Modularity in Static and Dynamic Networks

Total Page:16

File Type:pdf, Size:1020Kb

Modularity in Static and Dynamic Networks Modularity in static and dynamic networks Sarah F. Muldoon University at Buffalo, SUNY OHBM – Brain Graphs Workshop June 25, 2017 Outline 1. Introduction: What is modularity? 2. Determining community structure (static networks) 3. Comparing community structure 4. Multilayer networks: Constructing multitask and temporal multilayer dynamic networks 5. Dynamic community structure 6. Useful references OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Introduction: What is Modularity? OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon What is Modularity? Modularity (Community Structure) • A module (community) is a subset of vertices in a graph that have more connections to each other than to the rest of the network • Example social networks: groups of friends Modularity in the brain: • Structural networks: communities are groups of brain areas that are more highly connected to each other than the rest of the brain • Functional networks: communities are groups of brain areas with synchronous activity that is not synchronous with other brain activity OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The Brain is Modular Structural networks: cortical thickness correlations sensorimotor/spatial strategic/executive mnemonic/emotion olfactocentric auditory/language visual processing Chen et al. (2008) Cereb Cortex OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The Brain is Modular • Functional networks: resting state fMRI He et al. (2009) PLOS One OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The Brain is Hierarchical Modules are composed of modules Meunier et al. (2009) Front. Neuroinform. OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Determining Community Structure OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon How to Determine Community Structure? MANY methods (see Fortunato 2010 Phys Rep for a review) Here we focus on method of modularity maximization Define a quantity called modularity: Or more generally: Resolution Parameter Adjacency For Module Standard Standard Matrix Size Community i Null Model Resolution Parameter Null Model Community j Adjacency Matrix Newman (2006) PNAS; Bassett et al. (2013) Chaos OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Maximize Modularity Function! Examine all possible partitions of communities – choose partition that has the highest modularity Problem is NP-hard – need search algorithm: Louvain algorithm: Blondel et al. (2008) J of Stat Mech OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Importance of Multiple Optimizations The modularity energy landscape has multiple near- degeneracies • Must run optimization algorithm many times (~100) and determine a consensus partition • Consensus partitioning using Nodal Association Matrix – see Bassett et al. (2013) Chaos and Lancichinetti et al. (2012) Sci Rep Good et al. (2010) Phys Rev E OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon The Resolution Parameter Traditionally, γ=1 was used. However, changing γ changes the number of detected communities. γ<1: fewer communities γ>1: more communities Determining γ: 1. Look for stable regime 2. Run consensus over multiple parameter choices Alexander-Bloch et al. (2010) Front Syst Neurosci OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Summary: Static Community Detection 1. Construct brain network (structural or functional) 2. Compute modularity and determine community structure – determine choice of resolution parameter 3. Run community detection multiple times and determine a consensus partition. 4. INTERPRET the results! OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Comparing Community Structure OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Normalized Mutual Information How do we tell if detected community structure is alike between conditions? Compute NMI: 0 if partitions are independent 1 if the partitions are identical Many different ways to be “wrong” Can be adjusted for chance (see Vinh et al (2010)) Danon et al. (2005) J Stat Mech; Vinh et al. (2010) J of Mach Learn Res OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Dynamic Networks Cognitive processes are dynamic! • Memory • Learning • Attention • Fatigue • etc… Medaglia et al. (2015) J Cogn Neurosci How does network structure change and evolve over time? • Different tasks • As a function of learning • As people age • Progression of a disease Bassett et al. (2013) Chaos OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Introduction: Multilayer Networks OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Multilayer Networks • Each node belongs to any subset of the layers, and edges encompass pairwise connections between all possible combinations of nodes and layers 4 Nodes 4 Layers 2 Aspects (X,Y) 2 Elementary-Layer Sets {A,B}, {X,Y} Edges can connect nodes in any layers. • An aspect is a feature of the edge: for example, its definition, or the time in which it exists. Kivelä et al. (2014) J Complex Networks OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Multilayer Networks: An Example Network scientists who talked to each other or went to talks by each other across 3 different network science conferences/workshops. Kivelä et al. (2014) J Complex Networks OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Multiplex Networks • Sequential graphs Inter-layer identity links Temporal data: layers coupled to layers adjacent in time Categorical data: all-to-all layer coupling Mucha et al. (2010) Science OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Multitask Networks • How are networks extracted from the brain at rest different from networks extracted during task conditions? • How do task networks differ from one another? How are they similar? • How does the brain transition between tasks in a large task battery? Cole et al. (2014) Neuron OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Temporal Networks • Perhaps the most common type of network in neuroscience • How do connections evolve over time? • Learning • Development • Progression of disease • Resting state fluctuations Bassett et al. (2013) PLoS Comp. Bio OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Constructing Multitask and Temporal Multilayer Dynamic Networks OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Multiplex Networks • Sequential graphs Each layer is a single functional Inter-layer connectivity matrix identity links Temporal Networks 1 2 3 Multitask Networks 4 Mucha et al. (2010) Science OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Multitask Networks • Step 1: Create separate pairwise functional connectivity matrices for each task • Step 2: Couple each node to itself in every matrix (all-to- all between layer coupling) Task 1 Task 2 Task 3 Task 4 Task 1 Task 2 Task 3 Task 4 OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Temporal Networks • Step 1: Choose a windowing scheme (can overlap) • Step 2: Create pairwise functional connectivity matrices for each time window • Step 3: Align matrices in time and couple each node to itself in the following matrix Window 1 Window 2 Window 3 Window 4 Window 1 Window 2 Window 3 60 120 180 240 Window 4 Time (s) OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Dynamic Community Structure OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Multilayer Modularity We can define a multilayer modularity to estimate the community structure in multilayer networks. Resolution Parameter For Module Size For i and j Community Adjacency in same i in time Matrix community slice l Null Model Resolution Community Adjacency Parameter j in time Matrix for Module slice r Dynamics Optimize using a Louvain-like locally greedy algorithm. Blondel et al. 2008; Mucha et al. (2010) Science. • Mucha et al. (2010) Science; Bassett et al. (2013) Chaos OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Community Assignment in Layers Communities exist across layers (throughout time) and can come in and out of existence Bassett et al. (2013) Chaos OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Flexibility and Promiscuity Regional flexibility: The number of times a region changes its module allegiance normalized by possible number of changes Regional promiscuity: The fraction of communities in which a region participates Flexibility Promiscuity Region 1 0 0.25 Region 2 0.11 0.5 Region 3 0.33 1 Region 4 0.89 1 Region 5 0.11 0.5 Network (ave of regions) 0.29 0.65 Bassett et al. (2011) PNAS; Bassett et al. (2013) PLoS Comp Bio; Papadopoulos et al. (2016) Phys Rev E OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Flexibility Predicts Learning • Regional flexibility changes over the course of learning • Higher flexibility predicted higher learning Bassett et al. (2011) PNAS OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Module Allegiance Matrix We can also create a module allegiance matrix which gives the probability of two nodes being assigned to the same community Step 1: look across all partitions and make contingency tables Region 1 2 3 4 5 • 0 if assigned different communities • 1 if assigned same community 1 X 1 1 1 0 2 1 X 1 1 0 3 1 1 X 1 0 4 1 1 1 X 0 5 0 0 0 0 X Bassett et al. (2015) Nat Neuro OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Module Allegiance Matrix Step 2: Average across all contingency tables (across all variables) Region 1 2 3 4 5 Region 1 2 3 4 5 Subjects 1 X 1 1 1 0 1 X .5 .1 .1 0 2 1 X 1 1 0 Time/task 2 .5 X .3 .4 .4 3 1 1 X 1 0 3 .1 .3 X .7 .3 4 1 1 1 X 0 4 .1 .4 .7 X .6 Optimizations 5 0 0 0 0 X 5 0 .4 .3 .6 X Contingency table Module Allegiance Matrix Bassett et al. (2015) Nat Neuro OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Recruitment and Integration Measures interaction between larger scale groups (ex: visual and motor groupings) • Recruitment: strength of interaction within a group • Integration: strength of interaction between groups Module Allegiance Matrix Number of regions in group ki Recruitment if k1=k2 Integration if k1≠k2 Bassett et al. (2015) Nat Neuro OHBM 2017 – Brain Graphs Workshop – Sarah F.
Recommended publications
  • Networks in Nature: Dynamics, Evolution, and Modularity
    Networks in Nature: Dynamics, Evolution, and Modularity Sumeet Agarwal Merton College University of Oxford A thesis submitted for the degree of Doctor of Philosophy Hilary 2012 2 To my entire social network; for no man is an island Acknowledgements Primary thanks go to my supervisors, Nick Jones, Charlotte Deane, and Mason Porter, whose ideas and guidance have of course played a major role in shaping this thesis. I would also like to acknowledge the very useful sug- gestions of my examiners, Mark Fricker and Jukka-Pekka Onnela, which have helped improve this work. I am very grateful to all the members of the three Oxford groups I have had the fortune to be associated with: Sys- tems and Signals, Protein Informatics, and the Systems Biology Doctoral Training Centre. Their companionship has served greatly to educate and motivate me during the course of my time in Oxford. In particular, Anna Lewis and Ben Fulcher, both working on closely related D.Phil. projects, have been invaluable throughout, and have assisted and inspired my work in many different ways. Gabriel Villar and Samuel Johnson have been col- laborators and co-authors who have helped me to develop some of the ideas and methods used here. There are several other people who have gener- ously provided data, code, or information that has been directly useful for my work: Waqar Ali, Binh-Minh Bui-Xuan, Pao-Yang Chen, Dan Fenn, Katherine Huang, Patrick Kemmeren, Max Little, Aur´elienMazurie, Aziz Mithani, Peter Mucha, George Nicholson, Eli Owens, Stephen Reid, Nico- las Simonis, Dave Smith, Ian Taylor, Amanda Traud, and Jeffrey Wrana.
    [Show full text]
  • Multidimensional Network Analysis
    Universita` degli Studi di Pisa Dipartimento di Informatica Dottorato di Ricerca in Informatica Ph.D. Thesis Multidimensional Network Analysis Michele Coscia Supervisor Supervisor Fosca Giannotti Dino Pedreschi May 9, 2012 Abstract This thesis is focused on the study of multidimensional networks. A multidimensional network is a network in which among the nodes there may be multiple different qualitative and quantitative relations. Traditionally, complex network analysis has focused on networks with only one kind of relation. Even with this constraint, monodimensional networks posed many analytic challenges, being representations of ubiquitous complex systems in nature. However, it is a matter of common experience that the constraint of considering only one single relation at a time limits the set of real world phenomena that can be represented with complex networks. When multiple different relations act at the same time, traditional complex network analysis cannot provide suitable an- alytic tools. To provide the suitable tools for this scenario is exactly the aim of this thesis: the creation and study of a Multidimensional Network Analysis, to extend the toolbox of complex network analysis and grasp the complexity of real world phenomena. The urgency and need for a multidimensional network analysis is here presented, along with an empirical proof of the ubiquity of this multifaceted reality in different complex networks, and some related works that in the last two years were proposed in this novel setting, yet to be systematically defined. Then, we tackle the foundations of the multidimensional setting at different levels, both by looking at the basic exten- sions of the known model and by developing novel algorithms and frameworks for well-understood and useful problems, such as community discovery (our main case study), temporal analysis, link prediction and more.
    [Show full text]
  • A Network Approach to Define Modularity of Components In
    A Network Approach to Define Modularity of Components Manuel E. Sosa1 Technology and Operations Management Area, in Complex Products INSEAD, 77305 Fontainebleau, France Modularity has been defined at the product and system levels. However, little effort has e-mail: [email protected] gone into defining and quantifying modularity at the component level. We consider com- plex products as a network of components that share technical interfaces (or connections) Steven D. Eppinger in order to function as a whole and define component modularity based on the lack of Sloan School of Management, connectivity among them. Building upon previous work in graph theory and social net- Massachusetts Institute of Technology, work analysis, we define three measures of component modularity based on the notion of Cambridge, Massachusetts 02139 centrality. Our measures consider how components share direct interfaces with adjacent components, how design interfaces may propagate to nonadjacent components in the Craig M. Rowles product, and how components may act as bridges among other components through their Pratt & Whitney Aircraft, interfaces. We calculate and interpret all three measures of component modularity by East Hartford, Connecticut 06108 studying the product architecture of a large commercial aircraft engine. We illustrate the use of these measures to test the impact of modularity on component redesign. Our results show that the relationship between component modularity and component redesign de- pends on the type of interfaces connecting product components. We also discuss direc- tions for future work. ͓DOI: 10.1115/1.2771182͔ 1 Introduction The need to measure modularity has been highlighted implicitly by Saleh ͓12͔ in his recent invitation “to contribute to the growing Previous research on product architecture has defined modular- field of flexibility in system design” ͑p.
    [Show full text]
  • Task-Dependent Evolution of Modularity in Neural Networks1
    Task-dependent evolution of modularity in neural networks1 Michael Husk¨ en, Christian Igel, and Marc Toussaint Institut fur¨ Neuroinformatik, Ruhr-Universit¨at Bochum, 44780 Bochum, Germany Telephone: +49 234 32 25558, Fax: +49 234 32 14209 fhuesken,igel,[email protected] Connection Science Vol 14, No 3, 2002, p. 219-229 Abstract. There exist many ideas and assumptions about the development and meaning of modu- larity in biological and technical neural systems. We empirically study the evolution of connectionist models in the context of modular problems. For this purpose, we define quantitative measures for the degree of modularity and monitor them during evolutionary processes under different constraints. It turns out that the modularity of the problem is reflected by the architecture of adapted systems, although learning can counterbalance some imperfection of the architecture. The demand for fast learning systems increases the selective pressure towards modularity. 1 Introduction The performance of biological as well as technical neural systems crucially depends on their ar- chitectures. In case of a feed-forward neural network (NN), architecture may be defined as the underlying graph constituting the number of neurons and the way these neurons are connected. Particularly one property of architectures, modularity, has raised a lot of interest among researchers dealing with biological and technical aspects of neural computation. It appears to be obvious to emphasise modularity in neural systems because the vertebrate brain is highly modular both in an anatomical and in a functional sense. It is important to stress that there are different concepts of modules. `When a neuroscientist uses the word \module", s/he is usually referring to the fact that brains are structured, with cells, columns, layers, and regions which divide up the labour of information processing in various ways' 1This paper is a revised and extended version of the GECCO 2001 Late-Breaking Paper by Husk¨ en, Igel, & Toussaint (2001).
    [Show full text]
  • Analyzing Social Media Network for Students in Presidential Election 2019 with Nodexl
    ANALYZING SOCIAL MEDIA NETWORK FOR STUDENTS IN PRESIDENTIAL ELECTION 2019 WITH NODEXL Irwan Dwi Arianto Doctoral Candidate of Communication Sciences, Airlangga University Corresponding Authors: [email protected] Abstract. Twitter is widely used in digital political campaigns. Twitter as a social media that is useful for building networks and even connecting political participants with the community. Indonesia will get a demographic bonus starting next year until 2030. The number of productive ages that will become a demographic bonus if not recognized correctly can be a problem. The election organizer must seize this opportunity for the benefit of voter participation. This study aims to describe the network structure of students in the 2019 presidential election. The first debate was held on January 17, 2019 as a starting point for data retrieval on Twitter social media. This study uses data sources derived from Twitter talks from 17 January 2019 to 20 August 2019 with keywords “#pilpres2019 OR #mahasiswa since: 2019-01-17”. The data obtained were analyzed by the communication network analysis method using NodeXL software. Our Analysis found that Top Influencer is @jokowi, as well as Top, Mentioned also @jokowi while Top Tweeters @okezonenews and Top Replied-To @hasmi_bakhtiar. Jokowi is incumbent running for re-election with Ma’ruf Amin (Senior Muslim Cleric) as his running mate against Prabowo Subianto (a former general) and Sandiaga Uno as his running mate (former vice governor). This shows that the more concentrated in the millennial generation in this case students are presidential candidates @jokowi. @okezonenews, the official twitter account of okezone.com (MNC Media Group).
    [Show full text]
  • Ensemble-Based Community Detection in Multilayer Networks
    ENSEMBLE-BASED COMMUNITY DETECTION IN MULTILAYER NETWORKS Andrea Tagarelli, Alessia Amelio, Francesco Gullo The 2017 European Conference on Machine Learning & Principles and Practice of Knowledge Discovery in DataBases Experimental evaluation Datasets • Our experimental evaluation was mainly conducted on seven real-world multilayer network datasets Experimental evaluation Datasets • We also resorted to a synthetic multilayer network generator, mLFR Benchmark, mainly for our evaluation of efficiency of the M-EMCD method • We used mLFR to create a multilayer network with 1 million of nodes, setting other available parameters as follows: • 10 layers, • average degree 30, • maximum degree 100, • mixing at 20% , • layer mixing 2. Experimental evaluation Competing methods • flattening methods • apply a community detection method on the flattened graph of the input multilayer network • it is a weighted multigraph having V as set of nodes, the set of edges, and edge weights that express the number of layers on which two nodes are connected • Nerstrand algorithm1 1 D. LaSalle and G. Karypis, "Multi-threaded modularity based graph clustering using the multilevel paradigm", J. Parallel Distrib. Comput., 76:66–80, 2015. Experimental evaluation Competing methods • aggregation methods • detect a community structure separately for each network layer, after that an aggregation mechanism is used to obtain the final community structure • Principal Modularity Maximization (PMM)2 • frequent pAttern mining-BAsed Community discoverer in mUltidimensional networkS (ABACUS)3 2 L. Tang, X. Wang, and H. Liu, “Uncovering groups via heterogeneous interaction analysis,” in Proc. ICDM, 2009, pp. 503–512. 3 M. Berlingerio, F. Pinelli, and F. Calabrese, "ABACUS: frequent pattern mining-based community discovery in multidimensional networks", Data Min.
    [Show full text]
  • On Community Structure in Complex Networks: Challenges and Opportunities
    On community structure in complex networks: challenges and opportunities Hocine Cherifi · Gergely Palla · Boleslaw K. Szymanski · Xiaoyan Lu Received: November 5, 2019 Abstract Community structure is one of the most relevant features encoun- tered in numerous real-world applications of networked systems. Despite the tremendous effort of a large interdisciplinary community of scientists working on this subject over the past few decades to characterize, model, and analyze communities, more investigations are needed in order to better understand the impact of community structure and its dynamics on networked systems. Here, we first focus on generative models of communities in complex networks and their role in developing strong foundation for community detection algorithms. We discuss modularity and the use of modularity maximization as the basis for community detection. Then, we follow with an overview of the Stochastic Block Model and its different variants as well as inference of community structures from such models. Next, we focus on time evolving networks, where existing nodes and links can disappear, and in parallel new nodes and links may be introduced. The extraction of communities under such circumstances poses an interesting and non-trivial problem that has gained considerable interest over Hocine Cherifi LIB EA 7534 University of Burgundy, Esplanade Erasme, Dijon, France E-mail: hocine.cherifi@u-bourgogne.fr Gergely Palla MTA-ELTE Statistical and Biological Physics Research Group P´azm´any P. stny. 1/A, Budapest, H-1117, Hungary E-mail: [email protected] Boleslaw K. Szymanski Department of Computer Science & Network Science and Technology Center Rensselaer Polytechnic Institute 110 8th Street, Troy, NY 12180, USA E-mail: [email protected] Xiaoyan Lu Department of Computer Science & Network Science and Technology Center Rensselaer Polytechnic Institute 110 8th Street, Troy, NY 12180, USA E-mail: [email protected] arXiv:1908.04901v3 [physics.soc-ph] 6 Nov 2019 2 Cherifi, Palla, Szymanski, Lu the last decade.
    [Show full text]
  • Generalized Louvain Method for Community Detection in Large Networks
    Generalized Louvain method for community detection in large networks Pasquale De Meo∗, Emilio Ferrarax, Giacomo Fiumara∗, Alessandro Provetti∗,∗∗ ∗Dept. of Physics, Informatics Section. xDept. of Mathematics. University of Messina, Italy. ∗∗Oxford-Man Institute, University of Oxford, UK. fpdemeo, eferrara, gfiumara, [email protected] Abstract—In this paper we present a novel strategy to a novel measure of edge centrality, in order to rank all discover the community structure of (possibly, large) networks. the edges of the network with respect to their proclivity This approach is based on the well-know concept of network to propagate information through the network itself; iii) modularity optimization. To do so, our algorithm exploits a novel measure of edge centrality, based on the κ-paths. its computational cost is low, making it feasible even for This technique allows to efficiently compute a edge ranking large network analysis; iv) it is able to produce reliable in large networks in near linear time. Once the centrality results, even if compared with those calculated by using ranking is calculated, the algorithm computes the pairwise more complex techniques, when this is possible; in fact, proximity between nodes of the network. Finally, it discovers because of the computational constraints, the adoption of the community structure adopting a strategy inspired by the well-known state-of-the-art Louvain method (henceforth, LM), some existing techniques is not viable when considering efficiently maximizing the network modularity. The experi- large networks, and their application is only limited to small ments we carried out show that our algorithm outperforms case-studies. other techniques and slightly improves results of the original This paper is organized as follows: in the next Section we LM, providing reliable results.
    [Show full text]
  • Comm 645 Handout – Nodexl Basics
    COMM 645 HANDOUT – NODEXL BASICS NodeXL: Network Overview, Discovery and Exploration for Excel. Download from nodexl.codeplex.com Plugin for social media/Facebook import: socialnetimporter.codeplex.com Plugin for Microsoft Exchange import: exchangespigot.codeplex.com Plugin for Voson hyperlink network import: voson.anu.edu.au/node/13#VOSON-NodeXL Note that NodeXL requires MS Office 2007 or 2010. If your system does not support those (or you do not have them installed), try using one of the computers in the PhD office. Major sections within NodeXL: • Edges Tab: Edge list (Vertex 1 = source, Vertex 2 = destination) and attributes (Fig.1→1a) • Vertices Tab: Nodes and attribute (nodes can be imported from the edge list) (Fig.1→1b) • Groups Tab: Groups of nodes defined by attribute, clusters, or components (Fig.1→1c) • Groups Vertices Tab: Nodes belonging to each group (Fig.1→1d) • Overall Metrics Tab: Network and node measures & graphs (Fig.1→1e) Figure 1: The NodeXL Interface 3 6 8 2 7 9 13 14 5 12 4 10 11 1 1a 1b 1c 1d 1e Download more network handouts at www.kateto.net / www.ognyanova.net 1 After you install the NodeXL template, a new NodeXL tab will appear in your Excel interface. The following features will be available in it: Fig.1 → 1: Switch between different data tabs. The most important two tabs are "Edges" and "Vertices". Fig.1 → 2: Import data into NodeXL. The formats you can use include GraphML, UCINET DL files, and Pajek .net files, among others. You can also import data from social media: Flickr, YouTube, Twitter, Facebook (requires a plugin), or a hyperlink networks (requires a plugin).
    [Show full text]
  • Graph and Network Analysis
    Graph and Network Analysis Dr. Derek Greene Clique Research Cluster, University College Dublin Web Science Doctoral Summer School 2011 Tutorial Overview • Practical Network Analysis • Basic concepts • Network types and structural properties • Identifying central nodes in a network • Communities in Networks • Clustering and graph partitioning • Finding communities in static networks • Finding communities in dynamic networks • Applications of Network Analysis Web Science Summer School 2011 2 Tutorial Resources • NetworkX: Python software for network analysis (v1.5) http://networkx.lanl.gov • Python 2.6.x / 2.7.x http://www.python.org • Gephi: Java interactive visualisation platform and toolkit. http://gephi.org • Slides, full resource list, sample networks, sample code snippets online here: http://mlg.ucd.ie/summer Web Science Summer School 2011 3 Introduction • Social network analysis - an old field, rediscovered... [Moreno,1934] Web Science Summer School 2011 4 Introduction • We now have the computational resources to perform network analysis on large-scale data... http://www.facebook.com/note.php?note_id=469716398919 Web Science Summer School 2011 5 Basic Concepts • Graph: a way of representing the relationships among a collection of objects. • Consists of a set of objects, called nodes, with certain pairs of these objects connected by links called edges. A B A B C D C D Undirected Graph Directed Graph • Two nodes are neighbours if they are connected by an edge. • Degree of a node is the number of edges ending at that node. • For a directed graph, the in-degree and out-degree of a node refer to numbers of edges incoming to or outgoing from the node.
    [Show full text]
  • Social Network Analysis: Homophily
    Social Network Analysis: homophily Donglei Du ([email protected]) Faculty of Business Administration, University of New Brunswick, NB Canada Fredericton E3B 9Y2 Donglei Du (UNB) Social Network Analysis 1 / 41 Table of contents 1 Homophily the Schelling model 2 Test of Homophily 3 Mechanisms Underlying Homophily: Selection and Social Influence 4 Affiliation Networks and link formation Affiliation Networks Three types of link formation in social affiliation Network 5 Empirical evidence Triadic closure: Empirical evidence Focal Closure: empirical evidence Membership closure: empirical evidence (Backstrom et al., 2006; Crandall et al., 2008) Quantifying the Interplay Between Selection and Social Influence: empirical evidence—a case study 6 Appendix A: Assortative mixing in general Quantify assortative mixing Donglei Du (UNB) Social Network Analysis 2 / 41 Homophily: introduction The material is adopted from Chapter 4 of (Easley and Kleinberg, 2010). "love of the same"; "birds of a feather flock together" At the aggregate level, links in a social network tend to connect people who are similar to one another in dimensions like Immutable characteristics: racial and ethnic groups; ages; etc. Mutable characteristics: places living, occupations, levels of affluence, and interests, beliefs, and opinions; etc. A.k.a., assortative mixing Donglei Du (UNB) Social Network Analysis 4 / 41 Homophily at action: racial segregation Figure: Credit: (Easley and Kleinberg, 2010) Donglei Du (UNB) Social Network Analysis 5 / 41 Homophily at action: racial segregation Credit: (Easley and Kleinberg, 2010) Figure: Credit: (Easley and Kleinberg, 2010) Donglei Du (UNB) Social Network Analysis 6 / 41 Homophily: the Schelling model Thomas Crombie Schelling (born 14 April 1921): An American economist, and Professor of foreign affairs, national security, nuclear strategy, and arms control at the School of Public Policy at University of Maryland, College Park.
    [Show full text]
  • Network Modularity
    Modularity CMSC 858L Module-detection for Function Prediction • Biological networks generally modular (Hartwell+, 1999) • We can try to find the modules within a network. • Once we find modules, we can look at over-represented functions within a module, e.g.: - If a majority of the proteins within a module have annotation A, predict annotation A for the other proteins in the module. • ⇒ Graph clustering methods - Min Multiway Cut, Graph Summarization, VI-Cut: examples we’ve already seen. - Methods often borrowed from other “community detection” applications. Modularity Modularity eii = % edges in module i eii=|{(u,v) : u ∈ Vi, v ∈ Vi, (u,v)∈E}| / |E| 2 ai = % edges with at least 1 i end in module i ai=|{(u,v) : u ∈ Vi, (u,v) ∈ E}| / |E| probability a random 3 edge would fall into Modularity is: module i k Q = e a2 ii − i i=1 High modularity ⇒ more edges probability edge within the module that you expect is in module i by chance. Examples 28 5 22 27 6 3 4 23 7 16 19 8 21 12 13 14 24 2 9 9 29 17 3 10 5 1 7 1 20 10 30 25 26 2 11 8 18 6 4 Communities Assigned 15 to a small graph Communities assigned to Note: maximizing a random graph modularity will find it’s own # of clusters Modularity Algorithm #1 • Modularity is NP-hard to optimize (Brandes, 2007) • Greedy Heuristic: (Newman, 2003) - C = trivial clustering with each node in its own cluster - Repeat: • Merge the two clusters that will increase the modularity by the largest amount • Stop when all merges would reduce the modularity.
    [Show full text]