Visualization of Large Networks Using Recursive Community Detection

Total Page:16

File Type:pdf, Size:1020Kb

Visualization of Large Networks Using Recursive Community Detection San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Fall 12-18-2020 Visualization of Large Networks Using Recursive Community Detection Xinyuan Fan San Jose State University Follow this and additional works at: https://scholarworks.sjsu.edu/etd_projects Part of the Other Computer Sciences Commons Recommended Citation Fan, Xinyuan, "Visualization of Large Networks Using Recursive Community Detection" (2020). Master's Projects. 965. DOI: https://doi.org/10.31979/etd.atus-pbv9 https://scholarworks.sjsu.edu/etd_projects/965 This Master's Project is brought to you for free and open access by the Master's Theses and Graduate Research at SJSU ScholarWorks. It has been accepted for inclusion in Master's Projects by an authorized administrator of SJSU ScholarWorks. For more information, please contact [email protected]. Visualization of Large Networks Using Recursive Community Detection CS298 Report Presented to Department of Computer Science San Jose State University In Partial Fulfillment of the Requirements for the Degree Master of Science By Xinyuan Fan December 2020 c 2020 Xinyuan Fan ALL RIGHTS RESERVED The Designated Project Committee Approves the Project Titled Visualization of Large Networks Using Recursive Community Detection by Xinyuan Fan APPROVED FOR THE DEPARTMENT OF COMPUTER SCIENCE SAN JOSE STATE UNIVERSITY December 2020 Dr. Katerina Potika Department of Computer Science Dr. Teng Moh Department of Computer Science Prof. Kevin Smith Department of Computer Science ABSTRACT Networks show relationships between people or things. For instance, a person has a social network of friends, and websites are connected through a network of hyperlinks. Networks are most commonly represented as graphs, so graph drawing becomes significant for network visualization. An effective graph drawing can quickly reveal connections and patterns within a network that would be difficult to discern without visual aid. But graph drawing becomes a challenge for large networks. Am- biguous edge crossings are inevitable in large networks with numerous nodes and edges, and large graphs often become a complicated tangle of lines. These issues greatly reduce graph readability and makes analyzing complex networks an arduous task. This project aims to address the large network visualization problem by com- bining recursive community detection, node size scaling, layout formation, labeling, edge coloring, and interactivity to make large graphs more readable. Experiments are performed on five known datasets to test the effectiveness of the proposed approach. Asurveyofthevisualizationresultsisconductedtomeasuretheresults. Keywords: Networks, Visualization, Graph Drawing, Community Detec- tion, Louvain, Node Size, Graph Layout, Labeling, Edge Coloring, Inter- activity ACKNOWLEDGMENTS I would like to thank Dr. Katerina Potika for serving as the advisor for this project. Dr. Potika’s continued guidance and feedback have been invaluable towards the completion of this project. I would also like to thank committee members Dr. Teng Moh and Prof. Kevin Smith for taking an interest in this project. I am very appreciative of their time and efforts. Lastly, I would like to thank my family and friends for their encouragement and support. v TABLE OF CONTENTS CHAPTER 1Introduction................................ 1 2RelatedWorks.............................. 4 2.1 Network Tools ............................. 4 2.2 Network Embedding ......................... 4 2.3 Network Nucleus ........................... 5 2.4 Layouts ................................ 5 2.5 Edge Coloring ............................. 6 3Methodology............................... 8 3.1 Community Detection ......................... 8 3.1.1 Louvain Method for Community Detection ......... 10 3.1.2 Recursive Community Detection ............... 10 3.2 Node Size Scaling ........................... 16 3.2.1 Minimum Node Size Scaling ................. 16 3.2.2 Logarithmic Node Size Scaling ................ 17 3.2.3 Capped Node Size Scaling .................. 19 3.3 Layouts ................................ 20 3.3.1 Static Layouts ......................... 20 3.3.2 Interactive Layouts ...................... 21 3.4 Labeling ................................ 24 3.5 Edge Coloring ............................. 25 vi 3.5.1 Node Coloring ......................... 25 3.6 Interactivity .............................. 27 4SetupofExperiments.......................... 29 4.1 Datasets ................................ 29 4.2 Technologies .............................. 30 4.3 System Requirements ......................... 31 4.4 User Survey .............................. 31 5Results................................... 34 5.1 Recursive Community Detection ................... 34 5.2 Node Size Scaling ........................... 37 5.2.1 Minimum Node Size Scaling ................. 37 5.2.2 Logarithmic Node Size Scaling ................ 38 5.2.3 Capped Node Size Scaling .................. 39 5.3 Layouts ................................ 40 5.4 Labeling ................................ 43 5.5 Edge Coloring ............................. 43 5.6 Interactivity .............................. 44 6ConclusionandFutureWork..................... 48 6.1 Conclusion ............................... 48 6.2 Future Work .............................. 49 List of References .............................. 51 APPENDIX vii Data Visualization User Survey Questions .............. 55 A.1 Email Iteration 1 ........................... 56 A.2 Email Iteration 2 ........................... 57 A.3 DBLP Iteration 1 ........................... 58 A.4 DBLP Iteration 2 ........................... 59 A.5 Amazon Iteration 1 .......................... 60 A.6 YouTube Iteration 1 ......................... 61 A.7 YouTube Iteration 2 ......................... 62 A.8 Wikipedia Iteration 1 ......................... 63 viii CHAPTER 1 Introduction Anetworkisaninterconnectedsystemofpeopleorthings.Networksareubiq- uitous, as everyone has their own social network of friends, colleagues, and other acquaintances. Furthermore, with the rise of online social networks, such as Face- book and Twitter, millions of people can connect from across the world. With these rapidly growing networks, visualization becomes more important than ever to com- prehending complex networks. Network analysis provides valuable information about network organization and community structure. These insights can be most clearly seen by representing networks as graphs. Actors correspond to nodes, and relation- ships are depicted as edges. A simple example of a social network in graph format can be seen in Figure 1. Figure 1: Social network graph [1]. 1 Avisualgraphrepresentationcanrevealdetailsaboutanetworkfromjusta glance. This makes understanding network connections quick and easy compared to trying to analyze raw data. However, as networks grow larger, they become in- creasingly difficult to visualize. With so many nodes and overlapping edges, a graph can become a giant hairball, impossible to extract valuable information from. This project aims to address this problem by combining various visualization techniques and proposing some for the first time in order to improve the readability of large graphs. The objective of this research is to create a method to effectively visualize large networks. This is achieved by proposing a recursive community detection approach, • exploring various node size scaling approaches, • applying various layout formations, • applying node labeling to convey information on community size, • incorporating edge coloring for the inter-connections between communities, • and experimenting with interactive visualization capabilities. • Community detection is necessary for identifying closely related groups within a network [2]. Node size scaling ensures nodes are sized proportionately compared to other communities based on how many nodes from the original network are contained in the community. Visualizations are drawn in different layouts [3, 4]dependingonthe graph characteristics. Communities are labeled with the number of nodes contained in the community. This labeling enables the viewer to quickly gather insights on 2 community size. Edge coloring [5]isappliedtoedgesbetweencommunities,inter- connections, to minimize confusion from edge crossings. Finally, making the graphs interactive allows for useful capabilities such as zooming, dragging, and highlighting node connections. These topics have been individually researched, but there has yet to be a vi- sualization method that utilizes all of these features together to generate optimal network visualizations. Combining these methods for graph visualization is the pri- mary innovation and challenge of this project. Additionally, a user survey on different visualizations of five known datasets for community detection is used to rate the ex- perimental results. 3 CHAPTER 2 Related Works There have been various research efforts towards developing new graph analy- sis and visualization techniques. This project draws inspiration from some of these existing solutions to produce a comprehensive graph visualization approach. 2.1 Network Tools Many network analysis tools have arisen in recent years with the boom of social networks. These tools provide community detection and mining capabilities to extract useful information from networks. However, with so many different tools available, it can be difficult to determine which tool is best for a job. Amin, Ahmad, and Choi [1]hopedtoaddressthisissuebycomparingfourmajornetworkanalysistools:
Recommended publications
  • Sagemath and Sagemathcloud
    Viviane Pons Ma^ıtrede conf´erence,Universit´eParis-Sud Orsay [email protected] { @PyViv SageMath and SageMathCloud Introduction SageMath SageMath is a free open source mathematics software I Created in 2005 by William Stein. I http://www.sagemath.org/ I Mission: Creating a viable free open source alternative to Magma, Maple, Mathematica and Matlab. Viviane Pons (U-PSud) SageMath and SageMathCloud October 19, 2016 2 / 7 SageMath Source and language I the main language of Sage is python (but there are many other source languages: cython, C, C++, fortran) I the source is distributed under the GPL licence. Viviane Pons (U-PSud) SageMath and SageMathCloud October 19, 2016 3 / 7 SageMath Sage and libraries One of the original purpose of Sage was to put together the many existent open source mathematics software programs: Atlas, GAP, GMP, Linbox, Maxima, MPFR, PARI/GP, NetworkX, NTL, Numpy/Scipy, Singular, Symmetrica,... Sage is all-inclusive: it installs all those libraries and gives you a common python-based interface to work on them. On top of it is the python / cython Sage library it-self. Viviane Pons (U-PSud) SageMath and SageMathCloud October 19, 2016 4 / 7 SageMath Sage and libraries I You can use a library explicitly: sage: n = gap(20062006) sage: type(n) <c l a s s 'sage. interfaces .gap.GapElement'> sage: n.Factors() [ 2, 17, 59, 73, 137 ] I But also, many of Sage computation are done through those libraries without necessarily telling you: sage: G = PermutationGroup([[(1,2,3),(4,5)],[(3,4)]]) sage : G . g a p () Group( [ (3,4), (1,2,3)(4,5) ] ) Viviane Pons (U-PSud) SageMath and SageMathCloud October 19, 2016 5 / 7 SageMath Development model Development model I Sage is developed by researchers for researchers: the original philosophy is to develop what you need for your research and share it with the community.
    [Show full text]
  • Networkx Tutorial
    5.03.2020 tutorial NetworkX tutorial Source: https://github.com/networkx/notebooks (https://github.com/networkx/notebooks) Minor corrections: JS, 27.02.2019 Creating a graph Create an empty graph with no nodes and no edges. In [1]: import networkx as nx In [2]: G = nx.Graph() By definition, a Graph is a collection of nodes (vertices) along with identified pairs of nodes (called edges, links, etc). In NetworkX, nodes can be any hashable object e.g. a text string, an image, an XML object, another Graph, a customized node object, etc. (Note: Python's None object should not be used as a node as it determines whether optional function arguments have been assigned in many functions.) Nodes The graph G can be grown in several ways. NetworkX includes many graph generator functions and facilities to read and write graphs in many formats. To get started though we'll look at simple manipulations. You can add one node at a time, In [3]: G.add_node(1) add a list of nodes, In [4]: G.add_nodes_from([2, 3]) or add any nbunch of nodes. An nbunch is any iterable container of nodes that is not itself a node in the graph. (e.g. a list, set, graph, file, etc..) In [5]: H = nx.path_graph(10) file:///home/szwabin/Dropbox/Praca/Zajecia/Diffusion/Lectures/1_intro/networkx_tutorial/tutorial.html 1/18 5.03.2020 tutorial In [6]: G.add_nodes_from(H) Note that G now contains the nodes of H as nodes of G. In contrast, you could use the graph H as a node in G.
    [Show full text]
  • Networkx: Network Analysis with Python
    NetworkX: Network Analysis with Python Salvatore Scellato Full tutorial presented at the XXX SunBelt Conference “NetworkX introduction: Hacking social networks using the Python programming language” by Aric Hagberg & Drew Conway Outline 1. Introduction to NetworkX 2. Getting started with Python and NetworkX 3. Basic network analysis 4. Writing your own code 5. You are ready for your project! 1. Introduction to NetworkX. Introduction to NetworkX - network analysis Vast amounts of network data are being generated and collected • Sociology: web pages, mobile phones, social networks • Technology: Internet routers, vehicular flows, power grids How can we analyze this networks? Introduction to NetworkX - Python awesomeness Introduction to NetworkX “Python package for the creation, manipulation and study of the structure, dynamics and functions of complex networks.” • Data structures for representing many types of networks, or graphs • Nodes can be any (hashable) Python object, edges can contain arbitrary data • Flexibility ideal for representing networks found in many different fields • Easy to install on multiple platforms • Online up-to-date documentation • First public release in April 2005 Introduction to NetworkX - design requirements • Tool to study the structure and dynamics of social, biological, and infrastructure networks • Ease-of-use and rapid development in a collaborative, multidisciplinary environment • Easy to learn, easy to teach • Open-source tool base that can easily grow in a multidisciplinary environment with non-expert users
    [Show full text]
  • Detecting Statistically Significant Communities
    1 Detecting Statistically Significant Communities Zengyou He, Hao Liang, Zheng Chen, Can Zhao, Yan Liu Abstract—Community detection is a key data analysis problem across different fields. During the past decades, numerous algorithms have been proposed to address this issue. However, most work on community detection does not address the issue of statistical significance. Although some research efforts have been made towards mining statistically significant communities, deriving an analytical solution of p-value for one community under the configuration model is still a challenging mission that remains unsolved. The configuration model is a widely used random graph model in community detection, in which the degree of each node is preserved in the generated random networks. To partially fulfill this void, we present a tight upper bound on the p-value of a single community under the configuration model, which can be used for quantifying the statistical significance of each community analytically. Meanwhile, we present a local search method to detect statistically significant communities in an iterative manner. Experimental results demonstrate that our method is comparable with the competing methods on detecting statistically significant communities. Index Terms—Community Detection, Random Graphs, Configuration Model, Statistical Significance. F 1 INTRODUCTION ETWORKS are widely used for modeling the structure function that is able to always achieve the best performance N of complex systems in many fields, such as biology, in all possible scenarios. engineering, and social science. Within the networks, some However, most of these objective functions (metrics) do vertices with similar properties will form communities that not address the issue of statistical significance of commu- have more internal connections and less external links.
    [Show full text]
  • Networks in Nature: Dynamics, Evolution, and Modularity
    Networks in Nature: Dynamics, Evolution, and Modularity Sumeet Agarwal Merton College University of Oxford A thesis submitted for the degree of Doctor of Philosophy Hilary 2012 2 To my entire social network; for no man is an island Acknowledgements Primary thanks go to my supervisors, Nick Jones, Charlotte Deane, and Mason Porter, whose ideas and guidance have of course played a major role in shaping this thesis. I would also like to acknowledge the very useful sug- gestions of my examiners, Mark Fricker and Jukka-Pekka Onnela, which have helped improve this work. I am very grateful to all the members of the three Oxford groups I have had the fortune to be associated with: Sys- tems and Signals, Protein Informatics, and the Systems Biology Doctoral Training Centre. Their companionship has served greatly to educate and motivate me during the course of my time in Oxford. In particular, Anna Lewis and Ben Fulcher, both working on closely related D.Phil. projects, have been invaluable throughout, and have assisted and inspired my work in many different ways. Gabriel Villar and Samuel Johnson have been col- laborators and co-authors who have helped me to develop some of the ideas and methods used here. There are several other people who have gener- ously provided data, code, or information that has been directly useful for my work: Waqar Ali, Binh-Minh Bui-Xuan, Pao-Yang Chen, Dan Fenn, Katherine Huang, Patrick Kemmeren, Max Little, Aur´elienMazurie, Aziz Mithani, Peter Mucha, George Nicholson, Eli Owens, Stephen Reid, Nico- las Simonis, Dave Smith, Ian Taylor, Amanda Traud, and Jeffrey Wrana.
    [Show full text]
  • Modularity in Static and Dynamic Networks
    Modularity in static and dynamic networks Sarah F. Muldoon University at Buffalo, SUNY OHBM – Brain Graphs Workshop June 25, 2017 Outline 1. Introduction: What is modularity? 2. Determining community structure (static networks) 3. Comparing community structure 4. Multilayer networks: Constructing multitask and temporal multilayer dynamic networks 5. Dynamic community structure 6. Useful references OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Introduction: What is Modularity? OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon What is Modularity? Modularity (Community Structure) • A module (community) is a subset of vertices in a graph that have more connections to each other than to the rest of the network • Example social networks: groups of friends Modularity in the brain: • Structural networks: communities are groups of brain areas that are more highly connected to each other than the rest of the brain • Functional networks: communities are groups of brain areas with synchronous activity that is not synchronous with other brain activity OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The Brain is Modular Structural networks: cortical thickness correlations sensorimotor/spatial strategic/executive mnemonic/emotion olfactocentric auditory/language visual processing Chen et al. (2008) Cereb Cortex OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The Brain is Modular • Functional networks: resting state fMRI He et al. (2009) PLOS One OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The
    [Show full text]
  • Graph Database Fundamental Services
    Bachelor Project Czech Technical University in Prague Faculty of Electrical Engineering F3 Department of Cybernetics Graph Database Fundamental Services Tomáš Roun Supervisor: RNDr. Marko Genyk-Berezovskyj Field of study: Open Informatics Subfield: Computer and Informatic Science May 2018 ii Acknowledgements Declaration I would like to thank my advisor RNDr. I declare that the presented work was de- Marko Genyk-Berezovskyj for his guid- veloped independently and that I have ance and advice. I would also like to thank listed all sources of information used Sergej Kurbanov and Herbert Ullrich for within it in accordance with the methodi- their help and contributions to the project. cal instructions for observing the ethical Special thanks go to my family for their principles in the preparation of university never-ending support. theses. Prague, date ............................ ........................................... signature iii Abstract Abstrakt The goal of this thesis is to provide an Cílem této práce je vyvinout webovou easy-to-use web service offering a database službu nabízející databázi neorientova- of undirected graphs that can be searched ných grafů, kterou bude možno efektivně based on the graph properties. In addi- prohledávat na základě vlastností grafů. tion, it should also allow to compute prop- Tato služba zároveň umožní vypočítávat erties of user-supplied graphs with the grafové vlastnosti pro grafy zadané uži- help graph libraries and generate graph vatelem s pomocí grafových knihoven a images. Last but not least, we implement zobrazovat obrázky grafů. V neposlední a system that allows bulk adding of new řadě je také cílem navrhnout systém na graphs to the database and computing hromadné přidávání grafů do databáze a their properties.
    [Show full text]
  • Graphprism: Compact Visualization of Network Structure
    GraphPrism: Compact Visualization of Network Structure Sanjay Kairam, Diana MacLean, Manolis Savva, Jeffrey Heer Stanford University Computer Science Department {skairam, malcdi, msavva, jheer}@cs.stanford.edu GraphPrism Connectivity ABSTRACT nodeRadius 7.9 strokeWidth 1.12 charge -242 Visual methods for supporting the characterization, com- gravity 0.65 linkDistance 20 parison, and classification of large networks remain an open Transitivity updateGraph Close Controls challenge. Ideally, such techniques should surface useful 0 node(s) selected. structural features { such as effective diameter, small-world properties, and structural holes { not always apparent from Density either summary statistics or typical network visualizations. In this paper, we present GraphPrism, a technique for visu- Conductance ally summarizing arbitrarily large graphs through combina- tions of `facets', each corresponding to a single node- or edge- specific metric (e.g., transitivity). We describe a generalized Jaccard approach for constructing facets by calculating distributions of graph metrics over increasingly large local neighborhoods and representing these as a stacked multi-scale histogram. MeetMin Evaluation with paper prototypes shows that, with minimal training, static GraphPrism diagrams can aid network anal- ysis experts in performing basic analysis tasks with network Created by Sanjay Kairam. Visualization using D3. data. Finally, we contribute the design of an interactive sys- Figure 1: GraphPrism and node-link diagrams for tem using linked selection between GraphPrism overviews the largest component of a co-authorship graph. and node-link detail views. Using a case study of data from a co-authorship network, we illustrate how GraphPrism fa- compactly summarizing networks of arbitrary size. Graph- cilitates interactive exploration of network data.
    [Show full text]
  • Graph Simplification and Interaction
    Graph Clarity, Simplification, & Interaction http://i.imgur.com/cW19IBR.jpg https://www.reddit.com/r/CableManagement/ Today • Today’s Reading: Lombardi Graphs – Bezier Curves • Today’s Reading: Clustering/Hierarchical edge Bundling – Definition of Betweenness Centrality • Emergency Management Graph Visualization – Sean Kim’s masters project • Reading for Tuesday & Homework 3 • Graph Interaction Brainstorming Exercise "Lombardi drawings of graphs", Duncan, Eppstein, Goodrich, Kobourov, Nollenberg, Graph Drawing 2010 • Circular arcs • Perfect angular resolution (edges for equal angles at vertices) • Arcs only intersect 2 vertices (at endpoints) • (not required to be crossing free) • Vertices may be constrained to lie on circle or concentric circles • People are more patient with aesthetically pleasing graphs (will spend longer studying to learn/draw conclusions) • What about relaxing the circular arc requirement and allowing Bezier arcs? • How does it scale to larger data? • Long curved arcs can be much harder to follow • Circular layout of nodes is often very good! • Would like more pseudocode Cubic Bézier Curve • 4 control points • Curve passes through first & last control point • Curve is tangent at P0 to (P1-P0) and at P3 to (P3-P2) http://www.e-cartouche.ch/content_reg/carto http://www.webreference.com/dla uche/graphics/en/html/Curves_learningObject b/9902/bezier.html 2.html “Force-directed Lombardi-style graph drawing”, Chernobelskiy et al., Graph Drawing 2011. • Relaxation of the Lombardi Graph requirements • “straight-line segments
    [Show full text]
  • Constraint Graph Drawing
    Constrained Graph Drawing Dissertation zur Erlangung des akademischen Grades des Doktors der Naturwissenschaften vorgelegt von Barbara Pampel, geb. Schlieper an der Universit¨at Konstanz Mathematisch-Naturwissenschaftliche Sektion Fachbereich Informatik und Informationswissenschaft Tag der mundlichen¨ Prufung:¨ 14. Juli 2011 1. Referent: Prof. Dr. Ulrik Brandes 2. Referent: Prof. Dr. Michael Kaufmann II Teile dieser Arbeit basieren auf Ver¨offentlichungen, die aus der Zusammenar- beit mit anderen Wissenschaftlerinnen und Wissenschaftlern entstanden sind. Zu allen diesen Inhalten wurden wesentliche Beitr¨age geleistet. Kapitel 3 (Bachmaier, Brandes, and Schlieper, 2005; Brandes and Schlieper, 2009) Kapitel 4 (Brandes and Pampel, 2009) Kapitel 6 (Brandes, Cornelsen, Pampel, and Sallaberry, 2010b) Zusammenfassung Netzwerke werden in den unterschiedlichsten Forschungsgebieten zur Repr¨asenta- tion relationaler Daten genutzt. Durch geeignete mathematische Methoden kann man diese Netzwerke als Graphen darstellen. Ein Graph ist ein Gebilde aus Kno- ten und Kanten, welche die Knoten verbinden. Hierbei k¨onnen sowohl die Kan- ten als auch die Knoten weitere Informationen beinhalten. Diese Informationen k¨onnen den einzelnen Elementen zugeordnet sein, sich aber auch aus Anordnung und Verteilung der Elemente ergeben. Mit Algorithmen (strukturierten Reihen von Arbeitsanweisungen) aus dem Gebiet des Graphenzeichnens kann man die unterschiedlichsten Informationen aus verschiedenen Forschungsbereichen visualisieren. Graphische Darstellungen k¨onnen das
    [Show full text]
  • Multidimensional Network Analysis
    Universita` degli Studi di Pisa Dipartimento di Informatica Dottorato di Ricerca in Informatica Ph.D. Thesis Multidimensional Network Analysis Michele Coscia Supervisor Supervisor Fosca Giannotti Dino Pedreschi May 9, 2012 Abstract This thesis is focused on the study of multidimensional networks. A multidimensional network is a network in which among the nodes there may be multiple different qualitative and quantitative relations. Traditionally, complex network analysis has focused on networks with only one kind of relation. Even with this constraint, monodimensional networks posed many analytic challenges, being representations of ubiquitous complex systems in nature. However, it is a matter of common experience that the constraint of considering only one single relation at a time limits the set of real world phenomena that can be represented with complex networks. When multiple different relations act at the same time, traditional complex network analysis cannot provide suitable an- alytic tools. To provide the suitable tools for this scenario is exactly the aim of this thesis: the creation and study of a Multidimensional Network Analysis, to extend the toolbox of complex network analysis and grasp the complexity of real world phenomena. The urgency and need for a multidimensional network analysis is here presented, along with an empirical proof of the ubiquity of this multifaceted reality in different complex networks, and some related works that in the last two years were proposed in this novel setting, yet to be systematically defined. Then, we tackle the foundations of the multidimensional setting at different levels, both by looking at the basic exten- sions of the known model and by developing novel algorithms and frameworks for well-understood and useful problems, such as community discovery (our main case study), temporal analysis, link prediction and more.
    [Show full text]
  • Centrality in Modular Networks
    Ghalmane et al. EPJ Data Science (2019)8:15 https://doi.org/10.1140/epjds/s13688-019-0195-7 REGULAR ARTICLE OpenAccess Centrality in modular networks Zakariya Ghalmane1,3, Mohammed El Hassouni1, Chantal Cherifi2 and Hocine Cherifi3* *Correspondence: hocine.cherifi@u-bourgogne.fr Abstract 3LE2I, UMR6306 CNRS, University of Burgundy, Dijon, France Identifying influential nodes in a network is a fundamental issue due to its wide Full list of author information is applications, such as accelerating information diffusion or halting virus spreading. available at the end of the article Many measures based on the network topology have emerged over the years to identify influential nodes such as Betweenness, Closeness, and Eigenvalue centrality. However, although most real-world networks are made of groups of tightly connected nodes which are sparsely connected with the rest of the network in a so-called modular structure, few measures exploit this property. Recent works have shown that it has a significant effect on the dynamics of networks. In a modular network, a node has two types of influence: a local influence (on the nodes of its community) through its intra-community links and a global influence (on the nodes in other communities) through its inter-community links. Depending on the strength of the community structure, these two components are more or less influential. Based on this idea, we propose to extend all the standard centrality measures defined for networks with no community structure to modular networks. The so-called “Modular centrality” is a two-dimensional vector. Its first component quantifies the local influence of a node in its community while the second component quantifies its global influence on the other communities of the network.
    [Show full text]