Comparison Study on Graph Sampling Algorithms for Interactive Visualizations of Large-Scale Networks

Total Page:16

File Type:pdf, Size:1020Kb

Comparison Study on Graph Sampling Algorithms for Interactive Visualizations of Large-Scale Networks DEGREE PROJECT IN INFORMATION AND COMMUNICATION TECHNOLOGY, SECOND CYCLE, 30 CREDITS STOCKHOLM, SWEDEN 2019 Comparison study on graph sampling algorithms for interactive visualizations of large-scale networks ALEKSANDRA VOROSHILOVA KTH ROYAL INSTITUTE OF TECHNOLOGY SCHOOL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE Comparison study on graph sampling algorithms for interactive visualizations of large-scale networks Aleksandra Voroshilova 2019-06-20 Master’s Thesis Place for Project Stockholm, Sweden Examiner Mihhail Matskin KTH Royal Institute of Technology Supervisor Tino Weinkauf KTH Royal Institute of Technology Industry Supervisor Alexandros Gkogkas Hive Streaming Abstract Networks are present in computer science, sociology, biology, and neuroscience as well as in applied fields such as transportation, communication, medical industries. The growing volumes of data collection are pushing scalability and performance requirements on graph algorithms, and at the same time, a need for a deeper understanding of these structures through visualization arises. Network diagrams or graph drawings can facilitate the understanding of data, making intuitive the identification of the largest clusters, the number of connected components, the overall structure, and detecting anomalies, which is not achievable through textual or matrix representations. The aim of this study was to evaluate approaches that would enable visualization of a large scale peer-to-peer video live streaming networks. The visualization of such large scale graphs has technical limitations which can be overcome by filtering important structural data from the networks. In this study, four sampling algorithms for graph reduction were applied to large overlay peer-to-peer network graphs and compared. The four algorithms cover different approaches: selecting links with the highest weight, selecting nodes with the highest cumulative weight, using betweenness centrality metrics, and constructing a focus-based tree. Through the evaluation process, it was discovered that the algorithm based on betweenness centrality approximation offers the best results. Finally, for each of the algorithms in comparison, their resulting sampled graphs were visualized using a force- directed layout with a 2-step loading approach to depict their effect on the representation of the graphs. Keywords Graph sampling, graph filtering, large graph visualization ii Abstract Nätverk återfinns inom datavetenskap, sociologi, biologi och neurovetenskap samt inom tillämpade områden så som transport, kommunikation och inom medicinindustrin. Den växande mängden datainsamling pressar skalbarheten och prestandakraven på grafalgoritmer, samtidigt som det uppstår ett behov av en djupare förståelse av dessa strukturer genom visualisering. Nätverksdiagram eller grafritningar kan underlätta förståelsen av data, identifiera de största grupperna, ett antal anslutna komponenter, visa en övergripande struktur och upptäcka avvikelser, något som inte kan uppnås med text- eller matrisrepresentationer. Syftet med denna studie var att utvärdera tillvägagångssätt som kunde möjliggöra visualisering av ett omfattande P2P (peer-to-peer) livestreaming- nätverk. Visualiseringen av större grafer har tekniska begränsningar, något som kan lösas genom att samla viktiga strukturella data från nätverken. I den här studien applicerades fyra provtagningsalgoritmer för grafreduktion på stora överlagringar av P2P-nätverksgrafer för att sedan jämföras. De fyra algoritmerna är baserade på val av länkar med högsta vikt, av nodar med högsta kumulativa vikt, betweenness-centralitetsvärden för att konstruera ett fokusbaserat träd som har de längsta vägarna uteslutna. Under utvärderingsprocessen upptäcktes det att algoritmen baserad på betweenness-centralitetstillnärmning visade de bästa resultaten. Dessutom, för varje algoritm i jämförelsen, visualiserades deras slutliga samplade grafer genom att använda en kraftstyrd layout med ett 2-stegs laddningsinfart. Nyckelord grafreduktion, stor graf visualisering iii Acknowledgements I would like to thank my supervisors: Tino Weinkauf for guiding on the process, giving advices on the next steps and keeping it always professional and fun, and Alexandros Gkogkas for giving the opportunity to start this interesting study and sharing his knowledge and helping with everything I needed to successfully finish it. I thank Mihhail Matskin for examining the work and giving the final feedback. I also owe a gratitude to Hive Streaming team, who have made working there very pleasant. Thanks to my friends from EIT Digital Master School and Teknikringen who have made this year very memorable, and encouraged and inspired me to go on. Finally, I want to thank my family: Angela, Artem and Ksenia for always being there for me. Zoja, Maja, and Borja for being an example. iv Contents 1 Introduction 1 1.1 Background ............................... 2 1.2 Problem ................................. 3 1.3 Purpose ................................. 4 1.4 Goal .................................... 4 1.5 Benefits, Ethics and Sustainability ................. 4 1.6 Methodology .............................. 5 1.7 Stakeholders .............................. 5 1.8 Delimitations .............................. 5 1.9 Outline .................................. 6 2 Background 7 2.1 Network theory ............................. 7 2.2 Graph visualization ........................... 12 2.2.1 Graph layout ........................... 12 2.2.2 Edge bundling .......................... 14 2.2.3 Reduction and Clustering .................... 14 2.3 Related Work .............................. 14 2.4 Sampling algorithms .......................... 15 2.4.1 Simple random sample (SRS2) . 16 2.4.2 Weighted Independence Sampling (WIS) . 16 2.4.3 Edge Filtering based on betweenness centrality (BC) . 17 2.4.4 Focus-based Filtering (FF) ................... 18 3 Comparative study design 20 3.1 Research Process and Paradigm .................. 20 3.2 Experimental design .......................... 20 v 3.3 Planned data analysis ......................... 21 3.4 Test data collection ........................... 21 3.4.1 Degree distribution ....................... 22 3.4.2 Reduction size .......................... 23 3.5 Test environment ............................ 24 3.6 Assessing reliability and validity of the data collected ...... 24 3.7 Evaluation framework ......................... 25 4 Implementation 26 4.1 Graph libraries ............................. 26 4.2 Graph-tool graph library ........................ 27 4.3 NetworkX graph library ........................ 28 4.4 Graph sampling implementation ................... 29 4.5 Performance ............................... 29 4.6 Visualization ............................... 30 4.7 Implementation ............................. 32 5 Analysis and evaluation of results 34 5.1 Algorithms performance comparison ................ 34 5.1.1 Hypothesis 1 ........................... 35 5.1.2 Hypothesis 2 ........................... 37 5.1.3 Hypothesis 3 ........................... 39 5.1.4 Hypothesis 4 ........................... 41 5.1.5 Hypothesis 5 ........................... 43 5.1.6 Hypothesis 6 ........................... 45 5.1.7 Conclusions ........................... 47 5.2 Visualization results .......................... 49 5.2.1 Test graphs visualization .................... 49 5.2.2 Overlay network visualization . 53 6 Conclusions 60 References ................................... 62 vi Chapter 1 Introduction Growing amounts of data are constantly generated, collected, analyzed and stored for later use. As the data collection rate increases, data processing algorithms are pushed towards optimization for size and complexity. Furthermore, a need for a deeper understanding of underlying data structures, such as complex graphs, emerges. A graph data structure is a mathematical representation of networks, consisting of data points and relationships among them. Common examples of such are social networks, neural networks, computer networks, traffic networks, and many other [1]. The networks of web pages are examples of tightly interconnected graphs, where the pages are the nodes, and the links among them are directed edges. As more pages are added, the networks are increasing in size. For instance, by the date of this paper, Wikipedia contains around 47 million pages in 293 languages, and 5,8 million interconnected articles in the English language alone [2]. Visualizing such a graph is a challenge from computational, layout, and information visualization points of view. Wikipedia is not the largest example of a network. According to Google, the World Wide Web consists of trillions of interconnected indexed pages. Large networks are also present in other sciences. For example, a neural network of a human brain includes 86 billion interconnected neurons [3]. Graphs can be represented as pairwise relations in a textual format, however, it 1 is hard to make conclusions on the network properties by looking at a text file. A visual representation of a graph is easier to comprehend. The graph structure, connected components, and clusters can be determined by having a quick glance at it, furthermore, it also possible to detect the anomalies and properties by examining the visualization thoroughly. Technical and visual limitations are the two main challenges in the area of large graph visualization. Most graphics engines have a ceiling on the size of a graph they are able to render before running out of memory and computational
Recommended publications
  • Robustness and Assortativity for Diffusion-Like Processes in Scale
    epl draft Robustness and Assortativity for Diffusion-like Processes in Scale- free Networks G. D’Agostino1, A. Scala2,3, V. Zlatic´4 and G. Caldarelli5,3 1 ENEA - CR ”Casaccia” - Via Anguillarese 301 00123, Roma - Italy 2 CNR-ISC and Department of Physics, University of Rome “Sapienza” P.le Aldo Moro 5 00185 Rome, Italy 3 London Institute of Mathematical Sciences, 22 South Audley St Mayfair London W1K 2NY, UK 4 Theoretical Physics Division, Rudjer Boˇskovi´cInstitute, P.O.Box 180, HR-10002 Zagreb, Croatia 5 IMT Lucca Institute for Advanced Studies, Piazza S. Ponziano 6, Lucca, 55100, Italy PACS 89.75.Hc – Networks and genealogical trees PACS 05.70.Ln – Nonequilibrium and irreversible thermodynamics PACS 87.23.Ge – Dynamics of social systems Abstract – By analysing the diffusive dynamics of epidemics and of distress in complex networks, we study the effect of the assortativity on the robustness of the networks. We first determine by spectral analysis the thresholds above which epidemics/failures can spread; we then calculate the slowest diffusional times. Our results shows that disassortative networks exhibit a higher epidemi- ological threshold and are therefore easier to immunize, while in assortative networks there is a longer time for intervention before epidemic/failure spreads. Moreover, we study by computer sim- ulations the sandpile cascade model, a diffusive model of distress propagation (financial contagion). We show that, while assortative networks are more prone to the propagation of epidemic/failures, degree-targeted immunization policies increases their resilience to systemic risk. Introduction. – The heterogeneity in the distribu- propagation of an epidemic [13–15].
    [Show full text]
  • Ollivier Ricci Curvature of Directed Hypergraphs Marzieh Eidi1* & Jürgen Jost1,2
    www.nature.com/scientificreports OPEN Ollivier Ricci curvature of directed hypergraphs Marzieh Eidi1* & Jürgen Jost1,2 Many empirical networks incorporate higher order relations between elements and therefore are naturally modelled as, possibly directed and/or weighted, hypergraphs, rather than merely as graphs. In order to develop a systematic tool for the statistical analysis of such hypergraph, we propose a general defnition of Ricci curvature on directed hypergraphs and explore the consequences of that defnition. The defnition generalizes Ollivier’s defnition for graphs. It involves a carefully designed optimal transport problem between sets of vertices. While the defnition looks somewhat complex, in the end we shall be able to express our curvature in a very simple formula, κ = µ0 − µ2 − 2µ3 . This formula simply counts the fraction of vertices that have to be moved by distances 0, 2 or 3 in an optimal transport plan. We can then characterize various classes of hypergraphs by their curvature. Principles of network analysis. Network analysis constitutes one of the success stories in the study of complex systems3,6,11. For the mathematical analysis, a network is modelled as a (perhaps weighted and/or directed) graph. One can then look at certain graph theoretical properties of an empirical network, like its degree or motiv distribution, its assortativity or clustering coefcient, the spectrum of its Laplacian, and so on. One can also compare an empirical network with certain deterministic or random theoretical models. Successful as this analysis clearly is, we nevertheless see two important limitations. One is that many of the prominent concepts and quantities used in the analysis of empirical networks are node based, like the degree sequence.
    [Show full text]
  • Assortativity and Mixing
    Assortativity and Assortativity and Mixing General mixing between node categories Mixing Assortativity and Mixing Definition Definition I Assume types of nodes are countable, and are Complex Networks General mixing General mixing Assortativity by assigned numbers 1, 2, 3, . Assortativity by CSYS/MATH 303, Spring, 2011 degree degree I Consider networks with directed edges. Contagion Contagion References an edge connects a node of type µ References e = Pr Prof. Peter Dodds µν to a node of type ν Department of Mathematics & Statistics Center for Complex Systems aµ = Pr(an edge comes from a node of type µ) Vermont Advanced Computing Center University of Vermont bν = Pr(an edge leads to a node of type ν) ~ I Write E = [eµν], ~a = [aµ], and b = [bν]. I Requirements: X X X eµν = 1, eµν = aµ, and eµν = bν. µ ν ν µ Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. 1 of 26 4 of 26 Assortativity and Assortativity and Outline Mixing Notes: Mixing Definition Definition General mixing General mixing Assortativity by I Varying eµν allows us to move between the following: Assortativity by degree degree Definition Contagion 1. Perfectly assortative networks where nodes only Contagion References connect to like nodes, and the network breaks into References subnetworks. General mixing P Requires eµν = 0 if µ 6= ν and µ eµµ = 1. 2. Uncorrelated networks (as we have studied so far) Assortativity by degree For these we must have independence: eµν = aµbν . 3. Disassortative networks where nodes connect to nodes distinct from themselves. Contagion I Disassortative networks can be hard to build and may require constraints on the eµν.
    [Show full text]
  • Extraction and Analysis of Facebook Friendship Relations
    Chapter 12 Extraction and Analysis of Facebook Friendship Relations Salvatore Catanese, Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Alessandro Provetti Abstract Online social networks (OSNs) are a unique web and social phenomenon, affecting tastes and behaviors of their users and helping them to maintain/create friendships. It is interesting to analyze the growth and evolution of online social networks both from the point of view of marketing and offer of new services and from a scientific viewpoint, since their structure and evolution may share similarities with real-life social networks. In social sciences, several techniques for analyzing (off-line) social networks have been developed, to evaluate quantitative properties (e.g., defining metrics and measures of structural characteristics of the networks) or qualitative aspects (e.g., studying the attachment model for the network evolution, the binary trust relationships, and the link prediction problem). However, OSN analysis poses novel challenges both to computer and Social scientists. We present our long-term research effort in analyzing Facebook, the largest and arguably most successful OSN today: it gathers more than 500 million users. Access to data about Facebook users and their friendship relations is restricted; thus, we acquired the necessary information directly from the front end of the website, in order to reconstruct a subgraph representing anonymous interconnections among a significant subset of users. We describe our ad hoc, privacy-compliant crawler for Facebook data extraction. To minimize bias, we adopt two different graph mining techniques: breadth-first-search (BFS) and rejection sampling. To analyze the structural properties of samples consisting of millions of nodes, we developed a S.Catanese•P.DeMeo•G.Fiumara Department of Physics, Informatics Section.
    [Show full text]
  • The Perceived Assortativity of Social Networks: Methodological Problems and Solutions
    The perceived assortativity of social networks: Methodological problems and solutions David N. Fisher1, Matthew J. Silk2 and Daniel W. Franks3 Abstract Networks describe a range of social, biological and technical phenomena. An important property of a network is its degree correlation or assortativity, describing how nodes in the network associate based on their number of connections. Social networks are typically thought to be distinct from other networks in being assortative (possessing positive degree correlations); well-connected individuals associate with other well-connected individuals, and poorly-connected individuals associate with each other. We review the evidence for this in the literature and find that, while social networks are more assortative than non-social networks, only when they are built using group-based methods do they tend to be positively assortative. Non-social networks tend to be disassortative. We go on to show that connecting individuals due to shared membership of a group, a commonly used method, biases towards assortativity unless a large enough number of censuses of the network are taken. We present a number of solutions to overcoming this bias by drawing on advances in sociological and biological fields. Adoption of these methods across all fields can greatly enhance our understanding of social networks and networks in general. 1 1. Department of Integrative Biology, University of Guelph, Guelph, ON, N1G 2W1, Canada. Email: [email protected] (primary corresponding author) 2. Environment and Sustainability Institute, University of Exeter, Penryn Campus, Penryn, TR10 9FE, UK Email: [email protected] 3. Department of Biology & Department of Computer Science, University of York, York, YO10 5DD, UK Email: [email protected] Keywords: Assortativity; Degree assortativity; Degree correlation; Null models; Social networks Introduction Network theory is a useful tool that can help us explain a range of social, biological and technical phenomena (Pastor-Satorras et al 2001; Girvan and Newman 2002; Krause et al 2007).
    [Show full text]
  • Assortativity Measures for Weighted and Directed Networks
    Assortativity measures for weighted and directed networks Yelie Yuan1, Jun Yan1, and Panpan Zhang2,∗ 1Department of Statistics, University of Connecticut, Storrs, CT 06269 2Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA 19104 ∗Corresponding author: [email protected] January 15, 2021 arXiv:2101.05389v1 [stat.AP] 13 Jan 2021 Abstract Assortativity measures the tendency of a vertex in a network being connected by other ver- texes with respect to some vertex-specific features. Classical assortativity coefficients are defined for unweighted and undirected networks with respect to vertex degree. We propose a class of assortativity coefficients that capture the assortative characteristics and structure of weighted and directed networks more precisely. The vertex-to-vertex strength correlation is used as an example, but the proposed measure can be applied to any pair of vertex-specific features. The effectiveness of the proposed measure is assessed through extensive simula- tions based on prevalent random network models in comparison with existing assortativity measures. In application World Input-Ouput Networks, the new measures reveal interesting insights that would not be obtained by using existing ones. An implementation is publicly available in a R package wdnet. 1 Introduction In traditional network analysis, assortativity or assortative mixing (Newman, 2002) is a mea- sure assessing the preference of a vertex being connected (by edges) with other vertexes in a network. The measure reflects the principle of homophily (McPherson et al., 2001)|the tendency of the entities to be associated with similar partners in a social network. The prim- itive assortativity measure proposed by Newman(2002) was defined to study the tendency of connections between nodes based on their degrees, which is why it is also called degree-degree correlation (van der Hofstad and Litvak, 2014).
    [Show full text]
  • Spatial Analysis and Modeling of Urban Transportation Networks
    Spatial analysis and modeling of urban transportation networks Jingyi Lin Doctoral Thesis in Geodesy and Geoinformatics with Specialization in Geoinformatics Royal Institute of Technology Stockholm, Sweden, 2017 TRITA SoM 2017-15 ISRN KTH/SoM/2017-15/SE ISBN 978-91-7729-613-3 Doctoral Thesis Royal Institute of Technology (KTH) Department of Urban Planning and Environment Division of Geodesy and Geoinformatics SE-100 44 Stockholm Sweden © Jingyi Lin, 2017 Printed by US AB, Sweden 2017 Abstract Transport systems in general, and urban transportation systems in particular, are the backbone of a country or a city, therefore play an intrinsic role in the socio- economic development. There have been numerous studies on real transportation systems from multiple fileds, including geography, urban planning, and engineering. Geographers have extensively investigated transportation systems, and transport geography has developed as an important branch of geography with various studies on system structure, efficiency optimization, and flow distribution. However, the emergence of complex network theory provided a brand-new perspective for geographers and other researchers; therefore, it invoked more widespread interest in exploring transportation systems that present a typical node-link network structure. This trend inspires the author and, to a large extent, constitutes the motivation of this thesis. The overall objective of this thesis is to study and simulate the structure and dynamics of urban transportation systems, including aviation systems and ground transportation systems. More specificically, topological features, geometric properties, and dynamic evolution processes are explored and discussed in this thesis. To illustrate different construction mechanisms, as well as distinct evolving backgrounds of aviation systems and ground systems, China’s aviation network, U.S.
    [Show full text]
  • Edge Direction and the Structure of Networks
    Edge direction and the structure of networks Jacob G. Foster1, David V. Foster, Peter Grassberger, and Maya Paczuski Complexity Science Group, Department of Physics and Astronomy, University of Calgary, 2500 University Drive NW, Calgary, AB, Canada T2N 1N4 Edited by H. Eugene Stanley, Boston University, Boston, MA, and approved April 6, 2010 (received for review November 2, 2009) Directed networks are ubiquitous and are necessary to represent In directed networks, an edge from source to target (A → B) complex systems with asymmetric interactions—from food webs represents an asymmetric interaction; for example, that Web site to the World Wide Web. Despite the importance of edge direction A contains a hyperlink to Web site B, or organism A is eaten for detecting local and community structure, it has been disregarded by organism B. Edge direction is essential to evaluate and explain in studying a basic type of global diversity in networks: the tendency local structure in such networks. For instance, motif analysis of nodes with similar numbers of edges to connect. This tendency, (13, 14) identifies local connection patterns that appear more called assortativity, affects crucial structural and dynamic properties frequently in the real-world network than in ensembles of rando- of real-world networks, such as error tolerance or epidemic spread- mized networks. In this context, edge direction distinguishes ing. Here we demonstrate that edge direction has profound effects functional units like feed-forward and feedback loops. Takingedge on assortativity. We define a set of four directed assortativity mea- direction into account also overturns the simple picture of the sures and assign statistical significance by comparison to rando- World Wide Web (WWW) as having a short average distance mized networks.
    [Show full text]
  • Positive Network Assortativity of Influenza Vaccination at a High School: Implications for Outbreak Risk and Herd Immunity
    Positive Network Assortativity of Influenza Vaccination at a High School: Implications for Outbreak Risk and Herd Immunity Victoria C. Barclay1*., Timo Smieszek1.¤a¤b, Jianping He2, Guohong Cao2, Jeanette J. Rainey3, Hongjiang Gao3, Amra Uzicanin3, Marcel Salathe´ 1 1 Center for Infectious Disease Dynamics, Department of Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America, 2 Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America, 3 Division of Global Migration and Quarantine, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America Abstract Schools are known to play a significant role in the spread of influenza. High vaccination coverage can reduce infectious disease spread within schools and the wider community through vaccine-induced immunity in vaccinated individuals and through the indirect effects afforded by herd immunity. In general, herd immunity is greatest when vaccination coverage is highest, but clusters of unvaccinated individuals can reduce herd immunity. Here, we empirically assess the extent of such clustering by measuring whether vaccinated individuals are randomly distributed or demonstrate positive assortativity across a United States high school contact network. Using computational models based on these empirical measurements, we further assess the impact of assortativity on influenza disease dynamics. We found that the contact network was positively assortative with respect to influenza vaccination: unvaccinated individuals tended to be in contact more often with other unvaccinated individuals than with vaccinated individuals, and these effects were most pronounced when we analyzed contact data collected over multiple days. Of note, unvaccinated males contributed substantially more than unvaccinated females towards the measured positive vaccination assortativity.
    [Show full text]
  • Basics of Network Analysis
    Basics of Network Analysis Hiroki Sayama [email protected] Graph = Network • G(V, E): graph (network) V: vertices (nodes), E: edges (links) 1 Nodes = 1, 2, 3, 4, 5 Links = 1<->2, 1<->3, 1<->5, 3 2 2<->3, 2<->4, 2<->5, 3<->4, 3<->5, 4<->5 4 (Nodes may have states; links may have directions and weights) 5 2 Representation of a network • Adjacency matrix: A matrix with rows and columns labeled by nodes, where element aij shows the number of links going from node i to node j (becomes symmetric for undirected graph) • Adjacency list: A list of links whose element “i->j” shows a link going from node i to node j (also represented as “i -> {j1, j2, j3, …}”) 3 Exercise • Represent the following network in: 1 – Adjacency matrix 3 – Adjacency list 2 4 5 4 Degree of a node • A degree of node u, deg(u), is the number of links connected to u u1 u2 deg(u1) = 4 deg(u2) = 2 5 Connected graph • A graph in which there is a path between any pair of nodes 6 Connected components Connected Number of component connected components = 2 Connected component 7 Complete graph • A graph in which any pair of nodes are connected (often written as K1, K2, …) 8 Regular graph • A graph in which all nodes have the same degree (often called k-regular graph with degree k) 9 Bipartite graph • A graph whose nodes can be divided into two subsets so that no link connects nodes within the same subset = 10 Directed graph • Each link is directed • Direction repre- sents either order of relationship or accessibility between nodes E.g.
    [Show full text]
  • Immunization Strategies in Directed Networks
    MBE, 17(4): 3925–3952. DOI: 10.3934/mbe.2020218 Received: 12 March 2020 Accepted: 20 May 2020 http://www.aimspress.com/journal/MBE Published: 28 May 2020 Research article Immunization strategies in directed networks Junbo Jia, Wei Shi, Pan Yang and Xinchu Fu∗ Department of Mathematics, Shanghai University, Shanghai 200444, China * Correspondence: Email: [email protected]; Tel: +862166132664; Fax: +862166133292. Abstract: Many complex systems can be modeled as directed networks, which can be regarded as a generalization of undirected networks. In this paper, epidemic dynamics and immunization strategies in directed networks are studied. First, a Susceptible-Infected-Susceptible (SIS) model on a directed network is established employing the mean-field method, and its dynamics and epidemic threshold of the network are studied. Then based on the continuous degree technique, namely, considering the degree of a node as a continuous variable, we propose a method to calculate the epidemic threshold of the immunized network. Besides, some immunization strategies, including optimal immunization, random immunization, combined targeted immunization, and combined acquaintance immunization, and three special networks are considered. Finally, through numerical analysis, all immunization strategies are simulated and compared on different types of networks. We find that the nodes with the largest product of in-degree and out-degree are the most worthy of being immunized. Keywords: directed network; optimal immunization; random immunization; combined targeted immunization; combined acquaintance immunization 1. Introduction Complex network is a powerful tool to describe and study complex systems [1]. Generally, the network is represented by a graph in mathematics, including nodes, which represent the individual component in the system, and edges between the nodes, which represent the relationship or interaction between individuals in the system.
    [Show full text]
  • Co-Posting Author Assortativity in Reddit
    Co-posting Author Assortativity in Reddit Francesco Cauteruccio1, Enrico Corradini2, Giorgio Terracina1, Domenico Ursino2, and Luca Virgili2 1 DEMACS, University of Calabria 2 DII, Polytechnic University of Marche Abstract. In the context of social networks, a renowned paper of New- man introduced the notion of \assortativity", also known as \assortative mixing". Strictly akin to the concept of homophily, it shows how much a node tends to associate with other nodes somewhat similar to it. Degree centrality is the most used similarity metrics for evaluating assortativ- ity between nodes, but several more could be dealt with. Assortativity was deeply investigated in many past researches, given different social platforms. However, Reddit was not one of the social networks taken into account, even if it is a really popular social medium. In this paper, we want to find out the possible presence of a form of assortativity in Reddit; in particular, we focus our analysis on co-posters, i.e. authors posting contents on the same subreddit. Keywords: Reddit; Co-posters; Assortativity; Social Network Analysis; De- gree Centrality 1 Introduction Assortativity and degree assortativity were introduced in a renowned paper of Newman [17]. Here, the author defines a measure of assortativity for networks showing that real social networks are often assortative, whereas technological and biological networks tend to be disassortative. He also models an assortative net- work and exploits it for analytic and numeric studies. At the end of this analysis, he finds that assortative networks tend to percolate more easily than disassor- tative ones and that they are more robust to node removal.
    [Show full text]