Parallel Personalized Pagerank on Dynamic Graphs

Total Page:16

File Type:pdf, Size:1020Kb

Parallel Personalized Pagerank on Dynamic Graphs Parallel Personalized PageRank on Dynamic Graphs Wentian Guo, Yuchen Li, Mo Sha, Kian-Lee Tan School of Computing, National University of Singapore wentian, liyuchen, sham, tankl @comp.nus.edu.sg { } ABSTRACT representative graphs often evolve rapidly in many applica- 9 Personalized PageRank (PPR) is a well-known proximity tions. For example, network traffic data averages 10 pack- measure in graphs. To meet the need for dynamic PPR ets/hour/router for large ISPs [17] and Twitter has more maintenance, recent works have proposed a local update than 500 million tweets per day [3]. Since computation of scheme to support incremental computation. Nevertheless, PPR from scratch is prohibitively slow against high rate of sequential execution of the scheme is still too slow for high- graph updates, recent works [38, 49] focus on incremental speed stream processing. Therefore, we are motivated to PPR maintenance. The state-of-the-art dynamic PPR ap- design a parallel approach for dynamic PPR computation. proaches are based on a local update scheme [6, 5, 11, 31, 29, First, as updates always come in batches, we devise a batch 38, 49]. This scheme takes two steps against each single up- processing method to reduce synchronization cost among ev- date: (1) restore invariants; (2) perform local pushes. More ery single update and enable more parallelism for iterative specifically, each vertex v keeps two values Ps(v)andRs(v) parallel execution. Our theoretical analysis shows that the where Ps(v) is the current estimate to the PPR value of v parallel approach has the same asymptotic complexity as w.r.t. the source vertex s and Rs(v)istheupperboundon the sequential approach. Second, we devise novel optimiza- the estimation bias. We call Ps(v)andRs(v)astheestimate tion techniques to e↵ectively reduce runtime overheads for and residual of v respectively. The invariant is maintained parallel processes. Experimental evaluation shows that our for all vertices to ensure the residual is valid. To handle parallel algorithm can achieve orders of magnitude speedups any update, e.g., an edge insertion/deletion, the invariant is on GPUs and multi-core CPUs compared with the state-of- first restored which results in change in the residual. Subse- the-art sequential algorithm. quently, if the residual of a vertex v is over a user-specified threshold, i.e., ✏, the residual is pushed to v’s neighbors. PVLDB Reference Format: Although it has been shown in [49] that only a few op- Wentian Guo, Yuchen Li, Mo Sha, Kian-Lee Tan. Parallel Per- erations are needed to handle any single edge update (in- sonalized PageRank on Dynamic Graphs. PVLDB,11(1):3-9 sertion/deletion) under the common edge arrival models, it 106, 2017. DOI: https://doi.org/10.14778/3136610.3136618 is practically inefficient. We have observed latency of up to 2 3minutesagainstanupdatebatchconsistingof5000 edges− using the state-of-the-art approach [49] in our exper- 1. INTRODUCTION iments. This hardly matches the streaming rates of real- Personalized PageRank is one of the most widely used world dynamic graphs (e.g., 7000 tweets are generated per proximity measure in graphs. While PageRank quantifies second in Twitter [3]). Meanwhile, the emerging hardware the importance of vertices overall, the PPR vector measures architectures provide the consumers with massive computa- the importance of vertices w.r.t to a given source vertex s. tion power. The two of the most popular hardwares, multi- For instance, a vertex v with a larger PPR value w.r.t. s core CPUs and GPUs have shown their great potentials in implies that a random walk starting from s has a higher accelerating the graph applications [42, 36, 47, 34, 25, 35]. probability to reach v.PPRisusedinapplicationsofmany By utilizing the emerging hardwares, we are motivated to fields, such as web search for Internet individuals [39, 22], devise an efficient parallel approach for dynamic PPR com- user recommendation [8, 19], community detection [48] and putation under the state-of-the-art local update scheme to graph partitioning [6, 5]. achieve high-speed stream processing. Most existing works [39, 22, 6, 5, 30, 31, 29, 11] fo- Parallelizing dynamic PPR computation poses two new cus on PPR computation on static graphs. However, the challenges. First, as the update often comes in batches, it remains unclear if the batch can be updated in parallel for Permission to make digital or hard copies of all or part of this work for the local update scheme. A naive solution is to synchro- personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies nize on each single update to repair the invariant and then bear this notice and the full citation on the first page. To copy otherwise, to launch the local push. However, this solution causes sig- republish, to post on servers or to redistribute to lists, requires prior specific nificant synchronization overheads against batch updates. permission and/or a fee. Articles from this volume were invited to present Second, parallelizing the local push is non-trivial as it in- their results at The 44th International Conference on Very Large Data Bases, curs extra overheads compared with a sequential approach. August 2018, Rio de Janeiro, Brazil. We call the local push getting parallelized as the parallel Proceedings of the VLDB Endowment, Vol. 11, No. 1 Copyright 2017 VLDB Endowment 2150-8097/17/09... $ 10.00. push. The extra overheads of the parallel push come from DOI: https://doi.org/10.14778/3136610.3136618 93 two sources: (a) the parallel push, can take more operations, Algorithm 1 RestoreInvariant as it reads stale values of the residual at the beginning of it- Input: (G,Ps,Rs,u,v,op) erations and then propagates less probability mass from the Require: (u, v, op)istheupdatingedgewithop being the in- frontier vertices (the frontier contains vertices that need to sert/delete operation. 1: procedure Insert(u, v) be pushed in current iteration); (b) the parallel push needs (1 ↵) Ps(v) Ps(u) ↵ Rs(u)+↵ 1u=s 1 2: Rs(u) += − · − − · · to iteratively maintain a vertex frontier and thus results in dout(u) · ↵ 3: procedure Delete(u, v) synchronization costs to merge duplicate vertices. (1 ↵) Ps(v) Ps(u) ↵ Rs(u)+↵ 1u=s 1 4: Rs(u) = − · − − · · In this paper, we propose a novel parallel approach to − dout(u) · ↵ tackle the aforementioned challenges. First, we devise a batch processing method to avoid synchronization costs within a batch as well as to generate enough workloads for paral- are the out degrees of all vertices in G.ThePPRvector⇡s lelism. Our theoretical analysis proves that the parallel ap- w.r.t. s is the solution of the following linear equation: proach has the same asymptotic complexity as its sequential counterparts. Second, we propose novel optimization tech- ⇡s = ↵es +(1 ↵)W ⇡s (1) niques to practically reduce the number of local push op- − erations (eager propagation)andtoefficiently maintain the where ↵ is the teleport probability (which is a constant and vertex frontier in parallel by utilizing the monotonicity of it is typically set to 0.15), es is a personalized unit vector the local push process (local duplicate detection). with a single non-zero entry of 1 at s,andW is the transition We hereby summarize our contributions as follows: T 1 matrix s.t. W = A D− . We introduce a parallel approach to support dynamic Note that general PPR formulation does not restrict the • PPR computation over high-speed graph streams. One personalized vector to be a unit vector. We omit the dis- salient feature is that updates are processed in batches. cussion for the general case as it can be reduced to the case Theoretical analysis shows that the parallel approach with the unit vector scenario. In fact, many existing works has the same asymptotic complexity with its sequen- study how to use the unit vector formulation as a building tial counterpart under the two common edge arrival block to efficiently support the general case by maintaining models. To the best of our knowledge, this is the first multiple PPR vectors with di↵erent personalized unit vec- work on parallelizing dynamic PPR processing. tors. Interested readers are referred to [49, 38, 10, 31, 6, 9, We propose several optimization techniques to improve 29, 11]. Henceforth, this paper focuses on providing efficient • the performance of the parallel push. Eager propaga- parallel approach for the unit vector formalization. tion helps to reduce the number of local push opera- tions. Our novel frontier generation method efficiently 2.2 Dynamic Graph Model maintains the vertex frontier by mitigating synchro- We introduce a typical dynamic graph model [10, 49] to nization overheads to merge duplicate vertices. articulate the dynamic PPR problem. The dynamic model We implement our parallel approach on GPUs and ∆ t • consists of an unbounded sequence of updates E ,which multi-core CPUs. The experiments over real-world indicate the set of edges arriving at time step t.Eachel- datasets have verified the e↵ectiveness of our proposal ement of ∆Et is (u, v, op)thatmeansanedgeu v with over both architectures. The proposed approach can op denoting the type (insertion/deletion). Note that! in pre- achieve orders of magnitude speedups on GPUs and vious works [38, 49] only one single edge update arrives at multi-core CPUs compared to the state-of-the-art se- time step t, in the following we will discuss them with the quential approach. same notations but readers should notice ∆Et = 1 for the The remaining part of this paper is organized as follows. Sec- case of single update. 0 0 0 tion 2 introduces preliminaries and the background.
Recommended publications
  • The Long-Term Benefits of Positive Self-Presentation Via Profile
    fpsyg-08-01981 November 14, 2017 Time: 17:23 # 1 ORIGINAL RESEARCH published: 15 November 2017 doi: 10.3389/fpsyg.2017.01981 The Long-Term Benefits of Positive Self-Presentation via Profile Pictures, Number of Friends and the Initiation of Relationships on Facebook for Adolescents’ Self-Esteem and the Initiation of Offline Relationships Anna Metzler* and Herbert Scheithauer Developmental Science and Applied Developmental Psychology, Freie Universität Berlin, Berlin, Germany Social networking sites are a substantial part of adolescents’ daily lives. By using a longitudinal approach the current study examined the impact of (a) positive self- presentation, (b) number of friends, and (c) the initiation of online relationships on Facebook on adolescents’ self-esteem and their initiation of offline relationships, as Edited by: well as the mediating role of positive feedback. Questionnaire data were obtained Kai S. Cortina, from 217 adolescents (68% girls, mean age 16.7 years) in two waves. Adolescents’ University of Michigan, United States positive self-presentation and number of friends were found to be related to a higher Reviewed by: Burkhard Gniewosz, frequency of receiving positive feedback, which in turn was negatively associated with University of Salzburg, Austria self-esteem. However, the number of Facebook friends had a positive impact on self- Katherine Fiori, esteem, and the initiation of online relationships positively influenced the initiation of Adelphi University, United States offline relationships over time, demonstrating that Facebook
    [Show full text]
  • Friendship Paradox in Growth Networks: Analytical and Empirical Analysis
    Sidorov et al. Appl Netw Sci (2021) 6:51 https://doi.org/10.1007/s41109-021-00391-6 Applied Network Science RESEARCH Open Access Friendship paradox in growth networks: analytical and empirical analysis Sergei P. Sidorov* , Sergei V. Mironov and Alexey A. Grigoriev *Correspondence: [email protected] Abstract Saratov State University, Many empirical studies have shown that in social, citation, collaboration, and other Saratov, Russia 410012 types of networks in real world, the degree of almost every node is less than the aver- age degree of its neighbors. This imbalance is well known in sociology as the friendship paradox and states that your friends are more popular than you on average. If we intro- duce a value equal to the ratio of the average degree of the neighbors for a certain node to the degree of this node (which is called the ‘friendship index’, FI), then the FI value of more than 1 for most nodes indicates the presence of the friendship paradox in the network. In this paper, we study the behavior of the FI over time for networks generated by growth network models. We will focus our analysis on two models based on the use of the preferential attachment mechanism: the Barabási–Albert model and the triadic closure model. Using the mean-feld approach, we obtain diferential equa- tions describing the dynamics of changes in the FI over time, and accordingly, after obtaining their solutions, we fnd the expected values of this index over iterations. The results show that the values of FI are decreasing over time for all nodes in both models.
    [Show full text]
  • Network Interventions Based on Inversity: Leveraging the Friendship Paradox in Unknown Network Structures
    Network Interventions Based on Inversity: Leveraging the Friendship Paradox in Unknown Network Structures Vineet Kumar,1∗ David Krackhardt,2 Scott Feld3 1 Yale School of Management, Yale University. 2Heinz School of Public Policy and Management, Carnegie Mellon University 3 Department of Sociology, College of Liberal Arts, Purdue University. ∗To whom correspondence should be addressed; E-mail: [email protected] Network intervention problems benefit from selecting a more connected node, which is more likely to result in stronger indirect effects. However, in many network contexts, the structure of the network is unknown. We derive and examine the mathematical properties of two distinct “informationally light” strategies, a global strategy and local strategy, that yield higher degree nodes in virtually any network structure. These strategies are based on the friend- ship paradox: “your friends have more friends that you do.” We further iden- tify a novel network property called Inversity, which connects the fundamental parameters of these two strategies. We prove that the sign of Inversity for any given network determines which of the two strategies will be most effective in that network. We assess the performance of these strategies across a wide range of generative network models and real networks, and we show how to leverage network structure through these strategies even when the network structure is unknown. 1 Network-based interventions are of crucial importance in any setting where an individual’s choice or action has a multiplier impact on others, with a wide range of applications. Consider the following network intervention problems: (a) A new infectious disease is spreading through a large population.
    [Show full text]
  • The Structure of Social Networks: Modeling, Sampling, and Inference
    The Structure of Social Networks: Modeling, Sampling, and Inference Naghmeh Momeni Taramsari Doctor of Philosophy Department of Electrical and Computer Engineering McGill University Montreal,Quebec April 2017 A thesis submitted to McGill University in partial fulllment of the requirements of the degree of Doctor of Philosophy c Naghmeh Momeni Taramsari, 2017 TABLE OF CONTENTS ABSTRACT . vi ABRÉGÉ . vii ACKNOWLEDGEMENTS . viii PREFACE AND CONTRIBUTION OF AUTHORS . ix LIST OF FIGURES . x LIST OF TABLES . xiii 1 Introduction . 1 1.1 Network Perspective . 1 1.2 Notation and Terminology . 3 1.3 Historical Background on Network Science . 5 1.4 Social Networks and Social Outcomes . 7 1.5 Thesis Outline . 9 1.6 Publications . 12 2 Network Sampling . 13 2.1 Chapter Outline . 13 iii 2.2 Introduction . 13 2.3 Challenges of Sampling Oine Social Networks . 15 2.4 Fixed-choice Designs . 16 2.5 Chapter Summary . 17 3 Paper: Statistical Inference for Fixed-choice Design Incorporating Strong and Weak Ties . 18 4 Additional Simulation Results for Network Sampling . 52 4.1 Chapter Outline . 52 4.2 Introduction . 52 4.3 Transitivity . 53 4.4 Community Structure . 53 4.5 Popularity Bias . 54 4.6 Chapter Summary . 58 5 Introduction to Alter Sampling . 59 5.1 Chapter Outline . 59 5.2 Introduction . 59 5.3 Applications of Alter Sampling . 60 5.3.1 Acquaintance Vaccination . 60 5.3.2 Health Monitoring . 62 5.3.3 Information Diusion on Social Media . 63 5.3.4 Early Detection of Natural Disasters . 64 5.3.5 Health Intervention . 64 5.4 Chapter Summary . 65 6 Paper: Eectiveness of Alter Sampling in Various Social Networks .
    [Show full text]
  • 1 Peer Effects in Tournaments for Status
    Peer Effects in Tournaments for Status: Evidence from Rank Dynamics of U.S. Colleges and Universities* Noah S. Askin, University of Chicago, [email protected] Matthew S. Bothner, European School of Management and Technology, [email protected] Abstract Individual outcomes in tournaments for status result not only from participants’ own qualities and behaviors, but also from those of their most proximate peers. In this article, we take an alter-centric view of status dynamics, examining the effect of peers’ perceived quality on future changes in a focal organization’s status. Utilizing the yearly tournaments created by U.S. News & World Report’s rankings of national colleges and universities, two competing predictions are investigated. The first is that peer’ advances in perceived quality impair the future status of a focal school, reflecting inter-school competition for finite resources and rewards. The second is that peers’ improvements incite a focal school to make cosmetic or material adjustments, leading to an increase in its status. Peers are defined in two ways: by proximity in the prior iteration of the tournament and by network-based structural equivalence. Using fixed effects models predicting future changes in annual USN ranks, we observe opposing forces at work, depending on the type of peer exerting influence. When peers identified by prior rank proximity improve in perceived quality, they exert status-eroding effects on a focal school. Conversely, when structurally equivalent peers in the college-applicant market show improvement, the focal school subsequently increases in status. We examine the mechanisms responsible for this divergence by focusing on the bases of each type of peer-affiliation, presenting interaction effects that highlight the contextual conditions that shape the influence of peers on status change.
    [Show full text]
  • Contact Strength and the Friendship Paradox in Social Networks
    2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining Which friends are more popular than you? Contact strength and the friendship paradox in social networks James P. Bagrow1, Christopher M. Danforth1 and Lewis Mitchell2 Abstract—The friendship paradox states that in a social degree relates to many personality traits such as introver- network, egos tend to have lower degree than their alters, or, sion [13], and perceptual biases also factor into the effects “your friends have more friends than you do”. Most research of the friendship paradox [14]. All of these phenomena in has focused on the friendship paradox and its implications for information transmission, but treating the network as static and concert provide a deeper understanding of how social networks unweighted. Yet, people can dedicate only a finite fraction of form under cognitive and attentional limits, how information their attention budget to each social interaction: a high-degree can move between individuals via the network, and to what individual may have less time to dedicate to individual social extent the individual’s embedded view corresponds to the links, forcing them to modulate the quantities of contact made to actual network or information within it. their different social ties. Here we study the friendship paradox in the context of differing contact volumes between egos and alters, The friendship paradox along with the strength of weak ties finding a connection between contact volume and the strength play significant roles in how one is exposed to information of the friendship paradox. The most frequently contacted alters socially. By increasing the size of his or her social circle, an exhibit a less pronounced friendship paradox compared with the individual can gain access to a potentially overwhelming array ego, whereas less-frequently contacted alters are more likely to of information.
    [Show full text]
  • Reputation and Networks Studies in Reputation Effects and Network Formation
    Research Collection Doctoral Thesis Reputation and Networks Studies in Reputation Effects and Network Formation Author(s): Wehrli, Stefan Publication Date: 2014 Permanent Link: https://doi.org/10.3929/ethz-a-010432336 Rights / License: In Copyright - Non-Commercial Use Permitted This page was generated automatically upon download from the ETH Zurich Research Collection. For more information please consult the Terms of use. ETH Library DISS. ETH NO. 22476 Reputation and Networks Studies in Reputation Effects and Network Formation A thesis submitted to attain the degree of DOCTOR OF SCIENCES of ETH ZURICH (Dr. sc. ETH Zurich) presented by STEFAN WEHRLI lic. rer. soc., University of Bern born on October 13, 1974 citizen of Küttigen, AG Prof. Dr. Andreas Diekmann (examiner) Prof. Dr. Andreas Flache (co-examiner) Prof. Dr. Ryan Murphy (co-examiner) 2014 Acknowledgments This dissertation could not have been written without the support of many people. First of all, I would like to thank my advisor Andreas Diekmann for giving me the opportunity and the freedom to follow my curiosity. I am especially thankful to Andreas for creating an inspiring atmosphere and community that allowed me and my colleagues to pursue ideas that leave his center of gravity. He has been my advisor for many years now, but time and again his comments and ideas have offered new insights. Andreas has taught me the fundamentals of social theory, practical knowledge of empirical research, statistics, game theory, and also en- couraged me to pursue studies in social network analysis that I had the privilege to teach at the ETH for several years.
    [Show full text]
  • The Friendship Paradox and Systematic Biases in Perceptions and Social Norms
    The Friendship Paradox and Systematic Biases in Perceptions and Social Norms Matthew O. Jackson ∗ Revised: October 2016 Abstract The \friendship paradox" (Feld(1991)) states that, on average, people have strictly fewer friends than their friends do. This is an accounting identity that stems from the fact that people with more friends are counted as friends by more people. The point of this paper is that this over-representation of the most popular people in others' samples distorts perceptions of norms and amplifies resulting behaviors to match those perceived norms. I show that there are two things that drive people with more friends to behave more extremely than people with fewer friends and thus to distort the perceived norms. The first is that in any setting with strategic complementarities, people with more friends have more social interaction and hence more complementarity to their actions. The second is that people who have the most innate preference for a given activity form more relationships as they benefit most from the complementarities. As proven here, these two effects lead people with more friends to choose more extreme actions, which in turn feeds back via the friendship paradox to increase overall perceptions of behavior and then via complementarities to amplify average behavior. These theoretical results are consistent with the multitude of studies finding that students (from middle school through university) consistently overestimate peer consumption of alcohol, cigarettes, and drugs. This amplifies students' own behaviors, and can help explain problems with adolescent abuse of drugs and binge-drinking, as well as other behaviors. The analysis explains why policies that simply inform students of actual norms are effective in improving the accuracy of their perceptions and reducing behavior.
    [Show full text]
  • HZS C2BRNE DIARY – October 2020 1
    1 HZS C2BRNE DIARY – October 2020 www.cbrne-terrorism-newsletter.com 2 HZS C2BRNE DIARY – October 2020 HZS C2BRNE DIARY– 2020© October 2020 Website: www.cbrne-terrorism-newsletter.com Editor-in-Chief BrigGEN (ret.) Ioannis Galatas MD, MSc, MC (Army) PhD cand Consultant in Allergy & Clinical Immunology Medical/Hospital CBRNE Planner & Instructor Senior Asymmetric Threats Analyst Manager, CBRN Knowledge Center @ International CBRNE Institute (BE) Senior CBRN Consultant @ HotZone Solutions Group (NL) Athens, Greece Contact e-mail: [email protected] Editorial Team ⚫ Bellanca Giada, MD, MSc (Italy) ⚫ Hopmeier Michael, BSc/MSc MechEngin (USA) ⚫ Kiourktsoglou George, BSc, Dipl, MSc, MBA, PhD (UK) ⚫ Photiou Steve, MD, MSc EmDisaster (Italy) ⚫ Tarlow Peter, PhD Sociol (USA) A publication of HotZone Solutions Group Prinsessegracht 6, 2514 AN, The Hague, The Netherlands T: +31 70 262 97 04, F: +31 (0) 87 784 68 26 E-mail: [email protected] DISCLAIMER: The HZS C2BRNE DIARY® (former CBRNE-Terrorism Newsletter), is a free online publication for the fellow civilian/military CBRNE First Responders worldwide. The Diary is a collection of papers/articles related to the stated thematology. Relevant sources/authors are included and all info provided herein is from open Internet sources. Opinions and comments from the Editor, the Editorial Team or the authors publishing in the Diary do not necessarily represent those of the HotZone Solutions Group (NL) or the International CBRNE Institute (BE). COVER: UAE-based gynaecologist Dr Samer Cheaib shared
    [Show full text]
  • Investigating Social Networks with Agent Based Simulation and Link Prediction Methods
    Investigating Social Networks with Agent Based Simulation and Link Prediction Methods Angelico Giovanni Fetta School of Mathematics A thesis presented for the degree of Doctor of Philosophy 2014 Summary Social networks are increasingly being investigated in the context of individual behaviours. Research suggests that friendship connections have the ability to influence individual ac- tions, change personal opinions and subsequently impact upon personal wellbeing. This thesis aims to investigate the effects of social networks, through the use of Agent Based Simulation (ABS) and Link Prediction (LP) methods. Three main investigations form this thesis, culminating in the development of a new simulation-based approach to Link Predic- tion (PageRank-Max) and a model of behavioural spread through a connected population (Behavioural PageRank-Max). The first project investigates the suitability of ABS to explore a connected social system. The Peter Principle is a theory of managerial incompetence, having the potential to cause detrimental effects to system efficiency. Through the investigation of a theoretical hierar- chy of workplace social contacts, it is observed that the structure of a social network has the ability to impact system efficiency, demonstrating the importance of social network structure in conjunction with individual behaviours. The second project aims to further understand the structure of social networks, through the exploration of adolescent offline friendship data, taken from ‘A Stop Smoking in Schools Trial’ (ASSIST). An initial analysis of the data suggests certain factors may be pertinent in the formation of school social networks, identifying the importance of centrality mea- sures. An ABS aiming to predict the evolution of the ASSIST social networks is created, developing an algorithm based upon the optimisation of an individual’s eigen-centrality - termed PageRank-Max.
    [Show full text]
  • Identifying Prominent Actors in Social Networks
    Internal Report 2011–10 August 2011 Universiteit Leiden Opleiding Informatica Identifying Prominent Actors in Social Networks Iris Hupkens BACHELOR THESIS Leiden Institute of Advanced Computer Science (LIACS) Leiden University Niels Bohrweg 1 2333 CA Leiden The Netherlands Bachelor Thesis Identifying Prominent Actors in Social Networks Iris Hupkens August 9, 2011 Abstract In this thesis we investigate how to automatically identify the promi- nent actors in social networks, so that they can for instance be targeted for viral marketing. To do this, we look at characteristics of the friendship graph and apply data mining methods from the Weka toolkit. We compare the set of nodes resulting from these methods with a set of nodes that are known to represent celebrities. When only a group of people equal in size to the group of celebrities is considered to possibly be prominent, using only the degree centrality turns out to result in an accuracy larger than methods with a higher time complexity. The HITS algorithm is found to work better on social networks than looking at degree centrality when a larger group is considered to possibly be prominent. A Bayesian network using various qualities, such as the average degree of someone’s friends, can be used to nominate a set of nodes which contains even more celebrities. This implies that the method of looking at degree centrality to determine which actors are prominent is good enough when only a small group of people is to be selected, but that it starts losing some of its effectivity when the goal is to target the largest amount of prominent actors possi- ble.
    [Show full text]
  • Social Networks and Health: New Developments in Diffusion, Online and Offline
    SO45CH05_Centola ARjats.cls July 10, 2019 12:33 Annual Review of Sociology Social Networks and Health: New Developments in Diffusion, Online and Offline Jingwen Zhang1 and Damon Centola2 1Department of Communication, University of California, Davis, California 95616, USA 2Annenberg School for Communication, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA; email: [email protected] Annu. Rev. Sociol. 2019. 45:91–109 Keywords The Annual Review of Sociology is online at simple contagions, complex contagions, behavior change, online social soc.annualreviews.org networks, network interventions, diffusion https://doi.org/10.1146/annurev-soc-073117- 041421 Abstract Copyright © 2019 by Annual Reviews. Annu. Rev. Sociol. 2019.45:91-109. Downloaded from www.annualreviews.org The relationship between social networks and health encompasses every- Access provided by University of Pennsylvania on 09/30/19. For personal use only. All rights reserved thing from the flow of pathogens and information to the diffusion of beliefs and behaviors. This review addresses the vast and multidisciplinary litera- ture that studies social networks as a structural determinant of health. In particular, we report on the current state of knowledge on how social conta- gion dynamics influence individual and collective health outcomes. Wepay specific attention to research that leverages large-scale online data and social network experiments to empirically identify three broad classes of contagion processes: pathogenic diffusion, informational and belief diffusion, and be- havioral diffusion. We conclude by identifying the need for more research on (a) how multiple contagions interact within the same social network, (b) how online social networks impact offline health, andc ( ) the effective- ness of social network interventions for improving population health.
    [Show full text]