Fairness-Aware Network Link Prediction

Total Page:16

File Type:pdf, Size:1020Kb

Fairness-Aware Network Link Prediction The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) Bursting the Filter Bubble: Fairness-Aware Network Link Prediction Farzan Masrour, Tyler Wilson, Heng Yan, Pang-Ning Tan, Abdol-Hossein Esfahanian Michigan State University {masrours, wils1270, yanheng, ptan, esfahanian}@msu.edu Abstract exposed to online users. If left unchecked, the filter bubble may introduce systematic biases in the network data and its Link prediction is an important task in online social network- subsequent analysis. For instance, Hofstra et al. (Hofstra et ing as it can be used to infer new or previously unknown rela- tionships of a network. However, due to the homophily prin- al. 2017) examined the ethnic and gender diversity of social ciple, current algorithms are susceptible to promoting links relationships on Facebook and showed that those who have that may lead to increase segregation of the network—an ef- ample opportunities to befriend other similar users often find fect known as filter bubble. In this study, we examine the fil- themselves in highly segregated networks. This is due to the ter bubble problem from the perspective of algorithm fairness homophily principle (McPherson, Smith-Lovin, and Cook and introduce a dyadic-level fairness criterion based on net- 2001), which is the tendency of individuals to form social work modularity measure. We show how the criterion can be ties with other similar individuals in a network. As current utilized as a postprocessing step to generate more heteroge- algorithms are designed to promote links between similar neous links in order to overcome the filter bubble problem. users, their suggested links may exacerbate the user segre- In addition, we also present a novel framework that com- gation problem. bines adversarial network representation learning with super- vised link prediction to alleviate the filter bubble problem. In addition to online social networks, the filter bubble Experimental results conducted on several real-world datasets problem is also prevalent in recommender systems, which showed the effectiveness of the proposed methods compared can be viewed as a link prediction task applied to a bipar- to other baseline approaches, which include conventional link tite network of users and items. A recent study by Nguyen prediction and fairness-aware methods for i.i.d data. et al. (Nguyen et al. 2014) concluded that recommender sys- tems tend to expose users to “slightly narrowing set of items Introduction over time.” For example, in movie recommendation, movies from a certain genre may only be recommended to users Online social networking sites have transformed the way in- from a specific gender. By addressing the filter bubble prob- dividuals interact and share information with each other. The lem in network link prediction, the proposed method can wealth of social network data available also provide oppor- potentially be used to alleviate the filter bubble problem in tunities to mine them for a variety of business applications. other types of recommender systems. For example, businesses can learn about the users’ interests, This paper examines the filter bubble problem for network sentiment, and online behavior by analyzing the social net- link prediction from algorithm fairness perspective. Specif- work data. The insights gained from such analysis will help ically, we consider a link prediction algorithm to be unfair businesses to increase engagement with their existing cus- if it is biased towards promoting certain types of links (e.g., tomers or connect with new customers. Despite its impor- those between users with similar gender or other protected tance, recent studies have raised concerns about the poten- attributes). As a motivating example, consider the link pre- tial biases and unintended consequences that may arise from diction task on professional networking sites. Certain pro- such automated analysis. fessions, such as software engineering, tend to be dominated For example, link prediction methods (Liben-Nowell and by men, a fact that is likely to be reflected in the link struc- Kleinberg 2007; Al Hasan et al. 2006; Masrour et al. 2015; ture of the professional network. As a result, the links rec- 2018) are commonly employed by social networking sites ommended by the site may reinforce this gender-based seg- to encourage users to expand their social circles. “Suggested regation and primarily recommend links between individ- for you” on Instagram and “People you may know” on uals from the same gender while recommending compara- LinkedIn are two example applications of such methods. tively fewer inter-gender links. Though such a system may However, the rise of link prediction systems have led to an be able to achieve high link prediction accuracy, it may un- effect known as filter bubble (Pariser 2012), which is the re- fairly disadvantage some users. For example, a female soft- inforced segregation and narrowing diversity of information ware engineer may be treated unfairly as they are seldom Copyright c 2020, Association for the Advancement of Artificial recommended to other male software engineers. Intelligence (www.aaai.org). All rights reserved. Unfair practices due to the decisions generated by auto- 841 mated systems is a problem that has been well-documented is significantly fewer than the number of non-links. Recent in many application domains, including criminal justice, years have also witnessed the emergence of deep neural net- mortgage lending, and university admission. For example, work methods for the link prediction task (Li et al. 2014; ; Angwin et al. (Angwin et al. 2016) warned about the poten- Tian et al. 2014). These methods have been shown to achieve tial biases against African Americans in the software used state of the art performance. to predict the risk score of defendants who would likely re- Social networks are increasingly personalizing their con- offend again while O’Neil (O’Neil 2017) cautioned against tent using automated machine learning techniques, which is the manipulative marketing tactics used by for-profit col- a concern as the decisions may lead to adverse effects on the leges in online advertising that exploit vulnerable popula- users. This is due to the so-called “filter bubble” or “echo tions. These concerns have brought increasing scrutiny into chamber” effect (Hofstra et al. 2017; Pariser 2012) in which the issue of fairness in machine learning algorithms. Despite individuals are increasingly isolated to consuming only in- their growing research, existing works are primarily focused formation that conform to their own belief system. In on- on independent and identically distributed (i.i.d) data, and line social networks, the effect of filter bubble is exemplified thus, may not be suitable for link analysis problems. For ex- by the recommendation decisions generated using link pre- ample, previous works have considered the notion of fair- diction algorithms. As link prediction algorithms are com- ness either at individual (Dwork et al. 2012) or group (Hardt monly used to encourage users to expand their networks, et al. 2016; Feldman et al. 2015) level. In contrast, this paper this may lead to adverse consequences such as segregation examines the notion of fairness at a dyadic-level, based on of users (Hofstra et al. 2017; Nguyen et al. 2014). the pairwise interactions between users in a social network. Quantifying fairness has been a subject of intense de- Furthermore, previous approaches have considered fairness bate among AI and ML researchers in recent years (Berk in terms of the unjust decisions against members of a spe- et al. 2018; Dwork et al. 2012; Hardt et al. 2016; Kus- cific underrepresented (protected) group. Instead, we con- ner et al. 2017). Previous works are primarily focused on sider fairness in terms of promoting inter-group connections non-relational data and can be classified into two types— in a network in order to alleviate the filter bubble problem. individual-level or group-level fairness. Fairness definition There are four major contributions of this paper. First, at individual level is based on the premise that similar we empirically assess the influence of protected attributes people should be treated similarly. For example, Dwork et such as gender on the link structure of a network by mea- al. (Dwork et al. 2012) defined a task-specific metric based suring the homophily effect on several real-world network on a probabilistic distance measure between individuals via datasets. Second, we introduce modred as a fairness crite- a Lipschitz condition. The metric is used as constraints to rion for network link prediction. The metric is inspired by optimize a fairness-aware classifier. In contrast, the group- the well-known modularity measure (Newman and Girvan level approach quantifies fairness in terms of statistical mea- 2004) developed for network community detection. We con- sures such as demographic parity, equalized odds (Hardt et sider the reduction in modularity measure as a way to deter- al. 2016) or balanced error rate (Feldman et al. 2015) with mine whether the links predicted by an algorithm may lead respect to the protected groups. The measures are typically to further segregation of the network. We then illustrate how computed from a confusion matrix (Berk et al. 2018) and the measure can be incorporated into a greedy algorithm are used to ensure that the average performance do not vary for postprocessing the results of current link prediction al- significantly among different groups of a protected attribute. gorithms. Finally, we present a novel Fairness-aware LInk In addition, there has been growing literature on develop- Prediction (FLIP) framework that combines adversarial net- ing fairness-aware methods. Current methods can be divided work representation learning with supervised link prediction into three categories. The first category includes prepossess- to mitigate the filter bubble problem. ing algorithms (Zemel et al. 2013; Madras et al. 2018) with the motivation that training data is the main cause of bias Related Work in machine learning.
Recommended publications
  • Discover the Golden Paths, Unique Sequences and Marvelous Associations out of Your Big Data Using Link Analysis in SAS® Enterprise Miner TM
    MWSUG 2016 – Paper AA04 Discover the golden paths, unique sequences and marvelous associations out of your big data using Link Analysis in SAS® Enterprise Miner TM Delali Agbenyegah, Alliance Data Systems, Columbus, OH Candice Zhang, Alliance Data Systems, Columbus, OH ABSTRACT The need to extract useful information from large amount of data to positively influence business decisions is on the rise especially with the hyper expansion of retail data collection and storage and the advancement in computing capabilities. Many enterprises now have well established databases to capture Omni channel customer transactional behavior at the product or Store Keeping Unit (SKU) level. Crafting a robust analytical solution that utilizes these rich transactional data sources to create customized marketing incentives and product recommendations in a timely fashion to meet the expectations of the sophisticated shopper in our current generation can be daunting. Fortunately, the Link Analysis node in SAS® Enterprise Miner TM provides a simple but yet powerful analytical tool to extract, analyze, discover and visualize the relationships or associations (links) and sequences between items in a transactional data set up and develop item-cluster induced segmentation of customers as well as next-best offer recommendations. In this paper, we discuss the basic elements of Link Analysis from a statistical perspective and provide a real life example that leverages Link Analysis within SAS Enterprise Miner to discover amazing transactional paths, sequences and links. INTRODUCTION A financial fraud investigator may be interested in exploring the relationship between the financial transactions of suspicious customers, the BNI may want to analyze the social network of individuals identified as terrorists to conduct further investigation or a medical doctor may be interested in understanding the association between different medical treatments on patients and their corresponding results.
    [Show full text]
  • Networks in Nature: Dynamics, Evolution, and Modularity
    Networks in Nature: Dynamics, Evolution, and Modularity Sumeet Agarwal Merton College University of Oxford A thesis submitted for the degree of Doctor of Philosophy Hilary 2012 2 To my entire social network; for no man is an island Acknowledgements Primary thanks go to my supervisors, Nick Jones, Charlotte Deane, and Mason Porter, whose ideas and guidance have of course played a major role in shaping this thesis. I would also like to acknowledge the very useful sug- gestions of my examiners, Mark Fricker and Jukka-Pekka Onnela, which have helped improve this work. I am very grateful to all the members of the three Oxford groups I have had the fortune to be associated with: Sys- tems and Signals, Protein Informatics, and the Systems Biology Doctoral Training Centre. Their companionship has served greatly to educate and motivate me during the course of my time in Oxford. In particular, Anna Lewis and Ben Fulcher, both working on closely related D.Phil. projects, have been invaluable throughout, and have assisted and inspired my work in many different ways. Gabriel Villar and Samuel Johnson have been col- laborators and co-authors who have helped me to develop some of the ideas and methods used here. There are several other people who have gener- ously provided data, code, or information that has been directly useful for my work: Waqar Ali, Binh-Minh Bui-Xuan, Pao-Yang Chen, Dan Fenn, Katherine Huang, Patrick Kemmeren, Max Little, Aur´elienMazurie, Aziz Mithani, Peter Mucha, George Nicholson, Eli Owens, Stephen Reid, Nico- las Simonis, Dave Smith, Ian Taylor, Amanda Traud, and Jeffrey Wrana.
    [Show full text]
  • Modularity in Static and Dynamic Networks
    Modularity in static and dynamic networks Sarah F. Muldoon University at Buffalo, SUNY OHBM – Brain Graphs Workshop June 25, 2017 Outline 1. Introduction: What is modularity? 2. Determining community structure (static networks) 3. Comparing community structure 4. Multilayer networks: Constructing multitask and temporal multilayer dynamic networks 5. Dynamic community structure 6. Useful references OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Introduction: What is Modularity? OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon What is Modularity? Modularity (Community Structure) • A module (community) is a subset of vertices in a graph that have more connections to each other than to the rest of the network • Example social networks: groups of friends Modularity in the brain: • Structural networks: communities are groups of brain areas that are more highly connected to each other than the rest of the brain • Functional networks: communities are groups of brain areas with synchronous activity that is not synchronous with other brain activity OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The Brain is Modular Structural networks: cortical thickness correlations sensorimotor/spatial strategic/executive mnemonic/emotion olfactocentric auditory/language visual processing Chen et al. (2008) Cereb Cortex OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The Brain is Modular • Functional networks: resting state fMRI He et al. (2009) PLOS One OHBM 2017 – Brain Graphs Workshop – Sarah F. Muldoon Findings: The
    [Show full text]
  • Link Analysis Using SAS Enterprise Miner
    Link Analysis Using SAS® Enterprise Miner™ Ye Liu, Taiyeong Lee, Ruiwen Zhang, and Jared Dean SAS Institute Inc. ABSTRACT The newly added Link Analysis node in SAS® Enterprise MinerTM visualizes a network of items or effects by detecting the linkages among items in transactional data or the linkages among levels of different variables in training data or raw data. This node also provides multiple centrality measures and cluster information among items so that you can better understand the linkage structure. In addition to the typical linkage analysis, the node also provides segmentation that is induced by the item clusters, and uses weighted confidence statistics to provide next-best-offer lists for customers. Examples that include real data sets show how to use the SAS Enterprise Miner Link Analysis node. INTRODUCTION Link analysis is a popular network analysis technique that is used to identify and visualize relationships (links) between different objects. The following questions could be nontrivial: Which websites link to which other ones? What linkage of items can be observed from consumers’ market baskets? How is one movie related to another based on user ratings? How are different petal lengths, width, and color linked by different, but related, species of flowers? How are specific variable levels related to each other? These relationships are all visible in data, and they all contain a wealth of information that most data mining techniques cannot take direct advantage of. In today’s ever-more-connected world, understanding relationships and connections is critical. Link analysis is the data mining technique that addresses this need. In SAS Enterprise Miner, the new Link Analysis node can take two kinds of input data: transactional data and non- transactional data (training data or raw data).
    [Show full text]
  • Evolving Networks and Social Network Analysis Methods And
    DOI: 10.5772/intechopen.79041 ProvisionalChapter chapter 7 Evolving Networks andand SocialSocial NetworkNetwork AnalysisAnalysis Methods and Techniques Mário Cordeiro, Rui P. Sarmento,Sarmento, PavelPavel BrazdilBrazdil andand João Gama Additional information isis available atat thethe endend ofof thethe chapterchapter http://dx.doi.org/10.5772/intechopen.79041 Abstract Evolving networks by definition are networks that change as a function of time. They are a natural extension of network science since almost all real-world networks evolve over time, either by adding or by removing nodes or links over time: elementary actor-level network measures like network centrality change as a function of time, popularity and influence of individuals grow or fade depending on processes, and events occur in net- works during time intervals. Other problems such as network-level statistics computation, link prediction, community detection, and visualization gain additional research impor- tance when applied to dynamic online social networks (OSNs). Due to their temporal dimension, rapid growth of users, velocity of changes in networks, and amount of data that these OSNs generate, effective and efficient methods and techniques for small static networks are now required to scale and deal with the temporal dimension in case of streaming settings. This chapter reviews the state of the art in selected aspects of evolving social networks presenting open research challenges related to OSNs. The challenges suggest that significant further research is required in evolving social networks, i.e., existent methods, techniques, and algorithms must be rethought and designed toward incremental and dynamic versions that allow the efficient analysis of evolving networks. Keywords: evolving networks, social network analysis 1.
    [Show full text]
  • Multidimensional Network Analysis
    Universita` degli Studi di Pisa Dipartimento di Informatica Dottorato di Ricerca in Informatica Ph.D. Thesis Multidimensional Network Analysis Michele Coscia Supervisor Supervisor Fosca Giannotti Dino Pedreschi May 9, 2012 Abstract This thesis is focused on the study of multidimensional networks. A multidimensional network is a network in which among the nodes there may be multiple different qualitative and quantitative relations. Traditionally, complex network analysis has focused on networks with only one kind of relation. Even with this constraint, monodimensional networks posed many analytic challenges, being representations of ubiquitous complex systems in nature. However, it is a matter of common experience that the constraint of considering only one single relation at a time limits the set of real world phenomena that can be represented with complex networks. When multiple different relations act at the same time, traditional complex network analysis cannot provide suitable an- alytic tools. To provide the suitable tools for this scenario is exactly the aim of this thesis: the creation and study of a Multidimensional Network Analysis, to extend the toolbox of complex network analysis and grasp the complexity of real world phenomena. The urgency and need for a multidimensional network analysis is here presented, along with an empirical proof of the ubiquity of this multifaceted reality in different complex networks, and some related works that in the last two years were proposed in this novel setting, yet to be systematically defined. Then, we tackle the foundations of the multidimensional setting at different levels, both by looking at the basic exten- sions of the known model and by developing novel algorithms and frameworks for well-understood and useful problems, such as community discovery (our main case study), temporal analysis, link prediction and more.
    [Show full text]
  • A Network Approach to Define Modularity of Components In
    A Network Approach to Define Modularity of Components Manuel E. Sosa1 Technology and Operations Management Area, in Complex Products INSEAD, 77305 Fontainebleau, France Modularity has been defined at the product and system levels. However, little effort has e-mail: [email protected] gone into defining and quantifying modularity at the component level. We consider com- plex products as a network of components that share technical interfaces (or connections) Steven D. Eppinger in order to function as a whole and define component modularity based on the lack of Sloan School of Management, connectivity among them. Building upon previous work in graph theory and social net- Massachusetts Institute of Technology, work analysis, we define three measures of component modularity based on the notion of Cambridge, Massachusetts 02139 centrality. Our measures consider how components share direct interfaces with adjacent components, how design interfaces may propagate to nonadjacent components in the Craig M. Rowles product, and how components may act as bridges among other components through their Pratt & Whitney Aircraft, interfaces. We calculate and interpret all three measures of component modularity by East Hartford, Connecticut 06108 studying the product architecture of a large commercial aircraft engine. We illustrate the use of these measures to test the impact of modularity on component redesign. Our results show that the relationship between component modularity and component redesign de- pends on the type of interfaces connecting product components. We also discuss direc- tions for future work. ͓DOI: 10.1115/1.2771182͔ 1 Introduction The need to measure modularity has been highlighted implicitly by Saleh ͓12͔ in his recent invitation “to contribute to the growing Previous research on product architecture has defined modular- field of flexibility in system design” ͑p.
    [Show full text]
  • On Some Aspects of Link Analysis and Informal Network in Social Network Platform
    Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320–088X IJCSMC, Vol. 2, Issue. 7, July 2013, pg.371 – 377 RESEARCH ARTICLE On Some Aspects of Link Analysis and Informal Network in Social Network Platform Subrata Paul Department of Computer Science and Engineering, M.I.T.S. Rayagada, Odisha 765017, INDIA [email protected] Abstract— This paper presents a review on the two important aspects of Social Network, namely Link Analysis and Informal Network. Both of these characteristic plays a vital role in analysis of direction of information flow and reliability of the information passed or received. They can be easily visualized or can be studied using the concepts of graph theory. We have deliberately omitted discussing about general definitions of social network and representing the relationship between actors as a graph with nodes and edges. This paper starts with a formal definition of Informal Network and continues with some of its major aspects. Moreover its application is explained with a real life example of a college scenario. In the later part of the paper, some aspects of Informal Network have been presented. The similar kind of real life scenario is also being drawn out here to represent application of informal network. Lastly, this paper ends with a general conclusion on these two topics. Key Terms: - Social Network; Link Analysis; Information Overload; Formal Network; Informal Network I. INTRODUCTION Link analysis is a data-analysis technique used to evaluate relationships (connections) between nodes. Relationships may be identified among various types of nodes (objects), including organizations, people and transactions.
    [Show full text]
  • Task-Dependent Evolution of Modularity in Neural Networks1
    Task-dependent evolution of modularity in neural networks1 Michael Husk¨ en, Christian Igel, and Marc Toussaint Institut fur¨ Neuroinformatik, Ruhr-Universit¨at Bochum, 44780 Bochum, Germany Telephone: +49 234 32 25558, Fax: +49 234 32 14209 fhuesken,igel,[email protected] Connection Science Vol 14, No 3, 2002, p. 219-229 Abstract. There exist many ideas and assumptions about the development and meaning of modu- larity in biological and technical neural systems. We empirically study the evolution of connectionist models in the context of modular problems. For this purpose, we define quantitative measures for the degree of modularity and monitor them during evolutionary processes under different constraints. It turns out that the modularity of the problem is reflected by the architecture of adapted systems, although learning can counterbalance some imperfection of the architecture. The demand for fast learning systems increases the selective pressure towards modularity. 1 Introduction The performance of biological as well as technical neural systems crucially depends on their ar- chitectures. In case of a feed-forward neural network (NN), architecture may be defined as the underlying graph constituting the number of neurons and the way these neurons are connected. Particularly one property of architectures, modularity, has raised a lot of interest among researchers dealing with biological and technical aspects of neural computation. It appears to be obvious to emphasise modularity in neural systems because the vertebrate brain is highly modular both in an anatomical and in a functional sense. It is important to stress that there are different concepts of modules. `When a neuroscientist uses the word \module", s/he is usually referring to the fact that brains are structured, with cells, columns, layers, and regions which divide up the labour of information processing in various ways' 1This paper is a revised and extended version of the GECCO 2001 Late-Breaking Paper by Husk¨ en, Igel, & Toussaint (2001).
    [Show full text]
  • Analyzing Social Media Network for Students in Presidential Election 2019 with Nodexl
    ANALYZING SOCIAL MEDIA NETWORK FOR STUDENTS IN PRESIDENTIAL ELECTION 2019 WITH NODEXL Irwan Dwi Arianto Doctoral Candidate of Communication Sciences, Airlangga University Corresponding Authors: [email protected] Abstract. Twitter is widely used in digital political campaigns. Twitter as a social media that is useful for building networks and even connecting political participants with the community. Indonesia will get a demographic bonus starting next year until 2030. The number of productive ages that will become a demographic bonus if not recognized correctly can be a problem. The election organizer must seize this opportunity for the benefit of voter participation. This study aims to describe the network structure of students in the 2019 presidential election. The first debate was held on January 17, 2019 as a starting point for data retrieval on Twitter social media. This study uses data sources derived from Twitter talks from 17 January 2019 to 20 August 2019 with keywords “#pilpres2019 OR #mahasiswa since: 2019-01-17”. The data obtained were analyzed by the communication network analysis method using NodeXL software. Our Analysis found that Top Influencer is @jokowi, as well as Top, Mentioned also @jokowi while Top Tweeters @okezonenews and Top Replied-To @hasmi_bakhtiar. Jokowi is incumbent running for re-election with Ma’ruf Amin (Senior Muslim Cleric) as his running mate against Prabowo Subianto (a former general) and Sandiaga Uno as his running mate (former vice governor). This shows that the more concentrated in the millennial generation in this case students are presidential candidates @jokowi. @okezonenews, the official twitter account of okezone.com (MNC Media Group).
    [Show full text]
  • Ensemble-Based Community Detection in Multilayer Networks
    ENSEMBLE-BASED COMMUNITY DETECTION IN MULTILAYER NETWORKS Andrea Tagarelli, Alessia Amelio, Francesco Gullo The 2017 European Conference on Machine Learning & Principles and Practice of Knowledge Discovery in DataBases Experimental evaluation Datasets • Our experimental evaluation was mainly conducted on seven real-world multilayer network datasets Experimental evaluation Datasets • We also resorted to a synthetic multilayer network generator, mLFR Benchmark, mainly for our evaluation of efficiency of the M-EMCD method • We used mLFR to create a multilayer network with 1 million of nodes, setting other available parameters as follows: • 10 layers, • average degree 30, • maximum degree 100, • mixing at 20% , • layer mixing 2. Experimental evaluation Competing methods • flattening methods • apply a community detection method on the flattened graph of the input multilayer network • it is a weighted multigraph having V as set of nodes, the set of edges, and edge weights that express the number of layers on which two nodes are connected • Nerstrand algorithm1 1 D. LaSalle and G. Karypis, "Multi-threaded modularity based graph clustering using the multilevel paradigm", J. Parallel Distrib. Comput., 76:66–80, 2015. Experimental evaluation Competing methods • aggregation methods • detect a community structure separately for each network layer, after that an aggregation mechanism is used to obtain the final community structure • Principal Modularity Maximization (PMM)2 • frequent pAttern mining-BAsed Community discoverer in mUltidimensional networkS (ABACUS)3 2 L. Tang, X. Wang, and H. Liu, “Uncovering groups via heterogeneous interaction analysis,” in Proc. ICDM, 2009, pp. 503–512. 3 M. Berlingerio, F. Pinelli, and F. Calabrese, "ABACUS: frequent pattern mining-based community discovery in multidimensional networks", Data Min.
    [Show full text]
  • On Community Structure in Complex Networks: Challenges and Opportunities
    On community structure in complex networks: challenges and opportunities Hocine Cherifi · Gergely Palla · Boleslaw K. Szymanski · Xiaoyan Lu Received: November 5, 2019 Abstract Community structure is one of the most relevant features encoun- tered in numerous real-world applications of networked systems. Despite the tremendous effort of a large interdisciplinary community of scientists working on this subject over the past few decades to characterize, model, and analyze communities, more investigations are needed in order to better understand the impact of community structure and its dynamics on networked systems. Here, we first focus on generative models of communities in complex networks and their role in developing strong foundation for community detection algorithms. We discuss modularity and the use of modularity maximization as the basis for community detection. Then, we follow with an overview of the Stochastic Block Model and its different variants as well as inference of community structures from such models. Next, we focus on time evolving networks, where existing nodes and links can disappear, and in parallel new nodes and links may be introduced. The extraction of communities under such circumstances poses an interesting and non-trivial problem that has gained considerable interest over Hocine Cherifi LIB EA 7534 University of Burgundy, Esplanade Erasme, Dijon, France E-mail: hocine.cherifi@u-bourgogne.fr Gergely Palla MTA-ELTE Statistical and Biological Physics Research Group P´azm´any P. stny. 1/A, Budapest, H-1117, Hungary E-mail: [email protected] Boleslaw K. Szymanski Department of Computer Science & Network Science and Technology Center Rensselaer Polytechnic Institute 110 8th Street, Troy, NY 12180, USA E-mail: [email protected] Xiaoyan Lu Department of Computer Science & Network Science and Technology Center Rensselaer Polytechnic Institute 110 8th Street, Troy, NY 12180, USA E-mail: [email protected] arXiv:1908.04901v3 [physics.soc-ph] 6 Nov 2019 2 Cherifi, Palla, Szymanski, Lu the last decade.
    [Show full text]