Graphlet Decomposition of a Weighted Network

Total Page:16

File Type:pdf, Size:1020Kb

Graphlet Decomposition of a Weighted Network Graphlet decomposition of a weighted network Hossein Azari Edoardo M. Airoldi Harvard University Harvard University Abstract Λ is related to models for binary networks based on the singular value decomposition and other factoriza- We introduce the graphlet decomposition of tions (Hoff, 2009; Kim and Leskovec, 2010). We aban- a weighted network, which encodes a notion don the popular orthogonality constraint (Jiang et al., of social information based on social struc- 2011) among the basis matrices Pis to attain inter- ture. We develop a scalable algorithm, which pretability in terms of multi-scale social structure, and combines EM with Bron-Kerbosch in a novel we chose not model zero edge weights to enable the es- fashion, for estimating the parameters of the timation to scale linearly with the number of positive model underlying graphlets using one net- weights (Leskovec et al., 2010). These choices lead work sample. We explore theoretical prop- to a non-trivial inferential setting (Airoldi and Haas, erties of graphlets, including computational 2011). Related methods utilize a network to encode in- complexity, redundancy and expected accu- ferred dependence among multivariate data (Coifman racy. We test graphlets on synthetic data, and Maggioni, 2006; Lee et al., 2008). We term our and we analyze messaging on Facebook and method “graphlet decomposition”, as it is reminiscent crime associations in the 19th century. of a wavelet decomposition of a graph. An unrelated literature uses the term graphlets to denote network motifs (Przulj et al., 2006; Kondor et al., 2009). 1 Introduction The key features of the graphlet decomposition we in- troduce in this paper are: the basis matrices Pis are In recent years there has been a surge of interest in col- non-orthogonal, but capture overlapping communities laborative projects similar to Wikipedia, social media at multiple scales; the information used for inference platforms such as Facebook and Twitter, and services comes exclusively from positive weights; parameter es- that rely on the social structure underlying these ser- timation is linear in the number of positive weights. vices, including cnn.com “popular on Facebook” and The basic idea is to posit a Poisson model for the edge nytimes.com “most emailed”. Efforts in the computa- weights P (Y Λ) and parametrize the rate matrix Λ in tional social sciences have begun studying patterns of terms of a binary| factor matrix B. The binary factors behavior that result in organized social structure and can be interpreted as latent features that induce social interactions (e.g., see Lazer et al., 2009). Here, we de- structure through homophily. The factor matrix B al- velop a new tool to analyze data about social structure lows for an equivalent interpretation as basis matrices, and interactions routinely collected in this context. which define overlapping cliques of different sizes. In- ference is carried out in two stages: first we identify There is a rich literature of statistical models to ana- candidate basis matrices using Bron-Kerbosch, then lyze binary interactions or networks (Goldenberg et al., we estimate coefficients using EM. The computational 2010). Arguably, however, only a few of them may complexity of the inference is addressed in Section 3.2. be fit to weighted networks. Our approach is to de- compose the graphon Λ, which defines an exchange- able model (Kallenberg, 2005) for integer-valued mea- 2 Graphlet decomposition and model surements of pairs of individuals P (Y Λ), in terms of a number of basis matrices that grows| with the size Consider observing an undirected weighted network, encoded by a symmetric adjacency matrix with inte- of the network, Λ = i µiPi. The factorization of ger entries and diagonal elements equal to zero. Intu- Appearing in ProceedingsP of the 15th International Con- itively, we want to posit a data generating process that ference on Artificial Intelligence and Statistics (AISTATS) can explain edge weights in terms of social information, 2012, La Palma, Canary Islands. Volume XX of JMLR: quantified by community structure at multiple scales W&CP XX. Copyright 2012 by the authors. and possibly overlapping. We choose to represent com- 54 Graphlet decomposition of a weighted network munities in terms of their constituent maximal cliques. We develop a two-stage estimation algorithm. First, we identify a candidate set of Kc basis elements, using Below, we avoid notation whose sole purpose is to set the Bron-Kerbosch algorithm, and obtain a candidate to zero diagonal elements of the resulting matrices. basis matrix Bc. Second, we develop an expectation- Definition 1. The graphlet decomposition (GD) of a maximization (EM) algorithm to estimate the corre- matrix Λ with non-negative entries λ is defined as ij sponding coefficients µ1:Kc ; several of these coefficients Λ = BWB0, where B is an N K binary matrix, W will vanish, thus selecting a final set of basis elements. is a K K diagonal matrix. Explicitly,× we have × We study theoretical properties of this estimation al- gorithm in Section3. Figure1 illustrates the steps of λ11 λ12 . λ1N µ1 0 ... 0 the estimation process on a toy network. λ21 λ22 . λ2N 0 µ2 ... 0 . = B . B0 . .. .. 2.1.1 Identifying candidate basis elements λ λ . λ 0 0 . µ N1 N2 NN K The algorithm to identify the candidate basis elements where λii is set to zero and µi is positive for each i. proceeds by thresholding the observed network Y at a The basis matrix B can be interpreted in terms over- number of levels, t = max(Y ) ... min(Y ), and by iden- lapping communities at multiple scales. To see this, tifying all the maximal cliques in the corresponding sequence of binary networks, Y (t) = 1(Y t), us- denote the i-th column of the matrix B as b i. We can ≥ re-write Λ = K µ P , where each P is an· N N ing the Bron-Kerbosch algorithm (Bron and Kerbosch, i=1 i i i c matrix defined as P = b b , in which the diagonal× is 1973). The resulting cumulative collection of K max- i i 0i (t) set to zero. WeP refer to·P · as basis elements. Think imal cliques found in the sequence of networks Y i c of the i-th column of B as encoding a community of is then turned into candidate basis elements, Pi , for i = 1 ...Kc. Algorithm1 details this procedure. interest; then the vector b i specifies which individuals · are members of that community, and the basis element Bc = empty basis set Pi is the binary graph that specifies the connections For t = max(Y ) to min(Y ) among the member of that community. The same in- Y (t) = 1(Y t) dividual may participate to multiple communities, and C = maximal≥ cliques(Y (t)) using Bron-Kerbosch the communities may span different scales. Bc = Bc C ∪ c The model for an observed network Y is then Algorithm 1: Estimate a basis matrix, B , encoding candidate elements, P c, from a weighted network, Y . K Y Poisson+ ( i=1 µi Pi). (1) ∼ Algorithm1 allows us to avoid dealing with permuta- This is essentially a binaryP factor model with a trun- tions of the input adjacency matrix Y by inferring the cated Poisson link and two important nuances. Stan- basis elements Pi independently of such permutation. dard factor models assume that entire rows of the ma- trix Y are conditionally independent, even when the In Section 3.1 we show that, under certain conditions matrix is square and row and column i refer to the on the true underlying basis matrix B, Algorithm1 same unit of analysis. In contrast, our model treats finds a set of candidate basis elements that contains the true set of basis elements. In addition, in Section the individual random variables yij as exchangeable, and imposes the restriction that positives entries in the 3.2 we show that, under the same conditions on B and factors map (in some way) to maximal cliques in the additional realistic assumptions, we expect to have a observed network. In addition, the matrix Y is sparse; number of candidate basis elements of the same order of magnitude of the true number of basis elements, our model implies that zero edge weights carry no in- c formation about maximal cliques, and thus are not K = O(K), as the network size N grows, given that relevant for estimation. These nuances—map between the maximum weight is stationary, max(Y ) = O(1). cliques and factors, and information from positives en- tries only—produce a new and interesting inferential 2.1.2 Sparse Poisson deconvolution setting that scales to large weighted networks. Given the candidate set of basis elements encoded in Bc, we develop an EM algorithm to estimate the corre- 2.1 Inference sponding coefficients µ1:Kc . Under certain conditions on the true basis matrix B, this algorithm consistently Computing the graphlet decomposition of a network estimates the coefficients by zeroing out the coefficients Y involves the estimation of a few critical parameters: associated with the unnecessary basis elements. the number of basis elements K, the basis matrix B c or equivalently the basis elements P1:K , and the coef- Recall that we have K candidate basis elements c c ficients associated with these basis elements µ1:K . P1:Kc . Let’s define a set of statistics Tk = ij Pk,ij 55 P Azari & Airoldi 3 3 thres = 4 μ = 3.0 3 1 2 2 3 2 2 μ =0.0 1 4 thres = 3 2 4 2 1 4 μ 3 =2.0 1 thres = 2 1 1 1 1 Network Generation μ 4 =1.0 Final Basis 1 thres = 1 Maximal Cliques Candidate Basis Clique Components Setting Thresholds Bron Kerboshch Expectation Maximization Figure 1: An illustration of the two-stage estimation algorithm for the graphlet decomposition.
Recommended publications
  • Evolving Networks and Social Network Analysis Methods And
    DOI: 10.5772/intechopen.79041 ProvisionalChapter chapter 7 Evolving Networks andand SocialSocial NetworkNetwork AnalysisAnalysis Methods and Techniques Mário Cordeiro, Rui P. Sarmento,Sarmento, PavelPavel BrazdilBrazdil andand João Gama Additional information isis available atat thethe endend ofof thethe chapterchapter http://dx.doi.org/10.5772/intechopen.79041 Abstract Evolving networks by definition are networks that change as a function of time. They are a natural extension of network science since almost all real-world networks evolve over time, either by adding or by removing nodes or links over time: elementary actor-level network measures like network centrality change as a function of time, popularity and influence of individuals grow or fade depending on processes, and events occur in net- works during time intervals. Other problems such as network-level statistics computation, link prediction, community detection, and visualization gain additional research impor- tance when applied to dynamic online social networks (OSNs). Due to their temporal dimension, rapid growth of users, velocity of changes in networks, and amount of data that these OSNs generate, effective and efficient methods and techniques for small static networks are now required to scale and deal with the temporal dimension in case of streaming settings. This chapter reviews the state of the art in selected aspects of evolving social networks presenting open research challenges related to OSNs. The challenges suggest that significant further research is required in evolving social networks, i.e., existent methods, techniques, and algorithms must be rethought and designed toward incremental and dynamic versions that allow the efficient analysis of evolving networks. Keywords: evolving networks, social network analysis 1.
    [Show full text]
  • Package 'Qgraph'
    Package ‘qgraph’ January 28, 2021 Type Package Title Graph Plotting Methods, Psychometric Data Visualization and Graphical Model Estimation Version 1.6.9 Maintainer Sacha Epskamp <[email protected]> Depends R (>= 3.0.0) Imports Rcpp (>= 1.0.0), methods, grDevices, psych, lavaan, plyr, Hmisc, igraph, jpeg, png, colorspace, Matrix, corpcor, reshape2, ggplot2, glasso, fdrtool, gtools, parallel, pbapply, abind, dplyr ByteCompile yes Description Weighted network visualization and analysis, as well as Gaussian graphical model com- putation. See Epskamp et al. (2012) <doi:10.18637/jss.v048.i04>. BugReports https://github.com/SachaEpskamp/qgraph License GPL-2 LazyLoad yes LinkingTo Rcpp Suggests BDgraph, huge NeedsCompilation yes Author Sacha Epskamp [aut, cre], Giulio Costantini [aut], Jonas Haslbeck [aut], Adela Isvoranu [aut], Angelique O. J. Cramer [ctb], Lourens J. Waldorp [ctb], Verena D. Schmittmann [ctb], Denny Borsboom [ctb] Repository CRAN Date/Publication 2021-01-28 20:20:02 UTC 1 2 as.igraph.qgraph R topics documented: as.igraph.qgraph . .2 averageLayout . .3 big5.............................................4 big5groups . .4 centrality . .5 centrality and clustering plots . .7 centrality_auto . .8 clustcoef_auto . 10 cor_auto . 12 EBICglasso . 14 FDRnetwork . 16 flow ............................................. 18 getWmat . 19 ggmFit . 20 ggmModSelect . 21 makeBW . 23 mat2vec . 24 mutualInformation . 25 pathways . 25 plot.qgraph . 26 print.qgraph . 27 qgraph . 28 qgraph.animate . 45 qgraph.layout.fruchtermanreingold . 49 qgraph.loadings
    [Show full text]
  • Sparsity of Weighted Networks: Measures and Applications
    Sparsity of weighted networks: measures and applications Swati Goswamiab*, Asit K. Dasa, Subhas C. Nandyb aDepartment of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah – 711103, India bAdvanced Computing and Microelectronics Unit, Indian Statistical Institute, Kolkata, India, Pin 700108 Abstract A majority of real-life networks are weighted and sparse. The present article aims at characterization of weighted networks based on sparsity, as an indicator of inherent diversity of different network parameters. The measure called sparsity index defined on ordered degree sequence of simple networks is extended and new properties of this index are derived. The range of possible values of sparsity index of any connected network, with edge-count in specific intervals, is worked out analytically in terms of node-count and a pattern is uncovered in corresponding degree sequences. Given the edge-weight frequency distribution of a network, an expression of the sparsity index of edge- weights is formulated. Its properties are analyzed under different distributions of edge-weights. For example, the upper and lower bounds of sparsity index of edge-weights of a network, with all distinct edge-weights, is determined in terms of its node-count and edge-density. The article highlights that this summary index with low computational cost, computed on different network parameters, is useful to reveal different structural and organizational aspects of networks for performing analysis. An application of this index is demonstrated through devising a new overlapping community detection method. The results validated on artificial and real-world networks show its efficacy. Keywords Sparsity Index, Sparsity Measure, Gini Index, Edge Weight Distribution, Overlapping Community Detection, Community Detection in Weighted Networks 1.
    [Show full text]
  • An Extended Correlation Dimension of Complex Networks
    entropy Article An Extended Correlation Dimension of Complex Networks Sheng Zhang 1,*, Wenxiang Lan 1,*, Weikai Dai 1, Feng Wu 1 and Caisen Chen 2 1 School of Information Engineering, Nanchang Hangkong University, 696 Fenghe South Avenue, Nanchang 330063, China; [email protected] (W.D.); [email protected] (F.W.) 2 Military Exercise and Training Center, Academy of Army Armored Force, Beijing 100072, China; [email protected] * Correspondence: [email protected] (S.Z.); [email protected] (W.L.) Abstract: Fractal and self-similarity are important characteristics of complex networks. The correla- tion dimension is one of the measures implemented to characterize the fractal nature of unweighted structures, but it has not been extended to weighted networks. In this paper, the correlation dimen- sion is extended to the weighted networks. The proposed method uses edge-weights accumulation to obtain scale distances. It can be used not only for weighted networks but also for unweighted networks. We selected six weighted networks, including two synthetic fractal networks and four real-world networks, to validate it. The results show that the proposed method was effective for the fractal scaling analysis of weighted complex networks. Meanwhile, this method was used to analyze the fractal properties of the Newman–Watts (NW) unweighted small-world networks. Compared with other fractal dimensions, the correlation dimension is more suitable for the quantitative analysis of small-world effects. Keywords: fractal property; correlation dimension; weighted networks; small-world network Citation: Zhang, S.; Lan, W.; Dai, W.; Wu, F.; Chen, C. An Extended 1. Introduction Correlation Dimension of Complex Recently, revealing and characterizing complex systems from a complex networks Networks.
    [Show full text]
  • Weighted Network Analysis Steve Horvath
    Weighted Network Analysis Steve Horvath Weighted Network Analysis Applications in Genomics and Systems Biology ABC Steve Horvath Professor of Human Genetics and Biostatistics University of California, Los Angeles Los Angeles, CA 90095-7088, USA [email protected] ISBN 978-1-4419-8818-8 e-ISBN 978-1-4419-8819-5 DOI 10.1007/978-1-4419-8819-5 Springer New York Dordrecht Heidelberg London Library of Congress Control Number: 2011925163 c Springer Science+Business Media, LLC 2011 All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer Science+Business Media, LLC, 233 Spring Street, New York, NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights. Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com) To Lora, my brother Markus, my parents, Joseph O’Brien and Joerg Zimmermann Preface The past decade has seen an incredible growth of network methods following publications by Laszlo Barabasi and others. Excellent text books exist on general networks and graph theory, but these books typically describe unweighted networks.
    [Show full text]
  • Knowledge Discovery in Scientific Databases Using Text Mining and Social Network Analysis
    2012 IEEE Conference on Control, Systems and Industrial Informatics (ICCSII) Bandung, Indonesia, September 23-26,2012 Knowledge Discovery in Scientific Databases Using Text Mining and Social Network Analysis Ammar lalalimanesh Information Engineering Department Iranian Research Institute for Information Science and Technology Tehran, Iran [email protected] Abstract-This paper introduces a novel methodology to extract IRANDOC duties, and thereforeo, analyzing theses contents is core concepts from text corpus. This methodology is based on an interesting research domain for it. text mining and social network analysis. At the text mining phase the keywords are extracted by tokenizing, removing stop-lists In this research we tried to discover core concepts from and generating N-grams. Network analysis phase includes co­ theses repository by applying text mining methods and SNA word occurrence extraction, network representation of linked techniques. Accordingly 650 M.Sc and PhD theses were terms and calculating centrality measure. We applied our selected as test corpus. The text mining routines were applied methodology on a text corpus including 650 thesis titles in the on theses titles. First, key phrases were extracted from theses domain of Industrial engineering. Interpreting enriched titles. Next the network of co-word occurrence was drawn. networks was interesting and gave us valuable knowledge about Finally centrality measures were calculated for central words. corpus content. To conclude we tried to demonstrate these measures visually and to interpret them by exploring enriched network. Keywords-Text nunmg; Social network analysis,' Industrial engineering; concept mapping; Knowledge discovery The organization of this paper is as follows. In the second section, we review related works and experiences in the context I.
    [Show full text]
  • Assortativity Measures for Weighted and Directed Networks
    Assortativity measures for weighted and directed networks Yelie Yuan1, Jun Yan1, and Panpan Zhang2,∗ 1Department of Statistics, University of Connecticut, Storrs, CT 06269 2Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA 19104 ∗Corresponding author: [email protected] January 15, 2021 arXiv:2101.05389v1 [stat.AP] 13 Jan 2021 Abstract Assortativity measures the tendency of a vertex in a network being connected by other ver- texes with respect to some vertex-specific features. Classical assortativity coefficients are defined for unweighted and undirected networks with respect to vertex degree. We propose a class of assortativity coefficients that capture the assortative characteristics and structure of weighted and directed networks more precisely. The vertex-to-vertex strength correlation is used as an example, but the proposed measure can be applied to any pair of vertex-specific features. The effectiveness of the proposed measure is assessed through extensive simula- tions based on prevalent random network models in comparison with existing assortativity measures. In application World Input-Ouput Networks, the new measures reveal interesting insights that would not be obtained by using existing ones. An implementation is publicly available in a R package wdnet. 1 Introduction In traditional network analysis, assortativity or assortative mixing (Newman, 2002) is a mea- sure assessing the preference of a vertex being connected (by edges) with other vertexes in a network. The measure reflects the principle of homophily (McPherson et al., 2001)|the tendency of the entities to be associated with similar partners in a social network. The prim- itive assortativity measure proposed by Newman(2002) was defined to study the tendency of connections between nodes based on their degrees, which is why it is also called degree-degree correlation (van der Hofstad and Litvak, 2014).
    [Show full text]
  • PHYSICAL REVIEW E 98, 042304 (2018) Weight Thresholding on Complex Networks
    PHYSICAL REVIEW E 98, 042304 (2018) Weight thresholding on complex networks Xiaoran Yan,1,* Lucas G. S. Jeub,2 Alessandro Flammini,1,2 Filippo Radicchi,2 and Santo Fortunato1,2 1Indiana University Network Science Institute (IUNI), 1001 E SR 45/46 Bypass, Bloomington, Indiana 47408, USA 2School of Informatics, Computing and Engineering, Indiana University, 700 N Woodlawn Ave, Bloomington, Indiana 47408, USA (Received 21 June 2018; published 8 October 2018) Weight thresholding is a simple technique that aims at reducing the number of edges in weighted networks that are otherwise too dense for the application of standard graph-theoretical methods. We show that the group structure of real weighted networks is very robust under weight thresholding, as it is maintained even when most of the edges are removed. This appears to be related to the correlation between topology and weight that characterizes real networks. On the other hand, the behavior of other properties is generally system dependent. DOI: 10.1103/PhysRevE.98.042304 I. INTRODUCTION alternative to expensive community detection algorithms for testing the robustness of communities under thresholding. Many real networks have weighted edges [1], representing the intensity of the interaction between pairs of vertices. Also, some weighted networks, e.g., financial [2] and brain net- II. METHODS works [3], have a high density of edges. Analyzing very dense graphs with tools of network science is often impossible, Let us consider a weighted and undirected graph G com- unless some preprocessing technique is applied to reduce the posed of N vertices and M edges. Edges have positive number of connections.
    [Show full text]
  • Weighted Network Estimation by the Use of Topological Graph Metrics
    1 Weighted network estimation by the use of topological graph metrics Loukianos Spyrou and Javier Escudero Abstract—Topological metrics of graphs provide a natural way to describe the prominent features of various types of networks. Graph metrics describe the structure and interplay of graph edges and have found applications in many scientific fields. In this work, graph metrics are used in network estimation by developing optimisation methods that incorporate prior knowledge of a network’s topology. The derivatives of graph metrics are used in gradient descent schemes for weighted undirected network denoising, network completion , and network decomposition. The successful performance of our methodology is shown in a number of toy examples and real-world datasets. Most notably, our work establishes a new link between graph theory, network science and optimisation. Index Terms—graph metric derivatives, graph theory, network completion, network denoising, network decomposition, optimisation ! 1 INTRODUCTION RAPH theory has found applications in many scientific satisfy specific properties [17]–[20]. This can be particularly G fields in an attempt to analyse interconnections be- useful when building null models for hypothesis testing. tween phenomena, measurements, and systems. It provides Network inference problem attempts to identify a network a data structure that naturally express those interconnec- structure where the edges have been corrupted by a diffu- tions and also provides a framework for further analysis sion process through the network [21], [22]. More recently, [1], [2]. A graph consists of a set of nodes and edges reducing the noise in social networks has been attempted in describing the connections between the nodes. The edges [23]–[25].
    [Show full text]
  • Research on the Node Importance of a Weighted Network Based on the K-Order Propagation Number Algorithm
    entropy Article Research on the Node Importance of a Weighted Network Based on the K-Order Propagation Number Algorithm Pingchuan Tang 1, Chuancheng Song 2 , Weiwei Ding 1, Junkai Ma 1, Jun Dong 3,* and Liya Huang 1,4,* 1 College of Electronic and Optical Engineering & College of Microelectronics, Nanjing University of Posts and Telecommunications, Nanjing 210023, China; [email protected] (P.T.); [email protected] (W.D.); [email protected] (J.M.) 2 Bell Honors School, Nanjing University of Posts and Telecommunications, Nanjing 210023, China; [email protected] 3 Institute of Intelligent Machines, Hefei Institute of Physical Science Chinese Academy of Sciences, Hefei 230031, China 4 National and Local Joint Engineering Laboratory of RF Integration and Micro-Assembly Technology, Nanjing 210003, China * Correspondence: [email protected] (J.D.); [email protected] (L.H.) Received: 11 February 2020; Accepted: 19 March 2020; Published: 22 March 2020 Abstract: To describe both the global and local characteristics of a network more comprehensively, we propose the weighted K-order propagation number (WKPN) algorithm to extract the disease propagation based on the network topology to evaluate the node importance. Each node is set as the source of infection, and the total number of infected nodes is defined as the K-order propagation number after experiencing the propagation time K. The simulation of the symmetric network with bridge nodes indicated that the WKPN algorithm was more effective for evaluation of the algorithm features. A deliberate attack strategy, which indicated an attack on the network according to the node importance from high to low, was employed to evaluate the WKPN algorithm in real networks.
    [Show full text]
  • A Network Analysis of Money Laundering in the United Kingdom
    Mapping the laundromat: A network analysis of money laundering in the United Kingdom By Ian Goodrich Submitted to Central European University School of Public Policy In partial fulfilment of the requirements for the degree of Master of Arts in Public Policy (one year) Supervisor: Mihály Fazekas CEU eTD Collection Budapest, Hungary 2019 Authorship Declaration I, Ian Goodrich, hereby declare that I am the sole author of this thesis. To the best of my knowledge this thesis contains no material previously published by any other person except where due acknowledgement has been made. This thesis contains no material which has been accepted as part of the requirements of any other academic degree or non-degree program, in English or in any other language. This is a true copy of the thesis, including final revisions. March 4, 2019 CEU eTD Collection i Acknowledgements I’m writing this having returned to university after over a decade out in the wilderness of work. It’s been a life-changing experience, and one that I’ve thoroughly enjoyed, but there’s no chance I’d have done it without the faith others have shown in me. First and foremost, I must thank my beautiful and brilliant wife and partner, Dion. For many years, she has provided the love, support and encouragement which has enabled everything I’ve done. She was the first to suggest I return to university, she packed up her life to be with me in Budapest, and she has been consistently kind, caring and patient as I’ve completed my studies. Thank you, baby, I love you.
    [Show full text]
  • Assortativity and Act Degree Distribution of Some Collaboration Networks
    ARTICLE IN PRESS Physica A 383 (2007) 687–702 www.elsevier.com/locate/physa Assortativity and act degree distribution of some collaboration networks Hui Changa, Bei-Bei Sub, Yue-Ping Zhoua, Da-Ren Hea,Ã aCollege of Physics Science and Technology, Yangzhou University, Yangzhou 225002, China bWuxi Professional College of Science and Technology, Wuxi 214028, China Received 11 February 2007; received in revised form 18 April 2007 Available online 27 April 2007 Abstract Empirical investigation results on weighted and un-weighted assortativity, act degree distribution, degree distribution and node strength distribution of nine real world collaboration networks have been presented. The investigations propose that act degree distribution, degree distribution and node strength distribution usually show so-called ‘‘shifted power law’’ (SPL) function forms, which can continuously vary from an ideal power law to an ideal exponential decay. Two parameters, a and Z, can be used for description of the distribution functions. Another conclusion is that assortativity coefficient and the parameter, a or Z, is monotonously dependent on each other. By the collaboration network evolution model introduced in a reference [P. Zhang et al., Physica A 360 (2006) 599], we analytically derived the SPL distributions, which typically appeared in general situations where nodes are selected partially randomly, with a probability p, and partially by linear preferential principle, with the probability 1 À p. The analytic discussion gives an explicit expression on the relationship between the random selection proportion p and the parameters a and Z. The numerical simulation results by the model show a monotonic dependence of assortativity on the random selection proportion p.
    [Show full text]