Link to the Paper
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Basic Models and Questions in Statistical Network Analysis
Basic models and questions in statistical network analysis Mikl´osZ. R´acz ∗ with S´ebastienBubeck y September 12, 2016 Abstract Extracting information from large graphs has become an important statistical problem since network data is now common in various fields. In this minicourse we will investigate the most natural statistical questions for three canonical probabilistic models of networks: (i) community detection in the stochastic block model, (ii) finding the embedding of a random geometric graph, and (iii) finding the original vertex in a preferential attachment tree. Along the way we will cover many interesting topics in probability theory such as P´olya urns, large deviation theory, concentration of measure in high dimension, entropic central limit theorems, and more. Outline: • Lecture 1: A primer on exact recovery in the general stochastic block model. • Lecture 2: Estimating the dimension of a random geometric graph on a high-dimensional sphere. • Lecture 3: Introduction to entropic central limit theorems and a proof of the fundamental limits of dimension estimation in random geometric graphs. • Lectures 4 & 5: Confidence sets for the root in uniform and preferential attachment trees. Acknowledgements These notes were prepared for a minicourse presented at University of Washington during June 6{10, 2016, and at the XX Brazilian School of Probability held at the S~aoCarlos campus of Universidade de S~aoPaulo during July 4{9, 2016. We thank the organizers of the Brazilian School of Probability, Paulo Faria da Veiga, Roberto Imbuzeiro Oliveira, Leandro Pimentel, and Luiz Renato Fontes, for inviting us to present a minicourse on this topic. We also thank Sham Kakade, Anna Karlin, and Marina Meila for help with organizing at University of Washington. -
Networkx Reference Release 1.9.1
NetworkX Reference Release 1.9.1 Aric Hagberg, Dan Schult, Pieter Swart September 20, 2014 CONTENTS 1 Overview 1 1.1 Who uses NetworkX?..........................................1 1.2 Goals...................................................1 1.3 The Python programming language...................................1 1.4 Free software...............................................2 1.5 History..................................................2 2 Introduction 3 2.1 NetworkX Basics.............................................3 2.2 Nodes and Edges.............................................4 3 Graph types 9 3.1 Which graph class should I use?.....................................9 3.2 Basic graph types.............................................9 4 Algorithms 127 4.1 Approximation.............................................. 127 4.2 Assortativity............................................... 132 4.3 Bipartite................................................. 141 4.4 Blockmodeling.............................................. 161 4.5 Boundary................................................. 162 4.6 Centrality................................................. 163 4.7 Chordal.................................................. 184 4.8 Clique.................................................. 187 4.9 Clustering................................................ 190 4.10 Communities............................................... 193 4.11 Components............................................... 194 4.12 Connectivity.............................................. -
Representing Graphs by Polygons with Edge Contacts in 3D
Representing Graphs by Polygons with Side Contacts in 3D∗ Elena Arseneva1, Linda Kleist2, Boris Klemz3, Maarten Löffler4, André Schulz5, Birgit Vogtenhuber6, and Alexander Wolff7 1 Saint Petersburg State University, Russia [email protected] 2 Technische Universität Braunschweig, Germany [email protected] 3 Freie Universität Berlin, Germany [email protected] 4 Utrecht University, the Netherlands [email protected] 5 FernUniversität in Hagen, Germany [email protected] 6 Technische Universität Graz, Austria [email protected] 7 Universität Würzburg, Germany orcid.org/0000-0001-5872-718X Abstract A graph has a side-contact representation with polygons if its vertices can be represented by interior-disjoint polygons such that two polygons share a common side if and only if the cor- responding vertices are adjacent. In this work we study representations in 3D. We show that every graph has a side-contact representation with polygons in 3D, while this is not the case if we additionally require that the polygons are convex: we show that every supergraph of K5 and every nonplanar 3-tree does not admit a representation with convex polygons. On the other hand, K4,4 admits such a representation, and so does every graph obtained from a complete graph by subdividing each edge once. Finally, we construct an unbounded family of graphs with average vertex degree 12 − o(1) that admit side-contact representations with convex polygons in 3D. Hence, such graphs can be considerably denser than planar graphs. 1 Introduction A graph has a contact representation if its vertices can be represented by interior-disjoint geometric objects1 such that two objects touch exactly if the corresponding vertices are adjacent. -
Latent Distance Estimation for Random Geometric Graphs ∗
Latent Distance Estimation for Random Geometric Graphs ∗ Ernesto Araya Valdivia Laboratoire de Math´ematiquesd'Orsay (LMO) Universit´eParis-Sud 91405 Orsay Cedex France Yohann De Castro Ecole des Ponts ParisTech-CERMICS 6 et 8 avenue Blaise Pascal, Cit´eDescartes Champs sur Marne, 77455 Marne la Vall´ee,Cedex 2 France Abstract: Random geometric graphs are a popular choice for a latent points generative model for networks. Their definition is based on a sample of n points X1;X2; ··· ;Xn on d−1 the Euclidean sphere S which represents the latent positions of nodes of the network. The connection probabilities between the nodes are determined by an unknown function (referred to as the \link" function) evaluated at the distance between the latent points. We introduce a spectral estimator of the pairwise distance between latent points and we prove that its rate of convergence is the same as the nonparametric estimation of a d−1 function on S , up to a logarithmic factor. In addition, we provide an efficient spectral algorithm to compute this estimator without any knowledge on the nonparametric link function. As a byproduct, our method can also consistently estimate the dimension d of the latent space. MSC 2010 subject classifications: Primary 68Q32; secondary 60F99, 68T01. Keywords and phrases: Graphon model, Random Geometric Graph, Latent distances estimation, Latent position graph, Spectral methods. 1. Introduction Random geometric graph (RGG) models have received attention lately as alternative to some simpler yet unrealistic models as the ubiquitous Erd¨os-R´enyi model [11]. They are generative latent point models for graphs, where it is assumed that each node has associated a latent d point in a metric space (usually the Euclidean unit sphere or the unit cube in R ) and the connection probability between two nodes depends on the position of their associated latent points. -
Analytical Framework for Recurrence-Network Analysis of Time Series
Analytical framework for recurrence-network analysis of time series Jonathan F. Donges,1, 2, ∗ Jobst Heitzig,1 Reik V. Donner,1 and J¨urgenKurths1, 2, 3 1Potsdam Institute for Climate Impact Research, P.O. Box 601203, 14412 Potsdam, Germany 2Department of Physics, Humboldt University Berlin, Newtonstr. 15, 12489 Berlin, Germany 3Institute for Complex Systems and Mathematical Biology, University of Aberdeen, Aberdeen AB24 3FX, United Kingdom (Dated: August 13, 2018) Recurrence networks are a powerful nonlinear tool for time series analysis of complex dynamical systems. While there are already many successful applications ranging from medicine to paleoclima- tology, a solid theoretical foundation of the method has still been missing so far. Here, we interpret an "-recurrence network as a discrete subnetwork of a \continuous" graph with uncountably many vertices and edges corresponding to the system's attractor. This step allows us to show that various statistical measures commonly used in complex network analysis can be seen as discrete estimators of newly defined continuous measures of certain complex geometric properties of the attractor on the scale given by ". In particular, we introduce local measures such as the "-clustering coefficient, mesoscopic measures such as "-motif density, path-based measures such as "-betweennesses, and global measures such as "-efficiency. This new analytical basis for the so far heuristically motivated network measures also provides an objective criterion for the choice of " via a percolation threshold, and it shows that estimation can be improved by so-called node splitting invariant versions of the measures. We finally illustrate the framework for a number of archetypical chaotic attractors such as those of the Bernoulli and logistic maps, periodic and two-dimensional quasi-periodic motions, and for hyperballs and hypercubes, by deriving analytical expressions for the novel measures and com- paring them with data from numerical experiments. -
Geometric Biplane Graphs I: Maximal Graphs ∗
Geometric Biplane Graphs I: Maximal Graphs ∗ Alfredo Garc´ıa1 Ferran Hurtado2 Matias Korman3;4 In^esMatos5 Maria Saumell6 Rodrigo I. Silveira5;2 Javier Tejel1 Csaba D. T´oth7 Abstract We study biplane graphs drawn on a finite planar point set S in general position. This is the family of geometric graphs whose vertex set is S and can be decomposed into two plane graphs. We show that two maximal biplane graphs|in the sense that no edge can be added while staying biplane|may differ in the number of edges, and we provide an efficient algorithm for adding edges to a biplane graph to make it maximal. We also study extremal properties of maximal biplane graphs such as the maximum number of edges and the largest maximum connectivity over n-element point sets. 1 Introduction In a geometric graph G = (V; E), the vertices are distinct points in the plane in general position, and the edges are straight line segments between pairs of vertices. A plane graph is a geometric graph in which no two edges cross. Every (abstract) graph has a realization as a geometric graph (by simply mapping the vertices into distinct points in the plane, no three of which are collinear), and every planar graph can be realized as a plane graph by F´ary'stheorem [13]. The number of n-vertex labeled planar graphs is at least 27:22n n! [15]. · However, there are only 2O(n) plane graphs on any given set of n points in the plane [2, 22]. We consider a generalization of plane graphs. -
The Domination Number of On-Line Social Networks and Random Geometric Graphs?
The domination number of on-line social networks and random geometric graphs? Anthony Bonato1, Marc Lozier1, Dieter Mitsche2, Xavier P´erez-Gim´enez1, and Pawe lPra lat1 1 Ryerson University, Toronto, Canada, [email protected], [email protected], [email protected], [email protected] 2 Universit´ede Nice Sophia-Antipolis, Nice, France [email protected] Abstract. We consider the domination number for on-line social networks, both in a stochastic net- work model, and for real-world, networked data. Asymptotic sublinear bounds are rigorously derived for the domination number of graphs generated by the memoryless geometric protean random graph model. We establish sublinear bounds for the domination number of graphs in the Facebook 100 data set, and these bounds are well-correlated with those predicted by the stochastic model. In addition, we derive the asymptotic value of the domination number in classical random geometric graphs. 1. Introduction On-line social networks (or OSNs) such as Facebook have emerged as a hot topic within the network science community. Several studies suggest OSNs satisfy many properties in common with other complex networks, such as: power-law degree distributions [2, 13], high local cluster- ing [36], constant [36] or even shrinking diameter with network size [23], densification [23], and localized information flow bottlenecks [12, 24]. Several models were designed to simulate these properties [19, 20], and one model that rigorously asymptotically captures all these properties is the geometric protean model (GEO-P) [5{7] (see [16, 25, 30, 31] for models where various ranking schemes were first used, and which inspired the GEO-P model). -
Penrose M. Random Geometric Graphs (OUP, 2003)(ISBN 0198506260)(345S) Mac .Pdf
OXFORD STUDIES IN PROBABILITYManaging Editor L. C. G. ROGERS Editorial Board P. BAXENDALE P. GREENWOOD F. P. KELLY J.-F. LE GALL E. PARDOUX D. WILLIAMS OXFORD STUDIES IN PROBABILITY 1. F. B. Knight: Foundations of the prediction process 2. A. D. Barbour, L. Holst, and S. Janson: Poisson approximation 3. J. F. C. Kingman: Poisson processes 4. V. V. Petrov: Limit theorems of probability theory 5. M. Penrose: Random geometric graphs Random Geometric Graphs MATHEW PENROSE University of Bath Great Clarendon Street, Oxford OX26DP Oxford University Press is a department of the University of Oxford. It furthers the University's objective of excellence in research, scholarship, and education by publishing worldwide in Oxford New York Auckland Bangkok Buenos Aires Cape Town Chennai Dar es Salaam Delhi Hong Kong Istanbul Kaarachi Kolkata Kuala Lumpur Madrid Melbourne Mexico City Mumbai Nairobi São Paulo Shanghai Taipei Tokyo Toronto Oxford is a registered trade mark of Oxford University Press in the UK and in certain other countries Published in the United States by Oxford University Press Inc., New York © Mathew Penrose, 2003 The moral rights of the author have been asserted Database right Oxford University Press (maker) First published 2003 Reprinted 2004 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press, or as expressly permitted by law, or under terms agreed with the appropriate reprographics rights organisation. Enquiries concerning reproduction outside the scope of the above should be sent to the Rights Department, Oxford University Press, at the address above. -
Clique Colourings of Geometric Graphs
CLIQUE COLOURINGS OF GEOMETRIC GRAPHS COLIN MCDIARMID, DIETER MITSCHE, AND PAWELPRA LAT Abstract. A clique colouring of a graph is a colouring of the vertices such that no maximal clique is monochromatic (ignoring isolated vertices). The least number of colours in such a colouring is the clique chromatic number. Given n points x1;:::; xn in the plane, and a threshold r > 0, the corre- sponding geometric graph has vertex set fv1; : : : ; vng, and distinct vi and vj are adjacent when the Euclidean distance between xi and xj is at most r. We investigate the clique chromatic number of such graphs. We first show that the clique chromatic number is at most 9 for any geometric graph in the plane, and briefly consider geometric graphs in higher dimensions. Then we study the asymptotic behaviour of the clique chromatic number for the random geometric graph G(n; r) in the plane, where n random points are independently and uniformly distributed in a suitable square. We see that as r increases from 0, with high probability the clique chromatic number is 1 for very small r, then 2 for small r, then at least 3 for larger r, and finally drops back to 2. 1. Introduction and main results In this section we introduce clique colourings and geometric graphs; and we present our main results, on clique colourings of deterministic and random geometric graphs. Recall that a proper colouring of a graph is a labeling of its vertices with colours such that no two vertices sharing the same edge have the same colour; and the smallest number of colours in a proper colouring of a graph G = (V; E) is its chromatic number, denoted by χ(G). -
Betweenness Centrality in Dense Random Geometric Networks
Betweenness Centrality in Dense Random Geometric Networks Alexander P. Giles Orestis Georgiou Carl P. Dettmann School of Mathematics Toshiba Telecommunications Research Laboratory School of Mathematics University of Bristol 32 Queen Square University of Bristol Bristol, UK, BS8 1TW Bristol, UK, BS1 4ND Bristol, UK, BS8 1TW [email protected] [email protected] [email protected] Abstract—Random geometric networks are mathematical are routed in a multi-hop fashion between any two nodes that structures consisting of a set of nodes placed randomly within wish to communicate, allowing much larger, more flexible d a bounded set V ⊆ R mutually coupled with a probability de- networks (due to the lack of pre-established infrastructure or pendent on their Euclidean separation, and are the classic model used within the expanding field of ad-hoc wireless networks. In the need to be within range of a switch). Commercial ad hoc order to rank the importance of the network’s communicating networks are nowadays realised under Wi-Fi Direct standards, nodes, we consider the well established ‘betweenness’ centrality enabling device-to-device (D2D) offloading in LTE cellular measure (quantifying how often a node is on a shortest path of networks [5]. links between any pair of nodes), providing an analytic treatment This new diversity in machine betweenness needs to be of betweenness within a random graph model by deriving a closed form expression for the expected betweenness of a node understood, and moreover can be harnessed in at least three placed within a dense random geometric network formed inside separate ways: historically, in 2005 Gupta et al. -
Robustness of Community Detection to Random Geometric Perturbations
Robustness of Community Detection to Random Geometric Perturbations Sandrine Péché Vianney Perchet LPSM, University Paris Diderot Crest, ENSAE & Criteo AI Lab [email protected] [email protected] Abstract We consider the stochastic block model where connection between vertices is perturbed by some latent (and unobserved) random geometric graph. The objective is to prove that spectral methods are robust to this type of noise, even if they are agnostic to the presence (or not) of the random graph. We provide explicit regimes where the second eigenvector of the adjacency matrix is highly correlated to the true community vector (and therefore when weak/exact recovery is possible). This is possible thanks to a detailed analysis of the spectrum of the latent random graph, of its own interest. Introduction In a d-dimensional random geometric graph, N vertices are assigned random coordinates in Rd, and only points close enough to each other are connected by an edge. Random geometric graphs are used to model complex networks such as social networks, the world wide web and so on. We refer to [19]- and references therein - for a comprehensive introduction to random geometric graphs. On the other hand, in social networks, users are more likely to connect if they belong to some specific community (groups of friends, political party, etc.). This has motivated the introduction of the stochastic block models (see the recent survey [1] and the more recent breakthrough [5] for more details), where in the simplest case, each of the N vertices belongs to one (and only one) of the two communities that are present in the network. -
COLORING GRAPHS USING TOPOLOGY 11 Realized by Tutte Who Called an Example of a Disc G for Which the Boundary Has Chromatic Number 4 a Chromatic Obstacle
COLORING GRAPHS USING TOPOLOGY OLIVER KNILL Abstract. Higher dimensional graphs can be used to color 2- dimensional geometric graphs G. If G the boundary of a three dimensional graph H for example, we can refine the interior until it is colorable with 4 colors. The later goal is achieved if all inte- rior edge degrees are even. Using a refinement process which cuts the interior along surfaces S we can adapt the degrees along the boundary S. More efficient is a self-cobordism of G with itself with a host graph discretizing the product of G with an interval. It fol- lows from the fact that Euler curvature is zero everywhere for three dimensional geometric graphs, that the odd degree edge set O is a cycle and so a boundary if H is simply connected. A reduction to minimal coloring would imply the four color theorem. The method is expected to give a reason \why 4 colors suffice” and suggests that every two dimensional geometric graph of arbitrary degree and ori- entation can be colored by 5 colors: since the projective plane can not be a boundary of a 3-dimensional graph and because for higher genus surfaces, the interior H is not simply connected, we need in general to embed a surface into a 4-dimensional simply connected graph in order to color it. This explains the appearance of the chromatic number 5 for higher degree or non-orientable situations, a number we believe to be the upper limit. For every surface type, we construct examples with chromatic number 3; 4 or 5, where the construction of surfaces with chromatic number 5 is based on a method of Fisk.