Connected Spatial Networks Over Random Points and a Route-Length Statistic David J

Total Page:16

File Type:pdf, Size:1020Kb

Connected Spatial Networks Over Random Points and a Route-Length Statistic David J Statistical Science 2010, Vol. 25, No. 3, 275–288 DOI: 10.1214/10-STS335 © Institute of Mathematical Statistics,2010 Connected Spatial Networks over Random Points and a Route-Length Statistic David J. Aldous and Julian Shun Abstract. We review mathematically tractable models for connected net- works on random points in the plane, emphasizing the class of proximity graphs which deserves to be better known to applied probabilists and statisti- cians. We introduce and motivate a particular statistic R measuring shortness of routes in a network. We illustrate, via Monte Carlo in part, the trade-off between normalized network length and R in a one-parameter family of prox- imity graphs. How close this family comes to the optimal trade-off over all possible networks remains an intriguing open question. The paper is a write-up of a talk developed by the first author during 2007– 2009. Key words and phrases: Proximity graph, random graph, spatial network, geometric graph. 1. INTRODUCTION Recall that the most studied network model, the ran- dom geometric graph [40]reviewedinSection2.1, The topic called random networks or complex net- does not permit both connectivity and bounded nor- works has attracted huge attention over the last 20 malized length in the n limit. An attractive al- years. Much of this work focuses on examples such as ternative is the class of proximity→∞ graphs,reviewedin social networks or WWW links, in which edges are not Section 2.3,whichinthedeterministiccasehavebeen closely constrained by two-dimensional geometry. In studied within computational geometry. These graphs contrast, in a spatial network not only are vertices and are always connected. Proximity graphs on random edges situated in two-dimensional space, but also it is points have been studied in only a few papers, but are actual distances, rather than number of edges, that are potentially interesting for many purposes other than of interest. To be concrete, we visualize idealized inter- the specific “short route lengths” topic of this paper city road networks, and a feature of interest is the (min- (see Section 6.5). One could also imagine construc- imum) route length between two given cities. Because tions which depend on points having specifically the we work only in two dimensions, the word spatial may Poisson point process distribution, and one novel such be misleading, but equally the word planar would be network, which we name the Hammersley network,is misleading because we do not require networks to be described in Section 2.5. planar graphs (if edges cross, then a junction is cre- Visualizing idealized road networks, it is natural to ated). take total network length as the “cost” of a network, but Our major purpose is to draw the attention of read- what is the corresponding “benefit”? Primarily we are ers from the applied probability and statistics commu- interested in having short route lengths. Choosing an nities to a particular class of spatial network models. appropriate statistic to measure the latter turns out to be rather subtle, and the (only) technical innovation of this David J. Aldous is Professor, Department of Statistics, paper is the introduction (Section 3.2)andmotivation University of California, 367 Evans Hall # 3860, Berkeley, California 94720, USA (e-mail: [email protected]; of a specific statistic R for measuring the effectiveness URL: www.stat.berkeley.edu/users/aldous). Julian Shun is of a network in providing short routes. Graduate Student, Machine Learning Department, In the theory of spatial networks over random points, Carnegie Mellon University, 5000 Forbes Avenue, it is a challenge to quantify the trade-off between net- Pittsburgh, Pennsylvania 15213, USA (e-mail: work length [precisely, the normalized length L de- [email protected]). fined at (2)] and route length efficiency statistics such 275 276 D. J. ALDOUS AND J. SHUN as R.OurparticularstatisticR is not amenable to ex- First (Sections 2.1–2.3)areschemeswhichusede- plicit calculation even in comparatively tractable mod- terministic rules to define edges for an arbitrary deter- els, but in Section 4 we present the results from Monte ministic configuration of cities; then one just applies Carlo simulations. In particular, Figure 7 shows the these rules to a random configuration. Second, one can trade-off for the particular β-skeleton family of prox- have random rules for edges in a deterministic config- imity graphs. uration (e.g., the probability of an edge between cities Given a normalized network length L,foranyreal- i and j is a function of Euclidean distance d(xi,xj ),as ization of cities there is some network of normalized in popular small worlds models [39]), and again apply length L which minimizes R.AsindicatedinSec- to a random configuration. Third, and more subtly, one tion 5,bygeneralabstractmathematicalarguments, can have constructions that depend on the randomness there must exist a deterministic function Ropt(L) giv- model for city positions—Section 2.5 provides a novel ing (in the “number of cities ”limitunderthe example. random model) the minimum value→∞ of R over all pos- We work throughout with reference to Euclidean dis- sible networks of normalized length L.Anintriguing tance d(x,y) on the plane, even though many mod- open question is as follows: els could be defined with reference to other metrics (or even when the triangle inequality does not hold, for the how close are the values Rβ-skel(L) from the MST). β-skeleton proximity graphs to the optimum values Ropt(L)? 2.1 The Geometric Graph As discussed in Section 5.3,atfirstsightitlookseasyto In Sections 2.1–2.3 we have an arbitrary configura- design heuristic algorithms for networks which should tion x xi of city positions, and a deterministic rule for defining={ } the edge-set .Usuallyingraphtheory improve over the β-skeletons, for example, by intro- E ducing Steiner points, but in practice we have not suc- one imagines a finite configuration, but note that every- ceeded in doing so. thing makes sense for locally finite configurations too. This paper focuses on the random model for city po- Where helpful, we assume “general position,” so that sitions because it seems the natural setting for theoret- intercity distances d(xi,xj ) are all distinct. For the geometric graph one fixes 0 <c< and ical study. As a complement, in [10]wegiveempiri- ∞ cal data for the values of (L, R) for certain real-world defines networks (on the 20 largest cities, in each of 10 US (xi,xj ) iff d(xi,xj ) c. States). In [8]wegiveanalyticresultsandboundson ∈ E ≤ For the K-neighbor graph one fixes K 1anddefines the trade-off between L and the mathematically more ≥ tractable stretch statistic R at (4), in both worst-case (xi,xj ) iff xi is one of the K closest max ∈ E and random-case settings for city positions. Let us also neighbors of xj ,orxj is one of the K clos- point out a (perhaps) nonobvious insight discussed in est neighbors of xi. Section 3.3:indesigningnetworkstobeefficientinthe Amoment’sthoughtshowsthesegraphsareingeneral sense of providing short routes, the main difficulty is not connected, so we turn to models which are “by con- providing short routes between city-pairs at a specific struction” connected. We remark that the connectivity distance (2–3 standardized units) apart, rather than be- threshold cn in the finite n-vertex model of the random tween pairs at a large distance apart. geometric graph has been studied in detail—see Chap- Finally, recall this is a nontechnical account. Our ter 13 of [40]. purpose is to elaborate verbally the ideas outlined above; some technical aspects will be pursued else- 2.2 A Nested Sequence of Connected Graphs where. The material here and in the next section was de- veloped in graph theory with a view toward algorith- 2. MODELS FOR CONNECTED SPATIAL mic applications in computational geometry and pat- NETWORKS tern recognition. The 1992 survey [28]givesthehis- There are several conceptually different ways of tory of the subject and 116 citations. But everything defining networks on random points in the plane. To we need is immediate from the (careful choice of) defi- be concrete, we call the points cities;tobeconsistent nitions. On our arbitrary configuration x we can define four graphs whose edge-sets are nested as follows: about language, we regard xi as the position of city i and represent network edges as line segments (xi,xj ). (1) MST relative n’hood Gabriel Delaunay. ⊆ ⊆ ⊆ CONNECTED NETWORKS OVER RANDOM POINTS 277 Here are the definitions (for MST and Delaunay, it 2.3 Proximity Graphs is easy to check these are equivalent to more familiar Write v and v for the points ( 1 , 0) and ( 1 , 0). definitions). In each case, we write the criterion for an − + 2 2 The lune is the intersection of the open− discs of radii 1 edge (xi,xj ) to be present: centered at v and v .Sov and v are not in the − + − + Minimum spanning tree (MST) [24]. There does not lune but are on its boundary. Define a template A to be • 2 exist a sequence i k0,k1,...,km j of cities such asubsetofR such that: = = that (i) A is a subset of the lune. (ii) A contains the open line segment (v ,v ). max(d(xk0 ,xk1 ), d(xk1 ,xk2 ), . , d(xkm 1 ,xkm )) − + − (iii) A is invariant under the “reflection in the y- <d(x,x ). axis” map Reflectx(x1,x2) ( x1,x2) and the “re- i j = − flection in the x-axis” map Reflecty(x1,x2) (x1, Relative neighborhood graph. There does not exist a x ). = • 2 city k such that − (iv) A is open. 2 max(d(xi,xk), d(xk,xj )) < d(xi,xj ). For arbitrary points x,y in R ,defineA(x, y) to be the image of A under the natural transformation Gabriel graph.
Recommended publications
  • Oracle Database 10G: Oracle Spatial Network Data Model
    Oracle Database 10g: Oracle Spatial Network Data Model An Oracle Technical White Paper May 2005 Oracle Spatial Network Data Model Table of Contents Introduction ....................................................................................................... 3 Design Goals and Architecture ....................................................................... 4 Major Steps Using the Network Data Model................................................ 5 Network Modeling........................................................................................ 5 Network Analysis.......................................................................................... 6 Network Modeling ............................................................................................ 6 Metadata and Schema ....................................................................................... 6 Network Java Representation and API.......................................................... 7 Network Creation Using SQL and PL/SQL ................................................ 8 Creating a Logical Network......................................................................... 9 Creating a Spatial Network.......................................................................... 9 Creating an SDO Geometry Network ................................................ 10 Creating an LRS Geometry Network.................................................. 10 Creating a Topology Geometry Network........................................... 11 Network Creation
    [Show full text]
  • A Network-Of-Networks Percolation Analysis of Cascading Failures in Spatially Co-Located Road-Sewer Infrastructure Networks
    Physica A 538 (2020) 122971 Contents lists available at ScienceDirect Physica A journal homepage: www.elsevier.com/locate/physa A network-of-networks percolation analysis of cascading failures in spatially co-located road-sewer infrastructure networks ∗ ∗ Shangjia Dong a, , Haizhong Wang a, , Alireza Mostafizi a, Xuan Song b a School of Civil and Construction Engineering, Oregon State University, Corvallis, OR 97331, United States of America b Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China article info a b s t r a c t Article history: This paper presents a network-of-networks analysis framework of interdependent crit- Received 26 September 2018 ical infrastructure systems, with a focus on the co-located road-sewer network. The Received in revised form 28 May 2019 constructed interdependency considers two types of node dynamics: co-located and Available online 27 September 2019 multiple-to-one dependency, with different robustness metrics based on their function Keywords: logic. The objectives of this paper are twofold: (1) to characterize the impact of the Co-located road-sewer network interdependency on networks' robustness performance, and (2) to unveil the critical Network-of-networks percolation transition threshold of the interdependent road-sewer network. The results Percolation modeling show that (1) road and sewer networks are mutually interdependent and are vulnerable Infrastructure interdependency to the cascading failures initiated by sewer system disruption; (2) the network robust- Cascading failure ness decreases as the number of initial failure sources increases in the localized failure scenarios, but the rate declines as the number of failures increase; and (3) the sewer network contains two types of links: zero exposure and severe exposure to liquefaction, and therefore, it leads to a two-phase percolation transition subject to the probabilistic liquefaction-induced failures.
    [Show full text]
  • Planarity As a Driver of Spatial Network Structure
    DOI: http://dx.doi.org/10.7551/978-0-262-33027-5-ch075 Planarity as a driver of Spatial Network structure Garvin Haslett and Markus Brede Institute for Complex Systems Simulation, School of Electronics and Computer Science, University of Southampton, Southampton SO17 1BJ, United Kingdom [email protected] Downloaded from http://direct.mit.edu/isal/proceedings-pdf/ecal2015/27/423/1903775/978-0-262-33027-5-ch075.pdf by guest on 23 September 2021 Abstract include city science (Cardillo et al., 2006; Xie and Levin- son, 2007; Jiang, 2007; Barthelemy´ and Flammini, 2008; In this paper we introduce a new model of spatial network growth in which nodes are placed at randomly selected lo- Masucci et al., 2009; Chan et al., 2011; Courtat et al., 2011; cations in space over time, forming new connections to old Strano et al., 2012; Levinson, 2012; Rui et al., 2013; Gud- nodes subject to the constraint that edges do not cross. The mundsson and Mohajeri, 2013), electronic circuits (i Can- resulting network has a power law degree distribution, high cho et al., 2001; Bassett et al., 2010; Miralles et al., 2010; clustering and the small world property. We argue that these Tan et al., 2014), wireless networks (Huson and Sen, 1995; characteristics are a consequence of two features of our mech- anism, growth and planarity conservation. We further pro- Lotker and Peleg, 2010), leaf venation (Corson, 2010; Kat- pose that our model can be understood as a variant of ran- ifori et al., 2010), navigability (Kleinberg, 2000; Lee and dom Apollonian growth.
    [Show full text]
  • Arxiv:2006.02870V1 [Cs.SI] 4 Jun 2020
    The why, how, and when of representations for complex systems Leo Torres Ann S. Blevins [email protected] [email protected] Network Science Institute, Department of Bioengineering, Northeastern University University of Pennsylvania Danielle S. Bassett Tina Eliassi-Rad [email protected] [email protected] Department of Bioengineering, Network Science Institute and University of Pennsylvania Khoury College of Computer Sciences, Northeastern University June 5, 2020 arXiv:2006.02870v1 [cs.SI] 4 Jun 2020 1 Contents 1 Introduction 4 1.1 Definitions . .5 2 Dependencies by the system, for the system 6 2.1 Subset dependencies . .7 2.2 Temporal dependencies . .8 2.3 Spatial dependencies . 10 2.4 External sources of dependencies . 11 3 Formal representations of complex systems 12 3.1 Graphs . 13 3.2 Simplicial Complexes . 13 3.3 Hypergraphs . 15 3.4 Variations . 15 3.5 Encoding system dependencies . 18 4 Mathematical relationships between formalisms 21 5 Methods suitable for each representation 24 5.1 Methods for graphs . 24 5.2 Methods for simplicial complexes . 25 5.3 Methods for hypergraphs . 27 5.4 Methods and dependencies . 28 6 Examples 29 6.1 Coauthorship . 29 6.2 Email communications . 32 7 Applications 35 8 Discussion and Conclusion 36 9 Acknowledgments 38 10 Citation diversity statement 38 2 Abstract Complex systems thinking is applied to a wide variety of domains, from neuroscience to computer science and economics. The wide variety of implementations has resulted in two key challenges: the progenation of many domain-specific strategies that are seldom revisited or questioned, and the siloing of ideas within a domain due to inconsistency of complex systems language.
    [Show full text]
  • Query Processing in Spatial Network Databases
    Query Processing in Spatial Network Databases Dimitris Papadias Jun Zhang Department of Computer Science Department of Computer Science Hong Kong University of Science and Technology Hong Kong University of Science and Technology Clearwater Bay, Hong Kong Clearwater Bay, Hong Kong [email protected] [email protected] Nikos Mamoulis Yufei Tao Department of Computer Science and Information Systems Department of Computer Science University of Hong Kong City University of Hong Kong Pokfulam Road, Hong Kong Tat Chee Avenue, Hong Kong [email protected] [email protected] than their Euclidean distance. Previous work on spatial Abstract network databases (SNDB) is scarce and too restrictive Despite the importance of spatial networks in for emerging applications such as mobile computing and real-life applications, most of the spatial database location-based commerce. This necessitates the literature focuses on Euclidean spaces. In this development of novel and comprehensive query paper we propose an architecture that integrates processing methods for SNDB. network and Euclidean information, capturing Every conventional spatial query type (e.g., nearest pragmatic constraints. Based on this architecture, neighbors, range search, spatial joins and closest pairs) we develop a Euclidean restriction and a has a counterpart in SNDB. Consider, for instance, the network expansion framework that take road network of Figure 1.1, where the rectangles advantage of location and connectivity to correspond to hotels. If a user at location q poses the efficiently prune the search space. These range query "find the hotels within a 15km range", the frameworks are successfully applied to the most result will contain a, b and c (the numbers in the figure popular spatial queries, namely nearest correspond to network distance).
    [Show full text]
  • Urban Street Network Analysis in a Computational Notebook
    Urban Street Network Analysis in a Computational Notebook Geoff Boeing Department of Urban Planning and Spatial Analysis Sol Price School of Public Policy University of Southern California [email protected] Abstract: Computational notebooks offer researchers, practitioners, students, and educators the ability to interactively conduct analytics and disseminate re- producible workflows that weave together code, visuals, and narratives. This article explores the potential of computational notebooks in urban analytics and planning, demonstrating their utility through a case study of OSMnx and its tu- torials repository. OSMnx is a Python package for working with OpenStreetMap data and modeling, analyzing, and visualizing street networks anywhere in the world. Its official demos and tutorials are distributed as open-source Jupyter notebooks on GitHub. This article showcases this resource by documenting the repository and demonstrating OSMnx interactively through a synoptic tutorial adapted from the repository. It illustrates how to download urban data and model street networks for various study sites, compute network indicators, vi- sualize street centrality, calculate routes, and work with other spatial data such as building footprints and points of interest. Computational notebooks help in- troduce methods to new users and help researchers reach broader audiences in- terested in learning from, adapting, and remixing their work. Due to their utility and versatility, the ongoing adoption of computational notebooks in urban plan- ning, analytics, and related geocomputation disciplines should continue into the future.1 1 Introduction A traditional academic and professional divide has long existed between code creators and code users. The former would develop software tools and workflows for professional or research ap- plications, which the latter would then use to conduct analyses or answer scientific questions.
    [Show full text]
  • Shape of Shortest Paths in Random Spatial Networks Alexander Kartun-Giles, Marc Barthelemy, Carl Dettmann
    Shape of shortest paths in random spatial networks Alexander Kartun-Giles, Marc Barthelemy, Carl Dettmann To cite this version: Alexander Kartun-Giles, Marc Barthelemy, Carl Dettmann. Shape of shortest paths in random spatial networks. Physical Review E , American Physical Society (APS), 2019, 100 (3), pp.2315. 10.1103/PhysRevE.100.032315. cea-02564008 HAL Id: cea-02564008 https://hal-cea.archives-ouvertes.fr/cea-02564008 Submitted on 2 Sep 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The shape of shortest paths in random spatial networks Alexander P. Kartun-Giles,1 Marc Barthelemy,2 and Carl P. Dettmann3 1Max Plank Institute for Mathematics in the Sciences, Inselstr. 22, Leipzig, Germany 2Institut de Physique Th´eorique,CEA, CNRS-URA 2306, Gif-sur-Yvette, France 3School of Mathematics, University of Bristol, University Walk, Bristol BS8 1TW, UK In the classic model of first passage percolation, for pairs of vertices separated by a Euclidean distance L, geodesics exhibit deviations from their mean length L that are of order Lχ, while the transversal fluctuations, known as wandering, grow as Lξ. We find that when weighting edges directly with their Euclidean span in various spatial network models, we have two distinct classes defined by different exponents ξ = 3=5 and χ = 1=5, or ξ = 7=10 and χ = 2=5, depending only on coarse details of the specific connectivity laws used.
    [Show full text]
  • Experimenting with Spatial Networks to Save Time When Experimenting, It Can Be Useful to Create a Subset of the Airport Dataset
    Experimenting with Spatial Networks To save time when experimenting, it can be useful to create a subset of the airport dataset, for instance keeping only the airports of larger degree. Note also that when computing deterrence function, it is sometimes useful to work with random samples instead of computing distances between all pairs of nodes. This dataset is small enough however to compute all values in less than 1 minute on an average personal computer. To the best of my knowledge, there is no good library to work with spatial networks. networkx has functions to generate random spatial graphs (called geometric graphs https://networkx.org/ documentation/stable/reference/generators.html), but they are not designed to fit deterrence functions, and are not adapted to work with geographical coordinates. 1. Deterrence Function on the airport dataset When studying a network with spatial information, a first step to check if it can indeed be considered as a spatial network is to compute its deterrence function. (a) We will need to discretize distances using bins(similar to a histogram). Choose b bins, for instance every 500 km from 0km to 10,000km. Write a function which, provided a list of distances, return a list with b values, the number of values in each bin. A convenient way is to use numpy.digitize and collections.Counter . (b) Write a function which compute the distance between two positions on Earth in km. You can use function haversine from the package of the same name( pip install haversine ). To obtain standard latitude and longitude from the data in the airport dataset, you need to divide latitude and longitude values by 3600.
    [Show full text]
  • Osmnx: New Methods for Acquiring, Constructing, Analyzing, and Visualizing Complex Street Networks
    OSMnx: New Methods for Acquiring, Constructing, Analyzing, and Visualizing Complex Street Networks Geoff Boeing Email: [email protected] Department of City and Regional Planning University of California, Berkeley May 2017 Abstract Urban scholars have studied street networks in various ways, but there are data availability and consistency limitations to the current urban planning/street network analysis literature. To address these challenges, this article presents OSMnx, a new tool to make the collection of data and creation and analysis of street networks simple, consistent, automatable and sound from the perspectives of graph theory, transportation, and urban design. OSMnx contributes five significant capabilities for researchers and practitioners: first, the automated downloading of political boundaries and building footprints; second, the tailored and automated downloading and constructing of street network data from OpenStreetMap; third, the algorithmic correction of network topology; fourth, the ability to save street networks to disk as shapefiles, GraphML, or SVG files; and fifth, the ability to analyze street networks, including calculating routes, projecting and visualizing networks, and calculating metric and topological measures. These measures include those common in urban design and transportation studies, as well as advanced measures of the structure and topology of the network. Finally, this article presents a simple case study using OSMnx to construct and analyze street networks in Portland, Oregon. This is a preprint. Download the published article at: http://geoffboeing.com/publications/osmnx-complex- street-networks/ Cite as: Boeing, G. 2017. “OSMnx: New Methods for Acquiring, Constructing, Analyzing, and Visualizing Complex Street Networks.” Computers, Environment and Urban Systems 65, 126-139. doi:10.1016/j.compenvurbsys.2017.05.004 BOEING 1.
    [Show full text]
  • Spatial Science and Network Science: Review and Outcomes of a Complex Relationship César Ducruet, Laurent Beauguitte
    Spatial science and network science: Review and outcomes of a complex relationship César Ducruet, Laurent Beauguitte To cite this version: César Ducruet, Laurent Beauguitte. Spatial science and network science: Review and outcomes of a complex relationship. Networks and Spatial Economics, Springer Verlag, 2014, 14 (3-4), pp.297-316. 10.1007/s11067-013-9222-6. halshs-01093664 HAL Id: halshs-01093664 https://halshs.archives-ouvertes.fr/halshs-01093664 Submitted on 10 Dec 2014 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Spatial science and network science: Review and outcomes of a complex relationship DUCRUET César1 French National Centre for Scientific Research (CNRS) UMR 8054 Géographie-Cités, Paris, France Email: [email protected] BEAUGUITTE Laurent UMR 6266 IDEES, Rouen, France Email: [email protected] Pre-final version of the paper published in Networks and Spatial Economics, Vol. 14, Issue 3-4, pp. 297-316. Abstract For decades, the spatial approach to network analysis has principally focused on planar and technical networks from a classic graph theory perspective. Reference to models and methods developed by other disciplines on non-planar networks, such as sociology and physics, is recent, limited, and dispersed.
    [Show full text]
  • Improved Graph Neural Networks for Spatial Networks Using Structure-Aware Sampling
    International Journal of Geo-Information Article Improved Graph Neural Networks for Spatial Networks Using Structure-Aware Sampling Chidubem Iddianozie * and Gavin McArdle School of Computer Science, University College Dublin, Dublin 4, Ireland; [email protected] * Correspondence: [email protected] Received: 30 September 2020; Accepted: 11 November 2020; Published: 13 November 2020 Abstract: Graph Neural Networks (GNNs) have received wide acclaim in recent times due to their performance on inference tasks for unstructured data. Typically, GNNs operate by exploiting local structural information in graphs and disregarding their global structure. This is influenced by assumptions of homophily and unbiased class distributions. As a result, this could impede model performance on noisy real-world graphs such as spatial graphs where these assumptions may not be sufficiently held. In this article, we study the problem of graph learning on spatial graphs. Particularly, we focus on transductive learning methods for the imbalanced case. Given the nature of these graphs, we hypothesize that taking the global structure of the graph into account when aggregating local information would be beneficial especially with respect to generalisability. Thus, we propose a novel approach to training GNNs for these type of graphs. We achieve this through a sampling technique: Structure-Aware Sampling (SAS), which leverages the intra-class and global-geodesic distances between nodes. We model the problem as a node classification one for street networks with high variance between class sizes. We evaluate our approach using large real-world graphs against state-of-the-art methods. In the majority of cases, our approach outperforms traditional methods by up to a mean F1-score of 20%.
    [Show full text]
  • Nonobtuse Triangulations of Pslgs
    NONOBTUSE TRIANGULATIONS OF PSLGS CHRISTOPHER J. BISHOP Abstract. We show that any planar PSLG with n vertices has a conforming triangulation by O(n2.5) nonobtuse triangles, answering the question of whether a polynomial bound exists. The triangles may be chosen to be all acute or all right. A nonobtuse triangulation is Delaunay, so this result improves a previous O(n3) bound of Eldesbrunner and Tan for conforming Delaunay triangulations. In the special case that the PSLG is the triangulation of a simple polygon, we will show that only O(n2) elements are needed, improving an O(n4) bound of Bern and Eppstein. We also show that for any ǫ> 0, every PSLG has a conforming triangulation with O(n2/ǫ2) elements and with all angles bounded above by 90◦ + ǫ. This improves a 3 ◦ 7 ◦ result of S. Mitchell when ǫ = 8 π = 67.5 and Tan when ǫ = 30 π = 42 . Date: March 1, 2011. 1991 Mathematics Subject Classification. Primary: 30C62 Secondary: Key words and phrases. nonobtuse, acute, triangulation, conforming triangulation, PSLG, De- launay, Gabriel, nearest-neighbor learning, quadrilateral mesh, beta-skeleton, thick-thin decompo- sition, polynomial complexity, The author is partially supported by NSF Grant DMS 10-06309. This is a very preliminary version and should not be distributed. Please contact me for the latest version if you wish to share this with someone. 1 NONOBTUSE TRIANGULATIONS OF PSLGS 1 1. Introduction A planar straight line graph (or PSLG from now on) is the union of a finite number (possibly none) of non-intersecting open line segments together with a disjoint finite set that includes all the endpoints of the line segments, but may include other points as well.
    [Show full text]