Boman, A., Nilsson, E., Go with the Flow, 2020.Pdf

Examensarbete 30 hp Juni 2020 Go with the flow A study exploring public transit performance using a flow network model Axel Boman Erik Nilsson Acknowledgments For 22 weeks in the spring of 2020, we have had the honor of working with Samtrafiken and the Swedish Public Transport Data Lab (KoDa). We gratefully want to acknowledge Jerry Löfvenhaft for providing us with this meaningful context, network, and direction, forming the foundation of this project. Additionally, we want to express our sincere appreciation to our supervisor, Associate Professor Kristiaan Pelckmans, who has the substance of a genius: he convincingly led us through the world of graph mining. Without his clear-sightedness and persistent vision, the goal of this project would not have been realized. Moreover, we would like to recognize Gabriella Canas and Timo Palokangas at UL's traffic unit for accommodating us with a warm welcome and insights into the public transport industry. Also, special thanks to Jonathan Gustafsson and Oscar Mörke | the beloved janitors at The Park | for always delivering invaluable advice on life and making us laugh during the late working nights. Lastly, we would like to thank room 13:135 at Bl˚asenhus where most of this thesis was written. To most people, it's just a room with four walls, a table, and a whiteboard. To us, it's a parallel universe providing a level of embowerment and focus-instillment we did not know physical space was capable of. Thank you, Akademiska Hus. Axel & Erik June 2020, Uppsala 1 Populärvetenskaplig sammanfattning Idag bor halva världensbefolkning i stadsomr˚aden. Ar˚ 2050 förväntas antalet öka till 70 procent [1]. Världengenomg˚aren strukturell urbanisering som leder till ekonomiska, sociala och miljömässigakonsekvenser. Städers infrastruktur pressas h˚artmed att bemötabefolkningsökningensinneboende utmaningar; trängseloch förorening[2]. Föratt n˚ade globala m˚alsättningarnainom h˚allbarhet, krävsen omdef- inition av hur vi röross, och en vidareutveckling av systemen vi röross med. Det behövsen ny verklighet. En verklighet därmobilitet samordnas, utvärderasoch utformas i linje med regionernas utveckling [2]. Nordiska länderhar generellt sätthögtuppsatta m˚alfördenna förändring,och i syn- nerhet närdet kommer till kollektivtrafik. Köpenhamn har som m˚alsättning att 75% av alla resor 2025 ska genomförasmed kollektivtrafik, och Sverige har ett gemensamt m˚alom att dubbla antalet kollektivtrafikresor mellan 2006 och 2020. Man äröverens om att systemen bakom kollektivtrafiken m˚astebli mer robusta, effektiva och datadrivna [1]. Denna studie ämnaratt ta fasta vid de ledorden, och ta ett steg i samma riktning. Detta arbete undersöker de outnyttjade potentialerna i Googles GTFS1- format. Ett format förkollektivtrafikdata inneh˚allandesbl.a. positionering i realtid, information kring schemaavikelser och nätverkets organisation. At- traktiviteten i GTFS bottnar framförallti dess imponerande användarbas; 1240 transitnätverk i 672 regioner. Denna utbredning har gjort formatet till en de facto-standard och lösningarsom bygger ovanp˚aformatet öppnar indirekt upp dörrartill kollektivtrafiknäti hela världen. Formatets design ärprimärtgjord med m˚alsättningenatt tillgodose applikationer som utförreseplanering ˚atslutanvändare.Däremotkommer detta arbete att undersöka ifall samma data (dessutom) har potential i termer av att utvärderakollektivtrafiknätetsprestanda. D.v.s. fr˚anen nätägaresperspek- 1Ett format förkollektivtrafikdata som gördet möjligtföraktöreri kollektivtrafiksek- torn att publicera sina data och förutvecklare att skriva applikationer som konsumerar datan. 2 tiv. Mer specifikt kommer denna uppsats undersöka hur lokala s˚arbarheter i nätetkan modelleras m.h.a. GTFS-data insamlad fr˚anUL-nätetunder Januari 2020. Syftet med detta arbete ärtv˚afaldigt;Förstundersökshur specifika s˚arbarhetsegenskaper i ett kollektivtrafiknätkan utvärderas med hjälpav algoritmer inom graph mining2. Slutligen kommer vi undersöka möjlighetenatt utveckla en pipeline som aggregerar GTFS data till att passa in i en flödesnätsmodell. Resul- tatet visar att det ärmöjligt,genom det föreslagnaanalytiska ramverket, att modellera och bedömas˚arbarheter nod-till-nod i ett kollektivtrafiknät baserat p˚aGTFS data. Dessutom visualiseras resultaten i kontext genom Uppsalas nätverk (UL) med hjälpav ett webbaserat verktyg. 2Graph mining ärden uppsättningverktyg och tekniker som användsföratt (a) anal- ysera egenskaperna förverkliga grafer, (b) förutsägahur strukturen och egenskaperna för en given graf kan p˚averka vissa tillämpningar,och (c) utveckla modeller som kan generera realistiska grafer som matchar de mönstersom finns i verkliga grafer av intresse. 3 Distribution of work This work was created by Axel Boman and Erik Nilsson. All areas covered in this thesis have been researched, written, and reviewed in collaboration. Individual responsibilities were apportioned during the course of the project, which was then audited and aligned every week by both authors. Towards the end of the project, Axel focused on preparing the thesis writing while Erik developed the web-based tool for visualizing the results [5.5]. As both writers have worked with all parts of the thesis, the overall distribution of work is close to 50/50. 4 Abbreviations GTFS - General Transit Feed Specification PTN - Public Transit Networks RISE - Research Institutes of Sweden UL - Uppsala Lokaltrafik (Uppsala's transit network) API { Application Programming Interface HI-HS { High impact, high serviceability HI-LS { High impact, low serviceability LI-HS { Low impact, high serviceability LI-LS { Low impact, low serviceability 5 Contents 1 Introduction 8 1.1 Problem formulation . .9 1.2 Purpose . .9 1.2.1 Research questions . .9 1.2.2 Goals . 10 1.3 Thesis outline . 10 2 Background 12 2.1 Industry setting . 12 2.1.1 Coordination in a fragmented sector . 12 2.1.2 From an agency perspective . 13 2.1.3 Infrastructure for distribution of PT data . 14 2.2 The potential of modeling PTN as a graph . 15 2.2.1 Assessing vulnerability with topological features . 16 2.2.2 Assessing vulnerability with serviceability . 16 3 Theory 18 3.1 Key definitions in graph theory . 18 3.2 The concept of flow networks . 19 3.3 Modeling the real-world as flow networks . 22 3.4 Analytical framework . 23 4 Methods 26 4.1 Residual capacity model on an edge-level . 27 6 4.2 The data . 28 4.2.1 Data collection together with RISE . 29 4.2.2 Data preprocessing and cleaning . 30 4.2.3 Data modeling using NetworkX . 34 4.3 Vulnerability assessment . 35 4.4 Visualization using Mapbox . 36 4.5 General presumptions . 36 4.6 Data exploration . 37 5 Results 40 5.1 Capacity model in a PTN context . 40 5.2 Vulnerability assessment using the min-cut algorithm . 42 5.3 Fitting raw GTFS data in a flow network model . 43 5.4 In-context results from Uppsala's transit network (UL) . 44 5.4.1 HI-HS: High impact, high serviceability . 45 5.4.2 LI-HS: Low impact, high serviceability . 46 5.4.3 LI-LS: Low impact, low serviceability . 46 5.4.4 HI-LS, High impact, low serviceability . 47 5.4.5 HI-RLS, High impact, relatively low serviceability . 48 5.5 Interactive web application to visualize results . 49 6 Discussion 51 6.1 Beyond the complete disruption . 51 6.2 Comparison with previous report . 52 6.3 Vulnerability characteristics { not a distinct classification . 53 6.3.1 HI-LS: High impact, low serviceability . 53 6.3.2 HI-RLS: High impact, relatively low serviceability . 54 7 Conclusion 57 7.1 Future work . 58 8 Bibliography 59 7 Chapter 1 Introduction Efficient, flexible, and robust public transport (PT), especially along major commuting arteries, will be needed to reduce traffic congestion caused by urbanization [2]. In order to future proof cities, governments must explore ways of enhancing PT so that it remains an appealing alternative to private transportation and meets the mobility needs of people who depend on it [2]. In recent years, an increasing volume of data related to public transit networks (PTN) is being generated in real-time [3]. Data following the same General Transit Feed Specification (GTFS1) format [4] and is distributed openly by 1240 transit agencies in 672 locations worldwide. This broad adaptation of GTFS has made it a de facto standard, making a product built on it inherently scalable as it potentially could be deployed in PTN all over the world [3]. 1A format for real-time public transportation data and associated geographic information allowing public transit agencies to publish their data and developers to write applications to consume it. 8 1.1 Problem formulation As opposed to transit agencies' well-developed data generation capabilities [3], their utilization of their data is often overlooked. Although agencies' data utilization capabilities differ between countries and regions, qualitative legacy methods for evaluating PTN are still being used to a great extent [5][6]. Additionally, the GTFS data format is originally designed to be sufficient for trip planning functionality, rather than for transit performance measures. As a consequence, additional data processing is required to expand the data potential in this context[7]. In this study, we will tap into the potential of using GTFS data from an agency stakeholder perspective to assess transit performance. More specifi- cally, we will outline a data-driven approach for quantifying service disrup- tions to assess network vulnerabilities in a flow network model2. Lastly, our approach will be applied to a real-world context. 1.2 Purpose The academic purpose of this study is (1) to explore how to assess specific properties in terms of transit performance using flow network algorithms and (2) develop a pipeline for processing GTFS data to fit in a flow network model.

Load more