Master of Arts

Total Page:16

File Type:pdf, Size:1020Kb

Master of Arts Discovering the Space-Time Dimensions of Schedule Padding and Delay from GTFS and Real-Time Transit Data Thesis submitted in May of 2015 to the Graduate School of the University of Cincinnati In partial fulfillment of the requirements for the degree of Master of Arts In the Department of Geography of the College of Arts and Sciences by Nathan S. Wessel Bachelor of Urban Planning, University of Cincinnati Committee chaired by Dr. Michael Widener Abstract Schedule padding is the extra time added to transit schedules due to expected random variability in travel times throughout a route. To this point, methods for applying padding to certain route segments and times have been relatively unsophisticated, largely reacting to observed changes in travel time variability relative to the existing schedule. By comparing schedule data and real-time vehicle locations, we aim to locate the segments of routes that are most affected by this random variability, and thus have the most padding. These segments could most benefit from targeted delay reduction techniques, such as signal prioritization or multi-door boarding. We also outline cartographic methods that could be used to depict such results to lay people and policy-makers. Our approach is relevant to any city with both General Transit Feed Specification (GTFS) data and a real-time vehicle location feed, though we take a single large city as our case study. For this research, we focus on Toronto, Ontario, and the Toronto Transit Commission. We use real-time transit vehicle locations, obtained from a publicly available API, to establish what we take to be a reasonable maximum speed for each segment of a route, or any set of routes. We then compare this ideal performance to the scheduled performance, derived from GTFS data, where the difference between the two can be interpreted as the amount of schedule padding on a segment at a particular time. Since schedule padding is a response to stochastic delay, this technique should lead us to the spatio-temporal locations of the most significant sources of delay and guide attempts to reduce delay and maintain acceptable on-time performance. Results suggest that choke-points are observed to crop up in expected places, like downtown at rush-hour, or near major signalized intersections. What is interesting is that we are able to begin to quantify delays in one part of a transit system, and compare them to other delays anywhere else in the same system. This should help to guide infrastructure investments that can minimize the impacts of random delay and to justify such expenses with explicit potential time or money savings. Transit delay has been discussed extensively in the literature and in the press, but this delay is relative to a schedule which can be fairly arbitrary. This thesis is novel in its emphasis on schedule padding as a signal of avoidable delay below the level of the scheduled expectation. II III Table of Contents List of Figures, Maps, Tables and Videos............................................................................................V 1 Introduction........................................................................................................................................1 2 Literature Review..............................................................................................................................5 2.1 Data-derived performance measures for industry......................................................................5 2.2. The great opening of transit data and the real-time revolution.................................................6 2.3 Performance measures outside of industry................................................................................9 2.4 What remains to be considered................................................................................................11 3 Outline of the Project.......................................................................................................................12 4 Passengers as Sources of Delay.......................................................................................................15 4.1 Stopping...................................................................................................................................15 4.2 Internal congestion...................................................................................................................16 5 Data..................................................................................................................................................18 5.1 General Transit Feed Specification..........................................................................................18 5.1.1 Simplification, Segmentation, and Summation....................................................................18 5.2 Real-time Vehicle Locations....................................................................................................21 5.2.1 Handling Errors.....................................................................................................................22 5.2.2 Distinguishing Tracks...........................................................................................................23 6 Taking Measures..............................................................................................................................25 6.1 Establishing reasonable minimum speeds on edges................................................................27 6.2 Calculating schedule padding..................................................................................................28 7 Cartographic Depiction....................................................................................................................31 7.1 Maps.........................................................................................................................................31 7.2 Animated Maps........................................................................................................................57 8 Interpretation and Discussion..........................................................................................................61 8.1 Do the results match basic expectations?.................................................................................61 8.2 Some exemplary segments.......................................................................................................63 9. Concluding remarks........................................................................................................................67 9.1 Lessons learnt and directions for future research.....................................................................67 9.2 In Summary..............................................................................................................................70 References:.........................................................................................................................................72 Appendix 1: Code...............................................................................................................................76 Appendix 2: Example padding calculation.........................................................................................84 IV List of Figures, Maps, Tables and Videos Figures 1. State-space of schedule padding and random delay. page 3. 2. A metaphor for schedule padding. page 12. 3. Map showing redundancy of stops in TTC GTFS data. page 19. 4. Map showing intended effect of stop clustering technique. page 20. 5. Map showing unintended effect of stop clustering technique. page 20. 6. Map showing sample polylines generated from NextBus API vehicle locations feed. page 22. 7. Map showing location reporting error in Downtown Toronto. page 23. 8. Taking measurements: matching vehicle tracks to scheduled edges. page 25. 9. Taking measurements: intersections of vehicle tracks with stop cluster geometries. page 26. 10. Taking measurements: points on line intersections from which the measurements are finally taken. page 26. 11. Cumulative distribution of service hours at observed speeds across all segments. page 28. 12. Scheduled temporal distribution of service hours and the proportion of them which appears to be padding. page 30. 13. Satellite image of East Eglinton. page 66. Maps 1A & 1B: Observed mean speeds on network edges. pages 33, 35. 2A & 2B: Scheduled mean speeds on network edges. pages 37, 39. 3A & 3B: Observed fastest speeds (9th decile observations). pages 41, 43. 4A & 4B: Speed of fastest scheduled trips. pages 45, 47. 5A & 5B: Padding per trip per kilometer given 9th decile observations as maximum speed. pages 49, 51. V 6A & 6B: Padding per trip per kilometer given fastest scheduled speeds as maximum speed. pages 53, 55. 7: Padding on Eglinton Avenue East between Laird Drive and Yong Street. page 64. 8: Padding on Sheppard Avenue West from Allen Road to Keele Street. page 64. 9: Padding on Broadview Avenue and Pape Avenue. page 65. Tables 1. Variables returned by the NextBus API's vehicleLocations command. page 21. 2. Correlation between various values on network edges. page 62. Videos 1. Map 5A animated with 15 minute time steps. page 58. 2. Map 6A animated with 15 minute time steps. page 59. VI 1 Introduction Public transportation is a complex phenomenon, fundamental to the growth and maintenance of major cities in the industrial and postindustrial parts of the world. A great deal has been written about public transit in recent
Recommended publications
  • On the Accuracy of Schedule-Based GTFS for Measuring Accessibility
    On the Accuracy of Schedule-Based GTFS for Measuring Accessibility Nate Wessel1 and Steven Farber2 1 Department of Geography and Planning, University of Toronto 2 Department of Human Geography, University of Toronto Scarborough April 9, 2019 Abstract In this paper we assess the accuracy with which General Transit Feed Specification (GTFS) schedule data can be used to measure accessibility by public transit as it varies over space and time. We use archived Automatic Vehicle Location (AVL) data from four North American transit agencies to produce a detailed reconstruction of actual transit vehicle move- ments over the course of five days in a format that allows for travel time estimation directly comparable to schedule-based GTFS. With travel times estimated on both schedule-based and retrospective networks, we compute and compare a variety of accessibility measures. We find that origin-based accessibility even when averaged over one-hour periods can vary widely between locations. Origins with lower scheduled access tend to produce less reliable estimates with more variability from hour to hour in real accessibility, while higher access zones seem to converge on an estimate 5-15% lower than the schedule predicts. Such over- and under-predictions exhibit strong spatial patterns which should be of concern to those using accessibility metrics in statistical models. Momentary measures of accessibility are briefly discussed and found to be weakly related to momentary changes in real access. These findings bring into question the validity of some recent applications of GTFS data and point the way toward more robust methods for calculating accessibility. 1 Introduction Over the last decade, the General Transit Feed Specification (GTFS) has been established as a standard format for exchanging information on scheduled transit operations.
    [Show full text]
  • On the Accuracy of Schedule-Based GTFS for Measuring Accessibility
    On the Accuracy of Schedule-Based GTFS for Measuring Accessibility Nate Wessel1 and Steven Farber2 1 Department of Geography and Planning, University of Toronto 2 Department of Human Geography, University of Toronto Scarborough Abstract In this paper we assess the accuracy with which General Transit Feed Specification (GTFS) schedule data can be used to measure accessibility by public transit as it varies over space and time. We use archived Automatic Vehicle Location (AVL) data from four North American transit agencies to produce a detailed reconstruction of actual transit vehicle move- ments over the course of five days in a format that allows for travel time estimation directly comparable to schedule-based GTFS. With travel times estimated on both schedule-based and retrospective networks, we compute and compare a variety of accessibility measures. We find that origin-based accessibility even when averaged over one-hour periods can vary widely between locations. Origins with lower scheduled access tend to produce less reliable estimates with more variability from hour to hour in real accessibility, while higher access zones seem to converge on an estimate 5-15% lower than the schedule predicts. Such over- and under-predictions exhibit strong spatial patterns which should be of concern to those using accessibility metrics in statistical models. Momentary measures of accessibility are briefly discussed and found to be weakly related to momentary changes in real access. These findings bring into question the validity of some recent applications of GTFS data and point the way toward more robust methods for calculating accessibility. 1 1 Introduction Over the last decade, the General Transit Feed Specification (GTFS) has been established as a standard format for exchanging information on scheduled transit operations.
    [Show full text]
  • Training Manual
    / TRANSIT SERVICE PLAN NING AND SCHEDULING Training Manual Lehman Center for Transportation Research Florida International University 10555 West Flagler Street, EC 3609 Miami, FL 33174 Tel: 305-348-3144 | Fax: 305-348-2802 Email: [email protected] in association with National Center for Transit Research University of South Florida 4202 E. Fowler Ave., CUT100 Tampa, FL 33620 Tel: 813-974-3120 | Fax: 305-974-5168 Email: [email protected] i Training Manual for Transit Service Planning and Scheduling Copyright © 2015 by Lehman Center for Transportation Research Lehman Center for Transportation Research Florida International University 10555 West Flagler Street, EC 3609 Miami, FL 33174 All rights reserved. No part of this manual may be photocopied or reproduced in any form without written permission from the publisher. Moreover, no part of this publication can be stored in a retrieval system, transmitted by any means, or recorded or otherwise, without written permission from the author. Limits of Liability and Disclaimer of Warranty While every precaution has been taken in preparing this manual, including research, development, and testing, the publisher and author assume no responsibility for errors or omissions. No liability is assumed by either publisher or author for damages resulting in the use of this information. Printed in the United States of America ii Foreword The manual is intended for use by new transit staff, as well as seasoned professionals who want to review key concepts and best practices in the transit industry. The manual consists of two sections: Transit Planning and Transit Scheduling. It covers material for performing essential transit tasks.
    [Show full text]
  • Nurail Project Nurail2012-MIT-RO1
    NURail Project NURail2012-MIT-RO1 The final report for NURail project: NURail2012-MIT-R01 consists of three distinct documents. The first 233 pages are titled “The Impact of Amtrack Performance in the Northeast Corridor” By Tolulope A. Ogunbekun. The second 174 pages are a report titled “Capacity Challenge on the California High-Speed Rail Shared Corridors: How Local Decisions Have Statewide Impacts” By Samuel J. Levy and the last 206 pages consists of “Analysis of Capacity Pricing and Allocation Mechanisms in Shared Railway Systems” by Maite (Maria Teresa) Pena-Alcaraz. These were completed under grant number: DTRT12-G-UTC18. NURail Project ID: NURail2012-MIT-R01 High-Speed Rail as a Complex Sociotechnical System The Impact of Amtrack Performance in the Northeast Corridor By Tolulope A. Ogunbekun Supervised by Professor Joseph M. Sussman 10/1/2015 Grant Number: DTRT12-G-UTC18 1 2 DISCLAIMER The contents of this report reflect the views of the authors, who are responsible for the facts and the accuracy of the information presented herein. This document is disseminated under the sponsorship of the U.S. Department of Transportation’s University Transportation Centers Program, in the interest of information exchange. The U.S. Government assumes no liability for the contents or use thereof. 3 TECHNICAL SUMMARY Title The Impact of Amtrak Performance in the Northeast Corridor Author: Tolulope A. Ogunbekun Introduction The problem addressed in this report: develop an understanding of how Amtrak performs in the Northeast Corridor (NEC) of the US. The primary goal of this research is to study the impact of Amtrak’s performance in the Northeast Corridor.
    [Show full text]
  • Calgary-Bow Valley Mass Transit Feasibility Study Client Ref: RFP 1-500-5330-5320
    REPORT Calgary -Bow Valley Mass Transit Feasibility Study (Client Reference: RFP 1-500-5330-5320) Final Report Prepared for: The Town of Banff Prepared by: CPCS In association with sub-contractors: Dillon Consulting Ltd. Dominion Railway Services Ltd. Iron Moustache CPCS Ref: 17191 November 5, 2018 REPORT | Calgary-Bow Valley Mass Transit Feasibility Study Client Ref: RFP 1-500-5330-5320 Table of Contents Executive Summary ........................................................................................................................ i Acronyms / Abbreviations ............................................................................................................. ix 1 Introduction ............................................................................................................................... 1 Background ................................................................................................................................ 1 Project Objectives ..................................................................................................................... 2 Project Structure ....................................................................................................................... 2 Methodology ............................................................................................................................. 2 Limitations ................................................................................................................................. 3 Outline of this Report
    [Show full text]
  • A Concept for Flexible Operations and Optimized Traffic Into Metroplex Regions
    https://ntrs.nasa.gov/search.jsp?R=20120000031 2019-08-30T18:39:29+00:00Z NASA/CR—2011-217302 A Concept for Flexible Operations and Optimized Traffic into Metroplex Regions Daniel DeLaurentis, Steve Landry, and Dengfeng Sun Purdue University, West Lafayette, Indiana Fred Wieland and Ankit Tyagi Intelligent Automation, Inc., Rockville, Maryland 'HFember 2011 NASA STI Program . in Profile Since its founding, NASA has been dedicated x CONFERENCE PUBLICATION. Collected to the advancement of aeronautics and space papers from scientific and technical science. The NASA scientific and technical conferences, symposia, seminars, or other information (STI) program plays a key part in meetings sponsored or co-sponsored by helping NASA maintain this important role. NASA. The NASA STI program operates under the x SPECIAL PUBLICATION. Scientific, auspices of the Agency Chief Information Officer. technical, or historical information from It collects, organizes, provides for archiving, and NASA programs, projects, and missions, disseminates NASA’s STI. The NASA STI often concerned with subjects having program provides access to the NASA Aeronautics substantial public interest. and Space Database and its public interface, the NASA Technical Report Server, thus providing x TECHNICAL TRANSLATION. English- one of the largest collections of aeronautical and language translations of foreign scientific space science STI in the world. Results are and technical material pertinent to NASA’s published in both non-NASA channels and by mission. NASA in the NASA STI Report Series, which includes the following report types: Specialized services also include creating custom thesauri, building customized databases, x TECHNICAL PUBLICATION. Reports of and organizing and publishing research results.
    [Show full text]