Master Thesis
Total Page:16
File Type:pdf, Size:1020Kb
Faculty of Environmental Sciences Institute of Cartography MASTER THESIS Historical Spatio-temporal data in current GIS Case Study: German-Herero war of resistance 1904 submitted by Susanna Ambondo Abraham 13 July 1991, Oshikuku Matriculation Number: 4576256 International Master of Science in Cartography Technical University of Dresden Supervisors: First Supervisor: Prof. Lars Bernard Second Supervisor: Dr.-Ing. Christian Murphy Submission date: 09 October 2017 Statement of Authorship I hereby declare that I am the sole author of this master thesis and information directly or indirectly taken from other sources have been noted as such. I further declare that I have not submitted this thesis nor any partial work of this thesis to an examination committee. Dresden, 09 October 2017 _____________________ ii Acknowledgements First of all, I would like to thank the Lord for his provision, strength and comfort on this journey. I could have not done it without his grace and mercy upon me. This has been the most challenging and emotional episode of my life. I would like to take this time and express my appreciation to all that helped me make my studies in Germany a worthwhile. I would like to express my sincere gratitude to Prof. Dr. Lars Bernard for agreeing to supervise me and for his guidance on this thesis. I would also like to thank Dr-Ing Christian Murphy my second supervisor and Prof Dirk Burghardt for serving on my thesis committee and for their kind help and time. A special appreciation goes to Mrs Juliane Cron, for her kind assistance from the very first day on this master’s course, for motivating me to not give up when I felt like leaving. I am really grateful for all that you have done for me. Another special thanks goes to Mr Ankama Nadagho for his continuous support and for always believing in me. Equally, I would like to thank my dear friend Donald Akogou for being there to review my work and for always listening to my issues. My dear friend Maja Kalinic, you have not only been a classmate but also a sister to me. I am grateful to have met you and for making my life in Germany less stressful. Finally, my dear mama and sister for being my motivation to take on this amazing challenge. I thank you for your support and love. iii Abstract The growing developments in Geographic Information Retrieval (GIR) and Temporal Information Retrieval (TIR) techniques have given new ways to explore digital text archives for spatio-temporal data. Often, we are faced with questions regarding past events and the answers are hidden in the historical text archives. The question is how to retrieve the answers from the text documents. This thesis aims to contribute to the better understanding of spatio- temporal information extraction from text documents. Natural Language Processing techniques were used to develop an information extraction framework using GATE development framework. The developed framework uses gazetteer matching and pattern based rules to recognize and annotate elements in the documents. The goal is to use the extracted spatio- temporal data to study the time-geography context of the German-Herero war of resistance by using GIS. However, there are shortcomings regarding spatio-temporal functionalities in GIS applications. Space-time cube analysis, trajectory analysis and spatio-temporal clustering tools were used to establish space-time patterns in historical events of this phenomenon. While doing so, we assess the limitations of ArcGIS functionalities in handling time. The results shows that ArcGIS have the capabilities of handling time information. Space-time patterns were identified and presented using 2D and 3D visualization techniques. However, more functionalities are still needed in order to efficiently analyse spatio-temporal information, especially in regards to discrete historical temporal patterns. There is still a lack of spatio-temporal query functions. Additionally, more visualization methods are still needed that explicitly accentuate space-time changes without time animations. Keywords: Spatio-temporal data, Natural Language Processing, Information Extraction, GIS iv Table of Contents List of figures .......................................................................................................................... viii List of tables ............................................................................................................................... x List of abbreviations and symbols ............................................................................................. xi Chapter 1: Introduction .......................................................................................................... 1 1.1 Motivation .................................................................................................................... 1 1.2 Problem definition ....................................................................................................... 2 1.3 Case study Background ............................................................................................... 2 1.4 Research Identification ................................................................................................ 3 1.4.1 Research questions ............................................................................................... 4 1.4.2 Intended innovations ............................................................................................ 4 1.5 Material and methodologies ......................................................................................... 4 1.5.1 Methodologies: ..................................................................................................... 4 1.5.2 Materials: .............................................................................................................. 5 Chapter 2: Literature review................................................................................................... 6 2.1 Introduction .................................................................................................................. 6 2.2 Methods for extracting spatio-temporal data ............................................................... 6 2.3 Spatio-temporal data models ....................................................................................... 8 2.3.1 Snapshot data model ............................................................................................. 8 2.3.2 Event-based spatio-temporal data model (ESTDM) ............................................ 8 2.3.3 Simple time-stamp model ..................................................................................... 8 2.4 Spatio-temporal databases ........................................................................................... 9 2.5 Visualizing and analysing spatio-temporal data ........................................................ 11 2.5.2 Cartographic depict modes ................................................................................. 12 2.5.3 Space-time visualization models ........................................................................ 13 Chapter 3: Extraction of Spatio-temporal data from historical text documents ................... 16 3.1 Introduction ................................................................................................................ 16 v 3.2 Document pre-processing .......................................................................................... 18 3.3 Creation of gazetteers ................................................................................................ 20 3.3.1 Spatial Gazetteer creation ................................................................................... 21 3.3.2 Temporal gazetteer creation ............................................................................... 22 3.4 Contextual Information Extraction ............................................................................ 23 3.4.1 Named Entity Extraction pipeline ...................................................................... 24 3.4.2 Spatiotemporal relationship extraction pipeline ................................................. 28 3.5 Trajectory and location events extraction .................................................................. 31 3.5.1 Geocoding ........................................................................................................... 35 Chapter 4: Historical Spatio-temporal data in ArcGIS ........................................................ 36 4.1 Introduction ................................................................................................................ 36 4.2 Spatio-temporal data .................................................................................................. 36 4.2.1 Spatio-temporal data type ................................................................................... 36 4.2.2 Temporal GIS patterns........................................................................................ 38 4.2.3 Requirement for modelling, analysing and visualizing historical data .............. 40 4.3 Components of a temporal GIS ................................................................................. 40 4.4 Modelling and presenting time in ArcGIS ................................................................. 41 4.4.1 Modelling historical events as point in time ....................................................... 42 4.4.2 Modelling troop trajectory as duration in time ..................................................