A Protocol and Prototype Application to Find Maps

Total Page:16

File Type:pdf, Size:1020Kb

A Protocol and Prototype Application to Find Maps MAPSEARCH: A PROTOCOL AND PROTOTYPE APPLICATION TO FIND MAPS by JUDITH GELERNTER A Dissertation submitted to the Graduate School-New Brunswick Rutgers, The State University of New Jersey In partial fulfillment of the requirements for the degree of Doctor of Philosophy Graduate Program in Communication, Information and Library Studies Written under the direction of Professor Michael E. Lesk and approved by ________________________ ________________________ ________________________ ________________________ New Brunswick, New Jersey May 2008 © 2008 Judith Gelernter ALL RIGHTS RESERVED ABSTRACT OF THE DISSERTATION MAPSEARCH: A Protocol and Prototype Application to Find Maps By Judith Gelernter Dissertation Director: Professor Michael Lesk Even geographers need ways to find what they need among the thousands of maps buried in map libraries and in journal articles. It is not enough to provide search by region and keyword. Studies of queries show that people often want to look for maps showing a certain location at a certain time period or with a subject theme. The difficulties in finding such maps are several. Maps in physical and digital collections often are organized by region. Multi-dimensional manual indexing is time-consuming and so many maps are not indexed. Further, maps in non-geographical publications are indexed rarely, making them essentially invisible. In an attempt to solve actual problems, this dissertation research automatically indexes maps in published documents so that they become visible to searchers. The MapSearch prototype aggregates journal components to allow finer-grained searching of article content. MapSearch allows search by region, time, or theme as well as by keyword (http://scilsresx.rutgers.edu/~gelern/maps/). Automatic classification of maps is a multi-step process. A sample of 150 maps and the text (that becomes metadata) describing the maps have been copied from a random assortment of journal articles. Experience taking metadata manually enabled the writing of instructions to mine data automatically; experience with manual classification allowed for writing algorithms that classify maps by region, time and theme automatically. That classification is supported by ontologies for region, time and theme that have been ii generated or adapted for the purpose and that allow what has been called intelligent search, or smart search. The 150 map training set was loaded into the MapSearch engine repeatedly, each time comparing automatically-assigned classification to manually- assigned classification. Analysis of computer misclassifications suggested whether the ontology or classification algorithm should be modified in order to improve classification accuracy. After repeated trials and analyses to improve the algorithms and ontologies, MapSearch was evaluated with a set of 55 previously unseen maps in a test set. Automated classification of the test set of maps was compared to the manual classification, with the assumption that the manual process provides the most accurate classification obtainable. Results showed an accuracy, or a correspondence between manual and automated classification, of 75% for region, 69% for time, and 84% for theme. The dissertation contributes: (1) a protocol to harvest metadata from maps in published articles that could be adapted to aggregate other sorts of journal article components such as charts, diagrams, cartoons or photographs, (2) a method for ontology-supported metadata processing to allow for improved result relevance that could be applied to other sorts of data, (3) algorithms to classify maps into region, time and theme facets that could be adapted to classify other document types, and (4) a proof-of-concept MapSearch system that could be expanded with heterogeneous map types. iii Acknowledgement I am thankful for my chief advisor Michael Lesk’s wisdom and kindness that have guided me throughout this research. His exceptional knowledge of data mining and word sense disambiguation allowed me to delve deeply into the heart of the problem. His translation of my algorithms into the web programming language Perl breathed life into MapSearch and created the prototype. I will miss our work on this project, but look forward to research together in future. Thanks are due to committee members in my department Dan O’Connor and Marie Radford for advice on the rigors of form and style, and for being available when I have been in need. Thanks also to Michael Goodchild from the University of California at Santa Barbara for sharing some of his latest research and offering advice on this dissertation’s geographic aspects. I thank my colleagues in the graduate school Heather Moulaison and Sarah Legins for the careful attention they gave to indexing the map sample, which was used for the evaluation of the classification algorithms. Nancy Kandoian of the Map Division of the New York Public Library, and Cathay Crosby of the Internet Public Library, were most helpful in providing data on how people ask for maps. This research was supported in part by the Rutgers Academic Excellence Fund. iv Table of Contents 1 Introduction ……………………………………………………………. 1 1.1 Problem statement 1.2 Significance 1.3 Research questions and small scale answer to the problem 1.4 Contributions and long term goals 2 Background for the research ………………………………………… 5 2.1 What is a map? 2.2 How can one find a digital map? 2.3 Why finding a digital map can be a problem: spatial metadata 2.4 Theme maps and those who use them 2.5 MapSearch in response to present problems 2.6 Approach 3 Related research ……………………………………………………… 12 3.1 Information retrieval 3.2 Data mining and automated classification 3.3 Lexical search using an ontology and weighted indexing 3.4 Determining relevance: semantic similarity 3.5 Indexing attributes for maps 3.6 Systems to retrieve maps 4 User-defined facets …………………………………………………… 22 4.1 How do people ask for maps? 4.1.1 New York Public Library Map Division Method Data Findings 4.1.2 Actual questions from the Internet Public Library Method Data Findings 4.2 Interpretation of both studies 5 Overview of automatic cataloging and classification of maps …..... 28 5.1 System architecture 5.2 Harvesting metadata 5.2.1 Collecting the sample 5.2.2 Creating and storing 5.2.3 Data cleaning 5.2.4 Weighting and indexing 5.2.5 Information extraction 5.3 Metadata classification 5.3.1 Classification rules v 5.3.2 Classification categories 5.3.3 An ontology for each facet 5.4 Query processing: assessment and improvement 5.4.1 Algorithms 5.4.2 How to use an ontology to classify metadata or query 5.4.3 How to rank results 5.4.4 How good is a given classification? Scoring 5.5 Improving classification algorithms 6 Classification architecture …………………………………………… 42 6.1 Method 6.2 Region 6.2.1 Classes and their application 6.2.2 Geo-ontology 6.2.3 Algorithm for region 6.2.4 Findings (research question 1) 6.3 Time 6.3.1 Classes and their application 6.3.2 Chrono-ontology 6.3.3 Algorithm for time 6.3.4 Findings (research question 2) 6.4 Theme 6.4.1 Classes and their application 6.4.2 Concept ontology 6.4.3 Algorithm for theme 6.4.4 Findings (research question 3) 6.5 Summary of results 6.6 Discussion 7 Interface design ……………………………………………………… 70 7.1 Method 7.2 Keyword search 7.3 Faceted category search 7.4 Results display 7.5 Messages to the user 7.6 Further testing 8 Evaluation ……………………………………………………………. 77 8.1 Potential approaches for evaluation 8.2 How well do the algorithms work 8.2.1 Method for manual classification 8.2.2 Assembling the results of manual classifications 8.2.3 Evaluation of automatic classification 8.3 Proposed experiment for data mining 8.4 Proposed experiment for classification categories 8.5 Proposed experiment for user satisfaction vi 8.6 Cost effectiveness 8.7 Importance of testing 9 Limitations ………………………..…………………………………… 86 9.1 Collection, current and expanded 9.2 Categories 9.3 Classifying maps 9.4 Ontologies 9.5 Interface design 9.6 MapSearch is a question-answering tool—but what are the questions? 10 Contribution: what is new and why it matters ………………………. 90 10.1 Hybrid method for classification 10.2 Subject ontology 10.3 Algorithms 10.4 MapSearch prototype expandable 10.5 Toward a simpler spatial metadata scheme 11 Future Research ………………………………………………………… 93 11.1 Expanding the search protocol in order to expand the collection 11.2 Adding features for users 11.3 MapSearch as a model vii List of tables Table 1 Findings for region ………………………………………………….. 50 Table 2 Findings for time ……………………………………………………. 58 Table 3 Findings for theme …………………………………………………... 64 Table 4 Comparison of automatic classifications in the training set by facet .. 66 Table 5 Time and theme assignments for the training set ………………….... 68 Table 6 Comparison of automatic classifications in the testing set by facet .... 80 List of illustrations Figure 1 The architecture of the proposed map retrieval system ……………… 28 Figure 2a Map as it appears in the article; Center and Right: Demonstration of the text layer extraction program ………………………………… 30 Figure 2b Words from the article that can be taken using the data mining algorithm outlined in Appendix B…………………………………… 31 Figure 3 Example of weighting: Sample output of classification by theme ….. 38 Figure 4 Library of Congress Classification excerpt ………..………………... 61 Figure 5 MapSearch interface ………………………………………………… 75 Figure 6 MapSearch results display …………………………………………… 76 viii Concluding Material Appendix A. Glossary of terms in the dissertation ……………………………. 97 Appendix B. Instructions to mine data from around maps…………………….. 100 Appendix C. Ontology building with Geonames .…………………………….. 102 Appendix D. Materials for evaluation ………………………………………… 103 D.1 Instructions to indexers D.2 Regions expanded D.3 Regions mapped D.4 Themes expanded References ……………………………………………………………………. 117 Curriculum vitae (abbreviated) …………………………………………….... 126 ix 1 1 Introduction 1.1 Problem statement How do we find maps in non-geographic publications? In digital publications, often we do not—maps are invisible because they are not well-indexed.
Recommended publications
  • Geo-Text Data and Data-Driven Geospatial Semantics
    Geo-Text Data and Data-Driven Geospatial Semantics Yingjie Hu GSDA Lab, Department of Geography, University of Tennessee, Knoxville, TN, 37996, USA Abstract Many datasets nowadays contain links between geographic locations and natural language texts. These links can be geotags, such as geotagged tweets or geotagged Wikipedia pages, in which location coordinates are explicitly attached to texts. These links can also be place mentions, such as those in news articles, travel blogs, or historical archives, in which texts are implicitly connected to the mentioned places. This kind of data is referred to as geo- text data. The availability of large amounts of geo-text data brings both challenges and opportunities. On the one hand, it is challenging to automatically process this kind of data due to the unstructured texts and the complex spatial footprints of some places. On the other hand, geo-text data offers unique research opportunities through the rich information contained in texts and the special links between texts and geography. As a result, geo-text data facilitates various studies especially those in data-driven geospatial semantics. This paper discusses geo-text data and related concepts. With a focus on data-driven research, this paper systematically reviews a large number of studies that have discovered multiple types of knowledge from geo-text data. Based on the literature review, a generalized workflow is extracted and key challenges for future work are discussed. Keywords: geo-text data, spatial analysis, natural language processing, spatial and textual data analysis, data-driven geospatial semantics, spatial data science. 1. Introduction Recent years have witnessed an unprecedented increase in the volume, variety, and veloc- ity of data from different sources (Miller and Goodchild, 2015).
    [Show full text]
  • Web GIS in Practice VI: a Demo Playlist of Geo-Mashups for Public Health Neogeographers Maged N Kamel Boulos*1, Matthew Scotch2, Kei-Hoi Cheung2,3 and David Burden4
    International Journal of Health Geographics BioMed Central Editorial Open Access Web GIS in practice VI: a demo playlist of geo-mashups for public health neogeographers Maged N Kamel Boulos*1, Matthew Scotch2, Kei-Hoi Cheung2,3 and David Burden4 Address: 1Faculty of Health and Social Work, University of Plymouth, Drake Circus, Plymouth, Devon, PL4 8AA, UK, 2Center for Medical Informatics, School of Medicine, Yale University, New Haven, CT, USA, 3Departments of Anesthesiology and Genetics, School of Medicine, and Department of Computer Science, Yale University, New Haven, CT, USA and 4Daden Limited, 103 Oxford Rd, Moseley, Birmingham, B13 9SG, UK Email: Maged N Kamel Boulos* - [email protected]; Matthew Scotch - [email protected]; Kei- Hoi Cheung - [email protected]; David Burden - [email protected] * Corresponding author Published: 18 July 2008 Received: 6 July 2008 Accepted: 18 July 2008 International Journal of Health Geographics 2008, 7:38 doi:10.1186/1476-072X-7-38 This article is available from: http://www.ij-healthgeographics.com/content/7/1/38 © 2008 Boulos et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract 'Mashup' was originally used to describe the mixing together of musical tracks to create a new piece of music. The term now refers to Web sites or services that weave data from different sources into a new data source or service.
    [Show full text]
  • Crowdsourcing, Citizen Science Or Volunteered Geographic Information? the Current State of Crowdsourced Geographic Information
    International Journal of Geo-Information Article Crowdsourcing, Citizen Science or Volunteered Geographic Information? The Current State of Crowdsourced Geographic Information Linda See 1,*, Peter Mooney 2, Giles Foody 3, Lucy Bastin 4, Alexis Comber 5, Jacinto Estima 6, Steffen Fritz 1, Norman Kerle 7, Bin Jiang 8, Mari Laakso 9, Hai-Ying Liu 10, Grega Milˇcinski 11, Matej Nikšiˇc 12, Marco Painho 6, Andrea P˝odör 13, Ana-Maria Olteanu-Raimond 14 and Martin Rutzinger 15 1 International Institute for Applied Systems Analysis (IIASA), Schlossplatz 1, Laxenburg A2361, Austria; [email protected] 2 Department of Computer Science, Maynooth University, Maynooth W23 F2H6, Ireland; [email protected] 3 School of Geography, University of Nottingham, Nottingham NG7 2RD, UK; [email protected] 4 School of Engineering and Applied Science, Aston University, Birmingham B4 7ET, UK; [email protected] 5 School of Geography, University of Leeds, Leeds LS2 9JT, UK; [email protected] 6 NOVA IMS, Universidade Nova de Lisboa (UNL), 1070-312 Lisboa, Portugal; [email protected] (J.E.); [email protected] (M.P.) 7 Department of Earth Systems Analysis, ITC/University of Twente, Enschede 7500 AE, The Netherlands; [email protected] 8 Faculty of Engineering and Sustainable Development, Division of GIScience, University of Gävle, Gävle 80176, Sweden; [email protected] 9 Finnish Geospatial Research Institute, Kirkkonummi 02430, Finland; mari.laakso@nls.fi 10 Norwegian Institute for Air Research (NILU), Kjeller 2027, Norway; [email protected]
    [Show full text]
  • Using Wordpress to Manage Geocontent and Promote Regional Food Products
    Farm2.0: Using Wordpress to Manage Geocontent and Promote Regional Food Products Amenity Applewhite Farm2.0: Using Wordpress to Manage Geocontent and Promote Regional Food Products Dissertation supervised by Ricardo Quirós PhD Dept. Lenguajes y Sistemas Informaticos Universitat Jaume I, Castellón, Spain Co-supervised by Werner Kuhn, PhD Institute for Geoinformatics Westfälische Wilhelms-Universität, Münster, Germany Miguel Neto, PhD Instituto Superior de Estatística e Gestão da Informação Universidade Nova de Lisboa, Lisbon, Portugal March 2009 Farm2.0: Using Wordpress to Manage Geocontent and Promote Regional Food Products Abstract Recent innovations in geospatial technology have dramatically increased the utility and ubiquity of cartographic interfaces and spatially-referenced content on the web. Capitalizing on these developments, the Farm2.0 system demonstrates an approach to manage user-generated geocontent pertaining to European protected designation of origin (PDO) food products. Wordpress, a popular open-source publishing platform, supplies the framework for a geographic content management system, or GeoCMS, to promote PDO products in the Spanish province of Valencia. The Wordpress platform is modified through a suite of plug-ins and customizations to create an extensible application that could be easily deployed in other regions and administrated cooperatively by distributed regulatory councils. Content, either regional recipes or map locations for vendors and farms, is available for syndication as a GeoRSS feed and aggregated with outside feeds in a dynamic web map. To Dad, Thanks for being 2TUF: MTLI 4 EVA. Acknowledgements Without encouragement from Dr. Emilio Camahort, I never would have had the confidence to ensure my thesis handled the topics I was most passionate about studying - sustainable agriculture and web mapping.
    [Show full text]
  • Preserving Geospatial Data
    Technology Watch Report Preserving Geospatial Data Guy McGarva EDINA, University of Edinburgh Steve Morris North Carolina State University (NCSU) Greg Janée University of California, Santa Barbara (UCSB) DPC Technology Watch Series Report 09-01 May 2009 © 2009 1 Executive Summary: Geospatial data are becoming an increasingly important component in decision making processes and planning efforts across a broad range of industries and information sectors. The amount and variety of data is rapidly increasing and, while much of this data is at risk of being lost or becoming unusable, there is a growing recognition of the importance of being able to access historical geospatial data, now and in the future, in order to be able to examine social, environmental and economic processes and changes that occur over time. The geospatial domain is characterized by a broad range of information types, including geographic information systems data, remote sensing imagery, three- dimensional representations and other location-based information. The scope of this report is limited to two-dimensional geospatial data and data that would typically be considered comparable to paper maps or charts including vector data, raster data and spatial databases. There are a number of significant preservation issues that relate specifically to geospatial data, including: the complexity and variety of data formats and structures; the abundance of content that exists in proprietary formats; the need to maintain the technical and social contexts in which the data exists; and the growing importance of web services and dynamic (and ephemeral) data. Standards for geospatial metadata have been defined at both the national and international levels, yet metadata often becomes dissociated from the data, or is incorrect, non-standard in nature, or not created in the first place.
    [Show full text]
  • Review of Web Mapping: Eras, Trends and Directions
    International Journal of Geo-Information Review Review of Web Mapping: Eras, Trends and Directions Bert Veenendaal 1,*, Maria Antonia Brovelli 2 ID and Songnian Li 3 ID 1 Department of Spatial Sciences, Curtin University, GPO Box U1987, Perth 6845, Australia 2 Department of Civil and Environmental Engineering (DICA), Politecnico di Milano, P.zza Leonardo da Vinci 32, 20133 Milan, Italy; [email protected] 3 Department of Civil Engineering, Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3, Canada; [email protected] * Correspondence: [email protected]; Tel.: +618-9266-7701 Received: 28 July 2017; Accepted: 16 October 2017; Published: 21 October 2017 Abstract: Web mapping and the use of geospatial information online have evolved rapidly over the past few decades. Almost everyone in the world uses mapping information, whether or not one realizes it. Almost every mobile phone now has location services and every event and object on the earth has a location. The use of this geospatial location data has expanded rapidly, thanks to the development of the Internet. Huge volumes of geospatial data are available and daily being captured online, and are used in web applications and maps for viewing, analysis, modeling and simulation. This paper reviews the developments of web mapping from the first static online map images to the current highly interactive, multi-sourced web mapping services that have been increasingly moved to cloud computing platforms. The whole environment of web mapping captures the integration and interaction between three components found online, namely, geospatial information, people and functionality. In this paper, the trends and interactions among these components are identified and reviewed in relation to the technology developments.
    [Show full text]
  • Volunteered and Crowdsourced Geographic Information: the Openstreetmap Project
    JOURNAL OF SPATIAL INFORMATION SCIENCE Number 20 (2020), pp. 65–70 doi:10.5311/JOSIS.2020.20.659 INVITED ARTICLE Volunteered and crowdsourced geographic information: the OpenStreetMap project Michela Bertolotto1, Gavin McArdle1, and Bianca Schoen-Phelan2 1School of Computer Science, University College Dublin 2School of Computer Science, Technological University Dublin Received: February 28, 2020; accepted: May 12, 2020 Abstract: Advancements in technology over the last two decades have changed how spa- tial data are created and used. In particular, in the last decade, volunteered geographic information (VGI), i.e., the crowdsourcing of geographic information, has revolutionized the spatial domain by shifting the map-making process from the hands of experts to those of any willing contributor. Started in 2004, OpenStreetMap (OSM) is the pinnacle of VGI due to the large number of volunteers involved and the volume of spatial data generated. While the original objective of OSM was to create a free map of the world, its uses have shown how the potential of such an initiative goes well beyond map-making: ranging from projects such as the Humanitarian OpenStreetMap (HOT) project, that understands itself as a bridge between the OSM community and humanitarian responders, to collaborative projects such as Mapillary, where citizens take street-level images and the system aims to automate mapping. A common trend among these projects using OSM is the fact that the community dynamic tends to create spin-off projects. Currently, we see a drive towards projects that support sustainability goals using OSM. We discuss some such applications and highlight challenges posed by this new paradigm.
    [Show full text]
  • How Good Is Openstreetmap Information
    Haklay, M., 2008, Please note that this is an early version of this article, and that the definitive, peer- reviewed and edited version of this article is published in Environment and Planning B: Planning and Design. Please refer to the publication website for the definitive version. How good is Volunteered Geographical Information? A comparative study of OpenStreetMap and Ordnance Survey datasets Abstract Within the framework of Web Mapping 2.0 applications, the most striking example of a geographical application is the OpenStreetMap project. OpenStreetMap aims to create a free digital map of the world and is implemented through the engagement of participants in a mode similar to software development in Open Source projects. The information is collected by many participants, collated on a central database and distributed in multiple digital formats through the World Wide Web (Web). This type of information was termed ‘Volunteered Geographical Information’ (VGI) by Mike Goodchild (2007). However, to date there has been no systematic analysis of the quality of VGI. This paper aims to fill this gap by analysing OpenStreetMap information. The examination starts with the characteristics of OpenStreetMap contributors, followed by analysis of its quality through a comparison with Ordnance Survey datasets. The analysis focuses on London and England, since OpenStreetMap started in London in August 2004 and therefore the study of these geographies provides the best understanding of the achievements and difficulties of VGI. The analysis shows that OpenStreetMap information can be fairly accurate: on average within about 6 metres of the position recorded by the Ordnance Survey, and with approximately 80% overlap of motorway objects between the two datasets.
    [Show full text]
  • Expanding Library GIS Instruction to Web Mapping in the Age of Neogeography
    Final version published as: Zhang, S. (2021). Expanding Library GIS Instruction to Web Mapping in the Age of Neogeography. Journal of Map & Geography Libraries, 1–19. https://doi.org/10.1080/15420353.2021.1935399. Expanding Library GIS Instruction to Web Mapping in the Age of Neogeography Sarah Zhang, Simon Fraser University Library Abstract The past two decades witnessed the flourishing of GeoWeb, a web infused with geospatial services and applications, which has given rise to a trend that non‐experts are increasingly involved in creating digital maps, collecting spatial data, and developing mapping mashups or applications, known as neogeography. In light of that, the general public and researchers/students in higher education institutions are becoming increasingly interested in these technologies. However, a gap exists between the GIS educational programs offered by public/academic libraries in Canada and the fast developing web mapping technologies as well as the shifting needs of users. This paper describes two web mapping workshop initiatives at a public library and an academic library, arguing web mapping technologies provides new opportunities to adapt ACRL Information Literacy to GIS education, and advocating academic and public libraries’ involvement in web mapping instruction in the age of neogeography. Introduction Revolutionary technologies in the past decade or so have drastically transformed the landscape of maps and mapping‐ digital maps have dramatically outpaced print maps, interactive web maps or apps have become a ubiquitous part of everyone’s daily life. Higher education institutions, however, have only started to respond to this shift by offering web mapping programs and courses. While distance education programs in web mapping have notably jumped in to fill the gap, with the most prominent examples being the Pennsylvania State University and the University of Kentucky, the cartography and GIS programs in North America which offer courses in web mapping have not become the mainstream (Sack 2018).
    [Show full text]
  • Geoweb, Neogeography, and VGI: New Challenges for Geomatics Sciences and Engineering
    Geoweb, Neogeography, and VGI: New Challenges for Geomatics Sciences and Engineering Stéphane ROCHE, Canada SUMMARY An important democratization process characterizes Geospatial technologies. Web mapping and virtual globes, GeoWeb, GPS navigation, Location Based Services - LBS or geocaching are concrete examples of this democratization process. Geospatial technologies are accessible everywhere, every time, and by everybody. Even if such a process seems to impact GI Sciences, very little is known about its characteristics and mechanisms. This process is embedded into the exponential development of the Web 2.0., which is a more opened Web, based on sharing, peering and crowdsourcing, in which the location component plays a dominating part - GeoWeb. In this context, new forms of geospatial information production appear, illustrated by web service like Wikimapia or OpenStreetMap for example. Such a "User-driven generated content "initiative", qualified by Mike Goodchild as "Volunteered Geographic Information - VGI", are more generally linked to the new trend called "Neogeography". Our presentation aims at providing an overview of the concept of neogeography from a geomatics sciences perspective, by providing a critical analysis of the geospatial technology democratization process. We provide an analysis of the most popular forms of neogeoraphy, through a series of case studies (Point Of Interest – POI, creation e- community, VGI, geoblog or wikicarto). This presentation finally highlights to what extent all these initiatives challenge geomatics
    [Show full text]
  • Recent Developments and Future Trends in Volunteered Geographic Information Research: the Case of Openstreetmap
    Future Internet 2014, 6, 76-106; doi:10.3390/fi6010076 OPEN ACCESS future internet ISSN 1999-5903 www.mdpi.com/journal/futureinternet Review Recent Developments and Future Trends in Volunteered Geographic Information Research: The Case of OpenStreetMap Pascal Neis 1,* and Dennis Zielstra 2 1 Geoinformatics Research Group, Department of Geography, University of Heidelberg, Berliner Street 48, D-69120 Heidelberg, Germany 2 Geomatics Program, University of Florida, 3205 College Avenue, Fort Lauderdale, FL 33314, USA; E-Mail: [email protected] * Author to whom correspondence should be addressed; E-Mail: [email protected]; Tel.: +49-6221-54-5504; Fax: +49-6221-54-4529. Received: 10 December 2013; in revised form: 10 January 2014 / Accepted: 13 January 2014 / Published: 27 January 2014 Abstract: User-generated content (UGC) platforms on the Internet have experienced a steep increase in data contributions in recent years. The ubiquitous usage of location-enabled devices, such as smartphones, allows contributors to share their geographic information on a number of selected online portals. The collected information is oftentimes referred to as volunteered geographic information (VGI). One of the most utilized, analyzed and cited VGI-platforms, with an increasing popularity over the past few years, is OpenStreetMap (OSM), whose main goal it is to create a freely available geographic database of the world. This paper presents a comprehensive overview of the latest developments in VGI research, focusing on its collaboratively collected geodata and corresponding contributor patterns. Additionally, trends in the realm of OSM research are discussed, highlighting which aspects need to be investigated more closely in the near future.
    [Show full text]
  • Significance of Collaborative Cartography
    ARTICLE Significance of Collaborative Cartography It is well recognised that mapping has perhaps changed more profoundly in recent years than in any previous era. In the past map production was reserved for professional cartographers and dissemination was radial: users were amateurs who consumed professional products. The revolutions in digital cartography, initiated in the 1980s, did little to alter expert control over mapping, and GIS if anything exacerbated the power of the producer. Now, however, websites such as Google Earth offer everyone the chance to produce their own individual maps, in many cases without the need of any professional qualification. Never before has such democratisation been as widespread. So many users today are brought together via the internet that producers and consumers are no longer distinguishable. Ambivalence These developments evoke ambivalent attitudes: some critical writers have argued they spell the end of traditional cartography. There is an increasing need for dynamic, and particularly interactive, maps, and web 2.0 offers a suitable platform for these new approaches to the acquisition, assembly and publication of geographic information. Others embrace a new era of map production with undreamed-of possibilities offered by the bi-directional collaboration that characterises the new mapping worlds of web 2.0. New terminologies are emerging to reflect this changing conceptual field: in contrast to the old cartography, there is now ‘neogeography', embracing the integrative power of ubiquitous computing in which geography brings together the formerly separate. Yet others cast these developments as volunteered geographic information (VGI), stressing the potential of locally sourced or volunteered information. Control To date, most academic cartographic research has focused on controlling the syntactical dimension of communication and trying to provide better maps by exploring useful semantic aspects.
    [Show full text]