Preserving Geospatial Data
Total Page:16
File Type:pdf, Size:1020Kb
Technology Watch Report Preserving Geospatial Data Guy McGarva EDINA, University of Edinburgh Steve Morris North Carolina State University (NCSU) Greg Janée University of California, Santa Barbara (UCSB) DPC Technology Watch Series Report 09-01 May 2009 © 2009 1 Executive Summary: Geospatial data are becoming an increasingly important component in decision making processes and planning efforts across a broad range of industries and information sectors. The amount and variety of data is rapidly increasing and, while much of this data is at risk of being lost or becoming unusable, there is a growing recognition of the importance of being able to access historical geospatial data, now and in the future, in order to be able to examine social, environmental and economic processes and changes that occur over time. The geospatial domain is characterized by a broad range of information types, including geographic information systems data, remote sensing imagery, three- dimensional representations and other location-based information. The scope of this report is limited to two-dimensional geospatial data and data that would typically be considered comparable to paper maps or charts including vector data, raster data and spatial databases. There are a number of significant preservation issues that relate specifically to geospatial data, including: the complexity and variety of data formats and structures; the abundance of content that exists in proprietary formats; the need to maintain the technical and social contexts in which the data exists; and the growing importance of web services and dynamic (and ephemeral) data. Standards for geospatial metadata have been defined at both the national and international levels, yet metadata often becomes dissociated from the data, or is incorrect, non-standard in nature, or not created in the first place. Additional considerations to be taken into account in preserving geospatial data include coordinate reference systems, cartographic representations, topology, project files and data packaging. Standards bodies are in place at the national and international levels to address general geospatial data standardization issues, yet working groups addressing preservation issues have only recently been formed. A number of technologies and tools that are, or may be, of relevance to geospatial data preservation efforts have emerged, although the nature of the problem is such that there is not a single tool or technology that will be relevant in all cases. A number of projects and activities have been addressing various aspects of geospatial data preservation, creating an initial body of experience from which some initial recommendations can be made. While these recommendations provide a basic checklist of issues to be considered when preserving geospatial data, it must be emphasized that the collective experience in preserving such data is still very much in an early stage and that further investigations are needed. Keywords: Geographic Information Systems, geospatial data, preservation, spatial databases, geospatial formats, web mapping services 2 Contents: 1 Introduction: why preserve geospatial data? ................................................................................... 4 2 Background: key challenges with geospatial data ........................................................................... 4 3 Geospatial Data Preservation Issues ............................................................................................... 6 3.1 Generic Geospatial Data Issues ............................................................................................. 6 3.1.1 Coordinate Reference Systems ......................................................................................... 6 3.1.2 Cartographic Representation ............................................................................................ 7 3.1.3 Topology ........................................................................................................................... 7 3.1.4 Project Files ...................................................................................................................... 8 3.1.5 Data Packaging ................................................................................................................. 8 3.2 Vector Data ........................................................................................................................... 9 3.2.1 Commercial Vector Data Formats .................................................................................... 9 3.2.2 Open Vector Data Formats ............................................................................................. 11 3.3 Raster Data .......................................................................................................................... 13 3.3.1 Georeferencing and Rectification ................................................................................... 13 3.3.2 Compression ................................................................................................................... 13 3.3.3 Raster Formats ................................................................................................................ 14 3.3.4 Mosaicked Raster Data ................................................................................................... 15 3.3.5 Stereo, Oblique and Ground-Level Imagery ................................................................... 15 3.3.6 Raster Data Size .............................................................................................................. 16 3.4 Emerging Data Formats ....................................................................................................... 16 3.4.1 KML ............................................................................................................................... 16 3.4.2 PDF and GeoPDF ........................................................................................................... 17 3.5 Spatial Databases ................................................................................................................. 17 3.5.1 ESRI Geodatabases ......................................................................................................... 18 3.6 Dynamic Geospatial Data .................................................................................................... 19 3.6.1 Web Map Services (WMS) ............................................................................................. 19 3.6.2 Web Feature Services (WFS) ......................................................................................... 20 3.6.3 Other OGC Web Services ............................................................................................... 20 3.7 Legal Issues ......................................................................................................................... 20 3.7.1 UK Legal Landscape ...................................................................................................... 21 3.7.2 US Legal Landscape ....................................................................................................... 22 3.7.3 ‗Open‘ Geospatial Data .................................................................................................. 22 3.8 Geospatial Metadata ............................................................................................................ 22 3.8.1 Metadata Standards ......................................................................................................... 23 3.8.2 Metadata Challenges for Archives .................................................................................. 23 3.8.3 Geospatial Metadata vs. Preservation Metadata ............................................................. 24 3.8.4 Metadata Creation ........................................................................................................... 25 4 Standards Bodies and Working Groups ........................................................................................ 25 4.1 Open Geospatial Consortium (OGC) .................................................................................. 25 4.1.1 OGC Data Preservation Working Group ........................................................................ 25 4.2 U.S. Federal Geographic Data Committee (FGDC) ............................................................ 26 4.2.1 FGDC Historical Data Working Group .......................................................................... 26 5 Technology and Tools ................................................................................................................... 26 5.1 Digital Globe Tools ............................................................................................................. 26 5.2 Geospatial Format Registries and Validation Tools ............................................................ 27 5.3 ESRI Geodatabase Archiving .............................................................................................. 27 5.4 Digital Repository Software ................................................................................................ 27 6 Conclusions and Recommendations ............................................................................................. 28 7 Glossary of Acronyms .................................................................................................................. 29 8 Selected References and Resources