Geosparql: Enabling a Geospatial Semantic Web

Geosparql: Enabling a Geospatial Semantic Web

Undefined 0 (0) 1 1 IOS Press GeoSPARQL: Enabling a Geospatial Semantic Web Robert Battle, Dave Kolas Knowledge Engineering Group, Raytheon BBN Technologies 1300 N 17th Street, Suite 400, Arlington, VA 22209, USA E-mail: {rbattle,dkolas}@bbn.com Abstract. As the amount of Linked Open Data on the web increases, so does the amount of data with an inherent spatial context. Without spatial reasoning, however, the value of this spatial context is limited. Over the past decade there have been several vocabularies and query languages that attempt to exploit this knowledge and enable spatial reasoning. These attempts provide varying levels of support for fundamental geospatial concepts. In this paper, we look at the overall state of geospatial data in the Semantic Web, with a focus on the upcoming OGC standard GeoSPARQL. GeoSPARQL attempts to unify data access for the geospatial Semantic Web. We first describe the motivation for GeoSPARQL, then the current state of the art in industry and research, followed by an example use case, and the implementation of GeoSPARQL in the Parliament triple store. Keywords: GeoSPARQL, SPARQL, RDF, geospatial, geospatial data, query language, geospatial query language, geospatial index 1. Introduction In this paper, we discuss an emerging standard, Geo- SPARQL [24] from the Open Geospatial Consortium Geospatial data is increasingly being made avail- (OGC)1. This standard aims to address the issues of able on the Web in the form of datasets described us- geospatial data representation and access. It provides ing the Resource Description Framework (RDF). The a common representation of geospatial data described principles of Linked Open Data, detailed in [5], en- using RDF, and the ability to query and filter on the courage a set of best practices for publishing and con- relationships between geospatial entities. First, we in- necting structured data on the Web. Linked Open Data troduce some geospatial concepts that are critical to promotes the use of the SPARQL Protocol and RDF understanding some of the design choices for Geo- Query Language (SPARQL) and RDF to query and SPARQL. We then describe topological relationships model data. While this is useful for querying for re- that are important to understand when designing a lan- lationships that are explicitly represented in data, im- guage for querying between spatial entities. Next, we plicit relationships, such as geospatial relationships, describe the motivation for GeoSPARQL, the current cannot easily be queried. For instance, datasets may state of the art in modeling and querying geospatial exist that describe monuments and parks, but being data in the Semantic Web, and we introduce Geo- able to link these datasets based on their undeclared SPARQL. A discussion of the Parliament2 triple store, relationships is difficult. The ability to answer a mean- ingful query, like "What parks are within 3km of the Washington Monument?", depends on how the data is 1Dave Kolas is a co-chair for the GeoSPARQL Standards Work- represented, whether all of the resources are related to ing Group, and Robert Battle has worked on the Parliament imple- mentation and provided feedback to the development of the stan- the Washington Monument, and if that relationship is dard. explicit. 2http://parliament.semwebcentral.org 0000-0000/0-1900/$00.00 c 0 – IOS Press and the authors. All rights reserved 2 its GeoSPARQL spatial index, and a use-case that il- system uses a spherical surface to determine locations. lustrates a simple example of data integration with In such a system, a point is defined by angles measured GeoSPARQL follows. from the center of the Earth to a point on the surface. These are also known as latitudes (horizontal) and lon- gitudes (vertical). A Cartesian coordinate system is a 2. Geospatial Concepts flat coordinate system on the surface. It enables quick and accurate measurements over small distances and is Some basic understanding of geospatial concepts is useful for applications such as surveying. required for discussion of GeoSPARQL. The follow- An ellipsoid defines an approximation for the center ing sections define some of the terms used throughout and shape of the Earth. A datum defines the position the rest of this paper. of an ellipsoid relative to the center of the earth. This provides a frame of reference for measuring locations 2.1. Features and Geometries and, for local datums, allows for accurate locations to be defined for the valid area of the datum. WGS84 is a Features and geometries are two fundamental con- datum that is widely used by GPS devices that approxi- cepts of geospatial science. A feature is simply any en- mates the entire world. Geographic coordinate systems tity in the real world with some spatial location. This use an Earth based datum that transforms an ellipsoid could be a park, an airport, a monument, a restaurant, into a representation of the Earth. etc. A feature can have a spatial location that cannot In order to create a map of the Earth, it must be pro- be precisely defined, such as a swamp or a mountain jected from a curved surface onto the plane. This pro- range. A geometry is any geometric shape, such as a jection will distort the surface in some fashion which point, polygon, or line, and is used as a representation will mean that the coordinates for some locations are of a feature’s spatial location. For instance, Reagan more accurate than those in another. Some projections National Airport is a geospatial feature because it is an will preserve area, so the size of all objects is relative, entity that has a specific location in the world. It has a while others preserve angles, and others try to do both. geometry that is a point with coordinates 38.852222, - A coordinate system projected onto a plane enables 3 77.037778 (in the WGS84 datum). Geometries can be faster performance, as Cartesian calculations require measured at varying resolutions, from a simple point fewer resources than Spherical calculations. Computa- in the center of a feature to a complex, precise mea- tions across the plane, however, are inaccurate when surement of a feature’s entire border. Spatial data typi- they deal with large areas as the curvature of the Earth cally separates features from geometries, although that is not taken into account. is not always the case. The combination of these elements defines a CRS. One common source of well defined coordinate refer- 2.2. Coordinate Reference Systems ence systems is the European Petroleum Survey Group (EPSG)4. An important part of the metadata associated with a geometry is its coordinate reference system (CRS) (al- 2.3. Topological Relationships ternatively known as its spatial reference system). The elements of a coordinate reference system provide con- text for the coordinates that define a geometry in order All spatial entities are inherently related to some to accurately describe their position and establish rela- other spatial entity. Whether two entities intersect tionships between sets of coordinates. There are four somehow or are thousands of miles apart, the relation- parts that make up a CRS: a coordinate system, an el- ship that they share can be described and evaluated. lipsoid, a datum, and a projection. In [28], Randell et al. describe an interval logic for A coordinate system describes a location relative to reasoning about space using a simple ontology that de- some center. A geocentric coordinate system places the fines functions and relations for expressing and rea- center at the center of the Earth and uses standard X, soning over spatial regions. This logic is referred to as Y, Z ordinates. A geographic (or geodetic) coordinate Region Connection Calculus (RCC). A subset of RCC, RCC8, defines eight mutually exhaustive pairwise dis- 3http://en.wikipedia.org/wiki/World_ Geodetic_System 4http://www.epsg-registry.org 3 Table 1 Geospatial reasoning is critical in a large number Simple Features, Egenhofer and RCC8 relations equivalence of application domains (emergency response, trans- Simple Features Egenhofer RCC8 portation planning, hydrology, land use, etc.). Users in equals equal EQ these domains have long utilized relational databases disjoint disjoint DC with spatial extensions [9]. These spatially extended intersects :disjoint :DC databases have given the combination of efficient, sta- touches meet EC ble storage and retrieval of data with geospatial calcu- within inside + coveredBy NTPP + TPP lation and indexing. This allows questions like "Which contains contains + covers NTPPi + TPPi students live within 2km of the school they attend?" to overlaps overlap PO be answered efficiently. Within the last decade, RDF storage solutions have joint relations which can be used to imply the rest of become increasingly popular. These knowledge bases, the relations in RCC. These eight base relations are: sometimes called triple stores, are capable of better handling several types of problems at which relational 1. DC(x, y) (x is disconnected from y) databases struggle or are not intended to perform: 2. x = y (x is identical with y) queries with many joins across entities [32], queries 3. PO(x,y) (x partially overlaps y) with variable properties [32], and ontological inference 4. EC(x,y) (x is externally connected with y) on datasets. These features lend themselves towards 5. TPP(x,y) (x is a tangential proper part of y) problems that involve data exploration, linkage across 6. NTPP(x,y) (x is a non-tangential proper part of datasets, and abstraction from low level data. y) Because of RDF stores’ ability to do inference and 7. TPPi(x,y) (y is a tangential proper part of x) easily link data sets, they have been of growing inter- 8. NTPPi(x,y) (y is a non-tangential proper part of est to the geospatial data community. Often geospa- x) tial domains have complicated type hierarchies which cannot be fully expressed in current geospatial infor- The same set of eight geospatial topological rela- mation systems.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    17 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us