<<

OpenSearch applied to Earth Science

Pedro Pereira Gonçalves Terradue S.r.l. Rome Italy

Summary

• Earth Science Use Cases

• Review of OpenSearch

• Geospatial and Time Extension

Encoding

• Implementation Guidelines and Example Earth Science

• Earth Science applications need different sources of data, technologies and computing power

• Communities store their information according to their own model and metadata schemas

• Communities have their own portal and access model

• Complex cross-community applications need horizontal scaling

Satellite Oceanographic Ships Buoys Airplanes LIDAR Model results GPS GENESI-DEC FP7 Project

• Earth Science e-infrastructure connecting Digital Repositories spread all over the world

• Easy and fast access to heterogeneous data (airborne, in situ, satellite) to authorized users

• On-demand operational processing capabilities: application/algorithms are run on scalable resources

• Processing steps are split taking advantage of available computing nodes

• Expert users bring their own processors GENESI-DEC

• GENESI-DEC exposes an API for data discovery, access and processing

• Data Linking of spatial temporal Earth science data available on the world’s digital data repositories (space, in-situ, models)

More Land surface More Ocean related data More Atmospheric data New Census and (satellite, insitu and spatial indicator data airborne) data

New Seafloor data

Communities Seafloor and ocean Global Atmosphere Observation Global Change Territorial development and spatial planning Black Sea catchment observation

Integration Example Global Atmosphere Observation Community

• Research Aircraft (Cooperation Network EUFAR) • process studies funded by EC and national projects

• in-situ data on chemical species, aerosol and cloud properties, radiation, and meteorological parameter.

• In-service Aircraft (Infrastructure IAGOS-ERI)

• long term observations funded by EC and national contributions;

• in-situ data for upper troposphere and lower stratosphere, greenhouse gases, reactive gases, particles Integration Example Global Atmosphere Observation Community

• Discover data on different infrastructures co-located in time and space USER Products (e.g. co-located aircraft and satellite data)

• Execute on-demand OpenSearch + ATOM API processing order MOZAIC CARIBIC IAGOS FALCON ENVISAT AQUA HALO Mission Data Data Data ENVISAT Data MERIS TERRA Data Support (1994- 1996- (2012ff) AATSR (1995- Level1 MODIS (2012ff) System 2008) dato) 2014 • Integrate results IAGOS-ERI ESA / NASA EUFAR-DLR directly on the community application (here, DLR Mission Support System) Integration

• Automatic acquisition of data (support of data analysis) • Exploration of geographical vicinity of data (extension of data space, representativeness of data)

• Inter-comparison and validation • Each community keeps their own research and operational environment

• Communities share discovery and processing feeds The technology: OpenSearch

• OpenSearch started in an effort built around 's A9.com and now maintained in a community process at opensearch.org

• Search engines have a description document used by client applications and allows syndication of search results

• OASIS Search Web Services group is publishing searchRetrieve Operation with Bindings for SRU 1.2 and OpenSearch http://docs.oasis-open.org/search-ws/

• It was intended to allow syndication of search results

• Search engines have a description document used by client applications

• OASIS Search Web Services group is publishing searchRetrieve Operation with Bindings for SRU 1.2 and OpenSearch http://docs.oasis-open.org/search-ws/ The technology: OpenSearch

• OpenSearch is flexible, results can be returned as HTML interfaces or in ATOM, XML/RDF, KML, WKT, JSON….

• Directly integrated in Web Browsers

• Includes result’s paging OpenSearch Description

• Provides metadata about the contents along with a set of URL Templates which illustrate the parameters accepted by the service and the variety of output formats in which results can be obtained

• The OpenSearch request interface is simple, consisting of a description of a HTTP GET request with a series of optional key- value parameters that can be used to constrain the search

http://www.google.com/?q=question http://www.google.com/?q={searchTerms} Description XML Document

Web Search

Use Acme.com to search the Web. example web [email protected]

Description XML Document

Web Search Use Acme.com to search the Web. example web [email protected] Example.com Web Search http://acme.com/websearch.png microsoft.icon">http://acme.com/websearch.ico Acme.com Development Team Search data Copyright 2005, Acme.com, Inc., All Rights Reserved open false en-us UTF-8 UTF-8 GeoSpatial Extension

• Specify a series of parameters that can be used to spatially constrain search results

• Provision is made to filter results by: • A bounding box

• A geometry using Well Known Text • Within a certain radius from a given latitude-longitude point • Having a certain containment relation (within, overlaps, disjoint) with a geographic constraint

• Matching a geographic name

• All geographic information in EPSG 4326 (WGS84) GeoSpatial Extension Elements

• geo:box -> Geographic bounding box The box is defined by "west, south, east, north" coordinates of longitude, latitude, in a EPSG:4326 decimal degrees.

• geo:geometry -> Geographic area (geometry): The polygon is defined using the Well Known Text standard for geographic shapes, using EPSG:4326

• geo:lat, geo:lon, geo:radius -> The latitude, longitude of a given point with a given radius

• geo:relation -> Spatial relation to result set, Character String; One of “overlaps”, “contains”, “disjoint” (default is “overlaps”)

• geo:name -> A string describing the location to perform the search

The current namespace of the OpenSearch Geo Extension is: http://a9.com/-/opensearch/extensions/geo/1.0/ Time Extension Elements

• Defines the temporal range request

• time:start Start of the temporal interval to search (RFC-3339)

• time:end End of the temporal interval to search (RFC-3339)

• RFC-3339 Example: 2008-01-01T23:43:23.00Z

• Namespace of the OpenSearch Time Extension is: http://a9.com/-/opensearch/extensions/time/1.0/ Geo and Time Extensions

• A search service with

• The geospatial extension allow to formulate geospatial requests e.g. point- plus-radius, a bounding box, or a polygon bbox={geo:box?}

• Together with the Time extension, OpenSearch can specify time start, finish, and slices for searching data. start={time:start?}&stop={time:end?}

http://example.com/xml/? bbox=6.73,36.73,18.45,47.07&start=2011-01-10&end=2011-01-12T23:59:5 Integration

• REST approach allowing the query and registry of datasets

• Allows the easy definition of custom result types to ease the integration and creation of mashup applications

• Responses as HTML interfaces or in Atom, XML/RDF, KML, WKT, METALINK, EOP/HMA Integration using ATOM ATOM Response Mappings

• Usage of ATOM elements

• GEORSS for spatial attributes

• dc:date and gml:validTime for time ranges

• Resource linkage with the atom:link element

• atom:link/[@rel=‘alternate’] for metadata

• atom:link/[@rel=‘enclosure’] for data access services

• atom:link/[@rel=‘related’] to link related resources ATOM Feed and atom:link

Direct Link to the data URL using the relation “enclosure” Adoption

• SEADATANET, EMSO

• ESA Grid - G-POD Catalogue (ESA biggest online data repository)

• NASA Earth Observing System Clearinghouse (ECHO)

• GEO Geohazards Supersite, GEO Web Portal with Search context passing

• Software: Geocommons, GeoNetwork (dev trunk),http://eo-virtual-archive4.esa.int/search/ER02_SAR_IM__0P/html/ OpenLayers (dev trunk)

• ngEO – GMES next generation of user services for EO

• Sentinel-1, -2, -3 and EarthCare, Envisat, ERS and other Earth Explorer missions (GOCE, Cryosat-2, SMOS, ...), third party missions (ALOS, Landsat, SPOT, ...) Adoption Example: SEADATANET

• Search Interface in: • http://seadatanet.maris2.nl/v_cdi_v2/search.asp SEADATANET HTML Interface SEADATANET Search Interface

• Search Interface is a Post to: • http://seadatanet.maris2.nl/v_cdi_v2/result.asp • Bounding Box (upper left, lower right) W33=-22.97 W34=56.48 W35=28.55 W36=23.59 • Dates (start and stop) v0_1=19710921 V0_2=20100921 SEADATANET Search Interface

• Map the query parameters to OpenSearch

OpenSearch GeoSpatial ExtensionOpenSearch Request &instrument={sdn:instrumentType?} OpenSearch W33=-22.97 W34=56.48 W35=28.55PossibleTemporal W36=23.59 future &platform={sdn:platformType?} Request Terms &variable={sdn:variableGrouping?}Needs to be transformed"/> into SEADATAa bboxExtension request specific Requestextensions Terms Examples – NASA ECHO

://testbed.echo.nasa.gov/echo-esip/ GEO Geohazards Supersite

• http://eo-virtual-archive4.esa.int/search/html/ GEO Geohazards Supersite

http://eo-virtual-archive4.esa.int/search/ER02_SAR_IM__0P/html/ ?name=Adana Conclusion

• OpenSearch as the minimal compliance level to develop discovery mechanisms in heterogeneous sites

• Use ATOM to map metadata properties • Develop brokers/gateways for special cases

• Usage of atom:link • Define data access and alternate representations

• Express data relations

• Improve search relevance

• Relation logic between the data from different communities

• Resource clusters according to particular viewpoint