Web Approach for Ontology-Based Classification, Integration, and Interdisciplinary Usage of Geoscience Metadata
Total Page:16
File Type:pdf, Size:1020Kb
Data Science Journal, Volume 11, 18 October 2012 WEB APPROACH FOR ONTOLOGY-BASED CLASSIFICATION, INTEGRATION, AND INTERDISCIPLINARY USAGE OF GEOSCIENCE METADATA B Ritschel1*, V Mende1*, L Gericke2*, R Kornmesser2*, and S Pfeiffer3 1*Data Center, GFZ German Research Centre for Geosciences, Potsdam, German 14473 Potsdam, Germany 2*Hasso-Plattner-Institut für Softwaresystemtechnik, Potsdam, Germany 2-3, 14482 Potsdam, Germany 3Hochschule Neubrandenburg, Neubrandenburg, Germany, Postfach 11 01 21, 17041 Neubrandenburg, Germany ABSTRACT The Semantic Web is a W3C approach that integrates the different sources of semantics within documents and services using ontology-based techniques. The main objective of this approach in the geoscience domain is the improvement of understanding, integration, and usage of Earth and space science related web content in terms of data, information, and knowledge for machines and people. The modeling and representation of semantic attributes and relations within and among documents can be realized by human readable concept maps and machine readable OWL documents. The objectives for the usage of the Semantic Web approach in the GFZ data center ISDC project are the design of an extended classification of metadata documents for product types related to instruments, platforms, and projects as well as the integration of different types of metadata related to data product providers, users, and data centers. Sources of content and semantics for the description of Earth and space science product types and related classes are standardized metadata documents (e.g., DIF documents), publications, grey literature, and Web pages. Other sources are information provided by users, such as tagging data and social navigation information. The integration of controlled vocabularies as well as folksonomies plays an important role in the design of well formed ontologies. Keywords: Semantic Web, Ontologies, Metadata, Web 2.0, DIF (Directory Interchange Format), ISDC (Information System and Data Center), ESSI (Earth and Space Science Informatics) 1 INTRODUCTION The Electronic Geophysical Year (eGY) 2007-2008 (http://www.egy.org/declaration.php), an initiative of the International Union of Geodesy and Geophysics, was founded to strengthen the awareness of a sustainable handling of geoscience data, information, and knowledge and the important role of Earth and Space Science Informatics (ESSI). ESSI connects informatics research, development, and practice with the processing and solution of very specific, but also general, science and societal questions related to the research areas of the Earth and space domains 50 years after the International Geophysical Year 1957-1958. My experiences as European eGY representative and as organizer and participant of various eGY and ESSI related workshops and congresses within the last years, along with the work of the authors at the German Research Center for Geosciences’ data center (www.gfz-potsdam.de) in the design, implementation, and operation of the GFZ Information System and Data Center for Geoscience Data and Information, show clearly the demand for an intensified engagement for the creation of conditions for a sustainable handling of the life cycle of geoscience and space science data from the experiment, through the processing of data and information, to the generation of knowledge. Such ESSI objectives as the classification of geoscience data and information, the metadata extension of data, the development of concept models for the discovery of relations between data classes, and the usage of taxonomies and ontologies play an important role for an on-demand discovery, a sustainable management, and a systematic use of multi-domain data and information for the generation of knowledge for a better understanding of the complex system Earth. In particular, the intense study and practical implementation of the scientific and technical ideas of the Semantic Web approach, according to Daconta et al. (2003), are very important for the creation of interoperable and inter-domain geosciences data, information, and finally, knowledge. This paper demonstrates the usage of metadata classes and related standards according to a Semantic Web approach for the description, classification, and design of the conceptual structure of the geodetic and meteorological data and information of the Global Geodetic Project (GGP) in the framework of the International Association of Geodesy (IAG). IGY140 Data Science Journal, Volume 11, 18 October 2012 2 IAG, GGOS, AND GGP PROJECT The International Association of Geodesy (IAG) (http://www.iugg.org/associations/iag.php) is the scientific organization for the exchange of scientific results and research ideas as well as the promotion of scientific work in geodesy and related geoscientific fields. Geodesy studies the geometry, the gravity, and the rotation of the Earth. Rothacher (2002) calls these three research areas the three pillars of geodesy. In particular, the processing and supply of a highly accurate terrestrial reference frame (TRF), derived by combining results from the three geodetic pillars, are very important for the detailed description and presentation of spatial terrestrial data and information. Computation of the TRF is the IAG’s contribution to the creation of the Global Earth Observing System of Systems (GEOSS) (www.earthobservations.org). The IAG consists of commissions, scientific working groups, a central bureau, and services. The IAG flagship project is the design and implementation of a Global Geodetic Observing System (GGOS). The GGOS integrates geodetic techniques, models, and approaches for better long-term consistency, reliability, and understanding of geodetic, geodynamic, and global variations of the Earth. GGOS provides the scientific and infrastructure basis for global change research in Earth sciences. The main objective of the GGOS project, related to service and infrastructure aspects, is the promotion of geodetic data products and appropriate IAG services. Project requirements are the implementation of sustainable services for the collection, validation, archiving, and distribution of geodetic observation data, processed data products, such as global gravity models, and finally cross-domain knowledge products, such as results according to temporal variations of ocean currents for the estimation of the impact for climate changes (http://www-app2.gfz-potsdam.de/pb1/op/grace/). Figure 1. IAG Data Services Structure Figure 1 shows the current structure of IAG services in a GGOS environment, which are planned also to become services at GEOSS. Geometric related services are: • International Earth Rotation and Reference Systems Service (IERS), • International GNSS Service (IGS), • International Laser Ranging Service (ILRS), • International DORIS Service (IDS). IGY141 Data Science Journal, Volume 11, 18 October 2012 Gravity related services are: • International Gravity Field Service (IGFS), • International Gravimetric Bureau (BGI), • International Geoid Service (IGeS), • International Center for Earth Tides (ICET), • International Centre for Global Earth Models (ICGEM). Sea Level related services are: • Permanent Service for Mean Sea Level (PSMSL), • International Altimeter Service (IAS). Other service elements are: • Time Section of the International Bureau of Weights and Measures (BIPM), • IAG Bibliographic Service (IBS). The International Center for Earth Tides (ICET), as part of the gravity service area of GGOS (Figure 1), is also a World Data Center service, known as WDCC for Earth Tides (http://www.astro.oma.be/ICET/). Furthermore, the results of the international Global Geodynamic Project (GGP), an IAG Inter-Commission project of “Commission 2: The Gravity Field” and “Sub-Commission 3.1: Earth Rotation and Earth Tides”, provide the scientific basis for the calculation of Earth tides. The main objective of the GGP project (http://www.eas.slu.edu/GGP/ggphome.html) is to record the changes of the gravity field of the Earth with a number of highly accurate superconducting gravimeter (SG) observatories (Figure 2) distributed worldwide. The data are also used for researching global motions of the whole Earth, such as the Chandler wobble (named after an American astronomer who in 1884-1885 discovered a movement in the Earth’s axis of rotation that causes latitude to vary with a period of 14 months), up to the surficial gravity effects of atmospheric pressure and groundwater. The SG observatories are run independently by national groups of scientists and technicians who send gravity and meteorological data each month to the GGP Information System and Data Center GGP ISDC (http://ggp.gfz-potsdam.de) at the GFZ Potsdam in Germany. The provided gravity data differ in sampling rate and treatment after measurement. At the GGP ISDC there are calibrated raw data, corrected data related to jumps and gaps, as well as data which are processed according to an ICET standard procedure. These standard data files are processed at ICET (http://www.upf.pf/ICET/). In 2009, ICET operations were transferred from Brussels to Tahiti, the new location for ICET (http://www.icsu-fags.org/ps04icet.htm). All GGP data are public after a certain period of time (normally half a year) and can be requested and downloaded both at the specific GGP ISDC and the GGP section of the GFZ Information System and Data Center (ISDC) portal (http://isdc.gfz-potsdam.de/ggp) at the Helmholtz Centre Potsdam GFZ German Research Centre for Geosciences. The basic architecture, the