Instructions for the Preparation of a Camera-Ready Paper in MS Word

Total Page:16

File Type:pdf, Size:1020Kb

Instructions for the Preparation of a Camera-Ready Paper in MS Word An RDF guide for the Darwin Core standard Editor(s): Solicited review(s): Open review(s): Steven J. Baskaufa,*, John Wieczorekb, John Deckc, and Campbell O. Webbd aDepartment of Biological Sciences, PMB 351634, Vanderbilt University, Nashville, TN 37146, USA E-mail: [email protected] bMuseum of Vertebrate Zoology, University of California at Berkeley, 3101 Valley Life Sciences Building, Berkeley, CA 94720, USA E-mail: [email protected] c Berkeley Natural History Museums, University of California at Berkeley, 1001 Valley Life Sciences Building, Berkeley, CA 94720, USA Email: [email protected] dArnold Arboretum of Harvard University, 1300 Centre St., Boston 02131, USA E-mail: [email protected] Abstract. The Darwin Core vocabulary is widely used to transmit biodiversity data in the form of simple text files. In order to support expression of biodiversity data in the Resource Description Framework (RDF), a guide was created as a non-normative addition to the Darwin Core standard. The guide resolves a number of issues that arise from adapting terms designed to have literal values for use with URI references. Although there are some problems that are beyond the scope of the guide, the guide is an important step towards enabling the biodiversity informatics community to participate in broader Linked Data and Se- mantic Web efforts. Keywords: biodiversity, informatics, standards, semantics, interoperability 1. Introduction Darwin Core roughly follows the Dublin Core Ab- stract Model,4 which was built on Resource Descrip- Darwin Core (DwC) [1] is a technical standard1 of tion Framework5 (RDF) concepts. A DwC term is Biodiversity Information Standards2 (TDWG). Since designated as a URI, which can be dereferenced to its ratification in 2009, Darwin Core has been widely access its normative RDF/XML metadata. Despite used to publish and transmit data. For example, as of these connections to RDF, there has previously been 29 Jan 2014 the Global Biodiversity Information Fa- no guide to describe how DwC vocabulary terms cility (GBIF) network has aggregated over 428 mil- should be used to describe biodiversity resources as lion Darwin Core records.3 RDF. The Darwin Core standard contains a general- The Darwin Core vocabulary was primarily de- purpose vocabulary for describing biodiversity re- signed to facilitate the sharing of data in simple text sources. It includes several guides that describe how files containing a single table of rows and columns. the vocabulary terms should be used to transmit data Extensive discussion on the TDWG mailing list6 be- in various formats such as simple text and XML. tween 2009 and 2011 showed that there was signifi- cant interest in expressing Darwin Core data in RDF * Corresponding author. 1 http://www.tdwg.org/standards/450/; enter reference website at http://rs.tdwg.org/dwc/terms/ 4 http://dublincore.org/documents/abstract-model/ 2 http://www.tdwg.org/ 5 http://www.w3.org/TR/rdf-concepts/ 3 http://www.gbif.org/ 6 http://lists.tdwg.org/pipermail/tdwg-content/ and identified several issues that impeded the effec- namespace dwc:9 suggest using text strings to refer tive use of Darwin Core terms in RDF: to physical and conceptual entities, e.g., names to - Text files often use name strings as values of prop- represent people, citations to represent articles, codes erties, whereas it is often advantageous in RDF to to represent institutions, etc. When a record has mul- have URI reference objects. tiple values for a property, these term definitions - Simple fielded text files (where a line in the file specify that the multiple strings be concatenated and contains the data for a single record) are by their delineated in a single field to avoid forcing the crea- nature “flat”, while RDF is often highly normalized. tion of a more normalized data structure. These (We use the term "flat" to denote data which is ex- specifications are at odds with best practices for RDF, pressed in a single table.) where it is preferable in triples to identify non-literal - Darwin Core defines a number of “ID” terms, objects using URI references. Those URIs can then which are used in flat files to specify identifiers and be associated with additional properties that describe record type, whereas RDF provides different stand- the non-literal resource. ard methods for expressing identity and class mem- The conflicting demands of flat, string-based ta- bership. bles and normalized, graph-based RDF creates a In 2011, an RDF/OWL Task Group7 was chartered problem when terms that were originally designed for by TDWG, and in 2012 a team of writers began work use with text strings are co-opted for use with non- on a Darwin Core RDF guide to address the issues literal objects in RDF. This is a longstanding prob- listed above by providing a set of best practices and lem10 that is not unique to Darwin Core. The Dublin by creating some new Darwin Core terms intended Core RDF Guidelines11 provide a mechanism to per- specifically for use in RDF. The guide8 was com- mit legacy string literal data to be associated with pleted in 2013 and reviewed by the Task Group, terms that were not intended for use with literal ob- which recommended it for adoption. Upon adoption jects. This mechanism, which involves use of the by TDWG, the RDF guide will become a non- rdf:value 12 property, has not been widely im- normative part of the Darwin Core standard. plemented. A more widely used dual-term alterna- In this paper, URIs are sometimes abbreviated as tive allows Dublin Core terms in the legacy dc:13 QNames using standard namespace prefixes, e.g. namespace to be used with literal values,14 while re- dwc:recordedBy. The prefixes are defined in serving terms in the dcterms: 15 namespace that footnotes or legends. URIs, RDF serializations, and have declared non-literal ranges for use with URI SPARQL queries are written in Courier font. In reference or blank node objects. many cases, URIs identify real resources in the wild, although example triples containing those URIs are 2.1.1. New Darwin Core terms in the dwcuri: not necessarily asserted there. namespace Section 2 of this paper lays out how each major is- The Darwin Core RDF guide adopts the dual-term sue was addressed by the guide. Section 3 describes approach by creating a new Darwin Core namespace, several challenges to the implementation of Darwin dwcuri:,16 whose terms are intended for use only Core as RDF that are beyond the scope of the guide. with non-literal objects. For example, the existing Section 4 concludes the paper. Darwin Core term dwc:recordedBy would con- tinue to be used with a value that consisted of a name string for agents who recorded an occurrence, where- 2. Issues and their resolution in the guide as the new term dwcuri:recordedBy would re- late the subject to a non-literal object (URI reference 2.1. Terms intended for use with non-literal objects or blank node) that denotes the actual agent. Because the Darwin Core vocabulary was de- 9 http://rs.tdwg.org/dwc/terms/ signed primarily to facilitate the transfer of text- 10 http://wiki.foaf-project.org/w/UsingDublinCoreCreator based records from relatively flat database tables, 11 http://dublincore.org/documents/dc-rdf/#sect-4 definitions and comments for terms in the general 12 rdf: = http://www.w3.org/1999/02/ 22-rdf-syntax-ns# 13 http://purl.org/dc/elements/1.1/ 14 7 http://code.google.com/p/tdwg-rdf/ http://wiki.dublincore.org/index.php/User_Guide/ 8 The draft guide is at http://code.google.com/p/tdwg- Publishing_Metadata#Legacy_namespace rdf/wiki/DwcRdf with an eventual permanent URL of 15 http://purl.org/dc/terms/ http://rs.tdwg.org/dwc/terms/guides/rdf/ 16 http://rs.tdwg.org/dwc/uri/ <http:// arctos.database.museum/guid/MVZ:Mamm:115956> dwc:recordedBy "Oliver P. Pearson; Anita K. Pearson"; dwcuri:recordedBy <http://viaf.org/viaf/263074474>, <http://museum-x.org/personnel/akp>. Fig. 1. Recorders of an occurrence (serialized as RDF/Turtle) <http://bioimages.vanderbilt.edu/baskauf/00001#loc> a dcterms:Location; dwc:continent "North America"; dwc:country "United States"; dwc:stateProvince "Tennessee"; dwc:county "Robertson". Fig. 2. Darwin Core convenience terms describing the political subdivisions of a location (serialized as RDF/Turtle) <http://bioimages.vanderbilt.edu/baskauf/00001#loc> a dcterms:Location; dwcuri:inDescribedPlace <http://sws.geonames.org/4653638/>. Fig. 3. Using a dwcuri: term to link a location to its lowest level political subdivision (serialized as RDF/Turtle) The guide allows legacy string name data in the dwc:continent allow a location to be placed in form of concatenated lists to continue to be exposed its geographical context. in RDF as a literal object of a dwc: namespace term. No single term value is sufficient to unambiguous- However, the guide specifies that if a record using a ly place the location in its lowest level political sub- term from the general dwc: namespace is serialized division because there may be several low level polit- as RDF using a dwcuri: namespace term, each ical subdivisions having the same name that are con- non-literal resource in a concatenated list of names tained within different upper level political subdivi- should be the object of separate triple. This is illus- sions. Thus each location record must provide the trated in Fig. 1. entire set. In the context of a flat database structure, An advantage of the dual-term approach is that it it is convenient to expose the full set of proper- allows large databases consisting of legacy string ty/value pairs for a location since that would allow a name data to be exposed immediately as RDF with- user to query for locations in the database by specify- out imposing a requirement that the provider imme- ing the particular values of interest for certain proper- diately implement the use of URI identifiers for non- ties in the set (hence the name "convenience terms" literal resources.
Recommended publications
  • Mapping Spatiotemporal Data to RDF: a SPARQL Endpoint for Brussels
    International Journal of Geo-Information Article Mapping Spatiotemporal Data to RDF: A SPARQL Endpoint for Brussels Alejandro Vaisman 1, * and Kevin Chentout 2 1 Instituto Tecnológico de Buenos Aires, Buenos Aires 1424, Argentina 2 Sopra Banking Software, Avenue de Tevuren 226, B-1150 Brussels, Belgium * Correspondence: [email protected]; Tel.: +54-11-3457-4864 Received: 20 June 2019; Accepted: 7 August 2019; Published: 10 August 2019 Abstract: This paper describes how a platform for publishing and querying linked open data for the Brussels Capital region in Belgium is built. Data are provided as relational tables or XML documents and are mapped into the RDF data model using R2RML, a standard language that allows defining customized mappings from relational databases to RDF datasets. In this work, data are spatiotemporal in nature; therefore, R2RML must be adapted to allow producing spatiotemporal Linked Open Data.Data generated in this way are used to populate a SPARQL endpoint, where queries are submitted and the result can be displayed on a map. This endpoint is implemented using Strabon, a spatiotemporal RDF triple store built by extending the RDF store Sesame. The first part of the paper describes how R2RML is adapted to allow producing spatial RDF data and to support XML data sources. These techniques are then used to map data about cultural events and public transport in Brussels into RDF. Spatial data are stored in the form of stRDF triples, the format required by Strabon. In addition, the endpoint is enriched with external data obtained from the Linked Open Data Cloud, from sites like DBpedia, Geonames, and LinkedGeoData, to provide context for analysis.
    [Show full text]
  • Discovering Alignments in Ontologies of Linked Data∗
    Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Discovering Alignments in Ontologies of Linked Data∗ Rahul Parundekar, Craig A. Knoblock, and Jose´ Luis Ambite University of Southern California Information Sciences Institute 4676 Admiralty Way Marina del Rey, CA 90292 Abstract The problem of schema linking (aka schema matching in databases and ontology alignment in the Semantic Web) has Recently, large amounts of data are being published received much attention [Bellahsene et al., 2011; Euzenat and using Semantic Web standards. Simultaneously, Shvaiko, 2007; Bernstein et al., 2011; Gal, 2011]. In this there has been a steady rise in links between objects paper we present a novel extensional approach to generate from multiple sources. However, the ontologies alignments between ontologies of linked data sources. Sim- behind these sources have remained largely dis- ilar to previous work on instance-based matching [Duckham connected, thereby challenging the interoperabil- and Worboys, 2005; Doan et al., 2004; Isaac et al., 2007], ity goal of the Semantic Web. We address this we rely on linked instances to determine the alignments. Two problem by automatically finding alignments be- concepts are equivalent if all (or most of) their respective in- tween concepts from multiple linked data sources. stances are linked (by owl:sameAs or similar links). How- Instead of only considering the existing concepts ever, our search is not limited to the existing concepts in in each ontology, we hypothesize new composite the ontology. We hypothesize new concepts by combining concepts, defined using conjunctions and disjunc- existing elements in the ontologies and seek alignments be- tions of (RDF) types and value restrictions, and tween these more general concepts.
    [Show full text]
  • Linked Data Enables Such Web of Data Global Identifier
    Lift your data Introduction to Linked Data Ghislain Atemezing1 , Boris Villazón-Terrazas2 1 EURECOM, France [email protected] 2 Ontology Engineering Group , FI , UPM [email protected] Slides available at: http://www.slideshare.net/atemezing/ INSPIRE Conference 2012, Istanbul Acknowledgements: Oscar Corcho, Asunción Gómez-Pérez, Luis Vilches, Raphaël Troncy, Olaf Hartig, Luis Vilches and many others that we may have omitted. Workdistributed under the license Creative Commons Attribution- Noncommercial-Share Alike 3.0 Agenda Linked Data Geospatial LD Datasets 5-star deployment scheme for Linked Open Data Tutorial “Lift your data”, Istanbul 2012 2 Agenda Linked Data Geospatial LD Datasets 5-star deployment scheme for Linked Open Data Tutorial “Lift your data”, Istanbul 2012 3 Current Web Geospatial Database (Spain) Data exposed to the Web via HTML, pdf, etc. Statistical Database (Spain) © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig Tutorial “Lift your data”, Istanbul 2012 4 Information from ssgeingle pages can be found via sear ch eegngin es © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig Tutorial “Lift your data”, Istanbul 2012 5 Complex queries over muliltipl e pages/data sources? © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig Tutorial “Lift your data”, Istanbul 2012 6 What do we actually want? • Use the Web like a single global database • web of documents -> web of data Geospatial Statistical Database Database (Spain) (Spain) © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig 7 Tutorial “Lift your data”, Istanbul 2012 7 Linked Data enables such Web of Data Global Identifier: URI (Uniform Resource Identifier), which is a string of characters used to identify a name or a resource on the Internet.
    [Show full text]
  • Historical Gazetteer System Integration: CHGIS, Regnum Francorum, and Geonames
    p. 1 Historical Gazetteer System Integration: CHGIS, Regnum Francorum, and GeoNames Merrick Lex Berman (CGA, Harvard University) Johan Åhlfeldt (Regnum Francorum Online) Marc Wick 24 Jul 2014 [draft chapter for submission to publisher] Abstract: Integrating digital gazetteers involves the matching of place name records, disambiguation of unique places and conflation of duplicates or variant place names. The challenge of mapping between historical instances of place names is also an ongoing concern for several important projects dealing with ancient place names. Here the matching of historical place names from two unrelated datasets (for China and France) to gazetteer web services is undertaken using a basic geospatial and geonomial algorithm. The quantitative results of the matching trials are considered, problems in dealing with vernacular scripts considered, and practical implications for integrating historical gazetteers discussed. 1. Gazetteer Web Services and Digital Historical Gazetteers Online gazetteers provide essential services. They serve as authority records for geocoding place names and for retrieval of place names associated with real world coordinates (reverse geocoding). For example, the expanding interconnections of LinkedOpenData [LOD] 1 on the semantic web have consistently placed GeoNames 2 at or near the center of the semantic web's social graph. The centrality of GeoNames is largely due to three factors: first, it is a free and open Application Programming Interface [API]; second, the API is simple and easy-to-use; and third, it is currently the only global geographic resource with stable URIs. Traffic for the GeoNames web service topped twenty million requests per day (in 2012); and since half of these requests are from smart phones, 3 it is clear that geographic information retrieval [GIR] is being p.
    [Show full text]
  • The Resource Description Framework and Its Schema Fabien Gandon, Reto Krummenacher, Sung-Kook Han, Ioan Toma
    The Resource Description Framework and its Schema Fabien Gandon, Reto Krummenacher, Sung-Kook Han, Ioan Toma To cite this version: Fabien Gandon, Reto Krummenacher, Sung-Kook Han, Ioan Toma. The Resource Description Frame- work and its Schema. Handbook of Semantic Web Technologies, 2011, 978-3-540-92912-3. hal- 01171045 HAL Id: hal-01171045 https://hal.inria.fr/hal-01171045 Submitted on 2 Jul 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The Resource Description Framework and its Schema Fabien L. Gandon, INRIA Sophia Antipolis Reto Krummenacher, STI Innsbruck Sung-Kook Han, STI Innsbruck Ioan Toma, STI Innsbruck 1. Abstract RDF is a framework to publish statements on the web about anything. It allows anyone to describe resources, in particular Web resources, such as the author, creation date, subject, and copyright of an image. Any information portal or data-based web site can be interested in using the graph model of RDF to open its silos of data about persons, documents, events, products, services, places etc. RDF reuses the web approach to identify resources (URI) and to allow one to explicitly represent any relationship between two resources.
    [Show full text]
  • Aligning Ontologies of Linked Data
    Chapter 1 Aligning Ontologies of Linked Data Rahul Parundekar University of Southern California Craig A. Knoblock University of Southern California Jos´eLuis Ambite University of Southern California 1.1 Introduction ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 1 1.2 Linked Data Sources with Heterogeneous Ontologies :::::::::::::::::::: 3 1.3 Finding Alignments Across Ontologies ::::::::::::::::::::::::::::::::::: 3 1.3.1 Source Preprocessing :::::::::::::::::::::::::::::::::::::::::::::: 4 1.3.2 Aligning Atomic Restriction Classes :::::::::::::::::::::::::::::: 4 1.3.3 Aligning Conjunctive Restriction Classes ::::::::::::::::::::::::: 6 1.3.4 Eliminating Implied Alignments :::::::::::::::::::::::::::::::::: 7 1.3.5 Finding Concept Coverings ::::::::::::::::::::::::::::::::::::::: 10 1.3.6 Curating Linked Data ::::::::::::::::::::::::::::::::::::::::::::: 11 1.4 Results :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 12 1.4.1 Representative Examples of Atomic Alignments ::::::::::::::::: 12 1.4.2 Representative Examples of Conjunctive Alignments :::::::::::: 14 1.4.3 Representative Examples of Concept Coverings :::::::::::::::::: 14 1.4.4 Outliers :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 15 1.4.5 Precision and Recall of Country Alignments ::::::::::::::::::::: 15 1.4.6 Representative Alignments from Other Domains ::::::::::::::::: 16 1.5 Related Work ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 16 1.6 Conclusion :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
    [Show full text]
  • Geolod: a Spatial Linked Data Catalog and Recommender
    big data and cognitive computing Article GeoLOD: A Spatial Linked Data Catalog and Recommender Vasilis Kopsachilis *,† and Michail Vaitis † Department of Geography, University of the Aegean, GR-811 00 Mytilene, Greece; [email protected] * Correspondence: [email protected] † Current address: Department of Geography, University Hill, GR-811 00 Mytilene, Greece. Abstract: The increasing availability of linked data poses new challenges for the identification and retrieval of the most appropriate data sources that meet user needs. Recent dataset catalogs and recommenders provide advanced methods that facilitate linked data search, but none exploits the spatial characteristics of datasets. In this paper, we present GeoLOD, a web catalog of spatial datasets and classes and a recommender for spatial datasets and classes possibly relevant for link discovery processes. GeoLOD Catalog parses, maintains and generates metadata about datasets and classes provided by SPARQL endpoints that contain georeferenced point instances. It offers text and map- based search functionality and dataset descriptions in GeoVoID, a spatial dataset metadata template that extends VoID. GeoLOD Recommender pre-computes and maintains, for all identified spatial classes in the Web of Data (WoD), ranked lists of classes relevant for link discovery. In addition, the on-the-fly Recommender allows users to define an uncatalogued SPARQL endpoint, a GeoJSON or a Shapefile and get class recommendations in real time. Furthermore, generated recommendations can be automatically exported in SILK and LIMES configuration files in order to be used for a link discovery task. In the results, we provide statistics about the status and potential connectivity of spatial datasets in the WoD, we assess the applicability of the recommender, and we present the outcome of a system usability study.
    [Show full text]
  • Introduction Au Web Sémantique
    INTRODUCTION AU WEB SÉMANTIQUE Philippe GENOUD – Danielle ZIEBELIN - LIG-Steamer [email protected] 1 https://www.w3.org/standards/semanticweb/ In addition to the classic “Web of documents” W3C is helping to build a technology stack to support a “Web of data,” the sort of data you find in databases. The ultimate goal of the Web of data is to enable computers to do more useful work and to develop systems that can support trusted interactions over the network. The term “Semantic Web” refers to W3C’s vision of the Web of linked data. Semantic Web technologies enable people to create data stores on the Web, build vocabularies, and write rules for handling data. Linked data are empowered by technologies such as RDF, SPARQL, OWL, and SKOS. 2 WEB architecture URL www.google.com Hyper Link HTML <a href="http://fr.wikipedia.oorg/wiki/Georges_Brassens>..</a> fr.wikipedia.org 3 Web Sémantique – Ontologies P. Genoud D. Ziébelin Web evolution * * ? *d’après le cours Technologie du Web – A. Hombiat - Licence Professionnelle Études Statistiques et Systèmes d'Information Géographique (ESSIG) – UPMF - 2015 4 Web Sémantique – Ontologies P. Genoud D. Ziébelin Web evolution * * ? "The web of human-readable document is being merged with a web of machine understandable data. The potential of the mixture of humans and machines working together and communication through the web could be immense." Tim Berners-Lee, The World Wide Web: A very short personal history, May 1998 http://www.w3.org/People/Berners-Lee/ShortHistory.html 5 Web Sémantique – Ontologies P. Genoud D. Ziébelin From Web of Documents to Web of Data "The web of human-readable document is being merged with a web of machine understandable data." • The traditional web (Web of Documents) is for humans – based on the HTML markup language – HTML describes • what information is presented + how it's presented in conjunction with CSS • how information is linked • but not what the information means – meaning (semantics) of information is derived from available information 6 Web Sémantique – Ontologies P.
    [Show full text]
  • A Semantic Enhanced Model for Effective Spatial Information Retrieval Adeyinka Akanbi, Olusanya Agunbiade, Olumuyiwa Dehinbo, Sadiq Kuti
    A Semantic Enhanced Model for Effective Spatial Information Retrieval Adeyinka Akanbi, Olusanya Agunbiade, Olumuyiwa Dehinbo, Sadiq Kuti To cite this version: Adeyinka Akanbi, Olusanya Agunbiade, Olumuyiwa Dehinbo, Sadiq Kuti. A Semantic Enhanced Model for Effective Spatial Information Retrieval: Un modèle sémantique améliorée for Effective Information Retrieval spatiale. Proceedings of the World Congress on Engineering and Computer Science WCECS 2014, IAENG, Oct 2014, San Francisco, United States. pp.6. hal-01252644 HAL Id: hal-01252644 https://hal.archives-ouvertes.fr/hal-01252644 Submitted on 8 Jan 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Proceedings of the World Congress on Engineering and Computer Science 2014 Vol I WCECS 2014, 22-24 October, 2014, San Francisco, USA A Semantic Enhanced Model for Effective Spatial Information Retrieval Adeyinka K. Akanbi, Olusanya Y. Agunbiade, Olumuyiwa J. Dehinbo, and Sadiq T. Kuti Abstract—A lot of information on the web is geographically detailed sematic referencing showing the spatial relationship referenced. Discovering and retrieving this geographic among entities of the geospatial data for easy accessibility information to satisfy various users needs across both open and by the user. According to [2], Spatial Information Retrieval distributed Spatial Data Infrastructures (SDI) poses eminent is mainly concerned with the provision of access to geo- research challenges.
    [Show full text]
  • The Basic Message Was to Leave out Anything the Provider Does Not
    Europeana Data Model – Mapping Guidelines v2.3 18/11/2016 Co-funded by the European Union Document Scope This is The EDM Mapping Guidelines document. It is part of the family of documents about the Europeana Data Model (EDM) aimed at different audiences. They can all be found at http://pro.europeana.eu/edm-documentation, except the object templates which are at https://github.com/europeana/corelib/wiki/EDMObjectTemplatesProviders The EDM Mapping Guidelines – give guidance for providers wanting to map their data to EDM. They show which property relates to which class and contains definitions of the properties, the data types that can be used as values and the obligation level of each property. It also has an example of original data, the same data converted to EDM and diagrams showing the distribution of the properties amongst the classes. The full set of EDM classes and properties are being implemented incrementally and the Mapping Guidelines is the reference document showing which are currently available. The EDM Definition – the formal specification of the Europeana Data Model and lists the classes and properties that could be used in Europeana. Not all of these are currently implemented. Please refer to the Mapping Guidelines for the current subset of classes and properties in use. The EDM Primer – the “story” of EDM and explains how the classes and properties may be used together to model data and support Europeana functionality. The XML schema and the Schematron rules - this is the XML schema and the required Schematron rules for validating the current implementation of EDM. The EDM object templates – a working document is a simple wiki listing that shows which properties apply to which class and states the data types and obligation of the values.
    [Show full text]
  • Linked Sensor Data
    Wright State University CORE Scholar The Ohio Center of Excellence in Knowledge- Kno.e.sis Publications Enabled Computing (Kno.e.sis) 5-2010 Linked Sensor Data Harshal Kamlesh Patni Wright State University - Main Campus Cory Andrew Henson Wright State University - Main Campus Amit P. Sheth Wright State University - Main Campus, [email protected] Follow this and additional works at: https://corescholar.libraries.wright.edu/knoesis Part of the Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, and the Science and Technology Studies Commons Repository Citation Patni, H. K., Henson, C. A., & Sheth, A. P. (2010). Linked Sensor Data. 2010 International Symposium on Collaborative Technologies and Systems, 362-370. https://corescholar.libraries.wright.edu/knoesis/545 This Conference Proceeding is brought to you for free and open access by the The Ohio Center of Excellence in Knowledge-Enabled Computing (Kno.e.sis) at CORE Scholar. It has been accepted for inclusion in Kno.e.sis Publications by an authorized administrator of CORE Scholar. For more information, please contact library- [email protected]. Linked Sensor Data Harshal Patni, Cory Henson, Amit Sheth Kno.e.sis Center, Department of Computer Science and Engineering, Wright State University, Dayton, Ohio, USA {harshal, cory, amit}@knoesis.org ABSTRACT the WindSpeed measurements from these sensors. However, the data is only accessible to the organization A number of government, corporate, and academic involved in the collection of such data. Such data is very organizations are collecting enormous amounts of data useful for research and analysis, and having access to provided by environmental sensors.
    [Show full text]
  • YAGO2: Exploring and Querying World Knowledge in Time, Space
    YAGO2: Exploring and Querying World Knowledge in Time, Space, Context, and Many Languages Johannes Hoffart, Fabian Suchanek, Klaus Berberich, Edwin Lewis-Kelham, Gerard de Melo, Gerhard Weikum To cite this version: Johannes Hoffart, Fabian Suchanek, Klaus Berberich, Edwin Lewis-Kelham, Gerard de Melo, etal.. YAGO2: Exploring and Querying World Knowledge in Time, Space, Context, and Many Languages. WWW - Demos, 2011, Hyderabad, India. inria-00591780 HAL Id: inria-00591780 https://hal.inria.fr/inria-00591780 Submitted on 10 May 2011 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. YAGO2: Exploring and Querying World Knowledge in Time, Space, Context, and Many Languages Johannes Hoffart Fabian M. Suchanek1∗ Klaus Berberich Edwin Lewis-Kelham Gerard de Melo Gerhard Weikum Max Planck Institute for Informatics, Germany ∗INRIA Saclay, France {jhoffart,kberberi,edwin,gdemelo,weikum} @xmpi-inf.mpg.de [email protected] ABSTRACT person is the president of some country, but heads of state We present YAGO2, an extension of the YAGO knowledge or CEOs of companies change. Similar problems can be base with a focus on temporal and spatial knowledge. It is observed at the spatial level. Purely entity-centric represen- automatically built from Wikipedia, GeoNames, and Word- tations know locations and their locatedIn relations, but do Net, and contains nearly 10 million entities and events, as not consistently attach a geographical dimension to events well as 80 million facts representing general world knowl- and entities.
    [Show full text]