Introduction to Ontology- Based Semantics Goals Service

Total Page:16

File Type:pdf, Size:1020Kb

Introduction to Ontology- Based Semantics Goals Service Goals • To provide some insight into the usefulness of ontologies Introduction to Ontology- • To provide an understanding of the based Semantics features of RDFS and OWL and their use in automated reasoning Semantic Web, ontologies, RDF, OWL, • To provide an outline of the N3 syntax N3 • Use N3 to express RDFS and OWL – sufficient for later examples and exercises on service semantics With thanks to Declan O’Sullivan @ David Lewis @ David Lewis Service Semantics Functional Semantics • WSDL provides the syntax we need for interoperating with a service, but little in the way of semantics • What do ‘origin’ and ‘destination’ strings • Examining this example raises many questions about represent? functional and non-functional semantics – Country, city, airport, restrictions (airline, national, <message name=“getcheapestFlightRequest"> regional)? <part name=“origin" type="xsd:string"/> <part name=“destination" type="xsd:string"/> • What does ‘flight’ string represent? <part name=“date" type="xsd:date"/> – Airline, flight number, plane type? </message> <message name=“getcheapestFlightResponse"> • What does ‘time’ string represent? <part name=“flight" type="xsd:string"/> <part name=“time" type="xsd:time"/> – Departure time? <part name=“cost” type=“xsd:float”/> </message> – Note xsd:time probably is adequate - supports time- <portType name=“cheapestFlight"> zone information <operation name="getCheapestFlight"> <input message=“getcheapestFlightRequest"/> • What does ‘cost’ float represent? <output message=“getcheapestFlightResponse"/> </operation> – Currency, to how many decimal points? </portType> @ David Lewis @ David Lewis Non-functional Semantics Need more than XML Schema • XML is Syntax • Availability – DTDs talk about element nesting – Can we assume 24x7x365 availability globally for web – XML Schema schemas give you data types service unless otherwise stated – need anything else? => write comments! • XML DTDs and XML Schemas are sufficient for exchanging data • Channels between parties who have agreed to meaning of terms beforehand – WSDL binding to SOAP supports this • Domain Semantics are complex: • WSDL says nothing about: – implicit assumptions, hidden semantics – Charging Styles ⇒ sources seem unrelated to the non-expert • Need Structure and Semantics beyond XML trees! – Settlement ⇒ employ richer OO models – Service Quality ⇒ make domain semantics and “glue knowledge” explicit – Security and Trust ⇒ use ontologies to fix terminology and conceptualization – Ownership and Rights ⇒ avoid ambiguities by using formal semantics (logics) @ David Lewis @ David Lewis 1 Informally: What is an Ontology: A definition Ontology? •An "ontology” defines the common words and concepts (meaning) • Defines the terms used to describe and used to describe and represent an area of knowledge. represent an area of knowledge •An ontology can range from a – Taxonomy (knowledge with minimal hierarchy or a parent/child • Used by people, databases, applications structure) to a … that need to share domain information – Thesaurus (words and synonyms) to a …. – Conceptual Model (with more complex knowledge) to a… • Ontologies include computer-usable – Logical Theory (with very rich, complex, consistent and meaningful knowledge). definitions of basic concepts in the domain • A well-formed ontology is one that is expressed in a well-defined and the relationships among them syntax that has a well-defined machine interpretation consistent with the above ontology definition • They encode knowledge in a domain and knowledge that spans domains @ David Lewis @ David Lewis An explicit description of a domain An explicit description of a domain • Constraints or axioms on properties and concepts: animal • Concepts (aka: class, set, type, predicate) – value: integer – event, gene, gammaBurst, – domain: cat atrium, molecule, cat vermin domestic – cardinality: at most 1 animal • Properties of concepts and dog – range: 0 <= X <= 100 cat relationships between them – cows are larger than dogs rodent cow vermin domestic (aka: slot) eats – cats cannot eat only vegetation – Taxonomy: generalisation – cats and dogs are disjoint dog mouse cat ordering among concepts isA, cow partOf, subProcess • Values or concrete domains rodent eats – Relationship, Role or Attribute: isA – integer, strings functionOf, hasActivity location, relationships mouse eats, size @ David Lewis @ David Lewis [Carole Goble, Nigel Shadbolt, Ontologies and the Grid Tutorial] [Carole Goble, Nigel Shadbolt, Ontologies and the Grid Tutorial] An explicit description of a domain Types of Ontologies animal describe very general concepts like space, time, event, which are • Nominals independent of a particular problem or domain. It seems reasonable to have unified top-level ontologies for large – Similar to abstract classes/types communities of users. – Concepts that cannot have instances vermin domestic – Instances that are used in conceptual definitions dog cat – ItalianDog = Dog bornIn Italy cow rodent describe the describe the • Individuals or Instances eats vocabulary related to vocabulary related a generic domain by to a generic task – sulphur, trpA Gene, felix specializing the or activity by felix concepts introduced specializing the • Ontology versus Knowledge Base mouse in the top-level top-level ontology. –An ontology = tom ontologies. concepts+properties+axioms+values+ nominals mickey –A knowledge base = jerry These are the most specific ontologies. Concepts in ontology+instances application ontologies often correspond to roles played by domain entities while performing a certain activity. @ David Lewis @ David Lewis [Carole Goble, Nigel Shadbolt, Ontologies and the Grid Tutorial] [Carole Goble, Nigel Shadbolt, Ontologies and the Grid Tutorial] 2 Ontologies - Some Examples Semantic Web • General purpose ontologies: • Meta-Ontologies – WordNet / EuroWordNet, – Semantic Translation, • Initiative of W3C, strongly driven by Sir Tim Berners Lee http://www.cogsci.princeton.edu/~wn http://www.ecimf.org/contrib/o nto/ST/index.html – http://www.w3.org/2001/sw/ – The Upper Cyc Ontology, http://www.cyc.com/cyc- 2-1/index.html – RDFT, • To date Web has been developed as a medium for document http://www.cs.vu.nl/~borys/RD – IEEE Standard Upper Ontology, http://suo.ieee.org/ FT/0.27/RDFT.rdfs manipulation by people • Domain and application-specific – Evolution Ontology, • Aim to augment the web pages with data targeted at computers, and http://kaon.semanticweb.org/e add documents targeted solely at computers, e.g. semantic web ontologies: xamples/Evolution.rdfs – RDF Site Summary RSS, service descriptions http://groups.yahoo.com/group/rss- • Ontologies in a wider • Challenges dev/files/schema.rdf sense – UMLS, http://www.nlm.nih.gov/research/umls/ – Agreeing on standard Markup Language… OWL standardisation – Agrovoc, progressing – KA2 / Science Ontology, http://www.fao.org/agrovoc/ http://ontobroker.semanticweb.org/ontos/ka2.html – Art and Architecture, – Generating sufficient common ontologies that can be referred to people – RETSINA Calendering Agent, http://www.getty.edu/research/ in their own ontologies (e.g. definition of food, definition of person etc.) http://ilrt.org/discovery/2001/06/schemas/ical- tools/vocabulary/aat/ – Availability of indexing crawlers or web service repositories to take full/hybrid.rdf – UNSPSC, advantage of new approach – AIFB Web Page Ontology, http://eccma.org/unspsc/ – Tool support for people creating own ontologies http://ontobroker.semanticweb.org/ontos/aifb.html – DTD standardizations, e.g. – Web-KB Ontology, http://www- HR-XML, http://www.hr- – Tool support for automatic markup of documents for end users, e.g. 2.cs.cmu.edu/afs/cs.cmu.edu/project/theo- xml.org/ incorporation into Word or Frontpage etc. 11/www/wwkb/ – Tool support for automatic markup of web services for end users, e.g. – Dublin Core, http://dublincore.org/ incorporation into .NET, J2EE etc. @ David Lewis @ David Lewis [Carole Goble, Nigel Shadbolt, Ontologies and the Grid Tutorial] Resource Description Framework (RDF) Notational Verbosity •RDF is a W3C recommendation that enables encoding, exchange and reuse of structured metadata in XML • Semantic Web relies on a ‘stack’ of languages – Triples of assertions can be expressed using XML tags – XML for document formatting and referencing elements of other – A bit like “Subject, Property, Object” of a sentence document – XML Schema for encoding data types Concept – RDF – semantic description triple Concept Value – RDFS – definition of classes, properties and subclasses in RDF property – OWL – add assertions about relationships between RDF statement classes/properties – E.g. “Cabernet Sauvignon grape”, “is a type of” ,“Wine grape” • All aim to support machine processing and inference <rdf:Description rdf:about="Cabernet Sauvignon grape“> Subject <rdf:type rdf:resource=“#Wine grape" /> over the WWW </rdf:Description> • BUT …. Results in an incredibly verbose XML-based Property Object – Each resource can be assigned a different Universal Resource Identifier (URI) syntax! • Thus different meanings for the same term can be assigned different URIs – Fine for computers, bad for people – Reference: RDF Primer. W3C draft technical note, 2002 – Need less verbosity for sketching out ideas, presenting examples and teaching ontologies @ David Lewis @ David Lewis N3 Ontology Notation Syntactic shortcuts • Compact, text based language for playing • Several objects with same subject and around with ontologies • Python Platform CWM allows inference and predicate separated by
Recommended publications
  • Metadata for Semantic and Social Applications
    etadata is a key aspect of our evolving infrastructure for information management, social computing, and scientific collaboration. DC-2008M will focus on metadata challenges, solutions, and innovation in initiatives and activities underlying semantic and social applications. Metadata is part of the fabric of social computing, which includes the use of wikis, blogs, and tagging for collaboration and participation. Metadata also underlies the development of semantic applications, and the Semantic Web — the representation and integration of multimedia knowledge structures on the basis of semantic models. These two trends flow together in applications such as Wikipedia, where authors collectively create structured information that can be extracted and used to enhance access to and use of information sources. Recent discussion has focused on how existing bibliographic standards can be expressed as Semantic Metadata for Web vocabularies to facilitate the ingration of library and cultural heritage data with other types of data. Harnessing the efforts of content providers and end-users to link, tag, edit, and describe their Semantic and information in interoperable ways (”participatory metadata”) is a key step towards providing knowledge environments that are scalable, self-correcting, and evolvable. Social Applications DC-2008 will explore conceptual and practical issues in the development and deployment of semantic and social applications to meet the needs of specific communities of practice. Edited by Jane Greenberg and Wolfgang Klas DC-2008
    [Show full text]
  • Provenance and Annotations for Linked Data
    Proc. Int’l Conf. on Dublin Core and Metadata Applications 2013 Provenance and Annotations for Linked Data Kai Eckert University of Mannheim, Germany [email protected] Abstract Provenance tracking for Linked Data requires the identification of Linked Data resources. Annotating Linked Data on the level of single statements requires the identification of these statements. The concept of a Provenance Context is introduced as the basis for a consistent data model for Linked Data that incorporates current best-practices and creates identity for every published Linked Dataset. A comparison of this model with the Dublin Core Abstract Model is provided to gain further understanding, how Linked Data affects the traditional view on metadata and to what extent our approach could help to mediate. Finally, a linking mechanism based on RDF reification is developed to annotate single statements within a Provenance Context. Keywords: Provenance; Annotations; RDF; Linked Data; DCAM; DM2E; 1. Introduction This paper addresses two challenges faced by many Linked Data applications: How to provide, access, and use provenance information about the data; and how to enable data annotations, i.e., further statements about the data, subsets of the data, or even single statements. Both challenges are related as both require the existence of identifiers for the data. We use the Linked Data infrastructure that is currently developed in the DM2E project as an example with typical use- cases and resulting requirements. 1.1. Foundations Linked Data, the publication of data on the Web, enables easy access to data and supports the reuse of data. The Hypertext Transfer Protocol (HTTP) is used to access a Uniform Resource Identifier (URI) and to retrieve data about the resource.
    [Show full text]
  • Rdfa in XHTML: Syntax and Processing Rdfa in XHTML: Syntax and Processing
    RDFa in XHTML: Syntax and Processing RDFa in XHTML: Syntax and Processing RDFa in XHTML: Syntax and Processing A collection of attributes and processing rules for extending XHTML to support RDF W3C Recommendation 14 October 2008 This version: http://www.w3.org/TR/2008/REC-rdfa-syntax-20081014 Latest version: http://www.w3.org/TR/rdfa-syntax Previous version: http://www.w3.org/TR/2008/PR-rdfa-syntax-20080904 Diff from previous version: rdfa-syntax-diff.html Editors: Ben Adida, Creative Commons [email protected] Mark Birbeck, webBackplane [email protected] Shane McCarron, Applied Testing and Technology, Inc. [email protected] Steven Pemberton, CWI Please refer to the errata for this document, which may include some normative corrections. This document is also available in these non-normative formats: PostScript version, PDF version, ZIP archive, and Gzip’d TAR archive. The English version of this specification is the only normative version. Non-normative translations may also be available. Copyright © 2007-2008 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark and document use rules apply. Abstract The current Web is primarily made up of an enormous number of documents that have been created using HTML. These documents contain significant amounts of structured data, which is largely unavailable to tools and applications. When publishers can express this data more completely, and when tools can read it, a new world of user functionality becomes available, letting users transfer structured data between applications and web sites, and allowing browsing applications to improve the user experience: an event on a web page can be directly imported - 1 - How to Read this Document RDFa in XHTML: Syntax and Processing into a user’s desktop calendar; a license on a document can be detected so that users can be informed of their rights automatically; a photo’s creator, camera setting information, resolution, location and topic can be published as easily as the original photo itself, enabling structured search and sharing.
    [Show full text]
  • CODATA Workshop on Big Data Programme Book
    Sponsor I CODACODATA S UU Co-Sponsors Organizer Workshop on Big Data for International Scientific Programmes CONTENTS I. Sponsoring Organizations International Council for Science (ICSU) I. Sponsoring Organizations 2 The International Council for Science (ICSU) is a the international scientific community to II. Programme 7 non-governmental organization with a global strengthen international science for the benefit of membership of national scientific bodies society. (121members, representing 141 countries) and international scientific unions (31 members). ICSU: www.icsu.org III. Remarks and Abstracts 13 ICSU mobilizes the knowledge and resources of Committee on Data for Science and Technology (CODATA) IV. Short Biography of Speakers 28 I CODACODATA S UU V. Conference Venue Layout 41 CODATA, the ICSU Committee on Data for Science challenges and ‘hot topics’ at the frontiers and Technology, was established in 1966 to meet of data science (through CODATA Task a need for an international coordinating body to Groups and Working Groups and other improve the management and preservation of initiatives). VI. General Information 43 scientific data. CODATA has been at the forefront 3. Developing data strategies for international of data science and data policy issues since that science programmes and supporting ICSU date. activities such as Future Earth and Integrated About Beijing 43 Research on Disaster Risk (IRDR) to address CODATA supports ICSU’s mission of ‘strengthening data management needs. international science for the benefit of society’ by ‘promoting improved scientific and technical data Through events like the Workshop on Big Data for About the Workshop Venue 43 management and use’. CODATA achieves this International Scientific Programmes and mission through three strands of activity: SciDataCon 2014, CODATA collaborates with 1.
    [Show full text]
  • Folksonomies - Cooperative Classification and Communication Through Shared Metadata
    Folksonomies - Cooperative Classification and Communication Through Shared Metadata Adam Mathes Computer Mediated Communication - LIS590CMC Graduate School of Library and Information Science University of Illinois Urbana-Champaign December 2004 Abstract This paper examines user-generated metadata as implemented and applied in two web services designed to share and organize digital me- dia to better understand grassroots classification. Metadata - data about data - allows systems to collocate related information, and helps users find relevant information. The creation of metadata has generally been approached in two ways: professional creation and author creation. In li- braries and other organizations, creating metadata, primarily in the form of catalog records, has traditionally been the domain of dedicated profes- sionals working with complex, detailed rule sets and vocabularies. The primary problem with this approach is scalability and its impracticality for the vast amounts of content being produced and used, especially on the World Wide Web. The apparatus and tools built around professional cataloging systems are generally too complicated for anyone without spe- cialized training and knowledge. A second approach is for metadata to be created by authors. The movement towards creator described docu- ments was heralded by SGML, the WWW, and the Dublin Core Metadata Initiative. There are problems with this approach as well - often due to inadequate or inaccurate description, or outright deception. This paper examines a third approach: user-created metadata, where users of the documents and media create metadata for their own individual use that is also shared throughout a community. 1 The Creation of Metadata: Professionals, Con- tent Creators, Users Metadata is often characterized as “data about data.” Metadata is information, often highly structured, about documents, books, articles, photographs, or other items that is designed to support specific functions.
    [Show full text]
  • Extending the Role of Metadata in a Digital Library System*
    Extending the Role of Metadata in a Digital Library System* Alexa T. McCray, Marie E. Gallagher, Michael A. Flannick National Library of Medicine Bethesda, MD, USA {mccray,gallagher,flannick}@nlm.nih.gov Abstract Metadata efforts often fall into the trap of We describe an approach to the development of a digital trying to create a universal metadata schema. library system that is founded on a number of basic Such efforts fail to recognize the basic nature principles. In particular, we discuss the critical role of of metadata: namely, that it is far too diverse metadata in all aspects of the system design. We begin by to fit into one useful taxonomy...the creation, describing how the notion of metadata is sometimes administration, and enhancement of individual interpreted and go on to discuss some of our early metadata forms should be left to the relevant experiences in a digital conversion project. We report on communities of expertise. Ideally this would the Profiles in Science project, which is making the occur within a framework that will support archival collections of prominent biomedical scientists interoperability across data and domains. available on the World Wide Web. We discuss the [4:277] principles that are used in our system design, illustrating Roszkowski and Lukas describe an approach for these throughout the discussion. Our approach has linking distributed collections of metadata so that they are involved interpreting metadata in its broadest sense. We searchable as a single collection [5], and Baldonado et al. capture data about the items in our digital collection for a describe an architecture that facilitates metadata a variety of purposes and use those data to drive the compatibility and interoperability [6].
    [Show full text]
  • A Blockchain-Based Application to Protect Minor Artworks
    A Blockchain-based Application to Protect Minor Artworks Clara Bacciu, Angelica Lo Duca, Andrea Marchetti IIT-CNR - via G. Moruzzi 1, 56124 Pisa (Italy) [name].[surname]@iit.cnr.it Keywords: blockchain, Ethereum, recordkeeping, IPFS, Cultural Heritage, minor artwork Abstract: A new emerging trend concerns the implementation of services and distributed applications through the blockchain technology. A blockchain is an append-only database, which guarantees security, transparency and immutability of records. Blockchains can be used in the field of Cultural Heritage to protect minor artworks, i.e. artistic relevant works not as famous as masterpieces. Minor artworks are subjected to counterfeiting, thefts and natural disasters because they are not well protected as famous artworks. This paper describes a blockchain-based application, called MApp (Minor Artworks application), which lets authenticated users (pri- vate people or organizations), store the information about their artworks in a secure way. The use of blockchain produces three main advantages. Firstly, artworks cannot be deleted from the register thus preventing thieves to remove records associated stolen objects. Secondly, artworks can be added and updated only by authorized users, thus preventing counterfeiting in objects descriptions. Finally, records can be used to keep artworks memory in case of destruction caused by a natural disaster. 1 INTRODUCTION cessible. To complicate things, in the last decades, Italy (as many other places in the world) has suffered Minor artworks are works that are artistically rel- a series of catastrophic events that largely affected evant but not as well-known as famous masterpieces, cultural heritage: earthquakes, landslides, floods, col- or belonging to the so-called minor arts, such as books lapses in important archaeological sites such as Pom- and manuscripts, pottery, lacquerware, furniture, jew- pei.
    [Show full text]
  • Integrating Dublin Core Metadata for Cultural Heritage Collections Using Ontologies**
    2007 Proc. Int’l Conf. on Dublin Core and Metadata Applications Integrating Dublin Core metadata for cultural heritage ** collections using ontologies Constantia Kakali Irene Lourdi Thomais Stasinopoulou Department of Archive and Department of Archive and Department of Archive and Library Sciences, Ionian Library Sciences, Ionian Library Sciences, Ionian University, Greece University, Greece University, Greece [email protected] [email protected] [email protected] Lina Bountouri Christos Papatheodorou Martin Doerr Department of Archive and Department of Archive and Institute of Computer Library Sciences, Ionian Library Sciences, Ionian Science, Foundation for University, Greece University, Greece Research and Technology, [email protected] [email protected] Greece [email protected] Manolis Gergatsoulis Department of Archive and Library Sciences, Ionian University, Greece [email protected] Abstract Metadata interoperability is an active research area, especially for cultural heritage collections, which consist of heterogeneous objects described by a variety of metadata schemas. In this paper we propose an ontology-based metadata interoperability approach, which exploits, in an optimal way, the semantics of metadata schemas. In particular, we propose the use of CIDOC/CRM ontology as a mediating schema and present a methodology for mapping DC Type Vocabulary to CIDOC/CRM, demonstrating a real-world effort for ontology-based metadata integration. Keywords: ontology-based integration; metadata interoperability; CIDOC/CRM; DC-Type. 1. Introduction Heterogeneity is one of the main characteristics of cultural heritage collections. Such collections may be composed of text, written on different materials, paintings, photographs, 3D objects, sound recordings, maps or even digital objects. Furthermore, the objects are strongly related with the social and historical events that take place over time.
    [Show full text]
  • Publishing E-Resources of Digital Institutional Repository As Linked Open Data: an Experimental Study
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Library Philosophy and Practice (e-journal) Libraries at University of Nebraska-Lincoln 11-30-2020 Publishing E-resources of Digital Institutional Repository as Linked Open Data: an experimental study Subhendu Kar Vivekananda Satavarshiki Mahavidyalaya, [email protected] Dr. Rajesh Das The University of Burdwan, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/libphilprac Part of the Library and Information Science Commons Kar, Subhendu and Das, Dr. Rajesh, "Publishing E-resources of Digital Institutional Repository as Linked Open Data: an experimental study" (2020). Library Philosophy and Practice (e-journal). 4699. https://digitalcommons.unl.edu/libphilprac/4699 Publishing E-resources of Digital Institutional Repository as Linked Open Data: an experimental study Subhendu Kar, Librarian, Vivekananda Satavarshiki Mahavidyalaya, Manikpara, Dist, Jhargram, West Bengal, India Pin-721513 And Dr. Rajesh Das, Assistant Professor, Department of Library and Information Science, The University of Burdwan, Burdwan, West Bengal, India, Pin-713104 Abstract Linked open data (LOD) is an essential component in semantic web architecture and is becoming increasingly important over time due to its ability to share and re-use structured data which is both human and computer readable over the web. Currently, many libraries, archives, museums etc. are using open source digital library software to manage and preserve their digital collections. They may also intend to publish their e-resources as “Linked Open Datasets” for further usage. LOD enables the libraries or information centers to publish and share the structured metadata that is generated and maintained with their own bibliographic and authority data in such a way that the other libraries and general community across the world can consume, interact, enrich and share.
    [Show full text]
  • Developing Cultural Heritage Preservation Databases Based on Dublin Core Data Elements
    Developing cultural heritage preservation databases based on Dublin Core data elements Fenella G. France Art Preservation Services Tel: +1 212 722 6300 x4 [email protected] Michael B. Toth R.B. Toth Associates Tel: +1 703 938 4499 [email protected] Abstract: Preserving our cultural heritage – from fragile historic textiles such as national flags to heavy and seemingly solid artifacts recovered from 9/11 – requires careful monitoring of the state of the artifact and environmental conditions. The standard for a web-accessible textile fiber database is established with Dublin Core elements to address the needs of textile conservation. Dynamic metadata and classification standards are also incorporated to allow flexibility in recording changing conditions and deterioration over the life of an object. Dublin core serves as the basis for data sets of information about the changing state of artifacts and environmental conditions. With common metadata standards, such as Dublin Core, this critical preservation knowledge can be utilized by a range of scientists and conservators to determine optimum conditions for slowing the rate of deterioration, as well as comparative use in the preservation of other artifacts. Keywords: Preservation, cultural heritage, Dublin Core, conservation, deterioration, environmental parameters, preservation vocabularies, textiles, fiber, metadata standards, digital library. 1. Introduction Preserving our cultural heritage for future generations is a critical aspect of the stewardship of historical objects. This requires careful monitoring of the state of the artifact and environmental conditions, and standardized storage of this information to support a range of professionals who play a role in the preservation, storage and display of the artifact – including conservators, scientists, curators, and other cultural heritage professionals.
    [Show full text]
  • Choosing a Metadata Type on Metadata Types This Is Our Future; We Can No Longer Rely on Only One Record Structure
    Choosing a Metadata Type On Metadata Types This is our future; we can no longer rely on only one record structure. We must be able to accept many different kinds of bibliographic record structures, from ONIX to Dublin Core to whatever else comes along that contains useful information. To use these record formats we will need rules and guidelines to follow in their application. We need both general rules and schema-specific rules, similar to the way we have used AACR2 to define what information we capture in MARC. – Roy Tennant. “Building a New Bibliographic Infrastructure.”1 Considerations in choosing a metadata type The following questions to consider were put forth by Dorothea Salo, Digital Repository Services Librarian at George Mason University: • What is the problem domain? • Is the choice baked into the system? For example, if you want to use OAI-PMH, Dublin Core is a natural choice. • What are similar projects using? • What else do you have to interoperate with? • What kind of usage infrastructure is there? Is it open or proprietary? Consider the size of the community around this metadata type. • What will this metadata do? • Is it a good standard that encourages good metadata? • Don’t panic; a crosswalk can be used to convert your data to a new metadata standard later.2 Defining the problem domain at the outset is one of the most important considerations. What is the purpose of the system? Is it data retrieval, resource identification, applying access rights to certain metadata, or a combination of these? 3 Defining these requirements up front will make the choice of metadata type more obvious.
    [Show full text]
  • Dublin Core Tutorial
    Using Dublin Core file:///E:/aaa_DL_FUB/Support%20Material/DCMI/Using%20Dubli... Using Dublin Core Creator: Diane Hillmann Date Issued: 2005-11-07 Identifier: http://dublincore.org/documents/2005/11/07/usageguide/ Replaces: http://dublincore.org/documents/2005/08/15/usageguide/ Is Replaced By: Not applicable Latest Version: http://dublincore.org/documents/usageguide/ Translations: http://dublincore.org/resources/translations/ Status of DCMI Recommended Resource Document: Description of This document is intended as an entry point for users of Dublin Core. For non-specialists, it will assist Document: them in creating simple descriptive records for information resources (for example, electronic documents). Specialists may find the document a useful point of reference to the documentation of Dublin Core, as it changes and grows. Table of Contents 1. Introduction 1.1. What is Metadata? 1.2. What is the Dublin Core? 1.3. The Purpose and Scope of This Guide 2. Syntax, Storage and Maintenance Issues 2.1. HTML 2.2. RDF/XML 2.3. Metadata Storage and Maintenance 3. Element Content and Controlled Vocabularies 4. The Elements 5. Dublin Core Qualifiers 6. Appendix, Roles 7. Glossary 8. Bibliography 1. Introduction 1.1. What is Metadata? Metadata has been with us since the first librarian made a list of the items on a shelf of handwritten scrolls. The term "meta" comes from a Greek word that denotes "alongside, with, after, next." More recent Latin and English usage would employ "meta" to denote something transcendental, or beyond nature. Metadata, then, can be thought of as data about other data. It is the Internet-age term for information that librarians traditionally have put into catalogs, and it most commonly refers to descriptive information about Web resources.
    [Show full text]