Understanding Metadata

Total Page:16

File Type:pdf, Size:1020Kb

Understanding Metadata Understanding Metadata What is Metadata? .................................................................................................. 1 What Does Metadata Do? ................................................................................ 1 Structuring Metadata ................................................................................. 2 Metadata Schemes and Element Sets ............................................... 3 Dublin Core .....................................................................................................................3 TEI and METS ..............................................................................................................4 MODS .......................................................................................................................5 EAD and LOM......................................................................................................6 <indecs>, ONIX, CDWA, and VRA ..................................................................7 MPEG ..........................................................................................................8 FGDC and DDI ........................................................................................9 Creating Metadata ................................................ 10 Interoperability and Exchange of Metadata ....11 Future Directions .................................... 12 More Information on Metadata ........ 13 Glossary ...................................... 15 Acknowledgements Understanding Metadata is a revision and expansion of Metadata Made Simpler: A guide for libraries published by NISO Press in 2001. NISO Press extends its thanks and appreciation to Rebecca Guenther and Jacqueline Radebaugh, staff members in the Library of Congress Network Development and MARC Standards Office, for sharing their expertise and contributing to this publication. About NISO NISO, a non-profit association accredited by the American National Standards Institute (ANSI), identifies, develops, maintains, and publishes technical standards to manage information in our changing and ever-more digital environment. NISO standards apply both traditional and new technologies to the full range of information-related needs, including retrieval, re-purposing, storage, metadata, and preservation. NISO Standards, information about NISO’s activities and membership are featured on the NISO website <http://www.niso.org>. This booklet is available for free on the NISO website (www.niso.org) and in hardcopy from NISO Press. Published by: NISO Press National Information Standards Organization 4733 Bethesda Avenue, Suite 300 Bethesda, MD 20814 USA Email: [email protected] Tel: 301-654-2512 Fax: 301-654-1721 URL: www.niso.org Copyright © 2004 National Information Standards Organization ISBN: 1-880124-62-9 Understanding Metadata What Is Metadata? administrative data; two that in the headers of image files. sometimes are listed as separate Storing metadata with the object it Metadata is structured infor- metadata types are: describes ensures the metadata will mation that describes, explains, not be lost, obviates problems of locates, or otherwise makes it − Rights management meta- linking between data and metadata, easier to retrieve, use, or manage data, which deals with and helps ensure that the metadata an information resource. Metadata intellectual property rights, and object will be updated together. is often called data about data or and However, it is impossible to embed information about information. − Preservation metadata, which metadata in some types of objects The term metadata is used contains information needed (for example, artifacts). Also, storing differently in different communities. to archive and preserve a metadata separately can simplify Some use it to refer to machine resource. the management of the metadata understandable information, while itself and facilitate search and others use it only for records that Metadata can describe re- retrieval. Therefore, metadata is describe electronic resources. In sources at any level of aggregation. commonly stored in a database the library environment, metadata It can describe a collection, a single system and linked to the objects is commonly used for any formal resource, or a component part of a described. scheme of resource description, larger resource (for example, a applying to any type of object, digital photograph in an article). Just as What Does or non-digital. Traditional library cataloging is a form of metadata; Metadata Do? MARC 21 and the rule sets used Metadata is key An important reason for creating with it, such as AACR2, are to ensuring that descriptive metadata is to facilitate metadata standards. Other discovery of relevant information. In metadata schemes have been resources will addition to resource discovery, developed to describe various types survive and metadata can help organize of textual and non-textual objects continue to be electronic resources, facilitate including published books, interoperability and legacy resource electronic documents, archival accessible into integration, provide digital finding aids, art objects, educational the future. identification, and support archiving and training materials, and scientific and preservation. datasets. Resource Discovery There are three main types of catalogers make decisions about metadata: whether a catalog record should be Metadata serves the same • created for a whole set of volumes functions in resource discovery as Descriptive metadata describes or for each particular volume in the good cataloging does by: a resource for purposes such as set, so the metadata creator makes • allowing resources to be found discovery and identification. It similar decisions. Metadata can also by relevant criteria; can include elements such as be used for description at any level title, abstract, author, and of the information model laid out in • identifying resources; keywords. the IFLA (International Federation • bringing similar resources of Library Associations and • Structural metadata indicates together; how compound objects are put Institutions) Functional Require- together, for example, how ments for Bibliographic Records: • distinguishing dissimilar re- pages are ordered to form work, expression, manifestation, or sources; and chapters. item. For example, a metadata • record could describe a report, a giving location information. • Administrative metadata pro- particular edition of the report, or a Organizing Electronic vides information to help specific copy of that edition of the manage a resource, such as report. Resources when and how it was created, file Metadata can be embedded in As the number of Web-based type and other technical a digital object or it can be stored resources grows exponentially, information, and who can access separately. Metadata is often aggregate sites or portals are it. There are several subsets of embedded in HTML documents and increasingly useful in organizing Page 1 links to resources based on digital object may also be given The latter group developed a audience or topic. Such lists can be using a file name, URL (Uniform framework outlining types of built as static webpages, with the Resource Locator), or some more presentation metadata. A follow-up names and locations of the persistent identifier such as a PURL group, PREMIS (PREservation resources “hardcoded” in the (Persistent URL) or DOI (Digital Metadata: Implementation Strat- HTML. However, it is more efficient Object Identifier). Persistent egies)—also sponsored by OCLC and increasingly more common to identifiers are preferred because and RLG—is developing a set of build these pages dynamically from object locations often change, core elements and strategies for the metadata stored in databases. making the standard URL (and encoding, storage, and manage- Various software tools can be used therefore the metadata record) ment of preservation metadata to automatically extract and invalid. In addition to the actual within a digital preservation system. reformat the information for Web elements that point to the object, the Many of these initiatives are based applications. metadata can be combined to act on or compatible with the ISO as a set of identifying data, Reference Model for an Open Interoperability differentiating one object from Archival Information System Describing a resource with another for validation purposes. (OAIS). metadata allows it to be understood Archiving and by both humans and machines in Structuring Metadata ways that promote interoperability. Preservation Metadata schemes (also called Interoperability is the ability of Most current metadata efforts schema) are sets of metadata multiple systems with different center around the discovery of elements designed for a specific hardware and software platforms, recently created resources. purpose, such as describing a data structures, and interfaces to However, there is a growing particular type of information exchange data with minimal loss of concern that digital resources will resource. The definition or meaning content and functionality. Using not survive in usable form into the of the elements themselves is defined metadata schemes, shared future. Digital information is fragile; known as the semantics of the transfer protocols, and crosswalks it can be corrupted or altered, scheme. The values given to between schemes, resources intentionally or unintentionally. It metadata elements are the content. across the network can be may become unusable
Recommended publications
  • Metadata for Semantic and Social Applications
    etadata is a key aspect of our evolving infrastructure for information management, social computing, and scientific collaboration. DC-2008M will focus on metadata challenges, solutions, and innovation in initiatives and activities underlying semantic and social applications. Metadata is part of the fabric of social computing, which includes the use of wikis, blogs, and tagging for collaboration and participation. Metadata also underlies the development of semantic applications, and the Semantic Web — the representation and integration of multimedia knowledge structures on the basis of semantic models. These two trends flow together in applications such as Wikipedia, where authors collectively create structured information that can be extracted and used to enhance access to and use of information sources. Recent discussion has focused on how existing bibliographic standards can be expressed as Semantic Metadata for Web vocabularies to facilitate the ingration of library and cultural heritage data with other types of data. Harnessing the efforts of content providers and end-users to link, tag, edit, and describe their Semantic and information in interoperable ways (”participatory metadata”) is a key step towards providing knowledge environments that are scalable, self-correcting, and evolvable. Social Applications DC-2008 will explore conceptual and practical issues in the development and deployment of semantic and social applications to meet the needs of specific communities of practice. Edited by Jane Greenberg and Wolfgang Klas DC-2008
    [Show full text]
  • Metadata and GIS
    Metadata and GIS ® An ESRI White Paper • October 2002 ESRI 380 New York St., Redlands, CA 92373-8100, USA • TEL 909-793-2853 • FAX 909-793-5953 • E-MAIL [email protected] • WEB www.esri.com Copyright © 2002 ESRI All rights reserved. Printed in the United States of America. The information contained in this document is the exclusive property of ESRI. This work is protected under United States copyright law and other international copyright treaties and conventions. No part of this work may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying and recording, or by any information storage or retrieval system, except as expressly permitted in writing by ESRI. All requests should be sent to Attention: Contracts Manager, ESRI, 380 New York Street, Redlands, CA 92373-8100, USA. The information contained in this document is subject to change without notice. U.S. GOVERNMENT RESTRICTED/LIMITED RIGHTS Any software, documentation, and/or data delivered hereunder is subject to the terms of the License Agreement. In no event shall the U.S. Government acquire greater than RESTRICTED/LIMITED RIGHTS. At a minimum, use, duplication, or disclosure by the U.S. Government is subject to restrictions as set forth in FAR §52.227-14 Alternates I, II, and III (JUN 1987); FAR §52.227-19 (JUN 1987) and/or FAR §12.211/12.212 (Commercial Technical Data/Computer Software); and DFARS §252.227-7015 (NOV 1995) (Technical Data) and/or DFARS §227.7202 (Computer Software), as applicable. Contractor/Manufacturer is ESRI, 380 New York Street, Redlands, CA 92373- 8100, USA.
    [Show full text]
  • Creating Permissionless Blockchains of Metadata Records Dejah Rubel
    Articles No Need to Ask: Creating Permissionless Blockchains of Metadata Records Dejah Rubel ABSTRACT This article will describe how permissionless metadata blockchains could be created to overcome two significant limitations in current cataloging practices: centralization and a lack of traceability. The process would start by creating public and private keys, which could be managed using digital wallet software. After creating a genesis block, nodes would submit either a new record or modifications to a single record for validation. Validation would rely on a Federated Byzantine Agreement consensus algorithm because it offers the most flexibility for institutions to select authoritative peers. Only the top tier nodes would be required to store a copy of the entire blockchain thereby allowing other institutions to decide whether they prefer to use the abridged version or the full version. INTRODUCTION Several libraries and library vendors are investigating how blockchain could improve activities such as scholarly publishing, content dissemination, and copyright enforcement. A few organizations, such as Katalysis, are creating prototypes or alpha versions of blockchain platforms and products.1 Although there has been some discussion about using blockchains for metadata creation and management, only one company appears to be designing such a product. Therefore, this article will describe how permissionless blockchains of metadata records could be created, managed, and stored to overcome current challenges with metadata creation and management. LIMITATIONS OF CURRENT PRACTICES Metadata standards, processes, and systems are changing to meet twenty-first century information needs and expectations. There are two significant limitations, however, to our current metadata creation and modification practices that have not been addressed: centralization and traceability.
    [Show full text]
  • Metadata Demystified: a Guide for Publishers
    ISBN 1-880124-59-9 Metadata Demystified: A Guide for Publishers Table of Contents What Metadata Is 1 What Metadata Isn’t 3 XML 3 Identifiers 4 Why Metadata Is Important 6 What Metadata Means to the Publisher 6 What Metadata Means to the Reader 6 Book-Oriented Metadata Practices 8 ONIX 9 Journal-Oriented Metadata Practices 10 ONIX for Serials 10 JWP On the Exchange of Serials Subscription Information 10 CrossRef 11 The Open Archives Initiative 13 Conclusion 13 Where To Go From Here 13 Compendium of Cited Resources 14 About the Authors and Publishers 15 Published by: The Sheridan Press & NISO Press Contributing Editors: Pat Harris, Susan Parente, Kevin Pirkey, Greg Suprock, Mark Witkowski Authors: Amy Brand, Frank Daly, Barbara Meyers Copyright 2003, The Sheridan Press and NISO Press Printed July 2003 Metadata Demystified: A Guide for Publishers This guide presents an overview of evolving classified according to a variety of specific metadata conventions in publishing, as well as functions, such as technical metadata for related initiatives designed to standardize how technical processes, rights metadata for rights metadata is structured and disseminated resolution, and preservation metadata for online. Focusing on strategic rather than digital archiving, this guide focuses on technical considerations in the business of descriptive metadata, or metadata that publishing, this guide offers insight into how characterizes the content itself. book and journal publishers can streamline the various metadata-based operations at work Occurrences of metadata vary tremendously in their companies and leverage that metadata in richness; that is, how much or how little for added exposure through digital media such of the entity being described is actually as the Web.
    [Show full text]
  • Studying Social Tagging and Folksonomy: a Review and Framework
    Studying Social Tagging and Folksonomy: A Review and Framework Item Type Journal Article (On-line/Unpaginated) Authors Trant, Jennifer Citation Studying Social Tagging and Folksonomy: A Review and Framework 2009-01, 10(1) Journal of Digital Information Journal Journal of Digital Information Download date 02/10/2021 03:25:18 Link to Item http://hdl.handle.net/10150/105375 Trant, Jennifer (2009) Studying Social Tagging and Folksonomy: A Review and Framework. Journal of Digital Information 10(1). Studying Social Tagging and Folksonomy: A Review and Framework J. Trant, University of Toronto / Archives & Museum Informatics 158 Lee Ave, Toronto, ON Canada M4E 2P3 jtrant [at] archimuse.com Abstract This paper reviews research into social tagging and folksonomy (as reflected in about 180 sources published through December 2007). Methods of researching the contribution of social tagging and folksonomy are described, and outstanding research questions are presented. This is a new area of research, where theoretical perspectives and relevant research methods are only now being defined. This paper provides a framework for the study of folksonomy, tagging and social tagging systems. Three broad approaches are identified, focusing first, on the folksonomy itself (and the role of tags in indexing and retrieval); secondly, on tagging (and the behaviour of users); and thirdly, on the nature of social tagging systems (as socio-technical frameworks). Keywords: Social tagging, folksonomy, tagging, literature review, research review 1. Introduction User-generated keywords – tags – have been suggested as a lightweight way of enhancing descriptions of on-line information resources, and improving their access through broader indexing. “Social Tagging” refers to the practice of publicly labeling or categorizing resources in a shared, on-line environment.
    [Show full text]
  • PML, an Object Oriented Process Modelling Language
    PML, an Object Oriented Process Modeling Language Prof. Dr.-Ing. Reiner Anderl 1, and Dipl.-Ing. Jochen Raßler 2 1 Prof. Dr.-Ing. Reiner Anderl, Germany, [email protected] 2 Dipl.-Ing. Jochen Raßler, Germany, [email protected] Abstract: Processes are very important for the success within many business fields. They define the proper application of methods, technologies, tools and company structures in order to reach business goals. Important processes to be defined are manufacturing processes or product development processes for example to guarantee the company’s success. Over the last decades many process modeling languages have been developed to cover the needs of process modeling. Those modeling languages have several limitations, mainly they are still procedural and didn’t follow the paradigm change to object oriented modeling and thus often lead to process models, which are difficult to maintain. In previous papers we have introduced PML, Process Modeling Language, and shown it’s usage in process modeling. PML is derived from UML and hence fully object oriented and uses modern modeling techniques. It is based on process class diagrams that describe methods and resources for process modeling. In this paper the modeling language is described in more detail and new language elements will be introduced to develop the language to a generic usable process modeling language. Keywords: process modeling language, PML, UML 1. Introduction As the tendency of enterprises to collaborate growths steadily, industry faces new challenges managing business processes, product development processes, manufacturing processes and much more. Furthermore, discipline spanning product development processes are increasing, e.
    [Show full text]
  • Metadata Standards
    Chapter 3 METADATA STANDARDS This chapter lists the major metadata standards in use or under development. Standards are subdivided into six areas: general, transportation models, educa- tion, media-specific, preservation, and rights. Each standard lists its official URL, sponsoring agency, community of use, purpose and goals, description, potential for information organizations, and key projects. General metadata standards General metadata standards are the most common, well-known, and univer- sally accepted schemas to date. Some are general and meant to be used as universal or common-denominator standards; others are for specific communi- ties with specific information resources. Most of the metadata standards listed here have emerged as practical applications in use, or will probably emerge as the most commonly applied standards, due to broad international support, history and development, or industry application and acceptance. Dublin Core Metadata Initiative (DCMI) http://dublincore.org The Dublin Core Metadata Initiative (DCMI) is managed by an international board of trustees, but most of the direction and maintenance of the standard has been led by the Online Computer Library Center (OCLC) in Dublin, Ohio. Community of use Library Technology Reports Librarians, Web content providers, Web resource creators, metadata creators, and general public. Purpose and goals Dublin Core assists in the discovery and description of Web and electronic resources. It is designed to provide a simple descriptive metadata standard extensible to Web resources of any format or subject domain. Description www.techsource.ala.org Dublin Core is a set of 15 core elements that assist in simple description and discovery of electronic resources. The standards basic principles are the reasons for its success as a viable common-denominator metadata standard for elec- tronic resources.
    [Show full text]
  • A Framework of Guidance for Building Good Digital Collections
    A Framework of Guidance for Building Good Digital Collections 3rd edition December 2007 A NISO Recommended Practice Prepared by the NISO Framework Working Group with support from the Institute of Museum and Library Services About NISO Recommended Practices A NISO Recommended Practice is a recommended "best practice" or "guideline" for methods, materials, or practices in order to give guidance to the user. Such documents usually represent a leading edge, exceptional model, or proven industry practice. All elements of Recommended Practices are discretionary and may be used as stated or modified by the user to meet specific needs. This recommended practice may be revised or withdrawn at any time. For current information on the status of this publication contact the NISO office or visit the NISO website (www.niso.org). Published by National Information Standards Organization (NISO) One North Charles Street, Suite 1905 Baltimore, MD 21201 www.niso.org Copyright © 2007 by the National Information Standards Organization All rights reserved under International and Pan-American Copyright Conventions. For noncommercial purposes only, this publication may be reproduced or transmitted in any form or by any means without prior permission in writing from the publisher, provided it is reproduced accurately, the source of the material is identified, and the NISO copyright status is acknowledged. All inquires regarding translations into other languages or commercial reproduction or distribution should be addressed to: NISO, One North Charles Street, Suite
    [Show full text]
  • A Metadata Registry for Metadata Interoperability
    Data Science Journal, Volume 6, Supplement, 8 July 2007 A METADATA REGISTRY FOR METADATA INTEROPERABILITY Jian-hui Li *, Jia-xin Gao, Ji-nong Dong, Wei Wu, and Yan-fei Hou Computer Network Information Center, Chinese Academy of Sciences *Email: [email protected] ABSTRACT In order to use distributed and heterogeneous scientific databases effectively, semantic heterogeneities have to be detected and resolved. To solve this problem, we propose architecture for managing metadata and metadata schema using a metadata registry. A metadata registry is a place to keep facts about characteristics of data that are necessary for data sharing and exchange in a specific domain. This paper will explore the role of metadata registries and describe some of the experiences of implementing the registry. Keywords: Metadata, Metadata Registry, Interoperability, Crosswalk, Application Profile 1 INTRODUCTION Users and applications can easily find, locate, access, and use distributed and heterogeneous scientific databases with the help of metadata. Metadata are especially important for open access to and sharing of scientific data and databases. Different domains, however, will develop or follow different metadata specifications; even the same domain develops different metadata application profiles based on the same specifications according to their special requirements. Consequently, interoperability of metadata is a major issue for scientific data sharing and exchanging. Metadata Registry is a key solution to solve this problem. The DESIRE (Heery, Gardner, Day, & Patel, 2000), SCHEMAS (UKOLN, SCHEMAS, 2003), and CORES (UKOLN, CORES, 2003) projects are successful examples. Based on the requirements of the scientific databases of the Chinese Academy of Sciences and the Basic Scientific Data Sharing Network, one project of the National Scientific Data Sharing Program, we have designed and developed a metadata registry – the Scientific Database Metadata Registry (SDBMR).
    [Show full text]
  • Extending and Evaluating the Model-Based Product Definition
    NIST GCR 18-015 Extending and Evaluating the Model-based Product Definition Nathan W. Hartman Jesse Zahner Purdue University This publication is available free of charge from: https://doi.org/10.6028/NIST.GCR.18-015 NIST GCR 18-015 Extending and Evaluating the Model-based Product Definition Prepared for Thomas D. Hedberg, Jr. Allison Barnard Feeney U.S. Department of Commerce Engineering Laboratory National Institute of Standards and Technology Gaithersburg, MD 20899-8260 By Nathan W. Hartman Jesse Zahner PLM Center of Excellence Purdue University This publication is available free of charge from: https://doi.org/10.6028/NIST.GCR.18-015 December 2017 U.S. Department of Commerce Wilbur L. Ross, Jr., Secretary National Institute of Standards and Technology Walter Copan, NIST Director and Undersecretary of Commerce for Standards and Technology Disclaimer Any opinions, findings, conclusions, or recommendations expressed in this publication do not necessarily reflect the views of the National Institute of Standards and Technology (NIST). Additionally, neither NIST nor any of its employees make any warranty, expressed or implied, nor assume any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, product, or process included in this publication. The report was prepared under cooperative agreement 70NANB15H311 between the National Institute of Standards and Technology and Purdue University. The statements and conclusions contained in this report are those of the authors and do not imply recommendations or endorsements by the National Institute of Standards and Technology. Certain commercial systems are identified in this report. Such identification does not imply recommendation or endorsement by the National Institute of Standards and Technology.
    [Show full text]
  • 1 1. Opening Page Good Morning Ladies and Gentlemen My Name Is
    1. Opening page Good morning ladies and gentlemen My name is stephen machin I currently work As a data management consultant in mons in belgium Thank you very much for allowing me to speak to you today And thanks for coming along to listen I hope that you will find the presentation informative The purpose of this presentation Is to resolve the ambiguous term metadata Into its constituent concepts Having separated out the concepts We then provide Unambiguous and "systematic" definitions for each And then we give an exact description of the nature of the relationships between them with an equation which emphasises the separation of the concepts but also shows how they are tightly bound together 1 2. Metadata Metadata is a confused and ambiguous concept. several authors have remarked upon this One even goes so far as to say that the word No longer has any meaning Iso 11179 A much quoted metadata standard Says that the word has evolved and no longer has its old traditional meaning and that it now also refers to many other things but the standard explicitly limits it scope to the traditional sense of the word and this means that these things are not described 2 3. 11179: metadata = DE = container If we take a close look at The specifications for iso 11179 "The Metadata registry“ we note that for at least 7 years between 1994 and 2001 In its original formulation It referred to itself as the "data element" registry And the standard describes a data element as being a "container for data" and so ISO 11179 describes a metadata registry a data element registry a data container registry 3 4.
    [Show full text]
  • Semantic Web: a Review of the Field Pascal Hitzler [email protected] Kansas State University Manhattan, Kansas, USA
    Semantic Web: A Review Of The Field Pascal Hitzler [email protected] Kansas State University Manhattan, Kansas, USA ABSTRACT which would probably produce a rather different narrative of the We review two decades of Semantic Web research and applica- history and the current state of the art of the field. I therefore do tions, discuss relationships to some other disciplines, and current not strive to achieve the impossible task of presenting something challenges in the field. close to a consensus – such a thing seems still elusive. However I do point out here, and sometimes within the narrative, that there CCS CONCEPTS are a good number of alternative perspectives. The review is also necessarily very selective, because Semantic • Information systems → Graph-based database models; In- Web is a rich field of diverse research and applications, borrowing formation integration; Semantic web description languages; from many disciplines within or adjacent to computer science, Ontologies; • Computing methodologies → Description log- and a brief review like this one cannot possibly be exhaustive or ics; Ontology engineering. give due credit to all important individual contributions. I do hope KEYWORDS that I have captured what many would consider key areas of the Semantic Web field. For the reader interested in obtaining amore Semantic Web, ontology, knowledge graph, linked data detailed overview, I recommend perusing the major publication ACM Reference Format: outlets in the field: The Semantic Web journal,1 the Journal of Pascal Hitzler. 2020. Semantic Web: A Review Of The Field. In Proceedings Web Semantics,2 and the proceedings of the annual International of . ACM, New York, NY, USA, 7 pages.
    [Show full text]