PREMIS Data Dictionary for Preservation Metadata, Version

Total Page:16

File Type:pdf, Size:1020Kb

PREMIS Data Dictionary for Preservation Metadata, Version Data Dictiona ~ for Preservation Metadata Contents: Acknowledgments version 3.0 Introduction June 2015 Background The PREMIS Data Model General Topics on Structure & Use Implementation Considerations The PREMIS Data Dictionary Version 3.0 Special Topics Glossary PREMIS Data Dictionary for Preservation Metadata version 3.0 PREMIS Editorial Committee June 2015 – Revised November 2015 http://www.loc.gov/standards/premis CONTENTS CONTENTS .................................................................................................................... iii Acknowledgments ........................................................................................................... v PREMIS Editorial Committee members .................................................................. v Special thanks ........................................................................................................ v PREMIS Web Sites and E-Mail ..................................................................................... viii Introduction ..................................................................................................................... 1 Background ................................................................................................................. 1 Development of the original PREMIS Data Dictionary ............................................ 1 Implementable, core preservation metadata ........................................................... 2 PREMIS Maintenance Activity ................................................................................ 3 Version History ....................................................................................................... 4 PREMIS Awards and Recognition .......................................................................... 5 The PREMIS Data Model ............................................................................................ 6 More on Objects ..................................................................................................... 8 More on Events..................................................................................................... 15 More on Agents .................................................................................................... 16 More on Rights ..................................................................................................... 17 General Topics on the Structure and Use of the Data Dictionary.............................. 17 Identifiers .............................................................................................................. 17 Relationships between Objects ............................................................................ 19 Relationships between entities of different types .................................................. 21 The 1:1 principle ................................................................................................... 21 Implementation Considerations ................................................................................. 22 PREMIS conformance .......................................................................................... 22 Implementation of the data model ........................................................................ 24 Storing metadata .................................................................................................. 25 Supplying metadata values ................................................................................... 25 Extensibility ........................................................................................................... 27 Date and time formats in PREMIS ........................................................................ 29 The PREMIS Data Dictionary Version 3.0 ..................................................................... 30 Limits to the scope of the Data Dictionary ................................................................. 31 Object Entity .............................................................................................................. 33 Entity types ............................................................................................................ 33 Entity properties ...................................................................................................... 33 Entity semantic units ............................................................................................... 34 Event Entity ............................................................................................................. 137 Entity properties .................................................................................................. 137 Entity semantic units ........................................................................................... 137 Agent Entity ............................................................................................................. 159 Entity properties .................................................................................................. 159 Entity semantic units ........................................................................................... 159 Rights Entity ............................................................................................................ 178 Entity properties .................................................................................................. 178 Entity semantic units ........................................................................................... 178 Special Topics ............................................................................................................. 249 Format information .............................................................................................. 249 Environment ....................................................................................................... 251 Object characteristics and composition level: the “onion” model ........................ 256 Fixity, integrity, authenticity ................................................................................. 258 Digital signatures ................................................................................................ 259 Non-core metadata ............................................................................................. 262 Glossary ...................................................................................................................... 267 iv Data Dictionary for Preservation Metadata: PREMIS version 3.0 ACKNOWLEDGMENTS PREMIS Editorial Committee members Rebecca Guenther, Library of Congress, Chair Karin Bredenberg, Riksarkivet, Swedish National Archives Angela Dappert, University of Portsmouth Angela Di Iorio, Sapienza Università di Roma Leslie Johnston, U.S. National Archives and Records Administration Devon Landes, HBO Peter McKinney, National Library of New Zealand Evelyn McLellan, Artefactual Systems Tracy Meehleib, Library of Congress Sébastien Peyrard, Bibliothèque nationale de France Pauline Sinclair, Preservica Eld Zierau, Royal Library of Denmark Special thanks The following contributed their expertise to previous versions as former members of the PREMIS Editorial Committee: Steve Bordwell, General Register Office for Scotland Yair Brama, ExLibris Olaf Brandt, Koninklijke Bibliotheek, Netherlands Priscilla Caplan, Florida Center for Library Automation (co-chair of original PREMIS Working Group) Gerard Clifton, National Library of Australia Markus Enders, British Library Noreen Hill, Library and Archives Canada Karsten Huth, Sächsisches Staatsarchiv, Saxon State Archives David Lake, U.S. National Archives and Records Administration Brian Lavoie, OCLC Yaniv Levi, ExLibris Bill Leonard, Library and Archives Canada Rory McLeod, British Library Robert Sharpe, Preservica Robert Wolfe, HBO Zhiwu Xie, Los Alamos National Laboratory Sally Vermaaten, Statistics New Zealand Kate Zwaard, U.S. Government Printing Office, Library of Congress In addition to Editorial Committee members, the following contributed their expertise to the work on the Environment Working Group for PREMIS 3: Conçalo Antunes, Instituto de Engenharia de Sistemas e Computadores Artur Caetano, Instituto de Engenharia de Sistemas e Computadores Carol Chou, Florida Virtual Campus Janet Delve, University of Portsmouth Martin Neumann, University of Karlsruhe Michael Nolan, Intel In addition to Editorial Committee members, the following contributed their expertise to the work on the Conformance Statement for PREMIS 3: Jay Gattuso, National Library of New Zealand Jan Hutař, Archives New Zealand Amy Kirchhoff, ITHAKA Joseph Pawletko, New York University The following people were the original Preservation Metadata: Implementation Strategies (PREMIS) Working Group that developed version 1 of the Data Dictionary: Priscilla Caplan, Florida Center for Library Automation, co-chair Rebecca Guenther, Library of Congress, co-chair Robin Dale, RLG liaison Brian Lavoie, OCLC liaison George Barnum, U.S. Government Printing Office Charles Blair, University of Chicago Olaf Brandt, Göttingen State and University Library Mikki Carpenter, Museum of Modern Art Adam Farquhar, British Library David Gewirtz, Yale University Keith Glavash, MIT/DSpace Andrea Goethals, Florida Center for Library Automation Cathy Hartman, University of North Texas Helen Hodgart, British Library Nancy Hoebelheinrich, Stanford University Roger Howard, J. Paul Getty Museum Sally Hubbard, Getty Research Institute
Recommended publications
  • Collecting and Preserving Digital Materials
    COLLECTING AND PRESERVING DIGITAL MATERIALS A HOW-TO GUIDE FOR HISTORICAL SOCIETIES BY SOPHIE SHILLING CONTENTS Foreword Preface 1 Introduction 2 Digital material creation Born-digital materials Digitisation 3 Project planning Write a plan Create a workflow Policies and procedures Funding Getting everyone on-board 4 Select Bitstream preservation File formats Image resolution File naming conventions 5 Describe Metadata 6 Ingest Software Digital storage 7 Access and outreach Copyright Culturally sensitive content 8 Community 9 Glossary Bibliography i Foreword FOREWORD How the collection and research landscape has changed!! In 2000 the Federation of Australian Historical Societies commissioned Bronwyn Wilson to prepare a training guide for historical societies on the collection of cultural materials. Its purpose was to advise societies on the need to gather and collect contemporary material of diverse types for the benefit of future generations of researchers. The material that she discussed was essentially in hard copy format, but under the heading of ‘Electronic Media’ Bronwyn included a discussion of video tape, audio tape and the internet. Fast forward to 2018 and we inhabit a very different world because of the digital revolution. Today a very high proportion of the information generated in our technologically-driven society is created and distributed digitally, from emails to publications to images. Increasingly, collecting organisations are making their data available online, so that the modern researcher can achieve much by simply sitting at home on their computer and accessing information via services such as Trove and the increasing body of government and private material that is becoming available on the web. This creates both challenges and opportunities for historical societies.
    [Show full text]
  • Module 8 Wiki Guide
    Best Practices for Biomedical Research Data Management Harvard Medical School, The Francis A. Countway Library of Medicine Module 8 Wiki Guide Learning Objectives and Outcomes: 1. Emphasize characteristics of long-term data curation and preservation that build on and extend active data management ● It is the purview of permanent archiving and preservation to take over stewardship and ensure that the data do not become technologically obsolete and no longer permanently accessible. ○ The selection of a repository to ensure that certain technical processes are performed routinely and reliably to maintain data integrity ○ Determining the costs and steps necessary to address preservation issues such as technological obsolescence inhibiting data access ○ Consistent, citable access to data and associated contextual records ○ Ensuring that protected data stays protected through repository-governed access control ● Data Management ○ Refers to the handling, manipulation, and retention of data generated within the context of the scientific process ○ Use of this term has become more common as funding agencies require researchers to develop and implement structured plans as part of grant-funded project activities ● Digital Stewardship ○ Contributions to the longevity and usefulness of digital content by its caretakers that may occur within, but often outside of, a formal digital preservation program ○ Encompasses all activities related to the care and management of digital objects over time, and addresses all phases of the digital object lifecycle 2. Distinguish between preservation and curation ● Digital Curation ○ The combination of data curation and digital preservation ○ There tends to be a relatively strong orientation toward authenticity, trustworthiness, and long-term preservation ○ Maintaining and adding value to a trusted body of digital information for future and current use.
    [Show full text]
  • Metadata Demystified: a Guide for Publishers
    ISBN 1-880124-59-9 Metadata Demystified: A Guide for Publishers Table of Contents What Metadata Is 1 What Metadata Isn’t 3 XML 3 Identifiers 4 Why Metadata Is Important 6 What Metadata Means to the Publisher 6 What Metadata Means to the Reader 6 Book-Oriented Metadata Practices 8 ONIX 9 Journal-Oriented Metadata Practices 10 ONIX for Serials 10 JWP On the Exchange of Serials Subscription Information 10 CrossRef 11 The Open Archives Initiative 13 Conclusion 13 Where To Go From Here 13 Compendium of Cited Resources 14 About the Authors and Publishers 15 Published by: The Sheridan Press & NISO Press Contributing Editors: Pat Harris, Susan Parente, Kevin Pirkey, Greg Suprock, Mark Witkowski Authors: Amy Brand, Frank Daly, Barbara Meyers Copyright 2003, The Sheridan Press and NISO Press Printed July 2003 Metadata Demystified: A Guide for Publishers This guide presents an overview of evolving classified according to a variety of specific metadata conventions in publishing, as well as functions, such as technical metadata for related initiatives designed to standardize how technical processes, rights metadata for rights metadata is structured and disseminated resolution, and preservation metadata for online. Focusing on strategic rather than digital archiving, this guide focuses on technical considerations in the business of descriptive metadata, or metadata that publishing, this guide offers insight into how characterizes the content itself. book and journal publishers can streamline the various metadata-based operations at work Occurrences of metadata vary tremendously in their companies and leverage that metadata in richness; that is, how much or how little for added exposure through digital media such of the entity being described is actually as the Web.
    [Show full text]
  • 2016 Technical Guidelines for Digitizing Cultural Heritage Materials
    September 2016 Technical Guidelines for Digitizing Cultural Heritage Materials Creation of Raster Image Files i Document Information Title Editor Technical Guidelines for Digitizing Cultural Heritage Materials: Thomas Rieger Creation of Raster Image Files Document Type Technical Guidelines Publication Date September 2016 Source Documents Title Editors Technical Guidelines for Digitizing Cultural Heritage Materials: Don Williams and Michael Creation of Raster Image Master Files Stelmach http://www.digitizationguidelines.gov/guidelines/FADGI_Still_Image- Tech_Guidelines_2010-08-24.pdf Document Type Technical Guidelines Publication Date August 2010 Title Author s Technical Guidelines for Digitizing Archival Records for Electronic Steven Puglia, Jeffrey Reed, and Access: Creation of Production Master Files – Raster Images Erin Rhodes http://www.archives.gov/preservation/technical/guidelines.pdf U.S. National Archives and Records Administration Document Type Technical Guidelines Publication Date June 2004 This work is available for worldwide use and reuse under CC0 1.0 Universal. ii Table of Contents INTRODUCTION ........................................................................................................................................... 7 SCOPE .......................................................................................................................................................... 7 THE FADGI STAR SYSTEM .......................................................................................................................
    [Show full text]
  • Lancaster County, PA Archives
    Fictitious Names in Business Index 1917-1983 Derived from original indexes within the Lancaster County Archives collection 1001 Hobbies & Crafts, Inc. Corp 1 656 1059 Columbia Avenue Associates 15 420 120 Antiquities 8 47 121 Studio Gallery 16 261 1226 Gallery Gifts 16 278 1722 Motor Lodge Corp 1 648 1810 Associates 15 444 20th Century Card Co 4 138 20thLancaster Century Housing County,6 PA332 Archives 20th Century Television Service 9 180 222 Service Center 14 130 25th Hour 14 43 28th Division Highway Motor Court 9 225 3rd Regular Infantry Corp 1 568 4 R's Associates 16 227 4 Star Linen Supply 12 321 501 Diner 11 611 57 South George Street Associates 16 302 611 Shop & Gallery 16 192 7 Cousins Park City Corp 1 335 78-80 West Main, Inc. Corp 1 605 840 Realty 16 414 A & A Aluminum 15 211 A & A Credit Exchange 4 449 A & B Associates 13 342 A & B Automotive Warehouse Company Corp 1 486 A & B Electronic Products Leasing 15 169 A & B Manufacturing Company 12 162 A & E Advertising 15 54 A & H Collectors Center 12 557 A & H Disposal 15 56 A & H Drywall Finishers 12 588 A & L Marketing 15 426 A & L Trucking 16 358 A & M Enterprises 15 148 A & M New Car Brokers 15 128 A & M Rentals 12 104 A & P Roofing Company 14 211 A & R Flooring Service 15 216 A & R Nissley, Inc. Corp 1 512 A & R Nissley, Inc. Corp 1 720 A & R Nissley, Inc. Corp 2 95 A & R Tour Services Co.
    [Show full text]
  • Repurposing Archival Theory in the Practice of Data Curation
    Repurposing Archival Theory in the Practice of Data Curation Elizabeth Rolando| Wendy Hagenmaier |Susan Wells Parham Introduction Methodology • Expansion of data curation and digital archiving services at the Georgia Tech Library • Process the same digital collection, once by data curator, once by digital archivist and Archives. • Data curation processing informed by OAIS Reference Model1, ICPSR workflow2, and • How do data curation and archival science intersect? UK Data Archive workflow3 • How can comparing data curation and archival science lead to improvements in • Archival processing informed by concepts, such as appraisal, respect des fonds, local workflows and practices? original order, and archival value4, as well documented practices at peer institutions • Compare processing plans to discover areas of agreement and areas of conflict Data Transfer Data Processing Metadata Processing Preservation Access Unique Data Curation Processing Steps -Deposit agreement modeled on -Format transformation policies -Review and enhancement of -Varied retention periods, -Datasets treated as active and institutional repository license guided by reuse over preservation README file, used to accommodate determined by Board of Regents reusable -Funding model for sustainability -Create derivatives to promote diverse depositor needs Retention Schedule and funding -Datasets linked to publications access and re-use model -Bulk or individual file download -Correct erroneous or missing data Common Processing Steps -Data quarantine -Format identification
    [Show full text]
  • Partial Differential Equations
    CALENDAR OF AMS MEETINGS THIS CALENDAR lists all meetings which have been approved by the Council pnor to the date this issue of the Nouces was sent to press. The summer and annual meetings are joint meetings of the Mathematical Association of America and the Ameri· can Mathematical Society. The meeting dates which fall rather far in the future are subject to change; this is particularly true of meetings to which no numbers have yet been assigned. Programs of the meetings will appear in the issues indicated below. First and second announcements of the meetings will have appeared in earlier issues. ABSTRACTS OF PAPERS presented at a meeting of the Society are published in the journal Abstracts of papers presented to the American Mathematical Society in the issue corresponding to that of the Notices which contains the program of the meet­ ing. Abstracts should be submitted on special forms which are available in many departments of mathematics and from the office of the Society in Providence. Abstracts of papers to be presented at the meeting must be received at the headquarters of the Society in Providence, Rhode Island, on or before the deadline given below for the meeting. Note that the deadline for ab­ stracts submitted for consideration for presentation at special sessions is usually three weeks earlier than that specified below. For additional information consult the meeting announcement and the list of organizers of special sessions. MEETING ABSTRACT NUMBER DATE PLACE DEADLINE ISSUE 778 June 20-21, 1980 Ellensburg, Washington APRIL 21 June 1980 779 August 18-22, 1980 Ann Arbor, Michigan JUNE 3 August 1980 (84th Summer Meeting) October 17-18, 1980 Storrs, Connecticut October 31-November 1, 1980 Kenosha, Wisconsin January 7-11, 1981 San Francisco, California (87th Annual Meeting) January 13-17, 1982 Cincinnati, Ohio (88th Annual Meeting) Notices DEADLINES ISSUE NEWS ADVERTISING June 1980 April 18 April 29 August 1980 June 3 June 18 Deadlines for announcements intended for the Special Meetings section are the same as for News.
    [Show full text]
  • The Application of File Identification, Validation, and Characterization Tools in Digital Curation
    THE APPLICATION OF FILE IDENTIFICATION, VALIDATION, AND CHARACTERIZATION TOOLS IN DIGITAL CURATION BY KEVIN MICHAEL FORD THESIS Submitted in partial fulfillment of the requirements for the degree of Master of Science in Library and Information Science in the Graduate College of the University of Illinois at Urbana-Champaign, 2011 Urbana, Illinois Advisers: Research Assistant Professor Melissa Cragin Assistant Professor Jerome McDonough ABSTRACT File format identification, characterization, and validation are considered essential processes for digital preservation and, by extension, long-term data curation. These actions are performed on data objects by humans or computers, in an attempt to identify the type of a given file, derive characterizing information that is specific to the file, and validate that the given file conforms to its type specification. The present research reviews the literature surrounding these digital preservation activities, including their theoretical basis and the publications that accompanied the formal release of tools and services designed in response to their theoretical foundation. It also reports the results from extensive tests designed to evaluate the coverage of some of the software tools developed to perform file format identification, characterization, and validation actions. Tests of these tools demonstrate that more work is needed – particularly in terms of scalable solutions – to address the expanse of digital data to be preserved and curated. The breadth of file types these tools are anticipated to handle is so great as to call into question whether a scalable solution is feasible, and, more broadly, whether such efforts will offer a meaningful return on investment. Also, these tools, which serve to provide a type of baseline reading of a file in a repository, can be easily tricked.
    [Show full text]
  • Don't WARC Away: Preservation Metadata & Web Archives
    Don't WARC Away: Preservation Metadata & Web Archives! Jefferson Bailey & Maria LaCalle, Internet Archive ALA 2015 | ALCTS PARS | June 27, 2015 @jefferson_bail | [email protected] Don't WARC Away: Preservation Metadata & Web Archives! Jefferson Bailey & Maria LaCalle, Internet Archive ALA 2015 | ALCTS PARS | June 27, 2015 @jefferson_bail | [email protected] •! We are a non-profit Digital Library & Archive founded in 1996 •! 20+PB unique data: 10PB web, ~8m text, 2m vid, 2m aud, 100K soft, etc •! We work in a former church and it’s awesome •! Developed: Heritrix, Wayback, warcprox, Umbra, NutchWax, ARC format •! Engineers, librarians/archivists, program staff •! https://archive.org/web •! Largest and oldest publicly available web archive in existence •! 485,000,000,000+ URLs (that’s billions) •! Like a billion websites, domain agnostic •! Content in 40+ Languages •! Periodic snapshot; 1b+ URLs per week •! https://archive-it.org/ •! Web archiving service used by 370+ institutions •! 3500+ collection, 10 billion+ URLs •! 49 states and 19 countries •! Libraries, archives, museums, governments, non-profits, etc. •! User groups, Annual Meeting, collaborative and educational projects What is a web archive? •! Web archiving is the process of collecting portions of web content, preserving the collections, and then providing access to the archives - for use and re use. •! A web archive is a collection of archived URLs grouped by theme, event, subject area, or web address. •! A web archive contains as much as possible from the original resources and documents the change over time. It recreates the experience a user would have had if they!had visited the live site on the day it was archived.
    [Show full text]
  • Digital Preservation Metadata for Practitioners Implementing PREMIS
    springer.com Computer Science : Computer Applications Dappert, A., Guenther, R.S., Peyrard, S. (Eds.) Digital Preservation Metadata for Practitioners Implementing PREMIS Provides an introduction to fundamental issues related to digital preservation metadata and to its practical use and implementation Bridges the gap between the formal specifications provided in the PREMIS Data Dictionary and specific implementations Addresses the needs of both practitioners and students in Library, Information and Archival Science degree programs or related fields for understanding digital preservation issues This book begins with an introduction to fundamental issues related to digital preservation Springer metadata before proceeding to in-depth coverage of issues concerning its practical use and 1st ed. 2016, XIV, 266 p. 69 implementation. It helps readers to understand which options need to be considered in 1st illus. specifying a digital preservation metadata profile to ensure it matches their individual content edition types, technical infrastructure, and organizational needs. Further, it provides practical guidance and examples, and raises important questions. It does not provide full-fledged implementation solutions, as such solutions can, by definition, only be specific to a given preservation context. Printed book As such, the book effectively bridges the gap between the formal specifications provided in a Hardcover standard, such as the PREMIS Data Dictionary – a de-facto standard that defines the core metadata required by most preservation repositories – and specific implementations.Anybody Printed book who needs to manage digital assets in any form with the intent of preserving them for an Hardcover indefinite period of time will find this book a valuable resource. The PREMIS Data Dictionary ISBN 978-3-319-43761-3 provides a data model consisting of basic entities (objects, agents, events and rights) and basic £ 54,99 | CHF 71,00 | 59,99 € | properties (called “semantic units”) that describe them.
    [Show full text]
  • A STUDY of WRITING Oi.Uchicago.Edu Oi.Uchicago.Edu /MAAM^MA
    oi.uchicago.edu A STUDY OF WRITING oi.uchicago.edu oi.uchicago.edu /MAAM^MA. A STUDY OF "*?• ,fii WRITING REVISED EDITION I. J. GELB Phoenix Books THE UNIVERSITY OF CHICAGO PRESS oi.uchicago.edu This book is also available in a clothbound edition from THE UNIVERSITY OF CHICAGO PRESS TO THE MOKSTADS THE UNIVERSITY OF CHICAGO PRESS, CHICAGO & LONDON The University of Toronto Press, Toronto 5, Canada Copyright 1952 in the International Copyright Union. All rights reserved. Published 1952. Second Edition 1963. First Phoenix Impression 1963. Printed in the United States of America oi.uchicago.edu PREFACE HE book contains twelve chapters, but it can be broken up structurally into five parts. First, the place of writing among the various systems of human inter­ communication is discussed. This is followed by four Tchapters devoted to the descriptive and comparative treatment of the various types of writing in the world. The sixth chapter deals with the evolution of writing from the earliest stages of picture writing to a full alphabet. The next four chapters deal with general problems, such as the future of writing and the relationship of writing to speech, art, and religion. Of the two final chapters, one contains the first attempt to establish a full terminology of writing, the other an extensive bibliography. The aim of this study is to lay a foundation for a new science of writing which might be called grammatology. While the general histories of writing treat individual writings mainly from a descriptive-historical point of view, the new science attempts to establish general principles governing the use and evolution of writing on a comparative-typological basis.
    [Show full text]
  • Support for Digital Formats
    Chapter 4 Support for Digital Formats ong-term renderability cannot be ensured without into significant properties is focused on formats. The detailed knowledge about and documentation of dig- InSPECT project of the U.K. Arts and Humanities Data Lital file formats. In this respect, digital formats are at Service is investigating the significant properties of raster the heart of digital preservation activities. images, structured text, digital audio, and e-mail messages, and new awards were recently granted to study e-learning objects, software, vector images, and moving images.1 Significant Properties The term significant properties is used to refer to the Readings Library Technology Reports Library Technology properties of digital objects that must be preserved over • Andrew Wilson, “Significant Properties Report,” Oct. time through preservation treatments such as migrations 2007, www.significantproperties.org.uk/documents/ or emulations in order to ensure the continued usability wp22_significant_properties.pdf. A cogent review of and meaning of the objects. (Significant characteristics, work to date undertaken for the InSPECT project. essential characteristics, and essence are less commonly • Margaret Hedstrom and Christopher Lee, “Signifi- used synonyms). The definition and determination of cant Properties of Digital Objects: Definitions, these properties constitute a critical and mostly unsolved Applications, Implications,” in Proceedings of issue in the field of digital preservation. the DLM-Forum 2002, http://ec.europa.eu/ Significant properties are usually categorized as per- transparency/archival_policy/dlm_forum/ taining to content, context, appearance, structure, and doc/dlm-proceed2002.pdf. Describes preliminary www.techsource.ala.org www.techsource.ala.org behavior. If, for example, the digital object in question research taking a rather broad view of significant were a chapter of a book in PDF format, the content might properties, although follow-up appears to be be the text and pictures, the context would be the biblio- unavailable.
    [Show full text]