Metadata Digital Preservation Topical Note 5
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Collecting and Preserving Digital Materials
COLLECTING AND PRESERVING DIGITAL MATERIALS A HOW-TO GUIDE FOR HISTORICAL SOCIETIES BY SOPHIE SHILLING CONTENTS Foreword Preface 1 Introduction 2 Digital material creation Born-digital materials Digitisation 3 Project planning Write a plan Create a workflow Policies and procedures Funding Getting everyone on-board 4 Select Bitstream preservation File formats Image resolution File naming conventions 5 Describe Metadata 6 Ingest Software Digital storage 7 Access and outreach Copyright Culturally sensitive content 8 Community 9 Glossary Bibliography i Foreword FOREWORD How the collection and research landscape has changed!! In 2000 the Federation of Australian Historical Societies commissioned Bronwyn Wilson to prepare a training guide for historical societies on the collection of cultural materials. Its purpose was to advise societies on the need to gather and collect contemporary material of diverse types for the benefit of future generations of researchers. The material that she discussed was essentially in hard copy format, but under the heading of ‘Electronic Media’ Bronwyn included a discussion of video tape, audio tape and the internet. Fast forward to 2018 and we inhabit a very different world because of the digital revolution. Today a very high proportion of the information generated in our technologically-driven society is created and distributed digitally, from emails to publications to images. Increasingly, collecting organisations are making their data available online, so that the modern researcher can achieve much by simply sitting at home on their computer and accessing information via services such as Trove and the increasing body of government and private material that is becoming available on the web. This creates both challenges and opportunities for historical societies. -
Metadata and GIS
Metadata and GIS ® An ESRI White Paper • October 2002 ESRI 380 New York St., Redlands, CA 92373-8100, USA • TEL 909-793-2853 • FAX 909-793-5953 • E-MAIL [email protected] • WEB www.esri.com Copyright © 2002 ESRI All rights reserved. Printed in the United States of America. The information contained in this document is the exclusive property of ESRI. This work is protected under United States copyright law and other international copyright treaties and conventions. No part of this work may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying and recording, or by any information storage or retrieval system, except as expressly permitted in writing by ESRI. All requests should be sent to Attention: Contracts Manager, ESRI, 380 New York Street, Redlands, CA 92373-8100, USA. The information contained in this document is subject to change without notice. U.S. GOVERNMENT RESTRICTED/LIMITED RIGHTS Any software, documentation, and/or data delivered hereunder is subject to the terms of the License Agreement. In no event shall the U.S. Government acquire greater than RESTRICTED/LIMITED RIGHTS. At a minimum, use, duplication, or disclosure by the U.S. Government is subject to restrictions as set forth in FAR §52.227-14 Alternates I, II, and III (JUN 1987); FAR §52.227-19 (JUN 1987) and/or FAR §12.211/12.212 (Commercial Technical Data/Computer Software); and DFARS §252.227-7015 (NOV 1995) (Technical Data) and/or DFARS §227.7202 (Computer Software), as applicable. Contractor/Manufacturer is ESRI, 380 New York Street, Redlands, CA 92373- 8100, USA. -
Module 8 Wiki Guide
Best Practices for Biomedical Research Data Management Harvard Medical School, The Francis A. Countway Library of Medicine Module 8 Wiki Guide Learning Objectives and Outcomes: 1. Emphasize characteristics of long-term data curation and preservation that build on and extend active data management ● It is the purview of permanent archiving and preservation to take over stewardship and ensure that the data do not become technologically obsolete and no longer permanently accessible. ○ The selection of a repository to ensure that certain technical processes are performed routinely and reliably to maintain data integrity ○ Determining the costs and steps necessary to address preservation issues such as technological obsolescence inhibiting data access ○ Consistent, citable access to data and associated contextual records ○ Ensuring that protected data stays protected through repository-governed access control ● Data Management ○ Refers to the handling, manipulation, and retention of data generated within the context of the scientific process ○ Use of this term has become more common as funding agencies require researchers to develop and implement structured plans as part of grant-funded project activities ● Digital Stewardship ○ Contributions to the longevity and usefulness of digital content by its caretakers that may occur within, but often outside of, a formal digital preservation program ○ Encompasses all activities related to the care and management of digital objects over time, and addresses all phases of the digital object lifecycle 2. Distinguish between preservation and curation ● Digital Curation ○ The combination of data curation and digital preservation ○ There tends to be a relatively strong orientation toward authenticity, trustworthiness, and long-term preservation ○ Maintaining and adding value to a trusted body of digital information for future and current use. -
Creating Permissionless Blockchains of Metadata Records Dejah Rubel
Articles No Need to Ask: Creating Permissionless Blockchains of Metadata Records Dejah Rubel ABSTRACT This article will describe how permissionless metadata blockchains could be created to overcome two significant limitations in current cataloging practices: centralization and a lack of traceability. The process would start by creating public and private keys, which could be managed using digital wallet software. After creating a genesis block, nodes would submit either a new record or modifications to a single record for validation. Validation would rely on a Federated Byzantine Agreement consensus algorithm because it offers the most flexibility for institutions to select authoritative peers. Only the top tier nodes would be required to store a copy of the entire blockchain thereby allowing other institutions to decide whether they prefer to use the abridged version or the full version. INTRODUCTION Several libraries and library vendors are investigating how blockchain could improve activities such as scholarly publishing, content dissemination, and copyright enforcement. A few organizations, such as Katalysis, are creating prototypes or alpha versions of blockchain platforms and products.1 Although there has been some discussion about using blockchains for metadata creation and management, only one company appears to be designing such a product. Therefore, this article will describe how permissionless blockchains of metadata records could be created, managed, and stored to overcome current challenges with metadata creation and management. LIMITATIONS OF CURRENT PRACTICES Metadata standards, processes, and systems are changing to meet twenty-first century information needs and expectations. There are two significant limitations, however, to our current metadata creation and modification practices that have not been addressed: centralization and traceability. -
Metadata Demystified: a Guide for Publishers
ISBN 1-880124-59-9 Metadata Demystified: A Guide for Publishers Table of Contents What Metadata Is 1 What Metadata Isn’t 3 XML 3 Identifiers 4 Why Metadata Is Important 6 What Metadata Means to the Publisher 6 What Metadata Means to the Reader 6 Book-Oriented Metadata Practices 8 ONIX 9 Journal-Oriented Metadata Practices 10 ONIX for Serials 10 JWP On the Exchange of Serials Subscription Information 10 CrossRef 11 The Open Archives Initiative 13 Conclusion 13 Where To Go From Here 13 Compendium of Cited Resources 14 About the Authors and Publishers 15 Published by: The Sheridan Press & NISO Press Contributing Editors: Pat Harris, Susan Parente, Kevin Pirkey, Greg Suprock, Mark Witkowski Authors: Amy Brand, Frank Daly, Barbara Meyers Copyright 2003, The Sheridan Press and NISO Press Printed July 2003 Metadata Demystified: A Guide for Publishers This guide presents an overview of evolving classified according to a variety of specific metadata conventions in publishing, as well as functions, such as technical metadata for related initiatives designed to standardize how technical processes, rights metadata for rights metadata is structured and disseminated resolution, and preservation metadata for online. Focusing on strategic rather than digital archiving, this guide focuses on technical considerations in the business of descriptive metadata, or metadata that publishing, this guide offers insight into how characterizes the content itself. book and journal publishers can streamline the various metadata-based operations at work Occurrences of metadata vary tremendously in their companies and leverage that metadata in richness; that is, how much or how little for added exposure through digital media such of the entity being described is actually as the Web. -
Studying Social Tagging and Folksonomy: a Review and Framework
Studying Social Tagging and Folksonomy: A Review and Framework Item Type Journal Article (On-line/Unpaginated) Authors Trant, Jennifer Citation Studying Social Tagging and Folksonomy: A Review and Framework 2009-01, 10(1) Journal of Digital Information Journal Journal of Digital Information Download date 02/10/2021 03:25:18 Link to Item http://hdl.handle.net/10150/105375 Trant, Jennifer (2009) Studying Social Tagging and Folksonomy: A Review and Framework. Journal of Digital Information 10(1). Studying Social Tagging and Folksonomy: A Review and Framework J. Trant, University of Toronto / Archives & Museum Informatics 158 Lee Ave, Toronto, ON Canada M4E 2P3 jtrant [at] archimuse.com Abstract This paper reviews research into social tagging and folksonomy (as reflected in about 180 sources published through December 2007). Methods of researching the contribution of social tagging and folksonomy are described, and outstanding research questions are presented. This is a new area of research, where theoretical perspectives and relevant research methods are only now being defined. This paper provides a framework for the study of folksonomy, tagging and social tagging systems. Three broad approaches are identified, focusing first, on the folksonomy itself (and the role of tags in indexing and retrieval); secondly, on tagging (and the behaviour of users); and thirdly, on the nature of social tagging systems (as socio-technical frameworks). Keywords: Social tagging, folksonomy, tagging, literature review, research review 1. Introduction User-generated keywords – tags – have been suggested as a lightweight way of enhancing descriptions of on-line information resources, and improving their access through broader indexing. “Social Tagging” refers to the practice of publicly labeling or categorizing resources in a shared, on-line environment. -
2016 Technical Guidelines for Digitizing Cultural Heritage Materials
September 2016 Technical Guidelines for Digitizing Cultural Heritage Materials Creation of Raster Image Files i Document Information Title Editor Technical Guidelines for Digitizing Cultural Heritage Materials: Thomas Rieger Creation of Raster Image Files Document Type Technical Guidelines Publication Date September 2016 Source Documents Title Editors Technical Guidelines for Digitizing Cultural Heritage Materials: Don Williams and Michael Creation of Raster Image Master Files Stelmach http://www.digitizationguidelines.gov/guidelines/FADGI_Still_Image- Tech_Guidelines_2010-08-24.pdf Document Type Technical Guidelines Publication Date August 2010 Title Author s Technical Guidelines for Digitizing Archival Records for Electronic Steven Puglia, Jeffrey Reed, and Access: Creation of Production Master Files – Raster Images Erin Rhodes http://www.archives.gov/preservation/technical/guidelines.pdf U.S. National Archives and Records Administration Document Type Technical Guidelines Publication Date June 2004 This work is available for worldwide use and reuse under CC0 1.0 Universal. ii Table of Contents INTRODUCTION ........................................................................................................................................... 7 SCOPE .......................................................................................................................................................... 7 THE FADGI STAR SYSTEM ....................................................................................................................... -
Repurposing Archival Theory in the Practice of Data Curation
Repurposing Archival Theory in the Practice of Data Curation Elizabeth Rolando| Wendy Hagenmaier |Susan Wells Parham Introduction Methodology • Expansion of data curation and digital archiving services at the Georgia Tech Library • Process the same digital collection, once by data curator, once by digital archivist and Archives. • Data curation processing informed by OAIS Reference Model1, ICPSR workflow2, and • How do data curation and archival science intersect? UK Data Archive workflow3 • How can comparing data curation and archival science lead to improvements in • Archival processing informed by concepts, such as appraisal, respect des fonds, local workflows and practices? original order, and archival value4, as well documented practices at peer institutions • Compare processing plans to discover areas of agreement and areas of conflict Data Transfer Data Processing Metadata Processing Preservation Access Unique Data Curation Processing Steps -Deposit agreement modeled on -Format transformation policies -Review and enhancement of -Varied retention periods, -Datasets treated as active and institutional repository license guided by reuse over preservation README file, used to accommodate determined by Board of Regents reusable -Funding model for sustainability -Create derivatives to promote diverse depositor needs Retention Schedule and funding -Datasets linked to publications access and re-use model -Bulk or individual file download -Correct erroneous or missing data Common Processing Steps -Data quarantine -Format identification -
Metadata Creation Practices in Digital Repositories and Collections
Metadata Creation Practices in Digital Repositories and Collections: Schemata, Jung-ran Park and Selection Criteria, and Interoperability Yuji Tosaka This study explores the current state of metadata-creation development of such a mediation mechanism calls for practices across digital repositories and collections by an empirical assessment of various issues surrounding metadata-creation practices. using data collected from a nationwide survey of mostly The critical issues concerning metadata practices cataloging and metadata professionals. Results show across distributed digital collections have been rela- that MARC, AACR2, and LCSH are the most widely tively unexplored. While examining learning objects and used metadata schema, content standard, and subject- e-prints communities of practice, Barton, Currier, and Hey point out the lack of formal investigation of the metadata- controlled vocabulary, respectively. Dublin Core (DC) is creation process.2 As will be discussed in the following the second most widely used metadata schema, followed section, some researchers have begun to assess the current by EAD, MODS, VRA, and TEI. Qualified DC’s wider state of descriptive practices, metadata schemata, and use vis-à-vis Unqualified DC (40.6 percent versus 25.4 content standards. However, the literature has not yet developed to a point where it affords a comprehensive percent) is noteworthy. The leading criteria in selecting picture. Given the propagation of metadata projects, it is metadata and controlled-vocabulary schemata are collec- important to continue to track changes in metadata-cre- tion-specific considerations, such as the types of resources, ation practices while they are still in constant flux. Such efforts are essential for adding new perspectives to digital nature of the collection, and needs of primary users and library research and practices in an environment where communities. -
Geotagging Photos to Share Field Trips with the World During the Past Few
Geotagging photos to share field trips with the world During the past few years, numerous new online tools for collaboration and community building have emerged, providing web-users with a tremendous capability to connect with and share a variety of resources. Coupled with this new technology is the ability to ‘geo-tag’ photos, i.e. give a digital photo a unique spatial location anywhere on the surface of the earth. More precisely geo-tagging is the process of adding geo-spatial identification or ‘metadata’ to various media such as websites, RSS feeds, or images. This data usually consists of latitude and longitude coordinates, though it can also include altitude and place names as well. Therefore adding geo-tags to photographs means adding details as to where as well as when they were taken. Geo-tagging didn’t really used to be an easy thing to do, but now even adding GPS data to Google Earth is fairly straightforward. The basics Creating geo-tagged images is quite straightforward and there are various types of software or websites that will help you ‘tag’ the photos (this is discussed later in the article). In essence, all you need to do is select a photo or group of photos, choose the "Place on map" command (or similar). Most programs will then prompt for an address or postcode. Alternatively a GPS device can be used to store ‘way points’ which represent coordinates of where images were taken. Some of the newest phones (Nokia N96 and i- Phone for instance) have automatic geo-tagging capabilities. These devices automatically add latitude and longitude metadata to the existing EXIF file which is already holds information about the picture such as camera, date, aperture settings etc. -
Don't WARC Away: Preservation Metadata & Web Archives
Don't WARC Away: Preservation Metadata & Web Archives! Jefferson Bailey & Maria LaCalle, Internet Archive ALA 2015 | ALCTS PARS | June 27, 2015 @jefferson_bail | [email protected] Don't WARC Away: Preservation Metadata & Web Archives! Jefferson Bailey & Maria LaCalle, Internet Archive ALA 2015 | ALCTS PARS | June 27, 2015 @jefferson_bail | [email protected] •! We are a non-profit Digital Library & Archive founded in 1996 •! 20+PB unique data: 10PB web, ~8m text, 2m vid, 2m aud, 100K soft, etc •! We work in a former church and it’s awesome •! Developed: Heritrix, Wayback, warcprox, Umbra, NutchWax, ARC format •! Engineers, librarians/archivists, program staff •! https://archive.org/web •! Largest and oldest publicly available web archive in existence •! 485,000,000,000+ URLs (that’s billions) •! Like a billion websites, domain agnostic •! Content in 40+ Languages •! Periodic snapshot; 1b+ URLs per week •! https://archive-it.org/ •! Web archiving service used by 370+ institutions •! 3500+ collection, 10 billion+ URLs •! 49 states and 19 countries •! Libraries, archives, museums, governments, non-profits, etc. •! User groups, Annual Meeting, collaborative and educational projects What is a web archive? •! Web archiving is the process of collecting portions of web content, preserving the collections, and then providing access to the archives - for use and re use. •! A web archive is a collection of archived URLs grouped by theme, event, subject area, or web address. •! A web archive contains as much as possible from the original resources and documents the change over time. It recreates the experience a user would have had if they!had visited the live site on the day it was archived. -
Digital Preservation Metadata for Practitioners Implementing PREMIS
springer.com Computer Science : Computer Applications Dappert, A., Guenther, R.S., Peyrard, S. (Eds.) Digital Preservation Metadata for Practitioners Implementing PREMIS Provides an introduction to fundamental issues related to digital preservation metadata and to its practical use and implementation Bridges the gap between the formal specifications provided in the PREMIS Data Dictionary and specific implementations Addresses the needs of both practitioners and students in Library, Information and Archival Science degree programs or related fields for understanding digital preservation issues This book begins with an introduction to fundamental issues related to digital preservation Springer metadata before proceeding to in-depth coverage of issues concerning its practical use and 1st ed. 2016, XIV, 266 p. 69 implementation. It helps readers to understand which options need to be considered in 1st illus. specifying a digital preservation metadata profile to ensure it matches their individual content edition types, technical infrastructure, and organizational needs. Further, it provides practical guidance and examples, and raises important questions. It does not provide full-fledged implementation solutions, as such solutions can, by definition, only be specific to a given preservation context. Printed book As such, the book effectively bridges the gap between the formal specifications provided in a Hardcover standard, such as the PREMIS Data Dictionary – a de-facto standard that defines the core metadata required by most preservation repositories – and specific implementations.Anybody Printed book who needs to manage digital assets in any form with the intent of preserving them for an Hardcover indefinite period of time will find this book a valuable resource. The PREMIS Data Dictionary ISBN 978-3-319-43761-3 provides a data model consisting of basic entities (objects, agents, events and rights) and basic £ 54,99 | CHF 71,00 | 59,99 € | properties (called “semantic units”) that describe them.