A New Digital Dark Age? Collaborative Web Tools, Social Media and Long-Term Preservation Stuart Jeffrey Version of Record First Published: 05 Dec 2012
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Jamie Shiers, CERN
Investing in Curation A Shared Path to Sustainability [email protected] Data Preservation in HEP (DPHEP) Outline 1. Pick 2 of the messages from the roadmap & comment – I could comment on all – but not in 10’ 1. What (+ve) impact has 4C already had on us? – Avoiding overlap with the above 1. A point for discussion – shared responsiblity / action THE MESSAGES The 4C Roadmap Messages 1. Identify the value of digital assets and make choices 2. Demand and choose more efficient systems 3. Develop scalable services and infrastructure 4. Design digital curation as a sustainable service 5. Make funding dependent on costing digital assets across the whole lifecycle 6. Be collaborative and transparent to drive down costs IMPACT OF 4C ON DPHEP International Collaboration for Data Preservation and Long Term Analysis in High Energy Physics “LHC Cost Model” (simplified) Start with 10PB, then +50PB/year, then +50% every 3y (or +15% / year) 10EB 1EB 6 Case B) increasing archive growth Total cost: ~$59.9M (~$2M / year) 7 1. Identify the value of digital assets and make choices • Today, significant volumes of HEP data are thrown away “at birth” – i.e. via very strict filters (aka triggers) B4 writing to storage To 1st approximation ALL remaining data needs to be kept for a few decades • “Value” can be measured in a number of ways: – Scientific publications / results; – Educational / cultural impact; – “Spin-offs” – e.g. superconductivity, ICT, vacuum technology. Why build an LHC? BEFORE! 1 – Long Tail of Papers 2 – New Theore cal Insights 3 4 3 – “Discovery” to “Precision” Volume: 100PB + ~50PB/year (+400PB/year from 2020) 11 Zimmermann( Alain Blondel TLEP design study r-ECFA 2013-07-20 5 Balance sheet – Tevatron@FNAL • 20 year investment in Tevatron ~ $4B • Students $4B • Magnets and MRI $5-10B } ~ $50B total • Computing $40B Very rough calculation – but confirms our gut feeling that investment in fundamental science pays off I think there is an opportunity for someone to repeat this exercise more rigorously cf. -
Module 8 Wiki Guide
Best Practices for Biomedical Research Data Management Harvard Medical School, The Francis A. Countway Library of Medicine Module 8 Wiki Guide Learning Objectives and Outcomes: 1. Emphasize characteristics of long-term data curation and preservation that build on and extend active data management ● It is the purview of permanent archiving and preservation to take over stewardship and ensure that the data do not become technologically obsolete and no longer permanently accessible. ○ The selection of a repository to ensure that certain technical processes are performed routinely and reliably to maintain data integrity ○ Determining the costs and steps necessary to address preservation issues such as technological obsolescence inhibiting data access ○ Consistent, citable access to data and associated contextual records ○ Ensuring that protected data stays protected through repository-governed access control ● Data Management ○ Refers to the handling, manipulation, and retention of data generated within the context of the scientific process ○ Use of this term has become more common as funding agencies require researchers to develop and implement structured plans as part of grant-funded project activities ● Digital Stewardship ○ Contributions to the longevity and usefulness of digital content by its caretakers that may occur within, but often outside of, a formal digital preservation program ○ Encompasses all activities related to the care and management of digital objects over time, and addresses all phases of the digital object lifecycle 2. Distinguish between preservation and curation ● Digital Curation ○ The combination of data curation and digital preservation ○ There tends to be a relatively strong orientation toward authenticity, trustworthiness, and long-term preservation ○ Maintaining and adding value to a trusted body of digital information for future and current use. -
Archives First: Digital Preservation Further Investigations Into Digital
Archives First: digital preservation Further investigations into digital preservation for local authorities Viv Cothey * 2020 * Gloucestershire County Council ii Not caring about Archives because you have nothing to archive is no different from saying you don’t care about freedom of speech because you have nothing to say. Or that you don’t care about freedom of the press because you don’t like to read. (after Snowden, 2019, p 208) Disclaimer The views and opinions expressed in this report do not necessarily represent those of the institutions to which the author is affiliated. iii iv Executive summary This report is about an investigation into digital preservation by (English) local authorities which was commissioned by the Archives First consortium of eleven local authority record offices or similar memory organisations (Archives). The investigation is partly funded by The National Archives. Archival institutions are uniquely able to serve the public by providing current and future generations with access to authentic unique original records. In the case of local authority Archives these records will include documents related to significant decision making processes and events that bear on individuals and their communities. Archival practice, especially relating to provenance and purposeful preservation, is instrumental in supporting continuing public trust and essential to all of us being able to hold authority to account. The report explains how Archival practice differs from library practice where provenance and purposeful preservation are absent. The current investigation follows an earlier Archives First project in 2016-2017 that investigated local authority digital preservation preparedness. The 2016-2017 investigation revealed that local authority line of business systems in respect of children services, did not support the statutory requirement to retain digital records over the long- term (at least 100 years). -
INLS 752: Digital Preservation and Access
Summer, 2015 Monday-Friday, 8:00-11:30 INLS 752: Digital Preservation and Access The Instructors. Dr. Helen R. Tibbo : 962-8063(w); 929-6248(h) Office: 201 Manning Hall FAX #: (919) 962-8071 : Tibbo at email dot unc dot edu Class Listserv: INLS752- [email protected] Office Hours. I will be available until 12:30 most days. I will probably not be in the office in the afternoons during this intensive course session. Feel free to call me at home in the evening before 9:00 PM. Course Timeline. First Class: Wednesday, May 13, 2015 Last Class: Thursday, May 28, 2015 Final Project Due: Friday, May 29, 2015 Brief Course Description. This course focuses on integrating state-of-the-art information technologies, particularly those related to the digital curation lifecycle, digital repositories, and long-term digital preservation, into the daily operations of archives, records centers, museums, special collections libraries, visual resource collections, historical societies, and other information centers. Issues, topics, and technologies covered will include the promise & challenge of long-term digital preservation and curation; creating durable digital objects, approaches to preservation; development of institutional repositories; image processing; selecting materials for digitization and managing digitization projects; resource allocation and costing, risk management, digitization and metadata; rights management and other legal and ethical issues; digital asset management; standards; file formats; quality control; funding for developing and sustaining digitization projects and programs; and trusted repositories. Goals and Objectives. By the end of the course, the student should be able to: Identify the key events in the history of digital preservation and access. -
Digital Preservation Handbook
Digital Preservation Handbook Digital Preservation Briefing Illustrations by Jørgen Stamp digitalbevaring.dk CC BY 2.5 Denmark Who is it for? Senior administrators (DigCurV Executive Lens), operational managers (DigCurV Manager Lens) and staff (DigCurV Practitioner Lens) within repositories, funding agencies, creators and publishers, anyone requiring an introduction to the subject. Assumed level of knowledge Novice. Purpose To provide a strategic overview and senior management briefing, outlining the broad issues and the rationale for funding to be allocated to the tasks involved in preserving digital resources. To provide a synthesis of current thinking on digital preservation issues. To distinguish between the major categories of issues. To help clarify how various issues will impact on decisions at various stages of the life-cycle of digital materials. To provide a focus for further debate and discussion within organisations and with external audiences. Gold sponsor Silver sponsors Bronze sponsors Reusing this information You may re-use this material in English (not including logos) with required acknowledgements free of charge in any format or medium. See How to use the Handbook for full details of licences and acknowledgements for re-use. For permission for translation into other languages email: [email protected] Please use this form of citation for the Handbook: Digital Preservation Handbook, 2nd Edition, http://handbook.dpconline.org/, Digital Preservation Coalition © 2015. 2 Contents Why Digital Preservation Matters -
The DCC Curation Lifecycle Model Higgins, Sarah
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Aberystwyth Research Portal Aberystwyth University The DCC Curation Lifecycle Model Higgins, Sarah Published in: International Journal of Digital Curation DOI: 10.2218/ijdc.v3i1.48 Publication date: 2008 Citation for published version (APA): Higgins, S. (2008). The DCC Curation Lifecycle Model. International Journal of Digital Curation, 3(1), 134-140. https://doi.org/10.2218/ijdc.v3i1.48 General rights Copyright and moral rights for the publications made accessible in the Aberystwyth Research Portal (the Institutional Repository) are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the Aberystwyth Research Portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the Aberystwyth Research Portal Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. tel: +44 1970 62 2400 email: [email protected] Download date: 03. Oct. 2019 134 The DCC Curation Lifecycle Model The International Journal of Digital Curation Issue 1, Volume 3 | 2008 The DCC Curation Lifecycle Model Sarah Higgins, Standards Advisor, Digital Curation Centre, University of Edinburgh June 2008 Summary Lifecycle management of digital materials is necessary to ensure their continuity. -
Web Archiving and You Web Archiving and Us
Web Archiving and You Web Archiving and Us Amy Wickner University of Maryland Libraries Code4Lib 2018 Slides & Resources: https://osf.io/ex6ny/ Hello, thank you for this opportunity to talk about web archives and archiving. This talk is about what stakes the code4lib community might have in documenting particular experiences of the live web. In addition to slides, I’m leading with a list of material, tools, and trainings I read and relied on in putting this talk together. Despite the limited scope of the talk, I hope you’ll each find something of personal relevance to pursue further. “ the process of collecting portions of the World Wide Web, preserving the collections in an archival format, and then serving the archives for access and use International Internet Preservation Coalition To begin, here’s how the International Internet Preservation Consortium or IIPC defines web archiving. Let’s break this down a little. “Collecting portions” means not collecting everything: there’s generally a process of selection. “Archival format” implies that long-term preservation and stewardship are the goals of collecting material from the web. And “serving the archives for access and use” implies a stewarding entity conceptually separate from the bodies of creators and users of archives. It also implies that there is no web archiving without access and use. As we go along, we’ll see examples that both reinforce and trouble these assumptions. A point of clarity about wording: when I say for example “critique,” “question,” or “trouble” as a verb, I mean inquiry rather than judgement or condemnation. we are collectors So, preambles mostly over. -
Introduction to Archives and Digital Curation Fall 2016 | Tuesdays | 6:00 PM – 8:45 PM | HBK 0109 Instructor: Dr
University of Maryland College of Information Studies INST 604: Introduction to Archives and Digital Curation Fall 2016 | Tuesdays | 6:00 PM – 8:45 PM | HBK 0109 Instructor: Dr. Ricardo L. Punzalan, Assistant Professor Office Hours: By appointment | Office: 4117-J Hornbake Bldg., South Wing Office Telephone: 301-405-6518 | Email: [email protected] Archival thinking is an important skill in caring for an increasingly complex, multimedia, and heterogeneous information. This course provides you with an overview of fundamental theories and practices as well as the essential principles and standards that archivists apply in designing and implementing strategies for the preservation and long-term access of information. As a class, we shall examine the changing informational, organizational, societal, and technological landscapes and consider how those changes are affecting archival practices, the information and preservation professions, and the implementation of foundational archival ideas. You will also become acquainted with the values of the archives profession that underlie the mandate to manage and care for a body of information resources in diverse organizational and institutional contexts. This is a foundational course if you are training to become a professional archivist, manuscripts curator, records manager, digital curator, data librarian, etc. Thus, the course will provide you with essential knowledge for pursing a variety of career paths, including: • Professional careers in archives and records management - This course provides you an introduction to the field; introduces terms and concepts that will be used in more advanced courses; and builds a foundation for internships and professional networking. • Careers in related information fields - This course provides you with a survey of broadly applicable concepts used in information management, data curation, information policy, and user services. -
Implications of Preservation of Information in the Digital Age
City Research Online City, University of London Institutional Repository Citation: Roland, L. and Bawden, D. (2012). The Future of History: Investigating the Preservation of Information in the Digital Age. Library & Information History, 28(3), pp. 220- 236. doi: 10.1179/1758348912Z.00000000017 This is the unspecified version of the paper. This version of the publication may differ from the final published version. Permanent repository link: https://openaccess.city.ac.uk/id/eprint/3186/ Link to published version: http://dx.doi.org/10.1179/1758348912Z.00000000017 Copyright: City Research Online aims to make research outputs of City, University of London available to a wider audience. Copyright and Moral Rights remain with the author(s) and/or copyright holders. URLs from City Research Online may be freely distributed and linked to. Reuse: Copies of full items can be used for personal research or study, educational, or not-for-profit purposes without prior permission or charge. Provided that the authors, title and full bibliographic details are credited, a hyperlink and/or URL is given for the original metadata page and the content is not changed in any way. City Research Online: http://openaccess.city.ac.uk/ [email protected] The future of history: implications of preservation of information in the digital age Lena Roland and David Bawden Department of Information Science, City University London Abstract This study investigates the challenges of preserving information in the digital age, and explores how this may affect the future of historical knowledge. The study is based on literature analysis and a series of semi-structured interviews with 41 historians, archivists, librarians, and web researchers. -
Preservation in the Digital Age a Review of Preservation Literature, 2009–10
56(1) LRTS 25 Preservation in the Digital Age A Review of Preservation Literature, 2009–10 Karen F. Gracy and Miriam B. Kahn This paper surveys research and professional literature on preservation-related topics published in 2009 and 2010, identifies key contributions to the field in peri- odicals, monographs, and research reports, and provides a guide to the changing landscape of preservation in the digital age. The authors have organized the reviewed literature into five major areas of interest: tensions in preservation work as libraries embrace digital resources, mass digitization and its effects on collec- tions, risk management and disaster response, digital preservation and curation, and education for preservation in the digital age. his review article critically examines the literature of preservation published T during a two-year period, 2009–10. Almost a decade has passed since the last review of the preservation literature appeared in Library Resources and Technical Services, covering the period of 1999–2001.1 In the interim, the evolu- tion of the preservation field noted by Croft has accelerated, encompassing whole areas of practice that were in their infancy at the turn of the twenty-first century. In her review, Croft identified ten areas of emphasis in the literature: clarifying preservation misconceptions triggered by the publication of Nicholson Baker’s Double Fold; the continued importance of the artifact in the wake of new digital reformatting technologies; remote storage; mass deacidification; physical treat- ment -
Digital Preservation.Pdf
Digital Preservation By Jean-Yves Le Meur project leader of CERN Digital Memory 100 AD End of cuneiform on tablets 350-50 BC Jupiter orbit Tablets -> British Museum 1800- 1300-1400 Jupiter orbit 2014 (again) The missing 29 Jan 2016 ‘rosetta’ tablet Digital preservation in a nutshell ● World wide Landscape ○ Rationale ○ Interesting initiatives ○ Good practices: OAIS ● The different Approaches The Digital “Dark Age” "We are nonchalantly throwing all of our data into what could become an information black hole without realizing it" Vint Cerf (vice-president of Google in Feb 2015) The Digital “Dark Age” ● Very large community worrying about the preservation of digital content ● Digital Preservation Coalition ● Open Preservation Foundation ● UNESCO PERSIST project, EU e-ARK project, National Libraries and Archives ● Many related conferences: iPRES series, etc. “This is not about preserving bits, It is about preserving meaning, much like the Rosetta Stone.” More than 70 major libraries destroyed over time: accidents, disasters, ethnocides How digital data evaporates (I) 1. Physical Obsolescence: Bit rot Ten 2. Redundancy failure Major 3. Technological Obsolescence of readers, formats, OS, HWs New 4. Lost in migrations ! Risks 5. Missing context: no codec ! How digital data evaporates (II) 6. Redundancy failure Ten 7. Economical Failures Major 8. Lost in transitions: people ! New 9. Corruption, mistake or attack 10. Dissipation: out of reach Risks Some examples at CERN ● The very first WWW pages ○ Reconstructed in 2013 - found again in 2018! -
Follow-Up Questions
Follow-Up Questions ASERL Webinar: “Intro to Digital Preservation #2 -- Forbearing the Digital Dark Age: Capturing Metadata for Digital Objects” Speaker = Chris Dietrich, National Park Service Session Recording: https://vimeo.com/63669010 Speaker’s PPT: http://bit.ly/10PKvu8 UPDATED – May 23, 2013 Tools 1. Is photo watermarking available using Windows Explorer? Microsoft Paint, which comes installed with Microsoft Windows, provides basic (albeit inelegant) watermarking capabilities. 2. Do Microsoft tools capture basic metadata automatically, without user intervention? Microsoft Office products capture very basic metadata automatically. The Author, Initials, and Company are captured automatically from a user’s Windows User Account settings. File system properties like File Size, Date, etc. are also automatically captured. The following Microsoft Knowledge Base articles provide details for each Microsoft Office product: http://office.microsoft.com/en-us/access-help/view-or-change-the-properties-for-an-office-file- HA010354245.aspx, http://office.microsoft.com/en-us/help/about-file-properties-HP003071721.aspx. Microsoft SharePoint can be configured to automatically capture metadata for items uploaded to libraries: http://office.microsoft.com/en-us/sharepoint-help/introduction-to-managed-metadata-HA102832521.aspx. 3. Can you recommend tools/services that leverage geospatial data that do not provide latitude & longitude information? For example, I want to plot a photo of “Mt Doom” on a map but have no coordinates…. Embedding geospatial coordinates in digital objects (often called “geotagging”) can be done with a number of software tools. GPS Photo Link (http://www.geospatialexperts.com/gps-photo%20link.php) allows users to add coordinates to embedded metadata manually, or by selecting a photo(s) and then clicking a point on a Bing Maps satellite image.