Repurposing Archival Theory in the Practice of Data Curation

Repurposing Archival Theory in the Practice of Data Curation

Repurposing Archival Theory in the Practice of Data Curation Elizabeth Rolando| Wendy Hagenmaier |Susan Wells Parham Introduction Methodology • Expansion of data curation and digital archiving services at the Georgia Tech Library • Process the same digital collection, once by data curator, once by digital archivist and Archives. • Data curation processing informed by OAIS Reference Model1, ICPSR workflow2, and • How do data curation and archival science intersect? UK Data Archive workflow3 • How can comparing data curation and archival science lead to improvements in • Archival processing informed by concepts, such as appraisal, respect des fonds, local workflows and practices? original order, and archival value4, as well documented practices at peer institutions • Compare processing plans to discover areas of agreement and areas of conflict Data Transfer Data Processing Metadata Processing Preservation Access Unique Data Curation Processing Steps -Deposit agreement modeled on -Format transformation policies -Review and enhancement of -Varied retention periods, -Datasets treated as active and institutional repository license guided by reuse over preservation README file, used to accommodate determined by Board of Regents reusable -Funding model for sustainability -Create derivatives to promote diverse depositor needs Retention Schedule and funding -Datasets linked to publications access and re-use model -Bulk or individual file download -Correct erroneous or missing data Common Processing Steps -Data quarantine -Format identification and -Creation of descriptive, -File format migration -Various levels of access -Collection policy review normalization administrative, technical, and -Storage media refreshment -End user authentication -Integrity checks -Technical metadata extraction preservation metadata -Integrity checks -Terms of use -Confidentiality and privacy review -Preservation events noted in -Processing noted in metadata metadata Unique Archival Processing Steps -Retention and disposition -Format transformation policies -Enhancement of accession record, -Permanent retention of -Records treated as inactive and decided upon and recorded in guided by preservation over re-use based on standardized depositor unprocessed, raw masters, as well as read-only accession record -Create derivatives to protect master survey processed masters -Multiple virtual arrangements -Forensic capture and processing files -Creation of public finding aid -Emulation of original order -Donor agreement, with transfer -Digital exhibits of copyright Figure 1: Highlights from comparison between archival and data curation processing plans. The first row lists those elements of the data curation processing plan that were unique, while the bottom row lists those steps in the archival processing plan that were distinct. The middle row identifies those elements of the processing plans that were common between the two. Images used in the diagram were created by Jørgen Stamp (www.digitalbevaring.dk) and are published under a Creative Commons Attribution 2.5 Denmark license. Findings What data curation might learn from archival science and processing: What archival science and processing might learn from data curation: • Forensic capture and processing may be valuable for certain data sets • Establish a balance between supporting future access and use and maintaining the • Existing repository license agreement models might not work for digital data sets integrity of the record--do disk images support future access? • Retention and disposition should be planned at the point of data transfer • Existing donor agreement and copyright transfer models might not work for digital • Creating virtual arrangements that emulate the data creator’s original environment archives acquisitions could be valuable • Funding model should be planned at the point of record transfer • Data curators might question how much should be done to correct data in order to • Processed records may not be “inactive”; the life of the record continues through re- facilitate re-use--how much effort is enough? use, which enriches the record and should be documented in the record itself References 1 Consultative Committee for Space Data Systems. (2012). Reference model for an Open Archival Information System (OAIS) (Magenta Book CCSDS 650.0-B-1). Retrieved from http://public.ccsds.org/publications/archive/650x0m2.pdf. 2 Inter-university Consortium for Political and Social Research. (n.d.). A Case Study in Repository Management. Retrieved from http://www.icpsr.umich.edu/icpsrweb/content/datamanagement/lifecycle/index.html. 3 UK Data Archive. (2014). How We Curate Data. Retrieved from http://www.data-archive.ac.uk/curate. 4 Society of American Archivists, Glossary of Archival and Records Terminology: http://www2.archivists.org/glossary .

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    1 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us