SCAPE Standard Text Template

Evaluation of characterisation tools Part 1: Identification Authors Person Role Partner Contribution Johan van der Knijff KB Carl Wilson BL Distribution Person Role Partner Revision History Version Status Author Date Changes 0.1 Johan van der Knijff 2011-04-28 DROID 6 0.2 Johan van der Knijff 2011-06-14 Fido 0.25 Johan van der Knijff 2011-06-16 Fido updated Carl Wilson/Johan van der 0.3 2011-06-24 Unix File Utility Knijff 0.4 Johan van der Knijff 2011-07-12 FITS Incorporated feedback Andrea 0.45 Johan van der Knijff 2011-07-26 Goethals on FITS + layout changes FITS 2 + additional tests one-file- 0.5 Johan van der Knijff 2011-08-15 at-a-time scenario for all tools Incorporated feedback Stephen Abrams on JHOVE2, Maurice de 1.0 Johan van der Knijff 2011-09-21 Rooij on FIDO, added introduction, concluding chapter ii Table of Contents 1 Introduction .............................................................................................. 1 1.1 Scope of this document ................................................................................................... 1 1.2 Outline ............................................................................................................................. 1 2 Evaluation framework ............................................................................... 3 2.1 Tool interface .................................................................................................................. 3 2.2 License type ..................................................................................................................... 3 2.3 Language ......................................................................................................................... 3 2.4 Platform dependencies ................................................................................................... 3 2.5 Coverage of file formats .................................................................................................. 3 2.6 Extendibility ..................................................................................................................... 3 2.7 Output format ................................................................................................................. 3 2.8 Unique output identifiers ................................................................................................ 3 2.9 Granularity of output ...................................................................................................... 3 2.10 Accuracy of reported results ....................................................................................... 4 2.11 Comprehensiveness and completeness of reported results ....................................... 4 2.12 Fit to needs of preservation community ..................................................................... 4 2.13 Ability to deal with nested objects .............................................................................. 4 2.14 Ability to deal with composite objects ........................................................................ 4 2.15 User documentation .................................................................................................... 4 2.16 Computational performance ....................................................................................... 4 2.17 Stability ........................................................................................................................ 5 2.18 Error handling and reporting ....................................................................................... 5 2.19 Provision of event information .................................................................................... 5 2.20 Maturity and development stage ................................................................................ 5 2.21 Development activity ................................................................................................... 5 2.22 Existing experience ...................................................................................................... 5 3 Data set and test environment ................................................................. 7 3.1 KB Scientific Journals set ................................................................................................. 7 iii 3.2 KB Large set ..................................................................................................................... 8 3.3 Test environment ............................................................................................................ 8 4 DROID 6.0 ............................................................................................... 11 4.1 Overview ....................................................................................................................... 11 4.2 Tool interface ................................................................................................................ 11 4.3 License type ................................................................................................................... 11 4.4 Language ....................................................................................................................... 11 4.5 Platform dependencies ................................................................................................. 11 4.6 Coverage of file formats ................................................................................................ 11 4.7 Extendibility ................................................................................................................... 11 4.8 Output format ............................................................................................................... 12 4.9 Unique output identifiers .............................................................................................. 12 4.10 Granularity of output ................................................................................................. 12 4.11 Accuracy of reported results ..................................................................................... 12 4.12 Comprehensiveness and completeness of reported results ..................................... 12 4.13 Fit to needs of preservation community ................................................................... 13 4.14 Ability to deal with nested objects ............................................................................ 13 4.15 Ability to deal with composite objects ...................................................................... 13 4.16 User documentation .................................................................................................. 13 4.17 Computational performance: one file at a time ....................................................... 14 4.17.1 KB Scientific Journals data set ............................................................................................... 14 4.17.2 KB Large data set ................................................................................................................... 16 4.18 Computational performance: many files at a time .................................................. 17 4.19 Stability ...................................................................................................................... 17 4.20 Error handling and reporting ..................................................................................... 17 4.21 Provision of event information .................................................................................. 17 4.22 Maturity and development stage .............................................................................. 18 4.23 Development activity ................................................................................................. 18 4.24 Existing experience .................................................................................................... 18 4.25 Unidentified files ....................................................................................................... 18 4.26 Conclusions ................................................................................................................ 19 iv 5 Fido 0.9 ................................................................................................... 21 5.1 Overview ....................................................................................................................... 21 5.2 Tool interface ................................................................................................................ 21 5.2.1 Python version ....................................................................................................................... 21 5.2.2 Jython version ....................................................................................................................... 21 5.3 License type ................................................................................................................... 21 5.4 Language ....................................................................................................................... 21 5.5 Platform dependencies ................................................................................................. 21 5.5.1 Python version ....................................................................................................................... 21 5.5.2 Jython version ......................................................................................................................

SCAPE Standard Text Template

Generating File Format Identification and Checksums with DROID

Tools Used by CERP

Download Download

Preserva'on*Watch What%To%Monitor%And%How%Scout%Can%Help

The Application of File Identification, Validation, and Characterization Tools in Digital Curation

The Unified Digital Formats Registry

PREMIS Implementation Research

DPC Jargon Buster

The DROID Application Programming Interface

A Brief History of Digital Preservation Erin Baucom University of Montana, Missoula

DROID: User Guide

EPUB for Archival Preservation