CODA Slutrapport

Total Page:16

File Type:pdf, Size:1020Kb

CODA Slutrapport CODA Curation of Digital Assets Slutrapport 2007 Upphovsrättsinnehavare vid LDB-centrum står bakom framtagandet av denna skrift. Förutsatt att Ni i förväg meddelar hur materialet skall användas och inhämtar medgivande från Upphovsrättinnehavarna så kan Ni erhålla tillåtelse att för icke kommersiella ändamål, helt eller delvis, mångfaldiga och sprida innehållet. Vid sådan hantering och användning skall källan "© LDB-centrum 2008" alltid anges. I annat fall gäller den rådande lagen om upphovsrätt: ”Mångfaldigande av innehållet i denna skrift, helt eller delvis, är enligt lagen om upphovsrätt förbjudet utan medgivande av copyrightinnehavarna. Förbjudet gäller varje form av mångfaldigande genom tryckning, kopiering, bandinspelning, överföring till elektroniskt media etc." © LDB-centrum 2008 - 2 - Abstract Format Manager (FORM) The report CODA FORM discusses three areas: • Criteria for archive formats • Evaluation of format registers • Test of software for identification of logical formats Criteria for archive formats has been created, a total of 22 points to check in order to estimate formats suitability for long-term digital preservation. The criteria have been ranked after requirements, important and advantage where the condition requirements are a minimum requirement and must be met for a good archive format. An evaluation of five format registers has been done. The format register that fulfilled the examining points best was PRONOM. A test of four software identifying files has been carried out. After testing 25 different types of files with shifting results, the best programs turned out to be DROID and JHOVE. DROID achieved the best result with an accuracy rate of 60%. However, none of the programs operates perfectly. Processes of Preservation (POP) The aim of project CODA-POP was to produce guidelines within four areas: • Criteria for when it is time to refresh digital files • Selection criteria for data carriers • Criteria for when to transcode (convert) to new logical file formats • Criteria for when system migration is required The results were: A list of nine different aspects for when to refresh files, divided into three criteria concerning preservation, and six criteria regarding efficiency improvement. The report also contains some suggestions for further work in this area. Three documents with the aim to improve and simplify the process of choosing data carriers have been made: one for simplifying and improving the selection process, one that defines the ‘Total cost of ownership” of data carriers and finally a table to evaluate and compare various data carriers. A list of criteria for when to convert files into new file formats was created. Also, the process when to initiate this work has been described, and a table to fill in when checking if the file format used is in danger. The work with the fourth area resulted in a list of nine criteria, useful as warning signals for when system migration is required. We also argue about the need of a written action plan describing how critical each criterion is, as well as how to act when any of the warning signals occur. - 3 - Sammanfattning Format Manager (FORM) Rapporten CODA FORM avhandlar tre punkter: • Kriterier för arkivformat • Undersökning av formatregister • Test av programvara för identifiering av format Kriterier för arkivformat har tagits fram, totalt 22 punkter för att bedöma ett formats lämplighet för långsiktigt digitalt bevarande. För att underlätta kontrollen av ett format har kriterierna rangordnas efter krav, viktig samt fördel. Punkterna i villkoret krav är ett minimi- krav och måste uppfyllas för ett bra arkivformat. En undersökning av fem formatregister har gjorts efter åtta punkter. Det formatregister som uppfyllde de undersökta punkterna bäst var PRONOM. Därefter har en test av fyra stycken programvaror gjorts som identifierar filer. Testade mot 25 filer med skiftande resultat, bästa programvaror blev DROID samt JHOVE. DROID gav bäst resultat med 60 % rätt. Dock är ingen av programvarorna fulländad. Processes of Preservation (POP) Projektet CODA-POP har haft som syfte att ge riktlinjer inom fyra områden: • Kriterier för när omkopiering skall ske • Kriterier för val av databärare • Kriterier för när transkodering/konvertering skall ske • Kriterier för när systembyte skall ske Resultatet blev följande: En lista med kriterier för när omkopiering skall ske, uppdelade på tre bevarande- respektive sex stycken effektiviseringskriterier och förslag på hur organisationer kan jobba vidare med denna fråga. När det gäller kriterier för val av databärare har tre dokument skapats för att förenkla och förbättra arbetet: beskrivning över de steg som bör tas när man ska välja databärare, en lista på vilka delar som bör ingå i en databärares livslängdskostnad och en tabell att ha som grund för att utvärdera och jämföra olika databärare. En lista har tagits fram med kriterier som kan ses som varningssignaler för att det är dags att inleda konvertering. Dessutom har processen för att besluta om tidpunkt beskrivits, samt en tabell gjorts för att ha som underlag för att kontrollera om de filformat som används är i riskzonen för att bli dåliga eller i värsta fall oläsliga. Arbetet med punkt fyra har resulterat i en lista med nio kriterier som påvisar behovet av systembyte och som även de kan användas som varningssignaler. Förslag ges på att organisationer i en skriftlig handlingsplan beskriver hur kritisk vardera situationen är och hur ansvarig operatör ska agera när någon av punkterna infaller. - 4 - ABSTRACT.......................................................................................................................................................- 3 - FORMAT MANAGER (FORM).......................................................................................................................... - 3 - PROCESSES OF PRESERVATION (POP) ............................................................................................................. - 3 - SAMMANFATTNING .....................................................................................................................................- 4 - FORMAT MANAGER (FORM).......................................................................................................................... - 4 - PROCESSES OF PRESERVATION (POP) ............................................................................................................. - 4 - 1. BAKGRUND .................................................................................................................................................- 8 - 2. CODA 2007....................................................................................................................................................- 8 - 3. PROJEKT CODA-FORM............................................................................................................................- 9 - 3.1 SYFTE........................................................................................................................................................ - 9 - 3.2 MÅLGRUPP................................................................................................................................................ - 9 - 3.3 DISPOSITION.............................................................................................................................................. - 9 - 3.4 ARBETSGRUPP........................................................................................................................................... - 9 - 4. BAKGRUND - KUNGL. BIBLIOTEKET................................................................................................- 10 - 4.1 PROBLEMOMRÅDE................................................................................................................................... - 10 - 4.2 PROJEKTET .............................................................................................................................................. - 10 - 5. KRITERIER FÖR ARKIVFORMAT ......................................................................................................- 11 - 5.1 GENERELLA KRITERIER ........................................................................................................................... - 12 - 5.2 RANGORDNADE KRITERIER ..................................................................................................................... - 14 - 5.3 BEDÖMNING AV FORMAT......................................................................................................................... - 15 - 6. FORMATREGISTER ................................................................................................................................- 16 - 6.1 UPPSALA FORMATREGISTER .................................................................................................................... - 16 - 6.2 PRONOM............................................................................................................................................... - 17 - 6.3 GLOBAL DIGITAL FORMAT REGISTRY (GDFR)....................................................................................... - 18 - 6.4 LIBRARY OF CONGRESS........................................................................................................................... - 18 - 6.5 THE FILE EXTENSION SOURCE (FILEXT) ................................................................................................ - 19 - 6.6 ANALYS .................................................................................................................................................
Recommended publications
  • Binary Image Compression Using Neighborhood Coding
    BINARY IMAGE COMPRESSION USING NEIGHBORHOOD CODING Tiago B. A. de Carvalho1, Tsang Ing Ren1, George D.C. Cavalcanti1, Tsang Ing Jyh2 1Center of Informatics, Federal University of Pernambuco, Brazil. 2Alcatel-Lucent, Bell Labs, Belgium E-mail: 1{tbac,tir,gdcc}@cin.ufpe.br, [email protected] ABSTRACT Bitmap image format requires a reasonable great amount of computer memory, since for each pixel it is necessary a Compression plays an important role in the storage and set of bits to represent it, and a relatively small image can transmission of digital image. Binary image compression is contain millions of pixels. Even a binary image that uses just of essential value for document imaging. Here, we propose a a bit per pixel can demand a large amount of disk space. novel compression technique based on the concept of There are dozens of bitmap image formats that use neighborhood coding, which codes each pixel of an image compression techniques, e.g., TIFF, GIF, PNG, JBIG and according to the number of neighbor pixels in different JPEG. The TIFF format can also use different types of directions. The proposed technique is a lossless compression compression methods such as CCITT Group 3 or Group 4. scheme, which also uses run-length encoding (RLE) and We propose a novel lossless binary image compression Huffman coding. We evaluated and compared this method to technique based on neighborhood coding scheme described several image file format, using test images taken from the in [2]. The codification starts by transforming each pixel of MPEG-7 core experiment CE-shape and the binary image the image into a vector.
    [Show full text]
  • PDF Image JBIG2 Compression and Decompression with JBIG2 Encoding and Decoding SDK Library | 1
    PDF image JBIG2 compression and decompression with JBIG2 encoding and decoding SDK library | 1 JBIG2 is an image compression standard for bi-level images developed by the Joint bi-level Image Expert Group. It is suitable for lossless compression and lossy compression. According to the group’s press release, in its lossless mode, JBIG2 usually generates files that are one- third to one-fifth the size of the fax group 4 and twice the size of JBIG, which was previously released by the group. The double-layer compression standard. JBIG2 was released as an international standard ITU in 2000. JBIG2 compression JBIG2 is an international standard for bi-level image compression. By segmenting the image into overlapping and/or non-overlapping areas of text, halftones and general content, compression techniques optimized for each content type are used: *Text area: The text area is composed of characters that are well suited for symbol-based encoding methods. Usually, each symbol will correspond to a character bitmap, and a sub-image represents a character or text. For each uppercase and lowercase character used on the front face, there is usually only one character bitmap (or sub-image) in the symbol dictionary. For example, the dictionary will have an “a” bitmap, an “A” bitmap, a “b” bitmap, and so on. VeryUtils.com PDF image JBIG2 compression and decompression with JBIG2 encoding and decoding SDK library | 1 PDF image JBIG2 compression and decompression with JBIG2 encoding and decoding SDK library | 2 *Halftone area: Halftone areas are similar to text areas because they consist of patterns arranged in a regular grid.
    [Show full text]
  • DICOM Compression 2002
    The Medicine Behind the Image DDIICCOOMM CCoommpprreessssiioonn 22000022 David Clunie Director of Technical Operations Princeton Radiology Pharmaceutical Research The Medicine SScchheemmeess SSuuppppoorrtteedd Behind the Image • RLE • JPEG - lossless and lossy • JPEG-LS - more efficient, fast lossless • JPEG 2000 - progressive, ROI encoding • Deflate (zip/gzip) - for non-image objects The Medicine IInn pprraaccttiiccee mmoossttllyy …… Behind the Image • Lossless JPEG for cardiac angio – multi-frame 512x512x8, 1024x1024x10 – CD-R and on network • Lossless JPEG for CT/MR – mostly on MOD media rather than over network – 256x256 to 1024x1024, 12-16 bits • RLE/lossless/lossy JPEG for Ultrasound – 640x480 single and multiframe 8 bits gray/RGB, text The Medicine BBuutt …… Behind the Image • JPEG lossless not the most effective • JPEG lossy limited to 12 bits unsigned • Undesirable JPEG blockiness • Perception that wavelets are better • Need for better progressive encoding • Need for region-of-interest encoding The Medicine JJPPEEGG LLoosssslleessss Behind the Image • Reasonable predictive scheme – Most often only previous pixel predictor used (SV1), which is not always the best choice • No run-length mode – No way to take advantage of large background areas • Huffman entropy coder – Slow (multi-pass) The Medicine LLoosssslleessss CCoommpprreessssiioonn Behind the Image CALIC Arithmetic 3.91 JPEG2000 VM4 5x3 3.81 JPEG-LS MINE 3.81 JPEG2000 VM4 3.66 2x10 S+P Arithmetic 3.4 JPEG-LS MINE - NO 3.31 Byte All RUN NASA szip 3.09 JPEG best 3.04 JPEG SV
    [Show full text]
  • Snowbound Supported File Formats
    Snowbound Supported File Formats This document describes the file type number, descriptions, and read/write capabilities of all supported file formats. We have provided two tables of information, one sorted by file format name, and the other by the file type number. RasterMaster and VirtualViewer® HTML5 are powerful conversion tools that can transform your documents and images into many different formats. Some format types are limited in the amount of color (bit-depth) they support in an image. Some file formats read and write only black and white (1-bit deep) and other file formats support only color images (8+ bits deep). For many of these cases, the product automatically converts the pixel depth to the appropriate value, based on the output format specified. The chart below will help you determine whether your black and white or color document will be able to convert straight to the desired output format with no additional processing. When saving to a format, if the error returned is PIXEL_DEPTH_UNSUPPORTED (-21), the output format does not support the current bits per pixel of the image you are trying to save. The chart below will help you identify formats with compatible bit depths. 1 FILE FORMAT KEY File Format Description 1-bit Black and white or monochrome images. 4-bit, 8-bit, 16-bit Grayscale images, that may appear to be black and white, but contain much more information and are much larger than 1-bit. 8-bit, 16-bit, 24-bit, 32-bit Full color images. Please note that the higher the bit depth (bits per pixel), then the larger the size of the image on the disk or in memory.
    [Show full text]
  • Still Image File Format Comparison
    Still Image File Format Comparison Document I. Narrative Introduction Version of July 30, 2013 For review by the FADGI Still Image Working Group Background. The two FADGI Working Groups are exploring file formats for still images and video. The explorations are using similar, matrix-based tools to make comparisons relevant to preservation planning. The matrixes compare a limited number of formats in terms of roughly forty factors, grouped under the following general headings: Sustainability Factors Cost Factors System Implementation Factors (Full Lifecycle) Settings and Capabilities (Quality and Functionality Factors) The still image effort is led by the Government Printing Office and it is comparing formats suitable for reformatting (digitization). The formats being compared include JPEG 2000, JPEG (DCT), TIFF, PNG, and PDF, and several subtypes. The findings from this project will be integrated into the Working Group's continuing refinement of its general guideline for raster imaging. Document I - Narrative Introduction - Table of Contents Page 2 Introduction: File Format Sub-Group Page 2 Guiding Principles and Selection of File Formats Page 3 Sub-Group Deliverables: Summary Table and Detailed Matrix Page 4 Findings and Next Steps Other documents in the set Document II. Summary Table Document III. Detailed Matrix 1 FADGI Still Image Group: File Format Sub-Group Narrative Summary Introduction: File Format Sub-Group Since its inception, the FADGI Still Image Working Group’s work has mainly focused on guidelines related to image quality (e.g., resolution, sharpening, color encoding). As a supplement to these guidelines, the need has been identified to develop a set of recommendations for file encoding standards for archival and derivative renditions of digitized content, as the selection of format directly affects an implementer’s options in terms of compression, color encoding, and metadata support.
    [Show full text]
  • Performance Comparison of Video Encoders in Light Field Image
    https://doi.org/10.2352/ISSN.2470-1173.2021.18.3DIA-060 © 2021, Society for Imaging Science and Technology Performance comparison of video encoders in light field image compression Hadi Amirpour 1, Antonio M. G. Pinheiro1, Manuela Pereira 1, and Mohammad Ghanbari 2;3 1Instituto de Telecomunicac¸oes˜ and Universidade da Beira Interior, Covilha,˜ Portugal 2 School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran 3 School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK s Abstract u Efficient compression plays a significant role in Light Field imaging technology because of the huge amount of data needed . for their representation. Video encoders using different strategies v are commonly used for Light Field image compression. In this pa- per, different video encoder implementations including HM, VTM, x265, xvc, VP9, and AV1 are analysed and compared in terms . of coding efficiency, and encoder/decoder time-complexity. Light . field images are compressed as pseudo-videos. Introduction Light field technology is a promising representation for 3D imaging. It enables some post-processing tasks like refocusing, synthesising new views and depth estimation [1]. These features are enabled at the cost of significant data increase which makes . compression a major task to enable its practical use. Indepen- t dently of acquisition with a camera multi-array or a lenslet cam- Figure 1: Light field images can be represented by the different era, a captured light field image of a scene can be represented by a views captured from one scene. set of multi-views of a scene captured from different viewpoints.
    [Show full text]
  • Model Answer of Multimedia System Design (IT4111) B.Tech 7 Semester Multiple Choice 1. Graphic Programs Widely Used in the G
    Model Answer Of Multimedia system design (IT4111) B.tech 7th semester Multiple Choice 1. Graphic programs widely used in the graphic arts profession include __________ a) desktop publishing programs, image editors and illustration programs b) artificial intelligence, virtual reality, and illustration programs c) megamedia programs, image editors, and desktop publishing programs d) virtual reality, desktop publishing programs, and illustration programs Answer: A Response: The graphic arts profession would use desktop publishing, image editors and illustrator programs in their work. 2. Which of the following programs is not a popular desktop publishing program? a) Adobe PageMaker b) Microsoft Publisher c) Lotus AmiPro d) QuarkXPress Answer: C Response: Lotus AmiPro is a word processing package. 3. Programs used to create or modify bitmap images are called __________ a) illustration programs b) paint programs c) graphical modifiers d) bit publishing packages Answer: B Response: Image editors (also known as paint programs) are programs for creating and editing bitmap images. 4. Paint programs and image editors are used for creating and editing __________ a) bitmap images b) vector images c) text d) HTML codes Answer: A Response: Paint programs and image editors are programs for creating and editing bitmap images. 5. Raster images are also known as a) bitmap images b) vector images c) clip art images d) multimedia images Answer: A Response: Raster images are also known as bitmap images. 6. Images made up of thousands of pixels are called ___________ a) bitmap b) vector c) story boards d) graphics Answer: A Response: Bitmap images use thousands of dots or pixels to represent images.
    [Show full text]
  • Best Compression for Scanning Documents
    Best Compression For Scanning Documents Defunctive Alix slaves cracking or inhumes coweringly when Cat is finny. Gentianaceous and imperviable Giordano never daffs possibly when Hercules criminates his groomers. Is Stephan fatherly when Merill bioassay understandingly? Pdf size and best scanning troubles do not supported by most of the public activity and save, when switching apps If an original is compressed using a lossless compression technique it is identical. Best Practices for Email Attachments IT Security & Policy Office. When i compress a PDF you instead see how slight degradation in quality working with top copiers like the Canon models you probably there not notice. Digitizing Your Important Personal Documents Leavitt Group News. Because lossy compression removes data put the original file lines. Kdan for scanning location, compressed photo you have access files and scans result in the world that really need to create much. TIFF or JPEG compression ratio as dependent remember the difference between adjacent pixels. Scanning Guide Toshiba. Detect photo or document? Selecting Epson Scan Settings. Images for scanning, compress the epson scan processing is compressed photo images exceeded the same size and scans produce a pdf reader! Scanning your documents and photographs results in digital copies that chase be reprinted and shared. You can be prompted before you could be opportunities to download all of your thought and particularly appropriate. How to name multiple pages into one document and. If you scan documents all these. ABBYY for compression MobileRead Forums. Get compressed document scanned documents have access file compression pdf document even scan directly with a software, compress pdf reader! About file compression PaperPort.
    [Show full text]
  • Adobe Acrobat Image Compression: an Investigation Into the Effects of Compression in Acrobat 4.0 on Image Reproducibility for Digital Printing Kelly Thornton
    Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 8-1-2000 Adobe Acrobat image compression: An Investigation into the effects of compression in Acrobat 4.0 on image reproducibility for digital printing Kelly Thornton Follow this and additional works at: http://scholarworks.rit.edu/theses Recommended Citation Thornton, Kelly, "Adobe Acrobat image compression: An Investigation into the effects of compression in Acrobat 4.0 on image reproducibility for digital printing" (2000). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. Adobe Acrobat Image Compression: An Investigation into the Effects of Compression in Acrobat 4.0 on Image Reproducibility for Digital Printing by Kelly E. Thornton A thesis submitted in partial fulfillment of the requirements for the degree ofMaster of Science in the School of Printing Management and Sciences in the College ofImaging Arts and Sciences of the Rochester Institute ofTechnology August, 2000 Thesis Advisor: Professor Frank Romano School of Printing Management and Sciences Rochester Institute ofTechnology Rochester, New York Certificate ofApproval Master's Thesis This is to certify that the Master's Thesis of Kelly Elizabeth Thornton With a major in Graphic Arts Publishing/ Electronic Publishing has been approved by the thesis committee as satisfactory for the thesis requirement for the Master of Science degree at the convocation of August, 2000 Thesis Committee: Frank Romano Thesis Advisor Marie Freckleton Graduate Program Coordinator Frank Romano Chair Adobe Acrobat Image Compression: An Investigation into the Effects ofCompression in Acrobat 4.0 on Image Reproducibility for Digital Printing I, Kelly Elizabeth Thornton, hereby grant permission to the Wallace Memorial Library of R.I.T.
    [Show full text]
  • ICS Reference Manual
    ICS Reference Manual Revised for SwiftView 9.2.3.4 2 Table of Contents Chapter 1: The Imaging Command Set ................................................................................................. 3 Manual Conventions ............................................................................................................................ 3 Product Specific Commands ................................................................................................................ 4 Coordinate Systems ............................................................................................................................. 4 ICS parameter quoting ......................................................................................................................... 4 ICS Commands .................................................................................................................................... 5 Chapter 2: SwiftView Environment Variables .................................................................................... 44 Chapter 3: ICS Callbacks ..................................................................................................................... 47 Apendix A: PCL Fonts ......................................................................................................................... 53 Overview ........................................................................................................................................... 53 Font selection ...................................................................................................................................
    [Show full text]
  • Graphic-File-Formats.Pdf
    Digital Preservation Guidance Note: 4 Graphics File Formats Digital Preservation Guidance Note 4: Graphics file formats Document Control Author: Adrian Brown, Head of Digital Preservation Research Document Reference: DPGN-04 Issue: 2 Issue Date: August 2008 ©THE NATIONAL ARCHIVES 2008 Page 2 of 15 Digital Preservation Guidance Note 4: Graphics file formats Contents 1 INTRODUCTION .....................................................................................................................4 2 TYPES OF GRAPHICS FORMAT........................................................................................4 2.1 Raster Graphics ...............................................................................................................4 2.1.1 Colour Depth ............................................................................................................5 2.1.2 Colour Spaces and Palettes ..................................................................................5 2.1.3 Transparency............................................................................................................6 2.1.4 Interlacing..................................................................................................................6 2.1.5 Compression ............................................................................................................7 2.2 Vector Graphics ...............................................................................................................7 2.3 Metafiles............................................................................................................................7
    [Show full text]
  • PDF Version of the Csximage Instructions
    Website: www.chestysoft.com Email: [email protected] csXImage - Version 5.0 OCX Control for Display and Manipulation of Images This is an OCX control that enables processing of graphic images. A comprehensive range of over 200 functions is available to load and save images, resize and edit images, draw text and shapes and interact with hardware devices such as printers, scanners and cameras. Most commonly used graphics file formats are supported. A free trial version of csXImage is available. If you are reading this instruction manual for the first time, it is likely that you have just downloaded and installed the trial version. When you use the trial version, a line of text will be displayed at the top of any image as you view it in the control. The trial version is fully functional, with the exception of three methods (AcquireToFile, ReadMetaData and OverwriteMetaData) which are available only in the full version and the addition of this text to your image is the only other difference between the trial and full versions. This means that you can fully test whether this control is suitable for your application before considering whether to license the full version. Using these Instructions These instructions are divided into a number of sections covering different types of functions available in csXImage. A full Table of Contents is available on the next page and an index listing all functions in alphabetical order is included at the back for easy reference. Click on one of the links below to go directly to the section of interest: Installation Instructions Code Examples to Help Get Started Full Table of Contents Alphabetical Index of Functions Supported Graphics File Formats Resizing Images Merging Images Drawing Using Printers, Scanners and Cameras Client Side Use in a Browser Use in Visual Basic.NET Chestysoft, March 2021.
    [Show full text]