Williams College Archives and Special Collections File Format Recommendations

Total Page:16

File Type:pdf, Size:1020Kb

Williams College Archives and Special Collections File Format Recommendations Williams College Archives and Special Collections File Format Recommendations Based on current digital data curation best practice and recommendations, the Williams College Archives has set the following confidence levels for the longevity of commonly used file formats. Record creators are encouraged to take these recommendations into consideration when determining what file formats to save records as during their active lifecycle. The longer the active lifecycle is anticipated to be, the greater the need to use formats ranked as medium or high confidence. Media type High Confidence Level Medium Confidence Low Confidence Level Level Text - Plain text (encoding: USASCII, - Cascading Style Sheets - PDF (.pdf) (encrypted) UTF-8, UTF-16 with (.css) - Microsoft Word (.doc) BOM) - DTD (.dtd) - WordPerfect (.wpd) - XML (includes XSD/XSL/ - Plain text (ISO 8859-1 - DVI (.dvi) XHTML, etc.; with included or encoding) - All other text forMats accessible schema and - PDF (.pdf) (eMbedded not listed here character encoding explicitly fonts) specified) - Rich Text ForMat 1.x (.rtf) - PDF/A-1 (ISO 19005-1) - HTML (include a DOCTYPE (.pdf) declaration) - SGML (.sgMl) - Open Office (.sxw/.odt) - OOXML (ISO/IEC DIS 29500) (.docx) Raster Images - TIFF (uncoMpressed) - BMP (.bMp) - MrSID (.sid) - JPEG2000 (lossless) (.jp2) - JPEG/JFIF (.jpg) - TIFF (in Planar forMat) -PNG - JPEG2000 (lossy) (.jp2) - FlashPix (.fpx) - TIFF (coMpressed) - PhotoShop (.psd) - GIF (.gif) - RAW - Digital Negative DNG - JPEG 2000 Part 2 (*.jpf, (.dng) .jpx) - PNG (.png) - All other raster image formats not listed here Vector - SVG (no Java script binding) - CoMputer Graphic - Encapsulated Postscript Graphics (.svg) Metafile (EPS) (CGM, WebCGM) (.cgM) - MacroMedia Flash (.swf) - All other vector image formats not listed here Audio - AIFF (PCM) (.aif, .aiff) - SUN Audio - AIFC (coMpressed) - WAV (PCM) (.wav) (uncoMpressed) (.aifc) (.au) - NeXT SND (.snd) - Standard MIDI (.Mid, .midi) - RealNetworks 'Real - Ogg Vorbis (.ogg) Audio' (.ra, rm, - Free Lossless Audio Codec .raM) (.flac) - Windows Media Audio - Advance Audio Coding (.wMa) (.Mp4, .M4a, .aac) - Protected AAC (.m4p) - MP3 (MPEG-1/2, Layer 3) - WAV (coMpressed) (.mp3) (.wav) - All other audio forMats not listed here Video - Motion JPEG 2000 (ISO/IEC - Ogg Theora (.ogg) - AVI (others) (.avi) 15444-4) (.mj2) - MPEG-1, MPEG-2 (.mpg, - QuickTime Movie - AVI (uncoMpressed, motion .mpeg, wrapped in AVI, (others) (.mov) JPEG) (.avi) MOV) - RealNetworks 'Real - QuickTime Movie - MPEG-4 (H.263, H.264) Video' (.rv) (uncoMpressed, Motion JPEG) (.mp4, wrapped in AVI, - Windows Media Video (.mov) MOV) (.wmv) - All other video forMats not listed here Spreadsheet/ - CoMMa Separated Values - DBF (.dbf) - Excel (.xls) Database (.csv) - OpenOffice (.sxc/.ods) - All other spreadsheet/ - Delimited Text (.txt) - OOXML (ISO/IEC DIS database forMats not - SQL DDL 29500) listed here (.xlsx) Virtual Reality - X3D (.x3d) - VRML (.wrl, .vrMl) - All other virtual reality - U3D (Universal 3D file formats not listed here format) Computer - CoMputer prograM source - CoMpiled / Executable Programs code files (EXE, .class, COM, (.c, .c++, .java, .js, .jsp, DLL, BIN, DRV, OVL, .php, .pl, etc.) SYS, PIF) Presentation - OpenOffice (.sxi/.odp) - PowerPoint (.ppt) Files - OOXML (ISO/IEC DIS - All other presentation 29500) formats not listed here (.pptx) Williams College Archives and Special Collections Media Type Preservation Plan Once records have become inactive and are transferred to the Williams College Archives, the Archives’ primary preservation strategy is to normalize files to preservation and access formats upon ingest. The choice of access formats is based on the ubiquity of viewers for the file format. All preservation formats are open standards. Additionally, the choice of preservation format is based on community best practices, availability of open-source normalization tools, and an analysis of the significant characteristics of each media type. The College Archives also maintains the original format of all ingested files to support future migration and emulation preservation strategies as needed. The normalization chart below is based on international best practice standards and the default plan for Archivematica. The Archives monitors standard format registries and technological advances and adjusts the plan as needed. Preservation Access Normalization Media type Original File formats format(s) format(s) tool Audio AC3, AIFF, MP3, WAV, WMA WAVE (LPCM) MP3 FFmpeg Portable DocuMent PDF PDF/A PDF or PDF/A Ghostscript ForMat Presentation PPT ODF PDF OpenOffice files BMP, GIF, JPG, JP2, PNG, PSD*, UncoMpressed Images JPEG ImageMagick TIFF, TGA TIFF Raw caMera DigiKaM DNG NEF DNG JPEG files Converter Spreadsheet/ Original XLS ODF Unoconv/OpenOffice Database format Original Plain text TXT Original forMat None format Vector iMages AI*, EPS*, SVG* SVG PDF Inkscape AVI, FLV, MOV, MPEG-1, MPEG- Video MPEG-2 MPG FFmpeg 2, MPEG-4*, SWF, WMV Word processing DOC, WPD, RTF ODF PDF OpenOffice files .
Recommended publications
  • Architecture of the World Wide Web, First Edition Editor's Draft 14 October 2004
    Architecture of the World Wide Web, First Edition Editor's Draft 14 October 2004 This version: http://www.w3.org/2001/tag/2004/webarch-20041014/ Latest editor's draft: http://www.w3.org/2001/tag/webarch/ Previous version: http://www.w3.org/2001/tag/2004/webarch-20040928/ Latest TR version: http://www.w3.org/TR/webarch/ Editors: Ian Jacobs, W3C Norman Walsh, Sun Microsystems, Inc. Authors: See acknowledgments (§8, pg. 42). Copyright © 2002-2004 W3C ® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply. Your interactions with this site are in accordance with our public and Member privacy statements. Abstract The World Wide Web is an information space of interrelated resources. This information space is the basis of, and is shared by, a number of information systems. In each of these systems, people and software retrieve, create, display, analyze, relate, and reason about resources. The World Wide Web uses relatively simple technologies with sufficient scalability, efficiency and utility that they have resulted in a remarkable information space of interrelated resources, growing across languages, cultures, and media. In an effort to preserve these properties of the information space as the technologies evolve, this architecture document discusses the core design components of the Web. They are identification of resources, representation of resource state, and the protocols that support the interaction between agents and resources in the space. We relate core design components, constraints, and good practices to the principles and properties they support. Status of this document This section describes the status of this document at the time of its publication.
    [Show full text]
  • X3DOM – Declarative (X)3D in HTML5
    X3DOM – Declarative (X)3D in HTML5 Introduction and Tutorial Yvonne Jung Fraunhofer IGD Darmstadt, Germany [email protected] www.igd.fraunhofer.de/vcst © Fraunhofer IGD 3D Information inside the Web n Websites (have) become Web applications n Increasing interest in 3D for n Product presentation n Visualization of abstract information (e.g. time lines) n Enriching experience of Cultural Heritage data n Enhancing user experience with more Example Coform3D: line-up of sophisticated visualizations scanned historic 3D objects n Today: Adobe Flash-based site with videos n Tomorrow: Immersive 3D inside browsers © Fraunhofer IGD OpenGL and GLSL in the Web: WebGL n JavaScript Binding for OpenGL ES 2.0 in Web Browser n à Firefox, Chrome, Safari, Opera n Only GLSL shader based, no fixed function pipeline mehr n No variables from GL state n No Matrix stack, etc. n HTML5 <canvas> element provides 3D rendering context n gl = canvas.getContext(’webgl’); n API calls via GL object n X3D via X3DOM framework n http://www.x3dom.org © Fraunhofer IGD X3DOM – Declarative (X)3D in HTML5 n Allows utilizing well-known JavaScript and DOM infrastructure for 3D n Brings together both n declarative content design as known from web design n “old-school” imperative approaches known from game engine development <html> <body> <h1>Hello X3DOM World</h1> <x3d> <scene> <shape> <box></box> </shape> </scene> </x3d> </body> </html> © Fraunhofer IGD X3DOM – Declarative (X)3D in HTML5 • X3DOM := X3D + DOM • DOM-based integration framework for declarative 3D graphics
    [Show full text]
  • Marantz Guide to Pc Audio
    White paper MARANTZ GUIDE TO PCAUDIO Contents: Introduction • Introduction As you know, in recent years the way to listen to music has changed. There has been a progression from the use of physical • Digital Connections media to a more digital approach, allowing access to unlimited digital entertainment content via the internet or from the library • Audio Formats and TAGs stored on a computer. It can be iTunes, Windows Media Player or streaming music or watching YouTube and many more. The com- • System requirements puter is a centre piece to all this entertainment. • System Setup for PC and MAC The computer is just a simple player and in a standard setup the performance is just average or even less. • Tips and Tricks But there is also a way to lift the experience to a complete new level of enjoyment, making the computer a good player, by giving the • High Resolution audio download responsibility for the audio to an external component, for example a “USB-DAC”. A DAC is a Digital to Analogue Converter and the USB • Audio transmission modes terminal is connected to the USB output of the computer. Doing so we won’t be only able to enjoy the above mentioned standard audio, but gain access to high resolution audio too, exceeding the CD quality of 16-bit / 44.1kHz. It is possible to enjoy studio master quality as 24-bit/192kHz recordings or even the SACD format DSD with a bitstream at 2.8MHz and even 5.6MHz. However to reach the above, some equipment is needed which needs to be set up and adjusted.
    [Show full text]
  • TR 102 199 V1.1.1 (2003-10) Technical Report
    ETSI TR 102 199 V1.1.1 (2003-10) Technical Report Services and Protocols for Advanced Networks (SPAN); Preliminary analysis of Broadband multimedia services 2 ETSI TR 102 199 V1.1.1 (2003-10) Reference DTR/SPAN-130320 Keywords broadband, multimedia, service ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N° 348 623 562 00017 - NAF 742 C Association à but non lucratif enregistrée à la Sous-Préfecture de Grasse (06) N° 7803/88 Important notice Individual copies of the present document can be downloaded from: http://www.etsi.org The present document may be made available in more than one electronic version or in print. In any case of existing or perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). In case of dispute, the reference shall be the printing on ETSI printers of the PDF version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http://portal.etsi.org/tb/status/status.asp If you find errors in the present document, send your comment to: [email protected] Copyright Notification No part may be reproduced except as authorized by written permission. The copyright and the foregoing restriction extend to reproduction in all media. © European Telecommunications Standards Institute 2003. All rights reserved.
    [Show full text]
  • Scene Graph Adapter
    Scene Graph Adapter: An efficient Architecture to Improve Interoperability between 3D Formats and 3D Application Engines Rozenn Bouville Berthelot, Jérôme Royan, Thierry Duval, Bruno Arnaldi To cite this version: Rozenn Bouville Berthelot, Jérôme Royan, Thierry Duval, Bruno Arnaldi. Scene Graph Adapter: An efficient Architecture to Improve Interoperability between 3D Formats and 3D Application Engines. Web3D 2011 (16th International Conference on 3D Web technology), Jun 2011, Paris, France. pp.21- 30. inria-00586161 HAL Id: inria-00586161 https://hal.inria.fr/inria-00586161 Submitted on 6 Apr 2014 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Scene Graph Adapter: An efficient Architecture to Improve Interoperability between 3D Formats and 3D Applications Engines Rozenn Bouville Berthelot∗ Jérôme Royan† Thierry Duval‡ Bruno Arnaldi§ Orange Labs and IRISA, Rennes, France Orange Labs France IRISA, Rennes, France IRISA, Rennes, France Figure 1: Our architecture allows the loading of any 3D graphics format simultaneously in any available rendering engine. The scene graph adapter is an interface that adapts a scene graph (SG) of a given format into a renderer scene graph and which also allows the rendering part to request this scene graph.
    [Show full text]
  • Supplement 211: Dicomweb Support for the Application/Zip Payload
    5 Digital Imaging and Communications in Medicine (DICOM) Supplement 211: 10 DICOMweb Support for the application/zip Payload 15 20 Prepared by: Bill Wallace, Brad Genereaux DICOM Standards Committee, Working Group 27 1300 N. 17th Street Rosslyn, Virginia 22209 USA 25 Developed in accordance with work item WI 2018 -09 -C VERSION: 19 January 16, 2020 Table of Contents Scope and Field of Application ........................................................................................................................................ iii 30 Open Questions ....................................................................................................................................................... iii Closed Questions .................................................................................................................................................... iiii 8.6.1.3.1 File Extensions ................................................................................................................................. viv 8.6.1.3.2 BulkData URI ................................................................................................................................... viv 8.6.1.3.3 Logical Format ........................................................................................................................................ viv 35 8.6.1.3.4 Metadata Representations ...................................................................................................................... viv Scope and Field of Application
    [Show full text]
  • Blackberry QNX Multimedia Suite
    PRODUCT BRIEF QNX Multimedia Suite The QNX Multimedia Suite is a comprehensive collection of media technology that has evolved over the years to keep pace with the latest media requirements of current-day embedded systems. Proven in tens of millions of automotive infotainment head units, the suite enables media-rich, high-quality playback, encoding and streaming of audio and video content. The multimedia suite comprises a modular, highly-scalable architecture that enables building high value, customized solutions that range from simple media players to networked systems in the car. The suite is optimized to leverage system-on-chip (SoC) video acceleration, in addition to supporting OpenMAX AL, an industry open standard API for application-level access to a device’s audio, video and imaging capabilities. Overview Consumer’s demand for multimedia has fueled an anywhere- o QNX SDK for Smartphone Connectivity (with support for Apple anytime paradigm, making multimedia ubiquitous in embedded CarPlay and Android Auto) systems. More and more embedded applications have require- o Qt distributions for QNX SDP 7 ments for audio, video and communication processing capabilities. For example, an infotainment system’s media player enables o QNX CAR Platform for Infotainment playback of content, stored either on-board or accessed from an • Support for a variety of external media stores external drive, mobile device or streamed over IP via a browser. Increasingly, these systems also have streaming requirements for Features at a Glance distributing content across a network, for instance from a head Multimedia Playback unit to the digital instrument cluster or rear seat entertainment units. Multimedia is also becoming pervasive in other markets, • Software-based audio CODECs such as medical, industrial, and whitegoods where user interfaces • Hardware accelerated video CODECs are increasingly providing users with a rich media experience.
    [Show full text]
  • SA1OPS English User Manual
    Register your product and get support at www.philips.com/welcome SA1OPS08 SA1OPS16 SA1OPS32 EN User manual Select files and playlists for manual Contents sync 15 Copy files from GoGear Opus to your computer 16 English 1 Important safety information 3 WMP11 playlists 16 General maintenance 3 Create a regular playlist 16 Recycling the product 4 Create an auto playlist 16 Edit playlist 17 2 Your new GoGear Opus 6 Transfer playlists to GoGear Opus 17 What’s in the box 6 Search for music or pictures with WMP11 17 Delete files and playlists from WMP11 3 Getting started 7 library 17 Overview of the controls and Delete files and playlists from GoGear connections 7 Opus 18 Overview of the main menu 7 Edit song information with WMP11 18 Install software 8 Format GoGear Opus with WMP11 19 Connect and charge 8 Connect GoGear Opus to a computer 8 6 Music 20 Battery level indication 8 Listen to music 20 Battery level indication 9 Find your music 20 Disconnect GoGear Opus safely 9 Delete music tracks 20 Turn GoGear Opus on and off 9 Automatic standby and shut-down 9 7 Audiobooks 21 Add audiobooks to GoGear Opus 21 4 Use GoGear Opus to carry files 10 Audiobook controls 21 Select audiobook by book title 21 Adjust audiobook play speed 22 5 Windows Media Player 11 Add a bookmark in an audiobook 22 (WMP11) 11 Find a bookmark in an audiobook 22 Install Windows Media Player 11 Delete a bookmark in an audiobook 22 (WMP11) 11 Transfer music and picture files to WMP11 library 11 8 Video 23 Switch between music and pictures Download, convert and transfer library
    [Show full text]
  • Fast and Scalable Pattern Mining for Media-Type Focused Crawling
    Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the published version when available. Title Fast and Scalable Pattern Mining for Media-Type Focused Crawling Author(s) Umbrich, Jürgen; Karnstedt, Marcel; Harth, Andreas Publication Date 2009 Jürgen Umbrich, Marcel Karnstedt, Andreas Harth "Fast and Publication Scalable Pattern Mining for Media-Type Focused Crawling", Information KDML 2009: Knowledge Discovery, Data Mining, and Machine Learning, in conjunction with LWA 2009, 2009. Item record http://hdl.handle.net/10379/1121 Downloaded 2021-09-27T17:53:57Z Some rights reserved. For more information, please see the item record link above. Fast and Scalable Pattern Mining for Media-Type Focused Crawling∗ [experience paper] Jurgen¨ Umbrich and Marcel Karnstedt and Andreas Harthy Digital Enterprise Research Institute (DERI) National University of Ireland, Galway, Ireland fi[email protected] Abstract 1999]) wants to infer the topic of a target page before de- voting bandwidth to download it. Further, a page’s content Search engines targeting content other than hy- may be hidden in images. pertext documents require a crawler that discov- ers resources identifying files of certain media types. Na¨ıve crawling approaches do not guaran- A crawler for media type targeted search engines is fo- tee a sufficient supply of new URIs (Uniform Re- cused on the document formats (such as audio and video) source Identifiers) to visit; effective and scalable instead of the topic covered by the documents. For a scal- mechanisms for discovering and crawling tar- able media type focused crawler it is absolutely essential geted resources are needed.
    [Show full text]
  • Tetra4d Converter 2020 Release Notes 1 Table of Contents
    Tetra4D Converter Version 2020 Release Notes Details of new features, updated formats support and bug fixes for Tetra4D Converter Tetra4D Converter 2020 Release Notes 1 Table of Contents Version 2020 ................................................................................................................................................. 3 Definition of Release Types ....................................................................................................................... 3 Version Information .................................................................................................................................. 3 Installation ................................................................................................................................................ 3 Language Support Overview ...................................................................................................................... 3 Acrobat Pro Compatibility ........................................................................................................................ 4 System Requirements ................................................................................................................................... 4 Licensing........................................................................................................................................................ 5 Message for Tetra4D Converter existing customers ................................................................................ 5 New
    [Show full text]
  • Agisoft Photoscan User Manual Standard Edition, Version 1.3 Agisoft Photoscan User Manual: Standard Edition, Version 1.3
    Agisoft PhotoScan User Manual Standard Edition, Version 1.3 Agisoft PhotoScan User Manual: Standard Edition, Version 1.3 Publication date 2017 Copyright © 2017 Agisoft LLC Table of Contents Overview ......................................................................................................................... iv How it works ............................................................................................................ iv About the manual ...................................................................................................... iv 1. Installation and Activation ................................................................................................ 1 System requirements ................................................................................................... 1 GPU acceleration ........................................................................................................ 1 Installation procedure .................................................................................................. 2 Restrictions of the Demo mode ..................................................................................... 2 Activation procedure ................................................................................................... 3 2. Capturing photos ............................................................................................................ 4 Equipment ................................................................................................................
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]