File Formatsformats

Total Page:16

File Type:pdf, Size:1020Kb

File Formatsformats FILEFILE FORMATSFORMATS Summary records created or formatted electronically are Rapid changes in technology mean that file formats covered under the act. can become obsolete quickly and cause problems for Proprietary, Non-proprietary, Open your records management strategy. A long-term view Standard and Open Source File Formats and careful planning can overcome this risk and ◆ ensure that you can meet your legal and operational Proprietary formats. Proprietary file formats are requirements. controlled and supported by just one software developer. Microsoft Word (.DOC) format is an Legally, your records must be trustworthy, complete, example. accessible, admissible in court, and durable for as ◆ Non-proprietary formats. These formats are long as your approved records retention schedules supported by more than one developer and can require. For example, you can convert a record to be accessed with different software systems. For another, more durable format (e.g., from a nearly example, eXtensible Markup Language (XML) is obsolete software program to a text file). That copy, becoming an increasingly popular non-proprietary as long as it is created in a trustworthy manner, is format. legally acceptable. ◆ Open Source formats. In general, open source The software in which a file is created usually has a refers to any program whose source code is made default format, often indicated by a file name suffix available for use or modification as users or other (e.g., *.PDF for portable document format). Most developers see fit. Open source software may be software allows authors to select from a variety of developed, modified and distributed by formats when they save a file (e.g., document independent software companies for profit. The [DOC], Rich Text Format [RTF], text [TXT]). Some Linux operating system is an example. software, such as Adobe Acrobat, is designed to ◆ convert files from one format to another. Open Standard formats. Open standard software formats are created using publicly available Legal Framework specifications. Although software source codes remain proprietary, the availability of the For more information on the legal framework you standard increases compatibility by allowing must consider when selecting digital file formats, other developers to create hardware and software refer to the chapter Records Management in an solutions that interact with, or substitute for, Electronic Environment in the Electronic Records other software. The Portable Document Format Management Guidelines and Appendix A6 of the (.PDF) is based on an open standard. Trustworthy Information Systems Handbook. Also review the requirements of the: File Format Types ◆ Public Records Act [PRA] (Code of Laws of South There are hundreds of file formats used to encode Carolina, 1976, Section 30-1-10 through digital information. Below are brief descriptions of 30-1-140, as amended) available at the basic files you are likely to encounter. Use the www.scstatehouse.org/code/t30c001.htm, which resources in the Annotated List of Resources for supports government accountability by mandating more detailed information on specific file formats. the use of retention schedules to manage records Basic file format types include: of South Carolina public entities. This law governs the management of all records created by agencies or entities supported in whole or in part MORE ➔ by public funds in South Carolina. Section 30-1-70 establishes your responsibility to protect South Carolina Department of Archives & History the records you create and to make them www.state.sc.us/scdah/erg/erg.htm available for easy use. The act does not January 2005 Version 1 — FF discriminate between media types. Therefore, Page 1 ◆ Text files. Text files are most often created in are widely usable in many different software word processing software programs. Common file programs. TIFF files are either uncompressed formats for text files include: or compressed using a lossless algorithm — Proprietary formats, such as Microsoft Word – Graphics Interchange Format (GIF) files, files and WordPerfect files, which carry the which are widely used for Internet extension of the software in which they were applications. GIF is a lossless compression created. format but is limited to 256 colors or less. — RTF or Rich Text Format files, are supported by – Joint Photographic Experts Group (JPEG) a variety of applications and saved with files, which are used for full-color or gray- formatting instructions (such as page layout). scale images. Used primarily for photographs, — Portable Document Format (PDF) files contain the standard JPEG format uses a lossy an image of the page, including text and compression algorithm that discards some graphics. PDF files are widely used for read- information to achieve a smaller file size. only file sharing and printing. Adobe Acrobat is, by far, the most popular PDF file although – Portable Network Graphics (PNG) files. A other types are available. Acrobat reader, lossless compression designed to replace GIF available for no charge, is necessary for files. PNG is completely patent and license reading an Adobe PDF file. free and is of higher quality than GIF. ◆ ◆ Graphics files. Graphics files store an image (e.g., Data files. Data files are created in database photograph, drawing) and are divided into two software programs. Data files are divided into basic types: fields and tables that contain discrete elements of information. The software builds the — Vector-based files that store the image as relationships between these discrete elements. geometric shapes stored as mathematical For example, a customer service database may formulas, which allow the image to be scaled contain customer name, address, and billing without distortion. Common types of vector- history fields. These fields may be organized into based file formats include: separate tables (e.g., one table for all customer – Drawing Interchange Format (DXF) files, name fields). You may convert data files to a text which are widely used in computer-aided format, but you will lose the relationships among design software programs, such as those used the fields and tables. For example, if you convert by engineers and architects the information in the customer database to text, – Encapsulated PostScript (EPS) files, which you may end up with ten pages of names, ten are widely used in desktop publishing pages of addresses, and a thousand pages of software programs billing information, with no indication of which – Computer Graphics Metafile (CGM) files, information is related. which are widely used in many image- ◆ Spreadsheet files. Spreadsheet files store the oriented software programs (e.g., Photoshop) value of the numbers in their cells, as well as the and offer a high degree of durability relationships of those numbers. For example, one – Shapefiles (SHP), ESRI GIS applications use cell may contain the formula that sums two other vector coordinates to store non-topological cells. Like data files, spreadsheet files are most geometry and attribute information for often in the proprietary format of the software features. program in which they were created. Some software programs can import and export data — Raster-based files that store the image as a from other sources, including software programs collection of pixels. Raster graphics are also designed for such data sharing (e.g., Data referred to as bitmapped images. Raster Interchange Format [DIF]). Spreadsheet files can graphics cannot be scaled without distortion. be exported as text files, but the value and Common types of raster-based file formats relationship of the numbers are lost. include: – Bitmap (BMP) files, which are uncompressed, relatively low-quality files used most often in MORE ➔ word processing applications South Carolina Department of Archives & History – Tagged Image File Format (TIFF) files, which www.state.sc.us/scdah/erg/erg.htm January 2005 Version 1 — FF Page 2 ◆ Video and audio files. These files contain moving with content through the use of pre-defined images (e.g., digitized video, animation) and tags, HTML is simple to use but limited in sound data. They are most often created and scope. Other markup languages such as XHTML viewed in proprietary software programs and and XML offer greater flexibility. stored in proprietary formats. Common files — eXtensible Hypertext Markup Language formats in use include QuickTime, Motion Picture (XHTML) combines the flexibility found in XML Experts Group (MPEG) formats and Real Video. with the ease of use associated with HTML. ◆ Markup languages. Markup languages, also called Strict XHTML rules improve consistency and markup formats, contain embedded instructions provide the ability to create your own markup for displaying or understanding the content of tags. Because they share similar rules, the file. They provide the means to transmit and converting XHTML into XML is easier than share information over the web. The World Wide converting HTML into XML. Web Consortium (W3C) (www.w3c.org) supports — eXtensible Markup Language (XML) is a these standards. Common markup language file relatively simple language based on SGML that formats include the following: is gaining popularity for managing and sharing — Standard Generalized Markup Language (SGML), information. XML provides even greater a common markup language used in flexibility and control than XHTML while government offices worldwide, is an avoiding the complexities associated with international standard. HTML and XML are SGML. derived
Recommended publications
  • The Microsoft Office Open XML Formats New File Formats for “Office 12”
    The Microsoft Office Open XML Formats New File Formats for “Office 12” White Paper Published: June 2005 For the latest information, please see http://www.microsoft.com/office/wave12 Contents Introduction ...............................................................................................................................1 From .doc to .docx: a brief history of the Office file formats.................................................1 Benefits of the Microsoft Office Open XML Formats ................................................................2 Integration with Business Data .............................................................................................2 Openness and Transparency ...............................................................................................4 Robustness...........................................................................................................................7 Description of the Microsoft Office Open XML Format .............................................................9 Document Parts....................................................................................................................9 Microsoft Office Open XML Format specifications ...............................................................9 Compatibility with new file formats........................................................................................9 For more information ..............................................................................................................10
    [Show full text]
  • Background Information History, Licensing, and File Formats Copyright This Document Is Copyright © 2008 by Its Contributors As Listed in the Section Titled Authors
    Getting Started Guide Appendix B Background Information History, licensing, and file formats Copyright This document is Copyright © 2008 by its contributors as listed in the section titled Authors. You may distribute it and/or modify it under the terms of either the GNU General Public License, version 3 or later, or the Creative Commons Attribution License, version 3.0 or later. All trademarks within this guide belong to their legitimate owners. Authors Jean Hollis Weber Feedback Please direct any comments or suggestions about this document to: [email protected] Acknowledgments This Appendix includes material written by Richard Barnes and others for Chapter 1 of Getting Started with OpenOffice.org 2.x. Publication date and software version Published 13 October 2008. Based on OpenOffice.org 3.0. You can download an editable version of this document from http://oooauthors.org/en/authors/userguide3/published/ Contents Introduction...........................................................................................4 A short history of OpenOffice.org..........................................................4 The OpenOffice.org community.............................................................4 How is OpenOffice.org licensed?...........................................................5 What is “open source”?..........................................................................5 What is OpenDocument?........................................................................6 File formats OOo can open.....................................................................6
    [Show full text]
  • File Formats
    Electronic Records Management Guidelines - File Formats Rapid changes in technology mean that file formats can become obsolete quickly and cause problems for your records management strategy. A long-term view and careful planning can overcome this risk and ensure that you can meet your legal and operational requirements. Legally, your records must be authentic, complete, accessible, legally admissible in court, and durable for as long as your approved records retention schedules require. For example, you can convert a record to another, more durable format (e.g., from a nearly obsolete software program to a text file). That copy, as long as it is created in a trustworthy manner, is legally acceptable. The software in which a file is created usually has a default format, often indicated by a file name suffix (e.g., *.PDF for portable document format). Most software allows authors to select from a variety of formats when they save a file (e.g., document [DOC], Rich Text Format [RTF], text [TXT] in Microsoft Word). Some software, such as Adobe Acrobat, is designed to convert files from one format to another. Legal Framework: Key Concepts As you consider the file format options available to you, you will need to be familiar with the following concepts: Proprietary and non-proprietary file formats File format types Preservation: conversion and migration Compression Importance of planning File Format Decisions and Electronic Records Management Goals Proprietary and Non-proprietary File Formats A file format is usually described as either proprietary or non-proprietary: Proprietary formats. Proprietary file formats are controlled and supported by just one software developer, or can only be read by a limited number of other programs.
    [Show full text]
  • Sharing Files with Microsoft Office Users
    Sharing Files with Microsoft Office Users Title: Sharing Files with Microsoft Office Users: Version: 1.0 First edition: November 2004 Contents Overview.........................................................................................................................................iv Copyright and trademark information........................................................................................iv Feedback.................................................................................................................................... iv Acknowledgments......................................................................................................................iv Modifications and updates......................................................................................................... iv File formats...................................................................................................................................... 1 Bulk conversion............................................................................................................................... 1 Opening files....................................................................................................................................2 Opening text format files.............................................................................................................2 Opening spreadsheets..................................................................................................................2 Opening presentations.................................................................................................................2
    [Show full text]
  • Document Conversion Technical Manual Version: 10.3.0
    Kofax Communication Server KCS Document Conversion Technical Manual Version: 10.3.0 Date: 2019-12-13 © 2019 Kofax. All rights reserved. Kofax is a trademark of Kofax, Inc., registered in the U.S. and/or other countries. All other trademarks are the property of their respective owners. No part of this publication may be reproduced, stored, or transmitted in any form without the prior written permission of Kofax. Table of Contents Chapter 1: Preface...................................................................................................................................... 7 Purpose...............................................................................................................................................7 Third Party Licenses................................................................................................................7 Usage..................................................................................................................................................7 Feature Comparison (Links Versus TWS)......................................................................................... 8 Conversion Tools................................................................................................................................ 8 Imgio.......................................................................................................................................10 File Formats......................................................................................................................................10
    [Show full text]
  • Microsoft Office Word 2003 Rich Text Format (RTF) Specification White Paper Published: April 2004 Table of Contents
    Microsoft Office Word 2003 Rich Text Format (RTF) Specification White Paper Published: April 2004 Table of Contents Introduction......................................................................................................................................1 RTF Syntax.......................................................................................................................................2 Conventions of an RTF Reader.............................................................................................................4 Formal Syntax...................................................................................................................................5 Contents of an RTF File.......................................................................................................................6 Header.........................................................................................................................................6 Document Area............................................................................................................................29 East ASIAN Support........................................................................................................................142 Escaped Expressions...................................................................................................................142 Character Set.............................................................................................................................143 Character Mapping......................................................................................................................143
    [Show full text]
  • Microsoft Exchange 2007 Journaling Guide
    Microsoft Exchange 2007 Journaling Guide Digital Archives Updated on 12/9/2010 Document Information Microsoft Exchange 2007 Journaling Guide Published August, 2008 Iron Mountain Support Information U.S. 1.800.888.2774 [email protected] Copyright © 2008 Iron Mountain Incorporated. All Rights Reserved. Trademarks Iron Mountain and the design of the mountain are registered trademarks of Iron Mountain Incorporated. All other trademarks and registered trademarks are the property of their respective owners. Entities under license agreement: Please consult the Iron Mountain & Affiliates Copyright Notices by Country. Confidentiality CONFIDENTIAL AND PROPRIETARY INFORMATION OF IRON MOUNTAIN. The information set forth herein represents the confidential and proprietary information of Iron Mountain. Such information shall only be used for the express purpose authorized by Iron Mountain and shall not be published, communicated, disclosed or divulged to any person, firm, corporation or legal entity, directly or indirectly, or to any third person without the prior written consent of Iron Mountain. Disclaimer While Iron Mountain has made every effort to ensure the accuracy and completeness of this document, it assumes no responsibility for the consequences to users of any errors that may be contained herein. The information in this document is subject to change without notice and should not be considered a commitment by Iron Mountain. Iron Mountain Incorporated 745 Atlantic Avenue Boston, MA 02111 +1.800.934.0956 www.ironmountain.com/digital
    [Show full text]
  • File Format Guidelines for Management and Long-Term Retention of Electronic Records
    FILE FORMAT GUIDELINES FOR MANAGEMENT AND LONG-TERM RETENTION OF ELECTRONIC RECORDS 9/10/2012 State Archives of North Carolina File Format Guidelines for Management and Long-Term Retention of Electronic records Table of Contents 1. GUIDELINES AND RECOMMENDATIONS .................................................................................. 3 2. DESCRIPTION OF FORMATS RECOMMENDED FOR LONG-TERM RETENTION ......................... 7 2.1 Word Processing Documents ...................................................................................................................... 7 2.1.1 PDF/A-1a (.pdf) (ISO 19005-1 compliant PDF/A) ........................................................................ 7 2.1.2 OpenDocument Text (.odt) ................................................................................................................... 3 2.1.3 Special Note on Google Docs™ .......................................................................................................... 4 2.2 Plain Text Documents ................................................................................................................................... 5 2.2.1 Plain Text (.txt) US-ASCII or UTF-8 encoding ................................................................................... 6 2.2.2 Comma-separated file (.csv) US-ASCII or UTF-8 encoding ........................................................... 7 2.2.3 Tab-delimited file (.txt) US-ASCII or UTF-8 encoding .................................................................... 8 2.3
    [Show full text]
  • "Digital Video and Graphics for Planetaria and Exhibits" © Jeff Bowen
    3924 Pendleton Way • Indianapolis, IN• 46226• 317-536-1283 www.bowentechnovation.com "Digital Video and Graphics for Planetaria and Exhibits" © Jeff Bowen. June 25, 2004 [email protected] Section 1.0 Workshop Synopsis As we are heading into an era in which all visual media in the planetarium will be digital, attendees will learn what are current and forthcoming visual file formats and production methodologies. What is MPEG...DVI...when do you import an avi or mov? What are SD files...is 16:9 HD? Why is editing MPEG a bad idea? How do you archive and access digital files? What formats will soon be obsolete and which new ones will stay? Which of the new DVD formats will stick around? Jeff Bowen will share his extensive experience in producing award-winning digital media for education, planetariums, exhibits and broadcast. Section 2.0 Qualification of Attendees Number who have worked with video editing? Number who have worked with Photoshop? Number who have worked with digital video editing? Number who use digital playback in the dome? Number who have worked with digital audio editing? Number who have worked with digital editing? Created by Jeff Bowen Page 1 7/6/2004 Section 3.0 Digital Video Terminology and Specs Analog Video is represented as a continuous (time varying) signal. Digital Video is represented as a sequence of digital images. NTSC Video- 525 scan lines per frame, 30 frames per second (or be exact, 29.97 fps, 33.37 msec/frame) Interlaced, each frame is divided into 2 fields, 262.5 lines/field.
    [Show full text]
  • Rastermaster® for Java™ V20.2 Programmer's Guide
    RasterMaster® for Java™ v20.2 Programmer’s Guide RasterMaster® is the industry’s leading document/image conversion and imaging library for Java. It is continually enhanced with new functionality and formats and was developed by Snowbound’s experts who have nearly a hundred years of combined imaging expertise. It provides High-Speed File Conversion as well as Extensive Format Support. This guide is designed to provide developers with a high-level overview of RasterMaster’s functionality and capabilities, including conceptual information as well as code samples. For the full API reference, please see the Javadocs included with your build or visit docs.snowbound.com/rastermaster/latest/java/rastermaster-api/. IMPORTANT NOTICE: The online version of this manual contains information on the latest updates to RasterMaster. To find the most recent version of this manual, please visit the online version at www.rastermaster.com or download the most recent version from our website at www.snowbound.com/support/manuals.html. Copyright Information While Snowbound® Software believes the information included in this publication is correct as of the publication date, information in this document is subject to change without notice. UNLESS EXPRESSLY SET FORTH IN A WRITTEN AGREEMENT SIGNED BY AN AUTHORIZED REPRESENTATIVE OF SNOWBOUND SOFTWARE CORPORATION MAKES NO WARRANTY OR REPRESENTATION OF ANY KIND WITH RESPECT TO THE INFORMATION CONTAINED HEREIN, INCLUDING WARRANTY OF MERCHANTABILITY AND FITNESS FOR A PURPOSE, NON-INFRINGEMENT, OR THOSE WHICH MAY BE IMPLIED THROUGH COURSE OF DEALING OR CUSTOM OF TRADE. WITHOUT LIMITING THE FOREGOING, CUSTOMER UNDERSTANDS THAT SNOWBOUND DOES NOT WARRANT THAT CUSTOMER’S OPERATION OF THE SOFTWARE WILL BE UNINTERRUPTED OR ERROR-FREE, THAT ALL DEFECTS IN THE SOFTWARE WILL BE CORRECTED, OR THAT THE RESULTS OF THE SOFTWARE WILL BE ERROR-FREE.
    [Show full text]
  • Office File Formats Overview
    Office File Formats Overview Tom Jebo Sr Escalation Engineer Agenda • Microsoft Office Supported Formats • Open Specifications File Format Documents and Resources • Benefits of broadly-adopted standards • Microsoft Office Extensibility • OOXML Format Overview Microsoft Office 2016 File Format Support • Office Open XML (.docx, .xlsx, .pptx) • Microsoft Office Binary Formats (.doc, .xls, .ppt) (legacy) • OpenDocument Format (.odt, .ods, .odp) • Portable Document Format (.pdf) • Open XML Paper Specification (.xps) Microsoft File Formats Documents and Resources File Format Related Documents • Documentation Intro & Reference Binary Formats Standards • https://msdn.microsoft.com/en- [MS-OFFDI] [MS-DOC] [MS-DOCX] us/library/gg134029.aspx [MS-OFCGLOS] [MS-XLS] [MS-XLSX] [MS-OFREF] [MS-XLSB] [MS-PPTX] • [MS-OFFDI] start here [MS-OSHARED] [MS-PPT] [MS-OE376] • Standards implementation notes [MS-OFFCRYPTO] [MS-OI29500] • File format documentation Macros [MS-OODF] OneNote [MS-OFFMACRO] [MS-OODF2] • SharePoint & Exchange/Outlook client-server protocols [MS-ONE] [MS-OFFMACRO2] [MS-OODF3] • Windows client and server protocols [MS-ONESTORE] [MS-OVBA] [MS-ODRAWXML] • .NET Framework Office Drawing/Graphics Other • XAML Customization [MS-CTDOC] [MS-ODRAW] [MS-DSEXPORT] • Support [MS-CTXLS] [MS-OGRAPH] [MS-ODCFF] [MS-CUSTOMUI] [MS-OFORMS] • [email protected] [MS-CUSTOMUI2] [MS-WORDLFF] • MSDN Open Specifications forums [MS-OWEMXML] Outlook [MS-XLDM] [MS-PST] [MS-3DMDTP] Reviewing Binary Formats • CFB – [MS-CFB] storages and streams Binary Formats • Drawing
    [Show full text]
  • How to Create Accessible Documents in Microsoft Office (Word, Powerpoint Or Excel) Or Google Doc
    How to Create Accessible Documents in Microsoft Office (Word, PowerPoint or Excel) or Google Doc Microsoft Word application is commonly used among individuals with and without disabilities, and it has accessibility features. The following part provides information to guide you to build accessible documents in Microsoft Word. Those steps are applicable in other Microsoft Office (PowerPoint or Excel) and Google Doc as well. The major components include use of Font, ​ ​ Headings, Lists, Meaningful Hyperlinks, Alternate Text for Images, Table, and Document ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ Language Identification. ​ ❖ Use Sans Serif Fonts ​ ➢ Sans serif fonts (Helvetica, Arial, Verdana, Calibri, Comic Sans, and Geneva) are easier to read by digital, for learners with dyslexia ➢ Serif fonts (i.e., Times New Roman) are easier to read in print ➢ Suggest the use of an 11- or 12-point font size ❖ Use Headings to Organize the Document Instead of Manual Formatting ​ ​ ➢ Point your mouse to Home tab ➢ Find and click on Styles ➢ Use the built-in Heading styles, such as “Heading 1” (as the main heading) and “Heading 2” (as subheadings) ➢ The default, light blue, can be hard to read and is not accessible. Use the right mouse button to click on Heading 1, select Modify, and change the color of the heading from blue to black ➢ Change the default font from Calibri to any other san serif font, select a different size, or select bold; the American Federation for the Blind suggests, “Use bold. Avoid using italics or all capital letters.” ➢ Form an outline with headings ➢ With additional levels of headings within the document’s outline, use “Heading 3”, “Heading 4”, etc.
    [Show full text]