Forcepoint DLP Supported File Formats and Size Limits

Total Page:16

File Type:pdf, Size:1020Kb

Forcepoint DLP Supported File Formats and Size Limits Forcepoint DLP Supported File Formats and Size Limits Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x This article provides a list of the file formats that can be analyzed by Forcepoint DLP, as well as the file size limits for network, endpoint, and discovery functions. See: ● Supported File Formats ● File Size Limits © 2018 Forcepoint LLC Supported File Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x The following tables lists the file formats supported by Forcepoint DLP. File formats are in alphabetical order by format group. ● Archive Formats, page 3 ● Backup Formats, page 5 ● Business Intelligence (BI) and Analysis Formats, page 6 ● Computer-Aided Design Formats, page 7 ● Cryptography Formats, page 8 ● Database Formats, page 9 ● Desktop Publishing Formats, page 10 ● Executable Formats, page 11 ● Font Formats, page 12 ● Library Formats, page 13 ● Mail Formats, page 14 ● Miscellaneous Formats, page 15 ● Multimedia Formats, page 19 ● Object Formats, page 22 ● Presentation Formats, page 23 ● Project Management Formats, page 24 ● Raster Graphics Formats, page 25 ● Spreadsheet Formats, page 28 ● Text and Markup Formats, page 30 ● Vector Graphics Formats, page 31 ● Word Processing Formats, page 33 Supported file formats are added to and updated frequently. Supported File Formats and Size Limits 2 Archive Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Format Description 7-Zip 7-Zip format ACE ACE Archive AppleDouble AppleDouble AppleSingle AppleSingle ARC/PAK Archive ARC/PAK Archive ARJ ARJ Archive ARJ (Archive by Robert Jung) File format ARJ (Archive by Robert Jung) File format B1 B1 BinHex BinHex 4.0 encoded file Bzip 2 Compressed File Bzip 2 Compressed File BZIP2 BZIP CHM CHM Compactor / Compact Pro Archive Compactor / Compact Pro Archive COMPOUND COMPOUND CPIO CPIO cpio Archive CHRhdr cpio archive (CHR Header) cpio Archive CRChdr cpio archive (CRC Header) DEB DEB Disk Doubler Compression Disk Doubler Compression format Disk Image Disk Image Expert Witness Compression Format Expert Witness Compression Format Ghost Disk Image File Ghost Disk Image File Git Packfile format Git Packfile format GZ Compress GZ Compress GZIP GZIP reader Incomplete ZIP Incomplete ZIP InstallShield Cabinet Archive format InstallShield Cabinet Archive format InstallShield Z Archive format InstallShield Z Archive format Internet Archive ARC format Internet Archive ARC format ISO ISO ISO-9660 CD Disc Image Format Reader ISO-9660 CD Disc Image Format Reader Legato EMailXtender Archive Legato EMailXtender Archives Format Supported File Formats and Size Limits 3 File Format Description LZH LHA Archive LZMA compressed data format LZMA compressed data format Mac Disk Copy Disk Image Mac Disk Copy Disk Image MacBinary MacBinary Microsoft Cabinet format Microsoft Cabinet File Microsoft Compress Folder Unix Compress Microsoft Compressed Folder Microsoft Compressed Folder NSIS NSIS PEX Binary Archive SUN PEX Binary Archive PKZIP PKZip format RAR Format RAR archive RAR5 RAR5 RPM RPM SAP compression archive SAR format SAP compression archive SAR format Security Accounts Manager (SAM) Security Accounts Manager (SAM) SFX Self Extracting Archive SPLIT SPLIT Stuff It Archive (Mac) Stuff It Archive (Mac) UNIX SHAR UNIX SHAR UNIX Tape ARchiver TAR UU Encoded UU encoded encryption file Wang Office GDL Wang Office GDL Header Web ARChive Files Web ARChive Files WIM WIM xz xz ZZ Supported File Formats and Size Limits 4 Backup Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Format Description Microsoft Tape Format Microsoft Tape Format (MTF) Windows Backup File Windows Backup File Supported File Formats and Size Limits 5 Business Intelligence (BI) and Analysis Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Format Description Microsoft Power BI Desktop format Microsoft Power BI Desktop format Tableau Data Source format Tableau Data Source format Tableau Extract format Tableau Extract format Tableau Map Source format Tableau Map Source format Tableau Packaged Data Source format Tableau Packaged Data Source format Tableau Packaged Workbook format Tableau Packaged Workbook format Tableau Preferences format Tableau Preferences format Tableau Workbook format Tableau Workbook format Supported File Formats and Size Limits 6 Computer-Aided Design Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Format Description 3D Systems STL Binary Format 3D Systems STL Binary Format Abaqus ODB Format Abaqus ODB Format AutoCAD DXF binary format AutoCAD DXF AutoCAD DXF text format AutoCAD DXF Autodesk Design Web Format Autodesk Design Web Format Autodesk DWG Autodesk Drawing (DWG) Autodesk Maya binary file Autodesk Maya binary file Autodesk Maya Textual file format Autodesk Maya Textual file format CANape ASC format CANape ASC format CATIA formats CATIA formats (CAT*) LS-DYNA binary output (binout) format LS-DYNA Binary Output (Binout) format LS-DYNA State Database format LS-DYNA State Database format Macro-enabled Microsoft Visio Drawing Macro-enabled Microsoft Visio Drawing Macro-enabled Microsoft Visio Stencil Macro-enabled Microsoft Visio Stencil Macro-enabled Microsoft Visio Template Macro-enabled Microsoft Visio Template Microsoft Visio Microsoft Visio Microsoft Visio Drawing Microsoft Visio Drawing Microsoft Visio Stencil Microsoft Visio Stencil Microsoft Visio Template Microsoft Visio Template MicroStation MicroStation V8 DGN (OLE) Nastran OP2 format Nastran OP2 format PTC Creo ASM Format PTC Creo ASM Format PTC Creo DRW Format PTC Creo DRW Format PTC Creo FRM Format PTC Creo FRM Format PTC Creo PRT Format PTC Creo PRT Format Siemens NX CAD Format Siemens NX CAD Format STEP format ISO 10303-21 STEP format UGS Jupiter Tesselation file Jupiter Tesselation file Various PTC Creo CAD Formats Various PTC Creo CAD Formats Supported File Formats and Size Limits 7 Cryptography Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Format Description ASCII-armored PGP Encoded ASCII-armored PGP encoded ASCII-armored PGP Signed ASCII-armored PGP signed ASCII-armored Public Keyring ASCII-armored PGP Public Keyring Encrypted 7-Zip File Encrypted 7-Zip File Encrypted Access Database files (Legacy) Encrypted Access Database files (Legacy) Encrypted B1 File Encrypted B1 File Encrypted Excel Binary Files (Legacy) Encrypted Excel Binary files (Legacy) Encrypted files of unknown format Encrypted files of unknown format Encrypted PowerPoint Binary Files Encrypted PowerPoint Binary files (Legacy) (Legacy) Encrypted RAR5 File Encrypted RAR5 File Encrypted Word Binary Files (Legacy) Encrypted Word Binary files (Legacy) Microsoft Office Encrypted Files Microsoft Office Encrypted files (OOXML) (OOXML) Nero Encrypted File Nero Encrypted File Open PGP Open PGP Message Format (with new packet format) PDF Encrypted format Word processor encrypted PDF PGP Compressed Data PGP Compressed Data PGP Encrypted Data PGP Encrypted Data PGP Public Keyring PGP Public Keyring PGP Secret Keyring PGP Secret Keyring PGP Signature Certificate PGP Signature Certificate PGP Signed and Encrypted Data PGP Signed and Encrypted Data PGP Signed Data PGP Signed Data RAR Encrypted Format Encrypted RAR archive ZIP encrypted format Encapsulation encrypted ZIP archive Supported File Formats and Size Limits 8 Database Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Format Description Ability Database Database Ability Access Database Template Files Access Database Template files Borland Reflex 2 Borland Reflex 2 dBase Database dBase database FileMaker Mac Filemaker Mac Microsoft Access Microsoft Access Microsoft Access 2000 Microsoft Access 2000 Microsoft Access 95 Microsoft Access 95 Microsoft Access 97 Microsoft Access 97 Microsoft Access Database (.accdb) Microsoft Access Database (.accdb) Microsoft Exchange Server Database Microsoft Exchange Server database files Files Microsoft Program Database format Microsoft Program Database format Microsoft Works Database (DOS) Microsoft Works database for DOS Microsoft Works Database (Mac) Microsoft Works database for Macintosh Microsoft Works Database (Windows) Microsoft Works database for Windows MySQL table definition file MySQL table definition file Paradox (PC) Database Paradox (PC) database Reflex Reflex SAS7BDAT database storage format SAS7BDAT database storage format SmartWare II Database SmartWare II SQLite database format SQLite database format Supported File Formats and Size Limits 9 Desktop Publishing Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Format Description Adobe FrameMaker Adobe FrameMaker Adobe FrameMaker Book Adobe FrameMaker Aldus PageMaker (DOS) Aldus PageMaker for Windows Aldus PageMaker (Mac) Aldus PageMaker for Macintosh Maker Markup Language Adobe FrameMaker Markup Language Microsoft Publisher Microsoft Publisher Plan Perfect Plan Perfect Quark Express Quark Xpress Mac QuarkXPress Intel format QuarkXPress Intel format Supported File Formats and Size Limits 10 Executable Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.6.x File Formats Description ELF Executable MELF Executable Executable DOS/Windows Program MS DOS Device Driver MS DOS Device Driver Program Information File Program Information File Unix Executable 3B20 Unix Executable (3B20) Unix Executable Basic 16 Unix Executable (Basic-16) Unix Executable Bell 5.0 Unix Executable (Bell 5.0) Unix Executable iAPX 286 Unix Executable (iAPX 286) Unix
Recommended publications
  • Autocad 2011 DXF Reference
    AutoCAD 2011 DXF Reference February 2010 © 2010 Autodesk, Inc. All Rights Reserved. Except as otherwise permitted by Autodesk, Inc., this publication, or parts thereof, may not be reproduced in any form, by any method, for any purpose. Certain materials included in this publication are reprinted with the permission of the copyright holder. Trademarks The following are registered trademarks or trademarks of Autodesk, Inc., and/or its subsidiaries and/or affiliates in the USA and other countries: 3DEC (design/logo), 3December, 3December.com, 3ds Max, Algor, Alias, Alias (swirl design/logo), AliasStudio, Alias|Wavefront (design/logo), ATC, AUGI, AutoCAD, AutoCAD Learning Assistance, AutoCAD LT, AutoCAD Simulator, AutoCAD SQL Extension, AutoCAD SQL Interface, Autodesk, Autodesk Envision, Autodesk Intent, Autodesk Inventor, Autodesk Map, Autodesk MapGuide, Autodesk Streamline, AutoLISP, AutoSnap, AutoSketch, AutoTrack, Backburner, Backdraft, Built with ObjectARX (logo), Burn, Buzzsaw, CAiCE, Civil 3D, Cleaner, Cleaner Central, ClearScale, Colour Warper, Combustion, Communication Specification, Constructware, Content Explorer, Dancing Baby (image), DesignCenter, Design Doctor, Designer's Toolkit, DesignKids, DesignProf, DesignServer, DesignStudio, Design Web Format, Discreet, DWF, DWG, DWG (logo), DWG Extreme, DWG TrueConvert, DWG TrueView, DXF, Ecotect, Exposure, Extending the Design Team, Face Robot, FBX, Fempro, Fire, Flame, Flare, Flint, FMDesktop, Freewheel, GDX Driver, Green Building Studio, Heads-up Design, Heidi, HumanIK, IDEA Server,
    [Show full text]
  • Contrasting the Performance of Compression Algorithms on Genomic Data
    Contrasting the Performance of Compression Algorithms on Genomic Data Cornel Constantinescu, IBM Research Almaden Outline of the Talk: • Introduction / Motivation • Data used in experiments • General purpose compressors comparison • Simple Improvements • Special purpose compression • Transparent compression – working on compressed data (prototype) • Parallelism / Multithreading • Conclusion Introduction / Motivation • Despite the large number of research papers and compression algorithms proposed for compressing genomic data generated by sequencing machines, by far the most commonly used compression algorithm in the industry for FASTQ data is gzip. • The main drawbacks of the proposed alternative special-purpose compression algorithms are: • slow speed of either compression or decompression or both, and also their • brittleness by making various limiting assumptions about the input FASTQ format (for example, the structure of the headers or fixed lengths of the records [1]) in order to further improve their specialized compression. 1. Ibrahim Numanagic, James K Bonfield, Faraz Hach, Jan Voges, Jorn Ostermann, Claudio Alberti, Marco Mattavelli, and S Cenk Sahinalp. Comparison of high-throughput sequencing data compression tools. Nature Methods, 13(12):1005–1008, October 2016. Fast and Efficient Compression of Next Generation Sequencing Data 2 2 General Purpose Compression of Genomic Data As stated earlier, gzip/zlib compression is the method of choice by the industry for FASTQ genomic data. FASTQ genomic data is a text-based format (ASCII readable text) for storing a biological sequence and the corresponding quality scores. Each sequence letter and quality score is encoded with a single ASCII character. FASTQ data is structured in four fields per record (a “read”). The first field is the SEQUENCE ID or the header of the read.
    [Show full text]
  • DXF Reference
    AutoCAD 2012 DXF Reference February 2011 © 2011 Autodesk, Inc. All Rights Reserved. Except as otherwise permitted by Autodesk, Inc., this publication, or parts thereof, may not be reproduced in any form, by any method, for any purpose. Certain materials included in this publication are reprinted with the permission of the copyright holder. Trademarks The following are registered trademarks or trademarks of Autodesk, Inc., and/or its subsidiaries and/or affiliates in the USA and other countries: 3DEC (design/logo), 3December, 3December.com, 3ds Max, Algor, Alias, Alias (swirl design/logo), AliasStudio, Alias|Wavefront (design/logo), ATC, AUGI, AutoCAD, AutoCAD Learning Assistance, AutoCAD LT, AutoCAD Simulator, AutoCAD SQL Extension, AutoCAD SQL Interface, Autodesk, Autodesk Intent, Autodesk Inventor, Autodesk MapGuide, Autodesk Streamline, AutoLISP, AutoSnap, AutoSketch, AutoTrack, Backburner, Backdraft, Beast, Built with ObjectARX (logo), Burn, Buzzsaw, CAiCE, Civil 3D, Cleaner, Cleaner Central, ClearScale, Colour Warper, Combustion, Communication Specification, Constructware, Content Explorer, Dancing Baby (image), DesignCenter, Design Doctor, Designer's Toolkit, DesignKids, DesignProf, DesignServer, DesignStudio, Design Web Format, Discreet, DWF, DWG, DWG (logo), DWG Extreme, DWG TrueConvert, DWG TrueView, DXF, Ecotect, Exposure, Extending the Design Team, Face Robot, FBX, Fempro, Fire, Flame, Flare, Flint, FMDesktop, Freewheel, GDX Driver, Green Building Studio, Heads-up Design, Heidi, HumanIK, IDEA Server, i-drop, Illuminate Labs
    [Show full text]
  • Parallel Data Analysis Directly on Scientific File Formats
    Parallel Data Analysis Directly on Scientific File Formats Spyros Blanas 1 Kesheng Wu 2 Surendra Byna 2 Bin Dong 2 Arie Shoshani 2 1 The Ohio State University 2 Lawrence Berkeley National Laboratory [email protected] {kwu, sbyna, dbin, ashosani}@lbl.gov ABSTRACT and physics, produce massive amounts of data. The size of Scientific experiments and large-scale simulations produce these datasets typically ranges from hundreds of gigabytes massive amounts of data. Many of these scientific datasets to tens of petabytes. For example, the Intergovernmen- are arrays, and are stored in file formats such as HDF5 tal Panel on Climate Change (IPCC) multi-model CMIP-5 and NetCDF. Although scientific data management systems, archive, which is used for the AR-5 report [22], contains over petabytes such as SciDB, are designed to manipulate arrays, there are 10 of climate model data. Scientific experiments, challenges in integrating these systems into existing analy- such as the LHC experiment routinely store many gigabytes sis workflows. Major barriers include the expensive task of of data per second for future analysis. As the resolution of preparing and loading data before querying, and convert- scientific data is increasing rapidly due to novel measure- ing the final results to a format that is understood by the ment techniques for experimental data and computational existing post-processing and visualization tools. As a con- advances for simulation data, the data volume is expected sequence, integrating a data management system into an to grow even further in the near future. existing scientific data analysis workflow is time-consuming Scientific data are often stored in data formats that sup- and requires extensive user involvement.
    [Show full text]
  • Full Document
    R&D Centre for Mobile Applications (RDC) FEE, Dept of Telecommunications Engineering Czech Technical University in Prague RDC Technical Report TR-13-4 Internship report Evaluation of Compressibility of the Output of the Information-Concealing Algorithm Julien Mamelli, [email protected] 2nd year student at the Ecole´ des Mines d'Al`es (N^ımes,France) Internship supervisor: Luk´aˇsKencl, [email protected] August 2013 Abstract Compression is a key element to exchange files over the Internet. By generating re- dundancies, the concealing algorithm proposed by Kencl and Loebl [?], appears at first glance to be particularly designed to be combined with a compression scheme [?]. Is the output of the concealing algorithm actually compressible? We have tried 16 compression techniques on 1 120 files, and the result is that we have not found a solution which could advantageously use repetitions of the concealing method. Acknowledgments I would like to express my gratitude to my supervisor, Dr Luk´aˇsKencl, for his guidance and expertise throughout the course of this work. I would like to thank Prof. Robert Beˇst´akand Mr Pierre Runtz, for giving me the opportunity to carry out my internship at the Czech Technical University in Prague. I would also like to thank all the members of the Research and Development Center for Mobile Applications as well as my colleagues for the assistance they have given me during this period. 1 Contents 1 Introduction 3 2 Related Work 4 2.1 Information concealing method . 4 2.2 Archive formats . 5 2.3 Compression algorithms . 5 2.3.1 Lempel-Ziv algorithm .
    [Show full text]
  • Archicad Windows Bricscad Windows Autocad® Windows
    TurboCAD® BricsCAD Windows AutoCAD® Windows ArchiCAD Windows TurboCAD porovnání verzí včetně nástrojů jiných CAD od výrobce Pro Platinum 2018 Expert 2018 Deluxe 2018 Designer 2018 Platinum Pro Classic 2018 LT Suggested Retail Price $1 499,99 $499,99 $149,99 $49,99 $1110 $750 $590 $1,535.00/ year $380.00/ year$3750 /year including annual subscripon PRODUCT POSITIONING 2D/3D Drafting with Solid and Surface Modeling ✓ ✓ ✓ ✓ ✓ 2D/3D with 3D Surface Modeling ✓ ✓ ✓ ✓ ✓ ✓ ✓ 2D Drafting with AutoCAD® like User Interface Option ✓ ✓ ✓ ✓ ✓ ✓ ✓ 2D Drafting ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ USABILITY & INTERFACE 32 bit and 64 bit versions ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Command Line ✓ ✓ ✓ ✓ ✓ ✓ ✓ PUBLISH command ✓ ✓ ✓ ✓ ✓ FLATSHOT command ✓ ✓ ✓ XEDGES command ✓ ✓ ✓ ✓ ADDSELECTED command ✓ ✓ ✓ ✓ ✓ SELECTSIMILAR command ✓ ✓ ✓ ✓ ✓ RESETBLOCK command ✓ ✓ ✓ ✓ ✓ Design Director for object property management ✓ ✓ ✓ ✓ ✓ Draw Order by Layer ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Dynamic Input Cursor ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Conceptual Selector ✓ ✓ ✓ ✓ Explode Viewports ✓ ✓ Explorer Palette ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Compass Rose ✓ ✓ ✓ ✓ ✓ ✓ Image Manager ✓ ✓ ✓ ✓ ✓ ✓ Intelligent Cursor ✓ ✓ ✓ ✓ ✓ ✓ ✓ Intelligent File Send (E pack) ✓ ✓ ✓ ✓ ✓ ✓ Layer preview ✓ ✓ ✓ ✓ ✓ ✓ ✓ Layer Filters ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Layer Management (Layer States Manager) ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Deletion of $Construction and $Constraints layers ✓ ✓ ✓ ✓ Measurement Tool ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Distance Tool ✓ Object SNAP Prioritization ✓ ✓ ✓ ✓ ✓ ✓ SNAP between two points ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Protractor Tool ✓ ✓ ✓ Flexible UI ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Walkthrough navigation ✓ ✓
    [Show full text]
  • A Metadata Based Approach for Supporting Subsetting Queries Over Parallel HDF5 Datasets
    A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Thesis Presented in Partial Fulfillment of the Requirements for the Degree Master of Science in the Graduate School of The Ohio State University By Vignesh Santhanagopalan, B.S. Graduate Program in Computer Science and Engineering The Ohio State University 2011 Thesis Committee: Dr. Gagan Agrawal, Advisor Dr. Radu Teodorescu ABSTRACT A key challenge in scientific data management is to manage the data as the data sizes are growing at a very rapid speed. Scientific datasets are typically stored using low-level formats which store the data as binary. This makes specification of the data and processing very hard. Also, as the volume of data is huge, parallel configurations must be used to process the data to enable efficient access. We have developed a data virtualization approach for supporting subsetting queries on scientific datasets stored in native format. The data is stored in Hierarchical Data Format (HDF5) which is one of the popular formats for storing scientific data. Our system supports SQL queries using the Select, From and Where clauses. We support queries based on the dimensions of the dataset and also queries which are based on the dimensions and attributes (which provide extra information about the dataset) of the dataset. In order to support the different types of queries, we have the pre-processing and post-processing modules. We also parallelize the selection queries involving the dimensions and the attributes. Our system offers the following advantages. We provide SQL like abstraction for specifying subsets of interest which is a powerful mechanism.
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • ARC File Revision 3.0 Proposal
    ARC file Revision 3.0 Proposal Steen Christensen, Det Kongelige Bibliotek <ssc at kb dot dk> Michael Stack, Internet Archive <stack at archive dot org> Edited by Michael Stack Revision History Revision 1 09/09/2004 Initial conversion of wiki working doc. [http://crawler.archive.org/cgi-bin/wiki.pl?ArcRevisionProposal] to docbook. Added suggested edits suggested by Gordon Mohr (Others made are still up for consideration). This revision is what is being submitted to the IIPC Framework Group for review at their London, 09/20/2004 meeting. Table of Contents 1. Introduction ............................................................................................................................2 1.1. IIPC Archival Data Format Requirements .......................................................................... 2 1.2. Input ...........................................................................................................................2 1.3. Scope ..........................................................................................................................3 1.4. Acronyms, Abbreviations and Definitions .......................................................................... 3 2. ARC Record Addressing ........................................................................................................... 4 2.1. Reference ....................................................................................................................4 2.2. The ari Scheme ............................................................................................................
    [Show full text]
  • GNU CPIO GNU Cpio 2.5 June 2002
    GNU CPIO GNU cpio 2.5 June 2002 by Robert Carleton Copyright c 1995, 2001, 2002 Free Software Foundation, Inc. This is the first edition of the GNU cpio documentation, and is consistent with GNU cpio 2.5. Published by the Free Software Foundation 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA Permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and this permission notice are preserved on all copies. Permission is granted to copy and distribute modified versions of this manual under the con- ditions for verbatim copying, provided that the entire resulting derived work is distributed under the terms of a permission notice identical to this one. Permission is granted to copy and distribute translations of this manual into another lan- guage, under the above conditions for modified versions, except that this permission notice may be stated in a translation approved by the Free Software Foundation. Chapter 2: Tutorial 1 1 Introduction GNU cpio copies files into or out of a cpio or tar archive, The archive can be another file on the disk, a magnetic tape, or a pipe. GNU cpio supports the following archive formats: binary, old ASCII, new ASCII, crc, HPUX binary, HPUX old ASCII, old tar, and POSIX.1 tar. The tar format is provided for compatability with the tar program. By default, cpio creates binary format archives, for compatibility with older cpio programs. When extracting from archives, cpio automatically recognizes which kind of archive it is reading and can read archives created on machines with a different byte-order.
    [Show full text]
  • Lossless Compression of Internal Files in Parallel Reservoir Simulation
    Lossless Compression of Internal Files in Parallel Reservoir Simulation Suha Kayum Marcin Rogowski Florian Mannuss 9/26/2019 Outline • I/O Challenges in Reservoir Simulation • Evaluation of Compression Algorithms on Reservoir Simulation Data • Real-world application - Constraints - Algorithm - Results • Conclusions 2 Challenge Reservoir simulation 1 3 Reservoir Simulation • Largest field in the world are represented as 50 million – 1 billion grid block models • Each runs takes hours on 500-5000 cores • Calibrating the model requires 100s of runs and sophisticated methods • “History matched” model is only a beginning 4 Files in Reservoir Simulation • Internal Files • Input / Output Files - Interact with pre- & post-processing tools Date Restart/Checkpoint Files 5 Reservoir Simulation in Saudi Aramco • 100’000+ simulations annually • The largest simulation of 10 billion cells • Currently multiple machines in TOP500 • Petabytes of storage required 600x • Resources are Finite • File Compression is one solution 50x 6 Compression algorithm evaluation 2 7 Compression ratio Tested a number of algorithms on a GRID restart file for two models 4 - Model A – 77.3 million active grid blocks 3.5 - Model K – 8.7 million active grid blocks 3 - 15.6 GB and 7.2 GB respectively 2.5 2 Compression ratio is between 1.5 1 compression ratio compression - From 2.27 for snappy (Model A) 0.5 0 - Up to 3.5 for bzip2 -9 (Model K) Model A Model K lz4 snappy gzip -1 gzip -9 bzip2 -1 bzip2 -9 8 Compression speed • LZ4 and Snappy significantly outperformed other algorithms
    [Show full text]
  • The Ark Handbook
    The Ark Handbook Matt Johnston Henrique Pinto Ragnar Thomsen The Ark Handbook 2 Contents 1 Introduction 5 2 Using Ark 6 2.1 Opening Archives . .6 2.1.1 Archive Operations . .6 2.1.2 Archive Comments . .6 2.2 Working with Files . .7 2.2.1 Editing Files . .7 2.3 Extracting Files . .7 2.3.1 The Extract dialog . .8 2.4 Creating Archives and Adding Files . .8 2.4.1 Compression . .9 2.4.2 Password Protection . .9 2.4.3 Multi-volume Archive . 10 3 Using Ark in the Filemanager 11 4 Advanced Batch Mode 12 5 Credits and License 13 Abstract Ark is an archive manager by KDE. The Ark Handbook Chapter 1 Introduction Ark is a program for viewing, extracting, creating and modifying archives. Ark can handle vari- ous archive formats such as tar, gzip, bzip2, zip, rar, 7zip, xz, rpm, cab, deb, xar and AppImage (support for certain archive formats depends on the appropriate command-line programs being installed). In order to successfully use Ark, you need KDE Frameworks 5. The library libarchive version 3.1 or above is needed to handle most archive types, including tar, compressed tar, rpm, deb and cab archives. To handle other file formats, you need the appropriate command line programs, such as zipinfo, zip, unzip, rar, unrar, 7z, lsar, unar and lrzip. 5 The Ark Handbook Chapter 2 Using Ark 2.1 Opening Archives To open an archive in Ark, choose Open... (Ctrl+O) from the Archive menu. You can also open archive files by dragging and dropping from Dolphin.
    [Show full text]