The Recovery and Preservation of Critical Exploration Datasets for a Large Multinational Oil Company

Total Page:16

File Type:pdf, Size:1020Kb

The Recovery and Preservation of Critical Exploration Datasets for a Large Multinational Oil Company The Recovery and Preservation of Critical Exploration Datasets for a Large Multinational Oil Company Guy C. Holmes – BSc, MBA Chief Executive Officer SpectrumData Suite 1, 14 Brodie Hall Drive, BENTLEY WA 6102 [email protected] Introduction In February of 2002, a large multinational oil company requested that a project be undertaken to consolidate, and in many cases reconstruct, a large dataset consisting of approximately 80,000 original magnetic tapes of various ages, formats, media types, and condition. The collection contained data acquired during 30 years of oil and gas exploration in over 50 different countries. The project requirements were unique for a number of reasons. The most interesting and challenging of which was that this was the second attempt at performing the project for the company due to the failure of a first attempt by another party. This failed attempt left portions of the data in jeopardy of being permanently lost, corrupted or disassociated from their invaluable metadata. The project involved reading the tapes, consolidating the data into logical data sets, converting the various data types to an industry standard format, and outputting the data to a new set of high density data cartridges in triplicate. The vast majority of data in this collection was in the form of seismic survey data which is the principal exploration methodology used in oil and gas exploration. The tape collection consisted of the following tape types: − 9 track reel to reel tape − 3480 cartridge − 3490E cartridge − 8mm Helical Scan Cartridge − 4mm DDS DAT Cartridges − Digital Linear Tape (DLT) − A variety of smaller, less known media types including DC2120’s, DC6150, and 7 track magnetic tapes. The Consequences of Removing or Modifying Blocking Structures From Magnetic Tape Files As this project was already attempted once by another party the first essential element of the task was to isolate exactly what was done prior to our involvement in the project. An initial review of the data found that most of the low density tapes that needed to be read were severely damaged and deteriorated. In most cases the tapes that had not been converted in the previously failed project represented small portions of a larger dataset that had been successfully copied to higher density media. As an example, portions of a data set that may have previously been recorded on 800 9 track tapes, were now on 10 DLT IV cartridges with the exception of 40 of the original 9 tracks that had not been read due to deterioration or damage. The higher density DLT IV cartridges created in the previous project were not a one to one identical copy of the original 9 track tapes. Instead each DLT IV cartridge contained many individual 9 track tapes, written to DLT IV in an altered de-blocked format, with only a file mark between the end of one original 9 track tape and the start of the next. To fully appreciate the complexity of this restoration and migration project, one needs to have a basic understanding of the underlying structure of data when it is stored on magnetic tape. Magnetic tape is a linear recording medium. When reading a linear magnetic tape, locating a specific record requires reading or passing over every record recorded on the tape before it. To read data from tape, a tape drive may have to read through almost the entire spool of tape before it can read the record requested by the user. As an example, to get to the fifth record on a tape a user must read the first four records before it can read the fifth. To write data to tape, the tape drive writes sequentially, one record after another along the length of the tape. Data cannot be written to linear tape in any random location without the risk of overwriting existing data. In order for all pre-existing data on a tape to remain, data must be written at the end of the existing data sets. Tape drives write data to tape in blocks. Each block consists of a number of bytes and typically the software controlling the tape drive determines how many bytes per block it will write. These blocks are separated by inter-record gaps (effectively blank tape). A group of blocks written on a tape followed by a marker called a file mark constitute a logical file on a tape. Tape drives use these file marks and inter-record gaps to seek to particular locations on the tape for specific data. More than one logical file can be written to a tape and each may contain many physical files. Logical files on tape contain at least one block of data but typically contain many hundreds or thousands of blocks. In most cases software being used to read data on tape will require that the data match a defined file and blocking structure for the data to be successfully read and interpreted. To further appreciate the complexity of this project, it is important to understand how even the smallest modification to the blocking structure of a specified data format can directly affect the ability of software to interpret the data. As this project required the conversion of a vast amount of seismic data, I have chosen to use a highly specified format of seismic data known as SEGB to further demonstrate that a small change in blocking structure can have a very large impact on data integrity. Field Seismic Recording Exploration companies use the seismic method to explore for oil as their primary means of geo-scientific investigation. A seismic survey essentially consists of a seismograph, an array of seismic receivers known as geophones, and a synthetic source of seismic energy. This synthetic seismic energy, when released, travels through the different layers of the earth and eventually is reflected back to the surface. The time it takes for the energy to reach the surface and the wavelength of the returning seismic energy is measured by the geophones. For each burst of seismic data a seismic shot record is created and is written to tape as a single file. This seismic shot file is typically a multiplexed file and is generally written to tape as either one or two blocks of data per logical file. As discussed in the introduction of this paper, many of the tapes received for this project were duplicates where the data had been copied from many original 9 track tapes to a single new DLT IV cartridge. Because the capacity of a DLT IV cartridge is much greater than that of the original 9 track tapes, it was not uncommon to find that several hundred original 9 track tapes had been copied onto a single DLT IV cartridge. One of the critical issues created by the transfer of these original 9 track tapes to DLT IV cartridge during the first failed attempt at this project is that all of the original file and blocking structure stored on the 9 track tapes was not transferred to the new DLT IV cartridges. Essentially, data from a single 9 track tape consisting of many files, where each file contained many blocks, was transferred into a single file on a new tape with a different block structure. The removal of the original blocking and file structure from this data during the previous attempt at this project created some interesting and challenging technical issues. Firstly, true preservation of the data required that the data first be returned to its original recording format including all vital file and blocking structures. This would then allow for each seismic shot to be identified, validated and preserved prior to any conversion or migration processes being applied. For most SEGB seismic field data, the first block of a SEGB shot file is referred to as a “header” block, and the second block the “data” block. Most software applications that read field seismic data, require that the header block be correctly formatted and a specific number of bytes in length. The header block often contains vital information about the data block that follows it on the linear tape and in most cases a data block in isolation (without a header block) can not be interpreted by software. The length in bytes of a SEGB header or data block may vary from one shot file to another. As the data was binary and had lost its original blocking structure during copying, the resulting file was a stream of bytes that no longer contained the vital blocking structures to delineate one shot from another, or one header from another. Instead of 100 seismic shot files, each 960,240 bytes long (consisting of a 240 byte header block followed by a 960,000 byte data block), a new file of 96,024,000 bytes (100 x 960,240 byte original files concatenated together) had been created on tape. This new file was also written to tape with a block length of 10240 bytes. The original blocking structure of the data was now lost and what was once only two blocks per file had now become a single logical file of over 9,000 blocks. See figure 1. Figure 1 – Blocking Structure Changes Through Migration Process To conventional seismic software, this resulting new data structure would have been completely un-interpretable as there is a high degree of dependency between the interpretation of data from tape by software and the blocking structure of the data itself. SpectrumData was able to develop complex software routines that navigated the new blocking and file structure of the data and converted it back to its original format.
Recommended publications
  • Argest® Backup User Guide
    ArGest ® Backup User’s Guide Version 4.0 Copyright © 2020, TOLIS Group, Inc. ArGest® Backup User’s Guide TOLIS Group, Inc.., et al Copyright © 2008-2020, TOLIS Group, Inc., All rights reserved Notice of rights All rights reserved. No part of this book may be reproduced or transmitted in any form by any means without the prior written permission of TOLIS Group, Inc.. For information on getting permission for reprints and excerpts, contact [email protected]. Notice of Liability The information in this manual is distributed “as is” and without warranty. While every precaution has been taken in the preparation of the manual, TOLIS Group, Inc. nor its resellers and representatives shall have any liability to any person or entity with respect to any loss or damage caused or alleged to be caused directly or indirectly by the information and instructions contained in the manual or by the computer software described within. Trademarks Throughout this book trademarked names may be used. TOLIS Group, Inc. states that we are using any and all trademarked names in an editorial fashion and to the benefit of the trademark owner with no intention of infringement of the trademark. Update Information TOLIS Group, Inc. will always work to insure that the data contained in this manual is kept up to date. You can always find the latest version at our website at http://www.tolisgroup.com/documentation.html ArGest® Backup User’s Guide Table of Contents - 3 Email Settings ...............................................................................32 Table
    [Show full text]
  • Western Peripherals ™ Division of ~
    MODEL TC-120/128 TAPE CONTROLLER HARDHARE MANUAL western peripherals ™ Division of ~ 14321 New Myford Hoad • Tustin, California 92680 • (714) 730·6250 • TWX: 910 595·1775 • Cable: WESPER MODEL TC-120/128 TAPE CONTROLLER HARDHARE MANUAL PUBLICATION NUMBER 01200146 C western peripherals 14321 MYFORD ROAD TUSTIN~ CALIFORNIA 92680 © 198:1:. by Westem Peripherals, Inc. All Rights Reserved PRINTED IN U.S.A. PREFACE This manual provides information necessary for the installation and maintenance of the Western Peripherals Model TC-120/l28 Tape Controller, used with Data General or Data General-emulating computers. The manual is divided into the following sections: Section I General Description Section II Installation Section III Programming Section IV Theory of Operation SECTION I GENERAL DESCRIPTION • • TABLE OF CONTENTS PARAGRAPH PAGE • 1.1 DESCRIPTION OF EQUIPMENT 1-1 1.3 DRIVE COMPATIBILITY 1-1 1.6 OTHER FEATURES 1-2 • 1.12 SPECIFICATIONS 1-4 • • .' • • • • SECTION I • GENERAL DESCRIPTION • 1.1 DESCRIPTION OF EQUIPMENT 1.2 The Western peripherals Model TC-120/128 is a universal mag­ netic tape controller/formatter which is hardware and software • compatible with the Data General family of computer systems, pro­ viding both NRZI and phase encoded (PE) format capability in a single embedded printed circuit board. The controller is also I compatible with all other computers emulating the Data General computer family, using the standard-sized 15 inch x 15 inch cir­ cuit boards. The controller contains all interface, control, • status, and formatting electronics to emulate the Data General tape subsystem and installs directly into any available card slot in the computer or expansion chassis.
    [Show full text]
  • IBM Tape Device Drivers IBM
    IBM Tape Device Drivers IBM Installation and User's Guide GC27-2130-21 IBM Tape Device Drivers IBM Installation and User's Guide GC27-2130-21 ii IBM Tape Device Drivers: Installation and User's Guide Twenty-second Edition (November 2015) This twenty-second edition of the IBM Tape Device Drivers Installation and User's Guide, GC27-2130-21, replaces and makes obsolete the following manual: IBM Tape Device Drivers Installation and User's Guide, GC27-2130-20. © Copyright IBM Corp. 2007, 2015 iii iv IBM Tape Device Drivers: Installation and User's Guide Read this first Accessing online technical support For online technical support for your Library, visit: v www.ibm.com/support. Registering for My Notification My Notification registration provides email notification when firmware levels have been updated and are available for download and installation. To register for My Notification: 1. Visit the web at http://www-01.ibm.com/software/support/einfo.html. 2. Click My Notifications. Note: Library firmware and tape drive firmware are verified and released together. When updating to the latest firmware, verify that all installed components such as tape drives, and library are at the latest levels noted on the Support website. Mixing different levels of library and tape drive firmware is not supported and can cause unpredictable results. Contacting IBM technical support In the USA: Call 1-800-IBM_SERV (1-800-426-7378). All other Countries/Regions: Visit www.ibm.com/support To open a Service Request online: Under Support & downloads, click Open a service request. © Copyright IBM Corp. 2007, 2015 v vi IBM Tape Device Drivers: Installation and User's Guide Preface These publications and URLs provide user information and installation assistance for IBM® tape drive, medium changer, and library device drivers.
    [Show full text]
  • Device and Network Interfaces
    man pages section 7: Device and Network Interfaces Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. Part No: 816–3330–10 February 2002 Copyright 2002 Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry. Sun holds a non-exclusive license from Xerox to the Xerox Graphical User Interface, which license also covers Sun’s licensees who implement OPEN LOOK GUIs and otherwise comply with Sun’s written license agreements.
    [Show full text]
  • Federal Register/Vol. 67, No. 250/Monday, December 30, 2002
    Federal Register / Vol. 67, No. 250 / Monday, December 30, 2002 / Rules and Regulations 79517 § 51.317 [Corrected] NARA received seven responses to files using FTP can be accomplished in 2. On page 69666, third column, the proposed rule, six from Federal a variety of ways. The most common paragraph (g)(3), the words ‘‘paragraphs agencies and one from a private sector methods are dial-up modems and high- (1)’’ are corrected to read ‘‘paragraphs commenter. speed or broadband Internet connections. NARA works closely with (g)(1)’’. File Transfer Protocol each individual agency in arranging its § 51.318 [Corrected] FTP is a media-less transfer method specific FTP transfers to ensure that the 3. On page 69667, second column, that can be used to transfer electronic agency has an appropriate secure means paragraph (i)(e), the words ‘‘paragraphs records. FTP operates by using special of transferring the records by FTP. (1)’’ are corrected to read ‘‘paragraphs software located at the sending and DLTtape IV (i)(1)’’. receiving sites. This software, in combination with a telecommunications DLTtape IV cartridge tape is a high- Dated: December 20, 2002. network, provides the means for density magnetic cartridge tape that can A.J. Yates, transferring electronic records. The store up to 40 gigabytes of information Administrator, Agricultural Marketing agency may send any documentation in on each cartridge. DLTtape IV tapes are Service. electronic format to NARA via FTP as used by selected tape drive units [FR Doc. 02–32805 Filed 12–27–02; 8:45 am] part of the transfer of the electronic produced by several companies.
    [Show full text]
  • BRU PE 3.X User Guide
    BRU Producer’s Edition™ User’s Guide Version 3.1 Copyright © 2008-2018, TOLIS Group, Inc. BRU Producer’s Edition™ User’s Guide TOLIS Group, Inc.., et al Copyright © 2008-2015, TOLIS Group, Inc., All rights reserved Notice of rights All rights reserved. No part of this book may be reproduced or transmitted in any form by any means without the prior written permission of TOLIS Group, Inc.. For information on getting permission for reprints and excerpts, contact [email protected]. Notice of Liability The information in this manual is distributed “as is” and without warranty. While every precaution has been taken in the preparation of the manual, TOLIS Group, Inc. nor its resellers and representatives shall have any liability to any person or entity with respect to any loss or damage caused or alleged to be caused directly or indirectly by the information and instructions contained in the manual or by the computer software described within. Trademarks Throughout this book trademarked names may be used. TOLIS Group, Inc. states that we are using any and all trademarked names in an editorial fashion and to the benefit of the trademark owner with no intention of infringement of the trademark. Update Information TOLIS Group, Inc. will always work to insure that the data contained in this manual is kept up to date. As such, please visit our website at http://www.tolisgroup.com/documentation.html to retrieve the latest version of the manual. BRU Producer’s Edition User’s Guide Table of Contents - 3 Standalone Tape Drive ..................................................................28
    [Show full text]
  • A Standard Description for Magnetic Tape Files
    This PDF is a selection from an out-of-print volume from the National Bureau of Economic Research Volume Title: Annals of Economic and Social Measurement, Volume 4, number 3 Volume Author/Editor: NBER Volume Publisher: NBER Volume URL: http://www.nber.org/books/aesm75-3 Publication Date: July 1975 Chapter Title: A Standard Description for Magnetic Tape Files Chapter Author: Harold King, Mitchell Krasny Chapter URL: http://www.nber.org/chapters/c10410 Chapter pages in book: (p. 447 - 454) Anna/s of Economic and Social Measurement, 4/3. 1975 COMPUTER NOTES A STANDARD DESCRIPTION FOR MAGNETIC TAPE FILES EDITOR'S NOTE The following article is the direct result of a workshop on Documentation of Large Machine Readable Data Sets, sponsored by the NBER's Conference on the Computer in Economic and Social Measurement. The Conference, chaired by Charlotte Boschan, was held on April 18-20, 1974 at New York University. It was divided into six separate workshops, (1) Standards for Description of Storage Media, (2) Documentation for Interactive Use of Time Series Data Bases, (3) Bibliographic Aspects of Documentation, (4) Establishment and Management of Data Libraries, (5) Standards for Text Documentation of Social Science Data Bases, and (6) Potentials and Problems of Data Base Documentation in Machine Readable Form. The standards for the description of magnetic tape files recommended in the present article were discussed in Workshop # 1, developed by a subcommittee, and circulated among the members of the workshop for further suggestions. Members of the Subcommittee were: Harold KingThe Urban Institute Hazel McEwen--National Bureau of Standards Mitchell KrasnyNational Technical Information Service Other members of the workshop: Jerry BellSystems Software Div.
    [Show full text]
  • RSTS/E Programming Manual Order Number: AA-EZ09B-TC
    RSTS/E Programming Manual Order Number: AA-EZ09B-TC August 1990 This manual dascribes RSTS/E spedal programming techniques. It contains infonnation on device-dependent featuresand the use ofsystem function calls. Operating System and Version: RSTS/E Verdon 10.0 Software Version: RSTS/E Version 10.0 &Mentec w Update Notice No. 1 RSTS/E Programming Manual Order Number: AD-EZ09B-T1 September 1992 New and Changed Information This update contains changes and additions made to the RSTS/E Programming Manuel. Copyright ©1992 by Digital EquipmentCorporation All Rights Reserved. Printed In U.S.A. Instructions The enclosed pages are replacements oradditions to current pages In the RSTS/E Programming Manual, The change bars(|) onthe replacement pages Indicate new or revised material. Old Page New Page Title page/Copyright page Title page/Copyright page iii/iv through xv/xvi iii/iv through xv/^vi xix/xx xix/xx 1-7/1-8 1-7/1-8 through 1-8.1/blank 1-21/1-22 through 1-23/1-24 1-21/1-22 through 1-23/1-24 2-1/2-2 through 2-5/2-6 2-1/2-2 through 2-6.1/blank 2-21/2-22 2-21/2-22 2-29/2-30 2-29/2-30 3-1/3-2 81/3-2 through 82.1/blank 4-1/4-2 4-1/4-2 4-5/4-6 4-5/4-6 tiirough 4-6.1/t^ank 4-15/4-16 through 4-19/4-20 4-15/4-16 through 4-19/4-20 4-23/4-24 4-23/4-24 4-27/4-28 4-27/4-28 4-37/4-38 4-37/4-38 4-45/4-46 4-45/4-46 8-3/8-4 through 8-11/8-12 8-3/8-4 through 811/812 8-21/8-22 821/822 through 822.1/blank 8-35/8-36 835/836 through 836.1/836.2 8-51/8-52 through 853/8-54 851/852 through 853/8-54 8-87/-888 887/888 through 8-88.1/blank 897/898 through 899/8100
    [Show full text]
  • Federal Register/Vol. 67, No. 123/Wednesday, June 26, 2002
    Federal Register / Vol. 67, No. 123 / Wednesday, June 26, 2002 / Proposed Rules 43069 document published with two date permitting two additional electronic Although this proposed rule does not errors in the SUPPLEMENTARY records transfer methods, File Transfer address the format of electronic records INFORMATION section. This document Protocol (FTP) and Digital Linear Tape described in paragraph (d), NARA is corrects those errors. IV (DLTtape IV). NARA is introducing exploring the acceptance of formats DATES: The public meeting will be held these transfer methods to reduce the other than ASCII and EBCDIC as part of on July 26, 2002, from 9 a.m. to 5 p.m. media and shipping costs of electronic its E-Government initiative. Any Registration to attend the meeting must records transferred from Government proposed changes in this area will be be received by July 12, 2002. Submit agencies, improve record and file addressed in a separate rulemaking. written or electronic comments for integrity, and expand the options for Please submit Internet comments consideration during the meeting by transfer methods. This rule will affect within the body of your email message July 12, 2002. Government agencies transferring or as an attachment. Please also include permanent electronic records to the ADDRESSES: The meeting will be held at ‘‘Attn: 3095–AB03’’ and your name and the Natcher Auditorium, Bldg. 45, National Archives of the United States. return address in your Internet message. National Institutes of Health (NIH), DATES: Comments are due by August 26, If you do not receive a confirmation Bethesda, MD. Parking will be limited 2002. from the system that we have received and there may be delays entering the ADDRESSES: Comments must be sent to your Internet message, contact the NIH campus due to increased security.
    [Show full text]
  • Federal Register/Vol. 67, No. 250/Monday, December
    Federal Register / Vol. 67, No. 250 / Monday, December 30, 2002 / Rules and Regulations 79517 § 51.317 [Corrected] NARA received seven responses to files using FTP can be accomplished in 2. On page 69666, third column, the proposed rule, six from Federal a variety of ways. The most common paragraph (g)(3), the words ‘‘paragraphs agencies and one from a private sector methods are dial-up modems and high- (1)’’ are corrected to read ‘‘paragraphs commenter. speed or broadband Internet connections. NARA works closely with (g)(1)’’. File Transfer Protocol each individual agency in arranging its § 51.318 [Corrected] FTP is a media-less transfer method specific FTP transfers to ensure that the 3. On page 69667, second column, that can be used to transfer electronic agency has an appropriate secure means paragraph (i)(e), the words ‘‘paragraphs records. FTP operates by using special of transferring the records by FTP. (1)’’ are corrected to read ‘‘paragraphs software located at the sending and DLTtape IV (i)(1)’’. receiving sites. This software, in combination with a telecommunications DLTtape IV cartridge tape is a high- Dated: December 20, 2002. network, provides the means for density magnetic cartridge tape that can A.J. Yates, transferring electronic records. The store up to 40 gigabytes of information Administrator, Agricultural Marketing agency may send any documentation in on each cartridge. DLTtape IV tapes are Service. electronic format to NARA via FTP as used by selected tape drive units [FR Doc. 02–32805 Filed 12–27–02; 8:45 am] part of the transfer of the electronic produced by several companies.
    [Show full text]
  • IBM Tape Device Drivers Installation and User's Guide
    IBM Tape Device Drivers Installation and User’s Guide GC27-2130-20 IBM Tape Device Drivers Installation and User’s Guide GC27-2130-20 Note! Before this information and the product that it supports is used, be sure to read the general information under Notices. Twenty-first Edition (November 2014) This twenty-first edition of the IBM Tape Device Drivers Installation and User’s Guide, GC27-2130-20, replaces and makes obsolete the following manual: IBM Tape Device Drivers Installation and User’s Guide, GC27-2130-19. © Copyright IBM Corporation 2007, 2014. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Read this first Accessing online technical support For online technical support for your Library, visit: http://www.ibm.com/support/. Registering for My Notification My Notification registration provides email notification when firmware levels have been updated and are available for download and installation. To register for My Notification: 1. Visit the web at http://www-01.ibm.com/software/support/einfo.html. 2. Click My Notifications. Note: Library firmware and tape drive firmware are verified and released together. When updating to the latest firmware, verify that all installed components such as tape drives, and library are at the latest levels noted on the Support website. Mixing different levels of library and tape drive firmware is not supported and can cause unpredictable results. Contacting IBM technical support In the USA: Call 1-800-IBM_SERV (1-800-426-7378). All other Countries/Regions: Visit http://www.ibm.com. To open a Service Request online: Under Support & downloads, click Open a service request.
    [Show full text]
  • How Much Information Is Produced in the World Each Year
    About the Project Senior Researchers: Peter Lyman and Hal R. Varian Research Assistants: James Dunn, Aleksey Strygin, Kirsten Swearingen This study is an attempt to measure how much information is produced in the world each year. We look at several media and estimate yearly production, accumulated stock, rates of growth, and other variables of interest. If you want to understand what we've done, we offer different recommendations, depending on the degree to which you suffer from information overload: Heavy information overload: the world's total yearly production of print, film, optical, and magnetic content would require roughly 1.5 billion gigabytes of storage. This is the equivalent of 250 megabytes per person for each man, woman, and child on earth. Moderate information overload: read the Sound Bytes and look at the Charts illustrating our findings. Normal information overload: read the Executive Summary. Information deprived: read the detailed reports by clicking on the contents to your left. Or download the entire Web site as a PDF file. (It is about 100 pages long.) This study was produced by faculty and students at the School of Information Management and Systems at the University of California at Berkeley. We gratefully acknowledge financial support from EMC. We have put "[???]" in the text where we had to make "questionable" assumptions. If you have suggestions, corrections, or comments, please send email to [email protected]. We view this as a "living document" and intend to update it based on such contributions. © 2000 Regents of the University of California http://www.sims.berkeley.edu/how-much-info/ [11/10/2000 2:11:17 PM] Executive Summary Abstract The world produces between 1 and 2 exabytes of unique information per year, which is roughly 250 megabytes for every man, woman, and child on earth.
    [Show full text]