Browsing Through High Quality Document

Total Page:16

File Type:pdf, Size:1020Kb

Browsing Through High Quality Document Browsing through High Quality Do cument Images with DjVu Patrick Haner Leon Bottou Paul G Howard Patrice Simard Yoshua Bengio and Yann Le Cun ATT LabsResearch Schultz Drive Red Bank NJ fhanerleonbpghpatriceyoshuayanngresearchattcom Abstract Even if pictures and drawings are scanned and in tegrated into the web page much of the visual asp ect We present a new image compression technique of the original do cument is likely to be lost Visual cal led DjVu that is specical ly geared towards the details including font irregularities pap er color and compression of highresolution highquality images of pap er texture are particularly imp ortant for histori scanned documents in color With DjVu any scr een cal do cuments and may also b e crucial in do cuments connected to the Internet can access and display im with tables mathematical or chemical formulae and ages of scannedpages while faithful ly reproducing the handwritten text font color drawing pictures and paper texture A typical magazine page in color at dpi can becom A simple alternativewould b e to scan the original pressed down to between to KB approximately page and simply compress the image as a JPEG or to times better than JPEG for a similar level of GIF le Unfortunately those les tend to be quite subjective quality BW documents are typical ly large if one wants to preserv e the readability of the to KBytes at dpi or to times better than text Compressed with JPEG a color image of a ecient version of CCITTG Arealtime memory typical magazine page scanned at dpi dots per the decoder was implemented and is available as a inch would be around KBytes to KBytes plugin for popular web browsers and would be barely readable The same page at Keywords digital libraries image compression im dpi would b e of acceptable quality but would o c age segmentation arithmetic coding wavelet coding cupy around KBytes Even worse not only would JBIG the decompressed image ll up the entire memory of an average PC but only a small p ortion of it would Intro duction b e visible on the screen at once A justreadable black As electronic storage retrieval and distribution of and white page in GIF would be around to do cuments b ecomes faster and cheap er libraries are KBytes b ecoming increasingly digital Recent studies have To summarize the current situation it is clear that shown that it is already less costly to store do cuments the complete digitization of the worlds ma jor library digitally than to provide for buildings and shelves to collections is only a matter of time In this context it house them Unfortunately the diculty of con seems paradoxical that there exist no universal stan verting existing do cuments to electronic form is a ma dard for ecient storage retrieval and transmission jor obstacle to the development of digital libraries of highquality do cument images in color Existing do cuments are usually retyp ed and con verted to HTML or Adob es PDF format a tedious Tomake remote access to digital libraries a pleasant and exp ensive task sometimes facilitated by the use exp erience pages must app ear on the screen after only of Optical Character Recognition OCR While the a few seconds delay Assuming a kilobits p er second accuracy of OCR systems has b een steadily improv kbps connection this means that the most relevant ing over the last decade they are still far from b eing parts of the do cument the text must b e compressed able to translate faithfully a scanned do cumentinto a down to ab out to KBytes With a progressive computerreadable format without extensive manual compression technique the text would b e transmitted correction and displayed rst Then the pictures drawings and backgrounds would b e transmitted and displayed im background The p erformance of b oth of these meth proving the quality of the image as more bits arrive o ds heavily relies on a new adaptive binary arithmetic The overall size of the le should b e on the order of co ding technique called the Zco der also briey de to KBytes to keep the overall transmission time scrib ed Section intro duces the plugin that allows and storage requirements within reasonable b ounds to browse DjVu do cuments through standard appli cations such as Netscap e Navigator or Microsoft Ex Another p eculiarity of do cument images their large plorer with a DjVu plugin Comparative results on a size makes current image compression techniques in wide variety of do cumenttyp es are given in Section appropriate A magazinesize page at dots per inch is pixel high and pixel wide Uncom ImageBased Digital Libraries pressed it o ccupies MBytes of memory more than The imagebased approach to digital libraries is what the average PC can handle A practical do cu to store and to transmit do cuments as images To ment image viewer would need to keep the image in a achieve that we need to devise a metho d for com compressed form in the memory of the machine and pressing do cument images that makes it p ossible to only decompress ondemand the part of the image that transfer a highquality page over lowsp eed links mo is displayed on the screen dem or ISDN in a few seconds OCR would only b e The DjVu do cument image compression technique used for indexing with less stringent constraints on describ ed in this pap er is an answer to all the ab ove accuracy problems With DjVu scanned pages at dpi in full Several authors have prop osed imagebased ap color can b e compressed down to to KBytes les proaches to digital libraries The most notable ex from MBytes originals with excellent quality Black ample is the RightPages system The RightPages and white pages typically o ccupy to KBytes system was designed for do cument image transmission once compressed This puts the size of highquality over a lo cal area network It was used for some years scanned pages in the same order of magnitude as an bytheATT Bell Labs technical community and dis average HTML page KBytes according to the lat tributed commercially for customized applications in est statistics DjVu pages are displayed within the several countries The absence of a universal and op en browser window through a plugin The DjVu plugin platform for networking and bro wsing suchastodays allows easy panning and zo oming of very large images Internet limited the dissemination of the RightPages This is made p ossible by an onthey decompression system Similar prop osals have b een made more re metho d whichallows images that would normally re cently quire MBytes of RAM once decompressed to re All of the ab ove imagebased approaches and most quire only MBytes of RAM commercially available do cument image management The basic idea b ehind DjVu is to separate the text systems are restricted to black and white bilevel from the backgrounds and pictures and to use dier images This is adequate for technical and business ent techniques to compress each of those comp onents do cuments but insucient for other typ es of do cu Traditional metho ds are either designed to compress ments such as magazines catalogs or historical do c natural images with few edges JPEG or to compress uments Many formats exist for co ding bilevel do c black and white do cument images almost entirely com ument images notably the CCITT G and G fax p osed of sharp edges CCITT G G and JBIG standards the recent JBIG standard and the up The DjVutechnique improves on b oth and combines coming JBIG standard Using ATTs prop osals to the b est of b oth approaches the JBIG standard images of a t ypical black and white page at dpi dots p er inch can b e transmit Section reviews the currentavailable compression ted over a mo dem link in a few seconds Work on and displaying technologies and states the require bilevel image compression standards is motivated by ments for do cument image compression Section de the fact that until recently do cument images were scrib es the DjVu metho d of separately co ding the text primarily destined to b e printed on pap er Most low and drawings on one hand and the pictures and back cost printer technologies excel at printing bilevel im grounds on the other hand Section turns this idea ages but they must rely on dithering and halftoning into an actual image compression format It starts to print greylevel or color images thus reducing their with a description of the metho d used by DjVutoen eective resolution co de the text and the drawings This metho d is a variation of ATTs prop osal to the new JBIG fax The low cost and availability of highresolution standard It also includes a description of the IW color displays is causing more and more users to rely waveletbased compression metho d for pictures and on their screen rather than on their printer to dis play do cument images Even mo dern lowend PCs can display x pixel images with bits per pixelbitsperRGB comp onent while highend PCs and workstations can display x at bits p er pixel Most do cuments displayed in bilevel mo de are readable at dpi but are not pleasanttoread At dpi the quality is quite acceptable in bilevel mo de Displaying an entire x lettersize page at such high resolution requires a screen resolution of pix els vertically and pixels horizontallywhichisbe yond traditional display technology Fortunately us ing color or graylevels when displaying do cument im ages at lower resolutions drastically improves readabil ity and sub jective quality Most do cuments are read able when displayed at dpi on a color or greyscale display Only do cuments with particularly small fonts require dpi for eortless readability At dpi a typical
Recommended publications
  • PDF/A for Scanned Documents
    Webinar www.pdfa.org PDF/A for Scanned Documents Paper Becomes Digital Mark McKinney, LuraTech, Inc., President Armin Ortmann, LuraTech, CTO Mark McKinney President, LuraTech, Inc. © 2009 PDF/A Competence Center, www.pdfa.org Existing Solutions for Scanned Documents www.pdfa.org Black & White: TIFF G4 Color: Mostly JPEG, but sometimes PNG, BMP and other raster graphics formats Often special version formats like “JPEG in TIFF” Disadvantages: Several formats already for scanned documents Even more formats for born digital documents Loss of information, e.g. with TIFF G4 Bad image quality and huge file size, e.g. with JPEG No standardized metadata spread over all formats Not full text searchable (OCR) inside of files Black/White: Color: - TIFF FAX G4 - TIFF - TIFF LZW Mark McKinney - JPEG President, LuraTech, Inc. - PDF 2 Existing Solutions for Scanned Documents www.pdfa.org Bad image quality vs. file size TIFF/BMP JPEG TIFF G4 23.8 MB 180 kB 60 kB Mark McKinney President, LuraTech, Inc. 3 Alternative Solution: PDF www.pdfa.org PDF is already widely used to: Unify file formats Image à PDF “Office” Documents à PDF Other sources à PDF Create full-text searchable files Apply modern compression technology (e.g. the JPEG2000 file formats family) Harmonize metadata Conclusion: PDF avoids the disadvantages of the legacy formats “So if you are already using PDF as archival Mark McKinney format, why not use PDF/A with its many President, LuraTech, Inc. advantages?” 4 PDF/A www.pdfa.org What is PDF/A? • ISO 19005-1, Document Management • Electronic document file format for long-term preservation Goals of PDF/A: • Maintain static visual representation of documents • Consistent handing of Metadata • Option to maintain structure and semantic meaning of content • Transparency to guarantee access • Limit the number of restrictions Mark McKinney President, LuraTech, Inc.
    [Show full text]
  • Chapter 9 Image Compression Standards
    Fundamentals of Multimedia, Chapter 9 Chapter 9 Image Compression Standards 9.1 The JPEG Standard 9.2 The JPEG2000 Standard 9.3 The JPEG-LS Standard 9.4 Bi-level Image Compression Standards 9.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 ! Fundamentals of Multimedia, Chapter 9 9.1 The JPEG Standard JPEG is an image compression standard that was developed • by the “Joint Photographic Experts Group”. JPEG was for- mally accepted as an international standard in 1992. JPEG is a lossy image compression method. It employs a • transform coding method using the DCT (Discrete Cosine Transform). An image is a function of i and j (or conventionally x and y) • in the spatial domain. The 2D DCT is used as one step in JPEG in order to yield a frequency response which is a function F (u, v) in the spatial frequency domain, indexed by two integers u and v. 2 Li & Drew c Prentice Hall 2003 ! Fundamentals of Multimedia, Chapter 9 Observations for JPEG Image Compression The effectiveness of the DCT transform coding method in • JPEG relies on 3 major observations: Observation 1: Useful image contents change relatively slowly across the image, i.e., it is unusual for intensity values to vary widely several times in a small area, for example, within an 8 8 × image block. much of the information in an image is repeated, hence “spa- • tial redundancy”. 3 Li & Drew c Prentice Hall 2003 ! Fundamentals of Multimedia, Chapter 9 Observations for JPEG Image Compression (cont’d) Observation 2: Psychophysical experiments suggest that hu- mans are much less likely to notice the loss of very high spatial frequency components than the loss of lower frequency compo- nents.
    [Show full text]
  • Electronics Engineering
    INTERNATIONAL JOURNAL OF ELECTRONICS ENGINEERING ISSN : 0973-7383 Volume 11 • Number 1 • 2019 Study of Different Image File formats for Raster images Prof. S. S. Thakare1, Prof. Dr. S. N. Kale2 1Assistant professor, GCOEA, Amravati, India, [email protected] 2Assistant professor, SGBAU,Amaravti,India, [email protected] Abstract: In the current digital world, the usage of images are very high. The development of multimedia and digital imaging requires very large disk space for storage and very long bandwidth of network for transmission. As these two are relatively expensive, Image compression is required to represent a digital image yielding compact representation of image without affecting its essential information with reducing transmission time. This paper attempts compression in some of the image representation formats and the experimental results for some image file format are also shown. Keywords: ImageFileFormats, JPEG, PNG, TIFF, BITMAP, GIF,CompressionTechniques,Compressed image processing. 1. INTRODUCTION Digital images generally occupy a large amount of storage space and therefore take longer time to transmit and download (Sayood 2012;Salomonetal 2010;Miano 1999). To reduce this time image compression is necessary. Image compression is a technique used to identify internal data redundancy and then develop a compact representation that takes up less storage space than the original image size and the reverse process is called decompression (Javed 2016; Kia 1997). There are two types of image compression (Gonzalez and Woods 2009). 1. Lossy image compression 2. Lossless image compression In case of lossy compression techniques, it removes some part of data, so it is used when a perfect consistency with the original data is not necessary after decompression.
    [Show full text]
  • JPEG and JPEG 2000
    JPEG and JPEG 2000 Past, present, and future Richard Clark Elysium Ltd, Crowborough, UK [email protected] Planned presentation Brief introduction JPEG – 25 years of standards… Shortfalls and issues Why JPEG 2000? JPEG 2000 – imaging architecture JPEG 2000 – what it is (should be!) Current activities New and continuing work… +44 1892 667411 - [email protected] Introductions Richard Clark – Working in technical standardisation since early 70’s – Fax, email, character coding (8859-1 is basis of HTML), image coding, multimedia – Elysium, set up in ’91 as SME innovator on the Web – Currently looks after JPEG web site, historical archive, some PR, some standards as editor (extensions to JPEG, JPEG-LS, MIME type RFC and software reference for JPEG 2000), HD Photo in JPEG, and the UK MPEG and JPEG committees – Plus some work that is actually funded……. +44 1892 667411 - [email protected] Elysium in Europe ACTS project – SPEAR – advanced JPEG tools ESPRIT project – Eurostill – consensus building on JPEG 2000 IST – Migrator 2000 – tool migration and feature exploitation of JPEG 2000 – 2KAN – JPEG 2000 advanced networking Plus some other involvement through CEN in cultural heritage and medical imaging, Interreg and others +44 1892 667411 - [email protected] 25 years of standards JPEG – Joint Photographic Experts Group, joint venture between ISO and CCITT (now ITU-T) Evolved from photo-videotex, character coding First meeting March 83 – JPEG proper started in July 86. 42nd meeting in Lausanne, next week… Attendance through national
    [Show full text]
  • Analysis and Comparison of Compression Algorithm for Light Field Mask
    International Journal of Applied Engineering Research ISSN 0973-4562 Volume 12, Number 12 (2017) pp. 3553-3556 © Research India Publications. http://www.ripublication.com Analysis and Comparison of Compression Algorithm for Light Field Mask Hyunji Cho1 and Hoon Yoo2* 1Department of Computer Science, SangMyung University, Korea. 2Associate Professor, Department of Media Software SangMyung University, Korea. *Corresponding author Abstract This paper describes comparison and analysis of state-of-the- art lossless image compression algorithms for light field mask data that are very useful in transmitting and refocusing the light field images. Recently, light field cameras have received wide attention in that they provide 3D information. Also, there has been a wide interest in studying the light field data compression due to a huge light field data. However, most of existing light field compression methods ignore the mask information which is one of important features of light field images. In this paper, we reports compression algorithms and further use this to achieve binary image compression by realizing analysis and comparison of the standard compression methods such as JBIG, JBIG 2 and PNG algorithm. The results seem to confirm that the PNG method for text data compression provides better results than the state-of-the-art methods of JBIG and JBIG2 for binary image compression. Keywords: Lossless compression, Image compression, Light Figure. 2. Basic architecture from raw images to RGB and filed compression, Plenoptic coding mask images INTRODUCTION The LF camera provides a raw image captured from photosensor with microlens, as depicted in Fig. 1. The raw Light field (LF) cameras, also referred to as plenoptic cameras, data consists of 10 bits per pixel precision in little-endian differ from regular cameras by providing 3D information of format.
    [Show full text]
  • PDF Image JBIG2 Compression and Decompression with JBIG2 Encoding and Decoding SDK Library | 1
    PDF image JBIG2 compression and decompression with JBIG2 encoding and decoding SDK library | 1 JBIG2 is an image compression standard for bi-level images developed by the Joint bi-level Image Expert Group. It is suitable for lossless compression and lossy compression. According to the group’s press release, in its lossless mode, JBIG2 usually generates files that are one- third to one-fifth the size of the fax group 4 and twice the size of JBIG, which was previously released by the group. The double-layer compression standard. JBIG2 was released as an international standard ITU in 2000. JBIG2 compression JBIG2 is an international standard for bi-level image compression. By segmenting the image into overlapping and/or non-overlapping areas of text, halftones and general content, compression techniques optimized for each content type are used: *Text area: The text area is composed of characters that are well suited for symbol-based encoding methods. Usually, each symbol will correspond to a character bitmap, and a sub-image represents a character or text. For each uppercase and lowercase character used on the front face, there is usually only one character bitmap (or sub-image) in the symbol dictionary. For example, the dictionary will have an “a” bitmap, an “A” bitmap, a “b” bitmap, and so on. VeryUtils.com PDF image JBIG2 compression and decompression with JBIG2 encoding and decoding SDK library | 1 PDF image JBIG2 compression and decompression with JBIG2 encoding and decoding SDK library | 2 *Halftone area: Halftone areas are similar to text areas because they consist of patterns arranged in a regular grid.
    [Show full text]
  • Lossless Image Compression
    Lossless Image Compression C.M. Liu Perceptual Signal Processing Lab College of Computer Science National Chiao-Tung University http://www.csie.nctu.edu.tw/~cmliu/Courses/Compression/ Office: EC538 (03)5731877 [email protected] Lossless JPEG (1992) 2 ITU Recommendation T.81 (09/92) Compression based on 8 predictive modes ( “selection values): 0 P(x) = x (no prediction) 1 P(x) = W 2 P(x) = N 3 P(x) = NW NW N 4 P(x) = W + N - NW 5 P(x) = W + ⎣(N-NW)/2⎦ W x 6 P(x) = N + ⎣(W-NW)/2⎦ 7 P(x) = ⎣(W + N)/2⎦ Sequence is then entropy-coded (Huffman/AC) Lossless JPEG (2) 3 Value 0 used for differential coding only in hierarchical mode Values 1, 2, 3 One-dimensional predictors Values 4, 5, 6, 7 Two-dimensional Value 1 (W) Used in the first line of samples At the beginning of each restart Selected predictor used for the other lines Value 2 (N) Used at the start of each line, except first P-1 Default predictor value: 2 At the start of first line Beginning of each restart Lossless JPEG Performance 4 JPEG prediction mode comparisons JPEG vs. GIF vs. PNG Context-Adaptive Lossless Image Compression [Wu 95/96] 5 Two modes: gray-scale & bi-level We are skipping the lossy scheme for now Basic ideas find the best context from the info available to encoder/decoder estimate the presence/lack of horizontal/vertical features CALIC: Initial Prediction 6 if dh−dv > 80 // sharp horizontal edge X* = N else if dv−dh > 80 // sharp vertical edge X* = W else { // assume smoothness first X* = (N+W)/2 +(NE−NW)/4 if dh−dv > 32 // horizontal edge X* = (X*+N)/2
    [Show full text]
  • Widetek® Wide Format MFP Solutions 17
    ® The WideTEK 36CL-MF scanner forms the basis of all of the MF solutions and is the fastest, most productive wide format MFP so- lution on the market, running at 10 inches per second at 200 dpi in full color, at the lowest possible price. At the full width of 36“ and 600 dpi resolution, the scanner still runs at 1.7 inches per second. ® WideTEK 36CL-MF is the best choice to build a powerful and high quality MFP with either an integrated printer series specifi c stand, a high stand or a standalone stand. All MFP solutions come with a built in closed loop color calibration function producing the best possible copies on your selected printer and paper combination. Scanner Optics CIS camera, scans in color, grayscale, B&W Resolution 1200 x 1200 dpi scanner resolution 9600 x 9600 dpi interpolated (optional) Document Size 965 mm (38 inch) width, nearly unlimited length Scans 915 mm (36 inch) widths Scanning Speed 200 dpi - 15 m/min (10 inch/s) in color Face up scanning with on the fl y rotation and modifi cation 600 dpi - 8 m/min (5 inch/s) in grayscale of images without rescanning. Automated crop & deskew. Color Depth 48 bit color, 16 bit grayscale Built in closed loop color calibration function for best possible copies Scan output 24 bit color, 8 bit grayscale, bitonal, enhanced half- tone LED lamps, no warm up, IR/UV free Output Formats Multipage PDF (PDF/A) and TIFF, JPEG, JPEG 2000, Large 21“ full HD color touchscreen for simple operation in PNM, PNG, BMP, TIFF (Raw, G3, G4, LZW, JPEG), optional bundles AutoCAD DWF, JBIG, DjVu, DICOM, PCX, Postscript, EPS, Raw data Scan2USB, FTP, SMB and Network.
    [Show full text]
  • Optimization of Image Compression for Scanned Document's Storage
    ISSN : 0976-8491 (Online) | ISSN : 2229-4333 (Print) IJCST VOL . 4, Iss UE 1, JAN - MAR C H 2013 Optimization of Image Compression for Scanned Document’s Storage and Transfer Madhu Ronda S Soft Engineer, K L University, Green fields, Vaddeswaram, Guntur, AP, India Abstract coding, and lossy compression for bitonal (monochrome) images. Today highly efficient file storage rests on quality controlled This allows for high-quality, readable images to be stored in a scanner image data technology. Equivalently necessary and faster minimum of space, so that they can be made available on the web. file transfers depend on greater lossy but higher compression DjVu pages are highly compressed bitmaps (raster images)[20]. technology. A turn around in time is necessitated by reconsidering DjVu has been promoted as an alternative to PDF, promising parameters of data storage and transfer regarding both quality smaller files than PDF for most scanned documents. The DjVu and quantity of corresponding image data used. This situation is developers report that color magazine pages compress to 40–70 actuated by improvised and patented advanced image compression kB, black and white technical papers compress to 15–40 kB, and algorithms that have potential to regulate commercial market. ancient manuscripts compress to around 100 kB; a satisfactory At the same time free and rapid distribution of widely available JPEG image typically requires 500 kB. Like PDF, DjVu can image data on internet made much of operating software reach contain an OCR text layer, making it easy to perform copy and open source platform. In summary, application compression of paste and text search operations.
    [Show full text]
  • Kompresja Statycznego Obrazu 138
    Damian Karwowski Zrozumieć Kompresję Obrazu Podstawy Technik Kodowania Stratnego oraz Bezstratnego Obrazów Wydanie Pierwsze, wersja 1.2 Poznań 2019r. ISBN 978-83-953420-0-4 9 788395 342004 © Damian Karwowski – „Zrozumieć Kompresję Obrazu” © Copyright by DAMIAN KARWOWSKI. All rights reserved. Książka jest chroniona prawem autorskim i prawami pokrewnymi. Egzemplarz książki został zdeponowany w Kancelarii Notarialnej. Książka jest dostępna pod adresem: www.zrozumieckompresje.pl ISBN 978-83-953420-0-4 Projekt okładki książki oraz strona internetowa zostały wykonane przez Marka Piskulskiego. Serdecznie dziękuję za profesjonalną pracę Obraz „Miś Panda”, który jest częścią okładki, namalowała na płótnie Natalka Karwowska. Natalko, dziękuję Ci za Twój wysiłek Autor książki dołożył ogromnych starań, żeby zamieszczone w niej informacje były prawdziwe i rzetelnie przedstawione. Korzystając z książki Czytelnik robi to jednak wyłącznie na własną odpowiedzialność. Tym samym autor nie odpowiada za jakiekolwiek szkody, będące następstwem wykorzystania zawartej w książce wiedzy. Autor książki Dr inż. Damian Karwowski. Absolwent Politechniki Poznańskiej. Uzyskał tytuł zawodowy magistra inżyniera oraz stopień doktora nauk technicznych na Politechnice Poznańskiej, odpowiednio w latach 2003 i 2008. Obecnie pracownik naukowo-dydaktyczny na wyżej wymienionej uczelni. Od roku 2003 zawodowo zajmuje się kompresją obrazu. Autor ponad 50 publikacji naukowych o tematyce kompresji i przetwarzania obrazów. Brał udział w licznych projektach naukowych dotyczących wydajnej reprezentacji multimedialnych danych. Dodatkowo członek wielu zespołów badawczych, które w tematyce kompresji obrazu i dźwięku realizowały prace badawczo-wdrożeniowe dla przemysłu. Jego zainteresowania obejmują kompresję danych, techniki kodowania entropijnego danych oraz realizację kodeków obrazu i dźwięku na procesorach x86 oraz DSP. 4 | S t r o n a © Damian Karwowski – „Zrozumieć Kompresję Obrazu” Spis treści Spis treści 5 “Słowo” wstępu 11 Rozdział 1 15 Obraz i jego reprezentacja przestrzenna 15 1.1.
    [Show full text]
  • R-Photo User's Manual
    User's Manual © R-Tools Technology Inc 2020. All rights reserved. www.r-tt.com © R-tools Technology Inc 2020. All rights reserved. No part of this User's Manual may be copied, altered, or transferred to, any other media without written, explicit consent from R-tools Technology Inc.. All brand or product names appearing herein are trademarks or registered trademarks of their respective holders. R-tools Technology Inc. has developed this User's Manual to the best of its knowledge, but does not guarantee that the program will fulfill all the desires of the user. No warranty is made in regard to specifications or features. R-tools Technology Inc. retains the right to make alterations to the content of this Manual without the obligation to inform third parties. Contents I Table of Contents I Start 1 II Quick Start Guide in 3 Steps 1 1 Step 1. Di.s..k.. .S..e..l.e..c..t.i.o..n.. .............................................................................................................. 1 2 Step 2. Fi.l.e..s.. .M..a..r..k.i.n..g.. ................................................................................................................ 4 3 Step 3. Re..c..o..v..e..r.y.. ...................................................................................................................... 6 III Features 9 1 File Sorti.n..g.. .............................................................................................................................. 9 2 File Sea.r.c..h.. ............................................................................................................................
    [Show full text]
  • Forcepoint DLP Supported File Formats and Size Limits
    Forcepoint DLP Supported File Formats and Size Limits Supported File Formats and Size Limits | Forcepoint DLP | v8.8.1 This article provides a list of the file formats that can be analyzed by Forcepoint DLP, file formats from which content and meta data can be extracted, and the file size limits for network, endpoint, and discovery functions. See: ● Supported File Formats ● File Size Limits © 2021 Forcepoint LLC Supported File Formats Supported File Formats and Size Limits | Forcepoint DLP | v8.8.1 The following tables lists the file formats supported by Forcepoint DLP. File formats are in alphabetical order by format group. ● Archive For mats, page 3 ● Backup Formats, page 7 ● Business Intelligence (BI) and Analysis Formats, page 8 ● Computer-Aided Design Formats, page 9 ● Cryptography Formats, page 12 ● Database Formats, page 14 ● Desktop publishing formats, page 16 ● eBook/Audio book formats, page 17 ● Executable formats, page 18 ● Font formats, page 20 ● Graphics formats - general, page 21 ● Graphics formats - vector graphics, page 26 ● Library formats, page 29 ● Log formats, page 30 ● Mail formats, page 31 ● Multimedia formats, page 32 ● Object formats, page 37 ● Presentation formats, page 38 ● Project management formats, page 40 ● Spreadsheet formats, page 41 ● Text and markup formats, page 43 ● Word processing formats, page 45 ● Miscellaneous formats, page 53 Supported file formats are added and updated frequently. Key to support tables Symbol Description Y The format is supported N The format is not supported P Partial metadata
    [Show full text]