DLI Implementation and Reference Guide

Total Page:16

File Type:pdf, Size:1020Kb

DLI Implementation and Reference Guide Implementation and Reference Guide Datalogics Interface Datalogics® Datalogics DATALOGICS INTERFACE Implementation and Reference Guide This guide is part of the Adobe® PDF Library v6.1.1Plus suite; 02/15/05. Copyright 1999-2005 Datalogics Incorporated. All Rights Reserved. Use of Datalogics software is subject to the applicable license agreement. DL Interface is a trademark of Datalogics Incorporated. Other products mentioned herein as Datalogics prod- ucts are also trademarks or registered trademarks of Datalogics, Incorporated. Adobe, Adobe PDF Library, Portable Document Format (PDF), PostScript, Acrobat, Distiller, Exchange and Reader are trademarks of Adobe Systems Incorporated. HP and HP-UX are registered trademarks of Hewlett Packard Corporation. IBM, AIX, AS/400, OS/400, MVS, and OS/390 are registered trademarks of International Business Machines. Java, J2EE, J2SE, J2ME, all Java-based marks, Sun and Solaris are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States and other countries. Linux is a registered trademark of Linus Torvalds. Microsoft, Windows and Windows NT are trademarks or registered trademarks of Microsoft Corporation. SAS/C is a registered trademark of SAS Institute Inc. UNIX is a registered trademark of The Open Group. VeriSign® is a registered trademark of VeriSign, Inc. in the United States and/or other countries. All other trademarks and registered trademarks are the property of their respective owners. For additional information, contact: Datalogics, Incorporated 101 North Wacker Drive, Suite 1800 Chicago, Illinois 60606-7301 Phone: 312-853-8200 Fax: 312-853-8282 www.datalogics.com [email protected] Table of Contents i Table of Contents 1 Getting Started 1.1 An Introduction to DLI 1.2 How to Create a PDF Document with DLI 1.2 How to Use this Book 1.5 What’s New in This Release 1.11 Enhancements in Prior Releases 1.20 2 Initializing and Terminating the Library 2.1 Overview 2.2 Adobe PDF Library Data Structure 2.2 Adobe PDF Library Version Control 2.6 Files In Memory Activation 2.8 Initializing and Terminating via DLI 2.8 Writing PDF Output to Memory 2.15 API Comparison 2.17 3 Beginning and Ending a Document 3.1 Overview 3.2 4 Fonts 4.1 PDF Font Overview 4.2 Structure of a DLI Font 4.4 Font Creation Calls 4.6 Predefined Font Encodings 4.11 Unicode Text Support 4.13 Code Page Support 4.13 Performance Considerations 4.13 ii DLI Implementation and Reference Guide Accessing Fonts 4.15 5 Multibyte Text Work 5.1 Concepts and Facilities: Guide to the DL Pager Composition System Introduction 5.2 Loading and Creating Fonts 5.3 Creating DLPDFTEXT Areas 5.5 Working With DLPDFTEXT Areas 5.7 Performance Considerations 5.10 6 Working with Pages 6.1 Introduction to Page Interface 6.2 Page Interface Calls 6.2 7 Containers within Pages 7.1 What are Containers? 7.2 8 Working with Content 8.1 Overview of Content Interface 8.2 Content Interface Calls 8.3 9 Working with Forms 9.1 Overview of Forms 9.2 Form Calls 9.3 10 Displaying Line Drawings 10.1 Overview 10.2 Approaches to Line Drawing 10.3 Graphic State and Line Drawing 10.17 Table of Contents iii 11 Image Display 11.1 Overview 11.2 Graphic Image Structures 11.2 Graphic Image Forms 11.3 Image Creation Methods 11.5 Creating Transparent Graphics 11.12 12 Color and its Use 12.1 Library Color Descriptions 12.2 Colors in Images 12.3 Creating and Destroying Color Spaces 12.3 Values for Color Channels 12.6 Basic Color Spaces 12.7 Advanced Color Spaces 12.11 Building Patterned Color Spaces 12.13 Conversion between Models 12.16 13 Annotations and Links 13.1 Overview 13.2 Annotation Components 13.2 Modifying the Link Cos Object 13.9 14 Bookmark Creation 14.1 Overview 14.2 15 Digital Signatures 15.1 Overview 15.2 Public and Private Keys 15.2 iv DLI Implementation and Reference Guide Digital Signature Calls 15.4 16 Error Concepts and Facilities: Guide to the DL Pager Composition System Testing and Recovery 16.1 Overview 16.2 OS/390 Platform Concerns 16.4 17 Samples and Links 17.1 Running DLI Sample Applications 17.2 A DLI Reference Guide A.1 1 Getting Started This chapter introduces the Datalogics Interface. Experienced users may want to skip directly to the section “What’s New in This Release” on page 1.11 for information on the latest enhancements and additions. 1.1 1.2 DLI Implementation and Reference Guide An Introduction to DLI The Datalogics Interface (DLI) facilitates the rapid creation of PDF documents and improves performance, throughput and graphics handling. It does this by bypassing many of the functions of the PDF Edit layer and eliminating redundant calls to the COS layer used in the Adobe® PDF Library. For information on the various layers of Adobe PDF Library please see the Acrobat Core API Overview. How to Create a PDF Document with DLI The DLI package exists at the output end of the page creation process. As such, most of the intricacies of line breaking, page breaking and general composition are outside of its scope. The process of writing pages is where this package fits into an application. An overview of this process follows: 1 Initialize DLI. (start of all processing) 2 Define the document. 3 Define fonts: i.e. Identify and describe each font to be used in the job to DLI. These can be done as encountered, if so desired. 4 Define forms: i.e. Identify and describe each form to be used in the job to DLI. These can be done as encountered, if so desired. 5 Define graphics: i.e. Identify and define each graphic to be used multiple times to DLI. These can be done as encountered, if so desired. 6 For each page: • Create a dlpdfpage object. • Create a dlpdfcontent object. 7 Generate the content of a single page: • For each graphic reference on the page, create the graphic in content. • For each text line on the page, create the text in content. 8 End the content and inform DLI that the content is complete. 9 End the page and inform DLI that the page is complete. Getting Started 1.3 10 End the document. 11 End the job. Error handling during application execution is handled through the raising and handling of exceptions, as defined by the Adobe PDF Library. These are similar to operation exceptions in C++. For details on handling exceptions, please see the chapter “Error Testing and Recovery” on page 16.1. What You Should Know This document is intended for programmers who are familiar with text composition and the creation of output drivers, or by application designers who are constructing an application based on the DLI package. You should have access to the Adobe PDF Library Applications Programming Interface (API) manual and the Adobe PDF Specifications manual for your system. For Adobe PDF Library v5.x releases, Adobe PDF Specification 1.4 is appropriate.For Adobe PDF Library v6.x releases, Adobe PDF Specification 1.5 is appropriate. NOTE: Some structures permitted in Adobe PDF Specification 1.5 may not be permitted in Adobe PDF Specification 1.4, and some structures defined in Adobe PDF Specification 1.4 are not available in Adobe PDF Specification 1.3. The explanations, assumptions and samples provided in this guide refer to Adobe PDF Library v6.1.0Plus and DLI v3.0 or higher. DLI Initialization Required Starting with DLI v2.1, the initialization process has been simplified such that you must initialize DLI to automatically initialize the Adobe PDF Library. Though it may be functionally possible to bypass the initialization of DLI for versions 2.1 and higher, an application should not do so. Using the DLI initialization and termination routines not only simplifies application programming but also allows the use of the Datalogics 1.4 DLI Implementation and Reference Guide Files In Memory (FIM) System, and the enabling of certain optimizations in jobs which span multiple documents. NOTE: The Adobe PDF Library and DLI are not intended to be initialized more than once within a single instance of an application. Doing so can produce undesirable results. Versions of Adobe PDF Library prior to v6.1 are not thread-safe. gcc Compilation Version Adobe compiles their PDF Library components with gcc 2.95.2 on the Solaris®, AIX® and Linux® operating systems. Datalogics does not recompile Adobe components on any other compiler on these operating systems. Datalogics only distributes the Adobe PDF Library and the DLI components compiled using gcc on these operating systems. For the latest information on supported operating-system compilers and versions, please see the readme.txt file of last-minute updates accompanying your software release files. Unix Compiler Run-Time Libraries For clients who are using native (i.e. platform-resident) UNIX® compilers, the run- time libraries for each of these operating systems are available from the Free Software Foundation. Any clients wishing to integrate the Adobe PDF Library and DLI components with their natively-compiled applications can retrieve these run-time libraries free of charge from the Free Software Foundation at http:// www.gnu.org/software/gcc/. Getting Started 1.5 How to Use this Book This book has been created to guide you through the process of creating PDF documents with DLI. It consists of two main sections: • An Implementation section • A Reference Guide appendix The Implementation section begins with Chapter 1: Getting Started. It follows the steps needed to create PDF and introduces DLI in relation to the Adobe PDF Library, explains the methods used in DLI, how they fit together, and provides various samples.
Recommended publications
  • Adobe PDF Library Developer Overview 1
    Developer Overview Adobe PDF Library v7.0 Datalogics® Datalogics ADOBE PDF LIBRARY Developer Overview This guide is part of the Adobe PDF Library v7.0.5 suite; 08/17/06. Copyright 1999-2006 Datalogics Incorporated. All Rights Reserved. Use of Datalogics software is subject to the applicable license agreement. Datalogics Interface (DLI) is a trademark of Datalogics Incorporated. Other products mentioned herein as Datalogics products are also trademarks or registered trademarks of Datalogics, Incorporated. Adobe, Adobe PDF Library, PostScript, Acrobat, Distiller, Exchange and Reader are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States or other countries. HP and HP-UX are registered trademarks of Hewlett Packard Corporation. IBM, AIX, AS/400, OS/400, MVS, and OS/390 are registered trademarks of International Business Machines. Java, J2EE, J2SE, J2ME, all Java-based marks, Sun and Solaris are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States and other countries. Linux is a registered trademark of Linus Torvalds. Microsoft, Windows and Windows NT are trademarks or registered trademarks of Microsoft Corporation. SAS/C is a registered trademark of SAS Institute Inc. UNIX is a registered trademark of The Open Group. All other trademarks and registered trademarks are the property of their respective owners. For additional information, contact: Datalogics, Incorporated 101 North Wacker Drive, Suite 1800 Chicago, Illinois 60606-7301 Phone: 312-853-8200 Fax: 312-853-8282
    [Show full text]
  • Pdflib Text and Image Extraction Toolkit (TET) Manual
    ABC Text and Image Extraction Toolkit (TET) Version 5.2 Toolkit for extracting Text, Images, and other items from PDF Copyright © 2002–2019 PDFlib GmbH. All rights reserved. Protected by European and U.S. patents. PDFlib GmbH Franziska-Bilek-Weg 9, 80339 München, Germany www.pdflib.com phone +49 • 89 • 452 33 84-0 If you have questions check the PDFlib mailing list and archive at groups.yahoo.com/neo/groups/pdflib/info Licensing contact: [email protected] Support for commercial PDFlib licensees: [email protected] (please include your license number) This publication and the information herein is furnished as is, is subject to change without notice, and should not be construed as a commitment by PDFlib GmbH. PDFlib GmbH assumes no responsibility or lia- bility for any errors or inaccuracies, makes no warranty of any kind (express, implied or statutory) with re- spect to this publication, and expressly disclaims any and all warranties of merchantability, fitness for par- ticular purposes and noninfringement of third party rights. TET contains modified parts of the following third-party software: CMap resources. Copyright © 1990-2019 Adobe Zlib compression library, Copyright © 1995-2017 Jean-loup Gailly and Mark Adler TIFFlib image library, Copyright © 1988-1997 Sam Leffler, Copyright © 1991-1997 Silicon Graphics, Inc. Cryptographic software written by Eric Young, Copyright © 1995-1998 Eric Young ([email protected]) Independent JPEG Group’s JPEG software, Copyright © Copyright © 1991-2017, Thomas G. Lane, Guido Vollbeding Cryptographic software, Copyright © 1998-2002 The OpenSSL Project (www.openssl.org) Expat XML parser, Copyright © 2001-2017 Expat maintainers ICU International Components for Unicode, Copyright © 1995-2012 International Business Machines Corpo- ration and others OpenJPEG library, Copyright © 2002-2014, Université catholique de Louvain (UCL), Belgium TET contains the RSA Security, Inc.
    [Show full text]
  • Are Your Pdfs Print-Ready?
    Are your PDFs Print-Ready? How to save time and money with print-optimized workflows Meet the Presenter Scott Urchell Sales Engineering | Datalogics Technical lead, Business Development team What will be covered • Why & how to improve your print workflows • Good vs Bad PDFs • Internal PDF structures • Color management • Transparencies • PDF/X Why improve your print workflows? Why improve your print workflows? • Reduce time to print • Improve quality • Save money and resources • You don’t want to find a problem with your files when you are going to press What can you improve? What can you improve? • Files • Processes • Tools Not all PDFs are created equal PDF files are created for different purposes • Print-specific • Online distribution • General PDFs Bad PDFs (malformed) • Malformed files / syntax issues Internal PDF structure • Acrobat Preflight –> Report PDF Syntax Internal PDF structure • Acrobat Preflight –> Browse Internal Structure Bad PDFs (content) • Missing fonts • Image quality • Color problems • Affect how your customers communicate with their audience Common pre-print processes Preflight • Split & Merge • Color conversion -RGB to CMYK, Spot colors • Transparency Flattening • Preflight Color management • Conversion to CMYK targeting specific profiles • Render intent • Colorimetric (Absolute, Relative) • Perceptual, Saturation • Output Intent Transparency • Should you flatten transparencies? The importance of PDF/X A subset of the PDF standard supporting a variety of print production workflows • PDF/X-1 - CMYK and spot colors,
    [Show full text]
  • Adobe PDF Library (APDFL) Overview Functionality
    Datasheet Adobe PDF Library (APDFL) Overview Functionality The Adobe PDF Library is a Software Development The Adobe PDF Library is a Software Development Kit (SDK) and library of Application Programming Kit (SDK) and library of Application Programming Interfaces (API) for working with the PDF document Interfaces (API) for working with the PDF document format. format. Software developers and systems integrators use the Software developers and systems integrators use the Library to build their own third-party applications that Library to build their own third-party applications that they use to open and render, create, change, process, they use to open and render, create, change, process, review, print, and manage PDF files. review, print, and manage PDF files. Adobe, Inc. used the Adobe PDF Library as the basis for Adobe, Inc. used the Adobe PDF Library as the basis for creating their popular Adobe Acrobat software tool and so the Library is fully compatible with Acrobat DC and creating their popular Adobe Acrobat software tool and with the Adobe Reader. so the Library is fully compatible with Acrobat DC and with the Adobe Reader. In-Depth Look The Adobe PDF Library is an excellent choice of the application used to generate them or for high-volume processing functions such the platform used to render them. as document management, knowledge The Adobe PDF Library was created in 1993, management, prepress workflows, document around the same time as the PDF format archiving, business reporting, electronic itself, and Adobe Acrobat. It is built using statements, paper-to-digital conversions, C/C++ code, but Datalogics also provides content management, content extraction, a .NET, .NET Core and Java interface that indexing, and searching.
    [Show full text]
  • Python Language
    Python Language #python Table of Contents About 1 Chapter 1: Getting started with Python Language 2 Remarks 2 Versions 3 Python 3.x 3 Python 2.x 3 Examples 4 Getting Started 4 Verify if Python is installed 4 Hello, World in Python using IDLE 5 Hello World Python file 5 Launch an interactive Python shell 6 Other Online Shells 7 Run commands as a string 7 Shells and Beyond 8 Creating variables and assigning values 8 User Input 12 IDLE - Python GUI 13 Troubleshooting 14 Datatypes 15 Built-in Types 15 Booleans 15 Numbers 15 Strings 16 Sequences and collections 16 Built-in constants 17 Testing the type of variables 18 Converting between datatypes 18 Explicit string type at definition of literals 19 Mutable and Immutable Data Types 19 Built in Modules and Functions 20 Block Indentation 24 Spaces vs. Tabs 25 Collection Types 25 Help Utility 30 Creating a module 31 String function - str() and repr() 32 repr() 33 str() 33 Installing external modules using pip 34 Finding / installing a package 34 Upgrading installed packages 34 Upgrading pip 35 Installation of Python 2.7.x and 3.x 35 Chapter 2: *args and **kwargs 38 Remarks 38 h11 38 h12 38 h13 38 Examples 39 Using *args when writing functions 39 Using **kwargs when writing functions 39 Using *args when calling functions 40 Using **kwargs when calling functions 41 Using *args when calling functions 41 Keyword-only and Keyword-required arguments 42 Populating kwarg values with a dictionary 42 **kwargs and default values 42 Chapter 3: 2to3 tool 43 Syntax 43 Parameters 43 Remarks 44 Examples 44 Basic
    [Show full text]
  • A Proposal of Substitute for Base85/64 – Base91
    A Proposal of Substitute for Base85/64 – Base91 Dake He School of Information Science & Technology, Southwest Jiaotong University, Chengdu 610031,China College of Informatics, South China Agricultural University, Guangzhou 510642, China [email protected] Yu Sun, Zhen Jia, Xiuying Yu, Wei Guo, Wei He, Chao Qi School of Information Science & Technology, Southwest Jiaotong University, Chengdu 610031,China Xianhui Lu Key Lab. of Information Security, Chinese Academy of Sciences, Beijing 100039,China ABSTRACT not control character or “-”(hyphen). There are totally 94 of such ASCII characters, their corresponding digital The coding transformation method, called Base91, is coding being all integers ranging from 32 through 126 characterized by its output of 91 printable ASCII with the exception of 45. E-mail written in these ASCII characters. Base91 has a higher encoding efficiency than characters is compatible with the Internet standard SMTP, Base85/64, and higher encoding rate than Base85. and can be transferred in nearly all the E-mail systems. Besides, Base91 provides compatibility with any Nowadays, as Content-Transfer-Encoding to provide bit-length input sequence without additional filling compatibility with the E-mail, Base64[1,2] code is usually declaration except for his codeword self. One can use employed. Base91 as a substitute for Base85 and Base64 to get some Base64 coding divides the input sequence into blocks benefits in restricted situations. being 6-bits long to be used as variable implementation Keywords: Base91; Base85; Base64; printable ASCII mapping, the mapping is denoted by characters; IPv6 Base64[ ]: X →Y where the variable or original image set X includes all 64 1.
    [Show full text]
  • Chicago, IL, December 8, 2010 — Datalogics, a Chicago-Based
    For more information Nicole Bullock Marketing Manager tel. +1.312.853.8200 PRESS RELEASE BeamItDown Software licenses the Adobe Reader Mobile SDK from Datalogics for its iFlow Reader Chicago, IL, December 8, 2010 — Datalogics, a Chicago-based software company and the premier channel for several Adobe developer toolkits, today announced that it has licensed the Adobe® Reader® Mobile SDK to Irvine, CA-based BeamItDown Software for use in its iFlow Reader™ eBook reader application for Apple iPhone, iPod Touch, and iPad. The new, feature-rich iFlow Reader™ provides an exceptional reading and shopping experience with a gorgeous and intuitive user interface that establishes a new benchmark for eBook reading enjoyment. With its seamless support of manual scrolling, trackball scrolling, virtual paging, and the silky smooth tilt-controlled AutoScrolling, the iFlow Reader lets the user read the way that they prefer for the material at hand. Content for the iFlow Reader is available from the iFlowBookstore, developed in partnership with the Ingram Content Group. The iFlowBookstore contains the newest titles from top publishers, including Random House, Hachette Book Group, HarperCollins, Penguin Group, Simon & Schuster, McGraw Hill, John Wiley and Sons, and many others. “Our goal was to make the best eBook application anywhere because we saw it as the only way for us little guys to compete against the Goliaths that currently dominate this marketplace,” states Dennis Morin, one of the founders of BeamItDown Software. “Adobe has provided an extraordinary technology platform for us to build on. Because of this, and because we made the user experience of buying and reading eBooks our first priority, we firmly believe our iFlow Reader application is a huge improvement over everything else on the market today.
    [Show full text]
  • Answers to Exercises
    Answers to Exercises A bird does not sing because he has an answer, he sings because he has a song. —Chinese Proverb Intro.1: abstemious, abstentious, adventitious, annelidous, arsenious, arterious, face- tious, sacrilegious. Intro.2: When a software house has a popular product they tend to come up with new versions. A user can update an old version to a new one, and the update usually comes as a compressed file on a floppy disk. Over time the updates get bigger and, at a certain point, an update may not fit on a single floppy. This is why good compression is important in the case of software updates. The time it takes to compress and decompress the update is unimportant since these operations are typically done just once. Recently, software makers have taken to providing updates over the Internet, but even in such cases it is important to have small files because of the download times involved. 1.1: (1) ask a question, (2) absolutely necessary, (3) advance warning, (4) boiling hot, (5) climb up, (6) close scrutiny, (7) exactly the same, (8) free gift, (9) hot water heater, (10) my personal opinion, (11) newborn baby, (12) postponed until later, (13) unexpected surprise, (14) unsolved mysteries. 1.2: A reasonable way to use them is to code the five most-common strings in the text. Because irreversible text compression is a special-purpose method, the user may know what strings are common in any particular text to be compressed. The user may specify five such strings to the encoder, and they should also be written at the start of the output stream, for the decoder’s use.
    [Show full text]
  • Adobe PDF Library Developer Overview
    Developer Overview Adobe PDF Library v6.1 Datalogics® Datalogics ADOBE PDF LIBRARY Developer Overview This guide is part of the Adobe® PDF Library v6.1.0Plus suite; 08/11/04. Copyright 1999-2004 Datalogics Incorporated. All Rights Reserved. Use of Datalogics software is subject to the applicable license agreement. DL Interface is a trademark of Datalogics Incorporated. Other products mentioned herein as Datalogics prod- ucts are also trademarks or registered trademarks of Datalogics, Incorporated. Adobe, Adobe PDF Library, Portable Document Format (PDF), PostScript, Acrobat, Distiller, Exchange and Reader are trademarks of Adobe Systems Incorporated. Microsoft, Windows and Windows NT are trademarks or registered trademarks of Microsoft Corporation. IBM, AIX, AS/400, OS/400, MVS, and OS/390 are registered trademarks of International Business Machines. HP and HP-UX are registered trademarks of Hewlett Packard Corporation. SAS/C is a registered trademark of SAS Institute Inc. Java, J2EE, J2SE, J2ME, all Java-based marks, Sun and Solaris are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States and other countries. UNIX is a registered trademark of The Open Group. Linux is a registered trademark of Linus Torvalds. All other trademarks and registered trademarks are the property of their respective owners. For additional information, contact: Datalogics, Incorporated 101 North Wacker Drive, Suite 1800 Chicago, Illinois 60606-7301 Phone: 312-853-8200 Fax: 312-853-8282 www.datalogics.com [email protected] Table
    [Show full text]
  • Adobe PDF Library X SDK
    Adobe® PDF Library X SDK FAQ Adobe® PDF Library X SDK When should I use the Adobe PDF Library software development kit (SDK)? The Adobe PDF Library provides the core PDF manipulation functionality of Adobe Acrobat® software in an API format. It is suitable for embedding basic Acrobat functionality such as creating, editing, assembling, and printing PDF documents from within third-party applications. The PDF Library provides technology that can be fully integrated with another application. Use the Adobe PDF Library SDK when you need to integrate Adobe PDF functionality with your solutions in a client and/or server environment. The Adobe PDF Library does not provide access to extended Acrobat functionality such as digital signatures, annotations, or plug-ins. If you require these advanced features, use Acrobat instead. Developing with Acrobat requires users to have a copy of Acrobat on their computers. Whenever feasible and practical, you should use Adobe applications such as Adobe Acrobat, Reader® and LiveCycle® ES to support PDF capabilities in your solutions. To develop plug-ins for Acrobat and Reader, please use the Acrobat SDK. Adobe also offers a range of enterprise solutions for streamlining document-based processes at www.adobe. com/enterprise. How is using the Adobe PDF Library superior to developing my own PDF support? The Adobe PDF Library helps remove the burden of understanding and maintaining support for the underly- ing PDF standard, allowing developers to focus their core competencies on building robust business solutions. PDF files created with PDF Library are high quality and standards based. The library can read any valid PDF file created by other sources, and it will remain current with future versions of the PDF specification.
    [Show full text]
  • For More Information Nicole Bullock Senior Marketing Manager Tel
    For more information Nicole Bullock Senior Marketing Manager tel. +1.312.853.8200 PRESS RELEASE Datalogics PDF Alchemist, a Developer Toolkit for Converting PDF to HTML, Now Available Chicago, IL, July 15, 2015 – Datalogics, the premier source for Adobe PDF and eBook technologies, announced the release of Datalogics PDF Alchemist, a new SDK for converting PDF documents to HTML. PDF Alchemist recovers critical text flows that were lost during the initial conversion of the source document to PDF. These text flows are essential for repurposing document contents in a number of ways, including: Optimizing the viewing experience on mobile phones and tablets by enabling intelligent text reflow; Enabling improved semantic text search in content repositories and document management systems; Enabling the reconstruction of editable source documents in situations where the original was lost; And more. PDF Alchemist employs advanced heuristics and sophisticated algorithms to scan across columns and pages of a PDF, linking related text and paragraphs together in the final output. Images are extracted as separate files, and are referenced inline in the HTML output; and formatting including text styling, indentation and justification are also preserved. “Recovering the text structure of PDFs is a bit of a 'holy grail' of PDF processing,” notes Greg Manuel, Vice President of Marketing. “Since the PDF format was first and foremost a page description language, encoding semantic text flow information within the document was never a priority. Consequently, today there's a wide variety of tools for creating PDFs, each encoding the internal structure in a different way. PDF Alchemist employs a number of techniques, including using positional and style “hints," to reconstruct the flow of text across columns and pages.
    [Show full text]
  • Datalogics Announces Availability of Adobe® Reader Mobile SDK 10 and Adobe Content Server 5
    For more information Katie Farnan Tel. +1.312.853.8200 PRESS RELEASE Datalogics announces availability of Adobe® Reader Mobile SDK 10 and Adobe Content Server 5 Chicago, IL, January 23, 2014 – Chicago-based Datalogics, Inc., the premier source for Adobe PDF and eBook technologies, today announced the availability of Adobe Reader Mobile SDK (RMSDK) 10 and Adobe Content Server (ACS) 5 to their customers. These new releases incorporate a hardened Digital Rights Management (DRM) technology to more securely protect EPUB and PDF content. Adobe Reader Mobile SDK 10 and Adobe Content Server 5 are immediately available from Datalogics. Adobe Reader Mobile SDK is Adobe’s EPUB and PDF rendering technology for eBook apps and devices. It powers hundreds of apps and devices, which serves as the front end of Adobe’s end-to-end eBook ecosystem. This ecosystem is used worldwide by thousands of publishers, schools, libraries and content providers selling and lending millions of eBooks. Other features of RMSDK 10 include: • Performance improvements to PDF text search when the text is not present in the specified range • Compiler migration to support latest XCode and Mac OS • CSS 2.1 properties enhancements • Support for Pseudo Bold and Italic CJK font face • Other bug fixes For more information Adobe Reader Mobile SDK 10 and Adobe Content Server 5 are both available from Datalogics directly or through their distribution partner EAST Co. for the Japanese market. Existing customers can contact their Technical Support Engineer for availability and upgrade details. In addition, Datalogics and Adobe is hosting an informational web meeting on Wednesday, January 29th, 2014, to share additional details and answer questions.
    [Show full text]