Server-Side PDF There’S More to PDF Than Just Printing Documents
Total Page:16
File Type:pdf, Size:1020Kb
Server-Side PDF there’s more to PDF than just printing documents Leonard Rosenthol Senior Software Engineer Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Overview You are here because... · You’re currently working with PDF, but on the · PDF Creation client side · PDF Manipulation · You’re interested in doing things server side with · PDF Data Extraction PDF files · PDF Forms · You’re already doing some server side PDF · PDF Rendering & Printing solutions but looking for others · Libraries · You were already awake and had to find something to kill time · You’re a friend of mine, and wanted to heckle Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. What is “Server-side How I do things PDF”? · You already have a draft version of this document in your proceedings, so you shouldn’t need to take · too many notes Any program that works with PDF documents and runs in an unattended, most likely batch, mode. · A copy of the final document is on Digital · Application’s website (<http://www.digapp.com>) Does NOT require use of any portion of Adobe’s for your downloading pleasure Acrobat software · since that would be a violation of the Acrobat EULA · Although I’ve left time at the end for Q & A, I’m more than happy to take questions at any time. Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. PDF Creation Postscript · Postscript · HTML/XML · Ghostscript · Images · PStill · “Office” documents · Distiller Server · TeX Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Ghostscript PStill Aladdin Enterprises Frank Siegert · Supports PDF 1.3 · Supports PDF 1.3 · Supports standard PDFMarks · Supports standard as well as custom PDFMarks · Available for Mac OS, Windows, Unix, etc. · Available for Windows, Unix, etc. · Open Source · Free for non-commercial usage · Ships with most (all?) Linux distributions Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Distiller Server HTML/XML Adobe Systems · html2ps · HTMLDoc · It’s the “standard” · activePDF WebGrabber · Not currently available · SVG2PDF · but announced for Windows, Linux & Solaris · FOP · XEP Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. html2ps HTMLDoc Jan Kärrman Easy Software · Written in Perl · Available for Mac OS, Windows, Unix, etc. · Open Source · command line & GUI versions · must compile/build yourself · Open Source · Supports HTML4, including CSS · Only supports HTML 3.2 (+ some extensions) · allows for global CSS overrides · Lots of special formatting options · Requires Ghostscript Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. activePDF WebGrabber SVG2PDF Active4 Digital Applications, Inc. · Windows (COM) Only · Available for Mac OS and Windows · Only supports HTML 3.2 · Technology Preview Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. FOP XEP apache.org RenderX.com · XML/XSL Formatting Objects (FO’s) -> PDF · XML/XSL Formatting Objects (FO’s) -> PDF · Written in Java · Written in Java · Open Source · Source is available · must compile/build yourself · must compile/build yourself · Under development, limited functionality · Unsupported - “evaluation only” Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Images TIFF2PDF FastIO Systems · TIFF2PDF · Trans-TIFF · Convert single & multi-page TIFF’s to PDF · ImageMagick · Windows & Unix · Image Alchemy · Source is available · ImagePDF · PDF*It Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Trans-TIFF ImageMagick ePage Solutions · Convert ANY image (or set of images) to PDF · Windows only · Mac OS, Windows, Unix, etc. · Does NOT support multi-page TIFF’s · Open Source · Requires Ghostscript Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Image Alchemy ImagePDF Handmade Software Apex · Convert TIFF, BMP, JPEG & PNG to PDF · Convert ANY image (or set of images) to PDF · Windows & Linux · Windows, Unix, etc. · Available with COM, Java, C or command line API’s Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. PDF*It “Office” documents DocuLex · Win2PDF · PDFDriver · Scan documents direct to PDF · PDFConverter · Includes OCR capabilities · activePDF Doc Converter · Windows only · PrintToPDF · Roundabout options Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Win2PDF PDFDriver Dane Prairie ZEON Corporation · Supports any application that can print! · Supports any application that can print! · Windows only · Windows only · Full support for CJKV localizations · Adds watermarking & security options Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. activePDF Doc PDFConverter Converter Amyuni Active4 · Supports MSOffice (‘97 & 2000) and WordPerfect · Supports any application that can print! · includes bookmarking, TOC, hyperlinks, etc. · Windows only · Windows only · Can combine multiple jobs into a single PDF · Postscript based! · Adds encryption & linearization options Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. PrintToPDF Roundabout options James W. Walker or how to convert w/o printing? · Supports any application that can print! · Mac OS only · Convert to HTML using wvHTML then to PDF · Supports CJKV · Open with Abiword, print to PS, “distill” · Supports auto-conversion of URL’s, bookmarks, and more Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. TeX PDFTeX · Extended version of TeX for producing PDF documents · PDFTeX · Available for Mac OS, Windows, Unix, etc. · Lout · Open source · Includes tools for specifying PDF specific options · font embedding, hyperlinks, annotations, bookmarks, and more! Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Lout PDF Manipulation · AppendPDF/AppendPDF Pro · Layout Changer · TeX-like markup language with PDF output · Glance PDFTools · Windows and Unix only · Ari’s PDF Splitter · Open Source · StampPDFBatch · Support for non-Roman languages · Pitstop Server · AudienceOne Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. AppendPDF/AppendPDF Layout Changer Pro Digital Applications GDykes · Available for Mac OS, Windows, and Unix · Create new PDF documents from portions (or the · Available for Mac OS and Windows entirety) of other PDF documents. · Create new PDF documents from portions (or the · Have cover pages, TOC, bookmarks, etc. all entirety) of other PDF documents. generated automatically · Basic imposition capabilities by specifying · Convert pages to a “2-up” layout “layout” of multiple existing pages onto a new · Apply page numbers, watermarking, images, etc. page to newly created documents Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Glance PDFTools Ari’s PDF Splitter Glance Dionis · Available for Windows only · Create new PDF documents from portions (or the · Available for Mac OS and Windows entirety) of other PDF documents. · Create new documents by splitting existing · Have cover pages, TOC, bookmarks, etc. all documents up generated automatically · Basic image watermarking of documents Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. StampPDFBatch Pitstop Server Digital Applications Enfocus · Available for Mac OS and Windows · Available for Mac OS, Windows, and Unix · Preflighting and Correction · Adds new “permanent” content (as opposed to · Media Size, Font Embedding and Usage, Spot to CMYK Color annotations) to existing PDF documents Conversion, Black Text Overprint and White Text Knock-Out · text in any Type 1 font, in any color or drawing mode & rotation · Modify existing content · watermarks, headers & footers, etc. · remove items, change attributes (font, colors, object styles) · JPEG images · Add images Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. AudienceOne PDF Data Extraction Cardiff Software · Xpdf Tools · Customized solutions for PDF creation & · Ghostscript manipulation · Argus Server · business cards, information delivery, etc. · PDFDecoder · WinPDFData Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Xpdf Tools Ghostscript FooLabs Aladdin Enterprises · Available for Mac OS, Windows, Unix, etc. · Open Source · Available for Mac OS, Windows, Unix, etc. · Command line tools for · Open Source · document information · Command line & GUI tools for · text extraction · text extraction · image extraction · PDF->PS · PDF->PS · PDF->HTML Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. Argus Server PDFDecoder Iceni LimitPoint Software · Available for Windows and Solaris · Command line tool for · Available for Mac OS only · text extraction · Scriptable application for · image extraction · document information · PDF->XML · text extraction · PDF->HTML Copyright©1999-2000 Digital Applications, Inc. Copyright©1999-2000 Digital Applications, Inc. WinPDFData PDF Forms SANFACE · FDFMerge · Written in Perl · eFN Technology · extracts document info · Office Courier · Open