ABBYY Finereader Engine 10 User's Guide

Total Page:16

File Type:pdf, Size:1020Kb

ABBYY Finereader Engine 10 User's Guide USER’S GUIDE ABBYY FineReader Engine 10 User's Guide Contents Introducing ABBYY FineReader Engine 10........................................................................ 4 Basic Usage Scenarios Overview.......................................................................................................................................................................................................5 Key Features.....................................................................................................................................................................................................................................................7 Document Scanning and Image Import ................................................................................................................... 9 Image Preprocessing...............................................................................................................................................10 Document Analysis .................................................................................................................................................12 OCR and Other Recognition Technologies .............................................................................................................. 13 PDF Conversion......................................................................................................................................................16 Advanced Development Tools ................................................................................................................................18 Receiving and Exporting Recognized Text..............................................................................................................19 MultiCPU Recognition Architecture .......................................................................................................................20 Benefits.............................................................................................................................................................................................................................................................20 Short Specifications.................................................................................................................................................................................................................................20 Getting Started............................................................................................................................................................................................................................................21 Guided Tour.................................................................................................................... 22 Basic Usage Scenarios Implementation....................................................................................................................................................................................22 Document Conversion............................................................................................................................................22 Document Archiving...............................................................................................................................................27 Book Archiving ....................................................................................................................................................... 32 Text Extraction....................................................................................................................................................... 37 FieldLevel Recognition...........................................................................................................................................41 Barcode Recognition...............................................................................................................................................45 Image Preprocessing...............................................................................................................................................49 Scanning................................................................................................................................................................. 53 Advanced Techniques............................................................................................................................................................................................................................57 Programming Aspects.............................................................................................................................................58 Error Handling .......................................................................................................................................................59 Working with Properties .........................................................................................................................................59 Working with Connectable Objects.........................................................................................................................62 Working with COM Interfaces from a Scripting Language...................................................................................... 63 Using ABBYY FineReader Engine in Delphi.............................................................................................................64 Working with Profiles..............................................................................................................................................65 Tuning Analysis, Recognition, and Synthesis Parameters.......................................................................................66 Tuning Export Parameters .....................................................................................................................................68 Working with Images ..............................................................................................................................................70 Working with Languages ........................................................................................................................................71 Working with Layout and Blocks ............................................................................................................................ 73 Working with Text ..................................................................................................................................................75 Working with the Logical Structure of a Document ................................................................................................76 Using Voting API......................................................................................................................................................79 Using Text Type Autodetection ...............................................................................................................................82 Recognizing Checkmarks........................................................................................................................................82 Recognizing Handprinted Texts .............................................................................................................................85 Recognizing Hieroglyphic Languages......................................................................................................................87 1 ABBYY FineReader Engine 10 User's Guide Recognizing with Training......................................................................................................................................88 Training User Patterns............................................................................................................................................90 Pattern Training Dialog Box ..................................................................................................................................91 Working with Dictionaries...................................................................................................................................... 93 Working with ABBYY FineReader Engine Regular Expressions...............................................................................96 Recognizing Words with Spaces..............................................................................................................................97 Setting up Scanning Options...................................................................................................................................99 Best Practices ............................................................................................................................................................................................................................................100 Tips for Document Scanning ................................................................................................................................101 Tips for Taking Photos..........................................................................................................................................102 Improving Recognition Quality.............................................................................................................................104 Description of the ABBYY FineReader Engine Samples............................................................................................................................................105
Recommended publications
  • Adobe Indesign Introduction to Digital Humanities
    Platform Study Adobe InDesign Introduction to Digital Humanities 2015 Matt Higgins Design is mind control. Introduction Modernist designers sought to find universal concepts within design. They wanted to know how visual elements affected human beings on a psychological level. This is why the works of Modernists such as Josef Müller-Brockmann, El Lissitzky, and Jan Tschichold, feature basic colors and shapes. They believed stripping design down to its most basic elements would remove any sentiment or bias that certain visuals could produce and allow for an objective study on how humans are affected by design. There have been countless movements like Modernism. They have invariably found their way into design. Many of those movements would reject the principles of Modernism and their universals. But it is plain to see, regardless of philosophy or ideology, that design affects human beings. If it did not, why would we continue designing? The nature of graphic design has always been to communicate. To affect people. Fresh Dialogue Sagmeister & Walsh This differentiates it from traditional fine arts. Certainly a We can think of design in terms of verbal conversation. What painting can communicate. The medium only matters in how it words are spoken is just as important as how the words are relates to the relaying of the message. But we tend to think of spoken. Then we take into account body language. From there fine art as a form of self expression. The artists is much more we can list a whole host of factors beyond the words spoken that involved in the work.
    [Show full text]
  • Canon Solutions America, Inc. 5200 Upper Metro Place, 150 Dublin, OH 43017 Phone: 800.815.4000 CURRENT PRICELI
    Canon Solutions America, Inc. 5200 Upper Metro Place, 150 Dublin, OH 43017 Phone: 800.815.4000 www.csa.canon.com CURRENT PRICELIST SUPPLIER: CANON SOLUTIONS AMERICA, INC. INDEX NUMBER: STS 096 SCHEDULE NUMBER: 800324 EFFECTIVE DATES: 02/25/2020 TO 4/30/2020 Chapter 1: VarioStream 7000 - Simplex Section 2: Primary Interface (Mandatory - Must Select) Section 3: Second and Third Interface (Optional) Section 4: Options (updated for MICR options) Section 5: Color Starter Kits Section 6: Upgrades Chapter 2: VarioStream 7000 - Twins Section 1: Twin Printers Section 2: Primary Interface (Mandatory - Must Select) Section 3: Second and Third Interface (Optional) Section 4: Twin Options Section 5: Color Starter Kits Section 6: Upgrades Chapter 3: VarioStream 7000 - Triplex Section 1: Triplex Printers Section 2: Primary Interface (Mandatory - Must Select) Section 3: Second and Third Interface (Optional) Section 4: Options Section 5: Color Starter Kits Section 6: Upgrades Chapter 4: VarioStream 8000 - Simplex Section 1: Simplex Printers Section 2: Second and Third Interface (Optional) Section 3: Options Section 4: MICR Options Section 5: CustomTone Options Section 6: Upgrades Section 7: Accessories Section 8: Remote Diagnostics Chapter 5: VarioStream 8000- Twin Section 1: Twin Printers Section 2: Second and Third Interface (Optional) Section 3: Options Section 4: MICR Options Section 5: CustomTone Options Section 6: Upgrades Section 7: Accessories Section 8: Remote Diagnostics Chapter 10: CS3000 TWIN ColorStream Twin Series Section 0: Standard Configuration
    [Show full text]
  • Desktop Publishing 45, Anurag Nagar, Behind Press Complex, Indore
    B.Com 1st Year (Plain) Subject- Desktop Publishing SYLLABUS Class – B.Com. I Year Subject – Desktop Publishing UNIT – I Importance and Advantages of DTP, DTP Software and Hardware, Commercial DTP Packages, Page Layout programs, Introduction to Word Processing, Commercial DTP Packages, Difference between DTP Software and word Processing. UNIT – II Types of Graphics, Uses of Computer Graphics Introduction to Graphics Programs, Font and Typeface, Types of Fonts, Creation of Fonts (Photographer), Anatomy of Typefaces, Printers, Types of Printers used in DTP, Plotter, Scanner. UNIT – III History and Versions of PageMaker, Creating a New Page, Document Setup Dialog Box, Paper size, Page Orientation, Margins, Different Methods of Placing text and graphics in a document, master Page, Story Editor, Formatting of Text, Indent, Leading, Hyphenation, Spelling Check, Creating Index, Text Wrap, Position (Superscript/Subscript), Control Palette. UNIT – IV History of Multimedia Elements, Text, Images, Sound, Animation and Video, Text, concept of Plain Text and Formatted Text, RTF& HTML Text, Image, Importance of Graphics in Multimedia, Image Capturing Methods, Scanner, Digital Camera, Sound0 Sound and its effect in Multimedia, Analog and Digital sound, Animation, Basics, Principles and use of Animation, video, Basics of Video, Analog and Digital Video. UNIT – V Features Of Multimedia, Overview of Multimedia, Multimedia Software Tools, Multimedia Authoring- Production and Presentation, Graphics File Formats, MIDI-Overviews, Concepts, Structure of MIDI, MIDI Devices, MIDI Messages. 45, Anurag Nagar, Behind Press Complex, Indore (M.P.) Ph.: 4262100, www.rccmindore.com 1 B.Com 1st Year (Plain) Subject- Desktop Publishing UNIT I 1.1 Introduction to Desktop Publishing Desktop Publishing (DTP) is the creation of electronic forms of information documents using page layout skills on a personal computer primarily for print.
    [Show full text]
  • Desktop Publishers
    Network and Computer Systems Administrators Desktop Publishers TORQ Analysis of Network and Computer Systems Administrators to Desktop Publishers INPUT SECTION: Transfer Title O*NET Filters Network and Computer Systems Importance Weight: From Title: 15-1071.00 Abilities: Administrators LeveL: 50 1 Importance Weight: To Title: Desktop Publishers 43-9031.00 Skills: LeveL: 69 1 Labor Market Importance Weight: Maine Statewide Knowledge: Area: Level: 69 1 OUTPUT SECTION: Grand TORQ: 90 Ability TORQ Skills TORQ Knowledge TORQ Level Level Level 94 84 91 Gaps To Narrow if Possible Upgrade These Skills Knowledge to Add Ability Level Gap Impt Skill Level Gap Impt Knowledge Level Gap Impt Speech No Skills Upgrade Required! No Knowledge Upgrades Required! 44 3 62 Clarity LEVEL and IMPT (IMPORTANCE) refer to the Target Desktop Publishers. GAP refers to level difference between Network and Computer Systems Administrators and Desktop Publishers. ASK ANALYSIS Ability Level Comparison - Abilities with importance scores over 50 Network and Computer Description Systems Administrators Desktop Publishers Importance Written Comprehension 67 51 81 Near Vision 66 62 78 Visualization 62 57 72 Oral Comprehension 66 55 68 Written Expression 51 50 68 Fluency of Ideas 60 55 65 Problem Sensitivity 69 44 65 Oral Expression 53 62 73 Originality 59 51 62 Speech Clarity 41 44 62 TORQ Analysis Page 1 of 12. Copyright 2009. Workforce Associates, Inc. Network and Computer Systems Administrators Desktop Publishers Category Flexibility 57 51 59 Selective Attention 53 42 59
    [Show full text]
  • Download Issue 6
    £2.50 PageStream 4 from screen to page Issue 6, Autumn 2000 Gary Peake Interview Accelerators Feature ADSL Monitors and Scandoublers Heretic II Virtual GrandPrix Top Tips What’s new in OS 3.5? Hard Drivin’ Part 2 And much more... CONTENTS By Contents Editor Robert Williams News Welcome to the biggest issue of thank you to all the Clubbed ever! The extra three pages of contributors who SEAL Update ............................... 4 editorial in this issue have been made helped me with News Items .................................. 5 possible by two well known Amiga com- this issue, and to Amiga Update.............................. 9 panies, Eyetech and Analogic, agreeing Sharon who Gary Peake Interview .................. 10 to advertise with us. I would like to reas- checked an MorphOS ..................................... 12 sure readers that this additional adver- avalanche of articles in record time. tising will not bias us in any way, nor Despite the lack of time we’ve got some does it mean Clubbed is turning into a interesting articles in this issue. Mick Features profit making publication. All revenue has been playing Hyperion’s first received from advertising will be used to Acceleration!................................ 14 product, a port of the magical romp improve and enlarge the mag over the ADSL ........................................... 18 Heretic II that will push your PPC and base size paid for by subscriptions. BVision to the limit! I’ve reviewed Reviews Unfortunately you may find this maga- PageStream 4, as used to produce zine isn’t quite a polished as previous Clubbed, and Gary Storm has been PageStream 4.............................. 20 issues. I had to work long days and speaking to Gary Peake, head of devel- Fiasco .........................................
    [Show full text]
  • Les Formats De Fichier Images
    Les formats de fichier images Extension Description Catégorie Quel logiciel pour ouvrir ? diffusion ai Fichier Adobe Illustrator Image Vectorielle Adobe Illustrator pdf Adobe Acrobat Pro png Adobe Photoshop Adobe Photoshop Elements Adobe Flash Adobe Reader Apple Preview IMSI TurboCAD Deluxe ACD Systems Canvas CorelDRAW Graphics Suite Inkscape blend Fichier de données 3D Blender Image 3D Blender jpg/jpeg png avi bmp Fichier Image Bitmap Image Bitmap Apple Preview jpg Adobe Photoshop png Adobe Photoshop Elements Adobe Illustrator The GIMP Paint Paint.NET Roxio Toast The Logo Creator Corel Paint Shop Photo Pro X3 ACDSee Photo Manager 2009 Microsoft Windows Photo Gallery Viewer Nuance PaperPort Nuance OmniPage Professional Roxio Creator Inkscape dwg AtoCAD Drawing Database File Image 3D IMSI TurboCAD Deluxe jpg/jpeg PowerDWG Translator png Microspot DWG Viewer avi SolidWorks eDrawings Viewer Adobe Illustrator Autodesk AutoCAD Autodesk DWG TrueView SolidWorks eDrawings Viewer ACD Systems Canvas Corel Paint Shop Photo Pro Caddie DWG converter AutoDWG Solid Converter DWG IrfanView Formats de fichiers Images 1/7 Les formats de fichier images Extension Description Catégorie Quel logiciel pour ouvrir ? diffusion dxf Drawing Exchange Format File Image 3D TurboCAD Deluxe 16 jpg/jpeg PowerCADD PowerDWG translator png Microspot DWG Viewer avi NeoOffice Draw DataViz MacLink Plus Autodesk AutoCAD IMSI TurboCAD Deluxe SolidWorks eDrawings Viewer Corel Paint Shop Photo Pro ACD Systems Canvas DWG converter DWG2Image Converter OpenOffice.org Draw Adobe Illustrator
    [Show full text]
  • Oracle Application Server Reports Services Publishing Reports to the Web, 10G Release 2 (10.1.2) B14048-02
    Oracle® Application Server Reports Services Publishing Reports to the Web 10g Release 2 (10.1.2) B14048-02 December 2005 Oracle Application Server Reports Services Publishing Reports to the Web, 10g Release 2 (10.1.2) B14048-02 Copyright © 2003, 2005, Oracle. All rights reserved. Primary Author: Ingrid Snedecor Contributors: Ellen Gravina, Panna Hegde, Vinayak Hegde, Rohit Marwaha, Ratheesh Pai, Vinodkumar Pandurangan, Pravin Prabhakar, Rajesh Ramachandran, Vishal Sharma, Navneet Singh, Puvanenthiran Subbaraj, Philipp Weckerle The Programs (which include both the software and documentation) contain proprietary information; they are provided under a license agreement containing restrictions on use and disclosure and are also protected by copyright, patent, and other intellectual and industrial property laws. Reverse engineering, disassembly, or decompilation of the Programs, except to the extent required to obtain interoperability with other independently created software or as specified by law, is prohibited. The information contained in this document is subject to change without notice. If you find any problems in the documentation, please report them to us in writing. This document is not warranted to be error-free. Except as may be expressly permitted in your license agreement for these Programs, no part of these Programs may be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose. If the Programs are delivered to the United States Government or anyone licensing or using the Programs on behalf of the United States Government, the following notice is applicable: U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are "commercial computer software" or "commercial technical data" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations.
    [Show full text]
  • Layout a Bid
    Layout a bid Adapted from IACURH Award Resource Guide Layout Overview What is bid layout? The layout of a bid is the organization of information on each page and throughout the bid. Oftentimes, this includes different outlets for displaying text and information, graphical depictions of thematic elements and more. Layout Methods Paragraphs Bulleted Lists Text Boxes Graphics Pictures Backgrounds Why does layout matter? The layout of a bid can be just as important as who or what the bid is written about. The most deserving subject could be buried by a bid with poor layout. Information If information is hard to find or comprehend in depth and at a glance, the bid becomes less effective. This can be taken care of with effective layout methods (bulleted lists, text boxes, etc.). Theme The layout of a bid is the easiest way to incorporate thematic elements (images, wording, attitude, etc.). This will help you better expose the audience to the theme and, as a result, aid them in developing a better understanding of your nominee. Fun & Readability It’s simple; a well laid-out bid is just easier and more fun to read and makes the bidding process exciting. Without being able to read the text, which of the following would you prefer to read? Layout Basics Once you’ve selected your layout software program, it’s time to start thinking about the layout of your bid. The following suggestions will help you to set the foundation for your bid layout: Select Layout the List Topics Storyboard Review Theme Bid 1. Select Theme First, select the theme of your bid if you will be using one.
    [Show full text]
  • Tutorial Kit Omega Semester
    COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: MASS COMMUNICATION COURSE: MAC 322 DISCLAIMER The contents of this document are intended for practice and leaning purposes at the undergraduate level. The materials are from different sources including the internet and the contributors do not in any way claim authorship or ownership of them. The materials are also not to be used for any commercial purpose. 2 MAC 322: DESKTOP PUBLISHING Contributor: OMOREGBE 1. Explain the concept “publishing”. 2. Explain the concept “desktop publishing”. 3. List at least ten major strengths/advantages of desktop publishing over the traditional publishing. 4. State two detailed procedure for inserting columns into your publication. 5. State the detailed procedure for editing your stories in MS Word and going back to Publisher using the MS Publisher package. 6. Give a vivid explanation of the term “Microsoft publisher”. 7. State a detailed procedure for connecting frames for overflowing texts in Publisher. 8. State the detailed procedure for turning your publication into book form. 9. State the detailed procedure for inserting Drop Caps into your stories. 10. One of the features of Desktop Publishing software is the ability to import texts or documents from word processing software. State one detailed procedure for importing text from Microsoft Word into Publisher without launching MS Word. 11. State two procedures for removing unwanted pages from your publication in Ms Publisher. 12. What are the components of Desktop publishing? 13. Why do we use pictures/photographs and illustrations in publications? 14. List and explain the features of Desktop Publishing. 15. List at least ten examples of Desktop publishing.
    [Show full text]
  • Download Issue 14
    IssueBiggest Ever! £4.00 Issue 14, Spring 2003 8.00Euro Quake 2 Read our comprehensive review of Hyperions’s latest port. Hollywood Take a seat and enjoy our full review of this exciting new multimedia blockbuster! Contents Features The Show Must Go On! Editorial Welcome to another issue of Candy for SEAL’s Mick Sutton gives us an insight into the production of WoASE. Total Amiga, as you will no-doubt Issue 14 usergroups can afford. To give balance between space for the have noticed this issue is rather ack in the good old days we you an idea a venue capable of punters and giving the exhibitors late, which is a pity as we had Candy Factory is a graphics A built-in character generator had World of Amiga shows holding between 300 and 500 the stand space they require improved our punctuality over OS4 B the last few issues. application designed for allows you to add effects to Spring 2002 put on every year, usually at a people can cost anywhere from (some companies get a real bee high profile site (Wembley) and £500 to £1000 (outside London) in their bonnet about where they Unfortunately the main reason making logos and other text in any font without leaving texture again based on the all well attended. Everybody for a day. are situated). The floorplan goes behind the delay was that the graphics with high quality 3D the program. You can also load Contents wanted to be there and be seen, through many revisions before SCSI controller and PPC on my textured effects quickly and shapes (for example a logo) light source.
    [Show full text]
  • La Rivoluzione Digitale Dal Desktop Publishing All'editoria Multimediale Relazione Per Il Seminario Di Cultura Digitale, AA 2011/2012
    Giulia Santi matricola 409810 La rivoluzione digitale dal desktop publishing all'editoria multimediale Relazione per il Seminario di Cultura Digitale, AA 2011/2012 Introduzione Il seminario tenuto dal professor Carlo Bordoni mi ha incuriosito ed ho cercato informazioni riguardanti l'editoria digitale: come è nata, attraverso quali programmi si è sviluppata, come ha cambiato il mondo editoriale. Se la mia impressione iniziale era che i programmi di impaginazione avessero semplicemente semplificato il modo tradizionale di creare libri, ho invece scoperto la profonda rivoluzione che hanno operato nell'edizione di prodotti editoriali: la sempre maggiore integrazione col web ha permesso di passare dal libro cartaceo impaginato elettronicamente al cd-rom, all'e- book, al giornale on line. Le conseguenze non sono state irrilevanti: la revisione completa del ciclo produttivo editoriale, la scomparsa di vecchie figure professionali legate alla stampa e la nascita di nuovi ruoli legati all'informatica e alla grafica applicate all'editoria. La "rivoluzione digitale" ha portato vantaggi, come la riduzione dei costi, la semplificazione nella creazione di prodotti culturali fino ad arrivare al book on demand, la multimedialità, la sempre più grande facilità di accesso alla cultura; allo stesso tempo ha degli svantaggi, come i problemi legati alla pirateria e all'abbassamento della qualità della letteratura prodotta. Inoltre se alcuni prodotti sono entrati nell'uso comune e stanno aumentando il loro bacino di utenza (ad esempio i giornali on line) è ancora grande la diffidenza nei confronti di altri (il rapporto tra aumento dei titoli di e-book in commercio e ricavi delle vendite è ancora esiguo, anche se in crescita).
    [Show full text]
  • Cycle Three English Language Teachers' Perceptions of Their Ict Use in Teaching English Language in the Uae
    United Arab Emirates University Scholarworks@UAEU Theses Electronic Theses and Dissertations 6-2012 Cycle Three English Language Teachers’ Perceptions of Their ICT Use in Teaching English Language in the UAE. Ali Hussein Haidar Mohammed Follow this and additional works at: https://scholarworks.uaeu.ac.ae/all_theses Part of the Curriculum and Instruction Commons Recommended Citation Haidar Mohammed, Ali Hussein, "Cycle Three English Language Teachers’ Perceptions of Their ICT Use in Teaching English Language in the UAE." (2012). Theses. 193. https://scholarworks.uaeu.ac.ae/all_theses/193 This Thesis is brought to you for free and open access by the Electronic Theses and Dissertations at Scholarworks@UAEU. It has been accepted for inclusion in Theses by an authorized administrator of Scholarworks@UAEU. For more information, please contact [email protected]. ),ioJl �j-SUI ulJLaYI Ci.sLal? Faculty of nited Arab Emirates University �u Education United Arab Emirates University Faculty of Education Department of Curriculum and Instruction Ma ter of Education Cycle Three Engli h Language Teachers' Perceptions of their ICT Use in Teaching English Langu age in the UAE By Ali Hu ein Haidar Mohammed A Thesis Submitted to United Arab Emirates University For the Degree of Master of Education Curriculum and Instruction: English Language June 2012 CYCLE THREE ENGLISH LANGUAGE TEACHERS' PERCEPTIONS OF THEIR ICT USE IN TEACHING ENGLISH LANGUAGE IN THE UAE By Ali Hussein Haidar Mohammed Thesis Approved by: Dr. Abdulrahman Almekhlafi ---------------- ----------- Advisor and Chair = gQ?-tC� Dr. Sadiq Abdulwahed Ismail Member Dr. Najem Aldeen Alshaikh Member June 2012 AB TRACT Thl tud tried to cxarnll1e the perception of yc1e Three English language teachers' (ElT ) of thclr Inti nnatlOn and Communicati e Technology (lCT) use.
    [Show full text]