TEXT LINE EXTRACTION USING SEAM CARVING a Thesis

Total Page:16

File Type:pdf, Size:1020Kb

TEXT LINE EXTRACTION USING SEAM CARVING a Thesis TEXT LINE EXTRACTION USING SEAM CARVING A Thesis Presented to The Graduate Faculty of The University of Akron In Partial Fulfillment of the Requirement for the Degree Master of Science Christopher Stoll May, 2015 TEXT LINE EXTRACTION USING SEAM CARVING Christopher Stoll Thesis Approved: Accepted: __________________________________ __________________________________ Advisor Dean of the College Dr. Zhong-Hui Duan Dr. Chand Midha __________________________________ __________________________________ Faculty Reader Interim Dean of the Graduate School Dr. Chien-Chung Chan Dr. Rex Ramsier __________________________________ __________________________________ Faculty Reader Date Dr. Yingcai Xiao __________________________________ Department Chair Dr. Timothy Norfolk !ii ABSTRACT Optical character recognition (OCR) is a well researched area of computer science; presently there are numerous commercial and open source applications which can perform OCR, albeit with varying levels of accuracy. Since the process of performing OCR requires extracting text from images, it should follow that text line extraction is also a well researched area. And indeed there are many methods to extract text from images of scanned documents, the process known in that field as document analysis and recognition. However, the existing text extraction techniques were largely devised to feed existing character recognition techniques. Since this work was originally conceived from the perspective of computer vision and pattern recognition and pattern analysis and machine intelligence, a new approach seemed necessary to meet the new objectives. Out of that need an apparently novel approach to text extraction was devised which relies upon the central idea behind seam carving. Text images are examined for seams, but rather than removing the lowest energy seams, they are evaluated to determine where text is located within the image. The approach can be run iteratively, alternating the direction of seam evaluation, to recognize !iii increasingly specific areas of the image. The ultimate goal is to create an algorithm which can provide data to a new type of recognition algorithm, one that can understand information at higher levels of the document structure and possibly use information gained from those higher levels (words or phrases) to increase the accuracy of identifying data in the lower levels (characters). This paper explores the existing methods used for text extraction, touching upon existing OCR techniques, and then describes a novel technique for information extraction based upon seam carving. Modifications needed to adapt the seam carving process to the new problem domain are explained. Then, two output methods, direct area detection and information masking, are described. Finally, potential modifications to the technique, which could make it suitable for use in other domains such as general computer vision or bioinformatics, are discussed. !iv ACKNOWLEDGEMENTS First, I would like to thank Dr. Zhong-Hui Duan for making this work possible. Her guidance on this project was invaluable and, perhaps more importantly, I would have never even seen the possibilities without the knowledge I gained from Dr. Duan’s instruction. I would also like to thank my committee members, Dr. Chien-Chung Chan and Dr. Yingcai Xiao, for the time they took reviewing my work and providing valuable feedback. Additionally, I would like to thank Dr. Kathy J. Liszka, who helped an unlikely candidate get a chance to succeed in a Computer Science program. I would also like to acknowledge the contributions of Dr. Michael L. Collard and Dr. Timothy W. O’Neil, who influenced the methodologies and technical implementations used in this project. Finally, I would like to thank my wife Heather, my boys, my family, and my friends. Without their support, occasional diversions, and encouragement I would have never been able to see this project through to completion. Namaste. !v TABLE OF CONTENTS Page LIST OF FIGURES............................................................................................................. x LIST OF EQUATIONS.................................................................................................... xii CHAPTER I I.INTRODUCTION INTRODUCTION............................................................................................................. 1 EXISTINGII. EXISTING APPROACHES APPROACHES.............................................................................................. 5 Text Extraction Methods.............................................................................................. 5 Run-length Smearing.............................................................................................. 6 X-Y Cut...................................................................................................................... 8 Docstrum.................................................................................................................. 9 Whitespace Analysis............................................................................................. 11 Voronoi.................................................................................................................... 12 Text Extraction Software............................................................................................ 13 Cuneiform............................................................................................................... 14 OCRopus................................................................................................................. 14 !vi Tesseract.................................................................................................................. 15 OCRFeeder............................................................................................................. 15 III.PROPOSED PROPOSED APPROACH APPROACH.............................................................................................. 17 Seam Carving.............................................................................................................. 18 Preprocessing Functions....................................................................................... 20 Energy Functions................................................................................................... 21 Simple Gradient................................................................................................ 21 Sobel Edge Detection........................................................................................ 22 Laplacian of Gaussians.................................................................................... 22 Difference of Gaussians................................................................................... 23 Difference of Gaussians with Sobel................................................................ 24 Seam Traversal....................................................................................................... 25 Text Extraction Steps.................................................................................................. 27 Direct Area Detection............................................................................................ 30 Information Masking............................................................................................ 31 Method Comparison............................................................................................. 32 Overcoming Limitations....................................................................................... 34 !vii Improved Skew Handling............................................................................... 35 Complicated Layouts....................................................................................... 36 Extracting Finer Details............................................................................................. 37 Asymptotic Analysis.................................................................................................. 38 Related Work............................................................................................................... 40 Future Work................................................................................................................ 41 CONCLUSIONSIV. CONCLUSIONS.............................................................................................................. 46 REFERENCES.................................................................................................................. 49 APPENDICES.................................................................................................................. 52 PROGRAMAPPENDIX A:MAKEFILE PROGRAM........................................................................................... MAKEFILE 53 PROGRAMAPPENDIX MAINB: PROGRAM — SC.C MAIN..................................................................................... — SC.C 54 SEAMAPPENDIX CARVING C: SEAM FUNCTIONS CARVING — FUNCTIONS LIBSEAMCARVE.C — LIBSEAMCARVE.C.................................... 57 PNGAPPENDIX IMPORT D: FUNCTIONSPNG IMPORT — FUNCTIONS LIBPNGHELPER.C — LIBPNGHELPER.C........................................... 74 IMAGEAPPENDIX RESIZE E: IMAGE FUNCTIONS RESIZE — FUNCTIONS LIBRESIZE.C —.................................................. ILBRESIZE.C 78 IMAGEAPPENDIX BINARIZATION F: IMAGE BINARIZATION — LIBBINARIZATION.C — LIBBINARIZATION.C........................................... 80 IMAGEAPPENDIX ENERGY G: IMAGE FUNCTIONS ENERGY —
Recommended publications
  • OCR Pwds and Assistive Qatari Using OCR Issue No
    Arabic Optical State of the Smart Character Art in Arabic Apps for Recognition OCR PWDs and Assistive Qatari using OCR Issue no. 15 Technology Research Nafath Efforts Page 04 Page 07 Page 27 Machine Learning, Deep Learning and OCR Revitalizing Technology Arabic Optical Character Recognition (OCR) Technology at Qatar National Library Overview of Arabic OCR and Related Applications www.mada.org.qa Nafath About AboutIssue 15 Content Mada Nafath3 Page Nafath aims to be a key information 04 Arabic Optical Character resource for disseminating the facts about Recognition and Assistive Mada Center is a private institution for public benefit, which latest trends and innovation in the field of Technology was founded in 2010 as an initiative that aims at promoting ICT Accessibility. It is published in English digital inclusion and building a technology-based community and Arabic languages on a quarterly basis 07 State of the Art in Arabic OCR that meets the needs of persons with functional limitations and intends to be a window of information Qatari Research Efforts (PFLs) – persons with disabilities (PWDs) and the elderly in to the world, highlighting the pioneering Qatar. Mada today is the world’s Center of Excellence in digital work done in our field to meet the growing access in Arabic. Overview of Arabic demands of ICT Accessibility and Assistive 11 OCR and Related Through strategic partnerships, the center works to Technology products and services in Qatar Applications enable the education, culture and community sectors and the Arab region. through ICT to achieve an inclusive community and educational system. The Center achieves its goals 14 Examples of Optical by building partners’ capabilities and supporting the Character Recognition Tools development and accreditation of digital platforms in accordance with international standards of digital access.
    [Show full text]
  • Glosar De Termeni
    CUPRINS 1. Reprezentarea digitală a informaţiei……………………………………………………….8 1.1. Reducerea datelor………………………………………………………………..10 1.2. Entropia şi câştigul informaţional………………………………………………. 10 2. Identificarea şi analiza tipurilor formatelor de documente pe suport electronic……... 12 2.1. Tipuri de formate pentru documentele pe suport electronic……………………..…13 2.1.1 Formate text……………………………………………………………… 13 2.1.2 Formate imagine……………………...........……………………………....26 2.1.3 Formate audio……………………………………………………………...43 2.1.4 Formate video……………………………………………………………. 48 2.1.5. Formate multimedia.....................................................................................55 2.2. Comparaţie între formate………………………….………………………………72 3. Conţinutul web………………………………………………………………………………74 3.1. Identificarea formatelor de documente pentru conţinutul web……………………. 74 4. Conversia documentelor din format tradiţional în format electronic…………………101 4.1. Metode de digitizare……………………………………………………………..104 4.1.1 Cerinţe în procesul de digitizare……………………………………………104 4.1.2. Scanarea……………………………………………………………………104 4.1.3. Fotografiere digitală………………………………………………………105 4.1.4. Scanarea fotografiilor analogice………………………………………106 4.2. Arhitectura unui sistem de digitizare………………………………………………106 4.3. Moduri de livrare a conţinutului digital………………………………………….108 4.3.1. Text sub imagine………….……………………………………………..108 4.3.2. Text peste imagine…………………………………………….…………108 4.4. Alegerea unui format………………………………………………………………108 4.5. Tehnici de conversie prin Recunoaşterea Optică a Caracterelor (Optical Charater Recognition-OCR)……………………………………………109 4.5.1. Scurt istoric al conversiei documentelor din format tradiţional prin scanare şi OCR-izare…………………... …..109 4.5.2. Starea actuală a tehnologiei OCR……………………………………….110 4.5.3. Tehnologii curente, produse software pentru OCR-izare………………112 4.5.4. Suportul lingvistic al produselor pentru OCR-izare………………….…114 1 4.5.5. Particularităţi ale sistemelor OCR actuale………………………..……..118 5. Elaborarea unui model conceptual pentru un sistem de informare şi documentare………………………………………………………126 5.1.
    [Show full text]
  • Reconocimiento De Escritura Lecture 4/5 --- Layout Analysis
    Reconocimiento de Escritura Lecture 4/5 | Layout Analysis Daniel Keysers Jan/Feb-2008 Keysers: RES-08 1 Jan/Feb-2008 Outline Detection of Geometric Primitives The Hough-Transform RAST Document Layout Analysis Introduction Algorithms for Layout Analysis A `New' Algorithm: Whitespace Cuts Evaluation of Layout Analyis Statistical Layout Analysis OCR OCR - Introduction OCR fonts Tesseract Sources of OCR Errors Keysers: RES-08 2 Jan/Feb-2008 Outline Detection of Geometric Primitives The Hough-Transform RAST Document Layout Analysis Introduction Algorithms for Layout Analysis A `New' Algorithm: Whitespace Cuts Evaluation of Layout Analyis Statistical Layout Analysis OCR OCR - Introduction OCR fonts Tesseract Sources of OCR Errors Keysers: RES-08 3 Jan/Feb-2008 Detection of Geometric Primitives some geometric entities important for DIA: I text lines I whitespace rectangles (background in documents) Keysers: RES-08 4 Jan/Feb-2008 Outline Detection of Geometric Primitives The Hough-Transform RAST Document Layout Analysis Introduction Algorithms for Layout Analysis A `New' Algorithm: Whitespace Cuts Evaluation of Layout Analyis Statistical Layout Analysis OCR OCR - Introduction OCR fonts Tesseract Sources of OCR Errors Keysers: RES-08 5 Jan/Feb-2008 Hough-Transform for Line Detection Assume we are given a set of points (xn; yn) in the image plane. For all points on a line we must have yn = a0 + a1xn If we want to determine the line, each point implies a constraint yn 1 a1 = − a0 xn xn Keysers: RES-08 6 Jan/Feb-2008 Hough-Transform for Line Detection The space spanned by the model parameters a0 and a1 is called model space, parameter space, or Hough space.
    [Show full text]
  • An Accuracy Examination of OCR Tools
    International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-8, Issue-9S4, July 2019 An Accuracy Examination of OCR Tools Jayesh Majumdar, Richa Gupta texts, pen computing, developing technologies for assisting Abstract—In this research paper, the authors have aimed to do a the visually impaired, making electronic images searchable comparative study of optical character recognition using of hard copies, defeating or evaluating the robustness of different open source OCR tools. Optical character recognition CAPTCHA. (OCR) method has been used in extracting the text from images. OCR has various applications which include extracting text from any document or image or involves just for reading and processing the text available in digital form. The accuracy of OCR can be dependent on text segmentation and pre-processing algorithms. Sometimes it is difficult to retrieve text from the image because of different size, style, orientation, a complex background of image etc. From vehicle number plate the authors tried to extract vehicle number by using various OCR tools like Tesseract, GOCR, Ocrad and Tensor flow. The authors in this research paper have tried to diagnose the best possible method for optical character recognition and have provided with a comparative analysis of their accuracy. Keywords— OCR tools; Orcad; GOCR; Tensorflow; Tesseract; I. INTRODUCTION Optical character recognition is a method with which text in images of handwritten documents, scripts, passport documents, invoices, vehicle number plate, bank statements, Fig.1: Functioning of OCR [2] computerized receipts, business cards, mail, printouts of static-data, any appropriate documentation or any II. OCR PROCDURE AND PROCESSING computerized receipts, business cards, mail, printouts of To improve the probability of successful processing of an static-data, any appropriate documentation or any picture image, the input image is often ‘pre-processed’; it may be with text in it gets processed and the text in the picture is de-skewed or despeckled.
    [Show full text]
  • Gradu04243.Pdf
    Paperilomakkeesta tietomalliin Kalle Malin Tampereen yliopisto Tietojenkäsittelytieteiden laitos Tietojenkäsittelyoppi Pro gradu -tutkielma Ohjaaja: Erkki Mäkinen Toukokuu 2010 i Tampereen yliopisto Tietojenkäsittelytieteiden laitos Tietojenkäsittelyoppi Kalle Malin: Paperilomakkeesta tietomalliin Pro gradu -tutkielma, 61 sivua, 3 liitesivua Toukokuu 2010 Tässä tutkimuksessa käsitellään paperilomakkeiden digitalisointiin liittyvää kokonaisprosessia yleisellä tasolla. Prosessiin tutustutaan tarkastelemalla eri osa-alueiden toimintoja ja laitteita kokonaisjärjestelmän vaatimusten näkökul- masta. Tarkastelu aloitetaan paperilomakkeiden skannaamisesta ja lopetetaan kerättyjen tietojen tallentamiseen tietomalliin. Lisäksi luodaan silmäys markki- noilla oleviin valmisratkaisuihin, jotka sisältävät prosessin kannalta oleelliset toiminnot. Avainsanat ja -sanonnat: lomake, skannaus, lomakerakenne, lomakemalli, OCR, OFR, tietomalli. ii Lyhenteet ADRT = Adaptive Document Recoginition Technology API = Application Programming Interface BAG = Block Adjacency Graph DIR = Document Image Recognition dpi= Dots Per Inch ICR = Intelligent Character Recognition IFPS = Intelligent Forms Processing System IR = Information Retrieval IRM = Image and Records Management IWR = Intelligent Word Recognition NAS = Network Attached Storage OCR = Optical Character Recognition OFR = Optical Form Recognition OHR = Optical Handwriting Recognition OMR = Optical Mark Recognition PDF = Portable Document Format SAN = Storage Area Networks SDK = Software Development Kit SLM
    [Show full text]
  • Integral Estimation in Quantum Physics
    INTEGRAL ESTIMATION IN QUANTUM PHYSICS by Jane Doe A dissertation submitted to the faculty of The University of Utah in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Mathematical Physics Department of Mathematics The University of Utah May 2016 Copyright c Jane Doe 2016 All Rights Reserved The University of Utah Graduate School STATEMENT OF DISSERTATION APPROVAL The dissertation of Jane Doe has been approved by the following supervisory committee members: Cornelius L´anczos , Chair(s) 17 Feb 2016 Date Approved Hans Bethe , Member 17 Feb 2016 Date Approved Niels Bohr , Member 17 Feb 2016 Date Approved Max Born , Member 17 Feb 2016 Date Approved Paul A. M. Dirac , Member 17 Feb 2016 Date Approved by Petrus Marcus Aurelius Featherstone-Hough , Chair/Dean of the Department/College/School of Mathematics and by Alice B. Toklas , Dean of The Graduate School. ABSTRACT Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah. Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah. Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah. Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah. Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah. Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah. Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah. Blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah.
    [Show full text]
  • JETIR Research Journal
    © 2019 JETIR June 2019, Volume 6, Issue 6 www.jetir.org (ISSN-2349-5162) IMAGE TEXT CONVERSION FROM REGIONAL LANGUAGE TO SPEECH/TEXT IN LOCAL LANGUAGE 1Sayee Tale,2Manali Umbarkar,3Vishwajit Singh Javriya,4Mamta Wanjre 1Student,2Student,3Student,4Assistant Professor 1Electronics and Telecommunication, 1AISSMS, Institute of Information Technology, Pune, India. Abstract: The drivers who drive in other states are unable to recognize the languages on the sign board. So this project helps them to understand the signs in different language and also they will be able to listen it through speaker. This paper describes the working of two module image processing module and voice processing module. This is done majorly using Raspberry Pi using the technology OCR (optical character recognition) technique. This system is constituted by raspberry Pi, camera, speaker, audio playback module. So the system will help in decreasing the accidents causes due to wrong sign recognition. Keywords: Raspberry pi model B+, Tesseract OCR, camera module, recording and playback module,switch. 1. Introduction: In today’s world life is too important and one cannot loose it simply in accidents. The accident rates in today’s world are increasing day by day. The last data says that 78% accidents were because of driver’s fault. There are many faults of drivers and one of them is that they are unable to read the signs and instructions written on boards when they drove into other states. Though the instruction are for them only but they are not able to make it. So keeping this thing in mind we have proposed a system which will help them to understand the sign boards written in regional language.
    [Show full text]
  • Optical Character Recognition As a Cloud Service in Azure Architecture
    International Journal of Computer Applications (0975 – 8887) Volume 146 – No.13, July 2016 Optical Character Recognition as a Cloud Service in Azure Architecture Onyejegbu L. N. Ikechukwu O. A. Department of Computer Science, Department of Computer Science, University of Port Harcourt. University of Port Harcourt. Port Harcourt. Rivers State. Nigeria. Port Harcourt. Rivers State. Nigeria ABSTRACT The dawn of digital computers has made it unavoidable that Cloud computing and Optical Character Recognition (OCR) everything processed by the digital computer must be technology has become an extremely attractive area of processed in digital form. For example, most famous libraries research over the last few years. Sometimes it is difficult to in the world like the Boston public library has over 6 million retrieve text from the image because of different size, style, books, for public consumption, inescapably has to change all orientation and complex background of image. There is need its paper-based books into digital documents in order that they to convert paper books and documents into text. OCR is still could be stored on a storage drive. It can also be projected that imperfect as it occasionally mis-recognizes letters and falsely every year over 200 million books are being published[22], identifies scanned text, leading to misspellings and linguistics many of which are disseminated and published on papers [6]. errors in the OCR output text. A cloud based Optical Thus, for several document-input jobs, Opitcal Character Character Recognition Technology was used. This was Reconigtion is the utmost economical and swift process powered on Microsoft Windows Azure in form of a Web obtainable.
    [Show full text]
  • Operational Platform with Additional Tools Integrated
    DELIVERABLE SUBMISSION SHEET To: Cristina Maier (Project Officer) DG CONNECT G.2 Creativity Unit EUFO 01/162 - European Commission L-2557 Luxembourg From: Support Action Centre of Competence in Digitisation Project acronym: Succeed Project Number: 600555 Project Manager: Rafael C. Carrasco Jiménez Project Coordinator: Universidad de Alicante The following deliverable: Deliverable title: Operational platform with additional tools integrated Deliverable number: D2.4 Deliverable date: M23 Partners responsible: UA, KB Status: X Public Restricted Confidential is now complete X It is available for your inspection X Relevant descriptive documents are attached The deliverable is: X A document X A website Url: http://succeed-project.eu/demonstrator- platform X Software An event Other Sent to Project Officer: Sent to functional mailbox: On date: [email protected] [email protected] 04/12/2014 Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante. D2.4 Operational platform with additional tools integrated Support Action Centre of Competence in Digitisation (Succeed) December 4, 2014 Abstract Deliverable 2.4 (Operational platform with additional tools integrated) is an online service and open-source platform which complements and enhances deliverable D2.3 (Operational platform, submitted in Decem- ber 2013 and approved in February 2014). This report summarises the digitisation tools incorporated into the online platform, as well as the management of the releases of its open-source components. In 2014, 13 tools have been integrated in addition to those initially available in the operational platform or subsequently added during Suc- ceed's first year. The operational platform, currently provides 41 tools for text digitisation which can be executed and tested online without the need to install any software on the user's computer.
    [Show full text]
  • Choosing Character Recognition Software To
    CHOOSING CHARACTER RECOGNITION SOFTWARE TO SPEED UP INPUT OF PERSONIFIED DATA ON CONTRIBUTIONS TO THE PENSION FUND OF UKRAINE Prepared by USAID/PADCO under Social Sector Restructuring Project Kyiv 1999 <CHARACTERRECOGNITION_E_ZH.DOC> printed on June 25, 2002 2 CONTENTS LIST OF ACRONYMS.......................................................................................................................................................................... 3 INTRODUCTION................................................................................................................................................................................ 4 1. TYPES OF INFORMATION SYSTEMS....................................................................................................................................... 4 2. ANALYSIS OF EXISTING SYSTEMS FOR AUTOMATED TEXT RECOGNITION................................................................... 5 2.1. Classification of automated text recognition systems .............................................................................................. 5 3. ATRS BASIC CHARACTERISTICS............................................................................................................................................ 6 3.1. CuneiForm....................................................................................................................................................................... 6 3.1.1. Some information on Cognitive Technologies ..................................................................................................
    [Show full text]
  • An Open-Source Tool Providing a (Semi-)Automatic OCR Workflow For
    Article OCR4all - An Open-Source Tool Providing a (Semi-)Automatic OCR Workflow for Historical Printings Christian Reul 1, Dennis Christ 1, Alexander Hartelt 1, Nico Balbach 1, Maximilian Wehner 1, Uwe Springmann 2, Christoph Wick 1, Christine Grundig 3, Andreas Büttner 4, and Frank Puppe 1 1 Chair for Artificial Intelligence and Applied Computer Science, University of Würzburg, 97074 Würzburg, Germany 2 Center for Information and Language Processing, LMU Munich, 80538 Munich, Germany 3 Institute for Modern Art History, University of Zurich, 8006 Zurich, Switzerland 4 Institute for Philosophy, University of Würzburg, 97074 Würzburg, Germany * Correspondence: [email protected] Version September 11, 2019 submitted to Appl. Sci. Abstract: Optical Character Recognition (OCR) on historical printings is a challenging task mainly due to the complexity of the layout and the highly variant typography. Nevertheless, in the last few years great progress has been made in the area of historical OCR, resulting in several powerful open-source tools for preprocessing, layout recognition and segmentation, character recognition and post-processing. The drawback of these tools often is their limited applicability by non-technical users like humanist scholars and in particular the combined use of several tools in a workflow. In this paper we present an open-source OCR software called OCR4all, which combines state-of-the-art OCR components and continuous model training into a comprehensive workflow. A comfortable GUI allows error corrections not only in the final output, but already in early stages to minimize error propagations. Further on, extensive configuration capabilities are provided to set the degree of automation of the workflow and to make adaptations to the carefully selected default parameters for specific printings, if necessary.
    [Show full text]
  • Analiza I Optičko Prepoznavanje Rukopisa S Herbarijskih Etiketa U Zbirci Herbarium Croaticum
    SVEUČILIŠTE U ZAGREBU FILOZOFSKI FAKULTET ODSJEK ZA INFORMACIJSKE I KOMUNIKACIJSKE ZNANOSTI KATEDRA ZA ARHIVISTIKU I DOKUMENTALISTIKU Marta Dević Hameršmit Analiza i optičko prepoznavanje rukopisa s herbarijskih etiketa u zbirci Herbarium Croaticum Diplomski rad Mentor: dr. sc. Hrvoje Stančić, red. prof. Neposredni voditelj: Vedran Šegota, dipl. ing. bio. (Botanički zavod, Biološki odsjek, PMF) Zagreb, srpanj 2018. Sadržaj 1. Uvod ....................................................................................................................................... 1 1.1. Optičko prepoznavanje .................................................................................................... 2 1.1.1. Optičko prepoznavanje znakova .............................................................................. 2 1.1.2. Inteligentno prepoznavanje znakova ........................................................................ 3 1.1.3. Prepoznavanje rukopisa ............................................................................................ 4 1.2. Herbarij ............................................................................................................................ 5 1.2.1. Herbarijske zbirke .................................................................................................... 6 1.2.2. Herbarijske etikete .................................................................................................... 6 2. Ciljevi ....................................................................................................................................
    [Show full text]