Comparative Study of Table Layout Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Comparative Study of Table Layout Analysis Master of Science in Computer Science March 2018 Comparative study of table layout analysis Layout analysis solutions study for Swedish historical hand-written document Xusheng Liang Faculty of Computing, Blekinge Institute of Technology, 371 79 Karlskrona, Sweden This thesis is submitted to the Faculty of Computing at Blekinge Institute of Technology in partial fulfilment of the requirements for the degree of Master of Science in Computer Science. The thesis is equivalent to 20 weeks of full time studies. The authors declare that they are the sole authors of this thesis and that they have not used any sources other than those listed in the bibliography and identified as references. They further declare that they have not submitted this thesis at any other institution to obtain a degree. Contact Information: Author(s): Xusheng Liang E-mail: [email protected] University advisor: Cheddad, Abbas E-mail: [email protected] Department of Computer Science and Engineering Faculty of Computing Internet : www.bth.se Blekinge Institute of Technology Phone : +46 455 38 50 00 SE-371 79 Karlskrona, Sweden Fax : +46 455 38 50 57 ii ABSTRACT Background. Nowadays, information retrieval system become more and more popular, it helps people retrieve information more efficiently and accelerates daily task. Within this context, Image processing technology play an important role that help transcribing content in printed or handwritten documents into digital data in information retrieval system. This transcribing procedure is called document digitization. In this transcribing procedure, image processing technique such as layout analysis and word recognition are employed to segment the document content and transcribe the image content into words. At this point, a Swedish company (ArkivDigital® AB) has a demand to transcribe their document data into digital data. Objectives. In this study, the aim is to find out effective solution to extract document layout regard to the Swedish handwritten historical documents, which are featured by their tabular forms containing the handwritten content. In this case, outcome of application of OCRopus, OCRfeeder, traditional image processing techniques, machine learning techniques on Swedish historical hand-written document is compared and studied. Methods. Implementation and experiment are used to develop three comparative solutions in this study. One is Hessian filtering with mask operation; another one is Gabor filtering with morphological open operation; the last one is Gabor filtering with machine learning classification. In the last solution, different alternatives were explored to build up document layout extraction pipeline. Hessian filter and Gabor filter are evaluated; Secondly, filter images with the better filter evaluated at previous stage, then refine the filtered image with Hough line transform method. Third, extract transfer learning feature and custom feature. Fourth, feed classifier with previous extracted features and analyze the result. After implementing all the solutions, sample set of the Swedish historical handwritten document is applied with these solutions and compare their performance with survey. Results. Both open source OCR system OCRopus and OCRfeeder fail to deliver the outcome due to these systems are designed to handle general document layout instead of table layout. Tradition image processing solutions work in more than a half of the cases, but it does not work well. Combining traditional image process technique and machine leaning technique give the best result, but with great time cost. Conclusions. Results shows that existing OCR system cannot carry layout analysis task in our Swedish historical handwritten document. Traditional image processing techniques are capable to extract the general table layout in these documents. By introducing machine learning technique, better and more accurate table layout can be extracted, but comes with a bigger time cost. Keywords: layout analysis, pattern recognition, document digitalization, table layout extraction, transfer learning, machine learning, image processing, Hessian filter, Gabor filter iii ACKNOWLEDGEMENT Due to personal issue, this study was spent longer time than expected to finish. During this period, thanks my supervisor Abbas being patient and provide good guidance and support to me. Also, thanks my family unconditional support and the fellows participated in evaluation work. Besides, thanks Swedish medical care taking good care of me. iv ACRONYMS DSSHDUHGSDJHV ͳʹǦͳ͵ǡͳͷǡʹ͸Ǧ͵͵ǡ͵ͷǡ͵͹ǡ ͵ͻǡͶ͵ǡͶͷǦͶͺ ʹʹǡʹ͸ǡʹ͹ǡ͵ͷ ͳͷǦͳ͸ Ǧ Ǥ ͳͶ Ǧ ͵ͲǦ͵ͳ ͵ͷǡ͵ͺ ͵ͳǡ͵͹ Ǥ ͳͳǡͳ͵ǡͳͷǡʹͳǦʹʹǡ͵ͻ ͵ͻ ͵ͳǦ͵ͶǡͶ͸ǦͶͺ Ǥ ͳͳǡ͵ͳǦ͵͵ǡͶ͸ǦͶͺ ͳͶ Ǥ ͳͳǡͳͷǡ͵ͳǦ͵͸ǡͶ͸ǦͶͺǡͷͲǦ ͸͵ v GLOSSARY DSSHDUHGSDJHV ௪௜ௗ௧௛ Ǥ ͳ͵ǡͳͷǡ͵Ͳ ௛௘௜௚௛௧ ͳͷ Ǥ ͵ͺ ͳͷ Ǥ Ǥ ͳͷ Ǥ ʹͻ ͵͵ ⅵ-ⅶ, ͳͷǡʹͶ Ǥ ͳ ̵ ͵ʹǦ͵ͶǡͶ͸ǦͶͺ ǡ οǡⅷǡͳͲǦͳͳǡͳͶǦ Ǥ ͳͷǡͳ͹Ǧʹ͹ǡ͵͸Ǧ͵͹ǡ ǡǡ ͵ͻǡͶ͵ǡͶͻǦ͸ʹ Ǥ ͵Ͳ ͵Ͳ ǡ Ǥ ͳ͹Ǧͳͺ Ǥ ǡ ͳ͹ǡ͵Ͳǡ͵ͷǡ͵ͺ Ǥ Ǥ ⅷǡʹͶǦʹͺǡ͵ͷ Ǥ ǡ Ǥ ୲୰୳ୣ୮୭ୱ୧୲୧୴ୣ Ǣ ͳͶǡͶͷǦͶ͸ ୲୰୳ୣ୮୭ୱ୧୲୧୴ୣା୤ୟ୪ୱୣ୮୭ୱ୧୲୧୴ୣ Ǥ Ȁ ͳ͵Ǧͳͷǡʹ͹ǡͶʹ ȀǤ Ȁ Ǥ ୲୰୳ୣ୮୭ୱ୧୲୧୴ୣ Ǣ ͵͵ǡͶ͸ǦͶ͹ ୲୰୳ୣ୮୭ୱ୧୲୧୴ୣା୤ୟ୪ୱୣ୬ୣ୥ୟ୲୧୴ୣ Ǥ ͳ͵ǦͳͶǡʹͳ Ǥ Ǥ ͳ͵Ǧͳͷ Ǥ vi ʹͷǦʹͺ Ǥ vii CONTENTS ABSTRACT ......................................................................................................................................................... III ACKNOWLEDGEMENT .................................................................................................................................. IV ACRONYMS ......................................................................................................................................................... V GLOSSARY ......................................................................................................................................................... VI CONTENTS ...................................................................................................................................................... VIII 1 INTRODUCTION ...................................................................................................................................... 10 1.1 BACKGROUND ...................................................................................................................................... 10 1.2 AIM AND OBJECTIVES ........................................................................................................................... 10 1.2.1 Research gap ................................................................................................................................... 10 1.2.2 Aim .................................................................................................................................................. 11 1.2.3 Objectives ........................................................................................................................................ 11 1.3 RESEARCH QUESTIONS ......................................................................................................................... 11 1.4 STRUCTURE OF STUDY .......................................................................................................................... 11 1.5 LIMITATION .......................................................................................................................................... 11 1.6 CONTRIBUTION ..................................................................................................................................... 12 1.7 ETHICS ................................................................................................................................................. 12 2 RELATED WORK .................................................................................................................................... 13 3 METHOD .................................................................................................................................................... 17 3.1 FILTER APPLICATION ............................................................................................................................ 17 3.1.1 Hessian filter ................................................................................................................................... 17 3.1.2 Gabor filter ..................................................................................................................................... 18 3.2 OPEN SOURCE TOOL APPLICATIONS ...................................................................................................... 21 3.2.1 OCRopus testing .............................................................................................................................. 21 3.2.2 OCRfeeder testing ........................................................................................................................... 22 3.3 IMPLEMENTATION ................................................................................................................................ 22 3.3.1 Image processing solution I - Hessian filter with mask application................................................ 22 3.3.2 Image processing solution II - Gabor filter with morphological opening ....................................... 24 3.3.3 Solution III - Connected Component with Machine Learning Classifier ........................................ 26 3.3.4 Experiments in implantation ..........................................................................................................
Recommended publications
  • OCR Pwds and Assistive Qatari Using OCR Issue No
    Arabic Optical State of the Smart Character Art in Arabic Apps for Recognition OCR PWDs and Assistive Qatari using OCR Issue no. 15 Technology Research Nafath Efforts Page 04 Page 07 Page 27 Machine Learning, Deep Learning and OCR Revitalizing Technology Arabic Optical Character Recognition (OCR) Technology at Qatar National Library Overview of Arabic OCR and Related Applications www.mada.org.qa Nafath About AboutIssue 15 Content Mada Nafath3 Page Nafath aims to be a key information 04 Arabic Optical Character resource for disseminating the facts about Recognition and Assistive Mada Center is a private institution for public benefit, which latest trends and innovation in the field of Technology was founded in 2010 as an initiative that aims at promoting ICT Accessibility. It is published in English digital inclusion and building a technology-based community and Arabic languages on a quarterly basis 07 State of the Art in Arabic OCR that meets the needs of persons with functional limitations and intends to be a window of information Qatari Research Efforts (PFLs) – persons with disabilities (PWDs) and the elderly in to the world, highlighting the pioneering Qatar. Mada today is the world’s Center of Excellence in digital work done in our field to meet the growing access in Arabic. Overview of Arabic demands of ICT Accessibility and Assistive 11 OCR and Related Through strategic partnerships, the center works to Technology products and services in Qatar Applications enable the education, culture and community sectors and the Arab region. through ICT to achieve an inclusive community and educational system. The Center achieves its goals 14 Examples of Optical by building partners’ capabilities and supporting the Character Recognition Tools development and accreditation of digital platforms in accordance with international standards of digital access.
    [Show full text]
  • Durchleuchtet PDF Ist Der Standard Für Den Austausch Von Dokumenten, Denn PDF-Dateien Sehen Auf
    WORKSHOP PDF-Dateien © alphaspirit, 123RF © alphaspirit, PDF-Dateien verarbeiten und durchsuchbar machen Durchleuchtet PDF ist der Standard für den Austausch von Dokumenten, denn PDF-Dateien sehen auf Daniel Tibi, allen Rechnern gleich aus. Für Linux gibt es zahlreiche Tools, mit denen Sie alle Möglich- Christoph Langner, Hans-Georg Eßer keiten dieses Dateiformats ausreizen. okumente unterschiedlichster Art, in einem gedruckten Text, Textstellen mar- denen Sie über eine Texterkennung noch von Rechnungen über Bedie- kieren oder Anmerkungen hinzufügen. eine Textebene hinzufügen müssen. D nungsanleitungen bis hin zu Bü- Als Texterkennungsprogramm für Linux chern und wissenschaftlichen Arbeiten, Texterkennung empfiehlt sich die OCR-Engine Tesseract werden heute digital verschickt, verbrei- Um die Möglichkeiten des PDF-Formats [1]. Die meisten Distributionen führen das tet und genutzt – vorzugsweise im platt- voll auszureizen, sollten PDF-Dateien Programm in ihren Paketquellen: formunabhängigen PDF-Format. Durch- durchsuchbar sein. So durchstöbern Sie l Unter OpenSuse installieren Sie tesse­ suchbare Dokumente erleichtern das etwa gleich mehrere Dokumente nach be- ract­­ocr und eines der Sprachpakete, schnelle Auffinden einer bestimmten stimmten Wörtern und finden innerhalb z. B. tesseract­ocr­traineddata­german. Stelle in der Datei, Metadaten liefern zu- einer Datei über die Suchfunktion des (Das Paket für die englische Sprache sätzliche Informationen. PDF-Betrachters schnell die richtige Stelle. richtet OpenSuse automatisch mit ein.) Zudem gibt es zahlreiche Möglichkei- PDF-Dateien, die Sie mit LaTeX oder Libre- l Für Ubuntu und Linux Mint wählen ten, PDF-Dokumente zu bearbeiten: Ganz Office erstellen, lassen sich üblicherweise Sie tesseract­ocr und ein Sprachpaket, nach Bedarf lassen sich Seiten entfernen, bereits durchsuchen. Anders sieht es je- wie etwa tesseract­ocr­deu.
    [Show full text]
  • Igalia Desktop Summit, Berlin, Aug 2011
    Igalia Desktop Summit, Berlin, Aug 2011 Juan José Sánchez Penas | [email protected] | www.igalia.com About Igalia 2 ● Open source consultancy founded in 2001 ● Privately owned (independent), flat internal structure ● Headquarters: north west of Spain (A Coruña, Galicia) ● ~45 open source developers from many countries, working from different locations ● What we do: ● Development, consultancy, training,... ● We offer our upstream expertise to help others building platforms, products and solutions Juan José SánchezPenas | [email protected] | www.igalia.com What we do 3 ● Areas/Teams: Kernel/OS, multimedia, graphics, browsers, compilers, accessibility, ... ● Platforms/Technologies: GNOME, WebKit, MeeGo, Freedesktop.org, Qt, ... ● Experience: Many projects for relevant international companies Related to platform, middleware and app development Creation of and contribution to upstream components Juan José Sánchez | [email protected] | www.igalia.com Main affiliations 4 ● Members of GNOME Foundation's Advisory Board (2007) ● Patrons of FSF (2011) ● Members of Linux Foundation (2011) Juan José Sánchez | [email protected] | www.igalia.com Events 5 ● WebKitGTK+ Hackfest, A Coruña, 2009, 2010 and 2011 ● GUADEC Hispana, A Coruña, 2005 and 2010 ● GUADEMY, A Coruña, 2007 ● GTK+ Hackfest, A Coruña, October 2010 ● ATK Hackfest, A Coruña, May 2011 Juan José Sánchez | [email protected] | www.igalia.com WebKit 6 Juan José Sánchez | [email protected] | www.igalia.com WebKit 7 ● Key for integration of web technologies in the desktop ● Stable
    [Show full text]
  • Tecnologías Libres Para La Traducción Y Su Evaluación
    FACULTAD DE CIENCIAS HUMANAS Y SOCIALES DEPARTAMENTO DE TRADUCCIÓN Y COMUNICACIÓN Tecnologías libres para la traducción y su evaluación Presentado por: Silvia Andrea Flórez Giraldo Dirigido por: Dra. Amparo Alcina Caudet Universitat Jaume I Castellón de la Plana, diciembre de 2012 AGRADECIMIENTOS Quiero agradecer muy especialmente a la Dra. Amparo Alcina, directora de esta tesis, en primer lugar por haberme acogido en el máster Tecnoloc y el grupo de investigación TecnoLeTTra y por haberme animado luego a continuar con mi investigación como proyecto de doctorado. Sus sugerencias y comentarios fueron fundamentales para el desarrollo de esta tesis. Agradezco también al Dr. Grabriel Quiroz, quien como profesor durante mi último año en la Licenciatura en Traducción en la Universidad de Antioquia (Medellín, Colombia) despertó mi interés por la informática aplicada a la traducción. De igual manera, agradezco a mis estudiantes de Traducción Asistida por Computador en la misma universidad por interesarse en el software libre y por motivarme a buscar herramientas alternativas que pudiéramos utilizar en clase sin tener que depender de versiones de demostración ni recurrir a la piratería. A mi colega Pedro, que comparte conmigo el interés por la informática aplicada a la traducción y por el software libre, le agradezco la oportunidad de llevar la teoría a la práctica profesional durante todos estos años. Quisiera agradecer a Esperanza, Anna, Verónica y Ewelina, compañeras de aventuras en la UJI, por haber sido mi grupo de apoyo y estar siempre ahí para escucharme en los momentos más difíciles. Mis más sinceros agradecimientos también a María por ser esa voz de aliento y cordura que necesitaba escuchar para seguir adelante y llegar a feliz término con este proyecto.
    [Show full text]
  • Gradu04243.Pdf
    Paperilomakkeesta tietomalliin Kalle Malin Tampereen yliopisto Tietojenkäsittelytieteiden laitos Tietojenkäsittelyoppi Pro gradu -tutkielma Ohjaaja: Erkki Mäkinen Toukokuu 2010 i Tampereen yliopisto Tietojenkäsittelytieteiden laitos Tietojenkäsittelyoppi Kalle Malin: Paperilomakkeesta tietomalliin Pro gradu -tutkielma, 61 sivua, 3 liitesivua Toukokuu 2010 Tässä tutkimuksessa käsitellään paperilomakkeiden digitalisointiin liittyvää kokonaisprosessia yleisellä tasolla. Prosessiin tutustutaan tarkastelemalla eri osa-alueiden toimintoja ja laitteita kokonaisjärjestelmän vaatimusten näkökul- masta. Tarkastelu aloitetaan paperilomakkeiden skannaamisesta ja lopetetaan kerättyjen tietojen tallentamiseen tietomalliin. Lisäksi luodaan silmäys markki- noilla oleviin valmisratkaisuihin, jotka sisältävät prosessin kannalta oleelliset toiminnot. Avainsanat ja -sanonnat: lomake, skannaus, lomakerakenne, lomakemalli, OCR, OFR, tietomalli. ii Lyhenteet ADRT = Adaptive Document Recoginition Technology API = Application Programming Interface BAG = Block Adjacency Graph DIR = Document Image Recognition dpi= Dots Per Inch ICR = Intelligent Character Recognition IFPS = Intelligent Forms Processing System IR = Information Retrieval IRM = Image and Records Management IWR = Intelligent Word Recognition NAS = Network Attached Storage OCR = Optical Character Recognition OFR = Optical Form Recognition OHR = Optical Handwriting Recognition OMR = Optical Mark Recognition PDF = Portable Document Format SAN = Storage Area Networks SDK = Software Development Kit SLM
    [Show full text]
  • Informe Tradución Ao Galego Do Contorno GNOME 3.0
    INFORME DE TRADUCIÓN AO GALEGO DO CONTORNO GNOME 3.0 ABRIL 2011 Oficina de Software Libre da USC www.usc.es/osl [email protected] LICENZA DO DOCUMENTO Este documento pode empregarse, modificarse e redistribuírse baixo dos termos de unha das seguintes licenzas, a escoller: GNU Free Documentation License 1.3 Copyright (C) 2009 Oficina de Software Libre da USC. Garántese o permiso para copiar, distribuír e/ou modificar este documento baixo dos termos da GNU Free Documentation License versión 1.3 ou, baixo o seu criterio, calquera versión posterior publicada pola Free Software Foundation; sen seccións invariantes, sen textos de portada e sen textos de contraportada. Pode achar o texto íntegro da licenza en: http://www.gnu.org/copyleft/fdl.html Creative Commons Atribución – CompartirIgual 3.0 Copyright (C) 2009 Oficina de Software Libre da USC. Vostede é libre de: • Copiar, distribuír e comunicar publicamente a obra • Facer obras derivadas Baixo das condicións seguintes: • Recoñecemento. Debe recoñecer os créditos da obra do xeito especificado polo autor ou polo licenciador (pero non de xeito que suxira que ten o seu apoio ou apoian o uso que fan da súa obra. • Compartir baixo a mesma licenza.. Se transforma ou modifica esta obra para crear unha obra derivada, só pode distribuír a obra resultante baixo a mesma licenza, unha similar ou unha compatíbel. Pode achar o texto íntegro da licenza en: http://creativecommons.org/licenses/by-sa/3.0/es/deed.gl TÁBOA DE CONTIDOS Licenza do documento............................................................................................................3
    [Show full text]
  • Field Guide to Software for Nonprofit Immigration Advocates, Organizers, and Service Providers
    THE FIELD GUIDE TO SOFTWARE FOR NONPROFIT IMMIGRATION ADVOCATES, ORGANIZERS, AND SERVICE PROVIDERS By the Immigration Advocates Network and Idealware THE FIELD GUIDE TO SOFTWARE FOR NONPROFIT IMMIGRATION ADVOCATES, ORGANIZERS, AND SERVICE PROVIDERS By the Immigration Advocates Network and Idealware THE FIELD GUIDE TO SOFTWARE FOREWORD Welcome, The Field Guide to Software is a joint effort between the Immigration Advocates Network and Idealware. Through straightforward overviews, it helps pinpoint the types of software that might be useful for the needs of nonprofit immigration advocates, organizers, and service providers and provides user- friendly summaries to demystify the possible options. It covers tried-and-true and emerging tools and technolgies, and best practices and specific aspects of nonprofit software. There’s also a section to guide you through the sometimes daunting process of choosing and implementing software. We know you have your hands full and don’t always have time to keep up with the latest information about the software that can help your organization. That’s where this guide can help. Thank you for all you do to make the world a better place. We hope this Field Guide will help you do it all just a little more easily. Matthew Burnett Karen Graham Director, Executive Director, Immigration Advocates Network Idealware iii THE FIELD GUIDE TO SOFTWARE TABLE OF CONTENTS TABLE 1. Introduction 7 • Understanding What You Need 8 • Every Organization Needs 10 2. Case Studies: Putting Tools to Use 13 • Using Technology to Expand Legal Services: Ayuda Delaware 14 • A Holistic Approach to Serving Immigrants: Benevolent Charities of Oklahoma 17 • Giving Voice to Immigrants: Idaho Coalition for Immigrants and Refugees 20 3.
    [Show full text]
  • Indicators for Missing Maintainership in Collaborative Open Source Projects
    TECHNISCHE UNIVERSITÄT CAROLO-WILHELMINA ZU BRAUNSCHWEIG Studienarbeit Indicators for Missing Maintainership in Collaborative Open Source Projects Andre Klapper February 04, 2013 Institute of Software Engineering and Automotive Informatics Prof. Dr.-Ing. Ina Schaefer Supervisor: Michael Dukaczewski Affidavit Hereby I, Andre Klapper, declare that I wrote the present thesis without any assis- tance from third parties and without any sources than those indicated in the thesis itself. Braunschweig / Prague, February 04, 2013 Abstract The thesis provides an attempt to use freely accessible metadata in order to identify missing maintainership in free and open source software projects by querying various data sources and rating the gathered information. GNOME and Apache are used as case studies. License This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) license. Keywords Maintenance, Activity, Open Source, Free Software, Metrics, Metadata, DOAP Contents List of Tablesx 1 Introduction1 1.1 Problem and Motivation.........................1 1.2 Objective.................................2 1.3 Outline...................................3 2 Theoretical Background4 2.1 Reasons for Inactivity..........................4 2.2 Problems Caused by Inactivity......................4 2.3 Ways to Pass Maintainership.......................5 3 Data Sources in Projects7 3.1 Identification and Accessibility......................7 3.2 Potential Sources and their Exploitability................7 3.2.1 Code Repositories.........................8 3.2.2 Mailing Lists...........................9 3.2.3 IRC Chat.............................9 3.2.4 Wikis............................... 10 3.2.5 Issue Tracking Systems...................... 11 3.2.6 Forums............................... 12 3.2.7 Releases.............................. 12 3.2.8 Patch Review........................... 13 3.2.9 Social Media............................ 13 3.2.10 Other Sources..........................
    [Show full text]
  • GNOME 3 Application Development Beginner's Guide
    GNOME 3 Application Development Beginner's Guide Step-by-step practical guide to get to grips with GNOME application development Mohammad Anwari BIRMINGHAM - MUMBAI GNOME 3 Application Development Beginner's Guide Copyright © 2013 Packt Publishing All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews. Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book. Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information. First published: February 2013 Production Reference: 1080213 Published by Packt Publishing Ltd. Livery Place 35 Livery Street Birmingham B3 2PB, UK. ISBN 978-1-84951-942-7 www.packtpub.com Cover Image by Duraid Fatouhi ([email protected]) Credits Author Project Coordinator Mohammad Anwari Abhishek Kori Reviewers Proofreader Dhi Aurrahman Mario Cecere Joaquim Rocha Indexer Acquisition Editor Tejal Soni Mary Jasmine Graphics Lead Technical Editor Aditi Gajjar Ankita Shashi Production Coordinator Technical Editors Aparna Bhagat Charmaine Pereira Cover Work Dominic Pereira Aparna Bhagat Copy Editors Laxmi Subramanian Aditya Nair Alfida Paiva Ruta Waghmare Insiya Morbiwala About the Author Mohammad Anwari is a software hacker from Indonesia with more than 13 years of experience in software development.
    [Show full text]
  • Pdfswqokdvt2o.Pdf
    GNOME 3 Application Development Beginner's Guide Step-by-step practical guide to get to grips with GNOME application development Mohammad Anwari BIRMINGHAM - MUMBAI GNOME 3 Application Development Beginner's Guide Copyright © 2013 Packt Publishing All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews. Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book. Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information. First published: February 2013 Production Reference: 1080213 Published by Packt Publishing Ltd. Livery Place 35 Livery Street Birmingham B3 2PB, UK. ISBN 978-1-84951-942-7 www.packtpub.com Cover Image by Duraid Fatouhi ([email protected]) Credits Author Project Coordinator Mohammad Anwari Abhishek Kori Reviewers Proofreader Dhi Aurrahman Mario Cecere Joaquim Rocha Indexer Acquisition Editor Tejal Soni Mary Jasmine Graphics Lead Technical Editor Aditi Gajjar Ankita Shashi Production Coordinator Technical Editors Aparna Bhagat Charmaine Pereira Cover Work Dominic Pereira Aparna Bhagat Copy Editors Laxmi Subramanian Aditya Nair Alfida Paiva Ruta Waghmare Insiya Morbiwala About the Author Mohammad Anwari is a software hacker from Indonesia with more than 13 years of experience in software development.
    [Show full text]
  • JETIR Research Journal
    © 2019 JETIR June 2019, Volume 6, Issue 6 www.jetir.org (ISSN-2349-5162) IMAGE TEXT CONVERSION FROM REGIONAL LANGUAGE TO SPEECH/TEXT IN LOCAL LANGUAGE 1Sayee Tale,2Manali Umbarkar,3Vishwajit Singh Javriya,4Mamta Wanjre 1Student,2Student,3Student,4Assistant Professor 1Electronics and Telecommunication, 1AISSMS, Institute of Information Technology, Pune, India. Abstract: The drivers who drive in other states are unable to recognize the languages on the sign board. So this project helps them to understand the signs in different language and also they will be able to listen it through speaker. This paper describes the working of two module image processing module and voice processing module. This is done majorly using Raspberry Pi using the technology OCR (optical character recognition) technique. This system is constituted by raspberry Pi, camera, speaker, audio playback module. So the system will help in decreasing the accidents causes due to wrong sign recognition. Keywords: Raspberry pi model B+, Tesseract OCR, camera module, recording and playback module,switch. 1. Introduction: In today’s world life is too important and one cannot loose it simply in accidents. The accident rates in today’s world are increasing day by day. The last data says that 78% accidents were because of driver’s fault. There are many faults of drivers and one of them is that they are unable to read the signs and instructions written on boards when they drove into other states. Though the instruction are for them only but they are not able to make it. So keeping this thing in mind we have proposed a system which will help them to understand the sign boards written in regional language.
    [Show full text]
  • Upgrade Issues
    Upgrade issues Graph of new conflicts libsiloh5-0 libhdf5-lam-1.8.4 (x 3) xul-ext-dispmua (x 2) liboss4-salsa-asound2 (x 2) why sysklogd console-cyrillic (x 9) libxqilla-dev libxerces-c2-dev iceape xul-ext-adblock-plus gnat-4.4 pcscada-dbg Explanations of conflicts pcscada-dbg libpcscada2-dev gnat-4.6 gnat-4.4 Similar to gnat-4.4: libpolyorb1-dev libapq-postgresql1-dev adacontrol libxmlada3.2-dev libapq1-dev libaws-bin libtexttools2-dev libpolyorb-dbg libnarval1-dev libgnat-4.4-dbg libapq-dbg libncursesada1-dev libtemplates-parser11.5-dev asis-programs libgnadeodbc1-dev libalog-base-dbg liblog4ada1-dev libgnomeada2.14.2-dbg libgnomeada2.14.2-dev adabrowse libgnadecommon1-dev libgnatvsn4.4-dbg libgnatvsn4.4-dev libflorist2009-dev libopentoken2-dev libgnadesqlite3-1-dev libnarval-dbg libalog1-full-dev adacgi0 libalog0.3-base libasis2008-dbg libxmlezout1-dev libasis2008-dev libgnatvsn-dev libalog0.3-full libaws2.7-dev libgmpada2-dev libgtkada2.14.2-dbg libgtkada2.14.2-dev libasis2008 ghdl libgnatprj-dev gnat libgnatprj4.4-dbg libgnatprj4.4-dev libaunit1-dev libadasockets3-dev libalog1-base-dev libapq-postgresql-dbg libalog-full-dbg Weight: 5 Problematic packages: pcscada-dbg hostapd initscripts sysklogd Weight: 993 Problematic packages: hostapd | initscripts initscripts sysklogd Similar to initscripts: conglomerate libnet-akamai-perl erlang-base screenlets xlbiff plasma-widget-yawp-dbg fso-config- general gforge-mta-courier libnet-jifty-perl bind9 libplack-middleware-session-perl libmail-listdetector-perl masqmail libcomedi0 taxbird ukopp
    [Show full text]