ECEN 4024 Capstone Design Project: Text Recognition for an Automated Test System

ECEN 4024 Capstone Design Project: Text Recognition for an Automated Test System Ben Jespersen Matt Litchfield Chengjie Lin Student Student Student Electrical and Computer Electrical and Computer Electrical and Computer Engineering Department Engineering Department Engineering Department Oklahoma State University Oklahoma State University Oklahoma State University Stillwater, OK, USA Stillwater, OK, USA Stillwater, OK, USA Abstract—Testing is a vital component of engineering design. Sometimes this can be a tedious task that keeps engineers busy and delays designing of new features and systems. This is particularly true in the medical industry, where regulations require devices to be tested extremely thoroughly to ensure they pose no threat to human health. One such medical device is a tablet computer used to program an implanted nerve stimulator developed by Cyberonics. Up until now, Cyberonics has tested this device by hand, requiring people to physically tap the tablet screen and verify that it performs as expected. The company would like to be able to test it automatically; however, certain constraints in the tablet’s software prevent this from being done through the traditional method of inserting additional test features into the main software. To solve this problem, Tietronix Software Inc. is currently in the process of developing an Automated Test System to test the tablet using a robotic stylus and camera instead. This system will test the tablet in a completely non-intrusive way, without requiring extra software on the tablet. It has great potential to streamline the engineering process by allowing engineers to work on other tasks while testing is carried out automatically. As part of this effort, Tietronix recruited a Capstone Design team at Oklahoma State University to develop a software module for the system. This module uses image processing technology to recognize specific strings of text on the tablet screen. The goal was for a test engineer to be able to input a specific text string by hand or from a file prepared in advance. The string recognition module will then search for this text on the tablet screen. When a string is found, the software module also taps the string on the tablet screen using the robotic stylus. This allows the test system to navigate through an application on the tablet without requiring an engineer to be present. The module also logs the results of each search to a file that can be referenced later. This allows an engineer to automatically run a sequence of text searches to determine if the tablet is working properly and displaying the correct information. The engineer need not be present at all; once the program is started, he can work on other things and check the output log file at his convenience. This module was successfully prototyped, and the details of its design will be discussed in this report. Index Terms—Automated Test System (ATS); Capstone Design; Image Processing; Levenshtein; Optical Character Recognition (OCR); Tesseract Contents I. Introduction ......................................................................................................................................................... 1 II. Project Description ............................................................................................................................................ 1 A. Overview ....................................................................................................................................................... 1 B. Design Specifications..................................................................................................................................... 1 III. Design Details .................................................................................................................................................. 2 A. Work Breakdown ........................................................................................................................................... 2 B. Image Enhancement ....................................................................................................................................... 3 C. Optical Character Recognition (OCR) ........................................................................................................... 5 D. Image Capture................................................................................................................................................ 6 E. ATS/OCR Interface ........................................................................................................................................ 6 F. Similarity Calculation..................................................................................................................................... 6 G. User Input ...................................................................................................................................................... 7 H. XY Plotter ...................................................................................................................................................... 8 I. Stylus Control.................................................................................................................................................. 9 J. Log File Generation ...................................................................................................................................... 10 K. Overall Logic ............................................................................................................................................... 10 IV. Test Methods and Results................................................................................................................................. 8 V. Possibilities for Further Development ............................................................................................................. 11 VI. Conclusion ...................................................................................................................................................... 12 VII. References ..................................................................................................................................................... 12 Appendix A: Report Authorship........................................................................................................................... 14 Appendix B: Progress Reports ............................................................................................................................. 15 2 I. INTRODUCTION the ATS should tap the screen at the location of the The purpose of this report is to describe in detail text. The system should also log the results of the a project undertaken by Ben Jespersen, Matt search to a file that a test engineer can read to analyze Litchfield, and Chengjie Lin for the electrical and the tablet’s performance. computer engineering Capstone Design course at This software module has great potential for Oklahoma State University (OSU). The overall usefulness in the ATS. First, it can allow automated project goals and purpose will be described, followed testing of new text features when they are added to by details of the subsystems designed by each team the tablet. An example might be adding support for a member. Methods of testing will also be discussed, new language. One reason this would need to be along with the results thereof. Finally, possible future tested is to ensure that a word in the new language expansion of the project will be explored. does not become so long that it overlaps the edges of Graders for ECEN 4024 may wish to refer to a button on the tablet screen. Another use for the Appendix A while reading. This appendix lists which module comes from the fact that the ATS taps the sections of the report were authored by each member screen at the location of the text. If the text being of the team. searched for appears on a button, then this feature can be used to navigate to different pages in the tablet II. PROJECT DESCRIPTION software or activate features of the tablet. This allows the ATS to automatically navigate throughout the A. Overview software simply by having a test engineer specify a Unlike most of the projects undertaken for sequence of text strings to tap. A tester would not Capstone Design, this one was sponsored by a need to set up specific (x,y) coordinates to tap on the company outside OSU. Tietronix Software Inc. is screen in advance; he could simply prepare a file working on an Automated Test System (ATS) to test containing a list of the text on each button he wants a tablet developed by Cyberonics for use in the pressed. medical field. This tablet is used by physicians to read data from and reprogram electronic devices B. Design Specifications implanted in patients. Due to the critical timing The following design specifications were defined constraints involved in communicating with the early in the project. Many of these were given to the implanted device, the programming tablet cannot be team directly by Tietronix, though some of them tested using traditional automation methods involving were defined by the team with the help of its mentor, extra software running alongside the primary Dr. George Scheets. The specifications were selected application. The tablet must be tested externally, with such that the work load would be appropriate for a no additional software installed. Traditionally this has single-semester capstone project for three students. been

ECEN 4024 Capstone Design Project: Text Recognition for an Automated Test System

Reconocimiento De Escritura Lecture 4/5 --- Layout Analysis

Gradu04243.Pdf

JETIR Research Journal

Optical Character Recognition As a Cloud Service in Azure Architecture

Analiza I Optičko Prepoznavanje Rukopisa S Herbarijskih Etiketa U Zbirci Herbarium Croaticum

A Comparison of OCR Methods on Natural Images in Different Image Domains

Design and Implementation of a System for Recording and Remote Monitoring of a Parking Using Computer Vision and Ip Surveillance Systems

Comparison of Optical Character Recognition (OCR) Software

Ocrdroid: a Framework to Digitize Text Using Mobile Phones

Yelieseva S. V. Information Technologies in Translation.Pdf

PDF Extractor

OCR API'leri Ve Gerçek Zamanlı OCR Uygulaması Hüseyin KUTLU1