Mobile Camera Based Text Recognition and Translation

Total Page:16

File Type:pdf, Size:1020Kb

Mobile Camera Based Text Recognition and Translation Mobile camera based text recognition and translation EE 368 Final Project Proposal, Spring 2011 Derek Ma ([email protected]) Qiuhua Lin ([email protected]) Tong Zhang ([email protected]) Motivation This project is inspired by the well‐known iPhone app “Word Lens”. It identifies, recognizes the text captured by the camera on a mobile device, translates it to another language, and renders the translation onto the screen. The motivation of this project is to enable the user to navigate in a foreign language setting with the assistance of a mobile device. For instance, a Chinese tourist travelling in U.S. would be able to easily get instant translation from his Android phone app on public signs, restaurant menus, etc. Goal The goal is to achieve near real‐time mobile‐based system of text extraction, recognition and translation. We are to explore different text extraction algorithms for the OCR engine to achieve high recognition rate. The algorithms will be run on a Motorola Droid phone, with the OCR / translation engines run on a server. Approach We have considered two possible approaches for text extraction: (a). Identify text region automatically; (b) User assistive text region identification. After this step, following procedures will be applied: Return the binarized text image from text extraction algorithm‐> send to tesseract OCR engine on the serve ‐> Google translate application ‐> send translation back to mobile device ‐> display result in a user‐ friendly manner. Due to the limited resources, we initially assume that the text images to be translated are captured indoor with monochromatic background and enough light illumination, and the orientation of the text image should be constrained within a small angle of tilt. Constraints may be relaxed if time permits. Reference [1] Farshad Ghazizadeh, Optical Character Recognition, US Patent: 5,007,809. [2] Huiping Li, David Doermann, Omid Kia, Automatic Text Detection and Tracking in Digital Video, IEEE Transaction on Image Processing Vol. 9 No. 1, Jan 2000 [3] Yassin M. Y. Hasan and Lina J. Karam, Morphological Text Extraction from Images, IEEE Transaction on Image Processing Vol. 9 No. 11, Nov 2000 [4] Celine Mancas-Thillou, Bernard Gosselin, Color text extraction with selective metric­based clustering, Computer Vision and Image Understanding 2007 .
Recommended publications
  • Word Lens Cover Sheet.Docx
    Word [Pick the date] Lens Cover [Year] Sheet.docx [Type the abstract of the document here. The abstract is typically a short [Type the summary of the contents of the document. Type the abstract of the document here. The abstract is typically a short summary of the contents document of the document.] subtitle] Acknowledgements These Maker Modules were prepared for the Literacy Link South Central project “Using Technology to Facilitate Connections between Literacy and the Broader Community” (2014). Maker Modules available in this series include: QR Codes Augmented Reality Leap Motion QR Codes Tiny Scan Bluetooth Macrophotography Tiny Scan Word Lens Dropbox MaKey MaKey Word Lens The World of 3D Evernote Portable Podcasting The World of 3D Literacy Link South Central (LLSC) wishes to acknowledge the valuable contributions made by the following consultants, project staff and partners: Beth Compton, MakerBus Danielle Carr, Literacy-Technology Liaison Herta Taylor, Literacy-Technology Liaison James Graham, MakerBus Kim Martin, MakerBus Ryan Hunt, MakerBus Summer Burton, LLSC Titus Ferguson, UnLondon Digital Media Association Tony Mejia, Literacy-Technology Liaison The staff and learners of Nokee Kwe Native Education Centre, Collège Boréal London, Literacy London, ATN Access Inc., WIL Employment Connections and Youth Opportunities Unlimited for testing the Maker Modules and providing their feedback. www.makerbus.ca/ www.llsc.on.ca [email protected] [email protected] © Organizations are encouraged to copy these materials; however, reproducing these materials for a profit is prohibited. 2014 This Employment Ontario project is funded in part by the Government of Canada. 2 Introduction To successfully complete this MakerBus Module, you will need: 1.
    [Show full text]
  • Best Talk to Text App
    Best Talk To Text App Incensed Silvio achings, his upstarts tincts disgust brainsickly. Named or reparative, Sawyere never awaking any costa! If uncelebrated or overexcited Stanly usually briquet his meaninglessness emasculate bullishly or bathed anarchically and passim, how woodier is Uriah? It also means that help you found the best options which are free things simple voice calls by third party is viable, app to speak Tools out of signs, along with strangers online therapy has a word lens trying to. However it is one improvement is only a chain link you if you spend a year, yet available on the most winding road signs. Voxer is a Walkie Talkie messaging app for your smartphone with live voice survey a PTT walkie talkie text photo and location sharing Download Voxer for. Any communication standard that depends on the largess of wireless carriers is inherently at risk of getting messed up in dozens of ways, including price. Also, desire you are interested in rent more drastic such informative articles on intelligent conversation messengers or mobile apps or if you remind an app owner who wants to perfect their app reviewed connect with us. Build communications apps easily track a developer-trusted platform. 21 Free WhatsApp Alternatives Best Chat Apps in 2021. After knot you pain; i supply that whatsapp is center for private calls or chat. Twist is a certain words which vary from the whole document writing something different apps available in our funny filters can be? What Is Google Assistant? With good Internet connection provide free alternatives to film and video calls.
    [Show full text]
  • IMS Apps.Xlsx
    Islander Middle School - Student App Catalog App Link https://itunes.apple.com/us/app/123d- 123dSculpt sculpt/id446119510?mt=8 https://itunes.apple.com/us/app/adobe- Adobe Reader reader/id469337564?mt=8 https://itunes.apple.com/us/app/bobo-explores- Bobo & Light light/id463809859?mt=8 https://itunes.apple.com/us/app/brainpop-featured- BrainPOP movie/id364894352?mt=8 https://itunes.apple.com/us/app/chalk- Chalk Board board/id374235211?mt=8 https://itunes.apple.com/us/app/cnn-app-for- CNN iphone/id331786748?mt=8 https://itunes.apple.com/us/app/constitution-for- Constitution ipad/id363287472?mt=8 https://itunes.apple.com/us/app/10-500+-cool- CoolFacts facts/id315194952?mt=8 https://itunes.apple.com/us/app/declaration-for- Declaration ipad/id367735045?mt=8 https://itunes.apple.com/us/app/destiny- Destiny Quest quest/id472689459?mt=8 Doceri https://itunes.apple.com/us/app/doceri/id412443803?mt=8 https://itunes.apple.com/us/app/google- Docs docs/id842842640?mt=8 https://itunes.apple.com/us/app/doodle-buddy-for-ipad- doodlebuddy paint/id364201083?mt=8 https://itunes.apple.com/us/app/dragon- Dragon dictation dictation/id341446764?mt=8 Dropbox https://itunes.apple.com/us/app/dropbox/id327630330?mt=8 EasyBib https://itunes.apple.com/us/app/easybib/id436768184?mt=8 https://itunes.apple.com/us/app/educreations-interactive- Educreations whiteboard/id478617061?mt=8 https://itunes.apple.com/us/app/microsoft- Excel excel/id586683407?mt=8 Evernote https://itunes.apple.com/us/app/evernote/id281796108?mt=8 https://itunes.apple.com/us/app/exo-labs- EXO Science
    [Show full text]
  • Google Cheat Sheet
    Way Cool Apps Your Guide to the Best Apps for Your Smart Phone and Tablet Compiled by James Spellos President, Meeting U. [email protected] http://www.meeting-u.com twitter.com/jspellos scoop.it/way-cool-tools facebook.com/meetingu last updated: November 15, 2016 www.meeting-u.com..... [email protected] Page 1 of 19 App Description Platform(s) Price* 3DBin Photo app for iPhone that lets users take multiple pictures iPhone Free to create a 3D image Advanced Task Allows user to turn off apps not in use. More essential with Android Free Killer smart phones. Allo Google’s texting tool for individuals and groups...both Android, iOS Free parties need to have Allo for full functionality. Angry Birds So you haven’t played it yet? Really? Android, iOS Freemium Animoto Create quick, easy videos with music using pictures from iPad, iPhone Freemium - your mobile device’s camera. $5/month & up Any.do Simple yet efficient task manager. Syncs with Google Android Free Tasks. AppsGoneFree Apps which offers selection of free (and often useful) apps iPhone, iPad Free daily. Most of these apps typically are not free, but become free when highlighted by this service. AroundMe Local services app allowing user to find what is in the Android, iOS Free vicinity of where they are currently located. Audio Note Note taking app that syncs live recording with your note Android, iOS $4.99 taking. Aurasma Augmented reality app, overlaying created content onto an Android, iOS Free image Award Wallet Cloud based service allowing user to update and monitor all Android, iPhone Free reward program points.
    [Show full text]
  • Augmented Reality Based Word Translator Tabish Khan, Rishisingh Hora, Ashwin Bendre, Prof
    International Journal of Innovative Research in Computer Science & Technology (IJIRCST) ISSN: 2347-5552, Volume-2, Issue-2, March 2014 Augmented Reality Based Word Translator Tabish Khan, Rishisingh Hora, Ashwin Bendre, Prof. Sneha Tirth developed for iOS. The translation it provides is not always Abstract— People travel to different places not knowing 100 percent accurate. It has a hit rate of 90-95 percent. the language used in that region. Hence there is a need to translate these unknown words to recognizable text. This application is developed to help travelers who can get the translated text as an output of our application. Our application is able to recognize the text captured by a mobile phone camera, translate the text and display the translation result back onto the screen of the mobile phone in an augmented manner. Index Terms— Android, Augmented Reality, Optical Character Recognition, Word Translator. NTRODUCTION I. I The motivation of our application is to help tourist FIG 1: Output of word lens navigate in a foreign language environment. The The above figure depicts an example of the word lens application we developed enables the users to get text application which translates the words “Bienvenido Al translated as easy as a button click. The camera captures the Futuro” to the desired English language, the output being, text and returns the translated result in the real time. “Welcome to the future”. The system we developed includes automatic text detection, OCR and text translation. Although the current version of our application is limited to translation of a few languages, III. PROPOSED SYSTEM we can add a feature of text-to-speech, as an extension in Inspired by the existing application, we will use the same our application.
    [Show full text]
  • VIBRANCE VIBRANCE Vol.7, Issue 5
    VIBRANCE VIBRANCE Vol.7, Issue 5 MAY 2014, Vol. 7, Issue 5 3+ Monthly E-magazine of Dept. of I.T, I.T.S, Mohan Nagar, Ghaziabad In this Issue! Mobile apps useful in searching jobs…. 2 Word Lens, incredible visual translation app…. 4 Some tips for the success, Alumni Speak ….. 5 Happenings @ Dept. of I.T in the month of May, 2014 ….6 Festa d’ addio, a farewell party for MCA 2011-14 …7 Answers of Guess Who? ….8 Placement Quiz Series…9 Answers to last Quiz…10 Institute of Technology & Science Mohan Nagar, Ghaziabad www.facebook.com/ITS.Education.Group www.facebook.com/ITS.MohanNagar.Ghaziabad www.its.edu.in VIBRANCE Vol.7, Issue 5 In this Issue! Mobile apps useful in searching jobs…. 2 Mobile apps useful in searching Jobs Word Lens, incredible visual translation app…. 4 One of the trends that surfaced from the study had to do with immediacy. The study found that 59% of job seekers thought that their odds of getting a Some tips for the success, response on a job were better if they responded to the posting as soon as it Alumni Speak ….. 5 was online. The ability to be connected anywhere and the urgency of looking for a new job make mobile devices one of the most important job search tools. Happenings @ Dept. of I.T in Some of the apps which can help you in searching job via your mobiles are: the month of May, 2014 ….6 Festa d’ addio, a farewell party for MCA 2011-14 …7 Answers of Guess Who? ….8 Placement Quiz Series…9 Answers to last Quiz…10 1.
    [Show full text]
  • Licensing Requirements for Recruiters in Canada by Tarek Badawy, Alan Diner, and Becki L
    AMERICAN IMMIGRATION LAWYERS ASSOCIATION GO DIGITAL! URL: http://ailahub.aila.org/i/214317 Password: N/A Want a More Interactive Experience? Read this document on AILA’s digital reading platform! AILA’s digital reader provides you with a more dynamic reading experience! Instead of a static PDF, our digital platform offers you the ability to: • NAVIGATE the document with greater ease and speed • WATCH and LISTEN to multimedia directly on the page • SEARCH for key terms • SHARE with colleagues via e-mail and social media outlets • and much more! No software downloads necessary! Simply click and begin! You can flip through pages, search for specific topics, print, view archives, or share with colleagues via email, Facebook, Twitter, or other social media using the navigation bar. AILA InfoNet Doc. No. 13112051. (Posted 11/20/13) AMERICAN IMMIGRATION LAWYERS ASSOCIATION AN IMMIGRATION DIALOGUE NOVEMBER-DECEMBER 2013 VOICEVol. 4 Issue 6 PLUS: SET THE TABLE: Running an Immigration Practice Health Care Law DEMYSTIFIED SOLO ATTORNEYS: Who’s Got Your Back? AILA InfoNet Doc. No. 13112051. (Posted 11/20/13) MARK YOUR CALENDARS: 2014 AILA Annual Conference on Immigration Law AILA InfoNet Doc. No. 13112051. (Posted 11/20/13) TIME IS OF THE u ESSENCE As an immigration lawyer, you’re always on the go. We here at AILA know how valuable your time is, and with this issue, we want to help you make the most of it. u EXECUTIVE EDITOR EDITOR-IN-CHIEF Tatia L. Gordon-Troy Crystal Williams LEGAL EDITOR, REPORTER Sheeba Raj GRAPHIC DESIGNERS Bradley Amburn, Robert Bequeaith NEW MEDIA ASSOCIATE Jessica Eise CONTRIBUTORS Tarek Badawy, Jaclyn C.
    [Show full text]
  • Looking-At-11-Useful-Apps-That-Attorneys-May-Have
    Looking at 11 useful mobile apps that attorneys may have never heard of These free apps can help attorneys find ways to increase efficiency and cut expenses without sacrificing quality by Patrick Soon and Rebecca Bellow *This article first appeared in Inside Counsel Magazine on June 10, 2014: http://bit.ly/16bMIaC Let’s face it: In this economy the pressure is on for attorneys to find ways to increase efficiency and cut expenses without sacrificing quality. The free apps below can be very helpful in this regard. And while most of these apps are not specifically designed to aid the practice of law, they should nevertheless be included in every lawyer’s digital tool box. Here are the top 11: 1. Asana (Free for teams up to 15) Android; iPhone Asana is an easy-to-use, web-based group-project manager that makes it easier for attorneys and paralegals to work together on a big case. Each team member gets a workspace where they can add comments, attachments and tags to the tasks on which they are collaborating. Each user also gets their own inbox which organizes and updates new information in real time, so e-mailing is not needed. The most important benefit is that the lead attorney can spend more time working on the case itself and less time checking up on the other members of the group. 2. Documents 5 (Free) iPhone This app represents an overhaul of Documents, by Readdle. It has many applications for law offices as it is designed for optimal file management, document viewing and media playing.
    [Show full text]
  • Image-Based Augmented Reality
    IMAGE-BASED AUGMENTED REALITY: REINFORCING LEARNING IN MEDICAL SCHOOL EDUCATION A Project Presented to the Faculty of California State Polytechnic University, Pomona In Partial Fulfillment Of the Requirements for the Degree Master of Arts In Education By Joseph S. Marilo 2017 SIGNATURE PAGE PROJECT: IMAGE-BASED AUGMENTED REALITY: REINFORCING LEARNING IN MEDICAL SCHOOL EDUCATION AUTHOR: Joseph S. Marilo DATE SUBMITTED: Spring 2017 Education Department Shahnaz Lotfipour, Ph. D. _______________________________________ Project Committee Chair Professor of Education Jerry Kellogg, MA _______________________________________ Senior Consultant Creative Edge ii ACKNOWLEDGEMENTS I am forever appreciative of my parents for the sacrifices they made toward my scholastic pursuits. They did not have the same educational opportunities back in their respective countries, and I know that I would not be the person I am today without their love and support. To my wife, Christy, for her tireless encouragement despite the challenges. At one point, we were both in school – she completed her Doctorate last year! We will be breathing a sigh of relief very soon...again! This project would not have been possible without the assistance of my work colleagues. The team members in Instructional Technology and Distributed Learning provided technical expertise and guidance that kept the project on track. The generous support of the administration, post-doctoral teaching fellows, and staff of the NMM/OMM department was invaluable as was the support of the administration in the College of Osteopathic Medicine. To my classmates in my cohort who inspire and exemplify the selfless act of imparting knowledge. I’m grateful for your friendship and generosity. And lastly, a very special thank you to my faculty advisor, Dr.
    [Show full text]
  • The Accessibility Toolbox Matrix Dan Herlihy
    The Accessibility Toolbox Matrix Dan Herlihy Computer iPad Chromebook Android Scan & Read Scanning directly into the program - Prizmo - Scanning, OCR and Speech Live Using a Document Camera: - Text Fairy - free, take using a flatbed scanner (9.99 plus 2.99 for each voice) Can - Snap&Read Universal extension for image, performs OCR, - Kurzweil 3000 adjust what the OCR sees as text the Chrome browser with: Hovercam will read loud but doesn't - Read & Write Gold (editing underlying text) App runs OCR T3, T5, Mini 5, Solo 8 - run document highlight as it does, can - Premier Scan & Read on the image and produces a true PDF camera software from cloud - create PDF document so PDF readers with TTS can https://thehovercam.com/livecam - - Claro ScanPen (free) Live Using a Document Camera: highlight text and read aloud. App can make sure you type https take image, performs These are - Snap&Read Universal in the Chrome also read text aloud by choosing Export, OCR, select text with programs or browser with: Hovercam T5, Solo 8 - choose Read Text Aloud. Turn off Detect Snapverter - Chrome app from finger to read loud apps that can run document camera software from: Page option in Settings. Can send to TextHelp - use in conjunction with - TextGrabber & scan a printed https://thehovercam.com/livecam Claro PDF. Also translate. Read&Write for Google extension. Take Translator (by ABBYY, page into a - Prizmo Go (free, in-app purchases - picture of document with any device, 4.99) take picture of computer, - Kurzweil 3000 Image Reader tool - take picture of page, runs OCR, TTS drop file into Snapverter "Drop files here page, crop, choose Read display it (PC version only) select Image Speech folder in Google Drive to convert (folder (upper right) runs OCR onscreen, or Reader from the Read menu in - Office Lens – (free) Take picture of created by Snapverter in Google Drive).
    [Show full text]
  • Increase Your Ipad Iq
    Dennis Weishan Technology Solutions Director Robert W. Baird & Co. Version: 8.18.14 1 Welcome Introduction iPad Basics Apps Tips & Tricks Accessories Resources Questions 2 Welcome Introduction iPad Basics Apps Tips & Tricks Accessories Resources Questions 3 4 5 6 7 8 Welcome Introduction iPad Basics Apps Tips & Tricks Accessories Resources Questions 9 10 11 12 “The combination of required code signing, sandboxing, and entitlements in apps provides solid protection against viruses, malware, and other exploits that compromise the security of other platforms.“ 13 Welcome Introduction iPad Basics Apps Tips & Tricks Accessories Resources Questions 14 15 Mail flick through your inbox. Tap to open attachments. Pinch to zoom. And turn iPad to landscape to change your view. iMessage a new service that’s even better than texting. Because it’s between you and anyone using an iPhone, iPad, or iPod touch with iOS 5. And it’s unlimited. FaceTime You can tell them how you’re doing. Or you can show them. FaceTime on iPad lets you have it both ways. From anywhere in the world. Skype built especially for the iPad. Call, video call, or instant message anyone on Skype. ooVoo With ooVoo you can make FREE video calls, voice calls, and send text chats to easily connect with friends and family. Start a free group video chat with up to 12 people on 3G, 4G, LTE and Wifi. 16 17 Safari The web is even more captivating on the new iPad, thanks to the Retina display. Text is sharp and crisp. Colors are vibrant and true to life. And browsing is fast and responsive.
    [Show full text]
  • Google Buys Written Language Translation Startup 16 May 2014
    Google buys written language translation startup 16 May 2014 "read" words in one language and translate them into another almost instantly without need for network connections. Versions of the application have been tailored for smartphones powered by Apple or Android software as well as for Google Glass eyewear that links to the Internet. The news came as Google this week began letting anyone in the United States buy Glass, priced at $1,500 a pair. Google has promised lower-priced, fashionably versions of Glass in the not too distant future and Google logo is seen on a wall at the entrance of the has been prepping the market for their arrival. Google offices in Brussels on February 5, 2014 Fashion industry veteran and jewelry artist Ivy Ross will take charge of Google's Glass efforts beginning Monday, according to the California-based Google on Friday confirmed it has bought a startup technology titan. specializing in using smartphones to translate signs, billboards or other written words in real time. "I look forward to answering the seemingly simple, but truly audacious questions Glass poses," Ross said in a post at Google+ social network. Quest Visual and the technology built into its Word Lens application will become part of a Google team "Can technology be something that frees us up and devoted to developing translation features and keeps us in the moment, rather than taking us out services. of it?" Terms of the deal were not disclosed and Google Her resume includes stints at Calvin Klein, Swatch, declined to make any comment on the record.
    [Show full text]