Zone Based Hybrid Feature Extraction Algorithm for Handwritten Numeral

Total Page:16

File Type:pdf, Size:1020Kb

Zone Based Hybrid Feature Extraction Algorithm for Handwritten Numeral Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Zone-Based Hybrid Feature Extraction Algorithm for Handwritten Numeral Recognition of Four Indian Scripts *S.V.Rajashekararadhya Vanaja Ranjan P Research Scholar Asst.Professor Department of Electrical Engineering, CEG Department of Electrical Engineering, CEG Anna Anna University, Chennai, India University, Chennai, India *e-mail: [email protected] e-mail: [email protected] Abstract-India is a multi-lingual and multi-script country, recognition and post processing. Research in HCR is popular where eighteen official scripts are accepted and there are over for various practical application potential such as, reading aids hundred regional languages. In this paper we propose a zone- for the blind, bank cheques, vehicle number plates, automatic based hybrid feature extraction system. The character centroid is pin code reading to sort postal mail. There is a lot of demand computed and the image (character/numeral) is further divided on Indian scripts character recognition and a review of the into n equal zones. An average angle from the character centroid OCR work done on Indian languages is excellently reviewed to the pixels present in the zone, is computed (one feature). in [5]. In [6] a survey on the feature extraction methods for Similarly, the zone centroid is also computed (two features). The character recognition is reviewed. The feature extraction average angle from the zone centroid to the pixels present in the method includes Template matching, Deformable templates, zone is computed (one feature). This procedure is sequentially Unitary Image transforms, Graph description, Projection repeated for all the zones/grids/boxes present in the numeral Histograms, Contour profiles, Zoning, Geometric moment image. There could be some zones that are empty; then, the value invariants, Zernike Moments, Spline curve approximation, of that particular zone image in the feature vector is zero. Finally, Fourier descriptors , Gradient feature, Gabor feature etc. 4*n such features are extracted. The nearest neighbor and support vector machine classifiers are used for subsequent India is a multi-lingual and multi-script country, classification and recognition purposes. We obtained 97.85 %, comprising eighteen official languages, namely, Assamese, 96.8 %, 95.1% and 95 % recognition rates for Kannada, Telugu, Bangla, English, Gujarati, Hindi, Kankanai, Kannada, Tamil and Malayalam numerals respectively, using support Kashmiri, Malayalam, Marathi, Nepali, Oriya, Punjabi, vector machine. Rajasthani, Sanskrit, Tamil, Telugu and Urdu. Recognition of handwritten Indian scripts is difficult because of the presence Keywords—Handwriiten character recognition, image of numerals, vowels, consonants, vowel modifiers and processing, Indian scripts, Feature extraction, Nearest neighbor compound characters. classifier, Support vector machine We will now briefly review the few important works done I. INTRODUCTION towards HCR with reference to the Indian language scripts. In Handwritten character recognition (HCR) is an important [7] the grid based feature extraction method is used to area in image processing and pattern recognition fields. HCR recognize the handwritten Bangla numerals using a multifier has received extensive attention in academic and production classifier. An accuracy of 97.23% is reported. The recognition fields. The recognition system can be either on-line or off-line. of conjunctive Bangla Characters by the artificial neural In on-line handwriting recognition, words are generally network is reported in [8]. Recognition of Devanagari written on a pressure sensitive surface (digital tablet PCs) from characters using gradient features and fuzzy-neural network is which real time information, such as the order of the stroke reported in [9] and [10] respectively. We found curvature made by the writer is obtained and preserved. This is feature for recognizing Oriya characters in [11]. significantly different to off-line handwriting recognition In [12] the zone/grid based feature extraction for where no dynamic information is available [1]. Off-line handwritten Hindi numerals is reported. For extracting the handwriting recognition is the process of finding letters and features, each character image is divided into 24 zones. By words that are present in the digital image of the handwritten considering the bottom left corner of the image as an absolute text. It is the subfield of optical character recognition(OCR). reference, the average vector distance for the pixels present in Several methods of recognition of English, Latin, Arabic, the zone is computed. Character recognition for Telugu scripts Chinese scripts are excellently reviewed in [1, 2, 3, 4]. using multi-resolution analysis and associative memory is There are five major stages in the HCR problem: Image reported in [13]. The Fuzzy technique and neural network preprocessing, segmentation, feature extraction, training and based off-line Tamil character recognition are found in [14]. 5295 978-1-4244-2794-9/09/$25.00 ©2009 IEEE In [15] the author has described a scheme for extracting Kannada is one of the major Dravidian languages of South features from the gray scale images of the handwritten India and one of the earliest languages evidenced characters on their state-space map with eight directional epigraphically in India, and spoken by about 50 million people space variations. In [16] for feature computation, the bounding in the Indian states of Karnataka, Tamil Nadu, Andhra Pradesh box of a numeral image is segmented into blocks and the and Maharasthra. The script has 49 characters in its directional features are computed in each of the blocks. These alphasyllabary and is phonetic. The characters are classified blocks are then down-sampled by a Gaussian filter and the into three categories: swaras(vowels), vyanjans(consonants) features obtained from the down-sampled blocks are fed to a and yogavaahas (part vowel, part consonants). The script also modified quadratic classifier for recognition. Off-line HCR for includes 10 different Kannada numerals of the decimal numeral recognition is also found in [17-21]. number system. Recognition of isolated handwritten Kannada numerals Telugu is a Dravidian language and it is the third most based on the image fusion method is found in [17]. In [18] popular script in India. It is the official language of the south Zone and Distance metric based feature extraction method is Indian state, Andhra Pradesh and also spoken by neighboring used. The character centroid is computed and the image is states. The Telugu script is closely related to the Kannada further divided into n equal zones. The average distance from script. Telugu is a syllabic language. Similar to most the character centroid to pixels present in the zone is languages of India, each symbol in the Telugu script computed. This procedure is repeated for all the zones present represents a complete syllable. Officially, there are 10 in the numeral image. Finally n such features are extracted for numerals, 18 vowels, 36 consonants, and three dual symbols. classification and recognition. Tamil is a Dravidian language and one of the oldest In [19] the zone and vertical projection distance metric is languages in the world. It is the official language of the Indian used. The preprocessed numeral image (50x50) is fed to the state of Tamil Nadu; it also has official status in Sri Lanka, feature extraction module. The image is divided into 25 zones Malaysia and Singapore. The Tamil script has 10 numerals, 12 (each zone size is 10x10). The average pixel distance of each vowels, 18 consonants and five grantha letters. The script, column present in the zone is computed vertically. In [20] however, is syllabic and not alphabetic. The complete script, image is further divided in to n equal zones. The average therefore, consists of 31 letters in their independent form, and distance from the zone centroid to pixels present in the zone is an additional 216 combining letters representing every computed. possible combination of a vowel and a consonant. From the above literature survey it is clear that not much Malayalam is a Dravidian language and it is the eighth work has been done on the south-Indian scripts. This most popular script in India, and spoken by about 30 million motivated us to work on south-Indian scripts. The selection of people in the Indian state of Kerala. Both the language and the the feature extraction method is also a very important factor writing system are closely related to Tamil. However, for achieving efficient character recognition. In this paper we, Malayalam has its own script. The script has 16 vowels, 37 propose a simple and efficient zone-based hybrid feature consonants and 10 numerals. extraction algorithm. The Nearest Neighbor Classifier (NNC) and Support Vector Machine (SVM) classifiers are used for The challenging part of the Indian handwritten the recognition and classification of a numeral image. We numeral/character recognition is the distinction between the have tested our method for Kannada, Telugu, Tamil and similar-shaped components. A very small variation between Malayalam numerals and also experimentation is carried out two characters or numerals leads to recognition complexity on MNIST digit database and ISI Bangla digit database. and a certain degree of recognition accuracy. The style of writing the characters is highly different
Recommended publications
  • Handwriting Recognition in Indian Regional Scripts: a Survey of Offline Techniques
    1 Handwriting Recognition in Indian Regional Scripts: A Survey of Offline Techniques UMAPADA PAL, Indian Statistical Institute RAMACHANDRAN JAYADEVAN, Pune Institute of Computer Technology NABIN SHARMA, Indian Statistical Institute Offline handwriting recognition in Indian regional scripts is an interesting area of research as almost 460 million people in India use regional scripts. The nine major Indian regional scripts are Bangla (for Bengali and Assamese languages), Gujarati, Kannada, Malayalam, Oriya, Gurumukhi (for Punjabi lan- guage), Tamil, Telugu, and Nastaliq (for Urdu language). A state-of-the-art survey about the techniques available in the area of offline handwriting recognition (OHR) in Indian regional scripts will be of a great aid to the researchers in the subcontinent and hence a sincere attempt is made in this article to discuss the advancements reported in this regard during the last few decades. The survey is organized into different sections. A brief introduction is given initially about automatic recognition of handwriting and official re- gional scripts in India. The nine regional scripts are then categorized into four subgroups based on their similarity and evolution information. The first group contains Bangla, Oriya, Gujarati and Gurumukhi scripts. The second group contains Kannada and Telugu scripts and the third group contains Tamil and Malayalam scripts. The fourth group contains only Nastaliq script (Perso-Arabic script for Urdu), which is not an Indo-Aryan script. Various feature extraction and classification techniques associated with the offline handwriting recognition of the regional scripts are discussed in this survey. As it is important to identify the script before the recognition step, a section is dedicated to handwritten script identification techniques.
    [Show full text]
  • Numerical Notation: a Comparative History
    This page intentionally left blank Numerical Notation Th is book is a cross-cultural reference volume of all attested numerical notation systems (graphic, nonphonetic systems for representing numbers), encompassing more than 100 such systems used over the past 5,500 years. Using a typology that defi es progressive, unilinear evolutionary models of change, Stephen Chrisomalis identifi es fi ve basic types of numerical notation systems, using a cultural phylo- genetic framework to show relationships between systems and to create a general theory of change in numerical systems. Numerical notation systems are prima- rily representational systems, not computational technologies. Cognitive factors that help explain how numerical systems change relate to general principles, such as conciseness and avoidance of ambiguity, which also apply to writing systems. Th e transformation and replacement of numerical notation systems relate to spe- cifi c social, economic, and technological changes, such as the development of the printing press and the expansion of the global world-system. Stephen Chrisomalis is an assistant professor of anthropology at Wayne State Uni- versity in Detroit, Michigan. He completed his Ph.D. at McGill University in Montreal, Quebec, where he studied under the late Bruce Trigger. Chrisomalis’s work has appeared in journals including Antiquity, Cambridge Archaeological Jour- nal, and Cross-Cultural Research. He is the editor of the Stop: Toutes Directions project and the author of the academic weblog Glossographia. Numerical Notation A Comparative History Stephen Chrisomalis Wayne State University CAMBRIDGE UNIVERSITY PRESS Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo, Delhi, Dubai, Tokyo Cambridge University Press The Edinburgh Building, Cambridge CB2 8RU, UK Published in the United States of America by Cambridge University Press, New York www.cambridge.org Information on this title: www.cambridge.org/9780521878180 © Stephen Chrisomalis 2010 This publication is in copyright.
    [Show full text]
  • Miramo Mmcomposer Reference Guide
    Miramo® mmComposer mmComposer Reference Guide VERSION 1.5 Copyright © 2014–2018 Datazone Ltd. All rights reserved. Miramo®, mmChart™, mmComposer™ and fmComposer™ are trademarks of Datazone Ltd. All other trademarks are the property of their respective owners. Readers of this documentation should note that its contents are intended for guidance only, and do not constitute formal offers or undertakings. ‘License Agreement’ This software, called Miramo, is licensed for use by the user subject to the terms of a License Agreement between the user and Datazone Ltd. Use of this software outside the terms of this license agreement is strictly prohibited. Unless agreed otherwise, this License Agreement grants a non-exclusive, non-transferable license to use the software programs and related document- ation in this package (collectively referred to as Miramo) on licensed computers only. Any attempted sublicense, assignment, rental, sale or other transfer of the software or the rights or obligations of the License Agreement without prior written con- sent of Datazone shall be void. In the case of a Miramo Development License, it shall be used to develop applications only and no attempt shall be made to remove the associated watermark included in output documents by any method. The documentation accompanying this software must not be copied or re-distributed to any third-party in either printed, photocopied, scanned or electronic form. The software and documentation are copyrighted. Unless otherwise agreed in writ- ing, copies of the software may be made only for backup and archival purposes. Unauthorized copying, reverse engineering, decompiling, disassembling, and creating derivative works based on the software are prohibited.
    [Show full text]
  • N Umerations in Tke Sink Ala Language Numerations in the Sinhala Language
    Numerations in the Sinhala Language N umerations in tke Sink ala Language Numerations in the Sinhala Language N umerations in tke Sinkala Language by Harsha Wijayawardhana edited by Aruni Goonetilleke 3 Num erations in the Sinhala Language Numerations in the Sinhala Language © Harsha Wijayawardhana 9th Lane, Nawala Road, Rajagiriya, Sri Lanka. [email protected] www.ucsc.cbm.ac.lk/sdu 2009 October ISBN- 978-955-1199-05-0 Design Sanjaya Epa Senevirathna Information and Communication Technology Agency of Sri Lanka All rights reserved. No part of this document may be reporoduced or transmitted in any form or by any means without prior written permission from ICTA. Published Strategic Communications and Media Unit - ICTA 160/24, Kirimandala Mawatha, Colombo 05, Sri Lanka. TP: +94 11 2369099 FAX:+ 94 11 2369091 email: [email protected] web: www.icta.lk Numerations in the Sinhala Language To my parents and to my daughter Panchali. um a.tlions m llie Sinhala Language Preface The research into Sinhala numerals that ICTA initiated has yielded the fact the Sinhala language had several sets of Sinhala numerals, of which two sets had been widely used: one set (Sinhala Illakkam) was in use up to the early part of the nineteenth century, and the other set (Lith Illakkam) was in use well into the 20th century. The latter set clearly includes a zero and a zero place holder. ICTA’s Local Language Working Group, after reviewing the research and after extensive discussions with experts and stakeholders agreed that these two sets should be encoded in (he Sri Lanka Standard Sinhala Character Code for Information Interchange (SLS 1134 : 2004), in the Unicode standard and in ISO/IEC 10646 (the Universal Character set).
    [Show full text]
  • Elements of South-Indian Palaeography, from the Fourth To
    This is a reproduction of a library book that was digitized by Google as part of an ongoing effort to preserve the information in books and make it universally accessible. https://books.google.com ELEMENTS SOUTH-INDIAN PALfi3&BAPBY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEIN1 AN INTRODUCTION TO ?TIK STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. C. BURNELL HON'. PH. O. OF TUE UNIVERSITY M. K. A, ri'VORE PIS I. A SOClfcTE MANGALORE \ BASEL MISSION BOOK & TRACT DEPOSITORY ft !<3 1874 19 Vi? TRUBNER & Co. 57 & 69 LUDOATE HILL' . ' \jj *£=ggs3|fg r DISTRIBUTION of S INDIAN alphabets up to 1550 a d. ELEMENTS OF SOUTH-INDIAN PALEOGRAPHY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEING AN INTRODUCTION TO THE STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. p. j^URNELL HON. PH. D. OF THE UNIVERSITY OF STRASSBUB.G; M. R. A. S.; MEMBKE DE LA S0CIETE ASIATIQUE, ETC. ETC. MANGALORE PRINTED BY STOLZ & HIRNER, BASEL MISSION PRESS 1874 LONDON TRtlBNER & Co. 57 & 59 LUDGATE HILL 3« w i d m « t als ^'ctdjcn kr §anltekcit fiir Mc i|jm bdic<jcnc JJoctorMvk ttcsc fetlings^kit auf rincm fejjcr mtfrckntcn Jfclk bet 1®4 INTRODUCTION. I trust that this elementary Sketch of South-Indian Palaeography may supply a want long felt by those who are desirous of investigating the real history of the peninsula of India. Trom the beginning of this century (when Buchanan executed the only archaeological survey that has ever been done in even a part of the South of India) up to the present time, a number of well meaning persons have gone about with much simplicity and faith collecting a mass of rubbish which they term traditions and accept as history.
    [Show full text]
  • Year 5 Maths – Summer 2 Week Beginning: 29.6.20 LINK
    Year 5 maths – Summer 2 Week beginning: 29.6.20 Lesson 3 of 12 Lesson 1 of 2 Lesson 2 of 2 Lesson 1 of 12 Lesson 2 of 12 CONSOLIDATION LESSON Roman Numerals Roman Numerals CONSOLIDATION LESSON CONSOLIDATION LESSON Theme Word problems To write Roman numerals to To write thousands numbers in Formal methods Formal methods To solve addition and 1000 Roman numerals Addition within 1,000,000 Addition and subtraction subtraction word problems Factual Practise comparing numbers Practise choosing multiples Practise multiplication patterns Practise estimating products Practise estimating products fluency (to aid fluency) using multiplication activity activity activity activity activity (Lesson 1 resources below) (Lesson 2 resources below) (Lesson 3 resources below) (Lesson 4 resources below) (Lesson 5 resources below) MAKING LINKS: Last week we MAKING LINKS: Yesterday we MAKING LINKS: Earlier in the year MAKING LINKS: Yesterday we MAKING LINKS: This week we solved problems involving volume. wrote Roman numerals up to we worked with formal addition worked with formal addition worked with formal addition and Today we will be writing Roman 1000.Today we will be writing methods. Today we will be methods. Today we will be subtraction methods. Today we numerals up to 1000. 1000s using Roman numerals. continuing with that. continuing with that. will be using these to solve word problems THINK: (support below) THINK: (support below) THINK: (support below) THINK: (support below) Can you help me with this Can you help me with this Can you help me with this Can you help me with this THINK: (support below) problem? My friend says all problem? We sometimes see problem? My friend has digit problem? My friend has digit Can you help me with this Roman numerals are based Roman numerals on buildings to cards: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9.
    [Show full text]
  • Numbering Systems Developed by the Ancient Mesopotamians
    Emergent Culture 2011 August http://emergent-culture.com/2011/08/ Home About Contact RSS-Email Alerts Current Events Emergent Featured Global Crisis Know Your Culture Legend of 2012 Synchronicity August, 2011 Legend of 2012 Wednesday, August 31, 2011 11:43 - 4 Comments Cosmic Time Meets Earth Time: The Numbers of Supreme Wholeness and Reconciliation Revealed In the process of writing about the precessional cycle I fell down a rabbit hole of sorts and in the process of finding my way around I made what I think are 4 significant discoveries about cycles of time and the numbers that underlie and unify cosmic and earthly time . Discovery number 1: A painting by Salvador Dali. It turns that clocks are not as bad as we think them to be. The units of time that segment the day into hours, minutes and seconds are in fact reconciled by the units of time that compose the Meso American Calendrical system or MAC for short. It was a surprise to me because one of the world’s foremost authorities in calendrical science the late Dr.Jose Arguelles had vilified the numbers of Western timekeeping as a most grievious error . So much so that he attributed much of the worlds problems to the use of the 12 month calendar and the 24 hour, 60 minute, 60 second day, also known by its handy acronym 12-60 time. I never bought into his argument that the use of those time factors was at fault for our largely miserable human-planetary condition. But I was content to dismiss mechanized time as nothing more than a convenient tool to facilitate the activities of complex societies.
    [Show full text]
  • Year 5 Maths – Summer 2 Week Beginning
    Year 5 maths – Summer 2 Week beginning: 29.6.20 Lesson 3 of 12 Lesson 1 of 2 Lesson 2 of 2 Lesson 1 of 12 Lesson 2 of 12 CONSOLIDATION LESSON Roman Numerals Roman Numerals CONSOLIDATION LESSON CONSOLIDATION LESSON Theme Word problems To write Roman numerals to To write thousands numbers in Formal methods Formal methods To solve addition and 1000 Roman numerals Addition within 1,000,000 Addition and subtraction subtraction word problems Factual Practise comparing numbers Practise choosing multiples Practise multiplication patterns Practise estimating products Practise estimating products fluency (to aid fluency) using multiplication activity activity activity activity activity (Lesson 1 resources below) (Lesson 2 resources below) (Lesson 3 resources below) (Lesson 4 resources below) (Lesson 5 resources below) MAKING LINKS: Last week we MAKING LINKS: Yesterday we MAKING LINKS: Earlier in the year MAKING LINKS: Yesterday we MAKING LINKS: This week we solved problems involving volume. wrote Roman numerals up to we worked with formal addition worked with formal addition worked with formal addition and Today we will be writing Roman 1000.Today we will be writing methods. Today we will be methods. Today we will be subtraction methods. Today we numerals up to 1000. 1000s using Roman numerals. continuing with that. continuing with that. will be using these to solve word problems THINK: (support below) THINK: (support below) THINK: (support below) THINK: (support below) Can you help me with this Can you help me with this Can you help me with this Can you help me with this THINK: (support below) problem? My friend says all problem? We sometimes see problem? My friend has digit problem? My friend has digit Can you help me with this Roman numerals are based Roman numerals on buildings to cards: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9.
    [Show full text]
  • The Hindu-Arabic Numerals
    I V" - LIBRARY OF THE University of California. Class THE HINDU-ARABIC NUMERALS BY . DAVID EUGENE SMITH AND LOUIS CHARLES KARPINSKI BOSTON AND LONDON GINN AND COMPANY, PUBLISHERS 1911 OOPTinGHT. 1P11. T.Y PATTD EUGENE SMITH AND LOUTS CHAElEf KAEPORSKI ATT, EIGHT; ED 811.7 a.A. PREFACE So familiar are we with the numerals that bear the misleading name of Arabic, and so extensive is their use in Europe and the Americas, that it is difficult for us to realize that their general acceptance in the transactions of commerce is a matter of only the last four centuries, and that they are unknown to a very large part of the human race to-day. It seems strange that such a labor- saving device should have struggled for nearly a thou- sand years after its system of place value was perfected before it replaced such crude notations as the one that the Roman conqueror made substantially universal in Europe. Such, however, is the case, and there is prob- ably no one who has not at least some slight passing interest in the story of this struggle. To the mathema tician and the student of civilization the interest is gen- a one to the teacher of the elements of erally deep ; knowledge the interest may be less marked, but never- theless it is real ; and even the business man who makes daily use of the curious symbols by which we express the numbers of commerce, cannot fail to have some appreciation for the story of the rise and progress of these tools of his trade.
    [Show full text]
  • Patttterrn Anallysiis Techniiques Fforr Ttherecogniittion Off
    Pattern Analysis Techniques for the Recognition of Unconstrained Handwritten Malayalam Character Images Thesis submitted to Cochin University of Science and Technology in partial fulfillment for the award of the Degree of DOCTOR OF PHILOSOPHY Under the Faculty of Technology ByByBy JOMY JOHN (Reg. No. 3539) Under the guidance of Dr. K. V. PRAMOD Department of Computer Applications Cochin University of Science and Technology Cochin – 682 022, Kerala, India January 2014 Dr. K. V. Pramod (Supervising Guide) Associate Professor Dept. of Computer Applications Cochin University of Science and Technology Certified that the thesis entitled “PatternPattern AnalysisAnalysis Techniques for the Recognition of Unconstrained Handwritten Malayalam Character Images”Images is a bonafide record of research carried out by Jomy JohnJohn under my guidance in the Department of Computer Applications, Cochin University of Science and Technology, Kochi-22. The work does not form part of any dissertation submitted for the award of any degree, diploma, associate ship or any other title or recognition from any University. Kochi - 22 Date:30-1-2014 Dr. K. V. Pramod Dr. K. V. Pramod (Supervising Guide) Associate Professor Dept. of Computer Applications Cochin University of Science and Technology This is to certify that all the relevant corrections and modifications suggested by the audience during the Pre-synopsis seminar and recommendations by the Doctoral Committee of the candidate have been incorporated in the thesis. Kochi - 22 Date:30-1-2014 Dr. K. V. PPramodramod I hereby declare that the thesis entitled “““Pattern“Pattern Analysis Techniques for the Recognition of Unconstrained Handwritten Malayalam Character ImagesImages”””” is the outcome of the original work done by me under the guidance of Dr.
    [Show full text]
  • Research Article Recognition of Multi-Lingual Handwritten Numerals Using Partial Derivatives
    Research Journal of Applied Sciences, Engineering and Technology 12(1): 27-36, 2016 DOI:10.19026/rjaset.12.2300 ISSN: 2040-7459; e-ISSN: 2040-7467 © 2016 Maxwell Scientific Publication Corp. Submitted: July 2, 2015 Accepted: August 15, 2015 Published: January 05, 2016 Research Article Recognition of Multi-lingual Handwritten Numerals Using Partial Derivatives 1K.N. Saravanan and 2R. Anitha, 1Christ University, Bangalore, Karnataka, 2Muthayammal Engineering College, Rasipuram, Tamilnadu, India Abstract: The multi-font and multi-lingual handwritten numerals recognition has been a demanding requirement in this decade. This research work proposes multi-lingual handwritten numerals recognition using partial derivatives for classifying handwritten numerals of five major Indian languages. The objective of the proposed work aims at designing and developing a recognition algorithm for multilingual handwritten numerals. This objective is achieved through data collection and preprocessing which involves creation of handwritten numeral databases, data collection, round off mean aspect ratio value based representation and identification of features using partial derivatives. The features derived from partial derivatives are stored in a five dimensional column vector which yielded a recognition rate of 94.80, 95.89, 96.44, 95.81 and 92.03%, respectively for Kannada, Gurumukhi, Sindhi, Malayalam and Tamil Handwritten Numerals respectively. Keywords: Binary numeral image, feature identification, multi-lingual handwritten numeral recognition, representation,
    [Show full text]
  • Universal Numeric Segment Display for Indian Scheduled Languages
    International Journal of Computer Trends and Technology- volume2Issue2- 2011 Universal Numeric Segment Display for Indian Scheduled Languages: an Architectural View Partha Pratim Ray Surendra Institute of Engineering and Management Siliguri, Darjeeling-734009, West Bengal, India Abstract— India is country of several hundred different languages. Though twenty two languages have only been devised II. RELATED WORK as scheduled to the Eighth Schedule of Indian Constitution in So far 7-segment display is used for English digits, those 2007. But as there is yet no proposed compact display are Latin digits and 10-segment display is proposed for architecture to display all the scheduled language numerals at a Bengali digits [2, 3]. [4] proposed an 8-segment display is time, this paper proposes a uniform display architecture to display all twenty two different language digits with higher English and Bengali digits. Literature [5] proposed a 12- accuracy and simplicity by using a 17-segment display, which is segment display for Arabic digits. [6] described an 18- an improvement over the 16-segment display. segmented display for twelve international languages. Another report, [7] presented a 31-segmented display for Bengali Keywords— 17-segment display, Indian scheduled language, characters. [8] proposed a 26-segment display for Bengali numerals, English. characters. But these segment architectures are different to each other and we cannot use the same architecture for I. INTRODUCTION uniform display of multiple languages, that’s why I have India is country of multiple languages. Most Indians speak proposed 17-segment display as a uniform architecture to a language belonging either to the Indo-European (74%), the display multiple numerals at a time.
    [Show full text]