Handwritten Gujarati Numeral Recognition Using Wavelet Transform Mikita Gandhi 1, Dr

Total Page:16

File Type:pdf, Size:1020Kb

Handwritten Gujarati Numeral Recognition Using Wavelet Transform Mikita Gandhi 1, Dr JASC: Journal of Applied Science and Computations ISSN NO: 1076-5131 Handwritten Gujarati Numeral Recognition using Wavelet Transform Mikita Gandhi 1, Dr. V. K. Thakar 2, Dr. H.N.Patel 3 Department of Electronics & Communication, ADIT, New V. V. Nagar Gujarat Technological University [email protected] [email protected] [email protected] Abstract—Handwritten character recognition is leading research area in last few decades. In India, different languages are used in different regions. The common algorithm cannot be used for all Indian languages. In this paper, the feature extraction using wavelet transform was used for recognition of handwritten Gujarati numerals. Wavelet transform was decomposed the image into four sub images: one approximation and three detailed sub images. The approximation was used as feature vector. Total 5000 samples were taken for analysis of the proposed algorithm, in which 80% was used during training and 20 % was used for testing. The Accuracy obtained using KNN classifier was 95.86%, 98.30% and 94.16% for level 2, level 3 and level 4 wavelet decomposition levels respectively. Keywords— Handwritten Gujarati Numeral Recognition, Wavelet Transform, DWT, KNN I. INTRODUCTION This The basic difficulty in any handwritten document processing system is in recognizing the handwritten characters contained in the article. In the field of pattern recognition and image processing, the most ongoing challenging research area is hand written character recognition. A lot of work has been carried out for recognizing handwritten characters in English language and several softwares are available for handwritten reorganization of English language, but no such software was available for Indian languages, particular for Guajarati language. Across India there are approximately 46 million speakers use the Dravidian language that is Gujarati (around 4% of the total population of India). Unlike the English numerals, Gujarati numerals are written by curved lines, so there were large varieties of writing styles founded in handwritten Gujarati Numerals. Ten symbols 0,1,2,3,4,5,6,7,8,9 are used to represent 0,1,2,3,4,5,6,7,8,9 numbers respectively in Guajarati Numeral system. The objective of the system development is to help kids to be taught and recognize numerals in “learn with play” manner and also development of automatic recognition of amount written on bank cheque for banking application and marks written on answer sheet for academic application. In this paper, the offline handwritten Guajarati numeral recognition work was carried out using wavelet transformation as a feature extraction method. The accuracy was measured using KNN (K-Nearest Neighbor) classifier. Paper is organized as follows: Section 2 discusses about related work carried out in Guajarati numeral recognition using wavelet transform, Section 3 describe the Proposed Algorithm, Section 4 illustrates the Discrete Wavelet Transform; KNN discussed in Section 5; Results and Discussions in section 6; Section 7 presents the conclusion and remarks. II. RELATED WORK The most of the people in Gujarat state used Gujarati as first language for communication. Due to various applications of handwritten Gujarati numerals, an OCR system is required. A number of recognition systems have been proposed for handwritten Gujarati numerals. Survey relevant to recognition of handwritten Gujarati Numeral is given below. Kamal Moro et al [1] in 2013 have proposed pattern based feature extraction method in which done the sum of horizontal, vertical, left and right diagonal pixels sum. Using neural network classifier, 80.5% accuracy was achieved on 600 testing samples. Baheti M. J. et al [2] in 2012 worked on Affine Invariant Moments as a feature extraction method and have shown the comparison between different classifier. The algorithm was tested on 1600 samples in which 50%was applied for training and 50% was applied for testing and got recognition rate of 92.28% for SVM, 87.2% for Gaussian distribution function, 90.04% K-NN classifier and 84.1%for PCA classifier. Swital J. Macwan et al [3] in 2015 have worked on Gujarati character recognition and testing was done on 7800 samples database. Volume VI, Issue IV, April/2019 Page No:2699 JASC: Journal of Applied Science and Computations ISSN NO: 1076-5131 The SVM classifier was applied on feature set taken by Discrete Wavelet Transform, Discrete Cosine Transform and Discrete Fourier Transform. The preprocessing metods like noise removal, binarization, segmentation, size normalization Thinning and Spur removal was used to increase the recognition rate of classifier. The accuracy achieved 89.46% by DWT,89.31% by DCT and 96.06% by DFT. Jignesh Dholakia et. al. [4] in 2005 worked on zonal identification. Apurva Desai [5] in 2010 has proposed object profile based feature extraction method and got 82% success for recognize Guajarati numerals using a multi layered feed forward neural network classifier. V. A. Naik et al [6] in 2018 have used different structural and statistical features set of 22 features. These features were generated using chain code and tested on 2000 numeral samples. Using polynomial kernel in SVM classifier, the algorithm achieved 95% accuracy. Statistical features were used by Dinesh Satange et al [7] to identify offline handwritten Gujarati Numerals using Multi Layer Perceptron (MLP) network and got 90% accuracy. Sekhar Mandal et al [8] in 2011 have proposed algorithm on machine-printed character recognition in Bangla language. The two dimensional wavelet transform and gradient information was used as a feature extraction method. 1475 sampled images features were applied in KNN classifier which had achieved 88.95% accuracy. Saleem Pasha et al [9] in 2015 have shown problem of handwritten Kannada Character Recognition. Statistical features like Corner Detection, Corner Detection, Quadrant Density, Aspect Ratio, Width Features and Discrete wavelet transform were used to extract the features. The preprocessed was applied to DWT transform and then the resultant four sub bands were applied to IDWT to produce one dimensional 128 feature set. The combination of these two feature sets and ANN classifier has been reached 91% accuracy with 1000 data samples. S.Tharani et al [10] presented the application of DWT in Tamil and English handwritten characters. Using Haar wavelet transform, the feature sets were computed at decomposition levels 1, 2 and 3. Abdurazzag Ali ABURAS et al [11] in 2007 worked on handwriting Arabic characters using wavelet transformation. The Euclidean distance method was used for classified the characters and got approximately 80% accuracy average and for some characters reached up to97.9% accuracy. III. DATABASE PREPARATION The database was prepared by 500 writers of all age groups, with variety of variation pen or pencil in thickness of nib and variety of color and style of writing. Fig 1 show the scan copy of handwritten Gujarati Numeral database. Fig 1 sample copy of handwritten Gujarati numarals. The scan copy was converted in to binary form and using morphological operations handwritten numerals were segmented and resized into 64X64 pixels. Single sample of each digit from each writers were collected, so total 500X10 =5000 samples database was formed. IV. PROPOSED METHOD The sampled image was first converted into grayscale image then Discrete Wavelet Transform (DWT) was applied on grayscale image and produced four sub band images. The approximation component of sub band images was taken as a feature set. The detail of wavelet transform was described in next section. Volume VI, Issue IV, April/2019 Page No:2700 JASC: Journal of Applied Science and Computations ISSN NO: 1076-5131 KNN was used as classification method with k=1, 3 and 5. Fivefold cross validation was used to find resultant accuracy. Figure 2 shown proposed method block diagram for Handwritten Gujarati Numeral Read the input image Convert it into Gray scale Image Apply Discrete Wavelet Transform for level 2, 3 and 4 Consider Approximation sub image as Feature Set Classification using KNN Classified Numeral Fig .2 Block diagram of proposed method V. FEATURE EXTRACTION METHOD A. Concepts of Wavelets Wavelet is a mathematical function which was used in digital signal processing to recover weak signal from noise and used in image compression to upload image on internet. Wavelet analysis has ability to analyze rapidly changing transient which attract it used in various application. For provide more accurate temporal and frequency information wavelet transform is better choice than Fourier transform. The wavelet provides the time-frequency representation of signal. To analyze the wavelet consider the complex valued function 휓 satisfy the following two conditions: i. The function ψ has finite energy that implies ∞ | |2 ( ) ∫−∞ 휓 푑푡 < ∞ … … 1 ii. It is the admissibility condition. ∞ |횿(ω)|2 푐 = 2휋 ∫ 푑휔 < ∞ ……….(2) 휓 −∞ |휔| Where 횿 is Fourier transform of 휓 . This condition says that if 횿(ω) is smooth than 횿(0) = 0 The function 휓 called mother wavelet. The continuous wavelet transform (CWT) of 1-D f(x) signal is given by 1 +∞ 푋−푏 푊(푎, 푏) = ∫ 푓(푥)Ψ∗ ( ) 푑푥 ……(3) √푎 −∞ 푎 Where Ψ∗ denotes complex conjugates of , a is time dilation parameter and b is translation parameter. Discrete wavelet transforms (DWT), which transforms a discrete time signal to a discrete wavelet representation. It converts an input series X0, X1, ..Xn-1, into one high-pass wavelet coefficient series and one low-pass wavelet coefficient series (of length n/2 each) given by: 푘−1 퐻푖 = ∑푛=0 푋2푖−푛푠푛(푍)…..(4) 푘−1 퐿푖 = ∑푛=0 푋2푖−푛푡푛(푍)…..(5) Volume VI, Issue IV, April/2019 Page No:2701 JASC: Journal of Applied Science and Computations ISSN NO: 1076-5131 Where sn(Z) and tn(Z) are called wavelet filters, K is the length of the filter and i=0,...,[n/2]-1. Most popular mother wavelets are shown in figure 3: Fig. 3 Mother wavelets B.
Recommended publications
  • The Origins, Evolution and Decline of the Khojki Script
    The origins, evolution and decline of the Khojki script Juan Bruce The origins, evolution and decline of the Khojki script Juan Bruce Dissertation submitted in partial fulfilment of the requirements for the Master of Arts in Typeface Design, University of Reading, 2015. 5 Abstract The Khojki script is an Indian script whose origins are in Sindh (now southern Pakistan), a region that has witnessed the conflict between Islam and Hinduism for more than 1,200 years. After the gradual occupation of the region by Muslims from the 8th century onwards, the region underwent significant cultural changes. This dissertation reviews the history of the script and the different uses that it took on among the Khoja people since Muslim missionaries began their activities in Sindh communities in the 14th century. It questions the origins of the Khojas and exposes the impact that their transition from a Hindu merchant caste to a broader Muslim community had on the development of the script. During this process of transformation, a rich and complex creed, known as Satpanth, resulted from the blend of these cultures. The study also considers the roots of the Khojki writing system, especially the modernization that the script went through in order to suit more sophisticated means of expression. As a result, through recording the religious Satpanth literature, Khojki evolved and left behind its mercantile features, insufficient for this purpose. Through comparative analysis of printed Khojki texts, this dissertation examines the use of the script in Bombay at the beginning of the 20th century in the shape of Khoja Ismaili literature.
    [Show full text]
  • Gujarati Handwritten Numeral Optical Character Through Neural Network and Skeletonization
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Diponegoro University Institutional Repository GUJARATI HANDWRITTEN NUMERAL OPTICAL CHARACTER THROUGH NEURAL NETWORK AND SKELETONIZATION Kamal MORO*, Mohammed FAKIR, Badr Dine EL KESSAB, Belaid BOUIKHALENE, Cherki DAOUI (dont delete this line. It is used to insert authors detail) Abstract — This paper deals with an optical character recognition (OCR) system for handwritten Gujarati numbers. One may find so much of work for Indian languages like Hindi, Kannada, Tamil, Bangala, Malayalam, Gurumukhi etc, but Gujarati is a language for which hardly any work is Fig. 2 Confusing Gujarati digits traceable especially for handwritten characters. The features of Gujarati digits are abstracted by four different profiles of This paper addresses the problem of handwritten digits. Skeletonization and binarization are also done for Gujarati numeral recognition. Gujarati numeral preprocessing of handwritten numerals before their recognition requires binarization and skeletonozation as classification. This work has achieved approximately 80,5% of preprocess. Further, profiles are used for feature extraction success rate for Gujarati handwritten digit identification. and artificial neural network (ANN) is suggested for the classification. Index Terms —Optical character recognition, neural network, feature extraction, Gujarati handwritten digits, II. DATABASE skeletonization, classification. For handwritten English numerals, we have the CEDAR (Centre of Excellence for Document Analysis I. INTRODUCTION and Recognition at the University of New York at ujarati belonging to Devnagari family of Buffalo, USA) numeral database. It contains Glanguages, which originated and flourished in approximately 5000 samples of numerals. It contains Gujarat a western state of India, is spoken by over 50 approximately 5000 samples of numerals.
    [Show full text]
  • Miramo Mmcomposer Reference Guide
    Miramo® mmComposer mmComposer Reference Guide VERSION 1.5 Copyright © 2014–2018 Datazone Ltd. All rights reserved. Miramo®, mmChart™, mmComposer™ and fmComposer™ are trademarks of Datazone Ltd. All other trademarks are the property of their respective owners. Readers of this documentation should note that its contents are intended for guidance only, and do not constitute formal offers or undertakings. ‘License Agreement’ This software, called Miramo, is licensed for use by the user subject to the terms of a License Agreement between the user and Datazone Ltd. Use of this software outside the terms of this license agreement is strictly prohibited. Unless agreed otherwise, this License Agreement grants a non-exclusive, non-transferable license to use the software programs and related document- ation in this package (collectively referred to as Miramo) on licensed computers only. Any attempted sublicense, assignment, rental, sale or other transfer of the software or the rights or obligations of the License Agreement without prior written con- sent of Datazone shall be void. In the case of a Miramo Development License, it shall be used to develop applications only and no attempt shall be made to remove the associated watermark included in output documents by any method. The documentation accompanying this software must not be copied or re-distributed to any third-party in either printed, photocopied, scanned or electronic form. The software and documentation are copyrighted. Unless otherwise agreed in writ- ing, copies of the software may be made only for backup and archival purposes. Unauthorized copying, reverse engineering, decompiling, disassembling, and creating derivative works based on the software are prohibited.
    [Show full text]
  • A Survey of Gujarati Handwritten Character Recognition Techniques
    6 IX September 2018 International Journal for Research in Applied Science & Engineering Technology (IJRASET) ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 6.887 Volume 6 Issue IX, Sep 2018- Available at www.ijraset.com A Survey of Gujarati Handwritten Character Recognition Techniques Arpit A. Jain1, Harshal A. Arolkar2 Assistant Professor1, Associate Professor2 , GLS University Abstract: OCR termed as Optical Character Recognition, is a technique to convert mechanically or electronically an image, photo or scanned document of a handwritten or printed text into machine encoded text. HCR termed as Handwritten Character Recognition, is a form of OCR that is specifically designed to recognize the handwritten text. OCR and HCR nowadays are used extensively for information entry from printed or handwritten data records. In this paper we have done a survey on Gujarati Handwritten Character Recognition techniques. Keywords: OCR, Optical Character Recognition, HCR, Handwritten Character Recognition, Image Processing, Gujarati HCR, Gujarati Handwritten Character Recognition. I. INTRODUCTION Characters of any language are created using two types of mechanism namely; Digital and Handwritten Format. The digital characters are created with the help of a computer. The handwritten characters are the one’s that are written by person. Handwritten characters can further be classified into two categories: Offline and Online. The offline characters are written using any normal pen; while online characters are created using an optical pen or stylus on an electronic device. Figure 1 and Figure 2 shows the sample of offline and online characters. Figure 1: Offline Characters Figure 2: Online Characters ©IJRASET: All Rights are Reserved 461 International Journal for Research in Applied Science & Engineering Technology (IJRASET) ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 6.887 Volume 6 Issue IX, Sep 2018- Available at www.ijraset.com Languages like English, French, and Spanish have alphabets and vowels.
    [Show full text]
  • Auction 47 9Th January 2016 Nashik
    Auction 47 9th January 2016 Nashik World of Coins Highlights of Auction 21 31 32 50 43 55 66 76 83 77 105 107 167 168 172 177 170 Auction 47 World of Coins Saturday, 9th January 2016 6.00 pm onwards at Indraprashtha Hall VIEWING Old Gangapur Naka, Gangapur Road, Monday 4 Jan. 2016 11:00 am - 6:00 pm Nashik 422002 Tuesday 5 Jan. 2016 11:00 am - 6:00 pm In Association with Wednesday 6 Jan. 2016 11:00 am - 6:00 pm Rare Fair 2016 Thursday 7 Jan. 2016 11:00 am - 6:00 pm Category Lots At Rajgor’s SaleRoom Ancient Coins 1-44 605 Majestic Shopping Centre, Near Church, 144 JSS Road, Hindu Coins of Medieval India 45-48 Opera House, Mumbai 400004 Sultanate Coins of Islamic India 49-61 Friday 8 Jan. 2016 11:00 am - 6:00 pm Coins of Mughal Empire 62-75 Coins of Independent Kingdoms 76-94 Saturday 9 Jan. 2016 11:00 pm - 4:00 pm Princely States of India 95-171 At the Nashikvenue European Powers in India 172-183 British India 184-191 Republic of India 192-193 DELIVERY OF LOTS Foreign Coins 194-204 Delivery of Auction Lots will be done from the Badges 205 Mumbai Office of the Rajgor’s. Medals 206-208 Philately 209-216 BUYING AT RAJGOR’S For an overview of the process, see the Numismatic Books 217-327 Easy to buy at Rajgor’s Paper Money 328-333 Front cover: Lot 76 • Back cover: Lot 329 CONDITIONS OF SALE This auction is subject to Important Notices, Conditions of Sale and to Reserves To download the free Android App on your ONLINE CATALOGUE Android Mobile Phone, View catalogue and leave your bids online at point the QR code reader application on your www.Rajgors.com smart phone at the image on left side.
    [Show full text]
  • Numbering Systems Developed by the Ancient Mesopotamians
    Emergent Culture 2011 August http://emergent-culture.com/2011/08/ Home About Contact RSS-Email Alerts Current Events Emergent Featured Global Crisis Know Your Culture Legend of 2012 Synchronicity August, 2011 Legend of 2012 Wednesday, August 31, 2011 11:43 - 4 Comments Cosmic Time Meets Earth Time: The Numbers of Supreme Wholeness and Reconciliation Revealed In the process of writing about the precessional cycle I fell down a rabbit hole of sorts and in the process of finding my way around I made what I think are 4 significant discoveries about cycles of time and the numbers that underlie and unify cosmic and earthly time . Discovery number 1: A painting by Salvador Dali. It turns that clocks are not as bad as we think them to be. The units of time that segment the day into hours, minutes and seconds are in fact reconciled by the units of time that compose the Meso American Calendrical system or MAC for short. It was a surprise to me because one of the world’s foremost authorities in calendrical science the late Dr.Jose Arguelles had vilified the numbers of Western timekeeping as a most grievious error . So much so that he attributed much of the worlds problems to the use of the 12 month calendar and the 24 hour, 60 minute, 60 second day, also known by its handy acronym 12-60 time. I never bought into his argument that the use of those time factors was at fault for our largely miserable human-planetary condition. But I was content to dismiss mechanized time as nothing more than a convenient tool to facilitate the activities of complex societies.
    [Show full text]
  • Recognition of Spoken Gujarati Numeral and Its Conversion Into Electronic Form
    International Journal of Engineering Research & Technology (IJERT) ISSN: 2278-0181 Vol. 3 Issue 9, September- 2014 Recognition of Spoken Gujarati Numeral and Its Conversion into Electronic Form Bharat C. Patel Apurva A. Desai Smt. Tanuben & Dr. Manubhai Trivedi Dept. of Computer Science, Veer Narmad South Gujarat . College of information science, University, Surat, Gujarat, India Surat, Gujarat, India, Abstract— Speech synthesis and speech recognition are the area of interest for computer scientists. More and more A. Gujarati language researchers are working to make computer understand Gujarati is an Indo-Aryan language, descended from naturally spoken language. For International language like Sanskrit. Gujarati is the native language of the Indian state of English this technology has grown to a matured level. Here in this paper we present a model which recognize Gujarati TABLE I. PRONUNCIATION OF EQUIVALENT ENGLISH AND GUJARATI numeral spoken by speaker and convert it into machine editable NUMERALS. text of numeral. The proposed model makes use of Mel- Frequency Cepstral Coefficients (MFCC) as a feature set and K- English Pronunciation Gujarati Pronunciation Nearest Neighbor (K-NN) as classifier. The proposed model Digits Numerals 1 One Ek achieved average success rate of Gujarati spoken numeral is 2 Two Be about 78.13%. 3 Three Tran Keywords—speech recognition;MFCC; spoken Gujarati 4 Four Chaar numeral; KNN 5 Five Panch 6 Six Chha NTRODUCTION I. I 7 Seven Saat Speech recognition is a process in which a computer can 8 Eight Aath identify words or phrases spoken by different speakers in 9 Nine Nav different languages and translate them into a machineIJERTIJERT 0 Zero Shoonya readable-format.
    [Show full text]
  • International Journal of Computer Sciences and Engineering Open Access Research Paper Vol.-6, Issue-9, Sept
    International Journal of Computer Sciences and Engineering Open Access Research Paper Vol.-6, Issue-9, Sept. 2018 E-ISSN: 2347-2693 Online Handwritten Gujarati Numeral Recognition Using Support Vector Machine V. A. Naik1*, A. A. Desai2 1,2Department of Computer Science, Veer Narmad South Gujarat University, Surat, Gujarat, India * Corresponding Author: [email protected] Available online at: www.ijcseonline.org Accepted: 22/Sept/2018, Published: 30/Sept/2018 Abstract - In this paper, online handwritten numeral recognition for Gujarati is proposed. Online handwritten character recognition is in trend for research due to a rapid growth of handheld devices. The authors have compared Support Vector Machine (SVM) with linear, polynomial, and radial basis function kernels. The authors have used hybrid feature set. The authors have used zoning and chain code directional features which are extracted from each stroke. The dataset of the system is of 2000 samples and was collected by 200 writers and tested by 50 writers. The authors have achieved an accuracy of 92.60%, 95%, and 93.80% for linear, polynomial, RBF kernel and an average processing time of 0.13 seconds, 0.15seconds, and 0.18 seconds per stroke for linear, polynomial, RBF kernel. Keywords: Online Handwritten Character Recognition (OHCR), Handwritten Character Recognition (HCR), Optical Character Recognition (OCR), Support Vector Machine (SVM), Gujarati Numeral, Gujarati Digits I. INTRODUCTION There are many challenges in recognition of Gujarati digits because of variation in writing style and handwriting. Gujarati is an Indo-Aryan language and one of the official Gujarati digits have more curves than lines and there are languages of India, spoken by people of Gujarat state, union similar curves in some characters.
    [Show full text]
  • Invariant Moments Approach for Gujarati Numerals
    International Journal of Engineering and Applied Sciences (IJEAS) ISSN: 2394-3661, Volume-2, Issue-2, February 2015 Invariant Moments Approach for Gujarati Numerals Dr. Mamta Baheti characters in terms of lines, words and connected Abstract— Due to less reported work for Gujarati numerals components. By this effort, for connected component we have been motivated for same as Gujarati is a language not recognition rate was 78.34% for upper modifier recognition only of Indian states but widely spoken across world. We have. rate was 50% where as for lower modifier it was 77.55% and We have used noisy numerals for training and testing. Images for punctuation marks it was 29.6%, cumulative for overall it are pre-processed and then subjected to the proposed algorithm. was 72.3%. in our proposed algorithm we have used invariant moments as feature extraction technique and Gaussian distribution function In another work, Yajnik [5] had proposed an approach of as classifier. We found satisfactory results for some numerals. wavelet descriptors (Daubechies D4 wavelet coefficients) for The results can be improved by giving better quality images for image compression of printed Gujarati letters. They further training and testing. computed coefficients which were considered as an input to the recognizer (like nearest neighborhood or Neural Network Index Terms—Gujarati, Invariant moments,Gaussian architectures [6]-[7]) that reported them with results up to distribution function . 75% in compression. While reviewing literature, it was found that in 2005, Dholakia [8] have presented an algorithm to identify various zones. They have projected the use of I. INTRODUCTION horizontal and vertical profiles.
    [Show full text]
  • Nuicone 2015)
    2015 5th Nirma University International Conference on Engineering (NUiCONE 2015) Ahmedabad, India 26-28 November 2015 IEEE Catalog Number: CFP1555R-POD ISBN: 978-1-4799-9992-7 LIST OF FULL PAPERS ID No. Paper ID Title/Page No. Authors Analyzing effect of bad measurement data on load Jigar Patel, Daivat Desai, Vaibhav 38 555 flow and state estimation in power system 199 Patel, Dishang D. Trivedi and Santosh C. Vora VM Placement of Multidimensional Resources using Naisargi Patel and Govind Patel 39 350 Cartesian Co-ordinates Based Approach 205 Hierarchical Clustering Technique for Word Sense Nirali Patel, Bhargesh Patel, Rajvi 40 234 Disambiguation using Hindi WordNet 210 Parikh and Brijesh Bhatt Pose, Illumination and Expression Invariant Face Pradip Panchal, Palak Patel, 41 118 Recognition using Laplacian of Gaussian and Local Vandit Thakkar and Rachana Binary Pattern 215 Gupta Performance Enhancement of 12 X 160 Gbps (1.92 Rohit Patel and Dilip Kothari 42 519 Tbps) WDM Optical System for Transmission Distance upto 8000 km with Differential Coding 221 Arduino Controlled War Field Spy Robot using Jigneshkumar Patoliya, Haard 43 512 NightVision Wireless Camera and Android 227 Mehta and Hiteshkumar Patel Optimized Unscheduled Interchange Based Secondary Shital Pujara and Chetan Kotwal 44 310 Control for Two Area Deregulated Electricity Market 232 Introducing the Conceptual Model of Industrial Aarthi Raghavan 45 126 MOOCs (I-MOOCs) for Engineering Classes 240 Design Of Low Voltage Bandgap Reference Circuit Sushma Suresh Sangolli and 46 253
    [Show full text]
  • A Study and Comparative Analysis of Different Stemmer and Character Recognition Algorithms for Indian Gujarati Script
    International Journal of Computer Applications (0975 – 8887) Volume 106 – No.2, November 2014 A Study and Comparative Analysis of Different Stemmer and Character Recognition Algorithms for Indian Gujarati Script Rajnish M. Rakholia Jatinderkumar R. Saini Ph.D PhD Scholar, R K University Associate Professor and Director I/C Bhavnagar Highway Narmada College of Computer Application Rajkot – Gujarat, India Bharuch – Gujarat, India ABSTRACT 1.1.2 Area below and above the Baseline: used A lot of work has been reported on optical character for below-base and above-base dependent vowels recognition for various non-Indian scripts like Chinese, respectively. English and Japanese and Indian scripts like Tamil, Hindi Telugu, etc., in this paper, we present a literature review on 1.1.3 Area before and after the Baseline: this is stemmer, optical character recognition (OCR) and Text the placeholder for consonants and independent vowels mining work on Indian scripts, mainly on the Gujarati [31]. languages. We have discussed the different techniques for OCR and text mining in Gujarati scripts, and summarized 2. STEMMER FOR GUJARATI SCRIPT most of the published work on this topic and gives future Stemming is the process to transform the words in texts into directions of research in the field of Indian script. their grammatical root form. Sheth J and Patel B (2014) suggested DHIYA a stemmer for General Terms Gujarati language, EMILLE corpus is used for training and Stemmer, Gujarati character recognition evaluation of the stemmer's performance. They obtained Keywords accuracy of 92.41% [26]. Classification, feature extraction, Gujarati script, Gujarati In (Sheth and Patel, 2012) they discussed different stemming stemmer, Indian script, pre-processing and segmentation.
    [Show full text]
  • Gujarati Handwritten Character Recognition Using Hybrid Method
    International Journal of Engineering Research & Technology (IJERT) ISSN: 2278-0181 Vol. 2 Issue 6, June - 2013 Gujarati Handwritten Character Recognition Using Hybrid Method Based On Binary Tree-Classifier And K-Nearest Neighbour Chhaya Patel Apurva Desai MCA Department Department of Computer Science, Anand Institute of Information Science , Veer Narmad South Gujarat University, Anand, India Surat, India Abstract than handwritten form. A rich cultural heritage is available in Gujarati is a language used by more than 50 million handwritten form for this script. Being official language of people worldwide. Due to dissemination of ICT in India need state of Gujarat major correspondence within various for Optical Character Recognition (OCR) activities for Indian Government departments and other institutes is carried out script is in demand. One can obtain very less OCR related using Gujarati, either in handwritten or printed form. Many research work for Gujarati script, especially for handwritten OCR solutions are available for the other languages of Indian form. This paper describes a hybrid approach based on tree origin like Bangala, Devnagri, Gurumukhi but OCR solution classifier and k-Nearest Neighbor (k-NN) for recognition of for Gujarati handwritten form is not available. The optical handwritten Gujarati characters. Combination of structural character recognition of Gujarati script will definitely be features and statistical features is used for classification and helpful for developing a full-fledged OCR system for Gujarati. identification of characters. The features are relatively simple to derive. The structural features are selected by studying the This paper describes an approach to identify numerals and appearance of various handwritten characters. The moment characters of Gujarati script.
    [Show full text]