Deep Convolutional Neural Network for Recognition of Unified Multi-Language Handwritten Numerals
Total Page:16
File Type:pdf, Size:1020Kb
2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR) Deep Convolutional Neural Network for Recognition of Unified Multi-Language Handwritten Numerals Jaafar Alghazo Ghazanfar Latif Loay Alzubaidi Computer Engineering Department, Computer Science Department, Computer Science Department, Prince Mohammad Bin Fahd University, Prince Mohammad Bin Fahd University, Prince Mohammad Bin Fahd University, Al Khobar, Saudi Arabia. Al Khobar, Saudi Arabia. Al Khobar, Saudi Arabia. [email protected] [email protected] [email protected] M. Muzzamal Naseer Yazan Alghazo College of Engineering, Humanities Department, Prince Australian National University, Mohammad Bin Fahd University, Al Canberra, Australia. Khobar, Saudi Arabia. [email protected] [email protected] Abstract— Deep learning systems have recently gained handwritten. The classification phase involves using well- importance as the architecture of choice in artificial intelligence developed classification techniques such as Multilayer (AI). Handwritten numeral recognition is essential for the Perceptron (MLP) and Support Vector Machines (SVM) among development of systems that can accurately recognize digits in many others. This phase involves training the Artificial different languages which is a challenging task due to variant Intelligence (AI) system and then recognizing the remaining writing styles. This is still an open area of research for developing portion of the dataset or new data. These algorithms can also an optimized Multilanguage writer independent technique for utilize either supervised or unsupervised learning while training numerals. In this paper, we propose a deep learning architecture the AI system on the dataset. It is observed that mostly research for the recognition of handwritten Multilanguage (mixed papers have only tackled numerals written in one language numerals belongs to multiple languages) numerals (Eastern Arabic, Persian, Devanagari, Urdu, Western Arabic). The overall The importance of recognizing handwritten numerals is accuracy of the combined Multilanguage database was 99.26% apparent due to the increase usage of handheld devices in both with a precision of 99.29% on average. The average accuracy of personal and professional settings. Banking industry relies each individual language was found to be 99.322%. Results heavily on handwritten numeral recognition especially for indicate that the proposed deep learning architecture produces numerals written on checks. Other industries also would benefit better results compared to methods suggested in the previous from handwritten numeral recognition algorithms in addition to literature. individuals with different handheld device applications developed that allow users to write numbers rather than input Keywords—Arabic Numberals; Mul-Language Numerals Recognition; Hand Written Numerals; Deep Convolutional Neural them using a keyboard. Networks In this research, a deep Convolutional Neural Network (CNN) architecture is developed for handwritten Eastern Arabic I. INTRODUCTION and Persian numerals while experiments are also performed on some other languages including Urdu, Devanagari, Western Due to the infinite number of variations of handwritten Arabic which in total, are spoken by approximately 1.86 billion numerals that are subject to many variable, the recognition of people. The rest of this paper is structured as follows: section 2 handwritten numeral is a complex problem that has been subject confers the literature review, section 3 presents the proposed of research. A large number of algorithms have been developed system, section 4 details the experimental datasets, section 5 to tackle this problem for both offline and online recognition. discusses the results of the proposed recognition system and Offline recognition refers to recognizing numerals written on section 6 concludes the paper. paper using a pen and online recognition refers to numerals written on handheld devices e.g. stylus. Handwritten numerals can differ even for the same writer when writing on paper or a II. LITERATURE REVIEW handheld device and can vary even in the same medium. The Though the concept of Deep Learning has been around for algorithm consisting of pre-processing, feature extraction and quite some time, yet the utilization of Deep learning to its full the final phase of classification is used for both cases. In the potential was not achieved until recently with advances in preprocessing phase, the image is extracted whether in the technology, computing power and computing resources. Deep spatio-temporal or spatio-luminance representation, noise learning algorithms have achieved new records in different reduction, centering the image, etc. In the feature extraction fields such as image recognition and speech recognition among phase, researchers proposed different methods for feature others [1-2]. The concept of Deep Learning relies on what is extraction that can be used for in final step of classification of known as Convolutional Neural Networks (ConvNets or CNN). XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE 978-1-5386-1459-4/18/$31.00 ©2018 IEEE 90 Authorized licensed use limited to: Prince Mohammad Bin Fahd University. Downloaded on August 13,2020 at 11:49:58 UTC from IEEE Xplore. Restrictions apply. 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR) LeNet 5 developed by LeCun, Y. et al. was one of the first are used in [20] for improving handwritten Chinese text convolutional networks used in the 1990s [3]. In 2012, recognition. Deep Neural Network is used as part of an Krizhevsky, A. et al. [4] introduced a newer CNN that was wider algorithm implemented in [21] for SAR target configuration and deeper than LeNet and subsequently won the ImageNet- recognition. Fisher Vector (FV) and CNN are used in [22] for 2012 Large Scale Visual Recognition Challenge (ILSVRC). visual recognition of the ancient inscriptions. This is considered by many as the initial point of the subsequent Since Deep learning algorithm handwritten numeral interest in Deep Learning. The accuracy rate achieved was recognition rate will be compared to widely studied Artificial 84.7%. The Architecture of [4] was modified into a new Intelligence Algorithms, a few references will be mentioned architecture proposed by M. D. Zeiler and R. Fergus in [5]. This here in the literature review. Different novel feature extraction new architecture ended up winning the ILSVRC 2013. It methods were developed and used in [23-25] for numeral and achieved an accuracy rate of 88.3%. An Inception module was character recognition. Different classifier have also been used to developed that shrinks the number of parameters in the network test the proposed algorithms. and was the contribution of the CNN developed by Szegedy, C. et al. and ended up winning the ILSVRC 2014 [6]. The accuracy rate achieved was 93.33%. In the same year and in the same III. PROPOSED SYSTEM competition Simonyan, K., & Zisserman, A. introduced a CNN proving that the number of layers in a deep network affects the A. Preprocessing performance [7]. They showed that results are improved by Numerals in Arabic, Urdu and Persian are written from left increasing the depth to 16-19 weighted layers. Their network to right. Their digits are mostly similar except for the numbers ended up winning second place in the 2014 ILSVRC. He, K. et (4) four, (5) five and (6) six. In this paper, we target the al. in [8] introduced residual network CNN and won the recognition of handwritten digits of these languages in addition ILSVRC 2015. The error rate achieved was 3.6%. Huang et al. to Western Arabic (English) and Devanagari which are all [9] introduced the densely connected CNN which exhibited shown in table 1. improvement over some previous architectures. Alghazo et al. also proposed an efficient geometric feature based numeral TABLE I. REPRESENTATION NUMERALS IN EASTERN ARABIC, recognition system for Arabic and Persian numerals [10]. Yet WESTERN ARABIC, PERSIAN, URDU AND DEVANAGARI the CNN introduced in [8] is still the most popular and preferred Western Eastern CNN architecture for practical applications. Persian Urdu Devanagari Arabic Arabic ० ٠ ٠ A number of papers have already been completed on digit or 0 १ ١ ١ character recognition using some types of Deep Learning 1 Architectures. For example, Singh et al. [11] compared the use २ ٢ ٢ of the deep Convolutional Neural Network and the fully 2 ३ ٣ ٣ connected Feed-forward Neural Network for the recognition of 3 the Devangari handwritten characters. They reported a 98.11% ४ ۴ ٤ 4 recognition rate. In [12], Deep-Learning Feed forward ۵ ५ ٥ Backpropagation Neural Network (DFBNN) along with 5 ۶ ६ ٦ Extreme Learning Machine (ELM) were used in handwritten 6 Numeral Recognition. The methods were applied to datasets in ७ ٧ ٧ Thai, Bangla and Devangari. DRBNN outperformed ELM 7 ८ ٨ ٨ slightly. In DFBNN, the accuracy rates were 98.4% for Thai, 8 95.84% for Bangla, and 78.4% for Devangari. Deep Belief ९ ٩ ٩ 9 Networks (DBN), A Deep learning Structure, was used in [13] for Bangla Handwritten Numeral Recognition. The recognition accuracy was 90.27%. DBN was also used in [14] for high- Handwriting styles are different from one person to another resolution Synthetic Aperture Radar (SAR) image classification. and even the handwriting of the same person can be different Alghazo proposed an online numerals recognition system using according to several variables. For example, more than 52 structural