Machine Learning Based Braille Transliteration of Odia Language Vinod Jha, K

International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-9 Issue-5, March 2020 Machine Learning Based Braille Transliteration of Odia Language Vinod Jha, K. Parvathi reads the letters from the text file and converts each letter into Abstract: Braille transliteration of natural languages is Braille using Unicode mapping to Braille code. The Braille required for providing a better opportunity of learning and codes are mapped to corresponding Braille cells and then the creating opportunities of ceceity people. It allows a bigger Braille cells are organized into a standard Braille sheet. diaspora of non-blind teachers to have written communication However there is still no method which can be used to make with blind people. The present paper proposes a method of Braille transliteration of Handwritten and printed Odia characters the conversion of printed sheets into Odia Braille or automatically into Braille. The current work proposes a method of handwritten sheets into Odia Braille. This task has become Braille transliteration of Handwritten Odia text with industry physically realizable with considerably good accuracy applicable accuracy. The method first preprocesses the text and because of the design of high accuracy Optical character then segments it into characters and then uses an SVM classifier recognizer (OCR) for Odia language. The present paper trained on HOG features of Odia handwritten characters to proposes a method to automatically convert handwritten predict characters and maps the predicted printable character to characters into Braille, which can then further be used to its corresponding Braille with a very good accuracy. The method can further be used with text to speech engines to help the blind convert handwritten texts and printed texts into Braille. The students use this technique with refreshable Braille having audio method proposes a way to convert Odia handwritten texts into facility to listen the same. Braille as follows: take an image of the text as input and Keywords: Braille, Optical Character recognition, HOG, SVM, convert it into a printable text format by using Odia OCR, Unicode convert characters in the printable text into the corresponding Unicode and the map the Unicode to corresponding Braille I. INTRODUCTION code and finally to Braille cell. The ODIA OCR is designed using NIT Rourkela dataset of handwritten ODIA characters [26] which contains 320 unique samples each of 47 different India has a lion share of blind people of the world and it also characters. The SVM with linear kernel is trained using has one of the highest percentages of unemployable and Histogram of oriented gradient features with 8x8 block size. illiterate blind people. Therefore Bharati Braille is a language The highest accuracy on the OCR so obtained is 96 present. of utmost importance for the upliftment of blind people in The accuracy of printable character transliteration into Braille India. But unfortunately, there is a scarcity of resources like is 100 per cent. books, schools and even teachers to teach blind people. It is very time cumbersome and complex task to write books in Braille and even costly. However with the recent advancement in the field of technology it is possible to reduce the complexity and cost of teaching blind people. The automatic conversion of Braille documents into natural languages and speech helps any arbitrary tutor to teach blind students without even knowing Braille fluently. This reduces the need of special tutors to teach Braille. The automatic conversion of documents in natural languages to Braille helps in printing books written in any natural language into Braille. It is also possible to teach the blind students using a Fig.1. A Braille Fig.2. Braille Specification refreshable Braille slate which can refresh the dot pattern after cell taking input directly from the optical character recogniser and read the text like any non-blind person would do. This A braille cell for Indian scripts comprises of 6 dots. By conversion of Braille into natural language and vice versa is using one or more dots at a time 64 letters (symbols) in any called as Braille transliteration. There are many works done in language can be represented. the field of Braille transliteration of languages like English, Braille dots are embossed on a particular type of sheet and Arabic, etc. However there is scarcity of work in the field of it is read by sensing the embossment over the braille sheet by Braille transliteration of Indian local languages like Odia. The using fingers. For the mapping purpose the braille cells are work presented in [1] describes a realistic method of Braille given representation. In this paper a raised dot is considered transliteration of printable text in Odia. The transliterator as ‘1’ and a non-raised dot is considered as ‘0’. So the Braille cell shown in figure 1 is represented as ‘100100’(in a rowwise Revised Manuscript Received on March 06, 2020. reading). Vinod Jha, Assistant Professor, Department of Electronics and Telecommunication, KIIT, India. Dr. K. Parvathi, Administrator, Department of Electronics and Telecommunication, Andhra University, Visakhapatnam, India. Published By: Retrieval Number: E2820039520/2020©BEIESP Blue Eyes Intelligence Engineering DOI: 10.35940/ijitee.E2820.039520 1866 & Sciences Publication Machine Learning Based Braille Transliteration of Odia Language In this way one can have all the possible braille cells accuracies. Madhuri Yadav et.al in [16] [22] proposed an represented using 6 bit binary system. All the measures OCR based on shape description of the alphabet such as provided in Fig.2 are in inches. Braille dots are embossed on a histogram of oriented gradients and geometric moments. particular type of sheet and it is read by sensing the These features are used to train the classifier such as SVM and embossment over the braille sheet by using fingers. Although multi-layer perceptron. The work claims to achieve highest the thickness of the sheet is also standardized, it varies place recognition rate of 96.8% on one of the dataset. Another to place. Thickness of the braille sheet is directly proportional method [17] uses 8-directional and 16-directional gradient to its life. Less the thickness, less the number of times it could feature (DGF) input values to train two layer fully connected be read. A standard braille sheet consists of 40 lines with back propagation feed forward network. Sobel operator is maximum 25 characters each. The present work is carried out used to get the gradient features. The method claims to be on MATLAB with the screen resolution of 96 dpi for printing approximately 96% accurate with 16-DGF which also gives standard Braille cells. much more complexity. In another method [18] Hidden Markov model is used to recognize the middle layer after A. Related Works segmenting the word into three layers. A new feature named There is a scarcity of Braille Transliteration in Indian regional pyramid histogram of oriented gradient (PHOG) is used to languages. Recently in the last decade researchers have done recognise the middle zone. The work has also compared the considerable amount of work for the Braille Transliteration of efficiency of using various features such as Marte-Bunke Hindi, Bengali, Tamil, and Telugu etc. However, feature, PHOG, Gaber, G-PHOG and claims to get highest Transliteration of Odia braille is a work in progress. Braille accuracy of 94.51% for GPHOG features. Another approach Transliteration of Odia consists of two problems, converting [19] of character recognition used k-mean clustering for Braille document to Odia and vice-versa. An Optical Braille features extraction and features are used on SVM classifier Recognition (OBR) is achieved by first segmenting the with linear kernel. The highest accuracy which was claimed Braille cells from a document and then converting them into with SVM classifier is 95.86% and that with Euclidean Odia language characters. The present work focuses on distance is 81.7%. Another approach [20] introduces a converting scanned Odia texts into printable Braille form. In Devanagiri handwritten character dataset (DHCD) with 2018[1] describes a method of Odia Braille Transliteration of 92000 images of 46 different classes of characters of printable texts. This method also describes a common Devanagiri script segmented from handwritten documents. It algorithm for forward and reverse text transmission. It uses a proposed a deep convolution neural network with the use of Unicode of the letters and maps it to the corresponding braille dropout and dataset increment approach and claimed an cell. In another work [2] a method has been proposed for fast accuracy of 98.47% on the said dataset. İn [21], the proposed transliteration of Odia Braille. This method takes a printable has used curvelet transform which approximates the curved Odia text and maps it to the corresponding Braille. Further, it singularities of images very well. Then KNN is trained with searches for the number of dots present in the cell to reduce curvelet features which are obtained by evaluating statistics of the time taken in the mapping process. Researches have been thick and thin images by applying curvelet transform. The done for transcription of English [3], Tamil [3][4], Telugu, proposed recognition accuracy is 90% but the dataset valuate Bengali [5], Kannada etc. But they are limited to the was very small. The paper in [22] introduces a new transcription of printable texts or printed texts into Braille. handwritten character database and also lists and compares Gayathri Devi et.al [6] proposed a methodology in 2018 the existing database’s advantages and pitfalls. The standard which transliterate Tamil printed text into Braille using Deep databases are scanned using 300 dpi flatband scanner. The learning with claimed accuracy of 95.7%. Another work [7] biggest database with 92000 characters has been used but the proposed a method of transliteration of Kannada Braille into drawback is the lack of vowels in the dataset.

Machine Learning Based Braille Transliteration of Odia Language Vinod Jha, K

UNIT 2 Braille

Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress

Translation Strategies of the Non-Native Odia Translators (1807-1874)

A New Research Resource for Optical Recognition of Embossed and Hand-Punched Hindi Devanagari Braille Characters: Bharati Braille Bank

The Fontspec Package Font Selection for XƎLATEX and Lualatex

Transliteration of Indian Ancient Script to Braille Script Using Pattern Recognition Technique: a Review

Mapping India's Language and Mother Tongue Diversity and Its

JETIR Research Journal

Odia Identity, Language and Regionalism: a Historical Perspective

+3 Cbcs Syllabus

(IJTSRD) Volume 4 Issue 1, December 2019 Available Online: E-ISSN: 2456 – 6470

2013 Cover Page M1