Improvement Accuracy of Recognition Isolated Balinese Characters with Deep Convolution Neural Network
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Method to Accommodate Backward Compatibility on the Learning Application-Based Transliteration to the Balinese Script
(IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 12, No. 6, 2021 A Method to Accommodate Backward Compatibility on the Learning Application-based Transliteration to the Balinese Script 1 3 4 Gede Indrawan , I Gede Nurhayata , Sariyasa I Ketut Paramarta2 Department of Computer Science Department of Balinese Language Education Universitas Pendidikan Ganesha (Undiksha) Universitas Pendidikan Ganesha (Undiksha) Singaraja, Indonesia Singaraja, Indonesia Abstract—This research proposed a method to accommodate transliteration rules (for short, the older rules) from The backward compatibility on the learning application-based Balinese Alphabet document 1 . It exposes the backward transliteration to the Balinese Script. The objective is to compatibility method to accommodate the standard accommodate the standard transliteration rules from the transliteration rules (for short, the standard rules) from the Balinese Language, Script, and Literature Advisory Agency. It is Balinese Language, Script, and Literature Advisory Agency considered as the main contribution since there has not been a [7]. This Bali Province government agency [4] carries out workaround in this research area. This multi-discipline guidance and formulates programs for the maintenance, study, collaboration work is one of the efforts to preserve digitally the development, and preservation of the Balinese Language, endangered Balinese local language knowledge in Indonesia. The Script, and Literature. proposed method covered two aspects, i.e. (1) Its backward compatibility allows for interoperability at a certain level with This study was conducted on the developed web-based the older transliteration rules; and (2) Breaking backward transliteration learning application, BaliScript, for further compatibility at a certain level is unavoidable since, for the same ubiquitous Balinese Language learning since the proposed aspect, there is a contradictory treatment between the standard method reusable for the mobile application [8], [9]. -
1 Brahmi Word Recognition by Supervised Techniques
Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 5 June 2020 doi:10.20944/preprints202006.0048.v1 Brahmi word recognition by supervised techniques Neha Gautam 1*, Soo See Chai1, Sadia Afrin2, Jais Jose3 1 Faculty of Computer Science and Information Technology, University Malaysia Sarawak 2 Institute of Cognitive Science, Universität Osnabrück 3Amity University, Noida, India *[email protected] [email protected] Abstract: Significant progress has made in pattern recognition technology. However, one obstacle that has not yet overcome is the recognition of words in the Brahmi script, specifically the recognition of characters, compound characters, and word because of complex structure. For this kind of complex pattern recognition problem, it is always difficult to decide which feature extraction and classifier would be the best choice. Moreover, it is also true that different feature extraction and classifiers offer complementary information about the patterns to be classified. Therefore, combining feature extraction and classifiers, in an intelligent way, can be beneficial compared to using any single feature extraction. This study proposed the combination of HOG +zonal density with SVM to recognize the Brahmi words. Keeping these facts in mind, in this paper, information provided by structural and statistical based features are combined using SVM classifier for script recognition (word-level) purpose from the Brahmi words images. Brahmi word dataset contains 6,475 and 536 images of Brahmi words of 170 classes for the training and testing, respectively, and the database is made freely available. The word samples from the mentioned database are classified based on the confidence scores provided by support vector machine (SVM) classifier while HOG and zonal density use to extract the features of Brahmi words. -
Balinese Latin Text Becomes Aksara Bali Using Rule Base Method
I Putu Agus Eka Darma Udayana et. al., International Journal of Research in IT, Management and Engineering, ISSN 2249-1619, Impact Factor: 6.123, Volume 07 Issue 05, May 2017, Page 1-7 Balinese Latin Text Becomes Aksara Bali Using Rule Base Method I Putu Agus Eka Darma Udayana1, Made Sudarma2, and I Nyoman Satya Kumara3 1(Magister Program of Electrical and Computer Engineering, Udayana University Graduate Program, Sudirman Denpasar, Bali-Indonesia) 2,3(Department of Electrical and Computer Engineering, Faculty of Engineering, Udayana University Jimbaran Campus, Bali-Indonesia) Abstract: Lontar is one of the cultural heritage which has information about history of Balinese civilization in the past away. Problems encountered today are that the lontar are not well maintained. While, the lontar that used as letter of Balinese alphabet will be worn out soon, because it doesn’t have good endurance for long time. The exploiting of technology is one of media that can be used as a solution to solve the problems. The digitalization process of letter Balinese alphabet can be done by rewrite the script of that lontar in Balinese alphabet by using translation script with Rule Base and Levenshtein Distance Approach. The exploiting of technology will make the lontar becoming digital form and it won’t be worn out when the lontar is kept safe for long time and information that consisted in lontar can be protected for long time. Keywords: Translitation, rule base, levenshtein distance, aksara bali. I. INTRODUCTION Aksara Bali (Balinese Alphabet) is a symbol visual system that showed in a media, it has function to uncover the elements that expressing a language(Sudarma et al., 2016). -
Association Relative À La Télévision Européenne G.E.I.E. Gtld: .Arte Status: ICANN Review Status Date: 2015-10-27 17:43:58 Print Date: 2015-10-27 17:44:13
ICANN Registry Request Service Ticket ID: S1K9W-8K6J2 Registry Name: Association Relative à la Télévision Européenne G.E.I.E. gTLD: .arte Status: ICANN Review Status Date: 2015-10-27 17:43:58 Print Date: 2015-10-27 17:44:13 Proposed Service Name of Proposed Service: Technical description of Proposed Service: ARTE registry operator, Association Relative à la Télévision Européenne G.E.I.E. is applying to add IDN domain registration services. ARTE IDN domain name registration services will be fully compliant with IDNA 2008, as well as ICANN's Guidelines for implementation of IDNs. The language tables are submitted separately via the GDD portal together with IDN policies. ARTE is a brand TLD, as defined by the Specification 13, and as such only the registry and its affiliates are eligible to register .ARTE domain names. The full list of languages/scripts is: Azerbaijani language Belarusian language Bulgarian language Chinese language Croatian language French language Greek, Modern language Japanese language Korean language Kurdish language Macedonian language Moldavian language Polish language Russian language Serbian language Spanish language Swedish language Ukrainian language Arabic script Armenian script Avestan script Balinese script Bamum script Batak script Page 1 ICANN Registry Request Service Ticket ID: S1K9W-8K6J2 Registry Name: Association Relative à la Télévision Européenne G.E.I.E. gTLD: .arte Status: ICANN Review Status Date: 2015-10-27 17:43:58 Print Date: 2015-10-27 17:44:13 Bengali script Bopomofo script Brahmi script Buginese -
Amendment No. 2 to Registry Agreement
Amendment No. 2 to Registry Agreement The Internet Corporation for Assigned Names and Numbers and Infibeam Incorporation Limited agree, effective as of _______________________________ (“Amendment No. 2 Effective Date”), that the modification set forth in this amendment No. 2 (the “Amendment”) is made to the 09 January 2014 .OOO Registry Agreement between the parties, as amended (the “Agreement”). The parties hereby agree to amend Exhibit A of the Agreement by deleting section 5.3 in its entirety: [OLD TEXT] “5.3 Registry Operator may offer registration of IDNs in the following languages/scripts (IDN Tables and IDN Registration Rules will be published by the Registry Operator as specified in the ICANN IDN Implementation Guidelines): 5.3.1. Arabic script 5.3.2. Armenian script 5.3.3. Avestan script 5.3.4. Azerbaijani language 5.3.5. Balinese script 5.3.6. Bamum script 5.3.7. Batak script 5.3.8. Belarusian language 5.3.9. Bengali script 5.3.10. Bopomofo script 5.3.11. Brahmi script 5.3.12. Buginese script 5.3.13. Buhid script 5.3.14. Bulgarian language 5.3.15. Canadian Aboriginal script 5.3.16. Carian script 5.3.17. Cham script 5.3.18. Cherokee script 5.3.19. Chinese language 5.3.20. Coptic script 5.3.21. Croatian language 5.3.22. Cuneiform script 5.3.23. Cyrillic script 5.3.24. Devanagari script 5.3.25. Egyptian Hieroglyphs script 5.3.26. Ethiopic script 5.3.27. French language 5.3.28. Georgian script 5.3.29. Glagolitic script 5.3.30. -
Hindi Transliteration and Translation of Sanskrit-Old
__________________________________ Journal of Religious Culture Journal für Religionskultur Ed. by / Hrsg. von Edmund Weber in Association with / in Zusammenarbeit mit Matthias Benad, Mustafa Cimsit, Natalia Diefenbach, Martin Mittwede, Vladislav Serikov, Ajit S. Sikand, Ida Bagus Putu Suamba & Roger Töpelmann Goethe-Universität Frankfurt am Main in Cooperation with the Institute for Religious Peace Research / in Kooperation mit dem Institut für Wissenschaftliche Irenik Assistant Editor/ Redaktionsassistentin: Susan Stephanie Tsomakaeva ISSN 1434-5935 - © E.Weber – E-mail: [email protected]; [email protected] ; irenik.org/publikationen/jrc; http://publikationen.ub.uni-frankfurt.de/solrsearch/index/search/searchtype/series/id/16137; http://web.uni-frankfurt.de/irenik/ew.htm; http://irenik.org/; http://www.wissenschaftliche-irenik.org/ ____________________________________ No. 237 (2018) Hindi Transliteration and Translation of Sanskrit-Old Javanese Texts: An attempt to introduce Indian cultural influences abroad to native India1 By Ida Bagus Putu Suamba* 2 Abstract Strong imprints of Indian culture in various forms or modes of expressions are significantly found in Java. Sanskrit-Old Javanese texts, amongst those texts and traditions, were produced in the island in the periods between 9th to 15th cen. A.D. It covers various genres and subjects enriching indigenous culture in the archipelago. Tutur or tattva texts were one genre of them recorded the dynamic of Javanese intellectuals or poet-sages in pursuing the truth; they reveal metaphysical or theological aspects of Brahmanism, Saivism, Buddhism, Tantrism, Samkhya, Yoga, etc. This paper attempts to study ideas behind the Hindi transliteration and translation of those texts. This is a library reserach, the data were collected from Hindi translation of those texts. -
Benchmarking of Document Image Analysis Tasks for Palm Leaf Manuscripts from Southeast Asia
Journal of Imaging Article Benchmarking of Document Image Analysis Tasks for Palm Leaf Manuscripts from Southeast Asia Made Windu Antara Kesiman 1,2,* ID , Dona Valy 3,4, Jean-Christophe Burie 1, Erick Paulus 5, Mira Suryani 5, Setiawan Hadi 5, Michel Verleysen 2, Sophea Chhun 4 and Jean-Marc Ogier 1 1 Laboratoire Informatique Image Interaction (L3i), Université de La Rochelle, 17042 La Rochelle, France; [email protected] (J.-C.B.); [email protected] (J.-M.O.) 2 Laboratory of Cultural Informatics (LCI), Universitas Pendidikan Ganesha, Singaraja, Bali 81116, Indonesia; [email protected] 3 Institute of Information and Communication Technologies, Electronic, and Applied Mathematics (ICTEAM), Université Catholique de Louvain, 1348 Louvain-la-Neuve, Belgium; [email protected] 4 Department of Information and Communication Engineering, Institute of Technology of Cambodia, Phnom Penh, Cambodia; [email protected] 5 Department of Computer Science, Universitas Padjadjaran, Bandung 45363, Indonesia; [email protected] (E.P.); [email protected] (M.S.); [email protected] (S.H.) * Correspondence: [email protected] Received: 15 December 2017; Accepted: 18 February 2018; Published: 22 February 2018 Abstract: This paper presents a comprehensive test of the principal tasks in document image analysis (DIA), starting with binarization, text line segmentation, and isolated character/glyph recognition, and continuing on to word recognition and transliteration for a new and challenging collection of palm leaf manuscripts from Southeast Asia. This research presents and is performed on a complete dataset collection of Southeast Asian palm leaf manuscripts. It contains three different scripts: Khmer script from Cambodia, and Balinese script and Sundanese script from Indonesia. -
Universal Scripts Project: Statement of Significance and Impact
Universal Scripts Project: Statement of Significance and Impact The Universal Scripts Project expands the capabilities of the Internet by providing digital access to text materials from a variety of modern and historical cultures whose writing systems are not currently included in the international standard for electronic representation of scripts, known as Unicode. People who write in these scripts find it difficult to use email, compose and send documents electronically, and post documents on the World Wide Web, without relying on nonstandard fonts or other cumbersome workarounds, and are therefore left out of the “technological revolution.” About 66 scripts are currently included in the Unicode standard, but over 80 are not. Some 40 of these missing scripts belong to modern linguistic minorities in Africa, the Indian subcontinent, China, and other countries in Southeast Asia; about 40 are scripts of historical importance. The project’s goal for 2007–2008 is to provide the standards bodies overseeing character sets with proposals for 15 scripts to be included in the Unicode standard. The scripts selected for inclusion include 9 modern minority scripts and 6 historical scripts. The need is urgent, because the entire process, from first proposal to acceptance, typically takes from 2 to 5 years, and support among corporations and national bodies for adding more scripts to Unicode is uncertain. If the proposals are not submitted soon, these user communities will not be able to use their scripts in the near future. The scripts selected for this grant have established scholarly and user-community connections, which will help guarantee that the proposals meet the users' needs. -
Redalyc.Knowledge Representation and Phonological Rules for The
Computación y Sistemas ISSN: 1405-5546 [email protected] Instituto Politécnico Nacional México Antara Kesiman, Made Windu; Burie, Jean-Christophe; Ogier, Jean-Marc; Grangé, Philippe Knowledge Representation and Phonological Rules for the Automatic Transliteration of Balinese Script on Palm Leaf Manuscript Computación y Sistemas, vol. 21, núm. 4, 2017, pp. 739-747 Instituto Politécnico Nacional Distrito Federal, México Available in: http://www.redalyc.org/articulo.oa?id=61553900016 How to cite Complete issue Scientific Information System More information about this article Network of Scientific Journals from Latin America, the Caribbean, Spain and Portugal Journal's homepage in redalyc.org Non-profit academic project, developed under the open access initiative ISSN 2007-9737 Knowledge Representation and Phonological Rules for the Automatic Transliteration of Balinese Script on Palm Leaf Manuscript Made Windu Antara Kesiman1,3, Jean-Christophe Burie1, Jean-Marc Ogier1, Philippe Grangé2 1 University of La Rochelle, Laboratoire Informatique Image Interaction (L3i), La Rochelle, France 2 University of La Rochelle, Departement of Indonesian Language, La Rochelle, France 3 Universitas Pendidikan Ganesha, Laboratory of Cultural Informatics (LCI), Singaraja, Bali, Indonesia {made_windu_antara.kesiman, jean-christophe.burie, jean-marc.ogier, pgrange}@univ-lr.fr Abstract. Balinese ancient palm leaf manuscripts record Keywords. Knowledge representation, phonological many important knowledges about world civilization rules, automatic transliteration, Balinese script, palm leaf histories. They vary from ordinary texts to Bali’s most manuscript. sacred writings. In reality, the majority of Balinese can not read it because of language obstacles as well as tradition which perceived them as a sacrilege. Palm leaf 1 Introduction manuscripts attract the historians, philologists, and archaeologists to discover more about the ancient ways In many Southeast Asian countries, the literary of life. -
Study on Feature Extraction Methods for Character Recognition Of
Study on Feature Extraction Methods for Character Recognition of Balinese Script on Palm Leaf Manuscript Images Made Windu Antara Kesiman, Sophea Prum, Jean-Christophe Burie, Jean-Marc Ogier To cite this version: Made Windu Antara Kesiman, Sophea Prum, Jean-Christophe Burie, Jean-Marc Ogier. Study on Feature Extraction Methods for Character Recognition of Balinese Script on Palm Leaf Manuscript Images. 23rd International Conference on Pattern Recognition, Dec 2016, Cancun, Mexico. pp.4006- 4011. hal-01422135 HAL Id: hal-01422135 https://hal.archives-ouvertes.fr/hal-01422135 Submitted on 4 Jan 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Study on Feature Extraction Methods for Character Recognition of Balinese Script on Palm Leaf Manuscript Images Made Windu Antara Kesiman1, Sophea Prum2, Jean-Christophe Burie1, Jean-Marc Ogier1 1Laboratoire Informatique Image Interaction (L3i) University of La Rochelle, Avenue Michel Crépeau, 17042, La Rochelle Cedex 1, France {made_windu_antara.kesiman, jcburie, jean-marc.ogier}@univ-lr.fr 2Mimos Berhad, Kuala Lumpur, Malaysia [email protected] Abstract—The complexity of Balinese script and the poor The majority of Balinese have never read any palm leaf quality of palm leaf manuscripts provide a new challenge for manuscript because of language obstacles as well as tradition testing and evaluation of robustness of feature extraction which perceived them as sacrilege. -
A Historic Overview of Oriental Solmisation Systems Followed by an Inquiry Into the Current Use of Solmisation in Aural Training at South African Universities
A historic overview of oriental solmisation systems followed by an inquiry into the current use of solmisation in aural training at South African universities by Theunis Gabriël Louw (Student number: 13428330) Thesis presented in fulfilment of the requirements for the degree Magister Musicae at Stellenbosch University Department of Music Faculty of Arts and Social Sciences Supervisor: Mr Theo Herbst Co-supervisor: Prof. Hans Roosenschoon Date: December 2010 DECLARATION By submitting this thesis electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the owner of the copyright thereof (unless to the extent explicitly otherwise stated) and that I have not previously in its entirety or in part submitted it for obtaining any qualification. Date: 1 November 2010 Copyright © 2010 Stellenbosch University All rights reserved i ABSTRACT Title: A historic overview of oriental solmisation systems followed by an inquiry into the current use of solmisation in aural training at South African universities Description: The purpose of the present study is twofold: I. In the first instance, it is aimed at promoting a better acquaintance with and a deeper understanding of the generally less well-known solmisation systems that have emerged within the oriental music sphere. In this regard a general definition of solmisation is provided, followed by a historic overview of indigenous solmisation systems that have been developed in China, Korea, Japan, India, Indonesia and the Arab world, thereby also confirming the status of solmisation as a truly global phenomenon. II. The second objective of the study was to investigate the current use of solmisation, and the Tonic Sol-fa system in particular, in aural training at South African universities. -
A Method for Scriptio Continua Management on the Transliteration to the Balinese Script
Turkish Journal of Computer and Mathematics Education Vol.12 No.6 (2021), 4182-4191 Research Article A Method for Scriptio Continua Management on the Transliteration to the Balinese Script G. Indrawan1, I W. Sutaya1, K. U. Ariawan1, M. S. Gitakarma1, I G. Nurhayata1, I K. Paramarta2 1Electrical Engineering and Computer Science Department 2Balinese Language Education Department Universitas Pendidikan Ganesha (Undiksha), Singaraja, Bali, Indonesia 81116 Abstract: This research proposed a method for scriptio continua management on the learning application-based transliteration to the Balinese Script. It has not been conducted yet and is important since the proposed method eases the learning of the Balinese Script as the transliteration output. This research is a technological preservation effort to the endangered Balinese local language knowledge in Indonesia through the multi- discipline collaboration between Engineering (especially on applied Computer Science) and Language discipline. The proposed method took care of two related aspects, i.e.: (1) The Balinese Script uses scriptio continua as a style of writing without spaces, so do its transliteration result to the Balinese Script; and (2) A switching management between scriptio continua and non-scriptio continua style is needed by the learning application for the ease of learning through the visual analysis of the transliteration output which is the Balinese Script. This study was conducted on the pioneering web-based transliteration learning application, BaliScript, with the Latin text as the input and the Balinese Script as the output. Through the experiment, the proposed method gave the expected transliteration result on scriptio continua management, which can switch between three styles of writing, i.e.: (1) scriptio continua; (2) unconditional non-scriptio continua; and (3) conditional non-scriptio continua.