1 Brahmi Word Recognition by Supervised Techniques
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Method to Accommodate Backward Compatibility on the Learning Application-Based Transliteration to the Balinese Script
(IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 12, No. 6, 2021 A Method to Accommodate Backward Compatibility on the Learning Application-based Transliteration to the Balinese Script 1 3 4 Gede Indrawan , I Gede Nurhayata , Sariyasa I Ketut Paramarta2 Department of Computer Science Department of Balinese Language Education Universitas Pendidikan Ganesha (Undiksha) Universitas Pendidikan Ganesha (Undiksha) Singaraja, Indonesia Singaraja, Indonesia Abstract—This research proposed a method to accommodate transliteration rules (for short, the older rules) from The backward compatibility on the learning application-based Balinese Alphabet document 1 . It exposes the backward transliteration to the Balinese Script. The objective is to compatibility method to accommodate the standard accommodate the standard transliteration rules from the transliteration rules (for short, the standard rules) from the Balinese Language, Script, and Literature Advisory Agency. It is Balinese Language, Script, and Literature Advisory Agency considered as the main contribution since there has not been a [7]. This Bali Province government agency [4] carries out workaround in this research area. This multi-discipline guidance and formulates programs for the maintenance, study, collaboration work is one of the efforts to preserve digitally the development, and preservation of the Balinese Language, endangered Balinese local language knowledge in Indonesia. The Script, and Literature. proposed method covered two aspects, i.e. (1) Its backward compatibility allows for interoperability at a certain level with This study was conducted on the developed web-based the older transliteration rules; and (2) Breaking backward transliteration learning application, BaliScript, for further compatibility at a certain level is unavoidable since, for the same ubiquitous Balinese Language learning since the proposed aspect, there is a contradictory treatment between the standard method reusable for the mobile application [8], [9]. -
Proposal for Generation Panel for Sinhala Script Label
Proposal for the Generation Panel for the Sinhala Script Label Generation Ruleset for the Root Zone 1 General information Sinhala belongs to the Indo-European language family with its roots deeply associated with Indo-Aryan sub family to which the languages such as Persian and Hindi belong. Although it is not very clear whether people in Sri Lanka spoke a dialect of Prakrit at the time of arrival of Buddhism in the island, there is enough evidence that Sinhala evolved from mixing of Sanskrit, Magadi (the language which was spoken in Magada Province of India where Lord Buddha was born) and local language which was spoken by people of Sri Lanka prior to the arrival of Vijaya, the founder of Sinhala Kingdom. It is also surmised that Sinhala had evolved from an ancient variant of Apabramsa (middle Indic) which is known as ‘Elu’. When tracing history of Elu, it was preceded by Hela or Pali Sihala. Sinhala though has close relationships with Indo Aryan languages which are spoken primarily in the north, north eastern and central India, was very much influenced by Tamil which belongs to the Dravidian family of languages. Though Sinhala is related closely to Indic languages, it also has its own unique characteristics: Sinhala has symbols for two vowels which are not found in any other Indic languages in India: ‘æ’ (ඇ) and ‘æ:’ (ඈ). Sinhala script had evolved from Southern Brahmi script from which almost all the Southern Indic Scripts such as Telugu and Oriya had evolved. Later Sinhala was influenced by Pallava Grantha writing of Southern India. -
ED192569.Pdf
DOCUMENT RESUME ED 192 569 FL 011 690 AUTHOR MacDougall, Bonnie Graham: de Abrew, Kamini TITLE Sinhala: Basic Course. Module 1: Beginning Signsand Letters. IN Foreign Service (Dept. of State), Washington,D.C. Foreign Service Inst. PUB DATE 79 NOTE 119p.: For related documents, see FL 011 699-700. Photographs will not reproduce well. AVAILABLE FROM Superintendent of Documents, U.S. Government Printing Office, Washington, DC 20402 (No. 044-000-01764-4, $4.25) LANGUAGE English: Singhalese ErfS PRICE MF01/PC05 Plus Postage. DESCRIPTORS *Alphabets: Learning Modules: Postsecondary Education: Reading Instruction: SecondLanguage Instruction: *Singhalese: *Writing Instruction: Written Language ABSTRACT This course on the language of Sri Lanka is intended to be taken under d S=LL'hala-speaking instructor. Thismodule introduces the Sinhala writing system. The emphasisof the module is on letter recognition. Directions for writing the symbols in the "basic" alphabet are provided so that studentswill have a culturally appropriate and phonetically accurate method of writingdown words. Manv photographs of Sinhala signs are included. Each of2B lessons covers a specific aspect of reading and writing characters. With the addition of four practice reading sections at the conclusionof the module, this first part of the coursecan be completed in about 15 hours. ( 8) **** * ***** *** * * **** ******** * **** Reproductions supplied by ERRS are the best thatcan be made from the original document. *****4**************************************** **** ** MODULE I -
Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR)
Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR) Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR) LGR Version: 3.0 Date: 2019-03-06 Document version: 2.6 Authors: Neo-Brahmi Generation Panel [NBGP] 1. General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Kannada LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document: proposal-kannada-lgr-06mar19-en.xml Labels for testing can be found in the accompanying text document: kannada-test-labels-06mar19-en.txt 2. Script for which the LGR is Proposed ISO 15924 Code: Knda ISO 15924 N°: 345 ISO 15924 English Name: Kannada Latin transliteration of the native script name: Native name of the script: ಕನ#ಡ Maximal Starting Repertoire (MSR) version: MSR-4 Some languages using the script and their ISO 639-3 codes: Kannada (kan), Tulu (tcy), Beary, Konkani (kok), Havyaka, Kodava (kfa) 1 Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR) 3. Background on Script and Principal Languages Using It 3.1 Kannada language Kannada is one of the scheduled languages of India. It is spoken predominantly by the people of Karnataka State of India. It is one of the major languages among the Dravidian languages. Kannada is also spoken by significant linguistic minorities in the states of Andhra Pradesh, Telangana, Tamil Nadu, Maharashtra, Kerala, Goa and abroad. -
Improvement Accuracy of Recognition Isolated Balinese Characters with Deep Convolution Neural Network
Journal of Applied Intelligent System (e-ISSN : 2502-9401 | p-ISSN : 2503-0493) Vol. 4 No. 1, 2019, pp. 22 – 27 Improvement Accuracy of Recognition Isolated Balinese Characters with Deep Convolution Neural Network Ida Bagus Teguh Teja Murti*1 Universitas Pendidikan Ganesha, Jalan Udayana No 11 Singaraja Bali 8116, (+62362) 22570 E-mail : [email protected]*1 *Corresponding author Abstract - The numbers of Balinese script and the low quality of palm leaf manuscripts provide a challenge for testing and evaluation for character recognition. The aim of high accuracy for character recognition of Balinese script,we implementation Deep Convolution Neural Network using SmallerVGG (Visual Geometry Group) Architectur for character recognition on palm leaf manuscripts. We evaluated the performance that methods and we get accuracy 87,23% . Keywords - Classfication, Deep Neural Network, Balinese Characters, Visual Geometry Group 1. INTRODUCTION Isolated handwritten character recognition (IHCR) has been the subject of vigorous research in recent years. Some populer methods in this case are CNN for recognition numbers in the MNIST digits image database [1] . There are challenges in character recognition in some scripts, such as Chinese characters [2] and Balinese characters [3] , which have more characters than numbers in MNIST. Beside numbers character there is issue in recognition character in document made palm leaf manuscript that is the condition of document has been degenerated. Therefore the researcher must carry out the process of digitizing the palm leaf manuscript, automatic analysis and the indexing system of the script simultaneously. The activity goal is bring added value to digital palm leaf manuscripts by developing tools for analyzing, developing and accessing quickly and efficiently script content. -
View Entire Book
ODISHA REVIEW VOL. LXX NO. 8 MARCH - 2014 PRADEEP KUMAR JENA, I.A.S. Principal Secretary PRAMOD KUMAR DAS, O.A.S.(SAG) Director DR. LENIN MOHANTY Editor Editorial Assistance Production Assistance Bibhu Chandra Mishra Debasis Pattnaik Bikram Maharana Sadhana Mishra Cover Design & Illustration D.T.P. & Design Manas Ranjan Nayak Hemanta Kumar Sahoo Photo Raju Singh Manoranjan Mohanty The Odisha Review aims at disseminating knowledge and information concerning Odisha’s socio-economic development, art and culture. Views, records, statistics and information published in the Odisha Review are not necessarily those of the Government of Odisha. Published by Information & Public Relations Department, Government of Odisha, Bhubaneswar - 751001 and Printed at Odisha Government Press, Cuttack - 753010. For subscription and trade inquiry, please contact : Manager, Publications, Information & Public Relations Department, Loksampark Bhawan, Bhubaneswar - 751001. Five Rupees / Copy E-mail : [email protected] Visit : http://odisha.gov.in Contact : 9937057528(M) CONTENTS Sri Krsna - Jagannath Consciousness : Vyasa - Jayadeva - Sarala Dasa Dr. Satyabrata Das ... 1 Good Governance ... 3 Classical Language : Odia Subrat Kumar Prusty ... 4 Language and Language Policy in India Prof. Surya Narayan Misra ... 14 Rise of the Odia Novel : 1897-1930 Jitendra Narayan Patnaik ... 18 Gangadhar Literature : A Bird’s Eye View Jagabandhu Panda ... 23 Medieval Odia Literature and Bhanja Dynasty Dr. Sarat Chandra Rath ... 25 The Evolution of Odia Language : An Introspection Dr. Jyotirmati Samantaray ... 29 Biju - The Greatest Odia in Living Memory Rajkishore Mishra ... 31 Binode Kanungo (1912-1990) - A Versatile Genius ... 34 Role of Maharaja Sriram Chandra Bhanj Deo in the Odia Language Movement Harapriya Das Swain ... 38 Odissi Vocal : A Unique Classical School Kirtan Narayan Parhi .. -
Sino-Tibetan Languages 393
Sino-Tibetan Languages 393 Gair J W (1998). Studies in South Asian linguistics: Sinhala Government Press. [Reprinted Sri Lanka Sahitya and other South Asian languages. Oxford: Oxford Uni- Mandalaya, Colombo: 1962.] versity Press. Karunatillake W S (1992). An introduction to spoken Sin- Gair J W & Karunatillake W S (1974). Literary Sinhala. hala. Colombo: Gunasena. Ithaca, NY: Cornell University South Asia Program. Karunatillake W S (2001). Historical phonology of Sinha- Gair J W & Karunatillake W S (1976). Literary Sinhala lese: from old Indo-Aryan to the 14th century AD. inflected forms: a synopsis with a transliteration guide to Colombo: S. Godage and Brothers. Sinhala script. Ithaca, NY: Cornell University South Asia Macdougall B G (1979). Sinhala: basic course. Program. Washington D.C.: Foreign Service Institute, Department Gair J W & Paolillo J C (1997). Sinhala (Languages of the of State. world/materials 34). Mu¨ nchen: Lincom. Matzel K & Jayawardena-Moser P (2001). Singhalesisch: Gair J W, Karunatillake W S & Paolillo J C (1987). Read- Eine Einfu¨ hrung. Wiesbaden: Harrassowitz. ings in colloquial Sinhala. Ithaca, NY: Cornell University Reynolds C H B (ed.) (1970). An anthology of Sinhalese South Asia Program. literature up to 1815. London: George Allen and Unwin Geiger W (1938). A grammar of the Sinhalese language. (English translations). Colombo: Royal Asiatic Society. Reynolds C H B (ed.) (1987). An anthology of Sinhalese Godakumbura C E (1955). Sinhalese literature. Colombo: literature of the twentieth century. Woodchurch, Kent: Colombo Apothecaries Ltd. Paul Norbury/Unesco (English translations). Gunasekara A M (1891). A grammar of the Sinhalese Reynolds C H B (1995). Sinhalese: an introductory course language. -
The Unicode Standard, Version 4.0--Online Edition
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consor- tium and published by Addison-Wesley. The material has been modified slightly for this online edi- tion, however the PDF files have not been modified to reflect the corrections found on the Updates and Errata page (http://www.unicode.org/errata/). For information on more recent versions of the standard, see http://www.unicode.org/standard/versions/enumeratedversions.html. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The Unicode® Consortium is a registered trademark, and Unicode™ is a trademark of Unicode, Inc. The Unicode logo is a trademark of Unicode, Inc., and may be registered in some jurisdictions. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. -
Balinese Latin Text Becomes Aksara Bali Using Rule Base Method
I Putu Agus Eka Darma Udayana et. al., International Journal of Research in IT, Management and Engineering, ISSN 2249-1619, Impact Factor: 6.123, Volume 07 Issue 05, May 2017, Page 1-7 Balinese Latin Text Becomes Aksara Bali Using Rule Base Method I Putu Agus Eka Darma Udayana1, Made Sudarma2, and I Nyoman Satya Kumara3 1(Magister Program of Electrical and Computer Engineering, Udayana University Graduate Program, Sudirman Denpasar, Bali-Indonesia) 2,3(Department of Electrical and Computer Engineering, Faculty of Engineering, Udayana University Jimbaran Campus, Bali-Indonesia) Abstract: Lontar is one of the cultural heritage which has information about history of Balinese civilization in the past away. Problems encountered today are that the lontar are not well maintained. While, the lontar that used as letter of Balinese alphabet will be worn out soon, because it doesn’t have good endurance for long time. The exploiting of technology is one of media that can be used as a solution to solve the problems. The digitalization process of letter Balinese alphabet can be done by rewrite the script of that lontar in Balinese alphabet by using translation script with Rule Base and Levenshtein Distance Approach. The exploiting of technology will make the lontar becoming digital form and it won’t be worn out when the lontar is kept safe for long time and information that consisted in lontar can be protected for long time. Keywords: Translitation, rule base, levenshtein distance, aksara bali. I. INTRODUCTION Aksara Bali (Balinese Alphabet) is a symbol visual system that showed in a media, it has function to uncover the elements that expressing a language(Sudarma et al., 2016). -
Association Relative À La Télévision Européenne G.E.I.E. Gtld: .Arte Status: ICANN Review Status Date: 2015-10-27 17:43:58 Print Date: 2015-10-27 17:44:13
ICANN Registry Request Service Ticket ID: S1K9W-8K6J2 Registry Name: Association Relative à la Télévision Européenne G.E.I.E. gTLD: .arte Status: ICANN Review Status Date: 2015-10-27 17:43:58 Print Date: 2015-10-27 17:44:13 Proposed Service Name of Proposed Service: Technical description of Proposed Service: ARTE registry operator, Association Relative à la Télévision Européenne G.E.I.E. is applying to add IDN domain registration services. ARTE IDN domain name registration services will be fully compliant with IDNA 2008, as well as ICANN's Guidelines for implementation of IDNs. The language tables are submitted separately via the GDD portal together with IDN policies. ARTE is a brand TLD, as defined by the Specification 13, and as such only the registry and its affiliates are eligible to register .ARTE domain names. The full list of languages/scripts is: Azerbaijani language Belarusian language Bulgarian language Chinese language Croatian language French language Greek, Modern language Japanese language Korean language Kurdish language Macedonian language Moldavian language Polish language Russian language Serbian language Spanish language Swedish language Ukrainian language Arabic script Armenian script Avestan script Balinese script Bamum script Batak script Page 1 ICANN Registry Request Service Ticket ID: S1K9W-8K6J2 Registry Name: Association Relative à la Télévision Européenne G.E.I.E. gTLD: .arte Status: ICANN Review Status Date: 2015-10-27 17:43:58 Print Date: 2015-10-27 17:44:13 Bengali script Bopomofo script Brahmi script Buginese -
Amendment No. 2 to Registry Agreement
Amendment No. 2 to Registry Agreement The Internet Corporation for Assigned Names and Numbers and Infibeam Incorporation Limited agree, effective as of _______________________________ (“Amendment No. 2 Effective Date”), that the modification set forth in this amendment No. 2 (the “Amendment”) is made to the 09 January 2014 .OOO Registry Agreement between the parties, as amended (the “Agreement”). The parties hereby agree to amend Exhibit A of the Agreement by deleting section 5.3 in its entirety: [OLD TEXT] “5.3 Registry Operator may offer registration of IDNs in the following languages/scripts (IDN Tables and IDN Registration Rules will be published by the Registry Operator as specified in the ICANN IDN Implementation Guidelines): 5.3.1. Arabic script 5.3.2. Armenian script 5.3.3. Avestan script 5.3.4. Azerbaijani language 5.3.5. Balinese script 5.3.6. Bamum script 5.3.7. Batak script 5.3.8. Belarusian language 5.3.9. Bengali script 5.3.10. Bopomofo script 5.3.11. Brahmi script 5.3.12. Buginese script 5.3.13. Buhid script 5.3.14. Bulgarian language 5.3.15. Canadian Aboriginal script 5.3.16. Carian script 5.3.17. Cham script 5.3.18. Cherokee script 5.3.19. Chinese language 5.3.20. Coptic script 5.3.21. Croatian language 5.3.22. Cuneiform script 5.3.23. Cyrillic script 5.3.24. Devanagari script 5.3.25. Egyptian Hieroglyphs script 5.3.26. Ethiopic script 5.3.27. French language 5.3.28. Georgian script 5.3.29. Glagolitic script 5.3.30. -
Proposal to Encode ORIYA SIGN OVERLINE in the UCS From: Lorna Priest Evans (SIL International) Date: 1 January 2019
To: UTC and ISO/IEC JTC1/SC2 WG2 Title: Proposal to encode ORIYA SIGN OVERLINE in the UCS From: Lorna Priest Evans (SIL International) Date: 1 January 2019 We wish to propose the addition of ORIYA SIGN OVERLINE in the “Oriya (Odia)” block at the expected (preserved) position for length marks: U+0B55. The character is a combining mark and the glyph should appear as a straight macron above the base character. The Kuvi [kxv] language of India is spoken by 158,000 [Ethnologue]. The Kuvi language is written using the Oriya (Odia) script. However, length is quite important to distinguish in the Kuvi orthography. At this point there is no ordinary “LENGTH MARK” in the Oriya script (there are two characters 0B56 ◌ୖ ORIYA AI LENGTH MARK and 0B57 ◌ୗ ORIYA AU LENGTH MARK which have a different purpose and different appearance, and are described in The Unicode Standard, Chapter 12 South and Central Asia-I: Official Scripts of India). The length mark proposed here has the appearance of a macron above: ◌̅ In the Table below, one can see the same words listed without and with the length mark. Transliteration is marked, and the definition of the word is included. It is quite clear that the length mark is needed in order to understand the writing system. There have been some language materials printed using this orthography and those examples will be shown below. Proposal to Encode ORIYA SIGN OVERLINE Page 1 of 10 Lorna Priest Evans January 01, 2019 The ORIYA SIGN OVERLINE acts like a secondary modifier.