Balinese Latin Text Becomes Aksara Bali Using Rule Base Method
Total Page:16
File Type:pdf, Size:1020Kb
I Putu Agus Eka Darma Udayana et. al., International Journal of Research in IT, Management and Engineering, ISSN 2249-1619, Impact Factor: 6.123, Volume 07 Issue 05, May 2017, Page 1-7 Balinese Latin Text Becomes Aksara Bali Using Rule Base Method I Putu Agus Eka Darma Udayana1, Made Sudarma2, and I Nyoman Satya Kumara3 1(Magister Program of Electrical and Computer Engineering, Udayana University Graduate Program, Sudirman Denpasar, Bali-Indonesia) 2,3(Department of Electrical and Computer Engineering, Faculty of Engineering, Udayana University Jimbaran Campus, Bali-Indonesia) Abstract: Lontar is one of the cultural heritage which has information about history of Balinese civilization in the past away. Problems encountered today are that the lontar are not well maintained. While, the lontar that used as letter of Balinese alphabet will be worn out soon, because it doesn’t have good endurance for long time. The exploiting of technology is one of media that can be used as a solution to solve the problems. The digitalization process of letter Balinese alphabet can be done by rewrite the script of that lontar in Balinese alphabet by using translation script with Rule Base and Levenshtein Distance Approach. The exploiting of technology will make the lontar becoming digital form and it won’t be worn out when the lontar is kept safe for long time and information that consisted in lontar can be protected for long time. Keywords: Translitation, rule base, levenshtein distance, aksara bali. I. INTRODUCTION Aksara Bali (Balinese Alphabet) is a symbol visual system that showed in a media, it has function to uncover the elements that expressing a language(Sudarma et al., 2016). Based on the pronouncing, Aksara Bali (Balinese Alphabet) devided itno some groups called warga aksara. This division is based by Sansekerta Panini’s principle such as; warga kanthya, talawya, murdhanya, dantya dan osthya (Surada, 2007). That’s not easy for Aksara Bali (Balinese Alphabet) to hold in every development of new era. Lontar is one of traditional documentation media that has been endowed by the Balinese ancestors which has information about all culture in Bali by using Aksara Bali (Balinese alphabet) as the mediator (Ginarsa, 1976). So that’s why, the Aksara Bali (Balinese alphabet) that written in lontar must be protected. But, there will be a lot of challenge that should be faced in trying this preservation. One of them is limited financing by the government to take care all these documents. While, that lontar which is used as media of aksara Bali (Balinese alphabet) text doesn’t have good endurance to keep for long time. The exploiting of this technology is one of great media to be used in preserve the Balinese culture which has in lontar written in aksara Bali (Balinese alphabet), this is appropriate with S.K. nomor 179 tahun 1995 and circulation letter no 01/1995 that explained to the society in using aksara Bali (Balinese Alphabet) (Mahendra et al., 1996). Other than that, one of the efforts from the government in preserve the Balinese culture in form of lontar is by doing the translation the lontar from aksara Bali (Balinese alphabet) become text in Balinese language. For preserving the Balinese culture in form of lontar is not enough if you translate all the lontar only. The real lontar which is written in lontars leaf in Balinese alphabet should be become modern. The step of Balinese alphabet being modern in that lontar is done by rewriting the result of lontar translation in Balinese language Latin text become Balinese alphabet in digital form or computerization. The result of rewriting process from Latin text in Balinese language become Balinese alphabet called translation The same papers explain about translation had been done by using Syllabification Approach method (Joshi et al., 2013). In this paper, the method is used for solving the problems which is faced the similarity of two different languages. There are some other method that can be used for solving the translation problem, those methods are Rule Base, Decision Treedan Hybrid (Bhalla et al., 2013)(Kaur et al., 2014)(Kaur et al., 2012). There is a paper which do reviewing about the solving of translation process, in the paper is explained the great method for doing translation with structure case in different language called Rule Base(Kaur and Kaur, 2014). Based on the support of paper and problems in doing translation Latin text become Balinese alphabet, so this research uses Rule-Base method which combined with the application of spell character uses Levenshtein Distance method on preprocessing step. This combination method is used to decrease the level of mistakes, because there are some words in Latin of Balinese language which has homonym meaning, so that will often do mistakes in translation progress when using Rule Base method. http://indusedu.org Page 1 This work is licensed under a Creative Commons Attribution 4.0 International License I Putu Agus Eka Darma Udayana et. al., International Journal of Research in IT, Management and Engineering, ISSN 2249-1619, Impact Factor: 6.123, Volume 07 Issue 05, May 2017, Page 1-7 Need of the Study Lontar is a Balinese cultural heritage which has information about the history of Balinese civilization in the past away. Till today, that cultural heritage still tries to be kept its preservation by the society although it start to lose the application and comprehension about letter of Balinese alphabet in Balinese’s society from generation to the next generation. As time passes the lontar will be worn out soon, because it doesn’t have good endurance for long time. This technology is needed to rewrite existing manuscripts lontar in digital form, so that the manuscript lontar can hold for a long time. II. FUNDAMENTAL THEORY Aksara Bali Balinese alphabet is one of Balinese legacy by Balinese people which is valuable because it has so many culture values of society in Bali (Darma et al., 2015). The history of Balinese alphabet has close relation with the alphabet in India. Balinese alphabet is from India when Hindu and Buddha came to Indonesia (Sudarma and Surya, 2014). Aksara (Alphabet) can be meant as a visual symbol system which written on a media, it has function to shows the elements which express the language (Trieha, 2014). Based on its pronunciation aksara Bali (Balinese alphabet) is divided become some groups which is called warga akasara (Surada, 2007). Based on the Pesamuhan Agung which is done in 1963 that have decided kinds of letters which is made to write or presenting Balinese alphabet by Latin text (Sudarma and Sutramiani, 2014). Aksara suara is basically as same as vocal letter in Latin. Table 1 is aksara group which include in Aksara suara (Sutramiani et al., 2015). In Balinese alphabet (aksara Bali), AksaraWianjana is also called consonant. Although, its written without vocal letter in the way of writing. Table 2 is the classification of AksaraWianjana based on Warga Aksara. Table-1: Aksara Suara Table-2: Aksara Wianjana Lontar One of the heritages from the ancestors in the Indonesia Archipelago is a text which is called lontar. Lontar (Java: ron tal, “daun tal”(tal leaf)) is siwalan leaf or tal (Borassus flabellifer atau palmyra) that had been dried. Lontar tree (Borassus flabellifer) is a kind of palma(palm) which grow in South East and South of Asia. In Indonesia Archipelago, the relic of lontar had been found in some area such as in Bali, Java, Lombok and Sulawesi. That Lontar is used as place for writing before paper (Sudarma, 2015). Figure1: Capture Lontar Babad Brahmana Other than Lontar, there was also another media which was used by our ancestors as writing media before paper like nipa’s leaf, wood’s skin and goat’s skin. The old script is one of heritage by our ancestor which has important value. For now, The dominant old script which are in Indonesia are kept by a person and the other are kept in government instance. Transliteration Transliteration can be called as a progress of changing words in one language into the other language and still have the same structure (Kaur and Kaur, 2014). Transliteration can be as a words conversion progress which is written in one language into another language too by keeping the pronunciation. In this case, you will http://indusedu.org Page 2 This work is licensed under a Creative Commons Attribution 4.0 International License I Putu Agus Eka Darma Udayana et. al., International Journal of Research in IT, Management and Engineering, ISSN 2249-1619, Impact Factor: 6.123, Volume 07 Issue 05, May 2017, Page 1-7 hear about transliteration and translation. Transliteration can be used to translate the meaning and the great technique from language, but translation will interpret from the text or communicate the same message from original language. Tokenizing Tokenizing is a segmentation progress or divide a text documents become token, it means the structure of characters which represent the words by tokenizer (Indranandita et al., 2011). Tokenizer divides a string into some tokens based on the certain characters. In tokenizing progress, the characters usually divided as guidance for doing the segmentation sentence become token such as spasi, tab, new row (white space character). This is the example of tokenizing in the sentence “tiang sampun ngajeng”. Input : tiang sampun ngajeng Output : [tiang] [sampun] [ngajeng] Rule Base Rule Base method is a technique that used basic language rules in doing transliteration progress (Kaur and Kaur, 2014). Other than rules, dictionary data is needed too for every word in two languages. So, every word is translated one by one, and then arranged again based on the basic language rule.