Machine Transliteration Design for Old Malay Manuscript

Total Page:16

File Type:pdf, Size:1020Kb

Machine Transliteration Design for Old Malay Manuscript 2nd International Conference on Machine Learning and Computer Science (IMLCS'2013) May 6-7, 2013 Kuala Lumpur (Malaysia) Machine Transliteration Design for Old Malay Manuscript Che Wan Shamsul Bahri Che Wan Ahmad, Khairuddin Omar, Mohammad Faidzul Nasrudin, Mohd Zamri Murah, Sanusi Mohd Azmi name of Pedoman Ejaan Jawi Yang Disempurnakan Abstract—Jawi script is a script that has the Arabic influence. In (Guidelines of the Enhanced Jawi Spelling) is intended to meet the past, these writings are widely used by the Malay community as the present the needs of the future. This system is the result of well as foreigners who have diplomatic relations, business, a formula written in the Jawi National Convention held in missionary and such. At that time, the Malay language is the lingua 1984 at Kuala Terengganu, Malaysia. Jawi spelling system is franca of this region. So there are many Malay heritages such as manuscripts, religious books, letters, documents and other compiled by [2] as its basis. According to [3], the system agreements in the Jawi script. There are significant needs to do the involves five processes, which are: maintaining confirmed transliteration of the Jawi text on the materials to Malay Roman. words, perfecting the imperfect, creating the non-existence, Thus, research on machine transliteration will help the effort. Many clarifying the vague, tidying up the loose. In this paper, we researches in machine transliteration in the world for high-level define old Jawi as Jawi spelling system before era of Za’ba language have been done; such as English, European, and Asia TABLE I languages such as Chinese, Japanese, Korean and Arabic. However SAMPLE OLD AND NEW JAWI the research in the context of the Malay language is still lacking, Old Jawi New Jawi Roman (bagi (for ﺑﺎڬﻲ ﺑﮏ .especially those involving the Romanized transliteration of Jawi (segala (all ﺳڬﺎﻻ ﺳڬﻞ Jawi writing is quite different from the Urdu and Arabic although (olehmu (by you ﺍﻭﻟﻬﻴﻤﻮ ﺍﻭﻟﻬﻢ they share the same characters. Modern Jawi uses more vowel than (takut (fear ﺗﺎﮐﻮﺓ ﺗﺎﮐﺔ the old version. These papers discuss the previous studies related to (saudara (brother ﺳﺎﻭﺩﺍﺭﺍ ﺳﻮﺩﺭﺍ machine transliteration of the Malay language and approaches that can be used to develop it. (before 1949). According to [4] there is a 30% difference Keywords—Jawi, machine transliteration, Malay, rule base between the old and new Jawi spelling. The old and new Jawi spelling systems are differences as shown in Table I below. I. INTRODUCTION II. PROBLEM STATEMENT AWI writing today is actually a Malay writing with Arabic Jin fluences that have been used nearly 700 years ago. This is In the context of the Malay language, transliteration is used evidenced by the discovery of Terengganu Inscribed Stone, to change the Jawi spelling to Rumi or vice versa. The main dated 1303 AD [1]. However, the old Jawi is different problem in the transliteration is when there are no matching compare Jawi script today in terms of the use of the vowel, the characters in target text [5]. Jawi spelling system and Rumi writing techniques and the use of new letters. It is seen in line (Roman) spelling are two different spelling systems. Jawi with the development of the Malay language itself began to spelling read and written from right to left, while the reverse include some foreign words and technical terms especially in spelling of Rumi [6]. the advancement of science and technology today. In addition, old Jawi spelling were not using more letters in New Jawi spelling system was introduced in (1986) with the practicing the concept of economy in the use of Jawi character than modern today [7]. According to [8] Jawi spelling as long as it is not consistent (varies according to the author) and many Che Wan Shamsul Bahri C.W.Ahmad is with the International Islamic University College Selangor (KUIS), Bangi, 43000 Kajang, Selangor, occur ambiguous word compared to new Jawi spelling system. Malaysia (phone: +60389254251; fax: +6038925447; e-mail: The problems mentioned above are main challenges in the [email protected]). development machine transliteration for old Jawi to Rumi. Prof. Dr. Khairuddin Omar, Dr. Mohammad Faidzul Nasrudin and Mohd Zamri Murah was with Center for Artificial Intelligence Technology (CAIT), Universiti Kebangsaan Malaysia, Bangi, 43600 Kajang, Selangor, Malaysia. III. RELATED WORK (e-mail: [email protected], [email protected], [email protected]). Machine transliteration is an important matter in the Mohd Sanusi Azmi is with the Universiti Teknikal Malaysia (UTEM), Malacca, Malaysia (e-mail: [email protected]). application of natural language processing (NLP), especially in translating an entity name from one language to another [9]. 23 2nd International Conference on Machine Learning and Computer Science (IMLCS'2013) May 6-7, 2013 Kuala Lumpur (Malaysia) Among the world's fastest-growing language in its A. Homograph6B and Less Vowel transliteration mechanism is English, European, Asian Homograph is referring to two or more words that have the languages such as Chinese, Japanese, Korean, Arabic, Urdu, same spelling but different meanings[23]. Homograph can also Hindi, Punjabi, Taiwan, Korea, Japan, China, Thailand and coincided with a homonymous (same sound), for example, the others. Among them are [10], [11], [12], [13], [14], [15] and word mereka (to create) with mereka (they) which is similar in [5]. terms of raw sound[24]. In the Jawi script, most homograph In Malaysia, research on machine transliteration is long, but word occurred due to Jawi writing system itself that only use its development is quite slow particularly the Malay language -compared with six vowels (a, e (e (ﺍ ، ﻭ، ﻱ، ﻯ) four vowels as compared with foreign languages. pepet), e (e-taling), i, o, u) in Rumi writing. There are many studies in Malay machine transliteration There are more homograph problems in old Jawi spelling have done such as by [16], [17], [18], [19], [20], [21] and because the economic principle of the lesser use of the vowel [22]. All researches on Malay transliteration are related to modern Jawi, except study done by [19] and [20] are related to [7]. Here is an example of the old Jawi sentences occurs old Jawi. homograph. ﺍﻳﻪ ﭬﺎﻛﻲ ﺗﻮ ﻲﭬ ، ﺗﻴﻤﺒﻖ ﺗﻮ ﻲﭬ ﺑﺮﺳﺎﻡ ﺍﻧﭽﺊ ﺟﻮﻫﺮ ﺩ ﺟﻮﻫﺮ Several researchers such as [16], [17] and [18], using a .to Ayah pakai topiU ,U tembak tupaiU U bersama En. Johar di Johor ﺏ ,to a ﺍ character mapping technique that matches directly b, and so on. Therefore, there are many words that cannot be (Dad wear a hat, shoot squirrels with Mr Johar in Johor) converted correctly in the machine transliteration process. This ﺍﻳﻪ ﻣﺎﻛﻦ ڬﻮﻟﻲ، ڠﺳﺪ ﻜﻦ ﺍﻧﻖ ﺑﺮﻣﺎءﻳﻦ ڬﻮﻟﻲ is because there are more than matches for some of the letters in the target text. Ayah makan gulaiU ,U sedangkan anak bermain guliU .U In contrast to previous studies, [21] use rule-based (Dad eat curry, while children play marbles) transliteration technique in the study of Rumi to Jawi B. Spelling7B Not Consistency transliteration. Each Rumi word is processed following the In old Malay manuscripts, spelling varies used even in a Jawi spelling patterns. First, the word is divided into syllables, same book for some words. It also varies spelling between and then the syllables are matched with rule for Jawi authors for some words. There are classification of old Jawi conversion into Jawi spelling. Based on [21] research, there writer, including educated people in religion and palace are words that cannot be converted into Jawi precisely because writers anonymous, scholars and the general public[8]. Some some of the problems; loan words from Arabic and English, of these people are not educated about the method of Jawi the difficulty in distinguishing e-taling and e-pepet vowel, and writing. Thus they write not according to the rules, not [presentable and uncertain sentences. For example, [8 .ک or ﻕ the difficulty to determine whether the word has glottis In the Jawi vocabulary, several words classified as past law identified is in the pamphlets of ilmu wafaq (wafaq or exempt from the law. Several words are not subject to the knowledge), ilmu hikmat (wisdom knowledge), mantera rule of Jawi. These are among the difficulties faced by [21] (mantras), traditional medicine and others. because Jawi spelling system is so unique. [19] and [20] study is different as studying old Malay C. Sentence8B Structure manuscripts. [19] using a Kitab Nazam (old Malay Most of the old Jawi spelling or old manuscripts do not use Manuscript) that has been digitized in advance for the study. punctuation [25]. The beginning of a new sentence usually [19] uses stemming and filtering model which is to make the begins with the word bahawasanya (whereas), maka process of rooting and separation between old and new Jawi (therefore), lagi (again) as sentence delimiter signal, even words based on a Jawi corpus of built. While [20] use a though sometimes the words maka (then) and lagi (again) act transliteration based on grapheme to the epic of the old Malay as conjunctions. Word dan (and) also used at the beginning of manuscript, Merong Mahawangsa. With grapheme method, a sentence. In addition, there is a hadith or Quranic verses that .for old Jawi words, can be used as a delimiter sentence ڤ to ﻑ successfully converted letters [20] instead of using grapheme p in Rumi, not f. This is because D. Spelling System ڤ most of the old Jawi script does not distinguish phonemes ﻱ Most of the old spelling does not use a point on the letter can reduce the gap between the old and new Jawi. (ya) if in the end of the word. This is because many of the old[20] .ﻑ and ya) does) ﻯ Many other studies are more focused on Rumi to new Jawi books printed in Arabia [25].
Recommended publications
  • Issues on Arabic Scriptures in Malay/Indonesia Language: a Study of Its Contributions in the Development of Regional Culture, Language & Literature Literacy
    International Journal of Cultural and Art Studies (IJCAS) Vol. 03, No. 1, 2019 | 15 – 21 IJCAS International Journal of Cultural and Art Studies Issues on Arabic Scriptures in Malay/Indonesia Language: A Study of its Contributions in the Development of Regional Culture, Language & Literature Literacy Mardiah Mawar Kembaren 1*, Noordin Mohd Noor 2, M. Husnan Lubis 3 1 Malay Literature Study Program, Faculty of Cultural Sciences, University of Sumatera Utara, Medan, Indonesia 2 School of Languages, Literacies and Translation (SoLLaT), Universiti Sains Malaysia, Penang, Malaysia 3 Arabic Literature Study Program, Faculty of Cultural Sciences, University of Sumatera Utara, Medan, Indonesia Abstract. This paper touches on the important role of Islam in Malay literature and culture literacy, in the region. Malay literary and cultural treasures that were originally oral traditions have changed after the local community has embraced the teachings of Islam that bring along the Arabic scripture or Arabic writing system. This system of Arabic writing has greatly influenced Malay literature and culture. This paper therefore explores academically the role, contribution and discussion regarding the future of Arabic Scripture in the region. It would not be too much to say that it was the Arabic writing system that made the Malay community illiterate. This paper is expected to benefit not only to those who study literature and linguistic but for those with a background in Islamic studies too to study the regional literature and culture, especially those related to Malay studies. Keywords: Arabic scriptures, Malay Arabic Spelling, Literacy Received 20 September 2019 | Revised 11 October 2019 | Accepted 12 October 2019 1 Introduction The Arabic scripts also known as Jawi (also known as Arab Gundul or Arab Melayu in Indonesia) is a form of writing inherited by the Malay language derived from the Arabic language spelling system.
    [Show full text]
  • Methods of Learning and Writing Jawi Scripts Within the Malay Community: Past and Present Experiences
    METHODS OF LEARNING AND WRITING JAWI SCRIPTS WITHIN THE MALAY COMMUNITY: PAST AND PRESENT EXPERIENCES 1ASYRAF HJ AB RAHMAN, 2ABDUL MANAN ALI, 3FARHANAH BT ABDULLAH, 4FIRDAUS KHAIRI ABDUL KADIR, 5FADZLI ADAM, 6DAUD ISMAIL 1,2,3,4Centre for Fundamental and Liberal Education, Universiti Malaysia Terengganu 5,6Centre for Islamic Product and Malay Civilization (Inspire), Unisza Email: [email protected] Abstract - Jawi script is considered as the heritage of the Malay people and thus preserving and sustaining the script is vital since it shapes the identity of the Malays and their cultural heritage. This paper discusses some methods of learning and writing Jawi scripts within the Malay community and how the script writing system has gone through a systematic evolution through times due to requirements set by relevant authorities. Despite the latest development and evolution, the Jawi scripting system had played important roles as a medium to spread Islam, language development and Malay culture enhancements. Towards the end of the 20th century, the Jawi script began to fade away as it was gradually replaced by the Rumi (Roman or Latin alphabet) script that have a very strong influences in the education curriculum system. As a result, not many Malay younger generations know about Jawi alphabet yet again are unable to read or write in Jawi literature. These younger generations no longer fluent in Jawi literature compared to earlier days where everybody were Jawi literate regardless of their age category. Using qualitative research design involving content analysis of secondary data and interviews with selected informants amongst teachers and students of secondary schools, the study traced back the evolution of learning and writings Jawi script from past to the present days and demonstrated how different approaches were employed by those experts of Jawi scripts towards preserving the script and promoted its usage and interest amongst the Malay generation.
    [Show full text]
  • Title Writing As a Medium in Modern Malay Literature Author Muhammad Ariff Bin Ahmad Source Aksara
    Title Writing as a medium in modern Malay literature Author Muhammad Ariff bin Ahmad Source Aksara: Menjejaki tulisan Melayu = Aksara: The passage of Malay scripts (231 - 239) Published by National Library Board Singapore This document was digitised and archived with permission from the National Library Board Singapore. Citation: Muhammad Ariff Ahmad. (2009). Writing as a medium in modern Malay literature. In Juffri Supa’at, & Nazeerah Gopaul (Eds.), Aksara: Menjejaki tulisan Melayu = Aksara: The passage of Malay scripts (231 - 239). Singapore: National Library Board. ,gkinan wujud e-sastera (sastera elektronik) .i komputer. Writing as a Medium in Modern ~aimanapun, pada hemat saya, selagi e-sastera 1 1 melalui komputer masih perlu dibaca maka Malay Literature rumi akan tetap relevan dipakai sebagai mya. Muhammad Ariff bin Ahmad .llahu alam bissawab ... .billahi-taufiq wal-hidayah ... N HIS BOOK 'SARINAH' (1947), THE LATE IR. SOEKARNO, Ifirst President of Indonesia, had mentioned, "we learn history to make us intellectual". And everyone knows that 231 history is a record of life experiences, be it an individual's or a nation's. That statement has an effect: those who deny history will not be intellectuals. asuk Kesatuan Guru-guru Melayu Slngapura dan History often shows the evolution and progress of a life, according to the condi­ :atan Sasterawan 50. ( termasuk beberapa perwakilan penerbit dari Singapura tions, period and what is desired of the living. The same happens to Malay literature. ohor Bahru yang tel ah bertindak keluar (walk-out) History proves that Malay literature started off as oral literature because liter­ inggalkan persldangan kerana permohonannya menjadi ature develops by word of mouth.
    [Show full text]
  • Jawi Script and the Malay Society: Historical Background and Development
    International Journal of Management (IJM) Volume 11, Issue 7, July 2020, pp. 68-78, Article ID: IJM_11_07_008 Available online at http://iaeme.com/Home/issue/IJM?Volume=11&Issue=7 ISSN Print: 0976-6502 and ISSN Online: 0976-6510 DOI: 10.34218/IJM.11.7.2020.008 © IAEME Publication Scopus Indexed JAWI SCRIPT AND THE MALAY SOCIETY: HISTORICAL BACKGROUND AND DEVELOPMENT Farhanah Abdullah, Asyraf Hj Ab Rahman*, Abdul Manan Ali, Firdaus Khairi Abdul Kadir Centre for Fundamental and Continuing Education, Universiti Malaysia Terengganu, Malaysia Fadzli Adam Centre for Islamic Product and Malay Civilization (INSPIRE), Universiti Sultan Zainal Abidin Terengganu, Malaysia *Corresponding Author Email: [email protected] ABSTRACT Jawi script is always associated with the identity of the Malay community. Since decades, Jawi script has been growing rapidly among the Malay community and is associated with the mastery of the Quranic recitation. This is because most of the Jawi script uses Arabic letters in the Quran with some modification to suit the Malay pronunciation to some extents. Lately, there are some issues related to Jawi script and writing. The disadvantages of writing and reading Jawi writings amongst new generation of Malay Muslims are quite significant. This also affects the reading of the Qur'an either among the young or old generations. Several studies conducted confirm the issue. This paper will therefore discuss the development of Jawi script and its association with the Malay community. Some of the research findings related to the study on Jawi writings will be analyzed, besides looking at the future prospects of the Jawi script in Malaysian context.
    [Show full text]
  • Perspectives on Language Policies in Malaysia
    Durham E-Theses Perspectives on language policies in Malaysia Mariasoosay, Thanaraj How to cite: Mariasoosay, Thanaraj (1996) Perspectives on language policies in Malaysia, Durham theses, Durham University. Available at Durham E-Theses Online: http://etheses.dur.ac.uk/5185/ Use policy The full-text may be used and/or reproduced, and given to third parties in any format or medium, without prior permission or charge, for personal research or study, educational, or not-for-prot purposes provided that: • a full bibliographic reference is made to the original source • a link is made to the metadata record in Durham E-Theses • the full-text is not changed in any way The full-text must not be sold in any format or medium without the formal permission of the copyright holders. Please consult the full Durham E-Theses policy for further details. Academic Support Oce, Durham University, University Oce, Old Elvet, Durham DH1 3HP e-mail: [email protected] Tel: +44 0191 334 6107 http://etheses.dur.ac.uk PERSPECTIVES ON LANGUAGE POLICIES IN MALAYSIA by THANARAJ MARIASOOSAY A DISSERTATION SUBMITTED FOR THE DEGREE OF MASTER OF ARTS IN EDUCATION The copyright of this thesis rests with the author. No quotation from it should be published without the written consent of the author and information derived from it should be acknowledged. SCHOOL OF EDUCATION UNIVERSITY OF DURHAM 1996 ^ MAR 1998 PERSPECTiyES ON LANGUAGE POLICIES IN MALAYSLV by Thanaraj Mariasoosay for Master of Arts in Education 1996 ABSTRACT alaysia is a multi-cultural society with major ethnic divisions between MMalays, Chinese and Indians, each group having associated linguistic and religious affiliations which intensify the divisions of Malaysian society.
    [Show full text]
  • Orthographic Reforms of Standard Malay Online: Towards Better Pronunciation and Construction of a Cross-Language Environment*
    Mohd Zaidi Abd Rozan & Yoshiki Mikami 129 Journal of Universal Language 8 March 2007, 129-159 Orthographic Reforms of Standard Malay Online: Towards Better Pronunciation and Construction of a Cross-language Environment* Mohd Zaidi Abd Rozan & Yoshiki Mikami1 Nagaoka University of Technology Abstract The widespread use of communication tools on the Internet has provided greater possibilities as well as increased flexibility for orthographic reforms. As a result, spellings have been established that are closely matched to ‘written speech’, which emulate verbal expression plus the elegant addition of letters from foreign languages. This study will look in particular at Evolutionized Malay (EM), which is cultivated on the Internet. Our analysis of EM reveals two principal functions: 1) EM words performed better in *1 The authors would like to express their thanks to Professor Ahmad Zaki Abu Bakar (Universiti Teknologi Malaysia, Malaysia), Professor Robin Nagano Lee (University of Miskolc, Hungary), Dr. Chandrajith Ashuboda Marasinghe (Nagaoka University of Technology, Japan), Dr. Shigeaki Kodama (Asian Language Research Network), and Professor Brian Drier (Nagaoka University of Technology) for their careful attention, comments, and advice and also to Mr. Chew Yew Choong and Ms. Rizza Caminero (GII Lab, Nagaoka University of Technology, Japan) for their kind assistance. 130 Orthographic Reforms of Standard Malay Online displaying pronunciation properties and 2) EM words are able to form an environment we termed as Cross Language Environment (CLE). It may be that these roles hold a greater degree of attraction and motivates toward higher interaction within a cyberspace community. Surveys on the frequency of the occurrence of EM in the cyberspace of Malaysia are shown based on web forums and blogs pages.
    [Show full text]
  • The Systemic Rules of Malay Standard Borrowing from Arabic: Guidelines for Linguists and Translators
    Issues in Language Studies (Vol. 6 No. 2 – 2017) THE SYSTEMIC RULES OF MALAY STANDARD BORROWING FROM ARABIC: GUIDELINES FOR LINGUISTS AND TRANSLATORS IDris MANSOR Section of Translation Studies and Interpreting, School of Humanities Universiti Sains Malaysia [email protected] Manuscript received 29 May 2017 Manuscript accepted 29 September 2017 ABSTRACT Borrowing has been an important process in the development of the Malay language. Malay has a great number of borrowed terms from a variety of languages, such as Sanskrit, Arabic, Persian, Dutch, Hindi, Javanese, Siamese, Tamil, Portuguese, Chinese, Japanese and English. Among these languages, Arabic is one of the main sources of Malay borrowing. This research is a descriptive study of Malay borrowing from Arabic. It aims to produce a model of the systemic rules of standard Malay borrowing from the Arabic language. Data for this research were obtained from Kamus Dewan, the main reference for Malay lexicon. The data, then, were analysed manually based on their trends and patterns. The result of the study shows that there are several trends implemented in transferring Arabic words into Malay. This study aims to become a useful guideline for linguists and translators in borrowing new terms from Arabic. Keywords: Malay language, standard borrowing, Arabic, nativised borrowing, borrowing guidelines IntroDuction Borrowing has been an important process in the development of the Malay language. Malay has a great number of borrowed terms from a variety of languages, such as Sanskrit, Arabic, Persian, Dutch, Hindi, Javanese, Siamese, Tamil, Portuguese, Chinese, Japanese and English (Jones, 2007). After independence in 1957, Malay, as the national language of the country, and the medium of instruction in education, underwent major changes in its development.
    [Show full text]
  • A Study on Reusing Resources of Speech Synthesis for Closely-Related Languages
    A Study on Reusing Resources of Speech Synthesis for Closely-Related Languages by Nur Hana Samsudin A thesis submitted to The University of Birmingham for the degree of Doctor of Philosophy School of Computer Science College of Engineering and Physical Sciences The University of Birmingham October 2017 University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. “It does not matter how slowly you go as long as you do not stop." Confucius Thank you, mom Abstract A Study on Reusing Resources of Speech Synthesis for Closely-Related Languages by Nur Hana Samsudin This thesis describes research on building a text-to-speech (TTS) framework that can accommodate the lack of linguistic information of under-resource languages by using existing resources from another language. It describes the adaptation process required when such limited resource is used. The main natural languages involved in this research are Malay and Iban language. Malay represents a language with sufficient speech re- sources while Iban language represents a language with very limited resources. Overall thesis revolves around the two languages. The thesis includes a study on grapheme to phoneme mapping and the substitution of phonemes.
    [Show full text]
  • Segmental and Suprasegmental Analysis: a Case Study of a Malay Learner's Utterances of an English Song
    Segmental and Suprasegmental Analysis: A Case Study of a Malay Learner’s Utterances of an English Song SRI KANDY PUTRI NARU ABDUL HAMID NARU a, and SHAFIQ HIZWARI MD HASHIM b a,b Centre for International Languages, Universiti Malaysia Perlis, Malaysia a [email protected] b [email protected] Abstract – Pronunciation mistakes are among the common belongs as one of the members of the Western occurrences made by L2 Malay learners of English. Bahasa Group of the Austronesian family, where all the Melayu and English vary in the numbers of vowels and diphthongs, as well as the origin of the consonants (Goay and indigenous languages of Southeast Asia are related. Choo, 2003). Despite having sundry disparities, both are This language is widely spoken among Malay- phonetics languages and use Roman characters in the written speaking countries namely Malaysia, Singapore, form (Wai, Siew and Roziati, 2007). Hence, this paper aims to analyze the differences and similarities of English sound with Thailand, Brunei and the like (Abdullah, 1974; Bahasa Melayu, by focusing on two features; Segmental and Swan and Smith, 2001; Wai, Siew and Roziati, Suprasegmental (Prosodic Rules – Assimilation, Dissimilation, 2007). The varieties of Malay language used in Insertion, Deletion and Linking). The analysis was gathered from a case study, where a Malay subject’s utterances of an English these countries vary according to the assimilation song was recorded and transcribed by using IPA transcription. process that the language has undergone. Such An in-depth analysis was done by comparing the subject’s occurrence has brought along some major written utterances to the original lyric of the song.
    [Show full text]
  • Processing Malaysian Indigenous Languages: a Focus on Phonology and Grammar
    Open Journal of Modern Linguistics, 2014, 4, 728-738 Published Online December 2014 in SciRes. http://www.scirp.org/journal/ojml http://dx.doi.org/10.4236/ojml.2014.45063 Processing Malaysian Indigenous Languages: A Focus on Phonology and Grammar Asmah Haji Omar University of Malaya, Kuala Lumpur, Malaysia Email: [email protected] Received 3 October 2014; revised 16 November 2014; accepted 4 December 2014 Copyright © 2014 by author and Scientific Research Publishing Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/ Abstract Malaysian indigenous languages are of two entirely different families: Austronesian and Austroa- siatic. The former consists of Malay and all the languages of Sabah and Sarawak, while the latter the aboriginal languages found only in Peninsular Malaysia. Except for Malay and a few more in Sabah and Sarawak, most of these languages have not been put into writing. This means that no writing system has been ascribed to them, despite the fact that quite a number have been de- scribed in terms of phonology, morphology and syntax. From the descriptions available, one gets a picture of their typologies and systems for processing purposes. Concerning typology, there is not much difference between the two families as far as phonemic inventories go, but there are differ- ences in the phonological structures of the syllable and the word. As for morphology, the Austro- nesian languages are agglutinative, while the Austroasiatic ones are isolative. There is also a dif- ference in the syntactical status of the word, where the former has the two categories of the full word and the particle, and the latter only the full word.
    [Show full text]
  • Malaysia a Country Study Foreign Area Studies of MINNESOTA 1 UNIVERSITY the American University DEP03I1URY PU8N
    Malaysia a country study Foreign Area Studies OF MINNESOTA 1 UNIVERSITY The American University DEP03I1URY PU8N. Edited by U.S.-G.P.O.-D-301-A Frederica M. Bunge Research completed AUG 4 1986 January 1984 DOCUMENTS ST. PAUL CAMPUS LIBRARIES Contents Poge Foreword iii Acknowledgments v Preface xi Country Profile xv Introduction xxiii Chapter 1. Historical Setting 1 Donald M . Seekins EARLY HISTORY OF THE MALAY WORLD— Srivijaya to the Fourteenth Century —Borneo —THE MALACCA SULTANATE AND THE EVOLUTION OF MALAY IDENTITY — Malacca and Islam —European Intrusion and the Fall of Malacca —The Malay States to the End of the Eighteenth Century — Brunei and the Northern Borneo Coast— THE BRITISH COLONIAL PRESENCE— Fed erated and Unfederated Malay States, 1826-1909— De velopments on Borneo— COLONIAL ECONOMY AND SOCIETY—The Immigrant Communities —The Indian, Arab, and Eurasian Communities —The Malay Response — WAR, EMERGENCY, AND INDEPENDENCE— The Malayan Union and the Growth of Political Movements — The Emergency —The Alliance and Independence — Sarawak and Sabah— THE CREATION OF MALAYSIA— External Threats: "Confrontation" and the Philippine Claim to Sabah—The Secession of Singapore —The Kuala Lumpur Riots of May 1969 and Their Aftermath — Economic Development and the New Economic Policy Chapter 2. The Society and Its Environment 67 Charles Hirschman ENVIRONMENT— Topography— Climate and Vegeta tion —Natural Resources — Transportation — Regional Di visions— POPULATION GROWTH AND DISTRIBU TION— Mortality, Fertility, and Family Planning — Popu lation
    [Show full text]
  • Chicahuaxtla Triqui Digital Wordlist and Preliminary Observations
    Vol. 6 (2012), pp. 208-236 http://nflrc.hawaii.edu/ldc/ http://hdl.handle.net/10125/4509 Notes from the Field: Chicahuaxtla Triqui Digital Wordlist and Preliminary Observations A. Raymond Elliott The University of Texas at Arlington Fulgencio Sandoval Cruz Triqui language consultant, San Andrés Chicahuaxtla, Oaxaca, Mexico Felipe Santiago Rojas Triqui language consultant, San Andrés Chicahuaxtla, Oaxaca, Mexico. “En la cumbre de una alta montaña, y rodeada por todas partes de mixtecas, se encuentra Chicahuaztla, cabecera de parroquia, en la cual, así como en cuatro pueblos sujetos y en Copala, se habla un idioma extraño, el triqui, llamado así por la repeticion [sic] frecuente de ciertas consonantes (tr, pr, gr, etc.). ¿Los que lo hablan son mixtecas ó tienen otro origen? Se ignora; ni queda memoria siquiera de que antiguamente se hablase el triqui en la mixteca.”1 –José Antonio Gay, Historia de Oaxaca (1881:54) This article presents a 200-item list consisting of words and sample sentences from Chi- cahuaxtla Triqui, an Otomanguean language spoken in San Andrés Chicahuaxtla in the State of Oaxaca, Mexico. The wordlists include broad phonetic transcriptions, English glosses, Spanish cues, individual WAV recordings, and comments. Since the key to un- derstanding Chicahuaxtla Triqui lies in the ability to distinguish tone, the list is divided into two parts: 1) a section consisting of minimal pairs with contrastive phonemic tone and/or lexical items illustrating other interesting phonological characteristics, such as tone, fortis-lenis contrasts, prenasalized velars/pre-voicing, and velar onset nasals; and 2) lexical items that evidence tonal contours but may or may not operate contrastively in the language.
    [Show full text]