Codeswitching in Communication: a Sociolinguistic Study of Malaysian Secondary School Students

Total Page:16

File Type:pdf, Size:1020Kb

Codeswitching in Communication: a Sociolinguistic Study of Malaysian Secondary School Students ISSN: 0128-7702 Pertanika J. Soc. Sci. & Hum. 18 (2): 407 - 415 (2010) © Universiti Putra Malaysia Press Codeswitching in Communication: A Sociolinguistic Study of Malaysian Secondary School Students Paramasivam Muthusamy Department of Foreign Languages, Faculty of Modern Languages and Communication, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia *E-mail: [email protected] ABSTRACT Codeswitching is a common phenomenon in a multilingual society like Malaysia. Furthermore, the language policy of the Malaysian government has made it mandatory for the students to learn both in bahasa melayu and English at their primary school level. The Indian and Chinese students acquired their respective mother tongues, namely Tamil and Mandarin, besides the two compulsory languages, (bahasa Melayu and English). This study was conducted at four secondary schools situated in the Klang Valley, in which one school was chosen from an urban setting, two from suburban, and one from a rural area. Twenty samples were selected of which twelve were Indian students, four Malays, and four Chinese. The samples were give two topics for discussion and their conversations were recorded and transcribed. In addition, a questionnaire was administered to determine the respondents’ demographic details and their language choice at home. The findings indicated that codeswitching that occurs from bahasa Melayu to English, Tamil or Chinese to English or vice versa during conversations is more habitual by nature. Respondents from the average economic and educational category were found to have used both English and their mother tongue as their matrix language. However, the respondents from the lower economic and educational category used their mother tongue as the dominant language or matrix language. Similarly, when the same ethnic group converse, their mother tongue becomes the domain language, with English and bahasa Melayu as the embedded languages. Keywords: Codeswitching, matrix language, embedded language, multilingual society INTRODUCTION communities living in Malaysia (Asmah, 1992). Malaysia is a multilingual nation and it comprises Apart from the above socio-political reasons, of Malays and natives (62%), Chinese (27%), another factor which contributes to switching Indians (8%), and the remaining 3% of other codes between Malay and English by the Tamil races. This mix of ethnicities contributes speakers is the phenomenon of globalization. towards the variation in language use and Globalization is the result of advancement of cultural practices. Due to various factors, such science and technology at the world level. In as the numerical strength of the dominant Malay other words, the unequivocal advancement in community, their language, political support for technology is often initiated by the developed the Malay language, culture and above all, the nations. Meanwhile, technical advancement and administrative and educational dominance of obtaining up-to-date knowledge in technology the Malay language in every respect, the Malay automatically contribute economic power and language seems to have a significant control over bring in job opportunities. This cause and effect the weaker races which include all the minority reaction often makes the developing or under- Received: 31 December 2009 Accepted: 26 April 2010 Paramasivam Muthusamy developed nations to obtain technical knowledge Malaysia, three types of CS patterns have been as early as possible for the reasons mentioned identified. These are firstly, those Malaysian above. It is needless to say that the process of students who use bahasa Melayu as their globalization is always initiated by the developed dominant language with embedded English nations. Another hard truth is that in the words in their discourse. Secondly, those modern world scenario, English is the language interlocutors who use English as their dominant through which one can have easy access to the language with embedded Bahasa Melayu words needed technological knowledge and meet at in their speech and finally, the students who ease the process of globalization (Jacobson, embed English or bahasa Melayu in their mother 2004). Subsequently, most of the countries tongue (L1), such as Tamil or Mandarin in their in the world are interested in incorporating discourse. The above mentioned communicative English in their educational curricula. Malaysia patterns are observed in most of the situations is also not an exception to this. Another undertaken in the present study. sociolinguistic behaviour found among the language users is that everybody, irrespective REVIEW OF RELATED LITERATURE of their socio-economic and educational status, wants to be familiar with English. This linguistic While discussing about CS taking place in behaviour of the interlocutors has made English different sociolinguistic and multiethnic a prestigious language of the world, including situations, various researchers have defined in Malaysia. Language use in the Malaysian the phenomenon of CS with reference to the situation needs to be viewed with reference to concerned situation in which the study was the general language policy of the Malaysian undertaken. For instance, Zaitul Azma Zainon government. As per the language policy of Hamzah (2006) suggests that pragmatic research Malaysia, it is mandatory that the Malaysian focuses on the relationships that exist between students have to learn both bahasa Melayu and language structures and the mannerisms of English starting from their primary school level. the interlocutors. For example, in an informal Furthermore, the Indian and Chinese students conversation among a group of multiracial will also acquire their respective mother tongues, students, their conversations are dependent on namely Tamil and Mandarin, besides the two the context and the outcome, as well as how the compulsory languages, (bahasa Melayu and listeners process the information or message English). Thus, this enables the multilingual conveyed by the speaker. Here, the use of students to code switch with ease and confidence multiple languages or codeswitching is accepted during communication (Paramasivam, 2006). because the speech has distinct language In general, codeswitching (CS) can be structures that are clear and concise. defined as switching from one language code to According to Jacobson (2004), there is a another during a single communicative event. common understanding that CS is regarded as It also comprises the alternation between one bahasa rojak or bahasa pasar, i.e. a substandard or more languages or dialects in the middle of language. This connotative meaning refers to a discourse between people who have more the language that is impure, unsystematic, and than one language in common. Sometimes, has elements of foreign languages in Bahasa the switch takes place after a few sentences Melayu. Awang Sariyan (1996) further supports and at other times after a single phrase. Those this substandard language, by saying that CS who codeswitch may not even be aware of is not new but has existed as a form of pidgin their behaviour and when asked will deny that language in the history of human languages. they resorted to such a practice in their speech In the Malaysian context, the Baba Melaka (Jacobson, 2001). community’s spoken language is considered With this background, if we look into to be a variation of the Malay language. This the phenomenon of CS among the students of is further supported by the study of Chng Lee Swee Li (1995). 408 Pertanika J. Soc. Sci. & Hum. Vol. 18 (2) 2010 Codeswitching in Communication: A Sociolinguistic Study of Malaysian Secondary School Students Asmah Hj. Omar (2007) claims that CS or informal settings and at all levels of the among Malaysians exists at all social levels language, i.e. at the phonological, semantic and and races. English language is usually used lexical level. As it is evident from the works along with bahasa Melayu during CS. She of different scholars, as discussed above, it is further states that CS in English occurs among needless to say that the phenomenon of CS is Malay speakers frequently in formal situations, inevitable in every sociolinguistic situation in such as in meetings, talks, speeches, official a multilingual country. However, the patterns interviews, etc. In informal situations, CS with of CS differ according to the communication bahasa Melayu, English, and other languages situation. In other words, all the CS patterns are like Tamil, Cantonese, and Malay dialects are unique in their own ways which have situational frequently used among the Malays in general attestation. and the choice of the language varies according to the sociolinguistic situations (Asmah, 1992). AIM OF THE STUDY Similarly, Nik Safiah Karim (1992) states that CS resembles language in transition, where The present study was construed to find the the society uses more than one language to intricacies of CS among the students in the communicate. However, the interlocutors are selected secondary schools in the Klang Valley, not proficient in any of the languages that is used with the following aims: in their speech. 1. To determine the patterns of codeswitching Furthermore, CS is seen as a natural among secondary school students. language development process in the usage, 2. To investigate how linguistic patterns of where the speaker has a repertoire to effectively codeswitching are structured. manipulate two or more languages in any given 3. To determine the students’ dominance in speech event.
Recommended publications
  • Pilihan Bahasa Dan Lingua Franca Suku Kaum Di Sarawak Pilihan Bahasa Dan Lingua Franca Suku Kaum Di Sarawak
    PILIHAN BAHASA DAN LINGUA FRANCA SUKU KAUM DI SARAWAK PILIHAN BAHASA DAN LINGUA FRANCA SUKU KAUM DI SARAWAK Mohammed Azlan Mis [email protected] Pusat Pengajian Bahasa dan Linguistik Fakulti Sains Sosial dan Kemanusiaan Universiti Kebangsaan Malaysia Abstrak Kajian ini membincangkan lingua franca suku kaum Melayu, Iban, Cina dan Melanau di negeri Sarawak. Di Sarawak terdapat bermacam-macam suku kaum dan masing-masing menggunakan bahasanya sendiri. Oleh yang demikian, lingua franca di Sarawak merupakan bahasa yang dipilih untuk berhubung dan bertukar-tukar fikiran antara satu sama lain agar hubungan antara suku kaum menjadi lebih erat dan sejahtera. Tentu sahaja bahasa perhubungan yang dipilih telah menjadi milik bersama, difahami, sesuai dan tidak menimbulkan masalah. Kewajaran kajian ini bertunjangkan lingua franca yang bertitik tolak daripada maklumat pilihan bahasa semasa yang digunakan dalam kehidupan sehari-hari. Sesuatu bahasa itu akan terus dipilih dan digunakan oleh semua pihak sama ada semasa situasi formal atau tidak formal sekiranya bahasa tersebut memenuhi peranan lingua franca. Hasil analisis pilihan bahasa responden Melayu, Iban, Cina dan Melanau di negeri Sarawak, didapati pilihan bahasa Iban dan dialek Melayu Sarawak lebih tinggi jika dibandingkan dengan pilihan bahasa yang lain. Walau bagaimanapun, pilihan bahasa Iban dilihat digunakan dalam situasi tidak formal dan pilihan bahasa dialek Melayu Sarawak pula semasa situasi formal. Kata kunci: lingua franca, pilihan bahasa, suku kaum, dialek Melayu Sarawak, bahasa Iban MOHAMMED AZLAN MIS Abstract The aim of this research is to identify the the common language or lingua franca of Malay, Iban, Chinese and Melanau ethnics groups in Sarawak. There are several ethnic groups in Sarawak, the member of each communicating among themselves in their own language.
    [Show full text]
  • The Role of Code-Switching As a Communicative Tool in an ESL Teacher Education Classroom
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Available online at www.sciencedirect.com ScienceDirect P r o c e d i a - S o c i a l a n d B e h a v i o r a l S c i e n c e s 9 0 ( 2 0 1 3 ) 4 5 8 – 4 6 7 6th International Conference on University Learning and Teaching (InCULT 2012) The Role of Code-Switching as a Communicative Tool in an ESL Teacher Education Classroom Nik Mastura Nik Ismail Azlanª and Suthagar Narasumanb* ªAcademy of Language Studies, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia bFaculty of Education, Universiti Teknologi MARA, Shah Alam, 40200, Selangor, Malaysia Abstract This research sought to investigate how code-switching functions as a communicative tool in a English as a second language teacher education class in a tertiary institution in Malaysia. Surveys, observations, and interviews were carried out to elicit relevant data. Qualitative data was collected using an observational framework. The findings revealed that three types of code-switching known as tag switching, inter-sentential switching, and intra-sentential switching were predominant in classroom communication between students and between students and the instructor. The study was also able to ratify several systematic and predictive reasons for code-switching in the classroom. The study also found that English was the dominant language of communication while code switching was used to convey ideas in specific situations and to enhance solidarity in the first language.
    [Show full text]
  • Mother Tongues and Languaging in Malaysia: Critical Linguistics Under Critical Examination
    This is an Accepted Manuscript of an article published by Cambridge University Press, Language in Society, available at https://www.cambridge.org/core/journals/language-in- society/article/mother-tongues-and-languaging-in-malaysia-critical-linguistics-under-critical- examination/63E9315695008B98517AA546BF6C5F54 Mother tongues and languaging in Malaysia: Critical linguistics under critical examination Nathan John Albury Abstract This article brings the critical turn in linguistics - with its current scepticism of essentialised languages and bias for languaging - under critical evaluation. It does so by bringing it face- to-face with the local-knowledge turn in sociolinguistics that emphasises local knowledge, held by language users themselves, to understand sociolinguistic phenomena through local epistemologies. This paper analyses whether and how epistemologies inherent to language, mother tongue and languaging hold relevance in metalinguistic talk in Malaysia. Focus group discussions with ethnic Malay, Chinese and Indian youth revealed that languaging through Bahasa Rojak is already firmly embedded in local epistemology for communicating across ethnolinguistic divides and fostering interethnic inclusiveness. However, an essentialised view of language also remains vital to any holistic sociolinguistic research in Malaysia in culturally-specific ways that do not conflict with languaging. The paper especially supports arguments that we ought not to disregard mother tongues in the interests of critical linguistics. Key words Critical linguistics, Mother tongue, Languaging, Linguistic culture, Malaysia, Folk linguistics 1 This is an Accepted Manuscript of an article published by Cambridge University Press, Language in Society, available at https://www.cambridge.org/core/journals/language-in- society/article/mother-tongues-and-languaging-in-malaysia-critical-linguistics-under-critical- examination/63E9315695008B98517AA546BF6C5F54 Introduction Sociolinguistics has evolved to an era where the term mother tongue can raise eyebrows.
    [Show full text]
  • Broadening Language Use Options in Formal Discourse: the Malaysian Experience
    Intercultural Communication Studies XI: 3, 2002 R. Jacobson Broadening Language Use Options in Formal Discourse: The Malaysian Experience Rodolfo Jacobson The University of Texas at San Antonio Abstract The present paper will report on the ongoing research on “codeswitched speech in formal settings in Malaysia”, a project funded by the Dewan Bahasa dan Pustaka of Kuala Lumpur, and argue in particular on: 1. the settings where Bahasa alone is the appropriate choice; 2. the settings where codeswitched speech is an acceptable medium of communication; 3. the kind of nativization of English that can be detected in the stretches of English within the code-switched variety. The rationales why monolingual or bilingual discourse is chosen will be examined and the reasons will be explored why a form of English could survive in Malaysia in spite of the official pressure that Bahasa be the only means of communication at the official level. 1. Introduction A project designed to chart codeswitching practices in Malaysia has just been completed under the auspices of that country’s Dewan Bahasa dan Pustaka and is described in a yet unpublished manuscript entitled The broadening spectrum of a Malaysian experience: From codemixing to codeswitching. Some of the features identified there are worthwhile reporting as they mark Malaysian codeswitching as a unique language mixing technique, quite different from similar language use patterns elsewhere. Some of these unique traits are as follow: • Codeswitched discourse – whether Malay or English functioning as matrix language – is not restricted to the informal medium of communication and 25 Intercultural Communication Studies XI: 3, 2002 R.
    [Show full text]
  • Loanwords in Translating Malaysian Bank Brochures
    Current Issues in Language Studies 1 (2009)/N.Husin/ Loanwords in Translating Malaysian Bank Brochures LOANWORDS IN TRANSLATING MALAYSIAN BANK BROCHURES Norhazlina Husina Abstract Brochure is a very effective mean of promoting goods or services. International relationships between countries and the issues of borderless world now lead to the fact that more than one language is used in a brochure. This trend makes it even more important for translations to be able to carry the meanings from the original text. Analysis done on Malaysian bank brochures shows that most of the brochures used two or more languages. Research also shows that borrowing is the most frequently used method in translating the bank brochures from English into Malay language at word level. Keywords Bank Brochures; Loanwords; Effects of Loanwords. 1. Inreoduction In this modern era, translation can no longer be regarded as a minor industry that focuses only on literature and religious texts. Translation has begun to reflect its multi-functional attributes. Previously, translation was generally used to translate literary materials and epics. However, it has now taken a wider context. There is no denying that translation is now used in communication, political and commercial aspects, so to speak. In fact, almost every a Academy of Language Studies, Universiti Teknologi Mara Perak, Kampus Seri Iskandar, 32610 Bandar Baru Seri Iskndar, Perak Darul Ridzuan, Malaysia. E-mail: [email protected] 96 Current Issues in Language Studies 1 (2009)/N.Husin/ Loanwords in Translating Malaysian Bank Brochures industry makes use of translation as they strive for success. This is well supported by the fact that the concept of ‘borderless’ world makes it possible for a text written in its source language to be read by readers of another language.
    [Show full text]
  • Open Hor Karen Thesis.Docx.Pdf
    THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF ENGLISH BAHASA ROJAK IN MALAYSIAN LITERATURE AS A CRITIQUE ON POSTCOLONIAL IDENTITY AND POLITICS KAREN SIN LI HOR SPRING 2019 A thesis submitted in partial fulfillment of the requirements for a baccalaureate degree in Economics and English with honors in English Reviewed and approved* by the following: Janet Neigh Associate Professor of English, English Thesis Supervisor Craig Warren Professor of English, English Honors Adviser * Signatures are on file in the Schreyer Honors College. i ABSTRACT The intermingling of the variety of cultures and the legacy of colonialism in Malaysia has produced the Malaysian vernacular, bahasa rojak. While many Malaysians identify with this speech variation, most Malaysians stigmatize bahasa rojak as informal and use it as a measurement of intelligence, professionalism, and nationalism. Two Malaysian authors, Salleh Ben Joned and Jamal Raslan, have challenged these stigmas by producing poems in bahasa rojak, they are “Malchin Testament” by Joned and “Words” by Raslan. Their works explore the themes of language purity, linguistic identity, and the institutional enforcement of a national language in both a postcolonial and a contemporary perspective. This thesis investigates the political intentions for using bahasa rojak in Malaysian literature and its importance to Malaysian linguistic identity. By doing so, this thesis provides a Malaysian and multilingual perspective on postcolonial poetry and contributes to the scholarship on Malaysian
    [Show full text]
  • (PPSMI) Chedet.Co.Cc March 13, 2009 by Dr. Mahathir Mohamad
    PENGAJARAN DAN PEMBELAJARAN SAINS DAN MATEMATIK DALAM BAHASA INGGERIS (PPSMI) Chedet.co.cc March 13, 2009 By Dr. Mahathir Mohamad 1. Saya berasa amat sedih apabila membaca laporan berkenaan demonstrasi yang dipimpin oleh tokoh sastera Melayu menentang pengajaran sains dan matematik dalam bahasa Inggeris. 2. Kononnya ini akan menidakkan kebolehan bahasa Melayu sebagai bahasa pengantar dalam ilmu ini. Konon ia akan mengurangkan perkembangan bahasa Melayu sebagai bahasa kebangsaan. 3. Kita harus ingat bahawa pelajaran bukan semata-mata untuk memaju atau mengembangkan bahasa. Pelajaran adalah untuk penguasaan ilmu, pelbagai jenis ilmu. 4. Yang boleh meningkatkan taraf dan kemajuan bahasa ialah matapelajaran bahasa itu sendiri dan diperingkat yang lebih tinggi mata pelajaran sastera. 5. Sains dan matematik tidak mungkin menolong kemajuan bahasa, khususnya bahasa Melayu. 6. Ini disebabkan bahasa sains dan matematik bukan daripada bahasa Melayu. Sebahagian besar daripada perkataan-perkataan dalam mata pelajaran ini adalah dalam bahasa Latin yang di-Inggeriskan. 7. Memang benar kita boleh Melayu-kan perkataan-perkataan ini, seperti juga kita Melayu-kan perkataan daripada lain-lain bahasa. 8. Memang boleh tetapi perkataan-perkataan sains bukan satu dua tapi beratus-ratus. Dan perkataan ini menjadi asas kepada perkataan yang mempunyai makna yang amat berlainan. 9. Umpamanya perkataan Oxygen (Oksigen) - satu jenis gas. Derivative (cabang) daripada perkataan ini adalah seperti berikut; • Oxygenation • Oxidation • Oxide • Oxidants • Deoxydation • Oxidise • Oxidification • Dioxide • Monoxide • Peroxide 10. Ini hanya bagi satu elemen daripada sebanyak 120 (118) elemen. 11. Tiap satu mempunyai banyak derivative. Tetapi ada perkataan-perkataan lain yang hanya untuk ilmu sains. Jika semuanya nak di-Melayukan, bahasa Melayu akan bertukar menjadi bahasa Inggeris, ejaan lain bunyi semacam.
    [Show full text]
  • Social Meaning, Indexicality and Enregisterment of Manglish in Youth Whatsapp Chats
    SOCIAL MEANING, INDEXICALITY AND ENREGISTERMENT OF MANGLISH IN YOUTH WHATSAPP CHATS Thesis submitted in accordance with the requirements of the University of Liverpool for the degree of Doctor of Philosophy NUR HUSNA SERIP MOHAMAD Department of English School of the Arts September 2019 1 ABSTRACT Manglish, a variety of Malaysian English, has often been stereotyped in Malaysian media (i.e., local newspapers) as ‘improper English’ (Why Speak Manglish, 2007; Manglish-English Dilemma, 2007) or labelled by scholars as a type of ‘poor’, ‘broken’ English (Nair-Venugopal, 2013: 455). In fact, speakers of Manglish have been associated with ‘rural background’, ‘low-status’ in society, as well as ‘uneducated’ (Mahir, and Jarjis, 2007: 7). In recent years, there have been increasing debates on the status and use of Manglish, where it is a preferred variety of linguistic expression and is widely employed within certain contexts. The rise of Manglish has been hardly ignored in various social media platforms. This raises questions on how and why Manglish is employed or chosen as a language to communicate within online settings such as Instant Messaging. Thus, the (social) meanings that underlie Manglish communicative practices deserve exploration. This thesis seeks to contribute to the current discussion in linguistic studies on Manglish among three ethnic groups in Malaysia which are Malay, Chinese and Indian, by exploring how in-groupness and ethnic identity is reflected in the way speaker use Manglish features in WhatsApp conversation. More specifically, it focuses on the use of Manglish through the perspective of indexicality (Silverstein, 2003), social meaning (Eckert, 2003) and enregisterment (Agha, 2003).
    [Show full text]
  • The Linguistics Journal
    The Linguistics Journal August 2020 Volume 14 Issue 1 Editors: Paul Robertson and Lucas Kohnke The Linguistics Journal August 2020 Volume 14, Issue 1 Editors: Paul Robertson and Lucas Kohnke Published by the English Language Education Publishing The Linguistics Journal A Division of TESOL Asia Group Part of SITE Ltd. Australia http://www.linguistics-journal.com © The Linguistics Journal 2020 This e-book is in copyright. Subject to statutory exception no reproduction of any part may take place without the written permission of the Linguistics Journal Press. No unauthorized photocopying. All rights reserved. No part of this book may be reproduced, stored in a retrieval system or transmitted in any form or by any means, electronic, mechanical, photocopying or otherwise, without the prior written permission of The Linguistics Journal. [email protected] Editors: Dr. Paul Robertson and Dr. Lucas Kohnke Chief Editor: Dr. Lucas Kohnke Senior Advisor: Dr. John Adamson Journal Production Editor: Ramon Medriano, Jr. ISSN 1718-2301 2 Table of Contents Foreword…………………………...……………………………………………………………………………….. 4 Matt Lucas……………………………….................................................................................................................. 7 Agglutination, Loanwords, and Japanese Morpholexical Categoryhood: Cross-linguistic Factors in the Grammatical Handling of Verbs and Adjectives in Written English Zahra Nouri and Rezvan Rasouli……..………….…………...………………..………………………….……... 44 The Effect of Task Type on the Writing Accuracy of Iranian Male and
    [Show full text]
  • Malay Youth Language in West Malaysia Tom HOOGERVORST Royal Netherlands Institute of Southeast Asian and Caribbean Studies
    Malay youth language in West Malaysia Tom HOOGERVORST Royal Netherlands Institute of Southeast Asian and Caribbean Studies This paper typologises the sociolect of ‘young people’ – in the broadest sense of the word – in West Malaysia. The focus is on speakers of colloquial Malay, a language variety which is itself poorly documented. Linguistic scholarship has thus far prioritised the standardised Malay variety promoted by language planners and, more recently, regional dialects. Sociolects and contact varieties have received relatively scant attention, especially in Malaysia. I start by situating the Malay youth language in Malaysia’s language ecology. The next three sections describe, respectively, three distinctive lexico-semantic characteristics of this youth language: common processes of shortening words, semantic innovation, and the influence from languages in contact, including English and Chinese varieties. The final section addresses the ways in which young speakers of colloquial Malay write down their language informally. Throughout this paper, I will compare the Malay youth language used in West Malaysia with its much better known Indonesian equivalent. 1. Introduction1 To those who have learned Malay academically, an everyday conversation between young people often proves to be a linguistic terra incognita. This study offers a preliminary description of the youth language (bahasa remaja) of West Malaysia – i.e. the Peninsula – charting the speech habits of young speakers of colloquial Malay, the words they use and the new meanings they give to existing words. In recognition of this volume’s general theme, I will make comparisons with the much better known Indonesian youth language (bahasa gaul) wherever possible. Although the official languages of Indonesia and Malaysia, respectively Bahasa Indonesia ‘Indonesian’ and Bahasa Malaysia ‘Malaysian’, are both standardised varieties of Malay, they exhibit numerous differences in pronunciation, orthography, lexicon and grammar.
    [Show full text]
  • Part-Of-Speech Tagger for Malay Social Media Texts
    GEMA Online® Journal of Language Studies 124 Volume 18(4), November 2018 http://doi.org/10.17576/gema-2018-1804-09 Part-of-Speech Tagger for Malay Social Media Texts Siti Noor Allia Noor Ariffin [email protected] Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia Sabrina Tiun [email protected] Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia ABSTRACT Processing the meaning of words in social media texts, such as tweets, is challenging in natural language processing. Malay tweets are no exception because they demonstrate distinct linguistic phenomena, such as the use of dialects from each state in Malaysia; borrowing foreign language terms in the context of Malay language; and using mixed languages, abbreviations and spelling errors or mistakes in sentence structure. Tagging the word class of tweets is an arduous task because tweets are characterised by their distinctive style, linguistic sounds and errors. Currently, existing works on Malay part-of-speech (POS) are based only on standard Malay and formal texts and are thus unsuitable for tagging tweet texts. Thus, a POS model of tweet tagging for non-standardised Malay language must be developed. This study aims to design and implement a non-standardised Malay POS model for tweets and performs assessment on the basis of the word tagging accuracy of test data of unnormalised and normalised tweet texts. A solution that adopts a probabilistic POS tagging called QTAG is proposed. Results show that the Malay QTAG achieves best average POS tagging accuracies of 90% and 88.8% for normalised and unnormalised test datasets, respectively.
    [Show full text]
  • BAB 1 PENGENALAN 1.0 Pengenalan Rakyat Malaysia Terdiri
    BAB 1 PENGENALAN 1.0 Pengenalan Rakyat Malaysia terdiri daripada berbilang kaum yang bertutur dalam pelbagai bahasa seperti bahasa Malaysia, bahasa Mandarin, bahasa Tamil dan bahasa Inggeris. Lantaran itu, di Malaysia amalan mengalih kod dari satu bahasa kepada satu bahasa lain ketika berkomunikasi sesama sendiri adalah fenomena linguistik yang lumrah. Umpamanya di antara bahasa Malaysia dengan bahasa Inggeris atau dengan bahasa Tamil. Hal ini di sebabkan oleh latar belakang masyarakat Malaysia yang pelbagai etnik dan sistem pendidikan Malaysia yang mengamalkan lebih daripada satu bahasa. Pertembungan dua atau lebih bahasa boleh mengakibatkan pengalihan kod. Proses peminjaman dari satu bahasa ke satu bahasa lain turut mengakibatkan pengalihan kod. Fenomena pengalihan kod merupakan sesuatu yang biasa berlaku apabila berlakunya pertembungan bahasa. Contoh: How much sebungkus nasi lemak ? (Contoh pengkaji) Bab ini membincangkan latar belakang, penyataan masalah dan tujuan kajian yang akan menjadi garis panduan bagi perkembangan kajian. Pengkaji menghuraikan kepentingan kajian dan rasional untuk mengendalikan kajian ini. Had sempadan yang membatasi kajian ini juga dibincangkan dalam bab ini. Dalam Kajian ini pengkaji mengkaji fenomena pengalihan kod dalam kalangan mahasiswa-mahasiswi Tamil di Universiti Malaya. 1 1.1 Bahasa Tamil Bahasa Tamil berstatus bahasa klasik dan salah satu bahasa utama dalam keluarga bahasa Dravidian. Ia adalah bahasa rasmi di Tamil Nadu, Selatan India dan salah satu bahasa rasmi di Sri Lanka dan Singapura. Sebilangan besar masyarakat Tamil di Sri Lanka, Malaysia, Singapura, Mauritius, Afrika Selatan serta masyarakat emigran Tamil di seluruh pelusuk dunia menggunakan bahasa Tamil dalam pertuturan mereka. Menurut Kulandasamy (1989/90) Bahasa Tamil adalah salah satu bahasa klasik yang terunggul dan masih hidup dengan tradisi kesusasteraan yang tidak terputus.
    [Show full text]