Building Online Corpora of Philippine Languages*
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Computational Linguistics Research on Philippine Languages
Computational Linguistics Research on Philippine Languages Rachel Edita O. ROXAS Allan BORRA Software Technology Department Software Technology Department De La Salle University De La Salle University 2401 Taft Avenue, Manila, Philippines 2401 Taft Avenue, Manila, Philippines [email protected] [email protected] languages spoken in Southern Philippines. But Abstract as of yet, extensive research has already been done on theoretical linguistics and little is This is a paper that describes computational known for computational linguistics. In fact, the linguistic activities on Philippines computational linguistics researches on languages. The Philippines is an archipelago Philippine languages are mainly focused on 1 with vast numbers of islands and numerous Tagalog. There are also notable work done on languages. The tasks of understanding, Ilocano. representing and implementing these Kroeger (1993) showed the importance of the languages require enormous work. An grammatical relations in Tagalog, such as extensive amount of work has been done on subject and object relations, and the understanding at least some of the major insufficiency of a surface phrase structure Philippine languages, but little has been paradigm to represent these relations. This issue done on the computational aspect. Majority was further discussed in the LFG98, which is on of the latter has been on the purpose of the problem of voice and grammatical functions machine translation. in Western Austronesian Languages. Musgrave (1998) introduced the problem certain verbs in 1 Philippine Languages these languages that can head more than one transitive clause type. Foley (1998) and Kroeger Within the 7,200 islands of the Philippine (1998), in particular, discussed about long archipelago, there are about one hundred and debated issues such as nouns in Tagalog that can one (101) languages that are spoken. -
Resources for Philippine Languages: Collection, Annotation, and Modeling
PACLIC 30 Proceedings Resources for Philippine Languages: Collection, Annotation, and Modeling Nathaniel Ocoa, Leif Romeritch Syliongkaa, Tod Allmanb, Rachel Edita Roxasa aNational University 551 M.F. Jhocson St., Sampaloc, Manila, PH 1008 bGraduate Institute of Applied Linguistics 7500 W. Camp Wisdom Rd., Dallas, TX 75236 {nathanoco,lairusi,todallman,rachel_roxas2001}@yahoo.com The paper’s structure is as follows: section 2 Abstract discusses initiatives in the country and the various language resources we collected; section In this paper, we present our collective 3 discusses annotation and documentation effort to gather, annotate, and model efforts; section 4 discusses language modeling; various language resources for use in and we conclude our work in section 5. different research projects. This includes those that are available online such as 2 Collection tweets, Wikipedia articles, game chat, online radio, and religious text. The Research works in language studies in the different applications, issues and Philippines – particularly in language directions are also discussed in the paper. documentation and in corpus building – often Future works include developing a involve one or a combination of the following: language web service. A subset of the “(1) residing in the place where the language is resources will be made temporarily spoken, (2) working with a native speaker, or (3) available online at: using printed or published material” (Dita and http://bit.ly/1MpcFoT. Roxas, 2011). Among these, working with resources available is the most feasible option given ordinary circumstances. Following this 1 Introduction consideration, the Philippines as a developing country is making its way towards a digital age, The Philippines is a country in Southeast Asia which highlights – as Jenkins (1998) would put it composed of 7,107 islands and 187 listed – a “technological culture of computers”. -
'Undergoer Voice in Borneo: Penan, Punan, Kenyah and Kayan
Undergoer Voice in Borneo Penan, Punan, Kenyah and Kayan languages Antonia SORIENTE University of Naples “L’Orientale” Max Planck Institute for Evolutionary Anthropology-Jakarta This paper describes the morphosyntactic characteristics of a few languages in Borneo, which belong to the North Borneo phylum. It is a typological sketch of how these languages express undergoer voice. It is based on data from Penan Benalui, Punan Tubu’, Punan Malinau in East Kalimantan Province, and from two Kenyah languages as well as secondary source data from Kayanic languages in East Kalimantan and in Sawarak (Malaysia). Another aim of this paper is to explore how the morphosyntactic features of North Borneo languages might shed light on the linguistic subgrouping of Borneo’s heterogeneous hunter-gatherer groups, broadly referred to as ‘Penan’ in Sarawak and ‘Punan’ in Kalimantan. 1. The North Borneo languages The island of Borneo is home to a great variety of languages and language groups. One of the main groups is the North Borneo phylum that is part of a still larger Greater North Borneo (GNB) subgroup (Blust 2010) that includes all languages of Borneo except the Barito languages of southeast Kalimantan (and Malagasy) (see Table 1). According to Blust (2010), this subgroup includes, in addition to Bornean languages, various languages outside Borneo, namely, Malayo-Chamic, Moken, Rejang, and Sundanese. The languages of this study belong to different subgroups within the North Borneo phylum. They include the North Sarawakan subgroup with (1) languages that are spoken by hunter-gatherers (Penan Benalui (a Western Penan dialect), Punan Tubu’, and Punan Malinau), and (2) languages that are spoken by agriculturalists, that is Òma Lóngh and Lebu’ Kulit Kenyah (belonging respectively to the Upper Pujungan and Wahau Kenyah subgroups in Ethnologue 2009) as well as the Kayan languages Uma’ Pu (Baram Kayan), Busang, Hwang Tring and Long Gleaat (Kayan Bahau). -
Revisiting the Position of Philippine Languages in the Austronesian Family
The Br Andrew Gonzalez FSC (BAG) Distinguished Professorial Chair Lecture, 2017 De La Salle University Revisiting the Position of Philippine Languages in the Austronesian Family Lawrence A. Reid University of Hawai`i National Museum of the Philippines Abstract With recent claims from non-linguists that there is no such thing as an Austronesian language family, and that Philippine languages could have a different origin from one that all comparative linguists claim, it is appropriate to revisit the claims that have been made over the last few hundred years. Each has been popular in its day, and each has been based on evidence that under scrutiny has been shown to have problems, leading to new claims. This presentation will examine the range of views from early Spanish ideas about the relationship of Philippine languages, to modern Bayesian phylogenetic views, outlining the data upon which the claims have been made and pointing out the problems that each has. 1. Introduction Sometime in 1915 (or early 1916) (UP 1916), when Otto Scheerer was an assistant professor of German at the University of the Philippines, he gave a lecture to students in which he outlined three positions that had been held in the Philippines since the early 1600’s about the internal and external relations of Philippine languages. He wrote the following: 1. As early as 1604, the principal Philippine languages were recognized as constituting a linguistic unit. 2. Since an equally early time the belief was sustained that these languages were born of the Malay language as spoken on the Peninsula of Malacca. -
Tagalog and Philippine Languages.Qxd
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by ScholarSpace at University of Hawai'i at Manoa Tagalog and Philippine Languages Philippine Languages national language, now known as Filipino. Furthermore, it has become the main language of Over 150 languages are spoken by the more than movies and comics, and much of the Philippine mass 76,500,000 Filipinos who live in an archipelago of media. It is required to be taught in all the schools in around 7,000 islands that stretches over 1,500 kilome- the Philippines, and is rapidly becoming the main sec- ters from north to south, and about 800 kilometers ond language that people speak throughout the coun- from the most western point of Palawan to the most try. Sebuano, Ilokano, and Hiligaynon are widely easterly point of Mindanao. Most of the languages are spoken as regional trade languages. Ilokano is the dialectally diverse, with a number constituting exten- main language of trade and wider communication spo- sive dialect chains. ken throughout northern Luzon. It is also spoken in All Philippine languages belong to the Western some areas of southern Mindanao and is the main Malayo-Polynesian group of the Austronesian lan- Philippine language spoken in the United States and guage family. The archeological record suggests that other countries to which Filipinos have migrated. the earliest Austronesian speakers arrived in the north- Sebuano is used not only in the Visayan area of the ern Philippines, probably from what is now called Central Philippines, but also in much of southern Taiwan about 5,500 years ago, at the beginning of the Mindanao. -
563290Pub0box31guage0of0in
Mother tongue as bridge language of instruction: Public Disclosure Authorized k policies and experiences in :: ~\ Southeast Asia "w-.,.-, '. .... Public Disclosure Authorized Public Disclosure Authorized Public Disclosure Authorized Southeast Asian THE Ministers of Education WORLD 1!:X Organization E BANK ~ FastTrack Initiative Edited by Kimmo Kosonen and Catherine Young Foreword Dato’ Dr Ahamad bin Sipon, Director, SEAMEO Secretariat Chapter 1 Introduction 8 Kimmo Kosonen and Catherine Young Chapter 2 Language-in-education policies in 22 Southeast Asia: an overview Kimmo Kosonen Chapter 3 Various policies in Southeast Asian 44 countries Introduction 44 The evolution of language-in-education policies 49 in Brunei Darussalam Gary Jones Education policies for ethnic minorities in 62 Cambodia Neou Sun Regional and local languages as oral languages 69 of instruction in Indonesia Maryanto Policies, developments, and challenges in mother 76 tongue education in Malaysian public schools Ramanathan Nagarathinam Language-in-education policies and their 84 implementation in Philippine public schools Yolanda S Quijano and Ofelia H Eustaquio Language and language-in-education policies 93 and their implementation in Singapore Elizabeth S Pang Language policy and practice in public 102 schools in Thailand Busaba Prapasapong Language-in-education policies in Vietnam 109 Bui Thi Ngoc Diep and Bui Van Thanh Chapter 4 Good practices in mother tongue-first 120 multilingual education Catherine Young Chapter 5 Case studies from different countries 136 Introduction -
The Austronesian Languages of Asia and Madagascar Colonial History and Language Policy in Insular Southeast Asia and Madagascar
This article was downloaded by: 10.3.98.104 On: 30 Sep 2021 Access details: subscription number Publisher: Routledge Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: 5 Howick Place, London SW1P 1WG, UK The Austronesian Languages of Asia and Madagascar Alexander Adelaar, Nikolaus P. Himmelmann Colonial History and Language Policy in Insular Southeast Asia and Madagascar Publication details https://www.routledgehandbooks.com/doi/10.4324/9780203821121.ch3 Hein Steinhauer Published online on: 25 Nov 2004 How to cite :- Hein Steinhauer. 25 Nov 2004, Colonial History and Language Policy in Insular Southeast Asia and Madagascar from: The Austronesian Languages of Asia and Madagascar Routledge Accessed on: 30 Sep 2021 https://www.routledgehandbooks.com/doi/10.4324/9780203821121.ch3 PLEASE SCROLL DOWN FOR DOCUMENT Full terms and conditions of use: https://www.routledgehandbooks.com/legal-notices/terms This Document PDF may be used for research, teaching and private study purposes. Any substantial or systematic reproductions, re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The publisher shall not be liable for an loss, actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material. CHAPTER THREE COLONIAL HISTORY AND LANGUAGE POLICY IN INSULAR SOUTHEAST ASIA AND MADAGASCAR Hein Steinhauer 1 INTRODUCTION The present chapter discusses background and practice of language policy in those countries in which a majority of the population has a western Austronesian language as its mother tongue, i.e. -
Typological Overview of the Languages of Central and Southern Philippines
DRAFT (comments welcome) Typological overview of the languages of central and southern Philippines Daniel Kaufman 1.0 Introduction This chapter provides a typological overview of the languages of the Central and Southern Philippines (henceforth, CSP languages). Despite not forming a discrete phylogenetic group, the CSP languages share certain morphosyntactic retentions from Malayo-Polynesian (henceforth MP) which make them a useful unit for typological generalizations. Like other Philippine languages, almost all the CSP languages maintain the full MP voice system. However, unlike many languages of the northern Philippines, many CSP languages still maintain a distinction between the independent and dependent paradigms of the voice system (Wolff 1973). They may also reflect the Proto-Malayo-Polynesian (PMP) voice system more directly in disallowing innovative combinations of voice morphology commonly found in the north (e.g. the combination of actor voice and conveyance voice morphology). In one case, at least, there is a syntactic retention from Proto-Austronesian (PAn) that is only found in tiny handful of MP languages, discussed in §4.5.4. On the other hand, the voice system mutates in interesting ways in the southernmost CSP languages, specifically, in the Bilic and Sama languages. The larger question of a Philippine subgroup has been recently addressed by Blust (2019), who argues for its existence on the basis of a large set of proposed lexical innovations. Others have argued that the Philippine languages are best understood as a collection of primary subgroups of MP, rather than the daughters of a single MP proto-language. Phonologically, the only potential innovation that would characterize Proto-Philippines is the merger of PMP *z and *d, which is so common outside Philippine languages that it is of little probative value. -
Jason Lobel's Dissertation
PHILIPPINE AND NORTH BORNEAN LANGUAGES: ISSUES IN DESCRIPTION, SUBGROUPING, AND RECONSTRUCTION A DISSERTATION SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI‘I AT MĀNOA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY IN LINGUISTICS MAY 2013 BY JASON WILLIAM LOBEL Dissertation Committee: Robert A. Blust, Chairperson Michael L. Forman Kenneth L. Rehg R. David Zorc Ruth Elynia S. Mabanglo © Copyright 2013 by Jason William Lobel IMPORTANT NOTE: Permission is granted to the native speakers of the languages represented herein to reproduce this dissertation, or any part thereof, for the purpose of protecting, promoting, developing, or preserving their native languages, cultures, and tribal integrity, as long as proper credit is given to the author of this work. No librarian or other holder of a copy of this dissertation in any country shall have the right to require any additional proof of permission from this author in order to photocopy or print this dissertation, or any part thereof, for any native speaker of any language represented herein. ii We certify that we have read this dissertation and that, in our opinion, it is satisfactory in scope and quality as a dissertation for the degree of Doctor of Philosophy in Linguistics. ____________________________________ Chairperson ____________________________________ ____________________________________ ____________________________________ ____________________________________ iii iv ABSTRACT The Philippines, northern Sulawesi, and northern Borneo are home to two or three hundred languages that can be described as Philippine-type. In spite of nearly five hundred years of language documentation in the Philippines, and at least a century of work in Borneo and Sulawesi, the majority of these languages remain grossly underdocumented, and an alarming number of languages remain almost completely undocumented. -
Exploring Lexical Rapport of the Bisayan and Tausug Languages
International Journal of Linguistics, Literature and Translation (IJLLT) ISSN: 2617-0299 (Online); ISSN: 2708-0099 (Print) DOI: 10.32996/ijllt Journal Homepage: www.al-kindipublisher.com/index.php/ijllt Traces and Roots: Exploring Lexical Rapport of the Bisayan and Tausug Languages Sharifa Khalid Masorong English Department, College of Social Sciences and Humanities, Mindanao State University, Marawi City, Philippines Corresponding Author: Sharifa Khalid Masorong, E-mail: [email protected] ARTICLE INFORMATION ABSTRACT Received: December 22, 2020 Languages in a country are considered to share common characteristics and possibly Accepted: February 17, 2021 similar features. This can be attributed to many factors, but one holds true, language Volume: 4 is essential in keeping a community. Its vitality molds a society’s identity. The more Issue: 2 active the language is, the more its culture gets the exposure it needs. This notion DOI: 10.32996/ijllt.2021.4.2.15 goes two ways for culture cannot also stand without the usage of a language. In this study, Bisayan and Tausug lexicons were compared using a Samarin list to identify KEYWORDS factors of similarities as well as their relationship in terms of their semantic and ontological categories. The list was subjected to a qualitative-descriptive analysis. The Language, culture, Malayo result of the study revealed that both Tausug and Bisayan (Cebuano) cultures have Polynesian Languages, come from the same linguistic family. Both belong to the Philippine VISAYAN Austronesian Languages, Cultural LANGUAGES. Of all the Visayan cultures, the TAUSUG is a Muslim dominated culture Identity, Translation, Asian which makes it the reason why people associate Tausug language as part of the Languages Mindanao language. -
A Grammar of West Coast Bajau
A GRAMMAR OF WEST COAST BAJAU by MARK T. MILLER Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY THE UNIVERSITY OF TEXAS AT ARLINGTON August 2007 Copyright © by Mark T. Miller 2007 All Rights Reserved This dissertation is dedicated to the memory of Edith Mirafuentes, whose loving service to the West Coast Bajau people has always inspired me. ACKNOWLEDGEMENTS I would like to thank the Department of Linguistics and TESOL and the Graduate School at the University of Texas at Arlington for awarding me the scholarships and travel funds that made possible my research abroad and writing at home. I would also like to thank the Sabah State Museum for its role in sponsoring my work. I am greatly thankful and indebted to my supervisor for this project, Paul Kroeger, who generously gave of his time and expertise to work patiently with me through the successive drafts of this grammar. I have much respect for his scholarly gifts as well as his willingness to serve others with those gifts. I also acknowledge, with gratitude for their time and valuable input, the other members of my doctoral committee: Laurel Smith Stvan, Don Burquest, and Tom Headland. My thanks also to Shin Ja Hwang for her comments on portions of my work; and to Robert D. Van Valin, Jr., who made himself available to look over my data and answer questions, whether in person (at workshops) or via email. I want to express my deep appreciation to the West Coast Bajau people of Menunggui village in Kota Belud, and especially to Jumel Hj. -
Dupaningan Agta
DUPANINGAN AGTA: GRAMMAR, VOCABULARY, AND TEXTS A DISSERTATION SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI‘I IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY IN LINGUISTICS MAY 2008 By Laura C. Robinson Dissertation Committee: Robert A. Blust, Chairperson Michael L. Forman Yuko Otsuka Kenneth L. Rehg Nicholas A. Thieberger Aurelio S. Agcaoili © Copyright 2008 by Laura C. Robinson iii Dedicated to Ronald “Ukong” Agcaoili, for sharing his language with me iv ACKNOWLEDGMENTS First and foremost, I would like to thank all the Dupaningan Agta people who shared their language with me, without whom this work would not have been possible. I am especially grateful to Mr. Ronald “Ukong” Agcaoili, who shared not only his language, but also his friendship. Thank you. My gratitude also extends to Manet Agcaoili, Gilliam “Ballek” Cabaldo, Idring “Garwet” Malingling, Lebi and Amboy Agcaoili, Nardeng “Burog” Doniego, Michael and Anit Andres, Maribel and Luby Agcaoili, Balsig Baclig, and all the Agta of Santa Ana. My sincere gratitude also goes out to those who have helped me in my academic career. I wish to thank Robert Blust, for his support and encouragement over the years; Ken Rehg, for sharing his knowledge of phonology and language documentation; Yuko Otsuka, for asking difficult questions; Mike Forman, for asking interesting questions; Nick Thieberger, for his friendship; and Aurelio Agcaoili, for his willingness to jump on board at the last minute. I also wish to express my gratitude to Bion Griffin, for sharing his experiences with the Dupaningan Agta, and to Lawrence Reid, for leading me to discover the world of Philippine Negrito languages.