Transfer Learning for Singlish Universal Dependencies Parsing and POS Tagging
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Portuguese Language in Angola: Luso-Creoles' Missing Link? John M
Portuguese language in Angola: luso-creoles' missing link? John M. Lipski {presented at annual meeting of the AATSP, San Diego, August 9, 1995} 0. Introduction Portuguese explorers first reached the Congo Basin in the late 15th century, beginning a linguistic and cultural presence that in some regions was to last for 500 years. In other areas of Africa, Portuguese-based creoles rapidly developed, while for several centuries pidginized Portuguese was a major lingua franca for the Atlantic slave trade, and has been implicated in the formation of many Afro- American creoles. The original Portuguese presence in southwestern Africa was confined to limited missionary activity, and to slave trading in coastal depots, but in the late 19th century, Portugal reentered the Congo-Angola region as a colonial power, committed to establishing permanent European settlements in Africa, and to Europeanizing the native African population. In the intervening centuries, Angola and the Portuguese Congo were the source of thousands of slaves sent to the Americas, whose language and culture profoundly influenced Latin American varieties of Portuguese and Spanish. Despite the key position of the Congo-Angola region for Ibero-American linguistic development, little is known of the continuing use of the Portuguese language by Africans in Congo-Angola during most of the five centuries in question. Only in recent years has some attention been directed to the Portuguese language spoken non-natively but extensively in Angola and Mozambique (Gonçalves 1983). In Angola, the urban second-language varieties of Portuguese, especially as spoken in the squatter communities of Luanda, have been referred to as Musseque Portuguese, a name derived from the KiMbundu term used to designate the shantytowns themselves. -
Universidaddesonora
U N I V E R S I D A D D E S O N O R A División de Humanidades y Bellas Artes Maestría en Lingüística Nominal and Adjectival Predication in Yoreme/Mayo of Sonora and Sinaloa TESIS Que para optar por el grado de Maestra en Lingüística presenta Rosario Melina Rodríguez Villanueva 2012 Universidad de Sonora Repositorio Institucional UNISON Excepto si se seala otra cosa, la licencia del tem se describe como openAccess CONTENTS DEDICATION……………………………………………………………………….. 4 ACKNOWLEDGEMENTS………………………………………………………….. 5 ABBREVIATIONS………………………………………………………………….. 7 INTRODUCTION…………………………………………………………………… 12 CHAPTER 1: The Yoreme/Mayo and their language……………………………….. 16 1.1 Ethnographic and Sociolinguistic Context………………………………………. 16 1.1.1 Geographic Location of the Yoreme/Mayo…………………………………… 16 1.1.2 Social Organization of the Yoreme/Mayo……………………………………... 20 1.1.3 Economy and Working Trades………………………………………………… 21 1.1.4 Religion and Cosmogony……………………………………………………… 22 1.2 The Yoreme/Mayo language…………………………………………………….. 23 1.2.1 Geographical location and genetic affiliation………………………………….. 23 1.2.2 Phonology……………………………………………………………………… 27 1.2.2.1 Consonants………………………………………………………………….. 27 1.2.2.2 Vowels……………………………………………………………………….. 30 1.2.3 Typological Characteristics……………………………………………………. 33 1.2.3.1 Classification………………………………………………………………… 33 1.2.3.2 Marking: head or dependent?........................................................................... 35 1.2.3.3 Word Order…………………………………………………………………. 40 1.2.3.4 Case-Marking………………………………………………………………. 43 1.3 Previous Documentation and Description of Yoreme/Mayo……………………. 48 1 CHAPTER 2: Theoretical Preliminaries…………………………………………….. 52 2.0 Introduction……………………………………………………………………… 52 2.1 Predication: Verbal and Non-verbal 53 2.1.1 Definition………………………………………………………………………. 53 2.1.2 Verbal Predication……………………………………………………………. 56 2.1.3 Non-verbal Predication………………………………………………………… 61 2.2. The Syntax of Non-verbal Predication………………………………………… 71 2.2.1 Nominal Predication…………………………………………………………. -
Capeverdean Words Derived from Portuguese Non-Infinitive Verbal Forms: a Descriptive and Comparative Study Nicolas Quint
Capeverdean words derived from Portuguese non-infinitive verbal forms: a descriptive and comparative study Nicolas Quint To cite this version: Nicolas Quint. Capeverdean words derived from Portuguese non-infinitive verbal forms: a descriptive and comparative study. Papia, 2015, 2 (25), pp.189 - 215. halshs-01482124 HAL Id: halshs-01482124 https://halshs.archives-ouvertes.fr/halshs-01482124 Submitted on 8 Mar 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. PAPIA, São Paulo, 25(2), p. 189-215, Jul/Dez 2015. Capeverdean words derived from Portuguese non-infinitive verbal forms: a descriptive and comparative study Palavras do cabo-verdiano derivadas de formas verbais não-infinitivas: um estudo descritivo e comparativo Nicolas Quint1 llacan, Villejuif, France [email protected] Abstract: This article focuses on Santiaguense Capeverdean words derived from Portuguese non-infinitive verbal forms and among these, more specifically on those elements which were not recently borrowed from modern Portuguese and do not compete with Capeverdean more basilectal items. In section 1, the category of Capeverdean words under scrutiny is defined contrastively with other similar types of words. In section 2, all known members of this category are examined in turn and according to the characteristics of their respective Portuguese sources. -
A Grammar of Gyeli
A Grammar of Gyeli Dissertation zur Erlangung des akademischen Grades doctor philosophiae (Dr. phil.) eingereicht an der Kultur-, Sozial- und Bildungswissenschaftlichen Fakultät der Humboldt-Universität zu Berlin von M.A. Nadine Grimm, geb. Borchardt geboren am 28.01.1982 in Rheda-Wiedenbrück Präsident der Humboldt-Universität zu Berlin Prof. Dr. Jan-Hendrik Olbertz Dekanin der Kultur-, Sozial- und Bildungswissenschaftlichen Fakultät Prof. Dr. Julia von Blumenthal Gutachter: 1. 2. Tag der mündlichen Prüfung: Table of Contents List of Tables xi List of Figures xii Abbreviations xiii Acknowledgments xv 1 Introduction 1 1.1 The Gyeli Language . 1 1.1.1 The Language’s Name . 2 1.1.2 Classification . 4 1.1.3 Language Contact . 9 1.1.4 Dialects . 14 1.1.5 Language Endangerment . 16 1.1.6 Special Features of Gyeli . 18 1.1.7 Previous Literature . 19 1.2 The Gyeli Speakers . 21 1.2.1 Environment . 21 1.2.2 Subsistence and Culture . 23 1.3 Methodology . 26 1.3.1 The Project . 27 1.3.2 The Construction of a Speech Community . 27 1.3.3 Data . 28 1.4 Structure of the Grammar . 30 2 Phonology 32 2.1 Consonants . 33 2.1.1 Phonemic Inventory . 34 i Nadine Grimm A Grammar of Gyeli 2.1.2 Realization Rules . 42 2.1.2.1 Labial Velars . 43 2.1.2.2 Allophones . 44 2.1.2.3 Pre-glottalization of Labial and Alveolar Stops and the Issue of Implosives . 47 2.1.2.4 Voicing and Devoicing of Stops . 51 2.1.3 Consonant Clusters . -
Identification of Zero Copulas in Hungarian Using
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 4802–4810 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC Much Ado About Nothing Identification of Zero Copulas in Hungarian Using an NMT Model Andrea Dömötör1;2, Zijian Gyoz˝ o˝ Yang1, Attila Novák1 1MTA-PPKE Hungarian Language Technology Research Group, Pázmány Péter Catholic University, Faculty of Information Technology and Bionics Práter u. 50/a, 1083 Budapest, Hungary 2Pázmány Péter Catholic University, Faculty of Humanities and Social Sciences Egyetem u. 1, 2087 Piliscsaba, Hungary {surname.firstname}@itk.ppke.hu Abstract The research presented in this paper concerns zero copulas in Hungarian, i.e. the phenomenon that nominal predicates lack an explicit verbal copula in the default present tense 3rd person indicative case. We created a tool based on the state-of-the-art transformer architecture implemented in Marian NMT framework that can identify and mark the location of zero copulas, i.e. the position where an overt copula would appear in the non-default cases. Our primary aim was to support quantitative corpus-based linguistic research by creating a tool that can be used to compile a corpus of significant size containing examples of nominal predicates including the location of the zero copulas. We created the training corpus for our system transforming sentences containing overt copulas into ones containing zero copula labels. However, we first needed to disambiguate occurrences of the massively ambiguous verb van ‘exist/be/have’. We performed this using a rule-base classifier relying on English translations in the English-Hungarian parallel subcorpus of the OpenSubtitles corpus. -
NWAV 46 Booklet-Oct29
1 PROGRAM BOOKLET October 29, 2017 CONTENTS • The venue and the town • The program • Welcome to NWAV 46 • The team and the reviewers • Sponsors and Book Exhibitors • Student Travel Awards https://english.wisc.edu/nwav46/ • Abstracts o Plenaries Workshops o nwav46 o Panels o Posters and oral presentations • Best student paper and poster @nwav46 • NWAV sexual harassment policy • Participant email addresses Look, folks, this is an electronic booklet. This Table of Contents gives you clues for what to search for and we trust that’s all you need. 2 We’ll have buttons with sets of pronouns … and some with a blank space to write in your own set. 3 The venue and the town We’re assuming you’ll navigate using electronic devices, but here’s some basic info. Here’s a good campus map: http://map.wisc.edu/. The conference will be in Union South, in red below, except for Saturday talks, which will be in the Brogden Psychology Building, just across Johnson Street to the northeast on the map. There are a few places to grab a bite or a drink near Union South and the big concentration of places is on and near State Street, a pedestrian zone that runs east from Memorial Library (top right). 4 The program 5 NWAV 46 2017 Madison, WI Thursday, November 2nd, 2017 12:00 Registration – 5th Quarter Room, Union South pm-6:00 pm Industry Landmark Northwoods Agriculture 1:00- Progress in regression: Discourse analysis for Sociolinguistics and Texts as data 3:00 Statistical and practical variationists forensic speech sources for improvements to Rbrul science: Knowledge- -
Hyman Lusoga Noun Phrase Tonology PLAR
UC Berkeley Phonetics and Phonology Lab Annual Report (2017) Lusoga Noun Phrase Tonology Larry M. Hyman Department of Linguistics University of California, Berkeley 1. Introduction Bantu tone systems have long been known for their syntagmatic properties, including the ability of a tone to assimilate or shift over long distances. Most systems have a surface binary contrast between H(igh) and L(ow) tone, some also a downstepped H which produces a contrast between H-H and H-ꜜH.1 Given their considerable complexity, much research has focused on the tonal alternations that are produced both lexically and post-lexically. Lusoga, the language under examination in this study, is no exception. Although the closest relative to Luganda, whose tone system has been widely studied (see references in Hyman & Katamba 2010), the only two discussions of Lusoga tonology that I am aware of are Yukawa (2000) and van der Wal (2004:20-30), who outline the surface tone patterns of words in isolation, including certain verb tenses, and illustrate some of the alternations. The latter also points out certain resemblances with Luganda: “Two similarities between Luganda and Lusoga are the clear restriction against LH syllables and the maximum of one H to L pitch drop per word” (van der Wal 2004:29). In this paper I extend the tonal description, with particular attention on the relation between underlying and surface tonal representations. As I have pointed out in a number of studies, two-height tone systems are subject to more than one interpretation: First, the contrast may be either equipolent, H vs. -
Predicative Clauses in Today Persian - a Typological Analysis
Asian Social Science; Vol. 11, No. 15; 2015 ISSN 1911-2017 E-ISSN 1911-2025 Published by Canadian Center of Science and Education Predicative Clauses in Today Persian - A Typological Analysis Pooneh Mostafavi1 1 Research Institute of Cultural Heritage and Tourism of Islamic republic of Iran, Tehran, Iran Correspondence: Pooneh Mostafavi, Assistant Professor in General Linguistics, Research Center of linguistics, Inscriptions, and Texts, Research Institute to Iranian Cultural Heritage and Tourism of Islamic republic of Iran, Tehran, Iran. Tel: 98-21-66-736-5867. E-mail: [email protected] Received: December 24, 2014 Accepted: February 23, 2015 Online Published: May 15, 2015 doi:10.5539/ass.v11n15p104 URL: http://dx.doi.org/10.5539/ass.v11n15p104 Abstract Predicative clauses in world languages encode differently. There are typological researches on this subject in other languages but none of them is on Persian. Stassen in his typological studies shows that the world languages have yielded two strategies for encoding predicative clauses. Copulative and locative/existential constructions are considered as predicative clauses. The strategies which Stassen claims are ‘share’ and ‘split’ strategies. According to Stassen some languages employ one of them and some employ both strategies. He shows in his studies, the language type can be determined by examining the said strategies in them. Using normal data, the present study examines two strategies for today Persian in order to determine its language type. The findings of the present study show that Persian could be categorized in both language types as both strategies are employed by it. Keywords: predicative clauses, copulative construction, locative/existential construction, share strategy, split strategy 1. -
Vowel and Consonant Lessening: a Study of Articulating Reductions
Title: Vowel and Consonant Lessening: A Study of Articulating Reductions and Their Relations to Genders Author name(s): Grace Hui Chin Lin & Paul Shih Chieh Chien General Education Center, China Medical University (Lin) English Department, Changhwa University of Education General Education Center, Taipei Medical University (Chien) Academic Writing Education Center, National Taiwan University Page Number: p. 70-81 Publication date: 2011, Dec. 20 Conference information for conference papers (name, date and location of the conference) : Name : International Conference on Computer Assisted Language Learning in 2011, National Yunlin University of Science and Technology Date: Nov. 5, 2011, Location: Yunlin, Taiwan 69 Vowel and Consonant Lessening: A Study of Articulating Reductions and Their Relations to Genders Grace Hui Chin Lin Paul Shih Chieh Chien ABSTRACT Using English as a global communicating tool makes Taiwanese people have to speak in English in diverse international situations. However, consonants and vowels in English are not all effortless for them to articulate. This phonological reduction study explores concepts about phonological (articulating system) approximation. From Taiwanese folks’ perspectives, it analyzes phonological type, rate, and their associations with 2 genders. This quantitative research discovers Taiwanese people’s vocalization problems and their facilitating solutions by articulating lessening. In other words, this study explains how English emerging as a global language can be adapted and fluently articulated by Taiwanese. It was conducted at National Changhwa University of Education from 2010 fall to 2011 spring, investigating Taiwanese university students’ phonological lessening systems. It reveals how they face the phonetics challenges during interactions and give speeches by ways of phonological lessening. Taiwanese folks’ lessening patterns belong to simplified pronouncing methods, being evolved through Mandarin, Hakka, and Holo phonetic patterns. -
'Speaking Singlish' Comic Strips
International Journal of Innovation, Creativity and Change. www.ijicc.net Volume 12, Issue 12, 2020 The Use of Colloquial Singaporean English in ‘Speaking Singlish’ Comic Strips: A Syntactic Analysis Delianaa*, Felicia Oscarb, a,bEnglish department, Faculty of Cultural Studies, Universitas Sumatera Utara, Email: a*[email protected] This study explores the sentence structure of Colloquial Singaporean English (CSE) and how it differs from Standard English (SE). A descriptive qualitative method is employed as the research design. The data source is the dialogue of five comic strips which are purposively chosen from Speaking Singlish comic strips. Data is in the form of sentences totalling 34 declaratives, 20 wh- interrogatives, 14 yes -no interrogatives, 3 imperatives and 1 exclamative. The results present the sentence structure of CSE found in the data generally constructed by one subject, one predicate, and occasionally one discourse element. The subject is a noun phrase while the predicate varies amongst noun, adjective, adverb, and verb phrases– particularly in copula deletion. On the other hand, there are several differences between the sentence structure of CSE and SE in the data including the use of copula, topic sentence, discourse elements, adverbs, unmarked plural noun and past tense. Key words: Colloquial Singaporean English (CSE), Speaking Singlish, sentence structure. Introduction Standard English (SE) is a variety of the English language. This view is perhaps more acceptable in the case of Non-Standard English (NSE). The classification of SE being a dialect goes against the lay understanding that a dialect is a subset of a language, usually with a geographical restriction regarding its distribution (Kerswill, 2006). -
Does Singlish Contribute to Singaporean's National Identity
International Journal of Academic Research in Progressive Education and Development Vol. 9 , No. 2, 2020, E-ISSN: 2226-6348 © 2020 HRMARS Does Singlish Contribute to Singaporean’s National Identity, and do Singaporeans Support Formal Recognition of it? Wang Shih Ching To Link this Article: http://dx.doi.org/10.6007/IJARPED/v9-i2/7244 DOI:10.6007/IJARPED/v9-i2/7244 Received: 15 Jan 2020, Revised: 15 Feb 2020, Accepted: 10 Mar 2020 Published Online: 23 Mar 2020 In-Text Citation: (Wang, 2020) To Cite this Article: Wang, S. C. (2020). Does Singlish Contribute to Singaporean’s National Identity, and do Singaporeans Support Formal Recognition of it? International Journal of Academic Research in Progressive Education and Development, 9(2), 96–112. Copyright: © 2020 The Author(s) Published by Human Resource Management Academic Research Society (www.hrmars.com) This article is published under the Creative Commons Attribution (CC BY 4.0) license. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this license may be seen at: http://creativecommons.org/licences/by/4.0/legalcode Vol. 9(2) 2020, Pg. 96 - 112 http://hrmars.com/index.php/pages/detail/IJARPED JOURNAL HOMEPAGE Full Terms & Conditions of access and use can be found at http://hrmars.com/index.php/pages/detail/publication-ethics 96 International Journal of Academic Research in Progressive Education and Development Vol. 9 , No. 2, 2020, E-ISSN: 2226-6348 © 2020 HRMARS Does Singlish Contribute to Singaporean’s National Identity, and do Singaporeans Support Formal Recognition of it? Wang Shih Ching Research Scholar, Faculty of Social Science, Arts and Humanities, Lincoln University College, Malaysia Abstract ‘Singlish’ is a colloquial form of English that was influenced by other languages used in Singapore, such as Chinese Mandarin, Malay and Tamil. -
Singlish to English
TO English Singlish In a country like Singapore, where a non-native language is adopted as a native language, the style, vocabulary, grammar, pronunciation, etc. will have its own flavour and even its own unique code of usage, like a grammar. This chapter will focus on specific problems students have when studying English in Singapore. A comparison will be made between standard English and Singlish in terms of structure and form. It should be noted that Singlish and Singapore English are two very different things. A person who speaks Singapore English uses standard grammar and vocabulary but pronounces words the way most Singaporean English speakers do. Singlish, on the other hand, includes the same kind of pronunciation, although Singlish speakers may have a stronger accent in that some sounds are changed or dropped entirely. 1. CAN and the omission of the subject. In Chinese, the verb “can” has different meanings. One means ability (I can speak English) and the other meaning possibility (We can go now) or permission (We can’t smoke here), etc. The biggest difference in these expressions between Chinese and English (and Malay too) is that in Chinese and Malay, “can” is sometimes used without the subject, as it is not necessary when the meaning is understood. Also, in Chinese, a simple question marker “mah” is used to make questions (in Malay, it is necessary only to change intonation for a question). This “mah” cannot be translated into a word in English because we change word order and intonation to make questions. The result is often sentences like these: Can go? This one can? Can? Cannot.