Dialectology
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Adapting MARBERT for Improved Arabic Dialect Identification
Adapting MARBERT for Improved Arabic Dialect Identification: Submission to the NADI 2021 Shared Task Badr AlKhamissi∗ Mohamed Gabr∗ Independent Microsoft EGDC badr [at] khamissi.com mohamed.gabr [at] microsoft.com Muhammed ElNokrashy Khaled Essam Microsoft EGDC Mendel.ai muelnokr [at] microsoft.com khaled.essam [at] mendel.ai Abstract syntax, morphology, vocabulary, and even orthog- raphy. Dialects may be heavily influenced by pre- In this paper, we tackle the Nuanced Ara- viously dominant local languages. For example, bic Dialect Identification (NADI) shared task Egyptian variants are influenced by the Coptic lan- (Abdul-Mageed et al., 2021) and demonstrate guage, while Sudanese variants are influenced by state-of-the-art results on all of its four sub- the Nubian language. tasks. Tasks are to identify the geographic ori- gin of short Dialectal (DA) and Modern Stan- In this paper, we study the classification of such dard Arabic (MSA) utterances at the levels of variants and describe our model that achieves state- both country and province. Our final model is of-the-art results on all of the four Nuanced Ara- an ensemble of variants built on top of MAR- bic Dialect Identification (NADI) subtasks (Abdul- BERT that achieves an F1-score of 34:03% for Mageed et al., 2021). The task focuses on distin- DA at the country-level development set—an guishing both MSA and DA by their geographi- improvement of 7:63% from previous work. cal origin at both the country and province levels. The data is a collection of tweets covering 100 1 Introduction provinces from 21 Arab countries. -
Linguistics Development Team
Development Team Principal Investigator: Prof. Pramod Pandey Centre for Linguistics / SLL&CS Jawaharlal Nehru University, New Delhi Email: [email protected] Paper Coordinator: Prof. K. S. Nagaraja Department of Linguistics, Deccan College Post-Graduate Research Institute, Pune- 411006, [email protected] Content Writer: Prof. K. S. Nagaraja Prof H. S. Ananthanarayana Content Reviewer: Retd Prof, Department of Linguistics Osmania University, Hyderabad 500007 Paper : Historical and Comparative Linguistics Linguistics Module : Indo-Aryan Language Family Description of Module Subject Name Linguistics Paper Name Historical and Comparative Linguistics Module Title Indo-Aryan Language Family Module ID Lings_P7_M1 Quadrant 1 E-Text Paper : Historical and Comparative Linguistics Linguistics Module : Indo-Aryan Language Family INDO-ARYAN LANGUAGE FAMILY The Indo-Aryan migration theory proposes that the Indo-Aryans migrated from the Central Asian steppes into South Asia during the early part of the 2nd millennium BCE, bringing with them the Indo-Aryan languages. Migration by an Indo-European people was first hypothesized in the late 18th century, following the discovery of the Indo-European language family, when similarities between Western and Indian languages had been noted. Given these similarities, a single source or origin was proposed, which was diffused by migrations from some original homeland. This linguistic argument is supported by archaeological and anthropological research. Genetic research reveals that those migrations form part of a complex genetical puzzle on the origin and spread of the various components of the Indian population. Literary research reveals similarities between various, geographically distinct, Indo-Aryan historical cultures. The Indo-Aryan migrations started in approximately 1800 BCE, after the invention of the war chariot, and also brought Indo-Aryan languages into the Levant and possibly Inner Asia. -
On Burgenland Croatian Isoglosses Peter
Dutch Contributions to the Fourteenth International Congress of Slavists, Ohrid: Linguistics (SSGL 34), Amsterdam – New York, Rodopi, 293-331. ON BURGENLAND CROATIAN ISOGLOSSES PETER HOUTZAGERS 1. Introduction Among the Croatian dialects spoken in the Austrian province of Burgenland and the adjoining areas1 all three main dialect groups of central South Slavic2 are represented. However, the dialects have a considerable number of characteris- tics in common.3 The usual explanation for this is (1) the fact that they have been neighbours from the 16th century, when the Ot- toman invasions caused mass migrations from Croatia, Slavonia and Bos- nia; (2) the assumption that at least most of them were already neighbours before that. Ad (1) Map 14 shows the present-day and past situation in the Burgenland. The different varieties of Burgenland Croatian (henceforth “BC groups”) that are spoken nowadays and from which linguistic material is available each have their own icon. 5 1 For the sake of brevity the term “Burgenland” in this paper will include the adjoining areas inside and outside Austria where speakers of Croatian dialects can or could be found: the prov- ince of Niederösterreich, the region around Bratislava in Slovakia, a small area in the south of Moravia (Czech Republic), the Hungarian side of the Austrian-Hungarian border and an area somewhat deeper into Hungary east of Sopron and between Bratislava and Gyǡr. As can be seen from Map 1, many locations are very far from the Burgenland in the administrative sense. 2 With this term I refer to the dialect continuum formerly known as “Serbo-Croatian”. -
Exonormativity, Endonormativity Or Multilingualism: Teachers' Attitudes
Exonormativity, Endonormativity or Multilingualism: Teachers’ Attitudes towards Pronunciation Issues in Three Kachruian Circles Mohammad Amin Mozaheb Abbas Monfared* Department of Foreign Languages, Language Center, Imam Sadiq University, Tehran, Iran *Corresponding author Abstract Despite the accumulated body of debates surrounding English as an international language (EIL), and stronger orientations towards mutual intelligibility, little research has been done on teachers’ attitudes towards English pronunciation pedagogy in ELT classes. To address this gap, this study explores the perceptions of 352 English teachers from all Kachruvian Circles towards pronunciation pedagogy within the framework of English as an international language. Using a questionnaire, supplemented with interviews, the findings demonstrated an exonormative-endonormative gap among teachers in expanding circles (EC) and outer circles (OC). While teachers in the EC circle were in favour of native-speakerism, OC teachers highly valued their own local forms of English while they were in favour of native English. Native English teachers’ replies were also indicative of their acceptance of different varieties of English. Teachers’ preferences in regard to their attitudes towards varieties of English also show a disconnection between teachers’ theoretical knowledge and practical knowledge about world Englishes (WEs) in ELT classes which might have influenced the construction of their professional identity. This article argues that together with encouraging and valuing -
Voicing Distinctions in the Dutch-German Dialect Continuum
Voicing distinctions in the Dutch-German dialect continuum Nina Ouddeken Meertens Instituut This study investigates the phonetics and phonology of voicing distinctions in the Dutch-German dialect continuum, which forms a transition zone between voicing and aspiration systems. Two phonological approaches to represent this contrast exist in the literature: a [±voice] approach and Laryngeal Realism. The implementation of the change between the two language types in the transition zone will provide new insights in the nature of the phonological representa- tion of the contrast. In this paper I will locate the transition zone by looking at phonetic overlap between VOT values of fortis and lenis plosives, and I will compare the two phonological approaches, showing that both face analytical problems as they cannot explain the variation observed in word-initial plosives and plosive clusters. Keywords: dialectology, phonology, phonetics, Laryngeal Realism, transition zones 1. Introduction Voicing distinctions in plosive systems have been studied extensively in phonolo- gy. Lisker & Abramson (1964) discovered that the phonetic realisations of ‘voiced’ and ‘voiceless’ plosives are not identical across languages: in word-initial posi- tion fortis and lenis plosives can be distinguished in different ways with respect to Voice Onset Time (VOT; describing the onset of vocal fold vibration relative to the moment of plosive release). Languages contrasting two plosive series typically contrast prevoiced plosives with plain voiceless plosives (voicing languages), or plain voiceless plosives with voiceless aspirated plosives (aspiration languages). The boundary between plain voiceless and aspirated plosives is placed around 20– 35 msec (depending on Place of Articulation (PoA)) by Keating (1984): Linguistics in the Netherlands 2016, 106–120. -
Judeo-‐Spanish and the Sephardi
The Last Generation of Native Ladino Speakers? Judeo-Spanish and the Sephardic Community in Seattle Mary K. FitzMorris A thesis submitted in partial fulfillment of the requirements for the degree of Master of Arts University of Washington 2014 Faculty advisor: Devin E. Naar Program Authorized to Offer Degree: Spanish & Portuguese Studies 2 © Copyright 2014 Mary K. FitzMorris 3 University of Washington Abstract The Last Generation of Native Ladino Speakers? Judeo-Spanish and the Sephardic Community in Seattle Mary K. FitzMorris Faculty advisor: Devin E. Naar, Marsha and Jay Glazer Chair in Jewish Studies, Assistant Professor of History, Chair of the Sephardic Studies Program La comunidad sefardí de Seattle, Washington es única no sólo por su tamaño en comparación con el tamaño de la ciudad, sino también por la cohesión que se percibe que existe aquí (Bejarano y Aizenberg, 2012, p. 40n2). Esta comunidad tiene dos sinagogas, varios organizaciones y grupos religiosos y culturales, y, más importantemente, un grupo de hablantes que se reúne cada semana para leer textos en judeo-español y “echar lashon” sobre sus experiencias con esta lengua. De hecho, Seattle es una de las pocas ciudades en el mundo que quedan con una población respetable de ladinohablantes. El judeo-español, o ladino, la lengua histórica de los judíos sefardíes, nació cuando los judíos hispanohablantes fueron expulsados de España en 1492 y se trasladaron a varias partes del mundo, particularmente al Imperio Otomano, integrando elementos de las lenguas que encontraron a su propia 4 lengua ibérica. Un gran porcentaje de la generación más vieja de los sefardíes de Seattle creció, si no hablando, por lo menos escuchando el ladino en casa; eran hijos de inmigrantes recientes, pero no hablaban la lengua con sus propios hijos. -
The Romance Advantage — the Significance of the Romance Languages As a Pathway to Multilingualism
ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 8, No. 10, pp. 1253-1260, October 2018 DOI: http://dx.doi.org/10.17507/tpls.0810.01 The Romance Advantage — The Significance of the Romance Languages as a Pathway to Multilingualism Kathleen Stein-Smith Fairleigh Dickinson University, Metropolitan Campus, Teaneck, NJ, USA Abstract—As 41M in the US speak a Romance language in the home, it is necessary to personally and professionally empower L1 speakers of a Romance language through acquisition of one or more additional Romance languages. The challenge is that Romance language speakers, parents, and communities may be unaware of both the advantages of bilingual and multilingual skills and also of the relative ease in developing proficiency, and even fluency, in a second or third closely related language. In order for students to maximize their Romance language skills, it is essential for parents, educators, and other language stakeholders to work together to increase awareness, to develop curriculum, and to provide teacher training -- especially for Spanish-speakers, who form the vast majority of L1 Romance language speakers in the US, to learn additional Romance languages. Index Terms—romance languages, bilingual education, multilingualism, foreign language learning, romance advantage I. INTRODUCTION The Romance languages, generally considered to be French, Spanish, Italian, Portuguese, and Romanian, and in addition, regional languages including Occitan and Catalan, developed from Latin over a significant period of time and across a considerable geographic area. The beginnings of the Romance languages can be traced to the disappearance of the Roman Empire, along with Latin, its lingua franca. -
INTELLIGIBILITY of STANDARD GERMAN and LOW GERMAN to SPEAKERS of DUTCH Charlotte Gooskens1, Sebastian Kürschner2, Renée Van Be
INTELLIGIBILITY OF STANDARD GERMAN AND LOW GERMAN TO SPEAKERS OF DUTCH Charlotte Gooskens 1, Sebastian Kürschner 2, Renée van Bezooijen 1 1University of Groningen, The Netherlands 2 University of Erlangen-Nürnberg, Germany [email protected], [email protected], [email protected] Abstract This paper reports on the intelligibility of spoken Low German and Standard German for speakers of Dutch. Two aspects are considered. First, the relative potential for intelligibility of the Low German variety of Bremen and the High German variety of Modern Standard German for speakers of Dutch is tested. Second, the question is raised whether Low German is understood more easily by subjects from the Dutch-German border area than subjects from other areas of the Netherlands. This is investigated empirically. The results show that in general Dutch people are better at understanding Standard German than the Low German variety, but that subjects from the border area are better at understanding Low German than subjects from other parts of the country. A larger amount of previous experience with the German standard variety than with Low German dialects could explain the first result, while proximity on the sound level could explain the second result. Key words Intelligibility, German, Low German, Dutch, Levenshtein distance, language contact 1. Introduction Dutch and German originate from the same branch of West Germanic. In the Middle Ages these neighbouring languages constituted a common dialect continuum. Only when linguistic standardisation came about in connection with nation building did the two languages evolve into separate social units. A High German variety spread out over the German language area and constitutes what is regarded as Modern Standard German today. -
The Rhaeto-Romance Languages
Romance Linguistics Editorial Statement Routledge publish the Romance Linguistics series under the editorship of Martin Harris (University of Essex) and Nigel Vincent (University of Manchester). Romance Philogy and General Linguistics have followed sometimes converging sometimes diverging paths over the last century and a half. With the present series we wish to recognise and promote the mutual interaction of the two disciplines. The focus is deliberately wide, seeking to encompass not only work in the phonetics, phonology, morphology, syntax, and lexis of the Romance languages, but also studies in the history of Romance linguistics and linguistic thought in the Romance cultural area. Some of the volumes will be devoted to particular aspects of individual languages, some will be comparative in nature; some will adopt a synchronic and some a diachronic slant; some will concentrate on linguistic structures, and some will investigate the sociocultural dimensions of language and language use in the Romance-speaking territories. Yet all will endorse the view that a General Linguistics that ignores the always rich and often unique data of Romance is as impoverished as a Romance Philogy that turns its back on the insights of linguistics theory. Other books in the Romance Linguistics series include: Structures and Transformations Christopher J. Pountain Studies in the Romance Verb eds Nigel Vincent and Martin Harris Weakening Processes in the History of Spanish Consonants Raymond Harris-N orthall Spanish Word Formation M.F. Lang Tense and Text -
A New Model of Indo–European Subgrouping and Dispersal
A New Model of Indo—European Subgrouping and Dispersal For Prof. Murray Emeneau on his 95th birthday, 28 February 1999 Andrew Garrett University of California, Berkeley 1. Introduction In this century two great discoveries have shaken our view of the Indo—European family tree and protolanguage. The first was the discovery of Hittite, which in turn revealed the existence of an Anatolian branch of Indo—European; the second was the discovery in Central Asia of languages belonging to the previously unknown Tocharian branch of the family. Yet as important as these are, they are not the only twentieth century archaeological finds with Indo—European ramifications. In this paper I will explore the implications of a less dramatic set of discoveries for Indo—European subgrouping. I begin with a question posed in recent work by Johanna Nichols. Like many profound questions, this one is both shockingly obvious and disturbingly obscure: Why does Indo—European have so many branches? Ten are fully documented, and the count rises if you add the so—called ‘minor’ languages – Phrygian, Macedonian, Thracian, Venetic, and others known only through inscriptional remains. This is shown graphically in (1): 1 Celtic Italic Germanic Albanian Greek Indo—European Armenian Anatolian Balto—Slavic Indo—Iranian Tocharian ‘Minor’ languages: Phrygian, Macedonian, etc. Typical subgrouping situations are of two distinct types. One is the family tree with binary or ternary branching. This corresponds to situations where a speech community is separated for some reason, such as population movement into or out of the area it occupies, and the newly separated communities evolve in relative linguistic isolation. -
LNGT0101 Introduction to Linguistics
LNGT0101 Announcements Introduction to Linguistics If you don’t hear from me about your LAP proposal, then you’re good to go. Presentations on Monday: Myth 2: Some languages are just not good enough. Myth 4: French is a logical language. Myth 11: Italian is beautiful; German is ugly. Myth 6: Women talk too much. Lecture #17 Reactions to The Linguists? Nov 9th, 2011 Any questions? Transition from last class Sociolinguistics Sociolinguistics is the study of language in social contexts. It focuses on the language of the speech community and variation within Short clips from ‘Do you speak American?’ with Dennis Preston and Bill Labov. that speech community. There are several sociological variables that affect our usage of language, and sociolinguists are interested in studying linguistic variation correlated with these variables. So, … Variables affecting language use Region. What are some of the sociological variables Ethnicity. that may correlate with linguistic variation? Socio-economic background. Education. Age. Gender. Register/Style Whether or not you know another language. 1 Do you speak American? Northern Cities Vowel Shift http://www.pbs.org/speak/ahead/change/vowelpower/vowel.html The Northern Cities Vowel Shift: Another excerpt from ‘Do you speak American?’ From O’Grady et al 2005, p. 511. The language-dialect distinction The language-dialect distinction So, if one of you grew up in New England and Sociolinguists focus on linguistic diversity another one was born and raised in Georgia, internal to speech communities. One such case you’re still able to understand one another, of linguistic diversity is dialectal variation. despite differences in the language variety So, what’s the difference between a language each of you speaks. -
Narratives of Educational and Social Endeavour W RIGHT EDITED by HAZEL R
Discourses We Live By Narratives of Educational and Social Endeavour W RIGHT EDITED BY HAZEL R. WRIGHT AND MARIANNE HØYEN Discourses We Live By AND What are the infl uences that govern how people view their worlds? What are the embedded Narratives of Educational values and prac� ces that underpin the ways people think and act? H ØYEN and Social Endeavour Discourses We Live By approaches these ques� ons through narra� ve research, in a process that uses words, images, ac� vi� es or artefacts to ask people – either individually or collec� vely within social groupings – to examine, discuss, portray or otherwise make public their place in the world, their sense of belonging to (and iden� ty within) the physical and cultural space they inhabit. DITED BY AZEL RIGHT E H R. W This book is a rich and mul� faceted collec� on of twenty-eight chapters that use varied lenses to examine the discourses that shape people’s lives. The contributors are themselves from many AND MARIANNE HØYEN backgrounds – diff erent academic disciplines within the humani� es and social sciences, diverse professional prac� ces and a range of countries and cultures. They represent a broad spectrum of age, status and outlook, and variously apply their research methods – but share a common interest in people, their lives, thoughts and ac� ons. Gathering such eclec� c experiences as those of student-teachers in Kenya, a released prisoner in Denmark, academics in Colombia, a group of migrants learning English, and gambling addic� on support-workers in Italy, alongside D more mainstream educa� onal themes, the book presents a fascina� ng array of insights.