Morphological Analysis and Lemmatization for Swiss German Using Weighted Transducers
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Syntactic Transformations for Swiss German Dialects
Syntactic transformations for Swiss German dialects Yves Scherrer LATL Universite´ de Geneve` Geneva, Switzerland [email protected] Abstract These transformations are accomplished by a set of hand-crafted rules, developed and evaluated on While most dialectological research so far fo- the basis of the dependency version of the Standard cuses on phonetic and lexical phenomena, we German TIGER treebank. Ultimately, the rule set use recent fieldwork in the domain of dia- lect syntax to guide the development of mul- can be used either as a tool for treebank transduction tidialectal natural language processing tools. (i.e. deriving Swiss German treebanks from Stan- In particular, we develop a set of rules that dard German ones), or as the syntactic transfer mod- transform Standard German sentence struc- ule of a transfer-based machine translation system. tures into syntactically valid Swiss German After the discussion of related work (Section 2), sentence structures. These rules are sensitive we present the major syntactic differences between to the dialect area, so that the dialects of more Standard German and Swiss German dialects (Sec- than 300 towns are covered. We evaluate the tion 3). We then show how these differences can transformation rules on a Standard German treebank and obtain accuracy figures of 85% be covered by a set of transformation rules that ap- and above for most rules. We analyze the most ply to syntactically annotated Standard German text, frequent errors and discuss the benefit of these such as found in treebanks (Section 4). In Section transformations for various natural language 5, we give some coverage figures and discuss the processing tasks. -
THE SWISS GERMAN LANGUAGE and IDENTITY: STEREOTYPING BETWEEN the AARGAU and the ZÜRICH DIALECTS by Jessica Rohr
Purdue University Purdue e-Pubs Open Access Theses Theses and Dissertations 12-2016 The wS iss German language and identity: Stereotyping between the Aargau and the Zurich dialects Jessica Rohr Purdue University Follow this and additional works at: https://docs.lib.purdue.edu/open_access_theses Part of the Anthropological Linguistics and Sociolinguistics Commons, and the Language Interpretation and Translation Commons Recommended Citation Rohr, Jessica, "The wS iss German language and identity: Stereotyping between the Aargau and the Zurich dialects" (2016). Open Access Theses. 892. https://docs.lib.purdue.edu/open_access_theses/892 This document has been made available through Purdue e-Pubs, a service of the Purdue University Libraries. Please contact [email protected] for additional information. THE SWISS GERMAN LANGUAGE AND IDENTITY: STEREOTYPING BETWEEN THE AARGAU AND THE ZÜRICH DIALECTS by Jessica Rohr A Thesis Submitted to the Faculty of Purdue University In Partial Fulfillment of the Requirements for the degree of Master of Arts Department of Languages and Cultures West Lafayette, Indiana December 2016 ii THE PURDUE UNIVERSITY GRADUATE SCHOOL STATEMENT OF THESIS APPROVAL Dr. John Sundquist, Chair Department of German and Russian Dr. Daniel J. Olson Department of Spanish Dr. Myrdene Anderson Department of Anthropology Approved by: Dr. Madeleine M Henry Head of the Departmental Graduate Program iii To my Friends and Family iv ACKNOWLEDGMENTS I would like to thank my major professor, Dr. Sundquist, and my committee members, Dr. Olson, and Dr. Anderson, for their support and guidance during this process. Your guidance kept me motivated and helped me put the entire project together, and that is greatly appreciated. -
Searching for the Bilingual Advantage in Executive Functions in Speakers of Hunsrückisch and Other Minority Languages: a Literature Review
Searching for the bilingual advantage in executive functions in speakers of Hunsrückisch and other minority languages: a literature review Elisabeth Abreu and Bernardo Limberger (Rio Grande do Sul) Abstract The issue of a bilingual advantage on executive functions has been a hotbed for research and debate. Brazilian studies with the minority language Hunsrückisch have failed to replicate the finding of a bilingual advantage found in international studies with majority languages. This raises the question of whether the reasons behind the discrepant results are related to the lan- guage’s minority status. The goal of this paper was to investigate the bilingual advantage on executive functions in studies with minority languages. This is a literature review focusing on state of the art literature on bilingualism and executive functions; as well as a qualitative anal- ysis of the selected corpus in order to tackle the following questions: (1) is there evidence of a bilingual advantage in empirical studies involving minority languages? (2) are there common underlying causes for the presence or absence of the bilingual advantage? (3) can factors per- taining to the language’s minority status be linked to the presence or absence of a bilingual advantage? The analysis revealed that studies form a highly variable group, with mixed results regarding the bilingual advantage, as well as inconsistent controlling of social, cognitive and linguistic factors, as well as different sample sizes. As such, it was not possible to isolate which factors are responsible for the inconsistent results across studies. It is hoped this study will provide an overview that can serve as common ground for future studies involving the issue of a bilingual advantage with speakers of minority languages. -
Kashubian INDO-IRANIAN IRANIAN INDO-ARYAN WESTERN
2/27/2018 https://upload.wikimedia.org/wikipedia/commons/4/4f/IndoEuropeanTree.svg INDO-EUROPEAN ANATOLIAN Luwian Hittite Carian Lydian Lycian Palaic Pisidian HELLENIC INDO-IRANIAN DORIAN Mycenaean AEOLIC INDO-ARYAN Doric Attic ACHAEAN Aegean Northwest Greek Ionic Beotian Vedic Sanskrit Classical Greek Arcado Thessalian Tsakonian Koine Greek Epic Greek Cypriot Sanskrit Prakrit Greek Maharashtri Gandhari Shauraseni Magadhi Niya ITALIC INSULAR INDIC Konkani Paisaci Oriya Assamese BIHARI CELTIC Pali Bengali LATINO-FALISCAN SABELLIC Dhivehi Marathi Halbi Chittagonian Bhojpuri CONTINENTAL Sinhalese CENTRAL INDIC Magahi Faliscan Oscan Vedda Maithili Latin Umbrian Celtiberian WESTERN INDIC HINDUSTANI PAHARI INSULAR Galatian Classical Latin Aequian Gaulish NORTH Bhil DARDIC Hindi Urdu CENTRAL EASTERN Vulgar Latin Marsian GOIDELIC BRYTHONIC Lepontic Domari Ecclesiastical Latin Volscian Noric Dogri Gujarati Kashmiri Haryanvi Dakhini Garhwali Nepali Irish Common Brittonic Lahnda Rajasthani Nuristani Rekhta Kumaoni Palpa Manx Ivernic Potwari Romani Pashayi Scottish Gaelic Pictish Breton Punjabi Shina Cornish Sindhi IRANIAN ROMANCE Cumbric ITALO-WESTERN Welsh EASTERN Avestan WESTERN Sardinian EASTERN ITALO-DALMATIAN Corsican NORTH SOUTH NORTH Logudorese Aromanian Dalmatian Scythian Sogdian Campidanese Istro-Romanian Istriot Bactrian CASPIAN Megleno-Romanian Italian Khotanese Romanian GALLO-IBERIAN Neapolitan Ossetian Khwarezmian Yaghnobi Deilami Sassarese Saka Gilaki IBERIAN Sicilian Sarmatian Old Persian Mazanderani GALLIC SOUTH Shahmirzadi Alanic -
Toward an Orthography: the Textualization of Swiss German
Toward an Orthography: The Textualization of Swiss German Yiwei Luo This paper discusses the future of Swiss German. Currently Swiss German is considered a variety of the Standard German dialect because it is not a written language. The author argues, however, that Swiss German has a much larger presence currently than being a variety. In a number of examples shown, Swiss German is growing in numbers of speakers and in ways it is used. Because of this, there are more instances of written Swiss German being produced. There are currently a few online sources dedicated to standardizing Swiss German spelling, yet the future of Swiss Ger- man is uncertain. An Introduction A motif of human civilization has been to accord authority com mensurate to age. Writing, however, stands out as an exception: de spite being predated by spoken language by eons, it is writing that commands greater respect. For many centuries, people considered a written form to be the key determiner for language hood, and that forms restricted to the domain of speech were merely “vernac ulars.”1 This particular belief prevailed in the West until about a thousand years ago when vernaculars began to slowly gain prestige and find their way onto paper.2 However, traces of this former bias persist where Alemannic dialects are concerned. Swiss German— though mutually unintelligible with Standard German—is not considered a real language due to its lack of presence in writing. Native speakers of Standard German recognize the Swiss variety as distinct, but in contrasting it with “real German,” “actual German,” and “proper German” (just to name a few monikers for Standard German given by a Berliner), their views on the subject become clear: Swiss German is not a language. -
Vocalisations: Evidence from Germanic Gary Taylor-Raebel A
Vocalisations: Evidence from Germanic Gary Taylor-Raebel A thesis submitted for the degree of doctor of philosophy Department of Language and Linguistics University of Essex October 2016 Abstract A vocalisation may be described as a historical linguistic change where a sound which is formerly consonantal within a language becomes pronounced as a vowel. Although vocalisations have occurred sporadically in many languages they are particularly prevalent in the history of Germanic languages and have affected sounds from all places of articulation. This study will address two main questions. The first is why vocalisations happen so regularly in Germanic languages in comparison with other language families. The second is what exactly happens in the vocalisation process. For the first question there will be a discussion of the concept of ‘drift’ where related languages undergo similar changes independently and this will therefore describe the features of the earliest Germanic languages which have been the basis for later changes. The second question will include a comprehensive presentation of vocalisations which have occurred in Germanic languages with a description of underlying features in each of the sounds which have vocalised. When considering phonological changes a degree of phonetic information must necessarily be included which may be irrelevant synchronically, but forms the basis of the change diachronically. A phonological representation of vocalisations must therefore address how best to display the phonological information whilst allowing for the inclusion of relevant diachronic phonetic information. Vocalisations involve a small articulatory change, but using a model which describes vowels and consonants with separate terminology would conceal the subtleness of change in a vocalisation. -
Swiping in Germanic
Swiping in Germanic Jason Merchant University of Chicago Establishing the level of representation or the point in a derivation at which movement takes place has never been a trivial matter, and as such remains an topic of substantial ongoing interest. For overt movement, this question is complicated by the availability in principle of two components in which movement could take place with indistinguishable effects on word order: in the derivation leading to Spell-Out, or in the mapping from Spell-Out to PF. To a great extent, the reasoning brought to bear on this question has been concentrated on A- and A'-movement and their properties; head-movement, in contrast, has remained a distant third. In this paper, I show that a little-studied peculiarity of elliptical wh-questions in Germanic can cast new light on this question, providing evidence that there is indeed head-movement which takes place late in the derivation at PF, after Spell-Out.* The peculiarity in question comes from a range of data found only under sluicing in a subset of the Germanic languages. In particular, it is found in sluices involving certain prepositions, in which the [+wh] object of the preposition appears not after the preposition in the usual head-complement order, but before it, as in (1). (1) Peter went to the movies, but I don’t know who with. I will call this kind of exceptional inversion of the usual order of the preposition and its argument swiping, for sluiced wh-word inversion with prepositions (in Northern Germanic). Any examination of swiping or other curiosities of sluicing must, of course, begin with the necessary background on sluicing itself, which is given in section 1 with particular attention to the facts across Germanic. -
Short-Form ''Doubling Verbs" in Schwyzerdiitsch Ko . · Fo
OSU Working Papers in Linguistics 52, 89-95 Short-form ''Doubling Verbs" in Schwyzerdiitsch Charlotte Christ Schaengold Certain verbs in Schwyzerdiitsch appear .twice in ·a sentence without a corresponding repetition of meaning (Niibling 1995:173). This paper explores some possible explanations of this "doubling verb'' phenomenon, using the current.literature on the subject and interviews with native speakers.' . The first appearance of the "doubled" verb in a sentence is an inflected form of the verb, and the second- appearance is a short form CV stem. The sentences below illustrate this phenomenon; the short forms are,in bold. In the first example "go"·is the short form and "gang" is the inflected 1st person form of the verb "goo" (gehen in Modem High German) 'go.' There are four verbs in Schwyzerdlltsch which can be "doubled" in this way, with each "doubled" verb showing different patterns of usage and frequency of use; there are also historical differences and differences in .use among the different regional dialects of the country. e.g.: l. i gang , · go schwimm~ 'I'm going swimming' I go(lst sg) go(stem) swim(inf) 2. es kunnt ko . rruigne . 'it's about to rain' it come(3rd sg) come(stem) rain(inf) 3. si foot a · fo ·schaffe' 'she's beginning to work' ' Native-speaking experts: Jakob Christ, a~ 70; Rike Christ-Kutter, age 63; Max Reif, age -45; Christa,:·· Baltzer, age 35; Daniel Sachs, age 27; Rosina Christ, age 20. 90 DOUBLING VERBS IN SCHWY2ERD()TSCI{ she begin(3rd sg) begin(stem) work(inf) 4. er loot lo griiesse 'he sends greetings' he let(3rd sg) let(stem) greet(inf) The histories of the "doubled" verbs are diverse: while the doubled use of "go" is found already in Middle High German literature and is used in modem times all over German-speaking Switzerland, as well as in parts of Southern Germany, the doubled use of "afo" and "lo" is found only in certain Swiss dialects. -
Austrian German in Teaching the German Language
International Journal of Scientific and Research Publications, Volume 5, Issue 2, February 2015 1 ISSN 2250-3153 Austrian German in Teaching the German Language Valentina Ilieva PhD, Faculty of Education, Bitola, Macedonia Abstract- The German language and German speaking countries out. The results indicate that there is inadequate application of are the center of attention of all teachers of the German language the DACHL Concept and the pluricentricity of the German worldwide. Teachers of the German language master the German language, which, consequently, implies the necessity of utilizing language in the course of their studies and then they teach it to additional language teaching materials which would alleviate not their students or adult learners. In the process of preparing and only the mastering of the German language but also the process giving their lectures they come across a lot of issues and, of adaptation of students and adult learners of German in the consequently, they do their utmost to handle them. German speaking countries. This would also prevent the Understandably, the main reason why both students and adult occurrence of cultural and linguistic shocks, as well as the learners try to master the German language is to be able to emergence of empathy towards the German language and communicate effectively with native speakers of the German culture. language in their own country or in the German speaking countries. Bearing in mind the fact that approximately 100 Index Terms- language community, nation, the German million people speak German, it is not difficult to assume that language as an official language, dialect regions, pluricentricity there are many diverse dialects of the German language. -
Parsing Approaches for Swiss German
Master’s thesis presented to the Faculty of Arts and Social Sciences of the University of Zurich for the degree of Master of Arts UZH nmod case compound amod Parsing Approaches for Swiss German NOUN NOUN ADP ADJ NOUN Author: Noëmi Aepli Student ID Nr.: 10-719-250 Examiner: Prof. Dr. Martin Volk Supervisor: Dr. Simon Clematide Institute of Computational Linguistics Submission date: 10.01.2018 nmod case compound amod Parsing Approaches for Swiss German Abstract NOUN NOUN ADP ADJ NOUN Abstract This thesis presents work on universal dependency parsing for Swiss German. Natural language pars- ing describes the process of syntactic analysis in Natural Language Processing (NLP) and is a key area as many applications rely on its output. Building a statistical parser requires expensive resources which are only available for a few dozen languages. Hence, for the majority of the world’s languages, other ways have to be found to circumvent the low-resource problem. Triggered by such scenarios, research on different approaches to cross-lingual learning is going on. These methods seem promising for closely related languages and hence especially for dialects and varieties. Swiss German is a dialect continuum of the Alemannic dialect group. It comprises numerous vari- eties used in the German-speaking part of Switzerland. Although mainly oral varieties (Mundarten), they are frequently used in written communication. On the basis of their high acceptance in the Swiss culture and with the introduction of digital communication, Swiss German has undergone a spread over all kinds of communication forms and social media. Considering the lack of standard spelling rules, this leads to a huge linguistic variability because people write the way they speak. -
The Sociolinguistic Situation of Two Language Islands in Ohio and Argentina
The sociolinguistic situation of two language islands in Ohio and Argentina Robert Klosinski Pennsylvania State University Abstract Argentina and Ohio are homes of two distinct language islands (‘Sprachinseln’). This paper provides a basic overview of the history of these two heritage grammars, and comment on the current state of the speech commu- nities. Analyses suggest that Swiss German is now moribund (i.e. the community consists of the final generation of fluent speakers, as it was not passed on to the next generation). Similarly, numbers of fluent MSG speakers are decreasing. Yet, the degree of decay is different to that from Kidron, as there is a general cultural awareness within the speaker group to maintain parts of the language. 1 Introduction The following paper focuses on two language islands (i.e. Sprachinseln) located in North and South America, respectively. More specifically, the sociolinguistic situation will be introduced to trace down the nature of the language shift in the respective communities. Research on either community has been scarce, mainly focusing on syntactic and phonological phenomena of these Sprachinseln (Hoff- man & Klosinski 2018 for Kidron Swiss German; Putnam & Lipski 2016; Putnam & Schwarz 2014 for Misionero German). Both communities are far removed from the home country and thus their base dialect. Nevertheless, they speak distinct va- rieties of German. The language island in Ohio is Swiss German, and in Argentina it is a German variety, mainly from the Hunsrück area of Germany. Crucially, the sociolinguistics background has been neglected to date. However, before the two communities of this paper are introduced, it is important to discuss the term Sprachinsel (‘speech island’) as it is seen in the literature. -
Maintenance and Change of Bernese in Ohio and Misiones (Argentina)
AG 1: ESA1 HG HS J Maintenance and change of Bernese in Ohio and Misiones (Argentina) Donnerstag, Robert Klosinski 05.03.2020 The Pennsylvania State University 13:45–14:15 ESA1 HG HS J [email protected] Overview. Misionero Swiss German (hereafter: MSG) and Ohio Swiss German (hereafter: OSG) are two moribund heritage Swiss German varieties spoken in the Misiones Province of North-eastern AG 1 Argentina and in Ohio, USA respectively. While OSG has been subject to limited investigations, MSG has not been the subject to any known linguistic investigations. For the sake of this pre- sentation, I focus on three MSG and seven OSG speakers of Bernese descent. The current study investigates three phonological features, namely (i) l-vocalization, (ii) velarization of -nd to [ŋ], and (iii) singleton-geminate distinction as well as the lexicon to shed light at potential transfer or the lack thereof from the surrounding languages, i.e. English and Spanish. A previous study by Hoffman and Klosinski (2018) identified that the singleton-geminate distinction common in various Swiss dialects is maintained in OSG while contact with English has triggered a phonetic change. Importantly, Polinsky (2018) argues, that transfer by itself “cannot fully account for all the changes in the weaker language”, i.e. the heritage language (here: Bernese). Research questions. i) Is there change in the speech of heritage Bernese in comparison with the continental variety and with respect to each other? ii) if a change in heritage Bernese is found, can this be attributed to transfer or to other internal developments? Method. Three MSG-Spanish as well as seven OSG-English bilinguals were recruited in their home area and a picture-naming as well as semi-structured elicitation tasks were conducted.