Aarhus 231 Talk
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Syntactic Transformations for Swiss German Dialects
Syntactic transformations for Swiss German dialects Yves Scherrer LATL Universite´ de Geneve` Geneva, Switzerland [email protected] Abstract These transformations are accomplished by a set of hand-crafted rules, developed and evaluated on While most dialectological research so far fo- the basis of the dependency version of the Standard cuses on phonetic and lexical phenomena, we German TIGER treebank. Ultimately, the rule set use recent fieldwork in the domain of dia- lect syntax to guide the development of mul- can be used either as a tool for treebank transduction tidialectal natural language processing tools. (i.e. deriving Swiss German treebanks from Stan- In particular, we develop a set of rules that dard German ones), or as the syntactic transfer mod- transform Standard German sentence struc- ule of a transfer-based machine translation system. tures into syntactically valid Swiss German After the discussion of related work (Section 2), sentence structures. These rules are sensitive we present the major syntactic differences between to the dialect area, so that the dialects of more Standard German and Swiss German dialects (Sec- than 300 towns are covered. We evaluate the tion 3). We then show how these differences can transformation rules on a Standard German treebank and obtain accuracy figures of 85% be covered by a set of transformation rules that ap- and above for most rules. We analyze the most ply to syntactically annotated Standard German text, frequent errors and discuss the benefit of these such as found in treebanks (Section 4). In Section transformations for various natural language 5, we give some coverage figures and discuss the processing tasks. -
A Bavarian-Speaking Exception in Alemannic-Speaking Switzerland: the Case of Samnaun 48 Located
47 A Bavarian -speaking 1. Introduction1 data2 is missing, as the study by Gröger is the Exception in Alemannic- only detailed linguistic description of the speaking Switzerland: he municipality of Samnaun, situated German dialect in Samnaun. in the extreme east of Switzerland, In this paper, I present a new research The Case of Samnaun project which seeks to fill this gap. The T stands out from the rest of German- speaking Switzerland: Samnaun is usually project is dedicated to the current linguistic A Project Presentation described as the only place where not an situation in Samnaun. Its working title is Alemannic, but a Bavarian dialect is spoken Bairisch-alemannischer Sprachkontakt. Das Journal Article (cf., e.g., Sonderegger 2003: 2839; Wiesinger Spektrum der sprachlichen Variation in Susanne Oberholzer 1983: 817). This claim is based on a study by Samnaun (‘Bavarian-Alemannic Language Gröger (1924) and has found its way into Contact. The Range of Linguistic Variation in Samnaun has been described as the only Samnaun’). This paper describes Samnaun Bavarian-speaking municipality in Ale- many linguistic descriptions of (German- mannic-speaking Switzerland on the basis speaking) Switzerland. In the literature, this and summarises the available descriptions of of a study done in 1924. Hints in the viewpoint has been almost unchallenged for its language situation as well as the aims, literature about the presence of other over 90 years. Hints about the presence of research questions, and methods of the varieties for everyday communication – an other varieties (an Alemannic dialect as well planned project. intermediate variety on the dialect- In section 2, I sketch the municipality of standard -axis as well as an Alemannic as an intermediate variety on the dialect- dialect – have not resulted in more recent standard-axis) from the second half of the Samnaun. -
Morphological Analysis and Lemmatization for Swiss German Using Weighted Transducers
Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016) Morphological analysis and lemmatization for Swiss German using weighted transducers Reto Baumgartner University of Zurich [email protected] Abstract haar and Wyler, 1997, p. 37). Conversely it pos- sesses infinitive particles that are not known to StG. With written Swiss German becoming SwG consists of different local dialects that more popular in everyday use, it has be- mainly differ in phonology and to a lesser extent in come a target for text processing. The ab- vocabulary. There is no standard orthography, but sence of a standard orthography and the there are proposals for sound-character assignment variety of dialects, however, lead to a vast like Dieth-Schreibung (Dieth, 1986) or Bärndüt- variation in different spellings which makes schi Schrybwys (Marti, 1985a) that are, however, this task difficult. We built a system based not known to everyone. This results in a high vari- on weighted transducers that recognizes ability of spellings influenced by both dialects and over 90% of the tokens in certain texts. personal writing preferences. As an example for Weights ensure preferring the best analysis the StG word Jahr “year”, we found in our corpus for most words while at the same time al- Jahr and Jaar, Johr and Joor, even Joh for different lowing for very broad range of spelling pronunciations and spelling preferences. variations. Our morphological tagset that The lack of a standard orthography and the vast- we defined for this purpose and lemmas in ness of variants motivate the choices that have to Standard German open the possibility for be made to process these dialects. -
THE SWISS GERMAN LANGUAGE and IDENTITY: STEREOTYPING BETWEEN the AARGAU and the ZÜRICH DIALECTS by Jessica Rohr
Purdue University Purdue e-Pubs Open Access Theses Theses and Dissertations 12-2016 The wS iss German language and identity: Stereotyping between the Aargau and the Zurich dialects Jessica Rohr Purdue University Follow this and additional works at: https://docs.lib.purdue.edu/open_access_theses Part of the Anthropological Linguistics and Sociolinguistics Commons, and the Language Interpretation and Translation Commons Recommended Citation Rohr, Jessica, "The wS iss German language and identity: Stereotyping between the Aargau and the Zurich dialects" (2016). Open Access Theses. 892. https://docs.lib.purdue.edu/open_access_theses/892 This document has been made available through Purdue e-Pubs, a service of the Purdue University Libraries. Please contact [email protected] for additional information. THE SWISS GERMAN LANGUAGE AND IDENTITY: STEREOTYPING BETWEEN THE AARGAU AND THE ZÜRICH DIALECTS by Jessica Rohr A Thesis Submitted to the Faculty of Purdue University In Partial Fulfillment of the Requirements for the degree of Master of Arts Department of Languages and Cultures West Lafayette, Indiana December 2016 ii THE PURDUE UNIVERSITY GRADUATE SCHOOL STATEMENT OF THESIS APPROVAL Dr. John Sundquist, Chair Department of German and Russian Dr. Daniel J. Olson Department of Spanish Dr. Myrdene Anderson Department of Anthropology Approved by: Dr. Madeleine M Henry Head of the Departmental Graduate Program iii To my Friends and Family iv ACKNOWLEDGMENTS I would like to thank my major professor, Dr. Sundquist, and my committee members, Dr. Olson, and Dr. Anderson, for their support and guidance during this process. Your guidance kept me motivated and helped me put the entire project together, and that is greatly appreciated. -
Hatt Or Si? Neuter and Feminine Gender Assignment in Reference to Female Persons in Luxembourgish
Hatt or si? Neuter and feminine gender assignment in reference to female persons in Luxembourgish Abstract: In Luxembourgish, feminine as well as neuter gender can be assigned to female persons. Here, female first names are morphologically treated as neuter and therefore trigger neuter gender on their targets (e.g. definite article, personal pronoun). Last names referring to women, however, are feminine and take feminine targets respectively. While the use of neuter and feminine in prototypical and invariable reference contexts are well-known, morphological conflicts often arise regarding more complex name types (e.g. female first name + last name) leading to different degrees of variation between both genders. Building especially upon previous findings by Döhmer (2016), the present contribution offers a first extensive empirical analysis on the use of neuter and feminine personal pronouns considering different female referents as well as familiarity, the referent’s and the speaker’s as decisive (socio-pragmatical) factors for gender assignment. The results are based on elicited data retrieved from an online survey and audio recordings collected by means of the the Luxembourgish language app Schnëssen and allow a quantification of the phenomenon going beyond previous contributions and descriptions in reference grammars. The apparent-time analysis, carried out in order to identify potential tendencies in language change, suggests a preference for neuter pronominalization for younger speakers of Luxembourgish in variable reference contexts. keywords: gender assignment, variation, pronominalization, neuter, Luxembourgish 1 Introduction While a correlation between gender and sex usually applies to appellatives and anthroponyms, one can use feminine as well as neuter when referring to female persons in Luxembourgish. -
Searching for the Bilingual Advantage in Executive Functions in Speakers of Hunsrückisch and Other Minority Languages: a Literature Review
Searching for the bilingual advantage in executive functions in speakers of Hunsrückisch and other minority languages: a literature review Elisabeth Abreu and Bernardo Limberger (Rio Grande do Sul) Abstract The issue of a bilingual advantage on executive functions has been a hotbed for research and debate. Brazilian studies with the minority language Hunsrückisch have failed to replicate the finding of a bilingual advantage found in international studies with majority languages. This raises the question of whether the reasons behind the discrepant results are related to the lan- guage’s minority status. The goal of this paper was to investigate the bilingual advantage on executive functions in studies with minority languages. This is a literature review focusing on state of the art literature on bilingualism and executive functions; as well as a qualitative anal- ysis of the selected corpus in order to tackle the following questions: (1) is there evidence of a bilingual advantage in empirical studies involving minority languages? (2) are there common underlying causes for the presence or absence of the bilingual advantage? (3) can factors per- taining to the language’s minority status be linked to the presence or absence of a bilingual advantage? The analysis revealed that studies form a highly variable group, with mixed results regarding the bilingual advantage, as well as inconsistent controlling of social, cognitive and linguistic factors, as well as different sample sizes. As such, it was not possible to isolate which factors are responsible for the inconsistent results across studies. It is hoped this study will provide an overview that can serve as common ground for future studies involving the issue of a bilingual advantage with speakers of minority languages. -
Kashubian INDO-IRANIAN IRANIAN INDO-ARYAN WESTERN
2/27/2018 https://upload.wikimedia.org/wikipedia/commons/4/4f/IndoEuropeanTree.svg INDO-EUROPEAN ANATOLIAN Luwian Hittite Carian Lydian Lycian Palaic Pisidian HELLENIC INDO-IRANIAN DORIAN Mycenaean AEOLIC INDO-ARYAN Doric Attic ACHAEAN Aegean Northwest Greek Ionic Beotian Vedic Sanskrit Classical Greek Arcado Thessalian Tsakonian Koine Greek Epic Greek Cypriot Sanskrit Prakrit Greek Maharashtri Gandhari Shauraseni Magadhi Niya ITALIC INSULAR INDIC Konkani Paisaci Oriya Assamese BIHARI CELTIC Pali Bengali LATINO-FALISCAN SABELLIC Dhivehi Marathi Halbi Chittagonian Bhojpuri CONTINENTAL Sinhalese CENTRAL INDIC Magahi Faliscan Oscan Vedda Maithili Latin Umbrian Celtiberian WESTERN INDIC HINDUSTANI PAHARI INSULAR Galatian Classical Latin Aequian Gaulish NORTH Bhil DARDIC Hindi Urdu CENTRAL EASTERN Vulgar Latin Marsian GOIDELIC BRYTHONIC Lepontic Domari Ecclesiastical Latin Volscian Noric Dogri Gujarati Kashmiri Haryanvi Dakhini Garhwali Nepali Irish Common Brittonic Lahnda Rajasthani Nuristani Rekhta Kumaoni Palpa Manx Ivernic Potwari Romani Pashayi Scottish Gaelic Pictish Breton Punjabi Shina Cornish Sindhi IRANIAN ROMANCE Cumbric ITALO-WESTERN Welsh EASTERN Avestan WESTERN Sardinian EASTERN ITALO-DALMATIAN Corsican NORTH SOUTH NORTH Logudorese Aromanian Dalmatian Scythian Sogdian Campidanese Istro-Romanian Istriot Bactrian CASPIAN Megleno-Romanian Italian Khotanese Romanian GALLO-IBERIAN Neapolitan Ossetian Khwarezmian Yaghnobi Deilami Sassarese Saka Gilaki IBERIAN Sicilian Sarmatian Old Persian Mazanderani GALLIC SOUTH Shahmirzadi Alanic -
Toward an Orthography: the Textualization of Swiss German
Toward an Orthography: The Textualization of Swiss German Yiwei Luo This paper discusses the future of Swiss German. Currently Swiss German is considered a variety of the Standard German dialect because it is not a written language. The author argues, however, that Swiss German has a much larger presence currently than being a variety. In a number of examples shown, Swiss German is growing in numbers of speakers and in ways it is used. Because of this, there are more instances of written Swiss German being produced. There are currently a few online sources dedicated to standardizing Swiss German spelling, yet the future of Swiss Ger- man is uncertain. An Introduction A motif of human civilization has been to accord authority com mensurate to age. Writing, however, stands out as an exception: de spite being predated by spoken language by eons, it is writing that commands greater respect. For many centuries, people considered a written form to be the key determiner for language hood, and that forms restricted to the domain of speech were merely “vernac ulars.”1 This particular belief prevailed in the West until about a thousand years ago when vernaculars began to slowly gain prestige and find their way onto paper.2 However, traces of this former bias persist where Alemannic dialects are concerned. Swiss German— though mutually unintelligible with Standard German—is not considered a real language due to its lack of presence in writing. Native speakers of Standard German recognize the Swiss variety as distinct, but in contrasting it with “real German,” “actual German,” and “proper German” (just to name a few monikers for Standard German given by a Berliner), their views on the subject become clear: Swiss German is not a language. -
The Wili Benchmark Dataset for Written Natural Language Identification
1 The WiLI benchmark dataset for written II. WRITTEN LANGUAGE BASICS language identification A language is a system of communication. This could be spoken language, written language or sign language. A spoken language Martin Thoma can have several forms of written language. For example, the E-Mail: [email protected] spoken Chinese language has at least three writing systems: Traditional Chinese characters, Simplified Chinese characters and Pinyin — the official romanization system for Standard Chinese. Abstract—This paper describes the WiLI-2018 benchmark dataset for monolingual written natural language identification. Languages evolve. New words like googling, television and WiLI-2018 is a publicly available,1 free of charge dataset of Internet get added, but written languages are also refactored. short text extracts from Wikipedia. It contains 1000 paragraphs For example, the German orthography reform of 1996 aimed at of 235 languages, totaling in 235 000 paragraphs. WiLI is a classification dataset: Given an unknown paragraph written in making the written language simpler. This means any system one dominant language, it has to be decided which language it which recognizes language and any benchmark needs to be is. adapted over time. Hence WiLI is versioned by year. Languages do not necessarily have only one name. According to Wikipedia, the Sranan language is also known as Sranan Tongo, Sranantongo, Surinaams, Surinamese, Surinamese Creole and I. INTRODUCTION Taki Taki. This makes ISO 369-3 valuable, but not all languages are represented in ISO 369-3. As ISO 369-3 uses combinations The identification of written natural language is a task which of 3 Latin letters and has 547 reserved combinations, it can appears often in web applications. -
Spatial Boundaries and Transitions in Language and Interaction
URPP Language and Space www.spur.uzh.ch International Conference SPATIAL BOUNDARIES AND TRANSITIONS IN LANGUAGE AND INTERACTION Perspectives from Linguistics and Geography April 23 – 28, 2017 Monte Verità, Ascona http://www.spur.uzh.ch/boundaries PROGRAM MONDAY, 24. APRIL TUESDAY, 25. APRIL 08:00 Breakfast Breakfast 09:00 Welcome 09:15 1 KEYNOTE TALK: Tom Güldemann 9 KEYNOTE TALK: Setha Low Linguistic macro-areas in Africa: when Language, Discourse and Space: boundaries are areas themselves p. 01 A Conceptual Framework for the Ethnography of Space and Place p. 09 10:15 Break Break 10:45 2 INPUT TALK SESSION 1: Peter Auer 10 INPUT TALK SESSION 2: Alfred Lameli Walking and talking: how speakers jointly Intangible Borders – Linguistic Areas and manoeuver in space p. 02 Socio-Cultural Practices p. 10 11:30 3 Paul Luff, Christian Heath, Menisha Patel 11 Stefanie Siebenhuetter Boundaries in interaction spaces: embodied Conceptual transfer of spatial reference due interaction within a large working to language contact? A semantic approach environment p. 03 to cultural conceptualization in the linguistic area Mainland Southeast Asia p. 11 12:00 Lunch Lunch 14:00 4 Martin de Heaver, Paul Luff, 12 Grossenbacher, Britain, Leemann, Christian Heath Kolly, Blaxter, Wanitsch Crossing Boundaries: interactions through Smartphone app methodologies for regional locations within a moving environment p. 04 dialectology: the English North-South divide in data from the English Dialects App p. 12 14:30 5 Albert Acedo and Marco Painho: 13 Daan Hovens “You should participate” or “I want What is the language of the euregio to participate” – engaging spatial rhine-meuse-north? Euroregional integration boundaries p. -
Vocalisations: Evidence from Germanic Gary Taylor-Raebel A
Vocalisations: Evidence from Germanic Gary Taylor-Raebel A thesis submitted for the degree of doctor of philosophy Department of Language and Linguistics University of Essex October 2016 Abstract A vocalisation may be described as a historical linguistic change where a sound which is formerly consonantal within a language becomes pronounced as a vowel. Although vocalisations have occurred sporadically in many languages they are particularly prevalent in the history of Germanic languages and have affected sounds from all places of articulation. This study will address two main questions. The first is why vocalisations happen so regularly in Germanic languages in comparison with other language families. The second is what exactly happens in the vocalisation process. For the first question there will be a discussion of the concept of ‘drift’ where related languages undergo similar changes independently and this will therefore describe the features of the earliest Germanic languages which have been the basis for later changes. The second question will include a comprehensive presentation of vocalisations which have occurred in Germanic languages with a description of underlying features in each of the sounds which have vocalised. When considering phonological changes a degree of phonetic information must necessarily be included which may be irrelevant synchronically, but forms the basis of the change diachronically. A phonological representation of vocalisations must therefore address how best to display the phonological information whilst allowing for the inclusion of relevant diachronic phonetic information. Vocalisations involve a small articulatory change, but using a model which describes vowels and consonants with separate terminology would conceal the subtleness of change in a vocalisation. -
Swiping in Germanic
Swiping in Germanic Jason Merchant University of Chicago Establishing the level of representation or the point in a derivation at which movement takes place has never been a trivial matter, and as such remains an topic of substantial ongoing interest. For overt movement, this question is complicated by the availability in principle of two components in which movement could take place with indistinguishable effects on word order: in the derivation leading to Spell-Out, or in the mapping from Spell-Out to PF. To a great extent, the reasoning brought to bear on this question has been concentrated on A- and A'-movement and their properties; head-movement, in contrast, has remained a distant third. In this paper, I show that a little-studied peculiarity of elliptical wh-questions in Germanic can cast new light on this question, providing evidence that there is indeed head-movement which takes place late in the derivation at PF, after Spell-Out.* The peculiarity in question comes from a range of data found only under sluicing in a subset of the Germanic languages. In particular, it is found in sluices involving certain prepositions, in which the [+wh] object of the preposition appears not after the preposition in the usual head-complement order, but before it, as in (1). (1) Peter went to the movies, but I don’t know who with. I will call this kind of exceptional inversion of the usual order of the preposition and its argument swiping, for sluiced wh-word inversion with prepositions (in Northern Germanic). Any examination of swiping or other curiosities of sluicing must, of course, begin with the necessary background on sluicing itself, which is given in section 1 with particular attention to the facts across Germanic.