Orthography, Standardization, and Register: the Case of Manding
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The Assignment of Grammatical Gender in German: Testing Optimal Gender Assignment Theory
The Assignment of Grammatical Gender in German: Testing Optimal Gender Assignment Theory Emma Charlotte Corteen Trinity Hall September 2018 This dissertation is submitted for the degree of Doctor of Philosophy The Assignment of Grammatical Gender in German: Testing Optimal Gender Assignment Theory Emma Charlotte Corteen Abstract The assignment of grammatical gender in German is a notoriously problematic phenomenon due to the apparent opacity of the gender assignment system (e.g. Comrie 1999: 461). Various models of German gender assignment have been proposed (e.g. Spitz 1965, Köpcke 1982, Corbett 1991, Wegener 1995), but none of these is able to account for all of the German data. This thesis investigates a relatively under-explored, recent approach to German gender assignment in the form of Optimal Gender Assignment Theory (OGAT), proposed by Rice (2006). Using the framework of Optimality Theory, OGAT claims that the form and meaning of a noun are of equal importance with respect to its gender. This is formally represented by the crucial equal ranking of all gender assignment constraints in a block of GENDER FEATURES, which is in turn ranked above a default markedness hierarchy *NEUTER » *FEMININE » *MASCULINE, which is based on category size. A key weakness of OGAT is that it does not specify what constitutes a valid GENDER FEATURES constraint. This means that, in theory, any constraint can be proposed ad hoc to ensure that an OGAT analysis yields the correct result. In order to prevent any constraints based on ‘postfactum rationalisations’ (Comrie 1999: 461) from being included in the investigation, the GENDER FEATURES constraints which have been proposed in the literature for German are assessed according to six criteria suggested by Enger (2009), which seek to determine whether there is independent evidence for a GENDER FEATURES constraint. -
The Role of Grammatical Gender in Noun-Formation: a Diachronic Perspective from Norwegian
The role of grammatical gender in noun-formation: A diachronic perspective from Norwegian Philipp Conzett 1. The relationship between gender and word-formation According to Corbett (1991: 1) “[g]ender is the most puzzling of the grammatical categories”. In modern languages, however, gender is most often seen as nothing more than an abstract inherent classificatory feature of nouns that triggers agreement in associated words. Given this perspec- tive of gender as a redundant category, the question arises of why it none- theless is so persistent in a great number of languages. This question has been answered inter alia by referring to the identifying and disambiguating function gender can have in discourse (e.g., Corbett 1991: 320–321). In this article further evidence is provided for viewing grammatical gender (henceforth gender) as an integral part of Cognitive Grammar, more spe- cifically the domain of word-formation. The relationship between grammatical gender and word-formation can be approached from (at least) two different angles. In literature dealing with gender assignment, characteristics of word-formation are often used as a base for assigning gender to nouns. This approach is presented in sec- tion 1.1. On the other hand, gender is described as a feature involved in the formation of new nouns. This perspective is introduced in 1.2. 1.1. Gender assignment based on word-formation Regularities between the gender of nouns and their derivational morphol- ogy can be detected in a number of languages. Gender can be tied to overt or covert derivational features. The former type is usually realized by suf- fixation, e.g., Norwegian klok (adj. -
Adapting MARBERT for Improved Arabic Dialect Identification
Adapting MARBERT for Improved Arabic Dialect Identification: Submission to the NADI 2021 Shared Task Badr AlKhamissi∗ Mohamed Gabr∗ Independent Microsoft EGDC badr [at] khamissi.com mohamed.gabr [at] microsoft.com Muhammed ElNokrashy Khaled Essam Microsoft EGDC Mendel.ai muelnokr [at] microsoft.com khaled.essam [at] mendel.ai Abstract syntax, morphology, vocabulary, and even orthog- raphy. Dialects may be heavily influenced by pre- In this paper, we tackle the Nuanced Ara- viously dominant local languages. For example, bic Dialect Identification (NADI) shared task Egyptian variants are influenced by the Coptic lan- (Abdul-Mageed et al., 2021) and demonstrate guage, while Sudanese variants are influenced by state-of-the-art results on all of its four sub- the Nubian language. tasks. Tasks are to identify the geographic ori- gin of short Dialectal (DA) and Modern Stan- In this paper, we study the classification of such dard Arabic (MSA) utterances at the levels of variants and describe our model that achieves state- both country and province. Our final model is of-the-art results on all of the four Nuanced Ara- an ensemble of variants built on top of MAR- bic Dialect Identification (NADI) subtasks (Abdul- BERT that achieves an F1-score of 34:03% for Mageed et al., 2021). The task focuses on distin- DA at the country-level development set—an guishing both MSA and DA by their geographi- improvement of 7:63% from previous work. cal origin at both the country and province levels. The data is a collection of tweets covering 100 1 Introduction provinces from 21 Arab countries. -
Linguistics Development Team
Development Team Principal Investigator: Prof. Pramod Pandey Centre for Linguistics / SLL&CS Jawaharlal Nehru University, New Delhi Email: [email protected] Paper Coordinator: Prof. K. S. Nagaraja Department of Linguistics, Deccan College Post-Graduate Research Institute, Pune- 411006, [email protected] Content Writer: Prof. K. S. Nagaraja Prof H. S. Ananthanarayana Content Reviewer: Retd Prof, Department of Linguistics Osmania University, Hyderabad 500007 Paper : Historical and Comparative Linguistics Linguistics Module : Indo-Aryan Language Family Description of Module Subject Name Linguistics Paper Name Historical and Comparative Linguistics Module Title Indo-Aryan Language Family Module ID Lings_P7_M1 Quadrant 1 E-Text Paper : Historical and Comparative Linguistics Linguistics Module : Indo-Aryan Language Family INDO-ARYAN LANGUAGE FAMILY The Indo-Aryan migration theory proposes that the Indo-Aryans migrated from the Central Asian steppes into South Asia during the early part of the 2nd millennium BCE, bringing with them the Indo-Aryan languages. Migration by an Indo-European people was first hypothesized in the late 18th century, following the discovery of the Indo-European language family, when similarities between Western and Indian languages had been noted. Given these similarities, a single source or origin was proposed, which was diffused by migrations from some original homeland. This linguistic argument is supported by archaeological and anthropological research. Genetic research reveals that those migrations form part of a complex genetical puzzle on the origin and spread of the various components of the Indian population. Literary research reveals similarities between various, geographically distinct, Indo-Aryan historical cultures. The Indo-Aryan migrations started in approximately 1800 BCE, after the invention of the war chariot, and also brought Indo-Aryan languages into the Levant and possibly Inner Asia. -
On Burgenland Croatian Isoglosses Peter
Dutch Contributions to the Fourteenth International Congress of Slavists, Ohrid: Linguistics (SSGL 34), Amsterdam – New York, Rodopi, 293-331. ON BURGENLAND CROATIAN ISOGLOSSES PETER HOUTZAGERS 1. Introduction Among the Croatian dialects spoken in the Austrian province of Burgenland and the adjoining areas1 all three main dialect groups of central South Slavic2 are represented. However, the dialects have a considerable number of characteris- tics in common.3 The usual explanation for this is (1) the fact that they have been neighbours from the 16th century, when the Ot- toman invasions caused mass migrations from Croatia, Slavonia and Bos- nia; (2) the assumption that at least most of them were already neighbours before that. Ad (1) Map 14 shows the present-day and past situation in the Burgenland. The different varieties of Burgenland Croatian (henceforth “BC groups”) that are spoken nowadays and from which linguistic material is available each have their own icon. 5 1 For the sake of brevity the term “Burgenland” in this paper will include the adjoining areas inside and outside Austria where speakers of Croatian dialects can or could be found: the prov- ince of Niederösterreich, the region around Bratislava in Slovakia, a small area in the south of Moravia (Czech Republic), the Hungarian side of the Austrian-Hungarian border and an area somewhat deeper into Hungary east of Sopron and between Bratislava and Gyǡr. As can be seen from Map 1, many locations are very far from the Burgenland in the administrative sense. 2 With this term I refer to the dialect continuum formerly known as “Serbo-Croatian”. -
Graphemic Methods for Genderneutral
Graphemic Methods for GenderNeutral Writing Yannis Haralambous & Joseph Dichy Abstract. In this paper we present a model and a classification of graphemic genderneutral writing methods, we explore current practices in French, Ger man, Greek, Italian, Portuguese and Spanish languages, and we investigate in teractions between genderneutral writing forms and regular expressions. 1. Introduction: The General Issue of, and behind, Gender Neutral Writing The issue behind genderneutral writing is that of the representation of intergender relations carried by languages. What is at stake is the representation of equality, or not, between genders. The issue is also re ferred to as “inclusive writing,” which apparently refers to human rights, but does not cover all cases, i.e., lesbian, gay, bisexual and transgender persons. The term “GenderNeutral Writing” is, in fact, both clearer and more inclusive. Generally speaking, languages are conservative, if not archaic. A sig nificant example is that of the idea of time, which is traditionally repre sented as a dot sliding along a straight line in a continuous movement. This has been a philosophical image of time since ancient Greek philoso phers and throughout the Middle Ages both in Arabic and European phi losophy, but can no longer be considered as a valid representation after 20th century existentialist philosophers and Heidegger’s Sein und Zeit. Yannis Haralambous 0000-0003-1443-6115 IMT Atlantique & LabSTICC UMR CNRS 6285 Brest, France [email protected] Joseph Dichy Professor of Arabic linguistics Lyon, France [email protected] Y. Haralambous (Ed.), Graphemics in the 21st Century. -
Learning About Phonology and Orthography
EFFECTIVE LITERACY PRACTICES MODULE REFERENCE GUIDE Learning About Phonology and Orthography Module Focus Learning about the relationships between the letters of written language and the sounds of spoken language (often referred to as letter-sound associations, graphophonics, sound- symbol relationships) Definitions phonology: study of speech sounds in a language orthography: study of the system of written language (spelling) continuous text: a complete text or substantive part of a complete text What Children Children need to learn to work out how their spoken language relates to messages in print. Have to Learn They need to learn (Clay, 2002, 2006, p. 112) • to hear sounds buried in words • to visually discriminate the symbols we use in print • to link single symbols and clusters of symbols with the sounds they represent • that there are many exceptions and alternatives in our English system of putting sounds into print Children also begin to work on relationships among things they already know, often long before the teacher attends to those relationships. For example, children discover that • it is more efficient to work with larger chunks • sometimes it is more efficient to work with relationships (like some word or word part I know) • often it is more efficient to use a vague sense of a rule How Children Writing Learn About • Building a known writing vocabulary Phonology and • Analyzing words by hearing and recording sounds in words Orthography • Using known words and word parts to solve new unknown words • Noticing and learning about exceptions in English orthography Reading • Building a known reading vocabulary • Using known words and word parts to get to unknown words • Taking words apart while reading Manipulating Words and Word Parts • Using magnetic letters to manipulate and explore words and word parts Key Points Through reading and writing continuous text, children learn about sound-symbol relation- for Teachers ships, they take on known reading and writing vocabularies, and they can use what they know about words to generate new learning. -
Proposal for a Korean Script Root Zone LGR 1 General Information
(internal doc. #: klgp220_101f_proposal_korean_lgr-25jan18-en_v103.doc) Proposal for a Korean Script Root Zone LGR LGR Version 1.0 Date: 2018-01-25 Document version: 1.03 Authors: Korean Script Generation Panel 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Korean Script LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document below: • proposal-korean-lgr-25jan18-en.xml Labels for testing can be found in the accompanying text document below: • korean-test-labels-25jan18-en.txt In Section 3, we will see the background on Korean script (Hangul + Hanja) and principal language using it, i.e., Korean language. The overall development process and methodology will be reviewed in Section 4. The repertoire and variant groups in K-LGR will be discussed in Sections 5 and 6, respectively. In Section 7, Whole Label Evaluation Rules (WLE) will be described and then contributors for K-LGR are shown in Section 8. Several appendices are included with separate files. proposal-korean-lgr-25jan18-en 1 / 73 1/17 2 Script for which the LGR is proposed ISO 15924 Code: Kore ISO 15924 Key Number: 287 (= 286 + 500) ISO 15924 English Name: Korean (alias for Hangul + Han) Native name of the script: 한글 + 한자 Maximal Starting Repertoire (MSR) version: MSR-2 [241] Note. -
Voicing Distinctions in the Dutch-German Dialect Continuum
Voicing distinctions in the Dutch-German dialect continuum Nina Ouddeken Meertens Instituut This study investigates the phonetics and phonology of voicing distinctions in the Dutch-German dialect continuum, which forms a transition zone between voicing and aspiration systems. Two phonological approaches to represent this contrast exist in the literature: a [±voice] approach and Laryngeal Realism. The implementation of the change between the two language types in the transition zone will provide new insights in the nature of the phonological representa- tion of the contrast. In this paper I will locate the transition zone by looking at phonetic overlap between VOT values of fortis and lenis plosives, and I will compare the two phonological approaches, showing that both face analytical problems as they cannot explain the variation observed in word-initial plosives and plosive clusters. Keywords: dialectology, phonology, phonetics, Laryngeal Realism, transition zones 1. Introduction Voicing distinctions in plosive systems have been studied extensively in phonolo- gy. Lisker & Abramson (1964) discovered that the phonetic realisations of ‘voiced’ and ‘voiceless’ plosives are not identical across languages: in word-initial posi- tion fortis and lenis plosives can be distinguished in different ways with respect to Voice Onset Time (VOT; describing the onset of vocal fold vibration relative to the moment of plosive release). Languages contrasting two plosive series typically contrast prevoiced plosives with plain voiceless plosives (voicing languages), or plain voiceless plosives with voiceless aspirated plosives (aspiration languages). The boundary between plain voiceless and aspirated plosives is placed around 20– 35 msec (depending on Place of Articulation (PoA)) by Keating (1984): Linguistics in the Netherlands 2016, 106–120. -
Orthographies in Grammar Books
Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 30 July 2018 doi:10.20944/preprints201807.0565.v1 Tomislav Stojanov, [email protected], [email protected] Institute of Croatian Language and Linguistic Republike Austrije 16, 10.000 Zagreb, Croatia Orthographies in Grammar Books – Antiquity and Humanism Summary This paper researches the as yet unstudied topic of orthographic content in antique, medieval, and Renaissance grammar books in European languages, as part of a wider research of the origin of orthographic standards in European languages. As a central place for teachings about language, grammar books contained orthographic instructions from the very beginning, and such practice continued also in later periods. Understanding the function, content, and orthographic forms in the past provides for a better description of the nature of the orthographic standard in the present. The evolution of grammatographic practice clearly shows the continuity of development of orthographic content from a constituent of grammar studies through the littera unit gradually to an independent unit, then into annexed orthographic sections, and later into separate orthographic manuals. 5 antique, 22 Latin, and 17 vernacular grammars were analyzed, describing 19 European languages. The research methodology is based on distinguishing orthographic content in the narrower sense (grapheme to meaning) from the broader sense (grapheme to phoneme). In this way, the function of orthographic description was established separately from the study of spelling. As for the traditional description of orthographic content in the broader sense in old grammar books, it is shown that orthographic content can also be studied within the grammatographic framework of a specific period, similar to the description of morphology or syntax. -
Harmonizing the Orthography of Gĩkũyũ and Kĩkamba.Pdf
NAME, DATE HERE GĨKŨYŨ AND KĨKAMBA 39 CHAPTER THREE HARMONIZING THE ORTHOGRAPHY OF GĨKŨYŨ AND KĨKAMBA Angelina Nduku Kioko, Martin C. Njoroge and Peter Mburu Kuria INTRODUCTION The term orthography is derived from the Greek word ‘orthos’ which means ‘correct’, and ‘graphein’, which stands for ‘to write’ (Sampson, 1985). The orthography of a language describes or defines the set of symbols (graphemes and diacritics) used to represent the phonemic inventory of that language in the writing and the rules on how to write these symbols. According to Massamba (1986), a language takes a limited number of sounds from the central pool of speech sounds to form its phonetic inventory. In this chapter, orthography is used to refer to the system of symbols used in the writing system of Gĩkũyũ and Kĩkamba. There are three types of orthographies (Read, 1983: 143-152). The first is the ‘phonemic orthography’. In a ‘phonemic’ orthography there is a one-to-one correspondence between phonemes and graphemes. This type of orthography has a dedicated sequence of symbol or symbols for each phoneme. Examples of languages that have phonemic ortho graphies are Korean and Kiswahili. The second type is the ‘morpho-phonemic orthography’ which considers both the phonemic features and the underlying structure of words. In this case, words may be written in the same way despite differences in pronunciation. For example, the pronunciation of the plural marker in English {s} is conditioned by the phonetic environment in which it occurs, yet it is written with the same grapheme <s>. The plural forms ‘cats’ and ‘dogs’ are pronounced as [kӕts] and [dɒɡz] respectively although the two final sounds are written 40 THE HARMONIZATION AND STANDARDIZATION OF KENYAN LANGUAGES with the grapheme <s>. -
1 English Spelling and Pronunciation
ISSN: 2456-8104 http://www.jrspelt.com Issue 5, Vol. 2, 2018 English Spelling and Pronunciation - A Brief Study Prof. V. Chandra Sekhar Rao ([email protected] ) Professor in English, SITECH, Hyderabad Abstract The present paper aims at the correlation between spelling and pronunciation of English words. English spelling is almost divorced from its pronunciation and there is no perfect guide how to 1 learn the pronunciation of the words. The letters of alphabet used are always inadequate to represent the sounds. English alphabet contains only 26 letters but the sounds 44. IPA symbols are needed to understand the intelligibility of the pronunciation and the spelling-designed. Learners of English language have to understand that words from other languages may be adopted without being adapted to the spelling system. Most of the letters of English alphabet produce multiple pronunciations. English Pronouncing Dictionary is needed for better understanding of the spelling and pronunciation. Keywords: Spelling and Pronunciation, Orthography, Intelligibility, Phonetic Symbols Introduction "If we know the sounds of a word (in English) we can't know how to spell it; if we know the ` spelling, we can't know how to pronounce it." (Otto Jespersen, philologist, Essentials of English Grammar, 1905, page 11). "English spelling is almost divorced from its pronunciation and forms hardly any guide as to how words should be pronounced." (Mont Follick, The Case for Spelling Reform, 1964, page 87). English, as a global language of communication, is spoken, written and used widely for many different purposes - international diplomatic relations, business, science and technology. It is also called the library language and medium of instructions in higher education - science and technology, computer and software engineering, medicine and law, pharmacy and nursing, commerce and management, fashion technology and so on.