Analogy in Formal Grammar

Total Page:16

File Type:pdf, Size:1020Kb

Analogy in Formal Grammar Analogy in formal grammar An der Philologischen Fakultät der Universität Leipzig eingereichte DISSERTATION zur Erlangung des akademischen Grades Doctor philosophiae (Dr. phil.) von: Matías Guzmán Naranjo Gutachter: Prof. Dr. Doris Schönefeld Prof. Dr. Olivier Bonami Erweitertes Komitee: Prof. Dr. Martin Haspelmath Datum der Einreichung: Datum der Verleihung: Selbstständigkeitserklärung und Erklärung über frühere Promotionsversuche Hiermit versichere ich, dass ich die vorliegende Arbeit ohne unzulässige Hilfe Dritter und ohne Benutzung anderer als der angegebenen Hilfsmittel angefertigt habe; die aus fremden Quellen direkt oder indirekt über- nommenen Gedanken sind als solche kenntlich gemacht. Ich habe nicht die Hilfe eines Promotionsberaters in Anspruch genommen. Die Arbeit wurde zuvor weder im Inland noch im Ausland in gleicher oder ähnlicher Form einer anderen In- stitution vorgelegt. Sie ist weder Bestandteil eines ruhenden Verfahrens noch wurde sie in einem gleichartigen Promotionsverfahren für endgültig nicht bestanden erklärt. Die Arbeit ist vorher auch noch nicht veröffentlicht worden. Ich erkläre, dass ich bisher noch keine Promotionsversuche unternommen habe. Ort, Datum -------------- Unterschrift -------------- Matías Guzmán Naranjo Acknowledgements I want to thank, in no particular order, Doris Schönefeld, Olivier Bonami, and Martin Haspel- math for their support and guidance writing this dissertation. I would also like to thank Laura Becker for her generous help during various stages of my writing of this book. I also want to mention all other IGRA students, with whom I had the change to discuss several of the ideas presented here, in particular Joanna Zaleska and Ludger Paschen. I also want to thank the IGRA program for all the opportunities to improve myself through courses and conferences. I also thank Farrel Ackerman, Tapani Salminen, John Newman, Manfred Sailer, Stefan Müller, and Alexandr Rosen, for their comments, criticisms and advice. i Contents 1 Introduction 1 I Analogies 3 2 Remarks on analogy 5 2.1 The many meanings of analogy .......................... 5 2.1.1 Single case analogy ............................. 5 2.1.2 Proportional analogies ........................... 8 2.1.3 Analogical classifiers ............................ 11 2.1.4 Summing up ................................. 14 2.2 The mechanism for analogy ............................ 15 2.2.1 Simple rules ................................. 15 2.2.2 Schemata .................................. 18 2.2.3 Multiple-rule systems ........................... 19 2.2.4 Neural networks and Analogical Modelling ............... 23 2.2.5 Analogy or rules .............................. 24 2.2.6 Mental representations vs grammatical relations ............ 27 2.2.7 Summing up ................................. 28 2.3 Missing pieces .................................... 29 2.4 Final considerations ................................. 31 3 Modelling analogy in grammar 33 3.1 Basic assumptions .................................. 33 3.1.1 Feature structures .............................. 33 3.1.2 Type hierarchies .............................. 34 3.2 Analogy as type constraints ............................ 38 3.2.1 Analogy is categorical ........................... 39 3.2.2 Analogy runs through the hierarchy ................... 41 3.3 The (semi-)formal model .............................. 42 ii 3.4 Final remarks .................................... 48 II Case studies 49 4 Methodological notes 51 4.1 On the general methodology ............................ 51 4.2 Statistical models and methodology ........................ 52 4.3 Analogical models using neural networks ..................... 54 4.4 Clustering and distances between classes ..................... 64 4.5 Summing up ..................................... 67 5 Gender systems 69 5.1 Masculine-Feminine syncretism: Latin ...................... 71 5.1.1 The Latin third declension ......................... 71 5.1.2 Data ..................................... 72 5.1.3 Results .................................... 73 5.2 Gender vs inflection class: Romanian ....................... 76 5.2.1 The Romanian gender and plural system ................. 76 5.2.2 Modelling the system ............................ 80 5.2.3 Data ..................................... 86 5.2.4 Results .................................... 87 5.3 Interim conclusion .................................. 97 6 Hybrid classes 99 6.1 Overabundant inflection: Croatian singular instrumental ............ 99 6.1.1 Modelling the system ............................ 101 6.1.2 Materials .................................. 102 6.1.3 Results .................................... 102 6.2 Frequency and analogical similarity: Russian diminutives ............ 105 6.2.1 Russian diminutives ............................ 105 6.2.2 Modelling the system ............................ 107 6.2.3 The dataset ................................. 108 6.2.4 Results .................................... 109 6.3 Interim conclusion .................................. 111 7 Morphological processes and analogy 113 7.1 Prefixes and gender: Swahili noun classes ..................... 113 7.1.1 Materials .................................. 116 iii 7.1.2 Results .................................... 117 7.2 Prefixes and inflection classes: Eastern Highland Otomi ............. 120 7.2.1 Verb classes in Eastern Highland Otomi ................. 120 7.2.2 Materials .................................. 120 7.2.3 Results .................................... 122 7.3 Stem changing processes: Hausa plural classes .................. 123 7.3.1 The Hausa plural system .......................... 123 7.3.2 Materials .................................. 125 7.3.3 Results .................................... 125 7.4 Interim conclusion .................................. 129 8 Complex inflectional classes 132 8.1 Multiple inheritance and cross-hierarchies: Spanish verbal inflection ..... 132 8.1.1 Spanish inflection classes ......................... 132 8.1.2 Previous takes on the Spanish verbal system ............... 136 8.1.3 Modelling the system ............................ 138 8.1.4 Materials .................................. 145 8.1.5 Results .................................... 145 8.2 Cross-classifications between plural and singular: Kasem ............ 157 8.2.1 ATR in Kasem ................................ 159 8.2.2 A simple analysis of Kasem noun classes ................. 160 8.2.3 Materials .................................. 172 8.2.4 Modelling the system ............................ 173 8.2.5 Methodological considerations ...................... 178 8.2.6 Results .................................... 179 8.3 Interim Conclusion ................................. 199 9 Conclusions 201 9.1 The path forward .................................. 201 9.1.1 The limits of analogy ............................ 201 9.1.2 Analogical classifiers or proportional analogies ............. 201 9.1.3 The features of analogy .......................... 202 9.1.4 Coverage .................................. 202 9.2 Final considerations ................................. 202 iv List of Figures 4.1 Representation of Model 1. ............................. 57 4.2 Representation of Model 2. ............................. 62 4.3 Dendrogram based on correlation distances for Model 3. ............ 66 4.4 MDS based on correlation distances for Model 3. ................. 67 5.1 Clustering analysis for Latin gender assignment. ................. 74 5.2 Class frequency. ................................... 88 5.3 Additive (left) and substractive (right) accuracy and kappa scores for themodel predicting plural in Romanian. .......................... 92 5.4 Heat map for the model predicting inflection class in Romanian. ........ 93 5.5 Clustering analysis of singular-plural class in Romanian. ............ 94 5.6 Heat map for the model predicting inflection class by gender in Romanian. .. 95 5.7 Clustering analysis of gender and class in Romanian. .............. 96 6.1 Additive (left) and substractive (right) accuracy and kappa scores for the model predicting Croatian instrumental. ..................... 103 6.2 Proportion of confusion between classes in Croatian. .............. 104 6.3 Type frequency of Russian suffix classes. ..................... 109 6.4 Additive (left) and sbustractive (right) accuracy and kappa scores for themodel predicting Russian diminutives. .......................... 110 6.5 Proportion of confusion between classes in Russian. ............... 111 7.1 Type frequency of Swahili genders. ........................ 116 7.2 Additive (left) and substractive (right) accuracy and kappa scores for the model predicting gender in Swahili. ........................ 119 7.3 Additive (left) and substractive (right) accuracy and kappa scores for the model predicting inflection class. .......................... 123 7.4 Type frequency of macro-classes in Hausa. .................... 126 7.5 Heatmap for the model predicting plural forms in Hausa. ............ 127 v 7.6 Additive (left) and substractive (right) accuracy and kappa scores for thestruc- turally defined model. ................................ 128 7.7 Additive (left) and substractive (right) accuracy and kappa scores for thenon- structurally defined model. ............................. 129 8.1 Heat map for the model predicting inflection class of Spanish verbs. ...... 150 8.2 Overall validation for the model predicting inflection class of Spanish verbs. 152 8.3 Multidimensional scaling with hierarchical
Recommended publications
  • Language Group Specific Informational Reports
    Rhode Island College M.Ed. In TESL Program Language Group Specific Informational Reports Produced by Graduate Students in the M.Ed. In TESL Program In the Feinstein School of Education and Human Development Language Group: Romanian Author: Jean Civil Program Contact Person: Nancy Cloud ([email protected]) RHODE ISLAND COLLEGE TESL 539-01: ROMANIAN LANGUAGE Student: Jean Ocelin Civil Prof: Nancy Cloud Spring 2010 Introduction Thousands of languages, dialects, creoles and pidgins are spoken worldwide. Some people, endowed by either an integrative or extrinsic motivation, want to be bilingual, trilingual or multilingual. So, the interlanguage interference becomes unavoidable. Those bilingual individuals are omnipresent in the State of Rhode Island. As prospective ESL teachers, our job requirement is to help them to achieve English proficiency. Knowing the interference problems attributable to their native language is the sine qua none pre- condition to helping them. However, you may have trouble understanding Romanian native speakers due to communication barriers here in the State of Rhode Island. The nature of our research is to use a Contrastive Analysis Approach to the Romanian language, so we can inquire about their predicted errors. In the following PowerPoint presentation, we will put emphasis specifically on phonology, grammar, communication style, and semantic problems. Romanian History 1. Before 106 AD, the Dacians lived in Romanian territory. They spoke Thracian tongue. 2. 106 AD, the defeat of the Dacians, (an indo-European people), led to a period of intense Romanization. A vulgar Latin became the language of commerce and administration. Thracian and Latin combined gave birth to Romanian Language. 3.
    [Show full text]
  • Romanian Timebank: an Annotated Parallel Corpus for Temporal Information
    Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information Corina Forăscu1,2, Dan Tufiş1 1 Research Institute in Artificial Intelligence, Romanian Academy, Bucharest 1,2 Faculty of Computer Science, Alexandru Ioan Cuza University of Iași, Romania 16, General Berthelot, Iasi 700483, Romania E-mail: [email protected], [email protected] Abstract The paper describes the main steps for the construction, annotation and validation of the Romanian version of the TimeBank corpus. Starting from the English TimeBank corpus – the reference annotated corpus in the temporal domain, we have translated all the 183 English news texts into Romanian and mapped the English annotations onto Romanian, with a success rate of 96.53%. Based on ISO-Time - the emerging standard for representing temporal information, which includes many of the previous annotations schemes -, we have evaluated the automatic transfer onto Romanian and, and, when necessary, corrected the Romanian annotations so that in the end we obtained a 99.18% transfer rate for the TimeML annotations. In very few cases, due to language peculiarities, some original annotations could not be transferred. For the portability of the temporal annotation standard to Romanian, we suggested some additions for the ISO-Time standard, concerning especially the EVENT tag, based on linguistic evidence, the Romanian grammar, and also on the localisations of TimeML to other Romance languages. Future improvements to the Ro-TimeBank will take into consideration all temporal expressions, signals and events in texts, even those with a not very clear temporal anchoring. Keywords: temporal information, annotation standards, Romanian language expressions – references to a calendar or clock system, 1.
    [Show full text]
  • A Pedagogical Perspective on the Definite and the Indefinite Article in the Romanian Language
    Ovidiu Ivancu Institute of Romanian Language, Bucharest, Caransebeș 1, Bucharest Vilnius University, University 5, Vilnius [email protected] Research interests: cultural studies, literature, linguistics, theory of literature A Pedagogical Perspective on the Definite and the Indefinite Article in the Romanian Language. Challenges for Foreign Learners Abstract All Romance languages have developed the definite and the indefinite article via the Vulgar Latin (Classical Latin did not use articles), the language of the Roman colonists. According to Joseph H. Greenberg (1978), the definite article predated the indefinite one by approximately two centuries, being developed from demonstratives through a complex process of grammaticalization. Many areas of nowadays` Romania were incorporated into the Roman Empire for about 170 years. After two military campaign, the Roman emperor Trajan conquered Dacia, east of Danube. The Romans imposed their own administration and inforced Latin as lingua franca. The language of the colonists, mixed with the native language and, later on, with various languages spoken by the many migrant populations that followed the Roman retreat resulted in a new language (Romanian), of Latin origins. The Romanian language, attested in the 16th Century, in documents written by foreign travellers, uses four different types of articles. Being a highly inflected language, Romanian changes the form of the articles according to the gender, the number and the case of the noun As compared to the other Romance languages, Romanian uses the definite article enclitically. Thus, the definite article and the noun constitute a single word. The present paper aims at discussing, analysing and providing an overview of the use of definite and indefinite articles.
    [Show full text]
  • Romanian; an Essential Grammar
    Romanian Now in its second edition, Romanian: An Essential Grammar is a concise, user- friendly guide to modern Romanian. It takes the student through the essentials of the language, explaining each concept clearly and providing many examples of contemporary Romanian usage. This fully revised second edition contains: • a chapter of each of the most common grammatical areas with Romanian and English examples • extensive examples of the more difficult areas of the grammar • a section with exercises to consolidate the learning and the answer key • a list of useful verbs • an appendix listing useful websites for further information • a glossary of grammatical terms used in the book • a useful bibliographical list Suitable for both classroom use and independent study, this book is ideal for beginner to intermediate students. Ramona Gönczöl is a lecturer in Romanian language at the School of Slavonic and East European Studies, University College London. She is co-author, with Dennis Deletant, of Colloquial Romanian, 4th edition, Routledge, 2012. Ramona’s research interests include language acquisition, learner experience, online learning, sociolinguistics, psycholinguistics, cultural identities and language contact. Routledge Essential Grammars Essential Grammars describe clearly and succinctly the core rules of each language and are up-to-date and practical reference guides to the most important aspects of languages used by contemporary native speakers. They are designed for elementary to intermediate learners and present an accessible description of the language, focusing on the real patterns of use today. Essential Grammars are a reference source for the learner and user of the language, irrespective of level, setting out the complexities of the language in short, readable sections that are clear and free from jargon.
    [Show full text]
  • (Romanian) on L2/L3 Learning (Catalan/Spanish) Simona Popa
    Nom/Logotip de la Universitat on s’ha llegit la tesi Language transfer in second language acquisition. Some effects of L1 instruction (Romanian) on L2/L3 learning (Catalan/Spanish) Simona Popa http://hdl.handle.net/10803/380548 ADVERTIMENT. L'accés als continguts d'aquesta tesi doctoral i la seva utilització ha de respectar els drets de la persona autora. Pot ser utilitzada per a consulta o estudi personal, així com en activitats o materials d'investigació i docència en els termes establerts a l'art. 32 del Text Refós de la Llei de Propietat Intel·lectual (RDL 1/1996). Per altres utilitzacions es requereix l'autorització prèvia i expressa de la persona autora. En qualsevol cas, en la utilització dels seus continguts caldrà indicar de forma clara el nom i cognoms de la persona autora i el títol de la tesi doctoral. No s'autoritza la seva reproducció o altres formes d'explotació efectuades amb finalitats de lucre ni la seva comunicació pública des d'un lloc aliè al servei TDX. Tampoc s'autoritza la presentació del seu contingut en una finestra o marc aliè a TDX (framing). Aquesta reserva de drets afecta tant als continguts de la tesi com als seus resums i índexs. ADVERTENCIA. El acceso a los contenidos de esta tesis doctoral y su utilización debe respetar los derechos de la persona autora. Puede ser utilizada para consulta o estudio personal, así como en actividades o materiales de investigación y docencia en los términos establecidos en el art. 32 del Texto Refundido de la Ley de Propiedad Intelectual (RDL 1/1996).
    [Show full text]
  • Romanian Grammar
    1 Cojocaru Romanian Grammar 0. INTRODUCTION 0.1. Romania and the Romanians 0.2. The Romanian language 1. ALPHABET AND PHONETICS 1.1. The Romanian alphabet 1.2. Potential difficulties related to pronunciation and reading 1.2.1. Pronunciation 1.2.1.1. Vowels [ ǝ ] and [y] 1.2.1.2. Consonants [r], [t] and [d] 1.2.2. Reading 1.2.2.1. Unique letters 1.2.2.2. The letter i in final position 1.2.2.3. The letter e in the initial position 1.2.2.4. The ce, ci, ge, gi, che, chi, ghe, ghi groups 1.2.2.5. Diphthongs and triphthongs 1.2.2.6. Vowels in hiatus 1.2.2.7. Stress 1.2.2.8. Liaison 2. MORPHOPHONEMICS 2.1. Inflection 2.1.1. Declension of nominals 2.1.2. Conjugation of verbs 2.1.3. Invariable parts of speech 2.2. Common morphophonemic alternations 2.2.1. Vowel mutations 2.2.1.1. the o/oa mutation 2.2.1.2. the e/ea mutation 2.2.1.3. the ă/e mutation 2.2.1.4. the a/e mutation 2.2.1.5. the a/ă mutation 2.2.1.6. the ea/e mutation 2.2.1.7. the oa/o mutation 2.2.1.8. the ie/ia mutation 2.2.1.9. the â/i mutation 2.2.1.10. the a/ă mutation 2.2.1.11. the u/o mutation 2.2.2. Consonant mutations 2.2.2.1. the c/ce or ci mutation 2.2.2.2.
    [Show full text]
  • 'Until' and 'Before' Clauses: Zooming in on Linguistic Diversity
    BALTIC LINGUISTICS ISSN 2081- 9 (2018), 141–2 ‘As long as’, ‘until’ and ‘before’ clauses: Zooming in on linguistic diversity Be W Stockholm University This paper investigates ‘before’, ‘until’ and ‘as long as’ clauses in the Baltic languages in their wider areal and genealogical context in a sample of 2 modern and ancient doculects of European and Indo-European languages. In a bottom-up construction of the semantic map of ‘before’, ‘until’ and ‘as long as’ connectors from parallel text data, a fourth cluster intermediate between ‘before’ and ‘until’ with negative main clauses is identified. The typology resulting from the different overlaps of clusters locates Baltic languages in an intermediate zone between Western, Eastern, and Northern European languages. This goes hand-in-hand with a high diversity of Baltic languages in their typology of ‘before’, ‘until’ and ‘as long as’ clauses. The temporal connectors found in Baltic varieties can be classified according to whether they originate from strategies expressing temporal identity (simultaneity) or non-identity (non-simultaneity). Many connectors in Baltic derive from correlative constructions and originally express iden- tity, but can then shift from simultaneity towards posteriority as they gradually lose their association with correlative constructions. Since temporal clauses are never at- emporal and are hence incompatible with permanent states and since negation often expresses permanent states, negation—a marker of non-identity—is prone to develop non-polarity functions in ‘before’ and ‘until’ clauses. The Baltic and Slavic languages are rich in various kinds of expanded negation (translation equivalents in other languages lack negation) and expletive negation (negation does not have the function of expressing negative polarity) in ‘before’ and ‘until’ clauses.
    [Show full text]
  • The Hungarian Language in Education in Romania
    The Hungarian language in education in Romania European Research Centre on Multilingualism and Language Learning hosted by HUNGARIAN The Hungarian language in education in Romania c/o Fryske Akademy Doelestrjitte 8 P.O. Box 54 NL-8900 AB Ljouwert/Leeuwarden The Netherlands T 0031 (0) 58 - 234 3027 W www.mercator-research.eu E [email protected] | Regional dossiers series | tca r cum n n i- ual e : Available in this series: This document was published by the Mercator European Research Centre on Multilingualism Albanian; the Albanian language in education in Italy Aragonese; the Aragonese language in education in Spain and Language Learning with financial support from the Fryske Akademy and the Province Asturian; the Asturian language in education in Spain (2nd ed.) of Fryslân. Basque; the Basque language in education in France (2nd ed.) Basque; the Basque language in education in Spain (2nd ed.) Breton; the Breton language in education in France (2nd ed.) Catalan; the Catalan language in education in France Catalan; the Catalan language in education in Spain (2nd ed.) © Mercator European Research Centre on Multilingualism Cornish; the Cornish language in education in the UK (2nd ed.) Corsican; the Corsican language in education in France (2nd ed.) and Language Learning, 2019 Croatian; the Croatian language in education in Austria Danish; The Danish language in education in Germany ISSN: 1570 – 1239 Frisian; the Frisian language in education in the Netherlands (4th ed.) Friulian; the Friulian language in education in Italy The contents of this dossier may be reproduced in print, except for commercial purposes, Gàidhlig; The Gaelic Language in Education in Scotland (2nd ed.) Galician; the Galician language in education in Spain (2nd ed.) provided that the extract is proceeded by a complete reference to the Mercator European German; the German language in education in Alsace, France (2nd ed.) Research Centre on Multilingualism and Language Learning.
    [Show full text]
  • Lexical-Functional Grammar∗
    Lexical-Functional Grammar∗ Ash Asudeh & Ida Toivonen August 10, 2009 [19:17] CORRECTED FINAL DRAFT To appear in Bernd Heine and Heiko Narrog, eds., The Oxford Handbook of Linguistic Analysis. Oxford: Oxford University Press. Contents 1 Introduction 1 2 C-structure and F-structure 2 2.1 C-structure ........................................ 3 2.2 F-structure ......................................... 4 3 Structures and Structural Descriptions 8 3.1 Constraints on C-structures ................................ 8 3.2 Constraints on F-structures ................................ 10 4 The C-structure to F-structure Correspondence 15 4.1 How the C-structure to F-structure Mapping Works ................... 15 4.2 Flexibility in Mapping .................................. 18 5 The Correspondence Architecture 19 6 Some Recent Developments 20 7 Concluding Remarks 21 A Example: Unbounded Dependency, Adjuncts, Raising, Control 23 References 25 ∗This work was supported by the Social Sciences and Humanities Research Council of Canada, Standard Research Grant 410-2006-1650. We would like to thank Mary Dalrymple, Bernd Heine, Daniela Isac, Helge Lødrup, Kumiko Murasugi, Heiko Narrog, Kenji Oda, Chris Potts, and Nigel Vincent for helpful comments and suggestions. We would also like to thank Daniela Isac and Cristina Moldoveanu for checking the Romanian data. Any remaining errors are our own. i Abstract Lexical-Functional Grammar (LFG) is a lexicalist, declarative (non-transformational), constraint- based theory of generative grammar. LFG has a detailed, industrial-strength computational imple- mentation. The theory has also proven useful for descriptive/documentary linguistics. The gram- matical architecture of LFG, sometimes called the ‘Correspondence Architecture’, posits that dif- ferent kinds of linguistic information are modelled by distinct, simultaneously present grammatical structures, each having its own formal representation.
    [Show full text]
  • Free Foreign Language Teacher Resources from the Language Resource Centers
    Free Foreign Language Teacher Resources from the Common LRC Website Language Resource Centers http://www.nflrc.org Download a PDF with hotlinks from the LRC Common Website at http://www.nflrc.org/lrc_broc_full.pdf Visit the LRC Common Website for regular updates and additional information. Visit the LRC Common Website for information on summer workshops, institutes, and scholarship opportunities Last updated 09/2013 Free Resources from Language Resource Centers For updated information visit the LRC common website at www.nflrc.org Page 1 of 26 Entries marked with * are new for 2012-2013 / Entries marked with ** have been expanded in 2012-2013 Teacher Guides & Tools – General Alphabet Charts Full color alphabet charts with transcription and sound examples, IPA symbols, and letter CeLCAR http://iub.edu/~celcar/language_informat names for Azerbaijani, Dari, Kazakh, Kyrgyz, Mongolian, Pashto, Tajiki, Turkmen, Uyghur, ional_materials.php Uzbek. Arabic Online Corpus Online corpus mostly from the Arabic press and search tools that provide students, teachers, NMELRC http://nmelrc.org/online-arabic-corpus and material developers powerful tools to access authentic language in context. Bringing the Standards to the Handbook for teachers on how to implement the standards in their classrooms. NFLRC – K12 archived Classroom: A Teacher’s Guide http://web.archive.org/web/2010110314 5446/http://nflrc.iastate.edu/pubs/standa rds/standards.html **California Subject Examinations for These materials attempt to provide CSET-Arabic takers with basic review notes and practice LARC http://larc.sdsu.edu/downloads/CSET/CSE Teachers (CSET): Arabic Language questions covering solely domains in Subtest I: General Linguistics and Linguistics of the TArabicLanguagePowerPoint.pdf Preparation Material Target Language, and Literary and Cultural Texts and Traditions.
    [Show full text]
  • Most Vs. the Most in Languages Where the More Means Most Elizabeth Coppock Boston University University of Gothenburg Linnea Strand University of Gothenburg
    Chapter 12 Most vs. the most in languages where the more means most Elizabeth Coppock Boston University University of Gothenburg Linnea Strand University of Gothenburg This paper focuses on languages in which a superlative interpretation is typically indicated merely by a combination of a definiteness marker with a comparative marker, including French, Spanish, Italian, Romanian, and Greek (def+cmp lan- guages). Despite ostensibly using definiteness markers to form the superlative, su- perlatives are not always definite-marked in these languages, and the distribution of definiteness-marking varies across languages. Constituency structure appears to vary across languages as well. To account for these patterns of variation, we iden- tify conflicting pressures that all of the languages in consideration may besubject to, and suggest that different languages prioritize differently in the resolution of these conflicts. What these languages have in common, we suggest, is amecha- nism of Definite Null Instantiation for the degree-type standard argument ofthe comparative. Among the parameters along which languages are proposed to differ is the relative importance of marking uniqueness vs. avoiding determiners with predicates of entities that are not individuals. 1 Introduction In French, placing a definite article before a comparative adjective, as in(1), suf- fices to produce a superlative interpretation: Elizabeth Coppock & Linnea Strand. 2019. Most vs. the most in languages where the more means most. In Ana Aguilar-Guevara, Julia Pozas Loyo & Violeta Vázquez-Rojas Maldonado (eds.), Definiteness across languages, 371–417. Berlin: Language Science Press. DOI:10.5281/zenodo.3252030 Elizabeth Coppock & Linnea Strand (1) Elle est la plus grande. (French) she is the cmp tall ‘She is the tallest.’ French is not alone; other Romance languages, as well as Modern Greek, Mal- tese and others, make do with the same limited resources.
    [Show full text]
  • The Loss of Case Inflection in Bulgarian and Macedonian
    SLAVICA HELSINGIENSIA 47 THE LOSS OF CASE INFLECTION IN BULGARIAN AND MACEDONIAN Max Wahlström HELSINKI 2015 SLAVICA HELSINGIENSIA 47 Series editors Tomi Huttunen, Jouko Lindstedt, Ahti Nikunlassi Published by: Department of Modern Languages P.O. Box 24 (Unioninkatu 40 B) 00014 University of Helsinki Finland Copyright © by Max Wahlström ISBN 978-951-51-1185-2 (paperback) ISBN 978-951-51-1186-9 (PDF) ISSN-L 0780-3281, ISSN 0780-3281 (Print), ISSN 1799-5779 (Online) Summary Case inflection, characteristic of Slavic languages, was lost in Bulgarian and Macedonian approximately between the 11th and 16th centuries. My doctoral dissertation examines the process of this language change and sets out to find its causes and evaluate its consequences. In the earlier research literature, the case loss has been attributed either to language contacts or language internal sound changes, yet none of the theories based on a single explaining factor has proven satisfactory. In this study, I argue that the previous researchers of the Late Medieval manuscripts have often tried to date changes in the language earlier than what is plausible in light of the textual evidence. Also, I propose that the high number of second language speakers is among the key factors that reduced the number of morphological categories in the language, but, at the same time, several minor developments related to the case loss—for instance, in the marking of possession—are likely to result from a specific contact mechanism known as the Balkan linguistic area. My main methodological argument is that the study of language contacts must take into account a general typological perspective to determine the uniqueness of the suspected contact-induced changes.
    [Show full text]