Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch
Year: 2013
Object-conditioned differential marking in Chintang and Nepali
Schikowski, Robert
Abstract: This study deals with two syntactic alternation patterns found in two languages of Nepal: differential object marking (DOM) in Nepali (Indo-European > Indo-Aryan) and differential objectin- dexing (DOI) linked to differential agent marking in Chintang (Tibeto-Burman > Kiranti). The aim of the study is to conduct in-depth analyses of these two phenomena based on corpus annotations and including necessary methodological innovations, as well as to compare them in order to reach conclusions about alternations conditioned by properties of objects in general. The investigation of Nepali DOM showed that while this phenomenon is formally simple, it is functionally highly complex. It is condi- tioned by a plethora of properties of the object, many of which had not been noticed before, such as animacy, specificity, quantifiability, topicality, part of speech, focus, affectedness, and role ambiguity. A rule-based approach can neither cover the full range of variation found in this area in natural language nor can it model DOM in a conceptually sound way. For these reasons, a probabilistic analysis was conducted in the form of logistic regression and shown to be superior to the rule-based analysis. By con- trast, Chintang DOI turned out to be functionally simple in being basically conditioned by a single factor (specificity) but to be formally intertwined in complex ways with many other areas of morphosyntax such as valency classes, non-finite clauses, raising, and marking of aspect. In addition, it could be shownthat quantifiability is a pre-condition for specificity and that these two properties interact in various ways in Chintang. Most of the differences between Nepali DOM and Chintang DOI can be traced backto fundamental differences between case marking and agreement. The general picture emerging is thatthe main function of DOM is to mark objects with unusual properties, whereas DOI marks objects that are easy to track in discourse.
Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-85666 Dissertation
Originally published at: Schikowski, Robert. Object-conditioned differential marking in Chintang and Nepali. 2013, University of Zurich, Faculty of Arts. Object-conditioned differential marking in Chintang and Nepali
Abhandlung zur Erlangung der Doktorwürde der Philosophischen Fakultät der Universität Zürich
vorgelegt von Robert Schikowski
Angenommen im Frühjahrssemester 2013 auf Antrag der Promotionskommission: Prof. Dr. Balthasar Bickel (hauptverantwortliche Betreuungsperson) Prof. Dr. Fernando Zûñiga
Zürich 2013 Contents
Anowledgements v
Conventions vii 0.1 Transcription ...... vii 0.2 Interlinearisation ...... x 0.3 Abbreviations ...... xi 0.4 Sources of data ...... xiii
1 Preliminary considerations 1 1.1 A quick introduction to the problem ...... 1 1.2 Some basic definitions ...... 2 1.2.1 Valency ...... 2 1.2.2 Roles ...... 4 1.2.3 Grammatical relation ...... 7 1.2.4 Verb class and frames ...... 8 1.3 Object-conditioned differential marking ...... 10 1.3.1 Differential marking in general ...... 10 1.3.2 Differential marking of and conditioned by arguments ...... 11 1.3.3 Differential marking conditioned by objects ...... 13 1.4 Analytical questions ...... 15 1.4.1 Description versus explanation ...... 15 1.4.2 Functions versus conditions ...... 16 1.4.3 Modelling grammatical decisions ...... 17
2 Chintang: S/A detransitivisation 21 2.1 Language background ...... 21 2.2 Overview of relevant morphology ...... 23 2.2.1 Parts of speech ...... 23 2.2.2 Nominal morphology ...... 24 2.2.3 Verbal morphology ...... 26 2.3 Overview of relevant syntax ...... 28 2.3.1 Valency and basic frames ...... 28 2.3.2 Word order ...... 29 2.3.3 Frames and classes ...... 30 2.3.4 Differential marking ...... 32 2.3.5 Raising of case and agreement ...... 35 2.4 Formal properties of S/A detransitivisation ...... 38 2.4.1 S/A detransitivisation as differential framing ...... 38 2.4.2 Arguments selected by S/A detransitivisation ...... 41 2.4.3 Syntactic independence of detransitivised objects ...... 45 2.4.4 S/A detransitivisation in complex sentences ...... 49 2.5 Functional preliminaries: Identifying referents ...... 57
i CONTENTS
2.5.1 Introduction ...... 57 2.5.2 e identification process ...... 58 2.5.3 Definiteness ...... 62 2.5.4 Specificity ...... 64 2.5.5 e basis of unique identifiability ...... 67 2.6 Functional properties of S/A detransitivisation ...... 68 2.6.1 antifiability ...... 68 2.6.2 Specificity and arbitrary reference ...... 69 2.6.3 antifiability in detail ...... 74 2.6.4 Interaction with other factors ...... 84 2.6.5 Conventionalisation ...... 91 2.6.6 Some irrelevant variables ...... 101 2.7 antitative analysis based on corpus data ...... 105 2.7.1 Introduction ...... 105 2.7.2 Syntactic annotation and primary variables ...... 105 2.7.3 e centrality of quantifiability ...... 107 2.7.4 e role of exceptions ...... 109 2.8 Summary ...... 110 2.9 S/A detransitivisation in other Kiranti languages ...... 111 2.9.1 Overview ...... 111 2.9.2 Limbu ...... 113 2.9.3 Yakkha ...... 115 2.9.4 Athpare ...... 115 2.9.5 Belhare ...... 116 2.9.6 Chiling ...... 116 2.9.7 Bantawa ...... 117 2.9.8 Puma ...... 117 2.9.9 Summary ...... 119
3 Nepali: Differential A and O marking 121 3.1 Language background ...... 121 3.2 Overview of relevant morphology ...... 122 3.2.1 Parts of speech ...... 122 3.2.2 Nominal morphology ...... 122 3.2.3 Verbal morphology ...... 124 3.3 Overview of relevant syntax ...... 125 3.3.1 Word order ...... 125 3.3.2 Frames and classes ...... 125 3.3.3 Differential marking and valency manipulation ...... 130 3.4 Formal properties of DOM ...... 132 3.4.1 DOM as an isolated paern ...... 132 3.4.2 Arguments selected by DOM ...... 133 3.4.3 Position of the marker ...... 135 3.4.4 Double datives ...... 136 3.4.5 e question of incorporation ...... 138 3.4.6 DOM and agreement ...... 140 3.4.7 DOM in complex predicates ...... 141 3.5 Functional properties of DOM ...... 144 3.5.1 Uses of the dative ...... 144 3.5.2 Literature review ...... 146 3.5.3 Animacy ...... 148 3.5.4 Specificity ...... 150 3.5.5 antifiability ...... 153 3.5.6 Interplay of animacy and specificity ...... 154 ii CONTENTS
3.5.7 Topicality ...... 156 3.5.8 DOM with demonstratives and pronouns ...... 159 3.5.9 DOM with proper nouns ...... 161 3.5.10 Modification ...... 162 3.5.11 Unexpectedness ...... 163 3.5.12 Disambiguation ...... 168 3.5.13 Affectedness ...... 170 3.5.14 Some irrelevant variables ...... 171 3.5.15 One form, one function? ...... 173 3.6 antitative analysis based on corpus data ...... 174 3.6.1 Introduction ...... 174 3.6.2 Syntactic annotation and primary variables ...... 175 3.6.3 Calculation of secondary variables ...... 178 3.6.4 Impact of individual variables ...... 179 3.6.5 Interplay of variables ...... 197 3.6.6 Predicting DOM by probability and rules ...... 203 3.7 Some notes on the history of DOM ...... 208 3.7.1 e appearance of -lai ...... 208 3.7.2 An etymology of -lai ...... 211 3.8 Summary ...... 213 3.9 DOM in other Indo-Aryan languages ...... 215 3.9.1 Overview ...... 215 3.9.2 Panjabi ...... 218 3.9.3 Kumauni ...... 219 3.9.4 Maithili ...... 220 3.9.5 Bhojpuri ...... 222 3.9.6 Hindi-Urdu ...... 223 3.9.7 Bengali ...... 226 3.9.8 Gujarati ...... 227 3.9.9 Oriya ...... 229 3.9.10 Marathi ...... 230 3.9.11 Sinhala ...... 231 3.9.12 Summary ...... 233
4 Conclusions 235 4.1 Chintang vs Nepali ...... 235 4.1.1 Commonalities ...... 235 4.1.2 Differences ...... 235 4.1.3 Mutual influence? ...... 237 4.2 Repercussions for general linguistics ...... 238 4.2.1 Differential marking ...... 238 4.2.2 Identifiability and quantifiability ...... 239 4.2.3 Non-reductionist explanation ...... 239 4.2.4 eories of DOM ...... 240
A Annotation guidelines Chintang 241 A.1 How to tag files ...... 241 A.2 Variables, carriers, and values: overview ...... 241 A.3 Variables, carriers, and values: details ...... 243 A.4 Additional helps for assessing values ...... 250
iii CONTENTS
B Annotation guidelines Nepali 257 B.1 How to tag files ...... 257 B.2 Variables, carriers, and values: overview ...... 259 B.3 Variables, carriers, and values: details ...... 261 B.4 Problems in complex sentences ...... 271
C Scripts 275 C.1 sad-parse.pl ...... 275 C.2 sad-consistency.pl ...... 281 C.3 sad-analysis.R ...... 288 C.4 dom-parse.pl ...... 290 C.5 dom-consistency.pl ...... 302 C.6 dom-analysis.R ...... 307 C.7 dom-regression.R ...... 310
D Verb paradigms 317 D.1 Chintang ...... 317 D.2 Nepali ...... 326
Lebenslauf 345
iv Anowledgements
Writing this thesis has been fun and a fight. anks go to all the people who helped me.
• De Balthasar Bickel hät mer d glägehait gää, im CLRP z schaffe und hät mi mit filne mänsche und technologie zämepracht. Är hät sich immer wider ziit gnaa zum mini arbet mit mir diskutiere und isch debii immer sachlich und fair gsii. Danken au an Fernando ür di super zwaitbetroiig.
• रो सहकम जीलाई प न ध र ध यवाद दन चाहा छ । वहा बना म पालमा हराउ र वहा को राजन तक कौशल बना म िछ ताङमा काम गन सि दन । • Baiʔ Rʌbinɨŋ upʌriwarcenɨŋ yaŋ dhʌnnebad pimace konnoʔ. Chintaŋbeʔ hunikhimbeʔ yuŋma upidaŋnɨhẽ, sapphi limlok kok uthuktaŋbidaŋnɨhẽ, kina oliolikha kamacebeʔ uphadaŋnɨhẽ. Rʌbinɨŋ unisa Sarmilanɨŋ thekthekthekwa khasɨŋsɨcɨŋ nusayaŋ huncirek maikaiʔmata ucek- taktace. Hiccibaŋŋa ta narek tuklok somma Nʌspati Kʌtha ukhemsace. Rʌbinɨŋ Kathman- dubeʔyaŋ tubaktacehẽ kina paĩyaŋ kam likhi luŋaʔã.
• Suryʌŋa yaŋ baddhe uphadehẽ. Chintaŋge, Bantawa, Panirɨŋko miʔmikha sʌbdʌce somma jamma nisoko kha raicha. Bamu pode numde haŋ bhasabʌigyanik lise phe hola. Teĩbe huŋkhiya miccinɨŋ toŋnogo maʔmi chiuhẽ pache anɨŋwa tiye.
• Sarah Schneider half mir bei der Annotation des Chintang Language Corpus. Danke!
• मलाई NNC को ट पणी गन र DOM को बनावट ब न सघाउन भएकोमा बलराम साइ जी र क ण पौड लजीलाई ध र ध यवाद दन चाह छ ।
• यस गरी रा पाली भाषाका ग ल मी नाथ जीलाई प न ध र ध यवाद दन चाह छ । वहा मलाई पाली भाषा न सकाउन भएको भए िछ ताङमा मा ह स ग कसरी क रा गन स ? • Meine Kollegen Taras Zakharko, Alena Witzlack-Makarevich und Giorgio Iemmolo boten mir ein angenehmes Umfeld und haen stets ein offenes Ohr ür meine merkwürdigen Ideen. Ohne euch wäre ich nur halb so gern ins Büro gekommen! Besonderer Dank geht an Taras ür seine Verrücktheit, an Alena ür leckeres Essen und an Giorgio ür ausgedehnte Kaffee- pausen, sowie an alle drei ür Kommentare zu dieser Arbeit. Спасиба, дзякуй, grazii.
• I would also like to thank all people working in the CLRP: transcribers, translators, and glossers. Your work has been the base for the Chintang Corpus of which my work has greatly profited. So thanks to all the Rais: रखी माया, जानकी आ , न का, ग बहा र, गण श, दयाराम, ग क मारी, अ नता, च क मारी, शाि त माया. anks to Diana Schackow and स तोष िघिम , who became a friend while I was in Nepal, as well as to all glossers in Germany and Switzer- land: Sebastian Sauppe, Krissi Labs, Claudia Polkau, Kodjo Vissiennon, Saskia Wunder, Anne Wienholz, Joel Prokopchuk, Rachel Weymuth, Marcel Sanjuan, Shirin Hegetschweiler, and Michael Erlach.
• Rechd sche dànga mechd i a no maina familje und am Bernd. Oafach fias dòsai.
v CHAPTER 0. ACKNOWLEDGEMENTS
e research for this thesis has been financed by the European Science Foundation under the EUROCORES programme EuroBABEL (CRP “Referential Hierarchies in Morphosyntax”, IP “Dif- ferential agreement in Chintang and differential case marking in Nepali”; DFG/ESF Grant No. BI 799/5-1, 2009-2013).
vi Conventions
0.1 Transcription ree main writing systems are used for the object languages in this book. e default is simplified, phonological IPA for both Chintang and Nepali. A further simplified, ASCII-compatible alphabet is used for a few frequent proper names such as Chintang and Nepali, which it would be cumber- some to write differently. Finally, the International Alphabet of Sanskrit Transliteration (IAST) is used for writing proper names outside examples and for historical materials. IAST can capture all graphemic distinctions made in Devanagari, the native script of Nepali (and of Chintang as far as it is wrien), which is important for proper names because orthography is distinctive there and for historical materials because we can only speculate about its pronunciation. In addition to the three main systems, phonetic IPA and Devanagari will occasionally be used to illustrate the precise pronunciation and graphic representation of words, respectively. Table 1 shows the spelling of the words Chintang and Nepali in all five writing systems. e detailed conventions for writing each language are described below.
Phonetic IPA Phonological IPA English Roman Devanagari IAST
[ˈts̻ ʰiɳʈaŋ]̻ /chintaŋ/ Chintang िछ ताङ Chintāṅa [n̪eˈpa:li] /nepali/ Nepali पाली Nepālī
Table 1: Spelling the words Chintang and Nepali
Chintang was not wrien at all before its linguistic description. e only exception were proper names used by the Nepali administration and therefore wrien in Devanagari. e version of the IPA I use here is the one developed by the Chintang and Puma Documentation Project (CPDP, Volkswagenstiung DoBeS programme, grant no. II/79 092, 2004-2008). It is shown in Table 2. e recent Chintang-Nepali-English dictionary by R¯a¯ı et al. (2011) uses Devanagari with some special conventions. is system is likely to be taken over by the Chintāṅa Bhāṣā Saṃskṛti tathā Sahitya Pariṣada (‘Chintang Language Culture and Literature Council’), which is planning to introduce Chintang classes in schools. It is also shown below for the sake of interest.
Phonetic IPA Phonological IPA English Roman Devanagari
p p p प pʰ ph ph फ b b b ɦ ब b bh bh भ m m m म w w w व ʈ t t ट ʈʰ th th ठ ɖ d d ड
Table 2: Writing Chintang vii CHAPTER 0. CONVENTIONS
Phonetic IPA Phonological IPA English Roman Devanagari ɦ ɖ dh dh ढ ɳ n n न s s s स l l l ɦ ल l lh lh ल r r r र ts̻ ̻ c c च ts̻ ʰ̻ ch छ d̻z̻ j j ɦ ज d̻z̻ jh jh झ j y y य k k k क kʰ kh kh ख g g g ɦ ग g gh gh घ ŋ ŋ ng ङ ʔ ʔ ' : ɦ h h ह i i i इ ɨ ɨ i उ̣ u u u उ e e e ए o o o ओ a a a आ ˜ ˜ - ँ
Table 2: Writing Chintang
Nepali has a long tradition as a wrien language, the earliest inscriptions stemming from the 13th century AD (Hu 1988:79). Its traditional script is Devanagari, which is also used for a couple of other South Asian languages such as Hindi and Marathi. Presently there is no generally accepted Devanagari standard orthography for Nepali but a multitude of competing conventions. One big difficulty is that many words display a great degree of pronunciational variation, mainly reflecting the education of the speaker and his knowledge of Sanskrit. For instance, the word vyavasthā ‘handling’ is a loanword from Sanskrit. Its pronunciations range from the most learned variant [ʋyʌʋʌsˈtʰaː]̪ to the most informal [bɛbəsˈtaː].̪ Accordingly, it can be spelt <