The Andean and Pacific regions of South America are home to a remark- able variety of languages and language families, with a range of typologi- cal differences. This linguistic diversity results from a complex historical background, comprising periods of greater communication between dif- ferent peoples and languages, and periods of fragmentation and individual development. The Languages of the Andes is the first book in English to document in a single volume the indigenous languages spoken and for- merly spoken in this linguistically rich region, as well as in adjacent areas. Grouping the languages into different cultural spheres, it describes their characteristics in terms of language typology, language contact, and the social perspectives of present-day languages. The authors provide both historical and contemporary information, and illustrate the languages with detailed grammatical sketches. Written in a clear and accessible style, this book will be a valuable source for students and scholars of linguistics and anthropology alike.
. . is Professor of Amerindian Languages and Cul- tures at Leiden University. He has travelled widely in South America and has conducted fieldwork in Peru on different varieties of Quechua and minor languages of the area. He has also worked on the historical- comparative reconstruction of South American languages, and since 1991 has been involved in international activities addressing the issue of lan- guage endangerment. His previously published books include Tarma Quechua (1977) and Het Boek van Huarochir´ı (1988).
. is Professor of Linguistics at the University of Nijmegen. He has travelled widely in the Caribbean and the Andes, and was previously Professor of Sociolinguistics and Creole Studies at the Uni- versity of Amsterdam and Professor of Linguistics and Latin American Studies at Leiden University. He is co-editor of the Cambridge jour- nal Bilingualism: Language and Cognition, and his previously published books include Bilingual Speech: a Typology of Code-mixing (Cambridge, 2000), and One Speaker, Two Languages (co-edited with Lesley Milroy, Cambridge, 1995). CAMBRIDGE LANGUAGE SURVEYS
General editors P. Austin (University of Melbourne) J. Bresnan (Stanford University) B. Comrie (Max Planck Institute for Evolutionary Anthropology, Leipzig) W. Dressler (University of Vienna) C. Ewen (University of Leiden) R. Lass (University of Cape Town) D. Lightfoot (University of Maryland) I. Roberts (University of Cambridge) S. Romaine (University of Oxford) N. V. Smith (University College London)
This series offers general accounts of the major language families of the world, with volumes organised either on a purely genetic basis or on a geographical basis, whichever yields the most convenient and intelligible grouping in each case. Each volume compares and contrasts the typological features of the languages it deals with. It also treats the relevant genetic re- lationships, historical development and sociolinguistic issues arising from their role and use in the world today. The books are intended for linguists from undergraduate level upwards, but no special knowledge of the lan- guages under consideration is assumed. Volumessuch as those on Australia and the Amazon Basin are also of wider relevance, as the future of the lan- guages and their speakers raises important social and political issues.
Volumes already published include Chinese Jerry Norman The languages of Japan Masayoshi Shibatani Pidgins and Creoles (volume I: Theory and structure; volume II: Reference survey) John A. Holm The Indo-Aryan languages Colin Masica The Celtic languages edited by Donald MacAulay The Romance languages Rebecca Posner The Amazonian languages edited by R. M. W Dixon and Alexandra Y. Aikhenvald The languages of Native North America Marianne Mithun The Korean language Ho-Him Sohn Australian languages R. M. W. Dixon The Dravidian languages Bhadriraju Krishnamurti THE LANGUAGES OF THE ANDES
WILLEM F. H. ADELAAR with the collaboration of PIETER C. MUYSKEN cambridge university press Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo
Cambridge University Press The Edinburgh Building, Cambridge cb2 2ru, UK Published in the United States of America by Cambridge University Press, New York www.cambridge.org Information on this title: www.cambridge.org/9780521362757
© Willem F. H. Adelaar and Pieter C. Muysken 2004
This publication is in copyright. Subject to statutory exception and to the provision of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press.
First published in print format 2004 isbn-13 978-0-511-21587-2 eBook (NetLibrary) isbn-10 0-511-21587-8 eBook (NetLibrary) isbn-13 978-0-521-36275-7 hardback isbn-10 0-521-36275-x hardback
Cambridge University Press has no responsibility for the persistence or accuracy of urls for external or third-party internet websites referred to in this publication, and does not guarantee that any content on such websites is, or will remain, accurate or appropriate. CONTENTS
List of tables page x List of maps xiv Preface xv Orthographic conventions xvi List of abbreviations xx
1 Introduction 1 1.1 The languages of the Andes4 1.2 Physical description6 1.3 Brief history of the region7 1.4 A brief overview of the different Andean countries 10 1.5 History of the study of the Andean languages 15 1.5.1 The colonial period 15 1.5.2 The nineteenth century 18 1.5.3 Contemporary Andean linguistics 19 1.6 Sources for the study of the languages of the Andes 20 1.7 Genetic relations of South American Indian languages 22 1.7.1 History of classificatory efforts 23 1.7.2 Quechuan and Aymaran, Quechumaran 34 1.7.3 Other proposals for individual language families 36 1.7.4 The Greenberg (1987) proposal 41
2 The Chibcha Sphere 46 2.1 The language groups and their distribution 50 2.2 Research on the native languages of Colombia 54 2.3 Chocoan 56 2.4 Yurumangu´ı60 2.5 Cuna 61 2.6 The languages of the Sierra Nevada de Santa Marta 66 vi Contents
2.7 Chimila 75 2.8 Bar´ı80 2.9 The Muisca language 81 2.9.1 Sources 82 2.9.2 Phonology 83 2.9.3 Grammar 89 2.9.4 Lexicon 103 2.9.5 A Muisca text 106 2.10 Tunebo (Uw Cuwa) 109 2.11 Yukpa and Magdalena valley Cariban 112 2.12 The Arawakan languages of the Caribbean coast 115 2.13 Timote–Cuica 124 2.14 Jirajaran 129 2.15 P´aez (Nasa Yuwe) 130 2.16 Andaqu´ı and the languages of the Upper Magdalena valley 138 2.17 Barbacoan languages 141 2.18 Kams´a 151 2.19 Esmeralde˜no 155 2.20 Overview of the languages of the eastern Colombian lowlands 161
3 The Inca Sphere 165 3.1 The languages and their distribution 168 3.2 The Quechuan language family 179 3.2.1 The Quechua homeland 180 3.2.2 Historical overview of the colonial period 182 3.2.3 Dialect situation 183 3.2.4 Quechua studies 191 3.2.5 Phonology 194 3.2.6 Grammar 207 3.2.7 Characteristics of the Quechua lexicon 233 3.2.8 A sketch of an Ecuadorian Quechua dialect (Salasaca) 237 3.2.9 A sketch of a Peruvian Quechua dialect (Pacaraos) 242 3.2.10 A Cuzco Quechua text fragment 249 3.2.11 Literary production in Quechua 254 3.2.12 Social factors influencing the future of Quechua 256 3.3 The Aymaran language family 259 3.3.1 Past and present distribution 260 3.3.2 Homeland and expansion 263 3.3.3 Internal variation in the Aymaran language family 264 Contents vii
3.3.4 Salient features of the Aymaran language family 267 3.3.5 Aymara phonology 270 3.3.6 Aymara grammar 274 3.3.7 Aymara lexicon 293 3.3.8 Literary production in Aymara 296 3.3.9 Aymara sample text 296 3.3.10 The Jaqaru language 301 3.3.11 Jaqaru sample text 315 3.4 The Mochica language 319 3.4.1 The sounds of Mochica 321 3.4.2 Mochica grammar 328 3.4.3 Mochica sample texts 344 3.5 Puquina and Callahuaya 350 3.6 The Uru–Chipaya languages 362 3.7 The Atacame˜no language 375 3.8 The Lule–Tonocot´e language 385 3.9 Extinct and poorly documented languages of the Inca Sphere 391 3.9.1 Ecuador 392 3.9.2 Northern Peru 397 3.9.3 Northwestern Argentina 407
4 The languages of the eastern slopes 411 4.1 The Pano–Tacanan languages 418 4.2 The Arawakan languages 422 4.2.1 Yanesha phonology 424 4.2.2 The principal grammatical features of Yanesha 425 4.2.3 Complex sentences in Yanesha 430 4.3 Tupi–Guaran´ı 430 4.4 The Jivaroan languages 432 4.4.1 Shuar phonology 433 4.4.2 The principal grammatical features of Shuar 435 4.4.3 Complex sentences in Shuar 445 4.5 Cahuapana 447 4.6 Bora–Huitoto 449 4.7 The Zaparoan languages 451 4.8 The Tucanoan languages 453 4.9 Small families and supposed language isolates in Ecuador 454 4.10 Small families and supposed language isolates in Peru 456 4.11 Chol´on 460 viii Contents
4.11.1 The Chol´on lexicon and relationship with Hibito 461 4.11.2 Gender-determined language use 462 4.11.3 Chol´on phonology 463 4.11.4 The principal grammatical features of Chol´on 463 4.11.5 The basic word order of Chol´on 475 4.12 Small families and supposed language isolates in Bolivia 475 4.13 Chiquitano 477 4.13.1 Gender-determined language use in Chiquitano 478 4.13.2 Chiquitano phonology 479 4.13.3 The principal grammatical features of Chiquitano 480 4.13.4 Chiquitano word order 488 4.14 The languages of the Chaco region: Guaicuruan, Matacoan, Zamucoan and Lengua–Mascoy 488 4.15 Quechua influences on eastern slopes languages 499
5 The Araucanian Sphere 502 5.1 Araucanian or Mapuche 508 5.1.1 Mapuche studies 510 5.1.2 The sounds of Mapuche 512 5.1.3 Grammar 517 5.1.4 Lexicon 537 5.1.5 Mapuche sample text 539 5.2 The Allentiac language 544
6 The languages of Tierra del Fuego 550 6.1 The languages and their distribution 552 6.2 Ethnohistory 555 6.3 Problems in classification 556 6.4 Linguistic features 558 6.4.1 The Chonan languages 558 6.4.2 Chono and Kawesqar 564 6.4.3 Yahgan 567 6.4.4 Areal-typological features of the Fuegian languages 578 6.5 Oral literature 580 6.6 Language contact 580 6.7 A Tehuelche text 582
7 The Spanish presence 585 7.1 Characteristics of Andean Spanish 585 Contents ix
7.1.1 Demography and the Iberian dialectal origins 586 7.1.2 Linguistic features 587 7.2 Amerindian substratum influence 589 7.3 Language mixture and pidginisation in the Andes and the Amazon basin 602 7.4 African influences 604 7.5 Language planning and policy with respect to the Amerindian languages and to bilingual education 605 7.6 Andean languages in the modern world 608
Appendix: Inventory of languages and language families of the Andean region 610 References 625 Author index 681 Index of languages and ethnic groups 690 Subject index 703 TABLES
1.1 Percentage of Indian population in the different Andean countries page 11 1.2 Early grammars of Andean languages 16 1.3 Greenberg’s (1956) classification of the languages of the Andes 28 1.4 The four networks proposed by Swadesh (1959, 1962) 29 1.5 The language families relevant to the Andes listed in Loukotka (1968) 31 1.6 Groupings suggested by Su´arez (1974) of language families and isolates included in Loukotka (1968) 32 1.7 Language families relevant to the Andes listed in Kaufman (1990) with their correlates in Loukotka (1968) 33 1.8 Greenberg’s (1987) classification of the languages of the Andes 44 2.1 Overview of the consonant inventories of Chocoan languages and dialects 58 2.2 Cuna consonant inventory 63 2.3 Overview of the consonant inventories of the Arhuacan languages 68 2.4 Possessive modifiers in Chimila 77 2.5 Personal reference markers for subject and object in Chimila 78 2.6 Inventory of Muisca consonant phonemes 88 2.7 Inventory of Muisca vowel phonemes 88 2.8 Personal reference in Muisca 97 2.9 Inventory of Muisca pronoun and case combinations 100 2.10 Uw Cuwa (Tunebo) consonant inventory 110 2.11 Proto-Yukpa consonants 113 2.12 Guajiro consonant inventory 117 2.13 Personal prefixes and pronouns in Guajiro 120 List of tables xi
2.14 Caldono P´aez obstruents 131 2.15 Caldono P´aez continuants 132 2.16 Caldono P´aez vowels 132 2.17 Person markers in Nasa Yuwe (P´aez) 135 2.18 Guambiano consonant inventory 143 2.19 Awa Pit (Cuaiquer) consonant inventory 144 2.20 Tsafiki (Colorado) consonant inventory 144 2.21 Cha palaachi (Cayapa) consonant inventory 145 2.22 Awa Pit pronouns 148 2.23 Cha palaachi pronouns 148 2.24 Kams´a consonant phonemes 152 2.25 Kams´a personal reference markers 153 3.1 Proto-Quechua consonants 196 3.2 Proto-Quechua vowels 196 3.3 The Quechua four-person system 211 3.4 Subject conjugation in Ayacucho Quechua 219 3.5 Valency-changing suffixes in Quechua 229 3.6 Jaqaru personal reference markers 269 3.7 La Paz Aymara consonant inventory 271 3.8 Aymara subject and subject–object paradigm for the unmarked tense 282 3.9 Subject and subject–object endings for the future tense in La Paz and Sitajara Aymara 284 3.10 Aymara subject and subject–object paradigm for the imperative mood 285 3.11 Subject and subject–object endings for the present and past potential mood in La Paz Aymara 286 3.12 Nominalising affixes in Aymara 288 3.13 Jaqaru consonant inventory 302 3.14 Subject and subject–object endings of the unmarked tense in Jaqaru 307 3.15 Subject and subject–object endings of the future tense in Jaqaru 308 3.16 Mochica vowels as represented in Carrera Daza (1644) and Middendorf (1892) 324 3.17 Sibilants in seventeenth-century Mochica 326 3.18 Overview of the consonant symbols in the Mochica grammars of Carrera Daza (1644) and Middendorf (1892) 329 3.19 Personal reference in Mochica 331 xii List of tables
3.20 Mochica preterit and future tenses 337 3.21 Numerals 1 to 10 in Mochica 343 3.22 Puquina personal and possessive pronouns 353 3.23 Possessed nouns in Callahuaya 359 3.24 Personal and possessive pronouns in Callahuaya 360 3.25 Callahuaya consonant inventory 361 3.26 Chipaya consonant inventory 364 3.27 Unmarked present tense in Chipaya 371 3.28 Tentative inventory of Atacame˜no sounds 380 3.29 Possessive nominal paradigm in Atacame˜no 381 3.30 Atacame˜no personal and possessive pronouns 382 3.31 Verbal past-tense paradigm in Atacame˜no 383 3.32 Lule personal endings for unmarked tense and nominal possession 389 3.33 Lule future and imperative verbal paradigms 389 4.1 The relationship between the Arawakan languages of the pre-Andean area 423 4.2 Yanesha (Amuesha) phoneme inventory 424 4.3 The relationship among the pre-Andean members of the Tupi–Guaran´ı language family 431 4.4 Shuar phoneme inventory 434 4.5 The Shuar switch-reference system illustrated with the verb ant- ‘to hear’ 446 4.6 Phoneme inventory of Bora 450 4.7 Phoneme inventory of Huao 454 4.8 The sound inventory of Chol´on 464 4.9 Chiquitano phoneme inventory 480 4.10 Phoneme inventory of Toba 489 4.11 Phoneme inventory of Bolivian (Noctenes) Mataco 493 4.12 Reconstructed consonant system of Proto-Matacoan 495 4.13 Phoneme inventory of Ayoreo 496 5.1 Mapuche consonant inventory 517 5.2 Personal and possessive pronouns in Mapuche 519 5.3 Mapuche subject endings 523 5.4 Unmarked verbal paradigm in Allentiac 546 5.5 Interrogative verbal paradigm in Allentiac 547 6.1 The relation between putative Chono words identified by Bausani (1975) and their possible equivalents in the Alacalufan materials of Skottsberg (1913) and Clairis (1987) 553 List of tables xiii
6.2 Historical demographic data for the canoe nomads 555 6.3 Historical demographic data for the hunter nomads 555 6.4 Phoneme inventory of Selk nam 559 6.5 Phoneme inventory of G¨un¨una Yajich 562 6.6 Phoneme inventory of Tehuelche 563 6.7 Tentative sound inventory of Chono 565 6.8 Phonemes of Kawesqar 566 6.9 Phoneme inventory of Yahgan (based on Golbert de Goodbar 1977 and Poblete and Salas 1999) 568 6.10 Phoneme inventory of Yahgan (based on Adam 1885) 569 6.11 Person inflection in Yahgan 575 6.12 Phonological features of the Fuegian languages 579 6.13 Morphological features of the Fuegian languages 579 6.14 Constituent order features of the Fuegian languages 579 7.1 Major isoglosses in the Andean areas of Latin American Spanish 588 7.2 Types of speakers of Spanish that may show influence from Quechua 590 7.3 Sprachbund phenomena in the pronunciation of liquids and vibrants in different varieties of Quechua and Spanish in Ecuador 591 7.4 Features claimed to be due to Quechua in different varieties of Spanish 593 MAPS
Map 1 The Chibcha Sphere: overview of ethnolinguistic groups attested in premodern sources page 47 Map 2 The Chibcha Sphere: approximate distribution of indigenous languages in the mid twentieth century 51 Map 3 The Inca Sphere: approximate distribution of indigenous languages in the sixteenth century 166 Map 4 The Inca Sphere: approximate distribution of indigenous languages in the mid twentieth century 169 Map 5 Approximate distribution of Quechua dialects in Peru and adjacent areas 184 Map 6 Distribution of Aymaran and Uru–Chipaya languages 260 Map 7 Eastern lowland languages: Ecuador and northern Peru 412 Map 8 Eastern lowland languages: southern Peru 413 Map 9 Eastern lowland languages: Bolivia 414 Map 10 Eastern lowland languages: the Chaco area 415 Map 11 The Araucanian Sphere: approximate distribution of languages at the time of the Spanish conquest (sixteenth century) 503 Map 12 The Araucanian Sphere: twentieth-century distribution of indigenous languages 504 Map 13 The languages of Tierra del Fuego 551 PREFACE
This book took much longer to write than originally intended, particularly because very little was known about some of the regions to be covered, while much new material has become available these last few years. We hope this survey will in its turn inspire new research in the years to come. We wish to thank first of all Bernard Comrie for his precise and encouraging comments on earlier chapter drafts. We are very grateful to Ana Fern´andez, Timothy Curnow, Knut Olawsky and Nicholas Ostler for reading and commenting on specific chapters of the book. A special word of thanks goes to Rodolfo Cerr´on-Palomino for providing us with data from his ongoing research on the Chipaya language, and to Alfredo Torero for permitting us to use his unpublished work on Puquina. Many colleagues and friends have contributed over the years with valuable advice and commentary, by providing us with newly published or little-known publications, or by calling our attention to new materials and research results. Their generosity is duly remembered, although space does not allow us to mention each of them individually. Our gratitude extends in particular to those academic institutions that have provided the environment and the facilities necessary for an undertaking such as the present one: the Netherlands Institute of Advanced Studies (NIAS) in Wassenaar, the Netherlands, the Research Centre for Linguistic Typology (RCLT) at La Trobe University in Victoria, Australia, the Research School CNWS of Asian, African and Amerindian Studies and the Faculty of Letters of Leiden University. While the book was written under the primary authorship and responsibility of Willem Adelaar, the individual chapters were divided as follows: Pieter Muysken wrote sections 1.1–1.5 of chapter 1 and Adelaar the introductory pages of chapter 1 as well as sections 1.6–1.7. Adelaar also wrote chapter 2 except for section 2.15, chapter 3 except for section 3.2.8, and all of chapter 5. Muysken wrote chapters 4, 6 and 7, as well as the sections 2.15 and 3.2.8. ORTHOGRAPHIC CONVENTIONS
This book on Andean languages relies on heterogeneous sources, including premodern grammatical studies and vocabularies, as well as contemporary descriptions obtained by direct observation of living languages. These circumstances made it difficult to adopt a uniform orthographic practice. The spelling of colonial and other traditional sources has been preserved, allowing for marginal adaptations where the sources are internally inconsistent. Languages such as Allentiac, Muisca and Puquina, which have long been extinct, are known from premodern sources only, and the interpretation of the symbols used to represent them remains tentative. In the case of relatively recent data from languages that became extinct during the twentieth century,such as Mochica and Timote– Cuica, the identification of sounds can be problematic when the recorded materials are not in agreement with modern linguistic standards. Such materials are exemplified in the original spelling as well. Finally, there are premodern sources relating to languages still spoken, for instance, Valdivia’s Araucanian grammar of 1606. The use of premodern sources includes exotic symbols, such as cɥ, c−h, h, γ and œ. Among the languages primarily known from premodern sources, the only one presented in a reconstructed, rather than an original spelling is Chol´on (see section 4.11.3). For most of the living languages we are on firmer ground, although for these too we have to rely on published sources with different methodological approaches, theoretical backgrounds and degrees of phonological abstraction. In view of the necessity to repre- sent such heterogeneous materials, we have opted for a phonetically based orthography such as commonly found in North American linguistic journals dealing with Amerindian languages (e.g. International Journal of American Linguistics). Consequently, several of the original symbols have been replaced with others, and adjustments have been made at the level of individual languages so as to facilitate the presentation of the linguistic facts in a unified way.1 In a number of cases (e.g. Guajiro l and r, Mapuche r, Quechua n and q) concessions have been made to established practice. Such deviations of the
1 We wish to apologise beforehand for the inevitable errors and inconsistencies that are inherent to this procedure. Orthographic Conventions xvii overall orthographic practice adhered to in this book are duly explained in the respective sections.
Vowels When only vowel quality is taken into consideration, most languages of the Andean region select their vowels from a set of five, including two front vowels (e, i), two rounded back vowels (o, u) and one low vowel (a). These vowels usually exhibit a certain amount of non-distinctive variation, which is not shown in the orthography except when the phonetic realisation itself is a topic of discussion. In addition, many Andean languages also have an unrounded vowel which may be high central, mid central, or high back. We represent this sixth vowel by means of the symbol ,regardless of its exact phonetic nature and possible existing spelling conventions. For the representation of languages exhibiting an additional contrast between a high central and a mid central vowel we use the symbols and ə to distinguish between the two. The main reason for following this procedure is to preserve unity in the presentation. It is, furthermore, justified by the consideration that the sixth vowel often shows a wide range of non-contrastive variation, depending on the phonetic environment in which it occurs, and the fact that the observations of different authors rarely coincide, even when they are dealing with the same language. In the absence of specialised phonetic studies, almost non-existent in the case of Andean languages, the exact phonetic nature of the sixth vowel generally remains uncertain. Vowel systems of a different qualitative structure are found in Mochica, in languages of the Amazonian lowlands, and in languages of Tierra del Fuego. They will be discussed in the respective chapters (sections 3.4.1, 4.6 and 6.4). For these cases, as well as in explanative phonetic representations relating to more current Andean systems, additional symbols (ɑ, α, œ, ε, ¨ı, ɔ, ɯ) are used.
Secondary articulations of the vowels – Vowel length is indicated by a colon (a:, e:, i:, o:, u:), except when the long vowel consists of several tone-bearing units. In that case, the vowels are written separately (aa, etc.). Extra short vowels are marked as follows: aˇ, eˇ, ˇı, oˇ, uˇ. – Nasal vowels: a˜, e˜, ˜ı, o˜, u˜, etc. – Aspirated vowels: ah, eh, ih, oh,uh, etc. – Glottalised vowels: aʔ, eʔ, iʔ, oʔ, uʔ, etc. – Voiceless vowels: a, e, i, o, u, etc. ˚ ˚ ˚ ˚ ˚ – Tonal contrast is indicated by means of an acute accent (for high or rising tone), a grave accent (for low or falling tone), a superscript level stroke (for mid level tone), or a circumflex (for a descending tonal glide): a´, a`, a¯, xviii Orthographic Conventions
aˆ. Contrastive stress is also indicated by means of an acute accent. Stress and tone are indicated only when contrastive.
Non-syllabic vowels When non-syllabic, the high vowels i and u are analysed as glides, hence they are written as y and w, respectively. This is always the case in syllable-initial position (yV, wV ), and it is the preferred option in postvocalic syllable-final position (Vy, Vw). Occasionally, however, postvocalic glides are represented as vowels (Vi, Vu), when the status of vowel sequences in the language under discussion appears to favour that choice.
Consonants In the following overview the consonant symbols are grouped in categories: – Bilabial: plain stops p, b; implosive stop: ; fricatives ϕ, β; nasal m; glide w. – Labiodental: fricatives f, v. – Interdental: stops t, d; fricatives θ, ð; nasal n. ˆ ˆ ˆ – Alveodental: plain stops t, d; implosive stop ;affricates c (ts in phonetic explanations), dz (dz before secondary articulation markers, as in dzy); fricatives s, z; nasal n. – Apico-alveolar: fricative s¸;affricate ts¸. – Alveopalatal: affricates cˇ, d zˇ; fricatives sˇ, zˇ; glide y. – Retroflex: stops .t, d. ;affricates c.ˇ, d.zˇ; fricatives .sˇ, .zˇ; nasal n. ; glide ɺ. – Palatalised velar (ich-laut): fricative: c¸;affricate: tc¸ . – Velar: stops: k, g; fricatives x, γ; nasal ŋ. – Uvular (or postvelar): stops q, G; fricatives x. , ʁ. – Glottal: fricative h; stop ʔ. (Note: h can also refer to a velar fricative because many Andean languages tend to use glottal and velar fricatives in a non-distinctive way.) – Laterals: plain (voiced alveodental) l; interdental l; retroflex .l;voiceless fricative l˜;voiceless affricate λ-. – Vibrants: voiced tap r; trill rr; tap with palatal affrication rˇ; retroflex flap .
Secondary articulations of the consonants – Gemination is indicated by doubling the consonant symbol (pp, kk, nn, etc.). Double rr represents a trill, rather than only a geminate. (Quechua nn is a cluster [ŋn]; see section 3.2.5.) Orthographic Conventions xix
– Coarticulation is indicated by juxtaposition of the symbols: kp, pk, pkw. – Glottalisation: p’, t’, c’ˇ , k’, q’, etc. – Aspiration (of stops and affricates): ph, th, cˇh, kh, qh, etc. – Preaspiration or voicelessness (of resonants): hm, hn, hr, etc. – Palatalisation: ty, ky, ny, ly, etc. – Labialisation: pw, mw, kw, xw, etc. – Prenasalisation (or postoralisation): mb, nd, ŋg. – Postnasalisation (or preoralisation): bm, dn, gŋ. – Click-like articulation: p<, m<. – Syllabic resonants: l, n. ˚ ˚
Other symbols and conventions VVowel (only in phonological explanation). C Consonant. [. . .] Phonetic representation or tentative pronunciation. Etymological provenance or borrowing source. <...> Symbols used in premodern sources. { ...} Explanation of morphological structure. - Morpheme boundary. Division of morphemic glosses. = Division of constituents in reduplicated forms. .Division of speech elements covered by a single morphemic gloss. Division of morphemic glosses relating to a portmanteau speech element. ABBREVIATIONS
In the example sentences of this book morphemic glosses may consist of numbers, letters, or letter combinations. For reasons of presentation, all letter combinations have been limited to a maximum of two elements. Grammatical person is indicated by means of the numbers 1, 2, 3 and 4, which refer to the four-person system characteristic of the structure of some of the languages treated in this work:
1 first person (speaker) 2 second person (addressee) 3 third person (neither) 4 fourth person (speaker + addressee)
These numbers can be followed by the symbols S (subject), O (object), or P (possessor) without an intervening hyphen or dot:
1S, 2S, 3S, 4S first-person subject, etc. 1O, 2O, 3O, 4O first-person object, etc. 1P, 2P, 3P, 4P first-person possessor, etc.
The following list is an inventory of all the remaining morphemic glosses, which consist of letters or letter combinations.
A absolute (as opposed to relational) absolutive (as opposed to ergative) AB ablative case AC accusative case AD additive (‘also’, ‘even’) AF affirmative (evidential) AG agentive nominaliser AJ adjective adjectiviser AL allative case List of abbreviations xxi
AN anticompletive (‘still’, ‘yet’) AO aorist AP applicative AR attributive AS assistance AT attenuator AU augmentative AV adverbialiser AX auxiliary B benefactive case BN beneficiary C comitative case CA causative CD conditional mood CE centripetal (converging motion) CF centrifugal (diverging motion) CL classifier or shape morpheme (with semantic specification, e.g. CL: round object) CM completive (‘already’) CN continuative CO co-ordination CP comparative (‘like’) CR circular motion (‘go around’) CS courteous CT change of topic CU customary CV copula verbaliser D dual DA dative case DB distributive DC deictic DD different directions DE desiderative DF definite DG degree DI diminutive DL delimitative (‘just’, ‘only’) DM detrimental xxii List of abbreviations
DP deductive past DR directional DS different subjects (in switch-reference systems) DT distal DU dubitative DV declarative DW downward motion Eergative case EC exclamation EM emphatic ES external subject EU euphonic element EV event/action EX exclusive (addressee excluded) F future FA factitive (‘make’) FE feminine FM factual mood FN future-oriented nominaliser FO focus FR far remote G genitive case GA genitive agent GO goal GP generic pair GR gerund H hither (motion towards speaker) HB habitual past HN honorific HO hortative HS hearsay (evidential) HY hypothetical Iinverse IA imperfective aspect IC inchoative ID indicative mood IE irrealis mood IF infinitive List of abbreviations xxiii
II indefinite IK indirect knowledge IL inferential IM imperative mood IN inclusion (‘provided with’) IP inferential past IR interrogative IS instrumental case IT intensive IU immediate future IV inclusive (addressee included) IW inward motion L locative case LA lack (‘not having’) LB left-behind object LI limitative case (‘until’) LK linking element LN locality nominaliser LP lexicalised prefix LS lexicalised suffix LV locative verbaliser M momentaneous MA motion across (‘traverse’) MD medial MS masculine MT motion N noun nominaliser NA narrative past NC non-control ND non-determinate NE negation NF near future NM nominative case NP nominal past NR near remote NS non-speaker NT non-transitive xxiv List of abbreviations
NU neutral O object OB obligation OC oblique case OE ongoing event OS ownership (‘having’) OV obviative P possessor PA past tense PC paucal PD predicate marker PE perfect tense PF perfective aspect PI privative PL plural PM permissive PN present tense PO potential mood PR progressive PS passive PT perlative case (path) PU pronoun PV previous event PX proximate Q question marker QU quotative R relativiser RC reciprocal RD realis mood RE recent past RF reflexive RL relational (possessed) RM remote past RO reportative RP repetitive RR referential RS restitutive RU remote future List of abbreviations xxv
S subject SA simple aspect SD sudden discovery tense SG singular SI simulation SJ subjunctive mood SM simultaneous SO source SN stative nominaliser SP supine SQ sequential SR speaker SS same subject (in switch-reference systems) ST state SU subordination T transitive transitiviser TF transformative (‘become’) TH thither (motion not towards speaker) TO topic TS thematic suffix TV thematic vowel Uurgency UF unfulfilled UG undergoer UN unspecified subject UW upward motion Vverb verbaliser VE verbal extension VO vocative Z zero complement ZP zero person
1
Introduction
In his book Visi´on hist´orica del Per´u (A Historical Vision of Peru) the Peruvian historian Pablo Macera (1978) dates the beginning of human presence in the middle Andes at about 20,000 BC. The supposition of such an early human occupation, difficult to explain within the context of New World prehistory, is based on datings relating to excavations conducted by MacNeish at the highland site of Pikimachay of the Pacaicasa complex near Ayacucho (cf. MacNeish 1979). These datings are now considered very controversial (cf. Rick 1988). Although Macera himself recognises the uncertain character of the 20,000 BC date, its value is more than just scientific. It acquires the character of a fictitious date, needed to express the emotional feeling of timeless antiquity often associated with Andean culture and tradition, a feeling that is best put into words by the expression milenarismo andino (‘Andean millenarism’). It is not the cold evidence of radiocarbon datings, but the conscience of an immobile human society that clings fatalistically to age-old agricultural traditions perfectly adjusted to the formidable Andean landscape, that determines the view of the Andean intellectual until today. It is the view of a reality which has always been there, seemingly immune to the triviality of programmes aimed at modernisation and globalisation. In the meantime, the antiquity of human settlement in the Andean region, indeed in all of South America, remains a matter of debate. The rise of sea levels at the end of the Ice Age (± 10,000–8000 BC) may have hidden the traces of early coastal occupation. Excavations conducted by Dillehay at Monte Verde, near Puerto Montt in the south of Chile, have brought evidence of a relatively well-developed village culture that had its beginnings as early as 11,500 BC. (Dillehay 1989–97; cf. also Fiedel 1992). The inhospitable southern tip of South America at the Strait of Magellan (Fell’s Cave) was inhabited about 9000 BC. When considering the linguistic evidence, the bewildering variety of mutually unrelated languages found in South America suggests a protracted, gradual process of penetration, followed by long periods of isolation. This evidence appears to be in conflict with the traditional concept of a rapid colonisation of the subcontinent by big-game hunters, associated with the Clovis horizon of the North 2 1 Introduction
American plains (± 9500 BC). For an overview of the arguments in favour of a rapid colonisation of South America after 9500 BC, see Lynch (1999). For the Pacific side of the South American continent, the alternative of an early human penetration in a context of marine and coastal activity remains attractive to those familiar with the Andean situation, even though there is little support from archaeology. The date at which human activity throughout the Andean region becomes unequivocally visible is 9000 BC. From a cultural point of view, the Andean civilisation initially did not lag behind the rest of the world. Its agricultural beginnings were among the oldest in the world. The site of Guitarrero cave in the Callej´on de Huaylas (north-central Peru) contains evidence of plant domestication (beans, peppers) before 8000 BC (Lynch 1980; Fiedel 1992: 193). Agriculture in the Andes reached a high degree of sophistication, both in diversity of crops and in engineering techniques (terraces, raised fields, irrigation works). The Andean camelids possibly became domesticated as early as 4000 BC (evidence from Telarmachay, Jun´ın, in central Peru; Fiedel 1992: 195). The mummification techniques of the Chinchorro fishermen of the coast near Arica in northern Chile (5000 to 1500 BC) predated those of the Egyptians (Arriaza 1995). The construction of the extensive (pre- ceramic) urban settlement of Caral-Chupacigarro, which has been excavated since 1996 near Supe in the central Peruvian coastal area, has been dated at about 2650 BC (Shady Sol´ıs 1997). Curiously, the Andean society failed to develop an indigenous writing sys- tem, a circumstance that sets it apart from other areas of civilisation elsewhere in the world. The variety of native cultures and languages in South America, in particular in the Andes and on its eastern slopes, is remarkable even within the context of the New World. Kaufman (1990) has calculated the number of language families and geneti- cally isolated languages in the subcontinent at 118. Recent advances in the study of historical-comparative relations have tended to reduce this number, but proposed group- ings reducing the number of families mainly concern the eastern part of South America (cf. Dixon and Aikhenvald 1999). The Andean area, with its wealth of mutually unre- lated languages, has remained as opaque as ever in this respect. The linguistic diversity is not only genetic; the typological distance between some of the language groups is also impressive. It suffices to have a quick look at almost neighbouring languages such as Quechua, Mochica and Harakmbut to be struck by the differences. The historical picture is further obscured by the radical changes that have affected South America during the last five centuries of the second millennium. Scores of native languages, including entire families, have disappeared, often without leaving a trace. Others have dwindled to insignificant numbers. A few of them, including Aymara, Mapuche and Quechua, maintained a prominent position during the colonial period, partly at the cost of other languages, only to become endangered themselves 1 Introduction 3 in the subsequent period. Mapuche, Muisca and Quechua acted as linguae francae for local tongues, which were considered obstacles to evangelisation and effective domi- nation. Most languages, however, gave way to Spanish, the language introduced by the conquerors. The Spanish occupation, which for the Andean region began in Panam´a, the Caribbean coast of Colombia and Venezuela, and at the mouth of the River Plate, brought death and destruction for many native groups. The prosperous and numerous Cueva people of the Darien region in eastern Panam´awere exterminated between 1510 and 1535, their country depopulated, given back to the jungle, and partly occupied by other na- tive groups (Romoli 1987). Many others were forced to participate in civil wars or to join discovery parties geared at finding the legendary country of El Dorado (Hemming 1978). Epidemics of devastating dimensions swept through the continent even before the conquest. Huayna Capac, the last ruler of the undivided Inca empire, became one of their victims. After the arrival of the Europeans and during the first half of the colonial period the native population dropped dramatically. Many nations, such as the Quimbaya of the Cauca river valley in Colombia, known as the New World’s most talented goldsmiths, disappeared with their languages during that period. At the same time a benign and protective colonial rule guaranteed a state of relative quietude and prosperity. During most of the colonial period widely used native languages, such as Quechua, benefited from a certain prestige and legal protection. In 1770 the new Bourbon administration headed by Charles III banned the use of the indigenous languages from his domains and started a period of effective repression (Triana y Antorveza 1987: 499–511; Mannheim 1991: 74–9). In Peru the repression gained momentum in 1781 after the unsuccessful outcome of the Indian rebellion led by Tupac Amaru II. The independence of the South American nations was at first a new drawback for the native populations. As a last manifestation of indigenous sentiment, the act of 1816 declaring the independence of the United Provinces of R´ıo de la Plata, the predecessor of Argentina, was printed in Tucum´an both in Spanish and in Quechua. Subsequently, the linguistic and cultural rights of native South Americans were discontinued everywhere. In the more traditional areas with large indigenous populations, the hacienda system with its oppressive bondage practices reached its worst dimensions. Physical elimination by military forces or headhunters struck the Indians of Argentina, Uruguay and Chilean Tierra del Fuego, who had largely remained independent throughout the colonial period. The Araucanians of southern Chile lost their independence and integrity as a nation. The increase in the exploitation of rubber around the turn of the nineteenth century brought untold misery to the tribes of the Peruvian and Colombian rainforest, including slavery, deportation and ruthless massacres (Taussig 1987; Gray 1996). Until the second half of the twentieth century, the attitude of the South American governments and national societies remained indifferent to the existence of the native 4 1 Introduction languages, if not overtly hostile. The survival of these languages depended on the per- severance of their speakers, occasionally with the support of sympathising groups, such as indigenista circles or missionaries. Only during the last decades has there been a growing awareness at the national level of the importance of the cultural and linguistic heritage and the practical consequences of a multilingual reality. It started in 1975 in Peru with the recognition of Quechua as a second national language, a measure now largely forgotten. Meanwhile, the multicultural and multilingual character of the Bolivian na- tion has been recognised at the official level. A strong movement of highland Indians has come to play a crucial role in Ecuadorian politics. Finally, the cultural and terri- torial rights of native groups have been recognised in Colombia’s constitution of 1991 (see section 1.4 below). There have been several more or less successful attempts to introduce bilingual education in Bolivia, Ecuador and Peru. Needless to say, the prac- tical elaboration of all these measures and their effectiveness still leave much to be desired.
1.1 The languages of the Andes The languages of the Andes are not at all structurally similar, nor directly related, and are spoken in a huge area. Nonetheless, there are many connections between them, and they share a recent history of domination by Spanish. To us falls the task of both pointing out general traits, and doing justice to their various properties. It is only when compared to each other that their individuality emerges most clearly. In addition, we must try to stay clear of viewing these languages as static. In Race et histoire (1952) L´evi-Strauss warns against viewing other civilisations as either infant or stationary. When we sit in a train, our perception of the movement of other trains depends on the direction they are travelling in, with respect to our own train. The history of the Andes is characterised by an alternation between periods of greater communication and integration of different peoples and languages, and periods of frag- mentation and individual development. For this reason we must find, on occasion, a middle perspective between the Andean region as a whole and individual languages. We have tried to establish this by describing the Andean languages grouped into different ‘spheres’, zones which at different points in time have functioned as single units. Within these cultural spheres, the languages have influenced each other, sometimes rather pro- foundly. Hence our repeated insistence on the phenomenon of language contact in the chapters that follow. This book consists of seven chapters. In the introductory chapter we begin by sketching the geographical and the historical context in which the languages of the Andes attained their present form and use. We then turn to an overview of the linguistic and demographic situation of the Indians in each of the Andean countries, and to the history of descriptive and comparative studies of the languages of the Andes. Finally, we give a brief outline of 1.1 The languages of the Andes 5 the history of classificatory efforts for the Andean languages. More details are provided in chapters 2 to 6, which deal with specific regions or spheres. Chapter 2 deals with the Chibcha Sphere, which we define as the Venezuelan Andes and Colombia, including some of the border areas of Colombia with Peru and Ecuador. In chapter 3 the Inca Sphere is discussed, roughly the area covered by the Inca empire: highland and coastal Ecuador and Peru, highland Bolivia, northern Chile and north- west Argentina. Chapter 4 deals with the eastern slopes of the Andes and the up- per Amazon basin in Ecuador, Peru and Bolivia, and some information on the Gran Chaco area of Argentina, Bolivia and Paraguay will be included. Chapter 5 is dedi- cated to the Araucanian Sphere: the Chilean coast and highlands and part of south- central Argentina. Chapter 6 treats the languages of Tierra del Fuego and adjacent Patagonia. Chapter 7, finally, deals with the non-Indian languages, primarily Spanish, but also Afro-Hispanic survivals, as well as Amerindian contact vernaculars. In addition, policies of bilingual education and language maintenance are surveyed. Language contact in the Andes has taken several forms. First, we find the use of spe- cific lexical items (e.g. the Quechua word waranqa ‘thousand’ or reflexes of Quechua atawalypa ‘chicken’, ‘rooster’) in a wide variety of languages, many of which have never been in direct contact with the language of source. This suggests extensive and, as Nordenski¨old (1922) argues, sometimes fairly rapid transmission, over a wide area. Second, we find cases of intense lexical influence from either a demographically or a culturally dominant language, as in the case of Mapuche influence on G¨un¨una K¨une in Argentina, or Quechua influence on Amuesha on the Andean foothills in Peru. Third, there is evidence of highly complex patterns of long-term convergence, inter- ference and mutual lexical influence in the contact between languages of the Aymaran and Quechuan families. The contact has been so intense in this case that for a long time the two families were thought to be directly related. We will address the question whether the situation of Aymara and Quechua is unique in South America, or whether there are other cases of intense mutual influence as well. Fourth, there are documented patterns of language mixture through relexification, e.g. in the case of Media Lengua in Ecuador and the Callahuaya in Bolivia, a group of itinerant herbal curers, who used a sort of secret language with elements of Puquina and Quechua. Fifth, a phenomenon frequently observed is the fusion of the remnants of a tribe that has been decimated in number with another more vigorous tribe. Under the protection of their new social environment such tiny groups may preserve a language for gener- ations, and only gradually adopt the dominant language, as in the Chiquitano area in Bolivia. 6 1 Introduction
Finally, the Indian languages of the Andes have responded in different ways to the pressures from Spanish, from incidental lexical borrowing, through convergence and relexification, to shift and substrate. In the chapters to follow these different types of contact will be explored in some detail.
1.2 Physical description The Andes – or, more properly speaking, the Cordilleras de los Andes – constitute a mountain range about 7,000 kilometres long. They stretch all along the west coast of South America, from near Caracas to Cape Horn. On the average, the Andes are 400 kilometres wide, but at the twentieth parallel, at the altitude of Bolivia, over 900 kilometres. Steep on the western or Pacific side, the Andes are flanked by lower ridges on the eastern side, providing a more gradual transition to the Amazon and La Plata basins. In the south the Andes start out as a single ridge, but in northern Chile they split up into several ridges, enclosing the widening altiplano (high plain) of Bolivia and southern Peru. Through northern Peru and Ecuador there are two ranges, with a valley in between. In southern Colombia these join again before fanning out over this country in three separate cordilleras, the easternmost of which reaches into Venezuela. The Andes are a very high range, with several dozen peaks above 6,000 metres, and generally very high passes. Only the Chamaya highlands, at the border between Ecuador and Peru, provide an easy passage from the Amazon basin to the Pacific, as Raymond (1988: 281) points out, providing the opportunity for tropical forest/coastal plain contacts starting in early prehistory. For our purposes, the physical characteristics of the Andes are important for a number of reasons. First of all, because of their inhospitable character they have provided zones of refuge for numerous indigenous groups. We have but to compare Bolivia, where both remote mountainous regions and inaccessible Andean foothill areas have provided niches for Indian languages, with the Argentinian plains, where widely spread Indian groups were destroyed by the regular Argentinian army in the nineteenth century, to realise the effect that the physical environment has in this respect. The linguistic and cultural zones of refuge exist both where extremely harsh conditions or poor soils made colonisation difficult or unprofitable, and where the terrain made communication with and travel to regional centres an ordeal. Within the ecological perspective taken here, it is important to ask ourselves, for each indigenous language in South America and each group, how come it still exists, resisting or escaping destruction or assimilation? A second crucial aspect of the Andes, with its often steep slopes, is that it has made available different ecosystems even to a single ethnic group. Thus we find the Quechua- speaking Saraguro Indians in the province of Loja, southern Ecuador, cultivating maize and other cereal crops in the highlands in alternation with the raising of cattle on the eastern slopes. Murra (1975) has documented a very extensive system of ‘vertically’ 1.3 Brief history of the region 7 organised barter and economic cooperation networks in the Andes of Bolivia and Peru, in which groups located at different altitudes were allied. Sometimes these subgroups belonged to the same ethnolinguistic group, sometimes they did not. Altiplano groups such as that of the Bolivian Lupaca kingdom relied on an archipelago of lower-down settlements for their coca leaves and maize crops. Third, the mountains influence the climates in the Andean region enormously, in conjunction with the Humboldt current. In the extreme south, the western slopes are humid, and on the eastern side, Patagonia, it is dry. Near Valparaiso, however, where the Humboldt current reaches the Chilean coastline, the coast becomes a desert and the east- ern side more humid. This is the situation throughout Peru. At the altitude of the Ecuadorian border it changes again: tropical rains fall on both sides of the Andes. In Colombia, the coastal zone is hot and humid, and the central valleys are cooler. Thus we have virtually all existing climates represented in the region we are studying: from the Pacific deserts of Chile and Peru through the rainforest near the Brazilian borders of Bolivia, Peru and Colombia, to the permanent snow of the mountains in the altiplano regions around Lake Titicaca.
1.3 Brief history of the region We will sketch the prehistory of the Andean region on the basis of Peruvian prehistory, since it has been studied in the greatest detail and provides a point of reference for the whole region. At various points we will link developments in the northern and southern Andes to the central region focused on here. In Keatinge (1988a, b) the archaeological evidence is reviewed, and it is concluded that the earliest human occupation of the central Andes that is well documented dates back to 9000–8000 BC. The early occupants were hunters and gatherers, and they had well-defined lithic technologies. Soon settlements emerged, on the coast centred around fishing and gathering shellfish, and in the highlands based on the domestication of plants and animals. Although in the central Andes the preceramic period lasted till around 1800 BC, there is evidence of high levels of cultural evolution, e.g. in large constructions such as at Sech´ın Alto. The ceramic period is characterised by phases in which cultural elements were shared by groups in the whole central Andes, called Horizons, and intermediary periods in which cultural developments (as reflected, for instance, in ceramic patterns) were more regional. The Early Horizon (900 BC– AD 200), is associated with the Chavin de Hu´antar religious shrine and represents the consolidation of a pan-Andean religious foundation. The Middle Horizon (AD 600–1000, according to Keatinge 1988a, b) is linked with the two large urban centres of Huari and Tiahuanaco, which may have been the capitals of two empires: Tiahuanaco around the Titicaca basin and extending into western Bolivia and northern Chile, and Huari extending as far as the northern Peruvian coastal plains. These zones of influence did not last for more than two centuries, but they formed the 8 1 Introduction scene for a large network of exchange of goods, visual motifs and patterns of organisation throughout the whole central Andes. The shared religious heritage remains, however, in the subsequent period of regionalisation and is preserved in such centres as Pachacamac. During this period of regionalisation we do see large kingdoms emerging, particularly on the coastal plains of northern Peru, such as the Chim´u kingdom. The Late Horizon corresponds to the Inca period, to which we will turn shortly. To the north in Ecuador we find equally old early settlements, both near Quito and on the Santa Elena Peninsula, where there are traces of some of the earliest New World ceramics and textiles. Ecuador was at the crossroads between the Peruvian civilisations just mentioned and circum-Caribbean cultures. The bivalve shell spondylus, fished along the Pacific coast of Ecuador (and later much further north as well), was a highly valued object of trade, not just in Ecuador but also in Peru, particularly in the Chav´ın culture. Colombian archaeological remains date back to 8000 BC at the site of Tequendama (Correal and Van der Hammen 1977; Lynch 1999). Ceramic techniques were known as early as 3000 BC at the site of Puerto Hormiga on the Colombian Caribbean coast (Reichel Dolmatoff 1965; Rojas de Perdomo 1979; Allaire 1999). The earliest construc- tions at San Agust´ın date back to 500 BC (Rojas de Perdomo 1979), but the highly developed gold-working techniques, which inspired the Spanish thirst for gold and led to the myth of El Dorado, generally can be dated as having arisen in the first millennium. The San Agust´ın culture lasted until shortly before the arrival of the Spaniards. Although the Andes are associated in popular opinion with the Inca civilisation, historically the Incas played a relatively minor role. In the early part of the fifteenth century, they rose as a military power in southern Peru. Under Pachacuti Inca Yupanqui the southern highlands were conquered, and one by one the earlier Peruvian states, including powerful Chim´u, were toppled. After 1460 his son Tupac Inca conquered the northern highlands, as far as Quito in Ecuador, and after 1471 highland Bolivia and adjacent parts of Chile and Argentina were incorporated into the growing Inca empire. While most highland territories thus became Inca, the tropical forest remained out of reach for the new conquerors. Unlike earlier military powers, the Incas were not content with looting new territory, but rather they organised and restructured it. Huayna Capac, who succeeded around 1492, only added small parts to the empire, and withstood the first major assault on it, from the Chiriguanos in the southeast. When Huayna Capac died in 1527, his two sons, Huascar and Atahuallpa, fought over domination for five years, and when a small group of Spaniards under Francisco Pizarro invaded Peru in 1532 they could profit from the divisions caused by the wars of succession and from the disaffections among local elites of nations recently conquered by the Incas. In addition, the Incas were greatly debilitated by wavesofEuropean epidemic diseases, smallpox and measles, which had reached the Andes even before the advent of the Spaniards themselves. 1.3 Brief history of the region 9
From 1538 on Peru was firmly under the control of the Spanish colonialists, and it remained under their control inspite of uprisings and resistance during the seventeenth and eighteenth centuries. Resistance to Spanish colonial rule took several forms. While the Spaniards had been able to conquer most of the Andean region rather rapidly, in the monta˜na,onthe eastern slopes, the Incas held out for a long time, in fact until 1572, in their stronghold at Vilcabamba. While Europeans played the dominant role in the highlands, it should not be thought that there were no Indian rebellions during the colonial and republican periods. Messianic movements (Ossio 1973) kept flaring up throughout the colonial and early republican periods. Only a few can be mentioned here. In 1564 there was a revolt in the Ayacucho region of Peru, inspired by native religious leaders, called Taki Onqoy (lit. ‘dancing sickness’, i.e. ‘dancing into a trance’), which spread through large parts of central Peru and lasted seven years (Millones 1973, 1990). Around 1780 there was the famous uprising in southern Peru of Tupac Amaru II, a remote descendant of the Incas, which gained enormous peasant support before being squashed. The P´aez in Colombia withstood the attacks by various Spanish conquistadores, in- cluding Belalc´azar,1 but here also the Spaniards profited from conflicts between the various Indian nations. The Magdalena valley remained difficult to control for the colo- nial rulers until the nineteenth century. Just as the Incas had never been able to conquer the Mapuche in Chile, the Spanish conquistadores, after some initial successes, were unable to bring this Araucanian group down. Until the latter part of the nineteenth century, the Mapuche retained their inde- pendence and proved to be a formidable enemy, requiring repeated military expeditions by the colonial and later by the republican powers. After new hostilities, in 1869 and again in 1880, they were finally subdued or ‘pacified’ in 1882. They were forced to share their limited agricultural area with intrusive settlers, but a majority has stayed in the heartland south of the Biob´ıo river. The independence from Spain of the Andean regions and the formation of new nation-states brought some changes for the Indian populations, but in many ways the patterns established in the colonial period persisted. The overt rebellion against Spain started in 1810 at the two opposite ends of the Spanish empire – Caracas and Buenos Aires – and spread from there to the central Andean regions. Bol´ıvar in the north first liberated Venezuela, then Colombia and then Ecuador, with the help of Sucre. San Mart´ın started in the Argentine and then liberated Chile. The two met in Peru, where the battle of Ayacucho in 1824 marked the effective end of Spanish rule in South America.
1 Benalcazar or Benalca¸car in the colonial sources. 10 1 Introduction
Even though Bol´ıvar attempted to form larger nation-states, e.g. uniting Peru and Bolivia, and uniting Colombia, Venezuela and Ecuador, the contours of what were to become the Spanish American republics became clearer as the Wars of Independence were fought. The republics founded in the early nineteenth century have remained till today, and have now developed strong national identities. It would be a mistake to think that independence was inspired by the Indians or was beneficial to them. The opposite is the case, for several reasons. To begin with, independence from Spain did not mean full autonomy. When the Spanish officials were gone, European bankers, traders and settlers stepped in. The nineteenth century was a period of more intensive exploitation of a new series of natural resources in Latin America. The guano dug up along the Peruvian coast, the saltpetre mined in northern Chile and the rubber gathered in the upper Amazon basin are examples of this. In many areas, the Indians were driven from their home- steads by new colonists or forced to participate in the new explorations under hardship conditions. Further, the nationalism accompanying the forging of new nations was often translated into a desire for the cultural homogeneity of the citizenry. Public education, in Spanish, wasextended into rural areas. Cities expanded, and urban norms and values were seen as signs of modernity. All of this meant that Indian lifestyles were depreciated and threatened. An extreme result is the genocide perpetrated against the Indians of the Argentinian pampas under the command of General Roca (1878–82). Finally, independence had been fought for and won by elites associated with the import and export sectors of the colonial economies, who had been clamouring for trading possibilities with different nations, against the Spanish monopoly. These elites favoured the breaking up of the traditional feudal landholding system, which had exploited the Indian work force but at the same time sheltered their culture, or rather a complex amalgam of their traditional culture and colonial patterns. Modernisation of agriculture was accompanied by the increasing mobility of rural labourers, and hence by the splitting up of traditional Indian communities. These three factors still hold and help shape the relations between Indians and non- Indians in the Andes. In recent history organised Indian movements have allied them- selves with political movements, but major guerrilla activities such as Sendero Luminoso in Peru and FARCin Colombia are only peripherally related to Indian movements (cf. the contributions in Eckstein 1989).
1.4 A brief overview of the different Andean countries All Andean countries have a native population which speaks several native languages. However, the number of languages that became extinct since 1500 probably exceeds 1.4 Brief overview of the Andean countries 11
Table 1.1 Percentage of Indian population in the different Andean countries (a fairly conservative estimate based on Instituto Indigenista Interamericano 1993)
Total population Amerindian population Percentage
Venezuela 21,300,000 315,815 1.48 Colombia 35,600,000 620,052 1.74 Ecuador 10,600,000 2,634,494 24.85 Peru 22,900,000 8,793,295 38.39 Bolivia 8,200,000 4,142,187 50.51 Chile 14,000,000 989,745 7.06 Argentina 33,900,000 372,996 1.10 that of the languages still spoken. There is no longer a full coincidence between Indian descent and the preservation of the native languages. Nor are languages always spoken in their original locations. The social developments of the second half of the twentieth century have induced many Indians to migrate to urban centres both within and outside their original living areas. The countries involved in our study differ widely in the variety and relative impor- tance of native languages. Consider the figures for 1993 of the percentages of the Indian ethnic groups of the total population in different countries (these figures refer to cultur- ally identifiable ethnic groups, not to speakers of Indian languages) in table 1.1. Bolivia, Ecuador and Peru differ clearly from Argentina, Chile, Colombia and Venezuela in the size of their indigenous populations. Although overall figures have changed somewhat since 1993, the percentages remain comparable. These figures are rather conservative estimates; for individual countries, sometimes higher figures are given. With the excep- tion of Chile, all Andean countries mention Amerindian languages explicitly in their constitutions. Of course, these documents are no more than that, and explicit mention in the constitution is no guarantee of the existence or otherwise of an indigenous language. However, these texts do reflect the self-perception of the political classes, at least, of the different countries. Let us briefly look at these countries one by one. Venezuela.Insofar as the region covered by this book is concerned, the Indian languages of Venezuela have been preserved mainly in the area to the west of Lake Maracaibo. In the Andes all native languages, including several isolates and small fam- ilies, are considered extinct. The 1999 Venezuelan constitution states in its article 9 that Spanish is the official language, while the indigenous languages are also for official use by the indigenous peoples and must be respected throughout the Republic, since they constitute a cultural 12 1 Introduction heritage of the nation and of humanity at large.2 Article 100 provides for the recog- nition of cultural diversity and equality of all cultures that constitute the Venezuelan identity.3 Finally, there is a transitory measure regulating the election of indigenous po- litical representatives, which includes the requirement that they speak their indigenous languages. Colombia. The Andean region of Colombia occupies the western, central and northern parts of the country, and is bordered by tropical lowlands in the east. The Andean mountain ranges, which run from north to south, are separated by mighty river valleys and extensive forest areas, especially along the Pacific coast. At the arrival of the Europeans many autonomous ethnic groups inhabited Andean Colombia, which has preserved part of its original multilingualism. The languages that have survived belong to different families and constitute linguistic islands in a largely Hispanicised country. The Indian groups of the Colombian Andes are known for their spirit of independence and ethnic awareness. Their high level of organisation is rooted in the defence of their rights to reserved areas (resguardos) inherited from the colonial period. Many of them encourage efforts to codify and preserve their languages as long as these are compatible with the interests of the community. The 1991 constitution of Colombia (with modifications in 1997) states in article 10 that Spanish is the official language of Colombia while the languages and dialects of the ethnic groups are also official in their territories. Education in communities with their own linguistic traditions will be bilingual.4 Ecuador. Ecuador has one of the highest percentages of Indian population in South America. It is mainly concentrated in the inter-Andean valleys and the Amazonian low- land to the east, referred to as the oriente in Ecuador. Some small communities inhabit the northern part of the forested region separating the Pacific Ocean from the Andean ranges. The majority of Ecuadorian Indians speak Quichua, the local variety of the Quechua lan- guage spoken in the Andean countries further south. Quichua is found in the entire high- land region except for its northern and southern extremities (in the provinces of Carchi and Loja). The Quechuanisation of highland Ecuador became complete during the colo- nial period when it replaced a multitude of local languages. At the same time Quichua was introduced in the Ecuadorian oriente,where it also gradually replaced some of the local
2 El idioma oficial es el castellano. Los idiomas ind´ıgenas tambi´en son de uso oficial para los pueblos ind´ıgenas y deben ser respetados en todo el territorio de la Rep´ublica, por constituir patrimonio cultural de la Naci´on y de la humanidad. 3 Las culturas populares constitutivas de la venezolanidad gozan de atenci´on especial, recono- ci´endose y respet´andose la interculturalidad bajo el principio de igualdad de las culturas. 4 El castellano es el idioma oficial de Colombia. Las lenguas y dialectos de los gruposetnicos ´ son tambi´en oficiales en sus territorios. La ense˜nanza que se imparta en las comunidades con tradiciones ling¨u´ısticas propias ser´a biling¨ue. 1.4 Brief overview of the Andean countries 13 languages. Varese(1983) estimated the Indian population of the Ecuadorian oriente at be- tween 30,703 (minimum) and 58,353 (maximum). The Abya–Yala cultural organisation on its website Peoples of Ecuador (http://abyayala.nativeweb.org/ecuador/pueblos/php) mentions a figure of over a 100,000. Article 1 of the 1998 constitution of Ecuador perhaps goes furthest in declaring that the state respects and stimulates the development of all the Ecuadorian languages. While Spanish is the official language, Quechua, Shuar and the other ancestral languages are to be used officially for the indigenous peoples.5 Article 23 states that every person has the right to be informed in her or his mother tongue, of proceedings against her or him.6 Article 69, finally, guarantees a form of bilingual education in which the indigenous language is the principal one and Spanish the language for intercultural relations.7 Peru. Like Ecuador, Peru has an Indian population of several millions concentrated mainly in the Andes. The Peruvian eastern lowlands (selva) and the foothills (monta˜na) separating them from the Andean highlands are inhabited by a substantial number of ethnic groups. Their number has been estimated between 200,000 and 220,850 (Varese 1983). More recent estimates (Pozzi-Escot 1998) are slightly higher, but the number of speakers of lowland languages of five years and older has been calculated at 130,803 by Chirinos Rivera (2001). The Peruvian coast harbours several communities that have native American roots but have lost their language. The Andean highlands are dominated by the presence of two languages, Quechua and Aymara.Peruvian Quechua shows a considerable amount of internal dialect diversity. The number of Quechua speakers in Peru has been calculated at 4,402,023 (Cerr´on- Palomino 1987a). A recent estimate by Chirinos Rivera (2001), based on the national census of 1993, is much lower, however: 3,199,474 speakers of five years and older. Aymara is mainly confined to the southern departments of Puno, Moquegua and Tacna and has around 350,000 speakers in Peru. It has 412,215 speakers of five years and older according to Chirinos Rivera. Article 48 of the 1993 Peruvian constitution talks of official languages in the plural, declaring these to be Spanish and, in those zones where they are dominant, Quechua, Aymara and the other aboriginal languages.8 In the general motivation for the 1993 con- stitution it is mentioned that Peru is to be conceived of as a multiethnic and multicultural
5 El Estado respeta y estimula el desarrollo de todas las lenguas de los ecuatorianos. El castellano es el idioma oficial. El quichua, el shuar y los dem´as idiomas ancestrales son de uso oficial para los pueblos ind´ıgenas, en los t´erminos que fija la ley. 6 Toda persona tendr´aelderecho a ser oportuna y debidamente informada, en su lengua materna, de las acciones iniciadas en su contra. 7 El estado garantizar´aelsistema de educaci´on intercultural biling¨ue; enel ´ se utilizar´a como lengua principal la de la cultura respectiva, y el castellano como idioma de relaci´on intercultural. 8 Son idiomas oficiales el castellano y, en las zonas donde predominen, tambi´en lo son el quechua, el aimara y las dem´as lenguas abor´ıgenes, seg´un la ley. 14 1 Introduction country, where all citizens have the right to express themselves in their own language before any authority.9 Bolivia. The Bolivian highland region is again characterised by the dominance of Quechua and Aymara. Only small pockets of speakers of other highland languages remain. In the lowlands surrounding the northern edge of the Andean high plateau there is a wide array of small, genetically isolated languages. The number of Quechua speakers in Bolivia is estimated at 2,194,099 on the basis of the 1992 census figures, that of Aymara speakers at 1,503,754 (Alb´o 1995, I: 19). Speakers of the lowland languages are estimated at more than 96,000 (Alb´o 1995, I: 19). The 1967 constitution of Bolivia, with modifications dating from 1994, guarantees in article 171 the rights of the indigenous peoples, including those concerning their identity, values, languages and customs, and institutions.10 Chile. The Mapuche people, who constitute the majority of the Chilean indigenous population, are mainly concentrated in the region called La Araucan´ıa in the south of that country. Originally, they inhabited most of the central and southern mainland parts of Chile, including the island of Chilo´e, but centuries of war and colonising pressure have reduced their territorial space. Although there may be a million people of Mapuche descent, only an estimated 40 per cent continue to speak the language. There are no reliable figures as to the actual number of speakers, however. In addition to Mapuche only a few other native languages are found in the northern and southern extremities of the country. The current Chilean constitution makes no reference, as far as we could establish, to language and culture, indigenous or not. This may reflect the fact that Chileans tend not to perceive themselves as a partly Amerindian nation. Argentina. The northwestern part of Argentina is inhabited by Indians and mestizos belonging to the Andean cultural sphere. Many of them speak Quechua or did so in the past. The Gran Chaco, to the east of the northern Argentinian Andes, is inhabited by the Tupi–Guaran´ı Chiriguano and several other important indigenous groups speaking Guaicuruan and Matacoan languages. Araucanian (Mapuche) is the dominant Indian language in the south and southwest of Argentinia. Other groups in Patagonia and Tierra del Fuego are extinct or nearly so. In the central and western part of the country all
9 La primera visi´on que tiene el Proyecto concibe al Per´u como pa´ıs pluri´etnico y pluricultural; en consideraci´on a ello el Proyecto comienza estableciendo por ejemplo, que todos los peruanos tienen el derecho a expresarse en su propio idioma, no solamente en castellano, sino tambi´en en quechua o en aymara, ante cualquier autoridad. 10 Se reconocen, respetan y protegen en el marco de la ley, los derechos sociales, econ´omicos y culturales de los pueblos ind´ıgenas que habitan en el territorio nacional, especialmente los relativos a sus tierras comunitarias de origen garantizando del uso y aprovechamiento sostenible de los recursos naturales, a su identidad, valores, lenguas y costumbres e instituciones. 1.5 History of the study of the Andean languages 15 aboriginal Indian languages have disappeared. Indians or people of local indigenous descent number several hundred thousand in Argentina. Article 75 of the 1994 constitution of Argentina states that Congress must recognise the ethnic and cultural pre-existence of the Argentinian indigenous peoples, and guarantee the respect for their identity and right to bilingual and intercultural education.11 The term ‘pre-existence’ presumably refers to the Argentinian self-perception as a non- Indian immigrant nation.
1.5 History of the study of the Andean languages Little if anything is known about linguistics in the preconquest era, although there may have been awareness of linguistic differences in the Inca empire. The colonial era, in which missionary priests started recording the richness of the Indian linguistic heritage, is well worth describing in some detail. The nineteenth century is dominated by laymen: primarily European scholars, often with an archaeological and historical interest. In the twentieth century we find a stronger American presence, in addition to the emergence of groups of researchers in the different Andean countries themselves. The recent period shows several new developments: the participation of speakers of native languages in the research, the widening concern for language use, the concern about language endangerment and the role of multilingualism in Andean society. We will describe the developments in these periods in sequence, first looking at the external history of language study in each period, and then consider the treatment of a particular grammatical construction. Relative clauses in Quechua are used as an example to illustrate different phases in the thinking about Andean languages: they are complex and unfamiliar enough to have posed a challenge for different generations of outside scholars.
1.5.1 The colonial period The first data on any Andean language gathered by an outsider, as far as we know, consist of vocabulary noted down by Pigafetta, as Vocables Des G´eantz Pathagoniens. Antonio de Pigafetta accompanied Fernando de Magalh˜aes (Magellan) on his voyage around the world between 1519 and 1522 (Pigafetta 1956: 177 ff). The list of words includes body parts, some terms referring to social status (her ‘chief’) and the physical universe. No grammatical items were recorded, except chen [ˇcen] ‘us’. The list reveals only a superficial dialogue context and probably reflects some profound misunderstandings. Contacts did not lead to immediate colonisation. This was different, of course, when
11 Reconocer la preexistenciaetnica ´ y cultural de los pueblos ind´ıgenas argentinos. Garantizar el respeto a su identidad y el derecho a una educaci´on biling¨ue e intercultural... 16 1 Introduction
Table 1.2 Early grammars of Andean languages (arranged from north to south)
Chibcha 1609 Bernardo de Lugo Mochica 1644 Fernando de la Carrera Quechua (Inca) 1560 Domingo de Santo Tom´as 1607 Diego Gonz´alez Holgu´ın Aymara 1603 Ludovico Bertonio Morocosi (Mojo) 1699 Anonymous Jesuit Mapuche 1606 Luis de Valdivia Allentiac 1607 Luis de Valdivia Millcayac 1607 Luis de Valdivia
the Andes were taken from the north by Spanish bands of conquistadores in the 1530s. With the conquistadores came the priests. From the very beginning of the European presence in South America Indian languages were studied and documented by Roman Catholic missionaries. Some of the early lin- guistic descriptions written in the sixteenth and seventeenth centuries are admirably accurate for the time. They contain a wealth of information, which is particularly valu- able when placed in the interpretative context of recent findings. Rowe (1974) calculates that by 1700 grammars of 21 Amerindian languages had been published, 19 of which dealt with languages of Mexico and South America. In addition, we find a large num- ber of vocabularies, catechisms, etc. Some of the first grammars written for Andean languages are listed in table 1.2. The early grammars had an ecclesiastical use, and this meant that they were couched in a vocabulary a priest could understand. Grammatical descriptions often ‘reduced’ grammars to the Latin mould. Most missionaries used the model developed by the Spanish grammarian Antonio de Nebrija in his Introductiones Latinae (c.1488); see D¨ummler (1997). Still, we should not underestimate the achievement of early colonial grammarians. First of all, they were writing at a time when very few languages had been described yetinEurope. Fray Domingo de Santo Tom´as’s (1560a) grammar of Quechua antedates the first grammars of German (1573) and of English (1586). Second, we find, next to or subsequent to fumbling first efforts, early seventeenth-century masterpieces, still un- surpassed, of grammatical description and lexicographic exploration. Examples include Bertonio (1603a, b) on Aymara, Valdivia (1606) on Mapuche and Gonz´alez Holgu´ın (1607) on Quechua. An adequate assessment of their intellectual background and insights into the organisation of grammar is still lacking, even though a number of studies are filling the gap (Su´arez Roca 1992; Troiani 1995; Zimmermann 1997; Dedenbach-Salazar 1.5 History of the study of the Andean languages 17
S´aenz and Crickmay 1999; Nowak 1999; Zwartjes 2000). The work of Herv´as (Herv´as y Panduro 1784, 1800–5) reflects the detailed knowledge which Jesuits and missionaries of other congregations had in the eighteenth century of the linguistic situation in South America. We will illustrate the evolution of the grammatical tradition by using the example of the treatment of Quechua relative clauses. Quechua relative clauses are generally formed with nominalising particles. When the subject is relativised, -q is used (examples are given from the Cuzco variety):
(1) kay-man hamu-q runa ruwa-nqa this-AL come-AG man do-3S.F ‘The man who comes here will do it.’
The relative clause precedes the antecedent, and the relativised element is generally not expressed in the clause itself; cf. Weber (1983, 1989), Cole et al. (1982), Lefebvre and Muysken (1982, 1988). In the Central Peruvian varieties the relative clause generally follows the antecedent; if it precedes the antecedent, it is often interpreted generically. When a non-subject is relativised, a different nominalising particle, such as -sqa,is used:
(2) qaynunˇcaw riku-sqa-yki runa ruwa-nqa-n yesterday see-SN-2S man do-3S.F-AF ‘The man you saw yesterday will do it.’
The grammars by Santo Tom´as (1560a) and Gonz´alez Holgu´ın (1607) illustrate the early tradition. The unfamiliar features of Quechua relative clauses, such as the subject/non-subject distinction, their position and the absence of tense marking, posed a considerable challenge for Spanish priests. Santo Tom´as (1560a) does not discuss nom- inalising particles in relation with relative clauses. Rather, he introduces a periphrastic construction in his fifteenth chapter (‘Of relatives’), as in (3) (his spelling):
(3) Pedro pori-rca, pay-pas, o chay-pas, o quiquin-pas, micu-rca Pedro walk-PA.3S he-AD or that-AD or self-AD eat-PA.3S ‘Pedro walked, the same ate.’ (Pedro anduvo, el cual comi´o.)
This coordinate construction occurs in Quechua, but it is not the most common way to form a relative. Gonz´alez Holgu´ın’s (1607) grammar constitutes a considerable improvement. It is noted that the primary way of forming relative clauses is with a participle and without a relative pronoun, as in (1), but then Gonz´alez Holguin goes on to say that a clearer and more elegant way (oraci´on muy clara y elegante)exists to express the same meaning, as 18 1 Introduction in (4), involving the question word pi ‘who’, which again recalls the European models (original spelling):
(4) Dios-pa gracia-n-pac pi-ch camari-cu-n, o pi-pas camari-cu-n chay-ca usachi-cu-nca-tac-mi God-G grace-3P-B who-DU prepare-RF-3S or who-AD prepare-RF-3S that-TO attain-RF-3S.F-EM-AF ‘Whoever prepares himself for God’s grace, that person will certainly attain it.’
Gonz´alez Holgu´ın also mentions a third way of forming relatives, corresponding to Santo Tom´as’s example (3). An admirably detailed and generally faithful account of the language is coupled with Jesuit certainty and missionary zeal, and with a desire to cultivate the language (somewhat along the lines of Latin). This cultivation of Quechua was part of Spanish colonial policy (see chapters 3 and 7). University chairs were established for Quechua and Aymara at the University of San Marcos in Lima, for Quechua in Quito and for Muisca in Santaf´edeBogot´a. Half a century later we find the first grammar of Quechua in which different varieties are explicitly treated (Torres Rubio 1619), and this coincides with the beginning of the comparative tradition in Andean language studies, which will be the subject of the final sections of this chapter.
1.5.2 The nineteenth century The opening up of Latin America to other European nations after independence was accompanied by a new type of scholar dedicated to the languages of the Andes. Wilhelm von Humboldt had already paid attention to languages such as Quechua, basing himself on seventeenth- and eighteenth-century sources (1836, 1971). He saw the Quechua relative clauses in an evolutionary perspective, in a fashion typical of nineteenth-century historicist thought. The nominalised participles, as in (1) and (2), reflect an early, more primitive phase in Quechua language and culture; the relative clauses involving question words as in (4) correspond to the higher attainments of Inca civilisation. The same tendency to view Andean culture patterns in terms of the Great Civilisations is found in Ernst Middendorf, a curious but highly impressive figure who arrived in Peru as a physician in 1855, twenty-five years old. He spent a total of twenty-five years in the country, in three extended periods, first as a physician in Lima and then as a hacendado, a landowner, in the Cuzco region. Middendorf combined the edition of surviving texts in a number of Andean languages with actual fieldwork. In his Quechua grammar (1890a, 1970) the relatives formed through coordination, (3), and those formed with a 1.5 History of the study of the Andean languages 19 question word, (4), move to the second plan. The crucial subject (1)/non-subject (2) distinction is stated clearly for the first time. For the first time as well we find an explicit discussion of possible Spanish influence on the Andean languages: are relative clauses formed with a question word the result of Spanish grammatical influence? Middendorf considers the question unanswerable, and indeed the issue is highly complex (see, for instance, Lefebvre 1984; Appel and Muysken 1987: 161), since already in the earliest Quechua sources we find a special type of correlative formed with question words, but no ordinary relative clauses formed this way.
1.5.3 Contemporary Andean linguistics Particularly after the Second World War, the study of Andean languages underwent a new upsurge, stimulated by linguists from the United States and Europe, both secular and evangelist. The activities of the early missionary grammarians have received a modern continuation in the work of the Summer Institute of Linguistics (SIL; in Spanish: Instituto Ling¨u´ıstico de Verano, ILV),whose members study the native Indian languages for purposes of evangelisation. SIL members have been active in most South American countries. Two typical examples of post-Second World War descriptive linguistics are Yokoyama (1951) and Levinsohn (1976). In contrast with the notionally based grammars of tra- ditional and nineteenth-century linguists, these scholars focus on formal patterns and resemblances. Yokoyama discusses relative clauses in Quechua in terms of their con- stituent morphemes, going through some of the different uses of a suffix such as -sqa, before coming up with some examples of a relative clause. Levinsohn (1976), working in the framework of tagmemics, sets up numerical for- mulas for different verb and suffix combinations. We get examples such as (5a), which can be glossed as (5b):
(5) a. Vb 1331.1 – H:nighua b. ni-g-wa say-AG-C ‘with the one who says’
While the formula in (5a) may have made it possible for the author to describe the lan- guage economically and consistently, for outsiders it is difficult to gain much information from it without a lot of puzzle work. Work in more contemporary grammatical frame- works such as Lefebvre and Muysken’s (1988) analysis, couched in the Government and Binding model, while providing more examples, similarly has proven little more accessible. 20 1 Introduction
In work done by scholars from the Andean countries often a more tradition-based, practical perspective is adopted.
1.6 Sources for the study of the languages of the Andes Linguistic fieldwork is, of course, the principal source of information concerning the languages presently spoken. Cases where this type of research is particularly urgent are numerous. Notwithstanding its essential importance, field research cannot answer all the questions relevant to the past and present state of the South American languages. Manuscripts with word lists, travel accounts and recordings in public archives or libraries, or in private possession, may provide data concerning extinct languages and earlier stages of living languages. As we have said before, early Spanish grammarians contributed substantially to our knowledge of native American languages, and some of them played a role in efforts towards the standardisation of the Quechua language to suit its use as an instrument of evangelisation. Among these was the team that translated the Doctrina Christiana and the Catechism into Quechua following the Third Lima Council of 1583. They designed a unified, phonologically simple version of Quechua in which the regional differences were to some extent attenuated (Cerr´on-Palomino 1987b: 84–90). Others, such as Bertonio and Gonz´alez Holgu´ın, wrote grammars and dictionaries (of Aymara and Cuzco Quechua, respectively) considered among the classics of renaissance linguistic description. How- ever, they too focused their attention mainly upon those languages considered useful for purposes of colonisation and evangelisation. When unimportant from a numerical point of view, languages were mostly left unrecorded, or the grammars, dictionaries and catechisms dedicated to them remained in single manuscript versions. Many of these subsequently became lost. One manuscript grammar that fortunately has sur- vived is de la Mata (1748), kept in the British Library in London. It is a grammar of the Chol´on language spoken until recently in the Huallaga valley in the Peruvian department of San Mart´ın. However, the previous existence of Barzana’sgrammar (pub- lished in 1590; see Brinton 1891: 170) of the Diaguita or Kak´an language, once spoken in what is now northwestern Argentina and northern Chile, is only known to us from an indirect source. Most unfortunately, there is no grammatical nor lexical information on this language apart from a substantial number of place names and a few terms in local use. Not until shortly before the end of Spanish colonial presence in the Andean region, do we find a renewed interest in local linguistic and cultural conditions. A remarkable figure representing this current, inspired by the European Enlightenment, is Baltasar Jaime Mart´ınez Compa˜n´on, who collected word lists of the native languages still spoken about 1780 in the northern Peruvian coastal and sierra regions (Mart´ınez Compa˜n´on 1985). Mart´ınez Compa˜n´on stood at the beginning of a tradition of systematically collecting 1.6 Sources for study 21 data for a variety of Amerindian languages that lasted through the nineteenth and part of the twentieth centuries. The analysis of texts in native languages collected for different purposes, published and unpublished, constitutes another promising field. Much work remains to be done in this area. Several areas in South America, especially in the Andes, exhibit an ancient agricultural tradition rooted in the pre-Columbian and early colonial past but have lost their original languages. In such areas, the study of toponymy can provide information about the linguistic situation as it was in early postconquest times. Such an undertaking can help not only to identify the languages formerly spoken in a particular area, but also to provide an indication of their territorial extension at the time before they became extinct. This type of research has been carried out in recent years in relation to the coast and sierra of northern Peru (Torero 1986, 1989; Adelaar 1988, 1999). As regards the issue of extinction, a precaution is in order. Rediscoveries of languages thought extinct are not unusual. Elderly people may remember a language whereas the younger generation is hardly aware that it ever existed. Chol´on, for instance, was found to be still in use with a few individuals in the Peruvian Huallaga valley (Barbira 1979, cited in Cerr´on-Palomino 1987a), although it is probably extinct now. Van de Kerke (1998, 2000) found a number of speakers of Leco, a language on the slopes of the Andes in Bolivia that had been considered extinct as well. Several surveys dealing with the present-day situation of the native languages in the South American countries have provided data for this book. For Colombia the principal sources are publications of the Centro Colombiano de Estudios de Lenguas Abor´ıgenes (CCELA) of the Universidad de Los Andes and Gonz´alez and Rodr´ıguez (2000). For both Colombia and Ecuador a basic source are publications of the Summer Institute of Linguistics. Torero (1974) and Cerr´on Palomino (1987a) are important reference works for Quechua. For Aymara there is Hardman et al. (1988), and Cerr´on Palomino (2000). For Chile the main reference for Mapuche is Salas (1992a). The Argentinian situation is treated in Klein (1985) and Censabella (1999); as far as Patagonia and Tierra del Fuego are concerned, a basic source is Clairis (1985a). With respect to the Amazon area, important compilations have appeared, such as Key (1979), Pottier (1983), Klein and Stark (1985a), Derbyshire and Pullum (1986, 1990, 1991, 1998), Doris L. Payne (1990a), Queixal´os and Renault-Lescure (2000) and Dixon and Aikhenvald (2000). Loukotka (1968), Tovar and Larrucea de Tovar (1984), Migliazza and Campbell (1988), Campbell (1997) and Fabre (1998) provide the most complete recent bibliographical data. These books give a fairly complete listing of South America’snative languages, accompanied by remarks concerning existing classificatory proposals. Their aim is not to contribute a reasoned classification in itself. The Atlas of the World’s Languages (Moseley and Asher 1994), with a contribution by Kaufman, provides additional information. 22 1 Introduction
1.7 Genetic relations of South American Indian languages South America has rightly been called ‘the least known continent’ (Lyon 1974). This statement holds true, in particular, when applied to the genetic relationships of the native languages of its western half. Much remains to be done before a comprehensive classification can be established of the numerous native languages spoken, or once spoken, in the Andes and in the pre-Andean lowlands. Genetically isolated languages and small language families predominate in the area. Not the number of languages, but rather the number of irreducible genetic units constitutes its most striking feature. The resulting impression of extreme linguistic diversity is partly due to insufficient documentation, but in cases where good data are available the situation seems to be no less complex. The linguistic situation in the Andes is comparable in many ways to that in other parts of the Americas, except for the circumstance that maybe more languages became extinct here during the last five centuries than anywhere else on the continent. In a majority of cases, such languages have remained undocumented. Their extinction implies the loss of just as many potential genetic links between the languages still in use. Consequently, some of the best-known languages of the Andean region, such as Araucanian or Mochica, do not form part of any genetic grouping that could meet with the consensus of linguistic scholarship. The same holds true for ‘shallow’ genetic units such as Quechua, a conglomerate of closely related dialects, and Aymaran (also known as Aru or Jaqi), the language family that includes Aymara as its most important representative. The existence in South America of a number of large genetic groupings including many widely scattered individual languages has been known for a long time. They in- clude the Arawakan, Cariban, Chibchan and Tupi linguistic stocks. All four of them are represented in the Andes, in the pre-Andean lowlands or in both. However, with the exception of Chibchan in the northern Andes, they all occupy a predominant place elsewhere in South America. There have been several attempts to link linguistic isolates of the eastern Andean slopes to one of the larger stocks just mentioned, but in most cases without lasting success. The genetic status of several languages that were once tenta- tively classified as Arawakan or Chibchan will have to be reconsidered. The Amuesha language of the central Peruvian forest slopes constitutes a remarkable exception to this. Its supposed membership of the Arawakan family was long considered controversial, but its close genetic relationship with other Arawakan languages of the Pre-Andine subgroup has now become established beyond reasonable doubt (Wise 1976). Like in North America, the diversity of languages seems to have been the greatest along the mountainous spine on the Pacific side of the continent, which may also have constituted the scene of the earliest wave of migrations. Early colonial observers, such as Bibar (1558), Cieza de Le´on (1553), Cobo (1653) and Sim´on ([1626] 1882–1892, II: 116, 284; cited in P´erez de Barradas 1955: 17–19) speak of an amazing variety 1.7 Genetic relations of South American Indian languages 23 of languages in what is now Colombia, Peru and northern Chile. It is often not clear, however, whether the languages referred to were actually different languages or merely local varieties of more widespread languages. When Sim´on, for instance, speaks of the numerous languages used in the highlands surrounding Santaf´edeBogot´a and Tunja in New Granada (present-day Colombia), he may have been referring to dialects of the prevailing Muisca and Duit languages with the possible inclusion of some unrelated neighbouring idiomatic groups. On the other hand, the linguistic variety in the northern Peruvian bishopric of Trujillo observed by Garcilaso de la Vega (Royal Commentaries of the Incas, 1609, Book VII, chapter 3) turned out to be quite real. Except for Mochica, these languages remained virtually without record throughout the colonial period and hardly received any attention until their extinction at a relatively recent date. The cultural and political developments that took place in the Andes have favoured the spread of a few languages, i.e. Quechua, Aymara and Araucanian, at the cost of scores of local languages originally present in the area. Their expansion was initiated before the establishment of the Inca empire and the European invasion. In quite a few cases, the local languages did not disappear before the end of the colonial period. A few of them remained vital well into the twentieth century. In addition to the expansion of the major native languages, the rapid spread of Spanish also contributed to further reducing the complexity of the linguistic situation in the Andes. Spanish, for instance, has directly replaced Mochica and several other languages on the northern Peruvian coast as well as in its mountainous hinterland. The disappearance without record of so many potential relatives has doubtlessly con- tributed to the apparent genetic isolation of the languages still spoken. Tenacious efforts to relate some of the surviving languages of the area to each other, rather than con- sider them within the framework of more comprehensive genetic constructs, may have contributed to further obscuring the situation. This holds true, in particular, for the two Indian languages most widely spoken in the Andes, Quechua and Aymara. After centuries of increasing uniformisation, considerable language diversity, as may once have existed everywhere in the Andes, is still found nowadays on the eastern slopes along the upper reaches of the Amazonian rivers. Lowland Bolivia and eastern Ecuador, as well as the Colombian and Peruvian Amazonian regions adjacent to Ecuador, are typical areas in this respect.
1.7.1 History of classificatory efforts Opinions about the origin of the South American Indians and their languages date back to the early years of European presence on the continent. If we leave aside the initial postdiscovery belief that America was part of Asia, Acosta in his Historia natural y moral de las Indias (Natural and Moral History of the Indies; 1590, Book I, chapters 20–4) was probably the first to intuitively sense the way in which the ancestors of America’s Indian 24 1 Introduction population had entered the New World in times long gone. He assumed that they came mainly over land, crossing estuaries whenever necessary, in small bands and without the intention of peopling a new continent. Acosta also indicated the areas where this could have taken place: in the Arctic regions, where the boundaries of the Old and New World had not yet been discovered by his contemporaries. It was precisely the diversity of nations and languages found throughout the Americas that incited Acosta to reject prevalent theories about shipwrecks and maritime expeditions organised by historically known peoples. Quite correctly, he predicted that such diversity must have taken a long time to develop. In the linguistic domain, Cobo in his Historia del Nuevo Mundo (History of the New World; 1653, Book XI, chapter 10) assigned a common origin to Quechua and Aymara, the two predominant languages in the Inca empire, on account of their being remarkably similar (cf. Mannheim 1985b, 1991). He compared their relationship to that of Spanish and Italian, both descendants from Latin. By doing so, Cobo underestimated the power of long-term contacts between two languages sharing a common geographic and cultural space, a fact of which he was well aware nevertheless. As he posited the genetic unity of Quechua and Aymara, Cobo also laid the foundations for a debate bound to continue until the present. In the eighteenth century, as a result of expanding missionary activity, the awareness grew that the languages of many South American ethnic groups could be reduced to a few comprehensive genetic units. Missionaries were well informed about the existence of a multitude of tribes speaking different languages in the South American lowlands. Among them, the Italian Gilij (1780–4) drew the contours of several genetic groupings of lasting validity, including Arawakan (still referred to as Maipuran at that time) and Cariban (Hoff 1968). Another representative of the church, the abbot Herv´as (1784, 1800–5), published a substantial amount of data, collected from, among others, Jesuit missionaries residing in Europe after their expulsion from the Spanish American domains in 1767. When pointing out the fact that the Omagua of the Upper Amazon valley and the Guaran´ıofthe Paraguayan missions spoke closely related languages, despite geographic separation and different environments, Herv´as showed himself to be aware of the existence of a Tupi–Guaran´ı linguistic family. Herv´as refers to information from Gilij, among others, when assigning a number of languages to the Maipuran or Arawakan family. Among them is the language of the Achagua people of the Colombian llanos ‘plains’, of whose good disposition towards Spanish rule and religion Herv´as speaks highly. On the other hand, his concept of the existence of a Cariban language family seemed to be partly inspired by the idea that the Caribs did not easily submit to Spanish colonial domination. Arawakan nations who resisted colonisation efforts were classified as Cariban in company with a number of unsubjugated North American tribes. Herv´as held an interesting concept of language families. If several languages appeared to be 1.7 Genetic relations of South American Indian languages 25 related, one of them was assigned the status of lengua matriz (‘mother language’ or ‘principal language’), from which the others, by extension, were derived. This approach must be viewed in relation to the necessity felt at that time to select a limited number of languages for the purpose of evangelisation carried out among a much larger number of linguistic groups. There is a definite contrast between the amount of information provided by Herv´as concerning the Andes and the Pacific coast on one hand and the Amazonian lowlands on the other. The latter is remarkably accurate and full of detail; the former exhibits many lacunae, particularly in the sphere of the minor languages spoken within the limits of the former Inca empire. As it appears, the availability of Quechua as a vehicle used for evangelising purposes made these languages less interesting in the opinion of the missionaries and information concerning them was not passed on systematically. It may explain why languages such as Culli, Sechura and Tall´an, spoken until the nineteenth century in the northern Peruvian coastal plains and highlands, and the Pre-Quechuan languages of highland Ecuador received almost no attention before they eventually became extinct. The nineteenth century marks a renewed interest in native American linguistics and ethnography. In the southern half of the Andean region, including the adjacent lowlands, the French traveller and natural scientist d’Orbigny recorded word lists and ethnographic information on many Indian groups (d’Orbigny 1839). His information on the lowland Bolivian tribes brought together in the Chiquitos and Moxos missions is particularly valuable. D’Orbigny’s classification of Indian nations is still primarily based on ethno- graphic and geographic rather than on linguistic considerations, however. Efforts towards a genetically based classification of the South American languages gained importance during the last quarter of the nineteenth century. Brinton published The American Race (1891), a survey of the native American peoples with a strong emphasis on linguistic classification. Almost simultaneously, Uhle (1890) proposed the existence of a Chibchan family, and Middendorf (1890–2) published his monumental work on the indigenous languages of Peru, in which he also discusses the nature of the relationship between Quechua and Aymara. Henceforth, two types of contributions must be distinguished, those aiming at overall classifications meant to include all documented South American languages, or, at the least, as many of them as possible, and those aiming at establishing genetic links between specific languages or language families. Both activities have been going on, often without much mutual feedback. Proposals concerning genetic relations involving native Andean languages have been numerous and often contradictory. Here they will be dealt with very selectively. We will first enumerate and discuss some of the proposed classifications. Propos- als concerning individual languages or language families will be treated subsequently. 26 1 Introduction
Classifications are termed conservative if their authors refrain from including genetic units of which the internal cohesion is still open to doubt. Virtually no classifications are entirely consistent in this respect. Two classifications stand at the conservative end of the scale, Loukotka (1968), with 117 independent units for all of South America and the Caribbean islands, and Kaufman (1990), with 118. At the other end we find Greenberg (1987) with one comprehensive Amerind phylum thought to include all the native lan- guages of South and Central America, the Caribbean and most of North America as well. Conservative classifications do not necessarily imply a rejection of possible com- prehensive groupings, as their authors explicitly indicate, but they are meant to provide a list of firmly established ‘shallow’ language families, which can be used in further rearrangements. One of the earliest overall classifications of the South American languages, apart from Brinton’s, is that of Chamberlain (1913). It is a conservative classification containing 84 groups, most of which are represented in the Andes and the eastern foothills. The lower number of units in relation to, for instance, Loukotka’s classification is due to the fact that many languages and small families were still absent from Chamberlain’s account. In Rivet’s classification of the South American and Caribbean languages, which ap- peared in Meillet and Cohen’s well-known handbook Les langues du monde (1924), some reductions in the number of groups can be observed (to a total of 77; expanded to 108 in the revised edition of 1952). These reductions reflect Rivet’s comparative views and concern, in particular, the Chibchan language family, to which several groups previ- ously thought independent had been added. Languages thus classified as Chibchan were Andaqu´ı, the Barbacoan languages (Cayapa, Colorado and Cuaiquer), the Coconucan languages (including Guambiano), the Paniquitan languages (including P´aez), Cof´an and Cuna. Rivet also proposed a subgroup consisting of Barbacoan, Cuna and the Costa Rican groups Guatuso and Talamancan (the latter comprising all Costa Rican languages except Guatuso). Further innovations of Rivet’s classification concern the inclusion of the Tacanan family of northern Bolivia within Arawakan and the inclusion of Uruan and Puquina (wrongly considered a unit) within that same family. Of these proposals only the classification of Cuna as a member of the Chibchan family became generally accepted. Mason’s contribution to the Handbook of South American Indians (1950) contains an extensive and very useful discussion of previous classificatory efforts. It also brings a further reduction of the number of language families. Proposals to establish new groupings are partly taken from other authors. For the Chibchan family and its possible expansions (classified as Chibchan, probably Chibchan, or doubtful), Jij´on y Caama˜no (1940–5) is the main source, although the latter’s proposals are presented with much re- serve. Following up suggestions of Rivet, Harrington (1944) and Jij´on y Caama˜no, Mason groups the Huitotoan, Boran and Zaparoan languages of the Colombian, Ecuadorean and northern Peruvian lowlands with Tupi–Guaran´ı. Two innovations in Mason’s 1.7 Genetic relations of South American Indian languages 27 classification are the terms Kechumaran and Ataguitan, the former referring to a ge- netic grouping consisting of Quechua and Aymara and the latter bringing together the Atacame˜no, Diaguita and Humahuaca languages of northern Argentina and northern Chile. Mason presents the proposed Ataguitan grouping with much hesitation. It has never gained much support, in particular, because the Diaguita and Humahuaca groups are virtually undocumented. In contrast, Mason considers Kechumaran ‘yet unproved but highly probable’ (again referring to Jij´on y Caama˜no’swork). Subsequently, the term has become widely used in the alternative spelling Quechumaran.Itisremarkable that Mason explicity excludes the Cauqui or Jaqaru language from this grouping, although Cauqui was already thought (and is now known) to be closely related to Aymara. McQuown’s classification (1955) follows Mason’s in several respects, including the acceptance of the Ataguitan and Quechumaran groupings, which he considers less con- vincing than Mason does (1955: 562), the unjustified exclusion of Cauqui and the ex- panded Tupi–Guaran´ı. On the other hand, it is a conservative classification since it allows for no less than 629 unclassified languages in South America in addition to 12 large fam- ilies and 38 minor families. Characteristic of McQuown’s contribution is an attempt to enumerate and locate on maps all the native Latin American languages ever mentioned in literature (1820 for all Latin America). Although doubtlessly useful, McQuown’s lan- guage list contains many items which are geographic denominations rather than language names. This is the case, in particular, of the Quechua-dominated middle Andean region, where names of towns, provinces and valleys figure as just as many separate languages. This procedure apparently rests on the assumption that Quechua was introduced at a recent stage in most places where it is now spoken (or known to have been spoken), and that in each case a different language must underlie it. The linguistic parcelling that results from it is merely hypothetical and is also accessory to a spectacular increase in the number of unclassified languages. In 1956 Greenberg presents a classification which is distinguished from the previous ones by its greater sophistication and classificatory explicitness (Greenberg 1960a). In it few South American languages, however poorly documented, remain unaccounted for. Although it was published without a factual justification of the groupings proposed, it became widely known after its appearance in the 1958 edition of the Encyclopaedia Britannica,inSteward and Faron (1959) and in Current Anthropology (Greenberg 1960b). Much later work on individual South American native languages begins with a statement locating the language at issue in Greenberg’s classification. It was also used and given credit in the development of anthropological and archaeological theory concerning past migrations; see, for instance, Lathrap (1970: 83) and Meggers (1979). Greenberg’s initial classification was superseded by a rather revolutionary proposal advanced in his book Language in the Americas (Greenberg 1987). It will be the subject of a separate section (1.7.4). Since the influence of Greenberg’s initial classification has been particularly great, the essentials of it regarding the Andean region are reproduced 28 1 Introduction
Table 1.3 Greenberg’s (1956) classification of the languages of the Andes
Yur umangui
- A. Chibchan proper Chibcha–Duit,Tunebo group, Aruaco group, Cuna–Cueva. B. Paezan Choco, Cuaiquer, Andaki, Paez–Coconuco, Colorado–Cayapa, Jirajira, Yunca (=Chim´u, Mochica), Atacameno (=Kunza), Itonama. –– A. Macro-Ge 1. Ge: Caingang, Chiquita, Guato. 2. Bororo. B. Macro-Panoan Tacana–Pano, Moseten, Mataco, Lule, Vilela, Mascoy, Charrua, Guaycuru-Opaie. [C. not applicable] D. Huarpe E. Macro-Carib Carib, Peban (=Yaguan), Witotoan. – A. Andean A 1. Ona, Yahgan (=Yamana), Alakuluf (=Kawesqar), Tehuelche, Puelche (=Gennaken), Araucanian (=Mapuche). 2. Quechua, Aymara. 3. Zaparoan (including Omurano, Sabela), Cahuapana. 4. Leco, Sec, Culle, Xibito–Cholon, Catacao, Colan. 5. Simacu (=Itucale, Urarina). B. Andean B Jibaro-Kandoshi, Esmeralda, Cofan, Yaruro. C. Macro-Tucanoan 1. Tucano (including Auixira), Ticuna, Muniche, Yuri, Canichana, Mobima. 2. Puinave. D. Equatorial Arawak (including Chapacura–Uanhaman, Chamicuro, Apolista, Amuesha, Araua, Uru), Tupi, Timote, Zamuco, Guahibo–Pamigua, Saliban, Otomaco–Taparita, Mocoa (=Kams´a, Sibundoy), Tuyuneri (=Toyeri, Harakmbut), Yurucare, Cayuvava. in table 1.3. For this classification see also Key (1979) with some minor orthographic variation. Occasionally, alternative language names are added in parentheses and pre- ceded by an equation sign in order to facilitate comparison with other classifications. There are minor differences between the versions in circulation of Greenberg’sclassifi- cation. In the Encyclopaedia Britannica and Current Anthropology versions, Greenberg’s classification is mapped onto McQuown’s (1955) language list; Simacu (better known as Itucale or Urarina) is classified as Macro-Tucanoan, not Andean; Atacame˜no (also known as Kunza) is left out (possibly as a result of a confusion with the extinct Ecuadorian language called Atacame or Esmeralde˜no); the subdivisions are more detailed and ex- plicit and their denominations (phylum, stock, family, subfamily) more differentiated. In the Steward and Faron version, Simacu is not mentioned. 1.7 Genetic relations of South American Indian languages 29
Table 1.4 The four networks proposed by Swadesh (1959, 1962)
-: Chibchan (in its limited sense, see below, but including Timote) and Tucanoan; most languages of Meso-America. -: Cariban (including Jirajaran), Zaparoan (including Yaguan), Arawa, Kaingangan (including Ge), Guamo and Guat´o; many language groups of Brazil. -: Campan (including part of the Arawakan family: Campa, Machiguenga, Chamicuro, Amuesha, and additionally Chirino), Arawakan, Guahiboan, Cams´a, Chapacuran, Saliban, Yur´ıan (including Ticuna and Cof´an), Mobima, Tup´ı, Bororoan (including Chiquitano), Sec, Lecoan (including Choc´o) and Mochica; several language groups of Brazil, Venezuela and Florida (Timucua). -: Quechuan (including Aymara, Cauqui and Uru–Chipaya), Paezan (including the Barbacoan and Coconucan languages, Andaqu´ı, Atacame˜no and the Brazilian Kapishana and Mashub´ı), Cullian (including Hibito–Chol´on), Itonama, Cayuvavan (including Esmeralda), Pano–Tacanan, Sonchon (which includes Moset´en, Chon and Hongote), Yuracare, Macuan (which includes Mac´u–Puinave, Het, Charr´uan, Ahuishiri, Zamucoan, Yurumangu´ı, Canichana, the Brazilian Ot´ı, Ofai´e–Xavante and Catuquina, and the Venezuelan Macu), Muran (which includes Jivaroan–Cahuapanan, Boran and Huitotoan, along with the Brazilian Matanau´ı and Mura–Pirah´a), Puelche (Gennaken), Huarpean, Urarina, Guaicuruan, Mapuchean (including Matacoan), Guachian (including Vilela and Guach´ıofMato Grosso, Brazil), Yamanan (including Alacaluf), Lule, Otomaco and Yaruro (Venezuela), Lengua-Mascoy (Paraguay), Trumai and Huari (Brazil). Outside South America: Tarascan (Mexico) and Zuni (New Mexico, USA).
Swadesh’s classification (1959, 1962) is to a certain extent comparable with Greenberg’s in that it seeks to account for as many languages as possible. Swadesh sees the differentiation of languages as a geographic continuum. He does not define it exclusively in terms of genetically independent units which are internally structured by chronologically ordered moments of splitting. Instead of the usual tree model, Swadesh opts for a model of interconnected networks designed to cover the whole world, not only the Americas. As in the case of Greenberg’s initial classification, the publication of factual evidence supporting the classification was announced but remained fragmentary. Swadesh’s classification is less well known than Greenberg’s. Nevertheless, some of the surface-level proposals brought forward in it have of late received renewed attention (for instance, the J´ıvaro–Cahuapana connection and the proposed link between Atacame˜no and the Brazilian KanoˆeorKapishana; see Kaufman 1990). Swadesh distinguishes four networks in South America represented in table 1.4. Tiniguan, Omurano and Nambikwara (Brazil) have a status independent from the networks. Some languages are left unclassified for lack of data: Puruh´a, Ca˜nari, Aconipa 30 1 Introduction
(Tabancale), Copall´en, Diaguita, Gorgotoqui, Humahuaca, Munichi, Sabela and Mayna (considered by others to be a group with Omurano). The networks are linked to each other at different points and also to the North American Macro-Hokan network. Loukotka (1968) was published posthumously by Wilbert. It had been preceded by several other classifications, elaborated by the same author, the first of which dates back to 1935. Loukotka’s work is well known among scholars of South American Indian languages because it provides the reader with short word lists of almost every language, whether spoken or extinct, that had been documented before 1960. Although the data presented are frequently inaccurate, the availability in one single work of some basic vocabulary of so many different languages constitutes an invitation to browsing and amateur linguistic comparison. Loukotka divides the South American and Caribbean languages into languages of Paleo-American tribes, languages of Tropical Forest tribes and languages of Andean tribes. This general division, together with its subsequent subdivisions, seems to have a geographic or an anthropological inspiration, rather than a linguistic one. More essential are the 117 genetic units (stocks, small stocks and isolated languages) which Loukotka distinguishes and his endeavour to assign to as many languages as possible a place in the classificatory framework which he develops. Loukotka’sclassification is conservative in the sense that the proposed groupings basically contain languages of which the genetic unity is unquestioned. Unnecessary splitting, such as observed elsewhere in the separa- tion of Aymara and Cauqui (Mason, McQuown) or in that of Puquina and Callahuaya (Kaufman 1990: 44), is successfully avoided. On the other hand, two cases of unjustified grouping occur, both of them concerning the northern Andes (see also Kaufman 1990: 37–8): (1) the inclusion into the Arawakan family of the Guahiboan languages of eastern Colombia; and (2) the inclusion into the Chibchan family of Yaruro, Esmeralde˜no and a substantial number of language groups of Ecuador and southern and eastern Colombia that are not visibly related to it: Betoi, Andaqu´ı, P´aez, Coconuco, Barbacoa and Sibundoy. The assignment of the Misumalpa family of Central America to the Chibchan stock must equally be rejected if indisputable internal genetic cohesion is to be the leading criterion (Constenla Uma˜na 1981). Loukotka’s postulation of two separate isolates in the south of Chile, Alacaluf and Aksan´as, will be discussed in chapter 6. In table 1.5, those of Loukotka’s117 stocks and families relevant to the Andean region are enumerated. We will distinguish three approximate categories: (I) groups located in the Andes and along the Pacific coast, (II) groups which are predominantly located in the eastern lowlands of Colombia, Ecuador, Peru, Bolivia and Argentina, and (III) groups which are strongly represented in other areas, but also in the Andes, or in the eastern lowlands as defined above. The original numbering is retained. As can be observed from table 1.5, 75 of Loukotka’s 117 units are represented in the Andes or in the eastern lowlands of the Andean countries. Admittedly, some groups are 1.7 Genetic relations of South American Indian languages 31
Table 1.5 Language families relevant to the Andes listed in Loukotka (1968)
I Linguistic groups located in the Andes and along the Pacific coast: Y´amana (1), Alacaluf (2), Aksan´as (3), Patagon or Tshon (4), Timote (95), Jirajara (96), Choc´o (97), Idabaez (98), Yurimangui (99),* Sechura (101), Catacao (102), Culli (103), Tabancale (104), Copall´en (105), Chim´u (106), Quechua (107), Aymara (108), Puquina (109), Uro (110), Atacama (111), Mapuche (113), Diaguit (114), Humahuaca (115), Huarpe (117). II Linguistic groups located in the eastern lowlands of Colombia, Ecuador, Peru, Bolivia and Argentina: Gennaken (5), Chechehet (6), Sanaviron (7), Vilela (9), Chiquito (13), Gorgotoqui (14), Tinigua (51), Yagua (54), Kahuapana (55), Munichi (56), Cholona (57), Mayna (58), Murato (59), Auishiri (60), Itucale (61), J´ıbaro (62), Sabela (63), Z´aparo (64), Cayuvava (71), Mobima (72), Itonama (73), Canichana (74), Tacana (76), Toyeri (77), Yuracare (78), Mosetene (79), Andoque (82), Uitoto (83), Bora (84), Cofan (100), Leco (112), Lule (116). III Linguistic groups partly or mainly represented in other areas: Guaicuru (8), Mataco (10), Lengua (11), Zamuco (12), Charrua (15), Kaing´an (16), Bor´oro (27), Tupi (45), Arawak (46), Otomac (47), Guamo (48), Piaroa (50), Tucuna (53), Chapacura (65), Pano (75), Guat´o (80), Tucano (81), Yuri (85), Mak´u (86), Araw´a (88), Karaib (89), Chibcha (94).
* The original sources refer to this group as Yurumangu´ı, which is also the name of a river in the present-day Colombian department Valle del Cauca. The form Yurimangu´ıisfound in Loukotka (1968) and, as Yurimangi, in Kaufman (1990, 1994). represented very marginally (Arawa, Guat´o, Kaing´an, Yuri). On the other hand, there is an additional enumeration of ‘unclassified or unknown languages’, some of which probably represent separate groups, and, as we saw earlier, the Arawak and Chibcha groups are subject to further splitting. Another relatively conservative classification was carried out by Su´arez, and published in the fifteenth edition (1974) of the Encyclopaedia Britannica.Itdistinguishes 82 language groups for all South America. In relation to Loukotka (1968), Su´arez proposes the groupings listed in table 1.6. Apart from these groupings, which apparently meet with Su´arez’s approval, he in- dicates possible additional linkings advanced by others. Of Su´arez’s groupings some are relatively well established, such as Lulean (Balmori 1967; Lozano 1977), and Pano– Tacanan (Key 1968) without its Fuegian–Patagonian extension. Others have been refuted (Choc´o and Cariban; Uru–Chipaya and Mayan, see below) or rest upon extensive bor- rowing (Quechua and Aymara). Kaufman’sclassification of 1990 (see also Kaufman 1994) is a conservative proposal, comparable to Loukotka’s insofar as the number of genetic groups (118 for all South America) is concerned. According to the author, every group ‘is either obvious on inspection or has been demonstrated by standard procedures’ (Kaufman 1990: 37). 32 1 Introduction
Table 1.6 Groupings suggested by Su´arez (1974) of language families and isolates included in Loukotka (1968)
Su´arez (1974) Loukotka (1968)
Alacalufan Aksan´as, Alacaluf Bora–Huitotoan Bora, Uitoto Cariban Choc´o, Karaib Guaycur´u–Charruan Charrua, Guaicuru Jebero–Jivaroan J´ıbaro, Kahuapana Lulean Lule, Vilela Macro-Chibchan Chibcha, Itonama, Warao (Venezuela), Yanoama (Brazil, Venezuela) Macro-Ge Bor´oro, Kaing´an and a number of Brazilian groups Macro-Mayan Uro and the Mayan languages (Meso-America) Macro–Pano-Tacanan Mosetene, Pano, Patagon/Tshon, Tacana, Yuracare Quechumaran Aymara, Quechua
Suggestions for further grouping are accompanied by the qualifications ‘good’, ‘good?’, ‘promising’, ‘probable’ or ‘maybe’. Considering the greater methodological rigidity observed by Kaufman, one may wonder why the number of groups in his classification are not substantially higher. This is mainly because a number of poorly documented extinct languages and language groups have not been included. In table 1.7 we have arranged the genetic groups of Kaufman’sclassification insofar as they concern the Andean region by using the same geographic distinctions as observed in relation to Loukotka’s work above. As can be deduced from tables 1.5 and 1.7, the extinct languages and language groups figuring in Loukotka’s classification, but not included in Kaufman’s, are Idabaez in Colombia, Tabancale and Copall´en in northern Peru and four Argentinian groups, Diaguit, Humahuaca, Chechehet and Sanaviron because they are undocumented. Kaufman observes that maybe Gorgotoqui should be excluded as well for the same reason.12 The differences between the two classifications reside in the treatment of the Chibchan family (L94) (Kaufman has six units where Loukotka has one), the Arawakan family (L46) (Kaufman keeps Guahibo apart from Arawakan), Puquina (L109) (Kaufman has two units where Loukotka has one), the Je family (K74) (Kaufman has one unit where Loukotka has two), and Kaw´eskar (K58) (Kaufman has one unit where Loukotka has two). Otherwise, apart from a minor readjustment concerning the demar- cation between Arawakan and Har´akmbut/Toyeri (K18, L77), the two classifications are identical insofar as the Andean region is concerned.
12 The Gorgotoqui people are well attested historically, and so is the existence of a grammar of the language written by a father Ru´ız (Gonzales de Barc´ıa 1737–8). Unfortunately, no one has been able to locate this grammar in recent years. 1.7 Genetic relations of South American Indian languages 33
Table 1.7 Language families relevant to the Andes listed in Kaufman (1990) with their correlates in Loukotka (1968)*
I Linguistic groups located in the Andes and along the Pacific coast: Yurimangi (1, L99 Yurimangui), Tim´otean (2, L95 Timote), Hirah´aran (3, L96 Jirajara), Chok´o (4, L97 Choc´o), P´aesan (6, L94 Chibcha: Andaqu´ı/ Paez/ Coconuco), Barbak´oan (7, L94 Chibcha: Barb´acoa), Ezmeralda (27, L94 Chibcha: Esmeralda), Chim´uan (41, L106 Chim´u), Kulyi (43, L103 Culli), Sechura (44, L101), Katak´aoan (45, L102 Catacao), Kechua (47, L107 Quechua), Haki (48, L108 Aymara), Chipaya (49, L110 Uro), Pukina (50, L109 Puquina: Puquina), Kolyawaya (51, L109 Puquina: Callahuaya), Chon (56, L4 Patagon or Tshon), Y´amana (57, L1), Kaw´eskar (58, L2+L3 Alacaluf, Aksan´as), Mapudungu (59, L113 Mapuche), Warpe (61, L117 Huarpe), Kunsa (99, L111 Atacama). II Linguistic groups located in the eastern lowlands of Colombia, Ecuador, Peru, Bolivia and Argentina: Betoi (5, L94 Chibcha: Betoi), Kams´a (10, L94 Chibcha: Sebondoy), Tin´ıwan (11, L51 Tinigua), Wah´ıvoan (15, L46 Arawak: Guahibo), Har´akmbut (18, L46+L77 Arawak: Mashco, Toyeri), Tekiraka (21, L60 Auishiri), Kanichana (22, L74 Canichana), Munichi (26, L56), Kof´an (29, L100 Cof´an), Kandoshi (30, L59 Murato), H´ıvaro (31, L62 J´ıbaro), Kawap´anan (32, L55 Kahuapana), S´aparoan (33, L64 Z´aparo), Y´awan (34, L54 Yagua), Omurano (35, L58 Mayna), Sabela (36, L63), Urarina (37, L61 Itucale), B´oran (38, L84 Bora), Wit´otoan (39, L83 Uitoto), Andoke (40, L82 Andoque), Chol´onan (42, L57 Cholona), Leko (46, L112 Leco), Yurakare (52, L78 Yuracare), Tak´anan (54, L76 Tacana), Moset´en (55, L79 Mosetene), Puelche (60, L5 Gennaken), Lule (65, L116), Vilela (66, L9), Gorgotoki (69, L14 Gorgotoqui), Chikitano (70, L13 Chiquito), Itonama (98, L73), Movima (107, L72 Mobima), Kayuvava (108, L71 Cayuvava). III Linguistic groups partly or mainly represented in other areas: Ch´ıbchan (8, L94 Chibcha), Otom´akoan (12, L47 Otomac), Wamo (13, L48 Guamo), Chapak´uran (14, L65 Chapacura), Maip´urean (16, L46 Arawak), Araw´an (17, L88 Arawa), Puin´avean (19, L86 Mak´u), Tuk´anoan (23, L81 Tucano), Tikuna (24, L53 Tucuna), Jur´ı (25, L85 Yuri), Jaruro (28, L94 Chibcha: Yaruro), P´anoan (53, L75 Pano), Mat´akoan (62, L10 Mataco), Waikur´uan (63, L8 Guaicuru), Charr´uan (64, L15 Charrua), Mask´oian (67, L11 Lengua), Sam´ukoan (68, L12 Zamuco), Bor´oroan (71, L27 Bor´oro), Je (74, L16+L24, Kaing´an, Ge), Guat´o (82, L80), Tup´ıan (109, L45 Tupi), K´ariban (110, L89 Karaib), S´alivan (114, L50 Piaroa).
* Along with the group numbers introduced by Kaufman, the numbers of Loukotka’s classifi- cation are given in the formula Lx, followed by his group or language names when different from those used by Kaufman. In the main text, Kaufman’s group numbers are referred to as Kx.
Kaufman suggests that further grouping may be possible in the following cases (the spelling is Kaufman’s):P´aesan (K6) and Barbak´oan (K7); Chibchan (K8) and Misumalpa (a Central American group); Wamo (K13) and Chapak´uran (K14); Tikuna (K24) and Jur´ı (K25); Ezmeralda (K27) and Jaruro (K28); H´ıvaro (K31) and Kawap´anan (K32); S´aparoan (K33) and Y´awan (K34); B´oran (K38), Wit´otoan (K39) and Andoke (K40); Sechura (K44) and Katak´aoan (K45); Kechua (K47) and Haki (K48); Pukina (K50) and Kolyawaya (K51); P´anoan (K53) and Tak´anan (K54); Moset´en (K55) and Chon (K56); 34 1 Introduction
Lule (K65) and Vilela (K66); Chikitano (K70), Bor´oroan (K71), Je (K74), Guat´o (K82) and nine other Brazilian groups; Kunsa (K99) and Kapishan´a. In connection with other classifications, we commented en passant upon some of these suggested groupings (Lule and Vilela, Panoan and Tacanan, Chibchan and Misumalpa, Aymara and Quechua). Arguments for a comprehensive Macro-Ge grouping including Ge, Guat´o and Bororoan, as well as several other language groups, can be found in Davis (1968) and in Rodrigues (1986, 1999), but do not, as yet, extend to Chiquitano. The proposal of a special genetic relationship between Esmeralda and Yaruro was ad- vanced by Seler (1902); see section 2.19. Loukotka located them in the same subgroup of Chibchan. Doris Payne (1984) has presented evidence for a genetic relationship of Yaguan (K34) and Zaparoan (K33). Callahuaya (Kaufman’s Kolyawaya) is a profes- sional jargon composed of roots taken from a Puquina dialect and Quechua endings (see section 3.5). The unity of the extinct Sechura and the equally extinct Tall´an languages of the Piura area (K45 Katak´aoan) was proposed by Rivet (1949) but the evidence for it considered inconclusive by Torero (Torero 1986); see section 3.9.2. The possibility of a genetic relationship between the Boran and Huitotoan languages, on one hand, and Andoque, on the other, was considered unconvincing by a leading expert on this lan- guage (Landaburu 1979). As for the proposed special relationship between the Kunza or Atacame˜no language and the Brazilian KanoˆeorKapishana, the geographic and cultural barriers seem formidable, and there would need to be a very strong linguistic case to support it (see also section 3.7).
1.7.2 Quechuan and Aymaran, Quechumaran The two dominant language groups of the central Andes, Quechuan and Aymaran, must be viewed as language families rather than as single languages. Traditionally, however, Quechuan is more often referred to as the Quechua language. The internal comparison of Quechuan, a linguistic entity consisting of numerous local varieties, became an issue in the 1960s when Parker (1963) and Torero (1964) published their well- known articles about the Quechua dialect situation (see section 3.2.3). Hardman (1975, 1978a, b) introduced the name Jaqi (‘man’, ‘human being’) for the Aymaran family, which, according to her, has three living members to be treated as separate languages: Aymara, Cauqui and Jaqaru. For a discussion of the terminology and a justification of our use of the term Aymaran see section 3.1. Quechua(n) and Aymara(n) have repeatedly been compared to each other, but rarely to other languages. Harrington (1943) suggested a relationship between Quechua and Hokan; Dum´ezil (1954, 1955) compared some of the Quechua numerals to those of Turkish. Following an unconvincing first attempt by Swadesh (1967), Liedtke collected a list of lexical and grammatical resemblances between Quechua and Tarascan, some 1.7 Genetic relations of South American Indian languages 35 of which are quite suggestive (Liedtke 1996). No thorough comparative study has been carried out, however. Orr and Longacre (1968) set out to prove Mason’sQuechumaran hypothesis by trying to reconstruct the phoneme system and part of the lexicon of the proto-language un- derlying it. Although they apparently achieved their aim, the lexicon they reconstructed consists almost exclusively of shared vocabulary, which is evidently due to intensive borrowing between the two languages at an early stage of their development. Given the virtually identical form of the shared items, the radically different character of the remainder of the lexicon is left unexplained. The same holds for the grammatical com- ponents of the two language groups, which show quite a few semantic but hardly any formal similarities (Davidson 1977). Notwithstanding the lack of proof, the idea of a Quechumaran genetic unity exclusive of all other languages still has supporters. For an attempt to revive the Quechumaran hypothesis on a more sophisticated basis see Campbell (1995). The relationship between the Quechuan and Aymaran linguistic families is indeed unique. When the effects of loan traffic between individual Quechua dialects and the different languages of the Aymaran family are left aside, a substantial basis of common lexicon remains (about 20 per cent of the root vocabulary in each group), which can be traced back to the proto-languages. The phoneme inventories of the two proto-languages were probably very similar, as most of the existing differences may be explained by later internal developments in each of the two families. The existence of glottalised and aspirated consonants in Aymaran and in a number of Quechua dialects (Cuzco, Puno, Arequipa, north and south Bolivian Quechua) is generally attributed to diffusion from Aymaran into Quechuan, although its distribution within the latter group is far from predictable (see section 3.2.5). Morphological and lexico-semantic coincidences are highly specific and difficult to ascribe to parallel developments of a typological nature (see chapter 3 for more details). For a systematic inventory and discussion of all the coincidences see also Cerr´on-Palomino (1994a). The obvious similarities that have united Quechuan and Aymaran since the stage of the proto-languages stand in contrast with differences that are equally impressive. The very characteristic phonotac- tics and vowel suppression rules of all Aymaran languages (see section 3.3.4) are not found in Quechuan. The structure of the verbal inflection, personal reference marking in particular, differs considerably between the two language groups, and, of course, a major part of the lexicon and affixes do not show any systematic formal relationship at all. All this leads to the conclusion that Proto-Quechua and Proto-Aymaran were spoken in contiguous areas, if not in the same area, which were probably situated in central Peru, the heartland of the Middle Andean civilisation. The bi-directional loan influence between the two linguistic families was so intense, that possible surviving correspondences of a 36 1 Introduction genetic kind became hard to detect. If the languages were not genetically related – and there is no decisive evidence that they were – at the least one of them must have suffered a profound structural transformation adopting the phonological and morphosyntactic model represented by the other. This scenario presupposes a period of intense interaction and common development prior to the stage of the proto-languages. It may have begun well before the beginning of our era. Although it is risky to venture a statement on such a speculative matter, a variety of Aymaran would be the best candidate for having provided such a model because of the more homogeneous character of Aymaran verbal and nominal inflection in comparison to Quechuan inflection. The remaining languages of the central Andean region do not participate in the same sort of lexical and grammatical entwining that characterises the relationship between Quechuan and Aymaran, although lexical borrowing has occurred. Since these languages have been poorly studied so far, further research may eventually cast additional light on their relationship with either Quechuan, or Aymaran (or both).
1.7.3 Other proposals for individual language families As we anticipated, there have been many proposals of genetic connections between specific groups which were formulated outside the framework of an overall classification. For the earlier period (before 1960), two scholars, Rivet and Jij´on y Caama˜no, deserve to be specially commended for the size of their contribution to South American Indian linguistics, including much classificatory work. Many of their classificatory proposals have been the subject of drastic reconsideration. Therefore, it is not necessary to treat them in detail here, but the amount of data they brought together and their influence have been considerable. Among Rivet’s classificatory contributions we find the proposed connection of Arawakan and Tacanan (see above), the inclusion of Uru and Puquina within the Arawakan family (see also above), a rearrangement of the Chibchan family involving many groups in Ecuador and southern Colombia (see also above), and the association of the isolated Yurumangu´ı language of the Colombian Pacific coast with Sapir’s Hokan phylum (Rivet 1942). Well known was also Rivet’s conviction that the Chon languages (Tehuelche and Ona) of Patagonia were genetically related to languages spoken by the Australian aborigines (Rivet 1925). Rivet’s comparative methods have met with much criticism. In the case of Yurumangu´ı, for instance, he compared the vocabulary of this poorly documented extinct language with that of a wide array of North American and Mesoamerican languages of supposed Hokan affiliation. A lexical similarity between Yurumangu´ı and any of these languages would be considered evidence of a genetic relationship. In his monumental El Ecuador Interandino y Occidental antes de la Conquista Espa- nola˜ (Inter-Andean and Western Ecuador before the Spanish Conquest, 1940–5), Jij´on y Caama˜no assigned most languages of northwestern South America to a Macro-Chibchan 1.7 Genetic relations of South American Indian languages 37 phylum, which, in its turn, would fit into Hokan–Siouan. His Macro-Chibchan was more comprehensive than any of the previous proposals concerning Chibchan and its connections. Jij´on y Caama˜no’s Macro-Chibchan not only included all the languages in Loukotka’sChibcha, but also Timote, Cof´an, Murato (Candoshi), Yurumangu´ı, Mochica (Chim´u), Cholona and the Central American Lenca, Xinca, Jicaque and Subtiaba. By contrast, Tucano and Huitoto–Bora–Z´aparo are listed as separate phyla. Jij´on y Caama˜no’sinterpretation of the comparative method has been much criticised, inter alia, for its acceptance of systematic equations of phonetically unrelated sounds. Constenla Uma˜na (1981) mentions some striking examples of this procedure. Nevertheless, both Rivet and Jij´on y Caama˜no must be credited with having brought to public attention a wealth of data on many extinct and poorly documented languages, which until then had been virtually unknown. Recent investigations of the Chibchan family have tended to reduce the number of languages associated with it. In his thorough phonological reconstruction of Proto- Chibchan, Constenla Uma˜na (1981) found that the Barbacoan, Paezan, Andaqu´ı, Kams´a, Betoi, Jirajaran and Misumalpa languages are not Chibchan. What is left is a family based primarily in Central America and represented in Colombia and Venezuela by the Cundinamarcan Chibchan languages Muisca and Duit, Tunebo, the Arhuacan languages of the Sierra Nevada de Santa Marta, Bar´ı, Chimila and Cuna. In a revised version of his reconstruction, Constenla Uma˜na (1989) proposes a Paya–Chibchan family con- sisting of a Paya branch (represented by the sole Paya language of Honduras) and a Chibchan branch. The Chibchan branch comprises several subgroups. One of them is a Colombian Chibchan group which comprises the Arhuacan, Tunebo and Cundinamarcan Chibchan languages. Cuna is found to belong to a different subgroup with the extinct Dorasquean languages of Panama. Chimila and Bar´ı remain unclassified as to subgroup for lack of data. Several poorly documented languages once spoken in the Colombian department of Antioquia (Nutabe, Cat´ıo Chibcha) are also classified as Chibchan. The linguistic evidence seems to point to a relatively recent arrival of the Chibchan people from Central America, making it less likely for all proposed South American connec- tions to be correct. The alleged genetic relationship of Chibchan with Warao (in the Venezuelan Orinoco delta) and with Yanomama (in the Brazilian–Venezuelan border lands; Greenberg 1959, 1960a, b; Migliazza 1978a) has been the object of an investi- gation by Weisshar (1982). Among many other similar proposals, we may mention that of L´evi-Strauss (1948), who suggested a genetic relationship between Chibchan and the Brazilian Nambikwara languages (refuted in Constenla Uma˜na 1981). The reduced Chibchan family, such as proposed by Constenla Uma˜na, is almost the same as that originally outlined by Uhle (1890). The affinities of the different mem- bers of the family thus being reconsidered, many languages previously classified as Chibchan are again left unclassified. There is convincing evidence that the Barbacoan 38 1 Introduction languages Cayapa, Colorado and Cuaiquer are related in a family which also includes the Coconucan languages Guambiano and Totor´o (Constenla Uma˜na 1991; Curnow and Liddicoat 1998). Whether the extinct languages of the northern Ecuadorian highlands and the adjacent highlands in the Colombian department of Nari˜no (Cara, Pasto) also belonged to the same grouping is a question which deserves further investigation (see section 3.9.1). Although Greenberg and Kaufman classify Guambiano as Paezan, the distance between Guambiano and P´aez seems to be greater than that between Guambiano and the Barbacoan languages. The position of P´aez, Andaqu´ı, Kams´a and Betoi requires renewed attention. The Cariban language family, which has it greatest concentration of speakers in the Guyanas and eastern Venezuela, is represented in the Colombian–Venezuelan border mountains, west of Lake Maracaibo, with the Yukpa or Motilones group. Rivet (1943a) assumed a more generalised presence of Carib-speaking peoples in the Colombian Andes, by assigning the (extinct) Muzo, Colima, Panche, Pijao, Pant´agora and Op´on– Carare languages of the Magdalena basin to the same family. He also believed the Chocoan languages of the Colombian Pacific area to be related to Cariban (Rivet 1943b). Durbin and Seijas (1973a, b, 1975) have shown that only the Op´on and Carare languages, located to the northwest of the Cundinamarcan highlands, were demonstrably Cariban. Of the other languages Muzo and Colima may have been Cariban as well. For the three remaining languages, however, the lexical similarity with Cariban is not such that it can provide the assumed relationship with a solid basis (see section 2.11). The alleged connection of Chocoan and Cariban has been superseded by Greenberg’s proposal relat- ing the Chocoan languages to Paezan. For similarities between Chocoan and Barbacoan see section 2.3. Similarly, Constenla and Margery (1991) have published evidence for a relation between Chocoan and Chibchan. Genetic connections have been sought between the Mochica language (also known as Yunga and, erroneously, as Chim´u) and Chibchan (Jij´on y Caama˜no, Greenberg), and also with the Mayan language family in Mesoamerica (Stark 1972a, 1978). Another language that has been associated with Mayan is Mapuche (Stark 1970, Hamp 1971). Olson (1964, 1965) has proposed a genetic relationship between Mayan and the Chipaya language of the Bolivian altiplano (closely related to Uru). As this theory became widely accepted – Longacre (1968: 320) considers it proven – Uru–Chipaya came to be included, along with Mayan, into the North America-based Macro-Penutian phylum in published classifications of the North American and Mesoamerican languages (e.g. Voegelin and Voegelin 1965). Campbell (1973) later showed that many of the similarities observed by Olson between Chipaya and Mayan could have been the result of contact with Quechua or Aymara. Very little is left of the arguments that seemed to have convinced the Americanist linguistic community for some time. A serious drawback is the lack of a good grammar 1.7 Genetic relations of South American Indian languages 39 and dictionary of Uru and Chipaya, which makes verification a difficult task for the non-initiated. Family-internal reconstruction work was carried out for Panoan by Shell (1965, 1975), and for Tacanan by Key (1968) and by Girard (1971). Another complex of proposals concerns the connection of Panoan and Tacanan with the Bolivian Moset´en language (Su´arez 1969) and the relationship of both groups to Uru–Chipaya, to Yuracar´e (also in Bolivia) and to the Chon languages of Patagonia and Tierra del Fuego (Su´arez 1973, following Swadesh 1962 in the latter two suggestions). Key (1978) compared Pano– Tacanan and Moset´en to Araucanian. The extensive Arawakan or Maipuran family has been the subject of much comparative and classificatory work (Shafer 1959, Kingsley Noble 1965, Matteson 1972, Tovar 1986, Valenti 1986, Payne 1991a). Kingsley Noble includes the Uru and Puquina groups and Araw´ainhis comparison. Arawakan and Araw´a share rather specific features of their gender systems but are quite far apart lexically. Matteson includes Madi (Araw´a) and Harakmbut, a procedure which is rejected by Tovar. David L. Payne (1991a) does not include Araw´a and Harakmbut within the Arawakan family. Harakmbut has now been shown to be related to the Brazilian Katukina family with a suggested further connection to Macro-Ge (Adelaar 2000). Although Arawakan as a whole constitutes a closely-knit family with strong lexical resemblances (Rodrigues 1986: 70), the supposed affiliation between Arawakan and a number of languages in the Andean region has met with reserve. Such languages are Amuesha (recognised as Arawakan by Tello in 1913), Apolista (shown to be Arawakan by David L. Payne 1991b), Chamicuro (Parker 1987) and Res´ıgaro. Although Amuesha (spoken in the Andean foothills of the Peruvian department of Pasco) seemed at the best a highly divergent member of the Arawakan family, Wise (1976) convincingly demonstrated that Amuesha is closely related to pre-Andean Arawakan groups, such as Campa, Machiguenga and Piro, the relationship being obscured by rather unusual phonetic changes that took place in Amuesha. These changes must have occurred recently because they have also affected loan words from Spanish. The classificatory status of Res´ıgaro is discussed by Allin (1975) and by David L. Payne (1985); see also Aikhenvald (2001). The fact that Uru–Chipaya and Puquina are by no means closely related invalidates their inclusion into Arawakan as a subgroup, a hypothesis which is nevertheless defended by de Cr´equi-Montfort and Rivet (1925–7), Kingsley Noble (1965) and Greenberg (1987). It does not, however, preclude the possibility that one of the two languages, i.e. Puquina, exhibits a remote Arawakan affinity. There are, in fact, similarities in the lexicon and, above all, in the pronominal system (cf. section 3.5). The ongoing inves- tigation of the internal relations within the Arawakan family, which once spread over 40 1 Introduction much of South America and the Caribbean, holds great promise for the unravelling of the continent’s linguistic puzzle. David L. Payne (1990) has pointed at some very striking similarities concerning the formation of possessive nouns in four South American families, Arawakan, Araw´a, Cariban and Candoshi, which are difficult to explain through borrowing. On the other hand, Rodrigues (1985a) has presented well-documented lexical evidence relating the Cariban family to the Tupi stock (tronco Tup´ı ), a huge genetic construct attaining its maximum differentiation in the Brazilian Madeira basin. Lexical similarities between Tupi and Macro-Ge (tronco Macro-Jˆe)were already noticed by Davis (1968, 1985) and confirmed in Rodrigues (1985b). The suggested genetic relationship of Tupi, Cariban and Macro-Ge is supported by typological similarities (a relatively loose morphological structure and a lack of polysynthesis), as observed by Doris Payne (1990b); see also Rodrigues (2000). David L. Payne (1981) investigated the alleged relationship of the Jivaroan languages and Candoshi (proposed in both Greenberg’s classifications). Although he found lexical similarities, these lay in the sphere of flora and fauna and seemed to point at bor- rowing. David L. Payne (1990: 84–5) no longer considers the evidence for a J´ıvaro– Candoshi grouping convincing, but he mentions some grammatical similarities between Arawakan, Candoshi and Cariban. In Su´arez’sclassification (1974), Jivaroan is linked to Cahuapanan, another small language family of the northern Peruvian foothills. Kaufman (1990) considers this a possible relationship. Among several other suggestions, Kaufman (1994: 63) offers an interesting new proposal concerning a genetic relationship between Candoshi, Omurano (Mayna), and Taushiro (all in the Peruvian Amazon). Anew proposal concerning a possible genetic relationship between two linguistic groups that had never been associated before has been made by Croese and Payne (Croese 1990). They observed rather striking lexical similarities between Araucanian (Mapuche) and the Arawakan family. The matter requires further investigation. For the Panoan languages Migliazza (1978a, mentioned in Migliazza 1985) proposes a rather close genetic relationship with Yanomaman,based on a cognate number of about 40 per cent, and a more remote relationship with Chibchan (see above). The Guaicuruan language family (including Toba as its principal representative in the pre-Andean space) and the Matacoan language family have, together with a third group, Lengua–Mascoy, their centre of gravity in the Gran Chaco. Tovar (1981) found considerable lexical resemblance between Toba and the Matacoan languages. Whether this is due to a common genetic origin or borrowing is an issue awaiting further in- vestigation. Tovar also observes lexical similarities between Matacoan and Arawakan. In Greenberg’s classification Matacoan, Guaicuruan and Lengua–Mascoy are taken together. For Charr´uan, a group of extinct languages once located in Uruguay and 1.7 Genetic relations of South American Indian languages 41
Argentinian Entre R´ıos, Arawakan, Matacoan, Lule–Vilelan and Guaicuruan connec- tions have been proposed by Perea y Alonso (1937), Ferrario (Ms), Rona (1964) and Su´arez (1974), respectively. See for the three first proposals the discussion in Longacre (1968: 353–4), who seems to give most credit to Ferrario’s arguments for the Matacoan connection, cited in Censabella (1999: 61). Suˇsnik (1978: 94) appears to favour a Guaicuruan connection. Among the languages of the southernmost part of South America, a grouping was recognised as early as 1913 by Lehmann-Nitsche. It consisted of the Patagonian languages (Tehuelche, Tehues) and the languages of Tierra del Fuego’s main island (Selk nam or Ona, Haush). This grouping was called Tshon or Chon,adenomination that includes elements of the words ‘Tehuelche’ and ‘Ona’. For many languages once spoken in Argentina it will probably never be possible to even approximately determine a genetic affiliation because the populations in question were exterminated before their languages could be recorded. Viegas Barros has proposed a genetic relationship between Kawesqar and Yahgan (Censabella 1999: 88). A final word about possible trans-Pacific genetic connections. Although there have been many proponents of such connections (Rivet 1925; Imbelloni 1928; Ibarra Grasso 1958), no valid arguments were brought forward to support them. The search for them, however, has shown at the least two lexical items shared by Polynesian languages and languages in South America. One of them is the name of a plant domesticated in the New World, the sweet potato (Ipomoea batatas), Easter Island kumara,Hawaiian ʔuala, which is found as k’umar or k’umara in Quechua and Aymara. The second word is toki, Easter Island ‘stone axe’, Mapuche ‘stone axe’, ‘military chief (the holder of the axe)’; compare also Yurumangu´ı totoki ‘axe’ (Jij´on y Caama˜no 1945). Although the former case constitutes near proof of incidental contact between inhabitants of the Andean region and the South Pacific, the latter is not nearly as convincing but certainly deserves attention. Apparently, there were sporadic contacts that led to an occasional interchange of words, not to migrations of entire populations that could have brought along their languages.
1.7.4 The Greenberg (1987) proposal The appearance of Greenberg (1987) brought the discussion about the origins of lan- guage, both in the Andes and elsewhere in the New World, into a new phase. Rather than merely containing the long expected factual justification of Greenberg’searlier proposal, which, in part, it did, the terrain of comparison was widened to include the whole of native America. It also brought a revision of the classification of the South American Indian languages proposed before. In short, Greenberg (1987) contains the following new elements: 42 1 Introduction
A. All the languages of the New World belong to three families: Eskimo–Aleut, Na– Dene and Amerind; Eskimo–Aleut and Na–Dene are limited to the Arctic and parts of North America. Consequently, all South American and Mesoamerican Indian languages, as well as most North American Indian languages are related. They belong to a single family: Amerind. The tripartite division of the native American languages is associated with three consecutive waves of migration, the first of which is represented by speakers of Amerind. Support for this hypothesis is sought from physical anthropology (blood groups, dental structure) and archaeology; a first outline of it had already been published, before the appearance of Greenberg (1987), in Greenberg, Turner and Zegura (1985, 1986). B. The South American languages are divided into seven subgroups: Macro-Ge, Macro-Panoan, Macro-Carib, Equatorial, Macro-Tucanoan, Andean and Chibchan– Paezan. As in Greenberg’s earlier classification, one language, Yurumangu´ıisassigned to a North American subgroup, Hokan. In addition, Macro-Ge, Macro-Panoan and Macro-Carib are said to form a group at an indermediate level, which corresponds to the Ge–Pano–Carib of Greenberg’s earlier classification; the same holds for Equa- torial and Macro-Tucanoan, which formed part of Andean–Equatorial in the earlier classification. Andean–Equatorial as such is abandoned. So there are four groups at the intermediate level between Amerind and the seven subgroups just enumerated: Ge– Pano–Carib, Andean, Equatorial–Tucanoan and Chibchan–Paezan (the former Macro- Chibchan). For Amerind as a whole, including North America and Mexico, Greenberg posits eleven subgroups and six groups at the intermediate level. As Swadesh (1962) did before him, Greenberg finds more genetic diversity in South America than in North America (except for the presence of non-Amerind Na–Dene and Eskimo– Aleut). C. Chibchan–Paezan receives extensions in North America and elsewhere in the Americas. Its Chibchan division is made to include Tarascan and Cuitlatec, two lan- guage isolates located in Mexico; the Paezan division now includes Timucua, a lan- guage isolate once to be found in Florida. Huarpean (Allentiac), originally classified as Ge–Pano–Carib, has been reassigned to Paezan, where it finds itself together with Atacame˜no. J´ıvaro–Candoshi and the languages associated with it (Esmeralda, Yaruro and Cof´an), originally a separate division of Andean–Equatorial, have been reclassi- fied as Equatorial. Quechua and Aymara are both classified as Andean but no longer treated as a unit. Greenberg (1987: 100) admits ‘that Aymara appears relatively isolated within Andean’. Greenberg’s new classification comes at a point in time when historical linguists tend to be increasingly reluctant to accept distant genetic relationships if not accompanied by solid proof (see, in particular, Campbell and Mithun 1979). Greenberg shows him- self highly critical of the current methods of obtaining such proof, ‘the use of sound 1.7 Genetic relations of South American Indian languages 43 correspondence tables and asterisked reconstructed forms’ (1987: 1). Instead, Greenberg advocates the search for lexical and grammatical similarities that become apparent from a comparison of many languages at the same time (‘multilateral comparison’). He does not pursue phonetic exactitude and considers it premature to look for regular sound cor- respondence lest other significant similarities should be missed. Greenberg also allows for a substantial amount of error in his data: ‘the method of multilateral comparison is so powerful that it will give reliable results even with the poorest of materials. In- correct material should have merely a randomizing effect’ (1987: 29). As a matter of fact, the data included in Greenberg (1987) are riddled with errors. For instance, Cochabamba Quechua ‘to see’ is given as ruk, instead of rikhu-, and quechuologists are puzzled about identifying the ‘Huanacucho dialect’ (probably a designation for Ayacucho affected by a confusion with the name of some other dialect, such as Hu´anuco or Huanca). Old misunderstandings are perpetuated and even reinforced. For instance, the alleged unity of Uru–Chipaya and Puquina is not only defended, but Olson’s work is quoted as evidence for it (Greenberg 1987: 84). In reality, Olson (1964) merely observed that some Chipaya call their language Puquina, which says nothing about the undeni- able fact that there also existed a Puquina language quite distinct from present-day Chipaya. Identical namegiving is no proof of identity. On the other hand, possible close connections, such as Guambiano and Barbacoan, or Harakmbut and Katukina, were missed. It is not surprising that Greenberg’s work has met with vigorous criticism; see, for instance, the discussion in Current Anthropology 28: 647–67, Kaufman (1990), and the reviews by Adelaar (1989) and Matisoff (1990). Nevertheless, not all his proposals should be dismissed lightly. Some of the proposed genetic links will undoubtedly turn out valid, even though the factual basis is still insufficient. Greenberg also gives an inventory of grammatical elements that are widespread in the Amerindian languages. Of some cases Greenberg was not the first to have noticed them (see, for instance, Swadesh 1954). The grammatical elements in question are not merely cases of typological resemblances because they concern the formal aspects of morphemes. One of the best-known cases is a pattern consisting of n,oranother non-labial nasal, for reference to first person, in combination with m for reference to second (both usually followed by a vowel). One may be tempted, for instance, to investigate the possibility of a genetic link between, say, Araucanian and Californian Penutian on the basis of Araucanian nyi ‘my’ and mi ‘your’, on the one hand, and Wintu ni ‘I’ and mi ‘you’, on the other, only to find out later that many other Amerindian languages exhibit similar or related patterns of personal reference. Whatever the origin of such resemblances may be, they can hardly be due to borrowing. In table 1.8, we summarise the classification proposed in Greenberg (1987) insofar as it concerns the languages and language groups located in the Andean region. Alternative 44 1 Introduction
Table 1.8 Greenberg’s (1987) classification of the languages of the Andes
I. C. Hokan 5. Yurumangui. III. - A. Chibchan 3. Nuclear Chibchan: a. Antioquia (incl. Katio, Nutabe). b. Aruak (incl. Guamaca and Kagaba). c. Chibcha (incl. Duit and Tunebo) d. Cuna. f. Malibu (incl. Chimila). h. Motilon. B. Paezan 1. Allentiac (incl. Millcayac). 2. Atacama. 3. Betoi. 4. Chimu. 5. Itonama. 6. Jirajara. 8. Nuclear Paezan: a. Andaqui. b. Barbacoa (incl. Cara, Cayapa, Colorado and Cuaiquer). c. Choco. d. Paez (incl. Guambiano). IV. A. Aymara Aymara, Jaqaru. B. Itucale–Sabela 1. Itucale. 2. Mayna. 3. Sabela. C. Kahuapana–Zaparo 1. Kahuapana (incl. Jebero and Chayahuita). 2. Zaparo (incl. Arabela and Iquito). D. Northern 1. Catacao. 2. Cholona (incl. Hibito). 3. Culli. 4. Leco. 5. Sechura. E. Quechua. F. Southern 1. Alakaluf. 2. Araucanian. 3. Gennaken (=G¨un¨una K¨une). 4. Patagon (incl. Ona). 5. Yamana. V. – A. Macro-Tucanoan 1. Auixiri. 2. Canichana. 10. Mobima. 11. Muniche. 15. Puinave. 17. Ticuna– Yuri: a. Ticuna. b. Yuri. 18. Tucano. B. Equatorial 1. Macro-Arawakan: a. Guahibo. c. Otomaco. d. Tinigua. e. Arawakan: (i) Arawa. (ii) Maipuran (incl. Amuesha, Apolista, Chamicuro, Res´ıgaro and the Harakmbut languages). (iii) Chapacura. (iv) Guamo. (v) Uro (incl. Puquina and Callahuaya). 2. Cayuvava. 3. Coche (=Kams´a). 4. Jibaro–Kandoshi: a. Cofan. b. Esmeralda. c. Jibaro. d. Kandoshi. e. Yaruro. 5. Kariri–Tupi: b. Tupi. 6. Piaroa (incl. Saliba). 8. Timote. 11. Yuracare. 12. Zamuco. VI. –– A. Macro-Carib 1. Andoke. 2. Bora–Uitoto: a. Bora. b. Uitoto. 3. Carib. 5. Yagua. B. Macro-Panoan 1. Charruan. 2. Lengua. 3. Lule–Vilela: a. Lule. b. Vilela. 4. Mataco–Guaicuru: a. Guaicuru. b. Mataco. 5. Moseten. 6. Pano–Tacana: a. Panoan. b. Tacanan. C. Macro-Ge 1. Bororo. 4. Chiquito. 7. Ge–Kaingan: a. Kaingan. 8. Guato. 1.7 Genetic relations of South American Indian languages 45 names are occasionally added, preceded by an equals sign (=)inorder to ease identi- fication. The lowest level of the classification is left out because not all the language names listed in Greenberg’s classification actually represent different languages but, rather, dialects or different designations of the same language. In other cases, however, they do represent different languages, a fact which may give rise to confusion. 2
The Chibcha Sphere
The present chapter deals with the languages of the northern Andes; the term ‘Chibcha Sphere’ has been chosen because of the historically important role of the Chibcha people in that area. In the sixteenth century the Chibcha or Muisca were the inhabitants of the highland region that coincides with the modern Colombian departments of Boyac´a and Cundinamarca. Although historical sources insist that there was no linguistic unity, it is likely that most Chibcha spoke closely related languages or dialects belonging to a subgroup of the Chibchan language family. At least two languages have been identified: Muisca was spoken on the upland plain (sabana) surrounding the present-day Colombian capital Santaf´edeBogot´a (department of Cundinamarca) and Duit in the department of Boyac´a. By their location in the highlands east of the Magdalena river valley, close to the Amazonian plains, the Chibcha held a peripheral position in the Colombian Andes. Therefore, their linguistic influence on other parts of that area must not be overestimated. The Chibcha were a populous agricultural nation, who specialised in the cultivation of potatoes and cotton. They were divided into several chiefdoms, two of which occupied a leading position. A southern chiefdom, centred in Bacat´a or Muequet´a (near the modern town of Funza, close to present-day Bogot´a), was ruled by a king called the zipa.At the time of the arrival of the conquistador Gonzalo Jim´enez de Quesada in 1537, the valley of Bogot´awas filled with a multitude of high wooden buildings, which impressed the Spaniards so much that they gave it the name of Valle de los Alcazares (‘Valley of the Castles’). The zipa’s northern neighbour, located in Hunza (today’s Tunja, the capital of the Boyac´a department), was known as the zaque.Athird town of importance wasSogamoso (Sugamuxi), the religious centre of the Chibcha and the seat of a highly venerated wooden temple of huge dimensions. According to tradition, the temple of Sogamoso was burned down accidentally by two greedy Spanish conquistadores,who let go of their torches as they beheld the richness of the gold decorations inside (Hemming 1978: 86–7). The high priest of Sogamoso subsequently changed his name to don Alonso and became one of the most faithful propagandists of the Christian faith (Triana y Antorveza 1987: 555). 2 The Chibcha Sphere 47
O JIR UA G Santa Marta NA TÍO TAIRO QUE CA ARHUACO S A C E RAJARA H N JI Cartagena N MALIBÚ I O A M AMÁN
IL AY C I
L T
O YÓN A A O G M PACABUEY M CUICA C U E E SINÚ OT
V . T I M
Panamá A GUAMO R Mérida HACARITAMA Í GUACA- a
n U e
NORI l G . a I VENEZUELA PANAMA d R YAMESÍ R a g CHITARERO CUNA c a A u Y a M A C OPÓN-CARARE OTOMACO t r CATÍO a GUANE BETOI t o NUTABE LACHE IDABAEZ R
C . AGATANO H LIMA SÁLIBA O CO DUIT
C RA
S NTÁGO Sogamoso Ó PA . O MUZO MAIPURE R C ARMA- E TEGUA ACHAGUA N Tunja n POZO H A MUISCA a C u H IRRA N J C Bogotá ANSERMA A S. P QUIMBAYA SUTAGAO QUINDÍO O A LILI J A I YURUMANGUÍ C P JAMUNDÍ A COLOMBIA N Í JITIRIJITI APÁEZ PUBENZA U U G Q TIMANÁ A Popayán YALCÓN D N BARBACOA A TAMA San Agustín SINDAGUA QUILLACINGA NIGUA PASTO MALABA ESMERALDEÑO SIONA YUMBO CARA O QUIJO N O H C
L ECUADOR I
Z
A
R PERU B
Map 1 The Chibcha Sphere: overview of ethnolinguistic groups attested in premodern sources
The Chibcha heartland also became known worldwide as the source of the El Dorado legend, a major incentive for conquest and exploration in the northern Amazon. At regular intervals, the cacique or chieftain of Guatavita, one of the most influential vassals of the zipa,would anoint himself with gold dust and plunge into a volcanic lake. The 48 2 The Chibcha Sphere story of El Dorado had a tremendous impact on Spanish conquistadores and adventurers. During the decades following the conquest, they would organise numerous expeditions geared at finding other El Dorados. These expeditions brought considerable havoc and misery to the Chibcha and their neighbours. Their damage in terms of human losses and social disruption was such that the emperor Charles V forbade all such expeditions in 1550 (Hemming 1978: 139). There were some remarkable cultural achievements, such as the goldsmith’s art of the Quimbaya people of the Cauca river valley, the monumental stone sculptures of San Agust´ın in the department of Huila, the Ciudad Perdida (‘lost city’) of the Tairona in the Sierra Nevada de Santa Marta, and the pictographic writing system of the Cuna (cf. Nordenski¨old 1928–30). At the same time, the native peoples of the northern Andes were divided. Apart from the powerful Muisca kingdoms, there was no political unity, but rather a conglomerate of small chiefdoms and tribes, living in an almost permanent state of war. These small political units were usually referred to as behetr´ıas by the Spaniards (see, for instance, Cieza de Le´on 1553; Acosta 1590). The difference in this respect between New Granada, as Colombia was called in colonial times, and Peru (including present-day Ecuador) was emphasised by most of the chroniclers. The Chibcha, although highly organised internally, were besieged by the warlike tribes of the Magdalena valley, including the Panche, the Pant´agora and the Pijao, who blocked their way to the west and contributed to their isolation. Within the neighbouring highlands, the Muzo and the Colima were encroaching upon the Chibcha heartland from the northwest. A special caste of warriors, the guecha (probably g¨uecha [wetya]; cf. Uricoechea 1871: 253) were in charge of the defence of the Chibcha realm, which is otherwise described as relatively peaceful. After the arrival of the Spaniards, many Colombian tribes refused to submit and continued to fight the colonial rulers, taking advantage of the rugged physiognomy of the country. Famous is the story, possibly a legend, of la Gaitana,afemale cacique of Timan´ainthe Upper Magdalena valley. In 1543 she is said to have hunted down and ferociously killed the Spanish conquistador Pedro de A˜nasco, who was responsible for burning her son.1 Many sectors of the Magdalena river valley remained dangerous and insecure for travellers well into the twentieth century. The Chimila, who inhabited the region east of the lower Magdalena valley (departments of Cesar and Magdalena) and the Andaqu´ıofthe forest region east of San Agust´ın (departments of Huila and Caquet´a), are known for their long and tenacious resistance. The fearsome Pijao of Huila and Tolima challenged Spanish rule in a large-scale rebellion in the early seventeenth century.
1 The story of La Gaitana is related by the chroniclers Juan de Castellanos (1589) and Pedro Sim´on (1625). Her ethnic background has not been established. She may have belonged either to the P´aez or to the local Yalc´on nation. 2 The Chibcha Sphere 49
Outside the Chibcha heartland, another concentration of highly developed and popu- lous societies was found in the valley of the Cauca river, in the modern departments of Caldas, Risaralda and Quind´ıo. An outstanding position was occupied by the Quimbaya federation, centred around the modern towns of Chinchin´a (near Manizales) and Pereira (Chaves, Morales and Calle 1995: 156). The Quimbaya are known as the most talented goldsmiths of pre-Columbian America. Although the first contacts with the Spanish conquistador Jorge Robledo in 1539 were not particularly hostile, the subsequent re- pression and exploitation by Sebasti´an de Belalc´azar and his men from Peru led to a series of rebellions, which brought about the near annihilation of the Cauca peoples. The Quimbaya became extinct as a recognisable group around 1700 (Duque G´omez 1970). Their language remains unknown and its affiliations a matter of speculation. Sixteenth-century chroniclers report the existence of almost innumerable different languages. Some of them give a fair account of the situation which can help to make us aware of the loss. Pascual de Andagoya (1545) mentions the Atunceta, Ciaman, Jitirijiti and Lili languages spoken in the area of Cali and Popay´an. Only the names of these languages have been preserved, as well as the observation that they were so different from each other that the use of interpreters was required. Pedro de Cieza de Le´on (1553), the chronicler of Peru, who accompanied Captain Robledo in his conquest of the lower and middle Cauca valley, has left very precise information about the language situation of Antioquia, Caldas, Quind´ıo and Risaralda. Through an analysis of Cieza’s linguistic observations, Jij´on y Caama˜no (1938: Appendix, pp. 109–12) points at the existence of four different languages in the Caldas–Quind´ıo–Risaralda region: Arma– Pozo, Quimbaya–Carrapa–Picara–Paucura, Quind´ıo and Irra. However, this enumeration does not include the languages of the Anserma, of the Chancos nation and of several other local groups. They may either have been separate languages, or be included in one of the groupings just mentioned. The information on all these languages is too limited to permit any conclusion as to their genetic affiliation. An exception are the languages of Antioquia (known as Old Cat´ıo and Nutabe), which were identified as Chibchan (Rivet 1943–6; Constenla Uma˜na 1991: 31). Interestingly, one of the few words mentioned by Cieza de Le´on for the language spoken in the towns of Arma and Pozo (department of Caldas) is ume ‘woman’, which corresponds to ome in the Cuna language of the Colombian and Panamanian Caribbean coast. A frequent ending -racua is reminiscent of the Cuna derivational ending -kwa (cf. Llerena Villalobos 1987: 72–3). Such similarities, as well as some others, were observed by Rivet (1943–6) but remain merely suggestive as long as no additional data are found concerning the languages of the Cauca valley. Since most of the indigenous languages were lost without possibility of recovery, the extent of linguistic variety in the northern Andes may never be fully appreci- ated. The few languages that have survived the contact with the European invaders may or 50 2 The Chibcha Sphere may not be representative. As it stands, none of the original languages of the Cauca and Magdalena valleys have survived, and there is hardly any documentation on them. The main languages of the sabana, Muisca and Duit, became extinct in the eighteenth century, although in the first case the available documentation is relatively extensive. On the other hand, some surviving groups (e.g. the Chocoan Ember´a, the Cuna, the P´aez) have been remarkably expansive in recent times. From this perspective, the original linguistic situation and the present-day one are hardly comparable.
2.1 The language groups and their distribution Colombia and western Venezuela form the northernmost section of the Andean region. This area was a meeting point of linguistic and cultural influences from the central Andes, the Amazon basin, the Caribbean and Central America. From an archaeological and cultural point of view, it is part of a region often referred to as the ‘Intermediate Area’ (Area Intermedia), negatively defined as an area belonging neither to Mesoamerica, nor to the Central Andean civilisation domain. In addition to Colombia and western Venezuela, the Intermediate Area also comprises a substantial part of Central America. The linguistic features of the Intermediate Area have been studied by Constenla Uma˜na (1991), who finds it subdivided into three main typological regions: a Central American–Northern Colombian area (including the Cof´an language isolate as an outlier), an Ecuadorian– Southern Colombian area, and a Guajiro–Western Venezuelan area. Two important language families, Cariban and Chibchan,have left their mark in the northern Andes since precontact times. Cariban has its origin in the Amazonian and Guyanese regions, whereas Chibchan has Central American connections. Considering their distribution and the amount of internal differentiation within the area under dis- cussion, the intrusion of the Chibchan languages in the northern Andes is clearly much older than that of the Cariban languages. Nevertheless, a Central American origin for the Chibchan languages seems likely because some of the most fundamental diversity internal to the family is found in Costa Rica and western Panama (Constenla Uma˜na 1990). Furthermore, the closest presumable relatives of the Chibchan family as a whole, Lenca and Misumalpa, are located at the northern, Central American borders of the Chibchan domain (Constenla Uma˜na 1991). The area of the Caribbean coast of Colombia comprises two important nuclei of Chibchan-speaking populations: the Cuna, around the Gulf of Urab´a and adjacent areas of (Atlantic) Panama, at one end, and the complex of indigenous peoples of the Sierra Nevada de Santa Marta (Ika, Kankuamo, Kogui and Wiwa) and the Sierra de Perij´a(Bar´ı), at the other. Between these two geographical extremes, several other groups were decimated or eliminated early in the colonial process. In the present-day departments of Bol´ıvar, C´ordoba and Sucre, three prosperous chiefdoms, Fincen´u, Pancen´u and Cen´ufana represented the Sin´u culture, renowned for its burials rich in gold 2.1 The language groups and their distribution 51
O O N JIR A A J U AU G R A PA N †) IA A Í( ER M U PR KOGUI A K JA D N Maracaibo Cartagena IKA A K YUKPA GUAJIRO CHIMILA recent expansion BARÍ ( † ) EMBERÁ T E T I M O VENEZUELA CUNA . PANAMA R a c OPÓN- au CARARE(†) C TUNEBO YARURO A E Á t M r R a HITNÜ CUIBA t E B o B
E Medellín R
M R . SÁLIBA .
E Á . R R a . a et R n e M n l I a a N u d A g U J ACHAGUA K PIAROA WAUNANA a Bogotá I n S
a M S P PIJAO(†) IAP OCO PUINAVE YAVITERO(†) Cali PÁEZ GU AMB COLOMBIA BANIVA DEL IAN viare R O TINIGUA Gua . GUAINÍA EMBERÁ GUAYABERO TOTORÓ CURRIPACO Popayán NUKAK IN CARIJONA CUBEO GA KAMSÁ N YURUTÍ KAKUA O PISAMIRA GUANANO KOREGUAJE- SIRIANO AWA CARAPANA TARIANA TAMA DESANO PIT TATUYO PIRATAPUYO CHA’PALAACHI N SIONA BARÁ TUCANO Á CABIYARÍ F MAKAGUAJE TUYUCA O BARASANA C TETETÉ (†) Caquetá R. Quito MACUNA HUPDÁ TSAFIKI SECOYA TANIMUCA ANGUTERO ANDOQUE NA YUCU CARIJONA P H M NONUYA u U UIN L tu I ANE ECUADOR m TO MIRAÑA a T y O BORA o I R OCAINA YURÍ . RESÍGARO OREJÓN Z YAGUA TICUNA COCAMA A PERU R B
Map 2 The Chibcha Sphere: approximate distribution of indigenous languages (mid twentieth century). The shaded area surrounding Lake Maracaibo indicates a recent expansion of the Guajiro language. 52 2 The Chibcha Sphere
(Chaves et al. 1995). The surviving descendants of the Sin´u, who live at San Andr´es de Sotavento (C´ordoba), not far from the town of Sincelejo, have no record of their origi- nal languages. The language of the lower Magdalena river (between Tamalameque and Trinidad) was known as Malib´u.Itand the extinct languages of neighbouring peoples, such as the Mocana and the Pacabueyes,have been grouped with the (Chibchan) lan- guage of the Chimila by Loukotka (1968: 244) without any factual basis (cf. Constenla Uma˜na 1991). The Chimila language is still spoken today. The language of the Tairona, whowere destroyed in 1600 after almost a century of warfare with the colonists of Santa Marta, may have been related to (or even identical with) one of its Chibchan neighbours further east in the Sierra Nevada.2 The Andes northeast of the Chibcha heartland were inhabited by several agricultural highland peoples who shared some of its cultural characteristics. They include the Lache, wholived near the Sierra Nevada del Cocuy (northeastern Boyac´a), the Guane in the southern part of Santander department (south of Bucaramanga) and the Chitarero of the Pamplona area (department of Norte de Santander). The Agatano occupied the mountains west of V´elez (Santander). Most of these peoples have been classified as Chibchan, although there are hardly any linguistic data to support such a supposition. Since the territory of the Lache bordered on that of the modern (Chibchan) Tunebo or Uwa of the eastern Andean slopes and lowlands in Casanare, it is tempting to speculate that they represented the same linguistic grouping. Here again, however, the necessary data on Lache are lacking. The Venezuelan Andes form a geographical continuation of the area just referred to. The high Andes of the states of M´erida and Trujillo comprise a substantial indigenous (now Spanish-speaking) population, representing the (extinct) Timote–Cuica family, not related to Chibchan. In the pre-Andean hills of the states of Lara and Falc´on, the Jirajaran family (also extinct) constituted another linguistic isolate. Cariban-speaking peoples survive in the Colombian–Venezuelan border area west of Lake Maracaibo (Yukpa and Japreria). Elsewhere in the northern Andean region the identification of Cariban languages has been problematic as a result of their poor docu- mentation and early extinction. In addition, Spanish chroniclers did not employ the term ‘Carib’ (caribe) primarily as a linguistic concept, but rather as a cover term for Indians who remained intractable in their contacts with the colonial authorities, and, especially, for those who used arrow-poison and practised cannibalism. Such tribes inhabited the
2 The linguistic connection between the area of Santa Marta and the Muisca highlands of Bogot´a is illustrated by Piedrahita’s account of the conquest of 1537. In 1676 he wrote: Quienes m´as percibieron el idioma fueron Peric´on, y las Indias, que se llevaron de la costa de Santa Marta, y R´ıo Grande, que con facilidad lo pronunciaban, y se comunicaban enel ´ con los Bogot´aes (‘Those who best understood the language [of the Muisca] were Peric´on [the expedition’s guide] and the Indian women, who hailed from the coast of Santa Marta, and the R´ıo Grande [Magdalena], who with ease pronounced it, and communicated in it with the Bogot´aes’; Ostler 2000, following Uhle 1890). 2.1 The language groups and their distribution 53
Magdalena valley, where at least one group, Op´on–Carare, has been identified as Cariban on linguistic grounds (Durbin and Seijas 1973a, b). In the case of other Magdalenan peoples, such as the Panche and the Pijao,aCariban affiliation remains hypotheti- cal, although the poor lexical data of the Pijao language that are left exhibit traces of Cariban influence in its basic vocabulary (cf. Constenla Uma˜na 1991: 62). Cariban influ- ence in the Magdalena valley is also suggested by its unique Carib-sounding toponymy (Coyaima, Natagaima, Tocaima), which is found in both the ancient Panche and Pijao areas. In other areas, however, where a Carib presence has been suspected mainly on cultural grounds, e.g. in the Cauca valley, there is no linguistic evidence to support it. In addition to Cariban and Chibchan, two more families which have their origin out- side the northern Andes are found in the area under discussion. The Arawakan family of probable Amazonian origin is represented near the Caribbean coast (Guajiro and Paraujano). The Quechuan group (cf. chapter 3) is represented by the Inga or Ingano language in the southern Colombian departments of Nari˜no and Caquet´a. The influence of Quechua is particularly noticeable in the southern Andes of Colombia, notwith- standing the fact that local languages, such as Pasto, Quillacinga and Sindagua contin- ued to remain in use for a long time. It is not unlikely that Quillacinga and Sindagua survive in present-day Kams´a or Sibundoy and in (Barbacoan) Cuaiquer or Awa Pit, respectively (Groot and Hooykaas 1991). The department of Nari˜no, bordering on Ecuador, has a substantial indigenous population, even though the use of Spanish is now predominant. The extent of Quechua influence in southern Colombia, as well as the moment of its introduction, is a matter of debate. It may already have been present before the arrival of the Spaniards, or it may have been introduced by the yanacona (serfs) from Quito, who accompanied Belalc´azar and other conquistadores on their expeditions. The Quechua- speaking yanacona played an important part in the conquest of New Granada. They were eventually allowed to settle down at several locations north of Popay´an, in the Bogot´a area and in Huila (Triana y Antorveza 1987: 118–20). As early as 1540, Andagoya (1986: 133) observed a sort of mixed use of Quechua and Spanish among the Jitirijiti tribe, who lived in the neighbourhood of the present-day town of Cali. He quotes the words of a recently christianised Indian woman turning down an improper proposal made to her by a Spaniard: mana se˜nor que soy casada y tern´a Santa Mar´ıa ternan hancha pi˜na, ‘no, sir, I am married, and the Holy Mary will be very angry’; cf. Quechua mana ‘no’, anˇca pinya ‘very angry’ (tern´a and ternan may represent the Spanish verb tendr´a ‘he/she will have’). In 1758 the Franciscan friar Juan de Santa Gertrudis visited the archaeological remains at San Agust´ın, leaving a detailed account of his findings (Reichel-Dolmatoff 1972). He reported that Quechua, la lengua linga (an adulteration of la lengua del Inga ‘the language of the Inca’), was used in the Upper Magdalena region, an area which had been highly multilingual in the sixteenth century (Triana y Antorveza 1987: 169). 54 2 The Chibcha Sphere
As might be expected, not all northern Andean languages can be assigned to one of the four widely extended families discussed so far. In addition to the two extinct Venezuelan families mentioned earlier, there are at least two local families, Barbacoan in the southern departments of Cauca and Nari˜no (as well as in Ecuador) and Chocoan in the Pacific region. There are five living Barbacoan languages: Awa Pit (Cuaiquer), Cha palaachi (Cayapa), Guambiano, Totor´o and Tsafiki (Colorado). Cha palaachi and Tsafiki are only spoken in Ecuador. Chocoan is represented by two languages: Ember´a and Waunana. The P´aez language of Cauca and Huila is either an isolate or the surviving member of a small family. On the eastern slopes Kams´a or Sibundoy and Andaqu´ı (the latter extinct) are isolates, and so are Yurumangu´ı and Esmeralde˜no,extinct languages of the Pacific region of Colombia and Ecuador. In the following sections we will discuss some general characteristics of the principal documented languages of the Colombian and Venezuelan Andes, as well as the north- western part of Ecuador. Because of its historical importance, Muisca will be treated in somewhat more detail. Demographically important languages, such as Guajiro and P´aez, will also receive particular attention. The final section of this chapter contains an overview of the languages of the eastern lowlands that are adjacent to the northern Andes.
2.2 Research on the native languages of Colombia Among the precursors of academic studies concerning the languages of the northern Andes two scholars must be mentioned, the natural scientist and physicist Jos´e Celestino Mutis (1732–1808) and the linguist Ezequiel Uricoechea (1834–80). After his arrival in New Granada in 1761, Mutis became an enthusiastic collector of descriptive material of difficult accessibility concerning the indigenous languages of the Spanish American domain. In 1787 he received the commission to make a collection of indigenous language materials as the result of a request addressed by Catherine II of Russia to the king of Spain. The empress needed these materials for her ambitious project to document the languages of the world in Saint Petersburg. Mutis carried out his assignment with great care, making copies so as to avoid losses. As it appears, most of Mutis’s materials never reached Russia (Ortega Ricaurte 1978: 102), but copies remain in Madrid. Born in Colombia, Uricoechea, a man of wide interests, spent part of his life in Belgium, where he held a professorship in the Arabic language. He was the founder of the series Biblioth`eque Linguistique Am´ericaine (1871–1903), in which much early work on the indigenous languages of the Americas, including his own on Muisca, was brought together. In more recent times, the Frenchman Paul Rivet (1876–1958) collected and published material on numerous languages of Colombia and the Venezuelan Andes, including extinct languages of which only a minimum amount of data could be found. Landaburu 2.2 Research on the native languages of Colombia 55
(1996a, 1998, 1999) has published substantial parts of Rivet’s Colombian archive, kept at the Mus´ee de l’Homme in Paris. In Colombia a number of important studies, both descriptive and historical, have appeared, especially in recent decades. In 1965 Sergio El´ıas Ortiz published a survey of the Colombian indigenous languages. The Instituto Caro y Cuervo in Bogot´a, a venerable institution originally dedicated to Hispanic studies, issued several works of indigenous linguistic interest. Among these are a history of studies dedicated to the Colombian native languages (Ortega Ricaurte 1978), a history of the fate of the indigenous languages in colonial society (Triana y Antorveza 1987), an overview of historical-comparative efforts concerning these languages (Rodr´ıguez de Montes 1993) and two fundamental works on Muisca (Gonz´alez de P´erez 1980, 1987; see section 2.9 on Muisca). A most remarkable achievement is Lenguas Ind´ıgenas de Colombia: una visi´on descriptiva (Indigenous Languages of Colombia: a Descriptive Vision) (Gonz´alez and Rodr´ıguez 2000), a monumental book containing descriptive sketches and information by different specialists on all the native languages spoken in Colombia today. In 1984 the Universidad de los Andes in Bogot´a, in co-operation with the French Centre National de la Recherche Scientifique (CNRS), initiated a programme for the training of descriptive linguists under the direction of Jon Landaburu, in order to study and document the indigenous languages of Colombia in a systematic way. The institution harbouring this programme, the Centro Colombiano de Estudios de Lenguas Abor´ıgenes (CCELA), issues a series of descriptive studies and dictionaries, which have appeared regularly since 1987. Up to now, the series includes work on Achagua, Cuna, Ember´a, Chimila, Damana, Guambiano, Guayabero, Kogui, P´aez, Sikuani and Ticuna. Most of the contributions in Gonz´alez and Rodr´ıguez (2000) have been written by linguists trained in the CCELA programme. Members of the Summer Institute of Linguistics, which has deployed activities in Colombia for several decades, have produced descriptive studies, inter alia,ofAchagua, Ember´a, Ika and several Tucanoan languages. Also worth mentioning is a useful com- parative vocabulary of Colombian indigenous languages, compiled by Huber and Reed (1992). The journal Estudios de Ling¨u´ıstica Chibcha, published by the University of Costa Rica at San Jos´e, contains much data and discussion concerning the reconstruction of earlier stages of Chibchan in particular. Other important work by the Costa Rican school is Constenla’s dissertation on the reconstruction of Chibchan phonology (Constenla Uma˜na 1981) and his book on the typology of the Intermediate Area (Constenla Uma˜na 1991). Morphosyntactic reconstruction of Colombian Chibchan languages is attempted in Ostler (2000). The indigenous literature of the northern Andean area is mainly confined to collec- tions of folkloric text. An overview of the traditional literature in the Chibchan languages 56 2 The Chibcha Sphere can be found in Constenla Uma˜na (1990a). Two Chibchan languages, Cuna and Kogui, are well represented in this respect. Many Cuna texts were collected in the first half of the twentieth century by researchers of the Ethnographic Museum in G¨oteborg (Nordenski¨old 1928, 1930a, 1938; Holmer 1951; Holmer and Wass´en 1953). An eval- uation of the literary value of these texts is provided by Kramer (1970). More recent text material and an extensive study of Cuna discourse and speech styles can be found in Sherzer (1983, 1986). A rich corpus of Kogui folkloric text was collected by Preuss in 1915 and published in different issues of the journal Anthropos (Preuss 1921–5; cf. also Fischer and Preuss 1989). Further information on Kogui traditional literature can be found in Reichel-Dolmatoff (1950–1). Of great interest is an extensive collection of Huitoto myths, transcribed and with a vocabulary and a German translation, again by Preuss (1921–3). Finally, Howard and Sch¨ottelndreyer (1977) present some texts in Kams´a and in Cat´ıo (Choc´o), respectively.
2.3 Chocoan The Chocoan language family is located on the Pacific side of Colombia and eastern Panama.3 It consists of two languages, Ember´a and Waunana. The Waunana language has its principal location in the lower part of the San Juan river valley in the Colombian department of Choc´o. Additionally, a substantial number of Waunana speakers have migrated to coastal areas adjacent to Panama and to the Darien region of Panama itself. The Ember´a constitute a flexible and expanding population, which have colonised new territories whenever external or demographic pressure incited them to do so. At present, the Ember´a not only inhabit Darien and the Choc´o, but also parts of Antioquia, Cauca, C´ordoba, Nari˜no, Risaralda and Valle del Cauca, thus occupying areas both to the north and to the south of the Waunana. In recent times some Ember´ahave reached Ecuador, and others have crossed the Andes into the Amazon region (department of Caquet´a). Chocoan presence is easily traceable by the frequent occurrence of the ending -d´o ‘river’ in place names (e.g. Apartad´o, Baud´o, Docampad´o, Opogad´o). The number of Ember´a speakers in Colombia is estimated at more than 70,000, while that of the Colombian Waunana has been calculated at ca. 8,000 (Arango and S´anchez 1998).4
3 Loukotka (1968) reports the existence of an extinct language isolate in the area of Bah´ıa de Solano (northern Choc´o), which he calls Idabaez.Itisbased on a report concerning a short-lived missionary adventure of the Franciscans between 1632 and 1646 (Rowe 1950a). Only one word (tubete ‘medicine-man’) and the name of a chief (Hijuoba)were recorded, too little to attribute a separate identity to this group on linguistic grounds only. 4 The population numbers supplied by Arango and S´anchez (1998) often seem to be inflated in relation to figures taken from other sources. It has nothing to do with a difference in number be- tween speakers of the language and members of the ethnic group. In many Colombian indigenous groups the ancestral language is used by all members. 2.3 Chocoan 57
In their expansion the Ember´ahave settled in areas formerly occupied by other peo- ples who became extinct during the process of colonisation. Ember´aisknown under different names according to the region where it is spoken (Cat´ıo in central Antioquia and C´ordoba, Cham´ı in southern Antioquia and Risaralda, Saija in the area south of Buenaventura, Samb´u in the Panamanian border area). At least in one case, such a name (Cat´ıo) originally designated a Chibchan people who preceded the Ember´ain the same area (central Antioquia). Whether or not there has been any continuity be- tween the modern Choc´o and extinct societies such as the Quimbaya and Sin´u, re- mains an open question.5 A connection that certainly seems promising is that between Chocoan and the language of the Cueva people, who inhabited central and eastern Panama at the beginning of the sixteenth century (Loewen 1963a; Constenla and Margery 1991). Constenla and Margery (1991) point out a number of lexical and morphological similarities between the Chocoan and the Chibchan languages, which may indicate historical contact or a possible genetic relationship. It seems useful to observe that there are a number of close lexical similarities with the Barbacoan languages as well, e.g. Proto-Barbacoan (Curnow and Liddicoat 1998) *kim-ϕu ‘nose’, Guambiano (Curnow 1998) kalu,Awa Pit (Calvache Due˜nas 2000) kail y ‘ear’; Proto-Chocoan (Constenla and Margery 1991) *k˜eb´u ‘nose’, *k w r´ ‘ear’. Greenberg (1987) links Chocoan with Barbacoan and P´aez. Rivet’s proposal of a genetic link between Chocoan and Cariban (Rivet 1943b) has been rejected upon several occasions (cf. Pardo and Aguirre 1993: 278–92). The difference between the two Chocoan languages, Ember´a and Waunana, is in the first place lexical. Ember´a itself is best treated as a dialect continuum. From a phonolog- ical point of view, the Chocoan languages are complex and show a considerable amount of internal variation. Comparative studies of the phonology of the different varieties can be found in Loewen (1963b), and Pardo and Aguirre (1993). Llerena Villalobos (1995) contains the views of different authors on the phonology of several Ember´avarieties. Waunana has many syllable- and word-final obstruents, whereas Ember´a has a preference for open syllables. All varieties of Chocoan present three series of stops, which can be either voiceless aspirated/voiceless unaspirated/voiced lax (Waunana, Saija), or voice- less plain (somewhat aspirated)/voiced tense/voiced implosive (the northern dialects of Ember´a). In addition to these basic distinctions, there is much allophonic variation; in
5 In a discussion of Rivet (1943b), Constenla and Margery (1991) refer to a short word list collected by Bastian (1878) among possible descendants of the Quimbaya. It seems relatable to Chocoan. Considering the late date, so long after the recorded extinction of Quimbaya, one has to take into account the possibility that the interviewed persons were Chocoans who had migrated into the area. A closer analysis of the place and circumstances under which this word list was collected can possibly throw light on the matter. 58 2 The Chibcha Sphere
Table 2.1 Overview of the consonant inventories of Chocoan languages and dialects (adapted from Pardo and Aguirre 1993)
Atrato, Upper San Antioquia, Waunana, Saija Lower Baud´o Juan N. Choc´o
Stops Aspirated ph th kh ph th kh ph th kh ph th kh
Plain p t k ʔ ptk
Voiced b d g b d b d b d
Implosive /ð
Affricates Voiced z dzˇ and β Fricatives Voicelesscsh ˇ
Vibrants r rr
Resonants l m n
Approximants w y
a nasal environment the stops generally have prenasalised allophones. The dialect of the lower Baud´oriver is unique in that it presents four series in the labial and dental articulations (voiceless aspirated/voiceless unaspirated/voiced lax/voiced implosive). A constant feature of all the Chocoan languages is a phonemic opposition between a sim- ple (r) and a multiple vibrant (rr), both of which occur in intervocalic and syllable-final position. The number of affricates and fricatives varies according to the dialect. Only the southern varieties have a phonemic glottal stop. Table 2.1 contains a synopsis of the consonant systems of Waunana and a number of Ember´a dialects. Constenla and Margery (1991) propose a reconstruction of Proto-Chocoan phonology. In spite of the rather elaborate inventories of stops found today in the Chocoan varieties, they reconstruct a stop system with voiced (b, d ) and voiceless (p, t, k) elements only. This reconstruction is claimed to be valid for Proto-Ember´a, as well as for Proto-Chocoan in its totality. The minimal vowel inventory found in all present-day varieties comprises three high vowels (i, , u), two mid vowels (e, o) and one low vowel (a). In addition, Waunana has an extra row of lowered (open) high vowels; Saija has an extra central vowel ( ). A nasality opposition is relevant for all six vowels. Progressive nasal spread can affect whole words unless checked by an opaque consonant (one of the obstruents, or rr). In an analysis of 2.3 Chocoan 59
Saija (Harms 1994), the nasal consonants are treated as allophones of b and d before a nasal vowel. Also in Saija stress can be contrastive. Two detailed and recent studies address the structure of Ember´avarieties. Harms (1994) deals with the Saija variety of Nari˜no, Cauca and Valle, whereas Aguirre Licht (1998, 1999) describes the Cham´ı dialect of Cristian´ıa (southwestern Antioquia). Llerena Villalobos (1994) treats predication in the Cham´ı dialect of the Upper And´agueda (interior of Choc´o). In Gonz´alez and Rodr´ıguez (2000), Hoyos Ben´ıtez presents a sketch of the Napip´ıriver dialect (in the Baud´o hills, coast of north-central Choc´o) and Mej´ıa Fonnegra a sketch of Waunana. A detailed study of loan words in Waunana by Loewen (1960) documents language contact with Spanish. Chocoan word forms present an agglutinating structure which is mainly based on suffixes. Prefixes are exceptional. Nominal suffixes refer to case and number. Verbal suffixes refer to aspect, tense, number and mood. The verbal morphology furthermore includes suffixes for causative, directionality and other derivational categories. Auxil- iaries and copula verbs play an important role in the grammar. There is a rich choice of compound verbs, including some cases of object incorporation. In a genitive construction a possessor precedes its head without any special marker. This is also the case when the modifier is a demonstrative pronoun. Adjectives and numerals, however, follow their heads. Personal reference is indicated by pronouns, which precede a noun (when referring to a possessor), or a verb (when referring to subject and object). There is no indication of personal reference in the verb form itself, except that Waunana has a set of auxiliary verbs reflecting person-of-subject, which are limited to stative constructions (Mej´ıa Fonnegra 2000). All Chocoan languages are ergative. The case system comprises an ergative marker which is obligatory with agents of transitive constructions. Objects of transitive verbs and subjects of intransitive verbs remain unmarked. The preferred constituent order is Agent–Patient–Verb. For instance, in Waunana, we find:
(1) khum-au su:rr bu:rr-pi-hi-m jaguar-E deer fall-CA-PA-DV ‘The jaguar caused the deer to fall down.’ (Mej´ıa Fonnegra 2000: 89) (2) su:rr bu:rr-hi-m deer fall-PA-DV ‘The deer fell down.’ (Mej´ıa Fonnegra 2000: 89)
A special ergative marker is used with singular pronouns:
(3) mu-a p -rik binʔe de:-hi-m I-E you-DA medicine give-PA-DV ‘I gave medicine to you.’ (Mej´ıa Fonnegra 2000: 90) 60 2 The Chibcha Sphere
When plurality is marked on the verb in Ember´a, it refers to the agent, regardless of whether the latter is in the ergative or in the absolutive case. Examples from Saija (southern Ember´a) are:
(4) eper˜a:-r˜a-pa´ phokhura´ 6 kho-pa-ˇci-d´a person-PL-E toasted.corn eat-HB-PA-PL ‘The Epena used to eat toasted corn.’ (Harms 1994: 98) (5) m warr´a-r˜aˇc˜o:-pa-ˇci-d´a I son-PL fight-HB-PA-PL ‘My children used to fight.’ (Harms 1994: 103)
Ember´a has a series of copula verbs used in adjectival and locative constructions. They (lexically) encode such distinctions as number, aspect, honorific and size. In example (6), also from Saija, ci-to:n-a-ˇ is the honorific plural past stem of the copula verb. A past- tense marker (-ˇci-)isthen still required.
(6) cupˇ r´ıaci-to:n-a-pa-ˇ ˇ ci-d´a poor be.HN-PL-PA-HB-PA-PL ‘They were very poor.’ (Harms 1994: 32)
The following examples from Cham´ı illustrate a predicative construction with a dual copula verb (7), and a locative construction with a possessive interpretation (8), respectively.
(7) d´aiep˜ ˜ era´ pan´u-ma we Ember´a be.D-DV ‘The two of us are Ember´a.’ (Aguirre Licht 1999: 55) (8) ciˇ w´ar mar´ıa-e u´ DC child Mar´ıa-L be.SG ‘The child belongs to Mar´ıa.’ (Aguirre Licht 1999: 56)
2.4 Yurumangu´ı Yurumangu´ı is the name of a group of Indians who inhabited the upper reaches of the Cajambre, Nava and Yurumangu´ırivers. These rivers descend to the Pacific Ocean from the mountain range Los Farallones de Cali, situated southwest of the modern town of Cali in the department of Valle del Cauca. The Yurumangu´ıwere visited between 1765 and 1768 in two expeditions organised by a local prospector, Sebasti´an Lanchas de Estrada. He left a diary and a list of words and expressions in the language, as well as some interesting ethnographic notes. These materials were published in 1940 by Arcila
6 Intervocalic kh has a fricative pronunciation [x]. 2.5 Cuna 61
Robledo (cf. Ortiz 1946). The fate of the Indians of Yurumangu´ı after the two visits of the 1760s is unknown. There have been no further records of their existence. Rivet (1942) made a study of the language data recorded by Lanchas and concluded that Yurumangu´ıwas not visibly related to any other language of the area. He proposed a genetic relationship with the Hokan languages of North America, leaving Yurumangu´ı as the only language with such an exclusive connection outside South America. The suggestion was explicitly followed by Greenberg (1960a, 1987: 132). Jij´on y Caama˜no (1945) proposed both a Hokan and a Chibchan affinity. Many other researchers (e.g. Kaufman 1990, Constenla Uma˜na 1991) prefer to treat the language as an isolate. The data on Yurumangu´ı are very limited and probably unfit to establish genetic connections that are not particularly close. Its simple sound system with a preference for open syllables does not plead in favour of a North American connection. Even though the Yurumangu´ı territory was situated along rivers descending to the Pacific, it was an inland tribe, one of the reasons for which the group could remain unnoticed for so long. Instead of speculating about a marine origin for the Yurumangu´ı, it would probably be more logical to consider them as survivors of one of the groups of the Cali region who may have fled exploitation by the cruel Belalc´azar and his men. Some morphological elements can be deduced from the Yurumangu´ı data, such as a suffix -sa that characterises the citation form of several verbs (e.g. anga-sa ‘to sleep’, sai-sa ‘to die’, ulsa-sa ‘to take out’) and a prefix ca(i)- that occurs frequently in kinship terms and words referring to body parts (e.g. cai-g´ı ‘mother’, cai-enai´e ‘grandmother’, cai-cona ‘head’, cai-lusa ‘hair’). Assumedly, ca(i)- may have been one of the possessive personal reference markers. Interrogative pronouns began with c [k], as, for instance, cana ‘what?’, and cu or co ‘where?’ (cu-na ‘where is it?’, cu-cae ‘where are you?’, co-cobica ‘where do you come from?’, co-cuebiquen ‘where are you going?’). Constenla Uma˜na (1991: 53) observes word order preferences to the effect that a genitive must precede its head, and an adjective must follow its head, characteristics that the language shares with Chocoan. Alexical item worth mentioning is chuma ‘to drink’ (e.g. in chuma-´e ‘drink!’). Ortiz (1946: 25) observes that it is used in southern Colombian Spanish to denote drunkenness, suggesting that it could be a loan from Yurumangu´ı. However, the same expression is widely used in Ecuador and northern Peru. Its most likely source is Mochica cɥuma- ‘(get) drunk’ (cf. section 3.4). The case suggests that, notwithstanding their apparent isolation, the Yurumangu´ıwere not entirely free of contact with neighbouring speakers of Spanish.
2.5 Cuna In Colombia approximately 1,000 Cuna (autodenomination Tule) occupy two villages near the Gulf of Urab´a: Arqu´ıa (department of Choc´o) and Caim´an Nuevo (department of 62 2 The Chibcha Sphere
Antioquia). A majority of the Cuna people (over 40,000) inhabit neighbouring Panama, where they occupy an autonomous region or comarca, comprising the archipelago of San Blas in the Caribbean and parts of the Darien mainland. The Cunas of Panama obtained their autonomy as a result of the 1925 uprising led by the nele (‘shaman’) Kantule.7 Notwithstanding their present homeland, the Panamanian Cuna are origi- nally from Colombian territory. They were probably chased from the Atrato river valley and the adjacent Pacific region by their hereditary enemies, the Ember´a. On the coast of northern Choc´oaplace name Jurad´o means ‘river of the Cuna’ in Ember´a, sug- gesting that Cuna once may have inhabited that area (cf. Rowe 1950a). During the seventeenth and eighteenth centuries they moved eastward, attacking the Spanish set- tlements on the Sin´uriver (Fals Borda 1976), and westward, occupying empty areas in the Darien region, which had been inhabited by the extinct Cueva people. In 1681 the Darien Cuna were visited by the English ship’s doctor Lionel Wafer, who stayed with them for a while and published a word list of the language (Friedemann and Arocha 1985).8 The Cuna language belongs to the Chibchan family, of which it constitutes a separate branch (Constenla Uma˜na 1993: 119). Descriptive material on the Cuna language can be found in Holmer (1946, 1947, 1951, 1952), Sherzer (1975, 1978) and Llerena Villalobos (1987, 2000). The sound inventory of Cuna, which is relatively small, is subject to morpheme-internal allophonic variation and an elaborate set of sandhi rules operating at morpheme boundaries. The language has five vowels (a, e, i, o, u) and a series of diphthongs or, rather, vowel sequences. Vowels are automatically long in stressed open syllables (stress is normally on the penultimate). Extra-long vowels, analysable as sequences of same vowels, occur as well, in particular, in monosyllabic words (Holmer 1946: 186). Within the consonants there is a distinction between geminate (tense) and simple (lax), as can be seen in table 2.2. Geminate consonants only occur intervocalically; the geminate stops pp, tt, kk and kkw are always voiceless. By contrast, the plain stops p, t, k and kw occur in all positions. They are normally voiced between vowels or when adjacent to a voiced consonant; in word-initial position voicing is optional. The affricate cˇ is always voiceless and acts
7 The kantule, literally, ‘flute (kammu) man (tule)’, is the name of the central figure at girls’ puberty rites (Sherzer 1974). 8 Loukotka (1968: 238–9) apparently treated Wafer’s Cuna word list as a specimen of the Cueva language. The mistaken view that the Cuna are close relatives or even direct descendants of the Cueva is still upheld in literature (e.g. Greenberg 1987: 117; Whitehead 1999: 887). Cuna lan- guage and culture are very different from those of the unfortunate Cueva, who were exterminated during the early years of Spanish colonisation (Romoli 1987; Constenla Uma˜na 1991: 47). 2.5 Cuna 63
Table 2.2 Cuna consonant inventory
Labial Alveolar Palatal Velar Labiovelar
Stops Simple p t k kw Geminated pp tt kk kkw Affricatec ˇ Fricative s Nasals Simple m n Geminated mm nn Laterals Simple l Geminated ll Vibrant r Approximants w y
as the geminate counterpart of s.Asaresult, the two are usually not distinguished in loan words, where cˇ is banned from positions reserved for non-geminates. The sound r is a trill and has no geminate counterpart. It cannot occur in word-initial position, and neither does l,which is automatically replaced by n in that position. The existence of separate labiovelar phonemes kw and kkw is defended in Holmer (1947: 13; 1951: 35) and Sherzer (1975: 49), but rejected by Llerena Villalobos. The existence of an opposition between kue ‘to be, to become’ and kwa (classifier for seed- like objects and also a frequent multifunctional suffix) seems to favour the former position. Sandhi in Cuna mainly consists in the syncope of morpheme-final and word-final vowels and syllables, sometimes accompanied by other processes, such as replacement of l by r (9) and various types of fusion (for an account of the possible changes see Llerena Villalobos 1987: 209–16). Particularly frequent is the suppression of final e in polysyllabic morphemes, when followed by any other morpheme. Voicing may be preserved, even where the triggering intervocalic context has disappeared as a result of syncope (10).
(9) mola-makke [morm´akke] cloth-prick ‘to sew’ (Llerena Villalobos 1987: 26) (10) ape-takke [abd´ akke] want-see ‘to await’ (Llerena Villalobos 1987: 216) 64 2 The Chibcha Sphere
When, as a result of syncope, the velar stops k and kk end up in syllable-final position before another (non-velar) consonant, they are often replaced by a high vocalic glide y, as in (11):
(11) pakke-nae-te [payna:de ∼ panya:de] buy-go-TH ‘He went to buy it.’ (Llerena Villalobos 2000: 29)
In spite of its frequency, it is not always predictable whether sandhi will occur or not. Holmer (1951: 41) observes that syncope is highly dependent on register. The use of non-syncopated forms is often found in songs and in ceremonial speech, of crucial importance to Cuna society.Syncopated forms are more frequently found in men’sspeech than in the speech of women and children. In some cases sandhi is obligatory, whereas in other cases it is merely a matter of stylistic variation. The different registers of the Cuna language have been studied by Sherzer (1983) in his work on the ethnography of speech tradition. He observes the use of separate registers for public discourse, daily language and specific rites, such as puberty rites. In the remaining part of this section, the example sentences will be given in their non-syncopated forms, as they are presented in the sources. Cuna is an agglutinating, predominantly suffixing language. The occurrence of pre- fixes is limited to the function of indicating a difference in verbal valency. There are matching pairs of transitive and intransitive verbs differentiated by the initial vowel, e- for the transitives and a- or zero for the intransitives (12)(13).9 In addition, causativisation can be indicated by means of a prefix o- (14).
(12) kwane e-kwane ‘to fall (of fruit)’ ‘to drop (fruit)’ (Llerena Villalobos 1987: 24) (13) a-tinne e-tinne10 ‘to get tied’ ‘to tie’ (Llerena Villalobos 1987: 24) (14) purkwe o-purkwe ‘to die’ ‘to kill’ (Llerena Villalobos 1987: 25)
Verbal morphology expressed by suffixes includes voice (passive), tense, number and directionality. In addition to the directional suffixes, there is a set of verb stems referring to direction. These directional verb stems can be either affixed or used as main verbs
9 This situation is reminiscent of similar paired categories in Chiquitano (see section 4.13.3). 10 Llerena Villalobos (2000: 71) gives a phonetic representation [ett´ınne] for this form, suggesting a geminating effect of the prefix. 2.5 Cuna 65 themselves. Even when affixed, they frequently combine with the directional suffixes, as illustrated in (11) and (15).
(15) se-tani-kki carry-come-H ‘He came to bring it.’ (Llerena Villalobos 1987: 29)
Another set of stems combine the functions of bodily posture with durative aspect and ‘to be’. They can be used either as main verbs or as affixed auxiliaries. For instance, in (16) the stem kwiˇci indicates that the subject performs his action standing.
(16) we sunmakke-kwiˇci-t sayla ospino this speak-stand-N chief Ospino ‘The one who stands there talking is chief Ospino.’ (Llerena Villalobos 1987: 130)
As in Chocoan, personal reference is not expressed morphologically. Subject and object (with verbs) and possessor (with nouns) are all indicated by the same uninflected personal pronouns an ‘first person’, pe ‘second person’ and e ‘third person’ (Llerena Villalobos 1987: 63–4). Third-person subject is often omitted. The corresponding plural forms are obtained by adding the universal pluraliser -mala, and dual forms by adding -po ‘two’. The personal pronouns accompany verbs on an SOV order basis. When used as possessive pronouns, they must precede the head noun immediately. The same holds for the demonstrative pronouns we ‘this’ and a ‘that’. It should be observed that not all authors agree as to the status of the personal and demonstrative pronouns. Holmer (1946, 1951) treats them as prefixes, thus expanding the prefix inventory of the language, but makes an exception for the plural forms an-mal(a) ‘we’, pe-mal(a)‘you (plural)’, which are clearly independent forms. Adjectives and numerals follow their heads, again as in Chocoan. Cuna has several relational suffixes which together divide the oblique case functions. For instance, -kala indicates possession, destination and beneficiary; -kine locative, ablative, instrument and cause.
(17) an maˇci sayla-kala kue-oe I son chief-DA become-F ‘My son will be a chief.’ (Llerena Villalobos 1987: 47) (18) akkwa-kine makke-sa-mala stone-IS hit-PA-PL ‘They hit him with a stone.’ (Llerena Villalobos 1987: 53) 66 2 The Chibcha Sphere
Cuna has a series of numeral classifiers based mainly on shapes and measures (Sherzer 1978). Even though there are special classifiers for fish and personal adornments, there is none for human beings. These are classified among the long objects (wala- ∼ war-). The numerals and the stem for ‘how many?’ (-pikwa) are suffixed to the numeral classifier, which itself follows the noun it classifies, as in (19):
(19) maˇci-mala wala-pikwapenikka son-PL CL-how.many you have ‘How many children do you have?’ (Llerena Villalobos 1987: 67)
Cuna has a great variety of compounds, including verb–verb compounds and combina- tions of verbs with an incorporated object; e.g. mas-kunne ‘to eat’ (from masi ‘banana’, ‘food’ and kunne ‘to chew’); kap-takke ‘to dream’ (from kape ‘to sleep’ and takke ‘to see’) (Llerena Villalobos 1987: 25–6). A more complex form is op-nake-t-akkwa ‘millstone’ (from opa ‘maize’; nake- ‘to grind’; -t ‘nominaliser’, and akkwa ‘stone’) (Llerena Villalobos 1987: 37). There are borrowed words not only from Spanish (kannira ‘chicken’, from Sp. gallina), but also from English; e.g. mani ‘money’; waˇci ‘time, hour (watch)’.
2.6 The languages of the Sierra Nevada de Santa Marta The languages of the Sierra Nevada de Santa Marta, also known as the Arhuacan lan- guages (not to be confused with Arawakan; cf. Shafer 1962), form a group of mutually related languages of undisputed Chibchan affiliation. There are three living languages, which are known under several names. The Ika language is spoken in the southern part of the mountain massif (departments of Cesar and Magdalena) by some 14,000 people also known as Arhuaco or B´ıntukua (Arango and S´anchez 1998). The Ika are well known for their dynamic organisation and level of political awareness. Their principal centre is the town of Nabus´ımake (former San Sebasti´an de R´abago). North of the Ika, in the highest part of the Sierra Nevada (departments of Cesar, La Guajira and Magdalena), some 9,000 traditional Kogui or K´aggaba are speakers of the Kogui language, called Kougian in Kogui, and Peibu in Ika (Frank 1990: 41). The Kogui are an individualis- tic people, known for their inaccessibility and religious conservatism. Their principal ceremonial centre is Macotama. The third language of the Sierra Nevada, Damana,is found in the eastern and northeastern part of the Sierra Nevada (departments of Cesar and la Guajira). The approximately 2,000 speakers of Damana are less numerous than the Ika and the Kogui. As an ethnic group they are known by no less than six dif- ferent names: Arsario, Guamaca, Malayo, Marocasero, Sanj´a or Sank´a, and Wiwa; see Trillos Amaya (1989: 15–17) for a discussion of the respective merits of all these names. 2.6 Languages of the Sierra Nevada de Santa Marta 67
In spite of the fact that the Sierra Nevada people have been successful in preserv- ing their languages and traditions, they have felt the constant pressure of non-native colonisation. In addition to the three nations mentioned so far, a fourth group the Kankuamo, located near the town of At´anquez (Cesar) lost its identity in the twen- tieth century. Its language, Kanku´ı, closely related to Damana, is no longer spoken, although some people claim knowledge of it (Ortiz Ricaurte 2000). In order to escape colonisation and preserve their secluded life the Kogui have had to abandon most of the lower strata of their original habitat. The sixteenth-century Tairona,who occupied the area east of Santa Marta, were almost certainly part of the Arhuacan culture complex. It is likely that survivors of the Tairona group sought refuge with the Kogui after their defeat in 1600. It is not known to what extent the languages spoken by the Kogui and Tairona differed. The religious leaders of the Kogui claim knowledge of a ceremonial language called T´eiˇzua, and it is tempting to interpret this as a relict of Tairona. Land- aburu (1994: 375) mentions that the speakers of Damana also have a sacred language called Terruna shayama. The survival of the cultural and linguistic identity of the Arhuacan peoples is largely due to the strength of their traditional authorities and spiritual leaders, the mama (‘grand- fathers’), who distribute their knowledge at ceremonial temples known as kankurua.11 They have successfully fought the influence from outside, in particular, missionary ac- tivity. The native languages are part of the traditional values they seek to protect. At the same time, mama refuse to take part in formal education (Trillos Amaya 2000: 750). Some actively discourage the learning of Spanish, in particular by women. On the other hand, multilingualism seems to be the rule in the Sierra Nevada de Santa Marta. Due to mixed marriages and contact, many people speak several languages (especially Damana and Kogui), and some know as many as four languages. Since the pioneering work by Celed´on (1886), Preuss (1921–5), Holmer (1953) and Reichel-Dolmatoff (1950–1, 1989), the study of the Arhuacan languages has developed considerably. The Ika language has been the object of studies by Frank (1985, 1990) and Landaburu (1988, 1992, 1996b, 2000a). The Damana language is treated in Trillos Amaya (1989, 1994, 1999, 2000). There are partial studies of Kogui in Ortiz Ricaurte (1989, 1994, 2000) and in Olaya Perdomo (2000). Each of the three languages of the Sierra Nevada de Santa Marta has a clearly distinct character. Among them, the Kogui language is most conservative and particularly well suited for historical comparison with other Chibchan languages. Its nominal derivational morphology, involving several old classifying elements, is quite elaborate; see Ortiz Ricaurte (2000) for an inventory.
11 K nkurwa,anIka word; cf. Landaburu (1988: 163) and Trillos Amaya (2000: 749). 68 2 The Chibcha Sphere
Table 2.3 Overview of the consonant inventories of the Arhuacan languages
Labial Alveolar Palatal Velar Glottal
Stops Voiceless p t k ʔ**** Voiced b d g Affricates Voicelessc ˇ Voiced dzˇ * Fricatives Voiceless ss** ˇ h** Voiced w zz ˇ Nasals m n [ŋ]***** Vibrants Simple r* Multiple rr*** Laterals l**
* not in Kogui, ** marginal or absent in Ika, *** only in Damana, **** not in Damana, ***** only in Ika
However, many of the characteristics that are of interest in a general survey like the present one coincide and, therefore, justify a combined treatment. All the Arhuacan languages have voiceless and voiced stops: p, b; t, d; k, g (intervocalic k is realised as a fricative [x] in Kogui). They also distinguish between voiceless and voiced alveodental sibilants: s, z. The distinction between voiceless and voiced palatal sibilants (sˇ, zˇ)isfound in all three languages, with the observation that sˇ is marginal in Ika. Furthermore, a voiceless palatal affricate cˇ occurs in all three languages, its voiced counterpart d zˇ being limited to Damana and Ika. The glottal stop (ʔ ) plays an important role in Ika, a marginal one in Kogui, and it is absent from Damana. Conversely, the velar or glottal fricative h is frequent in Damana and Kogui, but marginal in Ika. As far as the vibrants and laterals are concerned, Damana stands out with three distinct sounds: lateral l, simple vibrant r, and multiple vibrant rr. Ika only has r, and Kogui only has l. All three languages have labial and alveodental nasals (m, n); the latter assimilates to the articulation place of a following stop. In Ika the velar nasalŋ can occur between vowels at morpheme boundaries, and may thus be interpreted as a separate, albeit peripheral phoneme. Ika has a bilabial approximant w,which is pronounced fricatively [β], except before a.Itisalso found in Damana, where a fricative realisation obtains between vowels. Its absence from the consonant inventory of Kogui (Ortiz Ricaurte 2000) seems to be the result of different analyses; cf. Ika wak ,Kogui uaka [w´axa] ‘fish’. Table 2.3 contains a combined representation of the consonant inventories of Damana, Ika and Kogui. Consonant clusters can be found in initial position (stop + r in Ika; sibilant + stop in Kogui). In these cases there is often a suppressed vowel that can be restored even 2.6 Languages of the Sierra Nevada de Santa Marta 69 synchronically. All Arhuacan languages have labiovelar consonants (kw, gw). These are generally not treated as separate phonemes in the available descriptions, but as consonant clusters or consonant–vowel sequences. In Ika most syllable-final stops are disallowed, and the number of consonant-clusters is limited. Prevocalic k in Ika often corresponds with a glottal stop in word- or syllable-final position, e.g. nak-a ‘to come’ (perfective participle) vs. naʔ (indefinite participle) (Landaburu 2000: 743); cf. also Ika aʔn ‘stone’, Kogui hag-, hagi; Ika kaʔ ‘earth’, Kogui kag-, kagi. The vowel inventory is basically the same in all three languages and consists of six basic vowels: a, e, i, o, u and a sixth vowel ,which has been described as a central vowel [ə]inDamana and in Ika, and as an unrounded high back vowel [ ]inKogui. For Ika, Frank (1990) recognises two central vowels, high central and mid central [ə], instead of one, but admits that their distribution is largely predictable. Damana has an additional nasal vowel u˜ of unclear status (Trillos Amaya 2000: 751). Vocalic nasality is also found in Kogui, where it is limited to a small number of instances (e.g. hay˜u´ ‘coca’), including very few minimal pairs (Ortiz Ricaurte 2000: 760). Stress is largely predictable in Damana and Ika, but distinctive in Kogui. The morphophonology and morphosyntax of the Arhuacan languages stand out on account of their great complexity and highly unusual character. In the context of this chapter we can only mention a few examples, referring to the literature mentioned above for a fuller picture. The Arhuacan languages are mainly verb-final with a preferred word order of subject–object–verb. Adjectives follow the substantive they modify, whereas genitive modifiers and possessors precede it. In Ika many adjectives in postnominal modifier position are followed by a nominalised verb kaw-a ‘seeming’ in a sort of adjective phrase reminiscent of a relative clause.
(20) dˇze k kaw-a water cold seem-N ‘cold water’ (Frank 1990: 32)
Nouns and noun phrases can receive case markers, as illustrated by the following example from Damana, where -ˇze indicates genitive and -rga allative case:
(21) ra-n-ˇze ade te-rga I-EU-G father field-AL ‘My father is in the field (has gone to the field).’ (Trillos Amaya 2000: 753)
Pronominal possession is often expressed by a genitive phrase. In addition, prefixed possessive markers indicating personal reference also occur. Prefixed possessives are fre- quently found with kinship terms, but not exclusively. Kogui, for instance, has two ways to express the notion ‘my house’, with a prefix and by means of a genitive construction; see example (22). The full word form for ‘house’, hu-´ı, contains a petrified suffix element 70 2 The Chibcha Sphere
-´ı,which can be suppressed both in compounds (e.g. h´u-kala ‘roof of house’) and in inflected forms (e.g. na-h´u).12
(22) na-h´u na-h´ı hu-´ı 1P.SG-house 1.SG-G house-LS ‘my house’ ‘my house’ (Ortiz Ricaurte 2000: 765)
Subject and object are not obligatorily marked for case. Nevertheless, Ika has a locative case marker -seʔ,which can be added to the agent–subject of a transitive verb in an ergative function. This marker can remove doubts as to which constituent plays the role of subject in a transitive construction. It is used, for instance, when the subject stands immediately before the verb, either because there is no overtly expressed object, or because the prevalent SOV order has been modified for pragmatic or other reasons. The marker -seʔ is frequently found in combination with a topicalising suffix -ri.Foran extensive discussion of the use of both suffixes see Frank (1990: 115–34).
(23) maikən per -ri ∅-kə-g-a na gwiadzˇ ina-seʔ-ri three dog-TO 3O-DA-eat-N be/past13 puma-E-TO ‘The puma ate his three dogs.’ (Frank 1990: 116)
Verbs in Arhuacan languages often exhibit a set of competing stems which can differ in their final consonants and internal vowels. Several types of nominalised and non- finite verb forms are derived from these stems. The morphosyntax of the Arhuacan languages is furthermore characterised by an extensive use of auxiliary verbs, which may appear in strings following a main verb. The choice of the appropriate form both for non-independent main verbs and for auxiliaries depends on the auxiliary that follows. In Ika the verb phrase is regularly ended by a suffix indicating the epistemic status of the sentence (validation), whereas in Kogui prefixes fulfill such a role. Auxiliary verbs may carry the morphology that cannot be accomodated on the main verb, but in other cases their presence is merely pragmatic or serves the purpose of refining temporal and aspectual distinctions. A characteristic case in which a morpho- logical element is transferred from the main verb to an auxiliary is negation. In Ika the negative marker -uʔ is suffixed to the root of a main verb, creating some sort of negative participle; the latter is followed by a form of nan ‘to be’. In example (24) the main verb
12 The bound form hu- coincides with a widespread Chibchan root for ‘house’. The conservative character of Kogui appears clearly in the Chibchan reconstructions presented in Constenla Uma˜na (1993: 111–13). For the suffix -´ı see also Constenla Uma˜na (1981: 362). 13 The element na has received two interpretations, (1) that of a ‘distal’ past marker (Frank 1990: 63–4); and (2) that of a nominalised form of the verb nan ‘to be’. According to Landaburu (2000), it acts as an auxiliary verb in indefinite past forms. 2.6 Languages of the Sierra Nevada de Santa Marta 71 is cwaˇ ‘to see’; when -uʔ is attached to it, it becomes c-uˇ ʔ. Before the suffix -w- the appropriate stem for the auxilary verb ‘to be’ is nar-.14
(24) c-uˇ ʔ nar-w-in see-NE be-1S.SG-DV ‘I do not see.’ (Landaburu 2000: 743)
The expression in (24) can be made more complex by adding a form of the auxiliary verb u/aw- ‘to do’, ‘to have’, as in (25). The negative participle of u/aw-isa-uʔ:
(25) cwaˇ a-uʔ nar-w-in see have-NE be-1S.SG-DV ‘I have not seen.’ (Landaburu 2000: 743)
Personal reference marking in the Arhuacan languages is remarkable for the num- ber of syntactic roles that can be expressed. In Ika Landaburu (1992: 12; 2000a: 740) distinguishes as many as five formal possibilities. These are subject, direct object (ac- cusative), indirect object (dative), beneficiary and possession acted upon. Non-subject personal reference is indicated by means of prefixes. The latter consist of more or less constant elements accompanied by an additional marker identifying the syntactic role (dative, beneficiary, etc.). Fusion of the respective elements occurs. The non-subject roles can be expressed in combination with the subject role, but they cannot be combined with each other within a single verb form. The distinction between non-subject roles in the verb form is all the more important due to the fact that many Arhuacan verbs are im- personal, their main participant being encoded as accusative, dative, etc. Example (26) from Damana illustrates the combined use of subject and accusative markers.
(26) m -n -paˇs-ka 2S.SG-1O.SG-beat-FM.2S.SG ‘You beat me.’ (Trillos Amaya 1989: 54)
The subject marker in (26) is a prefix, but it is accompanied in Damana by a suffix -ka (-ga after monosyllabic roots beginning with k, z or zˇ), which plays an accessory role in the identification of the subject. Its primary function is that of a marker of a factual mood, indicating the general validity of the stated event. It varies according to person and number, the form -ka being limited to third-person subjects and second-person-singular subjects. Dative marking with an impersonal verb (zˇ n- ‘to dream’) is illustrated in (27).
14 In Landaburu’sanalysis, the reference of -w- to a first-person-singular subject is not the primary, but rather a derived function of that suffix, which he defines as an ‘intralocutive’. It denotes a combination of speaker–subject and present tense. After roots ending in -k/-ʔ its allomorph is -kw-; see example (30) below. 72 2 The Chibcha Sphere
The choice of the prefix mi- identifies it as dative, not accusative; there is no special marker for third-person subject.
(27) mi-n-ˇz n-ga 2.SG.DA-EU-dream-FM.3S ‘You dream.’ (lit. ‘Dream comes to you.’) (Trillos Amaya 1989: 54)
A comparable example from Kogui (28) shows the use of a special dative marker -k-, which follows the person marker. The impersonal verb is nuni ‘to want’. The initial n of the stem changes to l according to a frequent morphophonemic rule of the Kogui language.
(28) big´ıˇza na-k-l´uni pineapple 1O.SG-DA-want ‘I want a pineapple.’ (Ortiz Ricaurte 2000: 774)
Accusative marking with an impersonal verb can be observed in example (29) from Ika. It is followed by (30), which illustrates a verb form with a beneficiary marker. (The auxiliary verb nuk (∼nuʔ ) ‘to be’ in combination with the nominalisation in - n expresses a progressive form.) The future tense in Ika is expressed by an impersonal auxiliary verb -ŋgwa.Itobligatorily takes an accusative object marker that refers to the actor. The auxiliary verb -ŋgwa is preceded by the imperfective participle (in - n)ofthe verb it dominates (31).
(29) n-aʔtikuma na 1O.SG-forget be/past ‘I forgot.’ (Frank 1990: 9) (30) akusa win-i-zas- nnuʔ-kw-in [Spanish aguja ‘needle’] needle 3O.PL-BN-save-N be-1S.SG-DV ‘I am saving needles for them.’ (Frank 1990: 71) (31) pinna dzˇ una was- nn -ŋgwa all kind chase-N 1O.SG-F ‘I will chase all kinds (of animals).’ (Frank 1990: 61)
The interplay of prefixes and suffixes referring to person of subject is illustrated in (32) by means of the factual (habitual) mood paradigm of the verb buˇs- ‘to spin’ in Damana (Trillos Amaya 1989: 87): 2.6 Languages of the Sierra Nevada de Santa Marta 73
(32) 1S.SG buˇs-uga ‘I spin’ 1S.PL buˇs-kurra ‘we spin’ 2S.SG m -buˇs-ka ‘you (sg.) spin’ 2S.PL m -buˇs-kwa ‘you (pl.) spin’ 3S.SG buˇs-ka ‘he/she spins’ 3S.PL yi-buˇs-ka ‘they spin’
The Damana affixes for virtual (non-realised) and real (realised) mood are inserted between the root and the variable suffix, as in buˇs- n-kurra ‘we may spin’ or m -buˇs- an-ka ‘you spun’. In Ika the interplay of subject-marking prefixes and corresponding suffixes that also convey such meaning is even more subtle, because the latter are selected on the basis of deictic distinctions involving both personal reference and tense. Further temporal distinctions are made explicit by variation in the stems to which the suffixes can be attached. For a detailed discussion of this complicated part of Ika verbal morphology see Landaburu (1988, 1992, 2000a); cf. also Frank (1990: 63–6). The system of Damana and Ika pronouns and object markers is based on the distinction of three persons and singular or plural number. At the least in its set of possessive markers Kogui expresses a distinction between inclusive and exclusive first person plural. The form s n- ‘our (exclusive)’ reflects a Chibchan first-person-plural marker not recorded in the other Arhuacan languages.15 Its inclusive counterpart na-wi- is the regularly derived plural of the first-person-singular marker na-.Inother contexts the element s n- is used indistinctively both for inclusive and exclusive (33).
(33) gaik´a-li ni ub´ıˇsa s n-ka-l´ai snow.mountain-L water much 1O.PL-DA-be ‘On the snow-capped mountain we have a lot of water.’ (Olaya Perdomo 2000: 782)
Some valency-changing processes show a clear affinity with other Chibchan lan- guages (cf. section 2.5 on Cuna). It is the case in example (34), where u- is a transitiviser, and (35), where a- is an intransitiviser.16 The examples are from Kogui. (The substi- tution of stem-initial consonants is part of a regular morphophonemic alternation; cf. also (28).)
15 Constenla Uma˜na (1981: 430) reconstructs *se˜´ʔ for Proto-Chibchan on the basis of the Talamancan (Costa Rican) languages Bribri, Cab´ecar and T´erraba. 16 A Chibchan language of Central America (Honduras) that uses the prefix u- as a transitiviser is Paya (see Holt 1989). 74 2 The Chibcha Sphere
(34) n´aˇsi ‘to come’ u-l´aˇsi ‘to bring’ (Olaya Perdomo 2000: 781) (35) gwaˇ´si ‘to kill’ a-kwaˇ´si ‘to kill oneself’ (Constenla Uma˜na 1990: 117)
In Ika causative derivation can be achieved by adding a suffix -s- to the root, usually in combination with further morphophonemic adaptations; cf. (36) and (37). Another strategy consists in raising the root-vowel with simultaneous replacement of root-final -n- by -ŋ- (Frank 1990: 66–7); cf. (38).
(36) k mma- ‘to sleep’ k mma-s- ‘to cause to sleep’ (37) aruk- ‘to go up’ aru-s- ‘to lift’ (38) con-ˇ ‘to enter’ cuˇ ŋ- ‘to let someone enter’
Also in relation to Ika, Frank (1990: 55) lists a series of transitive verbs differing semantically by the shape of the object that is handled (39). Similarly, intransitive verbs referring to position may differ according to the shape of the subject (40).
(39) gaka ‘to place long and thin objects’ sa ‘to place three-dimensional objects’ pan ‘to place flat objects’ coˇ ʔ ‘to place things with an upright position’ (40) aʔ-gei-kwa ‘to be in (of long objects)’ aʔ-p n-kwa ‘to be in (of flat objects)’ aʔ-ni-kwa ‘to be in (of three-dimensional objects)’ aʔ-nuk ‘to be in (of upright objects)’
Arhuacan languages have fully developed numeral systems formed on a decimal basis. As Constenla Uma˜na (1988) has pointed out, some of the Arhuacan numerals contain petrified classifier elements also found in other Chibchan languages. One of them is *kwa2 ‘seed’, for instance, in Kogui m´aigwa ‘three’, k´ugwa ‘seven’, ug´ wa ‘ten’, m´uˇzugwa ‘twenty’. The Arhuacan languages have several lexical borrowings from Spanish, some of which are quite old. An example is Damana turruma,Kogui tuluma, from Spanish turma,anearly term for ‘potato’ that was widely used in New Granada. Damana galina [galyina] ‘chicken’ (Spanish gallina) and paka ‘cow’ (Spanish vaca) are evident loans. In spite of their isolation the Kogui have adopted a few characteristic loan words, such as k´alta ‘book’, ‘paper’ (from Spanish carta) and gw´ıbu ‘egg’ (from Spanish huevo). Avery important cultural term is kw´ıbulu or kw´ıbulo,which refers to the traditional Kogui village. Although it is perfectly adapted to Kogui pronunciation, it does not seem far-fetched to derive its etymology from Spanish pueblo ‘village’. 2.7 Chimila 75
2.7 Chimila The Chimila language (or Ette Taara)isspoken by the descendants of a once numerous population (Ette Ennaka) inhabiting the space situated between the lower Magdalena river to the west and southwest, and the Sierra Nevada de Santa Marta and the Cesar river to the east. The strategic centre of the Chimila homeland was formed by the Ariguan´ıriver valley, which constitutes the border of the modern departments of Cesar and Magdalena. During the conquest the Chimila were much feared because of their use of poisoned arrows. They were referred to as ‘Caribs’ (Hemming 1978: 70; cf. section 2.1). In reality, the Chimila speak a language of the Chibchan family. About 1720 the Chimila people, exasperated by the excesses of colonisation, began a guerrilla war that was conducted with great cruelty on both sides (Chaves et al. 1995: 127–34). Although this war reached its highest intensity in the eighteenth century, ruth- less violence against the Chimila aiming at their extermination continued until well into the twentieth century. In 1990 the last traumatised and largely assimilated survivors were assigned a small reservation called Issa Oristuna, near San Angel (Magdalena), which harbours some eighty families (Trillos Amaya 1997: 25). The language, which had been practised in secret for a long time, is now spoken more openly, but field research is still difficult due to the Chimila’s understandable distrust of outsiders. The total number of Chimila is estimated at 900 (Arango and S´anchez 1998). In 1947 the anthropologist Reichel-Dolmatoff published an article on the Chimila language, summarised in Mel´endez Lozano (2000a). The information on Chimila con- tained in the present study is based on Trillos Amaya (1997). She provides a very good and honest impression of the language, admitting that many of the analyses offered must remain provisional for lack of data. Trillos Amaya respected the restrictions set by the community upon her fieldwork in the expectation of a more open attitude that may develop over the coming years. The phonology of Chimila still retains several unsolved questions. According to Trillos Amaya (1997: 66), its five vowels (a, e, i, o, u) can be short, long, interrupted (glottalised) and aspirated. Monosyllabic words ending in a long vowel can bear contrastive tone, e.g. t´o: ‘maraca (a musical instrument)’ with rising tone, versus t`o: ‘heart’ with descending tone. Polysyllabic words contain a tone-bearing syllable of which the tone appears to be partly determined by the nature of a following consonant (descending tone before geminates and r, rising tone before non-geminate obstruents). In Trillos Amaya’s study, tonal distinctions are not indicated except for a few relevant pairs in the phonological chapter (Trillos Amaya 1997: 75–6). The consonant inventory comprises a series of voiceless oral stops and affricates (p, t, cˇ, k), a prenasalised series (mb, nd, ndzˇ, ŋg) and a nasal series (m, n, ny, ŋ). In addition, there is a voiced stop (g) and a voiced affricate (dzˇ). All these oppositions are fairly well exploited, e.g. ka: ‘breast’, ga: ‘excrement’, ŋga: ‘wing, feather’, ŋa:` ‘to hoist up’, 76 2 The Chibcha Sphere
ŋa:´ ‘to lie down’. All the velar sounds have labiovelar counterparts (kw, gw, ŋw, ŋgw) The remaining consonants are a lateral l,apreglottalised tap r, the laminal and velar fricatives s and h, and an approximant w. Syllable-initial consonant clusters may consist of a stop + r.Inclusters after velar consonants r is pronounced as a uvular trill. Most consonants (not the prenasalised ones, and not h, r, w)have tense or geminate counterparts. Gemination appears to be an automatic process, occurring after a stressed short vowel. Nevertheless, gemination is sometimes found more than once in a word (even in consecutive syllables), and Trillos Amaya mentions the case of suffix pairs in which gemination appears to be contrastive, as in (41):
(41) ki:ro-tikwi ki:ro-tikkwi chicken-PL.DI chicken-not.fully.grown ‘small chickens’ ‘small chick’ (Trillos Amaya 1997: 75)
The interaction between gemination, vowel length, accent and tone in Chimila is a field that requires further investigation. Some of these aspects are discussed in Malone (1997–8). As might be expected, Chimila shares a number of characteristics with the related languages of the Arhuacan group, with which it has been in contact over centuries. As in Ika, some verbs exhibit internal stem variation depending on the affixes with which they are combined.
(42) kenne-riddzˇ akonni-ddzˇ a eat-AG eat-PS.N ‘eater’ ‘food’ (Trillos Amaya 1997: 83–4)
Most conspicuous among these similarities is the use of a set of auxiliary verbs, which follow nominalised or otherwise non-finite main verbs. The selection of a particular auxiliary verb (here glossed as ‘be’ or ‘do’) may reflect tense or mood distinctions, but in many cases there appears to be no clear semantic distinction that determines the choice. The matter requires further research. Finite verbs must either be specified as declarative or interrogative. The following examples illustrate the use of auxiliaries and the declarative marker -(t)te.
(43) hoggwa17 ŋa-∅-tte bathe be-3S.SG-DV ‘He bathes.’ (Trillos Amaya 1997: 157)
17 Trillos Amaya suggests that the final vowel in a full verb, such as hoggwa, has an aspectual function. Note that verbs such as hoggwa in constructions with an auxiliary verb must also be nominalised. If not, it may be necessary to reinterpret the auxiliary verb stems as suffixes. Considering that the relevant information is imprecise and insufficient, we will not represent aspect and (covert) nominalisation in our glosses. 2.7 Chimila 77
Table 2.4 Possessive modifiers in Chimila (from Trillos Amaya 1997: 126)
Singular Dual Plural
1 pers. naʔ na na-ra 2 pers. maʔ ma ma-ra 3 pers. nih ni nih-ne
(44) hoggwadzˇ a-∅-tte bathe do-3S.SG-DV ‘He will bathe’. (Trillos Amaya 1997: 157)
When directly affixed to a noun, the declarative marker -(t)te indicates that the noun is used predicatively (45).
(45) iŋŋŋŋa-te smoke-DV ‘It is smoke.’ (Trillos Amaya 1997: 161)
In contrast to the Arhuacan languages, Chimila is predominantly or entirely suffixing. The language has a set of possessive personal reference markers, which are prefix-like elements encoding the distinctions of first, second and third person, as well as singular, dual and plural number. In spite of the syntactic limitations, there are arguments to consider these possessive markers as free forms, rather than as prefixes. As a matter of fact, a numeral can intervene between a possessive marker and the noun it modifies (46).18
(46) naʔ mbuhna oggwe 1P.SG two child ‘my two children’ (Trillos Amaya 1997: 141)
Possessive constructions are obtained by the juxtaposition of a possessive marker, the possessed and the possessor in that order. No genitive case marker occurs.
(47) nih oggwe taki:riddzˇ aʔ o 3P.SG child chief ‘the son of the chief’ (Trillos Amaya 1997: 127)
In the verb personal reference and number are indicated by means of suffixes. These suffixes bear no formal relationship to the possessive markers. There are different sets
18 Compare the case of the Araucanian possessive markers in chapter 5. 78 2 The Chibcha Sphere
Table 2.5 Personal reference markers for subject and object in Chimila (based on Trillos Amaya 1997: 123, 125, 163)
Subject markers Subject markers in negation Object markers Singular Dual Plural Singular Non-singular Singular Non-singular
1 -n -ŋkre -ŋkre-mbre -na -na-ŋkre -nu -nu-ra 2 -uka -uka-ra -uka-ra-mbre -ka -ka-ra -dzˇu ∼ -ˇcu -dzˇu-ra 3 -Ø -(n)ne -(n)ne-mbre -Ø -ne -wi -(n)ne
for the identification of subject and object, respectively. In addition, there is a sep- arate set of personal reference markers used to indicate the subject in a negation. Table 2.5 represents the three sets of verbal personal markers that refer to subject and object. As can be seen in table 2.5, the category plural for subject markers is systematically distinguished from the dual by the addition of a plural marker -mbre. This marker -mbre is also used to indicate plural with nouns. It can be left out, so that the difference between plural and dual is not always overtly expressed. In the third person plural the order of the suffixes represented in table 2.5 is not always respected, for instance, in dzˇuŋŋa- mbre-nne-tte ‘they (more than two) walk’ (Trillos Amaya 1997: 123). The elements -ŋkre, -ra and -(n)ne are non-singular markers, with functions limited to the pronominal system. Object markers have only been found in verbs with a third-person subject (49). The scarce examples of use recorded so far do not yet allow a full appreciation of the possibilities. In negative sentences personal reference markers referring to first- and second-person subject are attached to the adverbial negation marker dzˇumma, not to the verb itself (50). By contrast, the third-person non-singular marker -ne is attached to the main verb, not to the negative adverb dzˇumma (51).
(48) kenne ka-uka-ra-tte eat be-2S-D-DV ‘The two of you eat.’ (Trillos Amaya 1997: 124) (49) ce:-riˇ seddzˇ akoʔ -dzˇ u-∅-tte money-DA give be-2O-3S.SG-DV ‘He gave you money.’ (Trillos Amaya 1997: 109) (50) dzˇ umma-ka dzˇ uŋŋŋŋa not-2S.SG walk ‘You do not walk.’ (Trillos Amaya 1997: 163) 2.7 Chimila 79
(51) dzˇ umma dzˇ uŋŋŋŋa-ne not walk-3S.PL ‘They do not walk.’ (Trillos Amaya 1997: 163)
The noun morphology of Chimila displays an elaborate inventory of case markers and postpositions with a particular emphasis on spatial distinctions and types of association. Strings of case markers referring to spatial relations occur; in (52), for instance, the case marker -ŋa refers to ‘action within a space’, whereas -(s)sa implies a previous motion into that space.
(52) ummaŋenta ka-∅-tte assu ndiddzˇ o-ŋa-ssa become.quiet be-3S-DV only rain-L-AL ‘He only became quiet under the rain.’ (Trillos Amaya 1997: 115)
Both subject and object can remain unmarked in a constituent order which is predom- inantly SVO. An ergative–dative case marker -ri is available for the purpose of marking off the actor of a transitive verb from its object, as illustrated in (53). However, in (49) we have seen that -ri can also mark a direct object, when there is no independent expression of either the subject or a dative. The matter requires further investigation.
(53) noggwe-ri assu tukku wi-∅-tte kawi-manta he-E only see do-3S-DV cabildo-LS ‘He only saw the cabildo.’19 (Trillos Amaya 1997: 108)
Chimila has a set of numeral classifiers: gwa- for round objects; ti:- for long objects; kra:- for corncobs; mbri:- for animals and clothing. As in Cuna, these are prefixed to the numerals.
(54) ti:-muhna CL-two ‘two pencils, bananas, etc.’ (Trillos Amaya 1997: 139) (55) mbri:-mahna CL-three ‘three animals, clothes, etc.’ (Trillos Amaya 1997: 139)
The Chibchan affinity with Chimila is not only clearly evident from its lexicon, but also from its frequent use of the lexical suffixes -kwa/-gwa and -kra; for instance, in kak-kwa ‘mouth’, wa:-kwa ‘eye’, rug-gwa ‘neck’; ha:-kra ‘head’, kik-kra ‘bone’. These
19 Cabildo: traditional administrative board of an indigenous community. The element -manta is translated as ‘skin’ (Trillos Amaya 1997: 86–8). Its function in this particular context remains obscure. 80 2 The Chibcha Sphere elements reflect the classifiers *kua2 ‘seed’ and *kara3 ‘bone, stick’ reconstructed for the proto-language by Constenla Uma˜na (1990).20
2.8 Bar´ı The Bar´ıorDobocub´ı occupy an area at the border between Colombia and Venezuela corresponding to the southern part of the Sierra de Perij´a (department of Norte de Santander in Colombia and state of Zulia in Venezuela). Together with their neighbours to the north, the Cariban Yukpa, the Bar´ıwere traditionally referred to as Motilones ‘shaven heads’. Due to their hostile attitude towards outsiders and, among other things, their resistance to oil prospectors working in the area, the Bar´ıwere known as Motilones bravos ‘wild Motilones’ in contrast to the Yukpa, who were called mansos ‘tame’. The Bar´ı inhabit thirteen villages on the Colombian side, as well as a number of villages in Venezuela. In Colombia their number is estimated at approximately 3,500 (Arango and S´anchez 1998). The Bar´ı language, which belongs to the Chibchan family, has only recently begun to be studied (Mogoll´on P´erez 2000). The vocabularies of Catarroja and Guti´errez published by Rivet and Armellada (1950) have only a few words in common with the more recently collected data. Like its Proto-Chibchan ancestor, Bar´ıisatonal language. Two basic tone levels, low and non-low, are distinguished. The non-low tone is subject to some subtonemic variation. No examples are provided of tonal contrast between vowels that are part of the same root.
(56) k´abb´u ‘he sleeps’ k`abb`u ‘heron’ (Mogoll´on P´erez 2000: 724)
Noun phrases display special internal tonal patterns, which suggest grammatical func- tions such as ‘genitive’ or ‘subordination’. No further grammatical markers specifying the relation appear to be used. Modifiers precede the modified in a genitive construction, but follow their heads in a noun–adjective combination.
(57) c´ˇıd´u´u´abb´ı [ˇc¯ıd¯u:abb¯ ¯ ı] snake blood ‘snake blood’ (Mogoll´on P´erez 2000: 724) (58) c´ˇıd´u´u`abb`ı`ı [ˇc¯ıd¯u:abb¯ ` ı`:] snake big ‘big snake’ (Mogoll´on P´erez 2000: 724)
20 The superscript numbers 2 and 3 in the reconstructed forms refer to mid and high tone, respec- tively (Constenla Uma˜na 1989: 37). 2.9 The Muisca language 81
The Bar´ı language distinguishes six oral (a, e, i, , o, u) and six nasal vowels (a˜, e˜, ˜ , ˜ , o˜, u˜). Long vowels are interpreted by Mogoll´on P´erez as sequences of same vowels and tone-bearing segments, rather than as single phonemes. Bar´ı has a small, asymmetrical consonant inventory, which consists of four stops (b, t, d, k), two fricatives (h, s), one true nasal (m), a multiple vibrant (rr), and two variable resonants. In Mogoll´on P´erez (2000) these resonants were classified as nasals (n, ny). However, a fully nasal realisation ([n], [ny]) is found in one specific environment only, namely, in word-initial position before a nasal vowel. Elsewhere, they are either oral ([r], [y]), or slightly nasalised. As a matter of fact, the least environmentally influenced allophones of the two resonant phonemes are non-nasal; e.g. in /nyinu/ [yiru] ‘yesterday’. The opposition of the alveodental resonant [r], which is found between two oral vowels, and the multiple vibrant rr is that of two vibrants; e.g. /kinu/ [kiru] ‘to spin’ versus /kiru/ [kirru] ‘to rub with tobacco.’ Alveodental consonants have palatal allophones before high front vowels, as could be seen in examples (57) and (58). Geminate consonants and consonant clusters of two consonants occur frequently at syllable boundaries, and most combinations are permitted. Within a syllable the only consonant cluster found occasionally is that of a stop followed by an alveodental resonant, e.g. [tr], in syllable-initial position. This fact and the contrast between the two vibrants are both Chibchan and northern Colombian areal features shared by Bar´ı.
2.9 The Muisca language The importance of the Muisca or Mosca language (in Muisca: muysc cubun [mw sk kuβun] ‘language of the Indians’) in the sixteenth century can be measured from the amount of descriptive material prepared in the colonial period. In 1538 the high plain of Boyac´aand Cundinamarca was densely populated by speakers of Muisca and related dialects. In spite of the fact that the Spanish colonial authorities and clergy were aware of the linguistic diversity in the area, they chose Muisca as a so-called ‘general language’ (lengua general )tobeused for administration and evangelisation. A chair for Muisca was established in Santaf´edeBogot´ain1582. The first chairholder was a parish priest called Gonzalo Berm´udez (Gonz´alez de P´erez 1980: 60–75). In the meantime, the practicability of Muisca as a general language remained a matter of contention. Apparently, history put the opponents in the right, as the language died out during the eighteenth century. According to Uricoechea (1871: xliv), the language was no longer spoken in 1765. The degree of linguistic diversity found in the Muisca realm becomes evident from observations of chroniclers of the sixteenth and seventeenth centuries. Unfortunately, almost nothing is known about varieties spoken in the Boyac´a and Cundinamarca high- lands, other than Muisca itself. The only specimen of Duit, the language of Boyac´a, is a fragment of a catechism published and analysed by Uricoechea (1871), who reports that this was only a sample of a larger document to which he had access. He, furthermore, 82 2 The Chibcha Sphere affirms that it was written in the language of Duitama, which in his opinion was different from that of Tunja, the capital of Boyac´a. It is unfortunate that the Duit document used by Uricoechea was never located after his death. Uricoechea’s Duit was a distinct language, clearly related to Muisca, but probably not on a level of mutual intelligibility. An interesting element of Duit is the frequent occurrence of r,asound rarely found in Muisca. Correspondences between Duit r,on the one hand, and Muisca s [s,s] ˇ or z [ts], on the other, occur; e.g. Duit sir, Muisca sis(y) [sis( ) ∼ sis(ˇ )] ‘this’; Duit pcuare, Muisca pquahaza [pkwahatsa] ‘lightning’. Constenla Uma˜na (1984: 87) furthermore observes a coincidence of dialectal r with standard Muisca ch [ty], as can be distilled from place names and local borrowings of the native language into Spanish. For instance, in the Spanish–Muisca vocabulary contained in the anonymous manuscript no. 158 of the Colombian National Library (see below) the Muisca version of the name of the town of Zipaquir´aisgiven as Chicaquicha.21 As we shall see, the treatment of r is also an element of differentiation between the sources for the Muisca language itself. Another interesting observation made by Uricoechea (1871: xlii) concerns the ex- istence around 1600 of a mixed dialect of Spanish and Muisca, which he calls jitano (‘gipsy’). He provides a few examples, such as hicabai for ‘horse’ (Spanish caballo) and zebos for ‘lover’ (Spanish mancebo). Unfortunately, no source is mentioned for Uricoechea’s jitano data.
2.9.1 Sources Of the many grammatical studies dealing with the Muisca language mentioned in the historical sources only three are known to have survived. One of these studies is the work of a Dominican and scholar of the Muisca language, Bernardo de Lugo (1619); the two others have no known authors. Lugo’s work has been the subject of a modern facsimile edition with an introduction by Alvar (1978). The two anonymous gram- mars belong to a different tradition from Lugo’s. One of them was sent to Madrid in 1789 by Mutis (cf. section 2.2) and was kept as manuscript no. 2922 in the Library of the Royal Palace in Madrid, together with a Spanish–Muisca vocabulary (cf. Ostler 1999). Most of the contents of the Royal Palace Library grammar were published by Lucena Salmoral (1967, 1970); the vocabulary by Quesada Pacheco (1991). A much earlier publication by Quijano Otero (1883) was apparently based on a copy of the same manuscript that remained in Colombia until it was lost. In 1970, after its partial publication by Lucena Salmoral, the manuscript of the Royal Palace Library grammar
21 The ending -quir´a,sofar unexplained, is highly frequent in Boyac´a and Cundinamarca (e.g. Chiquinquir´a, Moniquir´a, Raquir´a). As Constenla Uma˜na points out, it has often mistakenly been identified with the word quica (also recorded as quyca) ‘place’, ‘town’. 2.9 The Muisca language 83 became lost as well, while being transferred from Madrid to Salamanca (L´opez Garc´ıa 1995: 20).22 The other anonymous work is kept as manuscript no. 158 in the National Library of Colombia. Apart from a grammar, it contains an extensive vocabulary and religious texts. It is available in print in an edition of the Instituto Caro y Cuervo (Gonz´alez de P´erez 1987) and certainly constitutes the most important extant source on Muisca. The authorship of the two anonymous grammars has been the object of much spec- ulation. A candidate for the authorship of either one of the manuscripts is the Italian Jesuit Joseph Dadey (1576–1660), known in his time as the foremost authority on the Muisca language. The history of the three Muisca grammars and their interrelations is discussed in Gonz´alez de P´erez (1980, 1987); see also Ostler (1994). Several works on Muisca date from the nineteenth century. Uricoechea’s Muisca grammar and vocabulary of 1871 is based on a compilation of elements from the two anonymous manuscripts. Adam (1878) presents an interesting study of Muisca word formation and syntax, based on Uricoechea, and Middendorf (1892) a grammatical overview of the language. Acosta Orteg´on (1938) is useful because it contains an exten- sive Muisca–Spanish vocabulary. It, however, must be used with utmost care due to the author’s highly personal and erratic interpretation of the symbols found in the original Muisca sources. For the following succinct discussion of the Muisca facts, we will refer to and draw from a substantial body of linguistic literature that appeared in the 1980s and 1990s. Constenla Uma˜na (1984) presents a modern interpretation of the Muisca sound system. Morphological and syntactic aspects of the Muisca language are treated in Ostler (1993, 1994) and, in a historical-comparative perspective, in Ostler (2000). Translated and annotated specimens of colonial texts written in the Muisca language are found in Ostler (1995, 1999). A further study dealing with Muisca is L´opez Garc´ıa (1995).
2.9.2 Phonology The orthography that was developed for Muisca during the colonial period diverged in several respects from general Spanish usage in accordance with the necessities of the language. For the recovery and interpretation of the sounds of Muisca, Lugo’swork is of particular importance. It contains three symbols not found in the anonymous grammars: