Applying Finite-State Techniques to a Native American Language: Quechua
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Forms and Functions of Negation in Huaraz Quechua (Ancash, Peru): Analyzing the Interplay of Common Knowledge and Sociocultural Settings
Forms and Functions of Negation in Huaraz Quechua (Ancash, Peru): Analyzing the Interplay of Common Knowledge and Sociocultural Settings Dissertation zur Erlangung des Grades eines Doktors der Philosophie am Fachbereich Geschichts- und Kulturwissenschaften der Freien Universität Berlin vorgelegt von Cristina Villari aus Verona (Italien) Berlin 2017 1. Gutachter: Prof. Dr. Michael Dürr 2. Gutachterin: Prof. Dr. Ingrid Kummels Tag der Disputation: 18.07.2017 To Ani and Leonel III Acknowledgements I wish to thank my teachers, colleagues and friends who have provided guidance, comments and encouragement through this process. I gratefully acknowledge the support received for this project from the Stiftung Lateinamerikanische Literatur. Many thanks go to my first supervisor Prof. Michael Dürr for his constructive comments and suggestions at every stage of this work. Many of his questions led to findings presented here. I am indebted to him for his precious counsel and detailed review of my drafts. Many thanks also go to my second supervisor Prof. Ingrid Kummels. She introduced me to the world of cultural anthropology during the doctoral colloquium at the Latin American Institute at the Free University of Berlin. The feedback she and my colleagues provided was instrumental in composing the sociolinguistic part of this work. I owe enormous gratitude to Leonel Menacho López and Anita Julca de Menacho. In fact, this project would not have been possible without their invaluable advice. During these years of research they have been more than consultants; Quechua teachers, comrades, guides and friends. With Leonel I have discussed most of the examples presented in this dissertation. It is only thanks to his contributions that I was able to explain nuances of meanings and the cultural background of the different expressions presented. -
Semantic Transparency in the Lowland Quechua Morphosyntax
Semantic transparency in Lowland Ecuadorian Quechua morphosyntax1 PIETER MUYSKEN Abstract In this paper the properties of Lowland Ecuadorian Quechua, a possibly pidginized variety from this Andean indigenous language family, are eval- uated in the light of the semantic-transparency hypothesis. It is argued that the typological perspective created by looking at a wider range of languages brings some of the basic ideas developed for creole languages into focus. 1. Introduction One of Pieter Seuren’s contributions to the field of creole studies has been the idea that creoles somehow represent semantically transparent structures, as a result of their special history. Together with the late Herman Wekker, Seuren has particularly elaborated this idea in their joint 1986 paper. The dimensions of semantic transparency proposed by Seuren and Wekker (1986: 64) are uniformity, universality, and simplicity. Furthermore, Pieter Seuren has repeatedly stressed the importance of typological considerations, most recently in his Western Linguistics (1998). In this brief paper I will start to illustrate the workings of the principle of semantic transparency for the possibly pidginized Quechua of the Amazonian lowlands of eastern Ecuador, Lowland Ecuadorian Quechua (LEQ). This variety has been described by Leonardi (1966) and Mugica (1967) and is represented in texts gathered by Oberem and Hartmann (1971), but the present paper is based mostly on my own fieldwork in Arajuno (Tena province). Quechua is spoken (by more than eight million speakers) mostly in rural areas of the highlands of Bolivia, Peru, and Ecuador, but small pockets of speakers are also found on the slopes of the Amazon basin of Peru, Colombia, Bolivia, and Ecuador. -
Text Segmentation by Language Using Minimum Description Length
Text Segmentation by Language Using Minimum Description Length Hiroshi Yamaguchi Kumiko Tanaka-Ishii Graduate School of Faculty and Graduate School of Information Information Science and Technology, Science and Electrical Engineering, University of Tokyo Kyushu University [email protected] [email protected] Abstract addressed in this paper is rare. The most similar The problem addressed in this paper is to seg- previous work that we know of comes from two ment a given multilingual document into seg- sources and can be summarized as follows. First, ments for each language and then identify the (Teahan, 2000) attempted to segment multilingual language of each segment. The problem was texts by using text segmentation methods used for motivated by an attempt to collect a large non-segmented languages. For this purpose, he used amount of linguistic data for non-major lan- a gold standard of multilingual texts annotated by guages from the web. The problem is formu- lated in terms of obtaining the minimum de- borders and languages. This segmentation approach scription length of a text, and the proposed so- is similar to that of word segmentation for non- lution finds the segments and their languages segmented texts, and he tested it on six different through dynamic programming. Empirical re- European languages. Although the problem set- sults demonstrating the potential of this ap- ting is similar to ours, the formulation and solution proach are presented for experiments using are different, particularly in that our method uses texts taken from the Universal Declaration of only a monolingual gold standard, not a multilin- Human Rights and Wikipedia, covering more than 200 languages. -
Languages of the Middle Andes in Areal-Typological Perspective: Emphasis on Quechuan and Aymaran
Languages of the Middle Andes in areal-typological perspective: Emphasis on Quechuan and Aymaran Willem F.H. Adelaar 1. Introduction1 Among the indigenous languages of the Andean region of Ecuador, Peru, Bolivia, northern Chile and northern Argentina, Quechuan and Aymaran have traditionally occupied a dominant position. Both Quechuan and Aymaran are language families of several million speakers each. Quechuan consists of a conglomerate of geo- graphically defined varieties, traditionally referred to as Quechua “dialects”, not- withstanding the fact that mutual intelligibility is often lacking. Present-day Ayma- ran consists of two distinct languages that are not normally referred to as “dialects”. The absence of a demonstrable genetic relationship between the Quechuan and Aymaran language families, accompanied by a lack of recognizable external gen- etic connections, suggests a long period of independent development, which may hark back to a period of incipient subsistence agriculture roughly dated between 8000 and 5000 BP (Torero 2002: 123–124), long before the Andean civilization at- tained its highest stages of complexity. Quechuan and Aymaran feature a great amount of detailed structural, phono- logical and lexical similarities and thus exemplify one of the most intriguing and intense cases of language contact to be found in the entire world. Often treated as a product of long-term convergence, the similarities between the Quechuan and Ay- maran families can best be understood as the result of an intense period of social and cultural intertwinement, which must have pre-dated the stage of the proto-lan- guages and was in turn followed by a protracted process of incidental and locally confined diffusion. -
Intonation in Quechua: Questions and Analysis
INTONATION IN QUECHUA: QUESTIONS AND ANALYSIS Erin O’Rourke University of Pittsburgh [email protected] ABSTRACT information about the peaks and valleys occurring in non-final position within the utterance. Research on the suprasegmental system of Quechua has largely focused on the placement of 2. BACKGROUND stress within a word. Previous descriptions of Quechua intonation found in the literature offer a Quechua is an agglutinative language with SOV schematic representation of the intonation contour. word order which is spoken by approximately 8 In order to examine Quechua intonation within the million people primarily in Peru, Bolivia and current framework of Autosegmental Metrical Ecuador and also in parts of Argentina, Colombia (AM) phonology, data from field recordings in and Chile [2]. The Quechuan language family can Cuzco of Southern Peruvian Quechua have been be divided into two main varieties, Central and analyzed. The current paper offers a preliminary Peripheral [8, 15]. In Peru the Peripheral variety sketch of the basic units of intonation employed in with the greatest number of speakers is Southern Quechua, including pitch accents and boundary Peruvian Quechua [3]. Cuzco Quechua, one of the tones. This analysis may provide additional data in Southern Peruvian varieties, has been chosen for the cross-comparison of intonation systems and this description of intonation given its large also aid in the task of applying the principals of the distribution of speakers. AM model to less-commonly studied intonation 3. STRESS IN QUECHUA systems. Quechua has a fixed location for primary stress. As noted in Cerrón-Palomino [2:128], research on Keywords: intonation, Autosegmental Metrical Quechua suprasegmentals has focused mainly on (AM) model, Quechua, pitch accent, boundary stress placement across different varieties: “The tone phenomena of accent, rhythm and intonation are 1. -
(REELA) 5-7 September 2015, Leiden University Centre for Linguistics
Fourth Conference of the Red Europea para el Estudio de las Lenguas Andinas (REELA) 5-7 September 2015, Leiden University Centre for Linguistics Fourth Conference of the European Association for the Study of Andean Languages - Abstracts Saturday 5 September Lengua X, an Andean puzzle Matthias Pache Leiden University In the southern central Andes, different researchers have come across series of numerals which are difficult to attribute to one of the language groups known to be or have been spoken in this area: Quechuan, Aymaran, Uru-Chipayan, or Puquina (cf. Ibarra Grasso 1982: 97-107). In a specific chapter headed “La lengua X”, Ibarra Grasso (1982) discusses different series of numerals which he attributes to this language. Although subsumed under one heading, Lengua X, the numerals in question may vary across the sources, both with respect to form and meaning. An exemplary paradigm of Lengua X numerals recorded during own fieldwork is as follows: 1 mayti 2 payti 3 kimsti 4 taksi 5 takiri 6 iriti 7 wanaku 8 atʃ͡atʃ͡i 9 tʃ͡ipana 10 tʃ͡ˀutx Whereas some of these numerals resemble their Aymara counterparts (mayti ‘one’, payti ‘two’, cf. Aymara maya ‘one’, paya ‘two’), others seem to have parallels in Uru or Puquina numerals (taksi ‘four’, cf. Irohito Uru táxˀs núko ‘six’ (Vellard 1967: 37), Puquina tacpa ‘five’ (Torero 2002: 454)). Among numerals above five, there are some cases of homonymy with Quechua/Aymara terms referring to specific entities, as for instance Lengua X tʃ͡ipana ‘nine’ and Quechua/Aymara tʃ͡ipana ‘fetter, bracelet’. In this talk, I will discuss two questions: (1) What is the origin of Lengua X numerals? (2) What do Lengua X numerals reveal about the linguistic past of the southern central Andes? References Ibarra Grasso, Dick. -
On the External Relations of Purepecha: an Investigation Into Classification, Contact and Patterns of Word Formation Kate Bellamy
On the external relations of Purepecha: An investigation into classification, contact and patterns of word formation Kate Bellamy To cite this version: Kate Bellamy. On the external relations of Purepecha: An investigation into classification, contact and patterns of word formation. Linguistics. Leiden University, 2018. English. tel-03280941 HAL Id: tel-03280941 https://halshs.archives-ouvertes.fr/tel-03280941 Submitted on 7 Jul 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Cover Page The handle http://hdl.handle.net/1887/61624 holds various files of this Leiden University dissertation. Author: Bellamy, K.R. Title: On the external relations of Purepecha : an investigation into classification, contact and patterns of word formation Issue Date: 2018-04-26 On the external relations of Purepecha An investigation into classification, contact and patterns of word formation Published by LOT Telephone: +31 30 253 6111 Trans 10 3512 JK Utrecht Email: [email protected] The Netherlands http://www.lotschool.nl Cover illustration: Kate Bellamy. ISBN: 978-94-6093-282-3 NUR 616 Copyright © 2018: Kate Bellamy. All rights reserved. On the external relations of Purepecha An investigation into classification, contact and patterns of word formation PROEFSCHRIFT te verkrijging van de graad van Doctor aan de Universiteit Leiden, op gezag van de Rector Magnificus prof. -
The Attributive Suffix in Pastaza Kichwa
Brigham Young University BYU ScholarsArchive Theses and Dissertations 2020-06-08 The attributive suffix in Pastaza Kichwa Barrett Wilson Hamp Brigham Young University Follow this and additional works at: https://scholarsarchive.byu.edu/etd Part of the Arts and Humanities Commons BYU ScholarsArchive Citation Hamp, Barrett Wilson, "The attributive suffix in Pastaza Kichwa" (2020). Theses and Dissertations. 8443. https://scholarsarchive.byu.edu/etd/8443 This Thesis is brought to you for free and open access by BYU ScholarsArchive. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of BYU ScholarsArchive. For more information, please contact [email protected], [email protected]. The Attributive Suffix in Pastaza Kichwa Barrett Wilson Hamp A thesis submitted to the faculty of Brigham Young University in partial fulfillment of the requirements for the degree of Master of Arts Janis Nuckolls, Chair Chris Rogers Jeff Parker Department of Linguistics Brigham Young University Copyright © 2020 Barrett Wilson Hamp All Rights Reserved ABSTRACT The Attributive Suffix in Pastaza Kichwa Barrett Wilson Hamp Department of Linguistics, BYU Master of Arts This thesis is a corpus-based description of the attributive suffix -k in Pastaza Kichwa, a Quechuan language spoken in lowland Amazonian Ecuador. The goal of this work is, first, to describe the behaviors, characteristics, and functions of the suffix using data from the Corpus of Pastaza Kichwa (Rice 2018a), and second, to offer a typological analysis of these behaviors in order to identify the most appropriate classification for the suffix. The suffix has previously been described as a nominalizer (Nuckolls & Swanson, forthcoming), and the equivalent suffix in other Quechuan varieties has been described as an agentive nominal relativizer (Weber 1983; Weber 1989; Cole 1985; Lefebvre & Muysken 1988) or a participle (Markham 1864; Weber 1989; Guardia Mayorga 1973; Catta Quelen 1985; Debenbach-Salazar Saenz 1993, Muysken 1994). -
A Grammar of Yauyos Quechua
A grammar of Yauyos Quechua Aviva Shimelman language Studies in Diversity Linguistics 9 science press Studies in Diversity Linguistics Chief Editor: Martin Haspelmath Consulting Editors: Fernando Zúñiga, Peter Arkadiev, Ruth Singer, Pilar Valen zuela In this series: 1. Handschuh, Corinna. A typology of marked-S languages. 2. Rießler, Michael. Adjective attribution. 3. Klamer, Marian (ed.). The Alor-Pantar languages: History and typology. 4. Berghäll, Liisa. A grammar of Mauwake (Papua New Guinea). 5. Wilbur, Joshua. A grammar of Pite Saami. 6. Dahl, Östen. Grammaticalization in the North: Noun phrase morphosyntax in Scandinavian vernaculars. 7. Schackow, Diana. A grammar of Yakkha. 8. Liljegren, Henrik. A grammar of Palula. 9. Shimelman, Aviva. A grammar of Yauyos Quechua. 10. Rudin, Catherine & Bryan James Gordon (eds.). Advances in the study of Siouan languages and linguistics. 11. Kluge, Angela. A grammar of Papuan Malay. 12. Kieviet, Paulus. A grammar of Rapa Nui. 13. Michaud, Alexis. Tone in Yongning Na: Lexical tones and morphotonology. ISSN: 2363-5568 A grammar of Yauyos Quechua Aviva Shimelman language science press Aviva Shimelman. 2017. A grammar of Yauyos Quechua (Studies in Diversity Linguistics 9). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/83 © 2017, Aviva Shimelman Published under the Creative Commons Attribution 4.0 Licence (CC BY 4.0): http://creativecommons.org/licenses/by/4.0/ ISBN: 978-3-946234-21-0 (Digital) 978-3-946234-22-7 (Hardcover) 978-3-946234-23-4 -
Exploring Phonological Areality in the Circum-Andean Region Using a Naive Bayes Classifier
Exploring phonological areality in the circum-Andean region using a naive Bayes classifier Lev Michael, Will Chang, and Tammy Stark This paper describes the Core and Periphery technique, a quantitative method for exploring areality that uses a naive Bayes classifier, a statistical tool for inferring class membership based on training sets assembled from members of those classes. The Core and Periphery technique is applied to the exploration of phonological areality in the Andes and surrounding lowland regions, based on the South American Phonological Inventory Database (SAPhon 1.1.3; Michael et al., 2013). Evidence is found for a phonological area centering on the Andean highlands, and extending to parts of the northern and central Andean foothills regions, the Chaco, and Patagonia. Evidence is also found for Southern and North-Central phonological sub-areas within this larger phonological area. Keywords: naive Bayes classifier; linguistic areality; Andean languages; South American languages 1. Introduction The goals of this paper are to describe the Core and Periphery technique, an intuitively appealing quantitative method for exploring large linguistic datasets for evidence of linguistic areality, and to illustrate the utility of this technique by applying it to a dataset of South American phonological inventories, focusing on the evidence of phonological areality in the Andes and surrounding lowland areas. Core and Periphery is a method that uses as a starting point linguists’ knowl- edge of the languages and history of a region to generate initial hypotheses regarding ‘cores’: sets of languages that constitute possible linguistic areas (Campbell et al., 1986; Thomason, 2000; Muysken, 2008), or parts of ones. -
The Languages of the Andes
THE LANGUAGES OF THE ANDES The Andean and Pacific regions of South America are home to a remark- able variety of languages and language families, with a range of typologi- cal differences. This linguistic diversity results from a complex historical background, comprising periods of greater communication between dif- ferent peoples and languages, and periods of fragmentation and individual development. The Languages of the Andes is the first book in English to document in a single volume the indigenous languages spoken and for- merly spoken in this linguistically rich region, as well as in adjacent areas. Grouping the languages into different cultural spheres, it describes their characteristics in terms of language typology, language contact, and the social perspectives of present-day languages. The authors provide both historical and contemporary information, and illustrate the languages with detailed grammatical sketches. Written in a clear and accessible style, this book will be a valuable source for students and scholars of linguistics and anthropology alike. . is Professor of Amerindian Languages and Cul- tures at Leiden University. He has travelled widely in South America and has conducted fieldwork in Peru on different varieties of Quechua and minor languages of the area. He has also worked on the historical- comparative reconstruction of South American languages, and since 1991 has been involved in international activities addressing the issue of lan- guage endangerment. His previously published books include Tarma Quechua (1977) and Het Boek van Huarochir´ı (1988). . is Professor of Linguistics at the University of Nijmegen. He has travelled widely in the Caribbean and the Andes, and was previously Professor of Sociolinguistics and Creole Studies at the Uni- versity of Amsterdam and Professor of Linguistics and Latin American Studies at Leiden University. -
OSU WPL Ff29 (1984) 70-L04 Why -Ski? a Study of '.'-Erbal Aspect in Conchucos Quechua Anne M. Stewart O. Introduction the Verbal
OSU WPL ff29 (1984) 70-l04 Why -skI? A Study of '.'-erbal Aspect in Conchucos Quechua Anne M. Stewart O. Introduction The verbal sui'fix -skl of the Conchucos dialect of Ancash Quechua[l] is completely absent in the dialect of neighboring Huaraz. Impressionistically speaking, -skI might be said to "characterize" the Quechua of Conchucos. Although it is beyond the scope of this present paper to determine precisely why -skl is restricted almost exclusively to the Conchucos dialect and neighboring areas,(2] the specific function which -skI performs calls for more precise examination. The claim of this paper is that -skI performs a specific and vital function in the modal- aspectual system of the verb 1n the Quechua of Conchucos. Moreover, the complex role which -skl plays in this dialect indicates the likelihood of similar complexities in the other Quechua dialects which employ this suffix, Any further insight into this complex system of verbal derivation which all of the Quechua languages share is expedient for adequate analyses of the languages within the Quechua family, and is likely also to provide insights into the verbal morphology of typologically similar languages. 1. About Quechua Quechua is a language family with a number of members, rather than a single language with a number of dialects. There are approximately six million speakers of these languages, located geographically from Colombia, in the northern portion of South America, to the province of Santiago del Estero, in Argentina, to the south. Quechua is centered along the Andean chain and occurs in adjacent Jungle areas, such as the Quechua of the Napo in Ecuador and the Quechua of the Pastaza in Peru.