Linguistic Phylogeny with Bayesian Markov Chain Monte Carlo: the Case of Indo-European

Total Page:16

File Type:pdf, Size:1020Kb

Linguistic Phylogeny with Bayesian Markov Chain Monte Carlo: the Case of Indo-European Linguistic Phylogeny with Bayesian Markov Chain Monte Carlo: The Case of Indo-European by Stephen Tyndall A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Linguistics) in the University of Michigan 2019 Doctoral Committee: Professor Sarah Thomason, Chair Associate Professor Steven Abney Professor William H. Baxter Professor Benjamin Fortson ©Stephen Tyndall [email protected] ORDID iD: 0000-0001-7276-6695 2019 ACKNOWLEDGMENTS First and foremost, I would like to thank Sally Thomason, without whose encouragement and support I’d never have completed this the- sis. And thank you as well to the rest of the committee for provid- ing advice and ideas. Their suggestions and help have been invaluable. Thanks as well to my friends and colleagues, particularly Kate Sherwood, Terry Szymanski, Rob Gillezeau, Lauren Squires, Tayfun Bilgin, Jon Yip, and Kevin McGowan, who provided support, friendship, and mo- tivation through this long process. Further thanks to the members of GLaM and GEO for all the social and professional aid, and to Bu and Ping for their calming presence through the final stages of this thesis. Finally, thank you to my family, Karen Brichetto, Eric Tyndall, David Richardson, and Doug Tyndall, for all your support during graduate school. I could not have done this without you. ii TABLE OF CONTENTS Acknowledgments ................................... ii List of Figures ..................................... iv List of Tables ...................................... v Abstract ......................................... vi Chapter 1 Introduction ..................................... 1 1.1 Background Information..........................1 1.2 Statistical Methods in Linguistics......................2 1.3 The Particular Problem of Indo-European (Computational) Phylogeny..3 1.4 Dissertation Objectives...........................4 1.5 Scope and Limitations...........................6 1.6 Structure of the Dissertation........................6 1.6.1 Literature Review..........................6 1.6.2 Methodology............................7 1.6.3 Results...............................7 1.6.4 Conclusion.............................7 2 Literature Review .................................. 9 2.1 Computational Phylogeny in Linguistics..................9 2.1.1 Prior Computational Phylogeny Work in Linguistics....... 15 2.2 Models of the spread of the Indo-European Language Family....... 18 2.2.1 Tradition Family Tree(s) of IE................... 18 2.2.2 Non-Computational Dating Methods for the Breakup of PIE... 18 2.2.3 Models of the Breakup and Spread of Proto-Indo-European... 21 2.2.4 The Farming Model........................ 21 2.2.5 The Pastoralist Model....................... 23 2.3 Computational Phylogeny of Proto-Indo-European............ 25 2.3.1 Gray and Atkinson......................... 25 2.3.2 Chang et al. 2005.......................... 30 2.3.3 The Ringe and Warnow Method.................. 31 2.3.4 Computational Phylogeny and Estimating the Dates of Language Splits................................ 35 iii 3 Methodology ..................................... 38 3.1 Specifying Retentions with MRBAYES.................. 39 3.2 Capturing Phonological Changes as Characters for Bayesian MCMC... 40 3.3 Data Coding................................. 42 4 Results ........................................ 47 4.1 A Brief Survey of the Results........................ 47 4.2 Summary of the Results........................... 47 4.2.1 Characteristics and Subdivisions of the Set of Swadesh Lists... 49 4.2.2 Kitchen Sink Results........................ 52 4.3 Six-Way Results............................... 54 4.3.1 Phonological Characters Only................... 54 4.3.2 Lexical Characters Only...................... 58 4.3.3 Lexical and Phonological Characters............... 61 4.3.4 Overall Consideration of the Six Conditions............ 64 4.4 Replication of Gray and Atkinson 2003.................. 64 4.4.1 Replication without Root Node Specification........... 66 4.4.2 Replication with Root Node Specification............. 66 4.5 Discussion.................................. 66 4.5.1 Root Node Selection........................ 68 4.6 Implications and Limitations........................ 70 5 Conclusion ...................................... 72 5.1 Primary Benefits.............................. 72 5.2 Secondary Benefits............................. 73 5.3 Future Work................................. 73 Appendix ........................................ 75 Bibliography ...................................... 81 iv LIST OF FIGURES 2.1 Network from [Holden and Gray, 2006]..................... 10 2.2 Network from [Greenhill and Gray, 2005]. This splitstree network graph of Austronesian demonstrates a strong tree-like signal in the data used to create the graph...................................... 11 2.3 Network from [Bakker et al., 2011]. Splitstree network showing very weak treelike signal in source data............................ 12 2.4 Tree from Gray and Atkinson (2003)...................... 29 4.1 Entire IELEX Lexical Tree............................ 53 4.2 Ancient Languages, Phonology Only, No Retentions Specified......... 56 4.3 Ancient Languages, Phonology Only, Retentions Specified........... 57 4.4 Ancient Languages, Lexical Characters Only, No Retentions Specified..... 59 4.5 Ancient Languages, Lexical Characters Only, Retentions Specified....... 61 4.6 Ancient Languages, Lexical and Phonological Characters, No Retentions Spec- ified........................................ 62 4.7 Ancient Languages, Lexical and Phonological Characters, Retentions Specified 63 4.8 Dyen List, No Specification of Root....................... 67 4.9 Dyen List, Specification of Root......................... 68 5.1 A splitstree network showing the relationships among the Mande languages.. 74 v LIST OF TABLES 3.1 Manner Outcomes of PIE *p........................... 41 3.2 Manner Outcomes of PIE *t........................... 41 3.3 Manner Outcomes of PIE *k´........................... 42 vi ABSTRACT This study aims to improve the application of computational phylogeny to lan- guage families by testing a series of possible improvements in the structure and use of input data, particularly in order to help reconcile computational techniques with traditional historical linguistic family tree creation. The method used is Bayesian Markov Chain Monte Carlo (Bayesian MCMC), with the MRBAYES software package, replicating several prior computational phylogenetic studies of the Indo-European language family. The study examines two hypotheses about changes to the input data for the method: distinguishing innovations from reten- tions in the input data will improve the resulting trees, and providing a bias-free mechanism for incorporating sound change data in addition to lexical data will further improve both the output and the usability of Bayesian MCMC. The results of the study demonstrate small improvements over baseline under both hypothe- ses, demonstrating the utility of both kinds of additional data. Further work will include application of these new data types to other language families. vii CHAPTER 1 Introduction In this dissertation, I aim to improve the application of computational phylogeny to lan- guage families by testing a series of possible improvements in the structure and use of input data, particularly in order to help reconcile computational techniques with traditional historical linguistic family tree creation. The use of computational phylogeny for linguis- tics is both recent and controversial, particularly in its use by Gray & Atkinson (2003) to date the breakup of the Indo-European language family. I extend the method used by Gray & Atkinson in two ways: by including sound change data without pre-selecting particular changes, and by carefully distinguishing innovations from retentions in both lexical and sound change data. 1.1 Background Information The process of constructing family trees (phylogenies) for languages is very similar to con- structing phylogenies for biological species. Even as far back as the 19th century, Darwin observed parallels between languages and species: “The formation of different languages and of distinct species, and the proofs that both have been developed through a gradual pro- cess, are curiously parallel. We find in distinct languages striking homologies due to com- munity of descent, and analogies due to a similar process of formation. ”[Darwin, 1871, 57-58]. A number of techniques have been developed by biologists to produce species family trees and to date the breakup of populations. It should come as no surprise, then, that some of these techniques have been adapted to producing language phylogenies from linguistic data. In particular, a number of scholars, primarily non-linguists, have applied a computational and statistical technique called the Bayesian Markov Chain Monte Carlo (hereafter Bayesian MCMC) method to language families. This technique has been used with considerable success in biology, and some uses within linguistics have shown great promise in generating accurate family trees and dating 1 the breakup of linguistic groups. Recently, the use of the technique to date the breakup of a particularly well-studied language family, Indo-European, has generated significant controversy among historical linguists. The Indo-European language family has one of the longest histories of study in linguis- tic science. The family contains ten major sub-families, the earliest of which,
Recommended publications
  • When Political Institutions Use Sociolinguistic Concepts
    IJSL 2020; 263: 13–18 David Karlander* When political institutions use sociolinguistic concepts https://doi.org/10.1515/ijsl-2020-2076 Abstract: In this essay, David Karlander examines what happens when concepts developed by scholars of language circulate and become embedded in policies and law. In exploring how the distinction between a “language” and a “dialect” became encoded in the European Charter for Regional or Minority Languages (ECRML), Karlander examines the consequences when applied to the status and state support of minority languages in Sweden. What counts as a language, he demonstrates, is not simply an “academic” matter. When sociolinguistics enters the public arena, it has the potential to affect the political and social standing of real communities. Keywords: European Charter for Regional or Minority Languages (ECRML), language policy, Meänkieli, Övdalsk (Elfdalian, Övdalian), politics of linguistics How do committee-type bodies contribute to the formation, circulation, and societal impact of sociolinguistics? How have such institutions inserted socio- linguistic concepts into institutional or political practice? How has sociolinguistics reacted to the reuse of its conceptual goods? These are some of the important questions that Monica Heller poses in her September 2018 Items essay on the SSRC’s Committee on Sociolinguistics. In what follows, I delve further into them. Like Heller, I turn my attention to the borderlands between policymaking, advocacy, and academic research. However, rather than exploring how commit- tee-type institutions attempt to influence or regulate the research priorities of academics, I will focus on processes through which sociolinguistic notions are put to practice by nonacademic institutions. As an example, I discuss some dimensions of the implementation of the European Charter for Regional or Minority Languages (ECRML), a European treaty on “the protection of the historical regional or minority languages of Europe” under the auspices of the Council of Europe (CoE).
    [Show full text]
  • The Shared Lexicon of Baltic, Slavic and Germanic
    THE SHARED LEXICON OF BALTIC, SLAVIC AND GERMANIC VINCENT F. VAN DER HEIJDEN ******** Thesis for the Master Comparative Indo-European Linguistics under supervision of prof.dr. A.M. Lubotsky Universiteit Leiden, 2018 Table of contents 1. Introduction 2 2. Background topics 3 2.1. Non-lexical similarities between Baltic, Slavic and Germanic 3 2.2. The Prehistory of Balto-Slavic and Germanic 3 2.2.1. Northwestern Indo-European 3 2.2.2. The Origins of Baltic, Slavic and Germanic 4 2.3. Possible substrates in Balto-Slavic and Germanic 6 2.3.1. Hunter-gatherer languages 6 2.3.2. Neolithic languages 7 2.3.3. The Corded Ware culture 7 2.3.4. Temematic 7 2.3.5. Uralic 9 2.4. Recapitulation 9 3. The shared lexicon of Baltic, Slavic and Germanic 11 3.1. Forms that belong to the shared lexicon 11 3.1.1. Baltic-Slavic-Germanic forms 11 3.1.2. Baltic-Germanic forms 19 3.1.3. Slavic-Germanic forms 24 3.2. Forms that do not belong to the shared lexicon 27 3.2.1. Indo-European forms 27 3.2.2. Forms restricted to Europe 32 3.2.3. Possible Germanic borrowings into Baltic and Slavic 40 3.2.4. Uncertain forms and invalid comparisons 42 4. Analysis 48 4.1. Morphology of the forms 49 4.2. Semantics of the forms 49 4.2.1. Natural terms 49 4.2.2. Cultural terms 50 4.3. Origin of the forms 52 5. Conclusion 54 Abbreviations 56 Bibliography 57 1 1.
    [Show full text]
  • Indo-European: History, Families & Origins
    Indo-European: History, Families & Origins 1 Historical Linguistics Early observations pointed to language relatedness: The Sanscrit language, whatever by its antiquity, is of a wonderful structure; more perfect than the Greek, more copious than the Latin, and more exquisitely refined than either, yet bearing to both of them a stronger affinity, both in the roots of verbs and in the forms of grammar, than could possibly have been produced by accident; so strong indeed, that no philologer could examine them all three, without believing them to have sprung from some common source, which, perhaps, no longer exists .... Sir William Jones -- Presidential Discourse at the February 2, 1786 meeting of the Asiatic Society 2 Darwin & Historical Linguistics Darwin's On The Origin of Species published in 1859 German biologist Ernst Haeckel persuades his friend - philologist August Schleicher, to read it Schleicher (and generations of historical linguists) apply evolutionary principle to comparative/historical linguistics 3 Test The Methodology Ferdinand de Saussure (1857-1913) used the comparative method to predict that a certain group of sounds had to have existed in Indo-European 20 years later, when Hittite texts were discovered, Saussure's laryngeals were attested! 4 Indo-European Family 5 Indo-European Homeland 6 Family Branching 7 Earliest Attestation of IE Daughter Languages Anatolian 17th c. B.C.E Indo-Aryan (Sanskrit) 15th c. B.C.E Greek 13th c. B.C.E. Iranian (Avestan / Old Persian) 7th c. B.C.E / 6th c. B.C.E. Italic 6th c. B.C.E Tocharian 6th c. B.C.E Germanic 1st c.
    [Show full text]
  • Indo-European Linguistics: an Introduction Indo-European Linguistics an Introduction
    This page intentionally left blank Indo-European Linguistics The Indo-European language family comprises several hun- dred languages and dialects, including most of those spoken in Europe, and south, south-west and central Asia. Spoken by an estimated 3 billion people, it has the largest number of native speakers in the world today. This textbook provides an accessible introduction to the study of the Indo-European proto-language. It clearly sets out the methods for relating the languages to one another, presents an engaging discussion of the current debates and controversies concerning their clas- sification, and offers sample problems and suggestions for how to solve them. Complete with a comprehensive glossary, almost 100 tables in which language data and examples are clearly laid out, suggestions for further reading, discussion points and a range of exercises, this text will be an essential toolkit for all those studying historical linguistics, language typology and the Indo-European proto-language for the first time. james clackson is Senior Lecturer in the Faculty of Classics, University of Cambridge, and is Fellow and Direc- tor of Studies, Jesus College, University of Cambridge. His previous books include The Linguistic Relationship between Armenian and Greek (1994) and Indo-European Word For- mation (co-edited with Birgit Anette Olson, 2004). CAMBRIDGE TEXTBOOKS IN LINGUISTICS General editors: p. austin, j. bresnan, b. comrie, s. crain, w. dressler, c. ewen, r. lass, d. lightfoot, k. rice, i. roberts, s. romaine, n. v. smith Indo-European Linguistics An Introduction In this series: j. allwood, l.-g. anderson and o.¨ dahl Logic in Linguistics d.
    [Show full text]
  • Comments Received for ISO 639-3 Change Request 2015-046 Outcome
    Comments received for ISO 639-3 Change Request 2015-046 Outcome: Accepted after appeal Effective date: May 27, 2016 SIL International ISO 639-3 Registration Authority 7500 W. Camp Wisdom Rd., Dallas, TX 75236 PHONE: (972) 708-7400 FAX: (972) 708-7380 (GMT-6) E-MAIL: [email protected] INTERNET: http://www.sil.org/iso639-3/ Registration Authority decision on Change Request no. 2015-046: to create the code element [ovd] Ӧvdalian . The request to create the code [ovd] Ӧvdalian has been reevaluated, based on additional information from the original requesters and extensive discussion from outside parties on the IETF list. The additional information has strengthened the case and changed the decision of the Registration Authority to accept the code request. In particular, the long bibliography submitted shows that Ӧvdalian has undergone significant language development, and now has close to 50 publications. In addition, it has been studied extensively, and the academic works should have a distinct code to distinguish them from publications on Swedish. One revision being added by the Registration Authority is the added English name “Elfdalian” which was used in most of the extensive discussion on the IETF list. Michael Everson [email protected] May 4, 2016 This is an appeal by the group responsible for the IETF language subtags to the ISO 639 RA to reconsider and revert their earlier decision and to assign an ISO 639-3 language code to Elfdalian. The undersigned members of the group responsible for the IETF language subtag are concerned about the rejection of the Elfdalian language. There is no doubt that its linguistic features are unique in the continuum of North Germanic languages.
    [Show full text]
  • Sanskrit-Slavic-Sinitic Their Common Linguistic Heritage © 2017 IJSR Received: 14-09-2017 Milorad Ivankovic Accepted: 15-10-2017
    International Journal of Sanskrit Research 2017; 3(6): 70-75 International Journal of Sanskrit Research2015; 1(3):07-12 ISSN: 2394-7519 IJSR 2017; 3(6): 70-75 Sanskrit-Slavic-Sinitic their common linguistic heritage © 2017 IJSR www.anantaajournal.com Received: 14-09-2017 Milorad Ivankovic Accepted: 15-10-2017 Milorad Ivankovic Abstract Omladinski trg 6/4, SRB-26300 Though viewing from the modern perspective they seem to belong to very distant and alien traditions, the Vrsac, Serbia Aryans, the Slavs and the Chinese share the same linguistic and cultural heritage. They are the only three cultures that have developed and preserved the religio-philosophical concept of Integral Dualism, viz. ś ukram-kr̥ sṇ aṃ or yang-yin (see Note 1). And the existing linguistic data firmly supports the above thesis. Key Words: l-forms, l-formant, l-participles, ping, apple, kolo Introduction In spite of persistent skepticism among so called Proto-Indo-Europeanists, in recent years many scholars made attempts at detecting the genetic relationship between Old Chinese and Proto-Indo-European languages (e.g. T.T. Chang, R.S. Bauer, J.X. Zhou, J.L. Wei, etc.), but they proposed solely lexical correspondences with no morphological ones at all. However, there indeed exist some very important morphological correspondences too. The L-Forms in Chinese In Modern Standard Mandarin Chinese there is a particle spelled le and a verb spelled liao, both functioning as verb-suffixes and represented in writing by identical characters. Some researchers hold that the particle le actually derived from liao since “the verb liao (meaning “to finish, complete”), found at the end of the Eastern Han (25-220 CE) and onwards, around Wei and Jin Dynasties (220-581 CE) along with other verbs meaning “to finish” such as jing, qi, yi and bi started to occur in the form Verb (Object) + completive to indicate the completion of the action indicated by the main verb.
    [Show full text]
  • Language Attrition: the Next Phase Barbara Köpke, Monika Schmid
    Language Attrition: The next phase Barbara Köpke, Monika Schmid To cite this version: Barbara Köpke, Monika Schmid. Language Attrition: The next phase. Monika S. Schmid, Barbara Köpke, Merel Keijzer, Lina Weilemar. First Language Attrition: Interdisciplinary perspectives on methodological issues, John Benjamins, pp.1-43, 2004, Studies in Bilingualism, 9027241392. hal- 00879106 HAL Id: hal-00879106 https://hal.archives-ouvertes.fr/hal-00879106 Submitted on 31 Oct 2013 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Language Attrition: The Next Phase Barbara Köpke (Université de Toulouse – Le Mirail) and Monika S. Schmid Vrije Universiteit Amsterdam Barbara Köpke Laboratoire de Neuropsycholinguistique Jacques Lordat Institut des Sciences du Cerveau de Toulouse Université de Toulouse-Le Mirail 31058 Toulouse Cedex France [email protected] Monika S. Schmid Engelse Taal en Cultuur Faculteit der Letteren Vrije Universiteit 1081 HV Amsterdam The Netherlands [email protected] Published in : M.S. Schmid, B. Köpke, M. Keijzer & L. Weilemar (2004). First Language Attrition. Interdisciplinary perspectives on methodological issues (pp. 1-43). Amsterdam: John Benjamins. Köpke, B. & Schmid, M.S. (2004). Language Attrition: The Next Phase. In M.S. Schmid, B.
    [Show full text]
  • Languages of the World--Native America
    REPOR TRESUMES ED 010 352 46 LANGUAGES OF THE WORLD-NATIVE AMERICA FASCICLE ONE. BY- VOEGELIN, C. F. VOEGELIN, FLORENCE N. INDIANA UNIV., BLOOMINGTON REPORT NUMBER NDEA-VI-63-5 PUB DATE JUN64 CONTRACT MC-SAE-9486 EDRS PRICENF-$0.27 HC-C6.20 155P. ANTHROPOLOGICAL LINGUISTICS, 6(6)/1-149, JUNE 1964 DESCRIPTORS- *AMERICAN INDIAN LANGUAGES, *LANGUAGES, BLOOMINGTON, INDIANA, ARCHIVES OF LANGUAGES OF THE WORLD THE NATIVE LANGUAGES AND DIALECTS OF THE NEW WORLD"ARE DISCUSSED.PROVIDED ARE COMPREHENSIVE LISTINGS AND DESCRIPTIONS OF THE LANGUAGES OF AMERICAN INDIANSNORTH OF MEXICO ANDOF THOSE ABORIGINAL TO LATIN AMERICA..(THIS REPOR4 IS PART OF A SEkIES, ED 010 350 TO ED 010 367.)(JK) $. DEPARTMENT OF HEALTH,EDUCATION nib Office ofEduc.442n MD WELNicitt weenment Lasbeenreproduced a l l e a l O exactly r o n o odianeting es receivromed f the Sabi donot rfrocestarity it. Pondsof viewor position raimentofficial opinions or pritcy. Offkce ofEducation rithrppologicalLinguistics Volume 6 Number 6 ,Tune 1964 LANGUAGES OF TEM'WORLD: NATIVE AMER/CAFASCICLEN. A Publication of this ARC IVES OF LANGUAGESor 111-E w oRLD Anthropology Doparignont Indiana, University ANTHROPOLOGICAL LINGUISTICS is designed primarily, butnot exclusively, for the immediate publication of data-oriented papers for which attestation is available in the form oftape recordings on deposit in the Archives of Languages of the World. This does not imply that contributors will bere- stricted to scholars working in the Archives at Indiana University; in fact,one motivation for the publication
    [Show full text]
  • Binary Tree — up to 3 Related Nodes (List Is Special-Case)
    trees 1 are lists enough? for correctness — sure want to efficiently access items better than linear time to find something want to represent relationships more naturally 2 inter-item relationships in lists 1 2 3 4 5 List: nodes related to predecessor/successor 3 trees trees: allow representing more relationships (but not arbitrary relationships — see graphs later in semester) restriction: single path from root to every node implies single path from every node to every other node (possibly through root) 4 natural trees: phylogenetic tree image: Ivicia Letunic and Mariana Ruiz Villarreal, via the tool iTOL (Interative Tree of Life), via Wikipedia 5 natural trees: phylogenetic tree (zoom) image: Ivicia Letunic and Mariana Ruiz Villarreal, via the tool iTOL (Interative Tree of Life), via Wikipedia 6 natural trees: Indo-European languages INDO-EUROPEAN ANATOLIAN Luwian Hittite Carian Lydian Lycian Palaic Pisidian HELLENIC INDO-IRANIAN DORIAN Mycenaean AEOLIC INDO-ARYAN Doric Attic ACHAEAN Aegean Northwest Greek Ionic Beotian Vedic Sanskrit Classical Greek Arcado Thessalian Tsakonian Koine Greek Epic Greek Cypriot Sanskrit Prakrit Greek Maharashtri Gandhari Shauraseni Magadhi Niya ITALIC INSULAR INDIC Konkani Paisaci Oriya Assamese BIHARI CELTIC Pali Bengali LATINO-FALISCAN SABELLIC Dhivehi Marathi Halbi Chittagonian Bhojpuri CONTINENTAL Sinhalese CENTRAL INDIC Magahi Faliscan Oscan Vedda Maithili Latin Umbrian Celtiberian WESTERN INDIC HINDUSTANI PAHARI INSULAR Galatian Classical Latin Aequian Gaulish NORTH Bhil DARDIC Hindi Urdu CENTRAL EASTERN
    [Show full text]
  • 1 Introduction 2 Background L2/15-023
    Title: Preliminary Proposal to Encode the Tocharian Script Author: Lee Wilson ([email protected]) Date: 2015-01-26 1 Introduction Tocharian is a Brahmi-based script historically used to write the Tocharian languages (ISO 639-3: xto, txb), traditionally referred to as Tocharian A and Tocharian B, which belong to the Tocharian branch of the Indo-European language family. The script was in use primarily during the 8th century CE by the people inhabiting the northern edge of the Tarim Basin of the Taklamakan Desert in what is now Xinjiang in western China, an area formerly known as Turkestan. The Tocharian script is attested in over 4,000 extant manuscripts that were discovered in the early 20th century in the Tarim Basin. The discovery was significant for two reasons: first, it revealed a previously unknown branch of the Indo-European language family, and second, it marked the easternmost geographical extent of that language family. The most distinctive aspect of Tocharian script is the vowel transcribed as ä, commonly known as the Fremdvokal, and the modifications to the script that it entails. This vowel is indicated with a vowel sign not employed in scripts outside of the Tarim Basin region. However, this vowel sign is not commonly used when writing Tocharian. Instead, sequences of a consonant followed by the Fremdvokal are most typically indicated by 11 CV signs known as Fremdzeichen, which are unique to Tocharian. 2 Background The Tocharian script is a north-eastern descendant of Brahmi script, related to Khotanese, Tibetan, and Siddham scripts, which was used along the Tarim Basin in the Taklamakan Desert in what is now Xinjiang in western China.
    [Show full text]
  • 002.Alex.Comitato.2A Bozza
    Xaverio Ballester /A/ Y EL VOCALISMO INDOEUROPEO Negli últimi quarant’anni la linguística indo- europea si è in gran parte perduta dietro al mito delle laringali, di cui non intendo tenere alcun conto, e dello strutturalismo, di cui ten- go un conto molto limitato. Giuliano Bonfante, I dialetti indoeuropei, p. 8 De la regla a la ley o de mal en peor Bien digna de mención entre las primerísimas descripciones del mode- lo vocálico indoeuropeo es la propuesta de un inventario fonemático con únicamente tres timbres vocálicos /a i u/, una propuesta empero que fue desgraciadamente y demasiado pronto abandonada, siendo quizá la más conspicua consecuencia de este abandono el hecho de que para la Lin- güística indoeuropea oficialista la ausencia de /a/ devino en la práctica un axioma, de modo que, casi en cualquier posterior propuesta sobre el voca- lismo indoeuropeo, se ha venido adoptando la idea de que no hubiese exis- tido nunca la vocal /a/, y explicándose los ineluctables casos de presencia de /a/ en el material indoeuropeo con variados y bizarros argumentos del tipo de vocalismo despectivo, infantil o popular. Sin embargo, si conside- rada hoy spregiudicatamente, la argumentación que motivó el desalojo de la primitiva /a/ indoeuropea no presenta, al menos desde una perspectiva fonotipológica hodierna, ninguna validez en absoluto. Invocaremos un testimonio objetivo del tema para exponer brevemen- te la cuestión. Escribía O. Szemerényi: «Sotto l’impressione dell’arcai- cità del sanscrito, i fondatori dell’indoeuropeistica e i loro immediati suc- cessori pensavano che il sistema triangolare del sanscrito i–a–u rappre- sentasse la situazione originaria.
    [Show full text]
  • Indo-European Languages and Branches
    Indo-European languages and branches Language Relations One of the first hurdles anyone encounters in studying a foreign language is learning a new vocabulary. Faced with a list of words in a foreign language, we instinctively scan it to see how many of the words may be like those of our own language.We can provide a practical example by surveying a list of very common words in English and their equivalents in Dutch, Czech, and Spanish. A glance at the table suggests that some words are more similar to their English counterparts than others and that for an English speaker the easiest or at least most similar vocabulary will certainly be that of Dutch. The similarities here are so great that with the exception of the words for ‘dog’ (Dutch hond which compares easily with English ‘hound’) and ‘pig’ (where Dutch zwijn is the equivalent of English ‘swine’), there would be a nearly irresistible temptation for an English speaker to see Dutch as a bizarrely misspelled variety of English (a Dutch reader will no doubt choose to reverse the insult). When our myopic English speaker turns to the list of Czech words, he discovers to his pleasant surprise that he knows more Czech than he thought. The Czech words bratr, sestra,and syn are near hits of their English equivalents. Finally, he might be struck at how different the vocabulary of Spanish is (except for madre) although a few useful correspondences could be devised from the list, e.g. English pork and Spanish puerco. The exercise that we have just performed must have occurred millions of times in European history as people encountered their neighbours’ languages.
    [Show full text]