From Wordlists to Proto-Wordlists: Reconstruction As ʻoptimal Selectionʼ

Total Page:16

File Type:pdf, Size:1020Kb

From Wordlists to Proto-Wordlists: Reconstruction As ʻoptimal Selectionʼ From wordlists to proto-wordlists: reconstruction as ʻoptimal selectionʼ George Starostin* INTRODUCTION In several of my previous publications (Starostin 2010, 2013a, 2013b), I have repeatedly stressed the importance of combining, rather than opposing the classic comparative method, elaborated by several generations of historical linguists over the past two hundred years, and lexicostatistical methodology, originally introduced by Morris Swadesh and his colleagues in the 1950s and once again popular these days due to a massive influx of computational phylogenetic methods from adjacent branches of science. It was with that precise purpose — to integrate the two approaches — that a team of Moscow-based historical linguists set up The Global Lexicostatistical Database (GLD)1, a large-scale project that aims at applying a uniform, maximally formalized lexicostatistical methodology to all the languages of the world in order to arrive at a reasonable genetic classification, while at the same time corroborating the results with knowledge gained from traditional historical linguistics and philology. One of the most important postulates of the GLD is that lexicostatistics, applied on the basis of superficial phonetic comparison of words that belong to thousands of different languages (e.g. similar to the procedure that is currently employed in S. Wichmann's ASJP project, see Holman et al. 2008), will not be of much use to describe language relationships that go beyond several thousand years. In other words, the best way to explore the issue of, say, the external relations of Indo-European on a lexicostatistical basis would be to operate with a single wordlist for Proto-Indo-European, rather than with several hundred wordlists for its daughter languages. Even more than that — to explore the far less controversial, but almost as problematic issue of the internal classification of Indo-European, it would make more sense to operate with the reconstructed proto-wordlists for Proto-Germanic, Proto-Celtic, Proto-Slavic, etc., than with mass data from modern languages, which clutter the data with large amounts of secondary accumulated ʻnoiseʼ and raise the risks of distorting the resulting classification. * Russian State University for the Humanities, Moscow; Russian Presidential Academy, Moscow: [email protected] 1 The GLD currently operates as an autonomous website, subordinate to the larger ʻTower of Babelʼ project, originally launched by Sergei Starostin as a digital hub for etymological databases (http://starling.rinet.ru). Downloaded from Brill.com09/24/2021 07:10:16AM via free access 178 George Starostin Consequently, in this paper I will discuss some important, but frequently overlooked or understated details of how one should go about trying to reconstruct what might be called an ʻoptimalʼ proto-wordlist for a protolanguage. By wordlist I typically understand a Swadesh-type wordlist, i.e. a list of basic words whose meanings are relatively well defined2 and in most cases (although there may be solitary exceptions) find precise equivalents in any given language of the world. As a rule, the GLD operates with the regular 100-item Swadesh wordlist or its truncated 50-item version (see Starostin 2013a for details on the validity and usage of the shortened version), but the rules and recommendations presented below would work equally well for any other version, as long as the elements on the wordlist are assigned discrete meanings3. 1. PRELIMINARY REMARKS Although from the point of view of phylogenetic thinking, step-by-step reconstruction of proto-wordlists would seem to be a logical move, such a procedure is not commonly encountered in historical linguistics. Reconstructing a proto-wordlist is not at all the same thing as reconstructing morphemes, a far more natural occupation for etymologists all over the world. Above all, it presumes that the reconstructed morpheme (or word) (1) will be assigned a concrete, rather than vague, semantic definition, coinciding with the required meaning of the corresponding element on the wordlist (e.g. ʻseeʼ rather than ʻlook, see, observe, examine, thinkʼ; ʻliverʼ rather than ʻk. of internal organ or intestineʼ; ʻredʼ rather than ʻblood, red, brightly colouredʼ, etc.); (2) will have a better chance of representing this particular meaning in its basic (most stylistically neutral or frequent) usage on the proto-level than any other reconstructible morpheme (or word). We should stress that within the context of a lexicostatistical evaluation, the main goal of reconstructing a proto-wordlist is not a thorough and conclusive study of phonetic correspondences, including rare and non-trivial ones (although such a study is always useful). First and foremost, such a reconstruction helps define the optimal probable set of etyma4 that could have served as the main 2 It is, of course, well-known that many of the elements on the original Swadesh wordlist were not all that precisely defined in Swadesh's original works, which gave researchers a solid pretext to replace any of the elements that they deemed uncomfortable for their own purposes. An attempt to eliminate many of these problems by associating various Swadesh items with diagnostic syntactic contexts and semantic commentary has been published as Kassian et al. 2010 and is currently used as a set of guidelines in the construction of the GLD. 3 This paper essentially represents a somewhat reworked and improved English translation of one single subsection in the ʻMethodologyʼ chapter of the author's recent Russian language monograph on the lexicostatistical classification of the languages of Africa (Starostin 2013), with the addition of several important details and a special illustrative appendix. 4 We typically define an etymon as a correlated pair <sound (X) : meaning (Y)> that may be reconstructed for a proto-language based on regular reflexes in descendant languages. Downloaded from Brill.com09/24/2021 07:10:16AM via free access From wordlists to proto-wordlists 179 carriers for meanings constituting the wordlist on the proto-level. It is sufficient for our starred forms to be phonetically compatible, i.e. cognacy decisions should be made based on transparent argumentation and not contradict any historical or typological evidence. This circumstance is important inasmuch as the application of this methodology to language units all over the world will inevitably run into situations where there is simply not enough linguistic data to allow for a thorough clarification of all phonetic correspondences between poorly described, extinct, or too distantly related languages. Much more significant is to properly account for the distribution of various roots with identical basic semantics across languages belonging to the investigated taxon. Just as it is very important to minimize (preferably, completely eliminate) synonymy while collecting material for Swadesh-type wordlists of attested languages, reconstruction of the proto-wordlist also demands minimization of available candidates: operating on the principle of uniformitarianism, we have no theoretical reason to assume that lexical synonymy on the proto-level, particularly when it comes to strictly defined basic lexicon items, should have been more rampant in reconstructed proto-languages than it is in their modern descendants5. To the best of our knowledge, so far there have not been any detailed formalized attempts at describing any manual algorithms for the selection of optimal proto-language etyma in comparative-historical linguistics; this can be attributed, on one hand, to a generally skeptical attitude towards lexicostatistics (for which this procedure is of particular importance), and on the other, to the traditional unpopularity of rigorous semantic reconstruction among historical linguists. One of the most important exceptions from this overall tendency is a (relatively) recent study by Leonid Kogan (2006), where such an attempt has been undertaken on the basis of Semitic languages; nevertheless, the procedure suggested by Kogan clearly does not exhaust the entire potential of the comparative-historical method and is therefore in need of a series of additions, a brief list of which was suggested in Starostin 2010. In order to make the description of the procedure as transparent and demonstrative as possible, let us define it on a concrete object — a so-called The term should be technically distinguished from root or stem, since the latter refer primarily to the phonetic shape of the entity, leaving its semantic content relatively vague and ambiguous; that said, it is natural that in many particular contexts all these terms may be partially synonymous and interchangeable. 5 Our views on the issue of synonymy (particularly in the sphere of basic lexicon, where it is easier to operate with discrete, rather than vague and continuous meanings, than in the sphere of cultural lexicon) have been originally described in Starostin 2010. Later, in Starostin 2013b: 109-116, it was even more rigidly stated that there are almost no cases of total synonymy (i.e. complete interchangeability of two different words with the exact same meaning in any given context, register, or sociolinguistic situation) that could be reliably attested in any of the world's languages, and that most cases where such a synonymy could be suspected are really cases of technical synonymy (where we simply do not have sufficient data to understand the proper differentiation) or transit synonymy (where we deal with a temporary situation in
Recommended publications
  • Recent Developments in Spanish (And Romance) Historical Semantics
    Recent Developments in Spanish (and Romance) Historical Semantics Steven N. Dworkin University of Michigan Diachronic semantics has long been the stepchild of Spanish (and Romance) historical linguistics. Although many studies have examined (often in searching detail) the semantic evolution of individual lexical items, Hispanists have ignored broader patterns of semantic change and the relevant theoretical and methodological issues posed by this phenomenon. Working within the framework of cognitive semantics, an approach which perforce requires a comparative perspective, a team of Romanists at the University of Tübingen headed by Peter Koch has offered over the last ten years new insights into questions of diachronic Spanish (and Romance) semantics, with emphasis on the causes of semantic change (see the essays in Blank & Koch 1999, 2003, Mihatsch & Steinberg 2004). I wish to survey here some of this recent German work in diachronic Spanish (and Romance) semantics and to discuss the insights provided by cognitive semantics into the nature of semantic change, especially with regard to cross-linguistic instances of metaphorical and metonymic changes in certain semantic categories (e.g., the designation of body parts, spatial and temporal adjectives and adverbials). Cognitive semantics has been concerned with the polysemous nature of lexical items and the cognitive principles that motivate the relations between their different senses. These same mechanisms can also account for diachronic changes in these relationships and in the internal relationships of lexical categories. Diachronic cognitive semantics focuses particularly on universal causes of semantic change brought about by mechanisms related to human cognition and perception. Although workers in diachronic cognitive semantics have also paid considerable attention to the semantic processes involved in grammaticalization (cf.
    [Show full text]
  • Eric Smith, Santa Fe Institute
    Dating and Relationships (or...) Computational historical linguistics and long-range reconstruction Eric Smith, Santa Fe Institute Part of the Evolution of Human Languages project at SFI <http://ehl.santafe.edu/> in collaboration with the Tower of Babel project <http://starling.rinet.ru/main.html> Thanks to Murray Gell-Mann, George Starostin, Ilia Peiros, in memory of Sergei Starostin Work done jointly with Tanmoy Bhattacharya, Jon Wilkins, Dan Hruschka, William Croft, Ian Maddieson, Logan Sutton, and Mark Pagel Outline • Goals of historical linguistics • The classical comparative method • Attempts at deep reconstruction • New observations change the landscape • Our attempts at quantitative reconstruction Goals of historical linguistics • To understand how languages change, and how they have changed historically • To identify relations among languages due to common ancestry or cultural contact • To reconstruct the languages of past speakers • To contribute to an understanding of human populations and migrations • To understand what is possible in language as a window on cognitive constraints The interaction of history with process • History-dependent phenomena combine lawful dynamics with historical accident • Accidents make branching processes -- h0ROTO 4URKICv help us infer diachronic relations from synchronic variability • Diachronic relations assign the v correct weights to processes h# h HULYM 3 h7ESTERN ENISEI A 9 Y which act probabilistically h AN v 4URKICv v v UT v LTAI /LD AK 9 ! h h 4URKIC 4 - 1YZ 3 4 8 3 . 8 9 + # $ 3 OFA UV AR AK AK YR ! HOR ALAJ HUV IDDLE# OLGAN ! OTHERCLOSELANGUAGES Y 5 G LTAI/IR A UT AS YL8 LTAI YZ ASH IGHUR AK HULYM AS OT The classical comparative method of historical linguistics: to interpret innovations • A hypothesis of relationship among a set of languages.
    [Show full text]
  • 16 Semantic Change
    Semantic Change Semantic change can thus occur because the relation between signifidttt and Semantic Change signifii is arbitrary. Meaning is at the core of language by its very nature —no language would he 16 possible if its linguistic units did not have a meaning. En spite of its central role, Eugenio R. Lujin the study of meaning has been somewhat neglected in some approaches to Ian- gunge. It is sometimes considered the less linguistic port of language, in the sense that meaning has a direct connection to natural and sneial realities, which is not the case with other areas of language —we do not expect that the stud}' of the social structures of the speakers of a language will cast any light on the understanding of the phonology of their language nor that there is a correlation Chapter Overview between ergathdty or accusativity and hunter-gatherer societies in opposition to agricultural societies, but we do expect a difference of vocabulary and the 286 1. Introduction organization of the meaning of words between societies with a different level of 288 2. Types of Semantic Change technological development. 3. Semantic Change beyond the Word 296 From a historical perspective, this means that a change in thesociocultural or 299 environmental conditions of the speakers of a language may have an impact on 4. Causes of Semantic Change 5. Towards a Deeper Understanding of Semantic Change 304 this area of language—new words may be coined or borrowed or new mean ings of words may arise; compare Spanish raton 'mouse' (both animal and com Notes puter device, as a caique from English mouse; see section 4,3) vs.
    [Show full text]
  • GRAMMAR of SOLRESOL Or the Universal Language of François SUDRE
    GRAMMAR OF SOLRESOL or the Universal Language of François SUDRE by BOLESLAS GAJEWSKI, Professor [M. Vincent GAJEWSKI, professor, d. Paris in 1881, is the father of the author of this Grammar. He was for thirty years the president of the Central committee for the study and advancement of Solresol, a committee founded in Paris in 1869 by Madame SUDRE, widow of the Inventor.] [This edition from taken from: Copyright © 1997, Stephen L. Rice, Last update: Nov. 19, 1997 URL: http://www2.polarnet.com/~srice/solresol/sorsoeng.htm Edits in [brackets], as well as chapter headings and formatting by Doug Bigham, 2005, for LIN 312.] I. Introduction II. General concepts of solresol III. Words of one [and two] syllable[s] IV. Suppression of synonyms V. Reversed meanings VI. Important note VII. Word groups VIII. Classification of ideas: 1º simple notes IX. Classification of ideas: 2º repeated notes X. Genders XI. Numbers XII. Parts of speech XIII. Number of words XIV. Separation of homonyms XV. Verbs XVI. Subjunctive XVII. Passive verbs XVIII. Reflexive verbs XIX. Impersonal verbs XX. Interrogation and negation XXI. Syntax XXII. Fasi, sifa XXIII. Partitive XXIV. Different kinds of writing XXV. Different ways of communicating XXVI. Brief extract from the dictionary I. Introduction In all the business of life, people must understand one another. But how is it possible to understand foreigners, when there are around three thousand different languages spoken on earth? For everyone's sake, to facilitate travel and international relations, and to promote the progress of beneficial science, a language is needed that is easy, shared by all peoples, and capable of serving as a means of interpretation in all countries.
    [Show full text]
  • North Caucasian Languages
    The Turkish Online Journal of Design, Art and Communication - TOJDAC April 2017 Special Edition COMPARISON OF VARIOUS QUANTITATIVE MEASURES OF PROXIMITY OF LANGUAGES: NORTH CAUCASIAN LANGUAGES Galeev Timur Ildarovich Kazan federal university (KFU), Kazan, Russia [email protected] Solovyev Valery Dmitrievich Kazan federal university (KFU), Kazan, Russia ABSTRACT A comparison of North Caucasian languages is performed in the article according to various measures of proximity constructed using grammatical, lexical and genetic databases. Statistical methods are applied to the study of correlations among these proximity measures, and also between them and both geographical proximity and genealogical kinship. A full correlation has been found among language kinship, geographic situation and genetic kinship of peoples. Also, a high correlation was found between each of them and lexical similarity. In general these correlations persist at different levels – starting at the whole set of studied languages until the level of the smallest groups of related languages. It is shown that a significant factor in the analysis of geographical situation is the existence of a common boundary between the regions of distribution of languages. Keywords: Various Quantitative Measures, geographical situation common boundary, Creativity, languages 1. INTRODUCTION The classification of languages by genetic kinship, developed in the last two centuries within the framework of historical linguistics applying the comparative historical method, offers a qualitative characteristic of language proximity by including them into macrofamilies, families, branches, groups, etc. Glottochronology provides a quantitative measure of proximity that, in particular, allows assessing the age of families and other language groups. Unfortunately, in many cases there is no consensus among experts about languages kinship; it must be said also that lexicostatistical data are controversial.
    [Show full text]
  • Lectures on English Lexicology
    МИНИСТЕРСТВО ОБРАЗОВАНИЯ И НАУКИ РОССИЙСКОЙ ФЕДЕРАЦИИ ГОУ ВПО «Татарский государственный гуманитарно-педагогический университет» LECTURES ON ENGLISH LEXICOLOGY Курс лекций по лексикологии английского языка Казань 2010 МИНИСТЕРСТВО ОБРАЗОВАНИЯ И НАУКИ РОССИЙСКОЙ ФЕДЕРАЦИИ ГОУ ВПО «Татарский государственный гуманитарно-педагогический университет» LECTURES ON ENGLISH LEXICOLOGY Курс лекций по лексикологии английского языка для студентов факультетов иностранных языков Казань 2010 ББК УДК Л Печатается по решению Методического совета факультета иностранных языков Татарского государственного гуманитарно-педагогического университета в качестве учебного пособия Л Lectures on English Lexicology. Курс лекций по лексикологии английского языка. Учебное пособие для студентов иностранных языков. – Казань: ТГГПУ, 2010 - 92 с. Составитель: к.филол.н., доцент Давлетбаева Д.Н. Научный редактор: д.филол.н., профессор Садыкова А.Г. Рецензенты: д.филол.н., профессор Арсентьева Е.Ф. (КГУ) к.филол.н., доцент Мухаметдинова Р.Г. (ТГГПУ) © Давлетбаева Д.Н. © Татарский государственный гуманитарно-педагогический университет INTRODUCTION The book is intended for English language students at Pedagogical Universities taking the course of English lexicology and fully meets the requirements of the programme in the subject. It may also be of interest to all readers, whose command of English is sufficient to enable them to read texts of average difficulty and who would like to gain some information about the vocabulary resources of Modern English (for example, about synonyms
    [Show full text]
  • Semantic Reanalysis and Language Change Regine Eckardt* University of Go¨Ttingen
    Erschienen in: Language and Linguistics Compass ; 5 (2011), 1. - S. 33-46 https://dx.doi.org/10.1111/j.1749-818x.2010.00260.x Semantic Reanalysis and Language Change Regine Eckardt* University of Go¨ttingen Abstract Reanalysis is a well-known process of language change in morpho-syntax. However, the semantic composition of sentence meanings can also undergo reanalysis and lead to meaning changes for parts of the sentence. The article provides the basic notions of compositional semantics ⁄ pragmatics that underlie semantic reanalysis, surveys possible constellations and causes of reanalysis, and con- trasts the process to other ways of semantic change. I will, moreover, illustrate semantic reanalysis on basis of a case study which highlights some of its typical features. 1. Meaning Change and Grammaticalization Many people enjoy the fascinating task of tracing the history of words and expressions. Meaning changes often allow insights in cultural history, and major processes of change can be observed both in history and in daily life. For instance, speakers can decide to restrict the use of a word to what was formerly just the most frequent referents, like in Old English mete (‘food’) which turned into modern meat (food like in ‘steak’). This is called semantic narrowing, and we will use the example to introduce a few convenient notations. I will use [[a]] to refer to the meaning of a word a, e.g. [[dog]] stands for the meaning of the word dog. If the meaning of a word changes, we will use indices to indicate the two different meanings. Our instance of narrowing looks as follows: [[mete]]old was the concept ‘food’ or ‘meal’ in general, whereas [[mete]]new is the narrower concept ‘meat, edible muscle matter of animals’.
    [Show full text]
  • Comparative-Historical Linguistics and Lexicostatistics
    COMPARATIVE-HISTORICAL LINGUISTICS AND LEXICOSTATISTICS Sergei Starostin COMPARATIVE-HISTORICAL LINGUISTICS AND LEXICOSTATISTICS [This is a translation, done by I. Peiros and N. Evans, of my paper "Sravnitel'no-istoričeskoe jazykoznanie i leksikostatistika", in "Lingvističeskaja rekonstrukcija i drevnejšaja istorija Vostoka", Moscow 1989. I have introduced, however, a number of modifications into the final English text — basically rewritten it again, since the English version needs English examples and etymologies, not Russian ones.] The last two decades have witnessed a fundamental advance in the techniques of comparative linguistic research. A prolonged period of comparative work with a wide range of language families has laid the foundation for the study of genetic relationships between remotely related languages or language groups. The first step in this direction was taken by V.M. Illich-Svitych in his seminal work 'Towards a comparison of the Nostratic languages' in which, with a combination of rigorous methods and intuitive flare, he begins to demonstrate the relatedness of a number of languages of the Old World. This new level of comparative studies appears completely legitimate. In fact, if we take the theory of language divergence as axiomatic, we have to concede the fact that from around the sixth millenium B.C. to the first millenium B.C. there was quite a number of different reconstructable proto-languages throughout the world. Once the level of reconstruction of various proto-languages is improved, the question inevitably arises: are any of these proto-languages genetically related and, if so, can we prove this relationship? To the first part of this question we must now answer in the affirmative.
    [Show full text]
  • Current Progress in Altaic Etymology
    Blažek : Current Progress in Altaic Etymology Linguistica ONLINE. Added: January, 30th 2006. tjurkskix jazykov: obščetjurkskie i mežtjurkskie leksičeskie osnovy na bukvy "k", "q", http://www.phil.muni.cz/linguistica/art/blazek/bla-004.pdf Moskva: Jazyki russkoj kuľtury 1997). In his publications O. Mudrak especially ISSN 1801-5336 concentrates on two topics, historical phonology of Chuvash (1987, 1989, 1993, 1994) and reconstruction of Jurchen (1985, 1988). I. Gruntov has published an article on the historical phonology of Japanese in the Altaic context (2000). [*] The first version of the present review was finished in May 2005. This new version Current Progress in Altaic Etymology originates to up-to-date the sources on the one hand. The second reason consists in the Václav Blažek unexpected death of the head of the author’s team, Sergei Starostin, on 30th September 2005. Sergei Starostin, Anna Dybo, Oleg Mudrak, with assistance of Ilya Gruntov and Vladi- The Etymological Dictionary of the Altaic Languages (= EDAL) is open by Preface mir Glumov: Etymological Dictionary of the Altaic Languages, Part One [A-K], Part (pp. 7-9) where the purposes of the present dictionary and a short history of Altaic Two [L-Z], Part Three [Indices], Leiden-Boston: Brill 2003, published in the prestigeous studies are described. The Introduction (pp. 11-236) starts with the Chapter One devoted series Handbook of Oriental Studies / Handbuch der Orientalistik 8/1-3, edited by Denis to The problem of Interlingual borrowings in Altaic languages (pp. 13-21). Here it is Sinor & Nicola di Cosmo [ISSN 0169-8524]. These three volumes are of a respectable demonstrated, how the phonetic criteria may serve to distinguish the borrowings from size: pp 1-858, 859-1556, 1557-2096 respectively.
    [Show full text]
  • Time Depth in Historical Linguistics”, Edited by Colin Renfrew, April
    Time Depth 1 Review of “Time Depth in Historical Linguistics”, edited by Colin Renfrew, April McMahon, and Larry Trask Brett Kessler Washington University in St. Louis Brett Kessler Psychology Department Washington University in St. Louis Campus Box 1125 One Brookings Drive St. Louis MO 63130-4899 USA Email: [email protected] FAX: 1-314-935-7588 Time Depth 2 Review of “Time Depth in Historical Linguistics”, edited by Colin Renfrew, April McMahon, and Larry Trask Time depth in historical linguistics. Ed. by Colin Renfrew, April McMahon, and Larry Trask. (Papers in the prehistory of languages.) Cambridge, England: McDonald Institute for Archaeological Research, 2000. Distributed by Oxbow Books. 2 vol. (xiv, 681 p.) paperback, 50 GBP. http://www.mcdonald.cam.ac.uk/Publications/Time-depth.htm This is a collection of 27 papers, mostly presentations at a symposium held at the McDonald Institute in 1999. Contributions focus on two related issues: methods for establishing absolute chronology, and linguistic knowledge about the remote past. Most papers are restatements of the authors’ well-known theories, but many contain innovations, and some do describe new work. The ideological balance of the collection feels just left of center. We do not find here wild multilateral phantasms, reconstructions of Proto-World vocabulary, or idylls about pre-Indo-European matriarchal society. Or not much. These are mostly sober academics pushing the envelope in attempts to reason under extreme uncertainty. One of the recurrent themes was that the development of agriculture may drive the expansion of language families and therefore imply a date for the protolanguage. Colin Renfrew describes his idea that that is what happened in the case of Indo-European (IE): PIE was introduced into Europe at an early date, perhaps 8,000 BC.
    [Show full text]
  • A Pipeline for Computational Historical Linguistics
    A Pipeline for Computational Historical Linguistics Lydia Steiner Peter F. Stadler Bioinformatics Group, Interdisciplinary Bioinformatics Group, University of Center for Bioinformatics, University of Leipzig, Interdisciplinary Center for Leipzig ∗ Bioinformatics, University of Leipzig∗; Max-Planck-Institute for Mathematics in the Science; Fraunhofer Institute for Cell Therapy and Immunology; Center for non-coding RNA in Technology and Health, University of Copenhagen; Institute for Theoretical Chemistry, University of Vienna; Santa Fe Institute Michael Cysouw Research unit Quantitative Language Comparison, LMU München ∗∗ There are many parallels between historical linguistics and molecular phylogenetics. In this paper we describe an algorithmic pipeline that mimics, as closely as possible, the traditional workflow of language reconstruction known as the comparative method. The pipeline consists of suitably modified algorithms based on recent research in bioinformatics, that are adapted to the specifics of linguistic data. This approach can alleviate much of the laborious research needed to establish proof of historical relationships between languages. Equally important to our proposal is that each step in the workflow of the comparative method is implemented independently, so language specialists have the possibility to scrutinize intermediate results. We have used our pipeline to investigate two groups of languages, the Tsezic languages from the Caucasus and the Mataco-Guaicuruan languages from South America, based on the lexical data from the Intercontinental Dictionary Series (IDS). The results of these tests show that the current approach is a viable and useful extension to historical linguistic research. 1. Introduction Molecular phylogenetics and historical linguistics are both concerned with the recon- struction of evolutionary histories, the former of biological organisms, the latter of human languages.
    [Show full text]
  • "Evolution of Human Languages": Current State of Affairs
    «Evolution of Human Languages»: current state of affairs (03.2014) Contents: I. Currently active members of the project . 2 II. Linguistic experts associated with the project . 4 III. General description of EHL's goals and major lines of research . 6 IV. Up-to-date results / achievements of EHL research . 9 V. A concise list of actual problems and tasks for future resolution. 18 VI. EHL resources and links . 20 2 I. Currently active members of the project. Primary affiliation: Senior researcher, Center for Comparative Studies, Russian State University for the Humanities (Moscow). Web info: http://ivka.rsuh.ru/article.html?id=80197 George Publications: http://rggu.academia.edu/GeorgeStarostin Starostin Research interests: Methodology of historical linguistics; long- vs. short-range linguistic comparison; history and classification of African languages; history of the Chinese language; comparative and historical linguistics of various language families (Indo-European, Altaic, Yeniseian, Dravidian, etc.). Primary affiliation: Visiting researcher, Santa Fe Institute. Formerly, professor of linguistics at the University of Melbourne. Ilia Publications: http://orlabs.oclc.org/identities/lccn-n97-4759 Research interests: Genetic and areal language relationships in Southeast Asia; Peiros history and classification of Sino-Tibetan, Austronesian, Austroasiatic languages; macro- and micro-families of the Americas; methodology of historical linguistics. Primary affiliation: Senior researcher, Institute of Slavic Studies, Russian Academy of Sciences (Moscow / Novosibirsk). Web info / publications list (in Russian): Sergei http://www.inslav.ru/index.php?option- Nikolayev =com_content&view=article&id=358:2010-06-09-18-14-01 Research interests: Comparative Indo-European and Slavic studies; internal and external genetic relations of North Caucasian languages; internal and external genetic relations of North American languages (Na-Dene; Algic; Mosan).
    [Show full text]