June 18, 2012 14:12 WSPC - Proceedings Trim Size: 9in x 6in master 230 UNRAVELING THE TANGLES OF LANGUAGE EVOLUTION F. PETRONI DIMADEFAS, Facolt`adi Economia, Universit`adi Roma “La Sapienza”, Via del Castro Laurenziano 9, 00161 Roma, Italy E-mail:
[email protected] M. SERVA Dipartimento di Matematica, Universit`adell’Aquila, I-67010 L’Aquila, Italy E-mail:
[email protected] D. VOLCHENKOV Cognitive Interaction Technology – Center of Excellence, Universit¨at Bielefeld, Postfach 10 01 31, 33501 Bielefeld, Germany E-mail:
[email protected] The relationships between languages molded by extremely complex social, cul- tural and political factors are assessed by an automated method, in which the distance between languages is estimated by the average normalized Levenshtein by WSPC on 11/27/12. For personal use only. distance between words from the list of 200 meanings maximally resistant to change. A sequential process of language classification described by random walks on the matrix of lexical distances allows to represent complex relation- ships between languages geometrically, in terms of distances and angles. We have tested the method on a sample of 50 Indo-European and 50 Austrone- sian languages. The geometric representations of language taxonomy allows for Chaos, Complexity and Transport Downloaded from www.worldscientific.com making accurate interfaces on the most significant events of human history by tracing changes in language families through time. The Anatolian and Kurgan hypothesis of the Indo-European origin and the “express train” model of the Polynesian origin are thoroughly discussed. Keywords: Language taxonomy; lexicostatistic data analysis; Indo-European and Polynesian origins.