RESEARCH ARTICLE Towards a Formal Genealogical Classification of the Lezgian Languages (North Caucasus): Testing Various Phylogenetic Methods on Lexical Data Alexei Kassian1,2* 1 Department of Anatolian and Celtic Languages, Institute of Linguistics of the Russian Academy of Sciences, Moscow, Russia, 2 School for Advanced Studies in the Humanities, Russian Presidential Academy of National Economy and Public Administration, Moscow, Russia *
[email protected] Abstract A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, Global Lexicostatistical Database OPEN ACCESS published as part of the project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both dis- Citation: Kassian A (2015) Towards a Formal Genealogical Classification of the Lezgian Languages tance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (North Caucasus): Testing Various Phylogenetic (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov Methods on Lexical Data. PLoS ONE 10(2): chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes e0116950. doi:10.1371/journal.pone.0116950 within the input matrix were marked by two different algorithms: traditional etymological ap- Academic Editor: Maria Anisimova, Swiss Federal proach and phonetic similarity, i.e., the automatic method of consonant classes (Levensh- Institute of Technology (ETH Zurich), SWITZERLAND tein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists Received: June 10, 2014 and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian data- Accepted: December 17, 2014 base is a perfect testing area for appraisal of phylogenetic methods.