Basics for the Construction of Phylogenetic Trees

Total Page:16

File Type:pdf, Size:1020Kb

Basics for the Construction of Phylogenetic Trees Article ID: WMC002563 ISSN 2046-1690 Basics for the Construction of Phylogenetic Trees Corresponding Author: Mr. B P Niranjan Reddy, Senior Research Fellow, School of Sciences in Biotechnology, Jiwaji University - India Submitting Author: Mr. B.P.Niranjan Reddy, Senior Research Fellow, School of Sciences in Biotechnology, Jiwaji University - India Article ID: WMC002563 Article Type: Review articles Submitted on:03-Dec-2011, 11:47:15 AM GMT Published on: 03-Dec-2011, 08:06:40 PM GMT Article URL: http://www.webmedcentral.com/article_view/2563 Subject Categories:BIOLOGY Keywords:Phylogenetic tree, Model selection, Bootstrapping, Phylogeny free software How to cite the article:Niranjan Reddy B P. Basics for the Construction of Phylogenetic Trees . WebmedCentral BIOLOGY 2011;2(12):WMC002563 Source(s) of Funding: None Competing Interests: None Additional Files: Links to some very useful web pages for phylogenet WebmedCentral > Review articles Page 1 of 11 WMC002563 Downloaded from http://www.webmedcentral.com on 05-Dec-2011, 05:19:58 AM Basics for the Construction of Phylogenetic Trees Author(s): Niranjan Reddy B P Abstract the genes into various classes like orthologs, paralogs, in- or out-paralogs, to understand the evolution of the new functions through duplications, horizontal gene transfers, gene conversion, recombination, and Phylogeny- A Diagram for Evolutionary Network-is co-evolution etc. (Hafner and Nadler, 1988; Nei, 2003; used to infer the phylogenetic relationships among the Pagel, 2000). Phylogenetic analysis provides a species or genes. The phylogenetic analysis including powerful tool for comparative genomics (Pagel, 2000). morphological, biological, and bionomic characters, Genome sequencing projects are providing valuable allozyme, RFLP data have been extensively used to sequence information that is widely used to infer the infer the evolutionary relationship among the species evolutionary relationship between different species or during the pre-genomic era. With the advent of high genes. The species' phylogenies are generally inferred throughput sequencing technologies and the based on the paleontological/geological information or development of extensive statistical analytical tools, an morphological traits (Nei, 2003). These phylogenies increased amount of sequence information is made act as a reference to assess the veracity of the available in the public domains. This particular phylogenetic tree constructed based on any situation has revolutionarized the field of phylogenetics, phylogenetic informative marker. With the increased and has opened up opportunities for drawing and availability of whole genome sequences, the field of reconstructing the phylogenetic relationships with phylogenomics (i.e. use of either whole genome or a more confidence and accuracy. Consequently, today, large number of genes for phylogenetics analysis) is phylogenetics has become an integral part of any becoming popular among the evolutionary biologists sequencing associated research projects. Although, (Fitz Gibbon and House, 1999; Korbel et al., 2002; many publications related to the understanding of the Snel et al., 1999; Thornton and DeSalle, 2000). Many phylogenetic tree are available, most of them are phylogenomics based reports have been published, either for the experts in the field or for and most of them are true reflective of reference bioinformaticians. It is essentially needed for the species' phylogenies that are inferred from beginner to start from a document that includes all the paleontological and/or geological information (Kumar basics together with briefings of the modern and Filipski, 2001). Furthermore, phylogenomics developments in phylogenetics. Considering the reconstruction helps in supplementing or correcting importance of phylogenetic analysis in modern science, the earlier working phylogenetic relationships (Kumar here in this review, an attempt was made to simplify and Filipski, 2001). Phylogenetic trees can be drawn the understanding of the phylogenetic tree from genes (nucleotide or protein sequences), construction, availability and usability of the different morphological, biological and bionomic characters, methods and software tools for inferring the trees. restriction fragment polymorphisms, or whole genome Introduction orthologs, or geological records (Horner and Pesole, 2004; Klenk and Göker, 2010; Nikaido et al., 2001; Snel et al., 1999). Although it is very easy to construct The field of phylogenetics has become an integral part the phylogenetic trees using the user-friendly software of any modern biological research. Construction of tools, often it is observed that having basic information phylogenetic tree becoming such an easy task that regarding the processes that undergo behind the novice can also construct relatively near to perfect scenes will greatly helps in improving the quality of the phylogenetic tree with little hard work. This is majorly phylogenetic tree construction by giving better input due to free availability of many tree construction, values into the programs. Thus, in this review article, viewing and editing tools that demand very little our writing centered in basic concepts of construction knowledge regarding the phylogenetic construction of phylogenetic analysis using nucleotide or amino procedures (i. e., it is not mandatory to know the acid sequences. basics of the models and algorithm procedures which Basics concepts and definition involves in behind the scenes). Phylogenetic analysis can be performed to infer the evolutionary relationship among the members of the taxa, to understand the Phylogenetic tree also known as “evolutionary tree” is evolution of the genomes and gene families, to classify WebmedCentral > Review articles Page 2 of 11 WMC002563 Downloaded from http://www.webmedcentral.com on 05-Dec-2011, 05:19:58 AM the graphical representation of the evolutionary A rooted tree represents the divergence of a group of relationship between the taxa/genes in question. A related species from their last common ancestor (root) dendrogram is a broad term for the diagrammatic by successive branching events over the time period. representation of a phylogenetic tree. Different In contrary the unrooted phylogenetic tree reveal inter terminologies are used to describe the characteristics species/taxa relationships excluding the identification of a phylogenetic tree. The cladogram is a dendogram of most recent common ancestor or the root. The which explains only genealogy of the taxa but says rooted phylogenies are constructed using unrelated nothing about the branch lengths or time periods of species/genes involving the phylogenetic divergence (Page and Holmes, 1998; Procter et al., reconstruction. Very distantly related taxa or relatively 2010). The phylogram (additive tree) is a phylogenetic related taxa are considered for tree rooting called tree that explicitly represents a number of character out-group and in-group, respectively. The terminal changes (nucleotide/amino acid changes/number of nodes in the phylogenetic tree are called as character variations) through its branch lengths (Page operational taxonomic units (OTU). The branches that and Holmes, 1998; Procter et al., 2010). In case of do not join any of the terminal/leaves/OTUs (fig. 1) phylogram the evolutionary distance between any two directly but via internal nodes are called “ancestral taxa is given by sum of the branch lengths connected states” or “hypothetical taxonomic units” that might them. Though these trees may be rooted or unrooted, have appeared during evolution and cannot be seen at often these trees lack a root. A chronogram present (Page and Holmes, 1998; Pagel, 2000). The (ultrametric) is a rooted phylogenetic tree that posses internal branch points in a species phylogenetic tree all the characteristics of an additive tree, in addition represents the speciation events, while gene families' with the assumption of molecular clock determination phylogenetic tree, they mean for duplication events of the molecular divergence time between taxa can be (Pagel, 2000). The internal branches may be possible (Page and Holmes, 1998). The molecular bifurcating or multi-furcating. Analysis of the gene clock hypothesis assumes that every site in a protein families generally forms multi-furcating branches and or coding nucleotide sequence from all the species each of the small multi-furcating branches forms a sub evolve at a constant rate (Zuckerkandl and Pauling, tree or a clade (Kao et al., 1999; Nei et al., 1997; Nei 1962). Furthermore, the chronogram consists of taxa and Rooney, 2005). placed equidistant from the ancestor which cannot be The whole process of construction of the phylogenetic seen in case of phylogram. Phenetics (taximetrics) tree is divided into five different steps, viz. infers the relationship between the taxa that usually Step 1: Choosing an appropriate markers for the involves morphology or other observable traits as phylogenetic analysis phylogenetic informative markers (Duncan and Baum, Step 2: Multiple sequence alignments 1981; Mayr, 1965; Page and Holmes, 1998). Step 3: Selection of an evolutionary model A tree that shows the evolution of the genes is known Step 4: Phylogenetic reconstruction as gene tree (Snel et al., 1999). While, tree that shows Step 5: Evaluation of the phylogenetic tree the evolution of species is known as species' tree. It is Step 1: Choosing an appropriate markers for the important to note that gene trees are not necessary to phylogenetic analysis follow
Recommended publications
  • Investgating Determinants of Phylogeneic Accuracy
    IMPACT OF MOLECULAR EVOLUTIONARY FOOTPRINTS ON PHYLOGENETIC ACCURACY – A SIMULATION STUDY Dissertation Submitted to The College of Arts and Sciences of the UNIVERSITY OF DAYTON In Partial Fulfillment of the Requirements for The Degree Doctor of Philosophy in Biology by Bhakti Dwivedi UNIVERSITY OF DAYTON August, 2009 i APPROVED BY: _________________________ Gadagkar, R. Sudhindra Ph.D. Major Advisor _________________________ Robinson, Jayne Ph.D. Committee Member Chair Department of Biology _________________________ Nielsen, R. Mark Ph.D. Committee Member _________________________ Rowe, J. John Ph.D. Committee Member _________________________ Goldman, Dan Ph.D. Committee Member ii ABSTRACT IMPACT OF MOLECULAR EVOLUTIONARY FOOTPRINTS ON PHYLOGENETIC ACCURACY – A SIMULATION STUDY Dwivedi Bhakti University of Dayton Advisor: Dr. Sudhindra R. Gadagkar An accurately inferred phylogeny is important to the study of molecular evolution. Factors impacting the accuracy of a phylogenetic tree can be traced to several consecutive steps leading to the inference of the phylogeny. In this simulation-based study our focus is on the impact of the certain evolutionary features of the nucleotide sequences themselves in the alignment rather than any source of error during the process of sequence alignment or due to the choice of the method of phylogenetic inference. Nucleotide sequences can be characterized by summary statistics such as sequence length and base composition. When two or more such sequences need to be compared to each other (as in an alignment prior to phylogenetic analysis) additional evolutionary features come into play, such as the overall rate of nucleotide substitution, the ratio of two specific instantaneous, rates of substitution (rate at which transitions and transversions occur), and the shape parameter, of the gamma distribution (that quantifies the extent of iii heterogeneity in substitution rate among sites in an alignment).
    [Show full text]
  • Phylogeny Inference Based on Parsimony and Other Methods Using Paup*
    8 Phylogeny inference based on parsimony and other methods using Paup* THEORY David L. Swofford and Jack Sullivan 8.1 Introduction Methods for inferring evolutionary trees can be divided into two broad categories: thosethatoperateonamatrixofdiscretecharactersthatassignsoneormore attributes or character states to each taxon (i.e. sequence or gene-family member); and those that operate on a matrix of pairwise distances between taxa, with each distance representing an estimate of the amount of divergence between two taxa since they last shared a common ancestor (see Chapter 1). The most commonly employed discrete-character methods used in molecular phylogenetics are parsi- mony and maximum likelihood methods. For molecular data, the character-state matrix is typically an aligned set of DNA or protein sequences, in which the states are the nucleotides A, C, G, and T (i.e. DNA sequences) or symbols representing the 20 common amino acids (i.e. protein sequences); however, other forms of discrete data such as restriction-site presence/absence and gene-order information also may be used. Parsimony, maximum likelihood, and some distance methods are examples of a broader class of phylogenetic methods that rely on the use of optimality criteria. Methods in this class all operate by explicitly defining an objective function that returns a score for any input tree topology. This tree score thus allows any two or more trees to be ranked according to the chosen optimality criterion. Ordinarily, phylogenetic inference under criterion-based methods couples the selection of The Phylogenetic Handbook: a Practical Approach to Phylogenetic Analysis and Hypothesis Testing, Philippe Lemey, Marco Salemi, and Anne-Mieke Vandamme (eds.).
    [Show full text]
  • Species Tree Likelihood Computation Given SNP Data Using Ancestral Configurations
    Species Tree Likelihood Computation Given SNP Data Using Ancestral Configurations DISSERTATION Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Hang Fan, M.S. Graduate Program in Statistics The Ohio State University 2013 Dissertation Committee: Professor Laura Kubatko, Advisor Professor Bryan Carstens Professor Radu Herbei 1 Copyright by Hang Fan 2013 2 Abstract Inferring species trees given genetic data has been a challenge in the field of phylogenetics because of the high intensity during computation. In the coalescent framework, this dissertation proposes an innovative method of estimating the likelihood of a species tree directly from Single Nucleotide Polymorphism (SNP) data with a certain nucleotide substitution model. This method uses the idea of Ancestral Configurations (Wu, 2011) to avoid the computation burden brought by the enumeration of coalescent histories. Importance sampling is used to in Monte Carlo integration to approximate the expectations in the computation, where the accuracy of the approximation is tested in different tree models. The SNP data is processed beforehand which vastly boosts the efficiency of the method. Gene tree sampling given the species tree under the coalescent model is employed to make the computation feasible for large trees. Further, the branch lengths on the species tree are optimized according to the computed species tree likelihood, which provides the likelihood of the species tree topology given the SNP data. For inference, this likelihood computation method is implemented in the stepwise addition algorithm to infer the maximum likelihood species tree in the tree space given the SNP data, and simulations are conduced to test the performance.
    [Show full text]
  • Phylogeny Diversity: a Phylogenetic Framework Phenetics
    Speciation is a Phylogeny process by which lineages split. -the evolutionary relationships among We should be able organisms to the reconstruct the history of these (millions of Time years) -the patterns of lineage branching produced splits (i.e., build an by the true evolutionary history evolutionary tree) Linnaean System Diversity: of Classification A Phylogenetic Framework I. Taxonomy II. Phenetics III. Cladistics Carl von Linné a.k.a. Linnaeus (1707-1778) Phenetics Taxonomy does not always reflect evolutionary history. • Grouping is based on phenotypic similarity • Does not distinguish the cause of similarities • Not widely used today because many Example: Example: Example: similarities are not due to common ancestry. Class Aves Kingdom Protista Class Reptilia 1 Convergent Evolution Cladistics • Emphasizes common ancestry (≠ similarity) • Distinguishes between two kinds of similarities: 1) similarity due to common ancestry 2) similarity due to convergent evolution • Most widely used approach to reconstructing phylogenies. Homology Bird feathers are made of keratin. a phenotype that is similar in two species because it was inherited from a common ancestor. fly wings ≠ bird wings Homoplasy a phenotype that is similar in two species because of convergent evolution. Insect wings are made of chiton. An Example of Cladistics Relate the following taxa: • Horse • Seal • Lion • House Cat Homoplasy 2 Data for Cladistic Analysis How to do a Cladistic Analysis * * Cat Lion Horse Seal Cat Lion Horse 1. Define a set of traits. Seal Lizard Trait Lizard 2. Analyze traits for homology. Retractable Claws 1 0 0 0 0 Pointed Molars 1 1 0 0 0 3. Assign phenotypic states to each taxon.
    [Show full text]
  • EVOLUTIONARY INFERENCE: Some Basics of Phylogenetic Analyses
    EVOLUTIONARY INFERENCE: Some basics of phylogenetic analyses. Ana Rojas Mendoza CNIO-Madrid-Spain. Alfonso Valencia’s lab. Aims of this talk: • 1.To introduce relevant concepts of evolution to practice phylogenetic inference from molecular data. • 2.To introduce some of the most useful methods and computer programmes to practice phylogenetic inference. • • 3.To show some examples I’ve worked in. SOME BASICS 11--ConceptsConcepts ofof MolecularMolecular EvolutionEvolution • Homology vs Analogy. • Homology vs similarity. • Ortologous vs Paralogous genes. • Species tree vs genes tree. • Molecular clock. • Allele mutation vs allele substitution. • Rates of allele substitution. • Neutral theory of evolution. SOME BASICS Owen’s definition of homology Richard Owen, 1843 • Homologue: the same organ under every variety of form and function (true or essential correspondence). •Analogy: superficial or misleading similarity. SOME BASICS 1.Concepts1.Concepts ofof MolecularMolecular EvolutionEvolution • Homology vs Analogy. • Homology vs similarity. • Ortologous vs Paralogous genes. • Species tree vs genes tree. • Molecular clock. • Allele mutation vs allele substitution. • Rates of allele substitution. • Neutral theory of evolution. SOME BASICS Similarity ≠ Homology • Similarity: mathematical concept . Homology: biological concept Common Ancestry!!! SOME BASICS 1.Concepts1.Concepts ofof MolecularMolecular EvolutionEvolution • Homology vs Analogy. • Homology vs similarity. • Ortologous vs Paralogous genes. • Species tree vs genes tree. • Molecular clock.
    [Show full text]
  • The Probability of Monophyly of a Sample of Gene Lineages on a Species Tree
    PAPER The probability of monophyly of a sample of gene COLLOQUIUM lineages on a species tree Rohan S. Mehtaa,1, David Bryantb, and Noah A. Rosenberga aDepartment of Biology, Stanford University, Stanford, CA 94305; and bDepartment of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand Edited by John C. Avise, University of California, Irvine, CA, and approved April 18, 2016 (received for review February 5, 2016) Monophyletic groups—groups that consist of all of the descendants loci that are reciprocally monophyletic is informative about the of a most recent common ancestor—arise naturally as a conse- time since species divergence and can assist in representing the quence of descent processes that result in meaningful distinctions level of differentiation between groups (4, 18). between organisms. Aspects of monophyly are therefore central to Many empirical investigations of genealogical phenomena have fields that examine and use genealogical descent. In particular, stud- made use of conceptual and statistical properties of monophyly ies in conservation genetics, phylogeography, population genetics, (19). Comparisons of observed monophyly levels to model pre- species delimitation, and systematics can all make use of mathemat- dictions have been used to provide information about species di- ical predictions under evolutionary models about features of mono- vergence times (20, 21). Model-based monophyly computations phyly. One important calculation, the probability that a set of gene have been used alongside DNA sequence differences between and lineages is monophyletic under a two-species neutral coalescent within proposed clades to argue for the existence of the clades model, has been used in many studies. Here, we extend this calcu- (22), and tests involving reciprocal monophyly have been used to lation for a species tree model that contains arbitrarily many species.
    [Show full text]
  • 'The Realm of Hard Evidence': Novelty, Persuasion And
    Stud. Hist. Phil. Biol. & Biomed. Sci., Vol. 32, No. 2, pp. 343–360, 2001 Pergamon 2001 Elsevier Science Ltd. All rights reserved. Printed in Great Britain 1369-8486/01 $ - see front matter www.elsevier.com/locate/shpsc ‘The Realm of Hard Evidence’: Novelty, Persuasion and Collaboration in Botanical Cladistics Jim Endersby* In 1998 a new classification of flowering plants generated headlines in the non- specialist press in Britain. By interviewing those involved with, or critical of, the new classification, this essay examines the participants’ motives and strategies for creating and maintaining a research group. It argues that the classification was produced by an informal alliance whose members collaborated despite their disagreements. This collaboration was possible because standardised methods and common theoretical assumptions served as ‘boundary objects’. The group also created a novel form of collective publication that helped to unite them. Both the collaboration and the pub- lishing strategy were partly motivated by the need to give taxonomy a degree of ‘big science’ credibility that it had previously lacked: creating an international team allowed more comprehensive results; and collective publication served to emphasise both the novelty of the work and its claims to objectivity. Creating a group identity also served to exclude practitioners of alternative forms of taxonomy. Finally, the need to obtain funding for continuing work both created the need to collaborate and influenced the way the classification was presented to the public. 2001 Elsevier Science Ltd. All rights reserved. Keywords: Cladistics; Botanical Taxonomy; Boundary Objects; Sociology of Science; Rhetoric of Science. 1. Introduction: ‘A Rose is Still a Rose’ On 23rd November 1998, the Independent newspaper announced that ‘A rose is still a rose, but everything else in botany is turned on its head’.
    [Show full text]
  • A Comparative Phenetic and Cladistic Analysis of the Genus Holcaspis Chaudoir (Coleoptera: .Carabidae)
    Lincoln University Digital Thesis Copyright Statement The digital copy of this thesis is protected by the Copyright Act 1994 (New Zealand). This thesis may be consulted by you, provided you comply with the provisions of the Act and the following conditions of use: you will use the copy only for the purposes of research or private study you will recognise the author's right to be identified as the author of the thesis and due acknowledgement will be made to the author where appropriate you will obtain the author's permission before publishing any material from the thesis. A COMPARATIVE PHENETIC AND CLADISTIC ANALYSIS OF THE GENUS HOLCASPIS CHAUDOIR (COLEOPTERA: CARABIDAE) ********* A thesis submitted in partial fulfilment of the requirements for the degree of Doctor of Philosophy at Lincoln University by Yupa Hanboonsong ********* Lincoln University 1994 Abstract of a thesis submitted in partial fulfilment of the requirements for the degree of Ph.D. A comparative phenetic and cladistic analysis of the genus Holcaspis Chaudoir (Coleoptera: .Carabidae) by Yupa Hanboonsong The systematics of the endemic New Zealand carabid genus Holcaspis are investigated, using phenetic and cladistic methods, to construct phenetic and phylogenetic relationships. Three different character data sets: morphological, allozyme and random amplified polymorphic DNA (RAPD) based on the polymerase chain reaction (PCR), are used to estimate the relationships. Cladistic and morphometric analyses are undertaken on adult morphological characters. Twenty six external morphological characters, including male and female genitalia, are used for cladistic analysis. The results from the cladistic analysis are strongly congruent with previous publications. The morphometric analysis uses multivariate discriminant functions, with 18 morphometric variables, to derive a phenogram by clustering from Mahalanobis distances (D2) of the discrimination analysis using the unweighted pair-group method with arithmetical averages (UPGMA).
    [Show full text]
  • Outer and Inner Indo-Aryan, and Northern India As an Ancient Linguistic Area
    Acta Orientalia 2016: 77, 71–132. Copyright © 2016 Printed in India – all rights reserved ACTA ORIENTALIA ISSN 0001-6483 Outer and Inner Indo-Aryan, and northern India as an ancient linguistic area Claus Peter Zoller University of Oslo Abstract The article presents a new approach to the old controversy concerning the veracity of a distinction between Outer and Inner Languages in Indo-Aryan. A number of arguments and data are presented which substantiate the reality of this distinction. This new approach combines this issue with a new interpretation of the history of Indo- Iranian and with the linguistic prehistory of northern India. Data are presented to show that prehistorical northern India was dominated by Munda/Austro-Asiatic languages. Keywords: Indo-Aryan, Indo-Iranian, Nuristani, Munda/Austro- Asiatic history and prehistory. Introduction This article gives a summary of the most important arguments contained in my forthcoming book on Outer and Inner languages before and after the arrival of Indo-Aryan in South Asia. The 72 Claus Peter Zoller traditional version of the hypothesis of Outer and Inner Indo-Aryan purports the idea that the Indo-Aryan Language immigration1 was not a singular event. Yet, even though it is known that the actual historical movements and processes in connection with this immigration were remarkably complex, the concerns of the hypothesis are not to reconstruct the details of these events but merely to show that the original non-singular immigrations have left revealing linguistic traces in the modern Indo-Aryan languages. Actually, this task is challenging enough, as the long-lasting controversy shows.2 Previous and present proponents of the hypothesis have tried to fix the difference between Outer and Inner Languages in terms of language geography (one graphical attempt as an example is shown below p.
    [Show full text]
  • The Shape of Phylogenetic Trees (Review Paper)
    REVIEW PAPER: THE SHAPE OF PHYLOGENETIC TREESPACE KATHERINE ST. JOHN Abstract: Trees are a canonical structure for representing evolutionary histories. Many popular criteria used to infer optimal trees are computationally hard, and the number of possible tree shapes grows super-exponentially in the number of taxa. The underlying structure of the spaces of trees yields rich insights that can improve the search for optimal trees, both in accuracy and running time, and the analysis and visualization of results. We review the past work on analyzing and comparing trees by their shape as well as recent work that incorporates trees with weighted branch lengths. Keywords: tree metrics, treespace, maximum parsimony, maximum likelihood. Tree structures have long been used to represent the evolutionary histories of sets of species. For example, the tips of the trees represent extant species and the internal nodes represent speciation events. Despite its simplicity, the tree model captures much of the complexity of the underlying phenomena. However, the sheer number of possibilities forces many simply presented operations on trees to be computationally hard. For example, the maximum parsimony criteria (Farris, 1970; Fitch, 1971) that seeks the tree with the minimal number of changes across the edges is compu- tationally hard to compute exactly (Foulds and Graham, 1982). The addition of weights on the branches, to denote quantities such as amount of evolutionary change, the time, or the confidence on the existence of the branch, adds complexity to the model (Felsenstein, 1973, 1978). A pop- ular corresponding optimality criteria for weighted trees, the maximum likelihood criteria, is also computationally hard (Roch, 2006).
    [Show full text]
  • Clustering and Phylogenetic Approaches to Classification: Illustration on Stellar Tracks Didier Fraix-Burnet, Marc Thuillard
    Clustering and Phylogenetic Approaches to Classification: Illustration on Stellar Tracks Didier Fraix-Burnet, Marc Thuillard To cite this version: Didier Fraix-Burnet, Marc Thuillard. Clustering and Phylogenetic Approaches to Classification: Il- lustration on Stellar Tracks. 2014. hal-01703341 HAL Id: hal-01703341 https://hal.archives-ouvertes.fr/hal-01703341 Preprint submitted on 7 Feb 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Clustering and Phylogenetic Approaches to Classification: Illustration on Stellar Tracks D. Fraix-Burnet1, M. Thuillard2 1 Univ. Grenoble Alpes, CNRS, IPAG, 38000 Grenoble, France email: [email protected] 2 La Colline, 2072 St-Blaise, Switzerland February 7, 2018 This pedagogicalarticle was written in 2014 and is yet unpublished. Partofit canbefoundin Fraix-Burnet (2015). Abstract Classifying objects into groups is a natural activity which is most often a prerequisite before any physical analysis of the data. Clustering and phylogenetic approaches are two different and comple- mentary ways in this purpose: the first one relies on similarities and the second one on relationships. In this paper, we describe very simply these approaches and show how phylogenetic techniques can be used in astrophysics by using a toy example based on a sample of stars obtained from models of stellar evolution.
    [Show full text]
  • Autochthonous Aryans? the Evidence from Old Indian and Iranian Texts
    Michael Witzel Harvard University Autochthonous Aryans? The Evidence from Old Indian and Iranian Texts. INTRODUCTION §1. Terminology § 2. Texts § 3. Dates §4. Indo-Aryans in the RV §5. Irano-Aryans in the Avesta §6. The Indo-Iranians §7. An ''Aryan'' Race? §8. Immigration §9. Remembrance of immigration §10. Linguistic and cultural acculturation THE AUTOCHTHONOUS ARYAN THEORY § 11. The ''Aryan Invasion'' and the "Out of India" theories LANGUAGE §12. Vedic, Iranian and Indo-European §13. Absence of Indian influences in Indo-Iranian §14. Date of Indo-Aryan innovations §15. Absence of retroflexes in Iranian §16. Absence of 'Indian' words in Iranian §17. Indo-European words in Indo-Iranian; Indo-European archaisms vs. Indian innovations §18. Absence of Indian influence in Mitanni Indo-Aryan Summary: Linguistics CHRONOLOGY §19. Lack of agreement of the autochthonous theory with the historical evidence: dating of kings and teachers ARCHAEOLOGY __________________________________________ Electronic Journal of Vedic Studies 7-3 (EJVS) 2001(1-115) Autochthonous Aryans? 2 §20. Archaeology and texts §21. RV and the Indus civilization: horses and chariots §22. Absence of towns in the RV §23. Absence of wheat and rice in the RV §24. RV class society and the Indus civilization §25. The Sarasvatī and dating of the RV and the Bråhmaas §26. Harappan fire rituals? §27. Cultural continuity: pottery and the Indus script VEDIC TEXTS AND SCIENCE §28. The ''astronomical code of the RV'' §29. Astronomy: the equinoxes in ŚB §30. Astronomy: Jyotia Vedåga and the
    [Show full text]