Maximum Parsimony
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Phylogenetics Topic 2: Phylogenetic and Genealogical Homology
Phylogenetics Topic 2: Phylogenetic and genealogical homology Phylogenies distinguish homology from similarity Previously, we examined how rooted phylogenies provide a framework for distinguishing similarity due to common ancestry (HOMOLOGY) from non-phylogenetic similarity (ANALOGY). Here we extend the concept of phylogenetic homology by making a further distinction between a HOMOLOGOUS CHARACTER and a HOMOLOGOUS CHARACTER STATE. This distinction is important to molecular evolution, as we often deal with data comprised of homologous characters with non-homologous character states. The figure below shows three hypothetical protein-coding nucleotide sequences (for simplicity, only three codons long) that are related to each other according to a phylogenetic tree. In the figure the nucleotide sequences are aligned to each other; in so doing we are making the implicit assumption that the characters aligned vertically are homologous characters. In the specific case of nucleotide and amino acid alignments this assumption is called POSITIONAL HOMOLOGY. Under positional homology it is implicit that a given position, say the first position in the gene sequence, was the same in the gene sequence of the common ancestor. In the figure below it is clear that some positions do not have identical character states (see red characters in figure below). In such a case the involved position is considered to be a homologous character, while the state of that character will be non-homologous where there are differences. Phylogenetic perspective on homologous characters and homologous character states ACG TAC TAA SYNAPOMORPHY: a shared derived character state in C two or more lineages. ACG TAT TAA These must be homologous in state. -
An Introduction to Phylogenetic Analysis
This article reprinted from: Kosinski, R.J. 2006. An introduction to phylogenetic analysis. Pages 57-106, in Tested Studies for Laboratory Teaching, Volume 27 (M.A. O'Donnell, Editor). Proceedings of the 27th Workshop/Conference of the Association for Biology Laboratory Education (ABLE), 383 pages. Compilation copyright © 2006 by the Association for Biology Laboratory Education (ABLE) ISBN 1-890444-09-X All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the copyright owner. Use solely at one’s own institution with no intent for profit is excluded from the preceding copyright restriction, unless otherwise noted on the copyright notice of the individual chapter in this volume. Proper credit to this publication must be included in your laboratory outline for each use; a sample citation is given above. Upon obtaining permission or with the “sole use at one’s own institution” exclusion, ABLE strongly encourages individuals to use the exercises in this proceedings volume in their teaching program. Although the laboratory exercises in this proceedings volume have been tested and due consideration has been given to safety, individuals performing these exercises must assume all responsibilities for risk. The Association for Biology Laboratory Education (ABLE) disclaims any liability with regards to safety in connection with the use of the exercises in this volume. The focus of ABLE is to improve the undergraduate biology laboratory experience by promoting the development and dissemination of interesting, innovative, and reliable laboratory exercises. -
Phylogenetic Comparative Methods: a User's Guide for Paleontologists
Phylogenetic Comparative Methods: A User’s Guide for Paleontologists Laura C. Soul - Department of Paleobiology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA David F. Wright - Division of Paleontology, American Museum of Natural History, Central Park West at 79th Street, New York, New York 10024, USA and Department of Paleobiology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA Abstract. Recent advances in statistical approaches called Phylogenetic Comparative Methods (PCMs) have provided paleontologists with a powerful set of analytical tools for investigating evolutionary tempo and mode in fossil lineages. However, attempts to integrate PCMs with fossil data often present workers with practical challenges or unfamiliar literature. In this paper, we present guides to the theory behind, and application of, PCMs with fossil taxa. Based on an empirical dataset of Paleozoic crinoids, we present example analyses to illustrate common applications of PCMs to fossil data, including investigating patterns of correlated trait evolution, and macroevolutionary models of morphological change. We emphasize the importance of accounting for sources of uncertainty, and discuss how to evaluate model fit and adequacy. Finally, we discuss several promising methods for modelling heterogenous evolutionary dynamics with fossil phylogenies. Integrating phylogeny-based approaches with the fossil record provides a rigorous, quantitative perspective to understanding key patterns in the history of life. 1. Introduction A fundamental prediction of biological evolution is that a species will most commonly share many characteristics with lineages from which it has recently diverged, and fewer characteristics with lineages from which it diverged further in the past. This principle, which results from descent with modification, is one of the most basic in biology (Darwin 1859). -
II. a Cladistic Analysis of Rbcl Sequences and Morphology
Systematic Botany (2003), 28(2): pp. 270±292 q Copyright 2003 by the American Society of Plant Taxonomists Phylogenetic Relationships in the Commelinaceae: II. A Cladistic Analysis of rbcL Sequences and Morphology TIMOTHY M. EVANS,1,3 KENNETH J. SYTSMA,1 ROBERT B. FADEN,2 and THOMAS J. GIVNISH1 1Department of Botany, University of Wisconsin, 430 Lincoln Drive, Madison, Wisconsin 53706; 2Department of Systematic Biology-Botany, MRC 166, National Museum of Natural History, Smithsonian Institution, P.O. Box 37012, Washington, DC 20013-7012; 3Present address, author for correspondence: Department of Biology, Hope College, 35 East 12th Street, Holland, Michigan 49423-9000 ([email protected]) Communicating Editor: John V. Freudenstein ABSTRACT. The chloroplast-encoded gene rbcL was sequenced in 30 genera of Commelinaceae to evaluate intergeneric relationships within the family. The Australian Cartonema was consistently placed as sister to the rest of the family. The Commelineae is monophyletic, while the monophyly of Tradescantieae is in question, due to the position of Palisota as sister to all other Tradescantieae plus Commelineae. The phylogeny supports the most recent classi®cation of the family with monophyletic tribes Tradescantieae (minus Palisota) and Commelineae, but is highly incongruent with a morphology-based phylogeny. This incongruence is attributed to convergent evolution of morphological characters associated with pollination strategies, especially those of the androecium and in¯orescence. Analysis of the combined data sets produced a phylogeny similar to the rbcL phylogeny. The combined analysis differed from the molecular one, however, in supporting the monophyly of Dichorisandrinae. The family appears to have arisen in the Old World, with one or possibly two movements to the New World in the Tradescantieae, and two (or possibly one) subsequent movements back to the Old World; the latter are required to account for the Old World distribution of Coleotrypinae and Cyanotinae, which are nested within a New World clade. -
Hemiplasy and Homoplasy in the Karyotypic Phylogenies of Mammals
Hemiplasy and homoplasy in the karyotypic phylogenies of mammals Terence J. Robinson*†, Aurora Ruiz-Herrera‡, and John C. Avise†§ *Evolutionary Genomics Group, Department of Botany and Zoology, University of Stellenbosch, Private Bag X1, Matieland 7602, South Africa; ‡Laboratorio di Biologia Cellulare e Molecolare, Dipartimento di Genetica e Microbiologia, Universita`degli Studi di Pavia, via Ferrata 1, 27100 Pavia, Italy; and §Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697 Contributed by John C. Avise, July 31, 2008 (sent for review May 28, 2008) Phylogenetic reconstructions are often plagued by difficulties in (2). These syntenic blocks, sometimes shared across even dis- distinguishing phylogenetic signal (due to shared ancestry) from tantly related species, may involve entire chromosomes, chro- phylogenetic noise or homoplasy (due to character-state conver- mosomal arms, or chromosomal segments. Based on the premise gences or reversals). We use a new interpretive hypothesis, termed that each syntenic assemblage in extant species is of monophy- hemiplasy, to show how random lineage sorting might account for letic origin, researchers have reconstructed phylogenies and specific instances of seeming ‘‘phylogenetic discordance’’ among ancestral karyotypes for numerous mammalian taxa (ref. 3 and different chromosomal traits, or between karyotypic features and references therein). Normally, the phylogenetic inferences from probable species phylogenies. We posit that hemiplasy is generally these cladistic appraisals are self-consistent (across syntenic less likely for underdominant chromosomal polymorphisms (i.e., blocks) and taxonomically reasonable, and they have helped those with heterozygous disadvantage) than for neutral polymor- greatly to identify particular mammalian clades. However, in a phisms or especially for overdominant rearrangements (which few cases problematic phylogenetic patterns of the sort described should tend to be longer-lived), and we illustrate this concept by above have emerged. -
Collecting Reliable Clades Using the Greedy Strict Consensus Merger
Collecting reliable clades using the Greedy Strict Consensus Merger Markus Fleischauer and Sebastian Böcker Lehrstuhl für Bioinformatik, Friedrich-Schiller Universität, Jena, Thüringen, Germany ABSTRACT Supertree methods combine a set of phylogenetic trees into a single supertree. Similar to supermatrix methods, these methods provide a way to reconstruct larger parts of the Tree of Life, potentially evading the computational complexity of phylogenetic inference methods such as maximum likelihood. The supertree problem can be formalized in different ways, to cope with contradictory information in the input. Many supertree methods have been developed. Some of them solve NP-hard optimization problems like the well-known Matrix Representation with Parsimony, while others have polynomial worst-case running time but work in a greedy fashion (FlipCut). Both can profit from a set of clades that are already known to be part of the supertree. The Superfine approach shows how the Greedy Strict Consensus Merger (GSCM) can be used as preprocessing to find these clades. We introduce different scoring functions for the GSCM, a randomization, as well as a combination thereof to improve the GSCM to find more clades. This helps, in turn, to improve the resolution of the GSCM supertree. We find this modifications to increase the number of true positive clades by 18% compared to the currently used Overlap scoring. Subjects Bioinformatics, Computational Biology, Evolutionary Studies, Genetics, Taxonomy Keywords Consensus, Supertree, Supermatrix, Divide and Conquer, FlipCut, Phylogeny INTRODUCTION Submitted 14 September 2015 Supertree methods are used to combine a set of phylogenetic trees with non-identical but Accepted 3 June 2016 Published 28 June 2016 overlapping taxon sets, into a larger supertree that contains all the taxa of every input tree. -
Supertree Construction: Opportunities and Challenges
Supertree Construction: Opportunities and Challenges Tandy Warnow Abstract Supertree construction is the process by which a set of phylogenetic trees, each on a subset of the overall set S of species, is combined into a tree on the full set S. The traditional use of supertree methods is the assembly of a large species tree from previously computed smaller species trees; however, supertree methods are also used to address large-scale tree estimation using divide-and-conquer (i.e., a dataset is divided into overlapping subsets, trees are constructed on the subsets, and then combined using the supertree method). Because most supertree methods are heuristics for NP-hard optimization problems, the use of supertree estimation on large datasets is challenging, both in terms of scalability and accuracy. In this chap- ter, we describe the current state of the art in supertree construction and the use of supertree methods in divide-and-conquer strategies. Finally, we identify directions where future research could lead to improved supertree methods. Key words: supertrees, phylogenetics, species trees, divide-and-conquer, incom- plete lineage sorting, Tree of Life 1 Introduction The estimation of phylogenetic trees, whether gene trees (that is, the phylogeny re- lating the sequences found at a specific locus in different species) or species trees, is a precursor to many downstream analyses, including protein structure and func- arXiv:1805.03530v1 [q-bio.PE] 9 May 2018 tion prediction, the detection of positive selection, co-evolution, human population genetics, etc. (see [58] for just some of these applications). Indeed, as Dobzhanksy said, “Nothing in biology makes sense except in the light of evolution” [44]. -
Phylogenetic Analysis
Phylogenetic Analysis Aristotle • Through classification, one might discover the essence and purpose of species. Nelson & Platnick (1981) Systematics and Biogeography Carl Linnaeus • Swedish botanist (1700s) • Listed all known species • Developed scheme of classification to discover the plan of the Creator 1 Linnaeus’ Main Contributions 1) Hierarchical classification scheme Kingdom: Phylum: Class: Order: Family: Genus: Species 2) Binomial nomenclature Before Linnaeus physalis amno ramosissime ramis angulosis glabris foliis dentoserratis After Linnaeus Physalis angulata (aka Cutleaf groundcherry) 3) Originated the practice of using the ♂ - (shield and arrow) Mars and ♀ - (hand mirror) Venus glyphs as the symbol for male and female. Charles Darwin • Species evolved from common ancestors. • Concept of closely related species being more recently diverged from a common ancestor. Therefore taxonomy might actually represent phylogeny! The phylogeny and classification of life a proposed by Haeckel (1866). 2 Trees - Rooted and Unrooted 3 Trees - Rooted and Unrooted ABCDEFGHIJ A BCDEH I J F G ROOT ROOT D E ROOT A F B H J G C I 4 Monophyletic: A group composed of a collection of organisms, including the most recent common ancestor of all those organisms and all the descendants of that most recent common ancestor. A monophyletic taxon is also called a clade. Paraphyletic: A group composed of a collection of organisms, including the most recent common ancestor of all those organisms. Unlike a monophyletic group, a paraphyletic group does not include all the descendants of the most recent common ancestor. Polyphyletic: A group composed of a collection of organisms in which the most recent common ancestor of all the included organisms is not included, usually because the common ancestor lacks the characteristics of the group. -
Stylophoran Supertrees Revisited
Stylophoran supertrees revisited BERTRAND LEFEBVRE Lefebvre, B. 2005. Stylophoran supertrees revisited. Acta Palaeontologica Polonica 50 (3): 477–486. Supertree analysis is a recent exploratory method that involves the simultaneous combination of two or more charac− ter−based source trees into a single consensus supertree. This method was recently applied by Ruta to a fossil group of enigmatic Palaeozoic forms, the stylophoran echinoderms. Ruta’s supertree suggested that mitrates are polyphyletic and originated from paraphyletic cornutes. Re−examination of Ruta’s data matrix strongly suggests that most source trees were based on dubious homologies resulting from theory−laden assumptions (calcichordate model) or superficial similar− ities (ankyroid scenario). A new supertree analysis was performed using a slightly corrected version of Ruta’s original combined matrix; the 70% majority−rule consensus of 24,168 most parsimonious supertrees suggests that mitrates are monophyletic and derived from paraphyletic cornutes. A second new supertree analysis was generated to test the influ− ence of the pruning of three taxa in some calcichordate source trees; the 70% majority−rule consensus of 3,720 shortest supertrees indicates that both cornutes and mitrates are monophyletic and derived from a Ceratocystis−like ancestor. The two new supertree analyses demonstrate the dramatic influence of the relative contributions of each initial assumption of plate homologies (and underlying anatomical interpretations), in original source trees, on the final topology of supertrees. Key words: Echinodermata, Stylophora, Cornuta, Mitrata, Ankyroida, Calcichordata, supertree, Palaeozoic. Bertrand Lefebvre [bertrand.lefebvre@u−bourgogne.fr], Biogéosciences, Université de Bourgogne, 6 boulevard Ga− briel, F−21000 Dijon, France. Introduction disparate, character−derived trees (Purvis 1995; Bininda− Emonds et al. -
Phylogenetic Supertree Reveals Detailed Evolution of SARS-Cov-2
www.nature.com/scientificreports OPEN Phylogenetic supertree reveals detailed evolution of SARS‑CoV‑2 Tingting Li1, Dongxia Liu2, Yadi Yang1, Jiali Guo3, Yujie Feng3, Xinmo Zhang3, Shilong Cheng4 & Jie Feng2* Corona Virus Disease 2019 (COVID‑19) caused by the emerged coronavirus SARS‑CoV‑2 is spreading globally. The origin of SARS‑Cov‑2 and its evolutionary relationship is still ambiguous. Several reports attempted to fgure out this critical issue by genome‑based phylogenetic analysis, yet limited progress was obtained, principally owing to the disability of these methods to reasonably integrate phylogenetic information from all genes of SARS‑CoV‑2. Supertree method based on multiple trees can produce the overall reasonable phylogenetic tree. However, the supertree method has been barely used for phylogenetic analysis of viruses. Here we applied the matrix representation with parsimony (MRP) pseudo‑sequence supertree analysis to study the origin and evolution of SARS‑CoV‑2. Compared with other phylogenetic analysis methods, the supertree method showed more resolution power for phylogenetic analysis of coronaviruses. In particular, the MRP pseudo‑sequence supertree analysis frmly disputes bat coronavirus RaTG13 be the last common ancestor of SARS‑CoV‑2, which was implied by other phylogenetic tree analysis based on viral genome sequences. Furthermore, the discovery of evolution and mutation in SARS‑CoV‑2 was achieved by MRP pseudo‑sequence supertree analysis. Taken together, the MRP pseudo‑sequence supertree provided more information on the SARS‑CoV‑2 evolution inference relative to the normal phylogenetic tree based on full‑length genomic sequences. Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), a novel coronavirus emerged in December 2019, causes an ongoing outbreak of Corona Virus Disease 2019 (COVID-19). -
Chapter Some Desiderata for Liberal Supertrees
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by MURAL - Maynooth University Research Archive Library Chapter Some Desiderata for Liberal Supertrees 1 2 3, MARK WILKINSON , JOSEPH L. THORLEY , DAVIDE PISANI FRANCOIS- 4 5 JOSEPH LAPOINTE , JAMES O. MCINERNEY 1Department of Zoology, The Natural History Museum, London SW7 5BD, UK 2Fisheries Research Service, Freshwater Laboratory, Faskally, Pitlochry, Perthshire PH16 5LB, UK 3Department of Biology, The Pennsylvania State University. 208 Mueller Laboratory, University Park, 16802, PA, USA 4Département de Sciences Biologiques, Université de Montréal, C.P. 6128, Succ. Centre-ville, Montréal (Québec), H3C 3J7, Canada. 5Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland. Abstract: Although a variety of supertree methods has been proposed, our understanding of these methods is limited. In turn, this limits the potential for biologists who seek to construct supertrees to make informed choices among the available methods. In this chapter we distinguish between supertree methods that offer a conservative synthesis of the relationships that are agreed upon or uncontradicted by all the input trees and liberal supertree methods which have the potential to resolve conflict. We list a series of potential desiderata (i.e. desirable properties) of liberal supertree methods, discuss their relevance to biologists and highlight where it is known that particular methods do or do not satisfy them. For biologists, the primary aim of liberal supertree construction is to produce accurate phylogenies and most of our desiderata relate to this prime objective. Secondary desiderata pertain to the practicality of supertree methods, particularly their speed. -
Quantifying the Risk of Hemiplasy in Phylogenetic Inference
Quantifying the risk of hemiplasy in phylogenetic inference Rafael F. Guerreroa,b,1 and Matthew W. Hahna,b aDepartment of Biology, Indiana University, Bloomington, IN 47405; and bDepartment of Computer Science, Indiana University, Bloomington, IN 47405 Edited by David M. Hillis, The University of Texas at Austin, Austin, TX, and approved November 5, 2018 (received for review June 29, 2018) Convergent evolution—the appearance of the same character paralogs as orthologs, and a lack of fit between the sequence state in apparently unrelated organisms—is often inferred when a data and the substitution model used to infer the tree. The bi- trait is incongruent with the species tree. However, trait incongru- ological causes of gene-tree discordance are incomplete lineage ence can also arise from changes that occur on discordant gene sorting (ILS) and introgression/hybridization (18), although trees, a process referred to as hemiplasy. Hemiplasy is rarely taken natural selection can sometimes result in discordance (e.g., ref. into account in studies of convergent evolution, despite the fact 19). In the presence of any of these biological processes, indi- that phylogenomic studies have revealed rampant discordance. vidual loci can have different histories from the species tree; Here, we study the relative probabilities of homoplasy (including discordance due to either introgression or ILS in the history of convergence and reversal) and hemiplasy for an incongruent trait. extant lineages does not go away over time, so studies of both We derive expressions for the probabilities of the two events, ancient and recent divergences can be affected. showing that they depend on many of the same parameters.