Rooting Phylogenetic Trees with Distant Outgroups: a Case Study from the Commelinoid Monocots
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Lecture Notes: the Mathematics of Phylogenetics
Lecture Notes: The Mathematics of Phylogenetics Elizabeth S. Allman, John A. Rhodes IAS/Park City Mathematics Institute June-July, 2005 University of Alaska Fairbanks Spring 2009, 2012, 2016 c 2005, Elizabeth S. Allman and John A. Rhodes ii Contents 1 Sequences and Molecular Evolution 3 1.1 DNA structure . .4 1.2 Mutations . .5 1.3 Aligned Orthologous Sequences . .7 2 Combinatorics of Trees I 9 2.1 Graphs and Trees . .9 2.2 Counting Binary Trees . 14 2.3 Metric Trees . 15 2.4 Ultrametric Trees and Molecular Clocks . 17 2.5 Rooting Trees with Outgroups . 18 2.6 Newick Notation . 19 2.7 Exercises . 20 3 Parsimony 25 3.1 The Parsimony Criterion . 25 3.2 The Fitch-Hartigan Algorithm . 28 3.3 Informative Characters . 33 3.4 Complexity . 35 3.5 Weighted Parsimony . 36 3.6 Recovering Minimal Extensions . 38 3.7 Further Issues . 39 3.8 Exercises . 40 4 Combinatorics of Trees II 45 4.1 Splits and Clades . 45 4.2 Refinements and Consensus Trees . 49 4.3 Quartets . 52 4.4 Supertrees . 53 4.5 Final Comments . 54 4.6 Exercises . 55 iii iv CONTENTS 5 Distance Methods 57 5.1 Dissimilarity Measures . 57 5.2 An Algorithmic Construction: UPGMA . 60 5.3 Unequal Branch Lengths . 62 5.4 The Four-point Condition . 66 5.5 The Neighbor Joining Algorithm . 70 5.6 Additional Comments . 72 5.7 Exercises . 73 6 Probabilistic Models of DNA Mutation 81 6.1 A first example . 81 6.2 Markov Models on Trees . 87 6.3 Jukes-Cantor and Kimura Models . -
A Phylogenomic Analysis of Turtles ⇑ Nicholas G
Molecular Phylogenetics and Evolution 83 (2015) 250–257 Contents lists available at ScienceDirect Molecular Phylogenetics and Evolution journal homepage: www.elsevier.com/locate/ympev A phylogenomic analysis of turtles ⇑ Nicholas G. Crawford a,b,1, James F. Parham c, ,1, Anna B. Sellas a, Brant C. Faircloth d, Travis C. Glenn e, Theodore J. Papenfuss f, James B. Henderson a, Madison H. Hansen a,g, W. Brian Simison a a Center for Comparative Genomics, California Academy of Sciences, 55 Music Concourse Drive, San Francisco, CA 94118, USA b Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA c John D. Cooper Archaeological and Paleontological Center, Department of Geological Sciences, California State University, Fullerton, CA 92834, USA d Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA e Department of Environmental Health Science, University of Georgia, Athens, GA 30602, USA f Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA g Mathematical and Computational Biology Department, Harvey Mudd College, 301 Platt Boulevard, Claremont, CA 9171, USA article info abstract Article history: Molecular analyses of turtle relationships have overturned prevailing morphological hypotheses and Received 11 July 2014 prompted the development of a new taxonomy. Here we provide the first genome-scale analysis of turtle Revised 16 October 2014 phylogeny. We sequenced 2381 ultraconserved element (UCE) loci representing a total of 1,718,154 bp of Accepted 28 October 2014 aligned sequence. Our sampling includes 32 turtle taxa representing all 14 recognized turtle families and Available online 4 November 2014 an additional six outgroups. Maximum likelihood, Bayesian, and species tree methods produce a single resolved phylogeny. -
Phylogenetic Comparative Methods: a User's Guide for Paleontologists
Phylogenetic Comparative Methods: A User’s Guide for Paleontologists Laura C. Soul - Department of Paleobiology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA David F. Wright - Division of Paleontology, American Museum of Natural History, Central Park West at 79th Street, New York, New York 10024, USA and Department of Paleobiology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA Abstract. Recent advances in statistical approaches called Phylogenetic Comparative Methods (PCMs) have provided paleontologists with a powerful set of analytical tools for investigating evolutionary tempo and mode in fossil lineages. However, attempts to integrate PCMs with fossil data often present workers with practical challenges or unfamiliar literature. In this paper, we present guides to the theory behind, and application of, PCMs with fossil taxa. Based on an empirical dataset of Paleozoic crinoids, we present example analyses to illustrate common applications of PCMs to fossil data, including investigating patterns of correlated trait evolution, and macroevolutionary models of morphological change. We emphasize the importance of accounting for sources of uncertainty, and discuss how to evaluate model fit and adequacy. Finally, we discuss several promising methods for modelling heterogenous evolutionary dynamics with fossil phylogenies. Integrating phylogeny-based approaches with the fossil record provides a rigorous, quantitative perspective to understanding key patterns in the history of life. 1. Introduction A fundamental prediction of biological evolution is that a species will most commonly share many characteristics with lineages from which it has recently diverged, and fewer characteristics with lineages from which it diverged further in the past. This principle, which results from descent with modification, is one of the most basic in biology (Darwin 1859). -
Phylogenetic Definitions in the Pre-Phylocode Era; Implications for Naming Clades Under the Phylocode
PaleoBios 27(1):1–6, April 30, 2007 © 2006 University of California Museum of Paleontology Phylogenetic definitions in the pre-PhyloCode era; implications for naming clades under the PhyloCode MiChAel P. TAylor Palaeobiology research Group, School of earth and environmental Sciences, University of Portsmouth, Portsmouth Po1 3Ql, UK; [email protected] The last twenty years of work on phylogenetic nomenclature have given rise to many names and definitions that are now considered suboptimal. in formulating permanent definitions under the PhyloCode when it is implemented, it will be necessary to evaluate the corpus of existing names and make judgements about which to establish and which to discard. This is not straightforward, because early definitions are often inexplicit and ambiguous, generally do not meet the requirements of the PhyloCode, and in some cases may not be easily recognizable as phylogenetic definitions at all. recognition of synonyms is also complicated by the use of different kinds of specifiers (species, specimens, clades, genera, suprageneric rank-based names, and vernacular names) and by definitions whose content changes under different phylogenetic hypotheses. in light of these difficulties, five principles are suggested to guide the interpreta- tion of pre-PhyloCode clade-names and to inform the process of naming clades under the PhyloCode: (1) do not recognize “accidental” definitions; (2) malformed definitions should be interpreted according to the intention of the author when and where this is obvious; (3) apomorphy-based and other definitions must be recognized as well as node-based and stem-based definitions; (4) definitions using any kind of specifier taxon should be recognized; and (5) priority of synonyms and homonyms should guide but not prescribe. -
Phylogeny Codon Models • Last Lecture: Poor Man’S Way of Calculating Dn/Ds (Ka/Ks) • Tabulate Synonymous/Non-Synonymous Substitutions • Normalize by the Possibilities
Phylogeny Codon models • Last lecture: poor man’s way of calculating dN/dS (Ka/Ks) • Tabulate synonymous/non-synonymous substitutions • Normalize by the possibilities • Transform to genetic distance KJC or Kk2p • In reality we use codon model • Amino acid substitution rates meet nucleotide models • Codon(nucleotide triplet) Codon model parameterization Stop codons are not allowed, reducing the matrix from 64x64 to 61x61 The entire codon matrix can be parameterized using: κ kappa, the transition/transversionratio ω omega, the dN/dS ratio – optimizing this parameter gives the an estimate of selection force πj the equilibrium codon frequency of codon j (Goldman and Yang. MBE 1994) Empirical codon substitution matrix Observations: Instantaneous rates of double nucleotide changes seem to be non-zero There should be a mechanism for mutating 2 adjacent nucleotides at once! (Kosiol and Goldman) • • Phylogeny • • Last lecture: Inferring distance from Phylogenetic trees given an alignment How to infer trees and distance distance How do we infer trees given an alignment • • Branch length Topology d 6-p E 6'B o F P Edo 3 vvi"oH!.- !fi*+nYolF r66HiH- .) Od-:oXP m a^--'*A ]9; E F: i ts X o Q I E itl Fl xo_-+,<Po r! UoaQrj*l.AP-^PA NJ o - +p-5 H .lXei:i'tH 'i,x+<ox;+x"'o 4 + = '" I = 9o FF^' ^X i! .poxHo dF*x€;. lqEgrE x< f <QrDGYa u5l =.ID * c 3 < 6+6_ y+ltl+5<->-^Hry ni F.O+O* E 3E E-f e= FaFO;o E rH y hl o < H ! E Y P /-)^\-B 91 X-6p-a' 6J. -
Diversity-Dependent Cladogenesis Throughout Western Mexico: Evolutionary Biogeography of Rattlesnakes (Viperidae: Crotalinae: Crotalus and Sistrurus)
City University of New York (CUNY) CUNY Academic Works Publications and Research New York City College of Technology 2016 Diversity-dependent cladogenesis throughout western Mexico: Evolutionary biogeography of rattlesnakes (Viperidae: Crotalinae: Crotalus and Sistrurus) Christopher Blair CUNY New York City College of Technology Santiago Sánchez-Ramírez University of Toronto How does access to this work benefit ou?y Let us know! More information about this work at: https://academicworks.cuny.edu/ny_pubs/344 Discover additional works at: https://academicworks.cuny.edu This work is made publicly available by the City University of New York (CUNY). Contact: [email protected] 1Blair, C., Sánchez-Ramírez, S., 2016. Diversity-dependent cladogenesis throughout 2 western Mexico: Evolutionary biogeography of rattlesnakes (Viperidae: Crotalinae: 3 Crotalus and Sistrurus ). Molecular Phylogenetics and Evolution 97, 145–154. 4 https://doi.org/10.1016/j.ympev.2015.12.020. © 2016. This manuscript version is made 5 available under the CC-BY-NC-ND 4.0 license. 6 7 8 Diversity-dependent cladogenesis throughout western Mexico: evolutionary 9 biogeography of rattlesnakes (Viperidae: Crotalinae: Crotalus and Sistrurus) 10 11 12 CHRISTOPHER BLAIR1*, SANTIAGO SÁNCHEZ-RAMÍREZ2,3,4 13 14 15 1Department of Biological Sciences, New York City College of Technology, Biology PhD 16 Program, Graduate Center, The City University of New York, 300 Jay Street, Brooklyn, 17 NY 11201, USA. 18 2Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks 19 Street, Toronto, ON, M5S 3B2, Canada. 20 3Department of Natural History, Royal Ontario Museum, 100 Queen’s Park, Toronto, 21 ON, M5S 2C6, Canada. 22 4Present address: Environmental Genomics Group, Max Planck Institute for 23 Evolutionary Biology, August-Thienemann-Str. -
A Phylogenetic Analysis of the Basal Ornithischia (Reptilia, Dinosauria)
A PHYLOGENETIC ANALYSIS OF THE BASAL ORNITHISCHIA (REPTILIA, DINOSAURIA) Marc Richard Spencer A Thesis Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements of the degree of MASTER OF SCIENCE December 2007 Committee: Margaret M. Yacobucci, Advisor Don C. Steinker Daniel M. Pavuk © 2007 Marc Richard Spencer All Rights Reserved iii ABSTRACT Margaret M. Yacobucci, Advisor The placement of Lesothosaurus diagnosticus and the Heterodontosauridae within the Ornithischia has been problematic. Historically, Lesothosaurus has been regarded as a basal ornithischian dinosaur, the sister taxon to the Genasauria. Recent phylogenetic analyses, however, have placed Lesothosaurus as a more derived ornithischian within the Genasauria. The Fabrosauridae, of which Lesothosaurus was considered a member, has never been phylogenetically corroborated and has been considered a paraphyletic assemblage. Prior to recent phylogenetic analyses, the problematic Heterodontosauridae was placed within the Ornithopoda as the sister taxon to the Euornithopoda. The heterodontosaurids have also been considered as the basal member of the Cerapoda (Ornithopoda + Marginocephalia), the sister taxon to the Marginocephalia, and as the sister taxon to the Genasauria. To reevaluate the placement of these taxa, along with other basal ornithischians and more derived subclades, a phylogenetic analysis of 19 taxonomic units, including two outgroup taxa, was performed. Analysis of 97 characters and their associated character states culled, modified, and/or rescored from published literature based on published descriptions, produced four most parsimonious trees. Consistency and retention indices were calculated and a bootstrap analysis was performed to determine the relative support for the resultant phylogeny. The Ornithischia was recovered with Pisanosaurus as its basalmost member. -
Is Ellipura Monophyletic? a Combined Analysis of Basal Hexapod
ARTICLE IN PRESS Organisms, Diversity & Evolution 4 (2004) 319–340 www.elsevier.de/ode Is Ellipura monophyletic? A combined analysis of basal hexapod relationships with emphasis on the origin of insects Gonzalo Giribeta,Ã, Gregory D.Edgecombe b, James M.Carpenter c, Cyrille A.D’Haese d, Ward C.Wheeler c aDepartment of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA bAustralian Museum, 6 College Street, Sydney, New South Wales 2010, Australia cDivision of Invertebrate Zoology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA dFRE 2695 CNRS, De´partement Syste´matique et Evolution, Muse´um National d’Histoire Naturelle, 45 rue Buffon, F-75005 Paris, France Received 27 February 2004; accepted 18 May 2004 Abstract Hexapoda includes 33 commonly recognized orders, most of them insects.Ongoing controversy concerns the grouping of Protura and Collembola as a taxon Ellipura, the monophyly of Diplura, a single or multiple origins of entognathy, and the monophyly or paraphyly of the silverfish (Lepidotrichidae and Zygentoma s.s.) with respect to other dicondylous insects.Here we analyze relationships among basal hexapod orders via a cladistic analysis of sequence data for five molecular markers and 189 morphological characters in a simultaneous analysis framework using myriapod and crustacean outgroups.Using a sensitivity analysis approach and testing for stability, the most congruent parameters resolve Tricholepidion as sister group to the remaining Dicondylia, whereas most suboptimal parameter sets group Tricholepidion with Zygentoma.Stable hypotheses include the monophyly of Diplura, and a sister group relationship between Diplura and Protura, contradicting the Ellipura hypothesis.Hexapod monophyly is contradicted by an alliance between Collembola, Crustacea and Ectognatha (i.e., exclusive of Diplura and Protura) in molecular and combined analyses. -
1 Integrative Biology 200 "PRINCIPLES OF
Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley B.D. Mishler March 14, 2018. Classification II: Phylogenetic taxonomy including incorporation of fossils; PhyloCode I. Phylogenetic Taxonomy - the argument for rank-free classification A number of recent calls have been made for the reformation of the Linnaean hierarchy (e.g., De Queiroz & Gauthier, 1992). These authors have emphasized that the existing system is based in a non-evolutionary world-view; the roots of the Linnaean hierarchy are in a specially- created world-view. Perhaps the idea of fixed, comparable ranks made some sense under that view, but under an evolutionary world view they don't make sense. There are several problems with the current nomenclatorial system: 1. The current system, with its single type for a name, cannot be used to precisely name a clade. E.g., you may name a family based on a certain type specimen, and even if you were clear about what node you meant to name in your original publication, the exact phylogenetic application of your name would not be clear subsequently, after new clades are added. 2. There are not nearly enough ranks to name the thousands of levels of monophyletic groups in the tree of life. Therefore people are increasingly using informal rank-free names for higher- level nodes, but without any clear, formal specification of what clade is meant. 3. Most aspects of the current code, including priority, revolve around the ranks, which leads to instability of usage. For example, when a change in relationships is discovered, several names often need to be changed to adjust, including those of groups whose circumscription has not changed. -
Bioinfo 11 Phylogenetictree
BIO 390/CSCI 390/MATH 390 Bioinformatics II Programming Lecture 11 Phylogenetic Tree Instructor: Lei Qian Fisk University Phylogenetic Tree Applications • Study ancestor-descendant relationships (Evolutionary biology, adaption, genetic drift, selection, speciation, etc.) • Paleogenomics: inferring ancestral genomic information from extinct species (Comparing Chimpanzee, Neanderthal and Human DNA) • Origins of epidemics (Comparing, at the molecular level, various virus strains) • Drug design: specially targeting groups of organisms (Efficient enumeration of phylogenetically informative substrings) • Forensic (Relationships among HIV strains) • Linguistics (Languages tree divergence times) Phylogenetic Tree Illustrating success stories in phylogenetics (I) For roughly 100 years (more exactly, 1870-1985), scientists were unable to figure out which family the giant panda belongs to. Giant pandas look like bears, but have features that are unusual for bears but typical to raccoons: they do not hibernate, they do not roar, their male genitalia are small and backward-pointing. Anatomical features were the dominant criteria used to derive evolutionary relationships between species since Darwin till early 1960s. The evolutionary relationships derived from these relatively subjective observations were often inconclusive. Some of them were later proved incorrect. In 1985, Steven O’Brien and colleagues solved the giant panda classification problem using DNA sequences and phylogenetic algorithms. Phylogenetic Tree Phylogenetic Tree Illustrating success stories in phylogenetics (II) In 1994, a woman from Lafayette, Louisiana (USA), clamed that her ex-lover (who was a physician) injected her with HIV+ blood. Records showed that the physician had drawn blood from a HIV+ patient that day. But how to prove that the blood from that HIV+ patient ended up in the woman? Phylogenetic Tree HIV has a high mutation rate, which can be used to trace paths of transmission. -
Phylocode: a Phylogenetic Code of Biological Nomenclature
PhyloCode: A Phylogenetic Code of Biological Nomenclature Philip D. Cantino and Kevin de Queiroz (equal contributors; names listed alphabetically) Advisory Group: William S. Alverson, David A. Baum, Harold N. Bryant, David C. Cannatella, Peter R. Crane, Michael J. Donoghue, Torsten Eriksson*, Jacques Gauthier, Kenneth Halanych, David S. Hibbett, David M. Hillis, Kathleen A. Kron, Michael S. Y. Lee, Alessandro Minelli, Richard G. Olmstead, Fredrik Pleijel*, J. Mark Porter, Heidi E. Robeck, Greg W. Rouse, Timothy Rowe*, Christoffer Schander, Per Sundberg, Mikael Thollesson, and Andre R. Wyss. *Chaired a committee that authored a portion of the current draft. Most recent revision: April 8, 2000 1 Table of Contents Preface Preamble Division I. Principles Division II. Rules Chapter I. Taxa Article 1. The Nature of Taxa Article 2. Clades Article 3. Hierarchy and Rank Chapter II. Publication Article 4. Publication Requirements Article 5. Publication Date Chapter III. Names Section 1. Status Article 6 Section 2. Establishment Article 7. General Requirements Article 8. Registration Chapter IV. Clade Names Article 9. General Requirements for Establishment of Clade Names Article 10. Selection of Clade Names for Establishment Article 11. Specifiers and Qualifying Clauses Chapter V. Selection of Accepted Names Article 12. Precedence Article 13. Homonymy Article 14. Synonymy Article 15. Conservation Chapter VI. Provisions for Hybrids Article 16. Chapter VII. Orthography Article 17. Orthographic Requirements for Establishment Article 18. Subsequent Use and Correction of Established Names Chapter VIII. Authorship of Names Article 19. Chapter IX. Citation of Authors and Registration Numbers Article 20. Chapter X. Governance Article 21. Glossary Table 1. Equivalence of Nomenclatural Terms Appendix A. -
Hemidactylium Scutatum): out of Appalachia and Into the Glacial Aftermath
RANGE-WIDE PHYLOGEOGRAPHY OF THE FOUR-TOED SALAMANDER (HEMIDACTYLIUM SCUTATUM): OUT OF APPALACHIA AND INTO THE GLACIAL AFTERMATH Timothy A. Herman A Thesis Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE August 2009 Committee: Juan Bouzat, Advisor Christopher Phillips Karen Root ii ABSTRACT Juan Bouzat, Advisor Due to its limited vagility, deep ancestry, and broad distribution, the four-toed salamander (Hemidactylium scutatum) is well suited to track biogeographic patterns across eastern North America. The range of the monotypic genus Hemidactylium is highly disjunct in its southern and western portions, and even within contiguous portions is highly localized around pockets of preferred nesting habitat. Over 330 Hemidactylium genetic samples from 79 field locations were collected and analyzed via mtDNA sequencing of the cytochrome oxidase 1 gene (co1). Phylogenetic analyses showed deep divergences at this marker (>10% between some haplotypes) and strong support for regional monophyletic clades with minimal overlap. Patterns of haplotype distribution suggest major river drainages, both ancient and modern, as boundaries to dispersal. Two distinct allopatric clades account for all sampling sites within glaciated areas of North America yet show differing patterns of recolonization. High levels of haplotype diversity were detected in the southern Appalachians, with several members of widely ranging clades represented in the region as well as other unique, endemic, and highly divergent lineages. Bayesian divergence time analyses estimated the common ancestor of all living Hemidactylium included in the study at roughly 8 million years ago, with the most basal splits in the species confined to the Blue Ridge Mountains.