Phylogenetics Topic 3: Methods of Inferring Phylogenies

Total Page:16

File Type:pdf, Size:1020Kb

Phylogenetics Topic 3: Methods of Inferring Phylogenies Phylogenetics Topic 3: Methods of inferring phylogenies Because no person was present to directly observe the evolution of a group of organisms, biologists must infer phylogenies from the characters of living and fossil taxa. These days, the vast majority of phylogenies are reconstructed from variation among nucleotide or amino acid sequences. However, a wide variety of other types of molecular data can be used to reconstruct phylogenies; examples include restriction fragment length polymorphisms (RFLPS), insertion-deletion events (INDELS), chromosomal rearrangements, DNA-DNA hybridization, to name a few. Numerous methods of reconstructing trees have been implemented. This lecture covers a very brief, and non-technical, introduction to the most common methods. A generalized protocol for molecular phylogenetics, and the associated concerns with each step Concerns: Collect homolgous sequences gene tree-species tree / paralogy–orthology / trees within trees Multiple sequence alignment positional homology / gaps / subjectivity-objectivity / methods Phylogeny estimation philosophy / methods / consistency / power and accuracy Test reliability or fit of phylogenetic branch support / tree comparison / statistic issues with trees estimates independent contrasts / impact of error on conclusions Interpretation and application Classification of tree-reconstruction methods PARSIMONY METHODS: These methods utilize variation in CHARACTER STATES to reconstruct phylogenies. Character states are most often variation in the nucleotide (see below) or amino acid “states” at a site in a sequence of such characters. Such sequences often correspond to genes, but other sorts of sequences of characters could be just as useful; examples include nucleotides of introns or inter-genic regions, restriction site polymorphisms, or morphological characters. Alignment of the nucleotide character states of the β-globin gene from five species of mammals human cow rabbit rat opossum GTG CTG TCT CCT GCC GAC AAG ACC AAC GTC AAG GCC GCC TGG GGC AAG GTT GGC GCG CAC ... ... ... G.C ... ... ... T.. ..T ... ... ... ... ... ... ... ... ... .GC A.. ... ... ... ..C ..T ... ... ... ... A.. ... A.T ... ... .AA ... A.C ... AGC ... ... ..C ... G.A .AT ... ..A ... ... A.. ... AA. TG. ... ..G ... A.. ..T .GC ..T ... ..C ..G GA. ..T ... ... ..T C.. ..G ..A ... AT. ... ..T ... ..G ..A .GC ... GCT GGC GAG TAT GGT GCG GAG GCC CTG GAG AGG ATG TTC CTG TCC TTC CCC ACC ACC AAG ... ..A .CT ... ..C ..A ... ..T ... ... ... ... ... ... AG. ... ... ... ... ... .G. ... ... ... ..C ..C ... ... G.. ... ... ... ... T.. GG. ... ... ... ... ... .G. ..T ..A ... ..C .A. ... ... ..A C.. ... ... ... GCT G.. ... ... ... ... ... ..C ..T .CC ..C .CA ..T ..A ..T ..T .CC ..A .CC ... ..C ... ... ... ..T ... ..A ACC TAC TTC CCG CAC TTC GAC CTG AGC CAC GGC TCT GCC CAG GTT AAG GGC CAC GGC AAG ... ... ... ..C ... ... ... ... ... ... ... ..G ... ... ..C ... ... ... ... G.. ... ... ... ..C ... ... ... T.C .C. ... ... ... .AG ... A.C ..A .C. ... ... ... ... ... ... T.T ... A.T ..T G.A ... .C. ... ... ... ... ..C ... .CT ... ... ... ..T ... ... ..C ... ... ... ... TC. .C. ... ..C ... ... A.C C.. ..T ..T ..T ... The order of DNA sequences in the alignment is specified by the order of the taxa in the list. To fit it on the page, the alignment is broken into three parts; such alignments are called INTERLEAVED. The complete DNA sequence is shown for the fist taxon (human). All the other sequences are shown relative to human, with the dot, “.”, signifying a match in the character state with the human sequences. Differences are indicated by using the single-letter nucleotide code (A,C,T or G). Note that this alignment could also be analyzed by using distance, likelihood, and Bayesian methods. The parsimony principle is derived from the principle of philosophy called Occam’s Razor: plurality should not be posited without necessity (Pluralitas non est poneneda sine necessitate, William of Occam, medieval English philosopher [ca. 1285-1349]). Thus the “simplest” hypothesis is the one that is chosen under the MAXIMUM PARSIMONY criterion. Let’s take a nucleotide dataset as an example. In this case an individual tree is a hypothesis, and the “best tree” for the dataset is the one that requires the fewest number of nucleotide substitutions to explain those data. One first computes the minimum number of evolutionary changes required to fit a given dataset to a tree. This number, often called the “number of STEPS”, is recorded for all candidate trees. The tree that requires the minimum number of steps is selected as the best estimate of the phylogenetic tree, and is called the MAXIMUM PARSIMONY TREE. When there are one or more trees with the same minimum number of steps, such trees are called EQUALLY PARSIMONIOUS TREES. The length of a tree in steps is called the “TREE LENGTH”. The appeal of maximum parsimony is that the shortest tree is the one that requires the fewest number of homoplasies. Remember, homoplasies are events such as parallelisms, convergences, reversals; and as such they represent non-phylogenetic similarities. “Longer trees” require more assumptions of homoplasies and thus are more complex than the maximum parsimony tree. When the truth is not parsimonious, parsimony tree length underestimates the true evolutionary distances. Example of the maximum parsimony principle in phylogenetics: SITE 1 2 3 4 5 6 7 8 9 0 1 2 Lengths of three possible trees: SPECIES 1 A T G T T G T G A T A A SPECIES 2 A T G T T c T G G T A A TREE 1: 5 steps SPECIES 3 A T G T T A T C A T A A TREE 2: 6 steps SPECIES 4 A T G T T A T C G T A A TREE 3: 6 steps SITE 6 SITE 8 SITE 9 1 A 1 G A 3 1 G C 3 A 3 G A[G] A[G] G A C TREE 1 2 G 2 G 2 C A 4 C 4 G 4 1 A 1 G C 2 1 G G 2 G 2 A G TREE 2 A A C C 3 A 3 C 3 A A 4 C 4 G 4 1 G C 2 1 G G 2 1 A G 2 TREE 3 A A C C G[A] G[A] 4 A A 3 4 C 4 G C 3 A 3 A problem arises when the underlying mechanism of molecular evolution is sufficiently complex that the number of homoplasies exceeds the true phylogenetic signal in the data. When this happens, methods which choose simple solutions are sometimes “fooled” by the data. What happens is that the simplest way to fit a tree to such data is to consider the homoplasies as the true signal and the true signal as the homopalsies. When this happens we say that maximum parsimony is INCONSISTENT under such a mode of molecular evolution. DISTANCE MATRIX METHODS: If one looks at a phylogeny with branch lengths scaled to some evolutionary distance such as the mean number of changes per site in a gene, it is easy to see that there is a relationship between evolutionary distance and a measure of pair-wise similarity between the lineages. For example, a pair of sister taxa on a tree will have a shorter distance between each other than either will have with any other lineage on such a tree. Distance methods seek to utilize this form of information to reconstruct phylogenies. All distance methods start by converting the original data, say a set of gene sequences, into a matrix of pairwise distance values between all pairs of lineages in the sample. Next a tree is inferred either (i) by some type of sequential joining method, or (ii) by evaluating a set of candidate trees and applying a type of OPTIMALITY CRITERION to select the best tree. Note that maximum parsimony methods described above is one example of an optimality criterion that may be used on discrete character data. An optimality criterion for distance data with a similar justification as parsimony is MINIMUM EVOLUTION. Under the minimum evolution criterion, the tree with the smallest sum of branch lengths is chosen as the best estimate. As with character-based datasets, there are a variety of optimality criteria that one can use with distance data. Example of distance based approach to molecular phylogenetics: Obtain set of homologous gene sequences and produce an alignment. Transform primary data into a matrix of pairwise genetic distance values. Select a method of inferring a phylogenetic tree from distance data; in this case it is the least squares method. human In this case, determine the S statistic for the set of chimp candidate trees, and select a tree that minimizes S. gorilla Note that S is a function of both the tree topology and orang its branch lengths Distance methods have a number of attractive qualities for phylogenetics. First and foremost, the distance calculations between all pairs of sequences are based on an explicit model of molecular evolution. If the most important features of the process of evolution are contained in the model then inconsistency problems such as long-branch attraction are reduced or eliminated. For those who are interested, I have placed on the course website a short summary of the more popular models of nucleotide and amino acid evolution. We will return to the problem of using model-based methods to obtain “corrected” estimates of evolutionary distance later in this course. Another very useful feature of distance methods is the statistical framework that can be used to evaluate models or hypotheses that are not available under parsimony methods. A noteworthy drawback of distance methods is that the information content of the dataset is reduced in the step of transforming the primary data into a matrix of pairwise distance values. The practical effect is that the power of distance methods could be lower than character-based methods in certain circumstances. MAXIMUM LIKELIHOOD METHODS: Maximum likelihood is a standard statistical framework that can be applied to the problem of tree-reconstruction when a stochastic model of evolution is assumed.
Recommended publications
  • Investgating Determinants of Phylogeneic Accuracy
    IMPACT OF MOLECULAR EVOLUTIONARY FOOTPRINTS ON PHYLOGENETIC ACCURACY – A SIMULATION STUDY Dissertation Submitted to The College of Arts and Sciences of the UNIVERSITY OF DAYTON In Partial Fulfillment of the Requirements for The Degree Doctor of Philosophy in Biology by Bhakti Dwivedi UNIVERSITY OF DAYTON August, 2009 i APPROVED BY: _________________________ Gadagkar, R. Sudhindra Ph.D. Major Advisor _________________________ Robinson, Jayne Ph.D. Committee Member Chair Department of Biology _________________________ Nielsen, R. Mark Ph.D. Committee Member _________________________ Rowe, J. John Ph.D. Committee Member _________________________ Goldman, Dan Ph.D. Committee Member ii ABSTRACT IMPACT OF MOLECULAR EVOLUTIONARY FOOTPRINTS ON PHYLOGENETIC ACCURACY – A SIMULATION STUDY Dwivedi Bhakti University of Dayton Advisor: Dr. Sudhindra R. Gadagkar An accurately inferred phylogeny is important to the study of molecular evolution. Factors impacting the accuracy of a phylogenetic tree can be traced to several consecutive steps leading to the inference of the phylogeny. In this simulation-based study our focus is on the impact of the certain evolutionary features of the nucleotide sequences themselves in the alignment rather than any source of error during the process of sequence alignment or due to the choice of the method of phylogenetic inference. Nucleotide sequences can be characterized by summary statistics such as sequence length and base composition. When two or more such sequences need to be compared to each other (as in an alignment prior to phylogenetic analysis) additional evolutionary features come into play, such as the overall rate of nucleotide substitution, the ratio of two specific instantaneous, rates of substitution (rate at which transitions and transversions occur), and the shape parameter, of the gamma distribution (that quantifies the extent of iii heterogeneity in substitution rate among sites in an alignment).
    [Show full text]
  • Phylogeny Inference Based on Parsimony and Other Methods Using Paup*
    8 Phylogeny inference based on parsimony and other methods using Paup* THEORY David L. Swofford and Jack Sullivan 8.1 Introduction Methods for inferring evolutionary trees can be divided into two broad categories: thosethatoperateonamatrixofdiscretecharactersthatassignsoneormore attributes or character states to each taxon (i.e. sequence or gene-family member); and those that operate on a matrix of pairwise distances between taxa, with each distance representing an estimate of the amount of divergence between two taxa since they last shared a common ancestor (see Chapter 1). The most commonly employed discrete-character methods used in molecular phylogenetics are parsi- mony and maximum likelihood methods. For molecular data, the character-state matrix is typically an aligned set of DNA or protein sequences, in which the states are the nucleotides A, C, G, and T (i.e. DNA sequences) or symbols representing the 20 common amino acids (i.e. protein sequences); however, other forms of discrete data such as restriction-site presence/absence and gene-order information also may be used. Parsimony, maximum likelihood, and some distance methods are examples of a broader class of phylogenetic methods that rely on the use of optimality criteria. Methods in this class all operate by explicitly defining an objective function that returns a score for any input tree topology. This tree score thus allows any two or more trees to be ranked according to the chosen optimality criterion. Ordinarily, phylogenetic inference under criterion-based methods couples the selection of The Phylogenetic Handbook: a Practical Approach to Phylogenetic Analysis and Hypothesis Testing, Philippe Lemey, Marco Salemi, and Anne-Mieke Vandamme (eds.).
    [Show full text]
  • A Phylogenomic Analysis of Turtles ⇑ Nicholas G
    Molecular Phylogenetics and Evolution 83 (2015) 250–257 Contents lists available at ScienceDirect Molecular Phylogenetics and Evolution journal homepage: www.elsevier.com/locate/ympev A phylogenomic analysis of turtles ⇑ Nicholas G. Crawford a,b,1, James F. Parham c, ,1, Anna B. Sellas a, Brant C. Faircloth d, Travis C. Glenn e, Theodore J. Papenfuss f, James B. Henderson a, Madison H. Hansen a,g, W. Brian Simison a a Center for Comparative Genomics, California Academy of Sciences, 55 Music Concourse Drive, San Francisco, CA 94118, USA b Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA c John D. Cooper Archaeological and Paleontological Center, Department of Geological Sciences, California State University, Fullerton, CA 92834, USA d Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA e Department of Environmental Health Science, University of Georgia, Athens, GA 30602, USA f Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA g Mathematical and Computational Biology Department, Harvey Mudd College, 301 Platt Boulevard, Claremont, CA 9171, USA article info abstract Article history: Molecular analyses of turtle relationships have overturned prevailing morphological hypotheses and Received 11 July 2014 prompted the development of a new taxonomy. Here we provide the first genome-scale analysis of turtle Revised 16 October 2014 phylogeny. We sequenced 2381 ultraconserved element (UCE) loci representing a total of 1,718,154 bp of Accepted 28 October 2014 aligned sequence. Our sampling includes 32 turtle taxa representing all 14 recognized turtle families and Available online 4 November 2014 an additional six outgroups. Maximum likelihood, Bayesian, and species tree methods produce a single resolved phylogeny.
    [Show full text]
  • Phylogenetic Comparative Methods: a User's Guide for Paleontologists
    Phylogenetic Comparative Methods: A User’s Guide for Paleontologists Laura C. Soul - Department of Paleobiology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA David F. Wright - Division of Paleontology, American Museum of Natural History, Central Park West at 79th Street, New York, New York 10024, USA and Department of Paleobiology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA Abstract. Recent advances in statistical approaches called Phylogenetic Comparative Methods (PCMs) have provided paleontologists with a powerful set of analytical tools for investigating evolutionary tempo and mode in fossil lineages. However, attempts to integrate PCMs with fossil data often present workers with practical challenges or unfamiliar literature. In this paper, we present guides to the theory behind, and application of, PCMs with fossil taxa. Based on an empirical dataset of Paleozoic crinoids, we present example analyses to illustrate common applications of PCMs to fossil data, including investigating patterns of correlated trait evolution, and macroevolutionary models of morphological change. We emphasize the importance of accounting for sources of uncertainty, and discuss how to evaluate model fit and adequacy. Finally, we discuss several promising methods for modelling heterogenous evolutionary dynamics with fossil phylogenies. Integrating phylogeny-based approaches with the fossil record provides a rigorous, quantitative perspective to understanding key patterns in the history of life. 1. Introduction A fundamental prediction of biological evolution is that a species will most commonly share many characteristics with lineages from which it has recently diverged, and fewer characteristics with lineages from which it diverged further in the past. This principle, which results from descent with modification, is one of the most basic in biology (Darwin 1859).
    [Show full text]
  • Phylogenetic Definitions in the Pre-Phylocode Era; Implications for Naming Clades Under the Phylocode
    PaleoBios 27(1):1–6, April 30, 2007 © 2006 University of California Museum of Paleontology Phylogenetic definitions in the pre-PhyloCode era; implications for naming clades under the PhyloCode MiChAel P. TAylor Palaeobiology research Group, School of earth and environmental Sciences, University of Portsmouth, Portsmouth Po1 3Ql, UK; [email protected] The last twenty years of work on phylogenetic nomenclature have given rise to many names and definitions that are now considered suboptimal. in formulating permanent definitions under the PhyloCode when it is implemented, it will be necessary to evaluate the corpus of existing names and make judgements about which to establish and which to discard. This is not straightforward, because early definitions are often inexplicit and ambiguous, generally do not meet the requirements of the PhyloCode, and in some cases may not be easily recognizable as phylogenetic definitions at all. recognition of synonyms is also complicated by the use of different kinds of specifiers (species, specimens, clades, genera, suprageneric rank-based names, and vernacular names) and by definitions whose content changes under different phylogenetic hypotheses. in light of these difficulties, five principles are suggested to guide the interpreta- tion of pre-PhyloCode clade-names and to inform the process of naming clades under the PhyloCode: (1) do not recognize “accidental” definitions; (2) malformed definitions should be interpreted according to the intention of the author when and where this is obvious; (3) apomorphy-based and other definitions must be recognized as well as node-based and stem-based definitions; (4) definitions using any kind of specifier taxon should be recognized; and (5) priority of synonyms and homonyms should guide but not prescribe.
    [Show full text]
  • Phylogeny Codon Models • Last Lecture: Poor Man’S Way of Calculating Dn/Ds (Ka/Ks) • Tabulate Synonymous/Non-Synonymous Substitutions • Normalize by the Possibilities
    Phylogeny Codon models • Last lecture: poor man’s way of calculating dN/dS (Ka/Ks) • Tabulate synonymous/non-synonymous substitutions • Normalize by the possibilities • Transform to genetic distance KJC or Kk2p • In reality we use codon model • Amino acid substitution rates meet nucleotide models • Codon(nucleotide triplet) Codon model parameterization Stop codons are not allowed, reducing the matrix from 64x64 to 61x61 The entire codon matrix can be parameterized using: κ kappa, the transition/transversionratio ω omega, the dN/dS ratio – optimizing this parameter gives the an estimate of selection force πj the equilibrium codon frequency of codon j (Goldman and Yang. MBE 1994) Empirical codon substitution matrix Observations: Instantaneous rates of double nucleotide changes seem to be non-zero There should be a mechanism for mutating 2 adjacent nucleotides at once! (Kosiol and Goldman) • • Phylogeny • • Last lecture: Inferring distance from Phylogenetic trees given an alignment How to infer trees and distance distance How do we infer trees given an alignment • • Branch length Topology d 6-p E 6'B o F P Edo 3 vvi"oH!.- !fi*+nYolF r66HiH- .) Od-:oXP m a^--'*A ]9; E F: i ts X o Q I E itl Fl xo_-+,<Po r! UoaQrj*l.AP-^PA NJ o - +p-5 H .lXei:i'tH 'i,x+<ox;+x"'o 4 + = '" I = 9o FF^' ^X i! .poxHo dF*x€;. lqEgrE x< f <QrDGYa u5l =.ID * c 3 < 6+6_ y+ltl+5<->-^Hry ni F.O+O* E 3E E-f e= FaFO;o E rH y hl o < H ! E Y P /-)^\-B 91 X-6p-a' 6J.
    [Show full text]
  • Diversity-Dependent Cladogenesis Throughout Western Mexico: Evolutionary Biogeography of Rattlesnakes (Viperidae: Crotalinae: Crotalus and Sistrurus)
    City University of New York (CUNY) CUNY Academic Works Publications and Research New York City College of Technology 2016 Diversity-dependent cladogenesis throughout western Mexico: Evolutionary biogeography of rattlesnakes (Viperidae: Crotalinae: Crotalus and Sistrurus) Christopher Blair CUNY New York City College of Technology Santiago Sánchez-Ramírez University of Toronto How does access to this work benefit ou?y Let us know! More information about this work at: https://academicworks.cuny.edu/ny_pubs/344 Discover additional works at: https://academicworks.cuny.edu This work is made publicly available by the City University of New York (CUNY). Contact: [email protected] 1Blair, C., Sánchez-Ramírez, S., 2016. Diversity-dependent cladogenesis throughout 2 western Mexico: Evolutionary biogeography of rattlesnakes (Viperidae: Crotalinae: 3 Crotalus and Sistrurus ). Molecular Phylogenetics and Evolution 97, 145–154. 4 https://doi.org/10.1016/j.ympev.2015.12.020. © 2016. This manuscript version is made 5 available under the CC-BY-NC-ND 4.0 license. 6 7 8 Diversity-dependent cladogenesis throughout western Mexico: evolutionary 9 biogeography of rattlesnakes (Viperidae: Crotalinae: Crotalus and Sistrurus) 10 11 12 CHRISTOPHER BLAIR1*, SANTIAGO SÁNCHEZ-RAMÍREZ2,3,4 13 14 15 1Department of Biological Sciences, New York City College of Technology, Biology PhD 16 Program, Graduate Center, The City University of New York, 300 Jay Street, Brooklyn, 17 NY 11201, USA. 18 2Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks 19 Street, Toronto, ON, M5S 3B2, Canada. 20 3Department of Natural History, Royal Ontario Museum, 100 Queen’s Park, Toronto, 21 ON, M5S 2C6, Canada. 22 4Present address: Environmental Genomics Group, Max Planck Institute for 23 Evolutionary Biology, August-Thienemann-Str.
    [Show full text]
  • A Phylogenetic Analysis of the Basal Ornithischia (Reptilia, Dinosauria)
    A PHYLOGENETIC ANALYSIS OF THE BASAL ORNITHISCHIA (REPTILIA, DINOSAURIA) Marc Richard Spencer A Thesis Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements of the degree of MASTER OF SCIENCE December 2007 Committee: Margaret M. Yacobucci, Advisor Don C. Steinker Daniel M. Pavuk © 2007 Marc Richard Spencer All Rights Reserved iii ABSTRACT Margaret M. Yacobucci, Advisor The placement of Lesothosaurus diagnosticus and the Heterodontosauridae within the Ornithischia has been problematic. Historically, Lesothosaurus has been regarded as a basal ornithischian dinosaur, the sister taxon to the Genasauria. Recent phylogenetic analyses, however, have placed Lesothosaurus as a more derived ornithischian within the Genasauria. The Fabrosauridae, of which Lesothosaurus was considered a member, has never been phylogenetically corroborated and has been considered a paraphyletic assemblage. Prior to recent phylogenetic analyses, the problematic Heterodontosauridae was placed within the Ornithopoda as the sister taxon to the Euornithopoda. The heterodontosaurids have also been considered as the basal member of the Cerapoda (Ornithopoda + Marginocephalia), the sister taxon to the Marginocephalia, and as the sister taxon to the Genasauria. To reevaluate the placement of these taxa, along with other basal ornithischians and more derived subclades, a phylogenetic analysis of 19 taxonomic units, including two outgroup taxa, was performed. Analysis of 97 characters and their associated character states culled, modified, and/or rescored from published literature based on published descriptions, produced four most parsimonious trees. Consistency and retention indices were calculated and a bootstrap analysis was performed to determine the relative support for the resultant phylogeny. The Ornithischia was recovered with Pisanosaurus as its basalmost member.
    [Show full text]
  • Is Ellipura Monophyletic? a Combined Analysis of Basal Hexapod
    ARTICLE IN PRESS Organisms, Diversity & Evolution 4 (2004) 319–340 www.elsevier.de/ode Is Ellipura monophyletic? A combined analysis of basal hexapod relationships with emphasis on the origin of insects Gonzalo Giribeta,Ã, Gregory D.Edgecombe b, James M.Carpenter c, Cyrille A.D’Haese d, Ward C.Wheeler c aDepartment of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA bAustralian Museum, 6 College Street, Sydney, New South Wales 2010, Australia cDivision of Invertebrate Zoology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA dFRE 2695 CNRS, De´partement Syste´matique et Evolution, Muse´um National d’Histoire Naturelle, 45 rue Buffon, F-75005 Paris, France Received 27 February 2004; accepted 18 May 2004 Abstract Hexapoda includes 33 commonly recognized orders, most of them insects.Ongoing controversy concerns the grouping of Protura and Collembola as a taxon Ellipura, the monophyly of Diplura, a single or multiple origins of entognathy, and the monophyly or paraphyly of the silverfish (Lepidotrichidae and Zygentoma s.s.) with respect to other dicondylous insects.Here we analyze relationships among basal hexapod orders via a cladistic analysis of sequence data for five molecular markers and 189 morphological characters in a simultaneous analysis framework using myriapod and crustacean outgroups.Using a sensitivity analysis approach and testing for stability, the most congruent parameters resolve Tricholepidion as sister group to the remaining Dicondylia, whereas most suboptimal parameter sets group Tricholepidion with Zygentoma.Stable hypotheses include the monophyly of Diplura, and a sister group relationship between Diplura and Protura, contradicting the Ellipura hypothesis.Hexapod monophyly is contradicted by an alliance between Collembola, Crustacea and Ectognatha (i.e., exclusive of Diplura and Protura) in molecular and combined analyses.
    [Show full text]
  • EVOLUTIONARY INFERENCE: Some Basics of Phylogenetic Analyses
    EVOLUTIONARY INFERENCE: Some basics of phylogenetic analyses. Ana Rojas Mendoza CNIO-Madrid-Spain. Alfonso Valencia’s lab. Aims of this talk: • 1.To introduce relevant concepts of evolution to practice phylogenetic inference from molecular data. • 2.To introduce some of the most useful methods and computer programmes to practice phylogenetic inference. • • 3.To show some examples I’ve worked in. SOME BASICS 11--ConceptsConcepts ofof MolecularMolecular EvolutionEvolution • Homology vs Analogy. • Homology vs similarity. • Ortologous vs Paralogous genes. • Species tree vs genes tree. • Molecular clock. • Allele mutation vs allele substitution. • Rates of allele substitution. • Neutral theory of evolution. SOME BASICS Owen’s definition of homology Richard Owen, 1843 • Homologue: the same organ under every variety of form and function (true or essential correspondence). •Analogy: superficial or misleading similarity. SOME BASICS 1.Concepts1.Concepts ofof MolecularMolecular EvolutionEvolution • Homology vs Analogy. • Homology vs similarity. • Ortologous vs Paralogous genes. • Species tree vs genes tree. • Molecular clock. • Allele mutation vs allele substitution. • Rates of allele substitution. • Neutral theory of evolution. SOME BASICS Similarity ≠ Homology • Similarity: mathematical concept . Homology: biological concept Common Ancestry!!! SOME BASICS 1.Concepts1.Concepts ofof MolecularMolecular EvolutionEvolution • Homology vs Analogy. • Homology vs similarity. • Ortologous vs Paralogous genes. • Species tree vs genes tree. • Molecular clock.
    [Show full text]
  • Family Classification
    1.0 GENERAL INTRODUCTION 1.1 Henckelia sect. Loxocarpus Loxocarpus R.Br., a taxon characterised by flowers with two stamens and plagiocarpic (held at an angle of 90–135° with pedicel) capsular fruit that splits dorsally has been treated as a section within Henckelia Spreng. (Weber & Burtt, 1998 [1997]). Loxocarpus as a genus was established based on L. incanus (Brown, 1839). It is principally recognised by its conical, short capsule with a broader base often with a hump-like swelling at the upper side (Banka & Kiew, 2009). It was reduced to sectional level within the genus Didymocarpus (Bentham, 1876; Clarke, 1883; Ridley, 1896) but again raised to generic level several times by different authors (Ridley, 1905; Burtt, 1958). In 1998, Weber & Burtt (1998 ['1997']) re-modelled Didymocarpus. Didymocarpus s.s. was redefined to a natural group, while most of the rest Malesian Didymocarpus s.l. and a few others morphologically close genera including Loxocarpus were transferred to Henckelia within which it was recognised as a section within. See Section 4.1 for its full taxonomic history. Molecular data now suggests that Henckelia sect. Loxocarpus is nested within ‗Twisted-fruited Asian and Malesian genera‘ group and distinct from other didymocarpoid genera (Möller et al. 2009; 2011). 1.2 State of knowledge and problem statements Henckelia sect. Loxocarpus includes 10 species in Peninsular Malaysia (with one species extending into Peninsular Thailand), 12 in Borneo, two in Sumatra and one in Lingga (Banka & Kiew, 2009). The genus Loxocarpus has never been monographed. Peninsular Malaysian taxa are well studied (Ridley, 1923; Banka, 1996; Banka & Kiew, 2009) but the Bornean and Sumatran taxa are poorly known.
    [Show full text]
  • The Probability of Monophyly of a Sample of Gene Lineages on a Species Tree
    PAPER The probability of monophyly of a sample of gene COLLOQUIUM lineages on a species tree Rohan S. Mehtaa,1, David Bryantb, and Noah A. Rosenberga aDepartment of Biology, Stanford University, Stanford, CA 94305; and bDepartment of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand Edited by John C. Avise, University of California, Irvine, CA, and approved April 18, 2016 (received for review February 5, 2016) Monophyletic groups—groups that consist of all of the descendants loci that are reciprocally monophyletic is informative about the of a most recent common ancestor—arise naturally as a conse- time since species divergence and can assist in representing the quence of descent processes that result in meaningful distinctions level of differentiation between groups (4, 18). between organisms. Aspects of monophyly are therefore central to Many empirical investigations of genealogical phenomena have fields that examine and use genealogical descent. In particular, stud- made use of conceptual and statistical properties of monophyly ies in conservation genetics, phylogeography, population genetics, (19). Comparisons of observed monophyly levels to model pre- species delimitation, and systematics can all make use of mathemat- dictions have been used to provide information about species di- ical predictions under evolutionary models about features of mono- vergence times (20, 21). Model-based monophyly computations phyly. One important calculation, the probability that a set of gene have been used alongside DNA sequence differences between and lineages is monophyletic under a two-species neutral coalescent within proposed clades to argue for the existence of the clades model, has been used in many studies. Here, we extend this calcu- (22), and tests involving reciprocal monophyly have been used to lation for a species tree model that contains arbitrarily many species.
    [Show full text]