Splits and Phylogenetic Networks

Total Page:16

File Type:pdf, Size:1020Kb

Splits and Phylogenetic Networks SplitsSplits andand PhylogeneticPhylogenetic NetworksNetworks DanielDaniel H.H. HusonHuson Paris, June 21, 2005 1 ContentsContents 1.1. PhylogeneticPhylogenetic treestrees 2.2. SplitsSplits networksnetworks 3.3. ConsensusConsensus networksnetworks 4.4. HybridizationHybridization andand reticulatereticulate networksnetworks 5.5. RecombinationRecombination networksnetworks 2 PhylogeneticPhylogenetic NetworksNetworks Bandelt (1991): Network displaying evolutionary relationships Other types of Splits Phylogenetic Reticulate phylogenetic networks trees networks networks Median Consensus Hybridization Recombination Augmented networks (super) networks networks trees networks Special case: “Galled trees” Any graph Ancestor Split decomposition, representing Split decomposition, recombination evolutionary Neighbor-net graphs data 3 PhylogeneticPhylogenetic NetworksNetworks 22 11 Other types of Splits Phylogenetic Reticulate phylogenetic networks trees networks networks 33 44 55 Median Consensus Hybridization Recombination Augmented networks (super) networks networks trees networks from sequences Special case: “Galled trees” from trees Any graph Ancestor Split decomposition, representing Split decomposition, recombination evolutionary Neighbor-net graphs data from distances 4 PhylogeneticPhylogenetic NetworksNetworksDan Gusfield: “Phylogenetic network” 22 11 “Generalized Other types of Splits Phylogenetic Reticulate phylogeneticphylogenetic networks trees networks network”networks 33 44 \ 55 Median Consensus Hybridization Recombination“More generalizedAugmented networks (super) networks networks trees networks Phylogenetic network” from sequences Special case: “Galled trees” from trees Any graph Ancestor Split decomposition, representing Split decomposition, recombination evolutionary Neighbor-net graphs data from distances 5 PartPart II 1.1. PhylogeneticPhylogenetic treestrees 2.2. SplitsSplits networksnetworks 3.3. ConsensusConsensus networksnetworks 4.4. HybridizationHybridization andand reticulatereticulate networksnetworks 5.5. RecombinationRecombination networksnetworks 6 PhylogeneticPhylogenetic TreesTrees Ernst Haeckel, Tree of Life 1866 7 PhylogeneticPhylogenetic TreesTrees LetLet XX == {x{x1,...,x,...,xn}} denotedenote aa setset ofof taxa.taxa. AA phylogeneticphylogenetic treetree TT (or(or XX--treetree)) isis givengiven byby labelinglabeling thethe leavesleaves ofof aa treetree byby thethe setset X:X: Cow Fin Whale Blue Whale Habor Seal Rat Mouse Chimp Human Gorilla Taxa + tree ⇒ phylogenetic tree 8 UnrootedUnrooted vsvs RootedRooted TreesTrees Unrooted tree Rooted tree, rooted using Chicken as outgroup mathematically and algorithmically biologically relevant, defines easier to deal with clades of related taxa 9 AA SimpleSimple ModelModel ofof EvolutionEvolution TAA G C CG T ACT C CG A T AC GC C time time AC C C A G C T Evolutionary tree AC G C C T •Mutations along branches •Speciation events at nodes A C Sequence of common ancestor 12 TreeTree ReconstructionReconstruction ProblemProblem TAA G C CG T AC T C CG A T AC GC C Tree? Evolutionary tree 13 TreeTree ofof LifeLife BasedBased onon 16S16S rRNArRNA (Doolittle, 2000) 14 PartPart IIII 1.1. PhylogeneticPhylogenetic treestrees 2.2. SplitsSplits networksnetworks 3.3. ConsensusConsensus networksnetworks 4.4. HybridizationHybridization andand reticulatereticulate networksnetworks 5.5. RecombinationRecombination networksnetworks 15 SplitsSplits NetworksNetworks RepresentsRepresents incompatibleincompatible signalssignals inin data,data, from:from: SequencesSequences,, e.g.:e.g.: Median network (Bandelt et al 1994) Spectral analysis (Hendy and Penny 1993) DistancesDistances,, e.g.:e.g.: E.g. Split decomposition (Bandelt and Dress 1992) Neighbor-Net (Bryant and Moulton 2002) TreesTrees,, e.g.:e.g.: Consensus network (Holland and Moulton 2003) Super network (H., Dezulian, Kloepper and Steel 2004) Bootstrap network (H., implemented in SplitsTree4) 16 SequencesSequences toto SplitsSplits NetworkNetwork If characters have only 2 states and not too conflicting: interpret columns as splits and draw full splits network 17 DistancesDistances toto SplitsSplits NetworkNetwork Split decomposition or Neighbor-Net produces network from distances 18 TreesTrees toto SplitsSplits NetworkNetwork A collection of trees can be represented by a consensus network or super network 19 BootstrapBootstrap NetworkNetwork Draw all splits that have positive bootstrap score 20 SplitSplit DecompositionDecomposition && BootstrapBootstrap NetworkNetwork CompareCompare thethe resultresult ofof SplitSplit DecompositionDecomposition withwith anan NJNJ treetree andand bootstrapbootstrap network:network: A.cerana A.mellifer A.cerana A.mellifer A.cerana A.mellifer A.dorsata A.dorsata A.dorsata A.andrenof A.andrenof A.florea A.andrenof A.koschev A.florea A.koschev A.florea A.koschev Bio-NJ tree Bootstrap network Splits network obtained via the Split Decomposition Bill Martin: Splits networks show which signals tree reconstruction methods are fighting over… 21 RootedRooted SplitsSplits NetworksNetworks Splits network can be “rooted” e.g. using an outgroup (Gambette and H, manuscript) 22 BetterBetter layoutlayout ofof splitssplits networksnetworks Philippe Gambette, currently visiting Tuebingen from ENS Cachan 23 PartPart IIIIII 1.1. PhylogeneticPhylogenetic treestrees 2.2. SplitsSplits networksnetworks 3.3. ConsensusConsensus networksnetworks 4.4. HybridizationHybridization andand reticulatereticulate networksnetworks 5.5. RecombinationRecombination networksnetworks 24 GeneGene TreesTrees CanCan DifferDiffer AlsoAlso allowallow genegene duplicationduplication andand lossloss:: x1 x2 x3 A’ A” B’ x x x A’ A’’ B’ x1 x2 x3 ≠ A B Gene duplication Gene Tree Species Tree G 25 GeneGene TreesTrees vsvs SpeciesSpecies TreesTrees Differing gene trees give rise to “mosaic sequences” Gene A Gene B Gene C Gene D 26 ConsensusConsensus ofof DifferentDifferent GeneGene TreesTrees ForFor aa givengiven setset ofof species,species, differentdifferent genesgenes leadlead toto differentdifferent treestrees HowHow toto formform aa consensusconsensus ofof thethe trees?trees? ConsensusConsensus treestrees ConsensusConsensus networksnetworks ConsensusConsensus supersuper networksnetworks 27 TheThe SplitsSplits ofof aa TreeTree EveryEvery edgeedge ofof aa treetree definesdefines aa splitsplit ofof thethe taxontaxon setset X:X: x6 x4 x1 x 8 e x5 x3 x7 x2 xx ,x,x ,x,x ,x,x ,x,x vsvs xx ,x,x ,x,x 1 3 4 6 7 2 5 8 28 TheThe SplitSplit EncodingEncoding ofof aa TreeTree dd aa Tree T: cc bb ee Split encoding Σ(T): 5 trivial splits: 2 non-trivial splits: 29 CompatibilityCompatibility TwoTwo splitssplits AA1|B|B1 andand AA2|B|B2 ofof XX areare compatiblecompatible,, ifif ∅∈∅∈{A{A1∩∩AA2,,AA1∩∩BB2,B,B1∩∩AA2,B,B1∩∩ AA2}} Two compatible splits: A1 B1 x x4 A2 B2 x3 x8 x7 8 x2 x9 x5 x 1 x 6 X 30 CompatibilityCompatibility TwoTwo splitssplits AA1|B|B1 andand AA2|B|B2 ofof XX areare compatiblecompatible,, ifif ∅∈∅∈{A{A1∩∩AA2,A,A1∩∩BB2,B,B1∩∩AA2,B,B1∩∩ AA2}} Two incompatible splits: x x4 A1 B1 x5 A2 x6 B2 x2 x3 X x1 X 1 x7 31 ConsensusConsensus ofof TreesTrees Six gene trees: Σ(1/2): majority consensus: Σ(1/6): Σ(0): splits contained in more splits contained in splits contained than 50% of trees more than one tree in at least one tree 34 ExampleExample ofof AA SuperSuper NetworkNetwork (Plants)(Plants) Partial trees for five plant genes Super network 35 Joint work with Kim McBreen and Pete Lockhart ZZ--ClosureClosure MethodMethod Idea:Idea: ExtendExtend partialpartial splits.splits. ∩ A A ∪ A ZZ--rule:rule: A1 A2 1 1 2 , B1 B2 B1∪ B2 B2 B RepeatedlyRepeatedly applyapply 1toto completion.completion. A2 ReturnReturn allall fullfull splits.splits. A1 B2 [Huson, Dezulian, Kloepper and Steel, 2004] 36 ExampleExample FiveFive fungalfungal treestrees fromfrom [[Pryor,Pryor, 2000]2000] andand [Pryor,[Pryor, 2003]2003] Trees:Trees: ITSITS (two(two trees)trees) SSUSSU (two(two trees)trees) GpdGpd (one(one tree)tree) NumbersNumbers ofof taxataxa differ:differ: “partial“partial trees”trees” 37 ExampleExample 4beta25: User manual now available from www.splitstree.org! 38 IndividualIndividual GeneGene TreesTrees ITS00 46 taxa 39 IndividualIndividual GeneGene TreesTrees ITS03 40 taxa 40 IndividualIndividual GeneGene TreesTrees SSU00 29 taxa 41 IndividualIndividual GeneGene TreesTrees SSU03 40 taxa 42 IndividualIndividual GeneGene TreesTrees Gpd03 40 taxa 43 GeneGene TreesTrees asas SuperSuper NetworkNetwork Z-closure: a fast super-network method 44 GeneGene TreesTrees asas SuperSuper NetworkNetwork ITS00+ ITS03 45 GeneGene TreesTrees asas SuperSuper NetworkNetwork ITS03+ SSU00 46 GeneGene TreesTrees asas SuperSuper NetworkNetwork ITS00+ ITS00+ SSU03 47 GeneGene TreesTrees asas SuperSuper NetworkNetwork ITS00+ ITS03+ SSU03+ Gpd03 48 GeneGene TreesTrees asas SuperSuper NetworkNetwork ITS00+ ITS03+ SSU00+ SSU03+ Gpd03 49 PartPart IVIV 1.1. PhylogeneticPhylogenetic treestrees 2.2. SplitsSplits networksnetworks 3.3. ConsensusConsensus networksnetworks 4.4. HybridizationHybridization andand reticulatereticulate networksnetworks 5.5. RecombinationRecombination networksnetworks 50 HybridizationHybridization OccursOccurs whenwhen twotwo organismsorganisms fromfrom differentdifferent speciesspecies interbreedinterbreed andand combinecombine theirtheir genomesgenomes Copyright © 2003 University of Illinois Copyright © 2003 University of Illinois Copyright © 2003 University of Illinois Water hemp Hybrid Pigs weed 51 SpeciationSpeciation byby HybridizationHybridization
Recommended publications
  • Reticulate Evolution and Phylogenetic Networks 6
    CHAPTER Reticulate Evolution and Phylogenetic Networks 6 So far, we have modeled the evolutionary process as a bifurcating treelike process. The main feature of trees is that the flow of information in one lineage is independent of that in another lineage. In real life, however, many process- es—including recombination, hybridization, polyploidy, introgression, genome fusion, endosymbiosis, and horizontal gene transfer—do not abide by lineage independence and cannot be modeled by trees. Reticulate evolution refers to the origination of lineages through the com- plete or partial merging of two or more ancestor lineages. The nonvertical con- nections between lineages are referred to as reticulations. In this chapter, we introduce the language of networks and explain how networks can be used to study non-treelike phylogenetic processes. A substantial part of this chapter will be dedicated to the evolutionary relationships among prokaryotes and the very intriguing question of the origin of the eukaryotic cell. Networks Networks were briefly introduced in Chapter 4 in the context of protein-protein interactions. Here, we provide a more formal description of networks and their use in phylogenetics. As mentioned in Chapter 5, trees are connected graphs in which any two nodes are connected by a single path. In phylogenetics, a network is a connected graph in which at least two nodes are connected by two or more pathways. In other words, a network is a connected graph that is not a tree. All networks contain at least one cycle, i.e., a path of branches that can be followed in one direction from any starting point back to the starting point.
    [Show full text]
  • The Probability of Monophyly of a Sample of Gene Lineages on a Species Tree
    PAPER The probability of monophyly of a sample of gene COLLOQUIUM lineages on a species tree Rohan S. Mehtaa,1, David Bryantb, and Noah A. Rosenberga aDepartment of Biology, Stanford University, Stanford, CA 94305; and bDepartment of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand Edited by John C. Avise, University of California, Irvine, CA, and approved April 18, 2016 (received for review February 5, 2016) Monophyletic groups—groups that consist of all of the descendants loci that are reciprocally monophyletic is informative about the of a most recent common ancestor—arise naturally as a conse- time since species divergence and can assist in representing the quence of descent processes that result in meaningful distinctions level of differentiation between groups (4, 18). between organisms. Aspects of monophyly are therefore central to Many empirical investigations of genealogical phenomena have fields that examine and use genealogical descent. In particular, stud- made use of conceptual and statistical properties of monophyly ies in conservation genetics, phylogeography, population genetics, (19). Comparisons of observed monophyly levels to model pre- species delimitation, and systematics can all make use of mathemat- dictions have been used to provide information about species di- ical predictions under evolutionary models about features of mono- vergence times (20, 21). Model-based monophyly computations phyly. One important calculation, the probability that a set of gene have been used alongside DNA sequence differences between and lineages is monophyletic under a two-species neutral coalescent within proposed clades to argue for the existence of the clades model, has been used in many studies. Here, we extend this calcu- (22), and tests involving reciprocal monophyly have been used to lation for a species tree model that contains arbitrarily many species.
    [Show full text]
  • Stochastic Models in Phylogenetic Comparative Methods: Analytical Properties and Parameter Estimation
    Thesis for the Degree of Doctor of Philosophy Stochastic models in phylogenetic comparative methods: analytical properties and parameter estimation Krzysztof Bartoszek Division of Mathematical Statistics Department of Mathematical Sciences Chalmers University of Technology and University of Gothenburg G¨oteborg, Sweden 2013 Stochastic models in phylogenetic comparative methods: analytical properties and parameter estimation Krzysztof Bartoszek Copyright c Krzysztof Bartoszek, 2013. ISBN 978–91–628–8740–7 Department of Mathematical Sciences Division of Mathematical Statistics Chalmers University of Technology and University of Gothenburg SE-412 96 GOTEBORG,¨ Sweden Phone: +46 (0)31-772 10 00 Author e-mail: [email protected] Typeset with LATEX. Department of Mathematical Sciences Printed in G¨oteborg, Sweden 2013 Stochastic models in phylogenetic comparative methods: analytical properties and parameter estimation Krzysztof Bartoszek Abstract Phylogenetic comparative methods are well established tools for using inter–species variation to analyse phenotypic evolution and adaptation. They are generally hampered, however, by predominantly univariate ap- proaches and failure to include uncertainty and measurement error in the phylogeny as well as the measured traits. This thesis addresses all these three issues. First, by investigating the effects of correlated measurement errors on a phylogenetic regression. Second, by developing a multivariate Ornstein– Uhlenbeck model combined with a maximum–likelihood estimation pack- age in R. This model allows, uniquely, a direct way of testing adaptive coevolution. Third, accounting for the often substantial phylogenetic uncertainty in comparative studies requires an explicit model for the tree. Based on recently developed conditioned branching processes, with Brownian and Ornstein–Uhlenbeck evolution on top, expected species similarities are derived, together with phylogenetic confidence intervals for the optimal trait value.
    [Show full text]
  • Application of Phylogenetic Networks in Evolutionary Studies
    Application of Phylogenetic Networks in Evolutionary Studies Daniel H. Huson* and David Bryant *Center for Bioinformatics (ZBIT), Tu¨bingen University, Tu¨bingen, Germany; and Department of Mathematics, Auckland University, Auckland, New Zealand The evolutionary history of a set of taxa is usually represented by a phylogenetic tree, and this model has greatly facilitated the discussion and testing of hypotheses. However, it is well known that more complex evolutionary scenarios are poorly described by such models. Further, even when evolution proceeds in a tree-like manner, analysis of the data may not be best served by using methods that enforce a tree structure but rather by a richer visualization of the data to evaluate its properties, at least as an essential first step. Thus, phylogenetic networks should be employed when reticulate events such as hybridization, horizontal gene transfer, recombination, or gene duplication and loss are believed to be involved, and, even in the absence of such events, phylogenetic networks have a useful role to play. This article reviews the terminology used for phylogenetic networks and covers both split networks and reticulate networks, how they are defined, and how they can be interpreted. Additionally, the article outlines the beginnings of a comprehensive statistical framework for applying split network methods. We show how split networks can represent confidence sets of trees and introduce a conservative statistical test for whether the conflicting signal in a network is treelike. Finally, this article describes a new program, SplitsTree4, an interactive and comprehensive tool for inferring different types of phylogenetic networks from sequences, distances, and trees. Introduction The familiar evolutionary model assumes a tree, a Finally, we present our new program SplitsTree4, model that has greatly facilitated the discussion and testing which provides a comprehensive and interactive frame- of hypotheses.
    [Show full text]
  • Species Delimitation and DNA Barcoding of Atlantic Ensis (Bivalvia, Pharidae)
    Species delimitation and DNA barcoding of Atlantic Ensis (Bivalvia, Pharidae) Joaquín Viernaa,b, Joël Cuperusc, Andrés Martínez-Lagea, Jeroen M. Jansenc, Alejandra Perinaa,b, Hilde Van Peltc, Ana M. González-Tizóna a Evolutionary Biology Group (GIBE), Department of Molecular and Cell Biology,Universidade da Coruña, A Fraga 10, A Coruña, E-15008, Spain b AllGenetics, Edificio de Servicios Centrales de Investigación, Campus de Elviña s/n, A Coruña, E-15008, Spain c IMARES Wageningen UR, Ambachtsweg 8a, Den Helder, NL-1785 AJ, The Netherlands Zoologica Scripta Volume 43, Issue 2, pages 161–171, March 2014 Issue online: 17 February 2014, Version of record online: 3 September 2013, Manuscript Accepted: 11 August 2013, Manuscript Received: 17 May 2013. This is the peer reviewed version of the following article: Vierna, J., Cuperus, J., Martínez-Lage, A., Jansen, J.M., Perina, A., Van Pelt, H. & González-Tizón, A.M. (2013). Species delimitation and DNA barcoding of Atlantic Ensis (Bivalvia, Pharidae). Zoologica Scripta, 43, 161–171. which has been published in final form at https://doi.org/10.1111/zsc.12038. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving. Abstract Ensis Schumacher, 1817 razor shells occur at both sides of the Atlantic and along the Pacific coasts of tropical west America, Peru, and Chile. Many of them are marketed in various regions. However, the absence of clear autapomorphies in the shell and the sympatric distributions of some species often prevent a correct identification of specimens. As a consequence, populations cannot be properly managed, and edible species are almost always mislabelled along the production chain.
    [Show full text]
  • Computational Methods for Phylogenetic Analysis
    Computational Methods for Phylogenetic Analysis Student: Mohd Abdul Hai Zahid Supervisors: Dr. R. C. Joshi and Dr. Ankush Mittal Phylogenetics is the study of relationship among species or genes with the combination of molecular biology and mathematics. Most of the present phy- logenetic analysis softwares and algorithms have limitations of low accuracy, restricting assumptions on size of the dataset, high time complexity, complex results which are difficult to interpret and several others which inhibits their widespread use by the researchers. In this work, we address several problems of phylogenetic analysis and propose better methods addressing prominent issues. It is well known that the network representation of the evolutionary rela- tionship provides a better understanding of the evolutionary process and the non-tree like events such as horizontal gene transfer, hybridization, recom- bination and homoplasy. A pattern recognition based sequence alignment algorithm is proposed which not only employs the similarity of SNP sites, as is generally done, but also the dissimilarity for the classification of the nodes into mutation and recombination nodes. Unlike the existing algo- rithms [1, 2, 3, 4, 5, 6], the proposed algorithm [7] conducts a row-based search to detect the recombination nodes. The existing algorithms search the columns for the detection of recombination. The number of columns 1 in a sequence may be far greater than the rows, which results in increased complexity of the previous algorithms. Most of the individual researchers and research teams are concentrating on the evolutionary pathways of specific phylogenetic groups. Many effi- cient phylogenetic reconstruction methods, such as Maximum Parsimony [8] and Maximum Likelihood [9], are available.
    [Show full text]
  • Towards the Development of Computational Tools for Evaluating Phylogenetic Network Reconstruction Methods
    Towards the Development of Computational Tools for Evaluating Phylogenetic Network Reconstruction Methods L. Nakhleh, J. Sun, T. Warnow, C.R. Linder, B.M.E. Moret, A. Tholse Pacific Symposium on Biocomputing 8:315-326(2003) TOWARDS THE DEVELOPMENT OF COMPUTATIONAL TOOLS FOR EVALUATING PHYLOGENETIC NETWORK RECONSTRUCTION METHODS LUAY NAKHLEH, JERRY SUN, TANDY WARNOW Dept. of Computer Sciences, U. of Texas, Austin, TX 78712 C. RANDAL LINDER Section of Integrative Biology, U. of Texas, Austin, TX 78712 BERNARD M.E. MORET, ANNA THOLSE Dept. of Computer Science, U. of New Mexico, Albuquerque, NM 87131 We report on a suite of algorithms and techniques that together provide a simulation flow for study- ing the topological accuracy of methods for reconstructing phylogenetic networks. We imple- mented those algorithms and techniques and used three phylogenetic reconstruction methods for a case study of our tools. We present the results of our experimental studies in analyzing the relative performance of these methods. Our results indicate that our simulator and our proposed measure of accuracy, the latter an extension of the widely used Robinson-Foulds measure, offer a robust platform for the evaluation of network reconstruction algorithms. 1 Introduction Phylogenies, i.e., the evolutionary histories of groups of organisms, play a major role in representing the interrelationships among biological entities. Many methods for reconstructing such phylogenies have been proposed, but almost all of them assume that the underlying evolutionary history of a given set of species can be represented by a tree. While this model gives a satisfactory first-order approximation for many families of organisms, other families exhibit evolutionary mechanisms that cannot be represented by a tree.
    [Show full text]
  • A Note on Encodings of Phylogenetic Networks of Bounded Level
    A NOTE ON ENCODINGS OF PHYLOGENETIC NETWORKS OF BOUNDED LEVEL PHILIPPE GAMBETTE, KATHARINA T. HUBER Abstract. Driven by the need for better models that allow one to shed light into the question how life’s diversity has evolved, phylogenetic networks have now joined phylogenetic trees in the center of phylogenetics research. Like phylogenetic trees, such networks canonically induce collections of phyloge- netic trees, clusters, and triplets, respectively. Thus it is not surprising that many network approaches aim to reconstruct a phylogenetic network from such collections. Related to the well-studied perfect phylogeny problem, the follow- ing question is of fundamental importance in this context: When does one of the above collections encode (i.e. uniquely describe) the network that induces it? In this note, we present a complete answer to this question for the special case of a level-1 (phylogenetic) network by characterizing those level-1 net- works for which an encoding in terms of one (or equivalently all) of the above collections exists. Given that this type of network forms the first layer of the rich hierarchy of level-k networks, k a non-negative integer, it is natural to wonder whether our arguments could be extended to members of that hierar- chy for higher values for k. By giving examples, we show that this is not the case. Keywords: Phylogeny, phylogenetic networks, triplets, clusters, supernet- work, level-1 network, perfect phylogeny problem. 1. Introduction An improved understanding of the complex processes that drive evolution has arXiv:0906.4324v1 [math.CO] 23 Jun 2009 lent support to the idea that reticulate evolutionary events such as lateral gene transfer or hybridization are more common than originally thought rendering a phylogenetic tree (essentially a rooted leaf labelled graph-theoretical tree) too sim- plistic a model to fully understand the complex processes that drive evolution.
    [Show full text]
  • The Anâtaxis Phylogenetic Reconstruction Algorithm
    U´ G` F´ Departement´ d’informatique Professeur Bastien Chopard Institut Suisse de Bioinformatique Dr Gabriel Bittar The Anataxisˆ phylogenetic reconstruction algorithm THESE` present´ ee´ a` la Faculte´ des sciences de l’Universite´ de Geneve` pour obtenir le grade de Docteur es` sciences, mention bioinformatique par Bernhard Pascal Sonderegger de Heiden (AR) These` No 3863 Geneve` Atelier d’impression de la Section de Physique 2007 FACULTE´ DES SCIENCES Doctorat es` Sciences mention bioinformatique These` de Monsieur Bernhard SONDEREGGER Intitulee:´ The Anataxisˆ phylogenetic reconstruction algorithm La faculte´ des sciences, sur le preavis´ de Messieurs B. CHOPARD, professeur ad- joint et directeur de these` (Departement´ d’ informatique), G. Bittar, Docteur et co- directeur de these` (Institut Suisse de Bioinformatique, Geneve,` Suisse), A. BAIROCH, professeur adjoint (Faculte´ de medecine,´ Section de medecine´ fon- dementale, Departement´ de biologie structurale et bioinformatique) et N. SALAMIN, docteur (Universite´ de Lausanne, Faculte´ de biologie et de mede- cine Departement´ d’ecologie´ et evolution,´ Lausanne, Suisse), autorise l’impression de la presente´ these,` sans exprimer d’opinion sur les propositions qui y sont enonc´ ees.´ Geneve,` le 26.06.2007 Th`ese-3863- Le Doyen, Pierre SPIERER Contents Contents i Remerciements 1 Preface 1 R´esum´een franc¸ais 5 Introduction a` la phylogen´ etique´ ....................... 5 L’algorithme Anataxisˆ ............................. 8 Calcul de dissimilitudes ............................ 11 Validation numerique´ .............................. 11 Implementation´ ................................. 12 Exemple biologique ............................... 13 Conclusion .................................... 14 1 An introduction to phylogenetics 17 1.1 Homology and homoplasy ........................ 18 1.1.1 Characters and their states .................... 18 1.1.2 Homology is a phylogenetic hypothesis ............ 20 1.1.3 Homoplasy, a pitfall in phylogenetics ............
    [Show full text]
  • Progress in Erigonine Spider Phylogeny—The Savignia-Group Is Not Monophyletic (Araneae: Linyphiidae)
    Org Divers Evol (2010) 10:297–310 DOI 10.1007/s13127-010-0023-1 ORIGINAL ARTICLE Progress in erigonine spider phylogeny—the Savignia-group is not monophyletic (Araneae: Linyphiidae) Holger Frick & Wolfgang Nentwig & Christian Kropf Received: 1 December 2009 /Accepted: 16 March 2010 /Published online: 11 June 2010 # Gesellschaft für Biologische Systematik 2010 Abstract We present the most inclusive study on the Savignia frontata, and two representatives each of Erigonella, higher-level phylogeny of erigonine spiders, including Dicymbium and Araeoncus combine to form a monophyletic about 30% of all erigonine genera. By expanding the clade. previously most comprehensive analysis (Miller and Hormiga Cladistics 20:385–442, 2004) we tested the robustness of its Keywords Phylogeny. Morphology . Complex genital results to the addition of closely related taxa, and also the organs . Dwarf spiders . Erigoninae monophyly of the Savignia-group defined by Millidge (Bulletin of the British Arachnological Society 4:1–60, 1977). The character matrix was expanded by adding 18 Introduction newly scored species in 15 genera, and also includes all species scored by other authors. This adds up to 98 species Linyphiidae are the second most diverse spider family in in 91 erigonine genera plus 13 outgroup taxa. The the world and the most diverse in the northern hemisphere, parsimony analysis led to eight fully resolved most including 4359 species in 576 genera (Platnick 2010). The parsimonious trees (L=1084). The topology concerning systematics of Linyphiidae struggles with a tremendous the taxa basal to the ‘distal erigonines’ remained amount of genera with ambiguous genus delimitations. A unchanged, and the latter clade still shares 67% of all morphological phylogeny at genus level is therefore nodes with the original analysis.
    [Show full text]
  • The Uses and Abuses of Tree Thinking in Cultural Evolution
    The Uses and Abuses of Tree Thinking in Cultural Evolution Cara L. Evans1*, Simon J. Greenhill1,2 Joseph Watts3,4, Johann-Mattis List1, Carlos A. Botero5, Russell D. Gray1,6, Kathryn R. Kirby1,7* 1Department of Linguistic and Cultural Evolution, Max Planck Institute for the Science of Human History, Jena 07745, Germany; 2ARC Center of Excellence for the Dynamics of Language, ANU College of Asia and the Pacific, Australian National University, Canberra 2700, Australia; 3Religion Programme, University of Otago, Dunedin 9016, New Zealand; 4Centre for Research on Evolution, Belief and Behaviour, University of Otago, Dunedin 9016, New Zealand; 5Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA; 6School of Psychology, University of Auckland, Auckland 1010, New Zealand; 7Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, M5S 3B2 *Authors for correspondence ([email protected]; [email protected]). Keywords: cultural evolution, phylogenetic comparative methods, cross-cultural research, cultural macro-evolution Abstract Modern phylogenetic methods are increasingly being used to address questions about macro-level patterns in cultural evolution. These methods can illuminate the unobservable histories of cultural traits and identify the evolutionary drivers of trait-change over time, but their application is not without pitfalls. Here we outline the current scope of research in cultural tree thinking, highlighting a toolkit of best practices to navigate and avoid the pitfalls and ‘abuses’ associated with their application. We emphasise two principles that support the appropriate application of phylogenetic methodologies in cross-cultural research: researchers should (1) draw on multiple lines of evidence when deciding if and which types of phylogenetic methods and models are suitable for their cross-cultural data, and (2) carefully consider how different cultural traits might have different evolutionary histories across space and time.
    [Show full text]
  • New Reassortant Clade 2.3.4.4B Avian Influenza A(H5N6) Virus in Wild Birds, South Korea, 2017–2018
    Article DOI: https://doi.org/10.3201/eid2410.180461 New Reassortant Clade 2.3.4.4b Avian Influenza A(H5N6) Virus in Wild Birds, South Korea, 2017–2018 Technical Appendix Methods Virus Isolation Fecal samples were tested for influenza A virus by egg inoculation using 9–1-day-old specific-pathogen-free embryonating chicken eggs. After a 72-h incubation period, allantoic fluids were harvested from an axenic medium, and undiluted allantoic fluids were tested for hemagglutinin (HA) activity. For the definitive diagnosis of influenza A virus, RNA was extracted from HA-positive allantoic fluid using an RNeasy Kit (QIAGEN, Valencia, CA, USA) according to the manufacturer’s instructions. The extracted RNA was tested for the influenza A virus matrix gene by real-time reverse transcription PCR (RT-PCR) as described previously (1). The hosts of the positive fecal sample were identified as Mandarin ducks (Aix galericulata) and mallards (Anas platyrhynchos) using DNA barcoding techniques as previously described (2). Full-genome Sequencing Full-length genome sequencing was performed for 19 of 40 isolates (5 from Cheongmi- cheon River collected on 22 December 2017, 13 from Gokgyo-cheon River collected on 23 December 2017, and 1 from Gokgyo-cheon River collected on 18 January 2018). For molecular analysis, RNA was extracted using the RNeasy kit (QIAGEN) according to the manufacturer’s instructions. The 8 genes of each virus were amplified using 1-step RT-PCR. The RT-PCR amplicons (2 g) of all 8 gene segments was used to prepare Ion Fragment sequencing libraries (Life Technologies, Carlsbad, CA, USA) according to the manufacturer’s instructions.
    [Show full text]