The Anâtaxis Phylogenetic Reconstruction Algorithm

Total Page:16

File Type:pdf, Size:1020Kb

The Anâtaxis Phylogenetic Reconstruction Algorithm U´ G` F´ Departement´ d’informatique Professeur Bastien Chopard Institut Suisse de Bioinformatique Dr Gabriel Bittar The Anataxisˆ phylogenetic reconstruction algorithm THESE` present´ ee´ a` la Faculte´ des sciences de l’Universite´ de Geneve` pour obtenir le grade de Docteur es` sciences, mention bioinformatique par Bernhard Pascal Sonderegger de Heiden (AR) These` No 3863 Geneve` Atelier d’impression de la Section de Physique 2007 FACULTE´ DES SCIENCES Doctorat es` Sciences mention bioinformatique These` de Monsieur Bernhard SONDEREGGER Intitulee:´ The Anataxisˆ phylogenetic reconstruction algorithm La faculte´ des sciences, sur le preavis´ de Messieurs B. CHOPARD, professeur ad- joint et directeur de these` (Departement´ d’ informatique), G. Bittar, Docteur et co- directeur de these` (Institut Suisse de Bioinformatique, Geneve,` Suisse), A. BAIROCH, professeur adjoint (Faculte´ de medecine,´ Section de medecine´ fon- dementale, Departement´ de biologie structurale et bioinformatique) et N. SALAMIN, docteur (Universite´ de Lausanne, Faculte´ de biologie et de mede- cine Departement´ d’ecologie´ et evolution,´ Lausanne, Suisse), autorise l’impression de la presente´ these,` sans exprimer d’opinion sur les propositions qui y sont enonc´ ees.´ Geneve,` le 26.06.2007 Th`ese-3863- Le Doyen, Pierre SPIERER Contents Contents i Remerciements 1 Preface 1 R´esum´een franc¸ais 5 Introduction a` la phylogen´ etique´ ....................... 5 L’algorithme Anataxisˆ ............................. 8 Calcul de dissimilitudes ............................ 11 Validation numerique´ .............................. 11 Implementation´ ................................. 12 Exemple biologique ............................... 13 Conclusion .................................... 14 1 An introduction to phylogenetics 17 1.1 Homology and homoplasy ........................ 18 1.1.1 Characters and their states .................... 18 1.1.2 Homology is a phylogenetic hypothesis ............ 20 1.1.3 Homoplasy, a pitfall in phylogenetics ............. 21 1.2 Molecular phylogenetics ......................... 23 1.2.1 Speciation / duplication , orthologs / paralogs ........ 24 1.2.2 Sequence alignment as a homology hypothesis ........ 26 1.2.3 Evolutionary time ........................ 27 1.3 Tree reconstitution ............................ 28 1.3.1 Numerical Taxonomic Phenetics (NTP) ............ 29 1.3.2 Cladistic Maximum Parsimony (CMP) methods ....... 30 1.3.3 Probabilistic Methods ...................... 32 i Anataxisˆ Bernhard P. Sonderegger 1.3.4 Searching for the optimal tree .................. 33 1.3.5 Estimating tree robustness .................... 36 1.4 Uses of phylogenetics in molecular biology .............. 38 1.4.1 Prediction of gene function ................... 38 1.4.2 New directions in phylogenetics ................ 39 1.4.3 Very large trees .......................... 40 2 The Anˆataxisalgorithm 41 2.1 Overview .................................. 41 2.2 Normalisation ............................... 42 2.3 Ingroup division ............................. 43 2.4 Combined effects of homoplasy and heterogeneous rates of evolution 47 2.4.1 Homoplasy within the ingroup ................. 47 2.4.2 Homoplasy between the outgroup and the ingroup ..... 48 3 Dissimilarity calculation 51 3.1 Background ................................ 51 3.2 Methods used with Anataxisˆ ...................... 53 3.2.1 Universal dissimilarity calculation methods ......... 54 3.2.2 Dissimilarity calculation methods for nucleotide sequences 54 3.2.3 External dissimilarity calculation methods .......... 55 3.3 Dissimilarity values containing uncertainty .............. 58 3.3.1 Types of uncertainty ....................... 58 3.3.2 Comparison of uncertain dissimilarities ............ 59 4 Numerical validation 63 4.1 Validation of the normalisation step .................. 63 4.1.1 Normalisation with comparable branches ........... 63 4.1.2 Normalisation with long branches ............... 67 4.2 Validation of the ingroup division step ................. 80 5 Implementation 83 5.1 Complete application with graphical user interface ......... 83 5.1.1 Loading sequence alignments .................. 83 5.1.2 Calculating the dissimilarity matrix .............. 83 5.1.3 Ingroup and outgroup selection ................ 85 ii CONTENTS 5.1.4 Running Anataxisˆ ........................ 86 5.2 Command-line version .......................... 87 5.3 Execution times .............................. 87 5.4 Parallelisation ............................... 89 5.4.1 Using MPI in an object-oriented environment ........ 90 5.4.2 Implementation details ..................... 91 5.4.3 Speed-up obtained ........................ 94 5.4.4 Conclusion ............................ 94 6 Evaluating the performance of Anˆataxisusing a biological example 97 6.1 Phylogenetic tree evaluation ....................... 97 6.1.1 Automatic species-tree annotation and comparison ..... 99 6.1.2 Implementation .......................... 100 6.2 Biological examples ............................ 101 6.2.1 Sequence alignment ....................... 101 6.2.2 The rps4 dataset .......................... 102 Conclusion 106 Appendices 108 A Details of the Neighbor-Joining algorithm 109 A.1 Principle .................................. 109 A.2 Algorithm ................................. 110 A.3 Mathematical proof ............................ 111 B Pseudocode 115 B.1 Recursion .................................. 115 B.2 Normalisation ............................... 116 B.3 Graph creation ............................... 117 C Usage details for the command-line version of Anˆataxis 119 D NEWT-tree extraction and automatic tree annotation tools 121 D.1 newt.rb ................................... 121 D.2 simple tree.rb ............................... 122 D.3 make newt tree.rb ............................ 122 iii Anataxisˆ Bernhard P. Sonderegger D.4 annotate tree.rb .............................. 123 E Obj-MPI: an example 125 Bibliography 129 List of Figures 136 List of Tables 139 Index 140 iv Contents Remerciements Je tiens a` remercier Bastien Chopard pour l’acceuil dans son groupe de recherche et pour son aide durant toutes ces annees.´ Je remercie Gabriel Bittar qui a pris le temps de m’encadrer depuis l’Australie et qui m’a apporte´ son soutient scienti- fique, technique et philosophique. Il a toujours fait son possible pour m’aider sans avoir peur de se ‘salir les mains’ avec des taches penibles´ comme l’alignement manuel des sequences.´ Mes collegues de bureau ont rendu la routine de tous les jours tellement plus vi- vable. Plus particulierement, j’aimerais nommer Jonas pour de multiples sessions de ‘debugging’ C++ et son sens d’humour. Jean-Luc pour de nombreuses discu- tions sur des sujets aussi varies´ que la cuisine medievale´ ou les details´ techniques des denieres` distributions de linux. Davide pour ses acrobaties linguistiques et son bon humeur contagieux. Rafik, le seul musulmane que je ne connaisse qui defend la notion de “intelligent design”, pour des debats´ amusants. Je remercie egalement´ mes parents qui m’ont toujours encourages´ a` poursuivre mes buts et qui m’ont motives´ a suivre mon interet pour la science depuis mon plus jeun age. Il est clair que le plus grand remerciement est du a` ma femme Lina qui m’a motive´ quand j’en avait marre, qui m’a pousse´ quand je trainait les pieds et qui a toujours et´ e´ la` pour moi. Sachant ce qui implique une thse` de doctorat, elle a non seulement consenti d’epouser´ un thesard,´ mais lui a aussi donnee´ une superbe fille. Merci ma petite Genevieve` d’avoir et´ e´ un petit rayon de soleil, toujours capable de me faire rire ou sourire, memeˆ pendant les periodes´ difficiles de redaction´ de ce document. 1 Anataxisˆ Bernhard P. Sonderegger 2 Preface Phylogenetics, the science of evolutionary relationships, is a dynamic field. In the century since Darwin proposed his theory of evolution and in the thirty years since the advent of efficient DNA sequencing, tremendous advances have been made in phylogenetic methodology. New methods of phylogenetic tree reconstruction are regularly proposed and phylogenetics is applied in ever more diverse fields of molecular and computational biology. Nevertheless, two facets of phylogenetic reconstruction remain limiting: Certain properties inherent in the data make reconstruction a non-trivial • task. These properties can be summarised as a) similarities which are not due to a common ancestor, and b) rates of evolution which differ between lineages and between sites. Phylogenetic reconstruction is an NP-hard problem. Computation time and • memory quickly become limiting as larger problems are approached. This thesis proposes a new phylogenetic reconstruction algorithm developed by Dr. Gabriel Bittar and Bernhard Sonderegger. The algorithm uses some truly novel ideas and does not easily fit into existing categories of reconstruction methods. One of its principal aims is to be able to deal with very large datasets (thousands of taxons). 3 Anataxisˆ Bernhard P. Sonderegger 4 R´esum´een franc¸ais Introduction `ala phylog´en´etique La phylogen´ etique,´ du grec phylon (‘race, tribu, clan’) et genˆetikos (‘relatif a` la naissance, la gen´ eration,´ la gen´ ese’),` est l’etude´ des relations entre objets evoluant´ dans le temps. C’est bien evidemment´ le cas des objets vivants, qui depuis pres` de 4 milliards d’annees´ sur cette planete` montrent dans l’ensemble une tendance a` la diversification (cladogen´ ese,` du grec klados, ‘arbre’ : le nombre d’especes` vivantes
Recommended publications
  • Reticulate Evolution and Phylogenetic Networks 6
    CHAPTER Reticulate Evolution and Phylogenetic Networks 6 So far, we have modeled the evolutionary process as a bifurcating treelike process. The main feature of trees is that the flow of information in one lineage is independent of that in another lineage. In real life, however, many process- es—including recombination, hybridization, polyploidy, introgression, genome fusion, endosymbiosis, and horizontal gene transfer—do not abide by lineage independence and cannot be modeled by trees. Reticulate evolution refers to the origination of lineages through the com- plete or partial merging of two or more ancestor lineages. The nonvertical con- nections between lineages are referred to as reticulations. In this chapter, we introduce the language of networks and explain how networks can be used to study non-treelike phylogenetic processes. A substantial part of this chapter will be dedicated to the evolutionary relationships among prokaryotes and the very intriguing question of the origin of the eukaryotic cell. Networks Networks were briefly introduced in Chapter 4 in the context of protein-protein interactions. Here, we provide a more formal description of networks and their use in phylogenetics. As mentioned in Chapter 5, trees are connected graphs in which any two nodes are connected by a single path. In phylogenetics, a network is a connected graph in which at least two nodes are connected by two or more pathways. In other words, a network is a connected graph that is not a tree. All networks contain at least one cycle, i.e., a path of branches that can be followed in one direction from any starting point back to the starting point.
    [Show full text]
  • Splits and Phylogenetic Networks
    SplitsSplits andand PhylogeneticPhylogenetic NetworksNetworks DanielDaniel H.H. HusonHuson Paris, June 21, 2005 1 ContentsContents 1.1. PhylogeneticPhylogenetic treestrees 2.2. SplitsSplits networksnetworks 3.3. ConsensusConsensus networksnetworks 4.4. HybridizationHybridization andand reticulatereticulate networksnetworks 5.5. RecombinationRecombination networksnetworks 2 PhylogeneticPhylogenetic NetworksNetworks Bandelt (1991): Network displaying evolutionary relationships Other types of Splits Phylogenetic Reticulate phylogenetic networks trees networks networks Median Consensus Hybridization Recombination Augmented networks (super) networks networks trees networks Special case: “Galled trees” Any graph Ancestor Split decomposition, representing Split decomposition, recombination evolutionary Neighbor-net graphs data 3 PhylogeneticPhylogenetic NetworksNetworks 22 11 Other types of Splits Phylogenetic Reticulate phylogenetic networks trees networks networks 33 44 55 Median Consensus Hybridization Recombination Augmented networks (super) networks networks trees networks from sequences Special case: “Galled trees” from trees Any graph Ancestor Split decomposition, representing Split decomposition, recombination evolutionary Neighbor-net graphs data from distances 4 PhylogeneticPhylogenetic NetworksNetworksDan Gusfield: “Phylogenetic network” 22 11 “Generalized Other types of Splits Phylogenetic Reticulate phylogeneticphylogenetic networks trees networks network”networks 33 44 \ 55 Median Consensus Hybridization Recombination“More generalizedAugmented
    [Show full text]
  • The Probability of Monophyly of a Sample of Gene Lineages on a Species Tree
    PAPER The probability of monophyly of a sample of gene COLLOQUIUM lineages on a species tree Rohan S. Mehtaa,1, David Bryantb, and Noah A. Rosenberga aDepartment of Biology, Stanford University, Stanford, CA 94305; and bDepartment of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand Edited by John C. Avise, University of California, Irvine, CA, and approved April 18, 2016 (received for review February 5, 2016) Monophyletic groups—groups that consist of all of the descendants loci that are reciprocally monophyletic is informative about the of a most recent common ancestor—arise naturally as a conse- time since species divergence and can assist in representing the quence of descent processes that result in meaningful distinctions level of differentiation between groups (4, 18). between organisms. Aspects of monophyly are therefore central to Many empirical investigations of genealogical phenomena have fields that examine and use genealogical descent. In particular, stud- made use of conceptual and statistical properties of monophyly ies in conservation genetics, phylogeography, population genetics, (19). Comparisons of observed monophyly levels to model pre- species delimitation, and systematics can all make use of mathemat- dictions have been used to provide information about species di- ical predictions under evolutionary models about features of mono- vergence times (20, 21). Model-based monophyly computations phyly. One important calculation, the probability that a set of gene have been used alongside DNA sequence differences between and lineages is monophyletic under a two-species neutral coalescent within proposed clades to argue for the existence of the clades model, has been used in many studies. Here, we extend this calcu- (22), and tests involving reciprocal monophyly have been used to lation for a species tree model that contains arbitrarily many species.
    [Show full text]
  • Math/C SC 5610 Computational Biology Lecture 12: Phylogenetics
    Math/C SC 5610 Computational Biology Lecture 12: Phylogenetics Stephen Billups University of Colorado at Denver Math/C SC 5610Computational Biology – p.1/25 Announcements Project Guidelines and Ideas are posted. (proposal due March 8) CCB Seminar, Friday (Mar. 4) Speaker: Jack Horner, SAIC Title: Phylogenetic Methods for Characterizing the Signature of Stage I Ovarian Cancer in Serum Protein Mas Time: 11-12 (Followed by lunch) Place: Media Center, AU008 Math/C SC 5610Computational Biology – p.2/25 Outline Distance based methods for phylogenetics UPGMA WPGMA Neighbor-Joining Character based methods Maximum Likelihood Maximum Parsimony Math/C SC 5610Computational Biology – p.3/25 Review: Distance Based Clustering Methods Main Idea: Requires a distance matrix D, (defining distances between each pair of elements). Repeatedly group together closest elements. Different algorithms differ by how they treat distances between groups. UPGMA (unweighted pair group method with arithmetic mean). WPGMA (weighted pair group method with arithmetic mean). Math/C SC 5610Computational Biology – p.4/25 UPGMA 1. Initialize C to the n singleton clusters f1g; : : : ; fng. 2. Initialize dist(c; d) on C by defining dist(fig; fjg) = D(i; j): 3. Repeat n ¡ 1 times: (a) determine pair c; d of clusters in C such that dist(c; d) is minimal; define dmin = dist(c; d). (b) define new cluster e = c S d; update C = C ¡ fc; dg Sfeg. (c) define a node with label e and daughters c; d, where e has distance dmin=2 to its leaves. (d) define for all f 2 C with f 6= e, dist(c; f) + dist(d; f) dist(e; f) = dist(f; e) = : (avg.
    [Show full text]
  • Chapter 7 Phylogenesis Versus
    108 The Evolution of Culturol Diversity Clades Lineages CHAPTER 7 PHYLOGENESIS VERSUS ETHNOGENESIS IN vvvv TURKMEN CULTURAL EVOLUTION vv v Mark Collard and Jamshid Tehrani INTRODUCTION The processes responsible for producing the similarities and differences among cultures have been the focus of much debate in recent years, as has the corollary vv issue of linking cultural data with the patterns recorded by linguists and Figure 6.10 Clodes versus lineages. All nine diagrams represent the same hiologists working with human populations (eg, Romney 1957; Vogt 1964; phylogeny, with clades highlighted on the left and lineages on the tight Chakraborty ct a11976; Brace and Hinton 1981; Cavalli-Sforza and Feldman 1981; Additional lineages can be counted from various internal nodes to the branch Lumsden and Wilson 1981~ Ammerman and Cavalli-Sforza 1984; Boyd and tips (after de Quelroz 1998). Richerson 1985; Terrell 1986, 1988; Kirch and Green 1987, 2001; Renfrew 19H7, 1992, 2000b, 2001; Atkinson 1989; Croes 1989; Bateman et a11990; Durham 1990, Archaeologists are uniquely capable of ans\vering these questions, and cladistics 1991,1992; Moore 1994b; Cavalli-Sforza and Cavalli-Sforza 1995; Guglielmino et a[ offers a means to answer them. 19Q5; Laland et af 1995; Zvelebil 1995; Bellwood 1996a, 2001; Boyd clal 1997~ But are we simply borrowing techniques of biological origin \vithout a firm Shennan 2000, 2002; Smith 2001; Whaley 2001; Terrell cI ill 20CH; Jordan dnd basis for so doing? No. We view cultural phenomena as residing in a series of Shennan 2003). To date, this debate has concentrated on two cornpeting nested hierarchies that comprise traditions, or lineages, at ever more-inclusive hypotheses, which have been termed the 'genetic', 'demie diffusion', 'branching' scales and that are held together by cultural as \veU as genetic transmission.
    [Show full text]
  • Understanding the Processes Underpinning Patterns Of
    Opinion Understanding the Processes Underpinning Patterns of Phylogenetic Regionalization 1,2, 3,4 1 Barnabas H. Daru, * Tammy L. Elliott, Daniel S. Park, and 5,6 T. Jonathan Davies A key step in understanding the distribution of biodiversity is the grouping of regions based on their shared elements. Historically, regionalization schemes have been largely species centric. Recently, there has been interest in incor- porating phylogenetic information into regionalization schemes. Phylogenetic regionalization can provide novel insights into the mechanisms that generate, distribute, and maintain biodiversity. We argue that four processes (dispersal limitation, extinction, speciation, and niche conservatism) underlie the forma- tion of species assemblages into phylogenetically distinct biogeographic units. We outline how it can be possible to distinguish among these processes, and identify centers of evolutionary radiation, museums of diversity, and extinction hotspots. We suggest that phylogenetic regionalization provides a rigorous and objective classification of regional diversity and enhances our knowledge of biodiversity patterns. Biogeographical Regionalization in a Phylogenetic Era 1 Department of Organismic and Biogeographical boundaries delineate the basic macrounits of diversity in biogeography, Evolutionary Biology, Harvard conservation, and macroecology. Their location and composition of species on either side of University, Cambridge, MA 02138, these boundaries can reflect the historical processes that have shaped the present-day USA th 2 Department of Plant Sciences, distribution of biodiversity [1]. During the early 19 century, de Candolle [2] created one of the University of Pretoria, Private Bag first global geographic regionalization schemes for plant diversity based on both ecological X20, Hatfield 0028, Pretoria, South and historical information. This was followed by Sclater [3], who defined six zoological regions Africa 3 Department of Biological Sciences, based on the global distribution of birds.
    [Show full text]
  • Stochastic Models in Phylogenetic Comparative Methods: Analytical Properties and Parameter Estimation
    Thesis for the Degree of Doctor of Philosophy Stochastic models in phylogenetic comparative methods: analytical properties and parameter estimation Krzysztof Bartoszek Division of Mathematical Statistics Department of Mathematical Sciences Chalmers University of Technology and University of Gothenburg G¨oteborg, Sweden 2013 Stochastic models in phylogenetic comparative methods: analytical properties and parameter estimation Krzysztof Bartoszek Copyright c Krzysztof Bartoszek, 2013. ISBN 978–91–628–8740–7 Department of Mathematical Sciences Division of Mathematical Statistics Chalmers University of Technology and University of Gothenburg SE-412 96 GOTEBORG,¨ Sweden Phone: +46 (0)31-772 10 00 Author e-mail: [email protected] Typeset with LATEX. Department of Mathematical Sciences Printed in G¨oteborg, Sweden 2013 Stochastic models in phylogenetic comparative methods: analytical properties and parameter estimation Krzysztof Bartoszek Abstract Phylogenetic comparative methods are well established tools for using inter–species variation to analyse phenotypic evolution and adaptation. They are generally hampered, however, by predominantly univariate ap- proaches and failure to include uncertainty and measurement error in the phylogeny as well as the measured traits. This thesis addresses all these three issues. First, by investigating the effects of correlated measurement errors on a phylogenetic regression. Second, by developing a multivariate Ornstein– Uhlenbeck model combined with a maximum–likelihood estimation pack- age in R. This model allows, uniquely, a direct way of testing adaptive coevolution. Third, accounting for the often substantial phylogenetic uncertainty in comparative studies requires an explicit model for the tree. Based on recently developed conditioned branching processes, with Brownian and Ornstein–Uhlenbeck evolution on top, expected species similarities are derived, together with phylogenetic confidence intervals for the optimal trait value.
    [Show full text]
  • Application of Phylogenetic Networks in Evolutionary Studies
    Application of Phylogenetic Networks in Evolutionary Studies Daniel H. Huson* and David Bryant *Center for Bioinformatics (ZBIT), Tu¨bingen University, Tu¨bingen, Germany; and Department of Mathematics, Auckland University, Auckland, New Zealand The evolutionary history of a set of taxa is usually represented by a phylogenetic tree, and this model has greatly facilitated the discussion and testing of hypotheses. However, it is well known that more complex evolutionary scenarios are poorly described by such models. Further, even when evolution proceeds in a tree-like manner, analysis of the data may not be best served by using methods that enforce a tree structure but rather by a richer visualization of the data to evaluate its properties, at least as an essential first step. Thus, phylogenetic networks should be employed when reticulate events such as hybridization, horizontal gene transfer, recombination, or gene duplication and loss are believed to be involved, and, even in the absence of such events, phylogenetic networks have a useful role to play. This article reviews the terminology used for phylogenetic networks and covers both split networks and reticulate networks, how they are defined, and how they can be interpreted. Additionally, the article outlines the beginnings of a comprehensive statistical framework for applying split network methods. We show how split networks can represent confidence sets of trees and introduce a conservative statistical test for whether the conflicting signal in a network is treelike. Finally, this article describes a new program, SplitsTree4, an interactive and comprehensive tool for inferring different types of phylogenetic networks from sequences, distances, and trees. Introduction The familiar evolutionary model assumes a tree, a Finally, we present our new program SplitsTree4, model that has greatly facilitated the discussion and testing which provides a comprehensive and interactive frame- of hypotheses.
    [Show full text]
  • Thesis (PDF, 13.51MB)
    Insects and their endosymbionts: phylogenetics and evolutionary rates Daej A Kh A M Arab The University of Sydney Faculty of Science 2021 A thesis submitted in fulfilment of the requirements for the degree of Doctor of Philosophy Authorship contribution statement During my doctoral candidature I published as first-author or co-author three stand-alone papers in peer-reviewed, internationally recognised journals. These publications form the three research chapters of this thesis in accordance with The University of Sydney’s policy for doctoral theses. These chapters are linked by the use of the latest phylogenetic and molecular evolutionary techniques for analysing obligate mutualistic endosymbionts and their host mitochondrial genomes to shed light on the evolutionary history of the two partners. Therefore, there is inevitably some repetition between chapters, as they share common themes. In the general introduction and discussion, I use the singular “I” as I am the sole author of these chapters. All other chapters are co-authored and therefore the plural “we” is used, including appendices belonging to these chapters. Part of chapter 2 has been published as: Bourguignon, T., Tang, Q., Ho, S.Y., Juna, F., Wang, Z., Arab, D.A., Cameron, S.L., Walker, J., Rentz, D., Evans, T.A. and Lo, N., 2018. Transoceanic dispersal and plate tectonics shaped global cockroach distributions: evidence from mitochondrial phylogenomics. Molecular Biology and Evolution, 35(4), pp.970-983. The chapter was reformatted to include additional data and analyses that I undertook towards this paper. My role was in the paper was to sequence samples, assemble mitochondrial genomes, perform phylogenetic analyses, and contribute to the writing of the manuscript.
    [Show full text]
  • Journal of Pharmacology and Toxicological Studies
    e-ISSN: 2319-9873 p-ISSN: 2347-2324 Research & Reviews: Journal of Pharmacology and Toxicological Studies A Short Study on Phylogenetics Indu Sama* Department of plant Pathology and forest genetics, Bharat University, India Review Article ABSTRACT Received: 06/09/2016 Accepted : 09/09/2016 Published: 12/09/2016 In science, phylogenetics is the investigation of transformative *For Correspondence connections among gatherings of life forms (e.g. species,populations), which are found through atomic sequencing information and morphological information grids. The term phylogeneticsderives from the Greek Department of plant Pathology expressions phylé (φυλή) and phylon (φῦλον), meaning "tribe", "faction", and forest genetics, Bharat "race" and the descriptive structure, genetikós(γενετικός), of the word University, India genesis (γένεσις) "source", "source", "conception". Indeed, phylogenesis is the procedure, phylogeny is science on this procedure, and phylogenetics is E-mail: [email protected] phylogeny in view of examination of successions of organic macromolecules (DNA, RNA and proteins, in the first). The consequence of phylogenetic Keywords: Phylogenetics, studies is a speculation about the developmental history of taxonomic Devolopment, Computation gatherings: their phylogeny. INTRODUCTION Development is a procedure whereby populaces are modified over the long run and may part into particular branches, hybridize together, or end by elimination [1 -15]. The transformative expanding procedure may be portrayed as a phylogenetic tree, and the spot of each of the different living beings on the tree is in view of a speculation about the arrangement in which developmental spreading occasions happened. Inhistorical phonetics, comparative ideas are utilized regarding connections in the middle of dialects; and in literary feedback withstemmatics [15-25].
    [Show full text]
  • Phylogenomics of Reichenowia Parasitica, an Alphaproteobacterial Endosymbiont of the Freshwater Leech Placobdella Parasitica
    City University of New York (CUNY) CUNY Academic Works Publications and Research CUNY Graduate Center 2011 Phylogenomics of Reichenowia parasitica, an Alphaproteobacterial Endosymbiont of the Freshwater Leech Placobdella parasitica Sebastian Kvist American Museum of Natural History Apurva Narechania American Museum of Natural History Alejandro Oceguera-Figueroa CUNY Graduate Center Bella Fuks Long Island University Mark E. Siddall American Museum of Natural History How does access to this work benefit ou?y Let us know! More information about this work at: https://academicworks.cuny.edu/gc_pubs/365 Discover additional works at: https://academicworks.cuny.edu This work is made publicly available by the City University of New York (CUNY). Contact: [email protected] Phylogenomics of Reichenowia parasitica,an Alphaproteobacterial Endosymbiont of the Freshwater Leech Placobdella parasitica Sebastian Kvist1,2*, Apurva Narechania3, Alejandro Oceguera-Figueroa2,4, Bella Fuks5, Mark E. Siddall2,3 1 Richard Gilder Graduate School, American Museum of Natural History, New York, New York, United States of America, 2 Division of Invertebrate Zoology, American Museum of Natural History, New York, New York, United States of America, 3 Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America, 4 Department of Biology, The Graduate Center, The City University of New York, New York, New York, United States of America, 5 Long Island University Brooklyn Campus, Brooklyn, New York, United States of America Abstract Although several commensal alphaproteobacteria form close relationships with plant hosts where they aid in (e.g.,) nitrogen fixation and nodulation, only a few inhabit animal hosts. Among these, Reichenowia picta, R.
    [Show full text]
  • Molecular Phylogenetics (Hannes Luz)
    Tree: minimum but fully connected (no loop, one breaks) Molecular Phylogenetics (Hannes Luz) Contents: • Phylogenetic Trees, basic notions • A character based method: Maximum Parsimony • Trees from distances • Markov Models of Sequence Evolution, Maximum Likelihood Trees References for lectures • Joseph Felsenstein, Inferring Phylogenies, Sinauer Associates (2004) • Dan Graur, Weng-Hsiun Li, Fundamentals of Molecular Evolution, Sinauer Associates • D.W. Mount. Bioinformatics: Sequences and Genome analysis, 2001. • D.L. Swofford, G.J. Olsen, P.J.Waddell & D.M. Hillis, Phylogenetic Inference, in: D.M. Hillis (ed.), Molecular Systematics, 2 ed., Sunder- land Mass., 1996. • R. Durbin, S. Eddy, A. Krogh & G. Mitchison, Biological sequence analysis, Cambridge, 1998 References for lectures, cont’d • S. Rahmann, Spezielle Methoden und Anwendungen der Statistik in der Bioinformatik (http://www.molgen.mpg.de/~rahmann/afw-rahmann. pdf) • K. Schmid, A Phylogenetic Parsimony Method Considering Neigh- bored Gaps (Bachelor thesis, FU Berlin, 2007) • Martin Vingron, Hannes Luz, Jens Stoye, Lecture notes on ’Al- gorithms for Phylogenetic Reconstructions’, http://lectures.molgen. mpg.de/Algorithmische_Bioinformatik_WS0405/phylogeny_script.pdf Recommended reading/watching • Video streams of Arndt von Haeseler’s lectures held at the Otto Warburg Summer School on Evolutionary Genomics 2006 (http: //ows.molgen.mpg.de/2006/von_haeseler.shtml) • Dirk Metzler, Algorithmen und Modelle der Bioinformatik, http://www. cs.uni-frankfurt.de/~metzler/WS0708/skriptWS0708.pdf Software links • Felsenstein’s list of software packages: http://evolution.genetics.washington.edu/phylip/software.html • PHYLIP is Felsenstein’s free software package for inferring phyloge- nies, http://evolution.genetics.washington.edu/phylip.html • Webinterface for PHYLIP maintained at Institute Pasteur, http://bioweb.pasteur.fr/seqanal/phylogeny/phylip-uk.html • Puzzle (Strimmer, v.
    [Show full text]