Molecular Phylogeny and Evolution

Total Page:16

File Type:pdf, Size:1020Kb

Molecular Phylogeny and Evolution Molecular Phylogeny and Evolution 10 – 14 February 2020 Juan I. MONTOYA BURGOS Lab of Molecular Phylogeny and Evolution in Vertebrates Title of the course: Molecular Phylogeny and Molecular Evolution Evolutionary relationships among organisms = tree topology A better understanding of molecular evolution improves: First phylogenetic methods did not - topology and branch length make use of models of molecular reconstruction (=phylogenetic tree) evolution (UPGMA, Maximum Parsimony) Better phylogenetic trees improve: - the understanding of evolutionary processes => Models of molecular evolution Lab of Molecular Phylogeny and Evolution in Vertebrates 1 Why should we care about phylogenies? Are you using phylogenetics ? Lab of Molecular Phylogeny and Evolution in Vertebrates Current phylogenetic methods allow: - reconstruction of evolutionary relationships But also the analysis of: Gene/genome duplication Recombination Evolutionary rates Divergence time among lineages Selective pressure Demography Biodiversity Genetic variability Conservation Biogeography Spread of contagious disease Discovery of biomedical compounds Adaptive evolution Biomonitoring Protein-protein co-evolution And more …. Lab of Molecular Phylogeny and Evolution in Vertebrates 2 Phylogenetic analyses: many uses Understanding evolutionary relationships Determining the closest extant relative to tetrapods Amemiya et al., 2013. The African coelacanth genome provides insights into tetrapod evolution. Nature 496. Lab of Molecular Phylogeny and Evolution in Vertebrates Phylogenetic analyses: many uses Understanding morphological evolution Using a species-level phylogeny of Aquilegia, Whittall and Hodges showed a significant evolutionary trend for longer nectar spur during directional shifts to pollinators with longer tongues (moths). Whittall and Hodges, 2007. Pollinator shifts drive increasingly long nectar spurs in columbine flowers. Nature 447, 706-709 Lab of Molecular Phylogeny and Evolution in Vertebrates 3 Phylogenetic analyses: many uses Understanding bio-diversification Calibrated phylogeny of Zosterops and relatives Moyle et al. 2009. Explosive Pleistocene diversification and hemispheric expansion of a "great speciator." PNAS Lab of Molecular Phylogeny and Evolution in Vertebrates Phylogenetic analyses: many uses Understanding bio-diversification Calibrated phylogeny of Zosterops and relatives Lineage-through-time plots (LTT) Moyle et al. 2009. Explosive Pleistocene diversification and hemispheric expansion of a "great speciator." PNAS Lab of Molecular Phylogeny and Evolution in Vertebrates 4 Phylogenetic analyses: many uses New biodiversity 1 Unexpected diversity in the catfish Pseudancistrus brevispinis 2 reveals dispersal routes in a Neotropical center of endemism: the 3 Guyanas Region 4 5 6 YAMILA P. CARDOSO1 and JUAN I. MONTOYA-BURGOS 1* 7 8 9 10 11 1 Department of Zoology and Animal Biology, University of Geneva, 30 quai Ernest 12 Ansermet, 1211 Geneva 4, Switzerland Lab of Molecular Phylogeny and Evolution in Vertebrates Phylogenetic analyses: many uses New biodiversity 1 species à 6 species Lab of Molecular Phylogeny and Evolution in Vertebrates 5 Phylogenetic analyses: many uses Biogeography Biogeography: four possible entrance scenarios Topological tests: -SH test -AU test Lab of Molecular Phylogeny and Evolution in Vertebrates Phylogenetic analyses: many uses DNA taxonomy of Animals: species identification Hebert et al. PNAS 101:14812 (2004) Lab of Molecular Phylogeny and Evolution in Vertebrates 6 Phylogenetic analyses: many uses Evolutionary rates and selective pressure acting on genes The evolutionary histories of genes bear the marks of the functional demands to which they have been subjected. A maximum likelihood phylogenetic tree of Hmr built by the free-ratio model in PAML FIG. 3.ム A maximum likelihood phylogenetic tree of Hmr built by the free-ratio model in PAML. The likelihood of this model was significantly better than the one-ratio model (2l = 54.224, degrees of freedom = 11, P < 10ミ7). The tree length is defined as the number of nucleotide substitutions per codon. The number shown above each lineage is the estimated DN/ DS value of that lineage Maheshwari, S. et al. Mol Biol Evol 2008 25:2421-2430; doi:10.1093/molbev/msn190 Lab of Molecular Phylogeny and Evolution in Vertebrates Phylogenetic analyses: many uses Analysis of: - duplicated genes - gene families - genome duplication / polyploidisation Lab of Molecular Phylogeny and Evolution in Vertebrates 7 REVIEWS REVIEWS Phylogenetic analyses: many uses COMPUTATIONAL TOOLS COMPUTATIONAL TOOLS EmergingEvolution methods and selectionEmerging in protein on proteins, methods protein in protein co-evolutioninteractionsco-evolution and protein networks David de Juan1, Florencio Pazos2 and AlfonsoDavid Valencia de Juan11, Florencio Pazos2 and Alfonso Valencia1 Abstract | Co-evolution is a fundamental componentAbstract of| Co-evolution the theory of is evolution a fundamental and is component of the theory of evolution and is essential for understanding the relationshipsessential between for species understanding in complex the ecological relationships between species in complex ecological networks. A wide range of co-evolution-inspirednetworks. computational A wide range methods of co-evolution has been -inspired computational methods has been designed to predict molecular interactions, butdesigned it is only to recentlypredict molecular that important interactions, advances but it is only recently that important advances have been made. Breakthroughs in the handlinghave of been phylogenetic made. Breakthroughs information inand the in handling of phylogenetic information and in disentangling indirect relationships have resulteddisentangling in an improved indirect capacity relationships to predict have resulted in an improved capacity to predict interactions between proteins and contacts interactionsbetween different between protein proteins residues. and contacts Here, we between different protein residues. Here, we review the main co-evolution-based computationalreview the approaches, main co-evolution their theoretical-based computational basis, approaches, their theoretical basis, potential applications and foreseeable developments.potential applications and foreseeable developments. As a part of evolutionary theory, co-evolution is essential Tools at the residue level were inspired by the existence Molecular phylogenetics As a part of evolutionary theory,Molecular co-evolution phylogenetics is essential Tools at the residue level were inspired by the existence The study of evolutionary for understanding living systems.The study In itsof evolutionarysimplest definition, for understandingof interdependent living changes systems. inIn groupsits simplest of variable definition, amino of interdependent changes in groups of variable amino phenomena using biomolecular co-evolution refers to the coordinatedphenomena using changes biomolecular that occur co-evolution acids, as refersformulated to the coordinated for the first changes time by that the occur covarion acids, as formulated for the first time by the covarion data, generally in the form of in pairs of organisms or biomolecules,data, generally typically in the form to of mainin- pairsmodel of 6organisms, and they or typically biomolecules, use the typically multiple to mainsequence- model 6, and they typically use the multiple sequence Lab of Molecular Phylogenysequences and of Evolution nucleic acids in Vertebrates sequences of nucleic acids tain or to refine functional interactions between thosetain alignmentor to refine (MSA) functional for ainteractions protein family between of homologues those alignment (MSA) for a protein family of homologues or proteins. or proteins. pairs. Darwin himself initiated the study of co-evolution,pairs. to Darwin search forhimself correlated initiated mutations the study. Such of co-evolution,correlated muta -to search for correlated mutations. Such correlated muta- Covarion model and his observation on the Covarionrelationship model between the sizeand tionshis observation are suggestive on the of compensatory relationship between changes the that size occur tions are suggestive of compensatory changes that occur A phylogenetic model in of orchids’ corollae and the lengthA phylogenetic of the modelproboscis in of polof- orchids’between corollae entangled and theresidues length (for of the example, proboscis those of inpol prox- -between entangled residues (for example, those in prox- which the evolutionary linators led him to predict successfullywhich the evolutionary the existence of alinators imity, led direct him to contact predict or successfully acting together the existence in catalytic of a orimity, direct contact or acting together in catalytic or rate of different codons rate of different codons new species that was able to suck from the large spur of binding sites) to maintain protein stability, function are interdependent. new species that was able toare suck interdependent. from the large spur of binding sites) to maintain protein stability, function Darwin’s orchid. The studies of Dobzhansky1 and othersDarwin’s2 or folding orchid.7–10 The. Furthermore, studies of Dobzhansky extending1 andthese others methods2 or folding7–10. Furthermore, extending these methods Phylogenetic analyses: many uses Protein family contributed to the establishmentProtein of family this concept in geneticcontributed to search to the for establishment correlated ofmutations this concept between in genetic
Recommended publications
  • Lecture Notes: the Mathematics of Phylogenetics
    Lecture Notes: The Mathematics of Phylogenetics Elizabeth S. Allman, John A. Rhodes IAS/Park City Mathematics Institute June-July, 2005 University of Alaska Fairbanks Spring 2009, 2012, 2016 c 2005, Elizabeth S. Allman and John A. Rhodes ii Contents 1 Sequences and Molecular Evolution 3 1.1 DNA structure . .4 1.2 Mutations . .5 1.3 Aligned Orthologous Sequences . .7 2 Combinatorics of Trees I 9 2.1 Graphs and Trees . .9 2.2 Counting Binary Trees . 14 2.3 Metric Trees . 15 2.4 Ultrametric Trees and Molecular Clocks . 17 2.5 Rooting Trees with Outgroups . 18 2.6 Newick Notation . 19 2.7 Exercises . 20 3 Parsimony 25 3.1 The Parsimony Criterion . 25 3.2 The Fitch-Hartigan Algorithm . 28 3.3 Informative Characters . 33 3.4 Complexity . 35 3.5 Weighted Parsimony . 36 3.6 Recovering Minimal Extensions . 38 3.7 Further Issues . 39 3.8 Exercises . 40 4 Combinatorics of Trees II 45 4.1 Splits and Clades . 45 4.2 Refinements and Consensus Trees . 49 4.3 Quartets . 52 4.4 Supertrees . 53 4.5 Final Comments . 54 4.6 Exercises . 55 iii iv CONTENTS 5 Distance Methods 57 5.1 Dissimilarity Measures . 57 5.2 An Algorithmic Construction: UPGMA . 60 5.3 Unequal Branch Lengths . 62 5.4 The Four-point Condition . 66 5.5 The Neighbor Joining Algorithm . 70 5.6 Additional Comments . 72 5.7 Exercises . 73 6 Probabilistic Models of DNA Mutation 81 6.1 A first example . 81 6.2 Markov Models on Trees . 87 6.3 Jukes-Cantor and Kimura Models .
    [Show full text]
  • Translation Readthrough Mitigation Joshua A
    LETTER doi:10.1038/nature18308 Translation readthrough mitigation Joshua A. Arribere1, Elif S. Cenik1, Nimit Jain2, Gaelen T. Hess3, Cameron H. Lee3, Michael C. Bassik3 & Andrew Z. Fire1,3 A fraction of ribosomes engaged in translation will fail to into the 3′ UTR can confer substantial loss of protein expression for at terminate when reaching a stop codon, yielding nascent proteins least these three 3′ UTRs in C. elegans. inappropriately extended on their C termini. Although such To test whether translation into 3′ UTRs could confer a loss of extended proteins can interfere with normal cellular processes, protein expression more generally, a two-fluorescent-reporter system known mechanisms of translational surveillance1 are insufficient to with each fluorophore transgene containing an identical 3′ UTR was protect cells from potential dominant consequences. Here, through used. Nine genes were chosen to reflect a variety of functions and a combination of transgenics and CRISPR–Cas9 gene editing in expression levels: rps-17 (small ribosomal subunit component), r74.6 Caenorhabditis elegans, we demonstrate a consistent ability of cells (dom34/pelota release factor homologue), hlh-1 (muscle transcription to block accumulation of C-terminal-extended proteins that result factor), eef-1A.1 (also known as eft-3, translation elongation factor), from failure to terminate at stop codons. Sequences encoded by myo-2 (a pharyngeal myosin), mut-16 (involved in gene/transposon the 3′ untranslated region (UTR) were sufficient to lower protein silencing), bar-1 (a beta catenin), daf-6 (involved in amphid mor- levels. Measurements of mRNA levels and translation suggested a phogenesis), and alr-1 (neuronal transcription factor).
    [Show full text]
  • In Defence of the Three-Domains of Life Paradigm P.T.S
    van der Gulik et al. BMC Evolutionary Biology (2017) 17:218 DOI 10.1186/s12862-017-1059-z REVIEW Open Access In defence of the three-domains of life paradigm P.T.S. van der Gulik1*, W.D. Hoff2 and D. Speijer3* Abstract Background: Recently, important discoveries regarding the archaeon that functioned as the “host” in the merger with a bacterium that led to the eukaryotes, its “complex” nature, and its phylogenetic relationship to eukaryotes, have been reported. Based on these new insights proposals have been put forward to get rid of the three-domain Model of life, and replace it with a two-domain model. Results: We present arguments (both regarding timing, complexity, and chemical nature of specific evolutionary processes, as well as regarding genetic structure) to resist such proposals. The three-domain Model represents an accurate description of the differences at the most fundamental level of living organisms, as the eukaryotic lineage that arose from this unique merging event is distinct from both Archaea and Bacteria in a myriad of crucial ways. Conclusions: We maintain that “a natural system of organisms”, as proposed when the three-domain Model of life was introduced, should not be revised when considering the recent discoveries, however exciting they may be. Keywords: Eucarya, LECA, Phylogenetics, Eukaryogenesis, Three-domain model Background (English: domain) as a higher taxonomic level than The discovery that methanogenic microbes differ funda- regnum: introducing the three domains of cellular life, Ar- mentally from Bacteria such as Escherichia coli or Bacillus chaea, Bacteria and Eucarya. This paradigm was proposed subtilis constitutes one of the most important biological by Woese, Kandler and Wheelis in PNAS in 1990 [2].
    [Show full text]
  • Phylogenetics Topic 2: Phylogenetic and Genealogical Homology
    Phylogenetics Topic 2: Phylogenetic and genealogical homology Phylogenies distinguish homology from similarity Previously, we examined how rooted phylogenies provide a framework for distinguishing similarity due to common ancestry (HOMOLOGY) from non-phylogenetic similarity (ANALOGY). Here we extend the concept of phylogenetic homology by making a further distinction between a HOMOLOGOUS CHARACTER and a HOMOLOGOUS CHARACTER STATE. This distinction is important to molecular evolution, as we often deal with data comprised of homologous characters with non-homologous character states. The figure below shows three hypothetical protein-coding nucleotide sequences (for simplicity, only three codons long) that are related to each other according to a phylogenetic tree. In the figure the nucleotide sequences are aligned to each other; in so doing we are making the implicit assumption that the characters aligned vertically are homologous characters. In the specific case of nucleotide and amino acid alignments this assumption is called POSITIONAL HOMOLOGY. Under positional homology it is implicit that a given position, say the first position in the gene sequence, was the same in the gene sequence of the common ancestor. In the figure below it is clear that some positions do not have identical character states (see red characters in figure below). In such a case the involved position is considered to be a homologous character, while the state of that character will be non-homologous where there are differences. Phylogenetic perspective on homologous characters and homologous character states ACG TAC TAA SYNAPOMORPHY: a shared derived character state in C two or more lineages. ACG TAT TAA These must be homologous in state.
    [Show full text]
  • Phylogenetics of Buchnera Aphidicola Munson Et Al., 1991
    Türk. entomol. derg., 2019, 43 (2): 227-237 ISSN 1010-6960 DOI: http://dx.doi.org/10.16970/entoted.527118 E-ISSN 2536-491X Original article (Orijinal araştırma) Phylogenetics of Buchnera aphidicola Munson et al., 1991 (Enterobacteriales: Enterobacteriaceae) based on 16S rRNA amplified from seven aphid species1 Farklı yaprak biti türlerinden izole edilen Buchnera aphidicola Munson et al., 1991 (Enterobacteriales: Enterobacteriaceae)’nın 16S rRNA’ya göre filogenetiği Gül SATAR2* Abstract The obligate symbiont, Buchnera aphidicola Munson et al., 1991 (Enterobacteriales: Enterobacteriaceae) is important for the physiological processes of aphids. Buchnera aphidicola genes detected in seven aphid species, collected in 2017 from different plants and altitudes in Adana Province, Turkey were analyzed to reveal phylogenetic interactions between Buchnera and aphids. The 16S rRNA gene was amplified and sequenced for this purpose and a phylogenetic tree built up by the neighbor-joining method. A significant correlation between B. aphidicola genes and the aphid species was revealed by this phylogenetic tree and the haplotype network. Specimens collected in Feke from Solanum melongena L. was distinguished from the other B. aphidicola genes on Aphis gossypii Glover, 1877 (Hemiptera: Aphididae) with a high bootstrap value of 99. Buchnera aphidicola in Myzus spp. was differentiated from others, and the difference between Myzus cerasi (Fabricius, 1775) and Myzus persicae (Sulzer, 1776) was clear. Although, B. aphidicola is specific to its host aphid, certain nucleotide differences obtained within the species could enable specification to geographic region or host plant in the future. Keywords: Aphid, genetic similarity, phylogenetics, symbiotic bacterium Öz Obligat simbiyont, Buchnera aphidicola Munson et al., 1991 (Enterobacteriales: Enterobacteriaceae), yaprak bitlerinin fizyolojik olaylarının sürdürülmesinde önemli bir rol oynar.
    [Show full text]
  • Introductory Activities
    TEACHER’S GUIDE Introduction Dean Madden Introductory NCBE, University of Reading activities Version 1.0 CaseCase Studies introduction Introductory activities The activities in this section explain the basic principles behind the construction of phylogenetic trees, DNA structure and sequence alignment. Students are also intoduced to the Geneious software. Before carrying out the activities in the DNA to Darwin Case studies, students will need to understand: • the basic principles behind the construction of an evolutionary tree or phylogeny; • the basic structure of DNA and proteins; • the reasons for and the principle of alignment; • use of some features of the Geneious computer software (basic version). The activities in this introduction are designed to achieve this. Some of them will reinforce what students may already know; others involve new concepts. The material includes extension activities for more able students. Evolutionary trees In 1837, 12 years before the publication of On the Origin of Species, Charles Darwin famously drew an evolutionary tree in one of his notebooks. The Origin also included a diagram of an evolutionary tree — the only illustration in the book. Two years before, Darwin had written to his friend Thomas Henry Huxley, saying: ‘The time will come, I believe, though I shall not live to see it, when we shall have fairly true genealogical trees of each great kingdom of Nature.’ Today, scientists are trying to produce the ‘Tree of Life’ Darwin foresaw, using protein, DNA and RNA sequence data. Evolutionary trees are covered on pages 2–7 of the Student’s guide and in the PowerPoint and Keynote slide presentations.
    [Show full text]
  • 1 "Principles of Phylogenetics: Ecology
    "PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200 Spring 2016 University of California, Berkeley D.D. Ackerly March 7, 2016. Phylogenetics and Adaptation What is to be explained? • What is the evolutionary history of trait x that we see in a lineage (homology) or multiple lineages (homoplasy) - adaptations as states • Is natural selection the primary evolutionary process leading to the ‘fit’ of organisms to their environment? • Why are some traits more prevalent (occur in more species): number of origins vs. trait- dependent diversification rates (speciation – extinction) Some high points in the history of the adaptation debate: 1950s • Modern Synthesis of Genetics (Dobzhansky), Paleontology (Simpson) and Systematics (Mayr, Grant) 1960s • Rise of evolutionary ecology – synthesis of ecology with strong adaptationism via optimality theory, with little to no history; leads to Sociobiology in the 70s • Appearance of cladistics (Hennig) 1972 • Eldredge and Gould – punctuated equilibrium – argue that Modern Synthesis can’t explain pervasive observation of stasis in fossil record; Gould focuses on development and constraint as explanations, Eldredge more on ecology and importance of migration to minimize selective pressure 1979 • Gould and Lewontin – Spandrels – general critique of adaptationist program and call for rigorous hypothesis testing of alternatives for the ‘fit’ between organism and environment 1980’s • Debate on whether macroevolution can be explained by microevolutionary processes • Comparative methods
    [Show full text]
  • Comparative Evolution: Latent Potentials for Anagenetic Advance (Adaptive Shifts/Constraints/Anagenesis) G
    Proc. Natl. Acad. Sci. USA Vol. 85, pp. 5141-5145, July 1988 Evolution Comparative evolution: Latent potentials for anagenetic advance (adaptive shifts/constraints/anagenesis) G. LEDYARD STEBBINS* AND DANIEL L. HARTLtt *Department of Genetics, University of California, Davis, CA 95616; and tDepartment of Genetics, Washington University School of Medicine, Box 8031, 660 South Euclid Avenue, Saint Louis, MO 63110 Contributed by G. Ledyard Stebbins, April 4, 1988 ABSTRACT One of the principles that has emerged from genetic variation available for evolutionary changes (2), a experimental evolutionary studies of microorganisms is that major concern of modem evolutionists is explaining how the polymorphic alleles or new mutations can sometimes possess a vast amount of genetic variation that actually exists can be latent potential to respond to selection in different environ- maintained. Given the fact that in complex higher organisms ments, although the alleles may be functionally equivalent or most new mutations with visible effects on phenotype are disfavored under typical conditions. We suggest that such deleterious, many biologists, particularly Kimura (3), have responses to selection in microorganisms serve as experimental sought to solve the problem by proposing that much genetic models of evolutionary advances that occur over much longer variation is selectively neutral or nearly so, at least at the periods of time in higher organisms. We propose as a general molecular level. Amidst a background of what may be largely evolutionary principle that anagenic advances often come from neutral or nearly neutral genetic variation, adaptive evolution capitalizing on preexisting latent selection potentials in the nevertheless occurs. While much of natural selection at the presence of novel ecological opportunity.
    [Show full text]
  • The Caper Package: Comparative Analysis of Phylogenetics and Evolution in R
    The caper package: comparative analysis of phylogenetics and evolution in R David Orme April 16, 2018 This vignette documents the use of the caper package for R (R Development Core Team, 2011) in carrying out a range of comparative analysis methods for phylogenetic data. The caper package, and the code in this vignette, requires the ape package (Paradis et al., 2004) along with the packages mvtnorm and MASS. Contents 1 Background 2 2 Comparative datasets 3 2.1 The comparative.data class and objects. .3 2.1.1 na.omit ......................................3 2.1.2 subset ......................................5 2.1.3 [ ..........................................5 2.2 Example datasets . .6 3 Methods and functions provided by caper. 7 3.0.1 Phylogenetic linear models . .7 3.0.2 Fitting phylogenetic GLS models: pgls ....................8 3.1 Optimising branch length transformations: profile.pgls...............9 3.1.1 Criticism and simplification ofpgls models: plot, anova and AIC...... 11 3.2 Phylogenetic independent contrasts . 12 3.2.1 Variable names in contrast functions . 12 3.2.2 Continuous variables: crunch .......................... 13 3.2.3 Categorical variables: brunch .......................... 13 3.2.4 Species richness contrasts: macrocaic ..................... 14 3.2.5 Phylogenetic signal: phylo.d .......................... 15 3.3 Checking and comparing contrast models. 16 3.3.1 Testing evolutionary assumptions: caic.diagnostics............. 16 3.3.2 Robust contrasts: caic.robust ......................... 17 3.3.3 Model criticism: plot .............................. 19 3.3.4 Model comparison: anova & AIC ........................ 20 3.4 Other comparative functions . 21 3.4.1 Tree imbalance: fusco.test .......................... 21 3.5 Phylogenetic diversity: pd.calc, pd.bootstrap and ed.calc............
    [Show full text]
  • Molecular Evolution
    An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Molecular Evolution An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Outline • Evolutionary Tree Reconstruction • “Out of Africa” hypothesis • Did we evolve from Neanderthals? • Distance Based Phylogeny • Neighbor Joining Algorithm • Additive Phylogeny • Least Squares Distance Phylogeny • UPGMA • Character Based Phylogeny • Small Parsimony Problem • Fitch and Sankoff Algorithms • Large Parsimony Problem • Evolution of Wings • HIV Evolution • Evolution of Human Repeats An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Early Evolutionary Studies • Anatomical features were the dominant criteria used to derive evolutionary relationships between species since Darwin till early 1960s • The evolutionary relationships derived from these relatively subjective observations were often inconclusive. Some of them were later proved incorrect An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Evolution and DNA Analysis: the Giant Panda Riddle • For roughly 100 years scientists were unable to figure out which family the giant panda belongs to • Giant pandas look like bears but have features that are unusual for bears and typical for raccoons, e.g., they do not hibernate • In 1985, Steven O’Brien and colleagues solved the giant panda classification problem using DNA sequences and algorithms An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Evolutionary Tree of Bears and Raccoons An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Evolutionary Trees: DNA-based Approach • 40 years ago: Emile Zuckerkandl and Linus Pauling brought reconstructing evolutionary relationships with DNA into the spotlight • In the first few years after Zuckerkandl and Pauling proposed using DNA for evolutionary studies, the possibility of reconstructing evolutionary trees by DNA analysis was hotly debated • Now it is a dominant approach to study evolution.
    [Show full text]
  • Non-Synonymous to Synonymous Substitutions Suggest That Orthologs Tend to Keep Their Functions, While Paralogs Are a Source of Functional Novelty
    bioRxiv preprint doi: https://doi.org/10.1101/354704; this version posted July 23, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Non-synonymous to synonymous substitutions suggest that orthologs tend to keep their functions, while paralogs are a source of functional novelty Mario Esposito1, Gabriel Moreno-Hagelsieb1,* 1 Dept of Biology, Wilfrid Laurier University, Waterloo, Ontario, Canada N2L 3C5 * [email protected] Abstract Because orthologs diverge after speciation events, and paralogs after gene duplication, it is expected that orthologs should tend to keep their functions, while paralogs have been proposed as a source of new functions. This does not mean that paralogs should diverge much more than orthologs, but it certainly means that, if there is a difference, then orthologs should be more functionally stable. Since protein functional divergence follows from non-synonymous substitutions, here we present an analysis based on the ratio of non-synonymous to synonymous substitutions (dN=dS). The results showed orthologs to have noticeable and statistically significant lower values of dN=dS than paralogs, not only confirming that orthologs keep their functions better, butalso suggesting that paralogs are a readily source of functional novelty. Author summary Homologs are characters diverging from a common ancestor, with orthologs being homologs diverging after a speciation event, and paralogs diverging after a duplication event. Given those definitions, orthologs are expected to preserve their ancestral function, while paralogs have been proposed as potential sources of functional novelty.
    [Show full text]
  • 5. EVOLUTION AS a POPULATION-GENETIC PROCESS 5 April 2020
    æ 5. EVOLUTION AS A POPULATION-GENETIC PROCESS 5 April 2020 With knowledge on rates of mutation, recombination, and random genetic drift in hand, we now consider how the magnitudes of these population-genetic features dictate the paths that are open vs. closed to evolutionary exploitation in various phylogenetic lineages. Because historical contingencies exist throughout the Tree of Life, we cannot expect to derive from first principles the source of every molecular detail of cellular diversification. We can, however, use established theory to address more general issues, such as the degree of attainable molecular refinement, rates of transition from one state to another, and the degree to which nonadaptive processes (mutation and random genetic drift) contribute to phylogenetic diversification. Substantial reviews of the field of evolutionary theory appear in Charlesworth and Charlesworth (2010) and Walsh and Lynch (2018). Much of the field is con- cerned with the mechanisms maintaining genetic variation within populations, as this ultimately dictates various aspects of the short-term response to selection. Here, however, we are primarily concerned with long-term patterns of phylogenetic diver- sification, so the focus is on the divergence of mean phenotypes. This still requires some knowledge of the principles of population genetics, as evolutionary divergence is ultimately a consequence of the accrual of genetic modifications at the population level. All evolutionary change initiates as a transient phase of genetic polymor- phism, during which mutant alleles navigate the rough sea of random genetic drift, often being evaluated on various genetic backgrounds, with some paths being more accessible to natural selection than others.
    [Show full text]