Molecular Evolution Charles F

Total Page:16

File Type:pdf, Size:1020Kb

Molecular Evolution Charles F V.1 Molecular Evolution Charles F. Aquadro OUTLINE could be used to infer the date of a last common ancestor. 1. What is molecular evolution and why does it Molecular Evolution. Changes in the molecules of life occur? (DNA, RNA, and protein) over generations, for many 2. Origins of molecular evolution, the molecular reasons, including mutation, genetic drift, and nat- clock, and the neutral theory ural selection, resulting in different sequences of these 3. Predictions of the neutral theory for variation molecules in different descendant lineages. The study within and between species of molecular evolution is the study of the patterns 4. The impact of natural selection on molecular and process of change that result in these different variation and evolution sequences. 5. Biological insights from the study of molecular Mutation. Heritable change in genetic material, includ- evolution ing base substitutions, insertions, deletions, and re- 6. Conclusions arrangements; the ultimate source of new variation in populations. The molecules of life (DNA, RNA, and proteins) change Neutral Theory. Short for neutral mutation–random drift over evolutionary time. Much can be learned about evo- theory of molecular evolution, proposing that molec- lutionary process and biological function from the rates ular variation is equivalent in function (selectively and patterns of change in these molecules. The study of neutral), making genetic drift the main driver of these changes is the study of molecular evolution. This molecular genetic change in populations over time. chapter discusses why these molecules change, what can Positive Selection. New advantageous mutations, or be learned about pattern and process from these changes, changing environments, can present opportunities and how the changes in the molecules of life can be used for new, or currently existing, variants to now have to infer important past evolutionary events. a reproductive advantage. They thus relentlessly in- crease in frequency until they fix in the relevant pop- ulation. The selective pressure that leads to this fix- GLOSSARY ation is termed positive selection. Fixation. The population process in which, either by drift Purifying Selection. Selection against harmful (i.e., dele- or by natural selection, a new mutation increases in terious) mutations “purifies” the population of these frequency in a population until it replaces all other harmful variants. Such selection is due to constraint, variants and reaches a frequency of 100 percent. typically to maintain a specific important biological Molecular Clock. When the time at which organisms last function. shared a common ancestor is plotted over time (e.g., estimated time from the fossil record), a roughly lin- ear accumulation of genetic changes in DNA and the 1. WHAT IS MOLECULAR EVOLUTION AND encoded proteins is frequently observed. In 1965, the WHY DOES IT OCCUR? rough linearity of this accumulation of change moti- vated Emile Zuckerkandl and Linus Pauling to pro- The molecules of life (DNA, RNA, and proteins) are not pose that these data represent a sort of “molecular static. They change over evolutionary time, hence the clock” by which the amount of molecular divergence term molecular evolution. Some molecules evolve 368 Genes, Genomes, Phenotypes rapidly and some only very, very slowly. Their change is Harmful or deleterious mutations, or ones that re- due to the interplay between two fundamental evolu- duce reproductive output or success, often will not be tionary processes: mutation and fixation. Before dis- fixed but rather reduced in frequency or eliminated from cussing these two processes, it is important to note that populations. This is known as purifying selection, and it mutations come in two basic varieties: heritable muta- prevents the otherwise-inexorable, but very slow, march tions that occur in the germ line and are passed on in the of successive neutral mutation fixations over time in all genome of progeny, and somatic mutations that can finite populations. occur in the process of cell division during normal growth and development. For example, if the latter af- 2. ORIGINS OF MOLECULAR EVOLUTION, THE fect the control of certain cellular processes, they can MOLECULAR CLOCK, AND THE NEUTRAL THEORY lead to uncontrolled cell growth and cancer. Molecular evolutionary studies have traditionally focused only on Molecular evolution as a field of study originated sort of heritable genetic changes that accumulate within and accidentally, as biologists discovered how to determine between organisms. the sequence of proteins and started collecting data from Inherited mutations occur primarily during DNA diverse organisms in the 1950s and 1960s. Key early replication in the production of gametes and introduce data were for hemoglobin and histones, proteins chosen new genetic variants into the population. For animals for their biomedical importance. The study of proteins and plants, the relevant genetic molecule is DNA. For was emphasized because of their clear functional rele- some viruses, the heritable molecule is RNA. Certain vance, and because methods were developed first for segments of DNA or RNA genomes are translated into sequencing these biological molecules. Comparison of proteins by the cells, and some changes lead to changes in sequence divergence among proteins revealed two im- the encoded protein sequence, leading to molecular evo- portant patterns: specifically, fibrinopeptides, hemoglo- lution at the amino acid level as well. Heritable muta- bin, and histones from various vertebrates with well- tions are largely considered to occur at random in time defined fossil records revealed that (1) different proteins and space and location across our genome, and at a rel- evolved at different rates, but (2) each protein seemed to atively constant rate, at least for many organisms (see accumulate changes at a surprisingly consistent rate, a chapter IV.2). pattern that led to the concept of a “molecular clock” The second process of molecular evolution is fixation, (figure 1). which is fundamentally the “population” phase leading The term molecular clock refers to both a mechanism to molecular evolutionary change. Most new mutations and a tool for evolutionary studies. As a mechanism, the are lost from populations as a result of chance (genetic presence of a roughly constant accumulation of change drift; see chapter IV.1), or because they are harmful led to the inference that chance, not local adaptation, is (deleterious). Mutations remain in populations because the cause of much of the observed molecular change. As of chance or because they increase the reproductive suc- a tool, the presence of a clock representing a molecule cess of offspring carrying them. Drift can lead to rapid also meant that if its rate could be calibrated with or- changes in frequencies of mutations in very small popu- ganisms of known age (e.g., from the fossil record), then lations, but is much less influential in large populations. observed sequence differences for organisms without a Ultimately, the outcome of drift alone is alwaysthe lossor fossil record could also be “dated.” fixation of every new mutation; in other words, every Until the mid-1960s, most evolutionary biologists mutation will eventually reach 0 or 100 percent fre- considered natural selection the primary determinant of quency. Fixation means that the new mutation now re- evolutionary change. Genetic drift was considered im- places all previous variants present (segregating) in the portant only in small populations; for example, drift population at a particular site in the genome. played an important role in Wright’s shifting balance Fixation can also be caused, or assisted, by natural theory, but not as a primary driving force in evolution, selection. Selection acting to directly increase a variant rather as a source of new combinations of alleles on frequency, as the result of an increased relative repro- which selection could act; thus, drift was rarely con- ductive success or survival of individuals carrying it, sidered, and most models focused on selection. is termed positive selection, or often simply adaptation. Several observations in the mid-1960s and early Such selection can cause rapid changes of allele frequen- 1970s challenged the dominance of selection as the driv- cies in populations, over tens of generations, versus mil- er of evolution. First, high levels of protein polymor- lions of generations by genetic drift alone in large pop- phism (i.e., variation within populations) were observed ulations. While the underlying mutation rate is thought in fruit flies, humans, and bacteria. Could all that var- not responsive to selective challenges, the fixation pro- iation within populations really be maintained by natu- cess is strongly influenced by selection. ral selection? Second, extrapolating from available data, Molecular Evolution 369 220 200 Mammals Birds/Reptiles Mammals/Reptiles Reptiles/Fish Carp/Lamprey Vertebrates/Insects abcde f g h i j 180 160 Fibrinopeptides 140 120 100 Hemoglobin 80 60 40 Cytochrome c (corrected amino acid changes per 100 residues) (corrected 20 m 0 0 100 200 300 400 500 600 700 800 900 1,000 Millions of years since divergence Figure 1. Molecular clocks. Rates of amino acid substitution in three proteins are from the same pair of organisms and time of divergence, proteins: fibrinopeptides, hemoglobin, and cytochrome c. The number the three proteins are evolving at very different rates (i.e., fibrino- of amino acid differences (per 100 residues,
Recommended publications
  • Comparative Evolution: Latent Potentials for Anagenetic Advance (Adaptive Shifts/Constraints/Anagenesis) G
    Proc. Natl. Acad. Sci. USA Vol. 85, pp. 5141-5145, July 1988 Evolution Comparative evolution: Latent potentials for anagenetic advance (adaptive shifts/constraints/anagenesis) G. LEDYARD STEBBINS* AND DANIEL L. HARTLtt *Department of Genetics, University of California, Davis, CA 95616; and tDepartment of Genetics, Washington University School of Medicine, Box 8031, 660 South Euclid Avenue, Saint Louis, MO 63110 Contributed by G. Ledyard Stebbins, April 4, 1988 ABSTRACT One of the principles that has emerged from genetic variation available for evolutionary changes (2), a experimental evolutionary studies of microorganisms is that major concern of modem evolutionists is explaining how the polymorphic alleles or new mutations can sometimes possess a vast amount of genetic variation that actually exists can be latent potential to respond to selection in different environ- maintained. Given the fact that in complex higher organisms ments, although the alleles may be functionally equivalent or most new mutations with visible effects on phenotype are disfavored under typical conditions. We suggest that such deleterious, many biologists, particularly Kimura (3), have responses to selection in microorganisms serve as experimental sought to solve the problem by proposing that much genetic models of evolutionary advances that occur over much longer variation is selectively neutral or nearly so, at least at the periods of time in higher organisms. We propose as a general molecular level. Amidst a background of what may be largely evolutionary principle that anagenic advances often come from neutral or nearly neutral genetic variation, adaptive evolution capitalizing on preexisting latent selection potentials in the nevertheless occurs. While much of natural selection at the presence of novel ecological opportunity.
    [Show full text]
  • Molecular Evolution
    An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Molecular Evolution An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Outline • Evolutionary Tree Reconstruction • “Out of Africa” hypothesis • Did we evolve from Neanderthals? • Distance Based Phylogeny • Neighbor Joining Algorithm • Additive Phylogeny • Least Squares Distance Phylogeny • UPGMA • Character Based Phylogeny • Small Parsimony Problem • Fitch and Sankoff Algorithms • Large Parsimony Problem • Evolution of Wings • HIV Evolution • Evolution of Human Repeats An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Early Evolutionary Studies • Anatomical features were the dominant criteria used to derive evolutionary relationships between species since Darwin till early 1960s • The evolutionary relationships derived from these relatively subjective observations were often inconclusive. Some of them were later proved incorrect An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Evolution and DNA Analysis: the Giant Panda Riddle • For roughly 100 years scientists were unable to figure out which family the giant panda belongs to • Giant pandas look like bears but have features that are unusual for bears and typical for raccoons, e.g., they do not hibernate • In 1985, Steven O’Brien and colleagues solved the giant panda classification problem using DNA sequences and algorithms An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Evolutionary Tree of Bears and Raccoons An Introduction to Bioinformatics Algorithms www.bioalgorithms.info Evolutionary Trees: DNA-based Approach • 40 years ago: Emile Zuckerkandl and Linus Pauling brought reconstructing evolutionary relationships with DNA into the spotlight • In the first few years after Zuckerkandl and Pauling proposed using DNA for evolutionary studies, the possibility of reconstructing evolutionary trees by DNA analysis was hotly debated • Now it is a dominant approach to study evolution.
    [Show full text]
  • Concentration-Affinity Equivalence in Gene Regulation: Convergence Of
    Proc. Nad. Acad. Sci. USA Vol. 85, pp. 4784-4788, July 1988 Evolution Concentration-affinity equivalence in gene regulation: Convergence of genetic and environmental effects (phenocopies/parallel evolution/genetic assimilation/expressivity/penetrance) EMILE ZUCKERKANDL* AND RUXTON VILLETt *Linus Pauling Institute of Science and Medicine, 440 Page Mill Road, Palo Alto, CA 94306; and tU.S. Department of Agriculture, Agricultural Research Services, National Program Staff, Building 05S, BARC-West, Beltsville, MD 20705 Communicated by Francisco J. Ayala, March 21, 1988 ABSTRACT It is proposed that equivalent phenotypic affinity can also be altered by a reversible structural change effects can be obtained by either structural changes in macro- in the protein, resulting in a modification of specific fit molecules involved in gene regulation or changes in activity of between macromolecules and brought about by a combina- the structurally unaltered macromolecules. This equivalence tion with, or by the release of, an effector. An important between changes in activity (concentration) and changes in variant of the latter process is a reversible formation of a structure can come into play within physiologically plausible covalent compound between a polynucleotide-binding pro- limits and seems to represent an important interface between tein and some other organic moiety (e.g., acetylation, ADP- environment and genome-namely, between environmentally ribosylation; see ref. 6). The potential equivalence is between determined and genetically determined gene expression. The the effect of a change in component concentration (activity) equivalence principle helps explain the appearance of pheno- under a constant structural state and the effect ofa structural copies. It also points to a general pathway favorable to the modification under constant component concentration (ac- occurrence, during evolution, offrequent episodes correspond- tivity).
    [Show full text]
  • MOLECULAR CLOCKS Definition Introduction
    MOLECULAR CLOCKS 583 Kishino, H., Thorne, J. L., and Bruno, W. J., 2001. Performance of Thorne, J. L., Kishino, H., and Painter, I. S., 1998. Estimating the a divergence time estimation method under a probabilistic model rate of evolution of the rate of molecular evolution. Molecular of rate evolution. Molecular Biology and Evolution, 18,352–361. Biology and Evolution, 15, 1647–1657. Kodandaramaiah, U., 2011. Tectonic calibrations in molecular dat- Warnock, R. C. M., Yang, Z., Donoghue, P. C. J., 2012. Exploring ing. Current Zoology, 57,116–124. uncertainty in the calibration of the molecular clock. Biology Marshall, C. R., 1997. Confidence intervals on stratigraphic ranges Letters, 8, 156–159. with nonrandom distributions of fossil horizons. Paleobiology, Wilkinson, R. D., Steiper, M. E., Soligo, C., Martin, R. D., Yang, Z., 23, 165–173. and Tavaré, S., 2011. Dating primate divergences through an Müller, J., and Reisz, R. R., 2005. Four well-constrained calibration integrated analysis of palaeontological and molecular data. Sys- points from the vertebrate fossil record for molecular clock esti- tematic Biology, 60,16–31. mates. Bioessays, 27, 1069–1075. Yang, Z., and Rannala, B., 2006. Bayesian estimation of species Parham, J. F., Donoghue, P. C. J., Bell, C. J., et al., 2012. Best practices divergence times under a molecular clock using multiple fossil for justifying fossil calibrations. Systematic Biology, 61,346–359. calibrations with soft bounds. Molecular Biology and Evolution, Peters, S. E., 2005. Geologic constraints on the macroevolutionary 23, 212–226. history of marine animals. Proceedings of the National Academy Zuckerkandl, E., and Pauling, L., 1962. Molecular disease, evolution of Sciences, 102, 12326–12331.
    [Show full text]
  • Manipulating Underdetermination in Scientific Controversy: the Case of the Molecular Clock
    Dartmouth College Dartmouth Digital Commons Open Dartmouth: Published works by Dartmouth faculty Faculty Work 9-1-2007 Manipulating Underdetermination in Scientific Controversy: The Case of the Molecular Clock Michael Dietrich Dartmouth College Robert A. Skipper University of Cincinnati Follow this and additional works at: https://digitalcommons.dartmouth.edu/facoa Part of the Biology Commons Dartmouth Digital Commons Citation Dietrich, Michael and Skipper, Robert A., "Manipulating Underdetermination in Scientific Controversy: The Case of the Molecular Clock" (2007). Open Dartmouth: Published works by Dartmouth faculty. 16. https://digitalcommons.dartmouth.edu/facoa/16 This Article is brought to you for free and open access by the Faculty Work at Dartmouth Digital Commons. It has been accepted for inclusion in Open Dartmouth: Published works by Dartmouth faculty by an authorized administrator of Dartmouth Digital Commons. For more information, please contact [email protected]. Manipulating Underdetermination in Scientiªc Controversy: The Case of the Molecular Clock Michael R. Dietrich Dartmouth College Robert A. Skipper, Jr. University of Cincinnati Where there are cases of underdetermination in scientiªc controversies, such as the case of the molecular clock, scientists may direct the course and terms of dispute by playing off the multidimensional framework of theory evaluation. This is because assessment strategies themselves are underdetermined. Within the framework of assessment, there are a variety of trade-offs between differ- ent strategies as well as shifting emphases as speciªc strategies are given more or less weight in assessment situations. When a strategy is underdetermined, scientists can change the dynamics of a controversy by making assessments using different combinations of evaluation strategies and/or weighting what- ever strategies are in play in different ways.
    [Show full text]
  • What Is the Viewpoint of Hemoglobin, and Does It Matter?
    Hist. Phil. Life Sci., 31 (2009), 241-262 What is the Viewpoint of Hemoglobin, and Does It Matter? Jonathan Marks University of North Carolina at Charlotte Department of Anthropology Charlotte, NC 28223, USA ABSTRACT - In this paper I discuss reductive trends in evolutionary anthropology. The first involved the reduction of human ancestry to genetic relationships (in the 1960s) and the second involved a parallel reduction of classification to phylogenetic retrieval (in the 1980s). Neither of these affords greater accuracy than their alternatives; that is to say, their novelty is epistemic, not empirical. As a result, there has been a revolution in classification in evolutionary anthropology, which arguably clouds the biological relationships of the relevant species, rather than clarifying them. Just below the species level, another taxonomic issue is raised by the reinscription of race as a natural category of the human species. This, too, is driven by the convergent interests of cultural forces including conservative political ideologies, the creation of pharmaceutical niche markets, free-market genomics, and old- fashioned scientific racism. KEYWORDS – Molecular anthropology, Systematics, Classification, Human evolution Introduction In this paper I explore the intersection of genetics, taxonomy, and evolutionary anthropology. All three fields are scientific areas saturated with cultural meanings and associations. First, genetics is the scientific study of heredity, but has historically capitalized on non-scientific prejudices about heredity to curry support: hence James Watson’s epigrammatic proclamation in support of the Human Genome Project, “We used to think our fate was in the stars. Now we know, in large measure, our fate is in our genes” (Jaroff 1989; Duster 1990; Nelkin and Lindee 1995).
    [Show full text]
  • 5. EVOLUTION AS a POPULATION-GENETIC PROCESS 5 April 2020
    æ 5. EVOLUTION AS A POPULATION-GENETIC PROCESS 5 April 2020 With knowledge on rates of mutation, recombination, and random genetic drift in hand, we now consider how the magnitudes of these population-genetic features dictate the paths that are open vs. closed to evolutionary exploitation in various phylogenetic lineages. Because historical contingencies exist throughout the Tree of Life, we cannot expect to derive from first principles the source of every molecular detail of cellular diversification. We can, however, use established theory to address more general issues, such as the degree of attainable molecular refinement, rates of transition from one state to another, and the degree to which nonadaptive processes (mutation and random genetic drift) contribute to phylogenetic diversification. Substantial reviews of the field of evolutionary theory appear in Charlesworth and Charlesworth (2010) and Walsh and Lynch (2018). Much of the field is con- cerned with the mechanisms maintaining genetic variation within populations, as this ultimately dictates various aspects of the short-term response to selection. Here, however, we are primarily concerned with long-term patterns of phylogenetic diver- sification, so the focus is on the divergence of mean phenotypes. This still requires some knowledge of the principles of population genetics, as evolutionary divergence is ultimately a consequence of the accrual of genetic modifications at the population level. All evolutionary change initiates as a transient phase of genetic polymor- phism, during which mutant alleles navigate the rough sea of random genetic drift, often being evaluated on various genetic backgrounds, with some paths being more accessible to natural selection than others.
    [Show full text]
  • Molecular Evolution 1
    Molecular Evolution 1. Evolutionary Tree Reconstruction 2. Two Hypotheses for Human Evolution 3. Did we evolve from Neanderthals? 4. Distance-Based Phylogeny 5. Neighbor Joining Algorithm 6. Additive Phylogeny 7. Least Squares Distance Phylogeny 8. UPGMA 9. Character-Based Phylogeny 10. Small Parsimony Problem 11. Fitch and Sankoff Algorithms 12. Large Parsimony Problem 13. Nearest Neighbor Interchange 14. Evolution of Human Repeats 15. Minimum Spanning Trees Section 1: Evolutionary Tree Reconstruction Early Evolutionary Studies • Anatomical features were the dominant criteria used to derive evolutionary relationships between species since Darwin till early 1960s. • The evolutionary relationships derived from these relatively subjective observations were often inconclusive. Some of them were later proven incorrect. DNA Analysis: The Giant Panda Riddle • For roughly 100 years, scientists were unable to figure out to which family the giant panda should belong. • Giant pandas look like bears but have features that are unusual for bears and typical for raccoons, e.g. they do not hibernate. • In 1985, Steven O’Brien and colleagues solved the giant panda classification problem using DNA sequences and algorithms. Evolutionary Tree of Bears and Raccoons Evolutionary Trees: DNA-Based Approach • 40 years ago: Emile Zuckerkandl and Linus Pauling brought reconstructing evolutionary relationships with DNA into the spotlight. • In the first few years after Zuckerkandl and Emile Zuckerkandl Pauling proposed using DNA for evolutionary studies, the possibility
    [Show full text]
  • Lecture 11 Molecular Evolution
    Lecture 11 Molecular evolution Jim Watson, Francis Crick, and DNA Molecular Evolution 4 characteristics 1. c-value paradox 2. Molecular evolution is sometimes decoupled from morphological evolution 3. Molecular clock 4. Neutral theory of Evolution Molecular Evolution 1. c-value!!!!!! paradox Kb! ! Navicola (diatom) ! !! 35,000! Drosophila (fruitfly) ! !180,000! Gallus (chicken) ! ! 1,200,000! Cyprinus (carp) 1,700,000! Boa (snake) 2,100,000! Rattus (rat) 2,900,000! Homo (human) 3,400,000! Schistocerca (locust) 9,300,000! Allium (onion) 18,000,000! Lilium (lily) 36,000,000! Ophioglossum (fern) 160,000,000! Amoeba (amoeba) 290,000,000! Isochores Cold-blooded vertebrates L (low GC) Warm-blooded vertebrates L H1 L H2 L H3 (low GC) (high GC) Isochores - Chromatin structure - Time of replication - Gene types - Gene concentration - Retroviruses Warm-blooded vertebrates L H1 L H2 L H3 (low GC) (high GC) (Mb) GC, % GC, % Isochores of human chromosome 21 (Macaya et al., 1976) Costantini et al., 2006 Molecular Evolution 2. Molecular evolution is sometimes decoupled from morphological evolution Morphological Genetic Similarity Similarity 1. low low 2. high high 3. high low 4. low high Molecular Evolution Morphological Genetic Similarity Similarity 3. high low Living fossils Latimeria, Coelacanth Limulus, Horseshoe crab Molecular Evolution Morphological Genetic Similarity Similarity - distance between humans and chimpanzees is less than 4. low high between sibling species of Drosophila. - for example, from a sample of 11 proteins representing 1271 amino acids, only 5 differ between humans and chimps. - the other six proteins are identical in primary structure. - most proteins that have been sequenced exhibit no amino acid differences - e.g., alphaglobin Pan, Chimp Homo, Human Molecular clock - when the rates of silent substitution at a gene are compared to its rate of replacement substitution, the former typically exceeds the latter by a factor of 5-10.
    [Show full text]
  • 2012Zuckerkandl.Pdf
    J Mol Evol DOI 10.1007/s00239-012-9511-6 Fifty-Year Old and Still Ticking.... An Interview with Emile Zuckerkandl on the 50th Anniversary of the Molecular Clock Giacomo Bernardi Received: 24 May 2012 / Accepted: 12 June 2012 Ó Springer Science+Business Media, LLC 2012 Abstract In 1962, a young post-doctoral fellow and a population demographies rely on its principles. Thus the prominent Nobel Prize winner, Emile Zuckerkandl and consequences of the molecular clock are far-reaching. Linus Pauling, published a seminal paper that described the Fifty years after the original publication, I was fortunate relationship between the average number of aminoacid enough to interview Emile Zuckerkandl. We shared replacements and divergence time, known as the molecular thoughts on his life and the historical events that led to the clock (Zuckerkandl and Pauling 1962). Fifty years after the discovery of the molecular clock. Jane and Emile wel- original publication, I was fortunate enough to interview comed me into their elegant home in Palo Alto (Fig. 1). Emile Zuckerkandl. We shared thoughts on his life and the Emile recounted the events leading to the discovery, while historical events that led to the discovery of the molecular Jane peppered the conversation with insights and an clock. incredibly acute memory. Below are excerpts of this conversation. Keywords Molecular clock Emile Zuckerkandl Giacomo Bernardi: Dear Emile, thank you so much for Á Á Linus Pauling giving me the opportunity to ask you questions about your scientific and private life. As you know, 2012 marks the 50th anniversary of the first paper on molecular clocks that Interview you wrote with Linus Pauling (Zuckerkandl and Pauling 1962).
    [Show full text]
  • Molecular Evolution and Phylogenetic Tree Reconstruction
    1 4 Molecular Evolution and 3 2 5 Phylogenetic Tree Reconstruction 1 4 2 3 5 Orthology, Paralogy, Inparalogs, Outparalogs Phylogenetic Trees • Nodes: species • Edges: time of independent evolution • Edge length represents evolution time § AKA genetic distance § Not necessarily chronological time Inferring Phylogenetic Trees Trees can be inferred by several criteria: § Morphology of the organisms • Can lead to mistakes § Sequence comparison Example: Mouse: ACAGTGACGCCCCAAACGT Rat: ACAGTGACGCTACAAACGT Baboon: CCTGTGACGTAACAAACGA Chimp: CCTGTGACGTAGCAAACGA Human: CCTGTGACGTAGCAAACGA Distance Between Two Sequences Basic principle: • Distance proportional to degree of independent sequence evolution Given sequences xi, xj, dij = distance between the two sequences One possible definition: i j dij = fraction f of sites u where x [u] ≠ x [u] Better scores are derived by modeling evolution as a continuous change process Molecular Evolution Modeling sequence substitution: Consider what happens at a position for time Δt, • P(t) = vector of probabilities of {A,C,G,T} at time t • µAC = rate of transition from A to C per unit time • µA = µAC + µAG + µAT rate of transition out of A • pA(t+Δt) = pA(t) – pA(t) µA Δt + pC(t) µCA Δt + pG(t) µGA Δt + pT(t) µTA Δt Molecular Evolution In matrix/vector notation, we get P(t+Δt) = P(t) + Q P(t) Δt where Q is the substitution rate matrix Molecular Evolution • This is a differential equation: P’(t) = Q P(t) • Q => prob. distribution over {A,C,G,T} at each position, stationary (equilibrium) frequencies πA, πC, πG,
    [Show full text]
  • Local and Relaxed Clocks: the Best of Both Worlds
    Local and relaxed clocks: the best of both worlds Mathieu Fourment and Aaron E. Darling ithree institute, University of Technology Sydney, Sydney, Australia ABSTRACT Time-resolved phylogenetic methods use information about the time of sample collection to estimate the rate of evolution. Originally, the models used to estimate evolutionary rates were quite simple, assuming that all lineages evolve at the same rate, an assumption commonly known as the molecular clock. Richer and more complex models have since been introduced to capture the phenomenon of substitution rate variation among lineages. Two well known model extensions are the local clock, wherein all lineages in a clade share a common substitution rate, and the uncorrelated relaxed clock, wherein the substitution rate on each lineage is independent from other lineages while being constrained to fit some parametric distribution. We introduce a further model extension, called the flexible local clock (FLC), which provides a flexible framework to combine relaxed clock models with local clock models. We evaluate the flexible local clock on simulated and real datasets and show that it provides substantially improved fit to an influenza dataset. An implementation of the model is available for download from https://www.github.com/4ment/flc. Subjects Bioinformatics, Genetics, Computational Science Keywords Phylogenetics, Molecular clock, Local clock, Evolutionary rate, Heterotachy INTRODUCTION Phylogenetic methods provide a powerful framework for reconstructing the evolutionary Submitted 20 March 2018 Accepted 9 June 2018 history of viruses, bacteria, and other organisms. Correctly estimating the rate at which Published 3 July 2018 mutations accumulate in a lineage is essential for phylogenetic analysis, as the accuracy Corresponding author of inferred rates can heavily impact other aspects of the analysis.
    [Show full text]