Inferring Phylogeny

Total Page:16

File Type:pdf, Size:1020Kb

Inferring Phylogeny Today’s topics Inferring phylogeny Overview of phylogenetic inferences Methodology Introduction ! Distance methods ! Parsimony method Methods using distance data UPGMA, Neighbor-joining, minimum evolution Methods using discrete data !"#$%&'(!)* Maximum parsimony +,- Maximum likelihood Bayesian inference .'/01!23454(6!7!2845*0&4'9#6!:&454(6 ;<=>?@AB=C?DEF 1 2 Milestones of molecular evolution studies Contributions to molecular evolution 1800 1900 2000 The patterns of evolution Cladistics and phylogenetics: Emergence of cladistics: Willi Hennig et al. The blooming of Willi Hennig (1913-1976) evo devo studies Discovery of DNA structure: James Watson & Francis Technical breakthrough Crick (1953) in the 1990s: computer The mechanisms Genetics: Gregor Mendel speed, methodological (1856-1863) First comparison of amino improvement The neutral theory of molecular evolution: acid sequence (insulin) Motoo Kimura (!"#$, 1924-1994) among different species: Technical breakthrough in Fred Sanger et al. (1955) the 1980s: PCR, DNA sequencing, etc. First example of molecular systematics: Vince Sarich The models and methodological concerns & Allan Wilson (1967) on The uses of parsimony, maximum likelihood methods in phylogenetics: Origin of species: Charles human evolution Joseph Felsenstein Darwin (1859) Genome Science and of Biology, University of Washington, Seattle Nature (1968) 217: 6243 3 The neutral theory of molecular evolution 4 Reconstructing phylogeny The basics of evolutionary trees The concepts of trees The tree shapes The data used to construct phylogenetic trees Morphological data Molecular data Homology of characters in data matrices Sequence alignment Are these two trees different? 5 Baum et al. (2005) Science 310:9796 5 6 Most recent common ancestor (MRCA) Types of data Character and character types Quantitative and qualitative characters Binary and multistate characters Assumptions about character evolution Substitution models Step matrix Which is the MRCA of a mushroom and a sponge? d Which is the MRCA of a mouse and a fern? b Plant Molecular Evolution 8 7 8 Distance method Evolution of DNA sequences UPGMA (Unweighted Pair Group Method using A SpA ACAGCCGATTGT Arithmetic averages) TCAGCCGACTGT Transformed distance method T TCAGCCGACTGT Neighbor-Joining method C SpB TCAGCCGATTGT Minimum evolution TCAGACGACTGT SpC TCAGCCGACTGT TCAGACGACTGT SpD TCAGACGACTGT 9 10 9 10 In distance method, sequences were grouped based on their Measuring genetic distance similarity, we need to measure the differences among sequences How different are the two sequences we have? SpA ACAGCCGATTGT The ways of measuring nucleotide substitution Simplest way Hamming distance = n/N x100% SpB TCAGCCGATTGT SpA ACAGCCGA-TGT SpB TCAGCCGA-TGT SpC TCAGCCGACTGT 2/12 = 16.6% SpC TCAGCCGACTGT 1/12 = 8.3% SpD TCAGACGACTGT SpD TCAGACGACTGT 11 12 Measuring differences between sequences The probability of nucleotide substitution The common ways SpC TCAGCCGACTGT Direct counting SpD TCAGACGACTGT The probability of substitution at certain nucleotide site change from i to j. In order to know the probability of the Nucleotide substitution models sequences to be different (p), we can calculate Jukes & Cantor(1969)’s one parameter model the probability of the sequences being the same Kimura(1980)'s two parameter model (I(t)) first. F81 (Felsenstein, 1981) Let’s start with Jukes-Cantor model HKY85 (Hasegawa et al., 1985) Generalized time reversible model (GTR) 13 13 14 We can just use another estimator, K, the number of substitutions per site since time of Example of calculation divergence between the two sequences If there are 80 transitions and 20 transversions between two " sequences (length=1000bp), the number of substitutions per A G site K can be estimated: X Under J-C model, 3"t 3"t " " " " 3 4 Y Z K = ! ln(1! p) = 0.10732 (p=0.1) " 4 3 C T Therefore, K = 2 x 3"t = 6"t Under Kimura 2P model: 1 1 1 1 4 3 4 K = ln( ) + ln( ) (P=0.08, Q=0.02) ! 8!t = "ln(1" p) " K = ! ln(1! p) 2 1! 2P ! Q 4 1! 2Q 3 4 3 = 0.10943 where p = the observed proportions of different nucleotides between the two sequences where P = the proportions of transitions and Q = the proportions of transversions. p is the proportion of total substitution. 16 15 16 Overview of Distance method UPGMA OTU A B C 1. Transform the data OTU (AB) C d matrix into distance table B AB d C (AB)C d d C AC BC d d D (AB)D CD d d d D AD BD CD where d(AB)C = (dAC + dBC)/2 and d(AB)D = (dAD + dBD)/2 2. Use this new data matrix to evaluate/ construct the tree (Page & Holme 1998)17 (Graur & Li 2000)18 17 18 Example Example Felsenstein (2004) 19 Felsenstein (2004) 20 19 20 Example UPGMA tree Felsenstein (2004) 21 Felsenstein (2004) 22 21 22 3 Neighbor-Joining (NJ) method True tree SP1: AAACGCGCTTAGATCGCCTAACGTACAAGC 10 SP2: TATAAAGGGTAGAAAACCTAAGGATCAACG Saitou & Nei (1987); Studier N1: AATCGCGCTTAGATCGCCTAACGTACAACG 2 & Keppler (1988) N2: AATCGCGGGTAGATCGCCTAACGTACAACG 3 SP3: AATCGCGGGTATTTCGCCTAACGTACATCG NJ does not assume a clock and approximates the minimum evolution method 7 SP1 SP2 SP2 15 4 SP1 3 SP3 8 13 4 SP3 SP1 SP2 SP3 24 23 24 NJ procedure 1. For each tip, compute n Limitation on distance methods Di ui = # j: j"i (n ! 2) All nucleotide sites change independently. i j D -u -u 2. Choose the and for which ij i j When evolutionary rates vary from site to site, is smallest than the data set needs to be corrected 3. Join i and j, compute the new node The substitution rate is constant over time and in (ij) to every other nodes different lineages. (Dik + D jk ! Dij ) The base composition is at equilibrium. D(ij ),k = 2 The conditional probabilities of nucleotide 4. Delete tips i and j and make new substitutions are the same for all sites and do not table with the new node (ij) change over time. 5. Continue till resolve the tree 25 26 25 26 Discrete methods Parsimony methods Maximum parsimony (%&'()*) The goal is to find the most parsimonious tree Maximum likelihood (+,-./)*) The criteria are to calculate the changes of Bayesian inference (01234567) character states, i.e. the evolutionary steps Others First, we have to know the way to evaluate a given tree 28 27 28 A simple data matrix w/ discrete characters A simple data matrix w/ discrete characters SpA: TCAGACGATTGTCAGACCATTG For position no. 3 SpA: TCAGACGATTGTCAGACCATTG SpB: TCAGTCGACTGTCAAACCATTG SpB: TCAGTCGACTGTCAAACCATTG SpC: TCGGTCAATTGTCAAACGATTG SpC: TCGGTCAATTGTCAAACGATTG SpD: TCGGTCAATTGTCAAACGATTG SpD: TCGGTCAATTGTCAAACGATTG A A G G A A G G A G G A SpA SpB SpC SpD SpA SpB SpD SpC SpA SpC SpD SpB SpA SpB SpC SpD SpA SpB SpD SpC SpA SpC SpD SpB A to G A A G G SpA SpB SpC SpD Character change (step) = 2 Character change (step) = 1 29 Character change (step) = 2 30 29 30 A simple data matrix w/ discrete characters A simple data matrix w/ discrete characters For position no. 7 SpA: TCAGACGATTGTCAGACCATTG SpA: TCAGACGATTGTCAGACCATTG SpB: TCAGTCGACTGTCAAACCATTG SpB: TCAGTCGACTGTCAAACCATTG SpC: TCGGTCAATTGTCAAACGATTG SpC: TCGGTCAATTGTCAAACGATTG SpD: TCGGTCAATTGTCAAACGATTG SpD: TCGGTCAATTGTCAAACGATTG G G A A G G A A G A A G G G A A G G A A G A A G SpA SpB SpC SpD SpA SpB SpD SpC SpA SpC SpD SpB SpA SpB SpC SpD SpA SpB SpD SpC SpA SpC SpD SpB G to A Character change (step) = 9 Character change (step) = 9 Character change (step) = 6 Character change (step) = 2 Character change (step) = 2 Character change (step) = 1 31 32 31 32 Consensus trees Finding the underlying information among trees So, how many trees out there are we dealing with? (Page & Holme 1998)33 34 33 34 Tree searching 3 taxa The number of unrooted trees: (2n ! 5)! N = unrooted 2n!3(n ! 3)! 4 taxa The number of rooted trees: (2n ! 3)! N = rooted 2n!2(n ! 2)! 35 36 35 36 Taxon number All possible unrooted tree number 3 1 Note 4 3 5 15 6 105 6 7 945 Say a computer can evaluate 10 trees per 8 10,395 9 135,135 second. 10 2,027,025 11 34,459,425 If we want to evaluate all of the trees for 25 taxa, 12 654,729,075 13 13,749,310,575 we will need 14 316,234,143,225 x = 1029/106/60/60/24/365 = 3.17x1015!"#$% 15 7,905,853,580,625 16 213,458,046,676,875 !& 17 6,190,283,353,629,375 18 191,898,783,962,510,625 19 6,332,659,870,762,850,625 We got to have some ways to approximate the true tree. 20 221,643,095,476,699,771,875 21 8,200,794,532,637,891,559,375 22 319,830,986,772,877,770,815,625 23 13,113,070,457,687,988,603,440,625 24 563,862,029,680,583,509,947,946,875 25 25,373,791,335,626,257,947,657,609,375 >1029 ... etc. 37 38 37 38 Ways to solve time paradigm The procedure can be simplified Skip unnecessary calculation SpA: TCAGACGATTGTCAGACCATTG Change the tree searching ways SpB: TCGGTCGACTGTCAGACCATTG SpC: TCAGTCGATTGTCA-ACGATTG SpD: TCAGTCGATTGTCA-ACGATTG SpE: TCAGTCGATCGTCA-ACGATTG Parsimonious Parsimonious uninformative informative 39 40 39 40 Searching for optimal trees Exhausted search Exhausted search Branch-and-bound method Heuristic approach Stepwise/closest/random addition Star decomposition Branch swapping 41 Page & Holme (1998) Molecular evolution 41 42 Tree-island profile: First Last First Times Distribution of tree length Island Size tree tree Score replicate hit ---------------------------------------------------------------------- Frequency distribution of lengths of 1000000 random trees: 1 101 1471 1571 3890 3 1 Results of heuristic search, 2 69 4961 5029 3890 10 1 3 1 14236 14236 3890 30 1 with random addition from mean=6528.095481 sd=65.005662 g1=-0.322317 g2=0.159848 4 5 14237 14241 3890 31 1 random starting points 6114.00000 /---------------------------------------------------------------------
Recommended publications
  • Uncorrelated Genetic Drift of Gene Frequencies and Linkage
    Genet. Res., Camb. (1974), 24, pp. 281-294 281 Printed in Great Britain Uncorrelated genetic drift of gene frequencies and linkage disequilibrium in some models of linked overdominant polymorphisms BY JOSEPH FELSENSTEIN Department of Genetics, University of Washington, Seattle, Washington 98195 {Received 15 April 1974) SUMMARY For large population sizes, gene frequencies p and q at two linked over- dominant loci and the linkage disequilibrium parameter D will remain close to their equilibrium values. We can treat selection and recombination as approximately linear forces on^J, q and D, and we can treat genetic drift as a multivariate normal perturbation with constant variance-covariance matrix. For the additive-multiplicative family of two-locus models, p, q and D are shown to be (approximately) uncorrelated. Expressions for their variances are obtained. When selection coefficients are small the variances of p and q are those previously given by Robertson for a single locus. For small recombination fractions the variance of D is that obtained for neutral loci by Ohta & Kimura. For larger recombination fractions the result differs from theirs, so that for unlinked loci r2~ 2/(3N) instead of l/(2N). For the Lewontin-Kojima and Bodmer symmetric viability models, and for a model symmetric at only one of the loci, a more exact argument is possible. In the asymptotic conditional distribution in these cases, various of p, q and D are uncorrelated, depending on the type of symmetiy in the model. 1. INTRODUCTION Much of the work on linked genes in the last few years has centred on the deter- ministic theory of natural selection of linked polymorphisms.
    [Show full text]
  • Workshop on Molecular Evolution (Extended Special Topics Session July 27-August 8,2003 August 8-August 15,2003) Course Director
    Workshop on Molecular Evolution July 27-August 8,2003 (Extended Special Topics Session August 8-August 15,2003) Course Director: Michael P. Cummings, University of Maryland and Marine Biological Laboratory Molecular evolution has become the nexus of many areas of biological research. It both brings together and enriches such areas as biochemistry, molecular biology, microbiology, population genetics, systematics, developmental biology, genomics, bioinformatics, in vitro evolution, and molecular ecology. The Workshop provides an important contribution to these fields in that it promotes interdisciplinary research and interaction, and thus provides a glue that sticks together disparate fields. Due to the wide range of fields addressed by the study of molecular evolution, it is difficult to offer a comprehensive course in a university setting. It is rare for a single institution to maintain expertise in all necessary areas. In contrast, the Workshop is uniquely able to provide necessary breadth and depth by utilizing a large number of faculty with appropriate expertise. Furthermore, the flexible nature of the Workshop allows for rapid adaptation to changes in the dynamic field of molecular evolution. For example, the 2003 Workshop included recently emergent research areas of molecular evolution of development and genomics. The interest in the Workshop remains very strong and is increasing. The number of applications for the 2003 course was 143, continuing the trend of increased applications since 2000. In 2003 there were 60 students participating in the Workshop, which was taught by 19 faculty and 4 teaching assistants. The students came from all over the world (1 7 countries), and represented several career stages: graduate students (57%), postdoctoral researchers (1 3%), faculty/principal investigators (27%), and other (3%).
    [Show full text]
  • American Society of Naturalists Honorary Lifetime Membership Awards
    The University of Chicago American Society of Naturalists Honorary Lifetime Membership Awards. Source: The American Naturalist, Vol. 183, No. 6 (June 2014), pp. ii-v Published by: The University of Chicago Press for The American Society of Naturalists Stable URL: http://www.jstor.org/stable/10.1086/676468 . Accessed: 28/09/2014 17:59 Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp . JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. The University of Chicago Press, The American Society of Naturalists, The University of Chicago are collaborating with JSTOR to digitize, preserve and extend access to The American Naturalist. http://www.jstor.org This content downloaded from 142.103.160.110 on Sun, 28 Sep 2014 17:59:55 PM All use subject to JSTOR Terms and Conditions American Society of Naturalists Honorary Lifetime Membership Awards Jane Lubchenco The American Society of Naturalists is pleased to award step of effective involvement in the political processes Jane Lubchenco an Honorary Lifetime Membership. Jane that direct and fund our endeavors. She has cofounded received her BA in biology from Colorado College, an three organizations: the Leopold Leadership Program, the MS in zoology from the University of Washington, and Communications Partnership for Science and the Sea, her doctorate from Harvard University, studying with and Climate Central.
    [Show full text]
  • JUMPSTARTING PHYLOGENETIC SEARCHES by Jesse L. Mecham A
    JUMPSTARTING PHYLOGENETIC SEARCHES by Jesse L. Mecham A thesis submitted to the faculty of Brigham Young University in partial fulfillment of the requirements for the degree of Master of Science Department of Computer Science Brigham Young University April 2006 Copyright c 2006 Jesse L. Mecham All Rights Reserved BRIGHAM YOUNG UNIVERSITY GRADUATE COMMITTEE APPROVAL of a thesis submitted by Jesse L. Mecham This thesis has been read by each member of the following graduate committee and by majority vote has been found to be satisfactory. Date Mark J. Clement, Chair Date Quinn O. Snell Date Dennis Ng BRIGHAM YOUNG UNIVERSITY As chair of the candidate’s graduate committee, I have read the thesis of Jesse L. Mecham in its final form and have found that (1) its format, citations, and biblio- graphical style are consistent and acceptable and fulfill university and department style requirements; (2) its illustrative materials including figures, tables, and charts are in place; and (3) the final manuscript is satisfactory to the graduate committee and is ready for submission to the university library. Date Mark J. Clement Chair, Graduate Committee Accepted for the Department Parris Egbert Graduate Coordinator Accepted for the College Tony Martinez Dean, College of Engineering and Technology ABSTRACT JUMPSTARTING PHYLOGENETIC SEARCHES Jesse L. Mecham Department of Computer Science Master of Science Phylogenetic analysis is a central tool in studies of comparative genomics. When a new region of DNA is isolated and sequenced, researchers are often forced to throw away months of computation on an existing phylogeny of homologous sequences in order to incorporate this new sequence.
    [Show full text]
  • A General Population Genetic Theory for the Evolution of Developmental Interactions
    A general population genetic theory for the evolution of developmental interactions Sean H. Rice* Department of Ecology and Evolutionary Biology, Osborn Memorial Laboratories, Yale University, New Haven, CT 06520 Communicated by Joseph Felsenstein, University of Washington, Seattle, WA, October 11, 2002 (received for review October 7, 2001) The development of most phenotypic traits involves complex The geometry of this surface is determined by how the interactions between many underlying factors, both genetic and underlying factors interact to influence phenotype, in other environmental. To study the evolution of such processes, a set of words, by development. There is a straightforward relation mathematical relationships is derived that describe how selection between the terminology of gene interaction and the geometry acts to change the distribution of genetic variation given arbitrarily of the phenotype landscape. If u1 and u2 are genetic (heritable) complex developmental interactions and any distribution of ge- factors, then the degree to which the value of u1 influences the netic and environmental variation. The result is illustrated by using phenotypic consequences of changing u2 (i.e., the epistatic Ѩ2␾͞Ѩ Ѩ it to derive models for the evolution of dominance and for the interaction between them) is u1 u2, which measures the evolutionary consequences of asymmetry in the distribution of Ѩ2␾͞Ѩ 2 curvature of the landscape in one direction. Similarly, u1 genetic variation. measures the nonlinear effects of changing u1 and thus (when u1 is genetic) provides a measure of dominance (10). If u1 is a Ѩ2␾͞Ѩ Ѩ uring development of a phenotypic trait, gene products genetic factor and u2 an environmental factor, then u1 u2 Dinteract in highly nonadditive ways with one another and measures the genotype by environment (G ϫ E) interaction.
    [Show full text]
  • Dobzhansky's Evolution of Tropical Populations, and the Science and Politics Of
    CARVALHO, Tito. “A most bountiful source of inspiration:” Dobzhansky’s evolution of tropical populations, and the science and politics of genetic variation. História, Ciências, Saúde – Manguinhos, Rio de Janeiro, v.26, n.1, “A most bountiful jan.-mar. 2019, p.281-297. Abstract source of inspiration:” Theodosius Dobzhansky has been studied for how he integrated Dobzhansky’s evolution of field naturalism and laboratory experimentation in ways that helped tropical populations, and produce the Modern Synthesis, as well as how he leveraged biological expertise to support liberal and cosmopolitan the science and politics of values amidst Second World War and the Cold War. Moreover, Dobzhansky genetic variation has been central in analyses of the institutionalization of genetics in Brazil, where he spent several years. This article “A mais abundante fonte situates Dobzhansky’s Brazilian research within the science of variation and the de inspiração”: Dobzhansky politics of diversity. I conclude by raising questions about how the ways in which e sua evolução sobre as science figured in politics depended on ideas about the role of scientists in society whichwere advanced in populações dos trópicos, parallel, suggesting research on the “co- a ciência e a política da production” of natural and social orders. Keywords: evolutionary genetics; variabilidade genética transnational science; eugenics; race; tropics; Theodosius Dobzhansky (1900- 1975). Resumo Theodosius Dobzhansky tem sido estudado pelo modo como ele integrou o naturalismo de campo e a experimentação científica, que deram origem à síntese moderna, assim como a alavanca que ele deu ao conhecimento biológico para apoiar valores liberais e cosmopolitas em meio à Segunda Guerra Mundial e à Guerra Fria.
    [Show full text]
  • Indo-European Phylogenetics with R a Tutorial Introduction
    Indo-European Linguistics (2020) 1–71 brill.com/ieul Indo-European phylogenetics with R A tutorial introduction David Goldstein University of California, Los Angeles, CA, USA [email protected] Abstract The last twenty or so years have witnessed a dramatic increase in the use of computa- tional methods for inferring linguistic phylogenies. Although the results of this research have been controversial, the methods themselves are an undeniable boon for histori- cal and Indo-European linguistics, if for no other reason than that they allow the field to pursue questions that were previously intractable. After a review of the advantages and disadvantages of computational phylogenetic methods, I introduce the following methods of phylogenetic inference in R: maximum parsimony; distance-based meth- ods (UPGMA and neighbor joining); and maximum likelihood estimation. I discuss the strengths and weaknesses of each of these methods and in addition explicate var- ious measures associated with phylogenetic estimation, including homoplasy indices and bootstrapping. Phylogenetic inference is carried out on the Indo-European dataset compiled by Don Ringe and Ann Taylor, which includes phonological, morphological, and lexical characters. Keywords phylogenetics – computational methods – parsimony – UPGMA – neighbor joining – maximum likelihood – homoplasy – bootstrapping 1 Introduction Phylogenetic trees model linguistic descent. More specifically, they are hypoth- eses about the order of lineage-splitting events from an often unobservable common ancestor to a set of observable descendants (Bowern & Koch 2004: 8–9, Pagel 2017: 152). The phylogeny of the Indo-European languages is a mat- © david goldstein, 2020 | doi:10.1163/22125892-20201000 This is an open access article distributed under the terms of the CC BY-NCDownloaded 4.0 license.
    [Show full text]
  • Phylip and Phylogenetics
    ® Genes, Genomes and Genomics ©2009 Global Science Books Phylip and Phylogenetics Ahmed Mansour* Genetics Department, Faculty of Agriculture, Zagazig University, Zagazig, Egypt Correspondence : * [email protected] ; [email protected] ABSTRACT Phylogenetics studies are mainly concerned with evolutionary relatedness among various groups of organisms. Recently, phylogenetic analyses have been performed on a genomic scale to address issues ranging from the prediction of gene and protein function to organismal relationships. Computing the relatedness of organisms either by phylogenetic (gene by gene analyses) or phylogenomic (the whole genome comparison) methods reveals high-quality results for demonstrating phylogenies. In this regard, Phylip (Phylogeny Inference Package) software is a free package of programs for inferring phylogenies of living species and organisms. It is now one of the most widely used packages for computing accurate phylogenetic trees and carrying out certain related tasks. This paper provides an overview on Phylip package and its applications and contribution to phylogenetic analyses. _____________________________________________________________________________________________________________ Keywords: bioinformatics, evolutionary relatedness, genetic diversity INTRODUCTION Phylip: Different useful programs The word phylogenetics is derived from the Greek words, The PHYLIP programs could be classified into five cate- phylon, which means tribe or race, and genetikos, which gories (Table 1): means birth. Phylogenetic
    [Show full text]
  • Phylogeny of Five Taxa in the Felsenstein and Farris Zones
    Phylogeny of Five Taxa in the Felsenstein and Farris Zones Eric Trung Lam Thesis submitted to the Faculty of Science in partial fulfillment of the requirements for the degree of Masters in Statistics with Specialization in Bioinformatics1 Mathematics and Statistics Faculty of Science University of Ottawa © Eric Trung Lam, Ottawa, Canada, 2021 1The program is a joint program with Carleton University, administered by the Ottawa-Carleton Institute of Mathematics and Statistics Abstract Mathematical conditions which showed where parsimony was not consistent for four taxa were first introduced by Felsenstein in 1978. This was subsequently labelled the \Felsenstein zone". Following Felsenstein's findings, `frequentists' conjectured that for five taxa there would also be a region in parameter space where parsimony is not consistent. In response, `cladists' claimed that parsimony was consistent in a different region of parameter space, which is called the \Farris zone". However, no analytical description of the region in which this consistency occurs has been made. Furthermore, no mathematical extensions of this Felsenstein theory to five taxa or more has been made. The same is true for the Farris zone. In this thesis, we give a complete account for the Felsenstein zone and Farris zone for four and five taxa and interpret these in terms of the shape of the phylogenetic tree. ii Dedications I would like to dedicate this to my parents and sister, whose love and support has helped me throughout my university career. iii Acknowledgements Firstly, I would like to express my thanks and gratitude to my supervisor, David Sankoff, without whom this thesis would not have been completed.
    [Show full text]
  • The Troubled Growth of Statistical Phylogenetics
    Syst. Biol. 50(4):465–467, 2001 The Troubled Growth of Statistical Phylogenetics JOSEPH FELSENSTEIN Department of Genetics, University of Washington, Seattle, Washington 98919 USA; E-mail: [email protected] Statistical inference of phylogenies almost of a small band of pioneers exploring new didn’t happen. The story of the origin, territory. growth, and spread of “statistical phyloge- The stage would now seem set for a grad- netics” needs to be told, because it is so ual spread of statistical methods, but reality strange. It is not the straightforward story of was not to be this simple. In 1969 I began gradual spread that one might imagine. to attend the annual Numerical Taxonomy It starts with the development of numeri- conferences convened by Bob Sokal. In cal methods in systematics, whose modern 1971, at that meeting in Ann Arbor, Gareth proponents were Sokal and Sneath. Their Nelson advocated Willi Hennig’s strictly work, embodied in their book Principles of monophyletic classication. It became clear Numerical Taxonomy, set off an explosion of that some systematists wanted to take a well- work by mathematical clusterers, but did not dened, almost algorithmic approach. Hen- win many converts in systematics. In the nig set forth well-dened methods for infer- early 1960s two groups started work on nu- ring phylogenies (provided there was no in- merical inference of phylogenies. Edwards ternal conict in the data), an approach with and Cavalli-Sforza, working on trees of enormous appeal to a new generation of mor- human populations and using gene frequen- phological systematists. cies, invented parsimony and distance ma- The difculty was that although well- trix methods.
    [Show full text]
  • Punctuated Evolution Shaped Modern Vertebrate Diversity
    bioRxiv preprint doi: https://doi.org/10.1101/151175; this version posted June 18, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Punctuated evolution shaped modern vertebrate diversity Michael J. Landis1 and Joshua G. Schraiber2, 3 1Department of Ecology and Evolutionary Biology, Yale University 2Department of Biology, Temple University 3Institute for Genomics and Evolutionary Medicine, Temple University June 16, 2017 Abstract The relative importance of different modes of evolution in shaping phenotypic diversity remains a hotly debated question. Fossil data suggest that stasis may be a common mode of evolution, while modern data suggest very fast rates of evolution. One way to reconcile these observations is to imagine that evolution is punctuated, rather than gradual, on geological time scales. To test this hypothesis, we developed a novel maximum likelihood framework for fitting L´evyprocesses to comparative morphological data. This class of stochastic processes includes both a gradual and punctuated component. We found that a plurality of modern vertebrate clades examined are best fit by punctuated processes over models of gradual change, gradual stasis, and adaptive radiation. When we compare our results to theoretical expectations of the rate and speed of regime shifts for models that detail fitness landscape dynamics, we find that our quantitative results are broadly compatible with both microevolutionary models and with observations from the fossil record. A key debate in evolutionary biology centers around the seeming contradictions regarding the tempo and mode of evolution as seen in fossil data compared to ecological data.
    [Show full text]
  • A Primer to Phylogenetic Analysis Using the PHYLIP Package
    A primer to phylogenetic analysis using the PHYLIP package Jarno Tuimala Fifth Edition All rights reserved. The PDF version of this book or parts of it can be used in Finnish universities as course material, provided that this copyright notice is included. However, this publication may not be sold or included as part of other publications without permission of the publisher. © The author and CSC – Scientific Computing Ltd. 2006 ISBN 952-5520-02-1 2 Index Index ..............................................................................................................................................................3 Preface ...........................................................................................................................................................4 Introduction ..................................................................................................................................................5 What is PHYLIP? ...................................................................................................................................5 Installation ..............................................................................................................................................5 User interface..........................................................................................................................................6 Getting started – datafiles and programs...................................................................................................6 Always keep
    [Show full text]