Gene Dosage and the Evolution of Gene Expression
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Applied Category Theory for Genomics – an Initiative
Applied Category Theory for Genomics { An Initiative Yanying Wu1,2 1Centre for Neural Circuits and Behaviour, University of Oxford, UK 2Department of Physiology, Anatomy and Genetics, University of Oxford, UK 06 Sept, 2020 Abstract The ultimate secret of all lives on earth is hidden in their genomes { a totality of DNA sequences. We currently know the whole genome sequence of many organisms, while our understanding of the genome architecture on a systematic level remains rudimentary. Applied category theory opens a promising way to integrate the humongous amount of heterogeneous informations in genomics, to advance our knowledge regarding genome organization, and to provide us with a deep and holistic view of our own genomes. In this work we explain why applied category theory carries such a hope, and we move on to show how it could actually do so, albeit in baby steps. The manuscript intends to be readable to both mathematicians and biologists, therefore no prior knowledge is required from either side. arXiv:2009.02822v1 [q-bio.GN] 6 Sep 2020 1 Introduction DNA, the genetic material of all living beings on this planet, holds the secret of life. The complete set of DNA sequences in an organism constitutes its genome { the blueprint and instruction manual of that organism, be it a human or fly [1]. Therefore, genomics, which studies the contents and meaning of genomes, has been standing in the central stage of scientific research since its birth. The twentieth century witnessed three milestones of genomics research [1]. It began with the discovery of Mendel's laws of inheritance [2], sparked a climax in the middle with the reveal of DNA double helix structure [3], and ended with the accomplishment of a first draft of complete human genome sequences [4]. -
UNIVERSITY of CALIFORNIA SAN DIEGO Making Sense of Microbial
UNIVERSITY OF CALIFORNIA SAN DIEGO Making sense of microbial populations from representative samples A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by James T. Morton Committee in charge: Professor Rob Knight, Chair Professor Pieter Dorrestein Professor Rachel Dutton Professor Yoav Freund Professor Siavash Mirarab 2018 Copyright James T. Morton, 2018 All rights reserved. The dissertation of James T. Morton is approved, and it is acceptable in quality and form for publication on microfilm and electronically: Chair University of California San Diego 2018 iii DEDICATION To my friends and family who paved the road and lit the journey. iv EPIGRAPH The ‘paradox’ is only a conflict between reality and your feeling of what reality ‘ought to be’ —Richard Feynman v TABLE OF CONTENTS Signature Page . iii Dedication . iv Epigraph . .v Table of Contents . vi List of Abbreviations . ix List of Figures . .x List of Tables . xi Acknowledgements . xii Vita ............................................. xiv Abstract of the Dissertation . xvii Chapter 1 Methods for phylogenetic analysis of microbiome data . .1 1.1 Introduction . .2 1.2 Phylogenetic Inference . .4 1.3 Phylogenetic Comparative Methods . .6 1.4 Ancestral State Reconstruction . .9 1.5 Analysis of phylogenetic variables . 11 1.6 Using Phylogeny-Aware Distances . 15 1.7 Challenges of phylogenetic analysis . 18 1.8 Discussion . 19 1.9 Acknowledgements . 21 Chapter 2 Uncovering the horseshoe effect in microbial analyses . 23 2.1 Introduction . 24 2.2 Materials and Methods . 34 2.3 Acknowledgements . 35 Chapter 3 Balance trees reveal microbial niche differentiation . 36 3.1 Introduction . -
2R Or Not 2R Is Not the Question Anymore
CORRESPONDENCE LINK TO ORIGINAL ARTICLE LINK TO INITIAL CORRESPONDENCE origin of evolutionary novelties are highly interesting questions, but they remain 2R or not 2R is not the largely unsolved11. However, evidence for the two rounds of genome duplication dur- question anymore ing chordate evolution is very strong, and it would seem safe to say that the debate Yves Van de Peer, Steven Maere and Axel Meyer over 2R is settled and is no longer an open question12. As we point out in our Opinion article1, it is only the evolutionary effects of In his comments on our Opinion article in passing in our recent review on WGDs events such as WGDs on evolution that are (The evolutionary significance of ancient and their significance for evolution in Nature debated1,11, and no longer whether or not genome duplications. Nature Rev. Genet. Reviews Genetics1. two rounds of WGD occurred during the 10, 725–732 (2009))1 Amir Ali Abbasi Abbasi2, however, questions the support evolution of chordates. (Piecemeal or big bangs: correlating the for the 2R hypothesis, claiming that it still Yves Van de Peer and Steven Maere are at the vertebrate evolution with proposed models remains debated today. He points out cor- Department of Plant Systems Biology, VIB (Flanders of gene expansion events. Nature Rev. Genet. rectly that evidence for the 2R hypothesis Institute of Biotechnology), B-9052 Ghent, Belgium. 6 Jan 2010 (doi:10.1038/nrg2600-c1))2 was based initially on data from only a Axel Meyer is at the Department of Biology, University argues that it is not justified to speculate small number of genes and vertebrate and of Konstanz, D-78457 Konstanz, Germany. -
Standardised Benchmarking in the Quest for Orthologs
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Harvard University - DASH Standardised Benchmarking in the Quest for Orthologs The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Altenhoff, A. M., B. Boeckmann, S. Capella-Gutierrez, D. A. Dalquen, T. DeLuca, K. Forslund, J. Huerta-Cepas, et al. 2016. “Standardised Benchmarking in the Quest for Orthologs.” Nature methods 13 (5): 425-430. doi:10.1038/nmeth.3830. http://dx.doi.org/10.1038/ nmeth.3830. Published Version doi:10.1038/nmeth.3830 Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:29408292 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Other Posted Material, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#LAA HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Nat Methods Manuscript Author . Author manuscript; Manuscript Author available in PMC 2016 October 04. Published in final edited form as: Nat Methods. 2016 May ; 13(5): 425–430. doi:10.1038/nmeth.3830. Standardised Benchmarking in the Quest for Orthologs Adrian M. Altenhoff1,2, Brigitte Boeckmann3, Salvador Capella-Gutierrez4,5,6, Daniel A. Dalquen7, Todd DeLuca8, Kristoffer Forslund9, Jaime Huerta-Cepas9, Benjamin Linard10, Cécile Pereira11,12, Leszek P. Pryszcz4, Fabian Schreiber13, Alan Sousa da Silva13, Damian Szklarczyk14,15, Clément-Marie Train1, Peer Bork9,16,17, Odile Lecompte18, Christian von Mering14,15, Ioannis Xenarios3,19,20, Kimmen Sjölander21, Lars Juhl Jensen22, Maria J. -
Increased Gene Dosage Plays a Predominant Role in the Initial Stages of Evolution of Duplicate Tem-1 Beta Lactamase Genes
ORIGINAL ARTICLE doi:10.1111/evo.12373 INCREASED GENE DOSAGE PLAYS A PREDOMINANT ROLE IN THE INITIAL STAGES OF EVOLUTION OF DUPLICATE TEM-1 BETA LACTAMASE GENES Riddhiman Dhar,1,2,3 Tobias Bergmiller,4,5 and Andreas Wagner1,2,6,7 1Institute of Evolutionary Biology and Environmental Studies, University of Zurich, CH-8057 Zurich, Switzerland 2The Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland 3Current address: Centre for Genomic Regulation (CRG), C/Dr. Aiguader 88, 08003 Barcelona, Spain 4ETH Zurich and Eawag, CH-8600 Dubendorf,¨ Switzerland 5Current address: Institute of Science and Technology, Am Campus 1, 3400 Klosterneuburg, Austria 6The Santa Fe Institute, Santa Fe, New Mexico 87501 7E-mail: [email protected] Received August 8, 2013 Accepted January 22, 2014 Gene duplication is important in evolution, because it provides new raw material for evolutionary adaptations. Several existing hy- potheses about the causes of duplicate retention and diversification differ in their emphasis on gene dosage, subfunctionalization, and neofunctionalization. Little experimental data exist on the relative importance of gene expression changes and changes in coding regions for the evolution of duplicate genes. Furthermore, we do not know how strongly the environment could affect this importance. To address these questions, we performed evolution experiments with the TEM-1 beta lactamase gene in Escherichia coli to study the initial stages of duplicate gene evolution in the laboratory. We mimicked tandem duplication by inserting two copies of the TEM-1 gene on the same plasmid. We then subjected these copies to repeated cycles of mutagenesis and selection in various environments that contained antibiotics in different combinations and concentrations. -
Changing the Name of the NBPF/DUF1220 Domain to the Olduvai Domain [Version 2; Peer Review: 3 Approved]
F1000Research 2018, 6:2185 Last updated: 31 AUG 2021 OPINION ARTICLE Changing the name of the NBPF/DUF1220 domain to the Olduvai domain [version 2; peer review: 3 approved] Previously titled: A proposal to change the name of the NBPF/DUF1220 domain to the Olduvai domain James M. Sikela 1, Frans van Roy2,3 1Department of Biochemistry and Molecular Genetics, Human Medical Genetics and Neuroscience Programs, University of Colorado School of Medicine, Aurora, CO, 80045, USA 2Department of Biomedical Molecular Biology, Ghent University, Ghent, 9052, Belgium 3VIB-UGent Center for Inflammation Research, Ghent, 9052, Belgium v2 First published: 28 Dec 2017, 6:2185 Open Peer Review https://doi.org/10.12688/f1000research.13586.1 Latest published: 17 Jul 2018, 6:2185 https://doi.org/10.12688/f1000research.13586.2 Reviewer Status Invited Reviewers Abstract We are jointly proposing a new name for a protein domain of 1 2 3 approximately 65 amino acids that has been previously termed NBPF or DUF1220. Our two labs independently reported the initial studies of version 2 this domain, which is encoded almost entirely within a single gene (revision) report report family. The name Neuroblastoma Breakpoint Family (NBPF) was 17 Jul 2018 applied to this gene family when the first identified member of the family was found to be interrupted in an individual with version 1 neuroblastoma. 28 Dec 2017 report report report Prior to this discovery, the Pfam database had termed the domain DUF1220, denoting it as one of many protein domains of unknown f unction. It has been Pfam’s intention to use “DUF” nomenclature to 1. -
Multiscale Modeling in Systems Biology
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 2051 Multiscale Modeling in Systems Biology Methods and Perspectives ADRIEN COULIER ACTA UNIVERSITATIS UPSALIENSIS ISSN 1651-6214 ISBN 978-91-513-1225-5 UPPSALA URN urn:nbn:se:uu:diva-442412 2021 Dissertation presented at Uppsala University to be publicly examined in 2446 ITC, Lägerhyddsvägen 2, Uppsala, Friday, 10 September 2021 at 10:15 for the degree of Doctor of Philosophy. The examination will be conducted in English. Faculty examiner: Professor Mark Chaplain (University of St Andrews). Abstract Coulier, A. 2021. Multiscale Modeling in Systems Biology. Methods and Perspectives. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 2051. 60 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-513-1225-5. In the last decades, mathematical and computational models have become ubiquitous to the field of systems biology. Specifically, the multiscale nature of biological processes makes the design and simulation of such models challenging. In this thesis we offer a perspective on available methods to study and simulate such models and how they can be combined to handle biological processes evolving at different scales. The contribution of this thesis is threefold. First, we introduce Orchestral, a multiscale modular framework to simulate multicellular models. By decoupling intracellular chemical kinetics, cell-cell signaling, and cellular mechanics by means of operator-splitting, it is able to combine existing software into one massively parallel simulation. Its modular structure makes it easy to replace its components, e.g. to adjust the level of modeling details. We demonstrate the scalability of our framework on both high performance clusters and in a cloud environment. -
OMA, a Comprehensive, Automated Project for the Identification of Orthologs from Complete Genome Data: Introduction and First Achievements
OMA, A Comprehensive, Automated Project for the Identification of Orthologs from Complete Genome Data: Introduction and First Achievements Christophe Dessimoz, Gina Cannarozzi, Manuel Gil, Daniel Margadant, Alexander Roth, Adrian Schneider, and Gaston H. Gonnet ETH Zurich, Institute of Computational Science, CH-8092 Z¨urich [email protected] Abstract. The OMA project is a large-scale effort to identify groups of orthologs from complete genome data, currently 150 species. The algo- rithm relies solely on protein sequence information and does not require any human supervision. It has several original features, in particular a verification step that detects paralogs and prevents them from being clustered together. Consistency checks and verification are performed throughout the process. The resulting groups, whenever a comparison could be made, are highly consistent both with EC assignments, and with assignments from the manually curated database HAMAP. A highly ac- curate set of orthologous sequences constitutes the basis for several other investigations, including phylogenetic analysis and protein classification. Complete genomes give scientists a valuable resource to assign functions to se- quences and to analyze their evolutionary history. These analyses rely heavily on gene comparison through sequence alignment algorithms that output the level of similarity, which gives an indication of homology. When homologous sequences are of interest, care must often be taken to distinguish between orthologous and paralogous proteins [1]. Both orthologs and paralogs come from the same ancestral sequence, and therefore are homologous, but they differ in the way they arise: paralogous se- quences are the product of gene duplication, while orthologous sequences are the product of speciation. -
Timing and Mechanism of Ancient Vertebrate Genome Duplications – the Adventure of a Hypothesis
ARTICLE IN PRESS TIGS 365 Review TRENDS in Genetics Vol.xx No.xx Monthxxxx Timing and mechanism of ancient vertebrate genome duplications – the adventure of a hypothesis Georgia Panopoulou and Albert J. Poustka Evolution and Development Group, Department of Vertebrate Genomics, Max-Planck Institut fu¨ r Molekulare Genetik, Ihnestrasse 73, D-14195 Berlin, Germany Complete genome doubling has long-term conse- period following the split of the cephalochordate and quences for the genome structure and the subsequent vertebrate lineages and before the emergence of gnathos- evolution of an organism. It has been suggested that tomes (Figure 1). Based on the apparent stepwise increase two genome duplications occurred at the origin of in the gene copy-number from invertebrates to jawless vertebrates (known as the 2R hypothesis). However, there has been considerable debate as to whether these were two successive duplications, or whether a single Glossary duplication occurred, followed by large-scale segmental (AB)(CD) topology measure: the nodes of the phylogenetic tree of four duplications. In this article, we review and compare the duplicates generated from two duplication events should have the (AB)(CD) evidence for the 2R duplications from vertebrate genomes topology where the dates of duplication for the (AB) and (CD) nodes are the same. Neighbor genes within paralogons that have the same topology are with similar data from other more recent polyploids. assumed to have been generated through the same event. Agnathans: jawless vertebrates. Aneuploidy: the loss or addition of one or more specific chromosomes to the normal set of chromosomes of an organism (e.g. a form of aneuploidy is Introduction trisomy 21). -
“Parent-Daughter” Relationships Among Vertebrate Paralogs
Reconstruction of the deep history of “Parent-Daughter” relationships among vertebrate paralogs Haiming Tang*, Angela Wilkins Mercury Data Science, Houston, TX, 77098 * Corresponding author Abstract: Gene duplication is a major mechanism through which new genetic material is generated. Although numerous methods have been developed to differentiate the ortholog and paralogs, very few differentiate the “Parent-Daughter” relationship among paralogous pairs. As coined by the Mira et al, we refer the “Parent” copy as the paralogous copy that stays at the original genomic position of the “original copy” before the duplication event, while the “Daughter” copy occupies a new genomic locus. Here we present a novel method which combines the phylogenetic reconstruction of duplications at different evolutionary periods and the synteny evidence collected from the preserved homologous gene orders. We reconstructed for the first time a deep evolutionary history of “Parent-Daughter” relationships among genes that were descendants from 2 rounds of whole genome duplications (2R WGDs) at early vertebrates and were further duplicated in later ceancestors like early Mammalia and early Primates. Our analysis reveals that the “Parent” copy has significantly fewer accumulated mutations compared with the “Daughter” copy since their divergence after the duplication event. More strikingly, we found that the “Parent” copy in a duplication event continues to be the “Parent” of the younger successive duplication events which lead to “grand-daughters”. Data availability: we have made the “Parent-Daughter” relationships publicly available at https://github.com/haimingt/Parent-Daughter-In-Paralogs/ Introduction Gene duplication has been widely accepted as a shaping force in evolution (Zhang, et al. -
A Newly Identified Myomegalin Isoform Functions in Golgi Microtubule Organization and ER–Golgi Transport
ß 2014. Published by The Company of Biologists Ltd | Journal of Cell Science (2014) 127, 4904–4917 doi:10.1242/jcs.155408 RESEARCH ARTICLE A newly identified myomegalin isoform functions in Golgi microtubule organization and ER–Golgi transport Zhe Wang, Chao Zhang and Robert Z. Qi* ABSTRACT the formation of ER–Golgi cargo carriers that associate with dynein–dynactin to move along microtubules towards the Golgi The Golgi of mammalian cells is known to be a major microtubule- (Presley et al., 1997; Scales et al., 1997; Watson et al., 2005). organizing site that requires microtubules for its organization and The Golgi serves as a major microtubule-organizing center protein trafficking. However, the mechanisms underlying the (Chabin-Brion et al., 2001; Efimov et al., 2007; Miller et al., 2009; microtubule organization of the Golgi remain obscure. We used Rivero et al., 2009). For example, almost half of all cellular immunoprecipitation coupled with mass spectrometry to identify a microtubules originate from the Golgi in human retinal pigment widely expressed isoform of the poorly characterized muscle protein epithelial RPE1 cells (Efimov et al., 2007). Moreover, microtubule myomegalin. This newly identified isoform, myomegalin variant 8 nucleation at the Golgi does not require centrosomes, and it (MMG8), localized predominantly to cis-Golgi networks by interacting depends instead on c-tubulin (Efimov et al., 2007), the principal with AKAP450 (also known as AKAP9), and this interaction with microtubule nucleator in cells, which exists in the form of c-tubulin AKAP450 was required for the stability of both proteins. Disrupting complexes (cTuCs). The cis-Golgi proteins AKAP450 (also known MMG8 expression affected endoplasmic reticulum (ER)-to-Golgi as AKAP9, AKAP350, CG-NAP and hyperion) and GMAP210 trafficking and caused Golgi fragmentation. -
Supplementary Data
Supplementary Fig. 1 A B Responder_Xenograft_ Responder_Xenograft_ NON- NON- Lu7336, Vehicle vs Lu7466, Vehicle vs Responder_Xenograft_ Responder_Xenograft_ Sagopilone, Welch- Sagopilone, Welch- Lu7187, Vehicle vs Lu7406, Vehicle vs Test: 638 Test: 600 Sagopilone, Welch- Sagopilone, Welch- Test: 468 Test: 482 Responder_Xenograft_ NON- Lu7860, Vehicle vs Responder_Xenograft_ Sagopilone, Welch - Lu7558, Vehicle vs Test: 605 Sagopilone, Welch- Test: 333 Supplementary Fig. 2 Supplementary Fig. 3 Supplementary Figure S1. Venn diagrams comparing probe sets regulated by Sagopilone treatment (10mg/kg for 24h) between individual models (Welsh Test ellipse p-value<0.001 or 5-fold change). A Sagopilone responder models, B Sagopilone non-responder models. Supplementary Figure S2. Pathway analysis of genes regulated by Sagopilone treatment in responder xenograft models 24h after Sagopilone treatment by GeneGo Metacore; the most significant pathway map representing cell cycle/spindle assembly and chromosome separation is shown, genes upregulated by Sagopilone treatment are marked with red thermometers. Supplementary Figure S3. GeneGo Metacore pathway analysis of genes differentially expressed between Sagopilone Responder and Non-Responder models displaying –log(p-Values) of most significant pathway maps. Supplementary Tables Supplementary Table 1. Response and activity in 22 non-small-cell lung cancer (NSCLC) xenograft models after treatment with Sagopilone and other cytotoxic agents commonly used in the management of NSCLC Tumor Model Response type