What Is a Gene, Post-ENCODE? History and Updated Definition
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Asymmetric Cryo-EM Reconstruction of Phage MS2 Reveals Genome Structure in Situ
ARTICLE Received 8 Feb 2016 | Accepted 11 Jul 2016 | Published 26 Aug 2016 DOI: 10.1038/ncomms12524 OPEN Asymmetric cryo-EM reconstruction of phage MS2 reveals genome structure in situ Roman I. Koning1,2, Josue Gomez-Blanco3, Inara Akopjana4, Javier Vargas3, Andris Kazaks4, Kaspars Tars4, Jose´ Marı´a Carazo3 & Abraham J. Koster1,2 In single-stranded ribonucleic acid (RNA) viruses, virus capsid assembly and genome packaging are intertwined processes. Using cryo-electron microscopy and single particle analysis we determined the asymmetric virion structure of bacteriophage MS2, which includes 178 copies of the coat protein, a single copy of the A-protein and the RNA genome. This reveals that in situ, the viral RNA genome can adopt a defined conformation. The RNA forms a branched network of stem-loops that almost all allocate near the capsid inner surface, while predominantly binding to coat protein dimers that are located in one-half of the capsid. This suggests that genomic RNA is highly involved in genome packaging and virion assembly. 1 Department of Molecular Cell Biology, Leiden University Medical Center, P.O. Box 9600, 2300 RC Leiden, The Netherlands. 2 Netherlands Centre for Electron Nanoscopy, Institute of Biology Leiden, Leiden University, Einsteinweg 55, 2333 CC Leiden, The Netherlands. 3 Biocomputing Unit, Centro Nacional de Biotecnologı´a, Consejo Superior de Investigaciones Cientı´ficas (CNB-CSIC), Darwin 3, Campus Universidad Auto´noma, Cantoblanco, Madrid 28049, Spain. 4 Biomedical Research and Study Centre, Ratsupites 1, LV-1067 Riga, Latvia. Correspondence and requests for materials should be addressed to R.I.K. (email: [email protected]). -
Applied Category Theory for Genomics – an Initiative
Applied Category Theory for Genomics { An Initiative Yanying Wu1,2 1Centre for Neural Circuits and Behaviour, University of Oxford, UK 2Department of Physiology, Anatomy and Genetics, University of Oxford, UK 06 Sept, 2020 Abstract The ultimate secret of all lives on earth is hidden in their genomes { a totality of DNA sequences. We currently know the whole genome sequence of many organisms, while our understanding of the genome architecture on a systematic level remains rudimentary. Applied category theory opens a promising way to integrate the humongous amount of heterogeneous informations in genomics, to advance our knowledge regarding genome organization, and to provide us with a deep and holistic view of our own genomes. In this work we explain why applied category theory carries such a hope, and we move on to show how it could actually do so, albeit in baby steps. The manuscript intends to be readable to both mathematicians and biologists, therefore no prior knowledge is required from either side. arXiv:2009.02822v1 [q-bio.GN] 6 Sep 2020 1 Introduction DNA, the genetic material of all living beings on this planet, holds the secret of life. The complete set of DNA sequences in an organism constitutes its genome { the blueprint and instruction manual of that organism, be it a human or fly [1]. Therefore, genomics, which studies the contents and meaning of genomes, has been standing in the central stage of scientific research since its birth. The twentieth century witnessed three milestones of genomics research [1]. It began with the discovery of Mendel's laws of inheritance [2], sparked a climax in the middle with the reveal of DNA double helix structure [3], and ended with the accomplishment of a first draft of complete human genome sequences [4]. -
Gene Prediction: the End of the Beginning Comment Colin Semple
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by PubMed Central http://genomebiology.com/2000/1/2/reports/4012.1 Meeting report Gene prediction: the end of the beginning comment Colin Semple Address: Department of Medical Sciences, Molecular Medicine Centre, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK. E-mail: [email protected] Published: 28 July 2000 reviews Genome Biology 2000, 1(2):reports4012.1–4012.3 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2000/1/2/reports/4012 © GenomeBiology.com (Print ISSN 1465-6906; Online ISSN 1465-6914) Reducing genomes to genes reports A report from the conference entitled Genome Based Gene All ab initio gene prediction programs have to balance sensi- Structure Determination, Hinxton, UK, 1-2 June, 2000, tivity against accuracy. It is often only possible to detect all organised by the European Bioinformatics Institute (EBI). the real exons present in a sequence at the expense of detect- ing many false ones. Alternatively, one may accept only pre- dictions scoring above a more stringent threshold but lose The draft sequence of the human genome will become avail- those real exons that have lower scores. The trick is to try and able later this year. For some time now it has been accepted increase accuracy without any large loss of sensitivity; this deposited research that this will mark a beginning rather than an end. A vast can be done by comparing the prediction with additional, amount of work will remain to be done, from detailing independent evidence. -
Distinctive Regulatory Architectures of Germline-Active and Somatic Genes in C
Downloaded from genome.cshlp.org on October 7, 2021 - Published by Cold Spring Harbor Laboratory Press Research Distinctive regulatory architectures of germline-active and somatic genes in C. elegans Jacques Serizay, Yan Dong, Jürgen Jänes, Michael Chesney, Chiara Cerrato, and Julie Ahringer The Gurdon Institute and Department of Genetics, University of Cambridge, CB2 1QN Cambridge, United Kingdom RNA profiling has provided increasingly detailed knowledge of gene expression patterns, yet the different regulatory ar- chitectures that drive them are not well understood. To address this, we profiled and compared transcriptional and regu- latory element activities across five tissues of Caenorhabditis elegans, covering ∼90% of cells. We find that the majority of promoters and enhancers have tissue-specific accessibility, and we discover regulatory grammars associated with ubiquitous, germline, and somatic tissue–specific gene expression patterns. In addition, we find that germline-active and soma-specific promoters have distinct features. Germline-active promoters have well-positioned +1 and −1 nucleosomes associated with a periodic 10-bp WW signal (W = A/T). Somatic tissue–specific promoters lack positioned nucleosomes and this signal, have wide nucleosome-depleted regions, and are more enriched for core promoter elements, which largely differ between tissues. We observe the 10-bp periodic WW signal at ubiquitous promoters in other animals, suggesting it is an ancient conserved signal. Our results show fundamental differences in regulatory architectures of germline and somatic tissue–specific genes, uncover regulatory rules for generating diverse gene expression patterns, and provide a tissue-specific resource for future studies. [Supplemental material is available for this article.] Cell type–specific transcription regulation underlies production tissues are achieved and whether expression is governed by dis- of the myriad of different cells generated during development. -
Encapsulation of Biomolecules in Bacteriophage MS2 Viral Capsids
Encapsulation of Biomolecules in Bacteriophage MS2 Viral Capsids By Jeff Edward Glasgow A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Chemistry in the Graduate Division of the University of California, Berkeley Committee in Charge: Professor Matthew Francis, Co-Chair Professor Danielle Tullman-Ercek, Co-Chair Professor Michelle Chang Professor John Dueber Spring 2014 Encapsulation of Biomolecules in Bacteriophage MS2 Viral Capsids Copyright ©2014 By: Jeff Edward Glasgow Abstract Encapsulation of Biomolecules in Bacteriophage MS2 Viral Capsids By Jeff Edward Glasgow Doctor of Philosophy in Chemistry University of California, Berkeley Professor Matthew Francis, Co-Chair Professor Danielle Tullman-Ercek, Co-Chair Nanometer-scale molecular assemblies have numerous applications in materials, catalysis, and medicine. Self-assembly has been used to create many structures, but approaches can match the extraordinary combination of stability, homogeneity, and chemical flexibility found in viral capsids. In particular, the bacteriophage MS2 capsid has provided a porous scaffold for several engineered nanomaterials for drug delivery, targeted cellular imaging, and photodynamic thera- py by chemical modification of the inner and outer surfaces of the shell. This work describes the development of new methods for reassembly of the capsid with concomitant encapsulation of large biomolecules. These methods were then used to encapsulate a variety of interesting cargoes, including RNA, DNA, protein-nucleic acid and protein-polymer conjugates, metal nanoparticles, and enzymes. To develop a stable, scalable method for encapsulation of biomolecules, the assembly of the capsid from its constituent subunits was analyzed in detail. It was found that combinations of negatively charged biomolecules and protein stabilizing agents could enhance reassembly, while electrostatic interactions of the biomolecules with the positively charged inner surface led to en- capsulation. -
Exploring the Structure of Long Non-Coding Rnas, J
IMF YJMBI-63988; No. of pages: 15; 4C: 3, 4, 7, 8, 10 1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 Irina V. Novikova, Scott P. Hennelly, Chang-Shung Tung and Karissa Y. Sanbonmatsu Q15 6 Los Alamos National Laboratory, Los Alamos, NM 87545, USA 7 Correspondence to Karissa Y. Sanbonmatsu: [email protected] 8 http://dx.doi.org/10.1016/j.jmb.2013.02.030 9 Edited by A. Pyle 1011 12 Abstract 13 Novel, profound and unexpected roles of long non-coding RNAs (lncRNAs) are emerging in critical aspects of 14 gene regulation. Thousands of lncRNAs have been recently discovered in a wide range of mammalian 15 systems, related to development, epigenetics, cancer, brain function and hereditary disease. The structural 16 biology of these lncRNAs presents a brave new RNA world, which may contain a diverse zoo of new 17 architectures and mechanisms. While structural studies of lncRNAs are in their infancy, we describe existing 18 structural data for lncRNAs, as well as crystallographic studies of other RNA machines and their implications 19 for lncRNAs. We also discuss the importance of dynamics in RNA machine mechanism. Determining 20 commonalities between lncRNA systems will help elucidate the evolution and mechanistic role of lncRNAs in 21 disease, creating a structural framework necessary to pursue lncRNA-based therapeutics. 22 © 2013 Published by Elsevier Ltd. 24 23 25 Introduction rather than the exception in the case of eukaryotic 50 organisms. 51 26 RNA is primarily known as an intermediary in gene LncRNAs are defined by the following: (i) lack of 52 11 27 expression between DNA and proteins. -
Differential Analysis of Gene Expression and Alternative Splicing
i bioRxiv preprint doi: https://doi.org/10.1101/2020.08.23.234237; this version posted August 24, 2020. The copyright holder for this preprint i (which was not certified by peer review)“main” is the author/funder, — 2020/8/23 who has — granted 9:38 — bioRxiv page a1 license — #1 to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. i i EmpiReS: Differential Analysis of Gene Expression and Alternative Splicing Gergely Csaba∗, Evi Berchtold*,y, Armin Hadziahmetovic, Markus Gruber, Constantin Ammar and Ralf Zimmer Department of Informatics, Ludwig-Maximilians-Universitat¨ Munchen,¨ 80333, Munich, Germany ABSTRACT to translation. Different transcripts of the same gene are often expressed at the same time, possibly at different abundance, While absolute quantification is challenging in high- yielding a defined mixture of isoforms. Changes to this throughput measurements, changes of features composition are further referred to as differential alternative between conditions can often be determined with splicing (DAS). Various diseases can be linked to DAS high precision. Therefore, analysis of fold changes events (3). Most hallmarks of cancer like tumor progression is the standard method, but often, a doubly and immune escape (4, 5) can be attributed to changes in differential analysis of changes of changes is the transcript composition (6). DAS plays a prominent role required. Differential alternative splicing is an in various types of muscular dystrophy (MD) (7), splicing example of a doubly differential analysis, i.e. fold intervention by targeted exon removal is a therapeutic option changes between conditions for different isoforms in Duchenne MD (8). -
Upstream Sequences Other Than AAUAAA Are Required for Efficient Messenger RNA 3’-End Formation in Plants
The Plant Cell, Vol. 2, 1261-1272, December 1990 O 1990 American Society of Plant Physiologists Upstream Sequences Other than AAUAAA Are Required for Efficient Messenger RNA 3’-End Formation in Plants Bradley D. Mogen, Margaret H. MacDonald, Robert Graybosch,’ and Arthur G. Hunt2 Plant Physiology/Biochemistry/MolecularBiology Program, Department of Agronomy, University of Kentucky, Lexington, Kentucky 40546-009 1 We have characterized the upstream nucleotide sequences involved in mRNA 3’-end formation in the 3‘ regions of the cauliflower mosaic virus (CaMV) 19S/35S transcription unit and a pea gene encoding ribulose-l,5-bisphosphate carboxylase small subunit (rbcs). Sequences between 57 bases and 181 bases upstream from the CaMV polyade- nylation site were required for efficient polyadenylation at this site. In addition, an AAUAAA sequence located 13 bases to 18 bases upstream from this site was also important for efficient mRNA 3’-end formation. An element located between 60 bases and 137 bases upstream from the poly(A) addition sites in a pea rbcS gene was needed for functioning of these sites. The CaMV -181/-57 and rbcS -137/-60 elements were different in location and sequence composition from upstream sequences needed for polyadenylation in mammalian genes, but resembled the signals that direct mRNA 3’-end formation in yeast. However, the role of the AAUAAA motif in 3’-end formation in the CaMV 3’ region was reminiscent of mRNA polyadenylation in animals. We suggest that multiple elements are involved in mRNA 3‘-end formation in plants, and that interactions of different components of the plant polyadenyl- ation apparatus with their respective sequence elements and with each other are needed for efficient mRNA 3‘-end formation. -
Personal and Population Genomics of Human Regulatory Variation
Downloaded from genome.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press Research Personal and population genomics of human regulatory variation Benjamin Vernot, Andrew B. Stergachis, Matthew T. Maurano, Jeff Vierstra, Shane Neph, Robert E. Thurman, John A. Stamatoyannopoulos,1 and Joshua M. Akey1 Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA The characteristics and evolutionary forces acting on regulatory variation in humans remains elusive because of the difficulty in defining functionally important noncoding DNA. Here, we combine genome-scale maps of regulatory DNA marked by DNase I hypersensitive sites (DHSs) from 138 cell and tissue types with whole-genome sequences of 53 geo- graphically diverse individuals in order to better delimit the patterns of regulatory variation in humans. We estimate that individuals likely harbor many more functionally important variants in regulatory DNA compared with protein-coding regions, although they are likely to have, on average, smaller effect sizes. Moreover, we demonstrate that there is sig- nificant heterogeneity in the level of functional constraint in regulatory DNA among different cell types. We also find marked variability in functional constraint among transcription factor motifs in regulatory DNA, with sequence motifs for major developmental regulators, such as HOX proteins, exhibiting levels of constraint comparable to protein-coding regions. Finally, we perform a genome-wide scan of recent positive selection and identify hundreds of novel substrates of adaptive regulatory evolution that are enriched for biologically interesting pathways such as melanogenesis and adipo- cytokine signaling. These data and results provide new insights into patterns of regulatory variation in individuals and populations and demonstrate that a large proportion of functionally important variation lies beyond the exome. -
POST-TRANSCRIPTIONAL REGULATION of AFP and Igm GENES
University of Kentucky UKnowledge University of Kentucky Doctoral Dissertations Graduate School 2011 POST-TRANSCRIPTIONAL REGULATION OF AFP AND IgM GENES Lilia M. Turcios University of Kentucky, [email protected] Right click to open a feedback form in a new tab to let us know how this document benefits ou.y Recommended Citation Turcios, Lilia M., "POST-TRANSCRIPTIONAL REGULATION OF AFP AND IgM GENES" (2011). University of Kentucky Doctoral Dissertations. 210. https://uknowledge.uky.edu/gradschool_diss/210 This Dissertation is brought to you for free and open access by the Graduate School at UKnowledge. It has been accepted for inclusion in University of Kentucky Doctoral Dissertations by an authorized administrator of UKnowledge. For more information, please contact [email protected]. ABSTRACT OF DISSERTATION Lilia M. Turcios The Graduate School University of Kentucky 2011 POST-TRANSCRIPTIONAL REGULATION OF AFP AND IgM GENES ABSTRACT OF DISSERTATION A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the College of Medicine at the University of Kentucky By Lilia M. Turcios Director: Dr. Martha Peterson Lexington, KY 2011 Copyright © Lilia M. Turcios 2011 ABSTRACT OF DISSERTATION POST-TRANSCRIPTIONAL REGULATION OF AFP AND IgM GENES Gene expression can be regulated at multiple steps once transcription is initiated. I have studied two different gene models, the α-Fetoprotein (AFP) and the immunoglobulin heavy chain (IgM) genes, to better understand post-transcriptional gene regulation mechanisms. The AFP gene is highly expressed during fetal liver development and dramatically repressed after birth. There is a mouse strain-specific difference between adult levels of AFP, with BALB/cJ mice expressing 10 to 20-fold higher levels compared to other mouse strains. -
RNA Components of the Spliceosome Regulate Tissue- and Cancer-Specific Alternative Splicing
Downloaded from genome.cshlp.org on September 29, 2021 - Published by Cold Spring Harbor Laboratory Press Research RNA components of the spliceosome regulate tissue- and cancer-specific alternative splicing Heidi Dvinge,1,2,4 Jamie Guenthoer,3 Peggy L. Porter,3 and Robert K. Bradley1,2 1Computational Biology Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA; 2Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA; 3Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA Alternative splicing of pre-mRNAs plays a pivotal role during the establishment and maintenance of human cell types. Characterizing the trans-acting regulatory proteins that control alternative splicing has therefore been the focus of much research. Recent work has established that even core protein components of the spliceosome, which are required for splicing to proceed, can nonetheless contribute to splicing regulation by modulating splice site choice. We here show that the RNA components of the spliceosome likewise influence alternative splicing decisions. Although these small nuclear RNAs (snRNAs), termed U1, U2, U4, U5, and U6 snRNA, are present in equal stoichiometry within the spliceosome, we found that their relative levels vary by an order of magnitude during development, across tissues, and across cancer samples. Physiologically relevant perturbation of individual snRNAs drove widespread gene-specific differences in alternative splic- ing but not transcriptome-wide splicing failure. Genes that were particularly sensitive to variations in snRNA abundance in a breast cancer cell line model were likewise preferentially misspliced within a clinically diverse cohort of invasive breast ductal carcinomas. -
Sensitive Detection of Live Escherichia Coli by Bacteriophage Amplification-Coupled
bioRxiv preprint doi: https://doi.org/10.1101/318071; this version posted May 9, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Sensitive detection of Live Escherichia coli by bacteriophage amplification-coupled 2 immunoassay on the Luminex® MAGPIX instrument 3 Tomotaka Midoa*, Eric M. Schafferb,c, Robert W. Dorseyc, Shanmuga Sozhamannand,e, E. 4 Randal Hofmannc,f 5 6 CBRN Detection Technology Section, CBRN Defense Technology Division, Advanced Defense 7 Technology Center, Acquisition, Technology and Logistics Agency (ATLA), Tokyo, Japana; 8 Leidos, Inc., Aberdeen Proving Ground, Maryland, USAb; Biosciences Division, U.S. Army 9 Edgewood Chemical Biological Center, Aberdeen Proving Grounds, Edgewood, MD, USAc; The 10 Tauri Group, LLC, Alexandria, VA, USAd; Defense Biological Product Assurance Office, JPM 11 G, JPEO, Frederick, MD, USAe; EXCET, Inc. Springfield, VA, USAf. 12 13 Running Head: Bacterial detection via phage on a MAGPIX instrument 14 15 #Address correspondence to [email protected] 16 *Present address: International Affairs Office, International Cooperation Division, Department 17 of Equipment Policy, Acquisition, Technology and Logistics Agency (ATLA), Tokyo, Japan 18 19 20 1 bioRxiv preprint doi: https://doi.org/10.1101/318071; this version posted May 9, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 21 Abstract (250 words) 22 Phages are natural predators of bacteria and have been exploited in bacterial detection because of 23 their exquisite specificity to their cognate bacterial hosts.