Consensus Folding of Aligned Sequences As a New Measure for the Detection of Functional Rnas by Comparative Genomics
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The ELIXIR Core Data Resources: Fundamental Infrastructure for The
Supplementary Data: The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences The “Supporting Material” referred to within this Supplementary Data can be found in the Supporting.Material.CDR.infrastructure file, DOI: 10.5281/zenodo.2625247 (https://zenodo.org/record/2625247). Figure 1. Scale of the Core Data Resources Table S1. Data from which Figure 1 is derived: Year 2013 2014 2015 2016 2017 Data entries 765881651 997794559 1726529931 1853429002 2715599247 Monthly user/IP addresses 1700660 2109586 2413724 2502617 2867265 FTEs 270 292.65 295.65 289.7 311.2 Figure 1 includes data from the following Core Data Resources: ArrayExpress, BRENDA, CATH, ChEBI, ChEMBL, EGA, ENA, Ensembl, Ensembl Genomes, EuropePMC, HPA, IntAct /MINT , InterPro, PDBe, PRIDE, SILVA, STRING, UniProt ● Note that Ensembl’s compute infrastructure physically relocated in 2016, so “Users/IP address” data are not available for that year. In this case, the 2015 numbers were rolled forward to 2016. ● Note that STRING makes only minor releases in 2014 and 2016, in that the interactions are re-computed, but the number of “Data entries” remains unchanged. The major releases that change the number of “Data entries” happened in 2013 and 2015. So, for “Data entries” , the number for 2013 was rolled forward to 2014, and the number for 2015 was rolled forward to 2016. The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences 1 Figure 2: Usage of Core Data Resources in research The following steps were taken: 1. API calls were run on open access full text articles in Europe PMC to identify articles that mention Core Data Resource by name or include specific data record accession numbers. -
Methods in and Applications of the Sequencing of Short Non-Coding Rnas" (2013)
University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2013 Methods in and Applications of the Sequencing of Short Non- Coding RNAs Paul Ryvkin University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Bioinformatics Commons, Genetics Commons, and the Molecular Biology Commons Recommended Citation Ryvkin, Paul, "Methods in and Applications of the Sequencing of Short Non-Coding RNAs" (2013). Publicly Accessible Penn Dissertations. 922. https://repository.upenn.edu/edissertations/922 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/922 For more information, please contact [email protected]. Methods in and Applications of the Sequencing of Short Non-Coding RNAs Abstract Short non-coding RNAs are important for all domains of life. With the advent of modern molecular biology their applicability to medicine has become apparent in settings ranging from diagonistic biomarkers to therapeutics and fields angingr from oncology to neurology. In addition, a critical, recent technological development is high-throughput sequencing of nucleic acids. The convergence of modern biotechnology with developments in RNA biology presents opportunities in both basic research and medical settings. Here I present two novel methods for leveraging high-throughput sequencing in the study of short non- coding RNAs, as well as a study in which they are applied to Alzheimer's Disease (AD). The computational methods presented here include High-throughput Annotation of Modified Ribonucleotides (HAMR), which enables researchers to detect post-transcriptional covalent modifications ot RNAs in a high-throughput manner. In addition, I describe Classification of RNAs by Analysis of Length (CoRAL), a computational method that allows researchers to characterize the pathways responsible for short non-coding RNA biogenesis. -
Fast and Reliable Prediction of Noncoding Rnas
Fast and reliable prediction of noncoding RNAs Stefan Washietl*, Ivo L. Hofacker*, and Peter F. Stadler*†‡ *Department of Theoretical Chemistry and Structural Biology, University of Vienna, Wa¨hringerstrasse 17, A-1090 Wien, Austria; and †Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Ha¨rtelstrasse 16-18, D-04107 Leipzig, Germany Communicated by Hans Frauenfelder, Los Alamos National Laboratory, Los Alamos, NM, December 14, 2004 (received for review November 2, 2004) We report an efficient method for detecting functional RNAs. The served noncoding elements in mammalian (or, more generally, approach, which combines comparative sequence analysis and vertebrate) genomes, and it must be expected that a significant structure prediction, already has yielded excellent results for a fraction of them are functional RNAs. small number of aligned sequences and is suitable for large-scale Possible candidates, however, have been identified only spo- genomic screens. It consists of two basic components: (i) a measure radically so far (19, 21), simply because there are no reliable tools for RNA secondary structure conservation based on computing a to scan multiple sequence alignments for functional RNAs. The consensus secondary structure, and (ii) a measure for thermody- most widely used program QRNA (22), which has been success- namic stability, which, in the spirit of a z score, is normalized with fully used to identify ncRNAs in bacteria (23) and yeast (24), is respect to both sequence length and base composition but can be not suitable for screens of large genomes. QRNA is limited to calculated without sampling from shuffled sequences. Functional pairwise alignments, and its reliability is low, especially if the RNA secondary structures can be identified in multiple sequence evolutionary distance of the two sequences lies outside of the alignments with high sensitivity and high specificity. -
Comparing Tools for Non-Coding RNA Multiple Sequence Alignment Based On
Downloaded from rnajournal.cshlp.org on September 26, 2021 - Published by Cold Spring Harbor Laboratory Press ES Wright 1 1 TITLE 2 RNAconTest: Comparing tools for non-coding RNA multiple sequence alignment based on 3 structural consistency 4 Running title: RNAconTest: benchmarking comparative RNA programs 5 Author: Erik S. Wright1,* 6 1 Department of Biomedical Informatics, University of Pittsburgh (Pittsburgh, PA) 7 * Corresponding author: Erik S. Wright ([email protected]) 8 Keywords: Multiple sequence alignment, Secondary structure prediction, Benchmark, non- 9 coding RNA, Consensus secondary structure 10 Downloaded from rnajournal.cshlp.org on September 26, 2021 - Published by Cold Spring Harbor Laboratory Press ES Wright 2 11 ABSTRACT 12 The importance of non-coding RNA sequences has become increasingly clear over the past 13 decade. New RNA families are often detected and analyzed using comparative methods based on 14 multiple sequence alignments. Accordingly, a number of programs have been developed for 15 aligning and deriving secondary structures from sets of RNA sequences. Yet, the best tools for 16 these tasks remain unclear because existing benchmarks contain too few sequences belonging to 17 only a small number of RNA families. RNAconTest (RNA consistency test) is a new 18 benchmarking approach relying on the observation that secondary structure is often conserved 19 across highly divergent RNA sequences from the same family. RNAconTest scores multiple 20 sequence alignments based on the level of consistency among known secondary structures 21 belonging to reference sequences in their output alignment. Similarly, consensus secondary 22 structure predictions are scored according to their agreement with one or more known structures 23 in a family. -
Annual Scientific Report 2013 on the Cover Structure 3Fof in the Protein Data Bank, Determined by Laponogov, I
EMBL-European Bioinformatics Institute Annual Scientific Report 2013 On the cover Structure 3fof in the Protein Data Bank, determined by Laponogov, I. et al. (2009) Structural insight into the quinolone-DNA cleavage complex of type IIA topoisomerases. Nature Structural & Molecular Biology 16, 667-669. © 2014 European Molecular Biology Laboratory This publication was produced by the External Relations team at the European Bioinformatics Institute (EMBL-EBI) A digital version of the brochure can be found at www.ebi.ac.uk/about/brochures For more information about EMBL-EBI please contact: [email protected] Contents Introduction & overview 3 Services 8 Genes, genomes and variation 8 Molecular atlas 12 Proteins and protein families 14 Molecular and cellular structures 18 Chemical biology 20 Molecular systems 22 Cross-domain tools and resources 24 Research 26 Support 32 ELIXIR 36 Facts and figures 38 Funding & resource allocation 38 Growth of core resources 40 Collaborations 42 Our staff in 2013 44 Scientific advisory committees 46 Major database collaborations 50 Publications 52 Organisation of EMBL-EBI leadership 61 2013 EMBL-EBI Annual Scientific Report 1 Foreword Welcome to EMBL-EBI’s 2013 Annual Scientific Report. Here we look back on our major achievements during the year, reflecting on the delivery of our world-class services, research, training, industry collaboration and European coordination of life-science data. The past year has been one full of exciting changes, both scientifically and organisationally. We unveiled a new website that helps users explore our resources more seamlessly, saw the publication of ground-breaking work in data storage and synthetic biology, joined the global alliance for global health, built important new relationships with our partners in industry and celebrated the launch of ELIXIR. -
A Unicellular Relative of Animals Generates a Layer of Polarized Cells
RESEARCH ARTICLE A unicellular relative of animals generates a layer of polarized cells by actomyosin- dependent cellularization Omaya Dudin1†*, Andrej Ondracka1†, Xavier Grau-Bove´ 1,2, Arthur AB Haraldsen3, Atsushi Toyoda4, Hiroshi Suga5, Jon Bra˚ te3, In˜ aki Ruiz-Trillo1,6,7* 1Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain; 2Department of Vector Biology, Liverpool School of Tropical Medicine, Liverpool, United Kingdom; 3Section for Genetics and Evolutionary Biology (EVOGENE), Department of Biosciences, University of Oslo, Oslo, Norway; 4Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Japan; 5Faculty of Life and Environmental Sciences, Prefectural University of Hiroshima, Hiroshima, Japan; 6Departament de Gene`tica, Microbiologia i Estadı´stica, Universitat de Barcelona, Barcelona, Spain; 7ICREA, Barcelona, Spain Abstract In animals, cellularization of a coenocyte is a specialized form of cytokinesis that results in the formation of a polarized epithelium during early embryonic development. It is characterized by coordinated assembly of an actomyosin network, which drives inward membrane invaginations. However, whether coordinated cellularization driven by membrane invagination exists outside animals is not known. To that end, we investigate cellularization in the ichthyosporean Sphaeroforma arctica, a close unicellular relative of animals. We show that the process of cellularization involves coordinated inward plasma membrane invaginations dependent on an *For correspondence: actomyosin network and reveal the temporal order of its assembly. This leads to the formation of a [email protected] (OD); polarized layer of cells resembling an epithelium. We show that this stage is associated with tightly [email protected] (IR-T) regulated transcriptional activation of genes involved in cell adhesion. -
Deep Evolutionary Origin of Nematode SL2 Trans-Splicing Revealed by Genome-Wide
bioRxiv preprint doi: https://doi.org/10.1101/642082; this version posted May 27, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Deep evolutionary origin of nematode SL2 trans-splicing revealed by genome-wide analysis of the Trichinella spiralis transcriptome Marius Wenzel2, Christopher Johnston1, Berndt Müller1, Jonathan Pettitt1 and Bernadette Connolly1 Running Title: Conservation of nematode SL2 trans-splicing 1 School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, Institute of Medical Sciences, Foresterhill, Aberdeen, AB25 2ZD, UK 2 Centre of Genome-Enabled Biology and Medicine, University of Aberdeen, 23 St Machar Drive, Aberdeen AB24 3RY, UK Corresponding Author: Jonathan Pettitt, School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, Institute of Medical Sciences, Foresterhill, Aberdeen, AB25 2ZD, UK. Tel.: +44 1224 437516. Email: [email protected] Keywords: spliced leader trans-splicing; polycistronic RNA processing; eukaryotic operons; RNA splicing; nematode genome evolution 1 bioRxiv preprint doi: https://doi.org/10.1101/642082; this version posted May 27, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. ABSTRACT Spliced leader trans-splicing is intimately associated with the presence of eukaryotic operons, allowing the processing of polycistronic RNAs into individual mRNAs. -
U4 Small Nuclear RNA Dissociates from a Yeast Spliceosome And
MOLECULAR AND CELLULAR BIOLOGY, Nov. 1991, p. 5571-5577 Vol. 11, No. 11 0270-7306/91/115571-07$02.00/0 Copyright C) 1991, American Society for Microbiology U4 Small Nuclear RNA Dissociates from a Yeast Spliceosome and Does Not Participate in the Subsequent Splicing Reaction SHYUE-LEE YEAN AND REN-JANG LIN* Department of Microbiology, University of Texas at Austin, Austin, Texas 78712-1095 Received 16 April 1991/Accepted 19 August 1991 U4 and U6 small nuclear RNAs reside in a single ribonucleoprotein particle, and both are required for pre-mRNA splicing. The U4/U6 and U5 small nuclear ribonucleoproteins join Ul and U2 on the pre-mRNA during spliceosome assembly. Binding of U4 is then destabilized prior to or concomitant with the 5' cleavage-ligation. In order to test the role of U4 RNA, we isolated a functional spliceosome by using extracts prepared from yeast cells carrying a temperature-sensitive allele ofprp2 (rna2). The isolated prp2A spliceosome contains U2, U5, U6, and possibly also I11 and can be activated to splice the bound pre-mRNA. U4 RNA does not associate with the isolated spliceosomes and is shown not to be involved in the subsequent cleavage-ligation reactions. These results are consistent with the hypothesis that the role of U4 in pre-mRNA splicing is to deliver U6 to the spliceosome. Splicing of introns from nuclear pre-mRNAs occurs by mRNA in a spliceosome (19, 22). This prp2A spliceosome is two cleavage-ligation (transesterification) reactions. The first functional, since it can be activated to splice if supplemented reaction is a cleavage at the 5' splice site and the formation with splicing factors and ATP. -
Strategic Plan 2011-2016
Strategic Plan 2011-2016 Wellcome Trust Sanger Institute Strategic Plan 2011-2016 Mission The Wellcome Trust Sanger Institute uses genome sequences to advance understanding of the biology of humans and pathogens in order to improve human health. -i- Wellcome Trust Sanger Institute Strategic Plan 2011-2016 - ii - Wellcome Trust Sanger Institute Strategic Plan 2011-2016 CONTENTS Foreword ....................................................................................................................................1 Overview .....................................................................................................................................2 1. History and philosophy ............................................................................................................ 5 2. Organisation of the science ..................................................................................................... 5 3. Developments in the scientific portfolio ................................................................................... 7 4. Summary of the Scientific Programmes 2011 – 2016 .............................................................. 8 4.1 Cancer Genetics and Genomics ................................................................................ 8 4.2 Human Genetics ...................................................................................................... 10 4.3 Pathogen Variation .................................................................................................. 13 4.4 Malaria -
Forward Genetics
MOLECULAR AND CELLULAR BIOLOGY, Sept. 1992, p. 3939-3947 Vol. 12, No. 9 0270-7306/92/093939-09$02.00/0 Copyright X 1992, American Society for Microbiology PRP38 Encodes a Yeast Protein Required for Pre-mRNA Splicing and Maintenance of Stable U6 Small Nuclear RNA Levels STEVEN BLANTON, APARNA SRINIVASAN, AND BRIAN C. RYMOND* T. H. Morgan School ofBiological Sciences, University ofKentucky, Lexington, Kentucky 40506-0225 Downloaded from Received 7 April 1992/Returned for modification 14 May 1992/Accepted 17 June 1992 An essential pre-mRNA splicing factor, the product of the PRP38 gene, has been genetically identified in a screen of temperature-sensitive mutants of Saccharomyces cerevisiae. Shifting temperature-sensitive prp38 cultures from 23 to 37°C prevents the first cleavage-ligation event in the excision of introns from mRNA precursors. In vitro splicing inactivation and complementation studies suggest that the PRP38-encoded factor functions, at least in part, after stable splicing complex formation. The PRP38 locus contains a 726-bp open reading frame coding for an acidic 28-kDa polypeptide (PRP38). While PRP38 lacks obvious structural similarity to previously defined splicing factors, heat inactivation of PRP38, PRP19, or any of the known U6 http://mcb.asm.org/ (or U4/U6) small nuclear ribonucleoprotein-associating proteins (i.e., PRP3, PRP4, PRP6, and PRP24) leads to a common, unexpected consequence: intracellular U6 small nuclear RNA (snRNA) levels decrease as splicing activity is lost. Curiously, U4 snRNA, normally extensively base paired with U6 snRNA, persists in the virtual absence of U6 snRNA. The excision of intervening sequences from eukaryotic ates from the spliceosome (11, 37). -
Domains of Yeast U4 Spliceosomal RNA Required for PRP4 Protein Binding, Snrnp-Snrnp Interactions, and Pre- Mrna Splicing in Vivo
Downloaded from genesdev.cshlp.org on September 27, 2021 - Published by Cold Spring Harbor Laboratory Press Domains of yeast U4 spliceosomal RNA required for PRP4 protein binding, snRNP-snRNP interactions, and pre- mRNA splicing in vivo R6my Bordonn4, Josette Banroques,~, 2 John Abelson, 1 and Christine Guthrie s Department of Biochemistry and Biophysics, University of California at San Francisco, San Francisco, California 94143 USA; ~Division of Biology, California Institute of Technology, Pasadena, California 91125 USA U4 small nuclear RNA (snRNA) contains two intramolecular stem-loop structures, located near each end of the molecule. The 5' stem-loop is highly conserved in structure and separates two regions of U4 snRNA that base- pair with U6 snRNA in the U4/U6 small nuclear ribonucleoprotein particle (snRNP). The 3' stem-loop is highly divergent in structure among species and lies immediately upstream of the binding site for Sm proteins. To investigate the function of these two domains, mutants were constructed that delete the yeast U4 snRNA 5' stem-loop and that replace the yeast 3' stem-loop with that from trypanosome U4 snRNA. Both mutants fail to complement a null allele of the yeast U4 gene. The defects of the mutants have been examined in heterozygous strains by native gel electrophoresis, glycerol gradient centrifugation, and immunoprecipitation. The chimeric yeast-trypanosome RNA does not associate efficiently with U6 snRNA, suggesting that the 3' stem-loop of yeast U4 snRNA might be a binding site for a putative protein that facilitates assembly of the U4/U6 complex. In contrast, the 5' hairpin deletion mutant associates efficiently with U6 snRNA. -
Caenorhabditis Elegans Mrnas That Encode a Protein Similar to Adars Derive from an Operon Containing Six Genes Ronald F
3424–3432 Nucleic Acids Research, 1999, Vol. 27, No. 17 © 1999 Oxford University Press Caenorhabditis elegans mRNAs that encode a protein similar to ADARs derive from an operon containing six genes Ronald F. Hough, Arunth T. Lingam and Brenda L. Bass* Department of Biochemistry and Howard Hughes Medical Institute, University of Utah, 50 North Medical Drive, Salt Lake City, UT 84132, USA Received June 8, 1999; Revised and Accepted July 15, 1999 DDBJ/EMBL/GenBank accession nos+ ABSTRACT adenosines within base-paired regions of cellular pre-mRNAs and viral RNAs (reviewed in 5). In addition to the C-terminal The Caenorhabditis elegans T20H4.4 open reading domain, which contains the catalytic active site (2,6,7), frame (GenBank accession no. U00037) predicted by ADARs contain variable numbers of an amino acid sequence Genefinder encodes a 367 amino acid protein that is known as the dsRNA binding motif (dsRBM) (8,9). 32–35% identical to the C-terminal domain of adeno- We isolated several T20H4.4 clones from a C.elegans cDNA sine deaminases that act on RNA. We show that library. The cDNAs included two exons in addition to those T20H4.4 cDNAs (GenBank accession no. AF051275) identified as part of the T20H4.4 ORF by Genefinder (10), and encode a larger 495 amino acid protein that is encoded a larger protein (55.3 kDa) that contained a single extended at its N-terminus to include a single double- dsRBM. While one of the newly identified exons was created stranded RNA-binding motif, and that T20H4.4 occu- by conventional cis-splicing, the second derived from a trans- pies the second position in a six-gene operon splicing event since the 5' ends of the three longest clones con- (5'-T20H4.5, T20H4.4, R151.8A, R151.8B, R151.7, tained non-genomic spliced-leader (SL) sequences.