BMC Genomics Biomed Central

Total Page:16

File Type:pdf, Size:1020Kb

BMC Genomics Biomed Central BMC Genomics BioMed Central Research article Open Access Evidence for conservation and selection of upstream open reading frames suggests probable encoding of bioactive peptides Mark L Crowe*†1,2, Xue-Qing Wang†3 and Joseph A Rothnagel1,2,3 Address: 1The Australian Research Council Special Research Centre for Functional and Applied Genomics, The University of Queensland, Brisbane, Queensland 4072, Australia, 2Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia and 3School of Molecular and Microbial Sciences, The University of Queensland, Brisbane, Queensland 4072, Australia Email: Mark L Crowe* - [email protected]; Xue-Qing Wang - [email protected]; Joseph A Rothnagel - [email protected] * Corresponding author †Equal contributors Published: 26 January 2006 Received: 05 September 2005 Accepted: 26 January 2006 BMC Genomics 2006, 7:16 doi:10.1186/1471-2164-7-16 This article is available from: http://www.biomedcentral.com/1471-2164/7/16 © 2006 Crowe et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: Approximately 40% of mammalian mRNA sequences contain AUG trinucleotides upstream of the main coding sequence, with a quarter of these AUGs demarcating open reading frames of 20 or more codons. In order to investigate whether these open reading frames may encode functional peptides, we have carried out a comparative genomic analysis of human and mouse mRNA 'untranslated regions' using sequences from the RefSeq mRNA sequence database. Results: We have identified over 200 upstream open reading frames which are strongly conserved between the human and mouse genomes. Consensus sequences associated with efficient initiation of translation are overrepresented at the AUG trinucleotides of these upstream open reading frames, while comparative analysis of their DNA and putative peptide sequences shows evidence of purifying selection. Conclusion: The occurrence of a large number of conserved upstream open reading frames, in association with features consistent with protein translation, strongly suggests evolutionary maintenance of the coding sequence and indicates probable functional expression of the peptides encoded within these upstream open reading frames. Background ecule until reaching the first AUG, at which point transla- The 5' untranslated regions (5' UTRs) of vertebrate tion is initiated. Further work has since resulted in mRNAs typically vary from a few tens of bases up to sev- refinements to this model, such as the recognition of leaky eral hundred bases in length, and contain a variety of fea- scanning, where uAUGs in a sequence context which var- tures which affect the efficiency of translation of the main ies from the optimal consensus for initiation of transla- coding sequence (CDS) of the transcript. These include tion are bypassed by the ribosome complex, and of the length and secondary structure of the 5' UTR, the reinitiation, where the ribosome resumes scanning after sequence context of the initiation codon of the main CDS translation of upstream open reading frames (uORFs) and the presence of upstream AUG codons (uAUGs). The (reviewed in [2]). original scanning model of mRNA translation (reviewed in [1]) postulates that the translating ribosome enters the Both the leaky scanning and ribosome reinitiation mech- mRNA at the 5' end and processes linearly down the mol- anisms typically result in a decrease in efficiency of trans- Page 1 of 10 (page number not for citation purposes) BMC Genomics 2006, 7:16 http://www.biomedcentral.com/1471-2164/7/16 Table 1: Frequency of upstream AUGs and ORFs. Human Mouse Number of mRNA sequences in initial dataset (1) 16504 11291 Number of mRNA sequences containing > 1 uAUG (2) 9531 6352 Total number of uAUGs (3) 35599 24308 Number of mRNA sequences containing > 1 uORF (4) 4557 2820 Total number of uORFs (5) 8216 5487 Number of mRNA sequences containing > 1 uORF after removal of duplicates (6) 3924 2795 Total number of uORFs after removal of duplicates (7) 7138 5430 Number of mRNA sequences containing > 1 uORF after removal of blast matches (8) 3650 2678 Total number of uORFs after removal of blast matches (9) 6454 5089 Frequency of upstream AUGs and ORFs in human and mouse RefSeq mRNA sequences. Initial dataset refers to all sequences of > 60 nucleotides annotated as 5' UTRs. Numbers in parentheses indicate the corresponding stage in the filtering flowchart (figure 1). lation from subsequent initiation codons, in some cases Despite the examples above, most studies on the effects of virtually eliminating translation of the main CDS, uAUGs have considered the role of the uAUG as acting although more usually causing between two and 50-fold solely as an alternative site of ribosome initiation and reduction in protein levels [3-8]. It has therefore been pro- therefore to repress translation of the main CDS; the role posed that the presence of uAUGs may represent a of the associated uORF is apparently considered largely method of post-transcriptional regulation, through incidental. Even in reports describing the function of the repression of translation from the main AUG. This theory uORF itself, the role of the translated peptide is primarily is supported by the identification of transcriptional and a cis-acting one, either causing ribosome stalling at the ter- splice variants with an identical main CDS but with 5' mination codon of the uORF and a high level of repres- UTRs containing varying numbers of uAUGs [6,9,10], as sion of the main CDS [25,26] or, more occasionally, well as cases of an uAUG repressing translation in one tis- affecting mRNA stability. In this report we describe our sue type while not affecting it in another [11,12]. identification of a substantial number of uORF-encoded peptides (uPEPs) which are highly evolutionarily con- Furthermore a number of diseases have been associated served and which we propose have functions beyond that with uAUGs. These are caused either through mutations of simply reducing translation of the main CDS. introducing or eliminating uAUGs, resulting in a conse- quent decrease or increase in translation efficiency [13- Results 16], or by physiological changes which may affect the Presence of uAUGs and uORFs degree of repression by a particular uAUG [17] or may For this experiment, we restricted our analysis to uORFs of change splicing patterns of the 5' UTR to alter the number 20 to 99 codons in length. While these specific values are of uAUGs [6]. somewhat arbitrary, we chose them to maximise the prob- ability that the uORFs chosen for further analysis were In addition to repression of translation by uAUGs, addi- genuine and not affected by sequencing errors or cloning tional regulation of mRNAs containing uORFs can be artefacts. While it is possible that a uORF of less than 20 brought about by the uORF affecting transcript stability. codons could encode a functional peptide, ORFs longer The premature termination codon model for nonsense- than this are potentially easier to identify as conserved mediated decay (NMD) of mRNA (reviewed in [18-20]) between species and furthermore would be more amena- suggests that transcripts containing uORFs would effec- ble to subsequent experimental validation. We chose the tively have a premature termination codon at the end of upper limit after our initial results indicated that only a the uORF. Translation of the uORF would reduce progres- small fraction of uORFs (~5%) were more than 100 sion of the ribosome to the main CDS and ultimately to codons long, so their exclusion was unlikely to greatly its termination codon, and therefore may result in bias the analysis, while their inclusion would lead to the increased targetting of these molecules for NMD, thereby potential for contamination of our results by artefacts reducing the steady-state level of that particular transcript. such as co-ligated cDNA clones or misidentification of the Some uORFs do appear to have such a role in destabiliza- main CDS in a cDNA clone. tion of their mRNA by promoting NMD [21-23] while others have been shown to similarly destabilize mRNA, In the RefSeq release 6 database, there are 21768 human but via an NMD-independent pathway [24]. and 17106 mouse mature mRNA sequences. Of these, Page 2 of 10 (page number not for citation purposes) BMC Genomics 2006, 7:16 http://www.biomedcentral.com/1471-2164/7/16 Table 2: Frequency of downstream AUGs and ORFs. Human Mouse Number of mRNA sequences in initial dataset (1) 21597 14790 Number of mRNA sequences containing > 1 dAUG (2) 19853 14327 Total number of dAUGs (3) 352301 229945 Number of mRNA sequences containing > 1 dORF (4) 16965 11598 Total number of dORFs (5) 85876 55815 Number of mRNA sequences containing > 1 dORF after removal of duplicates (6) 14252 11377 Total number of dORFs after removal of duplicates (7) 69202 54403 Number of mRNA sequences containing > 1 dORF after removal of blast matches (8) 13899 11299 Total number of dORFs after removal of blast matches (9) 65258 52786 Frequency of downstream AUGs and ORFs (dAUGs and dORFs) in human and mouse RefSeq mRNA sequences. Initial dataset refers to all sequences of > 60 nucleotides annotated as 3' UTRs. Numbers in parentheses indicate the corresponding stage in the filtering flowchart (figure 1). approximately 75% and 66% respectively have an anno- in the 5' UTR. By using a relatively stringent filter, we tated 5' UTR of 60 bases or more in length (exact figures removed approximately 10% of the uPEPs which showed given in table 1). The difference in the proportion of long more than a small degree of similarity to proteins in the 5' UTRs is primarily because of a much higher number of NCBI non-redundant peptide sequence database, thus mouse sequences containing only the main CDS, with no minimising the possibility that the uORFs are in fact part 5' UTR given (11% of mouse sequences, compared with of a main CDS, and appear to be in the 5' UTR as a conse- only 4.5% of human ones).
Recommended publications
  • Mayr, Annu Rev Genet 2017
    GE51CH09-Mayr ARI 12 October 2017 10:21 Annual Review of Genetics Regulation by 3-Untranslated Regions Christine Mayr Department of Cancer Biology and Genetics, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA; email: [email protected] Annu. Rev. Genet. 2017. 51:171–94 Keywords First published as a Review in Advance on August noncoding RNA, regulatory RNA, alternative 3-UTRs, alternative 30, 2017 polyadenylation, cotranslational protein complex formation, cellular The Annual Review of Genetics is online at organization, mRNA localization, RNA-binding proteins, cooperativity, genet.annualreviews.org accessibility of regulatory elements https://doi.org/10.1146/annurev-genet-120116- 024704 Abstract Copyright c 2017 by Annual Reviews. 3 -untranslated regions (3 -UTRs) are the noncoding parts of mRNAs. Com- All rights reserved pared to yeast, in humans, median 3-UTR length has expanded approx- imately tenfold alongside an increased generation of alternative 3-UTR isoforms. In contrast, the number of coding genes, as well as coding region length, has remained similar. This suggests an important role for 3-UTRs in the biology of higher organisms. 3-UTRs are best known to regulate ANNUAL REVIEWS Further diverse fates of mRNAs, including degradation, translation, and localiza- Click here to view this article's tion, but they can also function like long noncoding or small RNAs, as has Annu. Rev. Genet. 2017.51:171-194. Downloaded from www.annualreviews.org online features: • Download figures as PPT slides been shown for whole 3 -UTRs as well as for cleaved fragments. Further- • Navigate linked references • Download citations more, 3 -UTRs determine the fate of proteins through the regulation of Access provided by Memorial Sloan-Kettering Cancer Center on 11/30/17.
    [Show full text]
  • Regulation of Translation by the 3' Untranslated Region of Barley Yellow Dwarf Virus-PAV RNA Shanping Wang Iowa State University
    Iowa State University Capstones, Theses and Retrospective Theses and Dissertations Dissertations 1996 Regulation of translation by the 3' untranslated region of barley yellow dwarf virus-PAV RNA Shanping Wang Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/rtd Part of the Molecular Biology Commons, and the Plant Pathology Commons Recommended Citation Wang, Shanping, "Regulation of translation by the 3' untranslated region of barley yellow dwarf virus-PAV RNA " (1996). Retrospective Theses and Dissertations. 11426. https://lib.dr.iastate.edu/rtd/11426 This Dissertation is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. INFORMATION TO USERS This manuscript has been reproduced from the microfilm master. UMI films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleedthrough, substandard margins, and improper alignment can adversely afTect reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand comer and continuing from left to right in equal sections with small overlaps.
    [Show full text]
  • Mechanisms of Mrna Polyadenylation
    Turkish Journal of Biology Turk J Biol (2016) 40: 529-538 http://journals.tubitak.gov.tr/biology/ © TÜBİTAK Review Article doi:10.3906/biy-1505-94 Mechanisms of mRNA polyadenylation Hızlan Hıncal AĞUŞ, Ayşe Elif ERSON BENSAN* Department of Biology, Arts and Sciences, Middle East Technical University, Ankara, Turkey Received: 26.05.2015 Accepted/Published Online: 21.08.2015 Final Version: 18.05.2016 Abstract: mRNA 3’-end processing involves the addition of a poly(A) tail based on the recognition of the poly(A) signal and subsequent cleavage of the mRNA at the poly(A) site. Alternative polyadenylation (APA) is emerging as a novel mechanism of gene expression regulation in normal and in disease states. APA results from the recognition of less canonical proximal or distal poly(A) signals leading to changes in the 3’ untranslated region (UTR) lengths and even in some cases changes in the coding sequence of the distal part of the transcript. Consequently, RNA-binding proteins and/or microRNAs may differentially bind to shorter or longer isoforms. These changes may eventually alter the stability, localization, and/or translational efficiency of the mRNAs. Overall, the 3’ UTRs are gaining more attention as they possess a significant posttranscriptional regulation potential guided by APA, microRNAs, and RNA-binding proteins. Here we provide an overview of the recent developments in the APA field in connection with cancer as a potential oncogene activator and/or tumor suppressor silencing mechanism. A better understanding of the extent and significance of APA deregulation will pave the way to possible new developments to utilize the APA machinery and its downstream effects in cancer cells for diagnostic and therapeutic applications.
    [Show full text]
  • Translation from the 5′ Untranslated Region Shapes the Integrated Stress Response Shelley R
    RESEARCH ◥ duce tracer peptides. These translation prod- RESEARCH ARTICLE SUMMARY ucts are processed and loaded onto major histocompatibility complex class I (MHC I) molecules in the ER and transit to the cell STRESS RESPONSE surface, where they can be detected by specific T cell hybridomas that are activated and quan- ′ tified using a colorimetric reagent. 3T provides Translation from the 5 untranslated an approach to interrogate the thousands of predicted uORFs in mammalian genomes, char- region shapes the integrated acterize the importance of uORF biology for regulation, and generate fundamental insights stress response into uORF mutation-based diseases. Shelley R. Starck,* Jordan C. Tsai, Keling Chen, Michael Shodiya, Lei Wang, RESULTS: 3T proved to be a sensitive and Kinnosuke Yahiro, Manuela Martins-Green, Nilabh Shastri,* Peter Walter* robust indicator of uORF expression. We mea- sured uORF expression in the 5′ UTR of INTRODUCTION: Protein synthesis is con- mRNAs, such as ATF4 and CHOP, that harbor mRNAs at multiple distinct regions, while trolled by a plethora of developmental and small upstream open reading frames (uORFs) ◥ simultaneously detecting environmental conditions. One intracellular in their 5′ untranslated regions (5′ UTRs). Still ON OUR WEB SITE expression of the CDS. We signaling network, the integrated stress re- other mRNAs sustain translation despite ISR Read the full article directly measured uORF sponse (ISR), activates one of four kinases in activation. We developed tracing translation at http://dx.doi. peptide expression from response to a variety of distinct stress stimuli: by T cells (3T) as an exquisitely sensitive tech- org/10.1126/ ATF4 mRNA and showed the endoplasmic reticulum (ER)–resident kinase nique to probe the translational dynamics of science.aad3867 that its translation per- .................................................
    [Show full text]
  • Gene Expression Regulation by Upstream Open Reading Frames In
    Silva J, Fernandes R, Romão L. J Rare Dis Res Treat. (2017) 2(4): 33-38 Journal of www.rarediseasesjournal.com Rare Diseases Research & Treatment Mini review Open Access Gene expression regulation by upstream open reading frames in rare diseases Joana Silva1,2, Rafael Fernandes1,2, and Luísa Romão1,2* 1Department of Human Genetics, Instituto Nacional de Saúde Doutor Ricardo Jorge, Lisboa, Portugal 2Gene Expression and Regulation Group, Biosystems & Integrative Sciences Institute (BioISI), Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal Article Info ABSTRACT Article Notes Upstream open reading frames (uORFs) constitute a class of cis-acting Received: June 19, 2017 elements that regulate translation initiation. Mutations or polymorphisms that Accepted: July 21, 2017 alter, create or disrupt a uORF have been widely associated with several human *Correspondence: disorders, including rare diseases. In this mini-review, we intend to highlight the Dr. Luísa Romão, Department of Human Genetics, Instituto mechanisms associated with the uORF-mediated translational regulation and Nacional de Saúde Doutor Ricardo Jorge, Av. Padre Cruz, describe recent examples of their deregulation in the etiology of human rare 1649-016, Lisboa, Portugal; Tel: (+351) 21 750 8155; Fax: diseases. Additionally, we discuss new insights arising from ribosome profiling (+351) 21 752 6410; studies and reporter assays regarding uORF features and their intrinsic role E-mail: [email protected]. in translational regulation. This type of knowledge is of most importance to © 2017 Romão L. This article is distributed under the terms of design and implement new or improved diagnostic and/or treatment strategies the Creative Commons Attribution 4.0 International License.
    [Show full text]
  • Reviews/0004.1
    http://genomebiology.com/2002/3/3/reviews/0004.1 Review Untranslated regions of mRNAs comment Flavio Mignone*, Carmela Gissi*, Sabino Liuni† and Graziano Pesole* Addresses: *Dipartimento di Fisiologia e Biochimica Generali, Università di Milano, Via Celoria, 26, 20133 Milano, Italy. †Centro di Studio sui Mitocondri e Metabolismo Energetico, Via Amendola 166/5, 70126 Bari, Italy. Correspondence: Graziano Pesole. E-mail: [email protected] reviews Published: 28 February 2002 Genome Biology 2002, 3(3):reviews0004.1–0004.10 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2002/3/3/reviews/0004 © BioMed Central Ltd (Print ISSN 1465-6906; Online ISSN 1465-6914) Abstract reports Gene expression is finely regulated at the post-transcriptional level. Features of the untranslated regions of mRNAs that control their translation, degradation and localization include stem-loop structures, upstream initiation codons and open reading frames, internal ribosome entry sites and various cis-acting elements that are bound by RNA-binding proteins. deposited research The recent analysis of the human genome [1,2] and the data 5? untranslated region (5? UTR), a coding region made up of available about other higher eukaryotic genomes have triplet codons that each encode an amino acid and a 3? revealed that only a small fraction of the genetic material - untranslated region (3? UTR). Figure 1 shows these and about 1.5 % - codes for protein. Indeed, most genomic DNA other features of mRNAs. is
    [Show full text]
  • The Role of RNA Structure at 5 Untranslated Region in Microrna
    Downloaded from rnajournal.cshlp.org on September 27, 2021 - Published by Cold Spring Harbor Laboratory Press BIOINFORMATICS The role of RNA structure at 5′ untranslated region in microRNA-mediated gene regulation WANJUN GU,1 YUMING XU,2 XUEYING XIE,1 TING WANG,3 JAE-HONG KO,4 and TONG ZHOU3 1Research Center for Learning Sciences, Southeast University, Nanjing, Jiangsu 210096, China 2School of Biological Sciences and Medical Engineering, Southeast University, Nanjing, Jiangsu 210096, China 3Department of Medicine, University of Arizona, Tucson, Arizona 85721, USA 4Department of Physiology, College of Medicine, Chung-Ang University, Seoul 156-756, South Korea ABSTRACT Recent studies have suggested that the secondary structure of the 5′ untranslated region (5′ UTR) of messenger RNA (mRNA) is important for microRNA (miRNA)-mediated gene regulation in humans. mRNAs that are targeted by miRNA tend to have a higher degree of local secondary structure in their 5′ UTR; however, the general role of the 5′ UTR in miRNA-mediated gene regulation remains unknown. We systematically surveyed the secondary structure of 5′ UTRs in both plant and animal species and found a universal trend of increased mRNA stability near the 5′ cap in mRNAs that are regulated by miRNA in animals, but not in plants. Intra-genome comparison showed that gene expression level, GC content of the 5′ UTR, number of miRNA target sites, and 5′ UTR length may influence mRNA structure near the 5′ cap. Our results suggest that the 5′ UTR secondary structure performs multiple functions in regulating post-transcriptional processes. Although the local structure immediately upstream of the start codon is involved in translation initiation, RNA structure near the 5′ cap site, rather than the structure of the full-length 5′ UTR sequences, plays an important role in miRNA-mediated gene regulation.
    [Show full text]
  • The 3' Untranslated Region of the Human Interferon-Fl Mrna Has an Inhibitory Effect on Translation (SP6 RNA Transcripts/Translational Efficiency) V
    Proc. Natl. Acad. Sci. USA Vol. 84, pp. 6030-6034, September 1987 Biochemistry The 3' untranslated region of the human interferon-fl mRNA has an inhibitory effect on translation (SP6 RNA transcripts/translational efficiency) V. KRUYS*, M. WATHELET*, P. POUPARTt, R. CONTRERASt, W. FIERSt, J. CONTENTt, AND G. HUEZ*§ *D6partement de Biologie Moldculaire, Universitd Libre de Bruxelles, 1640 Rhode-St-Gen~se, Belgium; tf~partement de Virologie, Institut Pasteur du Brabant, 1180 Bruxelles, Belgium; and fLaboratory of Molecular Biology, State University of Ghent, 9000 Ghent, Belgium Communicated by M. Van Montagu, April 13, 1987 (received for review January 20, 1987) ABSTRACT In vitro-transcribed human interferon-I3 transfection of the IFN-,8 gene, that the 3' region of the (IFN-1P) mRNA, which contains all the sequence of the natural mRNA influences its stability. Nevertheless, the authors molecule, is poorly translated in a reticulocyte lysate or when could not conclude whether the coding sgequence or the 3' injected in Xenopus oocytes. This low level of translation is due UTR is responsible for the destabilization of the mRNA. to an inhibition by the 5' ;and 3' untranslated regions (UTRs). It has been shown that the IFN-,8 mRNA is posttranscrip- Indeed, the replacement of these sequences by those ofXenopus tionally regulated (7). During the course of a study on the .-globin mRNA dramatically increases the translational effi- possible mechanism of such regulation, we observed that an ciency of the mRNA, especially in oocytes. This phenomenon is in vitro-transcribed IFN-p3 mRNA was actually very poorly not due to a difference in mRNA stability since both native and translated in certain translation systems and that this phe- chimeric mRNAs remain undegraded, at least during 'the nomenon was not due to a rapid degradation of the molecule.
    [Show full text]
  • Untranslated Region
    viruses Article The RNA Architecture of the SARS-CoV-2 0 3 -Untranslated Region Junxing Zhao 1 , Jianming Qiu 2 , Sadikshya Aryal 1, Jennifer L. Hackett 3 and Jingxin Wang 1,* 1 Department of Medicinal Chemistry, University of Kansas, Lawrence, KS 66047, USA; [email protected] (J.Z.); [email protected] (S.A.) 2 Department of Microbiology, Molecular Genetics & Immunology, University of Kansas Medical Center, Kansas, KS 66160, USA; [email protected] 3 Genome Sequencing Core, University of Kansas, Lawrence, KS 66045, USA; [email protected] * Correspondence: [email protected] Academic Editor: John S. L. Parker Received: 23 October 2020; Accepted: 15 December 2020; Published: 21 December 2020 Abstract: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the current COVID-19 pandemic. The 30 untranslated region (UTR) of this β-CoV contains essential cis-acting RNA elements for the viral genome transcription and replication. These elements include an equilibrium between an extended bulged stem-loop (BSL) and a pseudoknot. The existence of such an equilibrium is supported by reverse genetic studies and phylogenetic covariation analysis and is further proposed as a molecular switch essential for the control of the viral RNA polymerase binding. Here, we report the SARS-CoV-2 30 UTR structures in cells that transcribe the viral UTRs harbored in a minigene plasmid and isolated infectious virions using a chemical probing technique, namely dimethyl sulfate (DMS)-mutational profiling with sequencing (MaPseq). Interestingly, the putative pseudoknotted conformation was not observed, indicating that its abundance in our systems is low in the absence of the viral nonstructural proteins (nsps).
    [Show full text]
  • Deep Learning of the Regulatory Grammar of Yeast 5′ Untranslated Regions from 500,000 Random Sequences
    Downloaded from genome.cshlp.org on September 27, 2021 - Published by Cold Spring Harbor Laboratory Press Research Deep learning of the regulatory grammar of yeast 5′ untranslated regions from 500,000 random sequences Josh T. Cuperus,1,2,7 Benjamin Groves,3,7 Anna Kuchina,3,7 Alexander B. Rosenberg,3,7 Nebojsa Jojic,4 Stanley Fields,1,2,5 and Georg Seelig3,6 1Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA; 2Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA; 3Department of Electrical Engineering, University of Washington, Seattle, Washington 98195, USA; 4Microsoft Research, Seattle, Washington 98195, USA; 5Department of Medicine, University of Washington, Seattle, Washington 98195, USA; 6Department of Computer Science & Engineering, University of Washington, Seattle, Washington 98195, USA Our ability to predict protein expression from DNA sequence alone remains poor, reflecting our limited understanding of cis-regulatory grammar and hampering the design of engineered genes for synthetic biology applications. Here, we generate a model that predicts the protein expression of the 5′ untranslated region (UTR) of mRNAs in the yeast Saccharomyces cerevisiae. We constructed a library of half a million 50-nucleotide-long random 5′ UTRs and assayed their activity in a massively parallel growth selection experiment. The resulting data allow us to quantify the impact on protein expression of Kozak sequence composition, upstream open reading frames (uORFs), and secondary structure. We trained a convolu- tional neural network (CNN) on the random library and showed that it performs well at predicting the protein expression of both a held-out set of the random 5′ UTRs as well as native S.
    [Show full text]
  • Identification of Regional Motifs in the 5'UTR and Their Implication In
    Iowa State University Capstones, Theses and Retrospective Theses and Dissertations Dissertations 1-1-2003 Identification of egionalr motifs in the 5'UTR and their implication in translational control mechanisms Sachet Ashok Shukla Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/rtd Recommended Citation Shukla, Sachet Ashok, "Identification of egionalr motifs in the 5'UTR and their implication in translational control mechanisms" (2003). Retrospective Theses and Dissertations. 20043. https://lib.dr.iastate.edu/rtd/20043 This Thesis is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Identification ofregional motifs in the 5'UTR and their implication in translational control mechanisms by Sachet Ashok Shukla A thesis submitted to the graduate faculty in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Major: Bioinformatics and Computational Biology Program of Study Committee: Srinivas Aluru, Co-major Professor Charles Link, Jr., Co-major Professor Philip Dixon W. Allen Miller Iowa State University Ames, Iowa 2003 Copyright© Sachet Ashok Shukla, 2003. All rights reserved. 11 Graduate College Iowa State University This is to certify that the master's thesis of Sachet Ashok Shukla has met the thesis requirements of Iowa State University Signatures have been redacted for privacy 111 To my mother, father and dear brother Sanket. IV TABLE OF CONTENTS LIST OF FIGURES v LIST OF TABLES VI ACKNOWLEDGEMENTS Vlll ABSTRACT x CHAPTER 1.
    [Show full text]
  • Regulated Polyadenylation Controls Mrna Translation During Meiotic Maturation of Mouse Oocytes
    Downloaded from genesdev.cshlp.org on September 23, 2021 - Published by Cold Spring Harbor Laboratory Press Regulated polyadenylation controls mRNA translation during meiotic maturation of mouse oocytes Jean-Dominique Vassalli, 1 Joaquin Huarte, 1 Dominique Belin, 2 Pascale Gubler, 1 Anne Vassalli, 1,4 Marcia L. O'Connell, 3 Lance A. Parton, 3 Richard J. Rickles, 3 and Sidney Strickland 3 1Institute of Histology and Embryology, and 2Department of Pathology, University of Geneva Medical School, CH 1211 Geneva 4, Switzerland; 3Departrnent of Molecular Pharmacology, State University of New York, Stony Brook, New York 11794 USA The translational activation of dormant tissue-type plasminogen activator mRNA during meiotic maturation of mouse oocytes is accompanied by elongation of its 3'-poly(A) tract. Injected RNA fragments that correspond to part of the 3'-untranslated region (3'UTR) of this mRNA are also subject to regulated polyadenylation. Chimeric mRNAs containing part of this 3'UTR are polyadenylated and translated following resumption of meiosis. Polyadenylation and translation of chimeric mRNAs require both specific sequences in the 3'UTR and the canonical 3'-processing signal AAUAAA. Injection of 3'-blocked mRNAs and in vitro polyadenylated mRNAs shows that the presence of a long poly(A) tract is necessary and sufficient for translation. These results establish a role for regulated polyadenylation in the post-transcriptional control of gene expression. [Key Words: Plasminogen activators; meiosis; poly(A); 3'UTR; translational control] Received August 18, 1989; revised version accepted October 13, 1989. A rapid and efficient mechanism to regulate gene ex- in polyadenylation trigger or merely accompany transla- pression involves the post-transcriptional modulation of tion is not known.
    [Show full text]