The Genetic Code & Translation

Total Page:16

File Type:pdf, Size:1020Kb

The Genetic Code & Translation 11 The Genetic Code & Translation To understand how mRNA is At the heart of the central dogma is the concept that information in the Goal translated into protein. form of the four-letter alphabet (A, G, C and T) of the genetic material is translated into the 20-letter (amino acids) alphabet of proteins. As we Objectives have seen, the intermediary between the genetic material, DNA, and the After this chapter, you should be able to translation machinery, the ribosome, is the messenger RNA or mRNA. The mRNA is copied from the DNA in a process called transcription (Chapter • describe the principal features of the 10) and is then decoded on the ribosome in a process called translation, genetic code. where it directs the ordered polymerization of amino acids into polypeptide • explain how tRNAs mediate chains. Here we focus on the nature and logic of the genetic code, the RNA information transfer and do so adaptors that decipher the genetic code, the workings of the molecular accurately. machine that translates mRNA into protein and does so with high accuracy, • describe the peptide bond cycle and and the chemistry of the formation of the peptide bond. how it achieves accuracy. • describe the ribosome cycle and how open reading frames are set. The four-letter alphabet of the genetic material is read in units of three How many bases are required to specify an amino acid? Because nucleic acids have only four bases and proteins are composed of twenty amino acids, the coding unit or codon for each amino acid must consist of more than one base. Even two bases would not be enough. If codons consisted of two bases, then only 16 (or 42) different codons would be possible, and there would insufficient codons to specify 20 amino acids. However, if codons are composed of three bases, then 64 (or 43) codons are possible, more than enough for the amino acid alphabet. Therefore, the minimum number of bases needed to specify 20 different amino acids is three. Indeed, the genetic code is a triplet code. Chapter 11 The Genetic Code & Translation 2 Second position Figure 1 Each codon corresponds to a particular amino acid U C A G UUU UCU UAU UGU U Each codon is written from 5’ to 3’. The Phe Tyr Cys start codon (AUG) is shown in green. Stop UUC UCC UAC UGC C U Ser codons (UAA, UAG, and UGA) are shown UUA UCA UAA UGA Stop A in red. Leu Stop UUG UCG UAG UGG Trp G CUU CCU CAU CGU U His CUC CCC CAC CGC C C Leu Pro Arg position (3’) Third CUA CCA CAA CGA A Gln CUG CCG CAG CGG G AUU ACU AAU AGU U Asn Ser First Position (5’) Position First AUC Ile ACC AAC AGC C A Thr AUA ACA AAA AGA A Lys Arg AUG Met ACG AAG AGG G GUU GCU GAU GGU U Asp GUC GCC GAC GGC C G Val Ala Gly GUA GCA GAA GGA A Glu GUG GCG GAG GGG G Of the 64 possible codons in the genetic code, 61 specify amino acids, indicating that many amino acids are encoded by more than one codon (Figure 1). Thus, the code isdegenerate in the sense that some amino acids are specified by more than one synonymous codon. At one extreme, leucine, serine, and arginine are each specified by six synonymous codons. At the other extreme, methionine and tryptophan have unique codons (AUG and UGG, respectively). The methionine codon AUG has an additional function as a start codon: it signals the beginning of a coding sequence in the mRNA. The remaining three (of the 64) triplets—UAA, UGA, and UAG—do not specify any amino acids. Instead, these triplets are stop codons that signal the end of the coding sequence for a messenger RNA. Thus, coding sequences have two kinds of punctuation marks: an AUG at the beginning that marks the start and one (or more successive) stop codons at the end. We will return to start and stop codons near the end of the chapter, including the question of how AUG can serve both as a methionine codon internal to a coding sequence and as the start signal at the beginning. The entire genetic code, which is sometimes called the “Rosetta Stone of Life” (because it deciphers codons), is shown in its entirety in Figure 1. Notice that the left-hand vertical column indicates the first (5’) position in a codon, the horizontal bar across the top indicates the second position, and the right-hand vertical column indicates the third (3’) position. Start and stop codons are highlighted in green and red, respectively. Finally, we return to the 5’-to-3’ directionality of polynucleotides. Codons have a 5’-to-3’ orientation with respect to the directionality of the RNA Chapter 11 The Genetic Code & Translation 3 transcript in which they are embedded. Thus, for example, the CGA codon for arginine has the orientation 5’-CGA-3’. This is in keeping with the three foundational rules of directionality introduced in Chapter 8: polynucleotide synthesis proceeds in a 5’-to-3’ direction (rule 1), polypeptide synthesis proceeds in an NH2-terminal-to-COOH-terminal direction (rule 2), and information for the order of amino acids in the mRNA is specified sequentially in a 5’-to-3’ direction (rule 3). Thus, codons are lined up in the same 5’-to-3’ orientation as the direction of translation. Transfer RNAs are adaptors between codons and amino acids How are the 61 codons that specify amino acids deciphered such that each directs the incorporation of the appropriate, cognate amino acid? The answer is that codons are recognized by adaptor molecules known as transfer RNAs or tRNAs. In contrast to mRNAs, tRNAs are non-protein- coding RNAs that directly act as adaptors through their tertiary structure, as we will explain. (A second example of non-coding RNAs is the RNA components of the ribosome, as we will also discuss.) Each tRNA recognizes the codon for a particular amino acid. Recognition is mediated by base pairing between the codon and a corresponding anti-codon in the tRNA molecule, which align in an anti-parallel orientation. Covalently attached to each tRNA at its 3’ terminus is its cognate amino acid, that is, the amino (A) O acid that corresponds to that specified by the codon. Thus, the amino acid N NH aspartic acid is covalently attached to a tRNA whose anti-codon pairs with the aspartic acid codon 5’-GAC-3’. N N Some tRNAs recognize more than one synonymous codon. This is possible Inosine due to a phenomenon known as wobble, which takes advantage of the fact that synonymous codons often differ from each other at the third (3’) (B) H position. Part of the explanation for wobble is that the 5’ base in the anti- N O H N codon is not as spatially restricted as the other two, allowing it to “wobble” and form hydrogen bonds with bases other than its cognate base at the 3’ N I N H N C position of the codon. Also, some tRNAs have an unusual base, inosine N N (Figure 2A), at the 5’ (wobble) position of the anti-codon; inosine is able O to pair with A, U or C at the 3’ position of the codon (Figure 2B). Thus, the base-pair rules that form the basis for double-helical structures are not O strictly adhered to in the special case of codon/anti-codon interactions. N O H N U tRNAs are approximately 80 nucleotides in length. They contain regions N N of self-complementarity that enable them to fold back on themselves in a I N H O N characteristic cloverleaf-like pattern of loops and short stretches of double helix (analogous to secondary structure in proteins). The cloverleaf, in turn, folds into a precise three-dimensional structure (analogous to the tertiary H structure of proteins) that resembles the capital letter “L” (an upside-down N N N O H L in Figure 3). A key feature of tRNAs is that the anti-codon and the site of N A N attachment of the amino acid are at opposite ends of the L-shaped molecule. I N H N N N The anti-codon is displayed in a loop at one end, and the amino acid is attached to the 3’ terminus at the other end. Notice that the 5’ and 3’ termini Figure 2 Inosine can form base are both near the same end of the molecule but that the 3’ end protrudes as pairs with A, U, or C a short stretch of single-stranded RNA beyond the 5’ terminus. Chapter 11 The Genetic Code & Translation 4 (A) (B) amino acid attachment point 3’ 5’ 3’ 5’ amino acid attachment point anti-codon anti-codon Figure 3 tRNAs exhibit secondary and tertiary structure (A) Shown is a cartoon representing the two-dimensional, cloverleaf-like secondary structure of a generic tRNA. Each circle represents an individual nucleotide, with the nucleotides of the anti-codon shown in blue. Black lines indicate base pairing within the strand. (B) Shown is the X-ray crystal structure of a tRNA molecule. The tRNA folds into an L-shaped structure, with the anti-codon at one end and the 3’ hydroxyl, which is where the amino acid becomes attached, at the other end. All tRNAs have this overall folded structure, even though their specific nucleotide sequences vary. Shown is a tRNA for phenylalanine; as such, it is known as tRNAPhe.
Recommended publications
  • Effects of Single Amino Acid Deficiency on Mrna Translation Are Markedly
    www.nature.com/scientificreports OPEN Efects of single amino acid defciency on mRNA translation are markedly diferent for methionine Received: 12 December 2016 Accepted: 4 May 2018 versus leucine Published: xx xx xxxx Kevin M. Mazor, Leiming Dong, Yuanhui Mao, Robert V. Swanda, Shu-Bing Qian & Martha H. Stipanuk Although amino acids are known regulators of translation, the unique contributions of specifc amino acids are not well understood. We compared efects of culturing HEK293T cells in medium lacking either leucine, methionine, histidine, or arginine on eIF2 and 4EBP1 phosphorylation and measures of mRNA translation. Methionine starvation caused the most drastic decrease in translation as assessed by polysome formation, ribosome profling, and a measure of protein synthesis (puromycin-labeled polypeptides) but had no signifcant efect on eIF2 phosphorylation, 4EBP1 hyperphosphorylation or 4EBP1 binding to eIF4E. Leucine starvation suppressed polysome formation and was the only tested condition that caused a signifcant decrease in 4EBP1 phosphorylation or increase in 4EBP1 binding to eIF4E, but efects of leucine starvation were not replicated by overexpressing nonphosphorylatable 4EBP1. This suggests the binding of 4EBP1 to eIF4E may not by itself explain the suppression of mRNA translation under conditions of leucine starvation. Ribosome profling suggested that leucine deprivation may primarily inhibit ribosome loading, whereas methionine deprivation may primarily impair start site recognition. These data underscore our lack of a full
    [Show full text]
  • Detecting Translational Regulation by Change Point Analysis of Ribosome Profiling Datasets
    bioRxiv preprint doi: https://doi.org/10.1101/003210; this version posted March 5, 2014. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Detecting translational regulation by change point analysis of ribosome profiling datasets Zupanic A1, Meplan C2, Grellscheid SN3, Mathers JC1, Kirkwood TBL1, Hesketh JE2, Shanley DP1 1Centre for Integrated Systems Biology of Ageing & Nutrition, Institute for Ageing and Health, Newcastle University, Newcastle-upon-Tyne, NE4 5PL, UK 2Institute for Cell and Molecular Biosciences and Human Nutrition Research Centre, Newcastle University, Newcastle-upon-Tyne, NE2 4HH, UK 3School of Biological and Biomedical Sciences, Durham University, Durham DH1 3LE, UK Running title: Translational regulation in ribosome profiles Keywords: ribosome profiling, translation regulation, change point, mathematical model Corresponding author: Daryl Shanley Centre for Integrated Systems Biology of Ageing & Nutrition, Institute for Ageing and Health, Newcastle University, NE4 5PL, UK Email: [email protected] Fax: +441912481101 1 bioRxiv preprint doi: https://doi.org/10.1101/003210; this version posted March 5, 2014. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Abstract Ribo-Seq maps the location of translating ribosomes on mature mRNA transcripts. While ribosome density is constant along the length of the mRNA coding region, it can be altered by translational regulatory events.
    [Show full text]
  • YEAST CELLS MAY USE AUC OR AAG AS INITIATION CODON for PROTEIN SYNTHESIS by OLE OLSEN
    Carlsberg Res. Commun. Vol. 52, p. 83-90, 1987 YEAST CELLS MAY USE AUC OR AAG AS INITIATION CODON FOR PROTEIN SYNTHESIS by OLE OLSEN Department of Physiology, Carlsberg Laboratory, Gamle Cadsbergvej 10, DK-2500 Copenhagen Valby Keywords: Recombinant DNA, translation initiation, secretion A yeast expression plasmid without an ATG codon for initiation of mouse a-amylase protein synthesis directs the synthesis and secretion of active enzyme indistinguishable from both native mouse a-amylase and amylase synthesized from plasmids with normal AT(3 initiation codons. The initiation of amylase synthesis directed by this plasmid is at either an AUC or an AAG codon. In either case the amino acid sequence of the hydrophobic core and peptidase cleaving region of the signal peptide are normal, and the protein translation remains in frame with the structural gene of the mouse a-amylase. 1. INTRODUCTION mal context for initiation was 6NNAUGGA A where Initiation of protein synthesis in eukaryotes is a purine in position -3 (three nucleotides up- catalysed by a relatively large number of specific stream of the AUG initiator codon) is most eukaryotic initation factors (eIFs) bringing highly conserved. about the formation of the 80S initiation com- Until recently it was generally believed that plex composed ofmRNA, an 80S ribosome and eukaryotic ribosomes initiate exclusively at met-tRNAi (l 3). Eukaryotic met-tRNAi differs AUG codons although it had been shown by from its prokaryotic counterpart by being asso- RAJBHANDARYand GHOSH (24) that yeast met- ciated with the 40S pre-initation complex before tRNAi may function with either AUG or GUG binding to mRNA (13).
    [Show full text]
  • Bio 102 Practice Problems Genetic Code and Mutation
    Bio 102 Practice Problems Genetic Code and Mutation Multiple choice: Unless otherwise directed, circle the one best answer: 1. Choose the one best answer: Beadle and Tatum mutagenized Neurospora to find strains that required arginine to live. Based on the classification of their mutants, they concluded that: A. one gene corresponds to one protein. B. DNA is the genetic material. C. "inborn errors of metabolism" were responsible for many diseases. D. DNA replication is semi-conservative. E. protein cannot be the genetic material. 2. Choose the one best answer. Which one of the following is NOT part of the definition of a gene? A. A physical unit of heredity B. Encodes a protein C. Segement of a chromosome D. Responsible for an inherited characteristic E. May be linked to other genes 3. A mutation converts an AGA codon to a TGA codon (in DNA). This mutation is a: A. Termination mutation B. Missense mutation C. Frameshift mutation D. Nonsense mutation E. Non-coding mutation 4. Beadle and Tatum performed a series of complex experiments that led to the idea that one gene encodes one enzyme. Which one of the following statements does not describe their experiments? A. They deduced the metabolic pathway for the synthesis of an amino acid. B. Many different auxotrophic mutants of Neurospora were isolated. C. Cells unable to make arginine cannot survive on minimal media. D. Some mutant cells could survive on minimal media if they were provided with citrulline or ornithine. E. Homogentisic acid accumulates and is excreted in the urine of diseased individuals. 5.
    [Show full text]
  • Structural Insights Into Mrna Reading Frame Regulation by Trna
    RESEARCH ARTICLE Structural insights into mRNA reading frame regulation by tRNA modification and slippery codon–anticodon pairing Eric D Hoffer1, Samuel Hong1, S Sunita1, Tatsuya Maehigashi1, Ruben L Gonzalez Jnr2, Paul C Whitford3, Christine M Dunham1* 1Department of Biochemistry, Emory University School of Medicine, Atlanta, United States; 2Department of Chemistry, Columbia University, New York, United States; 3Department of Physics, Northeastern University, Boston, United States Abstract Modifications in the tRNA anticodon loop, adjacent to the three-nucleotide anticodon, influence translation fidelity by stabilizing the tRNA to allow for accurate reading of the mRNA genetic code. One example is the N1-methylguanosine modification at guanine nucleotide 37 (m1G37) located in the anticodon loop andimmediately adjacent to the anticodon nucleotides 34, 35, 36. The absence of m1G37 in tRNAPro causes +1 frameshifting on polynucleotide, slippery codons. Here, we report structures of the bacterial ribosome containing tRNAPro bound to either cognate or slippery codons to determine how the m1G37 modification prevents mRNA frameshifting. The structures reveal that certain codon–anticodon contexts and the lack of m1G37 destabilize interactions of tRNAPro with the P site of the ribosome, causing large conformational changes typically only seen during EF-G-mediated translocation of the mRNA-tRNA pairs. These studies provide molecular insights into how m1G37 stabilizes the interactions of tRNAPro with the ribosome in the context of a slippery mRNA codon. *For correspondence: Introduction [email protected] Post-transcriptionally modified RNAs, including ribosomal RNA (rRNA), transfer RNA (tRNA) and messenger RNA (mRNA), stabilize RNA tertiary structures during ribonucleoprotein biogenesis, reg- Competing interests: The ulate mRNA metabolism, and influence other facets of gene expression.
    [Show full text]
  • Circular Code Motifs in the Ribosome: a Missing Link in the Evolution of Translation?
    Downloaded from rnajournal.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press Circular code motifs in the ribosome: a missing link in the evolution of translation? Gopal Dila1, Raymond Ripp1, Claudine Mayer1,2,3, Olivier Poch1, Christian J. Michel1,* and Julie D. Thompson1,* 1 Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg, France 2 Unité de Microbiologie Structurale, Institut Pasteur, CNRS, 75724 Paris Cedex 15, France 3 Université Paris Diderot, Sorbonne Paris Cité, 75724 Paris Cedex 15, France * To whom correspondence should be addressed; Email: [email protected] *Corresponding authors: Names: Christian J. Michel, Julie D. Thompson Address: Department of Computer Science, ICube, Strasbourg, France Phone: (33) 0368853296 Email: [email protected], [email protected] Running title: circular code motifs in the ribosome Keywords: origin of life, genetic code, circular code, translation, ribosome evolution 1 Dila et al. Downloaded from rnajournal.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press Abstract The origin of the genetic code remains enigmatic five decades after it was elucidated, although there is growing evidence that the code co-evolved progressively with the ribosome. A number of primordial codes were proposed as ancestors of the modern genetic code, including comma-free codes such as the RRY, RNY or GNC codes (R = G or A, Y = C or T, N = any nucleotide), and the X circular code, an error-correcting code that also allows identification and maintenance of the reading frame. It was demonstrated previously that motifs of the X circular code are significantly enriched in the protein-coding genes of most organisms, from bacteria to eukaryotes.
    [Show full text]
  • Ef-G:Trna Dynamics During the Elongation Cycle of Protein Synthesis
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2015 Ef-G:trna Dynamics During the Elongation Cycle of Protein Synthesis Rong Shen University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Biochemistry Commons Recommended Citation Shen, Rong, "Ef-G:trna Dynamics During the Elongation Cycle of Protein Synthesis" (2015). Publicly Accessible Penn Dissertations. 1131. https://repository.upenn.edu/edissertations/1131 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/1131 For more information, please contact [email protected]. Ef-G:trna Dynamics During the Elongation Cycle of Protein Synthesis Abstract During polypeptide elongation cycle, prokaryotic elongation factor G (EF-G) catalyzes the coupled translocations on the ribosome of mRNA and A- and P-site bound tRNAs. Continued progress has been achieved in understanding this key process, including results of structural, ensemble kinetic and single- molecule studies. However, most of work has been focused on the pre-equilibrium states of this fast process, leaving the real time dynamics, especially how EF-G interacts with the A-site tRNA in the pretranslocation complex, not fully elucidated. In this thesis, the kinetics of EF-G catalyzed translocation is investigated by both ensemble and single molecule fluorescence resonance energy transfer studies to further explore the underlying mechanism. In the ensemble work, EF-G mutants were designed and expressed successfully. The labeled EF-G mutants show good translocation activity in two different assays. In the smFRET work, by attachment of a fluorescent probe at position 693 on EF-G permits monitoring of FRET efficiencies to sites in both ribosomal protein L11 and A-site tRNA.
    [Show full text]
  • Super-Resolution Ribosome Profiling Reveals Unannotated Translation Events in Arabidopsis
    Super-resolution ribosome profiling reveals unannotated translation events in Arabidopsis Polly Yingshan Hsua, Lorenzo Calviellob,c, Hsin-Yen Larry Wud,1, Fay-Wei Lia,e,f,1, Carl J. Rothfelse,f, Uwe Ohlerb,c, and Philip N. Benfeya,g,2 aDepartment of Biology, Duke University, Durham, NC 27708; bBerlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany; cDepartment of Biology, Humboldt Universität zu Berlin, 10099 Berlin, Germany; dBioinformatics Research Center and Department of Statistics, North Carolina State University, Raleigh, NC 27695; eUniversity Herbarium, University of California, Berkeley, CA 94720; fDepartment of Integrative Biology, University of California, Berkeley, CA 94720; and gHoward Hughes Medical Institute, Duke University, Durham, NC 27708 Contributed by Philip N. Benfey, September 13, 2016 (sent for review June 30, 2016; reviewed by Pam J. Green and Albrecht G. von Arnim) Deep sequencing of ribosome footprints (ribosome profiling) maps and contaminants. Several metrics associated with translation have and quantifies mRNA translation. Because ribosomes decode mRNA been exploited (11), for example, the following: (i)ribosomesre- every 3 nt, the periodic property of ribosome footprints could be lease after encountering a stop codon (9), (ii) local enrichment of used to identify novel translated ORFs. However, due to the limited footprints within the predicted ORF (4, 13), (iii) ribosome footprint resolution of existing methods, the 3-nt periodicity is observed length distribution (7), and (iv) 3-nt periodicity displayed by trans- mostly in a global analysis, but not in individual transcripts. Here, we lating ribosomes (2, 6, 10, 14, 15). Among these features, some work report a protocol applied to Arabidopsis that maps over 90% of the well in distinguishing groups of coding vs.
    [Show full text]
  • An Eif4e Allele Confers Resistance to an Uncapped and Non-Polyadenylated RNA Virus in Melon
    The Plant Journal (2006) 48, 452–462 doi: 10.1111/j.1365-313X.2006.02885.x An eIF4E allele confers resistance to an uncapped and non-polyadenylated RNA virus in melon Cristina Nieto1,3,‡, Monica Morales2,3,†,‡, Gisella Orjeda3,‡, Christian Clepet3, Amparo Monfort2, Benedicte Sturbois3, Pere Puigdome` nech4, Michel Pitrat5, Michel Caboche3, Catherine Dogimont5, Jordi Garcia-Mas2, Miguel. A. Aranda1 and Abdelhafid Bendahmane3,* 1Centro de Edafologı´a y Biologı´a Aplicada del Segura (CEBAS)- CSIC, Apdo. correos 164, 30100 Espinardo, Murcia, Spain, 2Departament de Gene` tica Vegetal, Laboratori de Gene` tica Molecular Vegetal CSIC-IRTA, carretera de Cabrils s/n, 08348 Cabrils, Barcelona, Spain, 3Unite´ de Recherche en Ge´ nomique Ve´ ge´ tale, 2, rue Gaston Cre´ mieux CP 5708, 91057 Evry Cedex, France, 4Departament de Gene` tica Molecular, Laboratori de Gene` tica Molecular Vegetal CSIC-IRTA, Jordi Girona 18-26, 08034 Barcelona, Spain, and 5INRA, Unite´ de Ge´ netique et Ame´ lioration des Plantes, BP 94, Montfavet, F-84143, France Received 6 April 2006; revised 5 July 2006; accepted 18 July 2006. *For correspondence (fax þ33 160874510; e-mail [email protected]). †Present address: Department of Disease and Stress Biology and Molecular Microbiology, John Innes Center, Norwich NR4 7UH, UK. ‡These authors contributed equally to this work. Summary The characterization of natural recessive resistance genes and virus-resistant mutants of Arabidopsis have implicated translation initiation factors of the 4E family [eIF4E and eIF(iso)4E] as susceptibility factors required for virus multiplication and resistance expression. To date, viruses controlled by these genes mainly belong to the family Potyviridae.
    [Show full text]
  • Measurements of Translation Initiation from All 64 Codons in E. Coli
    bioRxiv preprint doi: https://doi.org/10.1101/063800; this version posted July 14, 2016. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Measurements of translation initiation from all 64 codons in E. coli Ariel Hecht1,2,3,‡, Jeff Glasgow1,2,3,‡, Paul R. Jaschke3,4,‡, Lukmaan Bawazer1,2,3,‡, Matthew S. Munson1,2,3, Jennifer Cochran1,3, Drew Endy1,3,* and Marc Salit1,2,3,* 1Joint Initiative for Metrology in Biology, Stanford, CA, 2Genome-scale Measurements Group, National Institute of Standards and Technology, Stanford, CA, 3Department of Bioengineering, Stanford, CA, 4Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW, Australia *To whom correspondence should be addressed. Email: [email protected]. Correspondence may also be addressed to Drew Endy. Email: [email protected]. ‡These authors contributed equally. ABSTRACT Our understanding of translation is one cornerstone of molecular biology that underpins our capacity to engineer living matter. The canonical start codon (AUG) and a few near-cognates (GUG, UUG) are typically considered as the “start codons” for translation initiation in Escherichia coli (E. coli). Translation is typically not thought to initiate from the 61 remaining codons. Here, we systematically quantified translation initiation in E. coli from all 64 triplet codons. We detected protein synthesis above background initiating from at least 46 codons. Translation initiated from these non-canonical start codons at levels ranging from 0.01% to 2% relative to AUG.
    [Show full text]
  • Genes Preferring Non-AUG Start Codons in Bacteria Abstract
    Genes Preferring Non-AUG Start Codons in Bacteria 1,2 2,3,4 Anne Gvozdjak ​ and Manoj P. Samanta ​ ​ 1. Bellevue High School, 10416 SE Wolverine Way, Bellevue, WA 98004 2. Coding for Life Science, Redmond, WA 98053 3. Systemix Institute, Redmond, WA 98053 4. [email protected] Abstract Here we investigate translational regulation in bacteria by analyzing the distribution of start codons in fully assembled genomes. We report 36 genes (infC, rpoC, rnpA, etc.) showing a preference for non-AUG start codons in evolutionarily diverse phyla (“non-AUG genes”). Most of the non-AUG genes are functionally associated with translation, transcription or replication. In E. ​ coli, the percentage of essential genes among these 36 is significantly higher than among all ​ genes. Furthermore, the functional distribution of these genes suggests that non-AUG start codons may be used to reduce gene expression during starvation conditions, possibly through translational autoregulation or IF3-mediated regulation. Introduction Understanding the regulation of gene expression at the transcriptional and translational levels is a key step in deciphering the information contained by genomes. Whereas inexpensive, high-throughput technologies (ChIP-seq, RNAseq) are helping in extensive experimental investigation of transcriptional regulation [1,2], the measurement of translational regulation using ​ ​ high-throughput mass spectroscopy [3] is less widely adopted. Bioinformatics provides another ​ avenue to leverage the rapidly falling cost of DNA sequencing to explore translational regulation. Prior to 2000, computational researchers identified a number of relevant patterns through the comparative analysis of gene sequences [4,5] . With the availability of ~250,000 ​ prokaryotic genomes, thousands of eukaryotic genomes, and additional metagenomic sequences, those patterns can now be studied at a significantly larger scale, and new patterns may also be identified.
    [Show full text]
  • Rps3/Us3 Promotes Mrna Binding at the 40S Ribosome Entry Channel and Stabilizes Preinitiation Complexes at Start Codons
    Rps3/uS3 promotes mRNA binding at the 40S ribosome entry channel and stabilizes preinitiation complexes at start codons Jinsheng Donga, Colin Echeverría Aitkenb, Anil Thakura, Byung-Sik Shina, Jon R. Lorschb,1, and Alan G. Hinnebuscha,1 aLaboratory of Gene Regulation and Development, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892; and bLaboratory on the Mechanism and Regulation of Protein Synthesis, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892 Contributed by Alan G. Hinnebusch, January 24, 2017 (sent for review December 15, 2016; reviewed by Jamie H. D. Cate and Matthew S. Sachs) Met The eukaryotic 43S preinitiation complex (PIC) bearing Met-tRNAi rearrangement to PIN at both near-cognate start codons (e.g., in a ternary complex (TC) with eukaryotic initiation factor (eIF)2-GTP UUG) and cognate (AUG) codons in poor Kozak context; hence scans the mRNA leader for an AUG codon in favorable “Kozak” eIF1 must dissociate from the 40S subunit for start-codon rec- context. AUG recognition provokes rearrangement from an open ognition (Fig. 1A). Consistent with this, structural analyses of PIC conformation with TC bound in a state not fully engaged with partial PICs reveal that eIF1 and eIF1A promote rotation of the “ ” the P site ( POUT ) to a closed, arrested conformation with TC tightly 40S head relative to the body (2, 3), thought to be instrumental bound in the “P ” state. Yeast ribosomal protein Rps3/uS3 resides IN in TC binding in the POUT conformation, but that eIF1 physically in the mRNA entry channel of the 40S subunit and contacts mRNA Met clashes with Met-tRNAi in the PIN state (2, 4), and is both via conserved residues whose functional importance was unknown.
    [Show full text]