Corner Structural Motifs in Proteins

Total Page:16

File Type:pdf, Size:1020Kb

Corner Structural Motifs in Proteins S S symmetry Article Use of the Molecular Dynamics Method to Investigate the Stability of α-α-Corner Structural Motifs in Proteins Vladimir R. Rudnev 1,2, Liudmila I. Kulikova 1,3, Anna L. Kaysheva 2,*, Alexander V. Efimov 4 and Dmitry A. Tikhonov 1,3 1 Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, 142290 Pushchino, Russia; [email protected] (V.R.R.); [email protected] (L.I.K.); [email protected] (D.A.T.) 2 V.N. Orekhovich Institute of Biomedical Chemistry, 119121 Moscow, Russia 3 Institute of Mathematical Problems of Biology RAS-the Branch of Keldysh Institute of Applied Mathematics of Russian Academy of Sciences, 142290 Pushchino, Russia 4 Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Russia; efi[email protected] * Correspondence: [email protected]; Tel.: +7-(499)-764-98-78 Abstract: This study investigated the stability of structural motifs via molecular dynamics, using α-α-corners as an example. A molecular dynamics experiment was performed on a sample of α-α-corners selected by the authors from the PDB database. For the first time during a molecular dynamics experiment, we investigated the characteristics of structural motifs by describing their geometry, including the interplanar distance, area of polygon of the helices projections intersection, and torsion angles between axes of helices in helical pairs. The torsion angles for the constriction amino acids in the equilibrium portion of the molecular dynamics trajectory were analyzed. Using the molecular dynamics method, α-α-corners were found to be autonomous structures that are stable Citation: Rudnev, V.R.; Kulikova, in aquatic environments. L.I.; Kaysheva, A.L.; Efimov, A.V.; Tikhonov, D.A. Use of the Molecular Keywords: structural motifs of proteins; helical pairs; stability; molecular dynamics Dynamics Method to Investigate the Stability of α-α-Corner Structural Motifs in Proteins. Symmetry 2021, 13, 1193. https://doi.org/10.3390/ 1. Introduction sym13071193 Simple structural motifs are a current focus of researchers. These consist of two Academic Editor: Francesco Merlino elements of the secondary structure, which are characterized by unique folding of the polypeptide chain in space. This interest in structural motifs is due to the uniqueness of the Received: 12 June 2021 structures and their ability to act as focuses in the process of protein folding [1]. Structural Accepted: 28 June 2021 motifs can be used as starting structures to identify possible folds in a polypeptide chain Published: 2 July 2021 when modeling protein structures. They can also be used as stable structures in studies to predict the tertiary structure of a protein. Publisher’s Note: MDPI stays neutral Previously, Efimov [1] presented a classification of structural motifs, including two with regard to jurisdictional claims in α-helices, with unique folds of the polypeptide chain in space (α-α-corners, α-α-hairpins, published maps and institutional affil- and L-shaped and V-shaped structures). Types of helical pairs are presented in the Figure1 . iations. The α-α-hairpin motif is a supersecondary structure comprising two adjacent α-helices in the chain, which are constricted and packed antiparallel. The hairpin can be left or right, depending on how the second α-helix is located relative to the first. The length of the constrictions between the helices can also differ, and each standard α-α-hairpin contains Copyright: © 2021 by the authors. hydrophobic, hydrophilic, and glycine residues at strictly defined locations [2]. Licensee MDPI, Basel, Switzerland. Supersecondary L- and V-shaped structures are formed by two helices. Proline plays This article is an open access article a special role in the formation of L-shaped structures and promotes a link between two distributed under the terms and α-helices. Similar to studs, L-shaped structures can be right-handed or left-handed [3]. conditions of the Creative Commons V-shaped structures are also very similar to α-α-hairpins, in which the unconnected ends of Attribution (CC BY) license (https:// α-helices are very far from each other; they also appear as L-shaped structures. In V-shaped creativecommons.org/licenses/by/ structures, α-helices have a length that does not exceed three to four turns. 4.0/). Symmetry 2021, 13, 1193. https://doi.org/10.3390/sym13071193 https://www.mdpi.com/journal/symmetry Symmetry 2021, 13, x FOR PEER REVIEW 2 of 14 Symmetry 2021, 13, 1193 2 of 12 Figure 1. Types of the helical pairs, including two α-helices, with unique folds of the polypeptide Figurechain 1. Types in space. of the helical pairs, including two α-helices, with unique folds of the polypeptide chain in space. One of the most common structural motifs in homologous and non-homologous Theproteins α-α-hairpin is the α motif-α-corner is a supersecondary [4]. This supersecondary structure comprising structure is two formed adjacent by two α-hel- adjacent ices inα -helicesthe chain, along which the are polypeptide constricted chain, and connectedpacked antiparallel. by a constriction The hairpin and packed can be orthogonally left or right,in depending space. In proteins,on how αthe-α -cornerssecond α occur-helix in is the located form ofrelative a left supercoil.to the first. Their The sequences length of have the constrictionsa definite arrangement between the in helices chains can of hydrophobic,also differ, and hydrophilic, each standard and α glycine-α-hairpin residues. con- The tains sidehydrophobic, chains of residueshydrophilic, buried and in glycine the hydrophobic residues at core strictly are hydrophobic.defined locations Hydrophilic [2]. side Supersecondarychains or other polar L- and groups V-shaped can be structures recessed intoare formed a hydrophobic by two corehelices. (or otherProline hydrophobic plays a specialenvironment) role in the formation if they participate of L-shaped in the structures formation and of promotes hydrogen a link or ionic between bonds two [1 ].α- Thus, helices.each Similarα-helix to must studs, have L-shaped at least structures one hydrophobic can be residueright-handed per turn; or inleft-handed this case, hydrophobic [3]. V- shapedresidues structures should are bealso located very similar on one to side α-α of-hairpins, the α-helix in which and form the aunconnected continuous hydrophobicends of α-helicescluster are [ 4very]. far from each other; they also appear as L-shaped structures. In V- shaped structures,Previously α-helices [5–8], we have investigated a length that the geometricdoes not exceed characteristics three to four of double-helical turns. struc- Onetures. of Importantthe most common characteristics structural of helical motifspairs in homologous include inter-helical and non-homologous distances, the pro- torsion teins anglesis the α between-α-corner helical [4]. This axes insupersecondary helical pairs, the structure number is of formed amino acidsby two between adjacent helixes, α- the heliceslength along of the the polypeptide helixes, and chain, the area connected and perimeter by a constriction of the intersection and packed of helicalorthogonally projections. in space.In a previousIn proteins, study α-α [-corners9], we formulated occur in the rules form for theof a classification left supercoil. of Their double-helical sequences motifs have baseda definite on thearrangement geometric in and chains spatial of characteristicshydrophobic, hydrophilic, of their packing. and glycine residues. The side chainsIn proteins, of residuesα-helices buried are in tightlythe hydrophobic packed. Two core αare-helices hydrophobic. are packed Hydrophilic most densely side chainswhen thereor other is an polar antiparallel, groups can perpendicular, be recessed andinto canteda hydrophobic orientation core between (or other the hy- helices; drophobicexamples environment) of such packing if they include participate the supersecondary in the formation structures of hydrogenα-α-corners, or ionicα -bondsα-hairpins, [1]. Thus,and L-shapedeach α-helix and must V-shaped have structuresat least one (Figure hydrophobic1). The most residue studied per turn;α-α-corners in this case, are those hydrophobicwith short residues constrictions. should be The located constriction on one representsside of the theα-helix disordered and form part a continuous of the molecule, hydrophobicand each cluster amino [4]. acid residue within the constriction has a certain conformation from Previouslythe allowed [5–8], regions we on investigated the Ramachandran the geometric map. Since characteristics the conformational of double-helical change in the structures.molecule Important chain is characteristics significant (close of helical to rotation), pairs include the conformations inter-helical ofdistances, residues the in thetor- short sion anglesconstriction between are helical clamped axes within in helical a narrower pairs, the range. number These of patternsamino acids were between noted in he- [4] and lixes, requirethe length a comprehensive of the helixes, study and the and area statistical and perimeter analysis. of the intersection of helical projections.Many In a smallprevious proteins study exist[9], we that form compriseulated onlyrules onefor the or twoclassification known structural of double- motifs, α α helicalindicating motifs based that on such the structuralgeometric motifsand spatial are stablecharacteristics [4]. The of stability their packing. of - -corners was demonstrated indirectly in 1993 by Canadian researchers F. Tsai and J. Sherman (Univer- Symmetry
Recommended publications
  • RNA Structural Motif Recognition Based on Least-Squares Distance
    Downloaded from rnajournal.cshlp.org on October 1, 2021 - Published by Cold Spring Harbor Laboratory Press BIOINFORMATICS RNA structural motif recognition based on least-squares distance YING SHEN,1 HAU-SAN WONG,2,4 SHAOHONG ZHANG,3 and LIN ZHANG1 1School of Software Engineering, Tongji University, Shanghai 200092, China 2Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong 3Department of Computer Science, Guangzhou University, Guangzhou 510006, China ABSTRACT RNA structural motifs are recurrent structural elements occurring in RNA molecules. RNA structural motif recognition aims to find RNA substructures that are similar to a query motif, and it is important for RNA structure analysis and RNA function prediction. In view of this, we propose a new method known as RNA Structural Motif Recognition based on Least-Squares distance (LS-RSMR) to effectively recognize RNA structural motifs. A test set consisting of five types of RNA structural motifs occurring in Escherichia coli ribosomal RNA is compiled by us. Experiments are conducted for recognizing these five types of motifs. The experimental results fully reveal the superiority of the proposed LS-RSMR compared with four other state-of-the-art methods. Keywords: RNA structural motif; RNA motif recognition INTRODUCTION RNA molecule (i.e., the search space). There are two prevalent ways to search for RNA structural motifs. The first class of LongnoncodingRNA(ncRNA),whichconsistsof>200ntand methods is to find motifs based on their geometric features. has little or no protein-coding capability, is a remarkable class of RNAs. They are important in various biological processes NASSAM (Harrison et al. 2003) represents RNA motifs and (Dinger et al.
    [Show full text]
  • Cross-Talk Between Overlap Interactions in Biomolecules: a Case Study of the Β-Turn Motif
    molecules Article Cross-Talk between Overlap Interactions in Biomolecules: A Case Study of the b-Turn Motif Jayashree Nagesh Solid State and Structural Chemistry Unit, Indian Institute of Science Bangalore, Bengaluru 560012, Karnataka, India; [email protected] Abstract: Noncovalent interactions play a pivotal role in regulating protein conformation, stability and dynamics. Among the quantum mechanical (QM) overlap-based noncovalent interactions, n ! p∗ is the best understood with studies ranging from small molecules to b-turns of model proteins such as GB1. However, these investigations do not explore the interplay between multiple overlap interactions in contributing to local structure and stability. In this work, we identify and characterize all noncovalent overlap interactions in the b-turn, an important secondary structural element that facilitates the folding of a polypeptide chain. Invoking a QM framework of natural bond orbitals, we demonstrate the role of several additional interactions such as n ! s∗ and p ! p∗ that are energetically comparable to or larger than n ! p∗. We find that these interactions are sensitive to changes in the side chain of the residues in the b-turn of GB1, suggesting that the n ! p∗ may not be the only component in dictating b-turn conformation and stability. Furthermore, a database search of n ! s∗ and p ! p∗ in the PDB reveals that they are prevalent in most proteins and have significant interaction energies (∼1 kcal/mol). This indicates that all overlap interactions must be taken into account to obtain a comprehensive picture of their contributions to protein structure and energetics. Lastly, based on the extent of QM overlaps and interaction energies, we propose geometric criteria using which these additional interactions can be efficiently tracked in broad database searches.
    [Show full text]
  • Comparison of GHT-Based Approaches to Structural Motif Retrieval
    Comparison of GHT-Based Approaches to Structural Motif Retrieval Alessio Ferone1 and Ozlem Ozbudak2 1 University of Naples Parthenope, Department of Applied Science, Centro Direzionale Napoli - Isola C4, 80143, Napoli, Italy [email protected] 2 Istanbul Technical University, Department of Electronics and Communication Engineering, 34469, Istanbul, Turkey [email protected] Abstract. The structure of a protein gives important information about its function and can be used for understanding the evolutionary relation- ships among proteins, predicting protein functions, and predicting pro- tein folding. A structural motif is a compact 3D protein block referring to a small specific combination of secondary structural elements which appears in a variety of molecules. In this paper we present a compari- son between few approaches for motif retrieval based on the Generalized Hough Transform (GHT). Performance comparisons, in terms of preci- sion and computation time, are presented considering the retrieval of motifs composed by three to five SSs for more than 15 million searches. The approaches object of this study can be easily applied to the retrieval of greater blocks, up to protein domains, or even entire proteins. Keywords: Hough transform, Protein motif retrieval, Protein structure comparison. 1 Introduction Proteins are central molecules in biological phenomena because they form the functional and structural cell components of every organisms and their function is determined, to a large extend, by their spatial structures. Starting from the linear sequence of amino acid given in Protein Data Bank (PDB) [1], two basic regular 3D structures can be envisaged [11], called SSs: helices and sheets.Small specific combinations of SSs, which appear in a variety of molecules, are called motifs, and can be considered as super-SSs [12].
    [Show full text]
  • For Use of the Conserved Helix-Turn-Helix Motif in DNA Binding (Escherichia Coli/Operator Recognition/Hydroxylamine Mutagenesis/Tetracycline Resistance) PAUL J
    Proc. Natl. Acad. Sci. USA Vol. 82, pp. 6226-6230, September 1985 Genetics Dominant negative mutations in the TnIO tet repressor: Evidence for use of the conserved helix-turn-helix motif in DNA binding (Escherichia coli/operator recognition/hydroxylamine mutagenesis/tetracycline resistance) PAUL J. ISACKSON AND KEVIN P. BERTRAND Department of Microbiology and Molecular Genetics, California College of Medicine, University of California, Irvine, CA 92717 Communicated by Charles Yanofsky, May 20, 1985 ABSTRACT The Tn1O tet repressor regulates transcrip- not yet known. These observations have led several groups tion of the tetracycline-resistance determinant in transposon to propose that many sequence-specific DNA-binding pro- Tn1O. Previous DNA sequencing studies identified a region of teins use similar helix-turn-helix structures for DNA binding tet repressor (amino acids 26-47) that is homologous to the (19-21). helix-turn-helix regions of X Cro, X repressor, and catabolite We previously reported that an amino-terminal region of gene activator protein that are implicated in sequence-specific the TnJO tet repressor shows amino acid sequence homology DNA binding. Here we report the isolation of dominant tetR with the characteristic helix-turn-helix regions of Cro, X mutations that result in tet repressors deficient in tet operator repressor, and CAP (8). Here we report that mutations in binding but that retain some capacity to form dimers with, and TWJO tetR that impair repressor-operator binding, but not thereby inactivate, wild-type repressor monomers. The muta- tetracycline binding or subunit aggregation, are clustered in tions were isolated by transforming a MeMR+ tetA-lacZ fusion the region of helix-turn-helix sequence homology.
    [Show full text]
  • Computational Design of Asymmetric Three-Dimensional RNA Structures and 2 Machines
    bioRxiv preprint doi: https://doi.org/10.1101/223479; this version posted November 21, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Title: Computational Design of Asymmetric Three-dimensional RNA Structures and 2 Machines. 3 4 Authors: Joseph D. Yesselman1, Daniel Eiler2, Erik D. Carlson3,4, 5, Alexandra N. 5 Ooms6, Wipapat Kladwang1, Xuesong Shi1, David A. Costantino2, Daniel Herschlag1,7, 8, 6 Michael C. Jewett3,4,5, Jeffrey S. Kieft2, Rhiju Das1,9,* 7 8 1 Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 9 94305, USA. 10 2 Department of Biochemistry and Molecular Genetics, University of Colorado Denver 11 School of Medicine, Aurora, Colorado 80045, USA. 12 3 Department of Chemical and Biological Engineering, Northwestern University, 2145 13 Sheridan Road, Tech E-136, Evanston, Illinois 60208, USA. 14 4 Chemistry of Life Processes Institute, Northwestern University, 2170 Campus Drive, 15 Evanston, Illinois 60208, USA. 16 4 Center for Synthetic Biology, Northwestern University, 2145 Sheridan Road, Evanston, 17 Illinois 60208, USA. 18 6 Department of Cancer Genetics & Genomics, Stanford University School of Medicine, 19 Stanford, CA 94305, USA. 20 7 Department of Chemistry, Stanford University School of Medicine, Stanford, CA 94305, 21 USA. 22 8 Stanford ChEM-H (Chemistry, Engineering, and Medicine for Human Health), Stanford 23 University, Stanford, California 94305, USA. 24 9 Department of Physics, Stanford University, Stanford, CA 94305, USA. 25 26 *Correspondence should be addressed to R.D. ([email protected]) 27 1 bioRxiv preprint doi: https://doi.org/10.1101/223479; this version posted November 21, 2017.
    [Show full text]
  • Computational Prediction of RNA Structural Motifs Involved in Posttranscriptional Regulatory Processes
    Computational prediction of RNA structural motifs involved in posttranscriptional regulatory processes Michal Rabani*, Michael Kertesz*, and Eran Segal*† *Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel Edited by Sean R. Eddy, Howard Hughes Medical Institute, Janelia Farm, Ashburn, VA, and accepted by the Editorial Board August 6, 2008 (received for review April 12, 2008) Messenger RNA molecules are tightly regulated, mostly through Here we develop RNApromo (RNA prediction of motifs), a new interactions with proteins and other RNAs, but the mechanisms that computational method to identify short structural RNA motifs in confer the specificity of such interactions are poorly understood. It is sets of long unaligned RNAs. Using our method, we identify clear, however, that this specificity is determined by both the nucle- putative motifs in sets of mRNAs with substantial experimental otide sequence and secondary structure of the mRNA. Here, we evidence for a common posttranscriptional regulation, and we develop RNApromo, an efficient computational tool for identifying support these findings with cross-validation analysis. In some cases, structural elements within mRNAs that are involved in specifying sequence conservation of the putative motifs provides strong posttranscriptional regulations. By analyzing experimental data on independent support for our findings. The identified motifs include mRNA decay rates, we identify common structural elements in fast- motifs for mRNAs with similar decay rates, mRNAs that are bound decaying and slow-decaying mRNAs and link them with binding by the same RBP, and mRNAs with a common cellular localization. preferences of several RNA binding proteins. We also predict struc- Additionally, analysis of pre-microRNA structures identified dif- tural elements in sets of mRNAs with common subcellular localization ferences between animals and plants in the sizes of stem and loop in mouse neurons and fly embryos.
    [Show full text]
  • An Approach Used in DNA-Binding Helix-Turn-Helix Motif Prediction with Sequence Information Wenwei Xiong, Tonghua Li*, Kai Chen and Kailin Tang
    5632–5640 Nucleic Acids Research, 2009, Vol. 37, No. 17 Published online 3 August 2009 doi:10.1093/nar/gkp628 Local combinational variables: an approach used in DNA-binding helix-turn-helix motif prediction with sequence information Wenwei Xiong, Tonghua Li*, Kai Chen and Kailin Tang Department of Chemistry, Tongji University, Shanghai, 200092, China Received April 28, 2009; Revised and Accepted July 14, 2009 ABSTRACT INTRODUCTION Sequence-based approach for motif prediction Since the discovery of the DNA double helix in 1953 and is of great interest and remains a challenge. In this the ‘central dogma’ of molecular biology (1), which has work, we develop a local combinational variable been questioned and subsequently revised, research and approach for sequence-based helix-turn-helix debate on the flow of genetic information have been con- tinuous (2). The mechanisms for encoding, decoding and (HTH) motif prediction. First we choose a sequence transmitting genetic information have been the focus of data set for 88 proteins of 22 amino acids in length much attention. DNA-binding proteins play a vital role to launch an optimized traversal for extracting local in the delivery of this information DNA-binding proteins combinational segments (LCS) from the data set. (3) include transcription factors that modulate the tran- Then after LCS refinement, local combinational scription process, nucleases that cleave DNA molecules variables (LCV) are generated to construct predic- and histones that are involved in DNA packaging in the tion models for HTH motifs. Prediction ability of LCV cell nucleus (4). DNA-binding proteins comprise DNA- sets at different thresholds is calculated to settle a binding domains such as the helix-turn-helix (HTH), the moderate threshold.
    [Show full text]
  • Disclosing the Impact of Carcinogenic Sf3b Mutations on Pre-Mrna Recognition Via All-Atom Simulations
    biomolecules Article Disclosing the Impact of Carcinogenic SF3b Mutations on Pre-mRNA Recognition Via All-Atom Simulations Jure Borišek 1,2 , Andrea Saltalamacchia 3 , Anna Gallì 4 , Giulia Palermo 5, Elisabetta Molteni 6, Luca Malcovati 4,6 and Alessandra Magistrato 1,* 1 CNR-IOM-Democritos National Simulation Center c/o SISSA, 34136 Trieste, Italy; [email protected] 2 National Institute of Chemistry, 1000 Ljubljana, Slovenia 3 International School for Advanced Studies (SISSA), 34136 Trieste, Italy; [email protected] 4 Department of Hematology, IRCCS S. Matteo Hospital Foundation, 27100 Pavia, Italy; [email protected] (A.G.); [email protected] (L.M.) 5 Department of Bioengineering, University of California Riverside, Riverside, CA 92521, USA; [email protected] 6 Department of Molecular Medicine, University of Pavia, 27100 Pavia, Italy; [email protected] * Correspondence: [email protected] Received: 21 August 2019; Accepted: 17 October 2019; Published: 21 October 2019 Abstract: The spliceosome accurately promotes precursor messenger-RNA splicing by recognizing specific noncoding intronic tracts including the branch point sequence (BPS) and the 3’-splice-site (3’SS). Mutations of Hsh155 (yeast)/SF3B1 (human), which is a protein of the SF3b factor involved in BPS recognition and induces altered BPS binding and 3’SS selection, lead to mis-spliced mRNA transcripts. Although these mutations recur in hematologic malignancies, the mechanism by which they change gene expression remains unclear. In this study, multi-microsecond-long molecular-dynamics simulations of eighth distinct 700,000 atom models of the spliceosome Bact complex, and gene ∼ sequencing of SF3B1, disclose that these carcinogenic isoforms destabilize intron binding and/or affect the functional dynamics of Hsh155/SF3B1 only when binding non-consensus BPSs, as opposed to the non-pathogenic variants newly annotated here.
    [Show full text]
  • Automated Construction of Structural Motifs for Predicting Functional Sites on Protein Structures
    Automated Construction of Structural Motifs for Predicting Functional Sites on Protein Structures M.P. Liang, D.L. Brutlag, R.B. Altman Pacific Symposium on Biocomputing 8:204-215(2003) AUTOMATED CONSTRUCTION OF STRUCTURAL MOTIFS FOR PREDICTING FUNCTIONAL SITES ON PROTEIN STRUCTURES M.P. LIANG, D.L. BRUTLAG, R.B. ALTMAN Stanford University, 251 Campus Drive X-215, Stanford, CA 94305-5479, USA E-mail: {mliang, brutlag, altman}@smi.stanford.edu Structural genomics initiatives are beginning to rapidly generate vast numbers of protein structures. For many of the structures, functions are not yet determined and high-throughput methods for determining function are necessary. Although there has been extensive work in function prediction at the sequence level, predicting function at the structure level may provide better sensitivity and predictive value. We describe a method to predict functional sites by automatically creating three dimensional structural motifs from amino acid sequence motifs. These structural motifs perform comparably well with manually generated structural motifs and perform better than sequence motifs. Automatically generated structural motifs can be used for structural-genomic scale function prediction on protein structures. 1 Introduction 1.1 Structural Genomics With the sequencing of the human genome and many other organisms, rapid determination of gene and protein function is becoming increasingly important. Structural genomics initiatives are helping elucidate the function of these gene products by developing high-throughput methods for determining structures for all unique protein folds. These new structure targets are specifically selected not to have sequence similarity to existing proteins1-4. With the anticipated explosion of available structures, it is imperative to develop computational methods for high- throughput function prediction on protein structures.
    [Show full text]
  • The Structure and Function of DNA G-Quadruplexes
    Trends in Chemistry Review The Structure and Function of DNA G-Quadruplexes Jochen Spiegel,1,4 Santosh Adhikari,2,4 and Shankar Balasubramanian1,2,3,* Guanine-rich DNA sequences can fold into four-stranded, noncanonical secondary structures Highlights called G-quadruplexes (G4s). G4s were initially considered a structural curiosity, but recent evi- Endogenous DNA G-quadruplex dence suggests their involvement in key genome functions such as transcription, replication, (G4) structures have been detected genome stability, and epigenetic regulation, together with numerous connections to cancer in human cells and mapped in biology. Collectively, these advances have stimulated research probing G4 mechanisms and genomic DNA and in an endoge- consequent opportunities for therapeutic intervention. Here, we provide a perspective on the nous chromatin context by adapt- structure and function of G4s with an emphasis on key molecules and methodological advances ing next-generation sequencing that enable the study of G4 structures in human cells. We also critically examine recent mecha- approaches, to reveal cell type- and cell state-specific G4 land- nistic insights into G4 biology and protein interaction partners and highlight opportunities for scapes and a strong link of G4s with drug discovery. elevated transcription. Synthetic small molecules and engineered antibodies have been vital to probe Beyond the DNA Double Helix G4 existence and functions in cells. A chemist’s perspective on the function of a molecule, or a system of molecules, is typically led by a consideration of how molecular structure dictates function. The most widely recognised DNA struc- Several endogenous proteins have ture is that of the classical DNA double helix [1], which defines a structural basis for the genetic code been found to interact with DNA G4s, including helicases, transcrip- via defined base-pairing.
    [Show full text]
  • Structure of a Left-Handed DNA G-Quadruplex
    Structure of a left-handed DNA G-quadruplex Wan Jun Chunga, Brahim Heddia, Emmanuelle Schmittb, Kah Wai Lima, Yves Mechulamb, and Anh Tuân Phana,1 aSchool of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371, Singapore; and bLaboratoire de Biochimie, UMR 7654, CNRS, Ecole Polytechnique, Palaiseau 91128, France Edited by Stephen Neidle, University College London, London, United Kingdom, and accepted by the Editorial Board January 19, 2015 (received for review September 29, 2014) Aside from the well-known double helix, DNA can also adopt an G-tetrad layers arranged in the same hydrogen-bond direction- alternative four-stranded structure known as G-quadruplex. Implica- alities (G2·G5·G8·G11 and G3·G6·G9·G12 for the top G4 block tions of such a structure in cellular processes, as well as its therapeutic and G15·G18·G21·G24 and G17·G20·G23·G26 for the bottom G4 and diagnostic applications, have been reported. The G-quadruplex block) (SI Appendix,Fig.S3), connected through a central linker structure is highly polymorphic, but so far, only right-handed helical (T13–T14). All guanine residues adopt the anti glycosidic confor- forms have been observed. Here we present the NMR solution and mation with the exception of G2, which is in the syn conformation. X-ray crystal structures of a left-handed DNA G-quadruplex. The struc- Z-G4 was successfully crystallized in the presence of potassium ture displays unprecedented features that can be exploited as unique ions, using methylpentanediol as precipitating agent. Crystals recognition elements. belonged to the P3221 group and diffracted to 1.5-Å resolution.
    [Show full text]
  • Structural Motifs in Ribosomal Rnas: Implications for RNA Design And
    Julie Zorn1 Hin Hark Gan1,3 Nahum Shiffeldrim1,3 Structural Motifs in Tamar Schlick1–3 Ribosomal RNAs: 1 Department of Chemistry, New York University, Implications for RNA Design 1021 Silver, 100 Washington Square East and Genomics New York, NY 10003 2 Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012 3 Howard Hughes Medical Institute Received 29 April 2003; accepted 15 May 2003 Abstract: The various motifs of RNA molecules are closely related to their structural and functional properties. To better understand the nature and distributions of such structural motifs (i.e., paired and unpaired bases in stems, junctions, hairpin loops, bulges, and internal loops) and uncover characteristic features, we analyze the large 16S and 23S ribosomal RNAs of Escherichia coli. We find that the paired and unpaired bases in structural motifs have characteristic distribution shapes and ranges; for example, the frequency distribution of paired bases in stems declines linearly with the number of bases, whereas that for unpaired bases in junctions has a pronounced peak. Significantly, our survey reveals that the ratio of total (over the entire molecule) unpaired to paired bases (0.75) and the fraction of bases in stems (0.6), junctions (0.16), hairpin loops (0.12), and bulges/internal loops (0.12) are shared by 16S and 23S ribosomal RNAs, suggesting that natural RNAs may maintain certain proportions of bases in various motifs to ensure structural integrity. These findings may help in the design of novel RNAs and in the search (via constraints) for RNA-coding motifs in genomes, problems of intense current focus.
    [Show full text]