A Unified Computational View of DNA Duplex, Triplex, Quadruplex and Their

Total Page:16

File Type:pdf, Size:1020Kb

A Unified Computational View of DNA Duplex, Triplex, Quadruplex and Their Published online 24 April 2021 Nucleic Acids Research, 2021, Vol. 49, No. 9 4919–4933 doi: 10.1093/nar/gkab285 A unified computational view of DNA duplex, triplex, quadruplex and their donor–acceptor interactions Gyuri Park1, Byunghwa Kang1, Soyeon V. Park1, Donghwa Lee1,2,3,* and Seung Soo Oh 1,3,4,* 1Department of Materials Science and Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, South Korea, 2Division of Advanced Materials Science, Pohang University of Science and Technology (POSTECH), Pohang 37673, South Korea, 3Institute for Convergence Research and Education in Advanced Technology (I-CREATE), Yonsei University, Incheon 21983, South Korea and 4School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, South Korea Received October 15, 2020; Revised April 07, 2021; Editorial Decision April 07, 2021; Accepted April 14, 2021 ABSTRACT INTRODUCTION DNA can assume various structures as a result Deoxyribonucleic acid (DNA) is a unique material as com- of interactions at atomic and molecular levels posed of nitrogenous bases (adenine (A), thymine (T), (e.g., hydrogen bonds, ␲–␲ stacking interactions, guanine (G) or cytosine (C)), sugar rings, and phosphate and electrostatic potentials), so understanding of groups. DNA is programmable, so it can be rationally the consequences of these interactions could guide designed into molecular structures ranging from simple Watson–Crick base-pairing primers to DNA origami-based development of ways to produce elaborate pro- complex 3D constructs (1,2). Even spatial and temporal grammable DNA for applications in bio- and nan- control of DNA nanostructures is achievable; sophisticated otechnology. We conducted advanced ab initio cal- DNA molecular machines can perform a series of nanome- culations to investigate nucleobase model struc- chanical motions in a controllable manner, so this abil- tures by componentizing their donor-acceptor inter- ity provides unprecedented applications in bio- and nan- actions. By unifying computational conditions, we otechnology (3–6). Such active use of DNA requires funda- compared the independent interactions of DNA du- mental understanding of DNA folding and its stabilization. plexes, triplexes, and quadruplexes, which led us In particular, DNA hybridization programming extensively to evaluate a stability trend among Watson–Crick exploits understanding of various donor-acceptor interac- ␲ ␲ and Hoogsteen base pairing, stacking, and even tions of DNA, including hydrogen bonds, – stacking in- ion binding. For a realistic solution-like environ- teractions, and electrostatic potentials (7). Hydrogen bonds and ␲–␲ stacking are among the most ment, the influence of water molecules was care- important intra- and inter-molecular interactions in DNA fully considered, and the potassium-ion preference (8–11). In general, nucleobases create specific hydrogen of G-quadruplex was first analyzed at an ab initio bonds between a purine (A or G) and a pyrimidine (C and level by considering both base-base and ion-water T), which yield planar Watson–Crick base pairs (A–T; G– interactions. We devised new structure factors in- C). Between adjacent base pairs, ␲–␲ stacking interactions cluding hydrogen bond length, glycosidic vector an- result in sequential stacking of base pairs. Therefore, hy- gle, and twist angle, which were highly effective for drogen bonds and ␲–␲ stacking drive the formation of a comparison between computationally-predicted and duplex as a basic structure of DNA. However, the nucle- experimentally-determined structures; we clarified obases can be influenced by different interactions due to pH the function of phosphate backbone during nucle- and metal ions, so triplexes and quadruplexes are sometimes obase ordering. The simulated tendency of net inter- produced (12,13). For instance, C is protonated at slightly acidic pH to become a hydrogen-bond acceptor C+, which action energies agreed well with that of real world, binds to the guanine of a G–C pair as a bond donor to and this agreement validates the potential of ab ini- yield a C+•G–C triad in a DNA triplex (14). The C+ can tio study to guide programming of complicated DNA be also paired with a non-protonated form of C, and the re- constructs. sulting C–C+ pairs involve in formation of C-quadruplex, i.e., i-motif (15). Moreover, guanines can interact strongly *To whom correspondence should be addressed. Tel: +82 54 279 2144; Email: [email protected] Correspondence may also be addressed to Donghwa Lee. Tel: +82 54 279 2160; Email: [email protected] C The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. 4920 Nucleic Acids Research, 2021, Vol. 49, No. 9 with metal ions, and monovalent cations can influence a structures by ab initio calculations and experimentally- guanine-rich DNA strand to form a G-quadruplex (16,17). determined structures by X-ray crystallography and nuclear DNA should build thermodynamically-favored structures, magnetic resonance (NMR) spectroscopy, and those anal- which is a basic principle behind the spatiotemporal control yses were highly useful to identify how phosphate back- of DNA nanoconstructs by modifying their environmental bones influence nucleobase interactions. Owing to the uni- conditions. fied computational view, various DNA constructs could Rational creation of advanced DNA structures requires be compared to each other to provide useful information, in-depth understanding of DNA interactions under a vari- such as a stability trend among Watson–Crick base pairing, ety of conditions; ab initio calculations can provide valu- Hoogsteen base pairing, stacking, and metal-ion binding. able insights in the behavior of proposed designs (18). This method provides reasonable values of dipole moments, charge distributions, and vibrational frequencies, so it can MATERIALS AND METHODS be useful to describe non-covalent molecular interactions (19,20). Importantly, all donor-acceptor interactions can For our ab initio calculations, we used the Gaussian 09 be easily itemized, so that their independent functions in (G09) program. To effectively investigate nucleobase inter- structure stabilization can be readily analyzed, and the rele- actions, DNA backbones including sugar rings and phos- vant net energy is computed precisely. Therefore, the ab ini- phate groups were replaced with hydrogen atoms. Based on tio simulation has been exploited to interpret nucleic acid the ideal structure of B-DNA, an initial model of two-layer interactions. However, most previous studies have consid- structure was constructed with a parallel arrangement of base-pair layers, maintaining 3.4-A˚ stacking distance and ered only interactions in gas phase (21–24), whereas actual ◦ DNA structures are in aqueous solutions. Some researchers 36 angle under its center of mass. Previous calculation have conducted phase-dependent simulation (25–27), but and experimental determination reports have validated the feasibility of this model (24,29). The double-layer model the calculation works used different computational condi- = tions, making itemized DNA interactions not comparable was guided to initially have a fixed dihedral angle (Opt to each other. Furthermore, the difference between compu- ModRed), minimizing possible non-planarity caused by the tational and experimental results could not be well inter- absence of a backbone, and its optimization process was then followed. Every structure optimization and relaxation preted due to lack of effective analytical factors (26,28); in / // / some calculations, the Watson–Crick base pair was less sta- was conducted under M05–2X 6–31G(d,p) M05–2X 6– ble than a mismatch (MM) pair, i.e., a non-Watson–Crick 31G(d,p) condition, and the basis set superposition error base pair (26), and stacking of bases did not yield planarity (BSSE) calculations were further corrected by the counter- (28). Therefore, there is a strong need for more realistic cal- poise (CP) method (30). The water solvent effect was real- culations of itemized DNA interactions under the unified ized by using a conductor-like screening model (COSMO) conditions and analyses, making the computational results (31). comparable with experimental ones. Shapes of DNA structures were determined by consider- In this work, we conducted advanced ab initio calcu- ing the numbers of bases and layers. The number of bases af- fects the hydrogen bonding among the nucleobases, whereas lations to simulate DNA duplexes, triplexes, and quadru- ␲ ␲ plexes by using identical computation conditions and ratio- the number of layers represents – stacking interactions nally itemized donor-acceptor interactions (Scheme 1a). To among the base-pair layers. Metal ions were added to con- provide the fundamental information for designing and ma- struct G-tetrad or G-quadruplex structures in which the nipulating complicated DNA structures along with predic- cations interact electrochemically with guanines. All interactions are expressible in simple formulae. The tions of their potential stability and transient motions, we systematically analyzed stability orders of the itemized in- binding energy Ebind is determined as the
Recommended publications
  • 1 History for Canonical and Non-Canonical Structures of Nucleic Acids
    1 1 History for Canonical and Non-canonical Structures of Nucleic Acids The main points of the learning: Understand canonical and non-canonical structures of nucleic acids and think of historical scientists in the research field of nucleic acids. 1.1 Introduction This book is to interpret the non-canonical structures and their stabilities of nucleic acids from the viewpoint of the chemistry and study their biological significances. There is more than 60 years’ history after the discovery of the double helix DNA structure by James Dewey Watson and Francis Harry Compton Crick in 1953, and chemical biology of nucleic acids is facing a new aspect today. Through this book, I expect that readers understand how the uncommon structure of nucleic acids became one of the common structures that fascinate us now. In this chapter, I introduce the history of nucleic acid structures and the perspective of research for non-canonical nucleic acid structures (see also Chapter 15). 1.2 History of Duplex The opening of the history of genetics was mainly done by three researchers. Charles Robert Darwin, who was a scientist of natural science, pioneered genetics. The proposition of genetic concept is indicated in his book On the Origin of Species published in 1859. He indicated the theory of biological evolution, which is the basic scientific hypothesis of natural diversity. In other words, he proposed biological evolution, which changed among individuals by adapting to the environment and be passed on to the next generation. However, that was still a primitive idea for the genetic concept. After that, Gregor Johann Mendel, who was a priest in Brno, Czech Republic, confirmed the mechanism of gene evolution by using “factor” inherited from parent to children using pea plant in 1865.
    [Show full text]
  • Accurate Energies of Hydrogen Bonded Nucleic Acid Base Pairs and Triplets in Trna Tertiary Interactions Romina Oliva1,2,*, Luigi Cavallo3 and Anna Tramontano2
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by PubMed Central Nucleic Acids Research, 2006, Vol. 34, No. 3 865–879 doi:10.1093/nar/gkj491 Accurate energies of hydrogen bonded nucleic acid base pairs and triplets in tRNA tertiary interactions Romina Oliva1,2,*, Luigi Cavallo3 and Anna Tramontano2 1Centro Linceo Interdisciplinare ‘Beniamino Segre’, Accademia dei Lincei, I-00165 Rome, Italy, 2Dipartimento di Scienze Biochimiche ‘A. Rossi Fanelli’, Universita` di Roma ‘La Sapienza’, I-00185 Rome, Italy and 3Dipartimento di Chimica, Universita` di Salerno, Via Salvador Allende, Baronissi (SA), I-84081, Italy Received December 4, 2005; Revised and Accepted January 19, 2006 ABSTRACT intercalated A58–G18–A/G57–G19 purine bases. All these interactions are highly conserved in cytosolic tRNAs, indic- Tertiary interactions are crucial in maintaining the ating their importance for the tRNA function. Available tRNA structure and functionality. We used a com- experimental data also support the functional importance of bined sequence analysis and quantum mechanics such a region. Mutants of the Escherichia coli tRNAAla(CUA) approach to calculate accurate energies of the most obtained by random mutagenesis and having at least one of frequent tRNA tertiary base pairing interactions. Our the above-mentioned interactions disrupted were shown to be analysis indicates that six out of the nine classical non-functional (6), and insertion of additional nucleotides into tertiary interactions are held in place mainly by the T-loop or deletion of G19 from the D-loop affected the H-bonds between the bases. In the remaining three accuracy of the codon–anticodon recognition, resulting in a cases other effects have to be considered.
    [Show full text]
  • Plant G-Quadruplex (G4) Motifs in DNA and RNA; Abundant, Intriguing Sequences of Unknown Function T
    Plant Science 269 (2018) 143–147 Contents lists available at ScienceDirect Plant Science journal homepage: www.elsevier.com/locate/plantsci Review article Review: Plant G-quadruplex (G4) motifs in DNA and RNA; abundant, intriguing sequences of unknown function T ⁎ Brianna D. Griffin, Hank W. Bass Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, FL, 32306-4295, USA ARTICLE INFO ABSTRACT Keywords: DNA sequences capable of forming G-quadruplex (G4) structures can be predicted and mapped in plant genomes G-quadruplex using computerized pattern search programs. Non-telomeric G4 motifs have recently been found to number in G4 the thousands across many plant species and enriched around gene promoters, prompting speculation that they Gene regulation may represent a newly uncovered and ubiquitous family of cis-acting elements. Comparative analysis shows that cis-regulatory element monocots exhibit five to ten times higher G4 motif density than eudicots, but the significance of this difference has not been determined. The vast scale and complexity of G4 functions, actual or theoretical, are reviewed in relation to the multiple modes of action and myriad genetic functions for which G4s have been implicated in DNA and RNA. Future experimental strategies and opportunities include identifying plant G4-interactomes, resolving the structures of G4s with and without their binding partners, and defining molecular mechanisms through reporter gene, genetic, or genome editing approaches. Given the global importance of plants for food, clothing, medicine, and energy, together with the potential role of G4 motifs as a widely conserved set of DNA sequences that could coordinate gene regulation, future plant G4 research holds great potential for use in plant improvement strategies.
    [Show full text]
  • De Novo Nucleic Acids: a Review of Synthetic Alternatives to DNA and RNA That Could Act As † Bio-Information Storage Molecules
    life Review De Novo Nucleic Acids: A Review of Synthetic Alternatives to DNA and RNA That Could Act as y Bio-Information Storage Molecules Kevin G Devine 1 and Sohan Jheeta 2,* 1 School of Human Sciences, London Metropolitan University, 166-220 Holloway Rd, London N7 8BD, UK; [email protected] 2 Network of Researchers on the Chemical Evolution of Life (NoR CEL), Leeds LS7 3RB, UK * Correspondence: [email protected] This paper is dedicated to Professor Colin B Reese, Daniell Professor of Chemistry, Kings College London, y on the occasion of his 90th Birthday. Received: 17 November 2020; Accepted: 9 December 2020; Published: 11 December 2020 Abstract: Modern terran life uses several essential biopolymers like nucleic acids, proteins and polysaccharides. The nucleic acids, DNA and RNA are arguably life’s most important, acting as the stores and translators of genetic information contained in their base sequences, which ultimately manifest themselves in the amino acid sequences of proteins. But just what is it about their structures; an aromatic heterocyclic base appended to a (five-atom ring) sugar-phosphate backbone that enables them to carry out these functions with such high fidelity? In the past three decades, leading chemists have created in their laboratories synthetic analogues of nucleic acids which differ from their natural counterparts in three key areas as follows: (a) replacement of the phosphate moiety with an uncharged analogue, (b) replacement of the pentose sugars ribose and deoxyribose with alternative acyclic, pentose and hexose derivatives and, finally, (c) replacement of the two heterocyclic base pairs adenine/thymine and guanine/cytosine with non-standard analogues that obey the Watson–Crick pairing rules.
    [Show full text]
  • Conformation and Polymorphism in PNA-DNA and PNA-RNA Hybrids (Chimera/Duplex/Triple Helix/Molecular Mechanics)
    Proc. Natl. Acad. Sci. USA Vol. 90, pp. 9542-9546, October 1993 Biochemistry Peptide nucleic acid (PNA) conformation and polymorphism in PNA-DNA and PNA-RNA hybrids (chimera/duplex/triple helix/molecular mechanics) ORN ALMARSSON AND THOMAS C. BRUICE Department of Chemistry, University of California at Santa Barbara, Santa Barbara, CA 93106 Contributed by Thomas C. Bruice, July 19, 1993 ABSTRACT Two hydrogen-bonding motifs have been pro- nucleic acids, is their stability toward nucleases. Although the posed to account for the extraordinary stability of polyamide designing of the PNA analogues of DNA has been described "peptide" nucleic acid (PNA) hybrids with nucleic acids. These by the inventors (4), the design rationale does not explain the interresidue- and intraresidue-hydrogen-bond motifs were in- great stability of the PNA hybrids with nucleic acids, as vestigated by molecular mechanics calculations. Energy- exemplified by PNA-strand invasion of double-helical DNA. iinimized structures of Watson-Crick base-paired decameric Our recent molecular mechanics study (5) with the B-helix duplexes of PNA with A-, B-, and Z-DNA and A-RNA poly- structure of the DNA*PNA duplex (dAp)lo'(pnaT)lo [where morphs indicate that the inherent stability of the complemen- (dAp)10 is the decanucleotide of 2'-deoxyadenosine 5'- tary PNA helical structures is derived from interresidue, rather phosphate and (pnaT)lo is the decamer of thyminyl PNA with than from intraresidue, hydrogen bonds in all hybrids studied. lysine amide attached to the C terminus] strongly suggests that Intraresidue-hydrogen-bond lengths are consistently longer the stability of DNA*PNA duplexes is due to interresidue than interresidue hydrogen bonds.
    [Show full text]
  • Binary Complex Crystal Structure of DNA Polymerase Β Reveals Multiple Conformations of the Templating 8-Oxoguanine Lesion
    Binary complex crystal structure of DNA polymerase β reveals multiple conformations of the templating 8-oxoguanine lesion Vinod K. Batraa, David D. Shocka, William A. Bearda, Charles E. McKennab, and Samuel H. Wilsona,1 aLaboratory of Structural Biology, National Institute of Environmental Health Sciences, National Institutes of Health, P.O. Box 12233, Research Triangle Park, NC 27709-2233; and bDepartment of Chemistry, University of Southern California, Los Angeles, CA 90089-0744 Edited by Philip C. Hanawalt, Stanford University, Stanford, CA, and approved October 31, 2011 (received for review July 27, 2011) Oxidation of genomic DNA forms the guanine lesion 7,8-dihydro-8- well-extended by DNA polymerases facilitating G to T mutagen- oxoguanine (8-oxoG). When in the template base position during esis. The structural basis for DNA polymerase preferences in DNA synthesis the 8-oxoG lesion has dual coding potential by vir- mutagenic bypass of the 8-oxoG lesion is emerging across the tue of its anti- and syn-conformations, base pairing with cytosine DNA polymerase families (8–12) and is the subject of the present and adenine, respectively. This impacts mutagenesis, because inser- report. tion of adenine opposite template 8-oxoG can result in a G to T Repair of the 8-oxoG-DNA lesion in mammalian cells pro- transversion. DNA polymerases vary by orders of magnitude in ceeds by several DNA repair pathways (13–15), but the major their preferences for mutagenic vs. error-free 8-oxoG lesion bypass. pathway is considered to be a form of the base excision repair Yet, the structural basis for lesion bypass specificity is not well un- (BER) pathway referred to as the “GO system” (16).
    [Show full text]
  • Structural Basis of Error-Prone Replication and Stalling at a Thymine Base by Human DNA Polymerase Iota Kevin N
    Western University Scholarship@Western Biochemistry Publications Biochemistry Department 6-3-2009 Structural Basis of Error-prone Replication and Stalling at a Thymine Base by Human DNA Polymerase iota Kevin N. Kirouac University of Western Ontario, [email protected] Hong Ling University of Western Ontario, [email protected] Follow this and additional works at: https://ir.lib.uwo.ca/biochempub Part of the Biochemistry Commons Citation of this paper: Kirouac, Kevin N. and Ling, Hong, "Structural Basis of Error-prone Replication and Stalling at a Thymine Base by Human DNA Polymerase iota" (2009). Biochemistry Publications. 66. https://ir.lib.uwo.ca/biochempub/66 Structural basis of error-prone replication and stalling at a thymine base by human DNA polymerase ι Kevin N Kirouac & Hong Ling* Department of Biochemistry, University of Western Ontario, London, Ontario, Canada, N6A 5C1 *Correspondence should be addressed to H. L . ( [email protected] ) Subject Category: Structural Biology; Genome Stability and Dynamics Characters (with spaces) 48 073 Figures 5 Table 1 Running Title: Polymerase ι misinserts and stalls at T bases Competing interest statement: The authors declare no competing financial interests 1 ABSTRACT Human DNA polymerase ι (pol ιιι) is a unique member of Y-family polymerases, which preferentially misincorporates nucleotides opposite thymines (T), and halts replication at T bases. The structural basis of the high error rates remains elusive. We present three crystal structures of pol ι complexed with DNA containing a thymine base, paired with correct or incorrect incoming nucleotides. A narrowed active site supports a pyrimidine:pyrimidine mismatch and excludes Watson-Crick base pairing in pol ιιι.
    [Show full text]
  • How Does Human 8-Oxoguanine DNA Glycosylase 1 (OGG1) Find 8-Oxoguanine in the Genome? Ostiane D’Augustin, Sébastien Huet, Anna Campalans, Juan Pablo Radicella
    Lost in the Crowd: How Does Human 8-Oxoguanine DNA Glycosylase 1 (OGG1) Find 8-Oxoguanine in the Genome? Ostiane D’augustin, Sébastien Huet, Anna Campalans, Juan Pablo Radicella To cite this version: Ostiane D’augustin, Sébastien Huet, Anna Campalans, Juan Pablo Radicella. Lost in the Crowd: How Does Human 8-Oxoguanine DNA Glycosylase 1 (OGG1) Find 8-Oxoguanine in the Genome?. International Journal of Molecular Sciences, MDPI, 2020, 21 (21), 10.3390/ijms21218360. hal- 03007117 HAL Id: hal-03007117 https://hal.archives-ouvertes.fr/hal-03007117 Submitted on 16 Nov 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License International Journal of Molecular Sciences Review Lost in the Crowd: How Does Human 8-Oxoguanine DNA Glycosylase 1 (OGG1) Find 8-Oxoguanine in the Genome? Ostiane D’Augustin 1,2 ,Sébastien Huet 2,3,* , Anna Campalans 1,* and Juan Pablo Radicella 1,* 1 Institute of Cellular and Molecular Radiobiology, Institut de Biologie François Jacob, CEA, Université Paris-Saclay,
    [Show full text]
  • Reverse Watson-Crick Base-Pairs in Theá D(GCGCGCG)/D(TCGCGCG)Á and D(GCGCGCG)/ D(CCGCGCG) Heteroduplexes, Respectively
    J. Mol. Biol. (1997) 269, 796±810 The Structures and Relative Stabilities of d(G G) Reverse Hoogsteen, d(G T) Reverse Wobble,Á and d(G C) Reverse Watson-CrickÁ Base-pairs in DNAÁ Crystals BlaineH.M.Mooers,BrandtF.EichmanandP.ShingHo* 5 Department of Biochemistry We have solved the structures of the homoduplex d(Gm CGCGCG)2, and and Biophysics, ALS 2011 the heteroduplexes d(GCGCGCG)/d(TCGCGCG) and d(GCGCGCG)/ Oregon State University d(CCGCGCG). The structures form six base-pairs of identical Z-DNA Corvallis, OR 97331, USA duplexes with single nucleotides overhanging at the 50-ends. The over- hanging nucleotide from one strand remains stacked and sandwiched between the blunt-ends of two adjacent Z-DNA duplexes, while the over- hanging base of the opposing strand is extra-helical. The stacked and the extra-helical bases from adjacent duplexes pair to form a distorted 5 d(G G) reverse Hoogsteen base-pair in the d(Gm CGCGCG)2 homo- duplex,Á and d(G T) reverse wobble and d(G C) reverse Watson-Crick base-pairs in theÁ d(GCGCGCG)/d(TCGCGCG)Á and d(GCGCGCG)/ d(CCGCGCG) heteroduplexes, respectively. Interestingly, only the d(G,T) and d(G C) base-pairs were observed in the heteroduplexes, suggesting that bothÁ the d(G T) reverse wobble and d(G C) reverse Watson-Crick base-pairs are moreÁ stable in this crystal environmentÁ than the d(G G) reverse Hoogsteen base-pair. To estimate the relative stability of the threeÁ types of reverse base-pairs, crystals were grown using various mixtures of sequences and their strand compositions analyzed by mass spec- trometry.
    [Show full text]
  • A DNA G-Quadruplex/I-Motif Hybrid
    bioRxiv preprint doi: https://doi.org/10.1101/737296; this version posted August 15, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. A DNA G-quadruplex/i-motif hybrid Betty Chu, Daoning Zhang, and Paul J. Paukstelis* Department of Chemistry and Biochemistry, Center for Biomolecular Structure and Organization, University of Maryland, College Park, Maryland, 20742, United States Correspondence: Paul J. Paukstelis [email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/737296; this version posted August 15, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract DNA can form many structures beyond the canonical Watson-Crick double helix. It is now clear that noncanonical structures are present in genomic DNA and have biological functions. G-rich G-quadruplexes and C-rich i-motifs are the most well-characterized noncanonical DNA motifs that have been detected in vivo with either proscribed or postulated biological roles. Because of their independent sequence requirements, these structures have largely been considered distinct types of quadruplexes. Here, we describe the crystal structure of the DNA oligonucleotide, d(CCAGGCTGCAA), that self-associates to form a quadruplex structure containing two central antiparallel G-tetrads and six i-motif C-C+ base pairs. Solution studies suggest a robust structural motif capable of assembling as a tetramer of individual strands or as a dimer when composed of tandem repeats. This hybrid structure highlights the growing structural diversity of DNA and suggests that biological systems may harbor many functionally important non-duplex structures.
    [Show full text]
  • Canonical and Non-Canonical Base Pairs in DNA Or RNA: Structure, Function and Dynamics
    Canonical and Non-canonical Base Pairs in DNA or RNA: Structure, Function and Dynamics Dhananjay Bhattacharyya1 and Abhijit Mitra2 1 Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata 700064, INDIA 2 Center for Computational Natural Sciences and Bioinformatics (CCNSB), International Institute of Information Technology (IIIT-H), Gachibowli, Hyderabad 500032, INDIA ABSTRACT: Non-canonical base pairs are planar hydrogen bonded pairs of nucleobases, having hydrogen bonding patterns which differ from the patterns observed in Watson-Crick base pairs, as in the classic double helical DNA. The structures of polynucleotide strands of both DNA and RNA molecules can be understood in terms of sugar-phosphate backbones consisting of phosphodiester-linked D-2‘-deoxyribofuranose (D-ribofuranose in RNA) sugar moieties, with purine or pyrimidine nucleobases covalently linked to them. Here, the N9 atoms of the purines, guanine and adenine, and the N1 atoms of the pyrimidines, cytosine and thymine (uracil in RNA), respectively, form glycosidic linkages with the C1‘ atom of the sugars. These nucleobases can be schematically represented as triangles with one of their vertices linked to the sugar, and the three sides accounting for three edges through which they can form hydrogen bonds with other moieties, including with other nucleobases. As also explained in greater details later in this article, the side opposite to the sugar linked vertex is traditionally called the Watson-Crick edge, since they are involved in forming the Watson- Crick base pairs which constitute building blocks of double helical DNA. The two sides adjacent to the sugar-linked vertex are referred to, respectively, as the sugar and Hoogsteen (C-H for pyrimidines) edges.
    [Show full text]
  • Hoogsteen Base Pairs Increase the Susceptibility of Double-Stranded DNA To
    bioRxiv preprint doi: https://doi.org/10.1101/2020.05.24.113951; this version posted May 27, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Hoogsteen base pairs increase the susceptibility of double-stranded DNA to cytotoxic damage Yu Xu1#, Akanksha Manghrani2#, Bei Liu2, Honglue Shi1, Uyen Pham2, Amy Liu, and Hashim M. Al-Hashimi*1,2 1Department of Chemistry, Duke University, Durham, NC, 27710, USA 2Department of Biochemistry, Duke University School of Medicine, Durham, NC, 27710, USA *Corresponding author: Hashim M. Al-Hashimi Email: [email protected] #These authors contributed equally to this work Running title: Hoogsteen-mediated DNA damage Keywords. DNA dynamics; sequencing; m1A; AlkB; echinomycin 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.05.24.113951; this version posted May 27, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract: As the Watson-Crick faces of nucleobases are protected in double-stranded DNA (dsDNA), it is commonly assumed that deleterious alkylation damage to the Watson-Crick faces of nucleobases predominantly occurs when DNA becomes single-stranded during replication and transcription. However, damage to the Watson-Crick faces of nucleobases has been reported in dsDNA in vitro through mechanisms that are not understood. In addition, the extent of protection from methylation damage conferred by dsDNA relative to single-stranded DNA (ssDNA) has not been quantified. Watson-Crick base-pairs in dsDNA exist in dynamic equilibrium with Hoogsteen base-pairs that expose the Watson-Crick faces of purine nucleobases to solvent.
    [Show full text]