The Influence of Commonly Used Tags on Structural

Total Page:16

File Type:pdf, Size:1020Kb

The Influence of Commonly Used Tags on Structural Monatshefte für Chemie - Chemical Monthly (2019) 150:913–925 https://doi.org/10.1007/s00706-019-02401-x ORIGINAL PAPER The infuence of commonly used tags on structural propensities and internal dynamics of peptides Maria Bräuer1 · Maria Theresia Zich1 · Kamil Önder2 · Norbert Müller1,3 Received: 11 January 2019 / Accepted: 19 February 2019 / Published online: 29 April 2019 © The Author(s) 2019 Abstract The infuence of biotin and fuorescein tags attached to the N-terminus of peptides on their structural propensities is assessed by NMR methods. While the small peptides investigated are highly mobile with no uniquely preferred conformation, the introduction of the tags, in particular hydrophobic ones, clearly shows an infuence on NMR parameters such as chemical shifts and relaxation properties, which are not restricted to the nearby residues, but also afect distant parts. Thus, long-range efects on structural propensities become evident and are cause for concern with respect to the interpretation of weak interac- tion tests, which rely upon the assumption that tags do not exert infuence on intermolecular interactions. Graphical abstract Keywords Peptides · NMR spectroscopy · Conformation · Fluorescence tags · Afnity tags Introduction cloned sequences to facilitate purifcation of overexpressed recombinant proteins or to improve solubility or to facilitate The covalent attachment of tag groups to peptides and folding of the target protein [1, 2]. Tags are also employed proteins for either purifcation, immobilization, or spec- in Western blotting, enzyme-linked immunosorbent assay troscopic detection is a widely applied practice in biomo- (ELISA), or other biological assays [3]. Probably, the most lecular research. For example, afnity tags are included in common application of tags is the labelling of biomolecules with fuorescent dyes, for analytical purposes based on fuo- rophore-based techniques including confocal microscopy, Electronic supplementary material The online version of this article (https ://doi.org/10.1007/s0070 6-019-02401 -x) contains fuorescence resonance energy transfer (FRET), lumines- supplementary material, which is available to authorized users. cence resonance energy transfer (LRET), or microscale thermophoresis (MST) [4–7]. * Norbert Müller It is often tacitly assumed that such tags have no or neg- [email protected] ligible infuence on the secondary and tertiary structures 1 Institut für Organische Chemie, Johannes Kepler Universität of the fused peptides or proteins as well as on their con- Linz, Altenbergerstraße 69, 4040 Linz, Austria formational dynamics. Especially small tags, like hexa- 2 ProComCure BioTech GesmbH, 5000 Anif, Austria histidine, FLAG, Strep II, or chitin-binding protein (CBP), 3 are assumed to have no or only slight impact on protein Faculty of Science, University of South Bohemia, Branišovská 31, 37005 České Budějovice, Czech Republic function and, therefore, often remain attached at the N- or Vol.:(0123456789)1 3 914 M. Bräuer et al. C-termini of proteins, even during structure determination compound does not infuence the binding behaviour. This studies. Larger afnity tags including glutathione-S-trans- procedure was used in the development of a high-through- ferase (GST) and maltose-binding protein (MBP), although put fuorescence polarization anisotropy assay. Although increasing solubility [8] need to be removed before struc- NMR experiments had earlier confrmed that labelled and ture determination using X-ray or NMR and have also been unlabelled peptides were binding in the same manner, they found to modify intermolecular interactions signifcantly [9]. also revealed small changes in the chemical shifts, difer- While afnity tags introduced by recombinant DNA methods ential broadening of several peaks, and a slightly higher can be removed, e.g., by breaking linker sequence encoded binding afnity of the fuorescein isothiocyanate (FITC)- between the tag and the target protein using specifc pro- labelled peptide derivative [26]. As reasons for the changes, teases [3], tags introduced via chemical derivatization, like the authors suggested additional interactions of the protein most fuorescence tags, are normally not removed. Although with either the hydrophilic fuorescein moiety or the ali- small tags are claimed to have minimal or no impact on the phatic aminohexanoic (Ahx) linker. The introduction of a secondary structure of bio-macromolecules, there are some spacer between the tag and the frst amino acid of the pep- reports of opposite experiences using fuorescent as well as tide sequence is usually considered sufcient to prevent the afnity tags like fuorescein, biotin [10–13], or hexa-histi- infuences of the bulky fuorophore on conformation. Short dine [14–17]. Most reports about tags afecting structures linker sequences can also prevent side reactions upon the or functions can be found for hexa-histidine tagged bio- acidic cleavage of peptides produced by solid-phase peptide macromolecules, probably because it is the most frequently synthesis from the resin, as the cleavage reaction using tri- used tag for the purifcation of recombinant proteins [18]. fuoroacetic acid (TFA) can lead to reactions following the The infuence of short afnity tags like hexa-histidine and Edman degradation pathway [27], resulting in the formation FLAG on crystallisation [14, 19] and on the dynamics in the of fuorescein thiohydantoin including a cleavage of the frst picosecond range of His-tagged myoglobin used in infrared amino acid [28]. vibrational echo spectroscopy [15] are prominent examples During our studies of antimicrobial, conformationally of such reports. labile peptides, some signifcant diferences in the NMR Conversely, not many reports for efects of biotinylation spectroscopic properties between tagged and native pep- on structure and function could be found in the literature. tides were observed. Here, we report experimental results It is frequently used to attach peptides to solid surfaces as, concerning a dodecapeptide attached to an FITC tag and for example, in surface plasmon resonance [20–22]. Biotin two 22 amino acid peptides attached to biotin. For all pep- labelling at a specifc lysine residue of human lysozyme did tides, NMR experiments with and without a tag were per- not show any alteration of structure or efects on the binding formed, and their secondary structure propensities as well behaviour [10]. Only a slight decrease in enzymatic activity as local order parameters were compared on the basis of of the biotinylated sample was observed and explained by chemical shifts, scalar coupling constants, and relaxation the steric hindrance of the biotin moiety preventing binding parameters. in spatial proximity to the introduced tag. Another recently In spite of the lower number of signals, the interpreta- published report about a structure modifcation owed to the tion of peptide NMR data is often less straightforward than biotinylation of aminoacyl-tRNA describes altered growth for structured proteins. The problem with structure deter- behaviour at higher temperatures (37 °C), lower heat stabil- mination of peptides using solution state NMR is the high ity, and less efciency than the unlabelled counterpart [11]. fexibility of the molecules leading to the co-existence of a Fluorescein is frequently used as fuorophore in fuores- plethora of diferent conformers in solution. Therefore, the cence microscopy, immunofuorescence assays, and fow experimental NMR parameters represent weighted averages cytometry or fuorescence polarization [23]. Fluorescent of several conformers rather than a small number of well- tags apparently can impair the functionality of DNA mis- defned global folds. The structural propensities are highly match repair proteins when attached at the C-terminus [24]. dependent on the environment of the peptide; in polar sol- Studies on MreB, the actin homologue in bacteria, which is vents the contributions of intramolecular hydrogen bonds are essential for cell wall synthesis, showed that an alpha helix minor. Owed to the high fexibility of small peptides, crystal- reported in an earlier study was actually induced by a fuo- lisation is accompanied by conformational selection, which rescent tag [25]. is obscuring or biasing the structural information relevant For fuorescence polarization assays, especially in the in solution. Solution NMR is considered a most efective context for high-throughput screening, it has been sug- technique for the assessment of peptide conformation and gested to determine a competitive binding curve between the conformational dynamics [29, 30]. labelled probe and the corresponding unlabelled molecule The peptides investigated are part of a larger quest for [23]. If the binding curve fts, indicating similar afnity and antibacterial activity [31], they are: A0 having the sequence competitivity, it can be assumed that the tag of the labelled RKKRLKLLKRLL, with fA0 bearing a fuorescein tag 1 3 915 The infuence of commonly used tags on structural propensities and internal dynamics of peptides (with an Ahx spacer) at the N-terminus; S2 having the amino not deviate much from zero, with exception of the backbone acid sequence RIQALYSKQPKLVERILTKNEQ; and P1 nitrogen atoms. The major conformation according to the re- having the sequence DSLDIAELVMELEDEFGTEIPD, referenced secondary chemical shifts appears to be random with bioS2 and bioP1 respectively being biotinylated at the coil or dynamic. Remarkably, some of the largest deviations N-terminus.
Recommended publications
  • Accurate and Automated Classification of Protein Secondary Structure with Psicsi
    Accurate and automated classification of protein secondary structure with PsiCSI LING-HONG HUNG AND RAM SAMUDRALA Computational Genomics, Department of Microbiology, University of Washington, Seattle, Washington 98109, USA (RECEIVED July 2, 2002; FINAL REVISION October 18, 2002; ACCEPTED October 31, 2002) Abstract PsiCSI is a highly accurate and automated method of assigning secondary structure from NMR data, which is a useful intermediate step in the determination of tertiary structures. The method combines information from chemical shifts and protein sequence using three layers of neural networks. Training and testing was performed on a suite of 92 proteins (9437 residues) with known secondary and tertiary structure. Using a stringent cross-validation procedure in which the target and homologous proteins were removed from the databases used for training the neural networks, an average 89% Q3 accuracy (per residue) was observed. This is an increase of 6.2% and 5.5% (representing 36% and 33% fewer errors) over methods that use chemical shifts (CSI) or sequence information (Psipred) alone. In addition, PsiCSI improves upon the and is able to use sequence (87.4% ס translation of chemical shift information to secondary structure (Q3 without 13C shifts and 86.9% ס information as an effective substitute for sparse NMR data (Q3 with only H␣ shifts available). Finally, errors made by PsiCSI almost exclusively involve the 86.8% ס Q3 interchange of helix or strand with coil and not helix with strand (<2.5 occurrences per 10000 residues). The automation, increased accuracy, absence of gross errors, and robustness with regards to sparse data make PsiCSI ideal for high-throughput applications, and should improve the effectiveness of hybrid NMR/de novo structure determination methods.
    [Show full text]
  • A Web Server for Rapid NMR-Based Protein Structure Determination Berjanskii, M.; Tang, P.; Liang, J.; Cruz, J
    NRC Publications Archive Archives des publications du CNRC GeNMR: a web server for rapid NMR-based protein structure determination Berjanskii, M.; Tang, P.; Liang, J.; Cruz, J. A.; Zhou, J.; Zhou, Y.; Bassett, E.; Macdonell, C.; Lu, P.; Lin, G.; Wishart, D. S. This publication could be one of several versions: author’s original, accepted manuscript or the publisher’s version. / La version de cette publication peut être l’une des suivantes : la version prépublication de l’auteur, la version acceptée du manuscrit ou la version de l’éditeur. For the publisher’s version, please access the DOI link below./ Pour consulter la version de l’éditeur, utilisez le lien DOI ci-dessous. Publisher’s version / Version de l'éditeur: https://doi.org/10.1093/nar/gkp280 Nucleic Acids Research, 37, Suppl. 2, pp. W670-W677, 2009-07-01 NRC Publications Record / Notice d'Archives des publications de CNRC: https://nrc-publications.canada.ca/eng/view/object/?id=8c24f92f-5a63-4bc9-9510-868e720b942e https://publications-cnrc.canada.ca/fra/voir/objet/?id=8c24f92f-5a63-4bc9-9510-868e720b942e Access and use of this website and the material on it are subject to the Terms and Conditions set forth at https://nrc-publications.canada.ca/eng/copyright READ THESE TERMS AND CONDITIONS CAREFULLY BEFORE USING THIS WEBSITE. L’accès à ce site Web et l’utilisation de son contenu sont assujettis aux conditions présentées dans le site https://publications-cnrc.canada.ca/fra/droits LISEZ CES CONDITIONS ATTENTIVEMENT AVANT D’UTILISER CE SITE WEB. Questions? Contact the NRC Publications Archive team at [email protected].
    [Show full text]
  • Chemical Shift Secondary Structure Population Inference
    bioRxiv preprint doi: https://doi.org/10.1101/2021.02.20.432095; this version posted February 20, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 CheSPI: Chemical shift Secondary structure Population Inference Jakob Toudahl Nielsen1* and Frans A.A. Mulder1* 1Interdisciplinary Nanoscience Center (iNANO) and Department of Chemistry, Aarhus University, Gustav Wieds VeJ 14, 8000 Aarhus C, Denmark *Correspondence to: [email protected], [email protected] ORCID: Jtn http://orcid.org/0000-0002-9554-4662 faam http://orcid.org/0000-0001-9427-8406 Keywords: NMR, protein, order, disorder, chemical shifts Abstract NMR chemical shifts (CSs) are delicate reporters of local protein structure, and recent advances in random coil CS (RCCS) prediction and interpretation now offer the compelling prospect of inferring small populations of structure from small deviations from RCCSs. Here, we present CheSPI, a simple and efficient method that provides unbiased and sensitive aggregate measures of local structure and disorder. It is demonstrated that CheSPI can predict even very small amounts of residual structure and robustly delineate subtle differences into four structural classes for intrinsically disordered proteins. For structured regions and proteins, CheSPI can assign up to eight structural classes, which coincide with the well-known DSSP classification. The program is freely available, and can either be invoked from URL www.protein-nmr.org as a web implementation, or run locally from command line as a python program.
    [Show full text]
  • CSI 3.0: a Web Server for Identifying Secondary and Super-Secondary Structure in Proteins Using NMR Chemical Shifts Noor E
    W370–W377 Nucleic Acids Research, 2015, Vol. 43, Web Server issue Published online 15 May 2015 doi: 10.1093/nar/gkv494 CSI 3.0: a web server for identifying secondary and super-secondary structure in proteins using NMR chemical shifts Noor E. Hafsa1, David Arndt1 and David S. Wishart1,2,* 1Department of Computing Science, University of Alberta, Edmonton, AB T6G 2E8, Canada and 2Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E8, Canada Received March 10, 2015; Revised April 23, 2015; Accepted May 02, 2015 ABSTRACT ture elements has long been an integral part of the pro- tein structure determination process. This is particularly The Chemical Shift Index or CSI 3.0 (http://csi3. true for nuclear magnetic resonance (NMR)-based protein wishartlab.com) is a web server designed to accu- structure determination where secondary structure is used rately identify the location of secondary and super- to help in structure generation and refinement (2,3). In pro- secondary structures in protein chains using only tein NMR, secondary structures are traditionally identi- nuclear magnetic resonance (NMR) backbone chem- fied and assigned using NOE-based (Nuclear Overhauser ical shifts and their corresponding protein sequence Effect) methods. By manually analyzing the positions and data. Unlike earlier versions of CSI, which only iden- patterns of weak, medium or strong NOEs it is possible to tified three types of secondary structure (helix, ␤- identify helices, ␤-turns and ␤-stands with reasonably good strand and coil), CSI 3.0 now identifies total of 11 accuracy. Even today NOE pattern measurements continue types of secondary and super-secondary structures, to be the most commonly used method for identifying sec- ondary structures in peptides and proteins (2).
    [Show full text]
  • BMC Structural Biology Biomed Central
    BMC Structural Biology BioMed Central Research article Open Access Relationship between chemical shift value and accessible surface area for all amino acid atoms Wim F Vranken*1 and Wolfgang Rieping2 Address: 1Protein Databank Europe, European Bioinformatics Institute, Cambridge, UK and 2Department of Biochemistry, University of Cambridge, Cambridge, UK Email: Wim F Vranken* - [email protected]; Wolfgang Rieping - [email protected] * Corresponding author Published: 2 April 2009 Received: 12 November 2008 Accepted: 2 April 2009 BMC Structural Biology 2009, 9:20 doi:10.1186/1472-6807-9-20 This article is available from: http://www.biomedcentral.com/1472-6807/9/20 © 2009 Vranken and Rieping; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: Chemical shifts obtained from NMR experiments are an important tool in determining secondary, even tertiary, protein structure. The main repository for chemical shift data is the BioMagResBank, which provides NMR-STAR files with this type of information. However, it is not trivial to link this information to available coordinate data from the PDB for non-backbone atoms due to atom and chain naming differences, as well as sequence numbering changes. Results: We here describe the analysis of a consistent set of chemical shift and coordinate data, in which we focus on the relationship between the per-atom solvent accessible surface area (ASA) in the reported coordinates and their reported chemical shift value.
    [Show full text]
  • Noor E Hafsa
    Protein Structure Characterization From NMR Chemical Shifts by Noor E Hafsa A thesis submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Computing Science University of Alberta © Noor E Hafsa, 2016 Abstract In order to understand the complex biological functions of proteins, highly detailed, atomic resolution protein structures are needed. Experimental methods such as X-ray crystallography and NMR spectroscopy provide standard platforms for determining the atomic-resolution structures of proteins. However, a continuing bottleneck in conventional NOE-based NMR structure determination lies in the difficulty of measuring NOEs for medium-to-large proteins and the resulting time-costs and the corresponding reduction in structure accuracy and precision. This has led to an increased interest in using other easily identifiable NMR parameters, such as chemical shifts, to facilitate protein structure determination by NMR. Chemical shifts, often considered as mileposts of NMR, have long been used to decipher the structures of small molecules. However, chemical shifts are much less frequently utilized for structural interpretation of larger macromolecules such as peptides and proteins. Most existing macromolecular methods use chemical shifts and various heuristic, rule-based algorithms to identify and determine a small number of structural parameters (such as secondary structure). Other methods, such as CS-Rosetta and CS23D, which attempt to determine 3D structures from chemical shifts alone, are only modestly successful (~50% success). So while good progress has been made, I believe that there is still substantial room for improvement and that the “Shift-to- Structure” problem has not yet been fully solved. My PhD project involves investigating innovative computational and machine- learning approaches to develop chemical-shift based prediction models to determine protein structures with high efficiency and high accuracy (>90%).
    [Show full text]
  • An Efficient Backbone Assignment Tool for NMR Studies of Proteins
    Reduced Dimensionality (4,3)D-hnCOCANH Experiment: An Efficient Backbone Assignment tool for NMR studies of Proteins Dinesh Kumar* Centre of Biomedical Magnetic Resonance, SGPGIMS Campus, Raibareli Road, Lucknow-226014, India; *Email: [email protected] KEYWORDS: Multidimensional NMR; Backbone Assignment; hncoCANH, hnCOcaNH; Sequential Correlations. ABBREVIATIONS: NMR, Nuclear Magnetic Resonance; HSQC, Heteronuclear Single Quantum Correlation; CARA, Computer Aided Resonance Assignment; RD: Reduced dimensionality. 1 Abstract: Sequence specific resonance assignment and secondary structure determination of proteins form the basis for variety of structural and functional proteomics studies by NMR. In this context, an efficient standalone method for rapid assignment of backbone (1H, 15N, 13C and 13C') resonances and secondary structure determination of proteins has been presented here. Compared to currently available strategies used for the purpose, the method employs only a single reduced dimensionality (RD) experiment -(4,3)D-hnCOCANH and exploits the linear combinations of backbone (13C and 13C') chemical shifts to achieve a dispersion relatively better compared to those of individual chemical shifts (see the text) for efficient and rapid data analysis. Further, the experiment leads to the spectrum with direct distinction of self (intra-residue) and sequential (inter-residue) carbon correlation peaks; these appear opposite in signs and therefore can easily be discriminated without using an additional complementary experiment. On top of
    [Show full text]
  • Bioinformatics Methods for NMR Chemical Shift Data
    Bioinformatics Methods for NMR Chemical Shift Data Dissertation an der Fakult¨at fur¨ Mathematik, Informatik und Statistik der Ludwig-Maximilians-Universit¨at Munchen¨ Simon W. Ginzinger vorgelegt am 24.10.2007 Erster Gutachter: Prof. Dr. Volker Heun, Ludwig-Maximilians-Universit¨at Munchen¨ Zweiter Gutachter: Prof. Dr. Robert Konrat, Universit¨at Wien Rigorosum: 8. Februar 2008 Kurzfassung Die nukleare Magnetresonanz-Spektroskopie (NMR) ist eine der wichtigsten Meth- oden, um die drei-dimensionale Struktur von Biomolekulen¨ zu bestimmen. Trotz großer Fortschritte in der Methodik der NMR ist die Aufl¨osung einer Protein- struktur immer noch eine komplizierte und zeitraubende Aufgabe. Das Ziel dieser Doktorarbeit ist es, Bioinformatik-Methoden zu entwickeln, die den Prozess der Strukturaufkl¨arung durch NMR erheblich beschleunigen k¨onnen. Zu diesem Zweck konzentriert sich diese Arbeit auf bestimmte Messdaten aus der NMR, die so genannten chemischen Verschiebungen. Chemische Verschiebungen werden standardm¨aßig zu Beginn einer Struktur- aufl¨osung bestimmt. Wie alle Labordaten k¨onnen chemische Verschiebungen Fehler enthalten, die die Analyse erschweren, wenn nicht sogar unm¨oglich machen. Als erstes Resultat dieser Arbeit wird darum CheckShift pr¨asentiert, eine Meth- ode, die es erm¨oglich einen weit verbreiteten Fehler in chemischen Verschiebungs- daten automatisch zu korrigieren. Das Hauptziel dieser Doktorarbeit ist es jedoch, strukturelle Informationen aus chemischen Verschiebungen zu extrahieren. Als erster Schritt in diese Richtung wurde SimShift entwickelt. SimShift erm¨oglicht es zum ersten Mal, strukturelle Ahnlichkeiten¨ zwischen Proteinen basierend auf chemischen Verschiebungen zu identifizieren. Der Vergleich zu Methoden, die nur auf der Aminos¨aurensequenz basieren, zeigt die Uberlegenheit¨ des verschiebungsbasierten Ansatzes. Als eine naturliche¨ Erweiterung des paarweisen Vergleichs von Proteinen wird darauf fol- gend SimShiftDB vorgestellt.
    [Show full text]
  • Anna Katharina Dehof Novel Approaches for Bond Order Assignment and NMR Shift Prediction
    Anna Katharina Dehof Novel Approaches for Bond Order Assignment and NMR Shift Prediction Novel Approaches for Bond Order Assignment and NMR Shift Prediction Dissertation zur Erlangung des Grades eines Doktors der Naturwissenschaften (Dr. rer. nat.) der Naturwissenschaftlich–Technischen Fakult¨at I – Mathematik und Informatik – der Universit¨at des Saarlandes vorgelegt von Anna Katharina Dehof, M.Sc. Saarbr¨ucken 2011 c 2011 Anna Katharina Dehof Covergestaltung und Titelbild: c 2011 Anna Katharina Dehof, BALL developer team, http://www.ball-project.org Herstellung und Verlag: Books on Demand GmbH, Norderstedt ISBN 978-3-8482-0773-2 Bibliografische Informationen der Deutschen Nationalbibliothek Die Deutsche Nationalbibliothek verzeichnet diese Publikation in der Deutschen Nationalbibliographie; detaillierte bibliografische Daten sind im Internetuber ¨ http://dnb.d-ndb.de abrufbar. Tag des Kolloquiums 15.03.2012 Dekan Prof. Dr. Holger Hermanns Berichterstatter Prof. Dr. Hans-Peter Lenhof Prof. Dr. Sebastian B¨ocker Vorsitz Prof. Dr. Philipp Slusallek Akad. Mitarbeiter Dr. Oliver M¨uller Eidesstattliche Versicherung Hiermit versichere ich an Eides statt, dass ich die vorliegende Arbeit selbstst¨andig und ohne Benutzung anderer als der angegebenen Hilfsmittel angefertigt habe. Die aus anderen Quellen oder indirekt ubernommenen¨ Daten und Konzepte sind unter Angabe der Quelle gekennzeichnet. Die Arbeit wurde bisher weder im In- noch im Ausland in gleicher oder ¨ahnlicher Form in einem Verfahren zur Erlangung eines akademischen Grades vorgelegt. Saarbr¨ucken, den 17. November 2011 (Anna Katharina Dehof) Abstract Molecular modelling is one of the cornerstones of modern biological and pharmaceutical research. Accurate modelling approaches easily become computationally overwhelming and thus, different levels of approximations are typically employed. In this work, we develop such approximation approaches for problems arising in structural bioinformatics.
    [Show full text]
  • Equilibrium Simulations of Proteins Using Molecular Fragment Replacement and NMR Chemical Shifts
    Equilibrium simulations of proteins using molecular fragment replacement and NMR chemical shifts Wouter Boomsmaa,1,2, Pengfei Tianb,1, Jes Frellsenc, Jesper Ferkinghoff-Borgd, Thomas Hamelrycka, Kresten Lindorff-Larsena, and Michele Vendruscoloe,2 aDepartment of Biology, University of Copenhagen, 2200 Copenhagen, Denmark; bNiels Bohr Institute, University of Copenhagen, 2100 Copenhagen, Denmark; cDepartment of Engineering, University of Cambridge, Cambridge, CB2 1PZ, United Kingdom; dBiotech Research and Innovation Center, University of Copenhagen, Technical University of Denmark Campus, 2800 Kongens Lyngby, Denmark; and eDepartment of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom Edited by José N. Onuchic, Rice University, Houston, TX, and approved August 7, 2014 (received for review March 18, 2014) Methods of protein structure determination based on NMR chemical data provide information primarily about local structural prop- shifts are becoming increasingly common. The most widely used erties of a molecule. In such cases, without modifying the force approaches adopt the molecular fragment replacement strat- field, the data can be integrated directly in the sampling pro- egy, in which structural fragments are repeatedly reassembled cedure, which can dramatically increase the efficiency of the into different complete conformations in molecular simulations. simulations. For instance, local structural information can be Although these approaches are effective in generating individual integrated into the simulations
    [Show full text]
  • An Overview of Tools for the Validation of Protein NMR Structures
    J Biomol NMR (2014) 58:259–285 DOI 10.1007/s10858-013-9750-x ARTICLE An overview of tools for the validation of protein NMR structures Geerten W. Vuister • Rasmus H. Fogh • Pieter M. S. Hendrickx • Jurgen F. Doreleijers • Aleksandras Gutmanas Received: 27 March 2013 / Accepted: 4 June 2013 / Published online: 23 July 2013 Ó Springer Science+Business Media Dordrecht 2013 Abstract Biomolecular structures at atomic resolution examples from the PDB: a small, globular monomeric present a valuable resource for the understanding of biol- protein (Staphylococcal nuclease from S. aureus, PDB ogy. NMR spectroscopy accounts for 11 % of all structures entry 2kq3) and a small, symmetric homodimeric protein (a in the PDB repository. In response to serious problems with region of human myosin-X, PDB entry 2lw9). the accuracy of some of the NMR-derived structures and in order to facilitate proper analysis of the experimental Keywords NMR Á Structure Á Validation Á Restraints Á models, a number of program suites are available. We Quality Á Program Á Review Á Chemical shifts discuss nine of these tools in this review: PROCHECK- NMR, PSVS, GLM-RMSD, CING, Molprobity, Vivaldi, ResProx, NMR constraints analyzer and QMEAN. We Introduction evaluate these programs for their ability to assess the structural quality, restraints and their violations, chemical Biomolecular structures at atomic resolution are crucial for shifts, peaks and the handling of multi-model NMR interpreting cellular processes in a molecular context. In ensembles. We document both the input required by the addition, they serve important roles in drug discovery and programs and output they generate.
    [Show full text]
  • Model-Based Assignment and Inference of Protein Backbone Nuclear Magnetic Resonances
    Model-based assignment and inference of protein backbone nuclear magnetic resonances. Olga Vitek∗ Jan Vitek† Bruce Craig∗ Chris Bailey-Kellogg†‡ Abstract Nuclear Magnetic Resonance (NMR) spectroscopy is a key experimental technique used to study protein structure, dynamics, and interactions. NMR methods face the bottleneck of spectral analysis, in particular determining the resonance assignment, which helps define the mapping between atoms in the protein and peaks in the spectra. A substantial amount of noise in spectral data, along with ambiguities in interpretation, make this analysis a daunting task, and there exists no generally accepted measure of uncertainty associated with the resulting solutions. This paper develops a model- based inference approach that addresses the problem of characterizing uncertainty in backbone resonance assignment. We argue that NMR spectra are subject to random variation, and ignoring this stochasticity can lead to false optimism and erroneous conclusions. We propose a Bayesian statistical model that accounts for various sources of uncertainty and provides an automatable framework for inference. While assignment has previously been viewed as a deterministic optimization problem, we demonstrate the importance of considering all solutions consistent with the data, and develop an algorithm to search this space within our statistical framework. Our approach is able to characterize the uncertainty associated with backbone resonance assignment in several ways: 1) it quantifies of uncertainty in the individually assigned resonances in terms of their posterior standard deviations; 2) it assesses the information content in the data with a posterior distribution of plausible assignments; and 3) it provides a measure of the overall plausibility of assignments. We demonstrate the value of our approach in a study of experimental data from two proteins, Human Ubiquitin and Cold-shock protein A from E.
    [Show full text]