Digital Biology

Total Page:16

File Type:pdf, Size:1020Kb

Digital Biology Digital Biology L. Ridgway Scott University of Chicago Ariel Fern´andez Rice University Release 0.4 DO NOT DISTRIBUTE October 3, 2014 Draft: October 3, 2014, do not distribute Contents 1 How is biology digital? 1 1.1 Exercises....................................... ... 4 2 Digital rules for proteins 5 2.1 Digitalnatureofmolecules. ........ 5 2.1.1 Digitalnatureofatoms. .... 5 2.1.2 Ionicrules.................................... 9 2.1.3 Carbon/hydrogenrules. .... 9 2.2 Digitalnatureofproteins. ........ 10 2.2.1 Hydrophobicityandhydrophilicity . ....... 11 2.2.2 Solvation =salvation............................... 11 6 2.2.3 Energeticambivalence . .... 12 2.3 Pchemomics...................................... .. 13 2.3.1 Anewtool?.................................... 14 2.3.2 Dataminingdefinition . ... 15 2.3.3 Datamininglens ................................ 15 2.3.4 Hydrogen bonds are orientation-dependent . ......... 16 2.3.5 Whatisananswer?............................... 16 2.4 Multiscalemodels................................ ..... 17 2.5 Hydrophobicinteractions. ....... 19 2.5.1 Solventmediationofelectricforces . ......... 20 2.5.2 Dehydrons..................................... 22 2.5.3 Dynamicsofdehydrons. ... 22 2.5.4 Simulateddynamics ............................. .. 25 2.5.5 Stickinessofdehydrons. ..... 26 2.6 Dehydronswitch .................................. ... 26 2.7 Exercises....................................... ... 27 3 Electrostatic forces 29 3.1 Directbonds..................................... ... 29 3.1.1 Covalentbonds ................................. 29 3.1.2 Ionicbonds/saltbridges . ..... 30 i CONTENTS CONTENTS 3.1.3 Hydrogenbonds.................................. 31 3.1.4 Cation-π interactions............................... 32 3.2 Charge-forcerelationship . ......... 32 3.3 Interactionsinvolvingdipoles . .......... 33 3.3.1 Single-file dipole-dipole interactions . ............ 34 3.3.2 Parallel dipole-dipole interactions . ........... 36 3.3.3 Dipolestability ............................... ... 37 3.3.4 Differentdipoles............................... ... 39 3.4 vanderWaalsforces ............................... .... 39 3.4.1 Lennard-Jonespotentials. ...... 40 3.5 Induceddipoles .................................. .... 41 3.5.1 Debyeforces................................... 41 3.5.2 Londondispersionforces . ..... 42 3.5.3 Dipole-neutralinteractions . ........ 43 3.6 Exercises....................................... ... 44 4 Protein basics 45 4.1 Chainsofaminoacidresidues . ....... 45 4.1.1 Taxonomiesofaminoacids. .... 46 4.1.2 Wrappingofhydrogenbonds. .... 49 4.2 Specialbonds .................................... ... 51 4.2.1 SaltBridges................................... 51 4.2.2 Disulfidebonds ................................. 51 4.3 Post-translationalmodifications . ........... 53 4.4 Specialsidechains ............................... ..... 53 4.4.1 Glycine(andAlanine) ............................ .. 54 4.4.2 Proline....................................... 54 4.4.3 Arginine...................................... 54 4.4.4 Lysine ....................................... 55 4.4.5 Cysteine...................................... 55 4.4.6 Aromaticsidechains ............................. .. 55 4.4.7 Rememberingcodenames . .. 56 4.5 Sidechainionization. ....... 56 4.6 Saltions........................................ .. 58 4.7 Aminoacidfrequencies............................. ..... 58 4.7.1 Choiceofdatabase .............................. .. 59 4.7.2 PDBdetails.................................... 60 4.7.3 Hetatoms ..................................... 61 4.8 Exercises....................................... ... 61 Draft: October 3, 2014, do not distribute ii CONTENTS CONTENTS 5 Protein Structure 65 5.1 Hydrogenbondsandsecondarystructure . ......... 65 5.1.1 Secondarystructureunits . ..... 66 5.1.2 Folds/domains ................................. 68 5.2 Mechanicalpropertiesofproteins . .......... 69 5.2.1 Conformationalgeometryofproteins . ........ 69 5.2.2 Dihedralangles ................................ .. 71 5.2.3 φ,ψ versus ψ,φ: the role of θ .......................... 73 5.2.4 Sidechainrotamers ............................. ... 73 5.3 Pairfrequenciesinproteins. ......... 74 5.3.1 Pairfrequencydefinitions . ..... 74 5.3.2 Applicationtoproteinsequences . ...... 76 5.3.3 Like-chargedpairs ............................. ... 76 5.4 Volumeofproteinconstituents. ......... 77 5.5 Foldnetworks .................................... ... 80 5.6 Exercises....................................... ... 82 6 Hydrogen bonds 85 6.1 Typesofhydrogenbonds. ..... 86 6.1.1 Hydrogenbondsinproteins . ... 87 6.1.2 Hydrogenbondstrength . .. 87 6.2 Identificationofhydrogenpositions . ........... 90 6.2.1 Fixedhydrogenpositions. ..... 91 6.2.2 Variablehydrogenpositions . ...... 91 6.2.3 Ambiguoushydrogenpositions. ..... 91 6.3 Geometriccriteriaforhydrogenbonds . .......... 93 6.4 Specialclassesofhydrogenbonds . ......... 94 6.4.1 Saltbridgehydrogenbonds . .... 94 6.4.2 Carboxyl-carboxylate hydrogen bonds. ......... 94 6.4.3 Aromatichydrogenbonds . .. 95 6.4.4 Carbonaceoushydrogenbonds. ..... 95 6.5 Exercises....................................... ... 95 7 Determinants of protein-protein interfaces 97 7.1 Directbonds..................................... ... 97 7.2 Hydrophobicinteractions. ....... 98 7.3 Amino acids at protein-protein interfaces . ............ 100 7.4 Interfacepropensity. ....... 103 7.5 Aminoacidpairsatinterfaces . ....... 106 7.5.1 Pairinteractionsatinterfaces . ........ 106 7.5.2 Coreversusrim................................. 108 7.5.3 Furtherstudies ................................ 110 Draft: October 3, 2014, do not distribute iii CONTENTS CONTENTS 7.6 Hotspots ......................................... 111 7.7 Conclusionsaboutinterfaces . ......... 111 7.8 Exercises....................................... 112 8 Wrapping electrostatic bonds 115 8.1 Roleofnonpolargroups ............................ ..... 115 8.1.1 Unitofhydrophobicity . 117 8.1.2 Defininghydrophobicity . .... 117 8.2 Assessingpolarity................................ ..... 119 8.2.1 Electronegativityscale . ...... 119 8.2.2 Polarityofgroups.............................. 120 8.3 Countingresidues................................ ..... 122 8.3.1 Desolvationdomain. 123 8.3.2 Predictingaggregation . ..... 125 8.4 Countingnonpolargroups . ...... 126 8.4.1 Distribution of wrapping for an antibody complex . ........... 127 8.4.2 Residuesversuspolargroups. ...... 128 8.5 Defining dehydrons via geometric requirements . ............. 129 8.6 Dynamicmodels ................................... 132 8.7 Geneticcode..................................... 133 8.7.1 Interpretationofthegeneticcode . ........ 135 8.7.2 Sizematters................................... 137 8.8 Exercises....................................... 138 9 Stickiness of dehydrons 141 9.1 Surfaceadherenceforce. ....... 141 9.1.1 Biologicalsurfaces . .... 141 9.1.2 Solubleproteinsonasurface. ...... 142 9.2 Atwo-zonemodel.................................. 142 9.2.1 Boundaryzonemodel. 143 9.2.2 Diffusionzonemodel ............................. 143 9.2.3 Modelvalidity................................. 144 9.3 Directforcemeasurement . ...... 144 9.4 Membranemorphology .............................. .... 146 9.4.1 Proteinadsorption ............................. 146 9.4.2 Densityofinvaginations . ..... 147 9.5 Kineticmodelofmorphology. ....... 147 9.6 Exercises....................................... 148 10 Electrostatic force details 151 10.1 Globalaccumulationofelectricforce . ............ 151 10.2 Dipole-dipoleinteractions . .......... 153 10.2.1 Dipole-dipoleconfigurations . ........ 154 Draft: October 3, 2014, do not distribute iv CONTENTS CONTENTS 10.2.2 Two-parameter interaction configuration . ........... 156 10.2.3 Threedimensionalinteractions. ......... 159 10.2.4 Applicationtotwo-angleproblem . ....... 161 10.3 Chargedinteractions . ....... 161 10.3.1 Charge-dipoleinteractions . ........ 162 10.3.2 Charge-chargeinteractions . ........ 163 10.4 Generalformofachargegroup . ....... 165 10.4.1 Asymptoticsofgeneralpotentials . ........ 166 10.4.2 Applicationof(10.54) . .... 167 10.5 Quadrupolepotential . ....... 168 10.5.1 Opposingdipoles .............................. 168 10.5.2 Four-cornerquadrupole. ...... 169 10.5.3 Quadrupoleexample . 170 10.5.4 Water: dipoleorquadrupole? . ...... 171 10.6 Multipolesummary............................... ..... 173 10.7 Madelungconstants. ...... 173 10.8Furtherresults ................................. ..... 175 10.8.1 Dipoleinductionbydipoles . ...... 175 10.8.2 Modifieddipoleinteraction. ....... 176 10.8.3 HydrogenplacementforSerandThr . ..... 177 10.9Exercises...................................... .... 178 11 Case studies 181 11.1Basiccases..................................... .... 181 11.1.1 Asingularcase: signaling . ...... 181 11.1.2 Formingstructures . .... 182 11.1.3 Crystalcontacts. .... 183 11.2 Enzymaticactivity . .. .. .. .. .. ... .. .. .. .. .. ... .. ...... 184 11.3 Neurophysin/vasopressinbinding . .......... 185 11.3.1 Theroleoftyrosine-99 . ..... 185 11.3.2 Theroleofphenylalanine-98 . ....... 187 11.3.3 2BN2versus1JK4 ................................ 188 11.4 Variations in proteomic interactivity . ............. 188 11.4.1 Interactivitycorrelation . ........ 190 11.4.2 Structuralinteractivity . ........ 192 11.5 Sheetsofdehydrons. ...... 195 11.6Exercises...................................... .... 195 12 Aromatic interactions 197 12.1 Partialchargemodel . ...... 198 12.2 Cation-π interactions..................................
Recommended publications
  • Amino Acid Building Block Models – in Brief
    Amino Acid Building Block Models – In Brief Key Teaching Points for Amino Acid Building Block Models© Overall Student Learning Objective: What Dictates How a Protein Folds? Amino acids are the building blocks of proteins. All amino acids have an identical core structure consisting of an alpha-carbon, carboxyl group, amino group and R-group (sidechain). A linear chain of amino acids is a polypeptide. The primary sequence of a protein is the linear sequence of amino acids in a polypeptide. Proteins are made up of amino acid monomers linked together by peptide bonds. Peptide bond formation between amino acids results in the release of water (dehydration synthesis or condensation reaction). The protein backbone is characterized by the “N-C-C-N-C-C. .” pattern. The “ends” of the protein can be identified by the N-terminus (amino group) end and the C-terminus (carboxyl group) end. For a more complete lesson guide, please visit: http://www.3dmoleculardesigns.com/3DMD-Files/AABB/ContentsandAssembly.pdf Amino Acid Core Structure Build an amino acid according to the diagram to the right: 1. Identify the alpha carbon, amino group, carboxyl group and R-group (sidechain representation) in the structure you have constructed. Two amino acids can be chemically linked by a reaction called “condensation” or “dehydration synthesis” to form a dipeptide bond linking the two amino acids. A chain of amino acid units (monomers) linked together by peptide bonds is called a polypeptide. General Dipeptide Structure Construct a model of a dipeptide using the amino acid models previously built. 2. What are the products of the condensation reaction (dehydration synthesis)? 3.
    [Show full text]
  • Introduction to Proteins and Amino Acids Introduction
    Introduction to Proteins and Amino Acids Introduction • Twenty percent of the human body is made up of proteins. Proteins are the large, complex molecules that are critical for normal functioning of cells. • They are essential for the structure, function, and regulation of the body’s tissues and organs. • Proteins are made up of smaller units called amino acids, which are building blocks of proteins. They are attached to one another by peptide bonds forming a long chain of proteins. Amino acid structure and its classification • An amino acid contains both a carboxylic group and an amino group. Amino acids that have an amino group bonded directly to the alpha-carbon are referred to as alpha amino acids. • Every alpha amino acid has a carbon atom, called an alpha carbon, Cα ; bonded to a carboxylic acid, –COOH group; an amino, –NH2 group; a hydrogen atom; and an R group that is unique for every amino acid. Classification of amino acids • There are 20 amino acids. Based on the nature of their ‘R’ group, they are classified based on their polarity as: Classification based on essentiality: Essential amino acids are the amino acids which you need through your diet because your body cannot make them. Whereas non essential amino acids are the amino acids which are not an essential part of your diet because they can be synthesized by your body. Essential Non essential Histidine Alanine Isoleucine Arginine Leucine Aspargine Methionine Aspartate Phenyl alanine Cystine Threonine Glutamic acid Tryptophan Glycine Valine Ornithine Proline Serine Tyrosine Peptide bonds • Amino acids are linked together by ‘amide groups’ called peptide bonds.
    [Show full text]
  • Assigning Folds to the Proteins Encoded by the Genome of Mycoplasma Genitalium (Protein Fold Recognition͞computer Analysis of Genome Sequences)
    Proc. Natl. Acad. Sci. USA Vol. 94, pp. 11929–11934, October 1997 Biophysics Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium (protein fold recognitionycomputer analysis of genome sequences) DANIEL FISCHER* AND DAVID EISENBERG University of California, Los Angeles–Department of Energy Laboratory of Structural Biology and Molecular Medicine, Molecular Biology Institute, University of California, Los Angeles, Box 951570, Los Angeles, CA 90095-1570 Contributed by David Eisenberg, August 8, 1997 ABSTRACT A crucial step in exploiting the information genitalium (MG) (10), as a test of the capabilities of our inherent in genome sequences is to assign to each protein automatic fold recognition server and as a case study to sequence its three-dimensional fold and biological function. identify the difficulties facing automated fold assignment. Here we describe fold assignment for the proteins encoded by the small genome of Mycoplasma genitalium. The assignment MATERIALS AND METHODS was carried out by our computer server (http:yywww.doe- mbi.ucla.eduypeopleyfrsvryfrsvr.html), which assigns folds to The MG Sequences. The 468 MG sequences were obtained amino acid sequences by comparing sequence-derived predic- from The Institute for Genome Research (TIGR) through its tions with known structures. Of the total of 468 protein ORFs, Web address: http:yywww.tigr.orgytdbymdbymgdbymgd- 103 (22%) can be assigned a known protein fold with high b.html. Three types of annotation (based on searches in the confidence, as cross-validated with tests on known structures. sequence database) accompany each TIGR sequence (10): (i) Of these sequences, 75 (16%) show enough sequence similarity functional assignment—a clear sequence similarity with a to proteins of known structure that they can also be detected protein of known function from another organism was found by traditional sequence–sequence comparison methods.
    [Show full text]
  • Practical Structure-Sequence Alignment of Pseudoknotted Rnas Wei Wang
    Practical structure-sequence alignment of pseudoknotted RNAs Wei Wang To cite this version: Wei Wang. Practical structure-sequence alignment of pseudoknotted RNAs. Bioinformatics [q- bio.QM]. Université Paris Saclay (COmUE), 2017. English. NNT : 2017SACLS563. tel-01697889 HAL Id: tel-01697889 https://tel.archives-ouvertes.fr/tel-01697889 Submitted on 31 Jan 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 1 NNT : 2017SACLS563 Thèse de doctorat de l’Université Paris-Saclay préparée à L’Université Paris-Sud Ecole doctorale n◦580 (STIC) Sciences et Technologies de l’Information et de la Communication Spécialité de doctorat : Informatique par M. Wei WANG Alignement pratique de structure-séquence d’ARN avec pseudonœuds Thèse présentée et soutenue à Orsay, le 18 Décembre 2017. Composition du Jury : Mme Hélène TOUZET Directrice de Recherche (Présidente) CNRS, Université Lille 1 M. Guillaume FERTIN Professeur (Rapporteur) Université de Nantes M. Jan GORODKIN Professeur (Rapporteur) University of Copenhagen Mme Johanne COHEN Directrice de Recherche (Examinatrice)
    [Show full text]
  • Introduction 1.1 Post-Translational Modifications
    Chapter I: Introduction 1.1 Post-translational modifications Post-translational modifications (PTMs) are responsible for dynamic regulation of protein structure and function (Jensen et al., 2006). One example of the function of post-translational modifications in the dynamic regulation of protein structure and function is signal transduction; post-translational modification of key regulatory proteins in signal transduction pathways controls cellular proliferation, metabolism, gene expression, cytoskeletal organization and apoptosis (Jensen et al., 2006). Post-translational modifications not only change the mass of the protein, they are also often responsible for changes in charge, structure and conformation. This leads to changes in the biological activity of proteins like enzymes, the binding affinity of proteins to receptors and other protein-protein interactions (Hoffmann et al., 2008) (Murray et al., 1998). For example, the interaction of the cytokine TGF-beta with its receptor complex results in the phosphorylation of SMAD family proteins which activates these proteins to regulate target gene expression (Massague et al., 2000). Post-translational modified amino acid residues can act as binding sites on proteins for a specific recognition domain on other proteins. For example, phosphorylated tyrosine residues can bind to SH2 or PTB domains (Pawson et al., 1997). Depending on the type of PTM they can be very abundant and have a large number of target proteins, whereas other PTM`s have only a few target proteins. Moreover, some PTM`s are connected with several different residues and not only with one residue. For example multi-site phosphorylation that primarily targets Serine, Threonine and Tyrosine residues is a crucial mechanism for the regulation of protein localization and functional activity (Cohen et al., 2000).
    [Show full text]
  • Peptides & Proteins
    Peptides & Proteins (thanks to Hans Börner) 1 Proteins & Peptides Proteuos: Proteus (Gr. mythological figure who could change form) proteuo: „"first, ref. the basic constituents of all living cells” peptos: „Cooked referring to digestion” Proteins essential for: Structure, metabolism & cell functions 2 Bacterium Proteins (15 weight%) 50% (without water) • Construction materials Collagen Spider silk proteins 3 Proteins Structural proteins structural role & mechanical support "cell skeleton“: complex network of protein filaments. muscle contraction results from action of large protein assemblies Myosin und Myogen. Other organic material (hair and bone) are also based on proteins. Collagen is found in all multi cellular animals, occurring in almost every tissue. • It is the most abundant vertebrate protein • approximately a quarter of mammalian protein is collagen 4 Proteins Storage Various ions, small molecules and other metabolites are stored by complexation with proteins • hemoglobin stores oxygen (free O2 in the blood would be toxic) • iron is stored by ferritin Transport Proteins are involved in the transportation of particles ranging from electrons to macromolecules. • Iron is transported by transferrin • Oxygen via hemoglobin. • Some proteins form pores in cellular membranes through which ions pass; the transport of proteins themselves across membranes also depends on other proteins. Beside: Regulation, enzymes, defense, functional properties 5 Our universal container system: Albumines 6 zones: IA, IB, IIA, IIB, IIIA, IIIB; IIB and
    [Show full text]
  • Structure and Synthesis of Antifungal Disulfide
    microorganisms Review Structure and Synthesis of Antifungal Disulfide β-Strand Proteins from Filamentous Fungi Györgyi Váradi 1,*, Gábor K. Tóth 1,2 and Gyula Batta 3 1 Department of Medical Chemistry, Faculty of Medicine, University of Szeged, H-6720 Szeged, Hungary; [email protected] 2 MTA-SZTE Biomimetic Systems Research Group, University of Szeged, H-6720 Szeged, Hungary 3 Department of Organic Chemistry, University of Debrecen, H-4032 Debrecen, Hungary; [email protected] * Correspondence: [email protected]; Tel.: +36-62-545-142; Fax: +36-62-545-971 Received: 20 November 2018; Accepted: 24 December 2018; Published: 27 December 2018 Abstract: The discovery and understanding of the mode of action of new antimicrobial agents is extremely urgent, since fungal infections cause 1.5 million deaths annually. Antifungal peptides and proteins represent a significant group of compounds that are able to kill pathogenic fungi. Based on phylogenetic analyses the ascomycetous, cysteine-rich antifungal proteins can be divided into three different groups: Penicillium chrysogenum antifungal protein (PAF), Neosartorya fischeri antifungal protein 2 (NFAP2) and “bubble-proteins” (BP) produced, for example, by P. brevicompactum. They all dominantly have β-strand secondary structures that are stabilized by several disulfide bonds. The PAF group (AFP antifungal protein from Aspergillus giganteus, PAF and PAFB from P. chrysogenum, Neosartorya fischeri antifungal protein (NFAP)) is the best characterized with their common β-barrel tertiary structure. These proteins and variants can efficiently be obtained either from fungi production or by recombinant expression. However, chemical synthesis may be a complementary aid for preparing unusual modifications, e.g., the incorporation of non-coded amino acids, fluorophores, or even unnatural disulfide bonds.
    [Show full text]
  • UC San Diego UC San Diego Electronic Theses and Dissertations
    UC San Diego UC San Diego Electronic Theses and Dissertations Title Analysis and applications of conserved sequence patterns in proteins Permalink https://escholarship.org/uc/item/5kh9z3fr Author Ie, Tze Way Eugene Publication Date 2007 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA, SAN DIEGO Analysis and Applications of Conserved Sequence Patterns in Proteins A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by Tze Way Eugene Ie Committee in charge: Professor Yoav Freund, Chair Professor Sanjoy Dasgupta Professor Charles Elkan Professor Terry Gaasterland Professor Philip Papadopoulos Professor Pavel Pevzner 2007 Copyright Tze Way Eugene Ie, 2007 All rights reserved. The dissertation of Tze Way Eugene Ie is ap- proved, and it is acceptable in quality and form for publication on microfilm: Chair University of California, San Diego 2007 iii DEDICATION This dissertation is dedicated in memory of my beloved father, Ie It Sim (1951–1997). iv TABLE OF CONTENTS Signature Page . iii Dedication . iv Table of Contents . v List of Figures . viii List of Tables . x Acknowledgements . xi Vita, Publications, and Fields of Study . xiii Abstract of the Dissertation . xv 1 Introduction . 1 1.1 Protein Homology Search . 1 1.2 Sequence Comparison Methods . 2 1.3 Statistical Analysis of Protein Motifs . 4 1.4 Motif Finding using Random Projections . 5 1.5 Microbial Gene Finding without Assembly . 7 2 Multi-Class Protein Classification using Adaptive Codes . 9 2.1 Profile-based Protein Classifiers . 14 2.2 Embedding Base Classifiers in Code Space .
    [Show full text]
  • ABSTRACT DATA DRIVEN APPROACHES to IDENTIFY DETERMINANTS of HEART DISEASES and CANCER RESISTANCE Avinash Das Sahu, Doctor Of
    ABSTRACT Title of dissertation: DATA DRIVEN APPROACHES TO IDENTIFY DETERMINANTS OF HEART DISEASES AND CANCER RESISTANCE Avinash Das Sahu, Doctor of Philosophy, 2016 Dissertation directed by: Professor Sridhar Hannenhalli Department of Computer Science Cancer and cardio-vascular diseases are the leading causes of death world-wide. Caused by systemic genetic and molecular disruptions in cells, these disorders are the manifestation of profound disturbance of normal cellular homeostasis. People suffering or at high risk for these disorders need early diagnosis and personalized therapeutic intervention. Successful implementation of such clinical measures can significantly improve global health. However, development of effective therapies is hindered by the challenges in identifying genetic and molecular determinants of the onset of diseases; and in cases where therapies already exist, the main challenge is to identify molecular determinants that drive resistance to the therapies. Due to the progress in sequencing technologies, the access to a large genome-wide biolog- ical data is now extended far beyond few experimental labs to the global research community. The unprecedented availability of the data has revolutionized the ca- pabilities of computational researchers, enabling them to collaboratively address the long standing problems from many different perspectives. Likewise, this thesis tackles the two main public health related challenges using data driven approaches. Numerous association studies have been proposed to identify genomic variants that determine disease. However, their clinical utility remains limited due to their inability to distinguish causal variants from associated variants. In the presented thesis, we first propose a simple scheme that improves association studies in su- pervised fashion and has shown its applicability in identifying genomic regulatory variants associated with hypertension.
    [Show full text]
  • The Ubiquitin-Proteasome System
    Review The ubiquitin-proteasome system DIPANKAR NANDI*, PANKAJ TAHILIANI, ANUJITH KUMAR and DILIP CHANDU Department of Biochemistry, Indian Institute of Science, Bangalore 560 012, India *Corresponding author (Fax, 91-80-23600814; Email, [email protected]) The 2004 Nobel Prize in chemistry for the discovery of protein ubiquitination has led to the recognition of cellular proteolysis as a central area of research in biology. Eukaryotic proteins targeted for degradation by this pathway are first ‘tagged’ by multimers of a protein known as ubiquitin and are later proteolyzed by a giant enzyme known as the proteasome. This article recounts the key observations that led to the discovery of ubiquitin-proteasome system (UPS). In addition, different aspects of proteasome biology are highlighted. Finally, some key roles of the UPS in different areas of biology and the use of inhibitors of this pathway as possible drug targets are discussed. [Nandi D, Tahiliani P, Kumar A and Chandu D 2006 The ubiquitin-proteasome system; J. Biosci. 31 137–155] 1. Introduction biological processes, e.g. transcription, cell cycle, antigen processing, cellular defense, signalling etc. is now well In an incisive article, J Goldstein, the 1985 Nobel laureate established (Ciechanover and Iwai 2004; Varshavsky 2005). for the regulation of cholesterol metabolism (together with During the early days in the field of cytosolic protein M Brown) and Chair for the Jury for the Lasker awards, degradation, cell biologists were intrigued by the requirement laments the fact that it is hard to pick out truly original dis- of ATP in this process as it is well known that peptide bond coveries among the plethora of scientific publications hydrolysis does not require metabolic energy.
    [Show full text]
  • Proteolytic Cleavage—Mechanisms, Function
    Review Cite This: Chem. Rev. 2018, 118, 1137−1168 pubs.acs.org/CR Proteolytic CleavageMechanisms, Function, and “Omic” Approaches for a Near-Ubiquitous Posttranslational Modification Theo Klein,†,⊥ Ulrich Eckhard,†,§ Antoine Dufour,†,¶ Nestor Solis,† and Christopher M. Overall*,†,‡ † ‡ Life Sciences Institute, Department of Oral Biological and Medical Sciences, and Department of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada ABSTRACT: Proteases enzymatically hydrolyze peptide bonds in substrate proteins, resulting in a widespread, irreversible posttranslational modification of the protein’s structure and biological function. Often regarded as a mere degradative mechanism in destruction of proteins or turnover in maintaining physiological homeostasis, recent research in the field of degradomics has led to the recognition of two main yet unexpected concepts. First, that targeted, limited proteolytic cleavage events by a wide repertoire of proteases are pivotal regulators of most, if not all, physiological and pathological processes. Second, an unexpected in vivo abundance of stable cleaved proteins revealed pervasive, functionally relevant protein processing in normal and diseased tissuefrom 40 to 70% of proteins also occur in vivo as distinct stable proteoforms with undocumented N- or C- termini, meaning these proteoforms are stable functional cleavage products, most with unknown functional implications. In this Review, we discuss the structural biology aspects and mechanisms
    [Show full text]
  • Histone Methylation by Temozolomide; a Classic DNA Methylating Anticancer Drug
    ANTICANCER RESEARCH 36: 3289-3300 (2016) Histone Methylation by Temozolomide; A Classic DNA Methylating Anticancer Drug TIELI WANG1, AMANDA J. PICKARD2 and JAMES M. GALLO2 1Department of Chemistry and Biochemistry, California State University Dominguez Hills, Carson, CA, U.S.A.; 2Department of Pharmacology and Systems Therapeutics, Icahn School of Medicine at Mount Sinai, New York, NY, U.S.A. Abstract. Background/Aim: The alkylating agent, adults (1). Despite advances in multimodal therapies of temozolomide (TMZ), is considered the standard-of-care for surgical extirpation, radiotherapy and chemotherapy, GBMs high-grade astrocytomas –known as glioblastoma multiforme remain a cancer of poor prognosis that can be attributed to (GBM)– an aggressive type of tumor with poor prognosis. The the aggressive nature of the tumors and the development of therapeutic benefit of TMZ is attributed to formation of DNA resistance to radiation and chemotherapy (2). Out of adducts involving the methylation of purine bases in DNA. We chemotherapeutic approaches to GBMs, temozolomide investigated the effects of TMZ on arginine and lysine amino (TMZ) is the main alkylating agent used, based on its ability acids, histone H3 peptides and histone H3 proteins. Materials to increase the median survival (3-6). and Methods: Chemical modification of amino acids, histone TMZ possesses favorable pharmacokinetic characteristics H3 peptide and protein by TMZ was performed in phosphate with high oral bioavailability and blood-brain barrier (BBB) buffer at physiological pH.
    [Show full text]