Predicting Kinase Inhibitor Resistance: Physics-Based and Data-Driven Approaches Supporting Information

Total Page:16

File Type:pdf, Size:1020Kb

Predicting Kinase Inhibitor Resistance: Physics-Based and Data-Driven Approaches Supporting Information Predicting Kinase Inhibitor Resistance: Physics-Based and Data-Driven Approaches Supporting Information Matteo Aldeghi∗[a], Vytautas Gapsys[a] and Bert L. de Groot†[a] [a]Computational Biomolecular Dynamics Group, Max Planck Institute for Biophysical Chemistry, 37077 Göttingen, Germany This PDF file includes: • Methods • Figures S1-S11 • Tables S1-S2 Other supporting materials for this manuscript include the following: • Data S1: detailed information on the data set and numerical results for all calculations, provided as an Excel spreadsheet (XLSX). • Data S2: input files pertaining to the molecular-dynamics-based free energy calculations and Rosetta calculations, provided as a compressed archive file (ZIP, 1.6 MB). • Data S3: input files and Jupyter notebooks used for the training and testing of the machine learning model, provided as a compressed archive file (ZIP, 832 MB). This file can be downloaded from the following address: https://doi.org/10.5281/zenodo.3350897. ∗[email protected][email protected] S1 Contents Methods S3 Dataset . S3 System Setup . S3 Free Energy Calculations . S4 Rosetta Calculations . S4 Machine Learning . S5 Training Dataset . S5 Features and Feature Selection . S5 Data Analysis . S6 Figures S7 Figure S1 Thermodynamic cycle used in the MD-based calculations . S7 Figure S2 Accuracy of the ∆∆G estimates for all force fields . S8 Figure S3 Effect of halogen bond modelling on Charmm calculations . S9 Figure S4 Analysis of the consensus force field approach . S10 Figure S5 Results by net charge change . S11 Figure S6 Uncertainty of the MD-based calculations . S12 Figure S7 Analysis of the combined Rosetta+MD consensus approach . S13 Figure S8 Scatter plot of ∆∆G estimates obtained with the avg(A99,R15) consensus score S14 Figure S9 Scatter plot of ∆∆G estimates obtained with mCSM-Lig . S15 Figure S10 Precision-recall curves . S16 Figure S11 Scatter plot of ∆∆G estimates obtained with the max(A99,R15) consensus score S17 Tables S18 Table S1 Summary of all MD-based calculations and their performance . S18 Table S2 Binary classification performance . S19 References S20 S2 Methods Dataset A dataset containing 144 binding affinity changes (∆∆G) for eight TKIs due to point mutations in human Abl kinase is provided by Hauser et al.[1]. The authors also provide structure files of the Abl:TKI complexes. Six of these are structures resolved experimentally via X-ray crystallography (4WA9, 3UE4, 4XEY, 1OPJ, 3CS9, 3OXZ) and two were obtained via docking (referred to as DOK1 and DOK2). For calculations using Charmm force fields, glycine mutations were excluded, as it is currently not possible to interpolate between different grid-based energy correction maps (CMAPs) in Gromacs. Thus, the effective dataset size used in Charmm calculations was of 137 ∆∆G values rather than 144. The standard error associated with the ∆∆G values was taken to be 0.32 kcal/mol, based on the interlaboratory variability of the IC50 derived ∆∆G measurements.[1]. Details of the dataset can be found in Data S1. System Setup The structures of the Abl:TKI complexes were taken from Hauser et al.[1]. Apo structures were generated by discarding the ligand atoms. Crystallographic water molecules were retained. All mutant structures were generated using FoldX (v4)[2]. Protein protonation states were assigned at pH 7.4 with the protein preparation tool in HTMD (v1.12)[3], which uses PDB2PQR[4] and PROPKA v3.1[5, 6]. The protonation states of the ligands were kept as they were determined in Hauser et al.[1]. Proteins were modelled with the Amber99sb*-ILDN[7, 8, 9] (we abbreviate this as “A99”), Am- ber14sb[10] (A14), Charmm22*[11, 12, 13] (C22), and Charmm36[14] (C36) force fields. In addition, we also tested a modified version of Amber99sb*-ILDN in which the parameters for hydroxyl groups were adapted as described in Fennel et al.[15]; we refer to this force field as Amber99sb*-ILDN-DC (A99dc). The TIP3P water model[16] was used. Ligands were modelled with GAFF2 (v2.1)[17] via AmberTools 16 and CGenFF (v3.0.1)[18] via paramchem[19, 20]. In the GAFF2 models, restrained electrostatic potential (RESP)[21] charges were used. Geometry optimizations and molecular elec- trostatic potential calculations (ESP) were performed with Gaussian 09 (Rev. D.01), both at the HF/6-31G* level of theory. Only 3 optimization steps were carried out to keep ligands’ conformations close to the bound poses. ESP points were sampled according to the Merz-Kollman scheme[22, 23]. In addition, the σ-hole on halogen atoms was modelled as described by Kolář and Hobza[24]. All ligand parameters can be found in the input files in Data S2. The protein-ligand systems were solvated in a dodecahedral box with periodic boundary conditions and a minimum distance between the solute and the box of 12 Å. Sodium and chloride ions were added to neutralize the wild type systems at the concentration of 0.15 M. For the mutant systems, the same number of ions as in the wild type systems was added; i.e. the net charge of the wild type systems was always zero, while the net charge of the mutant systems was allowed to deviate from zero according to the mutation. Because FoldX does not consider the presence of ligands when mutating the protein, clashes with the ligands in the mutated complexes may occur. A clash was considered present if any protein heavy atom was within 1.5 Å of any ligand heavy atom. If one or more clashes were present, an approach similar to the one reported for alchembed[25] was used to resolve them: after 2,000 steepest descent steps, the ligand vdW interactions were switched on in 2,000 MD steps carried out with a 0.5 fs timestep, while using position restraints (1,000 kJ mol−1 nm−2) on all heavy atoms. S3 Free Energy Calculations All simulations were carried out in Gromacs 2016[26, 27] on cluster nodes equipped with an Intel Xeon processor of Ivy Bridge (4 cores, e.g. E3-1270 v2) or Broadwell (10 cores, e.g. E5-2630 v4) architecture and a Nvidia GeForce GPU 10 series (GTX 1070, GTX 1080, or GTX 1080 Ti). Compute times for a single ∆∆G estimate with a representative node architecture are shown in Table 1. 10,000 energy minimization steps were performed using a steepest descent algorithm. The systems were subsequently simulated for 100 ps in the isothermal-isobaric ensemble (NPT) with harmonic position restraints applied to all solute heavy atoms with a force constant of 1,000 kJ mol−1 nm−2. Equations of motion were integrated with a leap-frog integrator and a time-step of 2 fs. The tempera- ture was coupled with the stochastic v-rescale thermostat of Bussi et al.[28] at the target temperature of 300 K. The pressure was controlled with the Berendsen weak coupling algorithm[29] at a target pressure of 1 bar. The particle mesh Ewald (PME) algorithm[30] was used for electrostatic interac- tions with a real space cut-off of 10 Å when using Amber force fields and 12 Å when using Charmm force fields, a spline order of 4, a relative tolerance of 10−5 and a Fourier spacing of 1.2 Å. The Verlet cut-off scheme[31] with the potential-shift modifier was used with a Lennard-Jones interaction cut-off of 10 Å with Amber and 12 Å with Charmm force fields, and a buffer tolerance of 0.005 kJ mol−1 ps−1. All bonds were constrained with the P-LINCS algorithm[32]. For equilibration, 1 ns unrestrained MD simulations were then performed in the NPT ensemble with the Parrinello-Rahman pressure coupling algorithm[33] at 1 bar with a time constant of 2 ps. Production simulations were then performed for 3 ns. For the more expensive A99` protocol, simulations of 5 ns were used. For each free energy calculation, the above procedure for equilibrium simulations (from system setup to minimization, equilibration, and production MD) was repeated ten times on both the apo and complex states, of both wild-type and mutant Abl kinase, for each ∆∆G estimate. From each of these ten equilibrium simulations, 30 equally spaced frames were extracted as the starting configurations for the non-equilibrium part of the calculations, for a total of 300 non-equilibrium trajectories (in both directions, wild-type to mutant and mutant to wild-type) for each mutation. For A99`, ten repeated equilibrium simulations were used for charge-conserving mutations, and twenty for charge-changing mutations; from these, a total of 400 frames for charge-conserving mutations, and 800 frames for charge-changing mutations, were extracted. The non-interacting (“dummy”) atoms needed to morph the wild-type residues into mutant ones were introduced at this stage with the pmx package[34], using the mutant structure proposed by FoldX as a template. The positions of the dummy atoms were minimized while freezing the rest of the system. These systems containing hybrid residues were then simulated for 10 ps to equilibrate velocities. Amino acid side chains were finally alchemically morphed at constant speed during non-equilibrium simulations of 80 ps in length (100 ps were used for A99`). The work values associated with each non-equilibrium transition were extracted using thermodynamic integration (TI)[35] and then used to estimate the free energy differences with the Bennett’s Acceptance Ratio (BAR)[36, 37, 38]. apo holo Point estimates of the free energy differences (Figure S1: ∆GWT →MT and ∆GWT →MT ) were cal- culated with BAR after pooling all available forward and reverse work values coming from the non- apo equilibrium trajectories spawned from all equilibrium simulation repeats. Uncertainties in ∆GWT →MT holo and ∆GWT →MT were estimated as standard errors (σ∆G) by separately considering each equilibrium simulation and its related non-equilibrium trajectories as independent calculations.
Recommended publications
  • Foldx As Protein Engineering Tool: Better Than Random Based Approaches?
    Computational and Structural Biotechnology Journal 16 (2018) 25–33 Contents lists available at ScienceDirect journal homepage: www.elsevier.com/locate/csbj FoldX as Protein Engineering Tool: Better Than Random Based Approaches? Oliver Buß ⁎,JensRudat,KatrinOchsenreither Institute of Process Engineering in Life Sciences, Section II: Technical Biology, Karlsruhe Institute of Technology, Karlsruhe, Germany article info abstract Article history: Improving protein stability is an important goal for basic research as well as for clinical and industrial applications Received 31 October 2017 but no commonly accepted and widely used strategy for efficient engineering is known. Beside random ap- Received in revised form 21 December 2017 proaches like error prone PCR or physical techniques to stabilize proteins, e.g. by immobilization, in silico ap- Accepted 20 January 2018 proaches are gaining more attention to apply target-oriented mutagenesis. In this review different algorithms Available online 03 February 2018 for the prediction of beneficial mutation sites to enhance protein stability are summarized and the advantages Keywords: and disadvantages of FoldX are highlighted. The question whether the prediction of mutation sites by the algo- FoldX rithm FoldX is more accurate than random based approaches is addressed. Fold-X © 2018 Buß et al. Published by Elsevier B.V. on behalf of the Research Network of Computational and Structural Thermostability Biotechnology. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). Protein stabilization Protein engineering Enzyme engineering 1. Introduction per unit of product [10–12][13]. For industrial enzymes improved sta- bility against heat, solvents and other relevant process parameters, Increasing protein stability is a desirable goal for different life science e.g.
    [Show full text]
  • Foldx Accurate Structural Protein–DNA Binding
    3852–3863 Nucleic Acids Research, 2018, Vol. 46, No. 8 Published online 28 March 2018 doi: 10.1093/nar/gky228 FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) Downloaded from https://academic.oup.com/nar/article-abstract/46/8/3852/4955761 by Biblioteca de la Universitat Pompeu Fabra user on 25 November 2019 Javier Delgado Blanco1,†, Leandro Radusky1,†,Hector´ Climente-Gonzalez´ 1 and Luis Serrano1,2,3,* 1Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain, 2Universitat Pompeu Fabra (UPF), Barcelona, Spain and 3Institucio´ Catalana de Recerca i Estudis Avanc¸ats (ICREA), Pg. Lluis Companys 23, 08010 Barcelona, Spain Received January 18, 2018; Revised March 12, 2018; Editorial Decision March 13, 2018; Accepted March 20, 2018 ABSTRACT are involved in numerous processes including DNA repli- cation, DNA repair, gene regulation, recombination, DNA The speed at which new genomes are being se- packing, etc. Currently, although there are >120 000 struc- quenced highlights the need for genome-wide meth- tures deposited in the PDB (3), only ∼5000 of them in- ods capable of predicting protein–DNA interactions. volve DNA–protein complexes. When considering the rate Here, we present PADA1, a generic algorithm that ac- at which new genomes are being sequenced, the resolu- curately models structural complexes and predicts tion of novel DNA–protein structures is relatively scarce. the DNA-binding regions of resolved protein struc- As such, we need to develop methods not only capable of tures. PADA1 relies on a library of protein and double- predicting whether a protein can interact with DNA, but stranded DNA fragment pairs obtained from a train- also capable of determining the protein’s 3D binding re- ing set of 2103 DNA–protein complexes.
    [Show full text]
  • Expanding Molecular Modeling and Design Tools to Nonnatural Sidechains
    WWW.C-CHEM.ORG FULL PAPER Expanding Molecular Modeling and Design Tools to Non-Natural Sidechains David Gfeller,[a] Olivier Michielin,*[a,b,c] and Vincent Zoete*[a] Protein–protein interactions encode the wiring diagram of increase peptide ligand binding affinity. Our results obtained on cellular signaling pathways and their deregulations underlie a non-natural mutants of a BCL9 peptide targeting beta-catenin variety of diseases, such as cancer. Inhibiting protein–protein show very good correlation between predicted and interactions with peptide derivatives is a promising way to experimental binding free-energies, indicating that such develop new biological and therapeutic tools. Here, we develop predictions can be used to design new inhibitors. Data a general framework to computationally handle hundreds of generated in this work, as well as PyMOL and UCSF Chimera non-natural amino acid sidechains and predict the effect of plug-ins for user-friendly visualization of non-natural sidechains, inserting them into peptides or proteins. We first generate all are all available at http://www.swisssidechain.ch. Our results structural files (pdb and mol2), as well as parameters and enable researchers to rapidly and efficiently work with hundreds topologies for standard molecular mechanics software of non-natural sidechains. VC 2012 Wiley Periodicals, Inc. (CHARMM and Gromacs). Accurate predictions of rotamer probabilities are provided using a novel combined knowledge and physics based strategy. Non-natural sidechains are useful to DOI: 10.1002/jcc.22982 Introduction are currently expanding peptide screening technologies to include non-natural sidechains. Correct interpretation of these Protein–protein interactions are fundamental to most biologi- results at the atomic level requires structural and biochemical cal and biochemical processes.
    [Show full text]
  • Understanding Protein-RNA Interactions for Alternative Splicing
    Understanding Protein-RNA Interactions for Alternative Splicing During Gene Expression Using Molecular Dynamic Simulations Joseph Daou Department of Chemistry & Chemical Engineering/B.S. Chemistry Dequan Xiao Ph.D. Abstract The alternative splicing during gene expression is regulated by a type of protein called splicing factors. Recently, a splicing factor, SRSF2 protein was formed to recognize guanines and cytosines at the binding site of the RNA segment. A series of mutations were performed in experiments on SRSF2 to understand the binding interactions of the protein and RNA. However, the changes of binding interactions at the atomistic level due to the mutations have not been revealed. Here, we used molecular dynamics simulations to investigate the conformational changes and protein-RNA binding free energy changes due to a series of mutations. Through the extrapolation of intermediate states of molecular interactions during a fixed length of time throughout point mutations, we used free energy perturbation methods to compute the binding affinity between the mutated proteins and RNA segments. In addition, we also computed the binding free energy by comparing the free energy difference between the folded and unfolded states of the proteins. Our computed free energy changes due to mutations showed consistency with experimental findings. In addition, by molecular dynamic simulations for the solvated protein-RNA complexes, we revealed here the detailed interactions between the protein residues and the RNA for different mutants. Our results provide new insights on understanding the interaction between the splicing factor proteins and the RNA, which can be linked to understanding the pathological mechanisms of Leukemia and cancers. Introduction the option to solvate the RRM system for an in-vitro model Biophysical researchers have shown, comparable the environment of the protein-RNA complex.
    [Show full text]
  • Enabling Biomolecular Analysis and Engineering Along Structural Ensembles
    bioRxiv preprint doi: https://doi.org/10.1101/2021.08.16.456210; this version posted August 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. pyFoldX: enabling biomolecular analysis and engineering along structural ensembles Leandro G. Radusky1 & Luis Serrano1,2,3 1 Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona 08003, Spain 2 Universitat Pompeu Fabra (UPF), Barcelona, Spain 3 ICREA, Pg. Lluis Companys 23, Barcelona 08010, Spain Abstract Recent years have seen an increase in the number of structures available, not only for new proteins but also for the same protein crystallized with different molecules and proteins. While protein design software have proven to be successful in designing and modifying proteins, they can also be overly sensitive to small conformational differences between structures of the same protein. To cope with this, we introduce here pyFoldX, a python library that allows the integrative analysis of structures of the same protein using FoldX, an established forcefield and modeling software. The library offers new functionalities for handling different structures of the same protein, an improved molecular parametrization module, and an easy integration with the data analysis ecosystem of the python programming language. Availability and implementation pyFoldX is an open-source library that uses the FoldX software for energy calculations and modelling. The latter can be downloaded upon registration in http://foldxsuite.crg.eu/ and is free of charge for academics.
    [Show full text]
  • DIP/Dpr Interactions and the Evolutionary Design of Specificity In
    ARTICLE https://doi.org/10.1038/s41467-020-15981-8 OPEN DIP/Dpr interactions and the evolutionary design of specificity in protein families Alina P. Sergeeva 1, Phinikoula S. Katsamba2, Filip Cosmanescu2, Joshua J. Brewer3, Goran Ahlsen2, ✉ ✉ Seetha Mannepalli2, Lawrence Shapiro2,3 & Barry Honig1,2,3,4 Differential binding affinities among closely related protein family members underlie many biological phenomena, including cell-cell recognition. Drosophila DIP and Dpr proteins med- 1234567890():,; iate neuronal targeting in the fly through highly specific protein-protein interactions. We show here that DIPs/Dprs segregate into seven specificity subgroups defined by binding preferences between their DIP and Dpr members. We then describe a sequence-, structure- and energy-based computational approach, combined with experimental binding affinity measurements, to reveal how specificity is coded on the canonical DIP/Dpr interface. We show that binding specificity of DIP/Dpr subgroups is controlled by “negative constraints”, which interfere with binding. To achieve specificity, each subgroup utilizes a different com- bination of negative constraints, which are broadly distributed and cover the majority of the protein-protein interface. We discuss the structural origins of negative constraints, and potential general implications for the evolutionary origins of binding specificity in multi- protein families. 1 Department of Systems Biology, Columbia University, New York, NY, USA. 2 Zuckerman Mind, Brain and Behavior Institute, Columbia University, New York,
    [Show full text]
  • Foldx Accurate Structural Protein–DNA Binding Prediction
    Nucleic Acids Research, 2018 1 doi: 10.1093/nar/gky228 FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) Javier Delgado Blanco1,†, Leandro Radusky1,†,Hector´ Climente-Gonzalez´ 1 and Luis Serrano1,2,3,* 1Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain, 2Universitat Pompeu Fabra (UPF), Barcelona, Spain and 3Institucio´ Catalana de Recerca i Estudis Avanc¸ats (ICREA), Pg. Lluis Companys 23, 08010 Barcelona, Spain Received January 18, 2018; Revised March 12, 2018; Editorial Decision March 13, 2018; Accepted March 20, 2018 ABSTRACT are involved in numerous processes including DNA repli- cation, DNA repair, gene regulation, recombination, DNA The speed at which new genomes are being se- packing, etc. Currently, although there are >120 000 struc- quenced highlights the need for genome-wide meth- tures deposited in the PDB (3), only ∼5000 of them in- ods capable of predicting protein–DNA interactions. volve DNA–protein complexes. When considering the rate Here, we present PADA1, a generic algorithm that ac- at which new genomes are being sequenced, the resolu- curately models structural complexes and predicts tion of novel DNA–protein structures is relatively scarce. the DNA-binding regions of resolved protein struc- As such, we need to develop methods not only capable of tures. PADA1 relies on a library of protein and double- predicting whether a protein can interact with DNA, but stranded DNA fragment pairs obtained from a train- also capable of determining the protein’s 3D binding re- ing set of 2103 DNA–protein complexes.
    [Show full text]
  • Mutational Effects on Stability Are Largely Conserved During Protein Evolution
    Mutational effects on stability are largely conserved during protein evolution Orr Ashenberg1, L. Ian Gong1, and Jesse D. Bloom2 Division of Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 Edited by Andrew J. Roger, Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, Canada, and accepted bythe Editorial Board November 15, 2013 (received for review August 5, 2013) Protein stability and folding are the result of cooperative inter- approaches used to model protein evolution? The standard way actions among many residues, yet phylogenetic approaches to address this question has been to simulate or analyze protein assume that sites are independent. This discrepancy has engen- sequences, using some computational force field that predicts the dered concerns about large evolutionary shifts in mutational stability of different variants (2–5). Unfortunately, there is no effects that might confound phylogenetic approaches. Here we good a priori reason to believe that the force fields themselves experimentally investigate this issue by introducing the same authentically represent interactions among sites. Even state-of- fl mutations into a set of diverged homologs of the in uenza the-art force fields are at best modestly accurate at predicting fi nucleoprotein and measuring the effects on stability. We nd that mutational effects (6, 7). Furthermore, all tractable force fields mutational effects on stability are largely conserved across the approximate the true multibody quantum–mechanical interactions homologs. We reach qualitatively similar conclusions when we – simulate protein evolution with molecular-mechanics force fields. among protein sites in terms of pairwise interactions (8 10). Our results do not mean that proteins evolve without epistasis, Here we experimentally assess the extent to which mutational which can still arise even when mutational stability effects are effects on stability change during protein evolution.
    [Show full text]
  • Targeting Tmprss2, S-Protein:Ace2, and 3Clpro for Synergetic Inhibitory Engagement Mathew Coban1, Juliet Morrison Phd2, William
    Targeting Tmprss2, S-protein:Ace2, and 3CLpro for Synergetic Inhibitory Engagement Mathew Coban1, Juliet Morrison PhD2, William D. Freeman MD3, Evette Radisky PhD1, Karine G. Le Roch PhD4, Thomas R. Caulfield, PhD1,5-8 Affiliations 1 Department of Cancer Biology, Mayo Clinic, 4500 San Pablo Road South, Jacksonville, FL, 32224 USA 2 Department of Microbiology and Plant Pathology, University of California, Riverside, 900 University, Riverside, CA, 92521 USA 3 Department of Neurology, Mayo Clinic, 4500 San Pablo South, Jacksonville, FL, 32224 USA 4Department of Molecular, Cell and Systems Biology, University of California, Riverside, 900 University, Riverside, CA, 92521 USA 5 Department of Neuroscience, Mayo Clinic, 4500 San Pablo South, Jacksonville, FL, 32224 USA 6 Department of Neurosurgery, Mayo Clinic, 4500 San Pablo South, Jacksonville, FL, 32224 USA 7 Department of Health Science Research (BSI), Mayo Clinic, 4500 San Pablo South, Jacksonville, FL, 32224 USA 8Department of Clinical Genomics (Enterprise), Mayo Clinic, Rochester, MN 55905, USA Correspondence to: Thomas R. Caulfield, PhD, Dept of Neuroscience, Cancer Biology, Neurosurgery, Health Science Research, & Clinical Genomics Mayo Clinic, 4500 San Pablo Road South Jacksonville, FL 32224 Telephone: +1 904-953-6072, E-mail: [email protected] SUMMARY Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a devastating respiratory and inflammatory illness caused by a new coronavirus that is rapidly spreading throughout the human population. Over the past 6 months, SARS-CoV-2, the virus responsible for COVID-19, has already infected over 11.6 million (25% located in United States) and killed more than 540K people around the world. As we face one of the most challenging times in our recent history, there is an urgent need to identify drug candidates that can attack SARS-CoV-2 on multiple fronts.
    [Show full text]
  • Proceedings Journal 2020 "Ideas Shape the Course of History." J
    VECTOR The Unofficial iGEM Proceedings Journal 2020 "Ideas shape the course of history." J. M. Keynes History of iGEM and Some Exciting Projects of the Past. Team MIT_MAHE Pages 2-4 SARS-CoV-2 Novel Diagnostics Interview COVID-19: A Creation of a Novel, The Importance of Current Review Diagnostic Method vaccines On Pathology, for Endometriosis Prof. Dr. Kremsner Progression, and Using Menstrual Evaluates the Current Intervention Effluent. Pandemic Situation. Team MIT Team Rochester Team Tübingen Pages 33-35 Pages 125-128 Pages 139-142 October 2020, Vol 1., created by MSP-Maastricht, www.igem-maastricht.nl Editors in Chief Larissa Markus is a 3rd year bachelor student at the Maastricht Science Programme and the Head of management of team MSP-Maastricht. Her organisation skills, goal-oriented work and motivated attitude are what made this Journal possible. Contact: [email protected] university.nl Juliette Passariello-Jansen is a 3rd year bachelor student at the Maastricht Science Programme and one of the Team leaders of team MSP-Maastricht. Her incredible dedication, patience and eye for detail are what made this Journal possible. Contact: [email protected] Not a normal iGEM year….. iGEM can be challenging. Even in a normal year. You need to come up with a great project, do your research, navigate obstacles, work as a team in- and outside of the lab and deal with all the challenges along the way. The wrong bands on the gel, the trouble to get actual funding money instead of five packs of free polymerase, hours upon hours upon hours of work, not just in the lab, but also in meetings and at home, and in the end, the last bit of sleep-deprived cramming to fit everything you did into a great wiki.
    [Show full text]
  • Mutatex: an Automated Pipeline for In-Silico Saturation Mutagenesis of Protein Structures and Structural Ensembles
    bioRxiv preprint doi: https://doi.org/10.1101/824938; this version posted November 3, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. MutateX: an automated pipeline for in-silico saturation mutagenesis of protein structures and structural ensembles Matteo Tiberti1*, Thilde Terkelsen1, Tycho Canter Cremers1, Miriam Di Marco1, Isabelle da Piedade1, Emiliano Maiani1, Elena Papaleo1,2* 1 Computational Biology Laboratory, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark 2 Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark. [email protected] To whom correspondence should be addressed. Elena Papaleo, Email: [email protected], Matteo Tiberti: [email protected] Keywords Free energy; mutations; post-translational modifications; structural ensembles; binding; stability Abstract Mutations resulting in amino acid substitution influence the stability of proteins along with their binding to other biomolecules. A molecular understanding of the effects induced by protein mutations are both of biotechnological and medical relevance. The availability of empirical free energy functions that quickly estimate the free energy change upon mutation (ΔΔG) can be exploited for systematic screenings of proteins and protein complexes. Indeed, in silico saturation mutagenesis can guide the design of new experiments or rationalize the consequences of already-known mutations at the atomic level. Often software such as FoldX, while fast and reliable, lack the necessary automation features to make them useful in high-throughput scenarios. Here we introduce MutateX, a software which aims to automate the prediction of ΔΔGs associated with the systematic mutation of each available residue within a protein or protein complex to all other possible residue types, by employing the FoldX energy function.
    [Show full text]
  • Identification of Pathogenic Missense Mutations Using Protein Stability
    www.nature.com/scientificreports OPEN Identifcation of pathogenic missense mutations using protein stability predictors Lukas Gerasimavicius, Xin Liu & Joseph A. Marsh* Attempts at using protein structures to identify disease-causing mutations have been dominated by the idea that most pathogenic mutations are disruptive at a structural level. Therefore, computational stability predictors, which assess whether a mutation is likely to be stabilising or destabilising to protein structure, have been commonly used when evaluating new candidate disease variants, despite not having been developed specifcally for this purpose. We therefore tested 13 diferent stability predictors for their ability to discriminate between pathogenic and putatively benign missense variants. We fnd that one method, FoldX, signifcantly outperforms all other predictors in the identifcation of disease variants. Moreover, we demonstrate that employing predicted absolute energy change scores improves performance of nearly all predictors in distinguishing pathogenic from benign variants. Importantly, however, we observe that the utility of computational stability predictors is highly heterogeneous across diferent proteins, and that they are all inferior to the best performing variant efect predictors for identifying pathogenic mutations. We suggest that this is largely due to alternate molecular mechanisms other than protein destabilisation underlying many pathogenic mutations. Thus, better ways of incorporating protein structural information and molecular mechanisms into computational variant efect predictors will be required for improved disease variant prioritisation. Advances in next generation sequencing technologies have revolutionised research of genetic variation, increas- ing our ability to explore the basis of human disorders and enabling huge databases covering both pathogenic and putatively benign variants 1,2. Novel sequencing methodologies allow the rapid identifcation of variation in the clinic and are helping facilitate a paradigm shif towards precision medicine 3,4.
    [Show full text]