Assessing the Accuracy of Octanol–Water Partition Coefficient

Total Page:16

File Type:pdf, Size:1020Kb

Assessing the Accuracy of Octanol–Water Partition Coefficient Journal of Computer-Aided Molecular Design (2020) 34:335–370 https://doi.org/10.1007/s10822-020-00295-0 Assessing the accuracy of octanol–water partition coefcient predictions in the SAMPL6 Part II log P Challenge Mehtap Işık1,2 · Teresa Danielle Bergazin3 · Thomas Fox5 · Andrea Rizzi1,6 · John D. Chodera1 · David L. Mobley3,4 Received: 7 November 2019 / Accepted: 24 January 2020 / Published online: 27 February 2020 © Springer Nature Switzerland AG 2020 Abstract The SAMPL Challenges aim to focus the biomolecular and physical modeling community on issues that limit the accuracy of predictive modeling of protein-ligand binding for rational drug design. In the SAMPL5 log D Challenge, designed to benchmark the accuracy of methods for predicting drug-like small molecule transfer free energies from aqueous to nonpolar phases, participants found it difcult to make accurate predictions due to the complexity of protonation state issues. In the SAMPL6 log P Challenge, we asked participants to make blind predictions of the octanol–water partition coefcients of neutral species of 11 compounds and assessed how well these methods performed absent the complication of protonation state K K efects. This challenge builds on the SAMPL6 p a Challenge, which asked participants to predict p a values of a superset of the compounds considered in this log P challenge. Blind prediction sets of 91 prediction methods were collected from 27 research groups, spanning a variety of quantum mechanics (QM) or molecular mechanics (MM)-based physical meth- ods, knowledge-based empirical methods, and mixed approaches. There was a 50% increase in the number of participating groups and a 20% increase in the number of submissions compared to the SAMPL5 log D Challenge. Overall, the accuracy of octanol–water log P predictions in SAMPL6 Challenge was higher than cyclohexane–water log D predictions in SAMPL5, likely because modeling only the neutral species was necessary for log P and several categories of method benefted from the vast amounts of experimental octanol–water log P data. There were many highly accurate methods: 10 diverse methods achieved RMSE less than 0.5 log P units. These included QM-based methods, empirical methods, and mixed methods with physical modeling supported with empirical corrections. A comparison of physical modeling methods showed that QM-based methods outperformed MM-based methods. The average RMSE of the most accurate fve MM-based, QM-based, empirical, and mixed approach methods based on RMSE were 0.92 ± 0.13, 0.48 ± 0.06, 0.47 ± 0.05, and 0.50 ± 0.06, respectively. Keywords Octanol–water partition coefcient · log P · Blind prediction challenge · SAMPL · Free energy calculations · Solvation modeling Abbreviations log D log10 of organic solvent-water distribution coef- SAMPL Statistical Assessment of the Modeling of Pro- fcient ( Dow) teins and Ligands pKa − log10 of the acid dissociation equilibrium log P log10 of the organic solvent-water partition coef- constant fcient ( Kow ) of neutral species SEM Standard error of the mean RMSE Root mean squared error MAE Mean absolute error Mehtap Işık and Teresa Danielle Bergazin have contributed Kendall’s rank correlation coefcient (Tau) 2 equally to this work. R2 Coefcient of determination (R ) Electronic supplementary material The online version of this QM Quantum mechanics article (https ://doi.org/10.1007/s1082 2-020-00295 -0) contains MM Molecular mechanics supplementary material, which is available to authorized users. * Mehtap Işık [email protected] Extended author information available on the last page of the article Vol.:(0123456789)1 3 336 Journal of Computer-Aided Molecular Design (2020) 34:335–370 Introduction The log P challenge examines how well we model transfer free energy of molecules between diferent solvent environ- The development of computational biomolecular modeling ments in the absence of any complications coming from methodologies is motivated by the goal of enabling quanti- predicting protonation states and p Ka values. Assessing tative molecular design, prediction of properties and biomo- log P prediction accuracy also allows evaluating methods lecular interactions and achieving a detailed understanding for modeling protein-ligand afnities in terms of how well of mechanisms (chemical and biological) via computational they capture solvation efects. predictions. While many approaches are available for making such predictions, methods often sufer from poor or unpredict- History and motivation of the SAMPL challenges able performance, ultimately limiting their predictive power. It is often difcult to know which method would give the most The SAMPL blind challenges aim to focus the feld of quan- accurate predictions for a target system without extensive titative biomolecular modeling on major issues that limit the evaluation of methods. However, such extensive compara- accuracy of protein-ligand binding prediction. Companion tive evaluations are infrequent and difcult to perform, partly exercises such as the Drug Design Data Resource (D3R) because no single group has expertise in or access to all rel- blind challenges aim to assess the current accuracy of bio- evant methods and also because of the scarcity of blind experi- molecular modeling methods in predicting bound ligand mental data sets that would allow prospective evaluations. In poses and afnities on real drug discovery project data. D3R addition, many publications which report method comparisons blind challenges serve as an accurate barometer for accuracy. for a target system constructs these studies with the intention However, due to the confation of multiple accuracy-limit- of highlighting the success of a method being developed. ing problems in these complex test systems, it is difcult The Statistical Assessment of the Modeling of Proteins and to derive clear insights into how to make further progress Ligands (SAMPL) Challenges [http://sampl chall enges .githu towards better accuracy. b.io] provide a forum to test and compare methods with the Instead, SAMPL seeks to isolate and focus attention on following goals: individual accuracy-limiting issues. We aim to feld blind challenges just at the limit of tractability in order to identify 1. Determine prospective predictive power rather than underlying sources of error and help overcome these chal- accuracy in retrospective tests. lenges. Working on similar model systems or the same target 2. Allow a head to head comparison of a wide variety of with new blinded datasets in multiple iterations of prediction methods on the same data. challenges maximize our ability to learn from successes and failures. Often, these challenges focus on physical properties Regular SAMPL challenges focus attention on modeling of high relevance to drug discovery in their own right, such areas that need improvement, and sometimes revisit key test as partition or distribution coefcients critical to the devel- systems, providing a crowdsourcing mechanism to drive pro- opment of potent, selective, and bioavailable compounds gress. Systems are carefully selected to create challenges of (Fig. 1). gradually increasing complexity spanning between predic- tion objectives that are tractable and that are understood to be slightly beyond the capabilities of contemporary methods. So SAMPL5 logD Challenge far, most frequent SAMPL challenges have been on solvation and binding systems. Iterated blind prediction challenges have + played a key role in driving innovations in the prediction of cyclohexane water physical properties and binding. Here we report on a SAMPL6 SAMPL6 pKa Challenge SAMPL6 logP Challenge log P Challenge on octanol-water partition coefcients, treat- + ing molecules resembling fragments of kinase inhibitors. This K is a follow-on to the earlier SAMPL6 p a Challenge which octanol included the same compounds. water water + The partition coefcient describes the equilibrium concen- tration ratio of the neutral state of a substance between two phases: Fig. 1 The desire to deconvolute the distinct sources of error contrib- [unionized solute]octanol uting to the large errors observed in the SAMPL5 log D Challenge log P = log10 Kow = log10 K [unionized solute] (1) motivated the separation of p a and log P challenges in SAMPL6. water K The SAMPL6 p a and log P Challenges aim to evaluate protonation state predictions of small molecules in water and transfer free energy predictions between two solvents, isolating these prediction problems 1 3 Journal of Computer-Aided Molecular Design (2020) 34:335–370 337 The partition coefficient (log P) and the distribution accuracy such as a heterogeneous environment with poten- coefcient (log D) are driven by the free energy of transfer tially micelle-like bubbles [3–6], resulting in relatively slow from an aqueous to a nonpolar phase. Transfer free energy solute transitions between environments [4, 7]. The precise of only neutral species is considered for log P, whereas both water content of wet octanol is unknown, as it is afected by neutral and ionized species contribute to log D. Such sol- environmental conditions such as temperature as well as the ute partitioning models are a simple proxy for the transfer presence of solutes, the organic molecule of interest, and salts free energy of a drug-like molecule to a relatively hydro- (added to control pH and ionic strength). Inverse micelles tran- phobic receptor binding pocket, in the absence of specifc siently formed in wet octanol create spatial heterogeneity and interactions. Protein-ligand
Recommended publications
  • Evaluating Small Molecule Microscopic and Macroscopic Pka
    bioRxiv preprint doi: https://doi.org/10.1101/2020.10.15.341792; this version posted October 15, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Overview of the SAMPL6 pK a Challenge: 2 Evaluating small molecule microscopic and 3 macroscopic pK a predictions 4 Mehtap Işık (ORCID: 0000-0002-6789-952X)1,2*, Ariën S. Rustenburg (ORCID: 0000-0002-3422-0613)1,3, Andrea 5 Rizzi (ORCID: 0000-0001-7693-2013)1,4, M. R. Gunner (ORCID: 0000-0003-1120-5776)6, David L. Mobley (ORCID: 6 0000-0002-1083-5533)5, John D. Chodera (ORCID: 0000-0003-0542-119X)1 7 1Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, 8 New York, NY 10065, United States; 2Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Graduate 9 School of Medical Sciences, Cornell University, New York, NY 10065, United States; 3Graduate Program in 10 Physiology, Biophysics, and Systems Biology, Weill Cornell Medical College, New York, NY 10065, United States; 11 4Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Graduate School of Medical 12 Sciences, Cornell University, New York, NY 10065, United States; 5Department of Pharmaceutical Sciences and 13 Department of Chemistry, University of California, Irvine, Irvine, California 92697, United States; 6Department of 14 Physics, City College of New York, New York NY 10031 15 *For correspondence: 16 [email protected] (MI) 17 18 Abstract 19 K The prediction of acid dissociation constants (p a) is a prerequisite for predicting many other properties of a small molecule, 20 such as its protein-ligand binding affinity, distribution coefficient (log D), membrane permeability, and solubility.
    [Show full text]
  • Accurate Receptor-Ligand Binding Free Energies from Fast QM Conformational Chemical Space Sampling
    International Journal of Molecular Sciences Article Accurate Receptor-Ligand Binding Free Energies from Fast QM Conformational Chemical Space Sampling Esra Boz and Matthias Stein * Max Planck Institute for Dynamics of Complex Technical Systems, Molecular Simulations and Design Group, 39106 Magdeburg, Germany; [email protected] * Correspondence: [email protected] Abstract: Small molecule receptor-binding is dominated by weak, non-covalent interactions such as van-der-Waals hydrogen bonding or electrostatics. Calculating these non-covalent ligand-receptor interactions is a challenge to computational means in terms of accuracy and efficacy since the ligand may bind in a number of thermally accessible conformations. The conformational rotamer ensemble sampling tool (CREST) uses an iterative scheme to efficiently sample the conformational space and calculates energies using the semi-empirical ‘Geometry, Frequency, Noncovalent, eXtended Tight Binding’ (GFN2-xTB) method. This combined approach is applied to blind predictions of the modes and free energies of binding for a set of 10 drug molecule ligands to the cucurbit[n]urils CB[8] receptor from the recent ‘Statistical Assessment of the Modeling of Proteins and Ligands’ (SAMPL) challenge including morphine, hydromorphine, cocaine, fentanyl, and ketamine. For each system, the conformational space was sufficiently sampled for the free ligand and the ligand-receptor complexes using the quantum chemical Hamiltonian. A multitude of structures makes up the final conformer-rotamer ensemble, for which then free energies of binding are calculated. For those large and complex molecules, the results are in good agreement with experimental values with a mean Citation: Boz, E.; Stein, M. Accurate error of 3 kcal/mol.
    [Show full text]
  • 20180921 Michel SAMPL6-Ms-Revised
    Edinburgh Research Explorer Blinded predictions of standard binding free energies: lessons learned from the SAMPL6 challenge Citation for published version: Papadourakis, M, Bosisio, S & Michel, J 2018, 'Blinded predictions of standard binding free energies: lessons learned from the SAMPL6 challenge', Journal of computer-Aided molecular design. https://doi.org/10.1007/s10822-018-0154-6 Digital Object Identifier (DOI): 10.1007/s10822-018-0154-6 Link: Link to publication record in Edinburgh Research Explorer Document Version: Peer reviewed version Published In: Journal of computer-Aided molecular design General rights Copyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s) and / or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. Take down policy The University of Edinburgh has made every reasonable effort to ensure that Edinburgh Research Explorer content complies with UK legislation. If you believe that the public display of this file breaches copyright please contact [email protected] providing details, and we will remove access to the work immediately and investigate your claim. Download date: 08. Oct. 2021 Noname manuscript No. (will be inserted by the editor) Blinded predictions of standard binding free energies: lessons learned from the SAMPL6 challenge Michail Papadourakis, Stefano Bosisio, Julien Michel* EaStCHEM School of Chemistry, Joseph Black Building, King’s Buildings, Edinburgh EH9 3FJ, UK. Abstract In the context of the SAMPL6 challenges, series of blinded predictions of standard binding free energies were made with the SOMD software for a dataset of 27 host-guest systems featuring two octa-acids hosts (OA and TEMOA) and a cucurbituril ring (CB8) host.
    [Show full text]
  • Anew Drug Design Strategy in the Liht of Molecular Hybridization Concept
    www.ijcrt.org © 2020 IJCRT | Volume 8, Issue 12 December 2020 | ISSN: 2320-2882 “Drug Design strategy and chemical process maximization in the light of Molecular Hybridization Concept.” Subhasis Basu, Ph D Registration No: VB 1198 of 2018-2019. Department Of Chemistry, Visva-Bharati University A Draft Thesis is submitted for the partial fulfilment of PhD in Chemistry Thesis/Degree proceeding. DECLARATION I Certify that a. The Work contained in this thesis is original and has been done by me under the guidance of my supervisor. b. The work has not been submitted to any other Institute for any degree or diploma. c. I have followed the guidelines provided by the Institute in preparing the thesis. d. I have conformed to the norms and guidelines given in the Ethical Code of Conduct of the Institute. e. Whenever I have used materials (data, theoretical analysis, figures and text) from other sources, I have given due credit to them by citing them in the text of the thesis and giving their details in the references. Further, I have taken permission from the copyright owners of the sources, whenever necessary. IJCRT2012039 International Journal of Creative Research Thoughts (IJCRT) www.ijcrt.org 284 www.ijcrt.org © 2020 IJCRT | Volume 8, Issue 12 December 2020 | ISSN: 2320-2882 f. Whenever I have quoted written materials from other sources I have put them under quotation marks and given due credit to the sources by citing them and giving required details in the references. (Subhasis Basu) ACKNOWLEDGEMENT This preface is to extend an appreciation to all those individuals who with their generous co- operation guided us in every aspect to make this design and drawing successful.
    [Show full text]
  • Understanding TRPV1 Activation by Ligands: Insights from the Binding
    Understanding TRPV1 activation by ligands: Insights PNAS PLUS from the binding modes of capsaicin and resiniferatoxin Khaled Elokelya,b,c, Phanindra Velisettyd, Lucie Delemottea, Eugene Palovcaka, Michael L. Kleina,b,1, Tibor Rohacsd, and Vincenzo Carnevalea,1 aInstitute for Computational Molecular Science, Temple University, Philadelphia, PA 19122; bDepartment of Chemistry, Temple University, Philadelphia, PA 19122; cDepartment of Pharmaceutical Chemistry, Tanta University, 31527 Tanta, Egypt; and dDepartment of Pharmacology, Physiology and Neuroscience, Rutgers–New Jersey Medical School, Newark, NJ 07103 Contributed by Michael L. Klein, December 7, 2015 (sent for review November 6, 2015; reviewed by Kenton J. Swartz and Vladimir Yarov-Yarovoy) The transient receptor potential cation channel subfamily V member TRPV1 is known to be the target of capsaicin (CAPS), the active 1 (TRPV1) or vanilloid receptor 1 is a nonselective cation channel that component of chili peppers, and it can also be referred to as the is involved in the detection and transduction of nociceptive stimuli. capsaicin receptor (18). Resiniferatoxin (RTX), a phorbol ester Inflammation and nerve damage result in the up-regulation of TRPV1 isolated from the irritant lattices of the Moroccan cactus, shows a transcription, and, therefore, modulators of TRPV1 channels are much higher affinity for TRPV1 than CAPS (19). Both compounds potentially useful in the treatment of inflammatory and neuropathic activate TRPV1, causing the channel to be more permeable to pain. Understanding the binding modes of known ligands would cations, ultimately resulting in an analgesic effect due to channel significantly contribute to the success of TRPV1 modulator drug desensitization. CAPS can be subdivided into three structural re- design programs.
    [Show full text]
  • Survey Results
    Results of the 2017 Roadmap survey of the Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) challenge community David L. Mobley1, John D. Chodera2 and Michael K. Gilson3 1Departments of Pharmaceutical Sciences and Chemistry, University of California, Irvine 2Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center 3Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego Abstract: The Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) series of blind prediction challenges provide unbiased, prospective tests of computational methods, serving as a crowdsourcing mechanism to drive innovation. These challenges solicit submissions from the computational community to predict properties of interest, such as solvation, partition, or binding of drug-like molecules. These challenges provided substantial benefit to the community, and have led to roughly 100 publications, many of which are broadly cited (see attached bibliography). We are currently seeking funding from the NIH and surveyed the community concerning experiences with SAMPL and how our future plans for SAMPL can best align with the community’s interests and needs. This document summarizes the results of this survey and describes our findings. On the whole, the community enthusiastically supports our plans for the future of SAMPL, and provided modest suggestions to further strengthen our plans. For up-to-date info please see the SAMPL website. Survey methods and results Here, we reproduce the results of the survey in full except for anonymizing respondents. Questions 1 and 2 dealt with identifying information (name and e-mail) and are thus bypassed here. There were 44 respondents, though not all respondents answered all questions.
    [Show full text]
  • Multipose Binding in Molecular Docking
    Int. J. Mol. Sci. 2014, 15, 2622-2645; doi:10.3390/ijms15022622 OPEN ACCESS International Journal of Molecular Sciences ISSN 1422-0067 www.mdpi.com/journal/ijms Article Multipose Binding in Molecular Docking Kalina Atkovska 1,†, Sergey A. Samsonov 1,†,*, Maciej Paszkowski-Rogacz 2 and M. Teresa Pisabarro 1,* 1 Structural Bioinformatics, BIOTEC TU Dresden, Tatzberg 47-51, Dresden 01307, Germany; E-Mail: [email protected] 2 Department of Medical Systems Biology, TU Dresden, Medical Faculty, University Hospital Carl Gustav Carus, Dresden 01307, Germany; E-Mail: [email protected] † These authors contributed equally to this work. * Authors to whom correspondence should be addressed; E-Mails: [email protected] (M.T.P.); [email protected] (S.A.S.); Tel.: +49-351-463-40071 (M.T.P. & S.A.S.); Fax: +49-351-463-40087 (M.T.P. & S.A.S.). Received: 18 December 2013; in revised form: 17 January 2014 / Accepted: 21 January 2014 / Published: 14 February 2014 Abstract: Molecular docking has been extensively applied in virtual screening of small molecule libraries for lead identification and optimization. A necessary prerequisite for successful differentiation between active and non-active ligands is the accurate prediction of their binding affinities in the complex by use of docking scoring functions. However, many studies have shown rather poor correlations between docking scores and experimental binding affinities. Our work aimed to improve this correlation by implementing a multipose binding concept in the docking scoring scheme. Multipose binding, i.e., the property of certain protein-ligand complexes to exhibit different ligand binding modes, has been shown to occur in nature for a variety of molecules.
    [Show full text]
  • Overview of the SAMPL5 Host-Guest Challenge: Are We Doing Better? Jian Yin†, Niel M
    Overview of the SAMPL5 Host-Guest Challenge: Are We Doing Better? Jian Yin†, Niel M. Henriksen†, David R. Slochower†, Michael R. Shirts‡, Michael W. ¶ † * Chiu , David L. Mobleyѱ and Michael K. Gilson †Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA ‡Department of Chemical Engineering, University of Virginia, Charlottesville, Virginia 22904, USA ¶Qualcomm Institute, University of California, San Diego, La Jolla, CA 92093, USA ѱDepartments of Pharmaceutical Sciences and Chemistry, University of California Irvine, Irvine, CA 92697, USA Keywords Host-guest, molecular recognition, computer-aided drug design, Blind challenge, Binding affinity *Corresponding author: Prof. Michael K. Gilson Email: [email protected] Telephone: 858-822-0622 Fax: 858-822-7726 Abstract The ability to computationally predict protein-small molecule Binding affinities with high accuracy would accelerate drug discovery and reduce its cost By eliminating rounds of trial-and- error synthesis and experimental evaluation of candidate ligands. As academic and industrial groups work toward this capability, there is an ongoing need for datasets that can Be used to rigorously test new computational methods. Although protein-ligand data are clearly important for this purpose, their size and complexity make it difficult to oBtain well-converged results and to trouBleshoot computational methods. Host-guest systems offer a valuable alternative class of test cases, as they exemplify noncovalent molecular recognition But are far smaller and simpler. As a consequence, host-guest systems have Been part of the prior two rounds of SAMPL prediction exercises, and they also figure in the present SAMPL5 round. In addition to Being Blinded, and thus avoiding Biases that may arise in retrospective studies, the SAMPL challenges have the merit of focusing multiple researchers on a common set of molecular systems, so that methods may Be compared and ideas exchanged.
    [Show full text]
  • Pka Measurements for the SAMPL6 Prediction Challenge for A
    bioRxiv preprint doi: https://doi.org/10.1101/368787; this version posted July 13, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. PrEPRINT AHEAD OF SUBMISSION — July 13, 2018 P MEASUREMENTS FOR THE SAMPL6 1 K A PREDICTION CHALLENGE FOR A SET OF KINASE 2 inhibitor-likE FRAGMENTS 3 1,2 3 1,4 5 4 Mehtap Işık , DorOTHY LeVORSE , Ariën S. RustenburG , IKENNA E. Ndukwe , Heather 6 5 5 5 6 5 WANG , Xiao WANG , Mikhail Reibarkh , Gary E. Martin , AleXEY A. MakarOV , David L. 7 3* 1* 6 MobleY , Timothy Rhodes , John D. ChoderA Computational AND Systems Biology PrOGRam, Sloan Kettering Institute, Memorial Sloan 7 1 Kettering Cancer Center, NeW York, NY 10065, United States; Tri-INSTITUTIONAL PhD PrOGRAM IN 8 2 Chemical Biology, WEILL Cornell GrADUATE School OF Medical Sciences, Cornell University, NeW York, 9 NY 10065, United States; Pharmaceutical Sciences, MRL, MerCK & Co., Inc., 126 East Lincoln 10 3 Avenue, Rahway, NeW JerseY 07065, United States; GrADUATE PrOGRAM IN Physiology, Biophysics, 11 4 AND Systems Biology, WEILL Cornell Medical College, NeW York, NY 10065, United States; PrOCESS 12 5 AND Analytical ResearCH AND Development, MerCK & Co., Inc., Rahway, NJ 07065, United States; 13 Analytical ResearCH & Development, MRL, MerCK & Co., Inc., MRL, 126 East Lincoln Avenue, 14 6 Rahway, NeW JerseY 07065, United States; Department OF Pharmaceutical
    [Show full text]
  • Biomolecular Modeling and Simulation: a Prospering Multidisciplinary Field
    Annual Review of Biophysics Biomolecular Modeling and Simulation: A Prospering Multidisciplinary Field Tamar Schlick,1,2,3 Stephanie Portillo-Ledesma,1 Christopher G. Myers,1 Lauren Beljak,4 Justin Chen,4 Sami Dakhel,4 Daniel Darling,4 Sayak Ghosh,4 Joseph Hall,4 Mikaeel Jan,4 Emily Liang,4 Sera Saju,4 Mackenzie Vohr,4 Chris Wu,4 Yifan Xu,4 and Eva Xue4 1Department of Chemistry, New York University, New York, New York 10003, USA; email: [email protected] 2Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA 3New York University–East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 200122, China 4College of Arts and Science, New York University, New York, New York 10003, USA Annu. Rev. Biophys. 2021. 50:267–301 Keywords First published as a Review in Advance on biomolecular modeling, biomolecular simulation, biomolecular dynamics, February 19, 2021 structure prediction, protein folding, DNA folding, RNA folding, The Annual Review of Biophysics is online at multiscale modeling, citizen science projects, machine learning, artificial biophys.annualreviews.org intelligence https://doi.org/10.1146/annurev-biophys-091720- 102019 Abstract Copyright © 2021 by Annual Reviews. We reassess progress in the field of biomolecular modeling and simulation, Annu. Rev. Biophys. 2021.50:267-301. Downloaded from www.annualreviews.org All rights reserved following up on our perspective published in 2011. By reviewing metrics Access provided by New York University - Bobst Library on 05/12/21. For personal use only. for the field’s productivity and providing examples of success, we under- score the productive phase of the field, whose short-term expectations were overestimated and long-term effects underestimated.
    [Show full text]
  • Octanol-Water Partition Coefficient Measurements for the SAMPL6
    bioRxiv preprint doi: https://doi.org/10.1101/757393; this version posted September 8, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. PrEPRINT AHEAD OF SUBMISSION — September 8, 2019 Octanol-water PARTITION COEffiCIENT 1 MEASUREMENTS FOR THE SAMPL6 Blind PrEDICTION 2 Challenge 3 1,2 3 4 Mehtap Işık (ORCID: 0000-0002-6789-952X) , DorOTHY LeVORSE , David L. MobleY (ORCID: 4 3* 5 0000-0002-1083-5533) , Timothy Rhodes (ORCID: 0000-0001-7534-9221) , John D. ChoderA (ORCID: 1* 6 0000-0003-0542-119X) Computational AND Systems Biology PrOGRam, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, 7 1 NeW York, NY 10065, United States; Tri-INSTITUTIONAL PhD PrOGRAM IN Chemical Biology, WEILL Cornell GrADUATE School 8 2 OF Medical Sciences, Cornell University, NeW York, NY 10065, United States; Pharmaceutical Sciences, MRL, MerCK & 9 3 Co., Inc., 126 East Lincoln Avenue, Rahway, NeW JerseY 07065, United States; Department OF Pharmaceutical 10 4 Sciences AND Department OF Chemistry, University OF California, Irvine, Irvine, California 92697, United States 11 12 *For CORRespondence: 13 [email protected] (TR); [email protected] (JDC) 14 15 AbstrACT Partition COEffiCIENTS DESCRIBE THE EQUILIBRIUM PARTITIONING OF A single, DEfiNED CHARGE STATE OF A SOLUTE BETWEEN 16 TWO IMMISCIBLE phases, TYPICALLY A NEUTRAL solute. Octanol-water PARTITION COEffiCIENTS (Kow), OR THEIR LOGARITHMS (log P), ARE 17 FREQUENTLY USED AS A MEASURE OF LIPOPHILICITY IN DRUG DISCOVERY.
    [Show full text]
  • UC Irvine Mobley Lab Datasets
    UC Irvine Mobley Lab Datasets Title Advancing predictive modeling through focused development of model systems to drive new modeling innovations Permalink https://escholarship.org/uc/item/7cf8c6cr Authors Mobley, David L Chodera, John D Isaacs, Lyle et al. Publication Date 2016-10-05 License https://creativecommons.org/licenses/by-nc-nd/4.0/ 4.0 eScholarship.org Powered by the California Digital Library University of California Advancing predictive modeling through focused development of model systems to drive new modeling innovations A funding proposal recently submitted to the NIH David L. Mobley Associate Professor, Departments of Pharmaceutical Sciences and Chemistry, University of California, Irvine John D. Chodera Assistant Faculty Member, Computational Biology Program, Memorial Sloan-Kettering Cancer Center Lyle Isaacs Professor, Department of Chemistry and Biochemistry, University of Maryland Bruce C. Gibb Professor, Department of Chemistry, Tulane University ABSTRACT This work seeks to advance quantitative methods for biomolecular design, especially for predicting biomolecular interactions, via a focused series of community blind prediction challenges. Physical methods for predicting binding free energies, or “free energy methods”, are poised to dramatically reshape early stage drug discovery, and are already finding applications in pharmaceutical lead optimization. However, performance is unreliable, the domain of applicability is limited, and failures in pharmaceutical applications are often hard to understand and fix. On the other hand, these methods can now typically predict a variety of simple physical properties such as solvation free energies or relative solubilities, though there is still clear room for improvement in accuracy. In recent years, competitions and crowdsourcing have proven an effective model for driving innovations in diverse fields.
    [Show full text]