The Effect of Data Curation on the Accuracy of Quantitative Structure-Activity Relationship Models
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
FDA-Approved Drugs with Potent in Vitro Antiviral Activity Against Severe Acute Respiratory Syndrome Coronavirus 2
pharmaceuticals Article FDA-Approved Drugs with Potent In Vitro Antiviral Activity against Severe Acute Respiratory Syndrome Coronavirus 2 1, , 1, 2 1 Ahmed Mostafa * y , Ahmed Kandeil y , Yaseen A. M. M. Elshaier , Omnia Kutkat , Yassmin Moatasim 1, Adel A. Rashad 3 , Mahmoud Shehata 1 , Mokhtar R. Gomaa 1, Noura Mahrous 1, Sara H. Mahmoud 1, Mohamed GabAllah 1, Hisham Abbas 4 , Ahmed El Taweel 1, Ahmed E. Kayed 1, Mina Nabil Kamel 1, Mohamed El Sayes 1, Dina B. Mahmoud 5 , Rabeh El-Shesheny 1 , Ghazi Kayali 6,7,* and Mohamed A. Ali 1,* 1 Center of Scientific Excellence for Influenza Viruses, National Research Centre, Giza 12622, Egypt; [email protected] (A.K.); [email protected] (O.K.); [email protected] (Y.M.); [email protected] (M.S.); [email protected] (M.R.G.); [email protected] (N.M.); [email protected] (S.H.M.); [email protected] (M.G.); [email protected] (A.E.T.); [email protected] (A.E.K.); [email protected] (M.N.K.); [email protected] (M.E.S.); [email protected] (R.E.-S.) 2 Organic & Medicinal Chemistry Department, Faculty of Pharmacy, University of Sadat City, Menoufia 32897, Egypt; [email protected] 3 Department of Biochemistry & Molecular Biology, Drexel University College of Medicine, Philadelphia, PA 19102, USA; [email protected] 4 Department of Microbiology and Immunology, Zagazig University, Zagazig 44519, Egypt; [email protected] 5 Pharmaceutics Department, National Organization for Drug Control and Research, Giza 12654, Egypt; [email protected] 6 Department of Epidemiology, Human Genetics, and Environmental Sciences, University of Texas, Houston, TX 77030, USA 7 Human Link, Baabda 1109, Lebanon * Correspondence: [email protected] (A.M.); [email protected] (G.K.); [email protected] (M.A.A.) Contributed equally to this work. -
Medical Prescription Market Definitions
Horvath Health Policy Innovations in Healthcare Financing Policy PO Box 196, College Park, MD 20741 202/465-5836 [email protected] Medical Prescription (Rx) Market Definitions Drug Product Terminology Small Molecule Drugs: Drugs with an active chemical ingredient that is not live, but chemically synthesized and typically are taken orally or topically, such as capsules, tablets, powders, ointments, and sprays. Brands: Also referred to “small molecule drugs,” brand drugs require original research and development for FDA licensure (also called “FDA approval”). They are approved (or licensed) under a New Drug Application (NDA). They are patent protected for 20 years total (which usually includes years clinical research before the drug is approved, or licensed). They are still referred to as brands even after the patent has expired (which distinguishes these drugs from generics). A brand can be ‘first in class” if it is a new chemical entity or new mechanism of action. It is a “me too drug” if it is not first in class. The definition of “me too” varies – different chemical composition, different mechanism of action. The drug is quite similar but different enough to get a patent. Generic: Small molecule products that are demonstrated to be clinically equivalent to a branded product (e.g., same active ingredient and route of administration, same mechanism of action). Generics do not require original research for FDA approval. Generics come to market only after the patent has expired on the brand product. Licensed under an Abbreviated New Drug Application (ANDA) by the FDA. Large Molecule Drugs: Commonly referred to as “biologics” and “biosimilars.” They contain live active ingredients and are generally infused or injected, and otherwise not taken orally or topically. -
The Alexandria Library, a Quantum-Chemical Database of Molecular Properties for Force field Development 9 2017 Received: October 1 1 1 Mohammad M
www.nature.com/scientificdata OPEN Data Descriptor: The Alexandria library, a quantum-chemical database of molecular properties for force field development 9 2017 Received: October 1 1 1 Mohammad M. Ghahremanpour , Paul J. van Maaren & David van der Spoel Accepted: 19 February 2018 Published: 10 April 2018 Data quality as well as library size are crucial issues for force field development. In order to predict molecular properties in a large chemical space, the foundation to build force fields on needs to encompass a large variety of chemical compounds. The tabulated molecular physicochemical properties also need to be accurate. Due to the limited transparency in data used for development of existing force fields it is hard to establish data quality and reusability is low. This paper presents the Alexandria library as an open and freely accessible database of optimized molecular geometries, frequencies, electrostatic moments up to the hexadecupole, electrostatic potential, polarizabilities, and thermochemistry, obtained from quantum chemistry calculations for 2704 compounds. Values are tabulated and where available compared to experimental data. This library can assist systematic development and training of empirical force fields for a broad range of molecules. Design Type(s) data integration objective • molecular physical property analysis objective Measurement Type(s) physicochemical characterization Technology Type(s) Computational Chemistry Factor Type(s) Sample Characteristic(s) 1 Uppsala Centre for Computational Chemistry, Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Husargatan 3, Box 596, SE-75124 Uppsala, Sweden. Correspondence and requests for materials should be addressed to D.v.d.S. (email: [email protected]). -
IND Exemptions Chart
IND EXEMPTIONS Involving Drugs or Biologics Exemption 1 Exemption 4 Drug/Biologic US product Placebo * Drug/biologic product lawfully marketed in US * The clinical investigation involves the use of a placebo and the investigation does not otherwise * The investigation is not intended to be reported to FDA require submission of an IND as a well controlled study in support of a new indication for use nor intended to be used to support any other significant change in the labeling for the drug/biologic. * The investigation proposed is not intended to support a significant change in the advertising for the drug/ Exemption 5 biologic In Vivo Bioavailability or Bioequivalence * The investigation does not involve a route of * Test product does not contain a new chemical administration, dose, patient population, or other factor entity”’ as defined in 21 CFR 314.108(a) [** a drug that significantly increases the risks (or decreases the that contains no active moiety that has been acceptability of the risks) associated with the use of the approved by FDA in any other application. drug/biologic * The study does not involve a radioactively labeled * The investigation will be conducted in compliance with drug product. FDA regulations for the Protection of Human Subjects and Institutional Review Boards 21 CFR 50 & 56 * The study does not involve a cytotoxic drug product. * The investigation will be conducted in compliance with The investigator will conduct a bioavailability or the FDA requirements for Promotion and Charging for bioequivalence study -
Report on an NIH Workshop on Ultralarge Chemistry Databases Wendy A
1 Report on an NIH Workshop on Ultralarge Chemistry Databases Wendy A. Warr Wendy Warr & Associates, 6 Berwick Court, Holmes Chapel, Cheshire, CW4 7HZ, United Kingdom. Email: [email protected] Introduction The virtual workshop took place on December 1-3, 2020. It was aimed at researchers, groups, and companies that generate, manage, sell, search, and screen databases of more than one billion small molecules (Figure 1). There were about 550 “attendees” from 37 different countries. Recent advances in computational chemistry have enabled researchers to navigate virtual chemical spaces containing billions of chemical structures, carrying out similarity searches, studying structure-activity relationships (SAR), experimenting with scaffold-hopping, and using other drug discovery methodologies.1 For clarity, one could differentiate “spaces” from “libraries”, and “libraries” from “databases”. Spaces are combinatorially constructed collections of compounds; they are usually very big indeed and it is not possible to enumerate all the precise chemical structures that are covered. Libraries are enumerated collections of full structures: usually fewer than 1010 molecules. Databases are a way to storing libraries, for example, in a relational database management system. Figure 1. Ultralarge chemical databases. (Source: Marcus Gastreich based on the publication by Hoffmann and Gastreich.) This report summarizes talks from about 30 practitioners in the field of ultralarge collections of molecules. The aim is to represent as accurately as possible the information that was delivered by the speakers; the report does not seek to be evaluative. 2 Welcoming remarks; defining a drug discovery gateway Susan Gregurick, Office of Data Science and Strategy, NIH, USA Data should be “findable, accessible, interoperable and reusable” (FAIR)2 and with this in mind, NIH has been creating, curating, integrating, and querying ultralarge chemistry databases. -
Qsar Methods Development, Virtual and Experimental Screening for Cannabinoid Ligand Discovery
QSAR METHODS DEVELOPMENT, VIRTUAL AND EXPERIMENTAL SCREENING FOR CANNABINOID LIGAND DISCOVERY by Kyaw Zeyar Myint BS, Biology, BS, Computer Science, Hampden-Sydney College, 2007 Submitted to the Graduate Faculty of School of Medicine in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2012 UNIVERSITY OF PITTSBURGH SCHOOL OF MEDICINE This dissertation was presented by Kyaw Zeyar Myint It was defended on August 20th, 2012 and approved by Dr. Ivet Bahar, Professor, Department of Computational and Systems Biology Dr. Billy W. Day, Professor, Department of Pharmaceutical Sciences Dr. Christopher Langmead, Associate Professor, Department of Computer Science, CMU Dissertation Advisor: Dr. Xiang-Qun Xie, Professor, Department of Pharmaceutical Sciences ii Copyright © by Kyaw Zeyar Myint 2012 iii QSAR METHODS DEVELOPMENT, VIRTUAL AND EXPERIMENTAL SCREENING FOR CANNABINOID LIGAND DISCOVERY Kyaw Zeyar Myint, PhD University of Pittsburgh, 2012 G protein coupled receptors (GPCRs) are the largest receptor family in mammalian genomes and are known to regulate wide variety of signals such as ions, hormones and neurotransmitters. It has been estimated that GPCRs represent more than 30% of current drug targets and have attracted many pharmaceutical industries as well as academic groups for potential drug discovery. Cannabinoid (CB) receptors, members of GPCR superfamily, are also involved in the activation of multiple intracellular signal transductions and their endogenous ligands or cannabinoids have attracted pharmacological research because of their potential therapeutic effects. In particular, the cannabinoid subtype-2 (CB2) receptor is known to be involved in immune system signal transductions and its ligands have the potential to be developed as drugs to treat many immune system disorders without potential psychotic side- effects. -
QSAR Without Borders
Chem Soc Rev View Article Online REVIEW ARTICLE View Journal | View Issue QSAR without borders ab c d Cite this: Chem. Soc. Rev., 2020, Eugene N. Muratov, Ju¨rgen Bajorath, Robert P. Sheridan, e f f ghi 49,3525 Igor V. Tetko, Dmitry Filimonov, Vladimir Poroikov, Tudor I. Oprea, Igor I. Baskin, jk Alexandre Varnek, j Adrian Roitberg, l Olexandr Isayev, a Stefano Curtalolo, m Denis Fourches, n Yoram Cohen, o Alan Aspuru-Guzik, p David A. Winkler, qrst Dimitris Agrafiotis, u Artem Cherkasov *v and Alexander Tropsha *a Prediction of chemical bioactivity and physical properties has been one of the most important applications of statistical and more recently, machine learning and artificial intelligence methods in chemical sciences. This field of research, broadly known as quantitative structure–activity relationships (QSAR) modeling, has developed many important algorithms and has found a broad range of applications in physical organic and medicinal chemistry in the past 55+ years. This Perspective summarizes recent technological advances in QSAR modeling but it also highlights the applicability of algorithms, modeling methods, and validation Creative Commons Attribution-NonCommercial 3.0 Unported Licence. practices developed in QSAR to a wide range of research areas outside of traditional QSAR boundaries including synthesis planning, nanotechnology, materials science, biomaterials, and clinical informatics. Received 7th February 2020 As modern research methods generate rapidly increasing amounts of data, the knowledge of robust data- DOI: 10.1039/d0cs00098a driven modelling methods professed within the QSAR field can become essential for scientists working both within and outside of chemical research. We hope that this contribution highlighting the generalizable rsc.li/chem-soc-rev components of QSAR modeling will serve to address this challenge. -
Chemistrymedicinal Chemistry Series
Methods and Principles in ChemistryMedicinal Chemistry Series CASEPROFESSIONAL STUDY SAMPLER SCIENCE SAMPLER INCLUDING Chapter 2:21: The GPR81 Role HTS of Chemistry Case Study in Addressingby Eric Wellner Hunger and andOla FoodFjellström Security from LeadFrom Generation:The Chemical Methods,Element: Chemistry’s Strategies, Contribution and Case toStudies Our Global edited Future, by Jörg Holenz First Edition. Edited by Prof. Javier Garcia-Martinez and Dr. Elena Serrano-Torregrosa Chapter 22: The Integrated Optimization of Safety and DMPK Properties Enabling Chapter 5: Metal Sustainability from Global E-waste Management Preclinical Development: A Case History with S1P1 Agonists by Simon Taylor from EarlyFrom MetalDrug Sustainability:Development: Global Bringing Challenges, a Preclinical Consequences, Candidate and toProspects. the Clinic edited by Edited by Reed M. Izatt Fabrizio Giordanetto. Chapter 13: A Two-Phase Anaerobic Digestion Process for Biogas Production Chapter 14: BACE Inhibitors by Daniel F. Wyss, Jared N. Cumming, Corey O. Strickland, for Combined Heat and Power Generation for Remote Communities Fromand Andrew the Handbook W. Stamford of Clean from Energy Fragment-based Systems, Volume Drug 1. Edited Discovery: by Jinhu WuLessons and Outlook edited by Daniel A. Erlanson and Wolfgang Jahnke 597 21 GPR81 HTS Case Study Eric Wellner and Ola Fjellström 21.1 General Remarks One of the key lead generation strategies to identify new chemical entities against a certain target is high-throughput screening (HTS). Running an HTS requires a clear line of sight regarding the pharmacodynamic (PD) and pharma- cokinetic (PK) profile of the compounds one is interested in. This means that there has to be a clear screening and deconvolution strategy in place to success- fully assess the hits from an HTS output. -
Semantic Web Integration of Cheminformatics Resources with the SADI Framework Leonid L Chepelev1* and Michel Dumontier1,2,3
Chepelev and Dumontier Journal of Cheminformatics 2011, 3:16 http://www.jcheminf.com/content/3/1/16 METHODOLOGY Open Access Semantic Web integration of Cheminformatics resources with the SADI framework Leonid L Chepelev1* and Michel Dumontier1,2,3 Abstract Background: The diversity and the largely independent nature of chemical research efforts over the past half century are, most likely, the major contributors to the current poor state of chemical computational resource and database interoperability. While open software for chemical format interconversion and database entry cross-linking have partially addressed database interoperability, computational resource integration is hindered by the great diversity of software interfaces, languages, access methods, and platforms, among others. This has, in turn, translated into limited reproducibility of computational experiments and the need for application-specific computational workflow construction and semi-automated enactment by human experts, especially where emerging interdisciplinary fields, such as systems chemistry, are pursued. Fortunately, the advent of the Semantic Web, and the very recent introduction of RESTful Semantic Web Services (SWS) may present an opportunity to integrate all of the existing computational and database resources in chemistry into a machine-understandable, unified system that draws on the entirety of the Semantic Web. Results: We have created a prototype framework of Semantic Automated Discovery and Integration (SADI) framework SWS that exposes the QSAR descriptor functionality of the Chemistry Development Kit. Since each of these services has formal ontology-defined input and output classes, and each service consumes and produces RDF graphs, clients can automatically reason about the services and available reference information necessary to complete a given overall computational task specified through a simple SPARQL query. -
Flexible Heuristic Algorithm for Automatic Molecule Fragmentation: Application to the UNIFAC Group Contribution Model Simon Müller*
Müller J Cheminform (2019) 11:57 https://doi.org/10.1186/s13321-019-0382-3 Journal of Cheminformatics RESEARCH ARTICLE Open Access Flexible heuristic algorithm for automatic molecule fragmentation: application to the UNIFAC group contribution model Simon Müller* Abstract A priori calculation of thermophysical properties and predictive thermodynamic models can be very helpful for developing new industrial processes. Group contribution methods link the target property to contributions based on chemical groups or other molecular subunits of a given molecule. However, the fragmentation of the molecule into its subunits is usually done manually impeding the fast testing and development of new group contribution methods based on large databases of molecules. The aim of this work is to develop strategies to overcome the challenges that arise when attempting to fragment molecules automatically while keeping the defnition of the groups as simple as possible. Furthermore, these strategies are implemented in two fragmentation algorithms. The frst algorithm fnds only one solution while the second algorithm fnds all possible fragmentations. Both algorithms are tested to frag- ment a database of 20,000 molecules for use with the group contribution model Universal Quasichemical Func- tional Group Activity Coefcients+ (UNIFAC). Comparison of the results with a reference database shows that both algorithms are capable of successfully fragmenting all the molecules automatically. Furthermore, when applying them on a larger database it is shown, that the newly developed algorithms are capable of fragmenting structures previously thought not possible to fragment. Keywords: Molecule fragmentation, Cheminformatics, RDKit, Property prediction, Group contribution method, UNIFAC, Incrementation Introduction named QSPR methods (Quantitative Structure Property Cheminformatics is a growing feld due to the increas- Relationship). -
Open Chemoinformatic Resources to Explore the Structure, Properties and Chemical Space of Cite This: RSC Adv.,2017,7,54153 Molecules
RSC Advances REVIEW View Article Online View Journal | View Issue Open chemoinformatic resources to explore the structure, properties and chemical space of Cite this: RSC Adv.,2017,7,54153 molecules a ab a Mariana Gonzalez-Medina,´ J. Jesus´ Naveja, Norberto Sanchez-Cruz´ a and Jose´ L. Medina-Franco * New technologies are shaping the way drug discovery data is analyzed and shared. Open data initiatives and web servers are assisting the analysis of the large amounts of data that we are now able to produce. The final goal is to accelerate the process of moving from new data to useful information that could lead to Received 27th October 2017 treatments for human diseases. This review discusses open chemoinformatic resources to analyze the Accepted 21st November 2017 diversity and coverage of the chemical space of screening libraries and to explore structure–activity DOI: 10.1039/c7ra11831g relationships of screening data sets. Free resources to implement workflows and representative web- rsc.li/rsc-advances based applications are emphasized. Future directions in this field are also discussed. Creative Commons Attribution 3.0 Unported Licence. 1. Introduction connections between biological activities, ligands and proteins.3 During the past few years, there has been an important increase Herein we review representative chemoinformatic tools in open data initiatives to promote the availability of free essential to explore the structure, chemical space and properties research-based tools and information.1 While there is still some of molecules. The review is focused on recent and representative resistance to open data in some chemistry and drug discovery free web-based applications. -
A New Ligand-Based Approach to Virtual Screening and Profiling of Large Chemical Libraries
A new ligand-based approach to virtual screening and profiling of large chemical libraries Elisabet Gregori Puigjané Memòria presentada per optar al grau de Doctor en Biologia per la Universitat Pompeu Fabra. Aquesta Tesi Doctoral ha estat realitzada sota la direcció del Dr. Jordi Mestres al Departament de Ciències Experimentals i de la Salut de la Universitat Pompeu Fabra Jordi Mestres Elisabet Gregori Puigjané Barcelona, Maig 2008 The research in this thesis has been carried out at the Chemogenomics Laboratory (CGL) within the Unitat de Recerca en Informàtica Biomèdica (GRIB) at the Parc de Recerca Biomèdica de Barcelona (PRBB). The research carried out in this thesis has been supported by Chemotargets S. L. Table of contents Acknowledgements ........................................................................................... 3 Abstract .............................................................................................................. 5 Objectives ........................................................................................................... 7 List of publications ............................................................................................ 9 Part I – INTRODUCTION .................................................................................. 11 Chapter I.1. Drug discovery ..................................................................... 13 I.1.1. Obtaining a drug candidate ....................................................... 14 I.1.1.1. Hit identification ..........................................................