Managing, Profiling and Analyzing a Library of 2.6 Million
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Report on an NIH Workshop on Ultralarge Chemistry Databases Wendy A
1 Report on an NIH Workshop on Ultralarge Chemistry Databases Wendy A. Warr Wendy Warr & Associates, 6 Berwick Court, Holmes Chapel, Cheshire, CW4 7HZ, United Kingdom. Email: [email protected] Introduction The virtual workshop took place on December 1-3, 2020. It was aimed at researchers, groups, and companies that generate, manage, sell, search, and screen databases of more than one billion small molecules (Figure 1). There were about 550 “attendees” from 37 different countries. Recent advances in computational chemistry have enabled researchers to navigate virtual chemical spaces containing billions of chemical structures, carrying out similarity searches, studying structure-activity relationships (SAR), experimenting with scaffold-hopping, and using other drug discovery methodologies.1 For clarity, one could differentiate “spaces” from “libraries”, and “libraries” from “databases”. Spaces are combinatorially constructed collections of compounds; they are usually very big indeed and it is not possible to enumerate all the precise chemical structures that are covered. Libraries are enumerated collections of full structures: usually fewer than 1010 molecules. Databases are a way to storing libraries, for example, in a relational database management system. Figure 1. Ultralarge chemical databases. (Source: Marcus Gastreich based on the publication by Hoffmann and Gastreich.) This report summarizes talks from about 30 practitioners in the field of ultralarge collections of molecules. The aim is to represent as accurately as possible the information that was delivered by the speakers; the report does not seek to be evaluative. 2 Welcoming remarks; defining a drug discovery gateway Susan Gregurick, Office of Data Science and Strategy, NIH, USA Data should be “findable, accessible, interoperable and reusable” (FAIR)2 and with this in mind, NIH has been creating, curating, integrating, and querying ultralarge chemistry databases. -
Qsar Methods Development, Virtual and Experimental Screening for Cannabinoid Ligand Discovery
QSAR METHODS DEVELOPMENT, VIRTUAL AND EXPERIMENTAL SCREENING FOR CANNABINOID LIGAND DISCOVERY by Kyaw Zeyar Myint BS, Biology, BS, Computer Science, Hampden-Sydney College, 2007 Submitted to the Graduate Faculty of School of Medicine in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2012 UNIVERSITY OF PITTSBURGH SCHOOL OF MEDICINE This dissertation was presented by Kyaw Zeyar Myint It was defended on August 20th, 2012 and approved by Dr. Ivet Bahar, Professor, Department of Computational and Systems Biology Dr. Billy W. Day, Professor, Department of Pharmaceutical Sciences Dr. Christopher Langmead, Associate Professor, Department of Computer Science, CMU Dissertation Advisor: Dr. Xiang-Qun Xie, Professor, Department of Pharmaceutical Sciences ii Copyright © by Kyaw Zeyar Myint 2012 iii QSAR METHODS DEVELOPMENT, VIRTUAL AND EXPERIMENTAL SCREENING FOR CANNABINOID LIGAND DISCOVERY Kyaw Zeyar Myint, PhD University of Pittsburgh, 2012 G protein coupled receptors (GPCRs) are the largest receptor family in mammalian genomes and are known to regulate wide variety of signals such as ions, hormones and neurotransmitters. It has been estimated that GPCRs represent more than 30% of current drug targets and have attracted many pharmaceutical industries as well as academic groups for potential drug discovery. Cannabinoid (CB) receptors, members of GPCR superfamily, are also involved in the activation of multiple intracellular signal transductions and their endogenous ligands or cannabinoids have attracted pharmacological research because of their potential therapeutic effects. In particular, the cannabinoid subtype-2 (CB2) receptor is known to be involved in immune system signal transductions and its ligands have the potential to be developed as drugs to treat many immune system disorders without potential psychotic side- effects. -
Second Generation Inhibitors of BCR- ABL for the Treatment of Imatinib- Resistant Chronic Myeloid Leukaemia
REVIEWS Second generation inhibitors of BCR- ABL for the treatment of imatinib- resistant chronic myeloid leukaemia Ellen Weisberg*, Paul W. Manley‡, Sandra W. Cowan-Jacob§, Andreas Hochhaus|| and James D. Griffin¶ Abstract | Imatinib, a small-molecule ABL kinase inhibitor, is a highly effective therapy for early-phase chronic myeloid leukaemia (CML), which has constitutively active ABL kinase activity owing to the expression of the BCR-ABL fusion protein. However, there is a high relapse rate among advanced- and blast-crisis-phase patients owing to the development of mutations in the ABL kinase domain that cause drug resistance. Several second-generation ABL kinase inhibitors have been or are being developed for the treatment of imatinib- resistant CML. Here, we describe the mechanism of action of imatinib in CML, the structural basis of imatinib resistance, and the potential of second-generation BCR-ABL inhibitors to circumvent resistance. The BCR-ABL oncogene, which is the product of the design of new drugs to circumvent resistance, and Philadelphia chromosome (Ph) 22q, encodes a chimeric several new agents have been developed specifically BCR-ABL protein that has constitutively activated ABL for this purpose. These compounds have been well tyrosine kinase activity; it is the underlying cause of characterized for efficacy against the mutant enzymes chronic myeloid leukaemia (CML)1–3. Whereas the 210 in preclinical studies, and impressive therapeutic activ- kDa BCR-ABL protein is expressed in patients with ity has now been reported for two second generation CML, a 190 kDa BCR-ABL protein, resulting from an drugs in phase I and II clinical trials in patients with *Dana Farber Cancer alternative breakpoint in the BCR gene, is expressed in imatinib-resistant CML. -
Open Chemoinformatic Resources to Explore the Structure, Properties and Chemical Space of Cite This: RSC Adv.,2017,7,54153 Molecules
RSC Advances REVIEW View Article Online View Journal | View Issue Open chemoinformatic resources to explore the structure, properties and chemical space of Cite this: RSC Adv.,2017,7,54153 molecules a ab a Mariana Gonzalez-Medina,´ J. Jesus´ Naveja, Norberto Sanchez-Cruz´ a and Jose´ L. Medina-Franco * New technologies are shaping the way drug discovery data is analyzed and shared. Open data initiatives and web servers are assisting the analysis of the large amounts of data that we are now able to produce. The final goal is to accelerate the process of moving from new data to useful information that could lead to Received 27th October 2017 treatments for human diseases. This review discusses open chemoinformatic resources to analyze the Accepted 21st November 2017 diversity and coverage of the chemical space of screening libraries and to explore structure–activity DOI: 10.1039/c7ra11831g relationships of screening data sets. Free resources to implement workflows and representative web- rsc.li/rsc-advances based applications are emphasized. Future directions in this field are also discussed. Creative Commons Attribution 3.0 Unported Licence. 1. Introduction connections between biological activities, ligands and proteins.3 During the past few years, there has been an important increase Herein we review representative chemoinformatic tools in open data initiatives to promote the availability of free essential to explore the structure, chemical space and properties research-based tools and information.1 While there is still some of molecules. The review is focused on recent and representative resistance to open data in some chemistry and drug discovery free web-based applications. -
Medicinal Chemistry
Medicinal Chemistry Dr. Shuaib Alahmad Medicinal and Pharmaceutical Chemistry References: 1- Wilson and Gisvold’s Text book of Organic Medicinal & Pharmaceutical Chemistry, Twelfthe Edition, 2011 2- Gareth Thomas’’ Medicinal Chemistry; An Introduction,2nd Edition. 2007. 3- Dr.Iyad Allous lectures 2 Introduction Medicinal Chemistry: is the discovery, the development, the identification and the interpretation of the mode of action of biologically active compounds that can be used as drugs for the prevention, treatment or cure of human and animal diseases. Medicinal chemistry includes the study of already existing drugs, of their biological properties and their structure-activity relationships. During the early stages of medicinal chemistry development, scientists were primarily concerned with the isolation of medicinal agents found in plants. Today, scientists in this field are also equally concerned with the creation of new synthetic compounds as drugs. Medicinal chemistry is devoted to the discovery and development of new agents for treating diseases. 3 Introduction The primary objective of Medicinal Chemistry is the design and discovery of new compounds that are suitable for use as drugs. This process involves a team of workers from a wide range of disciplines such as chemistry, biology, biochemistry, pharmacology, mathematics, medicine and computing, amongst others 4 Introduction Medicinal chemistry covers the following stages: I.The first stage is lead discovery in which new active substances or drugs are identified and prepared -
Retro Drug Design: from Target Properties to Molecular Structures
Retro Drug Design: From Target Properties to Molecular Structures Yuhong Wang*, Sam Michael, Ruili Huang, Jinghua Zhao, Katlin Recabo, Danielle Bougie, Qiang Shu, Paul Shinn, Hongmao Sun* National Center for Advancing Translational Sciences (NCATS) 9800 Medical Center Drive, Rockville, MD 20850 *Contact information for the corresponding authors: Yuhong Wang Hongmao Sun, PhD NCATS/NIH NCATS/NIH 9800 Medical Center Dr. 9800 Medical Center Dr. Rockville, MD 20854 Rockville, MD 20854 Phone: 301-480-9855 Phone: 301-480-9839 e-mail: [email protected] e-mail: [email protected] Abstract: To generate drug molecules of desired properties with computational methods is the holy grail in pharmaceutical research. Here we describe an AI strategy, retro drug design, or RDD, to generate novel small molecule drugs from scratch to meet predefined requirements, including but not limited to biological activity against a drug target, and optimal range of physicochemical and ADMET properties. Traditional predictive models were first trained over experimental data for the target properties, using an atom typing based molecular descriptor system, ATP. Monte Carlo sampling algorithm was then utilized to find the solutions in the ATP space defined by the target properties, and the deep learning model of Seq2Seq was employed to decode molecular structures from the solutions. To test feasibility of the algorithm, we challenged RDD to generate novel drugs that can activate µ opioid receptor (MOR) and penetrate blood brain barrier (BBB). Starting from vectors of random numbers, RDD generated 180,000 chemical structures, of which 78% were chemically valid. About 42,000 (31%) of the valid structures fell into the property space defined by MOR activity and BBB permeability. -
A Chemaxon/KNIME Based Tool for Designing Chemical Libraries
A ChemAxon/KNIME based tool for designing chemical libraries Tim Parrott Brock Luty Dart NeuroScience Dart NeuroScience September 25, 2013 ChemAxon UGM Dart NeuroScience Small molecules to maintain cognitive vitality (LTM) Currently about 200 FTEs with build-out expected at 260 Privately held LLC by a single individual Scientific Computing Scientific Computing collaborates with other DNS Departments to deliver solutions that simplify and accelerate the drug discovery process. We rely on our (non-traditional) knowledge and experience in both Science and Technology to develop novel and efficient systems to meet this goal Scientific Computing Groups Computational Bioinformatics Chemistry Project Support Project Support - Modeling Philip Cheung - Target ID *Tami Marrone - SBDD/Library Design Doug Fenger - Expression Analysis / Pathways Meg McCarrick + 1 FTE James Na - Apply Methods - Novel Software algorithms Amy Shih - Pre-LO/LO/PCC - Enterprise Software (with Methods) Bill Sinko Information Methods Management Development Software Development Data / Biz Analysis - Informatics Software Development - Data Capture + 1 Group Lead Ron Blanford - Developing new methods John Jaeger - Analytics Daniel Garden - Enterprise Scale Architecture Tim Parrott - Data Access Kevin Neal - RIA (MVC) with SOA James Harr Hari Muddana - QA/Scientific Support - Extensions for ELN, Spotfire, IJC, etc Eileen Tompkins - Project Management + 1 FTE Heather Jones Background Dart NeuroScience (DNS) 200+ Scientists 50+ Chemists Parallel Synthesis Group We need a About 20 -
Topic 7 Drug Discovery and Design Pt. 1
Topic 7 Drug Discovery and design Pt. 1 Chapter 9 Patrick Contents Part 1: Sections 9.1-9.3 1. Target disease 2. Drug Targets 3. Testing Drugs 3.1. In vivo Tests 3.2. In vitro Tests 3.2.1. Enzyme Inhibition Tests 3.2.2. Testing with Receptors DRUG DESIGN AND DEVELOPMENT Stages 1) Identify target disease 2) Identify drug target 3) Establish testing procedures 4) Find a lead compound 5) Structure Activity Relationships (SAR) 6) Identify a pharmacophore 7) Drug design- optimising target interactions 8) Drug design - optimising pharmacokinetic properties 9) Toxicological and safety tests 10) Chemical development and production 11) Patenting and regulatory affairs 12) Clinical trials 1. TARGET DISEASE Priority for the Pharmaceutical Industry • Can the profits from marketing a new drug outweigh the cost of developing and testing that drug? Questions to be addressed • Is the disease widespread? (e.g. cardiovascular disease, ulcers, malaria) • Does the disease affect the first world? (e.g. cardiovascular disease, ulcers) • Are there drugs already on the market? • If so, what are there advantages and disadvantages? (e.g. side effects) • Can one identify a market advantage for a new therapy? 2. DRUG TARGETS-Remember? A) LIPIDS Cell Membrane Lipids B) PROTEINS Receptors Enzymes Carrier Proteins Structural Proteins (tubulin) C) NUCLEIC ACIDS DNA RNA D) CARBOHYDRATES Cell surface carbohydrates Antigens and recognition molecules 2. DRUG TARGETS TARGET SELECTIVITY Between species • Antibacterial and antiviral agents • Identify targets which are unique to the invading pathogen • Identify targets which are shared but which are significantly different in structure Within the body • Selectivity between different enzymes, receptors etc. -
Optimizing the Use of Open-Source Software Applications in Drug
DDT • Volume 11, Number 3/4 • February 2006 REVIEWS TICS INFORMA Optimizing the use of open-source • software applications in drug discovery Reviews Werner J. Geldenhuys1, Kevin E. Gaasch2, Mark Watson2, David D. Allen1 and Cornelis J.Van der Schyf1,3 1Department of Pharmaceutical Sciences, School of Pharmacy,Texas Tech University Health Sciences Center, Amarillo,TX, USA 2West Texas A&M University, Canyon,TX, USA 3Pharmaceutical Chemistry, School of Pharmacy, North-West University, Potchefstroom, South Africa Drug discovery is a time consuming and costly process. Recently, a trend towards the use of in silico computational chemistry and molecular modeling for computer-aided drug design has gained significant momentum. This review investigates the application of free and/or open-source software in the drug discovery process. Among the reviewed software programs are applications programmed in JAVA, Perl and Python, as well as resources including software libraries. These programs might be useful for cheminformatics approaches to drug discovery, including QSAR studies, energy minimization and docking studies in drug design endeavors. Furthermore, this review explores options for integrating available computer modeling open-source software applications in drug discovery programs. To bring a new drug to the market is very costly, with the current of combinatorial approaches and HTS. The addition of computer- price tag approximating US$800 million, according to data reported aided drug design technologies to the R&D approaches of a com- in a recent study [1]. Therefore, it is not surprising that pharma- pany, could lead to a reduction in the cost of drug design and ceutical companies are seeking ways to optimize costs associated development by up to 50% [6,7]. -
Chapter 7. Fragment-Based Drug Discovery
CHAPTER 7 Fragment-Based Drug Discovery Venkata Velvadapu1, Bennett T. Farmer2 and Allen B. Reitz1 1Fox Chase Chemical Diversity Center, Inc., Doylestown, PA, USA; 2Boehringer Ingelheim Pharmaceuticals, Inc., Ridgefield, CT, USA OUTLINE I. LigandÀProtein Interactions: First C. SAR by Mass Spectroscopy 171 Principles 162 D. Interferometry and Isothermal Titration A. Binding Energy as the Sum of the Parts 162 Calorimetry 171 B. Historical Development 162 E. Virtual Screening 172 C. First Principles: Ligand Efficiency 163 VI. Fragment Merging/Linking/Growing 172 II. What is Fragment-Based Drug Discovery? 164 A. Fragment Merging 172 A. Overview of FBDD 164 B. Fragment Linking 172 B. The Concept of FBDD 165 C. Fragment Growing 173 C. Fragments in FBDD 165 VII. Fragment Hit Follow-Up, and Pitfalls D. Fragments Hits vs. HTS Hits 165 to Avoid 174 III. Creation and Analysis of FBDD Libraries 166 A. How to Best Reduce False Positives A. Fragment Library Design 166 (NMR, MS) and False Negatives (X-ray) 174 B. Analysis of Fragments 167 B. Isothermal and Isothermal Titration C. The Role of the Medicinal Chemist in Calorimetry and Further Secondary FBDD 167 Analysis 174 C. Pitfalls to Avoid 175 IV. Fragment Screening Methods 167 A. Nuclear Magnetic Resonance 168 VIII. Zelborafs, First Approved Drug from B. 2D (protein-based) Screening 169 FBDD 175 C. X-ray Crystallography 169 IX. Limitations of FBDD 176 V. Other Biochemical and Biophysical X. Trends for the Future 176 Methods 171 A. Substrate Activity Screening 171 References 177 B. SPR Spectroscopy 171 Learn from yesterday; live for today; hope for tomorrow. -
Bringing Open Source to Drug Discovery
Bringing Open Source to Drug Discovery Chris Swain Cambridge MedChem Consulting Standing on the shoulders of giants • There are a huge number of people involved in writing open source software • It is impossible to acknowledge them all individually • The slide deck will be available for download and includes 25 slides of details and download links – Copy on my website www.cambridgemedchemconsulting.com Why us Open Source software? • Allows access to source code – You can customise the code to suit your needs – If developer ceases trading the code can continue to be developed – Outside scrutiny improves stability and security What Resources are available • Toolkits • Databases • Web Services • Workflows • Applications • Scripts Toolkits • OpenBabel (htttp://openbabel.org) is a chemical toolbox – Ready-to-use programs, and complete programmer's toolkit – Read, write and convert over 110 chemical file formats – Filter and search molecular files using SMARTS and other methods, KNIME add-on – Supports molecular modeling, cheminformatics, bioinformatics – Organic chemistry, inorganic chemistry, solid-state materials, nuclear chemistry – Written in C++ but accessible from Python, Ruby, Perl, Shell scripts… Toolkits • OpenBabel • R • CDK • OpenCL • RDkit • SciPy • Indigo • NumPy • ChemmineR • Pandas • Helium • Flot • FROWNS • GNU Octave • Perlmol • OpenMPI Toolkits • RDKit (http://www.rdkit.org) – A collection of cheminformatics and machine-learning software written in C++ and Python. – Knime nodes – The core algorithms and data structures are written in C ++. Wrappers are provided to use the toolkit from either Python or Java. – Additionally, the RDKit distribution includes a PostgreSQL-based cartridge that allows molecules to be stored in relational database and retrieved via substructure and similarity searches. -
Hit Discovery and Hit-To-Lead Approaches Reviews
Drug Discovery Today Volume 11, Numbers 15/16 August 2006 REVIEWS POST SCREEN Hit discovery and hit-to-lead approaches Reviews Gyo¨ rgy M. Keseru˝ 1 and Gergely M. Makara2 1 CADD&HTS Unit, Gedeon Richter Ltd, 19-21 Gyo¨mro˝iu´t, Budapest, H-1103, Hungary 2 Merck Research Laboratories, Merck & Co, RY80Y-325, 126 E. Lincoln Ave, Rahway, New Jersey, 07065, USA Hit discovery technologies range from traditional high-throughput screening to affinity selection of large libraries, fragment-based techniques and computer-aided de novo design, many of which have been extensively reviewed. Development of quality leads using hit confirmation and hit-to-lead approaches present their own challenges, depending on the hit discovery method used to identify the initial hits. In this paper, we summarize common industry practices adopted to tackle hit-to-lead challenges and review how the advantages and drawbacks of different hit discovery techniques could affect the various issues hit-to-lead groups face. It has been shown that marketed drugs are very frequently highly All hit discovery approaches have – often orthogonal – short- similar to the leads from which they were derived [1]. Thus, both comings prompting the frequent use of multiple techniques for hit the quality and the quantity of lead classes available to medicinal confirmation (Table 1). Challenges facing traditional HTS tech- chemists are primary drivers for discovering best-in-class medi- nologies include high false-positive rates, the need for reporter cines. This makes lead generation a crucial step in the drug dis- assays and the limitation in throughput imposed by testing com- covery process.