Virtual Screening of One Billion Compound Libraries Using Novel

Total Page:16

File Type:pdf, Size:1020Kb

Virtual Screening of One Billion Compound Libraries Using Novel Dr. Olexandr Isayev and Prof. Denis Fourches Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, USA Decline in Pharmaceutical R&D efficiency The cost of developing a new drug roughly doubles every nine years. 1033 drug-like chemicals* 108 compounds in PubChem 106 compounds in ChEMBL with ≥1known bioactivity Scannell et al. Nature Reviews Drug Discovery, 2012, 11, 191-200 Need of novel bio/cheminformatics methods that (i) Fully exploit the potential of modern chemical biological data streams; (ii) Reliably forecast compounds’ bioactivity and safety profiles; (iii) Accelerate the translation from basic research to drug candidates * Polishchuk, Madzhidov, Varnek. J Comput Aided Mol Des. 2013, 27(8):675-9. 2 Ligand-based Virtual Screening to identify potential hits Empirical Rules/Filters Similarity Search ConsensusQSAR MODELS QSA VIRTUAL SCREENING ~102 – 103 molecules Potential ~106 – 109 Hits molecules 3 Thousands of molecular descriptors O are available for organic compounds N constitutional, topological, structural, 0.613 CC quantum mechanics based, fragmental, steric, O pharmacophoric, geometrical, 0.380 A O N thermodynamical, conformational, etc. A O -0.222 O D Samples Features (descriptors) CC M N 0.708 M O E (compounds) N Descriptor X X ... X 1 2 1.146 TTm PP O matrixS Quantitative N C 1 X11 X12 0.491... X1m O StructureACTIVITY (i) II O N R 0.301 2 X X ... X O I 21 22 2m Activity 0.141 VV UU N P O R...elationships ... ... 0.956... ... N T II N O N O 0.256 N - Buildingn of modelsX X ... X R n1 n2 TTnm DD using machine learning 0.799 S methods (NN, SVM, RF) O 1.195 Y N Y SS - Validation of models 1.005 according to numerous statistical procedures, and their applicability domains. 4 Discovery of Novel Antimalarial Compounds Enabled by QSAR-Based Virtual Screening - Severe infectious diseases (~700,000 deaths per year worldwide) - Caused by unicellular eukaryotic parasites, mainly Plasmodium falciparum - Modeling Set: 158 actives, 2,975 inactives. kNN and SVM with ISIDA descriptors External predictive power of QSAR models is critical 176 putative hits Chembridgeto enable Similaritytheir applicationDrug-likeness to virtualQSAR screening. Database Filters Filters models 42 putative inactives 454,638 44,112 39,944 chemicalsTechnically chemicals challenging chemicals to compute molecular properties and descriptors for more >109 compounds. EXPERIMENTAL CONFIRMATION (Dr. Guy, St Jude Res. Hosp) -Most potent hit (SJ000565000) with EC = 95.6 nM and novel 50 9 Nomolecular cheminformatics scaffold architecture is able to screen >10 -7 compounds with EC less than 2 μM compounds. 50 -18 compounds with moderate activity (EC50 2-8 μM) -All of the 42 putative inactives have EC50 >10 μM 14.2% hit rate >> HTS hit rate (0.1 – 5%) SJ000565000 Zhang, Fourches, et al. JCIM, 2013, 53, 475-492 5 Study Design 6 6 Chemical Datasets Largest publicly available virtual libraries GDB-13 955 M compounds GDB-13-ABCDE subset 141 M GDB-17 subset 50 M 1 Blum and Reymond, 2009, J Am Chem Soc, 131, 8732–8733 2 Ruddigkeit et al., 2012, J Chem Inf Model, 52, 2864-2875 Setup Hardware Stack Software Stack • Intel Core i7 4770 • Ubuntu 12.04 CPU 3.4GHz, • Anaconda Scientific Py • Intel H87-based thon 2.7.6 Distribution motherboard, • Pandas / Pytables • 32GB of DDR3 1600 • MKL optimized NumPy memory • NUMBAPRO for CPU • Nvidia Tesla K20 for optimization GPU accelerated • RDKit calculations • C / CUDA High throughput -Data parsing from descriptor generator Data Smiles Mol weight, logP, Processing -2D structure H acceptors/donors, generation Rot bonds, Daylight Chemical 30M/hr -Automatic curation fingerprints. Library Screening & Modeling High throughput -Data parsing from descriptor generator Data Smiles Mol weight, logP, Processing -2D structure H acceptors/donors, generation Rot bonds, Daylight Chemical 30M/hr -Automatic curation fingerprints. Library Smaller datasets (<1M) directly allocated in RAM Storage & Interactive Manipulation analytics with IPython Indexed, fully searchable, and accessible via high level API, e.g., (data. MolWt > 150) & (data.logP == 3) Access in chunks or streaming compound by compound. High throughput -Data parsing from descriptor generator Data Smiles Mol weight, logP, Processing -2D structure H acceptors/donors, generation Rot bonds, Daylight Chemical 30M/hr -Automatic curation fingerprints. Library Smaller datasets (<1M) directly allocated in RAM Storage & Interactive Manipulation analytics with IPython Indexed, fully searchable, and accessible via high level API, e.g., (data. MolWt > 150) & (data.logP == 3) Access in chunks or streaming compound by compound. Modeling & Screening GPU accelerated Rapid screening of extremely large libraries with similarity search multiple molecular probes and QSAR/QSPR models ~1M Tanimoto/s GPU - Case Study 1 Fast Computation of Molecular Properties for Extremely Large Chemical Libraries GDB-13 GDB-17 Subset of 141 M Random sample of 50 M Our GPU-accelerated cheminformatics platform is able to compute key molecular properties for GDB-13 (855M), GBD-13-ABCDE (141M), and a subset of GDB-17 (50M) compounds. GPU - Case Study 1 Fast Computation of Molecular Properties for Extremely Large Chemical Libraries Our GPU-accelerated cheminformatics platform is able to compute key molecular properties for GDB-13 (855M), GBD-13-ABCDE (141M), and a subset of GDB-17 (50M) compounds. GPU - Case Study 2 Virtual Screening of Very Large Chemical Libraries to Identify Bioactive Compounds - Lacosamide (trade name Vimpat) is an anticonvulsant drug used to prevent seizures for patients treated for epilepsy; Lacosamide - Functionalized amino acid; - Many active analogues have been synthesized in Prof. Harold Kohn’s laboratory* at UNC-CH. *Wang et al., 2011, ACS Chem Neurosci, 2, 90–106 GPU - Case Study 2 Virtual Screening of Very Large Chemical Libraries to Identify Bioactive Compounds Analog 1 Analog 2 Analog 3 Analog 4 Analog 5 Similarity search using 200M compound subset of GDB-13/17 Lacosamide as molecular probe GPU - Case Study 2 Virtual Screening of Very Large Chemical Libraries to Identify Bioactive Compounds Compound ID Tanimoto Ts The GPU-accelerated screening Analog 2 0.997 platform was able to retrieve: Analog 3 0.995 -known active analogues of Analog 1 0.994 lacosamide, Analog 4 0.992 -several functionalized amino Analog 5 0.978 acids present in GDB-13, Gdb13-a10573585 0.977 -a novel compound (Gdb17- Gdb13-b28137563 0.977 Gdb13-a36264983 0.976 44140083) fully matching the Gdb13-a36264952 0.976 pharmacophore of lacosamide. Gdb13-a10616005 0.976 Gdb13-a3011053 0.976 Gdb13-b21242261 0.976 Gdb17-44140083 0.976 Gdb13-a30878321 0.975 Gdb13-b3485216 0.975 In Summary • GPU-accelerated cheminformatics platform for high performance virtual screening of extremely large chemical libraries. • Tested for the analysis of the largest publicly available dataset GDB-13 (~900M compounds) and (2) the screening of ~200M compound library for similarity search using an anticonvulsant drug as the molecular probe. • Our platform aims to virtually screen billions of compounds using similarity filters and QSAR models. Acknowledgements • Professor Alex Tropsha (UNC-CH) • Colleagues at MML laboratory • NVIDIA & Mark Berger for generous hardware donation Funding - NSF ABI program - Office of Naval Research Molecular fingerprints - bit string encodings of structural features and/or calculated molecular properties. INFORMATION ABOUT THE PRESENCE OF MOLECULAR FRAGMENTS 1 – FRAGMENT IS PRESENT 0 – FRAGMENT IS ABSENT Similarity Search Similarity searching using fingerprint representations of molecules is one of the most widely used approaches for chemical database mining: it assumes that similar compounds possess similar biological activities. Tanimoto Coefficient From J. Bajorath, SSS Cheminformatics, Obernai 2008 .
Recommended publications
  • Structure-Based Virtual Screening of Hypothetical Inhibitors of the Enzyme Longiborneol Synthase—A Potential Target to Reduce Fusarium Head Blight Disease
    J Mol Model (2016) 22: 163 DOI 10.1007/s00894-016-3021-1 ORIGINAL PAPER Structure-based virtual screening of hypothetical inhibitors of the enzyme longiborneol synthase—a potential target to reduce Fusarium head blight disease E. Bresso1 & V. L ero ux 2 & M. Urban3 & K. E. Hammond-Kosack3 & B. Maigret2 & N. F. Martins1 Received: 17 December 2015 /Accepted: 27 May 2016 /Published online: 21 June 2016 # Springer-Verlag Berlin Heidelberg 2016 Abstract Fusarium head blight (FHB) is one of the most compounds from a library of 15,000 drug-like compounds. destructive diseases of wheat and other cereals worldwide. These putative inhibitors of longiborneol synthase provide a During infection, the Fusarium fungi produce mycotoxins that sound starting point for further studies involving molecular represent a high risk to human and animal health. Developing modeling coupled to biochemical experiments. This process small-molecule inhibitors to specifically reduce mycotoxin could eventually lead to the development of novel approaches levels would be highly beneficial since current treatments to reduce mycotoxin contamination in harvested grain. unspecifically target the Fusarium pathogen. Culmorin pos- sesses a well-known important synergistically virulence role Keywords Fusarium mycotoxins . Culmorin . Inhibitors . among mycotoxins, and longiborneol synthase appears to be a Homology modeling . Molecular dynamics . Ensemble key enzyme for its synthesis, thus making longiborneol syn- docking thase a particularly interesting target. This study aims to dis- cover potent and less toxic agrochemicals against FHB. These compounds would hamper culmorin synthesis by inhibiting Introduction longiborneol synthase. In order to select starting molecules for further investigation, we have conducted a structure- Fusarium head blight (FHB), caused by Fusarium based virtual screening investigation.
    [Show full text]
  • Virtual Screening of the Inhibitors Targeting at the Viral Protein 40 of Ebola Virus V
    Karthick et al. Infectious Diseases of Poverty (2016) 5:12 DOI 10.1186/s40249-016-0105-1 RESEARCH ARTICLE Open Access Virtual screening of the inhibitors targeting at the viral protein 40 of Ebola virus V. Karthick1, N. Nagasundaram1, C. George Priya Doss1,2, Chiranjib Chakraborty1,3, R. Siva4, Aiping Lu1, Ge Zhang1 and Hailong Zhu1* Abstract Background: The Ebola virus is highly pathogenic and destructive to humans and other primates. The Ebola virus encodes viral protein 40 (VP40), which is highly expressed and regulates the assembly and release of viral particles in the host cell. Because VP40 plays a prominent role in the life cycle of the Ebola virus, it is considered as a key target for antiviral treatment. However, there is currently no FDA-approved drug for treating Ebola virus infection, resulting in an urgent need to develop effective antiviral inhibitors that display good safety profiles in a short duration. Methods: This study aimed to screen the effective lead candidate against Ebola infection. First, the lead molecules were filtered based on the docking score. Second, Lipinski rule of five and the other drug likeliness properties are predicted to assess the safety profile of the lead candidates. Finally, molecular dynamics simulations was performed to validate the lead compound. Results: Our results revealed that emodin-8-beta-D-glucoside from the Traditional Chinese Medicine Database (TCMD) represents an active lead candidate that targets the Ebola virus by inhibiting the activity of VP40, and displays good pharmacokinetic properties. Conclusion: This report will considerably assist in the development of the competitive and robust antiviral agents against Ebola infection.
    [Show full text]
  • Report on an NIH Workshop on Ultralarge Chemistry Databases Wendy A
    1 Report on an NIH Workshop on Ultralarge Chemistry Databases Wendy A. Warr Wendy Warr & Associates, 6 Berwick Court, Holmes Chapel, Cheshire, CW4 7HZ, United Kingdom. Email: [email protected] Introduction The virtual workshop took place on December 1-3, 2020. It was aimed at researchers, groups, and companies that generate, manage, sell, search, and screen databases of more than one billion small molecules (Figure 1). There were about 550 “attendees” from 37 different countries. Recent advances in computational chemistry have enabled researchers to navigate virtual chemical spaces containing billions of chemical structures, carrying out similarity searches, studying structure-activity relationships (SAR), experimenting with scaffold-hopping, and using other drug discovery methodologies.1 For clarity, one could differentiate “spaces” from “libraries”, and “libraries” from “databases”. Spaces are combinatorially constructed collections of compounds; they are usually very big indeed and it is not possible to enumerate all the precise chemical structures that are covered. Libraries are enumerated collections of full structures: usually fewer than 1010 molecules. Databases are a way to storing libraries, for example, in a relational database management system. Figure 1. Ultralarge chemical databases. (Source: Marcus Gastreich based on the publication by Hoffmann and Gastreich.) This report summarizes talks from about 30 practitioners in the field of ultralarge collections of molecules. The aim is to represent as accurately as possible the information that was delivered by the speakers; the report does not seek to be evaluative. 2 Welcoming remarks; defining a drug discovery gateway Susan Gregurick, Office of Data Science and Strategy, NIH, USA Data should be “findable, accessible, interoperable and reusable” (FAIR)2 and with this in mind, NIH has been creating, curating, integrating, and querying ultralarge chemistry databases.
    [Show full text]
  • Qsar Methods Development, Virtual and Experimental Screening for Cannabinoid Ligand Discovery
    QSAR METHODS DEVELOPMENT, VIRTUAL AND EXPERIMENTAL SCREENING FOR CANNABINOID LIGAND DISCOVERY by Kyaw Zeyar Myint BS, Biology, BS, Computer Science, Hampden-Sydney College, 2007 Submitted to the Graduate Faculty of School of Medicine in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2012 UNIVERSITY OF PITTSBURGH SCHOOL OF MEDICINE This dissertation was presented by Kyaw Zeyar Myint It was defended on August 20th, 2012 and approved by Dr. Ivet Bahar, Professor, Department of Computational and Systems Biology Dr. Billy W. Day, Professor, Department of Pharmaceutical Sciences Dr. Christopher Langmead, Associate Professor, Department of Computer Science, CMU Dissertation Advisor: Dr. Xiang-Qun Xie, Professor, Department of Pharmaceutical Sciences ii Copyright © by Kyaw Zeyar Myint 2012 iii QSAR METHODS DEVELOPMENT, VIRTUAL AND EXPERIMENTAL SCREENING FOR CANNABINOID LIGAND DISCOVERY Kyaw Zeyar Myint, PhD University of Pittsburgh, 2012 G protein coupled receptors (GPCRs) are the largest receptor family in mammalian genomes and are known to regulate wide variety of signals such as ions, hormones and neurotransmitters. It has been estimated that GPCRs represent more than 30% of current drug targets and have attracted many pharmaceutical industries as well as academic groups for potential drug discovery. Cannabinoid (CB) receptors, members of GPCR superfamily, are also involved in the activation of multiple intracellular signal transductions and their endogenous ligands or cannabinoids have attracted pharmacological research because of their potential therapeutic effects. In particular, the cannabinoid subtype-2 (CB2) receptor is known to be involved in immune system signal transductions and its ligands have the potential to be developed as drugs to treat many immune system disorders without potential psychotic side- effects.
    [Show full text]
  • Fast Three Dimensional Pharmacophore Virtual Screening of New Potent Non-Steroid Aromatase Inhibitors
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE J. Med. Chem. 2009, 52, 143–150 143 provided by Estudo Geral Fast Three Dimensional Pharmacophore Virtual Screening of New Potent Non-Steroid Aromatase Inhibitors Marco A. C. Neves,† Teresa C. P. Dinis,‡ Giorgio Colombo,*,§ and M. Luisa Sa´ e Melo*,† Centro de Estudos Farmaceˆuticos, Laborato´rio de Quı´mica Farmaceˆutica, Faculdade de Farma´cia, UniVersidade de Coimbra, 3000-295, Coimbra, Portugal, Centro de Neurocieˆncias, Laborato´rio de Bioquı´mica, Faculdade de Farma´cia, UniVersidade de Coimbra, 3000-295, Coimbra, Portugal, and Istituto di Chimica del Riconoscimento Molecolare, CNR, 20131, Milano, Italy ReceiVed July 28, 2008 Suppression of estrogen biosynthesis by aromatase inhibition is an effective approach for the treatment of hormone sensitive breast cancer. Third generation non-steroid aromatase inhibitors have shown important benefits in recent clinical trials with postmenopausal women. In this study we have developed a new ligand- based strategy combining important pharmacophoric and structural features according to the postulated aromatase binding mode, useful for the virtual screening of new potent non-steroid inhibitors. A small subset of promising drug candidates was identified from the large NCI database, and their antiaromatase activity was assessed on an in vitro biochemical assay with aromatase extracted from human term placenta. New potent aromatase inhibitors were discovered to be active in the low nanomolar range, and a common binding mode was proposed. These results confirm the potential of our methodology for a fast in silico high-throughput screening of potent non-steroid aromatase inhibitors. Introduction built and proved to be valuable in understanding the binding 14-16 Aromatase, a member of the cytochrome P450 superfamily determinants of several classes of inhibitors.
    [Show full text]
  • Bigger Is Better in Virtual Drug Screens
    NEWS & VIEWS RESEARCH might no longer be elusive, but the nitrogen Doney, S. C. Biogeosciences 11, 691–708 (2014). cycle remains a treasure trove for discovery. ■ 6. Knapp, A. N., Casciotti, K. L., Berelson, W. M., Prokopenko, M. G. & Capone, D. G. Proc. Natl Acad. Sci. USA 113, 4398–4403 (2016). Nicolas Gruber is at the Institute of 7. Bonnet, S., Caffin, M., Berthelot, H. & Moutin, T. Biogeochemistry and Pollutant Dynamics, Proc. Natl Acad. Sci. USA 114, E2800–E2801 (2017). ETH Zurich, 8092 Zurich, Switzerland. 8. Wang, W.-L., Moore, J. K., Martiny, A. C. & e-mail: [email protected] Primeau, F. W. Nature 566, 205–211 (2019). 9. DeVries, T. & Primeau, F. J. Phys. Oceanogr. 41, 1. Redfield, A. C. in James Johnstone Memorial Volume 2381–2401 (2011). (ed. Daniel, R. J.) 176–192 (Univ. College Liverpool, 10. Gruber, N. in Nitrogen in the Marine Environment 50 Years Ago 1934). 2nd edn (eds Capone, D. G., Bronk, D. A., 2. Carpenter, E. J. & Capone, D. G. in Nitrogen in the Mulholland, M. R. & Carpenter, E. J.) 1–150 Test tube babies may not be just Marine Environment 2nd edn (eds Capone, D. G., (Elsevier, 2008). Bronk, D. A., Mulholland, M. R. & Carpenter, E. J.) 11. Yang, S., Gruber, N., Long, M. C. & Vogt, M. Glob. round the corner, but the day 141–198 (Elsevier, 2008). Biogeochem. Cycles 31, 1470–1487 (2017). when all the knowledge necessary 3. Gruber, N. Proc. Natl Acad. Sci. USA 113, 12. Kim, I.-N. et al. Science 346, 1102–1106 (2014). to produce them will be available 4246–4248 (2016).
    [Show full text]
  • Autodock Vina Manual
    4/21/2021 AutoDock Vina - molecular docking and virtual screening program Home Features Download Tutorial FAQ Manual Questions? AutoDock Vina Manual Contents Features License Tutorial Frequently Asked Questions Platform Notes and Installation Windows Linux Mac Building from Source Other Software Usage Summary Configuration File Search Space Exhaustiveness Output Advanced Options Virtual Screening History Citation Getting Help Reporting Bugs Features Accuracy AutoDock Vina significantly improves the average accuracy of the binding mode predictions compared to AutoDock 4, judging by our tests on the training set used in AutoDock 4 development.[*] Additionally and independently, AutoDock Vina has been tested against a virtual screening benchmark called the Directory of Useful Decoys by the Watowich group, and was found to be "a strong competitor against the other programs, and at the top of the pack in many cases". It should be noted that all six of the other docking programs, to which it was compared, are distributed commercially. AutoDock Tools Compatibility For its input and output, Vina uses the same PDBQT molecular structure file Binding mode prediction accuracy on the test set. format used by AutoDock. PDBQT files can be generated (interactively or in "AutoDock" refers to AutoDock 4, and "Vina" to AutoDock batch mode) and viewed using MGLTools. Other files, such as the AutoDock Vina 1. and AutoGrid parameter files (GPF, DPF) and grid map files are not needed. Ease of Use Vina's design philosophy is not to require the user to understand its implementation details, tweak obscure search parameters, cluster results or know advanced algebra (quaternions). All that is required is the structures of the molecules being docked and the specification of the search space including the binding site.
    [Show full text]
  • Structure Based Pharmacophore Modeling, Virtual Screening
    www.nature.com/scientificreports OPEN Structure based pharmacophore modeling, virtual screening, molecular docking and ADMET approaches for identifcation of natural anti‑cancer agents targeting XIAP protein Firoz A. Dain Md Opo1,2, Mohammed M. Rahman3*, Foysal Ahammad4, Istiak Ahmed5, Mohiuddin Ahmed Bhuiyan2 & Abdullah M. Asiri3 X‑linked inhibitor of apoptosis protein (XIAP) is a member of inhibitor of apoptosis protein (IAP) family responsible for neutralizing the caspases‑3, caspases‑7, and caspases‑9. Overexpression of the protein decreased the apoptosis process in the cell and resulting development of cancer. Diferent types of XIAP antagonists are generally used to repair the defective apoptosis process that can eliminate carcinoma from living bodies. The chemically synthesis compounds discovered till now as XIAP inhibitors exhibiting side efects, which is making difculties during the treatment of chemotherapy. So, the study has design to identifying new natural compounds that are able to induce apoptosis by freeing up caspases and will be low toxic. To identify natural compound, a structure‑ based pharmacophore model to the protein active site cavity was generated following by virtual screening, molecular docking and molecular dynamics (MD) simulation. Initially, seven hit compounds were retrieved and based on molecular docking approach four compounds has chosen for further evaluation. To confrm stability of the selected drug candidate to the target protein the MD simulation approach were employed, which confrmed stability of the three compounds. Based on the fnding, three newly obtained compounds namely Caucasicoside A (ZINC77257307), Polygalaxanthone III (ZINC247950187), and MCULE‑9896837409 (ZINC107434573) may serve as lead compounds to fght against the treatment of XIAP related cancer, although further evaluation through wet lab is necessary to measure the efcacy of the compounds.
    [Show full text]
  • Virtual Screening in Drug Design – Overview of Most Frequent Techniques
    Mil. Med. Sci. Lett. (Voj. Zdrav. Listy) 2016, vol. 85(2), p. 75-79 ISSN 0372-7025 DOI: 10.31482/mmsl.2016.014 REVIEW ARTICLE VIRTUAL SCREENING IN DRUG DESIGN – OVERVIEW OF MOST FREQUENT TECHNIQUES Tomas Kucera Department of Toxicology and Military Pharmacy, Faculty of Military Health Sciences, University of Defence, Trebesska 1575, 500 01 Hradec Kralove, Czech Republic Received 22 nd May 2016. Revised 5 th June 2016. Published 14 th June 2016. Summary New and modern techniques of drug design are extensively used in parallel or instead of the classic ones. Applicability of virtual screening (VS) is growing with the computational performance. This article includes list and short description of most frequent used methods in VS. These methods are divided into two groups – ligand-based VS and structure-based VS. Ligand based methods include chemical similarity, pharmacophore and quantitative structure-activity relationship. Molecular docking and scoring are methods of the structure-based VS. Key words: virtual screening, drug design, docking, scoring, pharmacophore, fingerprint, QSAR INTRODUCTION technologies and growing computational perfor - mance [2]. The VS has three main advantages – per - Drug design is a costly and time-consuming pro - formance, price and safety – compared with traditi- cess. Some new methods are carried out in parallel onal methods or with in vitro high through-put scre - or instead of the traditional ones in the process of se - ening. It enables to test large databases of compounds arching for new drugs. This new performance tech - without the necessity to hold them. A lot of com - niques are called screening methods. We can enume- pounds predicted to be inactive can be eliminated rate e.g.
    [Show full text]
  • Virtual Screening, Docking, ADMET and System Pharmacology Studies
    www.nature.com/scientificreports OPEN Virtual screening, Docking, ADMET and System Pharmacology studies on Garcinia caged Xanthone Received: 16 November 2017 Accepted: 20 March 2018 derivatives for Anticancer activity Published: xx xx xxxx Sarfaraz Alam1,2 & Feroz Khan 1,2,3 Caged xanthones are bioactive compounds mainly derived from the Garcinia genus. In this study, a structure-activity relationship (SAR) of caged xanthones and their derivatives for anticancer activity against diferent cancer cell lines such as A549, HepG2 and U251 were developed through quantitative (Q)-SAR modeling approach. The regression coefcient (r2), internal cross-validation regression coefcient (q2) and external cross-validation regression coefcient (pred_r2) of derived QSAR models were 0.87, 0.81 and 0.82, for A549, whereas, 0.87, 0.84 and 0.90, for HepG2, and 0.86, 0.83 and 0.83, for U251 respectively. These models were used to design and screened the potential caged xanthone derivatives. Further, the compounds were fltered through the rule of fve, ADMET-risk and synthetic accessibility. Filtered compounds were then docked to identify the possible target binding pocket, to obtain a set of aligned ligand poses and to prioritize the predicted active compounds. The scrutinized compounds, as well as their metabolites, were evaluated for diferent pharmacokinetics parameters such as absorption, distribution, metabolism, excretion, and toxicity. Finally, the top hit compound 1G was analyzed by system pharmacology approaches such as gene ontology, metabolic networks, process networks, drug target network, signaling pathway maps as well as identifcation of of-target proteins that may cause adverse reactions. Cancer is a major public health problem worldwide and is the second-leading cause of death in the United States.
    [Show full text]
  • Cheminformatics in Drug Discovery, an Industrial Perspective
    Cheminformatics in Drug Discovery, an Industrial Perspective Hongming Chen, Thierry Kogej, Ola Engkvist Hit Discovery, Discovery Sciences, Astrazeneca R&D Gothenburg Drug Discovery Today An industry perspective of Today’s Challenge We need to become better, faster & cheaper Source: Paul et al., Nat. Rev. Drug Disc. (2010) 9:March Source: PhRMA profile 2016 2 IMED Biotech Unit I Discovery Sciences Cheminformatics @ AstraZeneca • HTS work-up • Library design • Virtual screening • Machine learning & AI 3 High Throughput Screening From Millions to just a few Low cost/compound High cost/compound Slide modified from Mark Wigglesworth, AZ, with permission HTS Analysis: Clustering analysis iHAT: An Spotfire add-in for HTS Analysis Early days • Heavily dependent on computational chemistry resources • Leverage the powerful visualization function of • Linux, scripts, static workflows, data in flat files Spotfire • Cutting, pasting and reformatting between • Annotation of compounds with in-house applications experimental and predicted data • Difficult to revisit or take over an analysis from a • Data integration from multiple sources colleague • Clustering of compounds • Time-consuming • Visualization and manipulation of cluster tree • NN search 5 iHAT: Clustering and Reclustering 6 Library design @ AstraZeneca • Diversity library is generally out of fashion • Focused library fit for specific project need • DNA encoded libraries become popular, but analysis is challenging, >60M to 8B library sizes Currently, use classical library design method to
    [Show full text]
  • Probabilistic Approach for Virtual Screening Based on Multiple Pharmacophores
    molecules Communication Probabilistic Approach for Virtual Screening Based on Multiple Pharmacophores Timur I. Madzhidov 1 , Assima Rakhimbekova 1, Alina Kutlushuna 1,2 and Pavel Polishchuk 2,* 1 A.M. Butlerov Institute of Chemistry, Kazan Federal University, 420008 Kazan, Russia; [email protected] (T.I.M.); [email protected] (A.R.); [email protected] (A.K.) 2 Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University and University Hospital in Olomouc, 77900 Olomouc, Czech Republic * Correspondence: [email protected]; Tel.: +420-585632298 Received: 7 December 2019; Accepted: 14 January 2020; Published: 17 January 2020 Abstract: Pharmacophore modeling is usually considered as a special type of virtual screening without probabilistic nature. Correspondence of at least one conformation of a molecule to pharmacophore is considered as evidence of its bioactivity. We show that pharmacophores can be treated as one-class machine learning models, and the probability the reflecting model’s confidence can be assigned to a pharmacophore on the basis of their precision of active compounds identification on a calibration set. Two schemes (Max and Mean) of probability calculation for consensus prediction based on individual pharmacophore models were proposed. Both approaches to some extent correspond to commonly used consensus approaches like the common hit approach or the one based on a logical OR operation uniting hit lists of individual models. Unlike some known approaches, the proposed ones can rank compounds retrieved by multiple models. These approaches were benchmarked on multiple ChEMBL datasets used for ligand-based pharmacophore modeling and externally validated on corresponding DUD-E datasets.
    [Show full text]