An Extensive Survey of Molecular Docking Tools and Their Applications Using

1 An extensive survey of molecular docking tools and their applications using 2 text mining and deep curation strategies. 3 4 Kamal Rawal1#, Tanishka Khurana1, Himanshu Sharma1, Sadika Verma1, Simmi Gupta1, 5 Chahat Kubba1, Ulrich Strych2, Peter Hotez2, 3, Maria Elena Bottazzi2, 3 6 7 1. Department of Biotechnology, Jaypee Institute of Information and Technology, 8 Noida, India. 9 2. Texas Children’s Hospital Center for Vaccine Development, Departments of 10 Pediatrics and Molecular Virology and Microbiology, National School of Tropical 11 Medicine, Baylor College of Medicine, Houston, TX, USA. 12 3. Department of Biology, Baylor University, Waco, Texas, USA. 13 14 #Corresponding Author: 15 Dr. Kamal Rawal 16 Jaypee Institute of Information Technology, A-10, Sector -62, NOIDA-201307, Uttar 17 Pradesh, India 18 Email address: [email protected] 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.27538v1 | CC BY 4.0 Open Access | rec: 15 Feb 2019, publ: 15 Feb 2019 35 ABSTRACT 36 The technology of docking molecules in-silico has evolved significantly in recent years and has 37 become a crucial component of the drug discovery tool process that includes virtual screening, 38 lead optimization, and side-effect predictions. To date over 43,000 abstracts/papers have been 39 published on docking, thereby highlighting the importance of this computational approach in the 40 context of drug development. Considering the large amount of genomic and proteomic consortia 41 active in the public domain, docking can exploit this data on a correspondingly ‘large scale’ to 42 address a variety of research questions. Over 160 robust and accurate molecular docking tools 43 based on different algorithms have been made available to users across the world. Further, 109 44 scoring functions have been reported in the literature till date. Despite these advancements, there 45 continue to be several bottlenecks during the implementation stage. These problems or issues 46 range from choosing the right docking algorithm, selecting a binding site in target proteins, 47 performance of the given docking tool, integration of molecular dynamics information, ligand- 48 induced conformational changes, use of solvent molecules, choice of docking pose, and choice of 49 databases. Further, so far, not always have experimental studies been used to validate the 50 docking results. In this review, basic features and key concepts of docking have been 51 highlighted, with particular emphasis on its applications such as drug repositioning and 52 prediction of side effects. Also, the use of docking in conjunction with wet lab experimentations 53 and epitope predictions has been summarized. Attempts have been made to systematically 54 address the above-mentioned challenges using expert-curation and text mining strategies. Our 55 work shows the use of machine-assisted literature mining to process and analyze huge amounts 56 of available information in a short time frame. With this work, we also propose to build a 57 platform that combines human expertise (deep curation) and machine learning in a collaborative 58 way and thus helps to solve ambitious problems (i.e. building fast, efficient docking systems by 59 combining the best tools or to perform large scale docking at human proteome level). 60 61 Website and other links: We have created web based forms and a website so that scientists, 62 developers and users of molecular docking tools can share their experiences and expertise to 63 build a comprehensive resource on molecular docking. In addition, the collected information 64 shall be used to update the molecular docking website and future versions of this manuscript. PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.27538v1 | CC BY 4.0 Open Access | rec: 15 Feb 2019, publ: 15 Feb 2019 65 The website(s) associated with this paper contain additional information in the form of tables and 66 figures. The information provided on the website(s) is updated on periodic basis. 67 A) https://tinyurl.com/sci-net2000 68 B) https://tinyurl.com/docking-tools 69 C) https://tinyurl.com/networks-docking 70 D) https://tinyurl.com/docking-review 71 72 Keywords: 73 Side effect prediction; adverse drug reactions prediction; drug repositioning; drug repurposing; 74 drug indication prediction, docking, tools, software, database, benchmarking, wet lab validations, 75 collaborative writing. 76 77 78 INTRODUCTION 79 A major challenge in the healthcare field is to devise a systematic strategy to integrate diverse 80 biological datasets to provide insight into disease, pathogenesis or discover new and safe 81 drugs/vaccines against complex diseases. The process encompasses a period of intense research, 82 typically involving a span of 10-15 years and a huge investment of sometimes more than $1 83 billion per product [Hughes et al. 2011]. Given the experimental difficulties of attaining 84 knowledge on the ligand-target interaction at the molecular level, numerous high performing 85 computational platforms and a wealth of structural data are now being increasingly used for 86 enhancing the efficiency and speed of the drug discovery process. As it has been said, substantial 87 progress has been witnessed in recent years for studying protein-ligand interactions over the 88 traditional paradigm. The computational technique known as “docking” has permeated all 89 aspects of the drug discovery process such as virtual screening, lead optimization, and side effect 90 predictions and essentially acts as a complementary tool to predict the structure of a specific 91 complex formed by two given interacting proteins. Docking holds a significant promise to screen 92 potential drugs as well as drug targets and elucidate biomolecular interactions. Its applications (at 93 larger scale) can be seen through public projects such as OpenZika (http://openzika.ufg.br/), 94 which involves the screening of potential compounds against the models of Zika protein 95 structures. The mechanistic approach of docking can also play a pivotal role in predicting PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.27538v1 | CC BY 4.0 Open Access | rec: 15 Feb 2019, publ: 15 Feb 2019 96 adverse drug reactions (ADRs) for early screening of hazardous drug molecules, which is 97 initiated by intended, on-target binding or promiscuous binding of drugs to an off-target protein. 98 Highly publicized examples of phase IV failures including rosiglitazone (“Avandia”) [Nissen et 99 al. 2010] and rofecoxib (“Vioxx”) [Karha et al. 2004] are indicative of the fact that the current 100 approach of the pharmaceutical industry involving the use of in vitro toxicity panels to assay 101 small molecule binding is inadequate [Blomme et al. 2015] and there exists a necessity to 102 explore docking technologies in order to develop safer medicines. Another field where docking 103 finds its application is drug repositioning in which already existing compounds can be 104 repurposed to new potential therapeutic targets. The technique has become progressively main- 105 stream in recent years and is believed to be of particular use in speeding up drug discovery by 106 inspecting new uses of existing, accepted drugs [Ekins et al. 2017]. This review thus provides 107 basic insights into the specific features and concepts of docking, with particular emphasis on 108 applications of docking in the field of side effect prediction and drug repositioning, so as to 109 develop a more rational and targeted therapy. We also discuss the role of software tools and 110 online web services and provide a critical analysis to compare their performance on benchmark 111 datasets along with the challenges of current docking models. To make this review 112 comprehensive and accurate, we used Perl and Python based text mining/machine learning 113 systems (developed in-house) to assist expert curators to analyse and curate a large number of 114 papers/abstracts [Kuhl et al. 1984]. Further, to keep this review updated and to build an 115 ambitious large-scale docking pipeline using the expertise of practitioners/users of molecular 116 docking and tool developers, we have initiated an international collaborative effort using 117 network sciences involving multiple organizations and researchers as co-authors of future 118 versions of this paper. This initiative based upon the principles of network sciences, is expected 119 to improve research quality, advance efficiency of the scientific production, and foster 120 breakthroughs in a shorter time. Here, we also discuss our ongoing collaborative efforts to 121 discover new vaccine targets using network sciences and the use of docking combined with 122 experimental techniques in the area of Chagas Disease. 123 124 125 126 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.27538v1 | CC BY 4.0 Open Access | rec: 15 Feb 2019, publ: 15 Feb 2019 127 2. BACKGROUND 128 2.1: The illustrious history of docking engines (algorithms) 129 Following the advent of docking algorithms in the 1980s [Billeter et al. 1987] along with the 130 advancement of techniques such as X-ray crystallography, nuclear magnetic resonance 131 spectroscopy and high-throughput protein purification, molecular docking has now become the 132 most commonly used method among the various rational approaches that are currently being 133 pursued for drug discovery and development [Lemmon et al. 2012]. Simulated docking processes 134 aim to predict the interaction of known structures (i.e. receptors,

An Extensive Survey of Molecular Docking Tools and Their Applications Using

GROMACS: Fast, Flexible, and Free

Identifying of Selective Agonists Targeting Lxrβ from Terpene Compounds of Alismatis Rhizoma

Synthesis and a Combined Simil

HP Dock Solutions

Design, Synthesis and Evaluation of Bitopic Arylpiperazine-Phthalimides As Selective Dopamine D3 Receptor Agonists

Energy Functions and Their Relationship to Molecular Conformation

Molecular Docking : Current Advances and Challenges

Recent Advances in Molecular Docking for the Research and Discovery of Potential Marine Drugs

Use of QSAR Global Models and Molecular Docking for Developing New

Computer?Aided Docking: an Invaluable Tool in Drug Discovery

Application of Various Molecular Modelling Methods in the Study of Estrogens and Xenoestrogens

Recent Advances in in Silico Target Fishing