Development and Application of Web-Based Open Source Drug Discovery Platforms Yuri Pevzner University of South Florida, Yuri [email protected]

Total Page:16

File Type:pdf, Size:1020Kb

Development and Application of Web-Based Open Source Drug Discovery Platforms Yuri Pevzner University of South Florida, Yuri Pevzner@Email.Com University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School 4-15-2015 Development and application of web-based open source drug discovery platforms Yuri Pevzner University of South Florida, [email protected] Follow this and additional works at: https://scholarcommons.usf.edu/etd Part of the Chemistry Commons Scholar Commons Citation Pevzner, Yuri, "Development and application of web-based open source drug discovery platforms" (2015). Graduate Theses and Dissertations. https://scholarcommons.usf.edu/etd/5550 This Dissertation is brought to you for free and open access by the Graduate School at Scholar Commons. It has been accepted for inclusion in Graduate Theses and Dissertations by an authorized administrator of Scholar Commons. For more information, please contact [email protected]. Development and Application of Web-Based Open Source Drug Discovery Platforms by Yuri Pevzner A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Chemistry with a concentration in Computational Chemistry Department of Chemistry College of Arts and Sciences University of South Florida Co-Major Professor: Henry Lee Woodcock, Ph.D. Co-Major Professor: Wayne C. Guida, Ph.D. Brian Space, Ph.D. Kenyon G. Daniel, Ph.D. Date of Approval: April 15, 2015 Keywords: CADD, Docking, Virtual Target Screening, CHARMMing Copyright © 2015, Yuri Pevzner DEDICATION I dedicate this work to my mother. Although many people have been instrumental in helping me in this achievement, she alone has seen me through the entire journey that has culminated in this work. From filling my childhood with love and happiness, to giving much needed guidance and encouragement to the developing individual, to having complete faith and confidence in every step I’ve taken since, she has been there. Knowingly or otherwise, she has instilled in me every trait that I value about myself, every trait that defines who I am. Mom, I want to thank you for being there through time and distance, thank you for making home the warm and welcoming place it has always been. Thank you for being my dearest friend. Happy birthday mom, this one is for you! ACKNOWLEDGMENTS I would like to thank my major advisors Dr. H. Lee Woodcock and Dr. Wayne C. Guida. Dr. Woodcock’s energy and infectious excitement about science was what had helped me stay productive and enthusiastic about my work throughout the long and arduous years of studies. Dr. Guida’s wisdom and valuable guidance will serve me well for many years, as in my eyes, he has always been an epitome of a scientist and a teacher. I would also like to thank Dr. Brian Space and Dr. Kenyon G. Daniel. Dr. Space for taking his time and patience to teach me the most important concepts of the physics of chemistry and giving me the tools and the confidence needed to navigate the waters of science. Dr. Daniel for instilling into me the appreciation for biology and, most importantly, for having a much appreciated faith in my scientific abilities and decisions. I would also like to thank the University of South Florida department of chemistry. The faculty and students that have helped create fun and stimulating environment. The administrative staff, in particular Ms. Adrienne McCain, for helping me navigate the administrative requirements, staying on track with the program and, finally, graduating. TABLE OF CONTENTS List of Tables ................................................................................................................................. iii List of Figures ................................................................................................................................ iv Abstract .......................................................................................................................................... vi Chapter One: Introduction ...............................................................................................................1 The Need for Web-Based Open Source Drug Discovery Platforms....................................1 Challenges of Open Source Projects ....................................................................................4 List of References ................................................................................................................7 Chapter Two: Development of the CHARMMing Web User Interface as a Platform for Computer-Aided Drug Design ....................................................................................................9 Note to Reader .....................................................................................................................9 CHARMM Interface and Graphics Platform .......................................................................9 Fragment-Based Docking Protocol in CHARMMing .......................................................11 Implementation Details ..........................................................................................11 Target Preparation ......................................................................................11 Compound Library and Ligand Upload .....................................................11 Binding Site Definition ..............................................................................12 Docking Protocol .......................................................................................15 Job Submission and Monitoring ................................................................19 Performance and Local Execution .............................................................19 Results and Discussion ..........................................................................................22 Conclusion .............................................................................................................26 Acknowledgments..................................................................................................27 List of References ..............................................................................................................28 Chapter Three: Development and Application of the Virtual Target Screening Protocol .............33 Development of the Virtual Target Screening Server ........................................................33 Methodology Background .....................................................................................33 VTS Protocol .........................................................................................................34 VTS Web-Based Interface .....................................................................................35 Application of VTS to Identify Protein Targets of Natural Products in Drug Discovery .38 Natural Products and VTS .....................................................................................38 Materials and Methods ...........................................................................................39 Virtual Compound Libraries ......................................................................39 Ligand Preparation .....................................................................................40 i Virtual Target Screening ............................................................................41 Hardware ....................................................................................................41 Graphics and Modeling ..............................................................................42 Results ....................................................................................................................42 VTS on Natural Products ...........................................................................42 VTS on CDDI Natural Products ................................................................45 Discussion ..............................................................................................................48 Conclusion .............................................................................................................53 Using Molecular Modeling, Chemo- and Bioinformatics to Search for Biomolecular Targets of Vitamin E δ-tocotrienol ..............................................................................53 Introduction ............................................................................................................53 Methods..................................................................................................................57 VEDT Preparation .....................................................................................57 Virtual Target Screening ............................................................................57 PharmMapper .............................................................................................57 PASS (Prediction of Activity Spectra of Substances) ...............................58 ProBiS (Protein Binding Site Detection) ...................................................59 Results ....................................................................................................................59 Discussion ..............................................................................................................64 Consensus between Computational Studies...............................................64 Consistency with Experimental Data .........................................................65 List of References ..............................................................................................................66
Recommended publications
  • Structure-Based Function Prediction of Uncharacterized Protein Using Binding Sites Comparison
    Structure-Based Function Prediction of Uncharacterized Protein Using Binding Sites Comparison Janez Konc1, Milan Hodosˇcˇek1, Mitja Ogrizek1, Joanna Trykowska Konc1, Dusˇanka Janezˇicˇ1,2* 1 National Institute of Chemistry, Ljubljana, Slovenia, 2 University of Primorska, Faculty of Mathematics, Natural Sciences and Information Technologies, Koper, Slovenia Abstract A challenge in structural genomics is prediction of the function of uncharacterized proteins. When proteins cannot be related to other proteins of known activity, identification of function based on sequence or structural homology is impossible and in such cases it would be useful to assess structurally conserved binding sites in connection with the protein’s function. In this paper, we propose the function of a protein of unknown activity, the Tm1631 protein from Thermotoga maritima, by comparing its predicted binding site to a library containing thousands of candidate structures. The comparison revealed numerous similarities with nucleotide binding sites including specifically, a DNA-binding site of endonuclease IV. We constructed a model of this Tm1631 protein with a DNA-ligand from the newly found similar binding site using ProBiS, and validated this model by molecular dynamics. The interactions predicted by the Tm1631-DNA model corresponded to those known to be important in endonuclease IV-DNA complex model and the corresponding binding free energies, calculated from these models were in close agreement. We thus propose that Tm1631 is a DNA binding enzyme with endonuclease activity that recognizes DNA lesions in which at least two consecutive nucleotides are unpaired. Our approach is general, and can be applied to any protein of unknown function. It might also be useful to guide experimental determination of function of uncharacterized proteins.
    [Show full text]
  • An Improved Branch and Bound Algorithm for the Maximum Clique Problem Janez Konc and Duˇsanka Janeˇziˇc∗
    MATCH MATCH Commun. Math. Comput. Chem. 58 (2007) 569-590 Communications in Mathematical and in Computer Chemistry ISSN 0340 - 6253 An improved branch and bound algorithm for the maximum clique problem Janez Konc and Duˇsanka Janeˇziˇc∗ National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia (Received June 21, 2007) Abstract A new algorithm for finding a maximum clique in an undirected graph is described. An approximate coloring algorithm has been improved and used to provide bounds to the size of the maximum clique in a basic algorithm which finds a maximum clique. This basic algorithm was then extended to include dynamically varying bounds. The resulting algorithm is significantly faster than the comparable algorithm. ∗ Author to whom correspondence should be addressed email: [email protected] telephone: 00386-14760200 fax: 00386-14760300 - 570 - 1 Introduction A clique is a subset S of vertices in a graph G such that each pair of vertices in S is connected by an edge. The maximum clique problem is the problem of finding in a given graph the clique with the largest number of vertices. The algorithms for finding a maximum clique are frequently used in chem- ical information, bioinformatics and computational biology applications [1], where their main application is to search for similarity between molecules. These algorithms are used for screening databases of compounds to filter out molecules that are similar to known biologically active molecules and are feasible to be active themselves [2]. Also, these algorithms are used for comparing protein structures, to provide the information about protein function [3] and also the information about possible interactions between proteins [4, 5].
    [Show full text]
  • Propedia: a Database for Protein–Peptide Identification Based on A
    Martins et al. BMC Bioinformatics (2021) 22:1 https://doi.org/10.1186/s12859-020-03881-z DATABASE Open Access Propedia: a database for protein–peptide identifcation based on a hybrid clustering algorithm Pedro M. Martins1 , Lucianna H. Santos1, Diego Mariano1, Felippe C. Queiroz3, Luana L. Bastos1, Isabela de S. Gomes3, Pedro H. C. Fischer5, Rafael E. O. Rocha1, Sabrina A. Silveira3, Leonardo H. F. de Lima5, Mariana T. Q. de Magalhães2, Maria G. A. Oliveira4 and Raquel C. de Melo‑Minardi1* *Correspondence: [email protected] Abstract 1 Laboratory of Bioinformatics Background: Protein–peptide interactions play a fundamental role in a wide variety and Systems (LBS), Department of Computer of biological processes, such as cell signaling, regulatory networks, immune responses, Science, Universidade and enzyme inhibition. Peptides are characterized by low toxicity and small interface Federal de Minas Gerais, Av areas; therefore, they are good targets for therapeutic strategies, rational drug plan‑ Pres. Antônio Carlos, Belo Horizonte, MG 31720‑901, ning and protein inhibition. Approximately 10% of the ethical pharmaceutical market Brazil is protein/peptide‑based. Furthermore, it is estimated that 40% of protein interactions Full list of author information are mediated by peptides. Despite the fast increase in the volume of biological data, is available at the end of the article particularly on sequences and structures, there remains a lack of broad and compre‑ hensive protein–peptide databases and tools that allow the retrieval, characterization and understanding of protein–peptide recognition and consequently support peptide design. Results: We introduce Propedia, a comprehensive and up‑to‑date database with a web interface that permits clustering, searching and visualizing of protein–peptide complexes according to varied criteria.
    [Show full text]
  • Rna 3D Motifs: Identification, Clustering, and Analysis
    RNA 3D MOTIFS: IDENTIFICATION, CLUSTERING, AND ANALYSIS ANTON I. PETROV A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2012 Committee: Neocles Leontis, Advisor Craig L. Zirbel Graduate Faculty Representative Paul Morris Scott Rogers Ray Larsen i ABSTRACT Neocles B. Leontis, Advisor Many hairpin and internal RNA 3D motif structures are recurrent, occurring in various types of RNA molecules, not necessarily homologs. Although usually drawn as single-strand “loops” in RNA 2D diagrams, recurrent motifs share a common 3D structure, but can vary in sequence. It is essential to understand the sequence variability of RNA 3D motifs in order to advance the RNA 2D and 3D structure prediction and ncRNA discovery methods, to interpret mutations that affect ncRNAs, and to guide experimental functional studies. The dissertation is organized into two parts as follows. First, the development of a new online resource called RNA 3D Hub is described, which is intended to provide a useful resource for structure modeling and prediction. It houses non-redundant sets of RNA-containing 3D structures, RNA 3D motifs extracted from all RNA 3D structures, and the RNA 3D Motif Atlas, a representative collection of RNA 3D motifs. Unique and stable ids are assigned to all non- redundant equivalence classes of structure files, to all motifs, and to all motif instances. RNA 3D Hub is updated automatically on a regular schedule and is available at http://rna.bgsu.edu/rna3dhub. In the second part of the dissertation, the development of WebFR3D (http://rna.bgsu.edu/webfr3d), a new webserver for finding and aligning RNA 3D motifs, is described and its use in a biologically relevant context is then illustrated using two RNA 3D motifs.
    [Show full text]