A Glance Into the Evolution of Template-Free Protein Structure Prediction Methodologies Arxiv:2002.06616V2 [Q-Bio.QM] 24 Apr 2
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Alberto Cisneros, Amanda M. Duran, Jessica A. Finn, Darwin
This is an open access article published under an ACS AuthorChoice License, which permits copying and redistribution of the article or any adaptations for non-commercial purposes. Current Topic pubs.acs.org/biochemistry Protocols for Molecular Modeling with Rosetta3 and RosettaScripts † ‡ ‡ § ‡ ∥ ‡ ⊥ ‡ ∥ Brian J. Bender, , Alberto Cisneros, III, , Amanda M. Duran, , Jessica A. Finn, , Darwin Fu, , ‡ # ‡ ∥ ‡ ∥ ‡ § Alyssa D. Lokits, , Benjamin K. Mueller, , Amandeep K. Sangha, , Marion F. Sauer, , ‡ § ‡ ∥ ‡ † ‡ § ∥ ⊥ # Alexander M. Sevy, , Gregory Sliwoski, , Jonathan H. Sheehan, Frank DiMaio,@ Jens Meiler, , , , , , ‡ ∥ and Rocco Moretti*, , † Department of Pharmacology, Vanderbilt University, Nashville, Tennessee 37232-6600, United States ‡ Center for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States § Chemical and Physical Biology Program, Vanderbilt University, Nashville, Tennessee 37232-0301, United States ∥ Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States ⊥ Department of Pathology, Microbiology and Immunology, Vanderbilt University, Nashville, Tennessee 37232-2561, United States # Neuroscience Program, Vanderbilt University, Nashville, Tennessee 37235, United States @Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States *S Supporting Information ABSTRACT: Previously, we published an article providing an overview of the Rosetta suite of biomacromolecular modeling software and a series of step-by-step tutorials [Kaufmann, K. W., et al. (2010) Biochemistry 49, 2987−2998]. The overwhelming positive response to this publication we received motivates us to here share the next iteration of these tutorials that feature de novo folding, comparative modeling, loop construction, protein docking, small molecule docking, and protein design. This updated and expanded set of tutorials is needed, as since 2010 Rosetta has been fully redesigned into an object-oriented protein modeling program Rosetta3. -
The Architecture of the Protein Domain Universe
The architecture of the protein domain universe Nikolay V. Dokholyan Department of Biochemistry and Biophysics, The University of North Carolina at Chapel Hill, School of Medicine, Chapel Hill, NC 27599 ABSTRACT Understanding the design of the universe of protein structures may provide insights into protein evolution. We study the architecture of the protein domain universe, which has been found to poses peculiar scale-free properties (Dokholyan et al., Proc. Natl. Acad. Sci. USA 99: 14132-14136 (2002)). We examine the origin of these scale-free properties of the graph of protein domain structures (PDUG) and determine that that the PDUG is not modular, i.e. it does not consist of modules with uniform properties. Instead, we find the PDUG to be self-similar at all scales. We further characterize the PDUG architecture by studying the properties of the hub nodes that are responsible for the scale-free connectivity of the PDUG. We introduce a measure of the betweenness centrality of protein domains in the PDUG and find a power-law distribution of the betweenness centrality values. The scale-free distribution of hubs in the protein universe suggests that a set of specific statistical mechanics models, such as the self-organized criticality model, can potentially identify the principal driving forces of molecular evolution. We also find a gatekeeper protein domain, removal of which partitions the largest cluster into two large sub- clusters. We suggest that the loss of such gatekeeper protein domains in the course of evolution is responsible for the creation of new fold families. INTRODUCTION The principles of molecular evolution remain elusive despite fundamental breakthroughs on the theoretical front 1-5 and a growing amount of genomic and proteomic data, over 23,000 solved protein structures 6 and protein functional annotations 7-9. -
Jonathan Sheehan
1 Jonathan H. Sheehan, Ph.D. Director, Program in Personalized Structural Biology / Precision Medicine Research Associate Professor Phone (W): 615-936-2516 Center for Structural Biology and Dept. of Biochemistry Phone (H): 615-673-9457 Vanderbilt University PMB 407917 Fax: 615-936-2211 5137 BIOSCI/MRB III [email protected] 465 21st Ave. South http://structbio.vanderbilt.edu/~sheehajh/ Nashville, TN 37232-8725 DOB: 1/29/1966, New Haven, CT Education 1984 - 1988 Harvard College (Cambridge, MA), A.B. with honors, Classics Summer 1987 Aegean Institute (Galatas, Greece) with honors in Greek and classical drama 2000 - 2006 Ph.D. Biochemistry, Vanderbilt University. Mentor: Walter Chazin Dissertation Title: Probing functional diversity of EF-hand calcium-binding proteins through mutant design and structural analysis 2006 - 2008 Postdoctoral Training: Vanderbilt University. Mentor: Jens Meiler Academic Appointments 1990 - 1995 Research Asst. in Jim Forman©s lab, Immunology at UTSW Med. Ctr., Dallas, TX 1995 - 1996 Research Asst. in the Embryonic Stem Cell Core Lab at VUMC 1996 - 1997 Research Asst. in Mark Magnuson©s lab, Molecular Physiol. and Biophys., VUMC 1997 - 2000 Manager of Dave Piston©s Cell Imaging Core Lab at VUMC 2009 - 2011 Research Instructor, Dept. of Biochemistry and V.U. Center for Structural Biology 2011 - 2016 Research Assistant Professor, Dept. of Biochemistry and Ctr. for Structural Biology 2016 - present Research Associate Professor, Dept. of Biochemistry and Ctr. for Structural Biology 2016 - present Program Director in Personalized Structural Biology / Precision Medicine Employment (other than academic appointments) 1989 - 1990 Peace Corps Volunteer, building potable water systems in Cañar, Ecuador Teaching June 2007 Vanderbilt University Center for Structural Biology Workshop: Molecular Visualization Tools and Analysis (with Eric Dawson) Aug. -
2011 Annual Report of the Georgia Research Alliance Not Every Day Brings a Major Win, but in 2011, Many Did
2011 AnnuAl report of the GeorGiA reseArch AlliAnce not every day brings a major win, but in 2011, many did. An advance in science, a venture investment, the growth of a company – these kinds of developments, all connected in some way to the catalytic efforts of the Georgia research Alliance, signaled that Georgia’s technology-rich economy is alive and growing. And that’s good news. In GRA, Georgia has a game Beginning in July, GRA partnered These and other 2011 events Competition among states for plan for building the science- with the Georgia Department and activities represent a economic development and job and technology-driven of Economic Development to foundation for future creation is as intense as ever. economy of tomorrow. And determine how the state’s seven advances, investments and Every advantage matters. GRA in 2011, that game plan was Centers of Innovation, which economic growth. As such, bolsters Georgia’s competitive strengthened when newly help grow strategic industries, they are a reminder that it advantage by expanding elected Governor Nathan Deal can maximize their potential. pays to make every day count. university R&D capacity and moved to align more closely And the Georgia Cancer adding the fuel needed to several of Georgia’s greatest Coalition, the state’s signature launch new enterprises. assets for creating, expanding initiative for cancer research and attracting companies and care, was moved under the that foster high-wage jobs. GRA umbrella. the events of 2011 | JAnuArY – feBruArY Jan 1 Expert in autism named GRA Eminent Scholar Jan 11 Collaboration aims to develop new treatment for ovarian cancer Georgia Tech researchers have Meg Buscema proposed a filtration system that could potentially remove Billy Howard Jan 4 free-floating cancer cells that cause secondary tumors. -
For Ligand- Binding Site Prediction and Functional Annotation
A threading-based method (FINDSITE) for ligand- binding site prediction and functional annotation Michal Brylinski and Jeffrey Skolnick* Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318 Edited by Harold A. Scheraga, Cornell University, Ithaca, NY, and approved November 19, 2007 (received for review August 15, 2007) The detection of ligand-binding sites is often the starting point for Connolly surface (21) and then reranks the identified pockets by protein function identification and drug discovery. Because of the degree of conservation of identified surface residues. inaccuracies in predicted protein structures, extant binding pocket- A systematic analysis of known protein structures grouped detection methods are limited to experimentally solved structures. according to SCOP (22) reveals a general tendency of certain Here, FINDSITE, a method for ligand-binding site prediction and protein folds to bind substrates at a similar location, suggesting functional annotation based on binding-site similarity across that analogous or very distantly homologous proteins can have groups of weakly homologous template structures identified from common binding sites (11). If so, it should be possible to develop threading, is described. For crystal structures, considering a cutoff an approach for ligand-binding site identification that is less distance of4Åasthehitcriterion, the success rate is 70.9% for sensitive than pocket-detection methods to distortions in the identifying -
Daisuke Kihara, Ph.D. Professor of Biological Sciences and Computer Science Purdue University
Updated on June 21, 2020 Daisuke Kihara, Ph.D. Professor of Biological Sciences and Computer Science Purdue University Contact Information Purdue University Department of Biological Sciences and Computer Science West Lafayette, IN, 47907 Office: Hockmyer 229 Tel: (765) 496-2284 E-mail: [email protected] https://www.bio.purdue.edu/People/faculty_dm/directory.php?refID=166 https://www.cs.purdue.edu/people/faculty/dkihara/ http://kiharalab.org (Lab) Education 1999 Ph.D. (Science) in Bioinformatics Kyoto University, Faculty of Science, Japan, Advisor: Minoru Kanehisa 1996 M.S. in Bioinformatics Kyoto University, Faculty of Science, Japan 1994 B.S. in Interdisciplinary Science The University of Tokyo, College of Arts and Sciences, Japan Positions held 2019.3-present Full member, Purdue University Center for Cancer Research 2014.8-present Full Professor 2018.3-present Adjunct Professor, University of Cincinnati, Department of Pediatrics 2015.1-2015.8 Visiting Scientist, Eli Lilly, Indianapolis 2009.8-2014.8 Associate professor 2003.8-2009.8 tenure-track Assistant professor Purdue University, West Lafayette, Indiana Department of Biological Sciences/Computer Sciences (joint appointment) 2002.9-2003.7 Senior Postdoctoral Research Associate Advisor: Jeffrey Skolnick Buffalo Center of Excellence in Bioinformatics, Buffalo, NY, USA 1999-2002.9 Postdoctoral Research Associate Advisor: Jeffrey Skolnick Donald Danforth Plant Science Center, St. Louis, MO, USA 1998-1999 Research Assistant Advisor: Minoru Kanehisa Bioinformatics Center, Institute for -
Cheminfo - Qualitative Analysis of Machine Learning Models for Activation of HSD Involved in Alzheimer’S Disease
Bcl::ChemInfo - Qualitative Analysis Of Machine Learning Models For Activation Of HSD Involved In Alzheimer’s Disease Mariusz Butkiewicz, Edward W. Lowe, Jr., Jens Meiler Abstract— In this case study, a ligand-based virtual high type 10 (HSD) has been found in elevated concentrations in throughput screening suite, bcl::ChemInfo, was applied to the hippocampi of Alzheimer’s disease patients. HSD may screen for activation of the protein target 17-beta play a role in the degradation of neuroprotective agents. The hydroxysteroid dehydrogenase type 10 (HSD) involved in inhibition of HSD has been indicated as a possible mean’s of Alzheimer’s Disease. bcl::ChemInfo implements a diverse set treating Alzheimer's disease. Dysfunctions in human 17 beta- of machine learning techniques such as artificial neural hydroxysteroid dehydrogenases result in disorders of biology networks (ANN), support vector machines (SVM) with the of reproduction and neuronal diseases, the enzymes are also extension for regression, kappa nearest neighbor (KNN), and involved in the pathogenesis of various cancers. HSD has a decision trees (DT). Molecular structures were converted into high affinity for amyloid proteins. Thus, it has been proposed a distinct collection of descriptor groups involving 2D- and 3D- that HSD may contribute to the amyloid plaques found in autocorrelation, and radial distribution functions. A confirmatory high-throughput screening data set contained Alzheimer's patients [2]. Furthermore, HSD degrades over 72,000 experimentally validated compounds, available neuroprotective agents like allopregnanolone which may lead through PubChem. Here, the systematical model development to memory loss. Therefore, it has been postulated that was achieved through optimization of feature sets and inhibition of HSD may help lessen the symptoms associated algorithmic parameters resulting in a theoretical enrichment of with Alzheimer's. -
Cmse 520 Biomolecular Structure, Function And
CMSE 520 BIOMOLECULAR STRUCTURE, FUNCTION AND DYNAMICS (Computational Structural Biology) OUTLINE Review: Molecular biology Proteins: structure, conformation and function(5 lectures) Generalized coordinates, Phi, psi angles, DNA/RNA: structure and function (3 lectures) Structural and functional databases (PDB, SCOP, CATH, Functional domain database, gene ontology) Use scripting languages (e.g. python) to cross refernce between these databases: starting from sequence to find the function Relationship between sequence, structure and function Molecular Modeling, homology modeling Conservation, CONSURF Relationship between function and dynamics Confromational changes in proteins (structural changes due to ligation, hinge motions, allosteric changes in proteins and consecutive function change) Molecular Dynamics Monte Carlo Protein-protein interaction: recognition, structural matching, docking PPI databases: DIP, BIND, MINT, etc... References: CURRENT PROTOCOLS IN BIOINFORMATICS (e-book) (http://www.mrw.interscience.wiley.com/cp/cpbi/articles/bi0101/frame.html) Andreas D. Baxevanis, Daniel B. Davison, Roderic D.M. Page, Gregory A. Petsko, Lincoln D. Stein, and Gary D. Stormo (eds.) 2003 John Wiley & Sons, Inc. INTRODUCTION TO PROTEIN STRUCTURE Branden C & Tooze, 2nd ed. 1999, Garland Publishing COMPUTER SIMULATION OF BIOMOLECULAR SYSTEMS Van Gusteren, Weiner, Wilkinson Internet sources Ref: Department of Energy Rapid growth in experimental technologies Human Genome Projects Two major goals 1. DNA mapping 2. DNA sequencing Rapid growth in experimental technologies z Microrarray technologies – serial gene expression patterns and mutations z Time-resolved optical, rapid mixing techniques - folding & function mechanisms (Æ ns) z Techniques for probing single molecule mechanics (AFM, STM) (Æ pN) Æ more accurate models/data for computer-aided studies Weiss, S. (1999). Fluorescence spectroscopy of single molecules. -
CASP)-Round V
PROTEINS: Structure, Function, and Genetics 53:334–339 (2003) Critical Assessment of Methods of Protein Structure Prediction (CASP)-Round V John Moult,1 Krzysztof Fidelis,2 Adam Zemla,2 and Tim Hubbard3 1Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, Maryland 2Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, Livermore, California 3Sanger Institute, Wellcome Trust Genome Campus, Cambridgeshire, United Kingdom ABSTRACT This article provides an introduc- The role and importance of automated servers in the tion to the special issue of the journal Proteins structure prediction field continue to grow. Another main dedicated to the fifth CASP experiment to assess the section of the issue deals with this topic. The first of these state of the art in protein structure prediction. The articles describes the CAFASP3 experiment. The goal of article describes the conduct, the categories of pre- CAFASP is to assess the state of the art in automatic diction, and the evaluation and assessment proce- methods of structure prediction.16 Whereas CASP allows dures of the experiment. A brief summary of progress any combination of computational and human methods, over the five CASP experiments is provided. Related CAFASP captures predictions directly from fully auto- developments in the field are also described. Proteins matic servers. CAFASP makes use of the CASP target 2003;53:334–339. © 2003 Wiley-Liss, Inc. distribution and prediction collection infrastructure, but is otherwise independent. The results of the CAFASP3 experi- Key words: protein structure prediction; communi- ment were also evaluated by the CASP assessors, provid- tywide experiment; CASP ing a comparison of fully automatic and hybrid methods. -
Biocreative II.5 Workshop 2009 Special Session on Digital Annotations
BioCreative II.5 Workshop 2009 Special Session on Digital Annotations The purified IRF-4 was also The main role of BRCA2 shown to be capable of binding appears to involve regulating the DNA in a PU.1-dependent manner function of RAD51 in the repair by by electrophoretic mobility shift homologous recombination . analysis. brca2 irf4 We found that cells ex- Moreover, expression of pressing Olig2, Nkx2.2, and NG2 Carma1 induces phosphorylation were enriched among virus- of Bcl10 and activation of the infected, GFP-positive (GFP+) transcription factor NF-kappaB. cells. carma1 BB I O olig2 The region of VHL medi- The Rab5 effector ating interaction with HIF-1 alpha Rabaptin-5 and its isoform C R E A T I V E overlapped with a putative Rabaptin-5delta differ in their macromolecular binding site within ability to interact with the rsmallab5 the crystal structure. GTPase Rab4. vhl Translocation RCC, bearing We show that ERBB2-dependenterbb2 atf1 TFE3 or TFEB gene fusions, are Both ATF-1 homodimers and tfe3 medulloblastoma cell invasion and ATF-1/CREB heterodimers bind to recently recognized entities for prometastatic gene expression can the CRE but not to the related which risk factors have not been be blocked using the ERBB tyrosine phorbol ester response element. identified. kinase inhibitor OSI-774. C r i t i c a l A s s e s s m e n t o f I n f o r m a t i o n E x t r a c t i o n i n B i o l o g y October 7th - 9th, 2009 www.BioCreative.org BioCreative II.5 Workshop 2009 special session | Digital Annotations Auditorium of the Spanish National -
BIOINFORMATICS Doi:10.1093/Bioinformatics/Btq499
Vol. 26 ECCB 2010, pages i409–i411 BIOINFORMATICS doi:10.1093/bioinformatics/btq499 ECCB 2010 Organization CONFERENCE CHAIR B. Comparative Genomics, Phylogeny, and Evolution Yves Moreau, Katholieke Universiteit Leuven, Belgium Martijn Huynen, Radboud University Nijmegen Medical Centre, The Netherlands PROCEEDINGS CHAIR Yves Van de Peer, Ghent University & VIB, Belgium Jaap Heringa, Free University of Amsterdam, The Netherlands C. Protein and Nucleotide Structure LOCAL ORGANIZING COMMITTEE Anna Tramontano, University of Rome ‘La Sapienza’, Italy Jan Gorodkin, University of Copenhagen, Denmark Yves Moreau, Katholieke Universiteit Leuven, Belgium Jaap Heringa, Free University of Amsterdam, The Netherlands D. Annotation and Prediction of Molecular Function Gert Vriend, Radboud University, Nijmegen, The Netherlands Yves Van de Peer, University of Ghent & VIB, Belgium Nir Ben-Tal, Tel-Aviv University, Israel Kathleen Marchal, Katholieke Universiteit Leuven, Belgium Fritz Roth, Harvard Medical School, USA Jacques van Helden, Université Libre de Bruxelles, Belgium Louis Wehenkel, Université de Liège, Belgium E. Gene Regulation and Transcriptomics Antoine van Kampen, University of Amsterdam & Netherlands Jaak Vilo, University of Tartu, Estonia Bioinformatics Center (NBIC) Zohar Yakhini, Agilent Laboratories, Tel-Aviv & the Tech-nion, Peter van der Spek, Erasmus MC, Rotterdam, The Netherlands Haifa, Israel STEERING COMMITTEE F. Text Mining, Ontologies, and Databases Michal Linial (Chair), Hebrew University, Jerusalem, Israel Alfonso Valencia, National -
Benchmarking the POEM@HOME Network for Protein Structure
3rd International Workshop on Science Gateways for Life Sciences (IWSG 2011), 8-10 JUNE 2011 Benchmarking the POEM@HOME Network for Protein Structure Prediction Timo Strunk1, Priya Anand1, Martin Brieg2, Moritz Wolf1, Konstantin Klenin2, Irene Meliciani1, Frank Tristram1, Ivan Kondov2 and Wolfgang Wenzel1,* 1Institute of Nanotechnology, Karlsruhe Institute of Technology, PO Box 3640, 76021 Karlsruhe, Germany. 2Steinbuch Centre for Computing, Karlsruhe Institute of Technology, PO Box 3640, 76021 Karlsruhe, Germany ABSTRACT forcefields (Fitzgerald, et al., 2007). Knowledge-based potentials, in contrast, perform very well in differentiating native from non- Motivation: Structure based methods for drug design offer great native protein structures (Wang, et al., 2004; Zhou, et al., 2007; potential for in-silico discovery of novel drugs but require accurate Zhou, et al., 2006) and have recently made inroads into the area of models of the target protein. Because many proteins, in particular protein folding. Physics-based models retain the appeal of high transmembrane proteins, are difficult to characterize experimentally, transferability, but the present lack of truly transferable potentials methods of protein structure prediction are required to close the gap calls for the development of novel forcefields for protein structure prediction and modeling (Schug, et al., 2006; Verma, et al., 2007; between sequence and structure information. Established methods Verma and Wenzel, 2009). for protein structure prediction work well only for targets of high We have earlier reported the rational development of transferable homology to known proteins, while biophysics based simulation free energy forcefields PFF01/02 (Schug, et al., 2005; Verma and methods are restricted to small systems and require enormous Wenzel, 2009) that correctly predict the native conformation of computational resources.