Casp9 Abstract Book

Total Page:16

File Type:pdf, Size:1020Kb

Casp9 Abstract Book CASP9 ABSTRACT BOOK Critical Assessment of Techniques for Protein Structure Prediction Ninth Meeting PACIFIC GROVE, CALIFORNIA, USA DECEMBER 5-9, 2010 TABLE OF CONTENT 3D-JIGSAW-4.0 .................................................................................................................... 15 3D-JIGSAW-4.5 .................................................................................................................... 15 3D-JIGSAW-4.0 & 3D-JIGSAW-4.5 ....................................................................................................................... 15 3DLIGANDSITE1-4 ................................................................................................................ 17 Using 3DLigandSite to making binding site predictions in CASP9 ....................................................................... 17 3SP-TSAILAB ........................................................................................................................ 19 A side-chain centric method for template-based structure prediction ............................................................... 19 4_BODY_POTENTIALS .......................................................................................................... 21 CASP9: Four Body Potentials for the Prediction of Protein Structure .................................................................. 21 ALADEGAP .......................................................................................................................... 23 Improvement of the Quality of Model Structures by Improving the Template-Target Alignments .................... 23 AOBA .................................................................................................................................. 25 Quality Assessment by Structural Consensus and Statistical Scoring Functions and Modeling by Hybridization of Server Models ................................................................................................................................................. 25 ATOME2_CBS ...................................................................................................................... 27 BAKER ................................................................................................................................. 29 Modeling of Protein Structures Using Rosetta in CASP9 ..................................................................................... 29 BALTYMUS .......................................................................................................................... 31 Quality assesment of single protein structure models using geometrical and statistical techniques ................. 31 BHAGEERATH ...................................................................................................................... 32 Bhageerath: an energy based web-enabled computer software suite for predicting the tertiary structures of soluble proteins ................................................................................................................................................... 32 BHAGEERATH_SCFBIO ......................................................................................................... 34 Bhageerath-H: An ab-initio, homology combined hybrid model for protein tertiary structure prediction ......... 34 BILAB-ENABLE ..................................................................................................................... 36 BILAB-SOLO ......................................................................................................................... 36 BILAB .................................................................................................................................. 36 2 Tertiary Structure Prediction by Combination of Fold Recognition, Realignment, Fragment Assembly and Consensus-based Model Quality Prediction ........................................................................................................ 36 BIO_ICM .............................................................................................................................. 38 Protein structure modeling using 3D-Jury and pyROSETTA ................................................................................ 38 BIOMINE ............................................................................................................................. 40 Prediction of Disorder Regions by Multilayer Information Fusion ...................................................................... 40 BIOSERF .............................................................................................................................. 42 Server-based de novo and fold recognition predictions using BioSerf ................................................................ 42 BUJNICKI-KOLINSKI .............................................................................................................. 43 Protein structure prediction by CABS and TRACER with restraints derived from MQAP-scored models. ........... 43 CBRC_POODLE ..................................................................................................................... 45 POODLE-I: Prediction of disordered region by integrating .................................................................................. 45 POODLE series based on workflow approach ..................................................................................................... 45 CHICKEN_GEORGE ............................................................................................................... 47 Protein structure prediction with SimFold in CASP9............................................................................................ 47 CHUO-FAMS ........................................................................................................................ 49 Construction of the Function for Protein Structure Prediction and the Homology Modeling System ................. 49 CHUNK-TASSER ................................................................................................................... 52 Chunk-TASSER server for protein structure prediction in CASP9 ......................................................................... 52 CIRCLE ................................................................................................................................. 54 Template based modeling server with Model Quality Assessment Program circle ............................................ 54 CNIO ................................................................................................................................... 55 Using predicted contacts to select model structures .......................................................................................... 55 CONFUZZ ............................................................................................................................. 57 ConFuzz residue-residue proximity prediction metaserver ................................................................................. 57 CONQUASS .......................................................................................................................... 59 ConQuass: using evolutionary conservation for quality assessment of protein model structures ...................... 59 CPU_HSFANG ...................................................................................................................... 61 Identification of native-like protein structures among sets of decoys employing a novel average measures approach ............................................................................................................................................................. 61 3 DCLAB ................................................................................................................................. 63 Combining Spectral Analysis with Motif Search and Homology Modeling for Protein Structure Prediction ...... 63 DILL ..................................................................................................................................... 65 Physics-Based Structure Prediction by Zipping and Assembly ............................................................................ 65 DISTILL ................................................................................................................................ 66 DISTILL_HUMAN .................................................................................................................. 66 Distill: protein structure prediction by Machine Learning ................................................................................... 66 DISTILL_NNPIF ..................................................................................................................... 68 DOKHLAB ............................................................................................................................ 69 Protein Structure Prediction by Ab Initio Folding using Discrete Molecular Dynamics ....................................... 69 ELOFSSON ........................................................................................................................... 70 PCONS ................................................................................................................................. 70 PCOMB ............................................................................................................................... 70 PROQ .................................................................................................................................
Recommended publications
  • A Community Proposal to Integrate Structural
    F1000Research 2020, 9(ELIXIR):278 Last updated: 11 JUN 2020 OPINION ARTICLE A community proposal to integrate structural bioinformatics activities in ELIXIR (3D-Bioinfo Community) [version 1; peer review: 1 approved, 3 approved with reservations] Christine Orengo1, Sameer Velankar2, Shoshana Wodak3, Vincent Zoete4, Alexandre M.J.J. Bonvin 5, Arne Elofsson 6, K. Anton Feenstra 7, Dietland L. Gerloff8, Thomas Hamelryck9, John M. Hancock 10, Manuela Helmer-Citterich11, Adam Hospital12, Modesto Orozco12, Anastassis Perrakis 13, Matthias Rarey14, Claudio Soares15, Joel L. Sussman16, Janet M. Thornton17, Pierre Tuffery 18, Gabor Tusnady19, Rikkert Wierenga20, Tiina Salminen21, Bohdan Schneider 22 1Structural and Molecular Biology Department, University College, London, UK 2Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, CB10 1SD, UK 3VIB-VUB Center for Structural Biology, Brussels, Belgium 4Department of Oncology, Lausanne University, Swiss Institute of Bioinformatics, Lausanne, Switzerland 5Bijvoet Center, Faculty of Science – Chemistry, Utrecht University, Utrecht, 3584CH, The Netherlands 6Science for Life Laboratory, Stockholm University, Solna, S-17121, Sweden 7Dept. Computer Science, Center for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit, Amsterdam, 1081 HV, The Netherlands 8Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg 9Bioinformatics center, Department of Biology, University of Copenhagen, Copenhagen, DK-2200,
    [Show full text]
  • Characterization of TSET, an Ancient and Widespread Membrane
    RESEARCH ARTICLE elifesciences.org Characterization of TSET, an ancient and widespread membrane trafficking complex Jennifer Hirst1*†, Alexander Schlacht2†, John P Norcott3‡, David Traynor4‡, Gareth Bloomfield4, Robin Antrobus1, Robert R Kay4, Joel B Dacks2*, Margaret S Robinson1* 1Cambridge Institute for Medical Research, University of Cambridge, Cambridge, United Kingdom; 2Department of Cell Biology, University of Alberta, Edmonton, Canada; 3Department of Engineering, University of Cambridge, Cambridge, United Kingdom; 4Cell Biology, MRC Laboratory of Molecular Biology, Cambridge, United Kingdom Abstract The heterotetrameric AP and F-COPI complexes help to define the cellular map of modern eukaryotes. To search for related machinery, we developed a structure-based bioinformatics tool, and identified the core subunits of TSET, a 'missing link' between the APs and COPI. Studies in Dictyostelium indicate that TSET is a heterohexamer, with two associated scaffolding proteins. TSET is non-essential in Dictyostelium, but may act in plasma membrane turnover, and is essentially identical to the recently described TPLATE complex, TPC. However, whereas TPC was reported to be plant-specific, we can identify a full or partial complex in every *For correspondence: jh228@ eukaryotic supergroup. An evolutionary path can be deduced from the earliest origins of the cam.ac.uk (JH); [email protected] (JBD); [email protected] (MSR) heterotetramer/scaffold coat to its multiple manifestations in modern organisms, including the mammalian muniscins, descendants of the TSET medium subunits. Thus, we have uncovered † These authors contributed the machinery for an ancient and widespread pathway, which provides new insights into early equally to this work eukaryotic evolution. ‡These authors also contributed DOI: 10.7554/eLife.02866.001 equally to this work Competing interests: The authors declare that no competing interests exist.
    [Show full text]
  • Structure Prediction, Fold Recognition and Homology Modelling
    Structure prediction, fold recognition and homology modelling Marjolein Thunnissen Lund September 2009 Steps in protein modelling Similarity search (BLAST) Multiple alignment 3-D structure known No Structure known Comparative Modelling Secondary structure prediction Fold recognition Ab initio (Rosetta) Structure Prediction 1) Prediction of secondary structure. a) Method of Chou and Fasman b) Neural networks c) hydrophobicity plots 2) Prediction of tertiary structure. a) Ab initio structure prediction b) Threading - 1D-3D profiles - Knowledge based potentials c) Homology modelling How does sequence identity correlate with structural similarity Analysis by Chotia and Lesk (89) • 100% sequence identity: rmsd = experimental error • <25% (twilight zone), structures might be similar but can also be different • Rigid body movements make rmsd bigger Secondary structure prediction: Take the sequence and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each residue Why secondary structure prediction ? • A major part of the general folding prediction problem. • The first method of obtaining some structural information from a newly determined sequence. Rules governing !-helix and "-sheet structures provide guidelines for selecting specific mutations. • Assignment of sec. str. can help to confirm structural and functional relationship between proteins when sequences homology is weak (used in threading experiments). • Important in establishing alignments during model building by homology; the first step in attempts to generate 3D models Some interesting facts 2nd structure predictions • based on primary sequence only • accuracy 64% -75% • higher accuracy for !-helices than "! strands • accuracy is dependent on protein family • predictions of engineered proteins are less accurate Methods: •Statistical methods based on studies of databases of known protein structures from which structural propensities for all amino acids are calculated.
    [Show full text]
  • CASP)-Round V
    PROTEINS: Structure, Function, and Genetics 53:334–339 (2003) Critical Assessment of Methods of Protein Structure Prediction (CASP)-Round V John Moult,1 Krzysztof Fidelis,2 Adam Zemla,2 and Tim Hubbard3 1Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, Maryland 2Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, Livermore, California 3Sanger Institute, Wellcome Trust Genome Campus, Cambridgeshire, United Kingdom ABSTRACT This article provides an introduc- The role and importance of automated servers in the tion to the special issue of the journal Proteins structure prediction field continue to grow. Another main dedicated to the fifth CASP experiment to assess the section of the issue deals with this topic. The first of these state of the art in protein structure prediction. The articles describes the CAFASP3 experiment. The goal of article describes the conduct, the categories of pre- CAFASP is to assess the state of the art in automatic diction, and the evaluation and assessment proce- methods of structure prediction.16 Whereas CASP allows dures of the experiment. A brief summary of progress any combination of computational and human methods, over the five CASP experiments is provided. Related CAFASP captures predictions directly from fully auto- developments in the field are also described. Proteins matic servers. CAFASP makes use of the CASP target 2003;53:334–339. © 2003 Wiley-Liss, Inc. distribution and prediction collection infrastructure, but is otherwise independent. The results of the CAFASP3 experi- Key words: protein structure prediction; communi- ment were also evaluated by the CASP assessors, provid- tywide experiment; CASP ing a comparison of fully automatic and hybrid methods.
    [Show full text]
  • Benchmarking the POEM@HOME Network for Protein Structure
    3rd International Workshop on Science Gateways for Life Sciences (IWSG 2011), 8-10 JUNE 2011 Benchmarking the POEM@HOME Network for Protein Structure Prediction Timo Strunk1, Priya Anand1, Martin Brieg2, Moritz Wolf1, Konstantin Klenin2, Irene Meliciani1, Frank Tristram1, Ivan Kondov2 and Wolfgang Wenzel1,* 1Institute of Nanotechnology, Karlsruhe Institute of Technology, PO Box 3640, 76021 Karlsruhe, Germany. 2Steinbuch Centre for Computing, Karlsruhe Institute of Technology, PO Box 3640, 76021 Karlsruhe, Germany ABSTRACT forcefields (Fitzgerald, et al., 2007). Knowledge-based potentials, in contrast, perform very well in differentiating native from non- Motivation: Structure based methods for drug design offer great native protein structures (Wang, et al., 2004; Zhou, et al., 2007; potential for in-silico discovery of novel drugs but require accurate Zhou, et al., 2006) and have recently made inroads into the area of models of the target protein. Because many proteins, in particular protein folding. Physics-based models retain the appeal of high transmembrane proteins, are difficult to characterize experimentally, transferability, but the present lack of truly transferable potentials methods of protein structure prediction are required to close the gap calls for the development of novel forcefields for protein structure prediction and modeling (Schug, et al., 2006; Verma, et al., 2007; between sequence and structure information. Established methods Verma and Wenzel, 2009). for protein structure prediction work well only for targets of high We have earlier reported the rational development of transferable homology to known proteins, while biophysics based simulation free energy forcefields PFF01/02 (Schug, et al., 2005; Verma and methods are restricted to small systems and require enormous Wenzel, 2009) that correctly predict the native conformation of computational resources.
    [Show full text]
  • Methods for the Refinement of Protein Structure 3D Models
    International Journal of Molecular Sciences Review Methods for the Refinement of Protein Structure 3D Models Recep Adiyaman and Liam James McGuffin * School of Biological Sciences, University of Reading, Reading RG6 6AS, UK; [email protected] * Correspondence: l.j.mcguffi[email protected]; Tel.: +44-0-118-378-6332 Received: 2 April 2019; Accepted: 7 May 2019; Published: 1 May 2019 Abstract: The refinement of predicted 3D protein models is crucial in bringing them closer towards experimental accuracy for further computational studies. Refinement approaches can be divided into two main stages: The sampling and scoring stages. Sampling strategies, such as the popular Molecular Dynamics (MD)-based protocols, aim to generate improved 3D models. However, generating 3D models that are closer to the native structure than the initial model remains challenging, as structural deviations from the native basin can be encountered due to force-field inaccuracies. Therefore, different restraint strategies have been applied in order to avoid deviations away from the native structure. For example, the accurate prediction of local errors and/or contacts in the initial models can be used to guide restraints. MD-based protocols, using physics-based force fields and smart restraints, have made significant progress towards a more consistent refinement of 3D models. The scoring stage, including energy functions and Model Quality Assessment Programs (MQAPs) are also used to discriminate near-native conformations from non-native conformations. Nevertheless, there are often very small differences among generated 3D models in refinement pipelines, which makes model discrimination and selection problematic. For this reason, the identification of the most native-like conformations remains a major challenge.
    [Show full text]
  • Advances in Rosetta Protein Structure Prediction on Massively Parallel Systems
    UC San Diego UC San Diego Previously Published Works Title Advances in Rosetta protein structure prediction on massively parallel systems Permalink https://escholarship.org/uc/item/87g6q6bw Journal IBM Journal of Research and Development, 52(1) ISSN 0018-8646 Authors Raman, S. Baker, D. Qian, B. et al. Publication Date 2008 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Advances in Rosetta protein S. Raman B. Qian structure prediction on D. Baker massively parallel systems R. C. Walker One of the key challenges in computational biology is prediction of three-dimensional protein structures from amino-acid sequences. For most proteins, the ‘‘native state’’ lies at the bottom of a free- energy landscape. Protein structure prediction involves varying the degrees of freedom of the protein in a constrained manner until it approaches its native state. In the Rosetta protein structure prediction protocols, a large number of independent folding trajectories are simulated, and several lowest-energy results are likely to be close to the native state. The availability of hundred-teraflop, and shortly, petaflop, computing resources is revolutionizing the approaches available for protein structure prediction. Here, we discuss issues involved in utilizing such machines efficiently with the Rosetta code, including an overview of recent results of the Critical Assessment of Techniques for Protein Structure Prediction 7 (CASP7) in which the computationally demanding structure-refinement process was run on 16 racks of the IBM Blue Gene/Le system at the IBM T. J. Watson Research Center. We highlight recent advances in high-performance computing and discuss future development paths that make use of the next-generation petascale (.1012 floating-point operations per second) machines.
    [Show full text]
  • Gathering Them in to the Fold
    © 1996 Nature Publishing Group http://www.nature.com/nsmb • comment tion trials, preferring the shorter The first was held in 19947 (see Gathering sequences which are members of a http://iris4.carb.nist.gov/); the sec­ sequence family: shorter sequences ond will run throughout 19968 were felt to be easier to predict, and ( CASP2: second meeting for the them in to methods such as secondary struc­ critical assessment of methods of ture prediction are significantly protein structure prediction, Asilo­ the fold more accurate when based on a mar, December 1996; URL: multiple sequence alignment than http://iris4.carb.nist.gov/casp2/ or A definition of the state of the art in on a single sequence: one or two http://www.mrc-cpe.cam.ac.uk/casp the prediction of protein folding homologous sequences whisper 21). pattern from amino acid sequence about their three-dimensional What do the 'hard' results of the was the object of the course/work­ structure; a full multiple alignment past predication competition and shop "Frontiers of protein structure shouts out loud. the 'soft' results of the feelings of prediction" held at the Istituto di Program systems used included the workshop partiCipants say 1 Ricerche di Biologia Molecolare fold recognition methods by Sippl , about the possibilities of meaning­ (IRBM), outside Rome, on 8-17 Jones2, Barton (unpublished) and ful prediction? Certainly it is worth October 1995 (organized by T. Rost 3; secondary structure predic­ trying these methods on your target Hubbard & A. Tramontano; see tion by Rost and Sander\ analysis sequence-if you find nothing http://www.mrc-cpe.cam.ac.uk/irbm­ of correlated mutations by Goebel, quickly you can stop and return to course951).
    [Show full text]
  • Improving the Performance of Rosetta Using Multiple Sequence Alignment
    PROTEINS:Structure,Function,andGenetics43:1–11(2001) ImprovingthePerformanceofRosettaUsingMultiple SequenceAlignmentInformationandGlobalMeasures ofHydrophobicCoreFormation RichardBonneau,1 CharlieE.M.Strauss,2 andDavidBaker1* 1DepartmentofBiochemistry,Box357350,UniversityofWashington,Seattle,Washington 2BiosciencesDivision,LosAlamosNationalLaboratory,LosAlamos,NewMexico ABSTRACT Thisstudyexplorestheuseofmul- alwayssharethesamefold.8–11 Eachsequencealignment tiplesequencealignment(MSA)informationand canthereforebethoughtofasrepresentingasinglefold, globalmeasuresofhydrophobiccoreformationfor witheachpositionintheMSArepresentingthepreference improvingtheRosettaabinitioproteinstructure fordifferentaminoacidsandgapsatthecorresponding predictionmethod.Themosteffectiveuseofthe positioninthefold.BadretdinovandFinkelsteinand MSAinformationisachievedbycarryingoutinde- colleaguesusedthetheoreticalframeworkoftherandom pendentfoldingsimulationsforasubsetofthe energymodeltoarguethatthedecoydiscrimination homologoussequencesintheMSAandthenidentify- problemisnotcurrentlypossibleunlesstheinformationin ingthefreeenergyminimacommontoallfolded theMSAisusedtosmooththeenergylandscape.12,13 sequencesviasimultaneousclusteringoftheinde- Usingathree-dimensional(3D)cubiclatticemodelfora pendentfoldingruns.Globalmeasuresofhydropho- polypeptide,theseinvestigatorsdemonstratedthataverag- biccoreformation,usingellipsoidalratherthan ingascoringfunctioncontainingrandomerrorsover sphericalrepresentationsofthehydrophobiccore, severalhomologoussequencesallowedthecorrectfoldto
    [Show full text]
  • Precise Structural Contact Prediction Using Sparse Inverse Covariance Estimation on Large Multiple Sequence Alignments David T
    Vol. 28 no. 2 2012, pages 184–190 BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btr638 Sequence analysis Advance Access publication November 17, 2011 PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments David T. Jones1,∗, Daniel W. A. Buchan1, Domenico Cozzetto1 and Massimiliano Pontil2 1Department of Computer Science, Bioinformatics Group and 2Department of Computer Science, Centre for Computational Statistics and Machine Learning, University College London, Malet Place, London WC1E 6BT, UK Associate Editor: Mario Albrecht ABSTRACT separated with regards to the primary sequence but display close Downloaded from https://academic.oup.com/bioinformatics/article/28/2/184/198108 by guest on 30 September 2021 Motivation: The accurate prediction of residue–residue contacts, proximity within the 3D structure. Although different geometric critical for maintaining the native fold of a protein, remains an criteria for defining contacts have been given in the literature, open problem in the field of structural bioinformatics. Interest in typically, contacts are regarded as those pairs of residues where the this long-standing problem has increased recently with algorithmic C-β atoms approach within 8 Å of one another (C-α in the case of improvements and the rapid growth in the sizes of sequence families. Glycine) (Fischer et al., 2001; Graña et al., 2005a, b; Hamilton and Progress could have major impacts in both structure and function Huber, 2008). prediction to name but two benefits. Sequence-based contact It has long been observed that with sufficient correct information predictions are usually made by identifying correlated mutations about a protein’s residue–residue contacts, it is possible to elucidate within multiple sequence alignments (MSAs), most commonly the fold of the protein (Gobel et al., 1994; Olmea and Valencia, through the information-theoretic approach of calculating mutual 1997).
    [Show full text]
  • Rosetta @ Home and the Foldit Game
    Rosetta @ Home and the Foldit game Observations of a distributed computing project, for the Interconnect & Neuroscience course by prof.dr.ir. R.H.J.M. Otten By Frank Razenberg – [email protected] – student id. 0636007 Contents 1 Introduction .......................................................................................................................................... 3 2 Distributed Computing .......................................................................................................................... 4 2.1 Concept ......................................................................................................................................... 4 2.2 Applications ................................................................................................................................... 4 3 The BOINC Platform .............................................................................................................................. 5 3.1 Design ............................................................................................................................................ 5 3.2 Processing power .......................................................................................................................... 5 3.3 Interesting projects ....................................................................................................................... 6 3.3.1 SETI@home ..........................................................................................................................
    [Show full text]
  • Chapter 1 Ab Initio Protein Structure Prediction
    Chapter 1 Ab Initio Protein Structure Prediction Jooyoung Lee, Peter L. Freddolino and Yang Zhang Abstract Predicting a protein’s structure from its amino acid sequence remains an unsolved problem after several decades of efforts. If the query protein has a homolog of known structure, the task is relatively easy and high-resolution models can often be built by copying and refining the framework of the solved structure. However, a template-based modeling procedure does not help answer the questions of how and why a protein adopts its specific structure. In particular, if structural homologs do not exist, or exist but cannot be identified, models have to be con- structed from scratch. This procedure, called ab initio modeling, is essential for a complete solution to the protein structure prediction problem; it can also help us understand the physicochemical principle of how proteins fold in nature. Currently, the accuracy of ab initio modeling is low and the success is generally limited to small proteins (<120 residues). With the help of co-evolution based contact map predictions, success in folding larger-size proteins was recently witnessed in blind testing experiments. In this chapter, we give a review on the field of ab initio structure modeling. Our focus will be on three key components of the modeling algorithms: energy function design, conformational search, and model selection. Progress and advances of several representative algorithms will be discussed. Keyword Protein structure prediction Ab initio folding Contact prediction Force field Á Á Á J. Lee School of Computational Sciences, Korea Institute for Advanced Study, Seoul 130-722, Korea P.L.
    [Show full text]