RCSB Protein Data Bank Tools for 3D Structure-Guided Cancer Research: Human Papillomavirus (HPV) Case Study
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Uniprot at EMBL-EBI's Role in CTTV
Barbara P. Palka, Daniel Gonzalez, Edd Turner, Xavier Watkins, Maria J. Martin, Claire O’Donovan European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK UniProt at EMBL-EBI’s role in CTTV: contributing to improved disease knowledge Introduction The mission of UniProt is to provide the scientific community with a The Centre for Therapeutic Target Validation (CTTV) comprehensive, high quality and freely accessible resource of launched in Dec 2015 a new web platform for life- protein sequence and functional information. science researchers that helps them identify The UniProt Knowledgebase (UniProtKB) is the central hub for the collection of therapeutic targets for new and repurposed medicines. functional information on proteins, with accurate, consistent and rich CTTV is a public-private initiative to generate evidence on the annotation. As much annotation information as possible is added to each validity of therapeutic targets based on genome-scale experiments UniProtKB record and this includes widely accepted biological ontologies, and analysis. CTTV is working to create an R&D framework that classifications and cross-references, and clear indications of the quality of applies to a wide range of human diseases, and is committed to annotation in the form of evidence attribution of experimental and sharing its data openly with the scientific community. CTTV brings computational data. together expertise from four complementary institutions: GSK, Biogen, EMBL-EBI and Wellcome Trust Sanger Institute. UniProt’s disease expert curation Q5VWK5 (IL23R_HUMAN) This section provides information on the disease(s) associated with genetic variations in a given protein. The information is extracted from the scientific literature and diseases that are also described in the OMIM database are represented with a controlled vocabulary. -
Itcontents 9..22
INTERNATIONAL TABLES FOR CRYSTALLOGRAPHY Volume F CRYSTALLOGRAPHY OF BIOLOGICAL MACROMOLECULES Edited by MICHAEL G. ROSSMANN AND EDDY ARNOLD Advisors and Advisory Board Advisors: J. Drenth, A. Liljas. Advisory Board: U. W. Arndt, E. N. Baker, S. C. Harrison, W. G. J. Hol, K. C. Holmes, L. N. Johnson, H. M. Berman, T. L. Blundell, M. Bolognesi, A. T. Brunger, C. E. Bugg, K. K. Kannan, S.-H. Kim, A. Klug, D. Moras, R. J. Read, R. Chandrasekaran, P. M. Colman, D. R. Davies, J. Deisenhofer, T. J. Richmond, G. E. Schulz, P. B. Sigler,² D. I. Stuart, T. Tsukihara, R. E. Dickerson, G. G. Dodson, H. Eklund, R. GiegeÂ,J.P.Glusker, M. Vijayan, A. Yonath. Contributing authors E. E. Abola: The Department of Molecular Biology, The Scripps Research W. Chiu: Verna and Marrs McLean Department of Biochemistry and Molecular Institute, La Jolla, CA 92037, USA. [24.1] Biology, Baylor College of Medicine, Houston, Texas 77030, USA. [19.2] P. D. Adams: The Howard Hughes Medical Institute and Department of Molecular J. C. Cole: Cambridge Crystallographic Data Centre, 12 Union Road, Cambridge Biophysics and Biochemistry, Yale University, New Haven, CT 06511, USA. CB2 1EZ, England. [22.4] [18.2, 25.2.3] M. L. Connolly: 1259 El Camino Real #184, Menlo Park, CA 94025, USA. F. H. Allen: Cambridge Crystallographic Data Centre, 12 Union Road, Cambridge [22.1.2] CB2 1EZ, England. [22.4, 24.3] K. D. Cowtan: Department of Chemistry, University of York, York YO1 5DD, U. W. Arndt: Laboratory of Molecular Biology, Medical Research Council, Hills England. -
Assessment of the Model Refinement Category in CASP12 Ladislav
Assessment of the model refinement category in CASP12 1, * 1, * 1, * Ladislav Hovan, Vladimiras Oleinikovas, Havva Yalinca, Andriy 2 1 1, 3 Kryshtafovych, Giorgio Saladino, and Francesco Luigi Gervasio 1Department of Chemistry, University College London, London WC1E 6BT, United Kingdom 2Genome Center, University of California, Davis, California 95616, USA 3Institute of Structural and Molecular Biology, University College London, London WC1E 6BT, United Kingdom. ∗ Contributed equally. Correspondence: Francesco Luigi Gervasio, Department of Chemistry, University College London, London WC1E 6BT, United Kingdom. Email:[email protected] ABSTRACT We here report on the assessment of the model refinement predictions submitted to the 12th Experiment on the Critical Assessment of Protein Structure Prediction (CASP12). This is the fifth refinement experiment since CASP8 (2008) and, as with the previous experiments, the predictors were invited to refine selected server models received in the regular (non- refinement) stage of the CASP experiment. We assessed the submitted models using a combination of standard CASP measures. The coefficients for the linear combination of Z-scores (the CASP12 score) have been obtained by a machine learning algorithm trained on the results of visual inspection. We identified 8 groups that improve both the backbone conformation and the side chain positioning for the majority of targets. Albeit the top methods adopted distinctively different approaches, their overall performance was almost indistinguishable, with each of them excelling in different scores or target subsets. What is more, there were a few novel approaches that, while doing worse than average in most cases, provided the best refinements for a few targets, showing significant latitude for further innovation in the field. -
Wefold: a Coopetition for Protein Structure Prediction George A
proteins STRUCTURE O FUNCTION O BIOINFORMATICS WeFold: A coopetition for protein structure prediction George A. Khoury,1 Adam Liwo,2 Firas Khatib,3,15 Hongyi Zhou,4 Gaurav Chopra,5,6 Jaume Bacardit,7 Leandro O. Bortot,8 Rodrigo A. Faccioli,9 Xin Deng,10 Yi He,11 Pawel Krupa,2,11 Jilong Li,10 Magdalena A. Mozolewska,2,11 Adam K. Sieradzan,2 James Smadbeck,1 Tomasz Wirecki,2,11 Seth Cooper,12 Jeff Flatten,12 Kefan Xu,12 David Baker,3 Jianlin Cheng,10 Alexandre C. B. Delbem,9 Christodoulos A. Floudas,1 Chen Keasar,13 Michael Levitt,5 Zoran Popovic´,12 Harold A. Scheraga,11 Jeffrey Skolnick,4 Silvia N. Crivelli ,14* and Foldit Players 1 Department of Chemical and Biological Engineering, Princeton University, Princeton 2 Faculty of Chemistry, University of Gdansk, Gdansk, Poland 3 Department of Biochemistry, University of Washington, Seattle 4 Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta 5 Department of Structural Biology, School of Medicine, Stanford University, Palo Alto 6 Diabetes Center, School of Medicine, University of California San Francisco (UCSF), San Francisco 7 School of Computing Science, Newcastle University, Newcastle, United Kingdom 8 Laboratory of Biological Physics, Faculty of Pharmaceutical Sciences at Ribeir~ao Preto, University of S~ao Paulo, S~ao Paulo, Brazil 9 Institute of Mathematical and Computer Sciences, University of S~ao Paulo, S~ao Paulo, Brazil 10 Department of Computer Science, University of Missouri, Columbia 11 Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca 12 Department of Computer Science and Engineering, Center for Game Science, University of Washington, Seattle 13 Departments of Computer Science and Life Sciences, Ben Gurion University of the Negev, Israel 14 Department of Computer Science, University of California, Davis, Davis 15 Department of Computer and Information Science, University of Massachusetts Dartmouth, Dartmouth ABSTRACT The protein structure prediction problem continues to elude scientists. -
Webnetcoffee
Hu et al. BMC Bioinformatics (2018) 19:422 https://doi.org/10.1186/s12859-018-2443-4 SOFTWARE Open Access WebNetCoffee: a web-based application to identify functionally conserved proteins from Multiple PPI networks Jialu Hu1,2, Yiqun Gao1, Junhao He1, Yan Zheng1 and Xuequn Shang1* Abstract Background: The discovery of functionally conserved proteins is a tough and important task in system biology. Global network alignment provides a systematic framework to search for these proteins from multiple protein-protein interaction (PPI) networks. Although there exist many web servers for network alignment, no one allows to perform global multiple network alignment tasks on users’ test datasets. Results: Here, we developed a web server WebNetcoffee based on the algorithm of NetCoffee to search for a global network alignment from multiple networks. To build a series of online test datasets, we manually collected 218,339 proteins, 4,009,541 interactions and many other associated protein annotations from several public databases. All these datasets and alignment results are available for download, which can support users to perform algorithm comparison and downstream analyses. Conclusion: WebNetCoffee provides a versatile, interactive and user-friendly interface for easily running alignment tasks on both online datasets and users’ test datasets, managing submitted jobs and visualizing the alignment results through a web browser. Additionally, our web server also facilitates graphical visualization of induced subnetworks for a given protein and its neighborhood. To the best of our knowledge, it is the first web server that facilitates the performing of global alignment for multiple PPI networks. Availability: http://www.nwpu-bioinformatics.com/WebNetCoffee Keywords: Multiple network alignment, Webserver, PPI networks, Protein databases, Gene ontology Background tools [7–10] have been developed to understand molec- Proteins are involved in almost all life processes. -
Stretch and Twist of HEAT Repeats Leads to Activation of DNA-PK Kinase
Combined ManuscriptbioRxiv preprint File doi: https://doi.org/10.1101/2020.10.19.346148; this version posted October 21, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Chen, et al., 2020 Stretch and Twist of HEAT Repeats Leads to Activation of DNA-PK Kinase Xuemin Chen1, Xiang Xu1,*,&, Yun Chen1,*,#, Joyce C. Cheung1,%, Huaibin Wang2, Jiansen Jiang3, Natalia de Val4, Tara Fox4, Martin Gellert1 and Wei Yang1 1 Laboratory of Molecular Biology and 2 Laboratory of Cell and Molecular Biology, NIDDK, and 3 Laboratory of Membrane Proteins and Structural Biology, NHLBI, National Institutes of Health, Bethesda, MD 20892. 4 Cancer Research Technology Program Frederick National Laboratory for Cancer Research, Leidos Biomedical Research Inc., Frederick, MD 21701, USA. * These authors contributed equally. & Current address: [email protected] # Current address: [email protected] % Current address: [email protected] Running title: Slinky-like HEAT-repeat Movement Leads to DNA-PK Activation Keyword: DNA-PKcs, Ku70, Ku80, PIKKs, DNA-end binding Correspondence: Wei Yang ([email protected]) Martin Gellert ([email protected]) 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.10.19.346148; this version posted October 21, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. -
Impact of the Protein Data Bank Across Scientific Disciplines.Data Science Journal, 19: 25, Pp
Feng, Z, et al. 2020. Impact of the Protein Data Bank Across Scientific Disciplines. Data Science Journal, 19: 25, pp. 1–14. DOI: https://doi.org/10.5334/dsj-2020-025 RESEARCH PAPER Impact of the Protein Data Bank Across Scientific Disciplines Zukang Feng1,2, Natalie Verdiguel3, Luigi Di Costanzo1,4, David S. Goodsell1,5, John D. Westbrook1,2, Stephen K. Burley1,2,6,7,8 and Christine Zardecki1,2 1 Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ, US 2 Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, US 3 University of Central Florida, Orlando, Florida, US 4 Department of Agricultural Sciences, University of Naples Federico II, Portici, IT 5 Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, US 6 Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, US 7 Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ, US 8 Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, US Corresponding author: Christine Zardecki ([email protected]) The Protein Data Bank archive (PDB) was established in 1971 as the 1st open access digital data resource for biology and medicine. Today, the PDB contains >160,000 atomic-level, experimentally-determined 3D biomolecular structures. PDB data are freely and publicly available for download, without restrictions. Each entry contains summary information about the structure and experiment, atomic coordinates, and in most cases, a citation to a corresponding scien- tific publication. -
The Biogrid Interaction Database
D470–D478 Nucleic Acids Research, 2015, Vol. 43, Database issue Published online 26 November 2014 doi: 10.1093/nar/gku1204 The BioGRID interaction database: 2015 update Andrew Chatr-aryamontri1, Bobby-Joe Breitkreutz2, Rose Oughtred3, Lorrie Boucher2, Sven Heinicke3, Daici Chen1, Chris Stark2, Ashton Breitkreutz2, Nadine Kolas2, Lara O’Donnell2, Teresa Reguly2, Julie Nixon4, Lindsay Ramage4, Andrew Winter4, Adnane Sellam5, Christie Chang3, Jodi Hirschman3, Chandra Theesfeld3, Jennifer Rust3, Michael S. Livstone3, Kara Dolinski3 and Mike Tyers1,2,4,* 1Institute for Research in Immunology and Cancer, Universite´ de Montreal,´ Montreal,´ Quebec H3C 3J7, Canada, 2The Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario M5G 1X5, Canada, 3Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA, 4School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JR, UK and 5Centre Hospitalier de l’UniversiteLaval´ (CHUL), Quebec,´ Quebec´ G1V 4G2, Canada Received September 26, 2014; Revised November 4, 2014; Accepted November 5, 2014 ABSTRACT semi-automated text-mining approaches, and to en- hance curation quality control. The Biological General Repository for Interaction Datasets (BioGRID: http://thebiogrid.org) is an open access database that houses genetic and protein in- INTRODUCTION teractions curated from the primary biomedical lit- Massive increases in high-throughput DNA sequencing erature for all major model organism species and technologies (1) have enabled an unprecedented level of humans. As of September 2014, the BioGRID con- genome annotation for many hundreds of species (2–6), tains 749 912 interactions as drawn from 43 149 pub- which has led to tremendous progress in the understand- lications that represent 30 model organisms. -
4.0-Е Resolution Cryo-EM Structure of the Mammalian Chaperonin Tric
4.0-Å resolution cryo-EM structure of the mammalian chaperonin TRiC/CCT reveals its unique subunit arrangement Yao Conga, Matthew L. Bakera, Joanita Jakanaa, David Woolforda, Erik J. Millerb, Stefanie Reissmannb,2, Ramya N. Kumarb, Alyssa M. Redding-Johansonc, Tanveer S. Batthc, Aindrila Mukhopadhyayc, Steven J. Ludtkea, Judith Frydmanb, and Wah Chiua,1 aNational Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemsitry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030; bDepartment of Biology and BioX Program, Stanford University, Stanford, CA 94305; and cPhysical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 Communicated by Michael Levitt, Stanford University School of Medicine, Stanford, CA, December 7, 2009 (received for review October 21, 2009) The essential double-ring eukaryotic chaperonin TRiC/CCT (TCP1- consists of eight distinct but related subunits sharing 27–39% ring complex or chaperonin containing TCP1) assists the folding sequence identity (Fig. S1) (13, 17). In contrast, bacterial (18) of ∼5–10% of the cellular proteome. Many TRiC substrates cannot and archael chaperonins (19) only have 1–3 different types of be folded by other chaperonins from prokaryotes or archaea. These subunits, and for those archaea with three types of subunits, it unique folding properties are likely linked to TRiC’s unique hetero- is unclear whether in the natural organism they form homo- or oligomeric subunit organization, whereby each ring consists of heterooligomeric chaperonins (20). The divergence of TRiC sub- eight different paralogous subunits in an arrangement that re- units occurred early in the evolution of eukaryotes, because all mains uncertain. Using single particle cryo-EM without imposing eukaryotes sequenced to date carry genes for all eight subunits; symmetry, we determined the mammalian TRiC structure at 4.7- orthologs of the various subunits across species are more similar Å resolution. -
NEWSLETTER Number 2 Summer 2005
AMERICAN CRYSTALLOGRAPHIC ASSOCIATION NEWSLETTER Number 2 Summer 2005 Blood 2,000,000X Art in Crystallography Table of Contents - President's Column Summer 2005 President's Column Table of Contents Presidentʼs Column ..........................................................1-2 I would like to thank all of the volunteers Guest Editorial .................................................................... 2 who do so much for News from Canada .............................................................. 3 the ACA, especially ACA Awards - Warren - Buerger - Etter ...........................6-7 those involved in our Other Awards to ACA Members .......................................7-9 annual meetings. We NAS News - Bridging the Sciences ...............................8-10 had a very successful News from Latin America ............................................10-12 meeting in Orlando, Contributors to this Issue ................................................... 10 FL thanks in large part US National Committee for Crystallography .................... 13 to our Program Chair, Crystmol 2.1 ...................................................................... 14 Edward Collins, Local Chairs, Khalil Abboud 35th Mid-Atlantic Protein Meeting ................................... 15 and Thomas Selby and 16 17th West Coast Protein Workshop ................................... their hard working com- Second Annual SERT-CAT Symposium ........................... 17 mittees. All of our initial GM/CA CAT Dedication .................................................. -
Pdbefold Tutorial Tutorial Pdbefold Can May Be Accessed from Multiple Locations on the Pdbe Website
PDBe TUTORIAL PDBeFold (SSM: Secondary Structure Matching) http://pdbe.org/fold/ This PDBe tutorial introduces PDBeFold, an interactive service for comparing protein structures in 3D. This service provides: . Pairwise and multiple comparison and 3D alignment of protein structures . Examination of a protein structure for similarity with the whole Protein Data Bank (PDB) archive or SCOP. Best C -alignment of compared structures . Download and visualisation of best-superposed structures using various graphical packages PDBeFold structure alignment is based on identification of residues occupying “equivalent” geometrical positions. In other words, unlike sequence alignment, residue type is neglected. The PDBeFold service is a very powerful structure alignment tool which can perform both pairwise and multiple three dimensional alignment. In addition to this there are various options by which the results of the structural alignment query can be sorted. The results of the Secondary Structure Matching can be sorted based on the Q score (Cα- alignment), P score (taking into account RMSD, number of aligned residues, number of gaps, number of matched Secondary Structure Elements and the SSE match score), Z score (based on Gaussian Statistics), RMSD and % Sequence Identity. It is hoped that at the end of this tutorial users will be able to use PDbeFold for the analysis of their own uploaded structures or entries already in the PDB archive. Protein Data Bank in Europe http://pdbe.org PDBeFOLD Tutorial Tutorial PDBeFold can may be accessed from multiple locations on the PDBe website. From the PDBe home page (http://pdbe.org/), there are two access points for the program as shown below. -
EMBL-EBI-Overview.Pdf
EMBL-EBI Overview EMBL-EBI Overview Welcome Welcome to the European Bioinformatics Institute (EMBL-EBI), a global hub for big data in biology. We promote scientific progress by providing freely available data to the life-science research community, and by conducting exceptional research in computational biology. At EMBL-EBI, we manage public life-science data on a very large scale, offering a rich resource of carefully curated information. We make our data, tools and infrastructure openly available to an increasingly data-driven scientific community, adjusting to the changing needs of our users, researchers, trainees and industry partners. This proactive approach allows us to deliver relevant, up-to-date data and tools to the millions of scientists who depend on our services. We are a founding member of ELIXIR, the European infrastructure for biological information, and are central to global efforts to exchange information, set standards, develop new methods and curate complex information. Our core databases are produced in collaboration with other world leaders including the National Center for Biotechnology Information in the US, the National Institute of Genetics in Japan, SIB Swiss Institute of Bioinformatics and the Wellcome Trust Sanger Institute in the UK. We are also a world leader in computational biology research, and are well integrated with experimental and computational groups on all EMBL sites. Our research programme is highly collaborative and interdisciplinary, regularly producing high-impact works on sequence and structural alignment, genome analysis, basic biological breakthroughs, algorithms and methods of widespread importance. EMBL-EBI is an international treaty organisation, and we serve the global scientific community.