Interpro, Progress and Status in 2005
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The SUPERFAMILY 2.0 Database: a Significant Proteome Update and a New Webserver Arun Prasad Pandurangan 1,*, Jonathan Stahlhacke2, Matt E
D490–D494 Nucleic Acids Research, 2019, Vol. 47, Database issue Published online 16 November 2018 doi: 10.1093/nar/gky1130 The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver Arun Prasad Pandurangan 1,*, Jonathan Stahlhacke2, Matt E. Oates2, Ben Smithers 2 and Julian Gough1 1MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK and 2Computer Science, University of Bristol, Bristol BS8 1UB, UK Received September 24, 2018; Revised October 23, 2018; Editorial Decision October 23, 2018; Accepted October 25, 2018 ABSTRACT level, most homologous proteins cluster together with high sequence similarity suggesting clear evolutionary relation- Here, we present a major update to the SUPERFAM- ship and functional consistency (3). The SUPERFAMILY ILY database and the webserver. We describe the ad- database provides domain annotations at both Superfamily dition of new SUPERFAMILY 2.0 profile HMM library and Family levels (4). containing a total of 27 623 HMMs. The database SUPERFAMILYprovides various analysis tools to facil- now includes Superfamily domain annotations for itate better analysis and interpretation of the database con- millions of protein sequences taken from the Uni- tent. They include the identification of under- and overrep- versal Protein Recourse Knowledgebase (UniPro- resentation of domains between genomes (5), construction tKB) and the National Center for Biotechnology In- of phylogenetic trees (6), analysis of the domain distribution formation (NCBI). This addition constitutes about 51 of superfamilies and families across the tree of life (7)aswell and 45 million distinct protein sequences obtained as providing ontology based annotations for SUPERFAM- ILY domains and architectures (8,9). -
Enhanced Representation of Natural Product Metabolism in Uniprotkb
H OH metabolites OH Article Diverse Taxonomies for Diverse Chemistries: Enhanced Representation of Natural Product Metabolism in UniProtKB Marc Feuermann 1,* , Emmanuel Boutet 1,* , Anne Morgat 1 , Kristian B. Axelsen 1, Parit Bansal 1, Jerven Bolleman 1 , Edouard de Castro 1, Elisabeth Coudert 1, Elisabeth Gasteiger 1,Sébastien Géhant 1, Damien Lieberherr 1, Thierry Lombardot 1,†, Teresa B. Neto 1, Ivo Pedruzzi 1, Sylvain Poux 1, Monica Pozzato 1, Nicole Redaschi 1 , Alan Bridge 1 and on behalf of the UniProt Consortium 1,2,3,4,‡ 1 Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; [email protected] (A.M.); [email protected] (K.B.A.); [email protected] (P.B.); [email protected] (J.B.); [email protected] (E.d.C.); [email protected] (E.C.); [email protected] (E.G.); [email protected] (S.G.); [email protected] (D.L.); [email protected] (T.L.); [email protected] (T.B.N.); [email protected] (I.P.); [email protected] (S.P.); [email protected] (M.P.); [email protected] (N.R.); [email protected] (A.B.); [email protected] (U.C.) 2 European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK 3 Protein Information Resource, University of Delaware, 15 Innovation Way, Suite 205, Newark, DE 19711, USA 4 Protein Information Resource, Georgetown University Medical Center, 3300 Whitehaven Street NorthWest, Suite 1200, Washington, DC 20007, USA * Correspondence: [email protected] (M.F.); [email protected] (E.B.); Tel.: +41-22-379-58-75 (M.F.); +41-22-379-49-10 (E.B.) † Current address: Centre Informatique, Division Calcul et Soutien à la Recherche, University of Lausanne, CH-1015 Lausanne, Switzerland. -
The EMBL-European Bioinformatics Institute the Hub for Bioinformatics in Europe
The EMBL-European Bioinformatics Institute The hub for bioinformatics in Europe Blaise T.F. Alako, PhD [email protected] www.ebi.ac.uk What is EMBL-EBI? • Part of the European Molecular Biology Laboratory • International, non-profit research institute • Europe’s hub for biological data, services and research The European Molecular Biology Laboratory Heidelberg Hamburg Hinxton, Cambridge Basic research Structural biology Bioinformatics Administration Grenoble Monterotondo, Rome EMBO EMBL staff: 1500 people Structural biology Mouse biology >60 nationalities EMBL member states Austria, Belgium, Croatia, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Luxembourg, the Netherlands, Norway, Portugal, Spain, Sweden, Switzerland and the United Kingdom Associate member state: Australia Who we are ~500 members of staff ~400 work in services & support >53 nationalities ~120 focus on basic research EMBL-EBI’s mission • Provide freely available data and bioinformatics services to all facets of the scientific community in ways that promote scientific progress • Contribute to the advancement of biology through basic investigator-driven research in bioinformatics • Provide advanced bioinformatics training to scientists at all levels, from PhD students to independent investigators • Help disseminate cutting-edge technologies to industry • Coordinate biological data provision throughout Europe Services Data and tools for molecular life science www.ebi.ac.uk/services Browse our services 9 What services do we provide? Labs around the -
Tum1 Is Involved in the Metabolism of Sterol Esters in Saccharomyces Cerevisiae Katja Uršič1,4, Mojca Ogrizović1,Dušan Kordiš1, Klaus Natter2 and Uroš Petrovič1,3*
Uršič et al. BMC Microbiology (2017) 17:181 DOI 10.1186/s12866-017-1088-1 RESEARCHARTICLE Open Access Tum1 is involved in the metabolism of sterol esters in Saccharomyces cerevisiae Katja Uršič1,4, Mojca Ogrizović1,Dušan Kordiš1, Klaus Natter2 and Uroš Petrovič1,3* Abstract Background: The only hitherto known biological role of yeast Saccharomyces cerevisiae Tum1 protein is in the tRNA thiolation pathway. The mammalian homologue of the yeast TUM1 gene, the thiosulfate sulfurtransferase (a.k.a. rhodanese) Tst, has been proposed as an obesity-resistance and antidiabetic gene. To assess the role of Tum1 in cell metabolism and the putative functional connection between lipid metabolism and tRNA modification, we analysed evolutionary conservation of the rhodanese protein superfamily, investigated the role of Tum1 in lipid metabolism, and examined the phenotype of yeast strains expressing the mouse homologue of Tum1, TST. Results: We analysed evolutionary relationships in the rhodanese superfamily and established that its members are widespread in bacteria, archaea and in all major eukaryotic groups. We found that the amount of sterol esters was significantly higher in the deletion strain tum1Δ than in the wild-type strain. Expression of the mouse TST protein in the deletion strain did not rescue this phenotype. Moreover, although Tum1 deficiency in the thiolation pathway was complemented by re-introducing TUM1, it was not complemented by the introduction of the mouse homologue Tst. We further showed that the tRNA thiolation pathway is not involved in the regulation of sterol ester content in S. cerevisiae,asoverexpressionofthetEUUC,tKUUU and tQUUG tRNAs did not rescue the lipid phenotype in the tum1Δ deletion strain, and, additionally, deletion of the key gene for the tRNA thiolation pathway, UBA4, did not affect sterol ester content. -
Article Reference
Article Incidences of problematic cell lines are lower in papers that use RRIDs to identify cell lines BABIC, Zeljana, et al. Abstract The use of misidentified and contaminated cell lines continues to be a problem in biomedical research. Research Resource Identifiers (RRIDs) should reduce the prevalence of misidentified and contaminated cell lines in the literature by alerting researchers to cell lines that are on the list of problematic cell lines, which is maintained by the International Cell Line Authentication Committee (ICLAC) and the Cellosaurus database. To test this assertion, we text-mined the methods sections of about two million papers in PubMed Central, identifying 305,161 unique cell-line names in 150,459 articles. We estimate that 8.6% of these cell lines were on the list of problematic cell lines, whereas only 3.3% of the cell lines in the 634 papers that included RRIDs were on the problematic list. This suggests that the use of RRIDs is associated with a lower reported use of problematic cell lines. Reference BABIC, Zeljana, et al. Incidences of problematic cell lines are lower in papers that use RRIDs to identify cell lines. eLife, 2019, vol. 8, p. e41676 DOI : 10.7554/eLife.41676 PMID : 30693867 Available at: http://archive-ouverte.unige.ch/unige:119832 Disclaimer: layout of this document may differ from the published version. 1 / 1 FEATURE ARTICLE META-RESEARCH Incidences of problematic cell lines are lower in papers that use RRIDs to identify cell lines Abstract The use of misidentified and contaminated cell lines continues to be a problem in biomedical research. -
Smurflite: Combining Simplified Markov Random Fields With
SMURFLite: combining simplified Markov random fields with simulated evolution improves remote homology detection for beta-structural proteins into the twilight zone Noah M. Daniels 1, Raghavendra Hosur 2, Bonnie Berger 2∗, and Lenore J. Cowen 1∗ 1Department of Computer Science, Tufts University, Medford, MA 02155 2Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139 ABSTRACT are limited in their power to recognize remote homologs because of Motivation: One of the most successful methods to date for their inability to model statistical dependencies between amino-acid recognizing protein sequences that are evolutionarily related has residues that are close in space but far apart in sequence (Lifson and been profile Hidden Markov Models (HMMs). However, these models Sander (1980); Zhu and Braun (1999); Olmea et al. (1999); Cowen do not capture pairwise statistical preferences of residues that are et al. (2002); Steward and Thorton (2002)). hydrogen bonded in beta sheets. These dependencies have been For this reason, many have suggested (White et al. (1994); partially captured in the HMM setting by simulated evolution in the Lathrop and Smith (1996); Thomas et al. (2008); Liu et al. (2009); training phase and can be fully captured by Markov Random Fields Menke et al. (2010); Peng and Xu (2011)) that more powerful (MRFs). However, the MRFs can be computationally prohibitive when Markov Random Fields (MRFs) be used. MRFs employ an auxiliary beta strands are interleaved in complex topologies. dependency graph which allows them to model more complex We introduce SMURFLite, a method that combines both simplified statistical dependencies, including statistical dependencies that Markov Random Fields and simulated evolution to substantially occur between amino-acid residues that are hydrogen bonded in beta improve remote homology detection for beta structures. -
Tunca Doğan , Alex Bateman , Maria J. Martin Your Choice
(—THIS SIDEBAR DOES NOT PRINT—) UniProt Domain Architecture Alignment: A New Approach for Protein Similarity QUICK START (cont.) DESIGN GUIDE Search using InterPro Domain Annotation How to change the template color theme This PowerPoint 2007 template produces a 44”x44” You can easily change the color theme of your poster by going to presentation poster. You can use it to create your research 1 1 1 the DESIGN menu, click on COLORS, and choose the color theme of poster and save valuable time placing titles, subtitles, text, Tunca Doğan , Alex Bateman , Maria J. Martin your choice. You can also create your own color theme. and graphics. European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), We provide a series of online tutorials that will guide you Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK through the poster design process and answer your poster Correspondence: [email protected] production questions. To view our template tutorials, go online to PosterPresentations.com and click on HELP DESK. ABSTRACT METHODOLOGY RESULTS & DISCUSSION When you are ready to print your poster, go online to InterPro Domains, DAs and DA Alignment PosterPresentations.com Motivation: Similarity based methods have been widely used in order to Generation of the Domain Architectures: You can also manually change the color of your background by going to VIEW > SLIDE MASTER. After you finish working on the master be infer the properties of genes and gene products containing little or no 1) Collect the hits for each protein from InterPro. Domain annotation coverage Overlap domain hits problem in Need assistance? Call us at 1.510.649.3001 difference b/w domain databases: the InterPro database: sure to go to VIEW > NORMAL to continue working on your poster. -
Evolution and Function of Drososphila Melanogaster Cis-Regulatory Sequences
Evolution and Function of Drososphila melanogaster cis-regulatory Sequences By Aaron Hardin A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Molecular and Cell Biology in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Michael Eisen, Chair Professor Doris Bachtrog Professor Gary Karpen Professor Lior Pachter Fall 2013 Evolution and Function of Drososphila melanogaster cis-regulatory Sequences This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License 2013 by Aaron Hardin 1 Abstract Evolution and Function of Drososphila melanogaster cis-regulatory Sequences by Aaron Hardin Doctor of Philosophy in Molecular and Cell Biology University of California, Berkeley Professor Michael Eisen, Chair In this work, I describe my doctoral work studying the regulation of transcription with both computational and experimental methods on the natural genetic variation in a population. This works integrates an investigation of the consequences of polymorphisms at three stages of gene regulation in the developing fly embryo: the diversity at cis-regulatory modules, the integration of transcription factor binding into changes in chromatin state and the effects of these inputs on the final phenotype of embryonic gene expression. i I dedicate this dissertation to Mela Hardin who has been here for me at all times, even when we were apart. ii Contents List of Figures iv List of Tables vi Acknowledgments vii 1 Introduction1 2 Within Species Diversity in cis-Regulatory Modules6 2.1 Introduction....................................6 2.2 Results.......................................8 2.2.1 Genome wide diversity in transcription factor binding sites......8 2.2.2 Genome wide purifying selection on cis-regulatory modules......9 2.3 Discussion.....................................9 2.4 Methods for finding polymorphisms...................... -
General Assembly and Consortium Meeting 2020
EJP RD GA and Consortium meeting Final program EJP RD General Assembly and Consortium meeting 2020 Online September 14th – 18th Page 1 of 46 EJP RD GA and Consortium meeting Final program Program at a glance: Page 2 of 46 EJP RD GA and Consortium meeting Final program Table of contents Program per day ....................................................................................................................... 4 Plenary GA Session .................................................................................................................. 8 Parallel Session Pillar 1 .......................................................................................................... 10 Parallel Session Pillar 2 .......................................................................................................... 11 Parallel Session Pillar 3 .......................................................................................................... 14 Parallel Session Pillar 4 .......................................................................................................... 16 Experimental data resources available in the EJP RD ............................................................. 18 Introduction to translational research for clinical or pre-clinical researchers .......................... 21 Funding opportunities in EJP RD & How to build a successful proposal ............................... 22 RD-Connect Genome-Phenome Analysis Platform ................................................................. 26 Improvement -
2003 Mulder Nucl Acids Res {22
The InterPro Database, 2003 brings increased coverage and new features Nicola J Mulder, Rolf Apweiler, Teresa K Attwood, Amos Bairoch, Daniel Barrell, Alex Bateman, David Binns, Margaret Biswas, Paul Bradley, Peer Bork, et al. To cite this version: Nicola J Mulder, Rolf Apweiler, Teresa K Attwood, Amos Bairoch, Daniel Barrell, et al.. The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Research, Oxford University Press, 2003, 31 (1), pp.315-318. 10.1093/nar/gkg046. hal-01214149 HAL Id: hal-01214149 https://hal.archives-ouvertes.fr/hal-01214149 Submitted on 9 Oct 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. # 2003 Oxford University Press Nucleic Acids Research, 2003, Vol. 31, No. 1 315–318 DOI: 10.1093/nar/gkg046 The InterPro Database, 2003 brings increased coverage and new features Nicola J. Mulder1,*, Rolf Apweiler1, Teresa K. Attwood3, Amos Bairoch4, Daniel Barrell1, Alex Bateman2, David Binns1, Margaret Biswas5, Paul Bradley1,3, Peer Bork6, Phillip Bucher7, Richard R. Copley8, Emmanuel Courcelle9, Ujjwal Das1, Richard Durbin2, Laurent Falquet7, Wolfgang Fleischmann1, Sam Griffiths-Jones2, Downloaded from Daniel Haft10, Nicola Harte1, Nicolas Hulo4, Daniel Kahn9, Alexander Kanapin1, Maria Krestyaninova1, Rodrigo Lopez1, Ivica Letunic6, David Lonsdale1, Ville Silventoinen1, Sandra E. -
MPGM: Scalable and Accurate Multiple Network Alignment
MPGM: Scalable and Accurate Multiple Network Alignment Ehsan Kazemi1 and Matthias Grossglauser2 1Yale Institute for Network Science, Yale University 2School of Computer and Communication Sciences, EPFL Abstract Protein-protein interaction (PPI) network alignment is a canonical operation to transfer biological knowledge among species. The alignment of PPI-networks has many applica- tions, such as the prediction of protein function, detection of conserved network motifs, and the reconstruction of species’ phylogenetic relationships. A good multiple-network align- ment (MNA), by considering the data related to several species, provides a deep understand- ing of biological networks and system-level cellular processes. With the massive amounts of available PPI data and the increasing number of known PPI networks, the problem of MNA is gaining more attention in the systems-biology studies. In this paper, we introduce a new scalable and accurate algorithm, called MPGM, for aligning multiple networks. The MPGM algorithm has two main steps: (i) SEEDGENERA- TION and (ii) MULTIPLEPERCOLATION. In the first step, to generate an initial set of seed tuples, the SEEDGENERATION algorithm uses only protein sequence similarities. In the second step, to align remaining unmatched nodes, the MULTIPLEPERCOLATION algorithm uses network structures and the seed tuples generated from the first step. We show that, with respect to different evaluation criteria, MPGM outperforms the other state-of-the-art algo- rithms. In addition, we guarantee the performance of MPGM under certain classes of net- work models. We introduce a sampling-based stochastic model for generating k correlated networks. We prove that for this model if a sufficient number of seed tuples are available, the MULTIPLEPERCOLATION algorithm correctly aligns almost all the nodes. -
Molecular Genetics & Genomics
page 46 Lab Times 5-2010 Ranking Illustration: Christina Ullman Publication Analysis 1997-2008 Molecular Genetics & Genomics Under the premise of a “narrow” definition of the field, Germany and England co-dominated European molecular genetics/genomics. The most frequently citated sub-fields were bioinformatical genomics, epigenetics, RNA biology and DNA repair. irst of all, a little science history (you’ll soon see why). As and expression. That’s where so-called computational biology is well known, in the 1950s genetics went molecular – and and systems biology enter research into basic genetic problems. Fdid not just become molecular genetics but rather molec- Given that development, it is not easy to answer the question ular bio logy. In 1963, however, Sydney Brenner wrote in his fa- what “molecular genetics & genomics” today actually is – and, mous letter to Max Perutz: “[...] I have long felt that the future of in particular, what is it in the context of our publication analy- molecular biology lies in the extension of research to other fields sis of the field? It is obvious that, as for example science historian of biology, notably development and the nervous system.” He Robert Olby put it, a “wide” definition can be distinguished from appeared not to be alone with this view and, as a consequence, a “narrow” definition of the field. The wide definition includes along with Brenner many of the leading molecular biologists all fields, into which molecular biology has entered as an exper- from the classical period redirected their research agendas, utilis- imental and theoretical paradigm. The “narrow” definition, on ing the newly developed molecular techniques to investigate un- the other hand, still tries to maintain the status as an explicit bio- solved problems in other fields.