Uniprot: the Universal Protein Knowledgebase the Uniprot Consortium1,2,3,4,*
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Clinical Spectrum and Genetic Landscape for Hereditary Spastic
Dong et al. Molecular Neurodegeneration (2018) 13:36 https://doi.org/10.1186/s13024-018-0269-1 RESEARCH ARTICLE Open Access Clinical spectrum and genetic landscape for hereditary spastic paraplegias in China En-Lin Dong1†, Chong Wang1†, Shuang Wu1†, Ying-Qian Lu1, Xiao-Hong Lin1, Hui-Zhen Su1, Miao Zhao1, Jin He1, Li-Xiang Ma2, Ning Wang1,3, Wan-Jin Chen1,3* and Xiang Lin1* Abstract Background: Hereditary spastic paraplegias (HSP) is a heterogeneous group of rare neurodegenerative disorders affecting the corticospinal tracts. To date, more than 78 HSP loci have been mapped to cause HSP. However, both the clinical and mutational spectrum of Chinese patients with HSP remained unclear. In this study, we aim to perform a comprehensive analysis of clinical phenotypes and genetic distributions in a large cohort of Chinese HSP patients, and to elucidate the primary pathogenesis in this population. Methods: We firstly performed next-generation sequencing targeting 149 genes correlated with HSP in 99 index cases of our cohort. Multiplex ligation-dependent probe amplification testing was further carried out among those patients without known disease-causing gene mutations. We simultaneously performed a retrospective study on the reported patients exhibiting HSP in other Chinese cohorts. All clinical and molecular characterization from above two groups of Chinese HSP patients were analyzed and summarized. Eventually, we further validated the cellular changes in fibroblasts of two major spastic paraplegia (SPG) patients (SPG4 and SPG11) in vitro. Results: Most patients of ADHSP (94%) are pure forms, whereas most patients of ARHSP (78%) tend to be complicated forms. In ADHSP, we found that SPG4 (79%) was the most prevalent, followed by SPG3A (11%), SPG6 (4%) and SPG33 (2%). -
Analysis of the Impact of Sequencing Errors on Blast Using Fault Injection
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Illinois Digital Environment for Access to Learning and Scholarship Repository ANALYSIS OF THE IMPACT OF SEQUENCING ERRORS ON BLAST USING FAULT INJECTION BY SO YOUN LEE THESIS Submitted in partial fulfillment of the requirements for the degree of Master of Science in Electrical and Computer Engineering in the Graduate College of the University of Illinois at Urbana-Champaign, 2013 Urbana, Illinois Adviser: Professor Ravishankar K. Iyer ABSTRACT This thesis investigates the impact of sequencing errors in post-sequence computational analyses, including local alignment search and multiple sequence alignment. While the error rates of sequencing technology are commonly reported, the significance of these numbers cannot be fully grasped without putting them in the perspective of their impact on the downstream analyses that are used for biological research, forensics, diagnosis of diseases, etc. I approached the quantification of the impact using fault injection. Faults were injected in the input sequence data, and the analyses were run. Change in the output of the analyses was interpreted as the impact of faults, or errors. Three commonly used algorithms were used: BLAST, SSEARCH, and ProbCons. The main contributions of this work are the application of fault injection to the reliability analysis in bioinformatics and the quantitative demonstration that a small error rate in the sequence data can alter the output of the analysis in a significant way. BLAST and SSEARCH are both local alignment search tools, but BLAST is a heuristic implementation, while SSEARCH is based on the optimal Smith-Waterman algorithm. -
Advancing Solutions to the Carbohydrate Sequencing Challenge † † † ‡ § Christopher J
Perspective Cite This: J. Am. Chem. Soc. 2019, 141, 14463−14479 pubs.acs.org/JACS Advancing Solutions to the Carbohydrate Sequencing Challenge † † † ‡ § Christopher J. Gray, Lukasz G. Migas, Perdita E. Barran, Kevin Pagel, Peter H. Seeberger, ∥ ⊥ # ⊗ ∇ Claire E. Eyers, Geert-Jan Boons, Nicola L. B. Pohl, Isabelle Compagnon, , × † Göran Widmalm, and Sabine L. Flitsch*, † School of Chemistry & Manchester Institute of Biotechnology, The University of Manchester, 131 Princess Street, Manchester M1 7DN, U.K. ‡ Institute for Chemistry and Biochemistry, Freie Universitaẗ Berlin, Takustraße 3, 14195 Berlin, Germany § Biomolecular Systems Department, Max Planck Institute for Colloids and Interfaces, Am Muehlenberg 1, 14476 Potsdam, Germany ∥ Department of Biochemistry, Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, U.K. ⊥ Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States # Department of Chemistry, Indiana University, Bloomington, Indiana 47405, United States ⊗ Institut Lumierè Matiere,̀ UMR5306 UniversitéLyon 1-CNRS, Universitéde Lyon, 69622 Villeurbanne Cedex, France ∇ Institut Universitaire de France IUF, 103 Blvd St Michel, 75005 Paris, France × Department of Organic Chemistry, Arrhenius Laboratory, Stockholm University, S-106 91 Stockholm, Sweden *S Supporting Information established connection between their structure and their ABSTRACT: Carbohydrates possess a variety of distinct function, full characterization of unknown carbohydrates features with -
Identification of the Drosophila Melanogaster Homolog of the Human Spastin Gene
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by RERO DOC Digital Library Dev Genes Evol (2003) 213:412–415 DOI 10.1007/s00427-003-0340-x EXPRESSION NOTE Lars Kammermeier · Jrg Spring · Michael Stierwald · Jean-Marc Burgunder · Heinrich Reichert Identification of the Drosophila melanogaster homolog of the human spastin gene Received: 14 April 2003 / Accepted: 5 May 2003 / Published online: 5 June 2003 Springer-Verlag 2003 Abstract The human SPG4 locus encodes the spastin gene encoding spastin. This gene is expressed ubiqui- gene, which is responsible for the most prevalent form tously in fetal and adult human tissues (Hazan et al. of autosomal dominant hereditary spastic paraplegia 1999). The highest expression levels are found in the (AD-HSP), a neurodegenerative disorder. Here we iden- brain, with selective expression in the cortex and striatum. tify the predicted gene product CG5977 as the Drosophila In the spinal cord, spastin is expressed exclusively in homolog of the human spastin gene, with much higher nuclei of motor neurons, suggesting that the strong sequence similarities than any other related AAA domain neurodegenerative defects observed in patients are caused protein in the fly. Furthermore we report a new potential by a primary defect of spastin in neurons (Charvin et al. transmembrane domain in the N-terminus of the two 2003). The human spastin gene encodes a predicted 616- homologous proteins. During embryogenesis, the expres- amino-acid long protein and is a member of the large sion pattern of Drosophila spastin becomes restricted family of proteins with an AAA domain (ATPases primarily to the central nervous system, in contrast to the Associated with diverse cellular Activities). -
Microtubule-Severing Enzymes at the Cutting Edge
Commentary 2561 Microtubule-severing enzymes at the cutting edge David J. Sharp1,* and Jennifer L. Ross2 1Department of Physiology and Biophysics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA 2Department of Physics, University of Massachusetts-Amherst, 302 Hasbrouck Laboratory, Amherst, MA 01003, USA *Author for correspondence ([email protected]) Journal of Cell Science 125, 2561–2569 ß 2012. Published by The Company of Biologists Ltd doi: 10.1242/jcs.101139 Summary ATP-dependent severing of microtubules was first reported in Xenopus laevis egg extracts in 1991. Two years later this observation led to the purification of the first known microtubule-severing enzyme, katanin. Katanin homologs have now been identified throughout the animal kingdom and in plants. Moreover, members of two closely related enzyme subfamilies, spastin and fidgetin, have been found to sever microtubules and might act alongside katanins in some contexts (Roll-Mecak and McNally, 2010; Yu et al., 2008; Zhang et al., 2007). Over the past few years, it has become clear that microtubule-severing enzymes contribute to a wide range of cellular activities including mitosis and meiosis, morphogenesis, cilia biogenesis and disassembly, and migration. Thus, this group of enzymes is revealing itself to be among the most important of the microtubule regulators. This Commentary focuses on our growing understanding of how microtubule-severing enzymes contribute to the organization and dynamics of diverse microtubule arrays, as well as the structural and biophysical characteristics that afford them the unique capacity to catalyze the removal of tubulin from the interior microtubule lattice. Our goal is to provide a broader perspective, focusing on a limited number of particularly informative, representative and/or timely findings. -
"Phylogenetic Analysis of Protein Sequence Data Using The
Phylogenetic Analysis of Protein Sequence UNIT 19.11 Data Using the Randomized Axelerated Maximum Likelihood (RAXML) Program Antonis Rokas1 1Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee ABSTRACT Phylogenetic analysis is the study of evolutionary relationships among molecules, phenotypes, and organisms. In the context of protein sequence data, phylogenetic analysis is one of the cornerstones of comparative sequence analysis and has many applications in the study of protein evolution and function. This unit provides a brief review of the principles of phylogenetic analysis and describes several different standard phylogenetic analyses of protein sequence data using the RAXML (Randomized Axelerated Maximum Likelihood) Program. Curr. Protoc. Mol. Biol. 96:19.11.1-19.11.14. C 2011 by John Wiley & Sons, Inc. Keywords: molecular evolution r bootstrap r multiple sequence alignment r amino acid substitution matrix r evolutionary relationship r systematics INTRODUCTION the baboon-colobus monkey lineage almost Phylogenetic analysis is a standard and es- 25 million years ago, whereas baboons and sential tool in any molecular biologist’s bioin- colobus monkeys diverged less than 15 mil- formatics toolkit that, in the context of pro- lion years ago (Sterner et al., 2006). Clearly, tein sequence analysis, enables us to study degree of sequence similarity does not equate the evolutionary history and change of pro- with degree of evolutionary relationship. teins and their function. Such analysis is es- A typical phylogenetic analysis of protein sential to understanding major evolutionary sequence data involves five distinct steps: (a) questions, such as the origins and history of data collection, (b) inference of homology, (c) macromolecules, developmental mechanisms, sequence alignment, (d) alignment trimming, phenotypes, and life itself. -
A Deficiency Screen for Genetic Interactions with Spastin Overexpression Reveals a Role for P21-Activated Kinase 3
Genetics:Genetics: PublishedPublished ArticlesArticles AheadAhead ofof Print,Print, publishedpublished onon JuneJuly 29,24, 20112011 asas 10.1534/genetics.111.13083110.1534/genetics.111.130831 1 Loss of Drosophila melanogaster p21-activated kinase 3 (pak3) suppresses defects in synapse structure and function caused by spastin mutations Emily F. Ozdowski*, Sophia Gayle*, Hong Bao§, Bing Zhang§, Nina T. Sherwood* *Department of Biology / Institute for Genome Sciences and Policy Duke University Durham, NC 27710 §Department of Zoology University of Oklahoma Norman, OK 73019 Copyright 2011. E. F. Ozdowski, et al. 2 Running Title: pak3 and spastin in synapse development Keywords: spastin (CG5977) pak3 (CG14895) Microtubule Actin Neuromuscular junction Corresponding Author: Nina Tang Sherwood Duke University Medical Center CARL Bldg Box 3577 Durham, NC 27710 Office phone: (919) 684-8658 Lab phone: (919) 668-3656 Fax: (919) 681-9193 Email: [email protected] E. F. Ozdowski, et al. 3 ABSTRACT Microtubules are dynamic structures that must elongate, disassemble, and be cleaved into smaller pieces for proper neuronal development and function. The AAA ATPase Spastin severs microtubules along their lengths and is thought to regulate the balance between long, stable filaments and shorter fragments that seed extension or are transported. In both Drosophila and humans, loss of Spastin function results in reduction of synaptic connections and disabling motor defects. To gain insight into how spastin is regulated, we screened the Drosophila melanogaster genome for deletions that modify a spastin overexpression phenotype, eye size reduction. One suppressor region deleted p21-activated kinase 3 (pak3), which encodes a member of the Pak family of actin-regulatory enzymes, but whose in vivo function is unknown. -
EMBL-EBI Powerpoint Presentation
Processing data from high-throughput sequencing experiments Simon Anders Use-cases for HTS · de-novo sequencing and assembly of small genomes · transcriptome analysis (RNA-Seq, sRNA-Seq, ...) • identifying transcripted regions • expression profiling · Resequencing to find genetic polymorphisms: • SNPs, micro-indels • CNVs · ChIP-Seq, nucleosome positions, etc. · DNA methylation studies (after bisulfite treatment) · environmental sampling (metagenomics) · reading bar codes Use cases for HTS: Bioinformatics challenges Established procedures may not be suitable. New algorithms are required for · assembly · alignment · statistical tests (counting statistics) · visualization · segmentation · ... Where does Bioconductor come in? Several steps: · Processing of the images and determining of the read sequencest • typically done by core facility with software from the manufacturer of the sequencing machine · Aligning the reads to a reference genome (or assembling the reads into a new genome) • Done with community-developed stand-alone tools. · Downstream statistical analyis. • Write your own scripts with the help of Bioconductor infrastructure. Solexa standard workflow SolexaPipeline · "Firecrest": Identifying clusters ⇨ typically 15..20 mio good clusters per lane · "Bustard": Base calling ⇨ sequence for each cluster, with Phred-like scores · "Eland": Aligning to reference Firecrest output Large tab-separated text files with one row per identified cluster, specifying · lane index and tile index · x and y coordinates of cluster on tile · for each -
A SARS-Cov-2 Sequence Submission Tool for the European Nucleotide
Databases and ontologies Downloaded from https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btab421/6294398 by guest on 25 June 2021 A SARS-CoV-2 sequence submission tool for the European Nucleotide Archive Miguel Roncoroni 1,2,∗, Bert Droesbeke 1,2, Ignacio Eguinoa 1,2, Kim De Ruyck 1,2, Flora D’Anna 1,2, Dilmurat Yusuf 3, Björn Grüning 3, Rolf Backofen 3 and Frederik Coppens 1,2 1Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium, 1VIB Center for Plant Systems Biology, 9052 Ghent, Belgium and 2University of Freiburg, Department of Computer Science, Freiburg im Breisgau, Baden-Württemberg, Germany ∗To whom correspondence should be addressed. Associate Editor: XXXXXXX Received on XXXXX; revised on XXXXX; accepted on XXXXX Abstract Summary: Many aspects of the global response to the COVID-19 pandemic are enabled by the fast and open publication of SARS-CoV-2 genetic sequence data. The European Nucleotide Archive (ENA) is the European recommended open repository for genetic sequences. In this work, we present a tool for submitting raw sequencing reads of SARS-CoV-2 to ENA. The tool features a single-step submission process, a graphical user interface, tabular-formatted metadata and the possibility to remove human reads prior to submission. A Galaxy wrap of the tool allows users with little or no bioinformatic knowledge to do bulk sequencing read submissions. The tool is also packed in a Docker container to ease deployment. Availability: CLI ENA upload tool is available at github.com/usegalaxy- eu/ena-upload-cli (DOI 10.5281/zenodo.4537621); Galaxy ENA upload tool at toolshed.g2.bx.psu.edu/view/iuc/ena_upload/382518f24d6d and https://github.com/galaxyproject/tools- iuc/tree/master/tools/ena_upload (development) and; ENA upload Galaxy container at github.com/ELIXIR- Belgium/ena-upload-container (DOI 10.5281/zenodo.4730785) Contact: [email protected] 1 Introduction Nucleotide Archive (ENA). -
Genomic Sequencing of SARS-Cov-2: a Guide to Implementation for Maximum Impact on Public Health
Genomic sequencing of SARS-CoV-2 A guide to implementation for maximum impact on public health 8 January 2021 Genomic sequencing of SARS-CoV-2 A guide to implementation for maximum impact on public health 8 January 2021 Genomic sequencing of SARS-CoV-2: a guide to implementation for maximum impact on public health ISBN 978-92-4-001844-0 (electronic version) ISBN 978-92-4-001845-7 (print version) © World Health Organization 2021 Some rights reserved. This work is available under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 IGO licence (CC BY-NC-SA 3.0 IGO; https://creativecommons.org/licenses/by-nc-sa/3.0/igo). Under the terms of this licence, you may copy, redistribute and adapt the work for non-commercial purposes, provided the work is appropriately cited, as indicated below. In any use of this work, there should be no suggestion that WHO endorses any specific organization, products or services. The use of the WHO logo is not permitted. If you adapt the work, then you must license your work under the same or equivalent Creative Commons licence. If you create a translation of this work, you should add the following disclaimer along with the suggested citation: “This translation was not created by the World Health Organization (WHO). WHO is not responsible for the content or accuracy of this translation. The original English edition shall be the binding and authentic edition”. Any mediation relating to disputes arising under the licence shall be conducted in accordance with the mediation rules of the World Intellectual Property Organization (http://www.wipo.int/amc/en/mediation/rules/). -
Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase -
The Biogrid Interaction Database
D470–D478 Nucleic Acids Research, 2015, Vol. 43, Database issue Published online 26 November 2014 doi: 10.1093/nar/gku1204 The BioGRID interaction database: 2015 update Andrew Chatr-aryamontri1, Bobby-Joe Breitkreutz2, Rose Oughtred3, Lorrie Boucher2, Sven Heinicke3, Daici Chen1, Chris Stark2, Ashton Breitkreutz2, Nadine Kolas2, Lara O’Donnell2, Teresa Reguly2, Julie Nixon4, Lindsay Ramage4, Andrew Winter4, Adnane Sellam5, Christie Chang3, Jodi Hirschman3, Chandra Theesfeld3, Jennifer Rust3, Michael S. Livstone3, Kara Dolinski3 and Mike Tyers1,2,4,* 1Institute for Research in Immunology and Cancer, Universite´ de Montreal,´ Montreal,´ Quebec H3C 3J7, Canada, 2The Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario M5G 1X5, Canada, 3Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA, 4School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JR, UK and 5Centre Hospitalier de l’UniversiteLaval´ (CHUL), Quebec,´ Quebec´ G1V 4G2, Canada Received September 26, 2014; Revised November 4, 2014; Accepted November 5, 2014 ABSTRACT semi-automated text-mining approaches, and to en- hance curation quality control. The Biological General Repository for Interaction Datasets (BioGRID: http://thebiogrid.org) is an open access database that houses genetic and protein in- INTRODUCTION teractions curated from the primary biomedical lit- Massive increases in high-throughput DNA sequencing erature for all major model organism species and technologies (1) have enabled an unprecedented level of humans. As of September 2014, the BioGRID con- genome annotation for many hundreds of species (2–6), tains 749 912 interactions as drawn from 43 149 pub- which has led to tremendous progress in the understand- lications that represent 30 model organisms.