Improved Prediction of Signal Peptides: Signalp 3.0

Total Page:16

File Type:pdf, Size:1020Kb

Improved Prediction of Signal Peptides: Signalp 3.0 doi:10.1016/j.jmb.2004.05.028 J. Mol. Biol. (2004) 340, 783–795 Improved Prediction of Signal Peptides: SignalP 3.0 Jannick Dyrløv Bendtsen1, Henrik Nielsen1, Gunnar von Heijne2 and Søren Brunak1* 1Center for Biological Sequence We describe improvements of the currently most popular method for pre- Analysis, BioCentrum-DTU diction of classically secreted proteins, SignalP. SignalP consists of two Building 208, Technical different predictors based on neural network and hidden Markov model University of Denmark algorithms, where both components have been updated. Motivated by DK-2800 Lyngby, Denmark the idea that the cleavage site position and the amino acid composition of the signal peptide are correlated, new features have been included as 2Department of Biochemistry input to the neural network. This addition, combined with a thorough and Biophysics, Stockholm error-correction of a new data set, have improved the performance of the Bioinformatics Center predictor significantly over SignalP version 2. In version 3, correctness of Stockholm University, SE-106 the cleavage site predictions has increased notably for all three organism 91 Stockholm, Sweden groups, eukaryotes, Gram-negative and Gram-positive bacteria. The accuracy of cleavage site prediction has increased in the range 6–17% over the previous version, whereas the signal peptide discrimination improvement is mainly due to the elimination of false-positive predic- tions, as well as the introduction of a new discrimination score for the neural network. The new method has been benchmarked against other available methods. Predictions can be made at the publicly available web server http://www.cbs.dtu.dk/services/SignalP/ q 2004 Elsevier Ltd. All rights reserved. Keywords: signal peptide; signal peptidase I; neural network; hidden *Corresponding author Markov model; SignalP Introduction This is particularly important in this area, where the predictive performance is approaching the Numerous attempts to predict the correct sub- performance calculated from interpretation of cellular location of proteins using machine learning experimental data, which is not always perfect. techniques have been developed.1–9 Computational Incorrect annotation of signal peptide cleavage methods for prediction of N-terminal signal pep- sites in the databases stems from trivial database tides were published around 20 years ago, initially errors, and from peptide sequencing, where it using a weight matrix approach.1,2 Development may be hard to control the level of post-processing of prediction methods shifted to machine learning of the protein by other peptidases after the signal algorithms in the mid 1990s,10,11 with a signifi- peptidase I has made its initial cleavage. Such cant increase in performance.12 SignalP, one of the post-processing typically leads to cleavage site currently most used methods, predicts the pre- assignments shifted downstream relative to the sence of signal peptidase I cleavage sites. For signal true signal peptidase I cleavage site. peptidase II cleavage sites found in lipoproteins, In the process of training the new version of the LipoP predictor has been constructed.13 SignalP SignalP we have generated a new, thoroughly produces both classification and cleavage site curated dataset based on the extraction and redun- assignment, while most of the other methods dancy reduction method published earlier.14 Other classify proteins as secretory or non-secretory. methods were used for cleaning the new dataset, A consistent assessment of the predictive per- and we found a surprisingly high error rate in formance requires a reliable benchmark dataset. Swiss-Prot, where, for example, of the order of 7% of the Gram-positive entries had either wrong Abbreviations used: HMM, hidden Markov model; cleavage site position and/or wrong annotation of NN, neural network. the experimental evidence. Also, we found many E-mail address of the corresponding author: errors in a previously used benchmark set [email protected] (stemming from automatic extraction from Swiss- 0022-2836/$ - see front matter q 2004 Elsevier Ltd. All rights reserved. 784 Signal Peptide Prediction by SignalP Prot),12 and it appears that some programs are in actually include probable propeptides. In such fact better than the performance reported (predic- cases, convertase cleavage sites are mixed together tions are correct, while feature annotation is with signal peptidase I cleavage sites. incorrect). For comparison, we made use of this independent benchmark dataset that was used Removal of spurious cleavage site residues initially for evaluation of five different signal pep- tide predictors.12 Experimental assessment of the effect of certain In the new version of SignalP we have intro- amino acids in the cleavage site region has shown duced novel amino acid composition units as well that rare residues do not allow for efficient as sequence position units in the neural network cleavage.17,18 Examination of amino acids around input layer in order to obtain better performance. the signal peptidase I cleavage site in the data set Moreover, we have changed the window sizes revealed a number of sequences containing amino slightly compared to the previous version. We acids which appear at the cleavage site very rarely. have used fivefold cross-validation tests for direct In the eukaryotic dataset we found and removed comparison to the previous version of SignalP.10 In seven sequences containing lysine (K) and 13 the previous version of SignalP a combination sequences containing arginine (R) at the 21 pos- score, Y, was created from the cleavage site score, ition. All sequences with either lysine or arginine C, and the signal peptide score, S, and used to at position 21 were investigated manually. All of obtain a better prediction of the position of the them, except one, had a predicted cleavage site cleavage site. In the new version, we also use the upstream of the annotated one. Most of these C-score to obtain a better discrimination between sequences probably undergo N-terminal matu- secreted and non-secreted sequences, and have ration by different proteases, either in the trans constructed a new D-score for this classification Golgi network (TGN) or after release from the cell task. The architecture of the hidden Markov as mentioned below in the section on propeptide model (HMM) SignalP has not changed, but the analysis. In one clear case we found an obvious models have been retrained on the new data set, error in the Swiss-Prot entry NPAB_LOCMI. and have increased their performance significantly. According to the annotation, the cleavage site is located between residues 24 and 25 (arginine in position 21), but in the original paper the authors Results and Discussion identified the cleavage to occur between amino acid residues 22 and 23. In this case, the two Generation of data sets residues, ER, are removed by a dipeptidase.19 Furthermore, we removed sequences where As the predictive performance of the earlier other amino acids appeared at position 21 in very SignalP method was quite high, assessment of few of the sequences. For the eukaryotic dataset, potential improvements is critically dependent on the only residues allowed at position 21 were the quality of the data annotation. We generated a alanine (A), cysteine (C), glycine (G), leucine (L), new positive signal peptide data set from Swiss- proline (P), glutamine (Q), serine (S) and threonine Prot15 release 40.0, retaining the negative dataset (T). By allowing only the latter amino acids extracted from the previous work. The method for we might have removed a few true, unusual redundancy reduction was the same as in the pre- sequences. For instance, tyrosine (Y) and histidine vious work14, and was based on the reduction (H) at position 21 were found only in one case principle developed by Hobohm et al.16 Our final each in the entire eukaryotic dataset. We removed positive signal peptide datasets contain 1192, 334 eight sequences with aspartic acid (D) and eight and 153 sequences for eukaryotes, Gram-negative with phenylalanine (F), seven each with glutamic and Gram-positive bacteria, respectively. acid (E) and asparagine (N), respectively. Five In the previous work, we found many errors by with methionine (M), three containing isoleucine detailed inspection of hard-to-learn examples (I) and two sequences containing tryptophan (W) during training and wrongly predicted examples. at position 21 were removed. Some of these are Nevertheless, we were quite sure that even after in fact provable errors, in one of the aspartic acid careful examination in this manner, the dataset examples, CLUS_BOVIN,20 the N-terminal peptide would probably still contain errors obtained from sequencing in the published paper reports the incorrect database annotation and wrongly inter- cleavage as MKTLLLLMGLLLSWESGWA—-ISDK preted laboratory results. ELQEMST…, while Swiss-Prot annotates the Therefore, we developed a new feature-based sequence as being cleaved between D and K, approach where abnormal examples can be thereby changing a common position 21 amino detected by inspecting rare amino acid occurrences acid, alanine, into a rare one. Interestingly, SignalP and outlier physical–chemical properties of signal predicts the cleavage site as reported in the peptides. In the following, we show that the iso- published paper. electric point of signal peptides can help in finding For Gram-positive and Gram-negative bacteria, possible annotation errors and other errors, where only four residues were allowed at position 21. these errors may be due to the fact that some These residues were alanine (A), glycine (G), serine (long) signal peptides annotated in Swiss-Prot (S) and threonine (T).17,18 For the Gram-positive Signal Peptide Prediction by SignalP 785 dataset, this approach removed four sequences IAA2_STRGS is not verified experimentally. It is containing arginine (R), three containing valine predicted to have a cleavage site at position 26 (V), two containing lysine (K) and one sequence (SignalP) or 24 (PSORT). Calculation of pI using each of glutamic acid (E), leucine (L), asparagine the SignalP predicted signal peptide length gave a (N), glutamine (Q), threonine (T) and tyrosine (Y).
Recommended publications
  • Structural and Biochemical Characterizations of Three Potential Drug Targets from Pathogens
    Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 2020 Structural and Biochemical Characterizations of Three Potential Drug Targets from Pathogens LU LU ACTA UNIVERSITATIS UPSALIENSIS ISSN 1651-6214 ISBN 978-91-513-1148-7 UPPSALA urn:nbn:se:uu:diva-435815 2021 Dissertation presented at Uppsala University to be publicly examined in Room A1:111a, BMC, Husargatan 3, Uppsala, Friday, 16 April 2021 at 13:15 for the degree of Doctor of Philosophy. The examination will be conducted in English. Faculty examiner: Christian Cambillau. Abstract Lu, L. 2021. Structural and Biochemical Characterizations of Three Potential Drug Targets from Pathogens. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 2020. 91 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-513-1148-7. As antibiotic resistance of various pathogens emerged globally, the need for new effective drugs with novel modes of action became urgent. In this thesis, we focus on infectious diseases, e.g. tuberculosis, malaria, and nosocomial infections, and the corresponding causative pathogens, Mycobacterium tuberculosis, Plasmodium falciparum, and the Gram-negative ESKAPE pathogens that underlie so many healthcare-acquired diseases. Following the same- target-other-pathogen (STOP) strategy, we attempted to comprehensively explore the properties of three promising drug targets. Signal peptidase I (SPase I), existing both in Gram-negative and Gram-positive bacteria, as well as in parasites, is vital for cell viability, due to its critical role in signal peptide cleavage, thus, protein maturation, and secreted protein transport. Three factors, comprising essentiality, a unique mode of action, and easy accessibility, make it an attractive drug target.
    [Show full text]
  • Serine Proteases with Altered Sensitivity to Activity-Modulating
    (19) & (11) EP 2 045 321 A2 (12) EUROPEAN PATENT APPLICATION (43) Date of publication: (51) Int Cl.: 08.04.2009 Bulletin 2009/15 C12N 9/00 (2006.01) C12N 15/00 (2006.01) C12Q 1/37 (2006.01) (21) Application number: 09150549.5 (22) Date of filing: 26.05.2006 (84) Designated Contracting States: • Haupts, Ulrich AT BE BG CH CY CZ DE DK EE ES FI FR GB GR 51519 Odenthal (DE) HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI • Coco, Wayne SK TR 50737 Köln (DE) •Tebbe, Jan (30) Priority: 27.05.2005 EP 05104543 50733 Köln (DE) • Votsmeier, Christian (62) Document number(s) of the earlier application(s) in 50259 Pulheim (DE) accordance with Art. 76 EPC: • Scheidig, Andreas 06763303.2 / 1 883 696 50823 Köln (DE) (71) Applicant: Direvo Biotech AG (74) Representative: von Kreisler Selting Werner 50829 Köln (DE) Patentanwälte P.O. Box 10 22 41 (72) Inventors: 50462 Köln (DE) • Koltermann, André 82057 Icking (DE) Remarks: • Kettling, Ulrich This application was filed on 14-01-2009 as a 81477 München (DE) divisional application to the application mentioned under INID code 62. (54) Serine proteases with altered sensitivity to activity-modulating substances (57) The present invention provides variants of ser- screening of the library in the presence of one or several ine proteases of the S1 class with altered sensitivity to activity-modulating substances, selection of variants with one or more activity-modulating substances. A method altered sensitivity to one or several activity-modulating for the generation of such proteases is disclosed, com- substances and isolation of those polynucleotide se- prising the provision of a protease library encoding poly- quences that encode for the selected variants.
    [Show full text]
  • Structure of the Human Signal Peptidase Complex Reveals the Determinants for Signal Peptide Cleavage
    bioRxiv preprint doi: https://doi.org/10.1101/2020.11.11.378711; this version posted November 11, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Structure of the Human Signal Peptidase Complex Reveals the Determinants for Signal Peptide Cleavage A. Manuel Liaci1, Barbara Steigenberger2,3, Sem Tamara2,3, Paulo Cesar Telles de Souza4, Mariska Gröllers-Mulderij1, Patrick Ogrissek1,5, Siewert J. Marrink4, Richard A. Scheltema2,3, Friedrich Förster1* Abstract The signal peptidase complex (SPC) is an essential membrane complex in the endoplasmic reticulum (ER), where it removes signal peptides (SPs) from a large variety of secretory pre-proteins with exquisite specificity. Although the determinants of this process have been established empirically, the molecular details of SP recognition and removal remain elusive. Here, we show that the human SPC exists in two functional paralogs with distinct proteolytic subunits. We determined the atomic structures of both paralogs using electron cryo- microscopy and structural proteomics. The active site is formed by a catalytic triad and abuts the ER membrane, where a transmembrane window collectively formed by all subunits locally thins the bilayer. This unique architecture generates specificity for thousands of SPs based on the length of their hydrophobic segments. Keywords Signal Peptidase Complex, Signal Peptide, Protein Maturation, Membrane Thinning, cryo-EM, Crosslinking Mass Spectrometry, Molecular Dynamics Simulations, Protein Secretion, ER Translocon 1Cryo-Electron Microscopy, Bijvoet Centre for Biomolecular Research, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands.
    [Show full text]
  • Sequence and Functional Analysis of Schistosoma
    MINING FOR CONSERVED MOTIFS AND SIGNIFICANT FUNCTIONS IN S. MANSONI CERCARIAL SECRETIONS Amy L. Schmidbauer Submitted to the faculty of the School of Informatics in partial fulfillment of the requirements for the degree Master of Science in Bioinformatics, Indiana University December 30, 2006 ii Accepted by the Faculty of Indiana University, in partial fulfillment of the requirements for the degree of Master of Science in Bioinformatics. (Committee Chair’s signature)_______________________ Sean D. Mooney, Ph.D., Chair Master’s Thesis Committee (Second member’s signature)________________________ Xiaoman Shawn Li, Ph.D. (Third member’s signature)__________ _______________ William J. Sullivan, Ph.D. ii © 2006 Amy L. Schmidbauer All Rights Reserved iii ACKNOWLEDGMENTS This project would not have been possible without the guidance and support of many people including faculty, advisors, colleagues, friends and family. I am extremely grateful to my advisor, Dr. Sean Mooney, for welcoming me into his laboratory as a graduate student, and for providing, not only computing resources, but continued support, suggestions, and guidance as, what started out as an independent study project, grew into what became this thesis. I extend my sincere appreciation to Dr. Giselle Knudsen, an honorary member of my thesis committee, for the original project inspiration, for her enthusiastic encouragement, insight, and direction throughout the project, and for her thoughtful review of this thesis. I would like also like to extend a heartfelt thank you to Dr. William Sullivan and Dr. Xiaoman Li for their willingness to lend their time to reviewing my thesis and for the insightful feedback they provided. For statistical expertise and support I would like to extend my deepest appreciation to Dr.
    [Show full text]
  • Bunyamwera Orthobunyavirus Glycoprotein Precursor Is Processed by Cellular Signal Peptidase and Signal Peptide Peptidase
    Bunyamwera orthobunyavirus glycoprotein precursor is processed by cellular signal peptidase and signal peptide peptidase Xiaohong Shia,1, Catherine H. Bottingb, Ping Lia, Mark Niglasb, Benjamin Brennana, Sally L. Shirranb, Agnieszka M. Szemiela, and Richard M. Elliotta,2 aMedical Research Council–University of Glasgow Centre for Virus Research, University of Glasgow, Glasgow G61 1QH, United Kingdom; and bBiomedical Sciences Research Complex, University of St. Andrews, St. Andrews KY16 9ST, United Kingdom Edited by Peter Palese, Icahn School of Medicine at Mount Sinai, New York, NY, and approved June 17, 2016 (received for review February 29, 2016) The M genome segment of Bunyamwera virus (BUNV)—the pro- (II and IV) (Fig. S1A), and its N-terminal domain (I) is required totype of both the Bunyaviridae family and the Orthobunyavirus for BUNV replication (8). genus—encodes the glycoprotein precursor (GPC) that is proteo- Cleavage of BUNV GPC is mediated by host proteases, but the lytically cleaved to yield two viral structural glycoproteins, Gn and Gc, details of which proteases are involved and the precise cleavage sites and a nonstructural protein, NSm. The cleavage mechanism of ortho- have not been clarified. Experimental data on GPC processing have bunyavirus GPCs and the host proteases involved have not been only been reported for snowshoe hare orthobunyavirus (SSHV); the clarified. In this study, we investigated the processing of BUNV GPC C terminus of SSHV Gn was determined by C-terminal amino acid and found that both NSm and Gc proteins were cleaved at their own sequencing to be an arginine (R) residue at position 299 (9) (Fig.
    [Show full text]
  • Intrinsic Evolutionary Constraints on Protease Structure, Enzyme
    Intrinsic evolutionary constraints on protease PNAS PLUS structure, enzyme acylation, and the identity of the catalytic triad Andrew R. Buller and Craig A. Townsend1 Departments of Biophysics and Chemistry, The Johns Hopkins University, Baltimore MD 21218 Edited by David Baker, University of Washington, Seattle, WA, and approved January 11, 2013 (received for review December 6, 2012) The study of proteolysis lies at the heart of our understanding of enzyme evolution remain unanswered. Because evolution oper- biocatalysis, enzyme evolution, and drug development. To un- ates through random forces, rationalizing why a particular out- derstand the degree of natural variation in protease active sites, come occurs is a difficult challenge. For example, the hydroxyl we systematically evaluated simple active site features from all nucleophile of a Ser protease was swapped for the thiol of Cys at serine, cysteine and threonine proteases of independent lineage. least twice in evolutionary history (9). However, there is not This convergent evolutionary analysis revealed several interre- a single example of Thr naturally substituting for Ser in the lated and previously unrecognized relationships. The reactive protease catalytic triad, despite its greater chemical similarity rotamer of the nucleophile determines which neighboring amide (9). Instead, the Thr proteases generate their N-terminal nu- can be used in the local oxyanion hole. Each rotamer–oxyanion cleophile through a posttranslational modification: cis-autopro- hole combination limits the location of the moiety facilitating pro- teolysis (10, 11). These facts constitute clear evidence that there ton transfer and, combined together, fixes the stereochemistry of is a strong selective pressure against Thr in the catalytic triad that catalysis.
    [Show full text]
  • Secretory Signal Peptide Modification for Optimized Antibody-Fragment Expression-Secretion in Leishmania Tarentolae Stephan Klatt1,2 and Zoltán Konthur1*
    Klatt and Konthur Microbial Cell Factories 2012, 11:97 http://www.microbialcellfactories.com/content/11/1/97 RESEARCH Open Access Secretory signal peptide modification for optimized antibody-fragment expression-secretion in Leishmania tarentolae Stephan Klatt1,2 and Zoltán Konthur1* Abstract Background: Secretory signal peptides (SPs) are well-known sequence motifs targeting proteins for translocation across the endoplasmic reticulum membrane. After passing through the secretory pathway, most proteins are secreted to the environment. Here, we describe the modification of an expression vector containing the SP from secreted acid phosphatase 1 (SAP1) of Leishmania mexicana for optimized protein expression-secretion in the eukaryotic parasite Leishmania tarentolae with regard to recombinant antibody fragments. For experimental design the online tool SignalP was used, which predicts the presence and location of SPs and their cleavage sites in polypeptides. To evaluate the signal peptide cleavage site as well as changes of expression, SPs were N-terminally linked to single-chain Fragment variables (scFv’s). The ability of L. tarentolae to express complex eukaryotic proteins with highly diverse post-translational modifications and its easy bacteria-like handling, makes the parasite a promising expression system for secretory proteins. Results: We generated four vectors with different SP-sequence modifications based on in-silico analyses with SignalP in respect to cleavage probability and location, named pLTEX-2 to pLTEX-5. To evaluate their functionality, we cloned four individual scFv-fragments into the vectors and transfected all 16 constructs into L. tarentolae. Independently from the expressed scFv, pLTEX-5 derived constructs showed the highest expression rate, followed by pLTEX-4 and pLTEX-2, whereas only low amounts of protein could be obtained from pLTEX-3 clones, indicating dysfunction of the SP.
    [Show full text]
  • Supporting Information
    Comparative genomic and transcriptomic analysis of Wangiella dermatitidis, a major cause of phaeohyphomycosis and a model black yeast human pathogen Zehua Chen*, Diego A. Martinez*, Sharvari Gujja*, Sean M. Sykes*, Qiandong Zeng*, Paul J. Szaniszlo§, Zheng Wang†,1, Christina A. Cuomo*,1 *Broad Institute of MIT and Harvard, Cambridge, MA 02142. §The Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712. †Center for Bio/Molecular Science and Engineering, Naval Research Laboratory, Washington, D.C. 20375. 1Corresponding authors: [email protected] and [email protected] Data availability: The assembly and annotation of Wangiella dermatitidis are available at the NCBI nucleotide database under the accession number AFPA01000000; RNA-Seq differential expression analysis of pH stress is available at the NCBI GEO database under record GSE51646. DOI: 10.1534/g3.113.009241 Figure S1 Independent expansion of MFS and APC transporter families in W. dermatitidis and selected aspergilli. (A) Average number of genes per genome for different category of MFS and APC families (Core families are the ortholog clusters shared by all four fungal groups; Shared, present in at least two out of the four fungal groups; Unique, unique to each group, including species-specific paralogous clusters and unclustered genes). (B) Enrichment of different category of MFS and APC transporters under different stress conditions (low pH or radiation). A positive normalized enrichment score (NES) indicates enrichment under stress conditions (pH 2.5 or with radiation), and a negative score indicates enrichment underChen normal et al, Figure conditions S1 (pH 6 or no radiation). Significant enrichments noted with **: q-value < 0.05; ***: q-value < 0.01.
    [Show full text]
  • Roles of Secreted Virulence Factors in Pathogenicity of Haemophilus Influenzae: a Dissertation
    University of Massachusetts Medical School eScholarship@UMMS GSBS Dissertations and Theses Graduate School of Biomedical Sciences 2011-05-12 Roles of Secreted Virulence Factors in Pathogenicity of Haemophilus Influenzae: A Dissertation Charles V. Rosadini University of Massachussetts Medical School Let us know how access to this document benefits ou.y Follow this and additional works at: https://escholarship.umassmed.edu/gsbs_diss Part of the Amino Acids, Peptides, and Proteins Commons, Bacteria Commons, Biological Factors Commons, Microbiology Commons, and the Respiratory System Commons Repository Citation Rosadini CV. (2011). Roles of Secreted Virulence Factors in Pathogenicity of Haemophilus Influenzae: A Dissertation. GSBS Dissertations and Theses. https://doi.org/10.13028/4d0p-mp61. Retrieved from https://escholarship.umassmed.edu/gsbs_diss/541 This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in GSBS Dissertations and Theses by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. ROLES OF SECRETED VIRULENCE FACTORS IN PATHOGENICITY OF HAEMOPHILUS INFLUENZAE A Dissertation Presented By Charles Victor Rosadini Submitted to the Faculty of the University of Massachusetts Graduate School of Biomedical Sciences, Worcester in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 12th, 2011 Molecular Genetics and Microbiology ROLES OF SECRETED VIRULENCE FACTORS IN PATHOGENICITY OF HAEMOPHILUS INFLUENZAE A Dissertation Presented By Charles Victor Rosadini The signatures of the Dissertation Defense Committee signifies completion and approval as to style and content of the Dissertation _________________________ Dr. Brian J. Akerley, Thesis Advisor _________________________ Dr. Jon Goguen, Member of Committee _________________________ Dr. Christopher Sassetti, Member of Committee _________________________ Dr.
    [Show full text]
  • Shi 2016 PNAS Cleavage AAM.Pdf (845.0Kb)
    1 Bunyamwera Orthobunyavirus Glycoprotein Precursor Is Processed by 2 Cellular Signal Peptidase and Signal Peptide Peptidase 3 4 Xiaohong Shia1, Catherine H. Bottingb, Ping Lia, Mark Niglasb, Benjamin Brennana, Sally L. 5 Shirranb, Agnieszka M. Szemiela and Richard M. Elliotta2 6 7 aMRC-University of Glasgow Centre for Virus Research, University of Glasgow, Glasgow G61 8 1QH, United Kingdom 9 bBiomedical Sciences Research Complex, University of St Andrews, St Andrews, KY16 9ST, 10 United Kingdom 11 12 1To whom Correspondence should be addressed. Email: [email protected] 13 2 This paper is dedicated to the memory of our colleague Richard M. Elliott who died on 14 June 5 2015 whist this work was ongoing. 15 Running title: Bunyamwera virus glycoprotein precursor processing 16 Author contributions: X.S., and R.M.E. designed research; X.S., P.L., M.N., B.B., and A.S. 17 performed research; C.H.B., and S.L.S. performed MS; X.S. and R.M.E. wrote the Paper. 1 18 Significance 19 Bunyamwera virus (BUNV) is the prototype of the Orthobunyavirus genus and Bunyaviridae 20 family that contains important human and animal pathogens. The cleavage mechanism of 21 orthobunyavirus glycoprotein precursor (GPC) and the host proteases involved have not 22 been clarified. Here we found that NSm and Gc contain their own internal signal peptides 23 (SPs) which mediate the GPC cleavage by host signal peptidase (SPase) and signal peptide 24 peptidase (SPP). Furthermore, the NSm domain-I (SPNSm) plays an important post-cleavage 25 role in cell fusion. Our data clarified the implication of host proteases in the processing of 26 the orthobunyavirus GPC.
    [Show full text]
  • Cleavage Entropy As Quantitative Measure of Protease Specificity
    Cleavage Entropy as Quantitative Measure of Protease Specificity Julian E. Fuchs, Susanne von Grafenstein, Roland G. Huber, Michael A. Margreiter, Gudrun M. Spitzer, Hannes G. Wallnoefer, Klaus R. Liedl* Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria Abstract A purely information theory-guided approach to quantitatively characterize protease specificity is established. We calculate an entropy value for each protease subpocket based on sequences of cleaved substrates extracted from the MEROPS database. We compare our results with known subpocket specificity profiles for individual proteases and protease groups (e.g. serine proteases, metallo proteases) and reflect them quantitatively. Summation of subpocket-wise cleavage entropy contributions yields a measure for overall protease substrate specificity. This total cleavage entropy allows ranking of different proteases with respect to their specificity, separating unspecific digestive enzymes showing high total cleavage entropy from specific proteases involved in signaling cascades. The development of a quantitative cleavage entropy score allows an unbiased comparison of subpocket-wise and overall protease specificity. Thus, it enables assessment of relative importance of physicochemical and structural descriptors in protease recognition. We present an exemplary application of cleavage entropy in tracing substrate specificity in protease evolution. This highlights the wide range of substrate promiscuity within homologue proteases and hence the heavy impact of a limited number of mutations on individual substrate specificity. Citation: Fuchs JE, von Grafenstein S, Huber RG, Margreiter MA, Spitzer GM, et al. (2013) Cleavage Entropy as Quantitative Measure of Protease Specificity. PLoS Comput Biol 9(4): e1003007. doi:10.1371/journal.pcbi.1003007 Editor: James M.
    [Show full text]
  • Structural and Kinetic Analysis of Escherichia Coli Signal Peptide Peptidase A
    Structural and Kinetic Analysis of Escherichia coli Signal Peptide Peptidase A by Apollos C. Kim B.A. (Psychology), Simon Fraser University, 2001 B.Sc. (Biology), Seoul National University, Korea, 1992 Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy in the Department of Molecular Biology and Biochemistry Faculty of Science Apollos C. Kim 2013 SIMON FRASER UNIVERSITY Summer 2013 All rights reserved. However, in accordance with the Copyright Act of Canada, this work may be reproduced, without authorization, under the conditions for “Fair Dealing.” Therefore, limited reproduction of this work for the purposes of private study, research, criticism, review and news reporting is likely to be in accordance with the law, particularly if cited appropriately. Approval Name: Apollos C. Kim Degree: Doctor of Philosophy (Molecular Biology and Biochemistry) Title of Thesis: Structural and kinetic analysis of Escherichia coli signal peptide peptidase A Examining Committee: Chair: Frederic Pio, Associate Professor Mark Paetzel Senior Supervisor Associate Professor Nicholas Harden Supervisor Professor Edgar C. Young Supervisor Associate Professor Dipankar Sen Internal Examiner Professor Ross MacGillivray External Examiner Professor Department of Biochemistry and Molecular Biology University of British Columbia Date Defended/Approved: June 19, 2013 ii Partial Copyright Licence iii Abstract Secretory proteins contain a signal peptide at their N-terminus. The signal peptide functions to guide proteins to the membrane and is cleaved off by signal peptidase. The remnant signal peptides must be removed from the membrane to prevent their accumulation which can lead to membrane destabilization. Escherichia coli signal peptide peptidase A (SppAEC) has been identified as a major enzyme that processes the remnant signal peptide to smaller fragments.
    [Show full text]