Appub0358.Pdf
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Supp. Table S2: Domains and Protein Families with a Putative Role in Host-Symbiont Interactions
Supp. Table S2: Domains and protein families with a putative role in host-symbiont interactions. The domains and protein families listed here were included in the comparisons in Figure 5 and Supp. Figure S5, which show the percentage of the respective protein groups in the Riftia symbiont metagenome and in metagenomes of other symbiotic and free-living organisms. % bacterial, total number bacterial: Percentage and total number of bacterial species in which this domain is found in the SMART database (January 2019). Domain name Pfam/SMART % bacterial (total Literature/comment annotation number bacterial) Alpha-2- alpha-2- A2M: 42.05% (2057) A2Ms: protease inhibitors which are important for eukaryotic macroglobulin macroglobulin innate immunity, if present in prokaryotes apparently fulfill a family (A2M), similar role, e.g. protection against host proteases (1) including N- terminal MG1 domain ANAPC Anaphase- APC2: 0 Ubiquitin ligase, important for cell cycle control in eukaryotes (2) promoting complex Bacterial proteins might interact with ubiquitination pathways in subunits the host (3) Ankyrin Ankyrin repeats 10.88% (8348) Mediate protein-protein interactions without sequence specificity (4) Sponge symbiont ankyrin-repeat proteins inhibit amoebal phagocytosis (5) Present in sponge microbiome metatranscriptomes, putative role in symbiont-host interactions (6) Present in obligate intracellular amoeba symbiont Candidatus Amoebophilus asiaticus genome, probable function in interactions with the host (7) Armadillo Armadillo repeats 0.83% (67) -
Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase -
Eukaryotic Genome Annotation
Comparative Features of Multicellular Eukaryotic Genomes (2017) (First three statistics from www.ensembl.org; other from original papers) C. elegans A. thaliana D. melanogaster M. musculus H. sapiens Species name Nematode Thale Cress Fruit Fly Mouse Human Size (Mb) 103 136 143 3,482 3,555 # Protein-coding genes 20,362 27,655 13,918 22,598 20,338 (25,498 (13,601 original (30,000 (30,000 original est.) original est.) original est.) est.) Transcripts 58,941 55,157 34,749 131,195 200,310 Gene density (#/kb) 1/5 1/4.5 1/8.8 1/83 1/97 LINE/SINE (%) 0.4 0.5 0.7 27.4 33.6 LTR (%) 0.0 4.8 1.5 9.9 8.6 DNA Elements 5.3 5.1 0.7 0.9 3.1 Total repeats 6.5 10.5 3.1 38.6 46.4 Exons % genome size 27 28.8 24.0 per gene 4.0 5.4 4.1 8.4 8.7 average size (bp) 250 506 Introns % genome size 15.6 average size (bp) 168 Arabidopsis Chromosome Structures Sorghum Whole Genome Details Characterizing the Proteome The Protein World • Sequencing has defined o Many, many proteins • How can we use this data to: o Define genes in new genomes o Look for evolutionarily related genes o Follow evolution of genes ▪ Mixing of domains to create new proteins o Uncover important subsets of genes that ▪ That deep phylogenies • Plants vs. animals • Placental vs. non-placental animals • Monocots vs. dicots plants • Common nomenclature needed o Ensure consistency of interpretations InterPro (http://www.ebi.ac.uk/interpro/) Classification of Protein Families • Intergrated documentation resource for protein super families, families, domains and functional sites o Mitchell AL, Attwood TK, Babbitt PC, et al. -
Appendix 2. Significantly Differentially Regulated Genes in Term Compared with Second Trimester Amniotic Fluid Supernatant
Appendix 2. Significantly Differentially Regulated Genes in Term Compared With Second Trimester Amniotic Fluid Supernatant Fold Change in term vs second trimester Amniotic Affymetrix Duplicate Fluid Probe ID probes Symbol Entrez Gene Name 1019.9 217059_at D MUC7 mucin 7, secreted 424.5 211735_x_at D SFTPC surfactant protein C 416.2 206835_at STATH statherin 363.4 214387_x_at D SFTPC surfactant protein C 295.5 205982_x_at D SFTPC surfactant protein C 288.7 1553454_at RPTN repetin solute carrier family 34 (sodium 251.3 204124_at SLC34A2 phosphate), member 2 238.9 206786_at HTN3 histatin 3 161.5 220191_at GKN1 gastrokine 1 152.7 223678_s_at D SFTPA2 surfactant protein A2 130.9 207430_s_at D MSMB microseminoprotein, beta- 99.0 214199_at SFTPD surfactant protein D major histocompatibility complex, class II, 96.5 210982_s_at D HLA-DRA DR alpha 96.5 221133_s_at D CLDN18 claudin 18 94.4 238222_at GKN2 gastrokine 2 93.7 1557961_s_at D LOC100127983 uncharacterized LOC100127983 93.1 229584_at LRRK2 leucine-rich repeat kinase 2 HOXD cluster antisense RNA 1 (non- 88.6 242042_s_at D HOXD-AS1 protein coding) 86.0 205569_at LAMP3 lysosomal-associated membrane protein 3 85.4 232698_at BPIFB2 BPI fold containing family B, member 2 84.4 205979_at SCGB2A1 secretoglobin, family 2A, member 1 84.3 230469_at RTKN2 rhotekin 2 82.2 204130_at HSD11B2 hydroxysteroid (11-beta) dehydrogenase 2 81.9 222242_s_at KLK5 kallikrein-related peptidase 5 77.0 237281_at AKAP14 A kinase (PRKA) anchor protein 14 76.7 1553602_at MUCL1 mucin-like 1 76.3 216359_at D MUC7 mucin 7, -
Armc5 Deletion Causes Developmental Defects and Compromises T-Cell Immune Responses
ARTICLE Received 26 Jan 2016 | Accepted 4 Nov 2016 | Published 7 Feb 2017 DOI: 10.1038/ncomms13834 OPEN Armc5 deletion causes developmental defects and compromises T-cell immune responses Yan Hu 1,*, Linjiang Lao1,*, Jianning Mao1, Wei Jin1, Hongyu Luo1, Tania Charpentier2, Shijie Qi1, Junzheng Peng1, Bing Hu3, Mieczyslaw Martin Marcinkiewicz4, Alain Lamarre2 & Jiangping Wu1,5 Armadillo repeat containing 5 (ARMC5) is a cytosolic protein with no enzymatic activities. Little is known about its function and mechanisms of action, except that gene mutations are associated with risks of primary macronodular adrenal gland hyperplasia. Here we map Armc5 expression by in situ hybridization, and generate Armc5 knockout mice, which are small in body size. Armc5 knockout mice have compromised T-cell proliferation and differentiation into Th1 and Th17 cells, increased T-cell apoptosis, reduced severity of experimental autoimmune encephalitis, and defective immune responses to lymphocytic choriomeningitis virus infection. These mice also develop adrenal gland hyperplasia in old age. Yeast 2-hybrid assays identify 16 ARMC5-binding partners. Together these data indicate that ARMC5 is crucial in fetal development, T-cell function and adrenal gland growth homeostasis, and that the functions of ARMC5 probably depend on interaction with multiple signalling pathways. 1 Centre de recherche (CR), Centre hospitalier de l’Universite´ de Montre´al (CHUM), 900 Rue Saint Denis, Montre´al, Que´bec, Canada H2X 0A9. 2 Institut national de la recherche scientifique—Institut Armand-Frappier (INRS-IAF), 531 Boul. des Prairies, Laval, Que´bec, Canada H7V 1B7. 3 Anatomic Pathology, AmeriPath Central Florida, 4225 Fowler Ave. Tampa, Orlando, Florida 33617, USA. -
An Unusual Hydrophobic Core Confers Extreme Flexibility to HEAT Repeat Proteins
1596 Biophysical Journal Volume 99 September 2010 1596–1603 An Unusual Hydrophobic Core Confers Extreme Flexibility to HEAT Repeat Proteins Christian Kappel, Ulrich Zachariae, Nicole Do¨lker, and Helmut Grubmu¨ller* Department of Theoretical and Computational Biophysics, Max Planck Institute for Biophysical Chemistry, Go¨ttingen, Germany ABSTRACT Alpha-solenoid proteins are suggested to constitute highly flexible macromolecules, whose structural variability and large surface area is instrumental in many important protein-protein binding processes. By equilibrium and nonequilibrium molecular dynamics simulations, we show that importin-b, an archetypical a-solenoid, displays unprecedentedly large and fully reversible elasticity. Our stretching molecular dynamics simulations reveal full elasticity over up to twofold end-to-end extensions compared to its bound state. Despite the absence of any long-range intramolecular contacts, the protein can return to its equilibrium structure to within 3 A˚ backbone RMSD after the release of mechanical stress. We find that this extreme degree of flexibility is based on an unusually flexible hydrophobic core that differs substantially from that of structurally similar but more rigid globular proteins. In that respect, the core of importin-b resembles molten globules. The elastic behavior is dominated by nonpolar interactions between HEAT repeats, combined with conformational entropic effects. Our results suggest that a-sole- noid structures such as importin-b may bridge the molecular gap between completely structured and intrinsically disordered proteins. INTRODUCTION Solenoid proteins, consisting of repeating arrays of simple on repeat proteins focus on the folding and unfolding basic structural motifs, account for >5% of the genome of mechanism (14–16), an atomic force microscopy study on multicellular organisms (1). -
Supporting Information
Supporting Information Barquilla et al. 10.1073/pnas.0802668105 SI Materials and Methods The purification of recombinant GST proteins using glutathione Bioinformatics, Plasmids Constructs, and Cell Lines. Searches for T. Sepharose 4B (Amersham) was performed according to the brucei TOR, raptor, AVO3, and FKBP12 orthologues were manufacturer’s instructions and the proteins were dialyzed performed using GeneDB (www.genedb.org) and NCBI Blast against PBS. (www.ncbi.nlm.nih.gov/BLAST/). Motif analyses were carried out using InterPro and Prosite. Antibodies. Mouse anti-TbFKBP12 monoclonal antibodies and GeneDB database accession numbers of these orthologues rabbit polyclonal anti-TOR antibodies were produced by inject- are: TbTOR1 (Tb10.6k15.2060), TbTOR2 (Tb927.4.420), ing the recombinant proteins into a BALB-C mouse or rabbit TbTOR-like 1 (Tb927.4.800), TbTOR-like 2 (Tb927.1.1930), using standard immunization protocols. Experiments involving TbRaptor (Tb11.03.0460), TbAVO3 (Tb927.8.3200), animals were approved by the National Animal Research Com- TbFKBP12 (Tb927.7.3420), and TbRaptor-like (Tb927.1.2190). mittee. Anti-TbFKBP12 antibodies were raised against the DNA fragments (400–700 bp) of the TbTOR1, TbTOR2, whole protein fused to GST, and hybridomes F12–3C7H11 and TbTOR-like 1, TbRaptor, TbAVO3, TbFKBP12, and TbRaptor- F12–7D10 H8 were selected by standard monoclonal selection like were amplified by PCR using pairs of specific primers (Table procedures as anti-TbFKBP12 monoclonal antibody producers. S1) and cloned into pGEM-T (Promega). All constructs were Anti-TbTOR antibodies were raised against nonhomologous sequenced to confirm that the subcloned DNA fragments had no TbTOR CЈ-terminal regions between the kinase domain and the mutations. -
Comprehensive Computational Analysis of Leucine-Rich Repeat (LRR) Proteins Encoded in the Genome of the Diatom Phaeodactylum Tricornutum
Erschienen in: Marine Genomics ; 21 (2015). - S. 43-51 https://dx.doi.org/10.1016/j.margen.2015.02.007 Comprehensive computational analysis of leucine-rich repeat (LRR) proteins encoded in the genome of the diatom Phaeodactylum tricornutum Birgit Schulze ⁎, Matthias T. Buhmann 1, Carolina Río Bártulos, Peter G. Kroth Department of Biological Sciences, Universität Konstanz, 78457 Konstanz, Germany abstract We have screened the genome of the marine diatom Phaeodactylum tricornutum for gene models encoding proteins exhibiting leucine rich repeat (LRR) structures. In order to reveal the functionality of these proteins, their amino acid sequences were scanned for known domains and for homologies to other proteins. Additionally, proteins were categorized into different LRR families according to the variable sequence part of their LRR. This approach enabled us to group proteins with potentially similar functionality and to classify also LRR proteins Keywords: where no characterized homologues in other organisms exist. Most interestingly, we were able to indentify LRR several transmembrane LRR proteins, which are likely to function as receptor like molecules. However, none Receptor-like protein of them carry additional domains that are typical for mammalian or plant like receptors. Thus, the respective Adhesion protein signal recognition pathways seem to be substantially different in diatoms. Moreover, P. tricornutum encodes a RanGAP family of secreted LRR proteins likely to function as adhesion or binding proteins as part of the extracellular matrix. Additionally, intracellular LRR only proteins were divided into proteins similar to RasGTPase activators, regulators of nuclear transport, and mitotic regulation. Our approach allowed us to draw a detailed picture of the conservation and diversification of LRR proteins in the marine diatom P. -
Armadillo Repeat Proteins: Beyond the Animal Kingdom Coates, Juliet
University of Birmingham Armadillo repeat proteins: beyond the animal kingdom Coates, Juliet DOI: 10.1016/S0962-8924(03)00167-3 Document Version Early version, also known as pre-print Citation for published version (Harvard): Coates, J 2003, 'Armadillo repeat proteins: beyond the animal kingdom', Trends in Cell Biology, vol. 13, no. 9, pp. 463-471. https://doi.org/10.1016/S0962-8924(03)00167-3 Link to publication on Research at Birmingham portal General rights Unless a licence is specified above, all rights (including copyright and moral rights) in this document are retained by the authors and/or the copyright holders. The express permission of the copyright holder must be obtained for any use of this material other than for purposes permitted by law. •Users may freely distribute the URL that is used to identify this publication. •Users may download and/or print one copy of the publication from the University of Birmingham research portal for the purpose of private study or non-commercial research. •User may use extracts from the document in line with the concept of ‘fair dealing’ under the Copyright, Designs and Patents Act 1988 (?) •Users may not further distribute the material nor use it for the purposes of commercial gain. Where a licence is displayed above, please note the terms and conditions of the licence govern your use of this document. When citing, please reference the published version. Take down policy While the University of Birmingham exercises care and attention in making items available there are rare occasions when an item has been uploaded in error or has been deemed to be commercially or otherwise sensitive. -
Rigid Fusions of Designed Helical Repeat Binding Proteins Efficiently
www.nature.com/scientificreports OPEN Rigid fusions of designed helical repeat binding proteins efciently protect a binding surface from crystal contacts Patrick Ernst1, Annemarie Honegger1, Floor van der Valk1, Christina Ewald1,2, Peer R. E. Mittl1 & Andreas Plückthun1* Designed armadillo repeat proteins (dArmRPs) bind extended peptides in a modular way. The consensus version recognises alternating arginines and lysines, with one dipeptide per repeat. For generating new binding specifcities, the rapid and robust analysis by crystallography is key. Yet, we have previously found that crystal contacts can strongly infuence this analysis, by displacing the peptide and potentially distorting the overall geometry of the scafold. Therefore, we now used protein design to minimise these efects and expand the previously described concept of shared helices to rigidly connect dArmRPs and designed ankyrin repeat proteins (DARPins), which serve as a crystallisation chaperone. To shield the peptide-binding surface from crystal contacts, we rigidly fused two DARPins to the N- and C-terminal repeat of the dArmRP and linked the two DARPins by a disulfde bond. In this ring-like structure, peptide binding, on the inside of the ring, is very regular and undistorted, highlighting the truly modular binding mode. Thus, protein design was utilised to construct a well crystallising scafold that prevents interference from crystal contacts with peptide binding and maintains the equilibrium structure of the dArmRP. Rigid DARPin-dArmRPs fusions will also be useful when chimeric binding proteins with predefned geometries are required. Designed Armadillo repeat proteins (dArmRPs) bind to elongated peptide sequences and may eventually com- plement classical detection antibodies (a review on alternative binding scafolds can be found in ref.1, their use in therapeutics is reviewed in ref.2). -
'Design of Armadillo Repeat Protein Scaffolds'
Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2008 Design of armadillo repeat protein scaffolds Parmeggiani, F Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-4818 Dissertation Published Version Originally published at: Parmeggiani, F. Design of armadillo repeat protein scaffolds. 2008, University of Zurich, Faculty of Medicine. Design of Armadillo Repeat Protein Scaffolds Dissertation zur Erlangung der naturwissenschaftlichen Doktorwürde (Dr. sc. nat.) vorgelegt der Mathematisch-naturwissenschaftlichen Fakultät der Universität Zürich von Fabio Parmeggiani aus Italien Promotionskomitee Prof. Dr. Andreas Plückthun (Leitung der Dissertation) Prof. Dr. Markus Grütter Prof. Dr. Donald Hilvert Zürich, 2008 To my grandfather, who showed me the essence of curiosity “It is easy to add more of the same to old knowledge, but it is difficult to explain the new” Benno Müller-Hill Table of Contents 1 Table of contents Summary 3 Chapter 1: Peptide binders 7 Part I: Peptide binders 8 Modular interaction domains as natural peptide binders 9 SH2 domains 10 PTB domains 10 PDZ domains 12 14-3-3 domains 12 SH3 domains 12 WW domains 12 Peptide recognition by major histocompatibility complexes 13 Repeat proteins targeting peptides 15 Armadillo repeat proteins 16 Tetratricopeptide repeat proteins (TPR) 16 Beta propellers 16 Peptide binding by antibodies 18 Alternative scaffolds: a new approach to peptide binding 20 Part -
Dissecting and Reprogramming the Folding and Assembly of Tandem-Repeat Proteins
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Apollo Dissecting and reprogramming the folding and assembly of tandem-repeat proteins Pamela J.E. Rowling1, Elin M. Sivertsson1, Albert Perez-Riba1, Ewan R. G. Main2*, Laura S. Itzhaki1* 1University of Cambridge Department of Pharmacology, Tennis Court Road, Cambridge CB2 1PD, UK. 2School of Biological and Chemical Sciences, Queen Mary, University of London, London E1 4NS, UK. *To whom correspondence should be addressed: [email protected] and [email protected] Studying protein folding and protein design in globular proteins presents significant challenges because of the two related features, topological complexity and cooperativity. In contrast, tandem-repeat proteins have regular and modular structures composed of linearly arrayed motifs. This means that the biophysics of even giant repeat proteins is highly amenable to dissection and to rational design. Here we discuss what has been learnt about the folding mechanisms of tandem- repeat proteins. The defining features that have emerged are: (i) accessibility of multiple distinct routes between denatured and native states, both at equilibrium and under kinetic conditions; (ii) different routes are favoured for folding versus unfolding; (iii) unfolding energy barriers are broad, reflecting stepwise unraveling of an array repeat by repeat; (iv) highly cooperative unfolding at equilibrium and the potential for exceptionally high thermodynamic stabilities by introducing consensus residues; (v) under force, helical-repeat structures are very weak with non-cooperative unfolding leading to elasticity and buffering effects. This level of understanding should enable us to create repeat proteins with made-to-measure folding mechanisms, in which one can dial into the sequence the order of repeat folding, number of pathways taken, step size (cooperativity) and fine-structure of the kinetic energy barriers.