Spencer Bliven1,2,* Maria Anisimova1,2

Total Page:16

File Type:pdf, Size:1020Kb

Spencer Bliven1,2,* Maria Anisimova1,2 Phylogenetics of Tandem Repeats with Circular HMMs: A Case Study on Armadillo Repeat Proteins Spencer Bliven1,2,* Maria Anisimova1,2 1 Institute for Applied Simulation, Zurich University of Applied Sciences, Wädenswil, Switzerland 2 Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland Abstract Evolution Tandem repeat proteins are characterized by multiple sequential Determining the evolution of tandem repeat proteins is a copies of repeats with significant structural or sequence TRAL availability challenge. The repeats typically have significant homology (for similarity. Tandem repeats evolve via repeat expansion, http://elkeschaper.github.io/tral/ instance, Asn39 is conserved in 51% of human ArmRP; Leu22 duplication and loss, and many protein families exhibit very in 67% (Gul, 2017)). This leads to the hypothesis that most diverse repeat counts. Identifying the complex relationships tandem repeats evolved through duplication and fusion events. A between homologous proteins and between individual repeats is number of mechanisms exist for the duplication/fusion of a challenging task. Using tools developed in our group, we repeats, both individually and collectively through gene present a detailed phylogenetic analysis of the repeats in the duplication/fusion. Combining this with gene-level speciation Armadillo Repeat Protein (ArmRP) family. and duplications creating paralogs can explain the complex The ArmRP family is very diverse, appearing throughout the relationships we see among TR families. eukaryotes and having a wide range of functions. They are well Tandem Repeat Annotation characterized structurally, with ~42 amino acid repeats forming a) 1 b) 1 c) 1 three alpha-helices which assemble into a solenoid structure. One issue with identifying tandem repeats is that the first and ArmRP are exciting candidates for protein design, as they have last repeats are often truncated or atypical. With standard 12 12 12 been shown to bind peptides in a modular manner (Reichen sequence alignment methods, this can lead to missing partial 2016). repeats and incorrect phase identification: 1234 A1B1 A2 B2 A1B1 A2 B2 Phylogenetic analysis of tandem repeats has several unique challenges. Identifying homologous regions is complicated by Canonical phase the repetitive nature of the sequence, which can cause register A1B1 A3 B3 A2 B2 A4 B4 A1 A3B1 B3 A2 A4 B2 B4 A1 A3A2 B1 A4 B3B2 B4 shifts when applying standard alignment tools. We surmount this Key: Duplication Speciation problem using the Tandem Repeat Annotation Library (TRAL), Possible evolutionary histories for two 4-repeat proteins. a) TR expansion occurs before speciation, giving identical TR orders in both species. b) Both species undergo independent a tool for accurately identifying repeats using circular profile Detected phase whole-gene duplications, giving similar subtree patterns in both species. c) Both species hidden Markov models (Schaper 2015). After constructing a undergo independent TR duplications showing different patterns. multiple alignment of the repeats of ArmRP representatives, we (Top) Protein with 3.5 repeats with repeat boundaries assigned from literature. (Bottom) KPNA2 1 infer a phylogenetic tree relating the different ArmRP and use it KPNA4 1 Incorrect tandem repeat identification, with non-standard boundaries, a missing partial KPNA5 1 KPNA1 1 to analyze the conservation and diversification patterns through repeat, and overlapping repeats. KPNA6 1 KPNA1 6 KPNA5 6 evolution, based on the information about tandem repeat KPNA6 6 KPNA2 6 KPNA4 6 number, order and their distribution on phylogenies. Additionally, misalignments near the repeat boundaries can cause KPNA1 9 KPNA5 9 overlapping repeats which must be reconciled. KPNA6 9 KPNA4 9 KPNA2 9 KPNA1 10 These issues are resolved in the Tandem Repeat Annotation KPNA5 10 KPNA6 10 KPNA4 10 Library (TRAL) (Schaper 2015). The library makes use of KPNA2 10 KPNA1 8 KPNA5 8 circular profile hidden Markov models (cpHMM) to capture TR KPNA6 8 KPNA4 8 KPNA2 8 profiles. KPNA1 3 KPNA6 3 KPNA5 3 KPNA2 3 KPNA4 3 KPNA1 7 KPNA5 7 Armadillo Repeat Proteins KPNA6 7 KPNA2 7 KPNA4 7 KPNA1 4 KPNA5 4 KPNA6 4 B M1 ... MN E KPNA2 4 KPNA4 4 KPNA1 2 KPNA6 2 KPNA5 2 KPNA2 2 KPNA4 2 KPNA1 5 KPNA5 5 DN KPNA6 5 DN D1 ... D1 KPNA2 5 KPNA4 5 0.7 Phylogenetic tree for human karyopherin (KPNA) proteins. Each protein has a 10 Arm IN I1 ... IN repeats. Each repeat clusters together, indicating that the order of repeats has not changed since the paralogous expansion. The support for this tree is too weak to draw conclusions about the relationships between repeats. Calculated using IQ-Tree (Nguyen 2015). Circular TR sequence profile HMM (cpHMM). The alignment starts in any position within the TR with equal probability (blue). States representing alignment matches, insertions and API5 3 RQCD1 3 UNC45A 7 CAB39L 1 deletions are included as in the HMMER model (black). Additional transitions are added CAB39 1 CTNNBL1 2 Preliminary tree of human ArmRP proteins CTNNBL1 3 DIAPH1 4 FMNL1 4 from the end of a repeat back to the beginning (red) to align multiple repeats. The alignment FMNL2 4 RQCD1 6 with structural information in the Protein UNC45A 8 CTNNB1 7 JUP 7 UNC45A 1 may end at any position with equal probability (green). UNC45A 6 APC 2 Data Bank. The shown tree does not CTNND2 2 PKP1 2 PKP2 2 Designed ArmRP YIIIM5AII bound to a (KR)5 peptide [5AEI] APC 3 UNC45A 3 explicitly model duplication events. Leaves APC 4 DAAM1 4 RQCD1 1 USO1 8 RQCD1 5 CTNND2 3 are colored by gene. Calculated using PKP1 3 PKP2 3 CTNNBL1 6 HSPBP1 4 CTNNB1 8 JUP 8 IQ-Tree. The cpHMM allows the TR region to start and end at any point APC 7 CTNNB1 10 Armadillo repeat proteins (ArmRP) are composed of ~40 JUP 10 CTNND2 6 DAAM1 1 PKP1 6 PKP2 6 CAB39L 4 in the repeat, making the phase inconsequential. Additionally, it CAB39 4 DAAM1 3 amino acid repeats forming a alpha solenoid structure. Each FMNL1 3 FMNL2 3 USO1 11 CTNNBL1 5 UNC45A 5 HSPBP1 1 prevents overlaps and provides a rigorous statistical model for HSPBP1 2 USO1 3 repeat contains three alpha helices, with a hydrophobic core and USO1 6 USO1 1 USO1 4 USO1 5 CTNNBL1 1 dealing with insertions between and within repeats. CTNNBL1 4 DAAM1 5 DIAPH1 2 conserved binding residues along the third helix. FMNL1 5 FMNL2 5 DIAPH1 5 KPNA1 10 KPNA5 10 KPNA6 10 KPNA4 10 KPNA2 10 1.67 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 TRAL contains functionality for constructing a cpHMM from a CTNNB1 12 JUP 12 APC 5 CTNND2 4 PKP1 4 PKP2 4 CTNNB1 6 JUP 6 set of possibly inconsistent or overlapping seed TR hits. CTNND2 5 PKP1 5 PKP2 5 APC 6 CAB39L 5 CAB39 5 KPNA1 1 KPNA6 1 0.83 CTNNB1 4 JUP 4 CTNNB1 5 L JUP 5 L USO1 10 P I S CTNND2 9 V AG KPNA2 1 G K L I Q V KPNA5 1 A E I GG D WAV KPNA4 1 L T PKP1 9 L QE N N R S L L I Q V A S E E N K V I UNC45A 2 E V E S EA R E L R Q L E E A A PKP2 9 S A P I Q L L E S E V V R Q A D V K S V N R I S Q D S L K I V CTNNB1 2 D N L AY QA RKM I N QKP S PD A E D Q CR T NQQAA E V A K D S L QS T A JUP 2 G EN YM K R R S S R C EQ V P HKR K D K T MF D T I T C G I N F C E USO1 9 P F F T M L S G T K S DM D F MN Q T AM N R N T N F K C F F Q N R N D F C N K T F Q TQ I D F R Y M H F R D N D T CTNNB1 9 Y R M CK Y C H H Q M WC F MC FM I C D Q Y Q 0 H C HAYQHWMWHMHM NHY C CQL F DHH C H I C H MCWWW CWY M I T T H JUP 9 0.25 0.47 0.80 0.89 0.93 0.95 0.99 1.00 1.00 1.00 1.00 1.00 0.95 0.98 0.99 0.99 0.99 1.00 0.98 0.99 1.00 1.00 1.00 1.00 0.99 0.99 0.90 0.96 0.99 1.00 1.00 1.00 0.99 0.99 0.99 1.00 1.00 0.98 0.97 0.96 0.95 0.90 DIAPH1 1 RQCD1 4 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 PKP2 7 PKP1 7 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 1.9 0 CAB39L 3 CAB39 3 PKP1 8 CTNND2 8 PKP2 8 USO1 2 CTNNB1 1 JUP 1 KPNA1 5 KPNA5 5 KPNA6 5 KPNA2 5 KPNA4 5 CAB39L 2 CAB39 2 Consensus Arm repeat.
Recommended publications
  • The Origin of the Eukaryotic Cell Based on Conservation of Existing
    The Origin of the Eukaryotic Albert D. G. de Roos The Beagle Armada Cell Based on Conservation Bioinformatics Division of Existing Interfaces Einsteinstraat 67 3316GG Dordrecht, The Netherlands [email protected] Abstract Current theories about the origin of the eukaryotic Keywords cell all assume that during evolution a prokaryotic cell acquired a Evolution, nucleus, eukaryotes, self-assembly, cellular membranes nucleus. Here, it is shown that a scenario in which the nucleus acquired a plasma membrane is inherently less complex because existing interfaces remain intact during evolution. Using this scenario, the evolution to the first eukaryotic cell can be modeled in three steps, based on the self-assembly of cellular membranes by lipid-protein interactions. First, the inclusion of chromosomes in a nuclear membrane is mediated by interactions between laminar proteins and lipid vesicles. Second, the formation of a primitive endoplasmic reticulum, or exomembrane, is induced by the expression of intrinsic membrane proteins. Third, a plasma membrane is formed by fusion of exomembrane vesicles on the cytoskeletal protein scaffold. All three self-assembly processes occur both in vivo and in vitro. This new model provides a gradual Darwinistic evolutionary model of the origins of the eukaryotic cell and suggests an inherent ability of an ancestral, primitive genome to induce its own inclusion in a membrane. 1 Introduction The origin of eukaryotes is one of the major challenges in evolutionary cell biology. No inter- mediates between prokaryotes and eukaryotes have been found, and the steps leading to eukaryotic endomembranes and endoskeleton are poorly understood. There are basically two competing classes of hypotheses: the endosymbiotic and the autogenic.
    [Show full text]
  • An Unusual Hydrophobic Core Confers Extreme Flexibility to HEAT Repeat Proteins
    1596 Biophysical Journal Volume 99 September 2010 1596–1603 An Unusual Hydrophobic Core Confers Extreme Flexibility to HEAT Repeat Proteins Christian Kappel, Ulrich Zachariae, Nicole Do¨lker, and Helmut Grubmu¨ller* Department of Theoretical and Computational Biophysics, Max Planck Institute for Biophysical Chemistry, Go¨ttingen, Germany ABSTRACT Alpha-solenoid proteins are suggested to constitute highly flexible macromolecules, whose structural variability and large surface area is instrumental in many important protein-protein binding processes. By equilibrium and nonequilibrium molecular dynamics simulations, we show that importin-b, an archetypical a-solenoid, displays unprecedentedly large and fully reversible elasticity. Our stretching molecular dynamics simulations reveal full elasticity over up to twofold end-to-end extensions compared to its bound state. Despite the absence of any long-range intramolecular contacts, the protein can return to its equilibrium structure to within 3 A˚ backbone RMSD after the release of mechanical stress. We find that this extreme degree of flexibility is based on an unusually flexible hydrophobic core that differs substantially from that of structurally similar but more rigid globular proteins. In that respect, the core of importin-b resembles molten globules. The elastic behavior is dominated by nonpolar interactions between HEAT repeats, combined with conformational entropic effects. Our results suggest that a-sole- noid structures such as importin-b may bridge the molecular gap between completely structured and intrinsically disordered proteins. INTRODUCTION Solenoid proteins, consisting of repeating arrays of simple on repeat proteins focus on the folding and unfolding basic structural motifs, account for >5% of the genome of mechanism (14–16), an atomic force microscopy study on multicellular organisms (1).
    [Show full text]
  • Genomic and Evolutionary Exploration of Asgard Archaea
    Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 1861 Genomic and evolutionary exploration of Asgard archaea EVA F. CACERES ACTA UNIVERSITATIS UPSALIENSIS ISSN 1651-6214 ISBN 978-91-513-0761-9 UPPSALA urn:nbn:se:uu:diva-393710 2019 Dissertation presented at Uppsala University to be publicly examined in B22, Biomedical Center (BMC), Husargatan 3, Uppsala, Thursday, 14 November 2019 at 09:15 for the degree of Doctor of Philosophy. The examination will be conducted in English. Faculty examiner: Professor Simonetta Gribaldo (Institut Pasteur, Department of Microbiology). Abstract Caceres, E. F. 2019. Genomic and evolutionary exploration of Asgard archaea. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 1861. 88 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-513-0761-9. Current evolutionary theories postulate that eukaryotes emerged from the symbiosis of an archaeal host with, at least, one bacterial symbiont. However, our limited grasp of microbial diversity hampers insights into the features of the prokaryotic ancestors of eukaryotes. This thesis focuses on the study of a group of uncultured archaea to better understand both existing archaeal diversity and the origin of eukaryotes. In a first study, we used short-read metagenomic approaches to obtain eight genomes of Lokiarchaeum relatives. Using these data we described the Asgard superphylum, comprised of at least four different phyla: Lokiarchaeota, Odinarchaeota, Thorarchaeota and Heimdallarchaoeta. Phylogenetic analyses suggested that eukaryotes affiliate with the Asgard group, albeit the exact position of eukaryotes with respect to Asgard archaea members remained inconclusive. Comparative genomics showed that Asgard archaea genomes encoded homologs of numerous eukaryotic signature proteins (ESPs), which had never been observed in Archaea before.
    [Show full text]
  • The C-Terminal Domain of TPX2 Is Made of Alpha-Helical Tandem Repeats
    Edinburgh Research Explorer The C-terminal domain of TPX2 is made of alpha-helical tandem repeats Citation for published version: Sanchez-Pulido, L, Perez, L, Kuhn, S, Vernos, I & Andrade-Navarro, MA 2016, 'The C-terminal domain of TPX2 is made of alpha-helical tandem repeats', BMC Structural Biology, vol. 16, no. 1, pp. 1-8. https://doi.org/10.1186/s12900-016-0070-8 Digital Object Identifier (DOI): 10.1186/s12900-016-0070-8 Link: Link to publication record in Edinburgh Research Explorer Document Version: Publisher's PDF, also known as Version of record Published In: BMC Structural Biology Publisher Rights Statement: This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. General rights Copyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s) and / or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. Take down policy The University of Edinburgh has made every reasonable effort to ensure that Edinburgh Research Explorer content complies with UK legislation. If you believe that the public display of this file breaches copyright please contact [email protected] providing details, and we will remove access to the work immediately and investigate your claim.
    [Show full text]
  • Solenoid and Non-Solenoid Protein Recognition Using Stationary Wavelet
    Vol. 26 ECCB 2010, pages i467–i473 BIOINFORMATICS doi:10.1093/bioinformatics/btq371 Solenoid and non-solenoid protein recognition using stationary wavelet packet transform An Vo1,∗,†, Nha Nguyen2,† and Heng Huang3 1The Feinstein Institute for Medical Research, North Shore LIJ Health System, NY, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Texas at Arlington, TX, USA ABSTRACT the length of their repeats, which can provide information about Motivation: Solenoid proteins are emerging as a protein class a possible 3D structure of the repetitive protein (Kajava, 2001). with properties intermediate between structured and intrinsically There are four main structural classes (Kajava, 2001): Class I, unstructured proteins. Containing repeating structural units, solenoid crystalline structures (1- to 2-residue repeats); Class II, fibrous proteins are expected to share sequence similarities. However, in proteins (3- to 4-residue repeats); Class III, solenoid proteins many cases, the sequence similarities are weak and non-detectable. (5- to 42-residue repeats); and Class IV, domain-forming repeats Moreover, solenoids can be degenerated and widely vary in the (30 or more residues). Solenoid proteins contain a superhelical number of units. So that it is difficult to detect them. Recently, arrangement of repeating structure units (Kobe et al., 2000). This several solenoid repeats detection methods have been proposed, arrangement contrasts the structure of most Class-IV proteins that such as self-alignment of the sequence, spectral analysis and fold into globular domains in more complex manners. discrete Fourier transform of sequence. Although these methods Repeats in Class I and Class II have only 1−4 residues, hence have shown good performance on certain data sets, they often fail they have low sequence complexity and can be easily detected.
    [Show full text]
  • Materials and Methods………………………………………………………………….9
    UC San Diego UC San Diego Electronic Theses and Dissertations Title A Molecular Characterization of the Role of the Vertebrate Nucleoporin Nup155 in Nuclear Protein Import, Export, mRNA Export, and Nuclear Pore Assembly / Permalink https://escholarship.org/uc/item/7wx5h207 Author Martell, Steven Walter Publication Date 2014 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA, SAN DIEGO A Molecular Characterization of the Role of the Vertebrate Nucleoporin Nup155 in Nuclear Protein Import, Export, mRNA Export, and Nuclear Pore Assembly A thesis submitted in partial satisfaction of the requirements for the degree Master of Science in Biology by Steven Walter Martell Committee in charge: Professor Douglass Forbes, Chair Professor Maho Niwa Professor Ella Tour 2014 The Thesis of Steven Walter Martell is approved, and it is acceptable in quality and form for publication on microfilm and electronically: Chair University of California, San Diego 2014 iii DEDICATION Without the love and support of my parents, David and Judy Martell, I would be lost. Their unconditional love has been the foundation of who I am today. Knowing that I have the unwavering support of these two is one of the great constants in my life. I am infinitely lucky to have parents who can both share their wisdom with me and be my friend. Their continued emotional and financial support has allowed me to enjoy the best years of my life at UCSD. No words can adequately express my admiration
    [Show full text]
  • Sequence Evidence for Common Ancestry of Eukaryotic Endomembrane Coatomers 2015|03|29
    bioRxiv preprint doi: https://doi.org/10.1101/020990; this version posted June 16, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Promponas et al. Deep divergence of endomembrane coatomers Sequence evidence for common ancestry of eukaryotic endomembrane coatomers 2015|03|29 Vasilis J. Promponas1, Katerina R. Katsani2, Benjamin J. Blencowe3, Christos A. Ouzounis1,3,4* 1 Bioinformatics Research Laboratory, Department of Biological Sciences, New Campus, University of Cyprus, PO Box 20537, CY-1678 Nicosia, Cyprus 2 Department of Molecular Biology & Genetics, Democritus University of Thrace, GR-68100 Alexandroupolis, Greece 3 Donnelly Centre for Cellular & Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada 4§ Biological Computation & Process Laboratory (BCPL), Chemical Process Research Institute (CPERI), Centre for Research & Technology (CERTH), PO Box 361, GR-57001 Thessalonica, Greece *corresponding author: CAO, email: [email protected] {§present address} ABSTRACT Eukaryotic cells are defined by compartments through which the trafficking of macromolecules is mediated by large complexes, such as the nuclear pore, transport vesicles and intraflagellar transport. The assembly and maintenance of these complexes is facilitated by endomembrane coatomers, long suspected to be divergently related on the basis of structural and more recently phylogenomic analysis. By performing supervised walks in sequence space across coatomer superfamilies, we uncover subtle sequence patterns that have remained elusive to date, ultimately unifying eukaryotic coatomers by divergent evolution. The conserved residues shared by 3,502 endomembrane coatomer components are mapped onto the solenoid superhelix of nucleoporin and COPII protein structures, thus determining the invariant elements of coatomer architecture.
    [Show full text]
  • Disease-Related Single-Point Mutations Alter the Global Dynamics of a Tetratricopeptide (TPR) A-Solenoid Domain
    bioRxiv preprint doi: https://doi.org/10.1101/598672; this version posted April 4, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Disease-related single-point mutations alter the global dynamics of a tetratricopeptide (TPR) a-solenoid domain Salomé Llabrés1*, Ulrich Zachariae1,2* 1 Computational Biology, School of Life Sciences, University of Dundee, Dundee, UK 2 Physics, School of Science and Engineering, University of Dundee, Dundee, UK *Correspondence to: Salomé Llabrés: [email protected] Ulrich Zachariae: [email protected] ABSTRACT Tetratricopeptide repeat (TPR) proteins belong to the class of α-solenoid proteins, in which repetitive units of α-helical hairpin motifs stack to form superhelical, often highly flexible structures. TPR domains occur in a wide variety of proteins, and perform key functional roles including protein folding, protein trafficking, cell cycle control and post-translational modification. Here, we look at the TPR domain of the enzyme O-linked GlcNAc-transferase (OGT), which catalyses O-GlcNAcylation of a broad range of substrate proteins. A number of single-point mutations in the TPR domain of human OGT have been associated with the disease Intellectual Disability (ID). By extended steered and equilibrium atomistic simulations, we show that the OGT-TPR domain acts as a reversibly elastic nanospring, and that each of the ID-related local mutations substantially affect the global dynamics of the TPR domain. Since the nanospring character of the OGT-TPR domain is key to its function in binding and releasing OGT substrates, these changes of its biomechanics likely lead to defective substrate interaction.
    [Show full text]
  • First and Last Ancestors
    Available online at www.sciencedirect.com First and last ancestors: reconstructing evolution of the endomembrane system with ESCRTs, vesicle coat proteins, and nuclear pore complexes Mark C Field1 and Joel B Dacks2 The eukaryotic endomembrane system is responsible for the cytoskeleton and many other factors [1]. Comparative biosynthesis and transport of proteins and lipids, and for the genomic and phylogenetic analyses have revealed a great definition of the major subcellular compartments. Recent work deal of conservation in the machinery of membrane indicates that the endomembrane system is ancient, with near trafficking, not only from yeast to man, but in the broad modern complexity predating the radiation of the major diversity of eukaryotes. These studies have suggested eukaryotic lineages. The challenge is to look beyond the last that the last eukaryotic common ancestor (LECA, a eukaryotic common ancestor and to attempt to deduce the hypothetical lineage that gave rise to all modern eukar- evolutionary steps in the rise of membrane-trafficking yote supergroups) possessed a complex membrane-traf- complexity. Relationships between the endomembrane ficking system, with a near modern array of organelles and coatomer complexes and their evolutionary connection to the representatives of almost all of the major protein families nuclear pore complex are emerging. These studies, plus the [2–4]. Furthermore, much of the machinery involved in realization of a role for the ESCRT complex as an alternate, but vesicle-trafficking steps is composed of paralogous equally ancient, system for membrane deformation are protein families but with organelle-specific family mem- providing insight into the earliest stages of endomembrane bers [5].
    [Show full text]
  • Measuring Uncertainty of Protein Secondary Structure
    Wright State University CORE Scholar Browse all Theses and Dissertations Theses and Dissertations 2011 Measuring Uncertainty of Protein Secondary Structure Alan Eugene Herner Wright State University Follow this and additional works at: https://corescholar.libraries.wright.edu/etd_all Part of the Computer Engineering Commons, and the Computer Sciences Commons Repository Citation Herner, Alan Eugene, "Measuring Uncertainty of Protein Secondary Structure" (2011). Browse all Theses and Dissertations. 422. https://corescholar.libraries.wright.edu/etd_all/422 This Dissertation is brought to you for free and open access by the Theses and Dissertations at CORE Scholar. It has been accepted for inclusion in Browse all Theses and Dissertations by an authorized administrator of CORE Scholar. For more information, please contact [email protected]. MEASURING UNCERTAINTY OF PROTEIN SECONDARY STRUCTURE A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy By Alan Eugene Herner B.A. Wright State University, 1980 M.S. Wright State University, 1980 M.S. Wright State University, 2001 _____________________________________________ 2011 Wright State University COPYRIGHT BY Alan E. Herner 2011 WRIGHT STATE UNIVERSITY SCHOOL OF GRADUATE STUDIES January 7, 2011 I HEREBY RECOMMEND THAT THE DISSERTATION PREPARED UNDER MY SUPERVISION BY ALAN E. HERNER ENTITLED Measuring Uncertainty of Protein Secondary Structure BE ACCEPTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY. ________________________ Michael L. Raymer, PhD Dissertation Director ________________________ Arthur A. Goshtasby, PhD Director, Computer Science and Engineering PhD Program ________________________ Andrew Hsu, PhD Dean School of Graduate Studies Committee Final Examination ____________________ Michael L. Raymer, PhD ____________________ Gerald Alter, PhD ____________________ Travis Doom, PhD ____________________ Ruth Pachter, PhD _____________________ Mateen Rizki, PhD ABSTRACT Herner, Alan E.
    [Show full text]
  • Functional and Genomic Analyses of Alpha-Solenoid Proteins
    Functional and Genomic Analyses of Alpha-Solenoid Proteins David Fournier1, Gareth A. Palidwor2, Sergey Shcherbinin3, Angelika Szengel1, Martin H. Schaefer1, Carol Perez-Iratxeta2, Miguel A. Andrade-Navarro1* 1 Max-Delbru¨ck Center for Molecular Medicine, Berlin, Germany, 2 Ottawa Hospital Research Institute, Ottawa, Ontario, Canada, 3 The University of British Columbia, Vancouver, British Columbia, Canada Abstract Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity.
    [Show full text]
  • The Nuclear Pore Complex Core Scaffold And
    Available online at www.sciencedirect.com ScienceDirect The nuclear pore complex core scaffold and permeability barrier: variations of a common theme Ryo Hayama, Michael P Rout and Javier Fernandez-Martinez The study of the nuclear pore complex (NPC) is a fascinating that fenestrates the NE and forms a channel of approxi- endeavor, as it not only implies uncovering the ‘engineering mately 40 nm wide; it is an eightfold symmetrical assem- marvel’ of its architecture and function, but also provides a key bly, formed by more than 500 copies of 30 different window into a significant evolutionary event: the origin of the proteins called nucleoporins (nups) [1] that have been eukaryotic cell. The combined efforts of many groups in the shown to be arranged in conserved, biochemically stable field, with the help of novel methodologies and new model subcomplexes acting as the NPC’s building blocks [2–4]. organisms, are facilitating a much deeper understanding of this These building blocks coat the NE membrane with a complex assembly. Here we cover recent advances on the highly modular symmetrical scaffold, formed by eight characterization of the structure of the NPC scaffold and of the protomers called spokes, connecting radially to form biophysical mechanisms that define the permeability barrier. several concentric rings [2,3]: the outer ring—formed We identify common architectural and functional principles by a head-to-tail arrangement of a Y-shaped complex between those two NPC compartments, expanding the [5–7]; the inner ring; and the membrane ring. Attached previous protocoatomer hypothesis to suggest possible to these rings are two asymmetrically localized modules, evolutionary origins for the FG nucleoporins and the NPC the cytoplasmic Nup82 complex and the nuclear basket.
    [Show full text]