The Missing Tailed Phages: Prediction of Small Capsid Candidates

Total Page:16

File Type:pdf, Size:1020Kb

The Missing Tailed Phages: Prediction of Small Capsid Candidates microorganisms Article The Missing Tailed Phages: Prediction of Small Capsid Candidates Antoni Luque 1,2,3,* , Sean Benler 4 , Diana Y. Lee 1,2, Colin Brown 1,5 and Simon White 6 1 Viral Information Institute, San Diego State University, San Diego, CA 92182, USA; [email protected] (D.Y.L.); [email protected] (C.B.) 2 Computational Science Research Center, San Diego State University, San Diego, CA 92182, USA 3 Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA 4 National Center for Biotechnology Information (NCBI), Bethesda, MD 20894, USA; [email protected] 5 Department of Physics, San Diego State University, San Diego, CA 92182, USA 6 Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269, USA; [email protected] * Correspondence: [email protected] Received: 29 October 2020; Accepted: 5 December 2020; Published: 8 December 2020 Abstract: Tailed phages are the most abundant and diverse group of viruses on the planet. Yet, the smallest tailed phages display relatively complex capsids and large genomes compared to other viruses. The lack of tailed phages forming the common icosahedral capsid architectures T = 1 and T = 3 is puzzling. Here, we extracted geometrical features from high-resolution tailed phage capsid reconstructions and built a statistical model based on physical principles to predict the capsid diameter and genome length of the missing small-tailed phage capsids. We applied the model to 3348 isolated tailed phage genomes and 1496 gut metagenome-assembled tailed phage genomes. Four isolated tailed phages were predicted to form T = 3 icosahedral capsids, and twenty-one metagenome-assembled tailed phages were predicted to form T < 3 capsids. The smallest capsid predicted was a T = 4/3 1.33 architecture. No tailed phages were predicted to form the smallest ≈ icosahedral architecture, T = 1. We discuss the feasibility of the missing T = 1 tailed phage capsids and the implications of isolating and characterizing small-tailed phages for viral evolution and phage therapy. Keywords: bacteriophage; tailed phages; icosahedral capsids; capsid modeling; statistical learning; isolated genomes; metagenome-assembled genomes 1. Introduction Tailed phages are viruses that infect bacteria and are the most abundant biological entity on Earth [1]. They are responsible for the regulation of biogeochemical processes at a planetary scale [2,3], the control of microbial populations [4,5], and the mobility of genes across hosts and ecosystems [6,7]. The broad functionality of tailed phages is facilitated by their vast reservoir of genes, resulting in a large range of genome lengths, from 10 to 500 kilobase pairs (kbp) [8,9]. Tailed phages can accommodate these genomes because the proteins building their protective capsids display a high degree of plasticity, allowing for a broad range of different size capsids, from about 30 to 160 nm in diameter [10]. The majority of tailed phages, about 80–90%, form capsids with icosahedral symmetry [11], while the remaining 10–20% form elongated capsids with icosahedral caps [12,13]. The formation of icosahedral capsids is not unique to tailed phages—it is the most common viral capsid architecture in the virosphere [14]. However, despite the abundance and diversity of tailed phages, even the smallest tailed phages form far more complex capsids and store far larger genomes Microorganisms 2020, 8, 1944; doi:10.3390/microorganisms8121944 www.mdpi.com/journal/microorganisms Microorganisms 2020, 8, 1944 2 of 18 Microorganisms 2020, 8, x FOR PEER REVIEW 2 of 19 than most icosahedral viral families [15]. Furthermore, the major capsid proteins of tailed phages adoptthe theHK97-fold, HK97-fold, which which is also is found also found in the inbuilding the building blocks of blocks bacterial of bacterial and archeal and enzymatic archeal enzymatic cellular cellularnanocompartments nanocompartments called called encapsulins encapsulins [16,17]. [16 ,17Intriguingly,]. Intriguingly, encapsulins encapsulins can canform form the the smallest smallest icosahedralicosahedral protein protein shells shells that that are are absent absent among among tailed tailedphages phages (Figure(Figure1 1).). Figure 1. HK97-fold protein compartments. The left side of the figure focuses on encapsulins, nanocompartmentsFigure 1. HK97-fold responsible protein forcompartments. chemical storage The andleft biochemicalside of the reactionsfigure focuses in bacteria on encapsulins, and archaea. Thenanocompartments right side of the figure responsible focuses onfor viruses chemical and stor geneage transfer and biochemical agents. The reactions viruses belong in bacteria to the realmand of Duplodnaviriaarchaea. The right. Tailed side phages of the figure in the focuses phylum onUroviricota viruses andand gene class transferCauviricetes agents. areThe the viruses most belong diverse andto abundant the realm representatives of Duplodnaviria of. Tailed this group. phages in the phylum Uroviricota and class Cauviricetes are the most diverse and abundant representatives of this group. Icosahedral capsids are characterized by the triangulation number T, which determines the number of quasi-equivalentIcosahedral capsids proteins are (complexity) characterized and by the the total triangulation number of proteinsnumber T, forming which thedetermines capsid [18 the,19 ]. number of quasi-equivalent proteins (complexity) and the total number of proteins forming the As illustrated in Figure2, icosahedral capsids can organize their proteins in four di fferent lattices, capsid [18,19]. As illustrated in Figure 2, icosahedral capsids can organize their proteins in four accommodating different stoichiometries of major and minor capsid proteins [19]. The generalized different lattices, accommodating different stoichiometries of major and minor capsid proteins [19]. T-number is Ti(h,k) = αi T0(h,k), where h and k are the steps in the hexagonal sublattice joining two The generalized T-number is Ti(h,k) = αi T0(h,k), where h and k are the steps in the hexagonal consecutive vertices in the icosahedral capsid. T0 is the classic T-number associated with the hexagonal sublattice joining two consecutive vertices in the icosahedral capsid. T0 is the classic T-number lattice and is given by the equation T (h,k) = h2 + hk + k2. The subindex i takes the values h, t, s, and r, associated with the hexagonal lattice0 and is given by the equation T0(h,k) = h2 + hk + k2. The subindex associated,i takes the respectively, values h, t, with s, and the r, hexagonal, associated, trihexagonal, respectively, snubwith hexagonal,the hexagonal, and rhombitrihexagonaltrihexagonal, snub icosahedralhexagonal, lattices. and rhombitrihexagonal The number of major icosahedral capsid proteins lattices. in The a capsid number is 60T of 0major(h,k). capsid The hexagonal proteins latticein a casecapsid only containsis 60T0(h,k). major The capsid hexagonal proteins, lattice that case is, only Th(h,k) contains= T0(h,k), major with capsidαh =proteins,1. If the that size is, of T theh(h,k) major = capsidT0(h,k), protein with is αh conserved = 1. If the size across of the lattices, major the capsid other protei threen latticesis conserved must across include lattices, minor the capsid other proteins three occupyinglattices must the secondary include minor polygons capsid (triangles proteins and occupying squares). Thisthe secondary increasesthe polygons surface (triangles of the capsid and by a factorsquares).αt = This4/3 increases1.33 (trihexagonal), the surface ofα thes = capsid7/3 2.33by a (snubfactor hexagonal),αt = 4/3 ≈ 1.33 and (trihexagonal),αr = 4/3 + 2 α/ ps =3 7/32.49 ≈ ≈ ≈ ≈ (rhombitrihexagonal).2.33 (snub hexagonal), When and combiningαr = 4/3 + 2/ all√3 lattices, ≈ 2.49 (rhombitrihexagonal). the first four elements When of the combining generalized all T-numberlattices, arethe T = first1, 1.33, four 2.33,elements and 2.49,of the containing generalized 60 T-number major capsid are T proteins = 1, 1.33, each.2.33, and The 2.49, fifth containing element of 60 the major series is Tcapsid= 3 and proteins contains each. 180 The major fifth capsidelement proteins. of the series Tailed is T phages = 3 and have contains been 180 observed major capsid to form proteins. capsids adoptingTailed thephages hexagonal have been and observed trihexagonal to form lattices capsid [19s adopting,20], but nothe tailedhexagonal phages and have trihexagonal been observed lattices to MicroorganismsMicroorganisms2020 2020,,8 8,, 1944 x FOR PEER REVIEW 33 ofof 1819 [19,20], but no tailed phages have been observed to form T ≤ 3 capsids (Figure 1). The smallest form T 3 capsids (Figure1). The smallest characterized tailed phage structure corresponds to the characterized≤ tailed phage structure corresponds to the Bacillus phage phi29, which adopts an Bacillus phage phi29, which adopts an elongated structure with icosahedral T = 3 caps and a Q = 5 elongated structure with icosahedral T = 3 caps and a Q = 5 body [12,21,22], and Streptococcus phage body [12,21,22], and Streptococcus phage C1, which adopts a T = 4 icosahedral capsid [10,23]. C1, which adopts a T = 4 icosahedral capsid [10,23]. Figure 2. Icosahedral lattices for T(2,1) = 7 capsids. The label on the top displays the name of the lattice.Figure 2. The Icosahedral blue arrows lattices and for blue T(2,1) dots = display 7 capsids. the The h and label k on steps the intop the displays hexagonal the name lattice, of hthe= lattice.2 and kThe= 1 blue in this arrows case.
Recommended publications
  • Multiple Origins of Viral Capsid Proteins from Cellular Ancestors
    Multiple origins of viral capsid proteins from PNAS PLUS cellular ancestors Mart Krupovica,1 and Eugene V. Kooninb,1 aInstitut Pasteur, Department of Microbiology, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, 75015 Paris, France; and bNational Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894 Contributed by Eugene V. Koonin, February 3, 2017 (sent for review December 21, 2016; reviewed by C. Martin Lawrence and Kenneth Stedman) Viruses are the most abundant biological entities on earth and show genome replication. Understanding the origin of any virus group is remarkable diversity of genome sequences, replication and expres- possible only if the provenances of both components are elucidated sion strategies, and virion structures. Evolutionary genomics of (11). Given that viral replication proteins often have no closely viruses revealed many unexpected connections but the general related homologs in known cellular organisms (6, 12), it has been scenario(s) for the evolution of the virosphere remains a matter of suggested that many of these proteins evolved in the precellular intense debate among proponents of the cellular regression, escaped world (4, 6) or in primordial, now extinct, cellular lineages (5, 10, genes, and primordial virus world hypotheses. A comprehensive 13). The ability to transfer the genetic information encased within sequence and structure analysis of major virion proteins indicates capsids—the protective proteinaceous shells that comprise the that they evolved on about 20 independent occasions, and in some of cores of virus particles (virions)—is unique to bona fide viruses and these cases likely ancestors are identifiable among the proteins of distinguishes them from other types of selfish genetic elements cellular organisms.
    [Show full text]
  • Copyrighted Material
    CHAPTER 1 Viruses and Their Importance CHAPTER 1 AT A GLANCE Viruses infect: • Humans • Other vertebrates • Invertebrates Smallpox1 Bluetongue virus-infected sheep2 Tipula sp. larvae (leatherjackets) infected with invertebrate iridescent virus 1 • Plants • Fungi • Bacteria COPYRIGHTED MATERIAL Escherichia coli Delayed emergence of Damaged potato Mushroom virus X4 cell potato caused by tobacco (spraing) caused by with phage T4 attached5 rattle virus infection3 tobacco rattle virus infection3 1 cc01.indd01.indd 1 118/01/138/01/13 88:14:14 PPMM 2 CHAPTER 1 VIRUSES AND THEIR IMPORTANCE CHAPTER 1 AT A GLANCE (continued) Some viruses are useful: • Phage typing of bacteria • Sources of enzymes • Pesticides • Anti-bacterial agents • Anti-cancer agents • Gene vectors Viruses are parasites; they depend on cells for molecular building blocks, machinery, and energy. Virus particles are small; dimensions range approx. from 20–400 nm. A virus genome is composed of one of the following: double-stranded single-stranded double-stranded single-stranded DNA DNA RNA RNA Photographs reproduced with permission of 1 World Health Organization. 2 From Umeshappa et al . (2011) Veterinary Immunology and Immunopathology , 141, 230. Reproduced by permission of Elsevier and the authors. 3 MacFarlane and Robinson (2004) Chapter 11, Microbe-Vector Interactions in Vector-Borne Diseases , 63rd Symposium of the Society for General Microbiology, Cambridge University Press. Reprinted with permission. 4 University of Warwick. 5 Cornell Integrated Microscopy Center. The presence of viruses is obvious in host organ- 1.1 VIRUSES ARE UBIQUITOUS ON EARTH isms showing signs of disease. Many healthy organisms, however, are hosts of non-pathogenic virus infections, Viruses infect all cellular life forms: eukaryotes (ver- some of which are active, while some are quiescent.
    [Show full text]
  • 2020 Taxonomic Update for Phylum Negarnaviricota (Riboviria: Orthornavirae), Including the Large Orders Bunyavirales and Mononegavirales
    Archives of Virology https://doi.org/10.1007/s00705-020-04731-2 VIROLOGY DIVISION NEWS 2020 taxonomic update for phylum Negarnaviricota (Riboviria: Orthornavirae), including the large orders Bunyavirales and Mononegavirales Jens H. Kuhn1 · Scott Adkins2 · Daniela Alioto3 · Sergey V. Alkhovsky4 · Gaya K. Amarasinghe5 · Simon J. Anthony6,7 · Tatjana Avšič‑Županc8 · María A. Ayllón9,10 · Justin Bahl11 · Anne Balkema‑Buschmann12 · Matthew J. Ballinger13 · Tomáš Bartonička14 · Christopher Basler15 · Sina Bavari16 · Martin Beer17 · Dennis A. Bente18 · Éric Bergeron19 · Brian H. Bird20 · Carol Blair21 · Kim R. Blasdell22 · Steven B. Bradfute23 · Rachel Breyta24 · Thomas Briese25 · Paul A. Brown26 · Ursula J. Buchholz27 · Michael J. Buchmeier28 · Alexander Bukreyev18,29 · Felicity Burt30 · Nihal Buzkan31 · Charles H. Calisher32 · Mengji Cao33,34 · Inmaculada Casas35 · John Chamberlain36 · Kartik Chandran37 · Rémi N. Charrel38 · Biao Chen39 · Michela Chiumenti40 · Il‑Ryong Choi41 · J. Christopher S. Clegg42 · Ian Crozier43 · John V. da Graça44 · Elena Dal Bó45 · Alberto M. R. Dávila46 · Juan Carlos de la Torre47 · Xavier de Lamballerie38 · Rik L. de Swart48 · Patrick L. Di Bello49 · Nicholas Di Paola50 · Francesco Di Serio40 · Ralf G. Dietzgen51 · Michele Digiaro52 · Valerian V. Dolja53 · Olga Dolnik54 · Michael A. Drebot55 · Jan Felix Drexler56 · Ralf Dürrwald57 · Lucie Dufkova58 · William G. Dundon59 · W. Paul Duprex60 · John M. Dye50 · Andrew J. Easton61 · Hideki Ebihara62 · Toufc Elbeaino63 · Koray Ergünay64 · Jorlan Fernandes195 · Anthony R. Fooks65 · Pierre B. H. Formenty66 · Leonie F. Forth17 · Ron A. M. Fouchier48 · Juliana Freitas‑Astúa67 · Selma Gago‑Zachert68,69 · George Fú Gāo70 · María Laura García71 · Adolfo García‑Sastre72 · Aura R. Garrison50 · Aiah Gbakima73 · Tracey Goldstein74 · Jean‑Paul J. Gonzalez75,76 · Anthony Grifths77 · Martin H. Groschup12 · Stephan Günther78 · Alexandro Guterres195 · Roy A.
    [Show full text]
  • A Persistent Giant Algal Virus, with a Unique Morphology, Encodes An
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.30.228163; this version posted January 13, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 A persistent giant algal virus, with a unique morphology, encodes an 2 unprecedented number of genes involved in energy metabolism 3 4 Romain Blanc-Mathieu1,2, Håkon Dahle3, Antje Hofgaard4, David Brandt5, Hiroki 5 Ban1, Jörn Kalinowski5, Hiroyuki Ogata1 and Ruth-Anne Sandaa6* 6 7 1: Institute for Chemical Research, Kyoto University, Gokasho, Uji, 611-0011, Japan 8 2: Laboratoire de Physiologie Cellulaire & Végétale, CEA, Univ. Grenoble Alpes, 9 CNRS, INRA, IRIG, Grenoble, France 10 3: Department of Biological Sciences and K.G. Jebsen Center for Deep Sea Research, 11 University of Bergen, Bergen, Norway 12 4: Department of Biosciences, University of Oslo, Norway 13 5: Center for Biotechnology, Universität Bielefeld, Bielefeld, 33615, Germany 14 6: Department of Biological Sciences, University of Bergen, Bergen, Norway 15 *Corresponding author: Ruth-Anne Sandaa, +47 55584646, [email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.30.228163; this version posted January 13, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 16 Abstract 17 Viruses have long been viewed as entities possessing extremely limited metabolic 18 capacities.
    [Show full text]
  • The LUCA and Its Complex Virome in Another Recent Synthesis, We Examined the Origins of the Replication and Structural Mart Krupovic , Valerian V
    PERSPECTIVES archaea that form several distinct, seemingly unrelated groups16–18. The LUCA and its complex virome In another recent synthesis, we examined the origins of the replication and structural Mart Krupovic , Valerian V. Dolja and Eugene V. Koonin modules of viruses and posited a ‘chimeric’ scenario of virus evolution19. Under this Abstract | The last universal cellular ancestor (LUCA) is the most recent population model, the replication machineries of each of of organisms from which all cellular life on Earth descends. The reconstruction of the four realms derive from the primordial the genome and phenotype of the LUCA is a major challenge in evolutionary pool of genetic elements, whereas the major biology. Given that all life forms are associated with viruses and/or other mobile virion structural proteins were acquired genetic elements, there is no doubt that the LUCA was a host to viruses. Here, by from cellular hosts at different stages of evolution giving rise to bona fide viruses. projecting back in time using the extant distribution of viruses across the two In this Perspective article, we combine primary domains of life, bacteria and archaea, and tracing the evolutionary this recent work with observations on the histories of some key virus genes, we attempt a reconstruction of the LUCA virome. host ranges of viruses in each of the four Even a conservative version of this reconstruction suggests a remarkably complex realms, along with deeper reconstructions virome that already included the main groups of extant viruses of bacteria and of virus evolution, to tentatively infer archaea. We further present evidence of extensive virus evolution antedating the the composition of the virome of the last universal cellular ancestor (LUCA; also LUCA.
    [Show full text]
  • How Influenza Virus Uses Host Cell Pathways During Uncoating
    cells Review How Influenza Virus Uses Host Cell Pathways during Uncoating Etori Aguiar Moreira 1 , Yohei Yamauchi 2 and Patrick Matthias 1,3,* 1 Friedrich Miescher Institute for Biomedical Research, 4058 Basel, Switzerland; [email protected] 2 Faculty of Life Sciences, School of Cellular and Molecular Medicine, University of Bristol, Bristol BS8 1TD, UK; [email protected] 3 Faculty of Sciences, University of Basel, 4031 Basel, Switzerland * Correspondence: [email protected] Abstract: Influenza is a zoonotic respiratory disease of major public health interest due to its pan- demic potential, and a threat to animals and the human population. The influenza A virus genome consists of eight single-stranded RNA segments sequestered within a protein capsid and a lipid bilayer envelope. During host cell entry, cellular cues contribute to viral conformational changes that promote critical events such as fusion with late endosomes, capsid uncoating and viral genome release into the cytosol. In this focused review, we concisely describe the virus infection cycle and highlight the recent findings of host cell pathways and cytosolic proteins that assist influenza uncoating during host cell entry. Keywords: influenza; capsid uncoating; HDAC6; ubiquitin; EPS8; TNPO1; pandemic; M1; virus– host interaction Citation: Moreira, E.A.; Yamauchi, Y.; Matthias, P. How Influenza Virus Uses Host Cell Pathways during 1. Introduction Uncoating. Cells 2021, 10, 1722. Viruses are microscopic parasites that, unable to self-replicate, subvert a host cell https://doi.org/10.3390/ for their replication and propagation. Despite their apparent simplicity, they can cause cells10071722 severe diseases and even pose pandemic threats [1–3].
    [Show full text]
  • Rapid Protein Sequence Evolution Via Compensatory Frameshift Is Widespread in RNA Virus Genomes
    Park and Hahn BMC Bioinformatics (2021) 22:251 https://doi.org/10.1186/s12859-021-04182-9 RESEARCH Open Access Rapid protein sequence evolution via compensatory frameshift is widespread in RNA virus genomes Dongbin Park and Yoonsoo Hahn* *Correspondence: [email protected] Abstract Department of Life Science, Background: RNA viruses possess remarkable evolutionary versatility driven by the Chung-Ang University, Seoul 06794, South Korea high mutability of their genomes. Frameshifting nucleotide insertions or deletions (indels), which cause the premature termination of proteins, are frequently observed in the coding sequences of various viral genomes. When a secondary indel occurs near the primary indel site, the open reading frame can be restored to produce functional proteins, a phenomenon known as the compensatory frameshift. Results: In this study, we systematically analyzed publicly available viral genome sequences and identifed compensatory frameshift events in hundreds of viral protein- coding sequences. Compensatory frameshift events resulted in large-scale amino acid diferences between the compensatory frameshift form and the wild type even though their nucleotide sequences were almost identical. Phylogenetic analyses revealed that the evolutionary distance between proteins with and without a compensatory frameshift were signifcantly overestimated because amino acid mismatches caused by compensatory frameshifts were counted as substitutions. Further, this could cause compensatory frameshift forms to branch in diferent locations in the protein and nucleotide trees, which may obscure the correct interpretation of phylogenetic rela- tionships between variant viruses. Conclusions: Our results imply that the compensatory frameshift is one of the mecha- nisms driving the rapid protein evolution of RNA viruses and potentially assisting their host-range expansion and adaptation.
    [Show full text]
  • Viral Diversity in Tree Species
    Universidade de Brasília Instituto de Ciências Biológicas Departamento de Fitopatologia Programa de Pós-Graduação em Biologia Microbiana Doctoral Thesis Viral diversity in tree species FLÁVIA MILENE BARROS NERY Brasília - DF, 2020 FLÁVIA MILENE BARROS NERY Viral diversity in tree species Thesis presented to the University of Brasília as a partial requirement for obtaining the title of Doctor in Microbiology by the Post - Graduate Program in Microbiology. Advisor Dra. Rita de Cássia Pereira Carvalho Co-advisor Dr. Fernando Lucas Melo BRASÍLIA, DF - BRAZIL FICHA CATALOGRÁFICA NERY, F.M.B Viral diversity in tree species Flávia Milene Barros Nery Brasília, 2025 Pages number: 126 Doctoral Thesis - Programa de Pós-Graduação em Biologia Microbiana, Universidade de Brasília, DF. I - Virus, tree species, metagenomics, High-throughput sequencing II - Universidade de Brasília, PPBM/ IB III - Viral diversity in tree species A minha mãe Ruth Ao meu noivo Neil Dedico Agradecimentos A Deus, gratidão por tudo e por ter me dado uma família e amigos que me amam e me apoiam em todas as minhas escolhas. Minha mãe Ruth e meu noivo Neil por todo o apoio e cuidado durante os momentos mais difíceis que enfrentei durante minha jornada. Aos meus irmãos André, Diego e meu sobrinho Bruno Kawai, gratidão. Aos meus amigos de longa data Rafaelle, Evanessa, Chênia, Tati, Leo, Suzi, Camilets, Ricardito, Jorgito e Diego, saudade da nossa amizade e dos bons tempos. Amo vocês com todo o meu coração! Minha orientadora e grande amiga Profa Rita de Cássia Pereira Carvalho, a quem escolhi e fui escolhida para amar e fazer parte da família.
    [Show full text]
  • Abstract 9 10 Bacteriophages (Phages) Are Bacterial Parasites That Can Themselves Be Parasitized by Phage 11 Satellites
    bioRxiv preprint doi: https://doi.org/10.1101/2021.03.30.437493; this version posted March 30, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 To catch a hijacker: abundance, evolution and genetic diversity of 2 P4-like bacteriophage satellites 3 Jorge A. Moura de Sousa1,*& Eduardo P.C. Rocha1,* 4 5 1Microbial Evolutionary Genomics, Institut Pasteur, CNRS, UMR3525, Paris 75015, France 6 * corresponding authors: [email protected], [email protected] 7 8 Abstract 9 10 Bacteriophages (phages) are bacterial parasites that can themselves be parasitized by phage 11 satellites. The molecular mechanisms used by satellites to hijack phages are sometimes 12 understood in great detail, but the origins, abundance, distribution, and composition of these 13 elements are poorly known. Here, we show that P4-like elements are present in more than 14 10% of the genomes of Enterobacterales, and in almost half of those of Escherichia coli, 15 sometimes in multiple distinct copies. We identified over 1000 P4-like elements with very 16 conserved genetic organization of the core genome and a few hotspots with highly variable 17 genes. These elements are never found in plasmids and have very little homology to known 18 phages, suggesting an independent evolutionary origin. Instead, they are scattered across 19 chromosomes, possibly because their integrases are often exchanged with other elements. 20 The rooted phylogenies of hijacking functions are correlated and suggest longstanding co- 21 evolution.
    [Show full text]
  • Virus World As an Evolutionary Network of Viruses and Capsidless Selfish Elements
    Virus World as an Evolutionary Network of Viruses and Capsidless Selfish Elements Koonin, E. V., & Dolja, V. V. (2014). Virus World as an Evolutionary Network of Viruses and Capsidless Selfish Elements. Microbiology and Molecular Biology Reviews, 78(2), 278-303. doi:10.1128/MMBR.00049-13 10.1128/MMBR.00049-13 American Society for Microbiology Version of Record http://cdss.library.oregonstate.edu/sa-termsofuse Virus World as an Evolutionary Network of Viruses and Capsidless Selfish Elements Eugene V. Koonin,a Valerian V. Doljab National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USAa; Department of Botany and Plant Pathology and Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon, USAb Downloaded from SUMMARY ..................................................................................................................................................278 INTRODUCTION ............................................................................................................................................278 PREVALENCE OF REPLICATION SYSTEM COMPONENTS COMPARED TO CAPSID PROTEINS AMONG VIRUS HALLMARK GENES.......................279 CLASSIFICATION OF VIRUSES BY REPLICATION-EXPRESSION STRATEGY: TYPICAL VIRUSES AND CAPSIDLESS FORMS ................................279 EVOLUTIONARY RELATIONSHIPS BETWEEN VIRUSES AND CAPSIDLESS VIRUS-LIKE GENETIC ELEMENTS ..............................................280 Capsidless Derivatives of Positive-Strand RNA Viruses....................................................................................................280
    [Show full text]
  • Capsid Structures, Variation and Flexibility. This Project Brings Together the Skills of Laboratories at Cornell University
    Principal Investigator/Program Director (Last, first, middle): Parrish, Colin, R. Capsid structures, variation and flexibility. This project brings together the skills of laboratories at Cornell University and at Pennsylvania State University Medical Center to provide a detailed understanding of the roles of structural variation and flexibility in the parvoviral capsid, and their effects on receptor and antibody binding and the controls of cell infection and host range. These are fundamental problems that apply to all animal viruses, where the capsid must protect the genome in the environment, interact with host molecules including cell receptors and antibodies, and undergo a series of regulated structural transitions during cell entry to eventually release the genome for replication. Viral capsid binding to host receptors and antibodies can have varying and often unpredictable effects on infection, and those interactions also control many other replication steps. Where these virus-host interactions are specific they can control the viral host ranges. A model for understanding virus cell infection and host range control through differential receptor binding. We study two viruses that differ in host range due to 3 or 4 capsid protein mutations that control specific receptor binding. Canine parvovirus (CPV) arose around 1976 as a variant of feline panleukopenia virus (FPV), and caused a pandemic of disease during 1978 and 1979. That virus has continued to circulate worldwide as a serious canine pathogen, and has also evolved new antigenic, receptor binding, and host range variants. FPV and CPV both can bind the feline transferrin receptor 1 (TfR) to infect cat cells, and CPV gained the host range for dogs by gaining the ability to bind the canine TfR.
    [Show full text]
  • Viral Metagenomics in the Clinical Realm: Lessons Learned from a Swiss-Wide Ring Trial
    G C A T T A C G G C A T genes Article Viral Metagenomics in the Clinical Realm: Lessons Learned from a Swiss-Wide Ring Trial 1, 2, 2 2, Thomas Junier *, Michael Huber * , Stefan Schmutz , Verena Kufner y, 2, 3, 3, 4, Osvaldo Zagordi y, Stefan Neuenschwander y, Alban Ramette y , Jakub Kubacki y , 4, 5, 6, 6, Claudia Bachofen y, Weihong Qi y, Florian Laubscher y, Samuel Cordey y , 6, 7, 8 1,9 8, Laurent Kaiser y, Christian Beuret y, Valérie Barbié , Jacques Fellay and Aitana Lebrand * 1 Global Health Institute, Swiss Federal Institute of Technology (ETH Lausanne) & SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland 2 Institute of Medical Virology, University of Zurich, 8057 Zurich, Switzerland 3 Institute for Infectious Diseases, University of Bern, 3001 Bern, Switzerland 4 Institute of Virology, VetSuisse Faculty, University of Zurich, 8057 Zurich, Switzerland 5 Functional Genomics Center Zurich, Swiss Federal Institute of Technology (ETH Zurich) & University of Zurich, 8057 Zurich, Switzerland 6 Laboratory of Virology, University Hospitals of Geneva, 1205 Geneva, Switzerland; University of Geneva Medical School, 1206 Geneva, Switzerland 7 Biology Department, Spiez Laboratory, 3700 Spiez, Switzerland 8 Clinical Bioinformatics, SIB Swiss Institute of Bioinformatics, 1202 Geneva, Switzerland 9 Precision Medicine Unit, Lausanne University Hospital and University of Lausanne, 1010 Lausanne, Switzerland * Correspondence: [email protected] (T.J.); [email protected] (M.H.); [email protected] (A.L.) These authors contributed equally. y Received: 30 July 2019; Accepted: 24 August 2019; Published: 28 August 2019 Abstract: Shotgun metagenomics using next generation sequencing (NGS) is a promising technique to analyze both DNA and RNA microbial material from patient samples.
    [Show full text]