Pandoravirus Celtis Illustrates the Microevolution Processes at Work In

Total Page:16

File Type:pdf, Size:1020Kb

Pandoravirus Celtis Illustrates the Microevolution Processes at Work In bioRxiv preprint doi: https://doi.org/10.1101/500207; this version posted February 11, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Pandoravirus celtis illustrates the microevolution processes at work 2 in the giant Pandoraviridae genomes 3 Running title: Evolutionary mechanisms in Pandoraviridae 4 Matthieu Legendre1, Jean-Marie Alempic1, Nadège Philippe1, Audrey Lartigue1, Sandra 5 Jeudy1, Olivier Poirot1, Ngan Thi Ta1, Sébastien Nin1, Yohann Couté2, Chantal Abergel1*, 6 Jean-Michel Claverie1* 7 1Aix Marseille Univ, CNRS, IGS, Structural and Genomic Information Laboratory (UMR7256), 8 Mediterranean Institute of Microbiology (FR3479), 163 Avenue de Luminy, F-13288 Marseille, 9 France 10 2Univ. Grenoble Alpes, CEA, Inserm, BIG-BGE, 38000 Grenoble, France. 11 * Correspondence: 12 Jean-Michel Claverie 13 [email protected] 14 Chantal Abergel 15 [email protected] 16 Keywords: de novo gene creation, comparative genomics, Acanthamoeba, Giant viruses, Soil 17 viruses, hAT transposase 18 Abstract 19 With genomes of up to 2.7 Mb propagated in µm-long oblong particles and initially predicted to 20 encode more than 2000 proteins, members of the Pandoraviridae family display the most extreme 21 features of the known viral world. The mere existence of such giant viruses raises fundamental 22 questions about their origin and the processes governing their evolution. A previous analysis of 23 six newly available isolates, independently confirmed by a study including 3 others, established 24 that the Pandoraviridae pan-genome is open, meaning that each new strain exhibits protein- 25 coding genes not previously identified in other family members. With an average increment of 26 about 60 proteins, the gene repertoire shows no sign of reaching a limit and remains largely 27 coding for proteins without recognizable homologs in other viruses or cells (ORFans). To explain 28 these results, we proposed that most new protein-coding genes were created de novo, from pre- 29 existing non-coding regions of the G+C rich pandoravirus genomes. The comparison of the gene 30 content of a new isolate, P. celtis, closely related (96 % identical genome) to the previously 31 described P. quercus is now used to test this hypothesis by studying genomic changes in a 32 microevolution range. Our results confirm that the differences between these two similar gene 33 contents mostly consist of protein-coding genes without known homologs (ORFans), with 34 statistical signatures close to that of intergenic regions. These newborn proteins are under slight 35 negative selection, perhaps to maintain stable folds and prevent protein aggregation pending the 36 eventual emergence of fitness-increasing functions. Our study also unraveled several insertion 37 events mediated by a transposase of the hAT family, 3 copies of which are found in P. celtis and 38 are presumably active. Members of the Pandoraviridae are presently the first viruses known to 39 encode this type of transposase. bioRxiv preprint doi: https://doi.org/10.1101/500207; this version posted February 11, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Evolutionary mechanisms in Pandoraviridae 40 1 Introduction 41 The Pandoraviridae is a proposed family of giant dsDNA viruses - not yet registered by the 42 International Committee on Taxonomy of Viruses (ICTV) - multiplying in various species of 43 Acanthamoeba through a lytic infectious cycle. Their linear genomes, flanked by large terminal 44 repeats, range from 1.9 to 2.7 Mb in size, and are propagated in elongated oblong particles 45 approximately 1.2 µm long and 0.6 µm in diameter (Fig. S1). The prototype strain (and the one 46 with the largest known genome) is Pandoravirus salinus, isolated from shallow marine sediments 47 off the coast of central Chile (Philippe et al., 2013). Other members were soon after isolated from 48 worldwide locations. Complete genome sequences have been determined for P. dulcis 49 (Melbourne, Australia) (Philippe et al., 2013), P. inopinatum (Germany) (Antwerpen et al., 50 2015), P. macleodensis (Australia), P. neocaledonia (New Caledonia), and P. quercus (France) 51 (Legendre et al., 2018), and three isolates from Brazil (P. braziliensis, P. pampulha, and P. 52 massiliensis) (Aherfi et al., 2018). A standard phylogenetic analysis of the above strains 53 suggested that the Pandoraviridae family consists of two separate clades (Claverie et al., 2018; 54 Fig. 1). The average proportion of identical amino acids between Pandoraviridae orthologs within 55 each clade is above 70% while it is below 55% between members of the A and B clades. 56 Following a stringent reannotation of the predicted protein-coding genes using transcriptomic and 57 proteomic data, our comparative genomic analysis reached the main following conclusions 58 (Legendre et al., 2018): 59 1) the uniquely large proportion of predicted proteins without homologs outside of the 60 Pandoraviridae (ORFans) is real and not due to bioinformatic errors induced by the 61 above-average G+C content (>60%) of Pandoravirus genomes; 62 2) the Pandoraviridae pan genome appears “open” (i.e. unbounded); 63 3) as most of the genes are unique to each strain are ORFans, they were not horizontally 64 acquired from other (known) organisms; 65 4) they are neither predominantly the result of gene duplications. 66 The scenario of de novo and in situ gene creation, supported by the analysis of their sequence 67 statistical signatures, thus became our preferred explanation for the origin of strain specific genes. 68 In the present study, we take advantage of the high similarity (96.7% DNA sequence identity) 69 between P. quercus and a newly characterized isolate, P. celtis, to investigate the microevolution 70 processes initiating the divergence between Pandoraviruses. Our results further support de novo 71 gene creation as a main diversifying force of the Pandoraviridae family. 72 2 Materials and Methods 73 2.1 Virus Isolation, Production, and Purification 74 P. celtis and P. quercus were isolated in November 2014 from samples of surface soil taken at the 75 base of two trees (Celtis autralis and Quercus ilex) less than 50 meters apart in an urban green 76 space of Marseille city (GPS: 43°15'16.00"N, 5°25'4.00"E). Their particles were morphologically 77 identical to previously characterized pandoraviruses (Fig. S1). The viral populations were 78 amplified by co-cultivation with A. castellanii. They were then cloned, mass-produced and 79 purified as previously described (Philippe et al., 2013). 80 2.2 Genome and transcriptome sequencing, annotation 2 bioRxiv preprint doi: https://doi.org/10.1101/500207; this version posted February 11, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Evolutionary mechanisms in Pandoraviridae 81 The P. celtis genome was fully assembled from one PacBio SMRT cell sequence data with the 82 HGAP 4 assembler (Chin et al., 2013) from the SMRT link package version 5.0.1.9585 with 83 default parameters and the “aggressive” option=true. Genome polishing was finally performed 84 using the SMRT package. Stringent gene annotation was performed as previously described 85 (Legendre et al., 2018). Briefly, data from proteomic characterization of the purified virions were 86 combined with stranded RNA-seq transcriptomic data, as well as protein homology among 87 previously characterized pandoraviruses. The transcriptomic data were generated from cells 88 collected every hour over an infectious cycle of 15 hours. They were pooled and RNAs were 89 extracted prior poly(A)+ enrichment. The RNA were then sent for sequencing. Stranded RNA-seq 90 reads were then used to accurately annotate protein-coding as well as non-coding RNA genes. A 91 threshold of gene expression (median read coverage >5 over the whole transcript) larger than the 92 lowest one associated to proteins detected in proteomic analyses was required to validate all 93 predicted genes (including novel genes) (Table 1). Genomic regions exhibiting similar expression 94 levels but did not encompass predicted proteins or did overlap with protein-coding genes 95 expressed from opposite strands were annotated as “non-coding RNA” (ncRNA) after assembly 96 by Trinity (Grabherr et al., 2011). When only genomic data were available, namely for P. 97 pampulha, P. massiliensis and P. braziliensis (Aherfi et al., 2018), we annotated protein-coding 98 genes using ab initio prediction coupled with sequence conservation information as previously 99 described (Legendre et al., 2018). 100 Gene clustering was performed on all available Pandoraviruses’ protein-coding genes using 101 Orthofinder (Emms & Kelly, 2015) with defaults parameters except for the “msa” option for the 102 gene tree inference method. 103 Functional annotation of protein-coding genes was performed using a combination of protein 104 domains search with the CD-search tool (Marchler-Bauer & Bryant, 2004) and HMM-HMM 105 search against the Uniclust30 database with the HHblits tool (Remmert et al., 2011). In addition, 106 we used the same procedure to update the functional gene annotation of P. salinus, P. dulcis, P. 107 quercus, P. macleodensis and P. neocaledonia (Genbank IDs: KC977571, KC977570, 108 MG011689, MG011691 and MG011690). 109 2.3 Phylogenetic and Selection pressure analysis 110 The phylogenetic tree (Fig. 1) was computed from the concatenated multiple alignment of the 111 sequences of Pandoravirus core proteins corresponding to single-copy genes. The alignments of 112 orthologous genes peptide sequences were done using Mafft (Katoh et al., 2002). The tree was 113 computed using IQtree (Hoang et al., 2018) with the following options: -m MFP –bb 10000 –st 114 codon –bnni. The best model chosen was: GY+F+R5. Codon sequences were subsequently 115 mapped on these alignments.
Recommended publications
  • Diversity and Evolution of the Emerging Pandoraviridae Family
    bioRxiv preprint doi: https://doi.org/10.1101/230904; this version posted December 8, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. PNAS formated 30/08/17 Pandoraviridae Title: Diversity and evolution of the emerging Pandoraviridae family Authors: Matthieu Legendre1, Elisabeth Fabre1, Olivier Poirot1, Sandra Jeudy1, Audrey Lartigue1, Jean- Marie Alempic1, Laure Beucher2, Nadège Philippe1, Lionel Bertaux1, Karine Labadie3, Yohann Couté2, Chantal Abergel1, Jean-Michel Claverie1 Adresses: 1Structural and Genomic Information Laboratory, UMR 7256 (IMM FR 3479) CNRS Aix- Marseille Université, 163 Avenue de Luminy, Case 934, 13288 Marseille cedex 9, France. 2CEA-Institut de Génomique, GENOSCOPE, Centre National de Séquençage, 2 rue Gaston Crémieux, CP5706, 91057 Evry Cedex, France. 3 Univ. Grenoble Alpes, CEA, Inserm, BIG-BGE, 38000 Grenoble, France. Corresponding author: Jean-Michel Claverie Structural and Genomic Information Laboratory, UMR 7256, 163 Avenue de Luminy, Case 934, 13288 Marseille cedex 9, France. Tel: +33 491825447 , Email: [email protected] Co-corresponding author: Chantal Abergel Structural and Genomic Information Laboratory, UMR 7256, 163 Avenue de Luminy, Case 934, 13288 Marseille cedex 9, France. Tel: +33 491825420 , Email: [email protected] Keywords: Nucleocytoplasmic large DNA virus; environmental isolates; comparative genomics; de novo gene creation. 1 bioRxiv preprint doi:
    [Show full text]
  • On the Occurrence of Cytochrome P450 in Viruses
    On the occurrence of cytochrome P450 in viruses David C. Lamba,b,1, Alec H. Follmerc,1, Jared V. Goldstonea,1, David R. Nelsond, Andrew G. Warrilowb, Claire L. Priceb, Marie Y. Truee, Steven L. Kellyb, Thomas L. Poulosc,e,f, and John J. Stegemana,2 aBiology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543; bInstitute of Life Science, Swansea University Medical School, Swansea University, Swansea, SA2 8PP Wales, United Kingdom; cDepartment of Chemistry, University of California, Irvine, CA 92697-3900; dDepartment of Microbiology, Immunology and Biochemistry, University of Tennessee Health Science Center, Memphis, TN 38163; eDepartment of Pharmaceutical Sciences, University of California, Irvine, CA 92697-3900; and fDepartment of Molecular Biology and Biochemistry, University of California, Irvine, CA 92697-3900 Edited by Michael A. Marletta, University of California, Berkeley, CA, and approved May 8, 2019 (received for review February 7, 2019) Genes encoding cytochrome P450 (CYP; P450) enzymes occur A core set of genes involved in viral replication and lysis is most widely in the Archaea, Bacteria, and Eukarya, where they play often conserved in these viruses (10), yet most genes in the giant important roles in metabolism of endogenous regulatory mole- viruses (70–90%) do not have any obvious homolog in existing cules and exogenous chemicals. We now report that genes for virus databases. multiple and unique P450s occur commonly in giant viruses in the Strikingly, many of the giant viruses possess genes coding for Mimiviridae Pandoraviridae , , and other families in the proposed proteins typically involved in cellular processes, including protein order Megavirales. P450 genes were also identified in a herpesvi- translation, DNA repair, and eukaryotic metabolic pathways Ranid herpesvirus 3 Mycobacterium rus ( ) and a phage ( phage previously thought not to occur in viruses (17).
    [Show full text]
  • Exploration of the Propagation of Transpovirons Within Mimiviridae Reveals a Unique Example of Commensalism in the Viral World
    The ISME Journal (2020) 14:727–739 https://doi.org/10.1038/s41396-019-0565-y ARTICLE Exploration of the propagation of transpovirons within Mimiviridae reveals a unique example of commensalism in the viral world 1 1 1 1 1 Sandra Jeudy ● Lionel Bertaux ● Jean-Marie Alempic ● Audrey Lartigue ● Matthieu Legendre ● 2 1 1 2 3 4 Lucid Belmudes ● Sébastien Santini ● Nadège Philippe ● Laure Beucher ● Emanuele G. Biondi ● Sissel Juul ● 4 2 1 1 Daniel J. Turner ● Yohann Couté ● Jean-Michel Claverie ● Chantal Abergel Received: 9 September 2019 / Revised: 27 November 2019 / Accepted: 28 November 2019 / Published online: 10 December 2019 © The Author(s) 2019. This article is published with open access Abstract Acanthamoeba-infecting Mimiviridae are giant viruses with dsDNA genome up to 1.5 Mb. They build viral factories in the host cytoplasm in which the nuclear-like virus-encoded functions take place. They are themselves the target of infections by 20-kb-dsDNA virophages, replicating in the giant virus factories and can also be found associated with 7-kb-DNA episomes, dubbed transpovirons. Here we isolated a virophage (Zamilon vitis) and two transpovirons respectively associated to B- and C-clade mimiviruses. We found that the virophage could transfer each transpoviron provided the host viruses were devoid of 1234567890();,: 1234567890();,: a resident transpoviron (permissive effect). If not, only the resident transpoviron originally isolated from the corresponding virus was replicated and propagated within the virophage progeny (dominance effect). Although B- and C-clade viruses devoid of transpoviron could replicate each transpoviron, they did it with a lower efficiency across clades, suggesting an ongoing process of adaptive co-evolution.
    [Show full text]
  • Comparative Analysis of Transcriptional Regulation Patterns: Understanding the Gene Expression Profile in Nucleocytoviricota
    pathogens Review Comparative Analysis of Transcriptional Regulation Patterns: Understanding the Gene Expression Profile in Nucleocytoviricota Fernanda Gil de Souza †,Jônatas Santos Abrahão * and Rodrigo Araújo Lima Rodrigues *,† Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais 31270-901, Brazil; [email protected] * Correspondence: [email protected] (J.S.A.); [email protected] (R.A.L.R.) † These authors contributed equally to this work. Abstract: The nucleocytoplasmic large DNA viruses (NCLDV) possess unique characteristics that have drawn the attention of the scientific community, and they are now classified in the phylum Nucleocytoviricota. They are characterized by sharing many genes and have their own transcriptional apparatus, which provides certain independence from their host’s machinery. Thus, the presence of a robust transcriptional apparatus has raised much discussion about the evolutionary aspects of these viruses and their genomes. Understanding the transcriptional process in NCLDV would provide information regarding their evolutionary history and a better comprehension of the biology of these viruses and their interaction with hosts. In this work, we reviewed NCLDV transcription and performed a comparative functional analysis of the groups of genes expressed at different times of infection of representatives of six different viral families of giant viruses. With this analysis, it was possible to observe
    [Show full text]
  • Searching for Megaviruses in Iceland Delanie Baker Ohio Wesleyan University
    Ohio Wesleyan University Digital Commons @ OWU Student Symposium 2019 Apr 25th, 6:00 PM - 7:00 PM Searching for Megaviruses in Iceland Delanie Baker Ohio Wesleyan University Follow this and additional works at: https://digitalcommons.owu.edu/studentsymposium Part of the Microbiology Commons Baker, Delanie, "Searching for Megaviruses in Iceland" (2019). Student Symposium. 2. https://digitalcommons.owu.edu/studentsymposium/2019/poster_session/2 This Poster is brought to you for free and open access by the Student Scholarship at Digital Commons @ OWU. It has been accepted for inclusion in Student Symposium by an authorized administrator of Digital Commons @ OWU. For more information, please contact [email protected]. Presence of Megaviruses from Diverse Icelandic Environments Delanie Baker1, Laura Tuhela1, and Surendra Ambegaokar1,2 1 Department of Botany and Microbiology, Ohio Wesleyan University, 2 Neuroscience Program, Ohio Wesleyan University ABSTRACT METHODS QUESTION SAMPLING The proposed Megavirales order comprises members of the previously 1. Are megaviruses present in Iceland? known nucleocytoplasmic large DNA viruses (NCLDVs). Virus families in the (a) (b) Megavirales order include Poxviridae, Ascoviridae, and the recently explored families of megaviruses infecting free living amoeba such as Mimiviridae, Figure 4. Marseilleviridae, and Pandoraviridae. Megaviruses have been isolated from The top 4cm of soil was water and soil samples from Chile, France, India, and the United States. We collected with a sterile RESULTS chose to study the occurrence of megaviruses in Iceland because of the diverse scoop. 2-3 replicates were Percent lysis of amoeba in various Icelandic soils habitats all within one island. No research has been carried out on the presence taken on a 5m transect.
    [Show full text]
  • List of the Main “Giant” Viruses Known As of Today (April, 18 , 2018)
    List of the main “giant” viruses known as of today (April, 18th, 2018) (complete genomes only)(unpublished in green)(largest genome among their group in red) Family Prototype Virion type Dimension Genome, size, GC% Specific feature Host Mimiviridae Icosahedral Linear DNA MutS, Virophage Acanthamoeba Megavirinae Long fibers Clade A Mimivirus Icosahedral 500+250 nm 1.18 Mb, 28% 4 AaRS Acanthamoeba Clade B Moumouvirus Icosahedral 420+200 nm 1.02 Mb, 25% 5 AaRS Acanthamoeba Clade C Megavirus chilensis Icosahedral 520+150 nm 1.26 Mb, 25% 7 AaRS Acanthamoeba Clade ? Tupanvirus soda lake Icosah. + tail 520+150 nm 1.44 – 1.51 Mb, 28% 20 AaRS Acant. & Verma. Platanovirus KSL-5 Icosahedral 290+140 nm ? ? Saccamoeba Mesomimivirinae Algae-infecting P. globosa virus (PgV) Icosahedral 150 nm 460 kb, 32% Phaeocystis H. ericina virus (CeV) Icosahedral 160 nm 474 kb, 25% Haptolina Aav Icosahedral 140 nm 371 kb, 28.7% Aureococcus TetV Icosahedral 257 nm 668 kb, 41.2%, C DNA Fermentation ? Tetraselmis Aquavirinae CroV Icosahedral 300 nm 693 kb, 23% 1 AaRS Cafeteria BsV Icosahedral 300 nm 1.386 Mb, 25% Bodo saltans Klosneuvirus No isolate ? 1.57 Mb, 28.6% 19 AaRS unknown Catovirus No isolate ? 1.53 Mb, 26.4% 15 AaRS unknown Hokovirus No isolate ? 1.33 Mb, 21.4% 13 AaRS unknown Indivirus No isolate ? 860 kb, 26.6% 3 AaRS unknown Family Prototype Virion type Dimension Genome, size, GC% Host Marseilleviridae icosahedral Acanthamoeba Clade A Marseillevirus (A) icosahedral 200 nm C DNA, 368 kb, 45% Acanthamoeba Melbournevirus icosahedral 200 nm C DNA, 369, 44.7%
    [Show full text]
  • Diversity and Evolution of the Emerging Pandoraviridae Family
    ARTICLE DOI: 10.1038/s41467-018-04698-4 OPEN Diversity and evolution of the emerging Pandoraviridae family Matthieu Legendre 1, Elisabeth Fabre1, Olivier Poirot1, Sandra Jeudy1, Audrey Lartigue1, Jean-Marie Alempic1, Laure Beucher2, Nadège Philippe 1, Lionel Bertaux1, Eugène Christo-Foroux1, Karine Labadie3, Yohann Couté 2, Chantal Abergel 1 & Jean-Michel Claverie1 With DNA genomes reaching 2.5 Mb packed in particles of bacterium-like shape and 1234567890():,; dimension, the first two Acanthamoeba-infecting pandoraviruses remained up to now the most complex viruses since their discovery in 2013. Our isolation of three new strains from distant locations and environments is now used to perform the first comparative genomics analysis of the emerging worldwide-distributed Pandoraviridae family. Thorough annotation of the genomes combining transcriptomic, proteomic, and bioinformatic analyses reveals many non-coding transcripts and significantly reduces the former set of predicted protein-coding genes. Here we show that the pandoraviruses exhibit an open pan-genome, the enormous size of which is not adequately explained by gene duplications or horizontal transfers. As most of the strain-specific genes have no extant homolog and exhibit statistical features comparable to intergenic regions, we suggest that de novo gene creation could contribute to the evolution of the giant pandoravirus genomes. 1 Aix Marseille Univ, CNRS, Structural and Genomic Information Laboratory, UMR 7256 (IMM FR 3479), 163 Avenue de Luminy, Case 934, 13288 Marseille cedex 9, France. 2 Univ. Grenoble Alpes, CEA, Inserm, BIG-BGE, 38000 Grenoble, France. 3 CEA-Institut de Génomique, GENOSCOPE, Centre National de Séquençage, 2 rue Gaston Crémieux, CP5706, 91057 Evry Cedex, France.
    [Show full text]
  • Pandoravirus Celtis Illustrate
    Pandoravirus Celtis Illustrates the Microevolution Processes at Work in the Giant Pandoraviridae Genomes Matthieu Legendre, Jean-Marie Alempic, Nadège Philippe, Audrey Lartigue, Sandra Jeudy, Olivier Poirot, Ngan Thi Ta, Sébastien Nin, Yohann Couté, Chantal Abergel, et al. To cite this version: Matthieu Legendre, Jean-Marie Alempic, Nadège Philippe, Audrey Lartigue, Sandra Jeudy, et al.. Pandoravirus Celtis Illustrates the Microevolution Processes at Work in the Giant Pandoraviridae Genomes. Frontiers in Microbiology, Frontiers Media, 2019, 10, pp.430. 10.3389/fmicb.2019.00430. hal-02135607 HAL Id: hal-02135607 https://hal.archives-ouvertes.fr/hal-02135607 Submitted on 5 Nov 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License fmicb-10-00430 March 6, 2019 Time: 17:27 # 1 ORIGINAL RESEARCH published: 08 March 2019 doi: 10.3389/fmicb.2019.00430 Pandoravirus Celtis Illustrates the Microevolution Processes at Work in the Giant Pandoraviridae Genomes Matthieu Legendre1, Jean-Marie
    [Show full text]
  • A Novel 80-Nm Virus Infecting Acanthamoeba Castellanii
    Yaravirus: A novel 80-nm virus infecting Acanthamoeba castellanii Paulo V. M. Borattoa,b,1, Graziele P. Oliveiraa,b,1, Talita B. Machadoa, Ana Cláudia S. P. Andradea, Jean-Pierre Baudoinb,c, Thomas Klosed, Frederik Schulze, Saïd Azzab,c, Philippe Decloquementb,c, Eric Chabrièreb,c, Philippe Colsonb,c, Anthony Levasseurb,c, Bernard La Scolab,c,2, and Jônatas S. Abrahãoa,2 aLaboratório de Vírus, Instituto de Ciências Biológicas, Departamento de Microbiologia, Universidade Federal de Minas Gerais, Belo Horizonte, MG 31270-901, Brazil; bMicrobes, Evolution, Phylogeny and Infection, Aix-Marseille Université UM63, Institut de Recherche pour le Développement 198, Assistance Publique–Hôpitaux de Marseille, 13005 Marseille, France; cInstitut Hospitalo-Universitaire Méditerranée Infection, Faculté de Médecine, 13005 Marseille, France; dDepartment of Biological Sciences, Purdue University, West Lafayette, IN 47907; and eDepartment of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 Edited by James L. Van Etten, University of Nebraska-Lincoln, Lincoln, NE, and approved June 2, 2020 (received for review January 29, 2020) Here we report the discovery of Yaravirus, a lineage of amoebal others. NCDLVs have dsDNA genomes and were proposed to virus with a puzzling origin and evolution. Yaravirus presents share a monophyletic origin based on criteria that include the 80-nm-sized particles and a 44,924-bp dsDNA genome encoding sharing of a set of ancestral vertically inherited genes (17, 18). for 74 predicted proteins. Yaravirus genome annotation showed From this handful set of genes, a core gene cluster is found to be that none of its genes matched with sequences of known organ- present in almost all members of the NCLDVs, being composed isms at the nucleotide level; at the amino acid level, six predicted by five distinct genes, namely a DNA polymerase family B, a proteins had distant matches in the nr database.
    [Show full text]
  • The Multiple Origins of Proteins Present in Tupanvirus Particles
    Available online at www.sciencedirect.com ScienceDirect The multiple origins of proteins present in tupanvirus particles 1 Paulo Victor de Miranda Boratto , 1 Ana Cla´ udia dos Santos Pereira Andrade , 1 2 Rodrigo Arau´ jo Lima Rodrigues , Bernard La Scola and 1 Joˆ natas Santos Abraha˜ o In the last few decades, the isolation of amoebae-infecting that stands out in this scenario is the giant viruses. giant viruses has challenged established principles related to These viruses are classified as nucleo-cytoplasmic large the definition of virus, their evolution, and their particle DNA viruses (NCLDVs – proposed order Megavirales). structures represented by a variety of shapes and sizes. They have dimensions larger than 200 nm and exten- Tupanviruses are one of the most recently described amoebae- sive genomes reaching up to 2.5 Mb that can encode infecting viruses and exhibit a peculiar morphology with a thousands proteins [2–5]. cylindrical tail attached to the capsid. Proteomic analysis of purified viral particles revealed that virions are composed of Most giant viruses, such as mimivirus, pandoravirus, over one hundred proteins with different functions. The putative and pithovirus, are associated with free-living amoebae origin of these proteins had not yet been investigated. Here, we of the genus Acanthamoeba [3–5]. Some giant viruses provide evidences for multiple origins of the proteins present in though, have been described infecting flagellate micro- tupanvirus particles, wherein 20% originate from members of organisms such as Cafeteria roenbergensis virus and Bodo the archaea, bacteria and eukarya. saltans virus, and both groups are phylogenetically related to the family Mimiviridae [6,7].
    [Show full text]
  • Host Range and Coding Potential of Eukaryotic Giant Viruses
    viruses Review Host Range and Coding Potential of Eukaryotic Giant Viruses Tsu-Wang Sun 1,2 , Chia-Ling Yang 1, Tzu-Tong Kao 1 , Tzu-Haw Wang 1, Ming-Wei Lai 1 and Chuan Ku 1,2,* 1 Institute of Plant and Microbial Biology, Academia Sinica, Taipei 11529, Taiwan; [email protected] (T.-W.S.); [email protected] (C.-L.Y.); [email protected] (T.-T.K.); [email protected] (T.-H.W.); [email protected] (M.-W.L.) 2 Genome and Systems Biology Degree Program, National Taiwan University and Academia Sinica, Taipei 10617, Taiwan * Correspondence: [email protected] Received: 7 November 2020; Accepted: 19 November 2020; Published: 21 November 2020 Abstract: Giant viruses are a group of eukaryotic double-stranded DNA viruses with large virion and genome size that challenged the traditional view of virus. Newly isolated strains and sequenced genomes in the last two decades have substantially advanced our knowledge of their host diversity, gene functions, and evolutionary history. Giant viruses are now known to infect hosts from all major supergroups in the eukaryotic tree of life, which predominantly comprises microbial organisms. The seven well-recognized viral clades (taxonomic families) have drastically different host range. Mimiviridae and Phycodnaviridae, both with notable intrafamilial genome variation and high abundance in environmental samples, have members that infect the most diverse eukaryotic lineages. Laboratory experiments and comparative genomics have shed light on the unprecedented functional potential of giant viruses, encoding proteins for genetic information flow, energy metabolism, synthesis of biomolecules, membrane transport, and sensing that allow for sophisticated control of intracellular conditions and cell-environment interactions.
    [Show full text]
  • Mimiviridae: an Expanding Family of Highly Diverse Large Dsdna Viruses Infecting a Wide Phylogenetic Range of Aquatic Eukaryotes Jean-Michel Claverie, Chantal Abergel
    Mimiviridae: An Expanding Family of Highly Diverse Large dsDNA Viruses Infecting a Wide Phylogenetic Range of Aquatic Eukaryotes Jean-Michel Claverie, Chantal Abergel To cite this version: Jean-Michel Claverie, Chantal Abergel. Mimiviridae: An Expanding Family of Highly Diverse Large dsDNA Viruses Infecting a Wide Phylogenetic Range of Aquatic Eukaryotes. Viruses, MDPI, 2018, 10 (9), pp.506. 10.3390/v10090506. hal-01887346 HAL Id: hal-01887346 https://hal-amu.archives-ouvertes.fr/hal-01887346 Submitted on 5 Nov 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License viruses Review Mimiviridae: An Expanding Family of Highly Diverse Large dsDNA Viruses Infecting a Wide Phylogenetic Range of Aquatic Eukaryotes Jean-Michel Claverie * and Chantal Abergel Aix Marseille Univ, CNRS, IGS, Structural and Genomic Information Laboratory (UMR7256), Mediterranean Institute of Microbiology (FR3479), 163 Avenue de Luminy, F-13288 Marseille, France; [email protected] * Correspondence: [email protected]; Tel.: +33-49-182-5420 Received: 14 August 2018; Accepted: 15 September 2018; Published: 18 September 2018 Abstract: Since 1998, when Jim van Etten’s team initiated its characterization, Paramecium bursaria Chlorella virus 1 (PBCV-1) had been the largest known DNA virus, both in terms of particle size and genome complexity.
    [Show full text]