A Novel Rubi-Like Virus in the Pacific Electric Ray (Tetronarce Californica) Reveals the Complex Evolutionary History of The

A Novel Rubi-Like Virus in the Pacific Electric Ray (Tetronarce Californica) Reveals the Complex Evolutionary History of The

bioRxiv preprint doi: https://doi.org/10.1101/2021.02.08.430315; this version posted February 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. 1 A novel rubi-like virus in the Pacific electric ray 2 (Tetronarce californica) reveals the complex 3 evolutionary history of the Matonaviridae 4 Rebecca M. Grimwood1, Edward C. Holmes2, Jemma L. Geoghegan1,3* 5 1Department of Microbiology and Immunology, University of Otago, Dunedin 9016, New Zealand; 6 [email protected] (R.M.G.); [email protected] (J.L.G.) 7 2Marie Bashir Institute for Infectious Diseases and Biosecurity, School of Life and Environmental Sciences and 8 School of Medical Sciences, University of Sydney, NSW 2006, Australia; [email protected] 9 3Institute of Environmental Science and Research, Wellington 5018, New Zealand; 10 [email protected] 11 *Correspondence: [email protected] 12 13 Abstract: Rubella virus (RuV) is the causative agent of rubella (“German measles”) and remains a 14 global health concern. Until recently, RuV was the only known member of the genus Rubivirus and 15 the only virus species classified within the Matonaviridae family of positive-sense RNA viruses. 16 Other matonaviruses, including two new rubella-like viruses, Rustrela virus and Ruhugu virus, have 17 been identified in several mammalian species, along with more divergent viruses in fish and 18 reptiles. To screen for the presence of additional novel rubella-like viruses we mined published 19 transcriptome data using genome sequences from Rubella, Rustrela, and Ruhugu viruses as baits. 20 From this, we identified a novel rubella-like virus in a transcriptome of Tetronarce californica (Pacific 21 electric ray) that is more closely related to mammalian Rustrela virus than to the divergent fish 22 matonavirus and indicative of a complex pattern of cross-species virus transmission. Analysis of 23 host reads confirmed that the sample analysed was indeed from a Pacific electric ray, and two other 24 viruses identified in this animal, from the Arenaviridae and Reoviridae, grouped with other fish 25 viruses. These findings indicate that the evolutionary history of the Matonaviridae is more complex 26 than previously thought and highlights the vast number of viruses still to be discovered. 27 28 Keywords: meta-transcriptomics; virus discovery; rubella; ruhugu; rustrela; Matonaviridae; 29 Arenaviridae; Reoviridae; fish; virus evolution; phylogeny 30 31 1. Introduction 32 Rubella virus (RuV) (Matonaviridae: Rubivirus) is a single-stranded positive-sense RNA virus [1]. It is 33 best known as the causative agent of rubella, sometimes known as “German measles” - a relatively 34 mild measles-like illness [2]. More severely, RuV is also teratogenic and can result in complications 35 such as miscarriage or congenital rubella syndrome (CRS) if contracted during pregnancy [3, 4]. 36 Despite the availability of effective vaccines, including the combination measles, mumps, and 37 rubella (MMR) vaccine [5], they are deployed in only around half of the world’s countries [6]. RuV 38 therefore remains a significant global health concern, particularly in developing countries where 39 childhood infection rates are high and vaccination efforts are low or non-existent [7]. 40 The disease rubella was first described in 1814 [8] and until recently RuV was the only known 41 species within the genus Rubivirus, itself the only genus in the family Matonaviridae [9]. This picture 42 has changed with the discovery of other rubella-like viruses through metagenomic technologies. 43 The first of these, Guangdong Chinese water snake rubivirus, was discovered via a large virological bioRxiv preprint doi: https://doi.org/10.1101/2021.02.08.430315; this version posted February 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. 2 of 12 44 survey of vertebrates in China and is highly divergent from RuV, sharing around 34% nucleotide 45 similarity [10]. Tiger flathead matonavirus was identified through a meta-transcriptomic exploration 46 of Australian marine fish [11], which, along with Guangdong Chinese water snake rubivirus, forms a 47 sister clade to RuV. Even more recently, two novel rubella-like virus species were described: 48 Rustrela virus, identified in several mammalian species close by and in a zoo in Germany, and 49 Ruhugu virus from bats in Uganda [12]. These viruses represent the first non-human mammalian 50 viruses within this family. Such findings begin to shed light on the deeper evolutionary history of 51 this previously elusive viral family. 52 The introduction and popularisation of metagenomic methods to identify novel viruses within 53 tissue or environmental samples has enabled their rapid discovery and provided important insights 54 into viral diversity and evolution [13, 14]. Here, we performed data-mining of metagenomic data to 55 determine if related matonviruses were present in other vertebrate taxa, screening the National 56 Center for Biotechnology Information (NCBI) Transcriptome Shotgun Assembly (TSA) database 57 against the genome sequences of Rubella, Ruhugu, and Rustrela viruses. 58 59 2. Materials and Methods 60 2.1 TSA mining for novel matonaviruses. To identify the presence of novel rubella-like viruses in 61 vertebrates, translated genome sequences of Rustrela virus (MN552442 , MT274274, and MT274275), 62 Ruhugu virus (MN547623), and RuV strain 1A (KU958641) were screened against transcriptome 63 assemblies available in NCBI’s TSA database (https://www.ncbi.nlm.nih.gov/genbank/tsa/) using 64 the translated Basic Local Alignment Search Tool (tBLASTn) algorithm [15]. Searches were 65 restricted to vertebrates (taxonomic identifier: 7742), excluding Homo sapiens (taxonomic identifier: 66 9606) and utilised the BLOSUM45 matrix to increase the chance of finding highly divergent viruses. 67 Putative viral sequences were then queried using the DIAMOND BLASTx (v.2.02.2) [16] algorithm 68 against the non-redundant (nr) protein database for confirmation. 69 2.2 Pacific electric ray transcriptome assembly and annotation. To recover further fragments of the novel 70 matonavirus genome and screen for other viruses in the Pacific electric ray transcriptome (see 71 Results), raw sequencing reads were downloaded from the NCBI Short Read Archive (SRA) 72 (Bioproject: PRJNA322346). These reads were first quality trimmed and then assembled de novo 73 using Trinity RNA-Seq (v2.11) [17]. Assembled contigs were annotated based on similarity searches 74 against the NCBI nucleotide (nt) database using the BLASTn algorithm and the non-redundant 75 protein (nr) database using DIAMOND BLASTx (v.2.02.2) [16]. Novel viral sequences were collated 76 by searching the annotated contigs and these potential viruses were confirmed with additional 77 BLASTx searches against nr and nt databases. 78 2.3 Abundance estimations. Virus abundance, expressed as the standardised total number of raw viral 79 sequencing reads that comprise a given contig, was calculated using the ‘align and estimate’ 80 module available for Trinity RNA-seq with RNA-seq by Expectation-Maximization (RSEM) [18] as 81 the abundance estimation method and Bowtie 2 [19] as the alignment method. 82 2.4 Phylogenetic analysis. A range of virus genomes representative of each virus family and sub- 83 family under consideration, and for a range of host species, were retrieved from GenBank for 84 phylogenetic analysis. The amino acid sequences of previously described viruses were aligned with bioRxiv preprint doi: https://doi.org/10.1101/2021.02.08.430315; this version posted February 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. 3 of 12 85 those generated here with Multiple Alignment using Fast Fourier Transform (MAFFT) (v.7.4) [20] 86 employing the E-INS-i algorithm. All sequence alignments were visualised in Geneious Prime 87 (v2020.2.4) [21]. Ambiguously aligned regions were removed using trimAl (v.1.2) [22] employing 88 the ‘gappyout’ flag. The maximum likelihood approach available in IQ-TREE (v.1.6.12) [23] was 89 used to generate phylogenetic trees for each family. ModelFinder [24] was used to determine the 90 best-fit model of amino acid substitution for each tree and 1000 ultrafast bootstrap replicates [25] 91 were performed to assess nodal support. Phylogenetic trees were annotated with FigTree (v.1.4.4) 92 [26]. 93 2.5 Virus nomenclature. The new T. californica viruses described were named with ‘Tetronarce’ 94 preceding their respective family name to signify their electric ray host. 95 2.6 Data availability. The Rustrela and Ruhugu virus sequences used for data mining in the project are 96 available under Bioproject PRJNA576343. The raw T. californica sequence reads are available at 97 Bioproject PRJNA322346. All other publicly available sequences used in analyses can be accessed 98 via their corresponding accession codes (see relevant figures). Consensus sequences of the new 99 viruses identified in this study will be available on GenBank. 100 101 3. Results 102 3.1 Identification of a novel Matonaviridae in Pacific electric ray. We identified a complete structural 103 polyprotein and several fragments of the RNA-dependent RNA polymerase (RdRp) from a 104 divergent rubi-like virus that we tentatively named Tetronarce matonavirus. Screening of Rustrela, 105 Ruhugu, and Rubella 1A virus sequences against the TSA database revealed matches to a contig of 106 3006 nucleotides (1001 amino acids) from a transcriptome of the electric organ of a female Pacific 107 electric ray (Tetronarce californica) [27], ranging in sequence identity from 37.6 – 44.9%.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    12 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us