Four Novel Algal Virus Genomes Discovered from Yellowstone Lake

Total Page:16

File Type:pdf, Size:1020Kb

Four Novel Algal Virus Genomes Discovered from Yellowstone Lake www.nature.com/scientificreports OPEN Four novel algal virus genomes discovered from Yellowstone Lake metagenomes Received: 09 March 2015 1 1,2 3 1,4 1,4 1,5 Accepted: 17 September 2015 Weijia Zhang , Jinglie Zhou , Taigang Liu , Yongxin Yu , Yingjie Pan , Shuling Yan 1,4 Published: 13 October 2015 & Yongjie Wang Phycodnaviruses are algae-infecting large dsDNA viruses that are widely distributed in aquatic environments. Here, partial genomic sequences of four novel algal viruses were assembled from a Yellowstone Lake metagenomic data set. Genomic analyses revealed that three Yellowstone Lake phycodnaviruses (YSLPVs) had genome lengths of 178,262 bp, 171,045 bp, and 171,454 bp, respectively, and were phylogenetically closely related to prasinoviruses (Phycodnaviridae). The fourth (YSLGV), with a genome length of 73,689 bp, was related to group III in the extended family Mimiviridae comprising Organic Lake phycodnaviruses and Phaeocystis globosa virus 16 T (OLPG). A pair of inverted terminal repeats was detected in YSLPV1, suggesting that its genome is nearly complete. Interestingly, these four putative YSL giant viruses also bear some genetic similarities to Yellowstone Lake virophages (YSLVs). For example, they share nine non-redundant homologous genes, including ribonucleotide reductase small subunit (a gene conserved in nucleo-cytoplasmic large DNA viruses) and Organic Lake virophage OLV2 (conserved in the majority of YSLVs). Additionally, putative multidrug resistance genes (emrE) were found in YSLPV1 and YSLPV2 but not in other viruses. Phylogenetic trees of emrE grouped YSLPVs with algae, suggesting that horizontal gene transfer occurred between giant viruses and their potential algal hosts. Phytoplankton (microalgae), based on conservative estimates of more than 100,000 species1, is abundant in the sea. These algae form the base of the marine food web as their photosynthetic activities provide carbon sources and energy for life in the marine ecosystem and they regulate numerous aspects of the global environment1. Marine algae-infecting viruses, particularly phycodnaviruses, are important for controlling the composition of planktonic communities2. The phycodnaviruses are a genetically diverse, morphologically similar group of double-stranded DNA viruses that infect eukaryotic algae and presently contain six genera3. Members of the Coccolithovirus, Phaeovirus, Prasinovirus, Prymnesiovirus and Raphidovirus genera infect marine algae, while chlorovi- ruses (Chlorovirus) infect freshwater algae1,4. The genomes of phycodnaviruses are generally smaller (160 to 560 kb) than those of mimiviruses belonging to Lineage A of Group I, which typically have genomes larger than 1 Mb. Thus far, complete genomes of phycodnaviruses, such as Ostreococcus viruses (OtV1, OtV5)5,6, Micromonas sp. RCC1109 virus MpV17 and five chloroviruses8–10, have been well characterized. The Phycodnaviridae, together with six other giant virus families, were defined as nucleo-cytoplasmic large DNA viruses (NCLDVs)11 and were proposed to be reclassified into a new order Megavirales due to the following shared features12: i) giant viral particles with capsid diameters > 150 nm and genome sizes 1College of Food Science and Technology, Shanghai Ocean University, Shanghai, China. 2Department of Biological Sciences, Auburn University, Auburn, AL, USA. 3College of Information Technology, Shanghai Ocean University, Shanghai, China. 4Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage & Preservation, Ministry of Agriculture, Shanghai, China. 5Institute of Biochemistry and Molecular Cell Biology, University of Göttingen, Göttingen, Germany. Correspondence and requests for materials should be addressed to Y.W. (email: [email protected]) SCIENTIFIC REPORTS | 5:15131 | DOI: 10.1038/srep15131 1 www.nature.com/scientificreports/ > 100 kb; ii) the presence of nine class I core genes in all seven families or 47 NCLDV conserved genes in one or two families13; iii) potential for infection with virophages. Interestingly, virophages were found to be associated with giant viruses since the first virophage Sputnik was isolated in association with a mamavirus, a relative of mimivirus, in a water-cooling tower in Paris14. Virophages have circular double-stranded DNA genomes of 18–30 kb which encode more than 20 genes. Recently, Zhou et al.15,16 observed extensive genetic diversity of virophages in Yellowstone Lake metagenomic datasets and have assembled seven complete virophage genomes (Yellowstone Lake virophages, YSLVs). In this study, to provide insight into the diversity of giant viruses in Yellowstone Lake, we assembled giant viral genomes from the same metagenomic datasets in which YSLVs were discovered15,16. Based on comparative genomic and phylogenetic analyses, four novel giant viruses detected in YSL appeared to infect algae, and horizontal gene transfer was observed among giant viruses, YSLVs, and their potential algae hosts. Material and Methods Sequence assembly. Sequence assembly was performed as previously described by Zhou et al.15,16. Briefly, the Yellowstone Lake metagenomics dataset17 was downloaded from the CAMERA 2.0 Portal and assembled de novo using Newbler v2.6 (Roche). Contigs derived from the assembly were con- structed as a local database in order to perform tBLASTx searches for viral major capsid protein (MCP)- related sequences. Since YSLVs are most closely related to Organic Lake virophage (OLV)15,16, and OLV is thought to be the viral parasite of Organic Lake phycodnaviruses (OLPVs)15,18, the potential giant viral hosts of YSLVs may be closely related to OLPVs. Therefore, MCPs of OLPVs were used as refer- ence sequences to search (tblastx, E-value < 10−5) for homologous sequences in the constructed contig database described above. The OLPV MCP-related contigs over 10 kb in length with good quality (low E-value and high identity) were re-assembled using the Yellowstone Lake metagenomic dataset until the assembled sequences no longer extended. All sequence assemblies (with a minimum overlap length of 25 bp and minimum overlap identity of 95%) were performed using GeneiousPro15. Assembly check. To further validate the assembled consensus sequences, duplicate reads were first removed from the data sets of scaffold reads with CD-HIT Suite19. Sequence identity cut-offs were set as: 0.97, 0.95 and 0.90. The obtained unique reads were then re-assembled, and the consensus sequences were compared to the original sequences using the data sets without prior removal of duplicates in order to check the quality and accuracy of sequence assembly. Genomic sequence analysis. The prediction and annotation of open reading frames (ORF) was performed as described by Zhou et al. in 201315. Each predicted ORF contained an ATG start codon, and had a minimum size of 150 bp, standard genetic code, and a stop codon. Translated amino acid sequences were used to search (E-value < 10−1) for homologs in NCBI nr database using the BLASTp program. One top hit to virus and/or non-virus was recorded. Functional annotation of ORFs was performed using the InterProScan program (http://www.ebi.ac.uk/Tools/pfa/iprscan/), Conserved Domain search20 on the NCBI server, and HHpred (http://toolkit.tuebingen.mpg.de/hhpred). In addition, 412 proteins of OLPV1, 326 of OLPV2 and 434 of Phaeocystis globosa virus 16 T (PgV- 16 T) were analyzed for orthologous protein clusters shared with YSLGV proteins using the COG algorithm21. All predicted ORFs (E-value < 10−1) were searched against a local database, which is comprised of all predicted proteins of seven YSLVs. Analysis of PgVV (a pro-virophage associated with PgV-16 T) proteins was performed by searching their homolog against an extensive database containing sequences of all the predicted proteins of known virophages. Genomic sequences were aligned using the Mauve program on the Geneious Pro platform (default parameter)22. Repetitive sequences were checked on the softberry website (http://linux1.softberry.com/ berry.phtml? topic = frep&subgroup = repeat&group = programs) with default parameters, and long ter- minal repeats using LTR-Finder23. Phylogenetic analysis. Homologs of MCP, DNA polymerase B family (PolB), poxvirus late tran- scription factor 3 (Pox-VLTF3), topoisomerase II (Topo II), vaccinia virus (VV) A32-like packaging ATPase, ribonucleotide reductase small subunit (RNR2), multidrug resistance protein (emrE), and OLV ORF2 (OLV2) were used to reconstruct phylogenetic trees. Reconstruction was initiated by aligning multiple amino acid sequences using the MUSCLE program24, followed by tree construction using the JTT model with a bootstrap value of 100. Phylogenetic analysis of emrE included sequences from three cellular life domains and was based on Bayesian Inference (parameter set: rate variation: gamma; rate matrix: poisson). All analyses were performed on the Geneious Pro platform. Accession numbers. The genomic sequences of four YSL algal viruses have been deposited in DDBJ under accession numbers LC015646-LC015649 for YSLGV and YSLPV 1–3, respectively. SCIENTIFIC REPORTS | 5:15131 | DOI: 10.1038/srep15131 2 www.nature.com/scientificreports/ Figure 1. Physical maps of partial genomes of YSLPVs and YSLGV. ORFs are indicated in box arrows and are labeled in different colors that represent different functional categories. Repeats are shown in diamond. Numbers outside the genomes indicate nucleotide positions. Viral names, genomic length and G+ C content, and the total number of predicted ORFs are shown
Recommended publications
  • A New Family of Hybrid Virophages from an Animal Gut Metagenome Natalya Yutin, Vladimir V Kapitonov and Eugene V Koonin*
    Yutin et al. Biology Direct (2015) 10:19 DOI 10.1186/s13062-015-0054-9 DISCOVERY NOTES Open Access A new family of hybrid virophages from an animal gut metagenome Natalya Yutin, Vladimir V Kapitonov and Eugene V Koonin* Abstract Search of metagenomics sequence databases for homologs of virophage capsid proteins resulted in the discovery of a new family of virophages in the sheep rumen metagenome. The genomes of the rumen virophages (RVP) encode a typical virophage major capsid protein, ATPase and protease combined with a Polinton-type, protein primed family B DNA polymerase. The RVP genomes appear to be linear molecules, with terminal inverted repeats. Thus, the RVP seem to represent virophage-Polinton hybrids that are likely capable of formation of infectious virions. Virion proteins of mimiviruses were detected in the same metagenomes as the RVP suggesting that the virophages of the new family parasitize on giant viruses that infect protist inhabitants of the rumen. This article was reviewed by Mart Krupovic and Kenneth Stedman; for complete reviews, see the Reviewers’ Reports section. Findings been isolated as infectious particles and shown to be associ- With the rapid increase of the quantity and quality of ated with different members of the family Mimiviridae available metagenomics sequences, metagenomes have [11-13], and 9 additional virophage genomes have been as- become a rich source for the discovery of novel viruses sembled from aquatic metagenomes [14-16]. The first vir- [1-3]. A prominent case in point is the recent discovery ophage, named Sputnik, was discovered as a parasite of of a novel, abundant and diversified group of viruses that Acanthamoeba castellani mimivirus [11], and reproduction are chimeras of genes from single-stranded DNA and of the related Zamilon virophage is supported by several positive-strand RNA viruses [4-7].
    [Show full text]
  • Chapitre Quatre La Spécificité D'hôtes Des Virophages Sputnik
    AIX-MARSEILLE UNIVERSITE FACULTE DE MEDECINE DE MARSEILLE ECOLE DOCTORALE DES SCIENCES DE LA VIE ET DE LA SANTE THESE DE DOCTORAT Présentée par Morgan GAÏA Né le 24 Octobre 1987 à Aubagne, France Pour obtenir le grade de DOCTEUR de l’UNIVERSITE AIX -MARSEILLE SPECIALITE : Pathologie Humaine, Maladies Infectieuses Les virophages de Mimiviridae The Mimiviridae virophages Présentée et publiquement soutenue devant la FACULTE DE MEDECINE de MARSEILLE le 10 décembre 2013 Membres du jury de la thèse : Pr. Bernard La Scola Directeur de thèse Pr. Jean -Marc Rolain Président du jury Pr. Bruno Pozzetto Rapporteur Dr. Hervé Lecoq Rapporteur Faculté de Médecine, 13385 Marseille Cedex 05, France URMITE, UM63, CNRS 7278, IRD 198, Inserm 1095 Directeur : Pr. Didier RAOULT Avant-propos Le format de présentation de cette thèse correspond à une recommandation de la spécialité Maladies Infectieuses et Microbiologie, à l’intérieur du Master des Sciences de la Vie et de la Santé qui dépend de l’Ecole Doctorale des Sciences de la Vie de Marseille. Le candidat est amené à respecter des règles qui lui sont imposées et qui comportent un format de thèse utilisé dans le Nord de l’Europe permettant un meilleur rangement que les thèses traditionnelles. Par ailleurs, la partie introduction et bibliographie est remplacée par une revue envoyée dans un journal afin de permettre une évaluation extérieure de la qualité de la revue et de permettre à l’étudiant de commencer le plus tôt possible une bibliographie exhaustive sur le domaine de cette thèse. Par ailleurs, la thèse est présentée sur article publié, accepté ou soumis associé d’un bref commentaire donnant le sens général du travail.
    [Show full text]
  • Complete Sections As Applicable
    This form should be used for all taxonomic proposals. Please complete all those modules that are applicable (and then delete the unwanted sections). For guidance, see the notes written in blue and the separate document “Help with completing a taxonomic proposal” Please try to keep related proposals within a single document; you can copy the modules to create more than one genus within a new family, for example. MODULE 1: TITLE, AUTHORS, etc (to be completed by ICTV Code assigned: 2015.001a-kF officers) Short title: A new family and two new genera for classification of virophages two new species (e.g. 6 new species in the genus Zetavirus) Modules attached 1 2 3 4 5 (modules 1 and 10 are required) 6 7 8 9 10 Author(s): Matthias Fischer – Max Planck Institute for Medical Research, Germany Mart Krupovic – Institut Pasteur, France Jens H. Kuhn – NIH/NIAID/IRF-Frederick, Maryland, USA Bernard La Scola – Aix Marseille Université, France Didier Raoult – Aix Marseille Université, France Corresponding author with e-mail address: Matthias Fischer, [email protected] Mart Krupovic, [email protected] List the ICTV study group(s) that have seen this proposal: A list of study groups and contacts is provided at http://www.ictvonline.org/subcommittees.asp . If in doubt, contact the appropriate subcommittee chair (fungal, invertebrate, plant, prokaryote or vertebrate viruses) ICTV Study Group comments (if any) and response of the proposer: Date first submitted to ICTV: June 11, 2015 Date of this revision (if different to above): ICTV-EC comments and response of the proposer: Fungal and Protist Viruses Subcommittee Chair: Proposal approved for submission.
    [Show full text]
  • Elisabeth Mendes Martins De Moura Diversidade De Vírus DNA
    Elisabeth Mendes Martins de Moura Diversidade de vírus DNA autóctones e alóctones de mananciais e de esgoto da região metropolitana de São Paulo Tese apresentada ao Programa de Pós- Graduação em Microbiologia do Instituto de Ciências Biomédicas da Universidade de São Paulo, para obtenção do Titulo de Doutor em Ciências. Área de concentração: Microbiologia Orienta: Prof (a). Dr (a). Dolores Ursula Mehnert versão original São Paulo 2017 RESUMO MOURA, E. M. M. Diversidade de vírus DNA autóctones e alóctones de mananciais e de esgoto da região metropolitana de São Paulo. 2017. 134f. Tese (Doutorado em Microbiologia) - Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo, 2017. A água doce no Brasil, assim como o seu consumo é extremamente importante para as diversas atividades criadas pelo ser humano. Por esta razão o consumo deste bem é muito grande e consequentemente, provocando o seu impacto. Os mananciais são normalmente usados para abastecimento doméstico, comercial, industrial e outros fins. Os estudos na área de ecologia de micro-organismos nos ecossistemas aquáticos (mananciais) e em esgotos vêm sendo realizados com mais intensidade nos últimos anos. Nas últimas décadas foi introduzido o conceito de virioplâncton com base na abundância e diversidade de partículas virais presentes no ambiente aquático. O virioplâncton influencia muitos processos ecológicos e biogeoquímicos, como ciclagem de nutriente, taxa de sedimentação de partículas, diversidade e distribuição de espécies de algas e bactérias, controle de florações de fitoplâncton e transferência genética horizontal. Os estudos nesta área da virologia molecular ainda estão muito restritos no país, bem como muito pouco se conhece sobre a diversidade viral na água no Brasil.
    [Show full text]
  • Changes to Virus Taxonomy 2004
    Arch Virol (2005) 150: 189–198 DOI 10.1007/s00705-004-0429-1 Changes to virus taxonomy 2004 M. A. Mayo (ICTV Secretary) Scottish Crop Research Institute, Invergowrie, Dundee, U.K. Received July 30, 2004; accepted September 25, 2004 Published online November 10, 2004 c Springer-Verlag 2004 This note presents a compilation of recent changes to virus taxonomy decided by voting by the ICTV membership following recommendations from the ICTV Executive Committee. The changes are presented in the Table as decisions promoted by the Subcommittees of the EC and are grouped according to the major hosts of the viruses involved. These new taxa will be presented in more detail in the 8th ICTV Report scheduled to be published near the end of 2004 (Fauquet et al., 2004). Fauquet, C.M., Mayo, M.A., Maniloff, J., Desselberger, U., and Ball, L.A. (eds) (2004). Virus Taxonomy, VIIIth Report of the ICTV. Elsevier/Academic Press, London, pp. 1258. Recent changes to virus taxonomy Viruses of vertebrates Family Arenaviridae • Designate Cupixi virus as a species in the genus Arenavirus • Designate Bear Canyon virus as a species in the genus Arenavirus • Designate Allpahuayo virus as a species in the genus Arenavirus Family Birnaviridae • Assign Blotched snakehead virus as an unassigned species in family Birnaviridae Family Circoviridae • Create a new genus (Anellovirus) with Torque teno virus as type species Family Coronaviridae • Recognize a new species Severe acute respiratory syndrome coronavirus in the genus Coro- navirus, family Coronaviridae, order Nidovirales
    [Show full text]
  • RESEARCH ARTICLES Gene Cluster Statistics with Gene Families
    RESEARCH ARTICLES Gene Cluster Statistics with Gene Families Narayanan Raghupathy*1 and Dannie Durand* *Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA; and Department of Computer Science, Carnegie Mellon University, Pittsburgh, PA Identifying genomic regions that descended from a common ancestor is important for understanding the function and evolution of genomes. In distantly related genomes, clusters of homologous gene pairs are evidence of candidate homologous regions. Demonstrating the statistical significance of such ‘‘gene clusters’’ is an essential component of comparative genomic analyses. However, currently there are no practical statistical tests for gene clusters that model the influence of the number of homologs in each gene family on cluster significance. In this work, we demonstrate empirically that failure to incorporate gene family size in gene cluster statistics results in overestimation of significance, leading to incorrect conclusions. We further present novel analytical methods for estimating gene cluster significance that take gene family size into account. Our methods do not require complete genome data and are suitable for testing individual clusters found in local regions, such as contigs in an unfinished assembly. We consider pairs of regions drawn from the same genome (paralogous clusters), as well as regions drawn from two different genomes (orthologous clusters). Determining cluster significance under general models of gene family size is computationally intractable. By assuming that all gene families are of equal size, we obtain analytical expressions that allow fast approximation of cluster probabilities. We evaluate the accuracy of this approximation by comparing the resulting gene cluster probabilities with cluster probabilities obtained by simulating a realistic, power-law distributed model of gene family size, with parameters inferred from genomic data.
    [Show full text]
  • Chlorovirus: a Genus of Phycodnaviridae That Infects Certain Chlorella-Like Green Algae
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Papers in Plant Pathology Plant Pathology Department 2005 Chlorovirus: A Genus of Phycodnaviridae that Infects Certain Chlorella-Like Green Algae Ming Kang University of Nebraska-Lincoln, [email protected] David Dunigan University of Nebraska-Lincoln, [email protected] James L. Van Etten University of Nebraska-Lincoln, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/plantpathpapers Part of the Plant Pathology Commons Kang, Ming; Dunigan, David; and Van Etten, James L., "Chlorovirus: A Genus of Phycodnaviridae that Infects Certain Chlorella-Like Green Algae" (2005). Papers in Plant Pathology. 130. https://digitalcommons.unl.edu/plantpathpapers/130 This Article is brought to you for free and open access by the Plant Pathology Department at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in Papers in Plant Pathology by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. Published in Molecular Plant Pathology 6:3 (2005), pp. 213–224; doi 10.1111/j.1364-3703.2005.00281.x Copyright © 2005 Black- well Publishing Ltd. Used by permission. http://www3.interscience.wiley.com/journal/118491680/ Published online May 23, 2005. Chlorovirus: a genus of Phycodnaviridae that infects certain chlorella-like green algae Ming Kang,1 David D. Dunigan,1,2 and James L. Van Etten 1,2 1 Department of Plant Pathology and 2 Nebraska Center for Virology, University of Nebraska-Lincoln, Lincoln, NE 68583-0722, USA Correspondence — M. Kang, [email protected] ; J. L. Van Etten, [email protected] Abstract INTRODUCTION Taxonomy: Chlorella viruses are assigned to the family Phy- The chlorella viruses are large, icosahedral, plaque-form- codnaviridae, genus Chlorovirus, and are divided into ing, dsDNA viruses that infect certain unicellular, chlorella- three species: Chlorella NC64A viruses, Chlorella Pbi vi- like green algae (Van Etten et al., 1991).
    [Show full text]
  • A Field Guide to Eukaryotic Transposable Elements
    GE54CH23_Feschotte ARjats.cls September 12, 2020 7:34 Annual Review of Genetics A Field Guide to Eukaryotic Transposable Elements Jonathan N. Wells and Cédric Feschotte Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; email: [email protected], [email protected] Annu. Rev. Genet. 2020. 54:23.1–23.23 Keywords The Annual Review of Genetics is online at transposons, retrotransposons, transposition mechanisms, transposable genet.annualreviews.org element origins, genome evolution https://doi.org/10.1146/annurev-genet-040620- 022145 Abstract Annu. Rev. Genet. 2020.54. Downloaded from www.annualreviews.org Access provided by Cornell University on 09/26/20. For personal use only. Copyright © 2020 by Annual Reviews. Transposable elements (TEs) are mobile DNA sequences that propagate All rights reserved within genomes. Through diverse invasion strategies, TEs have come to oc- cupy a substantial fraction of nearly all eukaryotic genomes, and they rep- resent a major source of genetic variation and novelty. Here we review the defining features of each major group of eukaryotic TEs and explore their evolutionary origins and relationships. We discuss how the unique biology of different TEs influences their propagation and distribution within and across genomes. Environmental and genetic factors acting at the level of the host species further modulate the activity, diversification, and fate of TEs, producing the dramatic variation in TE content observed across eukaryotes. We argue that cataloging TE diversity and dissecting the idiosyncratic be- havior of individual elements are crucial to expanding our comprehension of their impact on the biology of genomes and the evolution of species. 23.1 Review in Advance first posted on , September 21, 2020.
    [Show full text]
  • Evolution of Orthologous Tandemly Arrayed Gene Clusters
    Tremblay Savard et al. BMC Bioinformatics 2011, 12(Suppl 9):S2 http://www.biomedcentral.com/1471-2105/12/S9/S2 PROCEEDINGS Open Access Evolution of orthologous tandemly arrayed gene clusters Olivier Tremblay Savard1*, Denis Bertrand2*, Nadia El-Mabrouk1* From Ninth Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Com- parative Genomics Galway, Ireland. 8-10 October 2011 Abstract Background: Tandemly Arrayed Gene (TAG) clusters are groups of paralogous genes that are found adjacent on a chromosome. TAGs represent an important repertoire of genes in eukaryotes. In addition to tandem duplication events, TAG clusters are affected during their evolution by other mechanisms, such as inversion and deletion events, that affect the order and orientation of genes. The DILTAG algorithm developed in [1] makes it possible to infer a set of optimal evolutionary histories explaining the evolution of a single TAG cluster, from an ancestral single gene, through tandem duplications (simple or multiple, direct or inverted), deletions and inversion events. Results: We present a general methodology, which is an extension of DILTAG, for the study of the evolutionary history of a set of orthologous TAG clusters in multiple species. In addition to the speciation events reflected by the phylogenetic tree of the considered species, the evolutionary events that are taken into account are simple or multiple tandem duplications, direct or inverted, simple or multiple deletions, and inversions. We analysed the performance of our algorithm on simulated data sets and we applied it to the protocadherin gene clusters of human, chimpanzee, mouse and rat. Conclusions: Our results obtained on simulated data sets showed a good performance in inferring the total number and size distribution of duplication events.
    [Show full text]
  • Nematostella Genome
    Sea anemone genome reveals the gene repertoire and genomic organization of the eumetazoan ancestor Nicholas H. Putnam[1], Mansi Srivastava[2], Uffe Hellsten[1], Bill Dirks[2], Jarrod Chapman[1], Asaf Salamov[1], Astrid Terry[1], Harris Shapiro[1], Erika Lindquist[1], Vladimir V. Kapitonov[3], Jerzy Jurka[3], Grigory Genikhovich[4], Igor Grigoriev[1], JGI Sequencing Team[1], Robert E. Steele[5], John Finnerty[6], Ulrich Technau[4], Mark Q. Martindale[7], Daniel S. Rokhsar[1,2] [1] Department of Energy Joint Genome Institute, Walnut Creek, CA 94598 [2] Center for Integrative Genomics and Department of Molecular and Cell Biology, University of California, Berkeley CA 94720 [3] Genetic Information Research Institute, 1925 Landings Drive, Mountain View, CA 94043 [4] Sars International Centre for Marine Molecular Biology, University of Bergen, Thormoeøhlensgt 55; 5008, Bergen, Norway [5] Department of Biological Chemistry and the Developmental Biology Center, University of California, Irvine, CA 92697 [6] Department of Biology, Boston University, Boston, MA 02215 [7] Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI 96813 Abstract Sea anemones are seemingly primitive animals that, along with corals, jellyfish, and hydras, constitute the Cnidaria, the oldest eumetazoan phylum. Here we report a comparative analysis of the draft genome of an emerging cnidarian model, the starlet anemone Nematostella vectensis. The anemone genome is surprisingly complex, with a gene repertoire, exon-intron structure, and large-scale gene linkage more similar to vertebrates than to flies or nematodes. These results imply that the genome of the eumetazoan ancestor was similarly complex, and that fly and nematode genomes have been modified via sequence divergence, gene and intron loss, and genomic rearrangement.
    [Show full text]
  • Partial Retrotransposon-Like DNA Sequence in the Genomic Clone of Aspergillus flavus, Paf28
    Mycol. Res. 107 (7): 841–846 (July 2003). f The British Mycological Society 841 DOI: 10.1017/S0953756203008116 Printed in the United Kingdom. Partial retrotransposon-like DNA sequence in the genomic clone of Aspergillus flavus, pAF28 1 1 1 2 Patricia A. OKUBARA #, Brian K. TIBBOT , Alice S. TARUN , Cesaria E. MCALPIN and Sui-Sheng T. HUA1* 1 USDA, Agricultural Research Service, Western Regional Research Center, 800 Buchanan Street, Albany, CA 94710, USA. 2 USDA, Agricultural Research Service, National Center for Agricultural Utilization Research, Peoria, IL, USA. E-mail : [email protected] Received 18 February 2003; accepted 21 May 2003. A genomic clone of the aflatoxin-producing fungus Aspergillus flavus, designated pAF28, has been used as a probe for Southern blot fingerprinting of fungal strains. A large number of A. flavus strains isolated from corn fields and tree-nut orchards can be distinguished because the DNA fingerprint patterns are highly polymorphic. We have completed the sequencing of a 6355 bp insert in pAF28. The sequence features motifs and open reading frames characteristic of transposable elements of the gypsy class. We have named this new element AfRTL-1, for A. flavus retrotransposon-like DNA. INTRODUCTION characterized vegetative compatibility groups (Papa 1986, McAlpin & Mannarelli 1995, McAlpin et al. Aspergillus flavus is the most common aflatoxin-pro- 2002) indicates its further utility. In this study, we ducing species on corn, cotton, peanuts and tree nuts demonstrate that the genomic insert of pAF28 carries (Diener et al. 1987). Aflatoxin is a potent hepatoxin and a partial retrotransposon-like element homologous to carcinogen that poses a serious food safety hazard to the gypsy-class retrotransposon MAGGY, originally both humans and animals.
    [Show full text]
  • The Evolutionary Life History of P Transposons: from Horizontal Invaders to Domesticated Neogenes
    Chromosoma (2001) 110:148–158 DOI 10.1007/s004120100144 CHROMOSOMA FOCUS Wilhelm Pinsker · Elisabeth Haring Sylvia Hagemann · Wolfgang J. Miller The evolutionary life history of P transposons: from horizontal invaders to domesticated neogenes Received: 5 February 2001 / In revised form: 15 March 2001 / Accepted: 15 March 2001 / Published online: 3 May 2001 © Springer-Verlag 2001 Abstract P elements, a family of DNA transposons, are uct of their self-propagating lifestyle. One of the most known as aggressive intruders into the hitherto uninfected intensively studied examples is the P element of Dro- gene pool of Drosophila melanogaster. Invading through sophila, a family of DNA transposons that has proved horizontal transmission from an external source they useful not only as a genetic tool (e.g., transposon tag- managed to spread rapidly through natural populations ging, germline transformation vector), but also as a model within a few decades. Owing to their propensity for rapid system for investigating general features of the evolu- propagation within genomes as well as within popula- tionary behavior of mobile DNA (Kidwell 1994). P ele- tions, they are considered as the classic example of self- ments were first discovered as the causative agent of hy- ish DNA, causing havoc in a genomic environment per- brid dysgenesis in Drosophila melanogaster (Kidwell et missive for transpositional activity. Tracing the fate of P al. 1977) and were later characterized as a family of transposons on an evolutionary scale we describe differ- DNA transposons
    [Show full text]