Retroviral Long Terminal Repeats; Structure, Detection and Phylogeny

Total Page:16

File Type:pdf, Size:1020Kb

Retroviral Long Terminal Repeats; Structure, Detection and Phylogeny !"#$ %&#$&'( !$)*)+,) !"$-./*-*0*..0,*, 12111 *+,,+/ ! " # $%&& &'(&&) * ) )+* *," )- ./0* 1 2 */ 3 * * "/%&&/4 0 4 56 +* / # / 7/%$ / /8639'!:': ;:;&:&/ < ,<04. : ) * : )<04 /0* )<04 * */ - ,--. 1 ) 1**/0* )*1 1 <04 ) <04 --/ + 8 ** )-- )) )<04 ) ,=4>. ,24>./ 0* )--1 1) **) * ) <04/ 0* * -- ) ) 1** 1 ) )=4>/0* ) )* )) ) <041 * /0*1 * * 0? @# *#0:* *1** 1*0#0# ##0### / 8 + 88 * )* ) <041 --: /0*<041 1* * : 1 <04 * / 8 + 888** + 8 881 <04) * * )* ) 1 <04/" *) <04 * * /0*1 1 *<04 * )) )*+ /0* <04 ) 81) <04/+8 ) % ** 0?A@#/ + 8>1 * )B ) * )24>/24>1 ) * * * )* /C**) ) 24>/# 24> =4> 1 ) * * 1* 1 ) 24>B / + > 1 ) ) ) ) 24> ) ) ) D /8 ) * )* 24> * / 8 --1 ) <04/+ 1 * * /0** ** )) ) : 9#B / 4 * - * ): ! "# #$# %&'()*( E" 3 * * %&& 8669$ :$%&$ 8639'!:': ;:;&:& ( ((( :%&&%!,* (AA //A F G ( ((( :%&&%!. To my family List of papers I Benachenhou, F., Jern, P., Oja, M., Sperber, G., Blikstad, V., Somervuo, P., Kaski, S. and Blomberg, J. Evolutionary conser- vation of orthoretroviral long terminal repeats (LTRs) and ab initio detection of single LTRs in genomic data. PLoS ONE 4 (2009b), p. e5179 II Benachenhou, F., Blikstad, V. and Blomberg, J. The phylogeny of orthoretroviral long terminal repeats (LTRs). Gene 448 (2009a), pp. 134-8. III Benachenhou, F., Sperber, G. and Blomberg, J. Universal struc- ture and phylogeny of Long Terminal Repeats (LTRs). Manu- script. IV Blomberg, J., Benachenhou, F., Blikstad, V., Sperber, G. and Mayer, J. Classification and nomenclature of endogenous retro- viral sequences (ERVs): problems and recommendations. Gene 448 (2009), pp. 115-23. V Blikstad, V., Benachenhou, F., Sperber, G.O. and Blomberg, J. Evolution of human endogenous retroviral sequences: a concep- tual account. Cell Mol Life Sci 65 (2008), pp. 3348-65. Contents Introduction.....................................................................................................1 Retroviruses and LTR retrotransposons.....................................................1 Background............................................................................................1 XRVs and ERVs....................................................................................3 Relation to other genomic elements ......................................................4 LTRs ......................................................................................................5 Hidden Markov models..............................................................................7 Why HMMs? .........................................................................................7 Generalities............................................................................................7 Building HMMs for LTRs.....................................................................8 Overfitting and regularisation................................................................8 Model surgery........................................................................................8 Scoring...................................................................................................9 Evaluation..............................................................................................9 SuperViterbi...........................................................................................9 Summary of papers .......................................................................................10 Paper I ......................................................................................................10 Paper II .....................................................................................................13 Paper III....................................................................................................17 Paper IV ...................................................................................................19 Paper V.....................................................................................................19 Conclusions and future prospects .................................................................21 Acknowledgements.......................................................................................23 References.....................................................................................................24 Abbreviations env Envelope gene ERV Endogenous retrovirus gag Group specific antigen gene HERV Human endogenous retrovirus HML Human MMTV-like retrovirus HMM Hidden Markov model HTLV Human T-cell leukemia virus IN Integrase domain JSRV Jaagsiekte sheep retrovirus LTR Long terminal repeats MAP Maximum a posteriori ML Maximum likelihood MLV Mouse leukemia virus MMTV Mouse mammary tumor virus ORF Open reading frame PAS Polyadenylation site PBS Primer binding site PLE Penelope like elements pol Polymerase gene PPT Polypurine tract pro Protease gene pSIVgml Grey mouse lemur prosimian immu- nodeficiency virus R Repeat region (in LTR) RELIK Rabbit endogenous lentivirus type K RH RNAse H domain RT Reverse transcriptase domain TSD Target site duplication TSS Transcription start site U3 Unique 3’ region U5 Unique 5’ region XRV Exogenous retrovirus YR Tyrosine recombinase Introduction The present work is devoted to the study of a specific part of retroviruses and related retrotransposons, the long terminal repeats (LTRs). The study of LTRs is important for two reasons. First, there are very few systematic investigations of LTRs published in the literature. This is partly because of the scarcity of conserved motifs and partly because LTRs do not code for proteins and as a result standard alignment techniques such as ClustalW cannot be used. Second, LTRs are ubiquitous components of many eukaryote genomes. In the human genome for example, seven to eight per- cent is of retroviral origin (Lander et al., 2001). The aim of the present work is threefold. i) To create reliable alignments of LTRs. ii) To study the evolutionary relationships between LTR retrotrans- posons using the alignments. iii) To detect LTRs in genomic DNA. This was done by utilising the mathematical tool hidden Markov models (HMMs). Retroviruses and LTR retrotransposons Background Retroviruses are a family of RNA viruses which infect vertebrates (Coffin et al., 1997; Mager and Medstrand, 2003). Their genome consists of two iden- tical positive-strand RNAs. One of their characteristic features is their ability to make a DNA copy of their RNA genome, reverse transcription. Retroviruses integrate into the host genome as part of their replication. This property, although unique among viruses, is shared with other retroelements, the LTR retrotransposons (Boeke and Stote, 1997; Eickbush and Jamburut- hugoda, 2008). It is remarkable that the Copia/Ty1 retrotransposons (Flavell et al., 1997; Peterson-Burch and Voytas, 2002), found in fungi, plants, invertebrates and some vertebrates, have almost the same genetic structure as retroviruses. The only difference is that they generally lack the env-gene and therefore cannot leave the cell. They are however capable of reintegrating into the cellular genome and make multiple copies of their own genome. However, one line- 1 age of Copia/Ty1, referred to as Agroviruses or Sireviruses (Peterson-Burch and Voytas, 2002; Llorens et al., 2009), has an env-like gene. The Gypsy/Ty3 elements (Kordis, 2005; Llorens et al., 2008) found in fungi, plants, invertebrates and some vertebrates, are even more similar to retrovi- ruses. Like the Copia/Ty1 group they are generally non-infectious transpos- able elements except for the errantividae of insects which have infectious properties (Terzian et al., 2001) thanks to an env-like gene. The remaining group of LTR-containing retrotransposons is the Bel/Pao group in metazoans (Boeke and Stote, 1997). Retroviruses and other env-containing retroelements can occasionally infect the germ line and be transmitted in a Mendelian fashion thereby becoming endogenous retroviruses (ERVs, HERVs in the human genome). The genome of LTR retrotransposons comprises four protein-coding genes (Coffin et al., 1997): The gag-, the pro-, the pol- and the env-gene. Pol is the best conserved gene and encodes the reverse transcriptase (RT), the RNAse H (RH) and integrase (IN) proteins. The env-encoded proteins allow for the fusion between retrotransposon and cell, thus conveying infectivity. In the genomic, full length, RNA, the four coding genes are flanked by two non-coding sequences. Both of them end in an identical region, the repeat region R, but they also contain two unique regions adjacent to R: The U5 region at the 5’ end and the U3 region at the 3’ end. After reverse transcrip- tion the flanking non-coding sequences become identical and are called LTRs. They consist of U3 followed by R and U5. Downstream of the 5’ LTR is the primer binding site (PBS) that pairs to the host t-RNA priming the synthesis of the first DNA-strand in the reverse transcription reaction. The 3’ LTR is preceded by the polypurine tract (PPT) which primes the syn- thesis of the second DNA-strand. ERVs are often present as solo LTRs, lacking the internal genes. They were formed through homologous recombination between the two LTRs, and elimination
Recommended publications
  • Interfering with Retrotransposition by Two Types of CRISPR Effectors
    Zhang et al. Cell Discovery (2020) 6:30 Cell Discovery https://doi.org/10.1038/s41421-020-0164-0 www.nature.com/celldisc ARTICLE Open Access Interfering with retrotransposition by two types of CRISPR effectors: Cas12a and Cas13a Niubing Zhang1,2, Xinyun Jing1, Yuanhua Liu3, Minjie Chen1,2, Xianfeng Zhu2, Jing Jiang2, Hongbing Wang4, Xuan Li1 and Pei Hao3 Abstract CRISPRs are a promising tool being explored in combating exogenous retroviral pathogens and in disabling endogenous retroviruses for organ transplantation. The Cas12a and Cas13a systems offer novel mechanisms of CRISPR actions that have not been evaluated for retrovirus interference. Particularly, a latest study revealed that the activated Cas13a provided bacterial hosts with a “passive protection” mechanism to defend against DNA phage infection by inducing cell growth arrest in infected cells, which is especially significant as it endows Cas13a, a RNA-targeting CRISPR effector, with mount defense against both RNA and DNA invaders. Here, by refitting long terminal repeat retrotransposon Tf1 as a model system, which shares common features with retrovirus regarding their replication mechanism and life cycle, we repurposed CRISPR-Cas12a and -Cas13a to interfere with Tf1 retrotransposition, and evaluated their different mechanisms of action. Cas12a exhibited strong inhibition on retrotransposition, allowing marginal Tf1 transposition that was likely the result of a lasting pool of Tf1 RNA/cDNA intermediates protected within virus-like particles. The residual activities, however, were completely eliminated with new constructs for persistent crRNA targeting. On the other hand, targeting Cas13a to Tf1 RNA intermediates significantly inhibited Tf1 retrotransposition. However, unlike in bacterial hosts, the sustained activation of Cas13a by Tf1 transcripts did not cause cell growth arrest in S.
    [Show full text]
  • Retrovirus-Like Elements in Plants
    Research Signpost 37/661 (2), Fort P.O., Trivandrum-695 023, Kerala, India Recent Res. Devel. Plant Sci., 3(2005): ISBN: 81-7736-245-3 Retrovirus-like elements in plants Antonio Marco and Ignacio Marín Departamento de Genética, Universidad de Valencia, Calle Doctor Moliner, 50 46100 Burjassot (Valencia), Spain Abstract LTR retrotransposons with structures that are identical to those found in simple vertebrate retroviruses, including a putative env gene, have been discovered in plants. Those potential plant retroviruses can be classified into two classes. The first one is formed by the Arabidopsis thaliana Athila elements and many other closely related env- containing elements. All of them belong to the Ty3/Gypsy group of LTR retrotransposons. The second class, in which the best-known element was first found in soybean and called SIRE1, belongs to the Ty1/Copia group. Thus, two distantly related lineages have convergent features that suggest that the transition between intracellular and infective ways of life may have occurred several times independently. Correspondence/Reprint request: Dr. Ignacio Marín, Departamento de Genética, Universidad de Valencia Calle Doctor Moliner, 50, 46100 Burjassot (Valencia), Spain. E-mail: [email protected] 2 Antonio Marco & Ignacio Marín Introduction First discovered in maize by Barbara McClintock and later recognized in all types of organisms, transposable elements are ubiquitous components of plant genomes. Eukaryotic transposable elements are generally divided into two classes. Class II elements or DNA transposons do not require an intermediate RNA. Class I elements however, require an intermediate RNA and cannot complete its replicative cycle without the use of the enzyme reverse transcriptase (RT), able to generate DNA molecules from RNA templates.
    [Show full text]
  • A Species-Specific Retrotransposon Drives a Conserved Cdk2ap1 Isoform Essential for Preimplantation Development
    bioRxiv preprint doi: https://doi.org/10.1101/2021.03.24.436683; this version posted March 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Title: A species-specific retrotransposon drives a conserved Cdk2ap1 isoform essential for preimplantation development Authors: Andrew Modzelewski1†, Wanqing Shao2†, Jingqi Chen1, Angus Lee1, Xin Qi1, Mackenzie Noon1, Kristy Tjokro1, Gabriele Sales3, Anne Biton4, Terence Speed5, Zhenyu Xuan6, Ting Wang2#, Davide Risso3# and Lin He1# Affiliations: 1 Division of Cellular and Developmental Biology, MCB department, University of California at Berkeley, Berkeley, CA, USA. 2 Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA 3 Department of Statistical Sciences, University of Padova, Italy. 4 Division of Biostatistics, University of California at Berkeley, Berkeley, CA 94720, USA. 5 Bioinformatics Division, WEHI, Parkville, VIC 3052, Australia. 6 Department of Biological Sciences, The University of Texas at Dallas, Richardson Texas, USA # Correspondence to: [email protected], [email protected] and [email protected] †These authors contributed equally. Abstract: Retrotransposons mediate gene regulation in multiple developmental and pathological processes. Here, we characterized the transient retrotransposon induction in preimplantation development of eight mammalian species. While species-specific in sequences, induced retrotransposons exhibit a similar preimplantation profile, conferring gene regulatory activities particularly through LTR retrotransposon promoters. We investigated a mouse-specific MT2B2 retrotransposon promoter, which generates an N-terminally truncated, preimplantation-specific Cdk2ap1ΔN isoform to promote cell proliferation. Cdk2ap1ΔN functionally contrasts to the canonical Cdk2ap1, which represses cell proliferation and peaks in mid-gestation stage.
    [Show full text]
  • LINE1 Inhibition (With Nucleoside Reverse Transcriptase Inhibitors)
    Cognitive Vitality Reports® are reports written by neuroscientists at the Alzheimer’s Drug Discovery Foundation (ADDF). These scientific reports include analysis of drugs, drugs-in- development, drug targets, supplements, nutraceuticals, food/drink, non-pharmacologic interventions, and risk factors. Neuroscientists evaluate the potential benefit (or harm) for brain health, as well as for age-related health concerns that can affect brain health (e.g., cardiovascular diseases, cancers, diabetes/metabolic syndrome). In addition, these reports include evaluation of safety data, from clinical trials if available, and from preclinical models. LINE1 inhibition (with nucleoside reverse transcriptase inhibitors) Evidence Summary L1 activation may be associated with age-related diseases, and nucleoside reverse transcriptase inhibitors (NRTIs) suppress L1 activation and may be beneficial in age-related diseases. Neuroprotective Benefit: Increased retrotransposition of transposable elements is implicated in neurodegenerative diseases, so NRTIs could theoretically reverse these actions, but only preclinical evidence exists to date. Aging and related health concerns: Preclinical studies suggest that NRTIs may be an effective anti-aging therapy, and clinical studies suggest they may be effective for cancer. Safety: NRTIs are associated with some mild side effects and rare severe, possibly life- threatening, side effects. 1 Availability: Available with Dose: Depends on the Molecular Formula: N/A prescription (as generics) NRTI Molecular weight: N/A Half-life: Depends on the BBB: Possibly penetrant NRTI in animal models Clinical trials: Many for HIV, Observational studies: one for ALS Many What is it? Retrotransposons are stretches of DNA that can replicate and reinsert into other parts of the genome. Their presence in the DNA comes from retroviral infection of germline cells throughout evolutionary history.
    [Show full text]
  • 2007Murciaphd.Pdf
    https://theses.gla.ac.uk/ Theses Digitisation: https://www.gla.ac.uk/myglasgow/research/enlighten/theses/digitisation/ This is a digitised version of the original print thesis. Copyright and moral rights for this work are retained by the author A copy can be downloaded for personal non-commercial research or study, without prior permission or charge This work cannot be reproduced or quoted extensively from without first obtaining permission in writing from the author The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the author When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given Enlighten: Theses https://theses.gla.ac.uk/ [email protected] LATE RESTRICTION INDUCED BY AN ENDOGENOUS RETROVIRUS Pablo Ramiro Murcia August 2007 Thesis presented to the School of Veterinary Medicine at the University of Glasgow for the degree of Doctor of Philosophy Institute of Comparative Medicine 464 Bearsden Road Glasgow G61 IQH ©Pablo Murcia ProQuest Number: 10390741 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. uest ProQuest 10390741 Published by ProQuest LLO (2017). Copyright of the Dissertation is held by the Author. All rights reserved. This work is protected against unauthorized copying under Title 17, United States Code Microform Edition © ProQuest LLO.
    [Show full text]
  • Multiple Non-LTR Retrotransposons in the Genome of Arabidqpsis Thulium
    Copyright 0 1996 by the Genetics Society of America Multiple Non-LTR Retrotransposons in the Genome of ArabidqPsis thulium David A. Wright,* Ning Ke," Jan Smalle,t" Brian M. Hauge,t'2Howard M. Goodmant and Daniel F. Voytas* *Department of Zoology and Genetics, Iowa State University, Ames, Iowa 50011 and tDepartment of Genetics, Haward Medical School and Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 021 14 Manuscript received June 13, 1995 Accepted for publication October 14, 1995 ABSTRACT DNA sequence analysis near the Arabidopsis thaliana AB13 gene revealed the presence of a non-LTR retrotransposon insertion thatwe have designated Tal 1-1.This insertion is 6.2 kb in length and encodes two overlapping reading frames with similarity to non-LTR retrotransposon proteins, including reverse transcriptase. A polymerase chain reaction assaywas developed based on conserved amino acid sequences shared between the Tall-1 reverse transcriptase and those of non-LTR retrotransposons from other species. Seventeen additionalA. thaliana reverse transcriptases were identified that range in nucleotide similarity from 4848% (Ta12-Ta28). Phylogenetic analyses indicated that the A. thaliana sequences are more closely related to each other than to elements from other organisms, consistent with the vertical evolution of these sequences over mostof their evolutionary history. One sequence, Ta17,is located in the mitochondrial genome. The remaining are nuclear andof low copy number among 17 diverse A. thaliana ecotypes tested, suggesting that they are not highly active in transposition. The paucity of retrotransposons and the small genome size of A. thaliana support the hypothesis that most repetitive sequences have been lost from the genome and that mechanisms may exist to prevent amplification of extant element families.
    [Show full text]
  • Sap1 Dependent Replication Fork Barriers Guide Integration of Ltr Retrotransposons in S
    SAP1 DEPENDENT REPLICATION FORK BARRIERS GUIDE INTEGRATION OF LTR RETROTRANSPOSONS IN S. POMBE By! JAKE ZACHARY JACOBS A Dissertation submitted to the! Graduate School-New Brunswick! Rutgers, The State University of New Jersey !In partial fulfillment of the requirements! For the degree of !Doctor of Philosophy !Graduate Program in Biochemistry !Written under the direction of Dr. Mikel Zaratiegui, Ph.D.! And approved by _____________________________________________________ _____________________________________________________ _____________________________________________________ _____________________________________________________! New Brunswick, New Jersey! October 2016 ABSTRACT OF THE DISSERTATION SAP1 DEPENDENT REPLICATION FORK BARRIERS GUIDE INTEGRATION OF LTR RETROTRANSPOSONS IN S. POMBE By! JAKE ZACHARY JACOBS Dissertation Director: Dr. Mikel Zaratiegui Long Terminal Repeat retrotransposable elements (LTR-TEs) are a large group of eukaryotic Transposable Elements characterized by flanking repeats in tandem orientation—the LTRs. The LTRs of these elements contain sequences that recruit proteins involved in their expression, replication, silencing, organization, and stability. A successful transposable element must maximize its reproductive amplification without jeopardizing its host, and several characterized LTR-TEs appear to accomplish this through the selection of integration sites away from protein coding sequences. However, despite their high relatedness, a universal mechanism that explains how these parasitic elements avoid
    [Show full text]
  • Genome-Wide Mapping and Characterization of Microsatellites In
    www.nature.com/scientificreports OPEN Genome-wide mapping and characterization of microsatellites in the swamp eel genome Received: 14 February 2017 Zhigang Li, Feng Chen, Chunhua Huang, Weixin Zheng, Chunlai Yu, Hanhua Cheng & Accepted: 26 April 2017 Rongjia Zhou Published: xx xx xxxx We described genome-wide screening and characterization of microsatellites in the swamp eel genome. A total of 99,293 microsatellite loci were identified in the genome with an overall density of 179 microsatellites per megabase of genomic sequences. The dinucleotide microsatellites were the most abundant type representing 71% of the total microsatellite loci and the AC-rich motifs were the most recurrent in all repeat types. Microsatellite frequency decreased as numbers of repeat units increased, which was more obvious in long than short microsatellite motifs. Most of microsatellites were located in non-coding regions, whereas only approximately 1% of the microsatellites were detected in coding regions. Trinucleotide repeats were most abundant microsatellites in the coding regions, which represented amino acid repeats in proteins. There was a chromosome-biased distribution of microsatellites in non-coding regions, with the highest density of 203.95/Mb on chromosome 8 and the least on chromosome 7 (164.06/Mb). The most abundant dinucleotides (AC)n was mainly located on chromosome 8. Notably, genomic mapping showed that there was a chromosome-biased association of genomic distributions between microsatellites and transposon elements. Thus, the novel dataset of microsatellites in swamp eel provides a valuable resource for further studies on QTL-based selection breeding, genetic resource conservation and evolutionary genetics. Swamp eel (Monopterus albus) taxonomically belongs to teleosts, the family Synbranchidae of the order Synbranchiformes (Neoteleostei, Teleostei, and Vertebrata).
    [Show full text]
  • Genomic Neighborhoods for Arabidopsisretrotransposons: a Role for Targeted Integration in the Distribution of the Metaviridae Brooke D
    Statistics Publications Statistics 2004 Genomic neighborhoods for Arabidopsisretrotransposons: a role for targeted integration in the distribution of the Metaviridae Brooke D. Peterson-Burch U.S. Department of Agriculture Dan Nettleton Iowa State University, [email protected] Daniel F. Voytas Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/stat_las_pubs Part of the Applied Statistics Commons, Cell and Developmental Biology Commons, Genetics and Genomics Commons, and the Statistical Models Commons The ompc lete bibliographic information for this item can be found at https://lib.dr.iastate.edu/ stat_las_pubs/228. For information on how to cite this item, please visit http://lib.dr.iastate.edu/ howtocite.html. This Article is brought to you for free and open access by the Statistics at Iowa State University Digital Repository. It has been accepted for inclusion in Statistics Publications by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Genomic neighborhoods for Arabidopsisretrotransposons: a role for targeted integration in the distribution of the Metaviridae Abstract Background: Retrotransposons are an abundant component of eukaryotic genomes. The high quality of the Arabidopsis thaliana genome sequence makes it possible to comprehensively characterize retroelement populations and explore factors that contribute to their genomic distribution. Results: We identified the full complement of A. thaliana long terminal repeat (LTR) retroelements using RetroMap, a software tool that iteratively searches genome sequences for reverse transcriptases and then defines retroelement insertions. Relative ages of full-length elements were estimated by assessing sequence divergence between LTRs: the Pseudoviridae were significantly younger than the Metaviridae.
    [Show full text]
  • Virus World As an Evolutionary Network of Viruses and Capsidless Selfish Elements
    Virus World as an Evolutionary Network of Viruses and Capsidless Selfish Elements Koonin, E. V., & Dolja, V. V. (2014). Virus World as an Evolutionary Network of Viruses and Capsidless Selfish Elements. Microbiology and Molecular Biology Reviews, 78(2), 278-303. doi:10.1128/MMBR.00049-13 10.1128/MMBR.00049-13 American Society for Microbiology Version of Record http://cdss.library.oregonstate.edu/sa-termsofuse Virus World as an Evolutionary Network of Viruses and Capsidless Selfish Elements Eugene V. Koonin,a Valerian V. Doljab National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USAa; Department of Botany and Plant Pathology and Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon, USAb Downloaded from SUMMARY ..................................................................................................................................................278 INTRODUCTION ............................................................................................................................................278 PREVALENCE OF REPLICATION SYSTEM COMPONENTS COMPARED TO CAPSID PROTEINS AMONG VIRUS HALLMARK GENES.......................279 CLASSIFICATION OF VIRUSES BY REPLICATION-EXPRESSION STRATEGY: TYPICAL VIRUSES AND CAPSIDLESS FORMS ................................279 EVOLUTIONARY RELATIONSHIPS BETWEEN VIRUSES AND CAPSIDLESS VIRUS-LIKE GENETIC ELEMENTS ..............................................280 Capsidless Derivatives of Positive-Strand RNA Viruses....................................................................................................280
    [Show full text]
  • Downloads/Repeatmaskedgenomes
    Kojima Mobile DNA (2018) 9:2 DOI 10.1186/s13100-017-0107-y REVIEW Open Access Human transposable elements in Repbase: genomic footprints from fish to humans Kenji K. Kojima1,2 Abstract Repbase is a comprehensive database of eukaryotic transposable elements (TEs) and repeat sequences, containing over 1300 human repeat sequences. Recent analyses of these repeat sequences have accumulated evidences for their contribution to human evolution through becoming functional elements, such as protein-coding regions or binding sites of transcriptional regulators. However, resolving the origins of repeat sequences is a challenge, due to their age, divergence, and degradation. Ancient repeats have been continuously classified as TEs by finding similar TEs from other organisms. Here, the most comprehensive picture of human repeat sequences is presented. The human genome contains traces of 10 clades (L1, CR1, L2, Crack, RTE, RTEX, R4, Vingi, Tx1 and Penelope) of non-long terminal repeat (non-LTR) retrotransposons (long interspersed elements, LINEs), 3 types (SINE1/7SL, SINE2/tRNA, and SINE3/5S) of short interspersed elements (SINEs), 1 composite retrotransposon (SVA) family, 5 classes (ERV1, ERV2, ERV3, Gypsy and DIRS) of LTR retrotransposons, and 12 superfamilies (Crypton, Ginger1, Harbinger, hAT, Helitron, Kolobok, Mariner, Merlin, MuDR, P, piggyBac and Transib) of DNA transposons. These TE footprints demonstrate an evolutionary continuum of the human genome. Keywords: Human repeat, Transposable elements, Repbase, Non-LTR retrotransposons, LTR retrotransposons, DNA transposons, SINE, Crypton, MER, UCON Background contrast, MER4 was revealed to be comprised of LTRs of Repbase and conserved noncoding elements endogenous retroviruses (ERVs) [1]. Right now, Repbase Repbase is now one of the most comprehensive data- keeps MER1 to MER136, some of which are further bases of eukaryotic transposable elements and repeats divided into several subfamilies.
    [Show full text]
  • Drosophila: Retrotransposons Making up Telomeres
    Review Drosophila: Retrotransposons Making up Telomeres Elena Casacuberta Institute of Evolutionary Biology, IBE, CSIC—Pompeu Fabra University, Barcelona Spain, Passeig de la Barceloneta 37‐49, 08003 Barcelona, Spain; [email protected]‐csic.es; Tel.: +34‐932309637; Fax: +34‐932309555 Guest Editors: Paul M. Liebermann and Benedikt B. Kaufer Received: 9 June 2017; Accepted: 17 July 2017; Published: 19 July 2017 Abstract: Drosophila and extant species are the best‐studied telomerase exception. In this organism, telomere elongation is coupled with targeted retrotransposition of Healing Transposon (HeT‐A) and Telomere Associated Retrotransposon (TART) with sporadic additions of Telomere Associated and HeT‐A Related (TAHRE), all three specialized non‐Long Terminal Repeat (non‐LTR) retrotransposons. These three very special retroelements transpose in head to tail arrays, always in the same orientation at the end of the chromosomes but never in interior locations. Apparently, retrotransposon and telomerase telomeres might seem very different, but a detailed view of their mechanisms reveals similarities explaining how the loss of telomerase in a Drosophila ancestor could successfully have been replaced by the telomere retrotransposons. In this review, we will discover that although HeT‐A, TART, and TAHRE are still the only examples to date where their targeted transposition is perfectly tamed into the telomere biology of Drosophila, there are other examples of retrotransposons that manage to successfully integrate inside and at the end of telomeres. Because the aim of this special issue is viral integration at telomeres, understanding the base of the telomerase exceptions will help to obtain clues on similar strategies that mobile elements and viruses could have acquired in order to ensure their survival in the host genome.
    [Show full text]