"Evolutionary Emergence of Genes Through Retrotransposition"
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Mechanisms and Impact of Post-Transcriptional Exon Shuffling
Mechanisms and impact of Post-Transcriptional Exon Shuffling (PTES) Ginikachukwu Osagie Izuogu Doctor of Philosophy Institute of Genetic Medicine Newcastle University September 2016 i ABSTRACT Most eukaryotic genes undergo splicing to remove introns and join exons sequentially to produce protein-coding or non-coding transcripts. Post-transcriptional Exon Shuffling (PTES) describes a new class of RNA molecules, characterized by exon order different from the underlying genomic context. PTES can result in linear and circular RNA (circRNA) molecules and enhance the complexity of transcriptomes. Prior to my studies, I developed PTESFinder, a computational tool for PTES identification from high-throughput RNAseq data. As various sources of artefacts (including pseudogenes, template-switching and others) can confound PTES identification, I first assessed the effectiveness of filters within PTESFinder devised to systematically exclude artefacts. When compared to 4 published methods, PTESFinder achieves the highest specificity (~0.99) and comparable sensitivity (~0.85). To define sub-cellular distribution of PTES, I performed in silico analyses of data from various cellular compartments and revealed diverse populations of PTES in nuclei and enrichment in cytosol of various cell lines. Identification of PTES from chromatin-associated RNAseq data and an assessment of co-transcriptional splicing, established that PTES may occur during transcription. To assess if PTES contribute to the proteome, I analyzed sucrose-gradient fractionated data from HEK293, treated with arsenite to induce translational arrest and dislodge ribosomes. My results showed no effect of arsenite treatment on ribosome occupancy within PTES transcripts, indicating that these transcripts are not generally bound by polysomes and do not contribute to the proteome. -
Evolution of Pogo, a Separate Superfamily of IS630-Tc1-Mariner
Gao et al. Mobile DNA (2020) 11:25 https://doi.org/10.1186/s13100-020-00220-0 RESEARCH Open Access Evolution of pogo, a separate superfamily of IS630-Tc1-mariner transposons, revealing recurrent domestication events in vertebrates Bo Gao, Yali Wang, Mohamed Diaby, Wencheng Zong, Dan Shen, Saisai Wang, Cai Chen, Xiaoyan Wang and Chengyi Song* Abstracts Background: Tc1/mariner and Zator, as two superfamilies of IS630-Tc1-mariner (ITm) group, have been well-defined. However, the molecular evolution and domestication of pogo transposons, once designated as an important family of the Tc1/mariner superfamily, are still poorly understood. Results: Here, phylogenetic analysis show that pogo transposases, together with Tc1/mariner,DD34E/Gambol,and Zator transposases form four distinct monophyletic clades with high bootstrap supports (> = 74%), suggesting that they are separate superfamilies of ITm group. The pogo superfamily represents high diversity with six distinct families (Passer, Tigger, pogoR, Lemi, Mover,andFot/Fot-like) and wide distribution with an expansion spanning across all the kingdoms of eukaryotes. It shows widespread occurrences in animals and fungi, but restricted taxonomic distribution in land plants. It has invaded almost all lineages of animals—even mammals—and has been domesticated repeatedly in vertebrates, with 12 genes, including centromere-associated protein B (CENPB), CENPB DNA-binding domain containing 1 (CENPBD1), Jrk helix–turn–helix protein (JRK), JRK like (JRKL), pogo transposable element derived with KRAB domain (POGK), and with ZNF domain (POGZ), and Tigger transposable element-derived 2 to 7 (TIGD2–7), deduced as originating from this superfamily. Two of them (JRKL and TIGD2) seem to have been co-domesticated, and the others represent independent domestication events. -
Retrotransposon Long Interspersed Nucleotide Element1 (LINE1) Is
The Japanese Society of Developmental Biologists Develop. Growth Differ. (2012) 54, 673–685 doi: 10.1111/j.1440-169X.2012.01368.x Original Article Retrotransposon long interspersed nucleotide element-1 (LINE-1) is activated during salamander limb regeneration Wei Zhu,1 Dwight Kuo,2 Jason Nathanson,3 Akira Satoh,4,5 Gerald M. Pao,1 Gene W. Yeo,3 Susan V. Bryant,5 S. Randal Voss,6 David M. Gardiner5*and Tony Hunter1* 1Molecular and Cell Biology Laboratory and Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, California 92037, Departments of 2Bioengineering and 3Cellular and Molecular Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA; 4Okayama University, R.C.I.S. Okayama-city, Okayama, 700-8530, Japan; 5Department of Developmental and Cell Biology, University of California at Irvine, Irvine, California 92697, and 6Department of Biology and Spinal Cord and Brain Injury Research Center, University of Kentucky, Lexington, Kentucky 40506, USA Salamanders possess an extraordinary capacity for tissue and organ regeneration when compared to mam- mals. In our effort to characterize the unique transcriptional fingerprint emerging during the early phase of sala- mander limb regeneration, we identified transcriptional activation of some germline-specific genes within the Mexican axolotl (Ambystoma mexicanum) that is indicative of cellular reprogramming of differentiated cells into a germline-like state. In this work, we focus on one of these genes, the long interspersed nucleotide element-1 (LINE-1) retrotransposon, which is usually active in germ cells and silent in most of the somatic tissues in other organisms. LINE-1 was found to be dramatically upregulated during regeneration. -
B2 and ALU Retrotransposons Are Self-Cleaving Ribozymes Whose Activity Is Enhanced by EZH2
B2 and ALU retrotransposons are self-cleaving ribozymes whose activity is enhanced by EZH2 Alfredo J. Hernandeza,1, Athanasios Zovoilisb,c,1,2, Catherine Cifuentes-Rojasb,c,1,3, Lu Hanb,c, Bojan Bujisicb,c, and Jeannie T. Leeb,c,4 aDepartment of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115; bDepartment of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114; and cDepartment of Genetics, The Blavatnik Institute, Harvard Medical School, Boston, MA 02114 Contributed by Jeannie T. Lee, November 18, 2019 (sent for review October 9, 2019; reviewed by Vivian G. Cheung and Karissa Y. Sanbonmatsu) Transposable elements make up half of the mammalian genome. B2s are present in ∼350,000 copies in the mouse genome (10) One of the most abundant is the short interspersed nuclear and give rise to 180- to 200-nucleotide (nt) polymerase III complex element (SINE). Among their million copies, B2 accounts for (POL-III) transcripts. B2 SINEs are transcriptionally active, with ∼350,000 in the mouse genome and has garnered special interest expression being highest in germ cells and in early development, because of emerging roles in epigenetic regulation. Our recent work after which they become epigenetically silenced. However, one of demonstrated that B2 RNA binds stress genes to retard transcription the most intriguing aspects of SINE biology is that these elements elongation. Although epigenetically silenced, B2s become massively become highly up-regulated when somatic cells are acutely stressed up-regulated during thermal and other types of stress. Specifically, (11–13), including during viral infection (8, 11), autoimmune pro- an interaction between B2 RNA and the Polycomb protein, EZH2, cesses such as macular degeneration (14, 15), and progression to results in cleavage of B2 RNA, release of B2 RNA from chromatin, cancer (16, 17). -
A Review on the Population Genomics of Transposable Elements
G C A T T A C G G C A T genes Review On the Population Dynamics of Junk: A Review on the Population Genomics of Transposable Elements Yann Bourgeois and Stéphane Boissinot * New York University Abu Dhabi, P.O. 129188 Saadiyat Island, Abu Dhabi, UAE; [email protected] * Correspondence: [email protected] Received: 4 April 2019; Accepted: 21 May 2019; Published: 30 May 2019 Abstract: Transposable elements (TEs) play an important role in shaping genomic organization and structure, and may cause dramatic changes in phenotypes. Despite the genetic load they may impose on their host and their importance in microevolutionary processes such as adaptation and speciation, the number of population genetics studies focused on TEs has been rather limited so far compared to single nucleotide polymorphisms (SNPs). Here, we review the current knowledge about the dynamics of transposable elements at recent evolutionary time scales, and discuss the mechanisms that condition their abundance and frequency. We first discuss non-adaptive mechanisms such as purifying selection and the variable rates of transposition and elimination, and then focus on positive and balancing selection, to finally conclude on the potential role of TEs in causing genomic incompatibilities and eventually speciation. We also suggest possible ways to better model TEs dynamics in a population genomics context by incorporating recent advances in TEs into the rich information provided by SNPs about the demography, selection, and intrinsic properties of genomes. Keywords: transposable elements; population genetics; selection; drift; coevolution 1. Introduction Transposable elements (TEs) are repetitive DNA sequences that are ubiquitous in the living world and have the ability to replicate and multiply within genomes. -
Genome Organization/ Human
Genome Organization/ Secondary article Human Article Contents . Introduction David H Kass, Eastern Michigan University, Ypsilanti, Michigan, USA . Sequence Complexity Mark A Batzer, Louisiana State University Health Sciences Center, New Orleans, Louisiana, USA . Single-copy Sequences . Repetitive Sequences . The human nuclear genome is a highly complex arrangement of two sets of 23 Macrosatellites, Minisatellites and Microsatellites . chromosomes, or DNA molecules. There are various types of DNA sequences and Gene Families . chromosomal arrangements, including single-copy protein-encoding genes, repetitive Gene Superfamilies . sequences and spacer DNA. Transposable Elements . Pseudogenes . Mitochondrial Genome Introduction . Genome Evolution . Acknowledgements The human nuclear genome contains 3000 million base pairs (bp) of DNA, of which only an estimated 3% possess protein-encoding sequences. As shown in Figure 1, the DNA sequences of the eukaryotic genome can be classified sequences such as the ribosomal RNA genes. Repetitive into several types, including single-copy protein-encoding sequences with no known function include the various genes, DNA that is present in more than one copy highly repeated satellite families, and the dispersed, (repetitive sequences) and intergenic (spacer) DNA. The moderately repeated transposable element families. The most complex of these are the repetitive sequences, some of remainder of the genome consists of spacer DNA, which is which are functional and some of which are without simply a broad category of undefined DNA sequences. function. Functional repetitive sequences are classified into The human nuclear genome consists of 23 pairs of dispersed and/or tandemly repeated gene families that chromosomes, or 46 DNA molecules, of differing sizes either encode proteins (and may include noncoding (Table 1). -
The Future of Transposable Element Annotation and Their Classification in the Light of Functional Genomics
The future of transposable element annotation and their classification in the light of functional genomics -what we can learn from the fables of Jean de la Fontaine? Peter Arensburger, Benoit Piegu, Yves Bigot To cite this version: Peter Arensburger, Benoit Piegu, Yves Bigot. The future of transposable element annotation and their classification in the light of functional genomics - what we can learn from thefables of Jean de la Fontaine?. Mobile Genetic Elements, Taylor & Francis, 2016, 6 (6), pp.e1256852. 10.1080/2159256X.2016.1256852. hal-02385838 HAL Id: hal-02385838 https://hal.archives-ouvertes.fr/hal-02385838 Submitted on 27 May 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Copyright The future of transposable element annotation and their classification in the light of functional genomics - what we can learn from the fables of Jean de la Fontaine? Peter Arensburger1, Benoît Piégu2, and Yves Bigot2 1 Biological Sciences Department, California State Polytechnic University, Pomona, CA 91768 - United States of America. 2 Physiologie de la reproduction et des Comportements, UMR INRA-CNRS 7247, PRC, 37380 Nouzilly – France Corresponding author address: Biological Sciences Department, California State Polytechnic University, Pomona, CA 91768 - United States of America. -
Repeated Horizontal Transfers of Four DNA Transposons in Invertebrates and Bats Zhou Tang1†, Hua-Hao Zhang2†, Ke Huang3, Xiao-Gu Zhang2, Min-Jin Han1 and Ze Zhang1*
Tang et al. Mobile DNA (2015) 6:3 DOI 10.1186/s13100-014-0033-1 RESEARCH Open Access Repeated horizontal transfers of four DNA transposons in invertebrates and bats Zhou Tang1†, Hua-Hao Zhang2†, Ke Huang3, Xiao-Gu Zhang2, Min-Jin Han1 and Ze Zhang1* Abstract Background: Horizontal transfer (HT) of transposable elements (TEs) into a new genome is considered as an important force to drive genome variation and biological innovation. However, most of the HT of DNA transposons previously described occurred between closely related species or insects. Results: In this study, we carried out a detailed analysis of four DNA transposons, which were found in the first sequenced twisted-wing parasite, Mengenilla moldrzyki. Through the homology-based strategy, these transposons were also identified in other insects, freshwater planarian, hydrozoans, and bats. The phylogenetic distribution of these transposons was discontinuous, and they showed extremely high sequence identities (>87%) over their entire length in spite of their hosts diverging more than 300 million years ago (Mya). Additionally, phylogenies and comparisons of transposons versus orthologous gene identities demonstrated that these transposons have transferred into their hosts by independent HTs. Conclusions: Here, we provided the first documented example of HT of CACTA transposons, which have been so far extensively studied in plants. Our results demonstrated that bats had continuously acquired new DNA elements via HT. This implies that predation on a large quantity of insects might increase bat exposure to HT. In addition, parasite-host interaction might facilitate exchanging of their genetic materials. Keywords: Horizontal transfer, CACTA transposons, Mammals, Recent activity Background or isolated species. -
The Saccharomyces Ty5 Retrotransposon Family Is Associated with Origins of DNA Replication at the Telomeres and the Silent Mating Locus HMR SIGE Zou, DAVID A
Proc. Natl. Acad. Sci. USA Vol. 92, pp. 920-924, January 1995 Genetics The Saccharomyces Ty5 retrotransposon family is associated with origins of DNA replication at the telomeres and the silent mating locus HMR SIGE Zou, DAVID A. WRIGHT, AND DANIEL F. VOYTAS* Department of Zoology and Genetics, Iowa State University, Ames, IA 50011 Communicated by Mary Lou Pardue, Massachusetts Institute of Technology, Cambridge, MA, October 3, 1994 ABSTRACT We have characterized the genomic organi- with tRNA genes has been well documented, and most Ty3 zation of the TyS retrotransposons among diverse strains of insertions are located within a few bases of the transcription Saccharomyces cerevisiae and the related species Saccharomyces start site of genes transcribed by RNA polymerase III (pol III) paradoxus. The S. cerevisiae strain S288C (or its derivatives) (4). The fourth retrotransposon family, Ty4, is represented by carries eight Ty5 insertions. Six of these are located near the a single insertion on chr III. This element is within 300 bp of telomeres, and five are found within 500 bp of autonomously a tRNA gene, and tRNA genes are associated with 10 of 12 Ty4 replicating sequences present in the type X subtelomeric insertions currently in the sequence data base (GenBank, repeat. The remaining two S. cerevisiae elements are adjacent release 83.0). Thus, a target bias, and in particular a preference to the silent mating locus HMR and are located within 500 bp for tRNA genes, is readily apparent from the genomic orga- of the origin of replication present in the transcriptional nization of endogenous Tyl-4 elements. -
Estimation of Duplication History Under a Stochastic Model for Tandem Repeats Farzad Farnoud1* , Moshe Schwartz2 and Jehoshua Bruck3
Farnoud et al. BMC Bioinformatics (2019) 20:64 https://doi.org/10.1186/s12859-019-2603-1 RESEARCH ARTICLE Open Access Estimation of duplication history under a stochastic model for tandem repeats Farzad Farnoud1* , Moshe Schwartz2 and Jehoshua Bruck3 Abstract Background: Tandem repeat sequences are common in the genomes of many organisms and are known to cause important phenomena such as gene silencing and rapid morphological changes. Due to the presence of multiple copies of the same pattern in tandem repeats and their high variability, they contain a wealth of information about the mutations that have led to their formation. The ability to extract this information can enhance our understanding of evolutionary mechanisms. Results: We present a stochastic model for the formation of tandem repeats via tandem duplication and substitution mutations. Based on the analysis of this model, we develop a method for estimating the relative mutation rates of duplications and substitutions, as well as the total number of mutations, in the history of a tandem repeat sequence. We validate our estimation method via Monte Carlo simulation and show that it outperforms the state-of-the-art algorithm for discovering the duplication history. We also apply our method to tandem repeat sequences in the human genome, where it demonstrates the different behaviors of micro- and mini-satellites and can be used to compare mutation rates across chromosomes. It is observed that chromosomes that exhibit the highest mutation activity in tandem repeat regions are the same as those thought to have the highest overall mutation rates. However, unlike previous works that rely on comparing human and chimpanzee genomes to measure mutation rates, the proposed method allows us to find chromosomes with the highest mutation activity based on a single genome, in essence by comparing (approximate) copies of the pattern in tandem repeats. -
Stable Integration of Transgenes Delivered by a Retrotransposon–Adenovirus Hybrid Vector
HUMAN GENE THERAPY 12:1417–1428 (July 20, 2001) Mary Ann Liebert, Inc. Stable Integration of Transgenes Delivered by a Retrotransposon–Adenovirus Hybrid Vector HARRIS SOIFER, 1,2 COLLIN HIGO, 1,2 HAIG H. KAZAZIAN, JR., 3 JOHN V. MORAN, 4 KOHNOSUKE MITANI, 5 and NORIYUKI KASAHARA 1,2,6 ABSTRACT Helper-dependent adenoviruses show great promise as gene delivery vectors. However, because they do not integrate into the host chromosome, transgene expression cannot be maintained indefinitely. To overcome these limitations, we have inserted an L1 retrotransposon/transgene element into a helper-dependent adeno- virus to create a novel chimeric gene delivery vector. Efficient adenovirus-mediated delivery of the L1 ele- ment into cultured human cells results in subsequent retrotransposition and stable integration of the trans- gene. L1 retrotransposition frequency was found to correlate with increasing multiplicity of infection by the chimeric vector, and further retrotransposition from newly integrated elements was not observed on pro- longed culture. Therefore, this vector, which utilizes components of low immunogenic potential, represents a novel two-stage gene delivery system capable of achieving high titers via the initial helper-dependent adeno- virus stage and permanent transgene integration via the retrotransposition stage. OVERVIEW SUMMARY ing to achieve efficient gene transfer, a variety of viruses have been adapted for use as gene delivery vectors, including retro- A retrotransposition-competent LINE-1 element (RC-L1) virus, adenovirus, and adeno-associated virus. Of these, ade- was inserted into a helper-dependent adenovirus (HDAd) to noviral vectors are capable of achieving high titers and have generate a hybrid adenovirus–retrotransposon vector. -
Genomic Comparison of Closely Related Giant Viruses Supports an Accordion-Like Model of Evolution
ORIGINAL RESEARCH published: 16 June 2015 doi: 10.3389/fmicb.2015.00593 Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution Jonathan Filée * Laboratoire Evolution, Génome, Comportement, Ecologie, Centre National de la Recherche Scientifique UMR 9191, IRD UMR 247, Université Paris-Saclay, Gif-sur-Yvette, France Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the Edited by: different losses. In addition, each lineage displays specific patterns of genome evolution. Bernard La Scola, Mimiviridae (megaviruses and mimiviruses) and Chlorella Phycodnaviruses evolved Aix Marseille Université, France mainly by duplications and losses of genes belonging to large paralogous families Reviewed by: (including movements of diverse mobiles genetic elements), whereas Micromonas and Dahlene N. Fusco, Massachusetts General Hospital, USA Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral Philippe Colson, gene transfers. Taken together, these data support an accordion-like model of evolution Aix-Marseille Université, France in which GV genomes have undergone successive steps of gene gain and gene loss, *Correspondence: Jonathan Filée, accrediting the hypothesis that genome gigantism appears early, before the diversification Laboratoire Evolution, Génome, of the different GV lineages.