Computational Approaches and Tools Used in Identification of Dispersed Repetitive DNA Sequences
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Multiple Origins of Viral Capsid Proteins from Cellular Ancestors
Multiple origins of viral capsid proteins from PNAS PLUS cellular ancestors Mart Krupovica,1 and Eugene V. Kooninb,1 aInstitut Pasteur, Department of Microbiology, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, 75015 Paris, France; and bNational Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894 Contributed by Eugene V. Koonin, February 3, 2017 (sent for review December 21, 2016; reviewed by C. Martin Lawrence and Kenneth Stedman) Viruses are the most abundant biological entities on earth and show genome replication. Understanding the origin of any virus group is remarkable diversity of genome sequences, replication and expres- possible only if the provenances of both components are elucidated sion strategies, and virion structures. Evolutionary genomics of (11). Given that viral replication proteins often have no closely viruses revealed many unexpected connections but the general related homologs in known cellular organisms (6, 12), it has been scenario(s) for the evolution of the virosphere remains a matter of suggested that many of these proteins evolved in the precellular intense debate among proponents of the cellular regression, escaped world (4, 6) or in primordial, now extinct, cellular lineages (5, 10, genes, and primordial virus world hypotheses. A comprehensive 13). The ability to transfer the genetic information encased within sequence and structure analysis of major virion proteins indicates capsids—the protective proteinaceous shells that comprise the that they evolved on about 20 independent occasions, and in some of cores of virus particles (virions)—is unique to bona fide viruses and these cases likely ancestors are identifiable among the proteins of distinguishes them from other types of selfish genetic elements cellular organisms. -
Table 1. Unclassified Proteins Encoded by Polintons Polintons
Table 1. Unclassified proteins encoded by Polintons Protein Polintons Length, Similar Description aa GenBank Family Species proteins* Polinton-1_DR Fish 280 Polinton-2_DR Fish 282 Polinton-1_XT Frog 248 Polinton-2_XT Frog 247 Polinton-1_SPU Lizard - Polinton-1_CI Sea squirt 249 A profile derived from the PX multiple alignment PX Polinton-2_CI Sea squirt 248 — does not match (based on PSI-BLAST) any proteins Polinton-1_SP Sea urchin 252 that are not encoded by Polintons. Polinton-2_SP Sea urchin 255 Polinton-3_SP Sea urchin 253 Polinton-5_SP Sea urchin 255 Polinton-1_TC Beatle 291 Polinton-1_DY Fruit fly 182 Polinton-1_DR Fish 435 Polinton-2_DR Fish 435 Polinton-1_XT Frog 436 Polinton-2_XT Frog 435 Polinton-1_SPU Lizard 435 This protein is the most conserved one among the Polinton-1_CI Sea squirt 431 Polinton-encoded unclassified proteins. Its Polinton-2_CI Sea squirt 428 conservation is comparable to that of POLB and PY Polinton-1_SP Sea urchin 442 — INT. Polinton-2_SP Sea urchin 442 A profile derived from the PX multiple alignment Polinton-3_SP Sea urchin 444 does not match any proteins that are not encoded by Polinton-4_SP Sea urchin 444 Polintons. Polinton-5_SP Sea urchin 444 Polinton-1_TC Beatle 437 Polinton-1_DY Fruit fly 430 Polinton-1_CB Nematode 287 Polinton-1_DR Fish 151 Polinton-2_DR Fish 156 Polinton-1_XT Frog 146 Polinton-2_XT Frog 147 Polinton-1_CI Sea squirt 141 Polinton-2_CI Sea squirt 140 A profile derived from the PX multiple alignment PW Polinton-1_SP Sea urchin 121 — does not match any proteins that are not encoded by Polinton-2_SP Sea urchin 120 Polintons. -
A Field Guide to Eukaryotic Transposable Elements
GE54CH23_Feschotte ARjats.cls September 12, 2020 7:34 Annual Review of Genetics A Field Guide to Eukaryotic Transposable Elements Jonathan N. Wells and Cédric Feschotte Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; email: [email protected], [email protected] Annu. Rev. Genet. 2020. 54:23.1–23.23 Keywords The Annual Review of Genetics is online at transposons, retrotransposons, transposition mechanisms, transposable genet.annualreviews.org element origins, genome evolution https://doi.org/10.1146/annurev-genet-040620- 022145 Abstract Annu. Rev. Genet. 2020.54. Downloaded from www.annualreviews.org Access provided by Cornell University on 09/26/20. For personal use only. Copyright © 2020 by Annual Reviews. Transposable elements (TEs) are mobile DNA sequences that propagate All rights reserved within genomes. Through diverse invasion strategies, TEs have come to oc- cupy a substantial fraction of nearly all eukaryotic genomes, and they rep- resent a major source of genetic variation and novelty. Here we review the defining features of each major group of eukaryotic TEs and explore their evolutionary origins and relationships. We discuss how the unique biology of different TEs influences their propagation and distribution within and across genomes. Environmental and genetic factors acting at the level of the host species further modulate the activity, diversification, and fate of TEs, producing the dramatic variation in TE content observed across eukaryotes. We argue that cataloging TE diversity and dissecting the idiosyncratic be- havior of individual elements are crucial to expanding our comprehension of their impact on the biology of genomes and the evolution of species. 23.1 Review in Advance first posted on , September 21, 2020. -
Frequent Homozygous Deletions in the FRA3B Region in Tumor Cell Lines Still Leave the FHIT Exons Intact
Oncogene (1998) 16, 635 ± 642 1998 Stockton Press All rights reserved 0950 ± 9232/98 $12.00 Frequent homozygous deletions in the FRA3B region in tumor cell lines still leave the FHIT exons intact Liang Wang1, John Darling1, Jin-San Zhang1, Chi-Ping Qian1, Lynn Hartmann2, Cheryl Conover2, Robert Jenkins1 and David I Smith1 1Division of Experimental Pathology, Department of Laboratory Medicine and Pathology; and 2Department of Medical Oncology, Mayo Clinic/Foundation, 200 First Street, SW, Rochester, Maine 55902, USA FRA3B at human chromosomal band 3p14.2 is the most Soreng, 1984). However, whether or not fragile sites active common fragile site in the human genome. The play a causative role in these structural chromosome molecular mechanism of fragility at this region remains alterations has yet to be determined. unknown but does not involve expansion of a trinucleo- FRA3B, at chromosome band 3p14.2, is the most tide or minisatellite repeat as has been observed for highly inducible fragile site in the human genome several of the cloned rare fragile sites. Deletions and (Smeets et al., 1986). The constitutive familial renal cell rearrangements at FRA3B have been observed in a carcinoma-associated translocation t(3;8)(p14.2;q24) number of distinct tumors. The recently identi®ed (hRCC) (Cohen et al., 1979) was localized immedi- putative tumor suppressor gene FHIT spans FRA3B, ately centromeric of FRA3B (Paradee et al., 1995; and various groups have reported identifying deletions in Boldog et al., 1993). Structural rearrangements and this gene in dierent tumors. Using a high density of deletions at FRA3B were reported in a variety of PCR ampli®able markers within FRA3B searching for histologically dierent cancers including lung (Todd et deletions in the FRA3B region, we have analysed 21 al., 1997), breast (Panagopoulos et al., 1996), tumor cell lines derived from renal cell, pancreatic, and esophageal (Wang et al., 1996), ovarian (Ehlen et al., ovarian carcinomas. -
Repetitive Elements in Humans
International Journal of Molecular Sciences Review Repetitive Elements in Humans Thomas Liehr Institute of Human Genetics, Jena University Hospital, Friedrich Schiller University, Am Klinikum 1, D-07747 Jena, Germany; [email protected] Abstract: Repetitive DNA in humans is still widely considered to be meaningless, and variations within this part of the genome are generally considered to be harmless to the carrier. In contrast, for euchromatic variation, one becomes more careful in classifying inter-individual differences as meaningless and rather tends to see them as possible influencers of the so-called ‘genetic background’, being able to at least potentially influence disease susceptibilities. Here, the known ‘bad boys’ among repetitive DNAs are reviewed. Variable numbers of tandem repeats (VNTRs = micro- and minisatellites), small-scale repetitive elements (SSREs) and even chromosomal heteromorphisms (CHs) may therefore have direct or indirect influences on human diseases and susceptibilities. Summarizing this specific aspect here for the first time should contribute to stimulating more research on human repetitive DNA. It should also become clear that these kinds of studies must be done at all available levels of resolution, i.e., from the base pair to chromosomal level and, importantly, the epigenetic level, as well. Keywords: variable numbers of tandem repeats (VNTRs); microsatellites; minisatellites; small-scale repetitive elements (SSREs); chromosomal heteromorphisms (CHs); higher-order repeat (HOR); retroviral DNA 1. Introduction Citation: Liehr, T. Repetitive In humans, like in other higher species, the genome of one individual never looks 100% Elements in Humans. Int. J. Mol. Sci. alike to another one [1], even among those of the same gender or between monozygotic 2021, 22, 2072. -
MNS16A Tandem Repeat Minisatellite of Human Telomerase Gene: Functional Studies in Colorectal, Lung and Prostate Cancer
www.impactjournals.com/oncotarget/ Oncotarget, 2017, Vol. 8, (No. 17), pp: 28021-28027 Research Paper MNS16A tandem repeat minisatellite of human telomerase gene: functional studies in colorectal, lung and prostate cancer Philipp Hofer1, Cornelia Zöchmeister1, Christian Behm1, Stefanie Brezina1, Andreas Baierl2, Angelina Doriguzzi1, Vanita Vanas1, Klaus Holzmann1, Hedwig Sutterlüty- Fall1, Andrea Gsur1 1Medical University of Vienna, Institute of Cancer Research, A-1090 Vienna, Austria 2University of Vienna, Department of Statistics and Operations Research, A-1010 Vienna, Austria Correspondence to: Andrea Gsur, email: [email protected] Keywords: genetic variation, MNS16A, functional polymorphism, telomerase, TERT regulation Received: September 23, 2016 Accepted: February 21, 2017 Published: March 03, 2017 Copyright: Hofer et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC-BY), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. ABSTRACT MNS16A, a functional polymorphic tandem repeat minisatellite, is located in the promoter region of an antisense transcript of the human telomerase reverse transcriptase gene. MNS16A promoter activity depends on the variable number of tandem repeats (VNTR) presenting varying numbers of transcription factor binding sites for GATA binding protein 1. Although MNS16A has been investigated in multiple cancer epidemiology studies with incongruent findings, functional data of only two VNTRs (VNTR-243 and VNTR-302) were available thus far, linking the shorter VNTR to higher promoter activity. For the first time, we investigated promoter activity of all six VNTRs of MNS16A in cell lines of colorectal, lung and prostate cancer using Luciferase reporter assay. -
The Genome of Schmidtea Mediterranea and the Evolution Of
OPEN ArtICLE doi:10.1038/nature25473 The genome of Schmidtea mediterranea and the evolution of core cellular mechanisms Markus Alexander Grohme1*, Siegfried Schloissnig2*, Andrei Rozanski1, Martin Pippel2, George Robert Young3, Sylke Winkler1, Holger Brandl1, Ian Henry1, Andreas Dahl4, Sean Powell2, Michael Hiller1,5, Eugene Myers1 & Jochen Christian Rink1 The planarian Schmidtea mediterranea is an important model for stem cell research and regeneration, but adequate genome resources for this species have been lacking. Here we report a highly contiguous genome assembly of S. mediterranea, using long-read sequencing and a de novo assembler (MARVEL) enhanced for low-complexity reads. The S. mediterranea genome is highly polymorphic and repetitive, and harbours a novel class of giant retroelements. Furthermore, the genome assembly lacks a number of highly conserved genes, including critical components of the mitotic spindle assembly checkpoint, but planarians maintain checkpoint function. Our genome assembly provides a key model system resource that will be useful for studying regeneration and the evolutionary plasticity of core cell biological mechanisms. Rapid regeneration from tiny pieces of tissue makes planarians a prime De novo long read assembly of the planarian genome model system for regeneration. Abundant adult pluripotent stem cells, In preparation for genome sequencing, we inbred the sexual strain termed neoblasts, power regeneration and the continuous turnover of S. mediterranea (Fig. 1a) for more than 17 successive sib- mating of all cell types1–3, and transplantation of a single neoblast can rescue generations in the hope of decreasing heterozygosity. We also developed a lethally irradiated animal4. Planarians therefore also constitute a a new DNA isolation protocol that meets the purity and high molecular prime model system for stem cell pluripotency and its evolutionary weight requirements of PacBio long-read sequencing12 (Extended Data underpinnings5. -
Identification and Characterization of a Minisatellite Contained Within A
Klein et al. Mobile DNA (2015) 6:18 DOI 10.1186/s13100-015-0049-1 RESEARCH Open Access Identification and characterization of a minisatellite contained within a novel miniature inverted-repeat transposable element (MITE) of Porphyromonas gingivalis Brian A. Klein1,2, Tsute Chen2, Jodie C. Scott2, Andrea L. Koenigsberg1, Margaret J. Duncan2 and Linden T. Hu1* Abstract Background: Repetitive regions of DNA and transposable elements have been found to constitute large percentages of eukaryotic and prokaryotic genomes. Such elements are known to be involved in transcriptional regulation, host-pathogen interactions and genome evolution. Results: We identified a minisatellite contained within a miniature inverted-repeat transposable element (MITE) in Porphyromonas gingivalis.TheP. gingivalis minisatellite and associated MITE, named ‘BrickBuilt’, comprises a tandemly repeating twenty-three nucleotide DNA sequence lacking spacer regions between repeats, and with flanking ‘leader’ and ‘tail’ subunits that include small inverted-repeat ends. Forms of the BrickBuilt MITE are found 19 times in thegenomeofP. gingivalis strain ATCC 33277, and also multiple times within the strains W83, TDC60, HG66 and JCVI SC001. BrickBuilt is always located intergenically ranging between 49 and 591 nucleotides from the nearest upstream and downstream coding sequences. Segments of BrickBuilt contain promoter elements with bidirectional transcription capabilities. Conclusions: We performed a bioinformatic analysis of BrickBuilt utilizing existing whole genome sequencing, microarray and RNAseq data, as well as performing in vitro promoter probe assays to determine potential roles, mechanisms and regulation of the expression of these elements and their affect on surrounding loci. The multiplicity, localization and limited host range nature of MITEs and MITE-like elements in P. -
Unfolding of Quadruplex Structure in the G-Rich Strand of the Minisatellite Repeat by the Binding Protein UP1
Unfolding of quadruplex structure in the G-rich strand of the minisatellite repeat by the binding protein UP1 Hirokazu Fukuda*, Masato Katahira†, Naoto Tsuchiya*, Yoshiaki Enokizono†, Takashi Sugimura*, Minako Nagao*, and Hitoshi Nakagama*‡ *Biochemistry Division, National Cancer Center Research Institute, 1-1, Tsukiji 5, Chuo-ku, Tokyo 104-0045, Japan; and †Department of Environment and Natural Sciences, Graduate School of Environment and Information Sciences, Yokohama National University, 79-7 Tokiwadai, Hodogaya-ku, Yokohama 240-8501, Japan Contributed by Takashi Sugimura, July 31, 2002 The mouse hypervariable minisatellite (MN) Pc-1 consists of tan- the other four, MNBP-D, MNBP-E, MNBP-F, and MNBP-G, to dem repeats of d(GGCAG) and flanked sequences. We have previ- the complementary C-rich strand. ously demonstrated that single-stranded d(GGCAG)n folds into the In this article, we document isolation of cDNA clones encod- intramolecular folded-back quadruplex structure under physiolog- ing MNBP-B and characterization of a recombinant MNBP-B. ical conditions. Because DNA polymerase progression in vitro is Sequences of seven proteolytic peptides of purified MNBP-B blocked at the repeat, the characteristic intramolecular quadruplex were determined, and cDNA clones were subsequently isolated. structure of the repeat, at least in part, could be responsible for the MNBP-B was revealed to be identical to the single-stranded hypermutable feature of Pc-1 and other MNs with similar repetitive DNA binding protein, UP1 (17), which is a proteolytic product units. On the other hand, we have isolated six MN Pc-1 binding corresponding to the N-terminal 195 aa of the 34-kDa hetero- proteins (MNBPs) from nuclear extracts of NIH 3T3 cells. -
Repbase Update, a Database of Repetitive Elements in Eukaryotic Genomes Weidong Bao1*, Kenji K
Bao et al. Mobile DNA (2015) 6:11 DOI 10.1186/s13100-015-0041-9 SOFTWARE Open Access Repbase Update, a database of repetitive elements in eukaryotic genomes Weidong Bao1*, Kenji K. Kojima1,2,3* and Oleksiy Kohany1 Abstract Repbase Update (RU) is a database of representative repeat sequences in eukaryotic genomes. Since its first development as a database of human repetitive sequences in 1992, RU has been serving as a well-curated reference database fundamental for almost all eukaryotic genome sequence analyses. Here, we introduce recent updates of RU, focusing on technical issues concerning the submission and updating of Repbase entries and will give short examples of using RU data. RU sincerely invites a broader submission of repeat sequences from the research community. Keywords: Repbase Update, Repbase Reports, Transposable element, RepbaseSubmitter, Database Background submission and updating of Repbase entries, and will Repbase Update (RU), or simply “Repbase” for short, is a give short examples of using RU data. database of transposable elements (TEs) and other types of repeats in eukaryotic genomes [1]. Being a well- curated reference database, RU has been commonly used RU and TE identification for eukaryotic genome sequence analyses and in studies In eukaryotic genomes, most TEs exist in families of vari- concerning the evolution of TEs and their impact on ge- able sizes, i.e., TEs of one specific family are derived from nomes [2–6]. RU was initiated by the late Dr. Jerzy Jurka a common ancestor through its major burst of multiplica- in the early 1990s and had been developed under his dir- tion in the evolutionary history. -
Exploration of the Propagation of Transpovirons Within Mimiviridae Reveals a Unique Example of Commensalism in the Viral World
The ISME Journal (2020) 14:727–739 https://doi.org/10.1038/s41396-019-0565-y ARTICLE Exploration of the propagation of transpovirons within Mimiviridae reveals a unique example of commensalism in the viral world 1 1 1 1 1 Sandra Jeudy ● Lionel Bertaux ● Jean-Marie Alempic ● Audrey Lartigue ● Matthieu Legendre ● 2 1 1 2 3 4 Lucid Belmudes ● Sébastien Santini ● Nadège Philippe ● Laure Beucher ● Emanuele G. Biondi ● Sissel Juul ● 4 2 1 1 Daniel J. Turner ● Yohann Couté ● Jean-Michel Claverie ● Chantal Abergel Received: 9 September 2019 / Revised: 27 November 2019 / Accepted: 28 November 2019 / Published online: 10 December 2019 © The Author(s) 2019. This article is published with open access Abstract Acanthamoeba-infecting Mimiviridae are giant viruses with dsDNA genome up to 1.5 Mb. They build viral factories in the host cytoplasm in which the nuclear-like virus-encoded functions take place. They are themselves the target of infections by 20-kb-dsDNA virophages, replicating in the giant virus factories and can also be found associated with 7-kb-DNA episomes, dubbed transpovirons. Here we isolated a virophage (Zamilon vitis) and two transpovirons respectively associated to B- and C-clade mimiviruses. We found that the virophage could transfer each transpoviron provided the host viruses were devoid of 1234567890();,: 1234567890();,: a resident transpoviron (permissive effect). If not, only the resident transpoviron originally isolated from the corresponding virus was replicated and propagated within the virophage progeny (dominance effect). Although B- and C-clade viruses devoid of transpoviron could replicate each transpoviron, they did it with a lower efficiency across clades, suggesting an ongoing process of adaptive co-evolution. -
Downloads/Repeatmaskedgenomes
Kojima Mobile DNA (2018) 9:2 DOI 10.1186/s13100-017-0107-y REVIEW Open Access Human transposable elements in Repbase: genomic footprints from fish to humans Kenji K. Kojima1,2 Abstract Repbase is a comprehensive database of eukaryotic transposable elements (TEs) and repeat sequences, containing over 1300 human repeat sequences. Recent analyses of these repeat sequences have accumulated evidences for their contribution to human evolution through becoming functional elements, such as protein-coding regions or binding sites of transcriptional regulators. However, resolving the origins of repeat sequences is a challenge, due to their age, divergence, and degradation. Ancient repeats have been continuously classified as TEs by finding similar TEs from other organisms. Here, the most comprehensive picture of human repeat sequences is presented. The human genome contains traces of 10 clades (L1, CR1, L2, Crack, RTE, RTEX, R4, Vingi, Tx1 and Penelope) of non-long terminal repeat (non-LTR) retrotransposons (long interspersed elements, LINEs), 3 types (SINE1/7SL, SINE2/tRNA, and SINE3/5S) of short interspersed elements (SINEs), 1 composite retrotransposon (SVA) family, 5 classes (ERV1, ERV2, ERV3, Gypsy and DIRS) of LTR retrotransposons, and 12 superfamilies (Crypton, Ginger1, Harbinger, hAT, Helitron, Kolobok, Mariner, Merlin, MuDR, P, piggyBac and Transib) of DNA transposons. These TE footprints demonstrate an evolutionary continuum of the human genome. Keywords: Human repeat, Transposable elements, Repbase, Non-LTR retrotransposons, LTR retrotransposons, DNA transposons, SINE, Crypton, MER, UCON Background contrast, MER4 was revealed to be comprised of LTRs of Repbase and conserved noncoding elements endogenous retroviruses (ERVs) [1]. Right now, Repbase Repbase is now one of the most comprehensive data- keeps MER1 to MER136, some of which are further bases of eukaryotic transposable elements and repeats divided into several subfamilies.