Genome-Wide Characterization of Human Minisatellite Vntrs
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Genome-Wide Analysis of Macrosatellite
Schaap et al. BMC Genomics 2013, 14:143 http://www.biomedcentral.com/1471-2164/14/143 RESEARCH ARTICLE Open Access Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: evidence for differences and commonalities in size distributions and size restrictions Mireille Schaap1, Richard JLF Lemmers1, Roel Maassen1, Patrick J van der Vliet1, Lennart F Hoogerheide2, Herman K van Dijk3, Nalan Baştürk4,5, Peter de Knijff1 and Silvère M van der Maarel1* Abstract Background: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and function is largely understudied. Here, we describe a detailed study of six autosomal and two X chromosomal MSRs among 270 HapMap individuals from Central Europe, Asia and Africa. Copy number variation, stability and genetic heterogeneity of the autosomal macrosatellite repeats RS447 (chromosome 4p), MSR5p (5p), FLJ40296 (13q), RNU2 (17q) and D4Z4 (4q and 10q) and X chromosomal DXZ4 and CT47 were investigated. Results: Repeat array size distribution analysis shows that all of these MSRs are highly polymorphic with the most genetic variation among Africans and the least among Asians. A mitotic mutation rate of 0.4-2.2% was observed, exceeding meiotic mutation rates and possibly explaining the large size variability found for these MSRs. By means of a novel Bayesian approach, statistical support for a distinct multimodal rather than a uniform allele size distribution was detected in seven out of eight MSRs, with evidence for equidistant intervals between the modes. -
DUX4, a Zygotic Genome Activator, Is Involved in Oncogenesis and Genetic Diseases Anna Karpukhina, Yegor Vassetzky
DUX4, a Zygotic Genome Activator, Is Involved in Oncogenesis and Genetic Diseases Anna Karpukhina, Yegor Vassetzky To cite this version: Anna Karpukhina, Yegor Vassetzky. DUX4, a Zygotic Genome Activator, Is Involved in Onco- genesis and Genetic Diseases. Ontogenez / Russian Journal of Developmental Biology, MAIK Nauka/Interperiodica, 2020, 51 (3), pp.176-182. 10.1134/S1062360420030078. hal-02988675 HAL Id: hal-02988675 https://hal.archives-ouvertes.fr/hal-02988675 Submitted on 17 Nov 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. ISSN 1062-3604, Russian Journal of Developmental Biology, 2020, Vol. 51, No. 3, pp. 176–182. © Pleiades Publishing, Inc., 2020. Published in Russian in Ontogenez, 2020, Vol. 51, No. 3, pp. 210–217. REVIEWS DUX4, a Zygotic Genome Activator, Is Involved in Oncogenesis and Genetic Diseases Anna Karpukhinaa, b, c, d and Yegor Vassetzkya, b, * aCNRS UMR9018, Université Paris-Sud Paris-Saclay, Institut Gustave Roussy, Villejuif, F-94805 France bKoltzov Institute of Developmental Biology of the Russian Academy of Sciences, -
A Multistep Bioinformatic Approach Detects Putative Regulatory
BMC Bioinformatics BioMed Central Research article Open Access A multistep bioinformatic approach detects putative regulatory elements in gene promoters Stefania Bortoluzzi1, Alessandro Coppe1, Andrea Bisognin1, Cinzia Pizzi2 and Gian Antonio Danieli*1 Address: 1Department of Biology, University of Padova – Via Bassi 58/B, 35131, Padova, Italy and 2Department of Information Engineering, University of Padova – Via Gradenigo 6/B, 35131, Padova, Italy Email: Stefania Bortoluzzi - [email protected]; Alessandro Coppe - [email protected]; Andrea Bisognin - [email protected]; Cinzia Pizzi - [email protected]; Gian Antonio Danieli* - [email protected] * Corresponding author Published: 18 May 2005 Received: 12 November 2004 Accepted: 18 May 2005 BMC Bioinformatics 2005, 6:121 doi:10.1186/1471-2105-6-121 This article is available from: http://www.biomedcentral.com/1471-2105/6/121 © 2005 Bortoluzzi et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: Searching for approximate patterns in large promoter sequences frequently produces an exceedingly high numbers of results. Our aim was to exploit biological knowledge for definition of a sheltered search space and of appropriate search parameters, in order to develop a method for identification of a tractable number of sequence motifs. Results: Novel software (COOP) was developed for extraction of sequence motifs, based on clustering of exact or approximate patterns according to the frequency of their overlapping occurrences. -
Bioinformatics Analyses of Genomic Imprinting
Bioinformatics Analyses of Genomic Imprinting Dissertation zur Erlangung des Grades des Doktors der Naturwissenschaften der Naturwissenschaftlich-Technischen Fakultät III Chemie, Pharmazie, Bio- und Werkstoffwissenschaften der Universität des Saarlandes von Barbara Hutter Saarbrücken 2009 Tag des Kolloquiums: 08.12.2009 Dekan: Prof. Dr.-Ing. Stefan Diebels Berichterstatter: Prof. Dr. Volkhard Helms Priv.-Doz. Dr. Martina Paulsen Vorsitz: Prof. Dr. Jörn Walter Akad. Mitarbeiter: Dr. Tihamér Geyer Table of contents Summary________________________________________________________________ I Zusammenfassung ________________________________________________________ I Acknowledgements _______________________________________________________II Abbreviations ___________________________________________________________ III Chapter 1 – Introduction __________________________________________________ 1 1.1 Important terms and concepts related to genomic imprinting __________________________ 2 1.2 CpG islands as regulatory elements ______________________________________________ 3 1.3 Differentially methylated regions and imprinting clusters_____________________________ 6 1.4 Reading the imprint __________________________________________________________ 8 1.5 Chromatin marks at imprinted regions___________________________________________ 10 1.6 Roles of repetitive elements ___________________________________________________ 12 1.7 Functional implications of imprinted genes _______________________________________ 14 1.8 Evolution and parental conflict ________________________________________________ -
A Novel Trna Variable Number Tandem Repeat at Human Chromosome 1Q23.3 Is Implicated As a Boundary Element Based on Conservation of a CTCF Motif in Mouse Emily M
Published online 21 April 2014 Nucleic Acids Research, 2014, Vol. 42, No. 10 6421–6435 doi: 10.1093/nar/gku280 A novel tRNA variable number tandem repeat at human chromosome 1q23.3 is implicated as a boundary element based on conservation of a CTCF motif in mouse Emily M. Darrow and Brian P. Chadwick* Department of Biological Science, Florida State University, Tallahassee, FL 32306-4295, USA Received February 14, 2014; Revised March 24, 2014; Accepted March 25, 2014 ABSTRACT dividuals making macrosatellites some of the largest vari- able number tandem repeats (VNTRs) in the genome The human genome contains numerous large tan- (3,5,6,7,8,9,10,11,12). dem repeats, many of which remain poorly charac- What role macrosatellites fulfill in our genome is un- terized. Here we report a novel transfer RNA (tRNA) clear. Some contain open reading frames (ORFs) that tandem repeat on human chromosome 1q23.3 that are predominantly expressed in the testis or certain can- shows extensive copy number variation with 9–43 cers (13,14,15), whereas the expression of others is more repeat units per allele and displays evidence of mei- widespread (16,17,18,19). However, reduced copy number otic and mitotic instability. Each repeat unit consists of some is associated with disease (5,20) due to inappro- of a 7.3 kb GC-rich sequence that binds the insulator priate reactivation of expression (21). Others contain no protein CTCF and bears the chromatin hallmarks of obvious ORF (6,11); further complicating what purpose a bivalent domain in human embryonic stem cells. -
Application to Neuroimaging Biomarkers in Alzheimer's
The Author(s) BMC Medical Informatics and Decision Making 2017, 17(Suppl 1):61 DOI 10.1186/s12911-017-0454-0 RESEARCH Open Access Knowledge-driven binning approach for rare variant association analysis: application to neuroimaging biomarkers in Alzheimer’s disease Dokyoon Kim1,2, Anna O. Basile2, Lisa Bang1, Emrin Horgusluoglu4, Seunggeun Lee3, Marylyn D. Ritchie1,2, Andrew J. Saykin4 and Kwangsik Nho4* From The 6th Translational Bioinformatics Conference Je Ju Island, Korea. 15-17 October 2016 Abstract Background: Rapid advancement of next generation sequencing technologies such as whole genome sequencing (WGS) has facilitated the search for genetic factors that influence disease risk in the field of human genetics. To identify rare variants associated with human diseases or traits, an efficient genome-wide binning approach is needed. In this study we developed a novel biological knowledge-based binning approach for rare-variant association analysis and then applied the approach to structural neuroimaging endophenotypes related to late-onset Alzheimer’sdisease(LOAD). Methods: For rare-variant analysis, we used the knowledge-driven binning approach implemented in Bin-KAT, an automated tool, that provides 1) binning/collapsing methods for multi-level variant aggregation with a flexible, biologically informed binning strategy and 2) an option of performing unified collapsing and statistical rare variant analyses in one tool. A total of 750 non-Hispanic Caucasian participants from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort who had both WGS data and magnetic resonance imaging (MRI) scans were used in this study. Mean bilateral cortical thickness of the entorhinal cortex extracted from MRI scans was used as an AD-related neuroimaging endophenotype. -
The Cancer-Associated CTCFL/BORIS Protein Targets Multiple Classes of Genomic Repeats, with a Distinct Binding and Functional Pr
Pugacheva et al. Epigenetics & Chromatin (2016) 9:35 DOI 10.1186/s13072-016-0084-2 Epigenetics & Chromatin RESEARCH Open Access The cancer‑associated CTCFL/BORIS protein targets multiple classes of genomic repeats, with a distinct binding and functional preference for humanoid‑specific SVA transposable elements Elena M. Pugacheva2, Evgeny Teplyakov1, Qiongfang Wu1, Jingjing Li1, Cheng Chen1, Chengcheng Meng1, Jian Liu1, Susan Robinson2, Dmitry Loukinov2, Abdelhalim Boukaba1, Andrew Paul Hutchins3, Victor Lobanenkov2 and Alexander Strunnikov1* Abstract Background: A common aberration in cancer is the activation of germline-specific proteins. The DNA-binding pro- teins among them could generate novel chromatin states, not found in normal cells. The germline-specific transcrip- tion factor BORIS/CTCFL, a paralog of chromatin architecture protein CTCF, is often erroneously activated in cancers and rewires the epigenome for the germline-like transcription program. Another common feature of malignancies is the changed expression and epigenetic states of genomic repeats, which could alter the transcription of neighboring genes and cause somatic mutations upon transposition. The role of BORIS in transposable elements and other repeats has never been assessed. Results: The investigation of BORIS and CTCF binding to DNA repeats in the K562 cancer cells dependent on BORIS for self-renewal by ChIP-chip and ChIP-seq revealed three classes of occupancy by these proteins: elements cohabited by BORIS and CTCF, CTCF-only bound, or BORIS-only bound. The CTCF-only enrichment is characteristic for evolu- tionary old and inactive repeat classes, while BORIS and CTCF co-binding predominately occurs at uncharacterized tandem repeats. These repeats form staggered cluster binding sites, which are a prerequisite for CTCF and BORIS co-binding. -
Estimation of the RNU2 Macrosatellite Mutation Rate by BRCA1 Mutation
Published online 17 July 2014 Nucleic Acids Research, 2014, Vol. 42, No. 14 9121–9130 doi: 10.1093/nar/gku639 Estimation of the RNU2 macrosatellite mutation rate by BRCA1 mutation tracing Chloe´ Tessereau1,2, Yann Lesecque3, Nastasia Monnet1, Monique Buisson1, Laure Barjhoux1,Melanie´ Leon´ e´ 4, Bingjian Feng5, David E. Goldgar5,Olga M. Sinilnikova1,4, Sylvain Mousset3, Laurent Duret3 and Sylvie Mazoyer1,* 1Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Universite´ Lyon 1, Centre Leon´ Berard,´ Lyon, France, 2Genomic Vision, Bagneux, Paris, France, 3Laboratoire de Biometrie´ et Biologie Evolutive, CNRS UMR5558, Universite´ Lyon 1, France, 4Unite´ Mixte de Gen´ etique´ Constitutionnelle des Cancers Frequents,´ Hospices Civils de Lyon/Centre Leon´ Berard,´ Lyon, France and 5Department of Dermatology and Huntsman Cancer Institute University of Utah School of Medicine, Salt Lake City, Utah, USA Received April 4, 2014; Revised June 26, 2014; Accepted July 1, 2014 ABSTRACT traits, including diseases. The most extensively studied tan- dem repeat DNA sequences so far have been microsatellites Large tandem repeat sequences have been poorly (composed of units from 1 to 10 bp), in particular those in- investigated as severe technical limitations and their volved in trinucleotide repeat disorders, and minisatellites frequent absence from the genome reference hinder (units from 10 to a few hundreds bp long). Comparatively, their analysis. Extensive allelotyping of this class of little is known about tandemly repeated sequences longer variation has not been possible until now and their than 1 kb, also known as multi-allelic tandem copy number mutational dynamics are still poorly known. -
Changes in Mirna Expression in a Model of Microcephaly Shan Parikh University of Connecticut - Storrs, [email protected]
University of Connecticut OpenCommons@UConn Honors Scholar Theses Honors Scholar Program Spring 5-9-2010 Changes in miRNA Expression in a Model of Microcephaly Shan Parikh University of Connecticut - Storrs, [email protected] Follow this and additional works at: https://opencommons.uconn.edu/srhonors_theses Part of the Cellular and Molecular Physiology Commons, and the Other Physiology Commons Recommended Citation Parikh, Shan, "Changes in miRNA Expression in a Model of Microcephaly" (2010). Honors Scholar Theses. 140. https://opencommons.uconn.edu/srhonors_theses/140 Changes in miRNA expression in a model of Microcephaly Shan Parikh Physiology and Neurobiology Abstract miRNAs function to regulate gene expression through post-transcriptional mechanisms to potentially regulate multiple aspects of physiology and development. Whole transcriptome analysis has been conducted on the citron kinase mutant rat, a mutant that shows decreases in brain growth and development. The resulting differences in RNA between mutant and wild-type controls can be used to identify genetic pathways that may be regulated differentially in normal compared to abnormal neurogenesis. The goal of this thesis was to verify, with quantitative reverse transcriptase polymerase chain reaction (qRT-PCR), changes in miRNA expression in Cit-k mutants and wild types. In addition to confirming miRNA expression changes, bio- informatics software TargetScan 5.1 was used to identify potential mRNA targets of the differentially expressed miRNAs. The miRNAs that were confirmed to change include: rno-miR- 466c, mmu-miR-493, mmu-miR-297a, hsa-miR-765, and hsa-miR-1270. The TargetScan analysis revealed 347 potential targets which have known roles in development. A subset of these potential targets include genes involved in the Wnt signaling pathway which is known to be an important regulator of stem cell development. -
Stage-Specific Gene Expression Is a Fundamental Characteristic of Rat Spermatogenic Cells and Sertoli Cells
Stage-specific gene expression is a fundamental characteristic of rat spermatogenic cells and Sertoli cells Daniel S. Johnston*, William W. Wright†‡, Paul DiCandeloro*§, Ewa Wilson¶, Gregory S. Kopf*ʈ, and Scott A. Jelinsky¶ *Contraception, Women’s Health and Musculoskeletal Biology, Wyeth Research, 500 Arcola Road, Collegeville, PA 19426; †Division of Reproductive Biology, Department of Biochemistry and Molecular Biology, The Johns Hopkins Bloomberg School of Public Health, 615 North Wolfe Street, Baltimore, MD 21205- 2179; and ¶Biological Technologies, Biological Research, Wyeth Research, 87 Cambridge Park Drive, Cambridge, MA 02140 Edited by Ryuzo Yanagimachi, University of Hawaii, Honolulu, HI, and approved April 1, 2008 (received for review October 17, 2007) Mammalian spermatogenesis is a complex biological process that study detected 16,971 probe sets,** and analysis of their cell type occurs within a highly organized tissue, the seminiferous epithelium. and stage-regulated expression supported the conclusion that cyclic The coordinated maturation of spermatogonia, spermatocytes, and gene expression is a widespread and, therefore, fundamental charac- spermatids suggests the existence of precise programs of gene teristic of both spermatogenic and Sertoli cells. These data predicted expression in these cells and in their neighboring somatic Sertoli cells. that important biological pathways and processes are regulated as The objective of this study was to identify the genes that execute specific cell types progress through the stages of the cycle. these programs. Rat seminiferous tubules at stages I, II–III, IV–V, VI, VIIa,b, VIIc,d, VIII, IX–XI, XII, and XIII–XIV of the cycle were isolated by Results and Discussion microdissection, whereas Sertoli cells, spermatogonia plus early sper- Characterization of the Testicular Transcriptome. -
The Intertwining of Transposable Elements and Non-Coding Rnas
Int. J. Mol. Sci. 2013, 14, 13307-13328; doi:10.3390/ijms140713307 OPEN ACCESS International Journal of Molecular Sciences ISSN 1422-0067 www.mdpi.com/journal/ijms Review The Intertwining of Transposable Elements and Non-Coding RNAs Michael Hadjiargyrou 1 and Nicholas Delihas 2,* 1 Department of Life Sciences, Theobald Science Center, Room 420, New York Institute of Technology, Old Westbury, NY 11568, USA; E-Mail: [email protected] 2 Department of Molecular Genetics and Microbiology, School of Medicine, Stony Brook University, Stony Brook, NY 11794, USA * Author to whom correspondence should be addressed; E-Mail: [email protected]; Tel.: +1-631-286-9427; Fax: +1-631-632-9797. Received: 20 May 2013; in revised form: 5 June 2013 / Accepted: 5 June 2013 / Published: 26 June 2013 Abstract: Growing evidence shows a close association of transposable elements (TE) with non-coding RNAs (ncRNA), and a significant number of small ncRNAs originate from TEs. Further, ncRNAs linked with TE sequences participate in a wide-range of regulatory functions. Alu elements in particular are critical players in gene regulation and molecular pathways. Alu sequences embedded in both long non-coding RNAs (lncRNA) and mRNAs form the basis of targeted mRNA decay via short imperfect base-pairing. Imperfect pairing is prominent in most ncRNA/target RNA interactions and found throughout all biological kingdoms. The piRNA-Piwi complex is multifunctional, but plays a major role in protection against invasion by transposons. This is an RNA-based genetic immune system similar to the one found in prokaryotes, the CRISPR system. Thousands of long intergenic non-coding RNAs (lincRNAs) are associated with endogenous retrovirus LTR transposable elements in human cells. -
A SARS-Cov-2 Protein Interaction Map Reveals Targets for Drug Repurposing
Article A SARS-CoV-2 protein interaction map reveals targets for drug repurposing https://doi.org/10.1038/s41586-020-2286-9 A list of authors and affiliations appears at the end of the paper Received: 23 March 2020 Accepted: 22 April 2020 A newly described coronavirus named severe acute respiratory syndrome Published online: 30 April 2020 coronavirus 2 (SARS-CoV-2), which is the causative agent of coronavirus disease 2019 (COVID-19), has infected over 2.3 million people, led to the death of more than Check for updates 160,000 individuals and caused worldwide social and economic disruption1,2. There are no antiviral drugs with proven clinical efcacy for the treatment of COVID-19, nor are there any vaccines that prevent infection with SARS-CoV-2, and eforts to develop drugs and vaccines are hampered by the limited knowledge of the molecular details of how SARS-CoV-2 infects cells. Here we cloned, tagged and expressed 26 of the 29 SARS-CoV-2 proteins in human cells and identifed the human proteins that physically associated with each of the SARS-CoV-2 proteins using afnity-purifcation mass spectrometry, identifying 332 high-confdence protein–protein interactions between SARS-CoV-2 and human proteins. Among these, we identify 66 druggable human proteins or host factors targeted by 69 compounds (of which, 29 drugs are approved by the US Food and Drug Administration, 12 are in clinical trials and 28 are preclinical compounds). We screened a subset of these in multiple viral assays and found two sets of pharmacological agents that displayed antiviral activity: inhibitors of mRNA translation and predicted regulators of the sigma-1 and sigma-2 receptors.