Partial Bisulfite Conversion for Unique Template Sequencing

Total Page:16

File Type:pdf, Size:1020Kb

Partial Bisulfite Conversion for Unique Template Sequencing Published online 17 November 2017 Nucleic Acids Research, 2018, Vol. 46, No. 2 e10 doi: 10.1093/nar/gkx1054 Partial bisulfite conversion for unique template sequencing Vijay Kumar, Julie Rosenbaum, Zihua Wang, Talitha Forcier, Michael Ronemus, Michael Wigler and Dan Levy* Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724 USA Received May 25, 2017; Revised October 10, 2017; Editorial Decision October 16, 2017; Accepted October 18, 2017 ABSTRACT tinguishable from PCR and sequencing error. Finally, dis- tortion during PCR amplification makes for an unreliable We introduce a new protocol, mutational sequenc- estimate of count in RNA expression and DNA copy num- ing or muSeq, which uses sodium bisulfite to ran- ber. domly deaminate unmethylated cytosines at a fixed There are a host of new technologies designed to ad- and tunable rate. The muSeq protocol marks each dress the shortcomings of short-read sequencing, utilizing initial template molecule with a unique mutation sig- various strategies of limiting dilution and tagging (2–7). In nature that is present in every copy of the template, this paper, we present a fundamentally different approach and in every fragmented copy of a copy. In the se- that embeds a unique mutational tag within the sequence of quenced read data, this signature is observed as a each template molecule, and discuss the merits of a diverse unique pattern of C-to-T or G-to-A nucleotide con- set of tools for enhanced short-read sequencing. Previously, versions. Clustering reads with the same conversion we described the theoretical practicality of such a method (8). By marking each initial template molecule with a ran- pattern enables accurate count and long-range as- dom mutation pattern, all subsequent copies of the origi- sembly of initial template molecules from short-read nal molecule will carry the same pattern. Thus overlapping sequence data. We explore count and low-error se- copies from the same initial template can be joined if they quencing by profiling 135 000 restriction fragments have near identical patterns that far exceed chance agree- in a PstI representation, demonstrating that muSeq ment. With sufficient coverage, this property enables the improves copy number inference and significantly long-range assembly of each mutated template molecule. reduces sporadic sequencer error. We explore long- Such information is also useful for problems of haplotype range assembly in the context of cDNA, generating phasing and measuring repeat lengths. As with all template contiguous transcript clusters greater than 3,000 bp tagging methods, this method also allows accurate counting in length. The muSeq assemblies reveal transcrip- and low-error sequencing. tional diversity not observable from short-read data We demonstrate a protocol and informatics that realize the theory: marking each initial template with a demonstra- alone. bly unique mutational pattern and reconstructing identity, assembly, and count from noisy real-world data. We call INTRODUCTION this method mutational sequencing or muSeq. We use par- tial sodium bisulfite conversion to mark double-stranded Long-read sequencing platforms such as PacBio and Ox- template DNA molecules or first-strand cDNAs. The bisul- ford Nanopore are costly and error-prone, but provide the fite reaction deaminates unmethylated cytosines, and is typ- long-range information required for high quality assem- ically used for studying cytosine methylation patterns in the blies (1). Short-read sequencers are relatively inexpensive genome (9). For that application, the deamination reaction and have excellent precision; however, the reads lengths are is run to completion, converting nearly every unmethylated sufficient only for simpler assemblies. The specific problems cytosine to uracil. For randomly marking templates, how- with short-read sequencers are readily enumerated. When- ever, we require partial conversion. By adjusting the time ever the distinguishing variants in the template molecules and temperature of a step in the bisulfite reaction, we can are more than one read length apart, multiple distinct as- reliably control the rate of conversion. Reflecting the binary semblies are equally consistent with the read data. This pre- nature of the conversion, we refer to cytosines in this con- vents resolving haplotypes, observing transcript isoforms, text as ‘bits.’ and assembling complex repetitive regions. Although se- quence fidelity is good, low-frequency variants are not dis- *To whom correspondence should be addressed. Tel: +1 516 367 8377; Fax: +1 516 367 8381; Email: [email protected] C The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] e10 Nucleic Acids Research, 2018, Vol. 46, No. 2 PAGE 2 OF 11 To test the operating characteristics of the muSeq proto- read 1 and all G to A in read 2 (Figure 1F). These modified col, we conducted two series of experiments. In the first we reads are then mapped by Bowtie2 (11) to two modified ver- studied the application of muSeq to a genomic representa- sions of the reference genome (hg38 assembly): one with all tion (Figure 1) focusing our attention on ∼135,000 restric- C converted to T (hg38 CT), and one with all G converted tion fragments of the proper size and uniqueness. Our ex- to A (hg38 GA). Selection of the best mapping determines periments show that the rate of deamination is independent the strand of origin (Figure 1G). By referencing the original of position and uncorrelated within the template. We ob- read pair, we determine the conversion pattern. serve that fragment counts are linear with copy number and From the reference genome, there are 162,353 expected that allele ratios follow the expected binomial distribution. PstI fragments between 150 and 400 bp in length. We con- We determine that the method does not contribute any mea- vert these fragments in silico, both C-to-T and G-to-A, and surable sequence error. In the second series of experiments, mapthemtohg38CT and hg38 GA. Those in which both we applied the muSeq protocol to cDNA derived from re- top and bottom strands map unambiguously (MAPQ ≥ verse transcribed poly(A)+ cellular RNA. Applying a simple 40) comprise 135,262 high quality representation fragments algorithm, we clustered sequence reads into longer consen- (HQRFs). We further consider only those reads that map sus templates. The resulting templates comprised thousands with high quality alignments to HQRFs. These reads ac- of unique transcripts and compare favorably to reference count for about 50% of the raw sequence. transcript assemblies. Analyses of the data demonstrate the Read pairs are binned by restriction fragment and the ability to reconstruct splicing patterns at the level of indi- RT or RB of the initial template (Figure 1H).Eachofthe vidual transcripts. 270,000 bins is analyzed separately to determine the set of initial template conversion patterns. While many read MATERIALS AND METHODS pairs from the same initial template fragment bear identi- cal conversion patterns and sequence, sequencing and PCR Representations errors are sufficiently frequent to require methods for infer- Genomic DNAs were extracted from whole blood, cleaved ring their consensus, or common ancestral template. Conse- with PstI, end-repaired and ligated to custom Illumina quently, we extract all bits from each read pair (Figure 1H), sequencing primers (Figure 1A and B). The primers are establishing a bit string where 0 indicates that a position rendered bisulfite conversion-resistant by substituting 5- is unconverted and 1 indicates that a position is converted methyl-cytosine (5mC) for cytosine during oligo synthe- by sodium bisulfite. To cluster read pairs, we use transitive sis. The complete conversion protocol uses the MethylEasy propagation, an algorithm we developed to find an optimal Xceed Rapid DNA Bisulphite Modification Kit Mix (Hu- clustering (see supplement for details). Given a model for man Genetic Signatures/Clontech) according to standard base-calling error and a model for conversion rates, tran- instructions. The partial conversion protocol (Figure 1C) sitive propagation identifies a clustering solution that opti- uses the same kit and instructions, but we reduce the tem- mizes pairwise probabilities of belonging to the same cluster perature and time during incubation with the combined (a = b) or not (a = b) under the condition that belonging to Reagents 1 and 2 (step 5 in the instructions). We started the same cluster is a transitive relation. with 75 ng of input DNA for each reaction and carried out both complete (45 min, 80◦C) and partial conversion ◦ cDNA at 3, 6 and 9 min at 73 C. After conversion, we sampled 4% from each converted sample and PCR-amplified (Fig- We extracted total RNA from 3 million fibroblasts from a ure 1D) using Illumina P5 and P7 sequencing adapters (for line derived from the same donor as the whole blood sample. the complete conversion library, we sampled 40%). The re- We sampled 3.3% of the RNA for conversion to cDNA by sulting libraries were sequenced (Figure 1E) on an Illumina reverse transcriptase (100 U; SMARTScribe reverse tran- MiSeq (∼17 million paired-end reads per sample). We also scriptase; Clontech), employing custom oligo d(T) primers sampled 2% from the 6 min conversion, then amplified for and template switch primers, each with a sample tag and five linear rounds with just one primer (P7); we then com- random barcode. We made two such samples with distinct pleted the PCR as above, sequencing the resulting libraries pairs of sample tags. We subjected the first strand cDNA to on two lanes of an Illumina NextSeq (∼800 million paired- 6-minute partial bisulfite conversion as above.
Recommended publications
  • Reconstructing Cell Cycle Pseudo Time-Series Via Single-Cell Transcriptome Data—Supplement
    School of Natural Sciences and Mathematics Reconstructing Cell Cycle Pseudo Time-Series Via Single-Cell Transcriptome Data—Supplement UT Dallas Author(s): Michael Q. Zhang Rights: CC BY 4.0 (Attribution) ©2017 The Authors Citation: Liu, Zehua, Huazhe Lou, Kaikun Xie, Hao Wang, et al. 2017. "Reconstructing cell cycle pseudo time-series via single-cell transcriptome data." Nature Communications 8, doi:10.1038/s41467-017-00039-z This document is being made freely available by the Eugene McDermott Library of the University of Texas at Dallas with permission of the copyright owner. All rights are reserved under United States copyright law unless specified otherwise. File name: Supplementary Information Description: Supplementary figures, supplementary tables, supplementary notes, supplementary methods and supplementary references. CCNE1 CCNE1 CCNE1 CCNE1 36 40 32 34 32 35 30 32 28 30 30 28 28 26 24 25 Normalized Expression Normalized Expression Normalized Expression Normalized Expression 26 G1 S G2/M G1 S G2/M G1 S G2/M G1 S G2/M Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage CCNE1 CCNE1 CCNE1 CCNE1 40 32 40 40 35 30 38 30 30 28 36 25 26 20 20 34 Normalized Expression Normalized Expression Normalized Expression 24 Normalized Expression G1 S G2/M G1 S G2/M G1 S G2/M G1 S G2/M Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage Supplementary Figure 1 | High stochasticity of single-cell gene expression means, as demonstrated by relative expression levels of gene Ccne1 using the mESC-SMARTer data. For every panel, 20 sample cells were randomly selected for each of the three stages, followed by plotting the mean expression levels at each stage.
    [Show full text]
  • Aneuploidy: Using Genetic Instability to Preserve a Haploid Genome?
    Health Science Campus FINAL APPROVAL OF DISSERTATION Doctor of Philosophy in Biomedical Science (Cancer Biology) Aneuploidy: Using genetic instability to preserve a haploid genome? Submitted by: Ramona Ramdath In partial fulfillment of the requirements for the degree of Doctor of Philosophy in Biomedical Science Examination Committee Signature/Date Major Advisor: David Allison, M.D., Ph.D. Academic James Trempe, Ph.D. Advisory Committee: David Giovanucci, Ph.D. Randall Ruch, Ph.D. Ronald Mellgren, Ph.D. Senior Associate Dean College of Graduate Studies Michael S. Bisesi, Ph.D. Date of Defense: April 10, 2009 Aneuploidy: Using genetic instability to preserve a haploid genome? Ramona Ramdath University of Toledo, Health Science Campus 2009 Dedication I dedicate this dissertation to my grandfather who died of lung cancer two years ago, but who always instilled in us the value and importance of education. And to my mom and sister, both of whom have been pillars of support and stimulating conversations. To my sister, Rehanna, especially- I hope this inspires you to achieve all that you want to in life, academically and otherwise. ii Acknowledgements As we go through these academic journeys, there are so many along the way that make an impact not only on our work, but on our lives as well, and I would like to say a heartfelt thank you to all of those people: My Committee members- Dr. James Trempe, Dr. David Giovanucchi, Dr. Ronald Mellgren and Dr. Randall Ruch for their guidance, suggestions, support and confidence in me. My major advisor- Dr. David Allison, for his constructive criticism and positive reinforcement.
    [Show full text]
  • Direct in Vivo Mapping of Functional Suppressors in Glioblastoma Genome
    bioRxiv preprint doi: https://doi.org/10.1101/153460; this version posted June 22, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Direct in vivo mapping of functional suppressors in glioblastoma genome Ryan D. Chow *,1,2,3, Christopher D. Guzman*,1,2,4,5,6, Guangchuan Wang *,1,2, Florian Schmidt *,7,8, Mark W. Youngblood **,1,3,9, Lupeng Ye **,1,2, Youssef Errami 1,2, Matthew B. Dong 1,2,3, Michael A. Martinez 1,2, Sensen Zhang 1,2, Paul Renauer, Kaya Bilguvar 1,10, Murat Gunel 1,3,9,10, Phillip A. Sharp 11,12, Feng Zhang 13,14, Randall J. Platt 7,8,#, and Sidi Chen 1,2,3,4,5,15,16# Affiliations 1. Department of Genetics 2. System Biology Institute 3. MD-PhD Program 4. Biological and Biomedical Sciences Program 5. Immunobiology Program 6. Department of Immunobiology Yale University School of Medicine 333 Cedar Street, SHM I-308, New Haven, CT 06520, USA 7. Department of Biosystems Science and Engineering, ETH Zurich Mattenstrasse 26, 4058 Basel, Switzerland 8. Department of Chemistry, University of Basel Petersplatz 1, 4003 Basel, Switzerland 9. Department of Neurosurgery 10. Yale Center for Genome Analysis Yale University School of Medicine 300 Cedar Street, New Haven, CT 06520-8043 11. Koch Institute for Integrative Cancer Research, MIT 12. Department of Biology, MIT 77 Massachusetts Avenue, Cambridge, MA 02139-4307, USA 13. Broad Institute of MIT and Harvard 14. Department of Biological Engineering, MIT 75 Ames Street, Cambridge, MA 02142, USA 15.
    [Show full text]
  • A Chromosome Level Genome of Astyanax Mexicanus Surface Fish for Comparing Population
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.189654; this version posted July 6, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Title 2 A chromosome level genome of Astyanax mexicanus surface fish for comparing population- 3 specific genetic differences contributing to trait evolution. 4 5 Authors 6 Wesley C. Warren1, Tyler E. Boggs2, Richard Borowsky3, Brian M. Carlson4, Estephany 7 Ferrufino5, Joshua B. Gross2, LaDeana Hillier6, Zhilian Hu7, Alex C. Keene8, Alexander Kenzior9, 8 Johanna E. Kowalko5, Chad Tomlinson10, Milinn Kremitzki10, Madeleine E. Lemieux11, Tina 9 Graves-Lindsay10, Suzanne E. McGaugh12, Jeff T. Miller12, Mathilda Mommersteeg7, Rachel L. 10 Moran12, Robert Peuß9, Edward Rice1, Misty R. Riddle13, Itzel Sifuentes-Romero5, Bethany A. 11 Stanhope5,8, Clifford J. Tabin13, Sunishka Thakur5, Yamamoto Yoshiyuki14, Nicolas Rohner9,15 12 13 Authors for correspondence: Wesley C. Warren ([email protected]), Nicolas Rohner 14 ([email protected]) 15 16 Affiliation 17 1Department of Animal Sciences, Department of Surgery, Institute for Data Science and 18 Informatics, University of Missouri, Bond Life Sciences Center, Columbia, MO 19 2 Department of Biological Sciences, University of Cincinnati, Cincinnati, OH 20 3 Department of Biology, New York University, New York, NY 21 4 Department of Biology, The College of Wooster, Wooster, OH 22 5 Harriet L. Wilkes Honors College, Florida Atlantic University, Jupiter FL 23 6 Department of Genome Sciences, University of Washington, Seattle, WA 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.189654; this version posted July 6, 2020.
    [Show full text]
  • Hereditary Spastic Paraplegia: from Genes, Cells and Networks to Novel Pathways for Drug Discovery
    brain sciences Review Hereditary Spastic Paraplegia: From Genes, Cells and Networks to Novel Pathways for Drug Discovery Alan Mackay-Sim Griffith Institute for Drug Discovery, Griffith University, Brisbane, QLD 4111, Australia; a.mackay-sim@griffith.edu.au Abstract: Hereditary spastic paraplegia (HSP) is a diverse group of Mendelian genetic disorders affect- ing the upper motor neurons, specifically degeneration of their distal axons in the corticospinal tract. Currently, there are 80 genes or genomic loci (genomic regions for which the causative gene has not been identified) associated with HSP diagnosis. HSP is therefore genetically very heterogeneous. Finding treatments for the HSPs is a daunting task: a rare disease made rarer by so many causative genes and many potential mutations in those genes in individual patients. Personalized medicine through genetic correction may be possible, but impractical as a generalized treatment strategy. The ideal treatments would be small molecules that are effective for people with different causative mutations. This requires identification of disease-associated cell dysfunctions shared across geno- types despite the large number of HSP genes that suggest a wide diversity of molecular and cellular mechanisms. This review highlights the shared dysfunctional phenotypes in patient-derived cells from patients with different causative mutations and uses bioinformatic analyses of the HSP genes to identify novel cell functions as potential targets for future drug treatments for multiple genotypes. Keywords: neurodegeneration; motor neuron disease; spastic paraplegia; endoplasmic reticulum; Citation: Mackay-Sim, A. Hereditary protein-protein interaction network Spastic Paraplegia: From Genes, Cells and Networks to Novel Pathways for Drug Discovery. Brain Sci. 2021, 11, 403.
    [Show full text]
  • Identification of Key Genes and Pathways in Pancreatic Cancer
    G C A T T A C G G C A T genes Article Identification of Key Genes and Pathways in Pancreatic Cancer Gene Expression Profile by Integrative Analysis Wenzong Lu * , Ning Li and Fuyuan Liao Department of Biomedical Engineering, College of Electronic and Information Engineering, Xi’an Technological University, Xi’an 710021, China * Correspondence: [email protected]; Tel.: +86-29-86173358 Received: 6 July 2019; Accepted: 7 August 2019; Published: 13 August 2019 Abstract: Background: Pancreatic cancer is one of the malignant tumors that threaten human health. Methods: The gene expression profiles of GSE15471, GSE19650, GSE32676 and GSE71989 were downloaded from the gene expression omnibus database including pancreatic cancer and normal samples. The differentially expressed genes between the two types of samples were identified with the Limma package using R language. The gene ontology functional and pathway enrichment analyses of differentially-expressed genes were performed by the DAVID software followed by the construction of a protein–protein interaction network. Hub gene identification was performed by the plug-in cytoHubba in cytoscape software, and the reliability and survival analysis of hub genes was carried out in The Cancer Genome Atlas gene expression data. Results: The 138 differentially expressed genes were significantly enriched in biological processes including cell migration, cell adhesion and several pathways, mainly associated with extracellular matrix-receptor interaction and focal adhesion pathway in pancreatic cancer. The top hub genes, namely thrombospondin 1, DNA topoisomerase II alpha, syndecan 1, maternal embryonic leucine zipper kinase and proto-oncogene receptor tyrosine kinase Met were identified from the protein–protein interaction network.
    [Show full text]
  • Altered Pathways in Methylome and Transcriptome Longitudinal Analysis of Normal Weight and Bariatric Surgery Women C
    www.nature.com/scientificreports OPEN Altered pathways in methylome and transcriptome longitudinal analysis of normal weight and bariatric surgery women C. F. Nicoletti1, M. A. S. Pinhel1,2, N. Y. Noronha1, B. A. de Oliveira1, W. Salgado Junior3A. Jácome4, A. Diaz-Lagares5,6, F. Casanueva7,8, A. B. Crujeiras 7,8*, & C. B. Nonino1* DNA methylation could provide a link between environmental, genetic factors and weight control and can modify gene expression pattern. This study aimed to identify genes, which are diferentially expressed and methylated depending on adiposity state by evaluating normal weight women and obese women before and after bariatric surgery (BS). We enrolled 24 normal weight (BMI: 22.5± 1.6 kg/m2) and 24 obese women (BMI: 43.3 ± 5.7 kg/m2) submitted to BS. Genome-wide methylation analysis was conducted using Infnium Human Methylation 450 BeadChip (threshold for signifcant CpG sites based on delta methylation level with a minimum value of 5%, a false discovery rate correction (FDR) of q < 0.05 was applied). Expression levels were measured using HumanHT-12v4 Expression BeadChip (cutof of p ≤ 0.05 and fold change ≥2.0 was used to detect diferentially expressed probes). The integrative analysis of both array data identifed four genes (i.e. TPP2, PSMG6, ARL6IP1 and FAM49B) with higher methylation and lower expression level in pre-surgery women compared to normal weight women: and two genes (i.e. ZFP36L1 and USP32) that were diferentially methylated after BS. These methylation changes were in promoter region and gene body. All genes are related to MAPK cascade, NIK/NF- kappaB signaling, cellular response to insulin stimulus, proteolysis and others.
    [Show full text]
  • Download Ppis for Each Single Seed, Thus Obtaining Each Seed’S Interactome (Ferrari Et Al., 2018)
    bioRxiv preprint doi: https://doi.org/10.1101/2021.01.14.425874; this version posted January 16, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Integrating protein networks and machine learning for disease stratification in the Hereditary Spastic Paraplegias Nikoleta Vavouraki1,2, James E. Tomkins1, Eleanna Kara3, Henry Houlden3, John Hardy4, Marcus J. Tindall2,5, Patrick A. Lewis1,4,6, Claudia Manzoni1,7* Author Affiliations 1: Department of Pharmacy, University of Reading, Reading, RG6 6AH, United Kingdom 2: Department of Mathematics and Statistics, University of Reading, Reading, RG6 6AH, United Kingdom 3: Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, WC1N 3BG, United Kingdom 4: Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London, WC1N 3BG, United Kingdom 5: Institute of Cardiovascular and Metabolic Research, University of Reading, Reading, RG6 6AS, United Kingdom 6: Department of Comparative Biomedical Sciences, Royal Veterinary College, London, NW1 0TU, United Kingdom 7: School of Pharmacy, University College London, London, WC1N 1AX, United Kingdom *Corresponding author: [email protected] Abstract The Hereditary Spastic Paraplegias are a group of neurodegenerative diseases characterized by spasticity and weakness in the lower body. Despite the identification of causative mutations in over 70 genes, the molecular aetiology remains unclear. Due to the combination of genetic diversity and variable clinical presentation, the Hereditary Spastic Paraplegias are a strong candidate for protein- protein interaction network analysis as a tool to understand disease mechanism(s) and to aid functional stratification of phenotypes.
    [Show full text]
  • Partial Bisulfite Conversion for Unique Template Sequencing
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Cold Spring Harbor Laboratory Institutional Repository Nucleic Acids Research, 2017 1 doi: 10.1093/nar/gkx1054 Partial bisulfite conversion for unique template sequencing Vijay Kumar, Julie Rosenbaum, Zihua Wang, Talitha Forcier, Michael Ronemus, Michael Wigler and Dan Levy* Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724 USA Received May 25, 2017; Revised October 10, 2017; Editorial Decision October 16, 2017; Accepted October 18, 2017 ABSTRACT tinguishable from PCR and sequencing error. Finally, dis- tortion during PCR amplification makes for an unreliable We introduce a new protocol, mutational sequenc- estimate of count in RNA expression and DNA copy num- ing or muSeq, which uses sodium bisulfite to ran- ber. domly deaminate unmethylated cytosines at a fixed There are a host of new technologies designed to ad- and tunable rate. The muSeq protocol marks each dress the shortcomings of short-read sequencing, utilizing initial template molecule with a unique mutation sig- various strategies of limiting dilution and tagging (2–7). In nature that is present in every copy of the template, this paper, we present a fundamentally different approach and in every fragmented copy of a copy. In the se- that embeds a unique mutational tag within the sequence of quenced read data, this signature is observed as a each template molecule, and discuss the merits of a diverse unique pattern of C-to-T or G-to-A nucleotide con- set of tools for enhanced short-read sequencing. Previously, versions.
    [Show full text]
  • Rare Deletions at 16P13.11 Predispose to a Diverse Spectrum of Sporadic Epilepsy Syndromes
    ARTICLE Rare Deletions at 16p13.11 Predispose to a Diverse Spectrum of Sporadic Epilepsy Syndromes Erin L. Heinzen,1,23 Rodney A. Radtke,2,23 Thomas J. Urban,1,23 Gianpiero L. Cavalleri,5 Chantal Depondt,8 Anna C. Need,1 Nicole M. Walley,1 Paola Nicoletti,1 Dongliang Ge,1 Claudia B. Catarino,9,11 John S. Duncan,9,11 Dalia Kasperaviciute ˙ ,9 Sarah K. Tate,9 Luis O. Caboclo,9 Josemir W. Sander,9,11,12 Lisa Clayton,9 Kristen N. Linney,1 Kevin V. Shianna,1 Curtis E. Gumbs,1 Jason Smith,1 Kenneth D. Cronin,1 Jessica M. Maia,1 Colin P. Doherty,6 Massimo Pandolfo,8 David Leppert,13,15 Lefkos T. Middleton,16 Rachel A. Gibson,13 Michael R. Johnson,13,17 Paul M. Matthews,13,17 David Hosford,2 Reetta Ka¨lvia¨inen,18 Kai Eriksson,19 Anne-Mari Kantanen,18 Thomas Dorn,20 Jo¨rg Hansen,20 Gu¨nter Kra¨mer,20 Bernhard J. Steinhoff,21 Heinz-Gregor Wieser,22 Dominik Zumsteg,22 Marcos Ortega,22 Nicholas W. Wood,10 Julie Huxley-Jones,14 Mohamad Mikati,3 William B. Gallentine,3 Aatif M. Husain,2 Patrick G. Buckley,7 Ray L. Stallings,7 Mihai V. Podgoreanu,4 Norman Delanty,5 Sanjay M. Sisodiya,9,11,* and David B. Goldstein1,* Deletions at 16p13.11 are associated with schizophrenia, mental retardation, and most recently idiopathic generalized epilepsy. To evaluate the role of 16p13.11 deletions, as well as other structural variation, in epilepsy disorders, we used genome-wide screens to identify copy number variation in 3812 patients with a diverse spectrum of epilepsy syndromes and in 1299 neurologically-normal controls.
    [Show full text]
  • Supplementary Table 1 Double Treatment Vs Single Treatment
    Supplementary table 1 Double treatment vs single treatment Probe ID Symbol Gene name P value Fold change TC0500007292.hg.1 NIM1K NIM1 serine/threonine protein kinase 1.05E-04 5.02 HTA2-neg-47424007_st NA NA 3.44E-03 4.11 HTA2-pos-3475282_st NA NA 3.30E-03 3.24 TC0X00007013.hg.1 MPC1L mitochondrial pyruvate carrier 1-like 5.22E-03 3.21 TC0200010447.hg.1 CASP8 caspase 8, apoptosis-related cysteine peptidase 3.54E-03 2.46 TC0400008390.hg.1 LRIT3 leucine-rich repeat, immunoglobulin-like and transmembrane domains 3 1.86E-03 2.41 TC1700011905.hg.1 DNAH17 dynein, axonemal, heavy chain 17 1.81E-04 2.40 TC0600012064.hg.1 GCM1 glial cells missing homolog 1 (Drosophila) 2.81E-03 2.39 TC0100015789.hg.1 POGZ Transcript Identified by AceView, Entrez Gene ID(s) 23126 3.64E-04 2.38 TC1300010039.hg.1 NEK5 NIMA-related kinase 5 3.39E-03 2.36 TC0900008222.hg.1 STX17 syntaxin 17 1.08E-03 2.29 TC1700012355.hg.1 KRBA2 KRAB-A domain containing 2 5.98E-03 2.28 HTA2-neg-47424044_st NA NA 5.94E-03 2.24 HTA2-neg-47424360_st NA NA 2.12E-03 2.22 TC0800010802.hg.1 C8orf89 chromosome 8 open reading frame 89 6.51E-04 2.20 TC1500010745.hg.1 POLR2M polymerase (RNA) II (DNA directed) polypeptide M 5.19E-03 2.20 TC1500007409.hg.1 GCNT3 glucosaminyl (N-acetyl) transferase 3, mucin type 6.48E-03 2.17 TC2200007132.hg.1 RFPL3 ret finger protein-like 3 5.91E-05 2.17 HTA2-neg-47424024_st NA NA 2.45E-03 2.16 TC0200010474.hg.1 KIAA2012 KIAA2012 5.20E-03 2.16 TC1100007216.hg.1 PRRG4 proline rich Gla (G-carboxyglutamic acid) 4 (transmembrane) 7.43E-03 2.15 TC0400012977.hg.1 SH3D19
    [Show full text]
  • Cell Cycle Arrest Through Indirect Transcriptional Repression by P53: I Have a DREAM
    Cell Death and Differentiation (2018) 25, 114–132 Official journal of the Cell Death Differentiation Association OPEN www.nature.com/cdd Review Cell cycle arrest through indirect transcriptional repression by p53: I have a DREAM Kurt Engeland1 Activation of the p53 tumor suppressor can lead to cell cycle arrest. The key mechanism of p53-mediated arrest is transcriptional downregulation of many cell cycle genes. In recent years it has become evident that p53-dependent repression is controlled by the p53–p21–DREAM–E2F/CHR pathway (p53–DREAM pathway). DREAM is a transcriptional repressor that binds to E2F or CHR promoter sites. Gene regulation and deregulation by DREAM shares many mechanistic characteristics with the retinoblastoma pRB tumor suppressor that acts through E2F elements. However, because of its binding to E2F and CHR elements, DREAM regulates a larger set of target genes leading to regulatory functions distinct from pRB/E2F. The p53–DREAM pathway controls more than 250 mostly cell cycle-associated genes. The functional spectrum of these pathway targets spans from the G1 phase to the end of mitosis. Consequently, through downregulating the expression of gene products which are essential for progression through the cell cycle, the p53–DREAM pathway participates in the control of all checkpoints from DNA synthesis to cytokinesis including G1/S, G2/M and spindle assembly checkpoints. Therefore, defects in the p53–DREAM pathway contribute to a general loss of checkpoint control. Furthermore, deregulation of DREAM target genes promotes chromosomal instability and aneuploidy of cancer cells. Also, DREAM regulation is abrogated by the human papilloma virus HPV E7 protein linking the p53–DREAM pathway to carcinogenesis by HPV.Another feature of the pathway is that it downregulates many genes involved in DNA repair and telomere maintenance as well as Fanconi anemia.
    [Show full text]