Downloading and Demultiplexing Fastq Files
Total Page:16
File Type:pdf, Size:1020Kb
bioRxiv preprint doi: https://doi.org/10.1101/2020.09.09.284505; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Transcriptome Alterations in Myotonic Dystrophy Frontal Cortex Brittney A. Otero1, Kiril Poukalov1*, Ryan P. Hildebrandt1*, Charles A. Thornton2, Kenji Jinnai3, Harutoshi Fujimura4, Takashi Kimura5, Katharine A. Hagerman6, JaCinda B. Sampson6, John W. Day6, EriC T. Wang1,7 1Dept. of Molecular Genetics & Microbiology, Center for NeuroGenetics, Genetics Institute, University of Florida 2Department of Neurology, University of Rochester Medical Center 3Department of Neurology, National Hospital Organization Hyogo-Chuo Hospital 4Department of Neurology, National Hospital Organization Toneyama Hospital 5Department of Neurology, Hyogo College of Medicine 6Department of Neurology, Stanford University 7To whom correspondence should be addressed *These authors contributed equally to this worK Abstract MyotoniC dystrophy (dystrophia myotoniCa, DM) is Caused by expanded CTG/CCTG miCrosatellite repeats, leading to multi-systemic symptoms in skeletal muscle, heart, gastrointestinal, endocrine, and central nervous systems (CNS), among others. For some patients, CNS issues can be as debilitating or more so than muscle symptoms; they include hypersomnolence, executive dysfunction, white matter atrophy, and neurofibrillary tangles. Although transCriptomes from DM type 1 (DM1) skeletal muscle have provided useful insights into pathomeChanisms and biomarkers, limited studies of transCriptomes have been performed in the CNS. To eluCidate underlying Causes of CNS dysfunCtion in patients, we have generated and analyzed RNA-seq transcriptomes from the frontal cortex of 21 DM1 patients, 4 DM type 2 (DM2) patients, and 8 unaffeCted controls. One hundred and thirty high confidenCe spliCing changes were identified, most occurring exclusively in the CNS and not in skeletal muscle or heart. Mis-spliced exons were found in neurotransmitter receptors, ion channels, and synaptic scaffolds, and we identified an alternative exon in GRIP1 that modulates association with kinesins. Splicing changes exhibited a gradient of severity correlating with CTG repeat length, as measured by optical mapping of individual DNA molecules. All individuals studied, including those with modest spliCing defeCts, showed extreme somatiC mosaiCism, with a subset of alleles having >1000 CTGs. Analyses of gene expression changes showed up-regulation of genes transCribed in miCroglia and endothelial cells, suggesting neuroinflammation, and down- regulation of genes transCribed in neurons. Gene expression of RNAs enCoding proteins detectable in cerebrospinal fluid were also found to correlate with mis-splicing, with implications for CNS biomarkers of disease severity. These findings provide a framework for future meChanistiC and therapeutiC studies of CNS issues in DM. Introduction Myotonic dystrophy (DM) is a multi-systemic, progressive disease Caused by expanded CTG/ CCTG repeats in the 3’ UTR of the dystrophia myotoniCa protein kinase gene (DMPK, DM Type 1, DM1) 1 or the first intron of the cellular nucleic acid binding protein gene (CNBP, DM Type 2, DM2) 2, respeCtively. Both DM1 and DM2 are highly variable in age of onset, cliniCal features, and disease severity 3, 4. Although DM is well studied in the context of peripheral symptoms such as myotonia and muscle weakness, central nervous system (CNS) symptoms are also common in DM and can contribute significantly to neurological impairment 3, 4. These symptoms inClude hypersomnia, executive functioning deficits, memory deficits, and emotional bioRxiv preprint doi: https://doi.org/10.1101/2020.09.09.284505; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. disturbances 5, 6, 7. Imaging studies show white and gray matter abnormalities in multiple brain regions, and ventriCle enlargement (reviewed in 8). However, moleCular meChanisms driving these neurobiologiCal changes remain largely unknown. While there is strong evidenCe to support a pathomeChanism in which MBNL proteins are funCtionally depleted in both peripheral and CNS tissues in DM 9, 10, 11, 12, 13, 14, limited analyses have been performed to comprehensively identify transcriptome changes in the CNS across a broad subset of patients, assess the extent to which MBNLs are depleted, and address whether other RNA binding proteins are perturbed. RNA-seq has not been applied to CNS tissues in DM2, and the extent to whiCh RBFOX proteins may modulate CNS pathogenesis in DM2 15 is also unexplored. Furthermore, as opposed to skeletal muscle in which dysfunction is predominantly driven by myonuClei, the extent to whiCh different cell types are affeCted in the CNS of DM-affected individuals is unknown. CUG foci have been observed in neurons, glia, and oligodendrocytes 9, but the contribution of eaCh cell type to pathology has not been extensively explored. Although transcriptome dysregulation has been extensively studied in peripheral DM1 tissues 16, 17, 18, 19, the repertoire of mRNAs expressed in the CNS is distinCt, and therefore a different set of exons may be mis-regulated in this tissue. Finally, although there are clear examples of how speCifiC spliCing events in peripheral tissues Cause particular DM symptoms (ClCn1 and myotonia 20, Bin1 and musCle weakness 21, SCn5a and cardiaC arrhythmias 19), no clear examples exist in the CNS. To lay the groundwork required to answer some of these questions, here we generate and analyze transcriptomes derived from a set of post-mortem frontal cortex (FC) samples (Brodmann Area 10) from DM1 patients, DM2 patients, and unaffected controls. We identify high ConfidenCe mis-spliced exons that show a gradient of changes across patients and study how a mis-splicing event in GRIP1 may alter its efficacy as a synaptic adaptor. We analyze DM transcriptomes together with additional single cell transCriptome datasets 22 to assess potential changes in cell type composition and determine whiCh Cell types are potentially responsible for Changes in gene expression and spliCing patterns. SomatiC instability is well established as a major driver of age of onset in DM1 and other repeat expansion diseases 23, 24, 25, but teChniCal challenges have preCluded faCile, direCt assessment of full length repeat-containing alleles in the DM1 CNS. Furthermore, the CTG repeat lengths required to sequester suffiCient MBNL to eliCit robust mis-splicing have not been assessed in this tissue. New teChnologies suCh as long read sequenCing provide some advantages over Southern blotting and small pool PCR 26, but often still require amplifiCation or sub-cloning, whiCh Can introduCe some biases. Here, we apply a new optiCal mapping approaCh to size expanded CTG repeats at single molecule resolution in DM1; this approach allows for unbiased, amplification-free measurement of repeat lengths from genomiC DNA 27, 28. Finally, while studies of transCriptomes Can identify moleCular events driving disease features, they may also suggest potential markers of disease status. Here, we study changes in the frontal cortex, but similar moleCular changes may oCCur in other brain regions. The aggregate of these pathologiCal proCesses may be refleCted in the composition of cerebrospinal fluid (CSF), faCilitating the development of aCCessible biomarkers to more precisely characterize disease severity and measure changes following therapeutic intervention. Indeed, CSF biomarkers have been developed for other neurological diseases such as Huntington’s Disease, C9ALS/FTD, and Alzheimer’s Disease 29, 30, 31, 32; in parallel, investigators have used proteomiCs to broadly characterize proteins present in CSF. In DM1, splicing biomarkers correlate with muscle strength 33 and potentially reflect the concentration of free MBNL in skeletal muscle 17; subsequent approaches have been developed to profile RNAs present in urine 34, potentially providing a non-invasive route to measure disease status. By integrating our transcriptome bioRxiv preprint doi: https://doi.org/10.1101/2020.09.09.284505; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. analyses with existing knowledge of proteins present in CSF, we lay critical groundwork to properly inform and interpret future biomarker discovery efforts. Results Splicing dysregulation in the DM1 frontal cortex exhibits a gradient of severity We performed RNA-seq using RNA from post-mortem frontal Cortex (Brodmann Area 10) of 21 DM1, 4 DM2, and 8 unaffeCted age- and sex- matChed individuals (Fig. 1A, see Methods). All libraries satisfied typical quality metrics in FASTQC 35 and were sequenced to a depth of at least 88 million reads to provide sufficient coverage for analyses of gene expression and alternative splicing. Percent spliced in (ψ) values were estimated by MISO 36, and using a threshold of at least 20% Change in mean ψ (p < 0.01, rank-sum test), 130 exons were identified to be