Open Xinwei Han Thesis.Pdf

Total Page:16

File Type:pdf, Size:1020Kb

Open Xinwei Han Thesis.Pdf The Pennsylvania State University The Graduate School Intercollege Graduate Program in Genetics UNDERSTANDING GENE EXPRESSION AND GENETIC RECOMBINATION BY NEXT GENERATION SEQUENCING A Dissertation in Genetics by Xinwei Han 2012 Xinwei Han Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy May 2012 The dissertation of Xinwei Han was reviewed and approved* by the following: Stephen W. Schaeffer Professor of Biology Chair of Committee Hong Ma Distinguished Professor of Biology Dissertation Advisor Naomi S. Altman Professor of Statistics Gong Chen Associate Professor of Biology Anton Nekrutenko Associate Professor of Biochemistry and Molecular Biology Robert F. Paulson Associate Professor of Veterinary and Biomedical Sciences Chair of Intercollege Graduate Degree Program in Genetics *Signatures are on file in the Graduate School iii ABSTRACT The introduction of next-generation sequencing technologies has been changing the landscape of biological research. The plummeting cost of massive sequencing not only leads to the flourishing of various genome projects, but also opens up many opportunities in previously uncharted areas, two remarkable examples of which are sequencing different individuals of the same species and sequencing transcriptomes. With the availability of multiple genome sequences from a population, it is possible to systematically catalog natural variations and, more importantly, investigate their genome-wide distribution to deduce functional elements through conservation or selection. Moreover, as genomic changes reflect evolution at work, the comprehensive map of natural variations serves as a basis for studying the properties and effects of evolutionary forces. For transcriptomics, sequencing has become a revolutionary way to characterize gene expression. It not only offers high replicability and unparalleled accuracy, but also requires less input material, enabling transcriptome study in tiny structures, and no prior knowledge of gene structure, allowing detection of unknown transcripts. Here, three sequencing-based studies are presented. One study explored natural variations between two Arabidopsis ecotypes, Col and Ler, and then investigated one crucial evolutionary force to shape variations, meiotic recombination. The other two studies investigated the small RNA transcriptome in Arabidopsis meiocytes and mRNA transcriptomes in developing mouse cortex. In the first study, the sequencing and comparison between Col and Ler uncovered 249,171 SNPs, 58,085 small and 2,315 large indels, with highly correlated genome-wide distributions of SNPs and small indels. Disease resistance genes contain significantly more variations, suggesting adaptation to specific environmental niches. These variations were then used as markers to investigate meiotic recombinations, crossovers and gene conversions, in two tetrads, detecting 18 crossovers, 6 of which had an associated gene conversion event, and 4 iv independent gene conversions. The number and length of identified recombination events suggest that Arabidopsis gene conversions are likely fewer and with shorter tracts than those in yeast. In addition, the analysis of variations in offspring plants showed meiosis provided a rapid mechanism to generate copy number variations (CNVs) by reshuffling existing variations. In the second study, a recently developed method was applied to collect Arabidopsis meiocytes, a limited number of cells undergoing meiosis, and then small RNAs were profiled using SOLiD sequencing. 97 of 266 known miRNAs show expression in meiocytes. Interestingly, five miRNAs were found to account for more than half of the total miRNA expression in meiocytes, among which miR158a takes up about one third. The target genes of these five miRNAs have little or low expression in meiocytes. One putative novel miRNA was identified, which shows conservation with rice and maize. Analysis of longer reads provided clues for possible long ncRNAs in meiocytes. The mouse transcritpome study uncovered 3,758 differentially expressed genes between two critical stages of cortex development, embryonic day 18 (E18) and postnatal day (P7). Neurogenesis-related genes, such as Sox4 and Sox11, were more highly expressed at E18 than at P7. In contrast, the genes encoding synaptic proteins were up-regulated from E18 to P7, suggesting cortex development changes focus from neuron generation to synapse formation. In addition, approximately 500 genes with unknown function show dramatic change in expression level, serving as a blueprint for further experimental studies. Thousands of novel splice variants from 2,930 genes were identified, providing clues about another layer of dynamic expression. These sequencing-based studies pushed the limit of previous work. The characterization of meiotic recombination reached single base-pair resolution. The mouse transcriptome work is one of several early studies utilizing RNA-seq. The small RNA transcriptome in meiocytes, single type cells at microscopic scale, was profiled for the first time. All these studies provided unprecedented information about complicated biological processes. v TABLE OF CONTENTS LIST OF FIGURES ................................................................................................................. vii LIST OF TABLES ................................................................................................................... ix ACKNOWLEDGEMENTS ..................................................................................................... x CHAPTER 1 INTRODUCTION ............................................................................................ 1 1.1 Overview of Next Generation Sequencing................................................................. 2 1.1.1 Brief history of DNA sequencing .................................................................... 2 1.1.2 Application of next generation sequencing ..................................................... 6 1.1.3 The development of read mapping tools ......................................................... 10 1.2 Arabidopsis Natural Variation and Meiotic Recombination as One Crucial Shaping Force ........................................................................................................... 12 1.3 Brief Overview of Small RNAs Studies in Arabidopsis ............................................ 15 1.4 Transcriptome Studies in Mouse and Brief Introduction of Early Cortex Development ............................................................................................................ 18 CHAPTER 2 ANALYSIS OF ARABIDOPSIS GENOME-WIDE VARIATIONS BEFORE AND AFTER MEIOSIS AND MEIOTIC RECOMBINATION BY RE- SEQUENCING LANDSBERG ERECTA AND ALL FOUR PRODUCTS OF A SINGLE MEIOSIS .......................................................................................................... 20 2.1 Summary .................................................................................................................... 21 2.2 Introduction ................................................................................................................ 22 2.3 Material and Methods ................................................................................................ 25 2.3.1 Plant material and growth conditions .............................................................. 25 2.3.2 DNA isolation, genotyping, and genome re-sequencing ................................. 25 2.3.3 Calling of small polymorphisms ..................................................................... 26 2.3.4 Identification of larger indels .......................................................................... 26 2.3.5 Further bioinformatic analysis of SNP/indel affected genes ........................... 27 2.3.6 Identification of crossovers and non-crossovers ............................................. 28 2.3.7 Meiosis simulation and statistical analysis ...................................................... 29 2.3.8 Data access ...................................................................................................... 29 2.4 Results ........................................................................................................................ 30 2.4.1 Sequencing of the Ler genome uncovered numerous SNPs with functional Implications ...................................................................................................... 30 2.4.2 Numerous small indels with similar distribution patterns to those of SNPs ... 33 2.4.3 Detection of large indels and CNVs ................................................................ 35 2.4.4 Generating and sequencing “tetrads” of meiotic progeny plants .................... 38 2.4.5 Single-base resolution analysis of COs and NCOs/GCs ................................. 38 2.4.6 Redistribution of genome variations after meiosis .......................................... 43 2.4.7 CNVs due to meiotic reshuffling of structural variants................................... 44 2.5 Discussion .................................................................................................................. 46 2.5.1 Genetic variation and phenotypic variation ..................................................... 46 2.5.2 Possible relationship between frequency of CO, genome size and length of synaptonemal complex ................................................................................. 47 vi 2.5.3 The low frequency of detected NCOs and possible short GC
Recommended publications
  • The Rnase H-Like Superfamily: New Members, Comparative Structural Analysis and Evolutionary Classification Karolina A
    4160–4179 Nucleic Acids Research, 2014, Vol. 42, No. 7 Published online 23 January 2014 doi:10.1093/nar/gkt1414 The RNase H-like superfamily: new members, comparative structural analysis and evolutionary classification Karolina A. Majorek1,2,3,y, Stanislaw Dunin-Horkawicz1,y, Kamil Steczkiewicz4, Anna Muszewska4,5, Marcin Nowotny6, Krzysztof Ginalski4 and Janusz M. Bujnicki1,3,* 1Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, 2Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, 3Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, 4Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, 5Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and 6Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland Received September 23, 2013; Revised December 12, 2013; Accepted December 26, 2013 ABSTRACT revealed a correlation between the orientation of Ribonuclease H-like (RNHL) superfamily, also called the C-terminal helix with the exonuclease/endo- the retroviral integrase superfamily, groups together nuclease function and the architecture of the numerous enzymes involved in nucleic acid metab- active site. Our analysis provides a comprehensive olism and implicated in many biological processes, picture of sequence-structure-function relation- including replication, homologous recombination, ships in the RNHL superfamily that may guide func- DNA repair, transposition and RNA interference.
    [Show full text]
  • Identification and Functional Analysis of Novel Genes Associated with Inherited Bone Marrow Failure Syndromes
    Identification and Functional Analysis of Novel Genes Associated with Inherited Bone Marrow Failure Syndromes by Anna Matveev A thesis submitted in conformity with the requirements for the degree of Master of Science Institute of Medical Science University of Toronto © Copyright by Anna Matveev 2020 Abstract Identification and Functional Analysis of Novel Genes Associated with Inherited Bone Marrow Failure Syndromes Anna Matveev Master of Science Institute of Medical Science University of Toronto 2020 Inherited bone marrow failure syndromes are multisystem-disorders that affect development of hematopoietic system. One of IBMFSs is Shwachman-Diamond-syndrome and about 80-90% of patients have mutations in the Shwachman-Bodian-Diamond-Syndrome gene. To unravel the genetic cause of the disease in the remaining 10-20% of patients, we performed WES as well as SNP-genotyping in families with SDS-phenotype and no mutations in SBDS. The results showed a region of homozygosity in chromosome 5p-arm DNAJC21 is in this region. Western blotting revealed reduced/null protein in patient. DNAJC21-homolog in yeast has been shown facilitating the release of the Arx1/Alb1 heterodimer from pre-60S.To investigate the cellular functions of DNAJC21 we knocked-down it in HEK293T-cells. We observed a high-level of ROS, which led to reduced cell proliferation. Our data indicate that mutations in DNAJC21 contribute to SDS. We hypothesize that DNAJC21 related ribosomal defects lead to increased levels of ROS therefore altering development and maturation of hematopoietic cells. ii Acknowledgments I would like to take this opportunity to extend my deepest gratitude to everyone who has helped me throughout my degree.
    [Show full text]
  • Gigantea: Uncovering New Functions in Flower Development
    G C A T T A C G G C A T genes Review Gigantea: Uncovering New Functions in Flower Development Claudio Brandoli 1, Cesar Petri 2 , Marcos Egea-Cortines 1 and Julia Weiss 1,* 1 Genética Molecular, Instituto de Biotecnología Vegetal, Edificio I+D+I, Plaza del Hospital s/n, Universidad Politécnica de Cartagena, 30202 Cartagena, Spain; [email protected] (C.B.); [email protected] (M.E.-C.) 2 Instituto de Hortofruticultura Subtropical y Mediterránea-UMA-CSIC, Departamento de Fruticultura Subtropical y Mediterránea, 29750 Algarrobo-costa, Málaga, Spain; [email protected] * Correspondence: [email protected]; Tel.: +34-868-071-078 Received: 27 August 2020; Accepted: 22 September 2020; Published: 28 September 2020 Abstract: GIGANTEA (GI) is a gene involved in multiple biological functions, which have been analysed and are partially conserved in a series of mono- and dicotyledonous plant species. The identified biological functions include control over the circadian rhythm, light signalling, cold tolerance, hormone signalling and photoperiodic flowering. The latter function is a central role of GI, as it involves a multitude of pathways, both dependent and independent of the gene CONSTANS(CO), as well as on the basis of interaction with miRNA. The complexity of the gene function of GI increases due to the existence of paralogs showing changes in genome structure as well as incidences of sub- and neofunctionalization. We present an updated report of the biological function of GI, integrating late insights into its role in floral initiation, flower development and volatile flower production. Keywords: Gene ontology; molecular function; cellular localisation; biological function; circadian clock; flowering time; flower development; floral scent 1.
    [Show full text]
  • S41467-019-10037-Y.Pdf
    ARTICLE https://doi.org/10.1038/s41467-019-10037-y OPEN Feedback inhibition of cAMP effector signaling by a chaperone-assisted ubiquitin system Laura Rinaldi1, Rossella Delle Donne1, Bruno Catalanotti 2, Omar Torres-Quesada3, Florian Enzler3, Federica Moraca 4, Robert Nisticò5, Francesco Chiuso1, Sonia Piccinin5, Verena Bachmann3, Herbert H Lindner6, Corrado Garbi1, Antonella Scorziello7, Nicola Antonino Russo8, Matthis Synofzik9, Ulrich Stelzl 10, Lucio Annunziato11, Eduard Stefan 3 & Antonio Feliciello1 1234567890():,; Activation of G-protein coupled receptors elevates cAMP levels promoting dissociation of protein kinase A (PKA) holoenzymes and release of catalytic subunits (PKAc). This results in PKAc-mediated phosphorylation of compartmentalized substrates that control central aspects of cell physiology. The mechanism of PKAc activation and signaling have been largely characterized. However, the modes of PKAc inactivation by regulated proteolysis were unknown. Here, we identify a regulatory mechanism that precisely tunes PKAc stability and downstream signaling. Following agonist stimulation, the recruitment of the chaperone- bound E3 ligase CHIP promotes ubiquitylation and proteolysis of PKAc, thus attenuating cAMP signaling. Genetic inactivation of CHIP or pharmacological inhibition of HSP70 enhances PKAc signaling and sustains hippocampal long-term potentiation. Interestingly, primary fibroblasts from autosomal recessive spinocerebellar ataxia 16 (SCAR16) patients carrying germline inactivating mutations of CHIP show a dramatic dysregulation of PKA signaling. This suggests the existence of a negative feedback mechanism for restricting hormonally controlled PKA activities. 1 Department of Molecular Medicine and Medical Biotechnologies, University Federico II, 80131 Naples, Italy. 2 Department of Pharmacy, University Federico II, 80131 Naples, Italy. 3 Institute of Biochemistry and Center for Molecular Biosciences, University of Innsbruck, A-6020 Innsbruck, Austria.
    [Show full text]
  • RNA–Protein Interaction Mapping Via MS2- Or Cas13-Based APEX Targeting
    RNA–protein interaction mapping via MS2- or Cas13-based APEX targeting Shuo Hana,b,c,1, Boxuan Simen Zhaoa,b,c,1, Samuel A. Myersd, Steven A. Carrd, Chuan Hee,f, and Alice Y. Tinga,b,c,2 aDepartment of Genetics, Chan Zuckerberg Biohub, Stanford University, Stanford, CA 94305; bDepartment of Biology, Chan Zuckerberg Biohub, Stanford University, Stanford, CA 94305; cDepartment of Chemistry, Chan Zuckerberg Biohub, Stanford University, Stanford, CA 94305; dThe Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA 02142; eDepartment of Chemistry, Institute for Biophysical Dynamics, Howard Hughes Medical Institute, University of Chicago, Chicago, IL 60637; and fDepartment of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, Howard Hughes Medical Institute, University of Chicago, Chicago, IL 60637 Edited by Robert H. Singer, Albert Einstein College of Medicine, Bronx, NY, and approved July 24, 2020 (received for review April 8, 2020) RNA–protein interactions underlie a wide range of cellular pro- and oncogenesis by serving as the template for reverse transcrip- cesses. Improved methods are needed to systematically map RNA– tion of telomeres (26, 27). While hTR’s interaction with the protein interactions in living cells in an unbiased manner. We used telomerase complex has been extensively characterized (28), hTR two approaches to target the engineered peroxidase APEX2 to is present in stoichiometric excess over telomerase in cancer cells specific cellular RNAs for RNA-centered proximity biotinylation of (29) and is broadly expressed in tissues lacking telomerase protein protein interaction partners. Both an MS2-MCP system and an (30). These observations suggest that hTR could also function engineered CRISPR-Cas13 system were used to deliver APEX2 to outside of the telomerase complex (31), and uncharacterized the human telomerase RNA hTR with high specificity.
    [Show full text]
  • Defining Trends in Global Gene Expression in Arabian Horses with Cerebellar Abiotrophy
    Cerebellum DOI 10.1007/s12311-016-0823-8 ORIGINAL PAPER Defining Trends in Global Gene Expression in Arabian Horses with Cerebellar Abiotrophy E. Y. Scott1 & M. C. T. Penedo2,3 & J. D. Murray1,3 & C. J. Finno3 # Springer Science+Business Media New York 2016 Abstract Equine cerebellar abiotrophy (CA) is a hereditary expression (DE) analysis were used (Tophat2/Cuffdiff2, neurodegenerative disease that affects the Purkinje neurons of Kallisto/EdgeR, and Kallisto/Sleuth) with 151 significant the cerebellum and causes ataxia in Arabian foals. Signs of DE genes identified by all three pipelines in CA-affected CA are typically first recognized either at birth to any time up horses. TOE1 (Log2(foldchange) = 0.92, p =0.66)and to 6 months of age. CA is inherited as an autosomal recessive MUTYH (Log2(foldchange) = 1.13, p = 0.66) were not dif- trait and is associated with a single nucleotide polymorphism ferentially expressed. Among the major pathways that were (SNP) on equine chromosome 2 (13074277G>A), located in differentially expressed, genes associated with calcium ho- the fourth exon of TOE1 and in proximity to MUTYH on the meostasis and specifically expressed in Purkinje neurons, antisense strand. We hypothesize that unraveling the func- CALB1 (Log2(foldchange) = −1.7, p < 0.01) and CA8 tional consequences of the CA SNP using RNA-seq will elu- (Log2(foldchange) = −0.97, p < 0.01), were significantly cidate the molecular pathways underlying the CA phenotype. down-regulated, confirming loss of Purkinje neurons. There RNA-seq (100 bp PE strand-specific) was performed in cer- was also a significant up-regulation of markers for microglial ebellar tissue from four CA-affected and five age-matched phagocytosis, TYROBP (Log2(foldchange) = 1.99, p <0.01) unaffected horses.
    [Show full text]
  • Wholegenome Screening Identifies Proteins Localized to Distinct Nuclear Bodies
    JCB: Tools Whole-genome screening identifies proteins localized to distinct nuclear bodies Ka-wing Fong,1 Yujing Li,2,3 Wenqi Wang,1 Wenbin Ma,2,3 Kunpeng Li,2 Robert Z. Qi,4 Dan Liu,5 Zhou Songyang,2,3,5 and Junjie Chen1 1Department of Experimental Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030 2Key Laboratory of Gene Engineering of Ministry of Education and 3State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China 4State Key Laboratory of Molecular Neuroscience, Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong, China 5The Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030 he nucleus is a unique organelle that contains essen- 325 proteins localized to distinct nuclear bodies, including tial genetic materials in chromosome territories. The nucleoli (148), promyelocytic leukemia nuclear bodies (38), Tinterchromatin space is composed of nuclear sub- nuclear speckles (27), paraspeckles (24), Cajal bodies compartments, which are defined by several distinctive (17), Sam68 nuclear bodies (5), Polycomb bodies (2), nuclear bodies believed to be factories of DNA or RNA and uncharacterized nuclear bodies (64). Functional vali- processing and sites of transcriptional and/or posttranscrip- dation revealed several proteins potentially involved in tional regulation. In this paper, we performed a genome- the assembly of Cajal bodies and paraspeckles. Together, wide microscopy-based screening for proteins that form these data establish the first atlas of human proteins in nuclear foci and characterized their localizations using different nuclear bodies and provide key information for markers of known nuclear bodies.
    [Show full text]
  • 393LN V 393P 344SQ V 393P Probe Set Entrez Gene
    393LN v 393P 344SQ v 393P Entrez fold fold probe set Gene Gene Symbol Gene cluster Gene Title p-value change p-value change chemokine (C-C motif) ligand 21b /// chemokine (C-C motif) ligand 21a /// chemokine (C-C motif) ligand 21c 1419426_s_at 18829 /// Ccl21b /// Ccl2 1 - up 393 LN only (leucine) 0.0047 9.199837 0.45212 6.847887 nuclear factor of activated T-cells, cytoplasmic, calcineurin- 1447085_s_at 18018 Nfatc1 1 - up 393 LN only dependent 1 0.009048 12.065 0.13718 4.81 RIKEN cDNA 1453647_at 78668 9530059J11Rik1 - up 393 LN only 9530059J11 gene 0.002208 5.482897 0.27642 3.45171 transient receptor potential cation channel, subfamily 1457164_at 277328 Trpa1 1 - up 393 LN only A, member 1 0.000111 9.180344 0.01771 3.048114 regulating synaptic membrane 1422809_at 116838 Rims2 1 - up 393 LN only exocytosis 2 0.001891 8.560424 0.13159 2.980501 glial cell line derived neurotrophic factor family receptor alpha 1433716_x_at 14586 Gfra2 1 - up 393 LN only 2 0.006868 30.88736 0.01066 2.811211 1446936_at --- --- 1 - up 393 LN only --- 0.007695 6.373955 0.11733 2.480287 zinc finger protein 1438742_at 320683 Zfp629 1 - up 393 LN only 629 0.002644 5.231855 0.38124 2.377016 phospholipase A2, 1426019_at 18786 Plaa 1 - up 393 LN only activating protein 0.008657 6.2364 0.12336 2.262117 1445314_at 14009 Etv1 1 - up 393 LN only ets variant gene 1 0.007224 3.643646 0.36434 2.01989 ciliary rootlet coiled- 1427338_at 230872 Crocc 1 - up 393 LN only coil, rootletin 0.002482 7.783242 0.49977 1.794171 expressed sequence 1436585_at 99463 BB182297 1 - up 393
    [Show full text]
  • Gene Expression Profiling Reveals a Signaling Role of Glutathione in Redox Regulation
    Gene expression profiling reveals a signaling role of glutathione in redox regulation Maddalena Fratelli*, Leslie O. Goodwin†, Ulf Andersson Ørom*‡, Sarah Lombardi†, Rossella Tonelli*, Manuela Mengozzi*, and Pietro Ghezzi*§ *Laboratory of Neuroimmunology ‘‘Mario Negri,’’ Institute for Pharmacological Research, 20157 Milan, Italy; and †North Shore–Long Island Jewish Research Institute, Manhasset, NY 11030 Edited by Charles A. Dinarello, University of Colorado Health Sciences Center, Denver, CO, and approved July 22, 2005 (received for review May 27, 2005) Proteins can form reversible mixed disulfides with glutathione reversible oxidations to form disulfides and glutathionylated (GSH). It has been hypothesized that protein glutathionylation may proteins (2–5). The reversibility of protein glutathionylation, represent a mechanism of redox regulation, in a fashion similar to catalyzed by glutaredoxin and, to a lesser extent, thioredoxin (6), that mediated by protein phosphorylation. We investigated makes this posttranslational modification a likely molecular whether GSH has a signaling role in the response of HL60 cells to mechanism by which GSH could act as a redox-dependent hydrogen peroxide (H2O2), in addition to its obvious antioxidant signaling molecule, in analogy with protein phosphorylation. role. We identified early changes in gene expression induced at However, despite the identification of several proteins under- different times by H2O2 treatment, under conditions that increase going glutathionylation (5, 7), in our opinion a signaling role of protein glutathionylation and minimal toxicity. We then investi- glutathionylation is far from being established. gated the effect of prior GSH depletion by buthionine sulfoximine Because glutathionylation is a redox-dependent modification, and diethylmaleate on this response. The analysis revealed 2,016 we reasoned that it might participate in signaling the exposure genes regulated by H2O2.
    [Show full text]
  • The Neurodegenerative Diseases ALS and SMA Are Linked at The
    Nucleic Acids Research, 2019 1 doi: 10.1093/nar/gky1093 The neurodegenerative diseases ALS and SMA are linked at the molecular level via the ASC-1 complex Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1093/5162471 by [email protected] on 06 November 2018 Binkai Chi, Jeremy D. O’Connell, Alexander D. Iocolano, Jordan A. Coady, Yong Yu, Jaya Gangopadhyay, Steven P. Gygi and Robin Reed* Department of Cell Biology, Harvard Medical School, 240 Longwood Ave. Boston MA 02115, USA Received July 17, 2018; Revised October 16, 2018; Editorial Decision October 18, 2018; Accepted October 19, 2018 ABSTRACT Fused in Sarcoma (FUS) and TAR DNA Binding Protein (TARDBP) (9–13). FUS is one of the three members of Understanding the molecular pathways disrupted in the structurally related FET (FUS, EWSR1 and TAF15) motor neuron diseases is urgently needed. Here, we family of RNA/DNA binding proteins (14). In addition to employed CRISPR knockout (KO) to investigate the the RNA/DNA binding domains, the FET proteins also functions of four ALS-causative RNA/DNA binding contain low-complexity domains, and these domains are proteins (FUS, EWSR1, TAF15 and MATR3) within the thought to be involved in ALS pathogenesis (5,15). In light RNAP II/U1 snRNP machinery. We found that each of of the discovery that mutations in FUS are ALS-causative, these structurally related proteins has distinct roles several groups carried out studies to determine whether the with FUS KO resulting in loss of U1 snRNP and the other two members of the FET family, TATA-Box Bind- SMN complex, EWSR1 KO causing dissociation of ing Protein Associated Factor 15 (TAF15) and EWS RNA the tRNA ligase complex, and TAF15 KO resulting in Binding Protein 1 (EWSR1), have a role in ALS.
    [Show full text]
  • HLT/NAACL 2004 Template
    BioLINK 2004: Linking Biological Literature, Ontologies, and Databases, pp. 69-76. Association for Computational Linguistics Using Natural Language Processing, Locus Link, and the Gene Ontology to Compare OMIM to MEDLINE Bisharah Halil Thomas C. James G. Alan R. Libbus Kilicoglu Rindflesch Mork Aronson Lister Hill National Center for Biomedical Communications National Library of Medicine Bethesda, Maryland, 20894 {libbus|halil|tcr|mork|alan}@nlm.nih.gov database can be searched by gene symbol, chromosomal Abstract location, or disorder. More recently, automated techniques for information Researchers in the biomedical and molecular and knowledge extraction from the literature are being biology fields are faced with a wide variety of developed to complement scientific reviews. These information sources. These are presented in methods address the need to condense and efficiently the form of images, free text, and structured present large amounts of data to the user. The feasibil- data files that include medical records, gene ity of applying natural language processing techniques and protein sequence data, and whole genome to the biomedical literature (Friedman and Hripcsak microarray data, all gathered from a variety of 1999; de Bruijn and Martin 2002) and to the wealth of experimental organisms and clinical subjects. genomics data now available (Jenssen et al. 2001; Yan- The need to organize and relate this informa- dell and Majoros 2002) is increasingly being recog- tion, particularly concerning genes, has moti- nized. Efforts to develop systems that work toward this vated the development of resources, such as goal focus on the identification of such items as gene the Unified Medical Language System, Gene and protein names (Tanabe and Wilbur 2002) or groups Ontology, LocusLink, and the Online Inheri- of genes with similar function (Jenssen et al.
    [Show full text]
  • On the Origin of Pontocerebellar Hypoplasia: Finding Genes for a Rare Disease
    UvA-DARE (Digital Academic Repository) On the origin of pontocerebellar hypoplasia: Finding genes for a rare disease Eggens, V.R.C. Publication date 2016 Document Version Final published version Link to publication Citation for published version (APA): Eggens, V. R. C. (2016). On the origin of pontocerebellar hypoplasia: Finding genes for a rare disease. General rights It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons). Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible. UvA-DARE is a service provided by the library of the University of Amsterdam (https://dare.uva.nl) Download date:06 Oct 2021 Al ben ik meer dan eens de weg kwijt En rijd ik hele einden om Ik ben op weg om je te vinden Dus wees gerust, ik kom - Boudewijn de Groot TOE1 mutatiOns cause pontOcerebellar hypoplasia and disOrders Of sex deVelOPment Veerle rC eggens, david Chitayat, Hülya Kayserili,
    [Show full text]