The Use of Next Generation Sequencing Technologies to Dissect the Aetiologies of Neurodegenerative Diseases

Total Page:16

File Type:pdf, Size:1020Kb

The Use of Next Generation Sequencing Technologies to Dissect the Aetiologies of Neurodegenerative Diseases May 2013 The use of next generation sequencing technologies to dissect the aetiologies of neurodegenerative diseases Arianna Tucci, MD This thesis is submitted to the University College of London for the degree of Doctor of Philosophy 1 Declaration I, Arianna Tucci, hereby declare that the work presented in this thesis was performed by myself. Exceptions to this are indicated elsewhere in the thesis. 2 A Martino “Meglio aggiungere vita ai giorni che non giorni alla vita.” Rita Levi Montalcini 3 Abstract Advances in next generation sequencing technologies have brought a paradigm shift in how medical researchers investigate human disorders. Whole exome sequencing (WES) allows to comprehensively study, in a single experiment, coding variations of a human genome. My PhD thesis focuses on the use of WES to dissect genetic aetiologies of neurodegenerative diseases. First, I present a pilot study where I prove the feasibility of this technology and I test the methods used in the following projects. I then describe the use of WES in rare Mendelian disorders: i) in patients with Kohlschütter-Tönz Syndrome, where WES has failed to identify the disease gene, because the mutation is in a non-coding region; ii) in a family with Charcot-Marie- Tooth type 6, where coupled with linkage data, WES rapidly identified the mutation causing the disease. Last, I present two projects where next generation sequencing plays an important role in elucidating the role of variants present in two loci (PARK16 and EIF4G1) in Parkinson’s Disease. 4 Acknowledgements The work presented here was made possible by the people that surround, or have surrounded myself in the past. I am deeply grateful to my supervisor John Hardy, for his exceptional mentorship and constant support. Particular thanks to Prof. Henry Houlden for his support, Dr Vincent Plagnol for his advice, Dr Coro Paisán-Ruiz for her patience teaching me the basis of genetic research, Prof. Nick Wood for his support and advice, Dr Elia Stupka and Francesco Lescai for introducing myself to bioinformatics. Thanks to the members of the laboratory of neurogenetics at Institute of Neurology; Anna Sailer, Boniface Mok, Reema Paudel, Nuria Seto, Lucia Schottlaender, Eleanna Kara, Niccolo’ Mencacci, Una Sheerin, Alan Pittman, Mina Ryten, Daniah Trabzuni, Raquel Duran, Selina Wray, Lee Stanyer, June Smalley, Jana Vandrovcova, Dalia Kasperaviciute, Rohan de Silva and Roberto Simone. Thanks to the members of the neurogenetics lab at NIH: Dr Andy Singleton, Dena Hernandez, Dr Mike Nalls and especially Celeste Sassi. I am also very grateful to the people who first introduced myself to research, from the Laboratory of Genetics and Biochemistry in Milan: Stefania Corti, Monica Nizzardo and Giacomo Comi. Thanks to my lifetime friends sorelle Rugis Stefania e Ale, Giulia. Special thanks to Viola, Silvia and Licia; to my university fellows Alberto, Sara, Mimi’, Chiarina, SDM, Patoz, Luca, Leo, Tommi. Thanks to M’and the pizzica, Salvatore and Marta Wolek. Thanks to my family: my late grandma Nonna Giovanna, Zia Margherita. Thanks to the wonderful Emilia and Luigi, to Laura e Pietro and the little cousins Elena and Alessandro. Thanks to my parents for their encouragement, help and support. Thanks to the person that knows me better than I know myself, Fiora. Thanks to the astonishing beauty of Manarola and Tellaro. 5 Thanks to Martino and the brand new Riccardo, thanks for coming to this world, my little ones. And thanks to Pietro, for being always there – with a smile. 6 Table of contents 1 INTRODUCTION ................................................................................................... 17 1.1 NEURODEGENERATIVE DISEASES: IMPORTANCE OF GENETIC ANALYSIS .... 17 1.2 GENETIC RESEARCH OVERVIEW .................................................................... 18 1.2.1 Genetic analysis using high throughput SNP genotyping ...................... 21 1.3 WHOLE EXOME SEQUENCING ....................................................................... 28 1.3.1 Overview: next generation sequencing technologies ........................... 28 1.3.2 Whole exome sequencing: premises ..................................................... 30 1.3.3 The technology ....................................................................................... 31 1.3.4 How to identify the causal variants in Mendelian disorders ................. 35 1.3.5 How is WES changing neurogenetics ..................................................... 39 1.3.6 Limitations and challenges ..................................................................... 42 1.4 GENETICS OF SELECTED NEURODEGENERATIVE DISEASES .......................... 44 1.4.1 Genetics of Parkinson’s Disease ............................................................ 44 1.4.2 Genetics of Kohlschütter-Tönz syndrome ............................................. 51 1.4.3 Genetics of complex neuropathies ........................................................ 57 2 WHOLE EXOME SEQUENCING PILOT PROJECT ................................................... 61 2.1 STATEMENT OF CONTRIBUTION TO THIS RESEARCH ................................... 61 2.2 BACKGROUND ............................................................................................... 61 2.3 MATERIALS AND METHODS .......................................................................... 61 2.4 DNA samples selection .................................................................................. 61 2.4.1 Library preparation ................................................................................ 62 2.4.2 Sequencing ............................................................................................. 66 2.4.3 Bioinformatics ........................................................................................ 67 2.5 RESULTS ......................................................................................................... 77 2.5.1 Coupling NimbleGen SeqCap EZ to Illumina sequencing ....................... 77 2.5.2 Data analysis .......................................................................................... 85 2.6 CONCLUSIONS ............................................................................................... 91 7 3 KOHLSCHÜTTER-TÖNZ SYNDROME: ROGDI MUTATIONS AND GENETIC HETEROGENEITY ........................................................................................................ 92 3.1 STATEMENT OF CONTRIBUTION TO THIS RESEARCH ................................... 92 3.2 BACKGROUND ............................................................................................... 92 3.3 MATERIALS AND METHODS .......................................................................... 92 3.3.1 Samples .................................................................................................. 92 3.3.2 Genetic investigations ............................................................................ 98 3.4 RESULTS ....................................................................................................... 100 3.5 DISCUSSION ................................................................................................. 110 4 C12orf65 MUTATIONS CAUSE AXONAL NEUROPATHY WITH OPTIC ATROPHY 115 4.1 STATEMENT OF CONTRIBUTION TO THIS RESEARCH ................................. 115 4.2 BACKGROUND ............................................................................................. 115 4.3 METHODS .................................................................................................... 116 4.3.1 Samples ................................................................................................ 116 4.3.2 Nerve biopsy ........................................................................................ 116 4.3.3 SNP genotyping and autozygosity mapping ........................................ 117 4.3.4 Mutation validation and screening in the additional cohorts ............. 118 4.3.5 Lymphoblast cells cultures ................................................................... 118 4.3.6 Transcript analyses ............................................................................... 118 4.3.7 Blue native in-gel complex V assay ...................................................... 120 4.3.8 Oxygen Consumption ........................................................................... 121 4.4 RESULTS ....................................................................................................... 121 4.4.1 Clinical details ...................................................................................... 121 4.4.2 Genetic analyses .................................................................................. 127 4.4.3 Mitochondrial impairment ................................................................... 133 4.4.4 Oxygen consumption ........................................................................... 133 4.5 DISCUSSION ................................................................................................. 136 5 A GENOME-WIDE ASSOCIATION STUDY FOLLOW UP: THE PARK16 LOCUS .... 139 5.1 STATEMENT OF CONTRIBUTION TO THIS RESEARCH ................................. 139 5.2 BACKGROUND ............................................................................................. 139 5.3 MATERIALS AND METHODS ........................................................................ 140 8 5.3.1 Samples ................................................................................................ 140 5.3.2
Recommended publications
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Mutational Landscape Differences Between Young-Onset and Older-Onset Breast Cancer Patients Nicole E
    Mealey et al. BMC Cancer (2020) 20:212 https://doi.org/10.1186/s12885-020-6684-z RESEARCH ARTICLE Open Access Mutational landscape differences between young-onset and older-onset breast cancer patients Nicole E. Mealey1 , Dylan E. O’Sullivan2 , Joy Pader3 , Yibing Ruan3 , Edwin Wang4 , May Lynn Quan1,5,6 and Darren R. Brenner1,3,5* Abstract Background: The incidence of breast cancer among young women (aged ≤40 years) has increased in North America and Europe. Fewer than 10% of cases among young women are attributable to inherited BRCA1 or BRCA2 mutations, suggesting an important role for somatic mutations. This study investigated genomic differences between young- and older-onset breast tumours. Methods: In this study we characterized the mutational landscape of 89 young-onset breast tumours (≤40 years) and examined differences with 949 older-onset tumours (> 40 years) using data from The Cancer Genome Atlas. We examined mutated genes, mutational load, and types of mutations. We used complementary R packages “deconstructSigs” and “SomaticSignatures” to extract mutational signatures. A recursively partitioned mixture model was used to identify whether combinations of mutational signatures were related to age of onset. Results: Older patients had a higher proportion of mutations in PIK3CA, CDH1, and MAP3K1 genes, while young- onset patients had a higher proportion of mutations in GATA3 and CTNNB1. Mutational load was lower for young- onset tumours, and a higher proportion of these mutations were C > A mutations, but a lower proportion were C > T mutations compared to older-onset tumours. The most common mutational signatures identified in both age groups were signatures 1 and 3 from the COSMIC database.
    [Show full text]
  • Comparison of Orthologs Across Multiple Species by Various Strategies
    COMPARISON OF ORTHOLOGS ACROSS MULTIPLE SPECIES BY VARIOUS STRATEGIES BY HUI LIU DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Biophysics and Computational Biology in the Graduate College of the University of Illinois at Urbana-Champaign, 2014 Urbana, Illinois Doctoral Committee: Professor Eric Jakobsson, Chair, Director of Research Professor Gene E. Robinson Associate Professor Saurabh Sinha Assistant Professor Jian Ma Abstract Thanks to the improvement of genome sequencing technology, abundant multi-species genomic data now became available and comparative genomics continues to be a fast prospering filed of biological research. Through the comparison of genomes of different organisms, we can understand what, at the molecular level, distinguishes different life forms from each other. It shed light on revealing the evolution of biology. And it also helps to refine the annotations and functions of individual genomes. For example, through comparisons across mammalian genomes, we can give an estimate of the conserved set of genes across mammals and correspondingly, find the species-specific sets of genes or functions. However, comparative genomics can be feasible only if a meaningful classification of genes exists. A natural way to do so is to delineate sets of orthologous genes. However, debates exist about the appropriate way to define orthologs. It is originally defined as genes in different species which derive from speciation events. But such definition is not sufficient to derive orthologous genes due to the complexity of evolutionary events such as gene duplication and gene loss. While it is possible to correctly figure out all the evolutionary events with the true phylogenetic tree, the true phylogenetic tree itself is impractical to be inferred.
    [Show full text]
  • Análise Integrativa De Perfis Transcricionais De Pacientes Com
    UNIVERSIDADE DE SÃO PAULO FACULDADE DE MEDICINA DE RIBEIRÃO PRETO PROGRAMA DE PÓS-GRADUAÇÃO EM GENÉTICA ADRIANE FEIJÓ EVANGELISTA Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas Ribeirão Preto – 2012 ADRIANE FEIJÓ EVANGELISTA Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas Tese apresentada à Faculdade de Medicina de Ribeirão Preto da Universidade de São Paulo para obtenção do título de Doutor em Ciências. Área de Concentração: Genética Orientador: Prof. Dr. Eduardo Antonio Donadi Co-orientador: Prof. Dr. Geraldo A. S. Passos Ribeirão Preto – 2012 AUTORIZO A REPRODUÇÃO E DIVULGAÇÃO TOTAL OU PARCIAL DESTE TRABALHO, POR QUALQUER MEIO CONVENCIONAL OU ELETRÔNICO, PARA FINS DE ESTUDO E PESQUISA, DESDE QUE CITADA A FONTE. FICHA CATALOGRÁFICA Evangelista, Adriane Feijó Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas. Ribeirão Preto, 2012 192p. Tese de Doutorado apresentada à Faculdade de Medicina de Ribeirão Preto da Universidade de São Paulo. Área de Concentração: Genética. Orientador: Donadi, Eduardo Antonio Co-orientador: Passos, Geraldo A. 1. Expressão gênica – microarrays 2. Análise bioinformática por module maps 3. Diabetes mellitus tipo 1 4. Diabetes mellitus tipo 2 5. Diabetes mellitus gestacional FOLHA DE APROVAÇÃO ADRIANE FEIJÓ EVANGELISTA Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas.
    [Show full text]
  • Supplementary Materials
    Supplementary materials Supplementary Table S1: MGNC compound library Ingredien Molecule Caco- Mol ID MW AlogP OB (%) BBB DL FASA- HL t Name Name 2 shengdi MOL012254 campesterol 400.8 7.63 37.58 1.34 0.98 0.7 0.21 20.2 shengdi MOL000519 coniferin 314.4 3.16 31.11 0.42 -0.2 0.3 0.27 74.6 beta- shengdi MOL000359 414.8 8.08 36.91 1.32 0.99 0.8 0.23 20.2 sitosterol pachymic shengdi MOL000289 528.9 6.54 33.63 0.1 -0.6 0.8 0 9.27 acid Poricoic acid shengdi MOL000291 484.7 5.64 30.52 -0.08 -0.9 0.8 0 8.67 B Chrysanthem shengdi MOL004492 585 8.24 38.72 0.51 -1 0.6 0.3 17.5 axanthin 20- shengdi MOL011455 Hexadecano 418.6 1.91 32.7 -0.24 -0.4 0.7 0.29 104 ylingenol huanglian MOL001454 berberine 336.4 3.45 36.86 1.24 0.57 0.8 0.19 6.57 huanglian MOL013352 Obacunone 454.6 2.68 43.29 0.01 -0.4 0.8 0.31 -13 huanglian MOL002894 berberrubine 322.4 3.2 35.74 1.07 0.17 0.7 0.24 6.46 huanglian MOL002897 epiberberine 336.4 3.45 43.09 1.17 0.4 0.8 0.19 6.1 huanglian MOL002903 (R)-Canadine 339.4 3.4 55.37 1.04 0.57 0.8 0.2 6.41 huanglian MOL002904 Berlambine 351.4 2.49 36.68 0.97 0.17 0.8 0.28 7.33 Corchorosid huanglian MOL002907 404.6 1.34 105 -0.91 -1.3 0.8 0.29 6.68 e A_qt Magnogrand huanglian MOL000622 266.4 1.18 63.71 0.02 -0.2 0.2 0.3 3.17 iolide huanglian MOL000762 Palmidin A 510.5 4.52 35.36 -0.38 -1.5 0.7 0.39 33.2 huanglian MOL000785 palmatine 352.4 3.65 64.6 1.33 0.37 0.7 0.13 2.25 huanglian MOL000098 quercetin 302.3 1.5 46.43 0.05 -0.8 0.3 0.38 14.4 huanglian MOL001458 coptisine 320.3 3.25 30.67 1.21 0.32 0.9 0.26 9.33 huanglian MOL002668 Worenine
    [Show full text]
  • The Function and Evolution of C2H2 Zinc Finger Proteins and Transposons
    The function and evolution of C2H2 zinc finger proteins and transposons by Laura Francesca Campitelli A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Molecular Genetics University of Toronto © Copyright by Laura Francesca Campitelli 2020 The function and evolution of C2H2 zinc finger proteins and transposons Laura Francesca Campitelli Doctor of Philosophy Department of Molecular Genetics University of Toronto 2020 Abstract Transcription factors (TFs) confer specificity to transcriptional regulation by binding specific DNA sequences and ultimately affecting the ability of RNA polymerase to transcribe a locus. The C2H2 zinc finger proteins (C2H2 ZFPs) are a TF class with the unique ability to diversify their DNA-binding specificities in a short evolutionary time. C2H2 ZFPs comprise the largest class of TFs in Mammalian genomes, including nearly half of all Human TFs (747/1,639). Positive selection on the DNA-binding specificities of C2H2 ZFPs is explained by an evolutionary arms race with endogenous retroelements (EREs; copy-and-paste transposable elements), where the C2H2 ZFPs containing a KRAB repressor domain (KZFPs; 344/747 Human C2H2 ZFPs) are thought to diversify to bind new EREs and repress deleterious transposition events. However, evidence of the gain and loss of KZFP binding sites on the ERE sequence is sparse due to poor resolution of ERE sequence evolution, despite the recent publication of binding preferences for 242/344 Human KZFPs. The goal of my doctoral work has been to characterize the Human C2H2 ZFPs, with specific interest in their evolutionary history, functional diversity, and coevolution with LINE EREs.
    [Show full text]
  • CCAR1 Polyclonal Antibody
    PRODUCT DATA SHEET Bioworld Technology,Inc. CCAR1 polyclonal antibody Catalog: BS8140 Host: Rabbit Reactivity: Human BackGround: by affinity-chromatography using epitope-specific im- CARP-1 (cell division cycle and apoptosis regulator 1), munogen and the purity is > 95% (by SDS-PAGE). also known as CCAR1 or DIS, is a 1,150 amino acid pro- Applications: tein that localizes to the perinuclear region of the cyto- WB 1:500 - 1:2000 plasm and contains one SAP domain. Expressed in sever- IF 1:50 - 1:200 al epithelial cancer cell lines, including breast, colon, IP 1:50 - 1:100 prostate and leukemia, CARP-1 is involved in apoptotic Storage&Stability: signaling, as well as in cell cycle progression and cell Store at 4°C short term. Aliquot and store at -20°C long proliferation via interaction with c-Myc and cyclin B1. term. Avoid freeze-thaw cycles. CARP-1 is subject to DNA damage-induced phosphory- Specificity: lation, probably by ATM or ATR. The gene encoding CCAR1 polyclonal antibody detects endogenous levels of CARP-1 maps to human chromosome 10, which houses CCAR1 protein. over 1,200 genes and comprises nearly 4.5% of the hu- DATA: man genome. Defects in some of the genes that map to chromosome 10 are associated with Charcot-Marie Tooth disease, Jackson-Weiss syndrome, Usher syndrome, non- syndromatic deafness, Wolman’s syndrome, Cowden syndrome, multiple endocrine neoplasia type 2 and por- phyria. Product: Rabbit IgG, 1mg/ml in PBS with 0.02% sodium azide, WesternBlot (WB) analysis of CCAR1 polyclonal antibody 50% glycerol, pH7.2 Note: Molecular Weight: For research use only, not for use in diagnostic procedure.
    [Show full text]
  • Anti-MEF2A Phospho (Ser408) Antibody (ARG51784)
    Product datasheet [email protected] ARG51784 Package: 100 μl, 50 μl anti-MEF2A phospho (Ser408) antibody Store at: -20°C Summary Product Description Rabbit Polyclonal antibody recognizes MEF2A phospho (Ser408) Tested Reactivity Hu, Ms, Rat Tested Application IHC-P Host Rabbit Clonality Polyclonal Isotype IgG Target Name MEF2A Antigen Species Human Immunogen Peptide sequence around phosphorylation site of serine 408 (P-I-S(p)-P-P) derived from Human MEF2A. Conjugation Un-conjugated Alternate Names RSRFC4; RSRFC9; ADCAD1; mef2; Myocyte-specific enhancer factor 2A; Serum response factor-like protein 1 Application Instructions Application table Application Dilution IHC-P 1:50 - 1:100 Application Note * The dilutions indicate recommended starting dilutions and the optimal dilutions or concentrations should be determined by the scientist. Calculated Mw 55 kDa Properties Form Liquid Purification Antibodies were produced by immunizing rabbits with KLH-conjugated synthetic phosphopeptide. Antibodies were purified by affinity-chromatography using epitope-specific phosphopeptide. In addition, non-phospho specific antibodies were removed by chromatogramphy using non- phosphopeptide. Buffer PBS (without Mg2+ and Ca2+, pH 7.4), 150mM NaCl, 0.02% Sodium azide and 50% Glycerol. Preservative 0.02% Sodium azide Stabilizer 50% Glycerol Concentration 1 mg/ml Storage instruction For continuous use, store undiluted antibody at 2-8°C for up to a week. For long-term storage, aliquot and store at -20°C. Storage in frost free freezers is not recommended. Avoid repeated freeze/thaw cycles. Suggest spin the vial prior to opening. The antibody solution should be gently mixed before use. www.arigobio.com 1/2 Note For laboratory research only, not for drug, diagnostic or other use.
    [Show full text]
  • 35Th International Society for Animal Genetics Conference 7
    35th INTERNATIONAL SOCIETY FOR ANIMAL GENETICS CONFERENCE 7. 23.16 – 7.27. 2016 Salt Lake City, Utah ABSTRACT BOOK https://www.asas.org/meetings/isag2016 INVITED SPEAKERS S0100 – S0124 https://www.asas.org/meetings/isag2016 epigenetic modifications, such as DNA methylation, and measuring different proteins and cellular metab- INVITED SPEAKERS: FUNCTIONAL olites. These advancements provide unprecedented ANNOTATION OF ANIMAL opportunities to uncover the genetic architecture GENOMES (FAANG) ASAS-ISAG underlying phenotypic variation. In this context, the JOINT SYMPOSIUM main challenge is to decipher the flow of biological information that lies between the genotypes and phe- notypes under study. In other words, the new challenge S0100 Important lessons from complex genomes. is to integrate multiple sources of molecular infor- T. R. Gingeras* (Cold Spring Harbor Laboratory, mation (i.e., multiple layers of omics data to reveal Functional Genomics, Cold Spring Harbor, NY) the causal biological networks that underlie complex traits). It is important to note that knowledge regarding The ~3 billion base pairs of the human DNA rep- causal relationships among genes and phenotypes can resent a storage devise encoding information for be used to predict the behavior of complex systems, as hundreds of thousands of processes that can go on well as optimize management practices and selection within and outside a human cell. This information is strategies. Here, we describe a multi-step procedure revealed in the RNAs that are composed of 12 billion for inferring causal gene-phenotype networks underly- nucleotides, considering the strandedness and allelic ing complex phenotypes integrating multi-omics data. content of each of the diploid copies of the genome.
    [Show full text]
  • Ultraconserved Elements Are Associated with Homeostatic Control of Splicing Regulators by Alternative Splicing and Nonsense-Mediated Decay
    Downloaded from genesdev.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press Ultraconserved elements are associated with homeostatic control of splicing regulators by alternative splicing and nonsense-mediated decay Julie Z. Ni,1 Leslie Grate,1 John Paul Donohue,1 Christine Preston,2 Naomi Nobida,2 Georgeann O’Brien,2 Lily Shiue,1 Tyson A. Clark,3 John E. Blume,3 and Manuel Ares Jr.1,2,4 1Center for Molecular Biology of RNA and Department of Molecular, Cell, and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA; 2Hughes Undergraduate Research Laboratory, University of California at Santa Cruz, Santa Cruz, California 95064, USA; 3Affymetrix, Inc., Santa Clara, California 95051, USA Many alternative splicing events create RNAs with premature stop codons, suggesting that alternative splicing coupled with nonsense-mediated decay (AS-NMD) may regulate gene expression post-transcriptionally. We tested this idea in mice by blocking NMD and measuring changes in isoform representation using splicing-sensitive microarrays. We found a striking class of highly conserved stop codon-containing exons whose inclusion renders the transcript sensitive to NMD. A genomic search for additional examples identified >50 such exons in genes with a variety of functions. These exons are unusually frequent in genes that encode splicing activators and are unexpectedly enriched in the so-called “ultraconserved” elements in the mammalian lineage. Further analysis show that NMD of mRNAs for splicing activators such as SR proteins is triggered by splicing activation events, whereas NMD of the mRNAs for negatively acting hnRNP proteins is triggered by splicing repression, a polarity consistent with widespread homeostatic control of splicing regulator gene expression.
    [Show full text]
  • Gene Regulatory Factors in the Evolutionary History of Humans
    Gene regulatory factors in the evolutionary history of humans Von der Fakultät für Mathematik und Informatik der Universität Leipzig angenommene Dissertation zur Erlangung des akademischen Grades DOCTOR RERUM NATURALIUM (Dr. rer. nat.) im Fachgebiet Informatik vorgelegt von MSc. Alvaro Perdomo-Sabogal geboren am 08. Januar 1979 in Armenia, Quindio (Kolumbien) Die Annahme der Dissertation wurde empfohlen von: 1. Prof. Dr. Peter F. Stadler, Institut für Informatik, Leipzig 2. Prof. Dr. Andrew Torda, Zentrum für Bioinformatik, Hamburg Die Verleihung des akademischen Grades erfolgt mit Bestehen der Verteidigung am 24. August 2016 mit dem Gesamtprädikat “magna cum laude" Abstract Changes in cis‐ and trans‐regulatory elements are among the prime sources of genetic and phenotypical variation at species level. The introduction of cis‐ and trans regulatory variation, as evolutionary processes, has played important roles in driving evolution, diversity and phenotypical differentiation in humans. Therefore, exploring and identifying variation that occurs on cis‐ and trans‐ regulatory elements becomes imperative to better understanding of human evolution and its genetic diversity. In this research, around 3360 gene regulatory factors in the human genome were catalogued. This catalog includes genes that code for proteins that perform gene regulatory activities such DNA‐depending transcription, RNA polymerase II transcription cofactor and co‐repressor activity, chromatin binding, and remodeling, among other 218 gene ontology terms. Using the classification of DNA‐ binding GRFs (Wingender et al. 2015), we were able to group 1521 GRF genes (~46%) into 41 different GRF classes. This GRF catalog allowed us to initially explore and discuss how some GRF genes have evolved in humans, archaic humans (Neandertal and Denisovan) and non‐human primates species.
    [Show full text]
  • Mrna Editing, Processing and Quality Control in Caenorhabditis Elegans
    | WORMBOOK mRNA Editing, Processing and Quality Control in Caenorhabditis elegans Joshua A. Arribere,*,1 Hidehito Kuroyanagi,†,1 and Heather A. Hundley‡,1 *Department of MCD Biology, UC Santa Cruz, California 95064, †Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan, and ‡Medical Sciences Program, Indiana University School of Medicine-Bloomington, Indiana 47405 ABSTRACT While DNA serves as the blueprint of life, the distinct functions of each cell are determined by the dynamic expression of genes from the static genome. The amount and specific sequences of RNAs expressed in a given cell involves a number of regulated processes including RNA synthesis (transcription), processing, splicing, modification, polyadenylation, stability, translation, and degradation. As errors during mRNA production can create gene products that are deleterious to the organism, quality control mechanisms exist to survey and remove errors in mRNA expression and processing. Here, we will provide an overview of mRNA processing and quality control mechanisms that occur in Caenorhabditis elegans, with a focus on those that occur on protein-coding genes after transcription initiation. In addition, we will describe the genetic and technical approaches that have allowed studies in C. elegans to reveal important mechanistic insight into these processes. KEYWORDS Caenorhabditis elegans; splicing; RNA editing; RNA modification; polyadenylation; quality control; WormBook TABLE OF CONTENTS Abstract 531 RNA Editing and Modification 533 Adenosine-to-inosine RNA editing 533 The C. elegans A-to-I editing machinery 534 RNA editing in space and time 535 ADARs regulate the levels and fates of endogenous dsRNA 537 Are other modifications present in C.
    [Show full text]