Sequence Based Characterization of Structural Variation in the Mouse Genome

Total Page:16

File Type:pdf, Size:1020Kb

Sequence Based Characterization of Structural Variation in the Mouse Genome Sequence based characterization of structural variation in the mouse genome Binnaz Yalcin1†, Kim Wong2†, Avigail Agam†1, Martin Goodson†1, Thomas M. Keane2, Leo Goodstadt1, Jérôme Nicod1, Amarjit Bhomra1, Polinka Hernandez-Pliego1, Helen Whitley1, James Cleak1, Rebekah Dutton1, Deborah Janowitz1, Richard Mott1, David J. Adams2,*, Jonathan Flint2,* 1The Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK, 2The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1HH, UK 3MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK. †Co-first authors *Correspondence to: Dr. David Adams Dr. Jonathan Flint Wellcome Trust Sanger Institute Wellcome Trust Centre for Human Genetics Hinxton, Cambs, CB10 1SA, UK Oxford, OX3 7BN, UK Ph: +44 (0) 1223 86862 Ph: +44 (0) 1865 287512 Fax: +44 (0) 1223 494919 Fax: +44 (0) 1865 287501 Email: [email protected] Email: [email protected] Abstract The importance of structural variants (SVs) in DNA as a cause of quantitative variation and as a contributor to disease is unknown, but without knowing how many SVs there are, and how they arise, it is difficult to discover what they do. Combining experimental with automated analyses of the mouse genome sequence, we identified 0.71M SVs at 0.28M sites in the genomes of thirteen classical and four wild-derived inbred mouse strains. The majority of SVs are less than 1 kilobase in size and 98% are deletions or insertions. The breakpoints of 58% were mapped to base pair resolution allowing us to infer that insertion of retrotransposons causes more than half of SVs. Yet, despite their prevalence, SVs are less likely than other sequence variants to cause gene-expression or quantitative phenotypic variation. We identified 24 SVs that disrupt coding exons, acting as rare variants of large effect on gene function. One third of the genes so affected have immunological functions. Our catalogue provides a starting point for the analysis of the most dynamic and complex regions of genomes from a genetically tractable model organism. 2 Introduction Structural variation is believed to be widespread in mammalian genomes1-5 and is an important cause of disease6-8, but just how abundant and important structural variants (SVs) are in shaping phenotypic variation remains unclear. Understanding what SVs do depends on understanding what they are, where they occur and how they arise: large SVs that keep recurring and coincide with genes are far more likely to contribute to phenotypic variation than small non-recurrent SVs within intergenic regions. The preeminent organism for modeling the relationship between phenotype and genotype, including SVs, is the mouse, but our catalogue of SVs in this animal is incomplete. Estimates of SV numbers and the proportion of the mouse genome they occupy, vary considerably, from figures of a few hundred to over 7,0009-13, affecting from 3.2% to more than 10% of the genome14,15. Incompleteness and inconsistencies are largely due to reliance on differential hybridization of genomic DNA to oligonucleotide arrays16, a technology blind to some SV categories (such as inversions and insertions) and only limited ability to detect others (segmental duplications and transposable elements). Sequence based methods of SV detection, with higher resolution and greater sensitivity, have so far had limited application12,17. Along with SV catalogues, we need to know how SVs arise, as this will tell us what SVs may or may not do. The major molecular mechanism producing SVs in the mouse genome is believed to be retrotransposition12,17, which, may account for more than 80% of SVs between 100 nucleotides to 10 kilobases in length17. In cell culture, about 10% of LINE-1 insertions delete 3 DNA18,19, a process that also occurs in mouse genomic DNA20. It is not known to what extent retrotransposons, or other mechanisms of SV formation, contribute to mouse phenotypic variation and disease. What we know about the burden of SVs’ impact on phenotypes in the mouse comes primarily from analyses of gene expression14,15,21. Up to 28% of the between-strain variation in gene expression in hematopoietic stem and progenitor cells has been attributed to SVs14; for genes lying within SVs, the latter account for between 66% to 74% of between-strain expression variation in kidney, liver, lung and testis15. If the genome is replete with SVs, and given that their influence on gene expression could extend up to 500 Kb from their margins15, then SVs might be responsible for a considerable fraction of heritable gene expression variance. Since gene expression variation is believed to contribute to variation in phenotypes in the whole organism21, SVs may turn out to have a major role in the genetic determination of all aspects of mouse, and mammalian, biology. In this paper we use next generation sequencing to address three critical questions: what are the extent and complexity of SVs in the mouse genome, what are the likely mechanisms of SV formation, and to what extent do SVs contribute to phenotypic variation? Our molecular characterization of SVs in the mouse genome allows us to determine the extent to which SVs contribute to genetic and phenotypic diversity. 4 Results SV identification Using short-read paired-end mapping, we found SVs at 0.28M sites in the mouse genome, amounting to 0.71M SVs in 17 inbred strains of mice: A/J, AKR/J, BALB/cJ, C3H/HeJ, C57BL/6N, CBA/J, DBA/2J, LP/J, NOD/ShiLtJ, NZO/HlLtJ, 129S5SvEvBrd, 129P2/OlaHsd, 129S1/SvImJ, WSB/EiJ, PWK/PhJ, CAST/EiJ and SPRET/EiJ. Our catalogue contains far more SVs than previously recognized (Fig. 1a) and consists of a greater variety of molecular structures (Fig. 1b&1c). To explain why we found more, we start by describing how we went about finding SVs. We combined visual inspection of short-read sequencing data with molecular validation to improve automated SV detection across the genome. We used two criteria to identify SVs manually: read depth and anomalous paired-end mapping (PEM). We did this using data from the mouse’s smallest chromosome (19) in its entirety, and a random set of other chromosomal regions, for eight classical strains (A/J, AKR/J, BALB/cJ, C3H/HeJ, C57BL/6J, CBA/J, DBA/2J and LP/J), founder strains of heterogeneous stock (HS) population22. Based on read depth and PEM we expected to find eleven patterns that classify SVs. We refer to these as type H (“High-confidence”) patterns (H1-H11: Supplementary Fig. 1). For example, some deletions and inversions leave precise, easily identifiable signatures (Fig. 1d). In addition, we found ten patterns whose interpretation was ambiguous. We refer to these as type Q (“Questionable”) patterns (Q1-Q10: Supplementary Fig. 1, Fig. 1e). We investigated the molecular structure of all 21 patterns using a PCR 5 strategy (Supplementary Fig. 2, Supplementary Methods). We designed 584 pairs of primers and successfully amplified 547 SV sites across the eight strains (Supplementary Table 1). Our categorization of predicted SV structures, based on manual inspection of PEM patterns, resulted in the confident identification of an SV for nineteen of the 21 patterns in all instances that we examined by PCR (Supplementary Table 2). Two patterns were always false (Q6 and Q10), and arose because of the presence of a retrotransposed pseudogene giving mapping errors. Recognizing these patterns, we were able to predict underlying SV structure with high confidence. PCR confirmed that 12 patterns were indicative of a single SV and six patterns indicative of multiple adjacent SVs (Supplementary Table 2). However, SVs of type Q7 (45 cases) were due to a variable number tandem repeat, for which we could not predict the number of repeats or molecular structure. Available automated methods to identify SVs are unable to differentiate all 19 PEM patterns, and may also classify some SVs incorrectly; for example, the PEM patterns of linked insertions (Q5 and Q9: Supplementary Fig. 1) are similar to those for inversions or deletions. Therefore we adapted automated methods to recognize 15 types (Q1, Q2, Q3 and Q7 could not be unambiguously identified) identified by manual inspection and PCR validation (Supplementary Methods and 23). 6 Sensitivity and specificity analyses We established false positive and false negative rates for the automated analysis in three ways. First, we used our manually identified set of SVs on chromosome 19 (Supplementary Table 3) where we found 932 deletions (684 type H and 248 type Q), 15 inversions (2 type H and 13 type Q) and three copy number gains (all type H). False negative rates per strain range from 14% to 17% (Supplementary Table 4a); false positive rates range from 3.1% to 4.6% (Supplementary Table 4b). Second, to ensure that our sensitivity and specificity analyses were not vitiated because we used chromosome 19 as a training set for the automated analysis, we derived a second, smaller, set of manually curated deletions from a randomly chosen 10 Mb region (101 Mb to 111 Mb) from chromosome 3 in the strain C3H/HeJ. Automated analysis of this region correctly identified 43 (82.7%) and called 2 false positive deletions (4.4%). Third, we investigated the false negative rate for the automated detection of deletions across the genome using a PCR validation dataset of 267 simple deletions (Supplementary Table 1). Consistent with the chromosome 19 and chromosome 3 analyses we found that the false negative rate for deletions was between 9% and 15%, respectively (Supplementary Table 5a). We could not assess the performance of automated analysis to detect SV types other than deletions from manual inspection of chromosome 19 because so few of these rearrangements were called. So we turned to PCR- based validation of insertions, inversions and tandem duplications (n = 62 to n = 76) and found that the average false negative rate was higher than for deletions, ranging from 21% to 33% per strain (Supplementary Table 5b).
Recommended publications
  • SPACE Exploration of Chromatin Proteome to Reveal Associated RNA- Binding Proteins
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.13.200212; this version posted July 15, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. SPACE exploration of chromatin proteome to reveal associated RNA- binding proteins Mahmoud-Reza Rafiee1*, Julian A Zagalak1,2, Giulia Tyzack1,3,4, Rickie Patani1,3,4, Jernej Ule1,2, Nicholas M Luscombe1,5,6* 1The Francis Crick Institute, 1 Midland Road, London NW1 1AT, UK. 2 Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK. 3 Sobell Department of Motor Neuroscience and Movement Disorders, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK. 4 Department of Neuroinflammation, UCL Institute of Neurology, Queen Square, London WC1N 1PJ, UK. 5 UCL Genetics Institute, University College London, Gower Street, London WC1E 6BT, UK. 6 Okinawa Institute of Science & Technology Graduate University, Okinawa 904-0495, Japan * corresponding authors Abstract Chromatin is composed of many proteins that mediate intermolecular transactions with the genome. Comprehensive knowledge of these components and their interactions is necessary for insights into gene regulation and other activities; however, reliable identification of chromatin-associated proteins remains technically challenging. Here, we present SPACE (Silica Particle Assisted Chromatin Enrichment), a stringent and straightforward chromatin- purification method that helps identify direct DNA-binders separately from chromatin- associated proteins. We demonstrate SPACE’s unique strengths in three experimental set- ups: the sensitivity to detect novel chromatin-associated proteins, the quantitative nature to measure dynamic protein use across distinct cellular conditions, and the ability to handle 10- 25 times less starting material than competing methods.
    [Show full text]
  • A Case Report of Congenital Erythropoietic Anemia II in China with a Novel Mutation
    Annals of Hematology https://doi.org/10.1007/s00277-019-03612-2 LETTER TO THE EDITOR A case report of congenital erythropoietic anemia II in China with a novel mutation Hong Zhang1 & Wuqing Wan1 & Xiaoyan Liu1 & Chuan Wen1 & Ying Liu1 & Senlin Luo1 & Xiao Sun1 & Shizhe Liu1 Received: 19 December 2018 /Accepted: 4 January 2019 # The Author(s) 2019 Dear Editor, 53.9 μmol/L (normal, 0–21), of which 42.7 μmol/L was Congenital erythropoietic anemias (CDAs) are a indirect (normal, 0–19). G6PD deficiency was not found. group of rare inherited diseases [1]. So far, the CDAs Red blood cell folate and hemoglobin electrophoresis are mainly divided into four types (type I to type IV), gave results within normal limits. Serum vitamin B12 and the CDA type II is the most common type. It is was 736 pmol/L (normal, 133–675). Serum iron, ferritin, caused by a mutation in the SEC23B gene. To date, 67 and transferrin were all within normal limits. Erythrocyte causative mutations in the SEC23B gene have been de- osmotic fragility test was normal. Acidified glycerol he- scribed [2–5] (the complete mutational spectrum of molysis test and Coombs test were negative. Light micro- SEC23B isshowninTable1). scope observation of a bone marrow smear revealed hy- We report a patient with typical clinical manifesta- perplasia and binucleated late erythroblasts (Fig. 1a). tions and laboratory findings, a 6-year-old girl who Genetic testing of the proband, her little brother, and hadsufferedjaundiceattheageof6monthswithlow her parents performed at Shanghai Xin Peijing Medical hemoglobin levels at 80 g/L.
    [Show full text]
  • Genetic Drivers of Pancreatic Islet Function
    | INVESTIGATION Genetic Drivers of Pancreatic Islet Function Mark P. Keller,*,1 Daniel M. Gatti,†,1 Kathryn L. Schueler,* Mary E. Rabaglia,* Donnie S. Stapleton,* Petr Simecek,† Matthew Vincent,† Sadie Allen,‡ Aimee Teo Broman,§ Rhonda Bacher,§ Christina Kendziorski,§ Karl W. Broman,§ Brian S. Yandell,** Gary A. Churchill,†,2 and Alan D. Attie*,2 *Department of Biochemistry, §Department of Biostatistics and Medical Informatics, and **Department of Horticulture, University of Wisconsin–Madison, Wisconsin 53706-1544, †The Jackson Laboratory, Bar Harbor, Maine 06409, and ‡Maine School of Science and Mathematics, Limestone, Maine 06409, ORCID IDs: 0000-0002-7405-5552 (M.P.K.); 0000-0002-4914-6671 (K.W.B.); 0000-0001-9190-9284 (G.A.C.); 0000-0002-0568-2261 (A.D.A.) ABSTRACT The majority of gene loci that have been associated with type 2 diabetes play a role in pancreatic islet function. To evaluate the role of islet gene expression in the etiology of diabetes, we sensitized a genetically diverse mouse population with a Western diet high in fat (45% kcal) and sucrose (34%) and carried out genome-wide association mapping of diabetes-related phenotypes. We quantified mRNA abundance in the islets and identified 18,820 expression QTL. We applied mediation analysis to identify candidate causal driver genes at loci that affect the abundance of numerous transcripts. These include two genes previously associated with monogenic diabetes (PDX1 and HNF4A), as well as three genes with nominal association with diabetes-related traits in humans (FAM83E, IL6ST, and SAT2). We grouped transcripts into gene modules and mapped regulatory loci for modules enriched with transcripts specific for a-cells, and another specific for d-cells.
    [Show full text]
  • Genome-Wide DNA Methylation Profiling in the Superior Temporal Gyrus Reveals Epigenetic Signatures Associated with Alzheimer's
    Watson et al. Genome Medicine (2016) 8:5 DOI 10.1186/s13073-015-0258-8 RESEARCH Open Access Genome-wide DNA methylation profiling in the superior temporal gyrus reveals epigenetic signatures associated with Alzheimer’s disease Corey T. Watson1, Panos Roussos1,2,3, Paras Garg1, Daniel J. Ho1, Nidha Azam1, Pavel L. Katsel2,4, Vahram Haroutunian2,3,4 and Andrew J. Sharp1* Abstract Background: Alzheimer’s disease affects ~13 % of people in the United States 65 years and older, making it the most common neurodegenerative disorder. Recent work has identified roles for environmental, genetic, and epigenetic factors in Alzheimer’s disease risk. Methods: We performed a genome-wide screen of DNA methylation using the Illumina Infinium HumanMethylation450 platform on bulk tissue samples from the superior temporal gyrus of patients with Alzheimer’s disease and non-demented controls. We paired a sliding window approach with multivariate linear regression to characterize Alzheimer’s disease-associated differentially methylated regions (DMRs). Results: We identified 479 DMRs exhibiting a strong bias for hypermethylated changes, a subset of which were independently associated with aging. DMR intervals overlapped 475 RefSeq genes enriched for gene ontology categories with relevant roles in neuron function and development, as well as cellular metabolism, and included genes reported in Alzheimer’s disease genome-wide and epigenome-wide association studies. DMRs were enriched for brain-specific histone signatures and for binding motifs of transcription factors with rolesinthebrainandAlzheimer’s disease pathology. Notably, hypermethylated DMRs preferentially overlapped poised promoter regions, marked by H3K27me3 and H3K4me3, previously shown to co-localize with aging- associated hypermethylation. Finally, the integration of DMR-associated single nucleotide polymorphisms with Alzheimer’s disease genome-wide association study risk loci and brain expression quantitative trait loci highlights multiple potential DMRs of interest for further functional analysis.
    [Show full text]
  • Congenital Dyserythropoietic Anemia Type II
    Punzo et al. Orphanet Journal of Rare Diseases 2011, 6:89 http://www.ojrd.com/content/6/1/89 RESEARCH Open Access Congenital Dyserythropoietic Anemia Type II: molecular analysis and expression of the SEC23B Gene Francesca Punzo1,2, Aida M Bertoli-Avella1, Saverio Scianguetta2, Fulvio Della Ragione3, Maddalena Casale2, Luisa Ronzoni4, Maria D Cappellini4, Gianluca Forni5, Ben A Oostra1 and Silverio Perrotta2* Abstract Background: Congenital dyserythropoietic anemia type II (CDAII), the most common form of CDA, is an autosomal recessive condition. CDAII diagnosis is based on invasive, expensive, and time consuming tests that are available only in specialized laboratories. The recent identification of SEC23B mutations as the cause of CDAII opens new possibilities for the molecular diagnosis of the disease. The aim of this study was to characterize molecular genomic SEC23B defects in 16 unrelated patients affected by CDAII and correlate the identified genetic alterations with SEC23B transcript and protein levels in erythroid precursors. Methods: SEC23B was sequenced in 16 patients, their relatives and 100 control participants. SEC23B transcript level were studied by quantitative PCR (qPCR) in peripheral erythroid precursors and lymphocytes from the patients and healthy control participants. Sec23B protein content was analyzed by immunoblotting in samples of erythroblast cells from CDAII patients and healthy controls. Results: All of the investigated cases carried SEC23B mutations on both alleles, with the exception of two patients in which a single heterozygous mutation was found. We identified 15 different SEC23B mutations, of which four represent novel mutations: p.Gln214Stop, p.Thr485Ala, p.Val637Gly, and p.Ser727Phe. The CDAII patients exhibited a 40-60% decrease of SEC23B mRNA levels in erythroid precursors when compared with the corresponding cell type from healthy participants.
    [Show full text]
  • Consequences of Mutations in the Genes of the ER Export Machinery COPII in Vertebrates
    Biomedical Sciences Publications Biomedical Sciences 1-22-2020 Consequences of mutations in the genes of the ER export machinery COPII in vertebrates Chung-Ling Lu Iowa State University, [email protected] Jinoh Kim Iowa State University, [email protected] Follow this and additional works at: https://lib.dr.iastate.edu/bms_pubs Part of the Cellular and Molecular Physiology Commons, Molecular Biology Commons, and the Molecular Genetics Commons The complete bibliographic information for this item can be found at https://lib.dr.iastate.edu/ bms_pubs/81. For information on how to cite this item, please visit http://lib.dr.iastate.edu/ howtocite.html. This Article is brought to you for free and open access by the Biomedical Sciences at Iowa State University Digital Repository. It has been accepted for inclusion in Biomedical Sciences Publications by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Consequences of mutations in the genes of the ER export machinery COPII in vertebrates Abstract Coat protein complex II (COPII) plays an essential role in the export of cargo molecules such as secretory proteins, membrane proteins, and lipids from the endoplasmic reticulum (ER). In yeast, the COPII machinery is critical for cell viability as most COPII knockout mutants fail to survive. In mice and fish, homozygous knockout mutants of most COPII genes are embryonic lethal, reflecting the essentiality of the COPII machinery in the early stages of vertebrate development. In humans, COPII mutations, which are often hypomorphic, cause diseases having distinct clinical features. This is interesting as the fundamental cellular defect of these diseases, that is, failure of ER export, is similar.
    [Show full text]
  • Mutations in the Coat Complex II Component SEC23B
    Yang et al. Cell Death and Disease (2020) 11:157 https://doi.org/10.1038/s41419-020-2358-7 Cell Death & Disease ARTICLE Open Access Mutations in the coat complex II component SEC23B promote colorectal cancer metastasis Chunyuan Yang1,NanChen2,XiangLi1, Dan Lu3,ZhiyuanHou3,YuhuaLi3,YanJin3,JinGu2 and Yuxin Yin1,3 Abstract Metastasis is the leading cause of death for colorectal cancer (CRC). However, the protein transport process involved in CRC metastasis remains unclear. In this report, we use whole-exome sequencing and bioinformatics analysis to identify somatic mutations in CRC samples and found mutations of the protein transport gene Sec23 homolog B (SEC23B)in patients with metachronous liver metastasis. We show that deletion of SEC23B suppresses the membrane localization of adhesion proteins and augments cell mobility. SEC23B mutations either cause a premature stop (C649T) or impair its protein transport activity (C1467G and T488C + G791A + G2153A). Furthermore, SEC23B mutations inhibit the transport of epithelial cell adhesion molecule (EPCAM) and CD9 molecule, thereby attenuating cell adhesion and promoting invasiveness both in vitro and in vivo. Taken together, these data demonstrate the important impact of SEC23B mutations on metastasis, and we propose that SEC23B is a potential suppressor of CRC metastasis. Introduction pathway consists of a set of organelles and cargo-bearing Colorectal cancer (CRC) is one of the most prevalent vesicles10,11. The first step of this pathway is the transport 1234567890():,; 1234567890():,; 1234567890():,; 1234567890():,; cancers worldwide. According to the World Health of protein from endoplasmic reticulum (ER) to Golgi Organization (WHO), nearly 1.1 million individuals are apparatus, mediated by the coat protein II (COPII) com- diagnosed with CRC each year1.
    [Show full text]
  • Mai Muudatuntuu Ti on Man Mini
    MAIMUUDATUNTUU US009809854B2 TI ON MAN MINI (12 ) United States Patent ( 10 ) Patent No. : US 9 ,809 ,854 B2 Crow et al. (45 ) Date of Patent : Nov . 7 , 2017 Whitehead et al. (2005 ) Variation in tissue - specific gene expression ( 54 ) BIOMARKERS FOR DISEASE ACTIVITY among natural populations. Genome Biology, 6 :R13 . * AND CLINICAL MANIFESTATIONS Villanueva et al. ( 2011 ) Netting Neutrophils Induce Endothelial SYSTEMIC LUPUS ERYTHEMATOSUS Damage , Infiltrate Tissues, and Expose Immunostimulatory Mol ecules in Systemic Lupus Erythematosus . The Journal of Immunol @(71 ) Applicant: NEW YORK SOCIETY FOR THE ogy , 187 : 538 - 552 . * RUPTURED AND CRIPPLED Bijl et al. (2001 ) Fas expression on peripheral blood lymphocytes in MAINTAINING THE HOSPITAL , systemic lupus erythematosus ( SLE ) : relation to lymphocyte acti vation and disease activity . Lupus, 10 :866 - 872 . * New York , NY (US ) Crow et al . (2003 ) Microarray analysis of gene expression in lupus. Arthritis Research and Therapy , 5 :279 - 287 . * @(72 ) Inventors : Mary K . Crow , New York , NY (US ) ; Baechler et al . ( 2003 ) Interferon - inducible gene expression signa Mikhail Olferiev , Mount Kisco , NY ture in peripheral blood cells of patients with severe lupus . PNAS , (US ) 100 ( 5 ) : 2610 - 2615. * GeneCards database entry for IFIT3 ( obtained from < http : / /www . ( 73 ) Assignee : NEW YORK SOCIETY FOR THE genecards. org /cgi - bin / carddisp .pl ? gene = IFIT3 > on May 26 , 2016 , RUPTURED AND CRIPPLED 15 pages ) . * Navarra et al. (2011 ) Efficacy and safety of belimumab in patients MAINTAINING THE HOSPITAL with active systemic lupus erythematosus : a randomised , placebo FOR SPECIAL SURGERY , New controlled , phase 3 trial . The Lancet , 377 :721 - 731. * York , NY (US ) Abramson et al . ( 1983 ) Arthritis Rheum .
    [Show full text]
  • Clinical Interpretation and Implications of Whole-Genome Sequencing
    Dewey, et al WGS for clinical medicine Supplemental methods Supplementary methods ....................................................................................................... 2 Genomic sequence data generation ............................................................................................ 2 Sequence alignment, genetic variant and clinical genotype identification .................. 2 Genetic variant and clinical genotype annotation ................................................................. 5 Bioinformatic prioritization of candidate inherited disease risk variants ................... 5 Manual review of candidate inherited disease risk variants ............................................. 6 Assessment of genetic risk for coronary artery disease and diabetes ........................... 8 Supplementary Tables ........................................................................................................... 9 Box. Data sources used for whole genome sequence variant annotation. .................... 9 Table S1. Summary statement definitions for inherited risk candidates. .................. 10 Table S2. Phenotypic characterization of inherited disease conditions. .................... 15 Table S3. Common single nucleotide variants used for diabetes genetic risk assessment in Caucasians. .......................................................................................................... 16 Table S4. Common single nucleotide variants used for coronary artery disease genetic risk assessment in Caucasians.
    [Show full text]
  • 1 the TRAPP Complex Mediates Secretion Arrest Induced by Stress Granule Assembly Francesca Zappa1, Cathal Wilson1, Giusepp
    bioRxiv preprint doi: https://doi.org/10.1101/528380; this version posted February 5, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. The TRAPP complex mediates secretion arrest induced by stress granule assembly Francesca Zappa1, Cathal Wilson1, Giuseppe Di Tullio1, Michele Santoro1, Piero Pucci2, Maria Monti2, Davide D’Amico1, Sandra Pisonero Vaquero1, Rossella De Cegli1, Alessia Romano1, Moin A. Saleem3, Elena Polishchuk1, Mario Failli1, Laura Giaquinto1, Maria Antonietta De Matteis1, 2 1 Telethon Institute of Genetics and Medicine, Pozzuoli (Naples), Italy 2 Federico II University, Naples, Italy 3 Bristol Renal, Bristol Medical School, University of Bristol, UK Correspondence to: [email protected], [email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/528380; this version posted February 5, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. The TRAnsport-Protein-Particle (TRAPP) complex controls multiple membrane trafficking steps and is thus strategically positioned to mediate cell adaptation to diverse environmental conditions, including acute stress. We have identified TRAPP as a key component of a branch of the integrated stress response that impinges on the early secretory pathway. TRAPP associates with and drives the recruitment of the COPII coat to stress granules (SGs) leading to vesiculation of the Golgi complex and an arrest of ER export.
    [Show full text]
  • The ULK1-FBXW5-SEC23B Nexus Controls Autophagy
    bioRxiv preprint doi: https://doi.org/10.1101/441923; this version posted October 12, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. The ULK1‐FBXW5‐SEC23B nexus controls autophagy Yeon‐Tae Jeong1,2, Daniele Simoneschi1,2, Sarah Keegan1,2,3, David Melville4, Natalia S. Adler5, Anita Saraf6, Laurence Florens6, Michael P. Washburn6,7, Claudio N. Cavasotto5, David Fenyö1,2,3, Ana‐Maria Cuervo8, Mario Rossi1,2,5, and Michele Pagano1,2,9* 1Department of Biochemistry and Molecular Pharmacology, 2Perlmutter NYU Cancer Center, 3Institute for System Genetics, and 9Howard Hughes Medical Institute; New York University School of Medicine, 522 First Avenue, New York, NY 10016, USA. 4Department of Molecular and Cellular Biology and Howard Hughes Medical Institute, University of California, Berkeley, CA, USA. 5Instituto de Investigación en Biomedicina de Buenos Aires‐ CONICET‐‐Partner Institute of the Max Planck Society, C1425FQD Buenos Aires, Argentina. 6The Stowers Institute for Medical Research, 1000 East 50th Street, Kansas City, MO 64110, USA. 7Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Boulevard, Kansas City, Kansas 66160, USA. 8Department of Developmental and Molecular Biology and Institute for Aging Studies, Albert Einstein College of Medicine, Bronx, NY USA. *Correspondence: [email protected] 1 | Page bioRxiv preprint doi: https://doi.org/10.1101/441923; this version posted October 12, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
    [Show full text]
  • The DNA Sequence and Comparative Analysis of Human Chromosome 20
    articles The DNA sequence and comparative analysis of human chromosome 20 P. Deloukas, L. H. Matthews, J. Ashurst, J. Burton, J. G. R. Gilbert, M. Jones, G. Stavrides, J. P. Almeida, A. K. Babbage, C. L. Bagguley, J. Bailey, K. F. Barlow, K. N. Bates, L. M. Beard, D. M. Beare, O. P. Beasley, C. P. Bird, S. E. Blakey, A. M. Bridgeman, A. J. Brown, D. Buck, W. Burrill, A. P. Butler, C. Carder, N. P. Carter, J. C. Chapman, M. Clamp, G. Clark, L. N. Clark, S. Y. Clark, C. M. Clee, S. Clegg, V. E. Cobley, R. E. Collier, R. Connor, N. R. Corby, A. Coulson, G. J. Coville, R. Deadman, P. Dhami, M. Dunn, A. G. Ellington, J. A. Frankland, A. Fraser, L. French, P. Garner, D. V. Grafham, C. Grif®ths, M. N. D. Grif®ths, R. Gwilliam, R. E. Hall, S. Hammond, J. L. Harley, P. D. Heath, S. Ho, J. L. Holden, P. J. Howden, E. Huckle, A. R. Hunt, S. E. Hunt, K. Jekosch, C. M. Johnson, D. Johnson, M. P. Kay, A. M. Kimberley, A. King, A. Knights, G. K. Laird, S. Lawlor, M. H. Lehvaslaiho, M. Leversha, C. Lloyd, D. M. Lloyd, J. D. Lovell, V. L. Marsh, S. L. Martin, L. J. McConnachie, K. McLay, A. A. McMurray, S. Milne, D. Mistry, M. J. F. Moore, J. C. Mullikin, T. Nickerson, K. Oliver, A. Parker, R. Patel, T. A. V. Pearce, A. I. Peck, B. J. C. T. Phillimore, S. R. Prathalingam, R. W. Plumb, H. Ramsay, C. M.
    [Show full text]