Expressed Genes (Degs) and Annotated Under GO, COG, KEGG and TF

Total Page:16

File Type:pdf, Size:1020Kb

Expressed Genes (Degs) and Annotated Under GO, COG, KEGG and TF bioRxiv preprint doi: https://doi.org/10.1101/563916; this version posted March 1, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Transcriptome Profiling of Mouse Brain and Lung under 2 Dip2A Regulation Using RNA-Sequencing 3 4 Rajiv kumar sah1, Anlan Yang1, Fatoumata Binta Bah1, Salah Adlat1, Ameer Ali 5 Bohio2, Zin Mar Oo1, Chenhao Wang1, May zun Zaw Myint1, Noor Bahadar1, 6 Luqing Zhang1,2*, Xuechao Feng1,2*, Yaowu Zheng1,2* 7 1 Transgenic Research Center, School of Life Sciences, Northeast Normal University, 8 Changchun, China, 9 2 Key Laboratory of Molecular Epigenetics of Ministry of Education, Northeast 10 Normal University, Changchun, China, 11 12 * [email protected] (LQZ), *[email protected] (XCF); [email protected] (YWZ) 13 Abstract: 14 Disconnected interacting 2 homolog A (DIP2A) gene is highly 15 expressed in nervous system and respiratory system of developing 16 embryos. However, genes regulated by Dip2a in developing brain and 17 lung have not been systemically studied. Transcriptome of brain and lung 18 in embryonic 19.5 day (E19.5) were compared between wild type and 19 Dip2a-/- mice. Total RNAs were extracted from brain and lung of E19.5 20 embryos for RNA-Seq. Clean reads were mapped to mouse reference 21 sequence (mm9) using Tophat and assembled into transcripts by 22 Cufflinks. Edge R and DESeq were applied to identify differentially 1 bioRxiv preprint doi: https://doi.org/10.1101/563916; this version posted March 1, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 23 expressed genes (DEGs) and annotated under GO, COG, KEGG and TF. 24 An average of 50 million reads per sample was mapped to the reference 25 sequence. A total of 214 DEGs were detected in brain (82 up and 132 26 down) and 1900 DEGs in lung (1259 up and 641 down). GO enrichment 27 analysis indicated that DEGs in both Brain and Lung were mainly 28 enriched in biological processes ‘DNA-templated transcription and 29 Transcription from RNA polymerase II promoter’, ‘multicellular 30 organism development’, ‘cell differentiation’ and ‘apoptotic process’. In 31 addition, COG classification showed that both were mostly involved in 32 ‘Replication, Recombination and Repair’, ‘Signal transduction and 33 mechanism’, ‘Translation, Ribosomal structure and Biogenesis’ and 34 ‘Transcription’. KEGG enrichment analysis showed that brain was 35 mainly enriched in ‘Thryoid cancer’ pathway whereas lung in 36 ‘Complement and Coagulation Cascades’ pathway. Transcription factor 37 (TF) annotation analysis identified Zinc finger domain containing (ZF) 38 proteins were mostly regulated in lung and brain. Interestingly, study 39 identified genes Skor2, Gpr3711, Runx1, Erbb3, Frmd7, Fut10, Sox11, 40 Hapln1, Tfap2c and Plxnb3 from brain that play important roles in 41 neuronal cell maturation, differentiation and survival; genes Hoxa5, 42 Eya1, Ctsh, Erff1, Lama1, Lama2, Rspo2, Sox11, Spry4, Shh, Igf1 and 43 Wnt7a from lung are important in lung development and morphogenesis. 44 Expression levels of the candidate genes were validated by qRT-PCR. 2 bioRxiv preprint doi: https://doi.org/10.1101/563916; this version posted March 1, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 45 Genome wide transcriptional analysis using wild type and Dip2a 46 knockout mice in brain and lung at embryonic day 19.5 (E19.5) provided 47 a genetic basis of molecular function of these genes. 48 49 Introduction 50 DIP2A is a member of Disconnected (disco)-interacting 2 (DIP2) 51 protein family whose molecular anatomical function remains to be 52 clarified. DIP2A was firstly identified in Drosophila as a novel 53 transcription factor that interacts with Disconnected (disco) gene needed 54 for proper neural connection during visual system development in 55 Drosophila [1-3]. Previous studies have shown that Dip2A is highly 56 expressed in human brain and may play a role in axon patterning in 57 Central Nervous System (CNS) [4]. Bioinformatics analysis using 58 Homolog Gene suggests that DIP2A is a receptor molecule with DMAP, 59 AMP and CaiC binding domains [5]. At DNA replication site, DIP2A, in 60 a complex with DMAP1-DNMT1-HDAC, regulates neurite outgrowth 61 and synaptic plasticity [6]. Moreover, Dip2a has been previously 62 identified as a risk gene associated with neurodevelopment diseases like 63 autism spectrum disorder, Development dyslexia and Alzheimer diseases 64 [7-9]. These all evidence strongly supports the role of Dip2a gene in both 65 vertebrate and invertebrate nervous system development. However, 66 which biological process or molecular function is regulated by Dip2a 3 bioRxiv preprint doi: https://doi.org/10.1101/563916; this version posted March 1, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 67 gene during embryonic brain development is not known. 68 In order to systematically understand expression pattern and 69 physiological role of Dip2a gene, Dip2a-LacZ knockin and Dip2a 65kb 70 knockout (Dip2a-/-) mouse models were generated using CRISPR/Cas9 71 system [10]. Dip2a expression via endogenous expression of 72 β-Galactosidase gene (LacZ) have shown that Dip2a is highly expressed 73 in brain neurons, retinal ganglia cell, reproductive, vascular and Lung 74 tissue in adult and ectodermal tissue in developing embryos [11]. RNA 75 sequencing (RNA-Seq) has rapidly emerged as a favorite approach for 76 high throughput gene expression and function studies. Through 77 RNA-Seq, gene expression and gene interactions at any time point or in a 78 particular tissue can be investigated [12]. In present study, Transcriptome 79 (RNA-seq) analysis of E19.5 brain and lung of WT and Dip2a-/- embryo 80 was performed. 81 Dip2a role in brain and lung development has not been studied before. 82 A global Transcriptome analysis of brain and lung will help us in 83 understanding of Dip2a function in regulating brain and lung 84 development. A total of 214 genes in brain and 1900 genes in lung were 85 identified differentially expressed under Dip2A, suggesting that these 86 genes are potentially relevant to brain and lung development and function. 87 Those genes are further explicated and discussed in this study. 88 4 bioRxiv preprint doi: https://doi.org/10.1101/563916; this version posted March 1, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 89 Materials and methods 90 Animals 91 All procedures were conducted following guidelines recommended in 92 the guide for Care and Use of Laboratory Animals of National Institutes 93 of Health with approval of Institutional Animal Care and Use Committee 94 of Northeast Normal University (NENU/IACUC, AP2013011). Mice 95 were housed in clean facility in individual IVC cages under a normal 96 12:12h light:dark cycles in a temperature of 20oC and humidity 50 ± 20% 97 in Northeast Normal University. All mice were anesthetized before 98 euthanasia with 1% pentobarbital at a dose of 10mg/kg and all effort was 99 made to minimize suffering. 100 RNA isolation and library preparation for RNA-Seq 101 Dip2a heterozygous mice (Dip2a+/-) were intercrossed, female mice 102 checked for presence of copulation plug (Vaginal plug) and designated as 103 E0.5 day. At age of E19.5, pregnant dams were euthanized and embryos 104 were collected on ice-cold 1X PBS. Brains and lungs were dissected out 105 from each embryo and frozen immediately in liquid nitrogen. Total RNA 106 from brain and lung was isolated using RNAiso plus reagent (Takara, 107 Dalian) in accordance with the manufacturer’s instruction and followed 108 by additional step of DNase I digestion to eliminate genomic DNA 109 contamination. The quality and purity of RNA was checked by Nano drop 110 ND-1000 spectrophotometer (Thermo Fisher Scientific, USA) and 5 bioRxiv preprint doi: https://doi.org/10.1101/563916; this version posted March 1, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 111 Agilent 2100 Bio analyzer (Santa Clara, CA, USA). Library was 112 constructed and sequenced using Illumina HiseqTM 2500 (Biomarker, 113 Beijing, China). 114 Sequence Mapping, assembly and gene annotation 115 Clean data (clean reads) were obtained by filtering out adapter only 116 reads, ambiguous “N” base containing reads (N>5%) and reads with low 117 Q-value (< 10) from raw data. The clean reads were then mapped to 118 mouse reference genome (mm9) using Tophat v2.2.0 that allows up to 119 two mismatches [13]. The mapped reads were then assembled into 120 transcripts using Cufflinks v2.2.1 [13]. The transcripts were defined as 121 unigenes and analyzed via BLASTX alignment (E-value < 1.0E-5) with 122 NR (NCBI non-redundant protein) and Swiss-Prot databases under 123 BLAST2GO platform [14-15].
Recommended publications
  • X-Linked Diseases: Susceptible Females
    REVIEW ARTICLE X-linked diseases: susceptible females Barbara R. Migeon, MD 1 The role of X-inactivation is often ignored as a prime cause of sex data include reasons why women are often protected from the differences in disease. Yet, the way males and females express their deleterious variants carried on their X chromosome, and the factors X-linked genes has a major role in the dissimilar phenotypes that that render women susceptible in some instances. underlie many rare and common disorders, such as intellectual deficiency, epilepsy, congenital abnormalities, and diseases of the Genetics in Medicine (2020) 22:1156–1174; https://doi.org/10.1038/s41436- heart, blood, skin, muscle, and bones. Summarized here are many 020-0779-4 examples of the different presentations in males and females. Other INTRODUCTION SEX DIFFERENCES ARE DUE TO X-INACTIVATION Sex differences in human disease are usually attributed to The sex differences in the effect of X-linked pathologic variants sex specific life experiences, and sex hormones that is due to our method of X chromosome dosage compensation, influence the function of susceptible genes throughout the called X-inactivation;9 humans and most placental mammals – genome.1 5 Such factors do account for some dissimilarities. compensate for the sex difference in number of X chromosomes However, a major cause of sex-determined expression of (that is, XX females versus XY males) by transcribing only one disease has to do with differences in how males and females of the two female X chromosomes. X-inactivation silences all X transcribe their gene-rich human X chromosomes, which is chromosomes but one; therefore, both males and females have a often underappreciated as a cause of sex differences in single active X.10,11 disease.6 Males are the usual ones affected by X-linked For 46 XY males, that X is the only one they have; it always pathogenic variants.6 Females are biologically superior; a comes from their mother, as fathers contribute their Y female usually has no disease, or much less severe disease chromosome.
    [Show full text]
  • X-Linked Infantile Nystagmus
    X-linked infantile nystagmus Description X-linked infantile nystagmus is a condition characterized by abnormal eye movements. Nystagmus is a term that refers to involuntary side-to-side movements of the eyes. In people with this condition, nystagmus is present at birth or develops within the first six months of life. The abnormal eye movements may worsen when an affected person is feeling anxious or tries to stare directly at an object. The severity of nystagmus varies, even among affected individuals within the same family. Sometimes, affected individuals will turn or tilt their head to compensate for the irregular eye movements. Frequency The incidence of all forms of infantile nystagmus is estimated to be 1 in 5,000 newborns; however, the precise incidence of X-linked infantile nystagmus is unknown. Causes Mutations in the FRMD7 gene cause X-linked infantile nystagmus. The FRMD7 gene provides instructions for making a protein whose exact function is unknown. This protein is found mostly in areas of the brain that control eye movement and in the light-sensitive tissue at the back of the eye (retina). Research suggests that FRMD7 gene mutations cause nystagmus by disrupting the development of certain nerve cells in the brain and retina. In some people with X-linked infantile nystagmus, no mutation in the FRMD7 gene has been found. The genetic cause of the disorder is unknown in these individuals. Researchers believe that mutations in at least one other gene, which has not been identified, can cause this disorder. Learn more about the gene associated with X-linked infantile nystagmus • FRMD7 Inheritance This condition is inherited in an X-linked pattern.
    [Show full text]
  • X-Linked Idiopathic Infantile Nystagmus Associated with a Missense Mutation in FRMD7 Alan Shiels Washington University School of Medicine in St
    Washington University School of Medicine Digital Commons@Becker Open Access Publications 2007 X-linked idiopathic infantile nystagmus associated with a missense mutation in FRMD7 Alan Shiels Washington University School of Medicine in St. Louis Thomas M. Bennett Washington University School of Medicine in St. Louis Jessica B. Prince Washington University School of Medicine in St. Louis Lawrence Tychsen Washington University School of Medicine in St. Louis Follow this and additional works at: https://digitalcommons.wustl.edu/open_access_pubs Recommended Citation Shiels, Alan; Bennett, Thomas M.; Prince, Jessica B.; and Tychsen, Lawrence, ,"X-linked idiopathic infantile nystagmus associated with a missense mutation in FRMD7." Molecular Vision.13,. 2233-2241. (2007). https://digitalcommons.wustl.edu/open_access_pubs/1787 This Open Access Publication is brought to you for free and open access by Digital Commons@Becker. It has been accepted for inclusion in Open Access Publications by an authorized administrator of Digital Commons@Becker. For more information, please contact [email protected]. Molecular Vision 2007; 13:2233-41 <http://www.molvis.org/molvis/v13/a253/> ©2007 Molecular Vision Received 5 November 2007 | Accepted 26 November 2007 | Published 29 November 2007 X-linked idiopathic infantile nystagmus associated with a missense mutation in FRMD7 Alan Shiels, Thomas M. Bennett, Jessica B. Prince, Lawrence Tychsen Department of Ophthalmology and Visual Sciences, Washington University School of Medicine, St. Louis, MO Purpose: Infantile nystagmus is a clinically and genetically heterogeneous eye movement disorder. Here we map and identify the genetic mutation underlying X-linked idiopathic infantile nystagmus (XL-IIN) segregating in two Caucasian- American families. Methods: Eye movements were recorded using binocular infrared digital video-oculography.
    [Show full text]
  • Analysis of Gene Expression Profiles to Study Malaria Vaccine Dose Efficacy & Immune Response Modulation
    bioRxiv preprint doi: https://doi.org/10.1101/2021.08.05.454986; this version posted August 6, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Analysis of Gene Expression Profiles to study Malaria Vaccine Dose Efficacy & Immune Response Modulation. Supantha Dey*1,2, Harpreet Kaur2, Mohit Mazumder2, Elia Brodsky2 1- Department of Genetic Engineering & Biotechnology, University of Dhaka, Bangladesh. 2 - Pine Biotech, New Orleans, LA, USA Emails of Authors: Supantha Dey: [email protected] Harpreet Kaur:[email protected]; [email protected] Mohit Mazumder: [email protected] Elia Brodsky: [email protected] Abstract Malaria is a life-threatening disease, and the Africa is still one of the most affected endemic regions despite years of policy to limit infection and transmission rates. Further, studies into the variable efficacy of the vaccine are needed to provide a better understanding of protective immunity. Thus, the current study is designed to delineate the effect of the different vaccination doses on the transcriptional profiles of subjects to determine its efficacy and understand the molecular mechanisms underlying the protection this vaccine provides. Here, we used gene expression profiles of pre and post-vaccination patients after various doses of RTS,S based on 275 and 583 samples collected from the GEO datasets. At first, exploratory data analysis based Principal component analysis (PCA) shown the distinct pattern of different doses. Subsequently, differential gene expression analysis using edgeR revealed the significantly (FDR <0.005) 158 down-regulated and 61 upregulated genes between control vs.
    [Show full text]
  • Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham a Thesis Submitted in Conformity
    Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham A thesis submitted in conformity with the requirements for the degree of Master of Science Graduate Department of Molecular Genetics University of Toronto © Copyright by Edward James Higginbotham 2020 i Abstract Characterizing Genomic Duplication in Autism Spectrum Disorder Edward James Higginbotham Master of Science Graduate Department of Molecular Genetics University of Toronto 2020 Duplication, the gain of additional copies of genomic material relative to its ancestral diploid state is yet to achieve full appreciation for its role in human traits and disease. Challenges include accurately genotyping, annotating, and characterizing the properties of duplications, and resolving duplication mechanisms. Whole genome sequencing, in principle, should enable accurate detection of duplications in a single experiment. This thesis makes use of the technology to catalogue disease relevant duplications in the genomes of 2,739 individuals with Autism Spectrum Disorder (ASD) who enrolled in the Autism Speaks MSSNG Project. Fine-mapping the breakpoint junctions of 259 ASD-relevant duplications identified 34 (13.1%) variants with complex genomic structures as well as tandem (193/259, 74.5%) and NAHR- mediated (6/259, 2.3%) duplications. As whole genome sequencing-based studies expand in scale and reach, a continued focus on generating high-quality, standardized duplication data will be prerequisite to addressing their associated biological mechanisms. ii Acknowledgements I thank Dr. Stephen Scherer for his leadership par excellence, his generosity, and for giving me a chance. I am grateful for his investment and the opportunities afforded me, from which I have learned and benefited. I would next thank Drs.
    [Show full text]
  • Glucocorticoid Receptor and Klf4 Co-Regulate Anti-Inflammatory Genes in Keratinocytes
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Digital.CSIC Glucocorticoid receptor and Klf4 co-regulate anti-inflammatory genes in keratinocytes Lisa M. Sevilla1, Víctor Latorre1,2, Elena Carceller1, Julia Boix1, Daniel Vodák3, Ian Geoffrey Mills4, 5, 6, and Paloma Pérez1 1 Instituto de Biomedicina de Valencia-Consejo Superior de Investigaciones Científicas (IBV- CSIC), Jaime Roig 11, E-46010 Valencia, Spain. 2 Faculty of Human and Medical Sciences, The University of Manchester, Manchester UK 3 Bioinformatics Core Facility, Institute for Cancer Genetics and Informatics, The Norwegian Radium Hospital, Oslo University Hospital, Norway 4 Prostate Cancer Research Group, Centre for Molecular Medicine (Norway), University of Oslo and Oslo University Hospitals, Oslo, Norway 5 Department of Cancer Prevention, Oslo University Hospitals, Oslo, Norway. 6 Department of Urology, Oslo University Hospitals, Oslo, Norway. Corresponding autor and person to whom reprint requests should be addressed: Paloma Pérez, PhD Instituto de Biomedicina de Valencia-Consejo Superior de Investigaciones Científicas (IBV- CSIC) Jaime Roig 11, E-46010, Valencia, Spain Phone: +34-96-339-1766 Fax: +34-96-369-0800 e-mail: [email protected] Grants and fellowships supporting the writing of paper: SAF2011-28115 of the Ministerio de Economía y Competitividad from the Spanish government. EC and JB are recipients of FPU and FPI fellowships of the Spanish Ministery, respectively Disclosure Statement: The authors have nothing to disclose. Abstract The glucocorticoid (GC) receptor (GR) and Kruppel-like factor Klf4 are transcription factors that play major roles in the skin homeostasis by regulating the expression of overlapping gene subsets.
    [Show full text]
  • Development and Clinical Utility of a Novel Diagnostic Nystagmus Gene Panel Using Targeted Next-Generation Sequencing
    European Journal of Human Genetics (2017) 25, 725–734 Official journal of The European Society of Human Genetics www.nature.com/ejhg ARTICLE Development and clinical utility of a novel diagnostic nystagmus gene panel using targeted next-generation sequencing Mervyn G Thomas*,1,2, Gail DE Maconachie1,2, Viral Sheth1, Rebecca J McLean1 and Irene Gottlob*,1 Infantile nystagmus (IN) is a genetically heterogeneous disorder arising from variants of genes expressed within the developing retina and brain. IN presents a diagnostic challenge and patients often undergo numerous investigations. We aimed to develop and assess the utility of a next-generation sequencing (NGS) panel to enhance the diagnosis of IN. We identified 336 genes associated with IN from the literature and OMIM. NimbleGen Human custom array was used to enrich the target genes and sequencing was performed using HiSeq2000. Using reference genome material (NA12878), we show the sensitivity (98.5%) and specificity (99.9%) of the panel. Fifteen patients with familial IN were sequenced using the panel. Two authors were masked to the clinical diagnosis. We identified variants in 12/15 patients in the following genes: FRMD7 (n = 3), CACNA1F (n = 2), TYR (n = 5), CRYBA1 (n = 1) and TYRP1 (n = 1). In 9/12 patients, the clinical diagnosis was consistent with the genetic diagnosis. In 3/12 patients, the results from the genetic diagnoses (TYR, CRYBA1 and TYRP1 variants) enabled revision of clinical diagnoses. In 3/15 patients, we were unable to determine a genetic diagnosis. In one patient, copy number variation analysis revealed a FRMD7 deletion. This is the first study establishing the clinical utility of a diagnostic NGS panel for IN.
    [Show full text]
  • Content Based Search in Gene Expression Databases and a Meta-Analysis of Host Responses to Infection
    Content Based Search in Gene Expression Databases and a Meta-analysis of Host Responses to Infection A Thesis Submitted to the Faculty of Drexel University by Francis X. Bell in partial fulfillment of the requirements for the degree of Doctor of Philosophy November 2015 c Copyright 2015 Francis X. Bell. All Rights Reserved. ii Acknowledgments I would like to acknowledge and thank my advisor, Dr. Ahmet Sacan. Without his advice, support, and patience I would not have been able to accomplish all that I have. I would also like to thank my committee members and the Biomed Faculty that have guided me. I would like to give a special thanks for the members of the bioinformatics lab, in particular the members of the Sacan lab: Rehman Qureshi, Daisy Heng Yang, April Chunyu Zhao, and Yiqian Zhou. Thank you for creating a pleasant and friendly environment in the lab. I give the members of my family my sincerest gratitude for all that they have done for me. I cannot begin to repay my parents for their sacrifices. I am eternally grateful for everything they have done. The support of my sisters and their encouragement gave me the strength to persevere to the end. iii Table of Contents LIST OF TABLES.......................................................................... vii LIST OF FIGURES ........................................................................ xiv ABSTRACT ................................................................................ xvii 1. A BRIEF INTRODUCTION TO GENE EXPRESSION............................. 1 1.1 Central Dogma of Molecular Biology........................................... 1 1.1.1 Basic Transfers .......................................................... 1 1.1.2 Uncommon Transfers ................................................... 3 1.2 Gene Expression ................................................................. 4 1.2.1 Estimating Gene Expression ............................................ 4 1.2.2 DNA Microarrays ......................................................
    [Show full text]
  • Gnomad Lof Supplement
    1 gnomAD supplement gnomAD supplement 1 Data processing 4 Alignment and read processing 4 Variant Calling 4 Coverage information 5 Data processing 5 Sample QC 7 Hard filters 7 Supplementary Table 1 | Sample counts before and after hard and release filters 8 Supplementary Table 2 | Counts by data type and hard filter 9 Platform imputation for exomes 9 Supplementary Table 3 | Exome platform assignments 10 Supplementary Table 4 | Confusion matrix for exome samples with Known platform labels 11 Relatedness filters 11 Supplementary Table 5 | Pair counts by degree of relatedness 12 Supplementary Table 6 | Sample counts by relatedness status 13 Population and subpopulation inference 13 Supplementary Figure 1 | Continental ancestry principal components. 14 Supplementary Table 7 | Population and subpopulation counts 16 Population- and platform-specific filters 16 Supplementary Table 8 | Summary of outliers per population and platform grouping 17 Finalizing samples in the gnomAD v2.1 release 18 Supplementary Table 9 | Sample counts by filtering stage 18 Supplementary Table 10 | Sample counts for genomes and exomes in gnomAD subsets 19 Variant QC 20 Hard filters 20 Random Forest model 20 Features 21 Supplementary Table 11 | Features used in final random forest model 21 Training 22 Supplementary Table 12 | Random forest training examples 22 Evaluation and threshold selection 22 Final variant counts 24 Supplementary Table 13 | Variant counts by filtering status 25 Comparison of whole-exome and whole-genome coverage in coding regions 25 Variant annotation 30 Frequency and context annotation 30 2 Functional annotation 31 Supplementary Table 14 | Variants observed by category in 125,748 exomes 32 Supplementary Figure 5 | Percent observed by methylation.
    [Show full text]
  • Novel FRMD7 Mutations and Genomic Rearrangement Expand the Molecular Pathogenesis of X-Linked Idiopathic Infantile Nystagmus
    Genetics Novel FRMD7 Mutations and Genomic Rearrangement Expand the Molecular Pathogenesis of X-Linked Idiopathic Infantile Nystagmus Basamat AlMoallem,1,2 Miriam Bauwens,1 Sophie Walraedt,3 Patricia Delbeke,3 Julie De Zaeytijd,3 Philippe Kestelyn,3 Fran¸coise Meire,4 Sandra Janssens,1 Caroline van Cauwenbergh,1 Hannah Verdin,1 Sally Hooghe,1 Prasoon Kumar Thakur,1 Frauke Coppieters,1 Kim De Leeneer,1 Koenraad Devriendt,5 Bart P. Leroy,1,3,6 and Elfride De Baere1 1Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium 2Department of Ophthalmology, King Abdul-Aziz University Hospital, College of Medicine, King Saud University, Riyadh, Saudi Arabia 3Department of Ophthalmology, Ghent University Hospital, Ghent, Belgium 4Department of Ophthalmology, Queen Fabiola Children’s University Hospital, Brussels, Belgium 5Center for Human Genetics, Leuven University Hospitals, Leuven, Belgium 6Division of Ophthalmology, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States Correspondence: Elfride De Baere, PURPOSE. Idiopathic infantile nystagmus (IIN; OMIM 31700) with X-linked inheritance is one of Center for Medical Genetics, Ghent the most common forms of infantile nystagmus. Up to date, three X-linked loci have been University Hospital, De Pintelaan identified, Xp11.4-p11.3 (calcium/calmodulin-dependent serine protein kinase [CASK]), Xp22 185, B-9000 Ghent, Belgium; (GPR143), and Xq26-q27 (FRMD7), respectively. Here, we investigated the role of mutations [email protected]. and copy number variations (CNV) of FRMD7 and GPR143 in the molecular pathogenesis of Submitted: October 26, 2014 IIN in 49 unrelated Belgian probands. Accepted: February 5, 2015 METHODS. We set up a comprehensive molecular genetic workflow based on Sanger Citation: AlMoallem B, Bauwens M, sequencing, targeted next generation sequencing (NGS) and CNV analysis using multiplex Walraedt S, et al.
    [Show full text]
  • Structural Variations in a Non-Coding Region at 1Q32.1 Are Responsible For
    Human Genetics https://doi.org/10.1007/s00439-020-02156-0 ORIGINAL INVESTIGATION Structural variations in a non‑coding region at 1q32.1 are responsible for the NYS7 locus in two large families Wenmin Sun1 · Shiqiang Li1 · Xiaoyun Jia1 · Panfeng Wang1 · J. Fielding Hejtmancik2 · Xueshan Xiao1 · Qingjiong Zhang1 Received: 24 October 2019 / Accepted: 24 March 2020 © The Author(s) 2020 Abstract Congenital motor nystagmus (CMN) is characterized by early-onset bilateral ocular oscillations without other ocular defcits. To date, mutations in only one gene have been identifed to be responsible for CMN, i.e., FRMD7 for X-linked CMN. Four loci for autosomal dominant CMN, including NYS7 (OMIM 614826), have been mapped but the causative genes have yet to be identifed. NYS7 was mapped to 1q32.1 based on independent genome-wide linkage scan on two large families with CMN. In this study, mutations in all known protein-coding genes, both intronic sequence with predicted efect and coding sequence, in the linkage interval were excluded by whole-genome sequencing. Then, long-read genome sequencing based on the Nanopore platform was performed with a sample from each of the two families. Two deletions with an overlapping region of 775,699 bp, located in a region without any known protein-coding genes, were identifed in the two families in the linkage region. The two deletions as well as their breakpoints were confrmed by Sanger sequencing and co-segregated with CMN in the two families. The 775,699 bp deleted region contains uncharacterized non-protein-coding expressed sequences and pseudogenes but no protein-coding genes.
    [Show full text]
  • Identification of Variants in CNGA3 As Cause for Achromatopsia by Exome Sequencing of a Single Patient
    OPHTHALMIC MOLECULAR GENETICS SECTION EDITOR: JANEY L. WIGGS, MD, PhD Identification of Variants in CNGA3 as Cause for Achromatopsia by Exome Sequencing of a Single Patient Kevin Lam, MSc; Haiyan Guo, PhD; Graham A. Wilson, MBChB; Susanne Kohl, PhD; Fulton Wong, PhD, MPP Objective: To report disease-causing mutations in the Results: Analysis yielded a molecular diagnosis of achro- cyclic nucleotide-gated channel ␣ 3 gene (CNGA3) iden- matopsia. Two compound heterozygous mutations were tified by exome sequencing and bioinformatics filtering identified in CNGA3 of this patient, c.829CϾT p.R277C in a single patient. and c.1580TϾG p.L527R; they were not observed in the normal population and cosegregated with the pheno- Methods: The entire protein-coding sequence of a pa- type of achromatopsia in the patient’s family. tient with a retinal disease was enriched by in-solution targeted capture and massively parallel sequenced at 50- Conclusion: These mutations are the cause of achro- fold coverage. The assembled sequence was compared matopsia in this family. with databases of normal genomic sequences to identify Clinical Relevance: The key advantages of massively nonsynonymous variants, which were further filtered (1) parallel sequencing over linkage mapping and cloning with a prioritization of genes associated with retinal dis- are highlighted by (1) the small sample size required for eases, (2) according to the likelihood of variant damage successful analysis and (2) the rapid and high- to protein function, (3) following the predictions of a re- throughput manner in which the mutations are identi- cessive model, and (4) against common polymorphisms fied. This new tool will likely have major effects on the observed in normal genomes.
    [Show full text]