Mouse Wdfy3 Conditional Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Wdfy3 Conditional Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Wdfy3 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Wdfy3 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Wdfy3 gene (NCBI Reference Sequence: NM_172882 ; Ensembl: ENSMUSG00000043940 ) is located on Mouse chromosome 5. 67 exons are identified, with the ATG start codon in exon 4 and the TGA stop codon in exon 67 (Transcript: ENSMUST00000053177). Exon 8 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Wdfy3 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-447M5 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for hypomorphic mutations of this gene exhibit perinatal lethality, altered neural progenitor divisions and neuronal migration, a regionally enlarged cerebral cortex, and focal cortical dysplasias. Exon 8 starts from about 5.48% of the coding region. The knockout of Exon 8 will result in frameshift of the gene. The size of intron 7 for 5'-loxP site insertion: 4210 bp, and the size of intron 8 for 3'-loxP site insertion: 1569 bp. The size of effective cKO region: ~693 bp. The cKO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele gRNA region 5' gRNA region 3' 1 8 9 67 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Wdfy3 Homology arm cKO region loxP site Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Overview of the GC Content Distribution Window size: 300 bp Sequence 12 Summary: Full Length(7193bp) | A(27.99% 2013) | C(20.8% 1496) | T(28.81% 2072) | G(22.41% 1612) Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 8 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr5 - 101953424 101956423 3000 browser details YourSeq 172 955 2560 3000 86.2% chr2 + 168319466 168451442 131977 browser details YourSeq 160 663 1042 3000 88.6% chr9 - 69790273 69790623 351 browser details YourSeq 157 897 2561 3000 92.0% chr10 - 93629571 93692640 63070 browser details YourSeq 150 2469 2935 3000 82.5% chr9 + 103317367 103317821 455 browser details YourSeq 148 882 1063 3000 92.1% chr16 - 61216703 61216919 217 browser details YourSeq 139 891 1060 3000 89.5% chr13 + 100668126 100668277 152 browser details YourSeq 136 892 1071 3000 85.6% chr8 + 66232498 66232664 167 browser details YourSeq 136 891 1054 3000 93.2% chr19 + 5189290 5189471 182 browser details YourSeq 135 897 2489 3000 92.5% chr11 + 59949049 60011541 62493 browser details YourSeq 132 891 1042 3000 94.1% chr12 + 78870293 78870447 155 browser details YourSeq 132 891 1060 3000 88.0% chr1 + 134472871 134473033 163 browser details YourSeq 131 893 1050 3000 90.9% chr2 - 170569426 170569581 156 browser details YourSeq 130 891 1060 3000 88.6% chr14 - 62781481 62781647 167 browser details YourSeq 130 893 1038 3000 94.6% chr11 + 86602604 86602749 146 browser details YourSeq 129 884 1060 3000 84.0% chr12 + 12231528 12231696 169 browser details YourSeq 128 880 1039 3000 89.8% chr1 - 6194789 6194941 153 browser details YourSeq 128 881 1039 3000 88.6% chr5 + 136999391 136999542 152 browser details YourSeq 128 883 1039 3000 88.9% chr10 + 57194229 57194382 154 browser details YourSeq 127 891 1039 3000 93.3% chr11 + 98634687 98634848 162 Note: The 3000 bp section upstream of Exon 8 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr5 - 101949731 101952730 3000 browser details YourSeq 221 279 693 3000 91.1% chr5 - 103937663 103938125 463 browser details YourSeq 219 279 693 3000 89.3% chr5 - 106728336 106728775 440 browser details YourSeq 213 258 712 3000 88.3% chr1 + 10115070 10115578 509 browser details YourSeq 209 290 704 3000 87.8% chr1 - 74705775 74706233 459 browser details YourSeq 202 273 707 3000 85.4% chr7 - 65165525 65166017 493 browser details YourSeq 200 279 669 3000 85.3% chr14 - 60428893 60429332 440 browser details YourSeq 199 289 946 3000 84.5% chr13 - 17843691 17844310 620 browser details YourSeq 199 279 693 3000 91.4% chr11 + 105359258 105359716 459 browser details YourSeq 189 285 703 3000 88.0% chr6 - 97607811 97608286 476 browser details YourSeq 189 279 707 3000 87.6% chr13 - 73751455 73751900 446 browser details YourSeq 186 279 693 3000 86.1% chr1 - 164957159 164957626 468 browser details YourSeq 185 279 588 3000 88.4% chr5 + 38857366 38857745 380 browser details YourSeq 182 286 691 3000 82.7% chr11 - 76731232 76731663 432 browser details YourSeq 181 267 693 3000 88.2% chr18 - 82843675 82844127 453 browser details YourSeq 181 288 707 3000 93.0% chr12 - 55472794 55473257 464 browser details YourSeq 181 279 693 3000 88.4% chr10 - 89449788 89450279 492 browser details YourSeq 179 286 699 3000 89.5% chr9 + 102389155 102389612 458 browser details YourSeq 179 279 588 3000 88.3% chr16 + 4743049 4743391 343 browser details YourSeq 179 285 705 3000 89.8% chr10 + 117932641 117933105 465 Note: The 3000 bp section downstream of Exon 8 is BLAT searched against the genome. No significant similarity is found. Page 4 of 8 https://www.alphaknockout.com Gene and protein information: Wdfy3 WD repeat and FYVE domain containing 3 [ Mus musculus (house mouse) ] Gene ID: 72145, updated on 19-Oct-2019 Gene summary Official Symbol Wdfy3 provided by MGI Official Full Name WD repeat and FYVE domain containing 3 provided by MGI Primary source MGI:MGI:1096875 See related Ensembl:ENSMUSG00000043940 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as ALFY; BWF1; Bchs; Ggtb3; ZFYVE25; AW319683; D5Ertd66e; mKIAA0993; B930017C24; 2610509D04Rik Expression Ubiquitous expression in cerebellum adult (RPKM 13.9), whole brain E14.5 (RPKM 11.6) and 28 other tissues See more Orthologs human all Genomic context Location: 5 E4-E5; 5 48.95 cM See Wdfy3 in Genome Data Viewer Exon count: 70 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 5 NC_000071.6 (101832953..102072215, complement) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 5 NC_000071.5 (102261972..102498940, complement) Chromosome 5 - NC_000071.6 Page 5 of 8 https://www.alphaknockout.com Transcript information: This gene has 7 transcripts Gene: Wdfy3 ENSMUSG00000043940 Description WD repeat and FYVE domain containing 3 [Source:MGI Symbol;Acc:MGI:1096875] Gene Synonyms 2610509D04Rik, Alfy, Bchs, Bwf1, D5Ertd66e, Ggtb3 Location Chromosome 5: 101,832,956-102,069,921 reverse strand. GRCm38:CM000998.2 About this gene This gene has 7 transcripts (splice variants), 227 orthologues, 7 paralogues, is a member of 1 Ensembl protein family and is associated with 27 phenotypes. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Wdfy3- ENSMUST00000053177.13 14274 3508aa ENSMUSP00000052607.7 Protein coding CCDS19473 Q6VNB8 TSL:1 201 GENCODE basic APPRIS P2 Wdfy3- ENSMUST00000174598.7 10581 3526aa ENSMUSP00000134244.1 Protein coding - G3UYW1 TSL:5 205 GENCODE basic APPRIS ALT1 Wdfy3- ENSMUST00000212024.1 10539 3512aa ENSMUSP00000148521.1 Protein coding - A0A1D5RLV7 TSL:5 207 GENCODE basic Wdfy3- ENSMUST00000174698.1 3858 913aa ENSMUSP00000134541.1 Protein coding - Q6VNB8 TSL:1 206 GENCODE basic Wdfy3- ENSMUST00000172927.1 641 124aa ENSMUSP00000133979.1 Nonsense mediated - G3UY81 CDS 5' 203 decay incomplete TSL:5 Wdfy3- ENSMUST00000173955.1 2713 No - Retained intron - - TSL:1 204 protein Wdfy3- ENSMUST00000172512.1 704 No - lncRNA - - TSL:3 202 protein Page 6 of 8 https://www.alphaknockout.com 256.97 kb Forward strand 101.85Mb 101.90Mb 101.95Mb 102.00Mb 102.05Mb Genes Cds1-201 >protein coding Gm42934-201 >TEC Gm29707-202 >lncRNA (Comprehensive set... Gm20548-201 >lncRNA Gm29707-201 >lncRNA Gm20548-202 >lncRNA Gm20548-205 >lncRNA Gm20548-204 >lncRNA Gm20548-203 >lncRNA Contigs < AC171109.2 < AC131915.7 AC158143.6 > Genes (Comprehensive set... < Wdfy3-201protein coding < Wdfy3-204retained intron < Wdfy3-203nonsense mediated decay < Gm43787-201TEC < Wdfy3-207protein coding < Wdfy3-205protein coding < Wdfy3-206protein coding < Wdfy3-202lncRNA Regulatory Build 101.85Mb 101.90Mb 101.95Mb 102.00Mb 102.05Mb Reverse strand 256.97 kb Regulation Legend CTCF Enhancer Open Chromatin Promoter Promoter
Recommended publications
  • Whole Genome Sequencing of Nearly Isogeneic WMI and WLI Inbred Rats Identifies Genes Potentially Involved in Depression
    bioRxiv preprint doi: https://doi.org/10.1101/2020.12.04.411769; this version posted January 11, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Whole genome sequencing of nearly isogeneic WMI and WLI inbred rats identifies genes potentially involved in depression Tristan de Jong, Panjun Kim, Victor Guryev, Megan Mulligan, Robert W Williams, Eva E Redei, Hao Chen Abstract Background: The WMI and WLI inbred rat substrains were generated from the stress-prone, and not yet fully inbred, Wistar Kyoto (WKY) strain using bi-directional selection for immobility in the forced swim test followed by over 38 generations of inbreeding. Despite the low level of genetic diversity among WKY progenitors, the WMI substrain is more vulnerable to stress relative to its WLI control substrain. Here we quantify numbers and classes of sequence variants distinguishing these substrains and test the hypothesis that they are nearly isogenic. Results: The WLI and WMI genomic DNA were sequenced using Illumina xTen, IonTorrent and 10X Chromium technologies to obtain a combined coverage of over 100X. We identified 4,296 high quality homozygous SNPs and indels that differ between the WMI and WLI substrains. Gene ontology analysis of these variants showed an enrichment for neurogenesis related pathways. In addition, high impact variations were detected in genes previously implicated in depression (e.g. Gnat2), depression-like behavior (e.g. Prlr, Nlrp1a), other psychiatric disease (e.g.
    [Show full text]
  • Mechanisms Underlying Phenotypic Heterogeneity in Simplex Autism Spectrum Disorders
    Mechanisms Underlying Phenotypic Heterogeneity in Simplex Autism Spectrum Disorders Andrew H. Chiang Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy under the Executive Committee of the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2021 © 2021 Andrew H. Chiang All Rights Reserved Abstract Mechanisms Underlying Phenotypic Heterogeneity in Simplex Autism Spectrum Disorders Andrew H. Chiang Autism spectrum disorders (ASD) are a group of related neurodevelopmental diseases displaying significant genetic and phenotypic heterogeneity. Despite recent progress in ASD genetics, the nature of phenotypic heterogeneity across probands is not well understood. Notably, likely gene- disrupting (LGD) de novo mutations affecting the same gene often result in substantially different ASD phenotypes. We find that truncating mutations in a gene can result in a range of relatively mild decreases (15-30%) in gene expression due to nonsense-mediated decay (NMD), and show that more severe autism phenotypes are associated with greater decreases in expression. We also find that each gene with recurrent ASD mutations can be described by a parameter, phenotype dosage sensitivity (PDS), which characteriZes the relationship between changes in a gene’s dosage and changes in a given phenotype. Using simple linear models, we show that changes in gene dosage account for a substantial fraction of phenotypic variability in ASD. We further observe that LGD mutations affecting the same exon frequently lead to strikingly similar phenotypes in unrelated ASD probands. These patterns are observed for two independent proband cohorts and multiple important ASD-associated phenotypes. The observed phenotypic similarities are likely mediated by similar changes in gene dosage and similar perturbations to the relative expression of splicing isoforms.
    [Show full text]
  • Identification of Protein Features Encoded by Alternative Exons Using Exon Ontology
    Downloaded from genome.cshlp.org on October 2, 2021 - Published by Cold Spring Harbor Laboratory Press Resource Identification of protein features encoded by alternative exons using Exon Ontology Léon-Charles Tranchevent,1 Fabien Aubé,1 Louis Dulaurier,1 Clara Benoit-Pilven,1 Amandine Rey,1 Arnaud Poret,1 Emilie Chautard,2 Hussein Mortada,1 François-Olivier Desmet,1 Fatima Zahra Chakrama,1 Maira Alejandra Moreno-Garcia,1 Evelyne Goillot,3 Stéphane Janczarski,1 Franck Mortreux,1 Cyril F. Bourgeois,1,4 and Didier Auboeuf1,4 1Université Lyon 1, ENS de Lyon, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, F-69007, Lyon, France; 2Laboratoire de Biométrie et Biologie Évolutive, Université Lyon 1, UMR CNRS 5558, INRIA Erable, Villeurbanne, F-69622, France; 3Institut NeuroMyoGène, CNRS UMR 5310, INSERM U1217, Université Lyon 1, Lyon, F-69007 France Transcriptomic genome-wide analyses demonstrate massive variation of alternative splicing in many physiological and pathological situations. One major challenge is now to establish the biological contribution of alternative splicing var- iation in physiological- or pathological-associated cellular phenotypes. Toward this end, we developed a computational approach, named Exon Ontology, based on terms corresponding to well-characterized protein features organized in an ontology tree. Exon Ontology is conceptually similar to Gene Ontology-based approaches but focuses on exon-encod- ed protein features instead of gene level functional annotations. Exon Ontology describes the protein features encoded by a selected list of exons and looks for potential Exon Ontology term enrichment. By applying this strategy to exons that are differentially spliced between epithelial and mesenchymal cells and after extensive experimental validation, we demonstrate that Exon Ontology provides support to discover specific protein features regulated by alternative splic- ing.
    [Show full text]
  • Human Leucine-Rich Repeat Proteins: a Genome-Wide Bioinformatic Categorization and Functional Analysis in Innate Immunity
    Human leucine-rich repeat proteins: a genome-wide bioinformatic categorization and functional analysis in innate immunity Aylwin C. Y. Nga,b,1, Jason M. Eisenberga,b,1, Robert J. W. Heatha, Alan Huetta, Cory M. Robinsonc, Gerard J. Nauc, and Ramnik J. Xaviera,b,2 aCenter for Computational and Integrative Biology, and Gastrointestinal Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114; bThe Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142; and cMicrobiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15261 Edited by Jeffrey I. Gordon, Washington University School of Medicine, St. Louis, MO, and approved June 11, 2010 (received for review February 17, 2010) In innate immune sensing, the detection of pathogen-associated proteins have been implicated in human diseases to date, notably molecular patterns by recognition receptors typically involve polymorphisms in NOD2 in Crohn disease (8, 9), CIITA in leucine-rich repeats (LRRs). We provide a categorization of 375 rheumatoid arthritis and multiple sclerosis (10), and TLR5 in human LRR-containing proteins, almost half of which lack other Legionnaire disease (11). identifiable functional domains. We clustered human LRR proteins Most LRR domains consist of a chain of between 2 and 45 by first assigning LRRs to LRR classes and then grouping the proteins LRRs (12). Each repeat in turn is typically 20 to 30 residues long based on these class assignments, revealing several of the resulting and can be divided into a highly conserved segment (HCS) fol- protein groups containing a large number of proteins with certain lowed by a variable segment (VS).
    [Show full text]
  • Inflammatory, Regulatory, and Autophagy Co-Expression Modules
    Durocher et al. Journal of Neuroinflammation (2019) 16:56 https://doi.org/10.1186/s12974-019-1433-4 RESEARCH Open Access Inflammatory, regulatory, and autophagy co-expression modules and hub genes underlie the peripheral immune response to human intracerebral hemorrhage Marc Durocher1, Bradley P. Ander1, Glen Jickling1, Farah Hamade1, Heather Hull1, Bodie Knepp1, Da Zhi Liu1, Xinhua Zhan1, Anh Tran1, Xiyuan Cheng1, Kwan Ng1, Alan Yee1, Frank R. Sharp1 and Boryana Stamova1,2* Abstract Background: Intracerebral hemorrhage (ICH) has a high morbidity and mortality. The peripheral immune system and cross-talk between peripheral blood and brain have been implicated in the ICH immune response. Thus, we delineated the gene networks associated with human ICH in the peripheral blood transcriptome. We also compared the differentially expressed genes in blood following ICH to a prior human study of perihematomal brain tissue. Methods: We performed peripheral blood whole-transcriptome analysis of ICH and matched vascular risk factor control subjects (n = 66). Gene co-expression network analysis identified groups of co-expressed genes (modules) associated with ICH and their most interconnected genes (hubs). Mixed-effects regression identified differentially expressed genes in ICH compared to controls. Results: Of seven ICH-associated modules, six were enriched with cell-specific genes: one neutrophil module, one neutrophil plus monocyte module, one T cell module, one Natural Killer cell module, and two erythroblast modules. The neutrophil/monocyte modules were enriched in inflammatory/immune pathways; the T cell module in T cell receptor signaling genes; and the Natural Killer cell module in genes regulating alternative splicing, epigenetic, and post-translational modifications.
    [Show full text]
  • 393LN V 393P 344SQ V 393P Probe Set Entrez Gene
    393LN v 393P 344SQ v 393P Entrez fold fold probe set Gene Gene Symbol Gene cluster Gene Title p-value change p-value change chemokine (C-C motif) ligand 21b /// chemokine (C-C motif) ligand 21a /// chemokine (C-C motif) ligand 21c 1419426_s_at 18829 /// Ccl21b /// Ccl2 1 - up 393 LN only (leucine) 0.0047 9.199837 0.45212 6.847887 nuclear factor of activated T-cells, cytoplasmic, calcineurin- 1447085_s_at 18018 Nfatc1 1 - up 393 LN only dependent 1 0.009048 12.065 0.13718 4.81 RIKEN cDNA 1453647_at 78668 9530059J11Rik1 - up 393 LN only 9530059J11 gene 0.002208 5.482897 0.27642 3.45171 transient receptor potential cation channel, subfamily 1457164_at 277328 Trpa1 1 - up 393 LN only A, member 1 0.000111 9.180344 0.01771 3.048114 regulating synaptic membrane 1422809_at 116838 Rims2 1 - up 393 LN only exocytosis 2 0.001891 8.560424 0.13159 2.980501 glial cell line derived neurotrophic factor family receptor alpha 1433716_x_at 14586 Gfra2 1 - up 393 LN only 2 0.006868 30.88736 0.01066 2.811211 1446936_at --- --- 1 - up 393 LN only --- 0.007695 6.373955 0.11733 2.480287 zinc finger protein 1438742_at 320683 Zfp629 1 - up 393 LN only 629 0.002644 5.231855 0.38124 2.377016 phospholipase A2, 1426019_at 18786 Plaa 1 - up 393 LN only activating protein 0.008657 6.2364 0.12336 2.262117 1445314_at 14009 Etv1 1 - up 393 LN only ets variant gene 1 0.007224 3.643646 0.36434 2.01989 ciliary rootlet coiled- 1427338_at 230872 Crocc 1 - up 393 LN only coil, rootletin 0.002482 7.783242 0.49977 1.794171 expressed sequence 1436585_at 99463 BB182297 1 - up 393
    [Show full text]
  • Cytotoxicity of Thymus Vulgaris Essential Oil Towards Human Oral Cavity Squamous Cell Carcinoma
    ANTICANCER RESEARCH 31: 81-88 (2011) Cytotoxicity of Thymus vulgaris Essential Oil Towards Human Oral Cavity Squamous Cell Carcinoma SERKAN SERTEL1,2,3, TOLGA EICHHORN2,3, PETER K. PLINKERT1 and THOMAS EFFERTH2,3 1Department of Otorhinolaryngology, Head and Neck Surgery, University of Heidelberg, Heidelberg, Germany; 2Pharmaceutical Biology (C015), German Cancer Research Center, Heidelberg, Germany; 3Department of Pharmaceutical Biology, Institute of Pharmacy and Biochemistry, University of Mainz, Mainz, Germany Abstract. Background: Oral cavity squamous cell carcinoma inhabitants per year (2). Treatment of head and neck (OCSCC) accounts for 2% to 3% of all malignancies and has carcinomas comprises surgery commonly followed by a high mortality rate. The majority of anticancer drugs are of concurrent chemo- and radiation therapy for advanced tumors. natural origin. However, it is unknown whether the medicinal However, despite the improvements which have been achieved plant Thymus vulgaris L. (thyme) is cytotoxic towards head in concurrent therapies, the overall 5-year survival rate for and neck squamous cell carcinoma (HNSCC). Materials and OCSCC remains at 50% and has not significantly improved in Methods: Cytotoxicity of thyme essential oil was investigated the past 30 years (3). Novel tumor-specific therapies are on the HNSCC cell line, UMSCC1. The IC50 of thyme required to be less toxic while maintaining a high degree of essential oil extract was 369 μg/ml. Moreover, we performed efficacy. As the majority of anticancer drugs are of natural pharmacogenomics analyses. Results: Genes involved in the origin, natural products represent a valuable source for the cell cycle, cell death and cancer were involved in the cytotoxic identification and development of novel treatment options for activity of thyme essential oil at the transcriptional level.
    [Show full text]
  • Recurrent De Novo Mutations in Neurodevelopmental Disorders: Properties and Clinical Implications Amy B
    Wilfert et al. Genome Medicine (2017) 9:101 DOI 10.1186/s13073-017-0498-x REVIEW Open Access Recurrent de novo mutations in neurodevelopmental disorders: properties and clinical implications Amy B. Wilfert1†, Arvis Sulovari1†, Tychele N. Turner1, Bradley P. Coe1 and Evan E. Eichler1,2* Abstract Next-generation sequencing (NGS) is now more accessible to clinicians and researchers. As a result, our understanding of the genetics of neurodevelopmental disorders (NDDs) has rapidly advanced over the past few years. NGS has led to the discovery of new NDD genes with an excess of recurrent de novo mutations (DNMs) when compared to controls. Development of large-scale databases of normal and disease variation has given rise to metrics exploring the relative tolerance of individual genes to human mutation. Genetic etiology and diagnosis rates have improved, which have led to the discovery of new pathways and tissue types relevant to NDDs. In this review, we highlight several key findings based on the discovery of recurrent DNMs ranging from copy number variants to point mutations. We explore biases and patterns of DNM enrichment and the role of mosaicism and secondary mutations in variable expressivity. We discuss the benefit of whole-genome sequencing (WGS) over whole-exome sequencing (WES) to understand more complex, multifactorial cases of NDD and explain how this improved understanding aids diagnosis and management of these disorders. Comprehensive assessment of the DNM landscape across the genome using WGS and other technologies will lead to the development of novel functional and bioinformatics approaches to interpret DNMs and drive new insights into NDD biology.
    [Show full text]
  • Gene Expression Network Analysis Provides Potential Targets Against SARS-Cov-2
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.182634; this version posted July 6, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Gene Expression Network Analysis Provides Potential Targets Against SARS-CoV-2 2 Ana I. Hernández Cordero1*, Xuan Li1, Chen Xi Yang1, Stephen Milne1,2,3, Yohan Bossé4, Philippe 3 Joubert4, Wim Timens5, Maarten van den Berge6, David Nickle7, Ke Hao8, Don D. Sin1,2 4 5 1 Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada 6 2 Division of Respiratory Medicine, Faculty of Medicine, University of British Columbia, Vancouver, 7 BC, Canada 8 3 Faculty of Medicine and Health, University of Sydney, Sydney, New South Wales, Australia 9 4 Institut universitaire de cardiologie et de pneumologie de Québec, Université Laval, Québec City, 10 QC, Canada 11 5 University of Groningen, University Medical Centre Groningen, Department of Pathology and 12 Medical Biology, Groningen, The Netherlands 13 6 University of Groningen, University Medical Center Groningen, Department of Pulmonary Diseases, 14 Groningen, The Netherlands 15 7 Merck Research Laboratories, Genetics and Pharmacogenomics, Boston, Massachusetts, United 16 States of America 17 8 Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic 18 Technology, Icahn School of Medicine at Mount Sinai, New York, NY, United States of America 19 20 * Corresponding author 21 E-mail: [email protected] 22 23 24 25 26 27 28 29 30 31 32 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.182634; this version posted July 6, 2020.
    [Show full text]
  • Transposon Mutagenesis Identifies Genetic Drivers of Brafv600e Melanoma
    ARTICLES Transposon mutagenesis identifies genetic drivers of BrafV600E melanoma Michael B Mann1,2, Michael A Black3, Devin J Jones1, Jerrold M Ward2,12, Christopher Chin Kuan Yew2,12, Justin Y Newberg1, Adam J Dupuy4, Alistair G Rust5,12, Marcus W Bosenberg6,7, Martin McMahon8,9, Cristin G Print10,11, Neal G Copeland1,2,13 & Nancy A Jenkins1,2,13 Although nearly half of human melanomas harbor oncogenic BRAFV600E mutations, the genetic events that cooperate with these mutations to drive melanogenesis are still largely unknown. Here we show that Sleeping Beauty (SB) transposon-mediated mutagenesis drives melanoma progression in BrafV600E mutant mice and identify 1,232 recurrently mutated candidate cancer genes (CCGs) from 70 SB-driven melanomas. CCGs are enriched in Wnt, PI3K, MAPK and netrin signaling pathway components and are more highly connected to one another than predicted by chance, indicating that SB targets cooperative genetic networks in melanoma. Human orthologs of >500 CCGs are enriched for mutations in human melanoma or showed statistically significant clinical associations between RNA abundance and survival of patients with metastatic melanoma. We also functionally validate CEP350 as a new tumor-suppressor gene in human melanoma. SB mutagenesis has thus helped to catalog the cooperative molecular mechanisms driving BRAFV600E melanoma and discover new genes with potential clinical importance in human melanoma. Substantial sun exposure and numerous genetic factors, including including BrafV600E, recapitulate the genetic and histological hallmarks skin type and family history, are the most important melanoma risk of human melanoma. In these models, increased MEK-ERK signaling factors. Familial melanoma, which accounts for <10% of cases, is asso- initiates clonal expansion of melanocytes, which is limited by oncogene- ciated with mutations in CDKN2A1, MITF2 and POT1 (refs.
    [Show full text]
  • Autismnext Positive Sample Report
    SAMPLE REPORT FINAL REPORT - 08/25/2020 Ordered By Patient Name: Patient, Sample Medical Professional: Sample Physician, MD Accession #: 00-109625 Specimen #: N/A Client: Sample Facility AP2 Order #: 835495 Specimen: Blood EDTA Gender: M Additional Authorized Recipient: Birthdate: 01/01/1976 Collected: 07/29/2020 Sample GC, CGC MRN #: N/A Received: 07/30/2020 Indication: Diagnostic AutismNext®: Analyses of 72 Genes Associated with Autism Spectrum Disorders ​ and/or Intellectual Disability RESULTS Gene Inheritance Alteration Proband POGZ Autosomal dominant Pathogenic Mutation: c.3528C>A (p.Y1176*) Heterozygous (NM_015100) ANKRD11 Autosomal dominant Variant of Uncertain Significance: c.5351C>T (p.S1784F) Heterozygous (NM_013275) SUMMARY POSITIVE: Pathogenic Mutation Detected INTERPRETATION This individual is is heterozygous for the c.3528C>A (p.Y1176*) pathogenic mutation in the POGZ gene. This result is consistent with a diagnosis of POGZ-related neurodevelopmental disorder. The expression and severity of disease for this individual cannot be predicted. Genetic testing for pathogenic mutations in family members can be helpful in identifying at-risk individuals. Genetic counseling is a recommended option for all individuals undergoing genetic testing. This individual is also heterozygous for the c.5351C>T (p.S1784F) variant of uncertain significance in the ANKRD11 gene, which may or may not contribute to this individual’s clinical history. Refer to the supplementary pages for additional information on these variants. No additional pathogenic mutations, variants of uncertain significance, or gross deletions or duplications were detected. POGZ Additional Information Genomic Coordinates Gene Symbol RefSeq ID (GRCh37) Genomic Size (bp) Total Exons Coding Exons Number of Amino Acids POGZ NM_015100 chr1:151375200-151431941 56742 19 18 1410 aa VARIANT DETAILS: The c.3528C>A (p.Y1176*) alteration, located in exon 19 (coding exon 18) of the POGZ gene, consists of a C to A substitution at nucleotide position 3528.
    [Show full text]
  • Table S1. 103 Ferroptosis-Related Genes Retrieved from the Genecards
    Table S1. 103 ferroptosis-related genes retrieved from the GeneCards. Gene Symbol Description Category GPX4 Glutathione Peroxidase 4 Protein Coding AIFM2 Apoptosis Inducing Factor Mitochondria Associated 2 Protein Coding TP53 Tumor Protein P53 Protein Coding ACSL4 Acyl-CoA Synthetase Long Chain Family Member 4 Protein Coding SLC7A11 Solute Carrier Family 7 Member 11 Protein Coding VDAC2 Voltage Dependent Anion Channel 2 Protein Coding VDAC3 Voltage Dependent Anion Channel 3 Protein Coding ATG5 Autophagy Related 5 Protein Coding ATG7 Autophagy Related 7 Protein Coding NCOA4 Nuclear Receptor Coactivator 4 Protein Coding HMOX1 Heme Oxygenase 1 Protein Coding SLC3A2 Solute Carrier Family 3 Member 2 Protein Coding ALOX15 Arachidonate 15-Lipoxygenase Protein Coding BECN1 Beclin 1 Protein Coding PRKAA1 Protein Kinase AMP-Activated Catalytic Subunit Alpha 1 Protein Coding SAT1 Spermidine/Spermine N1-Acetyltransferase 1 Protein Coding NF2 Neurofibromin 2 Protein Coding YAP1 Yes1 Associated Transcriptional Regulator Protein Coding FTH1 Ferritin Heavy Chain 1 Protein Coding TF Transferrin Protein Coding TFRC Transferrin Receptor Protein Coding FTL Ferritin Light Chain Protein Coding CYBB Cytochrome B-245 Beta Chain Protein Coding GSS Glutathione Synthetase Protein Coding CP Ceruloplasmin Protein Coding PRNP Prion Protein Protein Coding SLC11A2 Solute Carrier Family 11 Member 2 Protein Coding SLC40A1 Solute Carrier Family 40 Member 1 Protein Coding STEAP3 STEAP3 Metalloreductase Protein Coding ACSL1 Acyl-CoA Synthetase Long Chain Family Member 1 Protein
    [Show full text]