Mouse Smoc2 Conditional Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Smoc2 Conditional Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Smoc2 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Smoc2 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Smoc2 gene (NCBI Reference Sequence: NM_022315 ; Ensembl: ENSMUSG00000023886 ) is located on Mouse chromosome 17. 13 exons are identified, with the ATG start codon in exon 1 and the TGA stop codon in exon 13 (Transcript: ENSMUST00000024660). Exon 3 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Smoc2 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-94H11 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for one KO allele exhibit protection from induced kidney fibrosis and reduced interstitial myofibroblast accumulation. Another KO allele leads to shortening and widening of the skull. Exon 3 starts from about 19.16% of the coding region. The knockout of Exon 3 will result in frameshift of the gene. The size of intron 2 for 5'-loxP site insertion: 10840 bp, and the size of intron 3 for 3'-loxP site insertion: 1008 bp. The size of effective cKO region: ~607 bp. The cKO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele gRNA region 5' gRNA region 3' 1 3 4 13 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Smoc2 Homology arm cKO region loxP site Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. It may be difficult to construct this targeting vector. Overview of the GC Content Distribution Window size: 300 bp Sequence 12 Summary: Full Length(7107bp) | A(26.21% 1863) | C(20.88% 1484) | T(28.1% 1997) | G(24.81% 1763) Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 8 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr17 + 14333298 14336297 3000 browser details YourSeq 144 2177 2384 3000 94.0% chr3 + 149745486 149745985 500 browser details YourSeq 132 2178 2382 3000 85.1% chrX + 152390492 152390682 191 browser details YourSeq 127 2178 2384 3000 81.5% chr5 - 59803566 59803742 177 browser details YourSeq 120 2190 2500 3000 85.7% chr17 + 7574138 7574432 295 browser details YourSeq 120 2178 2384 3000 81.1% chr13 + 115247573 115247737 165 browser details YourSeq 119 2167 2388 3000 89.8% chr14 - 35902263 35902645 383 browser details YourSeq 119 2198 2384 3000 82.3% chr13 - 82858849 82859014 166 browser details YourSeq 119 2177 2384 3000 89.4% chr19 + 34498078 34498286 209 browser details YourSeq 113 2177 2376 3000 81.9% chr9 + 31594499 31594660 162 browser details YourSeq 110 2189 2391 3000 91.0% chr14 + 71583722 71583946 225 browser details YourSeq 110 2184 2384 3000 85.2% chr11 + 91201266 91201424 159 browser details YourSeq 109 2214 2384 3000 90.3% chr19 + 46129035 46129305 271 browser details YourSeq 107 2178 2334 3000 85.3% chr10 - 45007530 45007676 147 browser details YourSeq 105 2179 2383 3000 89.2% chr11 + 32779604 32779804 201 browser details YourSeq 102 2167 2334 3000 83.5% chr1 - 162384446 162384600 155 browser details YourSeq 102 2178 2384 3000 78.6% chr13 + 115247597 115247725 129 browser details YourSeq 101 2178 2334 3000 82.7% chr18 - 29947440 29947570 131 browser details YourSeq 98 2196 2376 3000 80.5% chr16 - 78074773 78074923 151 browser details YourSeq 97 2200 2382 3000 81.9% chr6 - 97711215 97711359 145 Note: The 3000 bp section upstream of Exon 3 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr17 + 14336905 14339904 3000 browser details YourSeq 112 2360 2672 3000 91.8% chr9 + 113482567 113483303 737 browser details YourSeq 111 2370 2669 3000 83.4% chr1 - 34672870 34673163 294 browser details YourSeq 108 2359 2681 3000 88.3% chr10 + 67387121 67387633 513 browser details YourSeq 105 2357 2672 3000 79.2% chr11 - 8955268 8955554 287 browser details YourSeq 97 2362 2649 3000 81.5% chr10 + 6981484 6981755 272 browser details YourSeq 94 2435 2673 3000 80.0% chr17 + 69258310 69258489 180 browser details YourSeq 90 2400 2668 3000 80.0% chr15 - 101982413 101982632 220 browser details YourSeq 90 2438 2685 3000 91.6% chr10 - 59257132 59257389 258 browser details YourSeq 90 2400 2681 3000 79.9% chr8 + 121238655 121238879 225 browser details YourSeq 88 2203 2656 3000 75.8% chr12 - 19203156 19203387 232 browser details YourSeq 85 2450 2681 3000 94.0% chr1 - 130563666 130564091 426 browser details YourSeq 85 2360 2656 3000 89.0% chr11 + 98655715 98656014 300 browser details YourSeq 80 2406 2690 3000 79.4% chr11 - 111531567 111531791 225 browser details YourSeq 80 2466 2690 3000 88.4% chr4 + 153190046 153190301 256 browser details YourSeq 79 2436 2690 3000 91.7% chr8 - 120032191 120032562 372 browser details YourSeq 78 2438 2669 3000 90.6% chr8 + 121238657 121238891 235 browser details YourSeq 77 2394 2690 3000 76.4% chr11 - 38909971 38910155 185 browser details YourSeq 77 2440 2672 3000 80.0% chr1 - 192514567 192514741 175 browser details YourSeq 77 2374 2670 3000 89.5% chr11 + 6783461 6783806 346 Note: The 3000 bp section downstream of Exon 3 is BLAT searched against the genome. No significant similarity is found. Page 4 of 8 https://www.alphaknockout.com Gene and protein information: Smoc2 SPARC related modular calcium binding 2 [ Mus musculus (house mouse) ] Gene ID: 64074, updated on 14-Oct-2019 Gene summary Official Symbol Smoc2 provided by MGI Official Full Name SPARC related modular calcium binding 2 provided by MGI Primary source MGI:MGI:1929881 See related Ensembl:ENSMUSG00000023886 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Smoc2l; 1700056C05Rik; 5430426J21Rik Expression Broad expression in ovary adult (RPKM 125.2), subcutaneous fat pad adult (RPKM 55.4) and 16 other tissues See more Orthologs human all Genomic context Location: 17 A2; 17 8.95 cM See Smoc2 in Genome Data Viewer Exon count: 13 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 17 NC_000083.6 (14279490..14404790) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 17 NC_000083.5 (14416513..14541797) Chromosome 17 - NC_000083.6 Page 5 of 8 https://www.alphaknockout.com Transcript information: This gene has 7 transcripts Gene: Smoc2 ENSMUSG00000023886 Description SPARC related modular calcium binding 2 [Source:MGI Symbol;Acc:MGI:1929881] Gene Synonyms 1700056C05Rik, 5430426J21Rik, Smoc2l Location Chromosome 17: 14,279,506-14,404,790 forward strand. GRCm38:CM001010.2 About this gene This gene has 7 transcripts (splice variants), 200 orthologues, 1 paralogue, is a member of 1 Ensembl protein family and is associated with 6 phenotypes. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Smoc2-201 ENSMUST00000024660.8 2832 447aa ENSMUSP00000024660.8 Protein coding CCDS37442 Q8CD91 TSL:1 GENCODE basic APPRIS P1 Smoc2-204 ENSMUST00000233344.1 2817 458aa ENSMUSP00000156572.1 Protein coding - A0A3B2WCK7 GENCODE basic Smoc2-202 ENSMUST00000232923.1 588 89aa ENSMUSP00000156632.1 Protein coding - A0A3B2WCQ8 GENCODE basic Smoc2-206 ENSMUST00000233540.1 3972 No protein - Retained intron - - - Smoc2-205 ENSMUST00000233526.1 572 No protein - lncRNA - - - Smoc2-203 ENSMUST00000232936.1 521 No protein - lncRNA - - - Smoc2-207 ENSMUST00000233686.1 337 No protein - lncRNA - - - Page 6 of 8 https://www.alphaknockout.com 145.28 kb Forward strand 14.30Mb 14.35Mb 14.40Mb Genes Gm34567-201 >lncRNA Gm49720-201 >processed pseudogene (Comprehensive set... Smoc2-201 >protein coding Smoc2-204 >protein coding Smoc2-206 >retained intron Smoc2-205 >lncRNA Smoc2-202 >protein coding Smoc2-203 >lncRNA Smoc2-207 >lncRNA Contigs < AC171111.2 < AC169675.2 < CT010437.15 Genes < 4930474M22Rik-201lncRNA (Comprehensive set... Regulatory Build 14.30Mb 14.35Mb 14.40Mb Reverse strand 145.28 kb Regulation Legend CTCF Enhancer Open Chromatin Promoter Promoter Flank Transcription Factor Binding Site Gene Legend Protein Coding Ensembl protein coding merged Ensembl/Havana Non-Protein Coding processed transcript RNA gene pseudogene Page 7 of 8 https://www.alphaknockout.com Transcript: ENSMUST00000024660 125.28 kb Forward strand Smoc2-201 >protein coding ENSMUSP00000024... MobiDB lite Low complexity (Seg) Cleavage site (Sign... Superfamily Thyroglobulin type-1 superfamily EF-hand
Recommended publications
  • Molecular Classification Reveals the Diverse Genetic Features and Prognosis
    bioRxiv preprint doi: https://doi.org/10.1101/2021.06.07.447364; this version posted June 7, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Molecular classification reveals the diverse genetic features and prognosis of gastric cancer: a multi-omics consensus ensemble clustering 1 Xianyu Hu1#, Zhenglin Wang1#, Qing Wang2#, Ke Chen1, Qijun Han1, Suwen Bai3, 2 Juan Du3,4*, Wei Chen1* 3 1 Department of General Surgery, The First Affiliated Hospital of Anhui Medical 4 University Hefei, Anhui Province 230022, P.R. China 5 2 Department of Biliary-Pancreatic Minimally Invasive Surgery, The First Affiliated 6 Hospital of Shantou University Medical College, Shantou, Guangdong Province 515000, 7 P.R. China 8 3 Longgang District People’s Hospital of Shenzhen & The Third Affiliated Hospital 9 (Provisional) of The Chinese University of Hong Kong, Shenzhen, Shenzhen, 10 Guangdong Province 518172, P.R. China 11 4 School of Medicine, The Chinese University of Hong Kong, Shenzhen, Shenzhen, 12 Guangdong Province 518172, P.R. China 13 14 # These authors contributed equally to the study. 15 Correspondence should be addressed to Wei Chen ([email protected]) 16 17 18 bioRxiv preprint doi: https://doi.org/10.1101/2021.06.07.447364; this version posted June 7, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 19 *Correspondence to: 20 Prof. Juan Du 21 Longgang District People’s Hospital of Shenzhen & The Third Affiliated Hospital 22 (Provisional) of The Chinese University of Hong Kong, Shenzhen, Shenzhen, 23 Guangdong Province 518172, P.R.
    [Show full text]
  • Molecular Cytogenetic Characterisation of a Novel De Novo Ring
    Pace et al. Molecular Cytogenetics (2017) 10:9 DOI 10.1186/s13039-017-0311-y CASEREPORT Open Access Molecular cytogenetic characterisation of a novel de novo ring chromosome 6 involving a terminal 6p deletion and terminal 6q duplication in the different arms of the same chromosome Nikolai Paul Pace1, Frideriki Maggouta2, Melissa Twigden2 and Isabella Borg1,3,4* Abstract Background: Ring chromosome 6 is a rare sporadic chromosomal abnormality, associated with extreme variability in clinical phenotypes. Most ring chromosomes are known to have deletions on one or both chromosomal arms. Here, we report an atypical and unique ring chromosome 6 involving both a distal deletion and a distal duplication on the different arms of the same chromosome. Case presentation: In a patient with intellectual disability, short stature, microcephaly, facial dysmorphology, congenital heart defects and renovascular disease, a ring chromosome 6 was characterised using array-CGH and dual-colour FISH. The de-novo ring chromosome 6 involved a 1.8 Mb terminal deletion in the distal short arm and a 2.5 Mb duplication in the distal long arm of the same chromosome 6. This results in monosomy for the region 6pter to 6p25.3 and trisomy for the region 6q27 to 6qter. Analysis of genes in these chromosomal regions suggests that haploinsufficiency for FOXC1 and GMDS genes accounts for the cardiac and neurodevelopmental phenotypes in the proband. The ring chromosome 6 reported here is atypical as it involves a unique duplication of the distal long arm. Furthermore, the presence of renovascular disease is also a unique feature identified in this patient.
    [Show full text]
  • Heterogeneity Between Primary Colon Carcinoma and Paired Lymphatic and Hepatic Metastases
    MOLECULAR MEDICINE REPORTS 6: 1057-1068, 2012 Heterogeneity between primary colon carcinoma and paired lymphatic and hepatic metastases HUANRONG LAN1, KETAO JIN2,3, BOJIAN XIE4, NA HAN5, BINBIN CUI2, FEILIN CAO2 and LISONG TENG3 Departments of 1Gynecology and Obstetrics, and 2Surgical Oncology, Taizhou Hospital, Wenzhou Medical College, Linhai, Zhejiang; 3Department of Surgical Oncology, First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, Zhejiang; 4Department of Surgical Oncology, Sir Run Run Shaw Hospital, College of Medicine, Zhejiang University, Hangzhou, Zhejiang; 5Cancer Chemotherapy Center, Zhejiang Cancer Hospital, Zhejiang University of Chinese Medicine, Hangzhou, Zhejiang, P.R. China Received January 26, 2012; Accepted May 8, 2012 DOI: 10.3892/mmr.2012.1051 Abstract. Heterogeneity is one of the recognized characteris- Introduction tics of human tumors, and occurs on multiple levels in a wide range of tumors. A number of studies have focused on the Intratumor heterogeneity is one of the recognized charac- heterogeneity found in primary tumors and related metastases teristics of human tumors, which occurs on multiple levels, with the consideration that the evaluation of metastatic rather including genetic, protein and macroscopic, in a wide range than primary sites could be of clinical relevance. Numerous of tumors, including breast, colorectal cancer (CRC), non- studies have demonstrated particularly high rates of hetero- small cell lung cancer (NSCLC), prostate, ovarian, pancreatic, geneity between primary colorectal tumors and their paired gastric, brain and renal clear cell carcinoma (1). Over the past lymphatic and hepatic metastases. It has also been proposed decade, a number of studies have focused on the heterogeneity that the heterogeneity between primary colon carcinomas and found in primary tumors and related metastases with the their paired lymphatic and hepatic metastases may result in consideration that the evaluation of metastatic rather than different responses to anticancer therapies.
    [Show full text]
  • The Structure, Function and Evolution of the Extracellular Matrix: a Systems-Level Analysis
    The Structure, Function and Evolution of the Extracellular Matrix: A Systems-Level Analysis by Graham L. Cromar A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Molecular Genetics University of Toronto © Copyright by Graham L. Cromar 2014 ii The Structure, Function and Evolution of the Extracellular Matrix: A Systems-Level Analysis Graham L. Cromar Doctor of Philosophy Department of Molecular Genetics University of Toronto 2014 Abstract The extracellular matrix (ECM) is a three-dimensional meshwork of proteins, proteoglycans and polysaccharides imparting structure and mechanical stability to tissues. ECM dysfunction has been implicated in a number of debilitating conditions including cancer, atherosclerosis, asthma, fibrosis and arthritis. Identifying the components that comprise the ECM and understanding how they are organised within the matrix is key to uncovering its role in health and disease. This study defines a rigorous protocol for the rapid categorization of proteins comprising a biological system. Beginning with over 2000 candidate extracellular proteins, 357 core ECM genes and 524 functionally related (non-ECM) genes are identified. A network of high quality protein-protein interactions constructed from these core genes reveals the ECM is organised into biologically relevant functional modules whose components exhibit a mosaic of expression and conservation patterns. This suggests module innovations were widespread and evolved in parallel to convey tissue specific functionality on otherwise broadly expressed modules. Phylogenetic profiles of ECM proteins highlight components restricted and/or expanded in metazoans, vertebrates and mammals, indicating taxon-specific tissue innovations. Modules enriched for medical subject headings illustrate the potential for systems based analyses to predict new functional and disease associations on the basis of network topology.
    [Show full text]
  • The Use of Kosher Phenotyping for Mapping QTL Affecting Susceptibility to Bovine Respiratory Disease
    RESEARCH ARTICLE The Use of Kosher Phenotyping for Mapping QTL Affecting Susceptibility to Bovine Respiratory Disease Ehud Lipkin1☯*, Maria Giuseppina Strillacci2☯, Harel Eitam3, Moran Yishay3, Fausta Schiavini2, Morris Soller1, Alessandro Bagnato2, Ariel Shabtay3* 1 Department of Genetics, Hebrew University of Jerusalem, Jerusalem, Israel, 2 Università degli Studi di Milano (UNIMI), Milan, Italy, 3 Department of Ruminant Sciences, Agricultural Research Organization (ARO), Bet-Dagan, Israel ☯ These authors contributed equally to this work. * [email protected] (EL); [email protected] (AS) Abstract a11111 Bovine respiratory disease (BRD) is the leading cause of morbidity and mortality in feedlot cattle, caused by multiple pathogens that become more virulent in response to stress. As clinical signs often go undetected and various preventive strategies failed, identification of genes affecting BRD is essential for selection for resistance. Selective DNA pooling (SDP) was applied in a genome wide association study (GWAS) to map BRD QTLs in Israeli Hol- OPEN ACCESS stein male calves. Kosher scoring of lung adhesions was used to allocate 122 and 62 Citation: Lipkin E, Strillacci MG, Eitam H, Yishay M, animals to High (Glatt Kosher) and Low (Non-Kosher) resistant groups, respectively. Geno- Schiavini F, Soller M, et al. (2016) The Use of Kosher typing was performed using the Illumina BovineHD BeadChip according to the Infinium pro- Phenotyping for Mapping QTL Affecting Susceptibility tocol. Moving average of -logP was used to map QTLs and Log drop was used to define to Bovine Respiratory Disease. PLoS ONE 11(4): e0153423. doi:10.1371/journal.pone.0153423 their boundaries (QTLRs). The combined procedure was efficient for high resolution map- ping.
    [Show full text]
  • PB #ISAG2017 1 @Isagofficial #ISAG2017 #ISAG2017
    Bioinformatics · Comparative Genomics · Computational Biology Epigenetics · Functional Genomics · Genome Diversity · Geno Genome Sequencing · Immunogenetics · Integrative Geno · Microbiomics · Population Genomics · Systems Biolog Genetic Markers and Selection · Genetics and Dis Gene Editing · Bioinformatics · Comparative Computational Biology · Epigenetics · Fun Genome Diversity · Genome Sequeng Integrative Genomics · Microbiom Population Genomics · Syste Genetic Markers and Sel Genetics and Disease Gene Editing · Bi O’Brien Centre for Science Bioinformati and O’Reilly Hall, University College Dublin, Dublin, Ireland ABSTRACTMINI PROGRAMME BOOK www.isag.us/2017 PB #ISAG2017 1 @isagofficial #ISAG2017 #ISAG2017 Contents ORAL PRESENTATIONS 1 Animal Forensic Genetics Workshop 1 Applied Genetics and Genomics in Other Species of Economic Importance 3 Domestic Animal Sequencing and Annotation 5 Genome Edited Animals 8 Horse Genetics and Genomics 9 Avian Genetics and Genomics 12 Comparative MHC Genetics: Populations and Polymorphism 16 Equine Genetics and Thoroughbred Parentage Testing Workshop 19 Genetics of Immune Response and Disease Resistance 20 ISAG-FAO Genetic Diversity 24 Ruminant Genetics and Genomics 28 Animal Epigenetics 31 Cattle Molecular Markers and Parentage Testing 33 Companion Animal Genetics and Genomics 34 Microbiomes 37 Pig Genetics and Genomics 40 Novel, Groundbreaking Research/Methodology Presentation 44 Applied Genetics of Companion Animals 44 Applied Sheep and Goat Genetics 45 Comparative and Functional Genomics 47 Genetics
    [Show full text]
  • Table S1. 103 Ferroptosis-Related Genes Retrieved from the Genecards
    Table S1. 103 ferroptosis-related genes retrieved from the GeneCards. Gene Symbol Description Category GPX4 Glutathione Peroxidase 4 Protein Coding AIFM2 Apoptosis Inducing Factor Mitochondria Associated 2 Protein Coding TP53 Tumor Protein P53 Protein Coding ACSL4 Acyl-CoA Synthetase Long Chain Family Member 4 Protein Coding SLC7A11 Solute Carrier Family 7 Member 11 Protein Coding VDAC2 Voltage Dependent Anion Channel 2 Protein Coding VDAC3 Voltage Dependent Anion Channel 3 Protein Coding ATG5 Autophagy Related 5 Protein Coding ATG7 Autophagy Related 7 Protein Coding NCOA4 Nuclear Receptor Coactivator 4 Protein Coding HMOX1 Heme Oxygenase 1 Protein Coding SLC3A2 Solute Carrier Family 3 Member 2 Protein Coding ALOX15 Arachidonate 15-Lipoxygenase Protein Coding BECN1 Beclin 1 Protein Coding PRKAA1 Protein Kinase AMP-Activated Catalytic Subunit Alpha 1 Protein Coding SAT1 Spermidine/Spermine N1-Acetyltransferase 1 Protein Coding NF2 Neurofibromin 2 Protein Coding YAP1 Yes1 Associated Transcriptional Regulator Protein Coding FTH1 Ferritin Heavy Chain 1 Protein Coding TF Transferrin Protein Coding TFRC Transferrin Receptor Protein Coding FTL Ferritin Light Chain Protein Coding CYBB Cytochrome B-245 Beta Chain Protein Coding GSS Glutathione Synthetase Protein Coding CP Ceruloplasmin Protein Coding PRNP Prion Protein Protein Coding SLC11A2 Solute Carrier Family 11 Member 2 Protein Coding SLC40A1 Solute Carrier Family 40 Member 1 Protein Coding STEAP3 STEAP3 Metalloreductase Protein Coding ACSL1 Acyl-CoA Synthetase Long Chain Family Member 1 Protein
    [Show full text]
  • Cell Type Marker Enrichment Across Brain Regions and Experimental Conditions
    Cell type marker enrichment across brain regions and experimental conditions by Powell Patrick Cheng Tan B. Sc. (Honours), Simon Fraser University, 2010 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Science in THE FACULTY OF GRADUATE STUDIES (Bioinformatics) The University Of British Columbia (Vancouver) November 2012 c Powell Patrick Cheng Tan, 2012 Abstract The first chapter of this thesis explored the dominant gene expression pattern in the adult human brain. We discovered that the largest source of variation can be explained by cell type marker expression. Across brain regions, expression of neuron cell type markers are anti-correlated with the expression of oligodendrocyte cell type markers. Next, we explored gene function convergence and divergence in the adult mouse brain. Our contributions are as follows. First, we provide candidate cell type markers for investigating specific cell type populations. Second, we highlight orthologous genes that show functional divergence between human and mouse brains. In the second chapter, we present our preliminary work on the effects of tissue types and experimen- tal conditions on human microarray studies. First, we measured the expression and differential expression levels of tissue-enriched genes. Next, we identified modules with similar expression levels and differen- tial expression p-values. Our results show that expression levels reflect tissue type variation. In contrast, differential expression levels are more complex, owing to the large diversity of experimental conditions in the data. In summary, our work provides a different perspective on the functional roles of genes in human microarray studies. ii Table of Contents Abstract .
    [Show full text]
  • Unbalanced Translocations Arise from Diverse Mutational Mechanisms Including Chromothripsis
    Downloaded from genome.cshlp.org on June 15, 2015 - Published by Cold Spring Harbor Laboratory Press Research Unbalanced translocations arise from diverse mutational mechanisms including chromothripsis Brooke Weckselblatt, Karen E. Hermetz, and M. Katharine Rudd Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia 30322, USA Unbalanced translocations are a relatively common type of copy number variation and a major contributor to neurodeve- lopmental disorders. We analyzed the breakpoints of 57 unique unbalanced translocations to investigate the mechanisms of how they form. Fifty-one are simple unbalanced translocations between two different chromosome ends, and six rearrange- ments have more than three breakpoints involving two to five chromosomes. Sequencing 37 breakpoint junctions revealed that simple translocations have between 0 and 4 base pairs (bp) of microhomology (n = 26), short inserted sequences (n = 8), or paralogous repeats (n = 3) at the junctions, indicating that translocations do not arise primarily from nonallelic homol- ogous recombination but instead form most often via nonhomologous end joining or microhomology-mediated break-in- duced replication. Three simple translocations fuse genes that are predicted to produce in-frame transcripts of SIRPG-WWOX, SMOC2-PROX1, and PIEZO2-MTA1, which may lead to gain of function. Three complex translocations have inversions, inser- tions, and multiple breakpoint junctions between only two chromosomes. Whole-genome sequencing and fluorescence in situ hybridization analysis of two de novo translocations revealed at least 18 and 33 breakpoints involving five different chro- mosomes. Breakpoint sequencing of one maternally inherited translocation involving four chromosomes uncovered mul- tiple breakpoints with inversions and insertions. All of these breakpoint junctions had 0–4 bp of microhomology consistent with chromothripsis, and both de novo events occurred on paternal alleles.
    [Show full text]
  • Spatially and Functionally Distinct Subclasses of Breast Cancer-Associated fibroblasts Revealed by Single Cell RNA Sequencing
    ARTICLE DOI: 10.1038/s41467-018-07582-3 OPEN Spatially and functionally distinct subclasses of breast cancer-associated fibroblasts revealed by single cell RNA sequencing Michael Bartoschek 1, Nikolay Oskolkov2, Matteo Bocci 1, John Lövrot 3, Christer Larsson1, Mikael Sommarin4, Chris D. Madsen 1, David Lindgren1, Gyula Pekar5, Göran Karlsson4, Markus Ringnér 2, Jonas Bergh3, Åsa Björklund 6 & Kristian Pietras 1 1234567890():,; Cancer-associated fibroblasts (CAFs) are a major constituent of the tumor microenviron- ment, although their origin and roles in shaping disease initiation, progression and treatment response remain unclear due to significant heterogeneity. Here, following a negative selection strategy combined with single-cell RNA sequencing of 768 transcriptomes of mesenchymal cells from a genetically engineered mouse model of breast cancer, we define three distinct subpopulations of CAFs. Validation at the transcriptional and protein level in several experimental models of cancer and human tumors reveal spatial separation of the CAF subclasses attributable to different origins, including the peri-vascular niche, the mammary fat pad and the transformed epithelium. Gene profiles for each CAF subtype correlate to distinctive functional programs and hold independent prognostic capability in clinical cohorts by association to metastatic disease. In conclusion, the improved resolution of the widely defined CAF population opens the possibility for biomarker-driven development of drugs for precision targeting of CAFs. 1 Division of Translational Cancer Research, Department of Laboratory Medicine, BioCARE, Lund University, Medicon Village, 22381 Lund, Sweden. 2 Department of Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Lund University, Sölvegatan 35, 22362 Lund, Sweden. 3 Department of Oncology and Pathology, Karolinska Institutet, Karolinska Universititetssjukhuset Z1:01, 17176 Stockholm, Sweden.
    [Show full text]
  • Druggable Vitiligo Genome: a Fast Track Approach to Take the Genome Wide Association to the Clinic
    MOJ Proteomics & Bioinformatics Research Article Open Access Druggable vitiligo genome: a fast track approach to take the genome wide association to the clinic Abstract Volume 2 Issue 3 - 2015 Vitiligo, a skin depigmentation disorder, affects 0.5 to 1% of the population Ramaswamy Narayanan around the globe. While not life threatening, the disorder is associated with serious Department of Biological Sciences, Charles E. Schmidt College psychological trauma. Currently no known cure is available and the precise etiology of Science, Florida Atlantic University, USA is unknown. Genome-Wide Association Studies (GWAS) may provide clues to better understanding the disorder for the development of novel therapeutics. Mining Correspondence: Ramaswamy Narayanan, Department of the GWAS databases resulted in the identification of 51 Vitiligo-associated genes Biological Sciences, Charles E. Schmidt College of Science, (VAG) encompassing protein-coding sequences, noncoding RNAs and pseudogenes. Florida Atlantic University, 777 Glades Road, Boca Raton, FL A druggable class of proteins including enzymes, transporters, transcription factors 33431, USA, Tel +15612972247, Fax +15612973859, and secretome products was part of the VAG. The VAG were also genetically linked Email [email protected] to autoimmune, cancer, cardiovascular, inflammation, infections and neurological diseases. Two genes, Interleukin 2 receptor alpha (IL2RA) and Tyrosinase (TYR) are Received: May 28, 2015 | Published: July 24, 2015 FDA approved targets. Unique population-specific genes were identified in the GWAS databases. Chemogenomics approaches identified 246 compounds targeting the VAG. Key pathways involving the VAG’s mechanism included apoptosis, endocrine, immune, infection, metabolic, neuronal and transcription factor signaling. Five lead targets, four enzymes and one transporter, emerged from this study with bioactive drug-like compounds (<100nM).
    [Show full text]
  • Wnt and Other Pathways in Nonsyndromic Tooth Agenesis
    Received: 18 June 2018 | Revised: 26 June 2018 | Accepted: 27 June 2018 DOI: 10.1111/odi.12931 INVITED MEDICAL REVIEW Genetic analysis: Wnt and other pathways in nonsyndromic tooth agenesis Miao Yu1* | Sing‐Wai Wong2,3* | Dong Han1 | Tao Cai4 1Department of Prosthodontics, National Engineering Laboratory for Digital and Abstract Material Technology of Stomatology, Tooth agenesis (TA) is one of the most common developmental anomalies that af‐ Beijing Key Laboratory of Digital Stomatology, Peking University School and fects the number of teeth. An extensive analysis of publicly accessible databases re‐ Hospital of Stomatology, Beijing, China vealed 15 causative genes responsible for nonsyndromic TA, along with their signaling 2Oral and Craniofacial Biomedicine pathways in Wnt/β‐catenin, TGF‐β/BMP, and Eda/Edar/NF‐κB. However, genotype– Curriculum, School of Dentistry, University of North Carolina at Chapel Hill, Chapel Hill, phenotype correlation analysis showed that most of the causal genes are also re‐ North Carolina sponsible for syndromic TA or other conditions. In a total of 198 different mutations 3Immunity, Inflammation, and Disease of the 15 genes responsible for nonsyndromic TA, 182 mutations (91.9%) are derived Laboratory, National Institute of Environmental Health Sciences, Research from seven genes (AXIN2, EDA, LRP6, MSX1, PAX9, WNT10A, and WNT10B) compared Triangle Park, North Carolina with the remaining 16 mutations (8.1%) identified in the remaining eight genes (BMP4, 4National Institute of Dental and Craniofacial Research, NIH, Bethesda, DKK1, EDAR, EDARADD, GREM2, KREMEN1, LTBP3, and SMOC2). Furthermore, speci‐ Maryland ficity analysis in terms of the ratio of nonsyndromic TA mutations versus syndromic Correspondence mutations in each of the aforementioned seven genes showed a 98.2% specificity Dong Han, Department of Prosthodontics, rate in PAX9, 58.9% in WNT10A, 56.6% in MSX1, 41.2% in WNT10B, 31.4% in LRP6, Peking University School and Hospital of Stomatology, 22# Zhongguancun South 23.8% in AXIN2%, and 8.4% in EDA.
    [Show full text]