Mouse Tmem106c Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Tmem106c Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Tmem106c Knockout Project (CRISPR/Cas9) Objective: To create a Tmem106c knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Tmem106c gene (NCBI Reference Sequence: NM_201359 ; Ensembl: ENSMUSG00000052369 ) is located on Mouse chromosome 15. 8 exons are identified, with the ATG start codon in exon 2 and the TAG stop codon in exon 8 (Transcript: ENSMUST00000064200). Exon 2~8 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 2 starts from about 0.13% of the coding region. Exon 2~8 covers 100.0% of the coding region. The size of effective KO region: ~4867 bp. The KO region does not have any other known gene. Page 1 of 9 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 4 5 6 7 8 Legends Exon of mouse Tmem106c Knockout region Page 2 of 9 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section upstream of start codon is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section downstream of stop codon is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats. Page 3 of 9 https://www.alphaknockout.com Overview of the GC Content Distribution (up) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(23.75% 475) | C(25.3% 506) | T(25.05% 501) | G(25.9% 518) Note: The 2000 bp section upstream of start codon is analyzed to determine the GC content. Significant high GC-content regions are found. The gRNA site is selected outside of these high GC-content regions. Overview of the GC Content Distribution (down) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(24.0% 480) | C(25.35% 507) | T(26.35% 527) | G(24.3% 486) Note: The 2000 bp section downstream of stop codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 4 of 9 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr15 + 97962851 97964850 2000 browser details YourSeq 132 95 689 2000 79.0% chr9 - 53542846 53543090 245 browser details YourSeq 132 536 713 2000 89.3% chr2 + 145955648 145955831 184 browser details YourSeq 128 545 716 2000 88.8% chr11 - 74732800 74732970 171 browser details YourSeq 127 545 716 2000 90.0% chr10 - 61403512 61403890 379 browser details YourSeq 126 544 713 2000 90.9% chr14 + 102397628 102397798 171 browser details YourSeq 126 545 716 2000 88.9% chr1 + 36163767 36163946 180 browser details YourSeq 125 536 713 2000 83.6% chrX + 25501643 25501809 167 browser details YourSeq 125 549 716 2000 87.5% chr2 + 71390408 71390737 330 browser details YourSeq 124 548 713 2000 87.5% chrX - 144270550 144270711 162 browser details YourSeq 123 545 707 2000 90.2% chr4 + 139849381 139849544 164 browser details YourSeq 122 400 689 2000 88.7% chr8 - 32769572 32770137 566 browser details YourSeq 121 545 687 2000 92.4% chr8 - 88244061 88244203 143 browser details YourSeq 121 545 689 2000 91.8% chr12 + 84509301 84509445 145 browser details YourSeq 120 549 716 2000 90.6% chr1 - 54965170 54965342 173 browser details YourSeq 119 543 689 2000 90.5% chr1 - 9841486 9841632 147 browser details YourSeq 119 545 689 2000 91.1% chr4 + 143030059 143030203 145 browser details YourSeq 118 542 689 2000 89.9% chr9 - 106597930 106598077 148 browser details YourSeq 117 545 713 2000 89.4% chrX - 101181833 101182008 176 browser details YourSeq 117 545 689 2000 90.4% chr1 - 80299696 80299840 145 Note: The 2000 bp section upstream of start codon is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr15 + 97969718 97971717 2000 browser details YourSeq 139 1845 2000 2000 94.9% chrX + 6858350 6858507 158 browser details YourSeq 137 1849 2000 2000 95.4% chr19 + 21537646 21537799 154 browser details YourSeq 136 1849 2000 2000 96.0% chr17 - 82389282 82389434 153 browser details YourSeq 136 1848 2000 2000 95.4% chr2 + 33888926 33889078 153 browser details YourSeq 135 1832 2000 2000 87.7% chr8 - 94724245 94724406 162 browser details YourSeq 135 1849 2000 2000 94.8% chr5 - 63543892 63544045 154 browser details YourSeq 135 1847 2000 2000 93.4% chr17 - 53464865 53465017 153 browser details YourSeq 135 1849 2000 2000 94.8% chr6 + 35185430 35185582 153 browser details YourSeq 135 1842 2000 2000 93.1% chr5 + 117340337 117340502 166 browser details YourSeq 135 1849 2000 2000 94.8% chr4 + 35213982 35214135 154 browser details YourSeq 135 1848 1999 2000 94.8% chr13 + 92262611 92262764 154 browser details YourSeq 134 1849 2000 2000 95.4% chr7 - 96164071 96164223 153 browser details YourSeq 134 1849 2000 2000 94.1% chr2 - 3497712 3497863 152 browser details YourSeq 134 1849 2000 2000 94.1% chr15 - 35322514 35322665 152 browser details YourSeq 134 1848 2000 2000 94.2% chr14 - 45265181 45265335 155 browser details YourSeq 134 1848 2000 2000 93.4% chr14 + 106164371 106164522 152 browser details YourSeq 134 1845 2000 2000 91.7% chr10 + 59299196 59299350 155 browser details YourSeq 133 1848 2000 2000 94.7% chr8 - 120675421 120675573 153 browser details YourSeq 133 1847 2000 2000 93.6% chr3 - 53775410 53775565 156 Note: The 2000 bp section downstream of stop codon is BLAT searched against the genome. No significant similarity is found. Page 5 of 9 https://www.alphaknockout.com Gene and protein information: Tmem106c transmembrane protein 106C [ Mus musculus (house mouse) ] Gene ID: 380967, updated on 24-Oct-2019 Gene summary Official Symbol Tmem106c provided by MGI Official Full Name transmembrane protein 106C provided by MGI Primary source MGI:MGI:1196384 See related Ensembl:ENSMUSG00000052369 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as AI046681; BC046621; D15Ertd405e Expression Ubiquitous expression in genital fat pad adult (RPKM 24.2), bladder adult (RPKM 18.3) and 28 other tissues See more Orthologs human all Genomic context Location: 15 F1; 15 53.96 cM See Tmem106c in Genome Data Viewer Exon count: 8 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 15 NC_000081.6 (97963177..97970286) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 15 NC_000081.5 (97794710..97800706) Chromosome 15 - NC_000081.6 Page 6 of 9 https://www.alphaknockout.com Transcript information: This gene has 9 transcripts Gene: Tmem106c ENSMUSG00000052369 Description transmembrane protein 106C [Source:MGI Symbol;Acc:MGI:1196384] Gene Synonyms D15Ertd405e Location Chromosome 15: 97,964,200-97,970,275 forward strand. GRCm38:CM001008.2 About this gene This gene has 9 transcripts (splice variants), 166 orthologues and is a member of 1 Ensembl protein family. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Tmem106c- ENSMUST00000064200.8 1623 260aa ENSMUSP00000069764.7 Protein coding CCDS27785 Q80VP8 TSL:1 201 GENCODE basic APPRIS P1 Tmem106c- ENSMUST00000229428.1 1455 260aa ENSMUSP00000154819.1 Protein coding CCDS27785 Q80VP8 GENCODE basic 202 APPRIS P1 Tmem106c- ENSMUST00000231144.1 734 166aa ENSMUSP00000155384.1 Protein coding - A0A2R8VHS8 CDS 3' 209 incomplete Tmem106c- ENSMUST00000229433.1 631 165aa ENSMUSP00000154837.1 Protein coding - A0A2R8VJM7 CDS 3' 203 incomplete Tmem106c- ENSMUST00000230072.1 401 116aa ENSMUSP00000155091.1 Protein coding - A0A2R8W6L2 CDS 3' 205 incomplete Tmem106c- ENSMUST00000230005.1 927 No - Retained - - - 204 protein intron Tmem106c- ENSMUST00000230144.1 727 No - Retained - - - 206 protein intron Tmem106c- ENSMUST00000231079.1 571 No - Retained - - - 208 protein intron Tmem106c- ENSMUST00000230361.1 585 No - lncRNA - - - 207 protein Page 7 of 9 https://www.alphaknockout.com 26.08 kb Forward strand 97.96Mb 97.97Mb 97.98Mb Genes (Comprehensive set... Tmem106c-203 >protein coding Tmem106c-202 >protein coding Tmem106c-201 >protein coding Tmem106c-205 >protein coding Tmem106c-209 >protein coding Tmem106c-206 >retained intron Tmem106c-207 >lncRNA Tmem106c-204 >retained intron Tmem106c-208 >retained intron Contigs AC158787.14 > AC134554.6 > Genes < Col2a1-201protein coding (Comprehensive set... < Col2a1-202protein coding < Col2a1-205lncRNA Regulatory Build 97.96Mb 97.97Mb 97.98Mb Reverse strand 26.08 kb Regulation Legend CTCF Open Chromatin Promoter Promoter Flank Gene Legend Protein Coding Ensembl protein coding merged Ensembl/Havana Non-Protein Coding processed transcript RNA gene Page 8 of 9 https://www.alphaknockout.com Transcript: ENSMUST00000064200 6.05 kb Forward strand Tmem106c-201 >protein coding ENSMUSP00000069... Transmembrane heli... Low complexity (Seg) Pfam Protein of unknown function DUF1356, TMEM106 PANTHER PTHR28556:SF5 Protein of unknown function DUF1356, TMEM106 All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend missense variant synonymous variant Scale bar 0 40 80 120 160 200 260 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC. Page 9 of 9.
Recommended publications
  • Evolutionary Plasticity in Detoxification Gene Modules: the Preservation
    International Journal of Molecular Sciences Article Evolutionary Plasticity in Detoxification Gene Modules: The Preservation and Loss of the Pregnane X Receptor in Chondrichthyes Lineages Elza S. S. Fonseca 1,2, Raquel Ruivo 1 , André M. Machado 1 , Francisca Conrado 1, Boon-Hui Tay 3, Byrappa Venkatesh 3, Miguel M. Santos 1,2 and L. Filipe C. Castro 1,2,* 1 CIIMAR/CIMAR—Interdisciplinary Centre of Marine and Environmental Research, 4450-208 Matosinhos, Portugal; [email protected] (E.S.S.F.); [email protected] (R.R.); [email protected] (A.M.M.); fi[email protected] (F.C.); [email protected] (M.M.S.) 2 FCUP—Faculty of Sciences, Department of Biology, University of Porto, 4150-177 Porto, Portugal 3 Comparative Genomics Laboratory, Institute of Molecular and Cell Biology, A*STAR (Agency for Science, Technology and Research), Biopolis, Singapore 138673, Singapore; [email protected] (B.-H.T.); [email protected] (B.V.) * Correspondence: fi[email protected]; Tel.: +351-22-3401800 Received: 25 February 2019; Accepted: 6 May 2019; Published: 10 May 2019 Abstract: To appraise how evolutionary processes, such as gene duplication and loss, influence an organism’s xenobiotic sensitivity is a critical question in toxicology. Of particular importance are gene families involved in the mediation of detoxification responses, such as members of the nuclear receptor subfamily 1 group I (NR1I), the pregnane X receptor (PXR), and the constitutive androstane receptor (CAR). While documented in multiple vertebrate genomes, PXR and CAR display an intriguing gene distribution. PXR is absent in birds and reptiles, while CAR shows a tetrapod-specific occurrence.
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Increased Expression of the Frontotemporal Dementia Risk Factor Tmem106b Causes C9orf72-Dependent Alterations in Lysosomes
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2016 Increased Expression of Frontotemporal Dementia Risk Factor Tmem106b Alters Lysosomal and Autophagosomal Pathways Johanna Irene Busch University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Cell Biology Commons, and the Neuroscience and Neurobiology Commons Recommended Citation Busch, Johanna Irene, "Increased Expression of Frontotemporal Dementia Risk Factor Tmem106b Alters Lysosomal and Autophagosomal Pathways" (2016). Publicly Accessible Penn Dissertations. 1630. https://repository.upenn.edu/edissertations/1630 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/1630 For more information, please contact [email protected]. Increased Expression of Frontotemporal Dementia Risk Factor Tmem106b Alters Lysosomal and Autophagosomal Pathways Abstract Frontotemporal lobar degeneration (FTLD) is an important cause of dementia in individuals under age 65. Common variants in the TMEM106B gene were previously discovered by genome-wide association (GWAS) to confer genetic risk for FTLD-TDP, the largest neuropathological subset of FTLD (p=1x10-11, OR=1.6). Prior to its discovery in the GWAS, TMEM106B, or Transmembrane Protein 106B, was uncharacterized. To further understand the role of TMEM106B in disease pathogenesis, we used immortalized as well as primary neurons to assess the cell biological effects of disease-relevant levels of TMEM106B overexpression and the interaction of TMEM106B with additional disease-associated proteins. We also employed immunostaining to assess its expression pattern in human brain from controls and FTLD cases. We discovered that TMEM106B is a highly glycosylated, Type II late endosomal/lysosomal transmembrane protein. We found that it is expressed by neurons, glia, and peri-vascular cells in disease- affected and unaffected regions of human brain from normal controls in a cytoplasmic, perikaryal distribution.
    [Show full text]
  • Evolutionary History of Tibetans Inferred from Whole-Genome Sequencing
    RESEARCH ARTICLE Evolutionary history of Tibetans inferred from whole-genome sequencing Hao Hu1, Nayia Petousi2, Gustavo Glusman3, Yao Yu1, Ryan Bohlender4, Tsewang Tashi5, Jonathan M. Downie6, Jared C. Roach3, Amy M. Cole7, Felipe R. Lorenzo5, Alan R. Rogers4, Mary E. Brunkow3, Gianpiero Cavalleri7, Leroy Hood3, Sama M. Alpatty8, Josef T. Prchal5,6☯, Lynn B. Jorde6☯, Peter A. Robbins9☯, Tatum S. Simonson10☯, Chad D. Huff1☯* 1 Department of Epidemiology, University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America, 2 Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom, a1111111111 3 Institute for Systems Biology, Seattle, Washington, United States of America, 4 Department of a1111111111 Anthropology, University of Utah, Salt Lake City, Utah, United States of America, 5 Department of Medicine, a1111111111 University of Utah School of Medicine and George E. Wahlin Veterans Administration Medical Center, Salt a1111111111 Lake City, Utah, United States of America, 6 Department of Human Genetics, University of Utah, Salt Lake a1111111111 City, Utah, United States of America, 7 Department of Molecular and Cellular Therapeutics, The Royal College of Surgeons in Ireland, Dublin, Ireland, 8 Skaggs School of Pharmacy and Pharmaceutical Science, UC San Diego, La Jolla, California, United States of America, 9 Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom, 10 Department of Medicine, Division of Physiology, University of California San Diego, La Jolla, California, United States of America OPEN ACCESS ☯ These authors contributed equally to this work. * [email protected] Citation: Hu H, Petousi N, Glusman G, Yu Y, Bohlender R, Tashi T, et al. (2017) Evolutionary history of Tibetans inferred from whole-genome sequencing.
    [Show full text]
  • Nuclear Receptors in Metazoan Lineages: the Cross-Talk Between Evolution and Endocrine Disruption
    Nuclear Receptors in Metazoan lineages: the cross -talk between Evolution and Endocrine Disruption Elza Sofia Silva Fonseca Tese de Doutoramento apresentada à Faculdade de Ciências da Universidade do Porto Biologia D 2020 Nuclear Receptors in Metazoan lineages: the cross-talk between Evolution and Endocrine Disruption D Elza Sofia Silva Foseca Doutoramento em Biologia Departamento de Biologia 2020 Orientador Doutor Luís Filipe Costa Castro, Professor Auxiliar, Faculdade de Ciências da Universidade do Porto, Centro Interdisciplinar de Investigação Marinha e Ambiental (CIIMAR) Coorientador Professor Doutor Miguel Alberto Fernandes Machado e Santos, Professor Auxiliar, Faculdade de Ciências da Universidade do Porto Centro Interdisciplinar de Investigação Marinha e Ambiental (CIIMAR) FCUP i Nuclear Receptors in Metazoan lineages: the cross-talk between Evolution and Endocrine Disruption This thesis was supported by FCT (ref: SFRH/BD/100262/2014), Norte2020 and FEDER (Coral – Sustainable Ocean Exploitation – Norte-01-0145-FEDER-000036 and EvoDis – Norte-01-0145-FEDER-031342). ii FCUP Nuclear Receptors in Metazoan lineages: the cross-talk between Evolution and Endocrine Disruption The present thesis is organized into seven chapters. Chapter 1 consists of a general introduction, providing an overview on Metazoa definition, and a review on the current knowledge of evolution and function of nuclear receptors and their role in endocrine disruption processes. Chapters 2, 3, 4 and 6 correspond to several projects developed during the doctoral programme presented here as independent articles, listed below (three articles published in peer reviewed international journals and one article in final preparation for submission). Chapter 5 was adapted from an article published in a peer reviewed international journal (listed below), in which I executed the methodology regarding the structural and functional analyses of rotifer RXR and I contributed to the writing of the sections referring to these analyses (Material and Methods, Results and Discussion).
    [Show full text]
  • Content Based Search in Gene Expression Databases and a Meta-Analysis of Host Responses to Infection
    Content Based Search in Gene Expression Databases and a Meta-analysis of Host Responses to Infection A Thesis Submitted to the Faculty of Drexel University by Francis X. Bell in partial fulfillment of the requirements for the degree of Doctor of Philosophy November 2015 c Copyright 2015 Francis X. Bell. All Rights Reserved. ii Acknowledgments I would like to acknowledge and thank my advisor, Dr. Ahmet Sacan. Without his advice, support, and patience I would not have been able to accomplish all that I have. I would also like to thank my committee members and the Biomed Faculty that have guided me. I would like to give a special thanks for the members of the bioinformatics lab, in particular the members of the Sacan lab: Rehman Qureshi, Daisy Heng Yang, April Chunyu Zhao, and Yiqian Zhou. Thank you for creating a pleasant and friendly environment in the lab. I give the members of my family my sincerest gratitude for all that they have done for me. I cannot begin to repay my parents for their sacrifices. I am eternally grateful for everything they have done. The support of my sisters and their encouragement gave me the strength to persevere to the end. iii Table of Contents LIST OF TABLES.......................................................................... vii LIST OF FIGURES ........................................................................ xiv ABSTRACT ................................................................................ xvii 1. A BRIEF INTRODUCTION TO GENE EXPRESSION............................. 1 1.1 Central Dogma of Molecular Biology........................................... 1 1.1.1 Basic Transfers .......................................................... 1 1.1.2 Uncommon Transfers ................................................... 3 1.2 Gene Expression ................................................................. 4 1.2.1 Estimating Gene Expression ............................................ 4 1.2.2 DNA Microarrays ......................................................
    [Show full text]
  • Discerning the Role of Foxa1 in Mammary Gland
    DISCERNING THE ROLE OF FOXA1 IN MAMMARY GLAND DEVELOPMENT AND BREAST CANCER by GINA MARIE BERNARDO Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Dissertation Adviser: Dr. Ruth A. Keri Department of Pharmacology CASE WESTERN RESERVE UNIVERSITY January, 2012 CASE WESTERN RESERVE UNIVERSITY SCHOOL OF GRADUATE STUDIES We hereby approve the thesis/dissertation of Gina M. Bernardo ______________________________________________________ Ph.D. candidate for the ________________________________degree *. Monica Montano, Ph.D. (signed)_______________________________________________ (chair of the committee) Richard Hanson, Ph.D. ________________________________________________ Mark Jackson, Ph.D. ________________________________________________ Noa Noy, Ph.D. ________________________________________________ Ruth Keri, Ph.D. ________________________________________________ ________________________________________________ July 29, 2011 (date) _______________________ *We also certify that written approval has been obtained for any proprietary material contained therein. DEDICATION To my parents, I will forever be indebted. iii TABLE OF CONTENTS Signature Page ii Dedication iii Table of Contents iv List of Tables vii List of Figures ix Acknowledgements xi List of Abbreviations xiii Abstract 1 Chapter 1 Introduction 3 1.1 The FOXA family of transcription factors 3 1.2 The nuclear receptor superfamily 6 1.2.1 The androgen receptor 1.2.2 The estrogen receptor 1.3 FOXA1 in development 13 1.3.1 Pancreas and Kidney
    [Show full text]
  • Sea Anemone Genome Reveals the Gene Repertoire and Genomic Organization of the Eumetazoan Ancestor
    Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory Title Sea anemone genome reveals the gene repertoire and genomic organization of the eumetazoan ancestor Permalink https://escholarship.org/uc/item/3b01p9bc Authors Putnam, Nicholas H. Srivastava, Mansi Hellsten, Uffe et al. Publication Date 2007 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Sea anemone genome reveals the gene repertoire and genomic organization of the eumetazoan ancestor Nicholas H. Putnam[1], Mansi Srivastava[2], Uffe Hellsten[1], Bill Dirks[2], Jarrod Chapman[1], Asaf Salamov[1], Astrid Terry[1], Harris Shapiro[1], Erika Lindquist[1], Vladimir V. Kapitonov[3], Jerzy Jurka[3], Grigory Genikhovich[4], Igor Grigoriev[1], JGI Sequencing Team[1], Robert E. Steele[5], John Finnerty[6], Ulrich Technau[4], Mark Q. Martindale[7], Daniel S. Rokhsar[1,2] [1] Department of Energy Joint Genome Institute, Walnut Creek, CA 94598 [2] Center for Integrative Genomics and Department of Molecular and Cell Biology, University of California, Berkeley CA 94720 [3] Genetic Information Research Institute, 1925 Landings Drive, Mountain View, CA 94043 [4] Sars International Centre for Marine Molecular Biology, University of Bergen, Thormoeøhlensgt 55; 5008, Bergen, Norway [5] Department of Biological Chemistry and the Developmental Biology Center, University of California, Irvine, CA 92697 [6] Department of Biology, Boston University, Boston, MA 02215 [7] Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI 96813 Abstract Sea anemones are seemingly primitive animals that, along with corals, jellyfish, and hydras, constitute the Cnidaria, the oldest eumetazoan phylum. Here we report a comparative analysis of the draft genome of an emerging cnidarian model, the starlet anemone Nematostella vectensis.
    [Show full text]
  • Leveraging Models of Cell Regulation and GWAS Data in Integrative Network-Based Association Studies
    PERSPECTIVE Leveraging models of cell regulation and GWAS data in integrative network-based association studies Andrea Califano1–3,11, Atul J Butte4,5, Stephen Friend6, Trey Ideker7–9 & Eric Schadt10,11 Over the last decade, the genome-wide study of both heritable and straightforward: within the space of all possible genetic and epigenetic somatic human variability has gone from a theoretical concept to a variants, those contributing to a specific trait or disease likely have broadly implemented, practical reality, covering the entire spectrum some coalescent properties, allowing their effect to be functionally of human disease. Although several findings have emerged from these canalized via the cell communication and cell regulatory machin- studies1, the results of genome-wide association studies (GWAS) have ery that allows distinct cells to interact and regulates their behavior. been mostly sobering. For instance, although several genes showing Notably, contrary to random networks, whose output is essentially medium-to-high penetrance within heritable traits were identified by unconstrained, regulatory networks produced by adaptation to spe- these approaches, the majority of heritable genetic risk factors for most cific fitness landscapes are optimized to produce only a finite number common diseases remain elusive2–7. Additionally, due to impractical of well-defined outcomes as a function of a very large number of exog- requirements for cohort size8 and lack of methodologies to maximize enous and endogenous signals. Thus, if a comprehensive and accurate power for such detections, few epistatic interactions and low-pen- map of all intra- and intercellular molecular interactions were avail- etrance variants have been identified9. At the opposite end of the able, then genetic and epigenetic events implicated in a specific trait or germline versus somatic event spectrum, considering that tumor cells disease should cluster in subnetworks of closely interacting genes.
    [Show full text]
  • A Network Inference Approach to Understanding Musculoskeletal
    A NETWORK INFERENCE APPROACH TO UNDERSTANDING MUSCULOSKELETAL DISORDERS by NIL TURAN A thesis submitted to The University of Birmingham for the degree of Doctor of Philosophy College of Life and Environmental Sciences School of Biosciences The University of Birmingham June 2013 University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. ABSTRACT Musculoskeletal disorders are among the most important health problem affecting the quality of life and contributing to a high burden on healthcare systems worldwide. Understanding the molecular mechanisms underlying these disorders is crucial for the development of efficient treatments. In this thesis, musculoskeletal disorders including muscle wasting, bone loss and cartilage deformation have been studied using systems biology approaches. Muscle wasting occurring as a systemic effect in COPD patients has been investigated with an integrative network inference approach. This work has lead to a model describing the relationship between muscle molecular and physiological response to training and systemic inflammatory mediators. This model has shown for the first time that oxygen dependent changes in the expression of epigenetic modifiers and not chronic inflammation may be causally linked to muscle dysfunction.
    [Show full text]
  • Supplementary Table S2. List of Genes with Expression That Is Positively Correlated (Pearson Correlation Coefficient P>0.3)
    Supplementary Table S2. List of genes with expression that is positively correlated (Pearson correlation coefficient p>0.3) with HOXA9 expression in the study by Sun et al (1). Most HOXA genes (indicated by bold face) show highly significant positive correlation with HOXA9 expression, except for HOXA6 and HOXA13, similar to our findings in the UCSF and MDA tumor sets. Note that HOXA11, which did not demonstrate a statistically significant positive correlation with HOXA9 expression in both the UCSF and MDA tumor sets, demonstrates a substantially lower correlation coefficient relative to the other HOXA genes that positively correlate with HOXA9 expression in the study by Sun et al. These data were obtained from the online ONCOMINE database (www.oncomine.org) (2) searching for transcripts positively correlated with HOXA9 expression. Gene Name Gene Symbol Reporter ID Correlation (p) homeobox A9 HOXA9 214651_s_at .7682 homeobox A9 HOXA9 209905_at .7682 homeobox A10 HOXA10 213150_at .6058 homeobox A7 HOXA7 235753_at .6058 homeobox A10 HOXA10 213147_at .6058 homeobox A7 HOXA7 206847_s_at .6058 homeobox A4 HOXA4 206289_at .5741 homeobox A2 HOXA2 1557051_s_at .5379 homeobox A1 HOXA1 214639_s_at .5379 homeobox A3 HOXA3 235521_at .5379 homeobox B2 HOXB2 205453_at .5379 homeobox B3 HOXB3 228904_at .5379 EST EST 1555907_at .5379 homeobox A4 HOXA4 230080_at .5379 homeobox A2 HOXA2 228642_at .5379 homeobox A2 HOXA2 1557050_at .5379 homeobox A5 HOXA5 213844_at .5379 homeobox A2 HOXA2 214457_at .5113 homeobox B7 HOXB7 204778_x_at .5042 homeobox C6 HOXC6
    [Show full text]
  • Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Fall 2010 Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Renuka Nayak University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Computational Biology Commons, and the Genomics Commons Recommended Citation Nayak, Renuka, "Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress" (2010). Publicly Accessible Penn Dissertations. 1559. https://repository.upenn.edu/edissertations/1559 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/1559 For more information, please contact [email protected]. Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Abstract Genes interact in networks to orchestrate cellular processes. Here, we used coexpression networks based on natural variation in gene expression to study the functions and interactions of human genes. We asked how these networks change in response to stress. First, we studied human coexpression networks at baseline. We constructed networks by identifying correlations in expression levels of 8.9 million gene pairs in immortalized B cells from 295 individuals comprising three independent samples. The resulting networks allowed us to infer interactions between biological processes. We used the network to predict the functions of poorly-characterized human genes, and provided some experimental support. Examining genes implicated in disease, we found that IFIH1, a diabetes susceptibility gene, interacts with YES1, which affects glucose transport. Genes predisposing to the same diseases are clustered non-randomly in the network, suggesting that the network may be used to identify candidate genes that influence disease susceptibility.
    [Show full text]