Function of Core Promoters in Differential Gene Regulation During Embryogenesis

Total Page:16

File Type:pdf, Size:1020Kb

Function of Core Promoters in Differential Gene Regulation During Embryogenesis FUNCTION OF CORE PROMOTERS IN DIFFERENTIAL GENE REGULATION DURING EMBRYOGENESIS By JOCHEN GEHRIG A thesis submitted to The University of Birmingham for the degree of DOCTOR OF PHILOSOPHY Department of Medical and Molecular Genetics School of Clinical and Experimental Medicine College of Medical and Dental Sciences The University of Birmingham January 2010 University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. Abstract The core promoter is the ultimate target of all transcriptional regulatory processes. The recently discovered diversity of core promoters and basal transcription factors suggests a regulatory role in differential gene expression. However, the direct contribution of the core promoter remains poorly understood. I investigated core promoters and their putative role in differential gene regulation using the zebrafish embryo as an in vivo model system. To analyse the functional requirement for the general transcription factor TATA‐box binding protein (TBP), a diverse set of promoters was tested for their TBP dependence. This analysis revealed a differential requirement of TBP for promoter activity. To further explore the roles of core promoters the ability of various core promoters to interact with tissue‐specific enhancers was investigated. A high‐throughput pipeline combining automated imaging with custom‐designed software for registration of spatial reporter gene activity in thousands of zebrafish embryos was developed. The technology was applied in a large‐scale screen analysing the tissue specific activities of 202 enhancer ‐ core promoter combinations. A variety of interaction specificities observed suggests an important role of the core promoter in combinatorial gene regulation. Overall, these findings indicate that the core promoter significantly contributes to differential transcriptional regulation in the vertebrate embryo. I Acknowledgements First I would like to thank my supervisor Dr Ferenc Müller for accepting me as a PhD student and for his continuous support, advice, encouragement and enthusiasm throughout my research. Thanks also go to all past and current members of the laboratory in Germany and in the UK for helpful discussions and the great working atmosphere: Marco Ferg, Yavor Hadzhiev, Eva Kalmar, Nan Li, Agnes Lovas, Matthew Rawlings, Jenny Roberts, Simone Schindler, Chengyi Song and Andreas Zaucker. I am especially grateful to all the people involved in the screening project: Dr Urban Liebel (Institute of Toxicology and Genetics (ITG), Karlsruhe Institute of Technology (KIT), Germany) for providing the screening platform and for his support and enthusiasm, Dr Markus Reischl (Institute for Applied Computer Sciences, KIT, Germany) for the development of the custom design software and the great collaboration despite the geographical distance, Eva Kalmar for her initial work and the good teamwork, and all the helping hands involved in the handling of thousands of zebrafish embryos: Marco Ferg, Andreas Zaucker, Simone Schindler, Yavor Hadzhiev, Chengyi Song and Nadine Gröbner (ITG, KIT). I would like to thank Dr Agnes Lovas for preliminary experiments and sharing 5’‐RACE data. Finally, I would like to thank my parents and my partner Silke for their continued emotional support and encouragement during this project. II Table of contents 1. GENERAL INTRODUCTION ............................................................................................................... 1 1.1. Regulation of RNA polymerase II dependent transcription .................................................... 1 1.1.1. A classical model of transcription initiation .................................................................................. 3 1.2. Diversity of core promoters .................................................................................................... 4 1.2.1. Promoter discovery in eukaryotic genomes ................................................................................. 4 1.2.2. Transcription start site distributions define promoter classes ..................................................... 6 1.2.3. Alternative promoters ................................................................................................................... 8 1.2.4. Bidirectional promoters .............................................................................................................. 10 1.2.5. Chromatin modifications at the core promoter .......................................................................... 10 1.2.6. Core promoter elements ............................................................................................................. 12 1.2.6.1. The TATA‐box .......................................................................................................................... 14 1.2.6.2. The initiator ............................................................................................................................ 15 1.2.6.3. The Downstream promoter element ...................................................................................... 16 1.2.6.4. The TFIIB recognition elements .............................................................................................. 17 1.2.6.5. Other core promoter elements............................................................................................... 18 1.2.7. CpG islands .................................................................................................................................. 19 1.3. Diversity of core promoter recognition factors .................................................................... 20 1.3.1. Variability in TAF composition of TFIID ....................................................................................... 20 1.3.2. The TATA‐box binding protein .................................................................................................... 23 1.3.3. The TBP related factor 1 .............................................................................................................. 26 1.3.4. The TBP like factor ...................................................................................................................... 26 1.3.5. The TATA‐box binding protein 2 ................................................................................................. 28 1.4. Core promoters in combinatorial gene regulation ............................................................... 30 1.4.1. Cis‐regulatory elements besides the core promoter .................................................................. 30 1.4.1.1. The proximal promoter ........................................................................................................... 31 1.4.1.2. Enhancers and Silencers ......................................................................................................... 32 1.4.1.3. Insulators ................................................................................................................................ 33 1.4.2. Transcription factors and enhancer activity................................................................................ 34 1.4.2.1. Activator mediated initiation of transcription ........................................................................ 36 1.4.3. Enhancer‐promoter interaction specificity ................................................................................. 38 1.4.3.1. Core promoter architecture and enhancer‐promoter specificity ........................................... 39 1.4.3.2. Promoter features of targets of long‐range regulation .......................................................... 40 1.5. Cis‐regulatory elements and human disease ........................................................................ 41 1.6. The maternal to zygotic transition ........................................................................................ 44 1.6.1. Regulation of transcription and mRNA levels at the MZT ........................................................... 45 1.6.2. The role of TBP‐family members in the mid‐blastula transition ................................................. 47 1.7. Zebrafish as a model organism ............................................................................................. 50 1.7.1. Studying gene regulatory function by transgenesis in zebrafish ................................................ 52 III 1.7.2. Large‐scale screening approaches .............................................................................................. 54 2. MATERIALS AND METHODS .......................................................................................................... 56 2.1. Materials ............................................................................................................................... 56 2.1.1. Chemicals .................................................................................................................................... 56 2.1.2. Enzymes .....................................................................................................................................
Recommended publications
  • Ferguson, Alison (2012) Identification and Characterisation of Arabidopsis
    IDENTIFICATION AND CHARACTERISATION OF ARABIDOPSIS ER ACCESSORY PROTEINS Alison Ferguson, BSc Thesis submitted to the University of Nottingham for the degree of Doctor of Philosophy February 2012 i ABSTRACT ER accessory proteins are a novel class of endoplasmic reticulum (ER) proteins that facilitate the exit of polytopic membrane proteins from the ER. They are important for the correct targeting of their cognate polytopic membrane proteins to the plasma membrane (PM) and their absence leads to abnormal accumulation of their target in the ER. Until recently, it was not clear if such proteins exist in plants. However, work by Dharmasiri et al (2006) and Gonzales et al (2005) suggest that such proteins exists in plants too. Polytopic membrane proteins such as nutrient transporters, hormone transporters and sugar transporters are a very important class of proteins as they regulate many important physiological and biochemical processes. Better understanding of the targeting of these proteins to the PM is of considerable agronomic interest due to the importance of efficient use of resources in sustainable agriculture. One of the projects aims is to identify novel ER accessory proteins in Arabidopsis. Using a bioinformatics approach, 40 novel ER resident proteins were identified from a protein localisation database (LOPIT) generated by Dunkley et al (2006) as potential candidates for ER accessory proteins. Genetic, phenotypic and molecular approaches have been used to assess their role as potential ER accessory proteins. A few promising candidates have been identified, one of which AtBPL1 and related family. The AtBPL1 family has similarity to mammalian BAP31 which has been shown to function as an ER accessory protein (Ladasky et al, 2006).
    [Show full text]
  • Development of Selective Electrophoresis for Proteins and Peptides Within Proteomes
    DDEVELOPMENT OF SELECTIVE ELECTROPHORESIS FOR PROTEINS AND PEPTIDES WITHIN PROTEOMES Thesis submitted in fulfilment of the requirements for the degree of Doctor of Philosophy LINDA LY School of Biotechnology and Biomolecular Sciences The University of New South Wales August 2008 THE UNIVERSITY OF NEW SOUTH WALES Thesis/Dissertation Sheet Surname or Family name: LY First name: LINDA Other name/s: Abbreviation for degree as given in the University calendar: PhD School: Biotechnology and Biomolecular Sciences Faculty: Science Title: Development of selective electrophoresis for proteins and peptides within proteomes Abstract 350 words maximum: Analysis of complex protein samples is demanding due to the wide dynamic range of expression levels and the limited detection range of technology. Proteomics relies heavily on the development of new fractionation strategies to help reduce complexity, and overcome the technological and biological challenges associated with proteome analysis. Here, the development of a prototype instrument named ‘Microflow MF10’ was explored to enrich for particular classes of proteins. The MF10 was found to have a number of advantages over commercially available fractionation systems. Due to the reduced separation electrode distance, fractionation was rapid, occuring within ~0.125 kVH over 2-6 fractions under native conditions but longer under denaturing conditions. As low as 2 ng peptide could be fractionated with recovery for downstream analysis achievable. The ability to alter protein charge by changing the pH (acidic (pI 3.6) to basic environments (pI 10.4)) allows selection of proteins based on charge/mobility, size, shape, buffer ionic strength, pH and field strength. Proteins <10 kDa are also not routinely analysed because current technology is unable to cater for this region of the proteome.
    [Show full text]
  • The Structure, Function and Evolution of the Extracellular Matrix: a Systems-Level Analysis
    The Structure, Function and Evolution of the Extracellular Matrix: A Systems-Level Analysis by Graham L. Cromar A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Molecular Genetics University of Toronto © Copyright by Graham L. Cromar 2014 ii The Structure, Function and Evolution of the Extracellular Matrix: A Systems-Level Analysis Graham L. Cromar Doctor of Philosophy Department of Molecular Genetics University of Toronto 2014 Abstract The extracellular matrix (ECM) is a three-dimensional meshwork of proteins, proteoglycans and polysaccharides imparting structure and mechanical stability to tissues. ECM dysfunction has been implicated in a number of debilitating conditions including cancer, atherosclerosis, asthma, fibrosis and arthritis. Identifying the components that comprise the ECM and understanding how they are organised within the matrix is key to uncovering its role in health and disease. This study defines a rigorous protocol for the rapid categorization of proteins comprising a biological system. Beginning with over 2000 candidate extracellular proteins, 357 core ECM genes and 524 functionally related (non-ECM) genes are identified. A network of high quality protein-protein interactions constructed from these core genes reveals the ECM is organised into biologically relevant functional modules whose components exhibit a mosaic of expression and conservation patterns. This suggests module innovations were widespread and evolved in parallel to convey tissue specific functionality on otherwise broadly expressed modules. Phylogenetic profiles of ECM proteins highlight components restricted and/or expanded in metazoans, vertebrates and mammals, indicating taxon-specific tissue innovations. Modules enriched for medical subject headings illustrate the potential for systems based analyses to predict new functional and disease associations on the basis of network topology.
    [Show full text]
  • COIMBATORE – 641 046 Supportive
    M.Sc. Zoology-16-17 onwards -revised-UD Annexure No:52A Page 1 of 40 SCAA Dated: 03.07.2017 BHARATHIAR UNIVERSITY: COIMBATORE – 641 046 M. Sc. ZOOLOGY (CBCS – Univ. Dept.) (For the students admitted for the academic year 2016-2017 batch and onwards-Revised) Scheme of Examination Title of the Paper ZOOA Paper 6 Internal External Semester 1 /Supportive Hours/week Instructional Instructional Total Marks Total Subject Code Subject Core/Elective Core/Elective Credits Total Core-I 13A Functional Morphology of Invertebrate and Vertebrates 4 25 75 100 4 Core-II 13B Molecular Cell Biology 4 25 75 100 4 Core-III 13C Animal Physiology 4 25 75 100 4 Core-IV 13D Endocrinology 4 25 75 100 4 Lab Course - I 13P Functional Morphology of Invertebrate and Vertebrates, 8 25 75 100 4 I Molecular Cell Biology, Animal Physiology and Endocrinology Elective-I 1EA Nanobiotechnology and Applications 4 25 75 100 4 1EB Applied Entomology Supportive-I 1GS Offered from other Departments 2 12 38 50 2 Core-V 23A Experimental Embryology 4 25 75 100 4 Core-VI 23B Immunology 4 25 75 100 4 Core-VII 23C Molecular Genetics 4 25 75 100 4 Core-VIII 23D Biochemistry and Biostatistics 4 25 75 100 4 II Lab Course-II 23P Experimental Embryology, Immunology, Molecular 8 25 75 100 4 Genetics and Biochemistry & Biostatistics Elective-II 2EA Molecular Taxonomy 4 25 75 100 4 2EB Global Warming – Animal Migration and Behaviour Supportive-II 2GS Offered from other Departments 2 12 38 50 2 Core-IX 33A Animal Phylogeny and Evolution 4 25 75 100 4 Core-X 33B Environmental Biology and Toxicology
    [Show full text]
  • What's That Gene
    What’s that gene (or protein)? Online resources for exploring functions of genes, transcripts, and proteins James Hutchins To cite this version: James Hutchins. What’s that gene (or protein)? Online resources for exploring functions of genes, transcripts, and proteins. Molecular Biology of the Cell, American Society for Cell Biology, 2014, 25 (8), pp.1187-1201. 10.1091/mbc.E13-10-0602. hal-02354657 HAL Id: hal-02354657 https://hal.archives-ouvertes.fr/hal-02354657 Submitted on 7 Nov 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. M BoC | TECHNICAL PERSPECTIVE What’s that gene (or protein)? Online resources for exploring functions of genes, transcripts, and proteins James R. A. Hutchins Institute of Human Genetics, Centre National de la Recherche Scientifique (CNRS), 34396 Montpellier, France ABSTRACT The genomic era has enabled research projects that use approaches including Monitoring Editor genome-scale screens, microarray analysis, next-generation sequencing, and mass spectrom- Doug Kellogg etry–based proteomics to discover genes and proteins involved in biological processes. Such University of California, Santa Cruz methods generate data sets of gene, transcript, or protein hits that researchers wish to ex- plore to understand their properties and functions and thus their possible roles in biological Received: Oct 21, 2013 systems of interest.
    [Show full text]
  • Genetic Analysis of Rainbow Trout
    Graduate Theses, Dissertations, and Problem Reports 2006 Genetic analysis of rainbow trout (Oncorhynchus mykiss): Strain identification via microsatellites and analysis of expressed sequence tags in intestine, liver, kidney, and ovary Amanda B. Stewart West Virginia University Follow this and additional works at: https://researchrepository.wvu.edu/etd Recommended Citation Stewart, Amanda B., "Genetic analysis of rainbow trout (Oncorhynchus mykiss): Strain identification via microsatellites and analysis of expressed sequence tags in intestine, liver, kidney, and ovary" (2006). Graduate Theses, Dissertations, and Problem Reports. 4272. https://researchrepository.wvu.edu/etd/4272 This Dissertation is protected by copyright and/or related rights. It has been brought to you by the The Research Repository @ WVU with permission from the rights-holder(s). You are free to use this Dissertation in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you must obtain permission from the rights-holder(s) directly, unless additional rights are indicated by a Creative Commons license in the record and/ or on the work itself. This Dissertation has been accepted for inclusion in WVU Graduate Theses, Dissertations, and Problem Reports collection by an authorized administrator of The Research Repository @ WVU. For more information, please contact [email protected]. Genetic Analysis of Rainbow Trout (Oncorhynchus mykiss): Strain Identification via Microsatellites and Analysis of Expressed Sequence Tags in Intestine, Liver, Kidney, and Ovary Amanda B. Stewart Dissertation submitted to the College of Agriculture, Forestry and Consumer Sciences at West Virginia University in partial fulfillment of the requirements for the degree of Doctor of Philosophy In Agricultural Sciences Robert A.
    [Show full text]
  • Research Platelet Membrane Proteomics
    From bloodjournal.hematologylibrary.org at GRAND VALLEY STATE UNIV on November 19, 2013. For personal use only. 2009 114: e10-e19 Prepublished online May 12, 2009; doi:10.1182/blood-2009-02-203828 Platelet membrane proteomics: a novel repository for functional research Urs Lewandrowski, Stefanie Wortelkamp, Katharina Lohrig, René P. Zahedi, Dirk A. Wolters, Ulrich Walter and Albert Sickmann Updated information and services can be found at: http://bloodjournal.hematologylibrary.org/content/114/1/e10.full.html Articles on similar topics can be found in the following Blood collections e-Blood (113 articles) Platelets and Thrombopoiesis (407 articles) Information about reproducing this article in parts or in its entirety may be found online at: http://bloodjournal.hematologylibrary.org/site/misc/rights.xhtml#repub_requests Information about ordering reprints may be found online at: http://bloodjournal.hematologylibrary.org/site/misc/rights.xhtml#reprints Information about subscriptions and ASH membership may be found online at: http://bloodjournal.hematologylibrary.org/site/subscriptions/index.xhtml Blood (print ISSN 0006-4971, online ISSN 1528-0020), is published weekly by the American Society of Hematology, 2021 L St, NW, Suite 900, Washington DC 20036. Copyright 2011 by The American Society of Hematology; all rights reserved. From bloodjournal.hematologylibrary.org at GRAND VALLEY STATE UNIV on November 19, 2013. For personal use only. PLATELETS AND THROMBOPOIESIS e-Blood Platelet membrane proteomics: a novel repository for functional
    [Show full text]
  • And Type the TITLE of YOUR WORK in All Caps
    EXAMINING THE REGULATION OF ADIPOCYTOKINE EXPRESSION DURING CHRONIC INSULIN RESISTANCE by EDITH ELIZABETH HAYDEN (Under the Direction of Lance Wells) ABSTRACT Type II diabetes, characterized by hyperglycemia and hyperinsulinemia, results in many costly and debilitating patient complications. A broad range of tissues respond to insulin and help to mediate its effects. In particular, adipose tissue secretes proteins known as adipocytokines, which play an important role in whole body energy homeostasis and have been implicated in the pathogenesis of diabetes. Increased flux through the hexosamine biosynthetic pathway and the corresponding increase in intracellular glycosylation of proteins via O-GlcNAc is sufficient to induce insulin resistance (IR) in multiple systems. Previously, our group used shotgun proteomics to identify rodent adipocytokines whose levels are modulated upon the induction of IR by indirectly and directly modulating O-GlcNAc levels. Since adipocytokines levels are regulated primarily at the level of transcription and O-GlcNAc alters the function of many transcription factors, we hypothesized that elevated O-GlcNAc levels on key transcription factors are modulating adipocytokine expression. Here, we show that upon the elevation of O- GlcNAc levels and the induction of IR in mature 3T3-F442a adipocytes, the transcript levels of multiple adipocytokines, as measured by quantitative RT-PCR, reflect the modulation observed at the protein level. We have gone on to validate the adipocytokine transcript levels in mouse models of diabetes. Using inguinal fat pads from the db/db mouse model and the diet-induced obesity mouse model, we have confirmed that the adipocytokines regulated by O-GlcNAc modulation in cell culture are likewise modulated in the whole animal upon a shift to IR.
    [Show full text]
  • Towards Effective Biomedical Knowledge Discovery Through Subject-Centric Semantic Integration of the Life-Science Information Space
    TECHNISCHE UNIVERSITÄT MÜNCHEN Lehrstuhl für Genomorientierte Bioinformatik Towards Effective Biomedical Knowledge Discovery through Subject-Centric Semantic Integration of the Life-Science Information Space Karamfilka Krasimirova Nenova Vollständiger Abdruck der von der Fakultät Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt der Technischen Universität München zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften genehmigten Dissertation. Vorsitzender: Prof. Dr. M. Hrabé de Angelis Prüfer der Dissertation: 1. Univ.-Prof. Dr. H.-W. Mewes 2. Univ.-Prof. Dr. R. Zimmer (Ludwig-Maximilians-Universität München) Die Dissertation wurde am 20.10.2008 bei der Technischen Universität München eingereicht und durch die Fakultät Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt am 19.03.2009 angenommen. Acknowledgements This is a great opportunity to express my deep gratitude to many people who supported me during my Ph.D. research and in the writing process of this thesis. Without their help, guidance and encouragement, this study would not have been completed. Above all, I would like to thank Dr. Volker Stümpflen, who inspired me for the undertaken research. His professional competence, valuable advices and commitment were indispensable for my work. As a mentor and leader of the Biological Information Systems group at MIPS, he has been always open-minded for constructive discussions and core contributor to the very friendly atmosphere in the group. I am also very grateful to Prof. Dr. Hans-Werner Mewes for giving me the opportunity to acquire doctor’s degree at the Institute for Bioinformatics and Systems Biology. I highly appreciate his trust in me and my work and I am thankful for his guidance in the key moments of my research, the unreserved support and the patience during the writing process.
    [Show full text]
  • Biological Data Managing: a Opportunity for Molecular Biology and Pharmaceuticals
    Volume 4, Issue 2, September – October 2010; Article 010 ISSN 0976 – 044X BIOLOGICAL DATA MANAGING: A OPPORTUNITY FOR MOLECULAR BIOLOGY AND PHARMACEUTICALS. *Sofiya Verma1, Deepak Jain2 1Research Scholar, Dept. of Pharmaceutics, Shri Ram Institute of Technology, Jabalpur (M.P.) 2Research Scholar, Dept.of Pharmacology, Shri Ram Institute of Technology, Jabalpur (M.P.) *Email: [email protected] ABSTRACT Database management system (DBMS) A set of computer programs that control the creation, maintenance, and utilization of the databases of an organization DBMS uses application development: developing information systems by a process of investigation, analysis, design, implementation, and maintenance. Also called systems development life cycle (SDLC), information systems development. We demonstrate Biological Data Base Management (bdbm), an extensible database engine for biological databases. Bdbms started on the observation that database technology has not kept pace with the specific requirements of biological databases and that several needed key functionalities are not supported at the engine level. While bdbms aims at supporting several of these functionalities, this demo focuses on: (1) Annotation and provenance management including storage, indexing, querying, and propagation, (2) Local dependency tracking of dependencies and derivations among data items, and (3) Update authorization to support data curation. We demonstrate how bdbms enables biologists to manipulate their databases, annotations, and derivation information in a unified
    [Show full text]