The NFI-Regulome Database: a Tool

Total Page:16

File Type:pdf, Size:1020Kb

The NFI-Regulome Database: a Tool Gronostajski et al. Journal of Clinical Bioinformatics 2011, 1:4 http://www.jclinbioinformatics.com/content/1/1/4 JOURNAL OF CLINICAL BIOINFORMATICS DATABASE Open Access The NFI-Regulome Database: A tool for annotation and analysis of control regions of genes regulated by Nuclear Factor I transcription factors Richard M Gronostajski1,2*, Joseph Guaneri2,3, Dong Hyun Lee4, Steven M Gallo5 Abstract Background: Genome annotation plays an essential role in the interpretation and use of genome sequence information. While great strides have been made in the annotation of coding regions of genes, less success has been achieved in the annotation of the regulatory regions of genes, including promoters, enhancers/silencers, and other regulatory elements. One reason for this disparity in annotated information is that coding regions can be assessed using high-throughput techniques such as EST sequencing, while annotation of regulatory regions often requires a gene-by-gene approach. Results: The NFI-Regulome database http://nfiregulome.ccr.buffalo.edu was designed to promote easy annotation of the regulatory regions of genes that contain binding sites for the NFI (Nuclear Factor I) family of transcription factors, using data from the published literature. Binding sites are annotated together with the sequence of the gene, obtained from the UCSC Genome site, and the locations of all binding sites for multiple genes can be displayed in a number of formats designed to facilitate inter-gene comparisons. Classes of genes based on expression pattern, disease involvement, or types of binding sites present can be readily compared in order to assess common “architectural” structures in the regulatory regions. Conclusions: The NFI-Regulome database allows rapid display of the relative locations and number of transcription factor binding sites of individual or defined sets of genes that contain binding sites for NFI transcription factors. This database may in the future be expanded into a distributed database structure including other families of transcription factors. Such databases may be useful for identifying common regulatory structures in genes essential for organ development, tissue-specific gene expression or those genes related to specific diseases. Background High-throughput sequencing techniques now allow Genome annotation, and the ability to extract and use human and other complex genomes to be sequenced information stored in genome databases, is an essential relatively easily [8,9]. However determining the func- part of genomic and bioinformatic analysis [1-4]. While tional significance of sequence changes, particularly now primarily a basic research tool, analysis of genome changes that affect regulatory elements in the genome, is annotation information is rapidly becoming an impor- still in its infancy. The annotation of regulatory elements tant part of Medical and Health Care informatics. in genomes has lagged behind the analysis of coding As more patient genomes are determined, the ability to regions [1,2]. While coding regions can be readily correlate changes in the regulatory regions of genes with assessed by comparing genome sequence with cDNA specific disease states will become increasingly impor- sequences, regulatory regions are still identified primarily tant for Personalized Medicine [5-7]. on a gene-by-gene basis. Even with the use of such powerfultoolsaswholegenomeChIP-seqandthe Encode project [4,10,11], the functional significance of * Correspondence: [email protected] binding sites found by large scale screening can only be 1Department of Biochemistry, State University of New York at Buffalo, 140 definitively tested by mutational analysis of binding sites Farber Hall, Buffalo, NY, 14214, USA within genes and determining the effect of loss of binding Full list of author information is available at the end of the article © 2011 Gronostajski et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Gronostajski et al. Journal of Clinical Bioinformatics 2011, 1:4 Page 2 of 10 http://www.jclinbioinformatics.com/content/1/1/4 on gene expression. Thus, the wealth of published data occur are handled through the software due to the lim- on the analysis of regulatory elements in genes remains itation of MyISAM not having transaction management an important asset to be mined by bioinformatic or foreign key management. approaches. The overall structure of the tables (Figure 1) was devel- Gene expression can be regulated at many levels oped specifically for this database. The tables can be sepa- including control of transcription rate, transcript trans- rated into smaller groups which have a complete port and degradation, translation rate, protein folding dependency on each other (Figure 2), two additional tables and assembly into multi-subunit structures, and protein provide static information, and one table is used for hold- stability [12]. We have focused on the analysis of cis- ing user information. Each group is responsible for acting regulatory elements as mediators of gene expression as the data warehouse for a specific set of information. [13-17]. In particular, we have created a database for the The AuthorDB and ArticleDB grouping is used for annotation and analysis of binding sites for site-specific holding information relatedtoPubMedarticles.Asthe transcription factors in the promoter and enhancer PubMed ID is a unique feature, it is used in this group- regions of genes. To provide a focus for such a broad ing as the primary key. The Author field is set currently topic, we’ve considered only genes that contain binding to only 64 characters maximum as no current value sites for the Nuclear Factor I (NFI) family of transcrip- even approaches that maximum. This field maximum tion factors. can be adjusted if a value were to supersede this arbi- The NFI family of transcription factors is essential for trary default. The Abstract field is set to longtext due to the development of multiple organ systems including the varying length of article abstracts. Utilizing the MyI- brain, lung, muscle, hematopoietic cells, and teeth [18-21]. SAM engine allows for searching of this field for key- The NFI-Regulome database contains the control regions words at the lowest level which is preferable to creating of genes that have been shown to be regulated by NFI a piece of software to accomplish the same task. transcription factors in the primary literature. These con- GeneDB, GeneSynonymDB, SpeciesDB, BindingSiteDB, trol regions are annotated with transcription start sites, and TF_connect_TFBS_DB form the main grouping of translation start sites, NFI binding sites, and the location tables used for holding gene and binding site informa- and identity of other known or unknown site-specific tran- tion. The TF_connect_TFBS_DB acts as a directory to scription factors. Since there are hundreds of known site- allow a gene to know what binding sites it has and for a specific transcription factors [13,16,17], a comprehensive binding site to know what gene it belongs to. The Gen- coverage of all known cis-regulatory sites within a single eDB table includes a Cell_memo field that is used to hold database is daunting, therefore restricting our analysis to important keywords. The keywords are generally sepa- NFI-site containing regulatory regions provides us with a rated by a comma but this is an in-house practice. The defined starting point for our analysis. Since this gene use of MyISAM here allows for this field to be searched family has been shown to be essential for a number of for string values and can be changed to a text type if the developmental processes, the database should also provide varchar length, currently set to the max of 254, gets useful information on the structures of regulatory regions exceeded. GeneSynonymDB provides a table that lists the of genes involved in development and disease. alternative names for a particular gene. While this infor- mation could have been listed underneath the Cell_- Construction and content memo field by providing a separate table for this Structure information, fast indexing and access can be provided. The database is built using MySQL with the MyISAM This feature can be expanded to other attributes located engine. This provides the full text support needed for in the Cell_memo field. The BindingSiteDB table houses searches. The table structure is designed to be in first all of the binding site information. This table also normal form, which states that the attributes of the rela- includes a TFBS_memo field which allows an annotator tion contain only atomic values [22]. While third normal to list important keywords. In NFI-Regulome these are form (3NF) can be achieved easily through the use of an separatedbycommasalso.Duetotheshortlengthof algorithm [22], the decomposed tables are not practical binding site sequences, the binding site sequence field for the queries utilized by the website pages. The tables uses a variable type of varchar instead of the longtext are separated by major characteristics and generally all used by GeneDB. The BindingSiteDB table also provides fields of a given table are utilized by a given query. This the link to the previous group by the inclusion of the enables a single query to retrieve all the information Pubmed_id field. The TF_connect_TFBS_DB is the needed for a particular object such as a binding site or a central table of the entire database. The TF_- gene. In this case performance concerns outweigh the connect_TFBS_DB table connects a particular gene for a concerns of anomalies [22] appearing in the relation particular species to a particular binding site and gives it scheme. Most situations where anomalies can possibly transcription factor information. Most queries utilized by Gronostajski et al. Journal of Clinical Bioinformatics 2011, 1:4 Page 3 of 10 http://www.jclinbioinformatics.com/content/1/1/4 Figure 1 Table structure of NFI-Regulome Database.
Recommended publications
  • Systems Biology Lecture Outline (Systems Biology) Michael K
    Spring 2019 – Epigenetics and Systems Biology Lecture Outline (Systems Biology) Michael K. Skinner – Biol 476/576 CUE 418, 10:35-11:50 am, Tuesdays & Thursdays January 22 & 29, 2019 Weeks 3 and 4 Systems Biology (Components & Technology) Components (DNA, Expression, Cellular, Organ, Physiology, Organism, Differentiation, Development, Phenotype, Evolution) Technology (Genomics, Transcriptomes, Proteomics) (Interaction, Signaling, Metabolism) Omics (Data Processing and Resources) Required Reading ENCODE (2012) ENCODE Explained. Nature 489:52-55. Tavassoly I, Goldfarb J, Iyengar R. (2018) Essays Biochem. 62(4):487-500. Literature Sundaram V, Wang T. Transposable Element Mediated Innovation in Gene Regulatory Landscapes of Cells: Re-Visiting the "Gene-Battery" Model. Bioessays. 2018 40(1). doi: 10.1002/bies.201700155. Abil Z, Ellefson JW, Gollihar JD, Watkins E, Ellington AD. Compartmentalized partnered replication for the directed evolution of genetic parts and circuits. Nat Protoc. 2017 Dec;12(12):2493- 2512. Filipp FV. Crosstalk between epigenetics and metabolism-Yin and Yang of histone demethylases and methyltransferases in cancer. Brief Funct Genomics. 2017 Nov 1;16(6):320-325. Chen Z, Li S, Subramaniam S, Shyy JY, Chien S. Epigenetic Regulation: A New Frontier for Biomedical Engineers. Annu Rev Biomed Eng. 2017 Jun 21;19:195-219. Hollick JB. Paramutation and related phenomena in diverse species. Nat Rev Genet. 2017 Jan;18(1):5-23. Macovei A, Pagano A, Leonetti P, Carbonera D, Balestrazzi A, Araújo SS. Systems biology and genome-wide approaches to unveil the molecular players involved in the pre-germinative metabolism: implications on seed technology traits. Plant Cell Rep. 2017 May;36(5):669-688. Bagchi DN, Iyer VR.
    [Show full text]
  • The Human Proteome Co-Regulation Map Reveals Functional Relationships Between Proteins
    bioRxiv preprint doi: https://doi.org/10.1101/582247; this version posted March 19, 2019. The copyright holder for this preprint (which was 19/03/2019not certified by peer review) is the author/funder, who hasNBT_manuscript_revised granted bioRxiv a license - toGoogle display Docs the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. 1 The human proteome co-regulation map reveals functional relationships 2 between proteins 3 Georg Kustatscher1 *, Piotr Grabowski2 *, Tina A. Schrader3 , Josiah B. Passmore3 , Michael 4 Schrader3 , Juri Rappsilber1,2# 5 1 Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK 6 2 Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, 13355 Berlin, 7 Germany 8 3 Biosciences, University of Exeter, Exeter EX4 4QD, UK 9 * Equal contribution 10 # Communicating author: [email protected] 11 Submission as Resource article. 12 The annotation of protein function is a longstanding challenge of cell biology that 13 suffers from the sheer magnitude of the task. Here we present ProteomeHD, which 14 documents the response of 10,323 human proteins to 294 biological perturbations, 15 measured by isotope-labelling mass spectrometry. Using this data matrix and robust 16 machine learning we create a co-regulation map of the cell that reflects functional 17 associations between human proteins. The map identifies a functional context for 18 many uncharacterized proteins, including microproteins that are difficult to study with 19 traditional methods. Co-regulation also captures relationships between proteins 20 which do not physically interact or co-localize. For example, co-regulation of the 21 peroxisomal membrane protein PEX11β with mitochondrial respiration factors led us 22 to discover a novel organelle interface between peroxisomes and mitochondria in 23 mammalian cells.
    [Show full text]
  • Back-Of-The-Envelope Guide (A Tutorial) to 10 Intracellular Landscapes
    MOJ Proteomics & Bioinformatics Review Article Open Access Back-of-the-envelope guide (a tutorial) to 10 intracellular landscapes Abstract Volume 7 Issue 1 - 2018 Landscape is a metaphor for conceptualizing and visualizing a score across one or Maurice HT Ling1,2 more biological entities or concepts. This review provides a cursory overview of 1Colossus Technologies LLP, Republic of Singapore 10 landscapes (in alphabetical order, copy number, fluxome, genome, molecular, 2HOHY Pte. Ltd., Republic of Singapore metabolome, mutation, phenome, proteome, regulome, and transcriptome) in intracellular biology without going into extensive depth; hence, this article can act as Correspondence: Maurice HT Ling, Colossus Technologies a first tutorial into intracellular landscapes. The value ahead is to be able to compare LLP, 8 Burns Road, Trivex, Singapore 369977, Republic of and interrogate across multiple landscapes at different resolutions. Singapore, Tel +6596669233, Email [email protected] Received: January 29, 2018 | Published: February 06, 2018 Definition of a “landscape” Since Wright,1 the concept of landscape had been used to reference a metaphoric representation of an overview of one or more biological 1 The concept of landscape was first introduced by Sewell Wright, entities. This article reviews 10 landscapes (in alphabetical order, as a metaphor to visual a scale or score across a biological entity, copy number, fluxome, genome, molecular, metabolome, mutation, such as a chromosome (Figure 1). In a linear biological entity, such as phenome, proteome, regulome, and transcriptome) in intracellular a chromosome; the biological entity is represented on the horizontal biology with the aim of presenting a cursory, back of the envelope axis and the score, such as GC content, is represented on the vertical overview without going into extensive depth into each landscape.
    [Show full text]
  • Bioinformatics Strategies for Identification of Cancer Biomarkers and Targets in Pathogens Associated with Cancer
    FEDERAL UNIVERSITY OF MINAS GERAIS INSTITUTE OF BIOLOGICAL SCIENCE DEPARTMENT OF GENERAL BIOLOGY GRADUATE PROGRAM OF BIOINFORMATICS BIOINFORMATICS STRATEGIES FOR IDENTIFICATION OF CANCER BIOMARKERS AND TARGETS IN PATHOGENS ASSOCIATED WITH CANCER PhD STUDENT: Debmalya Barh SUPERVISOR: Prof. Dr. Vasco Ariston de Carvalho Azevedo Page 1 of 237 DEBMALYA BARH Bioinformatics strategies for identification of cancer biomarkers and targets in pathogens associated with cancer PhD STUDENT: Debmalya Barh SUPERVISOR: Prof. Dr. Vasco Ariston de Carvalho Azevedo FEDERAL UNIVERSITY OF MINAS GERAIS INSTITUTE OF BIOLOGICAL SCIENCES BELO HORIZONTE - MG February – 2017 Page 2 of 237 043 Barh, Debmalya. Bioinformatics strategies for identification of cancer biomarkers and targets in pathogens associated with cancer [manuscrito] / Debmalya Barh. – 2017. 237 f. : il. ; 29,5 cm. Supervisor: Vasco Ariston de Carvalho Azevedo. Tese (doutorado) – Federal University of Minas Gerais, Institute of Biological Sciences. 1. Bioinformática - Teses. 2. Marcadores biológicos. 3. Doenças transmissíveis - Teses. 4. Transcriptômica - Teses. 5. MicroRNAs - Teses. I. Azevedo , Vasco Ariston de Carvalho. II. Universidade Federal de Minas Gerais. Instituto de Ciências Biológicas. III. Título. CDU: 573:004 TABLE OF CONTENTS TABLE OF CONTENTS……………………………………………………………………………………i DEDICATION………………………………………………………………………………………..……iii ACKNOWLEDGEMENTS…………………………………………………………………………..……iv LIST OF ABBREVIATIONS…………………………………………………………………….……….. v ABSTRACT………………………………………………………………………………………...……..vii
    [Show full text]
  • Mrna Structure Dynamics Identifies the Embryonic RNA Regulome
    bioRxiv preprint doi: https://doi.org/10.1101/274290; this version posted March 1, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. mRNA structure dynamics identifies the embryonic RNA regulome Jean-Denis Beaudoin1,*,‡, Eva Maria Novoa2,3,4,5,*, Charles E Vejnar1, Valeria Yartseva1, Carter Takacs1, Manolis Kellis2,3 and Antonio J Giraldez1,6,‡ RNA folding plays a crucial role in RNA function. However, our knowledge of the global structure of the transcriptome is limited to steady-state conditions, hindering our understanding of how RNA structure dynamics influences gene function. Here, we have characterized mRNA structure dynamics during zebrafish development. We observe that on a global level, translation guides structure rather than structure guiding translation. We detect a decrease in structure in translated regions, and we identify the ribosome as a major remodeler of RNA structure in vivo. In contrast, we find that 3’-UTRs form highly folded structures in vivo, which can affect gene expression by modulating miRNA activity. Furthermore, we find that dynamic 3’-UTR structures encode RNA decay elements, including regulatory elements in nanog and cyclin A1, key maternal factors orchestrating the maternal-to-zygotic transition. These results reveal a central role of RNA structure dynamics in gene regulatory programs. Introduction Previous studies have focused on the global analysis of RNA structures in cells at RNA carries out a broad range of functions,1 and steady-state. However, in an organism, critical RNA structure has emerged as a fundamental developmental and metabolic decisions are often regulatory mechanism, modulating various post- made when cells transition between cellular transcriptional events such as splicing2, states.
    [Show full text]
  • Organizational Properties of a Functional Mammalian Cis-Regulome
    bioRxiv preprint doi: https://doi.org/10.1101/550897; this version posted February 15, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Organizational properties of a functional mammalian cis-regulome Virendra K. Chaudhri, Krista Dienger-Stambaugh, Zhiguo Wu, Mahesh Shrestha and Harinder Singh*# Division of Immunobiology and Center for Systems Immunology, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, 45220, USA #Current address: Center for Systems Immunology, Departments of Immunology and Computational and Systems Biology, The University of Pittsburgh, Pittsburgh, PA 15213 *Corresponding author Email: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/550897; this version posted February 15, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract Mammalian genomic states are distinguished by their chromatin and transcription profiles. Most genomic analyses rely on chromatin profiling to infer cis-regulomes controlling distinctive cellular states. By coupling FAIRE-seq with STARR-seq and integrating Hi-C we assemble a functional cis-regulome for activated murine B- cells. Within 55,130 accessible chromatin regions we delineate 9,989 active enhancers communicating with 7,530 promoters. The cis-regulome is dominated by long range enhancer-promoter interactions (>100kb) and complex combinatorics, implying rapid evolvability. Genes with multiple enhancers display higher rates of transcription and multi-genic enhancers manifest graded levels of H3K4me1 and H3K27ac in poised and activated states, respectively. Motif analysis of pathway-specific enhancers reveals diverse transcription factor (TF) codes controlling discrete processes.
    [Show full text]
  • Genome Sequencing: Then & Now
    “Aint you got an ‘ome to go to?” Genomics Transcriptomics Proteomics Metabolomics Physiomics toxicogenomics pharmacogenomics ecotoxicopharmacogenomics phosphatome, glycome, secretome metronome, mobilome, gardenome, etc… http://omics.org/index.php/Alphabetically_ordered_list_of_omes_and_omics --A-- Alignmentome: conceived before 2003. The whole set of Bacteriome: an organelle of bacteria. (Bacteriome.org). Also Carbome: BiO center. 2003. The whole set of carbone multiple sequence and structure alignments in the totality of baterial genes and proteins. based living organisms in the universe. (Carbome.org) bioinformatics. Alignments are the most important Bacterome: The totality of (Bacterome.org) Carbomics: BiO center. 2003. (Carbomics.org) representation in bioinformatics especially for homology and Behaviorome: The totality of (Behaviorome.org) Cardiogenomics: The omics approach research of evolution study. (Alignmentome.org) Behavioromics: The omics approach research of Cardiogenome (Cardiogenomics.org) in biology Alignmentomics: conceived before 2003. The study of Behavioromics (Behavioromics.org) in biology Cellome: concept is derived from the understanding that aligning strings and sequences especially in bioinformatics. Behaviourome: The totality of (Behaviourome.org) cells as a whole can be used for therapeutic purposes. As an (Alignmentomics.org) Behaviouromics: The omics approach research of important bioresource, cells are kept for biotechnology. As Alignome: 2003 . The whole set of string alignment Behaviouromics (Behaviouromics.org) in biology a distinct group concept, cellome refers to such cells and algorithms such as FASTA, BLAST and HMMER. Bibliome: 1999. EBI (European Bioinformatics Institute). their genetic materials. (Cellome.org) (Alignome.org) The whole of biological science and technology literature. Cellomics: The study of Cellome. Cellomics is also a Alignomics: The omics approach research of Alignomics Within bibliome, each word and context is interconnected company name Cellomics, Inc.
    [Show full text]
  • Identification of a Primitive Intestinal Transcription Factor Network Shared Between Esophageal Adenocarcinoma and Its Precancerous Precursor State
    Downloaded from genome.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press Research Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its precancerous precursor state Connor Rogerson,1 Edward Britton,1 Sarah Withey,2 Neil Hanley,2,3 Yeng S. Ang,2,4 and Andrew D. Sharrocks1 1School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PT, United Kingdom; 2School of Medical Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Sciences Centre, University of Manchester, Manchester M13 9PT, United Kingdom; 3Endocrinology Department, Central Manchester University Hospitals NHS Foundation Trust, Manchester M13 9WU, United Kingdom; 4GI Science Centre, Salford Royal NHS FT, University of Manchester, Salford M6 8HD, United Kingdom Esophageal adenocarcinoma (EAC) is one of the most frequent causes of cancer death, and yet compared to other common cancers, we know relatively little about the molecular composition of this tumor type. To further our understanding of this cancer, we have used open chromatin profiling to decipher the transcriptional regulatory networks that are operational in EAC. We have uncovered a transcription factor network that is usually found in primitive intestinal cells during embryonic development, centered on HNF4A and GATA6. These transcription factors work together to control the EAC transcrip- tome. We show that this network is activated in Barrett’s esophagus, the putative precursor state to EAC, thereby providing novel molecular evidence in support of stepwise malignant transition. Furthermore, we show that HNF4A alone is sufficient to drive chromatin opening and activation of a Barrett’s-like chromatin signature when expressed in normal human epithe- lial cells.
    [Show full text]
  • Institute for Systems Biology Scientific Strategic Plan
    Institute for Systems Biology Scientific Strategic Plan October 2014 VISION: Facilitate wellness, predict and prevent disease and enable environmental stability. STRATEGY: Develop a powerful approach to predict and engineer state transitions in biological systems. TACTICS: We will execute this strategy through four tactics: 1. Develop a theoretical framework to define states and state transitions in biological systems. 2. Use experimental systems to quantitatively characterize states and state transitions. 3. Invent technologies to measure and manipulate biological systems. 4. Invent computational methods to identify states and model dynamics of underlying networks. APPLICATIONS: We will drive tactics with applications to enable predictive, preventive, personalized, and participatory (P4) medicine and a sustainable environment. INTRODUCTION MISSION We will revolutionize science with a powerful systems approach to predict and prevent disease, and enable a sustainable environment. STRATEGY ISB will develop a systems approach to characterize, predict, and manipulate state transitions in biological systems. This strategy is the common denominator to accomplish the two vision goals. By studying similar phenomena across diverse biological systems (microbes to humans) we will leverage varying expertise, ideas, resources, and opportunities to develop a unified systems biology framework. 2 Learn more about our scientists at: systemsbiology.org/people INTRODUCTION ISB IS A UNIQUE ORGANIZATION ISB is neither a classical definition of an academic organization nor is it a biotechnology company. We are unique. Unlike in traditional academic departments, our faculty are cross-disciplinary and work collaboratively toward a common vision. This cross-disciplinary approach enables us to take on big, complex problems. While we are vision-driven, ISB is also deeply committed to transferring knowledge to society.
    [Show full text]
  • Σ54-Dependent Regulome in Desulfovibrio Vulgaris Hildenborough Alexey E
    Kazakov et al. BMC Genomics (2015) 16:919 DOI 10.1186/s12864-015-2176-y RESEARCH ARTICLE Open Access σ54-dependent regulome in Desulfovibrio vulgaris Hildenborough Alexey E. Kazakov1, Lara Rajeev1, Amy Chen1, Eric G. Luning1, Inna Dubchak1,2, Aindrila Mukhopadhyay1 and Pavel S. Novichkov1* Abstract Background: The σ54 subunit controls a unique class of promoters in bacteria. Such promoters, without exception, require enhancer binding proteins (EBPs) for transcription initiation. Desulfovibrio vulgaris Hildenborough, a model bacterium for sulfate reduction studies, has a high number of EBPs, more than most sequenced bacteria. The cellular processes regulated by many of these EBPs remain unknown. Results: To characterize the σ54-dependent regulome of D. vulgaris Hildenborough, we identified EBP binding motifs and regulated genes by a combination of computational and experimental techniques. These predictions were supported by our reconstruction of σ54-dependent promoters by comparative genomics. We reassessed and refined the results of earlier studies on regulation in D. vulgaris Hildenborough and consolidated them with our new findings. It allowed us to reconstruct the σ54 regulome in D. vulgaris Hildenborough. This regulome includes 36 regulons that consist of 201 coding genes and 4 non-coding RNAs, and is involved in nitrogen, carbon and energy metabolism, regulation, transmembrane transport and various extracellular functions. To the best of our knowledge,thisisthefirstreportofdirectregulationof alanine dehydrogenase, pyruvate metabolism genes and type III secretion system by σ54-dependent regulators. Conclusions: The σ54-dependent regulome is an important component of transcriptional regulatory network in D. vulgaris Hildenborough and related free-living Deltaproteobacteria. Our study provides a representative collection of σ54-dependent regulons that can be used for regulation prediction in Deltaproteobacteria and other taxa.
    [Show full text]
  • An Integrated Regulome-Interactome Approach for Establishing
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Archive Ouverte en Sciences de l'Information et de la Communication HTS-Net: An integrated regulome-interactome approach for establishing network regulation models in high-throughput screenings Claire Rioualen, Quentin da Costa, Bernard Chetrit, Emmanuelle Charafe-Jauffret, Christophe Ginestier, Ghislain Bidaut To cite this version: Claire Rioualen, Quentin da Costa, Bernard Chetrit, Emmanuelle Charafe-Jauffret, Christophe Ginestier, et al.. HTS-Net: An integrated regulome-interactome approach for establishing network regulation models in high-throughput screenings. PLoS ONE, Public Library of Science, 2017. hal- 01784477 HAL Id: hal-01784477 https://hal.archives-ouvertes.fr/hal-01784477 Submitted on 3 May 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. RESEARCH ARTICLE HTS-Net: An integrated regulome-interactome approach for establishing network regulation models in high-throughput screenings Claire Rioualen1,2,3,4, Quentin Da Costa1,2,3,4, Bernard Chetrit1,2,3,4,
    [Show full text]
  • Dcas9-Targeted Locus-Specific Protein Isolation Method Identifies Histone Gene Regulators
    dCas9-targeted locus-specific protein isolation method identifies histone gene regulators Chiahao Tsuia,b, Carla Inouyea,b, Michaella Levyc, Andrew Lua, Laurence Florensc, Michael P. Washburnc,d, and Robert Tjiana,b,1 aDepartment of Molecular and Cell Biology, Li Ka Shing Center for Biomedical and Health Sciences, California Institute for Regenerative Medicine Center of Excellence, University of California, Berkeley, CA 94720; bHoward Hughes Medical Institute, University of California, Berkeley, CA 94720; cStowers Institute for Medical Research, Kansas City, MO 64110; and dDepartment of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160 Contributed by Robert Tjian, February 7, 2018 (sent for review October 30, 2017; reviewed by James T. Kadonaga and Robert E. Kingston) Eukaryotic gene regulation is a complex process, often coordi- regulatory functions within the nucleus. These challenges have nated by the action of tens to hundreds of proteins. Although made the discovery of a more complete “regulome” responsible previous biochemical studies have identified many components of for various stages of gene-expression control at specific genomic the basal machinery and various ancillary factors involved in gene loci a difficult and experimentally arduous process. regulation, numerous gene-specific regulators remain undiscovered. One of the many essential loci for which our understanding of To comprehensively survey the proteome directing gene expression gene regulation remains stubbornly incomplete is the canonical at a specific genomic locus of interest, we developed an in vitro histone gene locus, which exists as highly repetitive clusters of nuclease-deficient Cas9 (dCas9)-targeted chromatin-based purifica- unique sequence in eukaryotic genomes (4).
    [Show full text]