Histone Modification Levels Are Predictive for Gene Expression

Total Page:16

File Type:pdf, Size:1020Kb

Histone Modification Levels Are Predictive for Gene Expression Histone modification levels are predictive for gene expression Rosa Karlića,b,1, Ho-Ryun Chunga,1,2, Julia Lasserrea, Kristian Vlahovičekb,c, and Martin Vingrona aMax-Planck-Institut für Molekulare Genetik, Department of Computational Molecular Biology, Ihnestraße 73, 14195 Berlin, Germany; bBioinformatics Group, Division of Biology, Faculty of Science, Zagreb University, Horvatovac 102a, 10000 Zagreb, Croatia; and cDepartment of Informatics, University of Oslo, P.O. Box 1080, Blindern, NO-0316 Oslo, Norway Edited by Robert G. Roeder, The Rockefeller University, New York, NY, and approved January 7, 2010 (received for review August 20, 2009) Histones are frequently decorated with covalent modifications. and are tightly regulated to achieve a precise control of gene ex- These histone modifications are thought to be involved in various pression. The regulatory mechanisms depend on the action of chromatin-dependent processes including transcription. To eluci- transcription factors, which facilitate the recruitment of pol II date the relationship between histone modifications and transcrip- and/or chromatin modifying complexes. Histone modifications tion, we derived quantitative models to predict the expression can therefore be viewed as a read out of the activity of transcrip- level of genes from histone modification levels. We found that his- tion factors. In line with this idea, there are established links be- tone modification levels and gene expression are very well corre- tween the distinct steps in the transcription cycle and some lated. Moreover, we show that only a small number of histone histone modifications, e.g., H3K4me3 is associated with tran- modifications are necessary to accurately predict gene expression. scription initiation [(14) and references therein], H3K27me3 with We show that different sets of histone modifications are necessary the repression of transcription elongation [(15) and references to predict gene expression driven by high CpG content promoters therein], and H3K36me3 with the removal of histone acetylations (HCPs) or low CpG content promoters (LCPs). Quantitative models in the wake of an elongating pol II (for review, see ref. 11). How- involving H3K4me3 and H3K79me1 are the most predictive of the ever, in general, little is known about the relationship between expression levels in LCPs, whereas HCPs require H3K27ac and histone modifications and the transcriptional process. H4K20me1. Finally, we show that the connections between histone Here, we systematically study this relationship. We analyzed modifications and gene expression seem to be general, as we were the recently published genome-wide localization data of 38 his- BIOPHYSICS AND able to predict gene expression levels of one cell type using a mod- tone modifications and one histone variant measured in human COMPUTATIONAL BIOLOGY el trained on another one. CD4þ T-cells (16, 17). We address four major questions: (i)Is there a quantitative relationship between histone modifications high CpG content promoter ∣ low CpG content promoter ∣ levels and transcription? (ii) Are there histone modifications that regression analysis ∣ transcription are more important than others to predict transcript levels? (iii) Are there different requirements for different promoter iv he DNA of eukaryotic organisms is packaged into chromatin, types? ( ) Are the relationships general? To answer these ques- Twhose basic repeating unit is the nucleosome. A nucleosome tions, we derived models that quantitatively relate the measured is formed by wrapping 147 base pairs of DNA around an octamer expression level of genes (18) to the level of modifications at their of four core histones, H2A, H2B, H3, and H4 (1–5) which are promoters. We show that our models faithfully capture the mea- subject to a number of posttranslational covalent modifications sured expression levels of genes, suggesting that the levels of [(6); for review, see ref. 7]. These modifications can alter the modifications are quantitatively related to gene expression and chromatin structure and function by changing the charge of that there is a tight link between these histone modifications the nucleosome particle, and/or by recruiting protein complexes and the transcriptional process. Furthermore, combinations of either individually or in combination (8). Hence, histone modifi- only two to three modifications are sufficient to build models that cations are thought to constitute a “Histone Code,” which is read give rise to at least 95% of the performance obtained by using all out by proteins to bring about specific downstream effects (9, 10). modifications. The separation of low CpG content promoters Histone modifications have been linked to a number of chro- (LCPs) from high CpG content promoters (HCPs) revealed that matin-dependent processes, including replication, DNA-repair, different histone modifications are identified dependent on the and transcription. The link between histone modifications and CpG content of the promoters. Finally, gene expression levels transcription has been particularly intensively studied. It has been of another cell type were accurately predicted using a model 4þ found that individual modifications can be associated with trained on the CD T-cells, suggesting that the relationship be- transcriptional activation or repression. Acetylation and phos- tween histone modifications and gene expression is a general one. phorylation generally accompany transcription; sumoylation, Results deimination, and proline isomerization are usually found in transcriptionally silent regions; methylation and ubiquitination Histone Modification Levels Are Predictive of Gene Expression in CD4þ are implicated in both activation and repression of transcription T-Cells. The presence or absence of certain histone modifi- (8). Furthermore, the establishment of some modifications is de- cations has been shown to correlate with the expression status of pendent on the presence of other modifications, e.g., the catalysis of H3K4me3 requires the presence of H2BK120ub1 (the Author contributions: R.K., H.-R.C., K.V., and M.V. designed research; R.K. and H.-R.C. so-called trans-tail pathway) and the phosphorylation on serine performed research; J.L. contributed new reagents/analytic tools; R.K. and H.-R.C. 5 on the C-terminal domain of RNA polymerase II (pol II) analyzed data; R.K., H.-R.C., J.L., K.V., and M.V. wrote the paper. (for review, see ref. 11, which also reviews other examples for The authors declare no conflict of interest. the combinatorial action of histone modifications). This article is a PNAS Direct Submission. Transcription proceeds in a series of steps, also referred to as Freely available online through the PNAS open access option. transcription cycle, starting with preinitiation complex formation, 1R.K. and H.-R.C. contributed equally to this work. pol II recruitment, the transition to an initiating and thereafter 2To whom correspondence should be addressed. E-mail: [email protected]. elongating pol II, elongation, and finally termination (for review, This article contains supporting information online at www.pnas.org/cgi/content/full/ see refs. 12, 13). The first four steps take place at the promoter 0909344107/DCSupplemental. www.pnas.org/cgi/doi/10.1073/pnas.0909344107 PNAS Early Edition ∣ 1of6 Downloaded by guest on September 27, 2021 genes (19). To get a better understanding of the relationships One-modification model (41 models) between histone modifications and gene expression, we analyzed f(N'i) =a+bi *N'i the publicly available genome-wide localization data for 38 his- N' f(N' ) y tone modifications and one histone variant in human CD4þ i i T-cells (ChIP-seq data) (16, 17). We used the numbers of tags Two-modifications model (820 models) for each histone modification or variant, found in a region of f(N' N' ) =a+b N' +b N' 4,001 base pairs surrounding the transcription start sites of i, j i i j j 14,802 RefSeq genes, as an estimation of the level of histone N'i modifications. Furthermore, we examined microarray data mea- f(N' N' ) y 4þ i, j suring the transcript abundance of each of these genes in CD N'j T-cells (18), where the logarithm of the intensity served as expres- sion value. Because open chromatin regions preferentially have Three-modifications model (10,660 models) higher tag counts than closed chromatin regions (20, 21), we also f(N' N' N' ) =a+b N' +b N' +b N' obtained ChIP-seq data for unspecific control ChIPs using goat i, j, k i i j j k k and rabbit IgG antibodies in CD4þ T-cells (22) and mapped N'i these tags to the same regions as the histone modification data. We derived a simple model that relates the expression values to N' y j f(N'i, N'j, N'k ) the histone modification levels (for details, see Methods). Briefly, N i we transformed the number of tags ij for each modification and N'k promoter j to a logarithmic scale. Because some Nij were zero, we had to add a pseudocount αi to the levels of each modification Full model (1 model) to assure the logarithm would always be defined (N0 ¼ ij f(N' ... N' ) =a+b N' + ... + b N' logðNij þ αiÞ). We chose the αi that maximizes the correlation 1, , 41 1 1 41 41 of Nij to the expression value (estimated using 4,934 randomly N' selected promoters). We then built for the remaining 9,868 pro- 1 moters a linear regression model where the entire set of modifi- f(N' , ..., N' ) y cations and the control IgG data served as input (Fig. 1; referred ... 1 41 “ ” to as full model ). We used 10-fold cross-validation to ascertain N'41 that a possible quantitative relationship is of general nature and not limited to a subset of genes. We evaluated the performance of Fig. 1. Modeling framework. Models are equations that linearly relate the N0 the model by determining the Pearson correlation coefficient r levels of histone modifications to the measured expression value. i between modeled and measured expression. corresponds to a vector of length L (the number of promoters), where the r ¼ 0 77 components are the transformed levels of a histone modification i The full model is very well correlated to expression ( .
Recommended publications
  • Functional Roles of Bromodomain Proteins in Cancer
    cancers Review Functional Roles of Bromodomain Proteins in Cancer Samuel P. Boyson 1,2, Cong Gao 3, Kathleen Quinn 2,3, Joseph Boyd 3, Hana Paculova 3 , Seth Frietze 3,4,* and Karen C. Glass 1,2,4,* 1 Department of Pharmaceutical Sciences, Albany College of Pharmacy and Health Sciences, Colchester, VT 05446, USA; [email protected] 2 Department of Pharmacology, Larner College of Medicine, University of Vermont, Burlington, VT 05405, USA; [email protected] 3 Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT 05405, USA; [email protected] (C.G.); [email protected] (J.B.); [email protected] (H.P.) 4 University of Vermont Cancer Center, Burlington, VT 05405, USA * Correspondence: [email protected] (S.F.); [email protected] (K.C.G.) Simple Summary: This review provides an in depth analysis of the role of bromodomain-containing proteins in cancer development. As readers of acetylated lysine on nucleosomal histones, bromod- omain proteins are poised to activate gene expression, and often promote cancer progression. We examined changes in gene expression patterns that are observed in bromodomain-containing proteins and associated with specific cancer types. We also mapped the protein–protein interaction network for the human bromodomain-containing proteins, discuss the cellular roles of these epigenetic regu- lators as part of nine different functional groups, and identify bromodomain-specific mechanisms in cancer development. Lastly, we summarize emerging strategies to target bromodomain proteins in cancer therapy, including those that may be essential for overcoming resistance. Overall, this review provides a timely discussion of the different mechanisms of bromodomain-containing pro- Citation: Boyson, S.P.; Gao, C.; teins in cancer, and an updated assessment of their utility as a therapeutic target for a variety of Quinn, K.; Boyd, J.; Paculova, H.; cancer subtypes.
    [Show full text]
  • Four Enzymes Cooperate to Displace Histone H1 During the First Minute of Hormonal Gene Activation
    Downloaded from genesdev.cshlp.org on September 26, 2021 - Published by Cold Spring Harbor Laboratory Press Four enzymes cooperate to displace histone H1 during the first minute of hormonal gene activation Guillermo Pablo Vicent,1,2 A. Silvina Nacht,1 Jofre Font-Mateu, Giancarlo Castellano, Laura Gaveglia, Cecilia Ballare´, and Miguel Beato Centre de Regulacio´ Geno` mica (CRG), Universitat Pompeu Fabra (UPF), E-08003 Barcelona, Spain Gene regulation by external signals requires access of transcription factors to DNA sequences of target genes, which is limited by the compaction of DNA in chromatin. Although we have gained insight into how core histones and their modifications influence this process, the role of linker histones remains unclear. Here we show that, within the first minute of progesterone action, a complex cooperation between different enzymes acting on chromatin mediates histone H1 displacement as a requisite for gene induction and cell proliferation. First, activated progesterone receptor (PR) recruits the chromatin remodeling complexes NURF and ASCOM (ASC-2 [activating signal cointegrator-2] complex) to hormone target genes. The trimethylation of histone H3 at Lys 4 by the MLL2/MLL3 subunits of ASCOM, enhanced by the hormone-induced displacement of the H3K4 demethylase KDM5B, stabilizes NURF binding. NURF facilitates the PR-mediated recruitment of Cdk2/CyclinA, which is required for histone H1 displacement. Cooperation of ATP-dependent remodeling, histone methylation, and kinase activation, followed by H1 displacement, is a prerequisite for the subsequent displacement of histone H2A/H2B catalyzed by PCAF and BAF. Chromatin immunoprecipitation (ChIP) and sequencing (ChIP-seq) and expression arrays show that H1 displacement is required for hormone induction of most hormone target genes, some of which are involved in cell proliferation.
    [Show full text]
  • Quantitative Proteomics Methods for the Analysis of Histone Post-Translational Modifications
    Université de Montréal Quantitative Proteomics Methods for the Analysis of Histone Post-translational Modifications par Nebiyu Ali Abshiru Département de Chimie Faculté des arts et des sciences Thèse présentée à la Faculté des études supérieures et postdoctorales en vue de l’obtention du grade de philosophiae doctor (Ph.D.) en chimie Septembre 2015 ©Nebiyu Ali Abshiru, 2015 i Résumé Les histones sont des protéines nucléaires hautement conservées chez les cellules des eucaryotes. Elles permettent d’organiser et de compacter l’ADN sous la forme de nucléosomes, ceux-ci representant les sous unités de base de la chromatine. Les histones peuvent être modifiées par de nombreuses modifications post-traductionnelles (PTMs) telles que l’acétylation, la méthylation et la phosphorylation. Ces modifications jouent un rôle essentiel dans la réplication de l’ADN, la transcription et l’assemblage de la chromatine. L’abondance de ces modifications peut varier de facon significative lors du developpement des maladies incluant plusieurs types de cancer. Par exemple, la perte totale de la triméthylation sur H4K20 ainsi que l’acétylation sur H4K16 sont des marqueurs tumoraux spécifiques a certains types de cancer chez l’humain. Par conséquent, l’étude de ces modifications et des événements determinant la dynamique des leurs changements d’abondance sont des atouts importants pour mieux comprendre les fonctions cellulaires et moléculaires lors du développement de la maladie. De manière générale, les modifications des histones sont étudiées par des approches biochimiques telles que les immuno-buvardage de type Western ou les méthodes d’immunoprécipitation de la chromatine (ChIP). Cependant, ces approches présentent plusieurs inconvénients telles que le manque de spécificité ou la disponibilité des anticorps, leur coût ou encore la difficulté de les produire et de les valider.
    [Show full text]
  • Inferring Causal Relationships Among Different Histone Modifications and Gene Expression
    Downloaded from genome.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press Inferring Causal Relationships among Different Histone Modifications and Gene Expression Hong Yu*, Shanshan Zhu*, Bing Zhou*, Huiling Xue and Jing-Dong J. Han Chinese Academy of Sciences Key Laboratory of Molecular Developmental Biology, Center for Molecular Systems Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Datun Road, Beijing, 100101, China * These authors contributed equally to this work. Correspondence should be addressed to J.-D.J.H ([email protected]). Running title: histone modification and gene expression network Keywords: histone modification, gene expression, Bayesian network, causal relationship, epigenetics 1 Downloaded from genome.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press Abstract Histone modifications are major epigenetic factors regulating gene expression. They play important roles in maintaining stem cell pluripotency and in cancer pathogenesis. Different modifications may combine to form complex ‘histone codes’. Recent high throughput technologies, such as ‘ChIP-chip’ and ‘ChIP-seq’, have generated high resolution maps for many histone modifications on the human genome. Here we use these maps to build a Bayesian network to infer causal and combinatorial relationships among histone modifications and gene expression. A pilot network derived by the same method among polycomb group (PcG) genes and H3K27 trimethylation is accurately supported by current literature. Our unbiased network model among histone modifications is also well supported by cross validation results. It not only confirmed already known relationships, such as those of H3K27me3 to gene silencing, H3K4me3 to gene activation, and the effect of bivalent modification of both H3K4me3 and H3K27me3, but also identified many other relationships that may predict new epigenetic interactions important in epigenetic gene regulation.
    [Show full text]
  • Anti-H3r2me2(Asym) Antibody
    FOR RESEARCH USE ONLY! 01/20 Anti-H3R2me2(asym) Antibody CATALOG NO.: A2025-100 (100 µl) BACKGROUND DESCRIPTION: Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H3 family. Transcripts from this gene lack poly A tails; instead, they contain a palindromic termination element. This gene is located separately from the other H3 genes that are in the histone gene cluster on chromosome 6p22-p21.3. H3.4; H3/g; H3FT; H3t; HIST3H3; Histone H3; HIST1H3A ALTERNATE NAMES: ANTIBODY TYPE: Polyclonal HOST/ISOTYPE: Rabbit / IgG IMMUNOGEN: A synthetic methylated peptide targeting residues around Arginine 2 of human Histone H3 MOLECULAR WEIGHT: 17 kDa PURIFICATION: Affinity purified FORM: Liquid FORMULATION: In PBS with 0.02% sodium azide, 50% glycerol, pH 7.3 SPECIES REACTIVITY: Human, Mouse, Rat STORAGE CONDITIONS: Store at -20ºC. Avoid freeze / thaw cycles APPLICATIONS AND USAGE: WB 1:500 - 1:2000, IHC 1:50 - 1:200, IF 1:50 - 1:200 Note: This information is only intended as a guide. The optimal dilutions must be determined by the user Western blot analysis of H3R2me2(asym) expression in Dot-blot analysis of methylation peptides using Anti- HeLa cells and H3 protein.
    [Show full text]
  • MAP3K4/CBP-Regulated H2B Acetylation Controls Epithelial-Mesenchymal Transition in Trophoblast Stem Cells
    Cell Stem Cell Article MAP3K4/CBP-Regulated H2B Acetylation Controls Epithelial-Mesenchymal Transition in Trophoblast Stem Cells Amy N. Abell,1,2,7,* Nicole Vincent Jordan,1,2,7 Weichun Huang,6 Aleix Prat,2,4 Alicia A. Midland,5 Nancy L. Johnson,1,2 Deborah A. Granger,1,2 Piotr A. Mieczkowski,2,3 Charles M. Perou,2,4 Shawn M. Gomez,5 Leping Li,6 and Gary L. Johnson1,2,* 1Department of Pharmacology 2Lineberger Comprehensive Cancer Center 3Department of Genetics and Carolina Center for Genome Sciences 4Department of Genetics 5Department of Biomedical Engineering and Curriculum in Bioinformatics and Computational Biology University of North Carolina School of Medicine, Chapel Hill, NC 27599-7365, USA 6Biostatistics Branch, National Institute of Environmental Health Sciences RTP, NC 27709, USA 7These authors contributed equally to this work *Correspondence: [email protected] (A.N.A.), [email protected] (G.L.J.) DOI 10.1016/j.stem.2011.03.008 SUMMARY berg, 2008). During development, EMT is responsible for proper formation of the body plan and for differentiation of many tissues Epithelial stem cells self-renew while maintaining and organs. Examples of EMT in mammalian development multipotency, but the dependence of stem cell prop- include implantation, gastrulation, and neural crest formation erties on maintenance of the epithelial phenotype is (Thiery et al., 2009). Initiation of placenta formation regulated unclear. We previously showed that trophoblast by trophoectoderm differentiation is the first, and yet most poorly stem (TS) cells lacking the protein kinase MAP3K4 defined, developmental EMT. maintain properties of both stemness and epithelial- The commitment of stem cells to specialized cell types requires extensive reprogramming of gene expression, involving, in part, mesenchymal transition (EMT).
    [Show full text]
  • Symmetric-Di-Methyl-Histone H3 (Arg8) Rabbit
    Symmetric -Di-methyl-Histone H3 (Arg8) rabbit pAb ( H3R8me2(sy) pAb) www.ptm-biolab.com.cn Cat#: PTM-672 [email protected] Species: Rabbit 0571-2883 3567 Size: 100 μl Form: supplied in liquid form Recommend Species Swissprot Molecular Application Immunogen dilution Reactivity ID Weight H3R8me2(sy) WB, Dot, ELISA 1:2000 for WB H, M, R, Mk P68431 17 KDa -KLH **Species reactivity is determined by WB. Kept at -20oC after reconstituted. *** Anti-rabbit secondary antibodies must be used to detect this antibody. Source/Purification: This product is produced by immunizing rabbits with a synthetic di-methyl(sy) peptide corresponding to residues surrounding Arg8 of human histone H3. Antibodies are purified by protein A-conjugated agarose followed by di-methylated(sy) histone H3 (Arg8) peptide affinity chromatography. Recommended Applications: ELISA, Dot blot, Western blot. Recommended antibody dilution: WB: 1:2000 NOTE: For WB, incubate membrane with diluted antibody in 5% nonfat milk, 1 x TBS, 0.1% Tween- 20 for two hours at room temperature with gentle shaking. Prepare working dilution immediately before use. Use at an assay dependent concentration. Optimal dilutions/concentrations should be determined by the end user. Not yet tested in other applications Scientific Description: Proteins post-translational Modification (PTM) is an important reversible event controlling protein activity. The amino-terminal tails of core histones undergo multiple modifications in multiple sites, termed as “histone code” or “epigenetic code”. Lysine methylation or arginine methylation in core histones is a major determinant for the formation of active and inactive regions of the genome and therefore plays vital roles in multiple cellular events.
    [Show full text]
  • BMC Genomics Biomed Central
    BMC Genomics BioMed Central Research article Open Access Determination of enriched histone modifications in non-genic portions of the human genome Jeffrey A Rosenfeld1,2,3, Zhibin Wang4, Dustin E Schones4, Keji Zhao4, Rob DeSalle3 and Michael Q Zhang*1 Address: 1Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA, 2Department of Biology, New York University, New York, NY USA, 3American Museum of Natural History, New York, NY USA and 4Laboratory of Molecular Immunology, National Heart, Lung and Blood Institute, NIH, Bethesda, MD USA Email: Jeffrey A Rosenfeld - [email protected]; Zhibin Wang - [email protected]; Dustin E Schones - [email protected]; Keji Zhao - [email protected]; Rob DeSalle - [email protected]; Michael Q Zhang* - [email protected] * Corresponding author Published: 31 March 2009 Received: 8 September 2008 Accepted: 31 March 2009 BMC Genomics 2009, 10:143 doi:10.1186/1471-2164-10-143 This article is available from: http://www.biomedcentral.com/1471-2164/10/143 © 2009 Rosenfeld et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP- seq) has recently been used to identify the modification patterns for the methylation and acetylation of many different histone tails in genes and enhancers. Results: We have extended the analysis of histone modifications to gene deserts, pericentromeres and subtelomeres. Using data from human CD4+ T cells, we have found that each of these non- genic regions has a particular profile of histone modifications that distinguish it from the other non- coding regions.
    [Show full text]
  • Le G´Enome En Action
    LEGENOME´ EN ACTION SEQUENC´ ¸ AGE HAUT DEBIT´ ET EPIG´ ENOMIQUE´ Epig´enomique´ ? IFT6299 H2014 ? UdeM ? Mikl´osCs}ur¨os Regulation´ d’expression la transcription d’une region´ de l’ADN necessite´ ? liaisons proteine-ADN´ (facteur de transcription et son site reconnu) ? accessibilite´ de la chromatine REVIEWS Identification of regions that control transcription An initial step in the analysis of any gene is the identifi- cation of larger regions that might harbour regulatory control elements. Several advances have facilitated the prediction of such regions in the absence of knowl- edge about the specific characteristics of individual cis- Chromatin regulatory elements. These tools broadly fall into two categories: promoter (transcription start site; TSS) and enhancer detection. The methods are influenced Distal TFBS by sequence conservation between ORTHOLOGOUS genes (PHYLOGENETIC FOOTPRINTING), nucleotide composition and the assessment of available transcript data. Functional regulatory regions that control transcrip- tion rates tend to be proximal to the initiation site(s) of transcription. Although there is some circularity in the Co-activator complex data-collection process (regulatory sequences are sought near TSSs and are therefore found most often in these regions), the current set of laboratory-annotated regula- tory sequences indicates that sequences near a TSS are Transcription more likely to contain functionally important regulatory initiation complex Transcription controls than those that are more distal. However, specifi- initiation cation of the position of a TSS can be difficult. This is fur- ther complicated by the growing number of genes that CRM Proximal TFBS selectively use alternative start sites in certain contexts. Underlying most algorithms for promoter prediction is a Figure 1 | Components of transcriptional regulation.
    [Show full text]
  • Lysine Benzoylation Is a Histone Mark Regulated by SIRT2
    ARTICLE DOI: 10.1038/s41467-018-05567-w OPEN Lysine benzoylation is a histone mark regulated by SIRT2 He Huang1, Di Zhang1, Yi Wang2, Mathew Perez-Neut1, Zhen Han3, Y. George Zheng3, Quan Hao2 & Yingming Zhao1 Metabolic regulation of histone marks is associated with diverse biological processes through dynamically modulating chromatin structure and functions. Here we report the identification 1234567890():,; and characterization of a histone mark, lysine benzoylation (Kbz). Our study identifies 22 Kbz sites on histones from HepG2 and RAW cells. This type of histone mark can be stimulated by sodium benzoate (SB), an FDA-approved drug and a widely used chemical food preservative, via generation of benzoyl CoA. By ChIP-seq and RNA-seq analysis, we demonstrate that histone Kbz marks are associated with gene expression and have physiological relevance distinct from histone acetylation. In addition, we demonstrate that SIRT2, a NAD+-dependent protein deacetylase, removes histone Kbz both in vitro and in vivo. This study therefore reveals a new type of histone marks with potential physiological relevance and identifies possible non-canonical functions of a widely used chemical food preservative. 1 Ben May Department for Cancer Research, The University of Chicago, Chicago, IL 60637, USA. 2 School of Biomedical Sciences, University of Hong Kong, Hong Kong, China. 3 Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, GA 30602, USA. Correspondence and requests for materials should be addressed to Y.Z. (email: [email protected]) NATURE COMMUNICATIONS | (2018) 9:3374 | DOI: 10.1038/s41467-018-05567-w | www.nature.com/naturecommunications 1 ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/s41467-018-05567-w hromatin structure and transcriptional activity of genes and coelution analysis were performed to compare the synthetic are regulated by diverse protein posttranslational mod- peptide with its in vivo counterpart (Fig.
    [Show full text]
  • The Determinants of Nucleosome Patterns and the Impact of Phosphate Starvation on Nucleosome Patterns and Gene Expression in Rice
    Louisiana State University LSU Digital Commons LSU Doctoral Dissertations Graduate School 3-14-2018 The etD erminants of Nucleosome Patterns and the Impact of Phosphate Starvation on Nucleosome Patterns and Gene Expression in Rice Qi Zhang Louisiana State University and Agricultural and Mechanical College, [email protected] Follow this and additional works at: https://digitalcommons.lsu.edu/gradschool_dissertations Part of the Agronomy and Crop Sciences Commons, Bioinformatics Commons, Genetics Commons, Genomics Commons, and the Plant Breeding and Genetics Commons Recommended Citation Zhang, Qi, "The eD terminants of Nucleosome Patterns and the Impact of Phosphate Starvation on Nucleosome Patterns and Gene Expression in Rice" (2018). LSU Doctoral Dissertations. 4502. https://digitalcommons.lsu.edu/gradschool_dissertations/4502 This Dissertation is brought to you for free and open access by the Graduate School at LSU Digital Commons. It has been accepted for inclusion in LSU Doctoral Dissertations by an authorized graduate school editor of LSU Digital Commons. For more information, please [email protected]. THE DETERMINANTS OF NUCLEOSOME PATTERNS AND THE IMPACT OF PHOSPHATE STARVATION ON NUCLEOSOME PATTERNS AND GENE EXPRESSION IN RICE A Dissertation Submitted to the Graduate Faculty of the Louisiana State University and Agricultural and Mechanical College in partial fulfillment of the requirements for the degree of Doctor of Philosophy in The Department of Biological Sciences by Qi Zhang B.S., Sichuan Agricultural University, 2012 May 2018 ACKNOWLEDGEMENTS The journey towards the completion of the degree hasn’t been easy, and I would like to express my heartfelt gratitude to everyone who has been around me, listening to me, and supporting me.
    [Show full text]
  • Machine Learning & Learning Machines
    A Brief History • Bootstrapping • Bagging • Boosting (Schapire 1989) • Adaboost (Schapire 1995) What’s So Good About Adaboost • Improves classification accuracy • Can be used with many different classifiers • Commonly used in many areas • Simple to implement • Not prone to over-fitting Ex. “How May I Help You?” easy to find “rules of thumb” that are “often” correct • e.g.: “IF ‘card’ occurs in utterance THEN predict ‘Calling Card’ ” • hard to find single highly accurate prediction rule Bootstrap Estimation • Repeatedly draw n samples from D • For each set of samples, estimate a statistic • The bootstrap estimate is the mean of the individual estimates • Used to estimate a statistic (parameter) and its variance Bagging - Aggregate Bootstrapping • For i = 1 .. M – Draw n*<n samples from D with replacement – Learn classifier Ci • Final classifier is a vote of C1 .. CM • Increases classifier stability/reduces variance Boosting (Schapire 1989) • Randomly select n1 < n samples from D without replacement to obtain D1 – Train weak learner C1 • Select n2 < n samples from D with half of the samples misclassified by C1 to obtain D2 – Train weak learner C2 • Select all samples from D that C1 and C2 disagree on – Train weak learner C3 • Final classifier is vote of weak learners Adaboost - Adaptive Boosting • Instead of sampling, re-weight – Previous weak learner has only 50% accuracy over new distribution • Can be used to learn weak classifiers • Final classification based on weighted vote of weak classifiers Adaboost Terms • Learner = Hypothesis = Classifier • Weak Learner: < 50% error over any distribution • Strong Classifier: thresholded linear combination of weak learner outputs Basic idea of boosting technique In general Boosting Bagging Single Tree.
    [Show full text]