<<

Enhancer DNA methylation: implications for regulation

Allegra Angeloni1, 2 and Ozren Bogdanovic1, 3

1Genomics and Division, Garvan Institute of Medical Research, Sydney, New South Wales, 2010, Australia 2St Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Sydney, New South Wales, 2010, Australia. 3School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, 2052, Australia. Correspondence to: [email protected]

Abstract DNA methylation involves the addition of a methyl group to the fifth carbon of the pyrimidine cytosine ring (5-methylcytosine, 5mC). 5mC is widespread in vertebrate where it is predominantly found within CpG dinucleotides. In mammals, 5mC participates in long term silencing processes such as X inactivation, genomic imprinting, somatic silencing of germline , and silencing of repetitive DNA elements. The evidence for 5mC as a dynamic gene-regulatory mechanism is mostly limited to specific examples, and is far from being completely understood. Recent work from diverse model systems suggests that 5mC might not always act as a dominant repressive mechanism and that hypermethylated promoters and enhancers can be permissive to in vivo and in vitro. In this review we discuss the links between 5mC and enhancer activity, and evaluate the role of this biochemical mechanism in various biological contexts.

Introduction Chemical modifications to the DNA have the potential to transient and permanent changes in gene-regulatory activity that are heritable through cell division and through generations [1, 2]. In , the major form of DNA methylation involves the addition of a methyl group to the fifth position of the cytosine ring (5-methylcytosine, 5mC) by DNA methyltransferase enzymes (DNMTs) [3]. 5mC is established de novo by DNMT3A and DNMT3B and is maintained through DNMT1, which recognises hemi-methylated CpGs through the UHRF1 [4]. Maintenance of 5mC is essential to guarantee faithful re- establishment of 5mC on the newly synthesised strand following DNA replication. 5mC can be removed from the DNA through two major mechanisms: passive and active DNA demethylation. During passive DNA demethylation, 5mC is diluted following several rounds of DNA replication due to the absence of DNA methylation maintenance following replication (absence of DNMT1) [5, 6]. Conversely, in active DNA demethylation, 5mC can be removed enzymatically in a stepwise manner. Active DNA demethylation involves the iterative oxidation of 5mC by Ten-Eleven Translocation (TET) dioxygenase enzymes to the intermediates 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5- carboxylcytosine (5caC) followed by their subsequent removal via the BER pathway [7]. In the animal kingdom, the predominant form of 5mC is usually found within CpG dinucleotides, however 5mC in CpH dinucleotides (where H is A, C or T) that is common in plant genomes, has also been identified in invertebrate genomes [8], and mammalian embryonic stem cells (ESCs) and brain tissue [9, 10]. The levels of 5mC in the CpG context can vary greatly between and within animal phyla [11, 12]. For example, in insects, genomic CpG methylation levels can range from 0% (Diptera) to 14% (Blattella asahinai - asian cockroach) [13]. Invertebrates on average display lower global 5mC levels than vertebrates, with the majority of 5mC localised to gene bodies of active genes [14]. Such intragenic 5mC has also been observed in highly transcribed genes in mouse ESCs where it is thought to prevent spurious transcription initiation as a result of cryptic entry by RNA polymerase II [15]. Unlike invertebrates, vertebrate genomes are characterized by high genomic 5mC that often reaches 70-80% [16]. Due to this high CpG 5mC content, CpG sites have been evolutionarily depleted from vertebrate genomes, which is a phenomenon that can be explained by increased deamination (C to T base change) of 5mC [17]. The exception to this global hypermethylated state are CpG islands (CGIs), short stretches of high CpG density that are mostly unmethylated and that often coincide with gene promoters [18]. Initial experiments that aimed to demonstrate the gene-repressive potential of 5mC involved the transfection of in vitro methylated templates to cultured cells and the subsequent measurement of their activity when compared to unmethylated templates [19, 20]. This was followed by the description of precise biochemical mechanisms of 5mC-mediated gene repression such as the recruitment of methyl-CpG-binding (MBD) and deacetylase (HDAC) containing complexes [21, 22]. These important observations resulted in speculations of 5mC potentially acting as a dynamic gene regulatory mechanism, with broad reaching implications for embryogenesis and cell differentiation. The advent of massively parallel sequencing technologies a couple of decades later allowed for unbiased insights into the dynamics of 5mC in diverse model systems. Collectively, this work unravelled that CpG island (CGI) promoters (i.e the majority of promoters in vertebrates) are by and large unmethylated independently of gene activity [23-25]. Those observations notwithstanding, it is important to note that when methylated, CGIs are strongly associated with transcriptional repression. For example, in both and mammals, a set of germline CGI promoters become methylated at early embryonic stages, resulting in permanent somatic silencing of germline genes [26, 27]. Distal regulatory elements such as enhancers display much greater differences in 5mC patterning between different tissues, cell types and developmental stages [28-30]. In general, enhancer activity as defined by (TF) binding is anti- correlated with 5mC content [31, 32]. However, the relationship between 5mC removal and enhancer activation is still not well understood. Here we discuss the functional links between 5mC and the activity of gene-regulatory elements, with a particular focus on transcriptional enhancers.

5mC and the regulation CpG island promoters CpG island (CGI) promoters are the most common type in vertebrate genomes. They are characterised by high CpG richness and elevated G+C content [18]. Notably, the majority of CGI promoters are unmethylated independently of gene activity, even though some CGI promoters become methylated and silenced during and X chromosome inactivation [26, 33, 34]. It is also worth noting that forceful methylation using targeted technologies does not result in permanent CGI 5mC and silencing [35]. The constitutively unmethylated state of CGIs is established and maintained by active histone marks, transcription factor binding, recruitment of CxxC domain proteins, and TET [36-43]. When methylated, CGIs can act as recruitment sites for MBDs such as MBD1, MBD2, MBD4, and MeCP2 [44]. These proteins appear to be constitutively bound to a selection of methylated CGIs in both pluripotent and differentiated states [44], in line with the absence of major 5mC changes during this developmental transition [33]. On the other hand, Polycomb group proteins display more extensive dynamics at CGIs during development and differentiation [33]. Overall, CGIs act as platforms for transcriptional regulation that are largely independent of 5mC dynamics. In the modest number of cases where 5mC is targeted to CGIs it appears that this process acts as an additional silencing safe-lock rather than the primary determinant of gene activity [26, 45].

5mC levels at enhancers Enhancers are traditionally defined as distal cis-acting regulatory elements that enhance transcription through recruitment of transcription factors (TFs) and co- proteins [46]. Enhancers activate and regulate expression of their target genes over distances that range from several hundreds of base pairs to a few megabases. In the absence of enhancers, transcription of associated genes is often weakened. On average, enhancers display low CpG density, even though CpG-rich enhancers have also been described [24, 28, 47, 48]. Based on their activity, enhancers can broadly be classified as: active, primed, poised, or silent. Active enhancers are characterised by depletion and the presence of active histone modifications such as and [46, 49]. Poised enhancers are depleted of and decorated with the permissive H3K4me1 and the repressive Polycomb H3K27me3 mark, whereas primed enhancers bear H3K4me1 and also display nucleosome depletion. Enhancers exhibit variable 5mC levels (10-50%) that are dependent on the extent of TF binding [31, 32]. An ENCODE study explored these trends globally by performing linear regression analysis between 5mC and accessibility (as measured by DNAseI hypersensitivity) at > 34,000 genomic sites, and revealed a significant association [50]. Increased 5mC was negatively correlated with chromatin accessibility at 20% of the examined sites. The same study compared TF expression levels to average 5mC levels at their corresponding binding sites across the . Similarly, a significant negative correlation was found between TF expression and 5mC content at TF binding sites for 70% of the TFs examined, arguing that 5mC deposition might be a direct consequence of the absence of TF expression [50]. Altogether, data from the studies discussed above suggest that active enhancers on average display less 5mC than poised or silent enhancers, yet their 5mC levels are not as low as 5mC levels observed in constitutively unmethylated CGIs [32].

5mC, 5hmC and TF-binding While TF-binding at enhancers is necessary for transcriptional regulation of the associated genes, it is not yet established whether TF-binding induces the formation and maintenance of hypomethylation at enhancers. A recent high throughput study examined the effects of 5mC on TF binding and found that the majority of TFs assayed (33%) were not repelled by 5mC, whereas 34% and 23% of TFs displayed increased and decreased binding to methylated sequences, respectively [51] (Fig. 1a). Moreover, recent work has demonstrated that 5mC as well its oxidative intermediates: 5hmC, 5fC, and 5caC can recruit transcriptional activators in vitro [52]. The protein CTCF has been studied extensively in terms of its interactions with 5mC. Data from mouse embryonic stem cells (ESCs) suggest that CTCF can mediate active DNA demethylation at bound regions [31] and that the introduction of CTCF sites in stably-integrated reporter constructs results in decreased 5mC [32]. Similarly, reintroduction of TFs is able to reverse a hypermethylated regulatory region to a hypomethylated state once bound by a TF. This has been demonstrated in mouse ESCs that lack the transcription factor REST, in which REST-/- cells display increased 5mC and reduced 5hmC at REST binding sites compared to wild-type cells. Reintroduction of REST leads to decrease in 5mC and increase in 5hmC at binding sites, indicative of an active, TET-dependent mechanism of demethylation [31]. This is further supported by studies of the glucocorticoid receptor binding to tissue-specific enhancers, in which binding of the receptor induces DNA demethylation resulting in enhancer activation [53]. Given the modest number of reports describing such examples, it is currently difficult to propose a unifying molecular mechanism of TF-mediated demethylation of regulatory regions. It is likely however that TET methylcytosine dioxygenases play a role in at least a subset of such interactions (Fig. 1b,c). Enrichment of 5hmC, the major 5mC oxidised derivative, has been identified on predicted enhancers in mammalian ESCs [54, 55]. In human ESCs, 5hmC-enriched enhancers coincide with binding sites of pluripotency regulators such as NANOG and OCT4 [54] and TET2 knockouts in mouse ESCs result in extensive loss of 5hmC at enhancers, accompanied by 5mC enrichment (hypermethylation) and reduction of enhancer activity [55]. This is indicative of robust 5mC oxidation and TF turnover on key regulatory elements associated with mammalian pluripotency. It is also worth noting that DNMT1 has low binding affinity for 5hmC, therefore 5hmC may further promote passive DNA demethylation by excluding DNMT1 and thus preventing methylation of cytosines on the nascent DNA strand [56]. Mechanistically, 5hmC has been shown to disrupt nucleosome stability in vitro, and to decrease binding affinity of a major transcriptional , MeCP2 [57, 58]. Taken together, this suggests that 5hmC might contribute to enhancer activity by promoting both active and passive DNA demethylation and by influencing chromatin structure.

Enhancer methylation and embryonic development Once established in adult tissues, 5mC is a relatively static mark. However, during early mammalian embryogenesis it undergoes two rounds of global reprogramming. The first wave of genome-wide demethylation occurs shortly after fertilisation whereas the second wave takes place in primordial germ cells (PGCs) [1, 2]. While such global reprograming appears to be a mammalian-specific phenomenon, active DNA demethylation at distal regulatory elements during vertebrate body plan formation is an evolutionarily conserved trait. A wave of active DNA demethylation at thousands of enhancers associated with conserved regulatory pathways such as Wnt, Notch and transforming growth factor (TGF)-β results in coordinated transcriptional changes and morphological similarities during the vertebrate phylotypic period, the most highly conserved stage of vertebrate embryogenesis [28]. The necessity of this epigenetic remodelling event is highlighted through triple TET zebrafish morphants, which display embryonic lethality and severe developmental including short and blended axes, reduced pigmentation and impaired head structures, and which coincide with increased 5mC at phylotypic stage enhancers. Notably, these ~ 2500 phylotypic stage enhancers are highly CpG rich, displaying a CpG density comparable to that of CGIs, which might make them more prone to regulation via 5mC. Importantly, triple TET morphants also display reduced chromatin accessibility at phylotypic enhancers, as measured by ATAC-seq, indicative of 5mC being an upstream regulatory mark, the removal of which might be required for proper enhancer function. Similar impact on chromatin accessibility at regulatory regions has been demonstrated in mammalian systems where TET2/TET3 function was ablated [59, 60]. Studies of mouse and human organogenesis during early embryogenesis have also revealed a conserved and elaborate methylation reprogramming of enhancers strongly associated with the development of almost all human organs, displaying the canonical inverse correlation between 5mC levels and enhancer activity [61]. Moreover, recent work suggests that developmental removal of enhancer methylation might have preceded the origin of vertebrates. The invertebrate chordate Amphioxus (Branchiostoma lanceolatum) has been shown to display robust demethylation of regulatory regions (as defined by ATAC-seq) during tissue differentiation [62]. Thus, evolutionary conservation of developmental enhancer demethylation and the expression of associated developmental genes, appears to be a feature required for proper embryonic development of vertebrates and beyond.

Compatibility of 5mC and active chromatin signatures Despite the insights discussed above, the relationships between 5mC levels and enhancer activity are far from being fully understood. For example, in mouse genomes, distal regulatory elements display tissue-specific differential methylation. A subset of these differentially methylated regions is hypomethylated yet lack the active histone marks H3K4me1 and H3K27ac, and are consequently inactive in adult tissues. However, these enhancers ("vestigial" enhancers) are active during embryonic development, displaying hypomethylation as well as H3K4me1 and H3K27ac, with their putative targets being developmental genes, suggesting that a form of “epigenetic memory” is retained from development to adulthood in the form of DNA hypomethylation [29, 63]. The methylation status of an enhancer can therefore be utilised as an indicator of the past activity of an enhancer. More notably however, these findings demonstrate that hypomethylation at these regions is not necessary or sufficient for enhancer activity. Moreover, a recent study exploring the correlation between chromatin structure and 5mC has challenged the typical inverse relationship between 5mC content and enhancer activity, demonstrating that 5mC can co-exist with active histone marks at distal regulatory sequences [64]. This report suggests that chromatin can exist in a bivalent state, characterised by the coexistence of both active histone marks such as H3K27ac and 5mC in human cells. Specifically, high levels of 5mC were detected on the same nucleosome with H3K27ac marks at regular and super-enhancers in the , co-existing in an unprecedented relationship that is not observed at promoters. Moreover, some of these regions were highly CpG dense, displaying CpG content comparable to that of CGIs. As these bivalent regions appear to be located outside TF- binding sites in most of the cases, it is possible that 5mC in this context does not repress enhancer activity, but rather stabilises DNA and defines demethylated TF-binding sites [64]. A similar observation regarding the compatibility of 5mC and active histone marks was also recently made in zebrafish embryos [65]. These observations notwithstanding, it is important to note that the majority of vertebrate enhancers are CpG poor and that it is not clear what impact 5mC has on their activity. Despite the compelling evidence suggesting that 5mC state plays an important role in determining enhancer activity, whether there is a strict causal relationship between methylation and enhancer activity remains unresolved.

Conclusion While 5mC is one of the most well studied epigenetic modifications, the functional consequence of this mark in certain genomic contexts remains enigmatic. The significance of 5mC at distal regulatory elements is augmented by the fact that it is an evolutionarily conserved feature that must be carefully orchestrated for normal embryonic development and cell differentiation. Evidently, a number of studies support that enhancer methylation influences transcription of the associated gene, however there is currently a lack of a definitive causal link that explains whether 5mC is directly responsible for enhancer activity. Furthermore, the chromatin environment that is necessary for 5mC to influence is not entirely understood. Studies performed in diverse biological systems typically suggest that active enhancers display lower 5mC levels than poised or silent enhancers, as well as TF binding, 5hmC enrichment and the presence of active histone marks H3K4me1 and H3K27ac [32]. Conversely, hypermethylation at enhancers is generally associated with reduced chromatin accessibility and decreased TF binding. This consensus has been challenged by recent studies on the epigenetic environment of enhancers. Enhancers can exist in a bivalent state, displaying the co-existence of characteristic active and inactive biochemical features [66]. It is also important to note that enhancer CpG density might play a major role in determining the regulatory inputs and outputs of enhancers. Thousands of CGI enhancers identified in human tissues display considerable regulatory differences when compared to their CpG poor counterparts. For example, CGI enhancers are: i) more highly conserved, ii) more highly enriched in R loops and 3D genomic contacts, iii) on average provide stronger signal in functional assays, and iv) are prone to hypermethylation in cancer [47]. Future work involving technologies such as precise epigenome editing [67] and single cell genomic and transcriptomic assays should aim to conclusively define how and to what extent 5mC presence at distinct enhancer classes is necessary for interaction with the linked promoter and consequently for the control of gene expression.

Summary

• In general, at least in vertebrates, enhancer activity is correlated with DNA hypomethylation. Some studies, however, suggest that 5mC and active chromatin marks such as H3K27ac are compatible at enhancers. • DNA hypomethylation is not sufficient to drive enhancer activity as evidenced by embryonic enhancers that remain hypomethylated and yet are inactive in adult mammalian tissues. • A number of studies have shown that TF binding to enhancers is sufficient to promote 5mC turnover that can be mediated by TET-dependent mechanisms of active demethylation. • Developmental 5mC remodeling at enhancers appears to be a conserved feature of vertebrate and perhaps even chordate and to some extent invertebrate development.

Conflicts of interest The authors declare no conflict of interest associated with this manuscript.

Acknowledgments This work was supported by the ARC Discovery Project (DP190103852) to O.B. The figures were made with the help of Biorender software.

Author contributions A.A and O.B wrote the paper. O.B drafted the figures.

References 1 Lee, H. J., Hore, T. A. and Reik, W. (2014) Reprogramming the methylome: erasing memory and creating diversity. Cell stem cell. 14, 710-719 2 Skvortsova, K., Iovino, N. and Bogdanovic, O. (2018) Functions and mechanisms of epigenetic inheritance in animals. Nat Rev Mol Cell Biol 3 Bestor, T. H. (2000) The DNA methyltransferases of mammals. Hum Mol Genet. 9, 2395-2402 4 Liu, X., Gao, Q., Li, P., Zhao, Q., Zhang, J., Li, J., Koseki, H. and Wong, J. (2013) UHRF1 targets DNMT1 for DNA methylation through cooperative binding of hemi- methylated DNA and methylated H3K9. Nature communications. 4, 1563 5 Rougier, N., Bourc'his, D., Gomes, D. M., Niveleau, A., Plachot, M., Paldi, A. and Viegas-Pequignot, E. (1998) Chromosome methylation patterns during mammalian preimplantation development. Genes & development. 12, 2108-2113 6 Cardoso, M. C. and Leonhardt, H. (1999) DNA methyltransferase is actively retained in the cytoplasm during early development. The Journal of cell biology. 147, 25-32 7 Kohli, R. M. and Zhang, Y. (2013) TET enzymes, TDG and the dynamics of DNA demethylation. Nature. 502, 472-479 8 Bonasio, R., Li, Q., Lian, J., Mutti, N. S., Jin, L., Zhao, H., Zhang, P., Wen, P., Xiang, H., Ding, Y., Jin, Z., Shen, S. S., Wang, Z., Wang, W., Wang, J., Berger, S. L., Liebig, J., Zhang, G. and Reinberg, D. (2012) Genome-wide and caste-specific DNA methylomes of the ants Camponotus floridanus and Harpegnathos saltator. Current biology : CB. 22, 1755-1764 9 Lister, R., Mukamel, E. A., Nery, J. R., Urich, M., Puddifoot, C. A., Johnson, N. D., Lucero, J., Huang, Y., Dwork, A. J., Schultz, M. D., Yu, M., Tonti-Filippini, J., Heyn, H., Hu, S., Wu, J. C., Rao, A., Esteller, M., He, C., Haghighi, F. G., Sejnowski, T. J., Behrens, M. M. and Ecker, J. R. (2013) Global epigenomic reconfiguration during mammalian brain development. Science. 341, 1237905 10 He, Y. and Ecker, J. R. (2015) Non-CG Methylation in the Human Genome. Annu Rev Genomics Hum Genet. 16, 55-77 11 Feng, S., Cokus, S. J., Zhang, X., Chen, P. Y., Bostick, M., Goll, M. G., Hetzel, J., Jain, J., Strauss, S. H., Halpern, M. E., Ukomadu, C., Sadler, K. C., Pradhan, S., Pellegrini, M. and Jacobsen, S. E. (2010) Conservation and divergence of methylation patterning in plants and animals. Proceedings of the National Academy of Sciences of the United States of America. 107, 8689-8694 12 Zemach, A., McDaniel, I. E., Silva, P. and Zilberman, D. (2010) Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science. 328, 916-919 13 Bewick, A. J., Vogel, K. J., Moore, A. J. and Schmitz, R. J. (2017) of DNA Methylation across Insects. Mol Biol Evol. 34, 654-665 14 Suzuki, M. M., Kerr, A. R., De Sousa, D. and Bird, A. (2007) CpG methylation is targeted to transcription units in an invertebrate genome. Genome research. 17, 625-631 15 Neri, F., Rapelli, S., Krepelova, A., Incarnato, D., Parlato, C., Basile, G., Maldotti, M., Anselmi, F. and Oliviero, S. (2017) Intragenic DNA methylation prevents spurious transcription initiation. Nature. 543, 72-77 16 Bird, A. (2002) DNA methylation patterns and epigenetic memory. Genes & development. 16, 6-21 17 Duncan, B. K. and Miller, J. H. (1980) Mutagenic deamination of cytosine residues in DNA. Nature. 287, 560-561 18 Deaton, A. M. and Bird, A. (2011) CpG islands and the regulation of transcription. Genes & development. 25, 1010-1022 19 Busslinger, M., Hurst, J. and Flavell, R. A. (1983) DNA methylation and the regulation of globin gene expression. Cell. 34, 197-206 20 Stein, R., Razin, A. and Cedar, H. (1982) In vitro methylation of the hamster adenine phosphoribosyltransferase gene inhibits its expression in mouse L cells. Proceedings of the National Academy of Sciences of the United States of America. 79, 3418-3422 21 Jones, P. L., Veenstra, G. J., Wade, P. A., Vermaak, D., Kass, S. U., Landsberger, N., Strouboulis, J. and Wolffe, A. P. (1998) Methylated DNA and MeCP2 recruit to repress transcription. Nature . 19, 187-191 22 Nan, X., Ng, H. H., Johnson, C. A., Laherty, C. D., Turner, B. M., Eisenman, R. N. and Bird, A. (1998) Transcriptional repression by the methyl-CpG-binding protein MeCP2 involves a histone deacetylase complex. Nature. 393, 386-389 23 Lister, R., Pelizzola, M., Dowen, R. H., Hawkins, R. D., Hon, G., Tonti-Filippini, J., Nery, J. R., Lee, L., Ye, Z., Ngo, Q. M., Edsall, L., Antosiewicz-Bourget, J., Stewart, R., Ruotti, V., Millar, A. H., Thomson, J. A., Ren, B. and Ecker, J. R. (2009) Human DNA methylomes at base resolution show widespread epigenomic differences. Nature. 462, 315- 322 24 Long, H. K., Sims, D., Heger, A., Blackledge, N. P., Kutter, C., Wright, M. L., Grutzner, F., Odom, D. T., Patient, R., Ponting, C. P. and Klose, R. J. (2013) Epigenetic conservation at gene regulatory elements revealed by non-methylated DNA profiling in seven vertebrates. eLife. 2, e00348 25 Weber, M., Hellmann, I., Stadler, M. B., Ramos, L., Paabo, S., Rebhan, M. and Schubeler, D. (2007) Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nature genetics. 39, 457-466 26 Borgel, J., Guibert, S., Li, Y., Chiba, H., Schubeler, D., Sasaki, H., Forne, T. and Weber, M. (2010) Targets and dynamics of promoter DNA methylation during early mouse development. Nature genetics. 42, 1093-1100 27 Skvortsova, K., Tarbashevich, K., Stehling, M., Lister, R., Irimia, M., Raz, E. and Bogdanovic, O. (2019) Retention of paternal DNA methylome in the developing zebrafish germline. Nature communications. 10, 3054 28 Bogdanovic, O., Smits, A. H., de la Calle Mustienes, E., Tena, J. J., Ford, E., Williams, R., Senanayake, U., Schultz, M. D., Hontelez, S., van Kruijsbergen, I., Rayon, T., Gnerlich, F., Carell, T., Veenstra, G. J., Manzanares, M., Sauka-Spengler, T., Ecker, J. R., Vermeulen, M., Gomez-Skarmeta, J. L. and Lister, R. (2016) Active DNA demethylation at enhancers during the vertebrate phylotypic period. Nature genetics. 48, 417-426 29 Hon, G. C., Rajagopal, N., Shen, Y., McCleary, D. F., Yue, F., Dang, M. D. and Ren, B. (2013) Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nature genetics. 45, 1198-1206 30 Ziller, M. J., Gu, H., Muller, F., Donaghey, J., Tsai, L. T., Kohlbacher, O., De Jager, P. L., Rosen, E. D., Bennett, D. A., Bernstein, B. E., Gnirke, A. and Meissner, A. (2013) Charting a dynamic DNA methylation landscape of the human genome. Nature. 500, 477-481 31 Feldmann, A., Ivanek, R., Murr, R., Gaidatzis, D., Burger, L. and Schubeler, D. (2013) Transcription factor occupancy can mediate active turnover of DNA methylation at regulatory regions. PLoS genetics. 9, e1003994 32 Stadler, M. B., Murr, R., Burger, L., Ivanek, R., Lienert, F., Scholer, A., van Nimwegen, E., Wirbelauer, C., Oakeley, E. J., Gaidatzis, D., Tiwari, V. K. and Schubeler, D. (2011) DNA-binding factors shape the mouse methylome at distal regulatory regions. Nature. 480, 490-495 33 Mohn, F., Weber, M., Rebhan, M., Roloff, T. C., Richter, J., Stadler, M. B., Bibel, M. and Schubeler, D. (2008) Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors. Molecular cell. 30, 755-766 34 Payer, B. and Lee, J. T. (2008) X chromosome dosage compensation: how mammals keep the balance. Annu Rev Genet. 42, 733-772 35 Ford, E. E., Grimmer, M. R., Stolzenburg, S., Bogdanovic, O., De Mendoza, A., Farnham, P. J., Blancafort, P. and Lister, R. (2017) Frequent lack of repressive capacity of promoter DNA methylation identified through genome-wide epigenomic manipulation. biorXiv 36 Wu, H., D'Alessio, A. C., Ito, S., Xia, K., Wang, Z., Cui, K., Zhao, K., Sun, Y. E. and Zhang, Y. (2011) Dual functions of Tet1 in transcriptional regulation in mouse embryonic stem cells. Nature. 473, 389-393 37 Brown, D. A., Di Cerbo, V., Feldmann, A., Ahn, J., Ito, S., Blackledge, N. P., Nakayama, M., McClellan, M., Dimitrova, E., Turberfield, A. H., Long, H. K., King, H. W., Kriaucionis, S., Schermelleh, L., Kutateladze, T. G., Koseki, H. and Klose, R. J. (2017) The SET1 Complex Selects Actively Transcribed Target Genes via Multivalent Interaction with CpG Island Chromatin. Cell reports. 20, 2313-2327 38 Boulard, M., Edwards, J. R. and Bestor, T. H. (2015) FBXL10 protects Polycomb- bound genes from hypermethylation. Nature genetics. 47, 479-485 39 Blackledge, N. P. and Klose, R. (2011) CpG island chromatin: a platform for gene regulation. Epigenetics : official journal of the DNA Methylation Society. 6, 147-152 40 Krebs, A. R., Dessus-Babus, S., Burger, L. and Schubeler, D. (2014) High-throughput engineering of a mammalian genome reveals building principles of methylation states at CG rich regions. eLife. 3, e04094 41 Wachter, E., Quante, T., Merusi, C., Arczewska, A., Stewart, F., Webb, S. and Bird, A. (2014) Synthetic CpG islands reveal DNA sequence determinants of chromatin structure. eLife. 3, e03397 42 Dimitrova, E., Kondo, T., Feldmann, A., Nakayama, M., Koseki, Y., Konietzny, R., Kessler, B. M., Koseki, H. and Klose, R. J. (2018) FBXL19 recruits CDK- to CpG islands of developmental genes priming them for activation during lineage commitment. eLife. 7 43 Thomson, J. P., Skene, P. J., Selfridge, J., Clouaire, T., Guy, J., Webb, S., Kerr, A. R., Deaton, A., Andrews, R., James, K. D., Turner, D. J., Illingworth, R. and Bird, A. (2010) CpG islands influence chromatin structure via the CpG-binding protein Cfp1. Nature. 464, 1082-1086 44 Baubec, T., Ivanek, R., Lienert, F. and Schubeler, D. (2013) Methylation-dependent and -independent genomic targeting principles of the MBD protein family. Cell. 153, 480- 492 45 Okamoto, I. and Heard, E. (2009) Lessons from comparative analysis of X- chromosome inactivation in mammals. Chromosome Res. 17, 659-669 46 Shlyueva, D., Stampfel, G. and Stark, A. (2014) Transcriptional enhancers: from properties to genome-wide predictions. Nature reviews. Genetics. 15, 272-286 47 Bell, J. S. K. and Vertino, P. M. (2017) Orphan CpG islands define a novel class of highly active enhancers. Epigenetics : official journal of the DNA Methylation Society. 12, 449-464 48 Bogdanovic, O., Fernandez-Minan, A., Tena, J. J., de la Calle-Mustienes, E., Hidalgo, C., van Kruysbergen, I., van Heeringen, S. J., Veenstra, G. J. and Gomez-Skarmeta, J. L. (2012) Dynamics of enhancer chromatin signatures mark the transition from pluripotency to cell specification during embryogenesis. Genome research. 22, 2043-2053 49 Rada-Iglesias, A., Bajpai, R., Swigut, T., Brugmann, S. A., Flynn, R. A. and Wysocka, J. (2011) A unique chromatin signature uncovers early developmental enhancers in . Nature. 470, 279-283 50 Thurman, R. E., Rynes, E., Humbert, R., Vierstra, J., Maurano, M. T., Haugen, E., Sheffield, N. C., Stergachis, A. B., Wang, H., Vernot, B., Garg, K., John, S., Sandstrom, R., Bates, D., Boatman, L., Canfield, T. K., Diegel, M., Dunn, D., Ebersol, A. K., Frum, T., Giste, E., Johnson, A. K., Johnson, E. M., Kutyavin, T., Lajoie, B., Lee, B. K., Lee, K., London, D., Lotakis, D., Neph, S., Neri, F., Nguyen, E. D., Qu, H., Reynolds, A. P., Roach, V., Safi, A., Sanchez, M. E., Sanyal, A., Shafer, A., Simon, J. M., Song, L., Vong, S., Weaver, M., Yan, Y., Zhang, Z., Zhang, Z., Lenhard, B., Tewari, M., Dorschner, M. O., Hansen, R. S., Navas, P. A., Stamatoyannopoulos, G., Iyer, V. R., Lieb, J. D., Sunyaev, S. R., Akey, J. M., Sabo, P. J., Kaul, R., Furey, T. S., Dekker, J., Crawford, G. E. and Stamatoyannopoulos, J. A. (2012) The accessible chromatin landscape of the human genome. Nature. 489, 75-82 51 Yin, Y., Morgunova, E., Jolma, A., Kaasinen, E., Sahu, B., Khund-Sayeed, S., Das, P. K., Kivioja, T., Dave, K., Zhong, F., Nitta, K. R., Taipale, M., Popov, A., Ginno, P. A., Domcke, S., Yan, J., Schubeler, D., Vinson, C. and Taipale, J. (2017) Impact of cytosine methylation on DNA binding specificities of human transcription factors. Science. 356 52 Spruijt, C. G., Gnerlich, F., Smits, A. H., Pfaffeneder, T., Jansen, P. W., Bauer, C., Munzel, M., Wagner, M., Muller, M., Khan, F., Eberl, H. C., Mensinga, A., Brinkman, A. B., Lephikov, K., Muller, U., Walter, J., Boelens, R., van Ingen, H., Leonhardt, H., Carell, T. and Vermeulen, M. (2013) Dynamic readers for 5-(hydroxy)methylcytosine and its oxidized derivatives. Cell. 152, 1146-1159 53 Wiench, M., John, S., Baek, S., Johnson, T. A., Sung, M. H., Escobar, T., Simmons, C. A., Pearce, K. H., Biddie, S. C., Sabo, P. J., Thurman, R. E., Stamatoyannopoulos, J. A. and Hager, G. L. (2011) DNA methylation status predicts cell type-specific enhancer activity. EMBO J. 30, 3028-3039 54 Stroud, H., Feng, S., Morey Kinney, S., Pradhan, S. and Jacobsen, S. E. (2011) 5- Hydroxymethylcytosine is associated with enhancers and gene bodies in human embryonic stem cells. Genome biology. 12, R54 55 Hon, G. C., Song, C. X., Du, T., Jin, F., Selvaraj, S., Lee, A. Y., Yen, C. A., Ye, Z., Mao, S. Q., Wang, B. A., Kuan, S., Edsall, L. E., Zhao, B. S., Xu, G. L., He, C. and Ren, B. (2014) 5mC oxidation by Tet2 modulates enhancer activity and timing of transcriptome reprogramming during differentiation. Molecular cell. 56, 286-297 56 Hashimoto, H., Liu, Y., Upadhyay, A. K., Chang, Y., Howerton, S. B., Vertino, P. M., Zhang, X. and Cheng, X. (2012) Recognition and potential mechanisms for replication and erasure of cytosine hydroxymethylation. Nucleic acids research. 40, 4841-4849 57 Khrapunov, S., Warren, C., Cheng, H., Berko, E. R., Greally, J. M. and Brenowitz, M. (2014) Unusual characteristics of the DNA binding domain of epigenetic regulatory protein MeCP2 determine its binding specificity. Biochemistry. 53, 3379-3391 58 Mendonca, A., Chang, E. H., Liu, W. and Yuan, C. (2014) Hydroxymethylation of DNA influences nucleosomal conformation and stability in vitro. Biochim Biophys Acta. 1839, 1323-1329 59 Lio, C. W., Zhang, J., Gonzalez-Avalos, E., Hogan, P. G., Chang, X. and Rao, A. (2016) Tet2 and Tet3 cooperate with B-lineage transcription factors to regulate DNA modification and chromatin accessibility. eLife. 5 60 Reizel, Y., Sabag, O., Skversky, Y., Spiro, A., Steinberg, B., Bernstein, D., Wang, A., Kieckhaefer, J., Li, C., Pikarsky, E., Levin-Klein, R., Goren, A., Rajewsky, K., Kaestner, K. H. and Cedar, H. (2018) Postnatal DNA demethylation and its role in tissue maturation. Nature communications. 9, 2040 61 Li, C., Fan, Y., Li, G., Xu, X., Duan, J., Li, R., Kang, X., Ma, X., Chen, X., Ke, Y., Yan, J., Lian, Y., Liu, P., Zhao, Y., Zhao, H., Chen, Y., Yu, Y. and Liu, J. (2018) DNA methylation reprogramming of functional elements during mammalian embryonic development. Cell Discov. 4, 41 62 Marletaz, F., Firbas, P. N., Maeso, I., Tena, J. J., Bogdanovic, O., Perry, M., Wyatt, C. D. R., de la Calle-Mustienes, E., Bertrand, S., Burguera, D., Acemel, R. D., van Heeringen, S. J., Naranjo, S., Herrera-Ubeda, C., Skvortsova, K., Jimenez-Gancedo, S., Aldea, D., Marquez, Y., Buono, L., Kozmikova, I., Permanyer, J., Louis, A., Albuixech- Crespo, B., Le Petillon, Y., Leon, A., Subirana, L., Balwierz, P. J., Duckett, P. E., Farahani, E., Aury, J. M., Mangenot, S., Wincker, P., Albalat, R., Benito-Gutierrez, E., Canestro, C., Castro, F., D'Aniello, S., Ferrier, D. E. K., Huang, S., Laudet, V., Marais, G. A. B., Pontarotti, P., Schubert, M., Seitz, H., Somorjai, I., Takahashi, T., Mirabeau, O., Xu, A., Yu, J. K., Carninci, P., Martinez-Morales, J. R., Crollius, H. R., Kozmik, Z., Weirauch, M. T., Garcia-Fernandez, J., Lister, R., Lenhard, B., Holland, P. W. H., Escriva, H., Gomez- Skarmeta, J. L. and Irimia, M. (2018) Amphioxus functional genomics and the origins of vertebrate gene regulation. Nature 63 Jadhav, U., Cavazza, A., Banerjee, K. K., Xie, H., O'Neill, N. K., Saenz-Vash, V., Herbert, Z., Madha, S., Orkin, S. H., Zhai, H. and Shivdasani, R. A. (2019) Extensive Recovery of Embryonic Enhancer and Gene Memory Stored in Hypomethylated Enhancer DNA. Molecular cell. 74, 542-554 e545 64 Charlet, J., Duymich, C. E., Lay, F. D., Mundbjerg, K., Dalsgaard Sorensen, K., Liang, G. and Jones, P. A. (2016) Bivalent Regions of Cytosine Methylation and H3K27 Acetylation Suggest an Active Role for DNA Methylation at Enhancers. Molecular cell. 62, 422-431 65 Kaaij, L. J., Mokry, M., Zhou, M., Musheev, M., Geeven, G., Melquiond, A. S., de Jesus Domingues, A. M., de Laat, W., Niehrs, C., Smith, A. D. and Ketting, R. F. (2016) Enhancers reside in a unique epigenetic environment during early zebrafish development. Genome biology. 17, 146 66 Charlet, J., Duymich, C. E., Lay, F. D., Mundbjerg, K., Dalsgaard Sorensen, K., Liang, G. and Jones, P. A. (2016) Bivalent Regions of Cytosine Methylation and H3K27 Acetylation Suggest an Active Role for DNA Methylation at Enhancers. Molecular cell. 62, 422-431 67 Pflueger, C., Tan, D., Swain, T., Nguyen, T., Pflueger, J., Nefzger, C., Polo, J. M., Ford, E. and Lister, R. (2018) A modular dCas9-SunTag DNMT3A epigenome editing system overcomes pervasive off-target activity of direct fusion dCas9-DNMT3A constructs. Genome research

Figure 1. 5mC turnover at gene-regulatory regions. a) 5mC sensitivity of > 500 TFs assayed through methyl-SELEX and bisulfite-SELEX approaches [51]. The majority of TFs were not affected by 5mC whereas more than 30% of assayed TF displayed increased affinity toward methylated motifs. This was followed by TFs that displayed reduced affinity towards 5mC (> 20%). A small percentage of TFs displayed minor or multiple effects. b-c) Schematic examples of cell lines where a transcription factor (TF) is silenced (b) or expressed (c). Work from multiple sources [31, 32, 50] suggests that TF occupancy anti-correlates with 5mC at regulatory regions and that forceful expression of TFs is sufficient to induce a hypomethylated state at TF binding sites, likely through the recruitment of TET demethylases.

Figure 2. Non-canonical chromatin configuration at enhancers. a) Vestigial enhancers [29] are embryonic enhancers that remain hypomethylated in adult tissues, however, without displaying any active chromatin marks. It is unclear to what extent TF binding participates in the formation of this type of chromatin and what the function of such "epigenetic memory is". b) Bivalent enhancers display active chromatin marks such as H3K27ac ad 5mC on the same nucleosome. Such enhancers also encompass super-enhancers (enhancer clusters). It is postulated that the combination of H3K27ac and 5mC participates in enhancer integrity and the definition of TF binding sites.