Published online 31 May 2018 Nucleic Acids Research, 2018, Vol. 46, No. 14 7097–7107 doi: 10.1093/nar/gky483 Choice of binding sites for CTCFL compared to CTCF is driven by chromatin and by sequence preference Philipp Bergmaier1, Oliver Weth1, Sven Dienstbach1, Thomas Boettger2, Niels Galjart3, Downloaded from https://academic.oup.com/nar/article-abstract/46/14/7097/5025896 by Erasmus University Rotterdam user on 22 January 2019 Marco Mernberger4, Marek Bartkuhn1 and Rainer Renkawitz1,*
1Institute for Genetics, Justus-Liebig-University, 35392 Giessen, Germany, 2Department Cardiac Development and Remodelling, Max-Planck-Institute, D61231 Bad Nauheim, Germany, 3Department of Cell Biology and Genetics, Erasmus MC, 3000CA Rotterdam, The Netherlands and 4Institute of Molecular Oncology, Philipps-University Marburg, 35043 Marburg, Germany
Received April 18, 2018; Revised May 14, 2018; Editorial Decision May 15, 2018; Accepted May 23, 2018
ABSTRACT nomic architecture to a functional output such as the reg- ulation of genes through an enhancer or insulator (6). Uti- The two paralogous zinc finger factors CTCF and lizing techniques like 3C (chromatin conformation capture) CTCFL differ in expression such that CTCF is ubiq- and its genome-wide derivatives such as Hi-C, topologically uitously expressed, whereas CTCFL is found during associated domains (TADs) could be identified and CTCF spermatogenesis and in some cancer types in addi- was found to be enriched in the border areas of such do- tion to other cell types. Both factors share the highly mains (7). Disruption of CTCF binding and binding sites conserved DNA binding domain and are bound to leads to changes in TAD patterns and has effects on proper DNA sequences with an identical consensus. In con- gene expression programs (8). Taken together, CTCF is one trast, both factors differ substantially in the number of the central factors in bridging genome architecture to of bound sites in the genome. Here, we addressed function. the molecular features for this binding specificity. In contrast to the established role of CTCF, the cellular role of the only known CTCF-paralogue, CTCFL, remains In contrast to CTCF we found CTCFL highly en- to be solved. CTCFL was identified in 20029 ( ) and is be- riched at ‘open’ chromatin marked by H3K27 acety- lieved to result from a gene duplication event in the early lation, H3K4 di- and trimethylation, H3K79 dimethy- amniotic evolution (10), with CTCF and CTCFL sharing a lation and H3K9 acetylation plus the histone vari- highly conserved 11 zinc finger (ZF) DNA binding domain. ant H2A.Z. CTCFL is enriched at transcriptional The N- and C-termini of the two proteins are different, with start sites and regions bound by transcription fac- an amino acid similarity of <20% between mammalian ver- tors. Consequently, genes deregulated by CTCFL are sions (11). First reports described CTCFL expression to be highly cell specific. In addition to a chromatin-driven testis specific and mutually exclusive to CTCF. Later, more choice of binding sites, we determined nucleotide detailed analysis could show CTCFL to be transiently ex- positions critical for DNA binding by CTCFL, but not pressed during spermatogenesis, prior to the onset of meio- by CTCF. sis, overlapping with CTCF expression (12). Some func- tional differences regarding the two proteins have been iden- tified, for instance it seems that only CTCF binds compo- INTRODUCTION nents of the Cohesin complex like Smc1 in mouse (12)or In recent years the multifunctional and highly conserved RAD21 in human (13). CTCFL also failed to substitute factor CTCF (CCCTC-binding factor) has been identified for a loss of CTCF in CTCF KO experiments (12). Further as a key player in 3D chromatin architecture and gene reg- Knockout experiments of CTCFL showed it to be impor- ulation (1–3). CTCF binds DNA through a combination of tant in proper testicular development. This is exemplified by 11 zinc-fingers from its central DNA binding domain4 ( ). At the deregulation of important testis-specific genes, such as its binding sites it can interact with a variety of co-factors, Gal3st1 and Prss50 (12,14,15). Tissue specificity of CTCFL most importantly the cohesin complex to mediate the for- expression has been questioned (16) by showing a more mation of long-distance DNA interaction and DNA loops widespread expression in normal and in cancer cells. Aber- (5). Such looping events can then link three-dimensional ge- rant expression of CTCFL was identified in some cancers
*To whom correspondence should be addressed. Tel: +49 6419 93 5461; Fax: +49 6419 93 5469; Email: [email protected] Present addresses: Philipp Bergmaier, Institute for Molecular Biology and Tumor Research, Philipps-University Marburg, Marburg, Germany. Sven Dienstbach, Institute for Biology, University Siegen, Siegen, Germany.