ARTICLE

Received 25 Sep 2012 | Accepted 12 Dec 2012 | Published 5 Feb 2013 DOI: 10.1038/ncomms2438 Genome-wide profiles of CtBP link metabolism with genome stability and epithelial reprogramming in breast cancer

Li-Jun Di1, Jung S. Byun1, Madeline M. Wong1, Clay Wakano1, Tara Taylor1, Sven Bilke1, Songjoon Baek2, Kent Hunter3, Howard Yang4, Maxwell Lee4, Cecilia Zvosec5, Galina Khramtsova5, Fan Cheng6, Charles M. Perou6, C. Ryan Miller6, Rachel Raab7, Olufunmilayo I. Olopade5 & Kevin Gardner1

The C-terminal binding (CtBP) is a NADH-dependent transcriptional repressor that links carbohydrate metabolism to epigenetic regulation by recruiting diverse histone-mod- ifying complexes to chromatin. Here global profiling of CtBP in breast cancer cells reveals that it drives epithelial-to-mesenchymal transition, stem cell pathways and genome instability. CtBP expression induces mesenchymal and stem cell-like features, whereas CtBP depletion or caloric restriction reverses repression and increases DNA repair. Multiple members of the CtBP-targeted gene network are selectively downregulated in aggressive breast cancer subtypes. Differential expression of CtBP-targeted predicts poor clinical outcome in breast cancer patients, and elevated levels of CtBP in patient tumours predict shorter median survival. Finally, both CtBP promoter targeting and gene repression can be reversed by small molecule inhibition. These findings define broad roles for CtBP in breast cancer biology and suggest novel chromatin-based strategies for pharmacologic and metabolic intervention in cancer.

1 Genetics Branch, National Cancer Institute, Bethesda, Maryland 20892, USA. 2 Laboratory of Biology and , National Cancer Institute, Bethesda, Maryland 20892, USA. 3 Laboratory of Cancer Biology and Genetics, National Cancer Institute, Bethesda, Maryland 20892, USA. 4 Laboratory of Population Genetics, National Cancer Institute, Bethesda, Maryland 20892, USA. 5 Department of Medicine, University of Chicago, Chicago, Illinois 60637, USA. 6 Lineberger Comprehensive Cancer Care Center, University of North Carolina, Chapel Hill, North Carolina 27599, USA. 7 Leo W. Jenkins Cancer Center, East Carolina University, Greenville, North Carolina 27834, USA. Correspondence and requests for materials should be addressed to K.G. (email: [email protected]).

NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications 1 & 2013 Macmillan Publishers Limited. All rights reserved. ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438

ancer evolves through a multi-step process driven by a diagnostic and prognostic importance. The more primitive global reprogramming of cellular gene expression patterns tumours (for example, basal-like and claudin low) usually show Cthat confers adaptive advantages for tumour growth, a more aggressive behaviour with worse clinical outcome9,10. proliferation and dissemination1. This phenotypic transformation Properties of these tumours include mesenchymal features is accomplished by diverse molecular strategies that control associated with reactivation of embryonic programs that programs of cellular function by directing large-scale changes in promote EMT, acquisition of stem cell-like self-renewal gene expression2. Although our understanding of how specific attributes, increased genome instability, and the production of genetic mutations can act as drivers of cancer is well established, cellular progenitors with the ability to seed new tumours, often the paradigms addressing how epigenetic changes are referred to as ‘tumour-initiating cells’ (TICs)11–13. Such features orchestrated to influence hallmarks of cellular malignancy are are recognized as important hallmarks or drivers of cancer1. only just beginning to evolve3. Epigenetic changes represent We profiled the binding of CtBP across the genome of the potentially reversible covalent modifications to chromatin that human breast cancer cell line MCF-7, a well-differentiated can be transmitted to subsequent generations in the absence of oestrogen receptor positive luminal subtype, using antibodies changes to genetic sequence. In combination with DNA that recognize epitopes common to both CtBP1 and CtBP2 methylation and histone modifications (including acetylation, (Fig. 1). Genome-wide ChIP-seq analysis identified a total of methylation, phosphorylation and ubiquitylation), these covalent 6,607 binding sites for CtBP with a false discovery rate modifications constitute a ‘histone code’ that is sculpted and (FDR)o0.00001. A total of 1,823 of these binding sites were in interpreted by an assortment of chromatin regulatory complexes promoter regions (Table 1). Consistent with the established role that bind (‘read’), place (‘write’) and remove (‘erase’) chromatin for CtBP in animal cell development14, ontology analysis of the marks to create the living ‘libretto’ that we now refer to as the 1,823 gene promoters demonstrates that CtBP interacts with gene ‘epigenome’3. How the spatial and kinetic distribution of these networks that have broad roles in cellular homeostasis including chromatin regulatory complexes are coordinated to influence the cellular macromolecule metabolic processes, RNA processing, epigenome has become the focus of extensive investigation4. gene expression and cellular metabolic processes (Supplementary The C-terminal binding (CtBP1/2) are a dimeric Table S1). However, a large number of CtBP-targeted genes family of proteins encoded by two paralogous genes, CtBP1 and belong to categories that are important in malignant tumour CtBP2, that have extensive roles in animal cell development5. transformation and progression, including embryonic CtBP homo- and heterodimerize in the presence of NADH to development, cellular response to DNA damage, cell cycle, cell recruit various chromatin-modifying complexes, including proliferation, cell death, cell adhesion and chromatin histone methyltransferases (HMTs), histone demethylases modifications (Fig. 1a). CtBP binding sites located outside of (HDMs) and histone deacetylases (HDACs) (for example, promoter regions are also nearby genes that show a similar LSD1, HDAC1/2/4/6/7, G9a and EHMT) to chromatin-bound distribution of these categories (Supplementary Figs S1a–d). In sequence-specific transcription factors5. In this way, CtBP has the particular, many of the CtBP target genes belong to functional potential to link metabolic status to specific changes in the gene categories that have major roles in driving the more epigenetic landscape of the nucleus and have a dominant role in aggressive mesenchymal phenotypes of basal-like and claudin low determining cellular behaviour and fate6,7. However, with the tumours including genome instability, EMT and stem cell-like/ exception of a small set of tumour suppressor genes (for example, TIC pathways10,13 (Fig. 1b). The core list of 30 CtBP bound genes CDH1 (E-Cadherin), CDKN2A (p16), Sirtuin 1 and BRCA1)6,8, in Fig. 1b (10 from each category, each with Z-scores430.17), the genome-wide targets of CtBP in the mammalian nucleus include multiple genes that have been previously shown to have remain unknown. major roles in breast cancer aetiology, genetic susceptibility and Previously, we showed that CtBP repressed the transcriptional tumour progression15,16. It was therefore selected for use in expression of the early-onset breast cancer gene, BRCA1, by subsequent validation studies. recruiting HDAC activity to the BRCA1 promoter to antagonize In silico analysis of consensus binding motifs centred under p300-driven histone acetylation8. In this current work we extend CtBP peaks shows a large enrichment of this observation by profiling the global association of CtBP with binding sites for ETS, CREB, STAT and EGR1/SP1 families of the genome of breast cancer cells by combining chromatin transcription factors (Fig. 1c and Supplementary Fig. S2). immunoprecipitation with deep sequencing (ChIP-Seq) to define Notably, these motifs are overrepresented in the promoters of cellular programs driven by CtBP with clinical importance, and both bidirectional and DNA repair genes including BRCA1, potential for therapeutic targeting. Here we reveal that CtBP has a PALB2, FANCD2, FANCM and RAD51C17,18 (also see Fig. 1b). In prominent role in epigenetic reprogramming that drives major addition, the ETS pathway has been identified as a major hallmarks of cancer through transcriptional mechanisms that are programme in basal-like breast cancers19. This finding suggests both linked to metabolism and susceptible to pharmacologic that CtBP recruitment is likely to be coordinated through a intervention. common promoter context to control specific cellular programs. CtBP is often found in complexes with the HDM, LSD1, where it has a broad role in both repressive and activating Results transcriptional programs in various cell types20–22. In breast CtBP targets cellular reprograming and genome stability. epithelia, LSD1 represses invasion and metastasis23. However, Recent molecular and morphological studies have shown that comparison of the targets of CtBP and LSD1 in MCF-7 and most breast cancers can be separated into distinct subtypes that human ES cells reveals less than a 7% and 10% overlap, segregate along the hierarchy of normal mammary epithelial respectively (Fig. 1d). This indicates that the major functions of differentiation and development, and include luminal A, luminal CtBP in epigenetic regulation are likely to involve complexes that B, human epithelial growth factor receptor 2 (HER2) positive, are distinct from LSD1 targeting in mammary epithelial cells. basal-like and claudin low9,10. Luminal A and B are well- differentiated tumours and usually oestrogen receptor positive. The basal-like and claudin low subtypes are much more primitive CtBP promotes EMT and enhances TIC traits. Though CtBP and usually deficient in receptors for oestrogen and progesterone predominantly produces repression of target genes, both the and HER2 (refs 9,10). This classification has substantial dimeric state of CtBP and composition of the CtBP-containing

2 NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications & 2013 Macmillan Publishers Limited. All rights reserved. NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438 ARTICLE

Genome stability EMT Stem/TIC pathways 2 kb 18 80 1 kb 20 1 kb HES1 1.6 CtBP FAN1 NME1

nsity 0 0 0 1.4 Chromatin FAN1 NME1 HES1 Cellular response 1 kb 1 kb 2 kb modification (54) 25 15 CLDN4 31 OVOL2 1.2 to DNA damage (86) FANCD2 1.0 Embryonic 0 CIDECP 0 1 tive tag de DNA repair (56) FANCD2 CLDN4 OVOL2 0.8 development (56) 2 kb FOXA1 2 kb Rela 17 2 kb 16 23 0.6 FANCM CST6 –20000 2000 Cell cycle (141) 0 1 1 FKBP3 CST6 FOXA1 FANCM Distance from TSS 20 67 2 kb 15 2 kb GATA3 Gene expression Cell adhesion (48) 2 kb PARD6B (533) PALB2 ETS family (GABP/ELK) 0 2 0 PALB2 PARD6B FLJ45983 DCTN5 GATA3 Cell proliferation 2 kb 1 kb CREB family (CREB/ATF1) Developmental 60 2 kb 22 12 DKK1 process (101) RAD51C ANXA3 STAT family (STAT1/5A/5B) (294) 0 1 0 Cell death (157) TEX14 ANXA3 LOC100506939 DKK1 RAD51C EGR1 SP1 SP3 SP4 Promoter (–2500 to +2500 bp) 13 2 kb 150 1 kb 38 2 kb RARG CtBP bound (1823) MUS81 GRHL2 1 0 1 Embryonic Cellular response Chromatin CFL1 GRHL2 RARG development to DNA damage modification MUS81 11 2 kb 5 kb THAP10 15 2 kb 20 KRT18 CtBP-bound genes FGF9 MSH5 SMARCD2 ERCC5 1 CTNNB1 RAD51L1 PRMT5 THAP10 1823 0 1 LRRC49 CEBPB BRCA1 SIRT1 ERCC5 KRT18 1 kb 30 2 kb ADIPOQ MDC1 SETDB1 15 2 kb 24 CEBPB 1534 HOPX RAD18 SMARCA5 ATR CDS1 1 107 175 LMX1B XRCC4 ARID1A 0 1 CEBPB MAFB RFC3 CHD8 ATR CDS1 1 kb 7 2 kb 2 kb 41 OAZ3 AXIN2 RAD51AP1 KDM3B 20 30 1591117 1566 LIMS1 XRCC1 MYST3 XRCC5 CLDN9 0 0 1 BMP7 EXO1 MLL XRCC5 CLDN9 MRPL9 1822 1865 1 kb OAZ3 70 5 kb 30 2 kb 14 AMOTL2 LSD1-bound genes LSD1-bound genes BRIP1 ZNF165 MCF-7 ES cells 1 21,23 22 0 BRIP1 ZNF165 0 (Wang et al ) (Adamo et al ) INTS2 AMOTL2

Figure 1 | Global targeting of genome stability and developmental pathways by CtBP differentiation. (a) analysis indicates that CtBP targets numerous cancer-related pathways linked to DNA repair, chromatin modifications, cell adhesion and other pathways important in developmental processes. (b) ChIP-Seq profiles of the binding of CtBP to selected genes that are key genetic drivers of human malignancy including genome stability, epithelial-to-mesenchymal transition (EMT) and stem cell/tumour initiation cell (TIC) pathways in MCF-7 cells. Red indicates genes encoded 50–30 on the upper strand, whereas green indicates genes encoded 50–30 on the lower strand. (c) Binding motifs enriched under CtBP ChIP-seq peaks revealed by in silico analysis. (d) The large majority of promoter binding sites for CtBP are distinct from those bound by LSD1 in MCF-7 and human ES cells.

Table 1 | Genome-wide binding sites for CtBP. size of the overlap is likely a reflection of direct and indirect influences of CtBP depletion in combination with the insensitivity Number of sites and low dynamic range of hybridization-based array Promoter (o2.5 kb of TSS) 1,823 technology27. Therefore, to generate a more accurate view of Downstream peaks 916 the relationship between CtBP occupancy and gene expression, a Distal upstream peaks 1,559 total of 71 genes (30 genes from Fig. 1b and 41 additional genes Intron 2,056 collected from gene categories described in Fig. 1; in total Exon 253 representing 26% of the EMT overlap and 38% of the TIC overlap TSS, transcription start site. in Fig. 2a) were selected for validation by quantitative mRNA Binding site distribution of CtBP in MCF-7 cells with Z-scoreZ30. expression (quantitative reverse transcription–PCR) and quantitative ChIP. Both cells depleted of CtBP by RNAi and cells overexpressing CtBP were analysed and compared (Fig. 2c complex are major determinants of whether CtBP will repress or and Supplementary Figs S3–S6). By this analysis, 56% of the 21,24,25 induce gene expression . An analysis of the relative validation genes showed significant (Po0.05) upregulation enrichment of CtBP-targeted genes in molecular signatures of following CtBP depletion, whereas 14% showed downregulation EMT26 shows a significant (P ¼ 9.5 E-11) overlap with genes that (Supplementary Figs S4–S6). Conversely, 46% of genes in cells are differentially repressed (downregulated) compared with overexpressing CtBP showed significant repression (Po0.05), activated (upregulated) during EMT (Fig. 2a). Similarly, CtBP- whereas 15% showed upregulation. Hundred percentage of genes bound genes are significantly more enriched (P ¼ 7.37E-10) in the tested by quantitative ChIP showed a significant peak (Po0.05) genes that are differentially repressed in the cancer TIC/Stem cell and 90% (27 of 30) genes showed a significant decrease (Po0.05) signature12 (Fig. 2a). These findings implicate a predominant role of CtBP binding following CtBP gene depletion by RNAi for CtBP in driving both EMT and stem cell-like attributes (Supplementary Figs S3 and S6). These data show that many of through transcriptional repression. the genes identified by ChIP-Seq analysis are likely to be bona fide To obtain a general impression of the correlation between functional targets of CtBP. CtBP promoter occupancy and gene expression, microarray analysis was used to compare gene expression patterns in control versus MCF-7 cells that had been depleted of CtBP by RNAi Global depletion of CtBP increases DNA repair. CtBP-bound (Fig. 2b). This screen identified 1,585 genes that showed sequences are enriched in transcription factor binding sites found significant (Po0.05) upregulation and 1,248 genes that showed in the promoters of genes involved in DNA repair (Fig. 1b,c). downregulation by either direct or indirect CtBP influence. Using Many of these are derepressed following CtBP depletion (Fig. 2c the FDR cutoff described above, 179 of the upregulated and 100 and Supplementary Fig. S4). These findings, in combination with of the downregulated genes were identified as CtBP targets by the previously reported repressive effects of CtBP on BRCA1 ChIP-Seq. Although the specific functional distribution of expression8, suggests that CtBP levels may have a strong influence differentially expressed gene classes is similar to the ChIP-Seq on DNA repair. To test this, comet assays were performed on distribution shown in Fig. 1a (Supplementary Fig. S7), the modest MCF-7 cells exposed to oxidative DNA damage before or after

NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications 3 & 2013 Macmillan Publishers Limited. All rights reserved. ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438

Ctrl CtBP RNAi EMT signature Ctrl CtBP Ctrl KD CtBP KD ZNF165 TOB2 Downregulated RASEF ATR Untxd CEBPB EMT P =9.5 E–11 CtBP- Downregulated RARG Control RNAi CLDN9 downregulated bound genes CtBP-RNAi RSF1 RAD9A 161 1792 1823 ZAP70 26 130 31 SMAD6 (Loboda et al ) AGR2 H O IGHMBP 2 2 NARAS

ression MUS81 T-PCR PRLR BNIP1 Upregulated SMARCA5 H O GATA3 2 2 –0.60 BUB3 +

its EMT P =0.999 CtBP- ression FAN1 VAV3 upregulated bound genes CLDN4 2 h recovery un PLS1

2

mRNA, qR

149 1819 1823 0.00 145 4 Relative exp SMAD3 26 KRT18 (Loboda et al ) BPNT1 Log CLN3 Upregulated MAT2B

0.60 TMPRSS4 Comet assay CDS1 Relative exp MLL FGF9 400 RAD51C Cancer TIC/ XRCC5 WWP1 Stem cell signature MED6 ANAPC5 P -value DKK1 –9 Downregulated PALB2 300 =6.0 x 10 CtBP-bound genes ‘CtBP CD44 OVOL2

P =7.37 E–010 CtBP- repressed’ GRHL2 il moment Cancer TIC/Stem cell 1823 NME1 * bound genes HES1 downregulated ERCC5 200 223 1773 1823 GADD45B 50 1544 FANCD2 273 CDH17 12 CTNNB1 (Creighton et al ) NDC80 179 100 BRIP1 CDH1 Upregulated CD24 100 SIRT1

CtBP- 1406 1148 XRCC4 Comet assay ta Cancer TIC/ Stem cell P =0.071 TBL1X bound genes THAP10 upregulated 1585 1248 CLDN7 1823 FANCM 142 1809 AMOTL2 0 128 14 CtBP KD CtBP KD SETDB1 12 BIRC2 Ctrl RNAi + – + – + – (Creighton et al ) upregulated downregulated PARD6B PROM2 CtBP RNAi – + – + – + (P <0.05) (P <0.05) MYST3 FOXA1 OAZ3 MRPL13 CtrlH2O2 H2O2 ANXA3 CST6 recovery

Figure 2 | CtBP downregulated targets control differentiation and DNA repair. (a) Venn diagrams showing the overlap between CtBP targets and genes downregulated and upregulated in established EMT and cancer tumour-initiating cell (TIC)/stem cell gene signatures. P-values indicate the significance of overlap determined from hypergeometric distribution analysis. (b) Unsupervised hierarchical clustering of microarray analysis (HGU133plus) of differentially expressed (Po0.05) genes in control and MCF-7 cells depleted of CtBP by RNAi. Venn diagram shows the overlap of repressed and induced genes with CtBP targets identified by ChIP-Seq. (c) Hierarchical clustering of the expression of multiple CtBP targets belonging to the gene classes, described in Fig. 1, measured by quantitative real-time PCR in cells overexpressing CtBP (CtBP) and cells depleted of CtBP by RNAi (CtBP KD) compared with GFP control and non-targeting RNAi, respectively. Bracket indicates CtBP repressed genes that were uniformly upregulated by CtBP depletion and repressed by CtBP overexpression. Corresponding expression values with error bars are provided in Supplementary Figs S4–S6 (d) Comet assay of MCF-7 cells before and after oxidative DNA damage by peroxide treatment and following 2 h recovery in control cells and cells depleted of CtBP by RNAi. (e) Analysis of change in tail moment in control and CtBP-depleted cells following DNA damage and recovery. The error bars represent the s.d. of the mean from two independent experiments. Scale bar, 20 mm.

CtBP gene depletion by RNAi (Fig. 2d). Analysis reveals that cells expression8. However, in the absence of direct empirical depleted of CtBP show significantly increased DNA repair information or further study of the specific forms of CtBP (P ¼ 6.0 E-09) compared with control cells (Fig. 2e). In complexes in different cell types or knowledge of the differential contrast, gene depletion of BRCA1 by RNAi has the opposite promoter recruitment of histone acetyl-, HDACs, effect (decreased DNA repair), though with lower relative HMTs and HDMs, it is not possible to readily predict what significance (P ¼ 1.50 E-05) (Supplementary Fig. S8). These modifications will be altered and in what direction at each findings establish a substantial role for CtBP in governing individual CtBP target. Regardless, CtBP gene depletion produces transcriptional programs that control genome stability. substantial changes in both H3 and H4 histone acetylation at most of the 30 CtBP gene targets (Fig. 3c and Supplementary Fig. S9). A well-characterized attribute of the acquisition of CtBP drives acquisition of mesenchymal traits. To assess the mesenchymal features and EMT is increasing vimentin functional influence of CtBP on the acquisition of mesenchymal expression accompanied by decreasing E-cadherin expression11. traits, we compared the effect of CtBP expression on the prop- To profile the influence of CtBP on these properties, the change erties of two cell lines at opposite poles of the hierarchy of in the E-cadherin/Vimentin ratio28 was measured (Fig. 3d). In mammary differentiation (Fig. 3). MCF-7 cells serve as a repre- both MCF-7 cells and MDA-MB-231 cells, CtBP gene depletion sentative of luminal differentiation, whereas MDA-MB-231, an increased the ratio of E-cadherin/Vimentin, whereas CtBP oestrogen receptor negative and highly metastatic cell line, is expression lowered it, consistent with the ability of CtBP to representative of the claudin low subtype. In both cell lines, CtBP drive the mesenchymal phenotype in both mesenchymal and depletion induces derepression of most of the 30 CtBP-targeted luminal cells. Furthermore, the role of CtBP in driving genes in Fig. 1b; however, gene depletion seems to derepress a mesenchymal features is well illustrated by the ability of substantially larger portion of the MDA-MB-231 cells than the enforced CtBP expression to substantially increase MCF-7 MCF-7 (80% versus 53%) (Fig. 3a,b and Supplementary Fig. S4). mobility in wound-healing assays (Fig. 3e). In contrast, CtBP overexpression has a more substantial influence EMT has recently been shown to activate programs that on the repression of the 30 CtBP targets in MCF-7 compared with promote the acquisition of stem cell-like properties11. This often MDA-MB-231 (43% versus 6%) (Fig. 3a,b and Supplementary occurs in progenitor cells with increases in CD44 as opposed to Fig. S5). CtBP is associated with a variety of chromatin-modifying CD24 expression11. The influence of CtBP on stem-like features complexes5. At the BRCA1 promoter, loss of CtBP results in of MCF-7 and MDA-MB-231 was measured by profiling changes increased BRCA1 promoter acetylation and increased BRCA1 in the CD44/CD24 ratio10 following enforced CtBP expression or

4 NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications & 2013 Macmillan Publishers Limited. All rights reserved. NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438 ARTICLE

MCF-7 2.5 FANCM 10 CST6 HES1 ** ** 1.6 MCF-7 2.0 ** 8 IgG H4Ac 1.2 1.5 6 CST6 T-PCR) 0.016 FANCM HES1 * 0.20 * MCF-7 MDA-MB-231 1.0 4 0.8 0.03 0.012 0.15 3.0 5 * MCF7 MB-231 * 0.5 2 0.4 0.02 0.008 0.10 * mRNA (qR * 2.5

Ctrl

OE Ctrl KD OE 0.0 0 0.0 KD 4 ANXA3 0.004 0.01 * 0.05 BRIP1 ** GRHL2 OVOL2 FANCD2 1.6 to input) 2.0 1.2 1.2 FANCM 0.000 0.00 0.00 BRIP1 3 1.2 BRIP1 GRHL2 OVOL2

GRHL2 c ChIP signal T-PCR) ** ** 1.5 0.8 ** 0.8 * CLDN9 0.025 0.016 * 0.06 0.8 CST6 H4A 2

(Normalized 0.012 ERCC5 0.015

E-cad/Vim ratio 0.4 0.4 0.04 E-cad/Vim ratio 1.0 0.4 CLDN4 0.008

mRNA (qR PARD6B 0.005 0.02 FAN1 0.004 * 1 0.0 0.0 0.0 0.5 XRCC5 * CtBP – ++––––– – + – – MUS81 0.00 0.00 0.000

units CtBP KD–––+++ ++ – – – +

CtBP KD ––––– ++– – –– + 2 FOXA1 0.0 0 AMOTL2 MDA-MB-231

MDA-MB-231 Log PALB2 CtBP – + – – CtBP – + – – RAD51C IgG H4Ac 12 FANCM 14 CST66 HES1 CtBP KD – ––– + CtBP KD – – + ** OVOL2 FANCM CST6 HES1 ** 0.20 10 12 5 ** HES1 ** GATA3 10 0.02 0.20 * 8 4 –2.32 0.00 2.32 CDS1 0.15 T-PCR) 8 ZNF165 6 3 MCF-7 MCF-7 + CtBP1 6 ATR 0.10 * 4 CEBPB 0.01 0.10 4 2 RARG 0.05 2 1 mRNA (qR 2 NME1 * KRT18 0 to input) 0.00 0.00 0.00 0 h 0 0 DKK1 5 hIP signal BRIP1 GRHL2 OVOL2 10 BRIP1 ** 10 GRHL2 OVOL2 THAP10 ** ** OAZ3 0.08 0.008 8 8 4 ** Gene expression 0.02 H4Ac C 0.06 6 6 3 (qRT-PCR) (Normalized 0.04 0.004 32 h 0.01

(qRT-PCR) 4 4 2 0.02 2 2 1 0.00 0.00 0.000 Wound healing mRNA * CtBP KD––+ ++ – – + ––+ + 0 0 0 CtBP –––++––––+ – – CtBP KD –––––––++– – +

MCF-7 MDA-MB-231 CD44/CD24 CD44/CD24 1.4 1.2 1.6 1.2 1.2 1.0 1.4 * * 1.0 atio 1.0 1.2 0.8 atio 0.8 1.0 0.8 * 0.6 0.6 0.8 0.6 0.4 0.6 0.4 0.4

CD44/CD24 r 0.4 0.2 CD44/CD24 r 0.2 0.2 0.2 0 0 0 0 CtBP – +–– CtBP – +–– CtBP KD –– – + CtBP KD –– – + Figure 3 | CtBP drives acquisition of mesenchymal traits in mammary epithelial cells. (a) Gene expression pattern of CtBP target genes controlling genome stability, EMTand stem cell pathways in MCF-7 cells (top) and MDA-MB-231 cells (bottom) overexpressing CtBP or depleted of CtBP by RNAi as indicated. (b) Unsupervised hierarchical clustering comparing of the 30 gene validation set expression (see Fig. 1b) in cells overexpressing or depleted of CtBP. Expression values with error bars are shown in Supplementary Figs S4,S5. (c) Promoter Histone H4 acetylation profile (K5, K8, K12, K16) of genes shown in a in MCF-7 cells (top) and MDA-MB-231 cells (bottom) following CtBP depletion by RNAi. Expression values and errors bars including the remaining 30 gene validation set are provided in Supplementary Fig. S9. (d) E-cadherin/Vimentin expression ratio in cells overexpressing CtBP or depleted of CtBP by RNAi. (e) Wound-healing assay shows MCF-7 control cells and cells overexpressing CtBP1. Vertical line indicates centre of wound in the scratch assay. White bar, 250 mm. (f) CD44/CD24 expression ratio in MCF-7 and MDA-MB-231 cells overexpressing CtBP or depleted of CtBP by RNAi. The error bars represent the s.d. of the mean from three independent experiments (a,d,f) or two independent experiments (c). *indicates Po0.05 and **indicates Po0.01.

CtBP gene depletion (Fig. 3f). In both cell lines, the expression of cells grown under conditions of high glucose (Fig. 4a–c). These CtBP increased the CD44/CD24 ratio consistent with the changes are, in turn, associated with increased nuclear levels of attributes of cells with progenitor/stem cell-like features, BRCA1 protein, decreased levels of CtBP loading at the BRCA1 whereas CtBP depletion decreased that ratio (Fig. 3f). Thus, promoter, compensatory increases in relative histone 4 CtBP appears to be able to drive the mesenchymal phenotype in acetylation8 (Fig. 4c–e and Supplementary Fig. S10a) and a mammary cells regardless of what position they are along the significant increase in the expression of BRCA1 mRNA and other spectrum of mammary differentiation. CtBP-targeted genes important in DNA repair (Fig. 4f and Supplementary Fig. S10b). Finally, as predicted, cells incubated under high glucose condition show a demonstrably reduced DNA CtBP links cellular metabolic status to genome stability. repair capacity, which is not due to differences in cell cycle entry Pharmacological manipulation of endogenous NADH levels (Fig. 4g,h and Supplementary Fig. S10c). influences BRCA1 expression through CtBP, with higher levels of NADH causing BRCA1 repression8. To ask whether manipulation of endogenous NADH levels by carbohydrate over-loading could CtBP gene networks distinguish aggressive breast cancer. The influence expression of CtBP targets, we grew MCF-7 cells in high embryonic properties linked to EMT, including cellular plasticity, or ‘diabetic’ levels of glucose (450 mg dl À 1) versus normal dedifferentiation, deregulated cell growth and genome instability, concentration (100 mg dl À 1) (Fig. 4). CtBP dimerization, are common features associated with more aggressive molecular nuclear localization, and stability are enhanced when bound to subtypes of breast cancer30,31. To ask whether CtBP target genes NADH7,29. As demonstrated by both immunohistochemistry and define networks that are more associated with aggressive subtypes western blot analysis, MCF-7 cells grown in low levels of glucose, of breast cancer, we profiled the expression of CtBP target genes demonstrate decreased levels of NADH relative to NAD þ , and in publically available breast cancer patient gene expression data show lower nuclear accumulation of CtBP in comparison with sets (Fig. 5a). Analysis by unsupervised hierarchical clustering

NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications 5 & 2013 Macmillan Publishers Limited. All rights reserved. ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438

1a 1b 2 24 BRCA1 High glucose MCF-7 MCF-7 Low glucose High glucose Low glucose –1 kb 0 kb +16 kb +82 kb 4.0 0.035

Low glucose + I glucose Low glucose High High glucose

o 0.030

3.0 t) Low glucose 203- α-BRCA1 0.025 0.020 2.0 52-

/NADH rati

Relative + α-CtBP2 0.015 CtBP ChIP 1.0 0.010

NAD

ChIP enrichment 38- (relative to inpu 0 α-NPM1 0.005 0.000 –1 kb 0 kb +16 kb +82 kb

–3 –10 12 0.710 32 62 82 83 84 85 kb a bdcefijg h k l BRCA1 BRIP1 2.0 MCF-7 MCF-7 P = 0.0021 2.0 4 BRCA1 IR ** High glucose Low glucose * 2.5 1.5 1.5 3 Low glucose epair

8S rRNA) 0 h

4) * High glucose recovery 2.0 1.0 1.0 2

tylation

relative expression 3 h and total H 1.5 0.5 0.5 1 recovery (foci change rate) *P =0.003 Relative DNA r

(normalized to 1

qRT-PCR 0.0 0 1.0 **P =0.001 0.0 6 h Low Low Low Low Low High High Glucose Glucose recovery Glucose High 0.5 Insulin – – + Insulin – – + DNA repair

Relative ChIP H4 ace DNA repair

(normalize to input 0.0 abce d fijklg h

Figure 4 | Calorie restriction decreases CtBP activity and increases DNA repair. (a) CtBP immunohistochemical staining of MCF-7 cells grown in high glucose (4.5 g l À 1) and low glucose (1.0 g l À 1). Scale bar, 25 mm. Inset is two-fold magnification. (b) NAD þ/NADH ratio in lysates of MCF-7 cells grown in high and low glucose. (c) Western blot analysis of BRCA1 and CtBP2 expression in high and low glucose-treated cells. NPM1 was used as loading control. I, insulin. (d) ChIP profile of CtBP binding to the BRCA1 promoter in cells grown in high glucose versus low glucose as indicated. (e) ChIP profile of relative histone 4 lysine acetylation after normalization to ChIP for total histone 4. (f) Quantitative reverse transcription–PCR expression of BRCA1 and BRIP1 mRNA in high and low glucose-treated cells with and without insulin. (g) Phospho-gamma H2AX foci profile of cells following ionizing radiation at 0, 3 and 6 h. Scale bar, 10 mm. (h) Relative rate of DNA repair expressed as relative rate of decrease in foci per cell over time. The error bars represent the s.d. of the mean from three independent experiments (b,d,f,h) or five independent experiments (e).

identified a large class or cluster of CtBP-targeted genes that are High CtBP predicts poor survival in breast cancer patients. The selectively downregulated in the basal-like and claudin low data presented thus far indicate that CtBP is likely to have a subtype of cancers (Fig. 5a). Moreover, ANOVA (analysis of substantial role in the aetiology and progression of human breast variance) analysis10 of expression of the CtBP-targeted gene cancer. To examine CtBP expression in patient tissues, tumour categories (Fig. 1b), shows that downregulation of many of the samples from breast cancer patients were stained for CtBP pro- genes within the EMT and Stem Cell/TIC categories significantly tein expression by immunohistochemistry using antibodies distinguish (P-values between 1.08 E-11 and 1.17 E-140) basal- against CtBP (Fig. 6a). In normal breast CtBP, nuclear immu- like and claudin low from the other subtypes (Fig. 5b and noreactivity is generally light and non-uniform with many nuclei Supplementary Fig. S11). GRHL2 has recently been shown to showing little or no CtBP staining, whereas in patients with basal- have a dominant role in EMT by regulating cell polarity and is a like, triple-negative breast cancer, CtBP1 staining is much more strong discriminator of claudin low subtypes32,33. FOXA1 intense (Fig. 6a). When digitally scored for CtBP nuclear staining potently distinguishes basal-like and claudin low from the more to measure percent of nuclei with scores of 0–3 (nuclear intensity) luminal subtypes and has recently been shown to actively repress or a score weighted by nuclear size (nuclear score), triple-negative the basal-like phenotype34,35 (Figs 1b and 5b and Supplementary breast cancer shows a nuclear intensity and nuclear score that is Fig. S11). Similarly, gene set enrichment analysis36 of the genes 25 and 22 times higher than normal breast, respectively (Fig. 6a). altered by CtBP RNAi depletion using microarray analysis This system was then used to score the patient tissue cohort (Fig. 2b) also reveals substantial CtBP-dependent participation (Fig. 6b). When this scoring system was used to segregate patient in multiple pathways important in breast cancer biology samples into three groups of low (nuclear score o100; nuclear (Supplementary Figs S12a–g)37. Moreover, the clinical relevance intensity o2), medium (nuclear score 100–300; nuclear intensity of the CtBP-targeted gene list is further supported by analysis of 2–5) and high (nuclear score 4300; nuclear intensity 45) CtBP two independent breast cancer gene expression studies revealing score, there was a clear trend showing an inverse relationship that patients, whose tumours can be classified as showing between CtBP staining and patient median survival by Kaplan– differential expression of CtBP-target genes, have significantly Meier analysis (Fig. 6b). This suggests that elevated CtBP has shorter metastasis-free survival by Kaplan–Meier analysis potential as a surrogate biomarker for altered epigenetic regula- (Fig. 5c). tion in breast cancer patients who may progress to more

6 NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications & 2013 Macmillan Publishers Limited. All rights reserved. NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438 ARTICLE

low

l A/B BRIP1 P =2.58 e-16 GSE11121 1.0 2 y HR= 2.695

Normal Basal\claudin

HER2 Lumena 0.8 0

babilit –2 0.6 0.4 Downregulated High risk n =96

LB

Ba

LA

H2

CL genes NL 0.2 Low risk n =104

GRHL2 P =2.54 e-28 Survival pro 0.0 P =0.001432 3 2 0 50 100 150 200 1 Months NME1 0 CST6 –1 GSE2034 PARD6 –2 1.0 –3 y HR= 2.08 ‘CtBP- GRHL2 targeted 0.8

Ba LA LB

NL H2 CLDN4 CL genes’ CDS1 OVOL2 P =9.23 e-53 0.6 KRT18

l probabilit ZNF165 2 0.4 High risk n =144 0 Low risk n =142 PALB2 0.2

–2 Surviva P =0.0001681 0.0 HES1 –4 OVOL2 0 50 100 150 200 GATA3 Months

LA Ba LB

H2

NL FOXA1 CL

Figure 5 | CtBP targets distinguish clinically aggressive subtypes of breast cancer. (a) Unsupervised hierarchical clustering of CtBP target genes in publically available breast cancer gene expression studies (UNC337). Highlighted rectangle indicates group of CtBP-bound genes that are downregulated/ repressed preferentially in the basal-like subtype. (b) ANOVA profiling of one representative each of the genome stability (BRIP1), EMT (GRHL2) and TIC/ Stem cell (OVOL2) gene groups, described in Fig. 1b, using the UNC337 patient gene expression data set. P-values were calculated by comparing mean expression across all subtypes. (c) Patients whose tumours differentially express CtBP-bound genes show worse clinical outcome (metastasis-free survival) by Kaplan–Meier analysis. Patient gene expression data obtained from the Gene Expression Omnibus (GSE11121 and GSE2034) were separated into two groups based on differential expression CtBP target genes (red), and were analysed by Kaplan–Meier analysis. The 95% confidence interval for median hazard ratio (HR) ¼ 2.08 is (1.420, 3.070). The 95% confidence interval for median HR ¼ 2.70 is (1.490, 5.411). P-values and HRs were derived as described in Methods. advanced disease, although the prognostic value of CtBP will displacement (Po0.05) of CtBP from 67% of the promoters in require further independent validation. MCF-7 and only 30% from promoters in MDA-MB-231 (Fig. 7d and Supplementary Fig. S14). The lower MTOB-induced CtBP displacement in MDA-MB-231 could be due to the lower level of Reversal of CtBP function by small molecule inhibition. CtBP CtBP binding found at these genes (Supplementary Fig. S3). This is a potent epigenetic regulator that responds to cellular meta- could explain, in part, why the concordance between changes in bolism through its interaction with NADH. Therefore, pharma- CtBP occupancy and gene expression is significantly higher for cological targeting of CtBP may, in principle, provide a means of MCF-7 (470%) compared with MDA-MB-231 (50%) following derepressing its transcriptional targets. Though CtBP is a member MTOB treatment (Fig. 7a,d and Supplementary Figs S13–S15). of the d-2-hydroxyacid dehydrogenase family, its true substrate is Nonetheless, these data indicate that the predominant mode of not known38. Recent studies indicate that 2-Keto-4- MTOB action is through its eviction of CtBP from occupied methylthiobutyrate (MTOB), an intermediate in the methionine promoter regions. Finally, treatment with MTOB antagonizes the salvage pathway, can bind CtBP and reverse repression of the mesenchymal phenotype (Fig. 7e,f). Addition of MTOB to both proapoptotic gene, BIK, in colon cancer cells39,40. To test whether MCF-7 and MDA-MD-231 increases the pro-epithelial MTOB could disrupt expression of CtBP target genes in breast E-cadherin/Vimentin ratio while reducing the pro-mesenchymal cancer, both MCF-7 and MDA-MB-231 were incubated in the CD44/CD24 ratio, with a more significant trend (Po0.05) in presence and absence of MTOB and the 30 CtBP-targeted genes MCF-7 (Fig. 7e,f). Taken all together, these findings provide (Fig. 1b) were screened for changes in gene expression (Fig. 7a,b substantial evidence that pharmacological targeting of CtBP to and Supplementary Fig. S13) and promoter occupancy (Fig. 7d,e disrupt malignant cellular reprogramming may be a feasible and Supplementary Figs S14,S15). MTOB treatment caused ‘epigenetic strategy’ for therapeutic intervention. significant derepression (Po0.05) of 40% of these genes in MCF-7 and 46% in MBA-MD-231. Approximately 3% and 10% of genes, respectively, were repressed (Fig. 7a,b and Discussion Supplementary Fig. 13). The concordance of the MTOB effect The evidence of a link between obesity and diabetes and increased between the two cell lines was 70% (21/30), indicating that mortality from breast cancer is incontrovertible41–44.An MTOB action is relatively independent of breast cancer subtype important feature of the dysfunctional energetics associated and epithelial programming (Fig. 7b and Supplementary Fig. with obesity and diabetes and malignant transformation, is S13). However, though it is difficult to know the extent to which elevated carbohydrate metabolism, a central component of the this derepression is due to direct targeting of CtBP occupancy or Warburg effect45,46. This elevated level of carbohydrate to indirect effects; incubation with MTOB caused a significant metabolism, whether due to the over-nutrition of obesity or the

NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications 7 & 2013 Macmillan Publishers Limited. All rights reserved. ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438

Breast cancer patients Normal gland Breast cancer Survival 100 CtBP nuc score <100 CtBP nuc score 100–300 Nuclear CtBP nuc score >300 intensity 3+ 2+ 1+ 0+ 50

Median survival 140 mos Median survival 135 mos Median survival 116 mos 0 0 50 100 150

Survival 100 CtBP nuc intensity <2 CtBP nuc intensity 2–5 CtBP nuc intensity >5

50

Nuclear intensity = 0.857 Nuclear intensity = 21.824 Percent survival Percent survival Median survival 164 mos Nuclear score = 44.003 Nuclear score = 1009.975 Median survival 135 mos Median survival 64 mos α -CtBP1 immunohistochemistry 0 0 50 100 150 Months

Figure 6 | Elevated CtBP protein in patient breast cancers predicts lower median survival. (a) CtBP1 immunohistochemical staining of normal breast tissue and tissue from triple-negative breast cancer patients, analysed for staining intensity using the Aperio nuclear algorithm software and then scored for nuclear intensity and nuclear score (weighted by nuclear area). Scale bar, 25 mm. (b) Relative survival of a breast cancer patient cohort (98) scored with low (o100 or o2.0), median (100–300 or 2–5) and high (4300 or45) scores for CtBP staining analysed for survival trends by Kaplan–Meier analysis. Median survival for all three groups are shown. Cox regression analysis of hazard ratio for nuclear intensity 45 is 2.10 (P-value ¼ 0.048) after adjusting for age.

Warburg effect of cancer bioenergetics, results in increased levels consistent with a role CtBP-regulated pathways had in such of NADH47–49. In this study, we propose CtBP is a key tumours. Notably, decreased expression specifically of DNA downstream epigenetic effector of elevated NADH. Therefore repair proteins is associated with shortened time to recurrence in through CtBP, changes in cellular metabolic status can drive triple-negative breast cancer52. This is consistent with the genome-wide changes in chromatin through targeted recruitment demonstration, in this current study, of the impact of CtBP of CtBP that facilitates the acquisition of epigenetically targeting on genome stability. Nearly one third of the Fanconi reprogrammed properties that promote genome instability, Anaemia complementation group is targeted by CtBP. Therefore, dedifferentiation and the transformation to a more it is not surprising that loss of CtBP expression or function results mesenchymal phenotype. in a significant improvement in DNA repair in breast cancer cell Though this study provides one of the first to profile the lines (see Figs 1d and 2d,e). Most importantly, targeting by CtBP binding of CtBP throughout the mammalian genome, how, when suggests that these hereditary risk factors for breast cancer may be and where the different CtBP complexes target and coordinate worsened by lifestyle factors influencing metabolic imbalance. the recruitment of specific chromatin modifiers, and their Finally, many of the new driver mutations identified by recent subsequent effect on the epigenome remain to be defined. These systematic sequencing of cancer genomes has uncovered several epigenetic networks and programs are likely to differ by cellular genes with functional roles in epigenetic regulation53. CtBP process and cell type and are likely to reflect a hierarchy of CtBP represents a novel class of versatile, multi-potent epigenetic complexes formed under specific cellular conditions and regulators that is likely to have many different roles in cancer environments as we have seen in comparing the mesenchymal aetiology and progression. The finding that MTOB can act as a MDA-MB-231 cell line with the luminal MCF-7 (Figs 3 and 7). small molecular inhibitor that can reverse genomic targeting by This difference has been demonstrated in prior studies where loss CtBP, provides a proof of principle that pharmacological of CtBP had a much great effect on mitotic fidelity in MDA-MB- manipulation of CtBP is feasible. Thus, epigenetic targeting 231 than MCF-750. Future studies to correlate global alterations through CtBP promises to be a new and exciting area of future in histone and DNA modification with changes in CtBP levels therapeutic intervention. New efforts will have to be directed at (via either genetic or metabolic disruption) in multiple breast finding compounds that will function in the nanomolar to cancer subtypes will be necessary to better define the mechanism micromolar range. Given that weight gain and obesity represent underlying these differences. modifiable cancer risk factors linked to lifestyle, a better Approximately 5–10% of breast cancers are secondary to understanding of CtBP will fuel new ideas and creative inherited mutations of cancer predisposing genes. It is striking strategies for combined behavioural and therapeutic approaches that, of the known and newly identified breast cancer predispos- to cancer treatment and prevention. ing genetic mutations, a substantial number are targeted for repression by CtBP, including PALB2, BRIP1, RAD51C and Methods BRCA1 (ref. 16). Thus, the observation that many patients Reagents. Hydrogen peroxide is from Invitrogen as 30% stock. MTOB (4- develop breast cancers with features of inherited disease without methylthio-2-oxobutyric Acid ) is from Sigma-Aldrich and was dissolved in demonstrating mutation in genes characteristic of the disease51,is medium to 250 mM and diluted to 10 mM final concentration in cell culture.

8 NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications & 2013 Macmillan Publishers Limited. All rights reserved. NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438 ARTICLE

IgG CtBP MCF-7 MCF7 MB-231 4-methylthio-2-oxobutyric acid FANCM CST6 * HES1 (MTOB) 0.30 0.2 0.6 MCF-7 MDA-MB-231 0.20 0.4

Ctrl Ctrl MTOB MTOB 0.1 *

FANCM CST6HES1FANCM CST6 HES1 FAN1 t) CST6 0.10 0.2 * R) 1.2 PALB2 ** 4 ** 8 ** 2.0 * 2.0 1.2 R) ERCC5 CLDN9 0.00 0.0 0.0 1.5 1.5 OVOL2 BRIP1 GRHL2 OVOL2 3 6 0.8 0.5 0.8 BRIP1 zed to inpu 2 4 1.0 1.0 FANCM 0.4 0.4 0.4 RAD51C 1.2 0.4 A (qRT-PC XRCC5 0.5 CtBP ChIP signal 0.3 1 2 0.5 CLDN4 0.3 (normali 0.8 mRNA (qRT-PC CEBPB –1.72 0.2 0.2 0 0.0 mRN 0.0 0.0 0.0 0 AMOTL2 * MUS81 0.1 0.4 0.1 BRIP1 GRHL2 OVOL2 BRIP1 GRHL2 OVOL2 ZNF165 * ** 1.6 units 2 CDS1 0.0 0.0 0.0

0.00

* R) 2.5 ** 2.0 2.0 4 * FANCD2 MTOB – – + + – – +––++ +

R) 1.2 1.2 ** Log PARD6B 2.0 NME1 1.5 1.5 3 THAP10 MDA-MB-231 1.5 0.8 KRT18 IgG CtBP

0.8 1.72 1.0 1.0 2 FOXA1 FANCM CST6 HES1 1.0 A (qRT-PC OAZ3 0.04 0.4 0.4 0.5 0.5 1 ANXA3 0.012 0.5 GRHL2 0.03 0.020 mRN RARG mRNA (qRT-PC 0.0 0.0 0.0 0.0 0.0 0 ATR 0.02 0.008 MTOB – + – + – + MTOB – + – + – + HES1 GATA3 t) * 0.010 DKK1 0.01 0.004 Gene expression MCF-7 MB-231 0.00 0.000 0.000 (qRT-PCR) MTOB ––++ BRIP1 GRHL2 OVOL2 FAN1 0.06 0.016 * CST6 0.008 PALB2

CtBP ChIP Signal 0.012

ERCC5 (normalized to inpu 0.04 CLDN9 0.008 OVOL2 MCF-7 MDA-MB-231MCF-7 MDA-MB-231 0.004 BRIP1 0.02 ** 0.004 FANCM E-Cad/Vim CD44 / CD24 RAD51C 5 2.0 0.000 XRCC5 1.2 1.6 0.00 0.000 CLDN4 MTOB – – + + ––––++++ 0.00 4 * 1.4 CEBPB 1.5 1.0 G and input) AMOTL2 1.2 MUS81 0.8 3 ** 1.0 ZNF165 4 ratio CDS1 1.0 0.6 0.8 FANCD2 2 CD24 ratio PARD6B 0.6

4.97 CtBP ChIP signal 0.4 NME1

E-cad/Vim ratio 0.5 E-cad/Vim ratio 0.4 THAP10 1 0.2

CD44 / KRT18 CD44/CD2 0.2

(normalized to Ig FOXA1 0.0 0.0 OAZ3 0 0.0 ANXA3 MTOB – + MTOB – + MTOB – + MTOB – + GRHL2 RARG ATR HES1 GATA3 DKK1

Figure 7 | Small molecule inhibition reverses gene repression by CtBP eviction. (a) Gene expression pattern of CtBP target genes controlling genome stability, EMT and stem cell pathways in MCF-7 cells (left) and MDA-MB-231 cells (right) with and without treatment with 10 mM MTOB. (b) Heat map of gene expression of the 30 gene validation set (Fig. 1b) in MCF-7 (left) and MDA-MB-231 (right) cells treated with and without MTOB. Gene expression values and error bars including the remaining 30 genes are provided in Supplementary Fig. S13. (c) CtBP qChIP profiles of genome stability, EMT and stem cell pathways genes in MCF-7 (top) and MDA-MB-231 (bottom) cells treated with or without MTOB. (d) Heat map of ChIP intensities of 30 gene validation list in MCF-7 and MDA-MB-231 cells treated with or without MTOB. Quantitative ChIP values and error bars are provided in Supplementary Fig. S14. (e) E-cadherin/Vimentin ratio in MCF-7 and MDA-MB-231 cells treated with and without MTOB. (f) CD44/CD24 ratio in MCF- 7 and MDA-MB-231 cells treated with and without MTOB. The error bars represent the s.d. of the mean from three independent experiments (a,e,f)or two independent experiments (c). *Indicates Po0.05 and **indicates Po0.01.

The antibody to CtBP used for ChIP was purchased from Santa Cruz based on the binomial distribution. The hotspots are defined by a z-score calculated Biotechnology and is cross-reactive with both CtBP1 and CtBP2. The anti-CtBP1- using the target window and the background window signals both centred on the specific antibody and anti-CtBP2 specific antibody were purchased from BD tag. In addition, ChIP hotspots were refined into 150 bp peaks using a peak-finding biosciences. Anti-acetylated histone H3, anti-acetylated histone H4 antibodies and procedure. The sequencing data from matching input samples are used for the anti-gH2AX antibody were obtained from Millipore. processing of the ChIP data, as a measure of background signal.

Cell culture and tissues. Both MCF-7 cells and MDA-MB-231 cells were main- tained in regular DMEM supplemented with 10% (v/v) FBS, penicillin– Motif discovery and enrichment analysis. A motif discovery analysis was per- streptomycin (Invitrogen) and insulin. In addition, the regular DMEM has formed on selected DNA sequences using MEME54 on parallel clusters at the NIH À 1 4.5 g l glucose and is considered as high glucose culturing (HG) compared with Biowulf supercomputing facility. DNA sequences for MEME input were from the À 1 1.0 g l glucose DMEM medium (LG). The low glucose-cultured cells were used top 1,500 (by tag density) hotspots among all CtBP binding hotspots. To limit the for experiments only after 3 months of continuous culture in low-glucose medium. computational load, only the 200 bp regions with the highest tag density were used instead of the entire width of a hotspot in cases where the hotspot spanned 4 ChIP and ChIP-seq. All ChIP experiments were carried out as described8.The 200 bp. The width of motifs to discover was set to 6 and 20 for minimum and detailed procedure is provided in the Supplementary Methods. maximum, respectively. To identify motifs for known transcription factor binding, individual position-specific probability matrices against the Transfac database were queried using the TomTom software (http://meme.nbcr.net/meme/cgi-bin/ ChIP-seq data analysis. The detailed ChIP-seq data analysis is provided in the tomtom.cgi). Statistically significant matches were retrieved that share the majority Supplementary Methods. Briefly, the 36-mer short-read tags were mapped to the of specific nucleotides in the sequence motifs. To generate consensus read densities (UCSC HG19). Enrichment of tags in a 250 bp target window for positions relative to transcription start sites (TSS), the total number rd* of read relative to a 200 kb surrounding window (local background) was gauged by a model tags summed over all Refseq annotated TSS normalized to the length L of the

NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications 9 & 2013 Macmillan Publishers Limited. All rights reserved. ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438 genome and the total number N of aligned reads (rd* ¼ rd*L/N) was profiled such (http://www.affymetrix.com/analysis/index.affx). Expression data were filtered to that rd* ¼ 1 approximately corresponds to an un-enriched distribution of reads. exclude any probe set that was not a component of the signatures tested, and to eliminate any probe set whose expression variation across the data set was P40.05. Kaplan–Meier analysis was performed using the Survival Risk Prediction tool, Gene expression and microarray analysis. The total RNA from three biological specifying two risk groups, with fitting to a Cox proportional hazard model with replicates of control MCF-7 cells and CtBP knockdown MCF-7 cells were prepared P-valuer0.05. Distributions of the hazard ratios and the logrank test P-values using the RNAeasy kit (Qiagen) following the manufacturer’s protocol. Synthesis of was determined based on 1,000 Bootstrap samples where each bootstrap sample cDNA from total RNA and hybridization/scanning of microarrays were performed consists of 50% of cases randomly selected from the whole set56. with Affymetrix GeneChip products (HGU133plus2) as described in the GeneChip manual. Raw data files (.CEL) were converted into probe set values by Robust Mult-array Average (RMA) normalization. Following RMA normalization, Bio- Statistical analysis. All the error bars represent the s.d. of the mean from at least conductor packages was applied in R statistical environment to generate a list of three independent biological replicates unless otherwise indicated. Comparison genes that are differentially expressed between control cells and CtBP knockdown between two groups was done using a two-sided Student’s t test. P-valueo0.05 was cells and Po0.05 was considered as significant. The data was stored as NCBI considered statistically significant. GSE36529. References Comet assay. Comet assays were performed according to Olive et al.55 Briefly, a 1. Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell single-cell suspension was prepared using disaggregation. The cells were 144, 646–674 (2011). À 1 exposed to neutral lysis buffer (2% sarkosyl, 0.5M Na2EDTA, 0.5 mg ml 2. Vogelstein, B. & Kinzler, K. W. Cancer genes and the pathways they control. proteinase K (pH 8.0); equilibrated at 4 °C) for overnight at 37 °C. Following Nat. Med. 10, 789–799 (2004). electrophoresis the cells were stained by SYBR Green and the images were obtained 3. Baylin, S. B. & Jones, P. A. A decade of exploring the cancer epigenome - using fluorescent microscopy. The tail moment was calculated by the following biological and translational implications. Nat. Rev. Cancer 11, 726–734 (2011). formula: Tail moment ¼ tail length  percentage of Tail DNA. Percentage of Tail 4. Ram, O. et al. Combinatorial patterning of chromatin regulators uncovered by DNA ¼ aT  iT/(aT  iT þ aH  iH), where aT ¼ the tail area, iT ¼ average genome-wide location analysis in human cells. Cell 147, 1628–1639 (2011). intensity of tail, aH ¼ the head area and iH ¼ average intensity of Head. Comet 5. Chinnadurai, G. The transcriptional corepressor CtBP: a foe of multiple tumor Score was used to analyse the comet pictures. suppressors. Cancer Res. 69, 731–734 (2009). 6. Zhang, Q. et al. Redox sensor CtBP mediates hypoxia-induced tumor cell Immunofluorescence staining of cells. Cells were grown on coverslips and fixed migration. Proc. Natl Acad. Sci. USA 103, 9029–9033 (2006). in 3.5% paraformaldehyde. For gH2AX staining the cells were incubated with 7. Zhang, Q., Piston, D. W. & Goodman, R. H. Regulation of corepressor function Alexa Fluor 488 Goat Anti-Mouse IgG for 1 h. Cells were irradiated at 10 Gy to by nuclear NADH. Science 295, 1895–1897 (2002). induce DNA damage. 8. Di, L. J., Fernandez, A. G., De, S. A., Longo, D. L. & Gardner, K. Transcriptional regulation of BRCA1 expression by a metabolic switch. Nat. Struct. Mol. Biol. 17, 1406–1413 (2010). Immunohistochemistry staining of tissues. Detailed methods for immunohis- 9. Perou, C. M. et al. Molecular portraits of human breast tumours. Nature 406, tochemistry is provided in the Supplementary Methods. Formalin-fixed, paraffin- 747–752 (2000). embedded tissues were de-paraffinized by submerging the slides in xylene. Antigen 10. Prat, A. et al. Phenotypic and molecular characterization of the claudin-low retrieval was performed in buffers containing 100 ml of 1 mM EDTA (pH 8.0). intrinsic subtype of breast cancer. Breast Cancer Res. 12, R68 (2010). Staining was developed using secondary antibody conjugated with horse-radish 11. Mani, S. A. et al. The epithelial-mesenchymal transition generates cells with peroxidase (HRP) (Dako EnVision) þ System-HRP Labelled Polymer Anti-Rabbit properties of stem cells. Cell 133, 704–715 (2008). or Anti-mouse and counterstained with haematoxylin. 12. Creighton, C. J. et al. Residual breast cancers after conventional therapy display mesenchymal as well as tumor-initiating features. Proc. Natl Acad. Sci. USA Analysis of tissue microarrays. Immunohistochemically stained tissue slides were 106, 13820–13825 (2009). converted to digital slide images by scanning the slides on an Aperio ScanScope XT 13. Herschkowitz, J. I. et al. Comparative oncogenomics identifies breast tumors slide scanner. High-resolution digital slide images were then archived into Aperio’s enriched in functional tumor-initiating cells. Proc. Natl. Acad. Sci. USA 109, digital pathology information management system ‘Spectrum’. Digital slide images 2778–2783 (2012). were analysed using Aperio’s IHC Nuclear Image Analysis algorithm to assess the 14. Hildebrand, J. D. & Soriano, P. Overlapping and unique roles for C-terminal nuclear staining for CtBP in MCF7 cells and quantify the intensity of individual binding protein 1 (CtBP1) and CtBP2 during mouse development. Mol. Cell cells. Values and colours are assigned to individual cells based on the intensity of Biol. 22, 5296–5307 (2002). nuclear staining with a classification of 0 (blue), 1 þ (yellow), 2 þ (orange) and 15. Valastyan, S. & Weinberg, R. A. Tumor metastasis: molecular insights and 3 þ (red). Nuclear intensity was calculated from the sum of the product of the % of evolving paradigms. Cell 147, 275–292 (2011). cells with 3 þ and 2 þ scores divided by the sum of the product of % cells with 16. Gage, M., Wattendorf, D. & Henry, L. R. Translational advances regarding 1 þ and 0 scores. Nuclear score was calculated as the product of nuclear area (mm2) hereditary breast cancer syndromes. J. Surg. Oncol. 105, 444–451 (2012). and the nuclear intensity. Archival formalin-fixed, paraffin-embedded tissues from 17. Wakano, C., Byun, J. S., Di, L. J. & Gardner, K. The dual lives of bidirectional breast cancer patients were obtained from the surgical pathology archive of the promoters. Biochim. Biophys. Acta 1819, 688–693 (2012). University of Chicago for tissue microarray construction. The study was approved 18. DeSiervi, A. et al. Transcriptional autoregulation by BRCA1. Cancer Res. 70, by the Institutional Review Board of the University of Chicago and East Carolina 532–542 (2010). University. Pathological features, including histological diagnosis, grade, tumour 19. Kao, J. et al. Molecular profiling of breast cancer cell lines defines relevant size and axillary lymph node metastasis, were abstracted from the pathology report. tumor models and provides a resource for cancer gene discovery. PLoS ONE 4, There were survival data on 98–100 breast cancer patients from each data set with a e6146 (2009). median follow-up of 8.3 years. 20. Shi, Y. et al. Coordinated histone modifications mediated by a CtBP co-repressor complex. Nature 422, 735–738 (2003). 21. Wang, J. et al. Opposing LSD1 complexes function in developmental gene Analysis of breast cancer gene expression profiles. Expression patterns of the activation and repression programmes. Nature 446, 882–887 (2007). 1,823 genes identified by CtBP ChIP-Seq were examined in a previously published breast cancer containing microarray and patient clinical data set available from the 22. Adamo, A. et al. LSD1 regulates the balance between self-renewal and University of North Carolina (UNC), which includes 337 human breast tumours differentiation in human embryonic stem cells. Nat. Cell Biol. 13, 652–659 (UNC337) and is available in the Gene Expression Omnibus (GEO) under (2011). accession number GEO:(GSE18229)10. All data sets were median centred within 23. Wang, Y. et al. LSD1 is a subunit of the NuRD complex and targets the each data set and standardized to zero mean and unit variance before downstream metastasis programs in breast cancer. Cell 138, 660–672 (2009). analysis10. ANOVA analysis of representative gene expression in tumour samples 24. Bhambhani, C., Chang, J. L., Akey, D. L. & Cadigan, K. M. The oligomeric state was determined using the UNC337 gene expression data set. To determine whether of CtBP determines its role as a transcriptional co-activator and co-repressor of the overlap of CtBP target gene lists with other referenced gene lists, is statistically Wingless targets. EMBO J. 30, 2031–2043 (2011). significant in Venn Diagrams, a hypergeometric distribution was calculated to 25. McDonald, O. G., Wu, H., Timp, W., Doi, A. & Feinberg, A. P. Genome-scale derive the statistical P-value based on 3,7630 TSSs in refseq (HG19, USCS). epigenetic reprogramming during epithelial-to-mesenchymal transition. Nat. Analysis of patient survival associated with gene expression from breast cancer data Struct. Mol. Biol. 18, 867–874 (2011). sets was performed using BRB ArrayTools Version: 4.1.0—Beta_3 Release. 26. Loboda, A. et al. EMT is the dominant program in human colon cancer. BMC Affymetrix data sets were downloaded from the NCBI GEO (http:// Med. Genomics 4, 9 (2011). www.ncbi.nlm.nih.gov/geo/). Expression data were loaded into BRB ArrayTools 27. Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping using the Data Import Wizard. U133A probe sets for the individual gene signatures and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, were identified by using the Affymetrix NetAffx Analysis Center Batch Query tool 621–628 (2008).

10 NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications & 2013 Macmillan Publishers Limited. All rights reserved. NATURE COMMUNICATIONS | DOI: 10.1038/ncomms2438 ARTICLE

28. Buijs, J. T. et al. BMP7, a putative regulator of epithelial homeostasis in the 50. Bergman, L. M., Birts, C. N., Darley, M., Gabrielli, B. & Blaydes, J. P. CtBPs human prostate, is a potent inhibitor of prostate cancer bone metastasis in vivo. promote cell survival through the maintenance of mitotic fidelity. Mol. Cell Am. J. Pathol. 171, 1047–1057 (2007). Biol. 29, 4539–4551 (2009). 29. Zhao, L. J., Subramanian, T., Vijayalingam, S. & Chinnadurai, G. PLDLS- 51. Issa, J. P. & Garber, J. E. Time to think outside the (genetic) box. Cancer Prev. dependent interaction of E1A with CtBP: regulation of CtBP nuclear Res. 4, 6–8 (2011). localization and transcriptional functions. Oncogene 26, 7544–7551 (2007). 52. Alexander, B. M. et al. DNA repair protein biomarkers associated with time to 30. Chung, C. H., Bernard, P. S. & Perou, C. M. Molecular portraits and the family recurrence in triple-negative breast cancer. Clin. Cancer Res. 16, 5796–5804 tree of cancer. Nat. Genet. 32(Suppl): 533–540 (2002). (2010). 31. Turner, N., Tutt, A. & Ashworth, A. Hallmarks of ‘BRCAness’ in sporadic 53. Stratton, M. R. Exploring the genomes of cancer cells: progress and promise. cancers. Nat. Rev. Cancer 4, 814–819 (2004). Science 331, 1553–1558 (2011). 32. Cieply, B. et al. Suppression of the epithelial-mesenchymal transition by 54. Bailey, T. L. & Gribskov, M. Methods and statistics for combining motif match Grainyhead-like-2. Cancer Res. 72, 2440–2453 (2012). scores. J. Comput. Biol. 5, 211–221 (1998). 33. Werth, M. et al. The transcription factor grainyhead-like 2 regulates the 55. Olive, P. L. & Banath, J. P. The comet assay: a method to measure DNA damage molecular composition of the epithelial apical junctional complex. Development in individual cells. Nat. Protoc. 1, 23–29 (2006). 137, 3835–3845 (2010). 56. Li, J. et al. Identification of high-quality cancer prognostic markers and 34. Bernardo, G. M. et al. FOXA1 represses the molecular phenotype of basal breast metastasis network modules. Nat. Commun. 1, 34 (2010). cancer cells. Oncogene doi: 10.1038/onc.2012.62 (2012). 35. Badve, S. et al. FOXA1 expression in breast cancer--correlation with luminal subtype A and survival. Clin. Cancer Res. 13, 4415–4421 (2007). Acknowledgements 36. Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based This research was supported by the Intramural Research Program of the US National approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Institutes of Health, the US National Cancer Institute, the US National Institute on Sci. USA 102, 15545–15550 (2005). Aging, the US National Institute on Minority Health and Health Disparities, and the 37. Charafe-Jauffret, E. et al. Gene expression profiling of breast cell lines identifies University of Chicago Breast Cancer SPORE P50 CA125183. potential new basal markers. Oncogene 25, 2273–2284 (2006). 38. Kumar, V. et al. Transcription corepressor CtBP is an NAD( þ )-regulated dehydrogenase. Mol. Cell 10, 857–869 (2002). Author contributions 39. Achouri, Y., Noel, G. & Van, S. E. 2-Keto-4-methylthiobutyrate, an L-J.D. performed most of the experiments and generated data and figures and co-wrote intermediate in the methionine salvage pathway, is a good substrate for CtBP1. the manuscript. J.S.B. performed experiments and generated data and figures and Biochem. Biophys. Res. Commun. 352, 903–906 (2007). co-wrote the manuscript. M.M.W. performed experiments and generated data. C.W. 40. Straza, M. W. et al. Therapeutic targeting of C-terminal binding protein in performed experiments. T.T. generated and analysed data. S.B. analysed data and human cancer. Cell Cycle 9, 3740–3750 (2010). generated figures. S.B., H.Y. and M.L. generated methods and analysed data. K.H. 41. Calle, E. E. & Kaaks, R. Overweight, obesity and cancer: epidemiological analysed data. C.Z. and G.K. collected clinical information and analysed data. F.C. and evidence and proposed mechanisms. Nat. Rev. Cancer 4, 579–591 (2004). C.M.P. generated methods, analysed data and generated figures. C.R.M. analysed data. 42. Ewertz, M. et al. Effect of obesity on prognosis after early-stage breast cancer. R.R. and O.I.O. collected clinical information and analysed data. K.G. designed study, J. Clin. Oncol 29, 25–31 (2011). supervised the study, analysed data and co-wrote the manuscript. 43. Sinicrope, F. A. & Dannenberg, A. J. Obesity and breast cancer prognosis: weight of the evidence. J. Clin. Oncol. 29, 4–7 (2011). 44. Taubes, G. Cancer research. Unraveling the obesity-cancer connection. Science Additional information 335, 28 30-32 (2012). Accession codes: Microarray data have been deposited in the Gene Expression Omnibus 45. Vander Heiden, M. G., Cantley, L. C. & Thompson, C. B. Understanding the database under series accession code GSE36529. Warburg effect: the metabolic requirements of cell proliferation. Science 324, 1029–1033 (2009). Supplementary Information accompanies this paper at http://www.nature.com/ 46. Koppenol, W. H., Bounds, P. L. & Dang, C. V. Otto Warburg’s contributions to naturecommunications current concepts of cancer metabolism. Nat. Rev. Cancer 11, 325–337 (2011). 47. Yu, Q. & Heikal, A. A. Two-photon autofluorescence dynamics imaging reveals Competing financial interests: C.M.P is an equity stock holder of BioClassifier LLC and sensitivity of intracellular NADH concentration and conformation to cell University Genomics. C.M.P. has also filed a patent on the PAM50 assay (‘GENE physiology at the single-cell level. J. Photochem. Photobiol. B 95, 46–57 (2009). EXPRESSION PROFILES TO PREDICT BREAST CANCER OUTCOMES’, US Patent 48. Uppal, A. & Gupta, P. K. Measurement of NADH concentration in normal and Application 20110145176). All other authors declare no competing financial interests. malignant human tissues from breast and oral cavity. Biotechnol. Appl. Reprints and permission information is available online at http://npg.nature.com/ Biochem. 37, 45–50 (2003). reprintsandpermissions/ 49. Schwartz, J. P., Passonneau, J. V., Johnson, G. S. & Pastan, I. The effect of growth conditions on NAD þ and NADH concentrations and the How to cite this article: Di, L-J. et al. Genome-wide profiles of CtBP link metabolism NAD þ :NADH ratio in normal and transformed fibroblasts. J. Biol. Chem. 249, with genome stability and epithelial reprogramming in breast cancer. Nat. Commun. 4138–4143 (1974). 4:1449 doi: 10.1038/ncomms2438 (2013).

NATURE COMMUNICATIONS | 4:1449 | DOI: 10.1038/ncomms2438 | www.nature.com/naturecommunications 11 & 2013 Macmillan Publishers Limited. All rights reserved.