Published OnlineFirst September 24, 2019; DOI: 10.1158/0008-5472.CAN-19-2132 Cancer Genome and Epigenome Research
Epigenomic Profiling Discovers Trans-lineage SOX2 Partnerships Driving Tumor Heterogeneity in Lung Squamous Cell Carcinoma Takashi Sato1,2, Seungyeul Yoo3, Ranran Kong1,2,4, Abhilasha Sinha1,2, Prashanth Chandramani-Shivalingappa1, Ayushi Patel1,2, Maya Fridrikh1,2, Osamu Nagano5, Takashi Masuko6, Mary Beth Beasley7, Charles A. Powell1, Jun Zhu2,3,8, and Hideo Watanabe1,2,3
Abstract
Molecular characterization of lung squamous Transcriptional programs cell carcinoma (LUSC), one of the major subtypes Super-enhancers Sox2 of lung cancer, has not sufficiently improved its p63 nonstratified treatment strategies over decades. Sox2 Sox2 p63 Accumulating evidence suggests that lineage- specific transcriptional regulators control differen- Genes involved in epithelial/ tiation states during cancer evolution and underlie p63 epidermal development Classical lineage their distinct biological behaviors. In this study, by Differential dependency Trans-lineage investigating the super-enhancer landscape of LUSC, we identified a previously undescribed Sox2 Brn2 fi "neural" subtype de ned by Sox2 and a neural Sox2 lineage factor Brn2, as well as the classical LUSC Lung squamous cell carcinoma Sox2 Brn2 fi subtype de ned by Sox2 and its classical squamous Neural lineage Brn2 Genes involved in neurogenesis partner p63. Robust protein–protein interaction and genomic cooccupancy of Sox2 and Brn2, in Transcriptional cooperation of Sox2 with p63 or Brn2 determines cancer lineage state in lung squamous cell carcinoma. place for p63 in the classical LUSC, indicated their © 2019 American Association for Cancer Research transcriptional cooperation imparting this unique lineage state in the "neural" LUSC. Forced expres- sion of p63 downregulated Brn2 in the "neural" LUSC cells and invoked the classical LUSC lineage with more squamous/ epithelial features, which were accompanied by increased activities of ErbB/Akt and MAPK–ERK pathways, suggesting differential dependency. Collectively, our data demonstrate heterogeneous cell lineage states of LUSC featured by Sox2 cooperation with Brn2 or p63, for which distinct therapeutic approaches may be warranted.
Significance: Epigenomic profiling reveals a novel subtype of lung squamous cell carcinoma with neural differentiation. Graphical Abstract: http://cancerres.aacrjournals.org/content/canres/79/24/6084/F1.large.jpg.
Introduction which provide a considerable survival benefit for patients with lung cancer, the overall 5-year survival remains less than adequate Lung cancer is the leading cause of cancer-related death world- at 19% (2). Lung squamous cell carcinoma (LUSC) is the second wide (1). Despite recent progress in diagnosis and treatment most common histologic subtype of lung cancer and is strongly including molecular targeted therapeutics and immunotherapy, associated with cigarette smoking. Although achievements in
1Division of Pulmonary, Critical Care and Sleep Medicine, Department of Med- Note: Supplementary data for this article are available at Cancer Research icine, Icahn School of Medicine at Mount Sinai, New York, New York. 2Tisch Online (http://cancerres.aacrjournals.org/). Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, New York. 3Department of Genetics and Genomic Sciences, Icahn School of Medicine at Corresponding Author: Hideo Watanabe, Icahn School of Medicine at Mount Mount Sinai, New York, New York. 4Department of Thoracic Surgery, The Second Sinai, One Gustave L. Levy Place, Box 1232, New York, NY 10029. Phone: 212-241- Affiliated Hospital of Medical School, Xi'an Jiaotong University, Xi'an, Shaanxi, 9242; Fax: 212-876-5519; E-mail: [email protected] China. 5Division of Gene Regulation, Institute for Advanced Medical Research, Keio University School of Medicine, Shinjuku-ku, Tokyo, Japan. 6Cell Biology Cancer Res 2019;79:1–17 Laboratory, School of Pharmacy, Kindai University, Higashiosaka, Osaka, Japan. 7Department of Pathology and Laboratory Medicine, Icahn School of Medicine at doi: 10.1158/0008-5472.CAN-19-2132 Mount Sinai, New York, New York. 8Sema4, a Mount Sinai venture, Stamford, Connecticut. 2019 American Association for Cancer Research.
www.aacrjournals.org OF1
Downloaded from cancerres.aacrjournals.org on September 27, 2021. © 2019 American Association for Cancer Research. Published OnlineFirst September 24, 2019; DOI: 10.1158/0008-5472.CAN-19-2132
Sato et al.
tobacco control in developed countries have contributed to a and that forced expression of p63 leads to classical squamous cell decline in the mortality rate of lung cancer, LUSC still remains a differentiation and suppression of Brn2 in this novel LUSC major cause of cancer-related death worldwide and shows even subtype. higher incidence than lung adenocarcinoma in several coun- tries (3, 4). Comprehensive genomic characterization of LUSC conducted by The Cancer Genome Atlas (TCGA) project revealed Materials and Methods its heterogeneous features with complex genomic alterations (5). Cell lines Unfortunately, these findings have not resulted in the successful Lung cancer cell lines (HCC95, KNS62, HCC2814, LK2, development of clinically approved targeted drugs for LUSC. NCI-H520, SQ-1, EBC-1, HCC2279, NCI-H2887, HCC2450, Compared with patients with lung adenocarcinoma, where mul- Calu-1, HARA, LC-1/SQSF) were maintained in RPMI1640 tiple genome-based targeted therapeutics have been approved, (Gibco) with 10% FBS and 1% penicillin–streptomycin (Gibco). patients with LUSC have very limited therapeutic options. More HEK293T cells were maintained in DMEM with 10% FBS and 1% recently, immune checkpoint inhibitors have emerged as key penicillin–streptomycin (Gibco). Original sources for these cell therapeutic options for solid tumors including LUSC (6, 7); lines are shown in Supplementary Table S1. HCC95, HCC2814, however, the response rate for unselected LUSC population HCC2279, NCI-H2887, and HCC2450 cells were a gift from remains only approximately 20% and initially responded tumors Dr. John Minna (University of Texas Southwestern Medical eventually progress in most cases. Therefore, innovative strategies Center, Dallas, TX). Cells were authenticated by copy number to better characterize and classify LUSC for a better patient variation analyses and regularly tested for Mycoplasma using the stratification for current and future therapeutic options are des- MycoAlert Detection Kit (Lonza). perately needed. We previously identified SOX2 as the most commonly ampli- Chromatin immunoprecipitation sequencing fied oncogene in LUSC (8). We proposed SOX2 as a lineage- For H3K27ac, Sox2, and p63 chromatin immunoprecipitation survival oncogene in squamous cell cancers for its essential role (ChIP), cells were performed as described previously with mod- during the development in the specification of the squamous cell ifications (10). Cells were crosslinked with 1% formaldehyde in lineages by opposing the role of Nkx2-1 in the dividing foregut PBS for 10 minutes at room temperature, washed in 5 mg/mL BSA and its essentiality for LUSC cell survival (9). In a following study, in PBS and then in just cold PBS, resuspended in lysis buffer we identified another squamous lineage factor, p63, as an impor- [50 mmol/L Tris-HCl pH 8.1, 10 mmol/L EDTA, 1% SDS, 1 tant cooperative partner of Sox2 in LUSC (10). SOX2 amplifica- protease inhibitor cocktail (Thermo Fisher Scientific)] and son- tion on chromosome 3q in LUSC often extends to its telomeric icated with the Covaris M220 sonicator or the Diagenode side to include the locus of TP63, which encodes p63. While focal Bioruptor sonicator to obtain chromatin fragment lengths of amplification and/or overexpression of both SOX2 and TP63 100-to-1,000 bp judged by Bioanalyzer DNA High Sensitivity Kit genes is found in only 7% of LUSC tumors, broader copy number (Agilent). Fragmented chromatin was diluted in immunoprecip- gains on 3q telomeric ends are observed in the vast majority of itation (IP) buffer (20 mmol/L Tris-HCl pH 8.1, 150 mmol/L LUSCs (5). Studies on expression profiles classified LUSCs into NaCl, 2 mmol/L EDTA, 1% Triton X-100) and incubated over- four expression subtypes (primitive, classical, secretory, and bas- night at 4 C with protein G magnetic beads (Dynabeads, Thermo al), suggesting the heterogeneity of transcriptional programs Fisher Scientific) that had been preincubated with anti-H3K27ac within LUSCs (5, 11, 12). On the basis of this classification, (Abcam, ab4729) or anti-Sox2 (R&D Systems, AF2018) or anti- coamplification of SOX2 and TP63 is enriched in the classical p63 (Santa Cruz Biotechnology, sc-8431) antibodies. subtype of LUSC, whereas TP63 expression is relatively low in the For Brn2 ChIP, cells were crosslinked with 2 mmol/L disucci- primitive and secretory subtypes. However, it remains largely nimidyl glutarate in PBS with 1 mmol/L MgCl2 for 45 minutes at unknown what mechanisms are involved in controlling transcrip- room temperature and 1% formaldehyde in PBS for 11 minutes at tional programs in the heterogeneous group of LUSCs. room temperature, washed in 5 mg/mL BSA in PBS and then A series of recent genome-wide histone modification analyses in just cold PBS, re-suspended in nuclear extraction buffer have demonstrated the presence of large clusters of putative (10 mmol/L Tris-HCl pH 7.5, 10 mmol/L NaCl, 3 mmol/L MgCl2, enhancers in close genomic proximity, coined super- 0.1 % IGEPAL CA-630, 1 protease inhibitor cocktail), and enhancers (13, 14). Because these regions typically exhibit cell incubated on ice for 10 minutes. Extracted nuclei were then lineage–specific patterns in health and in disease, super-enhancer washed and resuspended in micrococcal nuclease digestion buffer profiling is becoming a powerful tool to identify novel cancer cell (0.3 mol/L sucrose, 20 mmol/L Tris-HCl pH 7.5, 3 mmol/L CaCl2, lineages governed by specific transcriptional regulators. For exam- 1 protease inhibitor cocktail) and incubated with 12,000 gel ple, analysis of the super-enhancer landscape of neuroblastoma units/mL micrococcal nuclease (NEB, M0247S) at 37 C with has demonstrated heterogeneity of lineage states governed by frequent mixing to digest chromatin to lengths of approximately specifi c transcriptional programs that may underlie the cause of 100-to-1,000 bp. Digestion was stopped with 25 mmol/L EDTA relapse after chemotherapy (15). Similar analysis of the super- pH 8.0 and nuclei were resuspended in lysis buffer (50 mmol/L enhancer landscape in acute myeloid leukemia identified a novel Tris-HCl pH 8.0, 10 mmol/L EDTA, 1% SDS, 1 protease epigenomic subtype, which has therapeutic implications for inhibitor cocktail) and sonicated with the Diagenode Bioruptor differentiation therapy (16). In this study, we investigated sonicator (low intensity, 30 seconds on and 30 seconds off, super-enhancer profiles in LUSC and identified a novel subtype 2 cycles). Fragmented chromatin was diluted in IP buffer in which Sox2 and a neural transcription factor Brn2 have key (20 mmol/L Tris-HCl pH 8.1, 150 mmol/L NaCl, 2 mmol/L roles in determining its distinctive differentiation state. In addi- EDTA, 1% Triton X-100) and incubated with anti-Brn2 (Cell tion, we show that Brn2 serves as an interacting partner for Sox2 in Signaling Technology, #12137) antibody overnight at 4 C. this novel subtype, instead of p63 in the classical subtype of LUSC Protein G magnetic beads (Dynabeads, Thermo Fisher Scientific)
OF2 Cancer Res; 79(24) December 15, 2019 Cancer Research
Downloaded from cancerres.aacrjournals.org on September 27, 2021. © 2019 American Association for Cancer Research. Published OnlineFirst September 24, 2019; DOI: 10.1158/0008-5472.CAN-19-2132
Lineage Heterogeneity in Squamous Lung Cancer
were then added and the IP reaction was incubated for 2 hours sequencing library construction using NEBNext Ultra DNA at 4 C. Library Prep Kit (NEB, E7645). Paired-end sequencing was per- Immunoprecipitates were washed six times with the wash formed on NextSeq 500 (Illumina) for 38 nucleotides from each buffer (50 mmol/L HEPES pH 7.6, 0.5 mol/L LiCl, 1 mmol/L end according to the manufacturer's instructions. EDTA, 0.7% sodium deoxycholate, 1% IGEPAL CA-630) and twice with Tris-EDTA buffer. Immunoprecipitated (or no IP RNA-sequencing analyses input) DNA was treated with RNase A and Proteinase K on the RNA-sequencing (RNA-seq) gene expression data as well as beads, recovered in 1% SDS and 0.1 mol/L NaHCO3 over a period SNP array copy number data for SOX2 were obtained from Cancer of 7 hours at 65 C, and purified with DNA clean and concentrator- Cell Line Encyclopedia (CCLE; http://www.broadinstitute.org/ 25 (Zymo Research). Up to 10 ng of DNA was used for the library ccle/home). RNA-seq data of 501 tumor tissues were obtained construction using NEBNext Ultra II DNA Library Prep Kit (NEB, from TCGA-LUSC dataset (5). We downloaded htseq-counts as E7645). Sequencing was performed on NextSeq500 (Illumina) read counts for each gene in May 2018 (https://portal.gdc.cancer. for 38 nucleotides each from paired ends according to the man- gov/). The TCGA read counts originally aligned for Ensemble ufacturer's instructions. transcripts were converted to corresponding RefGene symbol based on the USCS database. mRNA abundance was estimated ChIP sequencing data analysis from read counts in transcripts per million (TPM) as described in For H3K27ac ChIP, ChIP-enriched regions (peaks) were iden- Wagner and colleagues (22). Genotype-Tissue Expression (GTEx) tified by Model-based Analysis of ChIP-seq (MACS; ref. 17) after TPM matrix (https://gtexportal.org/home/datasets) was down- aligning to the human reference genome hg19/GRCh37. To loaded from GTEx data portal in July 2018 (23). Data from the define "super-enhancers," we used the ROSE2 pipeline (18). brain hypothalamus region were used for the analysis. Log2- H3K27ac-enriched peaks were stitched together, using an optimal transformed TPM values were used as log2(TPMþ1) for the distance determined by the algorithm per sample up to 12.5 kb. following analysis. Two RNA-seq data (EGAD00001001244 and The algorithm generates plots for all stitched enhancers per each GSE60052) were downloaded for small cell lung cancer (SCLC; sample and defines super-enhancers on the basis of top-ranked refs. 24, 25). We downloaded raw fastq files, aligned to the human enhancers with highest read counts at the tangential cutoff. reference genome hg19, and estimated mRNA abundance in Unsupervised hierarchical clustering was performed using signals terms of TPM. (read counts on defined super-enhancer regions) near transcrip- For the TCGA-LUSC RNA-seq data, the 501 LUSC tumor tissues tional regulators. A list of the transcriptional regulators was were classified on the basis of the expression level of TP63, SOX2, obtained from the genes annotated with gene ontology term, and POU3F2. We used TPM cutoffs using (mean – SD) for TP63 DNA binding (GO: 0003677). Principal component analysis and SOX2 as their expressions are normally distributed (Fig. 2D; (PCA) was performed on the basis of the same signals per region, Supplementary Fig. S2D and S2E) and TPM>1 for POU3F2 and top 3 components were depicted in three-dimensional plots. because most of the tumors have little expression of the gene ChIP signals for each sample were visualized on Integrative (Fig. 2A). Hierarchical clustering of the LUSC tumors (n ¼ 416) Genome Viewer (IGV) genome browser (19) using wiggle files was performed using the differentially expressed genes between with a 10-bp resolution for H3K27ac modification generated by POU3F2-high/ TP63-low and POU3F2-low/TP63-high tumors MACS with tag shift that was rescaled to normalize to a total with complete-linkage clustering (Fig. 3A). Kaplan–Meier curves number of uniquely alignable sequences by WigMath function of were plotted to compare survival for patients with POU3F2-high/ Java-Genomic Toolkit. Differentially enriched super-enhancers TP63-low tumors versus patients with POU3F2-low/TP63-high were determined by comparing the "neural" subgroup against tumors (Fig. 3C). To compare the POU3F2-high/TP63-low tumors the "classical" subgroup using "samr" R package (20), based on with previously reported LUSC expression subtypes, the TCGA cutoffs of fold change > 2 and SAM score > 4. tumors were classified on the basis of expression of the 16 subtype For Sox2, Brn2, and p63 ChIP, peaks were identified by exemplar genes for each subtype (5). Using the k-means clustering MACS (17) after aligning to the human reference genome with k ¼ 4, these genes classified 497 tumors into 4 clusters with hg19/GRCh37. ChIP signals on Sox2, Brn2, and p63 peaks 106 primitive, 158 classical, 127 secretory, and 106 basal subtype detected on any samples were visualized as a heatmap generated tumors (Supplementary Fig. S2G). by "heatmap" function in Cistrome analysis pipeline (http:// To perform RNA-seq analyses for engineered LK2 cell lines, cistrome.org/) or plotHeatmap function of deepTools (21) using sequencing reads were aligned to the human reference genome wiggle files with a 10-bp resolution generated by MACS with tag hg19/GRCh37 using Tophat (v2.1) and read counts per RefGene shift that was rescaled to normalize to a total number of uniquely symbol on the USCS database was estimated using the htseq- alignable sequences by WigMath function of Java-Genomic count function in SAMtools (26). Then DEseq2 (27) was used to Toolkit. identify the differentially expressed genes between control LK2 cells (two stable GFP-overexpressed LK2 cells and duplicates of RNA sequencing inducible DNp63-expressing LK2 cells without doxycycline) and Total RNAs from engineered LK2 cell lines were extracted using DNp63-overexpressed LK2 cells (two stable DNp63-overex- the Qiagen RNeasy Kit. Polyadenylated RNA was enriched from pressed cells and duplicates of inducible DNp63-expressing LK2 1 mg of RNA for each sample with the NEBNext PolyA mRNA cells after treatment with 2 mg/mL doxycycline for 6 days) and to Magnetic Isolation Module (NEB, E7490), incubated at 94 C for identify genes significantly associated with DNp63 overexpres- 15 minutes and double-strand cDNA was synthesized using sion levels (control ! inducible ! stable). To compare the SuperScript III Reverse Transcriptase (Thermo Fisher Scientific) transcriptomic profiles of the engineered LK2 cells with those of and NEBNext Ultra II Directional RNA Second Strand Synthesis the LUSC cell lines from CCLE, we merged the TPM matrix from Module (NEB). Up to 10 ng of cDNA was used for the Illumina the engineered cells with that from the CCLE cell lines based on
www.aacrjournals.org Cancer Res; 79(24) December 15, 2019 OF3
Downloaded from cancerres.aacrjournals.org on September 27, 2021. © 2019 American Association for Cancer Research. Published OnlineFirst September 24, 2019; DOI: 10.1158/0008-5472.CAN-19-2132
Sato et al.
common genes and performed quantile normalization followed antibody and Dynabeads Protein G (Thermo Fisher Scientific) by ComBat to remove potential batch effects. on a rotator overnight at 4 C. After washing with PBS þ 0.05% To identify potentially enriched functions of selected gene sets Tween 20, the beads were boiled for 10 minutes in 1 SDS of interest, we compared these gene sets with the genes annotated sample buffer. The immunoprecipitates were then processed for by the same Gene Ontology (GO) terms curated in the Molecular immunoblotting. Signature Database (MSigDB; ref. 28). Each of 5,917 GO terms included in "C5" collection (version 6.0 downloaded in 2017) Lentiviral introduction of genes was compared with query gene sets. Any GO terms consisting of POU3F2, DNp63a,orGFP open reading frame (ORF) was more than 2,000 genes were considered nonspecific and removed cloned into pLEX_306 (a gift from Dr. David Root, Broad Insti- from the analysis. tute, Cambridge, MA; Addgene #41391), pLEX_307 (a gift from Dr. David Root, Addgene #41392) or pLIX_403 (a gift from Dr. Immunofluorescence David Root, Addgene #41395) using Gateway cloning methods Cells were grown on glass coverslips coated with poly-D-lysine according to manufacturer's recommendations. For lentiviral (Neuvitro), fixed in 4% paraformaldehyde for 10 minutes, and vectors production, HEK293T cells were seeded in 10-cm tissue permeabilized with 1 PBS containing 0.25% Triton X-100 for 10 culture dish and incubated at 37 C and 5% CO2. Cells at 80% minutes. Primary antibodies were used at the following dilutions: confluency were cotransfected with 10 mg of lentiviral expression anti-Brn2 at 1:200 (Cell Signaling Technology, #12137), and anti- constructs, 7.5 mg of psPAX2, and 2.5 mg pMD2.G vectors using Sox2 at 1:400 (Abcam, ab171380). Fluorescent signal was TransIT-Lenti (Mirus) following manufacturer's recommenda- detected with secondary antibodies conjugated with Alexa Fluor tions. At 48 hours posttransfection, supernatants were collected, (Thermo Fisher Scientific #A-11034, #A-21236) diluted at filtered (0.45 mm), and stored at 80 C. Cells were infected with 1:2,000, and coverslips were mounted (ProLong Diamond Anti- supernatant containing lentivirus supplemented with polybrene fade Mountant with DAPI: Thermo Fisher Scientific). Images were at a final concentration of 8 mg/mL and then selected with obtained with a Leica DM5500 B fluorescence microscope at 20 puromycin (2–3 mg/mL for 4–6 days). Ectopic protein expression objective magnification. Images were processed using the Fiji was confirmed via immunoblotting and compared with physio- distribution of ImageJ (https://fiji.sc/). logic expression levels in the LUSC cells with native expression of the transgenes. Western blotting and coimmunoprecipitation To determine protein expression, cell lysates were prepared by CRISPR-Cas9 genome editing incubating cells in lysis buffer A (150 mmol/L NaCl, 50 mmol/L Cells stably expressing human codon-optimized S. pyogenes Tris-HCl at pH 8.0, 1% IGEPAL CA-630, 0.5% sodium deoxycho- Cas9 were generated by infection with the lentiCas9-Blast plasmid late, 0.1% SDS, protease inhibitors) or lysis buffer B (150 mmol/L (a gift from Dr. Feng Zhang, Massachusetts Institute of Technol- NaCl, 40 mmol/L Tris-HCl at pH 8.0, 1% IGEPAL CA-630, 0.5% ogy, Cambridge, MA; Addgene, # 52962). Single-guide RNAs sodium deoxycholate, 2 mmol/L EDTA at pH 8.0, protease/ (sgRNA) were cloned at BbsI site downstream of the human U6 phosphatase inhibitors) for 30 minutes at 4 C. After centrifuga- promoter in a lentiviral vector containing eGFP downstream of tion to remove insoluble debris, lysates were immunoblotted the human PGK promoter (a kind gift from the Brian Brown with use of an anti-Brn2 antibody (Santa Cruz Biotechnology, sc- laboratory, Icahn School of Medicine at Mount Sinai, New York, 393324 or Cell Signaling Technology, #12137), anti-p63 anti- NY). Lentivirus was produced as described above. Cells were first body (Santa Cruz Biotechnology, sc-8344, sc-8431 or Cell Sig- infected with the lentiCas9-Blast lentivirus and then selected with naling Technology, #13109), anti-Sox2 antibody (R&D Systems, blasticidin (5 mg/mL for 10 days) for cells expressing the Cas9 AF2018 or Cell Signaling Technology, #3579), anti-ErbB4 anti- nuclease. Cells were then infected with pLenti-GFP-sgRNA. Poly- body (Cell Signaling Technology, #4795), anti-phospho-ErbB4 clonal populations of infected cells were used to avoid the effects antibody (Cell Signaling Technology, #4757), anti-ErbB3 anti- of selecting single/unique knockout clones. The knockout effi- body (Cell Signaling Technology, #12708), anti-phospho-ErbB3 ciency was confirmed in the bulk populations via immunoblot- antibody (Cell Signaling Technology, #4791), anti-EGFR anti- ting. sgRNA target sequences were selected from Brunello body (Millipore, #06-847), anti-phospho-EGFR antibody library (29) and Orzol and colleagues (30). Nontarget sgRNAs (Abcam, ab40815) anti-phospho-Akt antibody (Cell Signaling from the Gecko library v2 (31) were used as scramble sgRNAs. Technology, #4060), and anti-phospho-Erk1/2 antibody (Cell sgRNA target sequences are listed in Supplementary Table S2. Signaling Technology, #4370), anti-Ascl1 antibody (Santa Cruz Biotechnology, sc-374104), anti-SYP antibody (Santa Cruz Bio- IHC technology, sc-17750), anti-b-actin antibody (Sigma), or anti- IHC analyses were performed for xenograft tumor specimens vinculin (Sigma). For Western blotting to profile ErbB family and human primary LUSC tumor specimens. Total 23 formalin- signaling, engineered LK2 cells were serum starved for 4 hours, fixed, paraffin-embedded (FFPE) specimens of human primary and then incubated with anti-ErbB4 mAb P6-1 (a kind gift from LUSCs resected from patients at Icahn School of Medicine at the Hideyuki Saya laboratory, Keio University, Tokyo, Japan) for Mount Sinai (ISMMS, New York, NY) between 2011 and 2013 15 minutes, followed by stimulation with recombinant human were obtained and cut into 5-mm–thick sections. Characteristics of NRG1-b1 EGF domain (R&D Systems) for 30 minutes. the human LUSC specimens are shown in Supplementary For coimmunoprecipitation, whole-cell lysate of cells was Table S3. prepared in IP lysis buffer (25 mmol/L Tris-HCl pH 7.4, 150 IHC staining for Brn2, after being deparaffinized and rehy- mmol/L NaCl, 1 mmol/L EDTA, 1% IGEPAL CA-630, 5% glycerol, drated, FFPE tissue slides (5-mm thick) were heated at 95 Cin protease inhibitors) followed by incubation with anti-Brn2 anti- 10 mmol/L citrate buffer (pH 6.0) for antigen retrieval for body, anti-Sox2 antibody, anti-p63 antibody, or normal IgG 30 minutes. The sections were incubated with 0.3% H2O2 in TBS
OF4 Cancer Res; 79(24) December 15, 2019 Cancer Research
Downloaded from cancerres.aacrjournals.org on September 27, 2021. © 2019 American Association for Cancer Research. Published OnlineFirst September 24, 2019; DOI: 10.1158/0008-5472.CAN-19-2132
Lineage Heterogeneity in Squamous Lung Cancer
for 15 minutes to block endogenous peroxidase activity and Quantitative reverse transcription PCR were incubated with 10% normal goat serum (Jackson Immu- For expression analysis, RNA was extracted from cells using the noResearch) in TBS for 30 minutes to block nonspecific staining. Qiagen RNeasy Mini Kit and cDNA was prepared with the Qiagen After rinsing with TBS þ 0.025% Triton X-100, the sections were QuantiTect Reverse Transcription Kit. Real-time quantitative incubated with anti-Brn2 antibody (1:100; Cell Signaling Tech- PCRs were performed in triplicate (Applied Biosystems) on a nology, #12137) at 4 C overnight, followed by incubation with 7500 Real-Time PCR System (Applied Biosystems) with results biotin-conjugated goat anti-rabbit secondary antibody (Vector normalized to GAPDH expression. Primers are listed in Supple- Laboratories) at room temperature for 1 hour. Then, the sections mentary Table S2. were incubated with the ABC reagent (Vector Laboratories) and visualized with ImmPACT-DAB Peroxidase Substrate (Vector Statistical analyses Laboratories). IHC staining for Sox2 and p63 was conducted Differentially enriched super-enhancers were determined using using a Leica Bond-III automated slide stainer. The sections were "samr" R package (20), based on cutoffs of fold change > 2 and deparaffinized and subjected to heat-induced antigen retrieval SAM score > 4. For the TCGA-LUSC RNA-seq data, Pearson with high pH EDTA buffer for 20 minutes and 10 minutes, correlation coefficient between gene expression levels was calcu- respectively. The sections were incubated with anti-Sox2 antibody lated and associated P value was determined. t test was performed (prediluted; CellMarque clone SP76) or anti-p63 antibody (pre- to identify differentially expressed genes between POU3F2-high/ diluted; Biocare Medical, clone 4A4) as per the manufacturer's TP63-low and POU3F2-low/TP63-high tumors based on cutoffs instructions. All slides were counterstained with hematoxylin of fold change > 2 and FDR < 0.01. Significance of the overlap before being mounting. between gene sets was determined using Fisher exact test. To compare the overall survival, Kaplan–Meier curves were plotted Cell proliferation assay and log-rank test was performed. DEseq2 (27) was used to Cells were plated onto sextuplicate wells of a 96-well plate; identify the differentially expressed genes between control and three identical plates were prepared. Cell proliferation was DNp63-overexpressed LK2 cells based on cutoffs of fold change > assayed at 24, 72, and 120 hours after plating with AlamarBlue 1.5 and FDR < 0.05. In addition, likelihood ratio test model was Cell Viability Reagent (Thermo Fisher Scientific) and fluorescence used to identify genes significantly associated with conditions at 585 nm was measured on a Spectra Max3 plate reader (Molec- based on cutoffs of fold change > 2 and FDR < 0.01. For cell ular Devices) according to the manufacturer's protocol at excita- proliferation assay, xenograft tumor growth measurement and tion of 555 nm. Cell viability at 72 and 120 hours were corrected cell-cycle analysis, differences between two groups were examined for the ratio to control cells from the 24-hour reading to account using two-tailed t test. Bonferroni corrections were performed in for plating unevenness. case multiple comparisons were conducted.
Cell-cycle analysis Data availability Cells were fixed with 70% ethanol at least overnight at 4 C. ChIP-seq and RNA-seq data were deposited in the Gene Expres- After washing with PBS, cells were incubated in triplicate in PBS sion Omnibus under the accession number GSE137461. containing 100 mg/mL RNase and 50 mg/mL propidium iodide overnight at 4 C overnight. DNA content was analyzed by FACS- Canto II (BD Biosciences), and quantitative analyses for the Results proportions of cells in cell cycle were performed using FlowJo Super-enhancer profiling identifies a novel subtype of lung software. squamous cell carcinoma To understand the intertumor heterogeneity of cell lineages in Xenograft model LUSC, we examined super-enhancer landscape in a panel of 13 Engineered LK2 cells (4 106 cells) were injected with a 1:1 LUSC cell lines. Unsupervised hierarchical clustering of these cell mixture of 50 mL cell suspension and 50 mL Matrigel (Corning) lines using H3K27 acetylation signals over super-enhancer subcutaneously into both flank regions of 4- to 6-week-old male regions near transcriptional regulators identified three subgroups NOD-scid or NOD-scid gamma mice (Jackson Laboratory). of LUSC (Fig. 1A). PCA using those signals supported this clas- Tumor volume (length width2/2) was measured twice a week. sification (Supplementary Fig. S1A). One of the three subgroups Before tumor size reached at 1,000 mm3, mice were sacrificed and consists of five LUSC cell lines, in which common super- tumors were immersed in formalin for immunohistologic anal- enhancers are observed at genetic loci of 9 transcriptional reg- ysis. All animal procedures and studies were approved by the ulators including SOX2 and TP63 loci (Supplementary Fig. S1B; Mount Sinai Institutional Animal Care and Use Committee Supplementary Table S4), consistent with our previous findings (IACUC; protocol number, IACUC-2018-0021). that these genes play essential roles as lineage oncogenes in typical LUSCs (8, 10). This subgroup is enriched in SOX2-amplified Phospho-RTK array analysis LUSCs with high expression of both SOX2 and TP63 (Fig. 1B). The Human Phospho-RTK Array Kit (R&D Systems) was used to In contrast, LK2 and NCI-H520 cells formed a small subgroup determine the relative levels of tyrosine phosphorylation of 42 (Fig. 1A; Supplementary Fig. S1A), in which the common super- distinct receptor tyrosine kinase (RTK), according to the manu- enhancers with highest signals lie on only two loci at the SOX2 facturer's protocol. Three-hundred micrograms of total protein gene and the POU3F2 gene, which encodes Brn2 (Fig. 1C and D; was used for each membrane. Chemiluminescent signals were Supplementary Table S5). Brn2 is a well-known lineage factor in captured with a Chemidoc MP Imaging System (Bio-Rad neural progenitor cells, particularly in the hypothalamus, where it Laboratories) and images were processed using Image Lab soft- partners with Sox2 to exert its transcriptional functions (32–34). ware (Bio-Rad Laboratories). We also noted that the third subgroup, which consists of five cell
www.aacrjournals.org Cancer Res; 79(24) December 15, 2019 OF5
Downloaded from cancerres.aacrjournals.org on September 27, 2021. © 2019 American Association for Cancer Research. OF6 acrRs 92)Dcme 5 2019 15, December 79(24) Res; Cancer
aoe al. et Sato oio O2(et,T6 mdl) n O32(ih)i UCcl ie.Nraie inl r hw narnefo o5 o l h tracks. the all for 50 to 0 lines. from cell range LUSC a in in control shown loading are a signals as Normalized (right). vinculin lines. cells and cell NCI-H520 Sox2, LUSC and p63, in (left) Brn2, (right) cells of POU3F2 LK2 expression and lines; (middle), cell TP63 LUSC (left), of data SOX2 subset Duplicate of small GenePattern. loci the with in values scores the H3K27ac of using squares plots of enhancer cells. sum NCI-H520 the and using LK2 scores for linear shown to are normalized were signals Super-enhancer genes. regulator transcriptional pro Super-enhancer 1. Figure D A B H520_rep2 H520_rep1 LC1SQSF HCC2279 HCC2450 HCC2814 LK2_rep2 LK2_rep1 CALU1 HCC95 KNS62 2− 2 1 0 −1 −2 H2887 HARA EBC1 H520 532 SEs Near transcriptional regulators SQ1 Downloaded from Row Z-score HCC95 LK2 HARA [0-50] Chr3 HARA KNS62 181,420kb HCC2814 fi LC1SQSF igidenti ling HCC2814 LK2 LC1SQSF H520 Published OnlineFirstSeptember24,2019;DOI:10.1158/0008-5472.CAN-19-2132 HCC2450 KNS62 SQ1 181,430kb H3K27ac N/A fi
SOX2 CALU1 sanvlsbyeo LUSC. of subtype novel a es HCC95 EBC1
cancerres.aacrjournals.org H2887 HCC2450 HCC2279 B, TP63 TP63 POU3F2 SOX2 SOX2 POU3F2 oynme of number Copy 181,440kb H2887 HCC2279 CALU1
181,450kb EBC1
H3K27ac Signal C
SOX2 SQ1 100,000 150,000 POU3F2 mRNA TP63 mRNA SOX2 mRNA SOX2 CN 50,000 A, nuevsdheacia lseigo 3LS ellnsuigsprehne S)sgasnear signals (SE) super-enhancer using lines cell LUSC 13 of clustering hierarchical Unsupervised n RAepeso of expression mRNA and nacr akdb 32a signal H3K27ac by ranked Enhancers Chr3 [0-50] 0 Cancer Research. 189,500kb 08 identified: 624 identified: Super-enhancers 14,339 used: Cutoff on September 27, 2021. © 2019American Association for 2,000 189,550kb 4,000 H3K27ac TP63 POU3F2 LK2 ,0 10,000 6,000 SOX2 189,600kb Vinculin E O2 TP63 SOX2, Brn2 Sox2 p63 , 0 0 0 189,650kb ,and
LK2 POU3F2
H H3K27ac Signal 520 100,000 200,000 300,000 400,000 500,000 nLS ellnsfo CCLE. from lines cell LUSC in [0-50] 920b99,270kb 99,250kb D, Chr6
HCC95 signal H3K27ac by ranked Enhancers eoeve rcso 32a inlat signal H3K27ac of tracks view Genome 0
HCC2814 15,000 10,000 5,000 0 920b9,8k 99,290kb 99,280kb 99,260kb Super-enhancers identified: 583 identified: 21,833 used: Cutoff
KNS H3K27ac 62 H520 HARA POU3F2 SOX2 E, acrResearch Cancer POU3F2 C, LC1/ Protein Super- SQSF Published OnlineFirst September 24, 2019; DOI: 10.1158/0008-5472.CAN-19-2132
Lineage Heterogeneity in Squamous Lung Cancer
lines, shows relatively low expression of both SOX2 and TP63 ever, when we focused on tumors with low TP63 expression, we (Fig. 1B), although human LUSC tumors with low expression found a significant correlation of POU3F2 with SOX2 (Fig. 2D), (< mean SD) of both genes accounts for only 6.6% in TCGA whereas there was a significant correlation of TP63 with SOX2 Lung Squamous Cell Carcinoma (TCGA-LUSC) dataset. While the among LUSCs with high TP63 expression (Supplementary SOX2 super-enhancers were shared by the former two subgroups Fig. S2F). These findings suggest that Brn2 and p63 have con- (Fig. 1C and D; Supplementary Fig. S1B) and the TP63 super- trasting roles representing distinct lineages within LUSC by part- enhancers were found only in the first subgroup (Fig. 1D; Sup- nering with Sox2 to elicit distinct transcriptional outputs. plementary Fig. S1B), POU3F2 super-enhancers were present only To investigate the difference in gene signatures between in the small subset of LK2 and NCI-H520 (Fig. 1D). We also found POU3F2-expressing LUSC tumors and TP63-expressing LUSC the POU3F2 super-enhancers to be among the top differential tumors, we next performed differential expression analysis in the super-enhancers in this subset compared to the "classical" SOX2/ TCGA-LUSC dataset comparing POU3F2-high/TP63-low tumors TP63 subgroup (Supplementary Fig. S1C). Analyses using versus TP63-high/POU3F2-low tumors (Fig. 3A). A functional RNA-seq data on these cell lines obtained through CCLE dataset enrichment analysis of differentially expressed genes identified revealed that expression of POU3F2 was indeed significantly genes that play roles in neural cell differentiation significantly higher in this small subset while the squamous lineage factor enriched in POU3F2-high/TP63-low tumors, while those TP63 was expressed significantly higher in the "classical" SOX2/ involved in epithelial/epidermal cell differentiation were signif- TP63 subgroup of LUSC (Fig. 1B). In contrast, SOX2 expression icantly enriched in TP63-high/POU3F2-low tumors (Fig. 3B). was found in both subgroups and highest in the SOX2/POU3F2 Furthermore, we evaluated the prognostic significance of TP63/ subset (Fig. 1B). Immunoblotting showed the expression pattern POU3F2 expression status in the TCGA-LUSC dataset and found to be consistent with protein level; the LUSC cell lines that harbor that patients with POU3F2-high/TP63-low tumors showed sig- POU3F2 super-enhancers do not express p63 at a detectable level nificantly shorter survival compared with those with TP63-high/ but have high expression of Sox2 and Brn2 (Fig. 1E). These POU3F2-low tumors, suggesting clinically distinct, aggressive findings suggest that, while SOX2 is highly expressed and likely features of this "neural" subtype (Fig. 3C). When we examined serves as a lineage factor for both of these two subgroups, this the relationship between these POU3F2-high/TP63-low tumors small subgroup represents a unique subset of LUSCs signified by and the LUSC expression subtypes (11), consistent with the report the neural lineage factor Brn2. in which the "neural" LK2 and NCI-H520 cells were predicted as primitive (12), all of the POU3F2-high/TP63-low tumors were BRN2 signifies the "neural" subtype in human LUSCs classified as the primitive subtype (Supplementary Fig. S2G), Brn2 has been described to have an oncogenic role in neuro- which was reported to show worse prognosis among LUSC (11). endocrine tumors such as SCLC, neuroendocrine prostate cancer, These findings indicate the presence of a distinct LUSC subset with and glioblastoma (35–39). To determine whether this unique neural differentiation that has not been described previously. "neural" subset exists in human LUSC data and POU3F2 (Brn2) signifies this subgroup, we examined TCGA-LUSC dataset and BRN2 and SOX2 interact and colocalize at genetic loci in the identified a subset of LUSC that expresses POU3F2 (Fig. 2A) and "neural" LUSC cells their expression levels were comparable with that in SCLC (24) Given the positive correlation of POU3F2 and SOX2 expres- and hypothalamus in GTEx data (Fig. 2B; ref. 23), suggesting sions in LUSC tumors with low TP63 expression, and previously functional relevancy of Brn2 in this unique subset of LUSC. Of reported interaction of Brn2 and Sox2 in neural progenitor note, this "neural" subset of LUSC has very low expression of cells (33, 34), we further investigated the relationship between neuroendocrine markers (Supplementary Fig. S2A and S2B), Brn2 and Sox2 in LUSC. First, using immunofluorescence analysis suggesting this subtype is not necessarily closer to SCLC or we found that Sox2 and Brn2 colocalize in the nuclei of the Brn2- neuroendocrine tumors. To investigate how these tumors with positive LUSC cell lines (Fig. 4A; Supplementary Fig. S3A). In Brn2 expression are represented in clinical specimens, we exam- contrast, Sox2, but not Brn2, was detected in nuclei of the ined Brn2 expression pattern at protein level in human LUSC "classical" Sox2/p63-positive LUSC cells confirming the specific- tissues by IHC staining. Brn2 expression was found in 6 of 23 ity of the antibody (Supplementary Fig. S3B). Second, we specimens and all of them were positive for Sox2 (Fig. 2C; observed their robust protein–protein interaction by endogenous Supplementary Fig. S2C). Of note, the Brn2 expression was and reciprocal coimmunoprecipitations in the two "neural" LUSC relatively confined to subpopulations of tumor cells in these cell lines (Fig. 4B). Given their physical interaction, we hypoth- LUSC tissues. This suggests that some of the tumors with inter- esized that Brn2 and Sox2 cooccupy genomic loci to coordinately mediate POU3F2 expression level observed in bulk mRNA in the regulate gene expression in these Brn2-positive LUSC cells. To test TCGA-LUSC dataset may reflect heterogeneous tumor popula- for genomic cooccupancy of Brn2 and Sox2, we explored genome- tions with partial "neural" differentiation and that, perhaps wide binding profiles of Brn2 and Sox2 in the "neural" LUSC cells unlike cell lines, distinct lineage states can coexist within the by ChIP-seq and found that the Brn2-binding peaks substantially same LUSC tumor. This further implies trans-differentiation overlap with the peaks for Sox2 in these cells (Fig. 4C and D). In during the evolution of LUSCs. We next examined the correlation contrast, the Sox2 binding peaks from "classical" LUSC cells of expression of POU3F2 with SOX2 and TP63 expression in the showed distinct profiles with less overlaps with those for Brn2 TCGA-LUSC dataset. We found that expression of POU3F2 was in "neural" LUSC cells (Fig. 4C and D), suggesting that in the anticorrelated with that of TP63 (Fig. 2B; Supplementary absence of p63, Brn2 engages Sox2 to localize at genetic loci Fig. S2D), consistent with the data from LUSC cell lines in which thereby imparting "neural" features on LUSC cells. To investigate high expression of POU3F2 and TP63 were mutually exclusive. We which other factors are involved in collaboration of Sox2 and did not find significant correlation between POU3F2 and SOX2 Brn2 at genetic loci in the "neural" cells, we performed motif expression (Supplementary Fig. S2E) in the entire dataset; how- analysis for the Sox2/Brn2 peak-enriched cluster (Supplementary
www.aacrjournals.org Cancer Res; 79(24) December 15, 2019 OF7
Downloaded from cancerres.aacrjournals.org on September 27, 2021. © 2019 American Association for Cancer Research. Published OnlineFirst September 24, 2019; DOI: 10.1158/0008-5472.CAN-19-2132
Sato et al.
AB D TP63 Low r: 0.40
67 P: 0.002