Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Variations in HSPA1B at 6p21.3 are Associated with Lung Cancer Risk and

Prognosis in Chinese Populations

Huan Guo1, Qifei Deng1, Chen Wu2, Lingmin Hu3, Sheng Wei1, Ping Xu4, Dan Kuang1,

Li Liu5, Zhibin Hu3, Xiaoping Miao2, Hongbing Shen3, Dongxin Lin2, and Tangchun

Wu1

Authors' Affiliations: 1Institute of Occupational Medicine and Ministry of Education

Key Lab for Environment and Health, School of Public Health, Tongji Medical College,

Huazhong University of Science and Technology, Wuhan, China; 2Department of

Etiology and Carcinogenesis, Cancer Institute and Hospital, Chinese Academy of

Medical Sciences and Peking Union Medical College, Beijing, China; 3Department of

Epidemiology and Biostatistics, Cancer Center of Nanjing Medical University, Nanjing,

China; 4Department of Oncology, Wuhan Iron and Steel (Group) Corporation

Staff-Worker Hospital, Wuhan, China; and 5Cancer Center, Union Hospital, Huazhong

University of Science and Technology, Wuhan, China.

Corresponding Author: Tangchun Wu, Institute of Occupational Medicine, School of

Public Health, Tongji Medical College, HUST, 13 Hangkong Rd, Wuhan 430030,

Hubei, China. Phone: +86-27-83692347; Fax: +86-27-83692560; E-mail:

[email protected].

Grant Support

This study is supported by the National Basic Research Program grant

2011CB503800, the National High Technology Project grant 2009AA022705, and the

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

National Natural Scientific Foundation of China grants 30872092 and 30525031 to

T.W.

Running Title: HSPA1B Variants and lung cancer risk and survival

Keywords: Heat shock 70, genetic variation, lung cancer, risk, prognosis

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

Word count: 3, 913

Total number of figures and tables: 6

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Abstract The heat shock protein Hsp70 is crucial for regulating cellular homeostasis in stressed cells. While the tumorigenic potential and prognostic applications of Hsp70 have been widely investigated, it remains unclear whether genetic variations of the human isoforms HSPA1L, HSPA1A and HSPA1B are associated with cancer risk and prognosis. In this study, we genotyped 6 tagSNPs in these in 1,152 paired lung cancer patients and controls, and then validated the results in additional cohorts of 1,781 lung cancer patients and 1,038 controls. We also evaluated the associations of these tagSNPs with survival in 330 advanced non-small-cell lung cancer (NSCLC) patients with additional validation in another 331 advanced NSCLC patients. Functions of the risk variants identified were investigated using cell-based reporter assays. We found that the HSPA1B rs6457452T allele was associated with increased lung cancer risk compared with the rs6457452C allele in both datasets and also pooled analysis (adjusted odds ratio [OR] = 1.41, P = 2.8×10−5). The HSPA1B rs2763979TT genotype conferred poor survival outcomes for advanced NSCLC patients in two independent cohorts and pooled analysis (adjusted hazard ratio [HR] =1.80, 1.61, 1.66, P = 0.013, 0.036, 0.002, respectively). Lastly, we also found that the rs2763979T and rs6457452T alleles were each sufficient to reduce expression of transcriptional reporter constructs, when compared with the rs2763979C and rs6457452C alleles, respectively. Taken together, our findings define functional HSPA1B variants are associated with lung cancer risk and survival. These Hsp70 genetic variants may offer useful biomarkers to predict lung cancer risk and prognosis.

1

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Introduction Lung cancer is a leading cause of cancer-related deaths in the world (1). Although

tobacco smoking accounts for over 80% of all cases (2), only around 15% of all heavy

smokers develop lung cancer, suggesting that genetic factors may also play a crucial

role in susceptibility to the disease. To date, prognosis for patients with non-small-cell

lung carcinoma (NSCLC) remains poor, with an overall 5-year survival rate of only

10-15% (1). Despite intense efforts in the past decades, researchers have still not

identified specific biomarkers for lung cancer risk assessment and prognosis

prediction.

Published genome-wide association studies (GWAS) in Caucasians have mapped

a lung cancer susceptibility to 6p21.3, a -rich region of the

major histocompatibility complex (MHC) (3-5). The reported significant variants

(rs3117582, rs3131379, and rs4324798) at 6p21.3 do not exist in Asians according to

the HapMap project, and whether single nucleotide polymorphisms (SNPs) in this

region are related to lung cancer risk in Chinese populations is unknown.

The three heat-shock protein 70 (Hsp70) genes (HSPA1L, HSPA1A, and HSPA1B),

encoding three highly homologous HSP70 (Hsp70-1, Hsp70-2, and

Hsp70-hom, respectively), are located alongside within the MHC class-III region at

6p21.3, only approximately 50-kb away from rs3131379 (3). Heat-shock proteins

(HSPs), and Hsp70 in particular, are the major stress-inducible proteins that function as

key components in regulating cellular homeostasis, probably through their molecular

chaperone activities to inhibit accumulation of protein aggregates (6-7) or through their

cyto-protective properties to protect against DNA damage (8), promoting cell survival

under stress stimuli (9-10). Accumulating evidence indicates their potential

associations with tumorigenesis and clinical outcomes of cancer patients (11-12). A

case-control study nested in a large-scale Japanese cohort suggested that serum

2

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Hsp70 levels might be an important biomarker for lung cancer risk (13). Because of its

peptide chaperone capacity and its ability to bind to professional antigen-presenting

cells (APCs), the immunological properties of Hsp70 suggest its potential clinical

applications as personalized vaccines for human cancers (14).

Because of the vital role of Hsp70 in cancer development and survival progress

and the results from GWAS in Caucasian populations that identified risk alleles in

HSPA11L-HSPA1A-HSPA1B (3), we hypothesized that genetic variations in

HSPA1L-HSPA1A-HSPA1B may modulate Hsp70 expression levels and/or protein

structures and thus influence lung cancer susceptibility and prognosis in Chinese

populations.

Materials and Methods

Case-control study to identify risk variants

In this study, we performed three independent case-control analyses. The

discovery set included 1,152 lung cancer patients (recruited from Union Hospital

Cancer Center, Wuhan Steel Group/Corporation Staff-Worker Hospital and Wuhan

Zhongnan Hospital from January 2003 to February 2009) and 1,152 age (±5 years) and

sex frequency-matched controls (randomly selected from 4,073 individuals participating

in a community screening program in the same regions during the same period) (15).

The lung cancer patients and controls from northern China was used to validate the

identified risk variants (validation set), and it included an additional 1,781 lung cancer

patients (recruited at the Cancer Hospital, Chinese Academy of Medical Sciences

between July 2000 and November 2008) and 1,038 age (±5 years) and sex matched

controls (cancer-free individuals selected from a community nutritional survey of 6,450

individuals in the same region during the same period) (16). In addition, 500 newly

3

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

enrolled lung cancer patients (from Union Hospital Cancer Center and Wuhan Steel

Group/Corporation Staff-Worker Hospital) and 500 age (±5 years) and sex

frequency-matched controls were chosen for further fine-mapping analysis. These 500

control subjects were also randomly selected from the 4,073 individuals used for

selecting controls of the discovery set, and they were independent of the 1152 controls

used in the discovery set.

Follow-up study to identify prognostic variants

For the survival analysis, we followed up 450 primary lung cancer patients of

Wuhan Steel Group/Corporation Staff-Worker Hospital as the discovery set. These

patients were admitted to the Department of Oncology of Wuhan Iron and Steel

Group/Corporation Staff-Worker Hospital and had available sufficient follow-up data

(15). Patients enrolled in this hospital lived in the same region and had a similar

socio-economic status. Of the 450 patients, four (0.1%) patients did not have the TNM

stage classification. Among the remaining 446 patients with completed

follow-up/clinical information, 28 (6.3%) were SCLC patients and 418 (93.7%) were

NSCLC patients. There were 88 (21.1%) early NSCLC patients (stage I or II) and 330

(78.9%) advanced NSCLC patients (stage III or IV). Eight of the advanced NSCLC

patients were lost to follow-up. As we know, the survival outcomes of NSCLC lung

cancer patients can be affected by many factors, including genetic background,

treatments, and socio-economic status. In order to minimize the bias due to patient

selection, inconsistency of therapies and patient economic status among different

hospitals, we limited our survival analysis to the 330 advanced NSCLC patients (stage

III or IV) from Wuhan Steel Group/Corporation Staff-Worker Hospital.

Of them, 255 advanced NSCLC patients received chemotherapy, 84 (32.9%)

patients received a gemcitabine/cisplatin regimen, 99(38.8%) received a

4

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

vinorelbine/cisplatin regimen, 22 (8.6%) received a docetaxel/cisplatin regimen, 8

(3.1%) received an etoposide/cispatin regimen, and the remaining 42(16.5%) patients

received chemotherapy that was a combination of the above regimens.

Additional 353 advanced NSCLC patients from the Cancer Hospital of Jiangsu

Province and the First Affiliated Hospital of Nanjing Medical University were used as

the validation set (17). Of these 353 patients, only 331 patients with sufficient DNA

were evaluated. In this dataset, 300 patients received chemotherapy. Among these

patients, 142 (47.3%) received a gemcitabine/cisplatin regimen, 14 (4.7%) received a

vinorelbine/cisplatin regimen, 36 (12.0%) received a docetaxel/cisplatin regimen, 13

(4.3%) received an etoposide/cispatin regimen, and 95 (31.7%) received

chemotherapy that was a combination of the above regimens. The TNM stage

classification was evaluated by medical oncologists according to the Staging Manual of

AJCC/UICC (2003) (18). Patients were followed up by telephone calls every three

months. Survival time was calculated from the date when patients first received

confirmed diagnoses until the date of the last follow-up or death. Dates of death were

obtained from inpatient and outpatient records or from the patients’ families through

telephone follow-up. In the discovery set, the mean and median follow-up periods were

17.5 and 12.2 months, respectively, while the follow-up period ranged from 1.0 to 84.0

months. In the validation set, the mean and median follow-up periods were 19.6 and

16.0 months, respectively, while the follow-up periods ranged from 1.0 to 70.2 months.

All subjects in this study were unrelated ethnic Han Chinese and general

information about all the subjects was collected through personal interviews. Those

who had smoked an average of less than one cigarette per day and for less than one

year in their lifetimes were defined as nonsmokers; those who stopped smoking > 1

year previously were considered former smokers, and those who were still smoking in

5

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

the previous year were defined as current smokers. This study was approved by the

Institutional Review Board of Tongji Medical College, the Chinese Academy of Medical

Sciences Cancer Institute, and Nanjing Medical University.

TagSNPs selection

Haplotype-tagging SNPs (tagSNPs) of the HSPA1L-HSPA1A-HSPA1B gene

cluster on 6p21.3 (Chr6:31,885,375 to 31,906,010) in Chinese populations were

chosen based on Han Chinese Beijing (CHB) data in the HapMap project (HapMap

data Rel24/phaseII Nov08, on NCBI B36 assembly, dbSNP b126, minor allele

frequency (MAF ≥ 0.05), Hardy-Weinberg equilibrium (HWE) P value ≥ 0.05, and call

rate ≥ 90%) with Haploview 4.0 software on a block-by-block analysis (r2 threshold,

0.8). Linkage disequilibrium (LD) blocks were determined according to the method of

Gabriel et al. with default settings (19). By this method, we selected five tagSNPs,

rs2075800 (HSPA1L_+2763C>T), rs2227956 (HSPA1L_+2437A>G), rs1043618

(HSPA1A_+190G>C), rs2763979 (HSPA1B_-912C>T), and rs6457452

(HSPA1B_+39C>T). In addition, the rs1008438 (HSPA1A_-110A/C) identified by our

resequencing results was also selected (Supplementary Fig. S1) (20).

The risk polymorphisms [(rs3117582 (chr6:31728499) and rs3131379

(chr6:31829012)] reported by GWAS in Caucasians did not exist in Asians. Because

rs3117582 was located in the block from chr6:31722081 to 31733520, we further

carried out a fine-mapping analysis from this block to the HSPA1L-HSPA1A-HSPA1B

cluster. The same HapMap-CHB database and selection criteria described above were

used to select 29 tagSNPs at 6p21.3 (from 31722081 to 31906010) (Supplementary

Fig. S2).

Genotyping

Six tagSNPs located in the HSPA1L-HSPA1A-HSPA1B cluster were genotyped by

6

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

TaqMan methods, and the additional 24 tagSNPs at 6p21.3 were genotyped by the

MassArray system (Sequenom). However, genotyping of rs6905572 failed because no

appropriate primers could be designed for this SNP. In the 331 advanced NSCLC

patients in the validation set, six samples (1.8%) failed in genotyping of rs2763979 and

eight samples (2.4%) failed in genotyping of rs6457452 due to their lower DNA quantity.

As a result, 325 and 323 advanced NSCLC patients in the validation set were

successfully genotyped for HSPA1B gene rs2763979 and rs6457452, respectively. For

all tested samples, a 5% random sample was reciprocally tested, and the

reproducibility was 100%. All primers and probes used in this study are listed in

Supplementary Table S1.

Luciferase reporter assay

Referring to the transcription initiation site of the HSPA1B gene as “+1”, we

constructed six reporter plasmids based on the pGL3-Basic vector (Promega, Madison,

Wisconsin, USA), including four fragments encompassing from -1031 to +216

according to their haplotypes (rs2763979C/rs6457452C, rs2763979C/rs6457452T,

rs2763979T/rs6457452C and rs2763979T/rs6457452T, respectively) and two core

regulation regions from -456 to +216 containing either the rs6457452C or rs6457452T

allele (Fig. 2A). They were all inserted into Kpn I/Hind III enzyme sites of the

pGL3-Basic. Primer pairs designed for plasmid constructs and site-specific

mutagenesis are listed in Supplementary Table S1. The direction and sequence

authenticity of the six constructs were validated by restriction analysis and DNA

sequencing.

The human bronchial epithelia 16HBE cells were kindly provided by Dr. Yiguo

Jiang of Guangzhou Medical College, China; while the human lung carcinoma A549

cells and hepatoma HepG2 Cells were purchased from the China Center of Type

7

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Culture Collection in Wuhan, China. The 2×104 16HBE, HepG2, and A549 cells were

seeded into 96-well plates and co-transfected with 100ng reporter plasmids and 1ng

pRL-SV40 (Promega, Madison, WI) using Lipofectamine 2000 (Invitrogen, Carlsbad,

CA). The relative luciferase activity (RLA) was determined at 24-hours after

transfection as previously described (15, 20).

Statistical analysis

A two-phase screening was used to search for tagSNPs in the

HSPA1L-HSPA1A-HSPA1B cluster associated with lung cancer risk and survival (Fig.

1). Distributions of numerical data were examined by the 1-sample

Komogorov-Smirnov normality test. We used χ2 or Student's t-tests to examine the

HWE of SNPs, and the distributions of selected characteristics and genotypes between

patients and controls. Multivariate logistic regression models were used to estimate the

associations between SNPs and lung cancer risk after adjustment for age, sex and

pack-years smoked. The interactions of age, sex, and smoking habits with SNP

genotypes in the pooled case-control data were tested by using the Wald test in the

multivariate logistic regression models. The associations between overall survival time

and demographic and clinical characteristics were estimated using the Kaplan-Meier

method and Log-rank test. The effect modifications by these characteristics and the

effects of SNPs on death risk in patients with advanced NSCLC were assessed by

using the Wald test in the multivariate Cox proportional hazards regression models

after adjusting for the confounders. The proportional hazards assumption was

examined by testing interactions between the genotypes and time (all P values > 0.05).

The statistical software packages R and QVALUE were used to calculate false

discovery rate. Differences of RLA were compared using one-way analysis of variance

(ANOVA), Student-Newman-Keuls test, and Student’s t-tests. Two-sided P < 0.05 was

8

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

considered significant and SPSS12.0 software (SPSS Inc., Chicago, IL) was used for

all analyses.

Results

Associations between tagSNPs of the HSPA1L-HSPA1A-HSPA1B cluster and

lung cancer risk

Distributions of selected characteristics for subjects in the discovery and validation

sets and Fine-mapping analysis are listed in Table 1. As showed in Table 2, in the

discovery set, compared with the HSPA1B rs6457452CC genotype, subjects with the

rs6457452CT or combined rs6457452CT+TT genotypes had a 1.39-fold increased risk

of lung cancer (P = 0.013 and 0.010, respectively). There was a dose-response effect

of the increasing number of the rs6457452T alleles on an increased lung cancer risk

(Ptrend = 0.026). However, no associations were found between rs2075800, rs2227956,

rs1008438, rs1043618, and rs2763979 and lung cancer risk (Table 2).

In the validation set, logistic regression analysis indicated that compared with the

HSPA1B rs6457452CC genotype, subjects with rs6457452CT or rs6457452CT+TT

genotypes had a significantly increased risk of lung cancer (adjusted odds ratio [OR] =

1.34, 1.32, and P = 0.009, 0.012, respectively). Pooled analysis of the discovery and

validation sets showed that the adjusted ORs were 1.41 for the rs6457452CT genotype

(P = 3.0×10−5), and 1.41 for the rs6457452CT+TT genotype (P = 2.8×10−5), compared

with the rs6457452CC genotype, respectively (Table 2). We did not observe a

significant interaction of age at diagnosis, sex, and smoking habits with HSPA1B

rs6457452CT+TT genotypes on the risk of lung cancer in the pooled analysis (P =

0.231, 0.982, and 0.291, respectively).

Fine-mapping analysis on chromosome 6p21.3

9

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

A total of 28 tagSNPs at chr6:31722081 to 31906010 were successfully genotyped.

The rs3115672 was not detected in any subject. The allele distributions of rs3130617

and rs805923 deviated from HWE among controls (P < 0.05), and were excluded from

subsequent analysis. Analysis of the remaining 25 tagSNPs revealed that only the

rs6457452T allele on HSPA1B 5’UTR was significantly associated with an increased

lung cancer risk (P = 0.013, Table S2).

HSPA1L-HSPA1A-HSPA1B tagSNPs and survival of advanced NSCLC patients

The demographic and clinical characteristics of lung cancer patients in the survival

discovery and validation sets are presented in Table 3. In the discovery set, in which

the mean age was 63.25 ± 10.07 years, 257 (77.9%) patients died of lung cancer, 89

(27.0%) received surgical operations, 255 (77.3%) received chemotherapies, and 152

(46.1%) received radiotherapies. In the validation set, in which the mean age was

59.65 ± 9.57 years, 242 (73.1%) patients died of lung cancer, 152 (45.9%) received

surgical operations, 300 (90.6%) received chemotherapies, and 123 (37.2%) received

radiotherapies. The five year survival rates for the two sets were 8% and 10%,

respectively. The Kaplan-Meier analysis, Log-rank test, and univariate Cox analysis

revealed that younger patients, or patients with earlier stage, or who had surgical

operations and chemotherapy had a significantly longer median survival time (MST)

and a decreased death risk (P < 0.05 in Table 3). There were no significant effects of

sex, smoking status, and radiotherapy.

In the discovery set, the Kaplan-Meier method and Log-rank test showed that

patients with the rs1008438AC or rs1008438CC genotype had a significantly shorter

MST than patients with the rs1008438AA genotype (Log-rank P = 0.047, 0.009,

respectively) (Table 4). However, this SNP was not associated with death risk for

advanced NSCLC patients after adjustment for the confounders. Multivariate

10

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

proportional hazards regression models and Log-rank test revealed that, when

compared with the HSPA1B rs2763979CC genotype, the rs2763979CT and

rs2763979TT genotypes were associated with poor survival (adjusted HR = 1.43, 1.80

and P = 0.008, 0.013, respectively) and a shorter MST for advanced NSCLC patients

(Log-rank P = 0.011, 0.031, respectively); patients with the rs6457452CT+TT

genotypes had a 51% higher death risk than those with the rs6457452CC genotype (P

= 0.032).

The HSPA1B rs2763979C>T and rs6457452C>T polymorphisms were further

tested in the validation set. In this dataset, when compared with the rs2763979CC

genotype, advanced NSCLC patients with the rs2763979TT genotype had a 1.61-fold

increased death risk (P = 0.036). No significant association was found on the

rs6457452C>T polymorphism. Pooled analysis of the two cohorts showed that, when

compared with the rs2763979CC genotype, the rs2763979CT or rs2763979TT

genotype had a 1.21-fold or 1.66-fold increased death risk (P = 0.045 and 0.002,

respectively), while the rs2763979TT genotype was associated with a significantly

shorter MST (Table 4, Supplementary Fig. S3). We did not observe a significant

interaction of age at diagnosis, stage, surgical operations and chemotherapy with

HSPA1B 2763979CT+TT genotypes on the death risk among patients with advanced

NSCLC in the pooled analysis (P = 0.302, 0.229, 0.971, and 0.574, respectively).

Luciferase reporter activity for HSPA1B rs2763979C>T and rs6457452C>T

Among four constructed reporter plasmids with distinct haplotypes, the RLAs

driven by promoter containing rs2763979C/rs6457452T, rs2763979T/rs6457452C, and

rs2763979T/rs6457452T haplotype were significantly lower than that driven by

rs2763979C/rs6457452C haplotype (all P < 0.05); the RLA driven by promoter

containing rs2763979T/rs6457452T haplotype was significantly lower than that driven

11

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

by rs2763979C/rs6457452T or rs2763979T/rs6457452C haplotype (all P < 0.05, Fig.

2B); however, the RLA driven by promoter containing the rs2763979C/rs6457452T or

rs2763979T/rs6457452C haplotype showed no significant difference (P > 0.05). In two

reporter plasmids containing the rs6457452C or rs6457452T allele of HSPA1B

rs6457452C>T polymorphism, the rs6457452C allele was associated with significantly

higher RLA than rs6457452T allele in three cell lines (P < 0.05).

Discussion

To our knowledge, this study is the first to demonstrate that genetic variants of

HSPA1B are associated with lung cancer susceptibility and survival of advanced

NSCLC patients in Chinese populations. In this study, we found that the HSPA1B

rs6457452T allele was associated with an increased lung cancer risk in two

independent lung cancer case-control studies comprising 5,123 Chinese individuals.

Fine-mapping analysis confirmed that only this SNP within the surrounding 185-kb at

the previously identified chromosome 6p21.3 locus (chr6: 31722081 to 31906010) was

associated with lung cancer risk. In addition, we also observed that the HSPA1B

rs2763979TT genotype was associated with poor survival for advanced NSCLC

patients in two independent cohorts. Our further analysis of the biological functions for

rs2763979C>T and rs6457452C>T by luciferase reporter assay suggested that the

rs2763979T and rs6457452T alleles led to a significantly lower expression of Hsp70 in

both normal bronchial epithelial and in malignant cancer cells.

Hung et al. reported an association between rs4324798 at 6p21.3 and lung cancer

risk (P = 5.6×10-6) (5), but this association was not confirmed in a subsequent

meta-analysis (21). Wang et al. reported that two other SNPs at 6p21.3 were

significantly associated with lung cancer risk (rs3117582 and rs3131379) (3). However,

12

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

these three SNPs are not exist in Asian populations. In contrast, the published

Japanese and Korean GWAS of lung adenocarcinoma did not find 6p21.3 to be a

susceptible locus, possibly because the sample size at the discovery stage was

relatively small or because this locus does not contribute to carcinogenesis for this

specific histology (22-23). Our fine-mapping analysis of 25 tagSNPs from the block

containing rs3117582 to HSPA1L-HSPA1A-HSPA1B cluster found that only the

HSPA1B rs6457452C>T variant was associated with lung cancer risk. However, the

associations with other SNPs in the region investigated were not significant, which may

be due to the limited sample size. Since the LD region of 6p21.3 is extensive and

contains numerous transcripts, dissection and elucidation of the causal variant require

additional investigations with larger sample sizes and different ethnic groups.

The HSPA1B +1267A/G (Gln351Gln, rs1061581) polymorphism was reported to

be associated with susceptibility to nasopharyngeal carcinoma (24), non-Hodgkin

lymphoma (25), and breast carcinoma (25-26). Data from the Environmental Genome

Project showed that the HSPA1B rs1061581 was in linkage disequilibrium (LD) with

rs6457452 (D’ = 1.0). Because the rs1061581 is a silent polymorphism, it is likely to be

a surrogate marker for the rs6457452 in HSPA1B 5’UTR. In our study, the HSPA1B

rs6457452T allele reduced the Hsp70 synthesis levels of normal-status cultured cells.

Wang et al. previously found that the HSPA1A rs1043618C allele was associated with

increased lung cancer risk in 159 lung cancer patients and 202 controls in a Chinese

population, but no significant association was found for HSPA1B rs1061581 and

HSPA1L rs2227956 polymorphisms (27). Rusin et al reported that the 1541-1542delGT

heterozygous genotype of the constitutive heat shock cognate protein 70 (HSC70)

gene was associated with a significantly decreased risk for lung cancer in 96 paired

European lung cancer patients and controls (28). However, the sample sizes for the

13

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

above two studies were small and the results require further validation.

Polycyclic aryl hydrocarbons (PAHs) are the main potent procarcinogens in

tobacco smoke that can induce transformation of normal cells, causing malignancies in

experimental animals (29). Our previous studies demonstrated that Hsp70 could

protect human bronchial epithelia cells against DNA damage caused by

benzo[a]pyrene (30), and could also protect DNA from genotoxic damage introduced by

PAHs contained in coke-oven emissions (31). Since accumulation of damage at

different molecular and cellular levels is the underlying mechanism for lung

carcinogenesis (32), the increased DNA damage levels caused by reduced Hsp70

levels may enhance the process for genetic instability and promote cancer

development.

Cellular studies and animal models have extensively evaluated the functions of

Hsp70 in the carcinogenesis process and in cell proliferation and survival. For example,

high Hsp70 expression was correlated with poor prognosis of breast cancer and

endometrial cancer, but with better prognosis of esophageal, pancreatic, and renal

cancers (12). The contradictory prognostic implications of Hsp70 might be due to its

versatile properties. For example, elevated Hsp70 may confer resistance to

chemotherapy and apoptotic removal of tumor cells; Hsp70 can also participate in

tumor immunogenicity, because Hsp70 expressed on the surface of tumor cells can

elicit tumor cells specific immunological responses through chaperoning and

presentation of cancer-specific antigens (33-34). Tamura et al. found that immunization

of mice with lung carcinoma with cancer-derived Hsp70-peptide complex led to

retarded progression of primary cancer, a reduced metastasis and a prolongation of

life-span (35). In the present study, we can speculate that the low Hsp70 expression

levels driven by the rs2763979T alleles in lung cancer cells may contribute to a reduced

14

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

the immunogenic response, thus conferring unfavorable survival for NSCLC patients.

NSCLC patients are often diagnosed at an advanced stage and have a poor

prognosis. Individual heterogeneity is a major reason for distinct responses of

treatments and outcomes (36). Recently, using high-throughput expression and genetic

microarray platforms, molecular epidemiologic and pharmacogenetic studies have

been aimed at identifying molecular and clinical predictors for outcomes of lung cancer

patients with specific histology, TNM stage, or therapy strategy (37-39). Hsp70 has

become one of the most exciting anticancer molecular targets, and Hsp70-antigen

complex has proved effective in the treatment of rodent lung cancer in preclinical

studies (35) and is now undergoing clinical trials for treatment of human cancers (40).

Therefore, newly established molecular biomarkers as well as identifiable clinical and

pathological features in patients will help pave the road for individualized prevention

and therapy for lung cancer.

The present study has several strengths and limitations. First, a total of 5,123

Chinese subjects included in two independent panels and 661 advanced NSCLC

patients included in two case-only survival cohorts consistently confirmed the

associations of HSPA1B rs6457452C>T and rs2763979C>T polymorphisms with lung

cancer risk and survival respectively, which provided an improved statistical power and

reduced the probability of false positives. Second, the fine-mapping analysis provided

additional confirmation for the potential causal variant of rs6457452T allele. Finally, our

functional studies offered a biologically plausible explanation for the epidemiologic

findings. However, the fine-mapping analysis carried in our study was not deep enough.

A future more well-designed and comprehensive fine-mapping analysis on a longer

region surrounding the real hit of 6p21.3, which contains a large number of SNPs, is

need to investigate the real causal variant. Moreover, since all subjects enrolled in this

15

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

study were ethnic Han Chinese, the associations of these SNPs with lung cancer risk

and survival in other ethnic groups also needs further validation by larger prospective

studies.

In conclusion, this study provided evidence for the first time that the HSPA1B

5’UTR rs6457452T allele may contribute to an increased lung cancer risk, while the

HSPA1B promoter rs2763979TT genotype may confer poor survival outcomes for

advanced NSCLC patients in Chinese populations. Extensive functional evaluations

and additional population-based prospective studies with different ethnic groups and

well-designed clinical investigations are warranted to confirm and extend our findings.

Acknowledgements:

The authors thank all volunteers and medical assistants from cooperating hospitals.

They also appreciate the scientific editing of Professor Helen Neumann of University of

Connecticut and the helpful comments by Professor Frank Hu of the Harvard School of

Public Health and Professor Qingyi Wei of The University of Texas M.D. Anderson

Cancer Center.

16

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

References

1. Jemal A, Siegel R, Ward E, Hao Y, Xu J, Murray T, et al. Cancer statistics, 2008.

CA Cancer J Clin. 2008;58:71-96.

2. Ruano-Ravina A, Figueiras A, Barros-Dios JM. Lung cancer and related risk

factors: an update of the literature. Public Health. 2003;117:149-56.

3. Wang Y, Broderick P, Webb E, Wu X, Vijayakrishnan J, Matakidou A, et al.

Common 5p15.33 and 6p21.33 variants influence lung cancer risk. Nat Genet.

2008;40:1407-9.

4. Broderick P, Wang Y, Vijayakrishnan J, Matakidou A, Spitz MR, Eisen T, et al.

Deciphering the impact of common genetic variation on lung cancer risk: a

genome-wide association study. Cancer Res. 2009;69:6633-41.

5. Hung RJ, McKay JD, Gaborieau V, Boffetta P, Hashibe M, Zaridze D, et al. A

susceptibility locus for lung cancer maps to nicotinic acetylcholine receptor

subunit genes on 15q25. Nature. 2008;452:633-7.

6. Georgopoulos C, Welch WJ. Role of the major heat shock proteins as molecular

chaperones. Annu Rev Cell Biol. 1993;9:601-34.

7. Hartl FU, Hayer-Hartl M. Molecular chaperones in the cytosol: from nascent chain

to folded protein. Science. 2002;295:1852-8.

8. Kotoglou P, Kalaitzakis A, Vezyraki P, Tzavaras T, Michalis LK, Dantzer F, et al.

Hsp70 translocates to the nuclei and nucleoli, binds to XRCC1 and PARP-1, and

protects HeLa cells from single-strand DNA breaks. Cell Stress Chaperones.

2009;14:391-406.

9. Jaattela M. Heat shock proteins as cellular lifeguards. Ann Med. 1999;31:261-71.

10. Nylandsted J, Brand K, Jaattela M. Heat shock protein 70 is required for the

survival of cancer cells. Ann N Y Acad Sci. 2000;926:122-5.

17

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

11. Ciocca DR, Clark GM, Tandon AK, Fuqua SA, Welch WJ, McGuire WL. Heat

shock protein hsp70 in patients with axillary lymph node-negative breast cancer:

prognostic implications. J Natl Cancer Inst. 1993;85:570-4.

12. Ciocca DR, Calderwood SK. Heat shock proteins in cancer: diagnostic, prognostic,

predictive, and treatment implications. Cell Stress Chaperones. 2005;10:86-103.

13. Suzuki K, Ito Y, Wakai K, Kawado M, Hashimoto S, Seki N, et al. Serum heat

shock protein 70 levels and lung cancer risk: a case-control study nested in a

large cohort study. Cancer Epidemiol Biomarkers Prev. 2006;15:1733-7.

14. Castelli C, Rivoltini L, Rini F, Belli F, Testori A, Maio M, et al. Heat shock proteins:

biological functions and clinical application as personalized vaccines for human

cancer. Cancer Immunol Immunother. 2004;53:227-33.

15. Guo H, Bai Y, Xu P, Hu Z, Liu L, Wang F, et al. Functional promoter -1271G>C

variant of HSPB1 predicts lung cancer risk and survival. J Clin Oncol.

2010;28:1928-35.

16. Wu C, Hu Z, Yu D, Huang L, Jin G, Liang J, et al. Genetic variants on chromosome

15q25 associated with lung cancer risk in Chinese populations. Cancer Res.

2009;69:5065-72.

17. Hu Z, Chen J, Tian T, Zhou X, Gu H, Xu L, et al. Genetic variants of miRNA

sequences and non-small cell lung cancer survival. J Clin Invest.

2008;118:2600-8.

18. Sobin LH, Wittekind CH, editors. TNM classification of malignant tumors (ed 6th).

New York: Wiley-Liss; 2002.

19. Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al. The

structure of haplotype blocks in the . Science. 2002;296:2225-9.

20. He M, Guo H, Yang X, Zhang X, Zhou L, Cheng L, et al. Functional SNPs in

18

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

HSPA1A gene predict risk of coronary heart disease. PLoS One. 2009;4:e4851.

21. Truong T, Hung RJ, Amos CI, Wu X, Bickeboller H, Rosenberger A, et al.

Replication of lung cancer susceptibility loci at 15q25, 5p15, and

6p21: a pooled analysis from the International Lung Cancer Consortium. J Natl

Cancer Inst. 2010;102:959-71.

22. Hsiung CA, Lan Q, Hong YC, Chen CJ, Hosgood HD, Chang IS, et al. The

5p15.33 locus is associated with risk of lung adenocarcinoma in never-smoking

females in Asia. PLoS Genet. 2010;6:e1001051.

23. Miki D, Kubo M, Takahashi A, Yoon KA, Kim J, Lee GK, et al. Variation in TP63 is

associated with lung adenocarcinoma susceptibility in Japanese and Korean

populations. Nat Genet. 2010;42:893-6.

24. Jalbout M, Bouaouina N, Gargouri J, Corbex M, Ben Ahmed S, Chouchane L.

Polymorphism of the stress protein HSP70-2 gene is associated with the

susceptibility to the nasopharyngeal carcinoma. Cancer Lett. 2003;193:75-81.

25. Chouchane L, Ahmed SB, Baccouche S, Remadi S. Polymorphism in the tumor

necrosis factor-alpha promotor region and in the heat shock protein 70 genes

associated with malignant tumors. Cancer. 1997;80:1489-96.

26. Mestiri S, Bouaouina N, Ahmed SB, Khedhaier A, Jrad BB, Remadi S, et al.

Genetic variation in the tumor necrosis factor-alpha promoter region and in the

stress protein hsp70-2: susceptibility and prognostic implications in breast

carcinoma. Cancer. 2001;91:672-8.

27. Wang Y, Zhou F, Wu Y, Xu D, Li W, Liang S. The relationship between three heat

shock protein 70 gene polymorphisms and susceptibility to lung cancer. Clin

Chem Lab Med. 2010;48:1657-63.

28. Rusin M, Zientek H, Krzesniak M, Malusecka E, Zborek A, Krzyzowska-Gruca S,

19

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

et al. Intronic polymorphism (1541-1542delGT) of the constitutive heat shock

protein 70 gene has functional significance and shows evidence of association

with lung cancer risk. Mol Carcinog. 2004;39:155-63.

29. Heidelberger C. Chemical carcinogenesis. Annu Rev Biochem. 1975;44:79-121.

30. Yang J, Liu X, Niu P, Zou Y, Duan Y. Correlations and co-localizations of Hsp70

with XPA, XPG in human bronchial epithelia cells exposed to benzo[a]pyrene.

Toxicology. 2009;265:10-4.

31. Xiao C, Chen S, Li J, Hai T, Lu Q, Sun E, et al. Association of HSP70 and

genotoxic damage in lymphocytes of workers exposed to coke-oven emission.

Cell Stress Chaperones. 2002;7:396-402.

32. Finkel T, Serrano M, Blasco MA. The common biology of cancer and ageing.

Nature. 2007;448:767-74.

33. Udono H, Srivastava PK. Heat shock protein 70-associated peptides elicit specific

cancer immunity. J Exp Med. 1993;178:1391-6.

34. Basu S, Srivastava PK. Heat shock proteins: the fountainhead of innate and

adaptive immune responses. Cell Stress Chaperones. 2000;5:443-51.

35. Tamura Y, Peng P, Liu K, Daou M, Srivastava PK. Immunotherapy of tumors with

autologous tumor-derived heat shock protein preparations. Science.

1997;278:117-20.

36. Herbst RS, Heymach JV, Lippman SM. Lung cancer. N Engl J Med.

2008;359:1367-80.

37. Tomida S, Takeuchi T, Shimada Y, Arima C, Matsuo K, Mitsudomi T, et al.

Relapse-related molecular signature in lung adenocarcinomas identifies patients

with dismal prognosis. J Clin Oncol. 2009;27:2793-9.

38. Shedden K, Taylor JM, Enkemann SA, Tsao MS, Yeatman TJ, Gerald WL, et al.

20

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Gene expression-based survival prediction in lung adenocarcinoma: a multi-site,

blinded validation study. Nat Med. 2008;14:822-7.

39. Huang YT, Heist RS, Chirieac LR, Lin X, Skaug V, Zienolddiny S, et al.

Genome-wide analysis of survival in early-stage non-small-cell lung cancer. J Clin

Oncol. 2009;27:2660-7.

40. Calderwood SK, Ciocca DR. Heat shock proteins: stress proteins with Janus-like

properties in cancer. Int J Hyperthermia. 2008;24:31-9.

21

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research.

Downloaded from Author manuscriptshavebeenpeerreviewedandacceptedforpublicationbutnotyetedited.

Table 1. Distribution of selected characteristics among lung cancer patients and controls in three panels of Chinese populations Author ManuscriptPublishedOnlineFirstonOctober28,2011;DOI:10.1158/0008-5472.CAN-11-1409 Discovery Set Validation Set Fine-mapping Lung cancer Lung cancer Lung cancer Controls Controls Controls Variable patients patients patients (N=1,152) (N=1,038) (N=500) cancerres.aacrjournals.org (N=1,152) (N=1,781) (N=500) No. (%) No. (%) No. (%) No. (%) No. (%) No. (%) Age (year) ≤ 60 562(48.8) 598(51.9) 974(54.7) 578(55.7) 269(53.8) 269 > 60 590(51.2) 554(48.1) 807(45.3) 460(44.3) 231(46.2) 231 Sex Male 924(80.2) 951(82.6) 1288(72.3) 721(69.5) 386(77.2) 386

on September 26, 2021. © 2011American Association for Cancer Female 228(19.8) 201(17.4) 493(27.7) 317(30.5) 114(22.8) 114

Research. Smoking status Never 340(29.5) 525(45.6) 711(39.9) 644(62.0) 164(32.8) 242(48.4) Former smokers 343(29.8) 129(11.2) 399(22.4) 100(9.6) 161(32.2) 60(12.0) Current smokers 469(40.7) 498(43.2) 671(37.7) 294(28.3) 175(35.0) 198(39.6) Pack-years smoked 0 340(29.5) 525(45.6) 711(39.9) 644(62.0) 164(32.8) 242(48.4) ≤ 26 210(18.2) 314(27.3) 359(20.2) 207(19.9) 90(18.0) 134(26.8) > 26 602(52.3) 313(27.2) 711(39.9) 187(18.0) 246(49.2) 124(24.8) Histological type Adenocarcinoma 370(32.1) 866(48.6) 181(36.2) Squamous cell carcinoma 344(29.9) 678(38.1) 145(29.0) Small cell carcinoma 47( 4.1) 112(6.3) 35(7.0) Others* 391(33.9) 125(7.0) 139(27.8) *Others include large cell, bronchioalveolar, mixed cell, undifferentiated and pathologic not otherwise specified carcinomas.

1 Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Table 2. Genotype frequencies of HSPA1L-HSPA1A-HSPA1B polymorphisms among patients and controls and their associations with risk of lung cancer in Chinese populations Lung cancer Controls Gene Genotype OR(95% CI) * P * patients (n, %) (n, %) Discovery Set N=1,152 N=1,152 HSPA1L rs2075800 (+2763C>T) CC 464(40.3) 440(38.2) 1.00 (Reference) CT 524(45.5) 550(47.7) 0.87(0.72-1.04) 0.131 TT 164(14.2) 162(14.1) 0.96(0.73-1.25) 0.732 HSPA1L rs2227956 (+2437A>G) AA 674(58.5) 695(60.3) 1.00 (Reference) AG 412(35.8) 411(35.7) 0.99(0.83-1.19) 0.919 GG 66(5.7) 46(4.0) 1.49(0.99-2.25) 0.058 AG+GG 478(41.5) 457(39.7) 1.04(0.87-1.24) 0.665 HSPA1A rs1008438 (-110A>C) AA 420(36.5) 403(35.0) 1.00 (Reference) AC 530(46.0) 570(49.5) 0.92(0.76-1.12) 0.413 CC 202(17.5) 179(15.5) 1.09(0.84-1.40) 0.518 HSPA1A rs1043618 (+190G>C) GG 589(51.1) 564(49.0) 1.00 (Reference) GC 457(39.7) 486(42.2) 0.96(0.80-1.15) 0.669 CC 106(9.2) 102(8.9) 1.02(0.75-1.40) 0.880 HSPA1B rs2763979 (-912C>T) CC 601(52.2) 611(53.0) 1.00 (Reference) CT 457(39.7) 457(39.7) 1.00(0.83-1.20) 0.976 TT 94(8.2) 84(7.3) 1.18(0.85-1.64) 0.327 HSPA1B rs6457452 (+39C>T) CC 971(84.3) 1,007(87.4) 1.00 (Reference) CT 171(14.8) 139(12.1) 1.39(1.07-1.79) 0.013 TT 10(0.9) 6(0.5) 1.46(0.51-4.18) 0.480 CT+TT 181(15.7) 145(12.6) 1.39(1.08-1.78) 0.010 * Ptrend 0.026 Validation set N=1,781 N=1,038 HSPA1B rs6457452 (+39C>T) CC 1435(80.6) 879(84.7) 1.00 (Reference) CT 330(18.5) 151(14.5) 1.34(1.08-1.66) 0.009 TT 16(0.9) 8(0.8) 1.01(0.42-2.44) 0.977 CT+TT 346(19.4) 159(15.3) 1.32(1.06-1.63) 0.012 * Ptrend 0.018 Pooled analysis N=2,933 N=2,190 HSPA1B rs6457452 (+39C>T) CC 2,406(82.0) 1,886(86.1) 1.00 (Reference) CT 501(17.1) 290(13.2) 1.41(1.20-1.66) 3.0×10-5 TT 26(0.9) 14(0.6) 1.25(0.64-2.44) 0.518 CT+TT 527(18.0) 304(13.9) 1.41(1.20-1.65) 2.8×10-5 * -5 Ptrend 6.0×10 *Data were calculated by logistic regression, adjusted for age, sex, and pack-years smoked.

2

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Table 3. Demographic and clinical characteristics of lung cancer patients in the survival discovery and validation sets

Discovery Set (N=330) Validation Set (N=331) MST* Log-rank* Variables Lung cancer Deaths Lung cancer Deaths HR(95%CI) * † (month) P Patients (n=257) Patients (n=242) n (%) n (%) Age ≤ 65 172(52.1) 127 228(68.9) 163 17.9 1.00 (Reference) < 0.001 > 65 158(47.9) 130 103(31.1) 79 13.4 1.39(1.16-1.66) Sex Male 274(83.0) 213 247(74.6) 179 15.3 1.00 (Reference) 0.222 Female 56(17.0) 44 84(25.4) 63 19.9 0.88(0.71-1.08) Smoking Never 66(20.0) 51 122(36.9) 93 19.5 1.00 (Reference) Former smokers 160(48.5) 125 32(9.7) 24 14.2 0.073 1.29(1.03-1.63) Current smokers 104(31.5) 81 177(53.5) 125 16.5 1.07(0.86-1.32) Histology Adenocarcinoma 140(42.4) 106 212(64.0) 154 18.4 1.00 (Reference) SCC 93(28.2) 70 98(29.6) 72 16.5 < 0.001 0.94(0.76-1.15) Others‡ 97(29.4) 81 21(6.3) 16 11.2 1.55(1.23-1.96) Stage IIIA 76(23.0) 49 140(42.3) 101 19.6 1.00 (Reference) IIIB 85(25.8) 64 72(21.8) 46 18.0 < 0.001 1.14(0.89-1.46) IV 169(51.2) 144 119(36.0) 95 12.7 1.83(1.49-2.25) Surgery No 241(73.0) 188 179(54.1) 143 13.4 1.00 (Reference) < 0.001 Yes 89(27.0) 69 152(45.9) 99 21.0 0.58(0.48-0.70) Chemotherapy No 75(22.7) 58 31(9.4) 26 10.0 1.00 (Reference) < 0.001 Yes 255(77.3) 199 300(90.6) 216 17.8 0.63(0.50-0.80) Radiotherapy No 178(53.9) 135 208(62.8) 153 14.2 1.00 (Reference) 0.053 Yes 152(46.1) 122 123(37.2) 89 18.6 0.84(0.70-1.00) Abbreviations: MST, median survival time; SCC, squamous cell carcinoma; HR, hazard ratio *Data are for the combined discovery and validation sets. †Data was calculated by univariate cox regression analysis. ‡Others include large cell, bronchioalveolar, mixed cell, undifferentiated and pathologic not otherwise specified carcinomas.

3

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Table 4. Associations between HSPA1L-HSPA1A-HSPA1B genotypes and survival of advanced NSCLC patients Hazard Ratio Lung cancer MST Log-rank Gene SNPs P* Death (95% CI)* patients (month) P Discovery set N=330 n=257 HSPA1L rs2075800 (+2763C>T) CC 1.00 (Reference) 140(42.4) 111 11.7 CT 0.79(0.61-1.04) 0.093 149(45.2) 113 15.0 0.124 TT 0.87(0.57-1.31) 0.495 41(12.4) 33 20.2 0.153 HSPA1L rs2227956 (+2437A>G) AA 1.00 (Reference) 185(56.1) 144 14.2 AG 1.08(0.83-1.40) 0.586 117(35.5) 94 12.9 0.676 GG 0.75(0.46-1.23) 0.257 28(8.5) 19 22.8 0.130 HSPA1A rs1008438 (-110A>C) AA 1.00 (Reference) 118(35.8) 84 18.0 AC 1.15(0.86-1.54) 0.345 155(47.0) 126 13.4 0.047 CC 1.37(0.95-1.98) 0.092 57(17.3) 47 11.2 0.009 HSPA1A rs1043618 (+190G>C) GG 1.00 (Reference) 167(50.6) 123 17.0 GC 1.13(0.87-1.48) 0.363 139(42.1) 115 13.3 0.013 CC 1.11(0.67-1.84) 0.685 24(7.3) 19 11.7 0.237 HSPA1B rs2763979 (-912C>T) CC 1.00 (Reference) 183(55.5) 132 16.5 CT 1.43(1.10-1.87) 0.008 120(36.4) 103 12.2 0.011 TT 1.80(1.13-2.85) 0.013 27(8.2) 22 12.7 0.031 HSPA1B rs6457452 (+39C>T) CC 1.00 (Reference) 291(88.2) 226 14.5 CT+TT 1.51(1.04-2.21) 0.032 39(11.8) 31 10.7 0.025 Validation set HSPA1B rs2763979 N=325 n=239 (-912C>T) CC 1.00 (Reference) 142(43.7) 107 18.6 CT 1.00(0.75-1.32) 0.982 150(46.2) 106 16.8 0.948 TT 1.61(1.03-2.52) 0.036 33(10.2) 26 15.0 0.270 HSPA1B rs6457452 N=323 n=236 (+39C>T) CC 1.00 (Reference) 256(79.3) 192 17.3 CT+TT 0.79(0.56-1.10) 0.161 67(20.7) 44 21.4 0.133 Pooled analysis HSPA1B rs2763979 N=655 n=496 (-912C>T) CC 1.00 (Reference) 325(49.6) 239 17.8 CT 1.21(1.00-1.46) 0.045 270(41.2) 209 15.0 0.128 TT 1.66(1.21-2.27) 0.002 60(9.2) 48 14.2 0.026 Abbreviations: MST, median survival time. Note: The frequency of rs6457452TT genotype was 0.6% (2/322) and 2.2% (7/313) in the discovery and validation sets respectively, so we combined rs6457452CT with rs6457452TT for analyses. *The Cox regression analysis was adjusted for age, sex, pack-years smoked, histology, TNM stage, surgical operation, chemotherapy, and radiotherapy status.

4

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

FIGURE LEGENDS

Figure 1. Patients and study strategy

Figure 2. Luciferase reporter assays for the HSPA1B rs2763979C>T and

rs6457452C>T polymorphisms. (A) Schematic representation of HSPA1B gene and

six reporter gene constructs. (B) Luciferase expression of these constructs. The six

reporter constructs and empty pGL3-Basic were transfected into 16HBE, A549, and

HepG2 cells. All constructs were co-transfected with pRL-SV40 to standardize the

transfection efficiency. Fold increase of luciferase activity was measured by defining

the activity of the empty pGL3-Basic vector as 1. Mean ± SD of RLA values were from

six independent transfection experiments, each done in triplicate.

5

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Guo et al. Figure 1 Downloaded from

HSPA1L-HSPA1A-HSPA1B Author manuscriptshavebeenpeerreviewedandacceptedforpublicationbutnotyetedited. Author ManuscriptPublishedOnlineFirstonOctober28,2011;DOI:10.1158/0008-5472.CAN-11-1409

Case-only Case-Control study survival analysis 6 tagSNPs cancerres.aacrjournals.org HSPA1L: rs2075800, rs2227956 HSPA1A: rs1008438, rs1043618 HSPA1B: rs2763979, rs6457452 Discovery Set 1,152 patients 330 advanced 1,152 controls NSCLC patients Risk SNP Risk SNPs HSPA1B HSPA1B on September 26, 2021. © 2011American Association for Cancer rs6457452 rs2763979, rs6457452 Research. Validate Validate

Validation Set 1,781 patients 331 advanced 1,038 controls NSCLC patients

Risk SNP Risk SNP HSPA1B: rs6457452 HSPA1B: rs2763979

Fine-Mapping analysis: Genotyped 29 tagSNPs Risk SNP within surrounding 185-kb Function analyses region at 6p21.3 500 patients 500 controls Luciferase reporter assay Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Guo et al. Figure 2

A

-1031 -456 +1 +216 HSPA1B Gene Promoter 5’-UTR coding region

pGL3-Basic -912C +39C pGL3 constructs p-C-C -912C +39T p-C-T -912T +39C p-T-C -912T +39T p-T-T +39C p-rs6457452C +39T p-rs6457452T

B

30.0 P < 0. 001 16HBE P = 0.002 A549 P = 0.002 25.0 HepG2

P < 0.001 P < 0.001 P = 0.001 P = 0.001 20. 0 P = 0.006 P = 0.003

15.0 rbitary Units rbitary A A 10.0

5.0

0.0 p-C-C p-C-T p-T-C p-T-T p-rs6457452C p-rs6457452T

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research. Author Manuscript Published OnlineFirst on October 28, 2011; DOI: 10.1158/0008-5472.CAN-11-1409 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited.

Variations in HSPA1B at 6p21.3 are Associated with Lung Cancer Risk and Prognosis in Chinese Populations

Huan Guo, Qifei Deng, Chen Wu, et al.

Cancer Res Published OnlineFirst October 28, 2011.

Updated version Access the most recent version of this article at: doi:10.1158/0008-5472.CAN-11-1409

Supplementary Access the most recent supplemental material at: Material http://cancerres.aacrjournals.org/content/suppl/2011/10/28/0008-5472.CAN-11-1409.DC1

Author Author manuscripts have been peer reviewed and accepted for publication but have not yet been Manuscript edited.

E-mail alerts Sign up to receive free email-alerts related to this article or journal.

Reprints and To order reprints of this article or to subscribe to the journal, contact the AACR Publications Subscriptions Department at [email protected].

Permissions To request permission to re-use all or part of this article, use this link http://cancerres.aacrjournals.org/content/early/2011/10/27/0008-5472.CAN-11-1409. Click on "Request Permissions" which will take you to the Copyright Clearance Center's (CCC) Rightslink site.

Downloaded from cancerres.aacrjournals.org on September 26, 2021. © 2011 American Association for Cancer Research.