Article

pubs.acs.org/jpr

Proteome Atlas of Human 8 and Its Multiple 8p Deficiencies in Tumorigenesis of the Stomach, Colon, and Liver † ○ † ○ ‡ § ○ ∥ ○ † † † Yang Zhang, , Guoquan Yan, , Linhui Zhai, , , Shaohang Xu, , Huali Shen, Jun Yao, Feifei Wu, † # † † † ‡ § ‡ § Liqi Xie, Hailin Tang, Hongxiu Yu, Mingqi Liu, Pengyuan Yang, Ping Xu, , Chengpu Zhang, , ‡ § ‡ § ‡ § ‡ § ‡ § ∥ ⊥ ∥ Liwei Li, , Cheng Chang, , Ning Li, , Songfeng Wu, , Yunping Zhu, , Quanhui Wang, , Bo Wen, ∥ ∥ ∥ □ † ∥ ⊥ ‡ § Liang Lin, Yinzhu Wang, Guiyan Zheng, Lanping Zhou, Haojie Lu,*, Siqi Liu,*, , Fuchu He,*, , † and Fan Zhong*, † Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, China ‡ State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, China § National Engineering Research Center for Drugs, Beijing 102206, China ∥ BGI-Shenzhen, Shenzhen 518083, China ⊥ Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, China # College of Mechanical & Electronic Engineering and Automatization, National University of Defense Technology, Changsha 410073, China □ State Key Laboratory of Molecular Oncology, Cancer Institute & Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100021, China

*S Supporting Information

ABSTRACT: , a medium-length euchromatic unit in humans that has an extraordinarily high mutation rate, can be detected not only in evolution but also in multiple mutant diseases, such as tumorigenesis, and further invasion/ metastasis. The Chromosome-Centric Human Proteome Project of China systematically profiles the proteomes of three digestive organs (i.e., stomach, colon, and liver) and their corresponding carcinoma tissues/cell lines according to a chromosome organizational roadmap. By rigorous standards, we have identified 271 (38.7%), 330 (47.1%), and 325 (46.4%) of 701 chromosome 8-coded from stomach, colon, and liver samples, respectively, in Swiss-Prot and observed a total coverage rate of up to 58.9% by 413 identified proteins. Using large-scale label-free proteome quantitation, we also found some 8p deficiencies, such as the presence of 8p21−p23 in tumorigenesis of the above-described digestive organs, which is in good agreement with previous reports. To our best knowledge, this is the first study to have verified these 8p deficiencies at the proteome level, complementing genome and transcriptome data. KEYWORDS: chromosome 8, proteome, 8p deletion, tumorigenesis, liver, colon, stomach

■ INTRODUCTION Chromosome 8 is a medium-length euchromatic unit in humans that has an extraordinarily high mutation rate by The Chromosome-Centric Human Proteome Project (C-HPP) 3,4 5 6,7 fi positive selection. DEF and MCPH1 are widely known has received considerable attention for its signi cance in rapidly evolving clusters in 8p. Telomere shortening, understanding gene function and structure in terms of especially in chromosome 8, appears as a mechanism fostering chromosome proteins.1,2 The International the development of chromosomal instability during aging and Sequencing Consortium recently completed a sequence of the chronic disease.8 This relatively high genomic instability of human genome including chromosome 8. It has reported a chromosome 8 is found not only in evolution but also in manually curated gene catalog, containing 793 gene loci and 301 pseudogene loci, including all previously known on Special Issue: Chromosome-centric Human Proteome Project 3 chromosome 8 and giving 701 nonredundant proteins in Received: August 31, 2012 Swiss-Prot (Version 2012-7-11).

© XXXX American Chemical Society A dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX Journal of Proteome Research Article

Figure 1. Chromosome 8 proteome atlas with data on the quantification (A) and identification (B) of gastric (green text and frame), colorectal (blue text and frame), and hepatic (red text and frame) systems. (A) The colors denote the abundance of the protein expression data. The median − normalized protein data were transformed by log2 and rescaled into a [ 1,1] region. The protein data above the median are shown in red, whereas those below the median are shown in blue. The color legend is provided at the top portion. (B) The protein coding genes are shown in blue. multiple mutant diseases, such as tumorigenesis, and further information, is more robust for LTQ data. More details on invasion/metastasis. Chromosome 8 and its 8p deletion have proteome quantification are provided in the related article. The been studied and determined to be associated with mass spectrum results detected rather high incidences of hepatocellular carcinoma (HCC) metastasis for many years.9 missing values (without identification in some replicates). Use As part of the Chinese C-HPP consortium, 1, of each proteomic experimental replicate as input column 8, and 20 have been selected to systematically profile the would disrupt further Gene Set Enrichment Analysis (GSEA) proteomes of three digestive organs, namely, stomach, colon, processing. To solve this problem, we calculated the average of and liver, as well as their corresponding carcinoma tissues/cell SIN or LFQuant values for each sample as representation, rather lines. From the perspective of pathophysiological significance in than deal with each fluctuant experimental replicate in further China and worldwide, digestive cancers, such as gastric cancer analysis. For comparative analysis of liver samples, we further (GC), colorectal cancer (CRC), and HCC, are among the most normalized SIN and LFQuant values by dividing their medians. frequently reported cancers and are characterized by metastatic Biological Category Statistics and Enrichment Analysis potential and poor outcomes. This digestive group includes some of the most critical cancers (among them are those “Data set Files” and “Analyses” under “Human Genes ranked second to fourth in cancer-related mortality) and, Chromosomal Location” in Ingenuity Pathway Analysis (IPA, despite all sustained efforts, maintains a profile of low survival Ingenuity Systems; www.ingenuity.com) Version 14197757 rates and lacks successful therapies.10 Herein, we report the were used for cell, tissue, and organ (CTO) and cell line proteome of chromosome 8 based on measured proteins from distribution statistics as well as biological category enrichment samples of selected digestive tissues/cells and summarize the analysis. total nonredundant proteins reported so far for the said Enrichment analysis of 288 missing chromosome 8-coded proteome. We also discuss the connections of measured 8p proteins was carried out using the web-accessible Database for deletions with these three digestive cancers. Annotation, Visualization, and Integrated Discovery (DAVID) tool.14 DAVID can recognize the UniProt AC from data sets. ■ MATERIALS AND METHODS Medium classification stringency and default items were chosen for enrichment calculation. Proteome Identification and Quantification GSEA Proteomes from 18 samples, namely, liver tissue11 Hep3B, SNU398, SNU449, SNU475, MHCC97L (97L), MHCC97H GSEA15 was performed to find down-regulated enriched (97H), HCCLM3 (LM3), HCCLM6 (LM6), colon tissue, cytobands in chromosome 8. The data set was loaded from CRC tissue, SW480, HCT116, stomach tissue, GC tissue, AGS, HUGO Committee-approved gene BGC823, and SGC7901, were identified or researched as symbols16 and analyzed using the Java GSEA package.17 We described in a companion article (DOI: 10.1021/pr3008286). scanned (Signal2Noise metric, weighted scoring, 1,000 Protein abundance in the human liver data set and that in 17 phenotype permutations) the cytoband-organized c1 gene fi 12 13 other samples were quanti ed by SIN and by LFQuant, sets of Molecular Signature Database v3.0. As GSEA requires respectively. Both are label-free proteome quantification at least three (sample) columns for each phenotype group, we methods, and the SIN method, which does not use XIC replicated stomach, colon, and liver proteome data to appear

B dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX Journal of Proteome Research Article

Figure 2. Overlapping status of samples in total and chromosome 8-coded protein identifications. (A−C) Unique (diagonal, red text) and overlapped identification scales within gastric (A), colorectal (B), and hepatic (C) systems. Numbers of chromosome 8-coded proteins are given in brackets. (D−K) Venn diagrams comparing total proteins (D−F) and chromosome 8-coded proteins (G−I) identified within the three digestive systems and among three normal digestive organs (J, K). (L, M) Venn diagrams comparing total proteins (L) and chromosome 8-coded proteins (M) from the present study, PeptideAtlas, and MaxQB. thrice as well as HCT116 and SW480 proteome data to appear respectively, using rigorous standards. The proteomes from twice for comparison. The gene sets with normalized these samples provided 413 identified proteins in total and gave enrichment scores lower than −1.00 were enriched in a a total coverage of up to 58.9%. The total identification down-regulated direction. coverage of chromosome 8-coded proteins in other 5 databases were Ensembl (gene level, v.69) 402/696 = 57.8%, neXtProt ■ RESULTS (v.2012-10-16) 413/700 = 59.0% (Table S2 in Supporting Information), GPMdb (green, v.2012-10-18) 355/472 = 75.2%, Proteomes of the Three Digestive Organs Peptide Atlas (1% FDR at protein level, v.2012-07) 333/385 = All proteome data sets for liver, stomach, and colon samples 86.5%, and Human Protein Atla (v.10.0) 314/467 = 67.2%. have been thoroughly discussed elsewhere for the proteome of These findings suggest that selections of any CTOs (Table S3 chromosome 1 (see the companion article, DOI: 10.1021/ in Supporting Information) and cell lines (Table S4 in pr3008286). In brief, we identified 271 (38.7%), 330 (47.1%), Supporting Information) seldom cover more than 50% of and 325 (46.4%) of 701 chromosome 8-coded proteins in genes or those with expression bias in each chromosome. When Swiss-Prot (Version 2012-7-11) from stomach, colon, and liver the sample was limited to merely normal or paracancer tissues, samples (Figure 1 and Table S1 in Supporting Information), these identification scales decreased to 112 (16.0%, stomach),

C dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX Journal of Proteome Research Article

Figure 3. 8p deficiencies in the three digestive organs by cancer versus normal pairing. (A) GSEA normalized enrichment score (NES)of chromosome 8 cytobands in cancer versus normal comparisons. The negative value indicates down-regulation, and |NES| > 1.00 can be significant. (B) Heatmaps of protein abundance located on 8p21−p23 in each sample are displayed based on the protein quantification values, leaving those unidentified as blank blocks. The color legend is shown on the right side of the heatmaps.

303 (43.2%, colon), and 238 (34.0%, liver). The 288 missing detected in the colon (Figure 2K). We also compared our data proteins of chromosome 8-coded proteins were found to be set with published data sets for the whole proteome (Figure mostly enriched in the “disulfide bond, signal peptide, secreted, 2L) and the chromosome 8-coded proteome (Figure 2M). and glycoprotein” cluster (enrichment score = 3.45, p = 1.3 × Aebersold et al.18 created a PeptideAtlas data set including 406 − − 10 5, Benjamini = 7.7 × 10 3) and “defensin, antimicrobial, and proteins of chromosome 8 among 12,173 proteins collected defense response” cluster (enrichment score = 2.68, p = 9.0 × from the literature. In addition, a MaxQB data set with 313 − 10 4, Benjamini = 0.57) according to DAVID. proteins of chromosome 8 among 10,463 proteins measured IPA preanalysis showed no major expression bias of from 11 cell lines was established by Mann et al.19 Altogether, chromosomes among different CTOs (Table S3 in Supporting these three data sets provide 487 nonredundant proteins and an Information) and cell lines (Table S4 in Supporting increased coverage of 69.5% for the chromosome 8-coded Information) at the transcriptome level, thus answering the proteome. intriguing question of whether any expression bias exists as we 8p Deficiencies in Tumorigenesis select a sample for C-HPP profiling. These results also confirm that chromosome 8 is distinctly typical in character, being very We performed GSEA based on quantified proteome data and close to the genome median for each characteristic in terms of found that multiple significantly down-regulated enriched length, gene content, repeat content, and segmentally cytobands existed in chromosome 8, especially 8p21−p23 of duplicated fraction,3 although some chromosomes apparently all three digestive organs (Figure 3A). Detailed heatmaps with exhibited extreme characteristics. There were 2 insufficient cancer versus normal pairing showed that several 8p21−p23- protein identified cytobands that 8p23.1 (52/288) and 8q24.3 coded proteins (Figure 3B) were clearly down-regulated or (39/288), and the proteome quantification distribution among absent and occurred in other cytobands of chromosome 8 the 18 samples and their chromosome 8-coded subsets did not (Figure S2 in Supporting Information). We determined that significantly differ (Figure S1 in Supporting Information). chromosome 8 exhibits the highest enrichment of breast cancer − We observed that the overlap of proteomes was high genes (p = 1.68 × 10 10) and orthologs to (mouse) mammary − between cancer and normal/paracancer tissues of the same tumor genes (p = 2.23 × 10 10) (Table S5 in Supporting organ and among cancer cell lines from the same organ but Information), as well as the highest coverage (32.0%) of relatively low between tissues and cell lines (Figure 2A−F). A tumorigenesis genes among all chromosomes (Table S6 in similar status was observed in chromosome 8-coded proteomes Supporting Information). Our proteome results for chromo- (Figure 2G−I). Not surprisingly, the proteome of the stomach some 8 seem to be in good agreement with the findings for the was almost inclusive of that of the colon (3,254/3,375), mainly genome. due to the similar compositions of these two organs from Particularly, the proteins of some reported HCC, CRC, and gastrointestinal epithelia (Figure 2J). Only two chromosome 8- GC suppressor genes in 8p, namely, CCDC25 (8p21.1),20 coded proteins were identified in the stomach, namely, IDO1 DLC1 (8p21.3−p22),20,21 EPHX2 (8p12−p21),20,22 LZTS1 (8p11−p12) and NACAP1 (8q22.3), but these were not (8p22),23 NRG1 (8p12−p21),24 PCM1 (8p22),25 PROSC

D dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX Journal of Proteome Research Article

Figure 4. Protein abundances located on 8p of HCC, CRC, and GC suppressor genes were down-regulated or deficient in the three carcinomatous digestive organs.

(8p11.2),20 SCARA5 (8p21.1),26 SH2D4A (8p21.2),20,27 the missing 214 proteins in chromosome 8 could be cell- SORBS3 (8pter−p23.3),20,28 TUSC3 (8p22),29 WRN specific proteins in cells other than those we have explored. (8p11.2−p12),24 and ZNF703 (8p12),24 were significantly Although each cell type might express a few tens of cell-specific down-regulated or deficient in the three carcinomatous proteins only, the total number of cell-specific proteins could be digestive organs (Figure 4). Some proteins of HCC and CRC accumulated as the amount of measured cell type would suppressor genes located in other cytobands were also found to increase. As such, we strongly suggest that more types of CTOs be down-regulated/deficient, including BIRC2 (11q22);30 CA1 and cell lines be measured to observe all 701 possibly expressed (8q13-q22.1), CA2 (8q22), and CA13 (8q21.2);31 CDH1 proteins in chromosome 8, because the numbers of cell types in (16q22.1); TCF4 (18q21.2);32 TP53 (17p13); and YAP1 the present and previous studies are still considerably lower (11q13)30 (Figure S3 in Supporting Information). than the minimum possible of 230 for all cell types. We investigated some 8p deficiencies, such as the presence of − ■ DISCUSSION 8p21 p23 in tumorigenesis of the stomach, colon, and liver. 8p11.21, 8p21.3, and 8p22−p23 losses and 8q24 gain Using the latest instruments and methodology in this study, we frequently occurred in HCC tumorigenesis and its meta- fi 9,34−37 have determined that the identi ed proteins in each sample stasis. 8p deletions/LOH including 8p21, 8p22−23.1, from the stomach, colon, and liver are less than or close to only and 8q24 gains also promoted CRC and its meta- − half of the proteome (701) of chromosome 8. We thus stasis.24,32,38 41 The 8p deletion/LOH is also a frequent − speculated that the average of half of a chromosome proteome event in GC.42 45 8p contains many tumor suppressor genes might be the upper limit for expressed proteins. Aebersold et al. and is essential for cancer progression and metastasis.45,46 In provided a quantitative description of the proteome of a addition to HCC, CRC, and GC, 8p deletions can be found in commonly used human cell line and demonstrated that the lung cancer,47 larynx cancer,48 and renal cancer.49 8p LOH also − human cultured cells express at least approximately 10,000 occurs in bladder cancer,50,51 breast cancer,45 49,52 B cell 33 − proteins. This number is close to half of the total number of lymphoma,53 prostate cancer,54 56 and head and neck the protein-coded genome (∼20,344), which might explain why squamous cell carcinoma.57,58 Particularly, loss of chromosome only approximately half of or fewer proteins on average can be 8p associated with poor outcome (reduced survival time) and found in chromosome 8 for a cell or several cells. Mann et al. deletion of a cluster of six genes on 8p (CCDC25, DLC1, ELP3, identified 313 proteins belonging to chromosome 8 from PROSC, SH2D4A, and SORBS3) among the 10-gene signature 10,463 proteins in 11 cell lines, comparable with our 325 in HCC from patients with poor outcomes have been proteins from 8 cell lines and tissues from the liver (Figure 1I). reported.9,34 Thus, the deletion of chromosome 8p (e.g., PeptideAtlas has summarized 406 proteins of chromosome 8 8p23) might contribute to the development of HCC among 12,173 proteins from literature data sources (cells and metastasis.9 tissues) to date, and the present work study increases this The 8p deletion/LOH in chromosome 8 can be attributed to number to 487 from a total of 16,079 proteins for the proteome its genome instability. The chromosome has a vast region of of chromosome 8 (Figure 1L and M). Therefore, we feel that approximately 15 megabases on distal 8p with a strikingly high

E dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX Journal of Proteome Research Article mutation rate, perhaps due to its high mutation rates in the ■ REFERENCES human genome, proximity to telomeres, high recombination (1) Paik, Y. K.; Jeong, S. K.; Omenn, G. S.; Uhlen, M.; Hanash, S.; rate, and high A+T content in chromosome 8, although this 3 Cho, S. Y.; Lee, H. J.; Na, K.; Choi, E. Y.; Yan, F. The Chromosome- biological basis remains unclear. This 8p deletion/LOH can be Centric Human Proteome Project for cataloging proteins encoded in attributed to the telomere shortening correlated significantly the genome. Nat. Biotechnol. 2012, 30 (3), 221−223. with an increasing copy number of chromosome 8 at the (2) Paik, Y. K.; Omenn, G. S.; Uhlen, M.; Hanash, S.; Marko-Varga, cellular level and linked to a specific genetic alteration G.; Aebersold, R.; Bairoch, A.; Yamamoto, T.; Legrain, P.; Lee, H. J. characteristic of human HCC based on an analysis of Standard guidelines for the chromosome-centric Human Proteome hepatocellular telomere fluorescent intensity and copy number Project. J. Proteome Res 2012, 11 (4), 2005−2013. of chromosome 8.8 In addition, we also doubt that dispersed (3) Nusbaum, C.; Mikkelsen, T. S.; Zody, M. C.; Asakawa, S.; proteins from multiple cells in a sample will be mixed together Taudien, S.; Garber, M.; Kodira, C. D.; Schueler, M. G.; Shimizu, A.; and weaken the 8p deletion in the proteome level. This may Whittaker, C. A.; Chang, J. L.; Cuomo, C. A.; Dewar, K.; FitzGerald, M. G.; Yang, X.; Allen, N. R.; Anderson, S.; Asakawa, T.; cause some proteins to exhibit down-regulation but not Blechschmidt, K.; Bloom, T.; Borowsky, M. L.; Butler, J.; Cook, A.; complete absence in carcinomatous samples. Nevertheless, we Corum, B.; DeArellano, K.; DeCaprio, D.; Dooley, K. T.; Dorris, L., could conclude that a number of genes in the region of high 3rd; Engels, R.; Glockner, G.; Hafez, N.; Hagopian, D. S.; Hall, J. L.; divergence in distal 8p are essential to the development and Ishikawa, S. K.; Jaffe, D. B.; Kamat, A.; Kudoh, J.; Lehmann, R.; metastasis of human digestion cancers. Lokitsang, T.; Macdonald, P.; Major, J. E.; Matthews, C. D.; Mauceli, E.; Menzel, U.; Mihalev, A. H.; Minoshima, S.; Murayama, Y.; Naylor, ■ ASSOCIATED CONTENT J. W.; Nicol, R.; Nguyen, C.; O’Leary, S. B.; O’Neill, K.; Parker, S. C.; Polley, A.; Raymond, C. K.; Reichwald, K.; Rodriguez, J.; Sasaki, T.; *S Supporting Information Schilhabel, M.; Siddiqui, R.; Smith, C. L.; Sneddon, T. P.; Talamas, J. Table S1 and S2 show the protein identification scales and A.; Tenzin, P.; Topham, K.; Venkataraman, V.; Wen, G.; Yamazaki, S.; coverage of each chromosome from all samples in Swissprot Young, S. K.; Zeng, Q.; Zimmer, A. R.; Rosenthal, A.; Birren, B. W.; Platzer, M.; Shimizu, N.; Lander, E. S. DNA sequence and analysis of and neXtprot metrics, respectively. Tables S3 and S4 show the − − human chromosome 8. Nature 2006, 439 (7074), 331 5. chromosome CTO and cell line gene distributions from IPA, (4) Vallender, E. J.; Lahn, B. T. Positive selection on the human respectively. Table S5 lists the top chromosome-enriched genome. Hum. Mol. Genet. 2004, 13 Spec No 2, R245−54. biofunctions from IPA. Table S6 lists the top covered (5) Maxwell, A.; Morrison, G.; Dorin, J. Rapid sequence divergence chromosomes in each biofunction from IPA. Figure S1 in mammalian [beta]-defensins by adaptive evolution. Mol. Immunol. illustrates that the median normalized quants of all and 2003, 40 (7), 413−421. chromosome 8-coded proteins from the 18 samples did not (6) Evans, P. D.; Anderson, J. R.; Vallender, E. J.; Choi, S. S.; Lahn, B. significantly differ. Figure S2 shows the heatmap of protein T. Reconstructing the evolutionary history of microcephalin, a gene abundance located on chromosome 8 in each sample. Figure S3 controlling human brain size. Hum. Mol. Genet. 2004, 13 (11), 1139− demonstrates that the protein abundances of several HCC, 45. CRC, and GC suppressor genes (other than those located on (7) Evans, P. D.; Gilbert, S. L.; Mekel-Bobrov, N.; Vallender, E. J.; 8p) were down-regulated or deficient in the three carcinom- Anderson, J. R.; Vaez-Azizi, L. M.; Tishkoff, S. A.; Hudson, R. R.; Lahn, B. T. Microcephalin, a gene regulating brain size, continues to atous digestive organs. These materials are available free of evolve adaptively in humans. Science 2005, 309 (5741), 1717−20. charge at http://pubs.acs.org.This material is available free of (8) Plentz, R. R.; Schlegelberger, B.; Flemming, P.; Gebel, M.; charge via the Internet at http://pubs.acs.org. Kreipe, H.; Manns, M. P.; Rudolph, K. L.; Wilkens, L. Telomere shortening correlates with increasing aneuploidy of chromosome 8 in ■ AUTHOR INFORMATION human hepatocellular carcinoma. Hepatology 2005, 42 (3), 522−6. (9) Qin, L. X.; Tang, Z. Y.; Sham, J. S.; Ma, Z. C.; Ye, S. L.; Zhou, X. Corresponding Author D.; Wu, Z. Q.; Trent, J. M.; Guan, X. Y. The association of *(H.L.) Tel and Fax: +86 21 54237007. E-mail: luhaojie@ chromosome 8p deletion and tumor metastasis in human − fudan.edu.cn. (S.L.) Tel and Fax: +86 10 80485324. E-mail: hepatocellular carcinoma. Cancer Res. 1999, 59 (22), 5662 5. [email protected]. (F.H.) Tel and Fax: +86 10 68177417. E- (10) Albulescu, R.; Neagu, M.; Albulescu, L.; Tanase, C. Tissular and mail: [email protected]. (F.Z.) Tel and Fax: +86 21 soluble miRNAs for diagnostic and therapy improvement in digestive tract cancers. Expert Rev. Mol. Diagn. 2011, 11 (1), 101−20. 54237158. E-mail: [email protected]. (11) Chinese Human Liver Proteome Profiling Consortium, First Author Contributions insight into human liver proteome from PROTEOMESKY-LIVERHu ○ These authors contribute equally to this work. 1.0, a publicly-available database. J. Proteome Res. 2010, 9, (1), 79-94. (12) Griffin, N. M.; Yu, J.; Long, F.; Oh, P.; Shore, S.; Li, Y.; Koziol, Notes J. A.; Schnitzer, J. E. Label-free, normalized quantification of complex The authors declare no competing financial interest. mass spectrometry data for proteomic analysis. Nat. Biotechnol. 2010, 28 (1), 83−9. (13) Schwanhausser, B.; Busse, D.; Li, N.; Dittmar, G.; Schuchhardt, ■ ACKNOWLEDGMENTS J.; Wolf, J.; Chen, W.; Selbach, M. Global quantification of mammalian − The study was supported by MOST-S973/863 projects control. Nature 2011, 473 (7347), 337 42. (2013CB910802, 2010CB912700, 2012CB910600, (14) Huang, D. W.; Sherman, B. T.; Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics 2012AA020200, 2011CB910600, 2011CB910701, resources. Nat. Protoc. 2008, 4 (1), 44−57. 2012AA020201, and 2012AA020206), National Natural (15) Subramanian, A.; Tamayo, P.; Mootha, V. K.; Mukherjee, S.; Science Foundation of China projects (31000379, 21025519, Ebert, B. L.; Gillette, M. A.; Paulovich, A.; Pomeroy, S. L.; Golub, T. 81201534, 31070673, 31170780, and 91131009), Shanghai R.; Lander, E. S.; Mesirov, J. P. Gene set enrichment analysis: a Health Bureau Project (2010Y005), State Key Project knowledge-based approach for interpreting genome-wide expression Specialized for Infectious Diseases (2012ZX10002012−006). profiles. Proc. Natl. Acad. Sci. U.S.A. 2005, 102 (43), 15545−50.

F dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX Journal of Proteome Research Article

(16) Bruford, E. A.; Lush, M. J.; Wright, M. W.; Sneddon, T. P.; CA XIII, in normal and neoplastic colorectal mucosa. BMC Cancer Povey, S.; Birney, E. The HGNC Database in 2008: a resource for the 2005, 5, 41. human genome. Nucleic Acids Res. 2008, 36 (suppl 1), D445−D448. (32) Herbst, A.; Helferich, S.; Behrens, A.; Goke, B.; Kolligs, F. T. (17) Subramanian, A.; Kuehn, H.; Gould, J.; Tamayo, P.; Mesirov, J. The transcription factor ITF-2A induces cell cycle arrest via P. GSEA-P: a desktop application for Gene Set Enrichment Analysis. p21(Cip1). Biochem. Biophys. Res. Commun. 2009, 387 (4), 736−40. Bioinformatics 2007, 23 (23), 3251−3. (33) Beck, M.; Schmidt, A.; Malmstroem, J.; Claassen, M.; Ori, A.; (18) Desiere, F.; Deutsch, E. W.; King, N. L.; Nesvizhskii, A. I.; Szymborska, A.; Herzog, F.; Rinner, O.; Ellenberg, J.; Aebersold, R. Mallick, P.; Eng, J.; Chen, S.; Eddes, J.; Loevenich, S. N.; Aebersold, R. The quantitative proteome of a human cell line. Mol. Syst. Biol. 2011, The PeptideAtlas project. Nucleic Acids Res. 2006, 34 (Database issue), 7, 549. D655−8. (34) Yeh, S. H.; Chen, P. J.; Shau, W. Y.; Chen, Y. W.; Lee, P. H.; (19) Schaab, C.; Geiger, T.; Stoehr, G.; Cox, J.; Mann, M. Analysis of Chen, J. T.; Chen, D. S. Chromosomal allelic imbalance evolving from high accuracy, quantitative proteomics data in the MaxQB database. liver cirrhosis to hepatocellular carcinoma. Gastroenterology 2001, 121 Mol. Cell. Proteomics 2012, 11 (3), M111.014068. (3), 699−709. (20) Roessler, S.; Long, E. L.; Budhu, A.; Chen, Y.; Zhao, X.; Ji, J.; (35) Liao, C.; Zhao, M.; Song, H.; Uchida, K.; Yokoyama, K. K.; Li, Walker, R.; Jia, H. L.; Ye, Q. H.; Qin, L. X.; Tang, Z. Y.; He, P.; T. Identification of the gene for a novel liver-related putative tumor Hunter, K. W.; Thorgeirsson, S. S.; Meltzer, P. S.; Wang, X. W. suppressor at a high-frequency loss of heterozygosity region of Integrative genomic identification of genes on 8p associated with chromosome 8p23 in human hepatocellular carcinoma. Hepatology hepatocellular carcinoma progression and patient survival. Gastro- 2000, 32 (4 Pt 1), 721−7. enterology 2012, 142 (4), 957−66. (36) Farazi, P. A.; DePinho, R. A. Hepatocellular carcinoma (21) Yuan, B. Z.; Miller, M. J.; Keck, C. L.; Zimonjic, D. B.; pathogenesis: from genes to environment. Nat. Rev. Cancer 2006, 6 Thorgeirsson, S. S.; Popescu, N. C. Cloning, characterization, and (9), 674−87. chromosomal localization of a gene frequently deleted in human liver (37) Lu, T.; Hano, H.; Meng, C.; Nagatsuma, K.; Chiba, S.; Ikegami, cancer (DLC-1) homologous to rat RhoGAP. Cancer Res. 1998, 58 M. Frequent loss of heterozygosity in two distinct regions, 8p23.1 and (10), 2196. 8p22, in hepatocellular carcinoma. World J. Gastroenterol. 2007, 13 (7), (22) Woo, H. G.; Park, E. S.; Cheon, J. H.; Kim, J. H.; Lee, J. S.; Park, 1090−7. B. J.; Kim, W.; Park, S. C.; Chung, Y. J.; Kim, B. G.; Yoon, J. H.; Lee, (38) Mourra, N.; Zeitoun, G.; Portier, G.; Blanche, H.; Tubacher, E.; H. S.; Kim, C. Y.; Yi, N. J.; Suh, K. S.; Lee, K. U.; Chu, I. S.; Roskams, Gressin, L.; Flejou, J. F.; Tiret, E.; Thomas, G.; Olschwang, S. High- T.; Thorgeirsson, S. S.; Kim, Y. J. Gene expression-based recurrence resolution genotyping of chromosome 8 in colon adenocarcinomas prediction of hepatitis B virus-related human hepatocellular carcinoma. reveals recurrent break point but no gene mutation in the 8p21 region. Clin. Cancer Res. 2008, 14 (7), 2056−64. Diagn. Mol. Pathol. 2008, 17 (2), 90−3. (23) Ishii, H.; Baffa, R.; Numata, S. I.; Murakumo, Y.; Rattan, S.; (39) Flanagan, J. M.; Healey, S.; Young, J. Whitehall, V.; Trott, D. A.; Inoue, H.; Mori, M.; Fidanza, V.; Alder, H.; Croce, C. M. The FEZ1 Newbold, R. F.; Chenevix-Trench, G., Mapping of a candidate gene at chromosome 8p22 encodes a leucine-zipper protein, and its colorectal cancer tumor-suppressor gene to a 900-kilobase region on expression is altered in multiple human tumors. Proc. Natl. Acad. Sci. the short arm of chromosome 8. Genes Chromosomes Cancer 2004, 40 U.S.A. 1999, 96 (7), 3928. (3), 247−60. (24) Pole, J. C.; Courtay-Cahen, C.; Garcia, M. J.; Blood, K. A.; (40) Ashktorab, H.; Schaffer, A. A.; Daremipouran, M.; Smoot, D. T.; Cooke, S. L.; Alsop, A. E.; Tse, D. M.; Caldas, C.; Edwards, P. A. High- Lee, E.; Brim, H. Distinct genetic alterations in colorectal cancer. PLoS resolution analysis of chromosome rearrangements on 8p in breast, One 2010, 5 (1), e8879. colon and pancreatic cancer reveals a complex pattern of loss, gain and (41) Cunningham, C.; Dunlop, M. G.; Bird, C. C.; Wyllie, A. H. translocation. Oncogene 2006, 25 (41), 5693−706. Deletion analysis of chromosome 8p in sporadic colorectal adenomas. (25) Armes, J. E.; Hammet, F.; De Silva, M.; Ciciulla, J.; Ramus, S. J.; Br. J. Cancer 1994, 70 (1), 18−20. Soo, W. K.; Mahoney, A.; Yarovaya, N.; Henderson, M. A.; Gish, K. (42) Stamouli, M. I.; Ferti, A. D.; Panani, A. D.; Raftakis, J.; Consoli, Candidate tumor-suppressor genes on chromosome arm 8p in early- C.; Raptis, S. A.; Young, B. D. Application of multiplex fluorescence in onset and high-grade breast cancers. Oncogene 2004, 23 (33), 5697− situ hybridization in the cytogenetic analysis of primary gastric 702. carcinoma. Cancer Genet. Cytogenet. 2002, 135 (1), 23−7. (26) Huang, J.; Zheng, D. L.; Qin, F. S.; Cheng, N.; Chen, H.; Wan, (43) Fujii, H.; Ichikawa, K.; Takagaki, T.; Nakanishi, Y.; Ikegami, M.; B. B.; Wang, Y. P.; Xiao, H. S.; Han, Z. G. Genetic and epigenetic Hirose, S.; Shimoda, T. Genetic evolution of alpha fetoprotein silencing of SCARA5 may contribute to human hepatocellular producing gastric cancer. J. Clin. Pathol. 2003, 56 (12), 942−9. carcinoma by activating FAK signaling. J. Clin. Invest. 2010, 120 (1), (44) Baffa, R.; Santoro, R.; Bullrich, F.; Mandes, B.; Ishii, H.; Croce, 223. C. M. Definition and refinement of chromosome 8p regions of loss of (27) Lapinski, P. E.; Oliver, J. A.; Kamen, L. A.; Hughes, E. D.; heterozygosity in gastric cancer. Clin. Cancer Res. 2000, 6 (4), 1372−7. Saunders, T. L.; King, P. D. Genetic analysis of SH2D4A, a novel (45) Nihei, N.; Kouprina, N.; Larionov, V.; Oshima, J.; Martin, G. adapter protein related to T cell-specific adapter and adapter protein in M.; Ichikawa, T.; Barrett, J. C. Functional evidence for a metastasis lymphocytes of unknown function, reveals a redundant function in T suppressor gene for rat prostate cancer within a 60-kilobase region on cells. J. Immunol. 2008, 181 (3), 2019−27. human chromosome 8p21-p12. Cancer Res. 2002, 62 (2), 367−70. (28) Kioka, N.; Sakata, S.; Kawauchi, T.; Amachi, T.; Akiyama, S. K.; (46) Snijders, A. M.; Nowee, M. E.; Fridlyand, J.; Piek, J. M.; Okazaki, K.; Yaen, C.; Yamada, K. M.; Aota, S. Vinexin: a novel Dorsman, J. C.; Jain, A. N.; Pinkel, D.; van Diest, P. J.; Verheijen, R. vinculin-binding protein with multiple SH3 domains enhances actin H.; Albertson, D. G. Genome-wide-array-based comparative genomic cytoskeletal organization. J. Cell Biol. 1999, 144 (1), 59−69. hybridization reveals genetic homogeneity and frequent copy number (29) Pils, D.; Horak, P.; Gleiss, A.; Sax, C.; Fabjani, G.; Moebus, V. J.; increases encompassing CCNE1 in fallopian tube carcinoma. Oncogene Zielinski, C.; Reinthaller, A.; Zeillinger, R.; Krainer, M. Five genes 2003, 22 (27), 4281−6. from chromosomal band 8p22 are significantly down-regulated in (47) Petersen, S.; Aninat-Meyer, M.; Schluns, K.; Gellert, K.; Dietel, ovarian carcinoma. Cancer 2005, 104 (11), 2417−29. M.; Petersen, I. Chromosomal alterations in the clonal evolution to the (30) Zender, L.; Spector, M. S.; Xue, W.; Flemming, P.; Cordon- metastatic stage of squamous cell carcinomas of the lung. Br. J. Cancer Cardo, C.; Silke, J.; Fan, S. T.; Luk, J. M.; Wigler, M.; Hannon, G. J. 2000, 82 (1), 65−73. Identification and validation of oncogenes in liver cancer using an (48) Kujawski, M.; Sarlomo-Rikala, M.; Gabriel, A.; Szyfter, K.; integrative oncogenomic approach. Cell 2006, 125 (7), 1253−67. Knuutila, S. Recurrent DNA copy number losses associated with (31) Kummola, L.; Hamalainen, J. M.; Kivela, J.; Kivela, A. J.; Saarnio, metastasis of larynx carcinoma. Genes Chromosomes Cancer 1999, 26 J.; Karttunen, T.; Parkkila, S. Expression of a novel carbonic anhydrase, (3), 253−7.

G dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX Journal of Proteome Research Article

(49) Bissig, H.; Richter, J.; Desper, R.; Meier, V.; Schraml, P.; Schaffer, A. A.; Sauter, G.; Mihatsch, M. J.; Moch, H. Evaluation of the clonal relationship between primary and metastatic renal cell carcinoma by comparative genomic hybridization. Am. J. Pathol. 1999, 155 (1), 267−74. (50) Adams, J.; Williams, S. V.; Aveyard, J. S.; Knowles, M. A. Loss of heterozygosity analysis and DNA copy number measurement on 8p in bladder cancer reveals two mechanisms of allelic loss. Cancer Res. 2005, 65 (1), 66−75. (51) Wagner, U.; Bubendorf, L.; Gasser, T. C.; Moch, H.; Gorog, J. P.; Richter, J.; Mihatsch, M. J.; Waldman, F. M.; Sauter, G. Chromosome 8p deletions are associated with invasive tumor growth in urinary bladder cancer. Am. J. Pathol. 1997, 151 (3), 753−9. (52) Morikawa, A.; Williams, T.; Dirix, L.; Colpaert, C.; Goodman, M.; Lyles, R.; Zhong, D.; Zhou, W. Allelic imbalances of chromosomes 8p and 18q and their roles in distant relapse of early stage, node- negative breast cancer. Breast Cancer Res. 2005, 7 (6), R1051−7. (53) Rubio-Moscardo, F.; Blesa, D.; Mestre, C.; Siebert, R.; Balasas, T.; Benito, A.; Rosenwald, A.; Climent, J.; Martinez, J. I.; Schilhabel, M. Characterization of 8p21. 3 chromosomal deletions in B-cell lymphoma: TRAIL-R1 and TRAIL-R2 as candidate dosage-dependent tumor suppressor genes. Blood 2005, 106 (9), 3214−22. (54) Macoska, J. A.; Paris, P.; Collins, C.; Andaya, A.; Beheshti, B.; Chaib, H.; Kant, R.; Begley, L.; MacDonald, J. W.; Squire, J. A. Evolution of 8p loss in transformed human prostate epithelial cells. Cancer Genet. Cytogenet. 2004, 154 (1), 36−43. (55) Chang, B.; Liu, W.; Sun, J.; Dimitrov, L.; Li, T.; Turner, A. R.; Zheng, S. L.; Isaacs, W. B.; Xu, J. Integration of somatic deletion analysis of prostate cancers and germline linkage analysis of prostate cancer families reveals two small consensus regions for prostate cancer genes at 8p. Cancer Res. 2007, 67 (9), 4098. (56) Lu, W.; Takahashi, H.; Furusato, B.; Maekawa, S.; Ikegami, M.; Sudo, A.; Egawa, S.; Hano, H. Allelotyping analysis at chromosome arm 8p of high-grade prostatic intraepithelial neoplasia and incidental, latent, and clinical prostate cancers. Genes, Chromosomes Cancer 2006, 45 (5), 509−15. (57) Ye, H.; Pungpravat, N.; Huang, B. L.; Muzio, L. L.; Mariggio, M. A.; Chen, Z.; Wong, D. T.; Zhou, X. Genomic assessments of the frequent loss of heterozygosity region on 8p21.3-p22 in head and neck squamous cell carcinoma. Cancer Genet. Cytogenet. 2007, 176 (2), 100−6. (58) Jin, C.; Jin, Y.; Wennerberg, J.; Akervall, J.; Dictor, M.; Mertens, F. Karyotypic heterogeneity and clonal evolution in squamous cell carcinomas of the head and neck. Cancer Genet. Cytogenet. 2002, 132 (2), 85−96.

H dx.doi.org/10.1021/pr300834r | J. Proteome Res. XXXX, XXX, XXX−XXX