Supplementary Document 3 Comparing Our 6 Genes with Published Hypoxia Gene Signatures
Total Page:16
File Type:pdf, Size:1020Kb
Supplementary Document 3 Comparing our 6 genes with published hypoxia gene signatures We compared our 6 classifier genes with 6 other published transcriptomic based hypoxia gene signatures which have been derived using approaches involving clinical samples from other tumor types than cervical cancer. Three of the signatures were developed by identifying genes clustering with well‐known hypoxia‐ regulated genes: A 99‐gene HNSCC‐derived signature (1), a 26‐gene signature derived from HNSCC‐ and breast cancer samples (2), and a 70‐gene breast‐cancer derived signature (3). In addition, a 13‐gene breast‐ cancer signature derived to identify biological features associated with distant metastasis (4), a pimonidazole‐associated 32‐gene prostate cancer‐derived signature (5), and a 15‐gene HNSCC‐derived classifier constructed based on the gene expressions of more and less hypoxic tumors, as defined by pO2 electrode measurements in a training cohort (6) was addressed. Identifying and comparing the genes in the signatures For identifying the genes in the published signatures, Entrez Gene ID was extracted from the annotations provided and the corresponding official gene symbol was retrieved (see details in Table 1‐6 below, where genes appearing in more than one of the 6 published signatures are marked in red, and genes present among our 6 genes are marked with a green square). When comparing the genes in the 6 published signatures and our 6 genes, there were 209 unique genes, whereas 29 genes (14%) were present in more than one signature (Figure 1). Two out of our signature genes, KCTD11 and P4HA2, were present in three (1, 2, 6) or one (6) other signature, respectively. Figure 1: The frequency of genes appearing in more than one of the seven hypoxia gene signatures. 1 Table 1: The Toustrup et al. 2011 signature. Information in publication Retrieved gene information Illumina probec SYMBOL ABI probe Entrez Gene IDa HUGO SYMBOL PROBE_ID ADM Hs02562698 133 ADM ILMN_1708934 ALDOA Hs00605108 226 ALDOA ILMN_1681374 ANKRD37 Hs00699181 353322 ANKRD37 ILMN_1756417 BNIP3 Hs00969293 664 BNIP3 ILMN_1724658 BNIP3L Hs00188949 665 BNIP3L ILMN_1718961 C3orf28 Hs01055823 26355 FAM162A ILMN_1803647 EGLN3 Hs00222966 112399 EGLN3 ILMN_1667626 KCTD11 Hs00922550 147040 KCTD11 ILMN_1777513 LOX Hs00184700 4015 LOX ILMN_1695880 NDRG1 Hs00608389 10397 NDRG1 ILMN_1809931 P4HA1 Hs00914594 5033 P4HA1 ILMN_1747442 P4HA2 Hs00989996 8974 P4HA2 ILMN_2381697 PDK1 Hs00326943 5163 PDK1 ILMN_1670256 PFKFB3 Hs00998700 5209 PFKFB3 ILMN_2186061 SLC2A1 Hs185278117b 6513 SLC2A1 ILMN_1659027 aIdentified from ABI probset annotation. bAssay was not found. Entrez Gene ID was found based on HUGO gene symbol. c For genes in which several probes were present on the Illumina‐arrays, the probe with best correlation with ABrix in our training cohort of 42 cervical cancer patients was used. 2 Table 2: The Eustace et al. 2013 signature. a b Information in publication Retrieved gene information Illumina probe SYMBOL Entrez Gene ID HUGO SYMBOL PROBE_ID ALDOA 226 ALDOA ILMN_1681374 ANGPTL4 51129 ANGPTL4 ILMN_2386444 ANLN 54443 ANLN ILMN_1739645 BNC1 646 BNC1 ILMN_1692748 C20orf20 55257 MRGBP ILMN_1790136 CA9 768 CA9 ILMN_1725139 CDKN3 1033 CDKN3 ILMN_1666305 COL4A6 1288 COL4A6 ILMN_1811289 DCBLD1 285761 DCBLD1 ILMN_1695604 ENO1 2023 ENO1 ILMN_1710756 FAM83B 222584 FAM83B ILMN_1782863 FOSL1 8061 FOSL1 ILMN_1771841 GNAI1 2770 GNAI1 ILMN_1742044 HIG2 29923 HILPDA ILMN_1659990 KCTD11 147040 KCTD11 ILMN_1777513 KRT17 3872 KRT17 ILMN_1666845 LDHA 3939 LDHA ILMN_1807106 MRPS17 51373 MRPS17 ILMN_1804851 P4HA1 5033 P4HA1 ILMN_1747442 PGAM1 5223 PGAM1 ILMN_1661366 PGK1 5230 PGK1 ILMN_2216852 SDC1 6382 SDC1 ILMN_1800038 SLC16A1 6566 SLC16A1 ILMN_1757052 SLC2A1 6513 SLC2A1 ILMN_1659027 TPI1 7167 TPI1 ILMN_2181191 VEGFA 7422 VEGFA ILMN_1693060 aIdentified based on the symbol provided in the publication using the R annotation package org.Hs.eg.db (7). b For genes in which several probes were present on the Illumina‐arrays, the probe with best correlation with ABrix in our training cohort of 42 cervical cancer patients was used. 3 Table 3: The Winter et al. 2007 signature. Information in publication Retrieved gene information Illumina probeb Entrez SYMBOL Affy_ID Gene IDa HUGO SYMBOL PROBE_ID ADORA2B 205891_at 136 ADORA2B ILMN_1703946 230630_at, 204348_s_at, AK3 225342_at,204347_at 205 AK4 ILMN_2338038 ALDOA 238996_x_at, 214687_x_at, 200966_x_at 226 ALDOA ILMN_1681374 ANGPTL4 221009_s_at,223333_s_at 51129 ANGPTL4 ILMN_2386444 Lrp2bp 227337_at 353322 ANKRD37 ILMN_1756417 ANKRD9 230972_at 122416 ANKRD9 ILMN_2048607 ANLN 222608_s_at, 1552619_a_at 54443 ANLN ILMN_1739645 B4GALT2 209413_at 8704 B4GALT2 ILMN_1806508 BCAR1 223116_at 9564 BCAR1 ILMN_1672596 BMS1L 203082_at 9790 BMS1 ILMN_1777854 BNIP3 201848_s_at,201849_at 664 BNIP3 ILMN_1724658 MGC17624 227806_at 404550 C16orf74 ILMN_1806149 203963_at, 214164_x_at, 204508_s_at, CA12 210735_s_at 771 CA12 ILMN_2382942 CA9 205199_at 768 CA9 ILMN_1725139 CDCA4 218399_s_at 55038 CDCA4 ILMN_1684045 HSPC163 218728_s_at,223993_s_at 29097 CNIH4 ILMN_1714759 COL4A5 213110_s_at 1287 COL4A5 ILMN_2375360 CORO1C 222409_at 23603 CORO1C ILMN_1745954 IL8 202859_x_at 3576 CXCL8 ILMN_2184373 DPM2 209391_at 8818 DPM2 ILMN_1732049 MGC2408 227103_s_at 9718 ECE2 ILMN_1762883 EIF2S1 201144_s_at 1965 EIF2S1 ILMN_1739821 GAPD AFFX‐HUMGAPDH/M33197_5_at 2597 GAPDH ILMN_1802252 SIP1 211114_x_at 8487 GEMIN2 ILMN_2344002 GMFB 202543_s_at 2764 GMFB ILMN_2093674 MGC14560 218461_at 51184 GPN3 ILMN_1704943 GSS 201415_at 2937 GSS ILMN_1683462 C15orf25 229208_at 55142 HAUS2 ILMN_1738482 HES2 214521_at 54626 HES2 ILMN_2094266 HIG2 1554452_a_at 29923 HILPDA ILMN_1659990 HOMER1 213793_s_at 9456 HOMER1 ILMN_1804568 IMP‐2 218847_at 10644 IGF2BP2 ILMN_1702447 KCTD11 235857_at 147040 KCTD11 ILMN_1777513 KRT17 205157_s_at 3872 KRT17 ILMN_1666845 LDHA 200650_s_at 3939 LDHA ILMN_1807106 LDLR 202068_s_at 3949 LDLR ILMN_2053415 LOC149464 232823_at NA NA NA MGC2654 218945_at 79091 METTL22 ILMN_1658290 MIF 217871_s_at 4282 MIF ILMN_1807074 4 MNAT1 203565_s_at 4331 MNAT1 ILMN_2083243 C20orf20 218586_at 55257 MRGBP ILMN_1790136 MRPL14 225201_s_at 64928 MRPL14 ILMN_2072603 MRPS17 218982_s_at 51373 MRPS17 ILMN_1804851 MTX1 210386_s_at 4580 MTX1 ILMN_1667222 NDRG1 200632_s_at 10397 NDRG1 ILMN_1809931 LOC56901 218484_at 56901 NDUFA4L2 ILMN_1756573 NME1 201577_at 4830 NME1 ILMN_1741133 AD‐003 223368_s_at 28989 NTMT1 ILMN_2170515 NUDT15 219347_at 55270 NUDT15 ILMN_2172202 P4HA1 207543_s_at 5033 P4HA1 ILMN_1747442 PAWR 226231_at 5074 PAWR ILMN_1806907 PDZK11 223037_at 51248 PDZD11 ILMN_1690376 PFKFB4 228499_at 5210 PFKFB4 ILMN_1653292 PGAM1 200886_s_at 5223 PGAM1 ILMN_1661366 PGF 209652_s_at 5228 PGF ILMN_1809813 200738_s_at, PGK1 227068_at,217356_s_at,200737_at 5230 PGK1 ILMN_2216852 PLAU 205479_s_at 5328 PLAU ILMN_1656057 PLEKHG3 212821_at 26030 PLEKHG3 ILMN_1780671 PPARD 37152_at 5467 PPARD ILMN_1674282 PPP2CZ 200885_at 333926 PPM1J ILMN_1799150 PPP4R1 201594_s_at 9989 PPP4R1 ILMN_1724544 PSMA7 201114_x_at 5688 PSMA7 ILMN_1701962 PSMB7 200786_at 5695 PSMB7 ILMN_1814156 PSMD2 200830_at 5708 PSMD2 ILMN_1712432 PTGFRN 224950_at, 224937_at 5738 PTGFRN ILMN_1743130 PVR 212662_at 5817 PVR ILMN_1677305 PYGL 202990_at 5836 PYGL ILMN_1696187 RAN 200750_s_at 5901 RAN ILMN_1757384 RNF24 204669_s_at 11237 RNF24 ILMN_1717809 RNPS1 200060_s_at 10921 RNPS1 ILMN_2375386 RUVBL2 201459_at 10856 RUVBL2 ILMN_2120340 S100A10 200872_at 6281 S100A10 ILMN_2046730 S100A3 206027_at 6274 S100A3 ILMN_1712545 202235_at, AFARP1/SLC16A1 202234_s_at,1557918_s_at/209900_s_at, 6566 SLC16A1 ILMN_1757052 202236_s_at SLC2A1 201250_s_at, 201249_at 6513 SLC2A1 ILMN_1659027 SLC6A10 215812_s_at 386757 SLC6A10P ILMN_1772482 SLC6A8 202219_at,210854_x_at,213843_x_at 6535 SLC6A8 ILMN_1806349 SLCO1B3 206354_at 28234 SLCO1B3 ILMN_1687319 C14orf156 221434_s_at 81892 SLIRP ILMN_1661945 SNX24 222716_s_at 28966 SNX24 ILMN_1795666 SPTB 229952_at 6710 SPTB ILMN_1663106 DKFZP564D166 224952_at 26115 TANC2 NA TEAD4 41037_at, 204281_at 7004 TEAD4 ILMN_1722824 5 TFAP2C 205286_at 7022 TFAP2C ILMN_1683891 TIMM23 218119_at 100287932 TIMM23 ILMN_1679555 Kua 223186_at 387521 TMEM189 ILMN_1789732 TMEM30B 213285_at 161291 TMEM30B ILMN_1752935 SMILE 1560017_at 160418 TMTC3 ILMN_2119555 CTEN 230398_at 84951 TNS4 ILMN_1716370 TPBG 203476_at 7162 TPBG NA TPD52L2 201379_s_at 7165 TPD52L2 ILMN_1699570 TPI1 213011_s_at, 200822_x_at 7167 TPI1 ILMN_2181191 KIAA1393 227653_at 57570 TRMT5 ILMN_1673872 TUBB2 208977_x_at,213726_x_at 10383 TUBB4B ILMN_1780769 VAPB 202550_s_at 9217 VAPB ILMN_1678459 VEGF 210512_s_at, 212171_x_at,211527_x_at 7422 VEGFA ILMN_1693060 VEZATIN 223675_s_at 55591 VEZT ILMN_2141398 XPO5 223056_s_at 57510 XPO5 ILMN_1759495 aIdentified based on the Affymetrix probe annotations retrieved using the annotation package hgu133plus2.db in R (8). For LOC149464 no Entrez Gene ID was found. For probes mapped to the two genes AFARP1/SLC16A1 in the publication, reannotation of the probes mapped to the same gene, SLC16A, reducing the number of genes in the signature from 99 to 98. b For genes in which several probes were present on the Illumina‐arrays, the probe with best correlation with ABrix in our training cohort of 42 cervical cancer patients was used. 6 Table 4: The Ghazoiu et al. 2011 signature. Information in publication Retrieved gene information Illumina probeb Accession SYMBOL Entrez Gene HUGO number IDa SYMBOL PROBE_ID AK3L1 NM_016282 50808 AK3 ILMN_1778173 ANKRD37 NM_181726 353322 ANKRD37 ILMN_1756417 ATP1B3 NM_001679 483 ATP1B3 ILMN_1654322 ATP5G3 NM_001002256 518 ATP5G3