Author manuscript, published in "Nature Genetics 2012;44(6):694-8" DOI : 10.1038/ng.2256
a Final target bases 46.2 Mb Target bases covered 43.3 Mb Percent target bases covered 93.81%
Average depth 73.3 Median depth 70
Coverage at 25x 75.9% Coverage at 10x 87.0% Coverage at 4x 92.3% Coverage at 1x 96.0%
b 100
75
50
25 Reads depth (X) depth Reads
0 1 2 3 4 5 6 7 8 9 5 6 2 10 11 12 13 14 1 1 17 18 19 20 21 2 X # Chromosome c 1.0
0.8
0.6
0.4 Coverage inserm-00719917, version 1 - 22 Jul 2012
in target regions target in 0.2 Coveragex1 Coveragex4 Coveragex10 Coveragex25 0.0 1 2 3 4 5 6 7 8 9 5 8 1 4 7 8 1 4 10 11 12 13 14 1 16 17 1 19 20 2 22 23 2 25 26 2 28 29 30 31 32 33 34 35 36 37 3 39 40 4 42 43 4 45 46 47 48 # Exomes
Supplementary Figure 1. Statistics of mapping of sequencing reads
a, summary statistics for whole-exome sequence reads of 24 HCC with their non tumor liver tissues. b, mean depth (with 95% IC) of reads on each chromosome, c, cumulative fraction of coding bases covered in captured regions. A 1-fold, 4-fold, 10-fold and 25- fold coverage were considered (mean with 95% IC) per exome (numbered #1-48). $ Bioinforma5cs$workflow$ Variant$calling$Tumor$ ! Variant$calling$Normal$ 1,775,892$variants$
$ Database$extrac5on$ ! dbSNP$ 1,577,724$$
198,168$ $ background$known$polymorphisms$ $ Coding$SNPs/Indels$ (targeted$sequencing$of$exons)$ ! Intronic,$ 16,498$ UTR$ synonymous$$ 65,947$ variants$ non$coding$variants$ 44,172$nonsynonymous$ $coding$SNPs/Indels$ $ $ Quality$check$of$reads$ (Mean$size$±3SD,$Paired$end,$Phred$>=10,$$N>=$6)$ $ !
35,539$ $
$$ Soma5c$variants$ ! 1000$ Proprietary$ Duplicate$variants$$ Genomes$ database$ per$individual$ 29,056$$ germline$ (unique$per$individual) 3,777$ $ nonsynonymous$$ soma5c$novel$nonsynonymous$$ coding$SNPs$
coding$SNPs$ /Indels$ $ Func5onal$Evalua5on$ (visual$control,$PPH2)$ ! Artefact$(visual$control)$`$$1,501$
Predict$to$be$benign$or$neutral$(Polyphen`2)$`$1,190$
1,086$variants$predicted$to$alter$func5on$ $ inserm-00719917, version 1 - 22 Jul 2012
$$ Independant$verifica5on$ (capillary$Sanger$sequencing)$ Verified$as$ Verified$as$ ! Not$validated$ germline$ soma5c$ 23$ 994$ 69$
Supplementary Figure 2. Whole-exome sequencing analysis flowchart. Extrac on of func onally soma c gene muta ons for 24 HCC. Boxes refer to major bioinforma cs steps; arrows indicate the number of variants obtained or removed from subsequent analysis. Variants were filtered for their coding localiza on, annota on in dbSNP31 or 1000 genomes, soma c and func onally impairment. The per-base and reads quality scores were used to filter false posi ve gene muta on events. a
b
100 p< 0.0001 80
60
40
Percent survival Percent 20 None Homozygous Deletion 0 0 20 40 60 Time
subjects 57 43 28 21 at risk 42 19 9 3
Supplementary Figure 3. Genome-wide copy number changes and recurrent homozygous dele ons in HCC. a, Frequency of gains, dele ons, and losses of heterozygosity (LOH) along the genome in a series of 125 HCC. b, Kaplan-Meier analysis comparing survival of cases presen ng homozygous dele ons (n=42) versus those without local losses (n=57). Univariate inserm-00719917, version 1 - 22 Jul 2012 Cox P-value for risk index is included. Only pa ents with cura ve (R0) resec on were included in survival analysis. CHC012T
● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
CHC031T CHC014T
● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ●● 0 ● ● ● ● ● ●● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ●● ● ● ●● ●● ● ● ●● ● ● ●●● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● CHC031T CHC130T
● ● ● ● ● ● ● ● ● ●● ● ● ● ●●● ● ● ● ● ●● ● ● ● ● ● ● ● ●● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ●●● ●● ●● ● ● ●● ● ● ● 0 ● ● ●● ● ● ● 0 ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● Log R Ratio ● ● ● ● ● ● ●● ● ● ● ● ● CHC1199T CHC205T
● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● 0 ● ● ● ● ● ● ● ●●● ● 0 ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● HBZHBA2 HBQ1 ITFG3 ARHGDIG MPG AXIN1 CHC218T
● ● ● ● ● ● ● ● ● ● ●● ● ● ● ●● ● ● ● ● ● ● ●● ● ● ●● ● ●● ● ●●● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● NPRL3 HBM HBA1 LUC7L RGS11 PDIA2 MRPL28 ● ● ● ● ● ● ● ● ● ● 300 400 CHC235T Genomic position (Kb)
● ● ●● ●● ● ● ● ●● ●● ● ● ● ●● ● ● ●● ● ● ● ● ● ●● ● ●● ●● ● ● ● ● ● ●● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
Log R Ratio ●● ● ●● ● ● ● ● ● ● ● ●● ● ● CHC012T ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ●● ● ● ● ● ● ● ● ●● ● ● ● ● ● ●●● ●● ● ●● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ●●● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● CHC433T ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●●● ● ● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● 0 ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ● CHC018T ● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●●● ●● ● ● 0 ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ●● ●●● ● ● ●●● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ●● ● ● ● ● ● CHC793T ● ● ● ● ●