Supplemental Data s7
Total Page:16
File Type:pdf, Size:1020Kb
Supplemental Data
Imputation across Genotyping Arrays for Genome-wide Association Studies:
Assessment of Bias and a Correction Strategy
Eric O. Johnson, Dana B. Hancock, Joshua L. Levy, Nathan C. Gaddis, Nancy L. Saccone, Laura
J. Bierut, and Grier P. Page Figure S1. Quantile-quantile plots of SNP association results when combining studies genotyped on different Illumina BeadChip arrays (Human1M or HumanHap550 version 3). The plots compare the observed versus expected P values in (A) European American subjects from SAGE versus PanScan and (B) African American subjects from SAGE versus iControl. SNPs were imputed based on the union of genotyped SNPs available on either array. The number of imputed
SNPs with MAF>1% and the overall λgc are shown in each plot.
A.
34,483 SNPs
Overall λgc=1.075 B.
42,894 SNPs
Overall λgc=1.102 Figure S2. Genomic inflation factors (grey lines) (λgc) and percentages of SNPs having spurious association (black lines) (P<1x10-6), by minor allele frequency (MAF), when combining studies genotyped on different Illumina BeadChip arrays (Human1M or HumanHap550 version 3) and conducting imputation using the IMPUTE2 program. (A, B) African American subjects from
SAGE were compared to iControl subjects. Two different imputed SNP sets were assessed: (A) imputed SNPs based on the union of genotyped SNPs available on either array, and (B) imputed
SNPs based on the intersection of genotyped SNPs available on both arrays. The number of imputed SNPs with MAF>1% and the overall λgc are shown in each plot.
A.
42,826 SNPs
Overall λgc=1.047 B.
42,812 SNPs
Overall λgc=0.997 Figure S3. Genomic inflation factors (λgc) and percentages of SNPs having spurious association
(P<1x10-6), by minor allele frequency (MAF), when combining studies genotyped on slightly different Illumina BeadChips (HumanHap550 version 1 or version 3). Imputed SNPs, based on the union of genotyped SNPs available on either array, are shown for (A) European American subjects from PanScan versus CGEMS study subjects and (D) African American subjects from iControl subsets genotyped on different BeadChips. The number of imputed SNPs with
MAF>1% and the overall λgc are shown in each plot.
A.
34,435 SNPs
Overall λgc=1.029 B.
42,853 SNPs
Overall λgc=1.065 Figure S4. Genomic inflation factors (λgc) and percentages of SNPs having spurious association
(P<1x10-6), by minor allele frequency (MAF), when combining studies genotyped on either the
Illumina Human1M or Affymetrix 6.0 array. SNPs were imputed based on the union of genotyped SNPs available on either array in (A) European American and (B) African American subjects from SAGE (genotyped on Illumina 1M) versus the GAIN GWAS of Schizophrenia
(genotyped on Affymetrix 6.0). The number of imputed SNPs with MAF>1% and the overall λgc are shown in each plot.
A.
34,503 SNPs
Overall λgc=1.270
B.
43,035 SNPs
Overall λgc=1.269 Figure S5. Genomic inflation factors (λgc) and percentages of SNPs having spurious association
(P<1x10-6), by minor allele frequency (MAF), when combining studies genotyped on different
Illumina BeadChip arrays (Human 1M or HumanHap550K version 3). (A) European American subjects from the SAGE study were compared to PanScan study subjects, and (B) African
American control subjects from the SAGE study were compared to iControl subjects. Imputed
SNPs, based on the union of genotyped SNPs available on either array, are shown, following the removal of imputed SNPs with R2<0.3. The remaining number of imputed SNPs with MAF>1%
2 2 and R >0.3 are shown, along with the overall λgc after the R filtering.
A.
33,928 SNPs
Overall λgc=1.057 B.
42,064 SNPs
Overall λgc=1.080