<<

Supplementary Information (Figures and Tables) for:

An evolutionary medicine perspective on Neandertal extinction

Alexis P. Sullivan1, Marc de Manuel3, Tomas Marques-Bonet3,4,5, & George H. Perry1,2

Departments of 1Biology and 2Anthropology, Pennsylvania State University, University Park, PA 16802, USA

3Institut de Biologia Evolutiva (CSIC/UPF), Parque de Investigación Biomédica de Barcelona (PRBB), Barcelona, Catalonia 08003, Spain

4CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain

5Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain

Corresponding Author: George H. Perry E-mail: [email protected]

Supplemental Figure 1: Innate immune system permutation analyses – 10,000 sets of 73 randomly selected containing nonsynonymous SNPs

Supplemental Figure 2: MHC gene permutation analyses – 10,000 sets of 13 randomly selected genes containing nonsynonymous SNPs

Supplemental Figure 3: Significantly enriched categories (red) among top 1% ape diversity genes

Supplemental Table 1: A comparison of genome-wide nonsynonymous SNPs versus total (nonsynonymous + synonymous) SNPs between Neandertal and modern human populations

Supplemental Table 2: PolyPhen-2 predictions for genome-wide nonsynonymous SNPs – damaging versus not damaging – for Neandertal and modern human populations

Supplemental Table 3: List of Innate immune system genes

Supplemental Table 4: List of MHC genes

Supplemental Table 5: A comparison of nonsynonymous SNPs (benign + damaging) in innate immune system genes versus genome-wide genes (not including innate immune genes) between Neandertal and modern human populations

Supplemental Table 6: A comparison of nonsynonymous SNPs (benign + damaging) in MHC genes versus genome-wide genes (not including MHC genes) between Neandertal and modern human populations

Supplemental Table 7: Neandertal MAF comparison of nonsynonymous SNPs in MHC genes

Supplemental Table 8: A comparison of nonsynonymous SNP (benign + damaging) MAFs in MHC genes between Neandertal and modern human populations

Supplemental Table 9: List of top 1% ape diversity genes

Supplemental Table 10: Results of the top 1% ape diversity genes Gene Ontology analysis

Supplemental Table 11: A comparison of nonsynonymous SNPs (benign + damaging) in top 1% ape diversity genes between Neandertal and modern human populations

Supplemental Figure 1: Innate immune system gene permutation analyses - 10,000 sets of 73 randomly selected genes containing nonsynonymous SNPs Neandertal-European 800 Observed value

600

400

Hypothesis that observed ratio is 200 less than that expected by chance: Frequency of permutations P = 0.1944 0 0 0.5 1.0 1.5 2.0 2.5 Ratio Neandertal:European human nonsynonymous SNPs

Neandertal-African Neandertal-Asian 800 800 Observed value Observed value

600 600

400 400

Hypothesis that Hypothesis that observed ratio is observed ratio is 200 200 less than that less than that expected by chance: expected by chance: Frequency of permutations P = 0.2789 Frequency of permutations P = 0.1400 0 0 0 0.5 1.0 1.5 2.0 2.5 0 0.5 1.0 1.5 2.0 2.5 Ratio Neandertal:African human Ratio Neandertal:Asian human nonsynonymous SNPs nonsynonymous SNPs Supplemental Figure 2: MHC gene permutation analyses - 10,000 sets of 13 randomly selected genes containing nonsynonymous SNPs Neandertal-European 2000 Observed value

1500

1000

Hypothesis that observed ratio is 500 greater than that expected by chance:

Frequency of permutations P = 0.1556

0 0 2.0 4.0 6.0 8.0 Ratio Neandertal:European human nonsynonymous SNPs

Neandertal-African Neandertal-Asian 2000 2000 Observed value Observed value

1500 1500

1000 1000

Hypothesis that Hypothesis that observed ratio is observed ratio is 500 greater than that 500 greater than that expected by chance: expected by chance: Frequency of permutations P = 0.0222 Frequency of permutations P = 0.0759

0 0 0 2.0 4.0 6.0 8.0 0 2.0 4.0 6.0 8.0 Ratio Neandertal:African human Ratio Neandertal:Asian human nonsynonymous SNPs nonsynonymous SNPs Supplemental Figure 3: Significantly enriched gene ontology categories (red) among top 1% ape diversity genes Supplemental Table 1: A comparison of genome-wide nonsynonymous SNPs versus total (nonsynonymous + synonymous) SNPs between Neandertal and modern human populations Population Neandertal African European Asian Total Nonsynonymous SNPs 2469 6445 4561 4654 Total Synonymous SNPs 2472 8195 5692 5652 Total Nonsyn/Synon SNPs 4941 14640 10253 10306 Proportion Nonsyn:Total SNPs 0.4997 0.4402 0.4448 0.4516 Fisher's Exact Test P-value - 4.52E-13 2.29E-10 2.59E-08 Supplemental Table 2: PolyPhen-2 predictions for genome-wide nonsynonymous SNPs - damaging versus not damaging - for Neandertal and modern human populations Population Neandertal African European Asian Total Damaging Nonsyn. SNPs 1073 1878 1331 1380 Total Not-Damaging Nonsyn. SNPs 1396 4567 3230 3274 Total Nonsynonymous SNPs 2469 6445 4561 4654 Proportion Damaging SNPs:Total Nonsyn. SNPs 0.4346 0.2914 0.2918 0.2965 Fisher's Exact Test P-value - 2.20E-16 2.20E-16 2.20E-16 Supplemental Table 3: List of innate immune system genes Gene ID Description NLRP1 NOD-like receptors NLRP2 NOD-like receptors NLRP3 NOD-like receptors NLRP4 NOD-like receptors NLRP5 NOD-like receptors NLRP6 NOD-like receptors NLRP7 NOD-like receptors NLRP8 NOD-like receptors NLRP9 NOD-like receptors NLRP10 NOD-like receptors NLRP11 NOD-like receptors NLRP12 NOD-like receptors NLRP13 NOD-like receptors NLRP14 NOD-like receptors NOD1 NOD-like receptors NOD2 NOD-like receptors NLRC3 NOD-like receptors NLRC4 NOD-like receptors NLRC5 NOD-like receptors NLRX1 NOD-like receptors CIITA NOD-like receptors NAIP NOD-like receptors RIG-1 RIG-I-like receptors IFIH1 RIG-I-like receptors LGP2 RIG-I-like receptors TLR1 Toll-Like receptors TLR2 Toll-Like receptors TLR3 Toll-Like receptors TLR4 Toll-Like receptors TLR5 Toll-Like receptors TLR6 Toll-Like receptors TLR7 Toll-Like receptors TLR8 Toll-Like receptors TLR9 Toll-Like receptors TLR10 Toll-Like receptors CLEC7A C-type lectins CD209 C-type lectins CARD9 C-type lectins CLECL1 C-type lectins MR C-type lectins CD206 C-type lectins MRC1 C-type lectins CLEC16A C-type lectins IFI16 Cytosololic DNA sensors MNDA Cytosololic DNA sensors IFIX Cytosololic DNA sensors AIM2 Cytosololic DNA sensors MYD88 adaptors TRIF adaptors MAL adaptors TRAM adaptors IRAK4 adaptors IRAK1 adaptors C3 alternative pathway;classical pathways, Lectin pathway C5 alternative pathway;classical pathways, Lectin pathway C6 alternative pathway;classical pathways, Lectin pathway C7 alternative pathway;classical pathways, Lectin pathway C8A alternative pathway;classical pathways, Lectin pathway C9 alternative pathway;classical pathways, Lectin pathway CFB alternative pathway CFD alternative pathway CFP alternative pathway C2 classical pathways, Lectin pathway C1QA classical pathway C1QB classical pathway C1QC classical pathway C1R classical pathway C1S classical pathway C4A classical pathways, Lectin pathway C4B classical pathways, Lectin pathway MASP1 Lectin pathway MASP2 Lectin pathway MBL2 Lectin pathway Supplemental 4: List of MHC genes Gene ID HLA-A HLA-C HLA-DRA HLA-DRB5 HLA-DQA1 HLA-DQB2 HLA-DOB HLA-DMB HLA-DMA HLA-DOA HLA-DPA1 HLA-DPB1 HLA-DPB2 Supplemental Table 5: A comparison of nonsynonymous SNPs (benign + damaging) in innate immune system genes versus genome-wide genes (not including innate immune system genes) between Neandertal and modern human populations Population Neandertal African European Asian Total Immune Nonsynon. SNPs 16 52 41 45 Total Not-Immune Nonsynon. SNPs 2544 6654 4705 4783 Total Nonsynonymous SNPs 2560 6706 4746 4828 Fisher's Exact Test P-value - 0.4985 0.3294 0.1788 Supplemental Table 6: A comparison of nonsynonymous SNPs (benign + damaging) in MHC genes versus genome-wide genes (not including MHC genes) between Neandertal and modern human populations Population Neandertal African European Asian Total MHC Nonsynon. SNPs 17 9 13 9 Total Not-MHC Nonsynon. SNPs 2527 6645 4692 4774 Total Nonsynonymous SNPs 2560 6706 4746 4828 Fisher's Exact Test P-value - 6.58E-05 1.98E-02 1.59E-03 Supplemental Table 7: Neandertal MAF comparison of nonsynonymous SNPs in MHC genes Population Neandertal (MHC) Neandertal (Not-MHC) Total SNPs with MAF = 3 9 260 Total SNPs with MAF = 1 or 2 8 2283 Total Nonsynonymous SNPs 17 2527 Fisher's Exact Test P-value - 1.56E-05 Supplemental Table 8: A comparison of nonsynonymous SNP (benign + damaging) MAFs in MHC genes between Neandertal and modern human populations Population Neandertal African European Asian Total SNPs with MAF = 3 9 3 1 1 Total SNPs with MAF = 1 or 2 8 6 12 8 Total Nonsynonymous SNPs 17 9 13 9 Fisher's Exact Test P-value - 0.4291 0.0174 0.08733 Supplemental Table 9: List of top 1% ape diversity genes Gene ID LRRC38 VANGL1 FCRL5 CD1A CD1E SPTA1 OR6N2 C1orf186 CAPN8 ZP4 OR14A16 OR2T1 AP1S3 C3orf30 KLHL6 TMEM207 GP5 PROM1 SLC34A2 SPP1 LOC153328 LYRM4 OR2H1 MAS1L HLA-DRA HLA-DQB2 BET3L PKIB TAAR6 IYD PARK2 PKD1L1 GIMAP5 IDO2 OPRK1 NKAIN3 C9orf150 OR1L8 GLT6D1 OR13A1 ZNF488 SFXN4 C10orf122 OR52R1 OR51G1 OR5B12 GLYAT MS4A2 MMP27 FAM55B OR8D4 GPRC5A OAS1 GLT1D1 OR4K14 SERPINA9 C15orf54 USP50 AGBL1 OR4F6 GSG1L CES7 CLEC3A KIAA0513 OR1A2 SLFN13 C18orf26 ZNF30 RASSF2 KRTAP27-1 UMODL1 PKNOX1 PDXK Supplemental Table 10: Results of top 1% ape diversity genes Gene Ontology analysis

GO category subroot: name GO ID Observed # genes Expected # genes Raw P-value Adjusted P-value Gene ID HLA-DRA, SPTA1, GIMAP5, ZP4, HLA-DQB2, biological process: positive regulation of leukocyte activation GO:0002696 6 0.62 2.95E-05 0.0061 MS4A2 HLA-DRA, SPTA1, GIMAP5, ZP4, GP5, HLA- biological process: regulation of cell activation GO:0050865 7 0.99 4.62E-05 0.0061 DQB2, MS4A2 HLA-DRA, SPTA1, GIMAP5, ZP4, HLA-DQB2, biological process: positive regulation of cell activation GO:0050867 6 0.67 4.65E-05 0.0061 MS4A2 HLA-DRA, SPTA1, GIMAP5, ZP4, biological process: positive regulation of T cell activation GO:0050870 5 0.45 7.49E-05 0.0074 HLA-DQB2 HLA-DRA, SPTA1, GIMAP5, ZP4, HLA-DQB2, biological process: regulation of leukocyte activation GO:0002694 6 0.89 0.0002 0.0098 MS4A2 CD1E, HLA- DRA, AP1S3, HLA-DQB2, biological process: antigen processing and presentation GO:0019882 5 0.54 0.0002 0.0098 CD1A CD1E, HLA- DRA, SPP1, SPTA1, ZP4, KLHL6, OPRK1, HLA-DQB2, MS4A2, GIMAP5, AP1S3, OAS1, biological process: immune system process GO:0002376 14 4.92 0.0002 0.0098 CD1A, PKNOX1 HLA-DRA, SPTA1, GIMAP5, ZP4, biological process: positive regulation of lymphocyte activation GO:0051251 5 0.55 0.0002 0.0098 HLA-DQB2 HLA-DRA, SPTA1, GIMAP5, ZP4, biological process: regulation of T cell activation GO:0050863 5 0.61 0.0003 0.0131 HLA-DQB2 HLA-DRA, SPTA1, GIMAP5, ZP4, HLA-DQB2, biological process: T cell activation GO:0042110 6 1 0.0004 0.0158 PKNOX1

OR4K14, OR14A16, OR6N2, OR2T1, OR51G1, OR2H1, OR8D4, OR5B12, OR1A2, OR4F6, OR13A1, molecular function: activity GO:0004984 13 0.9 2.00E-12 1.42E-10 OR52R1, OR1L8

TAAR6, GPRC5A, OR2T1, OR51G1, OR2H1, OR8D4, OR5B12, OR1A2, OR4F6, OR4K14, OR14A16, OPRK1, OR6N2, OR13A1, OR52R1, molecular function: G- coupled receptor activity GO:0004930 17 2.33 3.83E-11 1.36E-09 MAS1L, OR1L8 TAAR6, GPRC5A, MS4A2, OR2T1, OR51G1, OR2H1, OR8D4, OR5B12, OR1A2, OR4F6, OR4K14, OR14A16, OPRK1, OR6N2, OR13A1, OR52R1, molecular function: transmembrane signaling receptor activity GO:0004888 18 3.88 1.42E-08 3.36E-07 MAS1L, OR1L8

TAAR6, GPRC5A, MS4A2, OR2T1, OR51G1, OR2H1, OR8D4, OR5B12, OR1A2, OR4F6, OR4K14, OR14A16, OPRK1, OR6N2, OR13A1, OR52R1, molecular function: signaling receptor activity GO:0038023 18 4.28 6.78E-08 1.20E-06 MAS1L, OR1L8

TAAR6, ZP4, GPRC5A, MS4A2, OR2T1, OR51G1, OR2H1, OR8D4, OR5B12, OR1A2, OR4F6, OR4K14, OR14A16, OPRK1, OR6N2, OR13A1, OR52R1, molecular function: molecular transducer activity GO:0060089 19 5.47 5.28E-07 6.25E-06 OR1L8, MAS1L

TAAR6, ZP4, GPRC5A, MS4A2, OR2T1, OR51G1, OR2H1, OR8D4, OR5B12, OR1A2, OR4F6, OR4K14, OR14A16, OPRK1, OR6N2, OR13A1, OR52R1, molecular function: signal transducer activity GO:0004871 19 5.47 5.28E-07 6.25E-06 OR1L8, MAS1L

TAAR6, GPRC5A, MS4A2, OR2T1, OR51G1, OR2H1, OR8D4, OR5B12, OR1A2, OR4F6, OR4K14, OR14A16, OPRK1, OR6N2, OR13A1, OR52R1, molecular function: receptor activity GO:0004872 18 5.18 1.21E-06 1.23E-05 MAS1L, OR1L8 HLA-DRA, HLA- molecular function GO:0032395 2 0.05 0.0008 0.0063 DQB2 molecular function: sodium ion binding GO:0031402 2 0.05 0.0008 0.0063 PDXK, SLC34A2 molecular function: alkali metal ion binding GO:0031420 2 0.07 0.0022 0.0156 PDXK, SLC34A2 HLA-DRA, VANGL1, MS4A2, OR51G1, OR2H1, LRRC38, OR8D4, OR1A2, CD1A, SLC34A2, NKAIN3, CD1E, OR4K14, IYD, GP5, C1orf186, FCRL5, MAS1L, TAAR6, GPRC5A, ZP4, SFXN4, UMODL1, GLT6D1, OR2T1, OR5B12, TMEM207, GIMAP5, OR4F6, PROM1, SPTA1, PKD1L1, OPRK1, OR14A16, C18orf26, HLA- DQB2, OR6N2, OR13A1, OR1L8, OR52R1, cellular component: intrinsic to membrane GO:0031224 41 20.16 6.33E-08 6.58E-06 GSG1L HLA-DRA, VANGL1, MS4A2, OR51G1, OR2H1, LRRC38, OR8D4, OR1A2, CD1A, SLC34A2, NKAIN3, CD1E, OR4K14, IYD, GP5, C1orf186, FCRL5, MAS1L, TAAR6, GPRC5A, ZP4, SFXN4, UMODL1, GLT6D1, OR2T1, OR5B12, TMEM207, GIMAP5, OR4F6, PROM1, PKD1L1, OPRK1, OR14A16, C18orf26, HLA- DQB2, OR6N2, OR13A1, OR1L8, OR52R1, cellular component: integral to membrane GO:0016021 40 19.8 1.43E-07 7.44E-06 GSG1L HLA-DRA, VANGL1, MS4A2, OR51G1, OR2H1, LRRC38, OR8D4, OR1A2, CD1A, SLC34A2, NKAIN3, CD1E, OR4K14, IYD, GP5, C1orf186, FCRL5, AP1S3, MAS1L, TAAR6, GPRC5A, ZP4, SFXN4, UMODL1, GLT6D1, OR2T1, OR5B12, TMEM207, GIMAP5, OR4F6, PKD1L1, PROM1, SPTA1, OPRK1, OR14A16, C18orf26, HLA- DQB2, OR6N2, OR13A1, OR1L8, cellular component: membrane part GO:0044425 42 22.95 9.37E-07 3.25E-05 OR52R1, HLA-DRA, MS4A2, OR51G1, OR2H1, LRRC38, OR8D4, OR1A2, CD1A, SLC34A2, NKAIN3, CD1E, OR4K14, IYD, GP5, FCRL5, MAS1L, TAAR6, GPRC5A, ZP4, UMODL1, OR2T1, OR5B12, OR4F6, PROM1, SPTA1, OR14A16, OPRK1, HLA- DQB2, OR6N2, OR13A1, cellular component: plasma membrane GO:0005886 32 17.53 8.87E-05 0.0015 OR52R1, OR1L8

HLA-DRA, MS4A2, OR51G1, OR2H1, LRRC38, OR8D4, OR1A2, CD1A, SLC34A2, NKAIN3, CD1E, OR4K14, IYD, GP5, FCRL5, MAS1L, TAAR6, GPRC5A, ZP4, UMODL1, OR2T1, OR5B12, OR4F6, PROM1, SPTA1, OR14A16, OPRK1, HLA- DQB2, OR6N2, OR13A1, cellular component: cell periphery GO:0071944 32 17.93 0.0001 0.0015 OR52R1, OR1L8 HLA-DRA, AP1S3, HLA- cellular component: trans-Golgi network membrane GO:0032588 3 0.1 0.0001 0.0015 DQB2 HLA-DRA, HLA- cellular component: MHC protein complex GO:0042611 3 0.09 8.02E-05 0.0015 DQB2, CD1A HLA-DRA, VANGL1, MS4A2, OR51G1, OR2H1, LRRC38, OR8D4, OR1A2, CD1A, SLC34A2, NKAIN3, CD1E, OR4K14, IYD, GP5, C1orf186, FCRL5, AP1S3, MAS1L, TAAR6, GPRC5A, ZP4, SFXN4, UMODL1, GLT6D1, OR2T2, OR5B12, TMEM207, GIMAP5, OR4F6, SERPINA9, PKD1L1, SPTA1, PROM1, OPRK1, OR14A16, C18orf26, HLA- DQB2, OR6N2, OR13A1, cellular component: membrane GO:0016020 43 29.9 0.0007 0.0091 OR1L8, HLA-DRA, HLA- cellular component - integral to lumenal side of endoplasmic reticulum membrane GO:0071556 2 0.05 0.001 0.0116 DQB2 HLA-DRA, HLA- cellular component - MHC class II protein complex GO:0042613 2 0.07 0.0021 0.0218 DQB2 Supplemental Table 11: A comparison of nonsynonymous SNPs (benign + damaging) in top 1% ape diversity genes between Neandertal and modern human populations Population Neandertal African European Asian Total Top 1% Nonsyn. SNPs 23 85 40 51 Total 99% Nonsyn. SNPs 1726 4721 3346 3476 Total Nonsynonymous SNPs 1749 4806 3386 3527 Fisher's Exact Test P-value - 0.2280 0.6894 0.8038