© 2014 Nature America, Inc. All rights reserved. addressed addressed to P.L.D. ( General Hospital, Boston, USA. Massachusetts, McGill University, Montreal, Québec, Canada. Cambridge, USA. Massachusetts, Rush University Medical Center, Chicago, Illinois, USA. of Mayo Neuroscience, Clinic, Florida, Jacksonville, USA. USA. Massachusetts, Boston, USA. Massachusetts, 1 expressed differentially these that and associ pathology AD independently with also ated was regions methylated these in differentially found of an were transcription the changes that AD, of these feature that early AD, to relation in DNA levels altered methylation exhibited genome human the in dinucleotides CpG rela in We 71 discrete that found of AD neuropathology. burden to the tion methylated differentially were that genome the of regions identify to effort discovery rigorous statistically a out carried AD studying when considered carefully to be has that age increasing with genome the throughout sites many at methylation associations robust returned yet not have and conflicting sometimes are AD of studies inhibitors HDAC with patients AD of treatment off-label in and AD of systems model in (HDAC) deacetylases histone of manipulation the from comes AD influence a or factor diagnosis risk disease a as such individual, given a of experiences life the with correlated be could loci certain in DNA methylation in and individual an individuals in time across over stable variable highly both be can CpG dinucleotides certain at levels methylation DNA that emerging is Evidence genes connect to a known AD gene susceptibility network. changes may have a role in the onset of AD given that we observed them in subjects presymptomatic and that six of the validated in AD: Furthermore, we functionally validated these CpG associations and identified the nearby genes whose RNA expression was altered AD variants. susceptibility We validated 11 of the methylated differentially regions in an independent set of 117 subjects. associated significantly with the burden of AD pathology, including CpGs in the relation to Alzheimer’s disease (AD). We found that the level of methylation at 71 of the 415,848 interrogated CpGs was We used a collection of 708 collected prospectively autopsied brains to assess the methylation state of the brain’s DNA in Jonathan Mill E Bradley Bernstein Steven G Younkin Towfique Raj YuLei Philip L De Jager methylation at Alzheimer’s disease: early alterations in brain DNA nature nature Received 5 May; accepted 16 July; published online 17 August 2014; Program Program in Translational Genomics, NeuroPsychiatric Institute for the Departments Neurosciences, of Neurology and Psychiatry, Brigham and Women’s Hospital, ANK1 8 NEUR , , Matthew L Eaton 3 . The most compelling evidence that the epigenome may may epigenome the that evidence compelling most . The , , OSCI CDH23 1 4 [email protected] – , 4 3 5 University University of Exeter Medical School, University of Exeter, Exeter, UK. EN , , Joseph Replogle & David A Bennett 6– 1 9 C – 6 . Furthermore, there is a gradual increase in in increase gradual a is there Furthermore, . 3 , , , , Fanggeng Zou E , , Gyan Srivastava

2 DIP2A 2 , Harvard Harvard Medical School, Boston, USA. Massachusetts, 12 advance online publication online advance 10 , 13 Genetic Genetic Analysis Platform, Broad Institute, Cambridge, USA. Massachusetts, , , Alex Meissner 3 , , , 4 9 RHBDF2 , 1 5 , , Brendan T Keenan , . To date, results of epigenomic epigenomic of results date, To . 2 ) ) or D.A.B. ( . This suggests that differences differences that suggests This . ANK1 12 14 Epigenomics Epigenomics Program, Broad Institute, Cambridge, USA. Massachusetts, 1 Harvard Harvard Stem Cell Institute, Harvard University, Cambridge, USA. Massachusetts, should Correspondence be – 6 , , 3 , , Moshe Szyf 8 RPL13 , , Wendy Brodeur 1 [email protected] , 9 3 Computer Computer Science and Artificial Intelligence Laboratory (CSAIL), Institute Massachusetts of Technology, , , Katie Lunnon 7 Department Department of Neurology, Mayo Clinic, Florida, Jacksonville, USA. 9 , , , 12 SERPINF1 , , 14 BIN1 , , Nilufer Ertekin-Taner 8

, 1 11 10– doi:10.1038/nn.378 , 3 , Charles , B Charles Epstein , , Jason Ernst 1 2 . . We and 10 4 - - , , Stacey Gabriel , 5 ). , , Jeremy Burgess SERPINF2 , To technically validate the nature of our data, we compared our our compared we data, our of nature the validate technically To AD of diagnosis pathologic a meeting of amyloid range pathology burden at a the time of death, with 60.8% of displayed subjects and cognitively declined subjects some study, the of course the Over population. older the of selection random a at were non-impaired the subjects cognitively study entry, we studied that Given death. of time the at donation brain include that aging of studies cohort prospective two (MAP), Project Aging and Memory the or (ROS) Study Order Religious the of part were subjects These glia. as such cells parenchymal other and populations neuronal ent differ of primarily composed tissue of piece a each profiled we from sample, matter gray the out dissected we that Given each individual. from obtained cortex prefrontal dorsolateral of profiles sample a and methylation beadset HumanMethylation450 These Illumina the using subjects. generated were 708 in dinucleotides CpG discrete 415,848 at measures methylation of consisted set data Our Description of subjects and data RESULTS AD network defined genetically susceptibility reported previously a to connected genes 3 Program Program in Medical and Population Genetics, Broad Institute, Cambridge, RHBDF2 6 3 5 , Institute Institute of Psychiatry, King’s College London, London, UK. 9 , , Cristin McCabe . . Our analyses suggest that these DNA methylation 12 6 ABCA7 , , , Julie A Schneider 10 7 , Lori , B Lori Chibnik , , High S Chai 6 , 7 , , Leonard C Schalkwyk and 1 11 3 . Department Department of Pharmacology and Therapeutics, and other loci BIN1 3 13 , , Anna Tang Department Department of Pathology, Massachusetts regions, which harbor known 8 6 Rush Rush Alzheimer’s Disease Center, , 7 , , Curtis Younkin 1 – 1 8 4 3 , ( , , Manolis Kellis

Supplementary Table 1a Supplementary 1 ,

4 t r a , 5 ,

6 6 C I Department Department , 3

, 9 e l ,

s ). ).  -

© 2014 Nature America, Inc. All rights reserved. significance; thus, significance; to enhance the clarity of the figure for the other , these two chromosomes are not included in the figure. Not all genes (–log( in black. The next red circle reports the results of the association of RNA expression level of these genes to a diagnosis of AD in the Mayo clinic data set alleles, susceptibility are highlighted in red letters. The subset of the genes with differential mRNA expression in AD in the Mayo clinic data set is shown colored circle reports the names of genes found within 50 kb of each associated CpG (light blue letters). The DNA methylation levels to NP burden. Similarly, the first red circle reports the −log( 0–20), we report the results for each of the 71 associated CpGs found in 60 independent methylated differentially regions (DMR) from the analysis relating given genomic segment (range, 0–200 probes per 100 kb). The blue circle reports the results of the DNA methylation scan: using a −log( centromere highlighted in red. The next circle (green) reports the density of CpG probes successfully sampled by the Illumina beadset that are present in a presents the physical position along each (in Mb). The cytogenetic bands of each chromosome are presented in the first circle, with the RNA data. Each sector of this diagram presents summary results of the three different analyses in a chromosome. The perimeter of this circular figure Figure 1 sub four these in subjects): AD two and non-impaired (two jects DNA in of four sub brain the same samples the from generated data sequence methylation DNA genome-wide to data Illumina-derived t r a  found in the associated regions are listed in the figure. For clarity, only a subset of genes are selected from loci significant in the discovery analysis.

chr13 P chr12 10

50 75 chr14 12 25

); ); range, 0–20). The central circle reports the set of validated CpGs Chromosomes 9 and 18 contain no CpG that meets a threshold of genome-wide

0

10 25

0 50

0

5 C I 75

Summary Summary of the genome-wide brain DNA methylation scan for NP burden and its validation using independent DNA methylation data and brain 75 0 10

50 0

25 0 25 chr15

e l

12 50

0

chr11

10 5 75

0 10 s

75 0

0

50 25 chr16 25 50

12 0 75

5 0

chr10 10 0 25 chr17

EP400 DDX54 RNF34 75 GPR 50

DDX47

C5A

EP400NL C1 50 HYL 75

DDX25 ZNF385A

ACAC

2or

0 S1 ABCD4 C IT DTX1

f52 chr19

25 OPZ1 GA

DIXDC1IL1

VWF B 25

HSPB2 5 TS 0RA TJ

CR P1

0 PA

YA

PI 50

DDB1 N3

TPN

OR VWCE B NT5M

A

NXN 0 12 RPL1

5 AOV

CRK RNF1 MY THRA

O1

CO

SERPINF1

1 CO

3

MR C

cg1172498 chr20

X4I1

X4NB

CSNK1D 41 25

10 SERPINF2

SPG7

chr8 0 VI1

CDH23 NP PA

LO

LM C4

RHBDF

cg16733298

FS

AL MED2 50

C1 TL3

75 OX GPX4

0or ALDH1

4

5 TBC1D1

cg03169557 2 SLC1 f1 6A1

0 0

4

7A7

6

GA

cg19803550

AB

SIX5 CNN2

50 SLC45A4 A CA chr21

cg05810363 7

25

cg23968456 LY

PD

FB

5 cg13076843 XO

CRL 46

25 AP3M2 S1

HCK PLA ANK1 0

0 T DNAJB6

chr22

IC OSLG

150 SHH 25

RBM33 cg00621289 DIP2 cg05066959 A

PODXL - - CLDN5

DD0 ST

12 50

5 MKLN1 0

0

10 UBE3C TNFRSF4 consistent with prior reports prior with consistent technologies, two the by generated methylation of levels estimated (mean we strong correlation found a very jects,

cg22962123 cg15821544 CDA chr7 cg25594100

GNA12 PINK1

75

P IQCE NCDN 25

DAGLB TF ) ) (range, 0–10) for the 71 CpGs in the replication analysis. The cream- DLL1 AP2E

PA RAC1 RS PACRG 50 2

BC 50

AR3 TTYH3

advance online publication online advance 25

TRAM2 75

MDC1

KIAA0319 NU 0

GRK AK2 100 F1

6

2

RGS14

chr1 IER3 15

SH3PXD2B 0 12

EP 5 KIAA014

FOXC AS

RNF14 PCBP1

12 1

5 15

1 0

1 BIN1 PRR7

10 NOSTRIN 0 17 FZD7 5

ABCA7 CAB39

chr6 75 200 SLC30A 1 ATG3 CCDC48 CCNL1

5 50 225 .

and

9

25 0 0

BIN1 17 25

5

15 50

0

regions, which harbor AD 12 75

nature nature

5

10 10

0 0

r

75 12

= 0.97) between the = between 0.97)

chr5 5 chr2 50 15

P 0 25 17 NEUR ) ) scale (range,

5

0 200 17 225

0 15 5

OSCI

12 25

0

10 50

5

75 75

50

0 10 25 12

0 15

17

0 EN 5

0

5 chr4 chr3 C E © 2014 Nature America, Inc. All rights reserved. significant inourdiscovery analysisandfoundinADsusceptibility lociarelistedinthe‘KnownADloci’. Chr, chromosome; pathoAD,pathologicdiagnosis ofAD. accounts forthetesting ofprobesgenome-wide.Listedareall genesfoundwithinasegment50kbupstream anddownstreamoftheassociated CpG.ThoseCpGsthatwere result ofalogisticregressionrelatingthe level ofmethylationagivenCpGtopathologicdiagnosisAD.Thethreshold ofgenome-widesignificanceis Model 3referstothesecondaryanalysis that includedavariablefortheestimatedproportionofneuronalcellsintissue andsurrogatevariables.TheADanalysisreportsthe The NPanalysisreportstheresultsofalinear regressionanalysisrelatingDNAmethylation leveltotheburdenofneuriticplaque.Model1 referstotheprimaryanalysisand cg02308560 cg22883290 cg05810363 cg05066959 cg03169557 cg19803550 cg22962123 cg16733298 cg15821544 cg23968456 cg11724984 CpG Table 1 target the be therefore may and region. given a in gene(s) expressed narrow CpGs differentially to methylated helped were differentially related it the and near genes region, change) the of this which down in a occurred that (transcriptional disease showing the effect by to region biological role methylated the meaningful differentially confirmed given further This a it subjects. of goals: non-AD two stage and accomplished AD in from strategy replicated obtained were mRNA that using the of 2 regions role the methylated validate functionally differentially to pathology significantly subjects. attempted of we set the AD 3, independent stage an replicated In in with 1 we stage 2, from CpGs correlate stage associated In levels Methods). methylation (Online regions which chromosomal for in screen methylation DNA a out carried ( stages three involves strategy analytic Our regions methylated differentially identify to study Discovery ( experiment our in quality data of assessments for lines profiled were that cell HapMap individuals from lymphoblastic and samples cortical our between profiles methylation DNA in differences more many found we expected, As subjects. older these of each of course life different very the despite in variation interindividual significant show not ( correlation of methylation levels for all possible subject pairs was 0.98 profiles across our subject population, we noted that the mean Pearson nature nature cg00621289 cg25594100 cg13076843 Supplementary Supplementary Fig. 1 Notably, when examining the nature of human cortical methylation

NEUR CpGs CpGs associated with amyloid burden: validated CpGs and CpGs in known AD loci 19 17 16 10 17 21 17 16 12 Chr 2 8 7 7 1 OSCI 127800646 121890864 74475270 41519308 89598950 27153605 19127132 43473840 73521631 47855916 74475294 Position 1071176 1637391 4786943 EN (bp) C ), suggesting that that majority of CpG sites did E

advance online publication online advance Supplementary Fig. 2 Fig. Supplementary NP burden Estimate model 1 2.19 4.41 2.95 2.69 4.86 4.36 2.75 3.52 4.97 3.02 3.15 2.35 1.7 3.5 3.06 3.06 × 10 3.73 3.73 × 10 3.68 × 10 7.13 × 10 1.12 1.12 × 10 1.17 × 10 3.97 × 10 3.99 3.99 × 10 1.04 1.04 × 10 6.48 × 10 2.54 × 10 1.68 × 10 5.24 × 10 4.76 × 10 P Fig. Fig. −8 −8 −7 −7 −10 −10 −14 −10 −11 −9 −8 −9 −8 −8 NP burden

). methylation levels levels methylation Estimate model 3 1 3.62 4.44 2.86 2.78 4.88 3.08 3.96 5.11 4.79 4.48 4.22 2.29 4.48 5.06 ). In stage 1, we we 1, stage In ). Discovery study

2.45 2.45 × 10 8.97 × 10 8.11 8.11 × 10 7.56 × 10 3.36 3.36 × 10 1.60 × 10 8.79 × 10 1.32 × 10 8.09 × 10 8.97 × 10 9.75 × 10 2.18 2.18 × 10 3.33 × 10 5.99 × 10 Known Known AD loci P −12 −8 −9 −13 −9 −13 −9 −11 −9 −11 −8 −11 −13 −8

technical or other artifacts. Of the 137 CpGs discovered in the the in discovered CpGs 137 the Of artifacts. cryptic other capture or may and do technical that confounders data known with methylation the correlate in not structure capture surrogate as that well as variables neurons of proportion the vari the captures that included able that analysis secondary a performed associated we CpGs, conservatively most the on only focus to However, sis. ( AD to related not was sample each in found neurons of the As proportion measured. was that CpG each of testing the for correct and hypotheses multiple of testing the for account to con strategy but simple, servative, this chose we unknown, currently is in genome the methylation of units independent functionally of number exact standard a on correction Bonferroni a ing impos by CpGs tested 415,848 all of testing the for accounts cance Table ( pathology NP of burden the with associated were AD of diagnosis a for criteria neuropathologic which of meet some pathology, NP of range a display individuals intact cognitively as subject, deceased a of brain the of state the captures better burden NP neuropathology. disease Alzheimer’s of measure a key (NP), plaques amyloid of neuritic burden the with correlates of methylation level whose CpGs autosomal we identified segment and were highly correlated in their level of methylation. methylation. chromosomal of level same their in the in correlated highly found were and were segment CpGs 71 these of Some con ( analysis more the secondary in servative significant remained CpGs 71 analysis, primary Estimate P 13.7 15.9 11.3 11.2 18.6 19 17.5 14.9 10.8 12.9 18.9 16.8 16.7 In the primary analysis of our cortical methylation profiles (stage 1), = 0.08), we did not include this as a term in the primary analy primary the in term a as this include not did we 0.08), = AD 9.32 1 and and 1.77 1.77 × 10 1.83 × 10 3.36 × 10 4.88 × 10 1.67 1.67 × 10 5.56 × 10 9.76 × 10 1.46 × 10 1.45 × 10 1.27 × 10 6.81 6.81 × 10 3.98 × 10 1.83 × 10 5.81 × 10 Supplementary Table 2 Table Supplementary P −8 −5 −7 −10 −6 −7 −7 −8 −7 −8 −6 −7 −5 −7 Estimate Braak score 3.60 0.96 2.76 3.45 1.52 5.13 3.58 2.47 1.62 3.40 1.51 2.02 4.09 3.02 Replication study 0.011 0.0067 7.93 7.93 × 10 6.48 × 10 6.05 × 10 2.79 2.79 × 10 2.30 × 10 2.08 × 10 8.27 × 10 5.44 × 10 5.61 5.61 × 10 4.95 × 10 4.19 × 10 2.99 × 10 Table P 1 and and −4 −4 −4 −4 −4 −4 −6 −6 −4 −4 −4 −4 ). This threshold of signifi of threshold This ). CNN2 BIN1 UBE2O AGPAT6 ANKRD11 PRPF8 PCNT FOXK1 UBE2O HOXA1 COQ7 SLC2A1 CDH23 RNF34, KDM2B Supplementary Table 2 Supplementary POLR2E AX747521 ANK1 RPL13 HOXA4 AK311383 AK291164 SERPINF2 AF070569 RADIL AX747521 HOXA7 HOXA5 P < 0.05. Given that the the that Given 0.05. < associated CpG Genes within50kbof , , , , , , , , , , DIP2A ITPRIPL2 , , , , ABCA7 , , , , , , , , TLCD2 AP5Z1 HOTAIRM1 C10orf105 AANAT AANAT P , , NKX6-3 , , , , FLJ32224 t r a <1.12×10 , , CPNE7 , , HOXA6 HOXA9 LOC100133311 SPG7 GPX4 , P , SERPINF1 , , , , 16 , , , < 1.20 × 10 × 1.20 < MIR22 , , CYGB CYGB BC035889 HOXA3 , , ( , , , , , HMHA1 1 MIR22HG KIAA0415

RHBDF2 RHBDF2 quantitative quantitative 7 , , , , , , , , , , . 137 CpGs CpGs 137 . JA429246 SNORD68 C I SBNO2 DQ655986 HOXA10 , , , , , , , , C10orf54 HOXA2 , PRCD PRCD , −7 WDR81 , , e l SMYD4 , which

, , , , ), ),

, −7 , , s , ). ). ,  ------, ;

© 2014 Nature America, Inc. All rights reserved. les: cg22883290 in the the in cg22883290 les: Table 2 strongly correlated with a key measure of AD neuropathology. that DNAcortical methylation of a large number of discrete regions is and correlated are regions genomic certain of levels methylation that model, they explained 28.7% of the variance in NP burden, suggesting one comprehensive in CpGs 71 all If consider burden. we NP in ance and all known AD variants and CR1 the exception of tion of variance explained by genetic variants associated with AD, with of the variance in NP burden. However, this is greater than the propor den: on average, each the 71 CpGs explained 5.0% (range = 3.7–9.7%) and effect. same the analysis captured our in probably and significant being as emerged methylation of levels correlated with CpGs neighboring three to up regions, 60 7 these of ( genome the differentially throughout discrete distributed regions 60 in methylated found were CpGs 71 the Altogether, is it that suggest analyses RNA our and genes, two to proximity close in found was CpG associated An significance. of threshold our met which cg13076843, on centered ( regions. transcribed actively of periphery the on found a conformation in largely was which 3 just a region in located be to appeared CpG associated the and cortex, prefrontal dorsolateral older healthy, in conformation transcribed the Overall, bins. 200-bp in derived was state chromatin The pathology. AD-related minimal with individuals older unimpaired healthy, in assessed as shown, is region the of state chromatin the segment, DNA this in found genes the of diagram the above Finally, | the using reported is (cg22883290) CpG best the of methylation DNA of level the with correlates CpG a given at level methylation DNA which to extent The point. a given at probes CpG of density the reports line blue vertical The analysis. this for significance of threshold the highlights line blue dotted horizontal The region. this in tested CpG one represents diamond Each studies. association genome-wide in susceptibility AD with the in cg22883290 around ( modest. was distributions two the between difference absolute the and overlapped, distributions two the However, subjects. control the of that from different significantly statistically was subjects AD of distribution the that We found samples. these in methylated completely is CpG the that indicates 1.0 of value green, light (control, criteria diagnostic red, (case, AD of diagnosis a neuropathologic having as classified were that subjects for CpG that at values methylation DNA of distribution the presenting histogram a smoothed is Shown significance. of of AD in 82% of the CpGs threshold that met our on with observed, a average, we diagnosis that levels methylation ( DMRs the from probes, AD-associated most ( associations. of distribution regional and CpGs 2 Figure t r a  The susceptibility variant rs744373 was moderately associated with with associated moderately was rs744373 variant susceptibility The ( region to AD in this association genetic the captures best from the index single nucleotide polymorphism (SNP), rs744373, that cg22883290 is located 5 kb from the 5 the in cg02308560 a

Notably, two of the 71 significantly associated CpGs ( ( CpGs associated significantly Individually,the of anyone Density 10 Supplementary Table 2 0 2 4 6 8 susceptibility allele explains just 1% of variance in NP burden NP in ofvariance 1% just explains allele susceptibility 0.55 P Table ) ) were found in that loci alle harbor known AD susceptibility

=2.16 C I

Extent of differences in methylation levels at associated associated at levels methylation in differences of Extent 0.60 n Methylation level = 460) and those subjects that did not meet these these meet not did that subjects those and = 460) 0.65 1 e l cg09448088 × ), were selected to illustrate the increase in increase the illustrate to selected ), were

0.70 10 RHBDF2 APOE –9 s 0.75 ABCA7

0.80 . For example, in our subjects, the well-validated BIN1

that is the target of the DMR, as its expression was altered relative to AD ( AD to relative altered was expression its as DMR, the of target the is that 0.85 BIN1 (beta = 3.62, 3.62, = (beta locus ) had a modest effect on the brain’s NP bur locus (beta = 4.44, 4.44, = (beta locus APOE DMR that has previously been associated associated been previously has that DMR b

MCF2L Density 12 10 0 2 4 6 8 n

′ 0.75 ε end of the = 263). A methylation A methylation = 263). 4 P account for 13.9% of the vari =7.84 c

( 0.80 ) Regional association plot plot association ) Regional a ) and ) and Methylation level a

0.85 cg11823178 , × b P 10 ) ) Two the of = 2.45 × 10 × 2.45 = P BIN1 ANK1 0.90 –10 = 9.00 × 10 × 9.00 =

Supplementary 0.95 gene and 92 kb

(

b

) 1.00

Fig. 2 Fig. Fig. Fig.

−12

Table −8

) 1 ) and and ) 19– c ): in in ): d c )

1 2 1 Observed (–logP) Observed (–logP)

9 2 8 1 10 10 - - - - . . , 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9

74,330 AD AD at any one associated probe was modest ( of AD. As noted above, the increased level of methylation in relation to methylated regions were more methylated in subjects with a diagnosis differentially the of 82% associations: these of direction the in zation is one criterion for a neuropathologic AD diagnosis. We noted a polari of association ( the CpGs associated with NP burden displayed at least some evidence ( significance of level genome-wide a at AD of diagnosis a with associated also were CpGs post-mortem, neuropathologic diagnosis of AD. 22 of the NP ary analysis correlating the level of methylation at these 71 CpGs with a process. disease the on and effects independent have that (genetic epigenetic) variation genomic of sources different by affected be therefore may AD of risk Overall, susceptibility. AD on effects ent SNPs in of regions, both and these to CpGs appeared have independ kb away, 25 is so, which at cg02308560, of methylation level the with burden NP with associated was of case In the methylation. CpG by could to disease SNP’sof the for mediation association test not formally therefore we and susceptibility, AD with associated not was = 4.37, beta covariate: a as rs744373 with (model burden NP to association the CpG of size effect the change meaningfully not did rs744373 for ing by adjust was not driven variant: the AD with pathology association the level of methylation at cg22883290 ( PRPSAP 127,660 Strongly transcribedregion Weakly transcribedregion Weak enhancer Active enhancer Strong promotor Inactive/poised promotor Active promotor Repetitive Heterochromatin Low signal PC/repressed To facilitate the interpretation of our results, we performed a second Strongly transcribedregion Weakly transcribedregion Weak enhancer Active enhancer Strong promotor Inactive/poised promotor Active promotor Repetitive Heterochromatin Low signal PC/repressed S 1 advance online publication online advance PHK1 74,390 127,720 P d = 4.91 × = 10 4.91 ) Regional association plot around the the around plot association ) Regional P < 0.001) with AD. This is not surprising, as NP burden UBE2O cg0624533 Chromosome 17position(kb) Chromosome 2position(kb) Table Table 74,450 127,780 AANA RHBDF −7 cg05810363 BIN1 2 ). In our data set of modest size, rs744373 rs744373 size, ). of In set modest our data 1 8 cg2288329 ). and BIN1 T RHBDF region 2 region 2 cg1307684 ABCA7 3 Supplementary Table Supplementary 2 gene appeared to be in an open, open, an in be to appeared gene , but had no association ( association no had but , 2 74,510 127,840 BIN1 0 P C = 0.0003). However, the CpG YG , the index SNP (rs3764650) (rs3764650) SNP index the , 3 PRCD

B nature nature ST6GALNAC2 Fig. 74,570 127,900 2 RHBDF2 ). NEUR r CYP27C

′ | to the gene, gene, the to 2 ST6GA- value. value. 0.5 0.5 LNAC1 1.0 1.0 ), and all of all and ), DMR, DMR, 0 0 - OSCI 74,630 associated 127,960 | | P r r 1 | | = 0.07) 0.07) = EN 0 1 2 3 4 5 6 7 8 0 1 2 3 4 5 6 7

C Illumina450 probes per kb per probes Illumina450 kb per probes Illumina450 E - - - - © 2014 Nature America, Inc. All rights reserved. diminished given the reduced sample size ( the subjects, of non-impaired In subset the asymptomatic. was that of pathology amyloid mulation of large accu demonstrated fraction non-impaired, older individuals studies imaging and neuropathological in well-documented time of death (no AD and no mild cognitive impairment). As has been those subjects who were deemed to be cognitively non-impaired at the to analysis NP the limited we AD, of process neurodegenerative the of effect an or cause a is regions of associated level the in methylation DNA increased the whether of question the explore to begin To methylation in alterations same the display subjects non-impaired Cognitively ( tion ABCA7 the Both profiled. are sizes sample larger 4 Fig. of most of size ( sets effect data two the across the consistent was CpGs that these found we CpG, 71 of set entire the evaluating when of association: evidence suggestive displayed CpGs other many much CpGs, significant a the to addition In screen’s and results. discov ery our of replication pathology robust AD observed we of size, sample measure smaller related, but different, a ( validated were study discovery the near from regions methylated region differentially the of 11 methylated differentially this same in significant were CpGs ( 12 analysis that found and analysis this in ( platform HumanMethylation450 Illumina same the using cortex tal staging) (Braak of pathology AD of measure collection quantitative different a with independent subjects 117 an in the CpGs evaluated associated we results, significantly our 71 of robustness the assess further To set sample CpG in an independent of Validation the associated nature nature not were effect, association’s the of magnitude the capture which Supplementary Table 1b Table Supplementary Table ), suggesting that most of these CpGs will be validated as as validated be will CpGs these of most that suggesting ), b a (

NEUR Proportion of probes P Table Table PC repressed 0. 0. 0. 0. 0. = 0.011) CpGs displayed suggestive evidence of replica of evidence suggestive displayed CpGs 0.011) = 0 1 2 3 4 5 1 ). OSCI

2 Low signal 1 4 Supplementary Fig. 3 Fig. Supplementary Strongly transcribedregion

); given that two of these CpGs were found in the the in found were CpGs these of two that given ); Heterochromatin . These subjects were profiled in a sample of fron of sample a in profiled were subjects These . Weakly transcribedregion Inactive/poised promoter 0.038 EN C Heterochromatin E Strong promoter Active enhancer Weak enhancer Active promoter

Repetitive PC/represse Active promoter advance online publication online advance Inactive poised promoter Low signal ). We imposed a Bonferroni correction correction Bonferroni a imposed We ). Repetitiv P 8 values for the CpG associations were for CpG the associations values × e d 10 Strong promoter –4 H3K36me3

Active enhancer Input ). Thus, despite the use of of use the despite Thus, ). n

BIN1 H3K9me3 Mark = 237), but the beta values, Weakly transcribed region Weak enhancer H3K27me3

( Strongly transcribed region H3K4me3 RHBDF2 9 P

× H3K9Ac = 0.0067) and the the and 0.0067) = 10 Top 71CpG Illumina 450K Supplementary Supplementary H3K4me1 –3 0.028 (

i. 2 Fig. s 25 , 2 6 d , a , ), ), - - - -

of whether altered DNA methylation contributes to the pathologic pathologic the to contributes methylation DNA altered whether of question The process. dementing of the stages later the to secondary stage of the disease. These DNA methylation changes are therefore not presymptomatic the in occurs and pathology AD of feature early an is study in our discovery that we DNAidentified altered methylation ( collection sample from the entire calculated values from the beta different significantly analyzed. were that dinucleotides CpG 415,848 all in found states chromatin of distribution ( plaques neuritic with associated was level methylation whose dinucleotides CpG associated 71 those at found states chromatin of distribution the compares histogram The found. were dinucleotides CpG interrogated the of each which in state chromatin ( marks. histone of complement a unique had state chromatin Each mark. histone particular a for immunoprecipitation after DNA of a segment in found sequences of abundance relative the displays graphically high) blue, low; (white, heatmap The examination. neuropathological on pathology minimal had and death of time the at non-impaired cognitively were that used were subjects MAP states. chromatin discrete 11 1 of to segment chromosomal each assigned we effort, Roadmap Epigenomic Health’s of Institute National the by generated data Using cortex. prefrontal dorsolateral the ( cortex. mid-frontal in found states chromatin 11 3 Figure available probes for each gene. was diagnosis, withnon-ADbeingthereference subjectgroup.Thesignificancethreshold Beta referstoeffectsizeinthelinearregression associatinggeneexpressionandAD AGPAT6 CYGB SPG7 WDR81 RNF34 UBE2O FOXK1 C10orf54 ANKRD11 COQ7 PCNT AP5Z1 SLC2A1 PRPF8 SMYD4 ANK1 SERPINF2 SERPINF1 RHBDF2 RPL13 DIP2A CDH23 Gene amyloid burden Table 2 It is derived from two MAP subjects who were cognitively non-impairedEpigenomics Roadmap team ( the dorsolateral prefrontal cortex, generated in collaboration with the of map chromatin a to relation in results our interpreted we CpGs, associated the of consequences functional the understand Tobetter Distribution of associated CpGs among different chromatin states neurodegenerative the of open. remains process epiphenomenon early an is or process P <0.0023given22 tests. Resultsareshownformeanexpression levelacross

Analyses Analyses of data transcriptional in loci associated with Distribution of CpGs associated ( associated CpGs of Distribution b ) We used the chromatin map in in map chromatin the ) We used Supplementary Table 3 Table Supplementary −0.02 −0.03 −0.05 −0.02 −0.02 −0.04 −0.17 −0.14 −0.08 0 0.04 0.02 0.07 0.02 0.03 0.05 0.02 0.03 0.2 0.22 0.07 0.58 Beta h ttp://www.roadm Mayo Clinicsubjects AD diagnosis P Table < 1.2 × 10 < 1.2 ). This suggests that the the that suggests This ). 1 a ) to the overall overall the ) to ) Chromatin map of map ) Chromatin apepigenomics.or a to identify the the identify to t r a −7 <0.0001 <0.0001 <0.0001 ) with NP among among NP ) with 0.9 0.62 0.54 0.39 0.30 0.28 0.28 0.25 0.24 0.22 0.15 0.15 0.13 0.13 0.065 0.0012 0.0008 0.0004 0.0001 P C I

e l

g /

s ). 

© 2014 Nature America, Inc. All rights reserved. Supplementary Table 6 Supplementary ( associations cant signifi had genes selected 21 the of 7 of expression of level the that 5 Table ( Mayo by Clinic the assembled individuals non-AD methylated regions (DMRs) in an set of independent 202 AD and 197 ( vicinity the in found genes of sion expres of level the assessing by AD in role their evaluated we stage, replication the in validated been have that CpGs 12 the on Focusing associations CpG the of validation Functional to CpG islands ( in different genic features or in different structures in defined relation or inactive in the healthy older brain. There were no enrichments noted appear to primarily affect genomic regions that are weakly transcribed brain may not be strongly altered by AD. Rather, methylation changes drive fundamental cellular processes of neurons and glia in the healthy chromatin that the architecturesuggest data of strong promoters that promoter profile in the reference chromatin map ( we found a strong under-representation in regions displaying a strong state ( ( enhancers was an enrichment of associated CpGs in regions predicted to be weak the 71 associated CpGs were found in chromatinevery state, but there this tissue. Using this reference map, we that observed at least some of in found elements regulatory putative and states transcriptional the 1 of 11 chromatin states ( ment of the genome was annotated as being in methods established and profiles histone-modification Using ination. exam post-mortem on pathology associated AD- minimal had and death of time the at gray.in shown are AD with associated be to known not connecting The meaning. no have colors the associations); epigenomic and (genetic AD in implicated genes by encoded 6 Fig. also (see process same the in DIP2A implicate and biology amyloid on SORL1 and PLD3 both of effects reported the with consistent are interconnections These (APP). precursor amyloid the to connects that allele, susceptibility AD a common with a gene to and allele AD susceptibility a has rare described recently the connects DIP2A Furthermore, left). (bottom network the of component amyloid the of elements different to connect SERPINF2 and SERPINF1 Notably, network. this in role a central has that susceptibility AD with associated genetically kinase tyrosine a protein PTK2B, to connects and green in figure the of top the at displayed is RHBDF2 example, For AD. with associated be to known not is that protein interactor common one for allowing analysis an of results the are Shown AD. to relative expressed differentially also were that DMRs in found genes seven the and genes) Mendelian and (susceptibility genes AD known among connectivity of extent the evaluated algorithm DAPPLE the data, interaction protein protein- Using genes. susceptibility AD known of a to network connect screen methylation 4 Figure t r a  SERPINF1

). The colored nodes are the proteins proteins the are nodes colored The ). P = ( 0.028)

C I ) that have RNA data from the temporal cortex. We found found We cortex. temporal the from data RNA have that ) Genes identified in our DNA DNA our in identified Genes and and P e l = 0.0098) or to be in a weakly transcribed chromatin chromatin transcribed weakly a in be to or 0.0098) = SERPINF2 Supplementary Fig. 5a Supplementary s Fig. Fig. 3 P < 0.0023) with AD in these data ( data these in AD with 0.0023) < b ): ): and Fig. Fig. 3 2 7 . ANK1 , each 200-bp seg 200-bp each , Supplementary Supplementary Supplementary TableSupplementary 4 PLD3 a ) ) that capture , , CDH23 gene that that gene SORL1 ± 50 kb) of these differentially differentially these of kb) 50 , b ). , , DIP2A - - , P SERPINF , , = 8 × 10 RHBDF2 DIP2A Supplementary Supplementary PLD3 ). ). Furthermore, 2 Table SORL1 −4 , , ). ). These RPL13 2 APO and and E - - , DSG Illumina platform, such as surveying only a fraction of the human human the of fraction a only surveying as such platform, Illumina the of result a as limitations certain has study our although Overall, DISCUSSION levels. RNA expression altered with genes seven the included that analysis network the of iteration the in significant ( ( studies genetic from derived network ceptibility RPL13 regions— methylated differentially the in found genes expressed of differentially the in AD. Several expression dated differentially methylated regions that also displayed vali alteredthe in RNAfound genes seven the adding after analysis the repeated a interactor, sharing common genes susceptibility of (proportion connectivity indirect ( connectivity direct of terms in both significant was genes from susceptibility genome-wide of association studies network and Mendelianthe AD that genes found we First, variation. rare of the latest results from genome-wide association studies and the studies data interaction protein protein- from derived network susceptibility AD an of existence the protein of pairs and interacting adjusts for gene and size, we reported c requires which Wemethod, this genes. used previously connectivity of these genes with the the evaluate to networkalgorithm DAPPLE ofthe used knownwe genes, AD AD susceptibility­validated well- to relation in genes seven these of role the evaluate Tofurther genes AD known with results our Integrating CLU P 2 APP = 0.0072) and indirect ( and indirect = 0.0072) PSEN1 PICALM INPP5D PSEN , , SERPINF1 advance online publication online advance 2 CD2AP RHBDF2 BIN1 and and PTK2B P 007 ( 0.037) = RIN3 SERPINF2 1 SERPINF 3 . We used an updated model that includes includes that model updated an We . used TREM P CD3 MEF2 RPL13 = 0.042) network connectivity remained remained connectivity network = 0.042) 3 PFKFB2 2 C 1 MAP3K2 Supplementary Fig. Supplementary —were connected to the AD sus AD the to connected —were NUP160 TREM EPHA1

ANK1 1 nature nature , , ANK1 Fig. Fig. HLA−DRB DIP2A NEUR P = 0.0072) and and 0.0072) = 4

). The direct direct The ). 6 o ). We then then We ). 5 , , expression expression OSCI RHBDF2 HLA−DRB EN C 1 E - - ,

© 2014 Nature America, Inc. All rights reserved. molecule as molecule well. Although little is known regarding the potential role as such genes, AD other Several in macrophages. and of infiltrating the microglia activation involved modulating cascade signaling the of element key a is that gene AD gene. this of outgrowth neurite to reduced leads ( AD in reduced Notably, amyloid machinery. amyloid on the of elements to effect connect also direct SERPINF2 a and to SERPINF1 of processing. related burden be the well to may relation pathology its NP processing, amyloid in SORL1 and sur cell a protein as receptor face function may DIP2A network. susceptibility AD the to connected not otherwise is that gene susceptibility AD reported known resolve to question. this conducted be to needs work experimental Further ogy. pathol AD of consequence early an are changes these that possible is it epigenome, the of plasticity the given causal: are changes methylation in observed the that state cannot we Thus, trait. a with tions associa reports only scan epigenome-wide our studies, genetic with as However, AD. of onset the in involved are changes methylation ( network susceptibility AD existing an ( genes methylated differentially the of six of connection ( subjects in non-impaired cognitively associations CpG the of presence the with along pathology, AD associations. by SNP driven not are tions associations genetic significant no found and subjects same the in trait same the with study association SNP a completed we genome-wide recently also AD tions with pathology: associa genetic by driven not are observed we that associations the variation can drive differences in DNA methylation for certain CpGs genetic although that, point the illustrate loci two These vicinity. its in found allele susceptibility of the independent was association this in the cg02308560 of expression the on integrate that effects independent have can CpG) and (SNP tion the of understanding our refine therefore results Our subjects. additional ( consistent was effect the of direction the analysis, replication small ( suggestive were results the although and, study, The bility locus and a recent report of enhanced BIN1 expression in AD AD suscepti our to study a links well-validated set data Mayo Clinic ( expression mRNA ( healthy brain older the in conformation chromatin transcribed poorly a healthy in be to likely the more were regions in associated the Instead, brain. older transcribed actively were that genes in found be to regions were in targeted methylated AD and ferentially were unlikely dif specific process: appear to genome-wide be of part a generalized, the and functionally methylation presymptomatic accumulation DNA of replicated, AD pathology. altered These changes do not between several associations found validated nonetheless we ylation), closely related chromatin marks (DNA two methylation and DNA hydroxymeth distinguish to inability array’s the and CpGs genome’s nature nature with is consistent PTK2B with of RHBDF2 connection the of ANK1, Table On the other hand, hand, other the On the to directly connects DIP2A map, network AD the at Looking The colocation of genetic susceptibility with CpG associations to to associations CpG with susceptibility genetic of colocation The BIN1 BIN1

Fig. 3 Fig. 1 SORL1 ), suggesting that the association is likely to validate with with validate to likely is association the that suggesting ), NEUR cg22883290 association was significant in our discovery discovery our in significant was association cg22883290 locus and suggest that different types of genomic varia genomic of types different that suggest and locus b OSCI susceptibility gene susceptibility ). Altered DNA methylation ( methylation DNA Altered ). Table Table ABCA7 BIN1 EN P 3 = 1.09 × 10 × 1.09 = 0 C and, given the putative role of PLD3 (ref. (ref. of PLD3 role putative the given and, 2 ANK1 E and influence AD susceptibility. Similarly, Similarly, susceptibility. AD influence and

), and its knockdown in an an in knockdown its and ), locus was associated to was locus associated AD pathology, and advance online publication online advance and and 2 CD33 −4 9 SERPINF1 and to indirectly 3 RHBDF2 Supplementary Table 3 Supplementary ) of the the of ) 2 , suggesting one potential effect effect one potential , suggesting and and Fig. Fig. 2 3 . Thus, our CpG associa CpG our Thus, . BIN1 EPHA1 connect to to connect mRNA expression was was expression mRNA Table 4 ), suggest that DNA that suggest ), gene in AD in the the in AD in gene P 1 = 0.0067) in the the in 0.0067) = , connect to this this to connect , PLD3 ) and enhanced enhanced and ) in vitro in PTK2B , , a recently

Fig. Fig. ) ) and the system system 4 , an an , ) to to ) 3 2 1 8 3 ------) . ,

RNA expression with AD ( AD with expression RNA CDH23 of assessment our in GFAP expression for term a adding that found we astrocytes, for marker surrogate a as GFAP expression using ple, some of alterations the to transcriptional cell Forcertain types. exam types cell ( expression mRNA model seems to account for the AD-associated differences in for an of using estimate an cells the RNA-based number of microglial in process. Consistent with adjusting the this, AD pathophysiological may be involved in the role of and macrophages microglia infiltrating text of AD ( but our data that suggest (EGF) factor growth TNF of release the in functions which protease, TACE/ADAM17 the of specificity that suggests also pathogens to responses immune systemic impairs in mice affects the normal release of TNF which releases TNF of transport TNF the known role of this in molecule myeloid cells: it is for necessary the goal for the development of AD therapies. AD of development the for goal the epigenome is remodeling a whether using systems, model fruitful and that contribute to to matin structure AD assess, pathophysiology aging chro of alterations the the map precisely more to need in clearly we brain, forward Going pathology. amyloid accumulated had but impairment, cognitive no displayed subjects while process, logic patho the in important early occur changes the epigenomic these that made observation also We organs. other or brain the in studies of human future design the to calibrate used be can our results hand, in sizes effect clear With studies. epidemiology epigenomic relevant other to broadly more epigenome brain’s the investigating for egy study at work. be may processes other, or unsuspected these of one than more that likely also Itis populations. cell cortical that alters circulation the systemic relative of abundance the different neurons are or lost; of is from there the cells immune a influx modest as such populations some as changes cortex the of populations cell overexpress that in change, of the such plaques vicinity as astrocytes neuritic activated in relation to a AD of fraction pathology: the constituent cells cortical occur that methylation in changes robust, but modest, these explain too is It time. given a early at to confidently differentiate between three possibilities affected that could are cells of number small a only and decades several over accumulates pathology AD as sample, cal represent the altered methylation state of a subset of cells in likely our corti most observed we that methylation DNA in changes The ies. capillar cortical microglia, from cells as endothelial and cells such immune peripheral cells other and glia, neurons, of types different mechanistically. dissected now be can that methylation in alteration a robust uncovered screen of case the in of mechanism, Regardless exact the effects. both of combination a or astrocytes these of an activation the by plaques, neuritic near caused astrocytes of number was increased methylation) DNA altered its presumably (and of alteration the whether distinguish not activation astrocyte with enhanced is GFAP expression Such cell-type adjustments using surrogate markers for different different for markers surrogate using adjustments cell-type Such Overall, the replication of our study’s results by an independent independent an by results study’s our of replication the Overall, many includes that architecture complex a has cortex human The 2 4 and its functional validation with RNA our strat with makes data validation and its functional RNA expression largely abrogated the association of association the abrogated largely RNA expression 37– Table 3 9 are crude analyses, but are helpful to begin to assign assign to begin to helpful are but analyses, crude are α , as well as that of other proteins such as epidermal epidermal as such proteins other of that as well as , α CDH23 2 RHBDF2 converting enzyme (TACE, also called ). Its connection to PTK2B further suggests that it suggests ). Itsto further PTK2B connection α Supplementary Table 7a Supplementary 3 from the cell surface 6 . Its exact role in AD is not clear at this point, point, this at clear not is AD in role exact Its . RHBDF2 ; the relative proportion of the constituent constituent the of proportion relative the ; may function in regulating the substrate substrate the regulating in function may Supplementary Table 7b Table Supplementary expression is increased in is expression the con increased CDH23 α 3 from the cell surface 3 CDH23 . . The absence of , our DNA methylation methylation DNA our , , b ). t r a RNA expression expression RNA 3 5 . . In vitro In ). Given that that Given ). 40 , ADAM17 4 C I 1 RHBDF2 RHBDF2 , we , can we CDH23 e l 3 work work 4 and s ), ),  ------

© 2014 Nature America, Inc. All rights reserved. 13. 12. 11. 10. 9. 8. 7. 6. 5. 4. 3. 2. 1. reprints/index.htm at online available is information permissions and Reprints The authors declare no competing interests.financial reviewed the critically manuscript. the study. P.L.D., D.A.B. and L.B.C. wrote the manuscript. All of the authors and analyzed RNA data from AD and non-AD brains. P.L.D. and D.A.B. designed analyzed the replication data. N.E.-T., J.B., H.S.C., C.Y., F.Z. and S.G.Y. provided L.Y. performed analyses on the resulting data. K.L., L.C.S. and J.M. generated and generated data from the samples. G.S., L.B.C., J.E., B.T.K., M.K., T.R., J.R. and C.M., A.T., W.B., S.G., C.B.E., B.E.B., A.M. and J.A.S. preparedcollected, and Equipment Grant from Alzheimer’s Research UK. was funded by US National Institutes of Health grant AG036039 to J.M. and an Buren Alzheimer’s Research Disease Program to N.E.-T., S.G.Y. and F.Z. This work Siragusa Foundation to N.E.-T., and the and Robert Clarice Smith and Abigail Van ES017155, KL2 RR024151, K25 AG041906-01). Support was also provided by the AG032990, R01 AG18023, RC2 AG036547, P30 AG10161, P50 AG016574, U01 of Health (R01 AG036042, R01AG036836, R01 AG17917,R01AG15819, R01 Support for this research was provided by grants from the US National Institutes participants of the ROS and MAP studies for their participation in studies.these and families who made this research possible. We also would like to thank the Brains for Dementia Research (Alzheimer Brain Bank UK) and the donors Dementia in the South London and Maudsley NHS Foundation Trust (SLaM), We thank the National Institute for Health (NIHR) Biomedical Research Unit in onlin Note: Any Supplementary Information and Source Data files are available in the the in versio available are references associated any and Methods M t r a  COM AUTH Acknowledgmen

et

Raj, T. Raj, S. Horvath, Bell, J.T.Bell, D.G. Hernandez, V.Haroutunian, & M.S. Beeri, S., and Akbarian, healthy of determinants Epigenetic S. Numata, K.M. Bakulski, L. Chouliaras, Gräff, J. of roles Multiple C.T. Chiu, & H.J. Kim, Z., Marinova, Y., Leng, D.M., Chuang, Y. Liu, A.P.Feinberg, late-onset in drift epigenetic Age-specific A. Schumacher, & B. Oelze, S.C., Wang, under natural selection. natural under tissue. blood population. (2012). ageing e1002629 healthy a in phenotypes age-related and age brain.human the inchronological age cognition. and aging brain diseased cortex. prefrontal human cortex. frontal human in controls normal Dis. Alzheimers J. cognitively and disease Alzheimer’s Aging patients. Alzheimer’sdisease of hippocampus the in hydroxymethylation brain. (2009). conditions. neurodegenerative in inhibition HDAC (2013). intermediary of genetic risk in rheumatoid arthritis. rheumatoid in risk genetic of intermediary index. mass body with covary and disease. Alzheimer’s e e version of the pape P h O n of the pape the of n E ods TI R R

C I 34 Nature et al. et t al. et CON NG et al. , 2091–2099 (2013) 2091–2099 , et al. et e l FI et al. et t al. et Alzheimer disease susceptibility loci: evidence for a protein network protein a for evidence loci: susceptibility disease Alzheimer pgnm-ie soito dt ipiae N mtyain s an as methylation DNA implicate data association Epigenome-wide An epigenetic blockade of cognitive functions in the neurodegenerating

TRIBUTI Genome Biol. Genome 483 Epigenome-wide scans identify differentially methylated regions for regions methylated differentially identify scans Epigenome-wide N et al. et l et al. et . t al. et A s Aging effects on DNA methylation modules in human brain and brain human in modules methylation DNA on effects Aging t al. et

N mtyain intrs n eeomn ad gn o the of aging and development in signatures methylation DNA , 222–226 (2012). 222–226 , NC 29 Personalized epigenomic signatures that are stable over time over stable are that signatures epigenomic Personalized r Genome-wide DNA methylation differences between late-onset between differences methylation DNA Genome-wide ts PLoS ONE PLoS . r , 571–588 (2012). 571–588 , IA . osset erae n lbl N mtyain and methylation DNA global in decrease Consistent itnt N mtyain hne hgl creae with correlated highly changes methylation DNA Distinct ON Am. J. Hum. Genet. Hum. J. Am. L Am. J. Hum. Genet. Hum. J. Am. I

13 N S T , R97 (2012). R97 , .

E 3 R , e2698 (2008). e2698 , Sci. Transl.Med. Sci. E STS JAMA Neurol. JAMA Hum. Mol. Genet.Mol. Hum.

90 90 , 720–726 (2012). 720–726 , , 260–272 (2012). 260–272 ,

rns Neurosci. Trends 70 Nat. Biotechnol. Nat.

2 , 711–718 (2013). 711–718 , , 49ra67 (2010). 49ra67 ,

20 http://www.nature.c , 1164–1172(2011). , LS Genet. PLoS

32 31 591–601 , , 142–147 , Neurobiol. online online om/

8 ,

17. 26. 25. 24. 23. 22. 21. 20. 19. 18. 16. 15. 14. 41. 40. 39. 38. 37. 36. 35. 34. 33. 32. 31. 30. 29. 28. 27.

Bennett, D.A., Wilson, R.S., Boyle, P.A.,Boyle, Schneider,Relation R.S., & Wilson,J.A. A.S. D.A., Buchman, Bennett, Sperling, R.A. Sperling, Schneider,neuropathology The D.A. Bennett, & S.E. Leurgans, Z., Arvanitakis, J.A., K. Lunnon, J.M. Shulman, J.C. Lambert, P. Hollingworth, S. Seshadri, A.C. Naj, L.B. Chibnik, Cognition S.E. Arnold, & J.A. Schneider, R.S., Wilson, D.A., Bennett, S., Negash, C. Bock, D.A. Bennett, Kraft, A.W.Kraft, Renner, N.A. Linnertz, C. J.E. Simpson, J. Cooper-Knock, Maretzky,T. D.R. McIlwain, O.M. Siggs, factor Tumornecrosis M. Y., Freeman, Christova, Taylor,& M., N. Zettl, C., Adrain, K.C. Roet, C. Cruchaga, M. Tanaka, J.C. Lambert, J. Chapuis, for states chromatin of characterization and Discovery M. Kellis, & J. Ernst, 200–208 (2009). impairment. 200–208 cognitive mild and disease Alzheimer probable of (2014). of pathology. disease. Alzheimer’s with associated CR1 disease. Alzheimer’s with associated (2011). 429–435 are CD2AP and CD33 disease. disease. Alzheimer’s(2011). late-onset with associated are decline. cognitive 72 of neuropathology to cognition in persons without cognitive impairment. (2011). Orders Study and Rush Memory And Aging Project. and neuropathology in aging: multidimensional perspectives from the Rush Religious technologies. Project. Aging and Memory 335 TACE. of activation and trafficking promote to iRhom2 requires signaling (2013). 11116–11135 axons. spinal sensory injured of sprouting regenerative improves that protein a SCARB2, identifies and cells ensheathing olfactory proregenerative disease. Alzheimer’s for proteins. superfamily TGF-beta with their binding of 1–analysis protein/follistatin-like follistatin-related for receptor candidate disease. Alzheimer’s for loci susceptibility pathology. tau modulating genome. human (2010). the of annotation systematic 7 Association disease. Alzheimer’s Aging-Alzheimer’s for guidelines on diagnostic on Institute workgroups National the from recommendations in APP/PS1 mice. APP/PS1 in phenotypes. 228 activation distinct induce astrocytes of stimulation tissues. brain human 32 genotype. APOE and pathology Alzheimer’s to relationship brain: disease. (2013). shedding. ectodomain dependent (2012). against protection 119 , 280–292 (2011). 280–292 , , 599–609 (2012). 599–609 , , 1795–1807 (2011). 1795–1807 , ANK1 , 1284–1294 (2013). 1284–1294 , (2012). 5769–5771 , (2012). 225–228 , advance online publication online advance J. Am. Med. Assoc. Med. Am. J. Nat. Rev. Neurol. Rev. Nat. et al. et n lhie’ dsae neuropathology. disease Alzheimer’s in et al. et JAMA Neurol. JAMA et al. et et al. et et al. et et al. et et al. t al. et et al. et et al. et et al. et et al. et al. et Nat. Biotechnol. Nat. et al. et et al. et Quantitative comparison of genome-wide DNA methylation mapping methylation DNA genome-wide of comparison Quantitative et al. et et al. et et al. et Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 and CD2AP,CD33 MS4A4/MS4A6E, at variants Common t al. et t al. et A multilevel screening strategy defines a molecular fingerprint of fingerprint molecular a defines strategy screening multilevel A Increased expression of BIN1 mediates Alzheimer genetic risk by risk genetic Alzheimer mediates BIN1 of expression Increased Attenuating astrocyte activation accelerates plaque pathogenesis plaque accelerates activation astrocyte Attenuating Genetic regulation of alpha-synuclein mRNA expression in various Cross-tissue methylomic profiling implicates cortical deregulation cortical implicates profiling methylomicCross-tissue t al. et iRhom2 controls the substrate selectivity of stimulated ADAM17- stimulated of selectivity substrate the controls iRhom2 iRhom2 is required for the secretion of mouse TNFalpha. mouse of secretion the for required is iRhom2 t al. et I2 ic-neatn poen hmlg ( A homolog 2 protein disco-interacting DIP2 Genome-wide analysis of genetic loci associated with Alzheimer with associated loci genetic of analysis Genome-wide Transient acidification and subsequent proinflammatory cytokine Ann. Neurol. Ann. t al. et Rare coding variants in the phospholipase D3 gene confer risk confer gene D3 phospholipase the in variants coding Rare FASEB J. FASEB CR1 is associated with amyloid plaque burden and age-related and burden plaque amyloid with associated is CR1 Genome-wide association study identifies variants at CLU and CLU at variants identifies study association Genome-wide Microarray analysis of the astrocyte transcriptome in the aging the in transcriptome astrocyte the of analysis Microarray Selected findings from the Religious Orders Study and Rush and Study Orders Religious the from findings Selected Toward defining the preclinical stages of Alzheimer’s disease: Alzheimer’s of stages preclinical the Towarddefining Listeria eei ssetblt fr lhie dsae ertc plaque neuritic disease Alzheimer for susceptibility Genetic Meta-analysis of 74,046 individuals identifies 11 new 11 identifies individuals 74,046 of Meta-analysis PLoS ONE PLoS omn ains t BA, SAAM44, EPHA1, MS4A6A/MS4A4E, ABCA7, at variants Common Ro2 euain f AE otos TNF-mediated controls TACE of regulation iRhom2 profiling in human neurodegenerative human in profiling expression Gene Nature

70

Mol. Psychiatry Mol. 8 n rsoss o LPS. to responses and J. Alzheimers Dis. Alzheimers J. , 1150–1157 (2013). 1150–1157 ,

, 518–530 (2012). 518–530 , 27 303

28

, 187–198 (2013). 187–198 , 69 505

4 , 1832–1840 (2010). 1832–1840 , , 1106–1114 (2010). 1106–1114 , Proc. Natl. Acad. Sci. USA Sci. Acad. Natl. Proc. , 560–569 (2011). 560–569 , , e7480 (2009). e7480 , , 550–554 (2014). 550–554 ,

Nat. Genet. Nat. Nat. Genet. Nat. 18 FEBS J. FEBS , 1225–1234 ,

33

a. Biotechnol. Nat. , S397–S403 (2013). S397–S403 , Curr. Alzheimer Res. nature nature a. Neurosci. Nat.

277

a. Genet. Nat. 41 45 Science , 1094–1099 (2009). 1094–1099 , , 1452–1458 (2013). 1452–1458 , , 4278–4289 (2010). 4278–4289 , (2013).

110 lhies Dement. Alzheimers NEUR

n. Neurol. Ann. a. Genet. Nat. Neurobiol. Aging Neurobiol. . Neurosci. J. , 11433–11438 , Drosophila . el Physiol. Cell. J. 335

28 43 17 OSCI

Ann. Neurol. 8 229–232 , 817–825 , 436–441 , XXX–YYY , , 336–340 Science EN i a is ) Blood

33 66 43 C E , , ,

© 2014 Nature America, Inc. All rights reserved. criteria. The NIA-Reagan criteria, whichonNationallikelihood theof based high AD Institute on Aging (NIA)-Reaganintegrates both the Consortium to were classifiedEstablish as having a pathologic diagnosis AD if they had intermediate or subjects Specifically,examination. post-mortem on determined is which AD, diagnosis, we also assessed for associations with a neuropathologic diagnosis of skewed, we used the square-root–transformed values in our statistical analyses. the five regions. Because the distribution of these average standardized counts is that same region, and then taking the average of the standardized counts across subject’sthe populationingregionsin thefives.d. by the of eachcount rawin score of neuritic plaque burden was then computed for each individual by divid the hippocampal CA1 sector. As in prior publications midfrontal,the temporal,middle inferiorandentorhinalparietal andcortices, was used to visualize neuritic plaques in tissue sections from five brain regions: performed across the US, as described previously thology in the brain at the time of death. Brain autopsies in ROS and MAP were neuriticofquantitativeplaques,den a measureamounttheof neuropa ADof Phenotypes. Preprocessed probe transcript levels were used in the BioConductordownstream ofanalysis. package lumi the using filtering probe and malization withbackground correction, variance stabilizing transformation, quantile nor sion data exported from Genome Studio software (Illumina) were preprocessed to diagnosis, age, sex and RNA Integrity Numbers (RIN). Rawacross probe-levelchips and platesexpres using a stratified approach to ensure balance with respect used for the transcriptome measurements of RNA samples that were lishedrandomized methods. Whole-genome DASL expression microarrays (Illumina) were Bioanalyzer,pubrespectively,2100 Agilentto according andRNAqueous kit (non-AD, ( pathologicAD withsubjects autopsiedfrombrainsof cortex temporal the frommeasured weretranscripts detail in described is methodology the obtained as part of a recently published brain expression GWAS (eGWAS), where before analysis. purity and degradation for tested and standard method extraction phenol-chloroform a using region brain dissected each of mg ~100 from isolated was by trainedsected specialists, snap-frozen and stored atGenomic °C.−80 DNA London-Neurodegenera re ( Bank Brain London Disease MRC Neurodegenerative the in archived individuals 117 from obtained (PFC) cortex HapMap reference, as previously described the usingimputation and genotypes Omni1-Quad Illumina or 6.0 GeneChip Genotypes were available from prior studies and were derived from Affymetrix previouslyfoundin published be literaturecohortscan regarding two the University Medical Center and Partners Healthcare. More detailed information aspects of these studies were approved by the Institutional Reviewteamlargeandofaphenotypes set were collected studies. identicalboth inAll Boards of Rush as they were designed to be combined, were maintained by a single investigative previousstudies in As 85%. Act donating their brains at the time of death. The overall autopsy rate exceeds exceeds 90%. Similarly, participants in both cohorts signed an Anatomical Gift annual detailed clinical evaluations and cognitive tests, and the rate of follow-up ment. All participants in ROS and MAP signed an informed consent agreeing to enrollof time dementiatheknownat of free andageyearsof 53 wereleastat who area Chicago the in retirementfromfacilitiesprimarily women and men enrollment.cohort,MAPTheestablished consists1997,in ofmore than1,600 states who were at least 55 years of age 12 andin groupsfree 40 offrom brothersknown and nuns dementiapriests, Catholic at 1,100 the than moretime of of Memory and Aging Project the Center: UniversityRushMedical at jects from two large, prospectively followed cohorts maintained by investigators Subjects and genotypes. ONLINE MET doi: a Registry for Alzheimer’s Disease (CERAD) estimates of neuritic plaque density search/MRC-London-Ne Toput associationsthe withneuritic plaque burden contextthein of anAD Temporal cortex expression levels for the autopsied Mayo Clinic subjects were prefrontal of samples uses analysis methylation DNA replication The 10.1038/nn.3786 n = 197). Total RNA extraction and QC were done using the Ambion Our primary phenotype of interest in this manuscript was the bur H ODS The analyses in this manuscript included deceased sub tive-Diseases-Brain-Bank.asp (MAP). The ROS cohort, established in 1994, consists 1 7 urodegenerative-Dise , we analyzed the ROS and MAP cohorts jointly,cohorts MAP and ROS the analyzed we , n = 202) and those with other brainpathologiesotherwith those and 202) = 44 Religious Orders StudyOrdersReligious , 4 5 2 . Briefly,. expression24,526 of levels http://www.kcl.a 3 . 42 , 4 1 3 7 . Bielschowsky silver stain ases-Brain-Bank/MRC- , a quantitative composite x ). All sampleswere All dis ). c.uk/iop/depts/cn/ (ROS) and the and (ROS) 46 42 , , 4 4 7 3 ------. .

Subject and probe quality control. ropathological phenotypes of each subject conducted by laboratory personnel who were blinded as to the clinical and neu color channel normalization and background removal. All data generation was procedures of the proprietary Illumina GenomeStudio software, recommended which the includesimplementing by file data a produces platform assay.The fordata generation by Illuminathe InfiniumHumanMethylation450 chip bead as measured by PicoGreen was used by the Broad Institute’s Genomics Platform (ICA). However, when applying these approaches to our data and comparingand data our approachesto However, applyingthese (ICA). when normalize our data, including confirmedwas also by our PCAanalysis. We evaluated different approaches to the use of two different thermocyclers during data generation. This batch effect N downstream analyses.for selected were samples 708 of total a check, quality subject’ ‘per our after Hence, efficiency.conversion bisulfite poor having for samples 40 additional an Weexcludedattempted. was assay Illumina the which for subjects 761 of total a from subjects 13 removed we criteria, these Using samples. the of all forautosomeprobes 50,000 of selection random a using performed was PCA ± we used principal component analysis (PCA) to select subjects that were within remove subjects with poor quality data from further consideration. Specifically, cleaning. The first step in this component of our quality control pipeline was to 415,848 autosomal CpGs for downstream analysis. try, such as our subjects. These CpGs were removed from consideration, leaving polymorphic CpGs with 0.01 analysisprevious a on Based occurs. base pairs upstream or downstream of the CpG site, where single base extension(MAF) frequency allele minor a with SNP a which in methylationthesample onourgivenremovedThus,datasize. we effect CpGs genotype. subject’s a majorhavenot shouldfrequencies by allele minor lowHowever, very SNPs with affected be could CpGs these of level Methylation probes. Using these criteria, we selected 470,913 out of a total of 485,577 tested probes that had detection represent the quality of the probes compared to background noise. We selected detection the used was to used quantitate the DNA. 16 Qiagen (cat. #51306) QIAamp DNA mini protocol. The Qubit 2.0 Fluorometer fully dissected from the white matter. DNA extraction was performed using the Disease Center. These sections were thawed on ice, and the gray matter was deceased caresubjects from the ROS and MAP studies based at the Rush Alzheimer’s sections of frozen dorsolateral prefrontal cortex were obtained from each of 761 E the neuropathology staff. by diagnosis) clinical (relativeto fashion blinded a in collected was data logic study.neuropathothe during All collected data clinical the to accesswithout as reported implemented was pathology, tangle neurofibrillary of staging Braak and also overlap with known SNP sites based on the 1000 Genomes database Genomes 1000 the on based sites SNP known with overlap also methylation levels. on sex of influence well-described the given model analytic our in gender for below,adjustednotedwe Furthermore,as downstreamanalysis. for list probe We implemented this recommendation and removed the 29,233 probes HumanMethylation450from our bead chip assay that meet this sequence match criterion. authorsInfiniumtheIllumina recommendprobesthediscarding from 29,233 chromosomeduringalignmentsequencesequenceusingBLAT sex the match nucleotides 50 of 47 which in probes that discarding recommends report recent a in gender to association strong showed probes these of chromosomes sex the with ­hybridize of the autosomes. onearefoundprobes, in 460,045470,913 these analysis.Of probes forfurther 3 s.d. from the mean of a principal component (PC) for PC1, PC2 and PC3. and PC2 PC1, forcomponent principal(PC) a of mean the from s.d. 3 xperimental protocol for ormalization of the data. We also evaluated the ‘per subject’ quality of the data after the initial CpG data In addition to cross-reactive probes, a substantial fraction of CpG probes CpG of fraction substantial a probes, cross-reactive to addition In cross- to predicted are probes some unique: are probes all not However, 4 8 Tee igoe ae ae y or-etfe neuropathologists board-certified by made are diagnoses These . P value criteria recommended by Illumina. These Illumina. byrecommended criteria value P < 0.01 for all samples to ensure the use of good quality We observed a strong batch effect in our data due to DN ≤ COMBAT MAF (EU) A extraction from post-mortem brain. For the initial quality check of the data, we µ 4 l of DNA at a concentration of 50 ng 9 4 , we found a total of 14,964 autosomal,14,964 of total a found we , 9 5 based on sequence alignment. Many alignment. sequence on based 0 and independent component analysis ≤ 0.99 in subjects of European ances nature nature ≥ 0.01 existed within 10 within existed 0.01 NEUR 4 9 . Specifically,. OSCI P 100-mg values EN µ C l 4 −1 9 E - - - - .

© 2014 Nature America, Inc. All rights reserved. mental batch and bisulfite conversion efficiency. A logistic regression was used used a linear model, adjusting for age at death, sex, study (ROS or MAP), experi differentiallymethylated regions associated with neuritic plaque pathology, we k missing Any details). for below (see approaches estimates of DNA methylation derived from whole genome bisulfite sequencing from 0 (no methylation) to 1 (100% methylation) and show good correlation to interpretablethan tain favorable statistical properties relative to valuesof(logisticallyM transformed methylation level measurement for the targeted CG site in a given sample ses, insteadwe used the D to identify the most conservatively associated subset of 71 CpG. analysis. However, we added it as a covariate in a secondary analysis that led us AD ( resulting measure was not significantly associated with a pathologic diagnosis of estimate of the proportion of NeuN combinations of purified profiles using nonlinear least-squares. This yielded an We used the data from the NeuN data. methylationDNA using sample brain each in neurons) (primarily cells packageR an mainused text,wethe in difference in the proportion of neuronal cells in our brain samples. As discussed cortex can change with AD, we evaluated a technique to account for the possible that the proportion of neurons and other cell types found in the human cerebral Accounting for differences in the proportion of cell types in our tissue. of validation. roundstwo with positives false minimize andnegativeresults false of risk the ciated with technical or demographic confounders. Thus, we prefer to minimize avoid over-correcting our data for variables that are not demonstrated in theto primary analysisbe to directly assoaddress the source of technical variation and to minimize false positives ( given our strategy of validating the CpG analysis with two rounds of validation tion to the neuritic amyloid trait (95% remain at a threshold of significance, they retained strongly suggestive evidence of associa other 67 CpGs that were significant in our primary analysis fluctuated below our significant regardless of the approach used and that, although the normalization), we found that the top 71 CpG in the primary analysis remained (including known technical confounders such as batch versus COMBAT and ICA used in our evaluation of normalization methods. appreciable percentage of the variance. These seven surrogate variables determined werethat 7 was a conservativethen number of components that explained an the average variance explained in the methylation matrix by each value of refined components as the surrogate variables. To set an optimal the independent component most correlated with component ated with component beta values populated only by methylation probes that were significantly associ componentindependent each for in used that to similar refinementstep a performed then assigned pendent components to the matrix of methylation beta values to infer the number of statistically inde at cally independent, non-Gaussian factors using an R package ments of disease pathophysiology that is not yet understood. knownconfounders. Thus,such ICA-derived relatedvariables be could eleto variablesthat capture structure withinof set adata and are not associated with normalizing our data. In particular, ICA includes adjustmentswithin forour data,many these unknownapproaches ( as batch number and age) that we find to be associatedthem withwith anprincipal approach componentsin which we adjust for given confounding variables (such nature nature foranalysis,ADthe using samethe covariates. As noted above, we considered, -nearest neighbor algorithm for ata analysis and statistical modeling. http://cran. ICA is a matrix factorizationintomatrixstatistimatrixseparatestechnique that a a is ICA To perform our methylome-wide association study (MWAS) and discover (MWAS)and study association methylome-wide our Toperform In our comparison of the results obtained using the two different approaches P = 0.08), and we therefore did not include it as a covariate in our primary k NEUR surrogatevariables representing r-project.org/web/pa β OSCI values reported by the Illumina platform for each probe as the β values i k . We then performed ICA on this sub-matrix and selected EN iteratively from 1 to 40. In each instance, all samples are C 5 E Fig. 3 . We. thereforeuse optedto 1 + k ), we opted to use the batch variable approach nuclei found in this package to create convex COMBAT = 100. ckages/fastICA/index.htm i + ( nuclei in each of our samples. However, the 1 β

values) ≤ 5 For our primary and secondary analy 2

to quantifyproportiontothe ofNeuN i

≤ and ICA) are overly conservative in k

k entries of the mixing matrix. Wematrix.mixing the ofentries β ) we created a sub-matrix of the the of sub-matrix a created we ) 5 values, they are less biologically 1 . Althoughvalues. haveM cer β P value was imputed using a using imputed was value < 5 × 10 β iSVA values, which rangewhichvalues, −5 fastICA l . We applied ICA ). As a result, and i , and used those (ref. k P , we explored value of the described 5 1) where 1) Given k , and + ------

measures obtained from the entire population (that is, the total cohort of 708 of cohort total the is, (thatpopulationentire the fromobtained measures estimate beta takinginto account standardthe error, were different from those testand the impaired subjects are a subset of the entire cohort and tested, using a one-samplethe beta estimate derived from the entire cohort. We used the fact that the non- CpG, whether the beta estimate from the non-impaired subset was differentcognitively non-impairedfrom at the time of death ( ( subjects of set full the fromobtained analysis association our ofresults comparedthe we conversion efficiency. probesIlluminaprovidesthatestimatethe of tobisulfitevalue median the ing tak by calculated is term conversionbisulfite The sites. control ten the from efficiency term used in the primary analysis is the median methylation estimate to used control for non-complete bisulfite conversion. The bisulfite conversion geted by type II probes), where we expect each CpG to be 100% methylated, are CpGstarcontrolprobesIlluminaby4and as I CpGs targeted sites type (6by version control probes, based on Illumina guidelines. Ten CpG sites designated reference genome. human hg19 the used we probes, CpG the of annotationFor analysis. our in autosomal415,848probes CpGtestedforgenome-widethe givensignificance wea Bonferroni used correction that yielded a hypotheses, multiple of testing the for Toaccount variables. demographic or or(2)surrogate variables that displayed nocorrelation withavailable technical but elected not to include, terms for (1) the proportion of neurons in each sample or learned from the sequence data (available at ingwere previouslyperformedas reported extracted from each of the cortical samples. Library construction and sequenc frontal cortex that is also sampled in our DNA methylation scan. predorsolateralChromatinthe from obtainedwere samples cases, was both examination.In neuropathologicalon pathology body lewy and vascular AD,of evidence mal subjects who were cognitively non-impaired at the time of death and had mini H3K4me3, H3K9ac and H3K9me3) was performed independently for two MAP ies targeting six different chromatin marks (H3K36me3, H3K27me3, H3K4me1, EpigenomicMapping Center, chromatinimmunoprecipitation antibod using C four subjects, the mean subject #MAP50403446, #ROS20963578, subject follows: as are WGBS) versus (Illumina replicates technical the of comparison the of sured by the two technologies was compared using a Student’s individual. In each subject, the estimated level of methylation at each CpG mea a random 50,000 autosomal CpG sites that are present in both data sets for each fromIllumina450k as well as bisulfite-sequencing, were compared by selecting sidered for further analysis. mismatches.coveredCpGs10%more byOnlythan with using custom software assembly.referencemade wereGRCh37 Subsequently, calls methylation CpG 36 base protocol Libraries were then purified and run on the Illumina HiSeq2000 using(Qiagen). EpitectkitBisulfite the treatments bisulfite ausing 5-h roundsof standardtwo before size-selected adaptor-ligatedwere fragmentsthe and fragments, the to previouslydescribed from ATDBio). Whole-genome bisulfite sequencing (WGBS) was performed as repaired, A-tailed and ligated with methylated paired-end adaptors (purchased mented to 100–500 bp using a Covaris S2 sonicator. DNA fragments were end- Array to perform whole genome bisulfite sequencing. Genomic DNA was frag we used the same DNA sample profiled usingwoman), the Illumina HumanMethylationa and man a of consists pair each non-impaired, two and AD with Bisulfite-sequencingdatageneration analysis.and subjects) from which the subset was drawn. hromatin state map. state hromatin g To assess whether changes in DNA methylation are an early feature of AD,featureof early anDNA methylationare in changes To whether assess The bisulfite conversion efficiency term is calculated using the bisulfite con Methylation profile from four samples (two with AD; two non-AD) generated (ref. 2.7 BSMap using aligned were libraries WGBS The ). The H3K36me3 and H3K27me3 data were from donor id 149 while the while 149 id donorfrom were data H3K27me3 and H3K36me3 The ). t n distribution, whetherresultsthe from subset,thisthe specifically = 708) to those obtained from the subset of subjects who were who subjects of subset the from obtained those to 708) = 5 4 . 5 4 5 . In short, Illumina genomicIlluminaDNAshort,In adaptors . wereadded 6 In collaboration with the Broad Institute’sRoadmap Broad the with collaboration In r , excluding duplicate, low-quality reads as well as reads = 0.972. r = 0.971; subject #ROS20214850, r = 0.975; subject #MAP5797875, subject 0.975; = 5 7 http://www.roadmapepigenomics. . Chromatin. state maps were then n P = 217) by assessing, for a given < 1.20 × 10 For fourindividuals (two ≥ doi: r 5× reads wereconreads 5× = 0.972. Across the −7 5 t 10.1038/nn.3786 as the threshold test. The results 5) to the hg19/ the to 5) r = 0.969; = ­ ------© 2014 Nature America, Inc. All rights reserved. 0.037). Once the seven genes that emerge from the RNAvalidationthatemergegenesthefunctionalfrom seven the Once0.037). ( Count Edge genetically defined AD network was significantly connected based on its Direct Our network. original the to comparisonfor distributions empirical generate tonetworks random these parametersonconnectivity networkfour evaluates to build random networks that mimic the structure of the original network and PPI networks, DAPPLE applies a within-degree node-label permutation strategy interactionsinvolving 12,793 proteins pair-wise high-confidence 169,810 contains which InWebdatabase, the from indirectnetworks ofconnected proteins usingofevidence physical interaction network with a cutoff of 2 interacting binding degrees. and DAPPLE creates direct and AD, including 25 late-onset AD GWAS genes, 3 early onset AD associated genes proteins discovered in genetic studies. We associated compiledAD known with a interact significantlylist study methylation DNA of our genes associated with DAPPLEtool online the using Pathway analysis. regions, where indicated. brain assessed the in tissue vascular and/or gliosis loss, neuronal for account to includedwere expression levels five these of all or Some (ILMN_1732799). OLIG2 GFAP following probes were used as covariates: the CNS, the in present types cell five main the for specific are that genes of levels expression the for corrected that analyses those For (RIN-RINmean)2. and RIN plate, PCR sex, death, at age including variables, biological or nical tech for correct to covariates adjusting implemented, was model regression explored the relation of mRNA levels to an AD diagnosis, a multivariable linear elsewhere reported been has samples Clinic R chromatin states to that of all 415,848 probes tested using a chi-square test. analysis,we compared thedistribution of the71associated CpGs across the11 optimal model for our tissue sample. ChromHMM the applying by trained the default settings of the otherdatawere sets from donor 112.Thedataid were dichotomized using the doi: N To assess whether certain chromatin states are enriched for associations in our A data and analysis. 10.1038/nn.3786 -center TREM1 for astrocytes (ILMN_1697176), astrocytes for for oligodendrocytes (ILMN_1727567) and (ILMN_1727567) oligodendrocytes for option to the already signal-extended bed files. The models were models The files. bed signal-extended already the to option (J. Replogle, D.A.B. & P.L.D., unpublished data). We produced a PPI . We selected a model with 11 distinct chromatin states as the the as states chromatin distinct 11 with model a selected We . P = 0.0072) and Seed Common Interactors Degrees Mean ( Mean Degrees InteractorsCommon Seed and 0.0072) = We constructed a protein-protein interaction (PPI) network BinarizeBed A detailed description of the RNA data from the Mayo LearnModel 59 to determine whether the genes identified in identified genes the whether determine to command of 6 omn wt dfut etns from settings default with command 0 .To assess the statistical significance of CD68 ENO2 44 for microglia (ILMN_2267914), microgliafor , 4 for neurons (ILMN_1765796), ChromHMM 5 . Given that the analysis here analysis the that Given . CD34 for endothelial cells endothelial for 5 8 except applying P = -

60. 59. 58. 57. 56. 55. 54. 52. 51. 50. 49. 48. 47. 46. 45. 44. 43. 42. Mean ( sis, both the Direct Edeg Count ( network is expanded and includes six of these genes. PPI theanalysis, Inthe consideredin be thisto genes of list the iterationto added areanalysis of the analy 53.

ae K. Lage, and discovery E.J. Rossin, chromatin-state automating ChromHMM: M. Kellis, & J. Ernst, Zhu, J. program. MAPping sequence M.J. Ziller, bisulfite genome whole BSMAP: W. Li, & Y. Xi, C.A. Gifford, Guintivano, J., Aryee, M.J. & Kaminsky, Z.A. A cell epigenotype specific model for model specific epigenotype cell A Kaminsky,Z.A. & M.J. Aryee, J., Guintivano, P. microarray Du, in effects batch Adjusting A. Rabinovic, & C. Li, W.E., Johnson, Y.A. Chen, variance-stabilizing Model-based D.A. Bennett, W.A. Kibbe, & W. Huber, P., Du, S.M., Lin, microarray.Illumina P.,processing Du, for pipeline a W.A.lumi: Kibbe, S.M. Lin, & F.Zou, M. Allen, Schneider,D.A., Bennett, findings Wilson,and & Overview Z. R.S. Arvanitakis, J.A., Bennett, D.A. ok C Aayig n itrrtn DA ehlto data. methylation DNA interpreting and Analysing C. Bock, methylation levels by microarray analysis. microarray by levels methylation methods. Bayes empirical (2007). using data expression (2013). implicated in genetic disorders. genetic in implicated 7 biology. underlying suggest and interact physically disease mediated characterization. cues. environmental and genome. Bioinformatics BMC cells. stem embryonic human (2012). 705–719 depression. major and region brain age, to application its and bias heterogeneity cellular brain of correction the Illumina Infinium HumanMethylation450 microarray. HumanMethylation450 Infinium Illumina studies. community-based two from data. microarray Illumina for transformation Bioinformatics variants. disease-associated human expression. gene study. orders religious the from Res. Alzheimer Curr. , e1001273 (2011). e1001273 , P = 0.042) measures remain significant. et al. et al. et t al. et Nature et al. et t al. et et al. et Genome-wide chromatin state transitions associated with developmental t al. et et al. et Brain expression genome-wide association study (eGWAS) identifies (eGWAS) study association genome-wide expression Brain et al. et oprsn f eavle n Mvle ehd fr quantifying for methods M-value and Beta-value of Comparison et al.

et al. et Novel late-onset Alzheimer disease loci variants associate with brain with associate variants loci disease Alzheimer late-onset Novel 24

hmn hnm-neatm ntok f rti complexes protein of network phenome-interactome human A Discovery of cross-reactive probes and polymorphic CpGs in the in CpGs polymorphic and probes cross-reactive of Discovery 500 Nat. Methods Nat. hrig dnmc N mtyain adcp o te human the of landscape methylation DNA dynamic a Charting Neurology , 1547–1548 (2008). 1547–1548 , Proteins encoded in genomic regions associated with immune- with associated regions genomic in encoded Proteins Transcriptional and epigenetic dynamics during specification of specification during dynamics epigenetic Transcriptionaland Overview and findings from the rush Memory and Aging Project. Neuropathology of older persons without cognitive impairment cognitive without persons older of Neuropathology

10 , 477–481 (2013). 477–481 , 9 , 646–663 (2012). 646–663 , , 232 (2009). 232 , Cell

79

Epigenetics 152 Cell

, 221–228 (2012). 221–228 , 9 P Curr. Alzheimer Res. Alzheimer Curr. Nat. Biotechnol. Nat. , 215–216 (2012). 215–216 , = 0.0072) and Common Interactors Degrees

, 642–654 (2013). 642–654 , 153 PLoS Genet. PLoS Neurology , 1149–1163 (2013). 1149–1163 ,

8 BMC Bioinformatics BMC , 290–302 (2013). 290–302 , Nucleic Acids Res. Acids Nucleic

66

8 25 , 1837–1844 (2006). 1837–1844 , , e1002707 (2012). e1002707 , nature nature , 309–316 (2007). 309–316 ,

9 , 628–645 (2012). 628–645 , Biostatistics Epigenetics a. e. Genet. Rev. Nat. NEUR

11

36 , 587 (2010). 587 ,

, e11 (2008). e11 ,

8 8 OSCI PLoS Genet. PLoS 118–127 , 203–209 , EN

13 C E - ,