© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. equally equally to this work. San Diego, California, USA. California, USA. 1 investigate to analysis RT-PCR quantitative single-cell of use the work through has Published limitations to sought cells. these address individual among heterogeneity potential the masks which cells, of ory in the specification of terminal differentiation versus long-lived mem role a with pathways and factors transcription many identified have studies such infections, microbial of course the during of lymphocytes specification fate the of T lymphocytes regulation molecular the of standing clearance pathogen after tissues (T cells T localization L-selectin as such receptors and anatomical capacity their proliferative and their CCR7, (CD62L) cytokine and homing of expression (T cells T (T memory cells T memory central subsets, two into interleukin the (IL-7R) of 7 expression low and KLRG1 receptor lectin-like tiated effector T cells are bycharacterized high expression of the killer memory cells that provide long-lived protection. Terminally differen minal cells effector that mediate acute host defense and self-renewing a to ter give rise naive T to lymphocytes responding infection, microbial response immune an During infection. lym microbial T to of phocytes responses the of hallmark a is fate cell of Heterogeneity and necessity of single-cell approaches. subsequently refined by epigenetic silencing of transcripts associated with memory lymphocytes, which highlights the power findings suggest a model for the of differentiation terminal effector cells initiated by an early burst of activity transcriptional and and identified previously unknown molecular determinants that controlled the fate of specification CD8 infection single-cell approach RNA-sequencing and analyzed individual CD8 immediate and durable protection. To elucidate the dynamic changes transcriptional that underlie this process, we applied a During microbial infection, responding CD8 John T Chang Bingfei Yu Boyko Kakaradov RNA sequencing CD8 Early and transcriptional epigenetic regulation of nature immunology nature Received 9 July 2016; accepted 13 January 2017; published online 20 February 2017; Department Department of Cellular and Molecular Medicine, University of California, San Diego, California, USA. Transcriptional profiling approaches have greatly advanced under 6 . However, most studies have been conducted on bulk populations RM in in vivo + 2 cells), do not circulate but instead remain in peripheral peripheral in remain instead but circulate not do cells), . . A third subset of memory cells, tissue-resident memory 3 , Ellen , J Ellen Wehrens 4 3 T cell revealed differentiation by single-cell , Division Division of Biological Sciences, University of California, San Diego, California, USA. 5 . Through comparison of the expression of CD8 . . Our analyses revealed a striking divergence transcriptional among cells that had undergone their first division 2 7 These These authors jointly directed this work. should Correspondence be addressed to G.W.Y. ( , 1 7 . Circulating memory T cells can be categorized categorized be can cells T memory Circulating . EM & Gene W Yeo 1 cells), distinguished by differences in their their in differences by distinguished cells), , 5

6 Department Department of Physiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore. , , Janilyn Arsenio aDV A NCE ONLINE PUBLIC ONLINE NCE 3 . 3 , , Justine Lopez 1 , 4 , 5 , 7 2 CM , + 6 T lymphocytes into differentiate subsets heterogeneous that together provide , Christella , E Christella Widjaja cells) and effector effector and cells) A TION 2 , , Stephanie H Kim

+ T - - - - -

In contrast, induction of the memory program seemed to be associated differentiation. memory-cell with associated determinants molecular program was subsequently refined by selective epigenetic repression of the peak of the adaptive immune response. That initial transcriptional entiating along the terminal effector pathway and was extinguished by differ cells in upregulated markedly was regulators, chromatin with that have been linked to effector and memory cell differentiation, along tion and fate specification. The expression of many transcriptionfunc effector metabolism, regulation, factorscell-cycle including functions, of hundreds of encoding products involved in diverse biological revealed two distinct subpopulations distinguished by their expression ity among that lymphocytes had undergone their first division, which heterogene transcriptional andsubstantial Weunexpected observed wide changes in CD8 individual study here, we applied a scRNA-seq approach to analyze transcriptome- T biological processes, including development diverse of understanding the advanced substantially has that tool ful of pathways. molecular and genes discovery unknown as-yet the precluded analysis for differentiation in involved products to known already genes of pre-selection identified, been have differentiation during cells individual in expression gene infection bacterial to ing patterns of the gene-expression CD8 single + H T lymphocytes sequentially throughout the course of a viral Single-cell RNA(scRNA-seq) power a as emerged has sequencing Single-cell 17 17 subset of helper T cells doi:10.1038/ni.368 2 , 6 , , Zhaoren He 2 , Elina , I Elina Zuniga 2 4 Department Department of Medicine, University of California, San Diego, Institute Institute for Genomic Medicine, University of California, 8 [email protected] 1 , , Stefan Aigner 3 n vivo in 9 , , Ananda W Goldrath , , and innate immune responses + T cells as differentiated they 7 . Although dynamic changes in in changes dynamic Although . ) ) or J.T.C. ( + T lymphocytes. Our 6 1 + These These authors contributed , , Patrick J Metz 8 T lymphocytes respond T lymphocytes , , the pathogenicity of the [email protected] ticles e l c i rt A 3 ,

1 0 ). . . In our in vivo 2

,

 - - - - - . © 2017 Nature America, Inc., part of Springer Nature. All rights reserved. clustering. ( clustering. T 7 cells, day 4 cells, day 1 cells, division cells, naive in as mice recipient wild-type of spleen the from obtained ( nodes. 1 Figure CD8 P14 activated infection, after 7 and 4 2, days LCMV.On of strain Armstrong the with mice host the of infection before d 1 mice recipient wild-type epitope of lymphocytic choriomeningitis virus (LCMV), into congenic immunodominant an recognize that receptors antigen cell T of sion P14 CD8 transferred infection microbial to responding phocytes CD8 individual in changes transcriptional investigate To CD8 of scRNA-seq RESULTS power and approaches.of necessity single-cell CD8 of specification fate the lying the tightly coupled transcriptional and epigenetic mechanisms under into insights new unexpected provide findings Together genes. these with more nuanced alterations in the expression levels of a few specific s e l c i rt A  point. time per experiments two least at of representative are Data plots’. ‘violin b

Naive b

) Single-cell expression of selected genes (right margin) encoding products associated with the differentiation of CD8 of differentiation the with associated products encoding margin) (right genes selected of expression ) Single-cell scRNA-seq analysis of CD8 of analysis scRNA-seq c ) Expression of selected genes among those in in those among genes selected of ) Expression Division a Cell isolation High Validation RNA-se Analysi 1 Day4 + + T lymphocytes (CD44 lymphocytes T + T cells differentiating differentiating T cells T lymphocytes, which have transgenic expres have transgenic which T lymphocytes, s q Expression Day 7 Sort CD8 P14 CD45.1 CD8 populations by flow cytometry Characterization ofmolecularmechanismsandregulatorsthedifferentiationCD8 expression transcript Quantify + T cells + + T lymphocytes responding to viral infection. ( infection. viral to responding T lymphocytes T T lymphocytes and highlight the the highlight and lymphocytes T + CM T cell + Low Adoptive Day –1 transfer T EM hi in vivo in ) were enriched from from enriched were ) in vivo in log(1 +TPM) Bcl II2 Foxo3 Prdm1 Mtor Tbx21 Foxo1 Eomes Irf4 II2ra Tnf Id Klrg1 Klf2 Ccr5 Cxcr II2rb Zeb2 Ifn CcI5 Id My Stat Gzmb Tcf7 Sel Lef1 II7 Batf CM Transform C 3 2 g r 1 c l 6 1 cells and T and cells Auto prep CD45.2 recipient 4 1 Single-cell 3 , we adoptively adoptively we , a (top row), with sorting before and on days 2, 4, 7 and 42 after infection for analysis of analysis for infection after 42 7 and 4, 2, days on and before sorting with row), (top b c

(vertical axes) by cells as in in as cells by axes) (vertical Stat4 Irf4 Ifng Foxo1 Batf

expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) + 10 15 20 10 15 20 12 16 10 12 + 0 5 0 5 0 4 8 0 2 4 6 8 0 4 8 LCMV Armstrong T lym T EM infectio Day cells (colors (above plot) match those in in those match plot) (above (colors cells differential expression, gene-set enrichment - - - 0 cell-type classifier tSNE, clustering, n

cDNA librarypreparatio Unsupervised: Supervised: Fig. 1b Fig. ( reads mapped uniquely 60–90% with populations, ( achieved ( libraries cDNA transcripts study. this of caveat a represents therefore can alter the magnitude and kinetics of the immune response immune the of kinetics and cells magnitude of the alter number can large a of transfer analysis; scRNA-seq for cells of transferred mice recipient into adoptively were cells P14 of numbers large infection), after 4 and 2 (days points time certain For analysis. our in included in a similar way; naive P14 CD8 P14 naive way; a in similar approach T infection, of 42 day magnetic-bead-based (CD44 On a cytometry. flow by by sorted were mice and infected of spleens We performed PCR amplification of full-length, polyadenylated polyadenylated full-length, of amplification PCR performed We at days2,4,7and42 Harvest SPandLN a Tbx21 Klf2 II2ra Gzmb CcI5 after infectio ) Experimental and analytical approaches. SP, spleen; LN, lymph SP, lymph LN, spleen; approaches. analytical and ) Experimental hi

expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) , c 10 10 10 16 10 14 18 12 CD62L 0 2 4 6 8 0 2 4 6 8 0 2 4 6 8 0 4 8 0 2 6 ). At least two technical replicates for each cell population population cell each for replicates technical two least At ). aDV upeetr Fg 1a Fig. Supplementary 1 b 3 (key), presented as transcripts per million reads (TPM) in (TPM) reads million per transcripts as presented (key), , followed by preparation and sequencing of single-cell single-cell of sequencing and preparation by followed , n n A hi NCE ONLINE PUBLIC ONLINE NCE ) ) cells and T Fig. 1 Fig. + 4 Tlymphocytes , 7 , 1 regulators andpathwaysfo Identification ofcandidat 1 a to enable the isolation of sufficient numbers numbers sufficient of isolation the enable to CD8 ). 10 × 10 × 10 ). the differentiationof

Tcf7 Klrg1 II2rb Id2 Cxcr3 scRNA-se +

expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) T lymphocytes EM 10 15 20 12 10 15 20 12 10 14 Naiv P14 CD8 0 4 8 4 0 5 0 5 0 2 6 0 8 c (CD44 Isolation of + ); left margin (brackets), hierarchical hierarchical (brackets), margin left ); D e T cells (CD44 T cells q 6 ), with slight variations among among variations slight with ), to 20 × 10 × 20 to + A Tcells ivision TION hi e CD62L r 1 nature immunology nature + Day T cells, by cells cells by T cells, 6 lo 4

Zeb2 Sell II7r Id3 Eomes lo reads per cell were were cell per reads CD62L

expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) ) cells were isolated 10 10 14 10 20 30 10 15 20 12 0 2 4 6 8 0 2 6 0 0 5 0 4 8 Day Supplementary Supplementary 7 hi ) were also ) also were T CM 1 2 and and T EM CM

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. Data are representative of two experiments. two of representative are Data with lower expression in Div1 in expression lower with Div1 cells, naive individual by Div1 in than cells Div1 in expression higher with genes indicate brackets row); per cell (one plot) ( cell. individual an represents symbol Each T and CD8 2 Figure previously to CD8 linked products the encoding at genes of subset Assessment a of cell. level per single-cell (TPM) reads million per transcript 1 analyses. in-depth further in duplicates of pairs 32 and samples sequencing unique 224 into separated libraries cell 288 single- we included measures, quality-control these undertaking ( effects batch of absence an and were on plates assessed separate sequencing to ensure reproducibility nature immunology nature b a We detected over 6,000 genes with a mean expression of at least least at of expression mean a with genes 6,000 over Wedetected + T cells (obtained as in in as (obtained T cells EM Div1 Div1 cells (green); inset, tSNE clustering analysis of division 1 cells in the plot below, identifying Div1 identifying below, plot the in 1 cells division of analysis clustering tSNE inset, (green); cells

Cells that have undergone their first division exhibit transcriptional heterogeneity. ( heterogeneity. transcriptional exhibit division first their undergone have that Cells ME Division TE M Naive Low + T cell differentiation by analyses of bulk populations populations of bulk by analyses differentiation T cell MEM 1 cells (bottom left). ( left). (bottom cells Div1 Expression

TE Div1 aDV Fig. 1 Fig. TE Day TE cells than in Div1 in than cells ME cells and Div1 and cells A T 4 M NCE ONLINE PUBLIC ONLINE NCE CM b ) among naive cells (gray), division 1 cells (red), day 4 cells (orange), day 7 cells (yellow), T (yellow), 7 cells day (orange), 4 cells day (red), 1 cells division (gray), cells naive ) among High Supplementary Fig. 1d Fig. Supplementary c – b e ) Differential expression of genes (one per column) by Div1 by column) per (one genes of expression ) Differential Day ) Expression of selected genes encoding products associated with the differentiation of CD8 of differentiation the with associated products encoding genes selected of ) Expression T MEM EM 7 MEM cells (key), grouped as genes with higher expression in Div1 in expression higher with genes as grouped (key), cells c e d cells ( cells Bach2 Fos Stat4 Ifng Foxo1 Batf expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) 10 12 A 10 12 14 10 15 20 10 12 14 16 10 10 12 0 2 4 6 8 0 2 4 6 8 0 5 0 2 4 6 8 0 2 4 6 8 0 2 4 6 8 TION d ) or with equal expression in Div1 in expression equal with ) or

– f ). After After ).

Cxcr3 II7r Stat5a II2ra Gzmb Bcat1 expression (TPM)

expression (TPM) 10 12 14 expression (TPM) expression (TPM) expression (TPM) expression (TPM) 10 12 14 0 2 4 6 8 10 10 12 14 16 10 12 14 10 12 0 2 4 6 8 2 4 6 8 0 2 4 6 8 0 2 4 6 8 0 2 4 6 8 0 findings demonstrated that the expression of genes previously previously genes of expression the that demonstrated findings T in expression high had genes encoding memory-associated products, such as as such products, memory-associated encoding genes Conversely, predominate. to known are cells effector differentiated ( infection after 7 and 4 as of and such with cells, the function differentiation effector associated previously their products encoding For genes example, process. in roles this with assigned consistent expression of patterns suggested TE cells than in Div 1 Div in than cells Batf , , Id2 and and Naive a

) tSNE analysis of 6,000 genes expressed by single single by expressed genes 6,000 of analysis ) tSNE Id2 II10rb Tbx21 Irf4 Hif1a Bcl11b TE expression (TPM) expression (TPM)

10 12 expression (TPM) expression (TPM) expression (TPM) expression (TPM) Gzmb 10 15 20 10 12 10 12 14 10 15 20 14 0 2 4 6 8 0 5 cells and Div1 and cells 8 0 2 4 6 8 0 1 2 3 4 5 6 7 8 5 0 2 4 6 0 Div1 MEM ME , had high expression in cells isolated at days days at isolated cells in expression high had , cells (top right) or lower expression in Div1 in expression lower or right) (top cells M TE Fig. 1b Fig. cells (top plot) and Div1 and plot) (top cells TE CM Div1 cells (red) and Div1 and (red) cells cells and T and cells TE TE MEM , cells than in Div 1 Div in than cells

Lef1 KIf2 Tcf12 Hk2 Ccr7 c

expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) terminally which at points time ), 16 10 12 14 10 20 30 40 50 10 12 10 15 20 10 10 12 14 cells ( cells 4 0 2 6 8 0 0 2 4 6 8 0 5 0 2 6 8 4 0 2 4 6 8 e EM ) (presented as in in as ) (presented cells ( cells CM MEM cells (purple), (purple), cells MEM MEM cells (blue). (blue). cells Tcf7 S1pr1

XcI1 Rheb Id3 Eomes s e l c i rt A expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) 1b Fig. cells (bottom (bottom cells 10 10 12 14 10 12 14 10 12 14 16 10 12 2 3 4 5 6 7 8 9 0 2 4 6 8 0 2 4 6 8 0 2 4 6 8 0 1 0 2 4 6 8 0 2 4 6 8 cells ( cells + Tcf7 T cells, T cells, Fig. 1 Fig. , c and and c ). These These ). ), ),

TE

c Il7r ). ).

 ,

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. ee edl dtce i te xetd ypoye ust b a by subsets lymphocyte expected the in detected readily products were memory-associated and effector- encoding as described experiments. two of representative are Data cell. r (linear), correlation Pearson furrowed. more much be can and functions’ linear ‘piecewise- complex are which boundaries, real the of estimates’ ‘stylized are lines of sets both error); Gaussian of assumption the with interval, Div1 between boundary’ ‘decision classifier’s early-state line, orange and cells, 7 effector day and cells memory between boundary’ ‘decision classifier’s fate lines, purple and black dashed axis): (vertical classifier final-fate of that versus axis) (horizontal classifier early-state the of 1 (memory)) in ( 4 cells. day effector-like or memory-like predicted between boundary ( T lymphocytes 4 CD8 day individual of fate the predict to classifiers fate and state ( histograms. density kernel as presented are ( classifier early-state the ( Fig. T n Div1 ( differentiation. of states intermediate in cells of identity the predict to 3 Figure s e l c i rt A  approach. single-cell = 0.71 ( = 0.71 f b a CM (teal; cells memory total from cells) = 48

) Prediction analysis of individual day 4 CD8 day individual of analysis ) Prediction , ) and day 7 effector and memory CD8 memory and 7 effector day ) and e c a e c , presented as the memory-score distribution (from 0 (effector) to 0 (effector) (from distribution memory-score the as , presented cells ( cells ) Classifier separation of Div1 of separation ) Classifier 1 MEM TE ). ( ). cells (red; (red; cells

cells and Div1 and cells r b = 0.78 ( = 0.78 Generation and application of ‘early-state’ and ‘fate’ classifiers classifiers ‘fate’ and ‘early-state’ of application and Generation P , n d = 2.2 × 10 = 2.2 = 48) and T and = 48) ) Cross-validated scores of division 1 CD8 division of scores ) Cross-validated Day 4cells n Div = 34) n P = 36 cells) ( cells) = 36 1 = 4.8 × 10 = 4.8 ME M −6 ; TE black and purple dashed lines indicate the the indicate lines dashed purple and black Div1 EM ). Each symbol ( symbol Each ). b cells (orange shaded area, 95% confidence confidence 95% area, shaded (orange cells ) and fate classifier ( classifier fate ) and cells ( cells TE Memory effector Day −8 a ) and day 7 effector cells (yellow; (yellow; cells 7 effector day ) and MEM n ); Spearman correlation (monotonic), (monotonic), correlation Spearman ); 7 = 48) ( = 48) f cells (blue; (blue; cells Memory score based on

+ ‘fate’ classifier T lymphocytes ( T lymphocytes 0. 0. 0. 0. 0. 1. b d a 0 2 4 6 8 0 , c 0. Memory score distribution Memory score distribution c ) (all cells obtained as in as obtained cells ) (all n 0 0 1. 0. 0. 0. 0. 0. , 0. 0. 0. 0. 0. 1. e = 96 cells), which include include which cells), = 96 e 0 0 2 4 6 8 0 2 4 6 8 0 ) Application of early- of ) Application , + f 6 4 2 0 8 6 4 2 0 T lymphocytes as assessed assessed as T lymphocytes d ) represents an individual individual an ) represents Memory scorebasedon ) were trained; results results trained; ) were .2 early ‘state’classifier n = 24 cells) from from cells) = 24 ‘Fate’ classifier 0. + 'Early-state 0 4 T lymphocytes T lymphocytes Cells Cells classifier Effector Memor Div Div d ) on which which ) on 1 1 .6 TE ME 8 y M ' 0. 10 10 1 8

+

.0

lymphocytes isolated at all time points after infection (as in in (as infection after points time all at isolated lymphocytes ( patterns manner, the gene-expression in an unbiased to visualize, analysis clustering embedding’) borhood neigh stochastic (‘t-distributed tSNE unsupervised Weperformed 1 CD8 division among heterogeneity Molecular infection. microbial to response in differentiation CD8 individual by expression exhibited gene in heterogeneity substantial the captured analysis ( populations bulk of analyses by identical as interpreted been have otherwise would that point time same the from derived cells individual among observed including genes, many of expression heterogeneous example, For populations. bulk of analyses by discernable previously not were that expression others ( others port, ribosome biogenesis, cell division and mRNA processing, among molecular and cellular processes involving transcription, trans diverse in involved products encoding those for enrichment showed ( 1) (division division cell first their undergone had that lymphocytes analysis revealed two distinct subpopulations among single CD8 single among subpopulations distinct two revealed analysis T and clusters; separate own their naive the however,near cell point grouped a time from each few cells formed 7 day and 4 day at cells most ( population each within neity ( clusters distinct In this tSNE analysis, naive cells, T gone their first cell division exhibited considerable transcriptional transcriptional considerable exhibited division cell first their gone CD8 that demonstrated data these Div1 in than (ref. Div1 in expression similar transcription factors with associated memory lymphocytes either had ( such with associated including cells, previously not molecules encoding genes to lymphocytes with that memory factors ing have associated several previously been cells effector of reprogramming terminal and metabolic and signaling molecules previously associated with the differentiation expression of genes encoding transcription factors, cytokine receptors that the revealed analysis set of ogy genes upregulated by Div1 Div1 in Div1 in expression higher having genes 27 and cells Div1 in expression higher having genes 903 sion of 930 genes Div1 distinguished respectively. (‘MEM’), populations memory-cell and (‘TE’) populations effector-cell terminal to larities ‘Div1 subpopulations ( to to T be 1 most similar seemed cells division CD8 effector 7 day and 4 day to similar most be to seemed cells 1 division of lation to ensure that all sorted cells had been activated markers, other than by cell-surface high expression of of the activation expression marker CD44 phenotypic their by not and peak) CFSE second (the CFSE dye division-tracking the of dilution their of basis Fig. 2 Fig. Supplementary Table 2 Table Supplementary Fig. 2 Fig. Notably, our single-cell analysis also revealed patterns of gene gene of patterns revealed also analysis single-cell our Notably, In contrast, Div1 contrast, In expres the in differences that revealed analysis Gene-expression 1 5 CM a a ) and and ) TE Supplementary Table 3 Supplementary ). We emphasize that the division 1 cells were isolated on the on the isolated were 1 cells division We). the that emphasize ). We provisionally designated these two division 1 (Div1) (Div1) 1 division two these designated provisionally We ). cell populations ( populations cell cells ( cells aDV Tbx21 6 Tcf7 MEM , including including , A + Btg1 T lymphocytes, whereas the other subpopulation of subpopulation other the whereas lymphocytes, T NCE ONLINE PUBLIC ONLINE NCE Fig. 2 Fig. , , ) ( ) Fig. 2 Fig. cells ( cells Gzmb Fig. 2 Fig. MEM , which encodes an anti-proliferative molecule an anti-proliferative encodes , which Fig. 1b Fig. TE b a cells had higher expression of genes encod genes of expression higher had cells and and ’ and ‘Div1 and ’ , , Eomes ), suggestive of unique molecular homoge molecular unique of suggestive ), ). Notably, however, many genes encoding encoding genes many however, Notably, ). e Il7r Id3 ) or had higher expression in Div1 in expression higher had or ) TE Fig. 2 Fig. , c , , , , Supplementary Table 2 Table Supplementary cells and Div1 and cells ). Thus, our single-cell transcriptomic transcriptomic single-cell our Thus, ). S1pr1 Il7r + , , Supplementary Table 1 Table Supplementary ). Moreover, ). Div1 T lymphocytes throughout their their throughout lymphocytes T Id3 a CM , , + ). Strikingly, unsupervised tSNE tSNE unsupervised Strikingly, ). MEM and and Il2ra TE T lymphocytes that had under had that lymphocytes T A and and cells and T TION cells from Div1 Fig. 2 Fig. ’ on the basis of their simi their of basis the on ’ , , Foxo1 Klf2 Sell

TE nature immunology nature a MEM CM ( cells than in Div1 in than cells , , ) of individual CD8 ) of individual in vivo ) ( ) Eomes Fig. 2 Fig. EM cells and cells naive cells cells ( cells TE Fig. 2 Fig. cells each formed + cells had higher higher had cells T cells . . One subpopu d MEM MEM ). Gene-ontol ). and and ), in addition addition in ), 6 c ( ). Similarly, Similarly, ). Lef1 ). Together Together ). Fig. 2 Fig. cells than than cells cells, with Irf4 , , Fig. Fig. TE TE Bach2 , was was , c cells cells cells MEM ). + + 1 T T 1 ). ). 4 ------

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. expression patterns of selected genes across inferred paths of differentiation for effector cells (orange), T (orange), cells effector for differentiation of paths inferred across genes selected of patterns expression T (yellow), 7 cells day (orange), CD8 all by row) per (one genes common 89 of Div1 in (as cells memory versus cells effector by differentially expressed (T T cells memory by expression their to relative Div1 4 Figure in as obtained cells (analyzing differentiation of stages intermediate subsequent, in cells of identity the predict systematically we could Wewhether to determine sought lations might represent cells that had to begun already diverge in fate. disparate gene-expression patterns suggested that these two subpopu had division first that their undergone cells of subpopulations distinct two that observation The differentiation of stages intermediate in cells of Identification lymphocytes. memory with associated previously products encoding many transcripts including discernible majority of differentially expressed genes upregulated by Div1 previously not was that heterogeneity nature immunology nature experiments. two of representative are Data trajectories). expression single-cell possible all from (bootstrapped interval confidence 95% areas, a c MEM TE Genes cells relative to their expression by Div1 by expression their to relative cells

cells (Div1 cells Identification of putative regulators of the differentiation of CD8 of differentiation the of regulators putative of Identification 815 (Div1 Single cells TE vs Div1 115 TE (T /Div1 TE MEM

vs

719 MEM

) T aDV MEM ). Pearson correlation, correlation, Pearson ). CM ) Low A in in vivo cells (green) and T and (green) cells NCE ONLINE PUBLIC ONLINE NCE Fig. Fig. 1 could be discerned by virtue of by virtue discerned be could ). We hypothesized that using using that Wehypothesized ). b Expression (log mean ratio)

+ TTE/ TMEM T lymphocyte populations of naive cells (gray), Div1 (gray), cells naive of populations T lymphocyte –10 MEM MEM –5 15 10 0 5 –10 Zscan2 cells (left) and those expressed differentially by terminally differentiated effector T cells (T T cells effector differentiated terminally by differentially expressed those and (left) cells Btg1 Malat1 Fmo1 cells) (right) (all cells obtained as in in as obtained cells (all (right) cells) EM r = 0.78 ( = 0.78 –5 High cells (purple) (one cell per column); left margin (brackets), hierarchical clustering. ( clustering. hierarchical (brackets), margin left column); per cell (one (purple) cells Expression (logmeanratio) A 7 TION , with the vast vast the with , 0 Div1 Mcm3 Cbx3

P TE = 3.6 × 10 = 3.6 1 5 / Div1 Dpy30 TE a ; ; T cells, MEM 1 0 TE Ybx1 Ddx39 /T - +

T lymphocytes. ( T lymphocytes. MEM Snrpa 5 Ezh2 −13 Sumo2 differentiation, whereas the fate classifier, trained on day 7 effector effector 7 day on trained classifier, fate the whereas differentiation, of states intermediate at cells 4 ‘future’ day on the into deployed was differentia of states intermediate ( tion in cells of identity the predict independently to approach computational ‘future-past’ a developed fate. memory long-lived a fate versus differentiated toward a terminally progressed as tion they of states differentia in intermediate cells us to enable identify would classifier; (‘fate’ populations effector-cell terminal and (T populations cell memory true on trained classifier; (‘early-state’ subpopulations division 1 two the on trained one classifiers, supervised distinct two ) against that of genes expressed differentially by Div1 by differentially expressed genes of that ) against 1 20 ). Each symbol represents an individual gene. ( gene. individual an represents symbol Each ). Using both the early-state classifier and fate classifier, we then then we classifier, fate and classifier early-state the both Using Fig. 3 Fig. d Ybx1 Snrpa1 Malat1 Ezh2 Ddx39 Batf Fig. Fig. expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) e a ). The early-state classifier, trained on division 1 cells, cells, 1 division on trained classifier, early-state The ). 10 11 10 12 14 16 10 12 14 16 22 24 26 28 30 32 34 36 38 10 15 20 25 10 15 20 ) Overlap (middle) of genes expressed differentially by differentially expressed genes of (middle) ) Overlap 5 6 7 8 9 Naive 0 2 4 6 8 0 2 4 6 8 0 5 0 5 4 1

). ( ). Div

b 1 TE ) Plotting of the difference in expression of genes genes of expression in difference the of ) Plotting Day cells (red), Div1 (red), cells 4 CM Day cells (purple) and T and (purple) cells 7

Day 42 MEM T T Zscan2 Sumo2 Mcm3 Fmo1 Dpy30 Cbx3 Effector EM CM cells (blue), day 4 cells 4 cells day (blue), cells expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) expression (TPM) 4. 4. 5. 5. 6. 6. 7. 7. 8. 3. 10 11 12 13 14 15 16 17 10 15 20 25 30 35 i. 3a Fig. 0 5 0 5 0 5 0 5 0 5 9 Naive 0 1 2 3 4 5 6 7 8 5 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 8 c CM EM ) Differential expression expression ) Differential Div cells (green); shaded shaded (green); cells 1 T and cells , ticles e l c i rt A b

TE Day ) and the other other the and ) cells versus versus cells 4

Day d Fig. 3c Fig.

) ) Temporal 7 TE EM

cells) cells) Day 42 cells) cells) , d ), ),  - -

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. PRC2 complex) in single CD8 single in complex) PRC2 5 Figure studies published of results the given validation, functional and (H3K27me3) Lys27 at H3 histone of ylation trimeth mediating by repression gene mediates that PRC2 complex cells. We selected effector differentiating in repression epigenetic of possibility ( expression in division, decrease rapid a first by followed the at expression in increase substantial a exhibit to tended pathway terminal-effector-cell the along differentiating cells and ( differentiation of state pre-effector a or state pre-memory a either in being as classified cells 4 day in regulators putative these encoding genes of expression the of basis the on differentiation cell memory- or terminal-effector-cell of pathways infer to able were we points, time of at cells us identity the intermediate to enabled predict ( points time all across regulators putative these encoding genes the of expression single-cell clustering by patterns expression temporal ( specification fate cell respectively. differentiation, memory-cell or differentiation terminal-effector-cell of regulators encoding genes identify to cells cells and effector cells or common only to Div1 ( sets both ( cells ory mem versus cells effector by terminal differentially expressed genes Div1 versus Div1 by differentially expressed genes 930 of set the Among in as obtained cells (analyzing cells memory in expression Div1 in sion by Div1 differentially expressed genes of set the between commonality for searching by differentiation cell We next sought to previously regulators unknown of identify CD8 CD8 of regulators of Identification states. subsequent or preceding the on trained classifiers binary supervised of use the through predicted readily be 4 Tables ( states cell pre-memory or effector prediction pre-terminal the of for genes of sets non-overlapping largely used ( they cells intermediate of identity the on agreed classifier fate and classifier early-state the Notably, cells. 4 day same the lyze ana to ‘past’ the into deployed was cells, memory 40 day and cells s e l c i rt A  ( *** and ( symbol CD62L percent indicate (left) quadrants in Numbers left. at results of summary CD62L percent indicate (left) areas outlined to adjacent Numbers ( left. at results of summary right, (bottom); in (as ( each. in cells percent indicate (left) areas outlined to adjacent Numbers left. at results of summary right, margin); (left infection after CD45.1 gated ( TNF ( infection. after Ezh2 gated of Ezh2 of intensity fluorescence mean indicate (left) areas outlined to adjacent Numbers left. at results of summary right, IL-2R on (gated 1 cells division CD8 1 (Div1) division gated of vivo in T and cells b–d Fig. 4 Fig.

The analysis described above yielded 89 putative regulators of CD8 ) or four ( four ) or Supplementary Fig. 2 Fig. Supplementary fl/fl g h . Numbers adjacent to outlined areas indicate percent Ezh2 percent indicate areas outlined to adjacent . Numbers ), among CD45.1 among ), ; left) of gated IL-2R gated of ; left) c Cd4

d P Ezh2 ). Because application of the ‘future-past’ binary classifiers classifiers binary ‘future-past’ the of application Because ). Ezh2 regulates effector CD8 effector regulates Ezh2 and and – < 0.001 (Student’s two-tailed two-tailed (Student’s < 0.001 k EM Supplementary Table 6 Table Supplementary Fig. Fig. 4a +/+ ) represents an individual mouse; small horizontal lines indicate the mean (+ s.e.m.). NS, not significant ( significant not NS, s.e.m.). (+ mean the indicate lines horizontal small mouse; individual an ) represents MEM Ezh2 cells (key) (presented as in in as (presented (key) cells MEM fl/fl f + or or – (P14) CD8 (P14) 5 h f Cd4 ). Thus, intermediate states of differentiation could could differentiation of states intermediate Thus, ). , , Ezh2 g k cells ( cells cells and by terminal effector cells relative to their their to relative cells effector terminal by and cells , , which encodes a catalytic subunit of subunit the Polycomb a catalytic , encodes which ) Flow cytometry of cells as in in as cells of cytometry ) Flow ) or three ( three ) or b +/+ ). ). We common genes only to Div1 selected next fl/fl (WT) or or (WT) + Supplementary Table 7 Supplementary (donor) CD8 (donor) Cd4 Supplementary Table 2 Table Supplementary + cells as in in as cells α Cre i hi + ) mice per genotype in each ( each in genotype per ) mice ). We observed that genes expressed in in expressed genes that observed We ). + T lymphocytes (obtained as in in as (obtained T lymphocytes CD62L IL-2R α Ezh2 P14 (CD45.1 P14 lo or IL-2R or fl/fl + α + ), only 115 genes were shared by shared were genes 115 only ), lo T lymphocyte differentiation. ( differentiation. T lymphocyte e cells, and the mean fluorescence intensity (right) of intracellular IFN- intracellular of (right) intensity fluorescence mean the and cells, hi and IL-2R and Cd4 , analyzing staining of the DNA-intercalating dye 7-AAD and the apoptosis marker annexin V (AnnV) at 3 or 5 d 3 or at V (AnnV) annexin marker apoptosis the and 7-AAD dye DNA-intercalating the of staining , analyzing CD62L t + TE Fig. 1 Fig. -test). Data are representative of two experiments ( experiments two of representative are Data -test). T cell differentiation T cell α Cre cells relative to their expres to relative their cells j ) Flow cytometry (left) of division 1, day 4 and day 7 CD8 day 4 and day 1, division of (left) cytometry ) Flow hi (KO) P14 CD8 P14 (KO) + cells), day 4 cells, day 7 cells, T 7 cells, day 4 cells, day cells), ) ) CD8 c lo ). ( ). Fig. 4 Fig. or IL-2R or e 1 α MEM 7 d after infection, showing the frequency of cells (left) with intracellular expression of IFN- of expression intracellular with (left) cells of frequency the showing infection, 7 d after 6 ). We visualized their their We ). visualized b , for further analysis analysis further for , lo ) and the set of 834 834 of set the and ) ) Flow cytometry of gated CD8 gated of cytometry ) Flow + CD62L T cells, followed by infection of the host mice with LCMV-Armstrong and analysis of cells 7 d cells of analysis and LCMV-Armstrong with mice host the of infection by followed T cells, cells and memory d ); this raised the the raised this ); Supplementary Supplementary α f lo hi – CD62L + Fig. 3 Fig. i cells among cells as in in as cells among cells Fig. 1 Fig. , T cells in the spleen of wild-type (CD45.2 wild-type of spleen the in T cells k ), or are pooled from two independent experiments ( experiments independent two from pooled are or ), hi lo Fig. Fig. Fig. 4 Fig. TE cells among CD8 among cells cells (left) or Ezh2 or (left) cells b hi f ), yet yet ), ) separated as naive cells, Div1 cells, naive as ) separated cells cells cells (key), assessing Ezh2 expression. ( expression. Ezh2 assessing (key), cells a ) Expression of of ) Expression + + 1 TE T T ). ). d - - - -

by days 5 and 7 after infection than were control cells ( cells control were than infection after 7 and 5 days by CD8 Ezh2-deficient Armstrong. (CD45.2 (CD45.1 ( Ezh2-deficient cell–specific T the from Ezh2 ( level mRNA at the that leled paral level protein the at differentiation during expression Ezh2 of IL-2R IL-2R ‘pre-effector’ putative Ezh2 of expression ( their in protein pattern bimodal a exhibited division CD8 that observed we ing, find that with Consistent cells. effector terminal of differentiation Div1 ( CD4 of function and differentiation the in for Ezh2 role a critical suggesting inflammatory cytokines relative to that of control cells ( cells control of that to relative cytokines inflammatory 3b Fig. Supplementary T cells by chromatin immunoprecipitation followed by sequencing sequencing by followed immunoprecipitation chromatin by cells T (KLRG1 mapped we H3K27me3 peaks derived from the analysis analysis, of naive, terminal scRNA-seq effector our by detected genes on ~6,000 the Focusing cells. effector terminal of differentiation the promote catalyze repressive H3K27me3 marks on a set of key genes and thereby Div1 in Ezh2 of expression high the that hypothesized we repression, transcriptional mediates complex PRC2 the Because determinants memory-associated of silencing Epigenetic cells. effector terminal of role a ings suggested for critical Ezh2 in the regulating differentiation ( an in cells, like’ ‘memory- not but cells, ‘effector-like’ early affected ‘preferentially’ ( infection after 5 day by apoptosis increased with ated ( CD8 of Ezh2-deficient Notably, ( cells of the effector absence Fig. 5i Fig. Eed Supplementary Fig. 3c Fig. Supplementary CM lo We next generated Ezh2 IL-2R cells and T and cells + , , MEM lee ( alleles T lymphocytes (as in in (as T lymphocytes α Suz12 + e T cells. ( T cells. + lo , along with genes encoding other components of the PRC2 PRC2 the of components other encoding genes with along , – Ezh2 , assessed after 3 d of 3 d of after , assessed α hi T cells T hi + + k CD62L hi cells ( cells (right) among the CD8 the among (right) cells i. 5 Fig. ) recipient mice that we subsequently infected with LCMV- ) recipient infected mice that we subsequently CD8 ) IL-7R and and cells (bottom right) or IL-2R or right) (bottom cells aDV and and , , a Eed EM Ezh2 ), two experiments with four ( four with experiments two ), A Supplementary Figs. 3e Figs. Supplementary Fig. Fig. 5 17– k α hi NCE ONLINE PUBLIC ONLINE NCE b cells (above plots) among CD8 among plots) (above cells + ) Flow cytometry (left) of cells treated as in in as treated cells of (left) cytometry ) Flow Set , , in vitro in Ezh2 cls ( cells T lo ), with high expression and low expression in in expression low and expression high with ), cells, respectively cells, 1 Suz12 ) CD8 ) 9 fl/fl ), had higher expression in Div1 in expression higher had ), . MEM a Ezh2 + ) are deleted by Cre recombinase expressed expressed recombinase Cre by deleted are ) ), ), suggestive of a role for Ezh2 in regulating the fl/fl + T cells as in in as T cells T cells to undergo activation or proliferation or proliferation T activation to cells undergo ) and exhibited an impaired ability to secrete to secrete ability an impaired ) and exhibited cells, Div1 cells, and and + model of CD8 of model , + Cd4 Cd4 ) recipient mice given adoptive transfer of transfer adoptive given mice ) recipient a d fl/fl T cells and total memory (CD44 memory total and cells T ) that had undergone their first cell division division cell first their undergone had ) that ). However, Ezh2 deficiency was associ was deficiency Ezh2 However, ). upeetr Fg 3a Fig. Supplementary + in vitro in γ Cd4 Set T cells that had undergone their first first their undergone had that cells T promoter) and adoptively transferred transferred adoptively and promoter) d Cre ( Fig. 5 Fig. α ) Flow cytometry (left) of naive cells, cells, naive of (left) cytometry ) Flow f ) or TNF ( TNF ) or hi (which encode components of the the of components encode (which o cnrl ( control or ) + Cre CD62L T cells were much less abundant abundant less much were cells T TE e e culture with IL-2 (top) or IL-15 IL-15 or (top) IL-2 with culture + 7 d , P14 mice (in which ; right, summary of results at left. left. at results of summary ; right, cells, day 4 cells, day 7 cells, T 7 cells, day 4 cells, day cells, A T lymphocytes. ( T lymphocytes. ( j P α ). ). ). TION Fig. Fig. 5 Fig. Fig. 5 < 0.05); * < 0.05); lo + CD62L events. ( events. and and g + lo b ). ( ). T cell differentiation cell T cells and ‘pre-memory’ ‘pre-memory’ and cells

, e c c h ) or three ( three ) or ) ) was not due to a failure ). ). Moreover, the kinetics nature immunology nature 4 + ) Flow cytometry (left) of (left) cytometry ) Flow T lymphocytes (as in in (as T lymphocytes ). Together these find Together these ). hi cells (top left). Each Each left). (top cells Ezh2 e P ) Absolute number number ) Absolute ≥ 0.05, ** ≥ 0.05, ) into wild-type wild-type into ) c fl/fl i ) Flow cytometry cytometry ) Flow ) Flow cytometry cytometry ) Flow TE d TE ) mice in each each in ) mice cells than in in than cells Cd4 lox Fig. 5 Fig. cells would would cells Fig. 5 Fig. i ; right, ; right, Fig. 5f Fig. P-flanked P-flanked +/+ hi γ P ( < 0.01 < 0.01 ) CD8 ) h f ) P14 P14 ) ) ) or e ) and and ) and and 20 , CM g a , 2 ); ); ). ). + 1 - - - -

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. lower , whereas those in memory cells exhibited exhibited cells memory to relative such coverage coverage, in H3K27me3 in losses considerable those whereas expression, with gene correlated lower that coverage H3K27me3 in gains significant ited (ChIP-seq) nature immunology nature i h e d a + IL-15 Day Day + + IL-2 CD45.1 Ezh2 expression (TPM) + 6 10 20 30 40 50

5 3 CD8 T cells (×10 ) Ezh2 0 0 1 2 3 4 7-AAD 10 10 10 10 10 10 10 10 10 10 10 10 0 2 4 6 10 7-AAD 2 10 10 10 10 10 10 10 10 2 CD8 0 2 4 6 0 2 4 6 0 10 . The promoter regions in terminal effector cells exhib cells effector terminal in regions promoter The . 0 2 4 6 0 2 4 6 WT 10 AnnV 10 0 63.5 Annexin 55.7 2 87.5 Naive 0 1,131 73.4 10 10 10 *** 2 4 2 WT 10 WT 10 10 Naive V 4 6 KO 4 20.7 14.6 20.7 20.5 10

12.3 10 2.4 8.7 8.6 6 Div1 IL-2R aDV 6 10 Eed expression (TPM) 10 10 6,791 Div1 0 44.4 56.0 0 2 4 6 8 60.7 85.4 0 A 10 f 10 NCE ONLINE PUBLIC ONLINE NCE

+ MEM 2

KO IFN-γ cells (%) α 2 10 KO 20 40 60 80 lo 10 0 4 24.2 18.7 4 22.9 32.3 10 21.6 14.4 Div1 IL-2R 10 4.1 8.9 Div1 6 6 WT 24,017 TE Cells (%) Cells (%) *** 100 100 Cells (%) Cells (%) α 20 40 60 80 20 40 60 80 hi 100 100 0 0 Day 20 40 60 80 20 40 60 80 AnnV KO AnnV 0 0 7-AAD Suz12 expression (TPM)

– *** AnnV 4 A 10 15 20 25

7-AAD NS – 0 5 15,391 TION AnnV 7-AAD – Day 4 NS

** – 7-AAD – AnnV –

AnnV 7-AAD – Day + 7-AAD – AnnV * 4 + *

*** IFN- MFI (×10 ) AnnV NS 7-AAD + γ + 7

7-AAD + AnnV 0 1 2 3 4 5 + + AnnV *** KO WT KO WT

+ NS + AnnV

+ T *** NS AnnV KO WT KO WT

+ CM WT Day 7,096 + - 7 *** T EM with lower expression in memory cells than in naive cells, 11 (1.75%) than in 11 cells naive (1.75%) cells, in lower memory with expression were naive 97 among cells, marked by (7.86%) 627 genes H3K27me3; in than cells effector terminal in expression lower with genes 1,234 ( cells in naive + IL-15 + IL-2 k j Day Day Div

KO Set expression (TPM) 10 15 20 25 0 5 1 7 4 1,151 CD62L T

10 10 10 10 10 10 10 10 10 10 CD62L CM 10 10 10 10 10 10 10 10 10 10 10 10 0 1 2 3 4 1 2 3 4 5 10 g 0 2 4 6 0 2 4 6 0 2 4 6 IL-2R 0 10

57.1 + CD8

10 TNF cells (%) 0 10 20 30 40 50 1 10 0 α Fig. 6a Fig. WT 10 2 18 2 39 WT 1 10 1,298 10 49.0 T 4 WT 3 EM 10 10 – 4 10 c 6 and b 72.7 *** 0 10 10 CD8 0 10 10 10 10 10 1 KO 10 1 2 3 4 5 10 KO Supplementary Tables 8 Supplementary Ezh2 2 2 10 38 58 3 10 KO 10 47 40.3 3 1

3 Ezh2 (10 MFI) 10 4 10 15 20 25 10 2 0 5 10 4 Div1 IL-2R 10 3 Naive 6 TNF MFI (×10 ) 3

lo hi hi lo 10 10,000 15,000

IL-2Rα CD62L IL-2Rα CD62L Div1 IL-2R 53 4 + + 5,000 CD8 cells (%) CD8 cells (%) 10 α lo Day Day Div 5 50 55 60 65 70 75 80 35 40 45 50 55 60 0 1 α hi c 7 4 Day Cells 10 10 10 10 10 hi hi hi 0 1 2 3 4

WT 4

CD62L cells (%) CD62L cells (%) CD62L cells (%) 10 WT WT Day 100 Ezh2 0 20 40 60 20 40 60 80 ticles e l c i rt A 0 1 2 3 4 5 0 0 7 IL-2R IL-2R 10

*** T WT 1 WT and WT CM *** ** 10 * T α α * 2 * hi lo KO EM KO CD62L CD62L KO KO 9 10 KO KO ): ): among 3 10 lo hi 4  © 2017 Nature America, Inc., part of Springer Nature. All rights reserved. ( processes) biological other in involved products encoding genes of sets in abundance to their (relative apoptosis and Wnt signaling, receptor binding, phosphoinositide binding, protein cytoskeletal tion, regula transcriptional involving processes in involved products ing encod those for enrichment showed Ezh2 by targeted genes of set Table 10 Supplementary ( H3K27me3 by marked genes 261 and Ezh2 by bound CD8 differentiating cells. memory of that for than might be more for crucial of the cells effector differentiation terminal silencing that epigenetic suggested This were by marked H3K27me3. s e l c i rt A  ( above initially described as Figure 6 Figure of three experiments ( experiments three of * Fig. Div1 in than 4 cells day in genes) 219 (Gain; expression higher or genes) 1,492 (Loss; expression ( ( scRNA-seq by detected genes 6,000 the of TSS) the around kb +1 to kb −1 (from regions promoter proximal at (key) both or (PolII) II ( cells. naive in than (Mem) cells memory or (Eff) cells effector in expression lower with ( gray). axis; vertical (right coverage of proportion or black) axis; vertical (left regions covered of quantification as ( set data scRNA-seq the in detected genes 6,000 the ( fates. those into differentiate cells naive as (blue) losses or (red) gains coverage as (key) presented are results (right); cells memory versus cells naive in or (left) cells effector terminal versus cells naive in (TSS), site start transcription the from (columns) distances various at (rows) genes various ( cells. effector terminal of differentiation e Supplementary Table11 Supplementary P

) Frequency of genes with a change in TPM among those with lower lower with those among TPM in a change with genes of ) Frequency b < 0.01 and ** and < 0.01 We next investigated binding of Ezh2 and H3K27me3 marks in in marks H3K27me3 and Ezh2 of binding investigated next We Covered regions/200 bp a 1 Genes 1,200 1,600

), with (Ezh2 bound) or without (Unbound) binding of Ezh2 (key). (key). Ezh2 of binding (Unbound) without or bound) (Ezh2 with ), –1,000 Base pairsfromTSS 400 800

b –800 terminal effector 0 ) Peak H3K27me3 coverage around the transcription start site for for site start transcription the around coverage H3K27me3 ) Peak –600 Increased epigenetic repression of genes expressed during the the during expressed genes of repression epigenetic Increased Naive –400 Naive vs –200 d 0 ) ChIP analysis of the binding of Ezh2 or RNA polymerase polymerase RNA or Ezh2 of binding the of analysis ) ChIP Effector 200 400 P 600 < 0.001 (Fisher’s exact test). Data are representative representative are Data test). exact (Fisher’s < 0.001 800 Memory 1,000 a +

– –1,000 T cells by ChIP-seq and found 1,564 genes genes 1,564 found and ChIP-seq by cells T TE –1 c Bse pairsfromTSS

peak coverage(%) –800 ) or two experiments ( experiments two ) or cells (horizontal axis; all cells obtained as in as obtained cells all axis; (horizontal cells –600 –0.5 H3K27me3 Naive vs memory

). Gene-ontology analysis revealed that the the that revealed analysis Gene-ontology ). –400 0 0.6 1.2 1.8 2.4 ). We observed that among cells obtained obtained cells among that We ). observed –200

0 0 Fig. Fig. 200 Coverage % 0.5 400 c Naive vs: 600

H3K27me3 1 1 800

), genes with expression that was was that expression with genes ), coverage (%) c 10 1,000 0 2 4 6 8 ) H3K27me3 coverage of genes genes of coverage ) H3K27me3 d

a Genes ** Ef ) H3K27me3 peak coverage in coverage peak ) H3K27me3 –1,000 f

–800 Both Pol ll Ezh Mem d NS 2 ,

e –600 Base pairsfromTSS ). –400 Expression: e Fig. Fig. Genes with –200 TPM change (%)

100 0 20 40 60 80 1 0 ), presented presented ), Ezh2 bound Unbound 200 Fig. 6 Fig. Loss 400 Fig. Fig.

* 600 Gain d

1 800 and and ). ).

1,000 - - with a critical role in differentiating memory CD8 memory in role differentiating a critical with as well as as such homing, and survival cell T control that encodes a branched-chain aminotransferase isoenzyme tion factors, such as as such factors, tion transcrip memory-associated encoding genes included targets Ezh2 the input DNA control ( of that than Ezh2 with association greater exhibited cells effector of differentiation the to not but cells memory of differentiation the to linked been have that products encoding genes many that revealed This analysis cells. of but cells memory not the differentiation during gets whose expression during decreased the differentiation of effector pathways. disparate these along progressing cells in Ezh2 of expression due in to cells, of part differential memory differentiation during of the cells but differentiation effector terminal not during the repressed selectively of were set being same that the genes possibility and distribution; bottom, of shifting no by indicated (as cells memory and top, by a rapid loss by day 4 after infection ( infection after 4 day by loss rapid a by followed division first the at increase early substantial a with by Ezh2, targeted genes effector memory-cell-associated of those differentiating resembled in cells genes these of patterns expression The Myc ( molecules ing ( including factors transcription encoding differentiation, those effector-cell with products associated encoding genes previously of patterns expression the assessed also cells. these in expression Ezh2 of levels distinct to owing resulted in part from the presence or of absence repression epigenetic in cells effector and terminal differentiating cells memory might have Thus, the unique expression patterns of memory-cell-associated Ezh2. genes by repression transcriptional H3K27me3-mediated for role a 12 Tables Supplementary such genes, as memory-cell-associated of number a including genes, of diminished mRNA many from expression and increased coverage with H3K27me3 associated was deficiency Ezh2 that observed we Il7r as such genes, memory-cell-associated for observed was expression T mature became cells the as expression increased or stable by followed division first a distinct expression pattern bycharacterized a modest increase at the revealed cells memory of in same genes differentiating those analysis contrast, In cells. effector of differentiation the in genes associated memory-cell- of these products for the role a possible suggested this ( decrease rapid a by followed division first the at increase like others, ing division; the first as ( T fate ‘decisions’cell as such signaling, terminal terminal effector cells (as indicated by shifting of distribution; of differentiation the to during more exhibit repression tended genes cell and memory-cell differentiation revealed that H3K27me3-marked terminal-effector- during genes unmarked and H3K27me3-marked Div1 in than cells 4 day in expression higher with those than binding Ezh2 Div1 in than cells 4 day in lower Fig. 7 Fig. We investigated that possibility by analyzing a subset of Ezh2 tar Ezh2 of subset a analyzing by possibility that Weinvestigated In parallel with our analysis of memory-cell-associated genes, we we genes, memory-cell-associated of analysis our with parallel In Foxo1 Eomes , , ), along inferred terminal-effector-cell or memory-cell pathways. or memory-cell ), along terminal-effector-cell inferred Lef1 b TE and and and and Supplementary Fig. 5a Fig. Supplementary and and , , cells ( cells Opa1 Tcf7 aDV Supplementary Table 10 Supplementary Tcf7 Supplementary Fig. 5b Fig. Supplementary Bcl2 A and and Il2ra , which encodes a regulator of mitochondrial fusion fusion mitochondrial of regulator a encodes which , NCE ONLINE PUBLIC ONLINE NCE Fig. 6 Fig. CM , underwent a rapid decrease in expression follow expression in decrease rapid a underwent , , that were not targeted by Ezh2 ( Ezh2 by targeted not were that , Smad2 Klf2 and and 23– cells or T or cells Tcf7 e 2 ). Comparison of the expression patterns of of patterns expression the of Comparison ). Fig. Fig. 7 5 ( and and ; metabolic regulators, such as such regulators, ; metabolic Akt1 , whose product has been linked to CD8 to linked been has product whose , Fig. 7e Fig. and and b Eomes; ) and metabolic regulators ( regulators metabolic and ) EM and 13 Eomes ) than during the differentiation of of differentiation the during than ) cells ( cells TE – h ), consistent with the proposal of of proposal the with consistent ), cells were more likely to exhibit exhibit to likely more were cells Supplementary TableSupplementary 10 , , A molecules that mediate TGF- mediate that molecules Supplementary Figs. 6 Figs. Supplementary TION , ). Several of these genes, such such genes, these of Several ). c Batf and Fig. 7 Fig. ). These findings raised the the raised findings These ). Fig. 7 Fig.

, , Id3 Irf4 nature immunology nature c ). A similar pattern of pattern similar A ). , underwent an , initial underwent i and and ); in contrast, these these contrast, in ); Klf2 + T lymphocytes Fig. 7 Fig. Tbx21 (refs. 27–29); 27–29); (refs. 2 6 Bcat1 ; ; and factors d Hif1a ). Finally, ). ), signal ), Fig. 7 Fig. Fig. 7 Fig. ). ). These Fig. Fig. 7 , which , which – 8 and and and and c 3 a a ); ); β + 0 - - - - , ,

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. expression patterns (as in in (as patterns expression CD8 to linked products ( (blue). H3K27me3 at H3K27me3 and Ezh2 of binding the of in as obtained cells in assessed (key), H3K27me3 by targeted) (Not marked not or (Targeted) marked T (effector, pathways differentiation inferred three the in products encoding genes of values) 7 Figure (ratio as in in as (ratio in Ezh2, by targeted genes cell-associated ( (right). regions or (left) peaks binding H3K27me3 of quantification as presented 2)), (1, each replicates biologic in genes individual for ( trajectories). expression single-cell possible all from (bootstrapped interval confidence 95% areas, shaded (key); cells nature immunology nature h d g c a Lef1 Bcl2 Klf2 Bach2 expression (TPM) expression (TPM) expression (TPM) expression (TPM) Ezh2 mediates the effector differentiation of CD8 of differentiation effector the mediates Ezh2 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 Naive 10 Naive Expression (TPM) 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 g H3K27me3 coverage (%) Not targeted Targeted Smad2 ) plotted against RNA expression (of all genes) in in genes) all (of expression RNA against ) plotted Eomes 0.5 1.5 1. 2. Bach2 –100 Foxo1 Tcf12 –60 –20 100 0 0 Tcf7 Klf2 20 60 Div1 Div1 ld3 Naive

–6 Day Day

4 4 Effector Effector –4 c Day Day T , CM d

Ezh2 7

+ 7

–2 differentiation ) Temporal expression patterns (time relative to infection as in in as infection to relative (time patterns ) Temporal expression Div1 T cell differentiation, targeted by Ezh2 ( Ezh2 by targeted differentiation, T cell Day 42 Day 42

c RNA (log ME 2 0 ) of selected genes associated with effector differentiation. Data are representative of two experiments. two of representative are Data differentiation. effector with associated genes selected ) of M fl/fl aDV Opa1 Eomes

S1pr1 Fos T expression (TPM) T Expression (TPM) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 TSS expression (TPM) EM EM Cd4 Naive 2 0 1 2 3 4 5 6 2.0 Naive expression (TPM) expression (TPM) 0.5 1.0 1.5 fold) 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 A 4 Naive NCE ONLINE PUBLIC ONLINE NCE

+/+ Div1 Div1 T CM 8 6 T T (WT) and and (WT) CM Day CM Day Effector differentiation 4

Expression (TPM) Div1 4 Day 0.5 1.0 2.0 Day 1.5 Eomes 10 TE Ezh2 7 Naive 7 Day 42 Day 42 Day i Smad2 Foxo1 Ezh2

fl/fl ll21r Akt1 expression (TPM) expression (TPM) , , 4 expression (TPM) expression (TPM) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 10 12 0. 3. 0.5 1.0 1.5 2.0 2.5 3.5 Naive T 0 2 4 6 8 Foxo1 Cd4 Naive EM 0 2 1 3 4 5 0 0 Map3k1 Foxp2 differentiation fl/fl

expression (TPM) expression (TPM) Div1 Naive Day 1 2 3 4 5 6 7 8 Div 0 1 2 3 4 5 6 7 8 Div1 Cre ME Cd4 1 7 A , , M CD8 TION Day Div1 Day Klf2

+ 4 4 Cre T lymphocytes through epigenetic repression. ( repression. epigenetic through T lymphocytes Day Day Day (KO) cells. ( cells. (KO) Ezh2 + and and 7 4 Genome position 7

T cells relative to that in in that to relative T cells

Day 42 Day Day 42 T EM c 7 ) or not ( not ) or Tcf7 fl/fl Day 42 Tcf12 ld3 b ll2ra Batf Naive expression (TPM) expression (TPM) 0 1 2 3 4 5 0 1 2 3 4 5 6 7 Cd4 Naive expression (TPM) expression (TPM) 1 1 2 3 6 0 2 3 4 5 6 7 0 4 5 7 8 Read depth Read depth (gray indicates input); arrowheads indicate binding peaks for Ezh2 (red) or (red) Ezh2 for peaks binding indicate arrowheads input); indicates (gray 10 20 10 20 0 0 4 8

ll7r 0 0 4 8 Div Div1 Input Cre expression (TPM) Chr3 Chr9 f d 1 Naive 0 1 2 3 4 5 6 7 8 9 ) H3K27me3 coverage in in coverage ) H3K27me3 ), in inferred paths of differentiation for effector cells, T cells, effector for differentiation of paths inferred in ),

CD8 Day Day 4 Div1 4

Day Ezh2 Day 118,480

+ 7 Day 7 T cells relative to that in in that to relative T cells Day 42 4 Day 42 52,300 Day H3K27me3 Fig. Fig. 7 Tcf7 Bcat1 lrf4 Dnmt1 Naive expression (TPM) expression (TPM) Ezh2 Day 42 Naive expression (TPM) expression (TPM) 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 0 2 3 5 6 1 4 0 1 3 4 6 7 8 2 5 1 ) of selected memory-cell-associated genes encoding encoding genes memory-cell-associated selected ) of CM Div1

Div 118,485 Foxo fl/fl 1 Eome or T or

f Day Day 4 1 Cd4 Peak number ×10 52,350

4 s 4 10 15 5 0 EM Day Day +/+ Kb

TTS 7 Ezh2 Kb

7 WT1 Figure Figure ; inferred as in in as ; inferred Day 42 Day 42 CD8

WT2 Read depth Read depth fl/fl 10 20 10 20 e a 0 0 4 8 0 0 4 8 Ezh2 + Myc Gzmb ) Expression (averaged normalized TPM TPM normalized (averaged ) Expression 1 H3K27me3 coverage (KO) Cd4 T cells (KO/WT). ( (KO/WT). T cells KO expression (TPM) expression (TPM) Chr1 100 150 200 Chr8 (horizontal axes). ( axes). (horizontal 15 20 25 10 14 Naive 10 12 50 K 1 4 0 6 8 5 2 72,319 0 1 5 0 +/+ fl/fl

Div O2 g 52,260

) H3K27me3 coverage of memory- of coverage ) H3K27me3 1 and and Cd4

Fig. 4 Fig. Day

4 H3K27me3 coverage(WT) 72,320 e 0 +/+ H3K27me3 coverag 0. ) H3K27me3 coverage (key) (key) coverage ) H3K27me3 Day 8 Ezh2

0 4 Peak region/bp ×10 7 52,270 CD8 d 0. 1. 1.

Day 42 (KO/WT) 0 5 5 0 ), classified as genes genes as classified ), 100 fl/fl WT1 72,321 + h Tbx21 52,280 Cd4 T cells. ( T cells. Hif1a ) H3K27me3 coverage coverage ) H3K27me3 Klf2 b expression (TPM) expression (TPM) –0. 1. 0. 0. 1. 2. 2. 3. 3. 4. Naive Tcf7 ) ChIP-seq analysis analysis ) ChIP-seq WT2 e 4 5 0 1 2 3 4 5 6 7 0 5 0 0 5 0 5 0 150 CM Cre ticles e l c i rt A Div cells and T and cells KO cells (two (two cells 1 Kb Kb Day K 1 200 i

) ) Temporal 4 O2

Day Density 1 ×10 1 ×10 1 ×10 1 ×10 7 Day 42 –20 –15 –10 –5 EM

 © 2017 Nature America, Inc., part of Springer Nature. All rights reserved. el, s t a be son ht T that shown been has it as cells, T whether elucidate and also question might this into insight additional provide probably should T T differentiating when unknown remains it but subsets, memory circulating both of progenitor common a represent might understood distinct molecularly T that appreciated been has it Although T as in as differentiating well cells, memory in those from distinct were that cells effector differentiating in sion expres of patterns suggested genes individual of ‘trajectories’ the of Visualization cells. memory or cells effector terminal of ferentiation dif the for pathways infer thereby and points time intermediate at ‘future-past’ binary classifiers enabled us to predict the identity of cells early states differentiation of The cellular these subsets. application of ory cells, we that hypothesized Div1 genes. specific few a of expression the in alterations nuanced more with associated be might program upregulation of hundreds of of genes, induction whereas the memory considerable involving burst transcriptional profound a with ciated asso were cells effector terminal of differentiation the in steps liest lower expression. These dichotomous patterns suggested that the ear Div1 Div1 Div1 by expression Div1 versus cells Div1 by differentially expressed genes of patterns expression the Moreover, cells. effector terminal of differentiation the in impor role tant but transient a have might factors transcription associated Div1 in expression Div1 in upregulated substantially were cells memory and cells effector both of differentiation the to linked tors fac transcription encoding genes Unexpectedly, and differentiation. metabolism translation, transcription, regulation, cell-cycle spanned Div1 by Div1 by expression higher had genes these of (97%) ity ‘Div1 called pro we visionally which identified, expressed subpopulations two genes the by of differentially hundreds with division, first their undergone had that cells among divergence transcriptional infection striking a viral revealed a of course the throughout CD8 of antigen-specific scRNA-seq ing CD8 of molecular specification of the regulation into insights new unknown gain and previously determinants discover molecular to sought we study, present the In DISCUSSION program. memory the of of chromatin regulators, followed by subsequent epigenetic upregulation repression the as well as fates, memory and effector the that promote products encoding genes of upregulation the includes that ity activ by that is a cells initiated rapid burst of transcriptional effector terminal of differentiation the for Together model a suggested mechanisms. findings these alternative by occur might during differentiation genes these of regulation the that suggested H3K27me3 by marked or Ezh2 by targeted not were genes effector-cell-associated ( proc differentiation the memory of during ess levels lower much at expressed were genes same s e l c i rt A 1 origin clonal common 0 EM On On the with basis and of effector similarities mem their molecular

cells diverge in fate. Future studies with more precise time points MEM MEM MEM cells were by also expressed Div1 cells. In contrast, the genes with higher expression by by expression higher with genes the contrast, In cells. 33 cells and encoded products with diverse functions that that functions diverse with products encoded and cells , 3 4 . Our data suggest the possibility that Div1 that possibility the suggest data Our . MEM MEM TE 31 cells were unique, in that the genes with higher higher with genes in the that were unique, cells cells had largely undetectable expression in in expression undetectable largely had cells , 3 TE 3 5 2 . el, hc sgetd ht memory-cell- that suggested which cells, , the ontology of these cells remains poorly poorly remains cells these of ontology the , cells’ and ‘Div1 and cells’ Fig. Fig. RM + cells are derived from Div1 from derived are cells TE T lymphocyte fates by perform by fates lymphocyte T RM 7 and Div1 i ). The observation that these these that observation The ). el ad T and cells + T cells derived sequentially sequentially derived T cells MEM CM TE CM cells, albeit with albeit cells, slightly cells and T and cells TE cells’. The vast major vast cells’. The n vivo in cells versus T versus cells cells relative to their their to relative cells MEM cells represented CM . Our analyses analyses Our . el sae a share cells TE CM EM cells than than cells MEM cells and and cells cells are are cells EM cells. cells. cells cells MEM TE ------

sion in Div1 in sion Div1 by differentially expressed differentiation by searching for commonality between the set of genes here. study our the in observed cells 1 division in heterogeneity transcriptional the mediating in division asymmetric signature transcriptional like effector- an with cells daughter to rise give division, asymmetric of T cells deficient in the atypical protein kinase PKC, a central regulator reported been has that pathways signaling kinase phosphatidylinositol-3-OH and mTOR Myc, the of distribution mitotic asymmetric the with sistent Div1 in programming metabolic Div1 of expression increased the in IFN- and IL-2 cytokines the via signaling promote could IL-2R receptors mitosis during cells daughter to unequally determinants certain apportion to lymphocytes T activated enables that mechanism conserved arily an evolution division, be asymmetric could factor One contributing question. open an remains division cell first the following observed tion of differentiating lymphocytes and was associated with more more with associated was and lymphocytes differentiating subpopula of tion distinct a in occur to seemed program memory the transcriptional of induction contrast, In lymphocytes. associated memory with transcripts of silencing epigenetic by refined and burst transcriptional by profound a and rapid initiated cells effector minal plasticity. of degree certain a cells effector on bivalent (H3K4me3 poised, a in remain might genes memory-cell-associated repressed that possible remains it However, genes. motion effector-cell-associated by into set program terminal-effector-cell-differentiation the enforce to serve might genes memory-cell-associated of repression the that suggested findings These Ezh2. by targeted selectively were effector terminal cells, transcription factors associated with the alternative differentiating memory fate In manner. state–specific cell T a in Ezh2 by differentially targeted be might fates alternative promote that factors transcription that demonstrating by observations those marks marks and repressive H3K27me3 subsets and during cell differentiation in their among deposition of permissive H3K4me3 differences substantial demonstrated have and CD8 memory and effector naive, of landscapes epigenetic process this in modifications histone and methylation DNA of importance the ing show studies published with appreciated, increasingly been has fate pathway signaling Notch the of components encoding those as well as molecules, pro-apoptotic encoding genes of repression H3K27me3-mediated through cells T CD8 effector human of survival and polyfunctionality the regulate CD8 in important regulators of functionally ofthe differentiation identifying approaches computational and experimental our of success the cells effector of terminal differentiation that candidate, showed We one metabolism. chromatin energy and proliferation, transcription, of structure, regulation spanned functions determinants molecular whose candidate 89 yielded approach This sets. sub memory-cell in expression their to relative subsets effector-cell We sought to identify previously unknown candidate regulators of regulators We candidate unknown previously to sought identify divergence transcriptional substantial the control factors Which In summary, our data suggest a model for the of differentiation ter A role for epigenetic regulation of determination of the CD8 + TE T cells. Consistent with our findings, Ezh2 has been shown to to shown been has Ezh2 findings, our with Consistent cells. T cells. Increased expression of genes encoding mediators of of mediators encoding genes of expression Increased cells. 1 1 . Asymmetric segregation of factors such as the cytokine cytokine the as such factors of segregation Asymmetric . aDV TE 37– A 42– cells and those expressed differentially by terminal- by differentially expressed those and cells Ezh2 NCE ONLINE PUBLIC ONLINE NCE α 3 4 n IFN- and 9 4 . Finally, the demonstration that activated CD8 activated that demonstration the Finally, . . Published studies have investigated the overall overall the investigated have studies Published . + H3K27me3 , encodes a product with a critical role in the the in role critical a with product a encodes , 4 γ 4 1 drn mitosis during R 0 . supports the possibility of a role for for role a of possibility the supports Il2ra + ) ) state MEM TE cells, moreover, would be con be would moreover, cells, A , , TION cells relative to their expres their to relative cells Stat5a 45 45 in in vivo , 4 , 4 6

6 and might thereby confer . . Our study has extended and and nature immunology nature , which demonstrated demonstrated , which 7 , Tbx21 11 , 3 6 , for example, example, for , observed in in observed γ and result result and + T cells cells T + T cell + + ------

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. 16. 15. 14. 13. 12. 11. 10. 9. 8. 7. 6. 5. 4. 3. 2. 1. reprints/index.htm R The authors declare no competing interests.financial J.A., C.E.W., Z.H., J.T.C. and G.W.Y. wrote the manuscript. J.A., C.E.W., S.A., P.J.M., B.Y., J.L. and S.H.K. performed experiments; and B.K., E.I.Z., B.Y. A.W.G., J.T.C. and G.W.Y. experiments designed and analyzed data; B.K., Z.H. and G.W.Y. performed computational analysis; J.A., C.E.W., E.J.W., Medical Institute (J.T.C.). AI113923 to E.I.Z.; DK007202 for C.E.W. and P.J.M.) and the Howard Hughes MH107367 to G.W.Y.; AI072117 and AI096852 to A.W.G; AI081923 and (DK093507, OD008469, and AI095277 to J.T.C.; NS075449, HG004659 and captures and sequencing. Supported by the US National Institutes of Health Core and Institute for Genomic Medicine Genomics Center for single-cell reading of the manuscript, and the Sanford Consortium Stem Genomics Cell We thank members of the Chang and Yeo laboratories for discussions and critical online version of the pape Note: Any Supplementary Information and Source Data files are available in the t the in available are references, and codes accession statements of including Methods, and data availability any associated M studies. future in approaches CD8 T lymphocytethe of fate and underscorespecification the control power and together necessity ofmechanisms single-cell epigenetic and transcriptional linked closely that suggest Together our findings genes. specific few a of expression the in increases gradual nuanced, nature immunology nature C AUTH A he pape he eprints and permissions information is available online at at online available is information permissions and eprints ck O

et

MPETI Blackledge, N.P., Rose, N.R. & Klose, R.J. Targeting Polycomb systems to regulate to systems TargetingPolycomb R.J. Klose, & N.P.,N.R. Blackledge, Rose, R. Roychoudhuri, J.P. Rouault, D. Ramskold, cell transgenic receptor cell T Initial J.T. Harty, & J.S. V.P., Haring, Badovinac, J.T. Chang, A.K. Shalek, J.T.Gaublomme, Treutlein,B. J. Arsenio, and effector of regulation Molecular A.W. Goldrath, & E.J. Wherry, J.T., Chang, profiling functional and Molecular R. Ahmed, & E. Hemby,Kersh, S.M., S., Kaech, J.A. Best, in specialists local cells: T memory Tissue-resident L.K. Mackay, & S.N. Mueller, of subsets Two A. Lanzavecchia, & M. Lipp, R., Forster, D., Lenig, F., Sallusto, N.S. Joshi, gene expression: modifications to a complex story. acomplex 16 to modifications expression: gene enhancers. to factors AP-1 of access J. EMBO cells. tumor circulating individual Immunity precursor frequency dictates critical aspects of the CD8 responses. immune cells. immune in splicing pathogenicity. RNA-seq. single-cell using analyses. gene-expression single-cell 15 by revealed immunity differentiation. cell T memory differentiation. cell T CD8 memory of formation. cell T memory and defence. immune functions. effector and potentials homing Nature distinct with lymphocytes T memory 27 CD8 no h , 643–649 (2015). 643–649 , (2014). 365–372 , (2007). 281–295 , O + ods T cell fates via the graded expression of T-bet .T-bet transcription of expression graded the via fates cell T wle R R C

r 401 N .

ON 11 G G FI 26 et al. et d , 708–712 (1999). 708–712 , t al. et t al. et t al. et , 1663–1670 (1992). 1663–1670 , gme , 827–841 (2007). 827–841 , et al. et et al. et TRIBUTI t al. et Cell N l t al. et . Transcriptional insights into the CD8 the into Transcriptionalinsights A Early specification of CD8 of specification Early Inflammation directs memory precursor and short-lived effector andshort-lived precursor memory directs Inflammation et al. et smerc lmhct dvso i te ntain f adaptive of initiation the in division lymphocyte T Asymmetric et al. et Reconstructing lineage hierarchies of the distal lung epithelium lung distal the of hierarchies lineage Reconstructing Nat. Rev. Immunol. Rev. Nat.

Single-cell transcriptomics reveals bimodality in expression and expression in bimodality reveals transcriptomics Single-cell N n 163 T1 a ebr f nw aiy f nirlfrtv genes. antiproliferative of family new a of member a BTG1, Science CIAL CIAL I ullnt mN-e fo snl-el ees f N and RNA of levels single-cell from mRNA-Seq Full-length ts r . Single-cell genomics unveils critical regulators of Th17 cell Th17 of regulators critical unveils genomics Single-cell BACH2 regulates CD8 regulates BACH2 , 1400–1412 (2015). 1400–1412 , ON

Nature Nature N S aDV 315 TERESTS Nat. Immunol. Nat. Nat. Immunol. Nat. , 1687–1691 (2007). 1687–1691 ,

498 A

Nat. Biotechnol. Nat. 509 NCE ONLINE PUBLIC ONLINE NCE Nat. Immunol. Nat.

, 236–240 (2013). 236–240 , Cell 16 , 371–375 (2014). 371–375 , , 79–89 (2016). 79–89 ,

111 +

lmhct fts uig adaptive during fates lymphocyte T + 15 14 T cell differentiation by controlling by differentiation cell T , 837–851 (2002). 837–851 , , 1104–1115 (2014). 1104–1115 , , 404–412 (2013). 404–412 ,

30

+ 17 + T cell response to infection to response cell T , 777–782 (2012). 777–782 , T cell response to infection. a. e. o. el Biol. Cell Mol. Rev. Nat. , 851–860 (2016). 851–860 , http://www.nature.c A online version of version online TION a. Immunol. Nat.

Immunity

om/ +

28. 27. 26. 25. 24. 23. 22. 21. 20. 19. 18. 17. 46. 45. 44. 43. 42. 41. 40. 39. 38. 37. 36. 35. 34. 33. 32. 31. 30. 29.

kn C.N. Skon, S.L. Schober, Cytosolic S.M. Hutson, & J.D. Powell, C.H., Drake, C.H., Patel, E.A., Ananieva, Tinoco,V.,Alcalde, R., Yang, Y., transforming Cell-intrinsic E.I. Zuniga, Sauer,& K. L.K. Mackay, factor- growth Transforming N. Zhang, & C. Ma, B. Yu, G.J. Windt, der van N. Manjunath, D.J. Tumes, I.H. Su, M. DuPage, Russ, B.E. Russ, Y. Araki, by methylation DNA J. Jacob, & R. Jaenisch, J., Altman, C., Beard, C., Chappell, B. Youngblood, rapid facilitates acetylation N.P.Histone Weng, & R. Wersto, M., Fann, Y., Araki, E. Zhao, P.J. Metz, K.C. Verbist, Pollizzi, K.N. W.H.Lin, J.T.Chang, O. Gaide, E.J. Wherry, relationships, Lineage C. Pannetier, & P. Kourilsky, Z., Garcia, C., Bouneaud, M.F.Callan, Willinger,& T.,A.J. Molecular T.,McMichael, Freeman, H., Hasegawa, T. Chtanova, M.D. Buck, TranscriptionH.D. ELF4 Lacorazza, factor & Yamada,M. T.,Mamonkin, C.S., Park, (2013). CD8 memory resident of establishment vivo. in and vitro in cells T memory of survival with correlates and cytokines by regulated is CD4 in metabolism branched chain aminotransferase (BCATc) regulates mTORC1 signaling and glycolytic vivo. in persistence fate. cell T memory tissue-resident control to IL-15 and (2015). population. T-cell memory CD8 of lymphocytes. T cytotoxic memory (2013). CD4 of plasticity signaling. (2015). activation. after identity cell T regulatory of maintenance during virus-specific CD8 virus-specific during Immunity CD8 memory of function and transcription gene of regulation state-based 176 expansion. cell T CD8 effector for critical is 1 methyltransferase DNA CD8 antigen-specific (2011). in PD-1 encodes that 180 B). granzyme and perforin targets: its and ( molecules effector of expression differential through response cell T CD8 memory robust and restriction. glycolysis via EZH2 and 194 Clambda/iota. kinase protein and Czeta kinase protein by specification lymphocytes. T activated CD8 dictates bifurcation. fate cell 34 partitioning of the transcription factor T-bet during T lymphocyte division. immunization. skin following subsets. cell T vivo. in cells T CD8 effector-memory and central- of capacities recall and homeostasis, cell T CD8 memory subsets. effector from memory central human distinguish signatures Immunol. J. datasets. microarray of collection comprehensive a using responses, cell T different programming. KLF2. and CD8 of homing and proliferation the controls rwh factor- growth (2015). differentiation. cell , 492–504 (2011). 492–504 , , 4562–4572 (2006). 4562–4572 , (2008). 8102–8108 , (2015). 2249–2259 , J. Exp. Med. Exp. J. + et al. et T cell memory development. memory cell T J. Immunol. J. et al. et J. Immunol. J. et al. et

t al. et et al. et 30 Cell t al. et t al. et et al. et et al. et

Epigenetic landscapes reveal transcription factors regulating CD8 regulating factors transcription reveal landscapes Epigenetic t al. et t al. et + 175 , 912–925 (2009). 912–925 , t al. et et al. et al. et T cell differentiation. cell T t al. et Polycomb group protein ezh2 controls actin polymerization and cell and polymerization actin controls ezh2 protein group Polycomb β t al. et t al. et t al. et

Cancer mediates effector T cell dysfunction by targeting microRNAs targeting by dysfunction cell T effector mediates Cancer et al. et Cell 121 Asymmetric PI3K signaling driving developmental and regenerative and developmental driving signaling PI3K Asymmetric signaling mediates virus-specific CD8 virus-specific mediates signaling eoewd aayi o hsoe ehlto rvas chromatin reveals methylation histone of analysis Genome-wide Nat. Immunol. Nat. t al. et t al. et euain f smerc iiin n CD8 and division asymmetric of Regulation omn lnl rgn f eta ad eiet eoy cells T memory resident and central of origin clonal Common Distinct epigenetic signatures delineate transcriptional programs transcriptional delineate signatures epigenetic Distinct Asymmetric proteasome segregation as a mechanism for unequal for mechanism a as segregation proteasome Asymmetric , 7837–7847 (2005). 7837–7847 , + Nat. Immunol. Nat. iohnra dnmc cnrl T el ae hog metabolic through fate cell T controls dynamics Mitochondrial Asymmetric inheritance of mTORC1 kinase activity during division

rncitoa dwrglto o Sp1 s eurd o the for required is S1pr1 of downregulation Transcriptional ieg rltosi ad rtcie muiy f eoy CD8 memory of immunity protective and relationship Lineage hle tp 1 n tp 2 cells. 2 type and 1 type helper T Metabolic maintenance of cell asymmetry following division in division following asymmetry cell of maintenance Metabolic 201 166 , 425–436 (2005). 425–436 , + -o tasrpin atr cmie ih h ctkns TGF- cytokines the with combine factors transcription T-box Expression of the transcription factor lung Kruppel-like factor Kruppel-like lung factor transcription the of Expression Identification of T cell-restricted genes, and signatures for andsignatures genes, Tcell-restricted of Identification h crmtnmdfig nye z2 s rtcl o the for critical is Ezh2 enzyme chromatin-modifying The Nat. Immunol. Nat. h plcm poen z2 euae dfeetain and differentiation regulates Ezh2 protein polycomb The

Immunity T cells. T et al. et

fetr ifrnito i nt rrqiie o gnrto of generation for prerequisite not is differentiation Effector 163 Cell Rep. Cell hoi vrs neto efre dmtyain f h locus the of demethylation enforces infection virus Chronic 175 , 579–590 (2005). 579–590 , , 63–76 (2016). 63–76 , , 3662–3667 (1999). 3662–3667 , + Nature Mitochondrial respiratory capacity is a critical regulator critical a is capacity respiratory Mitochondrial , 5895–5903 (2005). 5895–5903 , T cell differentiation. cell T rc Nt. cd Si USA Sci. Acad. Natl. Proc. J. Biol. Chem. Biol. J.

Nat. Med. Nat.

31

4 13

, 225–234 (2003). 225–234 , , 145–157 (2009). 145–157 , 532 10 J. Clin. Invest. Clin. J. , 2203–2218 (2015). 2203–2218 , (in the press). the (in , 618–626 (2009). 618–626 , Nat. Immunol. Nat. Nat. Immunol. Nat. , 389–393 (2016). 389–393 , Immunity

+ 21 cells. T , 647–653 (2015). 647–653 ,

289 + + T cells via the Kruppel-like factors Kruppel-like the via cells T cells. T

36

, 18793–18804 (2014). 18793–18804 , β Immunity 108 a. Immunol. Nat. inln i cntnl shaping constantly is signaling , 68–78 (2012). 68–78 ,

17 17 , 871–878 (2001). 871–878 , , 95–103 (2016). 95–103 , , 704–711 (2016). 704–711 , + Immunity cl dlto ad viral and deletion cell T Immunity Immunity Immunity

41 ticles e l c i rt A 112 + , 853–865 (2014). 853–865 , lmhct fate lymphocyte T

11013–11017 ,

14 43

35 39 1285–1293 , 42 , 1101–1111 , J. Immunol. J. J. Immunol. J. J. Immunol. J. 400–412 , 819–832 , , 227–238 , Immunity + T cells. T 11 + T β

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. and H3K27me3, wild-type CD8 wild-type H3K27me3, and H3K27me3. and Ezh2 of analysis ChIP-Seq Center. (IGM) Medicine Genomics for Institute UCSD the at or System Sequencing 100 HiSeq2500 the on 100) (paired-end single-end sequenced were libraries cDNA Single-cell (Illumina). System Sequencing MiSeq and Scientific), Fisher (Thermo Fluorometer 3.0 Qubit Technologies), (Agilent Bioanalyzer 2100 the librar on cDNA performed were single-cell ies the of measures control Quality instruc (Illumina). manufacturer’s tions the to according generated were libraries (cDNA) DNA complementary XT single-cell Nextera Illumina facturer’s instructions. manu the to according performed was amplification whole-transcriptome (Clontech) SMARTerchemistry chip. on performed were RT-PCR and Cell lysis imaged. manually were chip on captured cells exclude single to Viable cells. included dead was (Invitrogen) stain Live/dead chip. on capture cell C the onto loaded were 10 × 2.5 isolation, cell After simultaneously. cells RNA-sequencing. of up single to 96 amplification transcriptome and amplification C transcriptome Single-cell d. 3 additional an (PeproTech) for ng/ml) (15 IL-15 or U/ml) (10 IL-2 with cultured then were CD8 the using isolated were cells T After 1 (500 ng/ml). d peptide with gp33-41 of P14 activated CD8 activation, differentiation. and culture Cell recipient ( into mice transfer adoptive before (CFSE) ester succinimidyl diacetate 10 × 2 division, CD8 of isolation the For mice. ent 5 × 10 CD8 of isolation the For infection. after d 42 at ent mice at 7 d after infection and splenocytes and lymph nodes were harvested from were recipi isolated of mouse per units Splenocytes LCMV-Armstrong. 10 × 2 with later d 1 infection intraperitoneal by followed CD45.2 wild-type congenic into transferred adoptively were infection. and transfer viral Adoptive cell FACSCanto or Biosciences). FACSAria II, (BD C6, Accuri an on analyzed were samples All antibodies. intracellular with staining before lized and permeabi Services) Microscopy in fixed (Electron 4% paraformaldehyde then and antibodies surface with stained were cells °C; 37 at h 6 for (Sigma) A brefeldin of presence the in (KAVYNFATM) peptide (GenScript) gp33-41 of detection IFN- H-2D generate (NIH TetramerH-2D Core Facility) was conjugated to (Prozyme) streptavidin-PE to Biotinylated Technology. Cell and Biolegend from purchased were Flow Mito and Kit Detection Apoptosis V Annexin dilution. at 1:20 used was and Pharmingen BD from purchased was (11/Ezh2) Ezh2 to Antibody dilution. 1:100 at used was and Technologies Life from purchased IFN- V IL-2R (1M7), CD44 (2F1), KLRG1 (MEL-14), CD62L from Biolegend and were used at 1:100 dilution: CD8 cytometry. and flow Antibodies analysis. from experi infection and or was were used no For excluded no ments, blinding animals randomization old. weeks 6–8 male, were mice Recipient old. weeks P14 with crossed were and C57BL/6J were Wild-type conditions. mice All pathogen-free Diego. specific in San housed and bred California, of University the of Guidelines Use Mice. MET ONLINE nature immunology nature vitro in 1 β Single-Cell Auto Prep System (Fluidigm) was used to perform whole- perform to used was (Fluidigm) System Prep Auto Single-Cell 8.1/8.2 (KL16-133.18), IL-7R (KL16-133.18), 8.1/8.2 Ezh2 γ All animal work was approved by the Institutional Animal Care and and Care Animal Institutional the by approved was work animal All 4 (XMG1.2), TNF (MP6-XT22). Antibody to Granzyme B (GB11) was was (GB11) B Granzyme to Antibody (MP6-XT22). TNF (XMG1.2), n P14 CD8 = 24) and were harvested 2 d following LCMV infection. LCMV following d 2 harvested were and 24) = with 5 5 with fl/fl mice were obtained from the Jackson Laboratory. Laboratory. Jackson the from obtained were mice b 6 gp33 tetramer for flow cytometry analysis. For intracellular intracellular For analysis. cytometry flow for tetramer gp33 + P14 CD8 P14 H µ γ T cells per mouse were adoptively into transferred 24 recipi g/ml plate-bound anti-CD3 (145-2C11) and anti-CD28 anti-CD28 and (145-2C11) anti-CD3 plate-bound g/ml and TNF, CD8 ODS 5 to 2 × 10 × 2 to 1 Cd4 Single-Cell Auto Prep mRNA Array IFC for single- for IFC Array mRNA Prep Auto Single-Cell + T cells were first labeled with carboxyfluorescein carboxyfluorescein with labeled first were cells T Cre 6 mice. Donor mice were male or female, 6–8 6–8 female, or male were mice Donor mice. P14 CD8 P14 α Antibodies to the following were purchased to were purchased the following Antibodies + + Splenocytes obtained from P14 mice were were mice P14 from obtained Splenocytes T cells (from (from cells T (A7R34), T-bet (4B10), IRF4 (IRF4.3E4), (IRF4.3E4), IRF4 (4B10), T-bet (A7R34), T cells were stimulated + T cells that had undergone their first cell cell first their undergone had that cells T + T Isolation Kit (Miltenyi). CD8 (Miltenyi). Kit Isolation T + T cells sorted by flow cytometry cytometry flow by sorted cells T 5 × 10 For ChIP-seq analysis of Ezh2 Ezh2 of analysis ChIP-seq For + T cells at 4 d after infection, infection, after d 4 at cells T n 3 P14 CD45.1 = 4 mice) were activated activated were mice) 4 = α (53-6.7), CD45.1 (A20), α (PC61), V (PC61), ex ex vivo b 5 gp33 monomer monomer gp33 plaque-forming plaque-forming + recipient mice, mice, recipient + Ezh2 CD8 with LCMV α 2 (B20.1), 2 (B20.1), fl/fl + + T cells cells T T cells The The mice mice + ------

mRNA-stranded cDNA libraries were generated and sequenced on an Illumina by cytometry. and flow into mice sorted recipient transferred were adoptively 5 × 10 RNA-seq. Bulk-cell instrument. HighSeq4000 Illumina an on nt SE) (50 a reads on bp 250–650 to size by selected PippinPrep and to (Sage instrument sequenced Science), a depth of 30 million pooled, was library each of fmol 200 sample. per fmoles 200–600 yielded Amplification controls). or (IgG IPs) cycles 17 (H3K27me3 cycles 14 for amplified was and used was cells 500,000 coverage comparison of versus cells, wild-type chromatin Ezh2-deficient from for For IgG H3K27me3 the IPs, controls. for and 17 cycles Ezh2 IPs, 14 cycles and H3K27me3 for input controls II IPs, for 12 cycles pol 10 cycles used PCR to according by(NEB), the amplification manufacturer’sLibrary instructions. input controls (1%) using the NEB Next ChIP Preparation Library Reagent Set Sequence-indexed libraries were prepared from immunoprecipitated DNATechnologies). andLife (026102, IgG normal rabbit and Millipore) (12-371, IgG Millipore), mouse anti-RNA polymerase II (05-623, Millipore), mouse normal (07-449, rabbit anti-H3K27me3 Santa (H-80, Biotechnology), Cruz anti-Ezh2 10 × 1 from 3 and chromatin cells (IP), immunoprecipitation each hydro-shearing For E220 Covaris instrument. a using bp 300 of size average an to sheared facturer’s In were prepared brief, nuclear and extracts instructions. chromatin manu the to according (Millipore) kit ChIP EZ-Magna the using performed cells. 4 × 10 dead to exclude cytometry by flow for 4 d sorted and were X Bio Cell) (37.51, results in a dimensionality reduction that is invariant to reflection to invariant is that reduction dimensionality a in results and practice standard is transformations of composition 0.75 This = theta parameters). and 30 = perplexity (with 2 = D in visualize to bhSNE applied then only and D = 10, to down principal-component dimensionality the standard reduce to (PCA) applied analysis first we greater Therefore, cells all TPM). over 1 than expression mean a had which of over for (6,000 signal genes gave 22,000 cell each for data RNA-sequencing total our contrast, (approximately) ~40 = D is dimension meas original the cytometry where urements single-cell of millions visualize to used successfully been has and problem, crowding so-called the solves which objective non-convex (bhSNE) algorithm approximation (tSNE) embedding sto neighborhood t-distributed the chastic applied we plot, scatter 2-dimensional a in subtypes CD8 among expression gene of diversity visualization. the visualize and EXPR matrix expression cell-heterogeneity cell-by-gene the of and dimensionality the reduce reduction Dimensionality EXPR gene cell each for value TPM the Finally, gene. given a of transcripts tified quan all for values expression the summing by columns) = genes 22425 = G rows, = cells 288 = (C matrix expression cell-by-gene a into combined were Next, analysis. the per (TPM) outputsmillion transcript of for Kallisto all cells genome (mm10) STAR(v2.4.1b) by aligned and correction; bias out (v0.42.1) Kallisto by fastqc (v0.10.1) with metrics quality for checked were: reads RNA-seq raw cell, each For quantification. expression robust and (QC) control quality on focusing CD8 256 from data pre-processing. data RNA-seq Single-cell set. each in number gene the on depending analyses, gene-ontology 5 from ranged The distribution. Poisson a follows gene each of count read the that assumption the by calculated were genes of read-count expressed all genes in for total each sample. Differentially to to a gene according names and 1,000,000 was summed normalized Kallisto parameter: -1 200 –s 20 – single. The read count of each transcript derived from using GENCODE GRCm38.p4Kallisto transcriptomewith processed were as samples the bulk reference,The instrument. with the HighSeq4000 following g 4 c,g was natural log-transformed to yield a normalized expression value: value: expression normalized a yield to log-transformed natural was P14 wild-type CD8 = ln(1 + TPM + ln(1 = 4 8 ; poly-A and adaptor-trimmed with cutadapt (v1.8.1) cutadapt with ; and adaptor-trimmed poly-A µ 6 CD8 g of antibody were used. Antibodies used were as follows: rabbit rabbit follows: as were used Antibodies used. were antibody of g −5 5 o 5 to 2 + with default parameters for quality control and downstream T cells were in crosslinked 1% formaldehyde and ChIP was + 5 o ioain f CD8 of isolation For T cells were processed with a bioinformatics pipeline pipeline bioinformatics a with processed were cells T 0 −6 to a reference transcriptome (Gencode vM3) (Gencode transcriptome reference a to c,g for selection of differentially expressed genes for for genes expressed differentially of selection for ) 5 . + 4 T cells ( . tSNE is an unsupervised technique based on a on based technique unsupervised an is tSNE . P value threshold was Bonferroni-corrected and and Bonferroni-corrected was threshold value n = 4) or Ezh2-deficient CD8 + Single-cell mRNA sequencing sequencing mRNA Single-cell T cells at 4 d after infection, infection, after d 4 at cells T 5 1 to the reference mouse mouse reference the to 5 3 + via its Barnes-Hut Barnes-Hut its via T cells of different different of cells T doi: 10.1038/ni.3688 + T cells ( 4 9 ; quantified ; quantified 5 55– 4 8 8 . After After . with n c 5 and and = 8) 8 To . In . 4 7 6 - - - - - ,

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. In contrast, the ‘effector memory’ and ‘central memory’ lineages start from from start lineages memory’ ‘central and memory’ ‘effector the contrast, In Div1 the through Specifically, the ‘effector’data RT-qPCR of lineage courses starts time from similar the with work naive published population, of basis andthe on progresses a priori were determined orders These lineage. in a ordered particular lations each from cells population 50 and constructed all trajectories through the replacement cross-product of popuwith sampled we brief, In lineage. each for courses time differentiation hypothetical constructed we lineages, memory stand the temporal dynamics of expression for key genes along the effector and Temporal paths. through inferred lineage expression trajectories predictions. score ‘memory-ness’ their in agreement seeming their with contrast to computed was classifiers two the for natures score GINI by their from all G were The genes = signature for selected genes 22,425 each classifier analysis. this for redundant is and score ‘memory-ness’ the minus 1 is scores plotted and were shown to correlate ( mediate day 4 CD8 inter unseen previously provided both were their they on subpopulations, trained respective were classifier early-state and classifier fate the both After (ref. 90% remaining the on training while set test held-out 10% each on was computed curve (ROC) by combining ating the predictions characteristic tive training sets and evaluated using tenfold cross-validation. A receiver oper set for another early state classifier. Both classifiers were provided their respec and Div1 Div1 into T cells of daughter segregation observed Using newly the T cells. (T trees randomized unseen cells. Each classifier constructed an independent ensemble of extremely previously of ‘effector-ness’ or ‘memory-ness‘ the predict to used be can also subpopula butabove) described gene expression as cell (such the differential tions examined T the distinguish only not that signatures gene-expression Div1 ( subtypes division first our as such samples small two between even distributions, cumulative empirical the between difference largest the finds mate normality; and the Kolmogorov-Smirnov two-sample (KS2) test test rank-sum MWU) as known also (MWW; Mann-Whitney-Wilcoxon the expression: of heterogeneity for dropout to due bimodal be does not conform to the usual negative binomial distribution expression gene single-cell Since EXPR. matrix expression log-transformed all pairs of T cell sub-populations from two non-overlapping sets of rows in the results. interpretable more give to information extra the utilize which which are blind to the cell type labels, we also applied methods two supervised dimensionality reduction (PCA and tSNE) and hierarchical clustering methods of analysis data. gene expression Supervised SP_PIR_KEYWORDS, GOTERM_MF_FAT, categories. target for chosen were UP_SEQ_FEATURE GOTERM_BP_FAT, used were and threshold default and background Default analysis. for DAVID analysis. (GO) Gene-ontology green). and (purple cells T memory the of that to than (yellow) cells 7 day all of centroid the to space) tSNE (in closer Div1 remaining The T T all all and of (purple) cells centroid overall the to space) tSNE (in closer were they because Div1 proposed the Specifically, cells). ter (T T of cells effector clusters differentiated the of minally centroids the to proximity their to according terplot ( plot tSNE our in dots) (red 1 T cells of division clusters distinct two observed we Since two-dimensional population. cell T resulting corresponding its of stage the the by colored on was plot scatter point each reduction, dimensionality doi: n = 36) and Div1 and 36) = CM For the cell type classifier, we trained two T binary to cell classifiers identify For differential gene expression analysis, we performed this analysis between 10.1038/ni.3688 and T MEM EM Fig. 2 Fig. subpopulations after the first division, we built a second training ) ) populations, we built a training set for a fate classifier for CD8 6 MEM a TE 4 + ), we re-colored those cells distinctly for the inset scat inset the for distinctly cells those re-colored we ), . Using the terminally differentiated effector and memory and memory effector differentiated . Using terminally the 6 T cells. Their predicted ‘memory-ness’ scores were scatter- subpopulation, then onto day 4, and finally to day 7. 7. day to finally and 4, day onto then subpopulation, 5 TE . The surprisingly small overlap in gene-expression sig gene-expression in overlap small surprisingly . The ( EM cells ( cells n = 24). = cells (green) than to that of all day 7 cells (yellow). (yellow). 7 cells day of all that to than (green) cells 6 1 Fig. 2 Fig. , we used two non-parametric statistical tests tests statistical non-parametric two used we , 6 2 Generated gene lists were uploaded to to uploaded were lists gene Generated , which relies on a large sample to approxi to sample large a on relies which , a Fig. Fig. 3 , inset) remained red because they were were they because red remained , inset) MEM TE f cells (inset) were re-colored blue blue re-colored were (inset) cells cells) and memory T cells (T T cells memory and cells) ). ). For each T cell, its ‘effector-ness’ In contrast to the unsupervised 59 , 6 0 and can even To under 6 3 , , which MEM 6 CM 5 TE TE 7 ). ). + ------.

considered significant. Data are presented as the ratio of H3K27me3-marked of H3K27me3-marked ratio as the are presented Data significant. considered were bp 600 than greater changes coverage ChIP TSS absolute and 0.5 than In covered. significantly considered over input was coverage 3× read and any with region derived, was region TSS each of intensity way, read the In similar a sample. each for obtained reads of for both the input reads that were in located TSS regions and the total number normalized region, TSS the in located were that reads total the of sum the by and effector cells, respectively.memory Reads intensity from around TSS ( TSS around coverage cell naive deducting by calculated regions were calculated by BEDtools Intersect expressed 6,000 ( genes the for (TSS) site start transcriptional the around changes coverage analyze To values). (‘fold’ 4.0 as threshold input over enrichment ‘Homer Poisson by with command called histone were findPeaks’-style peaks ChIP default. as options other all and Fastx; outReadsUnmapped 1; limitOutSJoneRead 1; outFilterMultimapScoreRange None; 10; options: outSAMunmapped the following outFilterMultimapNmax with STAR with genome (v2.4.1b) reference to mm10 were mapped infection CD8 on analysis. data H3K27me3-coverage it. around area shaded a as interval confidence 95% the and presents segment line a with solid sample population for each expression plot time-series package software Seaborn a by summarized visually were trajectories bootstrapped These respectively. cells, Div1 the through naive, considered significant. considered ( by ChIP fied identi H3K27me3, or Ezh2 by targeted genes of comparisons for used was ( groups two involving comparisons for used ( expression differential significantly ( classifiers of by score to prediction memory supervised the assess significance analysis. Statistical samples. single-cell all across 1 = TPM expression mean had genes all that ensure to calculated was Div1 in that to relative cells 4 day in or loss of gain either expression with genes among total genes or Ezh2-bound In H3K27me3. by ≥ in of or Ezh2 targets H3K27me3 Ezh2 or H3K27me3 negatively correlated with TPM changes. To be included as Fisher’s test exact was genes to whether targeted then byperformed determine twofold. than more changed expression gene bulk if decreased or increased as marked were genes naive), vs. memory naive, vs. (effector comparisons of bulk gene expression TPM was from calculated data. single-cell In the two sets on gene expression in memory and effector differentiation ( and Ezh2 of H3K27me3 impact the In to quantify order cells. or effector cells memory from TSS around coverage cell naive deducting by calculated then regions were by calculated BEDtools ( genes expressed 6000 the of TSS of coverage the get to order In default. as options mand using input as background with following options: size 100; and all other were peaks by identified using the HOMER software package ‘findPeaks’ com 0 L 1000; minDist 200; size options: input following ‘findPeaks’ using with command, as background HOMER by called were regions ChIP H3K27me3 0.01. iterNorm normGC; command and the with ChIP HOMER input into data were converted tag then directory limi Fastx; and options as other all default. 1; outReadsUnmapped tOutSJoneRead 1; outFilterMultimapScoreRange 10; outFilterMultimapNmax None; genome with STAR (v2.4.1b) analysis. data ChIP Ezh2 expression. decreased with genes total over genes Fig. 3 Fig. 100-bp TSS region covered by Ezh2 peaks or or peaks Ezh2 by covered region TSS 100-bp f Fig. 6a Fig. Fig. 6 Fig. ). Pearson correlation was used to determine the genes with the most most the with genes the determine to used was correlation Pearson ). + T cells sorted at 8 d (effector cells) and 60 d (memory cells) after after cells) (memory d 60 and cells) (effector d 8 at sorted cells T 6 8 ‘makeTagDirectory’ with the following options: keepOne; tbp 1; 1; tbp keepOne; options: following the with ‘makeTagDirectory’ a Fig. 6c Fig. , ), the overlap of peaks and the 20 bins of 100 bp around TSS TSS around bp 100 of bins 20 the and peaks of overlap the ), d ), the overlap of peaks and the 20 bins of 100 bp around TSS TSS around bp 100 of bins 20 the and peaks of overlap the ), Figure 6 Figure Pearson correlation and Spearman correlation were used correlation and Spearman Pearson correlation , e ). Differences with an associated an associated with Differences ). ; and all other options as default. Ezh2 and PolII ChIP ChIP PolII and Ezh2 default. as options other all and MEM All ChIP data were mapped to mm10 reference reference mm10 to mapped were data ChIP All e 5 , data are presented as a percentage of unbound unbound of percentage a as presented are data , subpopulation, ending with T with ending subpopulation, 1 with the following options: outSAMunmapped Figures 6 Figures 6 Figure 6 Figure Data from H3K27me3 ChIP performed performed ChIP H3K27me3 from Data 7 TE . . The changes in coverage ( Fig. 4 Fig. cells. In In cells. and b 6 c ). Student’s unpaired unpaired Student’s ). 7 , absolute TPM changes greater , greater TPM changes absolute . . The coverage change was then

Fig. 5d- Fig. 7 ≥ , genes were required to have , were required genes 6 Figure Figure 600-bp TSS region covered covered region TSS 600-bp P 6 nature immunology nature , which links the average average the links which , value cutoff as 0.1% and and 0.1% as cutoff value Fig. Fig. 6 k P ). Fisher’s exact test test exact Fisher’s ). 7 value of <0.05 were of <0.05 value , normalized TPM TPM normalized , Fig. Fig. 6c EM b ) ) was calculated cells and T and cells Fig. Fig. 6 , e ), ), artificial t -test was was -test a ) ) were CM - - -

© 2017 Nature America, Inc., part of Springer Nature. All rights reserved. 57. 56. 55. 54. 53. 52. 51. 50. 49. 48. 47. codes accession GEO availability. Data nature immunology nature

Bendall, S.C. B. Becher, Amirel, A.D. Barnes-Hut-SNE. L. Maaten, t-SNE. using data high-dimensional Visualizing L.H.G.E. Maaten, C. Sequencing, Genome Mouse A. Dobin, accurate and Tse,Fast & D.N. Pachter,L. J.M., Zhang, V.,G.M., Ntranos, Kamath, Illumina pre-processing Wu,for & Software H. C.H. Huang, S.S., Khaleel, C., Chen, metagenomic of preprocessing and control Quality R. Edwards, & probabilistic R. Schmieder, Near-optimal L. Pachter, & P. Melsted, H., Pimentel, N.L., Bray, acrosshumanhematopoietica continuum. Immunol. Nat. revealsphenotypic heterogeneity leukemia.of Res. genome. mouse the of (2013). 15–21 counts. transcript-compatibility of clustering by Biol. Genome analysis RNA-seq single-cell (2014). sequences. read short sequencing next-generation datasets. quantification. RNA-seq

9 , 2579–2605 (2008). 2579–2605 , Bioinformatics t al. et t al. et et al. etal.

17

15 Source RNA-seq and ChIP-seq data sets are available with with available are sets data ChIP-seq and RNA-seq Source TR utaat nvra RAsq aligner. RNA-seq universal ultrafast STAR: , 112 (2016). 112 , Single-cell mass cytometry of differential immune and drug responses ihdmninl nlss f h mrn meod el system. cell myeloid murine the of analysis High-dimensional viSNE enables visualization of high dimensional single-cell data and , 1181–1189 (2014). 1181–1189 , GSE8940 Nature

Nat. Biotechnol. Nat. 27 , 863–864 (2011). 863–864 , arXIv

420 5 et al. et and and

, 520–562 (2002). 520–562 , https://arxiv.org/abs/1301.334 Initial sequencing and comparative analysis comparative and sequencing Initial GSE8903

Science 34 Nat.Biotechnol. , 525–527 (2016). 525–527 , 6

. 332 ore oe il Med. Biol. Code Source , 687–696, (2011).

31 , 545–552, (2013). Bioinformatics 2 J. Mach. Learn. Mach. J. (2013).

9 29 8 , ,

68. 67. 66. 65. 64. 63. 62. 61. 60. 59. 58.

una, .. ETos Te ws-ry ol o gnm faue analysis. feature genome for S. Heinz, tool Swiss-Army The BEDTools: A.R. Quinlan, M. Setty, Python. in learning machine Scikit-learn: A. F.V.Gramfor, Pedregosa, G; comparisons. some and fit of trees. randomized Extremely L. P.E.Wehenkel, Geurts, D.; goodness for statistics EDF M.A. Stephens, for approach nonparametric a patterns: consistent Finding R. Tibshirani, & J. Li, fold of J. Fan, estimation Moderated S. for package Anders, Bioconductor a edgeR: G.K. Smyth, & McCarthy,D.J. M.D., Robinson, & W. Huber, M.I., Love, of analysis Categorical E.W. Newell, & L. Maaten, der van M.T.,Y.,Wong, Cheng, (2006). Assoc. Stat. Am. J. (2013). 519–536 rm csrgltr eeet rqie fr arpae n B el identities. cell B and macrophage for Cell Mol. required elements cis-regulatory prime Bioinformatics. Protoc. Curr. data. single-cell Res. Learn. data. RNA-Seq in expression differential identifying analysis. overdispersion set (2010). 139–140 data. expression gene digital of analysis expression differential DESeq2. with (2014). data RNA-seq for dispersion and change nonlinear by soli-expression one-dimensional with embedding. stochastic heterogeneity cell T human et al. et

38 t al. et t al. et

12 Characterizing transcriptional heterogeneity through pathway and gene and pathway through heterogeneity transcriptional Characterizing , 576–589 (2010). 576–589 , , 2825–2830 (2011). 2825–2830 , ipe obntos f ieg-eemnn tasrpin factors transcription lineage-determining of combinations Simple Wishbone identifies bifurcating developmental trajectories from trajectories developmental bifurcating identifies Wishbone Nat. Biotechnol. Nat.

69 , 730–737 (1974). 730–737 , J. Immunol. J. Nat. Methods Nat.

47 , 11–34 (2014). 11–34 ,

34

196 , 637–645 (2016). 637–645 , , 924–932 (2016). 924–932 ,

13 , 241–244 (2016). 241–244 , Stat. Methods Med. Res. Med. Methods Stat. eoe Biol. Genome Mach. Learn. Mach. doi: Bioinformatics 10.1038/ni.3688

62 15 J. Mach. J. , 3–42 , 550 ,

22 26 , ,