© 2011 Nature America, Inc. All rights reserved. H.Y.C. ([email protected]). University of Oslo, Oslo, Norway. Genetics, Institute for Cancer Research, Oslo University Hospital Montebello, Oslo, Radiumhospitalet, Norway. of Medicine, Baltimore, Maryland, USA. Cancer Comprehensive Center, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA. Dartmouth Medical School, Hanover, New Hampshire, USA. Hill, North Carolina, USA. of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA. and Artificial Intelligence Laboratory, Institute Massachusetts of Technology, Cambridge, USA. Massachusetts, of Medicine, Stanford, California, USA. 1 transcriptional in implicated been have lncRNAs several to elements enhancer produce are ncRNAstranscribed also coding intergenic ncRNAs,(long nearby or orwithin lincRNAs), mayinterspersed be or they genes coding protein from isolation in located be in vary length from hundred several bases to infidelity tens of kb; lncRNAs may polymerase RNA of byproducts represent just or ncRNAs functional are these if However,uncertain remains it initiation RNAs (tiRNAs) and TSS-associated RNAs (TSSa-RNAs) ously named promoter-associated small RNAs (PASRs),vari transcription-been have which nucleotides, 200 to 20 from ranging ncRNAs (TSS), divergent transcription by RNA polymerase can generate small defense and regulation post-transcriptional genome in function (piRNAs), RNAs Piwi-associated and (miRNAs) microRNAs (siRNAs), RNAs interfering small including ncRNAs, regulatory Small transcribed. are (ncRNAs) RNAs coding expected Mammalian genomes are more pervasively transcribed than previously for depletion p53-dependent damage. expression regions the external Transcription Michael L Whitfield Anita Langerød Hugo M Horlings Tiffany Hung RNAs within promoterscell-cycle Extensive and coordinated transcription of noncoding Nature Ge Nature Received 19 August 2010; accepted 6 May 2011; published online 5 June 2011; Program Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA. Although evidence for function of IncRNAs as a group is lacking, lacking, is group a as IncRNAs of function for evidence Although

promoter promoters

that

1–

stimuli, DNA

4 markedly n

. . In many ofgenes, to addition protein-coding non types in etics encode

lncRNAs of

damage of

1

manner.

long ,

but 2

56 , , Yulei Wang 14 14 ADVANCE ONLINE PUBLICATION ONLINE ADVANCE

sensitized

, Anne-Lise , Børresen-Dale Anne-Lise 10 , putative cell-cycle

1 8 the

cancers noncoding 5 Department Department of Biochemistry 1, Hamamatsu University School of Medicine, Higashi-ku, Hamamatsu, Japan.

. Moreover, recent evidence suggest that active , , Nilay Shah

in induces 9

, , Manolis Kellis scope PANDA

cell-growth 16 These These authors contributed equally to this work. should Correspondence be addressed to D.J.W. or ([email protected])

lncRNAs, and

human genes

and 3 five 13 Life Life Technologies, Foster City, California, USA.

5–

3 RNAs interacts

Department Department of Biology,Developmental Stanford University School of Medicine, Stanford, California, USA. , , Michael F Lin are 7

. . Near transcriptional start sites

functions lncRNAs 11

to

control.

regulated fibroblasts

, , Christopher Umbricht

(lncRNAs)

many interrogate

4 with , 5

, , Yue Xiong from with of

10

the in such

Department Department of Pathology, Academic Medical Center, Amsterdam, The Netherlands.

to

within

12 RT-PCR–validated expression the

1 4 14 transcription

, apoptosis , 1 7

5 08

3 activity Department Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel , regulation. regulation. , , Ashley K Koegel

15 . lncRNAs lncRNAs . CDKN1A 16

samples , , Seung K Kim

, 1 7 6 . , , David J Wong 8–

are

by

1 by regulatory 1 - - .

promoter,

representing specific factor doi:10.1038/ng.84 not doxorubicin. 12 the genomic loci of cell-cycle genes may harbor other functional functional other harbor discovered. be to yet have may that ncRNAs genes cell-cycle of loci genomic the in genes as imprinting, regulate gene expression in studied lncRNAs, such as those involved in dosage compensation and Many formation. of well as induction heterochromatin well- through with sion correlation inverse an with leukemias human in the CDKN2B CCND1 of repression in resulting liposarcoma), in acetyltransferases, histone inhibits (translated which TLS protein RNA-binding the vates complex repressor of transcription lates of upstream kb 2 genes. cell-cycle of loci In genomic the the in are examples prime Two , , Pei Wang

periodic known. 4 HOTAIR The The Broad Institute, Cambridge, USA. Massachusetts, p15 19

NF-YA

, CCND1 2

oncogenic 2 elements , coding sequence, and and sequence, coding 0 1 transcription. A example second transcription. is an ncRNA antisense gene 13

. . trans and , 2 - CDKN2B , , Marc van de Vijver

AS1 1 , , Yojiro Kotake Here

2

and , diverse expression

Howard Howard Hughes Medical Institute, Stanford University School

16 These to

one 22– (encoding cyclin D1) promoter, an ncRNA transcribed 13 & Howard Y Chang (also known as as known (also 8

limit linc-p21

, , Yu Wang 2

12 we can

4 CCND1

such . Inspired by these examples, we hypothesized that we . hypothesized examples, by Inspired these stimuli, Department Department of Surgery, Johns Hopkins University School

1 - findings

8 AS1 perturbations.

use . This ncRNA binds to and allosterically acti allosterically and to binds ncRNA This .

expression 15 6 modulate CCND1 Lineberger Comprehensive Lineberger Cancer Comprehensive Center, University

Institute Institute for Clinical Medicine, Faculty of Medicine, lncRNA, during can transcriptionally silence silence transcriptionally can

, can regulate the activity of distantly located of located , distantly the can activity regulate an is induced by ionizing radiation and regu and radiation ionizing by induced is

stem

ultrahigh-density suggest 3 6–8 in in , , Benjamin Kong CDKN2B

p15AS the

, , Gavin D Grant

gene

cell cis of named 10

cell by forming a ribonucleoprotein ribonucleoprotein a forming by We pro-apoptotic

potentially , , Saraswati Sukumar differentiation

or or activity cis 1

- identify cycle, ,

AS1 2 2 PANDA ANRIL 1 , 9 16 , but other lincRNAs, such Department Department of Genetics, expression is increased increased is expression

in show array

) that overlaps with with overlaps that ) 2 widespread

5 , response s e l c i t r A Computer Computer Science

1

is 3 11 genes; 6 , 9

Sidney Sidney Kimmel

induced , that altered

14 or transcribed p15

Department Department of

DNA p15 directly as as directly

tiles

PANDA

to expres

roles 11 in

, a



- - -

© 2011 Nature America, Inc. All rights reserved. overlapped by a minimumthat of 50 bases transcripts and identified all clusters that clustered were arrays, 54 the of channels 108 all from transcripts significant statistically compiled Wethen bp. 50 least at of (peaks) regions contiguous detected and background above nals ( or carcinogenesis stimuli oncogenic stimuli, differentiation damage, DNA synchronization, cell-cycle through perturbed or altered were that cells human various RNAs from polyadenylated of pairs 54 ized formation tumor and response DNAdamage self-renewal, senescence, as such processes biological fundamental and include those that encode all known cyclins, CDKs and CDKIs ( to upstream 2 kb of genes to downstream each TSS from 53 cell-cycle CDKN2A at 25 across encompasses 5-nt kb resolution (which of locus the 9p21 interrogates that array tiling a created we genes, cell-cycle human of To systematically discover functional ncRNAs in the regulatory region Extensive RESULTS response. DNA damage the in function that lncRNAs of set specific a identify and transcription noncoding and choreographed a map of extensive reveal results Our samples that interrogate in distinct perturbations cell-growth control. We analyze a diverse of collection cells and tissue (CDKIs). inhibitors kinase dependent cyclin-dependent kinases (CDKs) and cyclin- cyclins, all encoding genes including genes, chromatin cell-cycle 56 of and TSSs the around landscape interro transcriptional to the gate microarray tiling resolution and also See arrays. 54 all across signals transcription average of window running 150-bp a represents output The window. 12-kb the across signal hybridization RNA average the calculated and TSS the at promoters cell-cycle all We aligned promoters. ( potential. coding protein no indicates 10 < CSF transcripts. predicted their and genes coding of exons the of CSF evolutionary average the of Graph analysis. (CSF) frequency substitution ( treatment. doxorubicin without or with fibroblasts lung fetal human in expressed peaks transcriptional across calculated input the to relative signal ChIP-chip average The ( peak from one of the 108 array channels. black boxes. Each bar represents a significant in red boxes. Known mRNA exons are shown in fibroblasts. The predicted transcripts are shown input at the and H3K4me3 ChIP-chip signals relative to the RNA hybridization intensity and H3K36me3 ncRNAs. ( strategy for systematic discovery of cell-cycle within cell-cycle genes. ( Figure 1 s e l c i t r A  intensity signal the Averaging samples. the of 10% at least in present e c Supplementary Table 2 Supplementary ) Transcriptional landscape of cell-cycle cell-cycle of landscape Transcriptional ) regions. transcribed the at state Chromatin ) A peak calling algorithm searched for statistically significant sig significant statistically for searched algorithm calling peak A ultrahigh- an create we study, this In S upeetr Tbe 1 Table Supplementary upplementary Figure 1 Figure upplementary

Identification of ncRNAs near and b (

) Representative tiling array data. noncoding CCNE1 p16 ), ), p14ARF locus in human fetal lung S a

upplementary upplementary transcription ) Flow chart of the and and ). . CDKN2B ). These genes are also critical for for critical also are genes These ). d ) Codon Codon ) T able 1 able

near ( p15

cell-cycle -

)), as well as from 10 kb kb 10 from as well as )), 25– 2 d b a 7 104 samples . Thus, we hybrid we Thus, . Peak calling peaks from CSF score H3K36me3 –100 Significant 104 samples Custom high density tiling H3K4me3 –50 100 150 200 250 RNA fro - Cancerandnormaltissue - Cellcycle–synchronizedcells - Cellcycle–arrestedcells - Cellstransducedwithoncogenes - ESCsanddifferentiatedcells Samples:

50 genes arrays 0 RNA mRNA m - Fig. 1 Introni Known exon New transcribedregion Identify candidate ncRNA c 28,416 RNA a peaks on all arrays Upstream ncRNAs - - Fig 1b

analysis confirmed that the chromatin confirmed analysis state at a majority of the newly This (ChIP-chip). array tiling custom our to fol hybridization by lowed immunoprecipitation chromatin performing by transcription formarked active were regions similarly transcribed 216 the whether transcription coding bodies, and starts respectively gene reflect which (H3K36me3), H3 histone of 36 lysine and (H3K4me3) 4 lysine on H3 histone of trimethylation 3 the of downstream located were (2%) 216 the of 5 and (‘intronic’), introns (‘upstream’),genes 40 of cell-cycle within were the 216 (19%) located 5 located were regions transcribed unidentified previously (79%) 216 the of one seventy hundred One nt). 50–1,494 3 Table a regions ( total of identified 216 transcribed discrete in is shown fibroblasts lung fetal genes cell-cycle 56 the of ( exons known with overlap not did that regions) a (with transcribed range of regions 14–189 216 transcribed regions. In each individual sample, transcribed we detected an average ofreported 73 of the previously detect can array tiling custom this that showing 56), the of (41 genes coding cell-cycle from of majority the transcription 1 exon detected procedure our selection, RNA 3 possible Despite abundance. script tran of estimate quantitative a produced peak a in probes all across Supplementary Fig. 1 Fig. Supplementary Genes actively transcribed by RNA polymerase II are marked by marked are II polymerase RNA by transcribed actively Genes potential coding No potential coding Increasing present in<10%samples Eliminate peaks Align allpeaks ′ ). The average transcript length was 234 nt (with a range of of range a (with nt 234 was length transcript average The ). end of of end 2 8 . These chromatin marks can be used to identify non identify to used be can marks chromatin These . e CDKN2A a RNA expression DVANCE ONLINE PUBLICATION ONLINE DVANCE 0.5 1.0 1.5 2.0 2.5 3.0 –10,000 0 Find ncRNAsthatchang in expressiondifferen 1 various conditions 216 RNAsunder biological states 4 . In a subset of our samples, we determined determined we samples, our of subset a In . ; an example of the the of example an ; –8,000 CCNE Fig 2,4 . –6,000 Distance fromTSS(bases) Fig. 1 Fig. t c ncRNAs e mRNAs –4,000 between ncRNAsandmRNA n FL3 ′ bias caused by polyadenylated polyadenylated by caused bias =36 expression relationship Infer functionbasedon b ). Across all 108 samples, we samples, 108 all Across ). –2,000 lncRNAs vs mRNA Fig

CCNE1 H3K4me3

H3K36me3 3 s

FL3+DOX 0 ′ ncRNAs Cell-cycle genes Nature Ge Nature of the TSS of the the of TSS the of mRNAs n =60 locus in human human in locus 2,000 Supplementary Supplementary s s Functional validation ChIP/input 0 siRNA assays of ncRNA of ncRNA Fig 5–7 4,000 H3K4me3 1.5 n H3K36me3 etics s s - - - © 2011 Nature America, Inc. All rights reserved. sion of these ncRNAs in order to infer possible biological functions. functions. biological possible infer to expres order in ncRNAs these of sion regulate that conditions biological the examined we Next, Expression TSS. immediately the around clustered not are and longer major are the ncRNAs TSS, these of the of ity bp 100 within located primarily are which TSSaRNAs, and RNAs tiny PASRs, described previously the unlike in 4–8 the region kb of Thus, upstream the TSS. transcription coding non of enrichment found we addition, In gene. protein-coding the of downstream immediately to the peak TSS corresponding exon 1 of to relative ( samples TSS their 108 all of loci coding protein- 56 at all signals RNA hybridization RNAsnoncoding (lncRNAs). We the aligned long as regions transcribed these to refered we Thereafter, browser. genome UCSC in the annotated as RNAs specific Cajal-body small or RNAs nucleolar small box RNAs, H/ACA nucleolar small box C/D miRNAs, pre- known intersect transcripts of none the ( genes protein-coding known not are scripts tran of the majority the that confirmed ysis anal BLAST potential. protein-coding have not do regions new the of most that gesting ( known genes ncRNAsand coding resembled protein- known of threshold the below well scores CSF had regions transcribed new the of 86% over However, scores. CSF high had exons annotated with coincided that regions genomes mammalian sequenced 21 across sequences protein-coding of signatures characteristic evolutionary for assess frequency to analysis substitution (CSF) codon a used we RNAs, noncoding or unknown exons protein-coding previously encode may scripts regions are actively transcribed. providing further evidence that these genomic ( binding II polymerase RNA and regions were enriched for H3K4me3 ( attributes against approximately 900 published genomic transcripts our query to analysisEpiGRAPH both in ( H3K36me3 and H3K4me3 enriched was transcripts defined cluster 1. See also change. ( ncRNA. Black indicates no significant expression cycle perturbation repressed transcription of the of the ncRNA. Green indicates that the cell the cell cycle perturbation induced transcription representing 108 conditions. Red indicates that of 216 predicted ncRNAs across 54 arrays, cycle perturbations. ( Figure 2 Nature Ge Nature new 216 the of changes expression the of matrix a assembled We Fig. upeetr Tbe 3 Table Supplementary o eemn wehr h 26 tran 216 the whether determine To

1 d

and and b ncRNA expression across diverse cell 3 ) Close up view of the ncRNAs in 2 0 9

Fig. 1 Fig. . As expected, the transcribed transcribed the expected, As . n patterns , the 216 putative transcribed transcribed putative 216 the , etics Supplementary Table 3 Table Supplementary e S ). As expected, we found a found we As expected, ). upplementary

a

) Hierarchical clustering of ADVANCE ONLINE PUBLICATION ONLINE ADVANCE

ncRNAs ). Furthermore, Furthermore, ). Fig. 1b T

suggest ables 2,3 P , c < 10 < P ). Using < 10 ), sug ),

specific . −7 −9 ), ), - - - - )

biological b a

functions - - - expression profiles of keratinocytes with knockdown of P63, which which P63, of knockdown with keratinocytes of profiles expression ( carcinogenesis and self-renewal in role a play may lncRNAs these of subset a that suggesting 216), of 35 as many (as normal to relative carcinomas breast ductal invasive 216) of and (40 pancreas fetal 152 day to relative (ESC) cells stem onic embry the were change expression twofold least most at with the transcripts had that samples The roles. functional have may regions transcribed the of subset large a that suggesting of one perturbations, least the at in array tiling the on detected expression in change twofold a least at had (43%) 92 regions, transcribed new 216 the Of ( samples and genes the tered ­transcribed regions across all 54 perturbations and hierarchically clus 216 transcriptionalunit s –2.0 Fold change Fig. 2 Fig. +2.0 a and and MYC overexpressioninkeratinocytes p63 knockdowninkeratinocytes Human ESCs Myc-RAS-I Breast carcinomas Fibroblasts inlowserum Ca E2F3-RAS-I Doxorubicin-treated fibroblasts Breast carcinomas 2+ Fig. 2 Fig. Supplementary Table 4 Table Supplementary -differentiated keratinocytes κ a B tumors κ B tumors ). Notably, lncRNA lncRNA Notably, ). s e l c i t r A upst upst upst upst upst upst int: upst upst upst upst upst upst upst dst: upst upst upst upst upst upst upst upst upst upst dst: int: int: upst upst upst upst int: upst upst upst upst int: int: upst int: upst upst int: CDKN2 CDK CDKN2 CDK5R CCNT CDKN1 CDKN2 CDK CDKN2A CDKN2A : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : CDKN1 CDK5R CNNM CDKN2 CCNC CCNYL CDKN1 CDK CDK5R CCNL CCNL CCNL CCNL CCNL CCNL CCNL CDK5R CDK5R CDKN1 CCNI CCNI CCNI CCNF CDK CDK CDK CDK5R CCNE2 CCNL CDK5R CCNL CCNL CDKN2 CDK

6 6 :143 :127 1 4 6 9 8 6 :602 :–688 :–662 :–639 C B 1 A D :–7794 :–1860 :–64 :–79 :–53 1 1 1 1 2 2 2 2 2 2 :–172 :–6760 3 :183 :142 :15 :1926 :1417 :–276 :–238 :–223 :–196 :–767 :–225 :–139 :–982 :–311 :–554 2 :–68 C 2 B A 2 1 1 A 1 1 C :39498 :43877 :–97 6 :–36 :–6045 :–6418 :–4410 :–4044 :–48 :–5717 :–5830 :–4845 :–219 :–281 :–803 6 8 3 9 3 1 8 0 2 1 0 7 3 4 8 3 1 0 0 2 6 7 7 ). ).  - -

© 2011 Nature America, Inc. All rights reserved. be functional in the cell cycle, in self-renewal and in cancer. in self-renewal cycle, cell in the functional be ncRNAs that these suggest conditions may and cellular ent biological lation of the dynamic expression patterns of these ncRNAs and differ response or are spliced together as exons of one transcript. High corre multiple adjacent ncRNAs are either coordinatelythat suggesting regulated in locus, a sharedgenomic same the from ncRNAs the of many ‘stemnessa be ( cluster’ GFP-RAS-I and cells differentiated MYC-RAS-I and P63-knockdown with nocytes kerati ESCs, in induced strongly are that lncRNAs of composed is ( samples all across pattern expression their fide bona lncRNAs of in not mRNAs only pattern these but also expression the RAS-I E2F3- MYC-RAS-I and ESCs the with clustered four of cancers the lncRNA profiles: on their based groups different two into split carcinomas breast inva human ductal sive primary eight addition, In lncRNAs. of majority the for expression of pattern inverse an had program, expression gene mRNA ESC-like the not do express which keratinocytes, transduced for embryonic and cancer stem cells. In contrast, the E2F3-RAS-I tered with that of ESCs ( Notably, the lncRNA expression profile of MYC-RAS-I expression program and acquire properties of cancer stem cells stem cancer of properties acquire and program expression gene mRNA ESC-like an activate keratinocytes human transduced MYC that showing together, clustered to controls relative oncogenes other oncogene the with transduced were that samples keratinocyte five from patterns Expression state. undifferentiated the in role a have may ncRNAs these that gesting that of sug with ESC, clustered differentiation, keratinocyte inhibits s e l c i t r A 

c a The 216 lncRNAs are divided into three main clusters based on based clusters main three into divided are lncRNAs 216 The Frequency int

has a dominant effect on ncRNA expression. MYC-RAS-I expression. ncRNA on effect dominant a has :CDKN2B

upst: 10 20 30 40 50 60 0 CNNM1 terms –1.0 :1926 human cancers. human upst: ncRNAs :–2645 –0.8 CCND2 upst: –0.6 κ upst:CNNM4:–9042 Pearson to B –0.4 CCND1 α upst: :–3840

κ –0.2 tumors, suggesting that these tumor models mimic tumor that models these tumors, suggesting CCNK:–2768 B

α dst:int :–4426 upst: :CCNG 0 cis tumors, and the other four clustered with the the with clustered four other the and tumors, CDKN2A 0.2 CDKN2B Fig. 2 Fig.

1 mRNA Fig. Fig. :381 upst: :8650 0.4 upst:upst:CCNF:–2817 0.6 CDKN1ACCNI

2 :–1721 b

), ), suggesting a shared lncRNA signature upst: :–6398 0.8 ). Notably, ). of composed is cluster each :–4845 upst: CCNC

MYC 1.0 CCNL2 :–5405 upst: κ

upst:CDK5R1:–1391 b B tumors, which we interpret to to interpret we which tumors, B

alone or in combination with with combination in or alone CNNM3 Frequency

int :–4410 200 250 : 100 150

CDKN2C:–9238 50

–1.0 0

Fig. Fig. :643 –0.8

Protein-RNA complexassembly and assembly Ribonucleoprotein complexbiogenesis RNA processing RNA splicing DNA recombination Regulation ofcellcycl Cell cyclephas Cell cycleproces M phas Mitosi Cell cyclecheckpoint Response toDNAdamage Cellular biosyntheticproces –0.6 Pearson to κ 2 s

B tumors relative to to relative tumors B –0.4 e ). Notably,). 1 cluster –0.2 e s κ 0 cis B

e 0.2 α ncRNA 0.4 cells clus s 0.6 0.8 κ κ 1.0 B B 3 α α 1 ------.

were also examined on our tiling array and calculated pairwise pairwise calculated and array tiling our on examined also were that samples 17 on arrays expression mRNA, whole-genome nearest conducted we the of expression the with correlated typically are lncRNAs gene-proximal whether To gene. determine coding nearby of including lncRNAs, Multiple and A gene Gene sets Ontology enriched are listed. Representative association. with the associated gene lncRNA set. A no black entry indicates significant set. A that blue the entry indicates gene Gene set Ontology is negatively with gene Gene the set Ontology associated gene lncRNA is positively ( gene sets 17 Processes across (rows) samples Biological Gene Ontology map Module of response. damage gene lncRNA versus sets (columns) in groups are for lncRNAs the enriched functional cell cycle and in DNA on the same locus 108 across ( samples. each between of the correlations 216 and ncRNAs transcripts nearby of Histogram Pearson transcripts. lncRNA with neighboring correlation 108 across ( samples. each between of the correlations 216 Pearson and ncRNAs the do patterns with not those of correlate the mRNAs in 3 Figure damage–related gene sets (UVA/UVB), several oncogenic signatures signatures oncogenic (UVA/UVB), gene sets several damage–related nega receptor tive, and up), negative BRCA DNA-prognosis BRCA1 overexpressed estrogen (BRCA sets gene cancer breast prognosis Cell or poor Brentani several included Cycle modules enriched addition, In Cell KEGG). Cycle example, (for sets cell-cycle–related collection) c2 (MSigDB Database Signatures Molecular from the states and clinical biological as well as pathways signaling and metabolic curated of sets with gene map module similar a constructed We cycle. cell the to related strongly still are lncRNAs these of patterns expression the gene, protein-coding neighboring their to expression in correlation to limited DNAhaving despite and Thus, response damage. splicing, RNA assembly, and biogenesis complex ribonucleoprotein bination, associated with biological processes including cell cycle, DNA recom ( terms Ontology Gene distinct with iden associated to are that lncRNAs tify biclustering performed and set gene Processes Ontology Gene Biological the versus set gene co-expression lncRNA each of 3 Fig. across the 17 samples ( that lncRNA with mRNAs correlated or that negatively are positively For each lncRNA, we a gene set defined co-expression as the group of approach a guilt-by-association using may regulate they that ways mRNA the relate with in ( transcript same the exons of are and/or other each regulate positively regulated, dinately coor be may ncRNAs neighboring positive, that was hypothesis our locus supporting same the of ncRNAs two between correlation ( 5 3 neighboring and lncRNAs of analysis (qRT-PCR) RT-PCR in function not ( expression mRNA may nearby or repress activate lncRNAs the of most mRNA, that protein-coding suggesting nearby the between and lncRNAs 216 anti-correlation the or of most correlation significant no was there Notably, wide. genome mRNA every versus lncRNA promoter cell- cycle each of patterns expression the between correlations Pearson P Fig. Supplementary Fig. 2 Fig. Supplementary ′

mRNAs in 34 additional samples confirmed these findings findings these confirmed samples additional 34 in mRNAs < rate A 0.05, <0.05). false that discovery the entry yellow indicates gene Given that expression of the 216 ncRNAs does not generally cor generally not does ncRNAs 216 the of expression that Given CCND1

biological

3 ). We then constructed a gene module map a module ). gene We constructed then c

). This analysis revealed multiple sets of lncRNAs that are are that lncRNAs of sets multiple revealed analysis This ). co-expression Functional associations of ( ncRNAs. associations Functional , have been shown to regulate the transcription of the the of transcription the regulate to shown been have , 3 3 a . This module map confirmed the enrichment for for enrichment the confirmed map module This . functions DVANCE ONLINE PUBLICATION ONLINE DVANCE b ) lncRNA expression patterns have a ) patterns expression positive lncRNA R > 0.5 or

map Fig. 3 Fig. ). In contrast, we found that the median median the that found we contrast, In ). cis , we further explored the genes and path genes the explored , we further

infers b R ). p15AS < 0.5, respectively) (

trans c ) Genes co-expressed with ) Genes co-expressed and the lncRNA upstream upstream lncRNA the and

a regulatory ) lncRNA expression ) expression lncRNA Fig. 3 Fig.

cis 3 2 Nature Ge Nature of the association of association the . Histogram . of Histogram a Supplementary ). Quantitative Quantitative ).

mechanisms cis mRNA n etics cis ′ and and to to 1 4 ------.

© 2011 Nature America, Inc. All rights reserved. with DNA damage response pathways ( pathways response damage DNA with progression. cancer and tion differentia cell of states different in expressed differentially are and cycle cell the in expression periodic have lncRNAs these of majority upst: normal to relative upst: including others, cancers whereas tissues, mammary breast metastatic the in repressed are upst: the from cancers breast ( tissues mammary normal metastatic five the distinguished readily tissues mammary normal five and cancers breast metastatic five in patterns expression lncRNA of clustering unsupervised addition, In ( differentiation during major regulated a are IncRNAs that these showed of ity 152 and 76 days at pancreas fetal and ESCs ( cycle cell the during pattern of phase cycle cell 74% peak of expression the lncRNAs with periodic the by confirmed human stimulation synchronized serum fibroblasts ( cycle cell the of phases ent at differ peaking expression periodic have IncRNA the of most that these of chronized in cell cycle progression by measure double thymidine block syn showed cells HeLa quantitative in Expression more conditions. different a across lncRNAs obtain to intronic) 7 and upstream (53 regions transcribed new 216 the of 60 for assays PCR qRT- designed we associations, functional inferred To these validate differentiation, Validation cell, stem (hematopoietic sets ( gene cell) stem neural stem-cell and MYC) (RAS, Nature Ge Nature 3 Fig. c a Our Our co-expression maps predicted associations of IncRNAsseveral Based on expression pattern of protein-coding genes M/G1 G2/M G1/S CCNL1 CDKN2B G2

). In support of this finding, doxorubicin-treated human human doxorubicin-treated finding, this of support In ). 60 ncRNAs 2 0 ESC n

Based onflowcytometry of G S :−2,767 and int: Hours postdouble etics thymidine block

6 4 :−2,817 and int: and :−2,817 ncRNA Fetal pancreas –2.0 d76 Fold change

M 2 cancer Supplementary Fig. 4 Fig. Supplementary

1 8 d152 ADVANCE ONLINE PUBLICATION ONLINE ADVANCE 1 0 +2.0

G1 expression 1 2

int: upst: upst: upst: upst: int: upst: upst: and 4 S CCNG1 ARF Fig. 4 Fig. CDKN1A CDKN1 CDK9 CCNL CCNG2 CDKN1 CDKN1 Fig. 4 Fig.

:+4530 DNA Fig. 4 Fig. upst: RAD2 int: upst: CDC2 CCNF E2F1 ARF :+38 :–9782 1 CCNG :–2767 C :–2953 d A A CDKN1 CCNL :–1017 1 0 : –9569 : –4845 ). Some of the lncRNAs, including including lncRNAs, of the Some ). 1

a 1 in

:+4,517, are induced. Thus, the the Thus, induced. are :+4,517, :+381 1 b ) damage :–2767 :+885 ( A –2.0 3 :–4845 ). Next, comparison of human human of comparison Next, ).

4 Fold change cell . Parallel analysis in primary primary in analysis Parallel . Noncoding Protein-coding Fig. 3 Fig. ). d

cycle, Supplementary Table 3

60 ncRNAs +2.0

response c Normal breas and and b Unassigned

ESC CDKN1A G2/M M/G1 G1/S

Supplementary Supplementary

t 2 0 Metastatic ducta Fig. 4 Fig. :−4,845, :−4,845, carcinom –8. 6 4 serum readdition Fold change + 0 Hours pos c ). ). ), 1 8 a - - - -

8. l 1 2 0 t upst: upst: int upst: int upst: upst: 13 exons). Rapid amplification of the 5 of the amplification Rapid exons). 13 to compared ­potential of score CSF the cross with by caused not hybridization was array tiling the by detected expression in change the that confirming change, significantly not did expression that of conserved pseudogene predicted computationally evolutionarily is and tags ( sequence expressed tated the analysis. ther the PANDA at lncRNAs these of CDKN1A relevance functional the investigate To PANDA the from DNA the in response damage suggest that the lncRNAs in the ( Upst: than magnitude higher substantially of upstream IncRNAs were previously on detected the tiling array ( surrounding the induced or repressed by doxorubicin, and all five of markedly the either tested were lncRNAs lncRNAs the of Most course. time 24-h a over tive RT-PCR in human fetal lung fibroblasts treated with doxorubicin upst: as well (as pathway damage DNA the in predicted lncRNAs 60 of changes expression the measured we to DNA lncRNAs responsive damage, may be these whether confirm 2 6 Fig. 5 Supplementary Fig. 5 Fig. Supplementary : : AR CCNG 2 0 CCNG CDK CCNL1 CDKN1C CDKN1A F CDKN1A PANDA :+4530 4 CDKN1A 1 9 c :+381 :–9782 2 ). These variations in expression patterns within the same locus :–2767 :–2953 (P21 associated ncRNA DNA damage activated), for fur for activated), damage DNA ncRNA associated (P21 : :–1017 : –4845

E2F1 CDC20 CCNF upst: int RAD21 upst: locus, we selected upst: we selected locus, a : CCNG

CDKN1A CCNL1 long was induced specifically by DNA damage, whereas 1 S, onie wt a lse o peiul anno previously of cluster a with coincides TSS, :+381 PANDA :−4,845 is induced up to 40-fold upon DNA damage damage DNA upon 40-fold to up induced is :−4,845 :–2767 :–4845

CDKN1A ncRNA PANDA LAP3 CDKN1A is located approximately 5 kb upstream of of upstream kb 5 approximately located is LAP3 5 located were ncRNAs 12 those of Notably, 2 upst: gene ( qRT-PCR by on and ncRNAs array 216 tiling the the of 12 in change two fold least at showed fibroblasts lung fetal samples. cancer to compared breast epithelium breast normal ( human to ESCs D, compared fetal pancreas. day. ( (orange). in the with expression known periodic cell cycle of cell and sorting genes activated expression by cycle were phases confirmed fluorescence- in HeLa cells ( cell cycle progression synchronized during (blue) ( expression. for samples lncRNA biological independent TaqMan and probes used them to interrogate and Wehuman cancers. custom generated ESC differentiation cell cycle progression, 4 Figure doxorubicin-treated fibroblasts. In order to to order In fibroblasts. doxorubicin-treated in only one expressed of the 108 the samples, was it because lncRNAs 216 the in included by was induced but also doxorubicin was not the at ( bicin CDKN1A ). Although the the Although ). d

′ involved ) Differential expression of in lncRNAs expression ) Differential of the TSS of the canonical p53 target target p53 canonical the of TSS the of TSS were induced, including the three that ( , 9.3, indicated very low protein-coding protein-coding low very indicated 9.3, , upeetr Fg 6 Fig. Supplementary (with a CSF range of 117–1,343 for its its for 117–1,343 of range CSF a (with CDKN1A CDKN1A CDKN1A Fig. 5 Fig. CDKN1A

are induced more rapidly and with with and rapidly more induced are Validated expression Validated of in ncRNAs expression CDKN1A c mRNA, were induced by doxoru by induced were mRNA, ) Regulated expression of in ) lncRNAs expression Regulated

a in , a b a CDKN1A ). In addition, a third lncRNA lncRNA third a addition, In ).

) and foreskin fibroblasts ) ( and fibroblasts foreskin ) Periodic expression of ) lncRNAs expression Periodic the :−4,845), and, similar to the the to similar and, :−4,845), ′ CDKN1A CDKN1A and 3 and (upst: locus, upst: locus, locus may play distinct roles PANDA

DNA-damage :−4,845, hereafter termed termed hereafter :−4,845, LAP3 Fig. Fig. 5 ′ complementary DNA complementary CDKN1A upon DNA damage. damage. DNA upon protein, p21. protein, :−800) by quantita by :−800) , qRT-PCR showed showed qRT-PCR , b locus intersects a intersects locus s e l c i t r A ). ). Notably, several ). Furthermore, Furthermore, ). CDKN1A :−1,210 and and :−1,210

response Fig. Fig.

b :−800, :−800, LAP3 ). ). Cell 2 ). ).  - - - - -

© 2011 Nature America, Inc. All rights reserved. in metastatic ductal carcinomas but not in normal breast tissue tissue breast normal in not but carcinomas ductal metastatic in of induction induce to syndrome ability the Li-Fraumeni in induce to ability the abrogated observed mutant, (p.Arg273His) damage-inducible DNA ( PANDA of expression mutant—restored (p.Val272Cys) p53 carc 9a Fig. sequencing DNA in 2–11 exon by mutation determined as inactivating harboring tumors breast primary p21. of transcript on effect on ( of induction the damage inhibited DNA before p53 of knockdown co-regulation. siRNA-mediated p53 of Indeed, possibility the supports which site, binding site CDKN1A binding p53 the of the upstream confirmed immediately analysis regulates ChIP-chip also p53 expression. whether asked we response, damage to antisense scribed 5 a is LAP3 from transcribed divergently is that transcript a 1.5-kb and revealed RNA (RACE) ends blot analysis s e l c i t r A  ( Supplementary Fig. 9b Fig. Supplementary Fig. 5 Fig. e ChIP-chip a b Because p53 is a positive regulator of of regulator positive a is p53 Because +Dox Fold siRNA/siCtrl –Dox ∆ 60 ncRNAs RNA RNA CDKN1A 0.2 0.4 0.6 0.8 1.0 1.2 1.4 i 0 noma cells by wild-type p53—but not the loss-of-function loss-of-function the not p53—but wild-type by cells noma 4 2 ′ Hours post-dox P53 PANDA pseudogene ( -capped and polyadenylated non-spliced lncRNA that is tran is that lncRNA non-spliced polyadenylated and -capped siCTRL e 100 100 ). Further, complementation of p53-null H1299 lung lung H1299 p53-null of complementation Further, ). and and 0 2 0 0 PANDA 1 8 are diametrically situated 2.5 kb inte from this situated are diametrically

siTP53 2 6 . In contrast, RNA interference of of interference RNA contrast, In . Supplementary Fig. 8 Fig. Supplementary level shows a trend of lower expression in human human in expression lower of trend a shows level 2 0 CDKN1A * –9,569 PANDA

siCDKN1A 4 expression, indicating that PANDAthat a is not linked indicating expression, –800 –9,56 –1,21 –4,84 +88 Fig. 5 5 CDKN1A 9 0 5 PANDA c without concordant concordant without Fold RNA expression nor is PANDA expression dependent on on dependent expression PANDA is nor ∆ –5,830 d 10 15 20 25 30 35 40 Fig. 5 Fig. 0 5 ). and LAP3 CDKN1A PANDA 8 0 PANDA CDKN1A –4,845 Hours post-dox CDKN1A ( . Supplementary Supplementary Fig. 7 f Fig. 5 Fig. CDKN1A ). Notably, a gain-of-function p53 p53 gain-of-function a Notably, ). CDKN1A predicted 16 RAC ( PANDA –4,845 by 70% 24 h after DNA damage damage DNA after h 24 70% by 20 E Promoter ), which is similar to its effect effect its to similar is which ), f f ). We also observed selective selective observed We also ). 24

Do Normalized expression TSS ( TSS 10 15 10 14 5 2 6 ( ) x CDKN1A , antisense of the predicted predicted the of antisense , PANDA 28 but selectively preserved preserved selectively but PANDA CDKN1 Empty vector + – + – + – + – +885 CDKN1 –9,569 –800 –1,210 –2,230 Fig. 5 Fig. –1,902 CDKN1A

) –1,210 3 A A 2,000- d 6 ( CDKN1A p53 Exon1 500- –800 a during the DNA the during Supplementary Supplementary ) ). ). Thus, 3 5 . . V272C PANDA expression expression r p53

1k +885 vening p53 vening

b +1,420 PANDA had no no had PANDA PANDA R273H p53 (LF) TP53 and 3 7 - ,

the other hand, expression of neither neither of expression hand, other the ( also See (FL3). fibroblasts human normal CDKN1A the from transcribed lncRNAs of A cluster treatment. doxorubicin of course h time a 24 across repression or induction coordinate shows lncRNAs of also array. See tiling upst: the of upstream immediately site binding p53 the confirmed input to relative signal ChIP-chip p53 the bottom, the At point. time single one in transcripts DNA-damage–inducible all observe not did we that Note fibroblasts. lung fetal human ng/ml) (200 treated (dox) h doxorubicin 24 in or control the in intensity hybridization RNA of examples are tracks two middle the In ( 5 Figure primary human lung fibroblasts specifically retrieved endogenous endogenous retrieved specifically fibroblasts lung human primary interaction. Immunoprecipitation of NF-YA from doxorubicin-treated (refs. LSD1 or EZH2 as lncRNAs, such other bind can that complexes modification matin induced by DNA damage ( specifically retrieved NF-YA from cellular lysates of human fibroblasts 1.2-kb a not but RNA,PANDA transcribed chromatography RNA function. NF-YA NF-YA factor scription tran the for site binding the by genes target p53 other from guished apoptosis. delimits of induction CDKN1A p53-mediated with parallel in that suggest data these DNA-damage–induced to fibroblasts sitized not shown; scripts within the cells ( marker of apoptosis, revealed PARP cleavage only in ( down resulted in fivefold to sevenfold increased TUNEL-positive cells contrast, In TUNEL. by measured as apoptosis to no little had siRNA control to exposed cells FL3 in treatment bicin apoptosis not but arrest p21 of independently acts that effector P53 by affected tors of apoptosis, including ( PANDA death’ ‘cell terms Ontology ( ‘apoptosis’ and Gene 0.04) the as such apoptosis, in PANDA by twofold least at repressed were genes 193 and induced were genes 224 that showed ( knockdown the lowing siRNAs targeting custom with (FL3) fibroblasts fetal human We transduced response. selectively retains the ability to induce PANDA. induce to ability the retains selectively (p.Arg273His), p53 allele, Li-Fraumeni a gain-of-function whereas induction, restore to fails mutant loss-of-function (p.Val272Cys) p53 The of induction damage DNA restores cells H1299 student’s by determined Mean of induction Doxorubicin a d Fig. ) At the top is a map of all detected transcripts at the CDKN1A promoter. CDKN1A at the transcripts ) is a top At detected of the all map ) RNA blot of of blot ) RNA Next, we addressed whether whether addressed we Next, Core promoters of cell death genes downstream of p53 are distin are p53 of downstream genes death cell of promoters Core G1 p53-dependent triggers fibroblasts human in damage DNA CDKN1A CDKN1A

6e ± Fig. Fig. 6 s.d. are shown; * shown; are s.d.

, depletion induced several genes encoding canonical activa canonical encoding genes several induced depletion knockdown were significantly enriched for those involved involved those for enriched significantly were knockdown locus over a 24 h time course after doxorubicin treatment of treatment doxorubicin after course h time a 24 over locus f ncRNAs at the the at ncRNAs ). Immunoblot analysis of PARP, a caspase substrate and and substrate caspase PARP, a of analysis Immunoblot ). for cell cycle arrest, p53-mediated induction of of induction p53-mediated arrest, cycle cell for g Supplementary Fig. 10 Supplementary PANDA locus are induced. ( induced. are locus ). ). In contrast, six additional siRNAs targeting other tran :−4,845 closely matches the predicted transcript on the the on transcript predicted the matches closely :−4,845 PANDA CDKN1A a DVANCE ONLINE PUBLICATION ONLINE DVANCE CDKN1A PANDA PANDA S depletion ( depletion upplementary Figure 7 Figure upplementary confirms a transcript size of 1.5 kb. ( kb. 1.5 of size a transcript confirms 42,43), or p21, illustrating the specificity of the the of specificity the illustrating p21, or 42,43), P CDKN1A t P PANDA TSS after DNA damage. The RACE clone of clone RACE The damage. DNA after TSS 4 -test. ( -test. < 0.03) ( 0.03) < < 0.05 relative to siCTRL (control siRNA) siRNA) (control siCTRL to relative < 0.05 0 38 , and we reasoned that that reasoned we and , Fig. 7 Fig. 6 Fig. and then applied doxorubicin for 24 h fol doxorubicin applied and then , 3 promoter had no effect on apoptosis (data knockdown ( knockdown APAF1 9 . Consistent with this finding, doxoru finding, this with Consistent . f requires p53 but not CDKN1A. CDKN1A. not but p53 requires ) Expression of wild-type p53 in p53-null p53-null in p53 wild-type of ) Expression c locus are induced by DNA damage. damage. DNA by induced are locus a Fig. 6 Fig. ) Expression of transcripts from the the from transcripts of ) Expression a ). PANDA did not retrieve other chro ). Global gene expression analysis analysis expression gene Global ). Fig. 6 Fig. PANDA , , ). ). Thus, PANDA sen knockdown BIK d CDKN1A ), suggesting that that suggesting ), S b , , upplementary Figure 6 Figure upplementary FAS ). qRT-PCR confirmed that that confirmed qRT-PCR ). Fig. 6 Fig. . . ( 4 affects the DNA damage damage DNA the affects 1 using purified, purified, using CDKN1A b CDKN1A and ) Quantitative RT-PCR ) Quantitative LacZ

apoptosis. Altogether, Altogether, apoptosis. b

itself nor nor itself ). Genes induced by induced Genes ). LRDD PANDA Nature Ge Nature mRNA fragment, fragment, mRNA . PANDA and and PANDA ( e PANDA Fig. Fig. 6 PANDA ) may affect affect may TP53 -depleted

PANDA knock n vitro in . n

c etics ). ). On . was was

is a is P < ------

© 2011 Nature America, Inc. All rights reserved. are needed that not only identify ncRNAs but also provide insight insight provide also but ncRNAs identify only not that needed are and enhancers, promoters as gene well as from stand-alone produced intergenic be loci can RNAs noncoding long malian genomes is transcribed. In addition to small noncoding RNAs, Recent studies have revealed that a surprisingly large fraction of mam DISCUSSION apoptotic gene expression program ( PANDA mediatearrest,whereas to p21 NF-YA impedes of activation cally to mediate cell cycle arrest and survival. ated transcription at to chromatin. These data suggest that DNA damage activates p53-medi Thus, PANDA binding to NF-YA may evict or prevent NF-YA binding required in forpart cell death bytriggered loss of PANDA ( genes and apoptosis as measured by TUNEL, indicating that NF-YA is of NF-YA and PANDA substantially attenuated induction of apoptotic known as (also including PANDA NF-YAgenes, of target at increased substantially occupancy p53-responsivethe promoter of PANDA ( treatment. doxorubicin h of 24 after student’s by determined siCTRL to compared sample siPANDA bar, 20 Scale treatment. doxorubicin h of 28 after fibroblasts FL3 TP53 ( cells). FL3 (in treatment doxorubicin h of 28 after siRNA control to relative siPANDA with induction revealed ( cells. FL3 in treatment doxorubicin h of 24 after siRNA control to relative siPANDA * graphs. bar all in shown target specifically siRNAs Custom (FL3). fibroblasts human in doxorubicin 6 Figure Nature Ge Nature into their potential Usingfunction. biological an ultrahigh-resolution d a

Dox RNA (fold) Fold ∆ dox/ctrl Fold ∆ dox/ctrl 1.0 2.0 3.0 4.0 5.0 in FL3 cells revealed no reduction in expression with siPANDA relative to control siRNA. ( siRNA. control to relative siPANDA with expression in reduction no revealed cells FL3 in 0.5 1.0 1.5 2.0 2.5 3.0 10 12 14 16 18 2 4 6 8 0 0

PANDA Fig. 7 – – CCNB1 Ctrl Ctrl CDKN1A n Ctrl siPANDA TP53 etics A siPANDA A + b NOXA lncRNA regulates the apoptotic response to DNA damage. ( damage. DNA to response apoptotic the regulates lncRNA

). NF-YA is a nuclear transcription factor that activates B B , ,

FAS C C ADVANCE ONLINE PUBLICATION ONLINE ADVANCE CDKN1A B A ) ) ( * , , – + P Fig. 7 BBC3 < 0.05 compared to siCTRL for all panels determined by Student’s Student’s by determined panels all for siCTRL to compared < 0.05 LAP3 PANDA siPANDA e c (also known as as known (also and Merge TUNEL DAPI ). ). Moreover, concomitant knockdown * FAS – + PANDA upon DNA damage Fig. siCTRL C * 8 + ). CDKN1A thatsynergisti functions 14 PUMA , 15 b , 1 7 siCtrl . New approaches mRNA produces

) and and ) 4 0 . Depletion . Depletion t Fig. 7d siPANDA -test. ( -test. siPANDA B A PMAIP1 µ m. ( m. g , ) Protein blot of PARP cleavage in control and PANDA siRNA FL3 fibroblasts fibroblasts FL3 PANDA siRNA and control in PARP of cleavage blot ) Protein e f ) Quantification of three independent TUNEL assays. assays. TUNEL independent three of ) Quantification ). ). - - -

PANDA RNA processing Programmed celldeath Apoptosis assembly Protein-DNA complex Nucleosome assembly DNA packaging a ) siRNA knockdown of PANDA in the presence of DNA damage with with damage DNA of presence PANDA the of in knockdown ) siRNA In addition, the ncRNA product may modulate neighboring gene gene neighboring in expression modulate may product ncRNA the addition, In interference). transcriptional called (so binding with factor transcription compete or sequences regulatory over chromatin compacted open may transcription of act The ways. of variety a in activity gene oncogenic perturbation. defined with cells of those to states promoter their of similarity the authentictor p63. humanLikewise, tumorson based can be classified MYC and epithelial progenitors depleted of the differentiation regula by induced tumors ESCs, among states promoter of similarity a fied promoter states among distinct perturbations. For instance, we identi in these diverse similarities samples cell cycle unexpected highlighted amongst profiles lncRNA of comparison Second, sample. biological anyone in transcribed are these of 73 only average, on identified, we regions transcribed noncoding new 216 the Of surveyed. were condi tions few a only if missed been have may and conditions specific highly in only induced are transcripts noncoding the of many First, rapid and economical fashion is advantageous a for at in least two reasons. samples biological diverse and numerous interrogate to ability The perturbations. diverse represent that samples 108 in promoters tiling array, we interrogated the transcriptional landscape at cell-cycle Noncoding transcription through regulatory elements may affect affect may elements regulatory through transcription Noncoding with no discernable effect on the the on effect discernable no with f ( ( P c P =0.01) ) Quantitative RT-PCR of canonical apoptosis pathway genes genes pathway apoptosis RT-PCR canonical of ) Quantitative % TUNEL positive =0.03) 10 15 20 25 30 35 0 5 ( ( P P =0.0001) =0.03) –2.0 cis

Ctrl Fold change ( ( t P P e -test. ( -test. 21 =0.01) =0.04) ) TUNEL immunofluorescence of control and siPANDA siPANDA and control of immunofluorescence ) TUNEL * , 4

siPANDA A 4 , affect distantly located genes in in genes located distantly affect , +2.0 * b B ) Heat map of gene expression changes with with changes expression gene of map ) Heat

C * c d

) Quantitative RT-PCR of ) Quantitative Fold ∆ dox/ctrl Fold ∆ dox/ctrl 10 15 20 25 0 2 4 6 8 0 5 LAP3 g Tubulin

Parp Ctrl Ctrl APAF1 siPANDA siPANDA * * A FAS mRNA. Mean Mean mRNA. A * * Ctrl B B P * *

< 0.05 for each each for < 0.05 C C B A siPANDA s e l c i t r A Fold ∆ dox/ctrl Fold ∆ dox/ctrl 15 25 35 45 10 30 50 70 0 5 0 trans CDKN1A ± s.d. are are s.d. C Ctrl Ctrl 2 LRDD siPANDA siPANDA 2 * * BIK or even even or A A 85 kDa 116 kDa and and * B * B * C C *  - - - © 2011 Nature America, Inc. All rights reserved. cell survival by impeding the apoptotic gene expression program. program. of the ability the by tumors: exploited apparently be can expression linkage This gene apoptotic the impeding by survival cell arrest, cycle cell mediate to inhibitor cycle cell a Whereas promoters. gene target occupying NF-YA from factor transcription the sequestering by genes apoptotic of parallel. in occurs p53 by induction their that indicating damage, DNA to response other’s the on effect of depletion contrast, In show that than to antisense transcribed named we which lncRNAs, these of One damage. DNA by induced are itself, mRNA TSS. At the the of downstream kb 2 and upstream kb 10 within lncRNAs four of average an and lncRNA detected one least at have loci gene coding pathways same ( biological the in function that transcripts multiple for code that units genomic to modulate the chromatin state promoters.of cell-cycle promoterscycle may a be form ofanticipation regulatory or feedback in gene regulation in tiation. These findings suggest that cell-cycle ncRNAs may participate cancers, andhuman are in regulated and in cycle response cell to the DNA in damage expressed or ESC differendifferentially these are ncRNAs of many that validated further analysis qRT-PCR prognosis. cancer with associated been have and differentiation, cell stem and DNA cycle, pathways, cell including damage response biological cific (and presumably co-regulated) with promoter ncRNAs function inco-expressed genes The spe genome. the in elsewhere located genes with mRNA neighboring but their with are rather correlated in expression ncRNAs are norpositively neither negatively correlated in expression expression profiles across our samples. We found that most promoter els of ncRNAs and cognate mRNAs, we compared ncRNA and mRNA these different mechanisms predict distinct relationships between lev as serve a target for regulation by small regulatory RNAs in as is panel this for legend The shown. is see ( the in region downstream of promoters on sites NF-YA target known at ChIP-qPCR siPANDA. or siCTRL with nucleofected ( bottom. the at shown as NF-YA, of immunoprecipitation confirms qRT-PCR.by Immunoblot measured as retrieves specifically lysates FL3 doxorubicin-treated NF-YA from ( analysis. immunoblot by retrieved the of 7 Figure s e l c i t r A  P53 mutant p.Arg273His to gain-of-function Li-Fraumeni selectively d ) Concomitant knockdown of NF-YA attenuates induction of apoptotic genes by PANDA depletion, as measured by qRT-PCR. For knockdown efficiency efficiency qRT-PCR.by knockdown For measured as PANDA by depletion, genes apoptotic of induction NF-YA of attenuates knockdown ) Concomitant PANDA Our results suggest that the is organized into into organized is genome human the that suggest results Our a S upplementary Figure 11 Figure upplementary CDKN1A Input

PANDA from doxorubicin-treated FL3 cell lysates. We visualized We visualized lysates. cell FL3 doxorubicin-treated from PANDA

CDKN1A Beads . Loss-of-function and complementation experiments experiments and complementation . Loss-of-function regulates transcription factor NF-YA. ( NF-YA. factor transcription regulates LacZ induction during DNA damage is p53 dependent. DNA is dependent. during p53 damage induction PANDA CCNB1 trans promoter, five lncRNAs, similar to the PANDA FAS CDKN1A . In addition, noncoding transcription of cell- LSD1 EZH2 p21 NFYA CDKN1A Fig. Fig. , , , is a non-spliced 1.5-kb ncRNA that is is that ncRNA 1.5-kb non-spliced a is , . . ( promoter lacking the NF-YA motif. Mean Mean NF-YA motif. the lacking promoter FAS e ) Concomitant knockdown of NF-YA rescues apoptosis induced by PANDA depletion. Quantification of TUNEL staining staining TUNEL of Quantification PANDA by depletion. induced apoptosis NF-YA of rescues knockdown ) Concomitant c WB:NF-YA 8 ) ChIP of NF-YA in FL3 fibroblasts fibroblasts NF-YA FL3 of in ) ChIP , , ). ). Forty nine of protein- 56 cell-cycle NOXA and is induced with faster kinetics kinetics and faster with is induced or depletion of of depletion or b

PANDA Retrieved RNA (% input) , , 20 10 15 BBC3 0 5 d . Input GAPDH α IgG b -NF-YA ( inhibits the expression expression the inhibits ) Immunoprecipitation of ) Immunoprecipitation PUMA IgG CDKN1A PANDA a PANDA ) or a control a control ) or ) RNA chromatography chromatography ) RNA

PANDA n.d. NFYA 45 , 4 promotes promotes 6 CDKN1A . Because . Because encodes encodes PANDA c had no no had

ChIP/input (%) 10 12

0 2 4 6 8

± siCTRL

s.d. are shown in all bar graphs. * graphs. bar all in shown are s.d. ,

- - - IgG

CCNB1

NF-YA siPANDA mediates gene silencing through recruitment of hnRPK. of recruitment through silencing gene mediates linc-p21 and NF-YA, through apoptosis PANDA blocks arrest, cycle cell mediates CDKN1A linc-p21. PANDA and transcripts noncoding as well the at binding p53 damage, DNA After response. damage DNA the coordinating 8 Figure sequences’ ‘genes’ ‘regulatory and between boundary blurring the shows that example vivid a provide findings ( response DNA damage the regulate to cooperate that transcripts noncoding and coding multiple of expression the drives repression of upstream kb 15 lincRNA-p21 located RNA noncoding susceptibility. cancer human for explanations new like lncRNAs Thus, carcinomas. ductal static in meta observed was which similarly from arrest, cycle cell survival retain *

gene repression IgG

p53-mediated Intriguingly, a recent study identified a distinct long intergenic intergenic long distinct a identified study recent a Intriguingly,

d NF-YA linc-p21 Relative expression CDKN1A 11 PANDA 0.1 0.2 0.3 0.4 0.5 0.6 1 3 5 7 9

0 Model of coding and noncoding transcripts at the the at transcripts noncoding and coding of Model APAF1 hnRPK 2 siCTRL 4 IgG . Thus, the regulatory sequence upstream of of upstream sequence regulatory the Thus, . , that is induced by p53 and mediates p53-dependent gene siPANDA siCTRL FAS locus coordinately activates transcription of of transcription activates coordinately locus induction instead of instead induction p53 NF-YA siPANDA a *

DVANCE ONLINE PUBLICATION ONLINE DVANCE IgG BIK NF-YA 0 1 2 3 4 5 siNFYA siPANDA+siNFYA LRDD

siCTRL IgG P NOXA < 0.05 determined by Student’s Student’s by determined < 0.05

Apoptosi NF-YA PANDA siPANDA NOXA NF-YA IgG * NF-YA s CDKN1A 0.2 0.4 0.6 0.8 PUM 0 A siCTRL IgG p53 PUM 4

7 NF-YA . siPANDA in effect uncouples cell uncouples in effect * e IgG A

PANDA % TUNEL positive

10 15 20 25

0 5 NF-YA CDKN1A Nature Ge Nature Cell cyclearrest 0.1 0.2 0.3 0.4 0.5 0.6 2 1 0 CDKN1A Fig. Fig. CDKN1 may provide provide may CDKN1A p21 siCTRL IgG Control t CDKN1A 8 * -test. -test. 4 3 , named named , NF-YA ). These These ). A siPANDA n locus locus etics IgG as

NF-YA -

© 2011 Nature America, Inc. All rights reserved. 9. 8. 7. 6. 5. 4. 3. 2. 1. reprints/index.html. http://www.nature.com/ at online available is information permissions and Reprints Published online at http://www.nature.com/naturegenetics/. The authors declare no competing interests.financial T.H. and D.J.W. with input from all co-authors. M.v.d.V., A.-L.B.-D., S.S., M.L.W. and Y.X. The manuscript was prepared by H.Y.C., samples or reagents: A.K.K., Y.K., G.D.G., H.M.H., N.S., C.U., P.W., A.L., S.K.K., M.F.L. and M.K. contributed CSF analysis. The following authors contributed Yulei Wang, Yu Wang and B.K. conducted high-throughput TaqMan RT-PCRs. experiments. T.H. performed the experiments and the computational analysis. H.Y.C. and D.J.W. initiated the project. H.Y.C., D.J.W. and T.H. the designed (DoD) National Defense Science & Engineering Graduate Fellowship Foundation(NDSEG). (NSF) Graduate Research Fellowship and the DepartmentInstitute. of Defense T.H. is supported by the Stanford Graduate SocietyFellowship, (H.Y.C.). the NationalH.Y.C. Science is an Early Career Scientist ofDiabetes the HowardResearch HughesFoundation Medical (S.K.K. and H.Y.C.) and the AmericanInstitute Cancer (NCI) (R01-CA118750 to H.Y.C. and R01-CA130795 to andM.L.W.), Skin Diseases the(NIAMS) Juvenile (K08-AR054615 to D.J.W.), NIH/NationalNational CancerInstitutes of Health (NIH)/National Institute of employeesArthritis andof MusculoskeletalLife Technologies. This work was supported byreading grants of fromthe themanuscript US and P. Khavari for reagents.We Y.W., thank J. Rinn, B.K. M. andGuttman Yu and WangA. Regev forare discussions, L. Attardi for careful Note: Supplementary information is available on the been deposited with GenBank under the accession number JF803844. Expression Omnibus (GSE28631). Sequence for codes. Accessionhuman online the at http://www.nature.com/naturegenetics/. paper the of in ­version available are references associated any and Methods M URLs. and likely other ncRNAs emanating from regulatory sequences. medicine. Future studies are needed to pinpoint the functions of these of promoters, which may of be in use cancer andbiology regenerative expression provides a convenient means of tracking the chromatin state ers that may play diverse functions. At a minimum, promoter ncRNA Nature Ge Nature 12. 11. 10. COMPETING FINANCIAL INTERESTS COMPETING AUTH A ckn

ethods Our study provides an initial catalog of lncRNAs in cell-cycle promot

el, A.C. Seila, P.Kapranov, silencing. RNA in players key proteins: Argonaute M.J. Simard, & G. Hutvagner, regulation. gene in role big a with RNAs small MicroRNAs: G.J. Hannon, & L. He, Mattick, J.S. & Makunin, I.V. Small regulatory RNAs in mammals. Carninci,P. Non-coding RNAtranscription: turning onneighbours. G.A. Calin, Carninci, P. Bertone, P. Ponjavic, J., Ponting, C.P. & Lunter, G. Functionality or transcriptional noise? Evidence Core, L.J., Waterfall, J.J. & Lis, J.T. Nascent RNA sequencing reveals widespread R.J. Taft,pausing for selectionfornoncodingwithinlong RNAs. anddivergent initiation humanatpromoters. Genet. Nat. (2008). 1849–1851 transcription. pervasive Biol. Cell Mol. Rev. Nat. Genet. Rev. Nat. 14 (2008). 1023–1024 carcinomas. and leukemias (2005). 1559–1563 arrays. tiling Spec No 1, R121–R132 (2005). R121–R132 1, No Spec o O Genomica, Genomica, wl R R C e O n t al. et dgm et al. et al. NTRIBUTI

t al. et etics t al. et 41 et al. et Science , 572–578 (2009). 572–578 , The transcriptional landscape of the mammalian genome. iy Ns soitd ih rncito sat ie i animals. in sites start transcription with associated RNAs Tiny Global identification of human transcribed sequences with genome

e lrcnevd ein ecdn nRA ae lee i human in altered are ncRNAs encoding regions Ultraconserved 5 iig n mcora dt ae vial a Gene at available are data microarray and Tiling iegn tasrpin rm cie promoters. active from transcription Divergent RNA maps reveal new RNA classes and a possible function for function possible a and classes RNA new reveal maps RNA nts , 522–531 (2004). 522–531 ,

http://ge

306 ADVANCE ONLINE PUBLICATION ONLINE ADVANCE Science O

9 NS , 2242–2246 (2004). 2242–2246 , , 22–32 (2008). 22–32 , Cancer Cell Cancer nomica.weizmann.ac.i

316 , 1484–1488 (2007). 1484–1488 ,

12 GenomeRes. , 215–229 (2007). 215–229 , Science N

a 322 t u r

e , 1845–1848, (2008). 17

G

, 556–565(2007).,

e l n / e . PANDA

t i c Nat.Cell Biol. s Hum. Mol. Genet. website. Science Science RNA has

322 309

10 - , , ,

47. 46. 45. 44. 43. 42. 41. 40. 39. 38. 37. 36. 35. 34. 27. 26. 25. 24. 23. 22. 21. 20. 19. 18. 17. 16. 15. 14. 13. 33. 32. 31. 30. 29. 28.

Mattick, J.S. Challenging the dogma: the hidden layer of non-protein-coding RNAs non-protein-coding of layer hidden the dogma: the Challenging J.S. Mattick, J.C. Schwartz, RNA- for required is RNA Promoter-associated K.V. Morris, & D. Kim, J., Han, Kanhere, A. Tsai,M.C. A.M. Khalil, chromatography. RNA RNase-assisted p53 J.F. Caceres, the & G. of Michlewski, Regulation B.M. Emerson, & C.M. Murawsky, J.M., Morachis, a triggers damage DNA G.M. Wahl, & K. Clarkin, S.P., Linke, A., Leonardo, Di Taylor,G.R. A., Stark, W.R.Agarwal, & M.L., Agarwal, K.P.Olive, S. Geisler, Wei,C.L. Whitfield, M.L. ono, .. Wle, .. yln ad el yl checkpoints. cycle cell and Cyclins C.L. Walker, & D.G. Johnson, and kinases, cyclin-dependent cyclins, of alterations Genetic G. of Peters, & M. Hall, regulators negative and positive inhibitors: CDK J.M. Roberts, & C.J. Sherr, Huarte, M. R.A. Gupta, J.L. Rinn, Lee, J.T. Lessons from X- inactivation: long ncRNA as guides and tethers W.Yu, E. Pasmant, X. Wang, F. Santa, De T.K.Kim, S. Katayama, M. Guttman, II. polymerase RNA by initiation of fidelity the and Transcriptionalnoise K. Struhl, Subramanian, A. Subramanian, conditional showing map module A Regev,A. & Koller,D. N., Friedman, E., Segal, Wong, D.J. to method genomics comparative a PhyloCSF: M. Kellis, & I. Jungreis, M., Lin, software user-friendly T.EpiGRAPH: Lengauer, & J. Buch, K., Halachev, C., Bock, structure. chromatin of views H.Y.Genome-wide Chang, & O.J. Rando, Cdk inhibitors in human cancer. human in inhibitors Cdk progression. G1-phase response. p53 the in repression gene metastasis. cancer promote human epigenome. the to RNA. Res. Cancer ANRIL INK4 transcription. inhibit to enhancers. overlap Nature Science mammals. in RNAs non-coding large in complex organisms. complex in Biol. Mol. Struct. Nat. 104 cells. human in silencing gene transcriptional directed complex-2. repressive polycomb (2010). with interact and complexes. USA expression. gene affect and complexes chromatin-modifying (2010). 1673–1678 promoters. (2010). 135–147 core diverse structurally by response transcriptional normal in Cip1 of fibroblasts. induction human long-term and arrest G1 p53-dependent prolonged fibroblasts. human in arrest growth reversible mediates and checkpoints cycle cell G1 the and syndrome. cancer. Res. breast Cancer advanced locally in doxorubicin with treatment to response the on genome. tumors. inexpression their and cycle (2005). 15545–15550 profiles. expression genome-wide interpreting for cancer. in modules expression of activity cells. stem 2010). August (18 doi:10.1038/npre.2010.4784.1 online, Pharmacol. Toxicol.Pharmacol. Biol. Mol. Struct. Nat. itnus poencdn ad o-oig regions. non-coding and protein-coding distinguish (2009). data. (epi)genomic of prediction and analysis statistical for Biochem. , 12422–12427 (2007). 12422–12427 ,

/ 106 ARF Nature a atsne ocdn RA hs epeso ccutr with coclusters expression whose RNA noncoding antisense an ,

et al. et 465 HOX

309 Cell

, 11667–11672 (2009). 11667–11672 , et al. et et al. et locus, in a melanoma-neural system tumor family: identification of identification family: tumor system melanoma-neural a in locus, 78 et al. et et al. et et al. et Cell et al. et et al. et al. , 182–187 (2010). 182–187 , Proc. Natl. Acad. Sci. USA Sci. Acad. Natl. Proc. Cell Stem Cell Stem Cell Science

loci by noncoding RNAs. noncoding by loci t al. et et al. Epigenetic silencing of tumour suppressor gene suppressor tumour of silencing Epigenetic

, 1564–1566 (2005). 1564–1566 , 61 67 t al. et (2009). 245–271 , t al. et t al. et 451 et al. et 124 t al. et A global map of p53 transcription-factor binding sites in the human the in sites binding transcription-factor p53 of map global A

Induced ncRNAs allosterically modify RNA-binding proteins in cis in proteins RNA-binding modify allosterically ncRNAs Induced etal. t al. et t al. et Widespread transcription at neuronal activity-regulated enhancers. activity-regulated neuronal at transcription Widespread 119 (2001). 2505–2512 , , 3963–3969 (2007). 3963–3969 , Functional demarcation of active and silent chromatin domains in domains chromatin silent and active of demarcation Functional Long noncoding RNA as modular scaffold of histone modification histone of scaffold modular as RNA noncoding Long Module map of stem cell genes guides creation of epithelial cancer Mutant A large intergenic noncoding RNA induced by p53 mediates global Short RNAs are transcribed from repressed polycomb target genes et al. et Influence of Influence , 202–206 (2008). 202–206 , , 207–219 (2006). 207–219 , og o-oig RNA non-coding Long ay ua lre negnc ocdn RA ascae with associate RNAs noncoding intergenic large human Many hrceiain f gr-ie eein icuig h entire the including deletion, germ-line a of Characterization Chromatin signature reveals over a thousand highly conserved highly thousand a over reveals signature Chromatin Genes Dev. Genes

Genes Dev. Genes lre rcin f xrgnc N pl I rncito sites transcription II pol RNA extragenic of fraction large A PLoS Biol. PLoS

, 847–860 (2004). 847–860 , Identification of genes periodically expressed in the human cell 39 329 nies tasrps r tres o atvtn sal RNAs. small activating for targets are transcripts Antisense nies tasrpin n h mmain transcriptome. mammalian the in transcription Antisense Gene set enrichment analysis: a knowledge-based approach knowledge-based a analysis: enrichment set Gene

, 295–312 (1999). 295–312 , Genes Dev. Genes Bioessays 15 14 Nature , 689–693 (2010). 689–693 , p53 , 842–848 (2008). 842–848 , , 103–105 (2007). 103–105 ,

2 Nature , 333–344 (2008). 333–344 , gain of function in two mouse models of Li-Fraumeni of models mouse two in function of gain

8 23 8 454 , e1000384 (2010). e1000384 , , 2540–2551 (1994). 2540–2551 ,

TP53 Adv. Cancer Res. Cancer Adv. 25 , 1831–1842 (2009). 1831–1842 ,

13 , 126–130 (2008). 126–130 , 464 , 930–939 (2003). 930–939 , Mol. Biol. Cell Biol. Mol. Nature , 1501–1512 (1999). 1501–1512 , ee leain ad -rB2 expression c-erbB-2 and alterations gene Cell Cell

92 , 1071–1076 (2010). 1071–1076 , Nat. Genet. Nat.

, 8493–8497 (1995). 8493–8497 , HOTAIR 142 129

458 , 409–419 (2010). 409–419 , , 1311–1323 (2007). 1311–1323 , , 223–227 (2009). 223–227 , Proc. Natl. Acad. Sci. USA Sci. Acad. Natl. Proc.

ergas hoai sae to state chromatin reprograms 68

13 aue Precedings Nature

, 67–108 (1996). 67–108 , 36 p53 , 1977–2000 (2002).1977–2000 , Proc. Natl. Acad. Sci. USA Sci. Acad. Natl. Proc. o. Cell Mol. , 1090–1098 (2004). 1090–1098 , controls both the G2/M the both controls s e l c i t r A Genome Biol. Genome Proc. Natl. Acad. Sci. Acad. Natl. Proc. p15 by its antisense its by

ee Dev. Genes 38 675–688 , nu Rev. Annu. Annu. Rev. Annu. published

RNA 10 , R14 ,

ARF 102

16 24  , , , .

© 2011 Nature America, Inc. All rights reserved. which has also been applied to assess new transcribed regions in mouse in regions transcribed new assess to applied been also has which method (CSF) frequencies substitution codon the using genomes lian their in alignments signatures with orthologous regions in 20 evolutionary other sequenced placental the mamma evaluated we regions, transcribed new potential. protein-coding Measuring ‘genegene, dst)’; of mRNA’;nearest symbol ‘distance from TSS’. of protein–coding cell-cycle gene, int; downstream of proteincell-cycle coding protein-coding gene, upst; exon of cell-cycle cell-cycle protein–coding of gene, exon;TSS intron of (upstream location ‘genomic follows: as annotated were Transcripts analysis. further for considered were samples the of 10% of mum samples were clustered using Galaxy array 55 all from calls Peak units. transcriptional significant considered were NimbleGen) with a window size of 50. Peaks with a peak score greater than ten each array were subjected to peak calling using the NimbleScan program (Roche calling. Peak protocol. NimbleGen (RETROscript Kit, Ambion), labeled and hybridized according to the standard transcribed reverse Ambion), Kit, (MessageAmp amplified RNA was Briefly, the In addition, including those encoding cyclins, CDKs and CDKIs ( 10 kb upstream to 2 kb downstream of each TSS from 53 other cell-cycle genes, encompasses (which region 9p21 the of kb 25 across resolution 5-bp at designed was NimbleGen) hybridization. RNA and design array Tiling ONLINE Nature Ge Nature performs a performs comparison two between statistical empirical codon models Briefly, CSF regions. and noncoding in coding known substitutions of similar frequency relative the on based codon alignment, its all within observed considering substitutions genome the in region any for score a produces CSF doxorubicin (Sigma)-treated human fetal lung fibroblasts, polyA was selected was polyA human selected lung fetal fibroblasts, (Sigma)-treated doxorubicin the FirstChoice RLM-RACE Kit (Ambion). RNA was extracted from 200 ng/ml PANDA CO2. 5% in °C at 37 (Gibco) streptomycin penicillin- 1% and (Hyclone) FBS 10% in cultured were CRL2091) (ATCC fibroblasts foreskin or AG04393) (Coriell FL3 fibroblasts lung fetal Human methods. standard using isolation RNA for processed receipt, and of washed h minced, 24 within processed were pancreata fetal Staged Washington. of University Laboratory, Research Defects Birth the from obtained were ata described as are Program Autopsy Rapid University Hopkins Johns the from tumors breast metastatic and tissues breast normal Institute. Cancer Institute Cancer Netherlands The from Netherlands tumors breast Human primary and University Hopkins Johns University, Stanford of boards review institutional from approval obtained we and tion, cells. and samples Tissue testing ( hypothesis multiple for account to used was discovery false calculation a (FDR) and rate distribution, hypergeometric the by determined was Database Signatures ( c2 Molecular (MSigDB collection) the from states clinical and biological and sets gene ( Processes Biological Ontology Gene with sets gene of ncRNA map these module a generated then we associations, functional Todetermine ncRNA. that with 0.5 a than less or had than that greater was genes that correlation Pearson protein-coding the of sets gene defined then we ncRNA, each For URLs). (see Genomica program the using visualized and clustered map was This samples. 17 different across on expression based combinations pairwise all for correlations Pearson the by computing genes protein-coding map analysis. Module conservation sequence of level for overall the controlling while explanation is a better model protein-coding the whether quantifies that ratio a likelihood and reports regions, noncoding on based other the and regions coding known of alignments from estimated P Fig. 3 Fig. < 0.05, FDR < 0.05). < FDR 0.05, < c ) and with curated gene sets of metabolic and signaling pathways pathways signaling and metabolic of sets gene curated with and ) cloning analysis. and sequence

METHODS n Robust multichip average normalized single channel data from from data channel single normalized average multichip Robust etics HOXA CDKN2A We a map generated module of the ncRNAs the versus and and Supplementary Fig. 4 Supplementary Informed consent was obtained for tissue dona tissue for obtained was consent Informed HOXD 3 0 , , . P14ARF loci were placed on the array as a control. a control. as array on the placed were loci 2 , 4 8 , , and only transcripts present in a mini To assess the coding potential of the the of potential coding the Toassess 3 ′ and and and 5 A custom tiling array (Roche (Roche array tiling custom A ) 3 ′ CDKN2B 3 RACE was using performed . . The Supplementary Supplementary Table 1 2 3 . Human fetal pancre fetal Human . P value of enrichment ), as well as from from as well as ), 30 5 5 1 2 , , , one 49 and , 1 5 4 0 ). ). - - - - , .

as an endogenous control for normalization across different samples. different across normalization for control endogenous an as used PPIA was Technologies). (Life System PCR Real-Time Fast 7900HT the replicates were run for each gene for each sample in a format 384-well plate on ufacturer’s TaqMan PreAmp Master Mix Kit Protocol (Life Technologies). Two 14-cycle PCR amplification followed by real-time PCR reaction using the man to cDNA a The Transcription resulting was Kit subjected Technologies). (Life Reverse cDNA Capacity High the using RNA total of ng 50 from generated these ncRNAs across different conditions, complementary DNAs (cDNA) were of profiling expression For gene sequences. or genomic transcripts nontarget genome-wide and assays designed between reactivity cross potential predict to matrix alignment TaqMana all using position-specific were verified assays the using regions transcribed new ‘single-exon’ 219 design mode. The and transcript specificity genome of the specificity of 60 targeting developed was assays. ncRNA custom TaqMan to (Stratagene)). Green SYBR Mix RT-PCR Step Master normalized and Assays RT-PCR for Expression Strand-specific GAPDH. Gene TaqMan using Biosystems) (Applied Mix RT-PCR Master One-Step the using performed was RNA total was DNA genomic of ng 50–250 and using RT-PCR (Ambion). DNA-free TURBO (Qiagen), using eliminated Kit Mini RNeasy the and (Invitrogen) RT-PCR. protocol. manufacturer’s standard the to according performed was RLM-RACE and (Ambion) MAG kit Poly(A)Purist the using overnight. We washed 10 10 We washed overnight. 2 and 89879), (Pierce columns spin into micro were transferred r.p.m.for 15 at min 13,000 Supernatants trifuged cen and combined were lysates cytoplasmic and Nuclear H). setting power s (30 30 s on, off, Sonicator a Diagenode for 7 using min sonicated and pestle motorized a using s 5 for mM dounced 0.5 were lysates MgCl2, Nuclear PMSF). mM mM DTT,1 4 KCl, M 0.42 glycerol, 10% 7.5, pH HEPES mM (20 as C vortex frequent with min 20 for Buffer pellet of the to added A was volume Buffer of used was equal an Next, collected. was (cytosol) supernatant the and r.p.m., 2,000 at min 3 for centrifuged were Lysates ice. on min 5 for mM (10 A Buffer 15 min on ice at of 150 r.p.m. NP-40 of to 0.25% was concentration added a final volume 2× for with DTT, PMSF) mM mM 1 0.5 KCl, mM 10 MgCl2, mM 1.5 lysed 7.5, pH HEPES After were min. 5 cells for glycine washes, M PBS 0.125 of two addition the by formaldehyde followed 1% min, with 10 for crosslinked and trypsinized h, 16 for doxorubicin immunoprecipitation. RNA antifade reagent with DAPI (Invitrogen) and imaged at 20× magnification. 1 hour at 37 °C. Slides were then washed with PBS and mounted in Prolong methanolGold at −20 °C for 10 min and incubatedslides with(Lab-Tek), the treatedTUNEL with labeling200 ng/ml doxorubicinmixture (Sigma)for for 24 h, fixedKit, TMRwith Red (Roche). Human fetal lung fibroblasts were cultured on chamber TUNEL. ( 5 Table of ON-TARGETPlus siRNAs (Dharmacon) targeting RNA interference. H-209). Biotech NF-YAanti Cruz (Santa and Biotech) LSD1 Cruz (Santa p21 anti anti AC22), Signal (Cell EZH2 anti ab6046), (ab17721), (Abcam anti-B-tubulin using 9542), Signal performed were (Cell anti-PARP blots Protein ab28). (Abcam anti-p53 and ab9050) (Abcam anti-H3K35me3 ab8580), (Abcam anti-H3K4me3 assays: cipitation Antibodies. (Agilent). Kit Labeling length full with generated were Probes protocol. manufacturer’s standard the following (Ambion) Kit and PolyA Purist Mag (Ambion). RNA blot was performed using NorthernMax RNAblot. Supplementary Table 5 Supplementary ). Validated siRNAs for mRNAs were obtained from Ambion Ambion from obtained were mRNAs for siRNAs Validated ). TUNEL assays were performed using the Total RNA was extracted from cells using the TRIzol reagent reagent TRIzol the using cells from extracted was RNA Total We obtained 5 5 We obtained The following antibodies were used for chromatin immunopre chromatin for used were antibodies following The Human fetal lung fibroblasts were transfected with 50 with nM were Human transfected lung fibroblasts fetal ). µ µ PANDA l of Protein A/G UltraLink Resin (Pierce 53132) 53132) (Pierce Resin UltraLink A/G Protein of l g of polyA RNA using an RNeasy Kit (QIAGEN) (QIAGEN) Kit RNeasy an using RNA polyA of g Ten million cells were treated with 200 ng/ml ng/ml 200 with treated were cells Tenmillion A panel of TaqMan custom ncRNA assays assays ncRNA custom TaqMan of panel A using the Prime-It RmT Random Primer Primer RmTRandom Prime-It the using µ g of antibody was added and incubated incubated and added was g antibody of PANDA was performed using the One- the using was performed insitu PANDA CellDeath Detection doi:10.1038/ng848 ( Supplementary - - -

© 2011 Nature America, Inc. All rights reserved. for 15 min. The nuclear pellet was resuspended in 1 ml RIP buffer (150 mM mM (150 buffer RIP ml 1 in 2,500 resuspended at was pellet nuclear The centrifugation min. 15 for by pelleted were Nuclei min. 20 for ice on X-100) Triton 4% MgCl2; mM 20 7.5; pH Tris-HCl mM 40 sucrose; M (1.28 buffer 200 in incubated were h) (16 cells 2 and (Ambion) cocktail A/T1 RNase of 20 min for before conjugation to °C) RNAsebeads. digestion was with performed 5 (22–25 temperature room to shifted and (Ambion) buffer structure RNA °C (90 for was RNAfolded 2 with for ice reaction. min, 2 supplied min), RNA( of pmols 6 tions: raphy RNA RNAse chromatography. mediated Kit Mini (QIAGEN). digest DNAse RNEasy column on the the with (QIAGEN) using extracted was RNA of and ml 1 sample, the added We to °C. 45 TRIzol at h 1 for 5 incubated plus and PMSF) (Ambion) mM K 1 and Proteinase DTT mM 0.5 deoxycholate, sodium 0.5% 40, NP- 1% SDS, 0.1% EDTA, mM 1 NaCl, mM 150 7.4, pH Tris mM (50 RIPA 200 in resuspended were Beads PMSF). mM 1 and DTT mM 0.5 deoxycholate, sodium 0.5% NP-40, 1% SDS, 0.1% EDTA, mM 1 NaCl, M 1 times with RIP wash buffer and two times with 1 M RIPA (50 mM Tris pH 7.4, four washed were Samples °C. 4 at h 1 for reaction immunoprecipitation the glycerol, 10% 7.9, pH TrisHcl MgCl mM mM 5 KCl, mM (50 100 buffer wash RIP with times three doi:10.1038/ng848 Cellular lysates were prepared as follows: 10 million doxorubicin treated treated doxorubicin million 10 follows: as prepared were lysates Cellular 4 1 was performed as previously described with the following modifica following the with described previously as performed was PANDA 2 , 10 mM B-me and 0.1% NP-40) and added it to to it added and NP-40) 0.1% and B-me mM 10 , or a 1.2-kb fragment of of fragment 1.2-kb a or µ l PBS, 600 600 l PBS, µ RNAse mediated RNA RNAse mediated chromatog l of RNase V1 (Ambion). V1 RNase of l µ l H20 and 200 and 200 l H20 LacZ ) were used per per used were ) µ l nuclear lysis lysis l nuclear µ l 150 mM mM 150 l µ µ g - - l l

53. 52. 51. 50. 49. 48. CCAAT motif ( consensus NF-YA the surround to designed were BAX and Morachis from obtained were sites binding YA described ously (ChIP). immunoprecipitation Chromatin at 18,000 by centrifugation pelleted were debris nuclei were sheared using a motorized douncer for 5 s. Nuclear membrane and inhibitor (Roche Complete Protease Inhibitor Cocktail Tablets)). Resuspended KCl, 25 mM Tris pH 0.5 mM 7.4, DTT, 1 NP40, mM 0.5% PMSF and protease Supplementary Table 5 Supplementary

in JL, ode C, ldtn, .. Bon PO & hn, .. Anatomic H.Y. Chang, & P.O. Brown, H.B., Gladstone, C., Bondre, J.L., Rinn, M.J. Vijver, de van Kosiol, C., Holmes, I. & Goldman, N. An empirical codon model for protein sequence scalability and Performance M. Kellis, & M.D. M.F.,Rasmussen, Lin, A.N., Deoras, Lin, M.F. perform to galaxy Using A. Nekrutenko, & D. Blankenberg, I., Schenck, Taylor,J., demarcation by positional variation in fibroblast gene expression programs. expression gene fibroblast in variation Genet. PLoS positional by demarcation cancer. breast evolution. 12 in identification gene genomes. comparative for metrics discriminative of genomes. fly 12 using (2007). 10.5 Unit ag-cl itrcie aa analyses. data interactive large-scale et al. PLOS Comput. Biol. Comput. PLOS Mol. Biol. Evol. Biol. Mol.

2 5 Revisiting the protein-coding gene catalog of , e119 (2006). e119 , 3 N. Engl. J. Med. J. Engl. N. . qPCR primers for FAS and CCNB1 and FAS-control NF- FAS-control and CCNB1 FASand for primers qPCR . et al. et Genome Res. Genome ). A gene-expression signature as a predictor of survival in survival of predictor a as signature gene-expression A

24 , 1464–1479 (2007). 1464–1479 ,

4

347 , e1000067 (2008). e1000067 ,

, 1999–2009 (2002). 1999–2009 , 17 ur Poo. Bioinformatics Protoc. Curr. , 1823–1836 (2007). 1823–1836 , ChIP was performed as previ as performed was ChIP g for 10 min. 10 for et al. et 4 0 Drosophila melanogaster Nature Ge Nature Primers for PUMA PUMA for Primers hpe 10, Chapter Drosophila n etics -