Downloaded from rnajournal.cshlp.org on October 3, 2021 - Published by Cold Spring Harbor Laboratory Press
REPORT
FUS regulates genes coding for RNA-binding proteins in neurons by binding to their highly conserved introns
TADASHI NAKAYA,1 PANAGIOTIS ALEXIOU,1,3 MANOLIS MARAGKAKIS,1,3 ALEXANDRA CHANG,1 and ZISSIMOS MOURELATOS1,2,4 1Department of Pathology and Laboratory Medicine, Division of Neuropathology, Perelman School of Medicine, and 2PENN Genome Frontiers Institute, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
ABSTRACT Dominant mutations and mislocalization or aggregation of Fused in Sarcoma (FUS), an RNA-binding protein (RBP), cause neuronal degeneration in Amyotrophic Lateral Sclerosis (ALS) and Frontotemporal Lobar Degeneration (FTLD), two incurable neurological diseases. However, the function of FUS in neurons is not well understood. To uncover the impact of FUS in the neuronal transcriptome, we used high-throughput sequencing of immunoprecipitated and cross-linked RNA (HITS–CLIP) of FUS in human brains and mouse neurons differentiated from embryonic stem cells, coupled with RNA-seq and FUS knockdowns. We report conserved neuronal RNA targets and networks that are regulated by FUS. We find that FUS regulates splicing of genes coding for RBPs by binding to their highly conserved introns. Our findings have important implications for understanding the impact of FUS in neurodegenerative diseases and suggest that perturbations of FUS can impact the neuronal transcriptome via perturbations of RBP transcripts. Keywords: FUS; ALS; FTLD; splicing; CLIP; conserved intron
INTRODUCTION not only to mRNAs with altered 3′ UTRs but also to the for- mation of mRNAs coding for proteins with different C-ter- Pre-mRNAs transcribed by RNA polymerase II are extensive- mini (Tian et al. 2005; Wang et al. 2008a). After export of ly modified in the nucleus, often cotranscriptionally, before the mature mRNA to the cytoplasm, the pioneer round of they are exported to the cytoplasm, where they function as translation functions as a quality control mechanism to dis- mRNAs to direct protein synthesis (Dreyfuss et al. 2002; tinguish mature, properly spliced mRNAs from misspliced Ibrahim et al. 2012). In the nucleus, the 5′-end of pre- mRNAs (Schoenberg and Maquat 2012). The translating ri- mRNA is capped, introns are removed, and exons are ligated bosomes remove protein marks (known as the exon–junction together by the spliceosome assisted by numerous auxiliary – – ′ complex, EJC ) that were deposited by the spliceosome in factors including RBPs, and the 3 -end is polyadenylated the vicinity of exon–exon junctions of mRNAs, and subse- (Ibrahim et al. 2012). Alternative splicing (AS) and alterna- quent rounds of translation ensue to generate proteins. In tive polyadenylation (APA) are prevalent and very important the case of misspliced mRNAs that contain retained introns, processes that generate tremendous transcriptome diversity ribosomes typically terminate in premature termination co- from a set number of genes (Wang et al. 2008a). AS is regu- dons (PTCs) found within the retained intron; the down- lated by many trans factors, notably RBPs that belong to the stream EJC is not removed and recruits factors that will heterogeneous Ribonucleoprotein Particle (hnRNP) and ar- degrade the mRNA in a process known as nonsense-mediat- ginine and serine-rich (SR) family of proteins that bind to ed decay (NMD) (Schoenberg and Maquat 2012). cis elements on pre-mRNAs and influence the composition FUS (also known as Translocated in Liposarcoma –TLS–, of mRNAs by promoting inclusion or exclusion of exons pigpen, and hnRNP P2) is a ubiquitously expressed RBP (Ibrahim et al. 2012). APA is also widespread and is con- that contains QGSY-rich (“prion-like”), Gly-rich, RNA rec- trolled by polyadenylation signals (PAS) and other cis ele- ognition motif (RRM), zinc-finger (ZnF), and arginine-gly- ments and by many trans-binding factors (Di Giammartino cine-rich (RGG) domains (Lagier-Tourenne et al. 2010). et al. 2011; Proudfoot 2011; Berg et al. 2012), and may lead FUS is homologous to TAF15 (TBP-associated factor of RNA Polymerase II of 68-KD/15) and to EWS (Ewing sar- 3These authors contributed equally to this work. coma breakpoint region 1, –EWSR1) and together constitute 4 Corresponding author the FET family of RBPs that are predominantly nuclear but E-mail [email protected] Article published online ahead of print. Article and publication date are at also shuttle to the cytoplasm and have functions in transcrip- http://www.rnajournal.org/cgi/doi/10.1261/rna.037804.112. tion, pre-mRNA splicing and RNA processing and metabo-
498 RNA (2013), 19:498–509. Published by Cold Spring Harbor Laboratory Press. Copyright © 2013 RNA Society. Downloaded from rnajournal.cshlp.org on October 3, 2021 - Published by Cold Spring Harbor Laboratory Press
FUS regulation of neuronal RBP transcripts lism (Kovar 2011). In vitro splicing assays implicated FUS have shown that TDP-43 affects RNA splicing and regulates, in the regulation of alternative splicing (Lerga et al. 2001) among others, RNAs with neuronal functions and long in- and FUS was identified in purified human spliceosomes trons (Polymenidou et al. 2011; Tollervey et al. 2011). FUS assembled in vitro on minimal pre-mRNAs (Rappsilber et al. RNA targets were recently reported for FLAG-tagged FUS 2002; Zhou et al. 2002). FUS regulates transcription of RNA whose expression was enforced in 293T cells (Hoell et al. polymerase II genes (Wang et al. 2008b; Tan et al. 2012) and 2011), revealing prominent binding of FUS to introns and also inhibits RNA polymerase III transcription (Tan and highlighting its function in splicing. While our manuscript Manley 2010). was under preparation and revision, RNA targets for endog- ALS is a motor neuron degenerative disease and in ∼20% of enous FUS in mouse brains and human autopsy brains were cases is part of a more extensive neurodegenerative process reported, revealing that pre-mRNAs constituted the major that includes FTLD (Ibrahim et al. 2012). Dysregulation of class of RNA targets for FUS (Ishigaki et al. 2012; Lagier- RBPs has recently emerged as a prominent pathogenic mech- Tourenne et al. 2012; Rogelj et al. 2012). These studies re- anism underlying ALS following the discovery of cytoplasmic vealed a role for FUS in splicing regulation along nascent mislocalization and aggregation of the RBP known as TAR transcripts (Ishigaki et al. 2012; Lagier-Tourenne et al. RNA/DNA-binding protein of 43 kDa (TDP-43, TARDBP) 2012; Rogelj et al. 2012). Interestingly, no significant overlap in afflicted neurons from most patients with sporadic ALS between the binding sites or splicing changes was seen be- (Arai et al. 2006; Neumann et al. 2006; Lee et al. 2012). tween FUS and TDP-43, although both proteins were found TDP-43 associates with the mRNA splicing and translational to bind and regulate genes with long introns and also genes machinery (Freibaum et al. 2010). Dominant mutations of important for neuronal development (Lagier-Tourenne the TARDBP gene were later found in familial ALS cases et al. 2012; Rogelj et al. 2012). However, the degree of FUS (Kabashi et al. 2008; Sreedharan et al. 2008; Van Deerlin target conservation and the magnitude of direct vs. indirect et al. 2008), and aggregation of TDP-43 is found in most cases effects of FUS in the neuronal transcriptome are not known. of hippocampal sclerosis associated with aging (Nelson et al. Here we report conserved neuronal RNA targets and net- 2011) and in many cases of FTLD (Sieben et al. 2012). The works that are regulated by FUS. Interestingly, we find that role of RBPs in ALS solidified with the discovery that domi- FUS regulates splicing of genes coding for RBPs by binding nant mutations of FUS also cause familial ALS (Kwiatkowski to their highly conserved introns. These results have impor- et al. 2009; Vance et al. 2009). Most FUS disease mutants cause tant implications for understanding the impact of FUS in disruption in the nucleocytoplasmic shuttling of FUS, leading neurodegeneration and suggest that perturbations of FUS to increased cytoplasmic levels of FUS (Dormann et al. 2010). may have widespread effects in the neuronal transcriptome Mutations in the genes coding for the other two FET proteins, via perturbations of RBP transcripts. TAF15 and EWS, are also found in familial and sporadic ALS cases (Couthouis et al. 2011, 2012; Ticozzi et al. 2011). RESULTS AND DISCUSSION Furthermore, prominent cytoplasmic accumulation and ag- gregation of FUS, TAF15, and EWS are found in a subset of To identify the in vivo RNA-binding sites for FUS, we per- FTLD cases, further supporting the central role of FUS and formed HITS–CLIP (Supplemental Fig. S1A) using temporal of other FET proteins in neurodegeneration (Mackenzie and lobe cortices (surgical specimens) from three unrelated nor- Neumann 2011; Neumann et al. 2011). mal human brains. We first tested the specificity of the anti- FUS, along with other RBPs that cause or are associated FUS antibody, A300-302A; as shown in Figure 1, the anti- with ALS and FTLD, have prion-like domains that under body recognized a single protein migrating at ∼66 kDa on physiological conditions may oligomerize (Kato et al. 2012), immunoblots of human brain and mouse neurons whose but under pathological conditions promote aggregation and identity was confirmed to be FUS by immunoprecipitation neurotoxicity (Cushman et al. 2010; King et al. 2012) as orig- and mass spectrometry (data not shown). Furthermore, inally proposed for the pathogenesis of prion diseases (Aguzzi A300-302A has been used successfully to identify pathologi- and Rajendran 2009; Prusiner 2012). An important unan- cal FUS in FTLD cases (Neumann et al. 2011). Following UV swered question is whether neurodegeneration is caused by cross-linking, immunoprecipitation of FUS was performed mislocalization and aggregation of RBPs that effectively re- under stringent conditions and after ligation of radiolabeled duce their availability to function in normal RNA metabo- 3′ adaptor to RNA cross-linked to FUS, the reactions were lism (loss-of-function), or from a gain-of-function of the separated by NuPAGE, blotted on nitrocellulose membranes, aggregated and mislocalized RBPs, leading, for example to in- and visualized by autoradiography and immunoblotting, de- appropriate interactions with cytoplasmic RNAs, or most monstrating robust and specific FUS–RNA complexes (Fig. likely from a combination of both mechanisms (Lee et al. 1A; Supplemental Fig. S1B). cDNA libraries were prepared 2012). from RNA (CLIP tags) extracted from the membranes and Identifying the RNA targets that are bound by RBPs in vivo sequenced by Illumina GIIe analyzer, generating a total of is an essential first step toward elucidation of their functions. 7,324,501 reads from the three FUS CLIPs that mapped to TDP-43 RNA interaction maps in mouse and human brains the human genome.
www.rnajournal.org 499 Downloaded from rnajournal.cshlp.org on October 3, 2021 - Published by Cold Spring Harbor Laboratory Press
Nakaya et al.
A CLIP (Human Brain 3) B S1C,D). Approximately 60% of CLIP peak-tags (see Materials and Methods) FUS NRS Lysate FUS NRS mapped to introns, whereas 13% map- kDa kDa 70 ped to exons, 30% to transcripts aris- 138 138 60 Brain1 50 Brain2 ing from intergenic regions, and 3% to 40 Brain3 noncoding genes (Fig. 1B). The over-re- 66 FUS 66 30 presentation of intronic CLIP reads in- 20 % of CLIP peak % of CLIP dicates that FUS binds pre-mRNAs and 48 * 48 10 0 is consistent with the predominantly 35.5 35.5 nuclear localization of FUS and its role 25 25 in nuclear RNA processing events such 17 17 as pre-mRNA splicing. Prior SELEX (Systematic Evolution of Immunoblot Autoradiography Ligands by Exponential Enrichement) experiments using a recombinant, bac- C terially expressed, GST-FUS protein DMEM+FCS+LIF ADFNK ADFNK + RA ADFNB + GDNF identified a GGUG motif in 39 of 72 se- Day 0 Day 2 Day 5 Day 6 Day 12 quences selected after three rounds of
Mouse ES cells Embryonic bodies (EBs) Dispersed Neurons SELEX (Lerga et al. 2001). RNAs con-
Dispersion of EBs HITS-CLIP taining this motif bound to GST-FUS or to GST fusions of isolated RRM D and RGG domains of FUS (Lerga et al. CLIP (Mouse Neurons) E 70 2001). However, RNAs containing CU Lysate 60 Neurons #1 sequences also bound to the RRM of
#1 #2 NRS FUS #1 FUS #2 NRS FUS #1 FUS #2 50 Neurons #2 kDa kDa 40 FUS, although with lower apparent af- 138 138 30 finity (Lerga et al. 2001). Subsequent 20 % of CLIP peak % of CLIP NMR and binding studies of individual 10 FUS domains showed that binding of 66 FUS 66 0 GGUGwasmediatedbytheZnFdomain 48 * 48 of FUS, while the RRM domain of FUS 35.5 35.5 failed to bind GGUG containing RNAs F Human (696) – 25 25 (Iko et al. 2004). PAR CLIP (Photo- Mouse activatable-Ribonucleoside-Enhanced 17 17 121 (131) Crosslinking and Immunoprecipita- Immunoblot Autoradiography tion) of epitope-tagged FUS with en- forced expression in 293T cells failed – FIGURE 1. FUS HITS CLIP in human brains and mouse neurons. (A) FUS CLIP from one hu- to reveal a linear motif, but revealed a man brain (CLIPs from the other two brains shown in Supplemental Fig. S1B). Red line indicates FUS cross-linked to RNA ligated to radiolabled 3′ adaptor; RNA was isolated from FUS–RNA degenerate structural motif found in a complexes and sequenced. (NRS) Nonimmune serum (negative control); (∗) antibody bands. portion of FUS-binding sites (Hoell (B) Genomic distribution of FUS CLIP peaks from three human brains. (C) Schematic of gener- et al. 2011). A well-defined FUS-bind- ation of neurons from mouse Embryonic Stem (ES) cells. (D) FUS CLIPs from biological repli- ing motif was also not found in the cates of mouse neurons. Red line indicates FUS cross-linked to RNA ligated to a radiolabed 3′ adaptor that was used for sequencing. (E) Genomic distribution of mouse neuron CLIP peaks. studies of Ishigaki et al. (2012). A two- (F) Overlap between consistently highly bound human and mouse FUS RNA targets. fold enrichment of a GGU-contain- ing motif was found by Rogelj et al. (2012) at cross-linking sites, but, over- To verify the reproducibility and significance of the results all, FUS-binding sites showed little sequence specificity and to avoid possible confounds in the data sets introduced (Rogelj et al. 2012). by Illumina sequencing artifacts, all bioinformatics analyses We decided to perform exhaustive searches for possible were performed independently for each CLIP sample and FUS-binding motifs in our human CLIP data (and mouse all conclusions were independently validated by all CLIP CLIP data, see below) by estimating the enrichment of every samples. possible hexamer within the highest scored 5000 peaks of The correlation of gene targeting by FUS among all pairwise each sample (see Materials and Methods) that overlap combinations of the three human brains (see Materials and UCSC annotated genes. The enrichment was calculated inde- Methods) was very high (R > 0.95), indicating that FUS pendently for each sample against a random background us- HITS–CLIP was highly reproducible (Supplemental Fig. ing Fischer’s exact test (Vourekas et al. 2012). The random
500 RNA, Vol. 19, No. 4 Downloaded from rnajournal.cshlp.org on October 3, 2021 - Published by Cold Spring Harbor Laboratory Press
FUS regulation of neuronal RBP transcripts background was created by selecting one random peak of TABLE 1. Gene Ontology term analysis for FUS targets equal size to each real peak. To create a background that closely resembled the real set, random peaks were restrained GO term Description P-value to avoid overlap with any real peaks and were selected from Process − genes that contained at least one real peak. The ranked lists GO:0022610 Biological adhesion 5.17 × 10 13 − of all hexamers for all samples were combined independent- GO:0007155 Cell adhesion 5.17 × 10 13 −10 ly per each CLIP library and were assigned P-values using GO:0042391 Regulation of membrane 2.13 × 10 potential − the Rank Products method (Breitling et al. 2004). All hexa- GO:0016337 Cell–cell adhesion 1.93 × 10 9 − mers that were significantly (P-value < 0.01) over-represent- GO:0060079 Regulation of excitatory 7.12 × 10 9 ed in all replicates and had a false-positive percent lower than postsynaptic membrane potential 0.01 based on the rank product were identified (Supplemental −8 Table 1) and were found to represent ∼20% of FUS-bind- GO:0060078 Regulation of postsynaptic 1.94 × 10 membrane potential ∼ − ing sites. However, the majority ( 80%) of RNA-binding GO:0051899 Membrane depolarization 2.62 × 10 8 − sites for FUS do not contain defined motifs, consistent GO:0043113 Receptor clustering 6.55 × 10 8 − with the low-sequence specificity of FUS toward RNA GO:0007156 Homophilic cell adhesion 1.18 × 10 7 −7 binding (Ishigaki et al. 2012; Rogelj et al. 2012) and highlight- GO:0072657 Protein localization in 1.59 × 10 membrane − ing the need for experimental determination of FUS- GO:0006873 Cellular ion homeostasis 1.71 × 10 7 − binding sites. GO:0008038 Neuron recognition 3.38 × 10 7 − We reasoned that conservation of FUS-binding sites is very GO:0055082 Cellular chemical homeostasis 4.16 × 10 7 −7 likely indicative of transcripts that are functionally regulated GO:0010975 Regulation of neuron projection 5.43 × 10 development by FUS. To identify conserved neuronal targets for FUS, we −7 – GO:0050801 Ion homeostasis 9.46 × 10 performed HITS CLIP in mouse neurons differentiated Function − from embryonic stem cells (Fig. 1C, Supplemental Fig. S2A, GO:0050839 Cell-adhesion molecule binding 3 × 10 7 − B). The purity of the neurons was ∼90% (Supplemental Fig. GO:0008066 Glutamate receptor activity 5.92 × 10 7 Component S2A) and two biological replicates of FUS HITS–CLIP (Fig. − GO:0044456 Synapse part 4.35 × 10 12 − 1D) generated a total of 53,408,386 reads (using Illumina GO:0031224 Intrinsic to membrane 2.68 × 10 11 − GIIx analyzer) that were mapped to the mouse genome. GO:0005886 Plasma membrane 4.41 × 10 10 − The correlation coefficient of gene targeting between the GO:0014069 Postsynaptic density 8.3 × 10 10 − GO:0031225 Anchored to membrane 3.73 × 10 9 two mouse samples (see Materials and Methods) was 0.94, in- − GO:0044425 Membrane part 1.17 × 10 8 dicating that mouse FUS CLIP was highly reproducible −8 GO:0043005 Neuron projection 3.52 × 10 − (Supplemental Fig. S2C,D). The genomic distribution of GO:0016021 Integral to membrane 3.8 × 10 8 − mouse FUS CLIP peak-tags was similar to that of human GO:0044459 Plasma membrane part 3.91 × 10 8 − ∼ ∼ GO:0030054 Cell junction 4.1 × 10 8 CLIP peak-tags; 60% mapped to introns, 10% to exons, − GO:0033267 Axon part 3.08 × 10 7 ∼30% to intergenic regions, and ∼1.5% to noncoding genes − GO:0097060 Synaptic membrane 3.29 × 10 7 (Fig. 1E), as were the over-represented hexamers that were identified for mouse FUS binding (Supplemental Table 2), in- dicating similar functions of FUS between human and mouse neural tissues. works of genes that are vital for neuronal maintenance, Next, we determined the most consistently targeted human development, and functioning. and mouse RNAs by ranking the genes that contained FUS To analyze the impact of FUS on its RNA targets, we exam- peak-tags in all CLIP experiments; we identified 863 human ined the transcriptome of mouse neurons after knockdown genes, 696 of which have a mouse homolog (Supplemental of FUS. Biological triplicates of mouse ES cell-derived neu- Table 3). We also identified 156 mouse genes, 131 of which rons were transfected with siRNAs against FUS or control have a human homolog (Supplemental Table 4). The higher siRNAs (Fig. 2A), and total RNA and protein were extracted. number of targets identified from the human samples is not The efficiency of knockdown was determined by quantitative surprising given that the human brain is a very complex tissue (q)RT-PCR and immunoblotting and revealed significant with multiple cells types, including various types of neurons reductions in the level of FUS mRNA (Fig. 2B) and pro- and glial cells, whereas the mouse samples represented highly tein (Fig. 2C). We then performed directional RNA-seq pure neurons. Strikingly, >90% of the highly FUS-targeted (Vourekas et al. 2012), generating a total of 61,304,641 reads mouse transcripts are the same in humans (Fig. 1F), indicat- from the three FUS knockdowns and 59,339,986 from the ing that there is a conserved group of neuronal RNAs that are three control knockdowns that mapped to the mouse genome targeted by FUS. Gene Ontology (GO) term analysis (see (Supplemental Fig. S2E). It is important to note that by using Table 1) revealed enrichment of genes controlling synaptic, total RNA instead of poly(A)-selected RNA for RNA-seq we cell adhesion, and neuronal projection and recognition pro- were able to probe not only exons, but also introns of tran- cesses, implicating FUS in the regulation of interrelated net- scripts. The correlation coefficient (R) of normalized gene
www.rnajournal.org 501 Downloaded from rnajournal.cshlp.org on October 3, 2021 - Published by Cold Spring Harbor Laboratory Press
Nakaya et al.
A By examining gene expression mea- DMEM+FCS+LIF ADFNK ADFNK + RA ADFNB + GDNF sured by RNA-Seq (see Materials and Methods), we identified 94 significantly Day 0 Day 2 Day 5 Day 6 Day 9 Day 10 Day 12 (P-value < 0.01) up-regulated and 52
Mouse ES cells Embryonic bodies (EBs) Dispersed Neurons significantly (P-value < 0.01) down- regulated genes upon FUS knockdown
Dispersion of EBs Transfection of RNA-seq (Fig. 2D; Supplemental Table 5). The siRNA small number of differentially expressed BC genes indicates that transcriptional reg- qRT-PCR Western blot ulation is probably not one of the princi- Control siRNA FUS siRNA pal functions of FUS in neurons and 120 #1 #2 #3 #1 #2 #3 100 is consistent with the limited changes FUS 80 seen by microarrays (Rogelj et al. 2012). 28.8 ± 4.27 % GAPDH 60 p<0.005 As previously reported (Lagier- 40
% of control 120 Tourenne et al. 2012), many FUS CLIP 20 100 tags were found in genes containing 0 36.7 ± 7.65 % 80 Control siRNA FUS siRNA p<0.005 long introns, such as Csmd1, Nrxn3, 60 Nlgn1, Nkain2, Smyd3, and Kcnip4 40
% of control 20 (Supplemental Fig. S3). Comparison of 0 our mouse FUS CLIP data with those re- Control siRNA FUS siRNA ported in Lagier-Tourenne et al. (2012) D E showed that they are highly correlated – 15 Genes 15 Exons (R = 0.77 0.81). However, by RNA-seq and qRT-PCR analyses we did not detect alterations in the expression levels of 10 10 genes harboring long introns in mouse neuronal cells after FUS knockdown
glog (Control) 5 glog (Control) 5 (Supplemental Fig. S3). We performed RNA-seq 2 d after siRNA knockdowns, 0 0 as we could achieve maximum FUS 0 5 10 15 0 5 10 15 (mRNA and protein) knockdown in glog (Knockdown) glog (Knockdown) mouse neurons at the shortest possible time in order to capture early events FIGURE 2. Reduction in FUS protein levels leads to transcript and splicing alterations in mouse that may be related directly to FUS neurons. (A) Experimental scheme. (B) mRNA levels of FUS after knockdown assayed by qRT- PCR from three independent experiments; GAPDH was used for normalization. (C) Protein lev- activities. In contrast, Lagier-Tourenne els of FUS after knockdown quantified by Western blots from three independent experiments; et al. (2012) used prolonged FUS knock- GAPDH was used for normalization. (D,E) Scatter plots of differentially regulated genes (D) downs in mouse brains and human or exons (E) identified by RNA-seq analysis from three FUS knockdown experiments compared neurons and different approaches (anti- with controls. A total of 94 genes were up-regulated (blue) and 52 were down-regulated (red) upon FUS knockdown (D). Differentially expressed genes were identified by Student’s t-test (n sense oligonucleotides for knockdowns =3;P < 0.01) on the gene expression levels of CTRL and knockdowns (D). Exon inclusion was in mouse brains and lentiviral shRNAs up-regulated for 631 exons (blue) and down-regulated for 437 exons (red) (E). The number of for knockdowns in human neurons reads that mapped on the exon, per exon kilobase (kb), was normalized using the per sample derived from ES cells). The difference gene expression level upper quartile normalization factor (see Materials and Methods). Differential inclusion of exons was determined by Student’s t-test (n =3;P < 0.01). in the time course of the knockdowns and in the experimental systems used likely explains the differences between expression between the three FUS and three control RNA-seq our results and those of Lagier-Tourenne et al. (2012). experiments (see Materials and Methods) ranged from 0.73 Next we examined the effect of FUS knockdown in pre- to 0.97, indicating that the high reproducibility of the RNA- mRNA splicing by assessing exon expression changes. We seq experiments under all experimental conditions (Supple- found that FUS knockdown increased the expression of 631 mental Fig. S2F). The availability of both FUS CLIP and exons and decreased the expression of 437 exons in the mouse RNA-seq data from mouse neuron samples allowed us to ex- neuronal transcriptome (Fig. 2E; Supplemental Table 6), in- amine the correlation between FUS binding and RNA ex- dicating a role of FUS in regulating splicing of many tran- − pression levels. No correlation (R2 <10 4) was observed, scripts. Of the changed exons, 54.4% are constitutive exons indicating that the abundance of FUS-binding sites is not af- and 45.6% are alternative exons, suggesting that FUS has a ge- fected by RNA expression levels (see Materials and Methods). neral role in splicing and that the effects seen may reflect
502 RNA, Vol. 19, No. 4 Downloaded from rnajournal.cshlp.org on October 3, 2021 - Published by Cold Spring Harbor Laboratory Press
FUS regulation of neuronal RBP transcripts sensitivity of these exons to reduced FUS levels and to secon- documented for a number of RBPs, including hnRNPs dary effects mediated by FUS regulating other RBPs, as de- (Huelga et al. 2012) and TDP-43 (Polymenidou et al. 2011; tailed below. These splicing alterations are unlikely to reflect Tollervey et al. 2011). Other splicing regulators, such as Fox- experimental variations because we performed FUS knock- 1 (Ataxin2-binding protein 1, A2BP1), autoregulate their al- downs in biological triplicates, and the splicing changes that ternative splicing to generate dominant negative isoforms we report are seen in all samples. (Damianov and Black 2010), and in other instances RBPs To determine whether the changes in splicing were directly autoregulate alternative polyadenylation of their mRNAs related to FUS binding, we evaluated the presence of FUS- (Al-Ahmadi et al. 2009). binding sites within a region including the exon and 2 kb up- We noticed that for many conserved introns, conservation stream and downstream. We found that 34.5% of unaffected was not limited close to splice sites, but extended throughout exons have FUS CLIP reads associated with this region and, most of the intron. The extent of intron conservation across in contrast, 42.4% of changed exons have reads associated the entire human and mouse genome and their relationship with them. These findings indicate that intronic sequences to FUS has not been previously addressed. We used PhyloP around the exons whose expression levels change after FUS vertebrate phylogenetic conservation to determine that the av- knockdown are significantly (Fisher exact test, P-value < erage conservation score distribution of human and mouse − 10 7) more targeted by FUS. exons has a peak from score 500 to 2000. We excluded from However, the impact of FUS in splicing is likely more this analysis regions within introns that give rise to noncoding widespread than the one uncovered by our RNA-Seq exper- RNAs (such as miRNAs or snoRNAs) and introns that were iments because of the stringent criteria that we used to define smaller than 200 nucleotides that might also give rise to as changed exons and limitations of RNA-Seq (Ozsolak and yet unknown small noncoding RNAs, such as mirtrons. Milos 2011). Furthermore, we did not identify FUS-binding Introns having an average conservation score of 500 or higher sites for the remainder 57.6% of changed exons, suggesting were considered highly conserved. GO term analysis revealed that either CLIP failed to identify all FUS-binding sites or that genes containing at least one highly conserved intron were that the observed changes in splicing of these exons were enriched for RBPs and for proteins related to splicing, mRNA an indirect consequence of FUS knockdown. We reasoned metabolic processes, and nucleotide and small molecule bind- that such indirect effects might be mediated by other RBPs ing (Fig. 3B). A subset of RBP introns that are highly targeted targeted by FUS. In fact, during the analysis of FUS CLIPs by FUS are also highly conserved (Fig. 3C). FUS-targeted RBP we noticed that RNA-binding sites for FUS were often found introns are also more commonly conserved compared with within highly conserved introns of genes coding for RBPs. the rest of RBP introns: Of 6990 annotated human RBP in- One such example is the FUS gene itself; as shown in trons, 303 are highly conserved and 184 are bound by FUS Supplemental Figure S4, FUS binds predominantly to the (Fig. 3D; Supplemental Table 7). Of 6256 annotated mouse two conserved introns between exon 6 and exon 8 of human RBP introns, 228 are highly conserved and 87 are bound by and mouse FUS. Another prominent FUS target is the con- FUS (Fig. 3D; Supplemental Table 8). To determine whether served intron of the gene coding for the small nuclear ribonu- FUS regulates levels of conserved introns we compared by cleoprotein 70 kDa polypeptide (snRNP70, U1-70K) (Fig. RNA-seq the levels of all introns in mouse neurons after con- 3A), which is an essential component of the U1 snRNP trol or FUS knockdown and found that FUS depletion leads to that is required for recognition of the 5′ splice site during up-regulation of conserved intron levels (Fig. 3E). spliceosome assembly (Kohtz et al. 1994; Cho et al. 2011). To further explore the effect of FUS in transcripts with con- Most intronic sequences are evolutionary neutral and are served introns, we analyzed the effects of FUS knockdown and not conserved (Castresana 2002). However, conserved in- FUS overexpression in the splicing of snRNP70 in mouse neu- tronic sequences with an average length of ∼100 nt often rons. The intron between exons 7 and 8 of mouse and human flank alternatively spliced exons, suggesting that conserved snRNP70 is conserved and highly targeted by FUS (Fig. 3A,B), intronic cis elements regulate alternative splicing (Sorek and several transcripts that retain the conserved intron are and Ast 2003; Sugnet et al. 2006). Highly conserved genomic found in EST databases (data not shown). We used specific regions were previously reported as ultraconserved elements primers to quantify by qRT-PCR transcript “a,” containing (Bejerano et al. 2004) and ∼90 elements were reported within the properly spliced exons 7 and 8 of mouse snRNP70, and introns of genes coding for RNA-binding proteins and, in transcript “b,” which retains the conserved intron, from particular, RBPs involved in splicing (Bejerano et al. 2004; mouse neurons (Fig. 4A) after control or FUS knockdown. Ni et al. 2007). An intriguing property of some of the con- We found a statistically significant increase of transcript “b” served introns of genes coding for RBPs is that their inclusion containing the conserved intron (6.66 ± 0.15% of transcript generates mRNAs that contain PTCs that trigger NMD (Ni “a” in control, while 12.1 ± 1.89% in FUS knockdown); and et al. 2007; Saltzman et al. 2008). Intron inclusion in these reciprocal decrease of transcript “a” that does not contain cases serves as an autoregulatory and cross-regulatory mech- the conserved intron (67.6 ± 7.61% of control) when FUS anism to control RBP levels (Ni et al. 2007). To date, exam- protein levels were reduced (Fig. 4B). Next, we transduced ples of auto- and cross-regulation of splicing has been mouse neurons with a lentivirus expressing wild-type human
www.rnajournal.org 503 Downloaded from rnajournal.cshlp.org on October 3, 2021 - Published by Cold Spring Harbor Laboratory Press
Nakaya et al.
A B
Mouse Snrnp70 – Chromosome 7 Human brains Description P-value GO:0003723 RNA binding 2.50E-10 5 kb GO:0000166 nucleotide binding 1.75E-06 GO:0036094 small molecule binding 3.87E-06 252 GO:0016071 mRNA metabolic process 8.38E-06 CLIP-Mouse GO:0043484 regulation of RNA splicing 1.15E-04 Neurons 1 Mouse neurons 1 GO:0003723 RNA binding 4.59E-06 124 GO:0036094 small molecule binding 9.38E-04
CLIP-Mouse Neurons 1 C 0.004 1 0.003 RNA-seq 10 FUS-KO 1 0.002 Conserved Introns Exon 8 Exon 7 0.001 RNA-seq 10 FUS-CTRL 1 0.000 -500 0 500 1000 1500 Snrnp70 3’- -5’ Conservation score +2.1 _ Mammalian 0 - Conservation _ D GO: RNA Binding -3.3 (–) Conserved Intron Introns Human Mouse
All 6,990 6,526
Conserved 303 228 Human SNRNP70 – Chromosome 19 FUS-bound 184 87 conserved 10 kb
135 CLIP- E Brain 1 1 51 CLIP-