Hnrnp L Represses Cryptic Exons 2 3 Sean P
Total Page:16
File Type:pdf, Size:1020Kb
Downloaded from rnajournal.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 1 HnRNP L represses cryptic exons 2 3 Sean P. McClory1, Kristen W. Lynch1, Jonathan P. Ling2 4 5 1Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, PA 19104-6059, USA 6 2Department of Neuroscience, The Johns Hopkins University School of Medicine, Baltimore, MD 21205-2196, USA 7 8 Abstract 9 The fidelity of RNA splicing is regulated by a network of splicing enhancers and repressors, although 10 the rules that govern this process are not yet fully understood. One mechanism that contributes to 11 splicing fidelity is the repression of nonconserved cryptic exons by splicing factors that recognize 12 dinucleotide repeats. We previously identified that TDP-43 and PTBP1/PTBP2 are capable of repressing 13 cryptic exons utilizing UG and CU repeats, respectively. Here we demonstrate that hnRNP L (HNRNPL) 14 also represses cryptic exons by utilizing exonic CA repeats, particularly near the 5’SS. We hypothesize 15 that hnRNP L regulates CA repeat repression for both cryptic exon repression and developmental 16 processes such as T cell differentiation. 17 18 Introduction 19 RNA splicing in higher eukaryotes helps produce the transcriptional diversity that is required for 20 cellular differentiation. This robust process is governed by RNA-binding splicing factors, many of which 21 bind to degenerate consensus sequences (Fu and Ares, 2014; Wahl et al., 2009). Given this limited 22 specificity, how is the spliceosome able to distinguish proper exons from similar intronic sequences? 23 Most introns contain cryptic “pseudoexon” sequences that appear to have all the appropriate splicing 24 motifs, but are never incorporated into mature RNA transcripts (Dhir and Buratti, 2010). We previously 25 reported that certain pseudoexons are flanked by repetitive dinucleotide sequences, and that the 26 repression of these cryptic exons is mediated by splicing factors that recognize said dinucleotide repeats 27 (Ling et al., 2016, 2015). Repression of cryptic exons may help maintain splicing fidelity, given that the Downloaded from rnajournal.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 28 sequences of cryptic exons are not conserved between species and thus appear to have arisen 29 stochastically. 30 Transactivation response element DNA-binding protein 43 (TDP-43, TARDBP), polypyrimidine tract- 31 binding protein 1 (PTB, PTBP1) and polypyrimidine tract-binding protein 2 (nPTB, PTBP2) were identified 32 as splicing factors that mediated cryptic exon repression. Specifically, TDP-43 binds to UG repeats (Jeong 33 et al., 2017; Ling et al., 2015; Sun et al., 2017) while PTBP1 and PTBP2 bind to CU repeats (Ling et al., 34 2016). Other dinucleotide repeats exist in and around cryptic exons, however, aside from UG and CU. 35 This led us to search for other splicing factors that might repress cryptic exons using other dinucleotide 36 sequences. Here, we demonstrate that heterogeneous nuclear ribonuclear protein L (hnRNP L, HNRNPL) 37 is a cryptic exon repressor that binds to CA repeats. 38 HnRNP L was first identified as a member of the heterogeneous nuclear ribonucleoprotein (HNRNP) 39 family (Pinol-Roma et al., 1989) and has a homolog, hnRNP LL (HNRNPLL). Among the HNRNP family 40 members, hnRNP L and hnRNP LL share closest homology with PTBP1 and PTBP2 (Blatter et al., 2015; 41 Busch and Hertel, 2012; Ghetti et al., 1992). HnRNP L was shown to play a role in RNA splicing (Hui et al., 42 2003a) soon after its discovery and like many other splicing factors, autoregulates the levels of its own 43 transcript (Rossbach et al., 2009). HnRNPL was a leading candidate for cryptic exon repression because 44 strong experimental consensus has indicated that hnRNP L binds to CA dinucleotide repeats (Hui et al., 45 2005, 2003b; Smith et al., 2013). This preference for CA repeats was further confirmed by UV cross- 46 linking immunoprecipitation (CLIP) (Rossbach et al., 2014; Shankarling et al., 2014) and crystal structure 47 analysis (W. Zhang et al., 2013). CA repeats are among the most common repeats in the human genome 48 (Lander et al., 2001; Sawaya et al., 2013; Subramanian et al., 2003). Furthermore, hnRNP L has been 49 reported to act as a splicing repressor in certain contexts (Hui et al., 2005; Hung et al., 2008; Motta- 50 Mena et al., 2010; Rothrock et al., 2005), with several reports suggesting that hnRNP L interferes with 51 appropriate assembly of the spliceosome at the 5’ splice site (5’SS) (Chiou et al., 2013; Heiner et al., Downloaded from rnajournal.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 52 2010; Hui et al., 2005; Loh et al., 2017). Together, these studies provided strong rationale to study 53 hnRNP L as a cryptic exon repressor candidate. 54 55 Results and Discussion 56 In order to determine whether hnRNP L is a cryptic exon repressor, we analyzed RNA-seq datasets 57 from hnRNP L knockdown in three human cell lines: JSL1, HepG2, and K562 (Cole et al., 2015; Sloan et 58 al., 2016; Sundararaman et al., 2016), as well as RNA-seq datasets from genetic knockout of Hnrnpl in 59 mouse fetal liver hematopoietic stem cells (HSC) (Gaudreau et al., 2016). Applying the bioinformatics 60 pipeline previously used to identify TDP-43 and PTBP1/PTBP2 cryptic exons (Ling et al., 2016, 2015), we 61 analyzed the above datasets for novel splice sites and found many cryptic cassette exons, cryptic 62 alternative splice site usage (exon extensions), cryptic terminal exons and cryptic polyadenylation sites 63 (3’ end exons) that were only observed after hnRNP L depletion (Fig. 1A-E, Fig. S1, Supplemental Excel 64 File). Importantly, we validated by RT-PCR use of many of the cryptic exons in JSL1 cells depleted of 65 hnRNP L protein (Fig. 1F-H, Fig. S2). We were also able to identify cryptic cassette exons, cryptic exon 66 extensions and cryptic 3’ end exons in the mouse knockout datasets (Fig. 1I-K). As expected, there was 67 no overlap in cryptic exons between mouse and human, consistent with the lack of conservation of 68 these elements. 69 Next, we analyzed hnRNP L knockdown in the HepG2 and K562 cell lines utilizing RNA-seq datasets 70 from the ENCODE shRNA library. Although we found that 58% (49/84) of cryptic exons were shared 71 between at least two cell types, 42% (35/84) of cryptic exons were unique to a single cell type (Fig. S3). 72 Notably, much of this cell-type specificity appears to be due to the cell-type specific expression of the 73 gene (Supplemental Excel File). These results are analogous to our previous observations with tissue- 74 specific cryptic exons associated with TDP-43 loss of function (Jeong et al., 2017). Downloaded from rnajournal.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 75 Further inspection of cryptic exons and their flanking sequences confirmed the presence of 76 numerous CA repeat clusters. Interestingly, aligning cryptic cassette exons to the 3’SS and 5’SS revealed 77 that most CA repeats are exonic and proximal to the 5’SS (Fig. 2A, 2B). We confirmed the functionality of 78 these exonic CAs in at least one case in which they were well clustered by transfection of JSL1 cells with 79 an antisense morpholino oligo (AMO) complementary to 7 CA dinucleotides located upstream of the 80 cryptic 5’ss in SH3BGRL (Fig. 3A). Blocking of these 7 CAs by AMO resulted in a level of exon inclusion 81 that mirrors that of hnRNP L depletion by translation blocking AMO (Fig. 3B). In contrast to the pattern 82 of CA dinucleotides in the hnRNP L cryptic exons, analysis for TDP-43 and PTBP1/2 cryptic exons 83 revealed that each dinucleotide repeat had different frequency distributions. TDP-43 cryptic exons had 84 primarily intronic UG repeats that were most commonly downstream of the 5’SS, with some repeats 85 upstream of the 3’SS. In contrast, PTBP1 and PTBP2 cryptic exons had mostly intronic CU repeats 86 upstream of the 3’SS but also some exonic repeats as well (Fig. 2B). 87 In general recognition and splicing of exons involves the stepwise assembly of various U snRNP 88 components of the spliceosome with the pre-mRNA substrate (Fig. 4A, right). Regulation of any of these 89 steps can alter the efficiency of splicing (Fu and Ares, 2014; Wahl et al., 2009). Interestingly, previous 90 work has shown that hnRNP L represses exon inclusion through several distinct mechanisms. Binding of 91 hnRNP L in the intron immediately proximal to the 5’ SS results in repression, presumably through steric 92 hindrance of U1 recruitment (Hui et al., 2005; Hung et al., 2008). By contrast, in the case of CD45 exon 4, 93 binding of hnRNP L within the exon functions to recruit hnRNP A1, remodel the U1 snRNP association 94 with the 5’SS, and thereby inhibit association of U6 snRNP (Chiou et al., 2013). Importantly, we showed 95 that a hallmark of the U1-remodeling mechanism of hnRNP L repression is a relatively strong association 96 of the snRNA component of the U1 snRNP with the 5’SS. Specifically, exons for which the 5’SS:U1 snRNA 97 base paring had a minimal free energy (MFE) of -7 or lower (more negative) were prone to repression 98 when hnRNP L bound to the exon (Chiou et al., 2013). Downloaded from rnajournal.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 99 Given our observation that hnRNP L cryptic exons have a high frequency of CA repeats near the 5’SS 100 (Fig.