THE JOURNAL OF BIOLOGICAL CHEMISTRY VOL. 284, NO. 1, pp. 630–639, January 2, 2009 © 2009 by The American Society for Biochemistry and Molecular Biology, Inc. Printed in the U.S.A.

Dimerization and Binding Specificity of the U2AF Homology Motif of the Splicing Factor Puf60*□S Received for publication, July 16, 2008, and in revised form, October 7, 2008 Published, JBC Papers in Press, October 29, 2008, DOI 10.1074/jbc.M805395200 Lorenzo Corsini‡, Michael Hothorn‡1, Gunter Stier‡, Vladimir Rybin‡, Klaus Scheffzek‡, Toby J. Gibson‡, and Michael Sattler‡§¶2 From the ‡Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany, the §Institute of Structural Biology, Helmholtz Zentrum Mu¨nchen, Ingolsta¨dter Landstrasse 1, 85764 Neuherberg, Germany, and the ¶Munich Center for Integrated Protein Science and Chair Biomolecular NMR, Department Chemie, Technische Universita¨tMu¨nchen, Lichtenbergstrasse 4, 85747 Garching, Germany Downloaded from PUF60 is an essential splicing factor functionally related and splicing initiation U2 snRNP is brought to the 3Ј splice site. This homologous to U2AF65. Its C-terminal domain belongs to the involves base pairing of the U2 RNA to the branch site RNA (9) family of U2AF (U2 auxiliary factor) homology motifs (UHM), a and localization of the SF3b subunit p14 near the branch point subgroup of RNA recognition motifs that bind to tryptophan- adenosine by an interaction with the N terminus of the U2 containing linear peptide motifs (UHM ligand motifs, ULMs) in snRNP component SF3b155 (10–12). Initial contacts between http://www.jbc.org/ several nuclear . Here, we show that the Puf60 UHM is the U2 snRNP and the pre-mRNA are mediated by the N ter- mainly monomeric in physiological buffer, whereas its dimer- minus of SF3b155 binding to U2AF65 and displacing SF1 from ization is induced upon the addition of SDS. The crystal struc- the branch point sequence (13). ture of PUF60-UHM at 2.2 A˚ resolution, NMR data, and muta- The third RNA recognition motif (RRM)3 of U2AF65 and the tional analysis reveal that the dimer interface is mediated by RRM of U2AF35 share distinct sequence features that are not electrostatic interactions involving a flexible loop. Using gluta- found in canonical RRMs and mediate binding to tryptophan- at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 thione S-transferase pulldown experiments, isothermal titra- containing peptide sequences in cognate splicing factors (14– tion calorimetry, and NMR titrations, we find that Puf60-UHM 19). These noncanonical RRMs form a subgroup of RRMs binds to ULM sequences in the splicing factors SF1, U2AF65, and called U2AF homology motif (UHM) (20). UHMs in other SF3b155. Compared with U2AF65-UHM, Puf60-UHM has dis- splicing factors have also been shown to bind to short trypto- tinct binding preferences to ULMs in the N terminus of phan-containing linear motifs in U2AF65, SF1, and SF3b155 SF3b155. Our data suggest that the functional cooperativity (21–24), which are thus called UHM ligand motifs (ULM) (23). between U2AF65 and Puf60 may involve simultaneous interac- High resolution structures of proteinpeptide complexes tions of the two proteins with SF3b155. involving U2AF35-UHM/U2AF65-ULM (14), U2AF65-UHM/ SF1-ULM (15), and the UHM of SPF45 (splicing factor 45 kDa) bound to an ULM in SF3b155 (23) all share a very similar mode Pre-mRNA splicing is a stepwise process initiated by the rec- of molecular recognition, suggesting that UHMs in other pro- ognition of sequence elements at the splice site by specific splic- teins might bind similar linear motifs as well. Because the ULM ing factors (1). The branch point sequence is recognized by consensus motif is rather short ((K/R)4–6X0–1W(D/E/N/Q)1–2), splicing factor SF1 (2, 3), whereas the polypyrimidine tract and its mere presence in a protein sequence cannot unambigu- the 3Ј splice site AG-dinucleotide are bound by the het- ously identify a functional ULM. Candidate ULMs are found erodimer U2AF65-U2AF35 (4–7). Although SF1 alone interacts in many proteins, but a biological function has so far been only weakly with the branch point sequence, this interaction is assigned only to those in U2AF65 (17, 25), SF1 (16, 23), and stabilized significantly by U2AF65, which binds simultaneously SF3b155 (13, 23). to SF1 and to the polypyrimidine tract (8). In the next step of PUF60 (poly-U-binding factor 60 kDa, also called FIR, Hfp, Ro-bp1) is a splicing factor homologous to and complementary in function to U2AF65. Similarly to U2AF65, its domain struc- * This work was supported by Deutsche Forschungsgemeinschaft Grant Sa ture consists of a predicted intrinsically unstructured N termi- 823/5 and European Commission 3D Repertoire Grant LSHG-CT-2005- 512028, and funds from the Peter and Traudl Engelhorn Foundation (to nus, two central RRM domains, and a C-terminal UHM. The M. H.). The costs of publication of this article were defrayed in part by the UHM domain is special in that it has been reported to mediate payment of page charges. This article must therefore be hereby marked homodimerization of Puf60 in SDS-PAGE (21). Full-length “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indi- cate this fact. Puf60 was found to interact with itself in yeast two-hybrid anal- □S The on-line version of this article (available at http://www.jbc.org) contains yses, suggesting that the oligomerization detected in SDS- supplemental Fig. S1–S5. PAGE also occurs under physiological conditions (26, 27). Theatomiccoordinatesandstructurefactors(code3DXB)havebeendepositedin the , Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/). 3 The abbreviations used are: RRM, RNA recognition motif; UHM, U2AF 1 Present address: Plant Biology Laboratory, The Salk Institute, La Jolla, CA homology motif; ULM, UHM ligand motif; GST, glutathione S-transferase; 92037. HSQC, heteronuclear single quantum coherence; NOE, nuclear Overhauser 2 To whom correspondence should be addressed. Tel.: 49-89-289-13418; Fax: effect; AUC, analytical ultracentrifugation; Trx, thioredoxin A; ITC, isother- 49-89-289-13869; E-mail: [email protected]. mal titration calorimetry.

630 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 284•NUMBER 1•JANUARY 2, 2009 Structure and Function of Puf60 UHM

Puf60 was discovered as a poly-U RNA-binding protein tionally purified by size exclusion chromatography on a Super- required to reconstitute splicing in depleted nuclear extracts. dexTM 75 16/60 prep grade column. Its function is enhanced by the presence of U2AF65, but not by NMR—All of the NMR spectra were recorded at 300 K on a the small U2AF subunit, U2AF35 (21). Puf60 and U2AF65 can Bruker DRX500 spectrometer, processed with NMRPipe (32), interact in vitro and in yeast cells (21, 26, 27). It was recently and analyzed with NMRView (33). Backbone 1H, 15N, and 13C demonstrated that Puf60 and U2AF65 mutually enhance their resonances were assigned with standard triple resonance affinity for binding polypyrimidine tract RNA in a cooperative experiments (34). 15N relaxation data were recorded as fashion. Moreover, the ratio of U2AF65 to Puf60 can directly described (35). Dissociation constants were derived from influence selective inclusion or skipping of alternatively spliced chemical shift displacements in HSQC spectra upon the addi- exons in several (28). The function of Puf60 in splicing is tion of ligands as described (36) (see supplemental data). thus closely linked to the function of U2AF65. Crystallization and Data Collection—For crystallization, the In addition to its role in alternative splicing, Puf60 also con- chimeric thioredoxin-Puf60(460–559) fusion protein was con- trols human c-myc expression. Under the synonym FIR centrated to about 70 mg/ml in 20 mM Tris (pH 7.0), 150 mM

(FBP-interacting repressor), Puf60 was reported to interact NaCl, 5 mM ␤-mercaptoethanol. The crystals were grown by Downloaded from with and inhibit the transcription factor FBP (FUSE (far vapor diffusion from hanging drops composed of 1 ␮l of protein ␮ upstream sequence element)- binding-protein), an activator of solution and 1 l of crystallization buffer (1.4 M (NH4)2SO4,50 c-myc promoters (29). Probably because of a similar mecha- mM potassium formate) suspended over 1 ml of the latter as nism, mutations in the Drosophila homolog of Puf60, Hfp (Half reservoir solution. The crystals grew to sizes of about 100 ϫ

Pint), lead to increased expression of d-myc genes, thus nega- 100 ϫ 500 ␮m and were cryoprotected by serial transfer into a http://www.jbc.org/

tively regulating cell cycle progression (30). Hfp mutations also solution containing 20% (v/v) ethylene glycol, 1.5 M (NH4)2SO4, lead to aberrant splicing of specific mRNAs in Drosophila ova- 50 mM potassium formate). Diffraction data were recorded at ries (31). Similar to its mammalian ortholog Puf60, Hfp is thus a beam-line PX01 of the Swiss Light Source (Villigen, Switzerland). regulator both of transcription and of alternative splicing. Data processing and scaling was carried out with XDS (37). Here, we report that Puf60 UHM is mainly monomeric under Structure Determination and Refinement—The structure of at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 physiological conditions, whereas it dimerizes upon the addi- the thioredoxin-Puf60 fusion protein was solved by molecular tion of SDS. The crystal structure of PUF60-UHM and muta- replacement as implemented in PHASER (38). The structure of tional analysis reveal that the dimerization is entirely mediated E. coli thioredoxin (Protein Data Bank code 2TRX) and a by electrostatic interactions. NMR relaxation data show that homology model of Puf60-UHM generated with MODELLER the dimer interface involves a loop that is highly flexible in (39) based on the structure of free SPF45-UHM (Protein Data solution. Furthermore, we show that PUF60-UHM binds to Bank code 2PE8) as a template were used as search models. The ULM sequences in U2AF65, SF1, and SF3b155. The UHMs in solution comprises eight Trx-Puf60-UHM monomers that PUF60 and U2AF65 show preferences for binding to different were refined in alternating cycles of model correction in COOT ULMs in the N terminus of SF3b155. We propose that PUF60 (40), and restrained refinement as implemented in REFMAC and U2AF65 may cooperatively recruit U2 snRNP by simulta- (41) and PHENIX.REFINE (Ref. 42; see Table 1 for structural neously binding to SF3b155. statistics). Structures were visualized with PYMOL (DeLano Scientific LLC, San Carlos, CA). The eight UHM domains in the EXPERIMENTAL PROCEDURES unit cell of the crystal structure can be superimposed onto a Protein Preparation—Recombinant Puf60-UHM (residues reported solution structure of Puf60-UHM (Protein Data Bank 460–559, wild type, and mutants), thioredoxin-Puf60-UHM, code 2DNY) with root mean square deviations of 0.9–1.1 Å SPF45-UHM(301–401), U2AF35-UHM (residues 38–152), and over 90 of 100 C␣ atoms. The solution structure, however, does U2AF65-UHM (residues 369–475) were expressed from mod- not indicate dimerization of the Puf60-UHM.

ified pET9d vectors with a noncleavable N-terminal His6 tag. GST Pulldown Experiments—GST-tagged ULMs (1 nmol) 65 ␮ U2AF (85–112), SF1(1–25), SF3b155 (1–424, 194–229, 210– were mixed with 3 nmol of His6-tagged UHMs in 150 lof 251, 229–269, 284–307, 317–357, wild type, and mutants), and phosphate-buffered saline supplemented with 2 mM ␤-mercap- Prp16 (1–314, 201–238, wild type, and mutants) were toethanol and 0.1% (w/v) Igepal CA-630 at 22 °C and mixed expressed from modified pET9d vectors with tobacco etch vigorously for 1 h. For GST precipitation, 8 ␮l of glutathione-

virus protease cleavable, N-terminal His6, and glutathione Sepharose 4B (Amersham Biosciences) pre-equilibrated in S-transferase (GST) tags. Unlabeled proteins were expressed in phosphate-buffered saline were added and mixed vigorously for Escherichia coli BL21(DE3)pLysS in LB medium. All of the pro- 30 min. The beads were washed three times for 1 min in the teins were purified with nickel-nitrilotriacetic acid-agarose buffer described above and analyzed by SDS-PAGE. Western (Qiagen) under standard conditions and buffer exchanged to blotting was carried out with ␣-Puf60 antibody (Abcam 22819). phosphate-buffered saline. For the preparation of ULM pep- tides for NMR titrations see the supplemental data. Isotopically RESULTS 13C- and/or 15N-labeled proteins were expressed in minimal Puf60-UHM Is Mainly Monomeric in Physiological Buffer— 13 (M9) medium supplemented with C-D-glucose and/or Puf60 interacts with itself in yeast two-hybrid analyses (26, 27), 15 NH4Cl. NMR samples were concentrated to 0.3–1.0 mM in 20 and its C-terminal UHM domain has been shown to form ␤ mM Na2PO4 buffer (pH 6.8), 150 mM NaCl, and 5 mM -mer- dimers resistant to denaturing SDS-PAGE (21). We used NMR captoethanol. The samples used for crystallization were addi- spectroscopy to characterize the oligomerization state of

JANUARY 2, 2009•VOLUME 284•NUMBER 1 JOURNAL OF BIOLOGICAL CHEMISTRY 631 Structure and Function of Puf60 UHM

TABLE 1 RRM2 and the C-terminal UHM had a tendency to dimerize Summary of crystallographic analysis without DNA or SDS. Our AUC data indicate that this con- The values in parentheses indicate the highest resolution shell. struct is largely monomeric as well (Fig. 1D, solid black line). Data collection Taken together, these data demonstrate that Puf60-UHM (in Space group P212121 (19) the absence of SDS) and Puf60 RRM1-RRM2-UHM (in the Unit cell dimensions a ϭ 75.12; b ϭ 89.42; c ϭ 299.39; ␣ ϭ ␤ ϭ ␥ ϭ 90° absence of DNA or SDS) are monomeric in solution. Therefore, Wavelength (Å) 1.006 the UHM dimerization observed in denaturing and reducing Resolution range (Å) 49.75–2.20 (2.33–2.20) Unique reflections 102,915 (16265) SDS-PAGE (21) is presumably induced by the experimental Redundancy 6.1 (5.8) conditions. ϽIϾ/␴(I)a 14.54 (3.21) a The Three-dimensional Structure of the PUF60-UHM—Next, Rmeas (%) 10.4 (60.6) Completeness (%) 99.8 (98.9) we determined the crystal structure of Puf60-UHM at 2.2 Å Refinement resolution. Diffracting crystals could only be obtained using a Resolution range (Å) 49.75–2.20 b fusion protein, in which E. coli thioredoxin A (Trx) is con- Rwork/Rfree 0.211/0.271 No. of atoms nected to the N terminus of Puf60-UHM via a short linker Downloaded from Protein 13,138 Water 1209 sequence (49). We confirmed that the Trx-UHM construct Mean B-factors (Å2)c dimerizes in SDS-PAGE similarly to what is seen for the UHM Thioredoxin domains 22.44 Puf60-UHM domains 20.13 alone (data not shown). Water 35.21 The asymmetric unit consists of eight Trx-Puf60 fusion pro-

Root mean square deviations teins arranged in a doughnut shape. Eight Trx molecules are http://www.jbc.org/ Bond lengths (Å) 0.006 Angles (°) 1.012 stacked in two layers in the center of the doughnut, surrounded Ramachandrand by a ring of eight PUF60-UHM domains (Fig. 2A). Consistent Most favored (# (%)) 1387 (92.3%) Additionally allowed (# (%)) 106 (7.1%) with the NMR secondary chemical shifts, Puf60-UHM adopts a Generously allowed (# (%)) 2 (0.1%) ␤1-␣A-␤2-␤3-␣B-␤4-␣C secondary structure. A central four- Disallowed (# (%)) 7 (0.5%) ␤ ␣ ␣ a stranded -sheet is sandwiched by helices A and Bonone As defined in XDS (37). at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 b As defined in REFMAC (41). side and helix ␣C on the other side (Fig. 2B). As seen in other c Residual isotropic B-factor after TLS refinement. UHM structures (14, 23), Puf60-UHM has an additional strand d As defined in PROCHECK (59). ␤3Ј adjacent to ␤4, which forms a ␤-hairpin extension to the central four-stranded ␤-sheet. The ␤3Ј strand comprises the

Puf60-UHM in solution (50 mM Pi, pH 7.0, 150 mM NaCl, 5 mM conserved Arg-Xaa-Phe motif (RWF 535–537 in Puf60), which dithiothreitol). NMR secondary chemical shifts (Fig. 1A) show plays a crucial role in ULM binding in all known UHMULM that Puf60-UHM adopts the typical ␤1-␣A-␤2-␤3-␣B-␤4-␣C complexes (14, 15, 23). A unique structural feature of Puf60- topology found for all RRMs and UHMs (20, 43). The overall UHM is the presence of unusually long ␤2 and ␤3 strands, ␶ ␤ ␤ rotational correlation time ( c) of Puf60-UHM was calculated which form a -hairpin that protrudes out of the -sheet (Fig. 15 ␤ ␤ from the ratio of the trimmed mean N longitudinal (T1) and 2B). In solution, the acidic 2- 3 loop is flexible, as indicated by transverse (T2) relaxation times of residues with heteronuclear low heteronuclear NOE values, which drop to a minimum of 1H-15N NOE values above 0.65 (Fig. 1B) (44, 45). The average 0.12 for Gly504 (Fig. 1B). In contrast, an average heteronuclear 15 N T1/T2 ratio for these residues is 7.4 (Fig. 1C), corresponding NOE of 0.74 for residues 462–501 and 512–559 indicates the ␶ to a c of 9.7 ns. However, at 50 MHz Larmor frequency and absence of internal motion on subnanosecond time scales. ␶ ϭ ϭ 297 K, c values of 8.3 ns (T1/T2 5.7) and 15.8 ns (T1/T2 Dimerization Interface—Because Puf60-UHM crystallizes at 17.7) would be expected for a 13-kDa monomer and a 26-kDa concentrations above the dimerization constant of 3–4 mM (1.5 UHM domain dimer, respectively (46, 47) (gray lines in Fig. 1C). mM in the mother liquor, 32 mM in the crystal lattice), we Thus, the observed relaxation times indicate the presence of a expected to detect a dimeric UHM in the crystal. Analysis with mainly monomeric rather than a dimeric form of Puf60-UHM. PISA (50) shows that each of the eight Puf60-UHM domains

The slightly increased T1/T2 ratio, compared with what is contacts three other Puf60-UHM domains, two in the same and expected for a pure monomer, might result from some nonspe- one in a symmetry-related asymmetric unit. One of the UHM-

cific aggregation, because the T1/T2 ratio of Puf60-UHM lack- UHM interfaces within an asymmetric unit is composed of ϭ ␶ ϭ ing an N-terminal His tag (T1/T2 4.4, c 7.0 ns; data not charged interactions between the residues EEE (505–507) in shown) is consistent with a monomeric protein. the ␤2-␤3 loop of one protein monomer and Arg467, Arg540/ To further investigate the oligomerization state of the UHM, Lys541 in the adjacent strands ␤1 and ␤4 of the other dimer we used sedimentation velocity analytical ultracentrifugation subunit, respectively (Fig. 2C). As shown in the electrostatic (AUC). The AUC data also indicate a largely monomeric state surface representation (Fig. 2C, right panel), the residues of the UHM domain (Fig. 1D, solid gray line), whereas partial Arg467, Arg540, and Lys541 form a positively charged surface, dimerization is observed at higher protein concentrations (Fig. which is contacted by the negatively charged acidic ␤2-␤3 loop. 1D, dotted gray line). By fitting the AUC data to a monomer- The electrostatic interactions involve the tips of the long side dimer equilibrium model, the dimerization constant is esti- chains of arginine/lysine, which contact glutamate/aspartate ϭ ␤ ␤ mated to be Kdimer 3–4 mM. The two central RRM domains residues in the mobile 2- 3 loop. Of the six salt bridges that of Puf60 were reported to dimerize in the presence of DNA (48). can be formed, electron density is visible for a maximum of four We therefore tested whether a construct comprising RRM1- contacts in any of the putative dimer interfaces in the asymmet-

632 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 284•NUMBER 1•JANUARY 2, 2009 Structure and Function of Puf60 UHM

K541A, R467A, and RK540– ∆δ(C ) -∆δ(C ) secondary chemical shift 10 α β 541AAϩR467A). The structural integrity of the E505A/E506A/ E507A mutant was confirmed by

p.p.m. comparison of the HSQC spectra (supplemental Fig. S1). The integ-

-10 rity of the other mutants was con- 460 β1 αA β2 β3 αB β4 αC 559 ESTVMVLRNMVDPKDIDDDLEGEVTEECGKFGAVNRVIIYQEKQGEEEDAEIIVKIFVEFSIASETHKAIQALNGRWFAGRKVVAEVYDQERFDNSDLSA firmed by one-dimensional NMR (data not shown). {1H}-15N NOE In denaturing SDS-PAGE, Puf60- 1.0 UHM wild type (12.6 kDa), E501A/ 0.8 K502A/Q503A, R540A/K541A, and 0.6 R467A run at an apparent molecu-

0.4 lar mass of 28 kDa (Fig. 2D, lanes 1, Downloaded from 0.2 2, 5, and 6, respectively), as expected 0 for a dimer. In contrast, the mutants E505A/E506A/E507A (lane 3), 1 15 D508A/A509A/E510A (lane 4), and 20 T1/T2 of residues with { H}- N NOE > 0.65

rigid dimer R540A/K541AϩR467A (lane 7) run http://www.jbc.org/

15 at lower molecular masses, indicat- ing that their dimerization is impaired. 10 These findings indicate that the dimerization of Puf60 UHM 5 at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 monomer involves the acidic residues 505EEEDAE510 in the flexible ␤2-␤3 0 loop and the basic residues Arg467 and Arg540-Lys541. Salt bridges and electrostatic contacts between these regions thus mediate dimerization 0.6 12.4 +/- 0.6 kDa Puf60(UHM) 29 µM of Puf60-UHM in the presence of 0.5 theor. 12.6 kDa SDS and presumably also contrib- Puf60(UHM) 377 µM ute to the small population of 0.4 47.5 +/- 2.5 kDa theor. 12.6 kDa dimeric species in physiological Puf60(RRM1+2, UHM) buffers (Fig. 1D). theor. 48.2kDa (11 µM) 0.3 To confirm that the observed bands indeed correspond to dimer- 0.2 ization of the UHM in SDS-PAGE 0.1 and that the observed positions of the bands do not fortuitously appear 0 at unusual positions, we mixed 0 25 50 75 100 125 150 recombinant, purified ZZ-tagged wild type UHM (28.4/56.8 kDa for FIGURE 1. NMR analysis of Puf60-UHM. All of the experiments were recorded at 297 , on a Bruker DRX500 monomer/dimer; Fig. 2E, lane 1) spectrometer using standard experiments (33). A, secondary chemical shifts ⌬␦(13C␣-13C␤) reveal secondary with untagged wild type UHM structure as indicated below the graph with the primary sequence of Puf60-UHM. B, 1H-15N heteronuclear NOE 15 1 15 Ͼ (12.6/25.2 kDa; Fig. 2E, lane 6). of Puf60-UHM. C, ratio of N T1 and T2 relaxation times for residues with H- N 0.65. The gray horizontal lines depict expected average values for a 12.6-kDa monomer and a 25-kDa dimer at 297 K and 50.68 MHz Larmor Because the protein species at 41.2 frequency, calculated as in Refs. 44 and 45. D, analytical ultracentrifugation analysis of Puf60-UHM (solid gray kDa in lanes 2 and 3 is not contained line, concentration 29 ␮M; dotted gray, 377 ␮M) and a construct comprising the two central RRM domains and the UHM of Puf60 (black line,11␮M). The expected theoretical molecular masses are given on the right. in either pure ZZ-tagged UHM (lane 1) or untagged UHM (lane 6), ric unit. Notably, the combinations of charged residues the appearance of a mixed dimer species of the type involved in direct salt bridges vary for the different dimer inter- ZZ-UHMUHM at 41.2 kDa (lanes 2 and 3) proves the forma- faces in the asymmetric unit. tion of a mixed dimer. The UHM mutant E505A/E506A/E507A To determine which residues are involved in the dimeriza- does not form the mixed dimer species, confirming that the tion of Puf60 in SDS-PAGE, we introduced amino acid changes mutations impair the dimerization of the UHM in SDS-PAGE for the residues in the ␤2-␤3 loop (E501A/K502A/Q503A, (Fig. 2E, lanes 4, 5, and 7). E505A/E506A/E507A, and D508A/A509A/E510A) and of the SDS Induces the Dimerization of Wild Type Puf60 UHM—To positively charged residues that contact the acidic loop (R540A/ gain some insight into the molecular basis of the SDS-induced

JANUARY 2, 2009•VOLUME 284•NUMBER 1 JOURNAL OF BIOLOGICAL CHEMISTRY 633 Structure and Function of Puf60 UHM

of Puf60-UHM. However, interme- diate SDS concentrations (Ͼ0.02% (w/v) SDS, ϳ3-fold molar excess) induce dimerization of the wild type protein, as indicated by the appear- ance of new signals in the NMR spectra that have 15N NMR relax- ation properties expected for a dimer (supplemental Fig. S3). With further increasing SDS concentra- tions, both Puf60-UHM wild type as well as the mutant protein are dena- tured (Fig. 3B).

Binding of Puf60 UHM to Tan- Downloaded from dem ULMs—The dimerization pro- pensity of Puf60-UHM opens the possibility that simultaneous bind- ing of two UHM domains to two

adjacent ULMs on the same peptide http://www.jbc.org/ chain (tandem ULM motif) could cooperatively induce the dimeriza- tion also in the absence of SDS. Based on the distance of the ULM- binding sites of the Puf60 homo- at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 dimer in the crystal structure, we estimated that the ULMs should be separated by a minimum of 15–20 residues in an extended conforma- tion. We identified evolutionarily conserved tandem ULMs in intrin- sically disordered regions of several proteins with the program SIRW (51). Of these, tandem ULMs in SF3b155 (194–229, 210–251 and 229–269) and in the nuclear RNA helicase Prp16 (201–238) (Fig. 4, A FIGURE 2. Crystal structure of Puf60-UHM. A, the asymmetric unit consists of eight Trx-Puf60 fusion proteins arranged in a doughnut shape. The Trx domains are shown in green, and the Puf60-UHM domains are shown in and B) were tested experimentally different colors. Trx and Puf60 domains with the same number constitute the same peptide chain. B, eight for binding to Puf60-UHM. Puf60-UHM domains in the asymmetric unit of the crystal structure are superimposed and shown in a cartoon Using Western blot detected GST representation. The putative ULM-binding site (based on known UHMULM complex structures) is highlighted by a red sphere. C, putative Puf60-UHM homodimer. Left panel, cartoon representation. The side chains of pulldown experiments, we found residues discussed in the text are shown and labeled, and salt bridges are indicated by dashed lines. Right panel, that the tandem ULM sequence of surface electrostatics representation of the dimer shown on the left, in the same orientation. D, mutational analysis of the dimerization interface. Denaturing, Coomassie-stained SDS-PAGE of bacterially expressed and SF3b155 (194–229) binds Puf60- purified Puf60-UHM mutants as denoted on top of the lanes. E, dimerization of Puf60-UHM in SDS-PAGE. UHM (supplemental Fig. S4A). Purified, ZZ-tagged UHM was analyzed on Coomassie-stained SDS-PAGE in the absence and presence of However, ITC (supplemental Fig. increasing amounts of untagged UHM (wild type (wt) and E505A/E506A/E507A mutant), as indicated above the lanes. theor.MW, theoretical molecular mass. S4, C and D) and NMR (supplemen- tal Fig. S5A) data show that the dimerization of Puf60, we monitored the NMR signals of 15N- binding of Puf60-UHM to SF3b155 (194–229) is mediated by labeled, wild type and mutant (E505A/E506A/E507A) Puf60- the ULM around Trp200 and that the ULM at Trp218 does not UHM in a series of 1H-15N correlation spectra upon titration of contribute to the binding cooperatively. NMR titrations reveal increasing amounts of SDS (Fig. 3 and supplemental Figs. S2 a weak interaction of Puf60-UHM with Prp16 (201–238), which and S3). Wild type and mutant UHM bind SDS in a 1:1 ratio and was not detected in the GST pulldown experiments. However, Ϯ Ϯ ␮ 215 230 with a Kd of 24 4 and 45 6 M, respectively (supplemental the two ULMs (Trp and Trp ) in Prp16 do not mutually Fig. S2A). Surprisingly, the binding site overlaps with the puta- enhance each other’s binding cooperatively (supplemental Fig. tive ULM-binding site of Puf60-UHM (supplemental Fig. S2B). S5B). Thus, dimerization of Puf60-UHM is not induced upon 15 The N T1/T2 ratio of SDS-bound Puf60-UHM (0.07% (w/v) binding to these tandem ULMs in the absence of SDS. ULM SDS/2.4 mM/8-fold molar excess) indicates a monomeric pro- binding in the presence of SDS (350 ␮M to 1.4 mM) was not ϭ ␶ ϭ tein (T1/T2 4.6, c 7.2 ns; supplemental Fig. S2C). Thus, a observed in GST pulldown experiments (data not shown). This simple equimolar binding of SDS does not induce dimerization is consistent with the observation that the SDS interaction

634 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 284•NUMBER 1•JANUARY 2, 2009 Structure and Function of Puf60 UHM

but reproducible (supplemental Fig. S4B) and has been described black: 0% SDS black: 0% SDS previously (18). red: 0.02% SDS red: 0.02% SDS ULM-binding Site Mapping on G486 G486 Puf60-UHM—We used NMR titra- tions to map the ULM-binding site onto the structure of Puf60-UHM. Unlabeled peptides comprising the L532 L532 G534 G534 ULMs in SF1(1–25), SF3b155(194– N468 K489 N468 K489 229), SF3b155(317–357), and 65 R535 A531 R535 A531 U2AF (85–112) were titrated to W536 W536 15N-labeled Puf60-UHM, and R467 R467 chemical shift perturbations were

1 15 Downloaded from F537 F537 monitored in H- N correlation

I497 V543 I497 V543 spectra (Fig. 5A). Mapping the chemical shift perturbations onto V542 V542 the crystal structure of Puf60-UHM

10 1 H 7 10 1 H 7 shows that for all four ULMs, the

strongest chemical shift perturba- http://www.jbc.org/ tions cluster around the ␤1-␣A ␣ n SDS loop, helix A and around the RWF motif (part of helix ␣B and strands wild type : UHM + SDS UHM*SDS unfolded UHM ␤3Ј and ␤4) of Puf60-UHM (Fig. 5B). The ULM interaction interface at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 n SDS of Puf60-UHM is thus analogous to the interfaces of the U2AF35- (UHM*SDS)2*SDSn UHMU2AF65-ULM (20), the n SDS U2AF65-UHMSF1-ULM (15), and the SPF45SF3b155(330–342) (23) : UHM + SDS UHM*SDS unfolded UHM complexes, indicating a similar mode of molecular recognition. FIGURE 3. SDS binding of Puf60-UHM. A, overlays of HSQC spectra of 15N-labeled Puf60-UHM wild type (left) and triple mutant E505A/E506A/E507A (right) at SDS concentrations between 0% (black) and 0. 02% Structural Basis for ULM (3-fold molar excess, red). B, proposed model for the interaction between Puf60-UHM and SDS. In both Specificity—To further character- wild type and mutant UHM, a single SDS molecule can bind to the hydrophobic ULM-binding pocket (supplemental Fig. 2B). SDS-bound wild type Puf60-UHM, but not E505A/E506A/E507A, can dimerize at ize the binding specificity of intermediate SDS concentrations. High SDS concentrations lead to unfolding of wild type and mutant Puf60-UHM for distinct ULMs we UHM domain (supplemental Fig. 3). compared its structure with the structures of the UHMULM com- maps to the canonical ULM-binding site of Puf60-UHM (sup- plexes of U2AF65SF1 (Protein Data Bank code 1O0P) and plemental Fig. S2B) and thus that SDS and ULM binding are SPF45SF3b155 (Protein Data Bank code 2PEH). As shown in competitive. Fig. 5C, the ULM-binding region of Puf60-UHM, defined by Distinct ULM Binding Properties of Puf60 and U2AF65—GST the NMR titrations (Fig. 5B), is structurally more similar to pulldown experiments show that Puf60-UHM binds to ULMs SPF45 than to U2AF65. Helix ␣A in U2AF65-UHM is N-ter- in SF1, U2AF65, and SF3b155 (supplemental Fig. S4A). We minally extended by four additional residues compared with quantified the affinities of Puf60-UHM for these ULMs by ITC the ␣A helices in Puf60-UHM and SPF45-UHM. As a conse- (Fig. 4C, supplemental Fig. 4C, and Table 2). Whereas Puf60- quence, the conformation of the ␤1-␣A loops in U2AF65- ϭ Ϯ UHM binds to SF1 with a dissociation constant Kd 20.8 5.2 UHM differs considerably from Puf60-UHM or SPF45- ␮ 65 ϭ M, its binding to U2AF -ULM is almost 8-fold stronger (Kd UHM. It is likely that these differences, in combination with 200 2.7 Ϯ 0.2 ␮M). The binding to the ULM comprising Trp in amino acid variations in the ULM sequences (Fig. 4A), deter- SF3b155(194–229) is ϳ5-fold stronger than to the ULM mine the specificity of the UHMULM complexes. For exam- 338 ϭ Ϯ Ϯ around Trp in SF3b155(317–357) (Kd 1.2 0.1 and 5.6 ple, SF1 has a longer stretch of positively charged residues 0.6 ␮M, respectively). preceding the ULM-tryptophan than the SF3b155 ULMs. In The ULM binding preferences of Puf60-UHM are distinct the U2AF65SF1 structure, this region contacts the highly from those of U2AF65-UHM (supplemental Fig. S4B and Table negatively charged helix ␣A of U2AF65-UHM (10 Glu/Asp 2). As reported previously, U2AF65-UHM preferentially binds residues). Because the length helix ␣A of Puf60 is shorter and the ULMs in SF1 (8, 15) and SF3b155(317–357) (13, 18, 24). Of because it is less negatively charged (5 Glu/Asp residues), the ULMs in SF3b155, U2AF65-UHM binds to the ULM ionic interactions involving these residues should contribute 338 ϭ ␮ around Trp (Kd 6 M (18)) with higher affinity than to less to the ULM binding by Puf60-UHM. A second specific- 200 ϭ ␮ the one at Trp (Kd 16 M (18)). This preference is weak ity-mediating region in the SF1 and SF3b155 ULMs involves

JANUARY 2, 2009•VOLUME 284•NUMBER 1 JOURNAL OF BIOLOGICAL CHEMISTRY 635 Structure and Function of Puf60 UHM

U2AF65 W92 LIRSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQA SF1 W22 NATPLDFPSKKRKRSRWNQDTMEQKTVIPGMPTVIPP SF3b155 W200 VNGAAASQPPSKRKRRWDQTADQTPGATPKKLSSWDQ W218 QTADQTPGATPKKLSSWDQAETPGHTPSLRWDETPGR W232 SSWDQAETPGHTPSLRWDETPGRAKGSETPGATPGSK W254 RAKGSETPGATPGSKIWDPTPSHTPAGAATPGRGDTP W293 ATPGHGGATSSARKNRWDETPKTERDTPGHGSGWAET W310 DETPKTERDTPGHGSGWAETPRTDRGGDSIGETPTPG W338 SIGETPTPGASKRKSRWDETPASQMGGSTPVLTPGKT Prp16 W215 SPRHRPKDAATPSRSTWEEEDSGYGSSRRSQWESPSP W230 TWEEEDSGYGSSRRSQWESPSPTPSYRDSERSHRLST B 229 269 210 251 181 194 LKVVNGAAASQPPSKRKRRWDQTADQTPGATPKKLSSWDQAETPGHTPSLR229 WDETPGRAKGSETPGATPGSKIWDPTPSHTPAGAATPGR W200 W218 W232 W254 Downloaded from 284 307 357 360 GDTPGHATPGHGGATSSARKNRWDETPKTERDTPGHGSGWAETPRTDRGGDSIGETPTPGASKRKSR317 WDETPASQMGGSTPVLTPGKTPI W293 W310 W338

C Time (min) Time (min) Time (min) Time (min) 0 20 40 60 80 100 120 140 160 0 20 40 60 80 100 120 140 160 -10 0 10 20 30 40 50 60 70 80 90 100 0 20 40 60 80 100 120 140 160 http://www.jbc.org/ 0.0 0.0 0.0 0.0

-0.1 -0.5 -0.5

-0.2 -1.0 -0.2 µcal/sec

-1.0 -0.3 -1.5 0 -0.4 at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 0 0 -2 -2 -2 -4 -4 -2 j

-6 -6 -4 -8 -8 -4 -10 -6 -10

kcal/mole of injectant -12 -12 -8 0.0 0.5 1.0 1.5 2.0 2.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0.0 0.5 1.0 1.5 2.0 0.0 0.5 1.0 1.5 2.0 Molar Ratio Molar Ratio Molar Ratio Molar Ratio

FIGURE 4. Binding of Puf60-UHM to various ULMs. A, sequence alignment of ULMs. The conserved tryptophans are aligned and colored in yellow, basic residues preceding the tryptophan are highlighted in blue, and [N,Q,D,E]-type residues directly succeeding the tryptophan are in orange, and potentially phosphorylated Thr-Pro or Ser-Pro repeats are in purple. SF1-Ser20 (inhibits binding to U2AF65 when phosphorylated (58)) and serines in analogous positions in other ULMs are colored in green. B, partial sequence of the N-terminal domain of SF3b155. The peptides used in GST pulldown experiments are highlighted. Same color code as in A. C, isothermal titration calorimetry of Puf60-UHM with ULM peptides of SF1, SF3b155, and U2AF65. The dissociation constants (Kd) are indicated. For SF3b155(194–229) the W218A mutant peptide was used, in which a minor binding site is removed (see supplemental Fig. 4, C and D).

TABLE 2 ULM interactions of Puf60-UHM and U2AF65-UHM Shown are the affinities of Puf60 for various ULMs, measured by ITC. ϩ, interaction detected qualitatively, not quantified (see supplemental Fig. S4B, lane 4). SF1(1–25) SF3b155(194–229) SF3b155(317–357) U2AF65(85–112)

Puf60-UHM 20.8 Ϯ 5.2 ␮M 1.0 Ϯ 0.2 ␮M 5.6 Ϯ 0.6 ␮M 2.7 Ϯ 0.2 ␮M a b c c U2AF65 50 nM ;23nM 16 ␮M 6.0 ␮M ϩ a Filter binding (15). b ITC (18). c Trp fluorescence (18). the residue C-terminal of the tryptophan. The aspartate which may contribute to the weaker interaction of Puf60- flanking Trp338 in SF3b155 forms a salt bridge with the argi- UHM with SF1-ULM. nine in the SPF45-UHM RYF motif (23). Because of its struc- tural similarities to the UHM of Puf60, we speculate that DISCUSSION analogous interactions might stabilize the complexes of Puf60 was repeatedly found to interact with itself in yeast Puf60-UHM with the SF3b155-ULMs around tryptophans two-hybrid assays (26, 27), and the Puf60 UHM domain was 200 and 338. Because SF1 has an asparagine instead of aspar- reported to be necessary and sufficient for the dimerization of tate at this position, a similar salt bridge cannot be formed, Puf60 in SDS-PAGE (21). Our analytical ultracentrifugation

636 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 284•NUMBER 1•JANUARY 2, 2009 Structure and Function of Puf60 UHM Downloaded from http://www.jbc.org/ at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016

FIGURE 5. NMR chemical shift perturbation of Puf60-UHM/ULM interactions. A, 1H,15N HSQC spectra of free 15N-labeled Puf60-UHM (black) and upon addition of ULM peptides derived from SF1, SF3b155, and U2AF65 (red). B, ribbon representation of the Puf60-UHM structure colored according to the extent of chemical shift perturbation induced by addition of the peptide ligands indicated above (white, no perturbation; red, strong perturbation). C, superposition of the Puf60-UHM structure (blue) onto the structures of SPF45-UHM/SF3b-ULM (left panel, gray; Protein Data Bank code 2PEH) and U2AF65-UHM/SF1-ULM (right panel, gray; Protein Data Bank code 1O0P). and NMR data show that the UHM domain is mainly mono- The flexibility of the ␤2-␤3 loop in solution (indicated by the meric in physiological buffer, whereas SDS is required for NMR relaxation data) and the variability of the electrostatic dimerization. A crystal structure and mutational analysis reveal contacts seen in the crystal structure suggest that the dimer a dimer interface, which is stabilized by electrostatic interac- interface is dynamic. The electrostatic nature of the dimer tions and involves the acidic ␤2-␤3 loop of one subunit and interface presumably contributes to the stability of the basic residues (Lys467, Arg540, and Lys541)inthe␤-sheet surface Puf60-UHM dimer in SDS-PAGE (21) (Fig. 2, D and E). of the other subunit of the dimer. The acidic ␤2-␤3 loop is Because the dimerization interface is stabilized by electro- conserved in all higher eukaryotic orthologs of Puf60 but is static contacts, the SDS alkyl chains might not be able to distinct in other UHM or RRM domains (20). This suggests that energetically favor the solvation of the UHM monomer. Puf60 orthologs may have a similar dimerization mode, which We found that a longer construct, comprising the two central is unique for Puf60 and not found in other UHMs. RRM domains and the UHM, is also largely monomeric in the

JANUARY 2, 2009•VOLUME 284•NUMBER 1 JOURNAL OF BIOLOGICAL CHEMISTRY 637 Structure and Function of Puf60 UHM

ous and potentially cooperative recruitment of SF3b155 to the N N Ј RRM 3 splice site. Recently, it was reported that the UHM domain of the kinase RRM RRM KIS strongly binds to SF1 (similar to U2AF65-UHM) (22). It also RRM UHM binds to ULMs in the N terminus of SF3b155 and prefers Trp200 W W W W HEAT repeats 338 UHM W over Trp (similar to Puf60-UHM) (57). The distinct binding W W338 65 W preferences of the Puf60, KIS, and U2AF UHMs suggest that W200 SF3b155 UHM-ULM interactions have evolved to achieve some binding N selectivity. Thus, a given ULM cannot be classified as strong or FIGURE 6. Puf60-UHM preferentially binds to the N terminus of SF3b155 at the ULM around Trp200, whereas U2AF65-UHM shows preferential weak but might bind with differential affinity to each UHM. binding to the ULM at Trp338 (18). The domain structure of the proteins is Our data provide molecular insights into the intricate network shown. of UHM-ULM interactions. Structure-based analysis allows the design of mutations in ULM and/or UHM sequences for mod-

ulating this network and studying its role in the regulation of Downloaded from absence of SDS. Crichlow et al. (48) report a weak tendency of splicing in vivo. the two central RRM domains to dimerize in the presence of single-stranded DNA. Therefore, it is likely that the oligomer- ization of Puf60 observed in yeast two-hybrid experiments Acknowledgments—We thank the staff at beam line PX01 of the Swiss Light Source (Villigen, Switzerland) for technical assistance during involves the UHM and the two central RRM domains of Puf60 data collection; Christoph Mu¨ller and Stephen Cusack for sharing http://www.jbc.org/ and additionally requires binding of a ligand. Potentially, beam-time; and Juan Valca´rcel and Rebecca Wade for discussions ligand-induced dimerization of Puf60-UHM could involve and critical reading of the manuscript. binding of tandem ULMs. However, our experiments with the tandem ULM motifs in SF3b155 and Prp16 did not provide evidence for such a mechanism. REFERENCES at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 Detergent-induced oligomerization has been reported for 1. Brow, D. A. (2002) Annu. Rev. Genet. 36, 333–360 several membrane-associated proteins (52–56). No experimen- 2. Berglund, J. A., Chua, K., Abovich, N., Reed, R., and Rosbash, M. (1997) tal evidence for a functional role of the SDS-induced dimeriza- Cell 89, 781–787 tion of Puf60-UHM is known. However, it is possible that deter- 3. Liu, Z., Luyten, I., Bottomley, M. J., Messias, A. C., Houngninou-Molango, gent-induced (or lipid-induced) dimerization might play a role S., Sprangers, R., Zanier, K., Kramer, A., and Sattler, M. (2001) Sciences (N. Y.) 294, 1098–1102 for the molecular functions of Puf60. Alternatively, SDS may 4. Singh, R., Valcarcel, J., and Green, M. R. (1995) Sciences (N. Y.) 268, resemble a putative, as yet unknown ligand of Puf60. 1173–1176 Puf60 was reported to interact directly with U2AF65 (26, 27). 5. Wu, S., Romfo, C. M., Nilsen, T. W., and Green, M. R. (1999) Nature 402, Our data provide a rationale for how the two proteins interact 832–835 and suggest that a minimal binding interface involves the ULM 6. Merendino, L., Guth, S., Bilbao, D., Martinez, C., and Valcarcel, J. (1999) 65 Nature 402, 838–841 sequence of U2AF and the UHM domain of Puf60. Note that 7. Zorio, D. A., and Blumenthal, T. (1999) Nature 402, 835–838 65 binding of Puf60 to the U2AF ULM can only occur if this 8. Berglund, J. A., Abovich, N., and Rosbash, M. (1998) Genes Dev. 12, ULM is not already bound by U2AF35-UHM, which has a sig- 858–867 nificantly higher affinity. Thus, for this interaction to occur in 9. Query, C. C., Moore, M. J., and Sharp, P. A. (1994) Genes Dev. 8, 587–597 vivo, there should be a population of U2AF65 molecules that is 10. Query, C. C., McCaw, P. S., and Sharp, P. A. (1997) Mol. Cell. Biol. 17, 35 2944–2953 not bound to U2AF in the nucleus. 11. Will, C. L., Schneider, C., MacMillan, A. M., Katopodis, N. F., Neubauer, It was shown in pulldown experiments from nuclear extract G., Wilm, M., Luhrmann, R., and Query, C. C. (2001) EMBO J. 20, that Puf60 associates with SF3b155 (28). We suggest that this 4536–4546 interaction likely involves direct binding of Puf60-UHM to 12. Schellenberg, M. J., Edwards, R. A., Ritchie, D. B., Kent, O. A., Golas, ULM sequences in the N terminus of SF3b155. Interestingly, M. M., Stark, H., Luhrmann, R., Glover, J. N., and MacMillan, A. M. (2006) 103, the Puf60-UHM and U2AF65-UHM have distinct binding affin- Proc. Natl. Acad. Sci. U. S. A. 1266–1271 13. Gozani, O., Potashkin, J., and Reed, R. (1998) Mol. Cell. Biol. 18, ities for ULMs. Puf60-UHM binds only weakly to SF1-ULM, 4752–4760 65 whereas this ULM strongly interacts with U2AF -UHM. Fur- 14. Kielkopf, C. L., Rodionova, N. A., Green, M. R., and Burley, S. K. (2001) thermore, Puf60-UHM has a stronger affinity to SF3b155(194– Cell 106, 595–605 229) than to SF3b155(317–357), whereas the opposite is found 15. Selenko, P., Gregorovic, G., Sprangers, R., Stier, G., Rhani, Z., Kra¨mer, A., for U2AF65-UHM (Table 2). The affinity differences of these and Sattler, M. (2003) Mol. Cell 11, 965–976 16. Rain, J. C., Rafi, Z., Rhani, Z., Legrain, P., and Kra¨mer, A. (1998) RNA two UHM domains are rationalized by comparing structural (N. Y.) 4, 551–565 models of these interactions. As shown in Fig. 5C, the ULM- 17. Rudner, D. Z., Kanaar, R., Breger, K. S., and Rio, D. C. (1998) Mol. Cell. Biol. binding region of the two UHMs is significantly different, 18, 1765–1773 which may be linked to the distinct binding preferences. 18. Thickman, K. R., Swenson, M. C., Kabogo, J. M., Gryczynski, Z., and Our biochemical data imply that Puf60 and U2AF65 can bind Kielkopf, C. L. (2006) J. Mol. Biol. 356, 664–683 19. Cass, D. M., and Berglund, J. A. (2006) Biochemistry 45, 10092–10101 to the N terminus of SF3b155 simultaneously and noncompeti- 20. Kielkopf, C. L., Lucke, S., and Green, M. R. (2004) Genes Dev. 18, tively (Fig. 6). The mutual enhancement of splicing activation 1513–1526 by these two splicing factors (28) could thus involve simultane- 21. Page-McCaw, P. S., Amonlirdviman, K., and Sharp, P. A. (1999) RNA

638 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 284•NUMBER 1•JANUARY 2, 2009 Structure and Function of Puf60 UHM

(N. Y.) 5, 1548–1560 roni, L. C., and Read., R. J. (2007) J. Appl. Crystallogr. 40, 658–674 22. Manceau, V., Swenson, M., Le Caer, J. P., Sobel, A., Kielkopf, C. L., and 39. Sali, A., and Blundell, T. L. (1993) J. Mol. Biol. 234, 779–815 Maucuer, A. (2006) FEBS J. 273, 577–587 40. Emsley, P., and Cowtan, K. (2004) Acta Crystallogr. 60, 2126–2132 23. Corsini, L., Bonnal, S., Basquin, J., Hothorn, M., Scheffzek, K., Valcarcel, J., 41. Murshudov, G. N., Vagin, A. A., and Dodson, E. J. (1997) Acta Crystallogr. and Sattler, M. (2007) Nat. Struct. Mol. Biol. 14, 620–629 53, 240–255 24. Spadaccini, R., Reidt, U., Dybkov, O., Will, C., Frank, R., Stier, G., Corsini, 42. Adams, P. D., Grosse-Kunstleve, R. W., Hung, L. W., Ioerger, T. R., Mc- L., Wahl, M. C., Luhrmann, R., and Sattler, M. (2006) RNA (N. Y.) 12, Coy, A. J., Moriarty, N. W., Read, R. J., Sacchettini, J. C., Sauter, N. K., and 410–425 Terwilliger, T. C. (2002) Acta Crystallogr. 58, 1948–1954 25. Zhang, M., Zamore, P. D., Carmo-Fonseca, M., Lamond, A. I., and Green, 43. Maris, C., Dominguez, C., and Allain, F. H. (2005) FEBS J. 272, 2118–2131 M. R. (1992) Proc. Natl. Acad. Sci. U. S. A. 89, 8769–8773 44. Kay, L. E., Torchia, D. A., and Bax, A. (1989) Biochemistry 28, 8972–8979 26. Poleev, A., Hartmann, A., and Stamm, S. (2000) Eur. J. Biochem. 267, 45. Tjandra, N., Wingfield, P., Stahl, S., and Bax, A. (1996) J. Biomol. NMR 8, 4002–4010 273–284 27. Rual, J. F., Venkatesan, K., Hao, T., Hirozane-Kishikawa, T., Dricot, A., Li, 46. Fushman, D., Weisemann, R., Thu¨ring, H., and Ru¨terjans, H. (1994) J. Bi- N., Berriz, G. F., Gibbons, F. D., Dreze, M., Ayivi-Guedehoussou, N., Klit- omol. NMR 4, 61–78 gord, N., Simon, C., Boxem, M., Milstein, S., Rosenberg, J., Goldberg, D. S., 47. Daragan, V., and Mayo, K. H. (1997) Prog. Nuclear Magn. Reson. Spectros-

Zhang, L. V., Wong, S. L., Franklin, G., Li, S., Albala, J. S., Lim, J., Fraugh- Downloaded from copy 31, 63–105 ton, C., Llamosas, E., Cevik, S., Bex, C., Lamesch, P., Sikorski, R. S., Van- 48. Crichlow, G. V., Zhou, H., Hsiao, H. H., Frederick, K. B., Debrosse, M., denhaute, J., Zoghbi, H. Y., Smolyar, A., Bosak, S., Sequerra, R., Doucette- Yang, Y., Folta-Stogniew, E. J., Chung, H. J., Fan, C., De la Cruz, E. M., Stamm, L., Cusick, M. E., Hill, D. E., Roth, F. P., and Vidal, M. (2005) Levens, D., Lolis, E., and Braddock, D. (2008) EMBO J. 27, 277–289 Nature 437, 1173–1178 49. Corsini, L., Hothorn, M., Scheffzek, K., Sattler, M., and Stier, G. (2008) 28. Hastings, M. L., Allemand, E., Duelli, D. M., Myers, M. P., and Krainer, Protein Sci., in press

A. R. (2007) PLoS ONE 2, e538 http://www.jbc.org/ 50. Krissinel, E., and Henrick, K. (2007) J. Mol. Biol. 372, 774–797 29. Liu, J., He, L., Collins, I., Ge, H., Libutti, D., Li, J., Egly, J. M., and Levens, D. Nucleic Acids Res. 31, (2000) Mol. Cell 5, 331–341 51. Ramu, C. (2003) 3771–3774 30. Quinn, L. M., Dickins, R. A., Coombe, M., Hime, G. R., Bowtell, D. D., and 52. Chung, S. H., Weiss, R. S., Frese, K. K., Prasad, B. V., and Javier, R. T. (2008) Richardson, H. (2004) Development (Camb.) 131, 1411–1423 Oncogene 27, 1412–1420 31. Van Buskirk, C., and Schupbach, T. (2002) Dev. Cell 2, 343–353 53. Forouhar, F., Huang, W. N., Liu, J. H., Chien, K. Y., Wu, W. G., and Hsiao, 32. Delaglio, F., Grzesiek, S., Vuister, G. W., Zhu, G., Pfeifer, J., and Bax, A. C. D. (2003) J. Biol. Chem. 278, 21980–21988 (1995) J. Biomol. NMR 6, 277–293 54. Bhakdi, S., Fussle, R., and Tranum-Jensen, J. (1981) Proc. Natl. Acad. Sci. at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 33. Johnson, B. A., and Blevins, R. A. (1994) J. Biomol. NMR 4, 603–614 U. S. A. 78, 5475–5479 34. Sattler, M., Schleucher, J., and Griesinger, C. (1999) Prog. NMR Spectrosc. 55. Forti, S., and Menestrina, G. (1989) Eur. J. Biochem. 181, 767–773 34, 93–158 56. Ramachandran, R., Tweten, R. K., and Johnson, A. E. (2004) Nat. Struct. 35. Farrow, N. A., Muhandiram, R., Singer, A. U., Pascal, S. M., Kay, C. M., Mol. Biol. 11, 697–705 Gish, G., Shoelson, S. E., Pawson, T., Forman-Kay, J. D., and Kay, L. E. 57. Manceau, V., Kielkopf, C. L., Sobel, A., and Maucuer, A. (2008) J. Mol. Biol. (1994) Biochemistry 33, 5984–6003 381, 748–762 36. Amezcua, C. A., Harper, S. M., Rutter, J., and Gardner, K. H. (2002) Struc- 58. Wang, X., Bruderer, S., Rafi, Z., Xue, J., Milburn, P. J., Kramer, A., and ture 10, 1349–1361 Robinson, P. J. (1999) EMBO J. 18, 4549–4559 37. Kabsch, W. (1993) J. Appl. Crystallogr. 26, 795–800 59. Laskowski, R. A., MacArthur, M. W., Moss, D. S., and Thornton, J. M. 38. McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Sto- (1993) J. Appl. Crystallogr. 26, 283–291

JANUARY 2, 2009•VOLUME 284•NUMBER 1 JOURNAL OF BIOLOGICAL CHEMISTRY 639 Supplemental Data Dimerization and protein binding specificity of the U2AF Homology Motif (UHM) of the splicing factor Puf60 Lorenzo Corsini1, Michael Hothorn1,4, Gunter Stier1, Vladimir Rybin1, Klaus Scheffzek1 Toby J. Gibson1 and Michael Sattler1,2,3,5

chemical shift change at the ligand concentration

Supplementary Fig. 1 - Structural integrity of L and the total protein concentration P, and Δδmax Puf60-UHM (EEE505-507AAA). is the chemical shift change at saturation. Supplementary Fig. 2 - SDS binding by Puf60- Averages and standard deviations of three fitted UHM. Kd values for each ligand are given in the text and Supplementary Fig. 3 - SDS induces the in figures. dimerization of wild type Puf60-UHM. Supplementary Fig. 4 - Binding of the UHMs of Supplementary Data Puf60 and U2AF65 to various ULMs. SDS induces the dimerization of wild type Supplementary Fig. 5 - Non-cooperative binding Puf60 UHM. To analyze the SDS-induced of Puf60-UHM to tandem ULMs. dimerization of Puf60 in further detail, we titrated Supplementary Fig. 6 - Binding of the UHM SDS to 15N-labelled, wild type and mutant domains of U2AF35 and SPF45 to various (EEE505-507AAA) Puf60-UHM and monitored ULMs. the NMR signals in a series of 1H,15N correlation spectra (Fig. 3A). Surprisingly, wild type and Supplementary Methods mutant UHM bind SDS in a 1:1 ratio and with a NMR titrations. Peptides for NMR titrations Kd of 24 ± 4 μM and 45 ± 6 μM, respectively were cleaved from the His6-GST-tags with TEV (Supplementary Fig. 2A). Although EEE505- protease and the tags were removed with a second 507AAA binds SDS weaker than wild type, the Ni-NTA purification step. The peptides were binding sites coincide, as the same peaks shift in further purified with a cation exchange SP- both cases (Fig. 3A). Mapping the chemical shift Sepharose column (HiTrapTM), followed by perturbations onto the structural model of Puf60- desalting and concentration with reversed-phase UHM shows that helices αA and αB, as well as disposable columns (Supelco Discovery C18). the β3’-strand and the β3’-β4 loop are mostly After elution with 80/19.9/0.1 (v/v/v) affected (Supplementary Fig. 2B, left panel). acetonitrile/H2O/trifluoro acetic acid the solution Comparison with the structure of SPF45-UHM was dried in a speed-vac before resuspension in and SF3b155 (330-342) (21) (Supplementary Fig. PBS buffer. 2B, right panel) shows that SDS binds into the NMR titrations were performed by adding ULM binding pocket of Puf60-UHM. 15 increasing amounts of ULM peptides from 7-15 The N T1 / T2 ratio of SDS-bound Puf60- 15 mM stock solutions in PBS to N-labelled Puf60- UHM (0.07% SDS / 2.4 mM / 8-fold molar 1 15 UHM (310 μM) and monitored using H, N excess) shows that it tumbles with a slightly HSQC experiments. shorter correlation time than Puf60-UHM in the

For each titration, if at least 3 peaks were absence of SDS (T1 / T2 = 4.6, τc = 7.2 ns; found to shift clearly in the fast exchange regime, Supplementary Fig. 2C). Thus, a simple equimolar a non-linear regression fit against the following binding of SDS does not induce dimerization of equation was used to obtain the corresponding Kd. Puf60-UHM. However, at an SDS-concentration 2 Δδ = Δδmax x { (L+P+Kd) – [(L+P+Kd) – (4 x L x of 850 μM (0.02% SDS, 2.9-fold molar excess) a 0.5 P)] } / (2 x P), where second set of NMR signals emerges (black arrows Δδ=sqrt{Δδ(1H)2+(0.2*Δδ(15N))2} is the measured in Supplementary Fig. 3A, upper panels; see

1 Supplementary Fig. 3C for the full spectra). The intensity of these peaks is maximal at an SDS- Binding to tandem ULMs. To test whether the concentration of 2.9 mM (0.1%, 10-fold molar binding of Puf60-UHM to SF3b155 (194-229) is excess) and vanishes at SDS-concentrations above cooperative, we separately mutated the two 16 mM (0.5%, 55-fold molar excess). These tryptophans in the tandem ULM peptide (W200 transiently appearing NMR signals have average and W218) to alanine and compared the binding 15 N relaxation times of T1/T2 999 ms/40 ms, affinities of the two mutant peptides to the wild corresponding to the relaxation rates expected for type. If the binding was cooperative, the Gibbs a UHM-dimer (218 residues, 297 K, 50.68 MHz: free energies of the single binding events would T1/T2 ~859 ms/48 ms). Thus, the dimerization of add and the Kd of the wild type should be PUF60-UHM is induced at SDS concentrations significantly lower than for the two (single) between 850 μM and 16 mM. tryptophan mutants. ITC indicates that Puf60- A third set of signals emerges at an SDS UHM binds wild type SF3b155(194-229) at two concentration of 0.02% and shows increasing sites with highly different affinities (1.0 +/- 0.2 intensity up to an SDS concentration of 1.0% (red μM and 178 +/- 107 μM, Supplementary Fig. 4C). arrows in Supplementary Fig. 3B). These signals The mutant peptide W218A has a single binding have average T1/T2 of 790 ms/291 ms (data not site with a Kd of 1.2 +/- 0.1 μM (Supplementary shown), indicative of fast tumbling typical of an Fig. 4D), which indicates that the ULM around unfolded peptide chain. Apparently, in the W200, which has more basic residues than the presence of SDS, the UHM dimer coexists with ULM around W218 (Fig. 4A), mediates the the unfolded protein. binding of the wild type peptide SF3b155(194- In contrast, the mutant EEE505-507AAA, 229). which does not dimerize in SDS-PAGE, unfolds NMR titrations indicate that the dissociation upon addition of SDS, without forming a constants for the wild type and the W218A mutant intermediate dimeric species (Supplementary Fig. are similar within experimental error (0.89±0.41 3A, and B, lower panels). Whereas the wild type μM and 0.78±0.36 μM , respectively), whereas the

UHM retains a significant population of folded W200A mutant has a Kd of 83±40 μM dimeric species up to an SDS concentration of 3.6 (Supplementary Fig. 5A). As the ITC results mM (0.10%, 12.4-fold molar excess, predicted, this indicates that the ULM at W200 is Supplementary Fig. 3 A and B, upper panels), the the main interacting region, and that the second mutant is unstructured in SDS concentrations ULM around W218 does not contribute above 2.4 mM (0.07%, 8-fold molar excess). Thus, cooperatively to the binding. the mutant is less stable to SDS-denaturation than The two putative ULMs in Prp16 have low the wild type protein, indicating that the similarity to the ULM consensus when compared dimerization of the wild type protein might to the established ones (Fig. 4A). Consistently, no increase its stability against SDS denaturation. interaction of Puf60-UHM and Prp16(201-238) The melting point of the mutant Puf60-UHM was found in GST-pulldown experiments (61°C) is 5°C lower than the melting point of the (Supplementary Fig. 4A). Prp16(201-238) binds to wild type (data not shown). This indicates that, Puf60-UHM with a Kd of 207±108 μM in NMR even though the β2-β3 loop is highly flexible in titrations (Supplementary Fig. 5B). The Prp16 solution, it contributes to the stability of Puf60- W215A and W230A mutants have Kd values of UHM in the absence of SDS. 594±119 μM and 524±382 μM, respectively, Taken together, these data indicate that, at indicating that both tryptophans are involved in low concentrations, SDS forms a 1:1 complex and binding Puf60-UHM (Supplementary Fig. 5B). binds to the ULM binding site of monomeric Taking into account that the wild type sequence Puf60-UHM. Wild type Puf60-UHM, but not the contains two ULM sites per peptide, we multiplied mutant EEE505-507AAA, dimerizes at the measured peptide concentration by two to intermediate SDS concentrations, as shown estimate an average Kd per ULM. This yields Kd, schematically in Fig. 3B. At higher concentrations, average = 659±112 μM. Since the addition of the SDS denatures Puf60-UHM wild type as well as Gibbs free energies of the two binding sites the mutant protein.

2 interacting with one molecule would roughly the intrinsically unstructured N-terminus of Prp16 correspond to a multiplication of the Kd values, the (Supplementary Fig. 4A, lane 10 and 4B, lane 7). data do not support a cooperative binding of the However, this binding does not involve the tandem ULMs in Prp16 to a Puf60-UHM dimer. tandem-ULM containing region 201-238 We conclude that binding of the tandem (Supplementary Fig. 4A, lane 9 and 4B lane 6). ULM sequences in SF3b155 or Prp16 is not The sequence of Prp16 comprising residues 1-200 sufficient to induce the dimerization of Puf60- and 239-314 contains four tryptophan residues UHM in solution in the absence of SDS. with weak similarity to ULM sequences, and we suppose that one of these putative ULM sequences ULM binding specificity. GST-pulldown mediates binding to the two UHM domains. experiments show binding of Puf60-UHM to SF1 (1-25, Supplementary Fig. 4A, lane 4), To better compare our results with previous SF3b155(194-229) (lane 5), SF3b155 (317-357, literature, we characterized the interactions lane 8) and U2AF65 (85-112, lane 13). A summary between the ULMs of SF1, U2AF65 and SF3b155 of the interactions of Puf60-UHM to various and the UHMs of U2AF35, and SPF45. As shown ULMs is given in Supplementary Table 1. in Supplementary Fig. 6, U2AF35-UHM binds We also compared ULM binding of Puf60- strongly to U2AF65-ULM (compare lanes 3 to 1 UHM with the interaction of U2AF65-UHM to the and 2), but also reproducibly shows weak binding same ULMs. As reported previously, U2AF65- affinity to SF1 and SF3b155 in GST-pulldowns UHM preferentially binds the ULMs in SF1 (lanes 4-6). SPF45-UHM binds to U2AF65-ULM (Supplementary Fig. 4B, lane 5) (8,15), and (compare lane 10 to 8 and 9), SF1-ULM (lane 13), SF3b155(317-357) (lane 15) (13,22,55). U2AF65- and the ULM at SF3b155-W338 (lanes 11-12) as UHM also can mediate an “intramolecular” described by Corsini et al. (21). Supplementary interaction to the ULM in its own N-terminus (85- Table 1 summarizes all of the UHM/ULM 112) (lane 4). The preference of U2AF65-UHM for interactions that were probed in this article, the ULM around W338 in the SF3b155 N- involving the splicing factors Puf60, U2AF65, terminus over the one at W200 (Supplementary U2AF35, SPF45, SF1, and SF3b155. Fig. 4B, compare lanes 11 and 15) is weak but reproducible, and has been described previously (55). Our GST-pulldown experiments also show binding of both Puf60-UHM and U2AF65-UHM to

3

Supplementary Table 1

U2AF65 U2AF35 SPF45 Puf60- UHM UHM UHM UHM

GST-pulldown NMR-titration ITC (μM) U2AF65-ULM (85-112) ++ a ++ f + f + a binding, i.e. c 2.7±0.2 μM e a f f a c e SF1-ULM (1-25) ++ 0/+ + + Kd = 6.5±1.8μM 20.8±5.2 μM a a c,d e, g SF3b155 (194-229) - + Kd = 0.9±0.4 μM 1.0 ± 0.2 μM b d g W218A + Kd = 0.8±0.4 μM 1.2 ± 0.1 μM b d W200A -- Kd = 83±40 μM n.d. SF3b155 (210-251) - a - a comprises W218 and W232 SF3b155 (229-269) - a - a conprises W232 and W254 SF3b155 (284-307) - a conprises W293 SF3b155 (317-357) - a 0/+ f + f + a binding, i.e. c 5.6 ± 0.6 μM e conprises W338 SF3b155-1-424 + a a a d Prp16 (201-238) - - Kd = 256±110μM d W230A Kd = 378±166μM d W215A Kd = 688±314μM Prp16-1-314 + a + a

Supplementary Table 1: Network of UHM/ULM interactions as detected by GST-pulldown experiments, NMR titrations, and ITC. Legend: ++ strong binding; + binding; 0/+ weak binding; 0 no binding; no entry: not determined. Kd values were derived by HSQC-titrations if 3 or more peaks shifted in the fast exchange regime. Intermediate a exchange (i.e.): strong binding detected by NMR titration, but Kd values could not be fitted. Supplementary Fig. 4A or B. b data not shown. c Fig. 5A. d Supplementary Fig. 5. e Fig. 4C. f Supplementary Fig. 6. g Supplementary Fig. 4C/D.

4 Supplementary Figure 1 - Structural integrity of Puf60-UHM (EEE505-507AAA)

102 black: wt red: EEE505-507AAA 501 510 107 ...EKQGEEEDAE... G504 boxes: mutated residues

112

E505

N 117 Q503 sc. D508 15 n.a. E506 E510 122 n.a. A509 E507 I511 Q503

127 D476

K502 132 10.1 9.6 9.1 8.6 8.1 7.6 7.1 6.6 6.1 5.6 1H

Supplementary Fig. 1: Overlay of the HSQC spectra of wild type and EEE505-507AAA Puf60-UHM in the absence of SDS in black and red, respectively. Peaks in the wild type spectrum originating from EEE505-507 are boxed. Peaks originating from residues close to the mutation site are labeled.

5 Supplementary Fig. 2 - SDS binding by Puf60-UHM

A 0,25

EEE 0,2 wt 505-507 AAA R467 0,15 W536 V543 p.p.m. 0,1 regression R467 W536 0,05 V543

0 0,00 1,00 2,00 3,00 4,00 5,00 6,00 [SDS] / [Puf60-UHM] B αC αC αB αB

β4 αA β1 F537 β2β3 β4 β3 αA β1 W338 β3’ β2 β3’

W536 Y376 F377 chemical shift perturbation

E505 E507 SF3b155 330-342 E506 Puf60-UHM SPF45-UHM shifts upon SDS binding complexed to SF3b155

C 20 rigid dimer

15

2 free

/T + 0.07% SDS 1 10 T

5 monomer

0 460 470 480 490 500 510 520 530 540 550 559 residue #

Supplementary Fig. 2: (A) Chemical shift perturbation of residues as depicted on the right was plotted over the [SDS]/[Puf60-UHM] concentration ratio (diamonds, triangles and circles). The lines show non-linear regressions to estimate the Kd, as described in Materials and Methods. Wild type and EEE505-507AAA UHM bind SDS in a 1:1 ratio with Kd values of 24 +/- 4 µM and 45 +/- 6 µM, respectively. (B) Chemical shift perturbations upon SDS binding mapped onto the crystal structure of Puf60-UHM cluster in the region where ULM binding is expected. For comparison, the structure of SPF45-UHM 15 (green/blue) bound to a ULM in SF3b155(330-342, orange) is shown on the right. (C) N T1/T2 ratios of Puf60-UHM free and in presence of 0.07% SDS. The horizontal black lines indicate expected T1/T2 ratios for a spherical protein monomer of 109 residues and a 218 residue dimer at 297K and 50.68 MHz Larmor frequency. The T1/T2 ratio of SDS-bound Puf60-UHM is lower than what would be expected for a rigid, spherical 109 residue protein, probably β β because the His6-tag and the 2- 3 loop (~17 residues) are flexible.

6 Supplementary Fig. 3 - SDS induces the dimerization of wild type Puf60-UHM. SDS-PAGE A SDS added 0% 0.02% 0.07% 0.1% 0.47% 1.0% T 1 604 122 A509 T2 170 941 wt 518 39 L479 87 124 E524 510 104

T 7.4 7.0 1 539 122 A509 T EEE 2 134 505-507 598 L479 AAA 83 124 E524 490 82 B 7.4 7.0 7.4 7.0 7.4 7.0 7.4 7.0 7.4 7.0 T 109 1 T1 T T 2 686 2 644 wt 170 520 311 G481 103 111 685 701 87 193

T 8.6 8.2 109 1 T1 T T 2 688 2 EEE 688 120 505-507 314 547 AAA G481 86 111 608 185

8.6 8.2 8.6 8.2 8.6 8.2 8.6 8.2 8.6 8.2 C Puf60-UHM wild type (300µM) Puf60-UHM EEE505-507AAA (300µM) 102 black: 0.014%SDS black: 0.014%SDS

106 red: 0.07% SDS red: 0.07% SDS blue: 1.0% SDS blue: 0.5% SDS

110 G481 G481

114 N

15 118

E480 122 A509 A509

L479 E524 L479 126 E524

130

10.1 9.1 8.1 7.1 6.1 10.1 9.1 8.1 7.1 6.1 1H 1H

Supplementary Fig. 3: (A) Upper panel: Regions of HSQC spectra of wild type Puf60-UHM recorded at the SDS concentrations indicated on top of the graphs. At an SDS concentration of 0.07%, 15N T1 and T2 were measured, as indicated next to the NMR signals in red and in green, respectively. The black arrows mark signals arising from the SDS-induced Puf60-UHM dimer. Lower panel: a corresponding SDS titration experiment as shown in the upper panel, but with the Puf60-UHM EEE505-507AAA mutant. (B) Same in (A), however, in this spectral region, no transiently appearing signals are observed during the SDS titration. The 15N T1 and T2 relaxation times of peaks marked by red arrows are indicative of fast tumbling typical of an unfolded peptide chain. (C) Overlay of HSQC spectra of Puf60-UHM wild type or EEE505-507AAA at the SDS concentrations depicted in the legend. Red boxes indicate peaks that appear at 0.02% SDS, have maximal intensity at 0.1% SDS and vanish at concentrations above 0.5%. Blue circles indicate peaks that shift at SDS-concentrations between 0.07% and 1.0%. Peaks arising from residues discussed in the Text and shown in (A) or (B) are labeled.

7 Supplementary Fig. 4 - Binding of the UHMs of Puf60 and U2AF65 to various ULMs.

A B

U2AF65 -UHM Puf60 -UHM GST- GST-fusions GST-fusions fusions (85-112) 65 (85-112) 65 10% input GSH-resin GST (201-238) Prp16 (1-314) Prp16 SF1(1-25) 10% input GSH-resin GST SF3b155(194-229) SF3b155(210-251) SF3b155(229-269) SF3b155(284-307) SF3b155(317-357) SF3b155(1-424) kDa U2AF GSH beads only GST kDa 10% input UHM SF1 (1-25) SF3b155 (194-229) SF3b155 (210-251) SF3b155 (229-269) SF3b155 (317-357) (201-238) Prp16 (1-314) Prp16 GSH beads only GST U2AF 96- 66- 66-

45- Ponceau S 45- 32- protein staining 32- 22- 14- α-Puf60 WB 14- - U2AF65-UHM 1 2 3 4 5 6 7 8 9 10 11 12 13 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 C D Puf60-UHM Puf60-UHM + SF3b155 (194-229, wt) + SF3b155 (194-229, W218A)

Time (min) Time (min) 0 20 40 60 80 100 120 140 160 0 20 40 60 80 100 120 140 160

0.0 0.0

-0.5 -0.5 µcal/sec µcal/sec

-1.0 -1.0

0 0 -2 -2 -4 -4 -6 -6 -8 -8 -10 -10 kcal/mole of injectant of kcal/mole -12 injectant of kcal/mole -12

0.0 0.5 1.0 1.5 2.0 2.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Molar Ratio Molar Ratio

Kd1=1.0 +/- 0.2 µM Kd=1.2 +/- 0.1 µM

Kd2=178 +/- 102 µM

Supplementary Fig. 4: (A) GST-pulldown of Puf60-UHM with several GST-tagged ULMs. Detection of Puf60-UHM was achieved by western blotting, as the much higher concentrations of Puf60-UHM required for coomassie staining induce dimerization of Puf60-UHM in SDS-PAGE. (B) GST-pulldown of U2AF65-UHM with several GST-tagged ULMs. The band at 32 kDa in lane 1 originates from residual GST, which was not fully separated from U2AF65-UHM during purification. Detection was achieved by coomassie staining, as there is no commercially available antibody for the UHM domain of U2AF65. (C) ITC of Puf60-UHM and SF3b155 (194-229) wild type. The curve can not be fitted by a single binding-site-model, indicating that both ULM sequences in SF3b155 (194-229) participate in the binding. Fitted dissociation constants for both binding sites are indicated below the graph. (D) ITC of Puf60-UHM and SF3b155 (194-229) W218A. In contrast to the curve in (C), this curve is clearly fittable by a single binding-site model. Thus, mutation of the weaker ULM impairs Puf60-UHM binding to the second binding site on the SF3b155 (194-229) peptide, but does not significantly affect binding of the stronger ULM (around W200). This indicates that the binding of Puf60-UHM to SF3b155 (194-229) is non-cooperative. 8 Supplementary Fig. 5 - Non-cooperative binding of Puf60-UHM to tandem ULMs. A 103 Puf60-UHM + SF3b155 194-229 wt (5.3eq, black) V543 60 W200A (7.9eq, red) 50 W218A (4.8eq, green) +w t 40 +W200A +W218A 30 Hz 20

10

0 0.0 1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0 [ULM]/[UHM] ratio N/ppm 15

I497 I497 60

50 V543 +w t 40 +W200A +W218A 30 Hz

20

10

133 0 0.0 1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0 [ULM]/[UHM] ratio 10.5 1H/ppm 6.0 B

103 G539 Puf60-UHM + Prp16 201-238 G539

wt (2.9 eq, black) 60 W215A (6.5 eq, red) 50 W230A (7.6 eq, green) +w t x 2 40 +W215A +W230A

Hz 30

20

10

0 0.00 2.00 4.00 6.00 8.00 10.00 12.00 14.00 16.00 [ULM]/[UHM] ratio N/ppm 15 F537 60

50 F537 40 +w t x 2 +W215A 30 Hz +W230A

20

10

0 133 0.00 2.00 4.00 6.00 8.00 10.00 12.00 14.00 16.00 [ULM]/[UHM] ratio 10.5 1H/ppm 6.0 Supplementary Fig. 5: HSQC titrations of 15N labelled Puf60-UHM with unlabelled wt and mutant tandem ULM peptides. (A) Left: overlay of “titration end-point” HSQC spectra of Puf60-UHM upon addition of saturating amounts of SF3b155 194-229 wild type (black), W200A (red) and W218A (green). On the right, the chemical shift perturbation at various titration points is plottet against the molar ratio of peptide/UHM for V543 (upper graph) and I497(lower

graph). The black line depicts the result of the Kd fitting. Average Kds fitted for three peaks in the fast exchange regime for SF3b155(194-229) wild type, W200A and W218A are 0.89+/-0.41µM, 83+/-40µM, and 0.78+/-0.36µM, respectively. Error bars indicate the uncertainties of the measured chemical shifts expressed as the digital resolu- tion of the twice zero-filled spectrum. (B) Left: overlay of HSQC spectra of Puf60-UHM upon addition of 2.9 molar equivalents of Prp16 201-238 wild type (blue), 6.5 eqivalents of W215A (red) and 7.6 equivalents of W230A (green). On the right, the chemical shift pertur- bation at various titration points is plottet against the molar ratio of peptide/UHM for G539 (upper graph) and

F537(lower graph). The black line depicts the result of the Kd fitting. The average Kds fitted for three peaks in the fast exchange regime for Prp16(301-348) wild type, W215A, and W230A are 207+/-108 µM, 594+/-119 µM, and µ 524+/-382 M, respectively. Fitting the Kd of wild type with the peptide concentration multiplied by two yields an µ average Kd per ULM of 659+/-112 M. Error bars indicate the uncertainties of the measured chemical shifts expressed as the digital resolution of the twice zero-filled spectrum. 9 Supplementary Fig.6: Fig.Supplementary oftheUHMdomainsU2AF35andSPF45to 6-Binding various ULMs GST-tagged ULMpeptides. kDa 32- 14- 22- 1 234567891011121314 GSH-resin U2AF GST 35 GST-U2AF65(85-112) SPF45-UHM (301-401) -UHM (38-152) GST-SF3b155(317-357 GST-SF3b155(330-341)

Coomassie stainedGST-pulldown ofU2AF GST-SF1(1-25) 10% input U2AF35-UHM GSH-resin GST GST-U2AF65(85-112) GST-SF3b155(317-357 GST-SF3b155(330-341) GST-SF1(1-25) 10% input SPF45-UHM - UHM 10 35 -UHM andSPF45-UHMwithvarious Dimerization and Protein Binding Specificity of the U2AF Homology Motif of the Splicing Factor Puf60 Lorenzo Corsini, Michael Hothorn, Gunter Stier, Vladimir Rybin, Klaus Scheffzek, Toby J. Gibson and Michael Sattler J. Biol. Chem. 2009, 284:630-639. doi: 10.1074/jbc.M805395200 originally published online October 29, 2008 Downloaded from Access the most updated version of this article at doi: 10.1074/jbc.M805395200

Alerts: • When this article is cited • When a correction for this article is posted http://www.jbc.org/ Click here to choose from all of JBC's e-mail alerts

Supplemental material: http://www.jbc.org/content/suppl/2008/10/29/M805395200.DC1.html

at Helmholtz Zentrum Muenchen - Zentralbibliothek on April 11, 2016 This article cites 58 references, 14 of which can be accessed free at http://www.jbc.org/content/284/1/630.full.html#ref-list-1