View metadata, citation and similar papers at core.ac.uk brought to you by CORE

provided by Elsevier - Publisher Connector Structure 14, 1741–1753, December 2006 ª2006 Elsevier Ltd All rights reserved DOI 10.1016/j.str.2006.09.013 The C-Terminal SH3 Domain of CRKL as a Dynamic Dimerization Module Transiently Exposing a Nuclear Export Signal

Maria Harkiolaki,1 Robert J.C. Gilbert,2,3 variant, c-Crk I, was acquired by the avian retroviruses E. Yvonne Jones,3 and Stephan M. Feller1,* CT10 and ASV1 from their hosts (Mayer et al., 1988; Tsu- 1 Cancer Research UK Cell Signalling Group chie et al., 1989). Weatherall Institute of Molecular Medicine v-Crk and c-Crk I each contain a single SH2 and SH3 University of Oxford domain. Their respective SH2 domains are known to Oxford OX3 9DS bind to specific phosphotyrosyl(pTyr)-containing epi- United Kingdom topes, which encompass the core motif pTyr-x-x-Pro 2 Oxford Centre for Molecular Sciences (Songyang et al., 1993), while the SH3 domains bind Central Chemistry Laboratory preferentially to proline-rich motifs with the core con- University of Oxford sensus Pro-x-x-Pro-x-Lys (Knudsen et al., 1995; Wu South Parks Road et al., 1995). The c-Crk II is about 12 kDa larger Oxford OX1 3QH than c-Crk I and contains an additional C-terminal SH3 United Kingdom domain (SH3C), which is separated from the first SH3 3 Division of Structural Biology (SH3N) by a linker region of approximately 55 amino Cancer Research UK Receptor Structure Group acids. The same domain arrangement is also present Henry Wellcome Building of Genomic Medicine in Crk-like (CRKL), a homologous adaptor protein, which Oxford OX3 7BN is the product of a distinct gene (ten Hoeve et al., 1993). United Kingdom CRKL is a major substrate of Bcr-Abl, an oncogenic ki- nase intimately linked to the development of human chronic myelogenous leukemia (CML) (Oda et al., 1994; Summary ten Hoeve et al., 1994). c-Crk II and CRKL are known to be involved in the signaling cascades initiated by sev- CRKL plays essential roles in cell signaling. It con- eral cell-surface receptors, including growth-factor- sists of an N-terminal SH2 domain followed by two activated kinases and integrins (reviewed in Buday SH3 domains. SH2 and SH3N bind to signaling pro- [1999]; Furge et al. [2000]; Feller [2001], and Chodnie- teins, but the function of the SH3C domain has re- wicz and Klemke [2004]). Targeted disruption of the mained largely enigmatic. We show here that the CRKL gene has shown that CRKL also plays an impor- SH3C of CRKL forms homodimers in protein crystals tant role in neural crest development (Guris et al., and in solution. Evidence for dimer formation of full- 2001; Moon et al., 2006; Guris et al., 2006). length CRKL is also presented. In the SH3C dimer, No signaling partners or binding motifs have emerged a nuclear export signal (NES) is mostly buried under for the SH3C domains of c-Crk II and CRKL (67% se- the domain surface. The same is true for a monomeric quence identity) from peptide library, phage display, SH3C obtained under different crystallization condi- and other screens. However, several reports in the liter- tions. Interestingly, partial SH3 unfolding, such as oc- ature indicate that the SH3C is nevertheless functionally curs upon dimer/monomer transition, produces a important. Mutation or deletion of the SH3C domain of c- fully-accessible NES through translocation of a single Crk II massively increases the phosphorylation of the b strand. Our results document the existence of an large docking protein p130Cas in transfected rat 3Y1 fi- SH3 domain dimer formed through exchange of the broblasts (Ogawa et al., 1994), indicating an inhibitory first SH3 domain b strand and suggest that partial un- role for the SH3C in this signal transduction pathway. folding of the SH3C is important for the relay of infor- Conversely, it has been shown that a mutant cell line in- mation in vivo. capable of oncogenic transformation and EGF receptor signaling contains a c-Crk II protein with mutations in the Introduction SH3C domain (Kizaka-Kondoh et al., 1996). Later stud- ies showed adverse effects on FAK, c-Abl, and Adaptor are important in cellular signaling. By Dock180/Elmo/Rac signaling upon forced expression definition, they lack a catalytic domain and act primarily of c-Crk II proteins with SH3C domain mutations (Zvara as organizers of multiprotein complexes. Adaptors are et al., 2001; Akakura et al., 2005; Reichman et al., 2005). typically composed of two or more protein interaction However, in none of these reports did a detailed picture domains and/or multiple binding sites for such domains. of the molecular action of the c-Crk II SH3C domain The importance of adaptor proteins for cell signaling emerge, which would have provided clear indications was first recognized through the cloning of the retroviral for the functionality of the homologous CRKL SH3C. A protein CT10 regulator of kinase (v-Crk), an oncogenic single report in the literature has so far directly implied adaptor that transforms avian fibroblasts and induces a function for the CRKL SH3C domain. Felschow et al. sarcomas in chicken (Mayer et al., 1988). Subsequently, reported the binding of a cytoplasmic region from the two equivalent cellular proteins that are splice variants cell-surface molecule CD34 to the CRKL SH3C domain of a single gene, c-Crk I and c-Crk II, were discovered (Felschow et al., 2001). The proposed binding epitope (Reichman et al., 1992; Matsuda et al., 1992). The shorter was a short amino acid stretch in the juxtamembrane re- gion of CD34, but the interaction was not studied further. There is also compelling evidence for a function of the *Correspondence: [email protected] C-terminal SH3 domains of Crk family proteins in Structure 1742

Figure 1. SH3 Domains of Human Small SH2/SH3 Adaptors Sequence alignment of SH3 domains (A). Secondary structure elements of the CRKL SH3C monomer and dimer are marked at the top and feature along corresponding residue sections. Red stars mark residues of the NES present in Crk II and CRKL. Bold-type face or colors were manually assigned to individual residues. Red-colored residues are conserved in at least 70% (12 of 17); green residues are conserved throughout all SH3 domain sequences. Bold letters indicate residues that are conserved in most SH3 domains but not in the C-terminal SH3 domains of CRKL and c- Crk II. Additionally, unique cysteine residues in the C-terminal SH3 domains of CRKL and c-Crk II are emphasized by yellow letters on blue back- ground. Within the consensus box, an asterisk indicates identity, a double-point conservation, and a single-point semiconservation of residues across the aligned sequences. Phylogenetic tree of the selected SH3 domains (B), highlighting the close relation between Crk II and CRKL SH3C on a fairly distinct branch, which diverges considerably from other SH3 domains of small SH2/SH3 adaptors.

nucleocytoplasmic shuttling. These domains contain a The alignment of all SH3 domains from small human LALEVGEL/IVKV motif conserved from man to Xenopus, SH2/SH3 adaptors clearly shows that the SH3C do- which acts as a nuclear export sequence (NES) and mains of CRKL and c-Crk II differ in several amino acids, binds to Crm1 (Smith et al., 2002). A mutant c-Crk II pro- which are otherwise conserved in all or most SH3s ana- tein lacking an intact NES promotes apoptosis, presum- lyzed here (Figure 1A, diverging amino acids within the ably through an SH2 domain interaction with the nuclear CRKL and c-Crk II SH3C domains in bold black). Gener- tyrosine kinase Wee1 (Smith et al., 2000). Key residues ation of a phylogenetic tree for the SH2/SH3 adaptor of this NES are highlighted in Figure 1A. CRKL has mul- SH3 domains accordingly shows that the CRKL and tiple functions in the cytoplasm of cells (Feller, 2001) but c-Crk II SH3C domains are particularly dissimilar from has also been repeatedly reported to form heterodimers all other SH3s (Figure 1B). with the transcription factor STAT5 (Oda et al., 1998; In this study, we have initially analyzed the putative Ozaki et al., 1998; Ota et al., 1998; Fish et al., 1999; Rho- binding motif of CD34 for the CRKL SH3C domain with des et al., 2000). The maintenance of appropriate tem- isothermal titration calorimetry. Our results suggest poral and spatial CRKL concentrations in different that the previously proposed CD34 motif does not bind subcellular compartments, for example by nucleocyto- directly to the CRKL SH3C domain. Subsequently, puri- plasmic shuttling, is therefore likely to be crucial for its fied CRKL SH3C domain protein was crystallized, and its actions. atomic structure was analyzed by X-ray diffraction. In CRKL SH3C Dimer/Monomer Transition Exposes NES 1743

addition, CRKL SH3C and full-length CRKL were studied of multiple precipitants as well as a cryoprotectant, all by analytical ultracentrifugation (AUC) and gel filtration at extreme concentrations far beyond physiological chromatography. These studies showed that the CRKL levels. As such, this reservoir solution appears to pro- SH3C, as well as full-length CRKL, have the ability to mote dimer dissociation while stabilizing the monomer form homodimers and that the previously reported selectively. NES is likely to be fully accessible only during the trans- A CRKL SH3C construct encoding for the C-terminal location of a single b strand at the N-terminal end of the 66 amino acids of CRKL (molecular weight of 7.4 kDa) SH3 domain fold. Such translocation takes place during was used to express the domain. The first CRKL SH3C the interchange between dimeric and monomeric states structure was solved by molecular replacement using of the native domain, both of which we have captured a P41212 crystal that contained one SH3 domain per through crystallization. asymmetric unit. The visible structure comprised of 60 residues, and the final model was refined against ˚ Results and Discussion 99.6% complete 2.8 A resolution data to an Rwork of 27.8 (Rfree = 31.7) and good stereochemistry (all residues CRKL Association with CD34 in favored or allowed regions of the Ramachandran plot) Based upon the results of Fackler and colleagues (Fel- (Table 1). The monomeric CRKL SH3C exhibits the typi- schow et al., 2001), we investigated the binding of the cal SH3 b-barrel fold (see Supplemental Data, Figure S1, purified CRKL SH3C to the implicated epitope of available with this article online). It features five CD34. For this, peptides corresponding to the juxta- b strands, b1–b5 (residues 4–6, 26–32, 38–41, 48–51, membrane region in the cytoplasmic tail of CD34 were and 55–57) as well as a 310 helix of residues 52–54 (Fig- synthesized, and their binding analyzed by isothermal ti- ure 1A). The RT loop, present between b1 and b2, is sta- tration calorimetry (ITC), by using a previously described bilized at its ends by a b bridge between Arg10 and set up suitable for quantification of SH3 domain interac- Leu22 and is locked onto the b barrel through mostly hy- tions (Harkiolaki et al., 2003; Lewitzky et al., 2004). We drophobic interactions as well as an additional b bridge were unable to detect even substandard binding of the between Ala19 and Leu49 from b4. The n-Src loop fea- CD34-derived motifs to the CRKL SH3C domain by ITC tures modestly between b2 and b3 spanning from (M. Lewitzky, personal communication). Several expla- Asn34 to Gln38. No regular secondary structure is nations for this are possible: the interaction of CD34 evident in either loop, in line with all structurally charac- and CRKL could occur indirectly, i.e. via a bridging pro- terized SH3 domains to date. tein, or it may require a conformational arrangement of The second CRKL SH3C structure was solved by mo- CD34, which only exists in the native full-length CD34 lecular replacement, by using the monomeric structure protein. Alternatively, the direct binding affinity may be as a model, and was present in a P6122 crystal with an very low, and additional components may lead to a sta- intertwined SH3 dimer per asymmetric unit (124 resi- bilization of the complex in vivo. Very low affinity interac- dues in total). The atomic model of the dimer was refined ˚ tions of SH3 domains with a Kd in the millimolar range against a 99.7% complete 2.5 A dataset to an Rwork of can be biologically important (Vaynberg et al., 2005) 26.3 (Rfree = 30.9). The architecture of the dimer is prac- but are not easily detected with ITC. tically identical to that of individual monomers for the In the absence of a clearly defined direct binding C-terminal bulk of the structure. However, each domain partner of the CRKL SH3C domain, the isolated SH3C in the dimer looses its RT loop as it extends its N-termi- domain was expressed, purified, and crystallized in the nal and positions its b1 onto the canonical b1 position of hope that the resulting atomic structure would provide the neighboring monomer, while the latter adopts the us with insights into the functionality of this domain. Fur- same fold on its partner (Figures 2A–2C). There is little thermore, atomic structures of the native domain would change in the architecture of each of the partners in reveal the precise architecture and accessibility of the the dimer compared to the monomer structure (Fig- NES site on CRKL and hence increase our understanding ure 2D), with an rmsd of 0.87 A˚ upon superposition of of protein associations involved in nuclear trafficking. Ca atoms over the bulk of the domain excluding some terminal residues, nine RT loop residues, where cross- The Atomic Structures of CRKL SH3C over occurs, and four residues corresponding to the The atomic structures discussed have been determined b hairpin between b3 and b4. Strand exchange between through X-ray crystallography. We tested two crystal domains further promotes the formation of a new forms, which contained different association states of b strand (denoted b* throughout the text and related fig- the native CRKL SH3C, namely a monomer and a homo- ures) between residues of the former RT loops. The con- dimer. In both cases, the stock protein solution was the served secondary structure features within each domain same. This disparity of native conformations was in in the dimer are five b strands (b1 residues 2–6, b* 13–15, agreement with biochemical data indicating that the b2–b4 27–33, 38–43, and 46–51, respectively), with b5 full-length CRKL can also form homodimers in solution reduced to a b bridge at Lys56. Each subunit can be (discussed below). Stabilization of dimer versus mono- superimposed onto its partner within the dimer with an ˚ mer and vice versa could have been promoted by the overall rmsd of 0.41 A for its core Ca atoms (residues reservoir solution used. Crystals of the SH3C domain 3–33 and 39–58 as terminal residues were excluded as a dimer were formed under what could be considered along with five residues of the nSrc loop that exhibit as near-physiological conditions, with a mere 5% of pre- some variation between subunits). cipitant at an ionic strength of w250 mM (for details see Within the crystalline arrangement of the dimer, Experimental Procedures). However, crystals of CRKL symmetry-imposed association promotes tetramer for- SH3C as a monomer were formed under the influence mation. Two intermolecular disulphide bonds between Structure 1744

Table 1. Data Collection and Refinement Statistics The Putative Ligand-Binding Surface of CRKL SH3C To date, despite considerable efforts, there are no bind- Monomer Dimer ing partners identified for CRKL SH3C. It is interesting Data Collection nevertheless to identify outstanding features present in Resolution (A˚ )a 30–2.80 (2.90–2.80) 30–2.50 (2.59–2.50) its putative binding site. Three representative structures were superimposed onto the domain to gain insight into Space group P41212P6122 Cell dimensions (A˚ ) a = b = 53.28, a = b = 71.32, its potential binding mode: Crk SH3N, which binds c = 51.19 c = 146.43 PxxPxK/R motifs (1CKA.pdb and 1CKB.pdb), Grb2 Cell angles () a = b = g =90 a = b = 90, g = 120 SH3N binding PxxPxR, and Mona/Gads SH3C as a rep- Solvent content (%) 54.91 68.42 resentative of the atypical RxxK binding SH3 domains Wavelength (A˚ ) 1.77 1.54 Total data 170,871 361,319 (1OEB.pdb). Compared to these domains, CRKL SH3C Unique data 2,035 8,176 lacks the strong negative potential to bind positively Completeness (%) 99.6 (99.5) 99.7 (100) charged residues, such as arginine or lysine, as it misses I/s(I) 34.3 (3.2) 16.74 (1.09) at least two of the charged residues involved, so it is un- b Rmerge (%) 10.1 (99.9) 15.6 (99.9) likely it will engage in charged interactions of this nature Refinement although some residual negative potential still persists Number of unique 2,015 8,123 in that location (Figures 3A and 3B). Remarkably, the ab- reflections used sence of a tryptophan in position 38, which appears c d Rwork (Rfree) 28.7 (35.33) 26.27 (30.90) strictly conserved in other SH3 domains (Figure 1A), re- Rmsd bonds (A˚ ) 0.009 0.014 sults in a relative depression of the binding groove and Rmsd angles (deg) 1.670 1.83 the loss of hydrophobic stacking potential (Figure 1B). e Average B main 81.22 55.40 Although some stacking potential remains (a histidine chain (A˚ 2) Rmsd main-chain 3.65 2.99 where tyrosine normally resides and a conserved pro- bond B (A˚ 2) line), it is unlikely it will promote strong binding with in- Average Be side 84.33 59.41 coming prolines of a PxxP motif if the ligand orients itself chain (A˚ 2) in a canonical fashion. Strikingly, the N-terminal Arg10 Rmsd side-chain 4.81 5.03 produces an impressive pool of positive charges that ˚ 2 bond B (A ) appears to dominate the midregion of the canonical Number of atoms 474 (60) 978 (124) (residues) per binding grove (Figure 3B). It would be reasonable then asymmetric unit to conclude that a binding partner would need to com- plement this with negatively charged residues, in which Model Quality (Ramachandran Plot)f case we are looking into an entirely novel motif that has Residues in most 58.8 88.5 to be yet identified. Dimerization should not affect bind- favored regions (%) ing as crossover takes place at the tip of the RT loop, Residues in additional 35.3 9.6 allowed regions (%) which, in itself, remains largely in place with regards to Residues in generously 5.9 1.9 the canonical binding groove (Figure 3C). A projected allowed regions (%) full-length CRKL dimer, however, could assist in signal Number of individual 12 modification within signaling networks through local domains/asymmetric amplification of the CRKL modules (see Supplemental unit Data). Number of residues 60 (472) 124 (978) (number of atoms)/ asymmetric unit Further Evidence of Native Association States for CRKL and Its SH3C Domain a Values in parentheses correspond to the highest resolution shell. Sedimentation equilibrium analytical ultracentrifugation b Rmerge= SjSh (jIj.h 2 j)/SjSh(), where h is the unique reflec- (AUC) of the CRKL SH3C under near physiological con- tion index, Ij.h is the intensity of the symmetry related reflection, and ditions (50 mM HEPES [pH 7.5], 100 mM NaCl) over is the mean intensity. a range of concentrations indicated that it is a dimer in c R=ShjjFojh 2 /jFcjhj/ShjFojh, where h defines the unique reflec- solution, with further higher-molecular-weight assem- tions. blies promoted by increased concentrations (Figure 4A). d Calculated on a random 5% of the data. e B=8p2 / 3, where is the mean square atomic dis- Extrapolation to infinite dilution over the first four data placement. points gives a molecular weight of 15,471 6 837 Da, f Values from PROCHECK (Laskowski et al., 1993). which is in agreement with the theoretical molecular weight (14,890 Da) of a CRKL SH3C dimer (the last data point was excluded due to crowding effects). Sed- imentation velocity AUC revealed a sedimentation neighboring dimers result in the formation of covalently boundary with several peaks (Figure 4B), which could bonded tetramers. Consequently, respective b ladders be deconvoluted into three species with sedimentation overlap (cross-stagger) resulting in the burial of a total coefficients (s) of 1.4S, 1.9S, and 3.0S (S = Svedberg of 1033 A˚ 2 per dimer. Although this may be an artifact unit). The major peak was present at 1.9S, followed by promoted through crystal packing, higher order associ- one at 3.0S, while a peak at 1.4S represented a minor ations such as these might be involved in recruitment species. The sedimentation coefficients calculated and signal amplification in the context of localized oxi- from the crystal structures of the CRKL domain mono- dative events (see Supplemental Data for further details mer, dimer, and tetramer are estimated at 1.31S– and discussion and Figure S4). 1.56S, 1.80S–2.04S, and 2.84S–3.22S, respectively, CRKL SH3C Dimer/Monomer Transition Exposes NES 1745

Figure 2. The CRKL SH3C Dimer Schematic representation of the CRKL SH3C dimer (A), where the two polypeptide chains are colored green and red, respectively, with all b strands numbered from N to C terminus and a second view of the same structure (B) produced through a 90 clockwise rotation around the x axis of (A). The second orientation (B) is also viewed as a space-filling model (C). (D) Ca models of the SH3C dimer and superimposed mono- mers. The two polypeptide chains in the dimer are colored green and red, respectively, while the monomers are in gray. The structures were superimposed in LSQKAB (Kabsch, 1976) with equivalences of core SH3 fold residues across species. Orientation same as in (A).

allowing for varying degrees of solvation. Using the periment was 138 mM (1.03 mg/ml) and therefore a KD structure derived estimates for s, the three peaks in for the monomer-dimer interaction of 6.1 mM can be the sedimentation profile can be identified with the mo- estimated. nomeric, dimeric, and tetrameric populations of the The experiment was repeated in the presence of CRKL domain, where the monomeric form is the minor 50 mM DTT to confirm the identity of the 3.0S peak as species. In order to estimate the concentrations of the SH3C tetramer. In this case, the monomeric and di- each species observed in the c(s) profile, the area of meric peaks persisted (Figure 4B), but the tetrameric each equivalent fitted Gaussian peak was calculated peak was much reduced if not absent; this is in line as described in the Experimental Procedures. This indi- with the stabilization of the tetrameric form by disul- cated that of the peaks observed for CRKL SH3C, 11% phide bonds. Both in the presence and absence of of the protein was monomeric, 56% dimeric, and 5% tet- DTT, the sedimentation profiles showed a long tail to- rameric. The concentration of the sample used in the ex- ward higher s values; this likely arises from the formation Structure 1746

Figure 3. Comparative Surface Representations of the CRKL SH3C Monomer and a Representative SH3 Domain Exhibiting the Canonical SH3 Binding (A) Electrostatic surface representation of c-Crk II SH3N in complex with a peptide of the canonical PxxPxK motif (1CKA.pdb). Trp169 (the typ- ically conserved Trp, missing in CRKLSH3C) is labeled. (B) Electrostatic surface representation of the CRKL SH3C monomer in divergent stereo with representative loops and binding site residues la- beled. SH3 domains in both (A) and (B) are in exactly the same orientation after superposition for comparison purposes. (C) Surface representation of the CRKL SH3C dimer with normalized electrostatic potential in agreement with the monomeric structure. of transitory associations into higher-order assemblies 21,000 rpm, 44,001 6 236 Da. This pattern of apparent consistent with side-on intermolecular interactions be- molecular weight loss with increasing speed is indica- tween strands. tive of the presence of high molecular weight species Full-length recombinant CRKL was also used for both (aggregates) that are progressively removed from solu- sedimentation equilibrium and velocity experiments. A tion as the speed of the rotor is increased. In line with fit to a single-species model is shown in Figure 4A for the this, the velocity profile of full-length CRKL contains equilibrium experiment. The full-length CRKL gave the a number of peaks, indicative of the presence of a num- following molecular weights: at 12,000 rpm, 75,961 6 ber of discrete species of increasing molecular weight in 739 Da; at 15,000 rpm, 63,372 6 485 Da; and at the sample (Figure 4B). The positioning of these peaks, CRKL SH3C Dimer/Monomer Transition Exposes NES 1747

Figure 4. Biochemical Characterization of the Distribution of Associative Species in Solution and Corroborating Thermal-Parameter Distribution within Crystalline Arrangements Sedimentation equilibrium of CRKL domain and full-length protein (A). Data for the SH3 domain uses the left-hand y axis and bottom x axis where the apparent whole-cell molecular weight (kDa) is plotted against sample concentration (mg/ml) (green circles). The lower four concentrations were fitted with a straight line (orange) giving the molecular weight at infinite dilution where the line cuts the y axis. The right-hand y axis and top x axis represent values relating to the full-length CRKL. The interference optics sample equilibrium distribution (fringe shift versus radial cm) is shown (blue symbols) fitted with a single-species equation with ULTRASPIN (Altamirano et al., 2001) (red line). Also shown is the sedimentation velocity of CRKL SH3C domain and full-length protein (B). Orange squares represent the c(s) profile for the CRKL SH3C domain. The Gaussian fits to the first three species in the profile (s = 1.4S, 1.9S, and 3.0S) are in green. These correspond to the monomer, dimer, and tetramer, structures of which are shown on the right along with their corresponding range of calculated sedimentation coefficients (protein backbone drawn as a ribbon with hydrodynamic beads calculated by using AtoB [Byron, 1997] drawn on top). Red circles represent the c(s) profile for the CRKL domain with 1 mM DTT. In blue are the Gaussian fits to the remaining (after DTT addition) two species (monomer and dimer). The c(s) profile for full-length CRKL is depicted by magenta crosses and displays a complex distribution of sedimenting species. Fractions from size-exclusion chromatography (C) of HeLa extracts after SDS-PAGE separation, western blotting, and ELC detection of CRKL on blue light-sensitive autoradiography film. Ca tem- perature factor plot (D) of polypeptide chains in the monomeric (in red) and dimeric (in blue and green) folds. Absolute mean square atomic dis- placements are dependent on data resolution across crystals, and in this case, values reflecting the dimeric SH3C residues have been normal- ized to the monomeric structure by addition of a constant value (20) for the purposes of relative distribution comparisons. Secondary structure features are labeled and enclosing the residue numbers involved in their formation, while double headed arrows indicate the stretches associ- ated with the three loops present in an SH3 fold (RT, nSrc and distal loop, respectively).

at 1.9S, 2.5S, 3.5S, 4.6S, and 7.5S are consistent with the from solution and the apparent whole-cell molecular presence in the sample of monomeric, dimeric, and weight, measured by fitting the equilibrium profile of higher assemblies of full-length CRKL. As the speed of a dynamic system such as this to a single-species the rotor is increased, the heavier species are pelletted model, falls. Structure 1748

Size-exclusion chromatography estimates a molecu- the homologous site is a distinct possibility. Further- lar radius of 33 A˚ for both the full-length recombinant more, the 3D profile of this projected NES on CRKL CRKL as well as the native CRKL derived from adherent SH3C provides valuable insight to the atomic nature of HeLa cells (Figure 4C), which, assuming an overall glob- c-Crk II binding to Crm1. ular structure, is equivalent to an apparent molecular The key residues of the proposed NES (Leu20, Val24, weight of 55 kDa (the theoretical MW of a CRKL mono- Val 28, and Val30) are mostly solvent inaccessible in the mer is 33.8 kDa). The multidomain CRKL (SH2-SH3N- native CRKL structure, be that a dimer or a monomer SH3C) is likely to conform to a beads-on-a-string archi- (Figure 5). Leu20 is completely buried within the hydro- tecture where the three domains are linked in tandem phobic core of the domain underneath the b sheet com- but do not structurally interfere with each other. It is con- prised of b2–b4 and is not likely therefore to be directly ceivable that such a protein could be delayed as it tra- involved in associations with Crm1. Val24 is clearly lo- verses the chromatography matrix, thus appearing cated on the domain surface. It is the last residue to smaller than expected, as terminal domains sample maintain an orderly bonding pattern with b1 as it pro- the available channels that would have been inaccessi- ceeds toward the periphery of the SH3. Val28 and ble to a typical single-fold protein of similar molecular Val30 are both buried within the hydrophobic core of weight. Taking therefore into account the relative uncer- the domain, locked onto position also by b1 that com- tainty of this technique, these results appear to imply pletely obscures both residues from any incoming bind- that CRKL, either as a recombinant construct expressed ing partners and locks them well within the innermost in bacteria or as endogenous protein derived from a hu- core of the domain. However, both of these residues man cell line, is a dimer in solution (see Supplemental are presumed to be intimately involved with Crm1 bind- Data and Figure S3 for further biochemical characteriza- ing because mutation of either results in the abrogation tion by crosslinking). of Crm1 binding to c-Crk II without any apparent loss of The atomic structures available provide additional in- solubility (Smith et al., 2002). sight into the origins of dynamic states of the domain The 2.5 and 2.8 A˚ SH3 folds presented herein conform fold in CRKL SH3C. By plotting the distribution of Ca to the archetypal SH3 fold present in all well docu- temperature factors B (B =8p2 / 3, where is mented structural studies of such domains, thus they the mean square atomic displacement), a clear differ- appear to be genuinely stable states for the domain ar- ence between the monomer and respective subunits chitecture. However, the presence of two alternate as- within the dimer becomes apparent (Figure 4D). While sociation states may provide a viable hypothesis for loops in both structures appear to retain a rather high vi- the character of the associations of c-Crk II with Crm1 brational profile overall, compared to the core structural in the process of nuclear export. According to the bulk elements, the dimer is considerably less flexible in the of our biochemical data, CRKL, and specifically its RT region. The RT loop, being the feature that un- SH3C domain, exists as a dimer in solution. However, dergoes restructuring in the transition between mono- we were able to favor monomer formation in the crystal- mer and dimer, appears thus to have a pivotal role in lization medium, an event that must give rise to, or in- dimer/monomer interconversion. As such, it could ac- deed result from, a transiently exposed NES on each count for the propensity of the monomer for partial un- participating subunit. The projected NES motif of folding, while at the same time, its rebirth as part of CRKL, and by inference the NES of c-Crk II, could be a b ladder in the dimer (b*-b*) might promote the overall sampled by environmental factors during structure fluc- stability of the respective dimeric architecture. tuations inherent to the nature of this domain or become accessible upon and because of association with Crm1. Association States of CRKL and Their Implication If it is assumed that the two proteins can be found in the in Nuclear Trafficking immediate environment of each other, Crm1 might ini- The SH3C domain of CRKL has yet to be fully character- tially associate with CRKL through residues clearly ac- ized as to the nature of interactions it experiences and cessible, such as Val24 or Ile27 (the latter of which, ac- the pathways in which it is implicated. A recent report cording to Smith et al., is required for binding), and in on the closely homologous c-Crk II (93% residue homol- essence ‘‘unroll’’ the SH3C domain N-terminal region, ogy between SH3C domains) provided evidence for di- fully exposing the complementary NES site. In this rect association to Crm1 through an NES located on model, Val20 would be essential for core stability be- its SH3C domain (Smith et al., 2002), in compliance cause it locks b2 under the curved b sheet formed by with the generic F-x2-3-F-x2-3-F-x-F (F = L,I,V,F,M and b3–b5 thus maintaining the core fold of the domain. x is any amino acid) motif (la Cour et al., 2004). c-Crk II The NMR solution structure of the highly homologous can thus be shuttled out of the nucleus as a preventative c-Crk II SH3C as a monomer was published while this measure against initiation of proapoptotic signals gen- report was being reviewed (Muralidharan et al., 2006). erated through its association with the kinase Wee1. It is largely in agreement with our findings and was The same motif is present in CRKL SH3C, which implies reported along with complementary equilibrium dena- that CRKL might also be involved in nuclear shuttling in turation measurements and fluorescence-detected a similar fashion, possibly within the context of its own stopped-flow experiments that provided thermody- distinct signaling tasks, for example the well established namic measurements of the SH3C folding process. It heterodimerization with transcription factor STAT5. Fur- was therein established that low nanomolar affinity of ther experiments will need to confirm whether the CRKL Crm1 would sufficiently compensate for the free-energy SH3C NES is indeed functional; however, based on se- loss incurred during domain unfolding as seen elsewhere quence identity (see Supplemental Data, Figure S2), it (Petosa et al., 2004) and would appear to validate our de- is reasonable to assume that a fully functional NES on ductions based on the atomic resolution data available. CRKL SH3C Dimer/Monomer Transition Exposes NES 1749

Figure 5. The CRKL SH3C Dimer and the Po- sition of Its NES Surface representations of the CRKL SH3C dimer in two orientations with the two do- mains in tandem on the x-y plane (A) and the same rotated by 30 counterclockwise along the y axis (B). The two views are given in two panels each where both of the domains are drawn as surface representations of green or red (top), respectively, or one do- main is seen as an atomic model for clarity (bottom). Accessible surfaces corresponding to residues of the NES are colored yellow. Residues Ile27 and Phe58 appear continuous due to their close proximity.

CRKL SH3C, SH3 Dimers, Domain Swapping, of its residues can be exposed without overall structural and Hidden Motifs compromise (Sadqi et al., 1999) (see Supplemental Data SH3 domain homo- and heterodimerization has been re- for further discussion). ported in atomic detail on a number of occasions and In general, domain swapping has been documented encompasses a range of molecules involved in cell sig- repeatedly and found implicated in activation and regu- naling (Figure 6) (see Supplemental Data for a more ex- lation of protein function as well as increased thermo- tensive discussion). Additionally, the plasticity of the stability (Rousseau et al., 2003). It has also been sug- SH3 fold has been convincingly exhibited through struc- gested that certain folds display a propensity for ture-informed mutagenesis on PSD-95, whereupon domain swapping, which is further ensured through a five proline linker results in preferential dimer forma- the positioning of key residues (most commonly pro- tion between domains (McGee et al., 2001). Folding ki- lines) in the exchange region (Ding et al., 2006). netic analyses of the Src SH3 domain, after point muta- Ultimately, the prospect arises that the observed two- tions along its length, confirmed that within an SH3 state equilibrium we witness in the case of CRKL SH3C domain strands b1 and b2, the RT loop and the 310 helix is only the byproduct of a local unfolding phenomenon can unfold without compromising the SH3 fold overall that aims to expose a hidden NES. As such, this behav- (Riddle et al., 1999). Furthermore, NMR studies of the ior can be indicative of a potentially common yet pres- native state of a-spectrin SH3 domain confirm that any ently under-studied structural promiscuity effecting Structure 1750

Figure 6. Examples of Known SH3 Dimers Ribbon representations of SH3 fold dimers (partner subunit polypeptide chains in green and red, respectively) of Vav and Grb2 (A) (Nishida et al., 2001), Grap2 (Mona/Gads) (B) (Harkiolaki et al., 2003), KorB-C (C), p47phox (D) (Delbruck et al., 2002), and Eps8 (E) (Kishan et al., 1997). The yellow circle in (B) represents a crucial divalent metal ion.

novel functional characteristics on known proteins. A was constructed by using the ‘‘neighbor-joining’’ algorithm (Saitou well documented example of such propensity is the and Nei, 1987) with ‘‘number of differences’’ as method for calcula- structurally unrelated focal adhesion targeting (FAT) do- tion of distances between the sequences and ‘‘complete deletion’’ for treating gaps in the alignment. main of focal adhesion kinase (FAK), whose four-helix bundle assembles as either a monomer or a dimer (Arold SH3 Domain Expression and Purification et al., 2002; McGee et al., 2001). In both, tyrosine 926 be- The pGEX-CRKL SH3C construct was a kind gift of Brian J. Druker comes available to kinase action only upon partial un- and generated as described previously (Heaney et al., 1997). This folding of the domain, an event that is ultimately also re- construct was used to transform E. coli BL21 (DE3) cells, which sponsible for dimer formation (Dixon et al., 2004). were induced to express at OD600 1.4 with 0.1 mM IPTG in terrific broth at 37C for 4 hr with shaking. The bacterial culture was there- The function of the CRKL SH3C domain has been un- after centrifuged, and the pellets resuspended and sonicated in clear for a long time. Here, we provide evidence, both 50 mM Tris (pH 7.5) with, 100 mM NaCl, 13 Complete protease inhib- crystallographic and biochemical, for alternate associa- itor cocktail (Roche) and 0.2 mM phenylmethylsulfonyl fluoride tion states that could promote nuclear export through (PMSF). The lysates were further centrifuged, and the clear superna- the dislocation of a single SH3C domain b strand. In do- tant applied to GSH-Sepharose beads (Amersham Biosciences) and ing so, a plausible model of CRKL targeting by the nu- allowed to bind overnight at 4 C on a nutator. The loaded resin was then packed in a chromatography column, washed with three times clear protein export complex Crm1/Ran$GTP emerges with ten column volumes of 50 mM Tris (pH 7.5), 100 mM NaCl, and that accounts for the inaccessibility of a resident NES incubated with five units of Thrombin per milligram of immobilized site. This model can hereafter be further tested by muta- GST-CRKL SH3C overnight at ambient temperature while nutating. tional analysis and binding studies. By and large the The cleaved domain solution was thereafter drained from the col- most intriguing structural aspect of this study appears umn, concentrated, and further purified through FPLC size-exclu- to be the dynamic structural state inherent in this subset sion chromatography (Sephadex 75 16/60; Amersham Biosciences) into 50 mM HEPES (pH 7.5), 100 mM NaCl. of SH3 domains (and possibly other SH3 domain rela- tives). As such, the fluid architecture documented here Crystallization and Data Collection could reflect a whole new physiological role for SH3 The purified CRKL SH3C was concentrated to 43 mg/ml and used in domains other than the prescribed surface-docking sitting drop vapor diffusion crystallization screens. Two types of dif- function. fraction quality crystals were produced. Crystals of the domain as a monomer (roughly cubical, with dimensions of 0.2 3 0.2 3 0.2 mm3) were formed under equilibrium conditions of 1.7 M ammo- Experimental Procedures nium soleplate, 25.5% w/v PEG-8000, 0.085 M sodium cacodylate (pH 6.5), and 15% glycerol (Crystal Screen Cryo, Hampton Research) Primary Structure Alignment and Phylogenetic Tree for SH3 within 2 days at room temperature. These were vitrified through Domains of CRKL and Related Small Adaptor Proteins transfer directly into a nitrogen stream (100 K). Diffraction data to Sequences of human full-length proteins were retrieved from the 3.0 A˚ were collected on BM14 at the European Synchrotron Radia- NCBI Entrez Protein Database. Limits of individual SH3 domains tion Facility, Grenoble, France. Crystals of the dimeric form (rod were determined by a Pfam database HMM search (http://pfam. like, with dimensions of 0.4 3 0.2 3 0.2 mm3) were formed under janelia.org/)(Bateman et al., 2004). Subsequent analyzes were per- equilibrium conditions of 0.01 M MgCl2, 5% w/v PEG-8000, 0.2 M formed with version 3.1 of the ‘‘Molecular Evolutionary Genetics KCl, 0.05 M MES (pH 5.6) (Natrix, Hampton Research) within Analysis’’ (MEGA) package (http://www.megasoftware.net/)(Kumar a week at room temperature. Crystals were cryoprotected through et al., 2004). The initial alignment was created with the incorporated transfer into mother liquor containing 25% glycerol. Diffraction ˚ ClustalW algorithm (Thompson et al., 1994). The phylogenetic tree data were collected to 2.5 A on a CuKa rotating anode (Rigaku CRKL SH3C Dimer/Monomer Transition Exposes NES 1751

MicroMax-007, 40 mA, 20 kV) with osmic-blue mirrors and a 30 cm (RRSWSPTGER and NRRSWSPTGERLGEDPYYT) were synthesized imaging plate (MAR research). Data were integrated and scaled and purified by HPLC in the Cancer Research UK Protein and Pep- with DENZO and SCALEPACK (Otwinowski, 1993; Otwinowski and tide Chemistry Group by Nicola O’Reilly and coworkers. Minor, 1997)(Table 1). Extraction of Cytosolic HeLa Proteins Structure Solution and Refinement HeLa cells were grown as adherent culture in excess DMEM supple- Data collected from type I crystals, formed under conditions of ex- mented with 10% FBS, 100 U/ml penicillin, and 100 mg/ml strepto- treme ionic strength, were used in molecular replacement phasing mycin in a humidified atmosphere at 37 C with 10% CO2. Before har- trials with various existing SH3 domains, and a solution was ob- vesting, cells were washed twice with chilled PBS, twice with chilled tained in AMoRe (Navaza, 1994) with the SH3 domain of p47phox hypotonic lysis buffer (HLB; 10 mM Tris HCl [pH 7.5], 10 mM KCl,

(1UEC) (Yuzawa et al., 2004). The resulting model was rebuilt in the 1 mM EDTA, 1 mM EGTA, 2 mM MgCl2), and once with HLB contain- program O (Jones et al., 1991) and refined with CNS (Brunger ing 13 protease inhibitor cocktail (169748, Roche), 0.7 mg/ml pep- et al., 1998). Refinement and model quality statistics are summa- statin A, 13 phosphatase inhibitor cocktails 1 and 2 (P2850 and rized in Table 1. The refined CrkL-SH3C monomer model was there- P5726, SigmaAldrich), and 1 mM DTT. Cells were left sitting on ice after used as a molecular replacement model for the data collected to swell for 10 min and then scraped and dounce homogenized on a type II crystal with AMoRe. Two molecules were successfully (20 strokes with a small clearance pestle). The homogenate was clar- placed in the asymmetric unit, and several rounds of rebuilding ified by a 30 min spin with 100,000 3 gat2C. The supernatant was and refinement followed, resulting in the atomic model of an inter- collected, leaving the lipid layer on top behind, snap frozen in liquid twined domain homodimer. Refinement and model quality statistics nitrogen and stored at 280C until further use. The protein concen- are given in Table 1. tration of the S100 fraction was determined by Bradford assay.

Structure and Surface Representations Molecular Size-Exclusion Chromatography by FPLC Atomic structure representations were created with BOBSCRIPT Purified recombinant full-length CRKL was loaded on a pre- (Esnouf, 1997, 1999) and rendered with Raster3D (Merritt and Mur- equilibrated HiLoad Superdex 75 16/60 (Amersham Biosciences) in phy, 1994). Surface representations were done with GRASP (Nich- 50 mM HEPES (pH 7.5), 100 mM NaCl and run at a flow rate of olls et al., 1991). 1 ml/min. The CRKL protein elutes at an experimental molecular weight of 58 kDa as opposed to the theoretical 33.8 kDa of the mono- Analytical Ultracentrifugation mer. In identical fashion, HeLa cell extracts were run on the same Sedimentation equilibrium experiments were performed in a Beck- column. The collected 0.5 ml fractions were separated by SDS- man Optima XL-I analytical ultracentrifuge, as previously described PAGE, and the resolved proteins transferred onto a Hybond-P mem- (Sutton et al., 2004). Briefly, CRKL SH3C in 50 mM HEPES (pH 7.0), brane (Amersham Biosciences) with a semidry transfer cell. The 50 mM NaCl was used at concentrations ranging from 0.75 to membrane was thereafter blocked with TBS-T (20 mM Tris HCl [pH 12 mg/ml and centrifuged to equilibrium at 12,000, 15,000, and 7.5], 100 mM NaCl, 0.1% Tween, 5% nonfat milk) for 1 hr and incu- 21,000 rpm, respectively, at 20C, and imaged by using interference bated overnight with a 1:200 diluted rabbit anti-c-Crk II (H53, sc- optics. The sample distributions measured at equilibrium were fitted 9004; Santa Cruz) or rabbit anti-CRKL (sc-319). Membranes were with the program ULTRASPIN (Altamirano et al., 2001) by using then washed with TBS-T, incubated with HRP-coupled donkey a single-species equation. Any nonideal behavior, such as self- anti-rabbit IgG (Jackson ImmunoResearch Laboratories), washed association, then manifests itself as increasing apparent whole- again, and finally incubated with ECL detection reagent before cell molecular-weight-average weights (Mw;app) with increasing autoradiography. concentration. Sedimentation velocity experiments used the same sample preparation; data were collected by using interference op- Supplemental Data tics and were analyzed with SEDFIT (Schuck and Rossmanith, Supplemental Data include a graphical representation of the CRKL 2000) in the c(s) mode for the calculation of sedimentation coeffi- SH3C monomer, along with a primary amino acid sequence align- cient distributions. Single species display a Gaussian distribution ment of CRKL SH3C and its homolog c-CrkII SH3C. These are in sedimentation coefficient due to the effects of diffusion, about followed by a section on the possible implications the reported a central point that is the sedimentation coefficient of the species. dimerization in the molecular interactions CRKL partakes in, such The Gaussian distributions were fitted with the program PROFIT, as loss of phosphorylation potential and signal amplification. Addi- and the areas under individual peaks within the distribution were tional experimental data supporting the persistence of a native calculated with the equation dimer of the full-length CRKL in solution are supplied through cross- pffiffiffiffiffiffi linking experiments accompanied by a figure of SDS-PAGE analysis A = sa 2p; of the relevant results followed by a full description of the experi- where A is the area, s the half width, and a the height of each peak. mental procedure employed. Supplemental Data further include an Bead models were calculated for monomeric, dimeric, and tetra- extensive section on the prevalence and physiological importance meric structures of the CRKL domain by using AtoB (Byron, 1997), of SH3 dimers studied to date accompanied by a section on previ- with sedimentation coefficients being calculated for those bead ously published molecular simulations that aim to clarify the ener- models with SOLPRO (Garcia de la Torre et al., 1999). The theoretical getics behind the monomer-dimer interconversion the authors sug- sedimentation coefficients were used to calculate a range of likely gest in the main manuscript. Finally, the tetramer/dimer, formed via values for those models in solution, allowing for hydration between disulphide bonds through crystal contacts, is discussed in depth. This was shown to coexist with the reported monomers/dimers in 0.1 and 0.5 g H2O/g protein with the equation (Sutton et al., 2004) solution through analytical ultracentrifugation and could be an indi- d 1=3 cation of cell-signaling regulation induced through oxidative stress. s = s 1+ ; d 0 vr A graphical representation of the higher-order associations present in the crystalline arrangements studied is also supplied. The Supple- where sd is the sedimentation coefficient allowing for hydration of d g mental Data are available at http://www.structure.org/cgi/content/ H2O/ g protein, s0 is the sedimentation coefficient calculated by full/14/12/1741/DC1/. SOLPRO, v is the partial specific volume of the protein, and r is the density of the buffer. The range between a hydration of 0.1 g/g Acknowledgments and one of 0.5 g/g was then quoted as a likely range for the sedimen- tation coefficient of the modeled species. Crystallization was performed at the MRC funded Oxford Protein Production Facility (OPPF). We would like to thank Marc Lewitzky Isothermal Titration Calorimetry for help with ITC measurements and the production of Figure 1, Experiments were performed essentially as described previously Tom Walter for help with crystallization, and Karl Harlos for help (Harkiolaki et al., 2003; Lewitzky et al., 2004). The C-terminally with in-house data collection. The analytical ultracentrifugation ex- amidated CD34 juxtamembrane domain peptides analyzed periments were performed at the analytical ultra centrifugation Structure 1752

facility established by the Biotechnology and Biological Sciences Furge, K.A., Zhang, Y.W., and Vande Woude, G.F. (2000). Met recep- Research Council and the Wellcome Trust in the Glycobiology Insti- tor tyrosine kinase: enhanced signaling through adapter proteins. tute of the University of Oxford and managed by Russell Wallis. This Oncogene 19, 5582–5589. work has been funded by Cancer Research UK. E.Y.J. is a Cancer Garcia de la Torre, J., Harding, S.E., and Carrasco, B. (1999). Calcu- Research UK Principal Research Fellow, and R.J.C.G. is a Royal So- lation of NMR relaxation, covolume, and scattering-related proper- ciety University Research Fellow. ties of bead models using the SOLPRO computer program. Eur. Bio- phys. J. 28, 119–132. Received: June 23, 2006 Guris, D.L., Fantes, J., Tara, D., Druker, B.J., and Imamoto, A. (2001). Revised: September 26, 2006 Mice lacking the homologue of the human 22q11.2 gene CRKL phe- Accepted: September 27, 2006 nocopy neurocristopathies of DiGeorge syndrome. Nat. Genet. 27, Published: December 12, 2006 293–298. Guris, D.L., Duester, G., Papaioannou, V.E., and Imamoto, A. (2006). References Dose-dependent interaction of Tbx1 and Crkl and locally aberrant RA signaling in a model of del22q11 syndrome. Dev. Cell 10, 81–92. Akakura, S., Kar, B., Singh, S., Cho, L., Tibrewal, N., Sanokawa-Aka- Harkiolaki, M., Lewitzky, M., Gilbert, R.J., Jones, E.Y., Bourette, kura, R., Reichman, C., Ravichandran, K.S., and Birge, R.B. (2005). R.P., Mouchiroud, G., Sondermann, H., Moarefi, I., and Feller, S.M. C-terminal SH3 domain of CrkII regulates the assembly and function (2003). Structural basis for SH3 domain-mediated high-affinity bind- of the DOCK180/ELMO Rac-GEF. J. Cell. Physiol. 204, 344–351. ing between Mona/Gads and SLP-76. EMBO J. 22, 2571–2582. Altamirano, M.M., Woolfson, A., Donda, A., Shamshiev, A., Briseno- Heaney, C., Kolibaba, K., Bhat, A., Oda, T., Ohno, S., Fanning, S., and Roa, L., Foster, N.W., Veprintsev, D.B., De Libero, G., Fersht, A.R., Druker, B.J. (1997). Direct binding of CRKL to BCR-ABL is not re- and Milstein, C. (2001). Ligand-independent assembly of recombi- quired for BCR-ABL transformation. Blood 89, 297–306. nant human CD1 by using oxidative refolding chromatography. Jones, T.A., Zou, J.Y., Cowan, S.W., and Kjeldgaard, M. (1991). Im- Proc. Natl. Acad. Sci. USA 98, 3288–3293. proved methods for building protein models in electron density Arold, S.T., Hoellerer, M.K., and Noble, M.E. (2002). The structural maps and the location of errors in these models. Acta Crystallogr. basis of localization and signaling by the focal adhesion targeting A 47 (Pt 2), 110–119. domain. Structure 10, 319–327. Kabsch, W. (1976). A solution for the best rotation to relate two sets Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths- of vectors. Acta Crystallogr. A 32, 922–923. Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., Kishan, K.V., Scita, G., Wong, W.T., Di Fiore, P.P., and Newcomer, et al. (2004). The Pfam protein families database. Nucleic Acids M.E. (1997). The SH3 domain of Eps8 exists as a novel intertwined Res. 32, D138–D141. dimer. Nat. Struct. Biol. 4, 739–743. Brunger, A.T., Adams, P.D., Clore, G.M., DeLano, W.L., Gros, P., Kizaka-Kondoh, S., Matsuda, M., and Okayama, H. (1996). CrkII sig- Grosse-Kunstleve, R.W., Jiang, J.S., Kuszewski, J., Nilges, M., nals from epidermal growth factor receptor to Ras. Proc. Natl. Acad. Pannu, N.S., et al. (1998). Crystallography & NMR system: a new Sci. USA 93, 12177–12182. software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 54, 905–921. Knudsen, B.S., Zheng, J., Feller, S.M., Mayer, J.P., Burrell, S.K., Cowburn, D., and Hanafusa, H. (1995). Affinity and specificity re- Buday, L. (1999). Membrane-targeting of signalling molecules by quirements for the first Src homology 3 domain of the Crk proteins. SH2/SH3 domain-containing adaptor proteins. Biochim. Biophys. EMBO J. 14, 2191–2198. Acta 1422, 187–204. Kumar, S., Tamura, K., and Nei, M. (2004). MEGA3: integrated soft- Byron, O. (1997). Construction of hydrodynamic bead models from ware for molecular evolutionary genetics analysis and sequence high-resolution X-ray crystallographic or nuclear magnetic reso- alignment. Brief. Bioinform. 5, 150–163. nance data. Biophys. J. 72, 408–415. la Cour, T., Kiemer, L., Molgaard, A., Gupta, R., Skriver, K., and Bru- Chodniewicz, D., and Klemke, R.L. (2004). Regulation of integrin- nak, S. (2004). Analysis and prediction of leucine-rich nuclear export mediated cellular responses through assembly of a CAS/Crk scaf- signals. Protein Eng. Des. Sel. 17, 527–536. fold. Biochim. Biophys. Acta 1692, 63–76. Laskowski, P.A., MacArthur, M.W., Moss, D.S., and Thornton, J.M. Delbruck, H., Ziegelin, G., Lanka, E., and Heinemann, U. (2002). An (1993). PROCHECK: a program to check the stereochemical quality Src homology 3-like domain is responsible for dimerization of the re- of protein structures. J. App. Cryst. 26, 283–291. pressor protein KorB encoded by the promiscuous IncP plasmid Lewitzky, M., Harkiolaki, M., Domart, M.C., Jones, E.Y., and Feller, RP4. J. Biol. Chem. 277, 4191–4198. S.M. (2004). Mona/Gads SH3C binding to hematopoietic progenitor Ding, F., Prutzman, K.C., Campbell, S.L., and Dokholyan, N.V. (2006). kinase 1 (HPK1) combines an atypical SH3 binding motif, R/KXXK, Topological determinants of protein domain swapping. Structure 14, with a classical PXXP motif embedded in a polyproline type II 5–14. (PPII) helix. J. Biol. Chem. 279, 28724–28732. Dixon, R.D., Chen, Y., Ding, F., Khare, S.D., Prutzman, K.C., Schaller, Matsuda, M., Tanaka, S., Nagata, S., Kojima, A., Kurata, T., and Shi- M.D., Campbell, S.L., and Dokholyan, N.V. (2004). New insights into buya, M. (1992). Two species of human CRK cDNA encode proteins FAK signaling and localization based on detection of a FAT domain with distinct biological activities. Mol. Cell. Biol. 12, 3482–3489. folding intermediate. Structure 12, 2161–2171. Mayer, B.J., Hamaguchi, M., and Hanafusa, H. (1988). A novel viral Esnouf, R.M. (1997). An extensively modified version of MolScript oncogene with structural similarity to phospholipase C. Nature that includes greatly enhanced coloring capabilities. J. Mol. Graph. 332, 272–275. Model. 15, 132–134, 112–113. McGee, A.W., Dakoji, S.R., Olsen, O., Bredt, D.S., Lim, W.A., and Esnouf, R.M. (1999). Further additions to MolScript version 1.4, in- Prehoda, K.E. (2001). Structure of the SH3-guanylate kinase module cluding reading and contouring of electron-density maps. Acta from PSD-95 suggests a mechanism for regulated assembly of Crystallogr. D Biol. Crystallogr. 55, 938–940. MAGUK scaffolding proteins. Mol. Cell 8, 1291–1301. Feller, S.M. (2001). Crk family adaptors-signalling complex forma- Merritt, E.A., and Murphy, M.E. (1994). Raster3D version 2.0. A pro- tion and biological roles. Oncogene 20, 6348–6371. gram for photorealistic molecular graphics. Acta Crystallogr. D Biol. Felschow, D.M., McVeigh, M.L., Hoehn, G.T., Civin, C.I., and Fackler, Crystallogr. 50, 869–873. M.J. (2001). The adapter protein CrkL associates with CD34. Blood Moon, A.M., Guris, D.L., Seo, J.H., Li, L., Hammond, J., Talbot, A., 97, 3768–3775. and Imamoto, A. (2006). Crkl deficiency disrupts fgf8 signaling in Fish, E.N., Uddin, S., Korkmaz, M., Majchrzak, B., Druker, B.J., and a mouse model of 22q11 deletion syndromes. Dev. Cell 10, 71–80. Platanias, L.C. (1999). Activation of a CrkL-stat5 signaling complex Muralidharan, V., Dutta, K., Cho, J., Vila-Perello, M., Raleigh, D.P., by type I . J. Biol. Chem. 274, 571–573. Cowburn, D., and Muir, T.W. (2006). Solution structure and folding CRKL SH3C Dimer/Monomer Transition Exposes NES 1753

characteristics of the C-terminal SH3 domain of c-Crk-II. Biochemis- Smith, J.J., Evans, E.K., Murakami, M., Moyer, M.B., Moseley, M.A., try 45, 8874–8884. Woude, G.V., and Kornbluth, S. (2000). Wee1-regulated apoptosis Navaza, J. (1994). AMoRe: an automated package for molecular re- mediated by the crk adaptor protein in Xenopus egg extracts. placement. Acta Crystallogr. A 50, 157–163. J. Cell Biol. 151, 1391–1400. Nicholls, A., Sharp, K.A., and Honig, B. (1991). Protein folding and Smith, J.J., Richardson, D.A., Kopf, J., Yoshida, M., Hollingsworth, association: insights from the interfacial and thermodynamic prop- R.E., and Kornbluth, S. (2002). Apoptotic regulation by the Crk erties of hydrocarbons. Proteins 11, 281–296. adapter protein mediated by interactions with Wee1 and Crm1/ex- portin. Mol. Cell. Biol. 22, 1412–1423. Nishida, M., Nagata, K., Hachimori, Y., Horiuchi, M., Ogura, K., Man- Songyang, Z., Shoelson, S.E., Chaudhuri, M., Gish, G., Pawson, T., diyan, V., Schlessinger, J., and Inagaki, F. (2001). Novel recognition Haser, W.G., King, F., Roberts, T., Ratnofsky, S., Lechleider, R.J., mode between Vav and Grb2 SH3 domains. EMBO J. 20, 2995–3007. et al. (1993). SH2 domains recognize specific phosphopeptide se- Oda, T., Heaney, C., Hagopian, J.R., Okuda, K., Griffin, J.D., and quences. Cell 72, 767–778. Druker, B.J. (1994). Crkl is the major tyrosine-phosphorylated pro- Sutton, G., Fry, E., Carter, L., Sainsbury, S., Walter, T., Nettleship, J., tein in neutrophils from patients with chronic myelogenous leuke- Berrow, N., Owens, R., Gilbert, R., Davidson, A., et al. (2004). The mia. J. Biol. Chem. 269, 22925–22928. nsp9 replicase protein of SARS-coronavirus, structure and func- Oda, A., Sawada, K., Druker, B.J., Ozaki, K., Takano, H., Koizumi, K., tional insights. Structure 12, 341–353. Fukada, Y., Handa, M., Koike, T., and Ikeda, Y. (1998). Erythropoietin ten Hoeve, J., Morris, C., Heisterkamp, N., and Groffen, J. (1993). induces tyrosine phosphorylation of Jak2, STAT5A, and STAT5B in Isolation and chromosomal localization of CRKL, a human crk-like primary cultured human erythroid precursors. Blood 92, 443–451. gene. Oncogene 8, 2469–2474. Ogawa, S., Toyoshima, H., Kozutsumi, H., Hagiwara, K., Sakai, R., ten Hoeve, J., Arlinghaus, R.B., Guo, J.Q., Heisterkamp, N., and Tanaka, T., Hirano, N., Mano, H., Yazaki, Y., and Hirai, H. (1994). Groffen, J. (1994). Tyrosine phosphorylation of CRKL in Philadel- The C-terminal SH3 domain of the mouse c-Crk protein negatively phia+ leukemia. Blood 84, 1731–1736. regulates tyrosine-phosphorylation of Crk associated p130 in rat Thompson, J.D., Higgins, D.G., and Gibson, T.J. (1994). CLUSTAL 3Y1 cells. Oncogene 9, 1669–1678. W: improving the sensitivity of progressive multiple sequence align- Ota, J., Kimura, F., Sato, K., Wakimoto, N., Nakamura, Y., Nagata, N., ment through sequence weighting, position-specific gap penalties Suzu, S., Yamada, M., Shimamura, S., and Motoyoshi, K. (1998). As- and weight matrix choice. Nucleic Acids Res. 22, 4673–4680. sociation of CrkL with STAT5 in hematopoietic cells stimulated by Tsuchie, H., Chang, C.H., Yoshida, M., and Vogt, P.K. (1989). A newly granulocyte-macrophage colony-stimulating factor or erythropoie- isolated avian sarcoma virus, ASV-1, carries the crk oncogene. tin. Biochem. Biophys. Res. Commun. 252, 779–786. Oncogene 4, 1281–1284. Otwinowski, Z. (1993). Proceedings of the CCP4 Study Weekend: Vaynberg, J., Fukuda, T., Chen, K., Vinogradova, O., Velyvis, A., Tu, Data Collection and Processing (Daresbury, UK: SERC Daresbury Y., Ng, L., Wu, C., and Qin, J. (2005). Structure of an ultraweak pro- Laboratory). tein-protein complex and its crucial role in regulation of cell mor- Otwinowski, Z., and Minor, W. (1997). Processing of X-ray diffraction phology and motility. Mol. Cell 17, 513–523. data collected in oscillation mode. Macromolecular Crystallogr. 276, Wu, X., Knudsen, B., Feller, S.M., Zheng, J., Sali, A., Cowburn, D., 307–326. Hanafusa, H., and Kuriyan, J. (1995). Structural basis for the specific Ozaki, K., Oda, A., Wakao, H., Rhodes, J., Druker, B.J., Ishida, A., interaction of lysine-containing proline-rich peptides with the Wakui, M., Okamoto, S., Morita, K., Handa, M., et al. (1998). Throm- N-terminal SH3 domain of c-Crk. Structure 3, 215–226. bopoietin induces association of Crkl with STAT5 but not STAT3 in Yuzawa, S., Ogura, K., Horiuchi, M., Suzuki, N.N., Fujioka, Y., human platelets. Blood 92, 4652–4662. Kataoka, M., Sumimoto, H., and Inagaki, F. (2004). Solution structure Petosa, C., Schoehn, G., Askjaer, P., Bauer, U., Moulin, M., Steuer- of the tandem Src homology 3 domains of p47phox in an auto- wald, U., Soler-Lopez, M., Baudin, F., Mattaj, I.W., and Muller, inhibited form. J. Biol. Chem. 279, 29752–29760. C.W. (2004). Architecture of CRM1/Exportin1 suggests how cooper- Zvara, A., Fajardo, J.E., Escalante, M., Cotton, G., Muir, T., Kirsch, ativity is achieved during formation of a nuclear export complex. K.H., and Birge, R.B. (2001). Activation of the focal adhesion kinase Mol. Cell 16, 761–775. signaling pathway by structural alterations in the carboxyl-terminal Reichman, C.T., Mayer, B.J., Keshav, S., and Hanafusa, H. (1992). region of c-Crk II. Oncogene 20, 951–961. The product of the cellular crk gene consists primarily of SH2 and SH3 regions. Cell Growth Differ. 3, 451–460. Reichman, C., Singh, K., Liu, Y., Singh, S., Li, H., Fajardo, J.E., Fiser, Accession Numbers A., and Birge, R.B. (2005). Transactivation of Abl by the Crk II adapter The coordinates and structure factors of the monomeric and dimeric protein requires a PNAY sequence in the Crk C-terminal SH3 do- CRKL SH3C structures have been deposited in the Protein Data main. Oncogene 24, 8187–8199. Bank with ID codes of 2BZX and 2BZY, respectively. Rhodes, J., York, R.D., Tara, D., Tajinda, K., and Druker, B.J. (2000). CrkL functions as a nuclear adaptor and transcriptional activator in Bcr-Abl-expressing cells. Exp. Hematol. 28, 305–310. Riddle, D.S., Grantcharova, V.P., Santiago, J.V., Alm, E., Ruczinski, I., and Baker, D. (1999). Experiment and theory highlight role of na- tive state topology in SH3 folding. Nat. Struct. Biol. 6, 1016–1024. Rousseau, F., Schymkowitz, J.W., and Itzhaki, L.S. (2003). The un- folding story of three-dimensional domain swapping. Structure 11, 243–251. Sadqi, M., Casares, S., Abril, M.A., Lopez-Mayorga, O., Conejero- Lara, F., and Freire, E. (1999). The native state conformational en- semble of the SH3 domain from alpha-spectrin. Biochemistry 38, 8899–8906. Saitou, N., and Nei, M. (1987). The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425. Schuck, P., and Rossmanith, P. (2000). Determination of the sedi- mentation coefficient distribution by least-squares boundary model- ing. Biopolymers 54, 328–341.