View metadata, citation and similar papers at core.ac.uk brought to you by CORE
provided by Elsevier - Publisher Connector Structure Article
Stereochemical Preferences Modulate Affinity and Selectivity among Five PDZ Domains that Bind CFTR: Comparative Structural and Sequence Analyses
Jeanine F. Amacher,1 Patrick R. Cushing,1 Lionel Brooks 3rd,2 Prisca Boisguerin,3,4 and Dean R. Madden1,* 1Department of Biochemistry, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA 2Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA 3Institute of Medical Immunology, Charite´ , 10115 Berlin, Germany 4Centre de Recherches de Biochimie Macromoleculaire, CRBM-CNRS, UMR-5237, UM1-UM2, University of Montpellier, Department of Molecular Biophysics and Therapeutics, 34293 Montpellier, France *Correspondence: [email protected] http://dx.doi.org/10.1016/j.str.2013.09.019
SUMMARY binding cleft is the core of the interaction, conferring critical target specificity. As a result, the underlying sequence:affinity PDZ domain interactions are involved in signaling relationships are essential to understand connectivity in PDZ- and trafficking pathways that coordinate crucial mediated protein networks. cellular processes. Alignment-based PDZ binding To identify the binding preferences of individual domains, early motifs identify the few most favorable residues at studies compared the sequences of known binding partners and highlighted the importance of the amino acids at the P0 (extreme certain positions along the peptide backbone. How- 2 ever, sequences that bind the CAL (CFTR-associated C terminus) and P positions (e.g., Songyang et al., 1997). High- throughput screens of phage-display and peptide-array libraries ligand) PDZ domain reveal only a degenerate motif have since revealed more complex ‘‘motifs’’ involving varying that overpredicts the true number of high-affinity in- combinations of up to seven C-terminal residues, consistent teractors. Here, we combine extended peptide-array with stereochemical interactions observed in individual com- motif analysis with biochemical techniques to show plexes (Doyle et al., 1996; Laura et al., 2002; Schultz et al., that non-motif ‘‘modulator’’ residues influence CAL 1998; Skelton et al., 2003; Stiffler et al., 2007; Tonikian et al., binding. The crystallographic structures of 13 CAL: 2008). Furthermore, although few motifs have been investigated peptide complexes reveal defined, but accommoda- beyond the P 6 position, biochemical and structural studies ting stereochemical environments at non-motif posi- have identified affinity contributions and stereochemical con- 10 tions, which are reflected in modulator preferences tacts extending as far as the P position (reviewed in Luck uncovered by multisequence substitutional arrays. et al., 2012). These preferences facilitate the identification of Even with extended motifs, PDZ binding motifs on average constrain fewer than four of the potential interacting residues high-affinity CAL binding sequences and differen- and often accommodate similar residues at a given position (Da- tially affect CAL and NHERF PDZ binding. As a result, vey et al., 2012). As a result, PDZ domains are frequently promis- they also help determine the specificity of a PDZ cuous, binding multiple target proteins and sharing targets with domain network that regulates the trafficking of other PDZ domains. This is illustrated by a set of PDZ proteins CFTR at the apical membrane. that regulate the intracellular trafficking and localization of the cystic fibrosis transmembrane conductance regulator (CFTR): the CFTR-associated ligand (CAL) and the NHE3 regulatory fac- INTRODUCTION tor proteins NHERF1 and NHERF2 (Cheng et al., 2002; Guerra et al., 2005; Wolde et al., 2007). Consistent with shared target If each residue is uniquely specified, a decapeptide can encode specificity, sequence alignments revealed overlapping motifs more than 1013 distinct sequences, providing a versatile and at the P0 and P 2 positions, and sequence optimization in the specific mechanism to encode protein:protein interactions. four C-terminal residues led to only 10-fold selectivity for the Indeed, domains recognizing short linear motif (SLiM) peptides CAL PDZ (CALP) domain (Vouilleme et al., 2010). Using a pep- are found throughout the human genome, including the abun- tide-array approach to extend the sequence iteratively toward dant family of PDZ domains first recognized in the proteins the N terminus, we engineered a CALP inhibitor (iCAL36; PSD-95, Dlg, and ZO-1 (Davey et al., 2012; Harris and Lim, sequence ANSRWPTSII) with 170-fold selectivity (Cushing 2001; Lee and Zheng, 2010; Nourry et al., 2003). PDZ target in- et al., 2010; Vouilleme et al., 2010). teractions are governed by a variety of constraints, including Here we characterize the interaction of the CAL PDZ domain local concentration, multidentate binding interfaces, and interac- with these N-terminal (‘‘upstream,’’ i.e., P 4 to P 9) binding deter- tions with additional proteins (Luck et al., 2012). Nevertheless, minants and with the P 1 and P 3 side chains. Although not the recognition of a generally C-terminal peptide by the PDZ captured by sequence alignment techniques, they act as binding
82 Structure 22, 82–93, January 7, 2014 ª2014 Elsevier Ltd All rights reserved Structure Affinity Modulators in CAL PDZ Binding
AB Figure 1. Sequence Determinants of CAL PDZ Binding and CAL/NHERF Specificity (A) WebLogo analysis (Crooks et al., 2004) of the top 100 sequences that bound CALP in a 6223HumLib peptide array experiment reveals clear preferences at the P0 and P 2 positions, similar to the short, degenerate binding motifs seen with the NHERF PDZ domains (Figure S1A). The C-terminal residue (P0) position is labeled 0; adjacent residues are 1, 2, etc. (B) For each PDZ domain, free energies of binding (kcal/mol) were calculated from the binding affinities determined by FP for ten different iCAL36/CFTR chimeric peptide sequences (Table 1). The number of intersection points in pairwise free-energy plots (Figure S2) were tabulated to reveal rank-order exchanges as a function of sequence.
‘‘modulators’’ for each domain, exerting a modest influence indi- position (e.g., S > E > T for N1P2 and E > S > D for N2P2) are slightly vidually, but a powerful impact collectively. Crystallographic anal- stronger than those observed for CALP. At least in the case of the ysis of 13 new CALP:peptide complexes spanning a wide range of N2P2 domain, the presence of acidic residues is broadly consis- affinities reveals an accommodating pharmacophore binding tent with a P 3 preference for Asp reported in earlier phage- model. In-depth stereochemical and substitutional array analysis display studies (Tonikian et al., 2008). However, as for CALP, no provides clues to the identification of preferences that remain hid- strong signatures were observed upstream of P 3. den from alignment motifs. These modulatory effects facilitate the identification of high-affinity binding sequences and contribute to Chimeric Peptides Reveal Positive and Negative Affinity inhibitor specificity in the context of CFTR trafficking. Modulators Although our experience suggested that residues outside the RESULTS P0/P 2 motif positions significantly enhanced binding selectivity of iCAL36 for CAL versus NHERF domains, we had not focused Sequence Alignments Identify only Core Binding Motifs on CALP affinity per se, which changed relatively little in the later for CAL and NHERF stages of iCAL36 sequence engineering (Vouilleme et al., 2010). Based on the demonstrated contributions of upstream residues Given the lack of upstream binding motifs, we decided to explore to the selectivity of our engineered iCAL36 peptide, we attempted the free-energy contributions of non-motif residues using nested to identify extended binding motifs for the CAL and NHERF PDZ sets of chimeric peptides based on reference sequences with domains. We performed WebLogo analysis of all ten C-terminal both high (iCAL36; Ki =23mM) and low (CFTR; Ki = 420 mM) positions in peptides identified by screening a 6223HumLib affinity for the CALP domain (Amacher et al., 2013). Using fluo- array (Cushing et al., 2008) with CALP or with the NHERF1 and rescence polarization (FP) competition experiments, we deter-
NHERF2 PDZ1 and PDZ2 domains (N1P1, N1P2, N2P2, and mined Ki values for each of the chimeric peptides binding to N2P2, respectively). However, the results (Figure 1A; Figure S1A CALP (Table 1). CALP has modest preferences for an 9ANSR 6 9 6 available online) did not highlight any significant upstream sequence over TEEE (iCAL36VQDTRL versus CFTR), Trp over 0 5 preferences outside the previously examined core residues P Val at P (iCAL36QDTRL versus iCAL36VQDTRL), and Gln over Pro 3 4 to P (Roberts et al., 2012; Vouilleme et al., 2010). at P (iCAL36 versus iCAL36Q 4). A positive CALP preference for Weak secondary CALP preferences are observed at a few aP 1 Arg is also suggested by the 5-fold higher affinity of the 3 positions (e.g., S > E > V at P ; Figure 1A). To test the robust- iCAL36TRL sequenced compared to iCAL36L and the approxi- ness of such signatures, we calculated motifs using a varying mate equivalence of the Ser/Thr motif at P 2 (Vouilleme et al., number of input sequences, and observed that both the rank 2010). However, the most significant effect is seen at P 3. 5 3 5 3 order and stereochemical nature of the residues changed. For Replacing WPT with VQD in iCAL36VQD 3 dramatically 3 5 example, at P , the top ten CALP-binding sequences yielded reduces CALP affinity (Ki > 1 mM). Because neither the P nor the pattern E > S > T, whereas the top 50 sequences yielded P 4 substitutions account for such a large effect, we synthesized 3 S>I>Q(Figure S1B). Furthermore, while the top ten CALP-bind- a CFTR peptide with a single substitution at P . CFTRT 3 1 ing sequences show modest signals, e.g., for Arg at P or Ser at (TEEEVQTTRL; Ki =18mM) binds 23-fold more tightly than the P 7, these grow progressively weaker or shift with the addition of native CFTR sequence (Table 1). more sequences (Figures 1A and S1B). To evaluate the possibil- To test the hypothesis that non-motif residues also affect ity of correlated binding requirements across multiple ligand po- NHERF1 and NHERF2 affinities, we measured the binding con- sitions, we reprobed our top 100 sequences using the MUltiple stants of the chimeric peptides for each of the four PDZ domains Specificity Identifier (MUSI) program, which uses both single (N1P1, N1P2, N2P2, and N2P2, respectively) in FP displacement and multiple position weight matrices to determine specificity assays (Table 1). The NHERF PDZ domains bind the CFTR and patterns (Kim et al., 2012a). Overall, the results again confirm iCAL36 peptides with high affinity for CFTR (Ki <2mM) and low the highly degenerate CALP binding motif originally seen at the affinity for iCAL36 (Ki > 1 mM; Amacher et al., 2013). Among 0 2 1 P and P positions (Figure S1C). these domains, substitutions at P (iCAL36TRL versus iCAL36L), 3 4 6 9 Similar patterns are observed for the NHERF domains (Fig- P (CFTRT 3), P (iCAL36Q 4), and P P (CFTR versus 3 ure S1A). In some cases, secondary preferences seen at the P iCAL36VQDTRL) each affect individual Ki values.
Structure 22, 82–93, January 7, 2014 ª2014 Elsevier Ltd All rights reserved 83 Structure Affinity Modulators in CAL PDZ Binding
Table 1. FP Binding Affinities of CALP and NHERF PDZ Domains for iCAL36/CFTR Chimeric Peptides
Ki (mM) Peptidea Sequenceb CALP CALP-E317A N1P1 N1P2 N2P1 N2P2 iCAL36TRL ANSRWPTTRL 4.5 ± 0.8 2.2 ± 1.2 8.0 ± 1.4 110 ± 10 8.7 ± 3.1 16.5 ± 3.9 iCAL36Q 4 ANSRWQTSII 14.8 ± 3.5 ND 280 ± 100 >1,000 300 ± 100 450 ± 150
CFTRT 3 TEEEVQTTRL 18 ± 5.5 15.1 ± 2.0 0.76 ± 0.51 4.6 ± 1.6 1.3 ± 0.4 0.45 ± 0.25 iCAL36c ANSRWPTSII 22.6 ± 8.0 15.1 ± 2.1 >1,000 >1,000 >1,000 >1,000 c iCAL36L ANSRWPTSIL 23.6 ± 2.2 ND 230 ± 100 >1,000 76.8 ± 1.6 530 ± 120 iCAL36QDTRL ANSRWQDTRL 56.2 ± 5.6 16.2 ± 3.5 1.5 ± 0.2 6.7 ± 2.4 3.1 ± 1.4 1.1 ± 0.6 iCAL36VQDTRL ANSRVQDTRL 230 ± 30 140 ± 40 2.5 ± 0.3 6.5 ± 2.3 3.8 ± 1.4 0.94 ± 0.42 CFTRc TEEEVQDTRL 420 ± 80 64.7 ± 1.8 0.47 ± 0.2 1.8 ± 0.5 0.83 ± 0.33 0.07 ± 0.05 c CFTRI TEEEVQDTRI 490 ± 120 ND 10.7 ± 2.8 39.8 ± 1.0 19.1 ± 2.9 2.5 ± 0.43 iCAL36VQD 3 ANSRVQDSII >1,000 770 ± 50 >1,000 >1,000 >1,000 420 ± 120 ND, value was not measured. aSubstitutions noted in subscript; the final substituted position is C terminal unless numbered. bBold indicates residue(s) from the CFTR sequence; non-bold indicates residues from iCAL36. cValues previously reported in Amacher et al., 2013.
Finally, to explore the potential of these non-motif differences IPDZ values, several side chain atoms are occluded at least to contribute to PDZ inhibitor selectivity, we developed an 50% upon CALP docking (atoms colored white in Figure 2D). approach to identify substitutions that switched rank order of Of course, contact mapping cannot by itself distinguish affinity. We first plotted the binding free energies for the series between favorable and unfavorable binding relationships. It of peptides along a vertical axis for each domain. When placed also does not capture long-range (e.g., electrostatic) interactions side-by-side for a given pair of PDZ domains, with straight line or shifts in the free energy of the unbound state. Nevertheless, connections between the values for each peptide, rank-order the observed steric interactions are consistent with evidence switches appear as points of intersection (Figure S2). Compared that non-motif residues can affect CALP affinity. to each NHERF PDZ domain, the CALP affinities exhibit 19–25 pairwise affinity crossovers (Figure 1B) that presumably account Defined CALP Binding Sites Interact with Each Residue for the engineered selectivity of iCAL36. Thus, although the CAL Position and NHERF binding motifs fail to highlight preferences at P-1, Given the characteristically shallow peptide-binding pocket and P-3 through P-9, these residues are capable of ‘‘modulating’’ of the CALP domain, we wished to determine whether our both the affinity and selectivity of peptides for these PDZ substituted peptides adopt significantly variable conformations, domains. Furthermore, despite high sequence identity (58%– which could complicate stereochemical analysis and obscure 72%) between the NHERF PDZ domains, pairwise comparisons modulator preferences in sequence alignments. To address reveal two to six crossovers (Figure 1B), suggesting potential this point, we crystallized and solved the structures of CALP non-motif contributions to selectivity even in this tightly related bound to four chimeric iCAL36/CFTR peptides, selected to cover cluster. a range of affinities: iCAL36TRL (Ki = 4.5 mM), iCAL36Q 4 (Ki = 14.8 mM), iCAL36QDTRL (Ki = 56.2 mM), and iCAL36VQD 3 (Ki > All Six C-Terminal Positions Form Side-Chain- 1 mM). To further explore sequence and conformational space, Dependent Interactions we also crystallized CALP bound to two additional low-affinity Despite the absence of motif signatures at four of the six resi- binding partners with unrelated sequences. HPV E6 viral onco- dues from P0 to P 5, each residue directly contacts the surface protein C-terminal peptides have been studied in complex of the CALP binding site, as illustrated in Figure 2A for one of with other PDZ domains with nuclear magnetic resonance and the two protomers in our previously published CALP:iCAL36 X-ray crystallography (Charbonnier et al., 2011; Liu et al., 2007; structure (Amacher et al., 2013). Corresponding contact sur- Mischo et al., 2013; Zhang et al., 2007). Here, we generated faces on the peptide are highlighted by comparison of the sol- CALP complexes with the decameric E6 C-terminal peptides vent-accessible surface of the bound peptide in the absence from HPV16 (SSRTRRETQL) and HPV18 (RLQRRRETQV; Jeong
(Figure 2C) and presence of the CALP domain (Figure 2D). et al., 2007; Pim et al., 2012), with Ki values of 340 ± 70 mM and The non-motif contacts also represent potentially significant 490 ± 20 mM, respectively. stereochemical constraints. The fraction of solvent-accessible Structure determination and refinement protocols were de- surface area (SASA) buried serves as an interaction index (IPDZ) signed to avoid phase bias (Table S1; Figures S3 and S4). Final that gauges the extent of side-chain engagement at each posi- data and refinement statistics are shown in Table 2. In general, tion along the backbone (Figure 2B). At the non-motif position the iCAL-based structures are very similar. When the PDZ back- 5 P ,IPDZ values (0.87 and 0.63 for the A and B protomers, bone atoms of each structure are aligned to the CALP:iCAL36 B- respectively) are comparable to those for the motif residue P-2 protomer, they exhibit an average root-mean-square deviation ˚ ˚ (0.73 and 0.64). The other side chains exhibit IPDZ values ranging (rmsd) value of 0.34 ± 0.09 A, with a maximum of only 0.52 A from 0.26 to 0.60, and even for residues with lower side chain (B-protomer of CALP:iCAL36TRL). Visually, the superposition of
84 Structure 22, 82–93, January 7, 2014 ª2014 Elsevier Ltd All rights reserved Structure Affinity Modulators in CAL PDZ Binding
A -5 -6 B can extend previous models (e.g., Kang et al., 2003) to define P P -5 0 5 -8 a pharmacophore model in which positions P to P each -4 P 0 5 P P -7 -4 interact with cognate sites S to S (Figure 3C). According to -9 this model, each CALP site exhibits a characteristic set of stereo- P -3 -2 chemical constraints that favor the binding of a corresponding P P -3 -2 subset of complementary amino acid(s). If the constraints are -1 iCAL36 B-protomer stringent, they may be present in enough sequences to appear P 0 P -1 iCAL36 A-protomer Position from C-terminus as part of a motif. If they are more accommodating, they may 0 not, but may still contribute to affinity as modulators. 0 0.2 0.4 0.6 0.8 IPDZ Positional SubAna Analysis Confirms Multiple -6 Modulatory Residues C -5 R D W N -8 As a further test of the idea that each peptide side chain contacts -4 -7 P S a defined stereochemical environment relatively independent -2 S of sequence context, we performed substitutional analysis A-9 -3 (SubAna) using peptide arrays derived from a variety of starting T sequences. Each SubAna array contains a single peptide -1 135° 0 I 135° sequence with all 20 natural L-amino acids individually I substituted at each position along the chain (Boisguerin et al., 2007). We chose to use SubAnas based on ten starting se- Figure 2. Six Peptide Positions Interact with the CALP Binding Cleft quences that encompassed a number of different CALP-binding (A) The van der Waals surface of the CALP protein is shown together with the attributes (sequences are described in detail in Figure S5). bound iCAL36 peptide (stick figure), with individual residues labeled and colored by position. The CALP surface (cyan) is colored at contact sites Except for the artificial iCAL36 sequence, these peptides are according to the closest peptide residue: P0 Ile (red), P 1 Ile (purple), P 2 Ser derived from endogenous proteins, include known interacting (gray), P 3 Thr (orange), P -4 Pro (yellow), P 5 Trp (blue), P 6 Arg (green), P 7 partners (Cheng et al., 2002; Wente et al., 2005), cover a wide Ser (black), P 8 Asn (pink), and P 9 Ala (forest green). range of affinities for CALP (Cushing et al., 2008), and provide 2 (B) The peptide side-chain interaction index (IPDZ) is shown as a function of a variety of binding motif residue combinations (S/T at the P residue position for the peptides interacting with the A- (gray), and B- (black) position, I/L/V at the P0 position) for comparison. Furthermore, CALP protomers of the asymmetric unit. while the amount of protein bound is measured at a single con- (C and D) The van der Waals surface of the bound conformation of the iCAL36 peptide is shown in the same perspective as in (A, left) and following rotation by centration under nonequilibrium conditions, we can qualitatively 135 around the vertical axis (right). The surface is colored by residue as in (A). compare trends across arrays (Boisguerin et al., 2011). To
To illustrate the surface contacted by CALP, individual atoms with IPDZ values compare positional preferences across the full set of peptides, > 0.5 are colored white in (D), involving residues P0 P 5. we separated each SubAna array into six strips, corresponding to the residues from P0 to P 5, and then aligned all of the strips these CALP domains (Figure 3A) confirms that the framework of for P0, all of the strips for P 1, etc. The resulting collage (Figure 4) the binding cleft is largely independent of ligand sequence. shows the variability of side chain preference(s) at a given posi- This domain superposition also reveals strong conformational tion. Consistent selective preferences at the P0 and P 2 posi- similarity of the peptides. We first compared the complexes tions reflect their status as motif residues, largely independent formed by derivatives of the iCAL sequence, including the of peptide context. chimeric peptides described above, as well as earlier structures At the other positions, a more heterogeneous picture of iCAL36 and iCAL36L (Amacher et al., 2013). Despite widely emerges, consistent with the lack of motif constraints observed varying binding affinities, all peptide co-complexes reflect previously (Figure 1B). A residue may be favored for certain canonical Class I PDZ domain-target interactions: the P0 residue sequences, and disfavored for others. For example, a P 4 Phe with the carboxylate-binding loop, the P 2 residue with His349, residue appears to be strongly favored for CFTR (row 7) and and conserved b sheet interactions (Figure 3B). Pairwise com- other sequences, while disfavored for VIPR2 (row 9). However, parisons of the peptides reveal Ca variations less than 1.1 A˚ for despite this variability, certain trends are observed that apply the P0 to P 3 positions, and up to 4.4 A˚ for the P 4 and P 5 to most or all sequences. For example, our positional arrays positions. These latter variations may partially reflect conforma- confirm positive preferences for a P 1 Arg and for a P 5 Trp tional changes due to differential crystal packing (Figure S3), but residue. They also reveal negative preferences, including P 3 the side chains at each position still contact relatively similar sur- Asp (Figure 4), consistent with the affinity difference observed faces on CALP. Thus, even the substitution of half of the residues between CFTR and CFTRT 3 (Table 1). CALP also appears to in the peptide has only a limited effect on the stereochemical disfavor a Gly or Pro at P 1 and Pro at P 3. environment experienced by the side chain at a given position along the backbone. Single Substitutions at P–5 Modulate Binding Affinity Comparison of iCAL36 with the HPV E6 peptides also revealed To investigate the stereochemistry of preferential accommoda- similar peptide backbone conformations, despite very low tion of a modulator residue, we focused on the impact of single sequence similarity (Figure 3C). Thus, in all of our structures, residue substitutions in iCAL36 at the P 5 position. Competition the side chains at a given peptide position are oriented similarly, FP experiments confirm a range of affinities for P 5-substituted contacting a common CALP local environment. As a result, we peptides (Table 3). The Leu- and Tyr-substituted peptides bind
Structure 22, 82–93, January 7, 2014 ª2014 Elsevier Ltd All rights reserved 85 Structure Affinity Modulators in CAL PDZ Binding
Table 2. Data Collection and Refinement Statistics for CALP Co-Crystals with iCAL36/CFTR Chimeric Peptides
iCAL36Q 4 iCAL36TRL iCAL36VQD 3 iCAL36QDTRL iCAL36QDTRL HPV16 E6 HPV18 E6 CALP CALP CALP CALP CALP-E317A CALP CALP Data collection
Space group (number) P 21 21 21 (19) P 1 (1) P 21 (4) P 63 2 2 (182) P 63 2 2 (182) P 21 (4) P 21 (4) Unit cell dimensions a,b,c (A˚ ) 36.0,47.7,98 30.8,50.2,55.2 36.3,48.8,54.9 61.6,61.6,97.6 61.8,61.8,97.4 33.1,48.4,52 33.2,48.0,52.9 a,b,g ( ) 90,90,90 68.8,75.8,87.9 90,92.8,90 90,90,120 90,90,120 90,101.6,90 90,102.0,90 Resolution (A˚ )a 19.9–1.48 19.3–1.75 19.6–1.90 19.7–1.50 19.1–1.80 19.4–1.80 19.3–1.34 (1.58–1.48) (1.79–1.7) (2.03–1.9) (1.58–1.5) (1.85–1.8) (1.90–1.8) (1.42–1.34) b Rsym (%) 5.9 (45.0) 4.5 (37.2) 6.5 (36.0) 4.9 (61.5) 11.0 (61.7) 5.9 (52.3) 3.8 (26.5) c I/sI 34.60 (6.93) 15.48 (2.60) 15.58 (3.59) 49.46 (6.32) 26.01 (6.53) 20.37 (3.16) 36.69 (7.53) Completeness (%) 99.7 (99.2) 94.8 (91.1) 99.6 (99.5) 99.9 (100.0) 99.9 (100.0) 99.4 (98.1) 94.2 (77.8) Refinement Total no. of reflections 28,816 30,585 15,162 18,179 10,780 14,953 34,482 Reflections in test set 1,460 1,493 763 920 560 758 1,729 d e Rwork /Rfree (%) 18.1/20.9 17.2 /22.1 17.9/22.7 19.7/22.1 18.1/22.1 18.3/22.3 17.2/18.1 Number of atoms Protein 1,467 2,927 1,426 774f 772f 1,431 1,507 Water 180 267 141 69 69 147 258 Ramachandran plotg (%) 96.8/3.2/0/0 95.6/4.4/0/0 98.0/2.0/0/0 97.3/2.7/0/0 100/0/0/0 96.6/3.4/0/0 97.4/2.6/0/0 ˚ 2 Bav (A ) Protein 13.27 18.96 22.38 19.36 16.32 23.40 9.86 Solvent 24.62 28.73 31.27 30.03 27.43 31.13 23.10 Bond length rmsd (A˚ ) 0.006 0.006 0.007 0.007 0.008 0.007 0.007 Bond angle rmsd ( ) 1.078 1.006 1.097 1.169 1.203 1.064 1.167 a ValuesP in parenthesesP areP forP data in the highest-resolution shell. b Rsym = h ijIðhÞ IiðhÞj= h iIiðhÞ , where Ii(h) and I(h) values are the i-th and mean measurements of the intensity of reflection h. c = j ð + Þ ð Þj= SigAnoPF -F - s. P d Rwork = kFobsjh jFcalckh= jFobsjh , h e {working set}. e Rfree is calculated as Rwork for the reflections h e {test set}. fIncluding sulfocysteine at position 319 (see Figure S4). gCore/allowed/generously allowed/disallowed.