B doi:10.1016/S0022-2836(02)00801-X available online at http://www.idealibrary.com on w J. Mol. Biol. (2002) 322, 645–652

The Backbone Conformational Entropy of Folding: Experimental Measures from Atomic Force Microscopy

James B. Thompson1, Helen G. Hansma1, Paul K. Hansma1* and Kevin W. Plaxco2,3*

1Department of Physics The energy dissipated during the atomic force microscopy-based University of California, Santa mechanical unfolding and extension of is typically an order of Barbara, CA 93106-9510, USA magnitude greater than their folding free energy. The vast majority of the “excess” energy dissipated is thought to arise due to backbone confor- 2Department of Chemistry and mational entropy losses as the solvated, random-coil unfolded state is Biochemistry, University of stretched into an extended, low-entropy conformation. We have investi- California, Santa Barbara, CA gated this hypothesis in light of recent measurements of the energy dis- 93106-9510, USA sipated during the mechanical unfolding of “polyproteins” comprised of 3Interdepartmental Program in multiple, homogeneous domains. Given the assumption that backbone Biomolecular Science and conformational entropy losses account for the vast majority of the energy Engineering, University of dissipated (an assumption supported by numerous lines of experimental California, Santa Barbara, CA evidence), we estimate that ,19(^2) J/(mol K residue) of entropy is lost 93106-9510, USA during the extension of three mechanically stable b-sheet polyproteins. If, as suggested by measured peak-to-peak extension distances, pulling proceeds to near completion, this estimate corresponds to the absolute backbone conformational entropy of the unfolded state. As such, it is exceedingly close to previous theoretical and semi-empirical estimates that place this value at ,20 J/(mol K residue). The estimated backbone conformational entropy lost during the extension of two helical poly- proteins, which, in contrast to the mechanically stable b-sheet polyproteins, rupture at very low applied forces, is three- to sixfold less. Either previous estimates of the backbone conformational entropy are significantly in error, or the reduced mechanical strength of the helical proteins leads to the rupture of a subsequent domain before full extension (and thus complete entropy loss) is achieved. q 2002 Elsevier Science Ltd. All rights reserved *Corresponding authors Keywords: configurational entropy; worm-like chain

Introduction tain a wealth of information regarding the mechanical properties, folding kinetics and Atomic force microscopy (AFM) and optical thermodynamics of proteins, some of which has tweezers have been used to characterize the not been explored in detail. Here, for example, we mechanical properties and force-induced unfold- demonstrate that AFM-unfolding may provide an ing of a wide range of proteins.1–15 When applied experimental means of measuring the backbone to a protein comprised of multiple independent conformational entropy of an unfolded protein. folding units (domains), these pulling experiments The area under each peak in AFM-unfolding typically produce a saw-toothed force versus exten- force–extension curves represents the energy sion curve as the domains sequentially unfold and required to unfold a domain and to pull the extend. These pioneering pulling experiments con- unfolded state into an extended conformation (Figure 1). The energy dissipated during such a Abbreviations used: AFM, atomic force microscopy; mechanical unfolding event is, however, an order Ig, immunoglobulin domain. of magnitude greater than the typical free energy E-mail addresses of the corresponding authors: of folding of a single-domain protein. The dis- [email protected]; [email protected] crepancy is thought to arise due to the additional

0022-2836/02/$ - see front matter q 2002 Elsevier Science Ltd. All rights reserved 646 Backbone Conformational Entropy of Folding

Figure 1. A diagram of the force- induced unfolding of a multi- domain protein. The area under the force–extension curve represents the energy dissipated during the pulling process. While a number of factors may contribute to this energy, the excellent fit of force– extension curves to the expected behavior of an inert, worm-like chain2,5,8 suggests that the loss of conformational entropy dominates. If this is true and the extension between subsequent rupture events is near completion (i.e. that the maximally extended state adopts one or a few conformations) the energy dissipated then provides an experimental means of determining the backbone conformational entropy change associated with . energy input required to extend the initially highly The five suitable polyproteins represent both disordered, random-coil unfolded state into one or predominantly sheet and predominantly helical a few near-fully extended conformations, a structures. Fernandez, Clarke and co-workers hypothesis supported by the observation that have used protein engineering to create homo- force–extension curves are well fit by models of geneous polyproteins consisting of multiple the entropic cost of extending an inert, worm-like repeats of identical, predominantly b-sheet chain.2,5,8 Here, we investigate this hypothesis in immunoglobulin (Ig) domains.6,7,9 Bustamante, the light of previous estimates of the backbone con- Dahlquist and co-workers, in turn, have used formational entropy of an unfolded polypeptide. solid-phase coupling to convert monomeric, helical T4 lysozyme into a covalent, disulfide-bonded polyprotein.10 Gaub and co-workers have engi- Results neered a multi-domain protein comprised of four homologous, structurally identical, predominantly Early protein pulling studies focused on natu- helical a-spectrin domains.4 Because each of the rally occurring multidomain proteins such as titin, well-characterized domains comprising them is tenascin and fibronectin.1–3,11 These proteins are structurally (and, with the exception of the comprised of multiple copies of two or three struc- a-spectrin polyprotein, mechanically and thermo- turally distinct domain types and thus their force– dynamically) identical with its neighbors, these extension curves reflect contributions from systems are ideally suited for detailed studies of structural units that vary in terms of both unfold- folding thermodynamics. ing free energy and extension length. This hetero- Fernandez, Clarke and co-workers have syn- geneity is evident in the observation that thesized and characterized the folding and sequential rupture peaks occur at increasing force mechanical unfolding of polyproteins comprised as the mechanically weakest domains break prior of either the 27th or 28th Ig domains (I27, I28) to mechanically stronger domains (e.g. see Figure from the human muscle protein titin.6,9 These 89 1A of Li et al.9). Because it is generally not possible residue, predominantly b-sheet domains rupture to correlate the individual features in a force– at applied forces of 204(^16) pN and extension curve with the unfolding and extension 257(^27) pN, respectively, reflecting their of a specific domain, it is difficult to account relatively high mechanical stability. The force– quantitatively for the energy dissipated in the extension curves observed upon their unfolding mechanical unfolding of these proteins. In order to during mechanical pulling are well fit by models circumvent this complication, several groups have of the entropy lost as a worm-like chain is employed protein engineering to produce “poly- extended.6,9 We have integrated the worm-like proteins” comprised of multiple copies of similar chain model and found that the unfolding and or identical domains.4,6,7,9,10,13,14 For these homo- extension these domains dissipates 390(^30) kJ/ geneous polyproteins, sequential rupture peaks mol and 460(^30) kJ/mol, respectively. If these occur at an approximately constant force, demon- energies arise predominantly due to chain entropy strating that their structural homogeneity trans- restrictions that are distributed evenly among the lates into mechanical homogeneity (e.g. see force-hidden residues in the domain (both con- Figures 1B and C, and 2 of Li et al.9). Five of these tour-length changes and inspection of the Ig struc- polyproteins are comprised of structurally well- ture suggest 75 residues are liberated upon characterized domains that rupture at measurably unfolding7), we find they correspond to confor- high forces. We have focused our investigations mational entropies of 17.4(^1.3) J/(mol K residue) on these well-defined systems. and 20.6(^1.3) J/(mol K residue), respectively. Backbone Conformational Entropy of Folding 647

Fernandez and co-workers have investigated the ordered extended state requires that the poly- issue of “force-hidden” residues by constructing peptide chain is pulled to nearly full extension. several mutant I27 polyproteins containing Consistent with this argument, the reported peak- unstructured penta-glycine loops.7 In two of the to-peak extension of the I27 domain, 24(^1) nm,6 three mutant proteins, the loops are inserted in is quite close to the ,26 nm we calculate for full inter-domain linker regions that are force-bearing extension. Similarly, the reported change in exten- and are thus extended prior to the unfolding of sion length upon the insertion of five glycine the first domain. Consistent with this suggestion, residues, ,1.9 nm,7 is consistent with the calcu- these insertions alter neither the contour length lated full extension of ,1.8 nm. If, however, the change nor the energy dissipated upon the exten- unfolded domain has not been pulled to full sion of individual domains.7 The third mutation extension prior to the rupture of the next (I27 < 75), in contrast, inserts a loop within the domain, not all of the conformational entropy force-hidden region of the domain’s structure, will have been pulled out of the backbone thus thus increasing both the protein’s pulling-induced lowering estimates of the entropy’s magnitude. change in contour length by the ,1.8 nm expected Moreover, because force increases very steeply for a five residue insertion7. The insertion also near full extension, pulling that is incomplete by increases the observed energy dissipation. The even 10–20% will significantly reduce the esti- area under the force–extension curve for a single mated total entropy loss. The low rupture forces domain of this mutant is 420(^30) kJ/mol. If we of T4 lysozyme and a-spectrin could give rise to assume this energy arises solely due to exten- such incomplete extension: because of their sion-linked reduction of the average confor- reduced mechanical strength, the second domain mational entropy of each of the 80 force-hidden may rupture prior to the complete extension of residues in the protein, we find it corresponds the first unfolded domain. Unfortunately, how- to an entropy of 17.6(^1.3) J/(mol K residue). ever, peak-to-peak extension distances of suffi- The difference in energy dissipation between cient accuracy to test this hypothesis have not the wild-type I27 and the I27<75 mutant is been reported. For example, while the reported 28(^3) kJ/mol and corresponds to an entropy ,30 nm average extension length of T4 change of 18.8(^2.0) J/(mol K) for each of the lysozyme10 is, as predicted, rather less than the five additional glycine residues. expected full extension of ,34 nm, it is difficult T4 lysozyme is a highly stable, predominantly to determine whether this difference is significant a-helical protein. Bustamante, Dahlquist and relative to the experiment’s likely confidence co-workers have used solid-phase synthesis to intervals. Achieving peak-to-peak distance covalently couple a two-cystine variant of the measurements of the requisite accuracy and pre- protein into a homogeneous polyprotein.10 The cision should prove a fruitful area for future mechanical stability of the T4 lysozyme poly- AFM studies. protein is quite low, exhibiting an average rupture If the reduced energy dissipation associated with force of only 64(^16) pN.10 The energy dissipated the helical proteins arises due to incomplete pull- during the mechanical unfolding and extension ing, any entropy not “pulled out” during the of a single “domain” in this polyprotein is, at sequential unfolding of these less stable domains 207(^50) kJ/mol, also rather low and corresponds should pull out after all of the domains have to a change in backbone conformational entropy unfolded and the entire polypeptide is stretched of only 6.7(^1.6) J/(mol K) for each of the 103 to ,200 pN. Thus, integration of the total energy residues between the cystine coupling sites. dissipated by the extension of the entire construct The polyprotein comprised of the predominantly should provide a means of calculating the confor- helical a-spectrin domain exhibits reduced mational entropy of the unfolded, helical poly- mechanical stability. This three-helix bundle pro- proteins. Unfortunately, however, at least two tein ruptures at an average force of approximately issues render this approach problematic. First, 30 pN, liberating 106 force-hidden residues.4 This non-specific adsorption contributes significantly to relatively low rupture force corresponds to a sig- the total energy dissipated during the pulling of nificant reduction in the energy required to unfold many proteins10 and would “contaminate” the and extend the polyprotein: the energy dissipated total energy dissipated (there is no evidence that it in mechanical unfolding experiments is only similarly affects the unfolding of any one domain 100(^10) kJ/mol, corresponding to an extension- after the first domain ruptures).9 Second, each induced change in backbone entropy of only individual rupture peak represents a known 3.2(^0.3) J/(mol K) for each of the force-hidden number of residues. Given the random nature of residues in the domain. substrate and tip attachment, however, it is diffi- Why is the putative entropy change upon cult to ascertain the total number of amino acid unfolding and extension of the helical polyproteins residues being “liberated” as the full-length con- so much lower than that of the predominantly struct is extended. Without knowledge of this b-sheet polyproteins? The energy dissipated is number, the entropy loss per residue cannot be associated with the entropic difference between determined accurately. Addressing these issues the unfolded, random-coil state and the highly may also prove a fruitful area for future AFM ordered, extended state. Achieving a highly pulling studies. 648 Backbone Conformational Entropy of Folding

Discussion high as to cause significant bond distortion. (Alternatively, the seeming coincidence that com- Given the assumption that backbone confor- plete extension is achieved immediately prior to mational entropy losses account for the vast significant bond distortion occurs could reflect majority of the energy dissipated during mechani- evolutionary pressures aimed at optimizing the cal unfolding, we estimate backbone confor- mechanico-chemical properties of these natural ^ mational entropy losses of ,19( 2) J/ “shock absorbers”.) This, in turn, predicts that (mol K residue) for the three predominantly pulling speeds sufficient to rupture proteins at b-sheet polyproteins that rupture at high pulling forces significantly in excess of 200 pN will pro- forces. If pulling proceeds to near full extension duce contour lengths greater than those observed (i.e. to near zero entropy), this estimate corre- at lower forces and thus lead to increases in bond sponds to the absolute backbone conformational distortion energy and the consequent “over- entropy of the unfolded state. As such, it is exceed- estimation” of the backbone entropy. Conversely, ingly close to previous theoretical and semi-empiri- we predict that when pulling speeds of sufficient 16 – 25 cal estimates of this value. The estimated rapidity are achieved that the mechanically less backbone conformational entropy lost during the stable polyproteins rupture ,200 pN, they too extension of two predominantly helical proteins, will reach full extension and will exhibit entropy which rupture at much lower force levels, is sig- losses of ,19 J/(mol K residue). While extra- nificantly less than would be expected if full exten- polation of the existing force–extension curves for sion is achieved. Either previous estimates of the T4 lysozyme and a-spectrin to 200 pN support backbone conformational entropy are significantly this suggestion (our unpublished results), the in error or, as suggested above, the entropic short- available data do not allow us to rigorously test fall arises due to the incomplete extension of these these predictions. mechanically less stable proteins. These estimates of the conformational entropy losses associated with the extension of poly- Other sources of energy dissipation proteins rely critically on the assumption that the In order to assign the energy dissipated during area under the force–distance curve corresponds pulling to the conformational entropy of the entirely to the entropic cost of extending a random- unfolded state, the analysis presented here relies coil polypeptide. While there are several reasons on the assumption that overcoming the confor- why this assumption may fail, we believe that it is mational entropy of the unfolded backbone is the a reasonable approximation and that the close cor- only significant energetic contributor. While the respondence between these results and previous, excellent fit of the worm-like chain model to theoretical and semi-empirical estimates of back- force–extension curves supports this assumption bone conformational entropy is not coincidental. strongly, several other potentially significant effects Here, we discuss this matter in detail. may contribute to the energy dissipated during unfolding and extension. These include irreversible Pulling speed dependence work, residual unfolded state structure, the energy stored in force-induced bond distortions and the It is well established that the force required to activation energy of the unfolding process. We dis- rupture a domain (and thus the area under a cuss below evidence suggesting that these effects force–extension curve) depends on the speed with do not alter the conclusions described here signifi- which the protein is pulled.3,6,26 How, then, do we cantly. Perhaps critically, we note that each of account for this in terms of conformational entropy these putative effects would add to the energy dis- contributions? Data on the helical polyproteins sipated and would thus lead to the overestimation, suggest that pulling speeds slow enough to induce rather than the possible underestimate noted for rupture at applied forces significantly below the helical polyproteins, of the entropic cost of 200 pN unfold the subsequent domain before full backbone extension. extension (and thus complete loss of conformation- al entropy) is achieved. But what of pulling speeds Irreversible work so rapid that rupture forces are greatly increased? As the applied force increases, the distortion of Because pulling is a non-equilibrium process, it covalent bonds becomes a significant contributor is possible that some of the area under the force– to energy dissipated during extension, thus leading extension curve is due to the occurrence of to an overestimate of the entropic cost of pulling. irreversible work. This could include the viscous Nevertheless, the calculations described below drag of the protein and cantilever. Control experi- indicate that the energy stored in bond distortions ments with bare cantilevers suggest that the irre- at 250 pN is at most a small percentage of the total versible work performed by the cantilever is energy dissipated. We recognize our conclusions insignificant relative to the energy dissipated are thus on the basis of a fortuitous coincidence: at during pulling-induced unfolding (J.B.T., unpub- currently employed pulling speeds the b-sheet lished results). The reversibility of an AFM-protein polyproteins rupture at a force that is sufficiently unfolding experiment is demonstrated also by high to ensure near complete extension but not so experiments in which a pulling event is halted in Backbone Conformational Entropy of Folding 649 mid-extension and the extension force is relaxed will include the unfolding activation energy. The (J.B.T., unpublished results). The lack of measur- activation energy of pulling-induced unfolding able hysterisis in such an experiment demonstrates has been estimated for a number of domain types that the process is well approximated as reversible, and is thought to be an order of magnitude less and that irreversible work does not contribute sig- than the observed energy dissipation.7,12,30 More- nificantly to the energy dissipated. over, at finite pulling speeds, much of this acti- vation energy is supplied by random thermal Energy stored in bond-distortions fluctuations (which accounts for the pulling-speed dependence of rupture force).3,26 It is estimated If the pulling force is sufficiently great, enthalpy that, at currently employed pulling speeds, the changes will occur due to bond length and angle unfolding activation energy is significantly less distortions. In a multi-domain protein, however, than that required in the absence of applied force the unfolding of the next domain provides a (D. Makarov, personal communication).10,30 Ignor- natural “weak link” that prevents pulling forces ing the activation energy thus leads to a ,10% from becoming greater than the rupture force. At a overestimation of the backbone conformational rupture force of 250 pN, the length of a typical entropy. carbon–carbon single bond (force constant On the basis of these arguments (and the excel- ,200 kJ/(mol nm)27) is distorted by only ,0.5%, lent fit of force–extension curves to the worm-like corresponding to a ,20 J/mol change in bond chain model), we believe that the assumption that energy (our unpublished results). For a 100 residue the energy dissipated during pulling experiments domain (three bonds/residue) this distortion corresponds to the entropic cost of extending the equates to ,6 kJ/mol, two orders of magnitude random-coil unfolded state is reasonably accurate. less than the energy dissipated in the relevant pull- To the extent that this assumption does not hold, ing experiments. Thus force-induced bond distor- we believe the estimate of the change in confor- tions would lead to no more than a 1–2% mational entropy described above will be, at most, overestimation of the entropic cost of pulling. of the order of a 10% overestimate.

Residual interactions in the denatured state Comparison with theory Our analysis requires that the unfolded state is a While these estimates of the entropy associated fully hydrated random-coil conformation lest with the mechanical extension of proteins could changes in solvent entropy or the energy required be coincidental (i.e. the non-entropic sources of dis- to disrupt favorable enthalpic interactions contri- sipated energy could be offset by incomplete exten- bute to the putative entropic cost of extension. sion so as to produce the observed value When unfolded in water, however, the denatured fortuitously), the above arguments suggest that states of a number of proteins have been reported the approach is well founded. To the extent that it to contain residual structure.21,24,28 Fortunately, is well founded, the pulling experiments per- several lines of evidence suggest that this putative formed using predominantly b-sheet polyproteins structure will not contribute significantly to the provides a direct measurement of the backbone energy dissipated during pulling. For example, conformational entropy of the unfolded state. Con- simple point mutations or the presence of even sistent with this suggestion, the observed moderate concentrations of denaturants abolish 19(^2) J/(mol K residue) average entropy loss residual denatured state structure and cause agrees relatively well with previous computational the polypeptide to adopt a random-coil and semi-empirical estimates of this fundamental configuration.29 Critically, calorimetric studies thermodynamic parameter. demonstrate that the unfolded state populated in Direct, experimental estimates of the backbone water (where the residual structure is sometimes conformational entropy of the unfolded state have observed) is thermodynamically equivalent to remained elusive. In the past, three approaches these fully hydrated, random-coil configurations.23 have been used to evaluate this term: (1) purely This implies that, if residual denatured state struc- computational methods aimed at enumerating the ture is populated, the entropic cost or gain (of conformations accessible to an unfolded both solvent and chain) associated with its disrup- polypeptide;31 (2) semi-empirical dissections tion must coincide with compensatory enthalpic aimed at subtracting the contributions of con- changes. The disruption of this putative residual founding factors from experimentally observed denatured state structure thus cannot contribute folding entropies;22,23 and (3) model-dependent significantly to the energy dissipated during interpretation of NMR relaxation data.21,24,25 While extension. these estimates range from 12 J/(mol K residue) to 30 J/(mol K residue), the majority cluster around , Activation energy of unfolding the median value of 20 J(/mol K residue) and are thus in excellent agreement with the experimental At any non-zero pulling speed, the force- values reported here. induced unfolding of a protein only approximates Computational approaches to the determination a reversible process and thus the energy dissipated of backbone conformational entropy date back 650 Backbone Conformational Entropy of Folding some five decades. In 1951 Pauling & Cory were with nanosecond to picosecond bond vector among the first to attempt to enumerate the motions along the backbone. Alexandrescu and number of conformations a polypeptide can co-workers report that, depending on the model adopt, arriving by inspection at a value corre- employed, these motions appear to account for sponding to 72 conformations/residue (compar- 13 J/(mol K residue), 18 J/(mol K residue) or 23 J/ able to a backbone entropy of 36 J/(mol K) if all (mol K residue) lost during the folding of conformations are populated equally).16,17 In 1955 S-peptide.25 Kay and co-workers report that these Schellman critically reviewed this analysis and motions account for 15–20 J/(mol K residue) of proposed that the conformational entropy of the conformational entropy in unstructured residues polypeptide backbone must be significantly less in the unfolded states of drkSH3 and staphylo- than 30 J/(mol K residue) and significantly greater coccal nuclease.21,24 In contrast, these motions than 12 J(/mol K residue).17 In 1965 Nemethy & account for only 6–12 J/(mol K residue) of confor- Scheraga, in turn, used early computer simulations mational entropy for residues thought to partici- to estimate that glycine can adopt 21 distinct con- pate in residual denatured-state structure in these formations and non-glycine residues seven, corre- same proteins. While the latter values are signifi- sponding to conformational entropies of 25 J/ cantly lower than the entropy estimate reported (mol K) and 16 J/(mol K), respectively, if these con- here, force-induced extension could easily disrupt formations are populated reasonably equally.18 this structure and thus abolish the discrepancy. Wang & Purisima have more recently employed The discrepancy could alternatively arise from the more detailed simulation methods to estimate the contribution of slower than picosecond motions to backbone conformational entropy associated with the backbone conformational entropy of these the folding of a residue at the center of a helix at residues. 21 J/(mol K).20 Freire and co-workers have simi- larly employed detailed simulations to estimate the conformational entropy of glycine and used an Conclusions experimental measure of the difference in confor- mational entropy between glycine and the other While theoretical and semi-empirical estimates amino acids to estimate conformational entropies.19 of the backbone conformational entropy of the They derive a value of 16.9(^0.2) J/(mol K resi- unfolded state vary widely, most cluster around due) for the conformational entropy of alanine the median value of ,20 J/(mol K residue). The in an unfolded polypeptide. Yang & Honig, in energy dissipated during mechanical unfolding contrast, have used computational methods to and extension of three mechanically stable, pre- estimate that the entropic cost of fixing alanine dominantly b-sheet polyproteins corresponds dihedral angles into a helical arrangement is closely to this theoretically predicted entropic cost. ,28 J/(mol K residue)32 but we observe no evi- These results offer support for both previous theor- dence in support of this rather higher estimate. etical and semi-empirical estimates of the backbone A number of groups have used semi-empirical conformational entropy loss upon folding and of methods to distinguish the backbone confor- the previously hypothesized contribution of this mational entropy contribution from other com- conformational entropy to the energy dissipated ponents of folding thermodynamics. Provided the during mechanical unfolding. folded state equates to a complete loss in backbone entropy, this number provides an estimate of the backbone conformational entropy of the unfolded Materials and Methods state. Privilov and others have noted that at high In order to determine the energy dissipated during a temperatures the entropy and enthalpy of pulling event, we presume a single unfolded domain in hydration are expected to vanish. By extrapolating isolation with a force versus extension curve that is thermodynamic parameters determined at described by the worm-like chain model. Mean rupture moderate temperatures into this regime they esti- force, persistence length and contour length were taken mate that the backbone conformational entropy of as reported in the relevant experimental literature. The folding is ,15 J/(mol K residue) at 393 K.26 Zhang energy dissipated during the rupture and extension of et al. have similarly used computational decompo- the single domain was then determined by integrating sition to estimate that the average backbone confor- the worm-like chain model from the relaxed state up to mational entropy of folding at room temperature is the experimentally determined average rupture force using the Marko & Siggia interpolation formula:33 22(^2) J/(mol K residue).22 ! More recently, several groups used simple 22 kBT Rz 4Rz models of bond vector motion to correlate NMR- FðRzÞ¼ 1 2 21 þ ð1Þ 4L L L derived order parameters with the backbone con- P C C formational entropy of folded and unfolded where F(R ) is the force observed at extension length R , drkSH3, staphylococcal nuclease and the S-peptide z z and LP and LC the persistence and contour lengths, of ribonuclease. These calculations, which rely respectively. This approximation of the worm-like critically on the validity of the models used to esti- chain model has been adopted almost universally as a mate order parameters, provide a means of means of describing force–extension curves of the measuring the conformational entropy associated type described here.1–15 We have employed this Backbone Conformational Entropy of Folding 651 approximation because it provides a ready means of titin measured by atomic fore microscopy. Biophys. J. determining average energy dissipation (and confidence 75, 3008–3014. intervals) given the previously reported data upon 4. Reif, M., Pascual, J., Saraste, M. & Gaub, H. E. (1999). which our analysis relies. Admittedly, this approach is Single molecule force spectroscopy of spectrin less satisfactory than direct, model-free integration of repeats: low unfolding forces in helix bundle. J. Mol. the observed force–extension curves. It is, nevertheless, Biol. 286, 553–561. sufficiently accurate for the analysis at hand; although 5. Fisher, T. E., Oberhauser, A. F., Carrion-Vazquez, M., equation (1) is only an approximation for the worm-like Marszalek, P. E. & Fernandez, J. M. (1999). The chain34 (which itself is an approximate, continuous study of protein mechanics with the atomic force model of the polypeptide), integrated worm-like chain microscope. Trends Biochem. Sci. 24, 379–384. fits produce energies effectively identical with those 6. Carrion-Vazquez, M., Oberhauser, A. F., Fowler, S. B., obtained by directly integrating the force–extension Marszalek, P. E., Broedel, S. E., Clarke, J. & data (our unpublished results). All temperatures were Fernandez, J. M. (1999). Mechanical and chemical assumed to be 298 K. unfolding of a single protein: a comparison. Proc. The average distance between rupture peaks (which is Natl Acad. Sci. USA, 96, 3694–3699. not equivalent to the change in contour length) has been 7. Carrion-Vazquez, M., Marszalek, P. E., Oberhauser, < reported for the I27, I27 75 and T4 lysozyme A. F. & Fernandez, J. M. (1999). Atomic force 6,7,10 polyproteins. For these three polyproteins, we calcu- microscopy captures length phenotypes in single lated the expected extension length, DE, using the proteins. Proc. Natl Acad. Sci. USA, 96, 11288–11292. formula: 8. Carrion-Vazquez, M., Oberhauser, A. F., Fisher, T. E., Marszalek, P. E., Li, H. B. & Fernandez, J. M. (2000). D ¼ N ð : = Þ 2 D ð Þ E fh 0 36 nm residue N 2 Mechanical design of proteins-studied by single- molecule force spectroscopy and protein engineer- where N is the number of force-hidden residues fh Prog. Biophys. Mol. Biol. 74 (residues not under tension prior to domain rupture) ing. , 63–91. 9. Li, H. B., Oberhauser, A. F., Fowler, S. B., Clarke, J. & liberated upon unfolding and DN is the distance between the first and last force-hidden residues in the Fernandez, J. M. (2000). Atomic force microscopy (measured using PDB file 1TIT or taken from the reveals the mechanical design of a modular protein. literature10). N values were taken from the literature. Proc. Natl Acad. Sci. USA, 97, 6527–6531. fh 10. Yang, G. L., Cecconi, C., Baase, W. A., Vetter, I. R., For I27, DN is measured at ,0.8 nm and for the mutant I27<75 (where the relevant distance is that between the Breyer, W. A., Haack, J. A. et al. (2000). Solid-state ends of the polyglycine insert) it is assumed to be synthesis and mechanical unfolding of polymers of ,0 nm. The native-state separation between the linkage T4 lysozyme. Proc. Natl Acad. Sci. USA, 97, 139–144. sites in the T4 lysozyme polyprotein is 3.2 nm.10 The 11. Oberhauser, A. F., Marszadek, P. E., Erickson, H. P. & factor 0.36 nm/residue reflects the translation per amino Fernandez, J. M. (1998). The molecular elasticity of acid residue in a fully extended polypeptide.35 Due to tenascin, an extracellular matrix protein. Nature, 313, mild steric clashes, a fully extended polypeptide chain 181–185. is rarely observed in native proteins (i.e. in the absence 12. Craig, D., Krammer, A., Schulten, K. & Vogel, V. of applied force). At an applied force of 200 pN, how- (2001). Comparison of the early stages of forced ever, simulations with the CHARMM potential27 suggest unfolding for fibronectin type III modules. Proc. Natl this value is reasonable (our unpublished results). Acad. Sci. USA, 98, 5590–5595. 13. Best, R. B., Li, B., Steward, A., Daggett, V. & Clarke, J. (2001). Can non-mechanical proteins withstand force? Stretching barnase by atomic force microscopy and simulation. Biophys. J. 81, Acknowledgements 2344–2356. 14. Oroudjev, E., Soares, J., Arcidiacono, S., Thompson, The authors acknowledge critical commentary J. B., Fossey, S. A. & Hansma, H. G. (2002). Segmen- provided by Dmitrii Makarov, Rohit Pappu, Trevor ted nanofibers of spider dragline silk: atomic force Creamer, Hue Sun Chan and Tanniemola microscopy and single-molecule force spectroscopy. Liverpool, and thank Mario Viani and Hong Bin Proc. Natl Acad. Sci. USA, 99, 6460–6465. Li for providing access to unpublished data. 15. Thompson, J. B., Kindt, J. H., Drake, B., Hansma, Portions of this work were supported by NIH H. G., Morse, D. E. & Hansma, P. K. (2001). Bone grant R01GM62868-01A2 (to K.W.P.). indentation recovery time correlates with bond reforming time. Nature, 414, 773–776. 16. Pauling, L. & Corey, R. (1951). Configurations of polypeptide chains with favored orientations around References single bonds: two new pleated sheets. Proc. Natl 1. Kellermayer, M. S., Smith, S. B., Granzier, H. L. & Acad. Sci. USA, 37, 729–740. Bustamante, C. (1997). Folding–unfolding transitions 17. Schellman, J. A. (1955). The stability of hydrogen- in single titin molecules characterized with laser bonded peptide structure in aqueous solution. Trav. tweezers. Science, 276, 112–116. Lab. Carlsberg Ser. Chim. 29, 230–259. 2. Reif, M., Gautel, M., Oesterhelt, F., Fernandez, J. M. 18. Nemethy, G. & Scheraga, H. A. (1965). Theoretical & Gaub, H. E. (1997). Reversible unfolding of indi- determination of sterically allowed conformations of vidual titin immunoglobulin domains by AFM. a polypeptide chain by a computer method. Science, 276, 1109–1112. Biopolymers, 3, 155–184. 3. Reif, M., Gautel, M., Schemmel, A. & Gaub, H. E. 19. D’Aquino, J. A., Gomez, J., Hilser, V. J., Lee, K. H., (1998). The mechanical stability of immunoglobin Amzel, L. M. & Freire, E. (1996). The magnitude of and fibronectin III domains in the muscle protein the backbone conformational entropy change in 652 Backbone Conformational Entropy of Folding

protein folding. Proteins: Struct. Funct. Genet. 25, 27. Brooks, B. R., Bruccoleri, R. E., Olafson, B. D., States, 143–156. D. J., Swaminathan, S. & Karplus, M. (1983). 20. Wang, J. & Purisima, E. (1996). Analysis of thermo- CHARMM—a program for macromolecular energy, dynamic determinants in helix propensities of non- minimization, and dynamics calculations. J. Comput. polar amino acids through a novel free energy Chem. 4, 187–217. calculation. J. Am. Chem. Soc. 118, 995–1001. 28. Dyson, H. J. & Wright, P. E. (1998). Equilibrium NMR 21. Yang, D. & Kay, L. E. (1996). Contributions to confor- studies of unfolded and partially folded proteins. mational entropy arising from bond vector Nature Struct. Biol. 5, 499–503. fluctuations measured from NMR-derived order 29. Millett, I. S., Doniach, S. & Plaxco, K. W. (2002). parameters: application to protein folding. J. Mol. Towards a taxonomy of the denatured state: small Biol. 263, 369–382. angle scattering studies of unfolded proteins. Advan. 22. Zhang, C., Cornette, J. L. & Delisi, C. (1997). Consist- Protein Chem. In press. ency in structural energetics of protein folding and 30. Makarov, D. E., Hansma, P. K. & Metiu, H. (2001). peptide recognition. Protein Sci. 6, 1057–1064. Kinetic Monte Carlo simulation of titin unfolding. 23. Privalov, P. L. (1997). Thermodynamics of protein J. Chem. Phys. 114, 9663–9673. folding. J. Chem. Thermodynam. 29, 447–474. 31. Creamer, T. P. (2001). Conformational entropy in pro- 24. Yang, D., Mok, Y.-K., Forman-Kay, J. D., Farrow, N. A. tein folding. In Protein Folding and Stability (Murphy, & Kay, L. E. (1997). Contributions to protein entropy K. P., ed.), 2nd edit., pp. 117–132, Humana Press, and heat capacity from bond vector motions Totowa, NJ. measured by NMR spin relaxation. J. Mol. Biol. 272, 32. Yang, A. S. & Honig, B. (1995). Free-energy determi- 790–804. nants of secondary structure formation 1. Alpha- 25. Alexandrescu, A. T., Rathgeb-Szabo, K., Rumpel, K., helicies. J. Mol. Biol. 252, 351–365. Jahnke, W., Schulthess, T. & Kammerer, R. A. (1998). 33. Bustamante, C., Marko, J. F., Siggia, E. D. & Smith, S. N-15 backbone dynamics of the S-peptide from ribo- (1994). Entropic elasticity of l-phage DNA. Science, nuclease A in its free and S-protein bound forms: 265, 1599–1600. toward a site-specific analysis of entropy changes 34. Bouchiat, C., Wang, M. D., Allemand, J. F., Strick, T., upon folding. Protein Sci. 7, 389–402. Block, S. M. & Croquette, V. (1999). Estimating the 26. Viani, M. B., Schaffer, T. E., Paloczi, G. T., Pietrasanta, persistence length of a worm-like chain molecule L., Smith, B. L., Thompson, J. B. et al. (1999). Fast from force–extension measurements. Biophys. J. 76, imaging and fast force spectroscopy of single bio- 409–413. polymers with a new atomic force microscope 35. Pauling, L. & Corey, R. B. (1951). The pleated sheet, a designed for small cantilevers. Rev. Sci. Instrum. 70, new layer configuration of polypeptide chains. Proc. 4300–4303. Natl Acad. Sci. USA, 37, 251–256.

Edited by P. Wright

(Received 5 April 2002; received in revised form 17 June 2002; accepted 30 July 2002)