The Nature of Protein Folding Pathways S
Total Page:16
File Type:pdf, Size:1020Kb
PERSPECTIVE PERSPECTIVE The nature of protein folding pathways S. Walter Englander1 and Leland Mayne Johnson Research Foundation, Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104 Edited by Alan R. Fersht, Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom, and approved September 23, 2014 (received for review June 24, 2014) How do proteins fold, and why do they fold in that way? This Perspective integrates earlier and more recent advances over the 50-y history of the protein folding problem, emphasizing unambiguously clear structural information. Experimental results show that, contrary to prior belief, proteins are multistate rather than two-state objects. They are composed of separately cooperative foldon building blocks that can be seen to repeatedly unfold and refold as units even under native conditions. Similarly, foldons are lost as units when proteins are destabilized to produce partially unfolded equilibrium molten globules. In kinetic folding, the inherently cooperative nature of foldons predisposes the thermally driven amino acid-level search to form an initial foldon and subsequent foldons in later assisted searches. The small size of foldon units, ∼20 residues, resolves the Levinthal time-scale search problem. These microscopic-level search processes can be identified with the disordered multitrack search envisioned in the “new view” model for protein folding. Emergent macroscopic foldon–foldon interactions then collectively provide the structural guidance and free energy bias for the ordered addition of foldons in a stepwise pathway that sequentially builds the native protein. These conclusions reconcile the seemingly opposed new view and defined pathway models; the two models account for different stages of the protein folding process. Additionally, these observations answer the “how” and the “why” questions. The protein folding pathway depends on the same foldon units and foldon–foldon interactions that construct the native structure. protein folding | hydrogen exchange | protein structure Proteins must fold to their active native state unique native state through multiple un- Kinetic folding intermediates seemed to form when they emerge from the ribosome and predictable routes and intermediate con- asynchronously over a range of time scales. when they repeatedly unfold and refold during formations. Another prominent inference Equilibrium analogs of folding intermediates their lifetime (1, 2). The folding process is configured the Anfinsen thermodynamic called molten globules yielded mixed results, difficult (3, 4) and potentially dangerous (5). hypothesis (6) in terms of a funnel-shaped sometimes agreeing with kinetic folding in- Biological health depends on its success and energy landscape diagram (Fig. 1B), which formation and sometimes not. Baldwin’sar- disease on its failure. However, more than 50 y pictures that proteins must fold energeti- ticle served to alert the experimental protein after the formative demonstration that protein cally downhill (the Z axis) and shrink in folding community to the new view of het- folding is a straightforward biophysical pro- conformational extent (the generalized XY erogeneous folding and helped to establish cess (6), there is not general agreement on plane) as they go (9, 12–14). To fill out the the current paradigm of a multipath fun- the overarching questions of how proteins fold landscape picture, classical rate-determining neled energy landscape. and why they fold in that way. Given this kinetic barriers are often replaced by qual- The distinction between the classical view uncertainty, one is not sure how to even think itative concepts such as ruggedness, frus- of a more or less single pathway through about many related biophysical and biological tration, and traps, and major species by defined intermediates and the disordered problems. deep wells, all forming a kind of metalan- many-pathway new view has broad signifi- Early in the history of the folding field, guage known as “energy landscape theory” cance for the understanding of protein bio- experimentalists simply assumed that proteins (10, 15–17). The graphic funnel picture is physics and biological function. The question fold through distinct intermediate states in a a generic representation, independent of could be resolved by determining experimen- distinct pathway (Fig. 1A), as seen for a classical structural and thermodynamic detail and tally the structure of the intermediate forms biochemical pathways. Following Anfinsen’s equally applicable to any protein, RNA, or that bridge between unfolded and native demonstration that proteins can fold all by other compact polymer. Although it pro- states in real proteins, but this effort has themselves without outside help (6), Levinthal vides no constraints that would exclude turned out to be exceptionally difficult. The perceived that no undirected folding process any realistic folding scenario, even a defined usual methods, crystallography and NMR, wouldbeabletofindthenativestructureby pathway model, it has been widely inter- cannot define partial structures that form and random searching through the vast number of preted to require that proteins fold through decay in less than 1 s. Experimentalists have structural options (3, 4). Proteins must solve many independent pathways. been forced to depend on spectroscopic the problem, he believed, by folding through R. L. Baldwin took up the challenge and methods (fluorescence, CD, IR) that can fol- predetermined pathways, although one had led the field in a multiyear effort to exper- low kinetic folding in real time but are blind no clue how or why that should occur. imentally define kinetic folding intermediates to the specifics of structure and so allow the A realization of the inability to equilibrate and pathways (18–20). In a thoughtful pro- to a common structure (3, 4) and the en- tein folding review 20 y ago, Baldwin consid- Author contributions: S.W.E. and L.M. wrote the paper. semble nature of partially folded forms led ered the disparate insights available at the The authors declare no conflict of interest. the theoretical community to a very differ- time from both theory and experiment This article is a PNAS Direct Submission. “ ” – ent more statistical new view (7 11). It (21). He highlighted uncertainties in the ex- 1To whom correspondence should be addressed. Email: engl@mail. was inferred that proteins must fold to their perimental evidence for classical pathways. med.upenn.edu. www.pnas.org/cgi/doi/10.1073/pnas.1411798111 PNAS | November 11, 2014 | vol. 111 | no. 45 | 15873–15880 Downloaded by guest on September 29, 2021 A B Collapse encounter a sizeable kinetic barrier and so reach significant population. TS Entropy Initial results obtained for cytochrome c (Cyt c; Fig. 2) showed that approximately half of the molecules form their sequentially remote but structurally contiguous N- and C-terminal helical segments early (12 ms), Molten globule suggesting the formation of a specific on- U search states Q pathway native-like intermediate. However, Energy Baldwin’s review (21) emphasized the asyn- Transition state chrony in these kinetic results; some of the Folding molecules protect their N- and C-terminal intermediates helices early, whereas others do so at later N 1.0 times, along with other regions. Other proteins Native similarly studied have often yielded analogous Fig. 1. (A) The classical view of a defined folding pathway, and (B) the new view of multiple routes through a funneled results. This heterogeneousbehaviorconflicts landscape. Reprinted with permission from ref. 13. Dashed line in A illustrates the insertion of an optional error-dependent with a well-defined sequential pathway model kinetic barrier, which can affect some population fraction and not others and thus mimic multipathway folding. but seems more consistent with the new view of different routes, rates, and traps. possibility of alternative folding mechanisms. energy space can be determined by HX la- One now knows that the heterogeneous Theorists have attempted to avoid these dif- beling, sulfhydryl labeling, and NMR meth- folding seen in kinetic HX pulse labeling ficulties by simulating the folding process in ods. Partially unfolded molten globules can experiments can be due to previously un- recognized experimental issues. One prob- computers. Theory-based computer simula- be labeled in a structure-sensitive way by lem concerns the tendency of refolding tions can be remarkably powerful. For exam- hydrogen–deuterium (H-D) exchange and proteins to transiently aggregate, especially ple, one can compute the path of a multiton analyzed later by site-resolved NMR in the at the high concentrations used to facilitate rocket through 150 million miles of free reformed native state or directly by mass the preparation of samples for NMR analy- space to a pinpoint landing on Mars. The spectrometry. sis (25, 26). Another unexpected effect was equations that govern space flight are known Theseadvancesnowmakeitpossibleto revealed in a sophisticated analysis of the precisely (22), computer power is ample, and determine the structure and properties of HX pulse labeling experiment, which showed the track to be controlled is clear. Computing intermediate protein folding states and their that intermediates populated during kinetic the structural journey of minuscule protein pathway connections and so place the study folding may repeatedly