Theoretical Perspectives on Biological Machines
Mauro L. Mugnai∗ Department of Chemistry, University of Texas, Austin, TX 78712 Changbong Hyeon† Korea Institute for Advanced Study, Seoul 02455, Republic of Korea Michael Hinczewski‡ Department of Physics, Case Western Reserve University, OH 44106 D. Thirumalai§ Department of Chemistry, University of Texas, Austin, TX 78712
Many biological functions are executed by molecular machines, which like man made motors consume energy and convert it into mechanical work. Biological machines have evolved to transport cargo, facilitate folding of proteins and RNA, remodel chromatin and replicate DNA. A common aspect of these machines is that their functions are driven by fuel provided by hydrolysis of ATP or GTP, thus driving them out of equilibrium. It is a challenge to provide a general framework for understanding the functions of bi- ological machines, such as molecular motors (kinesin, dynein, and myosin), molecular chaperones, and helicases. Using these machines, whose structures have little resem- blance to one another, as prototypical examples, we describe a few general theoretical methods that have provided insights into their functions. Although the theories rely on coarse-graining of these complex systems they have proven useful in not only accounting for many in vitro experiments but also address questions such as how the trade-off be- tween precision, energetic costs and optimal performances are balanced. However, many complexities associated with biological machines will require one to go beyond current theoretical methods. We point out that simple point mutations in the enzyme could drastically alter functions, making the motors bi-directional or result in unexpected dis- eases or dramatically restrict the capacity of molecular chaperones to help proteins fold. These examples are reminders that while the search for principles of generality in biology is intellectually stimulating, one also ought to keep in mind that molecular details must be accounted for to develop a deeper understanding of processes driven by biological machines. Going beyond generic descriptions of in vitro behavior to making genuine understanding of in vivo functions will likely remain a major challenge for some time to come. In this context, the combination of careful experiments and the use of physics and physical chemistry principles will be useful in elucidating the rules governing the workings of biological machines.
CONTENTS IV. Stochastic Kinetic Models 7 A. The Master Equation 8 I. Introduction 2 B. Thermodynamics 9 C. Rate of Entropy Production 10 arXiv:1908.11323v1 [physics.bio-ph] 29 Aug 2019 II. Structures 4 D. Fluctuation Theorems 10 A. Molecular motors 4 B. Catalytic Cycle 5 V. Molecular Motors – Models without Detachment 12 A. Molecular Motors are Processive Enzymes 12 III. Biological machines are active systems. 6 B. Processive Motor Velocity 13 C. Periodic Lattice Model 13 1. One-state Models 14 2. Multi-state, Uni-cycle Models 15 3. Multi-cycle Models 17 ∗ [email protected] D. Additional remark on non-equilibrium nature of † [email protected] motors 18 ‡ [email protected] E. Applications 19 § [email protected] 1. Myosin V and Kinesin-1 19 2
2. Dynein – an erratic motor 20
VI. Molecular Motors – Models with Detachment 20 A. Velocity Distribution 21 B. Alternate Models with Detachment 22
VII. Polymer Physics-based Approaches Incorporating Structural Features Into Kinetic Theories 23
VIII. Simulations using Coarse-Grained Models 28
IX. Cost-precision Trade-off and Efficiency of Molecular Motors 30 1. Cost-precision trade-off and its physical bound of molecular motors 30 2. Transport Efficiency 31
X. Molecular chaperones 33
XI. Universal characteristics of Helicases 36 FIG. 1 Illustration of the complexity of transportation of melanosomes, which are vesicles containing melanin. Both !1 XII. Discussion 38 dynein and kinesin-2 compete for the same binding site on A. Sometimes Details Matter 38 dynactin mediated by p150Glued. Depending on the func- B. Efficiency and optimality 38 tion (aggregation of melanosomes or their dispersion through- C. Specificity versus Promiscuity 39 out the cell) one or the other wins. When melanosomes are D. Biological complexity 39 dispersed in the cell they are transported by kinesin-2 and XIII. A Final remark 39 myosin V whereas when they aggregate dynein moves the cargo. Figure extracted from Soldati and Schliwa (2006). Acknowledgments 40
References 40 and abiotic systems. Because of functional demands in living systems, which also includes adaptation to chang- I. INTRODUCTION ing environmental conditions, it is virtually impossible to fully describe biology without evolutionary considera- The opening sentence of a perspective by Bustamante tions. Although not the focus of our perspective, the role (Bustamante et al., 2011) on the workings of nucleic acid of evolutionary constraints must also be integrated with translocases begins with the quote, “The operative indus- physical models in order to discover general principles try of Nature is so prolific that machines will be eventu- governing the functions of biological machines. ally found not only unknown to us but also unimaginable What are the characteristics of biological machines? by our mind”, attributed to Marcello Malpighi, who is First, there are many varieties of machines, all of which considered the founder of microscopic anatomy, histol- should be thought of as enzymes, which consume energy ogy, and embryology. This statement, made over three and perform mechanical work to carry out specific tasks. centuries ago, is even more relevant today. It is a re- A few of the machines that we consider here are kinesin, minder that mechanical forces must play a fundamental myosin, and dynein, which are collectively referred to as role in biology. In modern times this vast subject falls molecular motors (Belyy et al., 2014a; Block, 2007; Reck- under the growing field of mechanobiology. The perva- Peterson et al., 2018; Sun and Goldman, 2011a; Vale sive role of mechanics in living systems controls motil- and Milligan, 2000). These cytoskeletal motors trans- ity on all length scales, from motion of a single cell on port cargo by walking, almost always unidirectionally, a substrate and collections of cells to dynamics at the on filamentous actin and microtubules (MT). Under in molecular level. Cooperative interactions between var- vivo conditions the motors cooperate or there could be a ious modules at the molecular level is thought to con- tug-of-war in the process of transport (see Fig. 1) (Gross trol functions at the mesoscale. A number of complex et al., 2002; Levi et al., 2006). To illustrate the diversity dynamical processes such as transcription, translation, of motor-like functions and point out certain emerging transport of vesicles and organelles, folding of proteins general principles, we also provide theoretical descrip- and RNA, and chromosome segregation control the sus- tions for the functions of molecular chaperones, which tenance and growth of cells. At some level all these bio- assist in the folding of proteins and ribozymes (RNA en- logically important processes involve molecular machines, zymes) that cannot do so spontaneously, and helicases whose ability to perform their functions, often but always with multiple functions that includes separation of dou- with high efficiency, in a noisy crowded environment is ble stranded (ds) DNA strands. Both molecular chap- truly remarkable. It is worth remembering that the abil- erones and helicases bear no structural resemblance or ity to execute a variety of functions distinguishes living sequence similarity to molecular motors. Nevertheless, 3 we are convinced that by comparing the characteristics of these seemingly unrelated machines, integrating the- ory and experiments, unifying themes and differences be- tween them could be elucidated. Second, all these motors and others not covered here (for example polymerases, ribosomes, and packaging motors) are all multidomain proteins, whose architectures are spectacularly different. Despite the vastly different sequences, structures, and evolutionary origins it is indeed the case that all of these machines utilize some form of chemical energy (gener- ated by ATP or GTP hydrolysis) in order to amplify the small local conformational changes through their struc- tural linkages to facilitate large conformational changes for functional purposes. Such conformational amplifica- tions are examples of remarkable allostery or action at a distance, and could also be couched in terms of in- FIG. 2 Significant physical timescales associated with myosin V dynamics, from the coarse-grained polymer theory model formation transfer between structural subunits that are of Hinczewski et al. (2013). spatially well separated, which in some cases (dynein for example (Bhabha et al., 2014)) can be as large as 25nm. A large number of experiments have unveiled many of the details of how these machines move by converting ular dynamics simulations have been performed to get a chemical energy into mechanical work (see for example molecular picture of certain aspects of the functions of (Hartman et al., 2011; Spudich and Sivaramakrishnan, motors (Hwang et al., 2008, 2017) and other machines 2010) (De La Cruz and Ostap, 2004; Holzbaur and Gold- (see for example (Elber and West, 2010; Ma et al., 2000; man, 2010; Sweeney and Houdusse, 2010). A remarkable Stan et al., 2005), the current limitations of such ap- number of experimental methods have been developed proaches prevent them from making direct contact with to address various aspects of biological machines. These experiments. Improvements in the development of ac- include, but are not restricted to, ensemble experiments curate energy functions (also known as force fields) and (stopped flow and fluorescent labeling) that provide the enhancement in computer capacity to enable simulations much needed data on ATP hydrolysis and ADP release for long times will in the future bridge the gap between rates, single molecule experiments that yield dwell time what is currently possible and what is needed for realistic distributions in molecular motors, processivity and veloc- description of molecular machines. ity as a function of external loads in motors and helicases. The actions of molecular machines, like all biological In addition, a combination of ensemble experiments and processes, are complicated, involving cooperative dynam- determination of structures using X-ray crystallography ics on multiple time scales. This is illustrated using the and cryo-EM experiments has produced a vivid picture of typical time scales involved in a single step of myosin V the molecular basis of chaperone function. Of particular (Fig. 2). In many problems in physics it is the hoped that note are optical trap experiments, which have been used simple models, that capture the essence of the problem to obtain mean motor velocity as a function of a resis- can be created, which be can be solved, in order to ob- tive force and ATP concentration in kinesin, myosin, and tain insights into highly non-trivial systems. This strat- dynein, and the dependence of velocity and processivity egy is hard to implement for biological machines because in a number of helicases. No point would be served in the interaction energies are highly heterogeneous (like in reviewing the detailed results from these experiments as spin glasses), thus making it hard to construct a reason- there are many articles that the interested reader might able coarse-graining strategy. Nevertheless, in order to consult. make progress one has to devise tractable coarse-grained Our focus here is to describe theoretical approaches models, which can be either simulated or solved ana- that are rooted in a number of areas in physics in order to lytically (at least approximately). The efficacy of such understand principally outcomes of in vitro experiments. approaches can then be assessed by direct comparisons The theoretical approaches in these studies were devel- with experimental results, and their abilities to provide oped, and continue to be the focus of current research, insights into the mechanisms of their functions. Remark- in order to quantitatively explain the experimental ob- ably, inspired by advances in experiments in the last servations, and shed light on new puzzles that seem to decade, several theoretical models have been proposed, arise with ever improving advances in experimental tech- which have greatly contributed to our understanding of niques. Perhaps, the most detailed view of how biological molecular machines. These developments have occurred machines operate might be obtained from molecular dy- in different contexts, which belie the underlying unifying namics simulations. Although atomically detailed molec- principles. To bring these issues to the fore, we pro- 4 vide our perspectives on the application of these models, which should be thought of as coarse-grained network a b c d GroEL-GroES models or their generalizations. Such models have been Kinesin Dynein created for explaining not only the motility of motors, 23 nm but also the functions of molecular chaperones, and he- Myosin V licases. We focus on the applications of these theoretical ideas in order to account for the functions of these in- 14 nm trinsically non-equilibrium systems. Collectively, these approaches show that, by examining in detail the work- e Helicase ings of many machines, universal principles, both at the 8 nm 20 nm conceptual and practical levels, might emerge.
The literature on the functions of biological machines 10 nm is vast. Therefore, we restricted ourselves to only a few 35 nm topics that focus on theoretical approaches, which are FIG. 3 Schematic representations of the structures of six bio- sufficiently general that they can be applied to an ar- logical machines. Molecular motors, like conventional kinesin ray of problems in the field. Rather than describe many (a), dynein (b), and myosin V (c), show great variations in results in detail, we walk the the reader through a se- their structures and sizes. The motor heads in kinesin and lection of theoretical methods, which is necessarily bi- myosin V, shown in dark blue, bind directly to the micro- ased, so that she or he can access the literature read- tubule and actin, respectively (not shown) but in dynein the microtubule binding domain (light blue) is separated from the ily. Before getting to the details of this review we will motor domain (hexameric ring structures) by nearly twenty be remiss if we did not point out one prescient mono- five nanometers. (d) Structure of the GroEL in the symmet- graph (Howard, 2001) and two forward looking reviews ric state (Protein Data Bank (PDB) code 4PKO (Fei et al., (J¨ulicher et al., 1997; Kolomeisky and Fisher, 2007). The 2014)), which is the functional state in the presence of mis- monograph by Howard, written nearly twenty years ago, folded proteins. The blue and aqua colors represent the seven covers all aspects of cytoskeletal motors and is a land- fold symmetric GroEL rings. The red and the pink correspond mark in this field. It provides conceptual and practi- to the co-chaperonin GroES. (e) Structure of the DnaB he- licase with six subunits assembled as a ring. The figure was cal guidelines needed to understand the fundamentals created using the coordinates in the PDB code 4ESV (It- of motor mechanics. Questions of generality, such as sathitphaisarn et al., 2012). Structures (a)-(c) are adapted describing movement of a generic motor either in iso- from (Vale, 2003a). lation or as a collection, were addressed using princi- pally the Brownian ratchet model by Julicher, Adjari, and Prost (J¨ulicher et al., 1997), whereas Kolomeisky and myosin V share some structural similarities. and Fisher (Kolomeisky and Fisher, 2007) summarized the development and practical applications of stochas- tic kinetic models. Here, our focus is on the more re- A. Molecular motors cent developments and applications of physical princi- ples that have started to provide a unified perspective Let us first consider the three motors whose structures for not only molecular motors but a large of class of are schematically shown in Fig. 3 (a-c). Although there machines with vastly different functions. These new in- are differences between them, the parts list is roughly sights have become possible by expanding the scope of the same. These motors are dimeric. The nucleotide traditional stochastic chemical kinetics models for mo- binding sites are in the two motor heads, which are con- tors, helicases, and molecular chaperones, incorporation nected through a mobile linker (the lever arm in myosin) of polymer physics concepts to account for the architec- to a tail domain involved in dimerization and cargo- ture of motors, and use of coarse-grained models in sim- binding. Changes due to nucleotide binding and hydrol- ulating the stepping kinetics of motors and the dynamics ysis in the motor heads result in conformational changes of large scale allosteric transitions. in the linker that propel the motor on the cytoskele- tal filaments. For the motors to be processive, which means they take multiple steps before disengaging from II. STRUCTURES the track, one head has to be bound till the detached head rebinds to a site on the track. This involves communi- The structures of the biological machines that we con- cation between the heads and is referred to as gating, sider in our perspective (kinesin, myosin, dynein, the the origin of which is still not fully understood at the E. Coli. chaperonin machinery GroEL/GroES, and he- molecular level. licases) are shown in Fig. 3. Inspection of the figures Kinesin: There are at least forty five members be- shows that there are considerable variations in the ar- longing to the kinesin superfamily in mouse and human chitectures, although kinesin-1 (or conventional kinesin) genomes (Hirokawa et al., 2009). They are all micro- 5 tubule (MT) bound motors, which walk unidirectionally Chaperonins: The beautiful and unusual structure towards the plus end of the MT (for example kinesin-1) with seven fold symmetry of the E. Coli. chaperonin or to the minus end (Ncd motor). Kinesin-1 takes pre- machinery, consisting of a complex between GroEL and cisely 8.2 nm steps, which corresponds to the distance GroES, resembles an American football (Fig. 3d). The between two adjacent α/β tubulin dimers, which are the GroEL/GroES machine helps in the folding of recalci- building blocks of the MT. The size of kinesin-1 is a few trant proteins that do not fold spontaneously. The struc- nanometers whereas the length of the stalk is in the range ture in Fig. 3d (reported in (Fei et al., 2014)) is the func- of 30-40 nm. The kinesin-1 velocity depends on the con- tional state of this stochastic machine that is one of the centration of ATP, saturating at high values. The motor populated states during the catalytic cycle of GroEL in moves towards the plus end of the MT with maximal ve- the presence of misfolded substrate proteins (SPs). There locity of approximately 800 nm/s (Visscher et al., 1999), are two chambers in which the SPs could be sequestered. and is capable of resisting forces on the order of 7 pN The volume of each of the chambers in the structure in 3 (Carter and Cross, 2005; Visscher et al., 1999) (kBT = Fig. 3e is about 185,000 A˚ , which is about twice the vol- 4.1 pN nm where kB is the Boltzmann constant and T is ume in the relaxed structure in the absence of nucleotides the temperature).· and GroES, the co-chaperonin. The spectacular change in the chamber volume that occurs during the catalytic Myosin: The number of genes encoding for myosin mo- cycle, with accompanying alterations in the chemical na- tors in roughly thirty five (Sellers, 2000). The superfam- ture of the cavity interior, changing from hydrophobic ily of the actin bound myosins are divided into fifteen to polar, is the mechanism of annealing action of this classes (Hartman and Spudich, 2012; Sellers, 2000). With machine (Todd et al., 1996). the exception of myosin VI all other members of this fam- Helicases: Helicases, which are involved in all aspects ily walk towards the plus end of actin. The structure of of nucleic acid metabolism (Lohman, 1992; Lohman and myosin V (Fig. 3) shows that the motor heads are con- Bjornson, 1996), function by coupling nucleoside triphos- nected to the lever arms, which are made up of six IQ phate (NTP) hydrolysis, to either translocate on single motifs. The lever arm is 36 nm-long stiff unit (the strand nucleic acids (ssNAs) or unwind double-stranded persistence length exceeds≈ 100 nm), whose size is com- (ds) DNA. Depending on their sequences they are clas- mensurate with half the helical repeat length of F-actin. sified into six super families (SFs). The structural di- The maximal velocity of myosin V, whose motor head is versity of helicases can be appreciated by noticing that larger than kinesin-1, is roughly 500 nm/s (Baker et al., sequences classified under the SF1 and SF2 families are 2004) with a stall force between 2-3 pN (Kad et al., 2008; non-ring forming whereas those in SF3-SF6 form ring Mehta et al., 1999; Uemura et al., 2004a; Veigel et al., structures. The hexametric structure (Itsathitphaisarn 2002). et al., 2012) of the replicative helices (DnaB) from bacte- Dynein: Cytoplasmic dynein, discovered over fifty ria, which does belong to the AAA+ family, is shown in years ago (Gibbons and Rowe, 1965) and pictured in Fig. Fig. 3d. However, unlike GroEL (see Fig. 3d) DnaB has 3b, walks somewhat erratically with a broad step-size the expected crystallographically allowed six fold symme- distribution (DeWitt et al., 2012; Reck-Peterson et al., try. How the NTP chemistry is coupled to translocation 2006a) on the MT towards the minus end. The struc- and unwinding remains an outstanding unsolved theoret- tural features of dynein are different when compared with ical problem. other cytoskeletal motors (see Fig. 3). First, the motor head belongs to the class of AAA+ family, which means dynein must have evolved from a different lineage com- B. Catalytic Cycle pared with myosin and kinesin. Second, other AAA+ enzymes, such as bacterial chaperonin GroEL (Fig.3d) All machines undergo a catalytic cycle, not unlike and protein degradation machines, are oligomeric assem- man-made motors, in which fuel, usually in the form of blies. In contrast, the hexameric ring that constitutes ATP, bound to a nucleotide binding site (or sites) is hy- the motor domain assembles from a single polypeptide drolyzed. These events trigger conformational changes chain! Third, the size of the dynein motor head is sig- that produce motion, which in motors and helicases re- nificantly larger than those of kinesin and myosin. The sults in stepping on the polar tracks or translocation length of the motor head of dynein along its longest axis on single stranded nucleic acids. In chaperones the nu- is about 25 nm, which is in contrast to the diameter of cleotide chemistry is linked to conformational changes, kinesin, which is only about 5 nm. Finally, although which in turn perform work on the protein or ribozyme there are six nucleotide binding sites in dynein, hydroly- to be folded. The link between ATPase cycle and func- sis in only two (perhaps three) are relevant for its motil- tion is somewhat different in GroEL, which we describe ity. The velocity of dynein at 1 mM ATP is roughly 100 below. The catalytic cycle for myosin V in the simplest nm/s (Reck-Peterson et al., 2006b) with a stall force∼ that form, which suffices for our purposes, is reproduced in is 7pN (Gennerich et al., 2007; Toba et al., 2006). Fig. 4. There are four crucial steps (see Fig. 4). In ≈ 6
2016; Gladrow et al., 2016). These are the key features characterizing the out-of-equilibrium nature of biologi- cal machines driven by non-conservative forces. Conse- quently, molecular machines can be thought of as active systems, and hence a note on diffusive motion is war- ranted. Chemical free energy released upon hydrolysis of ATP or GTP is the driving force for directed motion of molecular motors. However, if the energy source is not explicitly modeled in the description of the associated dynamics, molecular motors could be regarded as self- propelled active particles. To our knowledge such an ap- proach has not been pursued to calculate experimentally measurable quantities, such as the force-velocity relation or the distribution of run lengths of motors. The dynamics of active particles is fundamentally dif- ferent from passive particles at equilibrium or moving FIG. 4 A simple representation of the catalytic cycle of under the influence of a conservative external field. For myosin V describing the stepping of the trailing head towards example, the mean drift velocity of a spherical colloidal the plus end of actin, which is the dominant pathway in the absence of resisting force. The main text describes the de- particle with a drag coefficient γ, and charge q subject to tails. As described later in this perspective there are four a constant electric field E is VD = qE/γ. This relation other pathways that have to be accounted for in order to is readily obtained by balancing the Stokes force (γVD) produce a quantitative theory of the stepping kinetics of this and the force exerted by the field (qE). The diffusion motor. The figure is reproduced from (Vale, 2003b) constant is D = kBT/γ, which is the Stokes-Einstein re- lation. Hence, the distribution of the particle position is given by the probability density, the initial state, ATP binds to the trailing head (TH), (x x V t)2 with ADP in the (LH). The premature release of ADP P (x, t) exp − 0 − D . (1) from the LH is slowed by rearward tension, which is an ∼ − 4Dt example of gating. Upon ATP binding to the TH the Starting from an initial position x0, the particle moves on interaction with F-actin is weakened, resulting in its de- average at velocity VD and the distribution of positions tachment from F-actin. During the diffusive search by 2 spreads with time as (δx) = 2(kBT/γ)t. Of particular the TH for the forward binding site, ATP is hydrolyzed, note is that D is definedh independentlyi of the particle producing ADP and the inorganic phosphate, P . In this i velocity VD, so that the magnitude of D is not altered by state, the TH binds to F-actin after which Pi is released increasing the field strength. Conversely, the velocity of from the new LH followed by ADP release from the TH, the particle does not depend on the ambient temperature and the cycle continues till the processive run ends. Of T . Thus, VD and D are mutually independent of one course, this simple description is incomplete because the another. rates for nucleotide binding vary depending on the nu- In stark contrast, for an active particle, whose motion cleotide concentration. Nevertheless, this simple reaction is powered by the internal fuel, the diffusivity is no longer cycle, whose main features hold for all myosin motors, is independent from the driving velocity. The effective sufficient to nearly quantitatively characterize many ex- 2 diffusion constant, Deff, defined as limt→∞ (δx) /2t, perimental observables (see below). The catalytic cycle depends on a set of non-thermal parametersh andi vio- for kinesin is similar except that the interaction between lates the FDT (Liu et al., 2011; Tailleur and Cates, the MT and kinesin is weakest if the motor head con- 2008). For transport motors exhibiting one-dimensional tains ADP. All other nucleotide states (the no nucleotide movement along the cytoskeletal filament, both V = apo, ATP bound state, the state with ADP and P ) bind 2 i limt→∞ d x(t) /dt and D = (1/2) limt→∞ d δx(t) /dt strongly to the MT. can be measuredh i directly using a given timeh tracei of a motor, which can be measured using single molecule opti- cal tweezer experiments. Indeed, such measurements on III. BIOLOGICAL MACHINES ARE ACTIVE SYSTEMS. kinesin-1 (Visscher et al., 1999) were used to obtain the velocity (V ) and diffusion coefficient (D) from the global Biological machines are non-equilibrium systems that analysis of the stepping trajectories. These values were are driven by non-conservative forces require constant used (Visscher et al., 1999) to estimate the randomness supply of energy. Therefore, it is not surprising that parameter r = 2D/d0V where d0( 8.2 nm) is the step detailed balance (DB) relation and the fluctuation- size of kinesin-1. When the diffusion≈ constant D is cal- dissipation theorem (FDT) are violated (Battle et al., culated from the randomness parameter with the knowl- 7 edge of V and d0, and D is compared with V measured ments. However, typically the number of observables are under the same conditions, it can be shown that D in- very few, making matching the predictions of the SKM creases monotonically with V (Hwang and Hyeon, 2017) to experiments is difficult. Furthermore, it is often dif- in contrast to passive diffusion. This result is a con- ficult to interpret the physical meaning of the extracted sequence of the enzyme catalytic turnover (Hwang and parameters in terms of the underlying motor architecture Hyeon, 2017). As already discussed the effective diffu- and the underlying biochemical cycle. As a consequence, sivity D for kinesin-1, which is an active particle whose it is necessary to strike a balance between the “minimal- dynamics is fueled by ATP hydrolysis free energy, does ity” of a model, which reduces the risk of over-fitting not obey the FDT. This illustration shows that these en- and increases the generality of SKMs and the predic- ergy consuming enzymes operate out of equilibrium. tive power, and the “comprehensiveness” of the network considered. This comes at some risk; for instance, one may be tempted to neglect certain transitions that are IV. STOCHASTIC KINETIC MODELS “fast” compared to others, and therefore are not expected to contribute to the overall phenomenology of the ma- In this section we discuss the development of stochas- chine. In addition, it may seem reasonable to ignore off- tic kinetic models (SKMs) as a means of describing the pathway, slow and rare transitions. On the other hand, in function of different types of molecular machines. As order to understand the function of molecular machines, stated above and shown explicitly in Fig. 4 for myosin experimentalists often probe their response to changes in V, all molecular machines go through cycles during which the environment (for instance modifying the ATP con- they hydrolyze ATP (or GTP) and perform a some func- centration), or by applying mechanical perturbations and tion. Bulk-kinetics, single-molecule experiments, and studying the response of the motor. As the environment structural studies have shown that the intermediates ex- or perturbations change, the nature of the “rate-limiting” plored by the molecular machines during the cycle have step as well as the likelihood of alternative stepping path- distinguishable structural or kinetic features and have ways might change. Therefore, a simplified description provided the rate for going from one state to the next. that ignores certain intermediate states might only work The temporal resolution in these studies is on the order for a limited set of experimental conditions, thereby re- of milliseconds [although new techniques enable the ex- ducing the number of measurements that can be used to ploration of time-scales as fast as 55 µs (Isojima et al., train the parameters or to falsify the predictions. Finally, 2016)]. It follows that the dynamics≈ of the molecular we note en passant that the determination of an appro- machine can be described as a discrete-state (the dis- priate catalytic cycle is predicated on a few experimental tinguishable intermediates should be experimentally ob- observables only, and it may not be complete enough for served), continuous-time Markov model – a framework all the states that a machine might sample during its that falls under the rubric of the “master equation.” In function. this model, the cycle of a molecular machine is reduced to a network of states connected by edges that identify the available transitions between the intermediates. Thus, by solving the dynamics associated with an appropriate network it is possible, in principle, to account for exper- imental observables in terms of the underlying network Another issue with SKMs is that structural informa- parameters. tion is incorporated in the model only in a very approxi- SKMs have a number of appealing features: (i) they mate way, normally by introducing a parameter that al- have a direct connection with the biochemistry associ- lows the rate to depend on the load applied according ated with the molecular machines; (ii) the rates that to the Bell model (Bell, 1978). This reduces the capa- constitute the model are measurable; (iii) it is some- bility of the model to incorporate the wealth of experi- times possible to find an analytical solution, and (iv) mental structural information, and decreases the predic- more complex networks may be solved numerically; (v) tive power of the proposed paradigm. However, in some finally, SKMs can be directly related with thermodynam- cases a tractable analytical theory that incorporates lever ics, which makes the model instructive and appeals to the arm structural information into a kinetic model is possi- interests, and intuition of a vast scientific community. ble (Hinczewski et al., 2013), as discussed in more detail Nevertheless, SKMs also have some shortcomings: first below. Despite these limitations, the framework underly- and foremost, they often have a large number of param- ing SKMs not only provides a convenient way to analyze eters. For instance, a simple model for a molecular ma- experiments, but also has been used to address concep- chine performing mechanical work against a fixed load tual questions related to the efficiency of biological ma- and described by a single-cycle network with N interme- chines. For these reasons, the theory underlying SKMs diate states has 4N 2 independent parameters, which is an integral part of describing the function of molecular have to be determined− by fitting to appropriate experi- motors. 8
A. The Master Equation
We assume that the system is described by N distin- guishable intermediate states. The probability of being in state i at time t is given by pi(t), and the time evolution of this probability is governed by the master equation,
dpi(t) X X = p (t)w p (t)w , (2) dt j ji − i ij j j where wij 0 is the rate for the transition i j, ≥ P → subject to the constraint i pi(t) = 1. Because of mi- croscopic reversibility, if w = 0 then w = 0. As ij 6 ji 6 t , the probabilities pi(t) become independent of time.→ ∞ This stationary solution of the master equation eq describes an equilibrium system (pi ) if detailed balance holds, that is if across all the edges of the network the eq eq net flux is zero: pi wij pj wji = ∆Jij = 0. However, dp /dt = 0 is also satisfied− by the less stringent relation FIG. 5 Example of kinetic network. The network has 6 states i and 7 edges [see panel (a)]. Panel (b): the partial graphs ori- P (pssw pssw ) = P ∆J = 0. Under these con- j i ij − j ji j ij ented in order to extract the stationary probability of state ditions the system could be in a non-equilibrium steady 1. Panel (c): There are three cycle fluxes, F (upper three state (NESS). An isolated system is expected to reach figures), B (bottom three), and D (central figure). From the an equilibrium state, whereas coupling with an external figure we extract the values of Σν in Eq. 4 for the three cy- energy source enables the creation and persistence of a cles. These are given by the product of the rates flowing to- NESS. wards the cycle [see T. L. Hill (Hill, 2005a) for details]: ΣF = From a mathematical standpoint, given a kinetic net- w32w45 + w43w32 + w34w45;ΣB = w12w65 + w61w12 + w16w65; Σ = 1. work described by the master equation, the stationary D ss eq probabilities pi (or pi ) can be obtained using graph- theoretical arguments. The details may be found in the where Aν (or βAν ) has been termed the action func- works of Hill (Hill, 1966, 2005a; Hill and Chen, 1975). tional (Lebowitz and Spohn, 1999) or affinity (Schnaken- Briefly, one may construct the set of partial diagrams berg, 1976) of the cycle ν. Note that the net direc- such that all the states are visited but no cycles are tion of completion of cycle ν is given by the sign of formed (see Fig. 5b). The stationary probability of be- ∆Jν = Jν+ Jν− . It is easy to show that (Hill and ing in state i is proportional to the sum of all the par- Simmons, 1976),− tial graphs oriented in such a way that the fluxes con- verge towards state i, or σi. The normalization factor is ∆Jν Aν 0. (6) P ≥ Σ = i σi, so that the stationary probability is, The above equality holds only when Jν,+ = Jν,−. In ss σi pi = . (3) other words, ∆Jν and Aν have the same sign, which Σ means that the value of the affinity dictates the direc- The stationary flux along one edge may be written as tion of the cycle (Hill and Simmons, 1976). a sum of contributions from all the cycle fluxes of the In order to identify the physical interpretation of system. Each cycle can be completed in two directions: these mathematical identities we need to connect the one, counterclockwise, labeled as “+”; the other, clock- rates with thermodynamic quantities, such as energy, wise, termed “-”. The cycle fluxes in the “+” and “-” entropy, and the chemical potential. Following three directions are given by the following relationships (Hill, different approaches we show that the affinity (Eq. 5) 2005a), is related to the entropy produced during a cycle. The first two strategies involve the Shannon entropy (Liepelt Σν Πν± J ± = , (4) ν Σ and Lipowsky, 2007b; Lipowsky and Liepelt, 2008; Schnakenberg, 1976) and fluctuation theorems (Crooks, where Σν is a combination of rates that are specific for 1998; Seifert, 2005a,b, 2012), and they will be dis- cycle ν,Πν± is the products of the rates of the cycle per- cussed without specifically referring to the experimental formed in the “+” (Πν+ ) or “-” (Πν− ) direction, and the hallmarks for motor velocity described in the previous denominator Σ is defined above (see Fig. 5c). It follows sections. The last method, which is based on estab- that, lishing a connection between pseudo-first-order rate J + Π + constants and the chemical potential (Hill, 2005a), will ν = ν = eβAν , (5) Jν− Πν− be developed in the context of molecular motors. 9
At equilibrium, the rate of hydrolyzing and synthesizing ATP is the same. However, this is not what happens in a cell, where B. Thermodynamics the concentration of ATP is maintained under homeo- static control (Wang et al., 2017) far away from equilib- Molecular machines are enzymes (E) that catalyze the rium; typical concentrations are [ATP] 1mM, [ADP] chemical transformation of substrate molecules (S P ), 10µM, [P ] 1mM (Howard, 2001), resulting≈ in a chem-≈ → i ≈ which in general can be described by the Michaelis- ical potential X = ∆µhyd of 25 (Howard, 2001), Menten kinetics (E + S ES E + P ). Of course, which makes ATP− hydrolysis a≈ spontaneous reaction → the process is reversible, and the enzyme also catalyzes (∆µhyd < 0). This means that the enzymatic cycle of the the reverse reaction – the transformation of P into S. molecular machine will be driven in the direction that In the case of molecular machines, the substrate is ATP, consumes ATP, and ATP hydrolysis provides the driv- and the products of ATP hydrolysis are ADP and or- ing force that enables the molecular machine to perform thophosphate (Pi). (Some molecular machines catalyze work. Without accounting explicitly for the whole cellu- the hydrolysis of GTP; mutatis mutandis our considera- lar apparatus involved in dictating and maintaining the tions do not change.) set-point ATP level, in the simplest theoretical model the We imagine the following experimental setup: a molec- system is assumed to be in contact with some devices re- ular machine operates in a solution containing ATP, ferred to as “chemostats” capable of maintaining the ini- ADP, and Pi. The chemical potentials of these three tial concentrations of ATP, ADP, and Pi (Lipowsky and species are given by (Hill, 2005b), Liepelt, 2008; Lipowsky et al., 2009; Qian and Beard, 2005; Seifert, 2011b). After each cycle the chemostat 0 µATP = µATP + kBT ln [ATP] removes the products from solution and replenishes the 0 substrates, thereby ensuring that after each enzymatic µADP = µADP + kBT ln [ADP] cycle the initial condition is reset, which enables the cre- µ = µ0 + k T ln [P ] Pi Pi B i (7) ation of a NESS for t . → ∞ X = ∆µhyd = µATP µADP µPi = The system is also in contact with a thermal reser- − − − Keq[ATP] voir with which it exchanges heat (Lipowsky and Liepelt, = kBT ln , 2008; Lipowsky et al., 2009; Seifert, 2011b). We assume [ADP][Pi] that the thermal reservoir operates “quasi-statically” on 0 the time-scale of the heat exchange, and the combination where µc (µc ) is the (standard) chemical potential of species c,[c] is the concentration in solution, and Keq of the system and the thermal reservoir is energetically 4.9 105 M (Howard, 2001) is the equilibrium constant≈ isolated. Because after a cycle the motor and the solu- for ATP· hydrolysis. (Note that we should use activities tion have not changed, the entropy produced is equal to ac in Eq. 7 instead of concentrations [c]; throughout the the ratio between the heat absorbed by the thermal reser- review we will assume that a [c].) At equilibrium, voir, Q, and the constant temperature T . We exclude pV c ≈ ([ADP][Pi]/[ATP])Eq = Keq, and ∆µhyd = 0. On the work from our formulation, but we include the possibility other hand, if [ATP]K [ADP][P ], then ∆µ < 0, that the molecular machine performs work against a fixed eq i hyd and the hydrolysis of ATP is a spontaneous reaction. load, so that for a forward (backward) step Wmech = fd0 We exclude un-catalyzed hydrolysis/synthesis of ATP, as (Wmech = fd0), where the choice of the sign implies − it occurs over time-scales beyond our interest (Hulett, that a positive force opposes forward movement. Note 1970); as a consequence, in the absence of molecular ma- that the load f is assumed to be clamped, so at every chines the concentrations [ATP], [ADP], and [Pi] are con- step, regardless of the position of the motor, the cycle is stant. The presence of the molecular machine does not repeated under identical conditions. alter the equilibrium features of ATP hydrolysis (namely, The energy X = ∆µhyd > 0 consumed during a cycle − K ), however it accelerates the rate of ATP synthe- is partitioned into work performed ( Wmech) and heat eq − sis/hydrolysis. Let the initial concentrations of ATP, released ( Q), that is, − ADP, and P make ATP hydrolysis a spontaneous re- i ∆µ + Q + W = 0, (8) action. After each catalytic cycle the enzyme attains hyd mech the same conformation that it had at the start. How- implying that the total change of entropy of the system ever, the solution conditions have changed as a substrate plus environment over one cycle is given by, (ATP) has been consumed and products (ADP and P ) i Q ∆µ fd have been created. Thus, in the presence of the molecular ∆S = = − hyd − 0 0. (9) machine the solution approaches the equilibrium ratio of T T ≥ concentrations of ATP, ADP, and Pi, and monitoring the (For backward steps, the sign in front of fd0 is the oppo- dynamics under these conditions corresponds to study- site). The inequality is due to the second law of thermo- ing the time-dependent relaxation towards equilibrium. dynamics, which states that we should to find an increase 10 in entropy during the cycle because the combination of Consider now a cyclic network of N states; the only system and environment is isolated. Eq. (9) leads to the transitions allowed from state i are i i + 1 and i observation that the maximum amount of force that the i 1. (The cyclic nature of the network implies periodic motor can resist is given by, boundary− conditions, and so state i = 0 is the same as P state i = N). In a NESS, the condition Jij = 0 ∆µhyd j fmax = − . (10) becomes ∆J = ∆J , so that the net flux across d i,i+1 i−1,i 0 all the edges is the same and equal to the net cycle flux, A number of authors have discussed variations of ∆J = J+ J−. It follows that, this thermodynamic framework. In particular, Seifert − N−1 elucidated the contribution of the thermodynamic diS Y wi,i+1 ∆JA = k ∆J ln = (15) contribution of chemostats (Seifert, 2011b); Qian and dt B w T i=0 i+1,i Beard discussed a network of reactions in which some metabolites are clamped, while other are introduced at where wi,i+1 and wi+1,i are the forward and backward rates, respectively. The term A = k T ln QN−1 wi,i+1 is a constant rate (Qian and Beard, 2005). B i=0 wi+1,i the affinity (see Eq. 5), which Hill refers to as a ther- modynamic force driving the system out of equilibrium and imposing a NESS (Hill, 2005a). Liepelt and Lipowky C. Rate of Entropy Production identified the thermodynamic force with the entropy pro- duction (Liepelt and Lipowsky, 2007b); if the rate of Let the entropy be, completing a cycle is ∆J > 0 (which implies that the cy- X S = k p ln p , (11) cle is preferentially completed in the forward, or “+”, di- − B i i i rection), one can identify the entropy ∆S produced over one cycle as, where the sum extends over all the available conforma- N−1 tions of the system. Let us take a time derivative of T diS Y wi,i+1 T ∆S = = k T ln = A 0. (16) S, and after imposing the conservation of probability ∆J dt B w ≥ P i=0 i+1,i ( i dpi/dt = 0) condition, and plugging in the master equation we get, The argument can be extended to the case in which the dS kinetic network is characterized by multiple cycles (Lie- X X −1 = kB (piwij pjwji) ln pi. (12) pelt and Lipowsky, 2007b); if ∆J is the average time dt − ν i j for completing a cycle ν, and ∆Sν is the entropy pro- Note that in a NESS, P (p w p w ) = P J = duced with that cycle, we can write, j i ij − j ji j ij 0, and as a consequence dS/dt = 0, as expected. We diS X = J ∆S 0. (17) first symmetrize the result with respect to the indexes i dt ν ν ≥ and j, and then add and subtract 1/2k P P (p w ν B i j i ij − pjwji) ln(wij/wji), which leads to (Lipowsky and Liepelt, Equation 6 indicates that each term in the summation is 2008; Schnakenberg, 1976), non-negative (Hill and Simmons, 1976). To summarize, in this section we showed how the dS 1 X X piwij 1 X X wij = k J ln k J ln . action functional (or affinity) can be identified with the dt B 2 ij p w − B 2 ij w i j j ji i j ji entropy produced. We now present a different argument (13) based upon fluctuation theorems leading to the same The first of the two terms on the r.h.s. of Eq. 13 is always conclusions. non-negative, and it has been identified as the rate of en- tropy production, diS/dt (Lipowsky and Liepelt, 2008; Schnakenberg, 1976). We note parenthetically that some authors (Hill and Simmons, 1976) do not include the fac- D. Fluctuation Theorems tor 1/2 as they interpret the summation to be carried out over the edges of the network, and not over the states. Let the probability density of taking a forward path of n steps be (i , t ; i , t ; ; i , τ), in which the system Because in a NESS dS/dt = 0, it follows that the second PF 0 0 1 1 ··· n term of the r.h.s of Eq. 13 is the rate of entropy out-flux, is in state i0 at t0, transitions to state i1 at t1 and so on and, until it reaches state in at time τ (see Fig. 6a). We use the subscript “F ” to indicate that the transitions are diS 1 X X piwij = k J ln = completed forward in time. Using the Markov property, dt B 2 ij p w i j j ji we can rewrite this joint probability as (Crooks, 1998), (14) 1 X X wij = k J ln 0. F (i0, t0; i1, t1; ; in, τ) = B 2 ij w ≥ P ··· (18) i j ji = p (t )P (i , t i , t ) P (i , τ i − , t − ). i0 0 1 1| 0 0 ··· n | n 1 n 1 11
ss pin and because each path could be performed forward or backward in time, the normalization factor = ˜ . Following Seifert (Seifert, 2005b), we define R toN be theN logarithm of the ratio between the probability for for- ward and backward paths, and for the sake of simplicity we consider only cyclical pathways, in which i0 = in. Therefore,
n−1 F Y wiα,iα+1 R = ln P = ln . (21) w PB α=0 iα+1,iα Note that the time-dependent terms in the joint proba- bilities cancel out, so we neglect them in the following. (In alternative, we may consider the time at which these jumps occur to be fixed.) Here, R has a structure anal- ogous to that of the affinities introduced in the previous sections. More precisely, given a set of states visited we FIG. 6 Path in state space executed forward (a) and back- can rewrite, ward (b) in time. The starting point is the dot, waiting times X at a fixed state are in black, transitions are in blue. R = nν βAν , (22) ν
where nν is the net number of times the cycle ν has Assuming that the system is in a steady state at t = been completed during the n-step path associated with ss t0, we replace pi0 = pi0 . The conditional probabili- R. In order to justify this expression, note that every −Wi(tj −ti) ties are given by P (j, tj i, ti) = wije , where time that during the path a cycle is not completed, the W = P w . The conditional| probability P (j, t i, t ) path re-traces it self and those branch-like excursions do i j ij j| i is the product of two probabilities. The first is wij/Wi, not contribute to R. If we average over all paths that the probability of making a transition to state j among complete a cycle of length n, we find the following se- all the other possibilities when starting in state i. The quence of identities (Seifert, 2005b), −Wi(tj −ti) second is Wie , the probability that this transi- X X e−R = P e−R = P = 1. (23) tion happens after time (tj ti). The reason for the Wi h i F B in the exponent is that the− mean waiting time to transi- paths paths −1 tion from state i to any other state is Wi , independent Using Jensen’s inequality, e−R e−hRi, leads to, of state j. This independence is a well-known property h i ≥ of escape times in Markov networks, which may be com- R 0. (24) h i ≥ puted through well-known methods (Van Kampen, 2007). From Eq. 22 we obtain, Therefore, the joint probability density becomes (Seifert, X 2012), R = n βA 0. (25) h i h ν i ν ≥ ν F (i0, t0; i1, t1; ; in, τ) = P ··· We now introduce an Arrhenius-type relationship be- n−1 (19) tween the rates, w /w = e−β∆Fij , where ∆F = −1 ss Y −Wiα (tα+1−tα) ij ji ij = pi0 wiα,iα+1 e , N Fj Fi is the free energy difference between state i and α=0 state− j. The free energy difference accounts for three where tn = τ and (t0, τ) is a normalization factor to contributions: (i) the intrinsic free energy of a state may P N R τ R τ R τ ensure that dt1 dt2 dtn F = 1. change; (ii) the motor may bind/release ATP, ADP, and i0,i1,·,in t0 t1 ··· tn−1 P The probability of completing the time-reversed path Pi; (iii) the motor may perform work against an exter- (Fig. 6b) is, nal load. After a cycle, the motor returns to the initial state, and as a consequence the intrinsic free energy does (i , τ,˜ i − , t˜ − ; ; i , t˜ ) = not carry any contribution to the cycle. The hydrolysis PB n n 1 n 1 ··· 0 0 N−1 of ATP contributed with the release of energy equal to − Y − ˜ −˜ (20) = ˜ 1pss w e Wiα (tα tα+1), ∆µhyd, and the work performed per each displacement N in iα+1,iα − α=0 of size d0 against a load f is equal to fd0. It follows that (Lipowsky and Liepelt, 2008), − in which we started in state in at time t˜n = τ tn and ∆Sν − Y kij −nν,ATPβ∆µh−βflν retrace the forward path until we reach state i0 at time = e = e kB , (26) kji t˜ = τ t . Note that the initial probability is now |i,ji∈ν 0 − 0 12 where, following the notation of Lipowsky and Lie- forward while the others hold tight onto the CF. From pelt (Lipowsky and Liepelt, 2008), i, j corresponds to a here on, we focus our discussion exclusively on dimeric directed edge from state i to j, and| thei sum is extended processive motors made of two identical enzymes of the over all the directed edges belonging to cycle ν. Here, same family, which we refer to as “heads.” With this nν,ATP is the net number of ATP hydrolyzed during cy- we may identify the leading head (LH) as the one that cle ν, and lν is the net displacement along the track (a is in front of the dimeric complex, while the other is multiple of d0). The last identity follows from Eq. 9, and the trailing head (TH). We reserve the word “motor” from Eq. 25 gives, for the motile construct, which is a dimer in the case of myosin V (Mehta et al., 1999), VI (Rock et al., 2001), X k R = n ∆S 0. (27) and X (Sun et al., 2010), conventional kinesin (kinesin- Bh i h ν i ν ≥ ν 1) (Howard et al., 1989), and cytoplasmic dynein (Reck- Peterson et al., 2006b). In the well-accepted hand-over-hand stepping mecha- V. MOLECULAR MOTORS – MODELS WITHOUT ¨ DETACHMENT nism (Okten et al., 2004; Sun et al., 2010; Toba et al., 2006; Yildiz et al., 2003, 2004a,b), the TH of the motor In this section we develop models of molecular motors detaches from the filament, overcomes the bound head, with increasing complexity, from one-state models, and advances the motor by reaching a forward target to uni-cyclic models, ending with multi-cycle kinetic binding site (TBS). After the step is completed, the role networks. We show that experimental evidence and of the two heads is reversed, and the process is repeated thermodynamic insights suggest that the introduction identically, until the motor detaches from the CF. A few of multiple cycles creates models that are physically key ingredients are necessary for processive, hand-over- more sensible. We discuss more in detail the efficiency hand movement to occur. First, in order to prevent pre- of the motors, and we conclude the section by describing mature end of the processive run, the head that remains models that account for the detachment of the molecular bound to the track during a step should not dissociate motor from the track. from the CF at least until the detached head reaches the TBS. Second, the two heads should go through their cat- alytic cycles out-of-phase (Block, 2007), so that the TH is more likely to detach from CF than the LH. This re- A. Molecular Motors are Processive Enzymes quirement necessarily implies that the two heads ought to communicate with one another through action at a The catalytic cycle of kinesin, dynein, and myosin pro- distance, which may be viewed as a complex form of al- ceeds via multiple steps, in which ATP binding, hydroly- lostery (Thirumalai et al., 2019). Third, during a step sis, and release of ADP and Pi lead to substantial changes a combination of conformational transitions in the two in the motor conformation. The presence of the cy- heads biases the diffusive motion of the free head towards toskeletal filament, CF (F-actin in the case of myosin, the TBS unless a large resistive force is applied. microtubules for dynein and kinesin), increases the rate Meeting the first two conditions requires specific fea- of ATP hydrolysis (De la Cruz et al., 2001, 1999; Hack- tures of the hydrolytic cycles that the two heads must ney, 1988; Homma and Ikebe, 2005; Ori-McKenney et al., undergo. For instance, if the rate-limiting step occurs 2010). When the structural changes are properly recti- in a CF-bound conformation, the chances of prematurely fied under the specially designed interactions with the ending the processive run are diminished. In addition, appropriate CFs, the conformational fluctuations in M the interaction between the two heads is believed to be are transduced to a predominantly uni-directional motion key in ensuring that the TH steps first, and that the along the track, and generates mechanical forces against LH is unlikely to detach during a step. This is possi- an external load. The free energy source necessary to ble if the interplay between the two heads slows down perform this movement (or work) is provided by the hy- (or “gates”) some of the steps of the LH catalytic cycle, drolysis of ATP. or accelerates them in the trailing head (also referred to Enzymes that work in conjunction with substrates (as as “gating”) (Block, 2007; Hancock, 2016; Sweeney and is the case for Ms and CFs) are said to be “processive” Houdusse, 2010). The structural bias towards the TBS is if they perform multiple catalytic cycles without fully provided by a conformational transition of the CF-bound disengaging from the CFs (Schnitzer and Block, 1995). motor that projects forward the stepping head. For Processive motors move long distances along the CFs by the three classes of motors mentioned before, the lever hydrolyzing one ATP molecule per step without dissoci- arm swing in myosin, the neck-linker docking in kinesin, ating from the CF. In general (although with some fasci- and the bent straight transition of the linker domain in nating exceptions (Inoue et al., 2002; Post et al., 2002)) dynein provide→ the requisite forward bias. It is remark- processive movement requires the cooperation of mul- able that these structural transitions, induced by ATP tiple enzymes: some members of the ensemble proceed binding and hydrolysis, which occur on relatively small 13 length scales are amplified by responses in other struc- tal value for the Michaelis constant is KM & 50 µM, tural elements. The terms “power stroke” (Dominguez and for [ATP] & 1 mM the velocity of kinesin is satu- et al., 1998) and “Brownian ratchet” (Rice et al., 1999) rated to its maximum value Vmax = d0kcat 8 nm/10 have been used to schematize this forward bias, depend- ms = 0.8 µm/s (Kolomeisky and Fisher, 2007;≈ Visscher ing on whether the emphasis is placed on the mechanistic et al., 1999). Similar single-molecule studies have been (order order) or stochastic (disorder order) nature of conducted on other motors, showing that the Michealis- the conformational→ transitions. → Menten scheme provides a good paradigm to rationalize In the simplest cases, the step-size of a motor is the ATP concentration dependence of velocity. However, commensurate with the filament repeat – for motors the parameters of the fit depend on the motor: for myosin moving along the MT, d0 8.2 nm, whereas d0 36 nm V it was found that Vmax 0.55 µm/s, with KM 163 µ for actin-based motors (myosins).≈ However,≈ not all M (Baker et al., 2004), whereas≈ the slower myosin≈ VI has motors walk precisely by following this rule: myosin Vmax 0.16 µm/s and KM 274 µM [fit of Eq. 28 to VI and dynein, for instance, display a broad step-size data from≈ Elting et al. (2011)].≈ distribution, in which hand-over-hand steps are inter- Experiments employing optical tweezers have shown twined with inchworm-like movements and frequent that external loads affect the ATP chemistry in the cat- backward steps (Nishikawa et al., 2010; Reck-Peterson alytic site as well as the motility of the motor (Block et al., 2006a). Of course, the variability in the step et al., 2003; Oguchi et al., 2008; Veigel et al., 2005; Viss- sizes is also a consequence of the architecture of the cher et al., 1999). Thus, the resulting force-velocity-ATP motor. Thus, both the structural design of the motor relationship has been a landmark measurement, which is and the coupling to catalytic cycles determine not only used as a constraint to decipher the mechanism under- the stepping kinetics but also the precision of the step lying kinesin motility, and to construct the appropriate sizes. SKMs. Incorporation of the effect of load into MM model was suggested by making the parameters kcat and KM force-dependent (Schnitzer et al., 2000),
B. Processive Motor Velocity d k (F )[ATP] V (F, [ATP]) = 0 cat (29) kcat(F )/kb(F ) + [ATP] A variety of single-molecule techniques are able to inform on the motile properties of molecular motors. o F δcat/kB T where kcat(F ) = kcat/(pcat + (1 pcat)e ) and The motor or the track may be labeled with a fluores- o F δb/k−B T kb(F ) = kb /(pb + (1 pb)e ). The fit of the cent (Nishikawa et al., 2010; Yildiz et al., 2003) or re- F -dependent velocity− data of motor using Eq.29 is fractive (Isojima et al., 2016; Mickolajczyk et al., 2015) reasonable as long as the magnitude of load (F ) is small probe. Monitoring the time-dependent changes in the (Schnitzer et al., 2000). However, the conventional MM location of the such probes enables the determination model has an intrinsic drawback when the external load of the velocity of the motor as a function of nucleotide approaches the stall force value ( 7 pN for kinesin) concentration. Alternatively, optical trapping techniques and becomes greater than the stall≈ force. An increase may be used to follow the displacement of the motor or of load ought to induce backward stepping (V < 0); yet the filament, and to investigate the motor response to no modification of Eq.28 produces a negative velocity! the external load (Block et al., 1990; Finer et al., 1994; One possible fix for the failure of the MM model at Howard et al., 1989). Much of our current knowledge large load is to permit in the model a reverse reaction about kinesin comes from single molecule experiments current, which is realized by rendering every elementary (Block et al., 1990; Howard et al., 1989), which were de- reaction step within the kinesin cycle “reversible.” We veloped approximately at the time of the discovery of ki- discuss later how such models have been constructed in nesin (Brady, 1985; Vale et al., 1985). It has been shown the context of Markov jump processes. that each step of kinesin motors along MTs is tightly coupled with hydrolysis of one ATP molecule, that is, a single event of ATP hydrolysis by kinesin leads to 8 nm step along MTs (Schnitzer and Block, 1997). Thus,∼ C. Periodic Lattice Model within the Michaelis Menten (MM) scheme, the velocity of kinesin is associated with enzyme turnover rate is, In a molecular motor (i) binding, release, and chemi-
kcat[ATP] cal transformations of ATP, ADP and Pi from the mo- V = d0 , (28) tor head domain, and (ii) the advancement along the KM + [ATP] track occur over much slower times (ms - s) compared where d0 = 8 nm is the step size of the kinesin, kcat to the fluctuations of molecular conformations (ns - µs). and KM are the rate of catalysis and the Michaelis con- The time-scale separation enables the construction of a stant, respectively. For kinesin-1, the typical experimen- Markov jump processes in which the state of the system 14 is defined by two coordinates: (i) one of the N intermedi- u = u*[ATP] ates defining the chemical state of the substrate, and (ii) the location l along the track. The probability density ⋯1l−1 1l 1l+1⋯ of the i-th state (i = 1, 2,...,N) at filament site l obeys w w⇌ ⇌ the master equation, = *[ADP][Pi] (a) l+1 u u u dPi(l, t) X X 0 N 1 = 1*[ATP] = [wj,l→i,l0 Pj(l , t) wi,l→j,l0 Pi(l, t)] 1 dt − ⇌w l0=l−1 j w = w*[ADP][P ] 2 1 ⇌1 i (30) N 2 P P 0 uN w w where l i Pi(l, t) = 1, and wj,l→i,l is the rate for go- −1 N 3 u2 ⇌ ing from state j in location l to state i on F site l0 = l, l 1. ⇌ ± Under the assumption that the track is infinite and pe- N w riodic, that is that the rates do not depend on l, one − 1 4 3 ⋯ u may sum the r.h.s and the l.h.s of the master equation 4 ⇌3 and eliminate l. As a result, we obtain the master equa- (b) tion in Eq. 2, and we can use the theoretical framework presented before. FIG. 7 Example of simple kinetic networks. Panel (a): the Periodic one-dimensional models, which were orig- network has only one statte, N = 1. The forward rate is given by u = u∗[ATP], the backward rate is w = w∗[ADP][P ]. Both inally suggested by Derrida (Derrida, 1983) in order i forward and backward rates are pseudo-first-order, with units to study the mean velocity and diffusion constant of mM−1s−1 and mM−2s−1, respectively. Panel (b): multi-state random systems, have been widely adopted in describing model. Here, the rates u1 and w1 are arbitrarily chosen to the dynamics of molecular motors. In particular, depend on [ATP] and [ADP][Pi], respectively. Panel (c): a Fisher and Kolomeisky (Fisher and Kolomeisky, 1999a, cycle corresponding to the multi-state model. 2001; Kolomeisky and Fisher, 2007) have pioneered this approach and specifically used the reversible ki- netic model to describe the motility of kinesin-1 and backward rates is given by, myosin V. Numerous other studies have used vari- ∗ u u [ATP] βX ants of these models to discuss the thermodynamic (a) = ∗ = e features of stochastic motors. We review the main w w [ADP][Pi] ∗ (32) successes and shortcomings of these models in the con- u (b) = Keq, text of molecular motors as their complexity is increased. w∗
where we recall that X = ∆µhyd. The average, station- ary velocity is given by the− step-size times the difference between the forward and backward rates, 1. One-state Models v = d (u w) = d w eβX 1 . (33) 0 − 0 − In the simplest possible model, the motor is defined by The crucial insight from this expression pertains to a single (N = 1) chemical state at each site along the the interplay between the non-equilibrium steady-state track (Fig. 7a). The rate of forward stepping, u, depends (NESS) homeostatically maintained by the cell and on the concentration of ATP. Microscopic reversibility molecular motor function: in the cytosol X > 0, thus demands that the backward transition is possible as well, v > 0; at equilibrium, X = ∆µhyd = 0, implying that which is then nominally associated with ATP synthesis. v = 0. The rates of forward and backward stepping are modeled In the absence of applied load, the motor moves with- as pseudo-first order processes, out performing any work. In this case, the entire free en- ergy extracted from the hydrolysis of ATP is dissipated ∗ ∗ u = u [ATP], w = w [ADP][Pi], (31) into heat, Q = ∆µhyd. From Eq. 15, the rate of heat dissipation per step− is given by, ∗ ∗ leading to u/w = u /w [ATP]/([ADP][Pi]). In equi- d S u · T i = Q˙ = k T (u v) ln = (u v)( ∆µ ) 0. librium, the probability of going forward is identical to dt B − v − − hyd ≥ the probability of backward stepping, which implies that (34) (u/w) = 1. From Eq. 7, at equilibrium ∆µ = 0 Clearly, Q˙ 0, and the equality holds in equilibrium, Eq hyd ≥ and ([ADP][Pi]/[ATP])Eq = Keq. As a consequence, us- where u = v. If instead the motor is subjected to a ing Eq. 7 we obtain that the ratio between forward and force, f, opposing its movement, at each step the motor 15 performs mechanical work W = fd0. In this case, the free is zero when f = 0 and at the stall force (where v = 0), ∗ energy released by ATP hydrolysis is partitioned between thus for some force 0 < f < Fs it reaches a maximum ∗ work and heat, Q = ∆µhyd W . In order to account for P . The force-velocity curve, power output, efficiency, the presence of applied− load,− the rates may be modified and efficiency at maximum power (η∗) have been used to using the Bell model (Fisher and Kolomeisky, 1999b), compare ratchet-like motors (θ = 1) with power-stroke- driven (θ = 0). Given a value of the driving force X, ∗ −βθfd (a) u(F ) = u [ATP]e 0 , power strokes result in larger velocity at fixed F , and ∗ β(1−θ)fd0 display higher efficiencies and exert more power at a given (b) w(F ) = w [ADP][Pi]e . (35) u velocity (Wagoner and Dill, 2016). Furthermore, both (c) = eβ(X−fd0) ∗ ∗ w η and P at fixed X increase as θ 0 (Schmiedl and Seifert, 2008). Finally, from Eq. 15 we→ write, Here, the coefficient θ indicates how the load is parti- tioned between the forward and backward steps, and it diS u T = Q˙ = (u w) ln = (u w)( ∆µ fd ), identifies the position of the transition state along the dt − w − − hyd − 0 direction of motion (Fisher and Kolomeisky, 2001). For (39) θ = 0, the forward step is unaffected by force, whereas where we used Eq. 35c and Eq. 36. Note that, again, Q˙ 0, and the equality holds at equilibrium (u = w). backward steps are insensitive to load if θ = 1. These two ≥ extreme cases have been referred to as “power stroke” One-state models are appealing because of their and “ratchet”, respectively (Howard, 2006). In a “power simplicity; for instance, they have only two fitting pa- stroke”, the chemical step (crossing the transition state) rameters with straightforward physical interpretations: ∗ occurs before the movement forward of the motor. In w sets the time-scale (see Eq. 33), and θ establishes the contrast, in the case of a “ratchet” the motor fluctuates location of the transition state during a step. However, between the sites l and l + 1 until it is captured in the these models fail in reproducing the dependence of the forward site by ATP hydrolysis. In the presence of back- velocity on ATP concentration observed in experiments ward load the velocity becomes, (Eq. 28). According to Eq. 33 v grows without saturat- ing as [ATP] increases, which is not physical. In order β(1−θ)fd0 βX−βfd0 v = d0w(0)e e 1 . (36) to solve this problem, one must use kinetic models with − N > 1. It is clear that the motor stalls under a resistive load of,
X ∆µhyd fstall = = , (37) d0 − d0 2. Multi-state, Uni-cycle Models at which the mean motor velocity becomes v = 0. It is Figure 7b shows a kinetic model for molecular motors clear that fstall = fmax for this model. Using Eq. 37 to- with N > 1 intermediates. Assuming that the filament is gether with the value of ∆µhyd in the cell and the average periodic, we can drop the label l and construct an equiv- step-size of a motor, one can predict the value of fstall: for alent uni-cycle in Fig. 7c, where the rate uN is associated kinesin fstall 12.5pN, for myosin V fstall 2.8nm. In ≈ ≈ with a forward step, and w1 with a backward displace- the case of myosin V, fstall approximates the value of the ment. From Eq. 4 we obtain the following relationships, stall force measured experimentally; for kinesin, the fmax is about twice as large as the stall force. Note also that QN u QN w (a) J + = i=1 i ,J − = i=1 i , according to Eq. 37, Fs depends on [ATP]; a few experi- Σ( u, w ) Σ( u, w ) ments have suggested that this is not the case for a vari- { } { } QN u J + ety of motors (Carter and Cross, 2005; Gennerich et al., (b) i=1 i = = eβX , QN J − 2007; Nishiyama et al., 2002; Toba et al., 2006; Uemura j=1 wj (40) et al., 2004a), although other studies have observed such ∗ QN u ui adependence (Visscher et al., 1999). The independence (c) 1 i>1 = K , ∗ QN eq of Fs on [ATP] might be reasonable because each head w1 j>1 wj has only one binding pocket for ATP, and hence [ATP] should not determine Fs. The relationship v(f, ∆µ) al- lows one also to compute the power output of the motor, where J + and J − are the fluxes to complete a cycle defined as P = vF , and the motor efficiency, in the clockwise and counterclockwise direction, respec- tively, and Σ( u, w ) is a function of all the rates (see fd η(f, X) = 0 , (38) Eq. 3). The connection{ } with thermodynamics is provided X by Eq. 40b, which is equivalent to Eq. 32a; Eq. 40c holds that is the ratio of the work produced over the energy because in equilibrium X = 0 (see Eq. 32b for the N = 1 released by the hydrolysis of ATP. Note that the power case). For the case of uni-cyclic network models, the net 16 steady state flux (∆J = J + J −) along the cycle is ob- Following Eq. 15, the rate of heat released during a − ss ss tained by calculating ∆J = wi,i+1pi wi+1,ipi+1 at any cycle (Hill, 2005a; Qian, 2007; Qian and Beard, 2005; edge between the two neighboring chemical− states along Toyabe et al., 2010; Zimmermann and Seifert, 2012) is, the cycle. Therefore, the velocity, which is obtained as uN + the flux across edge N 1, is given by, J Q˙ = ∆JkBT ln = ∆J( ∆µhydr fd0) = E˙ +W˙ 0, w1 J − − − ≥ (46) QN ˙ − βX i=1 wi βX where E is the rate of free energy expended per cy- v = d0∆J = d0J (e 1) = d0 (e 1), ˙ − Σ( u, w ) − cle. Note that in uni-cyclic network models, W = { } (41) ∆J(f)F d0 = 0 either at f = 0 or if the motor is subject − The similarity between Eq. 33 and Eq. 41 is clear: both to an opposing stall force (f = Fs), where ∆J(Fs) = 0; of the expressions relate the velocity to the exponential thus the work production (W˙ ) is a non-monotonic func- of the free energy released upon the hydrolysis of ATP, tion of f, whereas E˙ and Q˙ decrease monotonically with and in both cases at equilibrium the motor does not F . move. However, the expression in Eq. 41 saturates at Although the conventional N-state unicyclic mod- large [ATP], in agreement with experiments (see Eq. 28). els (Fisher and Kolomeisky, 1999a, 2001; Kolomeisky and Compared to the case of a single chemical state, new Fisher, 2003) appear successful in describing the motil- load distribution factors are necessary for each transition ity and thermodynamics of molecular motors, unicyclic rate, i.e. models encounter two serious problems, especially when the molecular motor is stalled or starts taking backward −βθ+fd βθ−fd ui(f) = ui(0)e i , wi(f) = wi(0)e i . (42) steps at large hindering loads (Astumian and Bier, 1996; Hyeon et al., 2009; Liepelt and Lipowsky, 2007a). First, PN + the backward step in the unicyclic network, by construc- The distribution factors obey the relationship i=1(θi + − θi ) = 1 (Fisher and Kolomeisky, 1999b). It follows that, tion, is produced by a reversal of the forward cycle, which implies that the backward step is always realized via the N−1 Q + synthesis of ATP from ADP and Pi. This is the case i=0 ui(f) J (f) βX−βfd0 N−1 = − = e , (43) for rotary ATP-synthases, which function as transducers Q w (f) J (f) i=0 i of electrochemical potential into the synthesis of ATP in et al. and given ∆J(f) = J +(f) J −(f), the force-velocity the mitochondria (Alberts , 2008), and depending relation is, − on the sign of ∆µhyd and on environmental conditions (e.g. the presence and strength of a proton gradient QN and applied load), these spectacular motors can reverse wi(f) v(f) = d ∆J(f) = d i=1 eβX−βfd0 1 . (44) their function (Itoh et al., 2004; Turina et al., 2003). Al- 0 0 Σ(f) − though ATP hydrolysis is reversible for linear molecular Although v(F ) is a complicated function of all the pa- motors (Bagshaw and Trentham, 1973; Hackney, 2005), rameters (Fisher and Kolomeisky, 1999b; Wagoner and back-stepping has not been associated with ATP synthe- Dill, 2016), the expression for a theoretical estimation of sis. Rather they are associated with ATP-independent the stall force is the same as the one found for N = 1, “slippage” or coupled to fuel consumption just as for- given by Eq. 37, and it is again equal to f in Eq. 10. ward stepping (Carter and Cross, 2005; Clancy et al., max et al. et al. In principle, one could design kinetic networks with an 2011; Gebhardt , 2006; Ikezaki , 2013). This arbitrary number of intermediates. However, because the requires an update of the kinetic network. number of free parameters becomes 4N 2, it is desir- In addition, the unicyclic network models lead to ˙ able to establish the minimal N that enables− an accu- Q = 0 under stall conditions (Eq. 46), which contradicts rate description of the experimental data as accurately physical reality. For example, an idling car still burns d 2 2 fuel and dissipates heat (Q˙ = 0)! There certainly exist as possible. Let D = limt→∞ dt ( x(t) x(t) ) be 6 d h i − h i fundamental differences between molecular world and the dispersion, and v = limt→∞ dt x is the velocity; the randomness parameter for a quantityh i x is defined as, macroscopic counterpart in that the former is subject to a large degree of fluctuations, which permit the reverse 2D process of ATP hydrolysis (i.e., synthesis) or negative r = . (45) heat dissipation for a sub-ensemble of the entire realiza- d0v tions. Yet, the mean of entropy production Q/T˙ is still It can be shown that N 1/r (Kolomeisky and Fisher, bound to be positive as demanded by thermodynamics, 2007; Koza, 2002), and therefore≥ r−1 sets a lower bound and the probability of a local violation of this principle to N that is needed to account for measurements. Note becomes vanishingly small as the system size grows. To that r is a function of ATP concentration and external ameliorate the aforementioned physically problematic load. interpretation, associated with the backward step 17
adjacent chemical states are given as
125D T 187 a hyd 126 k (f) = ko e−θfd0/kB T 188 127 [T] 25 25 189 128 o (1−θ)fd0/kB T 190 D T D k52(f) = k e 129 T 52 191 130 k (f) = 2ko (1 + eχij fd0/kB T )−1 for ij = 25 or 52192 (47) 131 ij ij 6 193 132 194 133 For the sake of simplicity, it is assumed that (2) 195(5), 134D T 196 b 125135 the step associated with the switching of the trailing197 and187 hyd 126136 leading head positions (the yellow arrow in Fig.8b),198 obeys188 137 199 127 [T] the Bell-like force dependence. When f > 0, the force189 D 138 200 D 128T T 190 129139 exerted on the motor hinders the forward step and201 en-191 140 202 130 [T] hances the backward step. Other steps (ij = 25 or 52)192 141 203 131 6 193 142 are abolished (kij 0) at large f(> 0), which204 corre- Analysis ofhyd kinesin-1 based onD the 6-state double cycle kinetic model. Schematics of the 6-state double cycle kinetic network for hand-over-hand dynamics of 132143 T Fig. 1. A. → 205 194 kinesin-1, where T , D, and „ denote ATP-, ADP-bound, and apo state, respectively.sponds Through binding to the [(1) (2)] physical and hydrolysis situation of ATP [(2) (5)], where and release the of ADP large [(5) (6)], external 133 æ æ æ 195 144 kinesin moves forward in the -cycle [(1) (2) (5) (6) (1)], where it takes a backstep in the -cycle [(4) (5) (2) (3) (4)]. The arrows in the 206 134 F æ æ æ æ force impedes theB chemicalæ æ stepsæ suchæ as ATP binding,196 145 figure depict the direction of reaction currents. In both cycles, each chemical step is reversible and the transition rate from the state (i) to (j) is given by kij . B. Reaction 207 135 current J , J , and J as a function of load. The three cartoons illustrate the amount of current along the and cycles at each value of load. f<0 and f>0 correspond 197 146 F B hydrolysis, andF ADPB release by deforming the confor-208 c to the assisting and hindering load, respectively. C. The ratio between the forward and backward fluxes J /J as function of f at fixed [ATP]. The stall forces, determined at 125 136147 F B 187 209 198 J /J =1(dashed line), are narrowly distributed between f =6 8 pN. (Dmation–H) V , D, Q˙ , ofW˙ , E molecular˙ as a function of f and motor. [ATP]. The whiteThe dashed model lines depict consists the locus of of two F B ≠ 126 137148 [ATP]-dependent stall force, and the dashed lines in magenta indicate the condition of [ATP] = 2 mM. I. Dependences of E˙ , Q˙ , W˙ on f at [ATP] = 2 mM. The188 star symbol in D 210 199 138149 indicates the cellular condition of f 1 pN and [ATP] 1 mM, which will be discussedsub-cycles later. ((1) (2) (5) (6) (1)) and211 200 127 ¥ ¥ F 189 B 128 139150 ((2) (3) (4) (5) (2)). At small190 (f 0)212 or as-201 140151 ≈ 213 202 129 motor e ciency, if any, that is minimized. To evaluatesisting, hindering force load. (f < The0), backstep which in renders unicyclick reaction25191 schemes,k52, the reac- 141152 Q Q 214 203 Q˙ D V 130 142153 one should know , , and of the system (see Eq.2), whichtion currentby construction, is mainly is produced formed by a reversal along of the the192 forwardcounterclockwise cycle 215 204 Fig.can 1. beAnalysis gained of by kinesin-1 building based a proper on the 6-state kinetic double network cycle model kinetic model. that A.(15Schematics), which of implies the 6-state that double+ the cycle backstep kinetic network is always for hand-over-hand realized via dynamics 131 143154 direction of cycle ( ); in contrast,193 at large hindering216 205 of kinesin-1, where T , D, and „ denote ATP-, ADP-bound, and apo state, respectively. Through binding ((1) (2)) and hydrolysis of ATP ((2) (5)), and release of ADP 155 can delineate the dynamical characteristics of the system (11). the synthesisF of ATPæ fromF ADP and Pi. Moreæ importantly, 217 132 144 ((5) (6)), kinesin moves forward in the -cycle ((1) (2) (5) (6) (1)), where it takes˙ a backstep in the -cycle ((4) (5) 194(2) (3) (4)). The 206 156 Ouræ analysis on motors will showF that is aæ complexæ functionæ forceæ in (f calculating > 0) aboveQ from the kinetic stallB network, condition theæ unicyclicæ (f =æ reactionfs),æ the218 cur- 145f