Arxiv:2009.14354V2 [Cond-Mat.Stat-Mech] 2 Jan 2021
Total Page:16
File Type:pdf, Size:1020Kb
Skewed Thermodynamic Geometry and Optimal Free Energy Estimation Steven Blaber∗ and David A. Sivaky Dept. of Physics, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada (Dated: January 5, 2021) Free energy differences are a central quantity of interest in physics, chemistry, and biology. We develop design principles that improve the precision and accuracy of free energy estimators, which has potential applications to screening for targeted drug discovery. Specifically, by exploiting the connection between the work statistics of time-reversed protocol pairs, we develop near-equilibrium approximations for moments of the excess work and analyze the dominant contributions to the precision and accuracy of standard nonequilibrium free-energy estimators. Within linear response, minimum-dissipation protocols follow geodesics of the Riemannian metric induced by the Stokes' friction tensor. We find the next-order contribution arises from the rank-3 supra-Stokes' tensor that skews the geometric structure such that minimum-dissipation protocols follow geodesics of a generalized cubic Finsler metric. Thus, near equilibrium the supra-Stokes' tensor determines the leading-order contribution to the bias of bidirectional free-energy estimators. I. INTRODUCTION (equaling the dissipation), and hence the error, can be reduced by following a different path through control- parameter space and varying the velocity along the path Free energy differences determine the equilibrium 14{16 phases of thermodynamic systems as well as the relative while keeping the protocol duration fixed. Near equi- reaction rates and binding affinities of chemical species.1 librium, this can be mapped onto the thermodynamic- Computational and experimental techniques which accu- geometry framework: for the mean-work estimator, the rately and precisely predict free energy differences are bias and variance from finite-time protocols is improved therefore highly desirable. One important application is by following geodesics of a thermodynamic metric, the in pharmaceutical drug discovery, where computation of force-variance (FV) metric, a Riemannian metric de- fined by the covariance matrix of conjugate-force fluctu- free energy differences can aid in the identification and 17{21 design of ligands for targeted protein binding.2{5 Current ations; similarly, for BAR, the variance (but not the methods rely primarily on costly and time-consuming ex- bias) is minimized if the protocol follows geodesics of the FV metric.22 This has been used to improve the precision perimentation, which can be reduced through screening 23{26 with efficient computational techniques.1{8 of calculated binding potentials of mean force. Nonequilbrium measurements of free energy differences The FV metric only minimizes the variance of the are often estimated by measuring the work incurred from mean-work estimator and BAR if the relaxation time a protocol (a dynamic variation in control parameters) is independent of the control parameters. The friction- 27,28 that drives the system between control-parameter end- tensor metric, the product of the covariance and inte- points corresponding to each state. A unidirectional es- gral relaxation time of conjugate-force fluctuations (19), timator calculates the free energy difference using only is more general than the FV and provides a relatively sim- the work done by a protocol driving from initial to final ple prescription for reducing excess work in more general states (forward protocol), while a bidirectional estimator near-equilibrium processes, where minimum-dissipation also uses a protocol that drives from final to initial states protocols follow geodesics of the Riemannian metric in- (reverse protocol). The mean-work estimator9 equates duced by the friction tensor. For common unidirectional the mean work with the free energy difference and yields estimators near equilibrium, the bias and variance are a biased estimate for any non-quasistatic (finite-speed) proportional to the first moment (mean) of the excess protocol. The Jarzynski estimator for free energy differ- work, and hence minimum-dissipation protocols defined ences (derived from the Jarzynski equality relating the by the friction tensor minimize both the bias and vari- exponentially averaged work to the free energy change10) ance. is unbiased for a large number of samples. The mean- We show that for bidirectional estimators, the variance work and Jarzynski estimators can be used as either uni- is proportional to the sum of the second moments of the arXiv:2009.14354v2 [cond-mat.stat-mech] 2 Jan 2021 or bidirectional estimators; however, if bidirectional data excess work from forward and reverse protocols (26a), is available the maximum log-likelihood estimate is Ben- while the bias is proportional to the difference of the nett's acceptance ratio (BAR)11 which yields (for a large first moments (27a).29 With this in mind, we extend the number of samples) the minimum variance of any unbi- thermodynamic-geometry framework to higher-order mo- ased estimator.12,13 ments of the excess work and beyond the leading-order Regardless of the estimator, non-quasistatic control- friction-tensor approximation. We find that for bidi- parameter protocols result in excess work (work in ex- rectional estimators near equilibrium, minimum-variance cess of the free energy difference), which increases the protocols follow geodesics of the Riemannian metric in- bias and variance (decreases the accuracy and precision, duced by the friction tensor, while minimum-bias pro- respectively) of free energy estimates. This excess work tocols follow geodesics of a cubic Finsler metric. For 2 the simple model system of a Brownian particle in a In analogy with fluid dynamics, this rank-two tensor is quadratic trap with time-dependent stiffness (a breathing the Stokes' friction, since it produces a drag force that de- harmonic trap), the minimum-variance and minimum- pends linearly on velocity. We denote the Stokes' friction bias protocols can improve variance by a factor of 3 4 with superscript (1) since it is the leading-order contri- and bias by over a factor of 10 (Fig. 2). − bution to dissipation.27 Following parallel arguments, we approximate the third moment of the excess-work distribution as II. DERIVATION 3 d W Λ 3 (2) ex _ _ _ h i 2 ζijk[λ(t)]λi(t)λj(t)λk(t) ; (6) Consider a system in the canonical ensemble, in dt ≈ β thermal equilibrium with a heat bath at temper- for the rank-three tensor ature T . The probability distribution over mi- crostates x at control parameters λ is π(x λ) = ζ(2)[λ(t)] (7) β F λ E x; λ E x; λ j ijk ≡ exp [ ( ) ( )], with energy ( ) and free en- Z 1 Z 1 − P 2 00 000 00 000 ergy F (λ) kBT ln x exp [ βE(x; λ)]. Here β β dt dt δf (0)δf (t )δf (t ) : −1 ≡ − − ≡ i j k λ(t) (kBT ) for Boltzmann's constant kB. We define the − 0 0 h i work done in a single realization of an external agent changing the control parameters λ according to a proto- (The factor of three in (6) results from grouping the in- R t 0 _ 0 dex permutations ijk; jik; kij into one term, using the col Λ as W dt fiλi(t ), which implies the average f g ≡ − 0 invariance of the sum under exchange of indices, e.g., excess work (Wex W ∆F ) is (2) (2) _ _ _ _ _ _ ≡ − ζijkλiλjλk = ζjikλiλjλk.) We call the rank-three tensor Z t [Eq. (7)] the supra-Stokes' tensor and index it by super- 0 _ 0 Wex Λ = dt δfi Λλi(t ) ; (1) script (2), as it corresponds to the leading-order correc- h i − 0 h i tion to dissipation beyond the Stokes' friction (15). where we adopt the Einstein summation convention of For fourth and higher moments, Appendix B shows implied summation over repeated indices. A dot denotes that the integration bounds must remain finite since the the time derivative λ_ i dλi=dt, fi @λ U is the gen- n-time covariance functions do not decay to zero, so there ≡ ≡ − i eralized force conjugate to λi, and δfi fi fi eq is the is no clear analogy to frictional drag forces. Nevertheless, difference from the equilibrium average.≡ Angle−h i brackets we can still approximate them by Λ denote an average over the nonequilibrium ensem- h· · · i n n ble of system responses to control-parameter protocol Λ. d Wex Λ (n−1) Y h i n ν ···ν [λ(t); t] λ_ ν (t) ; (8) The time derivative of the second moment of the dt ≈ C 1 n i excess-work distribution is i=1 where we use index notation ν ; ν ; ν instead of i; j; k, d W 2 Z t 1 2 3 ex Λ _ 0 0 _ 0 and define integral n-time covariance··· functions h i = 2λi(t) dt δfi(t)δfj(t ) Λ λj(t ) : (2) dt 0 h i (n−1) ν ···ν [λ(t); t] (9) For a sufficiently slow protocol, we replace the nonequi- C 1 n ≡ n t * n + librium average Λ with the equilibrium average Y Z Y h· · · i ( β)n dt δf (0)δf (t ) : λ(t) at fixed control parameters λ(t): i ν1 νj j − 0 h· · · i i=2 j=2 λ(t) 2 Z t d Wex Λ 00 00 h i 2λ_ i(t)λ_ j(t) dt δfi(0)δfj(t ) ; (3) This implies that higher-order moments of the excess dt ≈ h iλ(t) 0 work are higher order in control-parameter velocity and where we used the stationarity of the equilibrium aver- are therefore smaller for slow protocols. The approxima- age, defined t00 t0 t, and assumed smooth protocols tions of (4), (6), and (8) are the leading-order contribu- to expand the control-parameter≡ − velocity to zeroth or- tions to each moment of the excess work.