Parallel Tempering: Theory, Applications, and New Perspectives PCCP

INVITED ARTICLE www.rsc.org/pccp Parallel tempering: Theory, applications, and new perspectives PCCP David J. Earlab and Michael W. Deema a Departments of Bioengineering and Physics & Astronomy, Rice University, 6100 Main Street MS142, Houston, Texas, 77005, USA. E-mail: [email protected] b Rudolf Peierls Centre for Theoretical Physics, Oxford University, 1 Keble Road, Oxford, UK OX1 3NP. E-mail: [email protected] Received 13th July 2005, Accepted 11th August 2005 First published as an Advance Article on the web 5th September 2005 We review the history of the parallel tempering simulation method. From its origins in data analysis, the parallel tempering method has become a standard workhorse of physicochemical simulations. We discuss the theory behind the method and its various generalizations. We mention a selected set of the many applications that have become possible with the introduction of parallel tempering, and we suggest several promising avenues for future research. 1. Introduction is also the case that parallel tempering can make efficient use of large CPU clusters, where different replicas can be run in The origins of the parallel tempering, or replica exchange, parallel. An additional benefit of the parallel tempering meth- simulation technique can be traced to a 1986 paper by Swend- 1 od is the generation of results for a range of temperatures, sen and Wang. In this paper, a method of replica Monte Carlo which may also be of interest to the investigator. It is now was introduced in which replicas of a system of interest were widely appreciated that parallel tempering is a useful and simulated at a series of temperatures. Replicas at adjacent powerful computational method. temperatures undergo a partial exchange of configuration One of the debated issues in parallel tempering regards the information. The more familiar form of parallel tempering details of the exchange, or swapping, of configurations between with complete exchange of configuration information was 2 replicas. Pertinent questions include how many different repli- formulated by Geyer in 1991. Initially, applications of the cas and at what temperatures to use, how frequently swaps new method were limited to problems in statistical physics. should be attempted, and the relative computational effort to However, following Hansmann’s use of the method in Monte expend on the different replicas. Another emerging issue is how Carlo simulations of a biomolecule,3 Falcioni and Deem’s use 4 top swapffiffiffiffi only part of the system, so as to overcome the growth of parallel tempering for X-ray structure determination, and as N of the number of replicas required to simulate a system Okamoto and Sugita’s formulation of a molecular dynamics 5 of size N. We address these points of controversy in this review. version of parallel tempering, the use of parallel tempering in The widespread use of parallel tempering in the simulation fields spanning physics, chemistry, biology, engineering and field has led to the emergence of a number of new issues. It has materials science rapidly increased. become clear that temperature may not always be the best The general idea of parallel tempering is to simulate M parameter to temper, and parallel tempering can be conducted replicas of the original system of interest, each replica typically in the canonical ensemble, and usually each replica at a different temperature. The high temperature systems are gen- erally able to sample large volumes of phase space, whereas low temperature systems, whilst having precise sampling in a local region of phase space, may become trapped in local energy minima during the timescale of a typical computer simulation. Parallel tempering achieves good sampling by allowing the systems at different temperatures to exchange complete configurations. Thus, the inclusion of higher temperature systems ensures that the lower temperature systems can access a representative set of low-temperature regions of phase space. This concept is illustrated in Fig. 1. Simulation of M replicas, rather than one, requires on the order of M times more computational effort. This ‘extra expense’ of parallel tempering is one of the reasons for the initially slow adoption of the method. Eventually, it became clear that a parallel tempering simulation is more than 1/M Fig. 1 2-D representation of phase space. A simulation at lower times more efficient than a standard, single-temperature Monte temperatures can become trapped in a non-representative sample of Carlo simulation. This increased efficiency derives from allow- the low free energy minima (shaded regions). At higher temperatures, a ing the lower temperature systems to sample regions of phase simulation can sample more of phase space (light plus shaded areas). Configuration swaps between the lower and higher temperature sys- space that they would not have been able to access had regular 10.1039/b509983h tems allow the lower temperature systems to escape from one region of sampling been conducted for a single-temperature simulation phase space where they were effectively ‘stuck’ and to sample a DOI: that was M times as long. While not essential to the method, it representative set of the low free energy minima. 3910 Phys.Chem.Chem.Phys.,2005, 7 , 3910–3916 This journal is & The Owner Societies 2005 with order parameters other than temperature, such as pair potentials or chemical potentials. Of interest is how to choose the order parameter whose swapping will give the most efficient equilibration. It has also become apparent that multi-dimensional parallel tempering is possible. That is, swapping between a number of parameters in the same simulation, in a multi- dimensional space of order parameters, is feasible and some- times advised. The improvement in sampling resulting from the use of parallel tempering has revealed deficiencies in some of the most popular force fields used for atomistic simulations, and it would seem that the use of parallel tempering will be essential in tests of new and improved force fields. Fig. 2 Schematic representation of parallel tempering swaps between Parallel tempering can be combined with most other simula- adjacent replicas at different temperatures. In-between the swaps, tion methods, as the exchanges, if done correctly, maintain the several constant-temperature Monte Carlo moves are performed. detailed balance or balance condition of the underlying simulation. Thus, there is almost an unlimited scope for the where C is the heat capacity at constant volume, which is utilization of the method in computer simulation. This leads v assumed to be constant in the temperature range between b to intriguing possibilities, such as combining parallel tempering i and b . Simply put, the acceptance rate for the trials depends on with quantum methods. j the likelihood that the system sampling the higher temperature happens to be in a region of phase space that is important at 2. Theory the lower temperature. This theoretical analysis of the accep- 2.1 Theory of Monte Carlo parallel tempering tance rates becomes useful when considering the optimal choice of temperatures for a parallel tempering simulation In a typical parallel tempering simulation we have M replicas, (see section 2.3). each in the canonical ensemble, and each at a different temperature, Ti. In general T1 o T2 o ... o TM, and T1 is normally the temperature of the system of interest. Since the 2.2 Theory of molecular dynamics parallel tempering replicas do not interact energetically, the partition function of In Monte Carlo implementations of parallel tempering, we this larger ensemble is given by need only consider the positions of the particles in the simulation. In molecular dynamics, we must also take into account YM q R Q ¼ i drN exp½b UðrN Þ; ð1Þ the momenta of all the particles in the system. Sugita and N! i i i i¼1 Okamoto proposed a parallel tempering molecular dynamics method in which after an exchange, the new momenta for N 3/2 (i) where qi ¼ Pj¼1(2pmjkBTi) comes from integrating out the replica i, p 0, should be determined as N momenta, mj is the mass of atom j, ri specifies the positions of rffiffiffiffiffiffiffiffiffiffi the N particles in system i, b ¼ 1/(k T ) is the reciprocal 0 T i B i pðiÞ ¼ newpðiÞ; ð4Þ temperature, and U is the potential energy, or the part of the Told Hamiltonian that does not involve the momenta. If the prob- (i) ability of performing a swap move is equal for all conditions, where p are the old momenta for replica i, and Told and Tnew exchanges between ensembles i and j are accepted with the are the temperatures of the replica before and after the swap, 5 probability respectively. This procedure ensures that the average kinetic energy remains equal to 3Nk T. The acceptance criterion for N N 2 B A ¼ min{1, exp[þ(bi À bj)(U(ri ) À U(rj ))]}. (2) an exchange remains the same as for the MC implementation Swaps are normally attempted between systems with adjacent (eqn (2)) and satisfies detailed balance. When doing parallel tempering molecular dynamics, one temperatures, j ¼ i þ 1. Parallel tempering is an exact method in statistical me- must take care in the interpretation of the results. A parallel chanics, in that it satisfies the detailed balance or balance tempering exchange is an ‘unphysical’ move, and so one cannot condition,6 depending on the implementation. This is an im- draw conclusions about dynamics. That is, when using parallel portant advantage of parallel tempering over simulated anneal- tempering molecular dynamics, one is only really doing a form ing, as ensemble averages cannot be defined in the latter of sampling and not ‘true’ molecular dynamics. method. Parallel tempering is complementary to any set of Monte Carlo moves for a system at a single temperature, and 2.3 Optimal choice of temperatures such single-system moves are performed between each attempted swap.

Parallel Tempering: Theory, Applications, and New Perspectives PCCP

Parallel-Tempered Stochastic Gradient Hamiltonian Monte Carlo for Approximate Multimodal Posterior Sampling

Chapter 2 Monte Carlo Simulations

LAMMPS – an Object Oriented Scientific Application

Desmond Users Guide Release 3.4.0 / 0.7

Hybrid Parallel Tempering and Simulated Annealing Method

The American Ceramic Society 38Th International Conference

Comparing Monte Carlo Methods for Finding Ground States of Ising Spin

Multiscale Implementation of Infinite-Swap Replica Exchange Molecular Dynamics

PLUMED Tutorial

Conditions for Rapid Mixing of Parallel and Simulated Tempering on Multimodal Distributions

Qualitative Properties of Parallel Tempering and Its Infinite Swapping

Compare LAAMPS and HOOMD