Absolute Binding Free Energy Calculations Using Molecular Dynamics Simulations with Restraining Potentials
Total Page:16
File Type:pdf, Size:1020Kb
2798 Biophysical Journal Volume 91 October 2006 2798–2814 Absolute Binding Free Energy Calculations Using Molecular Dynamics Simulations with Restraining Potentials Jiyao Wang,* Yuqing Deng,y and BenoıˆtRoux*y *Institute of Molecular Pediatric Sciences, Gordon Center for Integrative Science, University of Chicago, Chicago, Illinois; and yBioscience Division, Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, Illinois ABSTRACT The absolute (standard) binding free energy of eight FK506-related ligands to FKBP12 is calculated using free energy perturbation molecular dynamics (FEP/MD) simulations with explicit solvent. A number of features are implemented to improve the accuracy and enhance the convergence of the calculations. First, the absolute binding free energy is decomposed into sequential steps during which the ligand-surrounding interactions as well as various biasing potentials restraining the translation, orientation, and conformation of the ligand are turned ‘‘on’’ and ‘‘off.’’ Second, sampling of the ligand conformation is enforced by a restraining potential based on the root mean-square deviation relative to the bound state conformation. The effect of all the restraining potentials is rigorously unbiased, and it is shown explicitly that the final results are independent of all artificial restraints. Third, the repulsive and dispersive free energy contribution arising from the Lennard-Jones interactions of the ligand with its surrounding (protein and solvent) is calculated using the Weeks-Chandler-Andersen separation. This separation also improves convergence of the FEP/MD calculations. Fourth, to decrease the computational cost, only a small number of atoms in the vicinity of the binding site are simulated explicitly, while all the influence of the remaining atoms is incorporated implicitly using the generalized solvent boundary potential (GSBP) method. With GSBP, the size of the simulated FKBP12/ligand systems is significantly reduced, from ;25,000 to 2500. The computations are very efficient and the statistical error is small (;1 kcal/mol). The calculated binding free energies are generally in good agreement with available experimental data and previous calculations (within ;2 kcal/mol). The present results indicate that a strategy based on FEP/MD simulations of a reduced GSBP atomic model sampled with conformational, translational, and orientational restraining potentials can be computationally inexpensive and accurate. INTRODUCTION Molecular recognition phenomena involving the association based on FEP/MD simulations for protein-ligand interac- of ligands to macromolecules with high affinity and spec- tions could become a useful tool in drug discovery and ificity play a key role in biology (1–3). Although the fun- optimization (15–22). Nonetheless, despite outstanding de- damental microscopic interactions giving rise to bimolecular velopments in simulation methodologies (23), carrying out association are relatively well understood, designing com- FEP/MD calculations of large macromolecular assemblies putational schemes to accurately calculate absolute binding surrounded by explicit solvent molecules often remains free energies remains very challenging. Computational ap- computationally prohibitive. For this reason, it is necessary proaches currently used for screening large databases of to seek ways to decrease the computational cost of FEP/MD compounds to identify potential lead drug molecules must calculations while keeping them accurate. rely on very simplified approximations to achieve the needed To simulate accurately the behavior of molecules, one computational efficiency (4). Nonetheless, the calculated must be able to account for the thermal fluctuations and free energies ought to be very accurate to have any predictive the environment-mediated interactions arising in diverse value. Furthermore, the importance of solvation in scoring and complex systems (e.g., a protein binding site or bulk ligands in molecular docking has been stressed previously solution). In FEP/MD simulations, the computational cost is (5). generally dominated by the treatment of solvent molecules. In principle, free energy perturbation molecular dynamics Computational approaches at different level of complexity (FEP/MD) simulations based on atomic models are the most and sophistication have been used to describe the influence powerful and promising approaches to estimate binding free of solvent on biomolecular systems (24). Those range from energies of ligands to macromolecules (6–11). Indeed, test MD simulations based on all-atom models in which the calculations have shown that FEP/MD simulations can be solvent is treated explicitly (10,25), to Poisson-Boltzmann more reliable than simpler scoring schemes to compute rela- (PB) continuum electrostatic models in which the influence tive binding affinities in important biological systems (12,13), of the solvent is incorporated implicitly (24,26). There are also and that it can naturally handle the influence of solvent and semianalytical approximations to continuum electrostatics, dynamic flexibility (14). There is a hope that calculations such as generalized Born (27–31), as well as empirical treat- ments based on solvent-exposed surface area (32–40). How- ever, even though such approximations are computationally Submitted March 1, 2006, and accepted for publication June 27, 2006. convenient, they are often of unknown validity when they Address reprint requests to Prof. Benoıˆt Roux, Tel.: 773-834-3557; E-mail: are applied to a new situation. [email protected]. Ó 2006 by the Biophysical Society 0006-3495/06/10/2798/17 $2.00 doi: 10.1529/biophysj.106.084301 Binding Free Energy Calculations 2799 An intermediate approach, which combines some aspects ferent computational strategies to estimate binding free ener- of both explicit and implicit solvent treatments (41–43), con- gies (62–65). This study is part of an ongoing collaborative sists in simulating a small number of explicit solvent mole- effort involving two other groups (Pande (63) and J. A. cules in the vicinity of a region of interest, while representing McCammon, personal communication, 2005) with the goal the influence of the surrounding solvent with an effective of comparing the results of calculations based on different solvent-boundary potential (41–50). Such an approximation treatments and approximations but using the same force field is an attractive strategy to decrease the computational cost of (AMBER). Pande and co-workers (63) and Shirts (64) MD/FEP computations because binding specificity is often carried out extensive all-atom free energy perturbation (FEP) dominated by local interactions in the vicinity of the ligand molecular dynamics (MD) simulations. With the same system, while the remote regions of the receptor contribute only in an J. M. Swanson and J. A. McCammon (personal communica- average manner. The method used in this study is called the tion, 2005) used molecular mechanics/Poisson-Boltzmann- generalized solvent boundary potential (GSBP) (43). GSBP and-surface-area (MM-PBSA), a popular approach that relies includes both the solvent-shielded static field from the on a mixed scheme combining configurations sampled from distant atoms of the macromolecule and the reaction field molecular dynamics (MD) simulations with explicit solvent, from the dielectric response of the solvent acting on the with free energy estimators based on an implicit continuum atoms of the simulation region. GSBP is a generalization of solvent model (66). spherical solvent boundary potential, which was designed to In the next two sections, the theoretical formulation and simulate a solute in bulk water (41). In the GSBP method, all the computational details are given. Then, all the results of atoms in the inner region belonging to ligand, macromole- the computations are presented and discussed in the follow- cule, or solvent can undergo explicit dynamics, whereas the ing section. The article ends with a brief conclusion sum- influence of the macromolecular and solvent atoms outside marizing the main points. the inner region are included implicitly. It is also possible to reduce the computational cost of FEP/ MD simulations and even improve their accuracy by using METHODS a number of additional features. For example, the Weeks Chandler Andersen (WCA) separation of the Lennard-Jones Theoretical formulation potential can be used to efficiently calculate the free energy The theoretical formulation for the equilibrium binding constant used here contribution arising from the repulsive and dispersive inter- was previously elaborated in Deng and Roux (52). Briefly, the equilibrium actions (51,52). Furthermore, biasing potentials restraining binding constant Kb for the process corresponding to the association of a the translation, orientation, and conformation of the ligand ligand L to a protein P, L 1 P LP, can be expressed as R R can help enhance the convergence of the calculations (17,21, ÿ dðLÞ dðXÞe bU 22,53,54). Such a procedure can provide correct results as K ¼ R site R ; (1) b ðLÞ ðr ÿ rÃÞ ðXÞ ÿbU long as the effect of all the restraining potentials is rigorously bulk d d L d e taken into account and unbiased. Combining these elements where L represents the coordinates of the ligand (only a single ligand needs yields the present computational strategy, which consists in to be considered at low concentration), X represents the coordinate of the [ FEP/MD simulations of a reduced GSBP atomic model with solvent and the protein,