Nucleic Acids: Theory and Computer Simulation, Y2K David L Beveridge* and Kevin J Mcconnell†
Total Page:16
File Type:pdf, Size:1020Kb
Sba213.qxd 03/27/2000 12:45 Page 182 182 Nucleic acids: theory and computer simulation, Y2K David L Beveridge* and Kevin J McConnell† Molecular dynamics simulations on DNA and RNA that include dependent upon judicious approximations of the underly- solvent are now being performed under realistic environmental ing empirical force-field, simulation protocols and system conditions of water activity and salt. Improvements to force- preparation. Run lengths of even nanoseconds, a consider- fields and treatments of long-range interactions have able recent technical accomplishment, are still not long significantly increased the reliability of simulations. New studies enough to access many nucleic acid phenomena of inter- of sequence effects, axis bending, solvation and est; however, much can and has been done. In this review, conformational transitions have appeared. we describe the recent literature on all-atom MD applied to uncomplexed DNA and RNA systems, with an empha- Addresses sis on studies published during 1998–2000, that serve to Chemistry Department and Molecular Biophysics Program, Wesleyan document simulation protocols and provide validations of University, Middletown, CT 06459, USA the current level of accuracy of the simulations. *e-mail: [email protected] †e-mail: [email protected] This review builds, in particular, on the most recent Current Opinion in Structural Biology 2000, 10:182–196 Current Opinion in Structural Biology review on this subject, 0959-440X/00/$ — see front matter that by Auffinger and Westhof [2], as well as a number of © 2000 Elsevier Science Ltd. All rights reserved. others [3–6]. A multi-author treatise on nucleic acid struc- ture edited by Neidle [7•] contains a number of valuable Abbreviations FDPB finite difference Poisson Boltzmann reviews, particularly one on MD by Miller et al. [8] and a •• GB generalized Born current overview by Lavery and Zakrzewska [9 ] on the MD molecular dynamics defining parameters of nucleic acid structure, in which NDB Nucleic Acid Database conflicted issues surrounding the definition of global and NOESY nuclear Overhauser enhancement spectroscopy local axis frames are resolved [10•]. Methods extending PB Poisson Boltzmann PME particle mesh Ewald MD to longer time frames are of considerable importance QM quantum mechanics to the field, but are beyond the scope of this review (for rmsd root mean square deviation information on this topic, see [11–13]). Schlick et al. [14] SA solvent accessibility have recently reviewed related algorithmic challenges. Introduction Methodology and validation studies The biological functions of nucleic acids — information Nucleic acids are an especially challenging problem to storage, replication and transcription, as well as novel cat- treat with MD simulation. DNA and RNA sequences are alytic events — are ultimately linked to the chemical polyelectrolytes at neutral and physiological pH, with a elements of the base pair sequence and the biophysical multiplicity of backbone phosphate groups fully ionized. elements of energetics, structure and molecular motion. Thus, electrostatic charge–charge interactions can be X-ray diffraction, NMR spectroscopy, fluorescence depo- expected to be a major component of the forces determin- larization, gel electrophoresis, microcalorimetry and other ing the structures and driving the MD modeling. techniques are used in experiments to probe the bio- Following Coulomb’s law, charge–charge interactions fall physics of nucleic acids. The next step, interpreting the off slowly with distance and are most susceptible to error experiments, requires the formulation of molecular models when numerical truncation is applied. In addition, nucleic consistent with observed results. The first and most acids have a generally larger surface to volume ratio than instinctive source of molecular models is chemical intu- proteins, making solvation effects generally more signifi- ition; however, competing forces determine almost all cant and, in some instances, a dominant effect. Thus, an phenomena and one usually cannot reliably intuit the accurate theoretical treatment of both long-range electro- resultant of opposing terms as the magnitudes are not well static interactions and solvent effects is necessary for known. Computational modeling via molecular simulation, proper MD modeling of nucleic acids. in which some quantification of the relative contributions of various terms is provided using methods derived from Structures proposed, from 1953 to the present, for B-form theoretical physical chemistry [1], has thus come to assume DNA, as derived from experimental data, are shown in an increasingly important role. The most rigorous method Figure 1. A number of characterization studies of MD model- for modeling macromolecules in solution with an all-atom ing of nucleic acids have focused on a DNA oligonucleotide representation of solute and solvent molecules is molecu- duplex with the sequence d(CGCGAATTCGCG). lar dynamics (MD) computer simulation. In principle, MD Dickerson and co-workers [15] reported a crystal structure provides a complete microscopic description of the atomic for this sequence in 1980, the first single crystal structure structure and motions of macromolecules and solvent. In of a Watson–Crick double helix (Figure 1c). This sequence practice, MD modeling is a work in progress, with results contains the recognition site for the EcoRI restriction Sba213.qxd 03/27/2000 12:45 Page 183 Theory and computer simulation of nucleic acids Beveridge and McConnell 183 Figure 1 Fifty years of DNA structures. (a) Watson and Crick’s proposed structure, 1953 [159]. (b) Structure of canonical B-form DNA obtained using fiber diffraction in 1976 [160]. (c) X-ray crystal structure of the EcoRI DNA dodecamer determined in 1980 [15]. (d) Structure of the EcoRI DNA dodecamer obtained from NMR refinement, 2000 (PH Bolton et al., unpublished data). endonuclease and is henceforth referred to in this review reviewed by Sagui and Darden [26]. An independent con- as the ‘EcoRI dodecamer’. Structures of many other DNA firmation of the performance of PME versus P3M is and RNA sequences have now been reported from crystal- provided by Deserno and Holm [27]. At this point, most lography [16] and serve as a basis for comparison with MD simulations utilize the PME or force-shifted method results from MD modeling. An ongoing issue is the extent for treating long-range interactions, rather than using trun- to which these structures are influenced by crystal packing cated potentials. The latter is known to produce artifacts at forces [17,18] and, thus, may not be good indicators of the the cut-offs [28], although it has also been demonstrated structure of DNA or RNA in solution. Also, nucleic acid that feathering the potentials can minimize the problem structural indices from experiment tend to be broadly dis- [25,29,30•,31,32••]. The Ewald model is strictly applicable tributed [19,20] and issues not only of static, but also of to treatments of crystalline solids, but is used to model solu- dynamical structure must be taken into consideration. tions by making the simulation cell large and mostly solvent, net neutral and positioning the solute at the center. Comprehensive reviews of the literature on MD modeling of DNA and RNA pre-1995 are available [21]. This earli- The first MD calculations on DNA, as well as proteins, er work is now of primary interest as examples of the using PME showed significantly improved dynamical sta- difficulties in making a proper MD model of nucleic acids bility; however, solution models developed on the basis of using ‘first generation’ force-fields, distance-dependent PME could be subject to spurious long-range correlations dielectric screening models of solvent and the truncation caused by the quasi-crystalline boundary conditions. of long-range interactions. The development of the parti- Darden et al. [33] studied possible artifacts in the solvation cle mesh Ewald (PME) method [22] for long-range forces free energy of small ions using PME compared with using rendered the classic Ewald treatment of periodic bound- cluster models with spherical cut-offs. Significant differ- ary conditions tractable for general purpose MD. In 1995 ences were found and attributed to the effect of the came the publication of three new complete all-atom vacuum/solvent interface in cluster models. Hunenberger force-fields [23–25], including a full set of parameters for and McCammon [34,35••] described potential artifacts in nucleic acids, designed particularly for simulations includ- conformational preferences using PME if the system is not ing explicit consideration of solvent — termed ‘second neutral or exhibits large effective charge separation. generation’ force-fields [23]. Here, we provide an update Examples of potential PME problems concerning the sol- on developments in each of these areas and on progress vation free energy of a spherical ion and the potential of towards a simplified treatment of solvent on the basis of mean force between two spherical ions were provided, continuum electrostatics. both examples having potential implications for MD on DNA and RNA using PME. Bogusz et al. [36•] have Particle mesh Ewald treatment of long-range interactions described a method to remove artifacts of energy and pres- Advances in the Ewald and PME methodology, including sure caused by the use of the PME method for systems consideration of the improved performance of the parti- with a net charge using net charge correction