20 Years of Polarizable Force Field Development

Total Page:16

File Type:pdf, Size:1020Kb

Load more

2 0 Y E A R S O F P O L A R I Z A B L E
F O R C E F I E L D D E V E LO P M E N T

for biomolecular systems

thor van heesch

Supervised by Daan P. Geerke and Paola Gori-Giorg

1

contents

2

contents

12

Introduction

3448

Force fields: Basics, Caveats and Extensions 2.1 The Classical Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2 The caveats of point-charge electrostatics . . . . . . . . . . . . . . . . . 2.3 Physical phenomenon of polarizability . . . . . . . . . . . . . . . . . . 10 2.4 Common implementation methods of electronic polarization . . . . . 11 2.5 Accounting for anisotropic interactions . . . . . . . . . . . . . . . . . . 14

3

Knitting the reviews and perspectives together

17

3.1 Polarizability: A smoking gun? . . . . . . . . . . . . . . . . . . . . . . . 17 3.2 New branches of electronic polarization . . . . . . . . . . . . . . . . . 18 3.3 Descriptions of electrostatics . . . . . . . . . . . . . . . . . . . . . . . . 19 3.4 Solvation and polarization . . . . . . . . . . . . . . . . . . . . . . . . . . 20 3.5 The rise of new challenges . . . . . . . . . . . . . . . . . . . . . . . . . . 21 3.6 Parameterization or polarization? . . . . . . . . . . . . . . . . . . . . . . 22 3.7 Enough response: how far away? . . . . . . . . . . . . . . . . . . . . . . 23 3.8 The last perspectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 3.9 A new hope: the next-generation force fields . . . . . . . . . . . . . . . 29

4

Learning with machines

29

4.1 Replace the functional form with machine learned force fields . . . . . 31 4.2 A different take on polarizable force fields . . . . . . . . . . . . . . . . 33 4.3 Are transferable parameters an universal requirement? . . . . . . . . . 33 4.4 The difference between derivation and prediction . . . . . . . . . . . . 34 4.5 From small molecules to long range interactions . . . . . . . . . . . . . 36 4.6 Enough knowledge to fold a protein? . . . . . . . . . . . . . . . . . . . 37 4.7 Boltzmann generators, a not so hypothetical machine anymore . . . . 38

5

Summary: The Red Thread

41

introduction

3

abstract

In this literature study we aimed to answer the following question: What has changed in the outlook on polarizable force field development during the last 20 years? The theory, history, methods, and applications of polarizable force fields have been discussed to address this question. This investigation showed that the quality of the force field potential is detrimental for any type of atomistic sampling technique. If the underlying energy function contains flaws, these flaws are one way or another embedded in the fast amount of data we seek to understand to reproduce a wide range of quantifiable observables accurately. Subsequently, we identified two key challenges associated with the development of polarizable force fields: First, how to determine transferable and accurate parameters sets, and second, how to advance the underlying physical model without computational overhead. After twenty year the classical avenue of force field development has the modeling community finally made the transition towards a general acceptance of the need to develop more physically sound models. During the last 5 years machine learning techniques emerged to provide new means to remove bottlenecks in the current process towards the development of an accurate force field potential.

1 introduction

Next-generation atomistic force fields include polarization effects for the simulation of biomolecular systems. How this apparent change happened is a different story, to explain this generational transition we begin our study in the late 1950s. Physicists Bernie Alder and Thomas Wainwright were the first to translate digital computation into the study of many particle systems.[1] Eventually, the offspring of their research brought the simulation method called molecular dynamics (MD) to reality. Currently, classical MD simulation methods are being applied to study a multitude of physical, chemical, and biological systems, ranging from pure liquids to large complex systems such as proteins and cell membranes. [2, 3] As a result atomistic simulations have become an important tool to understand fundamental processes of biological systems.
Since the pioneering work of Alder and Wainwright, computing performance has increased by more than trillion fold.[4] This rapid development of digital machines lead to the expansion of system sizes and increase of timescales. As this was not the only advancement, since technological advancement inspired in an equal manner the drive to search for faster, more efficient and accurate underlying physical models for our simulation methods. In response a diverse set of atomistic simulations methods developed (co-)independently for the simulation of electrolytes, ionic liquids, metal organic frame works, biomolecular systems, and other types of nano-materials.
In this study we will keep our attention focused on the simulation of biomolecular systems. For this particular simulation field, the inclusion of explicit polarization effect has dominated the evolutionary process towards obtaining an improved description of the systems under investigation. To understand the reasons why the biomolecular simulation community chose to include polarization effects into the

force fields: basics, caveats and extensions

4

atomistic model, we will let ourselves get inspired by the following remarkable observation: The number of reviews that specifically address recent developments in polarizable force fields is numerous: [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16]. This (incomplete) selection of publications account together for more than 2000 citations according to Google Scholar. The period over which these reviews span, from 2001 till the present, amounts to almost two decades. Therefore, it might be fruitful to ask ourselves the following question: what has changed in the outlook on polarizable force field development during the last 20 years?
First, this approach will allow us to assess the progress made and may show us the edge of where polarizable force fields are right now. Second, this method could expose whether there are persistent factors that hamper our progress towards a polarizable force field for general use in biomolecular simulation. After the evaluation of this question we will discuss the open ends and look forward to how these new challenges can be solved. This will include the exploration of how Machine Learning (ML) methods and novel sampling techniques can aid towards the development of general use biomolecular force fields. Finally, we will summarise these efforts and provide an outlook on the current status of polarizable force field development for biomolecular simulations.

2 force fields: basics, caveats and extensions

Before we will proceed with an analysis of the aforementioned reviews we will provide the necessary background information to understand what is about to be discussed. The next sections will therefore explain the basics behind force fields together with the caveats of choosing a model that is based on fixed point charges, i.e., a non-polarizable force field. The first section discusses the components of the potential-energy function in terms of bonded and non-bonded interactions and seemingly continues with the caveats related to this classical approach. This next section will focus on the methods used to refine the model for its intent and purpose. Finally, the contributions of anisotropic charge distributions are discussed in the last section.

2.1 The Classical Approach

In molecular dynamics (MD) simulations are the atoms more often than not treated as point-like particles. Their reciprocal interactions in combination with the dynamic equations of motion determine how the system will evolve over time. This simple approach is well suited to simulate the collective behavior of atoms in molecular structures and ensembles. Moreover, when a few assumptions are set aside, this level of theory can determine both the micro- and macroscopic properties of the respective system within the line of expectations. [17]
However, there is still plenty of room for uncertainty to develop during the course of the simulation. The biggest assumption that makes atomistic simulations to some extent a conjecture is the lack of an explicit expressions for electrons. In addition, the lack hereof assumes that the system is always in the electronic ground state.

force fields: basics, caveats and extensions

5

As it where, these factors seem like a major shortcoming for creating a faithful molecular model. But, the simplicity of the atomistic model is also its greatest strength. Accounting for the electronic behavior implicitly allows for long simulation times up to the regime of µs, while maintaining relative low computational cost.[18, 19, 20] Given the reason for treating atoms as simple particles, how do we resolve the need for inclusion of fundamental electrostatic effects without turning to on-the-fly quantum mechanical calculations?
Atomistic simulation methods have developed an implicit way to account for the existence and effects of electrons. The electronic energy is formulated as a parametric function of the nuclear coordinates and corresponding parameters are subsequently fitted to experimental or higher level computational data. Such a parameterised potential that describes all forces in the system is called a force field. This description allows us to think more pragmatic about what chemistry conveys in molecular simulations. Simply put, chemistry becomes the knowing of the energy as a function of nuclear coordinates and molecular properties become the knowing of how the energy changes upon adding a perturbation to the system. [17] Determining the physics using this approach is justified in classical mechanics, because this is the basis of methods that give access to the Boltzmann weighted ensembles from which macroscopic properties of the system directly follow. [21] Still, there are numerous ways to construction of such energy functions. In the following section we will therefore discuss the most important components for the construction of a force field.

2.1.1 An atomistic-potential energy function

Figure 1: Schematic illustration of the terms in a classical fixed-charge force field, i.e. bond stretching (Ebond), bond-angle bending (Eangle), dihedral- angle torsion (Etorsion), and improper dihedral-angle bending (Eimproper) as well as van der Waals (EvdW) and electrostatic (Eele) interactions. Figure courtesy to ref. [22]

The majority of force fields describe the interactions between the atoms in the system via a potential energy function of the atomic coordinates. The total molecular potential energy is then composed of many terms and its exact form is unknown. Today classical biomolecular force fields use essentially the same gross approximation for the potential energy as proposed by Levitt and Lifson in 1969: [23]

E

pot(~r) = Espr(~r) + Eang(~r) + Edih(~r) + Eimp(~r) + EvdW(~r) + Eele(~r)

(1)
Here is Epot the potential-energy function of the system and ~r are the coordinates of all atoms in the system. Espr, Eang, Edih, Eimp are the bonded terms

force fields: basics, caveats and extensions

6

for bond stretching, bond-angle bending, dihedral-angle torsion, and improper dihedral-angle bending (or out-of-plane distortions) in the description of molecules. Together these terms account for all the energy contributions due to covalent interactions: Ecov = Espr(~r) + Eang(~r) + Edih(~r) + Eimp(~r). The interactions between atoms that are not directly connected via covalent bonds and bond angles consist

  • of the potential-energy term for all nonbonded interactions: Enb = EvdW + Eele
  • .

Where the van der Waals term, EvdW, accounts for dispersion and core-core repulsion, and the latter term, Eele, for the electrostatic interactions of the system. Together with the potential-energy terms for the covalent interactions this again sums up to the full potential-energy expression of the force field:

Epot = Ecov + Enb

(2)
See figure 1 for a schematic illustration of the terms in a classical fixed-charged force field.
Without question, additional terms can be added to this general expression of the Epot, e.g., in the early days hydrogen-bonding interaction terms, or cross-terms that describe coupling between the first three terms of the covalent interactions and other types of restraining potential-energy functions could be utilized to prevent the system from drifting away. [17] However, these extra descriptors are usually correction terms to account for omissions made in the other terms to still obtain realistic free energy profiles, instead of improving the description of the other terms. Despite the major importance of all the electronic effects, the partial charges of the atoms are fixed during simulations that are based upon classical force fields. Historically, this choice was made due to the high computational cost and added complexity associated with a more physics based force fields. [24]
Starting from the static decision the continuous development of fixed-charge force fields has actually shown to be surprisingly successful. A recent publication by S. Riniker illustrates the development of this class of force fields and provides an overview on the major force-field families, with detailed discussions about the different covalent and nonbonded force-field terms, parametrization strategies and the historic background of these models. [22] Therefore, the aspects mentioned above revolving around non-additive force fields will only be touched upon in this literature study. Instead, we will mainly focus on the effects and importance of extending the description of the classical electrostatics in biomolecular simulations.

2.1.2 Van der Waals potential

In (almost) all fixed-charged force-fields, the van der Waals interactions are described using a 12-6 Lennard-Jones (LJ) functional form

C12(i, j) C6(i, j)

EvijdW(rij) =

(3)

12 ij

r

r6ij

with C12(i, j) = 4ꢀijij)12, C6(i, j) = 4ꢀijij)6, where rij is the distance between atoms i and j, and where ꢀij denotes the depth of the attractive well and σij the inter-particle distance where the potential changes sign.[25, 26]
The function of van der Waals energy, Evdw, is to describe the repulsion or attraction between atoms that are not directly bonded. Often the Evdw is interpreted

force fields: basics, caveats and extensions

7

as the non-polar part of the interaction not related to electrostatic energy due to charges.[17] Hence, the atoms in the system are grouped into atom types, which all have respective LJ parameters and typically represent atoms in a specific chemical environment, for example, a nitrogen in an amine group or oxygen in a hydroxylgroup, and so forth.
However, it should be noted that this widely used non-bonded term also has some
(major) flaws embedded. Recently Frenkel et. al noted that over the course of time the LJ potential, which originally has been derived to describe the cohesive energy of crystals of noble gases, is used so often for systems where it is not expected to be particularly realistic that the disadvantages of the LJ potential have become very relevant these days.[27] The implications of the arguments made against the LJ 12-6 potential can be summarised as follows: the LJ 12-6 potential is anything but a welldefined standard and, in particular for proteins and nano-colloids it is not a good model. Mainly, because the range of attraction is too large compared to the effective diameter.
To top it off, Frenkel et al. argue there is not obvious indication of why a truncated
LJ 12-6 potential should have any special merits that outweigh its disadvantages.[27] Instead, they argue that no such advantage exists and present a simple model pair potential to unify all the different unambiguous truncation methods used in the past. In contrast to all the shifted, truncated and interpolated LJ 12-6 models, once a cut-off distance, rc, is specified the model is uniquely defined.1 If required, their class of LJ-like potentials could be used for numerical studies of systems of particles with short-ranged attraction. For example, a set of thermodynamic and transport properties is reported for the cases rc=2.0 (“atomic liquid”) and rc=1.2 (“colloidal suspension”).[27] In short, the main message is that we should be careful with continuing trends based old habits during the construction of next-generation force fields.

2.1.3 Electrostatic energy

The electrostatic interaction of the nonbonded interaction should account for the internal (re)distribution of the electrons, creating positive and negative parts of the molecule. [10] At the lowest order of approximation, this electronic behaviour can be modeled by assigning a bond dipole to the bond or, more conventionally, partial charges to each atom. For example, in classical fixed-charge force fields, this pairwise Coulomb interactions between the point charges of atoms i and j is considered as

qiqj

1

Eeijle(rij) =

(4)

4πε0ε1 rij

where q is the partial charge, ε0 the electric constant, ε1 the background dielectric permittivity, and rij the distance between atoms i and j.[22] The classical electrostatic interactions can be viewed as the remaining long-range part, (1r ), of the quantum mechanical electrostatic interactions, after all short-range contributions like bonds, repulsion and dispersion are removed. Still, obtaining a good description of the electrostatic interaction between molecules (or between different parts of the same molecule) is one of the big challenges in force field developments. [10] The

1 The united model potential vanishes quadratically at a cut-off distance rc and represents now specific substance. The authors stress that their model rather represent generic models. However, the authors

mention, I cite, "very often the ubiquitious LJ 12-6 potential is used in exactly the same way."[27]

force fields: basics, caveats and extensions

8

predictive ability of MD simulations relies on the accuracy of the underlying force field.

2.2 The caveats of point-charge electrostatics

From equation 3 it is evident that not all types of electrostatic interactions are accounted for, and as such the deficiencies in the electrostatic potential that result from the fixed partial atomic charge approximation have been widely acknowledged over the past decade.[11, 10, 28, 12, 29, 9, 13, 30] Both Jensen and Hagler independently made special efforts to summarize all the components that currently are unaccounted for in most “standard” force fields and are likely required to achieve experimental accuracy in (bio)molecular force fields.[17, 31] The following 8 points summarize the combined efforts in addressing the caveats associated with fixedpoint charge force fields:

i) Electronic polarization effects are not account for, as fixed-point charge models only two-body interactions are included. However, for polar bodies the three-body contribution is quite significant, perhaps 10–20% of the two-body term. [32] Moreover, the response of the molecular dipole moments to variances in dielectric conditions are not taken into account by the fixed charge approximation models.[33, 34]

ii) The geometric dependence of charge and higher order multi-poles is omitted. [35,
36] Studies have shown that both partial atomic charges and higher-order electric moments have a significant dependence on the geometry. Consequently, these quantities cannot fulfill the requirement of transferability in parameter sets.

iii) The effects and energetics associated with charge transfer (CT) that occur between compounds, as for example in H-bonded complexes, are not accounted for in additive force fields. [37, 38, 39]

iv) The partial charge model cannot correctly model the electrostatics surrounding a molecule, resulting in errors in the electrostatic potential ranging between 10 to 20 kJ/mol.[17]

From section 2.1.2 we already concluded that the use of the standard LJ potential is not really justified. In addition, the following points will specifically address caveats hidden in the van der Waals potential and why these components should be accounted for or revisited.

v) The r−6 dispersion is an approximation omitting higher order terms, which have been indicated to be important for close packed systems such as globular proteins. [40, 41]

Recommended publications
  • Melissa Gajewski & Jonathan Mane

    Melissa Gajewski & Jonathan Mane

    Melissa Gajewski & Jonathan Mane Molecular Mechanics (MM) Methods Force fields & potential energy calculations Example using noscapine Molecular Dynamics (MD) Methods Ensembles & trajectories Example using 18-crown-6 Quantum Mechanics (QM) Methods Schrödinger’s equation Semi-empirical (SE) Wave Functional Theory (WFT) Density Functional Theory (DFT) Hybrid QM/MM & MD Methods Comparison of hybrid methods 2 3 Molecular Mechanics (MM) Methods Force fields & potential energy calculations Example using noscapine Molecular Dynamics (MD) Methods Ensembles & trajectories Example using 18-crown-6 Quantum Mechanics (QM) Methods Schrödinger’s equation Semi-empirical (SE) Wave Functional Theory (WFT) Density Functional Theory (DFT) Hybrid QM/MM & MD Methods Comparison of hybrid methods 4 Useful for all system size ◦ Small molecules, proteins, material assemblies, surface science, etc … ◦ Based on Newtonian mechanics (classical mechanics) d F = (mv) dt ◦ The potential energy of the system is calculated using a force field € 5 An atom is considered as a single particle Example: H atom Particle variables: ◦ Radius (typically van der Waals radius) ◦ Polarizability ◦ Net charge Obtained from experiment or QM calculations ◦ Bond interactions Equilibrium bond lengths & angles from experiment or QM calculations 6 All atom approach ◦ Provides parameters for every atom in the system (including hydrogen) Ex: In –CH3 each atom is assigned a set of data MOLDEN MOLDEN (Radius, polarizability, netMOLDENMOLDENMOLDENMOLDENMOLDEN charge,
  • Tinker-HP : Accelerating Molecular Dynamics Simulations of Large Complex Systems with Advanced Point Dipole Polarizable Force Fields Using Gpus and Multi-Gpus Systems

    Tinker-HP : Accelerating Molecular Dynamics Simulations of Large Complex Systems with Advanced Point Dipole Polarizable Force Fields Using Gpus and Multi-Gpus Systems

    Tinker-HP : Accelerating Molecular Dynamics Simulations of Large Complex Systems with Advanced Point Dipole Polarizable Force Fields using GPUs and Multi-GPUs systems Olivier Adjoua,y Louis Lagardère*,y,z Luc-Henri Jolly,z Arnaud Durocher,{ Thibaut Very,x Isabelle Dupays,x Zhi Wang,k Théo Jaffrelot Inizan,y Frédéric Célerse,y,? Pengyu Ren,# Jay W. Ponder,k and Jean-Philip Piquemal∗,y,# ySorbonne Université, LCT, UMR 7616 CNRS, F-75005, Paris, France zSorbonne Université, IP2CT, FR2622 CNRS, F-75005, Paris, France {Eolen, 37-39 Rue Boissière, 75116 Paris, France xIDRIS, CNRS, Orsay, France kDepartment of Chemistry, Washington University in Saint Louis, USA ?Sorbonne Université, CNRS, IPCM, Paris, France. #Department of Biomedical Engineering, The University of Texas at Austin, USA E-mail: [email protected],[email protected] arXiv:2011.01207v4 [physics.comp-ph] 3 Mar 2021 Abstract We present the extension of the Tinker-HP package (Lagardère et al., Chem. Sci., 2018,9, 956-972) to the use of Graphics Processing Unit (GPU) cards to accelerate molecular dynamics simulations using polarizable many-body force fields. The new high-performance module allows for an efficient use of single- and multi-GPUs archi- 1 tectures ranging from research laboratories to modern supercomputer centers. After de- tailing an analysis of our general scalable strategy that relies on OpenACC and CUDA , we discuss the various capabilities of the package. Among them, the multi-precision possibilities of the code are discussed. If an efficient double precision implementation is provided to preserve the possibility of fast reference computations, we show that a lower precision arithmetic is preferred providing a similar accuracy for molecular dynamics while exhibiting superior performances.
  • Multiscale Friction Simulation of Dry Polymer Contacts: Reaching Experimental Length Scales by Coupling Molecular Dynamics and Contact Mechanics

    Multiscale Friction Simulation of Dry Polymer Contacts: Reaching Experimental Length Scales by Coupling Molecular Dynamics and Contact Mechanics

    Multiscale Friction Simulation of Dry Polymer Contacts: Reaching Experimental Length Scales by Coupling Molecular Dynamics and Contact Mechanics Daniele Savio ( [email protected] ) Freudenberg Technology Innovation SE & Co. KG https://orcid.org/0000-0003-1908-2379 Jannik Hamann Fraunhofer Institute for Mechanics of Materials: Fraunhofer-Institut fur Werkstoffmechanik IWM Pedro A. Romero Freudenberg Technology Innovation SE & Co. KG Christoph Klingshirn Freudenberg FST GmbH Ravindrakumar Bactavatchalou Freudenberg Technology Innovation SE & Co. KG Martin Dienwiebel Fraunhofer Institute for Mechanics of Materials: Fraunhofer-Institut fur Werkstoffmechanik IWM Michael Moseler Fraunhofer Institute for Mechanics of Materials: Fraunhofer-Institut fur Werkstoffmechanik IWM Research Article Keywords: Coupling Molecular Dynamics, Contact Mechanics, Multiscale Friction Posted Date: February 23rd, 2021 DOI: https://doi.org/10.21203/rs.3.rs-223573/v1 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Multiscale friction simulation of dry polymer contacts: reaching experimental length scales by coupling molecular dynamics and contact mechanics Daniele Savio1*, Jannik Hamann2, Pedro A. Romero1, Christoph Klingshirn3, Ravindrakumar Bactavatchalou1, Martin Dienwiebel2,4, Michael Moseler2,5 1 Freudenberg Technology Innovation SE & Co. KG, Weinheim, Germany 2 µTC Microtribology Center, Fraunhofer Institute for Mechanics of Materials IWM, Freiburg, Germany 3 Freudenberg FST GmbH, Weinheim, Germany 4 Karlsruhe Institute of Technology, Institute for Applied Materials, IAM-CMS, Karlsruhe, Germany 5 Institute of Physics, University of Freiburg, Freiburg, Germany *Corresponding author: [email protected] Abstract This work elucidates friction in Poly-Ether-Ether-Ketone (PEEK) sliding contacts through multiscale simulations. At the nanoscale, non-reactive classical molecular dynamics (MD) simulations of dry and water-lubricated amorphous PEEK-PEEK interfaces are performed.
  • Chem3d 17.0 User Guide Chem3d 17.0

    Chem3d 17.0 User Guide Chem3d 17.0

    Chem3D 17.0 User Guide Chem3D 17.0 Table of Contents Recent Additions viii Chapter 1: About Chem3D 1 Additional computational engines 1 Serial numbers and technical support 3 About Chem3D Tutorials 3 Chapter 2: Chem3D Basics 5 Getting around 5 User interface preferences 9 Background settings 10 Sample files 10 Saving to Dropbox 10 Chapter 3: Basic Model Building 12 Default settings 12 Selecting a display mode 12 Using bond tools 13 Using the ChemDraw panel 15 Using other 2D drawing packages 15 Building from text 16 Adding fragments 18 Selecting atoms and bonds 18 Atom charges 21 Object position 23 Substructures 24 Refining models 27 Copying and printing 29 Finding structures online 32 Chapter 4: Displaying Models 35 © Copyright 1998-2017 PerkinElmer Informatics Inc., All rights reserved. ii Chem3D 17.0 Display modes 35 Atom and bond size 37 Displaying dot surfaces 38 Serial numbers 38 Displaying atoms 39 Atom symbols 40 Rotating models 41 Atom and bond properties 44 Showing hydrogen bonds 45 Hydrogens and lone pairs 46 Translating models 47 Scaling models 47 Aligning models 47 Applying color 49 Model Explorer 52 Measuring molecules 59 Comparing models by overlay 62 Molecular surfaces 63 Using stereo pairs 72 Stereo enhancement 72 Setting view focus 73 Chapter 5: Building Advanced Models 74 Dummy bonds and dummy atoms 74 Substructures 75 Bonding by proximity 78 Setting measurements 78 Atom and building types 81 Stereochemistry 85 © Copyright 1998-2017 PerkinElmer Informatics Inc., All rights reserved. iii Chem3D 17.0 Building with Cartesian
  • “One Ring to Rule Them All”

    “One Ring to Rule Them All”

    OpenMM library “One ring to rule them all” https://simtk.org/home/openmm CHARMM * Abalone Presto * ACEMD NAMD * ADUN * Ascalaph Gromacs AMBER * COSMOS ??? HOOMD * Desmond * Culgi * ESPResSo Tinker * GROMOS * GULP * Hippo LAMMPS * Kalypso MD * LPMD * MacroModel * MDynaMix * MOLDY * Materials Studio * MOSCITO * ProtoMol The OpenM(olecular)M(echanics) API * RedMD * YASARA * ORAC https://simtk.org/home/openmm * XMD Goals of OpenMM l Complete library l provide what is most needed l easy to learn and use l Fast, general, extensible l optimize for speed l hide hardware specifics l support new hardware l add new force fields, integration methods etc. l Available APIs in C++, Fortran95, Python l Advantages l Object oriented l Modular l Extensible l Disadvantages l Programs written in other languages need to invoke it through a layer of C++ code Features • Electrostatics: cut-off, Ewald, PME • Implicit solvent • Integrators: Verlet, Langevin, Brownian, custom • Thermostat: Andersen • Barostat: MonteCarlo • Others: energy minimization, virtual sites 5 The OpenMM Architecture Public Interface OpenMM Public API Platform Independent Code OpenMM Implementation Layer Platform Abstraction Layer OpenMM Low Level API Computational Kernels CUDA/OpenCL/Brook/MPI etc. Public API Classes (1) l System l A collection of interaction particles l Defines the mass of each particle (needed for integration) l Specifies distance restraints l Contains a list of Force objects that define the interactions l OpenMMContext l Contains all state information for a particular simulation − Positions, velocities, other parmeters Public API Classes (2) l Force l Anything that affects the system's behavior: forces, barostats, thermostats, etc. l A Force may: − Apply forces to particles − Contribute to the potential energy − Define adjustable parameters − Modify positions, velocities, and parameters at the start of each time step l Existing Force subclasses − HarmonicBond, HarmonicAngle, PeriodicTorsion, RBTorsion, Nonbonded, GBSAOBC, AndersenThermostat, CMMotionRemover..
  • Organic & Biomolecular Chemistry

    Organic & Biomolecular Chemistry

    Organic & Biomolecular Chemistry Accepted Manuscript This is an Accepted Manuscript, which has been through the Royal Society of Chemistry peer review process and has been accepted for publication. Accepted Manuscripts are published online shortly after acceptance, before technical editing, formatting and proof reading. Using this free service, authors can make their results available to the community, in citable form, before we publish the edited article. We will replace this Accepted Manuscript with the edited and formatted Advance Article as soon as it is available. You can find more information about Accepted Manuscripts in the Information for Authors. Please note that technical editing may introduce minor changes to the text and/or graphics, which may alter content. The journal’s standard Terms & Conditions and the Ethical guidelines still apply. In no event shall the Royal Society of Chemistry be held responsible for any errors or omissions in this Accepted Manuscript or any consequences arising from the use of any information it contains. www.rsc.org/obc Page 1 of 6 CREATED USING THE RSC ARTICLEOrganic TEMPLATE & Biomolecular (VER. 3.0 OOo) - SEE WWW.RSC.ORG/ELECTRONICFILESChemistry FOR DETAILS www.rsc.org/xxxxxx | XXXXXXXX Expanding DP4: Application to drug compounds and automation Kristaps Ermanis,a Kevin E. B. Parkes,b Tatiana Agbackb and Jonathan M. Goodman*c Received (in XXX, XXX) Xth XXXXXXXXX 200X, Accepted Xth XXXXXXXXX 200X First published on the web Xth XXXXXXXXX 200X DOI: 10.1039/b00000000x The DP4 parameter, which provides a confidence level for NMR assignment, has been widely used to help assign the structures of many stereochemically-rich molecules.
  • Open Source Molecular Modeling

    Open Source Molecular Modeling

    Accepted Manuscript Title: Open Source Molecular Modeling Author: Somayeh Pirhadi Jocelyn Sunseri David Ryan Koes PII: S1093-3263(16)30118-8 DOI: http://dx.doi.org/doi:10.1016/j.jmgm.2016.07.008 Reference: JMG 6730 To appear in: Journal of Molecular Graphics and Modelling Received date: 4-5-2016 Accepted date: 25-7-2016 Please cite this article as: Somayeh Pirhadi, Jocelyn Sunseri, David Ryan Koes, Open Source Molecular Modeling, <![CDATA[Journal of Molecular Graphics and Modelling]]> (2016), http://dx.doi.org/10.1016/j.jmgm.2016.07.008 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. Open Source Molecular Modeling Somayeh Pirhadia, Jocelyn Sunseria, David Ryan Koesa,∗ aDepartment of Computational and Systems Biology, University of Pittsburgh Abstract The success of molecular modeling and computational chemistry efforts are, by definition, de- pendent on quality software applications. Open source software development provides many advantages to users of modeling applications, not the least of which is that the software is free and completely extendable. In this review we categorize, enumerate, and describe available open source software packages for molecular modeling and computational chemistry. 1. Introduction What is Open Source? Free and open source software (FOSS) is software that is both considered \free software," as defined by the Free Software Foundation (http://fsf.org) and \open source," as defined by the Open Source Initiative (http://opensource.org).
  • Atomistic Modeling of DNA and Protein Structures

    Atomistic Modeling of DNA and Protein Structures

    Atomistic Modeling of DNA and Protein Structures George C. Schatz Introduction In this lecture we introduce one of the most commonly used computational tools for studying protein and DNA structure, the use of empirical potential energy functions, and we describe their use in molecular mechanics and molecular dynamics calculations. As an application, we present results of a molecular mechanics study of the mechanical properties (mechanical pulling) of a small protein. We also describe modeling of DNA hairpin structures. References: Computer Simulation of Liquids, M. P. Allen and D. J. Tildesley, Clarendon Press, Oxford, 1987; Introduction to Modern Statistical Mechanics, D. Chandler, Oxford, New York, 1987; Understanding Molecular Simulations: from Algorithms to Applications, D. Frenkel and B. Smit, Academic Press; San Diego, 1996. Molecular Modeling: Principles and Applications, (2nd ed) Andrew Leach, Prentice Hall, Englewood Cliffs, 2001. What is a force field? This is the potential energy function V(R) that determines the interactions between the atoms in a molecule or solid. The derivatives of this function (or more technically the gradients) give the forces that these atoms exert against each other. Thus Newton’s equations say that F=ma, where the force F is related to V via F=-dV/dR, m is the mass and a is the acceleration. A simple example of a force field is a Lennard-Jones, or 6-12 potential. This describes the interaction of two argon atoms pretty well, and it is often used to describe the interaction of two methane molecules or even, to a lesser degree of rigor, two water molecules. Lennard-Jones potential 7 6 5 12 6 ⎛⎞⎡⎤σσ ⎡⎤ 4 V4=ε − 3 ⎜⎟⎢⎥ ⎢⎥ RR energy/well depth ⎝⎠⎣⎦ ⎣⎦ 2 1 0 -1 0.80 1.00 1.20 1.40 1.60 1.80 2.00 r/σ For a liquid composed of argon, or methane, one can approximate that the overall potential is simply the sum of all pair-wise interactions: ⎛⎞12 6 ⎡ σσij⎤⎡⎤ ij V4=ε⎜⎟⎢ ⎥⎢⎥ − ∑ ij ⎜⎟RR ij ⎝⎠⎣⎢ ij⎦⎣⎦⎥⎢⎥ ij This neglects three-body and higher interactions, which works pretty well for argon, but is a problem for more complex materials.
  • EMERSON Centernewsletter

    EMERSON Centernewsletter

    EMERSON CENTER Newsletter A Publication of the Cherry L. Emerson Center for Scientific Computation, Emory University http://www.emerson.emory.edu Volume 8, September 15, 2001 COMPUTATIONAL CHEMIST JAMES KINDT Inside This Issue . JOINS EMERSON CENTER ✾ New Professor & User to EC (p. 1) ✾ The Emerson Center welcomes Dr. James Kindt, who has recently joined the Department New EC Equipment Purchase of Chemistry as an Assistant Professor and a new subscriber to the Emerson Center. His (p. 1) office and laboratory are located in the Emerson ✾ Emerson Center Fee Change (p. 1) Center on the 5th floor of the Cherry Logan ✾ ECEC Meeting Minutes (p. 1) Emerson Hall. Prof. Kindt’s specialty is ✾ computational chemistry, in particular, modeling EC Visiting Fellowship Announce- of biomolecular systems. He says that his primary ment & Letters from Fellows (p. 2) research goal will be to explore, by computer ✾ Research Reports from Emerson simulation and theory, how lipid composition and Ctr. (p. 3) molecular structure contribute to the organization ✾ Technical Specs on EC Hardware and structure of lipid bilayers. He will be using (p. 4) Emerson Center’s computational and graphic ✾ facilities extensively for his research. He recently Emerson Center Contact Info received a prestigious New Faculty Award from (p. 4) the Camille and Henry Dreyfus Foundation. Prof. Professor James T. Kindt Kindt received his PhD from Yale University and was a postdoctoral fellow at UCLA before joining Emory. Please refer to page 3 of this newsletter for one of Prof. Kindt’s research proposals using the Emerson Center facilities. New Biomolecular Modeling & Visualization Facility at EC In the News In response to popular demand from the biomolecular modeling community at Emory and with grant support from the National Science Foundation, the Emerson Center ◆ Emerson Center Fee recently established its new Biomolecular Modeling and Visualization Facility.
  • Simulations of the Role of Water in the Protein- Folding Mechanism

    Simulations of the Role of Water in the Protein- Folding Mechanism

    Simulations of the role of water in the protein- folding mechanism Young Min Rhee*, Eric J. Sorin*, Guha Jayachandran†, Erik Lindahl‡, and Vijay S. Pande*‡§ Departments of *Chemistry, †Computer Science, and ‡Structural Biology, Stanford University, Stanford, CA 94305 Edited by Harold A. Scheraga, Cornell University, Ithaca, NY, and approved March 15, 2004 (received for review November 26, 2003) There are many unresolved questions regarding the role of water ments. To address this question, it is natural to look to computer in protein folding. Does water merely induce hydrophobic forces, simulations of protein folding by using explicit solvation models or does the discrete nature of water play a structural role in for direct comparisons to both experimental and implicit solva- folding? Are the nonadditive aspects of water important in deter- tion simulation results. mining the folding mechanism? To help to address these questions, However, even with advances in computational methodologies we have performed simulations of the folding of a model protein in recent years, a folding kinetics simulation in atomistic detail (BBA5) in explicit solvent. Starting 10,000 independent trajectories remains a demanding task (4), and there have been only a few from a fully unfolded conformation, we have observed numerous reports of such studies (5, 6). Moreover, these studies have been folding events, making this work a comprehensive study of the confined to the use of implicit and indirect models of solvation kinetics of protein folding starting from the unfolded state and to reduce the required central processing unit (CPU) time and reaching the folded state and with an explicit solvation model and storage requirements (5).
  • Simulations of the Role of Water in the Protein- Folding Mechanism

    Simulations of the Role of Water in the Protein- Folding Mechanism

    Simulations of the role of water in the protein- folding mechanism Young Min Rhee*, Eric J. Sorin*, Guha Jayachandran†, Erik Lindahl‡, and Vijay S. Pande*‡§ Departments of *Chemistry, †Computer Science, and ‡Structural Biology, Stanford University, Stanford, CA 94305 Edited by Harold A. Scheraga, Cornell University, Ithaca, NY, and approved March 15, 2004 (received for review November 26, 2003) There are many unresolved questions regarding the role of water ments. To address this question, it is natural to look to computer in protein folding. Does water merely induce hydrophobic forces, simulations of protein folding by using explicit solvation models or does the discrete nature of water play a structural role in for direct comparisons to both experimental and implicit solva- folding? Are the nonadditive aspects of water important in deter- tion simulation results. mining the folding mechanism? To help to address these questions, However, even with advances in computational methodologies we have performed simulations of the folding of a model protein in recent years, a folding kinetics simulation in atomistic detail (BBA5) in explicit solvent. Starting 10,000 independent trajectories remains a demanding task (4), and there have been only a few from a fully unfolded conformation, we have observed numerous reports of such studies (5, 6). Moreover, these studies have been folding events, making this work a comprehensive study of the confined to the use of implicit and indirect models of solvation kinetics of protein folding starting from the unfolded state and to reduce the required central processing unit (CPU) time and reaching the folded state and with an explicit solvation model and storage requirements (5).
  • Folding@Home: Lessons from Eight Years of Volunteer Distributed Computing

    Folding@Home: Lessons from Eight Years of Volunteer Distributed Computing

    Folding@home: Lessons From Eight Years of Volunteer Distributed Computing Adam L. Beberg1, Daniel L. Ensign2, Guha Jayachandran1, Siraj Khaliq1, Vijay S. Pande2 1Computer Science Dept, Stanford University 2Chemistry Department, Stanford University {beberg@cs., densign@, guha@cs., siraj@cs., pande@}stanford.edu Abstract Volunteer computing is a popular term for distributed computing where the work is done on Accurate simulation of biophysical processes computers recruited from people on the Internet, not on requires vast computing resources. Folding@home is a machines under the control and management of the distributed computing system first released in 2000 to project. This approach can provide the necessary provide such resources needed to simulate protein computational power for bridging the time scale gap, folding and other biomolecular phenomena. Now but it introduces many new problems with respect to operating in the range of 5 PetaFLOPS sustained, it security, reliability, and coordination. provides more computing power than can typically be The architecture of Stanford University’s gathered and operated locally due to cost, physical Folding@home project [2] [3] additionally involves a space, and electrical/cooling load. This paper combination of load balancing, result feedback, and describes the architecture and operation of redundancy that are only required on volunteer systems Folding@home, along with some lessons learned over of this scale. the lifetime of the project. 2. Related work 1. Introduction The Distributed Computing System (DCS) [4] [5] (1970-77) represents the first work where computers Protein folding is a fundamental process of had the capabilities to do distributed computing on a biology in which proteins self-assemble or "fold." network scale, and tackled all the main issues we Many diseases (such as Alzheimer's, Parkinson's, and continue to face with distributed computing today.