University of Southampton Research Repository Eprints Soton

Total Page:16

File Type:pdf, Size:1020Kb

University of Southampton Research Repository Eprints Soton University of Southampton Research Repository ePrints Soton Copyright © and Moral Rights for this thesis are retained by the author and/or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder/s. The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders. When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given e.g. AUTHOR (year of submission) "Full thesis title", University of Southampton, name of the University School or Department, PhD Thesis, pagination http://eprints.soton.ac.uk University of Southampton Faculty of Natural and Environmental Sciences Using linear-scaling DFT for biomolecular simulations Christopher James Pittock Thesis for the degree of Doctor of Philosophy September 2013 university of southampton ABSTRACT faculty of natural and environmental sciences Computational Chemistry Doctor of Philosophy Using linear-scaling DFT for biomolecular simulations by Christopher James Pittock In the drug discovery process, there are multiple factors that make a successful can- didate other than whether it antagonises a chosen active site, or performs allosteric regulation. Each test candidate is profiled by its absorption into the bloodstream, distribution throughout the organism, its products of metabolism, method of excre- tion, and overall toxicity; summarised as ADMET. There are currently methods to calculate and predict such properties, but the majority of these involve rule-based, empirical approaches that run the risk of lacking accuracy as one's search of chemical space ventures into the more novel. The lack of experimental data on organometallic systems also means that some of these methods refuse to predict properties on them outright, losing the opportunity to exploit this relatively untapped area that holds promise for new antibacterial and antineoplastic pharmaceutical compounds. Using the more transferable and definitive quantum mechanical (QM) approach to drug dis- covery is desirable, but the computational cost of conventional Hartree-Fock (HF) and Density Functional Theory (DFT) calculations are too high. Using the linear-scaling DFT program, onetep, we aim to exploit the benefits of DFT in calculations with much larger fragments of, and in some cases entire biomolecules, in order to demon- strate calculations which could ultimately be used in developing more accurate methods of profiling drug candidates, with a computational cost that albeit still high, is now feasible with the provision of modern supercomputers. In this thesis, we first use linear-scaling DFT methods to address the lack of elec- tron polarisation and charge transfer effects in energy calculations using a molecular mechanics forcefield. Multiple DFT calculations are performed on molecular dynam- ics (MD) snapshots of small molecules in a waterbox, with the aim of computing a MM!QM correction term, which can be applied to a forcefield binding free energy approach (such as thermodynamic integration) which will process a far greater number of MD snapshots. As a result, one will obtain the precision from processing very large numbers of MD snapshots of biomolecular systems, but the accuracy of QM. To im- prove efficiency of the QM phase of the overall method, we use electrostatic embedding i to model the regions of the waterbox that are far from the solute, yet are still important to include. As this is a relatively new module in onetep, we present validation data prior to its use in the main work. Secondly, we validate different methods of calculating the pKa of a wide variety of molecules: from small, organic compounds, to the organometallic cisplatin, with the ultimate goal being of such calculations to eventually address questions such as, as- suming oral intake, where in the gastrointestinal tract will a drug molecule be absorbed into the bloodstream, and how much of the original dose will be absorbed. These calcu- lations are then scaled up significantly to examine the potential of using linear-scaling DFT to calculate the pKa of specific residues in proteins. This is performed with a 305-atom tryptophan cage, the 814-atom Ovomucoid Silver Pheasant Third Domain (OMSVP3) and a 2346-atom section of the T99A/M102Q T4-lysozyme mutant. We also highlight the challenges in calculating protein pKa. Finally, we study the hydrogen-abstraction reaction between cyclohexene and cyto- chrome P450cam, through onetep single-point energy calculations of a 10-snapshot adiabatic reaction profile generated by the Mulholland Group (University of Bristol). Following this, the LST and QST methods of determining the transition state (avail- able through onetep) are used, with the aims of determining the importance of the protein surrounding the active site in regards to the activation energy and structural geometry of the calculated transition state. The LST and QST methods are also valid- ated, through modelling of the SN2 reaction between fluoride and chloromethane. The aim of this part of our work is to eventually assist in developing a metabolism (and toxicity) model of the different isoforms of cytochrome P450. Overall, this thesis aims to highlight not only the capability of linear-scaling DFT in becoming an important part of biomolecular simulation, but also the challenges that one will face upon scaling up calculations that were previously simple to perform, based on the small size of the system being modelled. ii Contents Abstract . .i Declaration of authorship . xvii Acknowledgements . xviii Glossary . xx 1 Computational Theory 1 1.1 Molecular Mechanics . .1 1.1.1 Origin of AMBER forcefield parameters . 11 1.1.2 GAFF . 14 1.1.3 AMBER energy using the sander module . 16 1.2 Quantum Mechanics . 18 1.2.1 The wavefunction and the Schr¨odingerEquation . 18 1.2.2 Born-Oppenheimer Approximation . 20 1.2.3 Many-electron wavefunctions . 22 1.3 Density Functional Theory . 26 1.3.1 Hohenberg-Kohn Proof . 28 1.3.2 Kohn-Sham DFT . 30 1.3.3 Exchange-correlation functionals . 32 1.4 Linear-scaling DFT methods . 36 1.4.1 Density matrix reformulation of Kohn-Sham DFT . 36 1.4.2 Principle of linear-scaling DFT approaches . 36 1.4.3 ONETEP .............................. 39 1.5 Basis sets . 42 1.5.1 Gaussian Type Orbital (GTO) basis sets . 42 1.5.2 Basis Set Superposition Error . 45 1.5.3 Plane wave basis set . 48 1.5.4 Periodic cardinal sine (psinc) basis set (used in ONETEP)... 51 1.6 Solvent models . 54 1.6.1 Explicit - MM . 54 1.6.2 Implicit - MM . 56 1.6.3 Implicit - QM . 60 1.7 Geometry Optimisation . 64 1.7.1 Steepest Descent . 65 iii 1.7.2 Conjugate gradients . 66 1.7.3 Newton-Raphson and BFGS . 68 1.8 Transition state . 71 1.8.1 Adiabatic (Drag) method . 72 1.8.2 Nudged Elastic Band (NEB) . 73 1.8.3 LST/QST optimisation . 75 2 Use of electrostatic embedding for QM-corrected water-solute inter- actions 77 2.1 Introduction . 77 2.1.1 Free energy of binding methods . 77 2.1.2 QM/MM systems . 81 2.1.3 Electrostatic embedding module within ONETEP . 83 2.1.4 Aims and objectives . 86 2.2 Original calculation parameters . 86 2.2.1 Molecular Mechanics/Molecular Dynamics . 86 2.2.2 ONETEP . 88 2.3 Validation of electrostatic embedding module . 89 2.3.1 Charge spilling . 89 2.3.2 Calibration of electrostatic embedding . 91 2.4 Calculation of energies of solvation . 98 2.4.1 Calculation parameters . 99 2.4.2 Results . 101 2.5 Conclusion . 104 3 Calculating pKa using linear-scaling DFT and an implicit solvation model 107 3.1 Introduction . 107 3.1.1 The importance of pKa ...................... 107 3.1.2 Overview of current pKa calculation methods . 108 3.1.3 Aims and objectives . 111 3.2 Theoretical calculation of pKa ...................... 112 3.2.1 Empirical . 112 3.2.2 Ab initio . 115 3.3 Original calculation parameters . 119 iv 3.3.1 Single-point energy . 121 3.3.2 Geometry optimisation . 121 3.3.3 Molecular Dynamics (MD) . 122 3.3.4 Implicit solvation . 122 3.3.5 Phonons . 123 3.4 Test set of 36 small drug-like molecules . 123 3.4.1 Na¨ıve and proton-exchange method . 125 3.4.2 Cluster-continuum method . 127 3.4.3 Investigating probable sources of error . 129 3.5 Calibration through linear regression . 135 3.6 Increasing the scope of pKa calculation: Small and medium-sized molecules137 3.6.1 More challenging small molecules . 137 3.6.2 Deprotonation of L-histidine . 141 3.6.3 Hydrolysis products of cisplatin . 143 3.7 Increasing the scope of pKa calculation: Protein pKa .......... 147 3.7.1 Tryptophan cage . 150 3.7.2 Ovomucoid Silver Pheasant Third Domain (OMSVP3) . 152 3.7.3 Effects of truncation of lysozyme on pKa of Glu108 and Arg145 153 3.8 Conclusions . 155 3.8.1 Polyprotic pKa . 158 Tables . 164 4 Transition state modelling of the CYP101 enzyme, using large QM regions 171 4.1 Introduction . 171 4.1.1 The significance of CYP . 171 4.1.2 Transition state calculations of CYP . 175 4.1.3 Aims and objectives . 177 4.2 Original calculation parameters . 180 4.3 Validation work . 181 4.3.1 Calculation method for single-point energy of Compound I . 181 − 4.3.2 LST/QST transition state in SN2 reaction between CH3Cl and F 184 4.4 Calculating the energy profile of the CYP101-cyclohexene hydrogen- abstraction process . 188 v 4.5 Transition state search for the CYP101-cyclohexene hydrogen-abstraction process .
Recommended publications
  • GPAW, Gpus, and LUMI
    GPAW, GPUs, and LUMI Martti Louhivuori, CSC - IT Center for Science Jussi Enkovaara GPAW 2021: Users and Developers Meeting, 2021-06-01 Outline LUMI supercomputer Brief history of GPAW with GPUs GPUs and DFT Current status Roadmap LUMI - EuroHPC system of the North Pre-exascale system with AMD CPUs and GPUs ~ 550 Pflop/s performance Half of the resources dedicated to consortium members Programming for LUMI Finland, Belgium, Czechia, MPI between nodes / GPUs Denmark, Estonia, Iceland, HIP and OpenMP for GPUs Norway, Poland, Sweden, and how to use Python with AMD Switzerland GPUs? https://www.lumi-supercomputer.eu GPAW and GPUs: history (1/2) Early proof-of-concept implementation for NVIDIA GPUs in 2012 ground state DFT and real-time TD-DFT with finite-difference basis separate version for RPA with plane-waves Hakala et al. in "Electronic Structure Calculations on Graphics Processing Units", Wiley (2016), https://doi.org/10.1002/9781118670712 PyCUDA, cuBLAS, cuFFT, custom CUDA kernels Promising performance with factor of 4-8 speedup in best cases (CPU node vs. GPU node) GPAW and GPUs: history (2/2) Code base diverged from the main branch quite a bit proof-of-concept implementation had lots of quick and dirty hacks fixes and features were pulled from other branches and patches no proper unit tests for GPU functionality active development stopped soon after publications Before development re-started, code didn't even work anymore on modern GPUs without applying a few small patches Lesson learned: try to always get new functionality to the
    [Show full text]
  • Free and Open Source Software for Computational Chemistry Education
    Free and Open Source Software for Computational Chemistry Education Susi Lehtola∗,y and Antti J. Karttunenz yMolecular Sciences Software Institute, Blacksburg, Virginia 24061, United States zDepartment of Chemistry and Materials Science, Aalto University, Espoo, Finland E-mail: [email protected].fi Abstract Long in the making, computational chemistry for the masses [J. Chem. Educ. 1996, 73, 104] is finally here. We point out the existence of a variety of free and open source software (FOSS) packages for computational chemistry that offer a wide range of functionality all the way from approximate semiempirical calculations with tight- binding density functional theory to sophisticated ab initio wave function methods such as coupled-cluster theory, both for molecular and for solid-state systems. By their very definition, FOSS packages allow usage for whatever purpose by anyone, meaning they can also be used in industrial applications without limitation. Also, FOSS software has no limitations to redistribution in source or binary form, allowing their easy distribution and installation by third parties. Many FOSS scientific software packages are available as part of popular Linux distributions, and other package managers such as pip and conda. Combined with the remarkable increase in the power of personal devices—which rival that of the fastest supercomputers in the world of the 1990s—a decentralized model for teaching computational chemistry is now possible, enabling students to perform reasonable modeling on their own computing devices, in the bring your own device 1 (BYOD) scheme. In addition to the programs’ use for various applications, open access to the programs’ source code also enables comprehensive teaching strategies, as actual algorithms’ implementations can be used in teaching.
    [Show full text]
  • D6.1 Report on the Deployment of the Max Demonstrators and Feedback to WP1-5
    Ref. Ares(2020)2820381 - 31/05/2020 HORIZON2020 European Centre of Excellence ​ Deliverable D6.1 Report on the deployment of the MaX Demonstrators and feedback to WP1-5 D6.1 Report on the deployment of the MaX Demonstrators and feedback to WP1-5 Pablo Ordejón, Uliana Alekseeva, Stefano Baroni, Riccardo Bertossa, Miki Bonacci, Pietro Bonfà, Claudia Cardoso, Carlo Cavazzoni, Vladimir Dikan, Stefano de Gironcoli, Andrea Ferretti, Alberto García, Luigi Genovese, Federico Grasselli, Anton Kozhevnikov, Deborah Prezzi, Davide Sangalli, Joost VandeVondele, Daniele Varsano, Daniel Wortmann Due date of deliverable: 31/05/2020 Actual submission date: 31/05/2020 Final version: 31/05/2020 Lead beneficiary: ICN2 (participant number 3) Dissemination level: PU - Public www.max-centre.eu 1 HORIZON2020 European Centre of Excellence ​ Deliverable D6.1 Report on the deployment of the MaX Demonstrators and feedback to WP1-5 Document information Project acronym: MaX Project full title: Materials Design at the Exascale Research Action Project type: European Centre of Excellence in materials modelling, simulations and design EC Grant agreement no.: 824143 Project starting / end date: 01/12/2018 (month 1) / 30/11/2021 (month 36) Website: www.max-centre.eu Deliverable No.: D6.1 Authors: P. Ordejón, U. Alekseeva, S. Baroni, R. Bertossa, M. ​ Bonacci, P. Bonfà, C. Cardoso, C. Cavazzoni, V. Dikan, S. de Gironcoli, A. Ferretti, A. García, L. Genovese, F. Grasselli, A. Kozhevnikov, D. Prezzi, D. Sangalli, J. VandeVondele, D. Varsano, D. Wortmann To be cited as: Ordejón, et al., (2020): Report on the deployment of the MaX Demonstrators and feedback to WP1-5. Deliverable D6.1 of the H2020 project MaX (final version as of 31/05/2020).
    [Show full text]
  • Introducing ONETEP: Linear-Scaling Density Functional Simulations on Parallel Computers Chris-Kriton Skylaris,A) Peter D
    THE JOURNAL OF CHEMICAL PHYSICS 122, 084119 ͑2005͒ Introducing ONETEP: Linear-scaling density functional simulations on parallel computers Chris-Kriton Skylaris,a) Peter D. Haynes, Arash A. Mostofi, and Mike C. Payne Theory of Condensed Matter, Cavendish Laboratory, Madingley Road, Cambridge CB3 0HE, United Kingdom ͑Received 29 September 2004; accepted 4 November 2004; published online 23 February 2005͒ We present ONETEP ͑order-N electronic total energy package͒, a density functional program for parallel computers whose computational cost scales linearly with the number of atoms and the number of processors. ONETEP is based on our reformulation of the plane wave pseudopotential method which exploits the electronic localization that is inherent in systems with a nonvanishing band gap. We summarize the theoretical developments that enable the direct optimization of strictly localized quantities expressed in terms of a delocalized plane wave basis. These same localized quantities lead us to a physical way of dividing the computational effort among many processors to allow calculations to be performed efficiently on parallel supercomputers. We show with examples that ONETEP achieves excellent speedups with increasing numbers of processors and confirm that the time taken by ONETEP as a function of increasing number of atoms for a given number of processors is indeed linear. What distinguishes our approach is that the localization is achieved in a controlled and mathematically consistent manner so that ONETEP obtains the same accuracy as conventional cubic-scaling plane wave approaches and offers fast and stable convergence. We expect that calculations with ONETEP have the potential to provide quantitative theoretical predictions for problems involving thousands of atoms such as those often encountered in nanoscience and biophysics.
    [Show full text]
  • Natural Bond Orbital Analysis in the ONETEP Code: Applications to Large Protein Systems Louis P
    WWW.C-CHEM.ORG FULL PAPER Natural Bond Orbital Analysis in the ONETEP Code: Applications to Large Protein Systems Louis P. Lee,*[a] Daniel J. Cole,[a] Mike C. Payne,[a] and Chris-Kriton Skylaris[b] First principles electronic structure calculations are typically Generalized Wannier Functions of ONETEP to natural atomic performed in terms of molecular orbitals (or bands), providing a orbitals, NBO analysis can be performed within a localized straightforward theoretical avenue for approximations of region in such a way that ensures the results are identical to an increasing sophistication, but do not usually provide any analysis on the full system. We demonstrate the capabilities of qualitative chemical information about the system. We can this approach by performing illustrative studies of large derive such information via post-processing using natural bond proteins—namely, investigating changes in charge transfer orbital (NBO) analysis, which produces a chemical picture of between the heme group of myoglobin and its ligands with bonding in terms of localized Lewis-type bond and lone pair increasing system size and between a protein and its explicit orbitals that we can use to understand molecular structure and solvent, estimating the contribution of electronic delocalization interactions. We present NBO analysis of large-scale calculations to the stabilization of hydrogen bonds in the binding pocket of with the ONETEP linear-scaling density functional theory package, a drug-receptor complex, and observing, in situ, the n ! p* which we have interfaced with the NBO 5 analysis program. In hyperconjugative interactions between carbonyl groups that ONETEP calculations involving thousands of atoms, one is typically stabilize protein backbones.
    [Show full text]
  • Metalloboranes from First-Principles Calculations: a Candidate for High-Density Hydrogen Storage
    Metalloboranes from first-principles calculations: A candidate for high-density hydrogen storage A. R. Akbarzadeh, D. Vrinceanu, and C.J. Tymczak Department of Physics, Texas Southern University, Houston, Texas 77004, USA Abstract Using first principles calculations, we show the high hydrogen storage capacity of a new class of compounds, metalloboranes. Metalloboranes are transition metal (TM) and borane compounds that obey a novel-bonding scheme. We have found that the transition metal atoms can bind up to 10 H2-molecules with an average binding energy of 30 kJ/mole of H2, which lies favorably within the reversible adsorption range. Among the first row TM atoms, Sc and Ti are found to be the optimum in maximizing the H2 storage on the metalloborane cluster. Additionally, being ionically bonded to the borane molecule, the TMs do not suffer from the aggregation problem, which has been the biggest hurdle for the success of TM- decorated graphitic materials for hydrogen storage. Furthermore, since the borane 6-atom ring has identical bonding properties as carbon rings it is possible to link the metalloboranes into metal organic frameworks (MOF’s), which are thus able to adsorb hydrogen via Kubas interaction as well as the well-known van der Waals interaction. Finally, we construct a simple Monte-Catlo algorithm for Hydrogen uptake and show that Titanium metalloboranes in a MOF5 structure can absorb up to 11.5% hydrogen per weight at 100 bar of pressure. I. Introduction Due to the geographical distribution and supply limitation of fossil fuel resources and the increasing perceived negative impact of carbon dioxide emission in the environment, it is advantageous for the societies to take action in the search and implementation of alternative energy systems.
    [Show full text]
  • PDF Hosted at the Radboud Repository of the Radboud University Nijmegen
    PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/19078 Please be advised that this information was generated on 2021-09-23 and may be subject to change. Computational Chemistry Metho ds Applications to Racemate Resolution and Radical Cation Chemistry ISBN Computational Chemistry Metho ds Applications to Racemate Resolution and Radical Cation Chemistry een wetenschapp elijkeproeve op het gebied van de Natuurwetenschapp en Wiskunde en Informatica Pro efschrift ter verkrijging van de graad van do ctor aan de KatholiekeUniversiteit Nijmegen volgens b esluit van het College van Decanen in het op enbaar te verdedigen op dinsdag januari des namiddags om uur precies do or Gijsb ert Schaftenaar geb oren op augustus te Harderwijk Promotores Prof dr ir A van der Avoird Prof dr E Vlieg Copromotor Prof dr RJ Meier Leden manuscriptcommissie Prof dr G Vriend Prof dr RA de Gro ot Dr ir PES Wormer The research rep orted in this thesis was nancially supp orted by the Dutch Or ganization for the Advancement of Science NWO and DSM Contents Preface Intro duction Intro duction Chirality Metho ds for obtaining pure enantiomers Racemate Resolution via diastereomeric salt formation Rationalization of diastereomeric salt formation Computational metho ds for mo deling the lattice energy Molecular Mechanics Quantum Chemical
    [Show full text]
  • Dmol Guide to Select a Dmol3 Task 1
    DMOL3 GUIDE MATERIALS STUDIO 8.0 Copyright Notice ©2014 Dassault Systèmes. All rights reserved. 3DEXPERIENCE, the Compass icon and the 3DS logo, CATIA, SOLIDWORKS, ENOVIA, DELMIA, SIMULIA, GEOVIA, EXALEAD, 3D VIA, BIOVIA and NETVIBES are commercial trademarks or registered trademarks of Dassault Systèmes or its subsidiaries in the U.S. and/or other countries. All other trademarks are owned by their respective owners. Use of any Dassault Systèmes or its subsidiaries trademarks is subject to their express written approval. Acknowledgments and References To print photographs or files of computational results (figures and/or data) obtained using BIOVIA software, acknowledge the source in an appropriate format. For example: "Computational results obtained using software programs from Dassault Systèmes Biovia Corp.. The ab initio calculations were performed with the DMol3 program, and graphical displays generated with Materials Studio." BIOVIA may grant permission to republish or reprint its copyrighted materials. Requests should be submitted to BIOVIA Support, either through electronic mail to [email protected], or in writing to: BIOVIA Support 5005 Wateridge Vista Drive, San Diego, CA 92121 USA Contents DMol3 1 Setting up a molecular dynamics calculation20 Introduction 1 Choosing an ensemble 21 Further Information 1 Defining the time step 21 Tasks in DMol3 2 Defining the thermostat control 21 Energy 3 Constraints during dynamics 21 Setting up the calculation 3 Setting up a transition state calculation 22 Dynamics 4 Which method to use?
    [Show full text]
  • Density Functional Theory (DFT)
    Herramientas mecano-cuánticas basadas en DFT para el estudio de moléculas y materiales en Materials Studio 7.0 Javier Ramos Biophysics of Macromolecular Systems group (BIOPHYM) Departamento de Física Macromolecular Instituto de Estructura de la Materia – CSIC [email protected] Webinar, 26 de Junio 2014 Anteriores webinars Como conseguir los videos y las presentaciones de anteriores webminars: Linkedin: Grupo de Química Computacional http://www.linkedin.com/groups/Química-computacional-7487634 Índice Density Functional Theory (DFT) The Jacob’s ladder DFT modules in Maretials Studio DMOL3, CASTEP and ONETEP XC functionals Basis functions Interfaces in Materials Studio Tasks Properties Example: n-butane conformations Density Functional Theory (DFT) DFT is built around the premise that the energy of an electronic system can be defined in terms of its electron probability density (ρ). (Hohenberg-Kohn Theorem) E 0 [ 0 ] Te [ 0 ] E ne [ 0 ] E ee [ 0 ] (easy) Kinetic Energy for ????? noninteracting (r )v (r ) dr electrons(easy) 1 E[]()()[]1 r r d r d r E e e2 1 2 1 2 X C r12 Classic Term(Coulomb) Non-classic Kohn-Sham orbitals Exchange & By minimizing the total energy functional applying the variational principle it is Correlation possible to get the SCF equations (Kohn-Sham) The Jacob’s Ladder Accurate form of XC potential Meta GGA Empirical (Fitting to Non-Empirical Generalized Gradient Approx. atomic properties) (physics rules) Local Density Approximation DFT modules in Materials Studio DMol3: Combine computational speed with the accuracy of quantum mechanical methods to predict materials properties reliably and quickly CASTEP: CASTEP offers simulation capabilities not found elsewhere, such as accurate prediction of phonon spectra, dielectric constants, and optical properties.
    [Show full text]
  • Compact Orbitals Enable Low-Cost Linear-Scaling Ab Initio Molecular Dynamics for Weakly-Interacting Systems Hayden Scheiber,1, A) Yifei Shi,1 and Rustam Z
    Compact orbitals enable low-cost linear-scaling ab initio molecular dynamics for weakly-interacting systems Hayden Scheiber,1, a) Yifei Shi,1 and Rustam Z. Khaliullin1, b) Department of Chemistry, McGill University, 801 Sherbrooke St. West, Montreal, QC H3A 0B8, Canada Today, ab initio molecular dynamics (AIMD) relies on the locality of one-electron density matrices to achieve linear growth of computation time with systems size, crucial in large-scale simulations. While Kohn-Sham orbitals strictly localized within predefined radii can offer substantial computational advantages over density matrices, such compact orbitals are not used in AIMD because a compact representation of the electronic ground state is difficult to find. Here, a robust method for maintaining compact orbitals close to the ground state is coupled with a modified Langevin integrator to produce stable nuclear dynamics for molecular and ionic systems. This eliminates a density matrix optimization and enables first orbital-only linear-scaling AIMD. An application to liquid water demonstrates that low computational overhead of the new method makes it ideal for routine medium-scale simulations while its linear-scaling complexity allows to extend first- principle studies of molecular systems to completely new physical phenomena on previously inaccessible length scales. Since the unification of molecular dynamics and den- LS methods restrict their use in dynamical simulations sity functional theory (DFT)1, ab initio molecular dy- to very short time scales, systems of low dimensions, namics (AIMD) has become an important tool to study and low-quality minimal basis sets6,18–20. On typical processes in molecules and materials. Unfortunately, the length and time scales required in practical and accurate computational cost of the conventional Kohn-Sham (KS) AIMD simulations, LS DFT still cannot compete with DFT grows cubically with the number of atoms, which the straightforward low-cost cubically-scaling KS DFT.
    [Show full text]
  • Density Functional Theory and Nuclear Quantum Effects
    1 Grassmann manifold, gauge, and quantum chemistry Lin Lin Department of Mathematics, UC Berkeley; Lawrence Berkeley National Laboratory Flatiron Institute, April 2019 2 Stiefel manifold • × > ℂ • Stiefel manifold (1935): orthogonal vectors in × ℂ , = | = ∗ ∈ ℂ 3 Grassmann manifold • Grassmann manifold (1848): set of dimensional subspace in ℂ , = , / : dimensional unitary group (i.e. the set of unitary matrices in × ) ℂ • Any point in , can be viewed as a representation of a point in , 4 Example (in optimization) min ( ) . = ∗ • = , . invariant to the choice of basis Optimization on a Grassmann manifold. ∈ ⇒ • The representation is called the gauge in quantum physics / chemistry. Gauge invariant. 5 Example (in optimization) min ( ) . = ∗ • Otherwise, if ( ) is gauge dependent Optimization on a Stiefel manifold. ⇒ • [Edelman, Arias, Smith, SIMAX 1998] many works following 6 This talk: Quantum chemistry • Time-dependent density functional theory (TDDFT) • Kohn-Sham density functional theory (KSDFT) • Localization. • Gauge choice is the key • Used in community software packages: Quantum ESPRESSO, Wannier90, Octopus, PWDFT etc 7 Time-dependent density functional theory Parallel transport gauge 8 Time-dependent density functional theory [Runge-Gross, PRL 1984] (spin omitted, : number of electrons) , = , , , = 1, … , , , = ( ,) ( , ) ′ ∗ ′ � Hamiltonian =1 1 , = + ( ) + [ ] 2 − Δ 9 Density matrix • Key quantity throughout the talk × • (t) = , … , . : number of discretized grid points to represent each Ψ 1 ∈ ℂ = ∗ Ψ Ψ = 10 Gauge invariance • Unitary rotation = , = ∗ = Φ Ψ= ( ) = ∗ ∗ ∗ ∗ ΦΦ Ψ Ψ ΨΨ • Propagation of the density matrix, von Neumann equation (quantum Liouville equation) = , = , ( ) , − 11 Gauge choice affects the time step Extreme case (boring Hamiltonian) ≡ 0 Initial condition 0 = 0 0 Exact solution = (0) Small time step − = (0) (Formally) arbitrarily large time step 12 P v.s.
    [Show full text]
  • EPSRC Service Level Agreement with STFC for Computational Science Support
    CoSeC Computational Science Centre for Research Communities EPSRC Service Level Agreement with STFC for Computational Science Support FY 2016/17 Report and Update on FY 2017/18 Work Plans This document contains the 2016/17 plans, 2016/17 summary reports, and 2017/18 plans for the programme in support of CCP and HEC communities delivered by STFC and funded by EPSRC through a Service Level Agreement. Notes in blue are in-year updates on progress to the tasks included in the 2016/17 plans. Text highlighted in yellow shows changes to the draft 2017/18 plans that we submitted in January 2017. Contents CCP5 – Computer Simulation of Condensed Phases .......................................................................... 4 CCP5 – 2016 / 17 Plans (1 April 2016 – 31 March 2017) ...................................................... 4 CCP5 – Summary Report (1 April 2016 – 31 March 2017) .................................................... 7 CCP5 –2017 / 18 Plans (1 April 2017 – 31 March 2018) ....................................................... 8 CCP9 – Electronic Structure of Solids .................................................................................................. 9 CCP9 – 2016 / 17 Plans (1 April 2016 – 31 March 2017) ...................................................... 9 CCP9 – Summary Report (1 April 2016 – 31 March 2017) .................................................. 11 CCP9 – 2017 / 18 Plans (1 April 2017 – 31 March 2018) .................................................... 12 CCP-mag – Computational Multiscale
    [Show full text]