)ORULGD6WDWH8QLYHUVLW\/LEUDULHV
2020 Machine Learned Force Fields Cole Nathaniel Sheridan
Follow this and additional works at DigiNole: FSU's Digital Repository. For more information, please contact [email protected] dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
THE FLORIDA STATE UNIVERSITY COLLEGE OF ARTS & SCIENCES
MACHINE LEARNED FORCE FIELDS
By
COLE SHERIDAN
A Thesis submitted to the Department of Scientific Computing in partial fulfillment of the requirements for graduation with Honors in the Major
Degree Awarded: Fall, 2020
dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
The members of the Defense Committee approve the thesis of Cole Sheridan defended on November 20, 2020.
Dr. Chen Huang Thesis Director
Dr. Wei Yang Outside Committee Member
Dr. Albert DePrince Committee Member
Dr. Xiaoqiang Wang Committee Member
1
dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
Acknowledgements
I would like to express my sincerest thanks to Dr. Chen Huang, for his continuous support and encouragement, and his exceptional leadership throughout the course of this research. Without his assistance, this project may have never been realized. It has been an honor and a pleasure to do research with you throughout my undergraduate career. I would also like to express my gratitude to my defense committee, Dr. Chen Huang, Dr. Xiaoqiang Wang, Dr. Wei Yang, and Dr. Albert De Prince. While the circumstances surrounding the time of this prospectus may have caused difficulties in meeting, the continued support throughout this endeavor is greatly appreciated. Furthermore, a sincere thanks to Florida State University (FSU) and the FSU Department of Scientific Computing for providing the foundational education and resources required to complete this research project. Finally, a big thank you to my family and friends, who helped provide emotional support throughout these trying times.
i
dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
Table of Contents
List of Figures ……………………………………..……………………………………………. iii List of Tables ……………………………………..……………………………………………. iv Abstract ………………………………………………………………………………………… v Chapter 1. Introduction to Density Functional Theory 1 1.1 Density Functional Theory ……………………………………………………………. 1 1.2 Gaussian Basis and Force Calculation ……………………………………………….… 2 1.3 Force Calculation …………………..………………………………………...………..… 3 1.4 Introduction to NWChem Program ..…………………………………………………..… 4 Chapter 2. Introduction to Molecular Dynamics 7 2.1 Basics of Molecular Dynamics Simulation ………………………………………….…. 7 2.2 Force Fields ……………………………………………………………….…………. 9 Chapter 3. Introduction to Neural Networks 11 3.1 Background ………………….……………………………………………………….. 11 3.2 ANN Force Fields ……………………………...……………………………………... 14 Chapter 4. Machine-Learned Force Fields for C-X Molecules and Radicals 17
4.1 Neural Networks for Describing Heterogeneous Molecular Systems ...... ………… 17 4.2 Training Set …………………………………………………………………………… 18 4.3 Training the Neural Network …………………………………………………………… 22 Chapter 5. Machine-Learning for A Simple Three-Dimensional Molecule 27 5.1 Neural Networks for Describing Three-Dimensional Molecules……………...... ….... 27 5.2 Application to Cyanopolyyne …………………………...... ……………………...... 28 5.3 Results And Discussion ………………….……………...... ………………………... 30 Chapter 6 Conclusion 33
References 34
ii
dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
List of Figures
Figure 1.1 NWChem input file …………………………………………………….……………. 4 Figure 2.1 Flowchart of processes for performing molecular dynamics simulations …………. 7 Figure 3.1 Python code for a neural network using matrices ………….…………………..….... 14 Figure 4.1 ANN for diatomic molecule ……………………………………………………….. 17 Figure 4.2 Python code for reproducing NWChem input files ………………………………… 19
Figure 4.3 Job submission script for C2 …….……………………………….………………… 20 Figure 4.4 Graph of DFT energy vs bond length …………………………….………………… 21 Figure 4.5 Graph of partial charge vs bond length …………………………………………… 21 Figure 4.6 Outputs of the original neural network for DFT energies ……………………… 23 Figure 4.7 Outputs of the original neural network for partial charges ……………………… 23 Figure 4.8 Error from different initialization seeds for ANNs for two hidden layers ………..… 24 Figure 4.9 Error from different initialization seeds for ANNs for three hidden layers ……..… 25 Figure 4.10 Errors in calculation from different ANN sizes ………………………………… 26 Figure 5.1 Structure of a cyanopolyyne molecule …………………………….……………… 28 Figure 5.2 Results of training in with three MD points ………………………………………… 31 Figure 5.3 Results of training in with 30 MD points ………………………………………… 31 Figure 5.4 Results of training in with 110 MD points ………………………………………… 32
iii
dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
List of Tables
Figure 5.1 Input of cyanopolyyne molecule at the time step 90 ………..………….…………… 28
iv
dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
Abstract
Studying molecules through their potential energy surfaces using molecular dynamics have greatly advanced our understanding of chemical and physical processes in many exciting systems. To perform these simulations, we need atomic forces. Accurate atomic forces can be calculated using ab initio methods (such as the density functional theory). However, simulations based on ab initio methods are computationally costly. To reduce the cost, force fields that describe the interaction between atoms are often used. The accuracy of the simulations is then determined by the accuracy of force fields. Machine learning has become a promising method to generate force fields with an accuracy close to ab initio methods. Especially, artificial neural networks (ANN) have been shown to be an efficient and adaptable method for generating accurate force fields. One prominent challenge for ANN force fields is to represent different chemical elements in heterogeneous systems. In this thesis, we develop a new ANN force field that is capable of recognizing different chemical elements by using atomic numbers as additional descriptors. Our new method is demonstrated by generating ANN force fields for C-X systems in a one-dimensional space, where X stands for H, O, N, and C atoms. Afterwards, ANN methods are used to calculate the molecular force fiend of a cyanopolyyne molecule in three-dimensional space based on the partial energy of each atom in the molecule.
v
dッ」オsゥァョ@eョカ・ャッー・@idZ@dceeXVYRMXPQcMTTWTMadVWMSfbdWVTTPTec
Chapter 1. Introduction to Density Functional Theory
1.1 Density Functional Theory To solve quantum systems, we often adopt the Born-Oppenheimer approximation, meaning that the nuclei and electrons of molecules in the system are treated as separate cases due to the difference in mass between the nuclei and the electrons [1]. The system’s time-independent wave function can then be solved as
which gives