GPU: Between Performance, Correctness and Sustainability
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Wannier90 As a Community Code: New
IOP Journal of Physics: Condensed Matter Journal of Physics: Condensed Matter J. Phys.: Condens. Matter J. Phys.: Condens. Matter 32 (2020) 165902 (25pp) https://doi.org/10.1088/1361-648X/ab51ff 32 Wannier90 as a community code: new 2020 features and applications © 2020 IOP Publishing Ltd Giovanni Pizzi1,29,30 , Valerio Vitale2,3,29 , Ryotaro Arita4,5 , Stefan Bl gel6 , Frank Freimuth6 , Guillaume G ranton6 , JCOMEL ü é Marco Gibertini1,7 , Dominik Gresch8 , Charles Johnson9 , Takashi Koretsune10,11 , Julen Ibañez-Azpiroz12 , Hyungjun Lee13,14 , 165902 Jae-Mo Lihm15 , Daniel Marchand16 , Antimo Marrazzo1 , Yuriy Mokrousov6,17 , Jamal I Mustafa18 , Yoshiro Nohara19 , 4 20 21 G Pizzi et al Yusuke Nomura , Lorenzo Paulatto , Samuel Poncé , Thomas Ponweiser22 , Junfeng Qiao23 , Florian Thöle24 , Stepan S Tsirkin12,25 , Małgorzata Wierzbowska26 , Nicola Marzari1,29 , Wannier90 as a community code: new features and applications David Vanderbilt27,29 , Ivo Souza12,28,29 , Arash A Mostofi3,29 and Jonathan R Yates21,29 Printed in the UK 1 Theory and Simulation of Materials (THEOS) and National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne (EPFL), CH-1015 CM Lausanne, Switzerland 2 Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge CB3 0HE, United Kingdom 10.1088/1361-648X/ab51ff 3 Departments of Materials and Physics, and the Thomas Young Centre for Theory and Simulation of Materials, Imperial College London, London SW7 2AZ, United Kingdom 4 RIKEN -
Accelerating Molecular Docking by Parallelized Heterogeneous Computing – a Case Study of Performance, Quality of Results
solis vasquez, leonardo ACCELERATINGMOLECULARDOCKINGBY PARALLELIZEDHETEROGENEOUSCOMPUTING–A CASESTUDYOFPERFORMANCE,QUALITYOF RESULTS,ANDENERGY-EFFICIENCYUSINGCPUS, GPUS,ANDFPGAS Printed and/or published with the support of the German Academic Exchange Service (DAAD). ACCELERATINGMOLECULARDOCKINGBYPARALLELIZED HETEROGENEOUSCOMPUTING–ACASESTUDYOF PERFORMANCE,QUALITYOFRESULTS,AND ENERGY-EFFICIENCYUSINGCPUS,GPUS,ANDFPGAS at the Computer Science Department of the Technische Universität Darmstadt submitted in fulfilment of the requirements for the degree of Doctor of Engineering (Dr.-Ing.) Doctoral thesis by Solis Vasquez, Leonardo from Lima, Peru Reviewers Prof. Dr.-Ing. Andreas Koch Prof. Dr. Christian Plessl Date of the oral exam October 14, 2019 Darmstadt, 2019 D 17 Solis Vasquez, Leonardo: Accelerating Molecular Docking by Parallelized Heterogeneous Computing – A Case Study of Performance, Quality of Re- sults, and Energy-Efficiency using CPUs, GPUs, and FPGAs Darmstadt, Technische Universität Darmstadt Date of the oral exam: October 14, 2019 Please cite this work as: URN: urn:nbn:de:tuda-tuprints-92886 URL: https://tuprints.ulb.tu-darmstadt.de/id/eprint/9288 This document is provided by TUprints, The Publication Service of the Technische Universität Darmstadt https://tuprints.ulb.tu-darmstadt.de This work is licensed under a Creative Commons “Attribution-ShareAlike 4.0 In- ternational” license. ERKLÄRUNGENLAUTPROMOTIONSORDNUNG §8 Abs. 1 lit. c PromO Ich versichere hiermit, dass die elektronische Version meiner Disserta- tion mit der schriftlichen Version übereinstimmt. §8 Abs. 1 lit. d PromO Ich versichere hiermit, dass zu einem vorherigen Zeitpunkt noch keine Promotion versucht wurde. In diesem Fall sind nähere Angaben über Zeitpunkt, Hochschule, Dissertationsthema und Ergebnis dieses Versuchs mitzuteilen. §9 Abs. 1 PromO Ich versichere hiermit, dass die vorliegende Dissertation selbstständig und nur unter Verwendung der angegebenen Quellen verfasst wurde. -
Universid Ade De Sã O Pa
A hardware/software codesign for the chemical reactivity of BRAMS Carlos Alberto Oliveira de Souza Junior Dissertação de Mestrado do Programa de Pós-Graduação em Ciências de Computação e Matemática Computacional (PPG-CCMC) UNIVERSIDADE DE SÃO PAULO DE SÃO UNIVERSIDADE Instituto de Ciências Matemáticas e de Computação Instituto Matemáticas de Ciências SERVIÇO DE PÓS-GRADUAÇÃO DO ICMC-USP Data de Depósito: Assinatura: ______________________ Carlos Alberto Oliveira de Souza Junior A hardware/software codesign for the chemical reactivity of BRAMS Master dissertation submitted to the Instituto de Ciências Matemáticas e de Computação – ICMC- USP, in partial fulfillment of the requirements for the degree of the Master Program in Computer Science and Computational Mathematics. FINAL VERSION Concentration Area: Computer Science and Computational Mathematics Advisor: Prof. Dr. Eduardo Marques USP – São Carlos August 2017 Ficha catalográfica elaborada pela Biblioteca Prof. Achille Bassi e Seção Técnica de Informática, ICMC/USP, com os dados fornecidos pelo(a) autor(a) Souza Junior, Carlos Alberto Oliveira de S684h A hardware/software codesign for the chemical reactivity of BRAMS / Carlos Alberto Oliveira de Souza Junior; orientador Eduardo Marques. -- São Carlos, 2017. 109 p. Dissertação (Mestrado - Programa de Pós-Graduação em Ciências de Computação e Matemática Computacional) -- Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo, 2017. 1. Hardware. 2. FPGA. 3. OpenCL. 4. Codesign. 5. Heterogeneous-computing. I. Marques, Eduardo, orient. II. Título. Carlos Alberto Oliveira de Souza Junior Um coprojeto de hardware/software para a reatividade química do BRAMS Dissertação apresentada ao Instituto de Ciências Matemáticas e de Computação – ICMC-USP, como parte dos requisitos para obtenção do título de Mestre em Ciências – Ciências de Computação e Matemática Computacional. -
EPSRC Service Level Agreement with STFC for Computational Science Support
CoSeC Computational Science Centre for Research Communities EPSRC Service Level Agreement with STFC for Computational Science Support FY 2016/17 Report and Update on FY 2017/18 Work Plans This document contains the 2016/17 plans, 2016/17 summary reports, and 2017/18 plans for the programme in support of CCP and HEC communities delivered by STFC and funded by EPSRC through a Service Level Agreement. Notes in blue are in-year updates on progress to the tasks included in the 2016/17 plans. Text highlighted in yellow shows changes to the draft 2017/18 plans that we submitted in January 2017. Contents CCP5 – Computer Simulation of Condensed Phases .......................................................................... 4 CCP5 – 2016 / 17 Plans (1 April 2016 – 31 March 2017) ...................................................... 4 CCP5 – Summary Report (1 April 2016 – 31 March 2017) .................................................... 7 CCP5 –2017 / 18 Plans (1 April 2017 – 31 March 2018) ....................................................... 8 CCP9 – Electronic Structure of Solids .................................................................................................. 9 CCP9 – 2016 / 17 Plans (1 April 2016 – 31 March 2017) ...................................................... 9 CCP9 – Summary Report (1 April 2016 – 31 March 2017) .................................................. 11 CCP9 – 2017 / 18 Plans (1 April 2017 – 31 March 2018) .................................................... 12 CCP-mag – Computational Multiscale -
Arxiv:1508.02735V1 [Cond-Mat.Mtrl-Sci] 11 Aug 2015
A critical look at methods for calculating charge transfer couplings fast and accurately Pablo Ramos, Marc Mankarious, and Michele Pavanello∗ Department of Chemistry, Rutgers University, Newark, NJ 07102, USA E-mail: [email protected] Abstract We present here a short and subjective review of methods for calculating charge trans- fer couplings. Although we mostly focus on Density Functional Theory, we discuss a small subset of semiempirical methods as well as the adiabatic-to-diabatic transformation methods typically coupled with wavefunction-based electronic structure calculations. In this work, we will present the reader with a critical assessment of the regimes that can be modelled by the various methods – their strengths and weaknesses. In order to give a feeling about the practical aspects of the calculations, we also provide the reader with a practical protocol for running coupling calculations with the recently developed FDE-ET method. arXiv:1508.02735v1 [cond-mat.mtrl-sci] 11 Aug 2015 ∗To whom correspondence should be addressed 1 Contents 1 Introduction 3 2 DFT Based Methods 5 2.1 The Frozen Density Embedding formalism . 5 2.1.1 FDE-ET method . 9 2.1.2 Distance dependence of the electronic coupling . 12 2.1.3 Hole transfer in DNA oligomers . 14 2.2 Constrained Density Functional Theory Applied to Electron Transfer Simulations . 16 2.3 Fragment Orbital DFT . 18 2.3.1 Hole transfer rates on DNA hairpins . 20 2.3.2 The Curious Case of Sulfite Oxidase . 20 2.4 Ultrafast computations of the electronic couplings: The AOM method . 22 2.5 Note on orthogonality . -
Interoperability Between Electronic Structure Codes with the Quippy Toolkit
Interoperability between electronic structure codes with the quippy toolkit James Kermode Department of Physics King’s College London 6 September 2012 Outline Overview of libAtoms and QUIP Scripting interfaces Overview of quippy capabilities Live demo! Summary and Conclusions Outline Overview of libAtoms and QUIP Scripting interfaces Overview of quippy capabilities Live demo! Summary and Conclusions I The QUIP (QUantum mechanics and Interatomic Potentials) package, built on top of libAtoms, implements a wide variety of interatomic potentials and tight binding quantum mechanics, and is also able to call external packages. I Various hybrid combinations are also supported in the style of QM/MM, including ‘Learn on the Fly’ scheme (LOTF).2 3 I quippy is a Python interface to libAtoms and QUIP. The libAtoms, QUIP and quippy packages 1 I The libAtoms package is a software library written in Fortran 95 for the purposes of carrying out molecular dynamics simulations. 1http://www.libatoms.org 2Csányi et al., PRL (2004) 3http://www.jrkermode.co.uk/quippy I Various hybrid combinations are also supported in the style of QM/MM, including ‘Learn on the Fly’ scheme (LOTF).2 3 I quippy is a Python interface to libAtoms and QUIP. The libAtoms, QUIP and quippy packages 1 I The libAtoms package is a software library written in Fortran 95 for the purposes of carrying out molecular dynamics simulations. I The QUIP (QUantum mechanics and Interatomic Potentials) package, built on top of libAtoms, implements a wide variety of interatomic potentials and tight binding quantum mechanics, and is also able to call external packages. 1http://www.libatoms.org 2Csányi et al., PRL (2004) 3http://www.jrkermode.co.uk/quippy 3 I quippy is a Python interface to libAtoms and QUIP. -
Surface Functionalization of H-Terminated Diamond with NH and NH2 on the (1, 0, 0), (1, 1, 0) and (1, 1, 1) Surface Planes Using Density Functional Theory
Uppsala University Bachelor project, 15hp Surface functionalization of H-terminated diamond with NH and NH2 on the (1; 0; 0), (1; 1; 0) and (1; 1; 1) surface planes using Density Functional Theory Sara Kjellberg Supervised by Prof. Karin Larsson October 5, 2017 Abstract The adsorption energy of aminated diamond surfaces on the (1; 1; 1), (1; 1; 0) and (1; 0; 0) diamond surfaces have been calculated with density functional theory (DFT). The resulting values show that bridging is favoured before on-top nitrogen at low coverage for all the low index planes. The trend is even more pronounced as the coverage increases, most likely due to sterical hindrance. As expected, the imidogen (NH) and amidogen (NH2) groups are bonded to the surface planes in the order (1; 0; 0) > (1; 1; 1) > (1; 1; 0) as decrease in energy is the highest on the (1; 0; 0) plane and lowest on the (1 010) plane. The effect of sterical hindrance is largest at the (1; 1; 0) surface while the (1; 0; 0) surface is almost unaffected by neighbouring functional groups. Also, double bonded on-top imidogen (NH) is the least stable group at the (1; 0; 0) surface at low coverage while it is almost as stable as bridging NH when having a full mono-layer. Moreover, on-top imidogen (NH) was highly distorted and forced a change in surface geometry 1 Contents 1 Goal 3 2 Introduction 3 2.1 Background . .3 3 Theory 4 3.1 Surfaces of diamond . .4 3.1.1 General surface theory . -
Trends in Atomistic Simulation Software Usage [1.3]
A LiveCoMS Perpetual Review Trends in atomistic simulation software usage [1.3] Leopold Talirz1,2,3*, Luca M. Ghiringhelli4, Berend Smit1,3 1Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingenierie Chimiques, Valais, École Polytechnique Fédérale de Lausanne, CH-1951 Sion, Switzerland; 2Theory and Simulation of Materials (THEOS), Faculté des Sciences et Techniques de l’Ingénieur, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland; 3National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland; 4The NOMAD Laboratory at the Fritz Haber Institute of the Max Planck Society and Humboldt University, Berlin, Germany This LiveCoMS document is Abstract Driven by the unprecedented computational power available to scientific research, the maintained online on GitHub at https: use of computers in solid-state physics, chemistry and materials science has been on a continuous //github.com/ltalirz/ rise. This review focuses on the software used for the simulation of matter at the atomic scale. We livecoms-atomistic-software; provide a comprehensive overview of major codes in the field, and analyze how citations to these to provide feedback, suggestions, or help codes in the academic literature have evolved since 2010. An interactive version of the underlying improve it, please visit the data set is available at https://atomistic.software. GitHub repository and participate via the issue tracker. This version dated August *For correspondence: 30, 2021 [email protected] (LT) 1 Introduction Gaussian [2], were already released in the 1970s, followed Scientists today have unprecedented access to computa- by force-field codes, such as GROMOS [3], and periodic tional power. -
Porting the DFT Code CASTEP to Gpgpus
Porting the DFT code CASTEP to GPGPUs Toni Collis [email protected] EPCC, University of Edinburgh CASTEP and GPGPUs Outline • Why are we interested in CASTEP and Density Functional Theory codes. • Brief introduction to CASTEP underlying computational problems. • The OpenACC implementation http://www.nu-fuse.com CASTEP: a DFT code • CASTEP is a commercial and academic software package • Capable of Density Functional Theory (DFT) and plane wave basis set calculations. • Calculates the structure and motions of materials by the use of electronic structure (atom positions are dictated by their electrons). • Modern CASTEP is a re-write of the original serial code, developed by Universities of York, Durham, St. Andrews, Cambridge and Rutherford Labs http://www.nu-fuse.com CASTEP: a DFT code • DFT/ab initio software packages are one of the largest users of HECToR (UK national supercomputing service, based at University of Edinburgh). • Codes such as CASTEP, VASP and CP2K. All involve solving a Hamiltonian to explain the electronic structure. • DFT codes are becoming more complex and with more functionality. http://www.nu-fuse.com HECToR • UK National HPC Service • Currently 30- cabinet Cray XE6 system – 90,112 cores • Each node has – 2×16-core AMD Opterons (2.3GHz Interlagos) – 32 GB memory • Peak of over 800 TF and 90 TB of memory http://www.nu-fuse.com HECToR usage statistics Phase 3 statistics (Nov 2011 - Apr 2013) Ab initio codes (VASP, CP2K, CASTEP, ONETEP, NWChem, Quantum Espresso, GAMESS-US, SIESTA, GAMESS-UK, MOLPRO) GS2NEMO ChemShell 2%2% SENGA2% 3% UM Others 4% 34% MITgcm 4% CASTEP 4% GROMACS 6% DL_POLY CP2K VASP 5% 8% 19% http://www.nu-fuse.com HECToR usage statistics Phase 3 statistics (Nov 2011 - Apr 2013) 35% of the Chemistry software on HECToR is using DFT methods. -
HPC Issues for DFT Calculations
HPC Issues for DFT Calculations Adrian Jackson EPCC Scientific Simulation • Simulation fast becoming 4 th pillar of science – Observation, Theory, Experimentation, Simulation • Explore universe through simulation rather than experimentation – Test theories – Predict or validate experiments – Simulate “untestable” science • Reproduce “real world” in computers – Generally simplified – Dimensions and timescales restricted – Simulation of scientific problem or environment – Input of real data – Output of simulated data – Parameter space studies – Wide range of approaches http://www.nu-fuse.com Reduce runtime • Serial code optimisations – Reduce runtime through efficiencies – Unlikely to produce required savings • Upgrade hardware – 1965: Moore’s law predicts growth in complexity of processors – Doubling of CPU performance – Performance often improved through on chip parallelism http://www.nu-fuse.com Parallel Background • Why not just make a faster chip? – Theoretical • Physical limitations to size and speed of a single chip • Capacitance increases with complexity • Speed of light, size of atoms, dissipation of heat • The power used by a CPU core is proportional to Clock Frequency x Voltage 2 • Voltage reduction vs Clock speed for power requirements – Voltages become too small for “digital” differences – Practical • Developing new chips is incredibly expensive • Must make maximum use of existing technology http://www.nu-fuse.com Parallel Systems • Different types of parallel systems P M – Shared memory P M P M – Distributed memory P M Interconnect -
Lawrence Berkeley National Laboratory Recent Work
Lawrence Berkeley National Laboratory Recent Work Title From NWChem to NWChemEx: Evolving with the Computational Chemistry Landscape. Permalink https://escholarship.org/uc/item/4sm897jh Journal Chemical reviews, 121(8) ISSN 0009-2665 Authors Kowalski, Karol Bair, Raymond Bauman, Nicholas P et al. Publication Date 2021-04-01 DOI 10.1021/acs.chemrev.0c00998 Peer reviewed eScholarship.org Powered by the California Digital Library University of California From NWChem to NWChemEx: Evolving with the computational chemistry landscape Karol Kowalski,y Raymond Bair,z Nicholas P. Bauman,y Jeffery S. Boschen,{ Eric J. Bylaska,y Jeff Daily,y Wibe A. de Jong,x Thom Dunning, Jr,y Niranjan Govind,y Robert J. Harrison,k Murat Keçeli,z Kristopher Keipert,? Sriram Krishnamoorthy,y Suraj Kumar,y Erdal Mutlu,y Bruce Palmer,y Ajay Panyala,y Bo Peng,y Ryan M. Richard,{ T. P. Straatsma,# Peter Sushko,y Edward F. Valeev,@ Marat Valiev,y Hubertus J. J. van Dam,4 Jonathan M. Waldrop,{ David B. Williams-Young,x Chao Yang,x Marcin Zalewski,y and Theresa L. Windus*,r yPacific Northwest National Laboratory, Richland, WA 99352 zArgonne National Laboratory, Lemont, IL 60439 {Ames Laboratory, Ames, IA 50011 xLawrence Berkeley National Laboratory, Berkeley, 94720 kInstitute for Advanced Computational Science, Stony Brook University, Stony Brook, NY 11794 ?NVIDIA Inc, previously Argonne National Laboratory, Lemont, IL 60439 #National Center for Computational Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6373 @Department of Chemistry, Virginia Tech, Blacksburg, VA 24061 4Brookhaven National Laboratory, Upton, NY 11973 rDepartment of Chemistry, Iowa State University and Ames Laboratory, Ames, IA 50011 E-mail: [email protected] 1 Abstract Since the advent of the first computers, chemists have been at the forefront of using computers to understand and solve complex chemical problems. -
Quantum Chemical Calculations of NMR Parameters
Quantum Chemical Calculations of NMR Parameters Tatyana Polenova University of Delaware Newark, DE Winter School on Biomolecular NMR January 20-25, 2008 Stowe, Vermont OUTLINE INTRODUCTION Relating NMR parameters to geometric and electronic structure Classical calculations of EFG tensors Molecular properties from quantum chemical calculations Quantum chemistry methods DENSITY FUNCTIONAL THEORY FOR CALCULATIONS OF NMR PARAMETERS Introduction to DFT Software Practical examples Tutorial RELATING NMR OBSERVABLES TO MOLECULAR STRUCTURE NMR Spectrum NMR Parameters Local geometry Chemical structure (reactivity) I. Calculation of experimental NMR parameters Find unique solution to CQ, Q, , , , , II. Theoretical prediction of fine structure constants from molecular geometry Classical electrostatic model (EFG)- only in simple ionic compounds Quantum mechanical calculations (Density Functional Theory) (EFG, CSA) ELECTRIC FIELD GRADIENT (EFG) TENSOR: POINT CHARGE MODEL EFG TENSOR IS DETERMINED BY THE COMBINED ELECTRONIC AND NUCLEAR WAVEFUNCTION, NO ANALYTICAL EXPRESSION IN THE GENERAL CASE THE SIMPLEST APPROXIMATION: CLASSICAL POINT CHARGE MODEL n Zie 4 V2,k = 3 Y2,k ()i,i i=1 di 5 ATOMS CONTRIBUTING TO THE EFG TENSOR ARE TREATED AS POINT CHARGES, THE RESULTING EFG TENSOR IS THE SUM WITH RESPECT TO ALL ATOMS VERY CRUDE MODEL, WORKS QUANTITATIVELY ONLY IN SIMPLEST IONIC SYSTEMS, BUT YIELDS QUALITATIVE TRENDS AND GENERAL UNDERSTANDING OF THE SYMMETRY AND MAGNITUDE OF THE EXPECTED TENSOR ELECTRIC FIELD GRADIENT (EFG) TENSOR: POINT CHARGE MODEL n Zie 4 V2,k = 3 Y2,k ()i,i i=1 di 5 Ze V = ; V = 0; V = 0 2,0 d 3 2,±1 2,±2 2Ze V = ; V = 0; V = 0 2,0 d 3 2,±1 2,±2 3 Ze V = ; V = 0; V = 0 2,0 2 d 3 2,±1 2,±2 V2,0 = 0; V2,±1 = 0; V2,±2 = 0 MOLECULAR PROPERTIES FROM QUANTUM CHEMICAL CALCULATIONS H = E See for example M.