Optimizing the Performance of Reactive Molecular Dynamics Simulations for Multi-Core Architectures
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Bond Order Potential for Fesi
Bond order potential for FeSi P. S¨ule∗ Research Institute for Technical Physics and Material Science, Konkoly Thege u. 29-33, Budapest, Hungary, [email protected],www.mfa.kfki.hu/∼sule, (Dated: October 12, 2018) A new parameter set has been derived for FeSi using the Albe-Erhart-type bond order poten- tial (BOP) and the PONTIFIX code for fitting the parameters on a large training set of various polymorphs. Ab initio calculations are also carried out to study the relative stability of various polymorphs and to use the obtained data in the training set. The original BOP formalism was unable to account for the correct energetic relationship between the B20 (ε-FeSi) and B2 (CsCl) phases and notoriusly slightly favors incorrectly the B2 polymorph. In order to correct this improper behavior the BOP potential has been extended by a Columbic term (BOP+C) in order to account for the partial ionic character of FeSi. Using this potential we are able to account for the correct phase order between the most stable B20 and B2 (CsCl) polymorphs when the net atomic charges are properly set. Although this brings in a new somewhat uncertain parameter (the net charges) one can adjust properly the BOP+C potential for specific problems. To demonstrate this we study under high pressure the B2 phase which becomes more stable vs. B20 as it is found experimentally and expected to be taken place in the Earth mantle. The obtained BOP has also been tested for the metallic and semiconducting disilicides (α-FeSi2 and β-FeSi2) and for the Si/β-FeSi2 heterostruc- ture. -
Ushering in a New Era: Argonne National Laboratory & Aurora
Ushering in a New Era Argonne National Laboratory’s Aurora System April 2015 ANL Selects Intel for World’s Biggest Supercomputer 2-system CORAL award extends IA leadership in extreme scale HPC Aurora Argonne National Laboratory >180PF Trinity NNSA† April ‘15 Cori >40PF NERSC‡ >30PF July ’14 + Theta Argonne National Laboratory April ’14 >8.5PF >$200M ‡ Cray* XC* Series at National Energy Research Scientific Computing Center (NERSC). † Cray XC Series at National Nuclear Security Administration (NNSA). 2 The Most Advanced Supercomputer Ever Built An Intel-led collaboration with ANL and Cray to accelerate discovery & innovation >180 PFLOPS (option to increase up to 450 PF) 18X higher performance† >50,000 nodes Prime Contractor 13MW >6X more energy efficient† 2018 delivery Subcontractor Source: Argonne National Laboratory and Intel. †Comparison of theoretical peak double precision FLOPS and power consumption to ANL’s largest current system, MIRA (10PFs and 4.8MW) 3 Aurora | Science From Day One! Extreme performance for a broad range of compute and data-centric workloads Transportation Biological Science Renewable Energy Training Argonne Training Program on Extreme- Scale Computing Aerodynamics Biofuels / Disease Control Wind Turbine Design / Placement Materials Science Computer Science Public Access Focus Areas Focus US Industry and International Co-array Fortran Batteries / Solar Panels New Programming Models 4 Aurora | Built on a Powerful Foundation Breakthrough technologies that deliver massive benefits Compute Interconnect File System 3rd Generation 2nd Generation Intel® Xeon Phi™ Intel® Omni-Path Intel® Lustre* Architecture Software >17X performance† >20X faster† >3X faster† FLOPS per node >500 TB/s bi-section bandwidth >1 TB/s file system throughput >12X memory bandwidth† >2.5 PB/s aggregate node link >5X capacity† bandwidth >30PB/s aggregate >150TB file system capacity in-package memory bandwidth Integrated Intel® Omni-Path Architecture Processor code name: Knights Hill Source: Argonne National Laboratory and Intel. -
Roofline Model Toolkit: a Practical Tool for Architectural and Program
Roofline Model Toolkit: A Practical Tool for Architectural and Program Analysis Yu Jung Lo, Samuel Williams, Brian Van Straalen, Terry J. Ligocki, Matthew J. Cordery, Nicholas J. Wright, Mary W. Hall, and Leonid Oliker University of Utah, Lawerence Berkeley National Laboratory {yujunglo,mhall}@cs.utah.edu {swwilliams,bvstraalen,tjligocki,mjcordery,njwright,loliker}@lbl.gov Abstract. We present preliminary results of the Roofline Toolkit for multicore, manycore, and accelerated architectures. This paper focuses on the processor architecture characterization engine, a collection of portable instrumented micro benchmarks implemented with Message Passing Interface (MPI), and OpenMP used to express thread-level paral- lelism. These benchmarks are specialized to quantify the behavior of dif- ferent architectural features. Compared to previous work on performance characterization, these microbenchmarks focus on capturing the perfor- mance of each level of the memory hierarchy, along with thread-level parallelism, instruction-level parallelism and explicit SIMD parallelism, measured in the context of the compilers and run-time environments. We also measure sustained PCIe throughput with four GPU memory managed mechanisms. By combining results from the architecture char- acterization with the Roofline model based solely on architectural spec- ifications, this work offers insights for performance prediction of current and future architectures and their software systems. To that end, we in- strument three applications and plot their resultant performance on the corresponding Roofline model when run on a Blue Gene/Q architecture. Keywords: Roofline, Memory Bandwidth, CUDA Unified Memory 1 Introduction The growing complexity of high-performance computing architectures makes it difficult for users to achieve sustained application performance across different architectures. -
Development of Interatomic Potentials in the Tersoff-Albe Formalism For
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Helsingin yliopiston digitaalinen arkisto M.Sc. thesis Computational physics Development of interatomic potentials in the Tersoff-Albe formalism for metal compounds Jesper Byggmästar 2016 Supervisors: Morten Nagel Krister Henriksson Examiners: Krister Henriksson Kai Nordlund UNIVERSITY OF HELSINKI DEPARTMENT OF PHYSICS PL 64 (Gustaf Hällströmin katu 2) 00014 Helsingin yliopisto HELSINGIN YLIOPISTO | HELSINGFORS UNIVERSITET | UNIVERSITY OF HELSINKI Tiedekunta/Osasto | Fakultet/Sektion | Faculty Laitos | Institution | Department Faculty of Science Department of Physics Tekij¨a | F¨orfattare | Author Jesper Byggm¨astar Ty¨on nimi | Arbetets titel | Title Development of interatomic potentials in the Tersoff-Albe formalism for metal compounds Oppiaine | L¨aro¨amne | Subject Fysik Ty¨on laji | Arbetets art | Level Aika | Datum | Month and year Sivum¨a¨ar¨a | Sidoantal | Number of pages M.Sc. 08/2016 57 Tiivistelm¨a | Referat | Abstract Interatomic potentials are used to describe the motion of the individual atoms in atomistic simula- tions. An accurate treatment of the interatomic forces in a system of atoms requires heavy quantum mechanical calculations, which are not computationally feasible in large-scale simulations. Intera- tomic potentials are computationally more efficient analytical functions used for calculating the potential energy of a system of atoms, allowing simulations of larger systems or longer time scales than in quantum mechanical simulations. The interatomic potential functions must be fitted to known properties of the material the poten- tial describes. Developing a potential for a specific material typically involves fitting a number of parameters included in the functional form, against a database of important material properties, such as cohesive, structural, and elastic properties of the relevant crystal structures. -
Characterizing and Understanding HPC Job Failures Over the 2K-Day Life of IBM Bluegene/Q System
Characterizing and Understanding HPC Job Failures over The 2K-day Life of IBM BlueGene/Q System Sheng Di,∗ Hanqi Guo,∗ Eric Pershey,∗ Marc Snir,y Franck Cappello∗y ∗Argonne National Laboratory, IL, USA [email protected], [email protected], [email protected], [email protected] yUniversity of Illinois at Urbana-Champaign, IL, USA [email protected] Abstract—An in-depth understanding of the failure features of Blue Joule (UK), also adopt the same Blue Gene/Q system HPC jobs in a supercomputer is critical to the large-scale system architecture and they are still in operation. maintenance and improvement of the service quality for users. In Studying the job failure features in a large-scale system is this paper, we investigate the features of hundreds of thousands of jobs in one of the most powerful supercomputers, the IBM nontrivial in that it involves numerous messages logged from Blue Gene/Q Mira, based on 2001 days of observations with a across multiple data sources and many messages are heavily total of over 32.44 billion core-hours. We study the impact of the duplicated [2]. In our work, we performed a joint analysis system’s events on the jobs’ execution in order to understand the by leveraging four different data sources: the reliability, avail- system’s reliability from the perspective of jobs and users. The ability, and serviceability (RAS) log; task execution log; job characterization involves a joint analysis based on multiple data sources, including the reliability, availability, and serviceability scheduling log; and I/O behavior log. The RAS log is the most (RAS) log; job scheduling log; the log regarding each job’s important system log related to system reliability such as node physical execution tasks; and the I/O behavior log. -
June 2012 | TOP500 Supercomputing Sites
PROJECT LISTS STATISTICS RESOURCES NEWS CONTACT SUBMISSIONS LINKS HOME Home Lists June 2012 MANNHEIM, Germany; BERKELEY, Calif.; and KNOXVILLE, Tenn.—For the first time since November 2009, a United Contents States supercomputer sits atop the TOP500 list of the world’s top supercomputers. Named Sequoia, the IBM BlueGene/Q system installed at the Department of Energy’s Lawrence Livermore National Laboratory achieved an impressive 16.32 Release petaflop/s on the Linpack benchmark using 1,572,864 cores. Top500 List Sequoia is also one of the most energy efficient systems on Press Release (PDF) the list, which will be released Monday, June 18, at the 2012 Press Release International Supercomputing Conference in Hamburg, Germany. This will mark the 39th edition of the list, which is List highlights compiled twice each year. Performance Development On the latest list, Fujitsu’s “K Computer” installed at the RIKEN Related Files Advanced Institute for Computational Science (AICS) in Kobe, Japan, is now the No. 2 system with 10.51 Pflop/s on the TOP500 List (XML) Linpack benchmark using 705,024 SPARC64 processing TOP500 List (Excel) A 1.044 persone piace cores. The K Computer held the No. 1 spot on the previous TOP500 Poster Mi piace two lists. questo elemento. Di' che Poster in PDF piace anche a te, prima di The new Mira supercomputer, an IBM BlueGene/Q system at tutti i tuoi amici. Argonne National Laboratory in Illinois, debuted at No. 3, with Drilldown 8.15 petaflop/s on the Linpack benchmark using 786,432 Performance Development cores. The other U.S. -
Molecular Dynamics Simulations of Deuterium-Beryllium Interactions Under Fusion Reactor Conditions
Master's Thesis Molecular dynamics simulations of deuterium-beryllium interactions under fusion reactor conditions Elnaz Safi 2014 Supervisors: Doctor Carolina Bj¨orkas Ane Lasa (M.Sc) Examiners: Professor Kai Nordlund Doctor Carolina Bj¨orkas UNIVERSITY OF HELSINKI DEPARTMENT OF PHYSICS P.O. Box 64 (Gustaf H¨allstr¨omin katu 2a) 00014 University of Helsinki Tiedekunta – Fakultet – Faculty Laitos – Institution– Department Faculty of Science Department of Physics Tekijä – Författare – Author Elnaz Safi Työn nimi – Arbetets titel – Title Molecular dynamics simulations of deuterium-beryllium interactions under fusion reactor conditions Oppiaine – Läroämne – Subject Computational Nanoscience and Matrials Physics Työn laji – Arbetets art – Level Aika – Datum – Month and year Sivumäärä – Sidoantal – Number of pages Master's Thesis March 2014 43 Tiivistelmä – Referat – Abstract Beryllium (Be) is a strong candidate as plasma-facing material for the main wall of future fusion reactors. Thus, its erosion plays a key role in predicting the reactor's life-time and viability. MD simulations can be a powerful tool to study Be behavior under high plasma particle flux. In this work, beryllium sputtering due to D bombardment is studied using MD simulations. We have analyzed the fundamental mechanisms for Be erosion considering some important parameters that influence the outcome, such as particle flux and surface temperature. It is shown that the Be erosion yield is strongly dependent on the surface temperature and its dependency on the particle flux is negligible. We also show that different species of Be molecules can be sputtered from its surface, mainly due to swift chemical sputtering mechanism. Avainsanat – Nyckelord – Keywords MD simulations, Be, Plasma-facing materials Ohjaaja tai ohjaajat – Handledare – Supervisor or supervisors Doctor Carolina Björkas and Ane Lasa (M. -
Supercomputers – Prestige Objects Or Crucial Tools for Science and Industry?
Supercomputers – Prestige Objects or Crucial Tools for Science and Industry? Hans W. Meuer a 1, Horst Gietl b 2 a University of Mannheim & Prometeus GmbH, 68131 Mannheim, Germany; b Prometeus GmbH, 81245 Munich, Germany; This paper is the revised and extended version of the Lorraine King Memorial Lecture Hans Werner Meuer was invited by Lord Laird of Artigarvan to give at the House of Lords, London, on April 18, 2012. Keywords: TOP500, High Performance Computing, HPC, Supercomputing, HPC Technology, Supercomputer Market, Supercomputer Architecture, Supercomputer Applications, Supercomputer Technology, Supercomputer Performance, Supercomputer Future. 1 e-mail: [email protected] 2 e-mail: [email protected] 1 Content 1 Introduction ..................................................................................................................................... 3 2 The TOP500 Supercomputer Project ............................................................................................... 3 2.1 The LINPACK Benchmark ......................................................................................................... 4 2.2 TOP500 Authors ...................................................................................................................... 4 2.3 The 39th TOP500 List since 1993 .............................................................................................. 5 2.4 The 39th TOP10 List since 1993 ............................................................................................... -
An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers
ORNL/TM-2020/1561 An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers Hyogi Sim Awais Khan Sudharshan S. Vazhkudai Approved for public release. Distribution is unlimited. August 11, 2020 DOCUMENT AVAILABILITY Reports produced after January 1, 1996, are generally available free via US Department of Energy (DOE) SciTech Connect. Website: www.osti.gov/ Reports produced before January 1, 1996, may be purchased by members of the public from the following source: National Technical Information Service 5285 Port Royal Road Springfield, VA 22161 Telephone: 703-605-6000 (1-800-553-6847) TDD: 703-487-4639 Fax: 703-605-6900 E-mail: [email protected] Website: http://classic.ntis.gov/ Reports are available to DOE employees, DOE contractors, Energy Technology Data Ex- change representatives, and International Nuclear Information System representatives from the following source: Office of Scientific and Technical Information PO Box 62 Oak Ridge, TN 37831 Telephone: 865-576-8401 Fax: 865-576-5728 E-mail: [email protected] Website: http://www.osti.gov/contact.html This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal lia- bility or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or rep- resents that its use would not infringe privately owned rights. Refer- ence herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise, does not nec- essarily constitute or imply its endorsement, recommendation, or fa- voring by the United States Government or any agency thereof. -
ALCF Newsbytes Argonne Leadership Computing Facility Argonne National Laboratory
ALCF Newsbytes Argonne Leadership Computing Facility Argonne National Laboratory Volume 2, Issue 3 | Summer 2012 Table of Contents Program Awards 247 Million Hours Program Awards 247 Million Hours .......... 1 of ALCF Supercomputer Time Mira Ranks Third on TOP500 .................... 2 Rapid Procedure Unites Quantum Chemistry with Artificial Intelligence ........ 4 Nine research projects have been awarded 247 million Lithium-Air: The “Mount Everest” processor hours of computing time at the Argonne of Batteries for Electric Vehicles ............... 5 Spotlight: Steve Crusan ............................. 6 Leadership Computing Facility (ALCF) through the ALCF Systems Shine on Green500 ............. 7 U.S. Department of Energy’s (DOE) initiative—the ASCR Mira Shares Graph 500 Top Spot ............... 8 Leadership Computing Challenge (ALCC). Chosen through a Leap to Petascale Workshop in Review ..... 8 peer-review process, the projects selected reflect areas of special interest to the DOE: energy, national emergencies, and broadening the community of researchers capable of Events of Interest Argonne’s Energy Showcase using leadership-class computing resources. Saturday, September 15, 2012 9 a.m. – 4 p.m. Argonne National Laboratory will open its gates to the community for a day of discovery ALCC awards of compute time on Intrepid (the ALCF’s Blue Gene/P) become available and fun for the whole family. The event is free in July for the following recipients: and open to the public. Advance registration is required as attendance will be limited. -
Hybrid Bond-Order Potential for Silicon Suleiman Oloriegbe Clemson University, [email protected]
Clemson University TigerPrints All Dissertations Dissertations 12-2008 Hybrid Bond-Order Potential for Silicon Suleiman Oloriegbe Clemson University, [email protected] Follow this and additional works at: https://tigerprints.clemson.edu/all_dissertations Part of the Physical Chemistry Commons Recommended Citation Oloriegbe, Suleiman, "Hybrid Bond-Order Potential for Silicon" (2008). All Dissertations. 328. https://tigerprints.clemson.edu/all_dissertations/328 This Dissertation is brought to you for free and open access by the Dissertations at TigerPrints. It has been accepted for inclusion in All Dissertations by an authorized administrator of TigerPrints. For more information, please contact [email protected]. HYBRID BOND-ORDER POTENTIAL FOR SILICON _____________________________________ A Dissertation Presented to the Graduate School of Clemson University ___________________________________ In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Chemistry _____________________________________ by Suleiman Yahaya Oloriegbe December 2008 _____________________________________ Accepted by: Dr. Steven J. Stuart, Committee Chair Dr. Brian Dominy Dr. Jason McNeill Dr. Robert Latour ABSTRACT A new hybrid bond-order potential for silicon is developed. The functional form of the potential is derived from hybrid of expressions from empirical bond-order formalism and first principles approximations. The total energy is expressed as the sum of attractive, repulsive and promotion energies. By introducing a screening function derived from approximations to first principles expressions, the potential is made long-ranged by allowing covalent interactions beyond the first nearest neighbor shell of atoms in agreement with quantum mechanical descriptions of the bonding in silicon. Environment- dependent promotion energy is introduced that accurately accounts for energetic interactions due to changes in hybridization state of atoms during chemical bonding. -
Elastic Constants of Graphene: Comparison of Empirical Potentials and DFT Calculations
Elastic constants of graphene: Comparison of empirical potentials and DFT calculations Irina V. Lebedevaa,∗, Alexander S. Minkinb, Andrey M. Popovc, Andrey A. Knizhnikd aNano-Bio Spectroscopy Group and ETSF, Universidad del Pa´ısVasco, CFM CSIC-UPV/EHU, Avenida de Tolosa 72, San Sebastian 20018, Spain bNational Research Centre “Kurchatov Institute”, Kurchatov Square 1, Moscow 123182, Russia cInstitute for Spectroscopy of Russian Academy of Sciences, Fizicheskaya Street 5, Troitsk, Moscow 108840, Russia dKintech Lab Ltd., 3rd Khoroshevskaya Street 12, Moscow 123298, Russia Abstract The capacity of popular classical interatomic potentials to describe elastic properties of graphene is tested. The Tersoff poten- tial, Brenner reactive bond-order potentials REBO-1990, REBO-2000, REBO-2002 and AIREBO as well as LCBOP, PPBE-G, ReaxFF-CHO and ReaxFF-C2013 are considered. Linear and non-linear elastic response of graphene under uniaxial stretching is investigated by static energy calculations. The Young’s modulus, Poisson’s ratio and high-order elastic moduli are verified against the reference data available from experimental measurements and ab initio studies. The density functional theory calculations are performed to complement the reference data on the effective Young’s modulus and Poisson’s ratio at small but finite elongations. It is observed that for all the potentials considered, the elastic energy deviates remarkably from the simple quadratic dependence already at elongations of several percent. Nevertheless, LCBOP provides the results consistent with the reference data and thus realistically describes in-plane deformations of graphene. Reasonable agreement is also observed for the computationally cheap PPBE-G potential. REBO-2000, AIREBO and REBO-2002 give a strongly non-linear elastic response with a wrong sign of the third-order elastic modulus and the corresponding results are very far from the reference data.