IBM Blue Gene: Architecture and Software Overview
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
2017 HPC Annual Report Team Would Like to Acknowledge the Invaluable Assistance Provided by John Noe
sandia national laboratories 2017 HIGH PERformance computing The 2017 High Performance Computing Annual Report is dedicated to John Noe and Dino Pavlakos. Building a foundational framework Editor in high performance computing Yasmin Dennig Contributing Writers Megan Davidson Sandia National Laboratories has a long history of significant contributions to the high performance computing Mattie Hensley community and industry. Our innovative computer architectures allowed the United States to become the first to break the teraflop barrier—propelling us to the international spotlight. Our advanced simulation and modeling capabilities have been integral in high consequence US operations such as Operation Burnt Frost. Strong partnerships with industry leaders, such as Cray, Inc. and Goodyear, have enabled them to leverage our high performance computing capabilities to gain a tremendous competitive edge in the marketplace. Contributing Editor Laura Sowko As part of our continuing commitment to provide modern computing infrastructure and systems in support of Sandia’s missions, we made a major investment in expanding Building 725 to serve as the new home of high performance computer (HPC) systems at Sandia. Work is expected to be completed in 2018 and will result in a modern facility of approximately 15,000 square feet of computer center space. The facility will be ready to house the newest National Nuclear Security Administration/Advanced Simulation and Computing (NNSA/ASC) prototype Design platform being acquired by Sandia, with delivery in late 2019 or early 2020. This new system will enable continuing Stacey Long advances by Sandia science and engineering staff in the areas of operating system R&D, operation cost effectiveness (power and innovative cooling technologies), user environment, and application code performance. -
Safety and Security Challenge
SAFETY AND SECURITY CHALLENGE TOP SUPERCOMPUTERS IN THE WORLD - FEATURING TWO of DOE’S!! Summary: The U.S. Department of Energy (DOE) plays a very special role in In fields where scientists deal with issues from disaster relief to the keeping you safe. DOE has two supercomputers in the top ten supercomputers in electric grid, simulations provide real-time situational awareness to the whole world. Titan is the name of the supercomputer at the Oak Ridge inform decisions. DOE supercomputers have helped the Federal National Laboratory (ORNL) in Oak Ridge, Tennessee. Sequoia is the name of Bureau of Investigation find criminals, and the Department of the supercomputer at Lawrence Livermore National Laboratory (LLNL) in Defense assess terrorist threats. Currently, ORNL is building a Livermore, California. How do supercomputers keep us safe and what makes computing infrastructure to help the Centers for Medicare and them in the Top Ten in the world? Medicaid Services combat fraud. An important focus lab-wide is managing the tsunamis of data generated by supercomputers and facilities like ORNL’s Spallation Neutron Source. In terms of national security, ORNL plays an important role in national and global security due to its expertise in advanced materials, nuclear science, supercomputing and other scientific specialties. Discovery and innovation in these areas are essential for protecting US citizens and advancing national and global security priorities. Titan Supercomputer at Oak Ridge National Laboratory Background: ORNL is using computing to tackle national challenges such as safe nuclear energy systems and running simulations for lower costs for vehicle Lawrence Livermore's Sequoia ranked No. -
The Artisanal Nuke, 2014
The Artisanal Nuke Mary C. Dixon US Air Force Center for Unconventional Weapons Studies Maxwell Air Force Base, Alabama THE ARTISANAL NUKE by Mary C. Dixon USAF Center for Unconventional Weapons Studies 125 Chennault Circle Maxwell Air Force Base, Alabama 36112-6427 July 2014 Disclaimer The opinions, conclusions, and recommendations expressed or implied in this publication are those of the author and do not necessarily reflect the views of the Air University, Air Force, or Department of Defense. ii Table of Contents Chapter Page Disclaimer ................................................................................................... ii Table of Contents ....................................................................................... iii About the Author ......................................................................................... v Acknowledgements ..................................................................................... vi Abstract ....................................................................................................... ix 1 Introduction .............................................................................................. 1 2 Background ............................................................................................ 19 3 Stockpile Stewardship ........................................................................... 27 4 Opposition & Problems ......................................................................... 71 5 Milestones & Accomplishments .......................................................... -
Technical Issues in Keeping the Nuclear Stockpile Safe, Secure, and Reliable
Technical Issues in Keeping the Nuclear Stockpile Safe, Secure, and Reliable Marvin L. Adams Texas A&M University Sidney D. Drell Stanford University ABSTRACT The United States has maintained a safe, secure, and reliable nuclear stockpile for 16 years without relying on underground nuclear explosive tests. We argue that a key ingredient of this success so far has been the expertise of the scientists and engineers at the national labs with the responsibility for the nuclear arsenal and infrastructure. Furthermore for the foreseeable future this cadre will remain the critical asset in sustaining the nuclear enterprise on which the nation will rely, independent of the size of the stockpile or the details of its purpose, and to adapt to new requirements generated by new findings and changes in policies. Expert personnel are the foundation of a lasting and responsive deterrent. Thus far, the United States has maintained a safe, secure, and reliable nuclear stockpile by adhering closely to the designs of existing “legacy” weapons. It remains to be determined whether this is the best course to continue, as opposed to introducing new designs (as envisaged in the original RRW program), re-using previously designed and tested components, or maintaining an evolving combination of new, re-use, and legacy designs. We argue the need for a stewardship program that explores a broad spectrum of options, includes strong peer review in its assessment of options, and provides the necessary flexibility to adjust to different force postures depending on evolving strategic developments. U.S. decisions and actions about nuclear weapons can be expected to affect the nuclear policy choices of other nations – non-nuclear as well as nuclear – on whose cooperation we rely in efforts to reduce the global nuclear danger. -
A Comparison of the Current Top Supercomputers
A Comparison of the Current Top Supercomputers Russell Martin Xavier Gallardo Top500.org • Started in 1993 to maintain statistics on the top 500 performing supercomputers in the world • Started by Hans Meuer, updated twice annually • Uses LINPACK as main benchmark November 2012 List 1. Titan, Cray, USA 2. Sequoia, IBM, USA 3. K Computer, Fujitsu, Japan 4. Mira, IBM, USA 5. JUQueen, IBM, Germany 6. SuperMUC, IBM, Germany Performance Trends http://top500.org/statistics/perfdevel/ Trends • 84.8% use 6+ processor cores • 76% Intel, 12% AMD Opteron, 10.6% IBM Power • Most common Interconnects - Infiniband, Gigabit Ethernet • 251 in USA, 123 in Asia, 105 in Europe • Current #1 Computer • Oak Ridge National Laboratory Oak Ridge, Tennessee • Operational Oct 29, 2012 • Scientific research • AMD Opteron / NVIDIA Tesla • 18688 Nodes, 560640 Cores • Cray Gemini Interconnect • Theoretical 27 PFLOPS Sequoia • Ranked #2 • Located at the Lawrence Livermore National Library in Livermore, CA • Designed for "Uncertainty Quantification Studies" • Fully classified work starting April 17 2013 Sequoia • Cores: 1572864 • Nodes: 98304 • RAM: 1572864 GB • Interconnect: Custom 5D Torus • Max Performance: 16324.8 Tflops • Theoretical Max: 20132.7 Tflops • Power Consumption: 7890 kW Sequoia • IBM BlueGene/Q Architecture o Started in 1999 for protein folding o Custom SoC Layout 16 PPC Cores per Chip, each 1.6Ghz, 0.8V • Built for OpenMP and POSIX programs • Automatic SIMD exploitation Sequoia • #3 on Top 500, #1 June 2011 • RIKEN Advanced Institute for Computational -
Performance Tuning of Graph500 Benchmark on Supercomputer Fugaku
Performance tuning of Graph500 benchmark on Supercomputer Fugaku Masahiro Nakao (RIKEN R-CCS) Outline Graph500 Benchmark Supercomputer Fugaku Tuning Graph500 Benchmark on Supercomputer Fugaku 2 Graph500 https://graph500.org Graph500 has started since 2010 as a competition for evaluating performance of large-scale graph processing The ranking is updated twice a year (June and November) Fugaku won the awards twice in 2020 One of kernels in Graph500 is BFS (Breadth-First Search) An artificial graph called the Kronecker graph is used Some vertices are connected to many other vertices while numerous others are connected to only a few vertices Social network is known to have a similar property 3 Overview of BFS BFS Input:graph and root Output:BFS tree Data structure and BFS algorithm are free 4 Hybrid-BFS [Beamer, 2012] Scott Beamer et al. Direction-optimizing breadth-first search, SC ’12 It is suitable for small diameter graphs used in Graph500 Perform BFS while switching between Top-down and Bottom-up In the middle of BFS, the number of vertices being visited increases explosively, so it is inefficient in only Top-down Top-down Bottom-up 0 0 1 1 1 1 1 1 Search for unvisited vertices Search for visited vertices from visited vertices from unvisited vertices 5 2D Hybrid-BFS [Beamer, 2013] Scott Beamer, et. al. Distributed Memory Breadth- First Search Revisited: Enabling Bottom-Up Search. IPDPSW '13. Distribute the adjacency matrix to a 2D process grid (R x C) Communication only within the column process and within the row process Allgatherv, Alltoallv, -
The Blue Gene/Q Compute Chip
The Blue Gene/Q Compute Chip Ruud Haring / IBM BlueGene Team © 2011 IBM Corporation Acknowledgements The IBM Blue Gene/Q development teams are located in – Yorktown Heights NY, – Rochester MN, – Hopewell Jct NY, – Burlington VT, – Austin TX, – Bromont QC, – Toronto ON, – San Jose CA, – Boeblingen (FRG), – Haifa (Israel), –Hursley (UK). Columbia University . University of Edinburgh . The Blue Gene/Q project has been supported and partially funded by Argonne National Laboratory and the Lawrence Livermore National Laboratory on behalf of the United States Department of Energy, under Lawrence Livermore National Laboratory Subcontract No. B554331 2 03 Sep 2012 © 2012 IBM Corporation Blue Gene/Q . Blue Gene/Q is the third generation of IBM’s Blue Gene family of supercomputers – Blue Gene/L was announced in 2004 – Blue Gene/P was announced in 2007 . Blue Gene/Q – was announced in 2011 – is currently the fastest supercomputer in the world • June 2012 Top500: #1,3,7,8, … 15 machines in top100 (+ 101-103) – is currently the most power efficient supercomputer architecture in the world • June 2012 Green500: top 20 places – also shines at data-intensive computing • June 2012 Graph500: top 2 places -- by a 7x margin – is relatively easy to program -- for a massively parallel computer – and its FLOPS are actually usable •this is not a GPGPU design … – incorporates innovations that enhance multi-core / multi-threaded computing • in-memory atomic operations •1st commercial processor to support hardware transactional memory / speculative execution •… – is just a cool chip (and system) design • pushing state-of-the-art ASIC design where it counts • innovative power and cooling 3 03 Sep 2012 © 2012 IBM Corporation Blue Gene system objectives . -
MT-Lib: a Topology-Aware Message Transfer Library for Graph500 on Supercomputers Xinbiao Gan
IEEE TRANSACTIONS ON JOURNAL NAME, MANUSCRIPT ID 1 MT-lib: A Topology-aware Message Transfer Library for Graph500 on Supercomputers Xinbiao Gan Abstract—We present MT-lib, an efficient message transfer library for messages gather and scatter in benchmarks like Graph500 for Supercomputers. Our library includes MST version as well as new-MST version. The MT-lib is deliberately kept light-weight, efficient and friendly interfaces for massive graph traverse. MST provides (1) a novel non-blocking communication scheme with sending and receiving messages asynchronously to overlap calculation and communication;(2) merging messages according to the target process for reducing communication overhead;(3) a new communication mode of gathering intra-group messages before forwarding between groups for reducing communication traffic. In MT-lib, there are (1) one-sided message; (2) two-sided messages; and (3) two-sided messages with buffer, in which dynamic buffer expansion is built for messages delivery. We experimented with MST and then testing Graph500 with MST on Tianhe supercomputers. Experimental results show high communication efficiency and high throughputs for both BFS and SSSP communication operations. Index Terms—MST; Graph500; Tianhe supercomputer; Two-sided messages with buffer —————————— —————————— 1 INTRODUCTION HE development of high-performance supercompu- large graphs, a new benchmark called the Graph500 was T ting has always been a strategic goal for many coun- proposed in 2010[3]. Differently from the Top 500 used to tries [1,2]. Currently, exascale computing poses severe ef- supercomputers metric with FLOPS (Floating Point Per ficiency and stability challenges. Second) for compute-intensive su-percomputing applica- Large-scale graphs have applied to many seminars from tions. -
Supercomputers – Prestige Objects Or Crucial Tools for Science and Industry?
Supercomputers – Prestige Objects or Crucial Tools for Science and Industry? Hans W. Meuer a 1, Horst Gietl b 2 a University of Mannheim & Prometeus GmbH, 68131 Mannheim, Germany; b Prometeus GmbH, 81245 Munich, Germany; This paper is the revised and extended version of the Lorraine King Memorial Lecture Hans Werner Meuer was invited by Lord Laird of Artigarvan to give at the House of Lords, London, on April 18, 2012. Keywords: TOP500, High Performance Computing, HPC, Supercomputing, HPC Technology, Supercomputer Market, Supercomputer Architecture, Supercomputer Applications, Supercomputer Technology, Supercomputer Performance, Supercomputer Future. 1 e-mail: [email protected] 2 e-mail: [email protected] 1 Content 1 Introduction ..................................................................................................................................... 3 2 The TOP500 Supercomputer Project ............................................................................................... 3 2.1 The LINPACK Benchmark ......................................................................................................... 4 2.2 TOP500 Authors ...................................................................................................................... 4 2.3 The 39th TOP500 List since 1993 .............................................................................................. 5 2.4 The 39th TOP10 List since 1993 ............................................................................................... -
An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers
ORNL/TM-2020/1561 An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers Hyogi Sim Awais Khan Sudharshan S. Vazhkudai Approved for public release. Distribution is unlimited. August 11, 2020 DOCUMENT AVAILABILITY Reports produced after January 1, 1996, are generally available free via US Department of Energy (DOE) SciTech Connect. Website: www.osti.gov/ Reports produced before January 1, 1996, may be purchased by members of the public from the following source: National Technical Information Service 5285 Port Royal Road Springfield, VA 22161 Telephone: 703-605-6000 (1-800-553-6847) TDD: 703-487-4639 Fax: 703-605-6900 E-mail: [email protected] Website: http://classic.ntis.gov/ Reports are available to DOE employees, DOE contractors, Energy Technology Data Ex- change representatives, and International Nuclear Information System representatives from the following source: Office of Scientific and Technical Information PO Box 62 Oak Ridge, TN 37831 Telephone: 865-576-8401 Fax: 865-576-5728 E-mail: [email protected] Website: http://www.osti.gov/contact.html This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal lia- bility or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or rep- resents that its use would not infringe privately owned rights. Refer- ence herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise, does not nec- essarily constitute or imply its endorsement, recommendation, or fa- voring by the United States Government or any agency thereof. -
Report Is Available on the UCS Website At
Making Smart Security Choices The Future of the U.S. Nuclear Weapons Complex Making Smart SecurityChoices The Future of the U.S. Nuclear Weapons Complex Lisbeth Gronlund Eryn MacDonald Stephen Young Philip E. Coyle III Steve Fetter OCTOBER 2013 Revised March 2014 ii UNION OF CONCERNED SCIENTISTS © 2013 Union of Concerned Scientists All rights reserved Lisbeth Gronlund is a senior scientist and co-director of the Union of Concerned Scientists (UCS) Global Security Program. Eryn MacDonald is an analyst in the UCS Global Security Program. Stephen Young is a senior analyst in the UCS Global Security Program. Philip E. Coyle III is a senior science fellow at the Center for Arms Control and Non-Proliferation. Steve Fetter is a professor in the School of Public Policy at the University of Maryland. The Union of Concerned Scientists puts rigorous, independent science to work to solve our planet’s most pressing problems. Joining with citizens across the country, we combine technical analysis and effective advocacy to create innovative, practical solutions for a healthy, safe, and sustainable future. More information about UCS and the Global Security Program is available on the UCS website at www.ucsusa.org/nuclear_weapons_and_global_security. The full text of this report is available on the UCS website at www.ucsusa.org/smartnuclearchoices. DESIGN & PROductiON DG Communications/www.NonprofitDesign.com COVER image Department of Defense/Wikimedia Commons Four B61 nuclear gravity bombs on a bomb cart at Barksdale Air Force Base in Louisiana. Printed on recycled paper. MAKING SMART SECURITY CHOICES iii CONTENTS iv Figures iv Tables v Acknowledgments 1 Executive Summary 4 Chapter 1. -
Looking Back on Supercomputer Fugaku Development Project
Special Contribution Looking Back on Supercomputer Fugaku Development Project Yutaka Ishikawa RIKEN Center for Computational Science Project Leader, Flagship 2020 Project For the purpose of leading the resolution of various social and scientific issues surrounding Japan and contributing to the promotion of science and technology, strengthening industry, and building a safe and secure nation, RIKEN (The Institute of Physical and Chemical Research), and Fujitsu have worked on the research and development of the supercomputer Fugaku (here- after, Fugaku) since 2014. The installation of the hardware was completed in May 2020 and, in June, the world’s top performance was achieved in terms of the results of four benchmarks: Linpack, HPCG, Graph500, and HPL-AI. At present, we are continuing to make adjustments ahead of public use in FY2021. This article looks back on the history of the Fugaku development project from the preparation period and the two turning points that had impacts on the entire project plan. the system: system power consumption of 30 to 40 MW, 1. Introduction and 100 times higher performance than the K com- The supercomputer Fugaku (hereafter, Fugaku) is puter in some actual applications. one of the outcomes of the Flagship 2020 Project (com- This article looks back on the history of the Post-K monly known as the Post-K development), initiated by project since the preparation period and the two turn- the Ministry of Education, Culture, Sports, Science and ing points that had impacts on the entire project plan. Technology (MEXT) starting in FY2014. This project consists of two parts: development of a successor to 2.