Highlights of the 55Th TOP500 List

ISC 2020, Frankfurt, Highlights of June 22, 2020 the 55th Erich TOP500 List Strohmaier ISC20 TOP500 AWARDS • TOP500: Europe #1, #3, #2, #1 • HPCG #1 • HPL-AI #1 • Green500 #1 HPC5 - PowerEdge C4140, Xeon Gold 6252 24C 2.1GHz, NVIDIA Tesla V100, Mellanox HDR Infiniband Eni S.p.A., Italy is ranked ________ No. 1 in Europe ________ among the World’s TOP500 Supercomputers with 35.45 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital on June 22nd, 2020. Erich Strohmaier Jack Dongarra Horst Simon Martin Meuer NERSC/Berkeley Lab University of Tennessee NERSC/Berkeley Lab Prometeus Sierra - IBM Power System AC922, IBM POWER9 22C 3.1GHz, NVIDIA Volta GV100, Dual- rail Mellanox EDR Infiniband DOE/NNSA/LLNL, United States is ranked ________ No. 3 ________ among the World’s TOP500 Supercomputers with 94.64 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital Conference on June 22nd, 2020. Erich Strohmaier Jack Dongarra Horst Simon Martin Meuer NERSC/Berkeley Lab University of Tennessee NERSC/Berkeley Lab Prometeus Summit - IBM Power System AC922, IBM POWER9 22C 3.07GHz, NVIDIA Volta GV100, Dual-rail Mellanox EDR Infiniband DOE/SC/Oak Ridge National Laboratory, United States is ranked ________ No. 2 ________ among the World’s TOP500 Supercomputers with 148.6 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital Conference on June 22nd, 2020. Erich Strohmaier Jack Dongarra Horst Simon Martin Meuer NERSC/Berkeley Lab University of Tennessee NERSC/Berkeley Lab Prometeus Supercomputer Fugaku - A64FX 48C 2.2GHz, Tofu interconnect D RIKEN Center for Computational Science, Japan is ranked ________ No. 1 ________ among the World’s TOP500 Supercomputers with 415.53 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital Conference on June 22nd, 2020. Erich Strohmaier Jack Dongarra Horst Simon Martin Meuer NERSC/Berkeley Lab University of Tennessee NERSC/Berkeley Lab Prometeus JUNE 2020 PRESENTED AT SYSTEM ACHIEVED NUMBER 1 Fugaku 13.4 Riken R-CCS Pﬂop/s Riken Center for Computational Science JAPAN JACK DONGARRA MICHAEL HEROUX PIOTR LUSZCZEK IN COLLABORATION WITH SPONSORED BY JUNE 2020 NUMBER 1 SYSTEM Fugaku ACHIEVED 1.42 Eﬂop/s Riken R-CCS Riken Center for Computational Science 1 JAPAN Jack Dongarra Piotr Luszczek PRESENTED AT MN-3 - MN-Core Server, Xeon 8260M 24C 2.4GHz, MN-Core, RoCEv2/MN-Core DirectConnect Preferred Networks, Japan is ranked ________ No. 1 in the Green500 ________ among the World’s TOP500 Supercomputers with 21.11 GFlops/Watt Linpack Power-Efficiency on the Green500 List published at ISC 2020 Digital Conference, June 22nd, 2020 ISC20 TOP500 TOPICS • A new and exciting #1 • Exaflops on the HPL-AI • A renewed TOP10 • Green500 shows Progress • Low Market Turnover • Research Market vs Commercial Market Power # Site Manufacturer Computer Country Cores Rmax ST [Pflops] [MW] RIKEN 41 LIST: THEFugaku TOP10 1 Fujitsu Supercomputer Fugaku, Japan 7,299,072 415.5 28.3 Center for Computational Science A64FX 48C 2.2GHz, Tofu interconnect D Summit Oak Ridge 2 IBM IBM Power System, USA 2,414,592 148.6 10.1 National Laboratory P9 22C 3.07GHz, MellAnox EDR, NVIDIA GV100 Lawrence Livermore Sierra 3 IBM USA 1,572,480 94.6 7.4 National Laboratory IBM Power System, P9 22C 3.1GHz, MellAnox EDR, NVIDIA GV100 Sunway TaihuLight National Supercomputing 4 NRCPC NRCPC Sunway SW26010, China 10,649,600 93.0 15.4 Center in Wuxi 260C 1.45GHz National University of Tianhe-2A 5 NUDT China 4,981,760 61.4 18.5 Defense Technology ANUDT TH-IVB-FEP, Xeon 12C 2.2GHz, MAtrix-2000 HPC5 6 Eni S.p.A Dell EMC PowerEdge C4140, Italy 669,760 35.5 2.25 Xeon 24C 2.1GHz, NVIDIA T. V100, MellAnox HDR Selene 7 NVIDIA Corporation NVIDIA DGX A100 SuperPOD, USA 277,760 27.6 1.34 AMD 64C 2.25GHz, NVIDIA A100, MellAnox HDR Texas Advanced Computing Frontera 8 Dell Dell C6420, USA 448,448 23.5 Center / Univ. of Texas Xeon PlAtinum 8280 28C 2.7GHz, MellAnox HDR Marconi-100 9 CINECA IBM IBM Power System AC922, Italy 347,776 21.6 1.98 P9 16C 3GHz, NvidiA VoltA V100, MellAnox EDR Piz Daint Swiss National Supercomputing 10 Cray Cray XC50, Switzerland 387,872 21.2 2.38 Centre (CSCS) Xeon E5 12C 2.6GHz, NVIDIA TeslA P100, Aries HPCG HPCG/ HPCG/ # T Site Manufacturer Computer Country Rmax ST [Pflop/s] [Pflop/s] Peak HPL 41 LIST: FugakuTHE TOP10 1 1 RIKEN-CCS Fujitsu Supercomputer Fugaku, Japan 13.400 415.5 2.6% 3.2% A64FX 48C 2.2GHz, Tofu interconnect D Summit 2 2 Oak Ridge National Laboratory IBM IBM Power System, USA 2.926 148.6 1.5% 2.0% P9 22C 3.07 GHz, VoltA GV100, EDR Lawrence Livermore Sierra 3 3 IBM USA 1.796 94.6 1.4% 1.9% National Laboratory IBM Power System, P9 22C 3.1 GHz, VoltA GV100, EDR HPC5 4 6 Eni S.p.A Dell EMC PowerEdge C4140, Italy 0.860 35.5 1.7% 2.4% Xeon 24C 2.1GHz, NVIDIA V100, Mellanox HDR Trinity Los Alamos NL / 5 11 Cray Cray XC40, USA 0.546 20.2 1.3% 2.7% Sandia NL Intel Xeon Phi 7250 68C 1.4GHz, Aries Selene 6 7 NVIDIA Corporation NVIDIA DGX A100 SuperPOD, USA 0.509 27.6 1.5% 1.9% AMD 64C 2.25GHz, NVIDIA A100, MellAnox HDR National Institute of Advanced AI Bridging Cloud Infrastructure (ABCI) 7 12 Industrial Science and Fujitsu PRIMERGY CX2550 M4, Japan 0.509 19.9 1.6% 2.6% Technology Xeon Gold 20C 2.4GHz, IB-EDR, NVIDIA V100 Piz Daint Swiss National Supercomputing 8 10 Cray Cray XC50, Switzerland 0.497 21.2 1.8% 2.4% Centre (CSCS) Xeon E5 12C 2.6GHz, Aries, NVIDIA TeslA P100 National Supercomputing Sunway TaihuLight 9 4 NRCPC China 0.481 93.0 0.4% 0.5% Center in Wuxi NRCPC Sunway SW26010, 260C 1.45GHz Nurion Korea Institute of Science and 10 18 Cray Cray CS500, South Korea 0.392 13.9 1.5% 2.8% Technology Information Intel Xeons Phi 7250 68C 1.4 GHz, OmniPath MOST ENERGY EFFICIENT ARCHITECTURES Interconnect Accelerator Rmax/ Computer Power RoCEv2/MN-Core MN-3, Preferred Network MN-Core Server Xeon 24C 2.4GHz MN-Core 21.1 DirectConnect AMD Zen-2 64C Selene, NVIDIA DGX A100 SuperPOD Mellanox HDR NVIDIA A100 20.5 2.25GHz NA-1, ZettaScaler-2.2 Xeon 16C 1.3GHz Infiniband EDR PEZY-SC2 *18.4 Fujitsu A64FX A64FX Prototype, Fujitsu A64FX Tofu Interconnect D - 16.9 48C 2GHz AiMOS, IBM Power System AC922 POWER9 20C 3.45GHz Mellanox EDR Volta GV100 16.3 HPC5, Dell PowerEdge C4140 Xeon 24C 2.1GHz Mellanox HDR Tesla V100 SXM2 15.7 Satori, IBM Power System AC922 POWER9 20C 2.4GHz Mellanox EDR Tesla V100 SXM2 15.6 Summit, IBM Power System AC922 POWER9 22C 3.07GHz Mellanox EDR Volta GV100 14.7 Supercomputer Fugaku, Fujitsu A64FX 48C 2.2GHz Tofu interconnect D - 14.7 Marconi-100, IBM Power System Power9 16C 3.0GHz Mellanox EDR Volta GV100 14.6 * Efficiency based on Power optimized HPL runs of equal size to TOP500 run. [Gflops/Watt] ENERGY EFFICIENCY 25 MN-Core Max-Efficiency Fujitsu A64FX /W] ZettaScaler-2.2 20 TsuBame 3.0 Gflops 15 DGX SaturnV ZettaScaler-1.6 c 10 DGX SaturnV AMD FirePro TsuBame KFC /Power [ /Power Volta Mic NVIDIA K20x – K80 5 BlueGene/Q Cell Linpack TOP500 Average 0 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 PERFORMANCE DEVELOPMENT 100 Eflop/s 1.00E+11 June 2013 10 Eflop/s 2.22 EFlop/s 1 Eflop/s 1.00E+09 416 PFlop/s 100 Pflop/s SUM 1.00E+0710 Pflop/s 1 Pflop/s 100 Tflop/s 1.23 PFlop/s 1.00E+05 N=1 10 Tflop/s 1.00E+031 Tflop/s 1.17 TFlop/s 100 Gflop/s N=500 59.7 GFlop/s 1.00E+0110 Gflop/s June 2008 1 Gflop/s 1.00E-01100 Mflop/s 422 MFlop/s 19941996199820002002200420062008201020122014201620182020 100 150 200 250 300 350 50 0 1993 1994 1995 1996 1997 1998 RATE REPLACEMENT 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 51 2019 2020 PERFORMANCE FRACTION OF THE TOP5 SYSTEMS 20% 18% 16% 14% 1 12% 2 10% 3 8% 6% 4 4% 5 2% 0% 19941996199820002002200420062008201020122014201620182020 PERFORMANCE FRACTION OF THE TOP5 SYSTEMS r 20% A -2 18% L r Simulato / Fugaku 16% TianheTaiHuLigHt EartH 14% -Compute 1 d BlueGene K 12% 2 10% NWT ASCI Re 3 8% 6% 4 4% 5 2% 0% 19941996199820002002200420062008201020122014201620182020 COUNTRIES 500 China 400 Korea, South Italy 300 Canada 200 France United Kingdom 100 Germany 0 Japan United States 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017 2019 COUNTRIES / PERFORMANCE SHARE Switzerland, 1% Ireland, other, 126, 6% United States Netherlands, 1% 23, 1% China United Canada, 1% Kingdom, 1% Japan Germany, 3% United States, Italy 29% France, 4% France Germany Italy, 4% Japan, 24% China, 25% United Kingdom Canada Netherlands 100 200 300 400 500 0 1993 1995 1997 1999 2001 PRODUCERS 2003 2005 2007 2009 2011 2013 2015 2017 2019 USA Japan Europe China Russia VENDORS / SYSTEM SHARE Others, 33, 7% Penguin, 6, 1% Lenovo Nvidia, 7, 1% HPE/Cray Huawei, 7, 1% Sugon Dell EMC, 10, 2% Lenovo, 180, Inspur 36% IBM, 12, 2% Atos Fujitsu, 13, 3% Fujitsu IBM Atos, 26, 5% Sugon, HPE/Cray, 74, Dell EMC Inspur, 64, 13% 68, 15% Huawei 14% # of systems, % of 500 VENDORS / PERFORMANCE SHARE Others, 216, 10% Nvidia, Lenovo, 358, Lenovo 60, 3% Penguin, 14, 1% 16% HPE/Cray Huawei, 11, 0% Sugon Dell EMC, 85, 4% Inspur HPE/Cray, 317, IBM, 346, 14% Atos 16% Fujitsu IBM Fujitsu, 487, Sugon, 116, 5% 22% Dell EMC Inspur, 121, 5% Huawei Atos, 90, 4% Sum of Pflop/s, % of whole list RESEARCH / COMMERCIAL MARKETS • Markets for scientific computing and for commercial data processing are very different.

Highlights of the 55Th TOP500 List

Interconnect Your Future Enabling the Best Datacenter Return on Investment

Linpack Evaluation on a Supercomputer with Heterogeneous Accelerators

Tsubame 2.5 Towards 3.0 and Beyond to Exascale

Performance Analysis on Energy Efficient High

Science-Driven Development of Supercomputer Fugaku

Towards Exascale Computing

Computational PHYSICS Shuai Dong

FCMSSR Meeting 2018-01 All Slides

Jack Dongarra: Supercomputing Expert and Mathematical Software Specialist

The Sunway Taihulight Supercomputer: System and Applications

Germany's Top500 Businesses

BDEC2 Poznan Japanese HPC Infrastructure Update