<<

ISC 2020, Frankfurt, Highlights of June 22, 2020 the 55th Erich TOP500 List Strohmaier ISC20 TOP500 AWARDS • TOP500: Europe #1, #3, #2, #1 • HPCG #1 • HPL-AI #1 • #1

HPC5 - PowerEdge C4140, Gold 6252 24C 2.1GHz, Tesla V100, Mellanox HDR Infiniband Eni S.p.A., is ranked ______No. 1 in Europe ______

among the World’s TOP500 with 35.45 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital on June 22nd, 2020.

Erich Strohmaier Horst Simon Martin Meuer NERSC/Berkeley Lab NERSC/Berkeley Lab Prometeus

Sierra - IBM Power System AC922, IBM POWER9 22C 3.1GHz, NVIDIA Volta GV100, Dual- rail Mellanox EDR Infiniband DOE/NNSA/LLNL, is ranked ______No. 3 ______

among the World’s TOP500 Supercomputers with 94.64 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital Conference on June 22nd, 2020.

Erich Strohmaier Jack Dongarra Horst Simon Martin Meuer NERSC/Berkeley Lab University of Tennessee NERSC/Berkeley Lab Prometeus

Summit - IBM Power System AC922, IBM POWER9 22C 3.07GHz, NVIDIA Volta GV100, Dual-rail Mellanox EDR Infiniband DOE/SC/Oak Ridge National Laboratory, United States is ranked ______No. 2 ______

among the World’s TOP500 Supercomputers with 148.6 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital Conference on June 22nd, 2020.

Erich Strohmaier Jack Dongarra Horst Simon Martin Meuer NERSC/Berkeley Lab University of Tennessee NERSC/Berkeley Lab Prometeus

Supercomputer - A64FX 48C 2.2GHz, Tofu interconnect D RIKEN Center for , is ranked ______No. 1 ______

among the World’s TOP500 Supercomputers with 415.53 Pflop/s Linpack Performance in the 55th TOP500 List published at the ISC 2020 Digital Conference on June 22nd, 2020.

Erich Strohmaier Jack Dongarra Horst Simon Martin Meuer NERSC/Berkeley Lab University of Tennessee NERSC/Berkeley Lab Prometeus JUNE 2020 PRESENTED AT

SYSTEM ACHIEVED

NUMBER 1 Fugaku 13.4 Riken R-CCS Pflop/s Riken Center for Computational Science JAPAN

JACK DONGARRA MICHAEL HEROUX PIOTR LUSZCZEK

IN COLLABORATION WITH SPONSORED BY JUNE 2020

NUMBER 1 SYSTEM Fugaku ACHIEVED 1.42 Eflop/s Riken R-CCS Riken Center for Computational Science 1 JAPAN

Jack Dongarra Piotr Luszczek

PRESENTED AT

MN-3 - MN-Core Server, Xeon 8260M 24C 2.4GHz, MN-Core, RoCEv2/MN-Core DirectConnect Preferred Networks, Japan is ranked ______No. 1 in the Green500 ______

among the World’s TOP500 Supercomputers

with 21.11 GFlops/Watt Linpack Power-Efficiency

on the Green500 List published at ISC 2020 Digital Conference, June 22nd, 2020

ISC20 TOP500 TOPICS • A new and exciting #1 • Exaflops on the HPL-AI • A renewed TOP10 • Green500 shows Progress • Low Market Turnover • Research Market vs Commercial Market Power # Site Manufacturer Country Cores Rmax ST [Pflops] [MW] RIKEN 41 LIST: THEFugaku TOP10 1 Supercomputer Fugaku, Japan 7,299,072 415.5 28.3 Center for Computational Science A64FX 48C 2.2GHz, Tofu interconnect D Oak Ridge 2 IBM IBM Power System, USA 2,414,592 148.6 10.1 National Laboratory P9 22C 3.07GHz, Mellanox EDR, NVIDIA GV100 Lawrence Livermore 3 IBM USA 1,572,480 94.6 7.4 National Laboratory IBM Power System, P9 22C 3.1GHz, Mellanox EDR, NVIDIA GV100 TaihuLight National Supercomputing 4 NRCPC NRCPC Sunway SW26010, 10,649,600 93.0 15.4 Center in Wuxi 260C 1.45GHz National University of Tianhe-2A 5 NUDT China 4,981,760 61.4 18.5 Defense Technology ANUDT TH-IVB-FEP, Xeon 12C 2.2GHz, Matrix-2000 HPC5 6 Eni S.p.A Dell EMC PowerEdge C4140, Italy 669,760 35.5 2.25 Xeon 24C 2.1GHz, NVIDIA T. V100, Mellanox HDR Selene 7 NVIDIA Corporation NVIDIA DGX A100 SuperPOD, USA 277,760 27.6 1.34 AMD 64C 2.25GHz, NVIDIA A100, Mellanox HDR Texas Advanced Frontera 8 Dell Dell C6420, USA 448,448 23.5 Center / Univ. of Texas Xeon Platinum 8280 28C 2.7GHz, Mellanox HDR Marconi-100 9 CINECA IBM IBM Power System AC922, Italy 347,776 21.6 1.98 P9 16C 3GHz, Nvidia Volta V100, Mellanox EDR Piz Daint Swiss National Supercomputing 10 Cray XC50, 387,872 21.2 2.38 Centre (CSCS) Xeon E5 12C 2.6GHz, P100, Aries HPCG HPCG/ HPCG/ # T Site Manufacturer Computer Country Rmax ST [Pflop/s] [Pflop/s] Peak HPL 41 LIST: FugakuTHE TOP10 1 1 RIKEN-CCS Fujitsu Supercomputer Fugaku, Japan 13.400 415.5 2.6% 3.2% A64FX 48C 2.2GHz, Tofu interconnect D Summit 2 2 Oak Ridge National Laboratory IBM IBM Power System, USA 2.926 148.6 1.5% 2.0% P9 22C 3.07 GHz, Volta GV100, EDR Lawrence Livermore Sierra 3 3 IBM USA 1.796 94.6 1.4% 1.9% National Laboratory IBM Power System, P9 22C 3.1 GHz, Volta GV100, EDR HPC5 4 6 Eni S.p.A Dell EMC PowerEdge C4140, Italy 0.860 35.5 1.7% 2.4% Xeon 24C 2.1GHz, NVIDIA V100, Mellanox HDR Trinity Los Alamos NL / 5 11 Cray Cray XC40, USA 0.546 20.2 1.3% 2.7% Sandia NL 7250 68C 1.4GHz, Aries Selene 6 7 NVIDIA Corporation NVIDIA DGX A100 SuperPOD, USA 0.509 27.6 1.5% 1.9% AMD 64C 2.25GHz, NVIDIA A100, Mellanox HDR National Institute of Advanced AI Bridging Cloud Infrastructure (ABCI) 7 12 Industrial Science and Fujitsu CX2550 M4, Japan 0.509 19.9 1.6% 2.6% Technology Xeon Gold 20C 2.4GHz, IB-EDR, NVIDIA V100 Piz Daint Swiss National Supercomputing 8 10 Cray Cray XC50, Switzerland 0.497 21.2 1.8% 2.4% Centre (CSCS) Xeon E5 12C 2.6GHz, Aries, NVIDIA Tesla P100 National Supercomputing Sunway TaihuLight 9 4 NRCPC China 0.481 93.0 0.4% 0.5% Center in Wuxi NRCPC Sunway SW26010, 260C 1.45GHz Nurion Korea Institute of Science and 10 18 Cray Cray CS500, 0.392 13.9 1.5% 2.8% Technology Information Intel Phi 7250 68C 1.4 GHz, OmniPath MOST ENERGY EFFICIENT ARCHITECTURES

Interconnect Accelerator Rmax/ Computer Power RoCEv2/MN-Core MN-3, Preferred Network MN-Core Server Xeon 24C 2.4GHz MN-Core 21.1 DirectConnect AMD -2 64C Selene, NVIDIA DGX A100 SuperPOD Mellanox HDR NVIDIA A100 20.5 2.25GHz NA-1, ZettaScaler-2.2 Xeon 16C 1.3GHz Infiniband EDR PEZY-SC2 *18.4 Fujitsu A64FX A64FX Prototype, Fujitsu A64FX Tofu Interconnect D - 16.9 48C 2GHz AiMOS, IBM Power System AC922 POWER9 20C 3.45GHz Mellanox EDR Volta GV100 16.3 HPC5, Dell PowerEdge C4140 Xeon 24C 2.1GHz Mellanox HDR Tesla V100 SXM2 15.7 Satori, IBM Power System AC922 POWER9 20C 2.4GHz Mellanox EDR Tesla V100 SXM2 15.6 Summit, IBM Power System AC922 POWER9 22C 3.07GHz Mellanox EDR Volta GV100 14.7 Supercomputer Fugaku, Fujitsu A64FX 48C 2.2GHz Tofu interconnect D - 14.7 Marconi-100, IBM Power System Power9 16C 3.0GHz Mellanox EDR Volta GV100 14.6 * Efficiency based on Power optimized HPL runs of equal size to TOP500 run. [Gflops/Watt] ENERGY EFFICIENCY

25 MN-Core Max-Efficiency Fujitsu A64FX

/W] ZettaScaler-2.2 20 3.0 Gflops 15 DGX SaturnV ZettaScaler-1.6 c 10 DGX SaturnV AMD FirePro Tsubame KFC

/Power [ /Power Volta Mic NVIDIA K20x – K80 5 BlueGene/Q

Linpack TOP500 Average 0 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 PERFORMANCE DEVELOPMENT

100 Eflop/s 1.00E+11 June 2013 10 Eflop/s 2.22 EFlop/s 1 Eflop/s 1.00E+09 416 PFlop/s 100 Pflop/s SUM 1.00E+0710 Pflop/s 1 Pflop/s 100 Tflop/s 1.23 PFlop/s 1.00E+05 N=1 10 Tflop/s 1.00E+031 Tflop/s 1.17 TFlop/s 100 Gflop/s N=500 59.7 GFlop/s 1.00E+0110 Gflop/s June 2008 1 Gflop/s 1.00E-01100 Mflop/s 422 MFlop/s 19941996199820002002200420062008201020122014201620182020 REPLACEMENT RATE

350 300 250 200 150 100

50 51 0 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 PERFORMANCE FRACTION OF THE TOP5 SYSTEMS 20% 18% 16% 14% 1 12% 2 10% 3 8% 6% 4 4% 5 2% 0% 19941996199820002002200420062008201020122014201620182020 PERFORMANCE FRACTION OF THE TOP5 SYSTEMS r 20% A -2 18% L r Simulato / Fugaku 16% TianheTaihuLight Earth 14% -Compute 1 d BlueGene K 12% 2 10% NWT ASCI Re 3 8% 6% 4 4% 5 2% 0% 19941996199820002002200420062008201020122014201620182020 COUNTRIES

500 China

400 Korea, South Italy 300 200 100 0 Japan United States 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017 2019 COUNTRIES / PERFORMANCE SHARE

Switzerland, 1% Ireland, other, 126, 6% United States , 1% 23, 1% China United Canada, 1% Kingdom, 1% Japan Germany, 3% United States, Italy 29% France, 4% France Germany Italy, 4% Japan, 24% China, 25% United Kingdom Canada Netherlands PRODUCERS

500 400 China 300

200 Europe

100 Japan 0 USA 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017 2019 VENDORS / SYSTEM SHARE

Others, 33, 7% Penguin, 6, 1% Nvidia, 7, 1% HPE/Cray Huawei, 7, 1% Dell EMC, 10, 2% Lenovo, 180, 36% IBM, 12, 2% Atos Fujitsu, 13, 3% Fujitsu IBM Atos, 26, 5% Sugon, HPE/Cray, 74, Dell EMC Inspur, 64, 13% 68, 15% Huawei 14%

# of systems, % of 500 VENDORS / PERFORMANCE SHARE

Others, 216, 10% Nvidia, Lenovo, 358, Lenovo 60, 3% Penguin, 14, 1% 16% HPE/Cray Huawei, 11, 0% Sugon Dell EMC, 85, 4% Inspur HPE/Cray, 317, IBM, 346, 14% Atos 16% Fujitsu IBM Fujitsu, 487, Sugon, 116, 5% 22% Dell EMC Inspur, 121, 5% Huawei Atos, 90, 4%

Sum of Pflop/s, % of whole list RESEARCH / COMMERCIAL MARKETS • Markets for scientific computing and for commercial data processing are very different. • Extract proper sub-samples for these markets from the full TOP500 list – TOP100 Research and Academic installations – TOP100 Industry (and Vendor) installations • Could try to separate out Industry installations but difficult to do – Ignore "Government, Classified, Others" for now – 100 works reasonably well, more might become tricky MARKET SEGMENTS

500 Others

400 Vendor Industry 300 Government 200 Classified 100 Academic Research 0 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017 2019 COUNTRIES / SYSTEM SHARE

United France, Others, States, 3% 10% 7% Japan, Others, United 2% States, United 21% 32% Kingdom, 7% France, China, 9% China, 78% 6% Germany , 11% Japan, 14% Research Commercial COUNTRIES / PERFORMANCE SHARE

United United Others, Kingdom, States, 10% 2% 16% France, 3% United Germany Others, States, 23% , 5% 32% France, 7% Japan, Japan, China, 36% 2% 52% China, 12% Research Commercial VENDORS / SYSTEM SHARE

Cray, 1% HPE, 3% IBM, 2% Others, Atos, 12% 3% Fujitsu, 1% Cray, Others, Inspur, 22% 24% 1% Sugon, HPE, 16% Inspur, Lenovo, 2% 29% 33% Atos, Sugon, Lenovo, Fujitsu, 13% 16% 6% 7% IBM, 9% Research Commercial VENDORS / PERFORMANCE SHARE

HPE, 6% Cray, 1% Cray, Atos, 3% 11% IBM, 6% HPE, 5% Inspur, Fujitsu, 0% Others, Atos, Others, 1% Sugon, 19% 5% 31% 0%Lenovo, 3% IBM, 23% Lenovo, 23% Fujitsu, Inspur, 34% 19%

Sugon, 10% Research Commercial CHIPS / SYSTEM SHARE

Power / AMD / ARM, ARM, 2% Others, Others, Power / NVIDIA, NVIDIA, 3% Intel 1% 1% NVIDIA, AMD, 5% 1% Phi, 1% 1% 6% Power, 2% Intel Phi, 11% Intel, Intel, 54% Intel / NVIDIA, 44% 51%

Intel / NVIDIA, 17%

Research Commercial CHIPS / PERFORMANCE SHARE

AMD / Others, ARM, Others, Power / NVIDIA, 1% 1% 11% NVIDIA, 8% 5% Intel, Intel Phi, 17% 1% Intel, Intel / NVIDIA, 8% 36% ARM, 31% Intel Phi, 8% Power / Power, AMD, 2% NVIDIA, Intel / NVIDIA, 48% 21% 2% Research Commercial SC20 TOP500 HIGHLIGHTS • Fugaku is the new #1 in the TOP500 • It measured at over 1 Exaflop on the HPL-AI in reduced precision • TOP10 has four new systems • Overall turn-over in the list is at a record low • TOP100 Research System and Commercial Systems show very different markets