Beyond Earth Simulator

Beyond Earth Simulator International Computing for the Atmospheric Sciences Symposium (iCAS2017) 海洋研究開発機構 September 11, 2017 地球情報基盤センターMakoto Tsukakoshi Information 情報システム部Systems Department Center for Earth Information Science and塚越 Technology眞 JAMSTEC Japan Agency for Marine-Earth Science and Technology 2 The main seven research and development issues during the third mid-term plan During the third mid-term plan, we set and address the seven research and development issues with all our strength due to promote strategic and focused research and development based on the national and social needs. Exploring untapped Ocean drilling – submarine resources Getting to know the Earth from beneath the seabed Detecting signals of global Information Science - environmental Predicting the Earth's change future by simulations Understanding seismogenic zones, and contributing to disaster mitigation Marine Bioscience - Exploring the unknown Construction of research base extreme biosphere to solve to spawn the ocean frontier the mystery of life 3 Typhoon-marine interaction using nonstatic atmospheric wave ocean coupling model Typhoon Vera in 1959 by Kazuhisa Tsuboki (2015) Typhoon-Ocean Interaction Study Using the Coupled Atmosphere-Ocean Non-hydrostatic Model: With Careful Consideration of Upper Outflow Layer Clouds of Typhoon 4 • Earth Simulator ： - Developed by NASDA, JAERI and JAMSTEC (563.4* Oku Yen) - Operation started on March 2002 and immediately recognized as #1 supercomputer - “It will keep #1 for 2 years in peak and 5 years in effective performance” by H. Miyoshi *inclusing facilities and building 5 Earth Simulator has changed its role when “K computer” is completed Council for Science and Technology Policy (CSTP) Report on Nov 25, 2005: After the completion of Leading Edge High Performance Supercomputer (i.e. K-Computer), other existing supercomputers should carry their roles in each fields, according to their purpose and performance , for example, Earth Simulator, should responsible for Marine and Earth Science on the base of accumulated application software assets, and thus they should meet growing demand for HPC in our country. 既存のスーパーコンピュータは、本プロジェクトで開発する計算機システムの完成後も、例えば地球シミュレータについては、これまでに蓄積してきたアプリケーションソフトウェアを活用して効果的・効率的に海洋地球科学分野を主とした計算を担当するなど、それぞれが得意とする各分野を中心として、目的や性能に応じた計算に応するとの役割分担の下に、今後も増大が予想される我が国の計算機需要に応えることとしている。 HPCI Strategy - MEXT ES K-computer Operated as national infrastructure thorugh HPCI* ＊ “Innovative High Performance Computing Infrastructure” initiative ES 6 3 Generation of Earth Simulator since 2002 System for prediction and creation of Marine-Earth Science data and information for Society Proof of Feasibility of Utilize Simulation, Prediction and data for Society Foundation of Marine- Earth Simulation Simulation and Prediction Challenge new subjects in Marine-Earth Science 10 PFLOPS Flagship supercomputer System for Marine-Earth Science June 2015 for All Science Earth Simulator Peak/Effective Performance 10x 1.3PFLOPS / 320TB 1 PFLOPS “Computenik” (Jack Donguarra, NY Times, April 20, 2002) March 2009 ３ Earth Simulator (ES2) Peak Performance x 131TFLOPS / 20TB March 2002 100 TFLOPS Earth Simulator (1st) 40TFLOPS / 10TB About 1.5MW* 8x to 10x Effective Performance based on Application Programs* 10 TFLOPS About 3MW* * Single Program Turn Around Time Performance: 2.1x to 2.8x Number of Programs: 4x = System Throughput 8x to 10 x 1 TFLOPS About 5MW* * Effective Power Consumption 7 JAMSTEC Information System Outline SINET JAMSTEC backbone network Science Information NETwork 4 10Gbps ×2 （1G～10Gbps） User terminals Earth Simulator backbone Earth Simulator (NEC SX-ACE) Visualization Server 40Gbps ×2 Service servers Mass Storage System Large Scale Shared Memory User terminals Server （SGI UV2000) Super computer systems (HPE Apollo 6000, 380 node in 2018) (ICE-X, UV2000, SX-9) 8 Earth Simulator Outline Earth Simulator JAMSTEC Backbone network Main Computer Storage Computing Nodes (5120) + Internode Frontend Server Network /home 87.6 TB 4 servers Model: NEC SX-ACE /data Total Performance 4.7 PB - Number of nodes 5,120 - Peak Performance 1.31 PFLOPS - Memory BandwidthBandwidth 1.31PB/s - Memory size 320 TB Node Performance Work - Number of CPU 1(4cores) 13.5 PB - Peak Performance 256 GFLOPS Large Scale Shared (64GFLOPS x 4cores) Double Precision Memory Server（UV2000) 17PB - Memory Bandwidth 256 GB/s Pre and Post System Mass Storage System - Memory size 64 GB 9 Benchmark Test showed ES’ｓ value The same #CPU (sockets) DOUBLES the performance with ASIS code. 2.8X with further tuning. (Performance Ratio vs. ES2) Both ES2 and SX-ACE deploy same #CPU This means : Effective Throughput Performance is > 10x (TAT – 2 to 2.8x, #of jobs 4x) Average (TUNE) 2.8 Tuning Indeed, FLOPs counts increased by 12x 260 EFLOPs(executed in FY2012) to Average (ASIS) 2.1 3,119EFLOPs(executed in FY2016) 温暖化MIROC 温暖化MIROC 全球大気NICAM Specfem3D地震 RSGDX地震ミクロ大気MSSG 海洋ODA/同化 10 Effective Resource Comparison with K-computer K Computer Usage in FY2015 Atomic Energy Mathematics Fusion (1.7%) (1.0%) Earth Material Science Particle Physics Engineering Biology Science Chemistry Life Science Astronomy (24.3%) (22.0%) （18.8%） (16.4%) (15.5%) Others (0.3%) ~2/3(estimation) Equivalent to 28% of Next Earth Simulator Earth ~28% Science 90.3% Earth Simulator in FY2016 11 Earth Simulator Resource Allocation/Usage（FY2016） Industry & Innovation 3.7% 0.7% External JAMSTEC Math Sci & 5.4% proposed proposed Engineering Life Science project (27) 20% project (23) 40% Earth Internal, Earthquake, 20.3% Tsunami 30% 70.0% Environmental 10% Strategic project with Change Government special support (6) contract (7) Fee based usage(8) Earth Simulator Usage by Earth Simulator Resource Application [node hour] Allocation (71 projects) [node hour] Outside Japan Industry Government Agency University Institutes using Earth Simulator 12 Earth Simulator Operation Highlights • Stable Operation in FY2016 • Power Efficiency - Availability 99.86%, - Improved 30% from the previous ES2(NEC - Utilization 89.07% SX-9) - Total 3,119EFLOPs * executed in FY2016 - Continued effort to optimized the cooling “ ” (7.5% of total peak performance : Total Executed FLOPs/Total Power* *:including facilities): 24x366xall nodes) 171TFLOPs/kwh in 2016 - improved 9.2% from 2015 *Using 2016 Yellowstone CPU hours/24x366 (81%), 3,119EF requires 2.3% of peak performance on Cheyanne 13 (proposal) Use actual FLOPs per watt to evaluate total energy efficiency of computing centers Efficiency should be “actually achieved Performance/Total energy consumed” Peak or Linpack Performance : Don’t represent real work loads (“Green500”) HPCG or other Benchmark suites : Better but do not fit each center’s AP spectrum (each centers should have been optimized to its own application work loads.) Benchmarks show only capability. Do not represent the center’s operational efforts. Power consumption must be actual and total include cooling. Evaluate the energy efficiency by “Executed FLOPs per watt” ! Where, the energy is measured total inclusive amount. (Example) In 2016 (JAN-DEC), Earth Simulator (Entire System including cooling facilities) has actually achieved 178,956 GFLOPS/Kwh (7.6% of theoretical peak x 100% non- stop operation) Using HPCG/peak performance ratio (5.2%) and annual GWh from the annual report 2014, efficiency of “K-Computer” is estimated 109,221 GFLOPS/Kwh 14 Goal: “Marine-Earth Informatics” 1. Produce large and accurate data data from observation, experimentation, simulation data of various form (structured/unstructured, images, texts) • Building large and precise data sets by accurate observation and simulation applying the latest data handling method. 2. Create information from large and diverse data • Development of a computing method to deal with large data with high performance • Applying ML, pattern recognition, analysis of precursor characteristic of rare events 3. Present new application and use of information • New use case of information for government and industry • New application area 4. Develop ICT technology to handle large and complex data • Integrating Simulation and Data Analysis • Interactive real time visualization • Leading-edge High performance platforms and devices 15 database for Policy Decision making for Future climate change (d4PDF) • d4PDF is a database including both historical and +4K future climate simulation results, and provides probabilistic Information on climate change in extreme events by high- resolution large ensemble simulations with a 60km AGCM and 20km NHRCM. • 2.6M node hours* and 2PB output on Earth Simulator in 2015 by MRI and MIROC team. *9% of entire resources in 2015 MRI-AGCM Historical Simulations 60km 6,000(AGCM)+3,000(RCM) years ・・ 100 members (AGCM) 60 years (1950-2010) ・50 members (NHRCM) downscaling (SST perturbations representing observational error) +4K Future Climate Simulations 5,400 years (AGCM+RCM) ・・ 90 members (AGCM) 60 years (1950-2010) ・90 members (NHRCM) (6 ΔSST from 6 CMIP5 models x 15 SST perturbations) Histogram of daily mean precipitation at Tokyo (60km AGCM) (a) Present day (b) Changes in the +4K world MRI-NHRCM 10yr Frequency(%) 20km 100yr Ratio of freq. (+4K/present) freq. of Ratio Daily precipitation (mm/day) Daily precipitation (mm/day) Shiogama,H. (2016) Database for Policy Decision making for Future climate change (d4PDF) 16 Four-dimensional Variational Ocean ReAnalysis for the Western North Pacific over 30 years (FORA-WNP30) •First-ever dataset covering the western North Pacific (15-65N, 117E -160W) over the last three decades

Beyond Earth Simulator

User's Manual

DMI-HIRLAM on the NEC SX-6

Performance Modeling the Earth Simulator and ASCI Q

Project Aurora 2017 Vector Inheritance

2020 Global High-Performance Computing Product Leadership Award

Hardware Technology of the Earth Simulator 1

Shared-Memory Vector Systems Compared

Recent Supercomputing Development in Japan

Hardware-Oblivious SIMD Parallelism for In-Memory Column-Stores

Performance Evaluation of Supercomputers Using HPCC and IMB Benchmarks

Optimizing Sparse Matrix-Vector Multiplication in NEC SX-Aurora Vector Engine

Supercomputers – Prestige Objects Or Crucial Tools for Science and Industry?