Arm HPC Ecosystem Hardware, Software and Tools
Total Page:16
File Type:pdf, Size:1020Kb
Arm HPC Ecosystem Hardware, Software and tools Srinath Vadlamani, Field Application Engineer SEA, April 8, 2019 Arm Technology Already Connects the World Arm is ubiquitous Partnership is key Choice is good 21 billion chips sold by We design IP, not One size is not always the partners in 2017 alone manufacture chips best fit for all Mobile/Embedded/IoT/ Partners build products HPC is a great fit for Automotive/Server/GPUs for their target markets co-design and collaboration 2 © 2019 Arm Limited Armv8-A Architecture Evolution RISC architecture § Only have 32 bits available for encoding all instructions § Supports the development of efficient implementations 64-bit capable since 2012 § Known as AArch64 (or AArch32 when run in a 32-bit mode) § 128-bit vector unit (aka NEON Advanced SIMD) • AArch64 execution state • Atomic memory ops • Half-precision float • Pointer authentication • A64 instruction set • Type2 hypervisor support • RAS support • Nested virtualization 3 © 2019 Arm Limited • Statistical profiling • Complex float Arm business model Business Development Arm develops technology that is licensed to Arm licenses technology to Partner semiconductor companies. Technology Arm receives an SemiCo upfront license fee Partner License Fee and a royalty on every chip that contains its technology. Per-Chip Royalty Partner OEM develops Customer chips OEM sells consumer products 4 © 2019 Arm Limited CPU Engagement Models With Arm Core License Architecture License Partner licenses complete Partner designs complete CPU microarchitecture design microarchitecture from scratch § Wide choices available • Clean room – no reference to Arm § Many different A, R & M products core designs Standard CPU Proprietary CPU CPU differentiation through: Freedom to develop any design § Flexible configuration options • Must conform to the rules & § Wide implementation envelope with Core license Architecture license programmers model of a given different process technologies architecture variant • Must pass Arm architecture Range of licensing & engagement validation to preserve software models possible compatibility Long term strategic investment 5 © 2019 Arm Limited HPC on Arm – What’s new in 2018/19 Powerful hardware for now and future • Marvell ThunderX2 now GA • Fujitsu announced details of A64FX (with SVE) for Post-K • Arm announces Neoverse brand for infrastructure and core IP roadmap (Ares, Zeus, Poseidon) with each generation delivering 30% perf boost. N1 platform details announced. Mature toolchains and ISV Software • Three mature toolchains available –Arm Commercial, GNU and Cray CE • ISVs start porting to Arm – Altair RADIOSS, ANSYS Fluent and LS-DYNA Deployments • New deployments across the EU and USA • USA - Sandia Astra (Top 500), Comanche Clusters • EU – Catalyst and Isambard in UK, GENCI and Dibona (MontBlanc 3) in France 6 © 2019 Arm Limited Arm Hardware for Infrastrucutre (including HPC) AWS Graviton by Amazon 8 © 2019 Arm Limited AWS Graviton by Amazon 9 © 2019 Arm Limited Huawei unveils KunPeng 920 CPU and TaiShan Servers Industry’s Highest Performance 2.6GHz 64-cores 7nm based ARM v8 Server SoC & Servers Big Data, Distributed Storage and Arm-Native applications TaiShan 2280 Balanced Server TaiShan 5280/5290 Storage Server TaiShan X6000 High-Density Server “Use ARM-based CPU in areas like cloud and servers where they are better.” – William XU, Chief Strategy Marketing Officer, Huawei 10 © 2019 Arm Limited The Cloud to Edge Infrastructure Foundation for a World of 1T Intelligent Devices Broad SoC system design options within Arm Ecosystem Arm IP High performance CPUs Arm Architectural design Data plane CPUs CMN Fabric Custom Arm High performance CPU Other IP Custom Fabric & IP Accelerators ML, on-die FPGA Networking, security, encryption Video, Custom Memory DDR, HBM, Flash, Storage Class memory IO PCIe, CCIX, 100G+ ethernet Foundry TSMC 7FF, Samsung 7LPP, UMC Common Software Platform and Ecosystem Arm Architecture v8.x-A © 2018 Arm Limited Arm IP : Commitment to Infrastructure segment 5nm 7nm+ Poseidon Platform 7nm Zeus Platform 16nm Ares (N1) Cosmos Platform 2021 Platform 2020 (A72, A75)~30% per Gen Faster Performance & New Features 2019 Today 13 © 2019 Arm Limited Neoverse N1 platform Accelerating the transformation to a scalable cloud to edge infrastructure Revolutionary compute performance Platform features specific to infrastructure Extreme range of scale and diversity of compute 14 Confidential © 2018 Arm Limited Neoverse N1 platform: Revolutionary compute performance 2.5x NGINX 1.7x 2.5x Java* MemcacheD Improved cloud to edge TCO through revolutionary workload performance 15 Confidential © 2018 Arm Limited Data shown for Neoverse N1 has been collected/projected from an array of platforms, and relative to Cortex A72 ”Cosmos” *Based on an industry standard Java-based benchmark Arm Hardware for HPC Arm Architecture Partner SoC for HPC Available or Announced in 2018-19 17 © 2019 Arm Limited HPC Software Ecosystem Arm HPC Ecosystem – Overview Job schedulers HPC Applications: and Resource Open-source, Owned, and Commercial ISV codes User-space Management: utilities, scripting, SLURM, IBM LSF, App/ISA specific optimizations, optimized libs and intrinsics: container, and Altair PBS Pro, Arm PL, BLAS, FFTW, etc. other packages: etc. Singularity, HPEBright, CMU, Parallelism HPC Programming Debug and Filesystems: Openstack, Tools: Management Cluster standards: Languages: performance BeeGFS, OpenHPC, OpenMP Fortran, C, C++ analysis tools: LUSTRE, ZFS, Python, NumPy, (omp / gomp), via Arm Forge, SciPy, etc. MPI, SHMEM GNU, LLVM, Arm Rogue Wave, HDFS, GPFS (see below) & OEMs TAU, etc. xCat , Communication Stacks and run-times: Warewulf Silicon Suppliers: Mellanox IB/OFED/HPC-X, OpenMPI, MPICH, MVAPICH2, OpenSHMEM, OpenUCX, HPE MPI Marvell, Fujitsu, Mellanox Linux OS Distro of choice: RHEL, SUSE, CENTOS,… OEM/ODM’s: Cray, HPE, ATOS-Bull, Fujitsu, Gigabyte, Inventec, Foxconn Arm Server Ready Platform: Standard OS compatible FW and RAS features 19 © 2019 Arm Limited Common HPC applications now available GROMACS LAMMPS CESM2 MrBayes Bowtie NAMD AMBER Paraview SIESTA UM Quantum WRF VASP MILC GEANT4 ESPRESSO OpenFOAM GAMESS VisIT DL-Poly NEMO BLAST NWCHEM Abinit BWA QMCPACK Build recipes online at https://gitlab.com/arm-hpc/packages/wikis/home 20 © 2019 ArmChem Limited /Phys Weather CFD Visualization Genomics ISVs codes on Arm Porting underway Available 21 © 2019 Arm Limited : Typical HPC packages available for Arm Functional Areas Components include OpenHPC is a community effort to provide a common, Base OS CentOS 7.5, SLES 12 SP3 verified set of open source packages for HPC Administrative Conman, Ganglia, Lmod, LosF, Nagios, pdsh, pdsh- Tools mod-slurm, prun, EasyBuild, ClusterShell, mrsh, deployments Genders, Shine, test-suite Provisioning Warewulf Arm and partners actively involved: Resource Mgmt. SLURM, Munge • Arm is a silver member of OpenHPC I/O Services Lustre client (community version) • Numerical/Scientific Boost, GSL, FFTW, Metis, PETSc, Trilinos, Hypre, Linaro is on Technical Steering Committee Libraries SuperLU, SuperLU_Dist,Mumps, OpenBLAS, • Arm-based machines in the OpenHPC build Scalapack, SLEPc, PLASMA, ptScotch I/O Libraries HDF5 (pHDF5), NetCDF (including C++ and Fortran infrastructure interfaces), Adios Compiler Families GNU (gcc, g++, gfortran), LLVM Status: 1.3.6 release out now MPI Families OpenMPI, MPICH Development Tools Autotools (autoconf, automake, libtool), Cmake, • Packages built on Armv8-A for CentOS and SUSE Valgrind,R, SciPy/NumPy, hwloc Performance Tools PAPI, IMB, pdtoolkit, TAU, Scalasca, Score-P, SIONLib 22 © 2019 Arm Limited Arm HPC Ecosystem website: www.arm.com/hpc Starting point for developers and end-users of Arm for HPC Latest events, news, blogs, and collateral including whitepapers, webinars, and presentations Links to HPC open-source & commercial SW packages Guides for porting HPC applications Quick-start guides to Arm tools Links to community collaboration sites Curated and moderated by Arm 23 © 2019 Arm Limited Arm HPC Community: community.arm.com/tools/hpc/ HPC Community-driven Content Blogs By Arm and our HPC community Calendar of upcoming events such as workshops and weBinars HPC Forum with questions & posts curated and moderated By Arm HPC technical specialists Ask, answer, share progress and expertise 24 © 2019 Arm Limited Arm HPC Packages wiki www.gitlab.com/arm-hpc/packages/wikis • Dynamic list of common HPC packages • Status and porting recipes • Community driven • Anyone can join and contribute • Provides focus for porting progress • Allows developers to share and learn 25 © 2019 Arm Limited Open source libraries for helping increase performance Arm Optimized Routines Perf-libs-tools https://github.com/ARM-software/optimized-routines https://github.com/ARM-software/perf-libs-tools These routines provide high performing Understanding an application’s needs for versions of many math.h functions BLAS, LAPACK and FFT calls • Algorithmically better performance than • Used in conjunction with Arm Performance standard library calls Libraries can generate logging info to help profile • No loss of accuracy applications for specific case breakdowns SLEEF library https://github.com/shibatch/sleef/ Vectorized math.h functions • Provided as an option for use in Arm Compiler Example visualization: DGEMM cases called 26 © 2019 Arm Limited Arm HPC deployments Deployments 28 © 2019 Arm Limited Arm Supercomputer Makes Top500 List! “Astra, the world’s fastest Arm-based supercomputer according to the