Readthedocs-Breathe Documentation Release 1.0.0
Total Page:16
File Type:pdf, Size:1020Kb
ReadTheDocs-Breathe Documentation Release 1.0.0 Thomas Edvalson Sep 12, 2019 CONTENTS 1 Going to 11: Amping Up the Programming-Language Run-Time Foundation3 2 Solid Compilation Foundation and Language Support5 2.1 Quick Start Guide............................................5 2.1.1 Current Release Notes.....................................5 2.1.2 Installation Guide........................................5 2.1.3 Programming Guide......................................6 2.1.4 ROCm GPU Tunning Guides..................................6 2.1.5 GCN ISA Manuals.......................................7 2.1.6 ROCm API References.....................................7 2.1.7 ROCm Tools..........................................8 2.1.8 ROCm Libraries........................................9 2.1.9 ROCm Compiler SDK..................................... 10 2.1.10 ROCm System Management.................................. 11 2.1.11 ROCm Virtualization & Containers.............................. 11 2.1.12 Remote Device Programming................................. 11 2.1.13 Deep Learning on ROCm.................................... 12 2.1.14 System Level Debug...................................... 12 2.1.15 Tutorial............................................. 12 2.1.16 ROCm Glossary......................................... 12 2.2 Current Release Notes.......................................... 12 2.2.1 Hotfix release ROCm 2.7.1................................... 12 2.2.1.1 Defect fixed in ROCm 2.7.1.............................. 12 2.2.1.2 Upgrading from ROCm 2.7 to 2.7.1.......................... 13 2.2.2 New features and enhancements in ROCm 2.7......................... 13 2.2.2.1 [rocFFT] Real FFT Functional............................ 13 2.2.2.2 rocRand Enhancements and Optimizations...................... 13 2.2.2.3 RAS.......................................... 13 2.2.2.4 ROCm-SMI enhancements.............................. 13 2.2.3 New features and enhancements in ROCm 2.6......................... 13 2.2.3.1 ROCmInfo enhancements............................... 13 2.2.3.2 [Thrust] Functional Support on Vega20........................ 14 2.2.3.3 MIGraphX v0.3.................................... 14 2.2.3.4 MIOpen 2.0...................................... 14 2.2.3.5 Bloat16 software support in rocBLAS/Tensile.................... 14 2.2.3.6 AMD Infinity Fabric™ Link enablement....................... 15 2.2.3.7 ROCm-smi features and bug fixes........................... 15 2.2.3.8 ROCm-smi-lib enhancements............................. 15 2.2.3.9 RCCL2 Enablement.................................. 15 2.2.3.10 rocFFT enhancements................................. 15 i 2.2.4 New features and enhancements in ROCm 2.5......................... 15 2.2.4.1 UCX 1.6 support.................................... 15 2.2.4.2 BFloat16 GEMM in rocBLAS/Tensile........................ 15 2.2.4.3 ROCm-SMI enhancements.............................. 16 2.2.4.4 [PyTorch] multi-GPU functional support (CPU aggregation/Data Parallel)..... 16 2.2.4.5 rocSparse optimization on Radeon Instinct MI50 and MI60............. 16 2.2.4.6 [Thrust] Preview.................................... 16 2.2.4.7 Support overlapping kernel execution in same HIP stream.............. 16 2.2.4.8 AMD Infinity Fabric™ Link enablement....................... 16 2.2.5 New features and enhancements in ROCm 2.4......................... 16 2.2.5.1 TensorFlow 2.0 support................................ 16 2.2.5.2 AMD Infinity Fabric™ Link enablement....................... 16 2.2.6 New features and enhancements in ROCm 2.3......................... 17 2.2.6.1 Mem usage per GPU................................. 17 2.2.6.2 MIVisionX, v1.1 - ONNX............................... 17 2.2.6.3 MIGraphX, v0.2.................................... 17 2.2.6.4 MIOpen, v1.8 - 3d convolutions and int8....................... 17 2.2.6.5 Caffe2 - mGPU support................................ 17 2.2.6.6 rocTracer library, ROCm tracing API for collecting runtimes API and asynchronous GPU activity traces.................................. 17 2.2.6.7 BLAS - Int8 GEMM performance, Int8 functional and performance......... 17 2.2.6.8 Prioritized L1/L2/L3 BLAS (functional)....................... 17 2.2.6.9 BLAS - tensile optimization.............................. 18 2.2.6.10 MIOpen Int8 support................................. 18 2.2.7 New features and enhancements in ROCm 2.2......................... 18 2.2.7.1 rocSparse Optimization on Vega20.......................... 18 2.2.7.2 DGEMM and DTRSM Optimization......................... 18 2.2.7.3 Caffe2......................................... 18 2.2.8 New features and enhancements in ROCm 2.1......................... 18 2.2.8.1 RocTracer v1.0 preview release – ‘rocprof’ HSA runtime tracing and statistics sup- port -.......................................... 18 2.2.8.2 Improvements to ROCM-SMI tool -......................... 18 2.2.8.3 DGEMM Optimizations -............................... 18 2.2.9 New features and enhancements in ROCm 2.0......................... 18 2.2.9.1 Adds support for RHEL 7.6 / CentOS 7.6 and Ubuntu 18.04.1............ 19 2.2.9.2 Adds support for Vega 7nm, Polaris 12 GPUs.................... 19 2.2.9.3 Introduces MIVisionX................................. 19 2.2.9.4 Improvements to ROCm Libraries.......................... 19 2.2.9.5 MIOpen........................................ 19 2.2.9.6 Tensorflow multi-gpu and Tensorflow FP16 support for Vega 7nm.......... 19 2.2.9.7 PyTorch/Caffe2 with Vega 7nm Support....................... 19 2.2.9.8 Improvements to ROCProfiler tool.......................... 19 2.2.9.9 Support for hipStreamCreateWithPriority....................... 19 2.2.9.10 OpenCL 2.0 support.................................. 20 2.2.9.11 Improved Virtual Addressing (48 bit VA) management for Vega 10 and later GPUs. 20 2.2.9.12 Kubernetes support.................................. 20 2.2.9.13 Removed features................................... 20 2.2.10 New features and enhancements in ROCm 1.9.2........................ 20 2.2.10.1 RDMA(MPI) support on Vega 7nm.......................... 20 2.2.10.2 Improvements to HCC................................. 20 2.2.10.3 Improvements to ROCProfiler tool.......................... 20 2.2.10.4 Critical bug fixes.................................... 20 2.2.11 New features and enhancements in ROCm 1.9.1........................ 20 2.2.11.1 Added DPM support to Vega 7nm........................... 20 ii 2.2.11.2 Fix for ‘ROCm profiling’ “Version mismatch between HSA runtime and libhsa- runtime-tools64.so.1” error.............................. 21 2.2.12 New features and enhancements in ROCm 1.9.0........................ 21 2.2.12.1 Preview for Vega 7nm................................. 21 2.2.12.2 System Management Interface............................ 21 2.2.12.3 Improvements to HIP/HCC.............................. 21 2.2.12.4 Preview for rocprof Profiling Tool........................... 21 2.2.12.5 Preview for rocr Debug Agent rocr_debug_agent................... 21 2.2.12.6 New distribution support............................... 21 2.2.12.7 ROCm 1.9 is ABI compatible with KFD in upstream Linux kernels.......... 22 2.3 ROCm Installation Guide........................................ 22 2.3.1 Are You Ready to ROCK?................................... 22 2.3.2 Current ROCm Version: 2.7.1................................. 22 2.3.3 The latest ROCm platform - ROCm 2.7............................ 22 2.3.4 Hardware Support........................................ 23 2.3.4.1 Supported GPUs.................................... 24 2.3.4.2 Supported CPUs.................................... 24 2.3.4.3 Not supported or limited support under ROCm.................... 25 2.3.4.3.1 Limited support............................... 25 2.3.4.3.2 Not supported................................ 26 2.3.5 Supported Operating Systems - New operating systems available............... 26 2.3.5.1 ROCm support in upstream Linux kernels...................... 26 2.3.6 Installing from AMD ROCm repositories........................... 27 2.3.6.1 ROCm Binary Package Structure........................... 27 2.3.6.2 Ubuntu Support - installing from a Debian repository................ 29 2.3.6.2.1 First make sure your system is up to date.................. 29 2.3.6.2.2 Add the ROCm apt repository....................... 29 2.3.6.2.3 Install.................................... 30 2.3.6.2.4 Next set your permissions.......................... 30 2.3.6.2.5 Test basic ROCm installation........................ 30 2.3.6.2.6 Performing an OpenCL-only Installation of ROCm............ 30 2.3.6.2.7 How to uninstall from Ubuntu 16.04 or Ubuntu 18.04........... 31 2.3.6.2.8 Installing development packages for cross compilation.......... 31 2.3.6.2.9 Using Debian-based ROCm with upstream kernel drivers......... 31 2.3.6.3 CentOS/RHEL 7 (7.6) Support............................ 31 2.3.6.3.1 Preparing RHEL 7 (7.6) for installation.................. 31 2.3.6.3.2 Install and setup Devtoolset-7........................ 32 2.3.6.3.3 Prepare CentOS/RHEL (7.6) for DKMS Install.............. 32 2.3.6.3.4 Installing ROCm on the system....................... 32 2.3.6.3.5 Set up permissions............................. 32 2.3.6.3.6 Test basic ROCm installation........................ 33 2.3.6.3.7 Performing an OpenCL-only Installation of ROCm............ 33 2.3.6.3.8 Compiling applications using HCC, HIP, and other ROCm software... 33 2.3.6.3.9 How to uninstall ROCm from CentOS/RHEL 7.6............. 33 2.3.6.3.10 Installing