Readthedocs-Breathe Documentation Release 1.0.0

ReadTheDocs-Breathe Documentation Release 1.0.0 Thomas Edvalson Sep 12, 2019 CONTENTS 1 Going to 11: Amping Up the Programming-Language Run-Time Foundation3 2 Solid Compilation Foundation and Language Support5 2.1 Quick Start Guide............................................5 2.1.1 Current Release Notes.....................................5 2.1.2 Installation Guide........................................5 2.1.3 Programming Guide......................................6 2.1.4 ROCm GPU Tunning Guides..................................6 2.1.5 GCN ISA Manuals.......................................7 2.1.6 ROCm API References.....................................7 2.1.7 ROCm Tools..........................................8 2.1.8 ROCm Libraries........................................9 2.1.9 ROCm Compiler SDK..................................... 10 2.1.10 ROCm System Management.................................. 11 2.1.11 ROCm Virtualization & Containers.............................. 11 2.1.12 Remote Device Programming................................. 11 2.1.13 Deep Learning on ROCm.................................... 12 2.1.14 System Level Debug...................................... 12 2.1.15 Tutorial............................................. 12 2.1.16 ROCm Glossary......................................... 12 2.2 Current Release Notes.......................................... 12 2.2.1 Hotfix release ROCm 2.7.1................................... 12 2.2.1.1 Defect fixed in ROCm 2.7.1.............................. 12 2.2.1.2 Upgrading from ROCm 2.7 to 2.7.1.......................... 13 2.2.2 New features and enhancements in ROCm 2.7......................... 13 2.2.2.1 [rocFFT] Real FFT Functional............................ 13 2.2.2.2 rocRand Enhancements and Optimizations...................... 13 2.2.2.3 RAS.......................................... 13 2.2.2.4 ROCm-SMI enhancements.............................. 13 2.2.3 New features and enhancements in ROCm 2.6......................... 13 2.2.3.1 ROCmInfo enhancements............................... 13 2.2.3.2 [Thrust] Functional Support on Vega20........................ 14 2.2.3.3 MIGraphX v0.3.................................... 14 2.2.3.4 MIOpen 2.0...................................... 14 2.2.3.5 Bloat16 software support in rocBLAS/Tensile.................... 14 2.2.3.6 AMD Infinity Fabric™ Link enablement....................... 15 2.2.3.7 ROCm-smi features and bug fixes........................... 15 2.2.3.8 ROCm-smi-lib enhancements............................. 15 2.2.3.9 RCCL2 Enablement.................................. 15 2.2.3.10 rocFFT enhancements................................. 15 i 2.2.4 New features and enhancements in ROCm 2.5......................... 15 2.2.4.1 UCX 1.6 support.................................... 15 2.2.4.2 BFloat16 GEMM in rocBLAS/Tensile........................ 15 2.2.4.3 ROCm-SMI enhancements.............................. 16 2.2.4.4 [PyTorch] multi-GPU functional support (CPU aggregation/Data Parallel)..... 16 2.2.4.5 rocSparse optimization on Radeon Instinct MI50 and MI60............. 16 2.2.4.6 [Thrust] Preview.................................... 16 2.2.4.7 Support overlapping kernel execution in same HIP stream.............. 16 2.2.4.8 AMD Infinity Fabric™ Link enablement....................... 16 2.2.5 New features and enhancements in ROCm 2.4......................... 16 2.2.5.1 TensorFlow 2.0 support................................ 16 2.2.5.2 AMD Infinity Fabric™ Link enablement....................... 16 2.2.6 New features and enhancements in ROCm 2.3......................... 17 2.2.6.1 Mem usage per GPU................................. 17 2.2.6.2 MIVisionX, v1.1 - ONNX............................... 17 2.2.6.3 MIGraphX, v0.2.................................... 17 2.2.6.4 MIOpen, v1.8 - 3d convolutions and int8....................... 17 2.2.6.5 Caffe2 - mGPU support................................ 17 2.2.6.6 rocTracer library, ROCm tracing API for collecting runtimes API and asynchronous GPU activity traces.................................. 17 2.2.6.7 BLAS - Int8 GEMM performance, Int8 functional and performance......... 17 2.2.6.8 Prioritized L1/L2/L3 BLAS (functional)....................... 17 2.2.6.9 BLAS - tensile optimization.............................. 18 2.2.6.10 MIOpen Int8 support................................. 18 2.2.7 New features and enhancements in ROCm 2.2......................... 18 2.2.7.1 rocSparse Optimization on Vega20.......................... 18 2.2.7.2 DGEMM and DTRSM Optimization......................... 18 2.2.7.3 Caffe2......................................... 18 2.2.8 New features and enhancements in ROCm 2.1......................... 18 2.2.8.1 RocTracer v1.0 preview release – ‘rocprof’ HSA runtime tracing and statistics support -.......................................... 18 2.2.8.2 Improvements to ROCM-SMI tool -......................... 18 2.2.8.3 DGEMM Optimizations -............................... 18 2.2.9 New features and enhancements in ROCm 2.0......................... 18 2.2.9.1 Adds support for RHEL 7.6 / CentOS 7.6 and Ubuntu 18.04.1............ 19 2.2.9.2 Adds support for Vega 7nm, Polaris 12 GPUs.................... 19 2.2.9.3 Introduces MIVisionX................................. 19 2.2.9.4 Improvements to ROCm Libraries.......................... 19 2.2.9.5 MIOpen........................................ 19 2.2.9.6 Tensorflow multi-gpu and Tensorflow FP16 support for Vega 7nm.......... 19 2.2.9.7 PyTorch/Caffe2 with Vega 7nm Support....................... 19 2.2.9.8 Improvements to ROCProfiler tool.......................... 19 2.2.9.9 Support for hipStreamCreateWithPriority....................... 19 2.2.9.10 OpenCL 2.0 support.................................. 20 2.2.9.11 Improved Virtual Addressing (48 bit VA) management for Vega 10 and later GPUs. 20 2.2.9.12 Kubernetes support.................................. 20 2.2.9.13 Removed features................................... 20 2.2.10 New features and enhancements in ROCm 1.9.2........................ 20 2.2.10.1 RDMA(MPI) support on Vega 7nm.......................... 20 2.2.10.2 Improvements to HCC................................. 20 2.2.10.3 Improvements to ROCProfiler tool.......................... 20 2.2.10.4 Critical bug fixes.................................... 20 2.2.11 New features and enhancements in ROCm 1.9.1........................ 20 2.2.11.1 Added DPM support to Vega 7nm........................... 20 ii 2.2.11.2 Fix for ‘ROCm profiling’ “Version mismatch between HSA runtime and libhsa- runtime-tools64.so.1” error.............................. 21 2.2.12 New features and enhancements in ROCm 1.9.0........................ 21 2.2.12.1 Preview for Vega 7nm................................. 21 2.2.12.2 System Management Interface............................ 21 2.2.12.3 Improvements to HIP/HCC.............................. 21 2.2.12.4 Preview for rocprof Profiling Tool........................... 21 2.2.12.5 Preview for rocr Debug Agent rocr_debug_agent................... 21 2.2.12.6 New distribution support............................... 21 2.2.12.7 ROCm 1.9 is ABI compatible with KFD in upstream Linux kernels.......... 22 2.3 ROCm Installation Guide........................................ 22 2.3.1 Are You Ready to ROCK?................................... 22 2.3.2 Current ROCm Version: 2.7.1................................. 22 2.3.3 The latest ROCm platform - ROCm 2.7............................ 22 2.3.4 Hardware Support........................................ 23 2.3.4.1 Supported GPUs.................................... 24 2.3.4.2 Supported CPUs.................................... 24 2.3.4.3 Not supported or limited support under ROCm.................... 25 2.3.4.3.1 Limited support............................... 25 2.3.4.3.2 Not supported................................ 26 2.3.5 Supported Operating Systems - New operating systems available............... 26 2.3.5.1 ROCm support in upstream Linux kernels...................... 26 2.3.6 Installing from AMD ROCm repositories........................... 27 2.3.6.1 ROCm Binary Package Structure........................... 27 2.3.6.2 Ubuntu Support - installing from a Debian repository................ 29 2.3.6.2.1 First make sure your system is up to date.................. 29 2.3.6.2.2 Add the ROCm apt repository....................... 29 2.3.6.2.3 Install.................................... 30 2.3.6.2.4 Next set your permissions.......................... 30 2.3.6.2.5 Test basic ROCm installation........................ 30 2.3.6.2.6 Performing an OpenCL-only Installation of ROCm............ 30 2.3.6.2.7 How to uninstall from Ubuntu 16.04 or Ubuntu 18.04........... 31 2.3.6.2.8 Installing development packages for cross compilation.......... 31 2.3.6.2.9 Using Debian-based ROCm with upstream kernel drivers......... 31 2.3.6.3 CentOS/RHEL 7 (7.6) Support............................ 31 2.3.6.3.1 Preparing RHEL 7 (7.6) for installation.................. 31 2.3.6.3.2 Install and setup Devtoolset-7........................ 32 2.3.6.3.3 Prepare CentOS/RHEL (7.6) for DKMS Install.............. 32 2.3.6.3.4 Installing ROCm on the system....................... 32 2.3.6.3.5 Set up permissions............................. 32 2.3.6.3.6 Test basic ROCm installation........................ 33 2.3.6.3.7 Performing an OpenCL-only Installation of ROCm............ 33 2.3.6.3.8 Compiling applications using HCC, HIP, and other ROCm software... 33 2.3.6.3.9 How to uninstall ROCm from CentOS/RHEL 7.6............. 33 2.3.6.3.10 Installing

Readthedocs-Breathe Documentation Release 1.0.0

Small Form Factor 3D Graphics for Your Pc

AMD Accelerated Parallel Processing Opencl Programming Guide

Candidate Features for Future Opengl 5 / Direct3d 12 Hardware and Beyond 3 May 2014, Christophe Riccio

Nvidia Graphics Card Release Dates

SAPPHIRE HD 6950 2GB GDDR5 Dirt3 Edition

Opengl Driver Intel Download Windows 7 OPENGL INTEL HD GRAPHICS 4000 TREIBER WINDOWS 7

Comparison of Technologies for General-Purpose Computing on Graphics Processing Units

AMD Codexl 1.7 GA Release Notes

The Amd Linux Graphics Stack – 2018 Edition Nicolai Hähnle Fosdem 2018

Amd Driver 17.11.2 Download DRIVER RADEON V17.11.2 for WINDOWS 7 DOWNLOAD

Radeon Pro Software for Enterprise 20.Q1.1

SAPPHIRE R9 285 2GB GDDR5 ITX COMPACT OC Edition (UEFI)