Reference Guide

Total Page:16

File Type:pdf, Size:1020Kb

Reference Guide Reference Guide Graphics Core Next Architecture, Generation 3 August 2016 Revision 1.1 Copyright © 2016 Advanced Micro Devices, Inc. All rights reserved. DISCLAIMER The information contained herein is for informational purposes only, and is subject to change without notice. While every precaution has been taken in the preparation of this document, it may contain technical inaccuracies, omissions and typographical errors, and AMD is under no obligation to update or otherwise correct this information. Advanced Mi- cro Devices, Inc. makes no representations or warranties with respect to the accuracy or completeness of the contents of this document, and assumes no liability of any kind, in- cluding the implied warranties of noninfringement, merchantability or fitness for particular purposes, with respect to the operation or use of AMD hardware, software or other prod- ucts described herein. No license, including implied or arising by estoppel, to any intellec- tual property rights is granted by this document. Terms and limitations applicable to the purchase or use of AMD’s products are as set forth in a signed agreement between the parties or in AMD's Standard Terms and Conditions of Sale. © 2016 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, AMD Accelerated Parallel Processing, the AMD Accelerated Parallel Processing logo, and combinations thereof are trademarks of Advanced Micro Devices, Inc. OpenCL and the OpenCL logo are trademarks of Apple Inc. used by permission by Khronos. Other names are for informational purposes only and may be trademarks of their respective owners. AMD’s products are not designed, intended, authorized or warranted for use as compo- nents in systems intended for surgical implant into the body, or in other applications intended to support or sustain life, or in any other application in which the failure of AMD’s product could create a situation where personal injury, death, or severe property or envi- ronmental damage may occur. AMD reserves the right to discontinue or make changes to its products at any time without notice. Advanced Micro Devices, Inc. One AMD Place P.O. Box 3453 Sunnyvale, CA 94088-3453 www.amd.com For AMD Accelerated Parallel Processing: URL: developer.amd.com/appsdk Developing: developer.amd.com/ Forum: developer.amd.com/openclforum ii AMD GRAPHICS CORE NEXT TECHNOLOGY Contents Contents Preface Chapter 1 Introduction Chapter 2 Program Organization 2.1 The Compute Shader Program Type ............................................................................................. 2-1 2.2 Instruction Terminology .................................................................................................................. 2-2 2.3 Data Sharing..................................................................................................................................... 2-3 2.3.1 Local Data Share (LDS)...................................................................................................2-4 2.3.2 Global Data Share (GDS).................................................................................................2-5 2.4 Device Memory................................................................................................................................. 2-5 Chapter 3Kernel State 3.1 State Overview ................................................................................................................................. 3-1 3.2 Program Counter (PC) .................................................................................................................... 3-2 3.3 EXECute Mask.................................................................................................................................. 3-2 3.4 Status Registers............................................................................................................................... 3-2 3.5 Mode Register .................................................................................................................................. 3-4 3.6 GPRs and LDS ................................................................................................................................. 3-5 3.6.1 Out-of-Range Behavior....................................................................................................3-5 3.6.2 SGPR Allocation and Storage ........................................................................................3-6 3.6.3 SGPR Alignment...............................................................................................................3-6 3.6.4 VGPR Allocation and Alignment ....................................................................................3-6 3.6.5 LDS Allocation and Clamping ........................................................................................3-6 3.7 M# Memory Descriptor .................................................................................................................... 3-7 3.8 SCC: Scalar Condition Code........................................................................................................... 3-7 3.9 Vector Compares: VCC and VCCZ.................................................................................................. 3-8 3.10 Trap and Exception Registers........................................................................................................ 3-8 3.11 Memory Violations ......................................................................................................................... 3-10 Chapter 4 Program Flow Control 4.1 Program Control............................................................................................................................... 4-1 4.2 Branching.......................................................................................................................................... 4-1 4.3 Work-Groups..................................................................................................................................... 4-2 iii Copyright © 2016 Advanced Micro Devices, Inc. All rights reserved. AMD GRAPHICS CORE NEXT TECHNOLOGY 4.4 Data Dependency Resolution ......................................................................................................... 4-2 4.5 Manually Inserted Wait States (NOPs) .......................................................................................... 4-4 4.6 Arbitrary Divergent Control Flow................................................................................................... 4-5 Chapter 5 Scalar ALU Operations 5.1 SALU Instruction Formats .............................................................................................................. 5-1 5.2 Scalar ALU Operands...................................................................................................................... 5-2 5.3 Scalar Condition Code (SCC)......................................................................................................... 5-2 5.4 Integer Arithmetic Instructions....................................................................................................... 5-5 5.5 Conditional Instructions.................................................................................................................. 5-5 5.6 Comparison Instructions................................................................................................................. 5-6 5.7 Bit-Wise Instructions ....................................................................................................................... 5-6 5.8 Special Instructions......................................................................................................................... 5-8 Chapter 6 Vector ALU Operations 6.1 Microcode Encodings...................................................................................................................... 6-1 6.2 Operands........................................................................................................................................... 6-2 6.2.1 Instruction Inputs.............................................................................................................6-2 6.2.2 Instruction Outputs..........................................................................................................6-3 6.2.3 Out-of-Range GPRs..........................................................................................................6-4 6.2.4 GPR Indexing ..................................................................................................................6-5 6.3 Instructions....................................................................................................................................... 6-5 6.4 Denormalized and Rounding Modes ............................................................................................. 6-7 6.5 ALU CLAMP Bit Usage.................................................................................................................... 6-7 6.6 VGPR Indexing ................................................................................................................................. 6-7 6.6.1 Indexing Instructions.......................................................................................................6-8 6.6.2 Special Cases ...................................................................................................................6-8 Chapter 7 Scalar Memory Operations 7.1 Microcode Encoding.......................................................................................................................
Recommended publications
  • Small Form Factor 3D Graphics for Your Pc
    VisionTek Part# 900701 PRODUCTIVITY SERIES: SMALL FORM FACTOR 3D GRAPHICS FOR YOUR PC The VisionTek Radeon R7 240SFF graphics card offers a perfect balance of performance, features, and affordability for the gamer seeking a complete solution. It offers support for the DIRECTX® 11.2 graphics standard and 4K Ultra HD for stunning 3D visual effects, realistic lighting, and lifelike imagery. Its Short Form Factor design enables it to fit into the latest Low Profile desktops and workstations, yet the R7 240SFF can be converted to a standard ATX design with the included tall bracket. With 2GB of DDR3 memory and award-winning Graphics Core Next (GCN) architecture, and DVI-D/HDMI outputs, the VisionTek Radeon R7 240SFF is big on features and light on your wallet. RADEON R7 240 SPECS • Graphics Engine: RADEON R7 240 • Video Memory: 2GB DDR3 • Memory Interface: 128bit • DirectX® Support: 11.2 • Bus Standard: PCI Express 3.0 • Core Speed: 780MHz • Memory Speed: 800MHz x2 • VGA Output: VGA* • DVI Output: SL DVI-D • HDMI Output: HDMI (Video/Audio) • UEFI Ready: Support SYSTEM REQUIREMENTS • PCI Express® based PC is required with one X16 lane graphics slot available on the motherboard. • 400W (or greater) power supply GCN Architecture: A new design for AMD’s unified graphics processing and compute cores that allows recommended. 500 Watt for AMD them to achieve higher utilization for improved performance and efficiency. CrossFire™ technology in dual mode. • Minimum 1GB of system memory. 4K Ultra HD Support: Experience what you’ve been missing even at 1080P! With support for 3840 x • Installation software requires CD-ROM 2160 output via the HDMI port, textures and other detail normally compressed for lower resolutions drive.
    [Show full text]
  • AMD Firepro™Professional Graphics for CAD & Engineering and Media & Entertainment
    AMD FirePro™Professional Graphics for CAD & Engineering and Media & Entertainment Performance at every price point. AMD FirePro professional graphics offer breakthrough capabilities that can help maximize productivity and help lower cost and complexity — giving you the edge you need in your business. Outstanding graphics performance, compute power and ultrahigh-resolution multidisplay capabilities allows broadcast, design and engineering professionals to work at a whole new level of detail, speed, responsiveness and creativity. AMD FireProTM W9100 AMD FireProTM W8100 With 16GB GDDR5 memory and the ability to support up to six 4K The new AMD FirePro W8100 workstation graphics card is based on displays via six Mini DisplayPort outputs,1 the AMD FirePro W9100 the AMD Graphics Core Next (GCN) GPU architecture and packs up graphics card is the ideal single-GPU solution for the next generation to 4.2 TFLOPS of compute power to accelerate your projects beyond of ultrahigh-resolution visualization environments. just graphics. AMD FireProTM W7100 AMD FireProTM W5100 The new AMD FirePro W7100 graphics card delivers 8GB The new AMD FirePro™ W5100 graphics card delivers optimized of memory, application performance and special features application and multidisplay performance for midrange users. that media and entertainment and design and engineering With 4GB of ultra-fast GDDR5 memory, users can tackle moderately professionals need to take their projects to the next level. complex models, assemblies, data sets or advanced visual effects with ease. AMD FireProTM W4100 AMD FireProTM W2100 In a class of its own, the AMD FirePro Professional graphics starts with AMD W4100 graphics card is the best choice FirePro W2100 graphics, delivering for entry-level users who need a boost in optimized and certified professional graphics performance to better address application performance that similarly- their evolving workflows.
    [Show full text]
  • Minimum System Requirements for Illustrator
    Your computer must meet the minimum technical specifications outlined below to run and use Illustrator. Applicable for: November 2019 release System requirements for earlier releases Illustrator system requirements | earlier releases Minimum system requirements for Illustrator Windows Minimum requirement Processor Multicore Intel processor (with 64-bit support) or AMD Athlon 64 processor Operating Microsoft Windows 7* (64-bit) with Service Pack 1 or Windows 10** (64-bit) system RAM 8 GB of RAM (16 GB recommended) Hard disk 2 GB of available hard-disk space for installation; additional free space required during installation; SSD recommended Monitor resolution 1024 x 768 display (1920 x 1080 recommended) To use Touch workspace in Illustrator, you must have a touch-screen-enabled tablet/monitor running Windows 10 (Microsoft Surface Pro 3 recommended). GPU OpenGL 4.x Optional: To use GPU Performance: Your Windows should have a minimum of 1GB of VRAM (4 GB recommended), and your computer must support OpenGL version 4.0 or greater. Internet connection Internet connection and registration are necessary for required software activation, validation of subscriptions, and access to online services.* * Cloud documents are not supported on Windows 7. * * Not supported on Windows 10 versions 1507, 1511, 1607, 1703, and 1709. Note: Scalable UI feature is supported on Windows 10 (minimum resolution supported is 1920 x 1080). Auto handle feature is supported on Windows 10. GPU in Outline mode is supported on monitor with display resolution at least 2000 pixels in any dimension. Supported video adapter The following video adapter series support the new Windows GPU Performance features in Illustrator: NVIDIA NVIDIA Quadro K Series NVIDIA Quadro 6xxx NVIDIA Quadro 5xxx NVIDIA Quadro 4xxx NVIDIA Quadro 2xxx NVIDIA Quadro 2xxxD NVIDIA Quadro 6xx NVIDIA GeForce GTX Series (4xx, 5xx, 6xx, 7xx, 9xx, Titan) NVIDIA Quadro M Series NVIDIA Quadro P Series NVIDIA Quadro RTX 4000 Important: Microsoft Windows may not detect the availability of the latest device drivers for NVIDIA GPU cards.
    [Show full text]
  • Monte Carlo Evaluation of Financial Options Using a GPU a Thesis
    Monte Carlo Evaluation of Financial Options using a GPU Claus Jespersen 20093084 A thesis presented for the degree of Master of Science Computer Science Department Aarhus University Denmark 02-02-2015 Supervisor: Gerth Brodal Abstract The financial sector has in the last decades introduced several new fi- nancial instruments. Among these instruments, are the financial options, which for some cases can be difficult if not impossible to evaluate analyti- cally. In those cases the Monte Carlo method can be used for pricing these instruments. The Monte Carlo method is a computationally expensive al- gorithm for pricing options, but is at the same time an embarrassingly parallel algorithm. Modern Graphical Processing Units (GPU) can be used for general purpose parallel-computing, and the Monte Carlo method is an ideal candidate for GPU acceleration. In this thesis, we will evaluate the classical vanilla European option, an arithmetic Asian option, and an Up-and-out barrier option using the Monte Carlo method accelerated on a GPU. We consider two scenarios; a single option evaluation, and a se- quence of a varying amount of option evaluations. We report performance speedups of up to 290x versus a single threaded CPU implementation and up to 53x versus a multi threaded CPU implementation. 1 Contents I Theoretical aspects of Computational Finance 5 1 Computational Finance 5 1.1 Options . .7 1.1.1 Types of options . .7 1.1.2 Exotic options . .9 1.2 Pricing of options . 11 1.2.1 The Black-Scholes Partial Differential Equation . 11 1.2.2 Solving the PDE and pricing vanilla European options .
    [Show full text]
  • PC-Grade Parallel Processing and Hardware Acceleration for Large-Scale Data Analysis
    University of Huddersfield Repository Yang, Su PC-Grade Parallel Processing and Hardware Acceleration for Large-Scale Data Analysis Original Citation Yang, Su (2009) PC-Grade Parallel Processing and Hardware Acceleration for Large-Scale Data Analysis. Doctoral thesis, University of Huddersfield. This version is available at http://eprints.hud.ac.uk/id/eprint/8754/ The University Repository is a digital collection of the research output of the University, available on Open Access. Copyright and Moral Rights for the items on this site are retained by the individual author and/or other copyright owners. Users may access full items free of charge; copies of full text items generally can be reproduced, displayed or performed and given to third parties in any format or medium for personal research or study, educational or not-for-profit purposes without prior permission or charge, provided: • The authors, title and full bibliographic details is credited in any copy; • A hyperlink and/or URL is included for the original metadata page; and • The content is not changed in any way. For more information, including our policy and submission procedure, please contact the Repository Team at: [email protected]. http://eprints.hud.ac.uk/ PC-Grade Parallel Processing and Hardware Acceleration for Large-Scale Data Analysis Yang Su A thesis submitted to the University of Huddersfield in partial fulfilment of the requirements for the degree of Doctor of Philosophy School of Computing and Engineering University of Huddersfield October 2009 Acknowledgments I would like to thank the School of Computing and Engineering at the University of Huddersfield for providing this great opportunity of study and facilitating me throughout this project.
    [Show full text]
  • Deep Dive: Asynchronous Compute
    Deep Dive: Asynchronous Compute Stephan Hodes Developer Technology Engineer, AMD Alex Dunn Developer Technology Engineer, NVIDIA Joint Session AMD NVIDIA ● Graphics Core Next (GCN) ● Maxwell, Pascal ● Compute Unit (CU) ● Streaming Multiprocessor (SM) ● Wavefronts ● Warps 2 Terminology Asynchronous: Not independent, async work shares HW Work Pairing: Items of GPU work that execute simultaneously Async. Tax: Overhead cost associated with asynchronous compute 3 Async Compute More Performance 4 Queue Fundamentals 3 Queue Types: 3D ● Copy/DMA Queue ● Compute Queue COMPUTE ● Graphics Queue COPY All run asynchronously! 5 General Advice ● Always profile! 3D ● Can make or break perf ● Maintain non-async paths COMPUTE ● Profile async on/off ● Some HW won’t support async ● ‘Member hyper-threading? COPY ● Similar rules apply ● Avoid throttling shared HW resources 6 Regime Pairing Good Pairing Poor Pairing Graphics Compute Graphics Compute Shadow Render Light culling G-Buffer SSAO (Geometry (ALU heavy) (Bandwidth (Bandwidth limited) limited) limited) (Technique pairing doesn’t have to be 1-to-1) 7 - Red Flags Problem/Solution Format Topics: ● Resource Contention - ● Descriptor heaps - ● Synchronization models ● Avoiding “async-compute tax” 8 Hardware Details - ● 4 SIMD per CU ● Up to 10 Wavefronts scheduled per SIMD ● Accomplish latency hiding ● Graphics and Compute can execute simultanesouly on same CU ● Graphics workloads usually have priority over Compute 9 Resource Contention – Problem: Per SIMD resources are shared between Wavefronts SIMD executes
    [Show full text]
  • Contributions of Hybrid Architectures to Depth Imaging: a CPU, APU and GPU Comparative Study
    Contributions of hybrid architectures to depth imaging : a CPU, APU and GPU comparative study Issam Said To cite this version: Issam Said. Contributions of hybrid architectures to depth imaging : a CPU, APU and GPU com- parative study. Hardware Architecture [cs.AR]. Université Pierre et Marie Curie - Paris VI, 2015. English. NNT : 2015PA066531. tel-01248522v2 HAL Id: tel-01248522 https://tel.archives-ouvertes.fr/tel-01248522v2 Submitted on 20 May 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THESE` DE DOCTORAT DE l’UNIVERSITE´ PIERRE ET MARIE CURIE sp´ecialit´e Informatique Ecole´ doctorale Informatique, T´el´ecommunications et Electronique´ (Paris) pr´esent´eeet soutenue publiquement par Issam SAID pour obtenir le grade de DOCTEUR en SCIENCES de l’UNIVERSITE´ PIERRE ET MARIE CURIE Apports des architectures hybrides `a l’imagerie profondeur : ´etude comparative entre CPU, APU et GPU Th`esedirig´eepar Jean-Luc Lamotte et Pierre Fortin soutenue le Lundi 21 D´ecembre 2015 apr`es avis des rapporteurs M. Fran¸cois Bodin Professeur, Universit´ede Rennes 1 M. Christophe Calvin Chef de projet, CEA devant le jury compos´ede M. Fran¸cois Bodin Professeur, Universit´ede Rennes 1 M.
    [Show full text]
  • AMD APP SDK V2.8.1 Developer Release Notes
    AMD APP SDK v2.8.1 Developer Release Notes 1 What’s New in AMD APP SDK v2.8.1 1.1 New features in AMD APP SDK v2.8.1 AMD APP SDK v2.8.1 includes the following new features: Bolt: With the launch of Bolt 1.0, several new samples have been added to demonstrate the use of the features of Bolt 1.0. These features showcase the use of valuable Bolt APIs such as scan, sort, reduce and transform. Other new samples highlight the ease of porting from STL and the performance benefits achieved over equivalent STL implementations. Other samples demonstrate the different fallback options in Bolt 1.0 when no GPU is available. These options include a fallback to multicore-CPU if TBB libraries are installed, or falling all the way back to serial-CPU if needed to ensure your code runs correctly on any platform. OpenCV: AMD has been working closely with the OpenCV open source community to add heterogeneous acceleration capability to the world’s most popular computer vision library. These changes are already integrated into OpenCV and are readily available for developers who want to improve the performance and efficiency of their computer vision applications. The new samples illustrate these improvements and highlight how simple it is to include them in your application. For information on the latest OpenCV enhancements, see Harris’ blog. GCN: AMD recently launched a new Graphics Core Next (GCN) Architecture on several AMD products. GCN is based on a scalar architecture versus the VLIW vector architecture of prior generations, so carefully hand-tuned vectorization to optimize hardware utilization is no longer needed.
    [Show full text]
  • AMD Releases ATI Catalyst 72
    AMD Releases ATI Catalyst 7.2 1 / 4 AMD Releases ATI Catalyst 7.2 2 / 4 3 / 4 Just moments ago AMD released this months update to the ATI Catalyst drivers, bringing the Radeon powering software up to version 7.2. Along with the usual .... AMD --. Version 7.2.0 of the xf86-video-ati DDX driver was released this morning. While the X.Org drivers aren't too exciting these days with .... Catalyst 7.2 is available now and boosts performance for all ATI Radeon X1000 series cards. ATI has released new drivers for Windows XP, .... File Description: ATI CATALYST 7.2(FEBRUARY 2007) OFFICAL RELEASE VGA/SOUTHBRIDGE DRIVER FEATURING NEW CATALYST .... Developer. AMD Inc. Latest Version. ATI Catalyst Drivers 14.6 (Beta). Supported Systems ... Version Name. Released Date. Size ... ATI Catalyst 7.2. 22 February .... AMD just released the new Catalyst 7.2 drivers for ATI graphics cards. Some of the ... Open GL performance improves for all ATI Radeon X1000 series products.. This release for all Radeon family products updates the AMD display driver to the latest version. This unified driver has been further enhanced .... ATI Catalyst 7.2 Released AMD/ATI have released their latest set of video card drivers for both Windows XP and Windows Vista. Catalyst™ 7.2 .... AMD/ATI has released its latest version of the Catalyst drivers, which are used by the Radeon 9500 series and higher. There are 32-bit and .... ATI Catalyst™ Software Suite Version 7.2. This release note provides information on the latest posting of AMD's industry leading software suite, Catalyst™.
    [Show full text]
  • Mellanox Corporate Deck
    UCX Community Meeting SC’19 November 2019 Open Meeting Forum and Publicly Available Work Product This is an open, public standards setting discussion and development meeting of UCF. The discussions that take place during this meeting are intended to be open to the general public and all work product derived from this meeting shall be made widely and freely available to the public. All information including exchange of technical information shall take place during open sessions of this meeting and UCF will not sponsor or support any closed or private working group, standards setting or development sessions that may take place during this meeting. Your participation in any non-public interactions or settings during this meeting are outside the scope of UCF's intended open-public meeting format. © 2019 UCF Consortium 2 UCF Consortium . Mission: • Collaboration between industry, laboratories, and academia to create production grade communication frameworks and open standards for data centric and high-performance applications . Projects https://www.ucfconsortium.org Join • UCX – Unified Communication X – www.openucx.org [email protected] • SparkUCX – www.sparkucx.org • Open RDMA . Board members • Jeff Kuehn, UCF Chairman (Los Alamos National Laboratory) • Gilad Shainer, UCF President (Mellanox Technologies) • Pavel Shamis, UCF treasurer (Arm) • Brad Benton, Board Member (AMD) • Duncan Poole, Board Member (Nvidia) • Pavan Balaji, Board Member (Argonne National Laboratory) • Sameh Sharkawi, Board Member (IBM) • Dhabaleswar K. (DK) Panda, Board Member (Ohio State University) • Steve Poole, Board Member (Open Source Software Solutions) © 2019 UCF Consortium 3 UCX History https://www.hpcwire.com/2018/09/17/ucf-ucx-and-a-car-ride-on-the-road-to-exascale/ © 2019 UCF Consortium 4 © 2019 UCF Consortium 5 UCX Portability .
    [Show full text]
  • AMD Codexl 1.0 GA Release Notes (Version 1.0.)
    AMD CodeXL 1.0 GA Release Notes (version 1.0.) CodeXL 1.0 is finally here! Thank you for using CodeXL. We appreciate any feedback you have! Please use our CodeXL Forum to provide your feedback. You can also check out the Getting Started guide on the CodeXL Web Page and Milind Kukanur’s CodeXL blog at AMD Developer Central - Blogs This version contains: CodeXL Visual Studio package and Standalone application, for 32-bit and 64-bit Windows platforms CodeXL for 64-bit Linux platforms Kernel Analyzer v2 for both Windows and Linux platforms System Requirements CodeXL contains a host of development features with varying system requirements: GPU Profiling and OpenCL Kernel Debugging o An AMD GPU (Radeon HD 5xxx or newer) or APU is required o The AMD Catalyst Driver must be installed, release 12.8 or later. Catalyst 12.12 is the recommended version (will become available later in December 2012). For GPU API-Level Debugging, a working OpenCL/OpenGL configuration is required (AMD or other). CPU Profiling o Time Based Profiling can be performed on any x86 or AMD64 (x86-64) CPU/APU. o The Event Based Profiling (EBP) and Instruction Based Sampling (IBS) session types require an AMD CPU or APU processor. Supported platforms: Windows platforms: Windows 7, 32-bit and 64-bit are supported o Visual Studio 2010 must be installed on the station before the CodeXL Visual Studio Package is installed. Linux platforms: Red Hat EL 6 u2 64-bit and Ubuntu 11.10 64-bit New in this version The following items were not part of the Beta release and are new to this version: A unified installer for Windows which installs both CodeXL and APP KernelAnalyzer2 on 32 and 64 bit platforms, packaged in a single executable file.
    [Show full text]
  • AMD Codexl 1.1 GA Release Notes
    AMD CodeXL 1.1 GA Release Notes Thank you for using CodeXL. We appreciate any feedback you have! Please use our CodeXL Forum to provide your feedback. You can also check out the Getting Started guide on the CodeXL Web Page and the latest CodeXL blog at AMD Developer Central - Blogs This version contains: CodeXL Visual Studio 2012 and 2010 packages and Standalone application, for 32-bit and 64-bit Windows platforms CodeXL for 64-bit Linux platforms Kernel Analyzer v2 for both Windows and Linux platforms Note about 32-bit Windows CodeXL 1.1 Upgrade Error On 32-bit Windows platforms, upgrading from previous version of CodeXL using the CodeXL 1.1 installer will remove the previous version and then display an error message without installing CodeXL 1.1. The recommended method is to uninstall previous CodeXL before installing CodeXL 1.1. If you ran the 1.1 installer to upgrade a previous installation and encountered the error mentioned above, ignore the error and run the installer again to install CodeXL 1.1. Note about installing CodeAnalyst after installing CodeXL for Windows CodeXL can be safely installed on a Windows station where AMD CodeAnalyst is already installed. However, do not install CodeAnalyst on a Windows station already installed with CodeXL. Uninstall CodeXL first, and then install CodeAnalyst. System Requirements CodeXL contains a host of development features with varying system requirements: GPU Profiling and OpenCL Kernel Debugging o An AMD GPU (Radeon HD 5xxx or newer) or APU is required o The AMD Catalyst Driver must be installed, release 12.8 or later.
    [Show full text]