Intel® Xeon Phi™ Coprocessor

Total Page:16

File Type:pdf, Size:1020Kb

Intel® Xeon Phi™ Coprocessor Intel® Xeon Phi™ Coprocessor http://tinyurl.com/inteljames twitter @jamesreinders James Reinders it’s all about parallel programming Source Compilers Libraries, Parallel Models Intel® MIC Multicore CPU Multicore CPU architecture coprocessor Game Changer Source Compilers Libraries, Parallel Models Intel® MIC Multicore CPU Multicore CPU architecture coprocessor “Unparalleled productivity… most of this software does not run on a GPU” - Robert Harrison, NICS, ORNL “R. Harrison, “Opportunities and Challenges Posed by Exascale Computing - ORNL's Plans and Perspectives”, National Institute of Computational Sciences, Nov 2011” Intel® C/C++ and Fortran Compilers w/OpenMP Intel® MPI Library Intel® MKL, Intel® Cilk Plus, Intel® TBB, and Intel® IPP Intel® Trace Analyzer and Collector Intel® Inspector XE, Intel® VTune™ Amplifier Intel® Parallel XE, Intel® Advisor Studio XE Intel® C/C++ and Fortran Compilers w/OpenMP Intel® MPI Library Intel® MKL, Intel® Cilk Plus, Intel® TBB, and Intel® IPP Intel® Trace Analyzer and Collector Intel® Inspector XE, Intel® VTune™ Amplifier Intel® Parallel XE, Intel® Advisor Studio XE Software Development Ecosystem for Intel Xeon Phi coprocessors Open Commercial Source gcc (kernel Intel® C++ Compiler, Compilers, build only, not Run environs for applications), Intel® Fortran Compiler, MYO, Python CAPS* HMPP* compiler, ScaleMP* Debugger gdb Intel Debugger, Rogue Wave* TotalView*, Allinea* DDT TBB1, NAG*, Intel® MKL, Intel® MPI, Libraries * MPICH2, OpenMP (in Intel compilers), Cilk™ Plus (in Intel compilers), FFTW, * NetCDF Rogue Wave IMSL, Intel® OpenCL* SDK Profiling & Intel® VTune™ Amplifier XE, Analysis Tools Intel® Trace Analyzer & Collector, Intel® Inspector XE, Rogue Wave ThreadSpotter* Workload Altair* PBS Professional, * Scheduler Adaptive Computing Moab 1 Commercial support of TBB available from Intel. *Other names and brands may be claimed as the property of others. Software Development Ecosystem for Intel Xeon Phi coprocessors Open Commercial Source gcc (kernel Intel® C++ Compiler, Compilers, build only, not Run environs for applications), Intel® Fortran Compiler, MYO, Python CAPS* HMPP* compiler, ScaleMP* Debugger gdb Intel Debugger, Rogue Wave* TotalView*, Allinea* DDT TBB1, NAG*, Intel®Intel® MKL, MPI Intel® Library MPI, Libraries * MPICH2, OpenMP (in Intel compilers), Cilk™ Plus (in Intel compilers), FFTW, * NetCDF Rogue Wave IMSL, Intel® OpenCLIntel® Trace* SDK Analyzer and Collector Profiling & Intel® VTune™ Amplifier XE, Analysis Tools Intel® Trace Analyzer & Collector, Intel® Inspector XE, Rogue Wave ThreadSpotter* Workload Altair* PBS Professional, * Scheduler Adaptive Computing Moab 1 Commercial support of TBB available from Intel. *Other names and brands may be claimed as the property of others. Knights Corner Coprocessor KNC Card KNC Card TCP/IP GDDR5 GDDR5 Channel … Channel Intel® Xeon® PC e x16 Channel GDDR5 Processor PCIe x16 … KN> 50 Cores Channel KN GDDR5 Linux OS System Memory GDDR5 GDDR5 Channel … Channel >= 8GB GDDR5 memory Knights Corner Micro-architecture Core Core Core Core PCIe Client L2 L2 L2 L2 Logic GDDR MC TD TD TD TD GDDR MC TD TD TD GDDR MC TD GDDR MC L2 L2 L2 L2 Core Core Core Core Knights Corner Core PPF PF D0 D1 D2 E WB T0 IP T1 IP L1 TLB Code Cache Miss T2 IP and 32KB T3 IP Code Cache TLB Miss 16B/Cycle (2 IPC) 4 Threads In-Order Decode uCode 512KB TLB Miss HWP L2 Cache Handler Pipe 0 Pipe 1 L2 Ctl L2 TLB VPU RF X87 RF Scalar RF X87 ALU 0 ALU 1 VPU To On-Die Interconnect 512b SIMD TLB Miss L1 TLB and 32KB Data Cache DCache Miss Core X86 specific logic < 2% of core + L2 area Vector Processing Unit PPF PF D0 D1 D2 E WB D2 E VC1 VC2 V1-V4 WB D2 E VC1 VC2 V1 V2 V3 V4 VPU LD DEC RF 3R, 1W Vector ALUs EMU 16 Wide x 32 bit ST 8 Wide x 64 bit Fused Multiply Add Mask Scatter RF Gather Interconnect BL - 64 Bytes Data Core Core Core Core L2 L2 L2 L2 AD Command and Address AK Coherence and Credits TD TD TD TD TD TD TD TD AK AD L2 L2 L2 L2 Core Core Core Core BL – 64 Bytes Interleaved Memory Access Core Core L2 L2 GDDR MC Core GDDR MC GDDR L2 TD TD TD Core L2 TD L2 TD Core L2 TD TD GDDR MC TD Core GDDR MC L2 L2 Core Core http://tinyurl.com/intelja mes twitter @jamesreinders A picture can be worth a thousand words. Picture worth many words Picture worth many words Picture worth many words SMALL NUMBER OF THREADS IS UNINTERESTING Picture worth many words AT LOW PERFORMANCE LEVELS, MORE THREADS NEEDED FOR SAME PERFORMANCE Picture worth many words THE PAYOFF IS HIGHER ACHIEVEABLE RESULTS ON CERTAIN WORKLOADS AND LOWER POWER USAGE Over 100 threads? !$OMP PARALLEL do PRIVATE(j,k) do i=1, M ! each thread will work its own part of the problem do j=1, N do k=1, X ! calculations end do end do end do Fortran do loop transformed to create many threads using an OpenMP directive Where does my program run? 1. On CPU and “offload” to coprocessor model popular with GPUs 1. All the cores (CPU or coprocessor) are just peers in a system (probably connect with MPI) Your choice. Whatever works best for you. On CPU and “offload” to coprocessor model popular with GPUs Supported by: 1. Automatic use by Intel® math Kernel Library (MKL) 2. Program controls by Compiler directives (C, C++, Fortran) 3. APIs available to build additional tools or low level programs Offload Directives and Standard Requirements NVidia’s Intel’s Desired Feature OpenACC LEO Standard Support for C and C++, Fortran ✔ ✔ ✔ Support single code base of hetero-machine ✔ ✔ ✔ Overlap communication and computation ✔ ✔ ✔ Interoperate with MPI ✔ ✔ ✔ Interoperate with OpenMP* ✔ ✔ Offload to GPU ✔ ✔ Offload to MIC Coprocessor ✔ ✔ Ability to support all accelerators ✔ Ability to support all GPUs ✔ Ability to support all co-processors ✔ Proof of performance portability ✔ Support for nested parallelism ✔ ✔ User-managed memory consistency ✔ ✔ ✔ Multiple vendor support ✔ ✔ Restrict clause support ✔ Support for dynamic dispatch ✔ ✔ Parallel on/off separate from offload ✔ ✔ PGI*, CAPS* compiler support 2012 ✔ Cray* compiler support soon ✔ Intel® compiler support 2010* ✔ Broad standards body approval ✔ OpenMP* 4.0 (early 2013) planned * public product in 2012 two pre-Standard approaches to directives to control “offload” nVidia OpenACC Intel Language Extensions for Offload Data Parallelism Only Broad range of Parallelism Optimized for SIMT GPU Multicore, Many-core CPU, GPU No General Purpose Threading General Purpose Threading Targets “GPU Computing” Supports Intel CPU, GPU & coprocessor closed spec standards body with broad participation OpenMP “omp target” Open, Standard, Supports Diverse Hardware Intel will support the OpenMP/TR in our C/C++ and Fortran compilers Intel LEO support diverse parallel programming models and is an ideal path to OpenMP 4.0 Other brands and names are the property of their respective owners. Where does your program RUN? Everywhere More flexible possibility: Consider the program to run on cores everywhere. This opens up many possibilities. Peers cores or groups of cores can be organized in many ways. Peers? Well, it is an SMP-on-a-chip running Linux. As peers, a distributed program runs on processors and coprocessors, communicating with each other. Many ways to think about this. Starts with MPI. Intel Xeon Phi coprocessors stand out here – because of how very flexible this model is. Limited only by imagination! HotChips presentation (architecture details) Where to Learn More http://intel.com/software/mic http://tinyurl.com/intelja mes twitter @jamesreinders This is a really great book… I've been dreaming for a while of a modern accessible book that I could recommend to my threading-deprived colleagues and assorted enquirers to get them up to speed with the core concepts of multithreading as well as something that covers all the major current interesting implementations. Finally I have that book. Martin Watt, Principal Engineer, Dreamworks Animation (c) 2012, publisher: Morgan Kaufmann http://tinyurl.com/inteljames twitter @jamesreinders Available in early 2013. (limited partial “proof” version available at SC12 for reviewers) Completely focused on Intel Xeon Phi coprocessors. Volume 1: essentials ~350 pages of explanation of programming. It all comes down to PARALLEL PROGRAMMING ! (applicable to processors and Intel® Xeon Phi™ coprocessor) (c) 2013 http://tinyurl.com/inteljames twitter @jamesreinders http://tinyurl.com/inteljames my blogs Summary Intel® Xeon Phi™ coprocessor provides: Performance and Performance/Watt for highly parallel HPC workloads with cores, threads, wide-SIMD, caches, memory BW while maintaining the advantages of Intel Architecture general purpose programming environment advanced power management technology delivers programmability and performance/watt for highly parallel HPC parallel programming http://tinyurl.com/inteljames twitter @jamesreinders Thank you. http://tinyurl.com/inteljames twitter @jamesreinders Legal Disclaimer & Optimization Notice INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference
Recommended publications
  • Intel® Inspector 2017 Release Notes for Linux* OS
    Intel® Inspector 2017 Release Notes for Linux* OS Installation Guide and Release Notes 4 September 2017 Contents: Introduction What's New System Requirements Installation Notes Issues and Limitations Attributions Disclaimer and Legal Information 1 Introduction Intel® Inspector 2017 helps developers identify and resolve memory and threading correctness issues in their C, C++ and Fortran programs. Intel Inspector is a static and dynamic error checking tool for developing multithreaded applications on Windows* or Linux* operating systems. Intel Inspector maximizes code quality and reliability by quickly detecting memory, threading, and source code security errors during the development cycle. You can also use the Intel Inspector to visualize and manage Static Analysis results created by Intel® compilers in various suite products. Intel Inspector is an easy, comprehensive solution that delivers rapid results in isolating memory and multithreading errors. Intel Inspector has a standalone graphical user interface (GUI) as well as a command line interface (CLI). This document provides system requirements, installation instructions, issues and limitations, and legal information. Use the Getting Started tutorial and reference documentation to learn more about the Intel Inspector. For documentation, open the get_started.htm file in the following directory: /opt/intel/inspector_2017/documentation/en/welcomepage. You can access the product help in a web browser by opening the index.htm in the documentation help directory. 1 Intel® Inspector 2017 Release Notes If you did not register this product during installation, do so at the Intel® Software Development Products Registration Center (https://registrationcenter.intel.com/). Registration entitles you to free technical support, product updates and upgrades for the duration of the support term.
    [Show full text]
  • Middleware Mobility Paper SSRN
    STANDARDIZED TOOLS AND THE GENERALIZABILITY OF HUMAN CAPITAL: THE IMPACT OF STANDARDIZED TECHNOLOGIES ON EMPLOYEE MOBILITY Milan Miric University of Southern California ([email protected]) Hakan Ozalp KIN Center for Digital Innovation, VU Amsterdam ([email protected]) Abstract The mobility of highly skilled knowledge and creative workers is an important determinant of innovation. Existing studies have not considered how the use and diffusion of standardized technologies and tools influence the mobility of individual knowledge workers. We theorize that the diffusion of standardized tools increases the generalizability of human capital and, in turn, increases the ability of individuals to move between companies. Using data on the use of middleware in the console games industry, we find that this diffusion of standardized middleware tools lead to an increase in labor mobility on average, but was associated with higher mobility for individuals with skills that complemented those tools, in comparison to those that were substituted by these tools. Worker experience with standardized tools amplified these effects, as individuals who were experienced in using these tools saw the largest shift in the likelihood of mobility. We do not find that this diffusion led to individuals leaving the industry, but we do find evidence that the diffusion of a common set of tools within an industry was associated with workers being less likely to leave that industry. These results highlight the potential unintended effects of technological standardization and the broad diffusion of standardized tools, which may enable workers to more easily shift between competitors. 1. INTRODUCTION Human capital is a key resource for organizations and can be a key determinant of firm innovativeness, competitiveness, and survival.
    [Show full text]
  • Intel® Parallel Studio XE 2015 Composer Edition for Fortran OS X* Installation Guide and Release Notes
    Intel® Parallel Studio XE 2015 Composer Edition for Fortran OS X* Installation Guide and Release Notes 23 October 2014 Table of Contents 1 Introduction ............................................................................................................................ 3 1.1 Change History ............................................................................................................... 3 1.1.1 Changes in Update 1 ............................................................................................... 3 1.1.2 Changes since Intel® Fortran Composer XE 2013 SP1 (New in Intel® Parallel Studio XE 2015 Composer Edition) ....................................................................................... 3 1.2 Product Contents ............................................................................................................ 4 1.3 Intel® Debugger (IDB) is removed from this release ...................................................... 4 1.4 System Requirements .................................................................................................... 4 1.5 Documentation ............................................................................................................... 4 1.6 Optimization Notice ........................................................................................................ 4 1.7 Technical Support ........................................................................................................... 5 2 Installation .............................................................................................................................
    [Show full text]
  • Intel® Debugger Command Reference
    Intel® Debugger Command Reference May 2009 Document Number: 319698-009US World Wide Web: http://www.intel.com Disclaimer and Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. UNLESS OTHERWISE AGREED IN WRITING BY INTEL, THE INTEL PRODUCTS ARE NOT DESIGNED NOR INTENDED FOR ANY APPLICATION IN WHICH THE FAILURE OF THE INTEL PRODUCT COULD CREATE A SITUATION WHERE PERSONAL INJURY OR DEATH MAY OCCUR. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked reserved or undefined. Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. The products described in this document may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.
    [Show full text]
  • Table of Contents Porting/Building Code
    Table of Contents Porting/Building Code............................................................................................1 Porting/Building Code: Overview.......................................................................................1 Compiling..............................................................................................................2 Endian and Related Environment Variables or Compiler Options......................................2 GNU Compiler Collection....................................................................................................5 OpenMP.............................................................................................................................6 PGI Compilers and Tools....................................................................................................9 Intel Compiler..................................................................................................................11 Recommended Compiler Options....................................................................................13 Porting to NAS Systems........................................................................................16 Porting with HPE MPT.......................................................................................................16 Porting with HPE's MPI and Intel OpenMP........................................................................21 Porting with Intel-MPI.......................................................................................................23
    [Show full text]
  • Intel(R) Fortran Compiler Options
    Intel(R) Fortran Compiler Options Document Number: 307780-005US Disclaimer and Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL(R) PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. UNLESS OTHERWISE AGREED IN WRITING BY INTEL, THE INTEL PRODUCTS ARE NOT DESIGNED NOR INTENDED FOR ANY APPLICATION IN WHICH THE FAILURE OF THE INTEL PRODUCT COULD CREATE A SITUATION WHERE PERSONAL INJURY OR DEATH MAY OCCUR. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. The products described in this document may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.
    [Show full text]
  • Intel(R) System Studio Release Notes and Installation Guide
    Intel® System Studio 2016 Update 3 Installation Guide and Release Notes Installation Guide and Release Notes for Windows* Host 22 April 2016 Contents 1 Introduction ......................................................................................................................... 4 2 What's New ......................................................................................................................... 5 2.1 Versions History ........................................................................................................... 8 3 Intel® Software Manager ...................................................................................................12 4 Product Contents ...............................................................................................................12 5 Getting Started ...................................................................................................................12 6 Technical Support and Documentation ..............................................................................13 6.1 Release / Installation Notes and User Guides Location ...............................................13 6.2 Articles and Whitepaper Locations ..............................................................................15 6.3 Support .......................................................................................................................15 6.4 Support for native code generation for Intel® Graphics Technology ............................15 7 System Requirements ........................................................................................................17
    [Show full text]
  • Intel(R) C++ Compiler 8.0 for Linux Release Notes
    Intel® C++ Compiler 8.0 for Linux Release Notes Contents Overview New Features System Requirements Installation Notes Known Limitations Technical Support Documentation Additional Information Copyright and Legal Information Overview The Intel® Compilers help make your software run at top speeds on all Intel® 32-bit processors and the Intel® Itanium® processors. Optimizations include support for Streaming SIMD Extensions 2 (SSE2) in the Intel® Pentium® 4 and Pentium® M processors, Streaming SIMD Extensions 3 (SSE3) in the Intel® Pentium® Processor 4 with Streaming SIMD Extensions 3 (SSE3) instruction support and software pipelining in the Intel Itanium 2 processor. Inter- procedural optimization (IPO) and profile-guided optimization (PGO) can provide greater application performance. Intel Compilers support multi-threaded code development through autoparallelism and OpenMP* support. The following components are included in the installation package: ● The Intel® C++ Compiler version 8.0 for IA-32 based applications: icc. ● The Intel® C++ Compiler version 8.0 for Itanium-based applications: icc. ● The Intel® Debugger version 7.3 for IA-32 and Itanium-based applications: idb. ● The Intel® Itanium Assembler version 7.0 to produce Itanium-based applications: ias. ● The Intel® Compiler code-coverage tool ● The Intel® Compiler test-prioritization tool ● Documentation and documentation index that can be found at <install-dir>/doc/ ccompindex.htm The paper, Optimizing Applications with the Intel C++ and Fortran Compilers for Windows* and Linux*, explains how to use the Intel compilers to optimize for the Pentium 4 and Itanium processors and is available at http://www.intel.com/software/products/compilers/ . Additional information on the Intel Software Development Products is available at http://www.intel.com/ software/products/ .
    [Show full text]
  • Intel® Fortran Composer XE 2011 for Linux* Installation Guide and Release Notes
    Intel® Fortran Composer XE 2011 for Linux* Installation Guide and Release Notes Document number: 321415-003US 14 January 2011 Table of Contents 1 Introduction ......................................................................................................................... 3 1.1 Change History ............................................................................................................ 3 1.2 Product Contents ......................................................................................................... 3 1.3 System Requirements .................................................................................................. 3 1.3.1 Red Hat Enterprise Linux* 4 Support Deprecated ................................................. 5 1.3.2 IA-64 Architecture (Intel® Itanium®) Development Not Supported ........................ 5 1.4 Documentation ............................................................................................................. 5 1.5 Japanese Language Support ....................................................................................... 6 1.6 Technical Support ........................................................................................................ 7 2 Installation ........................................................................................................................... 7 2.1 Activation of Purchase after Evaluation Using the Intel Activation Tool ........................ 8 2.2 Silent Install ................................................................................................................
    [Show full text]
  • Intel® Openvino™ with FPGA Support Through the Intel FPGA Deep Learning Acceleration Suite
    SOLUTION BRIEF Intel® OpenVINO™ with FPGA Support Through the Intel FPGA Deep Learning Acceleration Suite Intel® FPGA Deep Learning Acceleration Suite enables Intel FPGAs for accelerated AI optimized for performance, power, and cost. Introduction Artificial intelligence (AI) is driving the next big wave of computing, transforming both the way businesses operate and how people engage in every aspect of their lives. Intel® FPGAs offer a hardware solution that is capable of handling extremely challenging deep learning models at unprecedented levels of performance and flexibility. The Intel OpenVINO™ toolkit is created for the development of applications and solutions that emulate human vision, and it provides support for FPGAs through the Intel FPGA Deep Learning Acceleration Suite. The Intel FPGA Deep Learning Acceleration Suite is designed to simplify the adoption of Intel FPGAs for inference workloads by optimizing the widely used Caffe* and TensorFlow* frameworks to be applied for various applications, including image classification, computer vision, autonomous vehicles, military, and medical diagnostics. Intel FPGAs offer customizable performance, customizable power, deterministic low latency, and flexibility for today’s most widely adopted topologies as well as programmability to handle emerging topologies. Unique flexibility, for today and the future, stems the ability of Intel FPGAs to support emerging algorithms by enabling new numeric formats quickly. What makes FPGAs unique is its ability to achieve high performance through parallelism coupled with the flexibility of hardware customization – which is not available on CPU, GPU, or ASIC architectures. Turnkey Solution Offers Faster Time to Market The Intel Programmable Acceleration Card with Intel Arria® 10 GX FPGA (Intel PAC with Intel Arria 10 GX FPGA) combined with the Intel FPGA Deep Learning Acceleration Suite offer an acceleration solution for real-time AI inference with low latency as a key performance indicator.
    [Show full text]
  • Compilers and Debuggers
    Compilers and Debuggers Nikolaos Nikoloutsakos Greek Research & Technology Network Athens, 22 Nov. 2018 1/79 Prace PTC, GRNET, 20 Nov. - 22 Nov. 2018 www.prace-ri.eu Contents 1 Environment Modules Available compilers MPI compilers 2 Compilers Compilation flags Optimization flags 3 Debuggers GDB DDD Parallel Debugging Debugging OpenMP Applications Debugging MPI Applications Valgrind 2/79 Prace PTC, GRNET, 20 Nov. - 22 Nov. 2018 www.prace-ri.eu Environment Login Shell /bin/bash echo $SHELL Environment Variables env printenv GRNET_ROOT; echo $GRNET_ROOT Default variables I $GRNET_ROOT I $HOME I $WORKDIR 3/79 Prace PTC, GRNET, 20 Nov. - 22 Nov. 2018 www.prace-ri.eu Environment Modules environment modules to manage user environment Access applications or versions on ARIS Dynamically set up environments for different applications PATH, LD_LIBRARY_PATH etc. One module for each software version Detects software dependencies and informs which modules must be loaded 4/79 Prace PTC, GRNET, 20 Nov. - 22 Nov. 2018 www.prace-ri.eu What modules exist? module avail List available modules. 5/79 Prace PTC, GRNET, 20 Nov. - 22 Nov. 2018 www.prace-ri.eu What modules exist? module avail <module> List <module> available versions Default version marked as (default) 6/79 Prace PTC, GRNET, 20 Nov. - 22 Nov. 2018 www.prace-ri.eu Default module version I Almost all ARIS software packages have multiple versions marked as (default) In these case the commands: module load MODULENAME and module load MODULENAME/DEFAULTVERSION are equivalent. I eg. current defaults module load intel and module load intel/15.0.3 are the same 7/79 Prace PTC, GRNET, 20 Nov.
    [Show full text]
  • Intel Visual Adrenaline Magazine, Issue 3, 2009
    INTEL® VISUAL Adrenaline ISSUE 3, 2009 AUTODESK®: * TECHNOLOGY WOLFENSteIN ADVANCES AS REBORN: NEXT-GENERATION Better, Stronger, HARDWARE Faster—and True MELDS WITH to Its Roots CUSTOM-TUNED SOFTWARE A Peek into the Intel® Architecture Code-Named Larrabee New Intel® Graphics Performance Analyzers Empire: Total War * Takes to the Waves with Real-Time 3D Naval Battles ©2009 Mythic Entertainment** TABLE OF CONTENTS 2 Wolfenstein* is Reborn Seventeen years after the groundbreaking original, an all new Wolfenstein is true to its roots. 8 Take a Look at Larrabee Tom Forsyth shares his thoughts on how it feels to be on the leading edge of the gaming graphics universe. 13 Victory for 2008 Indie Game Developers * Three winners, three games—how 28 Empire: Total War the win from Intel has helped these Takes to the Waves game developers. Featuring real-time 3D naval battles, Empire: Total War targets the revolution and rebellion of eighteenth-century England. 18 Autodesk® Fuels Triple-A Game Creations A trio of applications from Autodesk animates and energizes Triple-A game development. 24 Launch of Intel® Graphics Performance Analyzers: Virtual Graphics Support 34 Cakewalk Software: Composing Audio for Your Game Studio for Video Games Realize full graphics performance potential The audio soundtrack of a video game can make with minimum effort. the experience an adrenaline-drenched adventure. FROM THE managing EDITOR’S DESK CHRYSTE SULLIVAN It’s been a doozy of a winter for those of us in I know some fret about excessive “screen time,” but I firmly Pacific Northwest.All I can tell you is that those of us in side with Steven Johnson, who sang the praises of our immersed Oregon may have an affinity for fleece and coffee, but we digital existence in his 2005 book Everything Bad is Good for are innocent in the ways of shoveling driveways, salting You.
    [Show full text]