GPU System Architecture

Total Page:16

File Type:pdf, Size:1020Kb

GPU System Architecture GPU System Architecture Alan Gray EPCC The University of Edinburgh Outline • Why do we want/need accelerators such as GPUs? • GPU-CPU comparison – Architectural reasons for GPU performance advantages • GPU accelerated systems – From desktop to supercomputer • Accelerator Technology – NVIDIA – AMD – Intel • Example GPU machines Alan Gray Outline • Why do we want/need accelerators such as GPUs? • GPU-CPU comparison – Architectural reasons for GPU performance advantages • GPU accelerated systems – From desktop to supercomputer • Accelerator Technology – NVIDIA – AMD – Intel • Example GPU machines Alan Gray • The power used by a CPU core is proportional to Clock Frequency x Voltage2 • In the past, computers got faster by increasing the frequency – Voltage was decreased to keep power reasonable. • Now, voltage cannot be decreased any further – 1s and 0s in a system are represented by different voltages – Reducing overall voltage further would reduce this difference to a point where 0s and 1s cannot be properly distinguished Alan Gray • Instead, performance increases can be achieved through exploiting parallelism • Need a chip which can perform many parallel operations every clock cycle – Many cores and/or many operations per core • Want to keep power/core as low as possible • Much of the power expended by CPU cores is on functionality not generally that useful for HPC – Branch prediction, out-of-order execution etc Alan Gray • So, for HPC, we want chips with simple, low power, number-crunching cores • But we need our machine to do other things as well as the number crunching – Run an operating system, perform I/O, set up calculation etc • Solution: “Hybrid” system containing both CPU and “accelerator” chips Alan Gray • It costs a huge amount of money to design and fabricate new chips – Not feasible for relatively small HPC market • Luckily, over the last few years, Graphics Processing Units (GPUs) have evolved for the highly lucrative gaming market – And largely possess the right characteristics for HPC – Many number-crunching cores • GPU vendors have tailored existing GPU architectures to the HPC market Alan Gray GPUs in HPC • November 2011 Top500 list: GPUs (next generation will have GPUs) GPUs GPUs • GPUs now firmly established in HPC industry Alan Gray Outline • Why do we want/need accelerators such as GPUs? • GPU-CPU comparison – Architectural reasons for GPU performance advantages • GPU accelerated systems – From desktop to supercomputer • Accelerator Technology – NVIDIA – AMD – Intel • Example GPU machines Alan Gray AMD 12-core CPU = compute unit (= core) • Not much space on CPU is dedicated to compute Alan Gray NVIDIA Fermi GPU = compute unit (= SM = 32 CUDA cores) • GPU dedicates much more space to compute – At expense of caches, controllers, sophistication etc Alan Gray Memory • For many applications, performance is very sensitive to memory bandwidth CPUs use DRAM GPUs use Graphics DRAM • Graphics memory: much higher bandwidth than standard CPU memory Alan Gray • GPU vs CPU: Theoretical Peak capabilities NVIDIA Fermi AMD Magny-Cours (6172) Cores 448 (1.15GHz) 12 (2.1GHz) Operations/cycle 1 4 DP Performance (peak) 515 GFlops 101 GFlops Memory Bandwidth (peak) 144 GB/s 27.5 GB/s • For these particular products, GPU theoretical advantage is ~5x for both compute and main memory bandwidth • Application performance very much depends on application • Typically a small fraction of peak • Depends how well application is suited to/tuned for architecture Alan Gray Outline • Why do we want/need accelerators such as GPUs? • GPU-CPU comparison – Architectural reasons for GPU performance advantages • GPU accelerated systems – From desktop to supercomputer • Accelerator Technology – NVIDIA – AMD – Intel • Example GPU machines Alan Gray • GPUs cannot be used instead of CPUs – They must be used together – GPUs act as accelerators – Responsible for the computationally expensive parts of the code DRAM GDRAM CPU GPU PCIe I/O I/O Alan Gray Scaling to larger systems PCIe I/O I/O CPU GPU + GDRAM Interconnect DRAM CPU GPU + Interconnect allows GDRAM multiple nodes to be connected PCIe I/O I/O • Can have multiple CPUs and GPUs within each “workstation” or “shared memory node” – E.g. 2 CPUs +2 GPUs (above) – CPUs share memory, but GPUs do not Alan Gray GPU Accelerated Supercomputer GPU/CPU GPU/CPU … GPU/CPU Node Node Node GPU/CPU GPU/CPU GPU/CPU Node Node … Node … … … GPU/CPU GPU/CPU GPU/CPU Node Node … Node Alan Gray • To utilise a GPU, programs must – Contain parts targeted at host CPU (most lines of source code) – Contain parts targeted at GPU (key computational kernels) – Manage data transfers between distinct CPU and GPU memory spaces – Traditional language (e.g C/Fortran) does not provide these facilities • To run on multiple GPUs in parallel – Normally use one host CPU core (thread) per GPU – Program manages communication between host CPUs in the same fashion as traditional parallel programs – e.g. MPI and/or OpenMP (latter shared memory node only) Alan Gray Outline • Why do we want/need accelerators such as GPUs? • GPU-CPU comparison – Architectural reasons for GPU performance advantages • GPU accelerated systems – From desktop to supercomputer • Accelerator Technology – NVIDIA – AMD – Intel • Example GPU machines Alan Gray Latest Technology • NVIDIA – Tesla HPC specific GPUs have evolved from GeForce series • AMD – FireStream HPC specific GPUs have evolved from (ATI) Radeon series • Intel – Knights Corner many-core x86 chip is like hybrid between a GPU and many-core CPU Alan Gray NVIDIA Tesla Series GPU • Chip partitioned into Streaming Multiprocessors (SMs) • Multiple cores per SM • Sophisticated thread engine • Shared L2 cache Alan Gray NVIDIA GPU SM • Less scheduling units than cores • Threads are scheduled in groups of 32, called a warp • Threads within a warp always execute the same instruction in lock-step (on different data elements) • Configurable L1 Cache/ Shared Memory Alan Gray NVIDIA Tesla Series “Tesla” “Fermi” “Fermi” “Fermi” “Kepler” 1060 2050 2070 2090 K20 (not yet released) CUDA cores 240 448 448 512 1536 SP Performance 933 GFlops 1030 1030 1330 ?? GFlops GFlops GFlops DP Performance 78 GFlops 515 GFlops 515 GFlops 665 GFlops ?? Memory 102 GB/s 144 GB/s 144 GB/s 178 GB/s ?? Bandwidth Memory 4 GB 3 GB 6 GB 6 GB ?? • Variety of packaging solutions Alan Gray NVIDIA Roadmap Alan Gray Programming NVIDIA Tesla • CUDA C: proprietary interface to the architecture. – Extensions to the C language which allow interfacing to the hardware – Now relatively mature, and supported by comprehensive documentation, user forums, etc – Fortran support provided by third party PGI Cuda Fortran compiler • OpenCL – Cross platform API – Similar in design to CUDA, but lower level and not so mature • Directives based approach – OpenMP style directives - abstract complexities away from programmer – Several products with varying syntax etc. – Still lack maturity – OpenMP committee currently considering adopting accelerators as part of OpenMP standard – With participation from all main vendors plus others such as EPCC Alan Gray AMD FirePro • AMD acquired ATI in 2006 • AMD FirePro series: derivative of Radeon chips with HPC enhancements • FirePro W9000 – Released Aug 2012 – Peak 1 TFLOP (double precision) – Peak 4 TFLOPS (single precision) – 6GB GDDR5 SDRAM – Now with full ECC support • Programming: OpenCL • Packaging: just cards for workstations at the moment. Server solutions may follow. Alan Gray Intel Xeon Phi • Intel Larrabee: “A Many-Core x86 Architecture for Visual Computing” – Release delayed such that the chip missed competitive window of opportunity. – Larrabee was not released as a competitive product, but instead a platform for research and development (Knight’s Ferry). • Knights Corner derivative chip – Many Integrated Cores (MIC) architecture. No longer aimed at graphics market – Instead “Accelerating Science and Discovery” – “more than 50 cores” – Release expected in 2012 – Now branded as “Intel Xeon Phi” • Hybrid between GPU and many-core CPU Alan Gray Intel Xeon Phi CPU-like GPU-like x86 cores: same instruction set as standard CPUs Simple cores, lack sophistication e.g. no out-of- Relatively small number cores/chip order execution Each core contains 512-bit vector processing Fully cache coherent unit (16 SP or 8 DP numbers) In principle could support an OS Supports multithreading Not expected to run OS, but used as accelerator (at least initially) • Programming: Possible to use standard models such as OpenMP but still need additional directives to manage data Alan Gray Outline • Why do we want/need accelerators such as GPUs? • GPU-CPU comparison – Architectural reasons for GPU performance advantages • GPU accelerated systems – From desktop to supercomputer • Accelerator Technology – NVIDIA – AMD – Intel • Example GPU machines Alan Gray DIY GPU Workstation • Just need to slot GPU card into PCI-e • Need to make sure there is enough space and power in workstation Alan Gray GPU Servers • Several vendors offer GPU Servers • Example Configuration: – 4 GPUs plus 2 (multi-core) CPUs • Multiple servers can be connected via interconnect Alan Gray EPCC HECToR GPU Machine • EPCC own a GPU System which is partly available to UK researchers – As part of the HECToR UK National Service • Comprises 3 GPU servers, each with 4 NVIDIA Fermi GPUs – Connected via infiniband interconnect – Plus another node with single AMD Firestream GPU – Plus a front end with a single NVIDIA Fermi – More details at http://www.hector.ac.uk/howcan/admin/apply/ HECToRGPU.php
Recommended publications
  • Tesla K80 Gpu Accelerator
    TESLA K80 GPU ACCELERATOR BD-07317-001_v05 | January 2015 Board Specification DOCUMENT CHANGE HISTORY BD-07317-001_v05 Version Date Authors Description of Change 01 June 23, 2014 GG, SM Preliminary Information (Information contained within this board specification is subject to change) 02 October 8, 2014 GG, SM • Updated product name • Minor change to Table 2 03 October 31, 2014 GG, SM • Added “8-Pin CPU Power Connector” section • Updated Figure 2 04 November 14, 2014 GG, SM • Removed preliminary and NDA • Updated boost clocks • Minor edits throughout document 05 January 30, 2015 GG, SM Updated Table 2 with MTBF data Tesla K80 GPU Accelerator BD-07317-001_v05 | ii TABLE OF CONTENTS Overview ............................................................................................. 1 Key Features ...................................................................................... 2 NVIDIA GPU Boost on Tesla K80 ................................................................ 3 Environmental Conditions ....................................................................... 4 Configuration ..................................................................................... 5 Mechanical Specifications ........................................................................ 6 PCI Express System ............................................................................... 6 Tesla K80 Bracket ................................................................................ 7 8-Pin CPU Power Connector ...................................................................
    [Show full text]
  • AMD Firestream™ 9350 GPU Compute Accelerator
    AMD FireStream™ 9350 GPU Compute Accelerator High Density Server GPU Acceleration The AMD FireStream™ 9350 GPU compute accelerator card offers industry- leading performance-per-watt in a highly dense, single-slot form factor. This AMD FireStream 9350 low-profile GPU card is designed for scalable high performance computing > Heterogeneous computing that (HPC) systems that require density and power efficiency. The AMD FireStream leverages AMD GPUs and x86 CPUs 9350 is ideal for a wide range of HPC applications across several industries > High performance per watt at 2.9 including Finance, Energy (Oil and Gas), Geosciences, Life Sciences, GFLOPS / Watt Manufacturing (CAE, CFD, etc.), Defense, and more. > Industry’s most dense server GPU card 1 Utilizing the multi-billion-transistor ASICs developed for AMD Radeon™ graphics > Low profile and passively cooled cards, AMD’s FireStream 9350 cards provide maximum performance-per-slot > Massively parallel, programmable GPU and are designed to meet demanding performance and reliability requirements architecture of HPC systems that can scale to thousands of nodes. AMD FireStream 9350 > Open standard OpenCL™ and cards include a single DisplayPort output. DirectCompute2 AMD FireStream 9350 cards can be used in scalable servers, blades, PCIe® > 2.0 TFLOPS, Single-Precision Peak chassis, and are available from many leading server OEMs and HPC solution > 400 GFLOPS, Double-Precision Peak providers. > 2GB GDDR5 memory Priced competitively, AMD FireStream GPUs offer unparalleled performance > Industry’s only Single-Slot PCIe® Accelerator and value for high performance computing. > PCI Express® 2.1 Compliant Note: For workstation and server > AMD Accelerated Parallel Processing installations that require active (APP) Technology SDK with OpenCL3 cooling, AMD FirePro™ Professional > 3-year planned availability; 3-year Graphics boards are software- limited warranty compatible and offer comparable features.
    [Show full text]
  • ATI Radeon™ HD 4870 Computation Highlights
    AMD Entering the Golden Age of Heterogeneous Computing Michael Mantor Senior GPU Compute Architect / Fellow AMD Graphics Product Group [email protected] 1 The 4 Pillars of massively parallel compute offload •Performance M’Moore’s Law Î 2x < 18 Month s Frequency\Power\Complexity Wall •Power Parallel Î Opportunity for growth •Price • Programming Models GPU is the first successful massively parallel COMMODITY architecture with a programming model that managgped to tame 1000’s of parallel threads in hardware to perform useful work efficiently 2 Quick recap of where we are – Perf, Power, Price ATI Radeon™ HD 4850 4x Performance/w and Performance/mm² in a year ATI Radeon™ X1800 XT ATI Radeon™ HD 3850 ATI Radeon™ HD 2900 XT ATI Radeon™ X1900 XTX ATI Radeon™ X1950 PRO 3 Source of GigaFLOPS per watt: maximum theoretical performance divided by maximum board power. Source of GigaFLOPS per $: maximum theoretical performance divided by price as reported on www.buy.com as of 9/24/08 ATI Radeon™HD 4850 Designed to Perform in Single Slot SP Compute Power 1.0 T-FLOPS DP Compute Power 200 G-FLOPS Core Clock Speed 625 Mhz Stream Processors 800 Memory Type GDDR3 Memory Capacity 512 MB Max Board Power 110W Memory Bandwidth 64 GB/Sec 4 ATI Radeon™HD 4870 First Graphics with GDDR5 SP Compute Power 1.2 T-FLOPS DP Compute Power 240 G-FLOPS Core Clock Speed 750 Mhz Stream Processors 800 Memory Type GDDR5 3.6Gbps Memory Capacity 512 MB Max Board Power 160 W Memory Bandwidth 115.2 GB/Sec 5 ATI Radeon™HD 4870 X2 Incredible Balance of Performance,,, Power, Price
    [Show full text]
  • Download Radeon Grpadhics Catalylist and Driver AMD Catalyst Graphics Driver 13.4 X64
    download radeon grpadhics catalylist and driver AMD Catalyst Graphics Driver 13.4 x64. Fixes : - With the Radeon HD7790 graphics card, corruption may be seen in the game Tomb Raider with TressFX enabled - Texture flickering may be observed in DirectX 9.0c enabled applications - Corruption observed in Tomb Raider with TressFX enabled for AMD Crossfire and single GPU system configurations - Corruption observed on objects and textures in the game, Call of Duty – Black Ops 2 - Corruption observed in the game, Alan Wake (DirectX 9 Mode) when attempting to change in-game settings with Stereoscopic 3D using Tridef - Corruption observed on textures in the game, Battle Field: Bad Company 2 with forced anti-aliasing enabled - Adobe Photoshop CS6 experiences screen flicker under Windows 8 based system - Horizontal line corruption may be observed with forced anti-aliasing enabled in the game, Dishonored - System hang may occur when scrolling through the game selection menu in the game, DiRT 3 - F1 2012 may crash to desktop on systems supporting AMD PowerXpress (Enduro) Technology - Support added for AMD Radeon HD7790 and AMD Radeon HD 7990 - Catalyst Driver optimizations to improve performance in Far Cry 3, Crysis 3, and 3DMark - Significantly improves latency performance in Skyrim, Boderlands 2, Guild Wars 2, Tomb Raider and Hitman Absolution. Performance gains seen with the entire AMD Radeon HD 7000 Series for the following: - Batman Arkham City (1920x1200): Performance improvements up to 5% - Borderlands 2 (2560x1600): Performance improvements
    [Show full text]
  • AMD Radeon E8860
    Components for AMD’s Embedded Radeon™ E8860 GPU INTRODUCTION The E8860 Embedded Radeon GPU available from CoreAVI is comprised of temperature screened GPUs, safety certi- fiable OpenGL®-based drivers, and safety certifiable GPU tools which have been pre-integrated and validated together to significantly de-risk the challenges typically faced when integrating hardware and software components. The plat- form is an off-the-shelf foundation upon which safety certifiable applications can be built with confidence. Figure 1: CoreAVI Support for E8860 GPU EXTENDED TEMPERATURE RANGE CoreAVI provides extended temperature versions of the E8860 GPU to facilitate its use in rugged embedded applications. CoreAVI functionally tests the E8860 over -40C Tj to +105 Tj, increasing the manufacturing yield for hardware suppliers while reducing supply delays to end customers. coreavi.com [email protected] Revision - 13Nov2020 1 E8860 GPU LONG TERM SUPPLY AND SUPPORT CoreAVI has provided consistent and dedicated support for the supply and use of the AMD embedded GPUs within the rugged Mil/Aero/Avionics market segment for over a decade. With the E8860, CoreAVI will continue that focused support to ensure that the software, hardware and long-life support are provided to meet the needs of customers’ system life cy- cles. CoreAVI has extensive environmentally controlled storage facilities which are used to store the GPUs supplied to the Mil/ Aero/Avionics marketplace, ensuring that a ready supply is available for the duration of any program. CoreAVI also provides the post Last Time Buy storage of GPUs and is often able to provide additional quantities of com- ponents when COTS hardware partners receive increased volume for existing products / systems requiring additional inventory.
    [Show full text]
  • AMD APP SDK Developer Release Notes
    AMD APP SDK v3.0 Beta Developer Release Notes 1 What’s New in AMD APP SDK v3.0 Beta 1.1 New features in AMD APP SDK v3.0 Beta AMD APP SDK v3.0 Beta includes the following new features: OpenCL 2.0: There are 20 samples that demonstrate various features of OpenCL 2.0 such as Shared Virtual Memory, Platform Atomics, Device-side Enqueue, Pipes, New workgroup built-in functions, Program Scope Variables, Generic Address Space, and OpenCL 2.0 image features. For the complete list of the samples, see the AMD APP SDK Samples Release Notes (AMD_APP_SDK_Release_Notes_Samples.pdf) document. Support for Bolt 1.3 library. 6 additional samples that demonstrate various APIs in the Bolt C++ AMP library. One new sample that demonstrates the consumption of SPIR 1.2 binary. Enhancements and bug fixes in several samples. A lightweight installer that supports the following features: Customized online installation Ability to download the full installer for install and distribution 1.2 New features for AMD CodeXL version 1.6 The following new features in AMD CodeXL version 1.6 provide the following improvements to the developer experience: GPU Profiler support for OpenCL 2.0 API-level debugging for OpenCL 2.0 Power Profiling For information about CodeXL and about how to use CodeXL to gather performance data about your OpenCL application, such as application traces and timeline views, see the CodeXL home page. Developer Release Notes 1 of 4 2 Important Notes OpenCL 2.0 runtime support is limited to 64-bit applications running on 64-bit Windows and Linux operating systems only.
    [Show full text]
  • HP Grafične Postaje: HP Z1, HP Z2, HP Z4, ……
    ARES RAČUNALNIŠTVO d.o.o. HP Z1 Tržaška cesta 330 HP Z2 1000 Ljubljana, Slovenia, EU HP Z4 tel: 01-256 21 50 Grafične kartice GSM: 041-662 508 e-mail: [email protected] www.ares-rac.si 29.09.2021 DDV = 22% 1,22 Grafične postaje HP Grafične postaje: HP Z1, HP Z2, HP Z4, …….. ZALOGA Nudimo vam tudi druge modele, ki še niso v ceniku preveri zalogo Na koncu cenika so tudi opcije: Grafične kartice. Ostale opcije po ponudbi. Objavljamo neto ceno in ceno z DDV (PPC in akcijsko). Dokončna cena in dobavni rok - po konkretni ponudbi Objavljamo neto ceno in ceno z DDV (PPC in akcijsko). Dokončna cena po ponudbi. Koda HP Z1 TWR Grafična postaja /Delovna postaja Zaloga Neto cena Cena z DDV (EUR) (EUR) HP Z1 G8 TWR Zaloga Neto cena Cena z DDV 29Y17AV HP Z1 G8 TWR, Core i5-11500, 16GB, SSD 512GB, nVidia GeForce RTX 3070 8GB, USB-C, Win10Pro #71595950 Koda: 29Y17AV#71595950 HP delovna postaja, HP grafična postaja, Procesor Intel Core i5 -11500 (2,7 - 4,6 GHz) 12MB 6 jedr/12 niti Nabor vezja Intel Q570 Pomnilnik 16 GB DDR4 3200 MHz (1x16GB), 3x prosta reža, do 128 GB 1.735,18 2.116,92 SSD pogon 512 GB M.2 2280 PCIe NVMe TLC Optična enota: brez, HDD pogon : brez Razširitvena mesta 2x 3,5'', 1x 2,5'', RAID podpira RAID AKCIJA 1.657,00 2.021,54 Grafična kartica: nVidia GeForce RTX 3070 8GB GDDR6, 256bit, 5888 Cuda jeder CENA Žične povezave: Intel I219-LM Gigabit Network Brezžične povezave: brez Razširitve: 1x M.2 2230; 1x PCIe 3.0 x16; 2x PCIe 3,0 x16 (ožičena kot x4); 2x M.2 2230/2280; 2x PCIe 3.0 x1 Čitalec kartic: brez Priključki spredaj: 1x USB-C, 2x
    [Show full text]
  • The Compute Shader
    AMD Entering the Golden Age of Heterogeneous Computing Michael Mantor Senior GPU Compute Architect / Fellow AMD Graphics Product Group [email protected] 1 The 4 Pillars of massively parallel compute offload •Performance M’Moore’s Law Î 2x < 18 Month s Frequency\Power\Complexity Wall •Power Parallel Î Opportunity for growth •Price • Programming Models GPU is the first successful massively parallel COMMODITY architecture with a programming model that managgped to tame 1000’s of parallel threads in hardware to perform useful work efficiently 2 Quick recap of where we are – Perf, Power, Price ATI Radeon™ HD 4850 4x Performance/w and Performance/mm² in a year ATI Radeon™ X1800 XT ATI Radeon™ HD 3850 ATI Radeon™ HD 2900 XT ATI Radeon™ X1900 XTX ATI Radeon™ X1950 PRO 3 Source of GigaFLOPS per watt: maximum theoretical performance divided by maximum board power. Source of GigaFLOPS per $: maximum theoretical performance divided by price as reported on www.buy.com as of 9/24/08 ATI Radeon™HD 4850 Designed to Perform in Single Slot SP Compute Power 1.0 T-FLOPS DP Compute Power 200 G-FLOPS Core Clock Speed 625 Mhz Stream Processors 800 Memory Type GDDR3 Memory Capacity 512 MB Max Board Power 110W Memory Bandwidth 64 GB/Sec 4 ATI Radeon™HD 4870 First Graphics with GDDR5 SP Compute Power 1.2 T-FLOPS DP Compute Power 240 G-FLOPS Core Clock Speed 750 Mhz Stream Processors 800 Memory Type GDDR5 3.6Gbps Memory Capacity 512 MB Max Board Power 160 W Memory Bandwidth 115.2 GB/Sec 5 ATI Radeon™HD 4870 X2 Incredible Balance of Performance,,, Power, Price
    [Show full text]
  • Workstation Fisse E Mobili Monitor Hub Usb-C Docking
    OCCUPATEVI DEL VOSTRO BUSINESS, NOI CI PRENDEREMO CURA DEI VOSTRI COMPUTERS E PROGETTI INFORMATICI 16-09-21 16 WORKSTATION FISSE E MOBILI MONITOR HUB USB-C DOCKING STATION Noleggiare computer, server, dispositivi informatici e di rete, Cremonaufficio è un’azienda informatica presente nel terri- può essere una buona alternativa all’acquisto, molti infatti torio cremonese dal 1986. sono i vantaggi derivanti da questa pratica e il funziona- Sin dai primi anni si è distinta per professionalità e capacità, mento è molto semplice. Vediamo insieme alcuni di questi registrando di anno in anno un trend costante di crescita. vantaggi: Oltre venticinque anni di attività di vendita ed assistenza di Vantaggi Fiscali. Grazie al noleggio è possibile ottenere di- prodotti informatici, fotocopiatrici digitali, impianti telefonici, versi benefici in termini di fiscalità, elemento sempre molto reti dati fonia, videosorveglianza, hanno consolidato ed affer- interessante per le aziende. Noleggiare computer, server e mato l’azienda Cremonaufficio come fornitrice di prodotti e dispositivi di rete permette di ottenere una riduzione sulla servizi ad alto profilo qualitativo. tassazione annuale e il costo del noleggio è interamente Il servizio di assistenza tecnica viene svolto da tecnici interni, deducibile. specializzati nei vari settori, automuniti. Locazione operativa, non locazione finanziaria. A differen- Cremonaufficio è un operatore MultiBrand specializzato in za del leasing, il noleggio non prevede l’iscrizione ad una Information Technology ed Office Automation e, come tale, si centrale rischi, di conseguenza migliora il rating creditizio, avvale di un sistema di gestione operativa certificato ISO 9001. facilita il rapporto con le banche e l’accesso ai finanzia- menti.
    [Show full text]
  • Catalyst™ Software Suite Version 9.5 Release Notes
    Catalyst™ Software Suite Version 9.5 Release Notes This release note provides information on the latest posting of AMD’s industry leading software suite, Catalyst™. This particular software suite updates both the AMD Display Driver, and the Catalyst™ Control Center. This unified driver has been further enhanced to provide the highest level of power, performance, and reliability. The AMD Catalyst™ software suite is the ultimate in performance and stability. For exclusive Catalyst™ updates follow Catalyst Maker on Twitter. This release note provides information on the following: z Web Content z AMD Product Support z Operating Systems Supported z New Features z Performance Improvements z Resolved Issues for the Windows Vista Operating System z Resolved Issues for the Windows XP Operating System z Resolved Issues for the Windows 7 Operating System z Known Issues Under the Windows Vista Operating System z Known Issues Under the Windows XP Operating System z Known Issues Under the Windows 7 Operating System z Installing the Catalyst™ Vista Software Driver z Catalyst™ Crew Driver Feedback ATI Catalyst™ Release Note Version 9.5 1 Web Content The Catalyst™ Software Suite 9.5 contains the following: z Radeon™ display driver 8.612 z HydraVision™ for both Windows XP and Vista z HydraVision™ Basic Edition (Windows XP only) z WDM Driver Install Bundle z Southbridge/IXP Driver z Catalyst™ Control Center Version 8.612 Caution: The Catalyst™ software driver and the Catalyst™ Control Center can be downloaded independently of each other. However, for maximum stability and performance AMD recommends that both components be updated from the same Catalyst™ release.
    [Show full text]
  • AMD APP SDK V2.9.1 Developer Release Notes
    AMD APP SDK v2.9.1 Developer Release Notes 1 What’s New in AMD APP SDK v2.9.1 1.1 New features in AMD APP SDK v2.9.1 AMD APP SDK v2.9.1 includes the following new features: The AMD APP SDK can now be installed on Linux with root as well as non-root permissions. The installation logs display additional details about the status of the installation. Several samples have been enhanced. All the samples are now compatible with the gcc version, 4.8.1. The APARAPI samples have been updated to work with the latest APARAPI (OpenCL trunk) library. The OpenCV-CL samples now work with the OpenCV 2.4.9 library. The AMD APP SDK Bolt samples have been updated to work with the Bolt 1.2 library. For information about the features released in AMD APP SDK v2.9, see the AMD APP SDK v2.9 Developer Release Notes document. 1.2 Key features supported in the AMD Catalyst 14.4 driver Support for Kaveri graphics AMD Radeon R5/R6/R7 Support for AMD Radeon R9 295X Full support for OpenGL 4.4 Support for the following operating systems: Windows 8.1 (32- and 64-bit versions) Windows 7 (32- and 64-bit versions with SP1 or higher) 1.3 New features for AMD CodeXL version 1.4 The following new features for AMD CodeXL version 1.4 provide the following improvements to the developer experience: Microsoft Visual Studio 2013 CodeXL Extension New Shader Analyzer Support for Volcanic Islands family GPUs: debugging, profiling and analyzing.
    [Show full text]
  • Lewis University Dr. James Girard Summer Undergraduate Research Program 2021 Faculty Mentor - Project Application
    Lewis University Dr. James Girard Summer Undergraduate Research Program 2021 Faculty Mentor - Project Application Exploring the Use of High-level Parallel Abstractions and Parallel Computing for Functional and Gate-Level Simulation Acceleration Dr. Lucien Ngalamou Department of Engineering, Computing and Mathematical Sciences Abstract System-on-Chip (SoC) complexity growth has multiplied non-stop, and time-to- market pressure has driven demand for innovation in simulation performance. Logic simulation is the primary method to verify the correctness of such systems. Logic simulation is used heavily to verify the functional correctness of a design for a broad range of abstraction levels. In mainstream industry verification methodologies, typical setups coordinate the validation e↵ort of a complex digital system by distributing logic simulation tasks among vast server farms for months at a time. Yet, the performance of logic simulation is not sufficient to satisfy the demand, leading to incomplete validation processes, escaped functional bugs, and continuous pressure on the EDA1 industry to develop faster simulation solutions. In this research, we will explore a solution that uses high-level parallel abstractions and parallel computing to boost the performance of logic simulation. 1Electronic Design Automation 1 1 Project Description 1.1 Introduction and Background SoC complexity is increasing rapidly, driven by demands in the mobile market, and in- creasingly by the fast-growth of assisted- and autonomous-driving applications. SoC teams utilize many verification technologies to address their complexity and time-to-market chal- lenges; however, logic simulation continues to be the foundation for all verification flows, and continues to account for more than 90% [10] of all verification workloads.
    [Show full text]