High-Performance Computing with Accelerators

Total Page:16

File Type:pdf, Size:1020Kb

High-Performance Computing with Accelerators High Performance Computing with Accelerators Volodymyr Kindratenko Innovative Systems Laboratory @ NCSA Institute for Advanced Computing Applications and Technologies (IACAT) National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Presentation Outline • Recent trends in application accelerators • FPGAs, Cell, GPUs, … • Accelerator clusters • Accelerator clusters at NCSA • Cluster management • Programmability Issues • Production codes running on Accelerator Clusters • Cosmology, molecular dynamics, electronic structure, quantum chromodynamics HPC on Special-purpose Processors • Field-Programmable Gate Arrays (FPGAs) – Digital signal processing, embedded computing • Graphics Processing Units (GPUs) – Desktop graphics accelerators • Physics Processing Units (PPUs) – Desktop games accelerators • Sony/Toshiba/IBM Cell Broadband Engine – Game console and digital content delivery systems • ClearSpeed accelerator – Floating-point accelerator board for compute- intensive applications GPU Performance Trends: FLOPS 1200 GTX 285 1000 GTX 280 800 /s 8800 Ultra 600 8800 GTX NVIDIA GPU GFlop Intel CPU 400 7900 GTX 7800 GTX 200 6800 Ultra Intel Xeon Quad-core 3 GHz 5800 5950 Ultra 0 9/22/02 2/4/04 6/18/05 10/31/06 3/14/08 Courtesy of NVIDIA GPU Performance Trends: Memory Bandwidth 180 GTX 285 160 GTX 280 140 120 8800 Ultra /s 100 8800 GTX 80 GByte 60 7800 GTX 7900 GTX 40 5950 Ultra 6800 Ultra 20 5800 0 9/22/02 2/4/04 6/18/05 10/31/06 3/14/08 Courtesy of NVIDIA NVIDIA Tesla T10 GPU Architecture PCIe interface Input assembler Thread execution manager • T10 architecture TPC 1 TPC 10 • 240 streaming Geometry controller Geometry controller processors arranged SMC SMC SM SM SM SM SM SM as 30 streaming I cache I cache I cache I cache I cache I cache MT issue MT issue MT issue MT issue MT issue MT issue multiprocessors C cache C cache C cache C cache C cache C cache SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP SP • At 1.3 GHz this SP SP SP SP SP SP SP SP SP SP SP SP provides SP SP SP SP SP SP SP SP SP SP SP SP SFU SFU SFU SFU SFU SFU SFU SFU SFU SFU SFU SFU • 1 TFLOPS SP Shared Shared Shared Shared Shared Shared memory memory memory memory memory memory • 86.4 TFLOPS DP Texture units Texture units Texture L1 Texture L1 • 512-bit interface to 512-bit memory interconnect off-chip GDDR3 L2 ROP ROP L2 memory DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM • 102 GB/s bandwidth NVIDIA Tesla S1070 GPU Computing Server 4 GB GDDR3 • 4 T10 GPUs 4 GB GDDR3 SDRAM SDRAM Tesla Tesla GPU supply Power GPU NVIDIA PCI x16 SWITCH Thermal Thermal NVIDIA x16 PCI management SWITCH Tesla GPU Tesla System System monitoring GPU 4 GB GDDR3 SDRAM 4 GB GDDR3 SDRAM GPU Clusters at NCSA • Lincoln • AC • Production system available the • Experimental system available standard NCSA/TeraGrid HPC for anybody who is interested allocation in exploring GPU computing Intel 64 Tesla Linux Cluster Lincoln • Dell PowerEdge 1955 server • Intel 64 (Harpertown) 2.33 GHz IB dual socket quad core SDR IB SDR IB • 16 GB DDR2 Dell PowerEdge Dell PowerEdge 1955 server 1955 server • InfiniBand SDR • S1070 1U GPU Computing Server PCIe x8 PCIe x8 • 1.3 GHz Tesla T10 processors PCIe interface PCIe interface • 4x4 GB GDDR3 SDRAM T10 T10 T10 T10 • Cluster • Servers: 192 DRAM DRAM DRAM DRAM • CPU cores: 1536 Tesla S1070 • Accelerator Units: 96 • GPUs: 384 AMD AMD Opteron Tesla Linux Cluster AC • HP xw9400 workstation • 2216 AMD Opteron 2.4 GHz IB dual socket dual core QDR IB • 8 GB DDR2 HP xw9400 workstation Nallatech PCI-X H101 • InfiniBand QDR FPGA card • S1070 1U GPU Computing Server PCIe x16 PCIe x16 • 1.3 GHz Tesla T10 processors PCIe interface PCIe interface • 4x4 GB GDDR3 SDRAM T10 T10 T10 T10 • Cluster • Servers: 32 DRAM DRAM DRAM DRAM • CPU cores: 128 Tesla S1070 • Accelerator Units: 32 • GPUs: 128 AC Cluster: 128 TFLOPS (SP) NCSA GPU Cluster Management Software • CUDA SDK Wrapper • Enables true GPU resources allocation by the scheduler • E.g., qsub -l nodes=1:ppn=1 will allocate 1 CPU core and 1 GPU, fencing other GPUs in the node from accessing • Virtualizes GPU devices • Sets thread affinity to CPU core “nearest” the the GPU device • GPU Memory Scrubber • Cleans memory between runs • GPU Memory Test utility • Tests memory for manufacturing defects • Tests memory for soft errors • Tests GPUs for entering an erroneous state GPU Node Pre/Post Allocation Sequence • Pre-Job (minimized for rapid device acquisition) • Assemble detected device file unless it exists • Sanity check results • Checkout requested GPU devices from that file • Initialize CUDA wrapper shared memory segment with unique key for user (allows user to ssh to node outside of job environment and have same gpu devices visible) • Post-Job • Use quick memtest run to verify healthy GPU state • If bad state detected, mark node offline if other jobs present on node • If no other jobs, reload kernel module to “heal” node (for CUDA 2.2 bug) • Run memscrubber utility to clear gpu device memory • Notify of any failure events with job details • Terminate wrapper shared memory segment • Check-in GPUs back to global file of detected devices GPU Programming • 3rd Party Programming tools • CUDA C 2.2 SDK • OpenCL 1.2 SDK • PGI x64+GPU Fortran & C99 Compilers • IACAT’s on-going work • CUDA-auto • CUDA-lite • GMAC • CUDA-tune • MCUDA/OpenMP Implicitly parallel programming with data structure and function property CUDA-auto annotations to enable auto parallelization Locality annotation programming to eliminate need for explicit management of CUDA-lite memory types and data transfers GMAC Parameterized CUDA programming using auto-tuning and optimization space CUDA-tune pruning st 1 generation CUDA programming MCUDA/ NVIDIA with explicit, hardwired thread OpenMP SDK organizations and explicit management of memory types and data transfers IA multi-core NVIDIA GPU & Larrabe Courtesy of Wen-mei Hwu, UIUC GMAC – Designed to reduce accelerator use barrier • Unified CPU / GPU Address Space: • Same CPU and GPU address • Customizable implicit data transfers: • Transfer everything (safe mode) • Transfer dirty data before kernel execution • Transfer data as being produced (default) • Multi-process / Multi-thread support • CUDA compatible Courtesy of Wen-mei Hwu, UIUC GPU Cluster Applications • TPACF • Cosmology code used to study how the matter is distributed in the Universe • NAMD • Molecular Dynamics code used to run large-scale MD simulations • DSCF • NSF CyberChem project; DSCF quantum chemistry code for energy calculations • WRF • Weather modeling code • MILC • Quantum chromodynamics code, work just started • … TPACF on AC QP GPU cluster scaling 1000.0 567.5 289.0 180.0× 147.5 154.1× 100.0 160.0× 79.9 46.3 29.3 140.0× (sec) time exeution 23.7 120.0× 10.0 1 10 100.0× 88.2× number of MPI threads 80.0× speedup 59.1× 60.0× 45.5× 40.0× 30.9 20.0× 6.8× 0.0× Quadro FX GeForce GTX Cell B./E. (SP) GeForce GTX PowerXCell 8i H101 (DP) 5600 280 (SP) 280 (DP) (DP) NAMD on Lincoln (8 cores and 2 GPUs per node, very early results) ~5.6 ~2.8 1.6 1.4 CPU (8ppn) 2 GPUs = 24 cores CPU (4ppn) 1.2 4 GPUs CPU (2ppn) 1 8 GPUs GPU (4:1) 0.8 GPU (2:1) 16 GPUs STMV s/stepSTMV 0.6 GPU (1:1) 0.4 8 GPUs = 0.2 96 CPU cores 0 4 8 16 32 64 128 CPU cores Courtesy of James Phillips, UIUC DSCF on Lincoln Bovine pancreatic trypsin inhibitor (BPTI) 3-21G, 875 atoms, 4893 MPI timings and scalability basis functions Courtesy of Ivan Ufimtsev, Stanford GPU Clusters Open Issues / Future Work • GPU cluster configuration • CPU/GPU ratio • Host/device memory ratio • Cluster interconnect requirements • Cluster management • Resources allocation • Monitoring • CUDA/OpenCL SDK for HPC • Programming models for accelerator clusters • CUDA/OpenCL+pthreads/MPI/OpenMP/Charm++/….
Recommended publications
  • 1Gb Gddr3 SDRAM E-Die
    K4W1G1646E 1Gb gDDR3 SDRAM 1Gb gDDR3 SDRAM E-die 96 FBGA with Lead-Free & Halogen-Free (RoHS Compliant) INFORMATION IN THIS DOCUMENT IS PROVIDED IN RELATION TO SAMSUNG PRODUCTS, AND IS SUBJECT TO CHANGE WITHOUT NOTICE. NOTHING IN THIS DOCUMENT SHALL BE CONSTRUED AS GRANTING ANY LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHER- WISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IN SAMSUNG PRODUCTS OR TECHNOL- OGY. ALL INFORMATION IN THIS DOCUMENT IS PROVIDED ON AS "AS IS" BASIS WITHOUT GUARANTEE OR WARRANTY OF ANY KIND. 1. For updates or additional information about Samsung products, contact your nearest Samsung office. 2. Samsung products are not intended for use in life support, critical care, medical, safety equipment, or similar applications where Product failure could result in loss of life or personal or physical harm, or any military or defense application, or any governmental procurement to which special terms or provisions may apply. * Samsung Electronics reserves the right to change products or specification without notice. 1 of 123 Rev. 1.2 July 2009 K4W1G1646E 1Gb gDDR3 SDRAM Revision History Revision Month Year History 0.0 July 2008 - First release - Correction ball configuration on page 4. 0.9 October 2008 - Changed ordering Infomation 0.9 November 2008 - Correction package size on page 5. - Added thermal characteristics table 0.91 December 2008 - Added speed bin 1066Mbps - Corrected tFAW value on page 44 0.92 February 2009 - Corrected Package Dimension on page - Corrected Package pinout on page 4 - Added thermal characteristics values (speed bin 1066/1333/1600Mbps ) on page 14. 1.0 February 2009 - Added IDD SPEC values (speed bin 1066/1333/1600Mbps) on page 36.
    [Show full text]
  • NVIDIA Quadro CX Overview
    QuickSpecs NVIDIA Quadro CX Overview Models NVIDIA Quadro CX – The Accelerator for Creative Suite 4 Introduction The NVIDIA® Quadro® CX is the accelerator for Adobe® Creative Suite® 4-giving creative professionals the performance, tools, and reliability they need to maximize their creativity. Quadro CX enables H.264 video encoding at lightning-fast speeds with NVIDIA CUDA™ technology and accelerates rendering time for advanced effects. A Faster Way to Work Faster video encoding: Encode H.264 video at lightning-fast speeds with the NVIDIA CUDA™-enabled plug-in for Adobe Premiere® Pro CS4. Accelerated effects: Accelerate rendering time for advanced effects such as transformations, color correction, depth of field blur, turbulent noise, and more. A Better Way to Work High-quality preview: Accurately see what your deliverable will look like with 30-bit color or uncompressed 10-bit/12-bit SDI before final output.* Natural canvas: Experience fluid interaction with the Adobe Photoshop® canvas for smoother zooming and image rotation. Advanced multi-display management tools: Easily work across multiple displays with NVIDIA nView® advanced management tools for a smoother production pipeline. A More Reliable Way to Work Designed for CS4: NVIDIA Quadro CX is designed and optimized for Creative Suite 4. Built by NVIDIA: Quadro CX is engineered and optimized by NVIDIA to ensure your system works when you need it. Extended product lifecycle: Extended (24-36 month) product life cycle and availability for a stable platform environment. NOTE: As of August 1, 2009 Quadro CX is being replaced with Elemental Accelerator Software for NVIDIA Quadro. The same functionality is offered with drop in the box software, across multiple NVIDIA Quadro graphics cards.
    [Show full text]
  • GRA110 3U VPX High Performance Graphics Board
    GE Fanuc Embedded Systems GRA110 3U VPX High Performance Graphics Board Features The GRA110 is the first graphics board to be With a rich set of I/O, the GRA110 is designed • NVIDIA G73 GPU announced in the 3U VPX form factor. Bringing to serve many of the most common video - As used on NVIDIA® GeForce® 7600GT desktop performance to the rugged market, applications. Dual, independent channels the GRA110 represents a step change in mean that it is capable of driving RGB analog • Leading OpenGL performance capability for the embedded systems integrator. component video, digital DVI 1.0, and RS170, • 256 MBytes DDR SDRAM With outstanding functionality, together with NTSC or PAL standards. In addition, the GRA110’s • Two independent output channels PCI ExpressTM interconnect, even the most video input capability allows integration of sensor demanding applications can now be deployed data using RS170, NTSC or PAL video formats. • VESA output resolutions to 1600x1200 with incredible fidelity. • RS-170, NTSC & PAL video output • DVI 1.0 digital video output The VPX form factor allows for high speed PCI Express connections to single board computers in • RS170, NTSC & PAL Video output the system. The GRA110 supports the 16-lane PCI • Air- and rugged conduction- variants Express implementation, providing the maximum • 3U VPX Form Factors available communication bandwidth to a CPU such as GE Fanuc Embedded System’s SBC340. The PCI Express link will automatically adapt to the active number of lanes available, and so will work with single board computers
    [Show full text]
  • NVIDIA Quadro FX 560 Graphics Controller Overview
    QuickSpecs NVIDIA Quadro FX 560 Graphics Controller Overview Models NVIDIA Quadro FX 560 PCI-Express graphics controller kit. ES354AA Includes: PCA with ATX bracket, DVI to VGA converters, HDTV dongle, CD and manual. Introduction NVIDIA Quadro FX 560 features uncompromised performance and programmability at an entry-level price for professional CAD, DCC, and visualization applications. NVIDIA Quadro FX 560 entrylevel graphics are designed to deliver cost-effective solutions to professional 3D environments without compromising on quality, precision, performance, and programmability. Featuring advanced features ranging from a 128-bit floating point graphics pipeline, 12-bit subpixel precision, fast GDDR3 memory, and HD output, the NVIDIA Quadro FX 560 is certified on a wide set of CAD, DCC, and visualization applications, offering the value and capabilities professional users expect. In addition, support for one dual-link DVI output and HD component out enables direct connection to both high resolution digital flat panels and broadcast monitors, making the NVIDIA Quadro FX 560 a must have for any video editing solution. Performance & Features Features include an array of parallel vertex engines, fully programmable pixel pipelines, a high-speed graphics memory bus, and next-generation crossbar memory architecture: 128MB G-DDR3 graphics memory Two DVI-I (one dual-link) + 9-pin HDTV Out (HD output cable included) OpenGL quad-buffered stereo Rotated-Grid FSAA High Precision Dynamic Range Technology Unlimited Vertex & Pixel programmability 128-bit IEEE floating-point precision graphics pipeline\ 32-bit floating point color precision per component Hardware overlays Hardware accelerated anti-aliased points and lines Two-sided lighting Occlusion culling Advanced full-scene anti-aliasing Optimized and certified for OpenGL 2.0 and DirectX® 9.0 applications Multi-display productivity PCIe x16 bus Compatibility The NVIDIA Quadro FX 560 is supported on HP xw6400, xw8400 Workstations.
    [Show full text]
  • NVIDIA Quadro FX 380 256MB Graphics Card Overview
    QuickSpecs NVIDIA Quadro FX 380 256MB Graphics Card Overview Models NVIDIA Quadro FX 380 256MB PCIe Graphics Card NB769AA Introduction NVIDIA® Quadro® FX 380: New level of performance at a breakthrough price Affordable professional-class graphics solution, the NVIDIA® Quadro® FX 380 solution provides energy efficiency while delivering 50% faster performance across industry-leading CAD and digital content applications. Performance and Features 128-Bit Precision Graphics Pipeline: Full IEEE 32-bit floating-point precision per color component (RGBA) delivers millions of color variations with the broadest dynamic range. Sets new standards for image clarity and quality in shading, filtering, texturing, and blending. Enables unprecedented rendered image quality for visual effects processing. Full-Scene Antialiasing (FSAA): Up to 16x FSAA dramatically reduces visual aliasing artifacts or "jaggies," resulting in highly realistic scenes. 256 MB GDDR3 GPU Memory with ultra fast memory bandwidth: Delivers high throughput for interactive visualization of large models and high-performance for real time processing of large textures and frames and enables the highest quality and resolution full-scene antialiasing (FSAA). Advanced Color Compression, Early Z-Cull: Improved pipeline color compression and early z-culling to increase effective bandwidth and improve rendering efficiency and performance. Full-Scene Antialiasing (FSAA): Up to 16x FSAA dramatically reduces visual aliasing artifacts or "jaggies," resulting in highly realistic scenes. Highest Workstation Application Performance: Next-generation architecture enables over 2x improvement in geometry and fill rates with the industry's highest performance for professional CAD, DCC, and scientific applications. High-Performance Display Outputs: 400MHz RAMDACs and up to two dual-link DVI digital connectors drive the highest resolution digital displays available on the market.
    [Show full text]
  • NVIDIA NVS 300 512MB Graphics Card for Workstations Overview
    QuickSpecs NVIDIA NVS 300 512MB Graphics Card for Workstations Overview Models NVIDIA NVS300 512MB PCIe Graphics Card XP612AA Introduction The NVIDIA® NVS™ 300 high resolution, multi-display business graphics solution, designed for small and standard form factor systems, delivers reliable hardware and software for a stable business environment. Tested on leading business applications, NVIDIA NVS 300 is capable of driving two VGA, single link DVI, DisplayPort or HMDI displays. With NVIDIA Mosaic and nView® technologies built in, you can span and efficiently manage your entire Windows desktop across multiple displays. DA - 13906 North America — Version 6 — April 1, 2012 Page 1 QuickSpecs NVIDIA NVS 300 512MB Graphics Card for Workstations Overview Performance and Features Designed, tested, and built by NVIDIA to meet ultra high standards of quality assurance. Fast 512 MB DDR3 graphics memory delivers high throughput to simultaneously play stunning HD video on each connected display. NVIDIA Power Management technology helps reduce the overall system energy cost, by intelligently adapting the total power utilization of the graphics subsystem based on the applications being run by the end user. 16 CUDA GPU parallel computing cores compatible with all CUDA accelerated applications. NVIDIA CUDA provides a C language environment and tool suite that unleashes new capabilities to solve various visualization challenges. Drive up to two displays: Support up to two 1920 x 1200 digital displays with DVI connection Support up to two 30" (2560 x 1600) digital displays through optional DMS-59 to DisplayPort adapter The nView® Advanced Desktop Software delivers maximum flexibility for single-large display or multi-display options, providing unprecedented end-user control of the desktop experience for increased productivity.
    [Show full text]
  • Tesla C1060 Computing Processor Board
    TESLA C1060 COMPUTING PROCESSOR BOARD BD-04111-001_v06 | January 2010 Board Specification DOCUMENT CHANGE HISTORY BD-04111-001_v06 Version Date Authors Description of Change 01 July 10, 2008 SG, SM Preliminary Release 02 July 15, 2008 SG, SM Minor text updates Updated Support Information section 03 September 22, 2008 SG, SM Initial Release Updated thermal information Minor text updates 04 December 8, 2008 SG, SM Updated fan flow and power information 05 January 22, 2009 SG, SM Updated board power 06 January 26, 2010 RK, SM Updated document look and feel Updated storage temperature (Table 10) Tesla C1060 Computing Processor Board BD-04111-001_v06 | ii TABLE OF CONTENTS Overview ........................................................................................... 1 Key Features ..................................................................................... 1 Computing Processor Description ............................................................. 2 Configuration .................................................................................... 3 Mechanical Specifications ....................................................................... 4 PCI Express System ............................................................................. 4 Standard I/O connector Placement .......................................................... 5 Internal Connectors and Headers ............................................................. 6 External PCI Express Power Connectors .................................................. 6 4-Pin
    [Show full text]
  • GPU Computing: More Exciting Times for HPC! Paul G
    GPU Computing: More exciting times for HPC! Paul G. Howard, Ph.D. Chief Scientist, Microway, Inc. Copyright 2009 by Microway, Inc. Introduction In the never-ending quest in the High Performance Computing world for more and more computing speed, there's a new player in the game: the general purpose graphics processing unit, or GPU. The GPU is the processor used on a graphics card, optimized to perform simple but intensive processing on many pixels simultaneously. Taking a step back from its use in graphics, the GPU can be thought of as a high- performance SPMD (single-process multiple-data) processor capable of extremely high computational throughput, at low cost and with a far higher computational throughput-to-power ratio than is possible with CPUs. Performance speedups between 10x and 1000x have been reported using GPUs on real-world applications. For most people, a performance improvement of even one order of magnitude does not simply increase productivity. It will allow previously impractical projects to be completed in a timely manner. With three orders of magnitude improvement, a one minute simulation may be viewed real-time at 15 frames per second. GPU computing will change the way you think about your research! In late 2006, NVIDIA and ATI (acquired by AMD in 2006) began marketing GPU boards as streaming coprocessor boards. Both companies released software to expose the processor to application programmers. Since then the hardware has grown more powerful and the available software has become more high-level and developer-friendly. Hardware specifications and comparisons The GPU cards currently on the market are available from NVIDIA and AMD.
    [Show full text]
  • NVIDIA Quadro FX 4800 1.5GB Pcie Graphics Card Overview
    QuickSpecs NVIDIA Quadro FX 4800 1.5GB PCIe Graphics Card Overview Models NVIDIA Quadro FX 4800 1.5GB PCIe Graphics Card FQ138AA Introduction Whether designing and bringing the next luxury vehicle to market, creating the next blockbuster film, or providing a diagnosis of a patient's condition, professionals push the boundaries of realism, performance, and quality everyday. In an increasingly competitive and high-pressure landscape, they need to deliver results better, faster, and more cost effectively than ever before and traditional processing paradigms just cannot keep up. To get ahead of this trend, a new visual computing model is emerging based on the latest generation NVIDIA® Quadro® FX high- end graphics cards, like the Quadro FX 4800. Professional applications take advantage of the card's advanced feature set, including 1.5 GB of graphics memory and high color fidelity, providing the right set of tools to deliver results that push the realms of visual computing. The Quadro FX 4800 is built on the second-generation NVIDIA GPU unified architecture, delivering up to 50% increased performance over the first generation through 192 processor cores. The entire Quadro family takes professional visualization applications to a new level of interactivity by enabling unprecedented programmability and precision. The industry's leading workstation applications leverage these capabilities to deliver hardware- accelerated features, performance, and quality not found in other professional graphics solutions. Performance and Features Massive 1.5GB frame buffer and memory bandwidth up to 76.8 GB/sec. delivers high throughput for interactive visualization of large models and high-performance for real time processing of large textures and frames and enables the highest quality and resolution full-scene antialiasing (FSAA).
    [Show full text]
  • 512Mbit GDDR3 SDRAM
    http://www.BDTIC.com/SAMSUNG K4J52324QE 512M GDDR3 SDRAM 512Mbit GDDR3 SDRAM Revision 1.3 Feb 2008 INFORMATION IN THIS DOCUMENT IS PROVIDED IN RELATION TO SAMSUNG PRODUCTS, AND IS SUBJECT TO CHANGE WITHOUT NOTICE. NOTHING IN THIS DOCUMENT SHALL BE CONSTRUED AS GRANTING ANY LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IN SAMSUNG PRODUCTS OR TECHNOLOGY. ALL INFORMATION IN THIS DOCUMENT IS PROVIDED ON AS "AS IS" BASIS WITHOUT GUARANTEE OR WARRANTY OF ANY KIND. 1. For updates or additional information about Samsung products, contact your nearest Samsung office. 2. Samsung products are not intended for use in life support, critical care, medical, safety equipment, or similar applications where Product failure could result in loss of life or personal or physical harm, or any military or defense application, or any governmental procurement to which special terms or provisions may apply. * Samsung Electronics reserves the right to change products or specification without notice. - 1/58 - Rev. 1.3 Feb 2008 http://www.BDTIC.com/SAMSUNG K4J52324QE 512M GDDR3 SDRAM Revision History Revision Month Year History 0.0 March 2006 - Target Spec - Finalized Spec • Deleted -BC20/16 binning • Changed Voltage SPEC from C-die : VDD/VDDQ : 2.0V + 0.1V of -BJ12 @ C-die => -BC12 : 1.8V + 0.1V @ E-die : VDD/VDDQ : 2.0V + 0.1V of - BJ11 @ C-die => -BJ11: 1.9V + 0.1V @ E-die 1.0 June 2006 • Added -BJ1A(GF2000) spec : VDD/VDDQ : 1.9V + 0.1V @ E-die • Added current data • Added IBIS data • Added comment on page 27.
    [Show full text]
  • NVIDIA Tesla C870 GPU Computing Processor Board
    Board Specification NVIDIA Tesla C870 GPU Computing Processor Board April 14, 2008 | BD-03399-001_v04 Document Change History Version Date Reason for Change 01 July 24, 2007 Preliminary Release 02 August 1, 2007 Clarified language support for Linux (Table 9) 03 January 29, 2008 Removed confidential statement Updated look and feel to meet current standards 04 April 14, 2008 Updated terminology to be the same as used on web site and other documents April 14, 2008 | BD-03399-001_v04 ii Table of Contents NVIDIA Tesla C870 Overview.........................................................................................1 Key Features....................................................................................................................1 GPU Description ............................................................................................................... 2 Configuration ...................................................................................................................3 Mechanical Specification ................................................................................................4 PCI Express System.......................................................................................................... 4 Placement of Standard I/O Connectors............................................................................... 5 Internal Connector and Headers..................................................................................... 6 PCI Express Power Connector .......................................................................................
    [Show full text]
  • Sample Cut Sheets with Rated Power
    http://h18004.www1.hp.com/products/quickspecs/12849_na/12849_na.html Overview Supported Components System Technical Specifications Technical Specifications - Processors Technical Specif Technical Specifications - Hard Drives Technical Specifications - Hard Drive Controllers Technical Specifications - Multimedia and Technical Specifications - Optical and Removable Storage Technical Specifications - Networking and Communications Technical Overview HP xw8600 Workstation HP recommends Windows Vista® Business 1. Monitor (sold separately) 7. 1 PCI slot, 1 PCI-X slot, 1 PCIe x1 or x8 (selectable), 2 PCIe x8 (x4 electrically) 2. Standard Keyboard (USB or PS/2) 8. 2 PCI Express x16 Gen2 Graphics Bus 3. Mouse (USB or PS/2) 9. Dual-Core or Quad-Core Intel® Xeon® Processors 4. Front IO: 2 USB 2.0, IEEE-1394a (standard), 10. 8 DIMM slots (16 with riser) for DDR2 FB-DIMM headphone and microphone memory 5. 5.25" external bay for optional diskette drive, optical 11. 5 USB 2.0, 1 standard serial port, 2 PS/2, 2 RJ-45, drive or other 5.25"/3.5" device audio line in, audio line out, and microphone in, microphone, 1 IEEE-1394a 6. 5 internal 3.5" bays, 3 external 5.25" bays 12. Choice of 800 or 1050 watt, 80 PLUS power supplies Form Factor Minitower Compatible Operating Genuine Windows Vista® 32-bit downgrade to Genuine Microsoft® Windows® XP Professional Systems 32-bit Genuine Windows Vista® 64-bit downgrade to Genuine Microsoft® Windows® XP Professional 64-bit Genuine Windows Vista® Business 32-bit Genuine Windows Vista® Business 64-bit HP Installer Kit
    [Show full text]