Corporate Backgrounder

Total Page:16

File Type:pdf, Size:1020Kb

Corporate Backgrounder AGEIA PhysX FAQs for System and Board Partners For use in partner marketing materials KEY QUESTIONS 1: What exactly is the AGEIA PhysX Processor? 2: What is the Gaming Power Triangle? 3: What is physics for gaming? 4: What is a physics processor? 5: What makes the AGEIA PhysX hardware special? 6: Why do I need physics in my game? 7: Isn’t physics just juiced up graphics? 8: Can I get PhysX enabled games today? 9: Why do I need dedicated hardware for physics? 10: How do I get the new AGEIA PhysX processor into my PC? 11: How much does AGEIA PhysX cost? 12: How do games get PhysX-optimized? 13: Do I have to buy new hardware every year just to get new physics features? AGEIA Confidential March 2006 . 1 What exactly is the AGEIA PhysX Processor? The AGEIA PhysX processor is the first dedicated hardware accelerator for PC games that enables dynamic motion and interaction for a totally new gaming experience. Exciting immersive physical gaming feature that you can expect to see in next generation cutting edge titles will include groundbreaking physics features such as: * Explosions that cause dust and collateral debris * Characters with complex, jointed geometries for more life-like motion and interaction * Spectacular new weapons with unpredictable effects * Cloth that drapes and tears the way you expect it to * Lush foliage that sways naturally when brushed against * Dense smoke & fog that billow around objects in motion What is the Gaming Power Triangle The gaming triangle represents the three critical PC processors that enable the best possible gaming experience. The AGEIA PhysX processor is the key hardware element required for optimized game physics and is the new third engine of the Gaming Power Triangle which will drive a new era in exciting and immersive gaming. This new triangle consists of the central processing unit (CPU), graphics processing unit (GPU) and Physics Processing Unit (PPU) working together to balance the extensive computing demands of game logic, graphics and physics. Within this balanced triangle the CPU “thinks and orchestrates” to drive game artificial intelligence and logic, the general purpose GPU “renders and displays” to deliver beautiful 3D imagery, and the third leg of the triangle, the new AGEIA PhysX Processor, “moves and interacts” to take gaming to the next level with pervasive dynamic motion and interaction. AGEIA Confidential March 2006 . 2 What is physics for gaming? It’s the next big thing in gaming. Physics is all about how objects in your game move and react. It’s not just how things look, but how they behave. In many of today’s games, objects just don’t seem to act the way you’d want or expect. Most of the action is limited to pre-scripted or canned animations. Even the most powerful weapons leave little more than a charred smudge on the thinnest of walls; and every opponent you take out falls in a strangely familiar way. Serious gamers are left with a fine game, but one with a missing sense of realism to make the experience truly immersive. Until now, only limited software physics has been integrated in games. That means physics computation has been handled by the general purpose CPU which is already burdened with the ever increasing demands of today’s advanced games including game logic and AI. Meanwhile, the highly specialized graphics processor is fully engaged with handling the rendering requirements of cinematic visuals at interactive framerates. As a result, physics in games has been limited to a few objects in a scene, one-off “effects” or visual trickery that just mimics real physics. Now for the first time, the AGEIA PhysX Processor delivers the computing horsepower necessary to enable true, advanced physics in games. What is a physics processor? A physic processor is one designed specifically for the unique requirements of physics algorithms, which represent an environment that’s much different general purpose or graphics processing. And the AGEIA PhysX processor is the first and only physics processor (PPU) available for games. The PPU calculates all of the movement and interaction in a game at incredible speeds, so that graphically- sophisticated games can have realistic “cause and effect” action. AGEIA PhysX calculates the equations of matter and force and their interaction and movement in real-time environments. What makes the AGEIA PhysX hardware special? The only way to get real physics with the scale, sophistication, fidelity and high level of interactivity is with the AGEIA PhysX processor, which was developed from day one to accelerate the highly specialized physically based simulations. Its massively parallel, interactive AGEIA PhysX cores are optimized specifically for dynamic large-scale, physics processing, which enable it to accelerate real-time physical motion and interaction at a scale and quality far beyond that of general purpose solutions. For more information, read “Advanced Gaming Physics – Defining the New Reality in PC Hardware AGEIA Confidential March 2006 . 3 Why do I need physics in my game? In today’s games, objects just don’t seem to act the way they would in real life. Most of the action is limited to pre-scripted or canned animations. For example, fired shots leave strangely similar looking wallpaper-like bullet hole “marks” on the wall; players’ arms and legs somehow manage to actually protrude through what appeared to be a brick wall; objects of different weight, size and mass respond to contact oddly in the same way. Serious gamers are left a great game, but with a missing level of reality. But when physics is added to the mix, games can finally act real and feel real. Physics enables games to replace the repetitive predetermined motion with real interactions. So just like life, you never have the same experience twice. Buildings and landscapes are now massively destructible with extreme explosions of thousands of shards of glass and shrapnel that cause collateral damage; lifelike characters run, jump, block, tackle, swim or fly in remarkably natural lifelike ways; realistic smoke, fog and oozing fluids now react with characters, objects and entire physical environments. The possibilities are really endless. Isn’t physics just juiced up graphics? No, graphics is all about how things look. And today’s general purpose GPUs do a great job of rendering and displaying beautiful 3D imagery. But 3D graphics is just that: Graphics. Physics is an entirely different beast. It’s not about how objects look, but about how objects in your game move and react. Can I get PhysX enabled games today? Yes. First of all, there are over 60 developers planning to deliver as many as 100 PhysX-accelerated game titles from now through 2006 and into 2007. I’m not sure I’ve ever seen a brand new technology category like this being supported to this extent by leading content providers. In May 2006, Tom Clancy’s Ghost Recon Advanced Warfighter from Ubisoft, one of this year’s most anticipated titles, will hit the shelves with a wide variety of new “knock your socks off” PhysX features. The Microsoft Xbox360 version is already getting 9.9-class reviews with a big chunk of physics being driven by AGEIA PhysX software. A patch to Bet on Soldier with incredible fluid weapons unlike any you’ve seen before is available now. And Artificial Reality’s Cell Factor, a mind-bending game in development which is lighting up everyone who sees it, will be downloadable within a matter of days. What’s more, downloadable updates of NCSoft’s City of Villains MMOG and the highly touted Rise of Nations: Rise of Legends by Microsoft are soon to be available. As many as 20 games, including Epic’s Unreal Tournament 2007, have been publicly announced as supporting PhysX and we expect at least 20 more to hit store shelves in 2006. That doesn’t even count what we’ll see in 2007. And with leading game engines like Epic’s Unreal Engine 3, Emergent’s Gamebryo 2.2 and Bioware’s Eclipse, the possibilities get even bigger. AGEIA Confidential March 2006 . 4 Why do I need dedicated hardware for physics? Delivering physics in games is no easy task. It’s an extremely compute-intensive environment based on a unique set of physics algorithms that require tremendous amounts of mathematical and logical calculations supported by massive memory bandwidth. Simply put, it requires the AGEIA PhysX processor: a specialized accelerator dedicated solely to delivering rich immersive physical gaming environments. The only way to get real physics with the scale, sophistication, fidelity and level of interactivity that dramatically alters your entertainment experience is with the AGEIA PhysX processor, which was developed from day one to accelerate the highly specialized physically based simulations. For more information, read “Advanced Gaming Physics – Defining the New Reality in PC Hardware” (PDF) How do I get the new AGEIA PhysX processor into my PC? The AGEIA PhysX processor is integrated into an add-in card known as a PhysX Accelerator which is [insert sales information from AGEIA partner here] How much does AGEIA PhysX cost? [insert sales information from AGEIA partner here] How do games get PhysX-optimized? AGEIA makes more than hardware. That’s only half of the story. AGEIA provides game developers world class PhysX software which enables next-generation physics features accelerated by the PhysX processor. Not only is the software unmatched in feature functionality, stability , performance and robustness, it’s also supported by all the major developer tools and enables straightforward cross-platform support for Microsoft Xbox360 and Sony Playstaton 3. AGEIA continues to add new features to new revisions of PhysX software, which game developers will integrate into the latest and greatest titles.
Recommended publications
  • CUDA Particle-Based Fluid Simulation
    CUDA Particle-based Fluid Simulation Simon Green Overview Fluid Simulation Techniques CUDA particle simulation Spatial subdivision techniques Rendering methods Future © NVIDIA Corporation 2008 Fluid Simulation Techniques Various approaches: Grid based (Eulerian) Stable fluids Particle level set Particle based (Lagrangian) SPH (smoothed particle hydrodynamics) MPS (Moving-Particle Semi-Implicit) Height field FFT (Tessendorf) Wave propagation – e.g. Kass and Miller © NVIDIA Corporation 2008 CUDA N-Body Demo Computes gravitational attraction between n bodies Computes all n2 interactions Uses shared memory to reduce memory bandwidth 16K bodies @ 44 FPS x 20 FLOPS / interaction x 16K2 interactions / frame = 240 GFLOP/s GeForce 8800 GTX © NVIDIA Corporation 2008 Particle-based Fluid Simulation Advantages Conservation of mass is trivial Easy to track free surface Only performs computation where necessary Not necessarily constrained to a finite grid Easy to parallelize Disadvantages Hard to extract smooth surface from particles Requires large number of particles for realistic results © NVIDIA Corporation 2008 Particle Fluid Simulation Papers Particle-Based Fluid Simulation for Interactive Applications, M. Müller, 2003 3000 particles, 5fps Particle-based Viscoelastic Fluid Simulation, Clavet et al, 2005 1000 particles, 10fps 20,000 particles, 2 secs / frame © NVIDIA Corporation 2008 CUDA SDK Particles Demo Particles with simple collisions Uses uniform grid based on sorting Uses fast CUDA radix sort Current performance: >100 fps for 65K interacting
    [Show full text]
  • HELLAS: a Specialized Architecture for Interactive Deformable Object Modeling
    HELLAS: A Specialized Architecture for Interactive Deformable Object Modeling Shrirang Yardi Benjamin Bishop Thomas Kelliher Department of Electrical and Department of Computing Department of Mathematics Computer Engineering Sciences and Computer Science Virginia Tech University of Scranton Goucher College Blacksburg, VA Scranton, PA Baltimore, MD [email protected] [email protected] [email protected] ABSTRACT more realistic models, it is likely that the computational Applications involving interactive modeling of deformable resources required by these simulations may continue to ex- objects require highly iterative, floating-point intensive nu- ceed those provided by the incremental advances in general- merical simulations. As the complexity of these models in- purpose systems [12]. This has created an interest in the creases, the computational power required for their simula- use of specialized hardware that provides high performance tion quickly grows beyond the capabilities of current gen- computational engines to accelerate such floating-point in- eral purpose systems. In this paper, we present the design tensive applications [6, 7]. of a low–cost, high–performance, specialized architecture to Our research focuses on the design of such specialized sys- accelerate these simulations. Our aim is to use such special- tems. However, our approach has several important differ- ized hardware to allow complex interactive physical model- ences as compared to previous work in this area: (i) we ing even on consumer-grade PCs. In this paper, we present specifically target algorithms that are numerically stable, details of the target algorithms, the HELLAS architecture, but are much more computationally intensive than those simulation results and lessons learned from our implemen- used by existing hardware implementations [3,6]; (ii) we aim tation.
    [Show full text]
  • NVIDIA Physx SDK EULA
    NVIDIA CORPORATION NVIDIA® PHYSX® SDK END USER LICENSE AGREEMENT Welcome to the new world of reality gaming brought to you by PhysX® acceleration from NVIDIA®. NVIDIA Corporation (“NVIDIA”) is willing to license the PHYSX SDK and the accompanying documentation to you only on the condition that you accept all the terms in this License Agreement (“Agreement”). IMPORTANT: READ THE FOLLOWING TERMS AND CONDITIONS BEFORE USING THE ACCOMPANYING NVIDIA PHYSX SDK. IF YOU DO NOT AGREE TO THE TERMS OF THIS AGREEMENT, NVIDIA IS NOT WILLING TO LICENSE THE PHYSX SDK TO YOU. IF YOU DO NOT AGREE TO THESE TERMS, YOU SHALL DESTROY THIS ENTIRE PRODUCT AND PROVIDE EMAIL VERIFICATION TO [email protected] OF DELETION OF ALL COPIES OF THE ENTIRE PRODUCT. NVIDIA MAY MODIFY THE TERMS OF THIS AGREEMENT FROM TIME TO TIME. ANY USE OF THE PHYSX SDK WILL BE SUBJECT TO SUCH UPDATED TERMS. A CURRENT VERSION OF THIS AGREEMENT IS POSTED ON NVIDIA’S DEVELOPER WEBSITE: www.developer.nvidia.com/object/physx_eula.html 1. Definitions. “Licensed Platforms” means the following: - Any PC or Apple Mac computer with a NVIDIA CUDA-enabled processor executing NVIDIA PhysX; - Any PC or Apple Mac computer running NVIDIA PhysX software executing on the primary central processing unit of the PC only; - Any PC utilizing an AGEIA PhysX processor executing NVIDIA PhysX code; - Microsoft XBOX 360; - Nintendo Wii; and/or - Sony Playstation 3 “Physics Application” means a software application designed for use and fully compatible with the PhysX SDK and/or NVIDIA Graphics processor products, including but not limited to, a video game, visual simulation, movie, or other product.
    [Show full text]
  • Multiprocessing Contents
    Multiprocessing Contents 1 Multiprocessing 1 1.1 Pre-history .............................................. 1 1.2 Key topics ............................................... 1 1.2.1 Processor symmetry ...................................... 1 1.2.2 Instruction and data streams ................................. 1 1.2.3 Processor coupling ...................................... 2 1.2.4 Multiprocessor Communication Architecture ......................... 2 1.3 Flynn’s taxonomy ........................................... 2 1.3.1 SISD multiprocessing ..................................... 2 1.3.2 SIMD multiprocessing .................................... 2 1.3.3 MISD multiprocessing .................................... 3 1.3.4 MIMD multiprocessing .................................... 3 1.4 See also ................................................ 3 1.5 References ............................................... 3 2 Computer multitasking 5 2.1 Multiprogramming .......................................... 5 2.2 Cooperative multitasking ....................................... 6 2.3 Preemptive multitasking ....................................... 6 2.4 Real time ............................................... 7 2.5 Multithreading ............................................ 7 2.6 Memory protection .......................................... 7 2.7 Memory swapping .......................................... 7 2.8 Programming ............................................. 7 2.9 See also ................................................ 8 2.10 References .............................................
    [Show full text]
  • Nvidia Corporation 2016 Annual Report
    2017 NVIDIA CORPORATION ANNUAL REVIEW NOTICE OF ANNUAL MEETING PROXY STATEMENT FORM 10-K THE AGE OF THE GPU IS UPON US THE NEXT PLATFORM A decade ago, we set out to transform the GPU into a powerful computing platform—a specialized tool for the da Vincis and Einsteins of our time. GPU computing has since opened a floodgate of innovation. From gaming and VR to AI and self-driving cars, we’re at the center of the most promising trends in our lifetime. The world has taken notice. Jensen Huang CEO and Founder, NVIDIA NVIDIA GEFORCE HAS MOVED FROM GRAPHICS CARD TO GAMING PLATFORM FORBES The PC drives the growth of computer gaming, the largest entertainment industry in the world. The mass popularity of eSports, the evolution of gaming into a social medium, and the advance of new technologies like 4K, HDR, and VR will fuel its further growth. Today, 200 million gamers play on GeForce, the world’s largest gaming platform. Our breakthrough NVIDIA Pascal architecture delighted gamers, and we built on its foundation with new capabilities, like NVIDIA Ansel, the most advanced in-game photography system ever built. To serve the 1 billion new or infrequent gamers whose PCs are not ready for today’s games, we brought the GeForce NOW game streaming service to Macs and PCs. Mass Effect: Andromeda. Courtesy of Electronic Arts. THE BEST ANDROID TV DEVICE JUST GOT BETTER ENGADGET NVIDIA SHIELD TV, controller, and remote. NVIDIA SHIELD is taking NVIDIA gaming and AI into living rooms around the world. The most advanced streamer now boasts 1,000 games, has the largest open catalog of media in 4K, and can serve as the brain of the AI home.
    [Show full text]
  • Graphics Hardware
    Department of Computer Engineering Graphics Hardware Ulf Assarsson Graphics hardware – why? l About 100x faster! l Another reason: about 100x faster! l Simple to pipeline and parallelize l Current hardware based on triangle rasterization with programmable shading (e.g., OpenGL acceleration) l Ray tracing: there are research architetures, and few commercial products – Renderdrive, RPU, (Gelato), NVIDIA OptiX 2 3 Perspective-correct texturing l How is texture coordinates interpolated over a triangle? l Linearly? Linear interpolation Perspective-correct interpolation l Perspective-correct interpolation gives foreshortening effect! l Hardware does this for you, but you need to understand this anyway! 4 5 Recall the following l Before projection, v, and after p (p=Mv) l After projection pw is not 1! l Homogenization: (px /pw , py /pw , pz /pw , 1) l Gives (px´, py ´ , pz´ , 1) 6 Texture coordinate interpolation l Linear interpolation does not work l Rational linear interpolation does: – u(x)=(ax+b) / (cx+d) (along a scanline where y=constant) – a,b,c,d are computed from triangle’s vertices (x,y,z,w,u,v) l Not really efficient l Smarter: – Compute (u/w,v/w,1/w) per vertex – These quantities can be linearly interpolated! – Then at each pixel, compute 1/(1/w)=w – And obtain: (w*u/w,w*v/w)=(u,v) – The (u,v) are perspectively-correct interpolated l Need to interpolate shading this way too – Though, not as annoying as textures l Since linear interpolation now is OK, compute, e.g., Δ(u/w)/ Δx, and use this to update u/w when stepping in the x- direction (similarly for other parameters) 7 Put differently: l Linear interpolation in screen space does not work for u,v l Solution: – We have applied a non-linear transform to each vertex (x/w, y/w, z/w).
    [Show full text]
  • AMD Pcie® ADD-IN BOARD
    NVIDIA GT610 PCIe® for passive ADD-IN BOARD Datasheet AEGX-N3A1-01FSS1 CONTENTS 1. Feature .................................................................................................................. 3 2. Functional Overview ............................................................................................. 4 2.1. GPU Block diagram .................................................................................... 4 2.2. GPU ............................................................................................................ 4 2.3. Memory Interface ..................................................................................... 5 2.4. Features and Technologies ....................................................................... 5 2.5. Display Support ......................................................................................... 5 2.6. Digital Audio .............................................................................................. 5 2.7. Video ......................................................................................................... 6 3. PIN Assignment and Description………………………………………………………………………..7 3.1. DVI-I Connector Pinout ............................................................................ .7 3.2. HDMI 1.4a Connector Pinout ................................................................... .8 3.3. VGA Connector Pinout ............................................................................. .9 3.4. VGA Header Pinout ................................................................................
    [Show full text]
  • Inside Chips
    InsideChips.VenturesTM Tracking Fabless, IP & Design-House Startups Volume 6, Number 5 May 2005 Business Microscope Semiconductor Forecast ... Each year, analyzing capital spending and fab capacity trends, InsideChips bestows a Semiconductor Forecast researching ASIC markets and technologies, and Award to the best analyst forecaster after actual following emerging markets for ICs such as cellular results are posted by the SIA. The purpose of the phones. Feldhan is the founder of SEMICO, one of program is to identify and recognize top analysts the several market research boutiques that spun out whose methodology and experience result in of In-Stat during the 1980s. SEMICO is a consistently accurate forecasting. semiconductor-centric market research firm whose Based on forecasts made in Q4 2003, the award services include consulting, reports, events and for the best semiconductor forecast made by a market newsletters. research firm for 2004 is shared by Bill McClean of The most conservative forecasts for 2004 were IC Insights and Jim Feldhan of SEMICO Research. Both analysts 17%, 18% and 19% made by iSupply, IDC, and SIA, respectively. forecasted +27% growth for 2004, which came in very close to VLSI Research was most optimistic at 32%. The semiconductor the actual +28% growth over 2003 as announced by the forecasts for 2004 average 25.6%. Semiconductor Industry Association. McClean also won This year, the industry has cooled from last year’s double- InsideChips’ forecast award for his semiconductor predictions digit pace. The average forecast compiled from all analysts is a for 2002. modest 1.4%. Prior to forming IC Insights, McClean worked at ICE Readers can view the results of last year’s forecasts on Corporation (acquired by Chipworks).
    [Show full text]
  • Introduction
    A D V A N C E D C O M P U T E R G R A P H I C S Introduction (Thanks to Professions Andries van Dam and John Hughes) CMSC 635 January 15, 2013 Introduction ‹#›/16 A D V A N C E D C O M P U T E R G R A P H I C S What is Computer Graphics? • Computer graphics generally means creation, storage and manipulation of models and images • Such models come from diverse and expanding set of fields including physical, mathematical, artistic, biological, and even conceptual (abstract) structures Frame from animation by William Latham, shown at SIGGRAPH 1992. Latham uses rules that govern patterns of natural forms to create his artwork. CMSC 635 January 15, 2013 Introduction ‹#›/16 A D V A N C E D C O M P U T E R G R A P H I C S Modeling vs. Rendering • Modeling – Create models – Apply materials to models – Place models around scene – Place lights in scene – Place the camera • Rendering – Take “picture” with camera • Both can be done by modern commercial software: – Autodesk MayaTM ,3D Studio MaxTM, BlenderTM, etc. Point Light Spot Light Directional Light Ambient Light CS128 lighting assignment by Patrick Doran, Spring 2009 CMSC 635 January 15, 2013 Introduction ‹#›/16 A D V A N C E D C O M P U T E R G R A P H I C S Solved problem: Rendering Avatar 2009 CMSC 635 January 15, 2013 Introduction ‹#›/16 A D V A N C E D C O M P U T E R G R A P H I C S What is Interactive Computer Graphics? (1/2) • User controls contents, structure, and appearance of objects and their displayed images via rapid visual feedback • Basic components of an interactive graphics system – input (e.g., mouse, tablet and stylus, multi-touch…) – processing (and storage) – display/output (e.g., screen, paper-based printer, video recorder…) • First truly interactive graphics system, Sketchpad, pioneered at MIT by Ivan Sutherland for his 1963 Ph.D.
    [Show full text]
  • Multicore: Why Is It Happening Now? Eller Hur Mår Moore’S Lag?
    Multicore: Why is it happening now? eller Hur Mår Moore’s Lag? Erik Hagersten Uppsala Universitet Are we hitting the wall now? Pop: Can the transistors be made even smaller and faster? Performance Possible path, [log] but requires a paradigm shift 1000 ? Business as 100 ” w usual… La s re o Well uhm … 10 o M “ The transistors can p: Po be made smaller and PDC 1 Summer faster, but there are School other problems / 2010 Institutionen för informationsteknologi | www.it.uu.seNowMC 2 ©Erik Hagersten| user.it.uu.se/~ehYear Darling, I shrunk the computer Mainframes Super Minis: Microprocessor: Mem Multicore: Many CPUs on a Mem PDC Summer chip! School 2010 Institutionen för informationsteknologi | www.it.uu.se MC 3 ©Erik Hagersten| user.it.uu.se/~eh Multi-core CPUs: Ageia PhysX, a multi-core physics processing unit. Ambric Am2045, a 336-core Massively Parallel Processor Array (MPPA) AMD Athlon 64, Athlon 64 FX and Athlon 64 X2 family, dual-core desktop processors. Opteron, dual- and quad-core server/workstation processors. Phenom, triple- and quad-core desktop processors. Sempron X2, dual-core entry level processors. Turion 64 X2, dual-core laptop processors. Radeon and FireStream multi-core GPU/GPGPU (10 cores, 16 5-issue wide superscalar stream processors per core) ARM MPCore is a fully synthesizable multicore container for ARM9 and ARM11 processor cores, intended for high-performance embedded and entertainment applications. Azul Systems Vega 2, a 48-core processor. Broadcom SiByte SB1250, SB1255 and SB1455. Cradle Technologies CT3400 and CT3600, both multi-core DSPs. Cavium Networks Octeon, a 16-core MIPS MPU.
    [Show full text]
  • Datasheet GFX-N3A3-V5LMS1
    NVIDIA GT630 D3 1024 VHDCI to 4 HDMI PCIe® ADD-IN BOARD Datasheet GFX-N3A3-V5LMS1 CONTENTS 1. Feature .................................................................................................................. 3 2. Functional Overview ............................................................................................. 4 2.1. GPU Block diagram .................................................................................... 4 2.2. Key Features .............................................................................................. 4 2.3. Memory ..................................................................................................... 5 2.4. Features and Technologies ....................................................................... 5 2.5. Display Support ......................................................................................... 5 2.6. Video ......................................................................................................... 5 2.7. Bus Support Features ................................................................................ 6 3. PIN Assignment and Description………………………………………………………………………..6 3.1. HDMI Connector Pinout ........................................................................... .6 4. Power Specifications ............................................................................................. 7 5. Thermal Specifications .......................................................................................... 7 6. Output configuration and
    [Show full text]
  • Gputerasort: High Performance Graphics Co-Processor Sorting For
    GPUTeraSort: High Performance Graphics Co­processor Sorting for Large Database Management Naga K. Govindaraju ∗ Jim Gray † Ritesh Kumar ∗ Dinesh Manocha ∗ naga,ritesh,dm @cs.unc.edu, [email protected] { } http://gamma.cs.unc.edu/GPUTERASORT ABSTRACT advanced memory units, and multi-processors to improve sorting performance. The current Indy PennySort record We present a new algorithm, GPUTeraSort, to sort billion- 1 record wide-key databases using a graphics processing unit benchmark , sorts a 40 GB database in 1541 seconds on a (GPU) Our algorithm uses the data and task parallelism $614 Linux/AMD system. on the GPU to perform memory-intensive and compute- However, current external memory sort performance is intensive tasks while the CPU is used to perform I/O and limited by the traditional Von Neumann style architecture resource management. We therefore exploit both the high- of the CPU. Computer architects use data caches to amelio- bandwidth GPU memory interface and the lower-bandwidth rate the CPU and the main memory bottleneck; but, CPU- CPU main memory interface and achieve higher memory based sorting algorithms incur significant cache misses on bandwidth than purely CPU-based algorithms. GPUTera- large datasets. Sort is a two-phase task pipeline: (1) read disk, build keys, This article shows how to use a commodity graphics pro- sort using the GPU, generate runs, write disk, and (2) read, cessing unit (GPU) as a co-processor to sort large datasets. merge, write. It also pipelines disk transfers and achieves GPUs are programmable parallel architectures designed for near-peak I/O performance. We have tested the perfor- real-time rasterization of geometric primitives - but they mance of GPUTeraSort on billion-record files using the stan- are also highly parallel vector co-processors.
    [Show full text]