Transformations a Quarterly Publication From

issue 03 // fall-winter 07 Transformations A quarterly publication from Q&A with Marv Burkett The Future of 3D Graphics FEATURES Solving the World's Hardest Problems Welcome to Bollywood NVIDIA will celebrate its 15th year anniversary in February. As I reflect on the accomplishments and A breakthroughs that we’ve made during this time, a characteristic of NVIDIA that stands out is our ability to WORD innovate at an absolutely relentless pace, quarter after quarter, year after year. This passion for pushing the boundaries of technology and imagination drives all of our businesses—from consumer products to FROM professional solutions—and positions us firmly as a leader in the visual computing revolution. Developers, scientists, engineers, and researchers around the world are finding new and exciting ways to leverage MIKE the GPU across a wide range of areas, from film making to medical diagnosis to oil & gas exploration. From our vantage point, we see consumers with an insatiable appetite for as much processing power as we can offer, making NVIDIA a true “friend of Moore’s Law.” As I contemplate the potential of the GPU to transform industries and solve the world’s most difficult problems, I am excited to think about what the next 15 years will hold not only for NVIDIA but for our entire worldwide ecosystem of partners, developers, and consumers. Michael W. Hara VICE PRESIDENT INVESTOR RELATIONS AND COMMUNICATIONS ISSUE THREE // FALL-WINTER 07 // NVIDIA CORPORATION // WWW.NVIDIA.COM TRANSFORMATIONS // Q&A with Marv Burkett GPUs are one of the few areas that can Q&A efficiently use the doubling of transistors. with Marv Burkett CFO, NVIDIA CORPORATion Talk about the relationship between Does NVIDIA view Moore’s Law as NVIDIA and the investment community. a friend or foe? Please explain. The second change is the disappearance of the IDMs Describe some of the changes MB: I believe investors have a much better understanding (Integrated Device Manufacturers). In the 70's and MB: Moore's Law has lived longer than most expected. the semiconductor industry has about NVIDIA than they did a few years ago. Previously 80's most semiconductor companies were IDMs, It's over 40 years old and is still alive and well. At this there was the perception that graphics suppliers traded undergone over the past decade. meaning they had their own fabs. In the 80's we point, each doubling of the # of possible transistors position every cycle. This wasn't true, but it was a saw the start of the fabless business model and the creates significant challenges and opportunities. We common perception with investors. Now they understand MB: I see two significant changes in the industry emergence of pure foundries. The prohibitive cost of are at the point where each successive generation of that with few exceptions, NVIDIA has been and continues that are evident today. The first is the entrance of fabs, coupled with the skyrocketing cost of process technology adds not thousands or even millions of to be the technology leader. private equity into the semiconductor industry. The development, has forced many IDMs to abandon the possible transistors, but now a new generation could add acquisition of On Semiconductor, Freescale, and Phillips fab strategy. The announcement by TI that they were a billion transistors or more. The industries or companies The other perception that has changed is the inherent Semiconductor, all by private equity groups, signals a no longer going to develop process technology was a that can use a billion more transistors are dwindling. gross margins in our business. For a while, when we change in the perception of the business. Private equity sea change in the industry. In the U.S. that leaves only Memories can always use more density, so Moore's were improving gross margins, there was the perception has been around for a long time, but until recently they Intel and IBM as leading-edge process developers Law would be their friend. CPUs may be able to use that soon gross margins would fall and return to the have been unwilling to invest in the semiconductor and will eventually leave only Intel as an IDM. The the additional transistors, but can they do it efficiently? old levels. It is only with our continued progress and business. The primary reason for that is that in the past, emergence of companies like TSMC with significant That is, is a dual core processor twice as powerful as a an understanding of inherent causes of the change, semiconductor businesses were consumers of cash, leading edge fab capacity has not only allowed this single core? Is a quad core four times as powerful and that they have changed their minds. Now they believe not cash generators. If private equity uses significant change, but forced it, with very cost effective capacity. useful as a single core? Probably not. Therefore I think we have fundamentally changed the business model debt to finance their purchases, then they are unwilling There is no longer an advantage to owning your fab. it can be argued that Moore's Law is not the friend of to invest in cash consumers because they need to be for revenue per wafer, which changes their view of CPU companies. GPUs are one of the few areas that can able to service the debt. That has changed in the last our inherent gross margins. Investors are interested in efficiently use the doubling of transistors. This means ten years. Now most semiconductor companies are earnings growth. What excites them about NVIDIA is that with a doubling of transistors, GPU designers can generating cash because they are outsourcing the fab that we have the potential for both a.) revenue growth, double the performance. How long this will go on, we and/or sharing R&D costs. This attracts private equity which can lead to earnings growth, and b.) expansion of don't know, but for the foreseeable future, GPUs have and impacts the valuations of semiconductor companies. gross margins, which will also lead to earnings growth. ways of doubling the performance with successive generations. So certainly Moore's Law is NVIDIA’s friend. TRANSFORMATIONS // NVIDIA CORPORATION // WWW.NVIDIA.COM TRANSFORMATIONS // NVIDIA CORPORATION // WWW.NVIDIA.COM PROFESSIONAL TRANSFORMATIONS // The Future of 3D Graphics SOLUTIONS GPUs have evolved far beyond simply implementing a fixed function graphics The Future of 3D Graphics pipeline to becoming flexible, program- Spotlight on GPU Computing and Ray Tracing THE FIRST IN A SERIES OF ARTicLES AboUT 3D GRAPHicS BY Nvidia'S CHIEF SciENTIST, david KIRK mable, massively parallel computers. In recent years we’ve seen mapping units. We never looked tremendous interest in back—the current GeForce 8800 chip has 128 processor cores. And, utilizing the immense parallel not only does it have 128 processor processing power of GPUs cores, but each core can run many for uses beyond classic 3D threads, or program copies, at a graphics processing. GPUs time. The GeForce 8800 processes have evolved far beyond over 12,000 threads at once—each thread processing pixels, vertices, simply implementing a fixed or triangles! Imagine achieving that function graphics pipeline kind of parallelism and throughput to becoming flexible, FIGURE 1 FIGURE 2 with dual-or quad-core CPUs. It’s programmable, massively GPUs Lead Evolution to desire to use the power of the at ever higher clock rates. However, model aided by graphics APIs and just not possible. But, that’s not Many-Core Processing GPU for broader applications it’s quite easy to add multiple CPU the C programming language is all. In addition to the 12,000+ pixel parallel computers. Similarly, The programmable and flexible than just graphics. More recently, cores to a single chip—but that’s straightforward and easy to use. or vertex threads, there are many 3D graphics has evolved modern GPU is one of the most this broader effort, which we call where the simplicity ends. It is Although applying GPUs to a variety thousands of other concurrent to encompass many powerful computing devices on GPU Computing, has been made difficult for programmers to grasp of parallel computing tasks is a operations being processed by the forms of visual computing the planet. Since the year 2000, easier by the introduction of how to program multi-core CPUs natural evolution, trying to process GPU. Texture map calculations, applications. GPUs are now the individual processing cores NVIDIA’s CUDA (Compute Unified effectively. Also, for the first time in demonstrably parallel graphics rasterization, Z-buffer hidden- within GPUs have processed data Device Architecture) programming several decades, programmers can workloads with multi-core CPUs is surface-removal, color blending considered “computational using IEEE floating-point precision, environment. CUDA allows GPUs to no longer simply wait 18-24 months inherently challenging—because for transparency, and anti-aliasing graphics” engines, as just like standard CPUs (aka “real be programmed using the C language for their single-threaded programs simply grouping together many CPUs (edge smoothing) are all happening many of the fixed function computers”). The raw floating-point for non-graphics applications. to double in speed as processor will not produce an integrated parallel simultaneously. Without the special- parts of the graphics processing power of a modern GPU clock speeds increase. An industry processor. A GPU consists of many purpose hardware included in every pipeline have become is much larger and growing faster All processors evolve and change wide effort to “refactor” algorithms parallel processor cores integrated to GPU to perform these operations, than even the latest multi-core with time, not just GPUs. We are to run on multi-core CPUs is taking work together from the ground up.

Transformations a Quarterly Publication From

Conservation Cores: Reducing the Energy of Mature Computations

Nvidia® Gelato™ 1.0 Hardware

FCM 61 Italiano

The Utilization Wall

3D Computer Graphics Compiled By: H

VMD User's Guide

C 2009 Aqeel A. Mahesri TRADEOFFS in DESIGNING MASSIVELY PARALLEL ACCELERATOR ARCHITECTURES

Enabling Compute-Communication Overlap in Distributed Deep

Spatial Data Structures, Sorting and GPU Parallelism for Situated-Agent Simulation and Visualisation

July/August 2021

Graphics Hardware

Gelato Pro 2.0 and Gelato 2.0 Gpu-Accelerated Final-Frame Renderer