AMD最新テクノロジーアップデート -HPCへの取り組み- 日本AMD株式会社 マーケティング&ビジネス開発本部 エンタープライズプロダクトマーケティング部 部長 山野 洋幸 AMD’s HPC Product Portfolio Energy efficient CPU and discrete GPU processors focused on addressing the most demanding HPC workloads

Multi-core Processors • Outstanding Performance • Superior Scalability • Enhanced Power Efficiency

Professional Graphics • 3D Accelerators For Visualization • See More and Do More with Your Data

ATI Stream Computing • GPU Optimized For Computation • Massive Data-parallel Processing • High Performance Per Watt

2 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD’s HPC Product Portfolio Energy efficient CPU and discrete GPU processors focused on addressing the most demanding HPC workloads

Multi-core x86 Processors • Outstanding Performance • Superior Scalability • Enhanced Power Efficiency

Professional Graphics • 3D Accelerators For Visualization • See More and Do More with Your Data

ATI Stream Computing • GPU Optimized For Computation • Massive Data-parallel Processing • High Performance Per Watt

3 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Planned Server Platform Roadmap 2006 2007 2008 2009 2010 2011 “Maranello” with AMD SR56x0 and SP5100 Magny-Cours New Architecture Six-Core AMD ™ Processor with AMD

way - (1207) with AMD SR56x0 and SP5100

Shanghai/Istanbul

Platform 2/4 Enterprise Enterprise “Socket F (1207)” Socket F(1207) with Nvidia and Broadcom Rev F Barcelona Shanghai Istanbul “San Marino” +AMD SR56x0 and SP5100

Lisbon New Architecture way

- “Adelaide” Platform 1/2 Socket C32 EE+AMD SR5650 and SP5100

Lisbon New Architecture Power Efficient Efficient Power “Buenos Aires” Socket AM3 with AMD SR56x0 and SP5100

Suzuka

way -

1 “Socket AM2+” Platform Socket AM2+ with Nvidia and Broadcom chipsets 4 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 Rev F Budapest Suzuka For more information be sure to visit AMD at SC’09 booth #1417 Planned Server Platform Roadmap 2006 2007 2008 2009 2010 2011 “Maranello” Socket G34 with AMD SR56x0 and SP5100 Magny-Cours New Architecture Six-Core AMD Opteron™ Processor with AMD

way Chipset - Socket F(1207) with AMD SR56x0 and SP5100

Shanghai/Istanbul

Platform 2/4 Enterprise Enterprise “Socket F (1207)” Socket F(1207) with Nvidia and Broadcom chipsets Rev F Barcelona Shanghai Istanbul “San Marino” Socket C32+AMD SR56x0 and SP5100

Lisbon New Architecture way

- “Adelaide” Platform 1/2 Socket C32 EE+AMD SR5650 and SP5100

Lisbon New Architecture Power Efficient Efficient Power “Buenos Aires” Socket AM3 with AMD SR56x0 and SP5100

Suzuka

way -

1 “Socket AM2+” Platform Socket AM2+ with Nvidia and Broadcom chipsets 5 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 Rev F Budapest Suzuka For more information be sure to visit AMD at SC’09 booth #1417 Six-Core AMD Opteron™ Processor

• Six true cores • New HyperTransport™ technology HT Assist • Increased HyperTransport 3 technology (HT3) bandwidth • Higher performing Integrated Memory Controller • Same power/thermal ranges as Quad-Core AMD Opteron™ processor • Up to 50% higher performance than Quad-Core AMD Opteron™ processor-based servers at the same processor ACP1

Model Cores Freq NB Wattage L2 Cache L3 Cache Production 8435 2.6GHz 8431 2.4GHz 2435 6 2.6GHz 2.2GHz 75W 512K/core 6MB May 2431 2.4GHz 2427 2.2GHz 8425 HE 2.1 GHz 2425 HE 6 2.1 GHz 2.2 GHz 55W 512K/core 6MB July 3rd 2423 HE 2.0 GHz 8439 SE 2.8 GHz 6 2.2 GHz 105W 512K/core 6MB July 24th 2439 SE 2.8 GHz 2419EE 6 1.8GHz 2.0GHz 40W 512K/core 6MB September

6 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 *Based on SPECint® 2006 results published as of 11/09/09. See backup slides or configuration and performance information. For more information be sure to visit AMD at SC’09 booth #1417 Six-Core AMD RDDR-2 RDDR-2 Opteron™ Processor Up to 8 DIMMs Up to 8 DIMMs

HT-1

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM Performance “Istanbul” HT-3 “Istanbul” DIMM • Six-Core AMD Opteron™ Processor 6M Shared L3 Cache

North Bridge enhancements (PF + prefetch)

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM 45nm Process Technology DIMM • DDR2-800 Memory HT-1 HT-1 • HyperTransport-3 @ 4.8 GT/sec HT-3 HT-3

HT-1

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM Reliability/Availability “Istanbul” HT-3 “Istanbul” DIMM • L3 Cache Index Disable

• HyperTransport Retry (HT-3 Mode)

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM • x8 ECC (Supports x4 Chipkill in unganged mode) DIMM HT-1 HT-1 Virtualization HT-3 HT-3 • AMD-V™ with Rapid Virtualization Indexing PCI-e PCI-e or I/O Bridge or Manageability PCI-X PCI-X • APML Management Link*

Scalability USB • 48-bit Physical Addressing (256TB) South SATA SE: 2.8GHz • HT Assist (Cache Probe Filter) PCI Bridge Std: 2.6GHz LPC PATA HE: 2.1GHz Continued Platform Compatibility EE: 1.8GHz • Nvidia/Broadcom-based F/1207 platforms 4 socket example block diagram

*APML-enabled platform support required. 7 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Six-Core AMD Opteron™ RDDR-2 RDDR-2 Processor Up to 8 DIMMs Up to 8 DIMMs

Performance HT-1

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM “Istanbul” “Istanbul” DIMM • Six-Core AMD Opteron™ Processor HT-3 6M Shared L3 Cache

North Bridge enhancements (PF + prefetch)

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM 45nm Process Technology DIMM • DDR2-800 Memory HT-1 HT-1 • HyperTransport-3 @ 4.8 GT/sec HT-3 HT-3

HT-1

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM Reliability/Availability “Istanbul” HT-3 “Istanbul” DIMM • L3 Cache Index Disable

• HyperTransport Retry (HT-3 Mode)

DIMM DIMM DIMM DIMM

DIMM DIMM DIMM • x8 ECC (Supports x4 Chipkill in unganged mode) DIMM HT-1 HT-1 Virtualization HT-3 HT-3 • AMD-V™ with Rapid Virtualization Indexing 4 socket example block diagram

Manageability STREAM Bandwidth • APML Management Link* (GB/s)* Scalability Product, Freq, Dram 2S 4S 8S • 48-bit Physical Addressing (256TB) • HT Assist (Cache Probe Filter) “Barcelona,” 2.3/2.0, RDDR2-667 17.2 20.5 “Shanghai,” 2.7/2.2, RDDR2-800 21.4 24 Continued Platform Compatibility • Nvidia/Broadcom-based F/1207 platforms “Istanbul,” 2.4/2.2, RDDR2-800 22 42 81.5

*Based on measurements at AMD performance labs. See backup slide for configuration information. 8 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HT Assist Improves Two-Socket to Four-Socket Server Performance Scaling

200% 200% STREAM (GB/s) STREAM (GB/s) +100% +93% +96% 180% SPECfp®_rate2006 180% SPECfp®_rate2006 +82% SPECint®_rate2006 +80% SPECint®_rate2006 160% 160% Up to 82% higher Up to 2x 140% 140% performance performance 120% 120% +19% 100% 100%

80% 80%

RelativePerformance RelativePerformance 60% 60%

40% 40%

20% 20%

0% 0% Quad-Core AMD Opteron™ processor Quad-Core AMD Opteron™ processor Six-Core AMD Opteron™ processor Six-Core AMD Opteron™ processor ("Shanghai") ("Shanghai") ("Istanbul") ("Istanbul") Model 2384 (2.7GHz) (2P) Model 8384 (2.7GHz) (4P) Model 2435 (2.6GHz) (2P) Model 8435 (2.6GHz) (4P)

SPEC, SPECint, and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. The SPECint_rate and SPECfp_rate results for Six-Core AMD OpteronTM processors are based upon data submitted to Standard Performance Evaluation Corporation as of May 12, 2009. The other SPECint_rate and SPECfp_rate results stated above reflect results published on http://www.spec.org/ as of May 12, 2009. The STREAM results are based on measurements at AMD performance labs as of May 12, 2009. The SPECint_rate and SPECfp_rate comparisons presented above are based on the best performing two-socket and four-socket servers using AMD Opteron™ processor Models 2384, 8384, 2435, and 8435. For the latest results, visit www.spec.org. Please see backup slides for configuration information.

9 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 2 Socket HPL Performance High-Performance Linpack - Two Socket Servers

Six-Core AMD Opteron™ 104 Processor ("Istanbul") (139%) Model 2435 (2.6GHz)

Intel Xeon Processor 75 ("Gainestown") Model (100%) X5570 (2.93GHz)

0 20 40 60 80 100 120 HPL Gflops*

*Based on AMD internal performance lab measurements as of June 15th 2009 – see backup slides for configuration

10 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Power Efficient Innovations

AMD PowerNow!™ Technology with Independent Dynamic Core Technology Allows processors and cores to dynamically operate at lower power AMD Smart Fetch and frequencies, depending on usage and workload to help reduce Technology TCO and to lower power consumption in the datacenter Can reduce power consumption by allowing idle cores to enter a “halt” state Enhanced Performance-per-watt 50% more compute cores vs. quad-core within the same power envelope*

AMD PowerCap AMD CoolCore™ Manager Technology Allows IT datacenter Can reduce processor managers to set a energy consumption by fixed limit on a dynamically turning off server's processor sections of the processor power consumption when inactive

Extends to the L3 Cache

Dual Dynamic Power Management™ Enables more granular power management capabilities to reduce processor energy consumption. Separate power planes for cores and memory controller

11 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 *Compared to Quad-Core AMD Opteron processor codenamed “Shanghai.” For more information be sure to visit AMD at SC’09 booth #1417 Significantly Higher Performance in Same Power Envelope

Two-socket servers using Six-Core AMD Opteron™ processors significantly outperform two-socket servers using Dual-Core and Quad-Core AMD Opteron™ processors without consuming significantly more power

700% SPECfp®_rate2006 600% SPECint®_rate2006 SPECjbb®2005 500% Server Idle Power

400% 3x core count >3x performance 300% Same power envelope

200% Relative Performance Relative

100%

0% Dual-Core Quad-Core Six-Core AMD Opteron™ processor AMD Opteron™ processor ("Shanghai") AMD Opteron™ processor ("Istanbul") Model 2218 (2.6GHz) Model 2382 (2.6GHz) Model 2435 (2.6GHz)

SPEC, SPECfp, SPECint, and SPECjbb are registered trademarks of the Standard Performance Evaluation Corporation. The performance results for Six-Core AMD Opteron™ processor Model 2435 and the SPECjbb result for Quad-Core AMD Opteron™ processor Model 2382 are based upon data submitted to Standard Performance Evaluation Corporation as of May 21, 2009. The other performance results stated below reflect results published on http://www.spec.org/ as of May 21, 2009. The server idle power results are based on measurements of server active idle power for 60 seconds at AMD performance labs as of May 21, 2009. The comparison presented below is based on the best performing two-socket servers using AMD Opteron™ processor Models 2218, 2382, and 2435. For the latest results, visit http://www.spec.org/. Please see backup slides for configuration information.

12 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Six-Core AMD Opteron™ EE Processor

- Only Six-Core 40W ACP for 2P servers

- Targeted at energy-efficient servers and workloads such as cloud computing, Web 2.0 applications, and dense computing environments

- Low power without compromise; Built with AMD Virtualization™ (AMD-V™) technology and AMD-P

- Up to 31% higher performance-per-watt than Quad Core AMD Opteron™ 75W processor*

- Up to 33% higher performance over previous Quad-Core AMD Opteron™ EE processor**

*Compared to 75W ACP Quad-Core AMD Opteron™ processor. See backup slides for configuration information. **Compared to Quad-Core AMD Opteron™ EE processor. See backup slides for configuration information.

13 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Low Power without compromise

Six-Core AMD Opteron™ Six-Core AMD

HE Opteron™ EE

100%

75%

50%

™ ™ technology speed

25%

P P

V™ V™

- -

- -

AMD AMD

Maximum Memory Speed technology speed HyperTransport™

Maximum Memory Speed HyperTransport

AMD AMD

Cache Cache

Consistent Feature Set Inconsistent Feature Set

Intel specifications as of 5/15/09: http://www.intel.com/Assets/PDF/prodbrief/xeon-5500.pdf 14 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Launched: Six-Core AMD Opteron™ Processor with AMD Chipset System architecture

Unified server platforms with AMD on AMD – benefits of processor + chipset Enhanced platforms with chipset choices and key technologies . Ultra power efficient platform designed for cloud and dense deployments (using AMD Opteron™ EE processor and S5650 chipset) . First 4P x86 Platform with PCI Express® 2.0 for HPC SR56x0 SR56x0 I/O Hub I/O Hub applications AMD-V™ AMD-V™ . Three chipset solutions to optimize workloads Extends AMD Virtualization™ (AMD-V™) technology and SP5100 South AMD-P suite of features Bridge Wide availability from ODMs; OEMs are expected to introduce new chipsets as part of AMD’s 2010 processor lineup

15 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Benefits of Six-Core AMD Opteron™ processor with AMD Chipset

Choice of Consistent Key Chipsets Feature Set Technologies

® . AMD-P . PCI Express 2.0 SR5690 SR5670 SR5650 . AMD-V™ . HyperTransport™ 3.0 Technology

Benefits

Range and flexibility Exceptional of chipsets Virtualization Increased (from high bandwidth to capabilities performance and power efficient) throughput (up to 2x to meet the demands Only 4P x86 solution for PCI Express® with Tagged TLB, RVI of your workloads 2.0 vs. PCI and I/O Virtualization Express® 1.0) Power efficiency beyond the processor

16 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HPC Scalability - Think Outside The Box Scale Up: More processors, more cores & improved scalability (HT Assist) Scale Out: Multiple servers networked together into a large powerful computer

AMD’s new SR56x0 chipset helps ensure platform level performance innovations can be realized at scale

HyperTransport™ 3.0 technology and PCI-E® 2.0 provide the low latency and high bandwidth necessary to take full advantage of high performance interconnects such as QDR InfiniBand

17 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HPC Scalability - Think Outside The Box

AMD SR56x0 chipset with QDR IB delivers up to 22% reduced latency and 89% increased bandwidth over prior generation chipsets with DDR IB*

Unidirectional Bidirectional IB test Latency (usec) Bandwidth Bandwidth (MB/sec) (MB/sec) Affinity C0-c0 C4-c4 C0-c0 C4-c4 C0-c0 C4-c4 PCIe® Gen1 + 1.17 1.21 1549 1542 2918 2900 HT1 + IB DDR PCIe® Gen2 + 0.91 0.96 2773 2767 5486 5495 HT3 + IB QDR % diff -22% -20% 79% 79% 88% 89%

* Based on AMD internal performance lab measurements 18 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HPC University Implementation with PRACE Challenge Solution

Collaborative effort between Supermicro UK office Customer: Swedish National Infrastructure for and AMD Computing (SNIC), Royal Institute of Technology AMD product: Six-Core AMD Opteron™ HE (KTH), Sweden, jointly with the Partnership for processor with new AMD SR5670 chipset using Advanced Computing in Europe (PRACE) HyperTransport™ 3 technology and HT assist

Need: A general purpose HPC cluster for energy Infrastructure: efficiency and compute density using standard . Supermicro Highly dense 7U enclosure components (with 10x 4P Blade Model) BHQIE . QDR Switch incorporated inside the Buying Criteria: blade enclosure . Superior performance/watt/density and ability . Management per node including to control energy consumption depending on KVM/IP workload and performance based on flop/s and . Redundant PSU’s capable of operating GB/s at ultra high power efficiency . x86 Platform . Latest technology chipset supporting PCI Quantity: 720 processors; 4320 cores Express® 2.0

19 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD Developer Central – developer.amd.com // Code Faster, Faster Code

“Istanbul” Zone What’s in it for software developers? - Find out at developer.amd.com

Visit the developer site regularly for updated information and practical guidance on how to take advantage of all the new features in the Quad- Core AMD Opteron™ processor, and Six-Core AMD Opteron™ processor.

» Software Development Tools and Resources » Overview of Software Visible Features » Documentation » Technical Articles & Blogs » Benchmarks and Performance Evaluations

20 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD support for GCC

http://developer.amd.com/cpu/gnu/Pages/default.aspx

21 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 New! - ACML 4.3

. L3 BLAS improvements – SGEMM for Six-Core AMD Opteron™ processor – New Intel DGEMM and SGEMM kernels . Supporting Woodcrest, Penryn, Nehalem – New DGEMM "fast memory allocation" scheme . allows improved performance of other routines (such as LAPACK) which make heavy use of DGEMM . Six-Core AMD Opteron™ processor tuning for Level 1 BLAS – xDOT, xCOPY, xAXPY, and xSCAL . 3DFFT performance improvements . AMD Family 10h tuning for real-complex FFTs – csfft, dzfft, scfft and zdfft have been re-tuned for AMD Family 10h processors, providing significant performance increases.

22 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 New - AMD’s High Performance Compiler

AMD has released a version of C/C++/Fortran compilers based on Open64 technology – x86 Open64 version 4.2.2.2 . Release is available for download on AMD Developer Central at http://developer.amd.com/cpu/open64 in source and binary forms . No license fee for compiler use . Provided with documentation and support

File Name Launch Date OS Bitness Description x86_open64-4.2.2.2- Source code for x86 08/31/2009 Linux® Host 32/64-bit 1.src.tar.bz2 (59.6MB) Open64 4.2.2.2-1 x86_open64-4.2.2.2- Host 64-bit x86 Open64 4.2.2.2-1 08/31/2009 RHEL 5.3 & SLES 10 SP2 1.x86_64.rpm (23.4MB) Target 32/64-bit Compilers for Linux x86_open64-4.2.2.2- Host 64-bit x86 Open64 4.2.2.2-1 08/31/2009 RHEL 5.3 & SLES 10 SP2 1.x86_64.tar.bz2 (23.4MB) Target 32/64-bit Compilers for Linux

23 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Software Optimization & Compiler Guidelines

http://developer.amd.com/documentation/guides/Pages/default.aspx

24 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 x86 64-bit Architecture Evolution

2003 2005 2007 2008 2009 2010*

AMD AMD “Magny- “Barcelona” “Shanghai” “Istanbul” Opteron™ Opteron™ Cours”

Mfg. 90nm SOI 90nm SOI 65nm SOI 45nm SOI 45nm SOI 45nm SOI Process K8 K8 “Greyhound” “Greyhound+ “Greyhound+” “Greyhound+ CPU Core ”

L2/L3 1MB/0 1MB/0 512kB/2MB 512kB/6MB 512kB/6MB 512kB/12MB

Hyper Transport™ 3x 1.6GT/.s 3x 1.6GT/.s 3x 2GT/s 3x 4.0GT/s 3x 4.8GT/s 4x 6.4GT/s Technology

Memory 2x DDR1 300 2x DDR1 400 2x DDR2 667 2x DDR2 800 2x DDR2 800 4x DDR3 1333

*Planned model or feature introduction dates.

25 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD 2010-2011 Sweet Spot Server Strategy

2010 2011

4P/8P Platforms ~5% of Market* “Magny-Cours” “Interlagos” AMD Opteron™ 8 and 12 cores 12 and 16 cores 6000 Series Platform Performance-per- ”Maranello” • 2/4 socket; 4 memory channels watt and • Highly scalable without compromising value Expandability

2P Platforms ~75% of Market* Platform Consistency Bulldozer and Commonality Core

Highly Energy AMD Opteron™ “Lisbon” “Valencia” Efficient and 4000 Series Platform 4 and 6 cores 6 and 8 cores “San Marino” Cost Optimized • 1/2 socket; 2 memory channels 1P Platforms and “Adelaide” • New levels of value and power efficiency ~20% of Market*

26 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 *AMD internal estimates of total server market as of Q309 For more information be sure to visit AMD at SC’09 booth #1417 Unmatched Feature Consistency Across Platforms

AMD Opteron™ 6000 Series Platform Same BIOS (G34 Socket)

Shared Platform Interface Same API

AMD Opteron™ 4000 Series Platform (C32 Socket)

Same Drivers

. OEM Benefits: enables easier design, validation and support . Customer Benefits: easy management and flexibility

27 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 The AMD Opteron™ 6100 Processor

. Target: Enterprise Class 2-way and 4P Socket G34 Server Platform 4-way Servers 12/8 core Processor Support – Twelve-core and Eight–core 12M L3 Cache – CoolCore™ Technology, Enhanced AMD PowerNow!™ Technology, Enhanced C1 state, AMD CoolSpeed™ Technology, APML – Quad-Channel LV & U/RDDR3, ECC, On-line spare – Up to 3 DIMMs/channel, 12 per CPU Coherent HT3 – Expected platforms 2P/2U, 2P Tower, 4P rack, 4P Blade . Single Series for performance DP and MP platforms Non-coherent HT3 – 2P economics for 4P servers – Compelling price/performance for volume SR56x0 SR56x0 market AMD-Vi™ AMD-Vi . G34 Socket Infrastructure x4 A-Link – Performance-optimized Power/thermals

– Quad 16-bit HT3 links, up to 6.4 GT/s per SP5100 link South Bridge – AMD SR56x0 chipset with AMD-Vi™ and PCIe Gen2

28 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HyperTransport™ Technology HT Assist (Probe Filter)

Key enabling technology on Six-Core AMD “Old” broadcast protocol Opteron™ processor and AMD Opteron™ 6100 Home Series processor Node Probes

RdBlk HT Assist is a sparse directory cache Request Resps Req . Associated with the memory controller on Node the home node . Tracks all lines cached in the system from PF – clean data Home PF the home node Node Lookup

DRAM Eliminates most probe broadcasts (see diagram) Resp Req . Lowers latency Node – local accesses get local DRAM latency, no need to wait for probe responses PF – dirty data Home PF – less queuing delay due to lower HT Node Lookup traffic overhead Directed Probe . Increases system bandwidth by reducing Req Cache Owner probe traffic Node Resp

29 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HT Assist and Memory Latency

With “old” broadcast coherence protocol, the latency of a memory access is the longer of 2 paths: . time it takes to return data from DRAM and . the time it takes to probe all caches

With HT Assist, local memory latency is significantly reduced as it is not necessary to probe caches on other nodes.

Several server workloads naturally have ~100% local accesses . SPECint®, SPECfp® . VMMARK typically run with 1 VM per core . SPECpower_ssj with 1 JVM per core . STREAM

Probe Filter amplifies benefit of any NUMA optimizations in OS/application which make memory accesses local

SPEC, SPECint, SPECfp, and SPECpower_ssj are trademarks or registered trademarks of the Standard Performance Evaluation Corporation. 30 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Enhanced Integrated Memory Architecture Benefits HPC, Virtualization and Database

Memory Bandwidth Memory Bandwidth (GB/s in STREAM benchmark) (GB/s in STREAM benchmark)

60 4P AMD Opteron™ 61XX 50 8P AMD Opteron 84XX 40 4P AMD Opteron 84XX 30 4/8 Socket 4/8 4P Intel Xeon 74XX 20

10 2P AMD Opteron 61XX 0 2P Intel Xeon 55XX High Power Standard Power Low Power

2P AMD Opteron 24XX 2P AMD Opteron 6100 2P Intel Xeon 55XX 2 Socket 2

0 50 100 …and consistency across Greater peak performance… power bands

1) Based on measurements in AMD Performance Labs as of October 6, 2009. Please see backup slides for configuration information.

31 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Customer Support for AMD Strategy

“We’ve been able to provide remarkable value to our customers, in part due to AMD’s platform consistency and scale up/scale out capability. Our HPC customers can quickly and productively utilize all the processor performance we can provide and they are looking forward to the next wave of innovation with the 8- and 12-core “Maranello” platform. For our part, we are very happy with what we’re seeing in the early silicon and plan to have systems available in 1H10.” Barry Bolding, Vice President, Scalable Systems

32 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Server Roadmap 65nm 45nm 32nm Yellow text denotes new feature

“Shanghai” “Istanbul” “Magny-Cours” “Interlagos” 4-Core 6-Core 8/12-Core 12/16-Core • 6M L3 • 6M L3 • 12M L3 New Core • 3x HT-3 (4.4GT) • 3x HT-3 (4.8GT) • 4x HT-3 (6.4GT) • AMD-V technology • HT Assist • U/RDDR3 & LV RDDR3 (Quad-Channel) • RDDR2 (Dual-Channel) • AMD-V technology • Cool Speed • RDDR2 (Dual-Channel) • C1E 4-way • AMD-V 2 and 4-way Performance • HT Assist Enterprise/ Platform Mainstream “Socket F (1207)” Platform “Maranello” “Six-Core AMD Opteron™ Processor w/AMD Chipset” • AMD SR56x0 ● AMD SP5100 “Maximum Scalability” ● APML Enabled (Istanbul Only) • AMD SR56x0 • AMD SP5100 • Advanced Platform Management “Shanghai” “Istanbul” 4-Core 6-Core

2-way Mainstream “Lisbon” “Valencia” 4/6-Core Platform 6/8-Core “Socket F (1207)” • 6M L3 New Core • 2x HT-3 (6.4GT) • U/RDDR3 & LV RDDR3 (Dual-Channel) “Six-Core AMD Opteron™ Processor with AMD Chipset” • Cool Speed • C1E • HT Assist 1 and 2-way • AMD-V “Suzuka” • 6M L3 Energy “Budapest” • DDR3 Efficient/ 4-Core • 1xHT3 4-Core • AMD-V technology Cost Optimized Platform • AMD SR56x0 “San Marino” (Std/HE/EE) 1-way “Socket AM2+” • AMD SP5100 Platform “Optimized Energy Efficiency” • Advanced Platform Management

“Buenos Aires” • AMD SR5650 “Adelaide” (EE Only) • AMD SP5100 • AMD SR56x0 • LV DDR3 • AMD SP5100 “Ultra Low Power” • HT1

Platform Segment 2009 2010* 2011*

33 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 * Planned roadmaps For more information be sure to visit AMD at SC’09 booth #1417 Introducing the “Bulldozer” Architecture

. “Bulldozer” is an innovative Fetch new architecture that dramatically Decode reduces the thread- Int Int Scheduler interference Scheduler

. Only AMD is planning to FP Scheduler deliver the most scalable

industry-standard bit

processor architecture with bit

- -

Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline

true core functionality Pipeline

FAMC FAMC

128 128

. “Bulldozer” is a modular L1 DCache L1 DCache

architecture that creates the Shared L2 Cache building blocks of the next generation of processor Shared L3 Cache and NB designs

34 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Designed for Scalability and Performance

“Bulldozer” Flex FP module

Two cores in a single unit that enables two A flexible floating point simultaneous threads, the unit that can be dedicated building blocks of a OR shared between the “Bulldozer” die two cores per cycle

Parallel Dedicated Threads Scheduler

The ability to execute two Independent integer threads on two discrete, schedulers and an FP unshared cores without scheduler improve compromising or creating scalability by efficient bottlenecks execution

35 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Unprecedented Server Performance Gains

45 Floating Point Integer 40

35

30

25

20

15

10

5

0 2003 2004 2005 2006 2007 2008 2009 2010 2011

AMD Opteron™ AMD Opteron AMD Opteron AMD Opteron AMD Opteron AMD Opteron AMD Opteron “Magny-Cours”* “Interlagos”* 244 250 x75 285 x356 x384 2435 (dual-core) (dual-core)

2010 is projected to be the beginning of unprecedented leaps in server performance-per-watt for AMD

36 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 * “Magny-Cours” and “Interlagos” data is based on AMD projections For more information be sure to visit AMD at SC’09 booth #1417 AMD’s HPC Product Portfolio Energy efficient CPU and discrete GPU processors focused on addressing the most demanding HPC workloads

Multi-core x86 Processors • Outstanding Performance • Superior Scalability • Enhanced Power Efficiency

Professional Graphics • 3D Accelerators For Visualization • See More and Do More with Your Data

ATI Stream Computing • GPU Optimized For Computation • Massive Data-parallel Processing • High Performance Per Watt

37 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Harnessing the Computational Power of GPUs

• GPU architecture increasingly emphasizes programmable shaders instead of fixed function logic • Enormous computational capability for data parallel workloads • Potential to balance system performance • New math for datacenters: high performance/watt and performance/$

Software Applications

Graphics Serial and Workloads Task Parallel Workloads Data Parallel Workloads

38 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 ATI Stream Technology is… Heterogeneous: Developers leverage AMD GPUs and CPUs for optimal application performance and user experience

Industry Standards: OpenCL™ and DirectCompute 11 enable cross-platform development

High performance: Massively parallel, programmable GPU architecture delivers advanced performance and power efficiency

Gaming Digital Content Productivity Creation

Sciences Government Engineering

39 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 GPGPU Processing Power Trend

3000 * Cypress GigaFLOPS ATI RADEON™ HD 5870 2500 RV770 ATI RADEON™ HD 4800 RV670 ATI FirePro™ 2000 ATI RADEON™ V8700 R600 HD 3800 AMD FireStream™ ATI RADEON™ ATI FireGL™ 9250 OpenCL™ 1.0 HD 2900 V7700 9270 R580(+) ATI FireGL™ AMD FireStream™ DirectX® 11 ATI RADEON™ 9170 1500 X19xx V7600 R520 ATI FireStream™ V8600 ATI RADEON™ V8650 2.5x ALU X1800 ATI FireGL™ increase 1000 V7200 V7300 Stream SDK V7350 CAL+IL/Brook 500 + Double-precision GPGPU Unified floating point via CTM Shaders 0 support

Jul-09 Sep-05 Mar-06 Oct-06 Apr-07 Nov-07 Jun-08 Dec-08

*Chart indicated peak theoretical single-precision performance.

40 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Focus On Efficient GPU Progression

14.47 GFLOPS/W

GFLOPS/W GFLOPS/mm2 7.50

4.50 7.90 GFLOPS/mm2

2.01 2.21 4.56 1.07 2.24 0.42 1.06 0.92

41 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Enhancing GPUs for Computation AMD FireStream™ 9270 Stream Processor • 800 stream processing units arranged in 10 SIMD cores • Up to 1.2 TFLOPS peak single precision floating point performance; Fast double-precision processing w/ up to 240 GFLOPS • Up to 115 GB/sec GDDR5 memory interface • Up to 480 GB/s L1 & up to 384 GB/s L2 cache bandwidth • Data sharing between threads • Improved scatter/gather operations for improved GPGPU memory performance* • Integer bit shift operations for all units – useful for crypto, compression, video processing • More aggressive clock gating for improved performance per watt*

*Compared to previous generation AMD FireStream™ stream processors. 42 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009 42 For more information be sure to visit AMD at SC’09 booth #1417 ATI Radeon™ HD 5870 Graphics Architecture

2.72 Teraflops Single Precision 544 Gigaflops Double Precision

• Full Hardware Implementation of DirectCompute 11 and OpenCL™ 1.0

• IEEE754-2008 Compliance Enhancements

• Additional Compute Features: • 32-bit Atomic Operations • Flexible 32kB Local Data Shares • 64kB Global Data Share • Global synchronization • Append/consume buffers

43 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Harnessing the Computational Power of GPUs

Unleash the full potential of STREAM computing with AMD’s new SR56x0 chipsets

• GPU architecture increasingly emphasizes programmable shaders instead of fixed function logic • Enormous computational capability for data parallel workloads • New math for datacenters: high performance/watt and performance/$ • Open systems approach to enable developers: active in Khronos OpenCL™ Working Group driving new cross-platform API

44 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Moving Past Proprietary Solutions for Ease of Cross-Platform Programming

Open and Custom Tools High Level Tools High Level Language Application Specific Compilers Libraries

Industry Standard Interfaces

OpenCL™ DirectX® OpenGL®

AMD AMD Other GPUs CPUs CPUs/GPUs

• Cross-platform development • Interoperability with OpenGL and DirectX® • CPU/GPU back-ends enable balanced platform approach

45 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 ATI Stream SDK v2.0 Beta: OpenCL™ For Multicore x86 CPUs and GPUs The Power of Fusion: Developers leverage heterogeneous architecture to deliver superior user experience • First complete OpenCL™ development platform • Certified OpenCL™ 1.0 compliant by the Khronos Group • Write code that can scale well on multi-core CPUs and GPUs • AMD delivers on the promise of OpenCL™, with both high- performance CPU and GPU technologies • Available for download now as part of ATI Stream SDK beta program – includes documentation, samples, and developer support Beta Program: http://developer.amd.com/streambeta

Software Applications

Graphics Workloads Serial and Data Parallel Task Parallel Workloads Workloads

OpenCL and the OpenCL logo are trademarks of Apple Inc. used by permission by Khronos. 46 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 First To Support DirectCompute 11

• First and only driver to support DirectCompute 11 with Shader Model 5.0. • Fully Microsoft® WHQL certified • Available with ATI Radeon™ HD 58xx Series

47 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 ACML-GPU 1.0 Released March 2009 Selected BLAS routines enabled for GPU . DGEMM, SGEMM will run on GPU if present . Small problems (N,M,K < 200) run on CPU . Supports Quad- and Six-Core AMD Opteron™ processors BLAS LAPACK FFT RNG LIBMV Expert L3 23 Shape Array Simple Generators L2 LAPACK Vector 3.1.1 3D L1 2D 5 Base Fast Scalar 1D Generators

48 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD’s HPC Product Portfolio Energy efficient CPU and discrete GPU processors focused on addressing the most demanding HPC workloads

Multi-core x86 Processors • Outstanding Performance • Superior Scalability • Enhanced Power Efficiency

Professional Consumer Graphics • 3D Accelerators For Visualization • See More and Do More with Your Data

ATI Stream Computing • GPU Optimized For Computation • Massive Data-parallel Processing • High Performance Per Watt

49 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 ATI Radeon™ HD 5870 GPU

The World’s Most Powerful and Advanced GPU

. Ultimate immersion with DirectX® 11 and ATI Eyefinity Technology . Accelerating PCs with nearly 3 teraFLOPS of compute power . Extreme game play at high resolutions and maximum settings**

Compute Power Up to 2.72 TFLOPS

Core Clock Speed 850 MHz

Stream Processors 1600

Frame Buffer 1GB , 4.8Gbps

Max/Idle Board Power 188W/27W

Max Res. 3x 2560x1600**

*Claim of most powerful processor based on raw compute horsepower measured in FLOPS. Internal calculations show that the processor used in the ATI Radeon HD 5800 series can achieve 2.72 TeraFLOPS, more than any other known as of September 23, 2009. FLOPS is not necessarily an indicator of leading performance in every application as AMD GPUs are designed and built to excel specifically in applications involving massively parallel calculations. **Subject to monitor resolution

50 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 ATI Eyefinity Technology Think of it as surround-sight Multiple displays are treated as a single display by the OS.*

Supports up to six 30” panels with a combined resolution of up to 7680 x 3200**

Powering the displays was a SINGLE graphics card, with a SINGLE GPU† *To enable a third display requires one panel with a DisplayPort connector or an active DisplayPort adapter. **The ATI Catalyst™ driver 8663.1 is required to support 3x 2560x1600 resolution. † With the introduction of AMD’s next‐generation graphics product series supporting DirectX® 11, a single GPU now has the advanced capability of simultaneously supporting up to six independent display outputs.

51 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 ATI Eyefinity – YouTube Videos

11,520 x 4,800 - 55.3 megapixels, Four Cards, single PC

http://www.youtube.com/watch?v=mzGtxlaPQqY http://www.youtube.com/watch?v=N6Vf8R_gOec

For more information:

http://www.amd.com/us/products/technologies/eyefinity/Pages/eyefinity.aspx http://www.amd.com/us/Documents/ATI_Eyefinity_Technology_Brief.pdf http://blogs.amd.com/home/2009/09/10/ati-eyefinity%E2%80%99s-panoramic-future-keep-watch/

52 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD at Supercomputing 2009

AMD will be in booth #1417 at Supercomputing 2009 in Portland, Oregon on November 16th through 19th.

Be one of the first to preview the upcoming 8 and 12 core AMD Opteron™ 6000 Series platforms featuring the AMD Opteron™ 6100 Series processors. This new platform provides an outstanding new option for the next generation of HPC innovation.

We will showcase solutions based on AMD FireStream stream processor, ATI FirePro™ workstation graphics accelerators, and the leading-edge AMD Opteron™ processors.

Visit our events page @ www.amd.com/supercomputing09 and learn more about our technology partners featuring AMD technology-based solutions at the SC09, and get information on exciting promotions.

53 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 THANK YOU!

54 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 BACKUP AND CONFIGURATION INFORMATION

55 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Where Do We Put the HT Assist Probe Filter?

Q: Where do we store probe filter entries without adding a large on-chip probe filter RAM which is not used in a 1P desktop system?

A: Steal 1MB of 6MB L3 cache per die in “Magny-Cours”

systems way 0 way 1 way 15 L3 cache Dir set

Implementation in fast SRAM (L3) minimizes – Access latency – Port occupancy of read-modify-write operations – Indirection latency for cache-to-cache transfers

56 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Die Block Diagram

“Magny-Cours” Die (Node)

Core Core Core Core Core Core 0 1 2 3 4 5

512kB 512kB 512kB 512kB 512kB 512kB L2 L2 L2 L2 L2 L2

System Request Interface (SRI)

L3 data array L3 tag (6MB)

XBAR

Memory Probe Controller Filter MCT/DCT

DRAM DRAM 4 HyperTransport™3 Ports DRAMDRAM DRAMDRAM

57 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Format of a Probe Filter Entry

. 16 probe filter entries per L3 cache line (64B), 4B per entry, 4-way set associative . 1MB of a 6MB L3 cache per die holds 256k probe filter entries and covers 16MB of cache

4 ways

Entry 0 Entry 1 Entry 2 Entry 3 L3 Cache Line (64B) 4 sets Entry 12 Entry 13 Entry 14 Entry 15

Probe Filter Entry (4B) Tag State Owner

EM, O, S, S1 states

58 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Probe Filter Transaction Scenarios

PF Hit PF Miss (*)

I O S S1 EM I O S S1 EM FETCH - D - - D - B B DI DI LOAD - D - - D - B B DI DI STORE - B B B DI - B B DI DI

Legend - Filtered “Effective” D Directed DI Directed Invalidate B Broadcast Invalidate “Ineffective” (*) PF miss implies line is Uncached (no broadcast necessary). State refers to the state of the line to be replaced upon allocation of new PF entry.

Traditional “Cache Hit Ratio” does not measure effectiveness of probe filter

59 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Probe Filter Coverage Ratio

Dir 0 5MB L3 + 3MB L2 5MB L3+ 3MB L2 Dir 2 Memory 0 Memory 3 256k lines 128k lines 128k lines 256k lines

P0 P2

Typical Worst case (Hotspotting) Uniformly distributed data Home node of each cached line is P0 Coverage ratio = 256k :: 128k = 2.0x Coverage ratio = 256k :: 128k * 4 = 0.5x

With sharing, a PF entry may P1 P3 track multiple cached copies and the coverage ratio increases

Dir 1 5MB L3 + 3MB L2 5MB L3 + 3MBL2 Dir 3 Memory 1 Memory 3 256k lines 128k lines 128k lines 256k lines

2 Socket “Magny-Cours”

60 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Cache Coherence Protocol

. Track lines in M, E, O or S state in probe filter

. PF is fully inclusive of all cached data in system - if a line is cached, then a PF entry must exist.

. Presence of probe filter entry says line in M, E, O or S state

. Absence of probe filter entry says line is uncached

. New messages – Directed probe on probe filter hit – Replacement notification E ->I (clean VicBlk)

61 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD Opteron™ EE Processor Improves Performance by up to 33%

Integer Throughput Performance 180 160 140 158 Up to 33% higher 120 performance with 100 119 than servers using

80 Score 60 40W ACP Quad-Core Higher is Better is Higher 40 2377 EE 2419 EE AMD Opteron™ 20 4 cores 6 cores 40W ACP 40W ACP processors 0 SPECint®_rate2006 – Two-Socket Servers

Low-Power Quad-Core AMD Opteron™ Processor (“Shanghai”) Model 2377 EE (2.3GHz/40W ACP) Low-Power Six-Core AMD Opteron™ Processor (“Istanbul”) Model 2419 EE (1.8GHz/40W ACP)

SPEC and SPECint are registered trademarks of the Standard Performance Evaluation Corporation. The result for Six-Core AMD Opteron™ processor Model 2419 EE is based upon data submitted to Standard Performance Evaluation Corporation as of August 5, 2009. The other results stated above reflect results published on http://www.spec.org/ as of August 5, 2009. The comparison presented above is based on the best performing two-socket servers using AMD Opteron™ processor Models 2419 EE and 2377 EE. For the latest results, visit www.spec.org. Please see backup slides for configuration information.

62 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 AMD Opteron™ EE Processor Improves Performance-per-Watt by up to 31%

Relative Performance to Power Ratios Workload: Server-side java/SPECpower®ssj_2008 150% Up to 31% higher 125% 131% performance-per- 100% watt than servers 100% 75% using 2.7GHz 75W 2384 2419 EE 75W ACP 40W ACP

50% ACP Quad-Core Higher is Better is Higher 1166 (overall 1614 (overall AMD Opteron™ 25% ssj_ops/watt) ssj_ops/watt) 412840 ssj_ops & 406954 ssj_ops & Percentage Improvement Percentage 260W @ 100% load 178W @ 100% load processors 0% point point Quad-Core AMD Opteron™ Processor (“Shanghai”) Model 2384 (2.7GHz/75W ACP) Low-Power Six-Core AMD Opteron™ Processor (“Istanbul”) Model 2419 EE (1.8GHz/40W ACP)

SPEC and SPECpower are registered trademarks of the Standard Performance Evaluation Corporation. The result for Six-Core AMD Opteron™ processor Model 2419 EE is based upon data submitted to Standard Performance Evaluation Corporation as of August 12, 2009. The other results stated above reflect results published on http://www.spec.org/ as of August 12, 2009. The comparison presented above is based on the best performing two-socket servers using AMD Opteron™ processor Models 2419 EE and 2384. For the latest results, visit www.spec.org. Please see backup slide 15 for configuration information.

63 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Density Calculation Details (Single Rack Comparison)

Notes

Six-Core AMD Opteron™ Six-Core AMD Opteron™ Six-Core AMD Opteron™ processor 2.6GHz processor Model 2425 HE processor Model 2419 EE Processor (75W ACP) (55W ACP) (40W ACP)

42U Standard rack 7.5KW power budget

Estimated KW power Estimated server power required to operate 42 ~13KW or ~ 9.2KW or ~7.5KW or consumption for a fully populated servers (1 rack) @ 42U rack (42 servers) 100% load point with (12894 watts) (9156 watts) (7476 watts) including 5% margin Platform Power 307W 218W 178W Consumption measured (100% load point / (100% load point / (100% load point / at 100% load point 550755 ssj_ops) 462215 ssj_ops) 406954 ssj_ops) Estimated number of 1U server units supported in a standard 24 servers 34 servers 42 servers 42U rack in power budget $305.55 $249.4 (up to 29% less than AMD (up to 40% less than AMD Estimated electricity cost per year. Annual Energy Costs $430.29 Opteron™ processor Model Opteron™ processor Assumes $.10/kWh and 1.6 PUE DC estimates* 2435 based server) Model 2435 based server) cooling factor

*For configuration and power consumption calculations see slide 3. SPEC and the benchmark name SPECpower_ssj are trademarks of the Standard Performance Evaluation Corporation. For the latest SPECpower_ssj2008 benchmark results, visit http://www.spec.org/power_ssj2008.

64 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Configuration Information (energy calc details*)

SIX-CORE SERVER SIX-CORE SERVER SIX-CORE SERVER ZT Systems 1224Ra ZT Systems 1224Ra ZT Systems 1224Ra 2x AMD Opteron™ processor 2x AMD Opteron™ processor 2x AMD Opteron™ processor model 2465 model 2425 HE model 2419 EE 16GB memory (4x 4048 MB) 16GB memory (4x 2048 MB) 16GB memory (4x 2048 MB) 1 SDD 1 SDD 1 SDD

* For full disclosure report, contact AMD. SPEC and the benchmark name SPECpower_ssj are trademarks of the Standard Performance Evaluation Corporation. For the latest SPECpower_ssj2008 benchmark results, visit http://www.spec.org/power_ssj2008.

65 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Configuration Information (EE vs. 75W ACP)

SIX-CORE SERVER QUAD-CORE SERVER ZT Systems 1224Ra ZT Systems 1224Ra 2x AMD Opteron™ processor 2x AMD Opteron™ processor Model 2419 EE (55W ACP) Model 2384 75W ACP 16GB memory (4x 2048 MB) 16GB memory (4x 2048 MB) 1 SDD 1 SDD

* For full disclosure report, contact AMD. SPEC and the benchmark name SPECpower_ssj are trademarks of the Standard Performance Evaluation Corporation. For the latest SPECpower_ssj2008 benchmark results, visit http://www.spec.org/power_ssj2008.

66 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Footnotes

Two-Socket SPECint®_rate2006 158 using 2 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 2419 EE in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 250GB SATA disk drive, Red Hat Enterprise Linux® Server 5.2 64-bit 119 using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 2377 EE in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 300GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP1 64-bit http://www.spec.org/cpu2006/results/res2009q2/cpu2006-20090406-06935.html

67 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HT Assist • HT Assist helps reduces memory latency and increase overall system performance in multi-socket systems • HT Assist improves HyperTransport™ technology link efficiency and increases performance by: • Reducing probe traffic • Resolving probes more quickly • Probe “broadcasting” can be eliminated in 8 of 11 typical CPU-to-CPU transactions • HT Assist can provide a significant benefit in 4-way and greater systems • 4-Way Stream memory bandwidth performance improves by ~60% (41.5GB/s with HT Assist vs. 25.5GB/s without HT Assist) • HT Assist reserves 1MB portion of each CPU’s L3 cache to act as a directory. This directory tracks where that CPU’s cache lines are used elsewhere in the system • For 2-way systems this reduction in L3 cache size may eliminate any HT Assist benefit as probe traffic is already significantly less than in 4-way systems • Each CPU is considered the “host” of the cache information contained in its L3 directory • For many CPU-to-CPU transactions the host CPU knows exactly which CPU to probe for the information it needs, eliminating the need to “broadcast”. • Reducing broadcasting cuts down considerably the amount of system probe traffic and helps resolve probes more quickly, resulting in reduced memory latency and improved system performance

68 AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

6 For more information be sure to visit AMD at SC’09 booth #1417 8 Two-Socket SPECint®_rate2006 – Slide 8 & 11

205 using 2 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 2435 in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 250GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit

136 using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 2384 in Supermicro A+ Server 1021M- UR+B server, 32GB (8x4GB DDR2-800) memory, 250GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081024-05683.html

217 using 2 x Intel Xeon processors (“Gainestown”) Model E5540 in Supermicro SuperServer 6026T-NTR+ server, 24GB (12x2GB DDR3-1066) memory, 150GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2009q1/cpu2006-20090316-06749.html

131 using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 2382 in Dell PowerEdge R805 server, 32GB (8x4GB DDR2-800) memory, 73GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081110-05955.html

51.5 using 2 x Dual-Core AMD Opteron™ processors Model 2218 in Fujitsu Siemens Computers PRIMERGY RX330 S1 server, 32GB (8x4GB DDR2-667) memory, 36GB SAS disk drive, SuSE Linux® Enterprise Server 10 64-bit http://www.spec.org/cpu2006/results/res2007q2/cpu2006-20070427-00947.html

139 using 2 x Quad-Core Intel Xeon processors (“Harpertown”) Model E5450 in Dell PowerEdge 2950 III server, 16GB (4x4GB DDR2-667 FB-DIMM) memory, 73GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081027-05790.html

69 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Two-Socket SPECfp®_rate2006 - Slide 8 & 11

143 using 2 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 2435 in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 250GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit

118 using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 2384 in Supermicro A+ Server 1021M- UR+B server, 32GB (8x4GB DDR2-800) memory, 250GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081024-05684.html

115 using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 2382 in Dell PowerEdge R805 server, 32GB (8x4GB DDR2-800) memory, 73GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081110-05954.html

48.5 using 2 x Dual-Core AMD Opteron™ processors Model 2218 in Fujitsu Siemens Computers PRIMERGY RX330 S1 server, 32GB (8x4GB DDR2-667) memory, 36GB SAS disk drive, SuSE Linux® Enterprise Server 10 64-bit http://www.spec.org/cpu2006/results/res2007q2/cpu2006-20070427-00945.html

169 using 2 x Intel Xeon processors (“Gainestown”) Model E5540 in Supermicro SuperServer 6026T-NTR+ server, 24GB (12x2GB DDR3-1066) memory, 150GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2009q1/cpu2006-20090316-06747.html

78.4 using 2 x Quad-Core Intel Xeon processors (“Harpertown”) Model E5450 in Dell PowerEdge 2950 III server, 16GB (4x4GB DDR2-667 FB-DIMM) memory, 73GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081027-05795.html

70 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Two-Socket SPECjbb®2005 – Slide 8 & 11

462189 score using 2 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 2435 in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 300GB SATA disk drive, Microsoft® Windows Server® 2003 Enterprise x64 Edition SP2 352700 score using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 2384 in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 300GB SATA disk drive, Microsoft® Windows Server® 2003 Enterprise x64 Edition SP2 http://www.spec.org/osg/jbb2005/results/res2008q4/jbb2005-20081024-00551.html 468132 score using 2 x Intel Xeon processors (“Gainestown”) Model E5540 in Supermicro X8DTN+ , 12GB (6x2GB DDR3-1066) memory, 150GB SATA disk drive, Microsoft® Windows Server® 2008 R2 Enterprise x64 Edition SP1 362239 using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 2382 in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 300GB SATA disk drive, Microsoft® Windows Server® 2008 Enterprise x64 Edition 66420 using 2 x Dual-Core AMD Opteron™ processors Model 2218 in Tyan Thunder K9HM (S3992) server, 8GB (16x512MB DDR2-667) memory, 120GB IDE disk drive, Microsoft® Windows Server® 2003 Enterprise Edition SP1 64-bit http://www.spec.org/osg/jbb2005/results/res2006q3/jbb2005-20060718-00154.html

310028 using 2 x Quad-Core Intel Xeon processors (“Harpertown”) Model E5450 in IBM BladeCenter HS21XM server, 16GB (8x2GB DDR2-667) memory, 73GB SAS disk drive, Microsoft® Windows Server® 2003 R2 Enterprise x64 Edition SP1 http://www.spec.org/osg/jbb2005/results/res2008q2/jbb2005-20080527-00494.html

71 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Two-Socket STREAM – Slide 7 & 8

21GB/s using 2 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 2435 in Supermicro H8DMU+ motherboard, 16GB (8x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP1 64-bit 21GB/s using 2 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 8384 in Supermicro H8DMU+ motherboard, 16GB (8x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP1 64-bit

Two-Socket Server Idle Power – Slide 11 83.21W using 2 x Dual-Core AMD Opteron™ processors Model 2218 in ZT Systems 1224Ri Datacenter Server, 16GB (4x4GB DDR2-800) memory, 80GB SSD SATA disk drive, Sparkle Power International LTD SPI4001UG power supply, Microsoft® Windows Server® 2008 SP1 64-bit

92.84W using 2 x Quad-Core AMD Opteron™ (“Shanghai”) processors Model 2218 in ZT Systems 1224Ri Datacenter Server, 16GB (4x4GB DDR2-800) memory, 80GB SSD SATA disk drive, Sparkle Power International LTD SPI4001UG power supply, Microsoft® Windows Server® 2008 SP1 64-bit

91.32W using 2 x Six-Core AMD Opteron™ (“Istanbul”) processors Model 2218 in ZT Systems 1224Ri Datacenter Server, 16GB (4x4GB DDR2-800) memory, 80GB SSD SATA disk drive, Sparkle Power International LTD SPI4001UG power supply, Microsoft® Windows Server® 2008 SP1 64-bit

72 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Four-Socket and Eight-Socket SPECint®_rate2006 – Slide 6 & 8

402 using 4 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 8435 in Tyan Thunder n4250QE server (S4985-SI), 64GB (16x4GB DDR2-800) memory, 250GB SATA disk drive, Red Hat Enterprise Linux® Server release 5.3 64-bit http://www.spec.org/cpu2006/results/res2009q2/cpu2006-20090511-07355.html

249 using 4 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 8384 in HP ProLiant DL585 G5 server, 64GB (16x4GB DDR2-800) memory, 146GB SAS disk drive, Red Hat Enterprise Linux® Server release 5.2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081027-05740.html

386 using 8 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 8384 in Sun Fire X4600 M2 server, 128GB (64x2GB DDR2-667) memory, 2x 72GB SAS disk drive, OpenSolaris 2008.05 http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081208-06223.html

294 using 4 x Hex-Core Intel Xeon processors (“Dunnington”) Model X7460 in IBM System x3850 M2 server, 64GB (16x4GB DDR2-667 FB-DIMM) memory, 73GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20080915-05319.html

253 using 4 x Hex-Core Intel Xeon processors (“Dunnington”) Model E7450 in Intel Server System S7000FC4UR motherboard, 32GB (16x2GB DDR2-667 FB-DIMM) memory, 73GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20080915-05347.html

73 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Four-Socket SPECjbb®2005 – Slide 8

866261 using 4 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 8435 in Tyan Transport TX46 server, 64GB (16x4GB DDR2-800) memory, 250GB SATA disk drive, Microsoft® Windows Server® 2008 Enterprise x64 Edition

721843 using 4 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 8384 in IBM Bladecenter LS42 server, 64GB (16x4GB DDR2-800) memory, 36GB SAS disk drive, Microsoft® Windows Server® 2008 Enterprise x64 Edition http://www.spec.org/osg/jbb2005/results/res2008q4/jbb2005-20081112-00559.html

633897 using 4 x Hex-Core Intel Xeon processors (“Dunnington”) Model X7460 in Fujitsu PRIMERGY RX600 S4 server, 64GB (16x4GB DDR2-667 FB-DIMM) memory, 36GB SAS disk drive, Microsoft® Windows Server® 2003 Enterprise x64 Edition http://www.spec.org/osg/jbb2005/results/res2009q1/jbb2005-20090305-00664.html

74 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Four-Socket SPECfp®_rate2006 – 8

276 using 4 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 8435 in Tyan Transport TX46 server, 64GB (16x4GB DDR2-800) memory, 250GB SATA disk drive, Red Hat Enterprise Linux® Server release 5.2 64-bit

212 using 4 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 8384 in Dell PowerEdge M905 server, 64GB (16x4GB DDR2-800) memory, 36GB SAS + 73GB SAS disk drives, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20081110-05969.html 156 using 4 x Hex-Core Intel Xeon processors (“Dunnington”) Model X7460 in HP ProLiant DL580 G5 server, 64GB (16x4GB DDR2-667 FB-DIMM) memory, 146GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP1 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20080911-05253.html

139 using 4 x Hex-Core Intel Xeon processors (“Dunnington”) Model E7450 in Fujitsu Siemens Computers PRIMERGY RX600 S4 server, 64GB (16x4GB DDR2-667 FB-DIMM) memory, 36GB SAS disk drive, SuSE Linux® Enterprise Server 10 SP2 64-bit http://www.spec.org/cpu2006/results/res2008q4/cpu2006-20080922-05389.html

75 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Four-Socket STREAM – Slide 7 & 8

42GB/s using 4 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 8435 in Tyan Thunder n4250QE (S4985-E) motherboard, 32GB (16x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP1 64- bit (with HT Assist enabled) 25.5GB/s using 4 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 8435 in Tyan Thunder n4250QE (S4985-E) motherboard, 32GB (16x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP1 64-bit (with HT Assist disabled) 24GB/s using 4 x Quad-Core AMD Opteron™ processors (“Shanghai”) Model 8384 in Tyan Thunder n4250QE (S4985-E) motherboard, 32GB (16x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP1 64-bit 9GB/s using 4 x Hex-Core Intel Xeon processors (“Dunnington”) Model X7460 in Supermicro X7QC3+ motherboard, 32GB (16x2GB DDR2-667 FB-DIMM) memory, SuSE Linux® Enterprise Server 10 SP1 64-bit 9GB/s using 4 x Hex-Core Intel Xeon processors (“Dunnington”) Model E7450 in Supermicro X7QC3+ motherboard, 32GB (16x2GB DDR2-667 FB-DIMM) memory, SuSE Linux® Enterprise Server 10 SP1 64-bit

76 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 2 Socket HPL Performance – Slide 10

AMD system:

2 x Six-Core AMD Opteron™ processors ("Istanbul") Model 2435 (2.6GHz) in Supermicro A+ Server 1021M-UR+B server, 32GB (8x4GB DDR2-800) memory, 300GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP1

64-bit Intel system:

2 x Quad-Core Intel processor ("Nehalem") Model X5570 (2.93 GHz) 2in Intel board S5520UR, 24GB (6x4GB DDR3-1333) memory, 300GB SATA disk drive, SuSE Linux® Enterprise Server 10 SP1 64-bit

77 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 HPC Interconnect performance testing – slide 18

Configurations used to collect performance data:

• ausqc[01-08]: 2 x Quad-Core AMD Opteron™ processors Model 2384 in Supermicro H8DMU+ motherboard, 16GB (8x2GB DDR2-800 ), 150GB IDE disk drive, IB DDR [card,switch], SuSE Linux® Enterprise Server 10 SP1 64-bit • fiorano[01-08]: 2 x Quad-Core AMD Opteron™ processors Model 2387 in Toonie 2 , 16GB (8x2GB DDR2-800 ), 150GB IDE disk drive, IB QDR [card,switch], SuSE Linux® Enterprise Server 10 SP2 64-bit • Tooniehpc[1,2]: 2 x Quad-Core AMD Opteron™ processors Model 2387 in Toonie 2 , 16GB (8x2GB DDR2-800 ), 150GB IDE disk drive, IB [D,QDR] back to back [no IB switch], SuSE Linux® Enterprise Server 10 SP2 64-bit • IB [D,Q]DR cards and switches provided by Mellanox.

78 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 STREAM (Memory Bandwidth) – From slide 31

2 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 2435 in Supermicro A+ Server 1021M-UR+B server, 16GB (8x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP2 64-bit 4 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 8435 in Tyan Transport TX46 server, 32GB (16x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP2 64-bit 8 x Six-Core AMD Opteron™ processors (“Istanbul”) Model 8435, AMD 5690 Chipset reference design platform, 64GB (32x2GB DDR2-800) memory, SuSE Linux® Enterprise Server 10 SP2 64-bit 2 x AMD Opteron™ processors (“Magny-Cours ”) Model 61xx, AMD 5690 Chipset reference design platform, 32GB memory, SuSE Linux® Enterprise Server 10 SP2 64-bit. 4 x AMD Opteron™ processors (“Magny-Cours”) Model 61xx, AMD 5690 Chipset reference design platform, 64GB memory, SuSE Linux® Enterprise Server 10 SP2 64-bit. 2x Intel Xeon processors (“Gainestown”) Model X5570 in Supermicro SuperServer 6026T-NTR+ server, 24GB memory (6x4GB DDR3-1333) memory, SuSE Linux® Enterprise Server 10 SP2 64-bit 2x Intel Xeon processors (“Gainestown”) Model E5540 in Supermicro SuperServer 6026T-NTR+ server, 24GB memory (6x4GB DDR3-1066) memory, SuSE Linux® Enterprise Server 10 SP2 64-bit 2x Intel Xeon processors (“Gainestown”) Model L5520 in Supermicro SuperServer 6026T-NTR+ server, 24GB memory (6x4GB DDR3-1066) memory, SuSE Linux® Enterprise Server 10 SP2 64-bit 4x Intel Xeon processors (“Dunnington”) Model X7460 in Supermicro X7QC3+ motherboard, 32GB (16x2GB FBDIMM) memory, SuSE Linux® Enterprise Server 10 SP1 64-bit

79 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Disclaimer & Attribution

DISCLAIMER The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors.

The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes.

AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION.

AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.

ATTRIBUTION © 2009 , Inc. All rights reserved. AMD, the AMD Arrow logo, ATI, the ATI logo, AMD CoolCore, AMD PowerNow!, AMD Opteron, AMD Virtualization, AMD-V, Dual Dynamic Power Management, Catalyst, FireGL, FirePro, FireStream, Radeon, and combinations thereof are trademarks of Advanced Micro Devices, Inc. HyperTransport is a licensed trademark of the HyperTransport Technology consortium. Microsoft, Windows, and DirectX are registered trademarks of Microsoft Corporation in the United States and/or other jurisdictions. Other names are for informational purposes only and may be trademarks of their respective owners.

OpenCL is a trademark of Apple Inc. used under License to the Khronos Group Inc.

80 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417 Index Slide

•HPC Product Portfolio •Platform Roadmap •Istanbul •Istanbul Developer Update •CPU overview through 2010 •Magny-Cours •A peek at the future •STREAM Computing •EyeFinity Overview •Backup

81 | AMD HPC Product Portfolio Update @ SC’09 | November 30, 2009

For more information be sure to visit AMD at SC’09 booth #1417