Moving Forward: How emerging data center architectures are enabling virtualization adoption

Randy Allen Corporate VP, Server and Workstation Division Issues That Are Raising Data Center Costs

Underutilized resources Power, cooling, and space constraints Managing and securing the computing infrastructure Disruptions that decrease user productivity Solution: Transform Data Center Infrastructure Shared Infrastructure

Internet Virtual Resource Pool

• Higher rates of utilization • Significant unused capacity • Reconfigure dynamically • Inflexible • Unified resources “managed as • Many legacy technologies one” How Do We Define “Virtualization?”

Virtualization: The Use of Abstractions of Physical Entities in Place of Direct References to those Entities

If Done with Finesse, These Substitutions Are Invisible to Users and Developers and the Software They Use or Develop

Source: Picasso, 1912 (Table in a Café) Virtualization Has Been Around For a Very Long Time

We See Virtual Images In Mirrors Everywhere Often Extremely Realistic Sometimes Highly Distorted • Rear View Mirrors • Fun House Mirrors Interior Designers Use Mirrors to Deceive Our Senses and Make Rooms Seem Larger Software Designers Use Virtualization to Deceive Programs and Make Machines Seem Larger Source: Carroll/Tenniel (Illustrator), 1871 : Through the Looking-Glass Virtually All Aspects of System Hardware Can Be Virtualized

Virtualization can mask annoying hardware details from software

Virtual Virtualized devices can Storage Physical be highly fault tolerant Memory Physical Disks, LANs, and SANs Virtualization can provide

added flexibility to Virtual configuring and managing Machine hardware resources Physical Server Behind Every Virtual Machine Stands (a Fraction of) A Physical Machine

One Physical Machine Can Typically Host 10 to 30 VMs Multi-Core CPUs Will Further Improve This Ratio Reduces “Server Sprawl” Better Use of Data Center Real Estate Improves Electrical Power Utilization 10-20 Watts Per Idle VM (vs. 100-200W Per Idle PM) Simplifies Deployment of New Applications, Servers Facilitates Rapid Recovery From Hardware Failures Source: Picasso, 1968 (Homme a la Pipe) The Secret Sauce That Makes Virtualization Possible

It’s All Done With Mirrors!

With a Little Help from AMD Silicon and Advanced Software from Several AMD Ecosystem Partners

Source: M. C. Escher, 1935 (Hand with Reflecting Sphere) What Is A Virtual Machine?

Technology that partitions a computer into several independent machines that can Application Application Application

Guest OS Guest OS Guest OS support different OS’s and applications concurrently Virtual Virtual Virtual Machine Machine Machine is the underlying Hypervisor software which runs directly on the hardware and manages Real multiple operating systems Machine Virtual machine is self-contained operating environment that runs on top of the hypervisor and behaves as if it is a separate computer From Physical Servers to Virtual Machines

Breaks “one Virtualization converts Application application per physical boundaries into server” model logical constructs

Virtual You can still have your Machine special function servers – Virtual Virtual Machine they’re just sharing physical Machine resources with each other Virtual Machine Virtualized servers can be moved easily between Move Virtual Virtual Machines Machine physical machines, thus based on Virtual Virtual dynamically reconfiguring Machine Machine workload or maintenance your data center schedules Virtualization In the World

• In the x86 environment, the Application OS is “selfish” and expects direct control of the CPU 3 g in R 2 • x86 architecture supports g in R different classes of privileged 1 g in operations (often called R 0 Operating System “rings”) g Device Drivers in R • Privileged software runs in Ring 0 and can exercise control over software running Only privileged code can at higher rings define memory-mapping parameters or set up • Traditional x86 processors handlers can't simultaneously run both host and guest OS’s at Ring 0

Application AMD Is Driving x86-based Virtualization

AMD64 Direct Virtualization Nested I/O Technology Connect Extensions Paging Virtualization Road Map Architecture (Today) (2007*) (2008*)

Increase performance Simplify Reduce overhead of Virtualize devices with multi-core virtualization switching between to improve capabilities and fast software and virtual machines performance and memory access helps reduce security CPU overhead

AMD and VMware are working together to develop a robust virtualization ecosystem that provides users with innovative virtualization solutions

*Planned features for future AMD ™ processor-based systems Software-Enabled Virtualization

Application Application

3 g 3 in g R in 2 Modified R Unmodified g 2 in Guest OS g Guest OS R in 1 R 1 g g in Hypervisor n R i Hypervisor 0 Host OS R g 0 in g R in R

Para-virtualization Hypervisor handles Hypervisor is also Host OS Host/Guest OS interactions Intercepts and emulates Guest OS is recompiled to instructions and virtualizes run at Ring 1 devices Host OS handles devices Guest OS runs unmodified Hardware-Enabled Virtualization

ƒ Creates a Guest Mode that Application allows Guest OS to run

3 unmodified with Hypervisor g in R 2 g in ƒ Hypervisor controls access to R 1 g devices n i Guest R 0 g Mode in R ƒ Tagged TLB reduces the number Hypervisor of memory accesses needed to switch between host/guests

ƒ Instructions ease creation and management of virtual machines AMD Opteron™ Processor The Platform for Virtualization

64-bit and Dual-Core Performance- per-watt Enhances performance while Assists data offering the centers in flexibility to support controlling power both 32- and 64-bit consumption and applications heat output

Performance Virtualization

High performance in Enables separate, smaller form factor secure operating Grid Computing environments

Perfect balance of performance, power, and cost Introducing “Barcelona”… Native quad-core upgrade for 2007 Platform Native Quad- Compatibility Core Processor Socket and thermal To increase performance- compatible with “Socket F”. per-watt efficiencies using the same .

Direct Connect Advanced Process Architecture Technology • Integrated memory controller designed for reduced memory 65nm Silicon-on latency and increased Insulator Process performance • Memory directly connected

Fast transistors with low •Provides fast CPU-to-CPU power leakage to reduce communication power and heat. • CPUs directly connected • Glueless SMP up to 8 sockets A Closer Look at the Barcelona Processor

Comprehensive Upgrades Virtualization for SSE128 Performance

Expandable Advanced shared Power L3 cache Management

More delivered IPC-enhanced DRAM Bandwidth CPU cores Direct Connect Architecture – Virtualization Platform

CORE CORE CORE CORE CORE CORE CORE CORE

MCPMCP MCPMCP MCPMCP MCPMCP

8 GB/S PCI-EPCI-E USB Memory BriBridgedge USB Memory PCI-EPCI-E I/O Hub Controller I/OI/OI/O Hub HubHub Controller BriBridgedge Hub PCIe™PCIe™ PCIPCI Hub BriBridgedge

8 GB/S 8 GB/S

PCIe™PCIe™ PCIe™PCIe™ Bridge Bridge XMBXMB XMBXMB XMBXMB XMBXMB Bridge Bridge

8 GB/S

USBUSB I/OI/O HubHub PCIPCI

Legacy x86 Architecture AMD64 Technology with • 20-year old front-side bus architecture Direct Connect Architecture • CPUs, Memory, I/O all share a bus • Industry-standard AMD64 technology • Major bottleneck to performance • Eliminate bottlenecks of traditional front-side bus architecture • Faster CPUs or more cores ≠ performance • HyperTransport™ technology interconnect for high bandwidth and low latency AMD Virtualization™ Today Enabling More Efficient Workload Consolidation

• Instructions added to help reduce complexity of virtualization software

• Reducing overhead by selectively intercepting information destined for guest Application Application

Guest OS Guest OS

Virtual Virtual • Enables Guest OS’s to run unmodified for Machine Machine easier implementation and support Hypervisor Host OS • Increased isolation to improve security of Virtualization Instructions virtual machines

• Offers architectural enhancements to improve efficiency of switching between hypervisor and the guest OS’s Virtualization is Memory Intensive

VMVMVMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM VMVM CPUCPU CPUCPU CPUCPU CPUCPU CPUCPU CPUCPU

MemoMemoryry MemoMemorryy CoControntrolllelerr CoControntrolllelerr

MemoMemoryry ControlControllerler HubHub VMVM VMVM VMVM VMVM VMVM VMVM

CPUCPU CPUCPU Memory MemoMemorryy Memory Controller CoControntrolllelerr Controller

Shared memory can create bottlenecks Dedicated memory for scalability

• Shared front-side bus can decrease application • Direct Connect Architecture helps improve performance within a virtual machine application performance within a virtual machine • Untagged TLB means less efficient switching • Tagged TLB means more efficient switching between virtual machines between virtual machines • Software-based memory management and • Hardware-based memory management and security security (via external Memory Controller (Integrated memory controller with DEV) can Hub) can reduce overall virtualization improve overall virtualization performance and performance and efficiency efficiency Virtualization Performance Today – 4P Servers

Advantages of AMD Opteron™ processor-based system for x86 Virtualization 96% CPU Util. At 20 VMs: Over 2x better web serving

82% Over 50% better file & print CPU serving Util. 15% less power draw 14% more CPU headroom

96% At Same CPU Utilization: CPU Util. 6 additional VMs hosted 2.6x better web serving 2.1x better file & print serving 12% less power draw 883W 740W 902W 763W 796W

Full 4P study available at: http://www.veritest.com/NR/rdonlyres/F1F21288-8D6D-4769-94F3-252FCB02D51F/272/AMD_Virtualization_Project_Final_Report.pdf Virtualization Performance – 2P Servers

Workload Scores (Sums of Individual VMs) (DBH and WB in rqsts/sec, NB in Mb/sec) Advantages of AMD Opteron™ processor-based system 3000.00 126% 130.00 for x86 Virtualization Geomean

2500.00 125.00 At 12 VMs: DBHammer 10% Advantage in Light DBHammer AMD Workload 2000.00 WebBench Intel 120.00 110% 26% Advantage in Heavy Geomean WebBench AMD Workload NetBench Intel 1500.00 115.00 Approximately a 26% NetBench AMD performance/watt advantage

1000.00 110.00

At similar CPU utilization rates: 500.00 105.00 7-12% more SQL Requests per second 13-51% more Web requests per 0.00 100.00 second Light Workload Heavy Workload 2-19% better file & print serving

Full study available at: http://www.veritest.com/ Nested Paging Enabling More Efficient Switching

Guest Application Application App Guest App OS OS Page Page Guest OS Guest OS Page Table Table Table Virtual Virtual Machine Machine

Application Application Hypervisor Guest OS Guest OS Nested Page Virtual Virtual Machine Machine Tables

Shadow Host Page Table Hypervisor Page Guest OS Guest OS Tables Page Page Table Table

Guest Guest OS Guest OS App Guest App OS Page OS Page Page Page Page Table Page Table Table Table Table Table • Each guest physically has their • Provides the guest OS with the illusion own world to manage that it is managing memory • Requires less intervention • Page tables are actually kept up by the • Memory look ups done in hypervisor in software hardware which can be faster • Requires more software intervention than software management from the hypervisor AMD’s “Barcelona” Processor Improves Virtualization Performance

Nested Paging (NP) Guest and Host page tables both exist in memory • The processor walks both guest and host page tables Application Application Guest OS Guest OS

Nested walk can have up to 24 memory Virtual Virtual Nested accesses! Machine Machine Page Hypervisor Tables • Hardware caching accelerates the walk Host OS “Wire to wire” translations are cached in TLBs AMD-V Eliminates hypervisor cycles spent managing shadow pages Host Page Table Guest OS Guest OS • As much as 75% of hypervisor time Page Page Table Table

Guest OS Guest OS Barcelona also reduces world-switch time by 25% Page Page Table Table World-switch time : round-trip to the hypervisor and back Cygwin Compile with AMD Nested Paging VMware products will take advantage of hardware-based Guest Page Translation assist (such as AMD Nested Paging) in 2007 and beyond as hardware becomes available.

Cygwin Compile 389.61 AMD-V (no NP) 387.67

Nested 205.56 Paging BT 198.22 2VM Avg reduces 1VM compile time by 43% 117.00 Among Best Case Improvement AMD-V & NP for Nested Paging, which mainly 113.22 helps memory-management intensive workloads; not representative of all workloads. 0.00 50.00 100.00 150.00 200.00 250.00 300.00 350.00 400.00 450.00 Actual VM time in seconds Platform: Experimental AMD Processor with Nested Paging running experimental build of VMware Workstation. The First Wave: Using Virtualization to Lower Data Center Costs

Consolidate workloads to reduce hardware and space requirements

Run legacy software on reliable, power-efficient hardware

Reduce hardware requirements for development and test AMD: Consolidating Hardware

In Austin, approximately 135 servers consolidated to 7 active ESX 3.0 servers plus 2 swing servers In Sunnyvale, approximately 45 servers consolidated to 2 active ESX 3.0 servers plus 1 swing server Overall consolidation ratio between the two sites is 15:1 including swing servers Anticipate 79% reduction in power consumption (Austin Data Center) CDW: Wowing the Software Developer

One of the largest direct Virtualization Environment providers of technology • Virtualization Software: VMware ESX Server • Servers: 2P servers and 4P HP ProLiant DL 585s solutions for business, running AMD Opteron™ processors

government and education • Guest OS’s: Windows® and ®

• Applications: Windows domain controllers, IIS, FTP Develops most applications Endeca, ISA Server 2004, File & Print, SQL Server, in-house Exchange, Blackberry Enterprise Server, BizTalk Server, Cognos, PeopleSoft

9 Can now easily provision development and test environments without having to build physical servers 9 Reduced server deployment time from a day to a half hour 9 $25,000 a month savings from deploying virtual machines 9 CPU utilization increased from 5 – 15% to an average of 50%

http://www.vmware.com/pdf/cdw.pdf The Data Center of the Future In the 1970’s and 1980’s, Large Systems Prevailed The Larger, the Better (Grosch’s Law) Over the Last Decade, Scale-Out Deployments Became Feasible The Smaller, the Better (Moore’s Law) Today, the Pendulum Will Start to Swing Back Toward Larger Systems Reduce Server Sprawl and Power Requirements Simplify System Management Improve Resource Utilization A Few Virtualization Plays a Key Role in Many Big Servers Small Servers Enabling this Shift Closing Considerations

Virtualization Facilitates Resource Sharing But You’ve Got to Have Resources to Share Some workloads are not the best candidates for virtualization May Be of Little Value in Compute-Intensive HPTC environments Virtual Machines Require Real Software Licenses Paid for with Real Money

Source: Picasso, 1947 Ulysses and the Sirens) Summary

Virtual Technology Can Help IT Managers: Simplify Data Center Operations Improve Resource Utilization Reduce Server Sprawl Improve the Power Efficiency of their Data Centers Evolving Hardware and Software Offerings Will Enhance Virtualization and Security in Coming Years AMD Is Playing a Key Role in Driving this Evolution If You Have Yet to Explore this Technology, Watch Out! Your Competitors Are Moving in this Direction You Are at Risk of Falling Behind

Trademark Attribution

AMD, the AMD Arrow logo, AMD , AMD Opteron, AMD Turion, AMD . AMD Geode, and combinations thereof are trademarks of , Inc. in the United States and/or other jurisdictions. Other names used in this presentation are for identification purposes only and may be trademarks of their respective owners.

©2006 Advanced Micro Devices, Inc. All rights reserved. Presentation Download

Please remember to complete your session evaluation form and return it to the room monitors as you exit the session

The presentation for this session can be downloaded at http://www.vmware.com/vmtn/vmworld/sessions/

Enter the following to download (case-sensitive): Username: cbv_rep Password: cbvfor9v9r Some or all of the features in this document may be representative of feature areas under development. Feature commitments must not be included in contracts, purchase orders, or sales agreements of any kind. Technical feasibility and market demand will affect final delivery.