SUN SERVER, STORAGE AND VISUALIZATION SOLUTIONS FOR GRID COMPUTING

Constantin Gonzalez Ambassador Technical Systems GmbH

1 What Sun Does

Software Storage

Services Systems

SPARC 64 Network.com Microelectronics Sun Systems

Software Storage

Services SPARC 64

Network.com Systems Microelectronics How Applications Behave High Parallelism

Proxy Caching Data Warehousing Data Analysis Web Serving Client Server Streaming Media

OLTP Database Security File Server Directory SAP R3 J2EE Application Servers Storage Commercial Batch Network Centric Genomics Centric EAI Servers Structural Analysis Electronic Design Simulation Workgroup Compute Grid Application Development Financial Risk/Portfolio Analysis Monte Carlo Simulation Cheminformatics

No Parallelism The Bad News... High Parallelism

Storage Network Centric Centric

There is no egg-laying, wool and milk pork!

No Parallelism The Good News... High Parallelism

SPARC64 - VI UltraSPARC T1/T2

Storage Network Centric x86 / x64 Centric

There are good solutions for all situations! No Parallelism Sun’s Strategy x86 / x64 SPARC64 - VI

Single Thread With Chip Multi-Threading Mission Critical Processors 32 Threads / 8 Cores Today 128 Threads / 16 Cores in development Teams With Teams With The Innovation of

Solaris on All of Them Back-End Datacenter Servers High Parallelism

Proxy Caching Data Warehousing Data Analysis Web Serving Client Server Streaming Media

OLTP Database Security File Server Directory SAP R3 J2EE Application Servers Storage Commercial Batch Network Centric Genomics Centric EAI Servers Structural Analysis Electronic Design Simulation Workgroup Compute Grid Application Development Financial Risk/Portfolio Analysis Monte Carlo Simulation Cheminformatics

No Parallelism Sun SPARC Enterprise Servers M8000 & M9000 Highest availability Highest absolute performance Sophisticated control of resources Highest scalability

• Mainframe-class reliability M4000 & M5000 • New, high-speed, low Price/performance targeted latency interconnect Competitive RAS and S/W • Faster processors Dense rack mount package • Industry standard PCI-eEn I/Otry Systems: • High Tde1000gree &of Tmodul2000arity • Balanced SMP design

UltraSPARC T1 SPARC64 VI SPARC Enterprise M-Series Scalability and Performance • Proven Solaris Scalability • Systems Scale from 2–64 sockets (4–128 cores) > Dual-core, dual-thread SPARC64 VI processor > 2.15GHz to 2.4GHz Sun's • From 16GB to 2TB of DDRII memory Fastest Servers • From 5 to 288 PCI-X and PCI-E I/O slots Ever • M9000: 1.032 TFLOP Linpack

Solving the world’s most complex computing problems Web/Networking Servers High Parallelism

Proxy Caching Data Warehousing UltraSPARC T1/T2 Data Analysis Web Serving Client Server Streaming Media

OLTP Database Security File Server Directory SAP R3 J2EE Application Servers Storage Commercial Batch Network Centric Genomics Centric EAI Servers Structural Analysis Electronic Design Simulation Workgroup Compute Grid Application Development Financial Risk/Portfolio Analysis Monte Carlo Simulation Cheminformatics

No Parallelism Chip Multi-Threading • Memory latency is biggest issue in CPU design today • Sun's idea: While one thread waits for data, others can use the CPU pipeline!

Conventional Processor SingleUltraSP ThrARCeaded T 1 Processor Performance Utilization: 15–25% Utilization near 100%

Thread 4 C M C M C M Thread Thread 3 C M C M C M Thread 2 C M C M C M C M C M C M Thread 1 C M C M C M

Time Time Memory Effective Memory Effective Latency CPU Time Latency CPU Time UltraSPARC T1: Innovation Sun CMT CPU Technology: 8 Cores and 4 Threads per Core

Thread 4 Thread 3 Core 8 Thread 2 Thread 1 Thread 4 Thread 3 Core 7 Thread 2 Thread 1 Thread 4 Thread 3 Core 6 Thread 2 Thread 1 Thread 4 Thread 3 Core 5 Thread 2 Thread 1 Thread 4 Thread 3 Core 4 Thread 2 Thread 1 Thread 4 Thread 3 Core 3 Thread 2 Thread 1 Thread 4 Thread 3 Core 2 Thread 2 Thread 1 Thread 4 Thread 3 Core 1 Thread 2 Thread 1 Time Memory Latency CPU Time UltraSPARC T1 “Niagara” Processor

DDR-2 DDR-2 DDR-2 DDR-2 > SPARC V9 implementation SDRAM SDRAM SDRAM SDRAM > Up to eight 4-way multi- threaded cores for up to 32 simultaneous threads L2$ L2$ L2$ L2$ > Designed for multi-threaded Xbar FPU workloads > All cores connected through a C1 C2 C3 C4 C5 C6 C7 C8 134.4GB/s crossbar switch Sys I/F > 4 DDR2 channels (25.6GB/s) Buffer Switch Core > Typical Power : 72 Watts BUS Joyent Consolidates on CoolThreads Servers and Solaris 10 Reduced Operating Costs, Improved Agility and Scalability Legacy New Environment Environment 4 Racks Required Single Rack Solution Dell 2850 and Sun Fire T1000 Servers 850 Servers Solaris 10 & Nevada

● Data center ● Reduced operational com plexity complexity and cost ● ● Impeded scalabilty Improved scalability ● 4x space and 3x ● Increasing costs power savings ● Power and space constaints ● >7x number of logical ● Poor performance Servers with Solaris and utilization Containers ● Simplified installation and management

http://scalewithrails.com/downloads/Jason-Joyent-t1000subset.pdf http://scalewithrails.com/downloads/ScaleWithRails-April2006.pdf UltraSPARC T2: Server on a Chip 42 GB/s read, 21 GB/s write 2–8 DIMMs

Dual-channel Dual-channel Dual-channel Dual-channel • 8 SPARC V9 cores @ 1.2–1.4GHz FB-DIMM FB-DIMM FB-DIMM FB-DIMM > 8 threads per core > 2 execution pipelines per core x10 write > 1 instruction/cycle per pipeline x14 read > 1 FPU per core @ 4.0 GT/s Memory Memory Memory Memory > 1 SPU (crypto) per core controller controller controller controller > 4 MB, 16-way, 8-bank L2$ L2$ L2$ L2$ L2$ L2$ L2$ L2$ L2$ BaLn2k$ BanBankk BaLn2k$ BanBank BaLn2k$ BBaannk BaLn2k$ BankBank 4 MB L2$ • 4 FB-DIMM DRAM controllers CrCrossbarr • 2.5 GHz x 8 PCI-Express interface 16 KB I$ 16 KB I$ 16 KB I$ 16 KB I$ 16 KB I$ 16 KB I$ 16 KB I$ 16 KB I$ 8 KB D$ 8 KB D$ 8 KB D$ 8 KB D$ 8 KB D$ 8 KB D$ 8 KB D$ 8 KB D$ • 2 x 10 Gb on-chip Ethernet FPU FPU FPU FPU FPU FPU FPU FPU SPU SPU SPU SPU SPU SPU SPU SPU • Technology: TI 65nm C1 C2 C3 C4 C5 C6 C7 C8 New 8 threads per core 2 2 execution pipes • Die size: 342mm 1 op/cycle per pipe Sys I/F • Power: < 95 W (nominal) buffer switch NIU core PCIe

10 Gb Ethernet SSI, JTAG X8 @ 2.5 GHz Debug port 2 GB/s each direction Sun Processor Roadmap CMT Delivers Performance Gains, Outstanding Energy Efficiency

Web/Network SPARC Enterprise SPARC

Performance “Victoria Falls” Performance Increase 65X Increase (2 sockets) Taped- “” out 128 threads 16X April 2006 16 cores Taped- out 16 January cores “Niagara 2” Taped- 2007 out 35X October 2006

32 threads 8 cores “Niagara 1” “Olympus”/APL 14X 1.5X US IV+ US IIIi 64 threads 1X 1X 8 cores 1 FPU/core

2004 2005 2006 2007 2008 2004 2005 2006 2007 2008 HPC Systems High Parallelism

Proxy Caching Data Warehousing Data Analysis Web Serving Client Server Streaming Media

OLTP Database Security File Server Directory SAP R3 J2EE Application Servers Storage Commercial Batch Network Centric Genomics Centric EAI Servers Structural Analysis Electronic Design Simulation Workgroup Compute Grid Application Development Financial Risk/Portfolio Analysis Monte Carlo Simulation Cheminformatics

No Parallelism Sun’s x64 System Portfolio

Sun Customer Ready HPC Cluster

First True Modular Systems

Sun Blade 6000 8000 & 8000P Modular System Modular Systems First Tier 1 Sun Fire Sun Fire 8-way X4100 M2 Sun Fire Server X4200 M2 X4600 M2 Sun Fire Sun Fire X2100 M2 X2200 M2

Ultra 40 M2 Sun Fire Ultra 20 M2 V40Z Sun Fire X4200 M2 CPU/Memory Dual Socket Data Center Server ● 2 AMD Opteron 2000 Dual-Core CPUs ● 8x DDR2/667 Slots (up to 32 GB) Memory I/O ● 4x PCI-E x8 and 1 x PCI-X ● 4x Gb Ethernet Ports ● 4x SAS II 2.5” disks 10kRPM ● DVD Reliability ● Hot-plug disks and fans ● RAID 0/1 controller on-board The x64 Workhorse Management ● Complete KVMS service processor

OS ● Solaris, Linux or Windows Sun Fire X4600 M2 CPU/Memory Eight Socket Data Center Server ● 8 AMD Opteron 800/8000 Dual-Core CPUs ● 32x DDR/667 Slots (up to 128 GB) Memory I/O ● 6x PCI-E and 2x PCI-X ● 4x Gb Ethernet ports ● 4x SAS II 2.5” disks 10kRPM ● DVD 4 RU Reliability ● Hot-plug disks and fans ● RAID 0/1 controller on-board The Consolidation Management Machine ● Full KVMS service prozessor

OS ● Solaris, Linux oder Windows Sun Fire X4600 Server Internals Full Performance, AMD Opteron Processor Modules (8 total) Dual-Core, Quad-Core Ready CPU Module > > > 6 - PCI-Express Slots Redundant > Hot-plug - 4 – x8-lane Slots Fans > - 2 – x4-lane Slots > > >

2 PCI-X Slots (133MHz/64-bit)

24” depth Innovation: Systems+Storage+Solaris

• Breaking the $2/GB barrier • Integrated server and massive storage array • Up to 24 terabytes of storage in 4RU • Running Solaris with ZFS • 2 GB/s throughput (a.k.a. Thumper) • With general purpose “The World’s First Hybrid Data Server” components • 6 storage solutions in the works Tokyo Institute of Technology:

655 Sun Fire x64 More than 100 TeraFlops Compute Servers 10'480 CPU 21.4 TB Memory, ClearSpeed Advance .. Boards .

10 Gigabit-class Network Equipment InfiniBand Network Voltaire ISR 9288 ×8

200bps 24Gbps (unidirection) External (unidirection) Network Devices .. .x 42 FileServer FileServer C C

1 PB Sun Storage Storage Server B NEC iStorage S1800AT Physical Capacity 96TB RAID6 Petabyte-class Storage Server Sun Blade 6000 Modular System

• 10 RU — 10 blades per chassis NEW! • Up to 320 cores per rack • Up to 64 GB memory • Industry-standard PCI- Express midplane • Hot-pluggable and redundant components • Choice of architectures, operating systems Versatile Blade Portfolio

Sun Blade T6300 Server Module Sun Blade X6220 Server Module Sun Blade X6250 Server Module ● One UltraSPARC T1 ● Two dual-core AMD Opteron processors ● One or two Quad-Core Intel Xeon processor, six or eight cores ● 95 W or 120 W per CPU Processors ● 60 W per CPU ● 2.4, 2.6, 2.8, 3.0 GHz ● 50 W, 80 W, or 120 W per CPU ● 1.0, 1.2, and 1.4 GHz ● 16 DIMMs, 64 GB DDR2-677 MHz ● 1.6, 1.86, 2.33, 2.6 GHz Quad-Core ● Up to 32 GB DDR2 400 MHz ● Four hot-pluggable 2.5-inch SATA or SAS ● 3.0 GHz dual-core ● Four 2.5-inch SATA or SAS disks ● Up to 64 GB FBDIMM 677 MHz disks ● Four 2.5-inch SATA or SAS disks ● Hardware RAID: 0, 1, 5, 6 Independent Industry-standard I/O

• Based on open, Passive industry-standard midplane PCI-Express technology x8 PCIe • Forward-compatible x8 PCIe

with PCI G2 and IOV x8 PC x8 P CIe • Mix I/O types in chassis Ie • Hot-pluggable, easily

accessible Sun Blade Modular • High-bandwidth, System chassis accommodate multiple interconnect fabrics Sun Blades Run Any Application ...

Enterprise/ Business Applications • CRM • ERP • BIDW • Database HPC Internet • Mainstream Infrastructure • Finance Sun Blade • Web 2.0 • Manufacturing • Oil and gas Modular systems • Storage • Life sciences • Service • Government Providers Wait a Minute... High Parallelism

Storage Network Centric Centric

This is the egg-laying, wool and milk pork! (And it looks much prettier, too!)

No Parallelism Sun Customer Ready HPC Cluster Clusters Made Simple – Reduce Deployment Time and Risk with Sun's Expertise

Optimized Optimized Sample Sun Customer Factory-Built Customer Configuration Ready Program and Tested, Ready Configuration to Plug-In

Products, Function (Compute, Data, Access) On-Line Configurator

Workload—HPC ● Crash Simulation ● Structural Analysis ● Others Requirements ● Size Suggested Config. ● Interconnect ● ● S/M/L/XL Power ● Cooling ● Others

Build it Deploy it

Available: Architecture, Implementation, and Migration Services from Sun • Performance Sun Ultra 40 M2 > Up to 2 AMD Opteron 2200 Series High Performance Dual-Core Processors > 32 GB DDR2-667 memory > 8 HDD, SATA or SAS • Graphics & I/O > NVIDIA Quadro Series > 4 PCI Express x16 Slots (2 x16 electrical, 2 x8 electrical) > 1 32-bit/33MHz PCI Slot > 8 USB 2.0 Ports > 2 IEEE1394a Ports > S/PDIF 7.1 audio MCAD, MCAE, HPC- and • Software > Pre-Installed: Solaris 10, Sun Studio, Scientific Applications Studio Creator, Java Studio Enterprise, and NetBeans > Free Grid License Ultra 40 M2 vs. normal PCs

Standard-PC: Sun Ultra 40 M2 Workstation:

Simple PC chassis Sun designed chassis and motherboard: Standard PC cooling Optimized, silent cooling No cable management Energy efficient and eco-friendly Modular power supply unit Plug-in disks Visualization The Solution: Today: A New Model Big datasets overwhelm Networked users get Access to classical PCs scalable resources

Network congestion Application Graphics Graphics Big Data Application Graphics Graphics Application Slow Insecure Application

Application Noisy Hot

Big Data

It's Time for the Visual Grid! Suns Visualization Strategy: Scalable, Networked, a Reservable Resource

Software Graphics Systems Interconnect Storage Integration

Sun Shared Ethernet Visualization Software and Sun Scalable InfiniBand Visualization Software

Ethernet Sun Storage Open source, NVIDIA Quadro FX Sun Fire™ Solutions Sun CRS integratable and Servers InfiniBand NVIDIA Quadro Plex VCS and Sun Ultra™ Storage

Software Storage

Services Systems

SPARC 64 Network.com Microelectronics The Sun StorageTek Portfolio

Services Architectural and Consulting Services Implementation and Learning Services Managed Operations Services Support Services

Software Portfolio Backup and Data File Recovery Services Systems

Enterprise Storage Recovery and Management Disk-Based Virtualization Identity Solutions Archiving Systems Software Solutions Management Disk Portfolio for Fast Data Access From the Datacenter to the Desktop • Datacenter solutions Data Center for open systems and Price mainframe environments ST 9990

• Modular disk solutions ST 9985 that provide Modular

streamlined ST 6540 management, flexibility Specialized ST 6140

and the best TCO ST 2500

• Specialized disk ST 3510 solutions for NEBs compliance Performance, Capabilities, Availability, Scale and rugged military specs Sun StorageTekTM SL8500 Designed to Meet Enterprise Customers Business Requirements

Requirement SL8500 Value Proposition Consolidate • One library shared across multiple environments – including Storage mainframe, UNIX, Linux, and Windows Resources • Any Cartridge Any Slot technology for mixed media and drives • Superior density (slots/sq. foot and drives/slots)

High Availability: • Near zero scheduled and unscheduled downtime 24x7 Operation > Redundant power and robotics with Service on the “Fly” > Dynamic addition of slots, tape drives, and robotics with RealTime Growth capability

Unlimited Growth • Near unlimited scalability to 70,000 slots & 448 drives and Investment • Tape drives and media migration from existing tape libraries Protection • Capacity on demand Ranked #1 Overall in Enterprise Tape Libraries Environmental • 25-50% less Floor Space for libraries of similar slot-count (Source: Diogenes Lab, January 2007) Savings • Consolidate data center to decrease power and cooling Sun StorageTek 5800 Fixed Content Archive The Only Open Source Petabyte-Scale Object Store

• Organize, locate, retrieve hundreds of millions of e-Research data “objects” • OpenSolaris project drives innovation & collaboration > Source code availability protects against lock-in/obsolescence > Storage Beans: community extends, customizes & shares > Run applications on the storage (closer to data) > User-definable metadata & search parameters > Integrated with open repository S/W platforms (Flexible Extensible Digital Object Repository Architecture, Dspace, Eprints) • MTTDL: Exceeds 1 million years • Performance scales with capacity to handle increased query and retrieval workloads • Radically simplified storage administration reduces cost of ownership Putting It All Together...

Software Storage

Services Systems

SPARC 64 Network.com Microelectronics The Sun Constellation System Open Petascale Architecture • The Most Scalable Computing Cluster > 700ns latency (DDR); Up to 1.7 PetaFLOPs; Up to 10 PB > 20% Smaller Footprint than Competition • Open Industry Standards > Solaris, Linux, OpenMPI, Open InfiniBand interfaces and management > X64 Computing Architecture > InfiniBand DDR interconnect • The Highest Density Compute Cluster > Core switch supports 3456 nodes > Custom rack supports 48 server modules > Sun Fire X4500 storage cluster with 480TB per rack • The Easiest to Deploy and Manage > Provides a 6:1 reduction in physical ports and cables > Eliminates 100s of discrete switching elements Sun Constellation System Open Petascale Architecture Eco-Efficient Building Blocks Compute Networking Storage Software

Developer Tools Grid Engine Provisioning

Linux

Comprehensive Ultra-dense Blade Ultra-dense Ultra-dense Software Stack Platform Switch Solution Storage Solution Integrated Developer Fastest Processors: 3456 port InfiniBand Most economical and Tools SPARC, AMD Opteron, Switch scalable parallel file Intel Xeon Integrated Grid Engine Unrivaled cable system building block Infrastructure Highest Compute simplification Up to 48 TB in 4RU Density Provisioning, Monitoring, Most economical Direct Cabling to IB Patching Fastest Host Channel InfiniBand cost/port Switch Adaptor Simplified Inventory Management

Ultra-Dense Blade Platform Delivering the Most Efficient and Eco-Friendly Node Architecture • The first blade platform designed for extreme density and performance > 6 TFLOPS, 768 cores per chassis / 42U ► 50% more compute power than HP C-Class ► 71% more compute power than IBM BladeCenterH ► Unibody design accommodates more server modules than conventional rack/chassis combo while reducing 500 lbs in weight > 4 InfiniBand Leaf Switch Network Express Modules ► Lowest cost per port with Ultra-Dense Switch Solution • Pay as you grow platform ideal for fast growing businesses > Choose among SPARC, AMD Opteron and Intel Xeon CPU technologies • Runs General Purpose Software > Custom compiles and tuning are not required • Realize economies of scale savings in power and cooling Ultra-Dense Switch Solution Delivering the Most Integrated and Open Switching Architecture • Switch Performance > 3456 ports SDR or DDR > Bisection Bandwidth of 110 Tbps > 5 Stage internal full Clos network > 700ns latency (DDR) • Line and Fabric Cards > 24 Line Cards with 144 4X ports each > Physically realized with 48 12X connectors per line card > 18 Fabric Cards with no connectors • Dual-Wide Rack Chassis > Redundant Power and Cooling • Host based Solaris 10 Subnet Manager Sun Fire X4500 Ultra-Dense Storage Solution The Fastest, Densest and Most Scalable Storage Architecture

• The Fastest Object Storage Server > 2 AMD Opteron CPUs and 16 GB Memory > 2 times the memory capacity of HP DL320S > 2 PCI Express 4x slots, driving 1GB/s IO > Runs Solaris ZFS with Raid Z and Raid Z2 • The Densest Lustre Object Storage Server > Up to 48*TB of Storage in 4RU storage server > 1PB in 2 racks • Realize huge economies of scale savings in power and cooling > Integrated system to reduce power and cooling • Simplify the storage fabric with the server network into a single massive InfiniBand switch > 1 Ultra-Dense Switch for both Ultra-Dense Blade Platform and Ultra-Dense Storage Solution

* With 1TB disk drives Sun Constellation System Storage Solutions Compute Engine Long-Term Data Cache Retention & Archive Tier 2 Fixed Content Archive Tier 2 Near-Line Archive

Super Computer IB Network Load Archive SAN Automated Migration

Tier 1 Archive & Data Movers Home Directories Scalable Storage Cluster TACC: The First Sun Constellation System Implementation Sun Partners with the HPC Community to Deliver Super Computer

• The World's Largest Super Computer Under Construction – 500 TeraFLOPS > 82 Sun Ultra-Dense Blade Platforms > 2 Sun Ultra-Dense Switches > 72 Sun X4500 (“Thumper”) Servers • TACC selected for first NSF ‘Track2’ HPC system > Sun is the sole HW supplier > Opteron based > Expandable configuration > Solaris Fabric Management Project Blackbox: The Grid “To Go”

• Rapid, Easy Deployment: “Build once, deploy anywhere” • Very High-density Computing: Capacity for over 500 CPUs, 2000 cores, or 8000 compute threads! • Versatility and Flexibility: Computer “What you want, where you + Storage want it, when you want it”. + Network + Power • Breakthrough Economics and + Cooling Eco-responsibility: scalability, + Software standard components, = Project Blackbox efficiency, cooling innovation Thank You!

Constantin Gonzalez [email protected] blogs.sun.com/constantin Sun Shared Visualization Software Der sichere Zugang zu 3D Applikationen auf zentralen Ressourcen ● Transparenter und sicherer Zugriff von verschiedenen Clients ● Effizientere Nutzung vorhandener Grafik-Ressourcen bei hoher Performance ● Einfache Administration und Reservierung über Sun Grid Engine ● Open Source (www.virtualgl.org)

Multiple CPUs

Multiple applications

Large shared memory

Large, centralized graphics rendering capability

Many high- performance graphics cards

Sun Fire Server Sun Scalable Visualization Software Skalierbare Grafik-Infrastruktur aus einer Hand ● Mehr Performance, Qualität und Flexibilität ● Vor-integriert, individualisiert und getestet ● Basierend auf Open Source Komponenten (Chromium) ● Kombinierbar mit Sun Shared Vis. Software

Sun Fire Servers Sun Constellation System HPC Open Software Stack Open Petascale Computing from Sun

s

e Sun Studio 12 c i Developer Tools Free

v Developer Tools r

e Sun HPC ClusterTools S

l a n o i Management Sun Grid Engine Software s

s Workload Management Open, Free e f

o Cluster Management Sun Connection, ROCKS, Ganglia r P

, l a

r Open Storage Server u t

c ZFS, S-QFS Open, Free e t i

h Lustre, p-NFS c r A

, t r

o Sun Constellation System Open p 64 Bit p Ultra-Dense Blade Platform u S

, S R C Sun Constellation System IB Switch NEM for Blades n Open u Ultra-Dense Switch Solution S 3456 Port Non-Blocking Switch