Revolutionizing Data-Centric Transformation
April 2016
Sumit Gupta Vice President, High Performance Computing and Analytics IBM Power Systems
© 2016 OpenPOWER Foundation OpenPOWER: Catalyst for Open Innovation Market Shifts Processors Not Keeping up Numerous IT Fast Growing Open with Workload Performance consumption models Software Ecosystem Needs
OpenPOWER Strategy Vibrant ecosystem through open Accelerated innovation through Accelerators for driving industry collaborative, development collaboration of partners performance leadership
Cloud Computing High Performance Domestic Hyperscale & Large scale Datacenters Computing & Analytics IT Agendas
2 OpenPOWER Development Community
3 200+ OpenPOWER Members
4 Fast Progress Leading to 200+ Members
April October November March June November Public launch in San Offerings from DoE chose First OpenPOWER £313m UK 46 member Francisco and Beijing Canonical, IBM, OpenPOWER design Summit collaboration solutions at SC15, with 6 demos and NVIDIA, OVH, for CORAL •50+ presentations with STFC; 630+ numerous first hardware from Redis Labs, and supercomputers; and solution demos attendees accelerated Google and Tyan Tyan Nallatech CAPI •15 hardware reveals at China OP computing developer kit released •100+ projects Summit announcements 26 members 70 members 77 members 112 members 136 members 170+ members
2013 2014 2015 December July October December March May September Incorporated IBM, with China gov Rackspace SoftLayer Chartered nine GENCI OpenPOWER Foundation Google, endorsed OP declares Open announces 1st work groups; HPC investment; and elected released with formation Compute, OpenPOWER Member solutions chartered 11 officers 420,00 lines of of China Power OpenPOWER bare metal as unveiled workgroups; 1600 ISVs firmware code Tech Alliance strategy; 7 work a service at 6k attendee IT developing Linux on groups chartered offering event POWER 5 members 44 members 70 members 84 members 112 members 130 members 153 members
5 New Chips & Components
DMI connection between an Altera Stratix V FPGA Convey’s CAPI developer kit based on the company’s Xilinix- First China “local” POWER derivative chip, CP1 accelerator and a POWER8 CPU based co-processors
Components & Systems
First commercially available OpenPOWER New CAPI-based solution: the ConnectX-4 Nallatech’s OpenPOWER CAPI third-party server adapter card by Mellanox Developer Kit
First Open server specification and motherboard combining First GPU-accelerated Prototype of a new high-performance New Systems & OpenPOWER, Open Compute and OpenStack (mock-up) OpenPOWER developer platform server on the path to exascale Platforms
ChuangHe China-branded RedPower, the first China OpenPOWER OpenPOWER system with POWER8 First commercially available OpenPOWER server 2-socket system coming in 2015 Inspur 2-socket POWER8 Server
Data Engine for NoSQL with 40TB CAPI-attached flash Bringing It All Together Clustering Open Source 192 Vcores 40TB in 2U Redis + CAPI 24:1 Server consolidation for 3x lower cost per user
6 11 Workgroups Evolving Infrastructure Standards
11 Work Groups 2014 2015 2016 Personalized Medicine Charter Pers Med Integrated Solutions Integrated Solutions Charter CAPI Linux SDK System SW Charter 64b ABI Platform Ref
25g IO Compatibility Charter 25g IO Spec
OpenPOWER I/O Charter Sys I/O Enablement Guide CAPI AFU Intf Spec V1 CAPI AFU Intf Spec V2 Accelerator Charter OpenCL SDK
Memory Charter OPMB Intf. Spec V1
FSI Specification Charter FSI Spec
P8 SP010 P8 2U2S P8+ 1U1S P8+ 2U2S Charter Developer Platform Data Reference Reference Reference OpenPOWER ISA Profile V1 OpenPOWER ISA Profile V2 Charter IO Device Architecture V2 IO Device Architecture V3 HW Architecture Coherent Accel Intf Arch Coherent Accel Intf Arch
Compliance Specification Comp Compliance Charter Draft Review WG Spec STD
SP010 – Tyan OpenPOWER Customer Reference System AFU – Accelerator Function Unit OPMB – OpenPOWER Memory Bus SDK – Software Developer Kit CAPI – Coherent Accelerator Processor Interface FSI – Field Replaceable Unit (FRU) Service Interface ABI – Application Binary Interface 7 OpenPOWER Foundation reach
200+ members 50+ Innovations
24 countries 6 continents
8 OpenPOWER Open Software and University Cloud Environments
Oregon State North America http://osuosl.org/services/powerdev
UNICAMP Brazil, SA http://openpower.ic.unicamp.br/mini Brno University / cloud/index.html • OpenPOWER Platforms RedHat. Czech Republic • Open Stack Software https://fit-rhlab.rhcloud.com • University research • Open Development & Ecosystem Support SuperVessel IIT Bombay, India USA and China www.ptopenlab.com
HPC Center University of Texas- TACC
9 OpenPOWER Developer Ecosystem Growing Fast
Engage in the OpenPOWER community
Technology and Software Innovators, Integrators,
Innovators and Partners End Users INTEREST
Strategic imperatives and Discuss areas of collaboration Deep understanding of workload workload demands for
and synergy in OpenPOWER. demands and consumption performance and cost preferences of end users. optimization. Sign up for membership and
ACTION join a work group. Team with innovators and end Require open software and users to drive requirements, systems with choice and Build technical and business engage in specific projects. flexibility. relationships. Engage directly on system
design options.
Develop Deliver Deploy collaborative innovations with collaborative innovations with collaborative innovations with OUTCOME compelling value compelling value compelling value
11 OpenPOWER For Data Analytics and High Performance Computing
12 POWER8: Processor Performance Leadership
Faster Cores Larger Caches 3x Higher Memory 8 Threads Per Core Direct Accelerator Bandwidth, 1 TB Interconnect Memory per Socket
POWER8 12 Cores, 96 Threads DRAM Memory 4 Level Large Caches Chips Buffer Up to 1 TB per socket Up to 230 GB/s sustained 13 2.5x Faster CPU-GPU Connection via NVLink
Graphics Memory
GPU
PCIe
GPU CPU 16 GB/s POWER8
System NVLink
bottleneck GB/s 40 Graphics System Memory Memory System GPU Memory
Graphics Memory
GPUs Limited by PCIe Bandwidth NVLink Enables Fast Unified Memory Access From CPU-System Memory between CPU & GPU Memories
14 US & UK Research Establishments Select OpenPOWER-Based Supercomputers
IBM, Mellanox, and CORAL: Leadership Class Supercomputers NVIDIA awarded $325M 5X – 10X HIGHER APP PERF THAN CURRENT SYSTEMS U.S. Department of Energy’s CORAL Supercomputers
IBM & UK’s STFC Partner for Big Data & Cognitive Computing Research in £313M Partnership
15 3x Lower Cost for NoSQL Databases using CAPI-Attached Flash
Before: NoSQL In-Memory After: NoSQL POWER8 + CAPI (x86) Flash
Load Balancer
POWER8 Server 500GB Cache 500GBNode Cache 500GBNode Cache 500GBNode Cache CAPI Device 4U 500GBNode Cache 24U Server Node Flash Array w/ up to 40TB
Backup Nodes Flash Acts As Extension of System Memory Application Benchmarks
17 Compelling Solution Performance on Power
• POWER8 + CAPI FLASH as RAM Data Engine for NoSQL • 24:1 server consolidation with Redis Labs • Up to 3x lower TCA • Next generation integrated LAMP stack Turbo LAMP on Linux • Accelerated Dev/Ops for Mobile Apps on Power • 53% less $$ per user / hour with Magento running on Turbo LAMP • Up to 2.2x better per core performance MariaDB on Linux on Power • Up to 1.9x better per system performance
• Designed for mission critical 7X24 Enterprise customer SAP HANA on Linux on operations Power • Outstanding throughput per core • Virtualization out of the box
18 GPUdb in-memory Accelerated Database Ultrafast ingest and analysis of billions of objects
• In-memory distributed database using GPUs for processing
• Order of magnitude gains: • Performance over CPU solutions • Power reduction • Cost savings
• Anticipate order of magnitude gains with POWER based NVLink solutions in 2016
19
Over 2350 Linux ISVs on Power 100,000+ Open-Source Packages
Big Data & Machine HPC Cloud Mobile Enterprise Learning CHARMM miniDFT GROMACS CTH NAMD BLAST AMBER Bowtie RTM BWA GAMESS FASTA WRF HMMER HYCOM GATK HOMME SOAP3 LES STAC-A2 MiniGhost SHOC AMG2013 Graph500 OpenFOAM Ilog Major Linux Distros
20 Superior Cloud Economics
SoftLayer Bare Metal POWER cloud
Rackspace declares Open Compute OpenPOWER system
Docker, OpenStack HEAT, and Chef Server for Power
Bluemix Cloud Integration and SQLDB Services with Power Systems for SOE/SOR integration
Power Cloud environments in Europe
IBM Watson Developer’s Cloud for cognitive computing www.ibm.com/smarterplanet/us/en/ibmwatson/developers.html
21 New IBM “LC” Power Systems for Linux 2x POWER8 CPU CPU CPU 10 cores each, 2.9-3.3 GHz Built for HPC and Deep 822LC Memory Up to 1 TeraByte Memory HPC Learning GPU GPU 230 GB/s Memory Bandwith 2x NVIDIA Tesla K80 GPUs 2x POWER8 CPU CPU CPU 10 cores each, 2.9-3.3 GHz Optimized for Databases Up to 1 TeraByte Memory 822LC Memory 230 GB/s Memory Bandwith and Cloud Workloads 2 Drives (2 TB, HDD, SSD) 1x POWER8 CPU CPU 10 cores, 2.9-3.3 GHz Optimized for Hadoop, 812LC Memory Up to 1 TeraByte Memory Spark, In-Memory 115 GB/s Memory Bandwith Analytics 14 Drives (84TB, HDD, SSD) Leading Technology roadmap
Mellanox Interconnect Connect-IB ConnectX-4 ConnectX-5 FDR Infiniband EDR Infiniband Next-Gen Infiniband PCIe Gen3 CAPI over PCIe Gen3 Enhanced CAPI over PCIe Gen4
NVIDIA GPUs Kepler Pascal Volta PCIe Gen3 NVLink Enhanced NVLink
POWER8 POWER8 with NVLink POWER9 IBM CPUs OpenPower Enhanced CAPI Interface NVLink CAPI & NVLink
2015 2016 2017
IBM Systems
23 © 2016 OpenPOWER Foundation Portfolio of HPC & HPDA Offerings
• High Performance Processors & Systems Processors & • Accelerator, networking, storage integration via CAPI & NVLink Systems • Innovative solutions like CAPI Flash
• Platform LSF & Symphony workflow and resource management Cluster & • Development Compilers: gcc, IBM XLC, PGI Fortran/C/C++, Java, OpenACC, OpenMP Software • Debuggers, Profilers, Math libraries, MPI & HPC apps
High Performance • Highest Performance HPC Storage: Elastic Storage Server File System & • High Performance Spectrum Scale (GPFS) Parallel File System Storage • Flash Storage
24 Two tracks to challenge and win: Key Dates
1. The Open Road Test Register at openpower.devpost.com – Port and optimize for OpenPOWER – Go faster with accelerators (optional) Sun May 1st: Submission periods opens
2. The Spark Rally Tue Aug 2nd: Submission period closes – Train accelerated DNN and recognize objects with greater accuracy Grand prizes include a trip to – Show you can scale with Spark Supercomputing 2016 Other prizes include iPads, Apple Watches
Join the conversation at #OpenPOWERSummit Thank you.
26