Dr Mandie Quartly OpenPOWER Global Alliances, Europe, IBM @mandieq ∞ 0 Performance Cost

What would you do?

© 2016 OpenPOWER Foundation The only constant is change...

Source © 2016 OpenPOWER Foundation Quantum, DNA, Workload Neural or ??? Optimised computing Systems Multi-Core Multi-Thread y c

n Scaling e i Single thread c i f f E

T I

CLOCK ERA CLOUD ERA WOS ERA QUANTUM ERA 1970’s ~2004 ~2015 >>2025 Time

© 2016 OpenPOWER Foundation Demands and consumption of technology are changing too..

© 2016 OpenPOWER Foundation Introducing the OpenPOWER Foundation… 5 Founding members in 2013

© 2016 OpenPOWER Foundation What's OpenPOWER all about?

Vibrant ecosystem Accelerated innovation Driving industry through open through collaboration of performance leadership development partners

OpenPOWER is an open development community using the POWER Architecture © 2016 OpenPOWER Foundation This is what a revolution looks like

© 2016 OpenPOWER Foundation The power of open…

© 2016 OpenPOWER Foundation Speed to innovation | Standards | Choice

© 2016 OpenPOWER Foundation Open allows you to create what you need

© 2016 OpenPOWER Foundation What's the best tool for the job?

© 2016 OpenPOWER Foundation What's so special about POWER anyway?

© 2016 OpenPOWER Foundation POWER8 Processor – the low down...

POWER8

DRAM Memory Chips Buffer Up to 12 Cores, up to 96 Threads L1, L2, L3 + L4 Caches Up to 1 TB per socket Up to 230 GB/s sustained memory bandwidth © 2016 OpenPOWER Foundation POWER Processor Roadmap

Focus on Enterprise Focus on Scale-Out and Enterprise Technology and Performance Driven Cost and Acceleration Driven Future

POWER7 Architecture POWER8 Architecture POWER9 Architecture Partner Chip POWER10 POWER6 Architecture POWER8/9

2007 2008 2010 2012 2014 2016 2017 TBD 2018 - 20 2020+ POWER6 POWER6+ POWER7 POWER7+ POWER8 POWER8 P9 SO P9 SU P9 SO 2 cores 2 cores 8 cores 8 cores 12 cores w/ NVLink 24 cores TBD cores 10nm - 7nm 22nm 12 cores 14nm 14nm 65nm 65nm+ 45nm 32nm 22nm New Micro- Existing Architecture Enhanced T Micro- Enhanced New Micro- Enhanced New Micro- Enhanced New Micro- Micro- New Micro- Architecture Architecture Micro- Architecture Micro- Architecture Architecture Micro- Architecture Architecture Architecture Architecture With NVLink New Foundry Technology New Process New Process New Process Direct attach B Technology Technology Enhanced Technology New Process Technology memory Buffered Process Technology Memory Technology New Process Technology D

© 2016 OpenPOWER Foundation NVLink Technology

NVIDIA Tesla GPU NVIDIA Tesla GPU with NVLink

40+40 GB/s

Graphics Memory Graphics Memory Graphics Memory

4 0+ 16+16 GB/s 40 G 80 GB/s PCIe x16 B /s Peak*

POWER8 System Memory System Memory POWER8 with NVLink

Current GPU Attach Future NVLink GPU Attachment

© 2016 OpenPOWER Foundation FPGA

CAPP PCIe

POWER8 Processor

Typical I/O Model Flow Copy or PinPin MMIO Notify Poll / Int Copy or Unpin Ret. From DD DD Call Acceleration Source Data Accelerator Completion Result Data Completion

Flow withwith a CoherentCoherent ModelModel Shared Mem. Shared Memory Acceleration Notify Accelerator Completion

Coherent Accelerator Processor Interface (CAPI)

© 2016 OpenPOWER Foundation So what's new?

© 2016 OpenPOWER Foundation Vision Execution Adoption

OpenPOWER Open Innovation

Opening the door, a Game changing From proof of concept license to innovate solutions to industry revolution

Incorporated December 2013 OpenPOWER Summit March 2015 OpenPOWER Summit April 2016

© 2016 OpenPOWER Foundation “Bringing it all together”

Servers

Building blocks

Processors

© 2016 OpenPOWER Foundation Zaius 1.25 OU • 2 POWER9 CPUS • 32 DDR4 DIMM SLOTS • 2X G4 PCIE X16 FHFL SLOTS • 1X G4 X16 HHHL SLOT • 1X G4 X16 OCP MEZ • 1X M.2 SATA PORT • 1X SATA PORT • 15X 2.5” SAS/SATA/NVME SLOTS • BMC W/GBE LOM • “DISKLESS” OPTION

© 2016 OpenPOWER Foundation Cross community engagement drives mutual progress

© 2016 OpenPOWER Foundation In memory databases using CAPI attached flash

24:1 3x server consolidation lower cost per user

Before: NoSQL in memory After: NoSQL POWER8 + CAPI (x86) Flash WWW WWW 10Gb Uplink 10Gb Uplink POWER8 Server Load Balancer 4U 24U Flash Array w/ up 500GB Cache to 40TB 500GBNode Cache 500GBNode Cache 500GBNode Cache 500GBNode Cache Node

Backup Nodes Less is More 24:1 physical server consolidation = 6x less rack space

© 2016 OpenPOWER Foundation http://developers.openpowerfoundation.org/ © 2016 OpenPOWER Foundation Two tracks to challenge and win: Key Dates 1. The Open Road Test – Port and optimize for OpenPOWER Register today – Go faster with accelerators (optional) openpower.devpost.com

2. The Spark Rally Sun May 1st: – Train an accelerated DNN and recognize Submission periods opens objects with greater accuracy Tue Aug 2nd: – Show you can scale with Spark Submission period closes

Grand prizes include a trip to Supercomputing 2016 Other prizes include iPads, Apple Watches

© 2016 OpenPOWER Foundation ∞ 0 Performance Cost

Back to the original question...

© 2016 OpenPOWER Foundation “I couldn't care “I want to change less about the the specifics of platform”

A spectrum of “caring”...

© 2016 OpenPOWER Foundation In summary – so much going on...

200+ members 60+ technologies revealed

24 countries 6 continents 100s innovations under way

© 2016 OpenPOWER Foundation Main website → openpowerfoundation.org Overview → .biz/openpower_overview

Join the rebellion! [email protected]

© 2016 OpenPOWER Foundation Reference material Engage in the OpenPOWER community T T S S E E Technology and Software Innovators,Innovators, Integrators,Integrators, R R End Users E E Innovators and Partners T

T Innovators N N I I

Strategic imperatives and Discuss areas ofof collaborationcollaboration Deep understanding of workload demands for and synergy in OpenPOWER. workload demands and performance and cost N N O O consumption preferences ofof optimization. I I T T Sign up for membership andand end users. C C Require open software and A A joinjoin aa workwork group.group. Team with innovators and end systems withwith choicechoice andand Build technical and business users to drive requirements, flexibility.flexibility. relationships. engage in specific projects.projects. relationships. Engage directlydirectly onon system design options. E E M M O O C C Develop Deliver Deploy

T T U U collaborative innovations collaborative innovations collaborative innovations O O with compelling value with compelling value with compellingcompelling value

© 2016 OpenPOWER Foundation Accelerates Infrastructure Standards

11 Work Groups 2014 2015 2016 Personalised Medicine Charter Pers Med Charter Integrated Integrated Solutions Solutions CAPI SDK System SW Charter 64b ABI Platform Ref 25g IO Compatibility Charter 25g IO Spec

OpenPOWER I/O Charter Sys I/O Enablement Guide CAPI AFU Intf Spec CAPI AFU Intf Spec V2 Charter Accelerator V1 OpenCL SDK Memory Charter OPMB Intf. Spec V1

FSI Specification Charter FSI Spec P8+ 1U1S P8+ 2U2S Charter P8 SP010 P8 2U2S Developer Platform Data Reference Reference Reference OpenPOWER ISA Profile V1 OpenPOWER ISA Profile V2 HW Architecture Charter IO Device Architecture V2 IO Device Architecture V3 Coherent Accel Intf Arch Coherent Accel Intf Arch Compliance Specification Comp Charter Compliance Draft Review WG Spec STD

SP010 – Tyan OpenPOWER Customer Reference System AFU – Accelerator Function Unit OPMB – OpenPOWER Memory Bus SDK – Software Developer Kit CAPI – Coherent Accelerator Processor Interface FSI – Field Replaceable Unit (FRU) Service InterfaceABI – Application Binary Interface © 2016 OpenPOWER Foundation Membership Options

Anyone may participate in OpenPOWER. Membership levels are designed for those that are investing to grow and enhance the OpenPOWER community and its proliferation within the industry.

. The OpenPOWER Foundation is a Not-for-profit entity with a Board of Directors and a Technical Steering Committee. • Membership levels provide either a default Board of Director position (Platinum) or an opportunity to be elected to the Board (Gold, Silver, and Assoc/Academic members). The Bylaws include additional governance detail. • Technical Steering Committee is formed from Work group Leads and Platinum members. . Membership options include Platinum, Gold, Silver, and Associate / Academic memberships • Annual fee and dedicated full-time equivalent (FTEs) - verification of FTEs on honor system • Contributors, committers, Work group leads and project leads influence Technical Steering Committee • Associate / Academic level is not available to corporations Membership Annual Fee FTEs Technical Steering Committee Board / Voting position Level $ USD One seat per member not otherwise Includes board position Platinum $100k 10 represented Includes TSC position May be on TSC if Gold members may elect one board Gold $60k 3 Work group lead representative per three gold members $20k May be on TSC if Sliver members may elect one board Silver $5k if <300 employees 0 Work group lead representative for all silver members

Associate & May be on TSC if May be elected to one community Academic $0 0 Work group lead observer, non-voting Board seat Membership agreement, Bylaws, and IP Rights Policy available for review www.openpowerfoundation.org © 2016 OpenPOWER Foundation Introducing OpenPOWER Ready

● Products / solutions may display badge

● Participants request to use it

● Compatibility

● Not Certification or Compliance

● OpenPOWER Ready™ List

http://openpowerfoundation.org/technical/openpower-ready/

© 2016 OpenPOWER Foundation Joint roadmaps looking forward...

Mellanox Interconnect Connect-IB ConnectX-4 ConnectX-5 FDR Infiniband EDR Infiniband Next-Gen Infiniband PCIe Gen3 CAPI over PCIe Gen3 Enhanced CAPI over PCIe Gen4

NVIDIA GPUs Kepler Pascal Volta PCIe Gen3 NVLink Enhanced NVLink

POWER8 POWER8 with NVLink POWER9 IBM CPUs Enhanced CAPI CAPI Interface & NVLink

2015 2016 2017

Systems You are here ©35 2016 OpenPOWER Foundation POWER8 => built for performance

Sandy Ivy Bridge Ivy Bridge Haswell EP POWER 7+ Bridge EP EP EX POWER8 E5-26xx v3 Systems E5-x6xx E5-26xx v2 E7-88xx v2 Clock rates (GHz) 1.8–3.6 1.7-3.7 1.9-3.4 1.6-3.5 3.1–4.4 GHz 3.0-4.1 GHz SMT options 1,2* 1, 2* 1, 2* 1, 2* 1, 2, 4 1, 2, 4, 8 Max Threads / sock 16 24 30 36 32 96 Max L1 Cache 32KB 32KB* 32KB* 64 KB 32KB 64KB Max L2 Cache 256 KB 256 KB 256 KB 256KB 256 KB 512 KB Max L3 Cache 20 MB 30 MB 37.5 MB 45 MB 80 MB 96 MB Max L4 Cache 0 0 0 0 0 128 MB Memory Bandwidth 31.4-51.2 42.6-59.7 68-85** 51-68 100 – 180 115 - 410 GB/s GB/s GB/s GB/s GB/sec GB/sec

1 = The Micro-architecture features above contributed to premium performance for the POWER8 systems compared to Ivy Bridge systems * Intel calls this Hyper-Threading Technology (No HT and with HT) *32KB running in “Non-RAS mode” 16KB results in better RAS **85GB running in “Non-RAS mode” and dual-device error NOT supported © 2016 IBM Corporation Linux support for POWER

Same source and distribution release schedules as x86 Simplified x86 application migration with little endian distributions Enterprise support for all three from IBM or distributors

© 2016 OpenPOWER Foundation Accelerated • Over 2,300 Linux ISVs developing on Power • 50 IBM Innovation Centers software • Compelling PoCs innovation • Support for little endian applications

Big Data & Machine HPC Cloud Mobile Enterprise Learning CHARMM miniDFT GROMACS CTH NAMD BLAST AMBER Bowtie RTM BWA GAMESS FASTA WRF HMMER HYCOM GATK HOMME SOAP3 LES STAC-A2 MiniGhost SHOC AMG2013 Graph500 OpenFOAM Ilog

Major Linux Distros

© 2016 OpenPOWER Foundation Lots of resources & support for Linux ISVs and Developers

IBM PartnerWorldTechnical Support IBM DeveloperWorks •IBM Innovation Centers Technical resources, community, blogs, toolkits, How to •Free access to Power Hardware articles, beta code •Free porting assistance www.ibm.com/developerworks/linux/ •Free -based development environment www.ibm.com/partnerworld/wps/servlet/ContentHandler/pw_com_pwp_p Regional Ecosystem Initiative – Recruiting Key artnerworld-program Solutions Greater China, North America, Europe IBM Migration Factory Middleware and Industry Solutions Premier migration services for large applications www.ibm.com/systems/power/migratetoibm/index.html IBM Innovation Centers IBM Watson Developer’s Cloud .All 50+ centers worldwide now support Linux on Power Access to IBM Watson for developing cognitive .One-stop for ISVs, developers computing applications .HW access, technical support, demos, toolkits, Hands-on www.ibm.com/smarterplanet/us/en/ibmwatson/developers.html labs www.ibm.com/systems/power/software/linux/centers IBM Power Development Cloud Site Ox Provide free access to Power hardware to ISVs for Free On-demand cloud-based development platform Porting using Linux on POWER8 for 2 weeks hwww.ibm.com/partnerworld/wps/servlet/ContentHandler/stg_com_sys_powe www.siteox.com r-development-platform

© 2016 OpenPOWER Foundation Coming soon: Barreleye

In partnership with Avago, IBM, Mellanox, PMC & Samsung

© 2016 OpenPOWER Foundation OpenPOWER systems coming to market

Motherboard Chassis Rack

© 2016 OpenPOWER Foundation Presentation from recent OpenPOWER Summit

Press release

© 2016 OpenPOWER Foundation Cross community collaboration is essential

© 2016 OpenPOWER Foundation