Oracle SPARC Systems
Michael Coleman Chief Technologist, Systems, Asia Pacific Division
SKA Colloquium, Auckland New Zealand February 2017
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | Safe Harbor Statement The following is intended to outline our general product direc on. It is intended for informa on purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or func onality, and should not be relied upon in making purchasing decisions. The development, release, and ming of any features or func onality described for Oracle’s products remains at the sole discre on of Oracle.
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | Oracle Confiden al – Internal 2 Agenda
1 SPARC Roadmap and Overview
2 So ware in Silicon
3 Performance
4 Analy cs
5 SuperCluster
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 3
Consistent Execu on • Silicon Secured Memory SPARC @ Oracle Including • Encryp on Accelera on So ware in Silicon } • Data Analy cs Accelerators 7 Processors in 6 Years • More….
2010 2011 2013 2013 2013 2015 2016
SPARC T3 SPARC T4 SPARC T5 SPARC M5 SPARC M6 SPARC M7 SPARC S7
16 x 2nd Gen cores 8 x 3rd Gen Cores 16 x 3rd Gen Cores 6 x 3rd Gen 12 x 3rd Gen Cores 32 x 4th Gen Cores 8 x 4th Gen Cores 6MB L2 Cache 4MB L3 Cache 8MB L3 Cache Cores 48MB L3 Cache 64MB L3 Cache 16MB L3 Cache 1.65 GHz 3.0 GHz 3.6 GHz 48MB L3 Cache 3.6 GHz 4.13 GHz 4.27 GHz 3.6 GHz
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | So ware in Silicon
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | So ware in Silicon: Next Step in Hardware Evolu on Security. Efficiency. Simplicity.
Mid-2000’s Mul -core, Mul -threaded Mid-1990’s Compu ng Larger Memory Support, Greater Accuracy
Today So ware in Silicon: Virtualiza on So ware Func ons on Chip Open Systems
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 7 SPARC M7: So ware In Silicon Features
Security in Silicon: Silicon Secured Memory Cryptography Accelera on
SQL in Silicon: Database In Memory Accelerator Engines
Capacity in Silicon: Decompression Engines
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 8 Silicon Secured Memory: Always-On Intrusion Protec on Breakthrough security and reliability for applica ons • Unique hardware-based memory protec on GO • Stops malicious programs from accessing other applica on memory. Ex: HeartBleed, Venom GO • Can be always on: hardware approach has negligible performance impact • Easily ac vated for exis ng applica ons • Extremely efficient for so ware Memory Memory development Pointers
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 9 Silicon Secured Memory: Always-On Intrusion Protec on Breakthrough security and reliability in hardware
• Silicon Secured Memory implements Pointer “B” fine grained memory protec on in GO hardware – Hidden “color” bits added to pointers (key), and content (lock) • Pointer color (key) must match content Pointer “R” color or program is aborted GO – Set on memory alloca on, changed on memory free • Helps prevent access off end of structure, stale pointer access, Pointer “Y” malicious a acks, etc. plus improves developer produc vity Applica ons Memory
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 10 SPARC M7: Designed for Security The most complete set of encryp on standards
• 15 crypto algorithms Clear Data In • 25 user level crypto instruc ons • 32 Crypto Accelerators per Processor • To accelerate: – Asymmetric (Public Key Encryp on) – Symmetric Key (Bulk Encryp on) – Message Digest (Hash Func ons) Encrypted Data Out
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 11 Oracle Solaris Stops Malware Before It Gets In
Immutable Systems and Virtual Machines – Stop a ackers from establishing a foothold – Prevent administrator mistakes – Update, even though it’s un-writable by users and applica ons Tamper-Evident So ware – From firmware all the way to applica ons – Install only known, trusted so ware – If not signed, won’t install – Verified Boot
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 12 SQL In Silicon: In-Memory Query Accelera on • Dedicated SQL accelerators built on chip – Independently process streams of compressed data placed in system memory SPARC M7 – Like adding 32 addi onal specialized cores to chip Core Core Core Core – Up to 170 Billion rows per second! • Frees processor cores to run other Cache applica ons, such as OLTP SQL SQL SQL SQL Accel Accel Accel Accel • Decompresses data simultaneously to processing SQL func ons – Like adding 64 addi onal specialized cores
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 13 M7 Query Accelerator Engine On-Chip On-Chip Network Network
• 32 In-silicon offload engines M7 Query Data Output Queues Data Output Queues Engine • Cores/Threads operate (1 of 32) Result Result Result Result Format/ Format/ Format/ Format/ synchronous or asynchronous Encode Encode Encode Encode Scan, Filter, Scan, Filter, Scan, Filter, Scan, Filter, to offload engines Join Join Join Join Local Unpack/ Unpack/ Unpack/ Unpack/ Alignment Alignment Alignment Alignment • User level synchroniza on SRAM through shared memory Decompress Decompress Decompress Decompress • High performance at low power Data Input Queues Data Input Queues • 3x more memory bandwidth than x86 On-Chip On-Chip Network Network
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 14 World’s Fastest Microprocessor
Balanced Design
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | SPARC M7: Se ng 20 World Records in Performance
#1 SPECint_rate2006: 1,200 peak
#1 SPECfp_rate2006: 832 peak
#1 SPECjEnterprise2010: 25,093.06 EjOPs
#1 SAP-SD 2 processor: 30,800 users
And more…
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 16
Real-Time Enterprise: Simultaneous OLTP & In-memory SPARC M7: Faster analy cs, faster OLTP, and be er response me Analy cs: SPARC M7 8.2x faster per chip OLTP: SPARC M7 2.9x faster per chip Two x86 E5 v3 One SPARC M7
OLTP Performance Analy cs Analy cs OLTP Performance Throughput Response Time Throughput Response Time
236k 12ms 65 VS 267 338k 11ms Transac ons Queries Queries Transac ons per sec per min per min per sec
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 17 Benchmark Disclosure Statement • Copyright 2015, Oracle &/or its affiliates. All rights reserved. Oracle & Java are registered trademarks of Oracle &/or its affiliates. Other names may be trademarks of their respec ve owners • SPEC and the benchmark name SPECjEnterprise are registered trademarks of the Standard Performance Evalua on Corpora on. Results from www.spec.org as of 10/25/2015. SPARC T7-1, 25,818.85 SPECjEnterprise2010 EjOPS (unsecure); SPARC T7-1, 25,093.06 SPECjEnterprise2010 EjOPS (secure); Oracle Server X5-2, 21,504.30 SPECjEnterprise2010 EjOPS (unsecure); IBM Power S824, 22,543.34 SPECjEnterprise2010 EjOPS (unsecure); IBM x3650 M5, 19,282.14 SPECjEnterprise2010 EjOPS (unsecure). • SPEC and the benchmark name SPECvirt_sc are registered trademarks of the Standard Performance Evalua on Corpora on. Results from www.spec.org as of 10/25/2015. SPARC T7-2, SPECvirt_sc2013 3026@168 VMs; HP DL580 Gen9, SPECvirt_sc2013 3020@168 VMs; Lenovo x3850 X6; SPECvirt_sc2013 2655@147 VMs; Huawei FusionServer RH2288H V3, SPECvirt_sc2013 1616@95 VMs; HP ProLiant DL360 Gen9, SPECvirt_sc2013 1614@95 VMs; IBM Power S824, SPECvirt_sc2013 1371@79 VMs. • SPEC and the benchmark names SPECfp and SPECint are registered trademarks of the Standard Performance Evalua on Corpora on. Results as of October 25, 2015 from www.spec.org and this report. 1 chip resultsSPARC T7-1: 1200 SPECint_rate2006, 1120 SPECint_rate_base2006, 832 SPECfp_rate2006, 801 SPECfp_rate_base2006; SPARC T5-1B: 489 SPECint_rate2006, 440 SPECint_rate_base2006, 369 SPECfp_rate2006, 350 SPECfp_rate_base2006; Fujitsu SPARC M10-4S: 546 SPECint_rate2006, 479 SPECint_rate_base2006, 462 SPECfp_rate2006, 418 SPECfp_rate_base2006. IBM Power 710 Express: 289 SPECint_rate2006, 255 SPECint_rate_base2006, 248 SPECfp_rate2006, 229 SPECfp_rate_base2006; Fujitsu CELSIUS C740: 715 SPECint_rate2006, 693 SPECint_rate_base2006; NEC Express5800/R120f-1M: 474 SPECfp_rate2006, 460 SPECfp_rate_base2006. • SPEC and the benchmark name SPECjbb are registered trademarks of Standard Performance Evalua on Corpora on (SPEC). Results from h p://www.spec.org as of 11/13/2015 and IBM announcement. SPARC T7-1 120,603 SPECjbb2015-Mul JVM max-jOPS, 60,280 SPECjbb2015-Mul JVM cri cal-jOPSIBM Power S812LC 44,883 SPECjbb2015-Mul JVM max-jOPS, 13,032 SPECjbb2015-Mul JVM cri cal-jOPS, www-03.ibm.com/systems/power/hardware/reports/system_perf.html, IBM Power Systems Performance Report, October 2015, page 39; Dell PowerEdge R730 94,903 SPECjbb2015-Mul JVM max-jOPS, 29,033 SPECjbb2015-Mul JVM cri cal-jOPS; Cisco UCS C220 M4 92,463 SPECjbb2015-Mul JVM max-jOPS, 31,654 SPECjbb2015-Mul JVM cri cal-jOPS; Lenovo Flex System x240 M5 80,889 SPECjbb2015-Mul JVM max-jOPS, 43,654 SPECjbb2015-Mul JVM cri cal-jOPS; SPARC T5-2 80,889 SPECjbb2015-Mul JVM max-jOPS, 37,422 SPECjbb2015-Mul JVM cri cal-jOPS; Oracle Server X5-2L 76,773 SPECjbb2015-Mul JVM max-jOPS, 26,458 SPECjbb2015- Mul JVM cri cal-jOPS; Sun Server X4-2 52,482 SPECjbb2015-Mul JVM max-jOPS, 19,614 SPECjbb2015-Mul JVM cri cal-jOPS; HP ProLiant DL120 Gen9 47,334 SPECjbb2015- Mul JVM max-jOPS, 9,876 SPECjbb2015-Mul JVM cri cal-jOPS. • SPEC and the benchmark name SPEC OMP are registered trademarks of the Standard Performance Evalua on Corpora on. Results as of October 25, 2015 from www.spec.org and this report. SPARC T7-4 (4 chips, 128 cores, 1024 threads): 27.9 SPECompG_peak2012, 26.4 SPECompG_base2012; HP ProLiant DL580 Gen9 (4 chips, 72 cores, 144 threads): 21.5 SPECompG_peak2012, 20.4 SPECompG_base2012; Cisco UCS C460 M7 (4 chips, 72 cores, 144 threads): 20.8 SPECompG_base2012. • Two- er SAP Sales and Distribu on (SD) standard applica on benchmarks, SAP Enhancement Package 5 for SAP ERP 6.0 as of 10/23/15: SPARC T7-2 (2 processors, 64 cores, 512 threads) 30,800 SAP SD users, 2 x 4.13 GHz SPARC M7, 1 TB memory, Oracle Database 12c, Oracle Solaris 11, Cert# 2015050. IBM Power System S824 (4 processors, 24 cores, 192 threads) 21,212 SAP SD users, 4 x 3.52 GHz POWER8, 512 GB memory, DB2 10.5, AIX 7, Cert#201401. Dell PowerEdge R730 (2 processors, 36 cores, 72 threads) 16,500 SAP SD users, 2 x 2.3 GHz Intel Xeon Processor E5-2699 v3 256 GB memory, SAP ASE 16, RHEL 7, Cert#2014033. HP ProLiant DL380 Gen9 (2 processors, 36 cores, 72 threads) 16,101 SAP SD users, 2 x 2.3 GHz Intel Xeon Processor E5-2699 v3 256 GB memory, SAP ASE 16, RHEL 6.5, Cert#2014032. SAP, R/3, reg TM of SAP AG in Germany and other countries. More info www.sap.com/benchmark • Addi onal Info: h p://blogs.oracle.com/bestperf
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 18
Analy cs
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Legacy Approach Cannot Sustain Modern Analy cs Complexity and Latency Latency and Delayed Insight
ETL Produc on/OLTP Data Warehouse/OLAP (Column Format)
Copy DB to Run BI Server Cost Management (Analy cs & Reports)
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 20 OLTP and Data Warehousing Consolida on With SPARC you can efficiently run analy cs and OLTP simultaneously
OLTP Analy cs
Analy cs Analy cs OLTP Analy cs SPARC Efficient Database Analy cs and Analy cs Deployment
Legacy Infrastructure
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 21 Solu on: Extremely Efficient Analy cs in Silicon Real Time Analy cs on up to the Moment Data Latency Eliminated
Memory Memory
SALES SALES • BOTH row and column Row Column formats for same table Format Format • Analy cs & repor ng use new in-memory column format • Specialized analy cs accelerators on-chip, with maximum bandwidth
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 22 Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 23 The Power of DAX for Analy cs and Machine Learning Community Driven Innova on
Applica on DAX Advantage Oracle Database Up to 10x faster Apache Spark SQL Up to 21x faster Java Streams 3x to 21x faster Ac ve Pivot 6x to 8x faster Machine Learning 4x to 12x faster
Open access: h p://SWiSdev.oracle.com/DAX
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Accelera ng Apache Spark In-Memory Columnar Open DAX with Apache Spark SQL Over 20x Faster Than Latest 2-chip x86 • Apache Spark runs on a Java Virtual Machine 20x • SPARC DAX offloads cri cal func ons Faste r – SQL: filtering, dic onary encoding, 1 & 2
predicate, join processing, addi onal rows/sec compression, etc.
– 1-chip SPARC M7 DAX: 12.9 Billion rows/sec 2- 12.9 Billion predicate scan rows/sec – 20X faster than 2 of the latest 20-core x86 2-chip x86 SPARC M7 DAX
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | SuperCluster
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | SuperCluster M7: Hardware Architecture § ZS3 Mixed-use Storage • 160 TB (raw) storage for Virtual Machine and system data § QDR InfiniBand Unified Ultra-fast Network • 40Gb/s QDR InfiniBand IO backplane § M7 Servers for Databases & Applica ons • 1 or 2 M7 Chassis per system (Elas c Configura ons) • 2 Physical Domains per M7 chassis, 1 - 4 processors ea.
• Up to 8TB RAM per rack § Exadata Storage Servers for Oracle Database § From 3 to 11 per configura on (Flex. Config.) § High Capacity (96TB raw disk and 12.8TB of raw flash ea.)
SuperCluster M7 § Extreme Flash (25.6TB raw flash ea.)
9/02/17 Copyright © 2017 Oracle and/or its affiliates. All rights reserved. Oracle Confiden al 27 Smart Analy cs: In-Memory Formats in Columnar Flash Cache
• In-Memory formats used in Smart Columnar Flash Cache • Enables vector processing on storage server during smart scans – Mul ple column values evaluated in single instruc on In-Memory Columnar scans • Faster decompression speed than Hybrid Columnar Compression • Enables dic onary lookup and avoids processing unnecessary rows • Smart Scan results sent back to database in In-Memory Columnar format In-Flash – Reduces Database node CPU u liza on Columnar scans • In-memory performance seamlessly extended from DB node DRAM memory to 10x larger capacity flash in storage – Even bigger differen a on against all-flash arrays and other in- memory databases • Supports Database 12.1.0.2 and Database 12.2.0.1
Copyright © 2017 Oracle and/or its affiliates. All rights reserved. 28
Exadata Storage Server In-Memory Fault Tolerance
§ Similar to storage mirroring
§ Duplicate in-memory columns on another node • Enabled per table/partition • Application transparent
§ Downtime eliminated by using duplicate after failure
9/02/17 Copyright © 2017 Oracle and/or its affiliates. All rights reserved. Oracle Confiden al – .