Jessica Popp Director of Engineering, High Performance Data Division Legal Disclaimers

© 2016 Intel Corporation.

Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

*Other names and brands may be claimed as the property of others

This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps.

FTC Optimization Notice: Optimization Notice Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.

2 Intel® Solutions for * Roadmap

Q4 ’16 Q1 ‘17 Q2 ’17 Q3 ’17 Q4 ’17

Enterprise EE 3.0 Enterprise Edition 3.1 EE 4.0 HPC, commercial technical Lustre 2.7.x Lustre 2.7.x Lustre computing, and analytics Client m’data perf. Expanded OpenZFS support in IML DNE 2 OpenZFS perf. IML graphical UI enhancements Multi-rail LNET Snapshots Bulk IO Performance (16MB IO) IML asset mgmt. SELinux, Kerberos OpenZFS metadata performance ZED updates OPA support Subdirectory mounts RHEL 7 OpenZFS 0.6.5.3

Cloud Cloud Edition 1.3 Cloud Edition 1.4 CE 1.5 HPC, commercial technical Lustre 2.8.x Lustre 2.9.x Lustre computing and analytics delivered on cloud services RHEL 7 Server Support Updated Lustre and base OS builds SW updates Lustre release update New instance support New instances

Foundation FE 2.8 Foundation Edition 2.9 Foundation Edition 2.10 Community Lustre software Lustre 2.8.x Lustre 2.9.x Lustre 2.10.x coupled with Intel support DNE 2 UID/GID Mapping OpenZFS Snapshots Client m’data perf. Shared key crypto Multi-rail LNET LFSCK perf. Large Block IO Project quotas SELinux, Kerberos Subdirectory mounts Key: Shipping Development Planning

Product placement not representative of final launch date within the specified quarter * Some names and brands may be claimed as the property of others 3 Intel® Enterprise Edition for Lustre* software 3.1

Target GA Q4 2016 . Based on community 2.7 release . Latest RHEL 7.x server/client support . IML managed mode for OpenZFS* . GUI enhancements . Bulk IO Performance for ldiskfs (16MB IO) . Metadata performance improvements for Lustre on OpenZFS . Subdirectory Mounts

4 Intel® Cloud Edition for Lustre* software 1.3

Software defined storage cluster available via AWS Provides a performant, scalable filesystem on demand Powered by Lustre 2.8 Provides secure clients, IPSec and EBS encryption Available through AWS Cloud, AWS GovCloud,

*Features may vary by platform

5 Intel® Foundation Edition for Lustre* software 2.9

GA November 2016 Features . RHEL* 7.2 servers/clients; SLES12* SP1 client support . ZFS* Enhancements (Intel, LLNL) . UID/GID mapping (IU, OpenSFS*) . Shared Secret Key Encryption (IU, OpenSFS) . Subdirectory mounts (DDN*) . Server IO advice and hinting (DDN) . Large Block IO

6 Development Community Growth Other ANU Canonical Clogeny SGI Atos CEA DDN

Seagate Fujitsu Unique Developer and Organization Cray GSI ORNL LLNL Count IU 100 90 92 80 76 70 69 70 65 60 55 50 52 40 41 42 30 Intel 20 19 15 14 15 17 10 13 7 10 Lustre 2.8 Commits 0 1 Statistics courtesy of Lawrence Livermore National Laboratories (Chris 1.8.0 2.1.0 2.2.0 2.3.0 2.4.0 2.5.0 2.6.0 2.7.0 2.8.0 Morrone)

Developers Organizations Source: http://git.whamcloud.com/fs/lustre_release.git/shortlog/refs/heads/b2_8

7 Lustre 2.9 – Contributions So Far

GSI 1 SGI 8 ANU Atos4 6 CrayCEA 17 Purdue 43 ANU 545 Atos 449 CEA 210 Seagate 72 SGI 305 ORNL Other 2 89 DDN ClogenyCray 153617 138 Other 879 LLNL 17 52 Seagate DDN 1841 IU 2 5061 Fujitsu 1

ORNL 7873 LLNL 2786

IU 9164 Intel 37152 Intel 1759 Lines of Reviews Code

Statistics courtest of Dustin Leverman (ORNL) Source: http://git.whamcloud.com/fs/lustre-release.git Aggregated data by organization between 2.8.50 and 2.8.60

8 Advanced Lustre Research Intel Parallel Computing Centers Uni Hamburg + German Client Research Centre (DKRZ) • Client-side data compression • Adaptive optimized ZFS data compression GSI Helmholtz Centre for Heavy Ion Research • TSM* HSM copytool University of California Santa Cruz • Automated client-side load balancing Johannes Gutenberg University Mainz • Global adaptive IO scheduler Lawrence Berkeley National Laboratory

• Spark and Hadoop on 9 Lustre Feature Development

Multi-Rail LNet (SGI*, Intel) • HSM Data Mover • Improve performance by aggregating • Copy tool to interface between Lustre bandwidth and 3rd party storage • Improve resiliency by trying all interfaces • Early Access availability of agent with before declaring msg undeliverable POSIX and S3 data movers via GitHub • Additional data movers can be licensed as desired by developers

10 Lustre Feature Development

Data on MDT (Intel) File Level Replication • Optimize performance of small file • Robustness against data IO loss/corruption • Small files (as defined by • Provides higher availability for administrator) are stored on MDT server/network failures • Redundant layouts can be defined on Open, write, attributes per-file or directory basis Client MDS • Replicate/migrate between storage layout, lock, attributes, read classes OSSOSS Replica 0 Object j (primary, preferred) Small file IO directly to MDS OSS delayed Replica 1 Object k (stale) resync Overlapping (mirror) layout

11 ZFS Metadata Performance Improvements

MDS-Survey - File Creates/Sec over 8 Directories 120000

100000

80000 master+ ldiskfs

60000 master+ZFS-0.7.0 +dacct master +ZFS-0.7.0

File File Creates/sec 40000 2.8+ZFS-0.6.5

20000

0 8 16 32 64 128 *Based on test platform in slide 20 Threads

12 Data Security

Available in Intel® EE for Lustre* software 3.0 • Secure Clients – SELinux8 support to enforce access control policies, including Multi- Level Security (MLS) on Lustre clients • Secure Authentication and Encryption – Kerberos* functionality can be applied to Intel® EE for Lustre* software to establish trust between Lustre servers and clients and to support encrypted network communications

In Development (2.9+) • Shared Secret Key Crypto – data encryption for networks including RDMA • Node Maps – UID/GID mapping for WAN clients • Data isolation via filesystem containers – subdirectory mounts with client authentication

13 Upstream Linux Kernel Client Refocused efforts on in-kernel Lustre client • LNET virtually up-to-date with master • Bug fixes to 2.5.51 landed, 2.5.58 pending

Challenges • Features currently at 2.3.54 (now DNE, HSM) • Significant code cleanups still needed

Lustre community developers cited among most active for Linux 4.6 kernel • Oleg Droking (Intel): #5 by lines of code • James Simmons (ORNL): #16 by lines of code

14 Lustre Development for CORAL dRAID

• Optimizations for large streaming IO ZFS Declustered Parity • Distributes data, parity, and spare blocks evenly across all disks • Optimized and scalable RAID rebuild limits exposure to cascading disk failures and minimally impacts application IO Lustre Streaming • End-to-end large block support • Block allocation improvements

All work will be landed in appropriate upstream trees (Lustre and OpenZFS)

15 DAOS: The Future of Storage • Open source, scalable, software-defined userspace storage system providing object storage on Intel hardware New Existing Analytics Legacy App Applications Applications • Applications can realize immediate benefit w/ little to no change via middleware HDF5, NetCDF Hadoop, Spark POSIX-lite Full POSIX via modifications Modern HPC HDFS could Lustre data access DAOS Posix-compliant management Storage easily distributed file framework to support system for • Fine-grained IO and Versioning provide makes porting popular legacy DAOS API apps easy analytics tools application greater speed and control over data Allows applications access to directly access native Open Source • DAOS exploits 3D-Xpoint™ DIMM & NVMe Object Storage APIs and Intel® OmniPath Architecture • Intel technology improves performance by reducing latency & keeping data DAOS - Distributed Async Object Storage Framework Reliable, Fast, Flexible storage built on Intel Technology closer to computing resources

16 The Future is both Evolutionary & Revolutionary

Lustre evolving in response to: DAOS exploring new territory: • A Growing Customer Base • What may lay beyond POSIX • Changing use cases • Use new HW capabilities as storage • Emerging HW capabilities • Object storage model exposes new capability for scalable consistency

DAOS Tier Pool Container IML Object DKey Pool Akey[i]

17

Test Configuration

. 2 x Intel(R) Xeon(R) CPU E5-2660 v2 @ 2.20GHz – 20 cores . 64GB RAM . 3 x 500GB local SATA HDD 7200 RPM . CentOS 7.2.1511 . 3.10.0-327.28.2.el7 kernel (RHEL7) . No remote clients, just local MDS testing (mds-survey script)

20 HPE Scalable Storage with Intel Enterprise Edition for Lustre*

High Performance Storage Solution Meets Demanding I/O requirements Performance measured for an Apollo 4520 building block • Up to 17 GB/s Read/15 GB/s Writes with EDR1 • Up to 16 GB/s Reads and Writes with OPA1 • Up to 21GB/s Reads and 15GB/s Writes with all SSD’s² Designed for PB-Scale Data Sets

Density Optimized Design For Scale • Dense Storage Design Translates to Lower $/GB Built on Apollo 4520 • Limitless Scale Capability – Solution Grows Linearly in

HPC Clients “Appliance Like” Delivery Model Apollo 6000 FDR/EDR* Pre-Configured Flexible Solution InfiniBand • Deployment Services for Installation • Simplified Wizard Driven Management thru Intel Manager for HPE Scalable Lustre Storage with Intel EE for Lustre* Innovative Software Features DL360 DL360 + MSA 2040 (Intel Management Server) Leading Edge Yet Enterprise Ready Solution • ZFS software RAID provides Snapshot, Compression & Error Correction • ZFS reduces hardware costs with uncompromised performance • Rigorously Tested for Stability & Efficiency * Some names and brands may be claimed as the property of 1: Different Conditions & Workloads can affect the Performance 2: 24 x1.6TB MU SSD’s in A4520 no JBOD others.

21