Lustre* with ZFS* SC16 Presentation

Lustre* with ZFS* Keith Mannthey, Lustre Solutions Architect Intel High Performance Data Division Legal Information • All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps • Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit http://www.intel.com/performance. • Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at http://www.intel.com/content/www/us/en/software/intel-solutions-for-lustre-software.html. • Intel technologies may require enabled hardware, specific software, or services activation. Check with your system manufacturer or retailer. • You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. • No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. • The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. • Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non- infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. • This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. • A "Mission Critical Application" is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL'S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS' FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS. • Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined". Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. • Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries. • * Other names and brands may be claimed as the property of others. 3 • © 2016 Intel Corporation Lustre with ZFS • Motivations for Lustre w/ZFS • Lustre w/ZFS – Unique Features • Industry Use Cases • Industry Momentum • Intel’s Commitment to Lustre w/ZFS 4 Motivation Usage Models Technical Needs LUSTRE with OpenZFS Machine Learning Performance Extreme Performance at Scale Genomics Rapid Scalability Integrated Security Video/Animation Security & Compliance SW Management Stack Manageability Simulation Data Integrity and Recovery Reliability / Availability EXAscaleComputing DEEP Integration Open Source and Extensible 5 ZFS – Unique Features • Incredible reliability – Data is always consistent on disk; silent data corruption is detected and corrected; smart rebuild strategy • Compression – Maximize usable capacity for increased ROI • Snapshot – support built into Lustre – Consistent snapshot across all the storage targets without stopping the file system. • Hybrid Storage Pool – Data is tiered automatically across DRAM, SSD/NVMe and HDD accelerating random & small file read performance • Manageability – Powerful storage pool management makes it easy to assemble and maintain Lustre storage targets from individual devices 6 Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. • Solution: Storage System based on Intel® Enterprise Edition for Lustre* Software w/ZFS. • Results: • 20X improvement in data r/w capacity, lower costs, faster speed with supplying data to high-performance computing clusters. • High efficiency from full use of computing resources • Full confidence in meeting the demand for continuous performance and capacity upgrades caused by increasingly complex genetic information research. • Summary: Gene Sequencing, Data Archiving and Storage Clusters based on Intel® Enterprise Edition for Lustre* software have improved data throughput performance and accelerated value mining and insights into genetic information. 7 ZFS Enhancements in the path of Exascale • Changes for using ZFS more efficiently • Improved file create performance • Snapshots of whole file system • Changes to core ZFS code • Inode quota accounting • Multi-mount protection for safety • System and fault monitoring improvements • Large dnodes for improved extended attribute performance • Reduce CPU usage with hardware-assisted checksums, compression • Declustered parity & distributed hot spaces to improve re-silvering • Metadata allocation class to store all metadata on SSD/NVRAM 8 Industry Adoption • Path to Exascale • CORAL and future follow-on architectures are scoped with ZFS. • LLNL Sequoia1 (55PB File System) • Cheaper, less complex, higher performance file system for Sequoia • With Intel, Lustre and ZFS continue to advance • Collaborate with OpenZFS community on new features. • Breakthrough metadata performance: LAD’16 Talk 1 http://computation.llnl.gov/projects/zfs-lustre 9 Intel’s Commitment to Lustre w/ZFS Performance Enhancements Native Encryption Built- ZFS improvements for increased in encryption for data at metadata performance. rest to provide enhanced storage security. Fault Management Persistent Read Cache OpenZFS Intel Enhanced fault monitoring and Update of existing L2ARC management architecture for ZFS. read cache to persist data across reboots. D-RAID – De-clustered RAIDZ provides IPCC-L massively improved rebuild performance after a drive failure. Parity acceleration – Using AVX instructions to accelerate parity calculation 10 Wrap up • Lustre w/ZFS provides • Data Integrity • Compression • Snapshots • Learn More • www.intel.com/Lustre 11 .

Lustre* with ZFS* SC16 Presentation

The Parallel File System Lustre

A Fog Storage Software Architecture for the Internet of Things Bastien Confais, Adrien Lebre, Benoît Parrein

Evaluation of Active Storage Strategies for the Lustre Parallel File System

On the Performance Variation in Modern Storage Stacks

Lustre* Software Release 2.X Operations Manual Lustre* Software Release 2.X: Operations Manual Copyright © 2010, 2011 Oracle And/Or Its Affiliates

Parallel File Systems for HPC Introduction to Lustre

Decentralising Big Data Processing Scott Ross Brisbane

Understanding Lustre Filesystem Internals

Openzfs on Linux Hepix 2014 October 16, 2014 Brian Behlendorf [email protected]

Comparative Analysis of Distributed and Parallel File Systems' Internal Techniques

Introduction to Distributed File Systems. Orangefs Experience 0.05 [Width=0.4]Lvee-Logo-Winter

Lustre / ZFS at Indiana University