Intelligent Storage for HPC: Sun Storagetek QFS and Sun Storagetek Storage Archive Manager

Intelligent Storage for HPC: Sun StorageTek QFS and Sun StorageTek Storage Archive Manager Harriet Coverston Distinguished Engineer Sun Microsystems, Inc. May, 2008 Shared QFS and SAM HPC – Why It Matters to Sun • What happens in HPC today often becomes mainstream beyond HPC in the years ahead > HPC drives leading-edge technologies • HPC applications all have similar requirements > Consolidated Storage > Performance & Scalability > Parallel processing • Expect this same phenomenon in data management > HPC is tackling HUGE data requirements/issues • HPC is a growth target at SUN (one of four) Page 2 Shared QFS and SAM Sun's Advanced HPC Data Management Products • Sun StorageTek QFS – SAN File System > High performance parallel file system > Transparent user interface > Production ready > http://www.sun.com/storagetek/management_software/data_management/qfs • Sun StorageTek Storage Archive Manager (SAM) > Policy based automatic data migration and protection > Full device streaming > Tiered storage > http://www.sun.com/storagetek/management_software/data_management/sam Page 3 Shared QFS and SAM Open Source • Sun opened the source for SAM and Shared QFS, metadata server and clients > http://opensolaris.org/os/project/samqfs • Plans are to post SAM and Shared QFS 5.0 development code periodically > We are developing in the open > We are expanding our community • Sun opened Libsam, the APIs which allow the user to manage data in a SAM-QFS file system from within an application program > http://developers.sun.com/solaris/articles/libsam.html Page 4 Shared QFS and SAM Shared QFS (SQFS) • Large, existing, and loyal customer base > Stable base, shipping since Aug 2002 • Targets large enterprises, grid, and HPC > Clients run on Solaris (SPARC x64 & x86) & Linux > Metadata server runs on Solaris (SPARC & x64) > HA option with SunCluster • Built in HSM with SAM with optional WORM functionality for business compliance • SQFS currently supports 256 nodes • Next release, SQFS will support thousands of nodes > Targets HPC clusters Page 5 Shared QFS and SAM Shared QFS with SAM Integrated Data Management Client Client Client Client n ... Data is accessed directly from LAN devices Network SAN Meta Data Server handles name services Block Block and space Meta Data Server Storage Storage management Block Block Storage Storage Block Block Meta data path Storage Storage Data path SL500 (Tape Library) Page 6 Shared QFS and SAM Shared QFS Customer Benefits • Data consolidation with SAN file sharing > HBO – 5000 hours of programming to manage > “Provided the scalability to store and manage large files created by program-length video with the performance necessary to meet HBO's demanding throughput goals” http://www.sun.com/customers/storage/hbo.xml • Performance and scalability > Tune file system to the application > Near raw I/O performance > File system I/O performance scales linearly with the hardware • Parallel processing w/ multi-node read/write access • SAM provides automatic & continuous data protection Page 7 Shared QFS and SAM Shared QFS Certified w/SunCluster • SunCluster HA failover support > Standalone QFS > HA-NFS over QFS > Shared QFS Metadata Server failover > Supports clients outside the cluster • Oracle RAC runs on Shared QFS with SunCluster for high availability > Oracle certified on 9i and 10g > Shared QFS license is free for this configuration • Shared QFS transactional performance matches raw Page 8 Shared QFS and SAM SAM Customer Benefits • Policy based automatic data migration > Media can be disk, tape, or optical > Local or remote copies > Classification is path, owner, group, size, wildcard & access • Media format is tar > Small files are put into a tar container so data is streamed at device speeds out to the tape • Keeps all data available, but not on high cost storage > Moves data across the tiers according to access patterns • On-demand, transparent file retrieval • Continuous backup – no waiting until midnight Page 9 Shared QFS and SAM Policy Driven Sam Processes ● Transparently Archive from disk cache to removable media without operator intervention based on policies – Time based archiving ● Manage disk space and Release archived files from disk cache based on policies ● Automatically Stage released files back to disk cache when accessed – Option to pre-stage and option to bypass disk cache ● Recycle removable media by repacking media Page 10 Shared QFS and SAM SAM Migration Facility • Move from foreign HSM to SAM > Import metadata into a SAM-QFS file system > Copy foreign HSM data to SAM in the background > Production up and active during the migration process • Migrated German Weather from Amass/DataMgr to SAM – Sun partnered with reseller HMK > Moved 10 million files into a SAM-QFS in 8 hours > Successfully migrated 700+ TB of Amass/DataMgr to SAM in 161 days > Production operational during migration • Migrations for DMF, UniTree, Amass, and Veritas Page 11 Shared QFS and SAM Support for Monitoring SAM and QFS • The monitoring console(shown here) lets admins quickly understand their SAM environment – Potential trouble spots are indicated by severity icons in the left hand panel. • e-mail notifications can be configured to alert admins of problems with file systems, archiving and archive media • System metrics provide archive media reports and file data distribution charts • Faults provide a record of adverse conditions that have occurred in the system (including tape alerts) Page 12 Shared QFS and SAM SAM's Archives are OPEN! • Media format is open, not proprietary – tar format > Files can be recovered with or without SAM – our media format is open, NOT proprietary • Metadata about the data is on the archives > If file system metadata is lost, the archives can be recovered with a procedure we call the “Ultimate Disaster Recovery” Page 13 Shared QFS and SAM Store Data Forever! Future-proof Data Storage for Data Preservation > Archive files are self-describing, standard > No lock-in, open TAR format > Move data to newer media overtime, transparently Page 14 QFS Scalability for HPC 15 Shared QFS and SAM Coping with HPC Storage Complexity • Increasing bandwidth requirements > 2GB/sec per TF = 2TB/sec peak for a petaflop machine > 1TB/sec sustained I/O bandwidth for a petaflop machine • Increasing demand for metadata ops > Finding any one file among trillions of files > Finding anything in the petabytes of data (data mining) • Extreme concurrency > HPC compute scaling means more processors which means more concurrent threads which means more concurrent I/O requests • Seek Efficiency > Disk drive latency is the about the same as 1990 drives and this is the bottleneck Page 16 Shared QFS and SAM Intelligent Storage for HPC • Intelligent, secure storage (T10/1729-D OSD-2) helps solve storage complexities > Move higher-level storage functions out to the devices > Execute these functions in parallel in order to scale > Support secure client access because credentials are checked on every access • Data aware intelligent storage can support > Object caching and pre-fetching > Object seach > Object repair • AND most important, it is a standard (ANSI T10)! Page 17 Shared QFS and SAM Object Storage • Distributes space allocation > Allows bandwidth to scale up with capacity increases • Knowledge of the data pushed to object storage device (OSD) > Better resource utilization in multi-host configurations > Key to multi-host quality-of-service policies > Guaranteed rate I/O, guaranteed latency, etc. • Security at the OSD • Standards based (ANSI T10/1355) Page 18 Shared QFS and SAM Scale SQFS with Intelligent Storage • T10/OSD provides standards-based intelligent object-based storage > Version 1 ratified 2004; version 2 expected soon • Intelligent storage increases horizontal scale > Space allocation moves to the storage nodes > Space allocation is done in parallel by the storage nodes > Bandwidth scales up as capacity increases • Current roadmap plans include releasing Shared QFS with object-based storage end of this year in an OpenSolaris HPC distro > Plans are to support 1000s of nodes Page 19 Shared QFS and SAM History of T10 OSD at Sun • May, 2004. Speaker at DTC, “Intelligent Storage in Commercial HPC” • May, 2006. Speaker at DTC, “Object Storage at Sun” > DARPA Phase 2 filesystem Projects included T10 OSD petascale distributed file system, Storage Pools, and Archive Metadata Database for semantic access. • May, 2007. Speaker at DTC, “OpenSolaris T10 OSD Reference Implementation” • May, 2008. Speaker at DTC, “Intelligent Storage for HPC” Page 20 Shared QFS and SAM RDMA for InfiniBand • Storage access bandwidth, overhead, and latency are all limited by iSCSI over TCP/IP • InfiniBand RDMA increases the bandwidth and reduces overhead • Current roadmap plans include releasing InfiniBand RDMA support for Shared QFS end of this year, 2008 > iSER initiator and target supported in Solaris Page 21 Shared QFS and SAM Scale Shared QFS Page 22 Shared QFS and SAM Scale Shared QFS Page 23 Shared QFS and SAM Intelligent Storage Components INITIATOR TARGET • Shared QFS enhanced • ISCSI target enhanced to support T10 OSD, for iSER RDMA new file system type • Common Multiprotocol “mb” SCSI Target • Solaris initiator device (COMSTAR) driver, sosd • T10 LUN provider • Sun Command SCSI • Object QFS file system, Architecture allocators only, no • iSCSI enhanced for namespace, new file iSER RMDA system type “mat” Page 24 Shared QFS and SAM OpenSolaris.org Projects • All of the object-based storage projects are being developed in the open: • SAM-QFS

Intelligent Storage for HPC: Sun Storagetek QFS and Sun Storagetek Storage Archive Manager

The Parallel File System Lustre

A Fog Storage Software Architecture for the Internet of Things Bastien Confais, Adrien Lebre, Benoît Parrein

Evaluation of Active Storage Strategies for the Lustre Parallel File System

On the Performance Variation in Modern Storage Stacks

Lustre* Software Release 2.X Operations Manual Lustre* Software Release 2.X: Operations Manual Copyright © 2010, 2011 Oracle And/Or Its Affiliates

Parallel File Systems for HPC Introduction to Lustre

Hierarchical Storage Management with SAM-FS

Lustre* with ZFS* SC16 Presentation

Decentralising Big Data Processing Scott Ross Brisbane

Petascale Data Management: Guided by Measurement

Understanding Lustre Filesystem Internals

Kratka Povijest Unixa Od Unicsa Do Freebsda I Linuxa