Storage Research at ORNL Sudharshan Vazhkudai R. Scott

Storage Research at ORNL Presentation to HEC-IWG Workshop Sudharshan Vazhkudai Network and Cluster Computing, CSMD R. Scott Studham National Center for Computational Sciences Contributors: Sarp Oral, Hong Ong, Jeff Vetter, John Cobb, Xiaosong Ma and Micah Beck OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 1 Application Needs and User Surveys—Initial Observations • GYRO, POP, TSI, SNS • Most users have limited IO capability because the libraries and runtime systems are inconsistent across platforms. • Limited use of Parallel NetCDF or HDF5 − POP moving to P-NetCDF − SNS uses HDF5 • Seldom use of MPI-IO • Widely varying file size distribution − 1MB, 10MB, 100MB, 1GB, 10GB OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 2 Current Storage Efforts for NLCF • Future procurements require support for center– wide file system • Minimize the need for users to move files around for post processing. • As most applications continue to do the majority of I/O from PE0 we are focused on the single client performance to the central pool. NLCF Center Wide Filesystem OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 3 Using Xen to test scalability of Lustre to O(100,000) processors. SSI Software OpenSSI 1.9.1 Filesystem Lustre 1.4.2 Basic OS Linux 2.6.10 Virtualization Xen 2.0.2 Single System Image with process migration OpenSSI OpenSSI OpenSSI OpenSSI XenLinux XenLinux XenLinux XenLinux Linux 2.6.10 Linux 2.6.10 Linux 2.6.10 Linux 2.6.10 Lustre Lustre Lustre Lustre Xen Virtual Machine Monitor Hardware (SMP, MMU, physical memory, Ethernet, SCSI/IDE) OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 4 Data Management Infrastructure for the Spallation Neutron Source and NLCF TB of storage by year • 50 PB by 2010 • Need to archive, annotate, share, move, replicate • Current efforts revolve around Lustre, SRB and HPSS • Connection between database-assisted data management and file- based raw data I/O OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 5 FreeLoader: Collaborative Caching for Large Scientific Data http://www.csm.ornl.gov/~vazhkuda/Morsels Problem Space: Enabling Trends: • Data Deluge: Increasing dataset sizes (NIH, SDSS, SNS, TSI) • Unused Storage: More than 50% desktop storage unused • Locality of interest: Collaborating scientists routinely analyze and visualize these datasets • Immutable Data: Data is usually write once read many, with remote source copies • Desktop, an integral part: End-user consumes data locally due to ever increasing processing power, convenience & control; But • Connectivity: Well connected, secure LAN settings limited by secondary storage capabilities FreeLoader Aggregate Storage Cache: • Scavenges O(GB) of contributions from desktops • Parallel I/O environment across loosely-connected workstations, aggregating I/O as well as network BW • NOT a file system, but a low-cost, local storage solution enabling client-side caching and locality Initial Results: • Striping across desktops delivers comparable aggregate BW to file systems • In SC’05 OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 6 Logistical Networking Used for Failsafe Wide Area Data Streaming Micah Beck, U of Tennessee & ORNL Scott Klasky, ORNL GPFS Viraj Bhat, Rutgers Univ. file sys metadata depots on simulation end Signal Re-fetch failed transfers from depots close by GPFS/depots Write to GPFS or remote Simulation Depot depots Supercomputer Network Failure nodes Buffer Overflow Data Flow OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 7 Future Directions for ORNL storage • Important to improve serviceability of NLCF high-end storage − Center wide file system • heterogeneous set of clients • Focusing on single client IO performance • Aggregate IO performance must scale with # of disks, clients − Failover to • storage caches or archival storage seamlessly • Availability − Potentially large # of disks in future storage systems − Replication based on access patterns of datasets • Profile NLCF applications’ I/O usage • Track and benchmark I/O subsystems on NLCF platforms OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 8 Acknowledgments • DOE Office of Science • ORNL LDRD—TCSS Initiative • DOE Basic Energy Sciences • NSF-TeraGrid OAK RIDGE NATIONAL LABORATORY U. S. DEPARTMENT OF ENERGY 9.

Storage Research at ORNL Sudharshan Vazhkudai R. Scott

View Article(3467)

Distributed-Operating-Systems.Pdf

A Multiserver User-Space Unikernel for a Distributed Virtualization System

Facilitating Both Adoption of Linux Undergraduate Operating Systems Laboratories and Students’ Immersion in Kernel Code

Operating Systems – the Code We Love to Hate

ELASTICITY THANKS to KERRIGHED SSI and XTREEMFS Elastic Computing and Storage Infrastructure Using Kerrighed SSI and Xtreemfs

SSI-OSCAR: a Distribution for High Performance Computing Using a Single System Image

Downloaded on 2018-08-23T19:11:32Z Single System Image: a Survey

Comparative Study of Single System Image Clusters

Performance, Management, and Monitoring of 68 Node Raspberry Pi 3 Education Cluster: Big Orange Bramble (Bob)

Single System Image in a Linux-Based Replicated Operating System Kernel

SPEED QUEEN the Openssi Framework Rearranges Processes for Easy and Transparent