The Evolution of File Systems
Total Page:16
File Type:pdf, Size:1020Kb
The Evolution of File Systems Thomas Rivera, Hitachi Data Systems Craig Harmer, April 2011 SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced without modification The SNIA must be acknowledged as source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the Author nor the Presenter is an attorney and nothing in this presentation is intended to be nor should be construed as legal advice or opinion. If you need legal advice or legal opinion please contact an attorney. The information presented herein represents the Author's personal opinion and current understanding of the issues involved. The Author, the Presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. The Evolution of File Systems 2 © 2012 Storage Networking Industry Association. All Rights Reserved. 2 Abstract The File Systems Evolution Over time additional file systems appeared focusing on specialized requirements such as: data sharing, remote file access, distributed file access, parallel files access, HPC, archiving, security, etc. Due to the dramatic growth of unstructured data, files as the basic units for data containers are morphing into file objects, providing more semantics and feature- rich capabilities for content processing This presentation will: Categorize and explain the basic principles of currently available file system architectures (e.g. Local, Shared, SAN, Clustered, Network, Distributed, Parallel, etc. Explain technologies like Scale-Out NAS, NAS Aggregation, NAS Virtualization, NAS Clustering, Global Namespace, Parallel NFS Review new file system architectures being developed The Evolution of File Systems 3 © 2012 Storage Networking Industry Association. All Rights Reserved. 3 Related Tutorials Check out SNIA Tutorial: Check out SNIA Tutorial: Using File Server Protocols for Understanding Enterprise NAS Block-based Storage Workloads Check out SNIA Tutorial: pNFS and NFS V4.2 The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 4 Why File Systems Have Evolved Scale Megabytes → Petabytes Requirements High availability Data sharing Remote access Performance Archiving others… Local Shared SAN Cluster Network Distributed Object Parallel File File File File File File File File ? System System System System System System System System ..... Time (Not a strict timeline—new capabilities are generally incremental) The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 5 Where File Systems Live User Application and Libraries (ls, mv, rm, cp, ...) System Calls (open(), close(), read(), write(), ioctl(), mmap(), ...) User space Kernel space VFS Process Management File System Data Cache* mmap() *can be Memory Segmap Cache Scheduler IPC bypassed by using Mgmt direct I/O Volume Manager Device Drivers DMA Buffers Machine dependent code Hardware The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 6 What File Systems Do (UNIX example) File locators: Data locators: Data: (“inodes”) (pointers) (blocks) Inode direct 0 Host Data Blocks direct 1 direct 2 data block direct 3 data block direct 4 data block direct 5 data block direct 6 data block direct 7 data block direct 8 data block direct 9 data block single indirect data block 0 1 2 3 4 double data block indirect 5 6 7 8 9 data block triple data block indirect 10 11 12 13 14 data block File Owner 15 16 17 18 19 File Type Permissions Last Access . Size File attributes: # of links The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 7 A File System Taxonomy File Systems Local Shared Network File System File System File System Distributed SAN Cluster Distributed Parallel File System File System File System File System The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 8 Local File System Local file system Application File System File system is co-located in the server with application The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 9 Local File System Separate “islands” of data Limitation: no data sharing Application Application Application Application File System File System File System File System The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 10 One Way to Share Data: Scale-Up Vertical scaling The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 11 Another Way to Share Data: Scale-Out Horizontal Scaling ... Storage Network Shared Device: A multi-LUN device shared among clients Shared Each client has exclusive access to a dedicated LUN Data ≠ Shared Data: A physical device shared among clients Clients access LUNs concurrently The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 12 Data Access with Shared/Global File System Separate logical and physical placement Metadata server File access is a three-step transaction... Metadata Metadata Metadata Server Client ServerMDS Client ServerMDS Client Step 1:Request Step 2: Metadata Step 3: Data access delivery access The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 13 Shared/Global File System Asymmetric (“SAN File System”) Client Network Application Server Application Server Application Server Application Server Application Server Application Application Application Application Application e.g. Web Server e.g. Web Server e.g. Web Server e.g. Web Server e.g. Web Server Metadata Server Metadata Server Data Server Data Server Data Server (active) (passive) Storage Network Shared Data One active metadata server Typically homogeneous (scaling limited by metadata server capacity) Inter-node distance limited by storage network capability The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 14 Shared/Global File System Symmetric (“Cluster File System”) Client Network Application Server Application Server Application Server Application Server Application Server Application Application Application Application Application (e.g. Web Server) e.g. Web Server e.g. Web Server e.g. Web Server e.g. Web Server Metadata Server Metadata Server Metadata Server Metadata Server Metadata Server (active) (active) (active) (active) (active) Data Server Data Server Data Server Data Server Data Server Storage Network Shared Data Metadata server in each node Typically homogeneous (scaling limited by internal communication, e.g., distributed locking) Inter-node distance limited by storage network capability The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 15 Network File Systems (aka Proxy File Systems) Local File System Network File System Application Application Application Application Application File System File System File System File System File System Client Client Client Client Network Protocol* File System Server * e.g. NFS, CIFS, AFP, WebDAV, FTP, HTTP, ... Enables sharing of files located on a file server among one or more client computers using a network protocol The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 16 Network File System “Stack” (Example: Sun’s NFS) Data SCSI Port SAN SCSI HBA SCSI Driver Volume Mgr Application File System NFS NFS Client Server RPC/XDR RPC/XDR TCP/IP TCP/IP Ethernet Ethernet NIC NIC LAN The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 17 Wide Area Network File Systems Consolidation eases Management Data Administration Cost Compliance SCSI Port Global file sharing and collaboration Location consolidation and optimization SAN SCSI HBA SCSI Driver Application Volume Mgr File System NFS Client NFS Server RPC/XDR RPC/XDR TCP/IP TCP/IP Ethernet NIC Ethernet NIC WAN But: WAN performance is low compared to LAN/SAN performance The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 18 Improving Wide Area File System Performance Application-specific optimizations: email, document management, SQL, ... Protocol-specific optimizations: HTTP, NFS, CIFS, WebDAV, FTP, TCP/IP, ... Data Transport acceleration: TCP accelerators Intelligent caching: read-ahead, deferred write, coherency, ... SCSI Port Data compression: algorithms, file-aware differencing, data aggregation, I/O clustering, chunk based de-duplication, cross-protocol data reduction, ... SAN SCSI HBA SCSI Driver Application Application Volume Mgr Application ApplicationNFS/CIFS File System ApplicationNFS/CIFS NFS/CIFSClient Client NFS NFS/CIFSClient NFS Server ClientRPC/XDR ClientRPC/XDR RPC/XDR RPC/XDRTCP/IP Compression Engine Compression Engine RPC/XDR RPC/XDRTCP/IP TCP/IP TCP/IPEthernet NIC TCP/IP TCP/IP TCP/IP TCP/IP TCP/IP TCP/IPEthernet NIC Ethernet Ethernet Ethernet Ethernet Ethernet NIC NIC NIC NIC NIC Ethernet NIC Ethernet NIC Ethernet NIC LAN WAN LAN The Evolution of File Systems © 2012 Storage Networking Industry Association. All Rights Reserved. 19 Distributed