12. File-System Implementation
Total Page:16
File Type:pdf, Size:1020Kb
12.12. FiFilele--SystemSystem ImpImpllemeemenntattatiionon SungyoungLee College of Engineering KyungHeeUniversity ContentsContents n File System Structure n File System Implementation n Directory Implementation n Allocation Methods n Free-Space Management n Efficiency and Performance n Recovery n Log-Structured File Systems n NFS Operating System 1 OverviewOverview n User’s view on file systems: ü How files are named? ü What operations are allowed on them? ü What the directory tree looks like? n Implementor’sview on file systems: ü How files and directories are stored? ü How disk space is managed? ü How to make everything work efficiently and reliably? Operating System 2 FileFile--SySysstemtem StrStruucturecture n File structure ü Logical storage unit ü Collection of related information n File system resides on secondary storage (disks) n File system organized into layers n File control block ü storage structure consisting of information about a file Operating System 3 LayeLayerreded FFililee SystemSystem Operating System 4 AA TypicalTypical FFileile ControlControl BlockBlock Operating System 5 FileFile--SySysstemtem ImImpplementationlementation n On-disk structure ü Boot control block § Boot block(UFS) or Boot sector(NTFS) ü Partition control block § Super block(UFS) or Master file table(NTFS) ü Directory structure ü File control block (FCB) § I-node(UFS) or In master file table(NTFS) n In-memory structure ü In-memory partition table ü In-memory directory structure ü System-wide open file table ü Per-process open file table Operating System 6 OnOn--DiskDisk StructStructuurere bootboot bblloockck : fsmetadata susupperer bblloockck (type, # blocks, etc.) bitmaps : data structures for boot code bitmaps free space mgmt. partition table Master Boot Record i-ni-noodesdes : file metadata Partition 1 FS- rootroot ddirir (active) dependent Partition 2 filesfiles && directdirectooriesries Partition 3 Operating System 7 InIn--MemoryMemory StructureStructure file table in-memory (system-wide partition table process A open-file table) countcount offsetoffset fifillee attrattriibubutteses directory cache per-process file descriptor table (per-process open-file table) buffer cache process B Operating System 8 InIn--MemoryMemory FileFile SystSysteemm StructuresStructures n The following figure illustrates the necessary file system structures provided by the operating systems n Figure 12-3(a) refers to opening a file n Figure 12-3(b) refers to reading a file Operating System 9 InIn--MemoryMemory FileFile SystSysteemm StructuresStructures Operating System 10 VirtualVirtual FFileile SystemsSystems n Virtual File Systems (VFS) provide an object-oriented way of implementing file systems n VFS allows the same system call interface (the API) to be used for different types of file systems n The API is to the VFS interface, rather than any specific type of file system Operating System 11 ScheSchemamatictic VViewiew ofof VVirtuirtuaall FilFilee SystemSystem Operating System 12 FileFile SystemSystem IInnternalsternals System call interface VirtualVirtual FileFile SystemSystem (VFS)(VFS) mmiinixnix nfsnfs ext2ext2 dosfsdosfs … mmfsmmfs procfsprocfs bufferbuffer cachecache File System devicedevice driverdriver Operating System 13 VFSVFS n Virtual File System ü Manages kernel-level file abstractions in one format for all file systems ü Receives system call requests from user-level (e.g., open, write, stat, etc.) ü Interacts with a specific file system based on mount point traversal ü Receives requests from other parts of the kernel, mostly from memory management ü Translates file descriptors to VFS data structures (such as vnode) n Linux: VFS common file model ü Thesuperblockobject § stores information concerning a mounted file system ü Theinodeobject § stores general information about a specific file ü The file object § stores information about the interaction between an open file and a process ü Thedentryobject § stores information about the linking of a directory entry with the corresponding file Operating System 14 DirectoDirectorryy ImImpplementationlementation n Linear list of file names with pointer to the data blocks ü simple to program ü time-consuming to execute n Hash Table ü linear list with hash data structure ü decreases directory search time ü collisions § situations where two file names hash to the same location ü fixed size Operating System 15 DirectoDirectorryy ImImpplementationlementation n The location of metadata ü In the directory entry ““foofoo”” owner,owner, size,size, ACL,ACL, accessaccess time,time, loclocaation,tion, …… ““barbar”” owner,owner, size,size, ACL,ACL, accessaccess time,time, loclocaation,tion, …… ü In the separate ““foofoo”” owownnerer ssiizeze data structure ““barbar”” owownnerer ACLACL ssiizeze accessaccess ttimeime (e.g., i-node) …… ACLACL llocatocatiion,on, …… accessaccess ttimeime llocatocatiion,on, …… ü A hybrid approach ““ffoooo”” llooccaattiionon owner,owner, ssiize,ze, …… ““barbar”” llooccaattiionon owner,owner, ssiize,ze, …… Operating System 16 AllocationAllocation MetMethhodsods n An allocation method refers to how disk blocks are allocated forfiles ü Contiguous allocation ü Linked allocation ü Indexed allocation Operating System 17 ContigContiguuousous AllAlloocationcation n Each file occupies a set of contiguous blocks on the disk n Simple ü only starting location (block #) and length (number of blocks) are required n Random access n Wasteful of space (dynamic storage-allocation problem) n Files cannot grow Operating System 18 ContigContiguuousous AllAlloocationcation ooff DiskDisk SSpacepace Operating System 19 ExtentExtent--BBaasedsed SysteSystemsms n Many newer file systems (I.e. Veritas File System) use a modified contiguous allocation scheme n Extent-based file systems allocate disk blocks in extents n An extent is a contiguous block of disks ü Extents are allocated for file allocation ü A file consists of one or more extents Operating System 20 ContigContiguuousous AllAlloocationcation n Advantages ü The number of disk seeks is minimal ü Directory entries can be simple: <file name, starting disk block, length, etc.> n Disadvantages ü Requires a dynamic storage allocation: First / best fit ü External fragmentation: may require a compaction ü The file size is hard to predict and varying over time n Feasible and widely used for CD-ROMS ü All the file sizes are known in advance ü Files will never change during subsequent use Operating System 21 LinkedLinked AllocationAllocation n Each file is a linked list of disk blocks ü Blocks may be scattered anywhere on the disk block = pointer Operating System 22 LinkedLinked AllocationAllocation (Co(Contnt’’d)d) n Simple ü need only starting address n Free-space management system ü no waste of space n No random access n File-allocation table (FAT) ü disk-space allocation used by MS-DOS and OS/2 Operating System 23 LinkedLinked AllocationAllocation Operating System 24 FileFile--AllAlloocationcation TTableable Operating System 25 LinkedLinked AllocationAllocation n Advantages ü Directory entries are simple: <file name, starting block, ending block, etc.> ü No external fragmentation § the disk blocks may be scattered anywhere on the disk ü A file can continue to grow as long as free blocks are available n Disadvantages ü It can be used only for sequentially accessed files ü Space overhead for maintaining pointers to the next disk block ü The amount of data storage in a block is no longer a power of two because the pointer takes up a few bytes ü Fragile: a pointer can be lost or damaged Operating System 26 LinkedLinked AllocationAllocation usiusinngg ClustersClusters n Collect blocks into multiples (clusters) and allocate the clusters to files ü e.g., 4 blocks / 1 cluster n Advantages ü The logical-to-physical block mapping remains simple ü Improves disk throughput (fewer disk seeks) ü Reduced space overhead for pointers n Disadvantages ü Internal fragmentation Operating System 27 IndexedIndexed AllocationAllocation n Brings all pointers together into the index block n Logical view index table n Need index table n Random access n Dynamic access without external fragmentation, but have overheadof index block Operating System 28 ExampleExample ofof IIndndexedexed AllocationAllocation Operating System 29 IndexedIndexed AllocationAllocation –– MappinMappingg ((CContont’’d)d) M outer-index index table file Operating System 30 CombinedCombined SchScheeme:me: UNIXUNIX (4K(4K bytesbytes perper block)block) Operating System 31 IndexedIndexed AllocationAllocation n Advantages ü Supports direct access, without suffering from external fragmentation ü I-node need only be in memory when the corresponding file is open n Disadvantages ü Space overhead for indexes: (1) Linked scheme: link several index blocks (2) Multilevel index blocks (3) Combined scheme: UNIX -12 direct blocks, single indirect block, double indirect block, triple indirect block Operating System 32 FreeFree--SpaceSpace ManagementManagement n Bit vector (n blocks) 012n-1 … 1 Þ block[i] free bit[i] = 0 Þ block[i] occupied Block number calculation (number of bits per word) * (number of 0-value words) + offset of first 1 bit Operating System 33 FreeFree--SpaceSpace ManagementManagement (Cont(Cont’’d)d) n Bit map requires extra space. Example: block size = 212 bytes disk size = 230 bytes (1 gigabyte) n = 230/212 = 218 bits (or 32K bytes) n Easy to get contiguous files n Linked list (free list) ü Cannot get contiguous space easily ü No waste of space n Grouping n Counting Operating System 34 LinkedLinked FreeFree