Slides for Lecture 8

CHAPTER 11 AND 12 - MASS- STORAGE STRUCTURE & I/O- SYSTEMS 1 OBJECTIVES Describe physical structure of secondary storage devices and its effects on the uses of the devices Explain the performance char. of mass-storage devices Evaluate I/O scheduling algorithms Discuss OS services provided for mass storage, incl. RAID Explore the structure of an OS’s I/O subsystem Discuss principles of I/O hardware and its complexity Provide details of the performance aspects of I/O hardware and software 2 OVERVIEW OF MASS-STORAGE STRUCTURE Hard disk drives (HDDs) Nonvolatile memory (NVM) devices 3 . 1 HARD DISK DRIVES 3 . 2 HARD DISK DRIVES Magnetic disks (still) provide bulk of secondary storage of modern computers Drives rotate at 60 to 250 times per second (RPM) Transfer rate is rate at which data ow between drive and computer Positioning time (random-access time) is both seek time: time to move disk arm to desiredcylinder rotational latency time for desired sector to rotate under the disk head 3 . 3 HARD DISK DRIVES Head ies just above disc (microns) Head crash results from disk head making contact with the disk surface That’s bad Some storage media can be removable (disk, CD, DVD) Drive attached to computer via I/O bus Busses vary, including EIDE, ATA, SATA, USB, Fibre Channel, SCSI, SAS, Firewire Host controller in computer uses bus to talk to disk controller built into drive or storage array 3 . 4 MAGNETIC DISKS Platters range from .85” to 14” (historically) Commonly 3.5”, 2.5”, and 1.8” Range from 30GB to 3TB per drive 3 . 5 MAGNETIC DISKS - PERFORMANCE Performance Transfer Rate – theoretical – 6 Gb/sec Effective Transfer Rate – real – 1Gb/sec Seek time from 3ms to 12ms Average seek time measured or calculated based on 1/3 of tracks Latency based on spindle speed 3 . 6 THE FIRST COMMERCIAL DISK DRIVE 3 . 7 NONVOLATILE MEMORY DEVICES Electrical instead of mechanical Growing in importance Controller and ash NAND die semiconductor chips DRAM with battery backup Flash-memory-based NVM used in a disk-drive-like container ⇒ solid-state disk (SSD) 3 . 8 SOLID-STATE DISKS Nonvolatile memory used like a hard drive Many technology variations Can be more reliable than HDDs More expensive per MB Uses less power Maybe have shorter life span? Less capacity → But much faster No moving parts, so no seek time or rotational latency 3 . 9 NAND SEMICONDUCTOR CHARACTERISTICS Can be read and written in a “page” increment (similar to a sector), Data cannot be overwritten — rather, the NAND cells have to be erased rst. Erasure, occurs in a “block” increment that is several pages in size Takes much more time than a read (the fastest operation) or a write (slower than read, but much faster than erase). 3 . 10 LIFESPAN NAND semiconductors also deteriorate with every erase cycle After approximately 100,000 program-erase cycles they stop retaining data NAND NVM lifespan is not measured in years but Drive Writes Per Day (DWPD) Measure how many times the drive capacity can be written per day before the drive fails. Example, a 1 TB NAND drive with a 5 DWPD rating is expected to have 5 TB per day written to it for the warranty period without failure. 3 . 11 NAND FLASH CONTROLLER ALGORITHMS 3 . 12 NAND FLASH CONTROLLER ALGORITHMS 3 . 13 NAND FLASH CONTROLLER ALGORITHMS Consider a full SSD with a pending write request - some individual pages could contain invalid data. Where would garbage collection store valid data? 3 . 14 NAND FLASH CONTROLLER ALGORITHMS To solve this problem and improve write performance, the NVM device uses over-provisioning. Device sets aside a number of pages (frequently 20 percent of the total) as an area always available to write to. Over-provisioning space can also help with wear leveling: Controller tries to avoid that by using various algorithms to place data on less- erased blocks 3 . 15 VOLATILE MEMORY DRAM is frequently used as a mass-storage device. RAM drives (which are known by many names, including RAM disks) act like secondary storage Created by device drivers that carve out a section of the system’s DRAM and present it to the rest of the system as it if were a storage device. 3 . 16 MAGNETIC TAPE Was early secondary-storage medium Evolved from open spools to cartridges Relatively permanent and holds large quantities of data Access time slow Random access ~1000 times slower than disk 3 . 17 MAGNETIC TAPE Mainly used for backup, storage of infrequently-used data, transfer medium between systems Kept in spool and wound or rewound past read-write head Once data under head, transfer rates comparable to disk 140MB/sec and greater 200GB to 1.5TB typical storage 3 . 18 ADDRESS MAPPING AND DISK STRUCTURE Disk drives are addressed as large 1-dimensional arrays of logical blocks, where the logical block is the smallest unit of transfer Low-level formatting creates logical blocks on physical media 3 . 19 ADDRESS MAPPING AND DISK STRUCTURE The 1-dimensional array of logical blocks is mapped into the sectors of the disk sequentially Sector 0 is the rst sector of the rst track on the outermost cylinder Mapping proceeds in order through that track, then the rest of the tracks in that cylinder, and then through the rest of the cylinders from outermost to innermost 3 . 20 DISK STRUCTURE A C B D 3 . 21 DISK STRUCTURE Logical to physical address should be easy Except for bad sectors Non-constant # of sectors per track via constant angular velocity 3 . 22 HDD SCHEDULING 4 . 1 DISK SCHEDULING The operating system is responsible for using hardware efciently — for the disk drives, this means having a fast access time and disk bandwidth Minimize seek time Seek time ≈ seek distance Disk bandwidth is the total number of bytes transferred, divided by the total time between the rst request for service and the completion of the last transfer We can improve access time and the bandwidth by managing the order in which storage I/O requests are serviced 4 . 2 DISK SCHEDULING I/O request includes input or output mode File handle Memory address Amount of data to transfer Absolute knowledge of head location and physical block/cylinder locations is generally not possible on modern drives 4 . 3 DISK SCHEDULING The current goals of disk scheduling include fairness, timeliness, and optimizations, such as bunching reads or writes that appear in sequence, as drives perform best with sequential I/O. 4 . 4 DISK SCHEDULING OS maintains queue of requests, per disk or device Idle disk can immediately work on I/O request, busy disk means work must queue Optimization algorithms only make sense when a queue exists Drive controllers have small buffers and can manage a queue of I/O requests (of varying “depth”) Several algorithms exist to schedule the servicing of disk I/O requests The analysis is true for one or many platters 4 . 5 DISK SCHEDULING We illustrate scheduling algorithms with a request queue (0- 199) 98, 183, 37, 122, 14, 124, 65, 67 Head pointer 53 4 . 6 FCFS Illustration shows total head movement of 640 cylinders 4 . 7 SCAN The disk arm starts at one end of the disk, and moves toward the other end, servicing requests until it gets to the other end of the disk, where the head movement is reversed and servicing continues. SCAN algorithm Sometimes called the elevator algorithm Illustration shows total head movement of 208 cylinders But note that if requests are uniformly dense, largest density at other end of disk and those wait the longest 4 . 8 SCAN 4 . 9 C-SCAN Provides a more uniform wait time than SCAN The head moves from one end of the disk to the other, servicing requests as it goes When it reaches the other end, however, it immediately returns to the beginning of the disk, without servicing any requests on the return trip Treats the cylinders as a circular list that wraps around from the last cylinder to the rst one Total number of cylinders? 4 . 10 C-SCAN 4 . 11 SELECTING AN ALGORITHM SCAN and C-SCAN perform better for systems that place a heavy load on the disk Less starvation Performance depends on the number and types of requests Requests for disk service can be inuenced by the le-allocation method And metadata layout 4 . 12 SELECTING AN ALGORITHM The disk-scheduling algorithm should be written as a separate module of the operating system, allowing it to be replaced with a different algorithm if necessary What about rotational latency? Difcult for OS to calculate How does disk-based queuing effect OS queue ordering efforts? 4 . 13 NVM SCHEDULING Disk-scheduling algorithms just discussed apply to mechanical platter-based storage like HDDs. Their focus: Minimizing disk head movement 5 . 1 NVM SCHEDULING NVM devices do not contain moving disk heads and commonly use a simple FCFS policy. Linux NOOP scheduler uses an FCFS policy but modies it to merge adjacent requests. Random access I/O is much faster on NVM. 5 . 2 IMPACT OF GARBAGE COLLECTION Consider an NVM device under random read and write load. 5 . 3 IMPACT OF GARBAGE COLLECTION one write request eventually causes a page write (the data), one or more page reads (by garbage collection), and one or more page writes (of good data from the garbage-collected blocks) ⇒ write amplication 5 . 4 ERROR DETECTION AND CORRECTION Frequent in memory, networking, and storage. 6 . 1 ERROR DETECTION Memory systems have long detected certain errors by using parity bits. For each byte store extra bit Single bit error detected - double might go undetected In networking: Cyclic Redundancy Check 6 .

Slides for Lecture 8

United States Patent (10) Patent N0.: US 7,290,102 B2 Lubbers Et A]

Cortex-A9 Single Core Microarchitecture

Memory Hierarchy Summary

Lecture 22: Mass Storage

Tuning IBM System X Servers for Performance

POWER7 and POWER7+ Optimization and Tuning Guide

Fasttrak Tx2200 / Tx2300 Tx4200 / Tx4300 User Manual

Introduction to Computer Systems 15-213/18-243, Spring 2009

Introduction to Computer Systems 15-213/18-243, Spring 2009

Spheras Storage Director Installation & User Guide

Algorithms and Data Structures for External Memory Algorithms and Data Structures 2:4 for External Memory Jeffrey Scott Vitter

CS429: Computer Organization and Architecture Storage Technologies