Using Model Checking to Find Serious File System Errors

Using Model Checking to Find Serious File System Errors Junfeng Yang, Paul Twohey, Dawson Engler ∗ Madanlal Musuvathi {junfeng, twohey, engler}@cs.stanford.edu [email protected] Computer Systems Laboratory Microsoft Research Stanford University One Microsoft Way Stanford, CA 94305, U.S.A. Redmond, WA 98052, U.S.A. Abstract Not only are errors in file systems dangerous, file This paper shows how to use model checking to find system code is simultaneously both difficult to reason serious errors in file systems. Model checking is a for- about and difficult to test. The file system must cor- mal verification technique tuned for finding corner-case rectly recover to an internally consistent state if the sys- errors by comprehensively exploring the state spaces de- tem crashes at any point, regardless of what data is being fined by a system. File systems have two dynamics that mutated, flushed or not flushed to disk, and what invari- make them attractive for such an approach. First, their ants have been violated. Anticipating all possible failures errors are some of the most serious, since they can de- and correctly recovering from them is known to be hard; stroy persistent data and lead to unrecoverable corrup- our results do not contradict this perception. tion. Second, traditional testing needs an impractical, The importance of file system errors has led to the de- exponential number of test cases to check that the sys- velopment of many file system stress test frameworks; tem will recover if it crashes at any point during execu- two good ones are [24,30]. However, these focus mostly tion. Model checking employs a variety of state-reducing on non-crash based errors such as checking that the file techniques that allow it to explore such vast state spaces system operations create, delete and link objects cor- efficiently. rectly. Testing that a file system correctly recovers from We built a system, FiSC, for model checking file sys- a crash requires doing reconstruction and then compar- tems. We applied it to three widely-used, heavily-tested ing the reconstructed state to a known legal state. The file systems: ext3 [13], JFS [21], and ReiserFS [27]. We cost of a single crash-reboot-reconstruct cycle (typically found serious bugs in all of them, 32 in total. Most have a minute or more) makes it impossible to test more than led to patches within a day of diagnosis. For each file a tiny fraction of the exponential number of crash pos- system, FiSC found demonstrable events leading to the sibilities. Consequently, just when implementors need unrecoverable destruction of metadata and entire direc- validation the most, testing is least effective. Thus, even tories, including the file system root directory “/”. heavily-tested systems have errors that only arise after they are deployed, making their errors all but impossible 1 Introduction to eliminate or even replicate. In this paper, we use model checking to systematically File system errors are some of the most destructive errors test and find errors in file systems. Model checking [5, possible. Since almost all deployed file systems reside 19,22] is a formal verification technique that systemat- in the operating system kernel, even a simple error can ically enumerates the possible states of a system by ex- crash the entire system, most likely in the midst of a mu- ploring the nondeterministic events in the system. Model tation to stable state. Bugs in file system code can range checkers employ various state reduction techniques to from those that cause “mere” reboots to those that lead efficiently explore the resulting exponential state space. to unrecoverable errors in stable on disk state. In such For instance, generated states can be stored in a hash ta- cases, mindlessly rebooting the machine will not correct ble to avoid redundantly exploring the same state. Also, or mask the errors and, in fact, can make the situation by inspecting the system state, model checkers can iden- worse. tify similar set of states and prioritize the search towards ∗ previously unexplored behaviors in the system. When This research was supported by NSF grant CCR-0326227 and DARPA grant F29601-03-2-0117. Dawson Engler is partially sup- applicable, such a systematic exploration can achieve the ported by Coverity and an NSF Career award. effect of impractically massive testing by avoiding the ¦ " # ¦ # ¦ ¨ ¦ " # ¦ # ¦ redundancy that would occur in conventional testing. ! § ¦ E § ¦ © ¦ ! ¤ ¡ ¢ £ The dominant cost of traditional model checking is the ¥ ¦ § ¨ ¦ © effort needed to write an abstract specification of the sys- ¤ tem (commonly referred to as the “model”). This up- $ $ $ ¤ > ? @ A B C@ D front cost has traditionally made model checking com- ¢ pletely impractical for large systems. A sufficiently de- I # ¦ § © # ¨ J ¦ § % ¦ § ! ' tailed model can be as large as the checked system. Em- ! ¦ ¥ F E ¨ § ¦ : ; & ' & ' ! ¨ pirically, implementors often refuse to write them; those ¨ " # ¦ # ¦ that are written have errors and, even if they do not, they ¦ © ¨ ¥ : ; < ¥ § : ; = § ¦ : ; “drift” as the implementation is modified but the model 9 9 9 F G < ¥ § : H § H ; is not [6]. E ¨ % § ¨ § Recent work has developed implementation-level © ¦ ¦ ¥ ¦ § & ' & ! ¡ ¢ £ ¦ § ¨ ¦ © model checkers that check implementation code directly ¥ without requiring an abstract specification [18,25,26]. ¤ We leverage this approach to create a model checking in- frastructure, the File System Checker (FiSC), which lets ) * + , , - . implementors model-check real, unmodified file systems ( / 0 1 ) 2 0 1 3 4 5 * 3 - - 0 4 3 0 - ) . with relatively little model checking knowledge. FiSC ) - ) 5 . is built on CMC, an explicit state space, implementation / / , - ) 5 2 4 6 ) 5 2 1 7 4 3 0 1 3 8 4 . 4 ( + , , - ) 5 . model checker we developed in previous work [25,26], which lets us run an entire operating system inside of the model checker. This allows us to check a file system in situ rather than attempting the difficult task of extracting it from the operating system kernel. Figure 1: State exploration and checking overview. We applied FiSC to three widely-used, heavily-tested FiSC’s main loop picks a state S from the state queue and file systems, JFS [21], ReiserFS [27], and ext3 [13]. We then iteratively generates its successor states by applying found serious bugs in all of them, 32 in total. Most have each possible operation to a restored copy of S. Thegen- 0 led to patches within a day of diagnosis. For each file erated state S is checked for validity and, if valid and not system, FiSC found demonstrable events leading to the explored before, inserted onto the state queue. unrecoverable destruction of metadata and entire directories, including the file system root directory “/”. writes to and truncates files; and mounts and unmounts The rest of the paper is as follows. We give an the file system. Figure 1 shows this process. overview of both FiSC (§2) and how to check a file sys- As each new state is generated, we intercept all disk tem with it (§3). We then describe: the checks FiSC writes done by the checked file system and forward them performs (§4), the optimizations it does (§5), and how to the permutation checker, which checks that the disk it checks file system recovery code (§6). We then discuss is in a state that fsck can repair to produce a valid results (§7) and our experiences using FiSC (§8), includ- file system after each subset of all possible disk writes. ing sources of false positives and false negatives. We This avoids storing a separate state for each permutation then conclude. and allows FiSC to choose which permutations to check. This checker is explained in Section 4.2. We run fsck 2 Checking Overview on the host system outside of the model checker and use Our system is comprised of four parts: (1) CMC, an ex- a small shared library to capture all the disk accesses plicit state model checker running the Linux kernel, (2) fsck makes while repairing the file system generated a file system test driver, (3) a permutation checker which by writing a permutation. We feed these fsck gener- verifies that a file system can recover no matter what or- ated writes into the crash recovery checker. This checker der buffer cache contents are written to disk, and (4) a allows FiSC to recursively check for failures in fsck fsck recovery checker. The model checker starts in an and is covered in Section 6. initial pristine state (an empty, formatted disk) and re- Figure 2 outlines the operation of the permutation and cursively generates and checks successive states by sys- fsck recovery checkers. Both checkers copy the disk tematically executing state transitions. Transitions are from the starting state of a transition and write onto the either test driver operations or FS-specific kernel threads copy to avoid perturbing the system. After the copied which flush blocks to disk. The test driver is conceptu- disk is modified the model checker traverses its file sys- ally similar to a program run during testing. It creates, tem, recording the properties it checks for consistency in removes, and renames files, directories, and hard links; a model of the file system. Currently these are the name, " ( atop the system call interface. The other layer provides " ¡ ¢ £ ¤ ¥ ¦ £ § ¨ © a “fake environment” that the checked system runs on. ¥ ) + * We need this environment model because the checked file system does not run on bare hardware. Instead, FiSC ¡ ¢ £ ¤ ¥ ¦ £ § ¨ © ( ¥ provides a virtual block device that models a disk as a collection of sectors that can be written atomically. The 0 $ ( # £ $ % & £ ¤ 1 block device driver layer is a natural place to cut as it is , £ ¤ ¦ - ¨ © ¨ ¥ % § . / £ $ £ ¤ . / £ $ £ ¤ the only relatively well-documented boundary between in-core and persistent data. 2 Modern Unix derivatives provide a Virtual File Sys- + # £ $ % & £ ¤ £ ' + 4 tem (VFS) interface [28].

Using Model Checking to Find Serious File System Errors

Study of File System Evolution

W4118: Linux File Systems

Certifying a File System

XFS: There and Back ...And There Again? Slide 1 of 38

BSD UNIX Toolbox 1000+ Commands for Freebsd, Openbsd

Filesystem Considerations for Embedded Devices ELC2015 03/25/15

Filesystems HOWTO Filesystems HOWTO Table of Contents Filesystems HOWTO

Acronis® Disk Director® 12 User's Guide

Journaling File Systems

State of the Art: Where We Are with the Ext3 Filesystem

Model-Based Failure Analysis of Journaling File Systems

Outline of Ext4 File System & Ext4 Online Defragmentation Foresight