Modeling the Linux Page Cache for Accurate Simulation of Data-Intensive Applications

Modeling the Linux page cache for accurate simulation of data-intensive applications Hoang-Dung Do∗, Valerie´ Hayot-Sasson∗, Rafael Ferreira da Silvaz, Christopher Steelex, Henri Casanovay, Tristan Glatard∗ ∗Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada yDepartment of Information and Computer Sciences, University of Hawai‘i at Manoa,¯ USA zInformation Sciences Institute, University of Southern California, Marina Del Rey, CA, USA xDepartment of Psychology, Concordia University, Montreal, Canada Abstract—The emergence of Big Data in recent years has type of hardware/software stacks are best suited to different resulted in a growing need for efficient data processing solu- application classes, as well as understanding the limitations of tions. While infrastructures with sufficient compute power are current algorithms, designs and technologies. Unfortunately, available, the I/O bottleneck remains. The Linux page cache is an efficient approach to reduce I/O overheads, but few experimental performance studies relying on real-world experiments on studies of its interactions with Big Data applications exist, partly compute platforms face several difficulties (high operational due to limitations of real-world experiments. Simulation is a costs, labor-intensive experimental setups, shared platforms popular approach to address these issues, however, existing with dynamic loads that hinder reproducibility of results) and simulation frameworks do not simulate page caching fully, or shortcomings (experiments are limited to the available plat- even at all. As a result, simulation-based performance studies of data-intensive applications lead to inaccurate results. form/software configurations, which precludes the exploration In this paper, we propose an I/O simulation model that of hypothetical scenarios). Simulations address these concerns includes the key features of the Linux page cache. We have by providing models and abstractions for the performance implemented this model as part of the WRENCH workflow of computer hardware, such as CPU, network and storage. simulation framework, which itself builds on the popular Sim- As a result, simulations provide a cost-effective, fast, easy Grid distributed systems simulation framework. Our model and its implementation enable the simulation of both single- and reproducible way to evaluate application performance on threaded and multithreaded applications, and of both writeback arbitrary platform configurations. It thus comes as no surprise and writethrough caches for local or network-based filesystems. that a large number of simulation frameworks have been We evaluate the accuracy of our model in different conditions, developed and used for research and development [1], [2], including sequential and concurrent applications, as well as local [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13]. and remote I/Os. We find that our page cache model reduces the simulation error by up to an order of magnitude when compared Page caching is an ubiquitous technique for mitigating to state-of-the-art, cacheless simulations. the I/O bottleneck. As such, it is necessary to model it when simulating data-intensive applications. While existing I. INTRODUCTION simulation frameworks of parallel and distributed computing The Linux page cache plays an important role in reducing systems capture many relevant features of hardware/software filesystem data transfer times. With the page cache, previously stacks, they lack the ability to simulate page cache with enough read data can be re-read directly from memory, and written details to capture key features such as dirty data and cache eviction policies [5], [6]. Some simulators, such as the one arXiv:2101.01335v1 [cs.DC] 5 Jan 2021 data can be written to memory before being asynchronously flushed to disk, resulting in improved I/O performance on in [14], do capture such features, but are domain-specific. slower storage devices. The performance improvements de- In this work, we present WRENCH-cache, a page cache pend on many factors including the total amount of memory, simulation model implemented in WRENCH [13], a work- the amount of data being written (i.e., dirty data), and the flow simulation framework based on the popular SimGrid amount of memory available for written data. All these factors distributed simulation toolkit [11]. Our contributions are: are important when determining the impact of I/O on application performance, particularly in data-intensive applications. • A page cache simulation model that supports both single- The number of data-intensive applications has been steadily threaded and multithreaded applications, and both write- rising as a result of open-data and data sharing initiatives. Due back and writethrough caches for local or network-based to the sheer size of the data being processed, these applications filesystems; must be executed on large-scale infrastructures such as High • An implementation of this model in WRENCH; Performance Computing (HPC) clusters or the cloud. It is • An evaluation of the accuracy and scalability of our thus crucial to quantify the performance of these applications model, and of its implementation, for multiple applica- on these platforms. The goals include determining which tions, execution scenarios, and page cache configurations. II. RELATED WORK B. Simulation A. Page cache Many simulation frameworks have been developed to enable the simulation of parallel and distributed applications [1], [2], Page cache offsets the cost of disk I/O by enabling I/O to [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13]. These occur directly from memory. When a file is first loaded into frameworks implement simulation models and abstractions to memory, the file is read from disk and loaded into the page aid the development of simulators for studying the functional cache as a series of pages. Subsequent reads to any of the file and performance behaviors of application workloads executed pages located in memory will result in a cache hit, meaning on various hardware/software infrastructures. the I/O can occur directly from without disk involvement. Any The two main concerns for simulation are accuracy, the accessed page not loaded in memory results in a cache miss, ability to faithfully reproduce real-world executions, and scal- resulting in the page being read directly from disk. ability, the ability to simulate large/long real-world executions Written pages can also contribute to future application cache quickly and with low RAM footprint. The above frameworks hits. When use of page cache is enabled for a given filesystem, achieve different compromises between the two. At one ex- through the enabling of writeback or writethrough cache, all treme are discrete-event models that capture “microscopic” written pages are written first to page cache, prior to being behaviors of hardware/software systems (e.g., packet-level net- written to disk. Accessing of these written pages may result work simulation, block-level disk simulation, cycle-accurate in cache hits, should the pages remain in memory. CPU simulation), which favor accuracy over speed. At the The kernel may also provide improved write performance other extreme are analytical models that capture “macro- if writeback cache is enabled. Unlike writethrough, where the scopic” behaviors via mathematical models. While these mod- data is synchronously written from memory to disk, writeback els lead to fast simulation, they must be developed carefully enables asynchronous writes. With writeback, the application if high levels of accuracy are to be achieved [17]. may proceed with execution once the data has been written to In this work, we use the SimGrid and WRENCH simulation memory, even if it has not yet been materialized to disk. The frameworks. The years of research and development invested writeback strategy is considered to outperform writethrough as in the popular SimGrid simulation framework [11], have well as direct I/O (page cache bypassed for I/O) as it delays culminated in a set of state-of-the-art macroscopic simula- disk writes to perform a bulk write at a later time [15]. tion models that yield high accuracy, as demonstrated by Cache eviction and flushing strategies are integral to proper (in)validation studies and comparisons to competing frame- page cache functioning. Whenever space in memory becomes works [18], [17], [19], [20], [21], [22], [23], [24], [25], [26]. limited, either as a result of application memory or page cache But one significant drawback of SimGrid is that its simulation use, page cache data may be evicted. Only data that has been abstractions are low-level, meaning that implementing a simu- persisted to storage (clean pages) can be flagged for eviction lator of complex systems can be labor-intensive [27]. To rem- and removed from memory. Written data that has not yet been edy this problem, the WRENCH simulation framework [13] persisted to disk (dirty data) must first be copied (flushed) to builds on top of SimGrid to provide higher-level simulation storage prior to eviction. When sufficient memory is being abstractions, so that simulators of complex applications and occupied, the flushing process is synchronous. However, even systems can be implemented with a few hundred lines. when there is sufficient available memory, written data will Although the Linux page cache has a large impact on be flushed to disk at a predefined interval through a process I/O performance, and thus on the execution of data-intensive

Modeling the Linux Page Cache for Accurate Simulation of Data-Intensive Applications

Copy on Write Based File Systems Performance Analysis and Implementation

The Page Cache Today’S Lecture RCU File System Networking(Kernel Level Syncmem

Dynamically Tuning the JFS Cache for Your Job Sjoerd Visser Dynamically Tuning the JFS Cache for Your Job Sjoerd Visser

NOVA: a Log-Structured File System for Hybrid Volatile/Non

Hibachi: a Cooperative Hybrid Cache with NVRAM and DRAM for Storage Arrays

Data Storage on Unix

Arxiv:1901.01161V1 [Cs.CR] 4 Jan 2019

An Updated Overview of the QEMU Storage Stack

Redhat Virtualization Tuning and Optimization Guide

HAC: Hybrid Adaptive Caching for Distributed Storage Systems

Project Assignment 2

Exporting Kernel Page Caching for Efficient User-Level