On the Defectiveness of SCHED DEADLINE W.R.T. Tardiness and Afinities, and a Partial Fix Stephen Tang Luca Abeni James H

On the Defectiveness of SCHED_DEADLINE w.r.t. Tardiness and Afinities, and a Partial Fix Stephen Tang Luca Abeni James H. Anderson [email protected] {sytang,anderson}@cs.unc.edu Scuola Superiore Sant’Anna University of North Carolina at Chapel Hill Pisa, Italy Chapel Hill, USA ABSTRACT create new DL tasks until existing DL tasks reduce their bandwidth SCHED_DEADLINE (DL for short) is an Earliest-Deadline-First consumption. Note that preventing over-utilization is only a neces- sary condition for guaranteeing that tasks meet deadlines. (EDF) scheduler included in the Linux kernel. A question motivated Instead of guaranteeing that deadlines will be met, AC satisfes by DL is how EDF should be implemented in the presence of CPU afnities to maintain optimal bounded tardiness guarantees. Recent two goals [3]. The frst goal is to provide performance guarantees. works have shown that under arbitrary afnities, DL does not main- Specifcally, AC guarantees to each DL task that its long-run con- tain such guarantees. Such works have also shown that repairing sumption of CPU bandwidth remains within a bounded margin DL to maintain these guarantees would likely require an impractical of error from a requested rate. The second goal is to avoid the overhaul of the existing code. In this work, we show that for the starvation of lower-priority non-DL workloads. Specifcally, AC special case where afnities are semi-partitioned, DL can be modi- guarantees that some user-specifed amount of CPU bandwidth fed to maintain tardiness guarantees with minor changes. We also will remain for the execution of non-DL workloads. This goal of draw attention to the fact that admission control is already broken AC has been an important aspect of Linux real-time scheduling for in several respects in the existing DL implementation. years, and predates the merging of DL into the kernel. The theoretical basis of AC is that global EDF scheduling of CCS CONCEPTS sporadic real-time tasks on identical multiprocessors guarantees bounded tardiness to jobs if the system is not over-utilized [6]. The • Computer systems organization ! Real-time systems. DL documentation [2] cites [6]. Bounded tardiness ensures AC’s frst goal as it implies that any execution of a job occurs in a fnite KEYWORDS window around its release and deadline, meaning a task’s rate of real-time, afnities execution stays consistent with its bandwidth. Bounded tardiness ACM Reference Format: also prevents any set of DL tasks from having an unbounded number Stephen Tang, James H. Anderson, and Luca Abeni. 2021. On the Defec- of tardy jobs which, if allowed to execute uninterrupted, could tiveness of SCHED_DEADLINE w.r.t. Tardiness and Afnities, and a Partial starve non-DL workloads for an unbounded amount of time. Thus, Fix. In 29th International Conference on Real-Time Networks and Systems bounded tardiness also supports AC’s second goal. ACM, New York, NY, USA, (RTNS’2021), April 7–9, 2021, NANTES, France. Specifying tasks’ afnities, subsets of the CPUs on 11 pages. https://doi.org/10.1145/3453417.3453440 Afnities. which given tasks are permitted to execute, is useful for maintaining cache locality, as tasks whose execution times depend heavily on 1 INTRODUCTION whether they are cache hot or cold at a certain cache level may ben- SCHED_DEADLINE (DL for short) is an Earliest-Deadline-First (EDF) eft from having their afnities restricted to CPUs that share cache implementation included in the mainline Linux kernel since version at said level. Afnities can also be useful in reducing scheduling- 3.14. Its inclusion has been signifcant because it has lowered the decision overheads, as a CPU need only consider tasks with afnity barrier to entry for real-time EDF scheduling and it has inspired for that CPU when deciding what to execute. many publications [4, 8, 9, 11, 13, 15, 16]. Unfortunately, the tardiness result in [6] heavily relies on the Admission control. One feature of DL is its admission-control fact that scheduling is global, and does not hold when tasks have (AC) system. AC prevents the system from being over-utilized by afnities. Thus, with the exception of clustered scheduling (in which tasks managed by DL. If the total CPU bandwidth required by DL each cluster resembles its own global subsystem), the setting of tasks is near some threshold, then AC will reject any requests to afnities is forbidden in DL unless AC is explicitly disabled. It was shown in [15] that with arbitrary afnities, under-utilized Permission to make digital or hard copies of all or part of this work for personal or systems (whose defnition becomes more complex under arbitrary classroom use is granted without fee provided that copies are not made or distributed for proft or commercial advantage and that copies bear this notice and the full citation afnities) may have unbounded tardiness under DL. Though an EDF on the frst page. Copyrights for components of this work owned by others than ACM variant called SAPA-EDF presented in [5] was proven in [16] to must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specifc permission and/or a guarantee bounded tardiness with arbitrary afnities for identical fee. Request permissions from [email protected]. CPUs, this variant was not implemented by the DL maintainers RTNS’2021, April 7–9, 2021, NANTES, France due to the complexity of the required modifcations to both the © 2021 Association for Computing Machinery. scheduler and AC (as the meaning of over-utilization is changed). ACM ISBN 978-1-4503-9001-9/21/04...$15.00 https://doi.org/10.1145/3453417.3453440 RTNS’2021, April 7–9, 2021, NANTES, France Tang, Anderson, and Abeni This variant would likely have also required higher overheads and Organization. The rest of this paper is organized as follows. After increased task migrations in practice. covering needed background in Sec. 2, we present and categorize the While [15] showed that the current DL implementation cannot list of DL features not considered by [6] in Sec. 3. We demonstrate support arbitrary afnities, that work suggested that this may not the non-optimality of DL under SP afnities and present our patch be the case for special cases of afnities. In this paper, we consider in Sec. 4. We provide an overview of the proof of correctness of the special case of semi-partitioned (SP) systems, wherein each task this patch under an idealized abstraction of our patched DL in Sec. 5 has afnity for either one CPU (the task is partitioned) or for all (again, the formal proof is deferred to an appendix). We evaluate CPUs in its cluster (the task is migrating). While not as fexible as the overheads of our patch relative to the original implementation arbitrary afnities, SP afnities still reduce overheads by limiting in Sec. 6. We conclude in Sec. 7. the number of migrating tasks and can still be used to maintain cache locality for partitioned tasks. 2 BACKGROUND In practice. AC does not guarantee bounded tardiness under DL, We start by presenting our system model, mostly derived from [15] even under an idealized abstraction of the scheduler. This is be- with some modifcations to consider dynamic task systems. We cause afnities are far from the only way that DL difers from the then discuss how DL fts this system model. system model considered in [6], from which the soundness of AC originates. In particular, DL has grown to consider dynamic voltage 2.1 Task Model and frequency scaling (DVFS) and asymmetric CPU capacities, both We consider a system of N implicit-deadline sporadic tasks τ = of which break the assumption of identical CPUs in [6]. Besides fτ1;τ2;:::;τN g running on M unit-speed CPUs π = fπ1; π2;:::; πM g. these changes in the considered platform model, tasks can also be We assume basic familiarity with the sporadic task model. We de- much more dynamic under DL than in [6]. th note the j job released by task τi as τi;j , where j ≥ 1. Job τi;j Unlike afnities, AC need not be disabled when using such fea- must be completed before job τi;j+1 is allowed to execute. We let tures. This calls into question the role of AC, as there is no theo- Ci denote the worst-case execution time (WCET) of τi over all its retical basis for its goals without tardiness guarantees. Instead of jobs. We let Ti denote the period of task τi . The utilization ui of by analysis, these features have often been validated empirically task τi is given by Ci /Ti . We denote the release time, deadline, and by demonstrating that deadline-miss frequencies are acceptable. completion time of job τi;j by ri;j , di;j , and fi;j , respectively, where As we demonstrate herein, empirical validation is insufcient for di;j = ri;j + Ti (implicit deadlines). showing that AC is not broken. At time t, a job τi;j is either unreleased (t < ri;j ), pending (ri;j ≤ Contributions. We present a DL variant that supports AC for SP t < fi;j ), or complete (t ≥ fi;j º. If a task τi has pending jobs at t, systems. This contribution is divided into three parts. then its ready job at t is its earliest-released pending job at t. First, we list features supported by DL that were not considered For a job τi;j , its response time is given by fi;j −ri;j , and its tardi- in the analysis in [6]. For a subset of these features, we show that ness by maxf0; fi;j −di;j g.

On the Defectiveness of SCHED DEADLINE W.R.T. Tardiness and Afinities, and a Partial Fix Stephen Tang Luca Abeni James H

Linux Scheduler Documentation

Advances in Mobile Cloud Computing and Big Data in the 5G Era Studies in Big Data

RTEMS C User Documentation Release 4.11.3 ©Copyright 2016, RTEMS Project (Built 15Th February 2018)

Network Store: Exploring Slicing in Future 5G Networks

Embedded Operating Systems

Red Hat Enterprise Linux for Real Time 8 Reference Guide

SCHED DEADLINE: a Real-Time CPU Scheduler for Linux Luca Abeni [email protected]

Block Devices and Volume Management in Linux

Proceedings Der Linux Audio Conference 2018

Toward a Low-Latency and Cooperative Radio Access Network

SCHED DEADLINE: It’S Alive!

Allocation and Control of Computing Resources for Real-Time Virtual Network Functions