SCHED DEADLINE: It’S Alive!

Title 44pt sentence case SCHED_DEADLINE: It’s Alive! Juri Lelli Affiliations 24pt sentence case ARM Ltd. 20pt sentence case ELC North America 17, Portland (OR) 02/21/2017 © ARM 2017 Title 40pt sentence case Agenda Bullets 24pt sentence case . Deadline scheduling (SCHED_DEADLINE) bullets 20pt sentence case . Why is development now happening (out of the blue?) . Bandwidth reclaiming . Frequency/CPU scaling of reservation parameters . Coupling with frequency selection . Group scheduling . Future 2 © ARM 2017 Text 54pt sentence case CHAPER 1 What and Why 3 © ARM 2017 Title 40pt sentence case Agenda Bullets 24pt sentence case . Deadline scheduling (SCHED_DEADLINE) bullets 20pt sentence case . Why is development now happening (out of the blue?) . Bandwidth reclaiming . Frequency/CPU scaling of reservation parameters . Coupling with frequency selection . Group scheduling . Future 4 © ARM 2017 Title 40pt sentence case Deadline scheduling (previously on ...) Bullets 24pt sentence case . mainline since v3.14 SCHED_RR SCHED_ bullets 20pt sentence case 30 March 2014 (~3y ago) SCHED_ IDLE DEADLINE SCHED_FIFO . it’s not only about deadlines SCHED_ . RT scheduling policy BATCH . explicit per-task latency constraints SCHED_ NORMAL . avoids starvation . enriches scheduler’s knowledge about QoS requirements . EDF + CBS . resource reservation mechanism deadline.c rt.c fair.c . temporal isolation . ELC16 presentation Linux scheduler https://goo.gl/OVspuI 5 © ARM 2017 Title 40pt sentence case Deadline scheduling (previously on ...) Bullets 24pt sentence case . mainline since v3.14 SCHED_RR SCHED_ bullets 20pt sentence case 30 March 2014 (~3y ago) SCHED_ IDLE DEADLINE SCHED_FIFO . it’s not only about deadlines SCHED_ . RT scheduling policy BATCH . explicit per-task latency constraints SCHED_ NORMAL . avoids starvation . enriches scheduler’s knowledge about QoS requirements . EDF + CBS . resource reservation mechanism deadline.c rt.c fair.c . temporal isolation . ELC16 presentation Linux scheduler https://goo.gl/OVspuI 6 © ARM 2017 Title 40pt sentence case Agenda Bullets 24pt sentence case . Deadline scheduling (SCHED_DEADLINE) bullets 20pt sentence case . Why is development now happening (out of the blue?) . Bandwidth reclaiming . Frequency/CPU scaling of reservation parameters . Coupling with frequency selection . Group scheduling . Future 7 © ARM 2017 Title 40pt sentence case Why is development now happening Bullets 24pt sentence case . Energy Aware Scheduling (EAS) bullets 20pt sentence case . extends the Linux kernel scheduler and power management to make it fully power/performance aware (https://goo.gl/vQbUOu) . scheduler modifications pertain to SCHED_NORMAL (so far) . Android Common Kernel . EAS has been merged last year (https://goo.gl/FXCdAX) . performance usually means meeting latency requirements . considerable usage (and modifications) of SCHED_FIFO . SCHED_DEADLINE seems to be a better fit and mainline adoption of required changes should be less controversial . Joint collaboration between ARM and Scuola Superiore Sant’Anna of Pisa 8 © ARM 2017 Text 54pt sentence case CHAPTER 2 Let’s reclaim! 9 © ARM 2017 Title 40pt sentence case Agenda Bullets 24pt sentence case . Deadline scheduling (SCHED_DEADLINE) bullets 20pt sentence case . Why is development now happening (out of the blue?) . Bandwidth reclaiming . Frequency/CPU scaling of reservation parameters . Coupling with frequency selection . Group scheduling . Future 10 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming Bullets 24pt sentence case . PROBLEM bullets 20pt sentence case . tasks’ bandwidth is fixed (can only be changed with sched_setattr()) . what if tasks occasionally need more bandwidth? e.g., occasional workload fluctuations (network traffic, rendering of particularly heavy frame, etc) . SOLUTION (proposed*) . bandwidth reclaiming: allow tasks to consume more than allocated . up to a certain maximum fraction of CPU time . if this doesn’t break others’ guarantees * https://lkml.org/lkml/2016/12/30/107 11 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Greedy Reclamation of Unused Bandwidth (GRUB)1 bullets 20pt sentence case . 3 components2 . tracking of active utilization . modification of the accounting rule . multiprocessor support (original algorithm was designed for UP) 1 - Greedy reclamation of unused bandwidth in constant-bandwidth servers - Giuseppe Lipari, Sanjoy K. Baruah (https://goo.gl/xl4CUk) 2 - Greedy CPU reclaiming for SCHED DEADLINE - Luca Abeni, Juri Lelli, Claudio Scordino, Luigi Palopoli (https://goo.gl/e8EC8q) 12 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Tracking of active utilization bullets 20pt sentence case q i di 0lag di Ti Qi Q i q i Ti . Uact is increased by Qi/Ti when task wakes up q Q . 0 lag time comes from CBS wakeup check: i i di t Ti . Uact is decreased by the same amount at 0 lag time a timer is set to fire at this instant of time . One Uact per CPU (rq->dl.running_bw) 13 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Tracking of active utilization bullets 20pt sentence case q i di 0lag di Ti Qi Q i q i Ti . Uact is increased by Qi/Ti when task wakes up q Q . 0 lag time comes from CBS wakeup check: i i di t Ti . Uact is decreased by the same amount at 0 lag time a timer is set to fire at this instant of time . One Uact per CPU (rq->dl.running_bw) 14 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Modification of the accounting rule bullets 20pt sentence case task_tick_dl() dequeue_task_dl() -> update_curr_dl() -> update_curr_dl() . runtime -= delta_exec becomes runtime -= Uact * delta_exec . but this can eat up 100% of CPU time! (starving non-DL tasks) . e.g., a 5sec every 10sec task that can reclaim... so, in reality accounting will probably become runtime -= Uact/Umax * delta_exec 15 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Modification of the accounting rule bullets 20pt sentence case task_tick_dl() dequeue_task_dl() -> update_curr_dl() -> update_curr_dl() . runtime -= delta_exec becomes runtime -= Uact * delta_exec . but this can eat up 100% of CPU time! (starving non-DL tasks) . e.g., a 5sec every 10sec task that can reclaim... so, in reality accounting will probably become runtime -= Uact/Umax * delta_exec 16 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . e.g., a 5sec every 10sec task that can’t reclaim... bullets 20pt sentence case . VS, a 5sec every 10sec task that can reclaim (without Umax cap) Uact = 0.5 -> runtime -= delta*0.5 -> deplete in (1/0.5)*runtime = 10sec Umax = 0.9 -> runtime -= delta*(0.5/0.9) -> deplete in (0.9/0.5)*runtime = 9sec leaving 1sec for otherwise sad guys :-) 17 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Multiprocessor support bullets 20pt sentence case . ISSUE (one of a few) Qi Uactk 0lag Ti CPU k CPU j . task i wakes up and is accounted for . it then blocks and timer is set to fire at 0 lag time 18 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Multiprocessor support bullets 20pt sentence case . ISSUE (one of a few) 0lag CPU k CPU j . task i wakes up again, before 0 lag . but it is migrated on a different CPU . 0 lag timer cancelled, but no changes to both CPUs’ Uact 19 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Multiprocessor support bullets 20pt sentence case . ISSUE (one of a few) Uactk (t 1) Uactk (t) CPU k Q Uact i 0 j T i CPU j . task i blocks again (on CPUj) . no change on CPUk’s Uact and CPUj’s Uact becomes negative! 20 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (cont.) Bullets 24pt sentence case . Multiprocessor support bullets 20pt sentence case . SOLUTION – migrate task’s utilization together with him Qi Uactk 0lag Ti CPU k Q Uact i j T i CPU j . 0 lag timer cancelled, and... utilization is instantaneously migrated as well . so that when task i blocks again everything is fine 21 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (results) Bullets 24pt sentence case . Task1 (6ms, 20ms) bullets 20pt sentence case constant execution time of 5ms . Task2 (45ms, 260ms) experiences occasional variances (35ms-52ms) CDF Response time (ms) T2’s reservation period 22 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (results) Bullets 24pt sentence case . Task1 (6ms, 20ms) bullets 20pt sentence case constant execution time of 5ms . Task2 (45ms, 260ms) experiences occasional variances (35ms-52ms) CDF . Cumulative Distribution Function (CDF) probability that Response time will be less or equal to x ms Response time (ms) T2’s reservation period 23 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (results) Bullets 24pt sentence case . Task1 (6ms, 20ms) bullets 20pt sentence case constant execution time of 5ms . Task2 (45ms, 260ms) experiences occasional variances (35ms-52ms) CDF . Plain CBS T2’s response time bigger then reservation period (~25%) Response time (ms) T2’s reservation period 24 © ARM 2017 Title 40pt sentence case Bandwidth Reclaiming (results) Bullets 24pt sentence case . Task1 (6ms, 20ms) bullets 20pt sentence case constant execution time of 5ms . Task2 (45ms, 260ms) experiences occasional variances (35ms-52ms) CDF . GRUB T2 always completes before reservation period (using bandwidth left by T1) Response

SCHED DEADLINE: It’S Alive!

Linux Scheduler Documentation

Advances in Mobile Cloud Computing and Big Data in the 5G Era Studies in Big Data

On the Defectiveness of SCHED DEADLINE W.R.T. Tardiness and Afinities, and a Partial Fix Stephen Tang Luca Abeni James H

RTEMS C User Documentation Release 4.11.3 ©Copyright 2016, RTEMS Project (Built 15Th February 2018)

Network Store: Exploring Slicing in Future 5G Networks

Embedded Operating Systems

Red Hat Enterprise Linux for Real Time 8 Reference Guide

SCHED DEADLINE: a Real-Time CPU Scheduler for Linux Luca Abeni [email protected]

Block Devices and Volume Management in Linux

Proceedings Der Linux Audio Conference 2018

Toward a Low-Latency and Cooperative Radio Access Network

Allocation and Control of Computing Resources for Real-Time Virtual Network Functions