Piece of CAKE: a Comprehensive Queue Management Solution for Home Gateways

Piece of CAKE: A Comprehensive Queue Management Solution for Home Gateways Toke Høiland-Jørgensen Dave Täht Jonathan Morton Dept. of Computer Science Teklibre Karlstad University, Sweden Los Gatos, California Somero, Finland [email protected] [email protected] [email protected] Abstract—The last several years has seen a renewed interest compelling benefit of CAKE is that it takes state of the art in smart queue management to curb excessive network queueing solutions and integrates them to provide: delay, as people have realised the prevalence of bufferbloat in • a high-precision rate-based bandwidth shaper that in- real networks. However, for an effective deployment at today’s last mile cludes overhead and link layer compensation features for connections, an improved queueing algorithm is not enough in various link types. itself, as often the bottleneck queue is situated in legacy systems • a state of the art fairness queueing scheme that simulta- that cannot be upgraded. In addition, features such as per-user neously provides both host and flow isolation. fairness and the ability to de-prioritise background traffic are • a Differentiated Services (DiffServ) prioritisation scheme often desirable in a home gateway. with rate limiting of high-priority flows and work- In this paper we present Common Applications Kept Enhanced (CAKE), a comprehensive network queue management system conserving bandwidth borrowing behaviour. designed specifically for home Internet gateways. CAKE packs • TCP ACK filtering that increases achievable throughput several compelling features into an integrated solution, thus on highly asymmetrical links. easing deployment. These features include: bandwidth shaping CAKE is implemented as a queueing discipline (qdisc) for with overhead compensation for various link layers; reasonable DiffServ handling; improved flow hashing with both per-flow and the Linux kernel. It has been deployed as part of the OpenWrt per-host queueing fairness; and filtering of TCP ACKs. router firmware for the last several years and is in the process Our evaluation shows that these features offer compelling of being submitted for inclusion in the mainline Linux kernel.1 advantages, and that CAKE has the potential to significantly The rest of this paper describes the design and implementa- improve performance of last-mile internet connections. tion of CAKE and is organised as follows: SectionII outlines the desirable features of a comprehensive queue management I. INTRODUCTION system for a home router, and recounts related work in this space. Section III describes the design and implementation of Eliminating bufferbloat has been recognised as an important CAKE in more detail, and SectionIV evaluates the perfor- component in ensuring acceptable performance of internet mance of the various features. Finally, SectionV concludes. connections, especially as applications and users demand ever lower latencies. The last several years have established II. BACKGROUND AND RELATED WORK that Active Queue Management and Fairness Queueing are As mentioned initially, CAKE is designed to run on a home effective solutions to the bufferbloat problem, and several network gateway. We have gathered significant experience algorithms have been proposed and evaluated (e.g., [1]–[3]). with implementing such a system in form of the Smart Queue However, while modern queueing algorithms can effectively Management (SQM) system shipped in the OpenWrt router arXiv:1804.07617v2 [cs.NI] 25 May 2018 control bufferbloat, effective deployment presents significant firmware project, which has guided the design of CAKE. challenges. The most immediate challenge is that the home In this section we provide an overview of the problems gateway device is often not directly in control of the bottleneck CAKE is designed to address. We are not aware of any pre- link, because queueing persists in drivers or firmware of vious work addressing the home gateway queue management devices that cannot be upgraded [1]. In addition, other desir- challenges as a whole. However, several of the issues that able features in a home networking context (such as per-user CAKE addresses have been subject of previous work, and so fairness, or the ability to explicitly de-prioritise background the following subsections serve as both an introduction to the applications) can be challenging to integrate with existing design space and an overview of related work. queueing solutions. To improve upon this situation, we have The four problems we seek to address are bandwidth developed Common Applications Kept Enhanced (CAKE), shaping, queue management and fairness, DiffServ handling which is a comprehensive network queue management system and TCP ACK filtering. These are each treated in turn in the designed specifically for the home router use case. following sections. As outlined below, each of the issues that CAKE is designed 1We include links to the source code, along with the full evaluation dataset, to handle has been addressed separately before. As such, the in an online appendix [4]. A. Bandwidth Shaping to implement in place of flow fairness in any fairness queueing A queue management algorithm is only effective if it is in based scheme (by simply changing the function that maps control of the bottleneck queue. Thus, queueing in lower layers packets into different queues), but we are not aware of any needs to be eliminated, which has been achieved in Linux practical schemes prior to CAKE that implement both host for Ethernet [5] and WiFi [6]. However, eliminating queueing and flow fairness. at the link layer is not always possible, either because the C. DiffServ Handling driver source code is unavailable, or because the link-layer is implemented in inaccessible hardware or firmware (either on Even though flow-based fairness queueing offers a large the same device or a separate device, such as a DSL modem). degree of separation between traffic flows, it can still be As an alternative, queueing in the lower layers can be desirable to explicitly treat some traffic as higher priority, and avoided by deploying a bandwidth shaper as part of the queue to have the ability to mark other traffic as low priority. Since management system. By limiting the traffic traversing the a home network generally does not feature any admission bottleneck link to a bandwidth that is slightly less than the control, any prioritisation scheme needs to be robust against physical capacity of the link itself, queueing at the physical attempts at abuse (so, e.g., a strict priority queue does not work bottleneck can be eliminated and bufferbloat avoided. Such well). In addition, enabling prioritisation should not affect the bandwidth shaping can be performed by a token bucket-based total available bandwidth in the absence of marked traffic, as shaper (as is well-known from ATM networks, e.g., [7]), or that is likely to cause users to turn the feature off. by a rate-based shaper (which is known from video streaming Prioritisation of different traffic classes can be performed by applications, e.g., [8]). reacting to DiffServ markings [16]. This is commonly used The use of a shaper to move the link bottleneck wastes the in WiFi networks, where DiffServ code points map traffic bandwidth that is the difference between the actual physical into four priority levels [17]. For the home gateway use case, link capacity, and the set-point of the shaper. To limit this various schemes have been proposed in the literature (e.g., waste, the shaper needs to be set as close to the actual [18]), but as far as we are aware, none have seen significant link bandwidth as possible, while avoiding sending bursts of deployment. packets at a rate that is higher than the actual capacity. To D. TCP ACK Filtering achieve this, accurate timing information on a per-packet basis is needed. In addition, the shaper must account for link-layer TCP ACK filtering is an optimisation that has seen some framing and overhead. For instance, DSL links using ATM popularity in highly asymmetrical networks [19], and es- framing split up data packets into an integer number of fixed- pecially in cable modem deployments [20]. The technique size cells, which means that the framing overhead is a step involves filtering (or thinning) TCP acknowledgement (ACK) function of packet size, rather than a fixed value. packets by inspecting queues and dropping ACKs if a TCP flow has several consecutive ACKs queued. This can improve B. Queue Management performance on highly asymmetrical links, where the reverse Having control of the bottleneck queue makes it possible path does not have sufficient capacity to transport the ACKs to implement effective queue management that can all but produced by the forward path TCP flow. However, ACK eliminate bufferbloat. Such a queue management scheme filtering can also have detrimental effects on performance, for usually takes the form of an Active Queue Management instance due to cross layer interactions [21]. (AQM) algorithm, combined with a form of fairness queueing (FQ). Several such schemes exist, and extensive evaluation is III. THE DESIGN OF CAKE available in the literature (e.g., [1]–[3], [9]–[11]). The design of CAKE builds upon the basic fairness sched- Among the state of the art algorithms in modern queue uler design of FQ-CoDel, but adds features to tackle the areas management, is the FQ-CoDel algorithm [12]. FQ-CoDel outlined in the previous section. The following sections outline implements a hybrid AQM/fairness queueing scheme which how CAKE implements each of these features. isolates flows using a hashing scheme and schedules them using a Deficit Round-Robin (DRR) [13] scheduler. In addition, A. Bandwidth Shaping FQ-CoDel contains an optimisation that provides implicit ser- CAKE implements a rate-based shaper, which works by vice differentiation for sparse (low-bandwidth) flows, similar scheduling packet transmission at precise intervals using a to [14], [15]. Evaluations of FQ-CoDel have shown that it virtual transmission clock.

Piece of CAKE: a Comprehensive Queue Management Solution for Home Gateways

A Letter to the FCC [PDF]

Evaluation of Priority Scheduling and Flow Starvation for Thin Streams with FQ-Codel

AQM Algorithms and Their Interaction with TCP Congestion Control Mechanisms

Auto-Tuning Active Queue Management

Controlling Queuing Delays for Real-Time Communication: the Interplay of E2E and AQM Algorithms

Controlling Queuing Delays for Real-Time Communication: the Interplay of E2E and AQM Algorithms

Packet Mark Methodology for Reduce the Congestion

Evaluation of the Impact of Packet Drops Due to AQM Over Capacity Limited Paths

TCP Congestion Control Macroscopic Behavior for Combinations of Source and Router Algorithms

An Extension of the Codel Algorithm

A Hybrid Loss-Delay Gradient Congestion Control Algorithm for the Internet

Opnsense A10 Dual & Quad Core Desktop Series DATASHEET