User Space Network Drivers

User Space Network Drivers Paul Emmerich Maximilian Pudelko Simon Bauer Technical University of Munich Technical University of Munich Technical University of Munich [email protected] Open Networking Foundation [email protected] [email protected] Stefan Huber Thomas Zwickl Georg Carle Technical University of Munich Technical University of Munich Technical University of Munich [email protected] [email protected] [email protected] ABSTRACT The rise of user space packet processing frameworks like 1 INTRODUCTION DPDK and netmap makes low-level code more accessible to developers and researchers. Previously, driver code was Low-level packet processing on top of traditional socket APIs hidden in the kernel and rarely modified—or even looked at— is too slow for modern requirements and was therefore often by developers working at higher layers. These barriers are done in the kernel in the past. Two examples for packet for- gone nowadays, yet developers still treat user space drivers warders utilizing kernel components are Open vSwitch [40] as black-boxes magically accelerating applications. We want and the Click modular router [32]. Writing kernel code is to change this: every researcher building high-speed net- not only a relatively cumbersome process with slow turn- work applications should understand the intricacies of the around times, it also proved to be too slow for specialized underlying drivers, especially if they impact performance. applications. Open vSwitch was since extended to include We present ixy, a user space network driver designed for DPDK [7] as an optional alternative backend to improve per- simplicity and educational purposes to show that fast packet formance [36]. Click was ported to both netmap [41] and IO is not black magic but careful engineering. ixy focuses on DPDK for the same reasons [2]. Other projects also moved the bare essentials of user space packet processing: a packet kernel-based code to specialized user space code [24, 43]. forwarder including the whole NIC driver uses less than Developers and researchers often treat DPDK as a black- 1,000 lines of C code. box that magically increases speed. One reason for this is This paper is partially written in tutorial style on the case that DPDK, unlike netmap and others, does not come from study of our implementations of drivers for both the Intel an academic background. It was first developed by Intel and 82599 family and for virtual VirtIO NICs. The former allows then moved to the Linux Foundation in 2017 [27]. This means us to reason about driver and framework performance on that there is no academic paper describing its architecture a stripped-down implementation to assess individual opti- or implementation. The netmap paper [41] is often used as mizations in isolation. VirtIO support ensures that everyone surrogate to explain how user space packet IO frameworks can run it in a virtual machine. work in general. However, DPDK is based on a completely Our code is available as free and open source under the different architecture than seemingly similar frameworks. BSD license at https://github.com/emmericp/ixy. Abstractions hiding driver details from developers are an advantage: they remove a burden from the developer. How- KEYWORDS ever, all abstractions are leaky, especially when performance- critical code such as high-speed networking applications Tutorial, performance evaluation, DPDK, netmap are involved. We therefore believe it is crucial to have at least some insights into the inner workings of drivers when Permission to make digital or hard copies of all or part of this work for developing high-speed networking applications. personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear We present ixy, a user space packet framework that is this notice and the full citation on the first page. Copyrights for components architecturally similar to DPDK [7] and Snabb [15]. Both use of this work owned by others than ACM must be honored. Abstracting with full user space drivers, unlike netmap [41], PF_RING [34], credit is permitted. To copy otherwise, or republish, to post on servers or to PFQ [4], or similar frameworks that rely on a kernel driver. redistribute to lists, requires prior specific permission and/or a fee. Request ixy is designed for educational use only, i.e., you are meant to permissions from [email protected]. ANCS’19, 2019 use it to understand how user space packet frameworks and © 2019 Association for Computing Machinery. drivers work, not to use it in a production environment. Our whole architecture, described in Section 3, aims at simplicity 1 Chair of Network Architectures and Services Chair of Network Architectures and Services Department of Informatics Department of Informatics Technical University of MunichANCS’19, 2019 Technical University of Munich Paul Emmerich et al. Do more in user space? Do even more in user space? User Space Application User Space Packets mmap’ed Application Magic Library Memory mmap’ed DMA Control API BAR0 Memory Kernel Space Magic Kernel Module Kernel Space mmap’ed OS OS Driver Memory NIC NIC NIC NIC NIC NIC NIC NIC Paul Emmerich — Demystifying Network Cards 18 Paul Emmerich — Demystifying Network Cards 21 Figure 1: Architecture of user space packet processing Figure 2: Architecture of full user space network dri- frameworks using an in-kernel driver, e.g., netmap, vers, e.g., DPDK, Snabb, or ixy. PF_RING ZC, or PFQ. be controlled with standard tools like ethtool to configure and is trimmed down to the bare minimum. We currently sup- packet filtering or queue sizes. However, hardware features port the Intel ixgbe family of NICs (cf. Section 4) and virtual are often poorly supported, e.g., netmap lacks support for VirtIO NICs (cf. Section 6). A packet forwarding application most offloading features [11]. is less than 1,000 lines of C code including the whole poll- Note that none of these two advantages is superior to mode driver, the implementation is discussed in Section 4. the other, they are simply different approaches for a similar It is possible to read and understand drivers found in other problem. Each solution comes with unique advantages and frameworks, but ixy’s driver is at least an order of magnitude disadvantages depending on the exact use case. simpler than other implementations. For example, DPDK’s netmap [41] and XDP [23] are good examples of inte- implementation of the ixgbe driver needs 5,400 lines of code grating kernel components with specialized applications. just to handle receiving and sending packets in a highly op- netmap (a standard component in FreeBSD and also avail- timized way, offering multiple paths using different vector able on Linux) offers interfaces to pass packets between the SIMD instruction sets. ixy’s receive and transmit path for kernel network stack and a user space app, it can even make the same driver is only 127 lines of code. use of the kernel’s TCP/IP stack with StackMap [46]. Fur- It is not our goal to support every conceivable scenario, ther, netmap supports using a NIC with both netmap and hardware feature, or optimization. We aim to provide an the kernel simultaneously by using hardware filters to steer educational platform for experimentation with driver-level packets to receive queues either managed by netmap or the features or optimizations. ixy is available under the BSD kernel [3]. XDP is technically not a user space framework: license for this purpose [38]. the code is compiled to eBPF which is run by a JIT in the kernel, this restricts the choice of programming language to 2 BACKGROUND AND RELATED WORK those that can target eBPF bytecode (typically a restricted A multitude of packet IO frameworks have been built over subset of C is used). It is a default part of the Linux kernel the past years, each focusing on different aspects. They can nowadays and hence very well integrated. It is well-suited be broadly categorized into two categories: those relying to implement firewalls that need to pass on traffic tothe on a driver running in the kernel (Figure 1) and those that network stack [14]. More complex applications can be built re-implement the whole driver in user space (Figure 2). on top of it with AF_XDP sockets, resulting in an architec- Examples for the former category are netmap [41], PF_RING ture similar to netmap applications. Despite being part of the ZC [34], PFQ [4], and OpenOnload [44]. They all use the de- kernel, XDP does not yet work with all drivers as it requires fault driver (sometimes with small custom patches) and an a new memory model for all supported drivers. At the time additional kernel component that provides a fast interface of writing, XDP in kernel 4.19 (current LTS) supports fewer based on memory mapping for the user space application. drivers than DPDK [5, 22] and does not support forwarding Packet IO is still handled by the kernel driver here, but the dri- to different NICs. ver is attached to the application directly instead of the kernel DPDK [7], Snabb [15], and ixy implement the driver com- datapath, see Figure 1. This has the advantage that integrat- pletely in user space. DPDK still uses a small kernel module ing existing kernel components or forwarding packets to the with some drivers, but it does not contain driver logic and default network stack is feasible with these frameworks. By is only used during initialization. Snabb and ixy require no default, these applications still provide an application with kernel code at all, see Figure 2. A main advantage of the full exclusive access to the NIC. Parts of the NIC can often still user space approach is that the application has full control 2 User Space Network Drivers ANCS’19, 2019 over the driver leading to a far better integration of the ap- that the driver logic is only a single function call away from plication with the driver and the hardware.

User Space Network Drivers

The Linux Kernel Module Programming Guide

The Xen Port of Kexec / Kdump a Short Introduction and Status Report

Anatomy of Linux Loadable Kernel Modules a 2.6 Kernel Perspective

Kdump, a Kexec-Based Kernel Crash Dumping Mechanism

Communicating Between the Kernel and User-Space in Linux Using Netlink Sockets

Detecting Exploit Code Execution in Loadable Kernel Modules

DM-Relay - Safe Laptop Mode Via Linux Device Mapper

An Evolutionary Study of Linux Memory Management for Fun and Profit Jian Huang, Moinuddin K

Comparison of Kernel and User Space File Systems

Lab 1: Loadable Kernel Modules (Lkms)

Embedded Linux Conference Europe 2019

Mitigating the Performance-Efficiency Tradeoff in Resilient Memory Disaggregation