Performance Evaluation of Container Runtimes

Performance Evaluation of Container Runtimes Lennart Espe, Anshul Jindal a, Vladimir Podolskiy b and Michael Gerndt Chair of Computer Architecture and Parallel Systems, TU Munich, Garching, Germany Keywords: Container Runtime, OCI Runtime, Performance Evaluation, Benchmarking for Container Runtimes, Containers, Resource Management, Cloud Computing. Abstract: The co-location of containers on the same host leads to significant performance concerns in the multi-tenant environment such as the cloud. These concerns are raised due to an attempt to maximize host resource utiliza- tion by increasing the number of containers running on the same host. The selection of a container runtime becomes critical in the case of strict performance requirements. In the scope of the study, two commonly used runtimes were evaluated: containerd (industry standard) and CRI-O (reference implementation of the CRI) on two different Open Container Initiative (OCI) runtimes: runc (default for most container runtimes) and gVisor (highly secure alternative to runc). Evaluation aspects of container runtimes include the performance of running containers, the performance of container runtime operations, and scalability. A tool called Touch- Stone was developed to address these evaluation aspects. The tool uses the CRI standard and is pluggable into any Kubernetes-compatible container runtime. Performance results demonstrate the better performance of containerd in terms of CPU usage, memory latency and scalability aspects, whereas file system operations (in particular, write operations) are performed more efficiently by CRI-O. 1 INTRODUCTION mands for low start-up times of the underlying container runtime. Past studies have shown that the Cloud computing as an infrastructure model has be- performance overhead of wide-spread container run- come an industry standard. Container technology times is low and they are more resource-efficient than plays a leading role in this change allowing to effi- traditional virtual machines (Kozhirbayev and Sin- ciently share host resources among multiple tenants. nott, 2017). Combined with the ability to co-locate Hence, the demand for the performance evaluation containers, this resource-efficiency leads to container of container workloads has risen. The widely-used runtimes being the main driver behind energy and container orchestrator Kubernetes supports every con- cost reduction in the container-based computing en- tainer runtime that implements the standardized Con- vironments. Still, though virtual machines are more tainer Runtime Interface (CRI) (Hong, 2019). Al- resource-intensive, they offer a higher degree of secu- though the cluster administrator can choose any CRI rity and performance isolation (Bui, 2015). implementation, a guiding evaluation of runtime per- In this study, two commonly used runtimes are formance is missing. evaluated: containerd and CRI-O on two differ- Resource management is the most important task ent Open Container Initiative (OCI) runtimes: runc in multi-tenant environments such as the cloud. It (default for most container runtimes) and gVisor aims at resource efficiency in dense deployment mod- (highly secure alternative to runc). The evaluation els like FaaS (Functions-as-a-Service, a service that tool TouchStone is developed to evaluate various per- abstracts away any underlying infrastructure of an formance aspects of resource management in con- application deployment) and other serverless archi- tainer runtimes in a reliable manner. TouchStone tectures. As new deployment models are being in- measures how well the runtime’s performance scales troduced to the market, more requirements on spe- with the increase in the number of containers, how cific facets of container runtime performance pop up. resource overuse is coped with as well as general run- For example, Function-as-a-Service infrastructure de- time performance. a https://orcid.org/0000-0002-7773-5342 Section 2 provides the basic background knowl- b https://orcid.org/0000-0002-2775-3630 edge for the paper. Section 3 delves into the imple- 273 Espe, L., Jindal, A., Podolskiy, V. and Gerndt, M. Performance Evaluation of Container Runtimes. DOI: 10.5220/0009340402730281 In Proceedings of the 10th International Conference on Cloud Computing and Services Science (CLOSER 2020), pages 273-281 ISBN: 978-989-758-424-4 Copyright c 2020 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved CLOSER 2020 - 10th International Conference on Cloud Computing and Services Science mentation details of the container runtimes studies. drivers (Walli, ). It is mainly developed by RedHat, Section 4 discusses related works. Section 5 exam- and it serves as a default runtime for OpenShift, a ines the architecture and implementation of Touch- popular Kubernetes distribution for enterprise (Mc- Stone evaluation tool. Section 6 provides evaluation Carty, ). The typical life cycle of the interaction with results. Section 7 summarizes and discusses the re- CRI-O is similar to containerd over the CRI endpoint. sults. Section 8 concludes the work. Major differences in internal container handling to containerd do not exist since runc is the default OCI runtime when running CRI-O. 2 BACKGROUND 2.2 Evaluated OCI Runtimes 2.1 Evaluated Container Runtimes The Open Container Initiative is a Linux Founda- tion project to design open standards for operating- A container runtime is a software that runs the con- system-level virtualization, most importantly Linux tainers and manages the container images on a de- containers. The Open Containers Initiative defines ployment node. Specific container runtimes are dis- two standards – the image-spec for OCI images and cussed in the following subsection. This subsection the runtime-spec for OCI runtimes1,2. The typical explores two container runtimes: containerd and CRI- job sequence would be that a container runtime (e.g. O. containerd) downloads an OCI image, unpacks it and prepares an OCI bundle (a container specification in- 2.1.1 Containerd cluding the root filesystem) on the local disk. After that, a OCI runtime like runc is able to create a run- Containerd is the default runtime used by the Docker ning instance from this container specification. OCI engine (Crosby, 2017). It is often being referred to images can be created using a number of tools, for as an industry standard because of its wide adoption. example docker build command. After the successful Underneath, this runtime uses runc, the reference im- build, these images are usually pushed and published plementation of the OCI runtime specification. con- to a public or private container registry. This subsec- tainerd stores and manages images and snapshots; it tion explores two container runtimes: runc and gVisor starts and stops containers by delegating execution (runsc). tasks to the OCI runtime (The containerd authors, ). To start a new containerized process, containerd has 2.2.1 Runc to undertake the following actions: 1) create a new Container from a given OCI image; 2) create a new Runc is the lowest runtime layer of containerd that Task in the Container context; 3) start the Task (at explicitly handles containers. It was used as the refer- this point runc takes over and starts executing the OCI ence implementation when drafting the OCI runtime bundle supplied by containerd). specification (Open Container Initiative, 2019). Inter- Containerd provides CRI-compatible gRPC end- nally, runc uses libcontainer to interact with OS-level point enabled by default. When using this endpoint, components. abstractions specific to containerd are hidden from the client so that the user can operate on CRI-level ab- 2.2.2 Gvisor (Runsc) stractions. Typical OCI runtimes like runc limit system access 2.1.2 CRI-O using different capability models integrated into the Linux kernel like AppArmor, SELinux and SEC- Container Runtime Interface (CRI) is a standard- COMP. If an adversary has access to a limited set of ized interface for Kubernetes plugins that execute system calls and manages to exploit a vulnerability and watch over containers. It was created as an at- in the Linux kernel, then he may be able to escape tempt to stabilize the communication interface be- the sandbox and gain privileged access. This is a ma- tween kubelet and the host container runtime. It is jor risk when running untrusted code like a provider based on gRPC, a cross-language library for remote of FaaS infrastructure does. When running a con- procedure calls using Protocol Buffers. tainer in a VMM-based runtime like Kata Containers, CRI-O is a container runtime built to provide a hypervisor will be started and it will virtualize the a bridge between OCI runtimes and the high-level Kubernetes CRI. It is based on an older version of 1github.com/opencontainers/image-spec/ the Docker architecture that was built around graph 2github.com/opencontainers/runtime-spec/ 274 Performance Evaluation of Container Runtimes underlying system hardware the container is running based virtualization although no significant overheads on (kat, 2019); this approach has a significant perfor- in other performance metrics were detected (Morabito mance overhead. et al., 2015). gVisor’s approach to container isolation is in be- The study by Xavier et al. went further by dis- tween that of a typical container runtime and of a covering that the near-native performance overhead fully virtualized runtime. The idea of a guest ker- of containers (LXC, Linux-VServer, and Open VZ) is nel of VMM-based runtimes served as an inspiration

Performance Evaluation of Container Runtimes

Benchmarking, Analysis, and Optimization of Serverless Function Snapshots

Qualifikationsprofil #10309

Gvisor Is a Project to Restrict the Number of Syscalls That the Kernel and User Space Need to Communicate

Surviving Software Dependencies

Firecracker: Lightweight Virtualization for Serverless Applications

Architectural Implications of Function-As-A-Service Computing

The Aurora Operating System

Containerd: Integration

Linux Kernel Debugging Advanced Operating Systems 2020/2021

Bring Security to the Container World, and More

Sub-Millisecond Startup for Serverless Computing with Initialization-Less Booting

X-Containers: Breaking Down Barriers to Improve