Distributed Workflow Execution: How Containers Saved the Day the Case of Openminted

Total Page:16

File Type:pdf, Size:1020Kb

Distributed Workflow Execution: How Containers Saved the Day the Case of Openminted Distributed workflow execution: how containers saved the day The case of OpenMinTeD Developers Team @ GRNET S.A. Who are we? GRNET: Greek Research and Technology Network, www.grnet.gr GRNET developers involved: • Stavros Sachtouris, [email protected] • Theodoros Sotiropoulos, [email protected] • Kostas Vogias, [email protected] • Nick Vathis, [email protected] About OpenMinTeD • Open Mining Infrastructure for Text & Data • www.openminted.eu • Funding: European Commission • Partners: many European academic/research institutes and enterprises • Goal: an open service to create, discover, share and re-use knowledge extracted from a wide range of scientific resources. OpenMinTeD GRNET • TDM tools: Text & Data mining • ~okeanos: large IaaS cloud software • thousands of VMs + storage • Corpora: large datasets • Synnefo: IaaS software • Workflows: chains of TDM tools developed by GRNET • Users: international academia and more... ~okeanos: a Synnefo-powered IaaS • okeanos.grnet.gr • Serves the Greek and European research and academic community since 2011 • Over 9000 VMs! • 3 Datacenters all over the country • One of the largest non- commercial clouds in EU ~okeanos: a Synnefo-powered IaaS Features: • Virtual Machines • OS Images (mostly Linux) • IPs (v4 and v6) • Virtual Private Networks • Movable volumes • Object storage • UI, API, CLI, libraries, Resizing, Quotas, Image creator, back-up tool, syncing client and more Does NOT feature: • End user services (e.g. wordpress) • Management panels (e.g. CPanel) • Service-level Monitoring • Service-level Accounting Multi-document text summarization: a text mining example Input: lots of texts, pdfs, etc. on the same subject ==> convert everything to text ==> segment into sentences ==> lemmatize ==> part-of-speech tagging (grammar tags on words) ==> find terms (term grammar, statistical tools) ==> find sentences that are summary worthy ==> remove redundancy Output: summary of multiple documents Text and Data Mining (TDM) Tools Output Input TDM tool (e.g., extracted (corpora) metadata) Can be small or big Metadata Huge bodies of text Diverse runtimes Some standards exist (e.g. annotations, (OS, PL, libs, etc.) frequencies, etc.) Various developers Corpora Cannot be trusted TDM Workflows Should be able to • edit • save • load • re-run Corp. Result Tool A Tool B Result A A B Result Corp. Tool C Result Tool D B C D OpenMinTeD Requirements Corpora Execution Environment Results * * Workflows Key Requirements • Scalable: how many workflows can run simultaneously? • Easy to migrate to other clouds • Replaceable service components • Custom task scheduler • Isolated tool execution Key Requirements Key Decisions • Scalable: how many workflows can • All TDM tools must be run simultaneously? dockerized • Easy to migrate to other clouds • No user management or accounting at cluster level • Replaceable service components • Custom task scheduler • Monitor the TDM tools (rather than the stack) • Isolated tool execution How to "chain" diverse tools • All tools must be docker images • Run tools as docker containers • Containers mount volumes from a shared space $ docker run tdm-tool-A \ -v /shared/input-for-A –v /shared/input-for-B $ docker run tdm-tool-B \ -v /shared/input-for-B –v /shared/input-for-C ... Key Requirements Key Decisions • Scalable: how many workflows can • All TDM tools must be run simultaneously? dockerized • Easy to migrate to other clouds • No user management or accounting at cluster level • Replaceable service components • Custom task scheduler • Monitor the TDM tools (rather than the stack) • Isolated tool execution Registry + UI AAI Corp Workflow Service Accounting REST API WF editor Cluster Tool Scheduler Monitor Manager Registry WF engine Worker Worker Worker Worker scale Shared FS scale Corp Out Workflows Accounting REST API Cluster Tool Scheduler Monitor Manager Registry WF engine Worker Worker Worker Worker scale Shared FS scale Key Requirements Key Decisions • Scalable: how many workflows can • Each component runs on run simultaneously? (probably) its own VM • Easy to migrate to other clouds • Open Source Components • Replaceable service components • Components with REST API • Custom task scheduler • Isolated tool execution Components Workflows Accounting Worker: Mesos slave Galaxy REST API Docker engine CAdvisor Mesos Docker Prometheus Chronos master Registry Grafana Galaxy Worker Worker Worker Worker scale NFS with LVM scale Galaxy Galaxy-chronos runner Pithos+ Object Store Mesos Docker Prometheus Chronos master Registry Grafana Galaxy Worker Worker Worker Worker Galaxy-pithos driver NFS with LVM Galaxy and NFS on the same host Key Requirements Key Decisions • Scalable: how many workflows can • Modular architecture run simultaneously? • Easy to migrate to other clouds • Choose something simple for now, we will refine it later • Replaceable service components • Custom task scheduler • Isolated tool execution Mesos Cluster • Modular • Short-lived tasks • Can use many different • Queue of tasks schedulers • There are issues... simultaneously • Scalability, high availability Mesos Chronos • Supports docker master Mesos Mesos Mesos Mesos slave slave slave slave NFS with LVM Mesos Cluster can expand Need more processing power? --> add Workers Mesos Chronos master Need more storage? --> add volumes Mesos Mesos Mesos Mesos scale slave slave slave slave NFS with LVM scale Why not Docker Swarm? No custom scheduler Scalability High availability Easy to manage Docker Swarm Swarm Swarm Swarm Swarm scale Node Node Node Node NFS with LVM scale Technologies we considered • Mesos/Marathon: good for persistent services, not for one-off tasks • Docker Swarm: meets all requirements, except custom scheduler • Kubernetes: meets all requirements, too complicated / an overkill • Hadoop-YARN/Slider: nice for custom scheduler, optimal for some TDM operations, optimizations not utilized with docker containers, cannot cooperate with many tools • Mesos/Chronos: meets all requirements, Chronos is a pain... • Mesos/Custom scheduler: future work... User's Storage local disk Pithos+ Mesos Docker Prometheus Object Chronos Store master Registry Grafana Galaxy Worker Worker Worker Worker scale NFS with LVM scale Storage Performance Q: Isn't NFS slow? A: Maybe, but it currently performs faster than other parts of the system! Q: What is the bottleneck, then? A: Upload/download from cloud and/or user's disk. Q: How can you optimize? A: Separate upload/download from execution, upload/download directly to shared FS. Any other ideas? Too many workflows to run? • Galaxy enforces resource usage limits • Galaxy and Chronos have job queues • Monitoring and alerts • Scale as soon as you get alerted! AAI Monitoring, Alerts Workflows Accounting Resource consumption Aggregate resource consumption on cluster level Mesos Docker Prometheus Chronos master Registry Grafana Galaxy Email alerts for overused resources Worker Worker Worker: Mesos slave Gather container resource consumption NFS Docker engine CAdvisor on VM level Are these alerts enough? Workflow-related alerts: • Cluster resources used simultaneously (CPU, RAM, Disk space) • Only interested on docker containers consumption System-wide alerts: • NFS is full • Host disks are full • Services break down AAI Accounting User ID Task ID * Workflows Accounting Relate: • Task ID • Task ID * Preserve task ID User ID on container metadata • Resource Consumption * Mesos Docker Prometheus Chronos master Registry Grafana Galaxy * Worker Worker CAdvisor gathers Worker: container resource consumption on VM Mesos slave level NFS Docker engine CAdvisor Scalability Need more containers? --> add Workers Mesos Chronos master Storage bandwidth ??? Mesos Mesos Mesos Mesos scale slave slave slave slave Need storage? scale NFS with LVM --> add volumes Scalability NFS is limited by the network bandwidth. Solution: • Tune NFS caching • Create a data reallocation service (defeats the purpose of mesos) • Make each tool request its data from a service (against design goals) • Use a more scalable object store (e.g. HDFS, Pithos+, S3) Docker images and containers clean up Workflows Cluster Tool Scheduler Monitor Manager Registry WF engine Worker Worker Worker Worker Shared FS Docker images and containers clean up • What if a host is stuffed with docker images and stopped containers? Solutions: • Manual deletion of images on alert... • Clean up everything periodically • Clean up when execution finishes (bad performance) • NFS storage instead of docker registry (does not seem possible) • Sophisticated garbage collection? • Combination of the above? Something else? Docker images and containers clean up • What if a new image has the same tag as the old one? Solutions: • Manual deletion of images on update • Clean up when execution finishes (bad performance) • Delete image from all nodes when updating docker registry Sophisticated Garbage Collection Why is it so hard? Because it has to take into account: • Size of each image • Usage patterns of each image (Least Recently Used) • Needs tight integration with docker registry to monitor image updates (abstract away docker registry with a RESTful service) • Statistics ideally should be cluster-wide and not node specific Setup and operations • Provision VMs with a kamaki (the de facto Synnefo API client) script • Ansible + manual ssh (but one day it's gonna be just ansible...) In the future • An extra Mesos/Marathon to keep components alive • Nagios alerts • Automatic scaling with triggers and bash/kamaki scripting • Support provisioning for all major clouds (OpenStack,
Recommended publications
  • Kubernetes Security Guide Contents
    Kubernetes Security Guide Contents Intro 4 CHAPTER 1 Securing your container images and CI/CD pipeline 6 Image scanning 6 What is image scanning 7 Docker image scanning open source tools 7 Open source Docker scanning tool: Anchore Engine 8 Securing your CI/CD pipeline 9 Image scanning in CI/CD 10 CHAPTER 2 Securing Kubernetes Control Plane 14 Kubelet security 14 Access to the kubelet API 15 Kubelet access to Kubernetes API 16 RBAC example, accessing the kubelet API with curl 16 Kubernetes API audit and security log 17 Audit log policies configuration 19 Extending the Kubernetes API using security admission controllers 20 Securing Kubernetes etcd 23 PKI-based authentication for etcd 23 etcd peer-to-peer TLS 23 Kubernetes API to etcd cluster TLS 24 Using a trusted Docker registry 24 Kubernetes trusted image collections: Banning non trusted registry 26 Kubernetes TLS certificates rotation and expiration 26 Kubernetes kubelet TLS certificate rotation 27 Kubernetes serviceAccount token rotation 28 Kubernetes user TLS certificate rotation 29 Securing Kubernetes hosts 29 Kubernetes 2 Security Guide Using a minimal host OS 30 Update system patches 30 Node recycling 30 Running CIS benchmark security tests 31 CHAPTER 3 Understanding Kubernetes RBAC 32 Kubernetes role-based access control (RBAC) 32 RBAC configuration: API server flags 34 How to create Kubernetes users and serviceAccounts 34 How to create a Kubernetes serviceAccount step by step 35 How to create a Kubernetes user step by step 37 Using an external user directory 40 CHAPTER 4 Security
    [Show full text]
  • Running Legacy VM's Along with Containers in Kubernetes!
    Running Legacy VM’s along with containers in Kubernetes Delusion or Reality? Kunal Kushwaha NTT Open Source Software Center Copyright©2019 NTT Corp. All Rights Reserved. About me • Work @ NTT Open Source Software Center • Collaborator (Core developer) for libpod (podman) • Contributor KubeVirt, buildkit and other related projects • Docker Community Leader @ Tokyo Chapter Copyright©2019 NTT Corp. All Rights Reserved. 2 Growth of Containers in Companies Adoption of containers in production has significantly increased Credits: CNCF website Copyright©2019 NTT Corp. All Rights Reserved. 3 Growth of Container Orchestration usage Adoption of container orchestrator like Kubernetes have also increased significantly on public as well private clouds. Credits: CNCF website Copyright©2019 NTT Corp. All Rights Reserved. 4 Infrastructure landscape app-2 app-2 app-M app-1 app-2 app-N app-1 app-1 app-N VM VM VM kernel VM Platform VM Platform Existing Products New Products • The application infrastructure is fragmented as most of old application still running on traditional infrastructure. • Fragmentation means more work & increase in cost Copyright©2019 NTT Corp. All Rights Reserved. 5 What keeps applications away from Containers • Lack of knowledge / Too complex to migrate in containers. • Dependency on custom kernel parameters. • Application designed for a custom kernel. • Application towards the end of life. Companies prefer to re-write application, rather than directly migrating them to containers. https://dzone.com/guides/containers-orchestration-and-beyond Copyright©2019 NTT Corp. All Rights Reserved. 6 Ideal World app-2 app-2 app-M app-1 app-2 app-N app-1 app-1 app-N VM VM VM kernel VM Platform • Applications in VM and containers can be managed with same control plane • Management/ Governance Policies like RBAC, Network etc.
    [Show full text]
  • Ovirt and Docker Integration
    oVirt and Docker Integration October 2014 Federico Simoncelli Principal Software Engineer – Red Hat oVirt and Docker Integration, Oct 2014 1 Agenda ● Deploying an Application (Old-Fashion and Docker) ● Ecosystem: Kubernetes and Project Atomic ● Current Status of Integration ● oVirt Docker User-Interface Plugin ● “Dockerized” oVirt Engine ● Docker on Virtualization ● Possible Future Integration ● Managing Containers as VMs ● Future Multi-Purpose Data Center oVirt and Docker Integration, Oct 2014 2 Deploying an Application (Old-Fashion) ● Deploying an instance of Etherpad # yum search etherpad Warning: No matches found for: etherpad No matches found $ unzip etherpad-lite-1.4.1.zip $ cd etherpad-lite-1.4.1 $ vim README.md ... ## GNU/Linux and other UNIX-like systems You'll need gzip, git, curl, libssl develop libraries, python and gcc. *For Debian/Ubuntu*: `apt-get install gzip git-core curl python libssl-dev pkg- config build-essential` *For Fedora/CentOS*: `yum install gzip git-core curl python openssl-devel && yum groupinstall "Development Tools"` *For FreeBSD*: `portinstall node, npm, git (optional)` Additionally, you'll need [node.js](http://nodejs.org) installed, Ideally the latest stable version, be careful of installing nodejs from apt. ... oVirt and Docker Integration, Oct 2014 3 Installing Dependencies (Old-Fashion) ● 134 new packages required $ yum install gzip git-core curl python openssl-devel Transaction Summary ================================================================================ Install 2 Packages (+14 Dependent
    [Show full text]
  • Container and Kernel-Based Virtual Machine (KVM) Virtualization for Network Function Virtualization (NFV)
    Container and Kernel-Based Virtual Machine (KVM) Virtualization for Network Function Virtualization (NFV) White Paper August 2015 Order Number: 332860-001US YouLegal Lines andmay Disclaimers not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps. The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548-4725 or by visiting: http://www.intel.com/ design/literature.htm. Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn more at http:// www.intel.com/ or from the OEM or retailer. Results have been estimated or simulated using internal Intel analysis or architecture simulation or modeling, and provided to you for informational purposes. Any differences in your system hardware, software or configuration may affect your actual performance. For more complete information about performance and benchmark results, visit www.intel.com/benchmarks. Tests document performance of components on a particular test, in specific systems.
    [Show full text]
  • Kubernetes As an Availability Manager for Microservice Based Applications Leila Abdollahi Vayghan
    Kubernetes as an Availability Manager for Microservice Based Applications Leila Abdollahi Vayghan A Thesis in the Department of Computer Science and Software Engineering Presented in Partial Fulfillment of the Requirements for the Degree of Master of Computer Science at Concordia University Montreal, Quebec, Canada August 2019 © Leila Abdollahi Vayghan, 2019 CONCORDIA UNIVERSITY SCHOOL OF GRADUATE STUDIES This is to certify that the thesis prepared By: Leila Abdollahi Vayghan Entitled: Kubernetes as an Availability Manager for Microservice Based Applications and submitted in partial fulfillment of the requirements for the degree of Master in Computer Science complies with the regulations of the University and meets the accepted standards with respect to originality and quality. Signed by the final examining committee: ________________________________________________ Chair Dr. P. Rigby ________________________________________________ Internal Examiner Dr. D. Goswami ________________________________________________ Internal Examiner Dr. J. Rilling ________________________________________________ Co-Supervisor Dr. F. Khendek ________________________________________________ Co-Supervisor Dr. M. Toeroe Approved by: ___________________________________ Dr. L. Narayanan, Chair Department of Computer Science and Software Engineering _______________ 2019___ __________________________________ Dr. Amir Asif, Dean, Faculty of Engineering and Computer Science ii ABSTRACT Kubernetes as an Availability Manager for Microservice Based Applications Leila
    [Show full text]
  • Immutable Infrastructure, Containers, & the Future of Microservices
    Immutable infrastructure, containers, & the future of microservices Adam Miller Senior Software Engineer, Red Hat 2015-07-25 What we'll cover in this session ● Define “microservices” ● Define “containers” in the context of Linux systems ● Container Implementations in Linux ● What Immutable Infrastructure is – Example of what Immutable Infrastructure deployment workflow looks like ● Red Hat Enterprise Linux Atomic Host – How RHEL Atomic enables and enhances these concepts ● Kubernetes – Orchestrating the Immutable Infrastructure ● OpenShift – Enabling the development and container building pipeline Microservices Microservices are not entirely new. ● The vocabulary term is “new-ish” (2012 – James Lewis and Martin Fowler) ● The idea is very old – Microkernels have existed since the 1980s – Could argue that system admins have been doing this with shell scripts and pipes for years ● Applying this concept to services higher in Monolithic Kernel Microkernel the stack is a newer trend based Operating System based Operating System – Application Heavily influenced by popular technologies System Call such as web microframeworks and containers. user mode VFS IPC, File System Application UNIX Device File IPC Server Driver Server Scheduler, Virtual Memory kernel mode Device Drivers, Dispatcher, ... Basic IPC, Virtual Memory, Scheduling Hardware Hardware What are Microservices? ● Services, “the UNIX Way” – Do one thing, do it well. – Decouple tightly coupled services, make the architecture more modular. ● Loosely coupled services using programming language agnostic APIs for communication – Example: REST APIs The mythical cloud The mythical cloud Micro services Containers What are containers? ● Operating-system-level Virtualization – We (the greater Linux community) like to call them “containers” ● OK, so what is Operating-system-level Virtualization? – The multitenant isolation of multiple user Traditional OS Containers space instances or namespaces.
    [Show full text]
  • Kubernetes As an Availability Manager for Microservice Applications
    Kubernetes as an Availability Manager for Microservice Applications Leila Abdollahi Vayghan Mohamed Aymen Saied Maria Toeroe Ferhat Khendek Engineering and Computer Engineering and Computer Ericsson Inc. Engineering and Computer Science Science Montreal, Canada Science Concordia University Concordia University [email protected] Concordia University Montreal, Canada Montreal, Canada Montreal, Canada [email protected] [email protected] [email protected] Abstract— The move towards the microservice based services that can be deployed and scaled independently by fully architecture is well underway. In this architectural style, small and automated deployment machinery, with minimum centralized loosely coupled modules are developed, deployed, and scaled management [2]. Microservices are built around separate independently to compose cloud-native applications. However, for business functionalities. Each microservice runs in its own carrier-grade service providers to migrate to the microservices process and communicates through lightweight mechanisms, architectural style, availability remains a concern. Kubernetes is often using APIs [3]. Microservices address the drawbacks of an open source platform that defines a set of building blocks which monolithic applications. They are small and can restart faster at collectively provide mechanisms for deploying, maintaining, the time of upgrade or failure recovery. Microservices are scaling, and healing containerized microservices. Thus, loosely coupled, and failure of one microservice will not affect Kubernetes hides the complexity of microservice orchestration while managing their availability. In a preliminary work we other microservices of the system. The fine granularity of this evaluated Kubernetes, using its default configuration, from the architectural style makes the scaling more flexible and more availability perspective in a private cloud settings.
    [Show full text]
  • Docker and Kubernetes: Changing the Opentext Documentum Deployment Model
    White paper Docker and Kubernetes: Changing the OpenText Documentum deployment model Containerization with Docker and Kubernetes’ cloud-first technology is not only a game changer for effectively managing on-premises ™ ™ OpenText Documentum solutions, it also paves the way for deploying EIM solutions in the cloud. Contents New deployment models 3 Customer case study—Part I 3 What is containerization? 4 What are Docker containers? 4 Docker container advantages 5 Available containers 6 What is Kubernetes? 6 Kubernetes advantages 6 Customer case study—Part II 7 EIM in the cloud 7 What is the cloud? 7 Cloud EIM 8 Customer case study—Part III 9 ™ OpenText Managed Services 9 Summary 9 Docker and Kubernetes: Changing the OpenText Documentum deployment model 2/10 New deployment models ™ ™ OpenText Documentum administrators can face two challenges: 1. Effectively managing complex Documentum deployments. Highly customized, mission-critical applications consume disproportionate administrative cycles, budgets and resources to install, upgrade, maintain and enhance. Upgrading these applications requires significant investments in change management. As a result, applications are often not upgraded in a timely fashion and do not leverage the latest technology. 2. Developing a cloud strategy for Enterprise Information Management (EIM) applications. Corporate IT is under intense pressure to produce an enterprise cloud strategy. Leveraging cloud technology for Enterprise Information Management (EIM) applications can be a big win, as long as it does not impact adoption, productivity and governance. Containerization enables new deployment models to help organizations meet these challenges, effectively managing on-premises solutions and paving the way for deploying EIM solutions in the cloud. Customer case study—Part I This real-world customer case study illustrates how containerization can benefit existing Documentum customers.
    [Show full text]
  • LXC, Docker, and the Future of Software Delivery
    LXC, Docker, and the future of software delivery Linuxcon – New Orleans, 2013 Jérôme Petazzoni, dotCloud Inc. Best practices in development and deployment, with Docker and Containers February 2014—Docker 0.8.1 Introduction to Docker - Patel Jitendra Cluster Management with Kubernetes Please open the gears tab below for the speaker notes Satnam Singh [email protected] Work of the Google Kubernetes team and many open source contributors University of Edinburgh, 5 June 2015 Kubernetes An Introduction The promise of cloud computing Cloud software deployment is soul destroying Typically a cloud cluster node is a VM running a specific version of Linux. User applications comprise components each of which may have different and conflicting requirements from libraries, runtimes and kernel features. Applications are coupled to the version of the host operating system: bad. Evolution of the application components is coupled to (and in tension with) the evolution of the host operating system: bad. Also need to deal with node failures, spinning up and turning down replicas to deal with varying load, updating components with disruption … You thought you were a programmer but you are now a sys-admin. Why Linux Containers? What are we trying to solve? The Matrix From Hell Many payloads ● backend services (API) ● databases ● distributed stores ● webapps Many payloads ● Go ● Java ● Node.js ● PHP ● Python ● Ruby ● … Many targets ● your local development environment ● your coworkers' developement environment ● your Q&A team's test environment ● some random
    [Show full text]
  • Full Scalable Media Cloud Solution with Kubernetes Orchestration
    Full Scalable Media Cloud Solution with Kubernetes Orchestration Zhenyu Wang, Xin(Owen)Zhang Agenda • Media in the Network and Cloud • Intel Media Server Reference Software Stack • Container with MSS enablement • Kubernetes with Container integration • Kubernetes with Container enabling on VCA2 • Kubernetes device plugin/Intel GPU plugin • Use Case(1080p): VCA transcoding & k8s scheduling on VCA nodes Media in the Network and Cloud Visual Understanding Video Delivery Graphics in the Cloud Object Recognition & Tracking Cloud and Comms: Remote Desktop Indexing / Search Ingest / Storage / Edge Remote Workstation Transcode / Trans-size / Trans-rate Smart Cities, Security and Cloud Gaming Surveillance Video Conferencing Rendering Intel® Media Server Reference Software Stack Provisioning OpenStack* Kubernetes Cloud Management Guest Video Video Video Streaming Media Applications FFmpeg-qsv Conference Surveillance Index/Search Intel® Media Server Studio Intel® SDK for OpenGL* Software Software Guest Media OpenCL™ Audio Video Intel® HD Graphics Driver for Linux* Codecs Codecs Software Stack Guest i915 Driver Linux 3.x/4.x Kernel * Container Docker Hardware Acceleration Path Guest OS (Linux) Host * * Host i915 Driver Xen KVM VMWare HyperV Host Kernel & Hypervisor Host OS (Linux) Container with MSS enablement • More Containers can be run than VMs • Almost same performance with Native • Package application and dependencies integrated • Share same kernel as the host • No need providing hardware based on the isolation I915 device node Kubernetes with
    [Show full text]
  • Koordinator: a Service Approach for Replicating Docker Containers in Kubernetes
    Koordinator: A Service Approach for Replicating Docker Containers in Kubernetes Hylson Vescovi Netto∗, Aldelir Fernando Luiz∗, Miguel Correiay, Luciana de Oliveira Rechz, Caio Pereira Oliveiraz ∗College of Blumenau, Federal Institute Catarinense - Brazil yINESC-ID, Instituto Superior Tecnico,´ Universidade de Lisboa - Portugal zDepartment of Informatics and Statistics, Federal University of Santa Catarina - Brazil fhylson.vescovi, [email protected], [email protected], [email protected], [email protected] Abstract—Container-based virtualization technologies such as is a backend data store, access concurrency must be managed Docker and Kubernetes are being adopted by cloud service to prevent corruption. providers due to their simpler deployment, better performance, This paper presents Koordinator, a new container replica and lower memory footprint in relation to hypervisor-based virtualization. Kubernetes supports basic replication for availa- coordination approach that provides availability and integrity bility, but does not provide strong consistency and may corrupt with strong consistency in Kubernetes. Koordinator is based application state in case there is a fault. This paper presents a on state machine replication (SMR) [8], an approach that keeps state machine replication scheme for Kubernetes that provides replicas consistent even if some of them fail, providing high high availability and integrity with strong consistency. Replica availability and integrity. Koordinator is provided as a service, coordination is provided as a service, with lightweight coupling to applications. Experimental results show the solution feasibility. i.e., on top of Kubernetes, with lightweight coupling with the application being replicated. I. INTRODUCTION The rest of this paper is organized as follows. Section II briefly introduces container-based virtualization.
    [Show full text]
  • Reliable Storage for HA, DR, Clouds and Containers Philipp Reisner, CEO LINBIT LINBIT - the Company Behind It
    Reliable Storage for HA, DR, Clouds and Containers Philipp Reisner, CEO LINBIT LINBIT - the company behind it COMPANY OVERVIEW TECHNOLOGY OVERVIEW • Developer of DRBD • 100% founder owned • Offices in Europe and US • Team of 30 highly experienced Linux experts • Partner in Japan REFERENCES 25 Linux Storage Gems LVM, RAID, SSD cache tiers, deduplication, targets & initiators Linux's LVM logical volume snapshot logical volume Volume Group physical volume physical volume physical volume 25 Linux's LVM • based on device mapper • original objects • PVs, VGs, LVs, snapshots • LVs can scatter over PVs in multiple segments • thinlv • thinpools = LVs • thin LVs live in thinpools • multiple snapshots became efficient! 25 Linux's LVM thin-LV thin-LV thin-sLV LV snapshot thinpool VG PV PV PV 25 Linux's RAID RAID1 • original MD code • mdadm command A1 A1 • Raid Levels: 0,1,4,5,6,10 A2 A2 • Now available in LVM as well A3 A3 A4 A4 • device mapper interface for MD code • do not call it ‘dmraid’; that is software for hardware fake-raid • lvcreate --type raid6 --size 100G VG_name 25 SSD cache for HDD • dm-cache • device mapper module • accessible via LVM tools • bcache • generic Linux block device • slightly ahead in the performance game 25 Linux’s DeDupe • Virtual Data Optimizer (VDO) since RHEL 7.5 • Red hat acquired Permabit and is GPLing VDO • Linux upstreaming is in preparation • in-line data deduplication • kernel part is a device mapper module • indexing service runs in user-space • async or synchronous writeback • Recommended to be used below LVM 25 Linux’s targets & initiators • Open-ISCSI initiator IO-requests • Ietd, STGT, SCST Initiator Target data/completion • mostly historical • LIO • iSCSI, iSER, SRP, FC, FCoE • SCSI pass through, block IO, file IO, user-specific-IO • NVMe-OF • target & initiator 25 ZFS on Linux • Ubuntu eco-system only • has its own • logic volume manager (zVols) • thin provisioning • RAID (RAIDz) • caching for SSDs (ZIL, SLOG) • and a file system! 25 Put in simplest form DRBD – think of it as ..
    [Show full text]