Performance Isolation in Multi-Tenant Applications

Total Page:16

File Type:pdf, Size:1020Kb

Performance Isolation in Multi-Tenant Applications Performance Isolation in Multi-Tenant Applications Zur Erlangung des akademischen Grades eines Doktors der Ingenieurwissenschaften von der Fakultät für Informatik des Karlsruher Instituts für Technologie (KIT) genehmigte Dissertation von Rouven Krebs aus Speyer, Deutschland Tag der mündlichen Prüfung: 26.6.2015 Erstgutachter: Prof. Dr. Samuel Kounev and Zweitgutachter: Prof. Dr. Ralf Reussner KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum der Helmholtz-Gemeinschaft www.kit.edu DOI: 10.5445/IR/1000049479 Hiermit versichere ich wahrheitsgemäß, die Dissertation selbstständig angefertigt, alle benutzten Hilfsmittel vollständig und genau angegeben und alles kenntlich gemacht zu haben, was aus Ar- beiten Anderer und eigener Veröffentlichungen unverändert oder mit Änderungen entnommen wurde. Karlsruhe, den 28.04.2015 Rouven Krebs Abstract Multi-tenancy is an approach to share one application instance among multiple customers by provid- ing each of them a dedicated view. This approach reduces the costs for service provisioning. Tenants expect to be isolated in terms of the application performance they observe, and the providers’ in- ability to offer performance guarantees is a major obstacle for potential cloud customers. A direct relation between observed performance and customer satisfaction is well documented for interactive web applications. Performance can even become critical to economic success. Providing performance isolation when sharing at the application layer is a challenge. The layers responsible for controlling resource access (e.g., operating system) are normally unaware of entities defined at the application level. Thus, these layers cannot distinguish between different tenants. Furthermore, it is hard to predict how tenant requests propagate through the multiple layers of the execution environment, down to the physical resource layer. The intended decoupling of the application from the resource controlling layers does not allow moving the resource control to the application. Nowadays, no methods exist to efficiently enforce performance isolation between tenants sharing one multi-tenant application. Furthermore, the providers know neither which characteristics of the methods are important nor which method provides the best isolation. Thus, quantifying the degree of performance isolation a system achieves is essential. Additionally, there is no feasible documented approach of how potential performance isolation methods can be implemented in an application. This thesis aims at providing performance isolation mechanisms, a methodology to quantify per- formance isolation, as well as insights and methods that support multi-tenant application providers in realizing performance isolated multi-tenant applications. Performance isolation is enforced by means of request based admission control. The degree of performance isolation is quantified by using a benchmarking approach. Its metric captures the in- fluence of an increasing workload from one tenant upon the observed performance of other tenants. To support providers in applying the correct isolation method, the solutions are evaluated using the proposed benchmarking approach. Beyond this, additional relevant aspects of the methods, e.g., their informational requirements, are identified. To implement the isolation method, a framework is considered that allows to easily add the application-specific aspects of a concrete isolation method. Finally, the results of the thesis are validated in environments motivated by a real world PaaS runtime, using representative benchmarks. In an enterprise service, it was possible to reduce one tenant’s impact upon the response time of another from 141% to 11% by applying methods pre- sented in the thesis. The benchmarking approach was additionally validated in a case study that quantified the Xen hypervisor’s isolation capabilities. v Kurzfassung Cloud Computing gewinnt im Bereich der Geschäftsanwendungen immer mehr an Bedeutung. In infrastrukturbezogenen Angeboten wird die gleiche Hardware zwischen mehreren Kunden geteilt. Hierfür werden Virtualisierungstechniken verwendet, welche jedem Kunden eine virtuelle Maschine zur Verfügung stellen. Dadurch lassen sich Ressourcen teilen und einzelnen Kunden direkt zuwei- sen. Mehrmandantenfähigkeit teilt eine Applikationsinstanz zwischen mehreren Kunden, wobei jeder seine eigene und isolierte Sicht auf die Anwendung erhält. Dies erhöht den Anteil der geteilten Ressourcen, da nicht nur die Hardware, sondern auch darüberliegende Schichten geteilt werden. Somit können Kosten weiter gesenkt werden. Trotz dieses erhöhten Anteils geteilter Ressourcen erwarten die Mandanten, bezüglich der Performance von anderen Mandanten isoliert zu sein. Die Unfähigkeit der Anwendungsanbieter Performancegarantien sicherzustellen, ist ein Hemm- nis für potentielle Kunden. Speziell im Falle interaktiver Web-Anwendungen ist der Zusammenhang von Performance und Benutzerzufriedenheit bekannt, und schlechte Performance kann den ökono- mischen Erfolg einer Anwendung verhindern. Im Fall einer geteilten Infrastruktur sind sowohl die Ressourcen, als auch die Entitäten, die von- einander getrennt werden sollen, bekannt. Performanceisolation bei einer geteilten Anwendungs- schicht zu garantieren, ist im Gegensatz dazu eine Herausforderung mit speziellen Aspekten. Die ressourcenkontrollierenden Ausführungsumgebungen, wie z.B. das Betriebssystem, haben in der Regel keine Kenntnisse über Entitäten, welche auf Ebene der Applikationsinstanz definiert sind. In der Regel ist das Mandantenkonzept jedoch nur der Anwendung oder bestenfalls der Middleware bekannt. Somit können die ressourcenkontrollierenden Schichten nicht zwischen Mandanten un- terscheiden. Desweiteren ist es schwer vorherzusagen, wie eine Anfrage durch die verschiedenen Schichten des Systems bis hin zu den Ressourcen propagiert wird. Anwendungen sind absichtlich von der Ressourcenverwaltung getrennt, um eine effiziente Entwicklung zu ermöglichen. Deswe- gen erscheint es nicht schlüssig, die Ressourcenkontrolle in die Anwendung zu integrieren. Diese Schichtendiskrepanz verhinderte bislang die Performanceisolation von Mandanten, welche sich eine Applikationsinstanz teilen. Einem Anbieter von Mehrmandantenanwendungen fehlen die Methoden, um Performanceisola- tion sicherzustellen. Desweiteren wissen die Anbieter nicht, welche Charakteristiken der Methoden wichtig wären, noch welche Methoden die beste Isolation bieten würden. Deshalb ist es essentiell, im ersten Schritt Performanceisolation quantifizieren zu können. Erschwerend kommt hinzu, dass kein umfassender Ansatz dokumentiert ist, der beschreibt, wie die Methoden zur Sicherstellung der Performanceisolation implementiert werden können. vii Diese Dissertation hat zum Ziel, Methoden zur Performanceisolation zu entwickeln, Ansätze zur Quantifizierung von Performanceisolation zu erstellen, und Wissen zu erarbeiten, das die Anbieter von interaktiven Webanwendungen befähigt, isolierte Anwendungen zu erstellen. Ein Einfluss zwischen zwei Mandanten tritt auf, wenn diese gleichzeitig im Wettbewerb um ei- ne Ressource stehen. In einer interaktiven Anwendung beeinflussen sich Mandanten somit immer dann, wenn ihre Anfragen gleichzeitig vom System bearbeitet werden. Durch Einflussnahme wann und ob eine eingehende Anfrage bearbeitet wird, lässt sich somit kontrollieren, welcher Mandant auf eine Ressource zugreifen kann. In der vorliegenden Arbeit wird Performanceisolation durch ei- ne anfragebasierte Zugangskontrolle gewährleistet. Diese basiert auf zwei Bestandteilen. Zum einen stellt ein arbeitserhaltender Scheduler sicher, dass die Anzahl der zugelassenen Anfragen pro Man- dant einem konfigurierten Verhältnis entsprechen. Zum anderen stellt eine asynchron arbeitende Optimierung sicher, dass die Verhältnisse so angepasst werden, dass möglichst alle Mandanten eine faire Zuteilung der Ressourcen erhalten. Dies beinhaltet unter anderem auch eine Erhöhung des An- teils eines Mandanten, wenn andere Mandanten ihre Performancegarantien dadurch nicht verletzen, um nicht unnötig Ressourcen brachliegen zu lassen. Die Zweiteilung erlaubt es, in kurzfristigen Zeitabständen Isolation sicherzustellen, sowie in lang- fristigen Zyklen komplexere Optimierungen vorzunehmen, welche für jede Anfrage einen zu hohen Mehraufwand bedeuten würden. Zwei konkrete, neuartige Verfahren werden in der Arbeit vorgestellt. Das erste basiert darauf, den Ressourcenbedarf einer Anfrage abzuschätzen. Dieses Wissen wird genutzt, um im Scheduler sicherzustellen, dass ein Mandant nicht mehr als den ihm zugewiesenen Anteil einer Ressource nut- zen darf. Dies erlaubt es, Mandanten effektiv zu isolieren, falls der Ressourcenbedarf pro Anfrage sich zwischen den Mandanten stark unterscheidet. Ein zweites Verfahren beobachtet die Perfor- mancedaten der Anwendung, und leitet darauf basierend ein Blackbox-Modell der Anwendung ab. Dieses wird genutzt, um mit Hilfe einer analytischen Optimierung, eine Gewichtung für ein faires Queuing zu ermitteln. Das letztere Verfahren erlaubt es, Garantien bezogen auf extern sichtbare Performanceeigenschaften zu erhalten. Zur Ermittlung der Isolationsqualität wird ein benchmarkorientierter Ansatz vorgestellt. Dieser basiert auf der Separierung der Mandanten in zwei Gruppen. Die störende Gruppe, welche die ma- ximal erlaubte Last überschreitet, und der rechtschaffenden Gruppe von Mandanten, welche die erlaubte Last nicht überschreiten. Mit dieser Grundlage lassen sich zwei Typen von Metriken erstellen. Zum einen wird in der
Recommended publications
  • Metrics for Performance Advertisement
    Metrics for Performance Advertisement Vojtěch Horký Peter Libič Petr Tůma 2010 – 2021 This work is licensed under a “CC BY-NC-SA 3.0” license. Created to support the Charles University Performance Evaluation lecture. See http://d3s.mff.cuni.cz/teaching/performance-evaluation for details. Contents 1 Overview 1 2 Operation Frequency Related Metrics 2 3 Operation Duration Related Metrics 3 4 Benchmark Workloads 4 1 Overview Performance Advertisement Measuring for the purpose of publishing performance information. Requirements: – Well defined meaning. – Simple to understand. – Difficult to game. Pitfalls: – Publication makes results subject to pressure. – Often too simple to convey meaningful information. – Performance in contemporary computer systems is never simple. Speed Related Metrics Responsiveness Metrics – Time (Task Response Time) – How long does it take to finish the task ? Productivity Metrics – Rate (Task Throughput) – How many tasks can the system complete per time unit ? Utilization Metrics – Resource Use (Utilization) – How much is the system loaded when working on a task ? – Share of time a resource is busy or over given load level. – Helps identify bottlenecks (most utilized resources in the system). 1 Metric for Webmail Performance What metric would you choose to characterize performance of a web mail site ? – User oriented metric would be end-to-end operation time. – Server oriented metric would be request processing time. – How about metrics in between ? – And would you include mail delivery time ? How is throughput related to latency ? How is utilization defined for various resources ? 2 Operation Frequency Related Metrics Clock Rate Clock rate (frequency) of the component (CPU, bus, memory) in MHz. Most often we talk about CPU frequency.
    [Show full text]
  • The Nutanix Design Guide
    The FIRST EDITION Nutanix Design Guide Edited by AngeloLuciani,VCP By Nutanix, RenévandenBedem, incollaborationwith NPX,VCDX4,DECM-EA RoundTowerTechnologies Table of Contents 1 Foreword 3 2 Contributors 5 3 Acknowledgements 7 4 Using This Book 7 5 Introduction To Nutanix 9 6 Why Nutanix? 17 7 The Nutanix Eco-System 37 8 Certification & Training 43 9 Design Methodology 49 & The NPX Program 10 Channel Charter 53 11 Mission-Critical Applications 57 12 SAP On Nutanix 69 13 Hardware Platforms 81 14 Sizer & Collector 93 15 IBM Power Systems 97 16 Remote Office, Branch Office 103 17 Xi Frame & EUC 113 Table of Contents 18 Xi IoT 121 19 Xi Leap, Data Protection, 133 DR & Metro Availability 20 Cloud Management 145 & Automation: Calm, Xi Beam & Xi Epoch 21 Era 161 22 Karbon 165 23 Acropolis Security & Flow 169 24 Files 193 25 Volumes 199 26 Buckets 203 27 Prism 209 28 Life Cycle Manager 213 29 AHV 217 30 Move 225 31 X-Ray 229 32 Foundation 241 33 Data Center Facilities 245 34 People & Process 251 35 Risk Management 255 The Nutanix Design Guide 1 Foreword Iamhonoredtowritethisforewordfor‘TheNutanixDesignGuide’. Wehavealwaysbelievedthatcloudwillbemorethanjustarented modelforenterprises.Computingwithintheenterpriseisnuanced, asittriestobalancethefreedomandfriction-freeaccessofthe publiccloudwiththesecurityandcontroloftheprivatecloud.The privateclouditselfisspreadbetweencoredatacenters,remoteand branchoffices,andedgeoperations.Thetrifectaofthe3laws–(a) LawsoftheLand(dataandapplicationsovereignty),(b)Lawsof Physics(dataandmachinegravity),and(c)LawsofEconomics
    [Show full text]
  • “Freedom” Koan-Sin Tan [email protected] OSDC.Tw, Taipei Apr 11Th, 2014
    Understanding Android Benchmarks “freedom” koan-sin tan [email protected] OSDC.tw, Taipei Apr 11th, 2014 1 disclaimers • many of the materials used in this slide deck are from the Internet and textbooks, e.g., many of the following materials are from “Computer Architecture: A Quantitative Approach,” 1st ~ 5th ed • opinions expressed here are my personal one, don’t reflect my employer’s view 2 who am i • did some networking and security research before • working for a SoC company, recently on • big.LITTLE scheduling and related stuff • parallel construct evaluation • run benchmarking from time to time • for improving performance of our products, and • know what our colleagues' progress 3 • Focusing on CPU and memory parts of benchmarks • let’s ignore graphics (2d, 3d), storage I/O, etc. 4 Blackbox ! • google image search “benchmark”, you can find many of them are Android-related benchmarks • Similar to recently Cross-Strait Trade in Services Agreement (TiSA), most benchmarks on Android platform are kinda blackbox 5 Is Apple A7 good? • When Apple released the new iPhone 5s, you saw many technical blog showed some benchmarks for reviews they came up • commonly used ones: • GeekBench • JavaScript benchmarks • Some graphics benchmarks • Why? Are they right ones? etc. e.g., http://www.anandtech.com/show/7335/the-iphone-5s-review 6 open blackbox 7 Android Benchmarks 8 http:// www.anandtech.com /show/7384/state-of- cheating-in-android- benchmarks No, not improvement in this way 9 Assuming there is not cheating, what we we can do? Outline • Performance benchmark review • Some Android benchmarks • What we did and what still can be done • Future 11 To quote what Prof.
    [Show full text]
  • Red Hat Enterprise Virtualization Performance: Specvirt™ Benchmark
    RED HAT ENTERPRISE VIRTUALIZATION PERFORMANCE: SPECVIRT™ BENCHMARK DATASHEET AT A GLANCE OVERVIEW ® • The performance of Red Hat Red Hat Enterprise Virtualization (RHEV) is the strategic virtualization alternative for organizations Enterprise Virtualization can looking for better total cost of ownership, faster return on investment, accelerated break-even, and be compared to other freedom from vendor lock-in. virtualization platforms RHEV consists of both a hypervisor technology and an enterprise virtualization manager. The RHEV using the SPECvirt_sc2010 hypervisor, based on the Red Hat Enterprise Linux® kernel and the Kernel-based Virtual Machine benchmark. (KVM) hypervisor technology, offers the high performance and scalability that Red Hat is known for. • SPECvirt_sc2010 measures Hypervisor performance is a key factor in deciding which virtualization platform to implement, and hypervisor performance using Red Hat Enterprise Virtualization performance, as measured by the SPECvirt_sc2010® benchmark, realistic workloads. leads the industry both in terms of highest overall performance and highest number of performant As of March 1, 2013, RHEV virtual machines on a single server. leads VMware in performance on comparable WHAT IS SPECVIRT_SC2010®? servers defined by CPU count. SPECvirt_sc2010 is the first vendor-neutral benchmark designed to measure the performance of • The RHEV subscription model datacenter servers that are used for server virtualization. The benchmark was developed by the provides high performance non-profit Standard Performance Evaluation Corporation (SPEC) virtualization subcommittee, at a lower cost-per-SPECvirt_ whose members and contributors include AMD, Dell, Fujitsu, HP, IBM, Intel, Oracle, Red Hat, Unisys sc2010 than VMware. and VMware. SPECvirt_sc2010 uses realistic workloads and SPEC’s proven performance measurement method- ologies to enable vendors, users and researchers to compare system performance across multiple hardware, virtualization platforms, and applications.
    [Show full text]
  • On Benchmarking Intrusion Detection Systems in Virtualized Environments
    Technical Report: SPEC-RG-2013-002 Version: 1.0 On Benchmarking Intrusion Detection Systems in Virtualized Environments SPEC RG IDS Benchmarking Working Group Aleksandar Milenkoski Samuel Kounev Alberto Avritzer Institute for Program Structures and Data Institute for Program Structures and Data Siemens Corporate Research Organization Organization Princeton, NJ USA Karlsruhe Institute of Technology Karlsruhe Institute of Technology [email protected] Karlsruhe, Germany Karlsruhe, Germany [email protected] [email protected] Nuno Antunes Marco Vieira CISUC, Department of Informatics CISUC, Department of Informatics Engineering Engineering University of Coimbra University of Coimbra Coimbra, Portugal Coimbra, Portugal [email protected] [email protected] arXiv:1410.1160v1 [cs.CR] 5 Oct 2014 ® Research ℠ June 26, 2013 research.spec.org www.spec.org Contents 1 Introduction.......................................1 2 Intrusion Detection in Virtualized Environments . .2 2.1 VMM-Based Intrusion Detection Systems . .2 2.2 Intrusion Detection Techniques . .4 Misuse-based Intrusion Detection . .4 Anomaly-based Intrusion Detection . .5 3 Requirements and Challenges for Benchmarking VMM-based IDSes . .7 3.1 Workloads . .7 Benign Workloads . .7 Malicious Workloads . 10 3.2 Metrics ..................................... 14 4 Conclusion . 17 References . 18 i Executive Summary Modern intrusion detection systems (IDSes) for virtualized environments are deployed in the virtualization layer with components inside the virtual machine monitor (VMM) and the trusted host virtual machine (VM). Such IDSes can monitor at the same time the network and host activities of all guest VMs running on top of a VMM being isolated from malicious users of these VMs. We refer to IDSes for virtualized environments as VMM-based IDSes. In this work, we analyze state-of-the-art intrusion detection techniques applied in virtualized environments and architectures of VMM-based IDSes.
    [Show full text]
  • IADIS Conference Template
    BENCHMARKING OF BARE METAL VIRTUALIZATION PLATFORMS ON COMMODITY HARDWARE Duarte Pousa*, José Rufino*† *Polytechnic Institute of Bragança, 5300-253 Bragança, Portugal †Laboratory of Instrumentation and Experimental Particle Physics, University of Minho, 4710-057 Braga, Portugal ([email protected], [email protected]) ABSTRACT In recent years, System Virtualization became a fundamental IT tool, whether it is type-2/hosted virtualization, mostly exploited by end-users in their personal computers, or type-1/bare metal, well established in IT departments and thoroughly used in modern datacenters as the very foundation of cloud computing. Though bare metal virtualization is meant to be deployed on server-grade hardware (for performance, stability and reliability reasons), properly configured desktop-class systems are often used as virtualization “servers”, due to their attractive performance/cost ratio. This paper presents the results of a study conducted on such systems, about the performance of Windows 10 and Ubuntu Server 16.04 guests, when deployed in what we believe are the type-1 platforms most in use today: VMware ESXi, Citrix XenServer, Microsoft Hyper-V, and KVM-based (represented by oVirt and Proxmox). Performance is measured using three synthetic benchmarks: PassMark for Windows, UnixBench for Ubuntu Server, and the cross-platform Flexible I/O Tester. The benchmarks results may be used to choose the most adequate type-1 platform (performance-wise), depending on guest OS, its performance requisites (CPU-bound, IO-bound, or balanced) and its storage type (local/remote) used. KEYWORDS Bare Metal Virtualization, Synthetic Benchmarking, Performance Assessment, Commodity Hardware 1. INTRODUCTION System Virtualization allows a physical host to run (simultaneously) many guest operating systems in self- contained software-defined virtual machines, sharing the host hardware, under control of a hypervisor.
    [Show full text]
  • TPC-V: a Benchmark for Evaluating the Performance of Database Applications in Virtual Environments Priya Sethuraman and H
    TPC-V: A Benchmark for Evaluating the Performance of Database Applications in Virtual Environments Priya Sethuraman and H. Reza Taheri, VMWare, Inc. {psethuraman, rtaheri}@vmware.com TPCTC 2010 Singapore © 2010 VMware Inc. All rights reserved Agenda/Topics Introduction to virtualization Existing benchmarks Genesis of TPC-V But what is TPC-E??? TPC-V design considerations Set architectures, variability, and elasticity Benchmark development status Answers to some common questions 2 TPC TC 2010 What is a Virtual Machine? A (VM) is a software computer that, like a physical computer, runs an operating system and applications. An operating system installed on a virtual machine is called a guest operating system. Virtual machines run on host servers. The same server can run many virtual machines. Every VM runs in an isolated environment. Started out with IBM VM in the 60s Also on Sun Solaris, HP Itanium, IBM Power/AIX, others A new wave started in the late 90s on X86 • Initially, enthusiasts ran Windows and Linux VMs on their PCs Traditional Architecture Virtual Architecture 3 TPC TC 2010 Why virtualize a server ? Server consolidation • The vast majority of server are grossly underutilized • Reduces both CapEx and OpEx Migration of VMs (both storage and CPU/memory) • Enables live load balancing • Facilitates maintenance High availability • Allows a small number of generic servers to back up all servers Fault tolerance • Lock-step execution of two VMs Cloud computing! Utility computing was finally enabled by • Ability to consolidate
    [Show full text]
  • KVM Virtualization Roadmap and Technology Update
    KVM Virtualization Roadmap and Technology Update Karen Noel Bhavna Sarathy Senior Eng Manager Senior Product Manager Red Hat, Inc. Red Hat, Inc. June 13, 2013 Why we believe KVM is the best virtualization platform Performance Lower Cost Cross Platform KVM holds the Top 6/11 customers report up to Support and certification for virtual machine consolidation 70% savings by using leading x86_64 operating systems scores on SPECvirt (1) KVM (2) including RHEL and Microsoft Windows (4) Security Cloud & Virtualization Management Certification (3) EAL4+ Red Hat Open Stack for Cloud plus SE Linux enabling Virtualization and Red Hat Mandatory Access Control Enterprise Virtualization for data- between virtual machines center Virtualization (1) Source: SpecVirt_sc2010 results: http://www.spec.org/virt_sc2010/results/specvirt_sc2010_perf.html (2) Source: Case study on Canary Islands Government migration from VMware to RHEV: http://www.redhat.com/resourcelibrary/case-studies/canary-islands-government-migrates-telecommunications-platform-from- vmware-to-red-hat (3) Source: http://www.redhat.com/solutions/industry/government/certifications.html (4) Source: http://www.redhat.com/resourcelibrary/articles/enterprise-linux-virtualization-support KVM hypervisor in multiple Red Hat products KVM is the foundation Virtualization technology in multiple Red Hat products Red Hat Enterprise Virtualization – Hypervisor Derived from Red Hat Enterprise Linux SMALL FORM FACTOR, SCALABLE, ● RHEV Hypervisor HIGH PERFORMANCE ● Prebuilt binary (ISO) with 300+ packages derived
    [Show full text]
  • Report on Cloud Computing to the OSG Steering Committee
    SPEC OPEN SYSTEMS GROUP CLOUD COMPUTING WORKING GROUP Report on Cloud Computing to the OSG Steering Committee Table of Contents Introduction and Objectives ...................................................... 3 1 Cloud Computing ............................................................ 5 1.1 Definition of the Cloud .......................................................... 5 1.1.1 Characteristics .............................................................................. 5 1.1.2 Service Models ............................................................................. 6 1.1.3 Deployment Models ...................................................................... 6 1.2 Interested parties of a Cloud Benchmark ............................. 6 1.2.1 Participants ................................................................................... 7 1.2.2 Cloud Benchmark Audience Types .............................................. 7 1.3 Cloud Computing Use Cases ............................................... 8 1.3.1 Computing Use Cases .................................................................. 8 1.3.2 Use Case to Customer Map ......................................................... 9 1.3.3 Use Case to Processing Type Map ............................................ 10 1.4 Existing Cloud Frameworks ................................................ 11 2 Benchmark Considerations ........................................... 12 2.1 What is a Cloud SUT? ........................................................ 12 2.2 SUT Components
    [Show full text]
  • Review & Background
    Spring 2018 :: CSE 502 Review & Background Nima Honarmand Spring 2018 :: CSE 502 Measuring & Reporting Performance Spring 2018 :: CSE 502 Performance Metrics • Latency (execution/response time): time to finish one task • Throughput (bandwidth): number of tasks finished per unit of time – Throughput can exploit parallelism, latency can’t – Sometimes complimentary, often contradictory • Example: move people from A to B, 10 miles – Car: capacity = 5, speed = 60 miles/hour – Bus: capacity = 60, speed = 20 miles/hour – Latency: car = 10 min, bus = 30 min – Throughput: car = 15 PPH (w/ return trip), bus = 60 PPH Pick the right metric for your goals Spring 2018 :: CSE 502 Performance Comparison • “Processor A is X times faster than processor B” if – Latency(P, A) = Latency(P, B) / X – Throughput(P, A) = Throughput(P, B) * X • “Processor A is X% faster than processor B” if – Latency(P, A) = Latency(P, B) / (1+X/100) – Throughput(P, A) = Throughput(P, B) * (1+X/100) • Car/bus example – Latency? Car is 3 times (200%) faster than bus – Throughput? Bus is 4 times (300%) faster than car Spring 2018 :: CSE 502 Latency/throughput of What Program? • Very difficult question! • Best case: you always run the same set of programs – Just measure the execution time of those programs – Too idealistic • Use benchmarks – Representative programs chosen to measure performance – (Hopefully) predict performance of actual workload – Prone to Benchmarketing: “The misleading use of unrepresentative benchmark software results in marketing a computer system” -- wikitionary.com Spring 2018 :: CSE 502 Types of Benchmarks • Real programs – Example: CAD, text processing, business apps, scientific apps – Need to know program inputs and options (not just code) – May not know what programs users will run – Require a lot of effort to port • Kernels – Small key pieces (inner loops) of scientific programs where program spends most of its time – Example: Livermore loops, LINPACK • Toy Benchmarks – e.g.
    [Show full text]
  • Autonomic Performance-Aware Resource Management in Dynamic IT Service Infrastructures
    escartes Autonomic Performance-Aware Resource Management in Dynamic IT Service Infrastructures Zur Erlangung des akademischen Grades eines Doktors der Ingenieurwissenschaften/ Doktors der Naturwissenschaften der Fakultät für Informatik des Karlsruher Instituts für Technologie (KIT) genehmigte Dissertation von Nikolaus Matthias Huber aus Zell am Harmersbach Tag der mündlichen Prüfung: 16.07.2014 Erster Gutachter: Prof. Dr. Samuel Kounev Zweiter Gutachter: Prof. Dr. Ralf Reussner KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft www.kit.edu Ich versichere wahrheitsgem¨aß, die Dissertation bis auf die dort angegebenen Hilfen selb- st¨andig angefertigt, alle benutzten Hilfsmittel vollst¨andig und genau angegeben und alles kenntlich gemacht zu haben, was aus Arbeiten anderer und eigenen Ver¨offentlichungen unver¨andert oder mit Anderungen¨ entnommen wurde. Karlsruhe, 30. September 2014 Nikolaus Huber Contents Abstract xiii Zusammenfassung xv 1. Introduction 1 1.1. Motivation . .1 1.2. Problem Statement . .2 1.3. Shortcomings of Existing Approaches . .4 1.4. Contributions of this Thesis . .6 1.5. Outline . .8 I. Foundations and Related Work 11 2. Autonomic System Adaptation and Performance Modeling 13 2.1. Autonomic and Self-Adaptive Systems . 14 2.1.1. Self-* Properties . 14 2.1.2. Design Principles of Autonomic Systems . 16 2.1.3. Model-Driven Autonomic and Self-Adaptive Systems . 18 2.1.4. The Descartes Research Project and Self-Aware Computing Systems 20 2.2. Software Performance Engineering . 21 2.2.1. Performance Models: Classification and Terminology . 21 2.2.2. Model-Driven Software Performance Engineering . 22 2.2.3. The Palladio Component Model (PCM) . 22 2.2.4.
    [Show full text]
  • Chauffeur: a Framework for Measuring Energy Efficiency of Servers
    Chauffeur: A framework for measuring Energy Efficiency of Servers Masters Project Jeremy Arnold University of Minnesota [email protected] December 16, 2013 Abstract Energy efficiency is an important consideration for servers, bringing about a need for benchmarks to fairly measure energy efficiency. These benchmarks must be relevant, reproducible, fair, verifiable, and easy to use. This paper describes these characteristics in detail and provides an assessment of existing energy efficiency benchmarks based on these cri- teria. Next it describes Chauffeur, a new framework developed to help benchmarks achieve these goals with relatively little effort. SPEC's Server Efficiency Rating Tool (SERT) builds upon the Chauffeur framework to deliver more meaningful energy efficiency data than is provided by the cur- rent generation of benchmarks and workloads. Experimental SERT data is used to highlight some of the features of Chauffeur and characteristics of SERT. 1 Introduction Energy consumption of data centers has become an important issue for many companies, resulting in increased focus on the energy efficiency of servers [1]. 1 There are many challenges in evaluating the energy efficiency of computer servers. Some traditional performance-oriented benchmarks have been updated to include energy efficiency metrics; while these have some value, the results of these measurements are not indicative of energy consumption under real- world conditions. Early benchmarks like SPECpower ssj2008 that were designed specifically for measuring energy efficiency have their own limitations. I have made the following contributions in the context of energy efficiency measurements for servers: 1) a detailed set of criteria that can be used to assess the quality of computer benchmarks, and energy efficiency benchmarks in particular, 2) development of the Chauffeur framework for creating high-quality energy efficiency benchmarks.
    [Show full text]