High Volume Computing: Identifying and Characterizing Throughput

Total Page:16

File Type:pdf, Size:1020Kb

High Volume Computing: Identifying and Characterizing Throughput High Volume Computing: Identifying and Characterizing Throughput Oriented Workloads in Data Centers Jianfeng Zhan, Lixin Zhang, Ninghui Sun, Lei Wang, Zhen Jia, and Chunjie Luo State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing, China Email: {zhanjianfeng, zhanglixin, snh}@ict.ac.cn, {wl, jiazhen, luochunjie}@ncic.ac.cn Abstract—For the first time, this paper systematically also referred to as big data applications. Third, lots identifies three categories of throughput oriented work- of users tend to use streaming media or VoIP for loads in data centers: services, data processing applica- fun or communications. Different from an ordinary tions, and interactive real-time applications, whose targets Web server, a VoIP application will maintain a are to increase the volume of throughput in terms of processed requests or data, or supported maximum number user session of a long period (e.g., more than five of simultaneous subscribers, respectively, and we coin a new minutes) while guaranteeing the real time quality term high volume computing (in short HVC) to describe of service, which we call an interactive real-time those workloads and data center computer systems de- application. signed for them. We characterize and compare HVC with The workloads mentioned above consist of a large other computing paradigms, e.g., high throughput comput- amount of loosely coupled workloads instead of a ing, warehouse-scale computing, and cloud computing, in terms of levels, workloads, metrics, coupling degree, data big tightly coupled job. The nature of this class of scales, and number of jobs or service instances. We also workloads is throughput-oriented, and the target of preliminarily report our ongoing work on the metrics and data center computer systems designed for them is benchmarks for HVC systems, which is the foundation to increase the volume of throughput in terms of of designing innovative data center computer systems for processed requests (for services), or processed data HVC workloads. (for data processing applications), or the maximum Index Terms—High volume computing; Throughput- number of simultaneous subscribers (for interactive oriented workloads; Data center computer systems; Met- rics; Benchmarks; real-time applications), performed or supported in arXiv:1202.6134v2 [cs.DC] 14 Jan 2013 data centers. So as to pay attention to this class of workloads and computer systems designed for I. INTRODUCTION them, in this paper, we coin a new term high volume In the past decade, there are three trends in computing (nature: throughput computing; target: computing domains. First, more and more services, high volume, in short HVC) to describe this class of involving a large amount of data, are deployed in workloads or data center computer systems designed data centers to serve the masses, e.g., Google search for them. engine and Google Map. Second, massive data are In this paper, we identify three categories of produced, stored, and analyzed in real time or off workloads in HVC: services, data processing ap- line. According to the annual survey of the global plications, and interactive real-time applications, digital output by IDC, the total amount of global all of which are throughput-oriented workloads. A data passes 1.2 zettabytes in 2010. In this paper, service is a group of applications that collaborate we call applications that produce, store, and analyze to receive user requests and return responses to massive data data processing applications, which is end users. More and more emerging services are data-intensive, e.g., Google search engine or Google marking data center computer systems in the context Map. Data processing applications produce, store, of our identified three categories of throughput- and analyze massive data, and we only focus on oriented workloads. We present our preliminary loosely coupled data processing applications, ex- work on the metrics and benchmarks for HVC cluding tightly coupled data-intensive computing, systems. e.g., those written in MPI. Typical examples are The remainder of the paper is organized as fol- MapReduce or Dryad based computing. We also lows. Section II characterize and compare different include data stream applications that process con- computing paradigms. Section III revisits previous tinuous unbounded streams of data in real time into benchmarks and report our preliminary work on the the second category of HVC workloads. Different HVC metrics and benchmarks. Section IV draws a from an ordinary Web server, an interactive real- conclusion. time application will maintain a user session of a long period while guaranteeing the real time quality II. CHARACTERIZING COMPUTING PARADIGMS of service. Typical interactive real-time applications In this section, we give out the definition of HVC, include streaming media, desktop clouds [33], and and identify its distinguished differences from other Voice over IP (VoIP) applications. The details of computing paradigms. three categories of workloads can be found at II-A. Despite several computing paradigms are not A. What is HVC? formally or clearly defined, e.g., warehouse- HVC is a data center based computing paradigm scale computing (WSC) [4], data-intensive scalable focusing on throughput-oriented workloads. The computing (DISC) [28], and cloud computing, we target of a data center computer system designed compare HVC with several computing paradigms for HVC workloads is to increase the volume in terms of six dimensions: levels, workloads, met- of throughput in terms of requests, or processed rics, coupling degree, data scales, and number of data, or the maximum number of simultaneous sub- jobs or service instances as shown in Table I. Our scribers, which are performed or supported in a data definition of HVC is towards data center computer center. systems, while many task computing [14] and high In Table I, we characterize HVC from six dimen- throughput computing [26] are defined towards run- sions: levels, workloads, metrics, coupling degree, time systems. In terms of workloads and respective data scales, and number of jobs or service instances. metrics, both high throughput computing and high The HVC system is defined on a data center performance computing are about scientific com- level. We identify three categories of workloads in puting centering around floating point operations, HVC: services, data processing applications, and while most of HVC applications have few floating interactive real-time applications. Services belong point operations as uncovered in our preliminary to the first category of HVC workloads. A service work [3]. Meanwhile, we also notice that many is a group of applications that collaborate to receive emerging workloads can be included into one or user requests and return responses to end users. two categories of HVC workloads, e.g., WSC (into We call a group of applications that independently the first and second categories), and DISC (into the process requests a service instance. For a large second category). As for (public) cloud computing, Internet service, a large amount of service instances we believe it is basically a business model of renting are deployed with requests distribution enabled by computing or storage resources, which heavily relies load balancers. Since each request is independent, a upon virtualization technologies, however HVC is service in itself is loosely coupled. For an ordinary defined in terms of workloads. We think many well- Apache Web server, the data scale is lower, while know workloads in cloud [5] can be included into for a search engine provided by Google, the data HVC, but HPC in cloud workloads [1] [38] are scale is large. More and more emerging services excluded, since they are tightly-coupled. are data-intensive. After widely investigating previous benchmarks, The second category of HVC workloads is data we found there is no systematic work on bench- processing applications. Please note that we only TABLE I: Characterizing different computing paradigms. Computing paradigm level Workloads Metrics Coupling Data ♯ jobs or ser- degree scale vice instances High performance Super computers Scientific comput- Float point opera- Tight n/a Low computing ing: heroic MPI ap- tions per second plications High performance Processors Traditional server Overall work per- loose n/a Low throughput computing workloads formed over a fixed [27] time period High throughput com- Distributed runtime Scientific comput- Float point opera- loose n/a Medium puting [26] systems ing tions per month Many task computing Runtime systems Scientific comput- Tasks per second Tight n/a Large [14] ing or data analy- or sis: workflow jobs loose Data-intensive scalable Runtime systems Data analysis: n/a Loose Large Large computing [28] or data MapReduce-like center computing [29] jobs Warehouse-scale com- Data centers for Inter- Very large Internet n/a Loose large Large puting [4] net services, belonging services to a single organization Cloud computing [34] Hosted data centers SaaS + utility com- n/a Loose n/a Large [15] puting Services Requests per min- Loose Medium Large utes and joule High volume Data centers Data processing ap- Data processed per Loose Large Large computing (HVC) plications minute and joule Interactive real- Maximum number Loose From Large time applications of simultaneous medium subscribers and to large subscribers per watt include loosely coupled data-intensive computing, Typical interactive real-time applications include e.g., MapReduce jobs,
Recommended publications
  • Metrics for Performance Advertisement
    Metrics for Performance Advertisement Vojtěch Horký Peter Libič Petr Tůma 2010 – 2021 This work is licensed under a “CC BY-NC-SA 3.0” license. Created to support the Charles University Performance Evaluation lecture. See http://d3s.mff.cuni.cz/teaching/performance-evaluation for details. Contents 1 Overview 1 2 Operation Frequency Related Metrics 2 3 Operation Duration Related Metrics 3 4 Benchmark Workloads 4 1 Overview Performance Advertisement Measuring for the purpose of publishing performance information. Requirements: – Well defined meaning. – Simple to understand. – Difficult to game. Pitfalls: – Publication makes results subject to pressure. – Often too simple to convey meaningful information. – Performance in contemporary computer systems is never simple. Speed Related Metrics Responsiveness Metrics – Time (Task Response Time) – How long does it take to finish the task ? Productivity Metrics – Rate (Task Throughput) – How many tasks can the system complete per time unit ? Utilization Metrics – Resource Use (Utilization) – How much is the system loaded when working on a task ? – Share of time a resource is busy or over given load level. – Helps identify bottlenecks (most utilized resources in the system). 1 Metric for Webmail Performance What metric would you choose to characterize performance of a web mail site ? – User oriented metric would be end-to-end operation time. – Server oriented metric would be request processing time. – How about metrics in between ? – And would you include mail delivery time ? How is throughput related to latency ? How is utilization defined for various resources ? 2 Operation Frequency Related Metrics Clock Rate Clock rate (frequency) of the component (CPU, bus, memory) in MHz. Most often we talk about CPU frequency.
    [Show full text]
  • The Nutanix Design Guide
    The FIRST EDITION Nutanix Design Guide Edited by AngeloLuciani,VCP By Nutanix, RenévandenBedem, incollaborationwith NPX,VCDX4,DECM-EA RoundTowerTechnologies Table of Contents 1 Foreword 3 2 Contributors 5 3 Acknowledgements 7 4 Using This Book 7 5 Introduction To Nutanix 9 6 Why Nutanix? 17 7 The Nutanix Eco-System 37 8 Certification & Training 43 9 Design Methodology 49 & The NPX Program 10 Channel Charter 53 11 Mission-Critical Applications 57 12 SAP On Nutanix 69 13 Hardware Platforms 81 14 Sizer & Collector 93 15 IBM Power Systems 97 16 Remote Office, Branch Office 103 17 Xi Frame & EUC 113 Table of Contents 18 Xi IoT 121 19 Xi Leap, Data Protection, 133 DR & Metro Availability 20 Cloud Management 145 & Automation: Calm, Xi Beam & Xi Epoch 21 Era 161 22 Karbon 165 23 Acropolis Security & Flow 169 24 Files 193 25 Volumes 199 26 Buckets 203 27 Prism 209 28 Life Cycle Manager 213 29 AHV 217 30 Move 225 31 X-Ray 229 32 Foundation 241 33 Data Center Facilities 245 34 People & Process 251 35 Risk Management 255 The Nutanix Design Guide 1 Foreword Iamhonoredtowritethisforewordfor‘TheNutanixDesignGuide’. Wehavealwaysbelievedthatcloudwillbemorethanjustarented modelforenterprises.Computingwithintheenterpriseisnuanced, asittriestobalancethefreedomandfriction-freeaccessofthe publiccloudwiththesecurityandcontroloftheprivatecloud.The privateclouditselfisspreadbetweencoredatacenters,remoteand branchoffices,andedgeoperations.Thetrifectaofthe3laws–(a) LawsoftheLand(dataandapplicationsovereignty),(b)Lawsof Physics(dataandmachinegravity),and(c)LawsofEconomics
    [Show full text]
  • “Freedom” Koan-Sin Tan [email protected] OSDC.Tw, Taipei Apr 11Th, 2014
    Understanding Android Benchmarks “freedom” koan-sin tan [email protected] OSDC.tw, Taipei Apr 11th, 2014 1 disclaimers • many of the materials used in this slide deck are from the Internet and textbooks, e.g., many of the following materials are from “Computer Architecture: A Quantitative Approach,” 1st ~ 5th ed • opinions expressed here are my personal one, don’t reflect my employer’s view 2 who am i • did some networking and security research before • working for a SoC company, recently on • big.LITTLE scheduling and related stuff • parallel construct evaluation • run benchmarking from time to time • for improving performance of our products, and • know what our colleagues' progress 3 • Focusing on CPU and memory parts of benchmarks • let’s ignore graphics (2d, 3d), storage I/O, etc. 4 Blackbox ! • google image search “benchmark”, you can find many of them are Android-related benchmarks • Similar to recently Cross-Strait Trade in Services Agreement (TiSA), most benchmarks on Android platform are kinda blackbox 5 Is Apple A7 good? • When Apple released the new iPhone 5s, you saw many technical blog showed some benchmarks for reviews they came up • commonly used ones: • GeekBench • JavaScript benchmarks • Some graphics benchmarks • Why? Are they right ones? etc. e.g., http://www.anandtech.com/show/7335/the-iphone-5s-review 6 open blackbox 7 Android Benchmarks 8 http:// www.anandtech.com /show/7384/state-of- cheating-in-android- benchmarks No, not improvement in this way 9 Assuming there is not cheating, what we we can do? Outline • Performance benchmark review • Some Android benchmarks • What we did and what still can be done • Future 11 To quote what Prof.
    [Show full text]
  • Red Hat Enterprise Virtualization Performance: Specvirt™ Benchmark
    RED HAT ENTERPRISE VIRTUALIZATION PERFORMANCE: SPECVIRT™ BENCHMARK DATASHEET AT A GLANCE OVERVIEW ® • The performance of Red Hat Red Hat Enterprise Virtualization (RHEV) is the strategic virtualization alternative for organizations Enterprise Virtualization can looking for better total cost of ownership, faster return on investment, accelerated break-even, and be compared to other freedom from vendor lock-in. virtualization platforms RHEV consists of both a hypervisor technology and an enterprise virtualization manager. The RHEV using the SPECvirt_sc2010 hypervisor, based on the Red Hat Enterprise Linux® kernel and the Kernel-based Virtual Machine benchmark. (KVM) hypervisor technology, offers the high performance and scalability that Red Hat is known for. • SPECvirt_sc2010 measures Hypervisor performance is a key factor in deciding which virtualization platform to implement, and hypervisor performance using Red Hat Enterprise Virtualization performance, as measured by the SPECvirt_sc2010® benchmark, realistic workloads. leads the industry both in terms of highest overall performance and highest number of performant As of March 1, 2013, RHEV virtual machines on a single server. leads VMware in performance on comparable WHAT IS SPECVIRT_SC2010®? servers defined by CPU count. SPECvirt_sc2010 is the first vendor-neutral benchmark designed to measure the performance of • The RHEV subscription model datacenter servers that are used for server virtualization. The benchmark was developed by the provides high performance non-profit Standard Performance Evaluation Corporation (SPEC) virtualization subcommittee, at a lower cost-per-SPECvirt_ whose members and contributors include AMD, Dell, Fujitsu, HP, IBM, Intel, Oracle, Red Hat, Unisys sc2010 than VMware. and VMware. SPECvirt_sc2010 uses realistic workloads and SPEC’s proven performance measurement method- ologies to enable vendors, users and researchers to compare system performance across multiple hardware, virtualization platforms, and applications.
    [Show full text]
  • On Benchmarking Intrusion Detection Systems in Virtualized Environments
    Technical Report: SPEC-RG-2013-002 Version: 1.0 On Benchmarking Intrusion Detection Systems in Virtualized Environments SPEC RG IDS Benchmarking Working Group Aleksandar Milenkoski Samuel Kounev Alberto Avritzer Institute for Program Structures and Data Institute for Program Structures and Data Siemens Corporate Research Organization Organization Princeton, NJ USA Karlsruhe Institute of Technology Karlsruhe Institute of Technology [email protected] Karlsruhe, Germany Karlsruhe, Germany [email protected] [email protected] Nuno Antunes Marco Vieira CISUC, Department of Informatics CISUC, Department of Informatics Engineering Engineering University of Coimbra University of Coimbra Coimbra, Portugal Coimbra, Portugal [email protected] [email protected] arXiv:1410.1160v1 [cs.CR] 5 Oct 2014 ® Research ℠ June 26, 2013 research.spec.org www.spec.org Contents 1 Introduction.......................................1 2 Intrusion Detection in Virtualized Environments . .2 2.1 VMM-Based Intrusion Detection Systems . .2 2.2 Intrusion Detection Techniques . .4 Misuse-based Intrusion Detection . .4 Anomaly-based Intrusion Detection . .5 3 Requirements and Challenges for Benchmarking VMM-based IDSes . .7 3.1 Workloads . .7 Benign Workloads . .7 Malicious Workloads . 10 3.2 Metrics ..................................... 14 4 Conclusion . 17 References . 18 i Executive Summary Modern intrusion detection systems (IDSes) for virtualized environments are deployed in the virtualization layer with components inside the virtual machine monitor (VMM) and the trusted host virtual machine (VM). Such IDSes can monitor at the same time the network and host activities of all guest VMs running on top of a VMM being isolated from malicious users of these VMs. We refer to IDSes for virtualized environments as VMM-based IDSes. In this work, we analyze state-of-the-art intrusion detection techniques applied in virtualized environments and architectures of VMM-based IDSes.
    [Show full text]
  • IADIS Conference Template
    BENCHMARKING OF BARE METAL VIRTUALIZATION PLATFORMS ON COMMODITY HARDWARE Duarte Pousa*, José Rufino*† *Polytechnic Institute of Bragança, 5300-253 Bragança, Portugal †Laboratory of Instrumentation and Experimental Particle Physics, University of Minho, 4710-057 Braga, Portugal ([email protected], [email protected]) ABSTRACT In recent years, System Virtualization became a fundamental IT tool, whether it is type-2/hosted virtualization, mostly exploited by end-users in their personal computers, or type-1/bare metal, well established in IT departments and thoroughly used in modern datacenters as the very foundation of cloud computing. Though bare metal virtualization is meant to be deployed on server-grade hardware (for performance, stability and reliability reasons), properly configured desktop-class systems are often used as virtualization “servers”, due to their attractive performance/cost ratio. This paper presents the results of a study conducted on such systems, about the performance of Windows 10 and Ubuntu Server 16.04 guests, when deployed in what we believe are the type-1 platforms most in use today: VMware ESXi, Citrix XenServer, Microsoft Hyper-V, and KVM-based (represented by oVirt and Proxmox). Performance is measured using three synthetic benchmarks: PassMark for Windows, UnixBench for Ubuntu Server, and the cross-platform Flexible I/O Tester. The benchmarks results may be used to choose the most adequate type-1 platform (performance-wise), depending on guest OS, its performance requisites (CPU-bound, IO-bound, or balanced) and its storage type (local/remote) used. KEYWORDS Bare Metal Virtualization, Synthetic Benchmarking, Performance Assessment, Commodity Hardware 1. INTRODUCTION System Virtualization allows a physical host to run (simultaneously) many guest operating systems in self- contained software-defined virtual machines, sharing the host hardware, under control of a hypervisor.
    [Show full text]
  • TPC-V: a Benchmark for Evaluating the Performance of Database Applications in Virtual Environments Priya Sethuraman and H
    TPC-V: A Benchmark for Evaluating the Performance of Database Applications in Virtual Environments Priya Sethuraman and H. Reza Taheri, VMWare, Inc. {psethuraman, rtaheri}@vmware.com TPCTC 2010 Singapore © 2010 VMware Inc. All rights reserved Agenda/Topics Introduction to virtualization Existing benchmarks Genesis of TPC-V But what is TPC-E??? TPC-V design considerations Set architectures, variability, and elasticity Benchmark development status Answers to some common questions 2 TPC TC 2010 What is a Virtual Machine? A (VM) is a software computer that, like a physical computer, runs an operating system and applications. An operating system installed on a virtual machine is called a guest operating system. Virtual machines run on host servers. The same server can run many virtual machines. Every VM runs in an isolated environment. Started out with IBM VM in the 60s Also on Sun Solaris, HP Itanium, IBM Power/AIX, others A new wave started in the late 90s on X86 • Initially, enthusiasts ran Windows and Linux VMs on their PCs Traditional Architecture Virtual Architecture 3 TPC TC 2010 Why virtualize a server ? Server consolidation • The vast majority of server are grossly underutilized • Reduces both CapEx and OpEx Migration of VMs (both storage and CPU/memory) • Enables live load balancing • Facilitates maintenance High availability • Allows a small number of generic servers to back up all servers Fault tolerance • Lock-step execution of two VMs Cloud computing! Utility computing was finally enabled by • Ability to consolidate
    [Show full text]
  • KVM Virtualization Roadmap and Technology Update
    KVM Virtualization Roadmap and Technology Update Karen Noel Bhavna Sarathy Senior Eng Manager Senior Product Manager Red Hat, Inc. Red Hat, Inc. June 13, 2013 Why we believe KVM is the best virtualization platform Performance Lower Cost Cross Platform KVM holds the Top 6/11 customers report up to Support and certification for virtual machine consolidation 70% savings by using leading x86_64 operating systems scores on SPECvirt (1) KVM (2) including RHEL and Microsoft Windows (4) Security Cloud & Virtualization Management Certification (3) EAL4+ Red Hat Open Stack for Cloud plus SE Linux enabling Virtualization and Red Hat Mandatory Access Control Enterprise Virtualization for data- between virtual machines center Virtualization (1) Source: SpecVirt_sc2010 results: http://www.spec.org/virt_sc2010/results/specvirt_sc2010_perf.html (2) Source: Case study on Canary Islands Government migration from VMware to RHEV: http://www.redhat.com/resourcelibrary/case-studies/canary-islands-government-migrates-telecommunications-platform-from- vmware-to-red-hat (3) Source: http://www.redhat.com/solutions/industry/government/certifications.html (4) Source: http://www.redhat.com/resourcelibrary/articles/enterprise-linux-virtualization-support KVM hypervisor in multiple Red Hat products KVM is the foundation Virtualization technology in multiple Red Hat products Red Hat Enterprise Virtualization – Hypervisor Derived from Red Hat Enterprise Linux SMALL FORM FACTOR, SCALABLE, ● RHEV Hypervisor HIGH PERFORMANCE ● Prebuilt binary (ISO) with 300+ packages derived
    [Show full text]
  • Report on Cloud Computing to the OSG Steering Committee
    SPEC OPEN SYSTEMS GROUP CLOUD COMPUTING WORKING GROUP Report on Cloud Computing to the OSG Steering Committee Table of Contents Introduction and Objectives ...................................................... 3 1 Cloud Computing ............................................................ 5 1.1 Definition of the Cloud .......................................................... 5 1.1.1 Characteristics .............................................................................. 5 1.1.2 Service Models ............................................................................. 6 1.1.3 Deployment Models ...................................................................... 6 1.2 Interested parties of a Cloud Benchmark ............................. 6 1.2.1 Participants ................................................................................... 7 1.2.2 Cloud Benchmark Audience Types .............................................. 7 1.3 Cloud Computing Use Cases ............................................... 8 1.3.1 Computing Use Cases .................................................................. 8 1.3.2 Use Case to Customer Map ......................................................... 9 1.3.3 Use Case to Processing Type Map ............................................ 10 1.4 Existing Cloud Frameworks ................................................ 11 2 Benchmark Considerations ........................................... 12 2.1 What is a Cloud SUT? ........................................................ 12 2.2 SUT Components
    [Show full text]
  • Review & Background
    Spring 2018 :: CSE 502 Review & Background Nima Honarmand Spring 2018 :: CSE 502 Measuring & Reporting Performance Spring 2018 :: CSE 502 Performance Metrics • Latency (execution/response time): time to finish one task • Throughput (bandwidth): number of tasks finished per unit of time – Throughput can exploit parallelism, latency can’t – Sometimes complimentary, often contradictory • Example: move people from A to B, 10 miles – Car: capacity = 5, speed = 60 miles/hour – Bus: capacity = 60, speed = 20 miles/hour – Latency: car = 10 min, bus = 30 min – Throughput: car = 15 PPH (w/ return trip), bus = 60 PPH Pick the right metric for your goals Spring 2018 :: CSE 502 Performance Comparison • “Processor A is X times faster than processor B” if – Latency(P, A) = Latency(P, B) / X – Throughput(P, A) = Throughput(P, B) * X • “Processor A is X% faster than processor B” if – Latency(P, A) = Latency(P, B) / (1+X/100) – Throughput(P, A) = Throughput(P, B) * (1+X/100) • Car/bus example – Latency? Car is 3 times (200%) faster than bus – Throughput? Bus is 4 times (300%) faster than car Spring 2018 :: CSE 502 Latency/throughput of What Program? • Very difficult question! • Best case: you always run the same set of programs – Just measure the execution time of those programs – Too idealistic • Use benchmarks – Representative programs chosen to measure performance – (Hopefully) predict performance of actual workload – Prone to Benchmarketing: “The misleading use of unrepresentative benchmark software results in marketing a computer system” -- wikitionary.com Spring 2018 :: CSE 502 Types of Benchmarks • Real programs – Example: CAD, text processing, business apps, scientific apps – Need to know program inputs and options (not just code) – May not know what programs users will run – Require a lot of effort to port • Kernels – Small key pieces (inner loops) of scientific programs where program spends most of its time – Example: Livermore loops, LINPACK • Toy Benchmarks – e.g.
    [Show full text]
  • Autonomic Performance-Aware Resource Management in Dynamic IT Service Infrastructures
    escartes Autonomic Performance-Aware Resource Management in Dynamic IT Service Infrastructures Zur Erlangung des akademischen Grades eines Doktors der Ingenieurwissenschaften/ Doktors der Naturwissenschaften der Fakultät für Informatik des Karlsruher Instituts für Technologie (KIT) genehmigte Dissertation von Nikolaus Matthias Huber aus Zell am Harmersbach Tag der mündlichen Prüfung: 16.07.2014 Erster Gutachter: Prof. Dr. Samuel Kounev Zweiter Gutachter: Prof. Dr. Ralf Reussner KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft www.kit.edu Ich versichere wahrheitsgem¨aß, die Dissertation bis auf die dort angegebenen Hilfen selb- st¨andig angefertigt, alle benutzten Hilfsmittel vollst¨andig und genau angegeben und alles kenntlich gemacht zu haben, was aus Arbeiten anderer und eigenen Ver¨offentlichungen unver¨andert oder mit Anderungen¨ entnommen wurde. Karlsruhe, 30. September 2014 Nikolaus Huber Contents Abstract xiii Zusammenfassung xv 1. Introduction 1 1.1. Motivation . .1 1.2. Problem Statement . .2 1.3. Shortcomings of Existing Approaches . .4 1.4. Contributions of this Thesis . .6 1.5. Outline . .8 I. Foundations and Related Work 11 2. Autonomic System Adaptation and Performance Modeling 13 2.1. Autonomic and Self-Adaptive Systems . 14 2.1.1. Self-* Properties . 14 2.1.2. Design Principles of Autonomic Systems . 16 2.1.3. Model-Driven Autonomic and Self-Adaptive Systems . 18 2.1.4. The Descartes Research Project and Self-Aware Computing Systems 20 2.2. Software Performance Engineering . 21 2.2.1. Performance Models: Classification and Terminology . 21 2.2.2. Model-Driven Software Performance Engineering . 22 2.2.3. The Palladio Component Model (PCM) . 22 2.2.4.
    [Show full text]
  • Chauffeur: a Framework for Measuring Energy Efficiency of Servers
    Chauffeur: A framework for measuring Energy Efficiency of Servers Masters Project Jeremy Arnold University of Minnesota [email protected] December 16, 2013 Abstract Energy efficiency is an important consideration for servers, bringing about a need for benchmarks to fairly measure energy efficiency. These benchmarks must be relevant, reproducible, fair, verifiable, and easy to use. This paper describes these characteristics in detail and provides an assessment of existing energy efficiency benchmarks based on these cri- teria. Next it describes Chauffeur, a new framework developed to help benchmarks achieve these goals with relatively little effort. SPEC's Server Efficiency Rating Tool (SERT) builds upon the Chauffeur framework to deliver more meaningful energy efficiency data than is provided by the cur- rent generation of benchmarks and workloads. Experimental SERT data is used to highlight some of the features of Chauffeur and characteristics of SERT. 1 Introduction Energy consumption of data centers has become an important issue for many companies, resulting in increased focus on the energy efficiency of servers [1]. 1 There are many challenges in evaluating the energy efficiency of computer servers. Some traditional performance-oriented benchmarks have been updated to include energy efficiency metrics; while these have some value, the results of these measurements are not indicative of energy consumption under real- world conditions. Early benchmarks like SPECpower ssj2008 that were designed specifically for measuring energy efficiency have their own limitations. I have made the following contributions in the context of energy efficiency measurements for servers: 1) a detailed set of criteria that can be used to assess the quality of computer benchmarks, and energy efficiency benchmarks in particular, 2) development of the Chauffeur framework for creating high-quality energy efficiency benchmarks.
    [Show full text]