Diagnosing, Predicting and Managing Application Performance in Virtualised Multi-Tenant Clouds
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Queueing-Theoretic Solution Methods for Models of Parallel and Distributed Systems·
1 Queueing-Theoretic Solution Methods for Models of Parallel and Distributed Systems· Onno Boxmat, Ger Koolet & Zhen Liui t CW/, Amsterdam, The Netherlands t INRIA-Sophia Antipolis, France This paper aims to give an overview of solution methods for the performance analysis of parallel and distributed systems. After a brief review of some important general solution methods, we discuss key models of parallel and distributed systems, and optimization issues, from the viewpoint of solution methodology. 1 INTRODUCTION The purpose of this paper is to present a survey of queueing theoretic methods for the quantitative modeling and analysis of parallel and distributed systems. We discuss a number of queueing models that can be viewed as key models for the performance analysis and optimization of parallel and distributed systems. Most of these models are very simple, but display an essential feature of dis tributed processing. In their simplest form they allow an exact analysis. We explore the possibilities and limitations of existing solution methods for these key models, with the purpose of obtaining insight into the potential of these solution methods for more realistic complex quantitative models. As far as references is concerned, we have restricted ourselves in the text mainly to key references that make a methodological contribution, and to sur veys that give the reader further access to the literature; we apologize for any inadvertent omissions. The reader is referred to Gelenbe's book [65] for a general introduction to the area of multiprocessor performance modeling and analysis. Stochastic Petri nets provide another formalism for modeling and perfor mance analysis of discrete event systems. -
A Robust Queueing Network Analyzer Based on Indices of Dispersion
Submitted to INFORMS Journal on Computing manuscript (Please, provide the manuscript number!) Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title. However, use of a template does not certify that the paper has been accepted for publication in the named jour- nal. INFORMS journal templates are for the exclusive purpose of submitting to an INFORMS journal and should not be used to distribute the papers in print or online or to submit the papers to another publication. A Robust Queueing Network Analyzer Based on Indices of Dispersion Ward Whitt Department of Industrial Engineering and Operations Research, Columbia University, [email protected], Wei You Department of Industrial Engineering and Operations Research, Columbia University, [email protected], We develop a robust queueing network analyzer (RQNA) algorithm to approximate the steady-state per- formance of a single-class open queueing network of single-server queues with Markovian routing, allowing non-renewal external arrival processes and non-exponential service-time distributions. Each flow is partially characterized by its rate and index of dispersion for counts (IDC, i.e., scaled variance-time function). A robust queueing approximation is used to approximate the mean steady-state number of customers and workload (remaining service time) at each queue, given the rate and IDC of the arrival process and the first two moments of the service time. The RQNA algorithm includes subroutines to calculate or estimate the IDC for each external flow, subroutines to solve systems of linear equations to calculate the rates and approximate the IDC's of the internal flows and a feedback elimination procedure. -
This Item Is the Archived Peer-Reviewed Author-Version Of
This item is the archived peer-reviewed author-version of: Analyzing queues with customer differentiation using matrix analytic methods Reference: van Velthoven Jeroen.- Analyzing queues with customer differentiation using matrix analytic methods Antwerpen, Universiteit Antwerpen. Faculteit Wetenschappen. Departement Wiskunde & Informatica, 2008, 179 p. Handle: http://hdl.handle.net/10067/1109350151162165141 Institutional repository IRUA Faculteit Wetenschappen Departement Wiskunde & Informatica Analyzing Queues with Customer Differentiation using Matrix Analytic Methods Een analyse van wachtrijsystemen met differentiatie in klanten gebruikmakend van matrix analytische methodes Proefschrift voorgelegd tot het behalen van de graad van doctor in de Wetenschappen aan de Universiteit Antwerpen, te verdedigen door Jeroen VAN VELTHOVEN Promotor: Prof. dr. Benny Van Houdt Copromotor: Prof. dr. Chris Blondia Antwerpen, 2008 Contents Preface xi Acknowledgements xiii Notations and Acronyms xv Summary and Organization xvii I Introduction1 1 Queueing Theory and Markov Chains3 1.1 Markov Chains...................................4 1.2 Queueing Theory..................................6 1.2.1 Arrival Process...............................6 1.2.2 Scheduling Discipline............................7 1.2.3 Service Process...............................8 1.2.4 Some Basic Queueing Systems......................9 1.2.5 Customer Differentiation..........................9 2 Matrix Analytic Methods 11 2.1 Structured Processes................................ 12 2.1.1 M/G/1 and GI/M/1 Type Markov Chains................ 12 2.1.2 Quasi-Birth-Death Markov Chains.................... 12 2.1.3 Tree-Like Processes............................. 14 2.2 The D-MAP/PH/1 Queue............................. 16 2.2.1 The Discrete-Time Markovian Arrival Process.............. 16 2.2.2 Phase-Type Service Time......................... 17 2.2.3 Modeling the D-MAP/PH/1 Queue................... 18 II Steady State Analysis 19 3 Impatient Customers 21 3.1 Related Work................................... -
CPU Scheduling
Chapter 5: CPU Scheduling Operating System Concepts – 10th Edition Silberschatz, Galvin and Gagne ©2018 Chapter 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Thread Scheduling Multi-Processor Scheduling Real-Time CPU Scheduling Operating Systems Examples Algorithm Evaluation Operating System Concepts – 10th Edition 5.2 Silberschatz, Galvin and Gagne ©2018 Objectives Describe various CPU scheduling algorithms Assess CPU scheduling algorithms based on scheduling criteria Explain the issues related to multiprocessor and multicore scheduling Describe various real-time scheduling algorithms Describe the scheduling algorithms used in the Windows, Linux, and Solaris operating systems Apply modeling and simulations to evaluate CPU scheduling algorithms Operating System Concepts – 10th Edition 5.3 Silberschatz, Galvin and Gagne ©2018 Basic Concepts Maximum CPU utilization obtained with multiprogramming CPU–I/O Burst Cycle – Process execution consists of a cycle of CPU execution and I/O wait CPU burst followed by I/O burst CPU burst distribution is of main concern Operating System Concepts – 10th Edition 5.4 Silberschatz, Galvin and Gagne ©2018 Histogram of CPU-burst Times Large number of short bursts Small number of longer bursts Operating System Concepts – 10th Edition 5.5 Silberschatz, Galvin and Gagne ©2018 CPU Scheduler The CPU scheduler selects from among the processes in ready queue, and allocates the a CPU core to one of them Queue may be ordered in various ways CPU scheduling decisions may take place when a -
Download File
A Robust Queueing Network Analyzer Based on Indices of Dispersion Wei You Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2019 c 2019 Wei You All Rights Reserved ABSTRACT A Robust Queueing Network Analyzer Based on Indices of Dispersion Wei You In post-industrial economies, modern service systems are dramatically changing the daily lives of many people. Such systems are often complicated by uncertainty: service providers usually cannot predict when a customer will arrive and how long the service will be. For- tunately, useful guidance can often be provided by exploiting stochastic models such as queueing networks. In iterating the design of service systems, decision makers usually favor analytical analysis of the models over simulation methods, due to the prohibitive compu- tation time required to obtain optimal solutions for service operation problems involving multidimensional stochastic networks. However, queueing networks that can be solved an- alytically require strong assumptions that are rarely satisfied, whereas realistic models that exhibit complicated dependence structure are prohibitively hard to analyze exactly. In this thesis, we continue the effort to develop useful analytical performance approx- imations for the single-class open queueing network with Markovian routing, unlimited waiting space and the first-come first-served service discipline. We focus on open queueing networks where the external arrival processes are not Poisson and the service times are not exponential. We develop a new non-parametric robust queueing algorithm for the performance ap- proximation in single-server queues. With robust optimization techniques, the underlying stochastic processes are replaced by samples from suitably defined uncertainty sets and the worst-case scenario is analyzed. -
Inferring Queueing Network Models from High-Precision Location Tracking Data
University of London Imperial College of Science, Technology and Medicine Department of Computing Inferring Queueing Network Models from High-precision Location Tracking Data Tzu-Ching Horng Submitted in part fulfilment of the requirements for the degree of Doctor of Philosophy in Computing of the University of London and the Diploma of Imperial College, May 2013 Abstract Stochastic performance models are widely used to analyse the performance and reliability of systems that involve the flow and processing of customers. However, traditional methods of constructing a performance model are typically manual, time-consuming, intrusive and labour- intensive. The limited amount and low quality of manually-collected data often lead to an inaccurate picture of customer flows and poor estimates of model parameters. Driven by ad- vances in wireless sensor technologies, recent real-time location systems (RTLSs) enable the automatic, continuous and unintrusive collection of high-precision location tracking data, in both indoor and outdoor environment. This high-quality data provides an ideal basis for the construction of high-fidelity performance models. This thesis presents a four-stage data processing pipeline which takes as input high-precision location tracking data and automatically constructs a queueing network performance model approximating the underlying system. The first two stages transform raw location traces into high-level “event logs” recording when and for how long a customer entity requests service from a server entity. The third stage infers the customer flow structure and extracts samples of time delays involved in the system; including service time, customer interarrival time and customer travelling time. The fourth stage parameterises the service process and customer arrival process of the final output queueing network model. -
COS 318: Operating Systems CPU Scheduling
COS 318: Operating Systems CPU Scheduling Jaswinder Pal Singh Computer Science Department Princeton University (http://www.cs.princeton.edu/courses/cos318/) Today’s Topics u CPU scheduling basics u CPU scheduling algorithms 2 When to Schedule? u Process/thread creation u Process/thread exit u Process thread blocks (on I/O, synchronization) u Interrupt (I/O, clock) 3 Preemptive and Non-Preemptive Scheduling Terminate Exited (call scheduler) Scheduler dispatch Running Block for resource (call scheduler) Yield, Interrupt (call scheduler) Ready Blocked Create Resource free, I/O completion interrupt (move to ready queue) 4 Scheduling Criteria u Assumptions l One process per user and one thread per process l Processes are independent u Goals for batch and interactive systems l Provide fairness l Everyone makes some progress; no one starves l Maximize CPU utilization • Not including idle process l Maximize throughput • Operations/second (min overhead, max resource utilization) l Minimize turnaround time • Batch jobs: time to execute (from submission to completion) l Shorten response time • Interactive jobs: time response (e.g. typing on a keyboard) l Proportionality • Meets user’s expectations Scheduling Criteria u Questions: l What are the goals for PCs versus servers? l Average response time vs. throughput l Average response time vs. fairness Problem Cases u Completely blind about job types l Little overlap between CPU and I/O u Optimization involves favoring jobs of type “A” over “B” l Lots of A’s? B’s starve u Interactive process trapped -
CPU Scheduling
CPU Scheduling CSE 2431: Introduction to Operating Systems Instructor: Adam C. Champion, Ph.D. Reading: Chapter 5, [OSC] (except Sections 5.7.2, 5.8) 1 Contents • Why Scheduling? • Basic Concepts of Scheduling • Scheduling Criteria • A Basic Scheduling Algorithm (FCFS) • Scheduling Algorithms (SJF, RR, etc.) • Thread Scheduling 2 Why Scheduling? • Deciding which process/thread should occupy the resource (CPU, disk, etc.) (CPU (horsepower)) I want to ride it Whose turn is it? Process 1 Process 2 Process 3 3 Contents • Why Scheduling? • Basic Concepts of Scheduling • Scheduling Criteria • A Basic Scheduling Algorithm (FCFS) • Scheduling Algorithms (SJF, RR, etc.) • Thread Scheduling 4 When to Schedule? 5 Contents • Why Scheduling? • Basic Concepts of Scheduling • Scheduling Criteria • Basic Scheduling Algorithm (FCFS) • Scheduling Algorithms (SJF, RR, etc.) • Thread Scheduling 6 Scheduling Objectives • Fairness (nobody cries) • Priority (ladies first) • Efficiency (make best use of equipment) • Encourage good behavior (good boy/girl) • Support heavy loads (degrade gracefully) • Adapt to different environments (interactive, real-time, multimedia) 7 Performance Criteria • Fairness: No starvation • Efficiency: keep resources as busy as possible • Throughput: # of processes completed in unit time • Turnaround time (also called elapsed time): Amount of time to complete a certainprocess from its beginning • Waiting Time: Amount of time process has been waiting in ready queue • Response Time: Amount of time from when a request was first submitted until first response is produced. • Policy Enforcement: Enforcing that stated policy is carried out • Proportionality: Meet users' expectations 8 • Meeting Deadlines: Avoid losing data Different Systems, Different Foci • For all – Fairness, policy enforcement, resource balance • Batch Systems – Max throughput, min turnaround time, max CPU utilization • Interactive Systems – Min Response time, best proportionality • Real-Time Systems – predictability, meeting deadlines 9 Preemptive vs. -
Queueing-Theoretic Solution Methods for Models of Parallel and Distributed Systems
Queueing-Theoretic Solution Methods for Models of Parallel and Distributed Systems Onno Boxmay, Ger Kooley & Zhen Liuz yCWI, P.O. Box 94079, 1090 GB Amsterdam, The Netherlands zINRIA Sophia-Antipolis, B.P. 93, 06902 Sophia-Antipolis, France Abstract This paper aims to give an overview of solution methods for the performance analysis of parallel and distributed systems. After a brief review of some important general solution methods, we discuss key models of parallel and distributed systems, and optimization issues, from the viewpoint of solution methodology. AMS Subject Classification (1991): 60K20, 68M20 Keywords & Phrases: queueing, performance evaluation, parallel systems Note: Supported by the European Grant BRA-QMIPS of CEC DG XIII, the second author is also supported by NFI. This paper appeared also in Performance Evaluation of Parallel and Distributed Systems | Solution Methods, O.J. Boxma and G.M. Koole (eds.), CWI, Amsterdam, 1994 (CWI Tract 105 & 106). 1. Introduction The purpose of this paper is to present a survey of queueing theoretic methods for the quantitative modeling and analysis of parallel and distributed systems. We discuss a number of queueing models that can be viewed as key models for the performance analysis and optimization of parallel and distributed systems. Most of these models are very simple, but display an essential feature of distributed processing. In their simplest form they allow an exact analysis. We explore the possibilities and limitations of existing solution methods for these key models, with the purpose of obtaining insight into the potential of these solution methods for more realistic complex quantitative models. As far as references is concerned, we have restricted ourselves in the text mainly to key references that make a methodological contribution, and to surveys that give the reader further access to the literature; we apologize for any inadvertent omissions. -
Robust Transient Analysis of Multi-Server Queueing Systems and Feed-Forward Networks
Queueing Syst https://doi.org/10.1007/s11134-017-9566-6 Robust transient analysis of multi-server queueing systems and feed-forward networks Chaithanya Bandi1 · Dimitris Bertsimas2 · Nataly Youssef3 Received: 17 April 2016 / Revised: 1 October 2017 © Springer Science+Business Media, LLC, part of Springer Nature 2018 Abstract We propose an analytically tractable approach for studying the transient behavior of multi-server queueing systems and feed-forward networks. We model the queueing primitives via polyhedral uncertainty sets inspired by the limit laws of probability. These uncertainty sets are characterized by variability parameters that control the degree of conservatism of the model. Assuming the inter-arrival and service times belong to such uncertainty sets, we obtain closed-form expressions for the worst case transient system time in multi-server queues and feed-forward networks with deterministic routing. These analytic formulas offer rich qualitative insights on the dependence of the system times as a function of the variability parameters and the fundamental quantities in the queueing system. To approximate the average behavior, we treat the variability parameters as random variables and infer their density by using ideas from queues in heavy traffic under reflected Brownian motion. We then average the worst case values obtained with respect to the variability parameters. Our averaging approach yields approximations that match the diffusion approximations for a single queue with light-tailed primitives and allows us to extend -
Diffusion Approximations
D.E Heyman, M.J. Sobel, Eds., Handbooks on OR & MS, Vol. 2 (~ Elsevier Science Publishers B.V. (North-Holland) 1990 Chapter 4 Diffusion Approximations Peter W. Glynn Department of Operations Research, Stanford University, Stanford, CA 94305-4022, U.S.A. 1. Introduction In this chapter, we shall give an overview of some of the basic applications of the theory of diffusion approximations to operations research. A diffusion approximation is a technique in which a complicated and analytically intract- able stochastic process is replaced by an appropriate diffusion process. A diffusion process is a (strong) Markov process having continuous sample paths. Diffusion processes have a great deal of analytical structure and are therefore typically more mathematically tractable than the original process with which one starts. The approach underlying the application of diffusion approxima- tions is therefore comparable to that underlying normal approximation for sums of random variables. In the latter setting, the central limit theorem permits one to replace the analytically intractable sum of random variables by an appropriately chosen normal random variable. In this chapter, we shall describe some of the basic theory of weak convergence that underlies the method of diffusion approximation. We shall then survey various applications of this methodology to the approximation of complex queueing systems. Because we are interested in developing approximations for the distribution of a process (considered as a random function of time), it is necessary for us to describe the basic elements of the theory of weak convergence in a function space. Sections 2 and 3 are therefore devoted to this topic. -
Product Form Queueing Networks S.Balsamo Dept
Product Form Queueing Networks S.Balsamo Dept. of Math. And Computer Science University of Udine, Italy Abstract Queueing network models have been extensively applied to represent and analyze resource sharing systems such as communication and computer systems and they have proved to be a powerful and versatile tool for system performance evaluation and prediction. Product form queueing networks have a simple closed form expression of the stationary state distribution that allow to define efficient algorithms to evaluate average performance measures. We introduce product form queueing networks and some interesting properties including the arrival theorem, exact aggregation and insensitivity. Various special models of product form queueing networks allow to represent particular system features such as state-dependent routing, negative customers, batch arrivals and departures and finite capacity queues. 1 Introduction and Short History System performance evaluation is often based on the development and analysis of appropriate models. Queueing network models have been extensively applied to represent and analyze resource sharing systems, such as production, communication and computer systems. They have proved to be a powerful and versatile tool for system performance evaluation and prediction. A queueing network model is a collection of service centers representing the system resources that provide service to a collection of customers that represent the users. The customers' competition for the resource service corresponds to queueing into the service centers. The analysis of the queueing network models consists of evaluating a set of performance measures, such as resource utilization and throughput and customer response time. The popularity of queueing network models for system performance evaluation is due to a good balance between a relative high accuracy in the performance results and the efficiency in model analysis and evaluation.