The Analysis of Dynamical Queueing Systems (Background)

The Analysis of Dynamical Queueing Systems (Background) Technological innovations are creating new types of communication systems. During the 20 th century, we saw the evolution of electronic communication from the telegraph to the telephone to the internet. In turn, these new types communication systems have created new types of communication services such as call centers, electronic commerce and wireless communications. Communication services managers must make important business decisions to stay competitive and profitable. They have to maximize the communication resources that they are making available to the customer. The word “resource’’ can refer to telephone lines, telephone operators, optical fiber or bandwidth. At the same time, the manager must minimize their costs for providing these resources. This also allows them to maximize their profits. Unpredictable but growing demand for new services delivered by new communication systems renders managerial decision making difficult. The mathematical field of queueing theory has traditionally provided managers with a useful set of decision making formulas and algorithms for designing communication systems and services. An example of this is the classical Erlang blocking formula that was developed in 1917. For a group of telephone trunk lines that connect two central offices, this formula gives the probability that “all circuits are busy” (no trunk line is currently available) given the fixed number of trunk lines as provided by the service and given the average number of trunk lines requested by the customers. This formula can then be “inverted” so a manager can then decide what should be the least or optimal amount of trunk lines needed for a given customer demand that would keep a specific quality of service below some acceptable threshold level. The measure that we use here is the probability of all circuits being busy Historically, this was the firt problem that led to the creation of queueing theory. A. K. Erlang modeled the usage of a telephone trunk group as a time-homogeneous Markov chain. The randomness arises as the best way to model mathematically the aggregate behavior of people. The transient (short run) probabilities for such a stochastic process solve a set of ordinary differential equations. When we look at the asymptotic regime of large time (long run), these differential equations then reduce to a set of linear equations which are relatively easier to solve. This type of equilibrium or steady state analysis made classical queueing theory an important tool for modeling many different types of voice communications systems. The simplest, fundamental queueing model of this type that Erlang used has multiple servers (corresponding to telephone operators) but no buffers for customers to wait when all the servers are busy (when the customer hears a busy signal, they leave). We model the number of busy servers as a continuous time Markov process where we assume that the call arrival traffic is modeled by a random Poisson process of a constant mean rate and the time each customer spends on the phone is random with a negative exponential distribution. 1 The success of using queueing theory to model circuit-switched telephone networks in the first half of the 20 th century for voice communications led to another major triumph in the latter half of the 20 th century as queueing theory was applied by computer scientists such as Leonard Kleinrock [1] to model data communication systems. The simplest, fundamental queueing model for data communications has a single server where jobs arrive according to a Poisson process. When that single server is busy, there is an unlimited capacity waiting buffer for all the jobs there are not in service. The time that each job takes to be served is random with a negative exponential distribution. By using again the steady state theory for continuous time Markov chains, a queueing analysis was invented that contributed to the design of the first prototype for the Internet (ARPANET). Given the new types of communication systems and services available today, it is no longer possible to make many of the simplifying assumptions of classical queueing theory. One simplifying assumption that has become increasingly untenable is that all queues act in isolation. The reality is that most of them interact as a queueing network. Mathematically, this dramatically increases the complexity of the queueing models needed to capture this interactivity. Another major mathematical assumption has been to restrict our analysis of Markov processes to ones that are time homogeneous. This is akin to assuming that the mean arrival rate for telephone calling traffic is constant or independent of the time of day. Manager and engineers can no longer assume this fiction in their daily lives so it is important for the mathematician to respond. Unfortunately, the mathematician must now throw out the entire theory of steady state and equilibrium analysis for continuous time Markov chains when they try to address this issue. What are the mathematical tools needed to replace these techniques? These are the topics to be explored in the following series of research summaries. 1. A.K. Erlang, Solutions of some problems in the theory of probabilities of significance in automatic telephone exchanges , The Post Office Electrical Engineers’ Journal 10 (1918) 189–197. Translated from Elektroteknikeren 13 (1917) (in Danish). 2. L. Kleinrock. Queueing Systems, Volume II: Computer Applications, John Wiley & Sons., 1976. 2 I. Uniform Acceleration for Dynamical Queueing Analysis One innovation that I developed towards this end in my Ph.D. thesis was the notion of uniform acceleration . This was an asymptotic theory where the parameter of time for a specific queueing system is no longer scaled to infinity but its arrival and service rates are. When these rates are constant, this asymptotic analysis is equivalent to a steady state analysis. It is precisely when these rates are time-varying that these two types of analyses diverge and are not equivalent. I showed in my thesis that you could apply this analysis to the M/M/1 queue with time varying rates, which is a dynamic variant of the fundamental queueing model for data communications. One can use this system to model customers waiting to download files from a webserver. With my new time-dependent analysis it was now possible to obtain simple formulas that capture the dynamic behavior of this queueing model. In particular, my analysis can deal with the time dependent phenomena of backlog. For steady state analysis, we can say that such a queue is stable if its mean arrival rate is strictly less than its mean service rate. Before this type of analysis was employed, it was not mathematically understood that when rates are time varying, it is possible that the arrival rate can be less than the service rate at a given time but the system is still unstable at that time. This occurs when the arrival rates of the past have so exceeded the service rates of the past that a backlog in the queue develops. Consequently, it is not possible for the system to recover by the time the arrival rate drops below the service rate. This queueing problem has been a touchstone in my research career and I have returned to it once every ten years to say something new about it. The paper in 1985 [6] was a rigorous asymptotic analysis of the transition probabilities for the queueing model. The paper written jointly in 1995 with Avishai Mandelbaum of Technion [5] was the same asymptotic scaling but as applied directly to the random sample path behavior of the queueing system. This was achieved by using the latest probabilistic techniques of the theory of strong approximations, which give asymptotic expansions for the sample paths of a Poisson process. This gave a more detailed analysis of the queue when it is unstable. For the first time, the fluid approximation for this system as heuristically described by queueing theorists such as Kleinrock [1] was now shown to arise rigorously as a functional strong law of large numbers limit. Moreover, we showed that the deviation from this fluid limit yields its own functional central limit theorem. In a new paper, just completed [1] and co-authored by Mor Harchol-Balter of Carnegie Mellon University and my Ph.D. graduate student Robert C. Hampshire, we take this sample path analysis of the number of customers in the queueing system and apply it to the asymptotic analysis of the time spent by each customer in the systems with respect to the processor sharing service discipline. Inspired by call centers, a paper co-authored with Avishai Mandelbaum and Martin Reiman of Bell Labs [3] generalizes the notion of uniform acceleration for multiserver queues. Here the scaling of the arrival rates (demand) is matched by a scaling of the telephone operators (supply). This is a natural asymptotic regime for business managers. 3 We identify in this a large, new class of dynamical queueing network models (having time-varying rates) and show that they can be approximated by simpler sets of dynamical systems. 1. Fluid and Diffusion Limits for Transient Sojourn Times of Processor Sharing Queues with Time Varying Rates . R. C. Hampshire, M. Harchol-Balter and W. A. Massey (2006). To appear in the special issue of Queueing Systems and Their Applications on Fair Resource Sharing, Volume 53, Issue 1/2, June 2006. 2. The Analysis of Queues with Time-Varying Rates for Telecommunication Models. W. A. Massey, (2002), Telecommunications Systems 21, pp. 173–204. 3. Strong Approximations for Markovian Service Networks , A. Mandelbaum, W. A. Massey, and M. I. Reiman, Queueing Systems and Their Applications, 30 (1998) pp. 149-201. 4. Uniform Acceleration Expansions for Markov Chains with Time-Varying Rates , W. A. Massey and W.

The Analysis of Dynamical Queueing Systems (Background)

Fluid M/M/1 Catastrophic Queue in a Random Environment

Queueing Theory and Process Flow Performance

Lecture 5/6 Queueing

Steady-State Analysis of Reflected Brownian Motions: Characterization, Numerical Methods and Queueing Applications

Introduction to Queueing Theory Review on Poisson Process

Analysis of Packet Queueing in Telecommunication Networks

EUROPEAN CONFERENCE on QUEUEING THEORY 2016 Urtzi Ayesta, Marko Boon, Balakrishna Prabhu, Rhonda Righter, Maaike Verloop

Queueing Theory

Approximating Heavy Traffic with Brownian Motion

Queueing Theory and Modeling

Queueing Systems

Brief Introduction to Queueing Theory and Its Applications