On the relation between matrix-geometric and discrete phase-type distributions
Sietske Greeuw
Kongens Lyngby/Amsterdam 2009 Master thesis Mathematics University of Amsterdam Technical University of Denmark Informatics and Mathematical Modelling Building 321, DK-2800 Kongens Lyngby, Denmark Phone +45 45253351, Fax +45 45882673 [email protected] www.imm.dtu.dk
University of Amsterdam Faculty of Science Science Park 404, 1098 XH Amsterdam, The Netherlands Phone +32 20 5257678, Fax +31 20 5257675 [email protected] www.science.uva.nl Summary
A discrete phase-type distribution describes the time until absorption in a discrete-time Markov chain with a finite number of transient states and one absorbing state. The density f(n) of a discrete phase-type distribution can be expressed by the initial probability vector α, the transition probability matrix T of the transient states of the Markov chain and the vector t containing the probabilities of entering the absorbing state from the transient states:
n−1 f(n) = αT t, n ∈ N. If we take a probability density of the same form, but not necessarily require α,T and t to have the probabilistic Markov-chain interpretation, we obtain the density of a matrix-geometric distribution. Matrix-geometric distributions can equivalently be defined as distributions on the non-negative integers that have a rational probability generating function. In this thesis it is shown that the class of matrix-geometric distributions is strictly larger than the class of discrete phase-type distributions. We give an example of a set of matrix-geometric distributions that are not of discrete phase- type. We also show that there is a possible order reduction when representing a discrete phase-type distribution as a matrix-geometric distribution. The results parallel the continuous case, where the class of matrix-exponential distributions is strictly larger than the class of continuous phase-type distribu- tions, and where there is also a possible order reduction.
Keywords: discrete phase-type distributions, phase-type distributions, matrix- exponential distributions, matrix-geometric distributions. ii Resum´e
En diskret fasetype fordeling er fordelingen af tiden til absorption i en diskret- tids Markov kæde med et begrænset antal transiente tilstande og ´en absorbe- rende tilstand. Tætheden f(n) af en diskret fasetype fordeling kan udtrykkes ved vektoren med den initielle fordeling α, matricen T, der beskriver de mulige over- gange mellem transiente tilstande i Markov kæden, og vektoren t med sandsyn- ligheder for at springe til den absorberende tilstand fra de transiente tilstande:
n−1 f(n) = αT t, n ∈ N. Hvis vi tager en tæthed af samme form, men ikke nødvendigvis kræver, at α,T og t har probabilistisk Markov kæde fortolkning, f˚arvi tætheden af en matrix-geometrisk fordeling. Matrix-geometriske fordelinger kan tilsvarende de- fineres som fordelinger p˚aikke-negative heltal, der har en rationel sandsyn- lighedsgenererende funktion. I denne projekt er det vist, at mængden af matrix-geometriske fordelinger er strengt større end mængden af diskrete fasetype fordelinger. Der gives et ek- sempel p˚aen mængde af matrix-geometriske fordelinger, der ikke er af diskret fasetype. Vi viser ogs˚a,at der er en mulighed for reduktion af størrelse af repræsentationen, n˚arde repræsenterer en diskret fasetype fordeling som en matrix-geometrisk fordeling. Resultaterne svarer til det kontinuerte tilfælde, hvor klassen af matrix- eksponentielle fordelinger er strengt større end klassen af kontinuert fasetype fordelinger, og hvor reduktion af størrelse af repræsentation ogs˚aer muligt. iv Preface
This thesis was prepared in partial fulfillment of the requirements for acquiring the master of science degree in Mathematics. It has been prepared during a five month Erasmus-stay at the Danish Technical University, at the institute for Informatics and Mathematical Modelling.
I would like to thank my two supervisors, Bo Friis Nielsen and Michel Mandjes, for both being very supportive in my decision of doing my master project abroad. Furthermore I want to thank Bo for his friendly and informative guidance, and Michel for his help with the final parts of my thesis.
Finally I want to thank my family and my friends and all other people who have supported me during this process.
Amsterdam, March 2009
Sietske Greeuw vi Contents
Summary i
Resum´e iii
Preface v
1 Introduction 1
2 Phase-type distributions 5 2.1 Discrete phase-type distributions ...... 5 2.2 Continuous phase-type distributions ...... 19
3 Matrix-exponential distributions 23 3.1 Definition ...... 24 3.2 Examples of matrix-exponential distributions ...... 25
4 Matrix-geometric distributions 31 4.1 Definition ...... 32 4.2 Existence of genuine matrix-geometric distributions ...... 34 4.3 Order reduction for matrix-geometric distributions ...... 38 4.4 Properties of matrix-geometric distributions ...... 45 viii CONTENTS Chapter 1
Introduction
In this thesis the relation between matrix-geometric and discrete phase-type distributions will be explored. Matrix-geometric distributions (MG) are distri- butions on the non-negative integers that possess a density of the form
n−1 f(n) = αT t, n ∈ N (1.1) together with f(0) = ∆, the point mass in zero. The parameters (α,T, t) are a row vector, matrix and column vector respectively, that satisfy the necessary conditions for f(n) to be a density. Hence αT n−1t ≥ 0 for all n ∈ N and P∞ −1 n=0 f(n) = ∆ + α(I − T ) t = 1.
The class MG is a generalization of the class of discrete phase-type distributions (DPH). A discrete phase-type distribution describes the time until absorption in a finite-state discrete-time Markov chain. A discrete phase-type density has the same form as (1.1) but requires the parameters to have the interpretation as initial probability vector (α), sub-transition probability matrix (T ) and exit probability vector (t) of the underlying Markov chain. The analogous setup in continuous time is given by matrix-exponential distri- butions (ME) that are a generalization of continuous phase-type distributions (PH). The latter describe the time until absorption in a continuous-time finite- state Markov chain.
The name phase-type refers to the states of the Markov chain. Each visit to a 2 Introduction state can be seen as a phase with a geometric (in continuous time exponential) distribution, and the resulting Markovian structure gives rise to a discrete (con- tinuous) phase-type distribution. The method of phases has been introduced by A.K. Erlang in the beginning of the 20th century and has later been generalized by M.F. Neuts. A general probability distribution can be approximated arbi- trarily closely by a phase-type distribution. This makes phase-type distributions a powerful tool in the modelling of e.g. queueing systems. Another nice feature of phase-type distributions is that they often give rise to closed form solutions.
The main reference on discrete phase-type distributions is ‘Probability distribu- tions of phase type’ by M.F. Neuts, published in 1975 [12]. This article gives a thorough introduction to discrete phase-type distributions, their main proper- ties and their use in the theory of queues. In his 1981 book ‘Matrix-geometric Solutions in Stochastic Models’ [13], M.F. Neuts gives again an introduction to phase-type distributions, this time focussing on the continuous case. Another clear introduction of continuous phase-type distributions can be found in the book by G. Latouche and V. Ramaswami [10]. A good survey on the class ME is given in the entry on matrix-exponential distributions in the Encyclopedia of Statistical Sciences by Asmussen and O’Cinneide [4]. In this article the rela- tion of the class ME to the class PH is discussed, some properties of ME are given and the use of these distributions in applied probability is explained. The authors mention the class of matrix-geometric distributions as the analogous discrete counterpart of ME. However, no explicit examples are given.
The main question addressed in this thesis is what the relation is between the classes DPH and MG. From their definition it is clear that
DPH ⊆ MG.
It is however not immediately clear, whether there exists distributions that are in MG but do not have a discrete phase-type representation. The second question is on the order of the distribution. This order is defined as the minimal dimension of the matrix T in (1.1) needed to represent the distribution. It is of interest to know if there exist discrete phase-type distributions that have a lower order representation when they are represented as a matrix-geometric distribution. Such a reduced-order representation can be much more convenient to work with, for example in solving high-dimensional queueing systems.
In the continuous case it is known that
PH ( ME, that is, PH is a strict subset of ME [3, 4]. A standard example of a distribution that is in ME but not in PH is a distribution represented by a density which has a zero on (0, ∞). Such a distribution cannot be in PH, as a phase-type density 3 is strictly positive on (0, ∞) [13]. There is no equivalent result in discrete time, as discrete phase-type densities can be zero with a certain periodicity. A sec- ond way of showing that ME is larger than PH is by using the characterization of phase-type distributions stated by O’Cinneide in [14] which is based on the absolute value of the poles of the Laplace transform. In [15] O’Cinneide gives a lower bound on the order of a phase-type represen- tation. He shows that there is possible order reduction when using a matrix- exponential representation rather than a phase-type representation.
We will use the characterization of discrete phase-type distributions by O’Cinneide [14], to give a set of distributions that is in MG but not in DPH. We will also show with an example that order reduction by going from a DPH to an MG representation is possible. This result is based on discrete phase-type densities which have a certain periodicity.
The rest of the thesis is organized as follows: Section 2.1 gives an introduc- tion to and some important properties of discrete phase-type distributions. In Section 2.2 the class of continuous phase-type distributions PH is shortly ad- dressed. In Section 3.1 the class ME is introduced and in Section 3.2 some examples of matrix-exponential distributions are given, where the focus lies on their relation to the class PH. In Chapter 4 the answers to the questions stated in this introduction are given. In Section 4.1 we introduce matrix-geometric distributions and give an equivalent definition for these distributions based on the rationality of their probability generating function. In Section 4.2 we give an example that illustrates that MG is strictly larger than DPH and in Section 4.3 we give an example of order reduction between these classes. For the sake of completeness, Section 4.4 is devoted to some properties of matrix-geometric distributions. They equal the properties of discrete phase-type distributions that are stated in Chapter 2 but the proofs now need to be given analytically, as the probabilistic Markov-chain interpretation is no longer available. 4 Introduction Chapter 2
Phase-type distributions
This chapter is about phase-type distributions. In Section 2.1 discrete phase- type distributions are introduced. They are the distribution of the time until absorption in a discrete-time finite-state Markov chain with one absorbing state. Some properties of these distributions will be studied, the connection with the geometric distribution will be explained, and some closure properties will be examined. In Section 2.2, we have a look at continuous phase-type distributions. As their development is analogous to the discrete case, we will not explore them as extensively as the discrete case, but we will only state the most relevant results. The presentation of phase-type distributions in this chapter is based on [12],[6] and [13]. In the following, I and e will denote the identity matrix and a vector of ones respectively, both of appropriate size. Row vectors will be denoted by bold face Greek letters, whereas column vectors will be denoted by bold face Latin letters. We define N to be the set of positive integers, i.e. N = {1, 2, 3,...}.
2.1 Discrete phase-type distributions
Let {Xn}n∈N≥0 be a discrete-time Markov chain on the state space E = {1, 2, . . . , m, m + 1}. We let {1, 2, . . . , m} be the transient states of the 6 Phase-type distributions
Markov chain and m + 1 be the absorbing state. The transition probability matrix of this Markov chain is given by T t P = . 0 1 Here T is the m × m sub-transition probability matrix for the transient states, and t is the exit vector which gives the probability of absorption into state m+1 from any transient state. Since P is a transition probability matrix, each row of P must sum to 1, hence T e + t = e.
The probability of initiating the Markov chain in state i is denoted by αi = P (X0 = i) . The initial probability vector of the Markov chain is then given by Pm+1 (α, αm+1) = (α1, α2, . . . , αm, αm+1) and we have i=1 αi = 1.
Definition 2.1 (Discrete phase-type distribution) A random variable τ has a discrete phase-type distribution if τ is the time until absorption in a discrete-time Markov chain,
τ := min{n ∈ N≥0 : Xn = m + 1}.
The name phase-type distribution refers to the states (phases) of the underlying Markov chain. If T is an m×m matrix we say that the density is of order m. The order of the discrete-phase type distribution is the minimal size of the matrix T needed to represent the density.
2.1.1 Density and distribution function
In order to find the density of τ we look at the probability that the Markov chain is in one of the transient states i ∈ {1, 2, . . . , m} after n steps,
m (n) X n pi = P (Xn = i) = αk (T )ki . k=1
(n) (n) (n) (n) We can collect these probabilities in a vector and get ρ = (p1 , p2 , ··· , pm ). Note that ρ(0) = α.
Lemma 2.2 The density of a discrete phase-type random variable τ is given by
n−1 fτ (n) = αT t, n ∈ N (2.1) and fτ (0) = αm+1. 2.1 Discrete phase-type distributions 7
Proof. The probability of absorption of the Markov chain at time n is given by the sum over the probabilities of the Markov chain being in one of the states {1, 2, . . . , m} at time n − 1 multiplied by the probability that absorption takes place from that state. The state of the Markov chain at time n − 1 depends on the initial state of the Markov chain and the (n − 1)-step transition probability matrix T n−1. Hence we get
m X (n−1) (n−1) n−1 fτ (n) = P (τ = n) = pi ti = ρ t = αT t, n ∈ N. i=1
Note that fτ (0) is the probability of absorption of the Markov chain in zero steps, which is given by αm+1, the probability of initiating in the absorbing state.
The density of τ is completely defined by the initial probability vector α and the sub-transition probability matrix T , since t = (I − T ) e. We write
τ ∼ DPH (α,T ) to denote that τ is of discrete phase type with parameters α and T . The density of a discrete phase-type distribution is said to have an atom in zero of size αm+1 if fτ (0) = αm+1.
A representation (α,T ) for a phase-type distribution is called irreducible if every state of the Markov chain can be reached with positive probability if the initial distribution is given by α. We can always find such a representation by simply leaving out the states that cannot be reached.
From now on we will omit the subscript τ and simply write f(n) for the density of a discrete phase-type random variable. We have to check that f(n) = αT n−1t is a well-defined density on the non-negative integers. Since α,T and t only have non-negative entries (as their entries are probabilities) we know that f(n) is non-negative for all n. The infinite sum of f(n) is given by:
∞ ∞ X X f(n) = f(0) + αT n−1t n=0 n=1 ∞ ! X n = αm+1 + α T t n=0 −1 = αm+1 + α (I − T ) t −1 = αm+1 + α (I − T ) (I − T ) e
= αm+1 + αe = 1. 8 Phase-type distributions
As T is a sub-stochastic matrix, all its eigenvalues are less than one (see e.g. P∞ n [2] Proposition I.6.3). Therefore we have in the above that the series n=0 T converges to the matrix (I − T )−1 (see e.g. [16] Lemma B.1.).
We denote by F (n) = P (τ ≤ n) the distribution function of τ. The distribution function can be deduced by the following probabilistic argument.
Lemma 2.3 The distribution function of a discrete phase-type variable is given by F (n) = 1 − αT ne. (2.2)
Proof. We look at the probability that absorption has not yet taken place and hence the Markov chain is in one of the transient states. We get
1 − F (n) = P (τ > n) m X (n) = pi i=1 = p(n)e = αT ne.
2.1.2 Probability generating function and moments
For a discrete random variable X defined on the non-negative integers and density pn = P (X = n) the probability generating function (pgf) is given by
∞ 2 X k H(z) = p0 + zp1 + z p2 + ... = z pk, |z| ≤ 1. k=0