Diagonalization and Powers of Matrices

Diagonalization and powers of matrices Brian Krummel April 6, 2020 One important application of diagonalizable matrices is computing powers of square matrices. Let A be a diagonalizable n × n matrix expressed as A = P DP −1 for a n × n diagonal matrix D and n × n invertible matrix P . Suppose we want to compute Ak for some integer k. Then by multiplying P DP −1 k-times and cancelling P −1P = I: Ak = (P DP −1)k = (P DP −1)(P DP −1) ··· (P DP −1) = P DD ··· D P −1 = PDkP −1: | {z } | {z } k times k times Computing the Dk for a diagonal matrix is very easy: 2 3k 2 k 3 λ1 0 ··· 0 λ1 0 ··· 0 6 0 λ ··· 0 7 6 0 λk ··· 0 7 k 6 2 7 6 2 7 D = 6 . .. 7 = 6 . .. 7 : 4 . 5 4 . 5 k 0 0 ··· λn 0 0 ··· λn That is, Dk is the diagonal matrix obtained by computing the k-th power of the diagonal entries of D. Example 1. Given the 2 × 2 matrix −4 6 A = ; −3 5 find A5. Answer. Finding eigenvalues of A. Suppose λ is an eigenvalue of A. Then (A − λI)x = 0 has a nontrivial solution. Thus the matrix A − λI is singular and det(A − λI) = 0. We have −4 − λ 6 2 det(A − λI) = = (−4 − λ)(5 − λ) + 18 = λ − λ − 2 = (λ + 1)(λ − 2): −3 5 − λ Therefore the eigenvalues of A are −1; 2. 1 Finding eigenvector corresponding to −1. We solve (A + I) X = 0. −3 6 1 −2 A + I = −! : −3 6 0 0 x2 is a free variable and x1 is a basic variable with x1 = 2x2, so an eigenvector of A corresponding to −2 is 2 : 1 Finding eigenvector corresponding to 2. We solve (A − 2I) X = 0. −6 6 1 −1 A − 2I = −! : −3 3 0 0 x2 is a free variable and x1 is a basic variable with x1 = x2, so an eigenvector of A corresponding to −1 is 1 : 1 Diagonalize. We let D be the diagonal matrix whose diagonal entries are the eigenvalues −1; 1. We let P be the matrix whose columns are the corresponding eigenvectors: −1 0 2 1 D = P = 0 2 1 1 Thus 2 1 −1 0 2 1 −1 A = P DP −1 = (1) 1 1 0 2 1 1 Compute A5. 2 1 −1 0 5 2 1 −1 A5 = PD5P −1 = 1 1 0 2 1 1 2 1 −1 0 5 1 −1 = 1 1 0 2 −1 2 2 1 −1 0 1 −1 = 1 1 0 32 −1 2 2 1 −1 1 = 1 1 −32 64 −34 66 = : −33 65 More generally, this gives us a way to compute functions of matrices. Example 2. Let A be as in Example 1. Is B = A2 + 2A + 5I diagonalizable? 2 Answer. We have already shown that A is diagonalizable, so let A = P DP −1. Then, using A2 = PD2P −1 and I = PIP −1, B = A2 + 2A + 5I = (P DP −1)2 + 2P DP −1 + 5I = PD2P −1 + 2P DP −1 + 5PIP −1 = P (D2 + 2D + 5I) P −1: Recalling (1), (−1)2 + 2(−1) + 5 0 4 0 B = P P −1 = P P −1: 0 22 + 2(2) + 5 0 13 Therefore, B is diagonalizable. Notice that here we had a polynomial function f(x) = x2 + 2x + 5. We showed that if A is a diagonalizable n × n matrix written as 2 3 λ1 0 ··· 0 6 0 λ ··· 0 7 −1 6 2 7 −1 A = P DP = P 6 . .. 7 P 4 . 5 0 0 ··· λn where P is an invertible n × n matrix, then f(A) = A2 + 2A + 5I (with 5I in place of 5) is 2 3 f(λ1) 0 ··· 0 6 0 f(λ ) ··· 0 7 −1 6 2 7 −1 f(A) = P f(D) P = P 6 . .. 7 P : 4 . 5 0 0 ··· f(λn) This holds true for any polynomial function f(x). In fact, this holds true for any real analytic function f(x), i.e. any function which converges to its Taylor series. Example 3. For instance, consider the exponential function exp(x) = ex. This function has the Taylor series 1 X xk exp(x) = : k! k=0 We can define exp(A) for an n × n matrix by 1 X Ak exp(A) = ; k! k=0 where the infinite sum means that we compute the infinite sum for each entry. Of course, defining exp(A) by an infinite series is not particularly enlightening. Instead, suppose that A is a diagonalizable matrix with A = P DP −1 for an n × n diagonal matrix D and n × n invertible matrix 3 P . Then using Ak = PDkP −1: 2 k 3 λ1 0 ··· 0 1 k 1 k −1 1 k 1 0 λk ··· 0 X A X PD P X D −1 X 1 6 2 7 −1 exp(A) = = = P · · P = P · 6 . 7 P k! k! k! k! 6 . .. 7 k=0 k=0 k=0 k=0 4 . 5 k 0 0 ··· λn 2 1 k 3 X λ 1 0 ··· 0 6 k! 7 6 k=0 7 2 λ 3 6 1 k 7 e 1 0 ··· 0 6 X λ2 7 1 k 0 ··· 0 0 eλ2 ··· 0 X D 6 7 −1 6 7 −1 = P · = 6 k! 7 P = P 6 . 7 P k! 6 k=0 7 6 . .. 7 k=0 6 . .. 7 4 5 6 . 7 λn 6 1 7 0 0 ··· e 6 X λk 7 4 0 0 ··· n 5 k! k=0 For instance, when A is as in Example 1, 2 1 e−1 0 2 1 −1 exp(A) = 1 1 0 e2 1 1 2 1 e−1 0 1 −1 = 1 1 0 e2 −1 2 2 1 e−1 −e−1 = 1 1 −e2 2e2 2e−1 − e2 −2e−1 + 2e2 = : e−1 − e2 −e−1 + 2e2 This is important when studying differential equations. Recall that for each real number a, the solution to y0 = ay is y = ceat, where c 2 R is a constant. For each n × n matrix A, we can consider the differential system Y 0 = AY , where Y (t) is a function of t taking values in Rn. The solution to Y 0 = AY is Y = exp(tA) · C, where C 2 Rn is a constant. Example 4. Metropolis is served by two local newspapers, the Daily Planet and Metropolis Star. The Daily Planet seems to be in trouble. Currently has only a 34% market share. Every year, 10% of its readership switches to the Star, whereas only 6% of the Star's readership switches to the Planet. Assume that no one subscribes to both papers and that the total newspaper readership remains constant. What is the long-term outlook for the Planet? Answer. Next year, the figures for the Planet and Star will be, respectively 0:9 · 0:34 + 0:06 · 0:66 = 0:3456 0:1 · 0:34 + 0:94 · 0:66 = 0:6544 This can be expressed as the matrix product of the form 0:9 0:06 0:34 0:3456 = : 0:1 0:94 0:66 0:6544 4 In other words, X1 = PX0 where 0:9 0:06 0:34 0:3456 P = ;X = ;X = : 0:1 0:94 0 0:66 1 0:6544 We refer to the vectors X0 and X1 representing the readership for each year as the state vectors. For each positive integer k, we will let the state vector Xk represent the readership in the k-th year. Notice that the sum of the entries of each state vector Xk (for k = 0; 1) is 1. We call a column vector with non-negative entries and the sum of its entries equal to 1 a probability vector. We refer to the matrix P as the transition matrix, as it transitions the state vector Xk for the k-th year to the state vector Xk+1 = PXk for the next year via multiplication. The columns of P represent the probability that the readership will stay with the magazine or go to its rival. Thus the state vectors satisfy the inductive relationship Xk+1 = PXk (2) for each k. Notice that since the readership for each magazine stays with them or goes to their rival in the next year, the sum of the columns of P must equal 1. We call a matrix P with non- negative entries and the sum of its entries in each column equal to 1 a probability matrix. Since the transition matrix P is independent of the readership, we say that this is Markov process. If we compute the readership for the next few years, we obtain 0:9 0:06 0:3456 0:350304 X = PX = = ; 2 1 0:1 0:94 0:6544 0:649696 0:9 0:06 0:350304 0:35425536 X = PX = = ; 3 3 0:1 0:94 0:649696 0:64574464 0:9 0:06 0:35425536 0:3575745024 X = PX = = : 4 4 0:1 0:94 0:64574464 0:6424254976 The Planet is not in trouble. The readership of the Planet is in fact going up each year, whereas the readership of the Star is going down. This is because even though the Planet is currently less popular, there are not enough disgruntled Planet readers to keep the Star growing. To compute the readership for the k-th year, we multiplied P by the state vectors k times.

Diagonalization and Powers of Matrices

Diagonalizing a Matrix

3.3 Diagonalization

R'kj.Oti-1). (3) the Object of the Present Article Is to Make This Estimate Effective

Diagonalizable Matrix - Wikipedia, the Free Encyclopedia

EIGENVALUES and EIGENVECTORS 1. Diagonalizable Linear Transformations and Matrices Recall, a Matrix, D, Is Diagonal If It Is

§9.2 Orthogonal Matrices and Similarity Transformations

Contents 5 Eigenvalues and Diagonalization

Tropical Totally Positive Matrices 3

Matrix Multiplication. Diagonal Matrices. Inverse Matrix. Matrices

Matrices That Commute with a Permutation Matrix

An Inequality for Doubly Stochastic Matrices*

Semisimple and Unipotent Elements