Jordan Vectors and the Jordan Form

A useful basis for defective matrices: Jordan vectors and the Jordan form S. G. Johnson, MIT 18.06 Created Spring 2009; updated May 8, 2017 Abstract This matrix has a characteristic polynomial l 2 − 2l + 1, with a repeated root (a single eigenvalue) l1 = 1. (Equiv- Many textbooks and lecture notes can be found online alently, since A is upper triangular, we can read the de- for the existence of something called a “Jordan form” terminant of A − lI, and hence the eigenvalues, off the of a matrix based on “generalized eigenvectors (or “Jor- diagonal.) However, it only has a single indepenent eigen- dan vectors” and “Jordan chains”). In these notes, in- vector, because stead, I omit most of the formal derivations and instead 0 1 focus on the consequences of the Jordan vectors for how A − I = we understand matrices. What happens to our traditional 0 0 eigenvector-based pictures of things like An or eAt when diagonalization of A fails? The answer, for any matrix is obviously rank 1, and has a one-dimensional nullspace function f (A), turns out to involve the derivative of f ! spanned by x1 = (1;0). Defective matrices arise rarely in practice, and usually only when one arranges for them intentionally, so we have 1 Introduction not worried about them up to now. But it is important to have some idea of what happens when you have a defec- So far in the eigenproblem portion of 18.06, our strategy tive matrix. Partially for computational purposes, but also has been simple: find the eigenvalues li and the corre- to understand conceptually what is possible. For example, sponding eigenvectors xi of a square matrix A, expand what will be the result of any vector of interest u in the basis of these eigenvectors k 1 At 1 (u = c1x1 +···+cnxn), and then any operation with A can A or e 2 2 be turned into the corresponding operation with li acting on each eigenvector. So, Ak becomes l k, eAt becomes elit , i for the defective matrix A above, since (1;2) is not in the and so on. But this relied on one key assumption: we re- span of the (single) eigenvector of A? For diagonaliz- quire the n × n matrix A to have a basis of n independent able matrices, this would grow as l k or elt , respectively, eigenvectors. We call such a matrix A diagonalizable. but what about defective matrices? Although matrices in Many important cases are always diagonalizable: ma- real applications are rarely exactly defective, it sometimes trices with n distinct eigenvalues l , real symmetric or or- i happens (often by design!) that they are nearly defective, thogonal matrices, and complex Hermitian or unitary ma- and we can think of exactly defective matrices as a limit- trices. But there are rare cases where A does not have a ing case. (The book Spectra and Pseudospectra by Tre- complete basis of n eigenvectors: such matrices are called fethen & Embree is a much more detailed dive into the defective. For example, consider the matrix fascinating world of nearly defective matrices.) 1 1 The textbook (Intro. to Linear Algebra, 5th ed. by A = : 0 1 Strang) covers the defective case only briefly, in section 1 8.3, with something called the Jordan form of the matrix, If li is a double root, we will need a second vector to a generalization of diagonalization, but in this section we complete our basis. Remarkably,1 it turns out to always 2 will focus more on the “Jordan vectors” than on the Jordan be the case for a double root li that N([A − liI] ) is two- factorization. For a diagonalizable matrix, the fundamen- dimensional, just as for the 2 × 2 example above. Hence, tal vectors are the eigenvectors, which are useful in their we can always find a unique second solution ji satisfying: own right and give the diagonalization of the matrix as a side-effect. For a defective matrix, to get a complete basis (A − liI)ji = xi; ji ? xi : we need to supplement the eigenvectors with something called Jordan vectors or generalized eigenvectors. Jor- Again, we can choose ji to be perpendicular to xi via dan vectors are useful in their own right, just like eigen- Gram-Schmidt—this is not strictly necessary, but gives a vectors, and also give the Jordan form. Here, we’ll focus convenient orthogonal basis. (That is, the complete solu- mainly on the consequences of the Jordan vectors for how tion is always of the form xp +cxi, a particular solution xp matrix problems behave. plus any multiple of the nullspace basis xi. If we choose T T c = −xi xp=xi xi we get the unique orthogonal solution ji.) We call ji a Jordan vector or Jordan vector of A. 2 Defining Jordan vectors The relationship between j1 and x1 is also called a Jor- dan chain. In the example above, we had a 2 × 2 matrix A but only a single eigenvector x1 = (1;0). We need another vector to get a basis for R2. Of course, we could pick another vector 2.1 More than double roots at random, as long as it is independent of x1, but we’d like (1) A more general notation is to use xi instead of xi and it to have something to do with A, in order to help us with (2) xi instead of ji. If li is a triple root, we would find computations just like eigenvectors. The key thing is to (3) (1;2) 2 look at A−I above, and to notice that (A−I) = 0. (A ma- a third vector xi perpendicular to xi by requiring (3) (2) trix is called nilpotent if some power is the zero matrix.) (A − liI)xi = xi , and so on. In general, if li is an m- 2 m So, the nullspace of (A−I) must give us an “extra” basis times repeated root, then N([A − li] ) is m-dimensiohnal vector beyond the eigenvector. But this extra vector must we will always be able to find an orthogonal sequence (a still be related to the eigenvector! If y 2 N[(A − I)2], then ( j) Jordan chain) of Jordan vectors xi for j = 2:::m satis- (A − I)y must be in N(A − I), which means that (A − I)y ( j) ( j−1) (1) fying (A − l I)x = x and (A − l I)x = 0. Even is a multiple of x ! We just need to find a new “Jordan i i i i i 1 more generally, you might have cases with e.g. a triple vector” or “generalized eigenvector” j satisfying 1 root and two ordinary eigenvectors, where you need only (A − I)j1 = x1; j1 ? x1: one generalized eigenvector, or an m-times repeated root with ` > 1 eigenvectors and m − ` Jordan vectors. How- Notice that, since x 2 N(A−I), we can add any multiple 1 ever, cases with more than a double root are extremely of x to j and still have a solution, so we can use Gram- 1 1 rare in practice. Defective matrices are rare enough to be- Schmidt to get a unique solution j perpendicular to x . 1 1 gin with, so here we’ll stick with the most common defec- This particular 2 × 2 equation is easy enough for us to tive matrix, one with a double root li: hence, one ordinary solve by inspection, obtaining j1 = (0;1). Now we have eigenvector xi and one Jordan vector ji. a nice orthonormal basis for R2, and our basis has some simple relationship to A! Before we talk about how to use these Jordan vectors, 3 Using Jordan vectors let’s give a more general definition. Suppose that li is an eigenvalue of A corresponding to a repeated root of Using an eigenvector xi is easy: multiplying by A is just det(A−liI), but with only a single (ordinary) eigenvector like multiplying by li. But how do we use a Jordan vector xi, satisfying, as usual: 1This fact is proved in any number of advanced textbooks on linear (A − liI)xi = 0: algebra; I like Linear Algebra by P. D. Lax. 2 ji? The key is in the definition above. It immediately tells 3.1 More than double roots us that In the rare case of two Jordan vectors from a triple root, Aji = liji + xi: you will have a Jordan vector x(3) and get a f (A)x(3) = It will turn out that this has a simple consequence for more i i (3) 0 00 00 complicated expressions like Ak or eAt , but that’s probably f (l)xi + f (l)ji + f (l)xi, where the f term will give 2 k−2 2 lit k At not obvious yet. Let’s try multiplying by A : you k(k − 1)li and t e for A and e respectively. 2 A quadruple root with one eigenvector and three Jordan A ji = A(Aji) = A(liji + xi) = li(liji + xi) + lixi vectors will give you f 000 terms (that is, k3 and t3 terms), 2 = li ji + 2lixi and so on. The theory is quite pretty, but doesn’t arise often in practice so I will skip it; it is straightforward to and then try A3: work it out on your own if you are interested. 3 2 2 2 2 A ji = A(A ji) = A(li ji + 2lixi) = li (liji + xi) + 2li xi 3 2 = li ji + 3li xi: 3.2 Example From this, it’s not hard to see the general pattern (which 1 1 Let’s try this for our example 2×2 matrix A = can be formally proved by induction): 0 1 k k k−1 from above, which has an eigenvector x1 = (1;0) and a A ji = li ji + kl xi : i Jordan vector j1 = (0;1) for an eigenvalue l1 = 1.

Jordan Vectors and the Jordan Form

Linear Systems and Control: a First Course (Course Notes for AAE 564)

MATH 2370, Practice Problems

Scientific Computing: an Introductory Survey

Section 5 Summary

Diagonalizable Matrix - Wikipedia, the Free Encyclopedia

Math 1553 Introduction to Linear Algebra

Mata Glossary of Common Terms

Contents 5 Eigenvalues and Diagonalization

Diagonalization of Matrices

The Jacobian of the Exponential Function

Rigidity and Shrinkability of Diagonalizable Matrices

0 Nonsymmetric Eigenvalue Problems 149