COMPUTATION OF THE JORDAN NORMAL FORM OF A MATRIX
USING VERSAL DEFORMATIONS
ALEXEI A. MAILYBAEV
Abstract. Numerical problem of nding multiple eigenvalues of real and complex nonsymmet-
ric matrices is considered. Using versal deformation theory, explicit formulae for approximations
of matrices having nonderogatory multiple eigenvalues in the vicinity of a given matrix A are de-
rived. These formulae provide lo cal information on the geometry of a set of matrices having a given
nonderogatory Jordan structure, distance to the nearest matrix with a multiple eigenvalue of given
multiplicity,values of multiple eigenvalues, and corresp onding Jordan chains (generalized eigenvec-
tors). The formulae use only eigenvalues and eigenvectors of the initial matrix A,whichtypically has
only simple eigenvalues. Both the case of matrix families (matrices smo othly dep endentonseveral
parameters) and the case of real or complex matrix spaces are studied. Several examples showing
simplicity and eciency of the suggested metho d are given.
Key words. Jordan normal form, multiple eigenvalue, generalized eigenvector, matrix family,
versal deformation, bifurcation diagram
AMS sub ject classi cations. 65F15, 15A18, 15A21
1. Intro duction. Computation of the Jordan normal form (JNF) of a square
nonsymmetric matrix A is needed in many branches of mathematics, mechanics, and
physics. Due to its great imp ortance this problem was intensively studied during
the last century. It is well known that a generic (typical) matrix has only simple
eigenvalues, which means that its JNF is a diagonal matrix [1,2]. The problem
of nding the JNF of a generic matrix has b een well studied b oth from theoretical
and practical (numerical) sides; see [24]. Nevertheless, manyinteresting and imp or-
tant phenomena, asso ciated with qualitativechanges in the dynamics of mechanical
systems [17,18, 22], stability optimization [3, 14, 20], and eigenvalue b ehavior under
matrix p erturbations [4, 21, 23], are connected with degenerate matrices, i.e., matrices
having multiple eigenvalues.
In the presence of multiple eigenvalues the numerical problem of calculation of the
JNF b ecomes very complicated due to its instability, since arbitrarily small p ertur-
bations (caused, for example, by round-o errors) destroy the nongeneric structure.
This means that we should consider a matrix given with some accuracy,thus, substi-
tuting the matrix A by its neighb or in the matrix space. Suchformulation leads to
several imp ortant problems. The rst problem left op en by Wilkinson [25, 26] is to
nd a distance from a given matrix A to the nearest nondiagonalizable (degenerate)
b
matrix A. This problem can b e reformulated in the extended form:
to nd a distance (with resp ect to a given norm) from a matrix A to the
b
nearest matrix A having a given Jordan structure;
b
to nd a matrix A with the least generic Jordan structure in a given neigh-
b orho o d of the matrix A (the least generic Jordan structure means that a set
of matrices with this Jordan structure has the highest co dimension).
Here we assume that two matrices have the same Jordan structure, if their JNFs di er
only byvalues of di erent eigenvalues. Co dimensions of sets of matrices having the
same Jordan structure were studied by Arnold [1, 2]. According to Arnold, matrices
b
A with the least generic Jordan structure have the most p owerful in uence on the
Institute of Mechanics, Moscow State Lomonosov University,Michurinsky pr. 1, 117192 Moscow,
Russia ([email protected]). 1
2 ALEXEI A. MAILYBAEV
eigenvalue b ehavior in the neighb orho o d of the matrix A. Therefore, the ab ove stated
problems represent the key p oint in the numerical analysis of multiple eigenvalues.
Several authors addressed the problem of nding b ounds on the distance to the
nearest degenerate matrix; see [5,6,9,15,25,26] and more references therein. A nu-
merical metho d for nding the JNF of a degenerate matrix given with some accuracy
was prop osed in [12,13]. This metho d is based on making a numb er of successive
transformations of the matrix and deleting small elements whenever appropriate. Nev-
ertheless, this pro cedure do esn't provide the least generic JNF, and it is restricted to
the use of the Frob enius matrix norm (for example, it can not b e used in the case,
when di erententries of a matrix are given with di erent accuracy). Another metho d
based on singular value decomp ositions was given in [19]. This metho d is simpler,
but it has the same prop erties. More references of JNF computation metho ds can b e
found in [8].
It is understo o d now that the problem of calculation of the nondiagonal JNF
needs the singularity theory approach based on the use of versal deformations (lo cally
generic families of matrices). Versal deformations were intro duced by Arnold [1, 2] in
order to stabilize the JNF transformation, but no numerical metho ds were suggested.
Fairgrieve[9] succeeded in using the idea of versality for construction of a stable
numerical pro cedure, which calculates matrices of a given Jordan structure. Though
this metho d do es not provide the nearest matrix of a given Jordan structure, it shows
the usefulness of the singularity theory for the numerical eigenvalue problem. In [7]
versal deformations were used for improvement of an existing numerical program in a
similar problem of nding the Kronecker normal form of a matrix p encil. Application
of the singularity theory to computation of a distance from a matrix A to the nearest
matrix with a double eigenvalue [15] showed that this numerical problem is very
complex even in the case of a double eigenvalue.
This pap er is intended to prove that the versal deformation theory can b e an
indep endentpowerful to ol for analysis of multiple eigenvalues. It is shown that us-
ing versal deformations wecanavoid a numb er of transformations and singular value
decomp ositions (used in most algorithms). Instead of this, explicit approximate for-
mulae describing the geometry of a set of matrices with given Jordan structure in
the vicinityofa given matrix A are derived. These formulae use only information
on eigenvalues and eigenvectors of the matrix A. Since the matrix A is typically
nondegenerate, i.e., has only simple eigenvalues, its eigenvalues and eigenvectors can
b e obtained by means of standart co des. As a result, we obtain approximations of a
distance from A to the nearest matrix having a given Jordan structure, its multiple
eigenvalues, and the Jordan chains (generalized eigenvectors). The metho d do es not
dep end on the norm used in the matrix space. Moreover, it allows analyzing multiple
eigenvalues of matrices dep endent on several parameters (matrix families), where we
can not vary matrix elements indep endently.Approximations derived in the pap er
2
have the accuracy O (" ), where " is the distance from A to a matrix we are lo ok-
ing for. All these prop erties make the suggested approach useful and attractive for
applications.
In this pap er the case of a nonderogatory JNF (every eigenvalue has one corre-
sp onding Jordan blo ck) is considered. Derogatory Jordan structures require further
investigation and new ideas. Diculties app earing in the derogatory case are asso ci-
ated with the lack of uniqueness of the transformation to a versal deformation and
will b e discussed in the conclusion.
The pap er is organized as follows. In section 2 some concepts of the singularity
COMPUTATION OF THE JORDAN NORMAL FORM 3
theory are intro duced and the general idea of the pap er is describ ed. Sections 3 and
4 give approximate formulae for matrices having a prescrib ed nonderogatory Jordan
structure in the neighb orho o d of a given generic (nondegenerate) complex or real
matrix A. In section 5 the same problem is studied in the case, when the initial matrix
A is degenerate. Conclusion discusses extension of the metho d to the derogatory case
and p ossibilities for its application to sp eci c typ es of matrices or matrix p encils.
In the pap er matrices are denoted by b old capital letters, vectors take the form
of b old small letters, and scalars are represented by small italic characters.
2. Bifurcation diagram. Let us consider an m m complex (real) nonsym-
metric matrix A holomorphically (smo othly) dep endentonavector of complex (real)
parameters p =(p ;:::;p ). The space of all m m matrices can b e considered as a
1 n
sp ecial case, where all entries of A are indep endent parameters.
Matrices A and B are said to have the same Jordan structure if there exists one-
to-one corresp ondence b etween di erent eigenvalues of A and B such that sizes of
Jordan blo cks in the JNFs of A and B are equal for corresp onding eigenvalues (JNFs
of the matrices A and B di er only byvalues of their eigenvalues). For example,
among the following matrices
1 1 0 1 0 0
2 1 0 3 0 0 5 1 0
A A @ A @ @
(2.1) ; C = 0 2 0 ; B = 0 2 1 A = 0 5 0
0 0 2 0 0 2 0 0 2
the matrices A and B have the same Jordan structure, while the matrix C has
a di erent Jordan structure. In the generic case a set of values of the vector p
corresp onding to the matrices A(p)having the same Jordan structure represents a
smo oth submanifold of the parameter space [1, 2]. This submanifold is called a stratum
(also called a colored orbit ofamatrix).Abifurcation diagram of the matrix family
A(p) is a partition (strati cation) of the parameter space according to the Jordan
structure of the matrix A(p)[1, 2]. Generic (typical) matrices A form a stratum of
zero co dimension consisting of nondegenerate matrices having only simple eigenvalues.
The bifurcation diagram has singularities at b oundary p oints of the strata (b oundary
of a stratum consists of strata having higher co dimensions). Due to singularities the
bifurcation diagram has rather complicated structure.
In this pap er we consider matrices having a nonderogatory multiple eigenvalue
(there is one eigenvector and, hence, one Jordan blo ck corresp onding to ). The strata
d
corresp onding to such matrices are denoted by , where d is the algebraic multiplicity
of the multiple eigenvalue . If the matrix A has several multiple nonderogatory
eigenvalues ;:::; with algebraic multiplicities d ;:::;d , then the corresp onding
1 s 1 s
d
d
1
s
stratum is denoted by . In the case of real matrix families we will di er
s
1
strata determined by real eigenvalues and complex conjugate pairs of eigenvalues by
d d
using notations and ( i! ) resp ectively.
Geometry of the strati cation can b e illustrated by the following example
2p p
1 2
(2.2) A(p)= ; p =(p ;p ;p ):
1 2 3
p 0
3
2 2
The bifurcation diagram consists of the following strata: (the cone surface p +
1
p p = 0 except the origin), (the origin), and the other part of the parameter space
2 3
corresp onding to matrices A(p) with distinct eigenvalues (interior and exterior of the
cone); see Figure 2.1. Here denotes the stratum corresp onding to a derogatory
4 ALEXEI A. MAILYBAEV
Fig. 2.1. Geometry of the bifurcation diagram.
2
double eigenvalue (with two corresp onding eigenvectors). The strata and are
asso ciated with a double eigenvalue, but di er byanumb er of the Jordan blo cks
2
(equal to the numb er of eigenvectors). Clearly, the strata and are smo oth
submanifolds of co dimension 1 and 3 resp ectively.
A matrix with multiple eigenvalues can b e made nondegenerate by an arbitrarily
small p erturbation. That is whyinnumerical analysis we usually deal with nonde-
generate matrices. A double nonderogatory eigenvalue represents the most generic
2 2
typ e of multiple eigenvalues. The corresp onding stratum ( in the real case) is a
smo oth hyp ersurface in the parameter space. Nonderogatory eigenvalues are the most
generic (typical) b etween all typ es of eigenvalues of given multiplicity. Nonderoga-
d
tory eigenvalues of multiplicity d determine the stratum of co dimension d 1 (the
d
d
1
s
has the co dimension d + + d s). In the real case the strata stratum
1 s
s
1
d d
and ( i! ) have the co dimension d 1and2(d 1) resp ectively [1,2,10].
Let us describ e an idea of the metho d used in this pap er. The versal deformation
theory allows studying di erent strata indep endently.From one side, this simpli es
the analysis, since a stratum is a smo oth manifold without singularities. From the
other side, we get additional information ab out the Jordan structure of a matrix.
d
b b
Let p be a point on the stratum ; see Figure 2.2. The neighborhood of p can b e
d
explored by means of a versal deformation. As a result, the stratum is describ ed
lo cally by the equation
0 0
(2.3) A (q (p)) = J ;
d
0 0
where A (q ) is a simple explicit family of d d matrices (a blo ckoftheversal
0
deformation); q (p)isa vector smo othly dep endentonp; J represents the Jordan
d
blo ck of dimension d. Let a p oint p of the parameter space b e given. Assuming that
0
b
the p oint p is close to p,we can write equation (2.3) in the form
0
n
0
X
@ A
0 0 0 2 0
A (q (p )) + A + O (kp p k )=J ; A = (2.4) (p p );
0 0 d i 0i
@p
i
i=1
1=2
where kpk =(p p + + p p ) is the norm in the parameter space; derivatives
1 n
1 n
0 0 0
are calculated at p . The matrices A (q (p )) and @ A =@ p can b e found from the
0 0 i
analysis of the versal deformation at p . The latter represents the most nontrivial
0
2
)in(2.4),we obtain the equation determining part of the study. Omitting O (kp p k
0
d
a plane ,which is the rst order approximation of the stratum ; see Figure 2.2. A
d
distance from p to can b e found approximately as a distance to .Further analysis 0
COMPUTATION OF THE JORDAN NORMAL FORM 5
d
Fig. 2.2. Approximation of the stratum from the point p .
0
of the versal deformation allows nding approximations of multiple eigenvalues and
the Jordan chains (generalized eigenvectors) on the stratum. Note that all these
approximations use only information at the p oint p ,typically corresp onding to a
0
nondegenerate matrix A(p ).
0
The describ ed approach is similar to Newton's metho d for nding a ro ot of a
smo oth function, where the approximation of the ro ot is determined using the linear
d
approximation of the function at the initial p oint. Hence, if " =dist(p ; )isthe
0
d d
distance from p to , then the prop osed metho d gives the nearest p ointof with
0
2 d
the accuracy O (" ). Making k iterations, one nds a p oint of the stratum with the
k
2
accuracy O (" ).
3. Nonderogatory eigenvalue of a complex matrix. In this section we
study complex matrices A(p) holomorphically dep endentonavector of complex
b
parameters p. Let us consider a p oint p in the parameter space, where the matrix
b b
b
A = A(p) has a nonderogatory eigenvalue with algebraic multiplicity d. The corre-
b b
sp onding Jordan chain u ;:::;u (the eigenvector and asso ciated vectors, also called
1 d
generalized eigenvectors) is determined by the equations
b b
b b
A u = u ;
1 1
b
b
b b b
A u = u + u ;
2 2 1
(3.1)
.
.
.
b
b
b b b
Au = u + u :
d d d 1
b
b b
These vectors form an m d matrix U =[u ;:::;u ] satisfying equation
1 d
1 0
b
1
C B
.
.
b
C B
.
C b b b b b B
(3.2) ; AU UJ =0; J =
C B
.
.
A @
.
1
b
b
b b
where J is the Jordan blo ck of dimension d. The left Jordan chain v ;:::;v corre-
1 d
b
sp onding to the eigenvalue (the eigenvector and asso ciated vectors of the transp osed
6 ALEXEI A. MAILYBAEV
T
b
matrix A ) is determined by the equations
T T
b b
b b
v A = v ;
1 1
T T T
b b
b b b
v A = v + v ;
2 2 1
(3.3)
.
.
.
T T T
b
b
b b b
v A = v + v ;
d d d 1
T T
b b b b
(3.4) v u =1; v u =0;i=2;:::;d:
d d
1 i
Equalities (3.4) represent normalization conditions uniquely determining the vectors
b b b b
v ;:::;v for given vectors u ;:::;u . System (3.3), (3.4) can b e written in the
1 d 1 d
equivalent matrix form
T T T
b b b b b b b
b b
(3.5) U = I; V =[v ;:::;v ]; A JV =0; V V
d 1
b
where I is the identity matrix. Note that the matrix V consists of the vectors
b b
v ;:::;v listed in the reversed order.
1 d
b
Let us consider a p oint p in the vicinityof p. In this section we consider the
0
generic case, when the matrix A = A(p ) has simple eigenvalues ;:::; with
0 0 1 d
b
b
the corresp onding eigenvectors u ;:::;u suchthat ;:::; ! as p ! p.The
1 d 1 d 0
left eigenvectors corresp onding to ;:::; are denoted by v ;:::;v . The m d
1 d 1 d
matrices U =[u ;:::;u ]andV =[v ;:::;v ] satisfy the equations
1 d 1 d
T T T
(3.6) A U UJ =0; V A J V =0; V U = I;
0 0 0 0
where the diagonal d d matrix J =diag( ;:::; ). The last equality of (3.6) is the
0 1 d
normalization condition, which uniquely de nes V for given U. Since the eigenvalue
b
b b
u as is nonderogatory (has one eigenvector u ), every eigenvector u tends to c
1 1 i i
b
p ! p for some constant c [23].
0 i
The goal of this pap er is to nd the rst order approximations (up to the terms
2 d
O (kp p k )) of the stratum in the parameter space, the nonderogatory multiple
0
b
eigenvalue on the stratum, and the corresp onding Jordan chain U.We are going to
nd these approximations using only information at the p oint p , where the matrix
0
A has simple eigenvalues. Toformulate the main result, let us intro duce the following
0
scalars and vectors of dimension n
!