arXiv:1912.06967v1 [math.RA] 15 Dec 2019 values faui eigenvector unit a of ler teacher. algebra Dedication: adjugates. tors, Classification. Keywords Subject Mathematics 2010 hoe 1.1. Theorem iden eigenvector-eigenvalue “the attention: (called gained matrix tity”) Hermitian a of values h eigenvalues the zto) nti oe eln ihoeegnau tatm,w prove we time, a at eigenvalue diag one orthogonal with an dealing proc has note, the form this (in quadratic [9] In paper real ization). his every in that 30 proving formula of as Jacobi Jacob Gustav Carl set fisjm npplrt.I hudb one u htfrs for that out pointed be should over It matrices popularity. the metric in sociology-of-scien in jump some encountered its on variants of commenting aspects many also its while with literature, along modern identity, this of tions eoigthe removing Date eigen and eigenvectors between relation following the recently Very h uvyppr(4)peet eea rosadsm genera some and proofs several presents ([4]) paper survey The IEVCOSFO-IEVLE IDENTITY EIGENVECTORS-FROM-EIGENVALUES eebr1,2019. 13, December : λ egnrlz h ievco-ievlefruasree n[4] over in matrices surveyed square formula arbitrary eigenvector-eigenvalue to the generalize we Abstract. eigenvalues. 1 | ( v arcs hrceitcplnma,egnaus eigenvec- eigenvalues, polynomial, characteristic Matrices, : A i,j ) otemmr fWtl lie 12-08,m linear my (1929-2018), Kleiner Witold of memory the To .,λ ..., , | 2 j k hrwadclm yteformula the by column and row th =1; λ Y ( 1 n [4] sn h oino ihrajgt famatrix, a of adjugate higher a of notion the Using n k ( ( M 6= If ) A i ( R v j ) λ ) i MA MA STAWISKALGORZATA .,λ ..., , i hsfruawsetbihdarayi 84by 1834 in already established was formula this and ( soitdt h eigenvalue the to associated A A OEO THE ON MORE 1. ) san is − ,j i, n Introduction − λ 1 k ( 1 = M ( n A 1 j )= )) × ) .,n ..., , C fteminor the of n n hi osbymultiple possibly their and n k Y emta arxwt eigen- with matrix Hermitian hnthe then , =1 − 1 ( λ i ( A 51,1A4 15A75 15A24, 15A15, ) − M j λ λ hcomponent th j i k ( ( of A M ) A j srltdto related is )) omdby formed . onal- liza- ym- v ess ce i,j - - 2 M.STAWISKA an analogous formula for an arbitrary (not necessarily Hermitian or normal) square matrix over C. Our approach works also for the case of an eigenvalue of multiplicity higher than one. The result is as follows: Theorem 1.2. Let A be an n × n matrix over C, n ≥ 1, and let λ be an eigenvalue of A. Assume that the geometric multiplicity k of λ equals its algebraic multiplicity, 1 ≤ k ≤ n − 1. Let v1, ..., vk be a basis ∗ of eigenvectors for ker (A − λI) and let w1, ..., wk ∈ ker (A − λI) be such that wi(vj)= δij, i, j =1, ..., k. Then (−1)kP (k)(λ) vwT = adj (A − λI), k! k where v = v1 ∧ ... ∧ vk, w = w1 ∧ ... ∧ wk, P is the characteristic polynomial of A and adjk(A−λI) is the kth adjugate matrix of (A−λI). The analogy with the the identity in [4] for the eigenvectors of a Hermitian matrix A can be easily seen: if we fix an i ∈ {1, ..., n} and take λ = λi, then the product of differences of eigenvalues of A on the left-hand side is the derivative of the characteristic polynomial of A evaluated at λ = λi. Recall that for a Hermitian matrix the left eigenvector can be chosen to be the complex conjugate of the right eigenvector: w = u. The expression on the right-hand side involving eigenvalues of the minor Mj is the characteristic polynomial of the matrix Mj evaluated at λi, that is, an entry of the adjugate matrix of A − λiI. We use the (general) relation between the derivative of a determinant and the trace of the corresponding adjugate matrix, which is also due to Jacobi ([10]). Our method is first to prove the identity for k = 1 and then use this case to prove the general one. All notions and facts used in our proof will be recalled (in some cases, proved) below. In general, we refer the reader to [2], [3], [5], [6, Chapter 5], and [13].
2. Preliminaries 2.1. Vectors, linear transformations and matrices. Throughout, K will denote the field of either real or complex numbers. For finite- dimensional vector spaces V, U over K denote by L(V, U) the lin- ear space of all linear transformations T : V → U over K. For V = U let L(V) := L(V, V). The dual space of a vector space V over K of finite dimension n = dim V is V∗ := L(V, K). Let ∗ ∗ ∗ b1,..., bn ∈ V be a basis in V. Then the dual basis b1,..., bn ∈ V is ∗ defined by the conditions bi (bj)= δij for i, j ∈{1,...,n}. It is easily seen that V is isomorphic to Kn, the vector space of column vectors x ⊤ v n b = (x1,...,xn) . That is, the vector = Pi=1 bi i corresponds to MORE ON THE EIGENVECTORS-FROM-EIGENVALUES IDENTITY 3
⊤ n the vector b = (b1,...,bn) ∈ K . The basis bi, i ∈ {1, ..., n} corre- ⊤ ∗ sponds to the standard basis ei =(δi1,...,δin) , i ∈{1, ..., n}. V can Kn ∗ also be identified with , where bi corresponds to ei for i ∈{1, ..., n}. Thus u∗(v) corresponds to x⊤y.
For T ∈ L(V, U) let T ∗ : U∗ → V∗ be its dual transformation, act- ing as (T ∗(u∗))(v) = u∗(T (v)) for all u∗ ∈ V∗, v ∈ V. Let im T := T (V) ⊂ U and ker T := {v ∈ V : T (v)= 0}. Consider T ∈ L(V). The fundamental theorem of linear algebra says that ker(T ) ≃ (im(T ∗))⊥ and ker(T ∗) ≃ (im(T ))⊥. Here X⊥ := {φ ∈ V∗ : φ(X)=0} and (X′)⊥ := {v ∈ V : ∀ φ ∈ X′ φ(v)=0} for X ⊂ V, X′ ⊂ V∗ (the second definition reduces to the first one if we canonically identify V with its double dual V∗∗).
We will use the following easy result:
Proposition 2.1. (see [6], Problem 5.1.2(e)) Let U ⊂ V, W ⊂ V∗ be two m-dimensional subspaces. TFAE: (i) U ∩ W⊥ = {0}; (ii) U⊥ ∩ W = {0}; (iii) There exist bases {u1, ..., um}, {f1, ..., fm} in U and W, respectively, such that fj(ui)= δij, i, j =1, ..., m.
Assume now that dim V = n, dim U = m. Fix bases {b1,..., bn} ∗ ∗ and {a1,..., am} in V and U respectively. Then L(V, U)and L(U , V ) are isomorphic to Km×n and Kn×m respectively. Further, T ∈ L(V, U) m×n is represented by a matrix A ∈ K with entries aij ∈ K as T (x) = Ax, and T ∗ ∈ L(U∗, V∗) is represented by A⊤. The vector space of m × n matrices A = [aij] with entries in K will be denoted by m×n Mm×n(K) ≃ K . Bya k × k minor of a matrix A ∈ Mm×n(K) we mean the determinant of a k×k submatrix obtained from A by deleting n×n m − k rows and n − k columns. For an A = [aij] ∈ K we define the n trace of A as tr A := Pi=1 aii. Denote by rank T and null T the dimension of imT and ker T respec- tively. The rank-nullity theorem says that dim V = rank T + null T . Recall that for a matrix A ∈ Km×n rank A = r, viewed as the rank of the linear transformation induced by A, is equal to the size of the largest nonzero minor of A. Furthermore, there exists two sets of r lin- n m early independent vectors v1,..., vr ∈ K and u1,..., ur ∈ K such r u v⊤ that A = Pi=1 i i . Clearly, if A has the above rank decomposition r −1u v ⊤ K then A = Pi=1(ti i)(ti i) for t1,...,tr ∈ \{0}. For r > 1 there are other rank r decomposition of A. 4 M.STAWISKA
Assume that A ∈ Kn×n has rank A = 1. Then the decomposition A = uv⊤ =6 0 is unique up to scaling, A = (t−1u)(tv)⊤, t ∈ K \{0}. Further, tr A = v⊤u. If tr A =6 0, then the dual bases of im A and ⊤ 1 im A can be chosen as u and tr A v. 2.2. Compound and adjugate matrices. Assume that V is an n- K k dimensional vector space over . For k ∈{1, ..., n} denote by V V the k-wedge product space of V. This is a space spanned by the k-wedge products x1 ∧···∧ xk, x1, ..., xk ∈ V. Choose a basis b1,..., bn in V. Then this basis induces the basis in k V: {b ∧···∧ b , 1 ≤ i < V i1 ik 1 ··· < ik ≤ n}. We arrange the elements of this basis in the lexico- n graphical order. In K we choose the standard basis {e1,..., en}.
k V n 1 V V n V Recall that dim V = k. Thus V = , and V is a one- dimensional subspace (isomorphic to K). Assume that U is an m- dimensional vector space. Assume that T ∈ L(V, U) and 1 ≤ k ≤ min(m, n). Then T induces a transformation k T ∈ L( k V, k U) k V V V which acts as follows: V T (x1 ∧···∧ xk)=(T x1) ∧···∧ (T xk). It is k ∗ k ∗ also straightforward to show that V T =(V T ) .
n For T ∈ L(V) we have (V T )(v1 ∧···∧ vn) =: det(T )·v1 ∧···∧vn. If T is represented by a matrix A, then the scalar det(T ) is the same as the determinant of A (independent of the choice of a basis).
Suppose W is an l-dimensional vector space over K and let S ∈ L(W, V). The TS ∈ L(W, U). Furthermore for 1 ≤ k ≤ min(l, m, n) k k k we have the equality V (TS)=(V T )(V S).
n (n) Let x1,..., xk ∈ K . Then x1 ∧···∧ xk ∈ K k is a vector which is n×k obtained as follows: Let X = [x1 ··· xn] ∈ K be the matrix whose columns are x1,..., xk. Denote by ck(X) the vector whose coordinates are (determinants of) all k × k minors of X, arranged in the lexico- (n) graphical order. Then x1 ∧···∧ xk = ck(X) ∈ K k . The coordinates of ck(X) are the Pl¨ucker coordinates of x1 ∧···∧ xk (for more infor- mation, see e.g. [7]). Note that if x1,..., xk are linearly independent, k then x1 ∧···∧ xk is a basis in V span(x1,..., xk). It now follows that if T ∈ L(V, U) is represented by A ∈ Km×n then k (m)×(n) ∧ T is represented by the k-th compound matrix Ck(A) ∈ K k k , whose entries are the k × k minors of A arranged in the lexicograph- ical order. Assume that S ∈ L(W, V) is represented by B ∈ Kn×l. k Then the above arguments show that Ck(AB) represents V (TS), and MORE ON THE EIGENVECTORS-FROM-EIGENVALUES IDENTITY 5
k Ck(AB)= Ck(A)Ck(B). Furthermore, Ck(aA)= a Ck(A), and Ck(In)= n×n I n , where In ∈ K is the identity matrix. It is convenient to adopt (k) the definition C0(A)=1 ∈ K for an arbitrary A =6 0. Recall the definition of the adjugate of A = [aij], denoted by adj A = n×n i+j [bij] ∈ K . Then bij is (−1) times the (n − 1)st minor of A n×n obtained by deleting the j-th row and i-th column of A. Let ∆n ∈ K i be the matrix whose (i, j) entry is (−1) δi(n−j+1). (Note that ∆n is an orthogonal matrix.) It is straightforward to show that adj A = ⊤ ∆nCn−1(A)∆n . Recall the fundamental identity A adj A = (adj A)A = −1 −1 (det A)In. Hence if det A =6 0 it follows that A = (det A) adj A. Similarly we deduce adj AB = (adj B)(adj A). More generally, for 1 ≤ k ≤ n−1 one can introduce the k-th adjugate n × n K(k) (k) of A, denoted by adjk A ∈ , as follows (cf. [1], [12], [13]): The entry ({1 ≤ i1 < ··· < ik ≤ n}, {1 ≤ j1 < ··· < jk ≤ n}) is Pk i +j (−1) l=1 l l times the k-minor of A obtained from A by deleting rows i1,...,ik and colums j1,...,jk. It is straightforward to show that that ⊤ adjk A = Ck(∆n)Cn−k(A)Ck(∆n) . Note that adj A = adj1 A. We have the following identities:
Ck(A)adjk A = (adjk A)Ck(A) = (det A)I n . (k) Hence k Ck(A)adjk A = (adjk A)Ck(A) = (det A) I n , (k)
adjk (AB) = (adjk B)(adjk A). Compounds and adjugates occur in the following important deter- minantal formula, which was proved in [12]. Proposition 2.2. (Theorem 4.1, [12]) Let A, B ∈ Kn×n be arbitrary. Then
n
det(A + B)= X tr(adjkACk(B)). k=0 The following lemma is straightforward, but we present its proof for completeness: Lemma 2.3. Let A ∈ Kn×n and assume that rank A = n − k for some 1 ≤ k ≤ n − 1. Then for an integer j ∈ {1, .., n − 1} the following conditions hold:
(1) If j