Linear and Non-Linear Supersymmetry for Non-Hermitian Matrix Hamiltonians

Total Page:16

File Type:pdf, Size:1020Kb

Linear and Non-Linear Supersymmetry for Non-Hermitian Matrix Hamiltonians Saint-Petersburg State University Physical Faculty Russia, Saint-Petertsburg, 198504, Petrodvorets, Ulianovskaya ul., 1 Andrei V. Sokolov Linear and Non-linear Supersymmetry for Non-Hermitian Matrix Hamiltonians Report at the Workshop \PHHQP XI: Non-Hermitian Operators in Quantum Physics" August 27-31, 2012, APC, Paris, France Main points of the report • Basic definitions and notation • Brief review of the literature and the purpose of this report • A method of constructing of any matrix differential intertwining operator of any order in terms of transformation vector-functions • Theorem on singular factorization of a matrix intertwining operator into a prod- uct of matrix intertwining operators of lower orders • The notion of minimizability of a matrix intertwining operator and criterion of minimizability • Sufficient condition of existence for an intertwining operator a \conjugate" inter- twining operator of the same order and of polynomial SUSY with these operators • Theorem on existence for any intertwining operator a \conjugate" intertwining operator and polynomial SUSY with these operators • Sufficient condition of reducibility of an intertwining operator and existence of absolutely irreducible intertwining operators 2 1 Basic definitions and notation 1.1 Intertwining relation Let us consider two defined on the entire axis matrix n × n Hamiltonians 2 2 H+ = −In@ + V+(x);H− = −In@ + V−(x);@ ≡ d=dx; where In is the identity matrix and V+(x) and V−(x) are square matrices, all ele- ments of which are sufficiently smooth and, in general, complex-valued functions. We suppose that these Hamiltonians are intertwined by a matrix linear differential − operator QN , so that N − − − X − j Q H+ = H−Q ;Q = X (x)@ ; (1) N N N j=0 j − where Xj (x), j = 0, . , N are as well square matrices of n-th order, elements of which are sufficiently smooth and, in general, complex-valued functions. It is not hard to check that intertwining (1) leads to the following consequences: − − − −1 − 0 − −1 XN (x) = Const;V−(x) = XN V+(x)(XN ) + 2XN−1(x)(XN ) ; − where and in the what follows we restrict ourselves by the case det XN 6= 0. 3 1.2 About \conjugate" intertwining operator + Existence of a \conjugate" matrix intertwining operator QM for a given matrix − intertwining operator QN such that M + + + X + j H+Q = Q H−;Q = X (x)@ ; (2) M M m j=0 j is not evident in general but is evident in the following cases: y y + + + − y PN j − y (1) H+ =H+;H− =H− ) H+QN =QN H−;QN =QN = j=0(−@) Xj (x); where y denotes Hermitian conjugation; t t + + + − t PN j − t (2) H+ =H+;H− =H− ) H+QN =QN H−;QN =QN = j=0(−@) Xj (x); where t denotes transposition; ∗ + + + − ∗ PN − ∗ j (3) H+ = H− ) H+QN = QN H−;QN = QN = j=0 Xj (x)@ ; where ∗ denotes complex conjugation. Existence of a \conjugate" matrix intertwining operator of the type (2) for any − matrix intertwining operator QN is guaranteed by one of theorems presented below. 4 1.3 Structure of intertwining operator kernel, transformation functions − By virtue of the intertwining the kernel of QN is an invariant subspace for H+: − − H+ ker QN ⊂ ker QN : − − − − Hence, for any basis Φ1 (x), . , Φd (x) in ker QN , d = dim ker QN = nN there is a + constant square matrix T ≡ kTijk of d-th order such that d − X + − H+Φ = T Φ ; i = 1; : : : ; d: i j=1 ij j − + A basis in the kernel of an intertwining operator QN in which the matrix T has a normal (Jordan) form is called a canonical basis. Elements of a canonical basis are called a transformation vector-functions. If a Jordan form of the matrix T + has block(s) of order higher than one, then the corresponding canonical basis contains not only formal vector-eigenfunctions of H+ but also formal associated vector-function(s) of H+ which are defined as follows. A finite or infinite set of vector-functions Φm;i(x), i = 0, 1, 2, . is called a chain of formal associated vector-functions of H+ for a spectral value λm if H+Φm;0 =λmΦm;0; Φm;0(x)6≡0; (H+−λmIn)Φm;i =Φm;i−1; i=1; 2; 3;:::: 5 2 Brief review of the literature [1] R.D. Amado, F. Cannata, J.-P. Dedonder, Coupled-channel supersymmetric quantum mechanics, Phys. Rev. A 38 (1988) 3797. + − +: the simplest formulae for H+, H−, Q1 and Q1 in the case where N = 1, n is arbitrary, all transformation vector-functions are formal vector-eigenfunctions of H+ and belong to the only its spectral value; −: very partial case. [2] A.A. Andrianov, F.Cannata, D.N.Nishnianidze, M.V.Ioffe, Matrix Hamiltonians: SUSY approach to hidden symmetries, JPA 30 (1997) 5037; quant-ph/9707004. + −† y y +: systematic studying of the system where QN = QN , H+ = H+ and H− = H− in the case N = 1 for n = 2 mostly and in the case N = 2 for n = 2; −: rather restricted case. 6 [3] V.M. Goncharenko, A.P. Veselov, Monodromy of the matrix Schr¨odingerequa- tions and Darboux transformations, JPA 31 (1998) 5315. − − +: there are formulae in [3] that allow us to construct any QN with XN = In and the corresponding H− for any H+ (n and N are arbitrary); −: the formulae of [3] are rather complex since they contain quasideterminants and they are in terms of a basis in an H+-invariant subspace; there is no any − condition that provides implementability of the described procedure of QN con- − structing and absence of pole(s) for QN coefficients and for the potential of H−. [4] B.F. Samsonov, A.A. Pecheritsin, Chains of Darboux transformations for the matrix Schr¨odingerequation, JPA 37 (2004) 239; quant-ph/0307145. − +: there are more simple than in [3] formulae for QN and H− in terms of usual determinants and of formal vector-eigenfunctions of H+; −: very restricted case since only vector-eigenfunctions of H+ are used; there is − no condition that provides implementability of the procedure of QN constructing − and absence of pole(s) for QN and H−. 7 [5] T. Tanaka, N -fold Supersymmetry in Quantum Mechanical Matrix Models, Mod. Phys. Lett. A 27 (2012) 1250051; arXiv:1108.0480. +: the author proposes to consider SUSY generated by two intertwining opera- − + + − − + tors of the same order QN and QN such that QN QN and QN QN are the identical polynomials with matrix coefficients of H+ and H− respectively; −: there is no any general method of construction of such SUSY and there is no proof of existence of such SUSY except the case n = N = 2. The purpose of this report is to generalize the results of [6] A.A. Andrianov, A.V. Sokolov, Nonlinear supersymmetry in quantum mechan- ics: algebraic properties and differential representation, Nucl. Phys. B 660 (2003) 25; hep-th/0301062. to the matrix case. 8 3 A method of constructing of any matrix differential intertwining operator of any order in terms of transformation vector-functions Let us consider a set of formal associated vector-functions − − − − t Φl (x) ≡ 'l1(x);'l2(x);:::;'ln(x) ; l = 1; : : : ; nN; n; N 2 N of a matrix n × n Hamiltonian H+ such that this set can be divided into a chains of formal associated vector-functions of H+ for different, in general, spectral values − of H+ and the Wronskian W (x) of all Φl (x), l = 1, . , nN does not vanish on − the real axis. Then a matrix n × n differential operator QN of N-th order with − N arbitrary nondegenerate matrix coefficient XN at @ that intertwines H+ with some Hamiltonian H− and such that the considered set form a basis in its kernel can be 9 uniquely found as follows, 1 Q− = X− N W (x) N '− :::'− '−0 :::'−0 ::: ('− )(N−1) ::: ('− )(N−1) (Φ−)(N) 11 1n 11 1n 11 1n 1 − − −0 −0 − (N−1) − (N−1) − (N) '21 :::'2n '21 :::'2n ::: ('21) ::: ('2n) (Φ2 ) . .. .. .. .. × . ; '− :::'− '−0 :::'−0 ::: ('− )(N−1) ::: ('− )(N−1) (Φ− )(N) nN;1 nN;n nN;1 nN;n nN;1 nN;n nN N−1 N−1 N P1 :::Pn P1@:::Pn@:::P1@ :::Pn@ In@ PlΦ = 'l; 8 Φ(x) ≡ '1(x);'2(x);:::;'n(x) ; l = 1; : : : ; n; where during calculation of the determinant in each of its terms the corresponding N−1 N−1 N of the operators P1,..., Pn, P1@,..., Pn@, P1@ ,..., Pn@ , In@ must be placed on the last position. The potential of the Hamiltonian H− intertwined with − H+ by QN can be calculated with the help of the formula from the section 1. We emphasize that the condition that the Wronskian W (x) does not vanish on the − real axis provides existence for QN and H− and smoothness (absence of pole(s)) for − − − the potential of H− and all matrix-valued coefficients X0 (x), . , XN−1(x) of QN . 10 4 Theorem on singular factorization of a matrix intertwining operator into a product of matrix intertwining operators of lower orders The following theorem takes an important part in studying of polynomial supersym- metry and minnimizability and reducibility of matrix intertwining operators. Theorem 1. Suppose that − − (1) Φl (x), l = 1,..., nN are elements of a canonical basis in ker QN renumbered − so that the set Φl (x); l = 1,..., nj for any j = 1,..., N − 1 can be divided into a chains of formal associated vector-functions of H+; − (2) Wj(x) is the Wronskian of Φl (x), l = 1,..., nj, where j = 1,..., N; (3) jm, m = 1,..., M is monotonically increasing sequence of natural numbers such that jM = N and Wjm (x) 6≡ 0, m = 1,..., M − 1; (4) Nm = jm − jm−1, m = 1,..., M, j0 ≡ 0.
Recommended publications
  • Math 4571 (Advanced Linear Algebra) Lecture #27
    Math 4571 (Advanced Linear Algebra) Lecture #27 Applications of Diagonalization and the Jordan Canonical Form (Part 1): Spectral Mapping and the Cayley-Hamilton Theorem Transition Matrices and Markov Chains The Spectral Theorem for Hermitian Operators This material represents x4.4.1 + x4.4.4 +x4.4.5 from the course notes. Overview In this lecture and the next, we discuss a variety of applications of diagonalization and the Jordan canonical form. This lecture will discuss three essentially unrelated topics: A proof of the Cayley-Hamilton theorem for general matrices Transition matrices and Markov chains, used for modeling iterated changes in systems over time The spectral theorem for Hermitian operators, in which we establish that Hermitian operators (i.e., operators with T ∗ = T ) are diagonalizable In the next lecture, we will discuss another fundamental application: solving systems of linear differential equations. Cayley-Hamilton, I First, we establish the Cayley-Hamilton theorem for arbitrary matrices: Theorem (Cayley-Hamilton) If p(x) is the characteristic polynomial of a matrix A, then p(A) is the zero matrix 0. The same result holds for the characteristic polynomial of a linear operator T : V ! V on a finite-dimensional vector space. Cayley-Hamilton, II Proof: Since the characteristic polynomial of a matrix does not depend on the underlying field of coefficients, we may assume that the characteristic polynomial factors completely over the field (i.e., that all of the eigenvalues of A lie in the field) by replacing the field with its algebraic closure. Then by our results, the Jordan canonical form of A exists.
    [Show full text]
  • MATH 2370, Practice Problems
    MATH 2370, Practice Problems Kiumars Kaveh Problem: Prove that an n × n complex matrix A is diagonalizable if and only if there is a basis consisting of eigenvectors of A. Problem: Let A : V ! W be a one-to-one linear map between two finite dimensional vector spaces V and W . Show that the dual map A0 : W 0 ! V 0 is surjective. Problem: Determine if the curve 2 2 2 f(x; y) 2 R j x + y + xy = 10g is an ellipse or hyperbola or union of two lines. Problem: Show that if a nilpotent matrix is diagonalizable then it is the zero matrix. Problem: Let P be a permutation matrix. Show that P is diagonalizable. Show that if λ is an eigenvalue of P then for some integer m > 0 we have λm = 1 (i.e. λ is an m-th root of unity). Hint: Note that P m = I for some integer m > 0. Problem: Show that if λ is an eigenvector of an orthogonal matrix A then jλj = 1. n Problem: Take a vector v 2 R and let H be the hyperplane orthogonal n n to v. Let R : R ! R be the reflection with respect to a hyperplane H. Prove that R is a diagonalizable linear map. Problem: Prove that if λ1; λ2 are distinct eigenvalues of a complex matrix A then the intersection of the generalized eigenspaces Eλ1 and Eλ2 is zero (this is part of the Spectral Theorem). 1 Problem: Let H = (hij) be a 2 × 2 Hermitian matrix. Use the Min- imax Principle to show that if λ1 ≤ λ2 are the eigenvalues of H then λ1 ≤ h11 ≤ λ2.
    [Show full text]
  • Math 223 Symmetric and Hermitian Matrices. Richard Anstee an N × N Matrix Q Is Orthogonal If QT = Q−1
    Math 223 Symmetric and Hermitian Matrices. Richard Anstee An n × n matrix Q is orthogonal if QT = Q−1. The columns of Q would form an orthonormal basis for Rn. The rows would also form an orthonormal basis for Rn. A matrix A is symmetric if AT = A. Theorem 0.1 Let A be a symmetric n × n matrix of real entries. Then there is an orthogonal matrix Q and a diagonal matrix D so that AQ = QD; i.e. QT AQ = D: Note that the entries of M and D are real. There are various consequences to this result: A symmetric matrix A is diagonalizable A symmetric matrix A has an othonormal basis of eigenvectors. A symmetric matrix A has real eigenvalues. Proof: The proof begins with an appeal to the fundamental theorem of algebra applied to det(A − λI) which asserts that the polynomial factors into linear factors and one of which yields an eigenvalue λ which may not be real. Our second step it to show λ is real. Let x be an eigenvector for λ so that Ax = λx. Again, if λ is not real we must allow for the possibility that x is not a real vector. Let xH = xT denote the conjugate transpose. It applies to matrices as AH = AT . Now xH x ≥ 0 with xH x = 0 if and only if x = 0. We compute xH Ax = xH (λx) = λxH x. Now taking complex conjugates and transpose (xH Ax)H = xH AH x using that (xH )H = x. Then (xH Ax)H = xH Ax = λxH x using AH = A.
    [Show full text]
  • Arxiv:1901.01378V2 [Math-Ph] 8 Apr 2020 Where A(P, Q) Is the Arithmetic Mean of the Vectors P and Q, G(P, Q) Is Their P Geometric Mean, and Tr X Stands for Xi
    MATRIX VERSIONS OF THE HELLINGER DISTANCE RAJENDRA BHATIA, STEPHANE GAUBERT, AND TANVI JAIN Abstract. On the space of positive definite matrices we consider dis- tance functions of the form d(A; B) = [trA(A; B) − trG(A; B)]1=2 ; where A(A; B) is the arithmetic mean and G(A; B) is one of the different versions of the geometric mean. When G(A; B) = A1=2B1=2 this distance is kA1=2− 1=2 1=2 1=2 1=2 B k2; and when G(A; B) = (A BA ) it is the Bures-Wasserstein metric. We study two other cases: G(A; B) = A1=2(A−1=2BA−1=2)1=2A1=2; log A+log B the Pusz-Woronowicz geometric mean, and G(A; B) = exp 2 ; the log Euclidean mean. With these choices d(A; B) is no longer a metric, but it turns out that d2(A; B) is a divergence. We establish some (strict) convexity properties of these divergences. We obtain characterisations of barycentres of m positive definite matrices with respect to these distance measures. 1. Introduction Let p and q be two discrete probability distributions; i.e. p = (p1; : : : ; pn) and q = (q1; : : : ; qn) are n -vectors with nonnegative coordinates such that P P pi = qi = 1: The Hellinger distance between p and q is the Euclidean norm of the difference between the square roots of p and q ; i.e. 1=2 1=2 p p hX p p 2i hX X p i d(p; q) = k p− qk2 = ( pi − qi) = (pi + qi) − 2 piqi : (1) This distance and its continuous version, are much used in statistics, where it is customary to take d (p; q) = p1 d(p; q) as the definition of the Hellinger H 2 distance.
    [Show full text]
  • MATRICES WHOSE HERMITIAN PART IS POSITIVE DEFINITE Thesis by Charles Royal Johnson in Partial Fulfillment of the Requirements Fo
    MATRICES WHOSE HERMITIAN PART IS POSITIVE DEFINITE Thesis by Charles Royal Johnson In Partial Fulfillment of the Requirements For the Degree of Doctor of Philosophy California Institute of Technology Pasadena, California 1972 (Submitted March 31, 1972) ii ACKNOWLEDGMENTS I am most thankful to my adviser Professor Olga Taus sky Todd for the inspiration she gave me during my graduate career as well as the painstaking time and effort she lent to this thesis. I am also particularly grateful to Professor Charles De Prima and Professor Ky Fan for the helpful discussions of my work which I had with them at various times. For their financial support of my graduate tenure I wish to thank the National Science Foundation and Ford Foundation as well as the California Institute of Technology. It has been important to me that Caltech has been a most pleasant place to work. I have enjoyed living with the men of Fleming House for two years, and in the Department of Mathematics the faculty members have always been generous with their time and the secretaries pleasant to work around. iii ABSTRACT We are concerned with the class Iln of n><n complex matrices A for which the Hermitian part H(A) = A2A * is positive definite. Various connections are established with other classes such as the stable, D-stable and dominant diagonal matrices. For instance it is proved that if there exist positive diagonal matrices D, E such that DAE is either row dominant or column dominant and has positive diag­ onal entries, then there is a positive diagonal F such that FA E Iln.
    [Show full text]
  • 216 Section 6.1 Chapter 6 Hermitian, Orthogonal, And
    216 SECTION 6.1 CHAPTER 6 HERMITIAN, ORTHOGONAL, AND UNITARY OPERATORS In Chapter 4, we saw advantages in using bases consisting of eigenvectors of linear opera- tors in a number of applications. Chapter 5 illustrated the benefit of orthonormal bases. Unfortunately, eigenvectors of linear operators are not usually orthogonal, and vectors in an orthonormal basis are not likely to be eigenvectors of any pertinent linear operator. There are operators, however, for which eigenvectors are orthogonal, and hence it is possible to have a basis that is simultaneously orthonormal and consists of eigenvectors. This chapter introduces some of these operators. 6.1 Hermitian Operators § When the basis for an n-dimensional real, inner product space is orthonormal, the inner product of two vectors u and v can be calculated with formula 5.48. If v not only represents a vector, but also denotes its representation as a column matrix, we can write the inner product as the product of two matrices, one a row matrix and the other a column matrix, (u, v) = uT v. If A is an n n real matrix, the inner product of u and the vector Av is × (u, Av) = uT (Av) = (uT A)v = (AT u)T v = (AT u, v). (6.1) This result, (u, Av) = (AT u, v), (6.2) allows us to move the matrix A from the second term to the first term in the inner product, but it must be replaced by its transpose AT . A similar result can be derived for complex, inner product spaces. When A is a complex matrix, we can use equation 5.50 to write T T T (u, Av) = uT (Av) = (uT A)v = (AT u)T v = A u v = (A u, v).
    [Show full text]
  • Unitary-And-Hermitian-Matrices.Pdf
    11-24-2014 Unitary Matrices and Hermitian Matrices Recall that the conjugate of a complex number a + bi is a bi. The conjugate of a + bi is denoted a + bi or (a + bi)∗. − In this section, I’ll use ( ) for complex conjugation of numbers of matrices. I want to use ( )∗ to denote an operation on matrices, the conjugate transpose. Thus, 3 + 4i = 3 4i, 5 6i =5+6i, 7i = 7i, 10 = 10. − − − Complex conjugation satisfies the following properties: (a) If z C, then z = z if and only if z is a real number. ∈ (b) If z1, z2 C, then ∈ z1 + z2 = z1 + z2. (c) If z1, z2 C, then ∈ z1 z2 = z1 z2. · · The proofs are easy; just write out the complex numbers (e.g. z1 = a+bi and z2 = c+di) and compute. The conjugate of a matrix A is the matrix A obtained by conjugating each element: That is, (A)ij = Aij. You can check that if A and B are matrices and k C, then ∈ kA + B = k A + B and AB = A B. · · You can prove these results by looking at individual elements of the matrices and using the properties of conjugation of numbers given above. Definition. If A is a complex matrix, A∗ is the conjugate transpose of A: ∗ A = AT . Note that the conjugation and transposition can be done in either order: That is, AT = (A)T . To see this, consider the (i, j)th element of the matrices: T T T [(A )]ij = (A )ij = Aji =(A)ji = [(A) ]ij. Example. If 1 2i 4 1 + 2i 2 i 3i ∗ − A = − , then A = 2 + i 2 7i .
    [Show full text]
  • RANDOM DOUBLY STOCHASTIC MATRICES: the CIRCULAR LAW 3 Easily Be Bounded by a Polynomial in N
    The Annals of Probability 2014, Vol. 42, No. 3, 1161–1196 DOI: 10.1214/13-AOP877 c Institute of Mathematical Statistics, 2014 RANDOM DOUBLY STOCHASTIC MATRICES: THE CIRCULAR LAW By Hoi H. Nguyen1 Ohio State University Let X be a matrix sampled uniformly from the set of doubly stochastic matrices of size n n. We show that the empirical spectral × distribution of the normalized matrix √n(X EX) converges almost − surely to the circular law. This confirms a conjecture of Chatterjee, Diaconis and Sly. 1. Introduction. Let M be a matrix of size n n, and let λ , . , λ be × 1 n the eigenvalues of M. The empirical spectral distribution (ESD) µM of M is defined as 1 µ := δ . M n λi i n X≤ We also define µcir as the uniform distribution over the unit disk 1 µ (s,t) := mes( z 1; (z) s, (z) t). cir π | |≤ ℜ ≤ ℑ ≤ Given a random n n matrix M, an important problem in random matrix theory is to study the× limiting distribution of the empirical spectral distri- bution as n tends to infinity. We consider one of the simplest random matrix ensembles, when the entries of M are i.i.d. copies of the random variable ξ. When ξ is a standard complex Gaussian random variable, M can be viewed as a random matrix drawn from the probability distribution P(dM)= 1 tr(MM ∗) 2 e− dM on the set of complex n n matrices. This is known as arXiv:1205.0843v3 [math.CO] 27 Mar 2014 πn × the complex Ginibre ensemble.
    [Show full text]
  • PHYSICS 116A Homework 8 Solutions 1. Boas, Problem 3.9–4
    PHYSICS 116A Homework 8 Solutions 1. Boas, problem 3.9–4. Given the matrix, 0 2i 1 − A = i 2 0 , − 3 0 0 find the transpose, the inverse, the complex conjugate and the transpose conjugate of A. Verify that AA−1 = A−1A = I, where I is the identity matrix. We shall evaluate A−1 by employing Eq. (6.13) in Ch. 3 of Boas. First we compute the determinant by expanding in cofactors about the third column: 0 2i 1 − i 2 det A i 2 0 =( 1) − =( 1)( 6) = 6 . ≡ − − 3 0 − − 3 0 0 Next, we evaluate the matrix of cofactors and take the transpose: 2 0 2i 1 2i 1 − − 0 0 − 0 0 2 0 0 0 2 i 0 0 1 0 1 adj A = − − − = 0 3 i . (1) − 3 0 3 0 − i 0 − 6 6i 2 i 2 0 2i 0 2i − − − 3 0 − 3 0 i 2 − According to Eq. (6.13) in Ch. 3 of Boas the inverse of A is given by Eq. (1) divided by det(A), so 1 0 0 3 −1 1 i A = 0 2 6 . (2) 1 i 1 − − 3 It is straightforward to check by matrix multiplication that 0 2i 1 0 0 1 0 0 1 0 2i 1 1 0 0 − 3 3 − i 2 0 0 1 i = 0 1 i i 2 0 = 0 1 0 . − 2 6 2 6 − 3 0 0 1 i 1 1 i 1 3 0 0 0 0 1 − − 3 − − 3 The transpose, complex conjugate and the transpose conjugate∗ can be written down by inspec- tion: 0 i 3 0 2i 1 0 i 3 − − − ∗ † AT = 2i 2 0 , A = i 2 0 , A = 2i 2 0 .
    [Show full text]
  • Supplement: Symmetric and Hermitian Matrices
    Supplement: Symmetric and Hermitian Matrices A Bunch of Definitions Definition: A real n × n matrix A is called symmetric if AT = A. Definition: A complex n × n matrix A is called Hermitian if A∗ = A, where A∗ = AT , the conjugate transpose. Definition: A complex n × n matrix A is called normal if A∗A = AA∗, i.e. commutes with its conjugate transpose. It is quite a surprising result that these three kinds of matrices are always diagonalizable; and moreover, one can construct an orthonormal basis (in standard inner product) for Rn/Cn, consisting of eigenvectors of A. Hence the matrix P that gives diagonalization A = P DP −1 will be orthogonal/unitary, namely: T −1 T Definition: An n × n real matrix P is called orthogonal if P P = In, i.e. P = P . ∗ −1 ∗ Definition: An n × n complex matrix P is called unitary if P P = In, i.e. P = P . Diagonalization using these special kinds of P will have special names: Definition: A matrix A is called orthogonally diagonalizable if A is similar to a diagonal matrix D with an orthogonal matrix P , i.e. A = P DP T . A matrix A is called unitarily diagonalizable if A is similar to a diagonal matrix D with a unitary matrix P , i.e. A = P DP ∗. Then we have the following big theorems: Theorem: Every real n × n symmetric matrix A is orthogonally diagonalizable Theorem: Every complex n × n Hermitian matrix A is unitarily diagonalizable. Theorem: Every complex n × n normal matrix A is unitarily diagonalizable. To prove the above results, it is convenient to introduce the concept of adjoint operator, which allows us to discuss effectively the \transpose" operation in a general inner product space.
    [Show full text]
  • Symmetric and Hermitian Matrices
    “notes2” i i 2013/4/9 page 73 i i Chapter 5 Symmetric and Hermitian Matrices In this chapter, we discuss the special classes of symmetric and Hermitian matrices. We will conclude the chapter with a few words about so-called Normal matrices. Before we begin, we mention one consequence of the last chapter that will be useful in a proof of the unitary diagonalization of Hermitian matrices. Let A be an m n matrix with m n, and assume (for the moment) that A has linearly independent× columns. Then≥ if the Gram-Schmidt process is applied to the columns of A, the result can be expressed in terms of a matrix factorization A = Q˜ R˜, where the orthogonal vectors are the columns in Q˜, and R˜ is unit upper triangular n n with the inner products as the entries in R˜. H × i 1 qk ai For example, consider that qi := ai k−=1 H qk. Rearranging the equa- − qk qk tion, P i 1 H n − qk ai ai = q + q + 0q i qH q k k kX=1 k k k=Xi+1 where the right hand side is a linear combination of the columns of the first i (only!) columns of the matrix Q˜. So this equation can be rewritten H q1 ai H q1 q1 H q2 ai H q2 q2 . H qi−1ai ˜ H ai = Q qi−1qi−1 . 1 0 . . 0 Putting such equations together for i = 1 ,...,n , we arrive at the desired result, A = Q˜ R˜. 73 “notes2” i i 2013/4/9 page 74 i i 74 Chapter5.
    [Show full text]
  • Matrix Nearness Problems and Applications∗
    Matrix Nearness Problems and Applications∗ Nicholas J. Higham† Abstract A matrix nearness problem consists of finding, for an arbitrary matrix A, a nearest member of some given class of matrices, where distance is measured in a matrix norm. A survey of nearness problems is given, with particular emphasis on the fundamental properties of symmetry, positive definiteness, orthogonality, normality, rank-deficiency and instability. Theoretical results and computational methods are described. Applications of nearness problems in areas including control theory, numerical analysis and statistics are outlined. Key words. matrix nearness problem, matrix approximation, symmetry, positive definiteness, orthogonality, normality, rank-deficiency, instability. AMS subject classifications. Primary 65F30, 15A57. 1 Introduction Consider the distance function (1.1) d(A) = min E : A + E S has property P ,A S, k k ∈ ∈ where S denotes Cm×n or Rm×n, is a matrix norm on S, and P is a matrix property k · k which defines a subspace or compact subset of S (so that d(A) is well-defined). Associated with (1.1) are the following tasks, which we describe collectively as a matrix nearness problem: Determine an explicit formula for d(A), or a useful characterisation. • ∗This is a reprint of the paper: N. J. Higham. Matrix nearness problems and applications. In M. J. C. Gover and S. Barnett, editors, Applications of Matrix Theory, pages 1–27. Oxford University Press, 1989. †Department of Mathematics, University of Manchester, Manchester, M13 9PL, England ([email protected]). 1 Determine X = A + E , where E is a matrix for which the minimum in (1.1) • min min is attained.
    [Show full text]