Math 4571 (Advanced Linear Algebra) Lecture #27
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
MATH 2370, Practice Problems
MATH 2370, Practice Problems Kiumars Kaveh Problem: Prove that an n × n complex matrix A is diagonalizable if and only if there is a basis consisting of eigenvectors of A. Problem: Let A : V ! W be a one-to-one linear map between two finite dimensional vector spaces V and W . Show that the dual map A0 : W 0 ! V 0 is surjective. Problem: Determine if the curve 2 2 2 f(x; y) 2 R j x + y + xy = 10g is an ellipse or hyperbola or union of two lines. Problem: Show that if a nilpotent matrix is diagonalizable then it is the zero matrix. Problem: Let P be a permutation matrix. Show that P is diagonalizable. Show that if λ is an eigenvalue of P then for some integer m > 0 we have λm = 1 (i.e. λ is an m-th root of unity). Hint: Note that P m = I for some integer m > 0. Problem: Show that if λ is an eigenvector of an orthogonal matrix A then jλj = 1. n Problem: Take a vector v 2 R and let H be the hyperplane orthogonal n n to v. Let R : R ! R be the reflection with respect to a hyperplane H. Prove that R is a diagonalizable linear map. Problem: Prove that if λ1; λ2 are distinct eigenvalues of a complex matrix A then the intersection of the generalized eigenspaces Eλ1 and Eλ2 is zero (this is part of the Spectral Theorem). 1 Problem: Let H = (hij) be a 2 × 2 Hermitian matrix. Use the Min- imax Principle to show that if λ1 ≤ λ2 are the eigenvalues of H then λ1 ≤ h11 ≤ λ2. -
Math 223 Symmetric and Hermitian Matrices. Richard Anstee an N × N Matrix Q Is Orthogonal If QT = Q−1
Math 223 Symmetric and Hermitian Matrices. Richard Anstee An n × n matrix Q is orthogonal if QT = Q−1. The columns of Q would form an orthonormal basis for Rn. The rows would also form an orthonormal basis for Rn. A matrix A is symmetric if AT = A. Theorem 0.1 Let A be a symmetric n × n matrix of real entries. Then there is an orthogonal matrix Q and a diagonal matrix D so that AQ = QD; i.e. QT AQ = D: Note that the entries of M and D are real. There are various consequences to this result: A symmetric matrix A is diagonalizable A symmetric matrix A has an othonormal basis of eigenvectors. A symmetric matrix A has real eigenvalues. Proof: The proof begins with an appeal to the fundamental theorem of algebra applied to det(A − λI) which asserts that the polynomial factors into linear factors and one of which yields an eigenvalue λ which may not be real. Our second step it to show λ is real. Let x be an eigenvector for λ so that Ax = λx. Again, if λ is not real we must allow for the possibility that x is not a real vector. Let xH = xT denote the conjugate transpose. It applies to matrices as AH = AT . Now xH x ≥ 0 with xH x = 0 if and only if x = 0. We compute xH Ax = xH (λx) = λxH x. Now taking complex conjugates and transpose (xH Ax)H = xH AH x using that (xH )H = x. Then (xH Ax)H = xH Ax = λxH x using AH = A. -
Arxiv:1901.01378V2 [Math-Ph] 8 Apr 2020 Where A(P, Q) Is the Arithmetic Mean of the Vectors P and Q, G(P, Q) Is Their P Geometric Mean, and Tr X Stands for Xi
MATRIX VERSIONS OF THE HELLINGER DISTANCE RAJENDRA BHATIA, STEPHANE GAUBERT, AND TANVI JAIN Abstract. On the space of positive definite matrices we consider dis- tance functions of the form d(A; B) = [trA(A; B) − trG(A; B)]1=2 ; where A(A; B) is the arithmetic mean and G(A; B) is one of the different versions of the geometric mean. When G(A; B) = A1=2B1=2 this distance is kA1=2− 1=2 1=2 1=2 1=2 B k2; and when G(A; B) = (A BA ) it is the Bures-Wasserstein metric. We study two other cases: G(A; B) = A1=2(A−1=2BA−1=2)1=2A1=2; log A+log B the Pusz-Woronowicz geometric mean, and G(A; B) = exp 2 ; the log Euclidean mean. With these choices d(A; B) is no longer a metric, but it turns out that d2(A; B) is a divergence. We establish some (strict) convexity properties of these divergences. We obtain characterisations of barycentres of m positive definite matrices with respect to these distance measures. 1. Introduction Let p and q be two discrete probability distributions; i.e. p = (p1; : : : ; pn) and q = (q1; : : : ; qn) are n -vectors with nonnegative coordinates such that P P pi = qi = 1: The Hellinger distance between p and q is the Euclidean norm of the difference between the square roots of p and q ; i.e. 1=2 1=2 p p hX p p 2i hX X p i d(p; q) = k p− qk2 = ( pi − qi) = (pi + qi) − 2 piqi : (1) This distance and its continuous version, are much used in statistics, where it is customary to take d (p; q) = p1 d(p; q) as the definition of the Hellinger H 2 distance. -
MATRICES WHOSE HERMITIAN PART IS POSITIVE DEFINITE Thesis by Charles Royal Johnson in Partial Fulfillment of the Requirements Fo
MATRICES WHOSE HERMITIAN PART IS POSITIVE DEFINITE Thesis by Charles Royal Johnson In Partial Fulfillment of the Requirements For the Degree of Doctor of Philosophy California Institute of Technology Pasadena, California 1972 (Submitted March 31, 1972) ii ACKNOWLEDGMENTS I am most thankful to my adviser Professor Olga Taus sky Todd for the inspiration she gave me during my graduate career as well as the painstaking time and effort she lent to this thesis. I am also particularly grateful to Professor Charles De Prima and Professor Ky Fan for the helpful discussions of my work which I had with them at various times. For their financial support of my graduate tenure I wish to thank the National Science Foundation and Ford Foundation as well as the California Institute of Technology. It has been important to me that Caltech has been a most pleasant place to work. I have enjoyed living with the men of Fleming House for two years, and in the Department of Mathematics the faculty members have always been generous with their time and the secretaries pleasant to work around. iii ABSTRACT We are concerned with the class Iln of n><n complex matrices A for which the Hermitian part H(A) = A2A * is positive definite. Various connections are established with other classes such as the stable, D-stable and dominant diagonal matrices. For instance it is proved that if there exist positive diagonal matrices D, E such that DAE is either row dominant or column dominant and has positive diag onal entries, then there is a positive diagonal F such that FA E Iln. -
216 Section 6.1 Chapter 6 Hermitian, Orthogonal, And
216 SECTION 6.1 CHAPTER 6 HERMITIAN, ORTHOGONAL, AND UNITARY OPERATORS In Chapter 4, we saw advantages in using bases consisting of eigenvectors of linear opera- tors in a number of applications. Chapter 5 illustrated the benefit of orthonormal bases. Unfortunately, eigenvectors of linear operators are not usually orthogonal, and vectors in an orthonormal basis are not likely to be eigenvectors of any pertinent linear operator. There are operators, however, for which eigenvectors are orthogonal, and hence it is possible to have a basis that is simultaneously orthonormal and consists of eigenvectors. This chapter introduces some of these operators. 6.1 Hermitian Operators § When the basis for an n-dimensional real, inner product space is orthonormal, the inner product of two vectors u and v can be calculated with formula 5.48. If v not only represents a vector, but also denotes its representation as a column matrix, we can write the inner product as the product of two matrices, one a row matrix and the other a column matrix, (u, v) = uT v. If A is an n n real matrix, the inner product of u and the vector Av is × (u, Av) = uT (Av) = (uT A)v = (AT u)T v = (AT u, v). (6.1) This result, (u, Av) = (AT u, v), (6.2) allows us to move the matrix A from the second term to the first term in the inner product, but it must be replaced by its transpose AT . A similar result can be derived for complex, inner product spaces. When A is a complex matrix, we can use equation 5.50 to write T T T (u, Av) = uT (Av) = (uT A)v = (AT u)T v = A u v = (A u, v). -
Unitary-And-Hermitian-Matrices.Pdf
11-24-2014 Unitary Matrices and Hermitian Matrices Recall that the conjugate of a complex number a + bi is a bi. The conjugate of a + bi is denoted a + bi or (a + bi)∗. − In this section, I’ll use ( ) for complex conjugation of numbers of matrices. I want to use ( )∗ to denote an operation on matrices, the conjugate transpose. Thus, 3 + 4i = 3 4i, 5 6i =5+6i, 7i = 7i, 10 = 10. − − − Complex conjugation satisfies the following properties: (a) If z C, then z = z if and only if z is a real number. ∈ (b) If z1, z2 C, then ∈ z1 + z2 = z1 + z2. (c) If z1, z2 C, then ∈ z1 z2 = z1 z2. · · The proofs are easy; just write out the complex numbers (e.g. z1 = a+bi and z2 = c+di) and compute. The conjugate of a matrix A is the matrix A obtained by conjugating each element: That is, (A)ij = Aij. You can check that if A and B are matrices and k C, then ∈ kA + B = k A + B and AB = A B. · · You can prove these results by looking at individual elements of the matrices and using the properties of conjugation of numbers given above. Definition. If A is a complex matrix, A∗ is the conjugate transpose of A: ∗ A = AT . Note that the conjugation and transposition can be done in either order: That is, AT = (A)T . To see this, consider the (i, j)th element of the matrices: T T T [(A )]ij = (A )ij = Aji =(A)ji = [(A) ]ij. Example. If 1 2i 4 1 + 2i 2 i 3i ∗ − A = − , then A = 2 + i 2 7i . -
RANDOM DOUBLY STOCHASTIC MATRICES: the CIRCULAR LAW 3 Easily Be Bounded by a Polynomial in N
The Annals of Probability 2014, Vol. 42, No. 3, 1161–1196 DOI: 10.1214/13-AOP877 c Institute of Mathematical Statistics, 2014 RANDOM DOUBLY STOCHASTIC MATRICES: THE CIRCULAR LAW By Hoi H. Nguyen1 Ohio State University Let X be a matrix sampled uniformly from the set of doubly stochastic matrices of size n n. We show that the empirical spectral × distribution of the normalized matrix √n(X EX) converges almost − surely to the circular law. This confirms a conjecture of Chatterjee, Diaconis and Sly. 1. Introduction. Let M be a matrix of size n n, and let λ , . , λ be × 1 n the eigenvalues of M. The empirical spectral distribution (ESD) µM of M is defined as 1 µ := δ . M n λi i n X≤ We also define µcir as the uniform distribution over the unit disk 1 µ (s,t) := mes( z 1; (z) s, (z) t). cir π | |≤ ℜ ≤ ℑ ≤ Given a random n n matrix M, an important problem in random matrix theory is to study the× limiting distribution of the empirical spectral distri- bution as n tends to infinity. We consider one of the simplest random matrix ensembles, when the entries of M are i.i.d. copies of the random variable ξ. When ξ is a standard complex Gaussian random variable, M can be viewed as a random matrix drawn from the probability distribution P(dM)= 1 tr(MM ∗) 2 e− dM on the set of complex n n matrices. This is known as arXiv:1205.0843v3 [math.CO] 27 Mar 2014 πn × the complex Ginibre ensemble. -
PHYSICS 116A Homework 8 Solutions 1. Boas, Problem 3.9–4
PHYSICS 116A Homework 8 Solutions 1. Boas, problem 3.9–4. Given the matrix, 0 2i 1 − A = i 2 0 , − 3 0 0 find the transpose, the inverse, the complex conjugate and the transpose conjugate of A. Verify that AA−1 = A−1A = I, where I is the identity matrix. We shall evaluate A−1 by employing Eq. (6.13) in Ch. 3 of Boas. First we compute the determinant by expanding in cofactors about the third column: 0 2i 1 − i 2 det A i 2 0 =( 1) − =( 1)( 6) = 6 . ≡ − − 3 0 − − 3 0 0 Next, we evaluate the matrix of cofactors and take the transpose: 2 0 2i 1 2i 1 − − 0 0 − 0 0 2 0 0 0 2 i 0 0 1 0 1 adj A = − − − = 0 3 i . (1) − 3 0 3 0 − i 0 − 6 6i 2 i 2 0 2i 0 2i − − − 3 0 − 3 0 i 2 − According to Eq. (6.13) in Ch. 3 of Boas the inverse of A is given by Eq. (1) divided by det(A), so 1 0 0 3 −1 1 i A = 0 2 6 . (2) 1 i 1 − − 3 It is straightforward to check by matrix multiplication that 0 2i 1 0 0 1 0 0 1 0 2i 1 1 0 0 − 3 3 − i 2 0 0 1 i = 0 1 i i 2 0 = 0 1 0 . − 2 6 2 6 − 3 0 0 1 i 1 1 i 1 3 0 0 0 0 1 − − 3 − − 3 The transpose, complex conjugate and the transpose conjugate∗ can be written down by inspec- tion: 0 i 3 0 2i 1 0 i 3 − − − ∗ † AT = 2i 2 0 , A = i 2 0 , A = 2i 2 0 . -
Supplement: Symmetric and Hermitian Matrices
Supplement: Symmetric and Hermitian Matrices A Bunch of Definitions Definition: A real n × n matrix A is called symmetric if AT = A. Definition: A complex n × n matrix A is called Hermitian if A∗ = A, where A∗ = AT , the conjugate transpose. Definition: A complex n × n matrix A is called normal if A∗A = AA∗, i.e. commutes with its conjugate transpose. It is quite a surprising result that these three kinds of matrices are always diagonalizable; and moreover, one can construct an orthonormal basis (in standard inner product) for Rn/Cn, consisting of eigenvectors of A. Hence the matrix P that gives diagonalization A = P DP −1 will be orthogonal/unitary, namely: T −1 T Definition: An n × n real matrix P is called orthogonal if P P = In, i.e. P = P . ∗ −1 ∗ Definition: An n × n complex matrix P is called unitary if P P = In, i.e. P = P . Diagonalization using these special kinds of P will have special names: Definition: A matrix A is called orthogonally diagonalizable if A is similar to a diagonal matrix D with an orthogonal matrix P , i.e. A = P DP T . A matrix A is called unitarily diagonalizable if A is similar to a diagonal matrix D with a unitary matrix P , i.e. A = P DP ∗. Then we have the following big theorems: Theorem: Every real n × n symmetric matrix A is orthogonally diagonalizable Theorem: Every complex n × n Hermitian matrix A is unitarily diagonalizable. Theorem: Every complex n × n normal matrix A is unitarily diagonalizable. To prove the above results, it is convenient to introduce the concept of adjoint operator, which allows us to discuss effectively the \transpose" operation in a general inner product space. -
Symmetric and Hermitian Matrices
“notes2” i i 2013/4/9 page 73 i i Chapter 5 Symmetric and Hermitian Matrices In this chapter, we discuss the special classes of symmetric and Hermitian matrices. We will conclude the chapter with a few words about so-called Normal matrices. Before we begin, we mention one consequence of the last chapter that will be useful in a proof of the unitary diagonalization of Hermitian matrices. Let A be an m n matrix with m n, and assume (for the moment) that A has linearly independent× columns. Then≥ if the Gram-Schmidt process is applied to the columns of A, the result can be expressed in terms of a matrix factorization A = Q˜ R˜, where the orthogonal vectors are the columns in Q˜, and R˜ is unit upper triangular n n with the inner products as the entries in R˜. H × i 1 qk ai For example, consider that qi := ai k−=1 H qk. Rearranging the equa- − qk qk tion, P i 1 H n − qk ai ai = q + q + 0q i qH q k k kX=1 k k k=Xi+1 where the right hand side is a linear combination of the columns of the first i (only!) columns of the matrix Q˜. So this equation can be rewritten H q1 ai H q1 q1 H q2 ai H q2 q2 . H qi−1ai ˜ H ai = Q qi−1qi−1 . 1 0 . . 0 Putting such equations together for i = 1 ,...,n , we arrive at the desired result, A = Q˜ R˜. 73 “notes2” i i 2013/4/9 page 74 i i 74 Chapter5. -
Matrix Nearness Problems and Applications∗
Matrix Nearness Problems and Applications∗ Nicholas J. Higham† Abstract A matrix nearness problem consists of finding, for an arbitrary matrix A, a nearest member of some given class of matrices, where distance is measured in a matrix norm. A survey of nearness problems is given, with particular emphasis on the fundamental properties of symmetry, positive definiteness, orthogonality, normality, rank-deficiency and instability. Theoretical results and computational methods are described. Applications of nearness problems in areas including control theory, numerical analysis and statistics are outlined. Key words. matrix nearness problem, matrix approximation, symmetry, positive definiteness, orthogonality, normality, rank-deficiency, instability. AMS subject classifications. Primary 65F30, 15A57. 1 Introduction Consider the distance function (1.1) d(A) = min E : A + E S has property P ,A S, k k ∈ ∈ where S denotes Cm×n or Rm×n, is a matrix norm on S, and P is a matrix property k · k which defines a subspace or compact subset of S (so that d(A) is well-defined). Associated with (1.1) are the following tasks, which we describe collectively as a matrix nearness problem: Determine an explicit formula for d(A), or a useful characterisation. • ∗This is a reprint of the paper: N. J. Higham. Matrix nearness problems and applications. In M. J. C. Gover and S. Barnett, editors, Applications of Matrix Theory, pages 1–27. Oxford University Press, 1989. †Department of Mathematics, University of Manchester, Manchester, M13 9PL, England ([email protected]). 1 Determine X = A + E , where E is a matrix for which the minimum in (1.1) • min min is attained. -
Symmetric and Hermitian Matrices : a Geometric Perspective on Spectral Coupling
Symmetric and hermitian matrices : a geometric perspective on spectral coupling Franc¸oise Chatelin and M. Monserrat Rincon-Camacho Technical Report TR/PA/15/56 Publications of the Parallel Algorithms Team http://www.cerfacs.fr/algor/publications/ SYMMETRIC AND HERMITIAN MATRICES : A GEOMETRIC PERSPECTIVE ON SPECTRAL COUPLING CERFACS REPORT TR/PA/15/56 FRANC¸OISE CHATELIN (1);(2) AND M. MONSERRAT RINCON-CAMACHO (1) Abstract. The report develops the presentation begun in [Chatelin and Rincon-Camacho, 2015], of the information processing that can be performed by a general hermitian matrix of order n when two of its distinct eigenvalues are coupled, such 0 λ+λ0 λ0−λ as λ < λ . Setting a = 2 and e = 2 > 0, the new spectral information that is provided by e 0 jaj 0 coupling is expressed in terms of the ratios jaj (if λλ > 0) or e (if λλ < 0) and of the product jaje. The information is delivered in geometric form, both metric and trigonometric, associ- ated with various right-angled triangles deriving from optimality conditions on n−dimensional vectors. The paper contains a generalisation to indefinite matrices over R or C of Gustafson's operator trigonometry which in the matrix case assumes definiteness (mostly over R). Keywords: Spectral coupling, spectral chaining, hermitian matrix, indefinite symmetric or hermitian, spectral plane, observation angle, perspective angle, catchvector, antieigenvector, middle vector, Euler equation, balance equation, torus in 3D. 1. Composite natural phenomena 1.1. Introduction. It is well-known that many natural phenomena are composite: they are produced by the interaction of two (or more) simpler phenomena, where the interaction is in general nonlinear.