Finite-Dimensional Spectral Theory Part I: from Cn to the Schur Decomposition

Finite-dimensional spectral theory part I: from Cn to the Schur decomposition Ed Bueler MATH 617 Functional Analysis Spring 2020 Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 1 / 26 linear algebra versus functional analysis these slides are about linear algebra, i.e. vector spaces of finite dimension, and linear operators on those spaces, i.e. matrices one definition of functional analysis might be: “rigorous extension of linear algebra to 1-dimensional topological vector spaces” ◦ it is important to understand the finite-dimensional case! the goal of these part I slides is to prove the Schur decomposition and the spectral theorem for matrices good references for these slides: ◦ L. Trefethen & D. Bau, Numerical Linear Algebra, SIAM Press 1997 ◦ G. Strang, Introduction to Linear Algebra, 5th ed., Wellesley-Cambridge Press, 2016 ◦ G. Golub & C. van Loan, Matrix Computations, 4th ed., Johns Hopkins University Press 2013 Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 2 / 26 the spectrum of a matrix the spectrum σ(A) of a square matrix A is its set of eigenvalues ◦ reminder later about the definition of eigenvalues ◦ σ(A) is a subset of the complex plane C ◦ the plural of spectrum is spectra; the adjectival is spectral graphing σ(A) gives the matrix a personality ◦ example below: random, nonsymmetric, real 20 × 20 matrix 6 4 2 >> A = randn(20,20); ) >> lam = eig(A); λ 0 >> plot(real(lam),imag(lam),’o’) Im( >> grid on >> xlabel(’Re(\lambda)’) -2 >> ylabel(’Im(\lambda)’) -4 -6 -6 -4 -2 0 2 4 6 Re(λ) Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 3 / 26 Cn is an inner product space we use complex numbers C from now on ◦ why? because eigenvalues can be complex even for a real matrix ◦ recall: if α = x + iy 2 C then α = x − iy is the conjugate let Cn be the space of (column) vectors with complex entries: 2v13 = 6 . 7 v 4 . 5 vn definition. an inner product on Cn is a function n n h·; ·i : C × C ! C; almost-bilinear (sesquilinear1), with symmetry and positivity properties namely, for all u; v; w 2 Cn and α 2 C, ◦h w; u + vi = hw; ui + hw; vi ◦h u; αvi = α hu; vi ◦h u; vi = hv; ui ◦h u; ui ≥ 0, and hu; ui = 0 if and only if u = 0 1 kind of a joke, as it means “1 1 linear” 2 Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 4 / 26 norm and adjoint vector the inner product is conjugate-linear in its first position: ◦h u + v; wi = hw; u + vi = hw; ui + hw; vi = hu; wi + hv; wi ◦h αu; vi = hv; αui = αhv; ui = α hu; vi definition. an inner product h·; ·i induces a norm k · k : Cn ! R: p kuk = hu; ui the hermitian transpose (adjoint) of a vector v 2 Cn is the row vector ∗ v = [v1;:::; vn] the usual inner product is just a matrix product on Cn: 2 3 v1 ∗ ··· 6 . 7 hu; vi = u v = u1 un 4 . 5 vn Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 5 / 26 linear-dependence, span, basis m n definition. a (finite) set of vectors fvi gi=1 ⊂ C is linearly-dependent if there exist scalars αi 2 C, not all zero, so that α1v1 + ··· + αmvm = 0 ◦ a set of vectors is linearly-independent if it is not linearly-dependent ◦ vectors in a linearly-independent set are nonzero m n n definition. a finite set of vectors fvi gi=1 span C if for any w 2 C there exist scalars αi so that w = α1v1 + ··· + αmvm lemma.2 m n ◦ if fvi gi=1 ⊂ C is linearly-independent then m ≤ n m n n ◦ if fvi gi=1 ⊂ C spans C then m ≥ n m n definition. a finite set of vectors fvi gi=1 ⊂ C is a basis if the set is linearly-independent and it spans Cn ◦ by the lemma, m = n ◦ the dimension n is well-defined as the number of elements in a basis 2https://en.wikipedia.org/wiki/Steinitz_exchange_lemma Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 6 / 26 linear operators and matrices definition. A : Cn ! Cn is linear if A(αu + βv) = αA(u) + βA(v) ◦ we call such a function a linear operator and we write Au = A(u) given a basis, one may represent a linear operator as a (square) matrix ◦ matrix multiplication is just function composition definition. A is invertible if there exists B so that AB = BA = I lemma. matrix A is invertible if and only if its columns form a basis Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 7 / 26 meanings of matrix multiplication the “purpose” of a matrix A 2 Cn×m is to multiply vectors u 2 Cm ◦ a matrix is merely the representation of a linear operator ◦ operations on entries, e.g. det() or row operations, are less fundamental when you see “Au” remember two views: 2 3 2 3 r1 r1u 6 r2 7 6r2u7 Au = 6 7 u = 6 7 6 . 7 6 . 7 4 . 5 4 . 5 rn rnu 2u 3 2 3 1 2 3 2 3 2 3 6u2 7 Au = a a ::: a 6 7 = u a + u a + ··· + u a 4 1 2 m 5 6 . 7 1 4 15 2 4 25 m 4 m5 4 . 5 um ◦ A acts on u acts from the left; each entry is an inner product ◦ u acts from the right on A; get a linear-combination of columns also: if A 2 Cn×n is invertible then A−1w computes the coefficients of w in the basis of columns of A Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 8 / 26 adjoints, orthonormal bases, and unitary matrices definition. the hermitian transpose or adjoint of A 2 Cn×m is A∗ 2 Cm×n: 2 3 2 3 a11 a12 ::: a1m a11 a21 ::: an1 6a21 a22 a2m7 6a12 a22 an2 7 A = 6 7 ! A∗ = 6 7 6 . .. 7 6 . .. 7 4 . 5 4 . 5 an1 an2 ::: anm a1m a2m ::: anm n n definition. a basis fvi gi=1 of C is orthonormal (ON) if ( ∗ 1; i = j hvi ; vj i = vi vj = δij = 0; i 6= j ◦ “ortho” means hvi ; vj i = 0 if i 6= j and “normal” means kvi k = 1 definition. a matrix U is unitary if U∗U = I, thus U−1 = U∗ lemma. a matrix U is unitary if and only if its columns form an ON basis proof. The entries of a matrix product are inner products between the rows of the left factor and the columns of the right factor. The entries of I are δij . Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 9 / 26 Gram-Schmidt process m n given a set of vectors fwi gi=1 ⊂ C we can generate new orthonormal vectors which span the same subspace n ◦ if the original set spans C then the result is an ON basis formulas: v~ = w1 ! v1 = v~=kv~k v~ = w2 − hv1; w2i v1 ! v2 = v~=kv~k v~ = w3 − hv1; w3i v1 − hv2; w3i v2 ! v3 = v~=kv~k v~ = w4 − hv1; w4i v1 − hv2; w4i v2 − hv3; w4i v3 ! v4 = v~=kv~k . ◦ exception: if v~ = 0 at any stage then we ignore that wi and skip to wi+1 ◦ notice the triangular structure lemma. ◦f vi g is ON ◦ spanfvi g = spanfwi g m n n ◦ if fwi gi=1 spans C , thus m ≥ n, then fvi gi=1 is an ON basis Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 10 / 26 Gram-Schmidt is QR the Gram-Schmidt formulas are of the form i−1 ∗ X αi vi = wi − βji vj j=1 where αi = kv~k is a normalization constant and βji = hvj ; wi i moving the vi to the left in ∗, and writing vectors as columns gives 2α β β : : : β 3 2 3 1 12 13 1m 2 3 6 0 α2 β23 β2m7 v v ::: v 6 7 = w w ::: w 4 1 2 m 5 6 . .. 7 4 1 2 m 5 4 . 5 0 0 : : : αm this is a “reduced” QR decomposition Q^ R^ = A ◦ A is n × m and contains the original vectors wi , ^ ◦ Q is the same size as A and contains the ON vectors vi , ◦ and R^ is (upper-) right-triangular and m × m, thus small if m n if m = n and columns of A span Cn (A has full rank) then Q is unitary methods: numerical QR uses Householder, not Gram-Schmidt, for stability reasons Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 11 / 26 Gram-Schmidt process: example 1 suppose we have m = 3 vectors in C3: 293 213 263 w1 = 435 ; w2 = 465 ; w3 = 475 4 9 3 applying the formulas on slide 10: 20:874163 2−0:484563 2−0:032473 v1 = 40:291395 ; v2 = 4 0:46984 5 ; v3 = 4 0:83327 5 0:38851 0:73787 −0:55191 compare this MATLAB calculation: >> A = [9 1 6; 3 6 7; 4 9 3]; >> [Q,R] = qr(A) Q = -0.87416 0.48456 0.032465 -0.29139 -0.46984 -0.83327 -0.38851 -0.73787 0.55191 R = -10.296 -6.1191 -8.4502 0 -8.9753 -2.5952 0 0 -3.9824 Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 12 / 26 Gram-Schmidt process: example 2 what is this MATLAB calculation doing?: >> x = (-1:.01:1)’; >> A = [x.^0 x.^1 x.^2 x.^3 x.^4]; >> size(A) ans = 201 5 >> [Q,R] = qr(A,0); >> size(Q) ans = 201 5 0.1 >> plot(x,Q), xlabel x >> axis tight, grid on 0 -0.1 -0.2 -1 -0.5 0 0.5 1 x the plot shows Legendre polynomials up to degree 4 Ed Bueler (MATH 617) Finite-dimensional spectral theory Spring 2020 13 / 26 technique: extend to an ON basis a key technique, for proofs related to spectral theory, is to extend m < n ON vectors to an ON basis n n let fei gi=1 be the standard basis of C , with (ei )j = δij method.

Finite-Dimensional Spectral Theory Part I: from Cn to the Schur Decomposition

The Generalized Triangular Decomposition

CSE 275 Matrix Computation

The Schur Decomposition Week 5 UCSB 2014

The Non–Symmetric Eigenvalue Problem

Parallel Eigenvalue Reordering in Real Schur Forms∗

The Unsymmetric Eigenvalue Problem

Schur Triangularization and the Spectral Decomposition(S)

Introduction to QR-Method SF2524 - Matrix Computations for Large-Scale Systems

10. Schur Decomposition

Section 6.1.2: the Schur Decomposition 6.1.3: Nonunitary Decompositions

Comparison of Numerical Methods and Open-Source Libraries for Eigenvalue Analysis of Large-Scale Power Systems

The Jordan Canonical Form Weeks 9-10 UCSB 2014