Change of Basis Consider a Linear Transform, PB , and Its Inverse, P B

Total Page:16

File Type:pdf, Size:1020Kb

Change of Basis Consider a Linear Transform, PB , and Its Inverse, P B Change of Basis Consider a linear transform, PB, and its −1 inverse, PB , which map a vector back and forth between its representation in the standard basis and its representation in the basis, B: PB −→ u ← [u]B . −1 PB Change of Basis (contd). Let B consist of N basis vectors, b1 ...bN. Since [u]B is the representation of u in B, it follows that ... u = ([u]B)1 b1 +([u]B)2 b2 + ([u]B)N bN But this is just the matrix vector prod- uct u = B[u]B where B = b1 b2 ... bN . −1 −1 We see that PB = B and PB = B. Similarity Transforms Now consider a linear transformation rep- resented in the standard basis by the ma- trix A. We seek [A]B, i.e., the represen- tation of the corresponding linear trans- formation in the basis B: u −→A Au ↑ B ↓ B−1 [A]B [u]B −→ [Au]B The matrix we seek maps [u]B into [Au]B. From the above diagram, we see that this matrix is the composition of B, A, and B−1: −1 [A]B = B AB. We say that A and [A]B are related by a similarity transform. Diag. of Symmetric Matrices Because of linearity, one might expect that a transformation will have an espe- cially simple representation in the ba- sis of its eigenvectors, X . Let A be its representation in the standard basis and let the columns of X be the eigenvec- tors of A. Then X and XT = X−1 take us back and forth between the standard basis and X : XT −→ u ←− [u]X . X Diag. of Symmetric Matrices (contd.) The matrix we seek maps [u]X into [Au]X : u −→A Au ↑ X ↓ XT [A]X [u]X −→ [Au]X From the above diagram, we see that this matrix is the composition of X, A, and XT: Λ = XTAX. We observe that Λ is diagonal with Λii = λi, the eigenvalue of A associated with eigenvector, xi. Spectral Thm. for Sym. Matrices Any symmetric N × N matrix, A, with N distinct eigenvalues, can be factored as follows: A = XΛXT where Λ is N × N and diagonal, X and XT are N ×N matrices, and the i-th col- umn of X (equal to the i-th row of XT) is an eigenvector of A: λixi = Axi with eigenvalue Λii = λi. Note that xi is orthogonal to x j when i =6 j: 1 if i = j XXT = δ = ij ij 0 otherwise. In other words, XXT = I. Consequently, XT = X−1. Spectral Thm. for Sym. Matrices (contd). Using the definition of matrix product and the fact that Λ is diagonal, we can write A = XΛXT as N Λ T . (A)ij = ∑ (X)ik kk X kj k=1 Since X = x1 x2 ... xN and Λkk = λk N λ (A)ij = ∑ (xk)i k (xk) j k=1 N λ T = ∑ kxkxk ij k=1 N λ T A = ∑ kxkxk k=1 where λkxk = Axk. Spectral Thm. for Sym. Matrices (contd). The spectral factorization of A is: λ T λ T λ T . A = 1x1x1 + 2x2x2 + ··· + NxNxN λ T Note that each nxnxn is a rank one ma- λ T trix. Let Ai = ixixi . Now, because T xi xi = 1: λ λ T ixi = ixixi xi = Aixi i.e., xi is the only eigenvector of Ai and its only eigenvalue is λi. Diag. of Non-symmetric Matrices The situation is more complex when the transformation is represented by a non- symmetric matrix, P. Let the columns of X be P’s right eigenvectors and the rows of YT be its left eigenvectors. Then X and YT = X−1 take us back and forth between the standard basis and X : YT −→ u ←− [u]X . X Diag. of Non-symmetric Matrices (contd.) The matrix we seek maps [u]X into [Pu]X : u −→P Pu ↑ X ↓ YT Λ [u]X −→ [Pu]X From the above diagram, we see that this matrix is the composition of X, P, and YT: Λ = YTPX We observe that Λ is diagonal with Λii = λi, the eigenvalue of P associated with right eigenvector, xi, and left eigenvec- tor, yi. Spectral Theorem Any N × N matrix, P, with N distinct eigenvalues, can be factored as follows: P = XΛYT where Λ is N × N and diagonal, X and YT are N ×N matrices, and the i-th col- umn of X is a right eigenvector of P: λixi = Pxi with eigenvalue Λii = λi and the i-th row of YT is a left eigenvector of P: λ T T iyi = yi P with the same eigenvalue. Spectral Theorem (contd.) Note that xi is orthogonal to y j when i =6 j: 1 if i = j XYT = δ = ij ij 0 otherwise. In other words, XYT = I. Consequently, YT = X−1. Spectral Theorem (contd). The spectral factorization of P is: λ T λ T λ T . P = 1x1y1 + 2x2y2 + ··· + NxNyN λ T Note that each nxnyn is a rank one ma- λ T trix. Let Pi = ixiyi . Now, because T yi xi = 1: λ λ T ixi = ixiyi xi = Pixi and λ T T λ T iyi = yi ixiyi T = yi Pi i.e., xi and yi are the sole right and left eigenvectors of Pi. The only eigenvalue is λi. Stochastic Matrices If P is stochastic, then 1 = ∑Pij. i Let yT = 1 1 ... 1 . It follows that T T . y j = ∑ y i Pij i Consequently, yT is a left eigenvector with unit eigenvalue of every stochastic matrix: yT = yTP. Stochastic Matrices (contd.) What is the representation of an arbi- trary distribution, z, in the basis of eigen- vectors of an arbitrary stochastic ma- trix, P? z = c1x1 + c2x1 + ··· + cNxN = Xc. Solving for c: c = X−1z = YTz. T ... Since y1 = 1 1 1 , it follows that: T c1 = ∑Y1 j z j = ∑z j = 1 j j which is independent of the specific z and P!.
Recommended publications
  • 21. Orthonormal Bases
    21. Orthonormal Bases The canonical/standard basis 011 001 001 B C B C B C B0C B1C B0C e1 = B.C ; e2 = B.C ; : : : ; en = B.C B.C B.C B.C @.A @.A @.A 0 0 1 has many useful properties. • Each of the standard basis vectors has unit length: q p T jjeijj = ei ei = ei ei = 1: • The standard basis vectors are orthogonal (in other words, at right angles or perpendicular). T ei ej = ei ej = 0 when i 6= j This is summarized by ( 1 i = j eT e = δ = ; i j ij 0 i 6= j where δij is the Kronecker delta. Notice that the Kronecker delta gives the entries of the identity matrix. Given column vectors v and w, we have seen that the dot product v w is the same as the matrix multiplication vT w. This is the inner product on n T R . We can also form the outer product vw , which gives a square matrix. 1 The outer product on the standard basis vectors is interesting. Set T Π1 = e1e1 011 B C B0C = B.C 1 0 ::: 0 B.C @.A 0 01 0 ::: 01 B C B0 0 ::: 0C = B. .C B. .C @. .A 0 0 ::: 0 . T Πn = enen 001 B C B0C = B.C 0 0 ::: 1 B.C @.A 1 00 0 ::: 01 B C B0 0 ::: 0C = B. .C B. .C @. .A 0 0 ::: 1 In short, Πi is the diagonal square matrix with a 1 in the ith diagonal position and zeros everywhere else.
    [Show full text]
  • Coordinatization
    MATH 355 Supplemental Notes Coordinatization Coordinatization In R3, we have the standard basis i, j and k. When we write a vector in coordinate form, say 3 v 2 , (1) “ »´ fi 5 — ffi – fl it is understood as v 3i 2j 5k. “ ´ ` The numbers 3, 2 and 5 are the coordinates of v relative to the standard basis ⇠ i, j, k . It has p´ q “p q always been understood that a coordinate representation such as that in (1) is with respect to the ordered basis ⇠. A little thought reveals that it need not be so. One could have chosen the same basis elements in a di↵erent order, as in the basis ⇠ i, k, j . We employ notation indicating the 1 “p q coordinates are with respect to the di↵erent basis ⇠1: 3 v 5 , to mean that v 3i 5k 2j, r s⇠1 “ » fi “ ` ´ 2 —´ ffi – fl reflecting the order in which the basis elements fall in ⇠1. Of course, one could employ similar notation even when the coordinates are expressed in terms of the standard basis, writing v for r s⇠ (1), but whenever we have coordinatization with respect to the standard basis of Rn in mind, we will consider the wrapper to be optional. r¨s⇠ Of course, there are many non-standard bases of Rn. In fact, any linearly independent collection of n vectors in Rn provides a basis. Say we take 1 1 1 4 ´ ´ » 0fi » 1fi » 1fi » 1fi u , u u u ´ . 1 “ 2 “ 3 “ 4 “ — 3ffi — 1ffi — 0ffi — 2ffi — ffi —´ ffi — ffi — ffi — 0ffi — 4ffi — 2ffi — 1ffi — ffi — ffi — ffi —´ ffi – fl – fl – fl – fl As per the discussion above, these vectors are being expressed relative to the standard basis of R4.
    [Show full text]
  • MATH 304 Linear Algebra Lecture 14: Basis and Coordinates. Change of Basis
    MATH 304 Linear Algebra Lecture 14: Basis and coordinates. Change of basis. Linear transformations. Basis and dimension Definition. Let V be a vector space. A linearly independent spanning set for V is called a basis. Theorem Any vector space V has a basis. If V has a finite basis, then all bases for V are finite and have the same number of elements (called the dimension of V ). Example. Vectors e1 = (1, 0, 0,..., 0, 0), e2 = (0, 1, 0,..., 0, 0),. , en = (0, 0, 0,..., 0, 1) form a basis for Rn (called standard) since (x1, x2,..., xn) = x1e1 + x2e2 + ··· + xnen. Basis and coordinates If {v1, v2,..., vn} is a basis for a vector space V , then any vector v ∈ V has a unique representation v = x1v1 + x2v2 + ··· + xnvn, where xi ∈ R. The coefficients x1, x2,..., xn are called the coordinates of v with respect to the ordered basis v1, v2,..., vn. The mapping vector v 7→ its coordinates (x1, x2,..., xn) is a one-to-one correspondence between V and Rn. This correspondence respects linear operations in V and in Rn. Examples. • Coordinates of a vector n v = (x1, x2,..., xn) ∈ R relative to the standard basis e1 = (1, 0,..., 0, 0), e2 = (0, 1,..., 0, 0),. , en = (0, 0,..., 0, 1) are (x1, x2,..., xn). a b • Coordinates of a matrix ∈ M2,2(R) c d 1 0 0 0 0 1 relative to the basis , , , 0 0 1 0 0 0 0 0 are (a, c, b, d). 0 1 • Coordinates of a polynomial n−1 p(x) = a0 + a1x + ··· + an−1x ∈Pn relative to 2 n−1 the basis 1, x, x ,..., x are (a0, a1,..., an−1).
    [Show full text]
  • Vectors, Change of Basis and Matrix Representation: Onto-Semiotic Approach in the Analysis of Creating Meaning
    International Journal of Mathematical Education in Science and Technology ISSN: 0020-739X (Print) 1464-5211 (Online) Journal homepage: http://www.tandfonline.com/loi/tmes20 Vectors, change of basis and matrix representation: onto-semiotic approach in the analysis of creating meaning Mariana Montiel , Miguel R. Wilhelmi , Draga Vidakovic & Iwan Elstak To cite this article: Mariana Montiel , Miguel R. Wilhelmi , Draga Vidakovic & Iwan Elstak (2012) Vectors, change of basis and matrix representation: onto-semiotic approach in the analysis of creating meaning, International Journal of Mathematical Education in Science and Technology, 43:1, 11-32, DOI: 10.1080/0020739X.2011.582173 To link to this article: http://dx.doi.org/10.1080/0020739X.2011.582173 Published online: 01 Aug 2011. Submit your article to this journal Article views: 174 View related articles Full Terms & Conditions of access and use can be found at http://www.tandfonline.com/action/journalInformation?journalCode=tmes20 Download by: [Universidad Pública de Navarra - Biblioteca] Date: 22 January 2017, At: 10:06 International Journal of Mathematical Education in Science and Technology, Vol. 43, No. 1, 15 January 2012, 11–32 Vectors, change of basis and matrix representation: onto-semiotic approach in the analysis of creating meaning Mariana Montiela*, Miguel R. Wilhelmib, Draga Vidakovica and Iwan Elstaka aDepartment of Mathematics and Statistics, Georgia State University, Atlanta, GA, USA; bDepartment of Mathematics, Public University of Navarra, Pamplona 31006, Spain (Received 26 July 2010) In a previous study, the onto-semiotic approach was employed to analyse the mathematical notion of different coordinate systems, as well as some situations and university students’ actions related to these coordinate systems in the context of multivariate calculus.
    [Show full text]
  • Orthonormal Bases
    Math 108B Professor: Padraic Bartlett Lecture 3: Orthonormal Bases Week 3 UCSB 2014 In our last class, we introduced the concept of \changing bases," and talked about writ- ing vectors and linear transformations in other bases. In the homework and in class, we saw that in several situations this idea of \changing basis" could make a linear transformation much easier to work with; in several cases, we saw that linear transformations under a certain basis would become diagonal, which made tasks like raising them to large powers far easier than these problems would be in the standard basis. But how do we find these \nice" bases? What does it mean for a basis to be\nice?" In this set of lectures, we will study one potential answer to this question: the concept of an orthonormal basis. 1 Orthogonality To start, we should define the notion of orthogonality. First, recall/remember the defini- tion of the dot product: n Definition. Take two vectors (x1; : : : xn); (y1; : : : yn) 2 R . Their dot product is simply the sum x1y1 + x2y2 + : : : xnyn: Many of you have seen an alternate, geometric definition of the dot product: n Definition. Take two vectors (x1; : : : xn); (y1; : : : yn) 2 R . Their dot product is the product jj~xjj · jj~yjj cos(θ); where θ is the angle between ~x and ~y, and jj~xjj denotes the length of the vector ~x, i.e. the distance from (x1; : : : xn) to (0;::: 0). These two definitions are equivalent: 3 Theorem. Let ~x = (x1; x2; x3); ~y = (y1; y2; y3) be a pair of vectors in R .
    [Show full text]
  • Notes on Change of Bases Northwestern University, Summer 2014
    Notes on Change of Bases Northwestern University, Summer 2014 Let V be a finite-dimensional vector space over a field F, and let T be a linear operator on V . Given a basis (v1; : : : ; vn) of V , we've seen how we can define a matrix which encodes all the information about T as follows. For each i, we can write T vi = a1iv1 + ··· + anivn 2 for a unique choice of scalars a1i; : : : ; ani 2 F. In total, we then have n scalars aij which we put into an n × n matrix called the matrix of T relative to (v1; : : : ; vn): 0 1 a11 ··· a1n B . .. C M(T )v := @ . A 2 Mn;n(F): an1 ··· ann In the notation M(T )v, the v showing up in the subscript emphasizes that we're taking this matrix relative to the specific bases consisting of v's. Given any vector u 2 V , we can also write u = b1v1 + ··· + bnvn for a unique choice of scalars b1; : : : ; bn 2 F, and we define the coordinate vector of u relative to (v1; : : : ; vn) as 0 1 b1 B . C n M(u)v := @ . A 2 F : bn In particular, the columns of M(T )v are the coordinates vectors of the T vi. Then the point of the matrix M(T )v is that the coordinate vector of T u is given by M(T u)v = M(T )vM(u)v; so that from the matrix of T and the coordinate vectors of elements of V , we can in fact reconstruct T itself.
    [Show full text]
  • Eigenvalues and Eigenvectors MAT 67L, Laboratory III
    Eigenvalues and Eigenvectors MAT 67L, Laboratory III Contents Instructions (1) Read this document. (2) The questions labeled \Experiments" are not graded, and should not be turned in. They are designed for you to get more practice with MATLAB before you start working on the programming problems, and they reinforce mathematical ideas. (3) A subset of the questions labeled \Problems" are graded. You need to turn in MATLAB M-files for each problem via Smartsite. You must read the \Getting started guide" to learn what file names you must use. Incorrect naming of your files will result in zero credit. Every problem should be placed is its own M-file. (4) Don't forget to have fun! Eigenvalues One of the best ways to study a linear transformation f : V −! V is to find its eigenvalues and eigenvectors or in other words solve the equation f(v) = λv ; v 6= 0 : In this MATLAB exercise we will lead you through some of the neat things you can to with eigenvalues and eigenvectors. First however you need to teach MATLAB to compute eigenvectors and eigenvalues. Lets briefly recall the steps you would have to perform by hand: As an example lets take the matrix of a linear transformation f : R3 ! R3 to be (in the canonical basis) 01 2 31 M := @2 4 5A : 3 5 6 The steps to compute eigenvalues and eigenvectors are (1) Calculate the characteristic polynomial P (λ) = det(M − λI) : (2) Compute the roots λi of P (λ). These are the eigenvalues. (3) For each of the eigenvalues λi calculate ker(M − λiI) : The vectors of any basis for for ker(M − λiI) are the eigenvectors corresponding to λi.
    [Show full text]
  • Reciprocal Frame Vectors
    Reciprocal Frame Vectors Peeter Joot March 29, 2008 1 Approach without Geometric Algebra. Without employing geometric algebra, one can use the projection operation expressed as a dot product and calculate the a vector orthogonal to a set of other vectors, in the direction of a reference vector. Such a calculation also yields RN results in terms of determinants, and as a side effect produces equations for parallelogram area, parallelopiped volume and higher dimensional analogues as a side effect (without having to employ change of basis diagonalization arguments that don’t work well for higher di- mensional subspaces). 1.1 Orthogonal to one vector The simplest case is the vector perpendicular to another. In anything but R2 there are a whole set of such vectors, so to express this as a non-set result a reference vector is required. Calculation of the coordinate vector for this case follows directly from the dot product. Borrowing the GA term, we subtract the projection to calculate the rejection. Rejuˆ (v) = v − v · uˆ uˆ 1 = (vu2 − v · uu) u2 1 = v e u u − v u u e u2 ∑ i i j j j j i i 1 vi vj = ∑ ujei u2 ui uj 1 ui uj = (u e − u e ) 2 ∑ i j j i v v u i<j i j Thus we can write the rejection of v from uˆ as: 1 1 ui uj ui uj Rej (v) = (1) uˆ 2 ∑ v v e e u i<j i j i j Or introducing some shorthand: uv ui uj Dij = vi vj ue ui uj Dij = ei ej equation 1 can be expressed in a form that will be slightly more convient for larger sets of vectors: 1 ( ) = uv ue Rejuˆ v 2 ∑ Dij Dij (2) u i<j Note that although the GA axiom u2 = u · u has been used in equations 1 and 2 above and the derivation, that was not necessary to prove them.
    [Show full text]
  • Inner Product Spaces
    Part VI Inner Product Spaces 475 Section 27 The Dot Product in Rn Focus Questions By the end of this section, you should be able to give precise and thorough answers to the questions listed below. You may want to keep these questions in mind to focus your thoughts as you complete the section. What is the dot product of two vectors? Under what conditions is the dot • product defined? How do we find the angle between two nonzero vectors in Rn? • How does the dot product tell us if two vectors are orthogonal? • How do we define the length of a vector in any dimension and how can the • dot product be used to calculate the length? How do we define the distance between two vectors? • What is the orthogonal projection of a vector u in the direction of the vector • v and how do we find it? What is the orthogonal complement of a subspace W of Rn? • Application: Hidden Figures in Computer Graphics In video games, the speed at which a computer can render changing graphics views is vitally im- portant. To increase a computer’s ability to render a scene, programs often try to identify those parts of the images a viewer could see and those parts the viewer could not see. For example, in a scene involving buildings, the viewer could not see any images blocked by a solid building. In the mathematical world, this can be visualized by graphing surfaces. In Figure 27.1 we see a crude image of a house made up of small polygons (this is how programs generally represent surfaces).
    [Show full text]
  • Introduction to Geometric Algebra Lecture IV
    Introduction to Geometric Algebra Lecture IV Leandro A. F. Fernandes Manuel M. Oliveira [email protected] [email protected] Visgraf - Summer School in Computer Graphics - 2010 CG UFRGS Lecture IV Checkpoint Visgraf - Summer School in Computer Graphics - 2010 2 Checkpoint, Lecture I Multivector space Non-metric products . The outer product . The regressive product c Visgraf - Summer School in Computer Graphics - 2010 3 Checkpoint, Lecture II Metric spaces . Bilinear form defines a metric on the vector space, e.g., Euclidean metric . Metric matrix Some inner products The scalar product is a particular . Inner product of vectors case of the left and right contractions . Scalar product . Left contraction These metric products are . Right contraction backward compatible for 1-blades Visgraf - Summer School in Computer Graphics - 2010 4 Checkpoint, Lecture II Dualization Undualization By taking the undual, the dual representation of a blade can be correctly mapped back to its direct representation Venn Diagrams Visgraf - Summer School in Computer Graphics - 2010 5 Checkpoint, Lecture III Duality relationships between products . Dual of the outer product . Dual of the left contraction Visgraf - Summer School in Computer Graphics - 2010 6 Checkpoint, Lecture III Some non-linear products . Meet of blades . Join of blades . Delta product of blades Visgraf - Summer School in ComputerVenn GraphicsDiagrams - 2010 7 Today Lecture IV – Mon, January 18 . Geometric product . Versors . Rotors Visgraf - Summer School in Computer Graphics
    [Show full text]
  • September 11, 2013 III. the DUAL SPACE
    September 11, 2013 III. THE DUAL SPACE RODICA D. COSTIN Contents 1. The dual of a vector space 1 1.1. Linear functionals 1 1.2. The dual space 2 1.3. Dual basis 3 1.4. Linear functionals as covectors; change of basis 4 1.5. Functionals and hyperplanes 4 1.6. Application: interpolation of sampled data 5 1.7. Orthogonality 7 1.8. The bidual 7 1.9. The transpose of a linear transformation 8 1. The dual of a vector space 1.1. Linear functionals. Let V be a vector space over the scalar field F = R or C. Recall that linear functionals are particular cases of linear transformation, namely those whose values are in the scalar field (which is a one dimensional vector space): Definition 1. A linear functional on V is a linear transformation with scalar values: φ : V ! F . We denote φ(x) ≡ (φ, x). Notation used in quantum mechanics: Dirac's bracket notation (φ, x) ≡ hφjxi In this notation, a functional φ is rather denoted hφj and called bra-, while a vector x 2 V is rather denoted jxi and called -ket; they combine to hφjxi, a bra-ket. Example in finite dimensions. If M is a row matrix, M = [a1; : : : ; an] n where aj 2 F , then φ : F ! F defined by matrix multiplication, φ(x) = Mx, is a linear functional on F n. These are, essentially, all the linear func- tionals on a finite dimensional vector space. 1 2 RODICA D. COSTIN Indeed, the matrix associated to a linear functional φ : V ! F in a basis BV = fv1;:::; vng of V , and the (standard) basis f1g of F is the row vector [φ1; : : : ; φn]; where φj = (φ, vj) P P P P If x = j xjvj 2 V then (φ, x) = (φ, j xjvj) = j xj(φ, vj) = j xjφj hence 2 3 x1 6 .
    [Show full text]
  • Change of Basis and Diagonalization (Note/Linalg
    EECS 16A Designing Information Devices and Systems I Spring 2016 Official Lecture Notes Note 21 Introduction In this lecture note, we will introduce the last topics of this semester, change of basis and diagonalization. We will introduce the mathematical foundations for these two topics. Although we will not go through its application this semester, this is an important concept that we will use to analyze signals and linear time-invariant systems in EE16B. By the end of these notes, you should be able to: 1. Change a vector from one basis to another. 2. Take a matrix representation for a linear transformation in one basis and express that linear transfor- mation in another basis. 3. Understand the importance of a diagonalizing basis and its properties. 4. Identify if a matrix is diagonalizable and if so, to diagonalize it. Change of Basis for Vectors Previously, we have seen that matrices can be interpreted as linear transformations between vector spaces. In particular, an m×n matrix A can be viewed as a function A : U ! V mapping a vector~u from vector space n m U 2 R to a vector A~u in vector space V 2 R . In this note, we explore a different interpretation of square, invertible matrices as a change of basis. " # " # u 4 Let’s first start with an example. Consider the vector ~u = 1 = . When we write a vector in this form, u2 3 " # " # 1 0 implicitly we are representing it in the standard basis for 2, ~e = and ~e = . This means that we R 1 0 2 1 2 can write~u = 4~e1 +3~e2.
    [Show full text]