APPENDIX a Matrix Algebra
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Stat 5102 Notes: Regression
Stat 5102 Notes: Regression Charles J. Geyer April 27, 2007 In these notes we do not use the “upper case letter means random, lower case letter means nonrandom” convention. Lower case normal weight letters (like x and β) indicate scalars (real variables). Lowercase bold weight letters (like x and β) indicate vectors. Upper case bold weight letters (like X) indicate matrices. 1 The Model The general linear model has the form p X yi = βjxij + ei (1.1) j=1 where i indexes individuals and j indexes different predictor variables. Ex- plicit use of (1.1) makes theory impossibly messy. We rewrite it as a vector equation y = Xβ + e, (1.2) where y is a vector whose components are yi, where X is a matrix whose components are xij, where β is a vector whose components are βj, and where e is a vector whose components are ei. Note that y and e have dimension n, but β has dimension p. The matrix X is called the design matrix or model matrix and has dimension n × p. As always in regression theory, we treat the predictor variables as non- random. So X is a nonrandom matrix, β is a nonrandom vector of unknown parameters. The only random quantities in (1.2) are e and y. As always in regression theory the errors ei are independent and identi- cally distributed mean zero normal. This is written as a vector equation e ∼ Normal(0, σ2I), where σ2 is another unknown parameter (the error variance) and I is the identity matrix. This implies y ∼ Normal(µ, σ2I), 1 where µ = Xβ. -
Unsupervised and Supervised Principal Component Analysis: Tutorial
Unsupervised and Supervised Principal Component Analysis: Tutorial Benyamin Ghojogh [email protected] Department of Electrical and Computer Engineering, Machine Learning Laboratory, University of Waterloo, Waterloo, ON, Canada Mark Crowley [email protected] Department of Electrical and Computer Engineering, Machine Learning Laboratory, University of Waterloo, Waterloo, ON, Canada Abstract space and manifold learning (Friedman et al., 2009). This This is a detailed tutorial paper which explains method, which is also used for feature extraction (Ghojogh the Principal Component Analysis (PCA), Su- et al., 2019b), was first proposed by (Pearson, 1901). In or- pervised PCA (SPCA), kernel PCA, and kernel der to learn a nonlinear sub-manifold, kernel PCA was pro- SPCA. We start with projection, PCA with eigen- posed, first by (Scholkopf¨ et al., 1997; 1998), which maps decomposition, PCA with one and multiple pro- the data to high dimensional feature space hoping to fall on jection directions, properties of the projection a linear manifold in that space. matrix, reconstruction error minimization, and The PCA and kernel PCA are unsupervised methods for we connect to auto-encoder. Then, PCA with sin- subspace learning. To use the class labels in PCA, super- gular value decomposition, dual PCA, and ker- vised PCA was proposed (Bair et al., 2006) which scores nel PCA are covered. SPCA using both scor- the features of the X and reduces the features before ap- ing and Hilbert-Schmidt independence criterion plying PCA. This type of SPCA were mostly used in bioin- are explained. Kernel SPCA using both direct formatics (Ma & Dai, 2011). and dual approaches are then introduced. -
DECOMPOSITION of SINGULAR MATRICES INTO IDEMPOTENTS 11 Us Show How to Construct Ai+1, Bi+1, Ci+1, Di+1
DECOMPOSITION OF SINGULAR MATRICES INTO IDEMPOTENTS ADEL ALAHMADI, S. K. JAIN, AND ANDRE LEROY Abstract. In this paper we provide concrete constructions of idempotents to represent typical singular matrices over a given ring as a product of idempo- tents and apply these factorizations for proving our main results. We generalize works due to Laffey ([12]) and Rao ([3]) to noncommutative setting and fill in the gaps in the original proof of Rao's main theorems (cf. [3], Theorems 5 and 7 and [4]). We also consider singular matrices over B´ezoutdomains as to when such a matrix is a product of idempotent matrices. 1. Introduction and definitions It was shown by Howie [10] that every mapping from a finite set X to itself with image of cardinality ≤ cardX − 1 is a product of idempotent mappings. Erd¨os[7] showed that every singular square matrix over a field can be expressed as a product of idempotent matrices and this was generalized by several authors to certain classes of rings, in particular, to division rings and euclidean domains [12]. Turning to singular elements let us mention two results: Rao [3] characterized, via continued fractions, singular matrices over a commutative PID that can be decomposed as a product of idempotent matrices and Hannah-O'Meara [9] showed, among other results, that for a right self-injective regular ring R, an element a is a product of idempotents if and only if Rr:ann(a) = l:ann(a)R= R(1 − a)R. The purpose of this paper is to provide concrete constructions of idempotents to represent typical singular matrices over a given ring as a product of idempotents and to apply these factorizations for proving our main results. -
Newton's Method for the Matrix Square Root*
MATHEMATICS OF COMPUTATION VOLUME 46, NUMBER 174 APRIL 1986, PAGES 537-549 Newton's Method for the Matrix Square Root* By Nicholas J. Higham Abstract. One approach to computing a square root of a matrix A is to apply Newton's method to the quadratic matrix equation F( X) = X2 - A =0. Two widely-quoted matrix square root iterations obtained by rewriting this Newton iteration are shown to have excellent mathematical convergence properties. However, by means of a perturbation analysis and supportive numerical examples, it is shown that these simplified iterations are numerically unstable. A further variant of Newton's method for the matrix square root, recently proposed in the literature, is shown to be, for practical purposes, numerically stable. 1. Introduction. A square root of an n X n matrix A with complex elements, A e C"x", is a solution X e C"*" of the quadratic matrix equation (1.1) F(X) = X2-A=0. A natural approach to computing a square root of A is to apply Newton's method to (1.1). For a general function G: CXn -* Cx", Newton's method for the solution of G(X) = 0 is specified by an initial approximation X0 and the recurrence (see [14, p. 140], for example) (1.2) Xk+l = Xk-G'{XkylG{Xk), fc = 0,1,2,..., where G' denotes the Fréchet derivative of G. Identifying F(X+ H) = X2 - A +(XH + HX) + H2 with the Taylor series for F we see that F'(X) is a linear operator, F'(X): Cx" ^ C"x", defined by F'(X)H= XH+ HX. -
Mathematics Study Guide
Mathematics Study Guide Matthew Chesnes The London School of Economics September 28, 2001 1 Arithmetic of N-Tuples • Vectors specificed by direction and length. The length of a vector is called its magnitude p 2 2 or “norm.” For example, x = (x1, x2). Thus, the norm of x is: ||x|| = x1 + x2. pPn 2 • Generally for a vector, ~x = (x1, x2, x3, ..., xn), ||x|| = i=1 xi . • Vector Order: consider two vectors, ~x, ~y. Then, ~x> ~y iff xi ≥ yi ∀ i and xi > yi for some i. ~x>> ~y iff xi > yi ∀ i. • Convex Sets: A set is convex if whenever is contains x0 and x00, it also contains the line segment, (1 − α)x0 + αx00. 2 2 Vector Space Formulations in Economics • We expect a consumer to have a complete (preference) order over all consumption bundles x in his consumption set. If he prefers x to x0, we write, x x0. If he’s indifferent between x and x0, we write, x ∼ x0. Finally, if he weakly prefers x to x0, we write x x0. • The set X, {x ∈ X : x xˆ ∀ xˆ ∈ X}, is a convex set. It is all bundles of goods that make the consumer at least as well off as with his current bundle. 3 3 Complex Numbers • Define complex numbers as ordered pairs such that the first element in the vector is the real part of the number and the second is complex. Thus, the real number -1 is denoted by (-1,0). A complex number, 2+3i, can be expressed (2,3). • Define multiplication on the complex numbers as, Z · Z0 = (a, b) · (c, d) = (ac − bd, ad + bc). -
Sensitivity and Stability Analysis of Nonlinear Kalman Filters with Application to Aircraft Attitude Estimation
Graduate Theses, Dissertations, and Problem Reports 2013 Sensitivity and stability analysis of nonlinear Kalman filters with application to aircraft attitude estimation Matthew Brandon Rhudy West Virginia University Follow this and additional works at: https://researchrepository.wvu.edu/etd Recommended Citation Rhudy, Matthew Brandon, "Sensitivity and stability analysis of nonlinear Kalman filters with application ot aircraft attitude estimation" (2013). Graduate Theses, Dissertations, and Problem Reports. 3659. https://researchrepository.wvu.edu/etd/3659 This Dissertation is protected by copyright and/or related rights. It has been brought to you by the The Research Repository @ WVU with permission from the rights-holder(s). You are free to use this Dissertation in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you must obtain permission from the rights-holder(s) directly, unless additional rights are indicated by a Creative Commons license in the record and/ or on the work itself. This Dissertation has been accepted for inclusion in WVU Graduate Theses, Dissertations, and Problem Reports collection by an authorized administrator of The Research Repository @ WVU. For more information, please contact [email protected]. SENSITIVITY AND STABILITY ANALYSIS OF NONLINEAR KALMAN FILTERS WITH APPLICATION TO AIRCRAFT ATTITUDE ESTIMATION by Matthew Brandon Rhudy Dissertation submitted to the Benjamin M. Statler College of Engineering and Mineral Resources at West Virginia University in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Aerospace Engineering Approved by Dr. Yu Gu, Committee Chairperson Dr. John Christian Dr. Gary Morris Dr. Marcello Napolitano Dr. Powsiri Klinkhachorn Department of Mechanical and Aerospace Engineering Morgantown, West Virginia 2013 Keywords: Attitude Estimation, Extended Kalman Filter, GPS/INS Sensor Fusion, Stochastic Stability Copyright 2013, Matthew B. -
Block Diagonalization
126 (2001) MATHEMATICA BOHEMICA No. 1, 237–246 BLOCK DIAGONALIZATION J. J. Koliha, Melbourne (Received June 15, 1999) Abstract. We study block diagonalization of matrices induced by resolutions of the unit matrix into the sum of idempotent matrices. We show that the block diagonal matrices have disjoint spectra if and only if each idempotent matrix in the inducing resolution dou- ble commutes with the given matrix. Applications include a new characterization of an eigenprojection and of the Drazin inverse of a given matrix. Keywords: eigenprojection, resolutions of the unit matrix, block diagonalization MSC 2000 : 15A21, 15A27, 15A18, 15A09 1. Introduction and preliminaries In this paper we are concerned with a block diagonalization of a given matrix A; by definition, A is block diagonalizable if it is similar to a matrix of the form A1 0 ... 0 0 A2 ... 0 (1.1) =diag(A1,...,Am). ... ... 00... Am Then the spectrum σ(A)ofA is the union of the spectra σ(A1),...,σ(Am), which in general need not be disjoint. The ultimate such diagonalization is the Jordan form of the matrix; however, it is often advantageous to utilize a coarser diagonalization, easier to construct, and customized to a particular distribution of the eigenvalues. The most useful diagonalizations are the ones for which the sets σ(Ai)arepairwise disjoint; it is the aim of this paper to give a full characterization of these diagonal- izations. ∈ n×n For any matrix A we denote its kernel and image by ker A and im A, respectively. A matrix E is idempotent (or a projection matrix )ifE2 = E,and 237 nilpotent if Ep = 0 for some positive integer p. -
Matlib: Matrix Functions for Teaching and Learning Linear Algebra and Multivariate Statistics
Package ‘matlib’ August 21, 2021 Type Package Title Matrix Functions for Teaching and Learning Linear Algebra and Multivariate Statistics Version 0.9.5 Date 2021-08-10 Maintainer Michael Friendly <[email protected]> Description A collection of matrix functions for teaching and learning matrix linear algebra as used in multivariate statistical methods. These functions are mainly for tutorial purposes in learning matrix algebra ideas using R. In some cases, functions are provided for concepts available elsewhere in R, but where the function call or name is not obvious. In other cases, functions are provided to show or demonstrate an algorithm. In addition, a collection of functions are provided for drawing vector diagrams in 2D and 3D. License GPL (>= 2) Language en-US URL https://github.com/friendly/matlib BugReports https://github.com/friendly/matlib/issues LazyData TRUE Suggests knitr, rglwidget, rmarkdown, carData, webshot2, markdown Additional_repositories https://dmurdoch.github.io/drat Imports xtable, MASS, rgl, car, methods VignetteBuilder knitr RoxygenNote 7.1.1 Encoding UTF-8 NeedsCompilation no Author Michael Friendly [aut, cre] (<https://orcid.org/0000-0002-3237-0941>), John Fox [aut], Phil Chalmers [aut], Georges Monette [ctb], Gaston Sanchez [ctb] 1 2 R topics documented: Repository CRAN Date/Publication 2021-08-21 15:40:02 UTC R topics documented: adjoint . .3 angle . .4 arc..............................................5 arrows3d . .6 buildTmat . .8 cholesky . .9 circle3d . 10 class . 11 cofactor . 11 cone3d . 12 corner . 13 Det.............................................. 14 echelon . 15 Eigen . 16 gaussianElimination . 17 Ginv............................................. 18 GramSchmidt . 20 gsorth . 21 Inverse............................................ 22 J............................................... 23 len.............................................. 23 LU.............................................. 24 matlib . 25 matrix2latex . 27 minor . 27 MoorePenrose . -
Chapter 2 a Short Review of Matrix Algebra
Chapter 2 A short review of matrix algebra 2.1 Vectors and vector spaces Definition 2.1.1. A vector a of dimension n is a collection of n elements typically written as ⎛ ⎞ ⎜ a1 ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ a2 ⎟ a = ⎜ ⎟ =(ai)n. ⎜ . ⎟ ⎝ . ⎠ an Vectors of length 2 (two-dimensional vectors) can be thought of points in 33 BIOS 2083 Linear Models Abdus S. Wahed the plane (See figures). Chapter 2 34 BIOS 2083 Linear Models Abdus S. Wahed Figure 2.1: Vectors in two and three dimensional spaces (-1.5,2) (1, 1) (1, -2) x1 (2.5, 1.5, 0.95) x2 (0, 1.5, 0.95) x3 Chapter 2 35 BIOS 2083 Linear Models Abdus S. Wahed • A vector with all elements equal to zero is known as a zero vector and is denoted by 0. • A vector whose elements are stacked vertically is known as column vector whereas a vector whose elements are stacked horizontally will be referred to as row vector. (Unless otherwise mentioned, all vectors will be referred to as column vectors). • A row vector representation of a column vector is known as its trans- T pose. We will use⎛ the⎞ notation ‘ ’or‘ ’ to indicate a transpose. For ⎜ a1 ⎟ ⎜ ⎟ ⎜ a2 ⎟ ⎜ ⎟ T instance, if a = ⎜ ⎟ and b =(a1 a2 ... an), then we write b = a ⎜ . ⎟ ⎝ . ⎠ an or a = bT . • Vectors of same dimension are conformable to algebraic operations such as additions and subtractions. Sum of two or more vectors of dimension n results in another n-dimensional vector with elements as the sum of the corresponding elements of summand vectors. -
Table of Symbols
Table of Symbols Symbol Meaning Reference ∅ Empty set Page 10 ∈ Member symbol Page 10 ⊆ Subset symbol Page 10 ⊂ Proper subset symbol Page 10 ∩ Intersection symbol Page 10 ∪ Union symbol Page 10 ⊗ Tensor symbol Page 136 ≈ Approximate equality sign Page 79 −−→ PQ Displacement vector Page 147 | z | Absolute value of complex z Page 13 | A | determinant of matrix A Page 115 || u || Norm of vector u Page 212 || u ||p p-norm of vector u Page 306 u · v Standard inner product Page 216 u, v Inner product Page 312 Acof Cofactor matrix of A Page 122 adj A Adjoint of matrix A Page 122 A∗ Conjugate (Hermitian) transpose of matrix A Page 91 AT Transpose of matrix A Page 91 C(A) Column space of matrix A Page 183 cond(A) Condition number of matrix A Page 344 C[a, b] Function space Page 151 C Complex numbers a + bi Page 12 Cn Standard complex vector space Page 149 compv u Component Page 224 z Complex conjugate of z Page 13 δij Kronecker delta Page 65 dim V Dimension of space V Page 194 det A Determinant of A Page 115 domain(T ) Domain of operator T Page 189 diag{λ1,λ2,...,λn} Diagonal matrix with λ1,λ2,...,λn along diagonal Page 103 Eij Elementary operation switch ith and jth rows Page 25 356 Table of Symbols Symbol Meaning Reference Ei(c) Elementary operation multiply ith row by c Page 25 Eij (d) Elementary operation add d times jth row to ith row Page 25 Eλ(A) Eigenspace Page 254 Hv Householder matrix Page 237 I,In Identity matrix, n × n identity Page 65 idV Identity function for V Page 156 (z) Imaginary part of z Page 12 ker(T ) Kernel of operator T Page 188 Mij (A) Minor of A Page 117 M(A) Matrix of minors of A Page 122 max{a1,a2,...,am} Maximum value Page 40 min{a1,a2,...,am} Minimum value Page 40 N (A) Null space of matrix A Page 184 N Natural numbers 1, 2,.. -
Econometrics Lecture 1: Review of Matrix Algebra
Econometrics Lecture 1: Review of Matrix Algebra R. G. Pierse 1 Introduction A matrix is a rectangular array of numbers. If the matrix has n rows and m columns it is said to be an n × m matrix. This is called the dimension of the matrix. A matrix with a single column ( n × 1) is called a column vector.A matrix with a single row ( 1 × m) is called a row vector. A matrix with only one row and one column ( a single number) is called a scalar. The standard convention for denoting a matrix is to use a capital letter in bold typeface as in A, B, C. A column vector is denoted with a lowercase letter in bold typeface as in a, b, c. A row vector is denoted with a lowercase letter in bold typeface, followed by a prime, as in a0, b0, c0. A scalar is generally denoted with a lowercase letter is normal typeface as in a, b, c. An n × m matrix A can be written out explicitly in terms of its elements as in: 2 3 a11 a12 ··· a1j a1m 6 a21 a22 ··· a2j a2m 7 6 7 6 . .. 7 6 . 7 A = 6 7 : 6 ai1 ai2 aij aim 7 6 7 4 5 an1 an2 anj anm Each element has two subscripts: the first is the row index and the second the column index so that aij refers to the element in the ith row and jth column of A. 2 Matrix Operations The standard operations of addition, subtraction and multiplication can be defined for two matrices as long as the dimensions of the matrices satisfy appropriate conditions to ensure that the operation makes sense. -
Notes on Linear Algebra and Matrix Analysis
Notes on Linear Algebra and Matrix Analysis Maxim Neumann May 2006, Version 0.1.1 1 Matrix Basics Literature to this topic: [1–4]. x†y ⇐⇒ < y,x >: standard inner product. x†x = 1 : x is normalized x†y = 0 : x,y are orthogonal x†y = 0, x†x = 1, y†y = 1 : x,y are orthonormal Ax = y is uniquely solvable if A is linear independent (nonsingular). Majorization: Arrange b and a in increasing order (bm,am) then: k k b majorizes a ⇐⇒ ∑ bmi ≥ ∑ ami ∀ k ∈ [1,...,n] (1) i=1 i=1 n n The collection of all vectors b ∈ R that majorize a given vector a ∈ R may be obtained by forming the convex hull of n! vectors, which are computed by permuting the n components of a. Direct sum of matrices A ∈ Mn1,B ∈ Mn2: A 0 A ⊕ B = ∈ M (2) 0 B n1+n2 [A,B] = traceAB†: matrix inner product. 1.1 Trace n traceA = ∑λi (3) i trace(A + B) = traceA + traceB (4) traceAB = traceBA (5) 1.2 Determinants The determinant det(A) expresses the volume of a matrix A. A is singular. Linear equation is not solvable. det(A) = 0 ⇐⇒ −1 (6) A does not exists vectors in A are linear dependent det(A) 6= 0 ⇐⇒ A is regular/nonsingular. Ai j ∈ R → det(A) ∈ R Ai j ∈ C → det(A) ∈ C If A is a square matrix(An×n) and has the eigenvalues λi, then det(A) = ∏λi detAT = detA (7) detA† = detA (8) detAB = detA detB (9) Elementary operations on matrix and determinant: Interchange of two rows : detA ∗ = −1 Multiplication of a row by a nonzero scalar c : detA ∗ = c Addition of a scalar multiple of one row to another row : detA = detA 1 2 EIGENVALUES, EIGENVECTORS, AND SIMILARITY 2 a b = ad − bc (10) c d 2 Eigenvalues, Eigenvectors, and Similarity σ(An×n) = {λ1,...,λn} is the set of eigenvalues of A, also called the spectrum of A.