Analyzing the Linear and Nonlinear Transformations of Alexnet to Gain Insight Into Its Performance

Total Page:16

File Type:pdf, Size:1020Kb

Analyzing the Linear and Nonlinear Transformations of Alexnet to Gain Insight Into Its Performance Analyzing the Linear and Nonlinear Transformations of AlexNet to Gain Insight into Its Performance Jyoti Nigam, Srishti Barahpuriya and Renu M. Rameshan Indian Institute of Technology, Mandi, Himachal Pradesh, India Keywords: Convolution, Correlation, Linear Transformation, Nonlinear Transformation. Abstract: AlexNet, one of the earliest and successful deep learning networks, has given great performance in image clas- sification task. There are some fundamental properties for good classification such as: the network preserves the important information of the input data; the network is able to see differently, points from different classes. In this work we experimentally verify that these core properties are followed by the AlexNet architecture. We analyze the effect of linear and nonlinear transformations on input data across the layers. The convolution fil- ters are modeled as linear transformations. The verified results motivate to draw conclusions on the desirable properties of transformation matrix that aid in better classification. 1 INTRODUCTION AND • We derive the structure of the linear transforma- RELATED WORK tion corresponding to the convolution filters and analyze its effect on the data using the bounds on the norm of the linear transformation. Convolutional neural networks (CNNs) have led to considerable improvements in performance for many • Using a specific data selection plan we show em- computer vision (LeCun et al., 1989; Krizhevsky pirically that the data from the same class shrinks et al., 2012) and natural language processing tasks and separation increases between two different (Young et al., 2018). In recent literature there are classes. many papers (Giryes et al., 2016; Sokolic´ et al., 2017; Sokolic´ et al., 2017; Oyallon, 2017) which provide an analysis on why deep neural networks (DNNs) are ef- ficient classifiers. (Kobayashi, 2018; Dosovitskiy and 2 ANALYSIS Brox, 2016) provide an analysis of CNNs by looking at the visualization of the neuron activations. Statis- AlexNet employs five convolution layers and three tical models (Xie et al., 2016) have also been used to max pooling layers for extracting features. Further- derive feature representation based on a simple statis- more, the three fully connected layers for classifying tical model. images as shown in Fig.1. Each layer makes use of We choose to analyze the network in a method dif- the rectified linear unit (ReLU) for nonlinear neuron ferent from all the above by modeling the filters as a activation. linear transformation. The effect of nonlinear opera- In CNNs, feature extraction from the input data tions is analyzed by using measures like Mahalanobis is done by convolution layers while fully connected distance and angular as well as Euclidean separation layers perform as classifiers. Each convolution layer between points of different classes. generates a feature map that is in 3D tensor format and AlexNet (Krizhevsky et al., 2012) is one of the fed into the subsequent layer. The feature map from oldest successful CNNs that recognizes images of the the last convolution layer is given to fully connected ImageNet dataset (Deng et al., 2009). We analyze ex- layers in the form of a flattened vector and a 1000 di- perimentally this network with an aim of understand- mensional vector is generated as output of fully con- ing the mathematical reasons for its success. We use nected layer. This is followed by normalization and data from two classes of ImageNet to study the per- then a softmax layer is applied. In the normalized out- formance of AlexNet. put vector, each dimension refers to the probability of The contributions of this analysis are as follows: the image being the element of each image class. 860 Nigam, J., Barahpuriya, S. and Rameshan, R. Analyzing the Linear and Nonlinear Transformations of AlexNet to Gain Insight into Its Performance. DOI: 10.5220/0007582408600865 In Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2019), pages 860-865 ISBN: 978-989-758-351-3 Copyright c 2019 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved Analyzing the Linear and Nonlinear Transformations of AlexNet to Gain Insight into Its Performance Figure 1: AlexNet Architecture (Kim et al., 2017). CNN has more than one linear transformations and two types of nonlinear transformations (ReLU and pooling) which are used in repetition. The the nonlinear transformation confines the data to the pos- itive orthant of higher dimension. 2.1 Analysis of Linear Transformation The main operation in CNNs is convolution. The fil- Figure 2: Convolution operation. The input is of size ters are correlated with the input to get feature maps N1N2N3 ×1 and there are M3 filters ( f1;:::; fM3 ) which gen- and this operation is termed as convolution in the liter- erate M3 feature maps (m1;:::;mM3 ). ature. Since correlation is nothing other than convolu- tion without flipping the kernel, correlation operation has full rank. The input x is convolved with each filter can also be represented as a matrix vector product. generating a feature map mk. Eq.(3) gives the descrip- We refer to this matrix as the linear transformation. tion of convolution operation. 1D and 2D correlation can be represented as shown in Eq. (1) and Eq. (2), respectively. y = Tx: (3) y(n) = ∑h(k)x(n + k); (1) 2.1.1 Analysis based on Nature of k Transformation Matrix y(m;n) = ∑∑h(l;k)x(m + l;n + k); (2) The desirable properties for transformation matrix (T) l k to aid classification are: where x is the input and y is the output and h is the 1. The null space of T should be such that the differ- kernel. Eq.1 leads to a Toeplitz matrix and Eq.2 leads ence of vectors from two different classes should to a block Toeplitz matrix. In CNNs notice that the or- not be in the null space of T. This in turn demands der of convolution is higher and correspondingly one that the difference should not lie in null space of gets a matrix which is Toeplitz with Toeplitz blocks. fk, 1 ≤ k ≤ M3, i.e. if xi 2 Ci and x j 2 Cj, where Typically each layer has multiple filters leading to Ci and Cj are different classes, multiple maps and the Toeplitz matrix corresponding to each filter is stacked vertically to get the overall x j − xi 2= N ( fk); 1 ≤ k ≤ N: (4) transformation from the input space to output space. N ×N ×N Proof. Let x1;x2 2 R 1 2 3 be two points and As an example let x 2 R N1×N2×N3 be an input vec- their difference x = x1 − x2, the norm of x is, tor and y 2 R M1×M2×M3 be the output vector, where N and M are number of channels in input and num- 3 3 kxk= kx − x k: (5) ber of filters, respectively. Then the transformation 2 1 matrix T is such that T 2 R M1M2M3×N1N2N3 . T is ob- x1;x2 are transformed to tained by stacking fk, where 1 ≤ k ≤ M3, as shown in Fig.2. Each fk is Toeplitz with Toeplitz blocks and y1 = Tx1; y2 = Tx2: (6) 861 ICPRAM 2019 - 8th International Conference on Pattern Recognition Applications and Methods Table 1: Analysis of norm values at each layer. Layers Total filters Filters: k T k2 < 1 Min Max 1 96 13 0.28 4.16 2 256 2 0.03 4.26 3 384 2 0.96 2.22 4 384 6 0.96 2.12 5 256 0 1.16 2.19 Norm of the difference of y1 and y2 is Angles between representative points of both classes 100 Near_Min Medium_Min 90 Far_Min ky2 − y1k= kT(x2 − x1)k; (7) Near_Max 80 Medium_Max That can be written as: Far_Max 70 M 60 3 1 2 2 50 kT(x2 − x1)k= (∑k fk(x2 − x1)k ) ; Angles k=1 40 30 20 x −x 2= (T) x −x 2= ( f ) 8k: 2 1 N only if 2 1 N k This is 10 important to maintain separation between classes 0 1 2 3 4 5 after the transformation. Layers 2. l (TT T ) > 1, l (TT T ) > 1, where l and Figure 3: Angles between representative points of both the min max min classes. The first entry in x−axis shows the input value fol- lmax are the minimum and maximum eigenvalues, T lowed by five layers of AlexNet. Near min and Near max respectively of TT . are minimum and maximum angle values, respectively from the near region. Similarly for medium and far regions the Proof. For proper classification, two vectors from minimum and maximum values are shown. different classes should be separated at least by kx2 − x1k. Since transformations on the input data. In this direction there is a work (Giryes et al., 2016), which provides y2 − y1 = T(x2 − x1); a study about how the distance and the angle changes ky − y k = kT(x − x )k; 2 1 2 1 after the transformation within the layers. The anal- = k(x2 − x1)kkTzk; wherekzk= 1: ysis in (Giryes et al., 2016) is based on the networks with random Gaussian weights and it exploits tools Let l and l are the minimum and max- min max used in the compressed sensing and dictionary learn- imum eigenvalues, respectively of TT T , then ing literature. They showed that if the angle between l ≤ kTzk≤ l , and hence min max the inputs is large then the Euclidean distance be- lmink(x2 − x1)k≤ k(y2 − y1)k≤ lmaxk(x2 − x1)k: tween them at the output layer will be large and vice- (8) versa. We analyze the effect of nonlinear transformations on the input data by measuring the following key- From Eq.(8) it is evident that lmin;lmax > 1 is points.
Recommended publications
  • MATH 2030: MATRICES Introduction to Linear Transformations We Have
    MATH 2030: MATRICES Introduction to Linear Transformations We have seen that we may describe matrices as symbol with simple algebraic properties like matrix multiplication, addition and scalar addition. In the particular case of matrix-vector multiplication, i.e., Ax = b where A is an m × n matrix and x; b are n×1 matrices (column vectors) we may represent this as a transformation on the space of column vectors, that is a function F (x) = b , where x is the independent variable and b the dependent variable. In this section we will give a more rigorous description of this idea and provide examples of such matrix transformations, which will lead to the idea of a linear transformation. To begin we look at a matrix-vector multiplication to give an idea of what sort of functions we are working with 21 0 3 1 A = 2 −1 ; v = : 4 5 −1 3 4 then matrix-vector multiplication yields 2 1 3 Av = 4 3 5 −1 We have taken a 2 × 1 matrix and produced a 3 × 1 matrix. More generally for any x we may describe this transformation as a matrix equation y 21 0 3 2 x 3 x 2 −1 = 2x − y : 4 5 y 4 5 3 4 3x + 4y From this product we have found a formula describing how A transforms an arbi- 2 3 trary vector in R into a new vector in R . Expressing this as a transformation TA we have 2 x 3 x T = 2x − y : A y 4 5 3x + 4y From this example we can define some helpful terminology.
    [Show full text]
  • Vectors, Matrices and Coordinate Transformations
    S. Widnall 16.07 Dynamics Fall 2009 Lecture notes based on J. Peraire Version 2.0 Lecture L3 - Vectors, Matrices and Coordinate Transformations By using vectors and defining appropriate operations between them, physical laws can often be written in a simple form. Since we will making extensive use of vectors in Dynamics, we will summarize some of their important properties. Vectors For our purposes we will think of a vector as a mathematical representation of a physical entity which has both magnitude and direction in a 3D space. Examples of physical vectors are forces, moments, and velocities. Geometrically, a vector can be represented as arrows. The length of the arrow represents its magnitude. Unless indicated otherwise, we shall assume that parallel translation does not change a vector, and we shall call the vectors satisfying this property, free vectors. Thus, two vectors are equal if and only if they are parallel, point in the same direction, and have equal length. Vectors are usually typed in boldface and scalar quantities appear in lightface italic type, e.g. the vector quantity A has magnitude, or modulus, A = |A|. In handwritten text, vectors are often expressed using the −→ arrow, or underbar notation, e.g. A , A. Vector Algebra Here, we introduce a few useful operations which are defined for free vectors. Multiplication by a scalar If we multiply a vector A by a scalar α, the result is a vector B = αA, which has magnitude B = |α|A. The vector B, is parallel to A and points in the same direction if α > 0.
    [Show full text]
  • Support Graph Preconditioners for Sparse Linear Systems
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Texas A&M University SUPPORT GRAPH PRECONDITIONERS FOR SPARSE LINEAR SYSTEMS A Thesis by RADHIKA GUPTA Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE December 2004 Major Subject: Computer Science SUPPORT GRAPH PRECONDITIONERS FOR SPARSE LINEAR SYSTEMS A Thesis by RADHIKA GUPTA Submitted to Texas A&M University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Approved as to style and content by: Vivek Sarin Paul Nelson (Chair of Committee) (Member) N. K. Anand Valerie E. Taylor (Member) (Head of Department) December 2004 Major Subject: Computer Science iii ABSTRACT Support Graph Preconditioners for Sparse Linear Systems. (December 2004) Radhika Gupta, B.E., Indian Institute of Technology, Bombay; M.S., Georgia Institute of Technology, Atlanta Chair of Advisory Committee: Dr. Vivek Sarin Elliptic partial differential equations that are used to model physical phenomena give rise to large sparse linear systems. Such systems can be symmetric positive definite and can be solved by the preconditioned conjugate gradients method. In this thesis, we develop support graph preconditioners for symmetric positive definite matrices that arise from the finite element discretization of elliptic partial differential equations. An object oriented code is developed for the construction, integration and application of these preconditioners. Experimental results show that the advantages of support graph preconditioners are retained in the proposed extension to the finite element matrices. iv To my parents v ACKNOWLEDGMENTS I would like to express sincere thanks to my advisor, Dr.
    [Show full text]
  • Wronskian Solutions to the Kdv Equation Via B\" Acklund
    Wronskian solutions to the KdV equation via B¨acklund transformation Qi-fei Xuan∗, Mei-ying Ou, Da-jun Zhang† Department of Mathematics, Shanghai University, Shanghai 200444, P.R. China October 27, 2018 Abstract In the paper we discuss the B¨acklund transformation of the KdV equation between solitons and solitons, between negatons and negatons, between positons and positons, between rational solution and rational solution, and between complexitons and complexitons. We investigate the conditions that Wronskian entries satisfy for the bilinear B¨acklund transformation of the KdV equation. By choosing suitable Wronskian entries and the parameter in the bilinear B¨acklund transformation, we obtain transformations between many kinds of solutions. Keywords: the KdV equation, Wronskian solution, bilinear form, B¨acklund transformation 1 Introduction The Wronskian can be considered as a bridge connecting with many classical methods in soliton theory. This is not only because soliton solutions in Wronskian form can be obtained from the Darboux transformation[1], Sato theory[2, 3] and Wronskian technique[4]-[10], but also because the exponential polynomial for N-solitons derived from Hirota method[11, 12] and the matrix form given by the Inverse Scattering Transform[13, 14] can be transformed into a Wronskian by extracting some exponential factors. The special structure of a Wronskian contributes simple arXiv:0706.3487v1 [nlin.SI] 24 Jun 2007 forms of its derivatives, and this admits solution verification by direct substituting Wronskians into a bilinear soliton equation or a bilinear B¨acklund transformation(BT). This approach is re- ferred to as Wronskian technique[4]. In the approach a bilinear soliton equation is some algebraic identity provided that Wronskian entry vector satisfies some differential equation set which we call Wronskian condition.
    [Show full text]
  • Rotation Matrix - Wikipedia, the Free Encyclopedia Page 1 of 22
    Rotation matrix - Wikipedia, the free encyclopedia Page 1 of 22 Rotation matrix From Wikipedia, the free encyclopedia In linear algebra, a rotation matrix is a matrix that is used to perform a rotation in Euclidean space. For example the matrix rotates points in the xy -Cartesian plane counterclockwise through an angle θ about the origin of the Cartesian coordinate system. To perform the rotation, the position of each point must be represented by a column vector v, containing the coordinates of the point. A rotated vector is obtained by using the matrix multiplication Rv (see below for details). In two and three dimensions, rotation matrices are among the simplest algebraic descriptions of rotations, and are used extensively for computations in geometry, physics, and computer graphics. Though most applications involve rotations in two or three dimensions, rotation matrices can be defined for n-dimensional space. Rotation matrices are always square, with real entries. Algebraically, a rotation matrix in n-dimensions is a n × n special orthogonal matrix, i.e. an orthogonal matrix whose determinant is 1: . The set of all rotation matrices forms a group, known as the rotation group or the special orthogonal group. It is a subset of the orthogonal group, which includes reflections and consists of all orthogonal matrices with determinant 1 or -1, and of the special linear group, which includes all volume-preserving transformations and consists of matrices with determinant 1. Contents 1 Rotations in two dimensions 1.1 Non-standard orientation
    [Show full text]
  • Linear Algebra with Exercises B
    Linear Algebra with Exercises B Fall 2017 Kyoto University Ivan Ip These notes summarize the definitions, theorems and some examples discussed in class. Please refer to the class notes and reference books for proofs and more in-depth discussions. Contents 1 Abstract Vector Spaces 1 1.1 Vector Spaces . .1 1.2 Subspaces . .3 1.3 Linearly Independent Sets . .4 1.4 Bases . .5 1.5 Dimensions . .7 1.6 Intersections, Sums and Direct Sums . .9 2 Linear Transformations and Matrices 11 2.1 Linear Transformations . 11 2.2 Injection, Surjection and Isomorphism . 13 2.3 Rank . 14 2.4 Change of Basis . 15 3 Euclidean Space 17 3.1 Inner Product . 17 3.2 Orthogonal Basis . 20 3.3 Orthogonal Projection . 21 i 3.4 Orthogonal Matrix . 24 3.5 Gram-Schmidt Process . 25 3.6 Least Square Approximation . 28 4 Eigenvectors and Eigenvalues 31 4.1 Eigenvectors . 31 4.2 Determinants . 33 4.3 Characteristic polynomial . 36 4.4 Similarity . 38 5 Diagonalization 41 5.1 Diagonalization . 41 5.2 Symmetric Matrices . 44 5.3 Minimal Polynomials . 46 5.4 Jordan Canonical Form . 48 5.5 Positive definite matrix (Optional) . 52 5.6 Singular Value Decomposition (Optional) . 54 A Complex Matrix 59 ii Introduction Real life problems are hard. Linear Algebra is easy (in the mathematical sense). We make linear approximations to real life problems, and reduce the problems to systems of linear equations where we can then use the techniques from Linear Algebra to solve for approximate solutions. Linear Algebra also gives new insights and tools to the original problems.
    [Show full text]
  • DRS: Diagonal Dominant Reduction for Lattice-Based Signature Version 2
    DRS: Diagonal dominant Reduction for lattice-based Signature Version 2 Thomas PLANTARD, Arnaud SIPASSEUTH, Cédric DUMONDELLE, Willy SUSILO Institute of Cybersecurity and Cryptology School of Computing and Information Technology University of Wollongong Australia 1 Background Denition 1. We call lattice a discrete subgroup of Rn. We say a lattice is an integer lattice when it is a subgroup of Zn. A basis of the lattice is a basis as a Z − module. In our work we only consider full-rank integer lattices (unless specied otherwise), i.e such that their basis can be represented by a n × n non-singular integer matrix. It is important to note that just like in classical linear algebra, a lattice has an innity of basis. In fact, if B is a basis of L, then so is UB for any unimodular matrix U (U can be seen as the set of linear operations over Zn on the rows of B that do not aect the determinant). Denition 2 (Minima). We note λi(L) the i−th minimum of a lattice L. It is the radius of the smallest zero-centered ball containing at least i linearly independant elements of L. λi+1(L) Denition 3 (Lattice gap). We note δi(L) the ratio and call that a lattice gap. When mentioned λi(L) without index and called "the" gap, the index is implied to be i = 1. Denition 4. We say a lattice is a diagonally dominant type lattice (of dimension n) if it admits a basis B of the form a diagonal dominant matrix as in [1], i.e Pn 8i 2 [1; n];Bi;i ≥ j=1;i6=j jBi;jj We can also see a diagonally dominant matrix B as a sum B = D + R where D is diagonal and Di;i > kRik1.
    [Show full text]
  • Chapter 6 Direct Methods for Solving Linear Systems
    Chapter 6 Direct Methods for Solving Linear Systems Per-Olof Persson [email protected] Department of Mathematics University of California, Berkeley Math 128A Numerical Analysis Direct Methods for Linear Systems Consider solving a linear system of the form: 퐸1 ∶ 푎11푥1 + 푎12푥2 + ⋯ + 푎1푛푥푛 = 푏1, 퐸2 ∶ 푎21푥1 + 푎22푥2 + ⋯ + 푎2푛푥푛 = 푏2, ⋮ 퐸푛 ∶ 푎푛1푥1 + 푎푛2푥2 + ⋯ + 푎푛푛푥푛 = 푏푛, for 푥1, … , 푥푛. Direct methods give an answer in a fixed number of steps, subject only to round-off errors. We use three row operations to simplify the linear system: 1. Multiply Eq. 퐸푖 by 휆 ≠ 0: (휆퐸푖) → (퐸푖) 2. Multiply Eq. 퐸푗 by 휆 and add to Eq. 퐸푖: (퐸푖 + 휆퐸푗) → (퐸푖) 3. Exchange Eq. 퐸푖 and Eq. 퐸푗: (퐸푖) ↔ (퐸푗) Gaussian Elimination Gaussian Elimination with Backward Substitution Reduce a linear system to triangular form by introducing zeros using the row operations (퐸푖 + 휆퐸푗) → (퐸푖) Solve the triangular form using backward-substitution Row Exchanges If a pivot element on the diagonal is zero, the reduction to triangular form fails Find a nonzero element below the diagonal and exchange the two rows Definition An 푛 × 푚 matrix is a rectangular array of elements with 푛 rows and 푚 columns in which both value and position of an element is important Operation Counts Count the number of arithmetic operations performed Use the formulas 푚 푚(푚 + 1) 푚 푚(푚 + 1)(2푚 + 1) ∑ 푗 = , ∑ 푗2 = 푗=1 2 푗=1 6 Reduction to Triangular Form Multiplications/divisions: 푛−1 2푛3 + 3푛2 − 5푛 ∑(푛 − 푖)(푛 − 푖 + 2) = ⋯ = 푖=1 6 Additions/subtractions: 푛−1 푛3 − 푛 ∑(푛 − 푖)(푛 − 푖 + 1) = ⋯ = 푖=1 3 Operation Counts
    [Show full text]
  • Computing Rational Forms of Integer Matrices
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector J. Symbolic Computation (2002) 34, 157–172 doi:10.1006/jsco.2002.0554 Available online at http://www.idealibrary.com on Computing Rational Forms of Integer Matrices MARK GIESBRECHT† AND ARNE STORJOHANN‡ School of Computer Science, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1 n×n A new algorithm is presented for finding the Frobenius rational form F ∈ Z of any n×n 4 3 A ∈ Z which requires an expected number of O(n (log n + log kAk) + n (log n + log kAk)2) word operations using standard integer and matrix arithmetic (where kAk = maxij |Aij |). This substantially improves on the fastest previously known algorithms. The algorithm is probabilistic of the Las Vegas type: it assumes a source of random bits but always produces the correct answer. Las Vegas algorithms are also presented for computing a transformation matrix to the Frobenius form, and for computing the rational Jordan form of an integer matrix. c 2002 Elsevier Science Ltd. All rights reserved. 1. Introduction In this paper we present new algorithms for exactly computing the Frobenius and rational Jordan normal forms of an integer matrix which are substantially faster than those previously known. We show that the Frobenius form F ∈ Zn×n of any A ∈ Zn×n can be computed with an expected number of O(n4(log n+log kAk)+n3(log n+log kAk)2) word operations using standard integer and matrix arithmetic. Here and throughout this paper, kAk = maxij |Aij|.
    [Show full text]
  • 2 Rotation Kinematics
    2 Rotation Kinematics Consider a rigid body with a fixed point. Rotation about the fixed point is the only possible motion of the body. We represent the rigid body by a body coordinate frame B, that rotates in another coordinate frame G, as is shown in Figure 2.1. We develop a rotation calculus based on trans- formation matrices to determine the orientation of B in G, and relate the coordinates of a body point P in both frames. Z ZP z B zP G P r yP y YP X x P P Y X x FIGURE 2.1. A rotated body frame B in a fixed global frame G,aboutafixed point at O. 2.1 Rotation About Global Cartesian Axes Consider a rigid body B with a local coordinate frame Oxyz that is origi- nally coincident with a global coordinate frame OXY Z.PointO of the body B is fixed to the ground G and is the origin of both coordinate frames. If the rigid body B rotates α degrees about the Z-axis of the global coordi- nate frame, then coordinates of any point P of the rigid body in the local and global coordinate frames are related by the following equation G B r = QZ,α r (2.1) R.N. Jazar, Theory of Applied Robotics, 2nd ed., DOI 10.1007/978-1-4419-1750-8_2, © Springer Science+Business Media, LLC 2010 34 2. Rotation Kinematics where, X x Gr = Y Br= y (2.2) ⎡ Z ⎤ ⎡ z ⎤ ⎣ ⎦ ⎣ ⎦ and cos α sin α 0 − QZ,α = sin α cos α 0 .
    [Show full text]
  • Matrix Notation for Graphical Transformations
    Appendix Matrix notation for A graphical transformations This appendix is not intended as an introduction to matrix notation but serves to supplement the principles given in Chapters 3 and 4 concerning transformations by using the shorthand of matrix notations. A.I TWO-DIMENSIONAL TRANSFORMATIONS A generalized point in 2-D can be represented by its x and Y co-ordinates as p = [xv YI], where [xv yd is a 1 x 2 matrix. Similarly a line in 2-D can be represented by its end points as a 2 x 2 matrix: L = [:~ ~:]. The general 2-D point (x, y) can be transferred into the point (xv YI) as Xl = ax + by (A. 1) YI = ex + dy, where the transform T can be expressed in matrix form as: Equation (A.l) can now be expressed in matrix form as (XII YI) = (X, y) [~ ~] I (A.2) i.e. Xl can be found by multiplying the row vector (x, y) by the first column [~ :] and YI by multiplying (x, y) by the second column [: ~]. Translation I ~ Similarly a pair of matrix transformations can be multiplied together to give one combined (or concatenated) form. Thus if and then the concatenated transform or T = [(ae + ef) (ag + eh)] (be + df) (bg + dh) T = [~ ~l where j = ae + ej, i.e. j is formed by taking the sum of the products of the first row of TI with the first column of T2; k = be + dj, i.e. k is formed by taking the sum of the products of the second row of TI with the first column of T2; 1 = ag + eh, i.e.
    [Show full text]
  • Linear Algebra and Geometric Transformations in 2D
    Linear algebra and geometric transformations in 2D Computer Graphics CSE 167 Lecture 2 CSE 167: Computer Graphics • Linear algebra – Vectors – Matrices • Points as vectors • Geometric transformations in 2D – Homogeneous coordinates CSE 167, Winter 2018 2 Vectors • Represent magnitude and direction in multiple dimensions • Examples – Translation of a point – Surface normal vectors (vectors orthogonal to surface) CSE 167, Winter 2018 3 Based on slides courtesy of Jurgen Schulze Vectors and arithmetic Examples using Vectors are 3‐vectors column vectors Vectors must be the same length CSE 167, Winter 2018 4 Magnitude of a vector • The magnitude of a vector is its norm Example using 3‐vector • A vector if magnitude 1 is called a unit vector • A vector can be unitized by dividing by its norm CSE 167, Winter 2018 5 Dot product of two vectors Angle between two vectors CSE 167, Winter 2018 6 Cross product of two 3‐vectors • The cross product of two 3‐vectors a and b results in another 3‐vector that is orthogonal (using right hand rule) to the two vectors CSE 167, Winter 2018 7 Cross product of two 3‐vectors CSE 167, Winter 2018 8 Matrices • 2D array of numbers A = CSE 167, Winter 2018 9 Matrix addition • Matrices must be the same size • Matrix subtraction is similar CSE 167, Winter 2018 10 Matrix‐scalar multiplication CSE 167, Winter 2018 11 Matrix‐matrix multiplication CSE 167, Winter 2018 12 Matrix‐vector multiplication • Same as matrix‐matrix multiplication – Example: 3x3 matrix multiplied with 3‐vector CSE 167, Winter 2018 13 Transpose • AT
    [Show full text]