Higher Order Automatic Differentiation with Dual Numbers

Total Page:16

File Type:pdf, Size:1020Kb

Higher Order Automatic Differentiation with Dual Numbers https://doi.org/10.3311/PPee.16341 Creative Commons Attribution b |1 Periodica Polytechnica Electrical Engineering and Computer Science, 65(1), pp. 1–10, 2021 Higher Order Automatic Differentiation with Dual Numbers László Szirmay-Kalos1* 1 Department of Control Engineering and Information Technology, Faculty of Electrical Engineering and Informatics, Budapest University of Technology and Economics, H-1521 Budapest, P. O. B. 91, Hungary * Corresponding author, e-mail: [email protected] Received: 28 April 2020, Accepted: 27 September 2020, Published online: 26 October 2020 Abstract In engineering applications, we often need the derivatives of functions defined by a program. The approach chosen for derivative computation must be algebraic to allow computer implementation. A particular solution to obtain first derivatives is the application of dual numbers. This paper proposes simple and compact generalizations of this idea to obtain derivatives of arbitrary order for single or multi-variate functions and the automatic handling of 0/0 ambiguities in the calculations. We also provide the C++ code that takes advantage of operator overloading and recursion. The method is demonstrated by path animation, Gaussian curvature computation, and curve fairing. Keywords dual numbers, higher order automatic differentiation 1 Introduction ∆∆ Many engineering tasks require the computation of ft + −−ft 22 ft′()≈ , derivatives of a function specified by a program seg- ∆ ment. Although, the most essential approaches need only ft()+ ∆∆− 2 ft()+−ft() the first derivative, there are many problems requiring ft′′ ()≈ , ∆2 second or higher order derivation as well. For example, 3∆∆ dynamics simulation uses the Newton’s laws stating that ft + −+3 ft 2 2 the force is proportional to the second derivative of the ft′′′ ()≈ ∆3 path. The Frenet frame is based not only on the first but ∆∆ 3 also on the second derivatives. Curvature calculation also 3 ft − −−ft 2 2 needs the derivatives up to order two. Computational aes- + , ∆3 thetics, curve or surface fairing [1] use the assumption that a fair curve or surface uniformly distributes the curva- ..., ture, which means that the third derivative of the paramet- ric function should also be evaluated. Newton-Raphson where ∆ is a small offset. This is very simple, but methods attacking inverse problems use the Hessian i.e. suffers from severe problems. If ∆ is too large, then the second derivative of the target function, and particu- the approximation is rather poor. However, when it lar applications include reverse engineering [2, 3], regres- is too small, the result is numerically unstable and is sion methods [4], deep learning [5, 6] or medical imag- valid for very few digits (Fig. 1). ing [7], etc. Second derivatives of parametric surfaces • Symbolic differentiation [10] mimics the manual play an important role in non-photorealistic rendering differentiation process and generates the mathemat- as well to identify the principal curvature directions [8]. ical definition of the derived function. Symbolic dif- There are several options to implement derivative compu- ferentiation handles the function as a whole, thus, tation in a computer program: it cannot solve conditionals and special cases. • Numerical differentiation [9] approximates the • Automatic differentiation [5, 11, 12] determines derivatives of f at t by the value and the derivative of a function defined by a Cite this article as: Szirmay-Kalos, L. "Higher Order Automatic Differentiation with Dual Numbers", Periodica Polytechnica Electrical Engineering and Computer Science, 65(1), pp. 1–10, 2021. https://doi.org/10.3311/PPee.16341 Szirmay-Kalos 2|Period. Polytech. Elec. Eng. Comp. Sci., 65(1), pp. 1–10, 2021 80 f(x) automatic first derivative to the derivation rules. Concerning higher order differen- automatic second derivative 60 numeric first derivative tiation, Fike and Alonso [13] proposed hyper-dual num- numeric second derivative 40 bers that have the following form: 20 Zx=+yz12++w12, 0 2 2 2 0 -20 where the imaginary units satisfy 1 ==2 ()12 = . -40 Hyper-dual numbers have three imaginary parts to mimic second order derivatives [14]. Following this construction, -60 order N derivatives would need 2N − 1 imaginary units, mak- -80 0 1 2 3 4 5 6 ing this approach prohibitively expensive for higher orders. (a) In this paper, we provide a simple and compact approach 80 f(x) automatic first derivative for higher order derivation. When dealing with deriva- automatic second derivative 60 numeric first derivative tives up to order N, we store the function and the deriv- numeric second derivative 40 atives in an N + 1 dimensional vector, i.e. we use only 20 the minimally required N imaginary units. To demonstrate 0 the simplicity, we also present the C++ classes implement- -20 ing the different solutions in this paper. -40 The structure of this paper is as follows. In Section 2 we first review the theory of dual numbers and their -60 application in derivative computation. Section 3 presents -80 0 1 2 3 4 5 6 our first result that generalizes dual numbers for multiple (b) imaginary units and establishes the arithmetic rules for the 80 f(x) automatic first derivative computation of higher order derivatives. We also address automatic second derivative 60 numeric first derivative the case of multi-variate functions and provide a solution numeric second derivative 40 for the computation of arbitrary derivatives with the appli- 20 cation of recursive functions in Section 4. 0 Summarizing, the main contributions of this paper are: -20 • Generalization of dual numbers for multiple but min- -40 imal number of imaginary units to compute higher order derivatives. -60 • Generalization of dual numbers to handle higher -80 0 1 2 3 4 5 6 order cross differentiation of multi-variate functions. (c) • Simple solution for the automatic higher order deri- Fig. 1 Numerical differentiation of a function with different Δ values. vation with recursive functions. The function is X(t) = sin(t)(sin(t) + 3)4 / (tan(cos(t)) + 2) (a) when • Automatic handling of 0/0 type ambiguities. ∆ = 0.1 the numerical derivative is not precise; (b) when ∆ = 0.001 the derivative becomes noisy; (c) when ∆ = 0.0001 the noise becomes 2 Dual numbers intolerable. We used single precision floating point calculation. Dual numbers are similar to ordinary complex numbers. Both of them are par ticular Clifford algebras or hy per-num- program exactly up to the precision allowed by the bers of form z = x + yi where x and y are real numbers finite digit representation of the computer. As this and i is the imaginary unit. We expect the existence of approach takes the function and the value where the addition, multiplication and division with the properties of function and the derivative should be determined, ordinary complex operations, like commutativity, distrib- it is able to handle conditionals and to check and utivity, associativity. This means that when we compute solve special cases. the product of two such numbers, the result should have the same form, which necessitates the definition of i2 also Forward mode first order automatic differentiation is in the form of x + yi. The particular definition of i2 distin- accomplished by replacing the algebra of real numbers guishes different hyper-number types. For differentiation, by the algebra of dual numbers with arithmetic similar we take dual numbers defined with thei 2 = 0 fundamental Szirmay-Kalos Period. Polytech. Elec. Eng. Comp. Sci., 65(1), pp. 1–10, 2021| 3 property, stating that i is an imaginary number, which Our goal is to define the arithmetic rules for such num- does not belong to the real numbers, but its square can be bers, and find producti jik in particular, to make these rules replaced by zero. The arithmetic rules in this algebra are similar to those of higher order derivation. as follows. The addition or subtraction is Derivatives of the sum or difference of two functions are just the sum or difference of the derivatives, respec- ZZ12±=()xy11+ ii±+()xy22=±()xx12+±()yy12i. (1) tively, so the generalized dual number rules of addition The multiplication is and subtraction are equivalent to the rules of derivation: ZZ12=+()xy11ii()xy22+ N ()jj() 2 ()ff12± =±()ff12i j = ()ff12± (). (4) (2) ∑ =+xx12 ()yx12+ xy12ii+ yy12 j=0 =+xx12 ()yx12+ x112y i. Let us consider multiplication. The product of two gen- To establish the rule of division, both the numerator eralized dual numbers is and the denominator are multiplied with the conjugate of N N N N =×()j ()k = ()j ()k . the denominator to get rid of the imaginary part in the ()ff12()∑∑ff1 iij 2 k ∑∑ ff12iijk (5) j=0 k =0 j=0 k =0 denominator: Z xy+ i ()xy11+ ii()xy22− On the other hand, the dual number representation of 1 = 11= product f f is Z2 xy22+ i ()xy22+ ii()xy22− 1 2 2 (3) N ()n xx12+ ()y1xxx21− yy21ii− y2 x1 yx12− xy12 = =+ . ()ff12 = ()ff12 in . 2 22 2 i ∑ n=0 ()xy2 − 2 i x2 x2 Examining Eqs. (1), (2), and (3), we can realize that Substituting the formula of the nth derivative of a product, () n n the real part undergoes the same elementary operation n ()mn()−m , ()ff12 = ∑ ff12 (6) as the dual number, while the imaginary part reflects m=0 m the arithmetic rules of derivation for addition, multiplica- we get N n n tion and division. This means that considering function f ()mn()−m . ()ff12 = ∑∑ ff12 in (7) and its derivative f' as the real and imaginary parts of a dual n=0 m=0 m number ()ff=+f ′i , an arbitrary sequence of the four The derivatives can be computed with generalized dual elementary operations results in the function value in the number algebra if the two rules lead to identical results: real part and the derivative in the imaginary part.
Recommended publications
  • Hyper-Dual Numbers for Exact Second-Derivative Calculations
    49th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition AIAA 2011-886 4 - 7 January 2011, Orlando, Florida The Development of Hyper-Dual Numbers for Exact Second-Derivative Calculations Jeffrey A. Fike∗ andJuanJ.Alonso† Department of Aeronautics and Astronautics Stanford University, Stanford, CA 94305 The complex-step approximation for the calculation of derivative information has two significant advantages: the formulation does not suffer from subtractive cancellation er- rors and it can provide exact derivatives without the need to search for an optimal step size. However, when used for the calculation of second derivatives that may be required for approximation and optimization methods, these advantages vanish. In this work, we develop a novel calculation method that can be used to obtain first (gradient) and sec- ond (Hessian) derivatives and that retains all the advantages of the complex-step method (accuracy and step-size independence). In order to accomplish this task, a new number system which we have named hyper-dual numbers and the corresponding arithmetic have been developed. The properties of this number system are derived and explored, and the formulation for derivative calculations is presented. Hyper-dual number arithmetic can be applied to arbitrarily complex software and allows the derivative calculations to be free from both truncation and subtractive cancellation errors. A numerical implementation on an unstructured, parallel, unsteady Reynolds-Averaged Navier Stokes (URANS) solver,
    [Show full text]
  • A Nestable Vectorized Templated Dual Number Library for C++11
    cppduals: a nestable vectorized templated dual number library for C++11 Michael Tesch1 1 Department of Chemistry, Technische Universität München, 85747 Garching, Germany DOI: 10.21105/joss.01487 Software • Review Summary • Repository • Archive Mathematical algorithms in the field of optimization often require the simultaneous com- Submitted: 13 May 2019 putation of a function and its derivative. The derivative of many functions can be found Published: 05 November 2019 automatically, a process referred to as automatic differentiation. Dual numbers, close rela- License tives of the complex numbers, are of particular use in automatic differentiation. This library Authors of papers retain provides an extremely fast implementation of dual numbers for C++, duals::dual<>, which, copyright and release the work when replacing scalar types, can be used to automatically calculate a derivative. under a Creative Commons A real function’s value can be made to carry the derivative of the function with respect to Attribution 4.0 International License (CC-BY). a real argument by replacing the real argument with a dual number having a unit dual part. This property is recursive: replacing the real part of a dual number with more dual numbers results in the dual part’s dual part holding the function’s second derivative. The dual<> type in this library allows this nesting (although we note here that it may not be the fastest solution for calculating higher order derivatives.) There are a large number of automatic differentiation libraries and classes for C++: adolc (Walther, 2012), FAD (Aubert & Di Césaré, 2002), autodiff (Leal, 2019), ceres (Agarwal, Mierle, & others, n.d.), AuDi (Izzo, Biscani, Sánchez, Müller, & Heddes, 2019), to name a few, with another 30-some listed at (autodiff.org Bücker & Hovland, 2019).
    [Show full text]
  • Application of Dual Quaternions on Selected Problems
    APPLICATION OF DUAL QUATERNIONS ON SELECTED PROBLEMS Jitka Prošková Dissertation thesis Supervisor: Doc. RNDr. Miroslav Láviˇcka, Ph.D. Plze ˇn, 2017 APLIKACE DUÁLNÍCH KVATERNIONU˚ NA VYBRANÉ PROBLÉMY Jitka Prošková Dizertaˇcní práce Školitel: Doc. RNDr. Miroslav Láviˇcka, Ph.D. Plze ˇn, 2017 Acknowledgement I would like to thank all the people who have supported me during my studies. Especially many thanks belong to my family for their moral and material support and my advisor doc. RNDr. Miroslav Láviˇcka, Ph.D. for his guidance. I hereby declare that this Ph.D. thesis is completely my own work and that I used only the cited sources. Plzeˇn, July 20, 2017, ........................... i Annotation In recent years, the study of quaternions has become an active research area of applied geometry, mainly due to an elegant and efficient possibility to represent using them ro- tations in three dimensional space. Thanks to their distinguished properties, quaternions are often used in computer graphics, inverse kinematics robotics or physics. Furthermore, dual quaternions are ordered pairs of quaternions. They are especially suitable for de- scribing rigid transformations, i.e., compositions of rotations and translations. It means that this structure can be considered as a very efficient tool for solving mathematical problems originated for instance in kinematics, bioinformatics or geodesy, i.e., whenever the motion of a rigid body defined as a continuous set of displacements is investigated. The main goal of this thesis is to provide a theoretical analysis and practical applications of the dual quaternions on the selected problems originated in geometric modelling and other sciences or various branches of technical practise.
    [Show full text]
  • Vector Bundles and Brill–Noether Theory
    MSRI Series Volume 28, 1995 Vector Bundles and Brill{Noether Theory SHIGERU MUKAI Abstract. After a quick review of the Picard variety and Brill–Noether theory, we generalize them to holomorphic rank-two vector bundles of canonical determinant over a compact Riemann surface. We propose sev- eral problems of Brill–Noether type for such bundles and announce some of our results concerning the Brill–Noether loci and Fano threefolds. For example, the locus of rank-two bundles of canonical determinant with five linearly independent global sections on a non-tetragonal curve of genus 7 is a smooth Fano threefold of genus 7. As a natural generalization of line bundles, vector bundles have two important roles in algebraic geometry. One is the moduli space. The moduli of vector bun- dles gives connections among different types of varieties, and sometimes yields new varieties that are difficult to describe by other means. The other is the linear system. In the same way as the classical construction of a map to a pro- jective space, a vector bundle gives rise to a rational map to a Grassmannian if it is generically generated by its global sections. In this article, we shall de- scribe some results for which vector bundles play such roles. They are obtained from an attempt to generalize Brill–Noether theory of special divisors, reviewed in Section 2, to vector bundles. Our main subject is rank-two vector bundles with canonical determinant on a curve C with as many global sections as pos- sible: especially their moduli and the Grassmannian embeddings of C by them (Section 4).
    [Show full text]
  • Exploring Mathematical Objects from Custom-Tailored Mathematical Universes
    Exploring mathematical objects from custom-tailored mathematical universes Ingo Blechschmidt Abstract Toposes can be pictured as mathematical universes. Besides the standard topos, in which most of mathematics unfolds, there is a colorful host of alternate toposes in which mathematics plays out slightly differently. For instance, there are toposes in which the axiom of choice and the intermediate value theorem from undergraduate calculus fail. The purpose of this contribution is to give a glimpse of the toposophic landscape, presenting several specific toposes and exploring their peculiar properties, and to explicate how toposes provide distinct lenses through which the usual mathematical objects of the standard topos can be viewed. Key words: topos theory, realism debate, well-adapted language, constructive mathematics Toposes can be pictured as mathematical universes in which we can do mathematics. Most mathematicians spend all their professional life in just a single topos, the so-called standard topos. However, besides the standard topos, there is a colorful host of alternate toposes which are just as worthy of mathematical study and in which mathematics plays out slightly differently (Figure 1). For instance, there are toposes in which the axiom of choice and the intermediate value theorem from undergraduate calculus fail, toposes in which any function R ! R is continuous and toposes in which infinitesimal numbers exist. The purpose of this contribution is twofold. 1. We give a glimpse of the toposophic landscape, presenting several specific toposes and exploring their peculiar properties. 2. We explicate how toposes provide distinct lenses through which the usual mathe- matical objects of the standard topos can be viewed.
    [Show full text]
  • Math Book from Wikipedia
    Math book From Wikipedia PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Mon, 25 Jul 2011 10:39:12 UTC Contents Articles 0.999... 1 1 (number) 20 Portal:Mathematics 24 Signed zero 29 Integer 32 Real number 36 References Article Sources and Contributors 44 Image Sources, Licenses and Contributors 46 Article Licenses License 48 0.999... 1 0.999... In mathematics, the repeating decimal 0.999... (which may also be written as 0.9, , 0.(9), or as 0. followed by any number of 9s in the repeating decimal) denotes a real number that can be shown to be the number one. In other words, the symbols 0.999... and 1 represent the same number. Proofs of this equality have been formulated with varying degrees of mathematical rigour, taking into account preferred development of the real numbers, background assumptions, historical context, and target audience. That certain real numbers can be represented by more than one digit string is not limited to the decimal system. The same phenomenon occurs in all integer bases, and mathematicians have also quantified the ways of writing 1 in non-integer bases. Nor is this phenomenon unique to 1: every nonzero, terminating decimal has a twin with trailing 9s, such as 8.32 and 8.31999... The terminating decimal is simpler and is almost always the preferred representation, contributing to a misconception that it is the only representation. The non-terminating form is more convenient for understanding the decimal expansions of certain fractions and, in base three, for the structure of the ternary Cantor set, a simple fractal.
    [Show full text]
  • Generalisations of Singular Value Decomposition to Dual-Numbered
    Generalisations of Singular Value Decomposition to dual-numbered matrices Ran Gutin Department of Computer Science Imperial College London June 10, 2021 Abstract We present two generalisations of Singular Value Decomposition from real-numbered matrices to dual-numbered matrices. We prove that every dual-numbered matrix has both types of SVD. Both of our generalisations are motivated by applications, either to geometry or to mechanics. Keywords: singular value decomposition; linear algebra; dual num- bers 1 Introduction In this paper, we consider two possible generalisations of Singular Value Decom- position (T -SVD and -SVD) to matrices over the ring of dual numbers. We prove that both generalisations∗ always exist. Both types of SVD are motivated by applications. A dual number is a number of the form a + bǫ where a,b R and ǫ2 = 0. The dual numbers form a commutative, assocative and unital∈ algebra over the real numbers. Let D denote the dual numbers, and Mn(D) denote the ring of n n dual-numbered matrices. × arXiv:2007.09693v7 [math.RA] 9 Jun 2021 Dual numbers have applications in automatic differentiation [4], mechanics (via screw theory, see [1]), computer graphics (via the dual quaternion algebra [5]), and geometry [10]. Our paper is motivated by the applications in geometry described in Section 1.1 and in mechanics given in Section 1.2. Our main results are the existence of the T -SVD and the existence of the -SVD. T -SVD is a generalisation of Singular Value Decomposition that resembles∗ that over real numbers, while -SVD is a generalisation of SVD that resembles that over complex numbers.
    [Show full text]
  • Dual Quaternion Sample Reduction for SE(2) Estimation
    Dual Quaternion Sample Reduction for SE(2) Estimation Kailai Li, Florian Pfaff, and Uwe D. Hanebeck Intelligent Sensor-Actuator-Systems Laboratory (ISAS) Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology (KIT), Germany [email protected], fl[email protected], [email protected] Abstract—We present a novel sample reduction scheme for random variables belonging to the SE(2) group by means of Dirac mixture approximation. For this, dual quaternions are employed to represent uncertain planar transformations. The Cramer–von´ Mises distance is modified as a smooth metric to measure the statistical distance between Dirac mixtures on the manifold of planar dual quaternions. Samples of reduced size are then ob- tained by minimizing the probability divergence via Riemannian optimization while interpreting the correlation between rotation and translation. We further deploy the proposed scheme for non- parametric modeling of estimates for nonlinear SE(2) estimation. Simulations show superior tracking performance of the sample reduction-based filter compared with Monte Carlo-based as well as parametric model-based planar dual quaternion filters. Figure 1: Examples of the proposed sample reduction tech- I. INTRODUCTION nique for planar rigid motions. Here, 2000 random samples Estimation of planar motions is ubiquitous and play a (yellow) drawn from different underlying distributions are fundamental role in many application scenarios, such as optimally approximated by 20 samples (blue) on the manifold odometry and scene reconstruction, object tracking as well of planar dual quaternions. as remote sensing, etc [1]–[5]. Planar motions, incorporating both rotations and translations on a plane, are mathematically described by elements belonging to the two-dimensional spe- which can be easily violated under large noise levels (e.g., cial Euclidean group SE(2).
    [Show full text]
  • Design of a Customized Multi-Directional Layered Deposition System Based on Part Geometry
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by UT Digital Repository Design of a customized multi-directional layered deposition system based on part geometry. Prabhjot Singh, Yong-Mo Moon, Debasish Dutta, Sridhar Kota Abstract Multi-Direction Layered Deposition (MDLD) reduces the need for supports by depositing on a part along multiple directions. This requires the design of a new mechanism to re- orient the part, such that the deposition head can approach from different orientations. We present a customized compliant parallel kinematic machine design configured to deposit a set of part geometries. Relationships between the process planning for the MDLD of a part geometry and considerations in the design of the customized machine mechanism are illustrated. MDLD process planning is based on progressive part decomposition and kinematic machine design uses dual number algebra and screw theory. 1. Introduction The past decade has seen the development of numerous Layered Manufacturing (LM) techniques. The main advantage afforded by LM is its ability to produce geometrically complex parts without specialized tooling in a relatively short period of time. LM processes are characterized by the need for sacrificial structures to support overhanging regions of the part. This necessitates time consuming post- processing and degrades part quality. Multi-Direction Layered Deposition (MDLD) Systems such as [3][4][5][7] build parts without supports by depositing a part along multiple directions. Depending on the process, either the deposition nozzle or the base table has multi-axis kinematics (refer Figure 1). (b) (a) Figure 1: (a) Deposition Table with rotational and translational actuation (b) Deposition nozzle mounted on a multi-axis robotic arm.
    [Show full text]
  • Dual Quaternions in Spatial Kinematics in an Algebraic Sense
    Turk J Math 32 (2008) , 373 – 391. c TUB¨ ITAK˙ Dual Quaternions in Spatial Kinematics in an Algebraic Sense Bedia Akyar Abstract This paper presents the finite spatial displacements and spatial screw motions by using dual quaternions and Hamilton operators. The representations are considered as 4 × 4 matrices and the relative motion for three dual spheres is considered in terms of Hamilton operators for a dual quaternion. The relation between Hamilton operators and the transformation matrix has been given in a different way. By considering operations on screw motions, representation of spatial displacements is also given. Key Words: Dual quaternions, Hamilton operators, Lie algebras. 1. Introduction The matrix representation of spatial displacements of rigid bodies has an important role in kinematics and the mathematical description of displacements. Veldkamp and Yang-Freudenstein investigated the use of dual numbers, dual numbers matrix and dual quaternions in instantaneous spatial kinematics in [9] and [10], respectively. In [10], an application of dual quaternion algebra to the analysis of spatial mechanisms was given. A comparison of representations of general spatial motion was given by Rooney in [8]. Hiller- Woernle worked on a unified representation of spatial displacements. In their paper [7], the representation is based on the screw displacement pair, i.e., the dual number extension of the rotational displacement pair, and consists of the dual unit vector of the screw axis AMS Mathematics Subject Classification: 53A17, 53A25, 70B10. 373 AKYAR and the associate dual angle of the amplitude. Chevallier gave a unified algebraic approach to mathematical methods in kinematics in [4]. This approach requires screw theory, dual numbers and Lie groups.
    [Show full text]
  • LIE ALGEBRAS ARE INFINITESIMAL GROUPS Throughout K Is a Field
    LIE ALGEBRAS ARE INFINITESIMAL GROUPS RUDOLF TANGE Throughout k is a field and δ is a “dual number” satisfying δ2 = 0. Furthermore g is a Lie algebra over k which is the Lie algebra of a group G. We denote the identity of G by I. Whenever X ∈ g = TI (G), the tangent space of G at I, we consider the corresponding group element I + δX of which you should think as “I with an infinitesimal bit added in the direction X”. You should think of δ as being a very small number of k = R and calculate upto the first order. 2 More formally, δ is the image of δ0 in R = R(δ) := k[δ0]/(δ0), the quotient 2 of the polynomial ring k[δ0] by the ideal (δ0), and I + δX is an element of G(R), the points of G over R.1 The commutator Now we embed G into some GLn. Our computations below will take place in Matn(R). The inverse of I + δX is I − δX: (I + δX)(I − δX) = I − δX + δX − δ2X2 = I. Note that the elements I + δX form a commutative subgroup of G(R). To capture something of the noncommutativity we need to look at second order terms. To do this we add another dual number η to our ring R, that 2 2 is, we work with the ring R = R(δ, η) := k[δ0, η0]/(δ0, η0), where we denote the images of δ0 and η0 by δ and η. Let X, Y ∈ g. We compute the group commutator of I + δX and I + δY : (I + δX)(I + ηY )(I − δX)(I − ηY ) = (I + ηY + δX + δηXY )(I − ηY − δX + δηXY ) = I − ηY − δX + δηXY + ηY − δηY X + δX − δηXY + δηXY = I + δη(XY − YX) = I + δη[X, Y ].
    [Show full text]
  • Symmetry, Geometry, and Quantization with Hypercomplex
    SYMMETRY, GEOMETRY, AND QUANTIZATION WITH HYPERCOMPLEX NUMBERS VLADIMIR V. KISIL Abstract. These notes describe some links between the group SL2(R), the Heisenberg group and hypercomplex numbers – complex, dual and double num- bers. Relations between quantum and classical mechanics are clarified in this framework. In particular, classical mechanics can be obtained as a theory with noncommutative observables and a non-zero Planck constant if we replace complex numbers in quantum mechanics by dual numbers. Our considera- tion is based on induced representations which are build from complex-/dual- /double-valued characters. Dynamic equations, rules of additions of proba- bilities, ladder operators and uncertainty relations are discussed. Finally, we prove a Calder´on–Vaillancourt-type norm estimation for relative convolutions. Contents Introduction 2 1. Preview: Quantum and Classical Mechanics 3 1.1. Axioms of Mechanics 3 1.2. “Algebra” of Observables 3 1.3. Non-Essential Noncommutativity 4 1.4. Quantum Mechanics from the Heisenberg Group 5 1.5. Classical Noncommutativity 6 1.6. Summary 8 2. Groups, Homogeneous Spaces and Hypercomplex Numbers 9 2.1. The Group SL2(R) and Its Subgroups 9 2.2. Action of SL2(R) as a Source of Hypercomplex Numbers 9 2.3. Orbits of the Subgroup Actions 11 2.4. The Heisenberg Group and Symplectomorphisms 13 3. Linear Representations and Hypercomplex Numbers 15 3.1. Hypercomplex Characters 15 3.2. Induced Representations 16 3.3. Similarity and Correspondence: Ladder Operators 18 3.4. Ladder Operators 19 arXiv:1611.05650v1 [math-ph] 17 Nov 2016 3.5. Induced Representations of the Heisenberg Group 21 4.
    [Show full text]