Arxiv:1812.04590V2 [Cs.SC] 8 Sep 2019

Computing Nearby Non-trivial Smith Forms Mark Giesbrecht, Joseph Haraldson, George Labahn Cheriton School of Computer Science, University of Waterloo, Waterloo, Canada Abstract We consider the problem of computing the nearest matrix polynomial with a non-trivial Smith Normal Form. We show that computing the Smith form of a matrix polynomial is amenable to numeric computation as an optimization problem. Furthermore, we describe an effective optimization technique to find a nearby matrix polynomial with a non-trivial Smith form. The results are then generalized to include the computation of a matrix polynomial having a maximum specified number of ones in the Smith Form (i.e., with a maximum specified McCoy rank). We discuss the geometry and existence of solutions and how our results can be used for an error analysis. We develop an optimization-based approach and demonstrate an iterative numerical method for computing a nearby matrix polynomial with the desired spectral properties. We also describe an implementation of our algorithms and demonstrate the robustness with examples in Maple. 1. Introduction × For a given square matrix polynomial A 2 R[t]n n, one can find unimodular matrices × U; V 2 R[t]n n such that UAV is a diagonal matrix S. Unimodular means that there is a polynomial inverse matrix, or equivalently, that the determinant is a nonzero constant from R. The unimodular matrices U; V encapsulate the polynomial row and column operations, respectively, needed for such a diagonalization. The best known diagonalization is the Smith Normal Form (SNF, or simply Smith form) of a matrix polynomial. Here 0 1 s1 B C B s2 C B C n×n S = B : C 2 R[t] ; B :: C @B AC sn arXiv:1812.04590v2 [cs.SC] 8 Sep 2019 where s1;:::; sn 2 R[t] are monic and si j si+1 for 1 ≤ i < n. The Smith form always exists and is unique though the associated unimodular matrices U, V are not unique (see, e.g., (Kailath, 1980; Gohberg et al., 2009)). The diagonal entries s1;:::; sn are referred to as the invariant factors of A. Email addresses: [email protected] (Mark Giesbrecht), [email protected] (Mark Giesbrecht), [email protected] (Joseph Haraldson), [email protected] (George Labahn) URL: https://cs.uwaterloo.ca/~mwg (Mark Giesbrecht), https://cs.uwaterloo.ca/~jharalds (Joseph Haraldson), https://cs.uwaterloo.ca/~glabahn (George Labahn) Matrix polynomials and their Smith forms are used in many areas of computational algebra, control systems theory, differential equations and mechanics. The Smith form is important as it effectively reveals the structure of the polynomial lattice of rows and columns, as well as the effects of localizing at individual eigenvalues. That is, it characterizes how the rank decreases as the variable t is set to different values (especially eigenvalues, where the rank drops). The Smith form is closely related to the more general Smith-McMillan form for matrices of rational functions, a form that reveals the structure of the eigenvalue at infinity, or the infinite spectral structure. The eigenvalue at infinity is non-trivial if the leading coefficient matrix is rank deficient or equivalently, the determinant does not achieve the generic degree. The algebra of matrix polynomials is typically described assuming that the coefficients are fixed and come from an exact arithmetic domain, usually the field of real or complex numbers. In this exact setting, computing the Smith form has been well studied, and very efficient procedures are available (see (Kaltofen and Storjohann, 2015) and the references therein). However, in some applications, particularly control theory and mechanics, the coefficients can come from measured data or contain some amount of uncertainty. Compounding this, for efficiency reasons such computations are usually performed using floating point to approximate computations in R, introducing roundoff error. As such, algorithms must accommodate numerical inaccuracies and are prone to numerical instability. Numerical methods to compute the Smith form of a matrix polynomial typically rely on linearization and orthogonal transformations (Van Dooren and Dewilde, 1983; Beelen and Van Dooren, 1988; Demmel and Kågstrom¨ , 1993a,b; Demmel and Edelman, 1995) to infer the Smith form of a nearby matrix polynomial via the Jordan blocks in the Kronecker canonical form (see (Kailath, 1980)). These linearization techniques are numerically backwards stable, and for many problems this is sufficient to ensure that the computed solutions are computationally useful when a problem is continuous. Unfortunately, the eigenvalues of a matrix polynomial are not necessarily continuous functions of the coefficients of the matrix polynomial, and backwards stability is not always sufficient to ensure computed solutions are useful in the presence of discontinuities. Previous methods are also unstructured in the sense that the computed non-trivial Smith form may not be the Smith form of a matrix polynomial with a prescribed coefficient structure, for example, maintaining the degree of entries or not introducing additional non-zero entries or coefficients. In extreme instances, the unstructured backwards error can be arbitrarily small, while the structured distance to an interesting Smith form is relatively large. Finally, existing numerical methods can also fail to compute meaningful results on some problems due to uncertainties. Examples of such problems include nearly rank deficient matrix polynomials, repeated eigenvalues or eigenvalues that are close together and other ill-posed instances. In this paper we consider the problem of computing a nearby matrix polynomial with a prescribed spectral structure, broadly speaking, the degrees and multiplicities of the invariant factors in the Smith form, or equivalently the structure and multiplicity of the eigenvalues of the matrix polynomial. The results presented in this paper extend those in the conference paper (Gies- brecht, Haraldson, and Labahn, 2018). This work is not so much about computing the Smith normal form of a matrix polynomial using floating point arithmetic, but rather our focus is on the computation of a nearby matrix polynomial with “an interesting” or non-generic Smith normal form. The emphasis in this work is on the finite spectral structure of a matrix polynomial, since the techniques described are easily generalized to handle the instance of the infinite spectral structure as a special case. The Smith form of a matrix polynomial is not continuous with respect to the usual topology 2 of the coefficients and the resulting limitations of backward stability is not the only issue that needs to be addressed when finding nearest objects in an approximate arithmetic environment. A second issue can be illustrated by recalling the well-known representation of the invariant n×n factors s1;:::; sn of a matrix A 2 R[t] as ratios si = δi/δi−1 of the determinantal divisors δ0; δ1; : : : ; δn 2 R[t], where δ0 = 1; δi = GCD all i × i minors of A 2 R[t]: In the case of 2 × 2 matrix polynomials, computing the nearest non-trivial Smith form is thus equivalent to finding the nearest matrix polynomial whose polynomial entries have a non-trivial GCD. Recall that approximate GCD problems can have infima that are unattainable. That is, there are co-prime polynomials with nearby polynomials with a non-trivial GCD at distances arbitrarily approaching an infimum, while at the infimum itself the GCD is trivial (see, e.g., (Giesbrecht, Haraldson, and Kaltofen, 2019a)). The issue of unattainable infima extends to the Smith normal form. As an example, consider ! t2 − 2t + 1 A = = diag( f; g) 2 [t]2×2: t2 + 2t + 2 R If we look for nearby polynomials ef ;eg 2 R[t] of degree at most 2 such that gcd( ef ;eg) = γt + 1 2 2 (i.e. a nontrivial gcd) at minimal distance k f − ef k2 + kg − egk2 for some γ 2 R, then it is shown in (Haraldson, 2015, Example 3.3.6) that this distance is (5γ4 − 4γ3 + 14γ2 + 2)=(γ4 + γ2 + 1). This distance has an infimum of 2 at γ = 0. However at γ = 0 we have gcd( ef ;eg) = 1 even though deg(gcd( ef ;eg)) > 0 for all γ , 0. For A to have a non-trivial Smith form we must perturb f; g such that they have a non-trivial GCD, and thus any such perturbation must be at a distance of at least 2. However, the perturbation of distance precisely 2 has a trivial Smith form. There is no merit to perturbing the off-diagonal entries of A. Our work indirectly involves measuring the sensitivity to the eigenvalues of A and the determinant of A. Thus we differ from most sensitivity and perturbation analysis (e.g., (Stewart, 1994; Ahmad and Alam, 2009)), since we also study how perturbations affect the invariant factors, in- stead of the roots of the determinant. Additionally our theory is able to support the instance of A being rank deficient and having degree exceeding one. One may also approach the problem geometrically in the context of manifolds (Edelman et al., 1997, 1999). We do not consider the manifold approach directly since it does not yield numerical algorithms. Determining what it means for a matrix polynomial to have a non-trivial Smith form numerically and finding the distance from one matrix polynomial to another matrix polynomial having an interesting or non-trivial Smith form are formulated as finding solutions to continuous optimization problems. The main contributions of this paper are deciding when A has an interesting Smith form, providing bounds on a “radius of triviality” around A and a structured stability analysis on iterative methods to compute a structured matrix polynomial with desired spectral properties. The remainder of the paper is organized as follows. In Section2 we give the notation and terminology along with some needed background used in our work.

Load more