Arxiv:1805.04488V5 [Math.NA]
Total Page:16
File Type:pdf, Size:1020Kb
GENERALIZED STANDARD TRIPLES FOR ALGEBRAIC LINEARIZATIONS OF MATRIX POLYNOMIALS∗ EUNICE Y. S. CHAN†, ROBERT M. CORLESS‡, AND LEILI RAFIEE SEVYERI§ Abstract. We define generalized standard triples X, Y , and L(z) = zC1 − C0, where L(z) is a linearization of a regular n×n −1 −1 matrix polynomial P (z) ∈ C [z], in order to use the representation X(zC1 − C0) Y = P (z) which holds except when z is an eigenvalue of P . This representation can be used in constructing so-called algebraic linearizations for matrix polynomials of the form H(z) = zA(z)B(z)+ C ∈ Cn×n[z] from generalized standard triples of A(z) and B(z). This can be done even if A(z) and B(z) are expressed in differing polynomial bases. Our main theorem is that X can be expressed using ℓ the coefficients of the expression 1 = Pk=0 ekφk(z) in terms of the relevant polynomial basis. For convenience, we tabulate generalized standard triples for orthogonal polynomial bases, the monomial basis, and Newton interpolational bases; for the Bernstein basis; for Lagrange interpolational bases; and for Hermite interpolational bases. We account for the possibility of common similarity transformations. Key words. Standard triple, regular matrix polynomial, polynomial bases, companion matrix, colleague matrix, comrade matrix, algebraic linearization, linearization of matrix polynomials. AMS subject classifications. 65F15, 15A22, 65D05 1. Introduction. A matrix polynomial P (z) ∈ Fm×n[z] is a polynomial in the variable z with coef- ficients that are m by n matrices with entries from the field F. We will use F = C, the field of complex k numbers, in this paper. Typically an expression in the monomial basis φk(z) = z is given for P (z), and often only regular matrix polynomials are considered, that is, with m = n (we will use n for the dimension) and where det P (z) is not identically zero. Matrix polynomials have many applications and their study is of both classic and ongoing interest. See the classic work [20] and the surveys [21] and [29] for theory and applications. In this paper, as is done in [2], we consider the case when any polynomial basis φk(z) is used. We require that the set {φk(z)} for 0 ≤ k ≤ ℓ forms a basis for polynomials of grade ℓ. The word “grade” is short for “degree at most.” Thus, we write our regular matrix polynomial as ℓ (1.1) P (z)= Pkφk(z) , k X=0 n×n 1 where the matrices Pk ∈ F are square, and the degree of P (z) is at most ℓ (P (z) has grade ℓ). arXiv:1805.04488v5 [math.NA] 25 Mar 2021 ∗Received by the editors on Month/Day/Year. Accepted for publication on Month/Day/Year. Handling Editor: Name of Handling Editor. Corresponding Author: Name of Corresponding Author †Centre for Medical Evidence, Decision Integrity and Clinical Impact (MEDICI Centre), Department of Anesthesia and Perioperative Medicine, Schulich School of Medicine and Dentistry, Western University ([email protected]) ‡Ontario Research Centre for Computer Algebra, School of Mathematical and Statistical Sciences, Western University ([email protected]) §Ontario Research Centre for Computer Algebra, School of Mathematical and Statistical Sciences, Department of Applied Mathematics, Western University (lrafi[email protected]) 1The degree of a matrix polynomial P (z) is defined as follows. If P (z) is identically zero, the degree is −∞. Otherwise, if P ℓ A k A P (z)= Pk=0 kz expressed in the monomial basis and ℓ is not the zero matrix, then the degree of (z) is ℓ. 1 The notion of “grade” is useful even for the monomial basis, but it is especially useful if the basis is n n k n−k an interpolational basis or the Bernstein basis φk(z) = Bk (z) = k z (1 − z) , when the degree of the polynomial may not be clear from the data. For more information about matrix polynomials, consult [27]. See also [29], [21], and consult the seminal book [20]. Linearizations using different polynomial bases were first systematically studied in [2]; for a more up-to-date treatment see [5]. Some other recent papers of interest include [4], [28], [12], [17], and [32]; this is a very active area. See also [1]. In that paper, standard triples for structured matrices are studied. In [9] several proofs are given of strict equivalence of various linearizations to the standard second linearization for the monomial basis, which we will use without much further comment in this paper. 1.1. Organization of the Paper. In Section 1.2, we establish notation, give the definitions of algebraic linearization and of generalized standard triples. We define this last in Definition 1.1 with reference to the representation in Equation (1.7). In Section 1.3, we show how to use a generalized standard triple in the construction of algebraic linearizations. We also give a proof, by construction of the necessary equivalence matrices E(z) and F (z), that algebraic linearizations are local linearizations (a very useful notion from [13]) in the same sense that its components are. In particular, if the components A and B are linearizations in the sets ΣA and ΣB respectively, then the “algebraic linearization” is a linearization in the intersection ΣA ∩ ΣB. In Section 2, we prove our main result, giving a universal expression for generalized standard triples for the linearizations using the polynomial bases that appear in Sections 3.1, 3.2, and 3.3. In Section 3, we introduce all the polynomial bases by scalar examples. We sum up in the final section. 1.2. Notation and Definitions. Two matrix polynomials P1(z) and P2(z) are called unimodularly equivalent if there exist unimodular matrix polynomials (that is, matrix polynomials with constant nonzero determinant) E(z) and F (z) with P1(z) = E(z)P2(z)F (z). A matrix pencil L(z) := zC1 − C0 is called a linearization of the matrix polynomial P (z) if both C1 and C0 are of dimension N ≥ n and L(z) is unimodularly equivalent to the block diagonal matrix diag(P (z), IN−n). Two linearizations Lm(z) and Lφ(z) are called strictly equivalent if the corresponding matrices are equivalent in the following stronger sense: C1,m = EC1,φF and C0,m = E(z)C0,φF (z), with the same invertible (constant) matrices E and F . In the paper [13] we find the powerful notion of a local linearization on a subset Σ ⊂ F. In the notation of their Definition 2.1, two rational matrices G1(z) and G2(z) are said to be equivalent in a nonempty set Σ if there exist matrices R1(z) and R2(z) each nonsingular in Σ with R1(z)G1(z)R2(z) = G2(z). This notion, intended for use with matrix rational functions, is also useful for matrix polynomials, as we will see. There is a related idea to linearization, that of “companion pencil” (A, B), where the only requirement is that det P (z) = K det (zB − A) for some nonzero constant K. A companion pencil, therefore, has the same eigenvalues as the matrix polynomial. Companion pencils that are not linearizations do not necessarily preserve eigenvectors or elementary divisors, and are less useful than linearizations. The usual reversal2 of a matrix polynomial of grade ℓ is the polynomial rev P (z) = zℓP (z−1). A 2This definition, which is standard, is particularly appropriate for the monomial basis. The coefficients of the reversed matrix polynomial in the monomial basis are simply the same matrices in reverse order. The notion of a reversal, however, is independent of the basis used, and indeed reversals can be done differently. In [10] for instance we find a slightly different definition of reversal, appropriate for computation in a Lagrange or Hermite interpolational basis, which maps an arbitrary finite point to infinity; this difference allows for greater numerical stability. 2 linearization L(z) = zC1 − C0 of P is called a strong linearization if rev L(z) = C1 − zC0 is also a linearization of rev P (z). N×N If L(z) = zC1 − C0 ∈ C [z] is a linearization of P (z) in z ∈ Σ, then as a necessary consequence det(P (z)) = q(z)det(L(z)) = det(zC1 − C0) for some rational q(z) whose zeros and poles lie outside Σ. The eigenvalues of P in Σ are thus computable from the generalized eigenvalues of L. For this linearized problem several standard methods are available. A standard pair (X, T ) for a regular matrix polynomial P (z) expressed in the monomial basis with coefficients Pk is defined in [20] or in [27] as having the following properties: X has dimension n × nℓ, T has dimension nℓ × nℓ, ℓ k (1.2) PkXT =0 , k X=0 and that the nℓ by nℓ matrix X XT (1.3) Q = . . XT ℓ−1 is nonsingular. We can then define a third matrix 0n . (1.4) Y = Q−1 . 0 n I n and say that the triple (X, T , Y ) is a standard triple for a monic P (z). It is pointed out in [27] that monicity of P (z) is not required for many of the formulæ to do with standard pairs (but is required for some). Theorem 12.1.4 of [19] states that if there are matrices X, T , and Y of dimension n × nℓ, nℓ × nℓ, and nℓ × n for which −1 −1 (1.5) P (z)= X(zIn − T ) Y then (X, T , Y ) is a standard triple for P (z). This is also reported in Lemma 2 in [1]. There are two other representations of a matrix polynomial given a standard triple: the right canonical form, and the left canonical form.