Spec. Matrices 2022; 10:1–8 Research Article Open Access Christian Grussler and Anders Rantzer* On the similarity to nonnegative and Metzler Hessenberg forms https://doi.org/10.1515/spma-2020-0140 Received March 24, 2021; accepted May 24, 2021 Abstract: We address the issue of establishing standard forms for nonnegative and Metzler matrices by consid- ering their similarity to nonnegative and Metzler Hessenberg matrices. It is shown that for dimensions n ≥ 3, there always exists a subset of nonnegative matrices that are not similar to a nonnegative Hessenberg form, which in case of n = 3 also provides a complete characterization of all such matrices. For Metzler matrices, we further establish that they are similar to Metzler Hessenberg matrices if n ≤ 4. In particular, this provides the rst standard form for controllable third order continuous-time positive systems via a positive controller- Hessenberg form. Finally, we present an example which illustrates why this result is not easily transferred to discrete-time positive systems. While many of our supplementary results are proven in general, it remains an open question if Metzler matrices of dimensions n ≥ 5 remain similar to Metzler Hessenberg matrices. Keywords: Nonnegative matrices, Hessenberg form, controller-Hessenberg form, positive systems, nonnega- tive matrix factorization MSC 2020: 15A21, 15B48, 93C05, 93C28 1 Introduction Similarity transformation of matrices into structured standard forms is among the most fundamental tools in linear algebra. One of the most common real-valued forms is the upper Hessenberg form, i.e., matrices which are zero except for possible non-zero elements above and on the rst lower diagonal [4]. Non-uniqueness of this form, however, raises the question whether it is possible to incorporate and preserve additional proper- ties. In this work, we consider the following problem: When is a nonnegative matrix similar to a nonnegative Hessenberg matrix? We also ask this question when nonnegativity is replaced by Metzler, i.e., matrices that are nonnegative apart R5×5 from their main diagonals. Besides a necessary condition for trace-zero nonnegative matrices in ≥0 [10], little appears to be known about this problem. Rn×n Similar to standard numerical approaches that transform a nonnegative matrix A 2 ≥0 into upper n−1×n−1 Hessenberg form [4], in this work, we seek a nonnegative similarity transformation T2 2 R such that ! ≥0 1 0 the block-diagonal matrix T = yields a nonnegative/Metzler 0 T2 ! ! −1 −1 a11 c2 a11 c2T2 T AT = T T = −1 −1 b2 A2 T2 b T2 A2T2 Christian Grussler: University of California Berkeley, Department of Electrical Engineering and Computer Sciences, Berkeley, CA, E-mail: [email protected] *Corresponding Author: Anders Rantzer: Lund University, Department of Automatic Control, Lund, Sweden, E-mail: [email protected] Open Access. © 2021 Christian Grussler and Anders Rantzer, published by De Gruyter. This work is licensed under the Creative Commons Attribution alone 4.0 License. 2 Ë Christian Grussler and Anders Rantzer −1 Rn−1 −1 with T2 b2 = e1 being the rst canonical unit vector in and T2 A2T2 an upper Hessenberg matrix. As T2 can be found recursively by replacing A with A2, our investigations will start with the case of n = 2 to characterize the case n = 3, which then leads to n = 4. Our main complete characterizations are: R3×3 I. A 2 ≥0 is similar to a nonnegative Hessenberg matrix if and only if A does not have a negative eigen- value of geometric multiplicity 2. II. Every Metzler A 2 Rn×n with n ≤ 4 is similar to a Metzler Hessenberg matrix. Rn×n Further, we note that all A 2 ≥0 with a negative eigenvalue of geometric multiplicity n −1 cannot be similar to a nonnegative Hessenberg matrix. In the language of system and control theory, our approach is equivalent to nding a state-space trans- n−1×n−1 formation T2 2 R to a so-called discrete-time (DT) positive system [6] (A2, b2, c2): a state-space system x(t + 1) = A2x(t) + b2u(t), y(t) = c2x(t), −1 −1 with nonnegative system matrices A2, b2, c2; such that (T2 A2T2, T2 b2, c2T2) is a positive system in −1 −1 controller-Hessenberg form [4], i.e., T2 A2T2 is a nonnegative upper Hessenberg matrix, T2 b2 = e1 and c2T2 are nonnegative vectors. In other words, our characterization also attempts to answer the question when a positive systems can be realized in positive controller standard form. So far, this question has only been im- plicitly considered for minimal second-order systems [6]. For continuous-time (CT) positive systems, which are dened by replacing A2 with a Metzler matrix and x(t + 1) with x˙(t), our results rely on a new characteri- zation of all CT positive systems with state-space dimension 3 that can be transformed into a positive system with controller-Hessenberg form. As this also remains true for DT positive systems, where A2 is suciently diagonally dominant, we can further conclude that Item II also applies for suciently diagonally dominant A 2 R4×4. Unfortunately, as demonstrated by an illustrative example, a general characterization in DT is less trivial. While many of our techniques and tools can be extend to structured cases of larger dimensions, it remains an open question whether a Metzler Hessenberg form also exists for n ≥ 5. The main reason for this analysis limitation is our usage of the fact that polyhedral cones, which are embedded in a 2-dimensional subspace, can be generated by at most two extreme rays. As this is not necessarily true for cones that are embedded in a higher dimensional subspace (see nonnegative rank [2, 7]), our analysis does not simply extend. Finally, note that our results can also be seen towards standard forms for the nonnegative/Metzler inverse eigenvalue problem (see, e.g., [3, 5, 9]), a problem which shares the issue that complete solutions for n > 4 are missing. 2 Preliminaries n×m n For real valued matrices X = (xij) 2 R , including vectors x = (xi) 2 R , we say that X is nonnega- Rn×m R tive, X ≥ 0 or X 2 ≥0 , if all elements xij 2 ≥0; corresponding notation are used for positive matri- ces. Further, X 2 Rn×n is called Metzler, if all its o-diagonal entries are nonnegative. If X 2 Rn×n, then σ(X) = fλ1(X), ... , λn(X)g denotes its spectrum, where the eigenvalues are ordered by descending absolute value, i.e., λ1(X) is the eigenvalue with the largest magnitude, counting multiplicity. In case that the magni- tude of two eigenvalues coincides, we sub-sort them by decreasing real part. A matrix X is called reducible, if there exists a permutation matrix π 2 Rn×n such that ! X * πT Xπ = 1 , 0 X2 where X1 and X2 are square matrices. Otherwise, X is irreducible. A spectral characterization of nonnegative matrices is given by the celebrated Perron-Frobenius Theorem [1]. Nonnegative and Metzler Hessenberg forms Ë 3 Rn×n Proposition 1 (Perron-Frobenius). Let A 2 ≥0 . Then, i. λ1(A) ≥ 0. ii. If λ1(A) has algebraic multiplicity m0, then A has m0 linearly independent nonnegative eigenvectors related to λ1(A). iii. If A is irreducible, then m0 = 1, λ1(A) > 0 has a positive eigenvectors which is the only nonnegative eigen- vector to A. k iv. A is primitive, i.e., there exists a k 2 N such that A > 0, if and only if A is irreducible with λ1(A) > jλ2(A)j. n×n X 2 R is an upper Hessenberg matrix if xij = 0 for all i, j with i > j + 1. The identity matrix of any applicable n size will be denoted by I and its columns, the canonical basic unit vectors in R , are written as e1, ... , en. n×m Using the abbreviation (k1 : k2) = fk1, ... , k2g ⊂ N, we write XI,J for sub-matrices of X 2 R , where I ⊂ (1 : n) and J ⊂ (1 : j). The element-wise absolute value is written as jXj = (jxijj) and we dene ! X1 0 blkdiag(X1, X2) := 0 X2 n1×n1 n2×n2 n for X1 2 R and X2 2 R . For a subset S ⊂ R , we write int(S), ∂(S), conv(S) and cone(S) for its interior, boundary, convex hull and convex conic hull. We also use cone(X), X 2 Rn×m, to denote the convex polyhedral cone that is generated by the columns of X. The set of complex numbers with positive real part is denoted by C>0. 3 Nonnegative and Metzler Hessenberg forms In this section, we present our main results addressing the questions that were raised in the introduction. Rn×n R Trivially, all A 2 ≥0 with σ(A) ⊂ ≥0 are similar to a nonnegative Hessenberg matrix, simply by computing its Jordan normal form. Therefore, our investigations will start with σ(A) ⊄ R≥0. We begin by noticing that not every nonnegative matrix is similar to a nonnegative Hessenberg matrix. Rn×n Proposition 2. For A 2 ≥0 , the following are equivalent: i. λ2(A) < 0 has geometric multiplicity n − 1. T Rn ii. A = c(uv − sI), where u, v 2 >0, 0 ≤ s ≤ minifui vig and c > 0. Rn×n Proof. i. ) ii.: Let A 2 ≥0 have λ2(A) < 0 with geometric multiplicity n − 1. Then, A − λ2(A)I = T (λ1(A) − λ2(A))uv , where v and u are left- and right-eigenvectors of A. Since λ1(A) > 0 is the only nonnega- Rn tive eigenvalue of A, A cannot be reducible, which by Proposition 1 implies that u, v 2 >0.
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages8 Page
-
File Size-