Bounds on the Singular Values of Matrices with Displacement Structure and in Doing So Justify Many of the Low Rank Techniques

SIAM REVIEW \bigcirc c 2019 Society for Industrial and Applied Mathematics Vol. 61, No. 2, pp. 319–344 Bounds on the Singular Values of Matrices \ast with Displacement Structure Bernhard Beckermanny Alex Townsendz Abstract. Matrices with displacement structure, such as Pick, Vandermonde, and Hankel matrices, appear in a diverse range of applications. In this paper, we use an extremal problem in- volving rational functions to derive explicit bounds on the singular values of such matrices. For example, we show that the kth singular value of a real n \times n positive definite Hankel - k= log n matrix, Hn, is bounded by C\rho kHnk2 with explicitly given constants C > 0 and \rho > 1, where kHnk2 is the spectral norm. This means that a real n \times n positive definite Hankel matrix can be approximated, up to an accuracy of \epsilon kHnk2 with 0 < \epsilon < 1, by a rank O(log n log(1/\epsilon )) matrix. Analogous results are obtained for Pick, Cauchy, real Vandermonde, L\"owner,and certain Krylov matrices. Key words. singular values, displacement structure, Zolotarev, rational AMS subject classiﬁcations. 15A18, 26C15 DOI. 10.1137/19M1244433 1. Introduction. Matrices with rapidly decaying singular values frequently appear in computational mathematics. Such matrices are numerically of low rank, and this is exploited in applications such as particle simulations [33], model reduction [2], boundary element methods [35], and matrix completion [20]. However, it can be theoretically challenging to fully explain why low rank techniques are so effective in practice. In this paper, we derive explicit bounds on the singular values of matrices with displacement structure and in doing so justify many of the low rank techniques that are being employed on such matrices. m\times n m\times m n\times n Let X 2 C with m \geq n, A 2 C , and B 2 C . We say that X has an (A; B)-displacement rank of \nu if X satisfies the Sylvester matrix equation given by (1.1) AX - XB = MN \ast for some matrices M 2 Cm\times \nu and N 2 Cn\times \nu . Matrices with displacement structure include Toeplitz (\nu = 2), Hankel (\nu = 2), Cauchy (\nu = 1), Krylov (\nu = 1), and Van- dermonde (\nu = 1) matrices, as well as Pick (\nu = 2), Sylvester (\nu = 2), and L\"owner (\nu = 2) matrices. Fast algorithms for computing matrix-vector products and for \ast Published electronically May 8, 2019. This paper originally appeared in SIAM Journal on Matrix Analysis and Applications, Volume 38, Number 4, 2017, pages 1227{1248, under the title \On the Singular Values of Matrices with Displacement Structure." http://www.siam.org/journals/sirev/61-2/M124443.html Funding: The work of the first author was supported in part by the Labex CEMPI (ANR-11- LABX-0007-01). The work of the second author was supported by National Science Foundation grant 1818757. y Downloaded 07/04/19 to 91.110.173.243. Redistribution subject SIAM license or copyright; see http://www.siam.org/journals/ojsa.php Laboratoire Paul Painlev\'eUMR 8524 CNRS, Equipe ANO-EDP, UFR Math\'ematiques,UST Lille, F-59655 Villeneuve d'Ascq CEDEX, France ([email protected]). zDepartment of Mathematics, Cornell University, Ithaca, NY 14853 ([email protected]). 319 Copyright © by SIAM. Unauthorized reproduction of this article is prohibited. 320 BERNHARD BECKERMANN AND ALEX TOWNSEND Table 1.1 Summary of the bounds proved on the singular values of matrices with displacement structure. For the singular value bounds to be valid for Cm;n and Ln, mild \separation conditions" must hold (see section 4). The numbers \rhoj and Cj for j = 1;:::; 6 are given explicitly in their corresponding sections. Matrix class Notation Singular value bound Ref. - k Pick Pn \sigma 1+2k(Pn) \leq C 1\rho1 kPnk2 sec. 4.1 - k Cauchy Cm;n \sigma1+ k(Cm;n) \leq C2\rho kCm;nk2 sec. 4.2 2 - k L\"owner Ln \sigma1+2 k(Ln) \leq C 3\rho3 kLnk2 sec. 4.3 Krylov, Herm. arg. K \sigma (K ) \leq C \rho - k= log nkK k sec. 5.1 m;n 1+2k m;n 4 4 m;n 2 - k= log n real Vandermonde Vm;n \sigma1+2 k(Vm;n) \leq C5\rho5 kVm;nk2 sec. 5.1 - k= log n pos. semidef. Hankel Hn \sigma1+2 k(Hn) \leq C6\rho 6 kHnk2 sec. 5.2 solving systems of linear equations can be derived for many of these matrices by exploiting (1.1) [36, 42]. In this paper, we use the displacement structure to derive explicit bounds on the singular values of matrices that satisfy (1.1) by using an extremal problem for rational functions from complex approximation theory. In particular, we prove that the following inequality holds (see Theorem 2.1): (1.2) \sigmaj +\nu k(X) \leq Zk(E; F )\sigmaj (X); 1 \leq j + \nu k \leq n; where \sigma1 (X); : : : ; \sigman (X) denote the singular values of X and Zk(E; F ) is the Zolotarev number for complex sets E and F that depend on A and B (see (1.4)). Researchers have previously exploited the connection between the Sylvester matrix equation and Zolotarev numbers for selecting algorithmic parameters in the alternating direction implicit (ADI) method [8, 15, 38], and others have demonstrated that the singular values of matrices satisfying certain Sylvester matrix equations have rapidly decaying singular values [2, 4, 48]. Here, we derive explicit bounds on all the singular values of structured matrices. Table 1.1 summarizes our main singular value bounds. Not every matrix with displacement structure is numerically of low rank. For example, the identity matrix is a full rank Toeplitz matrix and the exchange matrix1 is a full rank Hankel matrix. Moreover, we show in Example 5.1 that the inequality in (1.2) is trivial for circulant as well as Toeplitz matrices. The properties of A and B in (1.1) are crucial. If A and B are normal matrices, then one expects X to be numerically of low rank only if the eigenvalues of A and B are well separated (see Theorem 2.1). If A and B are both not normal, then descriptive bounds on the numerical rank of X are more subtle (see Corollary 2.2, [4], and [48, Lemma 1]). By the Eckart{Young theorem [30, Theorem 2.4.8], singular values measure the distance in the spectral norm from X to the set of matrices of a given rank, i.e., \sigma (X) = min \bigl\{k X - Y k : Y 2 m\times n; rank( Y ) = j - 1\bigr\} : j 2 C For an 0 < \epsilon < 1, we say that the \epsilon -rank of a matrix X is k if k is the smallest integer such that \sigmak +1(X) \leq \epsilon kXk2. That is, (1.3) rank\epsilon (X) = min fk : \sigmak +1(X) \leq \epsilon kXk2g : k\geq 0 Downloaded 07/04/19 to 91.110.173.243. Redistribution subject SIAM license or copyright; see http://www.siam.org/journals/ojsa.php 1The n\times n exchange matrix X is obtained by reversing the order of the rows of the n\times n identity matrix, i.e., Xn - j+1;j = 1 for 1 \leq j \leq n. Copyright © by SIAM. Unauthorized reproduction of this article is prohibited. SINGULAR VALUES OF MATRICES WITH DISPLACEMENT 321 Table 1.2 Summary of the upper bounds proved on the \epsilon -rank of matrices with displacement structure. For the bounds above to be valid for Cm;n and Ln, mild \separation conditions" must hold (see section 4). The number \gamma is the absolute value of the cross-ratio of a, b, c, and d; see (3.7). The first three rows show an \epsilon -rank of at most O(log \gamma log(1/\epsilon )), and the last three rows show an \epsilon -rank of at most O(log n log(1/\epsilon )). Matrix class Notation Upper bound on rank\epsilon (X) Ref. 2 Pick Pn 2dlog(4b=a) log(4 /\epsilon )/\pi e sec. 4.1 2 Cauchy Cm;n dlog(16\gamma ) log(4/\epsilon )/\pi e sec. 4.2 2 L\"owner Ln 2dlog(16\gamma ) log(4 /\epsilon) /\pi e sec. 4.3 2 Krylov, Herm. arg. Km;n 2d4 log(8bn=2c/\pi ) log(4 /\epsilon )/\pi e + 2 sec. 5.1 real Vandermonde V 2d4 log(8bn=2c/\pi ) log(4/\epsilon )/\pi 2e + 2 sec. 5.1 m;n 2 pos. semidef. Hankel Hn 2d2 log(8bn=2c/\pi ) log(16 /\epsilon )/\pi e + 2 sec. 5.2 Thus, we may approximate X to a precision of \epsilon kXk2 by a rank k = rank\epsilon (X) matrix. An immediate consequence of explicit bounds on the singular values of certain matrices is a bound on the \epsilon -rank. Table 1.2 summarizes our main upper bounds on the \epsilon -rank of matrices with displacement structure. The form of the inequalities in (1.2) also allows one to use Zolotarev numbers to bound the \epsilon -rank of matrices when measured in the Frobenius norm [51, Lemma 5.1], which is a key observation to extending the bounds in this paper to tensors [51]. Zolotarev numbers have already proved useful for deriving tight bounds on the condition number of matrices with displacement structure [5, 6], where the condition number of a rectangular m \times n matrix X is given by \kappa2 (X) = \sigma 1(X)/\sigmamin( m;n)(X). For example, the first author proved that a real n \times n positive definite Hankel matrix, H , with n \geq 3, is exponentially ill-conditioned [6].

Bounds on the Singular Values of Matrices with Displacement Structure and in Doing So Justify Many of the Low Rank Techniques

Invertibility Condition of the Fisher Information Matrix of a VARMAX Process and the Tensor Sylvester Matrix

Beam Modes of Lasers with Misaligned Complex Optical Elements

Generalized Sylvester Theorems for Periodic Applications in Matrix Optics

A Note on the Inversion of Sylvester Matrices in Control Systems

(Integer Or Rational Number) Arithmetic. -Computing T

Evaluation of Spectrum of 2-Periodic Tridiagonal-Sylvester Matrix

Uniqueness of Solution of a Generalized ⋆-Sylvester Matrix

Matrix Algebraic Properties of the Fisher Information Matrix of Stationary Processes

Polynomial Bezoutian Matrix with Respect to a General Basisୋ Z.H

On the Classic Solution of Fuzzy Linear Matrix Equations

On the Sylvester-Like Matrix Equation $ AX+ F (X) B= C$

The Sylvester and Bézout Resultant Matrices for Blind Image Deconvolution