Etna the Block Hessenberg Process for Matrix

Electronic Transactions on Numerical Analysis. Volume 46, pp. 460–473, 2017. ETNA Kent State University and c Copyright 2017, Kent State University. Johann Radon Institute (RICAM) ISSN 1068–9613. THE BLOCK HESSENBERG PROCESS FOR MATRIX EQUATIONS∗ M. ADDAMy, M. HEYOUNIy, AND H. SADOKz Abstract. In the present paper, we first introduce a block variant of the Hessenberg process and discuss its properties. Then, we show how to apply the block Hessenberg process in order to solve linear systems with multiple right-hand sides. More precisely, we define the block CMRH method for solving linear systems that share the same coefficient matrix. We also show how to apply this process for solving discrete Sylvester matrix equations. Finally, numerical comparisons are provided in order to compare the proposed new algorithms with other existing methods. Key words. Block Krylov subspace methods, Hessenberg process, Arnoldi process, CMRH, GMRES, low-rank matrix equations. AMS subject classifications. 65F10, 65F30 1. Introduction. In this work, we are first interested in solving s systems of linear equations with the same coefficient matrix and different right-hand sides of the form (1.1) A x(i) = y(i); 1 ≤ i ≤ s; where A is a large and sparse n × n real matrix, y(i) is a real column vector of length n, and s n. Such linear systems arise in numerous applications in computational science and engineering such as wave propagation phenomena, quantum chromodynamics, and dynamics of structures [5, 9, 36, 39]. When n is small, it is well known that the solution of (1.1) can be computed by a direct method such as LU or Cholesky factorization. Note that the factorization needs to be carried out only once and the resulting upper and lower triangular systems are solved at low cost. Let Y = [y(1); : : : ; y(s)] 2 Rn×s, X = [x(1); : : : ; x(s)] 2 Rn×s, and assume that all s vectors y(i) are available simultaneously. Then the above systems can be written as (1.2) AX = Y: In the last two decades, block Krylov subspace methods for block linear systems of the form (1.2) have been developed. These iterative methods are suitable when n is large and when the matrix A is not explicitly available. For symmetric and positive definite matrices A, O’Leary presented in [31] a block conjugate gradient (BCG) method. Other variants of the BCG algorithm and generalizations to nonsymmetric matrices were presented in [30, 31]. Generalizations of classical and robust Krylov methods for solving a linear system such as GMRES, QMR, and BiCG-Stab to the block case are respectively considered in [28, 38, 40, 16], [19], and [18]. Parallel implementations of block Krylov solvers are discussed in [3, 6, 7, 10] and in the references therein. We also consider, in this paper, the solution of the low-rank Sylvester matrix equation (1.3) AXB − X = CF T ; where X 2 Rn×p, A 2 Rn×n, B 2 Rp×p, C 2 Rn×r, and F 2 Rp×r with r minfn; pg. ∗Received December 29, 2015. Accepted September 2, 2017. Published online on November 20, 2017. Recommended by L. Reichel. yENSA d’Al-Hoceima, Université Mohammed Premier, Oujda, Maroc ({[email protected], [email protected]}). zL.M.P.A, Université du Littoral, 50 rue F. Buisson BP699, F-62228 Calais Cedex, France ({[email protected]}). 460 ETNA Kent State University and Johann Radon Institute (RICAM) THE BLOCK HESSENBERG PROCESS FOR MATRIX EQUATIONS 461 Matrix Sylvester equations have numerous applications in filtering and image restora- tion [8]. They are also encountered in control and communication theory, model reduction problems, feedback stabilization, and pole-placement problems [4, 13, 12]. In order to ensure the existence of a unique solution, we assume that the matrices A and B of every Sylvester matrix equation satisfy µi(A)µj(B) 6= 1 for all i = 1; : : : ; n; j = 1; : : : ; s, where µk(Z) is the kth eigenvalue of the matrix Z. 2. The block Hessenberg process. If computations are carried out in exact arithmetic, then similar to the classical Hessenberg process with pivoting strategy [41], the block Hessen- n×ms berg process generates a lower trapezoidal basis Vm = [V1;:::;Vm] 2 R of the block Krylov subspace, m−1 n×s X i s×s n×s Km(A; R) = fX 2 C jX = A R Ωi;Ωi 2 R for i = 0; : : : ; m − 1g ⊂ C ; i=0 where R is a given n × s column block vector. The first block vector V1 is obtained by performing an LU decomposition with partial pivoting of the given block vector R. This means if P1 R = L1 Γ is the PLU decomposition of n×n n×s R, where P1 2 R is a permutation matrix, L1 2 R is a unit lower trapezoidal matrix, and Γ 2 Rs×s is an upper triangular matrix, then T −1 V1 = P1 L1 = R Γ : We note that Γ and V1 can be computed using the lu Matlab function [V1; Γ] = lu(R). Let ij (j = 1; : : : ; s) be the index of the row of V1 corresponding to the j-th row of T n L1, and let ei = [0;:::; 0; 1; 0 :::; 0] be the i-th vector of the canonical basis of R . Then T we define p1 = (i1; : : : ; is) and the n × s matrix Ee1 = [ei1 ; : : : ; eis ] , which correspond to the s first columns of P1. The vector p1 can be obtained using the max Matlab function [∼; p1] = max(V1): Now, suppose that block vectors V1;:::;Vk have been computed and the permutation (k) vectors p2; : : : ; pk updated. Then we can generate the block vectors Uk+1 via i (0) (i) X Uk+1 = AVk; and Uk+1 = AVk − Vj Hj;k; for i = 1; : : : ; k; j=1 s×s where the square matrices Hj;k 2 R , j = 1; : : : ; k, are such that (i) (2.1) Uk+1 ? Ee1;:::; Eei; for i = 1; : : : ; k: Thanks to the previous orthogonality condition, we have −1 (j) Hj;k = (Vj(pj; :)) Uk+1(pj; :); for j = 1; : : : ; k: (k) (k) Again, letting Pk+1 Uk+1 = Lk+1 Hk+1;k be the PLU decomposition of Uk+1, we obtain T (k) −1 Vk+1 = Pk+1 Lk+1 = Uk+1 Hk+1;k; and using the lu Matlab function, the (k + 1)-st block vector Vk+1 and the upper square triangular matrix Hk+1;k are given by (k) [Vk+1;Hk+1;k] = lu(Uk+1): ETNA Kent State University and Johann Radon Institute (RICAM) 462 M. ADDAM, M. HEYOUNI, AND H. SADOK We end the derivation of the block Hessenberg process by letting ij be the index row of Vk+1 which corresponds to the j-th row of Lk+1 (j = ks + 1;:::; (k + 1)s), pk+1 = (iks+1 : : : ; i(k+1)s), and define Eek+1 = [eiks+1 ; : : : ; ei(k+1)s ]. We also observe that the max Matlab function allows us to update pk+1 by [∼; pk+1] = max(Vk+1): Finally, a complete statement of the resulting block Hessenberg algorithm reads as follows. ALGORITHM 1: The block Hessenberg algorithm (with partial pivoting) • Inputs: A an n × n matrix, R an n × s matrix and m an integer. • Step 0. [V1; Γ] = lu(R); [∼; p1] = max(V1); • Step 1. For k = 1; : : : ; m (0) Uk+1 = AVk; for j = 1; : : : ; k −1 (j−1) Hj;k = (Vpj (pj; :)) Uk+1 (pj; :); (j) (j−1) Uk+1 = Uk+1 − Vj Hj;k; end(for) (k) [Vk+1;Hk+1;k] = lu(Uk+1); [∼; pk+1] = max(Vk+1); end(For). In exact arithmetic and after m steps, the above block Hessenberg procedure leads to the following relation, for k = 1; : : : ; m, 2 3 H1;1 H1;2 :::H1;k 6 H2;1 H2;2 :::H2;k 7 6 7 6 . 7 A [V1;:::;Vk] = [V1;:::;Vk;Vk+1] 6 0s H3;2 ::: . 7 : 6 7 | {z } | {z } 6 . .. .. 7 =Vk =Vk+1=[Vk;Vk+1] 4 . 5 0s ::: 0s Hk+1;k For k = m, the above relation can be rewritten as (2.2) A Vm = Vm+1 He m T (2.3) = Vm Hm + Vm+1 Hm+1;m Em; where He m; Hm are respectively the (m + 1)s × ms and ms × ms block upper Hessen- berg matrices whose non-zero block entries are the Hj;k generated by Algorithm1 and T Em = [0s;:::; 0s;Is] is the ms × s rectangular matrix whose m-th block element is Is, the identity matrix of size s. n×ms Letting Pm = (Ee1;:::; Eem) 2 R , which is a permutation matrix, and using (2:1) we also have T ms×ms (2.4) Pm Vm = Lm; ( with Lm 2 R ); L −1 T ms×n where Lm is a unit lower triangular matrix. Now introducing Vm := Lm Pm 2 R , L L we see that Vm is a left inverse of Vm since, according to (2.4), we have VmVm = Ims. L L Pre-multiplying (2.2) and (2.3), respectively, by Vm+1 and Vm, we get L L (2.5) Vm+1 A Vm = He m and Vm A Vm = Hm: ETNA Kent State University and Johann Radon Institute (RICAM) THE BLOCK HESSENBERG PROCESS FOR MATRIX EQUATIONS 463 3. The block Hessenberg and CMRH methods. To solve a single linear system, the author in [34] proposed the CMRH method. The CMRH method can be interpreted as a GMRES-like method but based on the Hessenberg reduction process with pivoting strategy instead of the Arnoldi process [34, 35, 21, 33, 32].

Etna the Block Hessenberg Process for Matrix

QR Decomposition: History and Its Applications

(Hessenberg) Eigenvalue-Eigenmatrix Relations∗

Computing the Jordan Structure of an Eigenvalue∗

11.5 Reduction of a General Matrix to Hessenberg Form

Generalized Hessenberg Matricesୋ Miroslav Fiedler a ,∗,Zdenekˇ Vavrínˇ B Aacademy of Sciences of the Czech Republic, Institute of Computer Science, Pod Vodáren

Matrix Theory

Section 6.3: the QR Algorithm

Numerical Linear Algebra

Lecture Notes on Numerical Methods for Eigenvalues: Part III

NEUTRALLY STABLE FIXED POINTS of the QR ALGORITHM 1. Background QR Iteration Is the Standard Method for Computing the Eigenvalue

Parallel Reduction from Block Hessenberg to Hessenberg Using MPI

The Eigenvalue Problem: the Hessenberg and Real Schur Forms