SIAM J. MATRIX ANAL. APPL. c 2016 Society for Industrial and Applied Mathematics Vol. 37, No. 1, pp. 195–214
STABILITY ANALYSIS OF THE TWO-LEVEL ORTHOGONAL ARNOLDI PROCEDURE∗
DING LU†, YANGFENG SU†, AND ZHAOJUN BAI‡
Abstract. The second-order Arnoldi (SOAR) procedure is an algorithm for computing an orthonormal basis of the second-order Krylov subspace. It has found applications in solving quadratic eigenvalue problems and model order reduction of second-order dynamical systems among others. Unfortunately, the SOAR procedure can be numerically unstable. The two-level orthogonal Arnoldi (TOAR) procedure has been proposed as an alternative to SOAR to cure the numerical instability. In this paper, we provide a rigorous stability analysis of the TOAR procedure. We prove that under mild assumptions, the TOAR procedure is backward stable in computing an orthonormal basis of the associated linear Krylov subspace. The benefit of the backward stability of TOAR is demonstrated by its high accuracy in structure-preserving model order reduction of second-order dynamical systems.
Key words. second-order Krylov subspace, second-order Arnoldi procedure, backward stability, model order reduction, dynamical systems
AMS subject classifications. 65F15, 65F30, 65P99
DOI. 10.1137/151005142
1. Introduction. The second-order Krylov subspace induced by a pair of ma- trices and one or two vectors is a generalization of the well-known (linear) Krylov subspace based on a matrix and a vector. An orthonormal basis matrix Qk of the second-order Krylov subspace can be generated by a second-order Arnoldi (SOAR) procedure [3]. The SOAR procedure has found applications in solving quadratic eigenvalue problems [33, 38] and model order reduction of second-order dynamical systems [2, 6] and structural acoustic analysis [25, 26]. It is implemented in Omega3P for electromagnetic modeling of particle accelerators [18, 19], and in MOR4ANSYS for the model order reduction of ANSYS engineering models [28]. It has been known that SOAR is prone to numerical instability due to the fact that it involves solving potentially ill-conditioned triangular linear systems and im- plicitly generates a nonorthonormal basis matrix Vk of an associated (linear) Krylov subspace (see examples in section 5). The instability issue has drawn the attention of researchers since the SOAR procedure was proposed. For quadratic eigenvalue problems, Zhu [40] exploited the relations between the second-order Krylov subspace and the associated linear Krylov subspace described in [3] and proposed to represent the basis matrix Vk of the linear Krylov subspace by the product of two orthonormal matrices Qk and Uk,whereQk is an orthonormal basis of the second-order Krylov subspace and Uk is computed to maintain the orthonormality of Vk.Thesameidea
∗Received by the editors January 23, 2015; accepted for publication (in revised form) by T. Stykel December 11, 2015; published electronically February 16, 2016. http://www.siam.org/journals/simax/37-1/100514.html †School of Mathematical Sciences, Fudan University, Shanghai 200433, China ([email protected], [email protected]). Part of this work was done while the first author was visiting the University of California, Davis, supported by China Scholarship Council. The research of the second author was supported in part by the Innovation Program of Shanghai Mu- nicipal Education Commission 13zz007, E-Institutes of Shanghai Municipal Education Commission N.E303004, and NSFC key project 91330201. ‡Department of Computer Science and Department of Mathematics, University of California, Davis, CA 95616, USA ([email protected]). The research of the this author was supported in part by the NSF grants DMS-1522697 and CCF-1527091. 195 196 D. LU, Y. SU, AND Z. BAI
was also independently studied about the same time and presented in [34] for the purposes of curing the numerical instability and finding a memory-efficient represen- tation of the basis matrix Vk in the context of using ARPACK [20] to solve high-order polynomial eigenvalue problems. The term “two-level orthogonal Arnoldi procedure”, TOAR in short, was coined in [34]. Recently, the notion of memory-efficient represen- tations of the orthonormal basis matrices of the second-order and higher-order Krylov subspaces was generalized to the model order reduction of time-delay systems [39], solving the linearized eigenvalue problem of matrix polynomials in the Chebyshev ba- sis [17] and implementing a rational Krylov method for solving nonlinear eigenvalue problems [35]. Unlike SOAR, TOAR maintains an orthonormal basis matrix Vk of the associated linear Krylov subspace. Therefore, it is generally believed to be numerically stable. This belief has been adopted in the literature [39, 17, 35]. The motivation of this paper is to provide a rigorous stability analysis of TOAR. We prove that under mild assumptions, TOAR with partial reorthogonalization is backward stable in computing the basis matrix Vk of the associated linear Krylov subspace. In addition, in this paper, we provide a different derivation of the TOAR procedure comparing to the ones in [40, 34] and remove unnecessary normalization steps. A comprehensive TOAR procedure, including the partial reorthogonalization and the treatments of deflation and breakdown, is presented. The advantages of the TOAR procedure are illustrated by an application to the structure-preserving model order reduction of second-order dynamical systems. The rest of this paper is organized as follows. In section 2, we review the definition and essential properties of the second-order Krylov subspace. In section 3, we derive the TOAR procedure. The backward stability of the TOAR procedure is proven in section 4. Numerical examples for the application of the TOAR procedure in the model order reduction of second-order dynamical systems are presented in section 5. Concluding remarks are in section 6. Notations. Throughout the paper, we use the upper case letter for matrices, lower case letter for vectors, particularly, I for the identity matrix with ei being the ith column, and the dimensions of these matrices and vectors conform with the dimensions used in the context. Following the convention of matrix analysis, we use ·T for transpose, and ·† for the pseudo-inverse, |·| for elementwise absolute value, · 2 and · F for 2-norm and Frobenius norm, respectively, span{U, v} for the subspace spanned by the columns of matrices U and vector v.WealsouseMATLAB conventions v(i : j)fortheith to jth entries of vector v,andA(i : j, k : )forthe submatrix of matrix A by the intersection of row i to j and column k to .Other notations will be explained as used.
2. Second-order Krylov subspace. Let A and B be n×n matrices and r−1 and r0 be length-n vectors such that [r−1,r0] = 0. Then the sequence r−1,r0,r1,r2,... with
(2.1) rj = Arj−1 + Brj−2 for j ≥ 1
is called a second-order Krylov sequence based on A, B, r−1,andr0. The subspace
(2.2) Gk(A, B; r−1,r0) ≡ span{r−1,r0,r1,...,rk−1}
is called a kth second-order Krylov subspace. If the vector rj lies in the subspace spanned by the vectors r−1,...rj−1, i.e., Gj+1(A, B; r−1,r0)=Gj (A, B; r−1,r0), then STABILITY ANALYSIS OF TOAR 197 a deflation occurs. We should stress that the deflation does not necessarily imply that rj+1 and all the following vectors still lie in Gj (A, B; r−1,r0). The latter case is referred to as a breakdown of the second-order Krylov sequence. The second-order Krylov subspace Gk(A, B; r−1,r0) can be embedded in the linear Krylov subspace 2 k−1 Kk(L, v0) ≡ span{v0,Lv0,L v0,...,L v0} r0 r1 r2 rk−1 (2.3) =span , , ,..., , r−1 r0 r1 rk−2 where AB 2n×2n r0 2n L = ∈ R and v0 = ∈ R , I 0 r−1 and I is an identity matrix of size n. Specifically, let Qk and Vk be basis matrices of the subspaces Gk(A, B; r−1,r0)andKk(L, v0), respectively. Then by (2.3), we know that
(2.4) span{Vk(1 : n, :)} =span{r0,r1,...,rk−1}, (2.5) span{Vk(n+1 : 2n, :)} =span{r−1,r0,...,rk−2}, and
(2.6) span{Qk} =span{Vk(1 : n, :),Vk(n+1 : 2n, :)}.
By (2.4)–(2.6), the basis matrix Vk of the linear Krylov subspace Kk(L, v0)can be written in terms of the basis matrix Qk of the second-order Krylov subspace Gk(A, B; r−1,r0): Vk(1 : n, :) QkUk,1 Qk Uk,1 (2.7) Vk = = = ≡ Q[k]Uk. Vk(n+1 : 2n, :) QkUk,2 Qk Uk,2
Without loss of generality, we assume the linear Krylov subspace Kk(L, v0)isnot reduced, i.e., Vk is of dimension 2n × k.Otherwise,Kk(L, v0)=Kj (L, v0)forsome j
3. TOAR Procedure. To cure the potential instability of the SOAR procedure, in this section we study a procedure that computes an orthonormal basis matrix Qk of the second-order Krylov subspace Gk(A, B; r−1,r0), and meanwhile, maintains the basis matrix Vk of the associated linear Krylov subspace Kk(L, v0) as orthonormal. The procedure is called as TOAR. Recall that the Arnoldi procedure for computing an orthonormal basis of the kth Kryov subspace Kk(L, v0) is governed by the following Arnoldi decomposition of order k − 1:
(3.1) LVk−1 = VkHk,
where Vk−1 consists of the first k − 1 columns of Vk,andHk is a k × (k − 1) unre- duced upper Hessenberg matrix; see, for example, [31, Chap. 5]. By the compact representation (2.7), we have the compact Arnoldi decomposition of order k − 1, AB Qk−1Uk−1,1 QkUk,1 (3.2) = Hk. I 0 Qk−1Uk−1,2 QkUk,2
We now show how to compute Qk, Uk,1,andUk,2 without explicitly generating T T T Vk. We note that the orthonormality of Qk and Vk implies that Uk =[Uk,1,Uk,2] is also orthonormal. Let us begin with the following lemma describing the relations between Qk−1 and Qk, Uk−1,i and Uk,i for i =1, 2. Lemma 3.1. For the compact Arnoldi decomposition (3.2),
(3.3) span{Qk} =span{Qk−1,r},
where r = AQk−1Uk−1,1(:,k − 1) + BQk−1Uk−1,2(:,k − 1).Furthermore,(a) if r ∈ span{Qk−1}, then there exist vectors xk and yk such that (3.4) Qk = Qk−1,Uk,1 = Uk−1,1 xk , and Uk,2 = Uk−1,2 yk ;
(b) otherwise, there exist vectors xk and yk andascalarβk =0 such that (3.5) Uk−1,1 xk Uk−1,2 yk Qk = Qk−1 qk ,Uk,1 = , and Uk,2 = , 0 βk 00
T where qk =(I − Qk−1Qk−1)r/α and α is a normalization factor such that qk 2 =1. Proof. Note that the (k − 1)st column of (3.1) is
Lvk−1 = Vk−1Hk(1 : k − 1,k− 1) + Hk(k, k − 1) · vk,
where vk−1 and vk are the last columns of Vk−1 and Vk, respectively. Therefore, given Vk−1 we have
span{Vk−1,Lvk−1} =span{Vk−1,vk} =span{Vk}≡Kk(L, v0).
Since Qk−1Uk−1,1 r Vk−1 Lvk−1 = , Qk−1Uk−1,2 Qk−1Uk−1,1(:,k− 1) by applying the subspace relation (2.6), we have
span{Qk} =span{Qk−1Uk−1,1,r,Qk−1Uk−1,2} =span{Qk−1,r}. STABILITY ANALYSIS OF TOAR 199
Hence the relation (3.3) is proven. To prove (a) and (b), we first note that by (2.4), the top n elements of the kth column vk of Vk satisfy
(3.6a) vk(1 : n) ∈ span{r0,r1,...,rk−1}⊂Gk(A, B; r−1,r0)=span{Qk},
and by (2.5), the bottom n elements of vk satisfy
(3.6b) vk(n+1 : 2n) ∈ span{r−1,r0,...,rk−2}⊂Gk−1(A, B; r−1,r0)=span{Qk−1}.
If r ∈ span{Qk−1}, then (3.3) implies span{Qk} =span{Qk−1}.Furthermore,by (3.6), the vector vk(1 : n) Qk−1xk vk = = vk(n +1:2n) Qk−1yk
for some vectors xk and yk. Therefore, Qk−1Uk−1,1 Qk−1xk Qk−1[Uk−1,1 xk] Vk = Vk−1 vk = = . Qk−1Uk−1,2 Qk−1yk Qk−1[Uk−1,2 yk]
Thus the result (a) is proven. If r ∈ span{Qk−1}, then by orthogonalizing r against Qk−1,wehave
r = Qk−1s + αqk, − T where qk =(I Qk−1Qk−1)r/α is a unitary vector orthogonal to the columns of Qk−1.Consequently,Qk = Qk−1 qk . Furthermore, according to (3.6), the Arnoldi vector vk(1 : n) Qk−1xk + βkqk vk = = vk(n +1:2n) Qk−1yk
for some vectors xk, yk, and scalar βk. Therefore, the result (b) is immediately proven by the following equations: Qk−1Uk−1,1 Qk−1xk + βkqk Vk = Vk−1 vk = . Qk−1Uk−1,2 Qk−1yk
Now we derive the TOAR procedure to compute the compact Arnoldi decompo- r0 sition (3.2). With the initial vectors r−1 and r0 such that v0 = = 0, we apply r−1 the rank revealing QR decomposition of the n×2matrix r−1 r0 = Q1X,
where Q1 is an n×η1 orthonormal matrix, and X is an η1×2 matrix. Note that η1 =2 if the starting vectors r−1 and r0 are linearly independent, otherwise η1 =1.Thenit follows 1 r0 1 Q1X(:, 2) Q1 U1,1 V1 = = = ≡ Q[1]U1, γ r−1 γ Q1X(:, 1) Q1 U1,2
where γ = v0 2, U1,1 = X(:, 2)/γ and U1,2 = X(:, 1)/γ. After the initialization, we have Q1, U1, and an empty 1 × 0 Hessenberg matrix H1 =[]. 200 D. LU, Y. SU, AND Z. BAI
Let us assume that the compact Arnoldi decomposition (3.2) has been computed for k = j, j ≥ 1. Next, the TOAR procedure consists of two steps to compute the decomposition (3.2) for k = j +1,namely, AB Qj Uj,1 Qj+1Uj+1,1 (3.7) = Hj+1, I 0 Qj Uj,2 Qj+1Uj+1,2 where Qj, Uj,1,andUj,2 and the first j − 1 columns of Hj+1 in (3.7) have been computed. At the first step, we compute the orthonormal matrix Qj+1. According to Lemma 3.1, we have span{Qj+1} =span{Qj,r},wherer = AQj Uj,1(:,j) + BQj Uj,2(:,j). By orthogonalizing r against Qj , it yields T (3.8) qj+1 =(r − Qj s)/α with s = Qj r, α = r − Qj s 2, where it is assumed that α = 0. Subsequently, Qj+1 = Qj qj+1 and ηj+1 = ηj +1. If α =0,thenr ∈ span{Qj} and Qj+1 = Qj and ηj+1 = ηj . A deflation occurs and Gj+1(A, B; r−1,r0)=Gj (A, B; r−1,r0). Uj+1,1 At the second step, we compute Uj+1,1 and Uj+1,2 such that Uj+1 = Uj+1,2 T is orthonormal. By left multiplying Q[j+1] of (3.7) and taking the jth column, we have T Qj+1r Uj+1,1 (3.9) T = Hj+1(:,j). Qj+1QjUj,1(:,j) Uj+1,2
When there is no deflation in the first step, by Lemma 3.1, Uj+1 is of the form ⎡ ⎤ Uj,1 xj+1 ⎢ ⎥ Uj+1,1 ⎢ 0 βj+1 ⎥ Uj+1 = = ⎣ ⎦ Uj+1,2 Uj,2 yj+1 00 for some vectors xj+1 and yj+1, and scalar βj+1 = 0. Denote (see (3.8))
⎡ 1 ⎤ ηj s ⎢ ⎥ T 1 ⎢ α ⎥ ηj+1 Qj+1r ⎣ ⎦ ≡ T , ηj u ηj+1 Qj+1QjUj,1(:,j) 1 0 where u ≡ Uj,1(:,j). The equation (3.9) can be written as ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ s Uj,1 xj+1 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ α ⎥ ⎢ 0 ⎥ ⎢ βj+1 ⎥ ⎣ ⎦ = ⎣ ⎦ hj + hj+1,j ⎣ ⎦ , u Uj,2 yj+1 0 0 0 where hj = Hj+1(1 : j, j)andhj+1,j = Hj+1(j +1,j). Since Uj+1 is also imposed to be orthonormal, we have ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ T Uj,1 s s Uj,1 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ 0 ⎥ ⎢ α ⎥ ⎢ α ⎥ ⎢ 0 ⎥ (3.10) hj = ⎣ ⎦ ⎣ ⎦ and hj+1,j = ⎣ ⎦ − ⎣ ⎦ hj . Uj,2 u u Uj,2 0 0 0 0 2 STABILITY ANALYSIS OF TOAR 201
Assume hj+1,j = 0, the vectors xj+1 and yj+1 and scalar βj+1,namely,the(j +1)st column of Uj+1,arethengivenby ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ xj+1 s s s Uj,1 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ βj+1 ⎥ 1 ⎢ α ⎥ ⎢ α ⎥ ⎢ α ⎥ ⎢ 0 ⎥ (3.11) ⎣ ⎦ = ⎣ ⎦ with ⎣ ⎦ := ⎣ ⎦ − ⎣ ⎦ hj . yj+1 hj+1,j u u u Uj,2 0 0 0 0 0
If there is a deflation in the first step, then Qj+1 = Qj. By Lemma 3.1, Uj+1 is of the form Uj,1 xj+1 Uj+1 = Uj,2 yj+1 for some vectors xj+1 and yj+1.Denote