gamm header will be provided by the publisher

Nonlinear Eigenvalue Problems: A Challenge for Modern Eigenvalue Methods

Volker Mehrmann∗1 and Heinrich Voss∗∗2 1 Institut fur¨ Mathematik, MA 4-5, Technische Universitat¨ Berlin, D-10623 Berlin, Fed. Rep. Germany. Supported by Deutsche Forschungsgemeinschaft through the DFG Research Center MATHEON ‘Mathematics for key technologies’ in Berlin 2 Arbeitsbereich Mathematik, Technische Universitat¨ Hamburg-Harburg, D-21071 Hamburg, Fed. Rep. Germany

Key words polynomial, projection method, Krylov-subspace method, Arnoldi method, rational-Krylov method, linearization, structure preservation. MSC (2000) 65F15, 15A18, 35P30 We discuss the state of the art in numerical solution methods for large scale polynomial or rational eigenvalue problems. We present the currently available solution methods such as the Jacobi-Davidson, Arnoldi or the rational Krylov method and analyze their properties. We briefly introduce a new linearization technique and demonstrate how it can be used to improve structure preservation and with this the accuracy and efficiency of linearization based methods. We present several recent applications where structured and unstructured nonlinear eigenvalue problems arise and some numerical results.

Copyright line will be provided by the publisher

1 Introduction We discuss numerical methods for the solution of large scale nonlinear eigenvalue problems

F(λ)x = F(λ; M0, . . . , Mk, p)x = 0, (1) where for K = C or K = R

F : D → Km,n. is a family of matrices depending on a variable λ ∈ D, where D ⊂ K is an open set. As in the linear case, λ ∈ D is called an eigenvalue of problem (1) if equation (1) has a nontrivial solution x 6= 0. Then x is called an eigenvector corresponding to λ. m,n The F typically depends on some coefficient matrices M0, . . . , Mk ∈ K and often also on a vector of parameters p ∈ Cr, e.g. material parameters or excitation frequen- cies. In many applications the purpose of the solution of the eigenvalue problem is to optimize certain properties of the eigenvalues, eigenvectors or the underlying dynamical system with respect to these parameters.

∗ e-mail: [email protected], Phone: +49 30 314 25736, Fax: +49 30 314 79706 ∗∗ e-mail [email protected]

Copyright line will be provided by the publisher 4 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

Nonlinear eigenvalue problems arise in a variety of applications. The most widely studied class in applications is the quadratic eigenvalue problem with F(λ) := λ2M + λC + K (2) that arises in the dynamic analysis of structures, see [27, 48, 74, 90] and the references therein. Here, typically the stiffness matrix K and the mass matrix M are real symmetric and positive (semi-)definite, and the damping matrix is general. Another source for such problems are vibrations of spinning structures yielding conservative gyroscopic systems [21, 47, 39], where K = KT and M = M T are real positive (semi-)definite, and C = −C T is real skew– symmetric. In most applications one is interested in the eigenvalues of smallest real part. A quadratic problem of slightly different structure arises in the study of corner singularities in anisotropic elastic materials, [3, 4, 5, 44, 52, 63, 89], where the problem has the form (λ2M(p) + λG(p) + K(p))x = 0, (3) with large and sparse coefficient matrices M(p) = M(p)T , G(p) = −G(p)T , K(p) = K(p)T that are resulting from a finite element discretization. Here M(p) and −K(p) are pos- itive definite and the coefficient matrices depend on a set of material and geometry parameters p which are varied. The part of the spectrum that is desired are the eigenvalues nearest to the imaginary axis and these are also the eigenvalues for which the error estimates in the finite element discretizations is most favorable. Polynomial eigenvalue problems of the form (3) are called even, see [55], since replacing λ by −λ and transposing gives the same problem. The spectrum of these problems has the Hamiltonian eigensymmetry, i.e. it is symmetric with respect to the real and imaginary axis. Another recently studied structured eigenvalue problem arises in the optimization of the acoustic emissions of high speed trains [34, 35, 55]. The model for the vibrations of the rails leads to a rational eigenvalue problem of the form 1 (λM (ω) + M (ω) + M T (ω))x = 0, (4) 1 0 λ 1 where the coefficients M0, M1 are large and sparse complex matrices depending on the exci- tation frequency ω. Here M1(ω) is highly rank deficient and M0(ω) is complex symmetric. 1 The eigenvalues occur in pairs λ, λ and most of the eigenvalues are at 0 and ∞. What is needed in the industrial application are the finite nonzero eigenvalues and the corresponding eigenvectors. None of the classical methods worked for this problem and only special meth- ods that were able to deal with the specific structure delivered sufficiently accurate eigenvalue approximations. Eigenvalue problems of the form (4) (or rather their polynomial representation which is 1 obtained by multiplying (4) by λ) are called palindromic in [55], since replacing λ by λ and transposing yields the same problem. The spectrum has the symplectic eigensymmetry, i.e. it is symmetric with respect to the unit circle. There are many other applications leading to structured or unstructured quadratic eigen- value problems. A detailed survey has recently been given in [100]. Quadratic eigenvalue problems are special cases of polynomial eigenvalue problems

k j P(λ)x =  λ Mj x = 0, (5) Xj=0   Copyright line will be provided by the publisher gamm header will be provided by the publisher 5

n,n with coefficients Mj ∈ K . An important application of polynomial eigenvalue problems is the solution of the optimal control problem to minimize the cost functional

t1 k (i) T (i) T (q ) Qiq + u Ru dt Z t0 Xi=0 h i

T T with Qi = Qi positive semidefinite, R = R positive definite, subject to the k-th order control system

k (i) Miq = Bu(t), Xi=0 with control input u(t) and initial conditions

(i) q (t0) = qi,0, i = 0, 1, . . . , k − 1. (6)

Application of the linear version of the Pontryagin maximum principle, e.g. [61], leads to the boundary value problem of Euler-Lagrange equations

k−1 j−1 T (2j) (−1) Qj M2j q (2j) +  M2j 0   µ  Xj=1 k−1 T (2j+1) 0 −M2j+1 q (2j+1) +  M2j+1 0   µ  Xj=1 T −Q0 M0 x −1 T = 0,  M0 −BR B   µ 

(i) with initial conditions (6) and µ (t1) = 0 for i = 0, . . . k − 1, where we have introduced the new coefficients Mk+1 = Mk+2 = . . . = M2k = 0. Here, all coefficients of derivatives higher than k are singular and one obtains a boundary value problem with coefficient matri- ces that alternate between real symmetric and skew-symmetric matrices. The solution of this boundary value problem can then be obtained by decoupling the forward and backward inte- gration, i.e. by computing the deflating subspace associated with the eigenvalues in the left (or right) half plane, see e.g. [61]. The associated matrix polynomial is even or odd depending on the degree and whether the leading coefficient is symmetric or skew-symmetric, and the spectrum has the Hamiltonian eigensymmetry. For this problem, even though it is large and sparse, the solution of the boundary value problem requires the computation of a deflating subspace associated with half of the eigen- values. This can only be done for medium size problems, where it is possible to store the full matrix. If the system size is bigger, then alternative techniques based on low rank approxima- tions to Riccati equations have to be applied, see e.g. [14, 54, 71]. Other polynomial eigenvalue problems of higher degree than two arise when discretizing linear eigenproblems by dynamic elements [74, 103, 104] or by least squares elements [78, 79], i.e. if one uses ansatz functions in a Rayleigh–Ritz approach which depend polynomially on the eigenparameter.

Copyright line will be provided by the publisher 6 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

Rational eigenproblems

k λ R(λ)x = −Kx + λMx + Cj x = 0, (7) σj − λ Xj=1

T T T where K = K and M = M are positive definite and Cj = Cj are matrices of small rank, occur in the study of the free vibration of plates with elastically attached masses [59, 96, 106] or vibrations of fluid solid structures [17, 73, 108]. A similar problem

k 2 1 R(λ)x = −Kx + λMx + λ Cj x = 0 (8) ωj − λ Xj=1 arises when a generalized linear eigenproblem is condensed exactly [72, 102]. Both these problems have real eigenvalues which can be characterized as min-max values of a Rayleigh functional [106], and in both cases one is interested in a small number of eigenvalues at the lower end of the spectrum or which are close to an excitation frequency. Another type of rational eigenproblem is obtained for the free vibrations of a structure if one uses a viscoelastic constitutive relation to describe the behavior of a material [31, 32]. A finite element model takes the form

k 2 1 R(λ) := λ M + K − ∆Kj x = 0, (9) 1 + bj λ  Xj=1  where the stiffness and mass matrices K and M are positive definite, k denotes the number of regions with different relaxation parameters bj , and ∆Kj is an assemblage of element stiffness matrices over the region with the distinct relaxation constants. Note that the rational problems (4), (7), (8), and (9) can be turned into polynomial eigenvalue problems by multiplying with an appropriate scalar polynomial in λ. A genuine nonlinear dependence on the eigenparameter appears in dynamic element meth- ods when using non–polynomial ansatz functions [74] or in the stability analysis of vibrating systems under state delay feedback control [37, 38]. Almost all these examples are finite dimensional approximations (typically finite element models) of operator eigenvalue problems and hence are large and sparse. Usually only a small number of eigenvalues in a specific region of the complex plane and associated eigenvectors are of interest. Furthermore, in most of these applications the eigenvalue problem has extra structure, e.g. is odd, even, or palindromic or a perturbation of a symmetric pencil. It is common wisdom in numerical analysis, that any kind of extra structure (arising typi- cally from the properties of the underlying physical problem) should be reflected as much as possible in the numerical method. In this way it is guaranteed that the approximate numerical solution properly reflects the physical properties of the system, and also structure preservation typically leads to a gain in efficiency and accuracy. As an example consider problems with Hamiltonian eigensymmetry. It has been shown in [24, 75, 76] that the problem may be well- conditioned under structured perturbations, but ill-posed under unstructured perturbations.

Copyright line will be provided by the publisher gamm header will be provided by the publisher 7

Fig. 1 Millennium bridge

In summary, the mathematical modelling of many engineering applications leads to large scale polynomial, rational, or more general nonlinear eigenvalue problems with coefficient matrices that are sparse and often have extra structure. Usually only few eigenvalues in a specific region of the complex plane are required. The task of numerical linear algebra is then to design numerical methods that are accurate and efficient for the given problem. The methods should exploit to a maximal extent the sparsity and structure of the coefficient matri- ces. Furthermore, it should be as accurate as the approximation of the underlying operator problem permits and it should include error and condition estimates. Many of the currently used mathematical models and numerical methods do not sat- isfy these requirements, and several failures due to bad modelling or failure of numer- ical methods have occured in recent years, see e.g. http://www.ima.umn.edu/˜ arnold/disasters/disasters.html. An interesting example where a failure due to mathematical modeling happened in the context of nonlinear eigenvalue problems is the Millennium bridge over the river Thames in London, which had to be closed right on open- ing day due to large oscillations, see http://www.arup.com/MillenniumBridge/. The cost to build in new damping was several million pounds. This example demonstrates that millions are invested in new industrial and architectural designs but the mathematics that is used to model and simulate the problem is often very old and not adequate for the current applications. More research into mathematical modeling, numerical methods and appropriate software for nonlinear eigenvalue problems is urgently needed. In linear eigenvalue problems Ax = λx or Ax = λEx, well established methods are available, that include error and condition estimates. These methods are able to deal with most of the small or large scale problems in practice [1, 6, 16, 53, 58] and specific methods have been developed to handle extra structures [9, 11, 61, 63, 64]. For nonlinear eigenvalue problems, there are essentially no analogous packages that reach the standard of those for linear problems.

Copyright line will be provided by the publisher 8 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

There are several reasons for this lack of adequate software. First of all, the essential tools in most numerically stable eigenvalue methods, the Schur form and generalized Schur form [28], are in general not available for nonlinear eigenvalue problems. Not even an analogue to the Jordan or Kronecker/Weierstrass form is available, [27]. These missing tools make numer- ical techniques such as deflation, purging and locking of eigenvalues, or implicit restarts very difficult. Also the sensitivity and round-off error analysis for nonlinear eigenvalue problems is still in its infancy, see e.g. [91, 99].

The theoretical analysis and the numerical methods for polynomial eigenvalue problems usually proceed via linearization, i.e. via the embedding of the nonlinear eigenvalue problem into a larger linear eigenvalue problem [27]. This is not ideal, since it makes the problem much bigger and also may significantly increase the conditioning of the problem, i.e. the resulting linear eigenvalue problem may be much more sensitive to perturbations than the original problem, see [99]. Also in the classical linearization approaches usually the inherent symmetry structure of the problem is destroyed. Some progress in the construction of better linearizations comes from the recent work [56], where a systematic new linearization approach has been developed that allows and analyzes also structure preserving linearizations, [33, 55, 64]. We will briefly review this new linearization concept in Section 2.

Most of the literature on polynomial eigenvalue problems discusses only regular matrix polynomials, i.e. square matrix polynomials, where det(P(λ)) does not vanish identically. Singular problems, however, arise frequently in practice, when automatic modeling leads to overdetermined systems with redundant equations, [22, 65, 98]. Often, the case of singular matrix polynomials can be reduced to the case of regular matrix polynomials using appropriate reduction procedures that have recently been derived in [62, 91]. But these procedures are not yet applicable to large scale problems and rather expensive even for small scale problems.

Finally, it is often assumed that the leading coefficient matrix Mk is nonsingular or even the identity matrix. In many applications, such as constraint multi-body systems [22, 87], circuit simulation [26], optical waveguide design [88], or problem (4), however, the leading coefficient is singular, i.e. the matrix polynomial has eigenvalues at infinity. In such a case not all linearizations properly reflect the multiplicities of the eigenvalue infinity, see [62, 91]. In [49] it has recently been suggested to use strong linearizations which properly reflect the multiplicities in linearizations. The new general linearization approach of [56] yields strong linearizations and the reduction procedure of [62, 91] allows to deflate this part of the matrix polynomial. In problems with extra structure, this can be done in a structured way, see e.g. [34, 35, 62].

More research is needed on the development of methods and the appropriate perturbation and error analysis, using the original data of the problem and not the matrix pencil arising from the linearization.

Despite the urgent need for more research and many of the described difficulties, there exist many classical but also several new and more prospective numerical methods for non- linear eigenvalue problems. We will review the state of the art in Section 3 and present some numerical examples in Section 4.

Copyright line will be provided by the publisher gamm header will be provided by the publisher 9

2 Linearization

The classical approach to solve k-th degree polynomial eigenvalue problems of the form (5) or rational eigenvalue problems of the forms (7)–(9) is to first perform a linearization [27], i.e. to transform the problem to an equivalent linear eigenvalue problem (λE − A)x = 0 with the same eigenvalues. The transformation between the polynomials and the linear pencil is performed with the help of unimodular matrix polynomials, i.e., matrix polynomials Q(λ) such that det Q(λ) is a nonzero constant, independent of λ. Definition 2.1 (Linearization [27]) Let P(λ) be a matrix polynomial of k-th degree and coefficients in Kn,n. A pencil L(λ) = λE − A with E, A ∈ Kkn,kn is called a linearization of P(λ) if there exist unimodular matrix polynomials Q1(λ), Q2(λ) such that

P(λ) 0 Q1(λ)L(λ)Q2(λ) = .  0 I(k−1)n 

The most commonly used linearization are the first companion form

M 0 · · · 0 k Mk− Mk− · · · M xk . 1 2 0   .  −I 0 · · · 0  x 0 In 0 .  n   k−1  λ + . . . = 0,   . . .  ......  .   . .. .. 0   . . . .   .           0 · · · 0 I   0 · · · −In 0   x1    n      i−1 which is obtained by introducing the new vectors xi = λ x, i = 1, . . . , k, and the analogous second companion form with the pencil

Mk 0 · · · 0 Mk−1 −In · · · 0 x . 1   .   ..  x 0 In 0 . Mk− 0 . 0  2  λ + 2 . = 0.   . . .   . . .  .   . .. .. 0   . .. .. −I   .      n      0 · · · 0 I   M 0 · · · 0   xk    n   0    But the companion forms are not the only possible linearizations. In a recent paper [56] a systematic approach has been presented to generate large classes of linearizations by con- structing kn × kn matrix pencils L(λ) = λX + Y with the properties

k−1 k−1 T T λ In v1P (λ) λ In v1P(λ)  .   v2P (λ)   .   v2P (λ)  L(λ) . = . or . L(λ) = . (10)  λI   .   λI   .   n     n     I   v P (λ)   I   v P(λ)   n   k   n   k  T k for some v = [v1, · · · , vk] ∈ K . It has been shown in [56] that in order to properly reflect the infinite eigenvalues (which occur if the leading coefficient Mk is singular) one needs to consider matrix pencils L(λ) that satisfy both conditions in (10). Then the following Theorem classifies when such pencils are linearizations.

Copyright line will be provided by the publisher 10 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

Theorem 2.2 [56] Suppose that P(λ) is a regular matrix polynomial, i.e. P(λ) is square and its does not vanish identically, and suppose that L(λ) satisfies both k k−1 k−2 conditions in (10). Define the scalar polynomial q(ξ) = i=1 v1ξ + v2ξ + · · · + vk−1ξ + vk. Then L(λ) is a linearization for P(λ) if andPonly if the sets {Roots of q(ξ)} and {Eigenvalues of P(λ)} are disjoint. Here it is to be understood that ∞ may also be an element of either set. Using this theorem, many different linearizations can be constructed, in particular lin- earizations that have the same structure as the original matrix polynomial. Furthermore, the freedom in the choice of the coefficients may be used to make the resulting linear eigenvalue problem as well conditioned as possible, although at the writing of this paper it is an open problem how to do this. Example 2.3 [3, 63] Considering an even quadratic eigenvalue problem of the form (3) and the even polynomial q(ξ) = 1, we obtain the linearization

0 M −M 0 L(λ) = λ + .  −M G   0 −K 

Since q(ξ) has only the eigenvalue ∞, we have by Theorem 2.2 that L(λ) is a linearization if and only if the mass matrix M is nonsingular. Example 2.4 [34, 35, 55] Consider a palindromic quadratic eigenvalue problem of the form (4) and the palindromic polynomial q(ξ) = ξ + 1, we obtain the palindromic pencil λZ + ZT with M M − M T Z = 1 0 1  M1 M1  which is by Theorem 2.2 a linearization if and only if −1 is not an eigenvalue of (4). (This condition is always satisfied in the industrial application, since in this example −1 corresponds to an undamped vibration of the rail.) Example 2.5 Consider a symmetric eigenvalue problem of the form (2), i.e. M, C, K are symmetric, and the general polynomial q(ξ) = v1ξ + v0. A simple calculation yields pencils of the form v M v M v C − v M v K λ 1 2 + 1 2 1 .  v2M v2C − v1K   v1K v2K  By Theorem 2.2 this is a linearization if and only if no eigenvalue of (2) is a root of q(ξ).

3 Numerical methods

There is a vast literature on numerical methods for nonlinear eigenvalue problems. In general, one has to distinguish between dense and large sparse problems. For dense problems, the size of the problems that can be treated with numerical methods is limited to a few thousand depending on the available storage capacities. Methods for small dense problems, however, are needed in most of the iterative projection methods for large sparse problems. These iterative projection methods make use of the sparse matrix struc- ture and typically require only matrix vector multiplication with the coefficient matrices plus possibly sparse factorizations of matrices, when shift-and-invert is used to get eigenvalues in

Copyright line will be provided by the publisher gamm header will be provided by the publisher 11 the interior of the spectrum. Again here the available storage sets the limit for the system sizes that can be dealt with. Using the sparsity and the symmetry structure of the coefficient matrices, nowadays problems of size on the order of n = 107 can be treated. In the polynomial or rational case the simplest approach is to use linearization and then to apply standard methods for linear eigenvalue problems, as they are available in [1, 13, 16, 53, 58]. Despite the fact that this increases the dimension, it is usually the method of choice if no special properties of the problem are known. In the case of structured problems the linearization should reflect the structure and then a structure preserving method for the linearized problem should be used. It is currently under discussion to improve the features of [1, 16, 58] to make better use of the resulting structures in the linearized problems.

3.1 Newton type methods and inverse iteration For general nonlinear eigenvalue problems, the classical approach is to formulate the eigen- value problem as a system of nonlinear equations and to use variations of Newton’s method or the inverse iteration method. For the characteristic equation

det F(λ) = 0, (11) it was suggested in [45, 46] to use a QR-decomposition with column pivoting F(λ)P (λ) = Q(λ)R(λ), where P (λ) is a permutation matrix which is chosen such that the diagonal ele- ments rjj (λ) of R(λ) are decreasing in magnitude, i.e. |r11(λ)| ≥ |r22(λ)| ≥ · · · ≥ |rnn(λ)|. Then λ is an eigenvalue if and only if rnn(λ) = 0. Applying Newton’s method to this equation, one obtains the iteration 1 (12) λk+1 = λk − H H 0 −1 en Q(λk) F (λk)P (λk)R(λk) en for approximations to an eigenvalue of problem (11). (Here en denotes the n-th unit vector.) Approximations to left and right eigenvectors can be obtained from

−1 yk = Q(λk)en and xk = P (λk)R(λk) en. An improved version of this method was suggested in [40, 41] and also quadratic conver- gence was shown. A similar approach was presented in [116], via a representation of Newton’s method using the LU-factorization of F(λ). Other variations of this method can be found in [117, 118]. However, this relatively simple idea is not efficient, since it computes eigenvalues one at a time and needs several O(n3) factorizations per eigenvalue. It is, however, useful in the context of iterative refinement of computed eigenvalues and eigenvectors. Another method that also serves the purpose of iterative refinement is the nonlinear version of inverse iteration. For linear eigenproblems Ax = λx it is well known that inverse iteration is equivalent to Newton’s method applied to the Ax − λx = 0,  vH x − 1  where v ∈ Cn is suitably chosen. Correspondingly, for the nonlinear problem F(λ)x = 0  vH x − 1 

Copyright line will be provided by the publisher 12 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems one step of Newton’s method gives

0 F(λk) F (λk)xk xk+1 − xk F(λk)xk H = − H . (13)  v 0   λk+1 − λk   v xk − 1  The first component yields

−1 0 xk+1 = −(λk+1 − λk)F(λk) F (λk)xk, (14)

−1 0 i.e. the direction of the new approximation to an eigenvector is uk+1 := F(λk) F (λk)xk. H Assuming that xk is already normalized by v xk = 1, the second component of (13) reads H H H H v xk v xk = v xk, and multiplying equation (14) by v yields λk = λk − H . Hence, +1 +1 v uk+1 for nonlinear eigenproblems inverse iteration takes the form given in Algorithm 1. Algorithm 1 Inverse iteration H 1: Start with λ0, x0 such that v x0 = 1 2: for k = 0, 1, 2, . . . until convergence do 0 3: solve F(λk)uk+1 = F (λk)xk for uk+1 H H 4: λk+1 = λk − (v xk)/(v uk+1) H 5: normalize xk+1 = uk+1/v uk+1 6: end for

This algorithm (being a variant of Newton’s method) converges locally and quadratically to (x, λ) [2, 68]. The normalization condition can be modified in each step of inverse iteration. It was sug- H gested in [80] to use vk = F(λk) yk for the normalization, where yk is an approximation to a left eigenvector. Then the update for λ becomes

H yk F(λk)xk λk+1 = λk − H 0 , yk F (λk)xk which is the Rayleigh functional for general nonlinear eigenproblems proposed in [48], and H which can be interpreted as one Newton step for solving the equation fk(λ) := yk F(λ)xk = 0. For linear Hermitian eigenproblems this gives cubic convergence if λk is updated by the Rayleigh quotient [19, 70]. The same is true [78] for symmetric nonlinear eigenproblems having a Rayleigh functional if we replace statement 4 in Algorithm 1 by λk+1 = p(uk+1), H where p(uk+1) denotes the real root of uk+1F(λ)uk+1 = 0 closest to λk. In [67] Newton’s method is considered for the complex function β(λ) defined by

F(λ)u = β(λ)x, vH u = κ, where κ is a given constant, and x and u are given vectors. This approach generalizes the method (12), inverse iteration, and a method proposed in [69]. It was proved that the rate of convergence is quadratic, and that cubic convergence can be obtained if not only λ, but also x and/or s are updated appropriately, thus unifying the results in [2, 45, 46, 48, 68, 69]. The disadvantage of inverse iteration with respect to efficiency is the large number of fac- torizations that are needed for each of the eigenvalues. The obvious idea then is to use a version of a simplified Newton method, where the shift σ is kept fixed during the iteration, i.e.

Copyright line will be provided by the publisher gamm header will be provided by the publisher 13

−1 to use, xk+1 = (A−σI) xk. However, in general this method does not converge in the non- linear case. The iteration converges to an eigenpair of a linear problem F(σ)x = γF 0(λ˜)x, from which one cannot recover an eigenpair of the nonlinear problem (1). A remedy against this problem was proposed in [66]. Assuming that F(λ) is twice continuously differentiable, then Algorithm 1 gives

−1 0 xk − xk+1 = xk + (λk+1 − λk)F(λk) F (λk)xk −1 0 = F(λk) (F(λk) + (λk+1 − λk)F (λk))xk −1 2 = F(λk) F(λk+1)xk + O(|λk+1 − λk| ).

Neglecting the second order term one gets

−1 xk+1 = xk − F(λk) F(λk+1)xk.

The advantage of this approach is that replacing λk by a fixed shift σ does not lead to miscon- vergence. The method can be implemented as in Algorithm 2, see [66].

Algorithm 2 Residual inverse iteration

1: Let v be a normalization vector and start with an approximations σ and x1 to an eigenvalue H and corresponding eigenvector of (1) such that v x1 = 1 2: for k = 1, 2, . . . until convergence do H −1 3: solve v F(σ) F(λk+1)xk = 0 for λk+1 H or xk F(λk+1)xk = 0 if F(λ) is Hermitian and λk+1 is real 4: compute the residual rk = F(λk+1)xk 5: solve F(σ)dk = rk for dk 6: set zk+1 = xk − dk H 7: normalize xk+1 = zk+1/v zk+1 8: end for

If F(λ) is twice continuously differentiable, λˆ a simple zero of det F(λ) = 0, and if xˆ is an eigenvector normalized by vH xˆ = 1, then the residual inverse iteration converges for all σ sufficiently close to λˆ, and one has the estimate

kxk+1 − xˆk q = O(|σ − λˆ|) and |λk+1 − λˆ| = O(kxk − xˆk ), kxk − xˆk

ˆ H where q = 2 if F(λ) is Hermitian, λ is real, and λk+1 solves xk F(λk+1)xk = 0 in Step 3, and q = 1 otherwise, see [66]. A variant of this approach is the method of successive linear approximations of [80]. If F is twice continuously differentiable, and λˆ is an eigenvalue of problem (1) such that F 0(λˆ) is nonsingular and 0 is an algebraically simple eigenvalue of F 0(λˆ)−1F(λˆ), then the method in Algorithm 3 converges quadratically to λˆ, see [80]. The discussed versions of inverse iteration apply to general nonlinear eigenproblems, al- though for Hermitian problems and real eigenvalues they converge faster if the eigenvalue ap- proximations are updated using the Rayleigh functional. For Hermitian problems that allow a variational characterization of their eigenvalues [20, 29, 30, 77, 107, 112], an alternative

Copyright line will be provided by the publisher 14 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

Algorithm 3 Method of successive linear problems

1: Start with an approximation λ1 to an eigenvalue of (1) 2: for k = 1, 2, . . . until convergence do 0 3: solve the linear eigenproblem F(λk)u = θF (λk)u 4: choose an eigenvalue θ smallest in modulus 5: λk+1 = λk − θ 6: end for is to use the safeguarded iteration. The method was introduced in [115] for over-damped problems, and in [113] for the non-over-damped case. Let J ⊂ R be an open interval which may be unbounded, and assume that F(λ) ∈ Cn,n is a family of Hermitian matrices, where the elements are differentiable in λ. If one assumes that for every x ∈ Cn \ {0} the real equation

f(λ, x) := xH F(λ)x = 0 (15) has at most one solution λ ∈ J, then (15) defines a functional ρ on some subset D ⊂ Cn which generalizes the Rayleigh quotient for linear pencils F(λ) = λE − A, and which is called Rayleigh functional of the nonlinear eigenvalue problem (1). If one assumes further that xH F 0(ρ(x))x > 0 for every x ∈ D (generalizing the definiteness requirement for linear pencils), then by the implicit function theorem D is an open set, and differentiating the identity xH F(ρ(x))x = 0 one obtains that the eigenvectors of (1) are stationary points of ρ. Under these conditions in [112] a minimax principle for the nonlinear eigenproblem (1) was proved if the eigenvalues are enumerated appropriately. A value λ ∈ J is an eigenvalue of (1) if and only if µ = 0 is an eigenvalue of the matrix F(λ), and by Poincare’´ s max-min principle there exists m ∈ N such that

xH F(λ)x 0 = max min . dim V =m x∈V, x=06 kxk2

One assigns this m to λ as its number and calls λ an m-th eigenvalue of problem (1). Under the above assumptions, it was shown in [112] that for every m ∈ {1, . . . , n} problem (1) has at most one m-th eigenvalue in J, which can be characterized by

λm = min sup ρ(v). (16) dim V =m,D∩V =6 ∅ v∈D∩V Conversely, if

λm := inf sup ρ(v) ∈ J, dim V =m,D∩V =6 ∅ v∈D∩V then λm is an m-th eigenvalue of (1), and the characterization (16) holds. The minimum is attained by the invariant subspace of F(λm) corresponding to its m largest eigenvalues, and the supremum is attained by any eigenvector of F(λm) corresponding to µ = 0. The enumeration of eigenvalues and the fact that the eigenvectors of (1) are the stationary vectors of the Rayleigh functional suggests the following Algorithm 4. It was shown in [111, 113] that the safeguarded iteration has the following convergence properties.

Copyright line will be provided by the publisher gamm header will be provided by the publisher 15

Algorithm 4 Safeguarded iteration

1: Start with an approximation σ1 to the m-th eigenvalue of (1) 2: for k = 1, 2, . . . until convergence do 3: determine an eigenvector xk corresponding to the m-largest eigenvalue of F(σk) H 4: solve xk F(σk+1)xk = 0 for σk+1 5: end for

(i) If λ1 := infx∈D ρ(x) ∈ J and x1 ∈ D then the iteration converges globally to λ1.

(ii) If λm ∈ J is a m-th eigenvalue of (1) which is simple, then the iteration converges locally and quadratically to λm.

(iii) Let F(λ) be twice continuously differentiable, and assume that F 0(λ) is positive definite for λ ∈ J. If, in Step 3 of Algorithm 4, xk is chosen to be an eigenvector corresponding 0 to the m-th largest eigenvalue of the generalized eigenproblem F(σk)x = µF (σk)x, then the convergence is even cubic.

The methods that have been discussed in this subsection are appropriate for problems for which matrix factorizations of F(σ) can be carried out and the storage for the factors is available. It is an obvious idea to use iterative methods for the solution of the occurring linear systems, but the convergence properties of the methods have to be carefully analyzed in this case. This topic is also discussed in the following section, where methods that can be applied to large sparse problems are discussed.

3.2 Iterative projection methods for linear problems For sparse linear eigenvalue problems Ax = λx, iterative projection methods like the Lanc- zos, Arnoldi, rational Krylov or Jacobi–Davidson method are well established. The basic idea of all these methods is the construction of a search space (typically a Krylov subspace) fol- lowed by projection of the problem into this subspace. This leads to a small dense problem that is then handled by a dense solver and the eigenvalues of the projected problem are used as approximations to the eigenvalues of the large sparse problem. The main feature of all these methods is that matrix factorizations are avoided as much as possible (except in the context of preconditioning) and the generation of the search space is usually done via an iterative procedure that is based on matrix vector products that can be cheaply obtained. Two basic types of iterative projection methods are in use: The first type consists of meth- ods which expand the subspaces independently of the eigenpair of the projected problem and which use Krylov subspaces of A or (A − σI)−1 for some shift σ. These methods include the Arnoldi, Lanczos or rational Krylov method. The other type of methods aim at a partic- ular eigenpair and choose the expansion q such that it has a high approximation potential for a desired eigenvalue/eigenvector or invariant subspace. An example for this approach is the Jacobi–Davidson method. For the Arnoldi method and similarly for other Krylov subspace methods, the search space is a Krylov space

2 k−1 Kk(A, v1) = span{v1, Av1, A v1, . . . , A v1},

Copyright line will be provided by the publisher 16 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

where v1 is an appropriately chosen initial vector. The Arnoldi method produces an orthog- onal basis Vk of Kk(A, v1) such that the projected matrix Hk (associated with Kk) is upper T Rk Hessenberg and satisfies AVk = VkHk + fkek , where ek ∈ is the k-th unit vector and H fk is orthogonal to the columns of Vk, i.e. Vk fk = 0. The orthogonality of Vk implies that H Vk AVk = Hk is the orthogonal projection of A to Kk(A, v1). If (y, θ) is an eigenpair of the projected problem, and x = Vky is the corresponding ap- proximation to an eigenvector of Ax = λx (which is called a Ritz vector corresponding to the Ritz value θ), then the residual satisfies

H H r := Ax − θx = AVky − θVky = VkHky − θVky + fkek y = (ek y)fk.

T Hence, one obtains an error indicator krk = |ek y| · kfkk for the eigenpair approximation (x, θ) without actually computing the Ritz vector x. If A is Hermitian then this is even an error bound. The Arnoldi method together with its variants, is today a standard solver for sparse linear eigenproblems. It is implemented in the package ARPACK [53] and the MATLAB command eigs, see also [6]. The method typically converges to the extreme eigenvalues first. If one is interested in eigenvalues in the interior of the spectrum, or in eigenvalues close to a given focal point τ, then one can apply the method in a shift-and-invert fashion, i.e. to the matrix (A − τI)−1 or an approximation of it. In this case one has to determine a factorization of A − τI, which, however, may be prohibitive for very large problems. A way out of this dilemma is the Jacobi–Davidson method of [94]. Let (x, θ) be an ap- proximation to an eigenpair obtained by a projection method with subspace V . We assume that kxk = 1, θ = xH Ax and r := Ax − θx ⊥ x. Then the most desirable orthogonal correction z solves the equation

A(x + z) = λ(x + z), z ⊥ x. (17)

As z ⊥ x, the operator A can be restricted to the subspace orthogonal to x yielding A˜ := (I − xxH )A(I − xxH ), and from θ = xH Ax it follows that

A = A˜ + AxxH + xxH A − θxxH .

Hence, from (17) and Az˜ = 0 we obtain that (A˜ − λI)z = −r + (λ − θ − xH Az)x. Since both the left hand side and r are orthogonal to x, it follows that the factor λ − θ − xH Az must vanish, and therefore the correction z has to satisfy (A˜ − λI)z = −r. Since λ is unknown, it is replaced by a Ritz approximation θ, and one ends up with the correction equation

(I − xxH )(A − θI)(I − xxH )z = −r.

It can be shown that the expanded space [V, z] for the Jacobi–Davidson method contains the direction u = (A−θI)−1x, which is obtained by one step of inverse iteration [93]. One there- fore can expect similar approximation properties, i.e. quadratic or even cubic convergence, if the problem is Hermitian. Obviously, the Jacobi–Davidson method is aiming at a particular eigenvalue (close to the shift θ). If one is interested in more than one eigenvalue, one typically uses a deflation based on a partial Schur decomposition of the matrix A, see [23].

Copyright line will be provided by the publisher gamm header will be provided by the publisher 17

Both, the shift-and-invert Arnoldi method and the Jacobi-Davidson method have to solve a large linear system. However, while in the Arnoldi method this system in general needs to be solved very accurately to get fast convergence, numerical experiments demonstrate that in the Jacobi–Davidson method it suffices to solve this system approximately to maintain fast convergence. Typically only a small number of steps of a preconditioned Krylov subspace method are sufficient to obtain a good expansion z for the search space V . Implementation details of the Jacobi–Davidson method for various types of linear eigenvalue problems can be found in [6]. Implementations in FORTRAN and MATLAB can be downloaded from http://www.math.ruu.nl/people/sleijpen. Many, but not all of the ideas in these projection methods can be generalized also to non- linear eigenproblems.

3.3 Structure preserving iterative projection methods for linearized problems For polynomial and rational problems, the easiest approach is to use linearization and to apply the projection method to the linearization. As we have seen in Section 2 one should use structure preserving linearizations, make use of the symmetries in the generation of the search space and also make sure that the projection leads to a small problem of the same structure. Recently several structure-preserving Krylov subspace methods have been developed [3, 10, 12, 63, 64, 114]. Each of these requires that a structured generalized eigenvalue problem, such as the linearizations obtained in Examples 2.3 or 2.4, is reduced further to a Hamiltonian or symplectic matrix. (A 2n × 2n matrix H is called Hamiltonian if (HJ)T = HJ and 0 I symplectic if HJHT = J, where J = n .) To see how this can be done very  −In 0  efficiently, consider the even pencil of Example 2.3. Since

1 0 M M 0 I 0 0 I I − 2 G M 0 = 1 ,  −M G   0 I   2 G I   −I 0   0 I   0 I  the pencil Example 2.3 is equivalent to the Hamiltonian matrix

−1 1 I 0 M 0 I 2 G H = J 1 ,  − 2 G I   0 K   0 I  as is shown in [63]. There is no need to assemble the matrix H, nor is there any need to compute M −1 explicitly. One just needs to compute and use the Cholesky decomposition of M and one should also note that

1 −1 I − 2 G M 0 I 0 T H = −1 1 J ,  0 I   0 K   2 G 0  is no less accessible than H itself. This is important, since if one wants the eigenvalues of H that are closest to the origin, one works with H −1. If one wants the eigenvalues near a focal-point τ, one might prefer to work with shift-and-invert, i.e. (H − τI)−1. However, the shift destroys the Hamiltonian structure, so one needs ways to effect shifts while preserving the structure. One simple remedy is to work with the matrix

(H − τI)−1(H + τI)−1,

Copyright line will be provided by the publisher 18 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems which is not Hamiltonian but skew-Hamiltonian, (i.e. it satisfies (HJ)T = −HJ.) If the problem is real and τ is neither real nor purely imaginary, one works with the skew- Hamiltonian (H − τI)−1(H − τI)−1(H + τI)−1(H + τI)−1 to stay within the real number system. Another possibility is to work with the Cayley- transform (H − τI)−1(H + τI), which is symplectic. A Krylov subspace method will preserve Hamiltonian, skew-Hamiltonian, or symplectic structure if it generates vectors that span isotropic subspaces. The skew-Hamiltonian form is easiest to preserve, since Krylov subspaces generated by skew-Hamiltonian operators are automatically isotropic [63]. Consequently the standard Arnoldi method preserves the struc- ture automatically in theory. In practice, however, the isotropy is steadily eroded by roundoff errors, so it must be enforced by an additional orthogonalization step. In the context of the Arnoldi process, this means that the vector qj+1 generated in step j must be made orthogo- nal to Jq1, . . . , Jqj as well as q1, . . . , qj . Furthermore, as in all practical Krylov subspace methods, in order to avoid that the storage capacity is exceeded, repeated implicit restarts in the spirit of Sorensen’s implicitly restarted Arnoldi (IRA) process [53, 97] are needed. A method that includes all these features is the skew-Hamiltonian implicitly-restarted Arnoldi algorithm (SHIRA) of [63]. We will present some numerical results obtained with this method in Section 4.

3.4 Iterative projection methods for nonlinear problems In this section we will discuss projection methods that work directly for general nonlinear eigenproblems. We have already pointed out that in this case the search spaces have to be expanded by directions that have a high approximation potential for the next desired eigen- vector. Assume that V is an orthonormal basis of the current search space. Let (θ, y) be a solution of the projected problem V H F(λ)V y = 0, and let x = V y be the corresponding Ritz vector. Then there are two candidates for expanding V suggested by the methods in Section 3.1: vˆ = x − F(σ)−1F(θ)x motivated by residual inverse iteration, and v˜ = F(θ)−1F 0(θ)x corresponding to inverse iteration. The following two subsections will take advantage of these directions. Expanding a given search space V by vˆ results in Arnoldi type methods treated in Subsection 3.4.1. Expanding it by the direction of inverse iteration v˜ requires the solution of a large linear system in every iteration step. As in the case of linear eigenproblems this can be avoided by a Jacobi–Davidson approach considered in Subsection 3.4.2.

3.4.1 Arnoldi type methods We consider the expansion of V by vˆ = x − F(σ)−1F(θ)x, where σ is a fixed shift (not too far away from the eigenvalue targeted at). In Arnoldi-like methods the new search direction is orthonormalized against the previous ansatz vectors. Since the Ritz vector x is contained in the span of V , one may choose the new direction v = F(σ)−1F(θ)x as well. For the linear problem F(λ) = A − λB this is exactly the Cayley transformation with pole σ and zero θ, and since (A − σB)−1(A − θB) = I + (θ − σ)(A − σB)−1B

Copyright line will be provided by the publisher gamm header will be provided by the publisher 19 and Krylov spaces are shift-invariant, the resulting projection method expanding V by v is nothing else but the shift-and-invert Arnoldi method. If it is too expensive to solve the linear system F(σ)v = F(θ)x for v, one may choose as new direction v = MF(θ)x with M ≈ F(σ)−1 which corresponds in the linear case to an inexact Cayley-transform or a preconditioned Arnoldi method. These variants have been introduced in [60] for quadratic eigenvalue problems and in [105, 109] for general nonlinear eigenproblems and are often called nonlinear Arnoldi method despite the fact that differently from the linear case no Krylov space is determined in the course of the algorithm. Since the speed of convergence depends crucially on |σ − λ|, it will be advisable to change the shift or more generally the preconditioner M in the course of the algorithm if the conver- gence to the current eigenvalue becomes too slow. If one does this, then the resulting method generalizes the rational Krylov method that was developed for linear problems in [83]. Thus the name nonlinear rational Krylov method would be appropriate as well. But this notation was already introduced in [84, 85] for a different method which we will also discuss below. A template for the nonlinear Arnoldi approach is given by the following algorithm. Algorithm 5 Nonlinear Arnoldi Method 1: start with an initial shift σ and an initial basis V , V H V = I; 2: determine a preconditioner M ≈ F(σ)−1, 3: for m = 1, 2, . . . , number of wanted eigenvalues do 4: compute appropriate eigenvalue µ and corresponding eigenvector y H of the projected problem FV (µ)y := V F(µ)V y = 0. 5: determine Ritz vector u = V y and residual r = F(µ)u 6: if krk/kuk <  then 7: accept approximate eigenpair λm = µ, xm = u, 8: if m == number of desired eigenvalues then STOP end if 9: choose new shift σ and determine a preconditioner M ≈ F(σ)−1 if indicated 10: restart if necessary 11: choose approximations µ and u to next eigenvalue and eigenvector 12: determine residual r = F(µ)u 13: end if 14: v = Mr 15: v = v − V V H v ,v˜ = v/kvk, V = [V, v˜] 16: reorthogonalize if necessary 17: end for

Remark 3.1 There are many details that have to be considered when implementing a non- linear Arnoldi method according to Algorithm 5. A detailed discussion is given in [106, 111], but the following general comments should be observed. (i) In Step 1 of Algorithm 5 any pre-information such as known approximate eigenvectors of problem (1) or eigenvectors of contiguous problems can and should be used. If no information on eigenvectors is at hand, and one is interested in eigenvalues close to a focal point τ ∈ D, then one can choose an initial vector at random, execute a few Arnoldi steps for the linear eigenproblem F(τ)u = θu or F(τ)u = θF 0(τ)u, and choose V by orthogonalizing eigenvectors corresponding to small eigenvalues in modulus. Starting with a random vector without this preprocessing usually does not lead to convergence.

Copyright line will be provided by the publisher 20 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

(ii) The preconditioner in Step 2 should be chosen on the basis of the underlying problem. If this is not available, then full or incomplete sparse LU decompositions of F(σ), see [86], present an alternative. (iii) A crucial point in iterative methods for general nonlinear eigenvalue problems when approximating more than one eigenvalue is to inhibit the method to converge to the same eigenvalue repeatedly. For linear eigenvalue problems this is easy to do by using Schur forms or generalized Schur forms for the projected problem and then locking or purging certain eigenvalues. For nonlinear problems, however, such Schur forms do not exist and this presents one of the most difficult tasks in achieving good convergence. See [15, 23, 57, 60, 105] for different approaches in this direction. (iv) Since the residual inverse iteration with a fixed shift σ converges linearly, and the con- vergence rate satisfies O(|σ − λ|), it is reasonable to update the preconditioner if the convergence measured by the quotient of the last two residual norms before convergence has become too slow. For several other recent variations and generalizations of the Arnoldi method for quadratic or general polynomial eigenvalue problems, see [7, 25, 36, 60, 100].

3.4.2 Jacobi–Davidson type methods Arnoldi type methods are quite efficient in solving sparse nonlinear eigenproblems if an ac- curate preconditioner M ≈ F −1 is at hand. If this is not the case, then the convergence deteriorates considerably. In this situation Jacobi–Davidson type methods offer an appealing alternative. A natural generalization of the Jacobi–Davidson method for linear eigenproblems which was already suggested in [92, 95] for polynomial eigenvalue problems and which was stud- ied in [15, 110] for general nonlinear eigenproblems is the following one: Suppose that the columns of V ⊂ Cn form an orthonormal basis of the current search space, and let (x, θ) be a Ritz pair of (1) with respect to V , i.e. V H F(θ)V y = 0, x = V y. Then we consider the correction equation pxH xxH I − F(θ) I − z = −r, z ⊥ x, (18)  xH p  xH x  where p := F 0(θ)x and r := F(θ)x. Equation (18) can be rewritten as F(θ)z − αp = −r, where α has to be chosen such that z ⊥ x. Solving for z we obtain z = −x + αF(θ)−1p = −x + αF(θ)−1F 0(θ)x, and x = V y yields that z˜ := F(θ)−1F 0(θ)x ∈ span[V, z]. Hence, as in the linear case the new search space span[V, z] contains the vector obtained by one step of inverse iteration with shift θ and initial vector x, and again we may expect quadratic or even cubic convergence of the resulting iterative projection method, if the correc- tion equation (18) is solved exactly. As in the linear case the correction equation does not have to be solved exactly to maintain fast convergence, but usually a few steps of a Krylov solver with an appropriate preconditioner

Copyright line will be provided by the publisher gamm header will be provided by the publisher 21 suffice to obtain a good expansion direction of the search space. The approximate solution of the correction equation has to be orthogonal to x. If (18) is solved by a Krylov solver and the initial approximation is orthogonal to x then all iterates are orthogonal to x as well. In the correction equation (18), the operator F(θ) is restricted to map the subspace x⊥ to (F 0(θ)x)⊥. Hence, if K ≈ F(θ) is a preconditioner of F(θ) then a preconditioner for an iterative solver of (18) should be modified correspondingly to

pxH xxH K˜ := (I − )K(I − ). xH p xH x With left-preconditioning, equation (18) becomes

pxH xxH K˜ −1(I − )F(θ)(I − )z = −K˜ −1r, z ⊥ x. (19) xH p xH x It was already demonstrated in [94] for linear problems that taking into account the projectors in the preconditioner, i.e. using K˜ instead of K in a preconditioned Krylov solver, raises the cost only slightly. In every step one has to solve one linear system Kw = y, and to initialize the solver requires only one additional solve. The transfer to (19) is straightforward [15]. A template for the Jacobi–Davidson method for the nonlinear eigenvalue problem (1) is very similar to the one in Algorithm 5. We only have to replace Step 14 by:

Find an approximate solution of the correction equation

F 0(µ)uuH uuH (I − )F(µ)(I − )t = −r (20) uH F 0(µ)u uH u by a preconditioned Krylov solver.

Note that the update rule for the preconditioner used in the Arnoldi type method does not make sense for the Jacobi–Davidson algorithm because of the unpredictable convergence be- havior of Krylov solvers. One way is to update the shift if the Krylov solver has not converged in a given number of iterations. In [101] the efficiency of the Jacobi–Davidson method is demonstrated for large quadratic eigenproblems modeling the propagation of sound waves in a room with sound–absorbing walls. In [39] a gyroscopic eigenproblem (λ2M + λ(G + εD) + K)x = 0 is considered, where K = KT , M = M T , G = −GT , and εD represents the damping of the system. Since the damping is assumed to be small it is suggested to determine eigenpairs (λj , xj ) of the conservative gyroscopic problem (λ2M + λG + K)x = 0 in the wanted region by the structure preserving method SHIRA from Section 3.3, and to improve these approximations by the Jacobi–Davidson method for the original quadratic problem.

3.4.3 Rational Krylov method A different approach was developed in [81, 82, 84, 85], which generalizes the rational Krylov method for linear eigenproblems [83] to sparse nonlinear eigenvalue problems by nesting the linearization of problem (1) (by regula falsi) and the solution of the resulting linear eigenprob- lem (by Arnoldi’s method). Hence, in this approach the two numerical subtasks, i.e. reducing

Copyright line will be provided by the publisher 22 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems the large dimension to a much smaller one and solving a nonlinear eigenproblem are attacked simultaneously. This method was applied in [31, 32] to the rational eigenvalue problem (9) governing damped vibrations of a structure. Linearizing the nonlinear family F(λ) by Lagrange interpolation between two points µ and σ one gets λ − µ λ − σ F(λ) = F(σ) + F(µ) + higher order terms. σ − µ µ − σ Keeping σ fixed for several steps, iterating on µ, neglecting the remainder in the Lagrange interpolation, and multiplying by F(σ)−1 from the left one obtains

−1 λj − λj−1 F(σ) F(λj−1)w = θw with θ = , (21) λj − σ predicting a singularity at θ λ = λ + (λ − σ). j j−1 1 − θ j−1 For large and sparse matrices the linearization (21) is combined with a linear Arnoldi process. After j steps, approximations λ1, . . . , λj to an eigenvalue, orthonormal vectors j,j−1 v1, . . . , vj , and an upper Hessenberg matrix Hj,j−1 ∈ C are generated, such that the Arnoldi recursion

−1 F(σ) F(λj−1)Vj−1 = Vj Hj,j−1 is fulfilled (at least approximately), where Vj = [v1, . . . , vj ]. Updating the matrix Hj,j−1 according to the linear theory yields

Hj,j−1 kj H˜j+1,j = ,  0 kr⊥k 

H H where kj = Vj rj , rj = F(λj )vj , and r⊥ = rj − Vj Vj vj , which due to the nonlinearity of F(·) violates the next Arnoldi relation

−1 F(σ) F(λj )Vj = Vj+1H˜j+1,j , vj+1 = v⊥/kv⊥k.

To satisfy it at least approximately one takes advantage of Lagrangian interpolation

λj − σ λj − λj−1 1 θ G(λj ) ≈ G(λj−1) − I = G(λj−1) − I, λj−1 − σ λj−1 − σ 1 − θ 1 − θ where G(λ) := F(σ)−1F(λ), and updates H according to

1 θ 1−θ Hj,j−1 − 1−θ Ij,j−1 kj Hj+1,j = .  0 kr⊥k  This gives a first version of the rational Krylov method, which unfortunately is not very effi- cient. In [84] it was thus suggested to modify λ and H in an inner iteration until the residual

Copyright line will be provided by the publisher gamm header will be provided by the publisher 23

Algorithm 6 Rational Krylov method

1: start with initial vector V = [v1] with kv1k = 1, initial λ and σ ; set j = 1 2: set hj = 0j; s = ej ; x = vj ; −1 H 3: compute r = F(σ) F(λ)x and kj = Vj r 4: while kkj k >ResTol do H 5: orthogonalize r = r − Vj kj −1 6: set hj = hj + kj sj 7: θ = min eig Hj,j with corresponding eigenvector s 8: x = Vj s 9: θ update λ = λ + 1−θ (λ − σ) 10: 1 1 update Hj,j = 1−θ Hj,j − 1−θ I −1 H 11: compute r = F(σ) F(λ)x and kj = Vj r 12: end while 13: compute hj+1,j = krk 14: if |hj+1,j sj | >EigTol then 15: vj+1 = r/hj+1,j ; j = j + 1; GOTO 2: 16: end if 17: Accept eigenvalue λi = λ and eigenvector xi = x 18: If more eigenvalues wanted, choose next θ and s, and GOTO 8:

−1 r = F(σ) F(λ)Vj s is enforced to be orthogonal to Vj , and to expand the search space only after the inner iteration has converged which yields Algorithm 6. The inner iteration is nothing else but a solver of the projected problem

H −1 Vj F(σ) F(λ)Vj s = 0. (22) Hence, although motivated originally in a different way, the rational Krylov method is an iter- ative projection method, where the nonlinear eigenproblem F(σ)−1F(λ)x = 0 is projected to a search space V , and V is expanded by the orthogonal complement (with respect to V ) of the residual r = F(σ)−1F(λ)V s of the Ritz pair and one ends up with Algorithm 7. Algorithm 7 Rational Krylov method, an iterative projection method

1: start with initial vector V = [v1] with kv1k = 1, initial λ and σ 2: for j = 1, 2, . . . until convergence do 3: solve projected eigenproblem V H F(σ)−1F(λ)V s = 0 for (λ, s) by inner iteration 4: compute Ritz vector x = V s and residual r = F(σ)−1F(λ)x 5: orthogonalize r = r − V V H r 6: expand searchspace V = [V , r/krk] 7: end for

The inner iteration in Step 3 of Algorithm 7 can be replaced by any dense solver of Sec- tion 3.1 and numerical examples in [43] demonstrate that the method can be accelerated con- siderably this way. It is a disadvantage of the rational Krylov method that symmetry properties which the original problem may have are destroyed if the projected problem (22) is consid- H ered instead of Vj F(λ)Vj s = 0 in the Arnoldi method or the Jacobi–Davidson algorithm.

Copyright line will be provided by the publisher 24 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

But on the other hand, the solvers in Section 3.1 need the explicit form of the projected prob- lem whereas the inner iteration in Algorithm 6 only needs a procedure that yields the vector F(σ)−1F(λ)x for a given x. Iterative projection methods for nonlinear eigenvalue problems have been studied exten- sively in recent years, but still have not reached a stage of maturity as have the methods for linear problems. More research is needed to improve the current methods, to analyze their stability and convergence behavior, in particular in finite precision, and to generate imple- mentations that can be used in a simple and robust way by non-experts.

4 Numerical examples

To demonstrate the iterative projection methods from the previous sections, we give several numerical examples that have already been mentioned in the introduction.

4.1 Vibrations of a fluid–solid structure Free vibrations of a tube bundle immersed in a slightly compressible fluid are governed under certain conditions by a boundary value problem with nonlocal boundary conditions depending rationally on the eigenparameter, see [17, 18, 73]. Discretizing by finite elements one obtains a rational matrix eigenvalue problem

k λ F(λ)x := −Kx + λMx + Cj x = 0, σj − λ Xj=1 where K, M, and Cj are symmetric matrices, K and Cj are positive semidefinite, M is positive definite, and 0 =: σ0 < σ1 < · · · < σk < σk+1 := ∞ are given positive numbers. Then, in each of the intervals (σj , σj+1), j = 0, . . . , k, this problem satisfies the conditions of the min-max characterization (16). We consider a problems of dimension n = 22654 with one pole σ1 = 1 which has 11 eigenvalues λ1 ≤ · · · ≤ λ11 in the interval J1 = (0, 1), see [59], and a large number of eigenvalues greater than 1, 10 of which are contained in the interval (1, 4). We determined approximations to the eigenvalues in [0, 1) by the Arnoldi method, the Jacobi–Davidson method, where in both cases the projected nonlinear eigenproblems were solved by safeguarded iteration, and by the rational Krylov method, where the projected rational eigenproblems were solved by linearizing the equivalent quadratic eigenproblem (1 − λ)V T F(σ)−1F(λ)V y = 0. All three methods were able to find all 11 eigenvalues. The original rational Krylov method (Algorithm 6) as implemented in [42] turned out to de- pend very sensitively on the initial pole σ and the initial approximation to an eigenvalue, and was able to find at most 8 eigenvalues in the interval [0, 1). Enumerating the eigenvalues according to Section 3.1 the smallest eigenvalue in the in- terval (1, 4) turns out to be a 6-th eigenvalue. Starting the Arnoldi method and the Jacobi– Davidson method with the invariant subspace of the linear eigenproblem, see [106],

λ K + C1 x = µMx, λ = 1 + ε, ε > 0,  λ − 1  Copyright line will be provided by the publisher gamm header will be provided by the publisher 25 corresponding to the 6 largest eigenvalues, both methods were able to find all eigenvalues in the interval (1, 4). The rational Krylov method destroys the symmetry of the problem, and the enumeration of Subsection 3.1 does not apply. None of the implementations of the rational Krylov methods was able to find more than one or two eigenvalues in the interval (1, 4). The numerical experiments were run under MATLAB 6.5 on an Intel Centrino M processor with 1.7 GHz and 1 GB RAM. Table 1 summarizes the properties of the iterative projection methods under consideration, where Iter. is the total number of iteration steps, LU fact. is the number of LU factorizations, CPU denotes the CPU time in seconds, and nlin.sol. the CPU time required for solving the projected eigenvalue problems. In either case only a very small share of the total CPU time was needed for solving the projected problems.

Method Iter. LU fact. CPU [s] nlin.sol. [s] Arnoldi 34 2 14.93 0.13 Jacobi–Davidson 37 3 112.84 0.15 rational Krylov 40 2 70.80 0.22 interval (1, 4) Arnoldi 36 2 17.35 0.17 Jacobi–Davidson 37 5 125.87 0.23

Table 4.1 Iterations and cpu-times for fluid–solid structure problem

4.2 Damped vibrations of a structure As second example we consider the free vibrations of a finite element model as in (9) of dimension 10704 for a solid using a viscoelastic constitutive relation to describe the behavior of the material. For symmetry reasons we determined only eigenvalues with negative imaginary part, and we computed 30 of them one after another with decreasing imaginary part. The nonlinear projected eigenproblems (after multiplying by 1 + bω) were solved by linearization, and the iteration was terminated if the norm of the residual was less than 10−6. The Arnoldi method without restarts needed 144 iteration steps, and a CPU time of 707.0 seconds to find all 30 eigenvalues with maximal negative imaginary part (i.e. the average number of iteration steps to determine an eigenvalue is less than 5). No update of the precon- ditioner was necessary. The dominant share of the CPU time, namely 469.9 seconds was consumed by the solver of the projected nonlinear eigenproblems. Figure 2 on the left displays the development of the time consumption of the entire iteration and the share of the nonlinear eigensolver. x-marks indicate found eigenvalues. It demonstrates the necessity of restarts, since the superlinear time consumption is mainly caused by the eigensolver. We restarted the Arnoldi process if the dimension of the search space exceeded 50 with an orthogonal basis of the space spanned by the already determined eigenvectors. The method needed 3 restarts, and again all 30 eigenvalues with maximal negative imaginary part were found by the Arnoldi method requiring 139 iterations, and 4 updates of the preconditioner. The total CPU time was 199.6 seconds. Solving the projected eigenproblems and updating the preconditioners required 25.0 and 30.8 seconds, respectively. The right picture in Figure 2 demonstrates the time consumption for this experiment. Immediately after a restart the speed

Copyright line will be provided by the publisher 26 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

700 200 total CPU time

180 600 total CPU time 160

500 140 restart

120 400

100 time [s] time [s] 300 80

200 60

40 LU update 100 nonlinear eigensolver 20 nonlinear solver 0 0 0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140 iteration iteration

Fig. 2 Arnoldi method without and with restarts of convergence is slowed down. On the other hand this delay yields an update of the precon- ditioner accelerating the convergence, such that the total number of iteration steps is reduced from 144 to 139. The Jacobi–Davidson method and the rational Krylov method according to Algorithm 7 are much slower than the Arnoldi method, but they show similar behavior. Without restart a substantial share of the total CPU time is consumed by the solver of the projected nonlinear eigenproblems, and both methods can be accelerated by restarts. Details about these methods are contained in Table 4.2. The rational Krylov method with inner iteration converges for this problem, although it is very slow. To determine the 30 wanted eigenvalues, 366 iterations are necessary requiring 2248.5 seconds. Method Iter. LU fact. CPU [s] nlin.sol. [s] Arnoldi 144 2 707.0 469.9 Arnoldi, restarted 139 5 199.6 25.0 Jacobi–Davidson 111 9 1050.5 161.2 Jacobi–Davidson, restarted 109 12 914.4 18.9 rational Krylov 147 3 1107.1 465.3 rational Krylov, restarted 147 4 647.8 28.5

Table 4.2: Iterations and cpu times for damped vibrations of a structure

4.3 Singularity exponents For a numerical example demonstrating the structure preserving methods, we consider the following problem which originates from [8, 51], for details see [4]. Consider a specimen in form of a brick with a crack as illustrated in Figure 3. The (homogeneous) material has Poisson ratio ν = 0.32, the Young modulus does not influence the result and can be set arbitrarily. The stress concentration near the point O where the crack intersects the surface

Copyright line will be provided by the publisher gamm header will be provided by the publisher 27

ξ O

Fig. 3 Illustration of the crack example

of the brick can be investigated within the linear elasticity framework, as long as the material is brittle, see e.g. [50, 51, 52]. The stress field is derived from the displacement field which can be represented by a regular part and several singular terms of the form krαu(ϕ, θ) where (r, ϕ, θ) are spherical coordinates centered in the point of interest, k is called stress intensity factor, and α is the characteristic (singular) exponent with the associated mode u. These terms are singular when Re α < 1 and Re α 6= 0.

Mathematically, the pair (α, u) is an eigenpair of a quadratic operator pencil [44, 51]. A finite element approximation [5] leads to a finite-dimensional quadratic eigenvalue problem. After substitution λ = α + 0.5 this problem has the more convenient form (3). Figure 4 displays the real part of the (approximated) eigenvalues from the strip 0 < Re α < 1 for the whole possible range of the angle ξ between the crack and the surface. The angle that actually appears depends on the applied load. In [4] several structure preserving linearizations and corresponding iterative projection methods (in a FORTRAN 77 implementation) were com- pared by computing 9 eigenvalues in order to obtain approximations to the 2 − 3 eigenvalues nearest to the imaginary axis. A comparison of the Hamiltonian implicitly restarted Lanc- zos method HIRL of [114] (with focal point τ = 0.0) and the skew-Hamiltonian implicitly- restarted Arnoldi algorithm (SHIRA) of [63] with focal points τ = 1.0 and τ = 0.0, as well as a simple non-structured method is given in Figure 5 for the angle ξ = 120◦, where the three eigenvalues of interest are well separated. The total CPU time is presented for various discretization parameters N, which is the size of the coefficient matrices K, G, and M.

The comparison shows that despite a higher complexity per iteration step, the structure preserving method SHIRA proves to be a competitive algorithm to compute eigenvalues of even or odd quadratic pencils, in particular if the focal point is not close to an eigenvalue. For more details see [4].

Copyright line will be provided by the publisher 28 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

Crack example: eigenvalues 1

0.75

α 0.5 Re

0.25

0 0 30 60 90 120 150 180 ξ

Fig. 4 Real part of the interesting eigenvalues against the angle ξ.

Crack example: total time for various methods 600 SHIRA, τ=1.0 500 SHIRA, τ=0.0 HIRL, τ=0.0 400 simple τ=0.0 simple τ=1.0 300 time [s] 200

100

0 0 1 2 3 4 N 5 x 10 Fig. 5 CPU times for crack with ξ = 120◦ and various N.

5 Conclusion

We have discussed the numerical solution of large scale nonlinear eigenvalue problems. The development of efficient and reliable numerical methods for these classes of problems remains a major challenge. More research is needed, in particular in the direction of error and condition estimation as well as the implementation of appropriate software packages such as those that are available for linear eigenvalue problems.

Copyright line will be provided by the publisher gamm header will be provided by the publisher 29

References

[1] E. Anderson, Z. Bai, C. Bischof, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammar- ling, A. McKenney, S. Ostrouchov, and D. Sorensen. LAPACK Users’ Guide. SIAM Publications, Philadelphia, second edition, 1995. [2] P.M. Anselone and L.B. Rall. The solution of characteristic value-vector problems by Newton’s method. Numer. Math., 11:38–45, 1968. [3] T. Apel, V. Mehrmann, and D. Watkins. Structured eigenvalue methods for the computation of corner singularities in 3D anisotropic elastic structures. Comp. Meth. Appl. Mech. Eng., 191:4459–4473, 2002. [4] T. Apel, V. Mehrmann, and D. Watkins. Numerical solution of large scale structured polynomial eigenvalue problems. In F. Cucker, R. DeVore, P. Olver, and E. Suli,¨ editors, Foundations of Computational Mathematics, London Mathematical Society, Lecture Note Series 312, pages 137– 157. Cambridge University Press, Cambridge, 2004. [5] T. Apel, A.-M. Sandig,¨ and S.I. Solov’ev. Computation of 3D vertex singularities for linear elasticity: Error estimates for a finite element method on graded meshes. Math. Modeling Numer. Anal., 36:1043 – 1070, 2002. [6] Z. Bai, J. Demmel, J. Dongarra, A. Ruhe, and H. van der Vorst. Templates for the Solution of Algebraic Eigenvalue Problems. SIAM Publications, Philadelphia, 2000. [7] Z. Bai and Y. Su. A second-order Arnoldi method for the solution of the quadratic eigenvalue problem. SIAM J. Matrix Anal. Appl., To appear, 2005. [8] Z. P. Bazantˇ and L.F. Estenssoro. Surface singularity and crack propagation. Internat. J. Solids Structures, 15:405–426, 1979. [9] P. Benner, R. Byers, V. Mehrmann, and H. Xu. Numerical computation of deflating subspaces of skew Hamiltonian/Hamiltonian pencils. SIAM J. Matrix Anal. Appl., 24:165–190, 2002. [10] P. Benner and H. Faßbender. An implicitly restarted symplectic Lanczos method for the Hamil- tonian eigenvalue problem. Linear Algebra Appl., 263:75–111, 1997. [11] P. Benner and H. Faßbender. The symplectic eigenvalue problem, the butterfly form, the SR algorithm, and the Lanczos method. Linear Algebra Appl., 275/276:19–47, 1998. [12] P. Benner and H. Faßbender. An implicitly restarted symplectic Lanczos method for the symplec- tic eigenvalue problem. SIAM J. Matrix Anal. Appl., 22:682–713, 2000. [13] P. Benner, V. Mehrmann, V. Sima, S. Van Huffel, and A. Varga. SLICOT - a subroutine library in systems and control theory. Appl. Comput. Contr. Sign. Circ., 1:499–532, 1999. [14] P. Benner, E.S. Quintana-Ort´ı, and G. Quintana-Ort´ı. Parallel model reduction of large-scale linear descriptor systems via balanced truncation. In High Performance Computing for Computa- tional Science. Proceedings of the 6th International Meeting VECPAR’04, pages 65–78. Valencia, Spain, June 28-30, 2004, 2004. [15] T. Betcke and H. Voss. A Jacobi–Davidson–type projection method for nonlinear eigenvalue problems. Future Generation Computer Systems, 20(3):363 – 372, 2004. [16] L.S. Blackford, J. Choi, A. Cleary, E.D. Azevedo, J. Demmel, I. Dhillon, J. Dongarra, G. Henry, A. Petitet, K. Stanley, D. Walker, and R. Whaley. ScaLAPACK Users’ Guide. SIAM Publications, Philadelphia, second edition, 1997. [17] C. Conca, J. Planchard, and M. Vanninathan. Existence and location of eigenvalues for fluid-solid structures. Comput. Meth. Appl. Mech. Eng., 77:253 – 291, 1989. [18] C. Conca, J. Planchard, and M. Vanninathan. Fluid and Periodic Structures, volume 24 of Re- search in Applied Mathematics. Masson, Paris, 1995. [19] S.H. Crandall. Iterative procedures related to relaxation methods for eigenvalue problems. Proc. Royal Soc. London, 207:416–423, 1951. [20] R.J. Duffin. A minmax theory for overdamped networks. J. Rat. Mech. Anal., 4:221 – 233, 1955. [21] R.J. Duffin. The Rayleigh-Ritz method for dissipative and gyroscopic systems. Quart. Appl. Math., 18:215–221, 1960.

Copyright line will be provided by the publisher 30 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

[22] E. Eich-Soellner and C. Fuhrer¨ . Numerical Methods in Multibody Systems. B.G. Teubner Verlag, Stuttgart, 1998. [23] D.R. Fokkema, G.L.G. Sleijpen, and H.A. van der Vorst. Jacobi-Davidson style QR and QZ algorithms for the partial reduction of matrix pencils. SIAM J. Sci. Comput., 20:94 – 125, 1998. [24] G. Freiling, V. Mehrmann, and H. Xu. Existence, uniqueness and parametrization of lagrangian invariant subspaces. SIAM J. Matrix Anal. Appl., 23:1045–1069, 2002. [25] R. Freund. Pade-type model reduction of second-order and higher-order linear dynam- ical systems. Technical report, Dept. of Mathematics, UC. Davis, Davis, Ca., 2004. http://arxiv.org/abs/math/0410195. [26] R. W. Freund. Krylov-subspace methods for reduced-order modeling in circuit simulation. J. Comput. Appl. Math., 123:395–421, 2000. [27] I. Gohberg, P. Lancaster, and L. Rodman. Matrix Polynomials. Academic Press, New York, 1982. [28] G.H. Golub and C.F. Van Loan. Matrix Computations. Johns Hopkins University Press, Balti- more, third edition, 1996. [29] K. P. Hadeler. Variationsprinzipien bei nichtlinearen Eigenwertaufgaben. Arch. Rat. Mech. Anal., 30:297 – 307, 1968. [30] K. P. Hadeler. Nonlinear eigenvalue problems. In R. Ansorge, L. Collatz, G. Hammerlin,¨ and W. Tornig,¨ editors, Numerische Behandlung von Differentialgleichungen, ISNM 27, pages 111– 129. Birkhauser¨ , Stuttgart, 1975. [31] P. Hager. Eigenfrequency Analysis. FE-Adaptivity and a Nonlinear Eigenvalue Problem. PhD thesis, Chalmers University of Technology, Gotebor¨ g, 2001. [32] P. Hager and N.E. Wiberg. The rational Krylov algorithm for nonlinear eigenvalue problems. In B.H.V. Topping, editor, Computational Mechanics for the Twenty-First Century, pages 379 – 402. Saxe–Coburg Publications, Edinburgh, 2000. [33] N. Higham, D.S. Mackey, N. Mackey, and F. Tisseur. Symmetric linearizations for symmetric polynomials. Technical report, Univ. of Manchester, 2005. In preparation. [34] A. Hilliges. Numerische Losung¨ von quadratischen Eigenwertproblemen mit Anwendungen in der Schienendynamik. Master’s thesis, TU Berlin, Berlin, FRG, 2004. Diplomarbeit, in German. [35] A. Hilliges, C. Mehl, and V. Mehrmann. On the solution of palindromic eigenvalue problems. In Proceedings of the 4th European Congress on Computational Methods in Applied Sciences and Engineering (ECCOMAS). Jyvask¨ yla,¨ Finland, 2004. CD-ROM. [36] U.B. Holz. Subspace Approximation Methods for Perturbed Quadratic Eigenvalue Problems. PhD thesis, Department of Mathematics, Stanford University, 2002. [37] H.Y. Hu, E.H. Dowell, and L.N. Virgin. Stability estimation of high dimensional vibrating sys- tems under state delay feedback control. J. Sound and Vibration, 214:497 – 511, 1998. [38] H.Y. Hu and Z.H. Wang. Dynamics of Controlled Mechanical Systems with Delayed Feedback. Springer Verlag, Berlin, 2002. [39] T.-M. Hwang, W.-W. Lin, and V. Mehrmann. Numerical solution of quadratic eigenvalue prob- lems with structure-preserving methods. SIAM J. Sci. Comput., 24:1283–1302, 2003. [40] N. K. Jain and K. Singhal. On Kublanovskaya’s approach to the solution of the generalized latent value problem for functional λ–matrices. SIAM J. Numer. Anal., 20:1062–1070, 1983. [41] N. K. Jain, K. Singhal, and K. Huseyin. On roots of functional lambda matrices. Comput. Meth. Appl. Mech. Eng., 40:277–292, 1983. [42] E. Jarlebring. Krylov methods for nonlinear eigenvalue problems. Master’s thesis, Royal Institute of Technology, Dept. Numer. Anal. Comput. Sci., Stockholm, 2003. [43] E. Jarlebring and H. Voss. Rational Krylov for nonlinear eigenproblems, an iterative projection method. Appl. Math., to appear, 2005. [44] V.A. Kozlov, V.G. Maz’ya, and J. Roßmann. Spectral properties of operator pencils generated by elliptic boundary value problems for the Lame´ system. Rostocker Math. Kolloq., 51:5–24, 1997. [45] V.N. Kublanovskaya. On an application of Newton’s method to the determination of eigenvalues of λ-matrices. Dokl. Akad. Nauk. SSR, 188:1240 – 1241, 1969.

Copyright line will be provided by the publisher gamm header will be provided by the publisher 31

[46] V.N. Kublanovskaya. On an approach to the solution of the generalized latent value problem for λ-matrices. SIAM. J. Numer. Anal., 7:532 – 537, 1970. [47] P. Lancaster. Strongly stable gyroscopic systems. Electr. J. Linear Algebra, 5:53–66, 1999. [48] P. Lancaster. Lambda–matrices and Vibrating Systems. Dover Publications, Mineola, N.Y., 2002. [49] P. Lancaster and P. Psarrakos. A note on weak and strong linearizations of regular matrix poly- nomials. Unpublished 5-page note, 2004. [50] B. Lawn. Fracture of Brittle Solids. Cambridge University Press, 1993. [51] D. Leguillon. Computation of 3d-singularities in elasticity. In M. Costabel et al., editor, Boundary value problems and integral equations in nonsmooth domains, volume 167 of Lect. Notes Pure Appl. Math., pages 161–170, New York, 1995. Marcel Dekker. Proceedings of the conference, held at the CIRM, Luminy, France, May 3-7, 1993. [52] D. Leguillon and E. Sanchez-Palencia. On 3D cracks intersecting a free surface in laminated composites. Int. J. Fracture, 99:25–40, 1999. [53] R. B. Lehoucq, D. C. Sorensen, and C. Yang. ARPACK Users’ Guide: Solution of Large-Scale Eigenvalue Problems with Implicitly Restarted Arnoldi Methods. SIAM Publications, Philadel- phia, 1998. [54] J.-R. Li. Model Reduction of Large Linear Systems via Low Rank System Gramians. Ph.d. thesis, Massachusetts Institute of Technology, Department of Mathematics, 2000. [55] D.S. Mackey, N. Mackey, C. Mehl, and V. Mehrmann. Palindromic polynomial eigenvalue prob- lems: Good vibrations from good linearizations. Technical report, TU Berlin, Inst. f. Mathematik, Berlin, Germany, 2005. In preparation. [56] D.S. Mackey, N. Mackey, C. Mehl, and V. Mehrmann. Vector spaces of linearizations for ma- trix polynomials. Technical report, TU Berlin, Inst. f. Mathematik, Berlin, Germany, 2005. In preparation. [57] M. Markiewicz and H. Voss. A local restart procedure for iterative projection methods for nonlin- ear symmetric eigenproblems. Technical Report 82, Section of Mathematics, Hamburg University of Technology, 2004. To appear in Proc. of Algoritmy 2005. [58] MATLAB, Version 7. The MathWorks, inc., 24 Prime Park Way, Natick, MA 01760-1500, USA, 1996. [59] L. Mazurenko and H. Voss. On the number of eigenvalues of a rational eigenproblem. Technical Report 61, Section of Mathematics, Hamburg University of Technology, 2003. [60] K. Meerbergen. Locking and restarting quadratic eigenvalue solvers. SIAM J. Sci. Comput., 22:1814 – 1839, 2001. [61] V. Mehrmann. The Autonomous Linear Quadratic Control Problem, Theory and Numerical So- lution. Number 163 in Lecture Notes in Control and Information Sciences. Springer Verlag, Heidelberg, July 1991. [62] V. Mehrmann and C. Shi. Analysis of higher order linear differential-algebraic systems. Preprint 2004/17, Institut fur¨ Mathematik, TU Berlin, D-10623 Berlin, FRG, 2004. url: http://www.math.tu-berlin.de/preprints/. [63] V. Mehrmann and D. Watkins. Structure-preserving methods for computing eigenpairs of large sparse skew-hamiltonian/hamiltonian pencils. SIAM J. Sci. Comput., 22:1905–1925, 2001. [64] V. Mehrmann and D. Watkins. Polynomial eigenvalue problems with hamiltonian structure. Electr. Trans. Num. Anal., 13:106–113, 2002. [65] MODELICA – A unified object–oriented language for physical systems modeling. Model speci- fication. Version 2.1. Modelica Association, 2004. Available from http://www.modelica.org/documents/ModelicaSpec21.pdf. [66] A. Neumaier. Residual inverse iteration for the nonlinear eigenvalue problem. SIAM J. Numer. Anal., 22:914 – 923, 1985. [67] M. R. Osborne. Inverse iteration, Newton’s method and nonlinear eigenvalue problems. In The Contribution of Dr. J.H. Wilkinson to Numerical Analysis, Symp. Proc. Series 19, pages 21–53. Inst. Math. Appl., London, 1978.

Copyright line will be provided by the publisher 32 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

[68] M.R. Osborne. A new method for the solution of eigenvalue problems. Comput. J., 7:228–232, 1964. [69] M.R. Osborne and S. Michelson. The numerical solution of eigenvalue problems in which the eigenvalue parameter appears nonlinearly, with an application to differential equations. Comput. J., 7:66–71, 1964. [70] A.M. Ostrowski. On the convergence of the Rayleigh quotient iteration for the computation of the characteristic roots and vectors I. Arch. Rat. Mech. Anal., 1:233–241, 1958. [71] T. Penzl. A cyclic low rank Smith method for large, sparse Lyapunov equations with applications in model reduction and optimal control. SIAM J. Sci. Comput., 21:1401–1418, 2000. [72] N. Petersmann. Substrukturtechnik und Kondensation bei der Schwingungsanalyse, volume 76 of Fortschrittberichte VDI, Reihe 11: Schwingungstechnik. VDI Verlag, Dusseldorf,¨ 1986. [73] J. Planchard. Eigenfrequencies of a tube bundle placed in a confined fluid. Comput. Meth. Appl. Mech. Eng., 30:75 – 93, 1982. [74] J.S. Przemieniecki. Theory of Matrix Structural Analysis. McGraw-Hill, New York, 1968. [75] A.C.M. Ran and L. Rodman. Stability of invariant Lagrangian subspaces I. In I. Gohberg, editor, Operator Theory: Advances and Applications, volume 32, pages 181–218. Birkhauser¨ , Basel, 1988. [76] A.C.M. Ran and L. Rodman. Stability of invariant Lagrangian subspaces II. In H.Dym, S. Gold- berg, M.A. Kaashoek, and P. Lancaster, editors, Operator Theory: Advances and Applications, volume 40, pages 391–425. Birkhauser¨ , Basel, 1989. [77] E.H. Rogers. A minmax theory for overdamped systems. Arch. Rat. Mech. Anal., 16:89 – 96, 1964. [78] K. Rothe. Losungsverfahr¨ en fur¨ nichtlineare Matrixeigenwertaufgaben mit Anwendungen auf die Ausgleichelementmethode. Verlag an der Lotterbek, Ammersbek, 1989. [79] K. Rothe. Least squares element method for boundary eigenvalue problems. Internat. J. Numer. Methods Engrg., 33:2129–2143, 1992. [80] A. Ruhe. Algorithms for the nonlinear eigenvalue problem. SIAM J. Numer. Anal., 10:674 – 689, 1973. [81] A. Ruhe. Computing nonlinear eigenvalues with spectral transformation Arnoldi. ZAMM, 76:S2: 17 – 20, 1996. [82] A. Ruhe. Rational Krylov for nonlinear eigenvalues, 1996. Talk presented at the Householder XIII Conference at Pontresina, Switzerland. [83] A. Ruhe. Rational Krylov, a practical algorithm for large sparse nonsymmetric matrix pencils. SIAM J. Sci. Comp., 19(5):1535–1551, 1998. [84] A. Ruhe. A rational Krylov algorithm for nonlinear matrix eigenvalue problems. Zapiski Nauch- nyh Seminarov POMI, 268:176 – 180, 2000. [85] A. Ruhe. Rational Krylov for large nonlinear eigenproblems. Technical report, Royal Institute of Technology, Stockholm, Sweden, 2004. To appear in Proceedings of PARA’04, Lyngby, Den- mark, 2004. [86] Y. Saad. Iterative Methods for Sparse Linear Systems. SIAM Publications, Philadelphia, second edition, 2003. [87] W. Schiehlen. Advanced multibody system dynamics. Kluwer Academic Publishers, Stuttgart, 1993. [88] F. Schmidt, T. Friese, L. Zschiedrich, and P. Deuflhard. Adaptive multigrid methods for the vectorial Maxwell eigenvalue problem for optical waveguide design. In W. Jager¨ and H.-J. Krebs, editors, Mathematics. Key Technology for the Future, pages 279–292. Springer Verlag, 2003. [89] H. Schmitz, K. Volk, and W. L. Wendland. On three-dimensional singularities of elastic fields near vertices. Numer. Meth. Part. Diff. Eq., 9:323–337, 1993. [90] H.R. Schwarz. Methode der finiten Elemente. Teubner Verlag, Stuttgart, 1984. [91] C. Shi. Linear Differential-Algebraic Equations of Higher-Order and the Regularity or Singular- ity of Matrix Polynomials. PhD thesis, TU Berlin, Inst. f. Mathematik, 2004.

Copyright line will be provided by the publisher gamm header will be provided by the publisher 33

[92] G.L. Sleijpen, G.L. Booten, D.R. Fokkema, and H.A. van der Vorst. Jacobi-Davidson type meth- ods for generalized eigenproblems and polynomial eigenproblems. BIT, 36:595 – 633, 1996. [93] G.L. Sleijpen and H.A. van der Vorst. The Jacobi–Davidson method for eigenproblems and its relation with accelerated inexact Newton schemes. In S.D. Margenov and P.S. Vassilevski, editors, Iterative Methods in Linear Algebra, volume 3 of IMACS Series in Computational and Applied Mathematics, pages 377–389, New Brunswick, 1996. IMACS. [94] G.L. Sleijpen and H.A. van der Vorst. A Jacobi-Davidson iteration method for linear eigenvalue problems. SIAM J. Matr. Anal. Appl., 17:401 – 425, 1996. [95] G.L. Sleijpen, H.A. van der Vorst, and M. van Gijzen. Quadratic eigenproblems are no problem. SIAM News, 8:9–10, September 1996. [96] S.I. Solov’ev. Eigenvibrations of a plate with elastically attached loads. Preprint SFB393/03-06, Sonderforschungsbereich 393 an der Technischen Universitat¨ Chemnitz, Technische Universitat,¨ D-09107 Chemnitz, Germany, 2003. Available at http://www.tu- chemnitz.de/sfb393/Files/PS/sfb03-06.ps.gz. [97] D.C. Sorensen. Implicit application of polynomial filters in a k-step Arnoldi method. SIAM J. Matrix Anal. Appl., 13:357–385, 1992. [98] M.M. Tiller. Introduction to Physical Modeling with Modelica. Kluwer Academic Publishers, Boston, 2nd edition, 2004. [99] F. Tisseur. Backward error analysis of polynomial eigenvalue problems. Linear Algebra Appl., 309:339–361, 2000. [100] F. Tisseur and K. Meerbergen. The quadratic eigenvalue problem. SIAM Rev., 43:234–286, 2001. [101] M.B. van Gijzen. The parallel computation of the smallest eigenpair of an acoustic problem with damping. Int. J. Numer. Meth. Eng., 45:765–777, 1999. [102] H. Voss. An error bound for eigenvalue analysis by nodal condensation. In J. Albrecht, L. Col- latz, and W. Velte, editors, Numerical Treatment of Eigenvalue Problems, Vol. 3, volume 69 of International Series on Numerical Mathematics, pages 205–214. Birkhauser¨ , Basel, 1984. [103] H. Voss. A new justification of finite dynamic element methods. In J. Albrecht, L. Collatz, W. Velte, and W. Wunderlich, editors, Numerical Treatment of Eigenvalue Problems, Vol. 4, ISNM 83, pages 232 – 242. Birkhauser¨ , Basel, 1987. [104] H. Voss. Free vibration analysis by finite dynamic element methods. In I. Marek, editor, Proceed- ings of the Second International Symposium on Numerical Analysis, Teubner Texte zur Mathe- matik 107, pages 295 – 298. Teubner Verlag, Leipzig, 1988. [105] H. Voss. An Arnoldi method for nonlinear symmetric eigenvalue problems. In On- line Proceedings of the SIAM Conference on Applied Linear Algebra, Williamsburg, http://www.siam.org/meetings/laa03/, 2003. [106] H. Voss. Initializing iterative projection methods for rational symmetric eigenproblems. In Online Proceedings of the Dagstuhl Seminar Theoreti- cal and Computational Aspects of Matrix Algorithms, Schloss Dagstuhl 2003, ftp://ftp.dagstuhl.de/pub/Proceedings/03/03421/03421.VoszHeinrich.Other.pdf, 2003. [107] H. Voss. A maxmin principle for nonlinear eigenvalue problems with application to a rational spectral problem in fluid–solid vibration. Appl. of Math., 48:607 – 622, 2003. [108] H. Voss. A rational spectral problem in fluid–solid vibration. Electr. Trans. Num. Anal., 16:94 – 106, 2003. [109] H. Voss. An Arnoldi method for nonlinear eigenvalue problems. BIT, 44:387 – 401, 2004. [110] H. Voss. A Jacobi–Davidson method for nonlinear eigenproblems. In M. Buback, G.D. van Albada, P.M.A. Sloot, and J.J. Dongarra, editors, Computational Science – ICCS 2004, 4th In- ternational Conference, Krakow´ , Poland, June 6–9,2004,Proceedings, Part II, volume 3037 of Lecture Notes in Computer Science, pages 34–41. Springer Verlag, Berlin, 2004. [111] H. Voss. Numerical methods for sparse nonlinear eigenproblems. In Ivo Marek, editor, Pro- ceedings of the XV-th Summer School on Software and Algorithms of Numerical Mathematics, Hejnice, 2003, pages 133 – 160, University of West Bohemia, Pilsen, Czech Republic, 2004.

Copyright line will be provided by the publisher 34 V. Mehrmann and H. Voss: Nonlinear Eigenvalue Problems

[112] H. Voss and B. Werner. A minimax principle for nonlinear eigenvalue problems with applications to nonoverdamped systems. Math. Meth. Appl. Sci., 4:415–424, 1982. [113] H. Voss and B. Werner. Solving sparse nonlinear eigenvalue problems. Technical Report 82/4, Inst. f. Angew. Mathematik, Universitat¨ Hamburg, 1982. [114] D.S. Watkins. On Hamiltonian and symplectic Lanczos processes. Linear Algebra Appl., 385:23– 45, 2004. [115] B. Werner. Das Spektrum von Operatorenscharen mit verallgemeinerten Rayleighquotienten. PhD thesis, Fachbereich Mathematik, Universitat¨ Hamburg, 1970. [116] W.H. Yang. A method for eigenvalues of sparse λ-matrices. Internat. J. Numer. Meth. Eng., 19:943 – 948, 1983. [117] R. Zurmuhl¨ and S. Falk. Matrizen und ihre Anwendungen, Bd. I. Springer Verlag, Berlin, fifth edition, 1984. [118] R. Zurmuhl¨ and S. Falk. Matrizen und ihre Anwendungen, Bd. II. Springer Verlag, Berlin, fifth edition, 1986.

Copyright line will be provided by the publisher