The Deflated Relaxed Incomplete Cholesky CG Method for Use in A

Procedia Computer Science Procedia Computer Computer Science Science 00 1 (2010) (2010) 1–9 249–257 www.elsevier.com/locate/procedia International Conference on Computational Science, ICCS 2010 The Deflated Relaxed Incomplete Cholesky CG method for use in a real-time ship simulator E. van ’t Wouta, M.B. van Gijzena, A. Ditzelb, A. van der Ploegb,C.Vuika aDelft University of Technology, Delft Institute of Applied Mathematics, Delft, The Netherlands bMARIN, Maritime Research Institute Netherlands, Wageningen, The Netherlands Abstract Ship simulators are used for training purposes and therefore have to calculate realistic wave patterns around the moving ship in real time. We consider a wave model that is based on the variational Boussinesq formulation, which results in a set of partial differential equations. Discretization of these equations gives a large system of linear equations, that has to be solved each time-step. The requirement of real-time simulations necessitates a fast linear solver. In this paper we study the combination of the Relaxed Incomplete Cholesky preconditioner and subdomain deflation to accelerate the Conjugate Gradient method. We show that the success of this approach depends on the relaxation parameter. For low values of the relaxation parameter, e.g. the standard IC preconditioner, the deflation method is quite successfull. This is not the case for large values of the relaxation parameter, such as the Modified IC preconditioner. We give a theoretical explanation for this difference by considering the spectrum of the preconditioned and deflated matrices. Computational results for the wave model illustrate the expected convergence behavior of the Deflated Relaxed Incomplete Cholesky CG method. We also present promising results for the combination of the deflation method and the inherently parallel block-RIC preconditioner. c 2010 Published by Elsevier Ltd. Keywords: Conjugate Gradient method, Relaxed Incomplete Cholesky preconditioner, Subdomain deflation, real-time wave model 1. Introduction The Maritime Research Institute Netherlands serves the maritime industry with innovative products. One of the products MARIN supplies is a ship manoeuvring simulator. These real-time navigation simulators are used for research, consultancy and training purposes. The current wave model of the ship simulator is based on a predefined wave spectrum and only partially interacts with objects. A new wave model is under development, which depends on the motions of a ship and the bathymetry, resulting in realistic wave patterns. To fullfill the requirement of real-time calculations, the computational methods have to be very efficient. A considerable amount of computation time is used by the linear solver in the wave model. Three linear solvers were implemented in the model: the Conjugate Gradient method combined with diagonal scaling, the Repeated Email address: [email protected] (E. van ’t Wout) 1877-0509 c 2010 Published by Elsevier Ltd. doi:10.1016/j.procs.2010.04.028 250 E.E. van’t van ’tWout Wout et et al. al. // ProcediaProcedia Computer Science Science 00 1 (2010) (2010) 1–9 249–257 2 Red-Black preconditioner, and the Modified Incomplete Cholesky preconditioner [1]. The RRB preconditioner uses a recursive red-black ordering of nodes and by permitting some extra fill-in in the last block, the spectral condition number has a smaller order than for the MIC preconditioner [2, 3]. In this paper we study possible enhancements of the MIC preconditioner. We focus on the combination of the Relaxed Incomplete Cholesky preconditioner and subdomain deflation. The deflation method reduces the spectral condition number and can be parallelized efficiently [4]. 2. Problem formulation The description of the water waves, to be used in the ship simulator, is based on a variational Boussinesq formulation. This model has been developed by Klopman [5, 6] and is described in detail in [1]. The fluid motion is assumed to be inviscid and incompressible. Starting point of the wave model is therefore the Euler equations. By integrating these equations over the whole domain, one obtains an expression for the total pressure inside the fluid in terms of the water level and the velocity potential. Minimizing the pressure functional with respect to these two variables gives a variational description of the model. The resulting nonlinear model has been simplified by writing the velocity potential as a series expansion in terms of vertical shape functions of the fluid flow. This Boussinesq approach reduces the three spatial dimensions to two. A linearization process has been carried out, paying carefull attention to the properties of the wave model. For given bathymetry and ship movements, the wave model calculates the water height. For example, the model is used to compute the wave height on a part of the Dutch river IJssel, resulting in the realistic wave pattern shown in Figure 1. Figure 1: Wave pattern around a ship sailing through the river IJssel. 2.1. Model equations The Variational Boussinesq model is governed by a set of three coupled equations: ∂ζ + ζ U + h ϕ h ψ = 0, (1) ∂t ∇· ∇ − D0 ∇ ∂ϕ + U ϕ + gζ = P , (2) ∂t ·∇ s ψ + h ϕ ψ = 0, (3) M0 ∇· D0 ∇ −N0 ∇ for the three basic variables water level ζ, surface velocity potential ϕ and structured vertical velocity potential ψ, ∂ and denoting the gradient operator x . The other symbols denote the known parameters gravitation g, water ∇ ∂y depth h, average current U, imposed pressure Ps, and the positive parameters 0, 0, 0 that depend on the water depth. Observe that the first two equations (1) and (2) form a hyperbolic set of equations,D M N similar to the shallow-water equations. The third equation (3) is elliptic and will result in the linear set of equations that has to be solved with a linear solver. E. van’tE. van Wout ’t Wout et al. et / Procedia al. / Procedia Computer Computer Science Science 1 (2010)00 (2010) 249–257 1–9 2513 2.2. Discretization methods The model equations contain both spatial and time derivatives. The Finite Volume method is used for the spatial discretization. To this end, the computational domain is partitioned into rectangular control cells. The normal derivatives on the edges of the cells are calculated using central finite difference approximations. To avoid computationally intensive solutions of linear systems, use is made of an explicit time integration scheme. The solution of the discretized model equations (1), (2) at a new timestep can therefore be calculated from the previous timestep with a matrix-vector operation. Discretization of model equation (3) yields a linear system Ax = b, (4) n n n with A R × , and x R denoting the variable ψ at the computational nodes, that are ordered lexicographically. ∈ ∈ 2.3. Matrix properties Because the properties of the linear system (4) will determine the choice of the solver, we will list some properties of A. The five-point stencil used for the discretization yields a pentadiagonal matrix, i.e., A has nonzero elements on five diagonals only. The linear system is the discretized version of model equation (3), which is an elliptic partial differential equation. The matrix A is therefore strictly diagonally dominant and is symmetric positive definite (SPD). 3. Linear solver At each time step, the wave model uses the solution of the linear system (4). Since the wave model is for use in a real-time ship simulator, the solution has to be calculated within the timestep of the model. A characteristic value of the timestep is 0.05 s. The size of A is given by the number of computational nodes. Due to the requirement of real-time calculation and the limitations of current technology, typical values are 200 200 nodes, so a matrix of size 40 000 40 000. Future developments aim to increase these dimensions. Because of× the properties of the Poisson-like equation× (4), an iterative solver is most suitable. 3.1. Preconditioned Conjugate Gradient method Since A is SPD, the Conjugate Gradient method is a natural choice as iterative solution method [7, 8]. The convergence of the CG method can be estimated with the well-known upper bound on the residual: n λmax 1 λmin − xn x A 2 x0 x A, (5) || − || ≤ ⎛ λ ⎞ || − || ⎜ max + 1⎟ ⎜ λmin ⎟ ⎜ ⎟ ⎜ ⎟ with λmin and λmax the minimal and maximal eigenvalue⎝⎜ of A respectively,⎠⎟ and A denoting the A-norm. This upper bound shows that the convergence of the CG method depends on the spectral condition|| · || number λmax . λmin By applying a preconditioner, the spectrum of the preconditioned matrix can be changed into a more favorable one, in particular one with a smaller spectral condition number. The symmetrically preconditioned system is given by 1 T P− AP− x˜ = b˜, (6) T 1 T with x˜ = P x, b˜ = P− b, and M = PP denoting the preconditioner. A suitable preconditioner can improve the convergence of the CG method considerably. The preconditioner should be chosen such that the system Px = y is relatively easy to solve. 252 E.E. van’t van ’tWout Wout et et al. al. // ProcediaProcedia Computer Science Science 00 1 (2010) (2010) 1–9 249–257 4 3.2. Incomplete Cholesky preconditioner Incomplete Cholesky preconditioners are among the best known preconditioners [9]. The symmetric matrix A can be approximated by a decomposition LLT where L is a lower triangular matrix with a prescribed sparsity pattern. The sparsity pattern is chosen to be the same as of A,soL + LT can also be represented by a five-point stencil. For this choice the storage requirements of L and A are the same. After scaling of the matrix, the outer diagonals of L equal the ones of A. Therefore, only the main diagonal of L has to be calculated, according to the incomplete Cholesky decomposition. An important difference between different versions of the incomplete decomposition is the handling of the fill-in elements.

The Deflated Relaxed Incomplete Cholesky CG Method for Use in A

Conjugate Gradient Method (Part 4)  Pre-Conditioning  Nonlinear Conjugate Gradient Method

Hybrid Algorithms for Efficient Cholesky Decomposition And

Matrix Inversion Using Cholesky Decomposition

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 3: Positive-Deﬁnite Systems; Cholesky Factorization

1. Positive Definite Matrices a Matrix a Is Positive Deﬁnite If X>Ax > 0 for All Nonzero X

Cholesky Decomposition 1 Cholesky Decomposition

The CMA Evolution Strategy: a Tutorial

Reducing Dimensionality in Text Mining Using Conjugate Gradients and Hybrid Cholesky Decomposition

Stable Computations of Generalized Inverses of Positive Semidefinite

Hierarchical Sparse Cholesky Decomposition with Applications to High-Dimensional Spatio-Temporal ﬁltering

LAPACK Working Note

A Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators LAPACK Working Note #223