MGRIT Preconditioned Krylov Subspace Method

MGRIT preconditioned Krylov subspace method Ryo Yoda Akihiro Fujii Teruo Tanaka Kogakuin University Kogakuin University Kogakuin University Sinjuku-ku, Tokyo Shinjuku-ku, Tokyo Shinjuku-ku, Tokyo [email protected] [email protected] [email protected] ABSTRACT 2 MGRIT MGRIT re-discretize the problem with larger time-step width at When the differential equation is discretized by the implicit method, the coarse-levels, which often cause unstable convergence. We pro- the relation between unknowns of time-steps are described as in pose a Krylov subspace method with MGRIT preconditioning as a Eq. (1). The matrix A constructed by a simulation length T from Eq. more stable solver. For unstable problems, MGRIT preconditioned (1), is called a time evolution matrix. Krylov subspace method performed better than MGRIT in terms of ( д0 = u0 the number of iterations. The contributions of the paper are orga- (1) nized as follows. We showed the matrix form of MGRIT operations, Φui+1 = ui + дi+1 ¹i = 0; 1; ··· ;T − 1º and the improvement of eigenvalue or singular-value distribution. 0 0 2 I 3 2 u 3 2 д 3 We exemplified MGRIT with Krylov subspace method reaching 6 7 6 1 7 6 1 7 6 −I Φ 7 6 u 7 6 д 7 convergence faster than MGRIT. Au = 6 7 6 7 = 6 7 = д (2) 6 : : 7 6 : 7 6 : 7 6 : : : : 7 6 : 7 6 : 7 6 7 6 7 6 7 CCS CONCEPTS 6 −I Φ 7 6 uT 7 6 дT 7 4 5 4 5 4 5 • Mathematics of computing → Solvers; Let m be the coarsening ratio. For the above problem Au = д, MGRIT [2] constructs a coarse-level ∆ with a wider time-step width KEYWORDS equal to m × ∆t. It recursively constructs coarser-levels and solves parallel-in-time, linear solver, MGRIT, Krylov-subspace method, the problem using the coarser-level solutions. preconditioning ACM Reference Format: 3 MGRIT PRECONDITIONED GMRES Ryo Yoda, Akihiro Fujii, and Teruo Tanaka. 2018. MGRIT preconditioned 3.1 Preconditioning matrix Krylov subspace method. In Proceedings of Super Computing 2018 (SC’18). ACM, New York, NY, USA, 3 pages. When MGRIT is employed as a preconditioner, it is necessary to derive a stationary iterative matrix. We described all MGRIT opera- 1 INTRODUCTION tions in the matrix form. As shown below, MGRIT divides the time-steps into C-points In massively parallel environment, Multigrid Reduction in Time (red) and F-points (black). We derived the matrix form of C-relaxation, (MGRIT) [1][2][5] has gained attentions as a scalable solver of the which updates the C-points, and that of F-relaxation, which updates time evolution problem. Because this method construct a multilevel the F-points. Each relaxation can be written, by defining operations structure enlarging the time-step width, the coarser-level problem Ic and If to extract only the C/F-points. becomes unstable. Here, we propose a Krylov subspace method • C-relaxation: Update C-points from F-points in the previous with MGRIT as preconditioning to enhance the stability of the step. solver. The proposed method inherits the scalability of MGRIT, and can be performed with the same computation complexity as – Mcu = If u + Icд MGRIT and with the additional memory cost of the restart vectors. – u = M−1I u + M−1I д Because the time evolution matrix is nonsynmmetric, we used c f c c I Generalized Minimum RESidual (GMRES) [8][10][11][12] as the Mc f u Ic д I u0 u0 I д0 © ª © ª © ª© ª © ª© ª Krylov subspace method in this study. I ® u1 ® I ® u1 ® ® д1 ® I ® u ® I ® u ® ® д ® ® 2 ® = ® 2 ® + ® 2 ® − ® ® ® ® ® ® The paper is organized as follows. Section 2 describes MGRIT I Φ ® u3 ® ® u3 ® I ® д3 ® ® ® ® ® ® ® I ® u4 ® I ® u4 ® ® д4 ® algorithm overview. Section 3 describes the MGRIT precondition- I u I u д « ¬ « 5 ¬ « ¬« 5 ¬ « ¬« 5 ¬ ing matrix, and its application. Section 4 describes experiments • F-relaxation: Update F-points sequentially from C-points. to confirm eigenvalue distribution and singular-value distribution, and number of iterations. – Mf u = Icu + If д Permission to make digital or hard copies of part or all of this work for personal or −1 −1 classroom use is granted without fee provided that copies are not made or distributed – u = Mf Icu + Mf If д for profit or commercial advantage and that copies bear this notice and the full citation Mf Ic u If д on the first page. Copyrights for third-party components of this work must be honored. I u0 I u0 д0 © − ª © ª © ª© ª © ª© ª For all other uses, contact the owner/author(s). I Φ ® u1 ® ® u1 ® I ® д1 ® − ® ® ® ® ® ® SC’18, November 2018, Dallas, Texas USA I Φ ® u2 ® ® u2 ® I ® д2 ® ® ® = ® ® + ® ® I ® u3 ® I ® u3 ® ® д3 ® © 2018 Copyright held by the owner/author(s). ® ® ® ® ® ® −I Φ ® u4 ® ® u4 ® I ® д4 ® −I Φ u u I д « ¬ « 5 ¬ « ¬« 5 ¬ « ¬« 5 ¬ SC’18, November 2018, Dallas, Texas USA R. Yoda et al. The restriction matrix R sets I at the C-points and reduces the As shown in Fig. 1, the eigenvalue distribution improved consid- dimension to a coarse-level. The interpolation matrix P is RT , which erably upon MGRIT preconditioning. Similarly, as shown in Fig. 2, expands the dimension to a fine-level. When the pre-smoothing and the singular value distribution improved considerably upon MGRIT post-smoothing are expressed as u = S1u + T1д and u = S2u + T2д preconditioning. respectively. The preconditioning matrix M at two levels becomes We implemented GMRES, MGRIT and MGRIT-GMRES (the pro- −1 posed method) in both time-direction and space-direction paral- S2T1 + T2 + S2PA R¹I − AT1º . ∆ lelization with Flat MPI. The data distribution is shown in Fig. 3 × 3.2 Preconditioned GMRES and Fig. 4. In this case, there are 16 processes (4 time direction 4 space direction) and 4 colors (red, blue, green, and orange) in Fig. 3, GMRES allows application of the right preconditioning [11][12]. and Fig. 4 correspond to 4 MPI communicators in space direction. The pseudocode is represented as follows. The highlight are the pre- 0 20 40 60 80 100 120 0 2 3 6 7 6 7 6 7 6 7 conditioning part. MGRIT preconditioned GMRES (MGRIT-GMRES) 6 7 6 7 6 7 6 7 20 6 7 6 7 6 7 6 7 6 7 uses the MGRIT iteration matrix M as a preconditioning in the pseu- 6 7 6 7 6 7 40 6 7 6 7 6 7 6 7 6 7 docode. 6 7 6 7 6 7 60 6 7 6 7 6 7 time 6 7 6 7 6 7 6 7 6 7 80 6 7 6 7 6 7 6 7 6 7 6 7 6 7 6 7 v = r /kr k 100 6 7 0 0 0 2 6 7 6 7 6 7 6 7 6 7 for j=0, k # k is a restart number. 6 7 120 6 7 −1 4 5 space zj = M vj vj+1 = Azj Fig. 3. the relation to Fig. 4. the status in space and for i=0, j A and each process time direction h[i][j]= ¹vj+1; vi º vj+1 = vj+1 -h[i][j]* vj+1 We measured the number of iterations when the problem was end discretized by the implicit or explicit method, for GMRES, MGRIT, h[j+1][j]= kv k j+1 2 and MGRIT-GMRES. vj+1 = vj+1/h[j+1][j] end space:4x4, time step size:256 space:4x4, time step size:256 100 100 Compute xk = x0 + Zk yk # Zk = »z0; z1; :::; zk ¼ 2 2 10− 10− 4 4 Since it keeps orthogonal basis vectors for k iterations, MGRIT- 10− 10− residuals residuals 6 GMRES requires more memory space than MGRIT. As for calcula- 10 6 10− − gmres gmres tion cost, the most cost-intensive process in GMRES is SpMV. This mgrit 8 mgrit 8 10− has the same computation complexity as that of Au, which is the 10− mgrit-gmres mgrit-gmres 0 5 10 15 0 5 10 15 operation required for MGRIT residual calculation. number of iteration number of iteration Fig. 5. Implicit: the iteration num. Fig. 6. Explicit: the iteration num. 4 NUMERICAL EXPERIMENTS We used Oakforest-PACS supercomputer (JCAHPC) [17]. As an As shown in Fig. 5, the residual histories of MGRIT and MGRIT- example of a time evolution problem, we considered the two di- GMRES agreed approximately. Because the coarse-level correction mensional linear diffusion-problem with a 5-point difference. The with implicit method for this problem was correct enough. As problem discretized by the implicit method was regarded as a stable shown in Fig. 6, the MGRIT residual norm increased for a few initial problem, while that discretized by the explicit method with large iterations. This indicates that a coarse-grid correction degrades ∆t was regarded as an unstable problem. because of larger time-step width. In this case, MGRIT-GMRES The iteration number of the Krylov subspace method depends on monotonically decreased the residuals and reduced the number of the eigenvalue distribution A. Therefore, first, we checked whether iteration. MGRIT preconditioning improved the eigenvalues distribution of 5 CONCLUSION the time evolution matrix of A. In the case of the explicit method, because all the diagonal elements of A are 1, all the eigenvalues In massively parallel environments, MGRIT attracts attentions as are 1. Therefore, we see the improvement of the singular value a scalable solver of time evolution problems.

Load more