Feti-Dp: a Dual–Primal Unified Feti Method—Part I 1525
Total Page:16
File Type:pdf, Size:1020Kb
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING Int. J. Numer. Meth. Engng 2001; 50:1523–1544 FETI-DP: a dual–primal uniÿed FETI method—part I: A faster alternative to the two-level FETI method Charbel Farhat,1;∗;† Michel Lesoinne,1 Patrick LeTallec,2 Kendall Pierson1 and Daniel Rixen3 1Department of Aerospace Engineering Sciences and Center for Aerospace Structures; University of Colorado at Boulder; Boulder; CO 80309-0429; U.S.A. 2Universiteà de Paris Dauphine and Ecole Polytechnique; 91 128 Palaiseau Cedex; France 3LTAS; UniversitedeLià ege ; Rue Ernest Solvay; 21; B-4000 Liege ; Belgium SUMMARY The FETI method and its two-level extension (FETI-2) are two numerically scalable domain decomposition methods with Lagrange multipliers for the iterative solution of second-order solid mechanics and fourth-order beam, plate and shell structural problems, respectively.The FETI-2 method distinguishes itself from the basic or one-level FETI method by a second set of Lagrange multipliers that are introduced at the subdomain cross-points to enforce at each iteration the exact continuity of a subset of the displacement ÿeld at these speciÿc locations. In this paper, we present a dual–primal formulation of the FETI-2 concept that eliminates the need for that second set of Lagrange multipliers, and uniÿes all previously developed one-level and two-level FETI algorithms into a single dual–primal FETI-DP method. We show that this new FETI-DP method is numerically scalable for both second-order and fourth-order problems. We also show that it is more robust and more computationally ecient than existing FETI solvers, particularly when the number of subdomains and=or processors is very large. Copyright ? 2001 John Wiley & Sons, Ltd. KEY WORDS: domain decomposition; numerical scalability; iterative methods 1. BACKGROUND The ÿnite element tearing and interconnecting (FETI) methods are a family of domain decompo- sition (DD) algorithms with Lagrange multipliers that have been developed during the last decade for the fast sequential and parallel iterative solution of large-scale systems of equations arising from the ÿnite element discretization of partial di erential equations. From a mechanical viewpoint, a FETI method can be viewed as an iterative substructuring method where Lagrange multipliers are introduced at the substructure interfaces to enforce the continuity of the displacement ÿeld. By ∗Correspondence to: Charbel Farhat, Department of Aerospace Engineering and Center for Aerospace Structures, University of Colorado at Boulder, Boulder, CO 80309-0429, U.S.A. †E-mail: [email protected] Contract=grant sponsor: Sandia National Laboratories; contract=grant number: BD-2435 Contract=grant sponsor: Department of Energy; contract=grant number: B347880=W-740-ENG-48 Contract=grant sponsor: INRIA Received 16 August 1999 Copyright ? 2001 John Wiley & Sons, Ltd. Revised 10 May 2000 1524 C. FARHAT ET AL. construction, equilibrium is satisÿed at each iteration in each substructure. However, the continuity condition is met only at convergence. From a mathematical viewpoint, each FETI method can be viewed as a two-step preconditioned conjugate gradient (PCG) algorithm where subdomain problems with Dirichlet boundary conditions are solved in the preconditioning step, and related subdomain problems with Neumann boundary conditions are solved in a second step to compute residuals. The basic FETI method, also known as the one-level FETI method, or simply the FETI method, was developed in References [1–5] together with its ‘lumped’ and ‘Dirichlet’ preconditioners, and extended in Reference [4] to substructure problems with non-matching interfaces. Its optimal con- vergence properties for second-order elliptic problems (i.e. thermal, structural, and solid mechanics problems discretized by plane stress=strain and=or brick elements) were ÿrst exposed numerically in Reference [5], then mathematically established in Reference [6]. More speciÿcally, it was proved in Reference [6] that when the basic FETI method is equipped with the Dirichlet preconditioner [5] and applied to second-order elliptic problems, the condition number Ä of its interface problem grows asymptotically as Ä = O(1 + logm(H=h));m63 (1) where h and H denote, respectively, the mesh and subdomain sizes. The conditioning result (1), which was later proved to hold also with m =2[7; 8], demonstrates that the FETI method is numerically scalable with respect to both the problem size and number of subdomains. Indeed, from Equation (1) it can be concluded that 2 P1. If for a ÿxed number of subdomains Ns (Ns = O(1=H ) in two dimensions and Ns = O(1=H 3) in three dimensions) the problem size is increased (O(1=h2) in two dimensions and O(1=h3) in three dimensions), the condition number of the FETI method grows asymp- totically only as log2 1=h. In other words, one can expect the FETI method to solve large- scale problems in a similar number of iterations as small-scale problems. P2. If for a ÿxed problem size the number of subdomains is increased—for example, for parallel processing purposes or in order to reduce the operation count of each iteration—the iteration count of the FETI method can be expected to decrease. P3. If the size of the subdomain problem, which can be characterized by H=h, is kept constant and the total size of the problem is increased by increasing the number of subdomains— as is often done when the number of processors of a given parallel computing platform is increased—the condition number of the FETI method remains constant. This property makes it possible for a well-implemented FETI method to solve an n-times larger problem on an n-times larger machine in a constant amount of CPU time. The three numerical scalability properties highlighted above are also veriÿed in practice for complex problems (for example, see References [5; 9; 10] and the references cited therein). For a suciently large problem, the parallel scalability of the FETI method—that is, its ability to deliver a speed-up that grows reasonably well with the number of processors (almost linearly)—depends on the parallel implementation of some key computational kernels of this DD method, particularly the solution of the associated ‘coarse’ problems [10], and some key characteristics of the target parallel hardware such as memory bandwidth and cache management. The parallel scalability of the FETI method, and its ability to solve an n-times larger problem on an n-times larger machine in a constant amount of CPU time have been recently demonstrated on a 1000-processor conÿguration of the ASCI Option Red massively parallel machine installed at the Sandia National Laboratories. Copyright ? 2001 John Wiley & Sons, Ltd. Int. J. Numer. Meth. Engng 2001; 50:1523–1544 FETI-DP: A DUAL–PRIMAL UNIFIED FETI METHOD—PART I 1525 The FETI method has been successfully extended to problems with multiple and=or repeated right-hand sides [11; 12], component mode synthesis [13], transient response analysis [14], non- linear solution strategies [15–17], free-vibration analysis [18], problems with multipoint constraints [19], problems with severe heterogeneities such as large discontinuities in material properties [20], and most recently acoustic scattering problems [21; 22]. It has also inspired many variants, and other extensions and applications, among which we note those described in [23–28]. All these vari- ants share the same lumped and Dirichlet preconditioners and the same coarse problem constructed with the rigid body modes of the oating subdomains, all of which were originally developed for the FETI method. A distinctive feature of the FETI method is its rigid-body-based auxiliary problem that is nat- urally derived from the subdomain equations of equilibrium [1–3]. This auxiliary problem must be solved twice at each PCG iteration. Because its size is in general as small as 3Ns in two dimensions and 6Ns in three dimensions, this problem is referred to as a ‘coarse’ problem. In Ref- erences [5; 6], it was shown that for second-order partial di erential equations, this coarse problem is the main reason why the FETI method is numerically scalable with respect to the number of subdomains. The inexpensive solution at each PCG iteration of this coarse problem propagates the error globally and accelerates convergence. For transient dynamics applications, a oating subdomain has a singular static sti ness matrix but a non-singular dynamic sti ness matrix. Hence, the subdomain equations of dynamic equilibrium do not naturally lead to any coarse problem. For this reason, a new coarsening procedure was designed in Reference [14] for addressing the solution of second-order transient dynamic problems by a numerically scalable FETI method. This procedure had led to a new FETI coarse problem for time-dependent applications that is also based on the null spaces of the subdomain static sti ness matrices, but which has a di erent expression than the FETI coarse problem for static applications and therefore requires a di erent computer implementation. For fourth-order plate and shell problems, the basic FETI method is not numerically scalable, and therefore is not as performing as for second-order problems. However, it was shown in References [29–31] that if at each PCG iteration the exact continuity of the (transverse) displacement ÿeld is enforced at the subdomain crosspoints, the FETI method becomes numerically scalable also for fourth-order (plate) and shell problems. Such a condition can be satisÿed by introducing a set of additional Lagrange multipliers at the subdomain crosspoints, and determining their values at each PCG iteration by solving another coarse problem that is based on both the subdomain rigid body and ‘corner’ modes. Because this new coarse problem can be expressed as the projection on a subspace of the interface problem associated with the basic FETI method, the resulting DD method was called in References [30; 31] the two-level FETI method (FETI-2).