Open Archive TOULOUSE Archive Ouverte ( OATAO ) OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible.
This is an author-deposited version published in: http://oatao.univ-toulouse.fr/ Eprints ID : 14634
To link to this article : DOI : 10.1007/s11831-014-9132-x URL : http://dx.doi.org/10.1007/s11831-014-9132-x
To cite this version : Duval, Mickaël and Passieux, Jean-Charles and Salaün, Michel and Guinard, Stéphane Non-intrusive Coupling: Recent Advances and Scalable Nonlinear Domain Decomposition . (2014) Archives of Computational Methods in Engineering. ISSN 1134- 3060
Any corres pondance concerning this service should be sent to the repository administrator: [email protected] ORIGINAL PAPER
Non-intrusive Coupling: Recent Advances and Scalable Nonlinear Domain Decomposition
Mickaël Duval · Jean-Charles Passieux · Michel Salaün · Stéphane Guinard
Abstract This paper provides a detailed review of the 1 Introduction global/local non-intrusive coupling algorithm. Such method allows to alter a global finite element model, without actu- Simulation in solid mechanics suffers from an intrinsic issue: ally modifying its corresponding numerical operator. We physical phenomena are complex and heterogeneous, which also look into improvements of the initial algorithm (Quasi- makes the use of accurate numerical models uneasy. Indeed, Newton and dynamic relaxation), and provide comparisons simulations are closely bound to computing resources (both based on several relevant test cases. Innovative examples hardware and software), which prevents systematic use of and advanced applications of the non-intrusive coupling complex, accurate numerical models. algorithm are provided, granting a handy framework for Hopefully, most of time simplest models (i.e. computa- both researchers and engineers willing to make use of such tionally cheap) are good enough at a global scale, and one process. Finally, a novel nonlinear domain decomposition can rely on specific models (i.e. complex and computation- method is derived from the global/local non-intrusive cou- ally expensive) only on small areas, at a local scale. pling strategy, without the need to use a parallel code or soft- Such assumption allowed the emergence of a wide variety ware. Such method being intended to large scale analysis, of numerical methods dedicated to multi-scale and/or multi- we show its scalability. Jointly, an efficient high level Mes- model computing. For simplicity of the presentation, we will sage Passing Interface coupling framework is also proposed, divide them into two main classes of numerical method: finite granting an universal and flexible way for easy software cou- element model enrichment and finite element model cou- pling. A sample code is also given. pling. First, one can cite enrichment methods based on the Par- tition of Unity Method [66]: the Generalised Finite Element Method [27,53,81] and the eXtended Finite Element Method [67] being the most famous ones. Their principle is to enrich the finite element functional space with specific functions, which can result from asymptotic expansion [16]orpre- B · · M. Duval ( ) J.-C. Passieux M. Salaün computed local finite element problem solution [18,19]for Institut Clément Ader (ICA), INSA de Toulouse, UPS, Mines Albi, ISAE, Université de Toulouse, 3 rue Caroline Aigle, instance. 31400 Toulouse, France Then, enrichment methods are based on micro-macro e-mail: [email protected] models. Their objective is to compute a solution u as a combi- J.-C. Passieux nation of a macro scale solution u M and a micro scale correc- e-mail: [email protected] tion um, so that u = u M + um. Then the micro scale solution M. Salaün acts as a correction of the macro scale solution, while ensur- e-mail: [email protected] ing unknowns (displacements, forces, stress, strain) equal- ity at the interface between the macro and the micro scale S. Guinard Airbus Group Innovations, BP 90112, 31703 Blagnac, France [40,59]. There exists a wide range of micro-macro meth- e-mail: [email protected] ods. The micro model can either be solved analytically as done in the Variational MultiScale method [47], or using the ing finite element model, without actually altering its corre- finite element method as well as done by the Strong Coupling sponding numerical operator [34,82]. Method [48]. When dealing with highly heterogeneous mod- Thus, a main consequence of non-intrusiveness is the pos- els, the whole macro domain can even be entirely mapped sibility to easily merge commercial software and research with micro models, as done is the Hierarchical Dirichlet Pro- codes, as no modification of the software will be required. jection Method [70,87]. In some cases, micro-macro prin- In addition, such algorithm will easily fit the standard ciple is applied to the finite element solver itself, providing input/output specifications of most industrial software. efficient multi-grid numerical methods [36,73,78]. Actually, the non-intrusive global/local coupling strategy Finally, one can also cite structural zooming [25] and is currently under investigation through several applications. finite element patches [55,76,77]. The principle of structural One can cite crack propagation [41,74]. In that case, a zooming is to use a computed global solution as boundary two-level non-intrusive coupling is proposed, either within condition for a local refined problem; this method is wide- a multi-grid or a GFEM framework: a first global model is spread in structural engineering. Besides, the finite element used in order to represent the global structure behaviour, a patches method relies on an iterative process in order to take second local model takes into account the crack. into account the effect of the local model on the global solu- Then, non-intrusive coupling is also investigated within tion. The main interest of such methods is their ability to com- a stochastic framework [21]. The objective is here to take pute local corrections in a flexible way, i.e. make the local into account local uncertainties into a global problem with patch definition independent from the global model charac- deterministic operator, using a non-intrusive strategy. The teristic. main property of such coupling is its ability to represent the Following the same idea, structural reanalysis [3,46] stochastic effect of the local uncertainties at the global scale intends to compute a posteriori a local correction from a without altering the initial global deterministic operator. given global solution. In that case, not only the solution is to One can finally cite 2D/3D coupling [39]. The strategy be corrected, but the global model itself. developed here aims at coupling a global plate model with Despite all these efforts, it may appear that model enrich- local 3D models on localised zones where plate modelling is ment cannot be practically used. Indeed, in an industrial con- inadequate. text, most of time one has to rely on existing commercial All in all, such flexible method can be applied to a very software which may have been developed and certified for wide variety of static mechanical analysis, including quasi- a specific purpose. Though, it is not always easy or even static crack propagation, plasticity, contact, composite failure possible to use a given software in order to achieve multi- [24],..., and even transient dynamics problems [14,20]. scale, heterogeneous computation. Moreover, supercomput- In this paper, we propose to analyse the effectiveness of ers recent developments allow to run very ambitious sim- this algorithm through various applications (crack, plastic- ulations thanks to parallel computations. Thus, instead of ity, contact), and a comparative analysis of some existing embedding all the needing specificities into a unique finite acceleration methods in the literature. element model, the present-day trend is to rely on model It is also proposed to extend the method to the case of coupling. multiple patches. We also show that this type of method can Most common coupling methods are based upon iterative be used to locally redefine the geometry and boundary con- sub-structuring algorithms [15,64], possibly combined with ditions. static condensation [35,85,86], and Schwarz algorithms [32, In fact, model coupling is also widely used for domain 37,60]. decomposition. Among them one can cite the well-known As for the models and/or domains connexions, a wide Finite Element Tearing and Interconnecting method [28], range of numerical methods are available in the litera- the Balanced Domain Decomposition method [62] and the ture. Among them, there is the Mortar method [8,12,13] LArge Time INcremental method [58] which are the most which is based upon weak equality enforcing at the interface used in structural engineering. through Lagrange multipliers or the Nitsche method [7,31, In that context, nonlinear localisation algorithms have also 42,43,65,68]. Besides, energy averaging method, namely the recently been proposed [5,6,23,75]. The objective of nonlin- Arlequin method [9–11] brought a flexible tool for coupling ear localisation algorithm is to bring an efficient way to apply models. domain decomposition method to nonlinear problem through Nevertheless all the above cited methods require quite the general Newton-Krylov-Schur solvers class. deep adaptation or modification of finite element solvers We then propose a new algorithm for nonlinear domain and software, which is not always doable in an industrial decomposition based on the concept of non-intrusive cou- context. pling. More recently, a new class of method is emerging: the All the examples illustrated in this paper have been com- non-intrusive coupling. It allows to locally modify an exist- puted using Code_Aster, an open source software package Non-intrusive Coupling: Recent Advances and Scalable Nonlinear Domain Decomposition for numerical simulation in structural mechanics developed by the French company EDF [88] for industrial applications. The complete code used to run one of the given examples is also provided as “Appendix”.
2 Non-intrusive Coupling: A State of the Art
2.1 Mechanical Problem
Let us consider an elastic body represented by a geometrical domain Ω. Displacement u D is prescribed on the Dirichlet Fig. 1 Situation overview: global/local mechanical problem boundary ΓD, surface force fN is applied on the Neumann boundary ΓN and body force fΩ is applied on Ω. Then we seek to solve the following mechanical problem: Then a possible way to deal with heterogeneous models is to use domain decomposition based model coupling, each (P) : min J(u) (1) domain being represented with its own ad hoc model. Without u∈U loss of generality, we will take here the example of a cracked For the sake of simplicity, we consider here a linear elastic domain (see Fig. 1). The domain is then divided into a global Ω Ω model (C being the corresponding Hooke tensor and ε being part G and a local part L , thus those two non-overlapping Γ the infinitesimal strain tensor). We then give the definition of sub-domains share a common interface . The Dirichlet and the affine space U (we will also make use of its correspond- Neumann boundaries are also partitioned as following: ing vector space U 0) and the potential function J. • Γ , = Γ ∩ ∂Ω and Γ , = Γ ∩ ∂Ω 1 D G D G D L D L U ={u ∈ H (Ω), u|Γ = u } (2) D D • ΓN,G = ΓN ∩ ∂ΩG and ΓN,L = ΓN ∩ ∂ΩL 0 1 U ={u ∈ H (Ω), u|Γ = 0} (3) D 1 J(u) = Cε(u(x)) : ε(u(x))dx We then need to define new functional spaces in order to give 2 Ω an adapted formulation to the global/local problem, UG and U U 0 U 0 L (and their corresponding vector spaces G and L ). − fN (x) · u(x)dx − fΩ (x) · u(x)dx (4) Γ Ω N U ={ ∈ 1(Ω ), | = } G u H G u ΓD,G u D (8) In the context of the finite element method, we will make use 1 U ={u ∈ H (Ω ), u|Γ = u } (9) of the equivalent variational formulation L L D,L D
We also consider Uψ ⊂ L2(Γ ) the Lagrange multipliers ∈ U , ∀v ∈ U 0, ( ,v)= (v) u a u l (5) functional space. Displacement and efforts continuity will be imposed in a with the following definition of the bilinear and linear forms weak sense via a mortar method at the interface [8,12,13,33]. a and l: We then get the following dual formulation for the domain decomposition problem: a(u,v)= Cε(u(x)) : ε(v(x))dx (6) Ω uG ∈ UG , uL ∈ UL ,ψ∈ Uψ (v) = ( ) · v( ) + ( ) · v( ) 0 l fN x x dx fΩ x x dx (7) ∀vG ∈ U , aG (uG ,vG ) + b(ψ, vG ) = lG (vG ) Γ Ω G N ∀v ∈ U 0, ( ,v ) − (ψ, v ) = (v ) L L aL uL L b L lL L When using the finite element method to solve such a prob- ∀ϑ ∈ Uψ , b(ϑ, u − u ) = 0(10) lem, a mesh will be set up, a stiffness matrix and a right G L hand side vector will be assembled, and finally a linear sys- where the definitions of the bilinear forms aG , aL and b are tem will be solved. Now, let us consider that a local detail given below. is missing from the modelling (crack, hole,...). One cannot easily use the initial homogeneous model defined above, as aG (u,v)= Cε(u(x)) : ε(v(x))dx (11) Ω it would require to adapt it. When the detail location is not G known a priori, such model adaptation can be very intrusive a (u,v)= Cε(u(x)) : ε(v(x))dx (12) and computationally expensive. L ΩL M. Duval et al. (v) = ( ) · v( ) + ( ) · v( ) ( ) = ϕi · ϕ j lG fN x x dx fΩ x x dx CG ij ψ Gdx (20) Γ Ω Γ N,G G (13) ( ) = ϕi · ϕ j CL ij ψ L dx (21) Γ lL (v) = fN (x) · v(x)dx + fΩ (x) · v(x)dx ΓN,L ΩL (14) If one had to use a monolithic solver when computing the solution of Eq. (10), the resulting finite element linear system b(ϑ, u) = ϑ(x) · u(x)dx (15) would be the following: Γ ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ According to our example (the locally cracked body), one KG 0 C UG FG ⎢ G ⎥ ⎢ ⎥ ⎢ ⎥ can use a standard Finite Element Method (FEM) on the ⎢ ⎥ ⎣ ⎦ = ⎣ ⎦ ⎣ 0 K L −C ⎦ UL FL (22) global part, and an eXtended Finite Element Method (XFEM, L C −C 0 Ψ 0 [67]) on the local cracked part (another solution would be to G L rely on an analytical model for the local part, [80]), as it Remark In order to simplify notations, we decided to not is done in [85,86]. Let us define ϕ , ϕ and ϕψ the basis G L explicitly use restriction and prolongation operators. Instead, functions of the finite element spaces corresponding to the we will denote the restriction operation with ·|, and the pro- discretization of U , U and Uψ . We will give more details G L longation operation with · each time it is needed. Such on the discretization of the Lagrangian dual space further, restriction and prolongation operators are merely boolean- still it may be noted that the mortar coupling used here allows matrix based operators which cast a shape-given object into non-conforming meshes at the interface Γ . an other one by gathering selected values or by supplement- Let us also consider the triangulations T and T of Ω G L G ing it with zeros. and ΩL respectively, and TΓ,G and TΓ,L their restriction on Γ . We will denote nG the number of degrees of freedom of For instance, we define here C G the prolongation of CG from T ×T T ×T TG and nΓ,G the number of degrees of freedom of TΓ,G . Γ,ψ Γ,G to Γ,ψ G . Then C G stands for a matrix of × Then nL and nΓ,L will follow the same definition on the shape nΓ,ψ nG , all the remaining coefficients being filled local domain. Finally, TΓ,ψ will stands for the Lagrangian with zeros. The same procedure defines C L , the prolongation multipliers mesh on the interface Γ , with nΓ,ψ its number of of CL from TΓ,ψ × TΓ,L to TΓ,ψ × TL . degrees of freedom. Remark It is not an easy task to set up a “good” basis of Then we can define the finite element matrices (stiffness Uψ when discretizing the Lagrange multipliers [83,84]. If matrices, coupling matrices and right-hand side vector): the basis is bad-chosen (i.e. the inf-sup conditions are not fulfilled), the mortar operator can lead to undesirable energy- – the stiffness matrices KG (nG ×nG matrix) and K L (nL × free oscillations of the displacement fields. For the sake of nL matrix) ease, we chose in this paper to use the local finite element basis on the interface for the Lagrange multiplier as well (i.e. TΓ,ψ = TΓ, ϕψ = ϕ Γ i j L and L on ). The main consequence of (KG )ij = Cε(ϕ (x)) : ε(ϕ (x))dx (16) G G such a choice is that matrix CL is a square invertible matrix: ΩG we will not have to rely on least squares methods (i.e. gener- ( ) = ε(ϕi ( )) : ε(ϕ j ( )) K L ij C L x L x dx (17) alised inverse matrix) when performing the interface projec- Ω L tions. We never encountered instabilities in all the test cases we set up using that Lagrange multipliers basis. – the right-hand side load vectors FG (vector of size nG ) and Of course, the idea behind such a domain decomposition is to FL (vector of size nL ) dissociate ΩG and ΩL when solving the problem. A solution is to set up an asymmetric global/local algorithm, i.e. solving ( ) = ( ) · ϕ j ( ) + ( ) · ϕ j ( ) FG j fN x G x dx fΩ x G x dx alternately Dirichlet and Neumann problems on the local and Γ Ω N,G G global models until convergence. (18) To that end, let us define interface projection like operator j j (FL ) j = fN (x) · ϕ (x)dx+ fΩ (x) · ϕ (x)dx P and P from the global to the local model and from the L L − Γ , Ω = 1 N L L local to the global model respectively, so that P CL CG (19) = − and P CGCL ). We will also denote ΛG = (KGUG − FG )|Γ and ΛL = – the coupling matrices CG (nΓ,ψ × nΓ,G matrix) and CL (K LUL − FL )|Γ the global and local reaction forces at the (nΓ,ψ × nΓ,L matrix) interface Γ . Non-intrusive Coupling: Recent Advances and Scalable Nonlinear Domain Decomposition
T T T = T ∪ T Algorithm 1: Global/local domain decomposition – As G is a part of (i.e. G G ), we naturally T = T Fixed point solver have Γ,G Γ,G . Ω Λ0 Then the objective is to replace the global model on G by Data: , L k=0 the local one on ΩL without actually modifying the global while η> do finite element operator K on Ω. From a practical point of Global problem computation (Neumann problem) view, we define U the fictitious prolongation of UG to the k+1 = − Λk KGUG FG P L Ω | = | = full domain , so that U Ω UG and U Ω UG (i.e. Local problem computation (Dirichlet problem) G G U is the prolonged part of the global solution U). − k+1 FL G K L C L UL = k+1 k+1 In our example, such a prolonged model will stand to a − Ψ −CL PU C L 0 G Γ Convergence test standard FEM model, the crack being absent from the pro- k+1 k+1 2 2 longation. η =Λ + P Λ / FG +FL G L = + Ω k=k+1 We define F F G F G the load vector defined on . end Then, applying the Chasles relation on Eq. (11) in the discrete Result: U k , U k G L form gives us the following equality which will be used to adapt Algorithm. 1.
= + KU K GU K G U (23)
Using this equality at the global computation step gives us the expression of the equation standing for the global model Λ = ( − ) at each iteration k, with G KG UG FG Γ .
+ KUk 1 = F − P Λk + Λk (24) L G
The global/local coupling algorithm can then be given in its non-intrusive form.
Algorithm 2: Global/local domain decomposition – Fig. 2 Situation overview: non-intrusive global/local problem Non-intrusive fixed point solver Data: , Λ0 , Λ0 L G k=0 The convergence test used here relies on the reaction equi- while η> do Global problem computation librium between the two domains. KUk+1 = F − P Λk + Λk Global/local model coupling is a powerful tool to han- L G Local problem computation dle heterogeneity at a local scale when performing structural − k+1 F K L C L UL = L analysis. Nevertheless, it implies to set up the two models − Ψ k+1 − k+1 C L 0 CL PU Γ each time one wants to run a computation. For instance, in Convergence test η =Λk+1 + Λk+1/ 2 + 2 our example, if the crack grows, the mesh partitioning will G P L FG FL k=k+1 not stand right for long; then one needs to adapt both global end and local models, which can be very time consuming (in k k Result: U , UL terms of human and computer resources). Still, there exists a way to keep a global model unchanged when performing a global/local computation: the non-intrusive coupling [82]. It must be noted that, thanks to the prolongation of the global model (i.e. the non-intrusiveness of the coupling), the 2.2 Global/Local Non-intrusive Coupling Method stiffness matrix K will be assembled and factorised only once. In our example, even if the crack grows, the global The principle of non-intrusive coupling is to rely on an exist- model will stand unmodified. The coupling will only involve Ω = Ω ∪ Ω Γ ing global model on G G (see Fig. 2), its triangu- displacements and forces exchange at the interface (see lation T , and its corresponding stiffness matrix K (a n × n Fig. 3). matrix). It may also be noted that the fictitious prolongation of the T Ω Ω From now on, we denote G the triangulation of G and global solution U on G has no physical meaning, and that × Λ0 the corresponding stiffness matrix KG (a nG nG matrix). its value depends on the initialisation (i.e. the values of L M. Duval et al.
First of all, let us remark the following equilibrium equa- tion of the global model at iteration k:
KUk = F + Λk + Λk (25) G G
It is possible to reformulate Algorithm 2 into a Newton-like algorithm (i.e. in an incremental formulation) by adding a −KUk term both on the left and right side of the global domain equation at iteration k. + K U k 1 − U k = F − P Λk + Λk − KUk (26) L G
Indeed, making use of Eq. (25) into Eq. (26), one can give the following formulation:
+ − Fig. 3 Situation overview: non-intrusive coupling U k 1 = U k − K 1 f (U k) (27) and Λ0 ). Nevertheless, as such fictitious solution has to be where f is the finite element operator computing the forces G replaced by the one obtained from the local model, that is of equilibrium residual between ΩG and ΩL obtained from the no consequence. global displacement U k at iteration k. There are several advantages arising from non-intrusive ( k) = Λk + Λk coupling: f U G P L (28)
One can remark that Eq. (27) looks very much like a modified Ω – the global mesh on is always left unchanged (so is the Newton method prescribed on f = 0 (we look for the solu- global stiffness matrix K ), which is convenient when the tion which verify the interface forces equilibrium). In fact, objective is to investigate local details on large scale struc- let us show now that K ≈∇f . tures (i.e. involving a large number of degrees of freedom), Let us define SL and SG the primal Schur complements – when dealing with local nonlinear details (we will present (Dirichlet problem with prescribed displacement on Γ ) cor- applications involving nonlinear local behaviour further), responding to the local model K L and to the fictitious global one can use a linear (i.e. fast) solver for the global model model KG respectively [38]. Then we get the following con- and use a nonlinear solver only for the local part, densed equilibrium equations on the interface Γ : – the local model acts as a correction applied to the global Λ = | model on the right-hand side, so that different research L SL PUΓ (29) Λ = | codes or commercial software can be easily merged in a G SG U Γ (30) non-intrusive way. Λ We then introduce G in Eq. (28) so that it can be rewritten in the following way: 2.3 Incremental Formulation: Additive Global Correction f (U k) = Λk + P Λk + Λk − Λk (31) G L G G The main drawback of Algorithm 2 is its low convergence speed. In fact, the convergence speed depends of the stiff- Λk +Λk = k − Still, from Eq. (25)wehave G KU F. Wefinally Ω Ω G ness gap between L and G : the more the gap is important, get the exact formulation of the interface residual function the more the convergence is slow. Such phenomenon is not f . shown to be significant for local plasticity problem [34,61]. In our example, we shall see that it can be a severe disadvan- ( k) = k − + ( − ) k f U KU F P SL P SG U (32) tage when the crack grows: the stiffness gap increases as the crack spreads since the global model does not include any We can then give the expression for ∇ f : representation of the crack. Then, acceleration techniques ∇ = + ( − ) will be used in order to improve the convergence speed of f K P SL P SG (33) the algorithm. We propose in that section an incremental formulation of It can be seen that K is a good approximation of ∇ f as soon the non-intrusive global/local algorithm as a prerequisite for as the condensed stiffness between the local and the global the acceleration technique setting up. models at the interface are close. Non-intrusive Coupling: Recent Advances and Scalable Nonlinear Domain Decomposition
∇ − = − δ f K P SL P SG (34) Let us define the predicted displacement increment such ¯ k+1 k as δk+1 = (U − U )|Γ . Then the relaxation parameter ω In practice (e.g. local cracked domain, see Fig. 3), the pre- is dynamically updated following the recursive formula vious hypothesis does not stand for true any longer: in that below. case, matrix K is a bad approximation of the true gradient δ (δ − δ ) ∇ k+1 k+2 k+1 f . Thus, the modified Newton scheme (27), when used as ω + =−ω k 1 k 2 (40) such, would lead to tremendous iterations number when the δk+2 − δk+1 crack grows. 0 ¯ 0 In fact, as soon as Ω is stiffer than Ω , the algorithm No relaxation is applied to the first two iterations (U = U L G ¯ becomes divergent [21]. and U 1 = U 1) and the relaxation parameter initial value is set to ω0 = 1. The Aitken’s Delta Squared method involves very few 2.4 Convergence Properties: Relaxation computing overhead, as it only involve displacement value at the interface obtained from the two previous iterations. In our case Eq. (27) is fully equivalent to the fixed point + Moreover, such method will be shown to improve the con- equation prescribed on U k 1 = g(U k) with the following vergence speed of the initial algorithm. definition of g.
− 2.6 Quasi-Newton Acceleration g(U) = U − K 1 f (U) (35) An other possible way to speed up the convergence is to rely Then, using relaxation (with a well-chosen constant parame- on Quasi-Newton methods to update matrix K . The Sym- ter ω) enforces stability of the numerical scheme, ensuring metric Rank One (SR1) update is an easy-to-implement and Ω Ω convergence of the algorithm even if L is stiffer than G . efficient way to build a sequence of matrices Kk convergent In the present situation, relaxation will consist in a two-step toward ∇ f , assuming K0 = K when initialising the algo- ¯ k+1 computation at iteration k. First a predicted value U is rithm [22,34,50–52,69]. k k+1 k k+1 computed from the previous solution U , then this value is Let us define dk = U − U and yk = f (U ) − ω k corrected using a relaxation parameter . f (U ). At iteration k, we seek to update Kk into Kk+1 with + the SR1 formula, i.e. with a rank-one symmetric update, U¯ k 1 = g(U k) (36) while verifying the secant equation for each iteration k > 1: k+1 = ω ¯ k+1 + ( − ω) k U U 1 U (37) Kk+1 = Kk + ρvv (41) The optimal relaxation parameter ω can be computed upon Kk+1dk = yk (42) the knowledge of the eigenvalues of the iteration opera- tor [21], or using a power-type method during the first In Eq. (41), v is a vector with the same shape than U and global/local iterations in order to get an cheaper approxi- ρ =±1. Then, making use of Eq. (41) into Eq. (42) gives us mation. the following relation: Still, computing a good relaxation parameter remains computationally very expensive. yk − Kkdk = ρvv dk (43)
2.5 Dynamic Relaxation: Aitken’s Delta Squared As it can be seen in Eq. (43), yk − Kkdk and v are collinear Acceleration (ρv dk being a scalar), thus there exists a real α such as v get the following expression: Then a possibility is to rely on dynamic relaxation, i.e. com- ω puting a new parameter k for each iteration, assuming that v = α(yk − Kkdk) (44) we can provide an easy and cheap way for the computation of ω . ( − ) = k Then, again from Eq. (43), we have dk yk Kkdk + ρd vv d which leaves us with U¯ k 1 = g(U k) (38) k k k+1 ¯ k+1 k U = ω U + (1 − ω )U (39) k k ρ = { ( − )} sgn dk yk Kkdk (45) In this paper, we investigate dynamic relaxtion based upon the Aitken’s Delta Squared formula [49,54,61], also used in as vv is a positive-semidefinite matrix. Finally, the value of Fluid-Structure Interaction (FSI). α arises using Eq. (44) into Eq. (43). M. Duval et al.
α2 = 1 2.7 Non-intrusive Coupling Illustration: Crack Growth (46) ( − ) Simulation dk yk Kkdk
All in all, one can retrieve the well-known SR1 update for- Actually, crack propagation is the most obvious example of mula: local detail whose effects on the global structure are the most visible (structure failure in the worst case). ( − )( − ) In this section, we give a simple example to illustrate the = + yk Kkdk yk Kkdk Kk+1 Kk (47) non-intrusive global/local coupling algorithm and its prop- d (y − K d ) k k k k erties. Note that in the context of the SR1 update, Eq.(27) rewrites We consider here a rectangular two-dimensional domain (200 × 80)mm. The material law is linear elastic in plane Kkdk =−fk, so that Eq. (47) can be given in a simplified k strain conditions (E = 200GPa and ν = 0.3) and the global form (where fk = f (U )): loading is represented by a uniform pressure of magnitude 10 MPa as depicted in Fig. 2. The initial crack is defined by f + f = + k 1 k+1 = Kk+1 Kk (48) a vertical notch extending from the crack tip position (xct d f + k k 1 −50 mm and yct = 35 mm from the center of the plate). We investigated quasi-static crack growth simulation Finally, thanks to the SR1 formula, we get a simple expres- using the non-intrusive coupling method presented in the sion of the tangent matrix update for each iteration k.Never- previous sections. We computed fourteen quasi-static prop- theless, one has to keep in mind the non-intrusiveness con- agation steps, using a constant growth increment Δ = 3.75 straint of the coupling algorithm, i.e. do not modify the global mm (this value is linked to the patch mesh size at crack tip, stiffness matrix K . This can be achieved using the Sherman- hct , such as Δ = 8hct ). For each step, the crack bifurcation Morrison formula on Eq. (48), leaving us with the following −1 angle is determined using the maximal hoop stress criterion. relation which can be used in order to compute Kk f in a We give in Fig. 4 an illustration of the displacement field −1 iterative manner based upon the knowledge of K0 f . (modulus) from the last propagation step. As one may notice, the initial mesh on Ω (see Fig. 4a) is unaffected by the −1 −1 −1 −1 fk+1 Kk crack spreading, while the global solution is. Global and local K = K − K f + (49) k+1 k k k 1 ( + −1 ) meshes are not bound to be coinciding at the interface, allow- fk+1 dk Kk fk+1 ing for a flexible local mesh refinement. We give here the algorithmic version of such iterative relation We applied both Quasi-Newton SR1 and Aitken’s Delta [34]. Squared acceleration techniques to the non-intrusive cou- pling algorithm with relative tolerance ε = 10−10 for each crack propagation step. Algorithm 3: Non-intrusive global correction The main properties of the coupling algorithm and the Data: fk acceleration techniques can be pulled out from Fig. 5.The i = 0 −1 first graph gives the link between the crack length and the Compute K0 fk while i < k do number of iterations needed to reach the equilibrium, and the −1 −1 −1 −1 f + K fk second one represents the residual evolution over the algo- K f = K f − K f + i 1 i i+1 k i k i i 1 ( + −1 ) fi+1 di Ki fi+1 rithm iterations (for the last crack propagation step). = + i i 1 Two main observations can be underlined. First, it can end =− −1 be observed in Fig. 5a that, using no acceleration, the num- dk Kk fk Result: dk ber of iterations required to reach the chosen tolerance is strongly dependant of the crack length (i.e. the stiffness gap between the local XFEM model and the global FEM model, At iteration k, we suppose that {( fi )i Fig. 4 Non-intrusive crack growth simulation. a Global FEM solution. b Local XFEM solution. c Composite solution (a) (b) )mmmmm(U x10 13.272 9.9541 6.6361 3.3180 0 (c) the other results. In the context of crack propagation, the as said previously, the convergence rate depends on the Aitken’s Delta Squared method proves to be less efficient stiffness gap between the local and the global model. Let than the Quasi-Newton acceleration. In the present case, the us consider the bending plate case with a static crack last propagation step took 3006 iterations without any accel- defined by a vertical ray extending from the crack tip eration to converge, 389 with the Aitken’s accelerator and 18 position (xct = 0 mm and yct = 10 mm from the using the Quasi-Newton update. center of the plate). We applied the non-intrusive cou- Note that as soon as the crack nearly divides the plate pling algorithm considering several patch thicknesses (one into two parts, one can no longer reasonably speak of to ten global stitch layers), as depicted on Fig. 6.In “global/local” situation. Nevertheless, nothing prevents the the present example, the global model stands for a non- coupling scheme to be applied to such critical situation, cracked plate whereas the local model stands for the cracked which allows us to distinctly analyse the differences between one. According to the Saint-Venant principle, the crack the two acceleration techniques. influence will decrease when getting far from the pertur- bation. Thus, the more the patch extends far from the 2.8 Patch Geometry: Influence on the Convergence Speed crack, the less the stiffness gap between the global and the local model will be important, allowing for faster con- In the previous example, the patch is defined on the region of vergence (this is numerically illustrated on Fig 7). More- interest which needs to get worked out with a specific local over such phenomenon can be observed both with standard model (the crack). However, we never gave details about how and accelerated algorithms. Nevertheless the Quasi-Newton is defined the patch, and what is the influence of its spatial update is the only one guaranteeing a reasonable number extend until now. of iterations which is almost independent from the patch In fact, there is no constrain nor generic rule about the way thickness. to define the patch extend. In the crack propagation case, Still one should keep in mind that the more the patch we simply select a given number of stitch layers from the extends, the more it will be computationally expensive to global mesh around the crack location. Then those stitches work out as it will involve a larger number of degrees of are duplicated and saved as local mesh (see Figs. 4a and freedom, particularly in the nonlinear case. Meanwhile, the b). Any refinement of the freshly generated local mesh is local patch should be large enough in order to fully take into possible, particularly at crack tip. account the local behaviour (XFEM enrichment for instance). Nevertheless, the patch extend is not without conse- Then engineer’s skills must prevail in order to determine the quence on the algorithm convergence properties. Indeed, best choice of parameters in such situations. M. Duval et al. • Γ =∪6 Γ Γ = ∂Ω = The patches boundary i=1 i with i G ,i ∂ΩL,i • The global domain internal boundary ΓG = ∂ΩG \∂Ω = ∂Ω Γ ⊂ Γ G so that G Note that as soon as all the local models do not share any common interface (e.g. if we added a gap between the local domains), then ΓG = Γ . Remark In the previous application (crack propagation), Ω = Ω we considered L G (i.e. as the crack was repre- sented through the XFEM method, the geometrical domain remained unaffected). In the present situation, we allow the (a) local patches to redefine the geometry of the pre-existing Ω = Ω global model, so that L G because of holes [21]. The only required condition is that the interfaces remain coin- cident (from a geometrical point of view, non-conforming meshes are still handled using the mortar method). In the non-intrusive coupling context, we consider the Ω = Ω ∪ Ω global model on G G as a linear elastic mate- Ω rial, the holes being absent from the global model on G . The holes will be represented only through the local models Ω , on L i i∈{1..6} and elastic-plastic constitutive law will be applied. From now on, we need to extend the projection operator Γ = −1 P to each interface i , so that Pi CL,i CG,i .Wewillalso keep the definition of ΛG = (KGUG − FG )|Γ and Λ , = G G i (b) (K , U , − F , ) . Nevertheless, as we consider elastic- G i G i G i Γi plastic (i.e. nonlinear) behaviour on the local models, the Fig. 5 Crack growth simulation: acceleration techniques comparison. Λ = ζ( , , ) ζ a Crack spreading: dependence between crack length and convergence local reaction reads L,i UL,i FL,i Xi where is a speed. b Final crack propagation step: residual evolution nonlinear function computing the reaction forces from the displacement UL,i , the right-hand side loading FL,i and the plastic internal variables Xi. 3 New Advances Based on the Non-intrusive Coupling Thus, the multi-patch non-intrusive coupling algorithm can be established, KL being the nonlinear local solver. 3.1 Parallel Processing and Multi-Patch Approach Algorithm 4: Multi-patch coupling – Non-intrusive In this part, we seek to extend the non-intrusive cou- fixed point solver pling method to the multi-patch situation. We give here 0 0 Data: , Λ , , Λ detailed explanations about the way multi-patching is han- L i i∈{1..6} G,i i∈{1..6} dled in specific situations (e.g. two patches share a common k=0 η> interface), and we provide a MPI based parallel process- while do Global problem computation ing method to increase the computational efficiency of the k+1 6 k k KU = F + = −P Λ + Λ algorithm. i 1 i L,i G ,i Local problems computations Let us consider a multi-perforated plate subjected to a k+1 k+1 uniform tension (see Fig. 8). U , = KL,i Pi U , FL,i , Xi ∀i ∈{1..6} L i Γi In order to set up the coupling algorithm, one has to define Convergence test the following items: η =Λk+1 + 6 Λk+1 / 2 + 6 2 G i=1 Pi L,i FG i=1 FL,i k=k+1 end • The united local domain Ω =∪6 Ω , Result: U k , U k L i=1 L i L,i ∈{ .. } • Ω =∪6 Ω i 1 6 The united global fictitious domain G i=1 G ,i Non-intrusive Coupling: Recent Advances and Scalable Nonlinear Domain Decomposition Fig. 6 Patch definition and extend: global stitches selection. a Global mesh (One stitch thick layer). b Local refined mesh. c Global mesh (Five stitches thick layer). d Local refined mesh. e Global mesh (Ten stitches thick layer). f Local refined mesh (a) (b) (c) (d) (e) (f) As it has been done in the previous section (mono-patch case), the problem can be rewritten in an incremental version as U k+1 = U k − K −1 f (U k). Thus the difference here is that f is a nonlinear operator as soon as the local domains involve elastic-plastic behaviour.