Chapter 5

Finite Element Methods (FEM)

5.1 Generalities

Remark 5.1 Finite element methods. Finite element methods were one of the main topics of Numerical 3. The knowledge of the lecture notes of Numerical Mathematics 3 is assumed. Only a few issues, which are important for this course on finite element methods for convection-dominated problems, will be reminded here. Let h be a family of regular triangulations consisting of mesh cells K . The triangulations{T } are assumed to be quasi-uniform. The diameter of a mesh{cell} K is denoted by hK and it is h = maxK hK . Parametric finite element spaces will be considered with affine maps between{ a reference} cell Kˆ and all physical cells K. 2

s Theorem 5.2 Local interpolation error estimate. Let IK : C (K) P (K) be an interpolation operator as defined in Numerical Mathematics 3, where→P (K) is a polynomial space defined on K. Let p [1, ] and (m + 1 s)p > 1. Then there is a constant c, which is independent of ∈v W∞m+1,p(K), such− that ∈ (k) m+1−k (m+1) (v IK v) chK v , 0 k m + 1, (5.1) − Lp(K) ≤ Lp(K) ≤ ≤ for all v W m+1,p( K). ∈ Proof: See lecture notes of Numerical Mathematics 3.

Theorem 5.3 Inverse estimate. Let 0 k l be natural numbers and let ≤ ≤ p, q [1, ]. Then there is a constant Cinv, which depends only on k, l, p, q, Kˆ , Pˆ(Kˆ ) such∈that∞ − − −1− −1 Dlvh C h(k l) d(p q ) Dkvh vh P (K). (5.2) Lq (K) ≤ inv K Lp(K) ∀ ∈

Pro of: See lecture notes of Numerical Mathema tics 3.

5.2 The Galerkin Method

Remark 5.4 The Galerkin method. The standard finite element method, which just replaces in the variational formulation (4.2) the space V by V h V , is called Galerkin method: Find uh V h, such that for all vh V h ⊂ ∈ ∈ ε uh, vh + b uh + cuh, vh = f, vh . (5.3) ∇ ∇ · ∇    48 2

Figure 5.1: Boris Grigorievich Galerkin (1871 – 1945).

Theorem 5.5 Lemma of Cea. Let V h V and assume the conditions of the Theorem of Lax–Milgram, Theorem 4.10. ⊂Then there is a unique solution of the problem to find uh V h such that ∈ a uh, vh = f vh vh V h (5.4) ∀ ∈ and it holds the error estimate   M u uh inf u vh , (5.5) − V ≤ m vh∈V h − V where u is the unique solution of the continuous problem (4.5) and the constants are defined in Definition 4.8.

Proof: The existence of a unique solution of the discrete problem follows directly from the Theorem of Lax–Milgram, since the subspace of a is also a Hilbert space and the properties of the carry over from V to V h. Computing the difference of the continuous equation (4.5) and the discrete equation (5.4) yields a u uh, vh = 0 vh V h. − ∀ ∈ With   m v 2 a(v, v) and a(u, v) M u v k kV ≤ | | ≤ k kV k kV if follows for all vh V h that ∈ 2 1 1 u uh a u uh, u uh = a u uh, u vh − V ≤ m − − m − −     M h h u u u v . ≤ m − V − V

This inequality is equivalent to the statemen t of the theorem.

Remark 5.6 Application to the Galerkin finite element method. The properties of the bilinear form from problems (4.2) and (5.3) were studied in Example 4.9. It was shown that with appropriate regularity assumptions and under condition (4.4), the bilinear form is bounded with a constant M of order max b , c and {k k∞ k k∞}

49 it is coercive with m = ε. In this case, one can apply the Lemma of Cea and one obtains the error estimate max b , c u uh C {k k∞ k k∞} inf u vh , C R. − V ≤ ε vh∈V h − V ∈

In the singularly p erturbed case ε b ∞, the first factor of this estimate becomes very large.  k k Thus, from this error estimate one cannot expect that the Galerkin finite element solution is accurate unless the second factor, which is the best approximation error, is very small. On uniformly refined grids, the best approximation error becomes very small only if the dimension of V h becomes very large. 2

Example 5.7 Galerkin method for P1 finite elements in one dimension. Let Ω = (0, 1) and consider the case that the coefficients of the problem are constant, i.e., b(x) = b, c(x) = c and f(x) = f. For the P1 finite element on the reference cell Kˆ = [ 1, 1], the functions and their derivatives are given by − 1 1 1 1 φˆ (xˆ) = ( xˆ + 1), φˆ0 (xˆ) = , φˆ (xˆ) = (xˆ + 1), φˆ0 (xˆ) = . 1 2 − 1 −2 2 2 2 2

Consider the matrix entry ai,i+1, which is computed with the test function φi(x), which is transformed to φˆ1(xˆ), and the ansatz function φi+1(x), which is transformed to φˆ2(xˆ). The common support is the mesh cell [xi, xi+1]. Let h denote the length of this cell, then one obtains 2ε 1 1 1 1 1 1 a = dxˆ + b ( xˆ + 1) dxˆ i,i+1 h 2 · −2 2 · 2 − Z−1   Z−1 ch 1 1 1 + (xˆ + 1) ( xˆ + 1) dxˆ 2 2 2 − Z−1 ε b ch = + + . −h 2 6 For the i-th component of the right-hand side, one gets 1 fi = f φi(x) dx = hf. Z0 The other matrix entries can be calculated in a similar way, exercise. If one applies the trapezoidal rule for the approximation of the integrals, one obtains ch 1 1 1 ch 0 + 0 ( xˆ + 1) (xˆ + 1) dxˆ = 2 = 0. 2 2 − 2 2 2 Z−1 Then, it follows that ε b a = + . i,i+1 −h 2 For c = 0 (or with trapezoidal rule) these are, apart of a factor h, the same matrix entries as for the central difference scheme, see Remark 3.10. One gets for c = 0 hεD+D−u + hb D0u = hf εD+D−u + b D0u = f . − i i i i ⇐⇒ − i i i i Hence, one obtains in this special case the same results with the Galerkin finite element method and the central finite difference scheme. For singularly perturbed problems, these results are very bad. The equivalence of the central finite difference method and the Galerkin finite element method generally does not hold if the coefficients are not constant. In higher dimensions, there are differences even for constant coefficients. But still, these two methods give very similar results. At any rate, the Galerkin finite element method is not useful in the singularly perturbed case also in more general situations. 2

50 5.3 Stabilized Finite Element Methods

Remark 5.8 On the H1(Ω) norm for the of singularly perturbed problems. Consider the problem: Find u V = H1(Ω) such that ∈ 0 a(u, v) = f(v) v V (5.6) ∀ ∈ with

a(u, v) := ε u(x) v(x) + b(x) u(x)v(x) + c(x)u(x)v(x) dx, Ω ∇ · ∇ · ∇ Z   f(v) := f(x)v(x) dx. ZΩ Let the condition 1 b(x) + c(x) µ > 0 almost everywhere in Ω −2∇ · ≥ 0 be satisfied, which is stronger than condition (4.4). Then, an analogous calcula- tion as in Example 4.9 shows that a( , ) is uniformly coercive with respect to the following norm, which depends on ε, · ·

2 2 2 2 2 v := ε v + µ v = ε v 2 + µ v 2 , k kε | |1,2 0 k k0 k∇ kL (Ω) 0 k kL (Ω) i.e., there is a constant m which does not depend on ε such that

a(v, v) m v 2 v V. ≥ k kε ∀ ∈ Applying integration by parts, exercise, shows that there is a constant M, which is also independent on ε, such that

a(v, w) M v w 1 (v, w) V V. | | ≤ k kε k kH (Ω) ∀ ∈ × However, there is no constant M˜ , which is independent of ε, with

a(v, w) M˜ v w (v, w) V V. | | ≤ k kε k kε ∀ ∈ × Applying the estimates with constants that are independent of ε, one obtains in a similar way as in the proof of the Lemma of Cea that

h h u u C inf u v 1 , − ε ≤ vh∈V h − H (Ω) with C independent of ε. If V h is a standard finite elemen t space (piecewise poly- nomial), then one can show that in layers it is

h inf u v 1 for ε 0, vh∈V h − H (Ω) → ∞ → for fixed h. Consequently, there is no uniform convergence u uh 0 for − ε → h 0. The norm 1 is not suited for the investigation of numerical methods → k·kH (Ω) for convection-dominated problems. It turns out that the use of appropriate norms is important for the numerical analysis of finite element methods for convection- dominated problems. 2

51 5.3.1 Petrov–Galerkin Methods and Upwind Methods Remark 5.9 Petrov–Galerkin method. A finite element method, whose ansatz and test space are different, is called Petrov–Galerkin method. Let Sh be the ansatz space and T h be the test space with dim(Sh) = dim(T h), then the Petrov–Galerkin method reads as follows: Find uh Sh such that ∈ a uh, vh = f vh vh T h. ∀ ∈   2

Example 5.10 Petrov–Galerkin method and upwind method. Consider

εu00(x) + bu0(x) = 0 − with b R 0 and homogeneous Dirichlet boundary conditions. Use as functions in the ansatz∈ \{space} continuous piecewise linear functions

(x x − )/h for x [x − , x ], − i 1 ∈ i 1 i φ (x) = (x x)/h for x [x , x ], , i = 1,..., N 1. i  i+1 − ∈ i i+1 −  0 else, Define the bubblefunctions 2 4(x xi−1)(xi x)/h for x [xi−1, xi], σ − (x) = i 1/2 0 − − else. ∈  The test functions will be defined as piecewise quadratic functions 3 ψ (x) = φ (x) + κ σ − (x) σ (x) , i = 1,..., N 1, i i 2 i 1/2 − i+1/2 −  where κ is an upwind parameter which has to be chosen. A direct calculation shows, exercise, that one obtains the following method

1 1 εD+D−u + b κ D+u + + κ D−u = 0. − i 2 − i 2 i      For the choice κ = sgn(b)/2, one obtains the simple upwind finite difference scheme, see Definition 3.33. A test function ψi(x), defined in the nodes 0, 0.5, 1 , for κ = 1/2 is presented in Figure 5.2. { } 2

Remark 5.11 Fitted upwind schemes. It is also possible to define fitted upwind methods with Petrov–Galerkin methods, even for non-constant coefficients. How- ever, one does not obtain better results as for the Iljin–Allen–Southwell method, Theorem 3.57, i.e., one obtains not more than linear convergence. 2

5.3.2 The Streamline-Upwind Petrov–Galerkin (SUPG) Method Remark 5.12 Goal. The goal consists in the construction of a method that is more stable than the Galerkin finite element method and which can be used with finite elements of arbitrary order. The convergence of this method, in an appropriate norm, should be of higher order. Consider the problem (4.1) and assume at the moment that condition (4.4) is satisfied. Later, an even stronger assumption will be made. 2

52 Figure 5.2: Piecewise quadratic test function for κ = 1/2.

Remark 5.13 The basic idea. The basic idea consists in a penalization of large values of the so-called strong residual. Such methods are called residual-based stabilizations. Given a linear partial differential equation in strong form

A u = f, f L2(Ω), str str ∈ and its Galerkin finite element discretization: Find uh V h such that ∈ ah uh, vh = f, vh vh V h. (5.7) ∀ ∈

For residual-based stabilizations, a mo dification of Astr is needed which is well- defined for finite element functions. This modification should be also a linear oper- h h 2 ator and it is denoted by Astr : V L (Ω). The (strong) residual is now defined by → rh uh = Ah uh f L2(Ω). str − ∈ In general, it holds rh uh =0, but a good numerical approximation of the solution of the continuous problem should6 have in some sense a small residual. Now, instead  of finding the solution of (5.7), the minimizer of the residual is searched, i.e, the following optimization problem is considered

h h 2 h h h h arg min r u L2(Ω) = arg min r u , r u . (5.8) uh∈V h uh∈V h    The necessary condition for taking the minimum is the vanishing of the Gˆateaux h derivative. This derivative is computed by using the linearity of Astr and the bilin- earity of the inner product in L2(Ω)

rh uh + εvh , rh uh + εvh rh uh , rh uh 0 = lim − → ε 0  ε    rh uh + εAh vh, rh uh + εAh vh rh uh , rh uh = lim str str − → ε 0   ε    = 2 rh uh , Ah vh vh V h. str ∀ ∈ It follows that the necessary condition for the solution of (5.8) is

h h h h r u , Astrv = 0.   53 A generalization consists in considering the minimization problem

2 arg min δ1/2rh uh = arg min δrh uh , rh uh . (5.9) 2 uh∈V h L (Ω) uh∈V h

   with the positive weigh ting functi on δ(x). Analogously to the derivation for the special case, one obtains as necessary condition for the minimum

h h h h δr u , Astrv = 0. (5.10) The solutions of (5.8) or (5.9) will not be identical to the solution of the Galerkin discretization (5.7). It turns out that the reason for the Galerkin discretization to fail is that the solution possesses structures (scales) that are important but which are not resolved by the used finite element space (grid). For convection-diffusion problems, such structures are layers, particularly at boundaries. The numerical methods should also compute sharp layers. However the sharpness of layers in numerical solutions is restricted by the resolution, which is generally much coarser than the layer width. Hence, even for a numerical solution with sharp layers, the residual in the layer regions are very large. In particular, a numerical solution with sharp layers (with respect to the resolution of the finite element space) will not be the minimizer of (5.8) or (5.9), see Figure 5.3. The minimizers of (5.8) or (5.9) tend to possess strongly smeared layers and these solutions are useless in applications. For this reason, one considers in residual-based stabilizations a combination of the Galerkin discretization (5.7) and the minimization of the residual ah uh, vh + δrh uh , Ah vh = f, vh vh V h. (5.11) str ∀ ∈ The goal of numerical analysis consists in determining  the weighting function δ optimally in an asymptotic sense. 2

Figure 5.3: Function with sharp layer (solid line) and optimal piecewise linear ap- proximation in a mesh cell K (dashed line). The equation which is fulfilled by the function in K is far from being satisfied by the piecewise linear approxima- tion. Hence, despite the approximation is of the type considered to be optimal, the residual will be large.

Definition 5.14 Streamline-Upwind Petrov–Galerkin FEM, SUPG meth- od, Streamline-Diffusion FEM, SDFEM. The Streamline-Upwind Petrov– Galerkin (SUPG) FEM or Streamline-Diffusion FEM (SDFEM) has the form: Find uh V h, such that ∈ ah uh, vh = f h vh vh V h (5.12) ∀ ∈   54 with

ah(v, w) := a(v, w) (5.13)

+ δ ε∆v(x) + b(x) v(x) + c(x)v(x) b(x) w(x) dx, K − · ∇ · ∇ ∈T h K KX Z    f h(w) := (f, w) + δ f(x) b(x) w(x) dx. K · ∇ ∈T h K KX Z  

Here, δK are user-chosen weights, which are called stabilization parameters or SUPG{parameters.} 2

Remark 5.15 Concerning the SUPG method. The method was developed in Hughes and Brooks (1979); Brooks and Hughes • (1982). The name “SUPG” comes from the fact that the method can be considered as • a Petrov–Galerkin method with the test space

span w(x) + δ b(x) w(x) .  K · ∇  ∈T h  KX  The SUPG method introduces artificial diffusion only in streamline direction • b(x) w(x). From this property, the name “Streamline Diffusion FEM” origi- nates.· ∇ h The operator Astr is given in the second part of the bilinear form (5.13). The • second derivative for finite element functions is defined only piecewise. h In the stabilization term of the SUPG method, not the strong operator Astr • applied to the test function is used, as in (5.11), but only the first order term contained in this expression. However, for singularly perturbed problems, the first order term is the dominating term of the strong operator applied to the test function. It is also possible to define a method with the strong operator applied to the test function, the so-called Galerkin least squared (GLS) method. In general, one gets similar results with the SUPG and the GLS method, but the SUPG method is easier to implement. Generally, the SUPG parameter is a general function. However, in practice it • is often chosen as a piecewise constant function. The goal of the finite element error analysis consists in proposing a good choice of this parameter. 2

Example 5.16 SUPG in one dimension for P1 finite elements. Consider Ω = (0, 1) h and V = P1 on an equidistant grid with hi = h, i = 1,..., N. If all coefficients are constant, c = 0, and if one chooses the SUPG parameter also as a constant, then the left-hand side of the SUPG method reduces to

N 0 0 0 xi 0 0 ε( uh , vh ) + (b uh , vh) + δ ε 0 + b uh (x) b vh (x) dx − · i=1 Zxi−1   0 0 X 0  0 0   = ε( uh , vh ) + b( uh , vh) + δb2( uh , vh ).

This expression is of the form of a Galerkin  finite elemen t metho d for an equation with left-hand side ε + δb2 u00(x) + bu0(x). − 

55 It is known from Example 5.7 that the Galerkin finite element method is equivalent to a central finite difference scheme. The right-hand side of the SUPG method is

N N xi 0 xi 0 (f, vh) + δ fb vh (x) dx = (f, vh) + δfb vh (x) dx i=1 Zxi−1 i=1 Zxi−1 X  X  =0 h = (f, v ) = hfi.| {z } 0 The sum vanishes, since each test function vh (x) can be written as a linear combination of the basis functions φ (x) of P and the integral of the derivative i 1  of each basis function vanishes. Alternativ{ } ely, one can apply integration by parts to check this fact. Altogether, the SUPG method with the conditions stated above is equivalent to the fitted finite difference scheme (3.10)

b2 ε 1 + δ D+D−u + bD0u , = f , − ε i i i   i.e., σ(q) = 1 + δb2/ε, q = bh/(2ε). Choosing the SUPG parameter by

h 1 δ(q) = coth(q) , 2b − q   then it is hb2 1 1 σ(q) = 1 + coth(q) = 1 + q coth(q) = q coth(q). 2bε − q − q     One obtains the Iljin–Allen–Southwell scheme. With δ = h/(2b), one gets the simple . These simple connections do not hold in higher dimensions. 2

Definition 5.17 Consistent finite element method. Let u(x) be a sufficiently smooth solution of: Find u V such that ∈ a(u, v) = f(v) v V, ∀ ∈ where a( , ) is an appropriate bilinear form and f( ) an appropriate functional. A finite elemen· · t method related to this problem: Find· uh V h such that ∈ ah uh, vh = f h vh vh V h ∀ ∈ is called consistent, if  

ah u, vh = f h vh vh V h. (5.14) ∀ ∈   2

Remark 5.18 Consistency. Note that consistency of a finite element method is not the same as consistency of a finite difference method, see Definition 3.6. For finite element methods, consistency means that a sufficiently smooth solution satisfies also the discrete equation. 2

56 Lemma 5.19 Galerkin . A consistent finite element method has the property of the Galerkin orthogonality ah u uh, vh = 0 vh V h. (5.15) − ∀ ∈ The error is “orthogonal” tothe finite element space.

Proof: The statement of the lemma follows immediately by subtracting (5.12) and (5.14).

Lemma 5.20 Consistency of the SUPG method. The SUPG method (5.12) – (5.13) is consistent. Proof: A sufficiently smooth solution u(x) of (4.2) satisfies the strong form of the equation even pointwise. Hence, the residual is pointwise zero. Inserting this solution into the SUPG formulation (5.12) – (5.13) results in a vanishing of the stabilization term. It remains a u, vh = f vh vh V h, ∀ ∈ which is satisfied by any weaksolution since Vh V . That means, the smooth solution ⊂ satisfies also the discrete equation.

Definition 5.21 SUGP norm. Let 1 b(x) + c(x) ω > 0. (5.16) − 2∇ · ≥ In V h, the SUPG norm is defined by

1/2 2 h h 2 h 2 1/2 h v := ε v + ω v + δK b v , SUPG  1 0 · ∇ 0,K  K∈T h X    where denotes the norm in L2(K). 2 k·k0,K Theorem 5.22 Coercivity of the SUPG bilinear form. Assume b W 1,∞(Ω), c L ∞(Ω), (5.16), and let ∈ ∈ 2 1 hK ω 0 < δK min 2 , 2 , (5.17) ≤ 2 (εCinv c ∞ ) k kL (K) where Cinv is the constant in the inverse estimate (5.2) Then, the SUPG bilinear form is coercive with respect to the SUPG norm, i.e., it is

1 2 ah vh, vh vh vh V h. ≥ 2 SUPG ∀ ∈  Proof: Integration by parts gives, see Example 4.9, b b vh + cvh, vh = ∇ · + c vh, vh vh V h. · ∇ − 2 ∀ ∈      With the definition of ω, one obtains

2 b x 2 2 h h h h x ( ) h x x 1/2 b h a v , v = ε v + c( ) ∇ · v ( ) d + δK v 1 Ω − 2 · ∇ 0,K   Z     K∈T h   X ≥ω>0 | {zh } h h + δK ε∆v (x) + c(x)v (x) b(x) v (x) dx h K − · ∇ KX∈T Z     2 h h h h v δK ε∆v (x) + c(x)v (x) b(x) v (x) dx . ≥ SUPG − K − · ∇ K∈T h Z X    

57 Now, the last term will be estimated from above. Then, one obtains altogether an estimate from below if the estimate of the last term is subtracted from the first term. In the following estimate, one uses the conditions (5.17) on the SUPG parameter. It is for each K h ∈ T

h h h δK ε∆v (x) + c(x)v (x) b v (x) dx − · ∇ ZK     1/2 h x 1/2 b h x x δK ε ∆v ( ) δK v ( ) d ≤ K · ∇ Z    1/2 1/2 + δ c(x) v h(x) δ b v h(x) dx K | | K · ∇ ZK CS    1/2 h 1/2 h 1/2 b h δK ε ∆v + δK c L∞(K ) v δK v ≤ 0,K k k 0,K · ∇ 0,K     (5.2) εC 1/2 inv h 1/2 h 1/2 b h δK v + δK c L∞(K) v δK v ≤ hK ∇ 0,K k k 0,K · ∇ 0,K     (5.17) hK εC inv h √ω h 1/2 b h v + c L∞(K) v δK v ≤ √2εCinv hK ∇ 0,K √2 c ∞ k k 0,K ! · ∇ 0,K k kL (K)  

ε h ω h 1/2 b h = v + v δK v 2 ∇ 0,K 2 0,K · ∇ 0,K r r  Young 2 2  2 2 ε h 1 1/2 b h ω h 1 1/2 b h v + δK v + v + δK v ≤ 2 ∇ 0,K 4 · ∇ 0,K 2 0,K 4 · ∇ 0,K     1 h = v . 2 SUPG,K

Now, the pro of is finished by summing over all mesh cells and inserting this estimate in the first estimate of the proof.

Corollary 5.23 Coercivity of the SUPG bilinear form for P1 finite ele- ments. Let the assumptions of Theorem 5.22 with respect to the coefficients of the problem be valid. For piecewise linear finite elements, the SUPG bilinear form (5.13) is coercive with respect to the SUPG norm if ω 0 < δK 2 . (5.18) ≤ c ∞ k kL (K) Proof: The proof is the same as for Theorem 5.22, where one uses that for piecewise h h linear finite elements ∆v (x) K = 0 for all K . Thus, the corresponding terms do | ∈ T not appear in the proof.

Corollary 5.24 Existenz and uniqueness of a solution of the SUPG method. Let the assumptions of Theorem 5.22 and Corollary 5.23, respectively, be valid. Then, the SUPG finite element method (5.12) – (5.13) has a unique solution.

Proof: The statement is obtained by the application of the Theorem of Lax–Milgram, Theorem 4.10. The coercivity of the bilinear form was proved in Theorem 5.22 and Corol- lary 5.23, respectively. For the boundedness, one uses similar estimates as in the proof of Theorem 5.22 and in Example 4.9, exercise.

Remark 5.25 On the coercivity of the SUPG bilinear form. The proof of Theorem 5.22 is typical for the numerical analysis of stabilized finite • element methods. One tries to get rid of the troubling terms by estimating them with the used norm. This approach works only if one uses an appropriate norm. In particular, the stabilization has to appear in the norm. Theorem 5.22 provides an upper bound for the SUPG parameter. This bound • is generally not critical in applications.

58 From Theorem 5.22 one obtains the stability of the SUPG method with respect • to the SUPG norm. Stability means that an appropriate norm of the solution can be estimated with the data of the problem. It is

h 2 u SUPG 2ah uh, uh = 2f h uh ≤ h h = 2 f,u + 2 δK f(x) b(x) u (x) dx K · ∇ K∈T h Z  X   CS 2 h 1/2 1/2 h f 0 √ω u + 2 δK f δK b u ≤ √ω k k 0 0,K · ∇ 0,K K∈T h X 

Young 2 2 1 h 2 1/2 h C f 0 + ω u + δK b u . ≤ k k 2  0 · ∇ 0,K  K∈T h X    Now, the last terms on the right-hand side can be absorbed in the left-hand side and one has stability. The stability constant depends on ω and on the upper bound of δK . All vh V h satisfy • ∈ vh min 1, ω vh . SUPG ≥ { } ε

Hence, the SUPG metho d is also stable with resp ect to the norm ε. With respect to this norm, also the Galerkin finite element method is stable,k·khowever this method is not stable with respect to the SUPG norm. That means, the stability of the SUPG method is stronger than the stability of the Galerkin finite element method. 2

Theorem 5.26 Convergence of the SUPG method. Let the solution of (4.2) satisfy u Hk+1(Ω), k 1, let b W 1,∞(Ω), c L∞(Ω), and consider the SUPG ∈ ≥ ∈ ∈ method for Pk finite elements. Let the SUPG parameter be given as follows

2 hK C0 for hK < ε, δK = ε (5.19)  C h for ε h ,  0 K ≤ K where the constant C0 > 0 is sufficiently small such that (5.17) is satisfied for k 2 h ≥ or (5.18) for k = 1, respectively. Then, the solution u Pk of the SUPG method (5.12) satisfies the following error estimate ∈

u uh C ε1/2hk + hk+1/2 u , − SUPG ≤ | |k+1   where the constant C does not depend on ε and h.

Proof: Let uh V h be the Lagrange interpolant of u(x). One obtains with the I ∈ triangle inequality

h h h h u u u uI + uI u . − SUPG ≤ − SUPG − SUPG

The first term on the righ t-hand side is the in terpolation error. Using the interpolation error estimate (5.1), which is applied for each term of the SUPG norm individually, one

59 Figure 5.4: Joseph–Louis Lagrange (1736 – 1813).

gets

h u uI − SUPG

1/2 2k 2 2(k+1) 2 b 2 2k 2 Cεh u k+1 + Cωh u k+1 + C δK ∞,K hK u k+1,K ≤  | | | | h k k | |  KX∈T  1/2  C εh2k + h2(k+1) + h2k+1 u ≤ | |k+1   C ε1/2hk + hk+1/2 u . ≤ | |k+1   Here, it was used that for both regimes it is δK C0hK Ch. ≤ ≤ Consider now the second term on the right-hand side. The coercivity, Theorem 5.22, and the Galerkin orthogonality yield

2 1 h h h h h h h h h h h uI u a uI u , uI u = a uI u, uI u . 2 − SUPG ≤ − − − −     h h h h Now, the triangle inequalit y is applied to a u u, u u and every term is estimated I − I − individually. In these estimates, the interpolation estimate (5.1) plays an important role.  Let wh = uh uh. One obtains for the diffusion term I − ε uh u , wh ∇ I − ∇ CS    h h 1/2 h 1/2 h ε uI u w = ε uI u ε w ≤ ∇ − 0 ∇ 0 ∇ − 0 ∇ 0 (5.1)     1/2 k 1/ 2 h 1/2 k h C ε h u k+1 ε w Cε h u k+1 w . ≤ | | ∇ 0 ≤ | | SUPG

For the reactive term, one obtains in a similar way

CS h h h h −1/2 h 1/2 h c(uI u), w c ∞ uI u w = ω c ∞ uI u ω w − ≤ k k − 0 0 k k − 0 0   (5.1) k+1 h Ch u k+1 w . ≤ | | SUPG

Next, the terms are considered which come from the SUPG stabilization. Since for both

60 2 regimes it is εδK C0h , one gets ≤ K

h b h ε∆ uI u , δK w − − · ∇ K K∈T h X     CS 1/2 h 1/2 1/2 1/2 b h ε ∆ uI u ε δK δK w ≤ − 0,K · ∇ 0,K K∈T h     X 1/2 1/2 h 1/2 b h C0 hK ε ∆ uI u δK w ≤ − 0,K · ∇ 0,K K∈T h     X 1/2 1/2 CS 2 2 1/2 1/2 h 1/2 b h C0 ε h ∆ uI u δK w ≤  − 0,K   · ∇ 0,K  K∈T h   K∈T h   X X  1/2  1/2 (5.1) 2 1/2 2(k−1) 2 1/2 b h Cε h hK u k+1,K δK w ≤  | |   · ∇ 0,K  K∈T h K∈T h   X X 1/2 k h    Cε h u k+1 w . ≤ | | SUPG

For the other terms, one obtains with the relation δK C0hK , which holds for both ≤ regimes,

h h h b u u + c u u , δK b w · ∇ I − I − · ∇ K∈T h X        CS b h 1/2 1/2 b h ∞ uI u δK δK w ≤ k k ∇ − 0,K · ∇ 0,K K∈T h     X h 1/2 1/ 2 b h + c ∞ uI u δK δK w k k − 0,K · ∇ 0,K K∈T h   X

1/2 h 1/2 b h C hK uI u δK w ≤  ∇ − 0,K · ∇ 0,K K∈T h     X  1/2 h 1/2 b h + hK uI u δK w − 0,K · ∇ 0,K  K∈T h   X 1/2  1/2 CS 2 2 1/2 h h Ch uI u + uI u ≤  ∇ − 0,K   − 0,K   K∈T h   K∈T h  X X   1/2    2 1/2 b h δK w ×  · ∇ 0,K  K∈T h   X (5.1)   k+1/2 k+3/2 h C h + h u k+1 w . ≤ | | SUPG  

To obtain an optimal estimate for the conv ec tiv e term, one has to apply first integration by parts

b uh u , wh = uh u , bwh = uh u, bwh · ∇ I − ∇ I − − I − ∇ ·            = uh u, ( b) wh uh u, b wh . − I − ∇ · − I − · ∇     Both terms on the right-hand side are estimated separately. Using the same tools as for

61 the other estimates, ones obtains

1/2 2 h b h −1/2 b h 1/2 h uI u, ( ) w ω ∞ uI u ω w − ∇ · ≤ k∇ · k  − 0,K  0   K∈T h X k+1 h  Ch u k+1 w . ≤ | | SUPG

In the estimate of the other term, one has to distinguish if in the mesh cell K it is ε hK ≤ or ε > hK . One gets

uh u, b wh I − · ∇  CS  −1/2 h 1/2 b h δK uI u δK w ≤ − 0,K · ∇ 0,K ε≤hK   X b h h + ∞ uI u w k k − 0,K ∇ 0,K ε>hK X (5.1) −1/2 k+1 1/2 b h C δK hK u k+1,K δK w ≤  | | · ∇ 0,K ε≤hK   X  k+1 h + hK u k+1,K w | | ∇ 0,K  ε>hK X  C0h ≤δ ,ε>h K K K −1/2 −1/2 k+1 1/2 b h C C0 hK hK u k+1,K δK w ≤  | | · ∇ 0,K ε≤hK   X  k+1/2 1/2 h + hK u k+1,K ε w | | ∇ 0,K  ε>hK X  1/2 CS 2 k+1/2 1/2 b h h Ch u k+1 δK w + ε w  0,K 1 ≤ | |  h · ∇  KX∈T      k+1/2  h  Ch u k+1 w . ≤ | | SUPG

Summarizing all estimates, the statemen t of the theorem is proved.

Remark 5.27 Concerning the error estimate. In the convection-dominated regime ε h, the order of convergence in the • SUPG norm is k + 1/2 and in the diffusion-dominated case it is k. In the latter case, the SUPG norm is essentially the H1(Ω) semi norm such that order k is optimal. It is essential for obtaining an estimate with a constant C which is independent • of ε that the term 1/2 2 1/2 h δK b w  · ∇ 0,K  K∈T h X  is part of the norm, whic h is used for estimating the error. Such an estimate does not hold for the norm . k·kε For the interpretation of the results one has to take into account that different • stabilization parameters by choosing different values of C0 lead also to different norms on the left-hand side of the estimate. On the other hand, the value of a constant which is independent of ε is ques- • tionable since in general u depends on ε. | |k+1

62 In numerical simulations, often one can observe even convergence of order hk+1 • for the error in L2(Ω), in particular on structured grids. However, in Zhou (1997) examples were constructed which show that the estimate of Theorem 5.26 is sharp also for the error in L2(Ω). 2

Example 5.28 SUPG in one dimension, the user-chosen constant. The standard example εu00 + u0 = 1 on (0, 1), u(0) = u(1) = 0, − 0 b (x) does not fit into the theory of the SUPG method since c(x) 2 = 0. Nevertheless, one can apply the SUPG method also for this example. −An error estimate in the norm N 1/2 0 2 h 2 1/2 h ε v 1 + δK b v 0,K ! i=1 X  can be proved. One looses the control on the error in the L2(0, 1) norm. A fundamental problem in the application of the SUPG method is the free constant C0 in the definition of the parameter (5.19). At the present example, for ε = 10−6, one can observe very well that one obtains for different constants rather different numerical results, see Figure 5.5. If C0 is too large, then the layer is smeared, for an appropriate value of C0 one obtains a solution which is almost exact in the nodes, and if C0 is too small, then one can observe spurious (unphysical) oscillations in the layer.

Figure 5.5: Results obtained with the SUPG method for the standard one- dimensional example, C0 = 1, C0 = 0.5, C0 = 0.25 from left to right, h = 1/32, P1 finite elements.

For general problems, it is difficult to choose C0 appropriately. In higher di- mensions, it is in general not possible to find C0 such that the solution is (almost) exact in the nodes. Often, the solution computed with the SUPG method in higher dimensions exhibits spurious oscillations at the layers, e.g., see Example 5.30. 2

Remark 5.29 Different choices of the SUPG parameter. In practice, one takes instead of (5.19) also the parameter

hK 1 b L∞(K) hK δK = coth(PeK ) , PeK = k k , (5.20) 2 b ∞ − Pe 2ε k kL (K)  K  where PeK is the local P´eclet number, since in one dimensions one recovers under certain conditions the Iljin–Allen–Southwell scheme, see Example 5.16. There is no user-chosen constant in this parameter. Asymptotically, both parameters (5.19) and (5.20) have the same behavior. 2

63 Example 5.30 SUPG in two dimensions. A standard test problem in two dimen- sions has the form ε∆u + (1, 0)T u = 1 in Ω = (0, 1)2, − · ∇u = 0 on ∂Ω.

Besides the layer at the outflow boundary x = 1, there are also two layers at the boundaries y = 0 and y = 1. The layer at the outflow boundary is often called exponential layer and the layers parallel to the flow direction parabolic layers. −8 A numerical solution obtained for ε = 10 with the Q1 finite element method on a rather coarse grid and the SUPG parameter (5.20) is shown in Figure 5.6. One can see very well large spurious oscillations, in particular at the parabolic layers. These oscillations are a typical feature of solutions obtained with the SUPG method. They might become smaller with higher order elements or on finer grids. But they will generally vanish only if the layer is resolved.

Figure 5.6: Result obtained with the SUPG method, Q1 finite elements, and the SUPG parameter (5.20).

2

64