Lecture Notes on Distributions

Lecture notes on Distributions Hasse Carlsson 2011 2 Preface Two important methods in analysis is differentiation and Fourier transformation. Unfortunally not all functions are differentiable or has a Fourier transform. The theory of distribution tries to remedy this by imbedding classical functions in a larger class of objects, the so called distributions (or general functions). The basic idea is not to think of functions as pointwise defined but rather as a "mean value". A locally integrable function f is identified with the map Z ' 7! f'; 1 where ' belongs to a space of "nice" test functions, for instance C0 . As an extension of this we let a distribution be a linear functional on the space of test functions. When extending operations such as differentiation and Fourier transformation, we do this by transfering the operations to the test functions, where they are well defined. Let us for instance see how to define the derivative of a locally integrable function f on R. If f is continuously differentiable, an integration by parts implies that Z Z f' = − f'0: Now we use this formula to define the differential of f, when f is not classi- cally differentiable. f 0 is the map Z ' 7! − f'0: In these lectures we will study how differential calculus and Fourier analysis can be extended to distributions and study some applications mainly in the theory of partial differential equations. The presentation is rather short and for a deeper study I recommend the following books: Laurent Schwartz. Théoriedes Distributions I, II. Hermann, Paris, 1950{ 51. Lars Hörmander. The Analysis of Linear Partial Differential Operators I, 2nd ed. Springer, Berlin, 1990. Contents 1 1 A primer on C0 -functions 6 2 Definition of distributions 11 3 Operations on distributions 17 4 Finite parts 21 5 Fundamental solutions of the Laplace and heat equations 28 6 Distributions with compact support 31 7 Convergence of distributions 32 8 Convolution of distributions 36 9 Fundamental solutions 43 10 The Fourier transform 47 11 The Fourier transform on L2 55 12 The Fourier transform and convolutions 57 13 The Paley-Wiener theorem 63 14 Existence of fundamental solutions 66 15 Fundamental solutions of elliptic differential operators 68 16 Fourier series 70 4 17 Some applications 74 17.1 The central limit theorem . 74 17.2 The mean value property for harmonic functions . 75 17.3 The Heisenberg uncertainty principle . 76 17.4 A primer on Sobolev inequalities . 78 17.5 Minkowski's theorem . 81 5 Chapter 1 1 A primer on C0 -functions When we shall extend differential calculus to distributions, it is suitable to use infintely differentiable functions with compact support as test functions. 1 In this chapter we will show that there is "a lot of" C0 -functions. Notation Let Ω be a domain in Rn. Ck(Ω) denotes the k times comtinuously differ- k entiable functions on Ω. (k may be +1.) C0 (Ω) are those functions in k n C (Ω) with compact support. We denote points in R with x = (x1; : : : ; xn) and dx = dx1 : : : dxn denotes the Lebesgue measure. For a vector α = n (α1; : : : αn) 2 N we let α α1 αn jαj = α1 + ::: + αn; α! = α1! : : : αn!; x = x1 : : : xn and @αf @α1 @αn @αf = = ::: f: α α1 αn @x @x1 @xn Example 1.1. With these notations the Taylorpolynomial of f of degree N can be written as X @αf(a) xα: α! jα|≤N 2 1 As described in the preface, to a function f 2 Lloc, we will associate the map Λf , given by Z 1 ' 7! f' dx; ' 2 C0 : Rn 6 Problem. Does the map Λf determine f? 1 More precisely, if f; g 2 Lloc and Z Z 1 f' dx = g' dx; ' 2 C0 ; Rn Rn does this imply that f = g a.e.? 2 1 To be able to solve this problem we need to construct functions ' 2 C0 . We start with Example 1.2. There are functions f 2 C1(R) with f(x) = 0 when x ≤ 0 and f(x) > 0 when x > 0. Remark 1.3. There is no such real analytic function. 2 Proof. Such a function must satisfy f (n)(0) = 0 for all n. Thus f(x) = 0(xn); x ! 0, for all n. Guided by this, we put 8 < e−1=x; x > 0 f(x) = : 0; x ≤ 0 : We have to prove that f 2 C1. By induction we have 8 1 − 1 < Pn e x ; x > 0 f (n)(x) = x : 0; x ≤ 0 for some polynomials Pn. This is clear when x 6= 0. But at the origin we have if h > 0, (n) (n) f (h) − f (0) 1 1 − 1 = P e h ! 0; h ! 0: h h n h 1 n Example 1.4. There are non-trivial functions in C0 (R ). Proof. Let f be the function in Example 2 and put '(x) = f(1 − jxj2). 7 Approximate identities 1 n R Pick a function ' 2 C0 (R ) with ' = 1 and ' ≥ 0. For δ > 0 we let −n 1 n R 'δ(x) = δ '(x/δ). Then 'δ 2 C0 (R ) and 'δ = 1. f'δ; δ > 0g is called an approximate identity. "'δ ! δ" Regularization by convolution The convolution of two functions f and ' is defined by Z f ∗ '(x) = f(x − y)'(y)dy: Rn 1 1 The convolution is defined for instance if f 2 Lloc and ' 2 C0 . Then f ∗ ' = ' ∗ f; f ∗ ' 2 C1 and @α(f ∗ ') = f ∗ @α'. Exercise 1.1. Verify this. Theorem 1.5. a) If f 2 C0, then f ∗ 'δ ! f; δ ! 0, uniformly. b) If f is continuous in x, then f ∗ 'δ(x) ! f(x); δ ! 0. p p c) If f 2 L ; 1 ≤ p < +1, f ∗ 'δ ! f i L (and a.e.). 1 Remark 1.6. a) implies that C0 (Ω) is dense in C0(Ω) (in the supremum norm). Exercise 1.2. Verify this. 8 Proof. a) Take R so that supp ' ⊂ fx; jxj ≤ Rg. We have Z jf ∗ 'δ(x) − f(x)j ≤ jf(x − y) − f(x)j'δ(y)dy jy|≤δR R ≤ uniform continuity ≤ n 'δ(y)dy = , if δ is small enough. R b) Exercise 1.3. c) Jensen's inequality implies Z p p jf ∗ 'δ(x) − f(x)j ≤ jf(x − y) − f(x)j'δ(y)dy n Z R Z p p ≤ jf(x − y) − f(x)j 'δ(y)dy = jf(x − δt) − f(x)j '(t)dt: Rn Rn Using Fubini's theorem and the notation f δt(x) = f(x − δt), we get Z Z p p kf ∗ 'δ − fkp ≤ '(t)dt jf(x − δt) − f(x)j dx n n Z R R δt p = kf − fkp '(t)dt ! 0; Rn That the limit is zero follows by dominated convergence and that translation p p is continuous on L . This in turn follows since C0 is dense in L ; 1 ≤ p < +1: If g 2 C0, then Z δ p p kg − gkp = jg(x − δ) − g(x)j dx ! 0; δ ! 0; K p by dominated convergence. Now approximate f 2 L with g 2 C0, kf −gkp < . Minkowski's inequality (the triangle inequality) implies δ δ δ δ δ kf − fkp ≤ kf − g kp + kg − gkp + kg − fkp ≤ 2 + kg − gkp ≤ 3, if δ is small enough. 1 n Exercise 1.4. a) Let Br = fx; jxj < rg. Construct a function δ 2 C0 (R ) such that α 0 ≤ δ ≤ 1; δ = 1 on Br and supp δ ⊂ Br+δ. How big must kδ δk1 be? n 1 b) Let K ⊂ Ω where K is compact and Ω is open in R . Construct 2 C0 (Ω) with α = 1 on a neighborhood of K and 0 ≤ ≤ 1. How big must k@ k1 be? 9 Now we are able to answer yes to the problem on page 7. Theorem 1.7. A locally integrable function that is zero as a distribution is zero a.e. Proof. We assume that R f' = 0 for all ' 2 C1. According to Theorem 1 a), R 0 we have fΦ = 0 for all Φ 2 C0, and thus f = 0 a.e. (for instance by the Riesz representation theorem.) 1 Alternatively we can argue as follows: Take n 2 C0 with n(x) = 1 1 when jxj ≤ n. Then f n 2 L and Z f n ∗ 'δ(x) = f(y) n(y)'δ(x − y)dy = 0; Rn 1 1 since y 7! n(y)'δ(x − y) is C0 . But f n ∗ 'δ ! f n in L according to Theorem 1 c). Hence f n = 0 a.e., and thus f = 0 a.e. 10 Chapter 2 Definition of distributions Definition 2.1. Let Ω be an open domain in Rn.A distribution u in Ω is 1 a linear functional on C0 (Ω), such that for every compact set K ⊂ Ω there are constants C and k such that X α ju(')j ≤ C k@ 'k1; (2.1) jα|≤k 1 for all ' 2 C0 with supp ' ⊂ K. 2 We denote the distributions on Ω by D 0(Ω). If the same k can be used for all K, we say that u has order ≤ k. These distributions are denoted 0 Dk(Ω). The smallest k that can be used is called the order of the distribution. 0 0 DF = [kDk are the distributions of finite order. Example 2.2. 1 (a) A function f 2 Lloc is a distribution of order 0.

Lecture Notes on Distributions

Distributions:The Evolutionof a Mathematicaltheory

Training Autoencoders by Alternating Minimization

Arxiv:1404.7630V2 [Math.AT] 5 Nov 2014 Asoﬀsae,Se[S4 P8,Vr6.Isedof Instead Ver66]

MATH 418 Assignment #7 1. Let R, S, T Be Positive Real Numbers Such That R

On Stochastic Distributions and Currents

Deep Learning Based Computer Generated Face Identification Using

Matrix Calculus

(Measure Theory for Dummies) UWEE Technical Report Number UWEETR-2006-0008

Mathematics Support Class Can Succeed and Other Projects To

Differentiability in Several Variables: Summary of Basic Concepts

Stone-Weierstrass Theorems for the Strict Topology

Lipschitz Recurrent Neural Networks