Math 524 – Linear Analysis on Manifolds Spring 2012
Pierre Albin Contents
Contents i
1 Differential operators on manifolds 3 1.1 Smooth manifolds ...... 3 1.2 Partitions of unity ...... 5 1.3 Tangent bundle ...... 5 1.4 Vector bundles ...... 8 1.5 Pull-back bundles ...... 9 1.6 The cotangent bundle ...... 11 1.7 Operations on vector bundles ...... 12 1.8 Sections of bundles ...... 14 1.9 Differential forms ...... 16 1.10 Orientations and integration ...... 18 1.11 Exterior derivative ...... 19 1.12 Differential operators ...... 21 1.13 Exercises ...... 22 1.14 Bibliography ...... 24
2 A brief introduction to Riemannian geometry 25 2.1 Riemannian metrics ...... 25 2.2 Parallel translation ...... 28 2.3 Levi-Civita connection ...... 32 2.4 Associated bundles ...... 33
i ii CONTENTS
2.5 E-valued forms ...... 34 2.6 Curvature ...... 36 2.7 Geodesics ...... 39 2.8 Complete manifolds of bounded geometry ...... 42 2.9 Riemannian volume and formal adjoint ...... 42 2.10 Hodge star and Hodge Laplacian ...... 44 2.11 Exercises ...... 47 2.12 Bibliography ...... 50
3 Laplace-type and Dirac-type operators 53 3.1 Principal symbol of a differential operator ...... 53 3.2 Laplace-type operators ...... 56 3.3 Dirac-type operators ...... 57 3.4 Clifford algebras ...... 59 3.5 Hermitian vector bundles ...... 63 3.6 Examples ...... 68 3.6.1 The Gauss-Bonnet operator ...... 68 3.6.2 The signature operator ...... 70 3.6.3 The Dirac operator ...... 71 3.6.4 Twisted Clifford modules ...... 73 3.6.5 The ∂ operator ...... 73 3.7 Exercises ...... 76 3.8 Bibliography ...... 78
4 Pseudodifferential operators on Rm 79 4.1 Distributions ...... 79 4.2 Examples ...... 84 4.2.1 Fundamental solutions ...... 84 4.2.2 Principal values and finite parts ...... 85 4.3 Tempered distributions ...... 86 CONTENTS 1
4.3.1 Convolution ...... 86 4.3.2 Fourier transform ...... 88 4.4 Pseudodifferential operators on Rm ...... 92 4.4.1 Symbols and oscillatory integrals ...... 92 4.4.2 Amplitudes and mapping properties ...... 98 4.5 Exercises ...... 106 4.6 Bibliography ...... 107
5 Pseudodifferential operators on manifolds 109 5.1 Symbol calculus and parametrices ...... 109 5.2 Hodge decomposition ...... 114 5.3 The index of an elliptic operator ...... 118 5.4 Pseudodifferential operators acting on L2 ...... 122 5.5 Sobolev spaces ...... 127 5.6 Exercises ...... 131 5.7 Bibliography ...... 134
6 Spectrum of the Laplacian 135 6.1 Spectrum and resolvent ...... 135 6.2 The heat kernel ...... 139 6.3 Weyl’s law ...... 146 6.4 Long-time behavior of the heat kernel ...... 148 6.5 Determinant of the Laplacian ...... 149 6.6 The Atiyah-Singer index theorem ...... 151 6.6.1 Chern-Weil theory ...... 151 6.6.2 The index theorem ...... 155 6.7 Examples ...... 157 6.8 Exercises ...... 159 6.9 Bibliography ...... 159 2 CONTENTS Lecture 1
Differential operators on manifolds
1.1 Smooth manifolds
The m-dimensional Euclidean space Rm, the m-dimensional sphere Sm and the m- dimensional torus Tm = S1 × ... × S1 are all examples of smooth manifolds, the context where our studies will take place. A general manifold is a space constructed from open sets in Rm by patching them together smoothly. Formally, M is a manifold if it is a paracompact1 Hausdorff topological space with an ‘atlas’, i.e., a covering by open sets {Vα} together with homeomorphisms
φα : Uα −→ Vα
m for some open subsets {Uα} of R . The integer m is known as the dimension of M and the maps φα : Uα −→ Vα (or sometimes just the sets {Vα}) are known as coordinate charts. An atlas is smooth if whenever Vαβ := Vα ∩ Vβ 6= ∅ the map
−1 −1 −1 (1.1) ψαβ := φβ ◦ φα : φα (Vαβ) −→ φβ (Vαβ)
−1 −1 m is a smooth diffeomorphism between the open sets φα (Vαβ), φβ (Vαβ) ⊆ R . (For ∞ k us smooth will always mean C .) If the maps ψαβ are all C -smooth, we say that the atlas is Ck.
If φα : Uα −→ Vα is a coordinate chart, it is typical to write
−1 (y1, . . . , ym) = φα : Vα −→ Uα and refer to the yi as local coordinates of M on Vα. 1 A topological space is paracompact if every open cover has a subcover in which each point is covered finitely many times.
3 4 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS
Two smooth atlases are compatible (or equivalent) if their union is a smooth atlas. A smooth structure on M is an equivalence class of smooth atlases or, equivalently, a maximal smooth atlas on M. A manifold M together with a smooth structure is a smooth manifold. To indicate that a manifold M is m-dimensional we will sometimes write M m or, more often, we will refer to M as an m-manifold. A more general notion than manifold is that of a manifold with boundary. The former is defined in the same way as a manifold, but the local model is
m m R− = {(x1, . . . , xn) ∈ R : x1 ≤ 0}. If M is a smooth manifold with boundary, its boundary ∂M consists of those points m in M that are in the image of ∂R− for some (and hence all) local coordinate charts. It is easy to check that ∂M is itself a smooth manifold (without boundary). The product of two manifolds with boundary is an example of a still more general notion, a manifold with corners. A map between two smooth manifolds (with or without boundary)
F : M −→ M 0 is smooth if it is smooth in local coordinates. That is, whenever
0 0 0 0 φα : Uα −→ Vα ⊆ M, φγ : Uγ −→ Vγ ⊆ M , are coordinate charts, the map
0 −1 0 (1.2) (φγ) ◦ F ◦ φα : Uα −→ Uγ is smooth (whenever it is defined). Notice that this definition is equivalent to making coordinate charts diffeomorphisms. The space of smooth maps between M and M 0 will be denoted C∞(M,M 0) and when M 0 = R, simply by C∞(M). Notice that a map F ∈ C∞(M,M 0) naturally induces a pull-back map
F ∗ : C∞(M 0) −→ C∞(M),F ∗(h)(p) = h(F (p)) for all h ∈ C∞(M 0), p ∈ M.
(So we can think of C∞ as a contravariant functor on smooth manifolds.)
For us a manifold will mean a connected smooth manifold without boundary, unless otherwise specified. Similarly, unless we specify otherwise, a map between manifolds will be smooth. 1.2. PARTITIONS OF UNITY 5
A subset X ⊆ M is a submanifold of M of dimension k if among the m coordinates on M at each point of X there are k that restrict to give coordinates of X. That is, at each point p ∈ X there is a coordinate chart of M, φ : U −→ V with q ∈ V, whose restriction to U ∩ Rk × {0} maps into X, φ : U ∩ k × {0} −→ V ∩ X. U∩Rk×{0} R These restricted charts then define a smooth k-manifold structure on X.
1.2 Partitions of unity
A manifold is defined by patching together many copies of Rm. It is often useful to take a construction on Rm and transfer it to a manifold by carrying it out on each patch and then adding the pieces together.
For these constructions, it is very convenient that every open covering Vα of a smooth manifold carries a partition of unity. That is, a collection ωi of smooth functions ωi : M −→ R such that: i) At each p ∈ M only finitely many ωi(p) are non-zero ii) Each ωi is supported inside some Vα iii) The sum of the ωi is the constant function equal to one at each point p ∈ M. One remarkable fact that follows easily from the existence of partitions of unity is that any compact smooth manifold can be identified with a smooth submanifold of RN for some N. This is easy to show using a finite atlas with a corresponding partition of unity. Whitney’s embedding theorem shows that any smooth manifold M embeds into R2 dim M+1 and a compact smooth manifold into R2 dim M .
1.3 Tangent bundle
We know what a smooth function
f : M −→ R is, what is its derivative? If γ is a smooth curve on M, that is, a smooth map
γ : R −→ M, then f ◦ γ is a map from R to itself so its derivative is unambiguously defined. We can take advantage of this to define the derivative of f at a point p ∈ M in terms of the curves through that point. 6 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS
Indeed, let us say that two curves γ1 and γ2 are equivalent at p if γ1(0) = γ (0) = p and 2 ∂ −1 ∂ −1 (φ ◦ γ1) = (φ ◦ γ2) ∂t t=0 ∂t t=0 for some (and hence any) coordinate chart φ : U −→ V ⊆ M containing p. It is easy to see that this is an equivalence relation and we refer to an equivalence class as a tangent vector at p ∈ M. The tangent space to M at p is the set of all tangent vectors at p and is denoted TpM. For the case of Euclidean space, the tangent space to Rm at any point is another copy of Rm, m m TpR = R . The tangent space of the m-sphere is also easy to describe. By viewing Sm as the unit vectors in Rm+1, curves on Sm are then also curves in Rm+1 so we have
m m+1 m+1 TpS ⊆ TpR = R , and a tangent vector at p will correspond to a curve on Sm precisely when it is orthogonal to p as a vector in Rm+1, thus
m m+1 TpS = {v ∈ R : v · p = 0}.
Clearly if γ1 is equivalent to γ2 at p then
∂ ∂ (f ◦ γ1) = (f ◦ γ2) ∂t t=0 ∂t t=0 for all f : M −→ R, so we can think of the derivative of f at p as a map from equivalence classes of curves through p to equivalence classes of curves through f(p), thus Dpf : TpM −→ Tf(p)R.
More generally, any smooth map F : M1 −→ M2 defines a map, known as its differential, DpF : TpM1 −→ TF (p)M2 by sending the equivalence class of a curve γ on M1 to the equivalence class of the curve F ◦γ on M2. When the point p is understood, we will omit it from the notation. If the differential of a map is injective we say that the map is an immersion, if it is surjective we say that it is a submersion. If F : M1 −→ M2 is an injective immersion which is a homeomorphism between M1 and F (M1) then we say that F is an embedding. A connected subset X ⊆ M is a submanifold of M precisely when the inclusion map X,→ M is an embedding. 1.3. TANGENT BUNDLE 7
If F is both an immersion and a submersion, i.e., if DF is always a linear isomorphism, we say that it is a local diffeomorphism. By the inverse function theorem, if F is a local diffeomorphism then every point p ∈ M1 has a neighborhood
V ⊆ M1 such that F V is a diffeomorphism onto its image. A diffeomorphism is a bijective local diffeomorphism. The chain rule is the fact that commutative diagrams
F1 M1 / M2
F2 F ◦F 2 1 ! M3 of smooth maps between manifolds induce, for each p ∈ M1, commutative diagrams between the corresponding tangent spaces,
DF1 TpM1 / TF1(p)M2
DF2 D(F ◦F ) 2 1 & TF2(F1(p))M3 i.e., that D(F2 ◦ F1) = DF2 ◦ DF1. Taking differentials of coordinate charts of M, we end up with coordinate charts on [ TM = TpM p∈M showing that TM, the tangent bundle of M, is a smooth manifold of twice the dimension of M. For Rm the tangent bundle is simply the product
m m m T R = R × R where the first factor is thought of as a point in Rm and the second as a vector ‘based’ at that point; the tangent bundle of an open set in Rm has a very similar description, we merely restrict the first factor to that open set. (We can think of M 7→ TM together with F 7→ DF as a covariant functor on smooth manifolds.)
If φ : U −→ V is a coordinate chart on M and p ∈ V ⊆ M, then Dpφ identifies m TpM with R . A different choice of coordinate chart gives a different identification m of TpM with R , but these identifications differ by an invertible linear map. This means that each TpM is naturally a vector space of dimension m, which however does not have a natural basis. Now that we know the smooth structure of TM, note that the differential of a smooth map between manifolds is a smooth map between their tangent bundles. 8 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS
There is also a natural smooth projection map
π : TM −→ M
m that sends elements of TpM to p and when M = R is simply the projection onto the first factor. It is typical, for any subset Z ⊆ M, to use the notations
−1 π (Z) =: TM Z =: TZ M.
1.4 Vector bundles
A coordinate chart φ : U −→ V on a smooth m-manifold M induces a coordinate chart on TM and an identification
−1 −1 Dφ m φ×id m (1.3) π (V) −−−−→ T U = U × R −−−−→V × R which restricts to each fiber π−1(p) to a linear isomorphism. We say that a coordinate chart of M locally trivializes TM as a ‘vector bundle’. A vector bundle over a manifold M is a locally trivial collection of vector spaces, one for each point in M. Formally a rank k real vector bundle consists of a smooth manifold E together with a smooth surjective map π : E −→ M such that the fibers π−1(p) form vector spaces over R and satisfying a local triviality condition: for any point p ∈ M and small enough neighborhood V of p there is a diffeomorphism k −1 Φ: V × R −→ π (V) whose restriction to any q ∈ V is a linear isomorphism
k −1 Φ q∈V : {q} × R −→ π (q).
We refer to the pair (V, Φ) as a local trivialization of E. We will often refer to a rank k real vector bundle as a Rk-bundle. The discus- sion below of Rk-vector bundles applies equally well to complex vector bundles, or Ck-vector bundles, which are the defined by systematically replacing Rk with Ck above. If X is a submanifold of M, and E −−→π M is a vector bundle, then
π|X E X −−−→ X is a vector bundle. 1.5. PULL-BACK BUNDLES 9
A subset of a vector bundle is a subbundle if it is itself a bundle in a consistent fashion. That is, E1 is a subbundle of E2 if E1 and E2 are both bundles over M, E1 ⊆ E2, and the projection map E1 −→ M is the restriction of the projection map E2 −→ M. A bundle morphism between two vector bundles
π1 π2 E1 −−→ M1,E2 −−→ M2 is a map F : E1 −→ E2 that sends fibers of π1 to fibers of π2 and acts linearly between fibers. Thus for any bundle morphism F there is a map F : M1 −→ M2 such that the diagram
F (1.4) E1 / E2
π1 π2 F M1 / M2 commutes; we say that F covers the map F. An invertible bundle morphism is called a bundle isomorphism. The trivial Rk-bundle over a manifold M is the manifold Rk × M with projec- tion the map onto the second factor. We will use Rk to denote the trivial Rk-bundle. A bundle is trivial if it is isomorphic to Rk for some k. A manifold is said to be par- allelizable if its tangent bundle is a trivial bundle, e.g., Rm is parallelizable. We will show later that Sm is not parallelizable when m is even. The circle S1 is parallelizable and hence (by exercise 1) so are all tori Tm. Some interesting facts which we will not show is that any three dimensional manifold is parallelizable and that the only parallelizable spheres are S1, S3 and S7. The fact that these sphere are parallelizable is easy to see by viewing them as the unit vectors in the complex numbers, the quaternions, and the octionians, respectively; the fact that these are the only parallelizable ones is not trivial.
1.5 Pull-back bundles
Any bundle morphism covers a map between manifolds (1.4). There is a sort of k converse to this: given a map between manifolds f : M1 −→ M2, and a R -bundle π E −−→ M2 the set
∗ f E = {(p, v) ⊆ M1 × E : f(p) = π(v)} 10 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS
fits into the commutative diagram
fb f ∗E / E
π πb f M1 / M2 where, for any (p, v) ∈ f ∗E,
π fb p o b (p, v) / v.
Moreover, if (U, Φ) is a local trivialization of E and V is a coordinate chart in M1 such that f(V) ⊆ U then
∗ −1 −1 k f E ∩ (V × π (U)) = {(p, (Φ f(p)) (x)) : p ∈ V, x ∈ R } and we can use this to introduce a C∞ manifold structure on f ∗E such that f ∗E ∩ −1 k ∗ πb k (V × π (U)) is diffeomorphic to V × R making f E −−→ M1 a R -bundle. The bundle f ∗E is called the pull-back of E along f. The simplest example of a pull-back bundle is the restriction of a bundle. If
π π X X is a submanifold of M, and E −−→ M is a vector bundle, then E X −−−−→ X is (isomorphic to) the pull-back of E along the inclusion map X,→ M. We see directly from the definition that the pull-back bundle of a trivial bundle is again trivial, but note that the converse is not true.
Given a smooth map f : M1 −→ M2, pull-back along f defines a functor between bundles on Y and bundles on X. Indeed, if E1 −→ Y and E2 −→ Y are bundles over Y and F : E1 −→ E2 is a bundle map between them then
∗ ∗ ∗ ∗ ∗ f F : f E1 −→ f E2, f F (p, v) = (p, F (p)v) for all (p, v) ∈ f E1
∗ ∗ is a bundle map between f E1 and f E2. Another useful relation between pull-backs and bundle maps is the following: π1 π2 If E1 −−→ M1 and E2 −−→ M2 are vector bundles and F : E1 −→ E2 is a bundle map so that F E1 / E2
π1 π2 F M1 / M2 ∗ then F is equivalent to a bundle map between E1 −→ M1 and F E2 −→ M1, namely ∗ E1 3 v 7→ (π1(v),F (π1(v))v) ∈ F E2. 1.6. THE COTANGENT BUNDLE 11
We will show later (Remark 3) that if f : M1 −→ M2 and g : M1 −→ M2 are homotopic smooth maps, i.e., if there is a smooth map
H : [0, 1] × M1 −→ M2 s.t. H(0, ·) = f(·) and H(1, ·) = g(·)
∗ ∼ ∗ then f E = g E for any vector bundle E −→ M2. In particular, this implies that all vector bundles over a contractible space are trivial.
1.6 The cotangent bundle
Every real vector space V has a dual vector space
∗ V = Hom(V, R) consisting of all the linear maps from V to R. Given a linear map between vector spaces T : V −→ W there is a dual map between the dual spaces
T ∗ : W ∗ −→ V ∗,T ∗(ω)(v) = ω(T (v)) for all ω ∈ W ∗, v ∈ V.
By taking the dual maps of the local trivializations of a vector bundle E −−→π M, the manifold E∗ = {(p, ω): p ∈ M, ω ∈ (π−1(p))∗} with the obvious projection E∗ −→ M is seen to be a vector bundle, the dual bundle to E. Every vector v ∈ Rk defines a dual vector v∗ ∈ (Rk)∗,
∗ k ∗ v : R −→ R, v (w) = v · w, and it is easy to see that the dual vectors of a basis of Rk form a basis of (Rk)∗ so that in fact v 7→ v∗ is a vector space isomorphism. In particular the dual bundle of an Rk-bundle is again a Rk-bundle.
Given vector bundles E1 −→ M1 and E2 −→ M2 and F : E1 −→ E2 a bundle map, we get a dual bundle map
∗ ∗ ∗ F : E2 −→ E1 .
Thus taking duals defines a contravariant functor on the category of vector bundles. 12 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS
The dual of a bundle map F : E1 −→ E2 between two bundles Ei −→ M is a map between their duals ∗ ∗ ∗ F : E2 −→ E1 . Thus taking duals defines a contravariant functor on the category of vector bundles on M. It is easy to see that this functor commutes with pull-back along a smooth map. The dual bundle to the tangent bundle is the cotangent bundle
T ∗M −→ M.
Remark 1. The assignation M 7→ T ∗M is natural, but is not a functor in the same way that M 7→ TM is. Rather, a map F : M1 −→ M2 between manifolds induces a map DF : TM1 −→ TM2 between their tangent bundles which, as explained above, ∗ we can interpret as a map DF : TM1 −→ F (TM2) between bundles over M1. Now ∗ ∗ ∗ ∗ we can take duals and we get a bundle map (DF ) : F (TM2 ) −→ TM1 . Thus we can view M 7→ T ∗M as a functor but its target category should consist of vector bundles where a morphism between E1 −→ M1 and E2 −→ M2 consists of ∗ a map F : M1 −→ M2 and a bundle map Φ : F E2 −→ E1 and composition is
∗ (F1, Φ1) ◦ (F2, Φ2) = (F1 ◦ F2, Φ1 ◦ (F1 Φ2)).
This is called the category of star bundles in [Kolar-Michor-Slovak].
1.7 Operations on vector bundles
We constructed the vector bundle E∗ −→ M from the vector bundle E −→ M by applying the vector space operation V 7→ V ∗ fiber-by-fiber. This works more generally whenever we have a smooth functor of vector spaces. Let Vect denote the category of finite dimensional vector spaces with linear isomorphisms, and let T be a (covariant) functor on Vect. We say that T is smooth if for all V and W in Vect the map
Hom(V,W ) −→ Hom(T (V ), T (W )) is smooth. Let E −−→π M be a vector bundle over M, define
−1 G Tp(E) = T (π (p)), T (E) = Tp(E) p∈M 1.7. OPERATIONS ON VECTOR BUNDLES 13
and let T (π): T (E) −→ M be the map that sends Tp(E) to p. We claim that T (π) T (E) −−−−→ M is a smooth vector bundle. Applying T fibrewise to a local trivialization of E,
k −1 Φ: V × R −→ π (V), we obtain a map k −1 T (Φ) : V × T (R ) −→ T (π) (V), and we can use these maps as coordinate charts on T (E) to obtain the structure of a smooth manifold. Indeed, we only need to see that if two putative charts overlap then the map
−1 k T (Φ1) T (Φ2) k (V1 ∩ V2) × T (R ) −−−−−−−−−−→ (V1 ∩ V2) × T (R ) is smooth, and this follows from the smoothness of T . Thus T (E) is a smooth manifold, T (π) is a smooth function, and
T (π) T (E) −−−−→ M satisfies the local triviality condition, so we have a new vector bundle over M. The same procedure works for contravariant functors (meaning that Hom(V,W ) is sent to Hom(T (W ), T (V ))) as well as functors with several variables. Therefore, for instance, we can define for vector bundles E −→ M and F −→ M the vector bundles:
i) Hom(E,F ) −→ M with fiber over p, Hom(Ep,Fp).
ii) E ⊗ F −→ M with fiber over p, Ep ⊗ Fp. iii) S2E −→ M whose fiber over p is the space of all symmetric bilinear transfor- mations from Ep × Ep to R. iv) SkE −→ M whose fiber over p is the space of all symmetric k-linear transfor- k mations from Ep to R. v) ΛkE −→ M whose fiber over p is the space of all antisymmetric k-linear k transformations from Ep to R.
We also obtain natural isomorphisms of vector bundles from isomorphisms of vector spaces. Thus, for instance, for vector bundles E,F, and G over M, we have2
2Recall that R is the trivial R-bundle, R × M −→ M. 14 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS
i) E ⊕ F ∼= F ⊕ E ∼ ii) E ⊗ R = E iii) E ⊗ F ∼= F ⊗ E iv) E ⊗ (F ⊕ G) ∼= (E ⊗ F ) ⊕ (E ⊗ G)
∗ ∗ ∼ v) Hom(E,F ) = E ⊗ F (so E = Hom(E, R)) M vi) Λk(E ⊕ F ) ∼= (ΛiE ⊗ ΛjF ) i+j=k vii) (ΛkE)∗ = Λk(E∗)
1.8 Sections of bundles
A section of a vector bundle E −−→π M is a smooth map s : M −→ E such that E > s π
id M / M commutes. We will denote the space of smooth sections of E by C∞(M; E). This forms a vector space over R and, as such, it is infinite-dimensional (unless we are in a ‘trivial’ situation where (dim M)(rankE) = 0). However it is also a module over C∞(M) (with respect to pointwise multiplication) and here it is finitely generated. For instance, the sections of a trivial bundle form a free C∞(M) module,
∞ n ∞ n C (M; R ) = (C (M)) . We also introduce notation for functions and sections with compact support,
∞ ∞ Cc (M), Cc (M; E) and point out that the latter is a module over the former. One consequence of the module structure is that we can use a partition of unity on M to decompose a section into a sum of sections supported on trivializations of E. A section of the tangent bundle is known as a vector field. Thus a vector field on M is a smooth function that assigns to each point p ∈ M a tangent vector ∞ in TpM. A vector field V ∈ C (M; TM) naturally acts on a smooth function by differentiation
∞ ∞ ∞ V : C (M) −→ C (M), (V f)(p) = Dpf(V (p)) for all f ∈ C (M), p ∈ M. 1.8. SECTIONS OF BUNDLES 15
If Vα is a coordinate chart on M with local coordinates y1, . . . , ym then we can write
m X ∂ V = ai(y1, . . . , ym) . Vα ∂y i=1 i Indeed, this is just the representation in the induced coordinates of TM with the identification (1.3). This action of V on C∞(M) is linear and satisfies
V (fh) = V (f)h + fV (h), for all f, h ∈ C∞(M), i.e., it is a derivation of C∞(M). In fact, it is possible to identify vector fields on M with the derivations of C∞(M). (see e.g., [Lee]) As maps on C∞(M), we can compose vector fields
(VW )(f) = V (W (f)) but the result is no longer a vector field. However, if we form the commutator of two vector fields, [V,W ]f = V (W (f)) − W (V (f)) then the result is again a vector field! In fact it is easy to check that this is a derivation on C∞(M). The commutator of vector fields satisfies the ‘Jacobi identity’
[[U, V ],W ] + [[V,W ],U] + [[W, U],V ] = 0 and hence the commutator, combined with the R-vector space structure, gives C∞(M,TM) the structure of a Lie algebra.
Remark 2. Whereas a smooth map F : M1 −→ M2 induces a map DF : TM1 −→ TM2, it is not always possible to ‘push-forward’ a vector field. We say that two vector ∞ ∞ fields V ∈ C (M1; TM1), and W ∈ C (M2; TM2) are F -related if the diagram
DF TM1 / TM2 O O V W F M1 / M2 commutes, i.e., if DF ◦ V = W ◦ F. If F is a diffeomorphism, then we can push-forward a vector field: the vector ∞ −1 ∞ fields V ∈ C (M1; TM1) and F∗V = DF ◦ V ◦ F ∈ C (M2; TM2) are always ∞ F -related. Moreover, F∗[V1,V2] = [F∗V1,F∗V2] for all V1,V2 ∈ C (M1; TM1), so M 7→ C∞(M; TM) is a functor from smooth manifolds with diffeomorphisms to Lie algebras and isomorphisms. 16 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS 1.9 Differential forms
Sections of the cotangent bundle of M,T ∗M, are known as differential 1-forms. More generally, sections of the k-th exterior power of the cotangent bundle, ΛkT ∗M, are known as differential k-forms. The typical notation for this space is
dim M M Ωk(M) = C∞(M;ΛkT ∗M), Ω∗(M) = Ωk(M) k=0 where by convention Ω0(M) = C∞(M). A differential form in Ωk(M) is said to have degree k. A differential k-form ω ∈ Ωk(M) can be evaluated on k-tuples of vector fields,
∞ k ω : C (M; TM) −→ R, it is linear in each variable and has the property that
ω(Vσ(1),...,Vσ(k)) = sign(σ)ω(V1,...,Vk) for any permutation σ of {1, . . . , k}. Differential forms of all degrees Ω∗(M) form a graded algebra with respect to the wedge product
Ωk(M) × Ω`(M) 3 (ω, η) 7→ ω ∧ η ∈ Ωk+`(M) 1 X ω ∧ η(V ,...,V ) = sign(σ)ω(V ,...,V )η(V ,...,V ). 1 k+` k!`! σ(1) σ(k) σ(k+1) σ(k+`) σ
This product is graded commutative (so sometimes commutative and sometimes anti-commutative)
ω ∈ Ωk(M), η ∈ Ω`(M) =⇒ ω ∧ η = (−1)k`η ∧ ω.
In particular if ω ∈ Ωk(M) with k odd, then ω ∧ ω = 0. Given a differential k-form, ω, we will write the operator ‘wedge product with ω, by ∗ ∗+k ∗ eω :Ω (M) −→ Ω (M), eω(η) = ω ∧ η, for all η ∈ Ω (M).
We say that a k-form is elementary if it is the wedge product of k 1-forms. Elementary k-forms are easy to evaluate
(ω1 ∧ · · · ∧ ωk)(V1,...,Vk) = det(ωi(Vj)). 1.9. DIFFERENTIAL FORMS 17
Let p be a point in a local coordinate chart V on M with coordinates y1, . . . , ym. A ∗ basis of Tp M dual to the basis {∂y1 , . . . , ∂ym } of TpM is given by 1 m i i {dy , . . . , dy } where dy (∂yj ) = δj. ∗ A basis for the space of k-forms in Tp M is given by
i1 ik {dy ∧ · · · ∧ dy : i1 < ··· < ik}. Any k-form supported in V can be written as a sum of elementary k-forms. Using a partition of unity we can write any k-form on M as a sum of elementary k-forms. Notice that anticommutativity of the wedge product implies that all elementary k- forms, and hence all k-forms, of degree greater than the dimension of the manifold vanish. There is also an interior product of a vector field and a differential form, which is simply the contraction of the differential form by the vector field. For every W ∈ C∞(M; TM), the interior product is a map ∗ ∗−1 iW :Ω (M) −→ Ω (M) k ∞ (iW ω)(V1,...,Vk−1) = ω(W, V1,...,Vk−1), for all ω ∈ Ω , {Vi} ⊆ C (M; TM). We will also use the notation iW ω = W y ω. Notice that antisymmetry of differential forms induces anticommutativity of the interior product in that iV iW = −iW iV ∞ 2 for any V,W ∈ C (M; TM). In particular, (iV ) = 0.
If F : M1 −→ M2 is a smooth map between manifolds, we can pull-back differential forms from M2 to M1, ∗ k k F :Ω (M2) −→ Ω (M1), ∗ k ∞ F ω(V1,...,Vk) = ω(DF (V1),...,DF (Vk)) for all ω ∈ Ω (M2),Vi ∈ C (M; TM1). Pull-back respects the wedge product, (1.5) F ∗(ω ∧ η) = F ∗ω ∧ F ∗η, and the interior product,
∗ ∗ F (DF (W ) y ω) = W y F (ω).
In the particular case of Rm, the pull-back of an m-form by a linear map T is (1.6) T ∗(dx1 ∧ · · · ∧ dxm) = (det T )(dx1 ∧ · · · ∧ dxm), directly from the definition of the determinant. 18 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS 1.10 Orientations and integration
If W is an m-dimensional vector space, and {u1, . . . , um}, {v1, . . . , vm} are bases of W, we will say that they represent the same orientation if the linear map that sends m ui to vi has positive determinant. Equivalently, since Λ W is a one-dimensional vector space, we know that u1 ∧ · · · ∧ um and v1 ∧ · · · vm differ by multiplication by a non-zero real number and we say that they represent the same orientation if this number is positive. A representative ordered basis or non-zero m-form is said to be an orientation of W. The canonical orientation of Euclidean space Rm corresponds to the canonical ordered basis, {e1, . . . , em}. An Rk-vector bundle E −→ M is orientable if we can find a smooth orienta- tion of its fibers, i.e., if there is a nowhere vanishing section of ΛkE −→ M. Notice that the latter is an R-bundle over M, so the existence of a nowhere vanishing sec- tion is equivalent to its being trivial. Thus E −→ M is orientable if and only if ΛkE −→ M is a trivial bundle. An orientation is a choice of trivializing section, and a manifold together with an orientation is said to be oriented. Note that, since Λk(E∗) = (ΛkE)∗,E is orientable if and only if E∗ is orientable. An m-manifold M is orientable if and only if its tangent bundle is orientable. Equivalently, a manifold is orientable when its cotangent bundle is orientable, so when there is a nowhere vanishing m-form. A choice of nowhere vanishing m-form is called an orientation of M or also a volume form on M. A coordinate chart φ : U −→ V is oriented if it pulls-back the orientation of M to the canonical orientation of Rm. A manifold is orientable precisely when it has an atlas of oriented coordinate charts. If M is oriented, there is a unique linear map the integral Z ∞ m : Cc (M;Λ M) −→ R which is invariant under diffeomorphims and coincides in local coordinates with the Lebesgue integral. That is, if ω ∈ ΩmM is supported in a oriented coordinate chart φ : U −→ V then φ∗ω ∈ Ωm(Rm) and so we can take its Lebesgue integral. The assertion is that the result is independent of the choice of oriented coordinate chart, and the proof follows from (1.6) and the usual change of variable formula for the Lebesgue integral. By using a partition of unity, integration of any m-form reduces to integration of m-forms supported in oriented coordinate charts. It is convenient to formally extend the integral to a functional on forms of all degrees, by setting it equal to zero on all forms of less than maximal degree. What happens when a manifold is not orientable? At each point p ∈ M, the set of equivalence classes of orientations of TM, Op, has two elements, say o1(p) and 1.11. EXTERIOR DERIVATIVE 19
o2(p). We define the orientable double cover of M to be the set
Or(M) = {(p, oi(p)) : p ∈ M, oi(p) ∈ Op}.
The manifold structure on M easily extends to show that Mf is a smooth manifold. Indeed, given a coordinate chart φ : U −→ V of M we set, for each q ∈ U,
−1 φe(q) = (φ(q), [φ (e1 ∧ · · · ∧ em)]) ∈ Or(M) and we get a coordinate chart on M.f The mapping π : Mf −→ M that sends (p, oi(p)) to p is smooth and surjective. Moreover if V is a coordinate neighborhood in M −1 then π (V) = V1 ∪ V2 is the union of two disjoint open sets in Or(M) and π restricted to each Vi is a diffeomorphism onto V, hence the term ‘double cover’. It is easy to see that Or(M) is always orientable (at each point (p, oi(p)) choose the orientation corresponding to oi(p)). If M is connected and not orientable then Or(M) is connected and orientable, and we can often pass to Or(M) if we need an orientation. If M is orientable, then the same construction of Or(M) yields a disconnected manifold consisting of two copies of M. Orientations are one place where the theory of complex vector bundles dif- fers from that of general real vector bundles. Complex vector bundles are always orientable. Indeed, first note that a C-vector space W of complex dimension k has a canonical orientation as a R-vector space of real dimension 2k. We just take any basis of W over C, say b = {v1, . . . , vk}, and then take b = {v1, iv1, v2, iv2, . . . , vk, ivk} as 0 a basis for W over R. The claim is that if b = {w1, . . . , wk} is any other basis of W 0 over C, then b = {w1, iw1, . . . , wk, iwk} gives us the same orientation for W as a R- vector space, i.e., the change of basis matrix Bb from b to b0 has positive determinant. The proof is just to note that, if B is the complex change of basis matrix from b to b0 then det Bb = (det B)(det B) > 0. As the orientation is canonical, every Ck-vector bundle is orientable as a R2k-vector bundle.
1.11 Exterior derivative
If f is a smooth function on M, its exterior derivative is a 1-form df ∈ C∞(M; T ∗M) defined by df(V ) = V (f), for all V ∈ C∞(M; TM). 20 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS
If ω ∈ C∞(M; T ∗M), its exterior derivative is a 2-form dω ∈ C∞(M;Λ2T ∗M) defined by dω(V,W ) = V (ω(W )) − W (ω(V )) − ω([V,W ]), for all V,W ∈ C∞(M; TM). In the same way, if ω ∈ Ωk(M), we define its exterior derivative to be the (k+1)-form dω ∈ Ωk+1(M) defined by
p X i dω(V0,...,Vk) = (−1) Vi(ω(V0,..., Vbi,...,Vp)) i=0 X i+j + (−1) ω([Vi,Vj],V0,..., Vbi,..., Vbj,...,Vp) i The exterior derivative thus defines a graded map d :Ω∗(M) −→ Ω∗(M). It has the property that d2 = 0. It also distributes over the wedge product respecting the grading, in that (1.7) ω ∈ Ωk(M), η ∈ Ω`(M) =⇒ d(ω ∧ η) = (dω) ∧ η + (−1)kω ∧ dη. The exterior derivative also commutes with pull-back along a smooth map ∞ F : M1 −→ M2. Indeed, note that for a function on M2, f ∈ C (M2), this follows from the chain rule d(F ∗f)(V ) = d(f ◦F )(V ) = D(f ◦F )(V ) = Df(DF (V )) = df(DF (V )) = F ∗(df)(V ) and, using this, it follows for forms of arbitrary degree from (1.5) and (1.7). A k-form ω is called closed if dω = 0 and exact if there is a (k − 1)-form η such that dη = ω. The space of k-forms is an infinite dimensional vector space over R, as are the spaces of closed and exact forms. However we will show that, for a compact manifold M and any k, the quotient vector space closed k-forms ker d :Ωk(M) −→ Ωk+1(M) Hk (M) = = dR exact k-forms im (d :Ωk−1(M) −→ Ωk(M)) is finite dimensional. This R-vector space is known as the k-th de Rham coho- mology group of M. These groups are one of the main objects of study in this course. We will come back to them after we build up more tools. If M is an oriented manifold with boundary, Stokes’ theorem is the fact that Z Z m−1 dω = ω, for all ω ∈ Ωc (M). M ∂M 1.12. DIFFERENTIAL OPERATORS 21 This compact statement generalizes the fundamental theorem of calculus, Green’s theorem, and Gauss’ theorem. If M does not have boundary, then Stokes’ theorem says that Z m−1 dω = 0, for all ω ∈ Ωc (M). M 1.12 Differential operators It is clear what we mean by a differential operator of order n on Rm, namely a polynomial of degree n in the variables ∂y1 , ∂y2 , . . . , ∂ym , X j1 j2 jm (1.8) aj1,...,jm (y1, . . . , ym)∂y1 ∂y2 ··· ∂ym . j1,j2,...,jm j1+...+jm=n (The degree of this operator is the largest value of j1 + j2 + ... + jm for which aj1,...,jm is not zero, which we are assuming is n.) This operator is smooth when the coefficient functions aj1,...,jm (y1, . . . , ym) are all smooth. The set of all differential n operators of order at most n, is denoted Diff (Rm). This is a good moment to introduce multi-index notation. Let us agree, for all vectors α = (j1, . . . , jm), on the equivalence of the symbols α j1 j2 jm ∂y = ∂y1 ∂y2 ··· ∂ym , and let us use |α| to denote j1 + ... + jm. Then we can write the expression above more succinctly as n m X α P ∈ Diff (R ) ⇐⇒ P = aα(y)∂y . |α|≤n On a manifold, a differential operator of order at most n is a linear map P : C∞(M) −→ C∞(M) that in any choice of local coordinates has the form (1.8). For instance, a vector field V induces a linear map (a derivation) on smooth functions V : C∞(M) −→ C∞(M) and this is exactly what we mean by a first order differential operator on M. In fact, a better description of differential operators of order at most n on M is to take the span of powers of C∞(M; TM), Diffn(M) = span(C∞(M; TM))j 0≤j≤n 22 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS where we agree that C∞(M; TM)0 = C∞(M). Notice that the fact that the degree of a differential operator is well-defined comes from the fact that the commutator of two vector fields is again a vector field. The exterior derivative is also a first order differential operator, but now be- tween sections of vector bundles over M, e.g., for each k, d : C∞(M;ΛkT ∗M) −→ C∞(M;Λk+1T ∗M). In general, a differential operator of order at most n acting between sections of two vector bundles E and F is a linear map C∞(M; E) −→ C∞(M; F ) that in any local coordinates that trivialize both E and F has the form (1.8) where we now require the functions aα(y) to be sections of Hom(E,F ). There is a very nice characterization (or definition if you prefer) of differential operators due to Peetre3. First, we say that a linear operator T : C∞(M; E) −→ C∞(M; F ) is a local operator if whenever s ∈ C∞(M; E) vanishes in an open set U ⊆ M, T s also vanishes in that open set, s U = 0 =⇒ (T s) U = 0. Peetre showed that all linear local operators are differential operators. 1.13 Exercises Exercise 1. Show that whenever M1 and M2 are smooth manifolds, M1 × M2 is too. Prove that T (M1 × M2) = TM1 × TM2. Exercise 2. Instead of defining the cotangent bundle in terms of the tangent bundle, we can do things the other way around. At each point p of a manifold M, let ∞ Ip = {u ∈ C (M): u(p) = 0} 3 Peetre, Jaak Une caract´erisationabstraite des op´erateurs diff´erentiels. Math. Scand. 7 1959 211-218. 1.13. EXERCISES 23 2 be the ideal of functions vanishing at p. Let Ip be the linear span of products of pairs of elements in Ip, and define ∗ 2 ∗ [ ∗ Tp M = Ip/Ip , T M = Tp M. p∈M Show that, together with the obvious projection map, this defines a vector bundle isomorphic to the cotangent bundle. Exercise 3. k π Show that a R -bundle E −−→ M is trivial if and only if there are k sections s1, . . . , sk such that, at each p ∈ M, {s1(p), . . . , sk(p)} is linearly independent. Exercise 4. If E −−−→πE M is a subbundle of F −−−→πF M show that there is a bundle F/E −→ M −1 −1 whose fiber over p ∈ M is πF (p)/πE (p) Exercise 5. Given vector bundles E −→ M and F −→ M, assume that f ∈ C∞(M; Hom(E,F )) is such that dim ker f(p) is independent of p ∈ M. Prove that the kernels of f form a vector bundle over M (a subbundle of E) and the cokernels of f also form a vector bundle over M (a quotient of F.) Exercise 6. (Poincar´e’sLemma for 1-forms.) Let ω = a(x, y, z) dx + b(x, y, z) dy + c(x, y, z) dz be a smooth 1-form in R3 such that dω = 0. Define f : R3 −→ R by Z 1 f(x, y, z) = (a(tx, ty, tz)x + b(tx, ty, tz)y + c(tx, ty, tz)z) dt. 0 Show that df = ω. (Thus on R3 (also Rn) a 1-form ω satisfies dω = 0 ⇐⇒ ω = df for some f. It turns out that this is a reflection of the ‘contractibility’ of Rn.) Hint: First use dω = 0 to show that ∂b ∂a ∂c ∂a ∂b ∂c = , = , = ∂x ∂y ∂x ∂z ∂z ∂y and then use the identity Z 1 ∂ a(x, y, z) = (a(tx, ty, tz)t) dt 0 ∂t Z 1 Z 1 ∂a ∂a ∂a = a(tx, ty, tz) dt + t x + y + z dt. 0 0 ∂x ∂y ∂z 24 LECTURE 1. DIFFERENTIAL OPERATORS ON MANIFOLDS Exercise 7. Show that any smooth map F : M1 −→ M2 induces, for any k ∈ N0, a map in cohomology ∗ k k F :HdR(M2) −→ HdR(M1) Exercise 8. Show that the wedge product and the integral descend to cohomology. That is, show that the value of R ω and the cohomology class of ω ∧ η only depends on the cohomology classes of ω and η. Exercise 9. Show that a linear map T : C∞(M; E) −→ C∞(M; F ) corresponds to a section of C∞(M; Hom(E,F )) if and only if T is linear over C∞(M). 1.14 Bibliography There are many wonderful sources for differential geometry. The first volume of Spivak, A comprehensive introduction to differential geometry is written in a con- versational style that makes it a pleasure to read. The other four volumes are also very good. I also recommend Lee, Introduction to smooth manifolds. For vector bundles, the books Milnor and Stasheff, Characteristic classes and Atiyah, K-theory are written by fantastic expositors, as is the book Bott and Tu, Differential forms in algebraic topology. Lecture 2 A brief introduction to Riemannian geometry 2.1 Riemannian metrics A metric on a vector bundle E −→ M is a smooth family of inner products, one on each fiber of E. We make sense of smoothness by using the bundle S2(E) −→ M of symmetric bilinear forms on the fibers of E. Formally, a metric gE on E is a section of the vector bundle S2(E) −→ M that is non-degenerate and positive definite. In particular we have an C∞(M)-valued inner product on the sections of E, so a smooth map ∞ ∞ ∞ gE : C (M; E) × C (M; E) −→ C (M) that is linear in each entry, symmetric, and satisfies gE(s, s) ≥ 0 and gE(s, s) = 0 ⇐⇒ s = 0. A metric g on the tangent bundle of M is called a Riemannian metric, and the pair (M, g) is called a Riemannian manifold. Any vector bundle admits a metric. Indeed, on a local trivialization we can just put the Euclidean inner product. Covering the manifold with local trivializations, we can use a partition of unity to put these inner products together into a bundle metric. Given a metric gE on E we can use the inner product gE(p) to identify a vector ∗ v ∈ Ep with a vector in E by (2.1) (geE(p))(v)(w) = gE(p)(v, w) for all w ∈ Ep. 25 26 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY This extends to a bundle isomorphism ∗ geE : E −→ E . Conversely, any bundle isomorphism between E and E∗ defines by (2.1) an inner product on E (and on E∗), though not necessarily positive definite. In particular a Riemannian metric g on M induces a bundle isomorphism between TM and T ∗M, known as the ‘musical isomorphism’. A vector field V is associated a differential 1-form via g, denoted V [, and a 1-form ω is associated a vector field via g, denoted ω]. Incidentally, this is how the gradient is defined on a Riemannian manifold. Given a function f ∈ C∞(M), we get a one form df ∈ C∞(M; TM), and we use the metric to turn this into a vector field ∇f = (df)] ∈ C∞(M; TM). Thus, the gradient satisfies (and is defined by) g(∇f, V ) = df(V ) = V (f), for all V ∈ C∞(M; TM). Let us look at the objects in local coordinates. Let V be a local coordinate chart of M trivializing E, so that we have coordinates (2.2) {y1, . . . , ym, s1, . . . , sk} where y1, . . . , ym are local coordinates on M, and s1, . . . , sk are linearly independent sections of E depending on y. A metric is a smooth family of k × k symmetric positive definite matrices y 7→ gE(y) = ((gE)ab(y)) , where (gE)ab = gE(sa, sb). Any section of E, σ : V −→ E, can be written k X a σ(y) = σ (y)sa a=1 for some smooth functions σa. It is common to write this more succinctly using the ‘Einstein summation convention’ as a σ(y) = σ (y)sa 2.1. RIEMANNIAN METRICS 27 with the understanding that a repeated index that occurs as both a subindex and a a a superindex should be summed over. If σ = σ sa and τ = τ sa are two sections of E, their inner product with respect to gE is a b gE(σ, τ) = σ τ (gE)ab. It is very convenient, with regards to the Einstein convention, to denote the ab entries of the inverse matrix of (gE)ab by (gE) , thus for instance we have ab a (gE) (gE)bc = δc . ∗ The metric gE determines a metric gE∗ on E by the relation (2.3) gE∗ (geE(σ), geE(τ)) = gE(σ, τ), and in a local trivialization this corresponds to taking the inverse matrix of (gE)ab. Indeed, in these same coordinates (2.2) we have a natural local trivialization of E∗, namely 1 k j ∞ ∗ i i {y1, . . . , ym, s , . . . , s }, where s ∈ C (V; E ) satisfy s sj = δj. Then the map geE satisfies b (geE)(sa)(sb) = gE(sa, sb) = gab, so (geE)(sa) = (gE)abs b ab and hence s = (gE) geE(sa). Thus we have a b ac bn (gE∗ )ab = gE∗ (s , s ) = gE∗ ((gE) geE(sc), (gE) geE(sn)) ac bn ac b ab = (gE) (gE) (gE)cn = (gE) δc = (gE) as required. Given a smooth map F : M1 −→ M2 we can pull-back a metric g on M2 to 2 define a section of S (TM1) −→ M1, ∗ (F g)(V1,V2) = g(DF (V1),DF (V2)). ∗ If F is an immersion, then F g is a metric on M1. In particular, submanifolds of a Riemannian manifold inherit metrics. The simplest example of a Riemannian m-manifold is Rm with the usual Eu- clidean inner product on the fibers of m m m m T R = R × R −→ R . 28 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY From the previous paragraph it follows that all submanifolds of Rm inherit a metric from the one on Rm. A difficult theorem of Nash, the Nash embedding theorem, shows that all Riemannian manifolds can be embedded into RN for some N in such a way that the metric coincides with the one inherited from RN . So we can think of an arbitrary Riemannian manifold as a submanifold of a Euclidean space. If (M1, g1) and (M2, g2) are Riemannian manifolds, a map F : M1 −→ M2 is a ∗ local isometry if it is a local diffeomorphism and F g2 = g1. An isometry between two Riemannian manifolds is a bijective local isometry. Riemannian geometry is the study of invariants of isometry classes of manifolds. A metric on E −→ M allows us to define the length of a vector v ∈ Ep by p |v|gE = gE(p)(v, v) and the angle θ between two vectors v, w ∈ Ep satisfies gE(p)(v, w) = |v|gE |w|gE cos θ. Given a curve γ :[a, b] −→ M on M, and a Riemannian metric g, the length of γ is defined to be Z b 0 `(γ) = |γ (t)|g dt. a This is independent of the chosen parametrization of γ. We can use the length of curves to define the distance between two points, p, q ∈ M, to be (2.4) d(p, q) = inf{`(γ): γ ∈ C∞([0, 1],M), γ(0) = p, γ(1) = q} it is easy to see that d(p, q) ≥ 0, d(p, q) = d(q, p), d(p, q) ≤ d(p, r) + d(r, q) and we will show below that d(p, q) = 0 =⇒ p = q, so that (M, d) is a metric space. 2.2 Parallel translation On Rm, and more generally on a parallelizable manifold, it is clear what we mean by a constant vector field. Given a tangent vector at a point in one of these spaces, we can ‘parallel translate’ it to the whole space just taking the constant vector field. 2.2. PARALLEL TRANSLATION 29 To try to make sense of this on a manifold, suppose we have a smooth curve γ : [0, 1] −→ M and a vector field V ∈ C∞(M; TM), we would like to say that V is constant (or parallel) along γ if the map V ◦ γ : [0, 1] −→ TM is constant. The problem with making sense of this is that TM is not a vector space but a collection of vector spaces, so V (γ(t)) ∈ Tγ(t)M belongs to a different vector space for each t. The same problem occurs with trying to make sense of a constant section of a vector bundle E −→ M. The solution is to use a connection. A connection on a vector bundle E −→ M is a map ∇E : C∞(M; TM) × C∞(M; E) −→ C∞(M; E) E E which is denoted ∇ (V, s) = ∇V s, is R-linear in both variables, i.e., for all ai ∈ R, ∞ ∞ Vi ∈ C (M; TM), si ∈ C (M; E), E E E E E E ∇a1V1+a2V2 s = a1∇V1 s + a2∇V2 s, ∇V (a1s1 + a2s2) = a1∇V s1 + a2∇V s2 and satisfies, for f ∈ C∞(M), E E E E (2.5) ∇fV s = f∇V s, ∇V (fs) = V (f)s + f∇V s. E It follows that the value of ∇V s at a point p ∈ M depends only on the value of V E at p and the values of s in a neighborhood of p. We refer to ∇V s as the covariant derivative of s in the direction of V . On a trivial bundle Rk × M −→ M, a vector field is a map M −→ Rk and the usual derivative on Rk defines a covariant derivative on Rk. In a local trivialization of E, {y1, . . . , ym, s1, . . . , sk} the connection is determined by mk2 functions, b Γia, where 1 ≤ i ≤ m, 1 ≤ a, b ≤ k known as the Christoffel symbols, defined by E b ∇ sa = Γ sb ∂yi ia where we are using the Einstein summation convention as above. Once we know the b functions Γia we can find ∇V s at any point in this local trivialization by using (2.5). 30 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY We can answer our original question by saying that, for a given curve γ : [a, b] −→ M a section of E, s : M −→ E, is parallel along γ (with respect to ∇E) if E ∇γ0(t)s = 0 at all points γ(t). Let us see write this equation in local coordinates as above (2.2). We can write a 0 i 0 s(t) := s(γ(t)) = σ (t)sa, γ (t) = (γ ) (t)∂yi and then E a i 0 a b ∇γ0(t)s = (∂tσ )sa + ((γ ) (t))σ Γiasb grouping together the coefficients of sa, we see that this will vanish precisely when a i 0 c a (2.6) ∂tσ + ((γ ) (t))σ Γic = 0 for all a. This local expression shows that it is always possible to extend a vector in Ep to a covariantly constant section of E along any curve through p. Indeed, given any such curve γ :[a, b] −→ M, with p = γ(t0) and any vector s0 ∈ Eγ(t0), we can find s by solving (2.6) for its coefficients σ1, . . . , σk. As this is a linear system of ODE’s, there is a unique solution defined on all of [a, b]. We call this solution the parallel transport of s0 along γ. Parallel transport along γ defines a linear isomorphism between fibers of E −→ M, Tγ : Ea −→ Eb. This is particularly interesting when γ is a loop at a point p ∈ M. The possible linear automorphisms of Ep resulting from parallel transport along loops at p form a group, the holonomy group of ∇. Different points in M yield isomorphic groups so, as long as M is connected, this group is independent of the base point p. Remark 3. We can use parallel transport to show that pull-back bundles under homotopic maps are isomorphic. Indeed, assume that f : M1 −→ M2 and g : M1 −→ M2 are homotopic smooth maps, so that there is a smooth map H : [0, 1] × M1 −→ M2 s.t. H(0, ·) = f(·) and H(1, ·) = g(·) ∗ ∼ ∗ and let E −→ M2 be a vector bundle. We want to show that f E = g E, and by ∗ considering H E we see that we can reduce to the case M2 = [0, 1] × M1, with f = i0 : M1 −→ [0, 1] × M1, i0(p) = (0, p) g = i1 : M1 −→ [0, 1] × M1, i1(p) = (1, p). 2.2. PARALLEL TRANSLATION 31 So let F −→ [0, 1] × M1 be a vector bundle. Choose a connection on F and for each p ∈ M1 let Tp be parallel transport along the curve t 7→ (t, p), then the map ∗ ∗ Φ: i0F −→ i1F, Φ(p, v) = (p, Tp(v)) is a vector bundle isomorphism. E Next assume that we have both a metric gE and a connection ∇ on E. We will say that these objects are compatible if, for any sections σ and τ of E, and any vector field V ∈ C∞(M; TM), E E (2.7) V (gE(σ, τ)) = gE(∇V σ, τ) + gE(σ, ∇V τ). We also say in this case that ∇E is a metric connection. Parallel transport with respect to a metric connection is an isometry. That is, if γ :[a, b] −→ M is a curve on M and σ and τ are parallel along γ, then ∂tgE(σ(t), τ(t)) = 0. In fact this follows immediately from the metric condition since the derivative with respect to t is precisely the derivative with respect to the vector field γ0(t). In local coordinates, the connection is metric if and only if the Christoffel symbols satisfy c c ∂yi (gE)ab = Γia(gE)cb + Γib(gE)ac. b Given the matrix-valued function gE, we can always find functions Γia that satisfy these relations. In other words given a metric on a trivial bundle, there is always a metric connection. Using a partition of unity it follows that for any metric on a bundle E there is a metric connection. A connection on E is, for each vector field V ∈ C∞(M; TM), a first-order differential operator on sections of E, E ∞ ∞ ∇V : C (M; E) −→ C (M; E). If ∇E and ∇e E are two connections on E, then their difference is linear over C∞(M), E E ∞ ∞ ∇V − ∇e V : C (M; E) −→ C (M; E) satisfies E E E E ∞ ∞ (∇V − ∇e V )(fs) = f(∇V − ∇e V )(s) for all f ∈ C (M), s ∈ C (M; E) and hence a zero-th order differential operator or equivalently, a section of the ho- momorphism bundle of E, E E ∞ ∇V − ∇e V ∈ C (M; Hom(E)). 32 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY We can also incorporate the dependence on the vector field V and write ∇E − ∇e E ∈ C∞(M; T ∗M ⊗ Hom(E)). Conversely, it is easy to see that whenever ∇E is a connection on E and Φ ∈ C∞(M; T ∗M ⊗ Hom(E)) is a section of T ∗M ⊗ Hom(E), the sum ∇E + Φ is again a connection on E. We say that the space of connections on E is an affine space modeled on C∞(M; T ∗M ⊗ Hom(E)). In particular, any bundle has infinitely many connections on it. 2.3 Levi-Civita connection Among the infinitely many connections on the tangent bundle of a Riemannian man- ifold, there is a unique connection that is compatible with the metric and satisfies an additional symmetry property. Theorem 1 (Fundamental theorem of Riemannian geometry). Let (M, g) be a Rie- mannian manifold, there is a unique metric connection ∇ on TM satisfying ∞ (2.8) ∇V W − ∇W V − [V,W ] = 0 for all V,W ∈ C (M; TM). We say that a connection (necessarily on TM) that satisfies on (2.8) is sym- metric or ‘torsion-free’. The unique metric torsion-free connection of a metric is called the Levi-Civita connection. Proof. Given any three vector fields U, V, W ∈ C∞(M; TM), we can apply the metric property (2.7) to Ug(V,W ) + V g(W, U) − W g(U, V ) and then use the torsion-free property to cancel out all but one covariant derivative to arrive at the Koszul formula: 1 g(∇ V,W ) = Ug(V,W ) + V g(W, U) − W g(U, V ) U 2 − g(U, [V,W ]) + g(V, [W, U]) + g(W, [U, V ]) , which proves uniqueness. It is easy to check that this formula defines a symmetric metric connection, establishing existence. In local coordinates, {y1, . . . , ym, ∂y1 , . . . , ∂yk }, a connection on TM is symmet- ric precisely when its Christoffel symbols satisfy k k Γij = Γji. 2.4. ASSOCIATED BUNDLES 33 Note that the coordinate vector fields ∂yi satisfy Schwarz’ theorem, [∂yi , ∂yj ] = 0, so from Koszul’s formula the Christoffel symbols of the Levi-Civita connection are given by 1 Γk = gk` (∂ g + ∂ g − ∂ g ) . ij 2 i `j j i` ` ij 2.4 Associated bundles We saw above (2.3) that a metric on E determines a dual metric on E∗ by requiring that the bundle isomorphism ∗ geE : E −→ E be an isometry. A connection on E also naturally induces a connection on E∗. Given sections s of E and η of E∗, we can ‘contract’ or pair them to get a function η(s) on M, and we demand that the putative connection on E∗ satisfy the corresponding Leibnitz rule, E∗ E V (η(s)) = (∇V η)(s) + η(∇V s). E∗ Thus we define ∇V by E∗ E (∇V η)(s) = V (η(s)) − η(∇V s). If E −→ M and F −→ M are vector bundles over M endowed with metrics gE and gF then we have induced metrics gE⊕F on E ⊕ F −→ M, gE⊗F on E ⊗ F −→ M determined by gE⊕F (σ ⊕ α, τ ⊕ β) = gE(σ, τ) + gF (α, β), gE⊗F (σ ⊗ α, τ ⊗ β) = gE(σ, τ)gF (α, β), for all σ, τ ∈ C∞(M; E), α, β ∈ C∞(M; F ). Similarly, given connections ∇E and ∇F , we have induced connections ∇E⊕F on E ⊕ F −→ M, ∇E⊗F on E ⊗ F −→ M. determined by the corresponding Leibnitz rules, E⊕F E F ∇V (σ ⊕ α) = ∇V σ ⊕ ∇V α E⊗F E F ∇V (σ ⊗ α) = ∇V σ ⊗ α + σ ⊗ ∇V α, for all V ∈ C∞(M; TM), σ ∈ C∞(M; E), and α ∈ C∞(M; F ). 34 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY In particular, connections on E and F induce a connection on Hom(E,F ) = E∗ ⊗ F −→ M, satisfying Hom(E,F ) F E (∇V Φ)(σ) = ∇V (Φ(σ)) − Φ(∇V σ) for all V ∈ C∞(M; TM), Φ ∈ C∞(M; Hom(E,F )), and σ ∈ C∞(M; E). Using these induced connections, it makes sense to take the covariant derivative of a metric on E as a section of E∗ ⊗ E∗, E∗⊗E∗ E E (∇V gE)(σ, τ) = V (gE(σ, τ)) − gE(∇V σ, τ) − gE(σ, ∇V τ). Thus we find that a connection is compatible with a metric if and only if the metric is covariantly constant with respect to the induced connection. We also have induced metrics and connections on pull-back bundles. Let f : M1 −→ M2 be a smooth map and E −→ M2 a vector bundle with metric gE and E connection ∇ . Any section of E, σ : M2 −→ E, can be pulled-back to a section of f ∗E, ∗ ∗ (f σ)(p) = (p, σ(f(p))) ∈ (f E)p, for all p ∈ M1. ∞ ∗ ∞ Moreover these pulled-back sections generate C (M1; f E) as a C (M1)-module in that ∞ ∗ ∞ ∗ ∞ C (M1; f E) = C (M1) · f C (M2; E). Thus, to specify a metric and connection on f ∗E, it suffices to describe their actions on products of functions on M1 and pulled-back sections. Thus for any h1, h2 ∈ ∞ ∞ C (M1), and σ1, σ2 ∈ C (M2; E) we define ∗ ∗ gf ∗E(h1f σ1, h2f σ2)p = h1(p)h2(p)gE(σ1, σ2)f(p) f ∗E ∗ ∗ ∗ E ∇V h1f σ1 = V (h1)f σ1 + f (∇V σ1). Finally, if M is a submanifold of a Riemannian manifold (X, g) then we point out that M inherits a metric from g by restricting it on M to vectors tangent to M. 2.5 E-valued forms We defined a connection as a map ∇E : C∞(M; TM) × C∞(M; E) −→ C∞(M; E) but we can equivalently think of it as a map ∇E : C∞(M; E) −→ C∞(M; T ∗M) × C∞(M; E) = C∞(M; T ∗M ⊗ E). 2.5. E-VALUED FORMS 35 The space C∞(M; T ∗M⊗E) is the 1-form part of the space of E-valued differential k-forms, Ωk(M; E) = C∞(M;ΛkT ∗M ⊗ E). This space is linearly spanned by ‘elementary tensors’, i.e., section of the form ω ⊗ s where ω ∈ Ωk(M) and s ∈ C∞(M; E). There is a wedge product between k-forms and E-valued `-forms given on elementary tensors by (ω ⊗ s) ∧ η = (ω ∧ η) ⊗ s and then extended by linearity. There is also a natural wedge product between a form in Ω∗(M; End E) and a form in Ω∗(M; E) resulting in another form in Ω∗(M; E). For elementary tensors ω ⊗ Φ ∈ Ω∗(M; End E), η ⊗ s ∈ Ω∗(M; E) the wedge product is given by (ω ⊗ Φ) ∧ (η ⊗ s) = (ω ∧ η) ⊗ Φ(s). Suppose M is Riemannian manifold, so that we have the Levi-Civita connec- tion on TM, and that we have a connection on E, then as explained above there is an induced connection on Ωk(M; E), ∇Ωk(M;E) :Ωk(M; E) −→ C∞(M; T ∗M ⊗ ΛkT ∗M ⊗ E). We can take the final section, ‘antisymmetrize’ all of the cotangent variables, and end up with a section of Ωk+1(M; E). We think of the resulting map as an E-valued exterior derivative and denote it dE :Ωk(M; E) −→ Ωk+1(M; E). One elementary tensors ω ⊗ s, dE is given by dE(ω ⊗ s) = dω ⊗ s + (−1)deg(ω)ω ∧ ∇Es and in particular on Ω0(M; E) = C∞(M; E) coincides with ∇E. Unlike the case E = R, it is generally not true that (dE)2 = 0. Proposition 2. Let (M, g) be a Riemannian manifold, E −→ M a vector bundle, and let dE be the E-valued exterior derivative induced by a connection ∇E on E and the Levi-Civita connection on M. The map (dE)2 :Ωk(M; E) −→ Ωk+2(M; E) 36 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY is given by wedge product with a differential form RE ∈ Ω2(M; End E). Moreover, if V,W ∈ C∞(M; TM), we have E E E E E E R (V,W ) = ∇V ∇W − ∇W ∇V − ∇[V,W ]. This 2-form with values in End E is is known as the curvature of the con- nection ∇E. We say that the connection is flat if RE = 0. Proof. First consider the effect of applying ∇E and then ∇T ∗M⊗E to a section of E, E T ∗M⊗E C∞(M; E) −−−→C∇ ∞(M; T ∗M ⊗ E) −−−−−−−→C∇ ∞(M; T ∗M ⊗ T ∗M ⊗ E). By the Leibnitz rule defining the connection on T ∗M ⊗ E, we have, for all V,W ∈ C∞(M; TM), T ∗M⊗E E T ∗M⊗E E (∇ ∇ σ)(V,W ) = (∇V (∇ σ))(W ) = ∇E (∇E σ) − (∇Eσ)(∇ W ) = ∇E (∇E σ) − ∇E σ. V W V V W ∇V W We obtain dE by anti-symmetrizing, so we have (dE)2(σ)(V,W ) = ∇E (∇E σ) − ∇E σ − ∇E (∇E σ) + ∇E σ V W ∇V W W V ∇W V E E E E E = ∇V ∇W − ∇W ∇V − ∇[V,W ] σ. This is what we wanted to prove for k = 0, and the same computation replacing E with ΛkT ∗M ⊗ E proves the result for general k. The curvature of the Levi-Civita connection is referred to as the Riemannian curvature of (M, g). 2.6 Curvature As explained in the previous section, the curvature of a Riemannian manifold (M, g) is R ∈ Ω2(M; End(TM)), defined in terms of the Levi-Civita connection by R(V,W ) = ∇V ∇W − ∇W ∇V − ∇[V,W ]. Together with the obvious (anti)symmetry R(V,W ) = −R(W, V ), we have the fol- lowing: 2.6. CURVATURE 37 Proposition 3. Let (M, g) be a Riemannian manifold, and R the curvature of its Levi-Civita connection ∇, then for any V, W, X, Y ∈ C∞(M; TM) we have a) (First Bianchi Identity) R(V,W )X + R(W, X)V + R(X,V )W = 0 b) g(R(V,W )X,Y ) = −g(R(V,W )Y,X) c) g(R(V,W )X,Y ) = g(R(X,Y )V,W ) Proof. a) reduces by a straightforward computation to the Jacobi identity for the commutator of vector fields. b) is equivalent to proving that g(R(V,W )X,X) = 0, which we prove by noting that g(∇V ∇W X − ∇V ∇W X,X) = (V g(∇W X,X) − g(∇W X, ∇V X)) − (W g(∇V X,X) − g(∇V X, ∇W X)) 1 = [V,W ]g(X,X) = g(∇ X,X). 2 [V,W ] c) follows from the other symmetries. Starting with following consequences of the Bianchi identity: g(R(V,W )X + R(W, X)V + R(X,V )W, Y ) = 0 g(R(W, X)Y + R(X,Y )W + R(Y,W )X,V ) = 0 g(R(X,Y )V + R(Y,V )X + R(V,X)Y,W ) = 0 g(R(Y,V )W + R(V,W )Y + R(W, Y )V,X) = 0 we add all of these equations up, and use the other symmetries to see that we get 2g(R(X,V )W, Y ) + 2g(R(Y,W )X,V ) = 0 and hence g(R(X,V )W, Y ) = g(R(W, Y )X,V ). In local coordinates {y1, . . . , ym, ∂y1 , . . . , ∂ym } on the tangent bundle, the cur- ` vature is determined by the functions Rijk defined by ` R(∂i, ∂j)∂k = Rijk∂`. We can compute these functions explicitly in terms of the Christoffel symbols of the Levi-Civita connection, ` ` s ` s ` ` Rijk = ΓisΓjk − ΓjsΓik + ∂i(Γjk) − ∂j(Γik). The Riemannian curvature vanishes if and only if (M, g) is locally isometric to Euclidean space [Spivak,vol 2]. 38 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY There is a lot of complexity in the curvature tensor, so it is standard to focus on one of its contractions. Let p ∈ M and v, w ∈ TpM, the Ricci curvature at v, w, Ric(v, w), is the trace of the endomorphism of TpM z 7→ R(v, z)w. This defines a symmetric tensor Ric ∈ C∞(M; T ∗M ⊗ T ∗M). The metric is also a symmetric section of T ∗M ⊗ T ∗M, so we can compare these two tensors. We say that a metric is Einstein if there is a constant λ such that Ric = λg. The Ricci curvature corresponds to a linear self-adjoint map Ricf : C∞(M; TM) −→ C∞(M; TM) determined by g(Ric(f V ),W ) = Ric(V,W ). The scalar curvature of a Riemannian manifold scal ∈ C∞(M) is the trace of Ricf . Finally, the sectional curvature of a Riemannian manifold (M, g) is a func- tion that assigns to each 2-dimensional subspace of TpM a real number. Let Σ ⊆ TpM be a two dimensional space spanned by v, w ∈ TpM then g(R(v, w)v, w) K(Σ) = − g(v, v)g(w, w) − g(v, w)2 is independent of the choice of v, w. Notice that the denominator is the area of the parallelogram spanned by v and w. The sectional curvature determines the full curvature tensor. A manifold is said to have constant curvature if the sectional curvature is constant. If the sectional curvature at a point p ∈ M is constant and equal to K, then the Riemannian curvature at that point is given by g(R(v, w)x, z)p = K(g(w, x)pg(v, z)p − g(v, x)pg(w, z)p), the Ricci curvature at p is (m − 1)Kgp and the scalar curvature is m(m − 1)K. It turns out that if this is true at each point of M, with a priori a different K at each point, then in fact all of the K’s are equal and the manifold has constant curvature. The simply connected manifolds of constant curvature are, up to scale, just Euclidean space, the sphere, and hyperbolic space (see exercises 4 and 5). In two dimensions, the curvature is determined by the scalar curvature which, up to a factor of 2, coincides with the classical Gaussian curvature. In three dimen- sions, the curvature is determined by the Ricci curvature. 2.7. GEODESICS 39 In local coordinates {y1, . . . , ym, ∂y1 , . . . , ∂ym } on the tangent bundle, the Ricci curvature is determined by the functions Ricij s Ricij = Ric(∂i, ∂j) = Risj. The map Ricf is given in local coordinates by j j js Ric(f ∂i) = (Ric)f i ∂j, with (Ric)f i = g Ricis and hence the scalar curvature is given by k jk jk s scal = (Ric)f k = g Ricjk = g Rjsk. For the sake of intuition, recall what the sign of the Gaussian curvature tells us about a surface embedded in R3. Positive Gaussian curvature at a point means that the surface lies on a single side of its tangent plane at that point. Thus we can think of the sphere or a paraboloid. Negative Gaussian curvature at a point means that the surface locally looks like a saddle. Vanishing of the Gaussian curvature is true for a plane, but it is also true for a cylinder, since these are locally isometric. We should think of this as saying that there is at least one direction that looks Euclidean. For higher dimensional manifolds, these mental pictures are good descriptions of the behavior of sectional curvature. In fact, in the language of the next section, the sectional curvature of the space Σ ⊆ TpM is the Gaussian curvature of the surface in M carved out by geodesics through p in the directions in Σ, i.e., the image of Σ under the exponential map. 2.7 Geodesics A curve γ :[a, b] −→ M on a Riemannian manifold (M, g) is said to be a geodesic if, for all t, 0 ∇γ0(t)γ (t) = 0. 1 m In local coordinates, {y1, . . . , ym, ∂y1 , . . . , ∂ym }, in which γ(t) = (γ (t), . . . , γ (t)), this means that 2 k k i j (2.9) ∂t γ (t) + Γij∂tγ (t)∂tγ (t) = 0. This is a semi-linear second order system of ODEs for the coefficients γi(t), so we have the following existence result1 1 For more detail, see Lemma 10.2 of John Milnor, Morse theory. Annals of Mathematics Studies, No. 51 Princeton University Press, Princeton, N.J. 1963 40 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY Lemma 4. For every point p0 on a Riemannian manifold (M, g) there is a neigh- borhood U of p0 and numbers ε1, ε2 > 0 such that: For all p ∈ U and tangent vector v ∈ TpM with |v|g < ε there is a unique geodesic γ(p,v) :(−2ε2, 2ε2) −→ M, satisfying the initial conditions 0 (2.10) γ(p,v)(0) = p, γ(p,v)(0) = v. Furthermore, the geodesics depend smoothly on the initial conditions. The solution to the system (2.9) is unique up to its domain of definition. Since (2.9) is non-linear the geodesic may not be defined on all of R, but rather there is a maximal interval around zero, say (a, b), in which this solution is defined. With this domain, the solution is unique, so we say that for each (p, v) ∈ M there is a unique maximally extends geodesic satisfying (2.10). We will use γ(p,v) to refer to this maximally extended geodesic. Remark 4. If M is a submanifold of RN with the induced metric then a curve in M is a geodesic if and only if its acceleration as a curve in RN is normal to M. In particular, if the intersection of M with a subspace of RN that is everywhere orthogonal to M is a curve, then any constant speed parametrization of that curve must be a geodesic. The same is true replacing RN with a Riemannian manifold, X. If M ⊆ X is a Riemannian submanifold and all of the geodesics of M are also geodesics of X, we say that M is a totally geodesic submanifold. By uniqueness we have the following homogeneity property γ(p,λv)(t) = γ(p,v)(λt) for all λ ∈ R for which one of the two sides is defined. Thus by making ε1 smaller in the lemma, we can take, e.g., ε2 = 1. At each point p ∈ M we can define the exponential map by expp : U ⊆ TpM −→ M, expp(v) = γ(p,v)(1) where U is the subset of TpM of vectors v for which γ(p,v)(1) is defined. We know that U contains a neighborhood of the origin and that expp is smooth on U. Let {y1, . . . , ym, ∂y1 , . . . , ∂ym } be local coordinates for TM over a coordinate chart V of p. Consider the smooth function F : U −→ M × M, defined by F (p, v) = (p, expp(v)). 2.7. GEODESICS 41 Notice that, if {y1, . . . , ym, z1, . . . , zm} are the corresponding coordinates on V ×V ⊆ M × M, then DF : T U −→ TM × TM satisfies DF (∂ ) = ∂ + ∂ ,DF (∂ ) = ∂ yi yi zi (∂yi ) zi Id Id and hence has the form . In particular, we can apply the implicit function 0 Id theorem to F to conclude as follows: Lemma 5. For each p ∈ M there is a neighborhood U of M and a number ε > 0 such that: a) Any two points p, q ∈ U are joined by a unique geodesic in M of length less than ε. b) This geodesic depends smoothly on the two points. c) For each q ∈ U the map expq maps the open ε-ball in TqM diffeomorphically onto an open set Uq containing U. Since expp is a diffeomorphism from Bε(0) ⊆ TpM into M, it defines a coordi- nate chart on M. These coordinates are known as normal coordinates. At any point p ∈ M, one can show that if v ∈ TpM is small enough, the distance (2.4) between p and exp(v) is |v|g. In particular d(p, q) = 0 =⇒ p = q and hence (M, d) is a metric space. A very nice property of a normal coordinate system is that near the origin the coefficients of the metric agree to second order with the identity matrix up to second order. More precisely2: Proposition 6. Let (M, g) be a Riemannian manifold, p ∈ M, and let {y1, . . . , ym} be normal coordinates centered at p. Then we have k gij(0) = δij, Γij(0) = 0, 2 2 2 2 1 ∂ gj` ∂ gjk ∂ gik ∂ gi` Rijk`(0) = − + − 2 ∂yi∂yk ∂yi∂y` ∂yj∂y` ∂yj∂yk and the Taylor expansion of the functions gij at the origin has the form 1 X g (y) ∼ δ − R y y + O(|y|3). ij ij 3 ikj` k ` k,` 2See, e.g., Proposition 1.28 of Nicole Berline, Ezra Getzler, Michele` Vergne Heat kernels and Dirac operators. Corrected reprint of the 1992 original. Grundlehren Text Edi- tions. Springer-Verlag, Berlin, 2004. x+363 pp. ISBN: 3-540-20062-2 42 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY 2.8 Complete manifolds of bounded geometry At each point p in a Riemannian radius, the injectivity radius at p, injM (p), is the largest radius R such that expp : BR(0) ⊆ TpM −→ M is a diffeomorphism. The injectivity radius of M is defined by injM = inf{injM (p): p ∈ M}. By Lemma 5, a compact manifold has a positive injectivity radius. We say that a manifold is complete if there is a point p ∈ M for which expp is defined on all of TpM (though it need not be injective on all of TpM). The nomenclature comes from the Hopf-Rinow theorem, which states that the following assertions are equivalent: a) The metric space (M, d), with d defined by (2.4), is complete. b) Any closed and bounded subset of M is compact. c) There is a point p ∈ M for which expp is defined on all of TpM. d) At any point p ∈ M, expp is defined on all of TpM. Moreover in this case one can show that any two points p, q ∈ M can be joined by a geodesic γ with d(p, q) = `(γ). A Riemannian manifold (M, g) is called a manifold with bounded geometry if: 1) M is complete and connected 2) injM > 0 3) For every k ∈ N0, there is a constant Ck such that k |∇ R|g ≤ Ck. Every compact Riemannian manifold has bounded geometry. Many non- compact manifolds, such as Rm and Hm, have bounded geometry. It is often the case that constructions that one can do on Rm can be carried out also on manifolds of bounded geometry but that one runs into trouble trying to carry them out on manifolds that do not have bounded geometry. 2.9 Riemannian volume and formal adjoint On Rm with the Euclidean metric and the canonical orientation, there is a natural choice of volume form dvolRm = dy1 ∧ · · · ∧ dym. 2.9. RIEMANNIAN VOLUME AND FORMAL ADJOINT 43 This assigns unit volume to the unit square. The volume of a parallelotope generated by m-vectors z1, . . . , zm is equal to the determinant of the change of basis matrix from y to z, which we can express as the ‘Gram determinant’ q det(zi · xj) . It turns out that a Riemannian metric g on an orientable manifold M also determines a natural volume form, dvolg, determined by the property that, at any p ∈ M and any local coordinates in which gij(p) = δij we have dvolg(p) = dvolRm . In arbitrary oriented local coordinates, x1, . . . , xm, dvol(p) should be the volume of the parallelotope generated by ∂xi and hence q dvolg(p) = det(gij(p)) dx1 ∧ · · · ∧ dxm. The volume form is naturally associated to the metric in that, e.g., ∗ dvolf ∗g = f dvolg and one can check that the volume form is covariantly constant with respect to the Levi-Civita connection. Given a metric gE on a bundle E −→ M and a Riemannian metric on M, we obtain an inner product on sections of E by setting: ∞ ∞ h·, ·iE : C (M; E) × C (M; E) −→ R, Z ∞ hσ, τiE = gE(σ, τ) dvolg, for all σ, τ ∈ C (M; E). M ∞ This is a non-degenerate, positive definite inner product on C (M; E) because gE is a non-degenerate positive definite inner product on each Ep. Using this inner product, we can define the Lp-norms in the usual way Z 1/p ∞ p k·k p : C (M; E) −→ , ksk p = |s| dvol L (M;E) R L (M;E) gE g M p p and then define the L space of sections of E,L (M; E), by taking the k·kLp(M;E)- closure of the smooth functions with finite Lp(M; E)-norm. These are Banach spaces, and we will mostly work with L2(M; E), which is also a Hilbert space. If we have a Riemannian manifold (M, g) and two vector bundles E −→ M, F −→ M with bundle metrics, we say that two linear operators T : C∞(M; E) −→ C∞(M; F ),T ∗ : C∞(M; F ) −→ C∞(M; E) 44 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY are formal adjoints if they satisfy ∗ ∞ ∞ hT σ, τiF = hσ, T τiE, for all σ ∈ Cc (M; E), τ ∈ Cc (M; F ). (We use the word formal because we are not working with a complete vector space like L2 or Lp but just with smooth functions, this avoids some technicalities.) As a simple example, the formal adjoint of a scalar differential operator on R is easy to compute ∗ X k X k k ∞ ak(x)∂x u(x) = (−1) ∂x(ak(x)u(x)), for all u ∈ C (R). The formal adjoint of any differential operator is similarly computed using integra- tion by parts in local coordinates (the compact supports allow us not to worry about picking up ‘boundary terms’). We say that a linear operator is formally self-adjoint if it coincides with its formal adjoint. For instance, if T and T ∗ are formal adjoints then TT ∗ and T ∗T are formally self-adjoint. Moreover, these operators are positive semidefinite since on compactly supported sections hT ∗T s, si = hT s, T si = |T s|2. In the next section we will compute the formal adjoints of d and ∇E, which will lead to natural Laplace-type operators. 2.10 Hodge star and Hodge Laplacian We have already looked at the space of differential forms on M and seen that the smooth structure on M induces a lot of structure on Ω∗(M), such as the exterior product, interior product and exterior derivative. Now we suppose that (M, g) is a Riemannian manifold and we will see that there is still more structure on Ω∗(M). First, we have already noted above that a metric on M induces metrics on associated vector bundles, such as Λ∗T ∗M. It is easy to describe this metric in terms of the metric on T ∗M when dealing with elementary k-forms, namely 1 k 1 k i j i j ∗ gΛkT ∗M (ω ∧ · · · ∧ ω , θ ∧ · · · θ )p = det gT ∗M (ω , θ )p , for all {ω , θ } ⊆ Tp M. On any oriented Riemannian m-manifold, we define the Hodge star operator ∗ :Ωk(M) −→ Ωm−k(M) 2.10. HODGE STAR AND HODGE LAPLACIAN 45 by demanding, for all ω, η ∈ Ωk(M), ω ∧ ∗η = gΛkT ∗M (ω, η) dvolg . Thus, for example, in R4 with Euclidean metric and canonical orientation, with y1, . . . , y4 an orthonormal basis, ∗1 = dy1 ∧ dy2 ∧ dy3 ∧ dy4 ∗dy1 = dy2 ∧ dy3 ∧ dy4, ∗dy2 = −dy1 ∧ dy3 ∧ dy4, ∗dy1 ∧ dy2 = dy3 ∧ dy4, ∗dy2 ∧ dy4 = −dy1 ∧ dy3 ∗dy1 ∧ dy2 ∧ dy3 = dy4, ∗dy1 ∧ dy2 ∧ dy3 ∧ dy4 = 1 and so on. Note that ∗ is symmetric in that ω ∧ ∗η = gΛkT ∗M (ω, η) dvolg = gΛkT ∗M (η, ω) dvolg = η ∧ ∗ω. The Hodge star is almost an involution, ω ∈ Ωk(M) =⇒ ∗(∗ω) = (−1)k(m−k)ω. The Hodge star acts on differential forms and we have Z hω, ηiΛ∗T ∗M = ω ∧ ∗η. M This lets us get a formula for the formal adjoint of the exterior derivative, ∗ k−1 k d = δ. Namely, whenever ω ∈ Ωc (M), and η ∈ Ωc (M), Z Z k−1 hdω, ηiΛkT ∗M = (dω) ∧ ∗η = d(ω ∧ ∗η) − (−1) ω ∧ d ∗ η M M Z Z = (−1)k ω ∧ d ∗ η = (−1)k ω ∧ (−1)(m−k+1)(k−1) ∗2 d ∗ η M M m(k+1)+1 = hω, (−1) ∗ d ∗ ηiΛk−1T ∗M and so, when acting on Ωk(M), δ = d∗ = (−1)m(k+1)+1 ∗ d∗ = (−1)k ∗−1 d ∗ . A form is called coclosed if it is in the null space of δ. (Functions are thus auto- matically coclosed.) In particular on one-forms we have ω ∈ Ω1(M) =⇒ δω = − ∗ d ∗ ω, 46 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY 1 and, if ω ∈ Ωc (M), then Stokes’ theorem (on manifolds with boundary) says that Z Z Z Z δ(ω) dvolg = − ∗d ∗ ω dvolg = − d(∗ω) = − ∗ω. M M M ∂M Hence on manifolds without boundary, Z δ(ω) dvolg = 0. M The Hodge Laplacian on k-forms is the differential operator k k ∆k :Ω (M) −→ Ω (M), ∆k = dδ + δd. Note that this operator is formally self-adjoint. Elements of the null space of the Hodge Laplacian are called harmonic differential forms, and we will use the notation k k H (M) = {ω ∈ Ω (M) : ∆kω = 0} for the vector space of harmonic k-forms. A form that is closed and coclosed is automatically harmonic. Conversely, if k ω ∈ Ωc (M) is compactly supported and harmonic, then 0 = h∆ω, ωiΛkT ∗M = hdω, dωiΛk+1T ∗M + hδω, δωiΛk−1T ∗M and so ω is both closed and coclosed. In particular, if M is compact, we have a natural map k k H (M) −→ HdR(M) since each harmonic form is closed. We will show later that this map is an isomor- phism by showing that every de Rham cohomology class on a compact manifold contains a unique harmonic form. Since d2 = 0 and δ2 = 0, we can also write the Hodge Laplacian as 2 ∆k = (d + δ) . Also, notice that ∗∆k = ∆m−k∗, d∆k = dδd = ∆k+1d, δ∆k = δdδ = ∆k−1δ and hence ω is harmonic implies dω and δω harmonic, and is equivalent to ∗ω harmonic. In particular, Hk(M) = Hm−k(M), 2.11. EXERCISES 47 which we call Poincar´eduality for harmonic forms. Let us describe the Laplacian on functions ∆ = ∆0 in a little more detail. Since δ vanishes on functions, we have ∆ = δd = − ∗ d ∗ d. A closed function is necessarily a constant, so on compact manifolds all harmonic functions are constant. In local coordinates, {y1, . . . , ym}, the Laplacian is given by 1 ∂ p ij ∂f ij ij k ∆f = −p det gij g = − g ∂i∂j − g Γij∂k f. det gij ∂yj ∂yi On Euclidean space our Laplacian is m X ∂2f ∆ = − ∂y2 i=1 i which is, up to a sign, the usual Laplacian on Rm. With our sign convention, the Laplacian is sometimes called the ‘geometer’s Laplacian’ while the opposite sign convention yields the ‘analyst’s Laplacian’. The Laplacian on functions on a Rie- mannian manifold is sometimes called the Laplace-Beltrami operator to emphasize that there is a Riemannian metric involved. 2.11 Exercises Exercise 1. Show that if (M1.g1) and (M2, g2) are Riemannian manifolds, then so is (M1 × M2, g1 + g2). Exercise 2. It is tempting to think that the interior product and the exterior derivative will define a covariant derivative on T ∗M without having to use a metric, say by ∞ ∗ ∞ ∇e V ω = V y dω, for all ω ∈ C (M; T M),V ∈ C (M; TM). Show that this is not a covariant derivative on T ∗M. Show that the exterior derivative does define a connection on the trivial bundle R × M −→ M. Exercise 3. Show that parallel transport determines the connection. 48 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY Exercise 4. Consider the 2-dimensional sphere of radius r centered at the origin of R3, 2 3 Sr = {(y1, y2, y3) ∈ R : |y| = r}. 3 2 The Euclidean metric on R induces the ‘round’ metric on Sr. Compute the sectional curvatures of this metric. Hint: Use the spherical coordinates (θ, ψ) that parametrize the sphere via φ(θ, φ) = (r cos θ cos ψ, r cos θ sin φ, r sin θ). Notice that any curve on S2 can be parametrized γ(t) = (θ(t), ψ(t)) and the length of γ0(t) is rp(θ0)2 + (ψ0)2 cos2 θ . 2 2 2 2 2 2 Hence the round metric in these coordinates is gSr = r dθ + r cos θdψ . Exercise 5. m One model of hyperbolic space of dimension m, (H , gHm ) is the space m m R+ = {(x, y1, . . . , ym−1) ∈ R : x > 0} with the metric dx2 + |dy|2 g m = . H x2 Compute the Christoffel symbols and the sectional curvatures of Hm. Exercise 6. (Riemann’s formula) Let K be a constant. Show that a neighborhood of the origin m in R with coordinates {y1, . . . , ym} endowed with the metric |dy|2 K 22 1 + 4 |y| has constant curvature K. Exercise 7. Use stereographic projection to write the metric on the sphere as a particular case of Riemann’s formula. Exercise 8. Consider the flat torus Tm = Rm/Zm with the metric it inherits from Rm. Compute the curvature and geodesics of Tm. Exercise 9. If γ(t) is a geodesic on a Riemannian manifold show that γ0(t) has constant length. 2.11. EXERCISES 49 Exercise 10. a) Determine the geodesics on the sphere. (Hint: Remark 4.) b) All of the points of the sphere have the same injectivity radius, what is it? c) Consider the geodesic triangle on the sphere S2 with a vertex at the ‘north pole’ and other vertices on the ‘equator’, suppose that at the north pole the edge form an angle θ. Describe the parallel translation map along this triangle. Hint: Because the sides are geodesics, any vector parallel along the sides forms a constant angle with the tangent vector of the edge. Exercise 11. Give an example of two curves on a Riemannian manifold with the same endpoints and with different parallel transport maps. Exercise 12. a) Show that any differential k-form α on M × [0, 1] has a ‘standard decomposition’ α = αt(s) + ds ∧ αn(s) where ds is the unique positively oriented unit 1-form on [0, 1] and, for each s ∈ [0, 1], αt(s) is a differential k-form on M and αn(s) is a differential (k − 1)-form on M. What is the standard decomposition of dα in terms of αt and αn? b) For each s ∈ [0, 1] let is : M −→ M × [0, 1], is(p) = (p, s). We can think of the pull-back of a differential form by is as the ‘restriction of a form on M × [0, 1] to M × {s}’. Consider the map π∗ that takes k-forms on M × [0, 1] to (k − 1)-forms on M given by Z 1 π∗(α) = αn(s) ds. 0 Show that for any differential k-form α on M × [0, 1] ∗ ∗ i1α − i0α = dπ∗α + π∗(dα). c) Show that whenever [0, 1] 3 t 7→ ωt is a smooth family of closed k-forms on a manifold M, the de Rham cohomology class of ωt is independent of t. Exercise 13. (Poincar´e’sLemma) A manifold M is (smoothly) contractible if there is a point p0 ∈ M, and a smooth map H : M × [0, 1] −→ M such that H(p, 1) = p, H(p, 0) = p0 for all p ∈ M. 50 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY H is known as a ‘contraction’ of M. a) Show that Rn is contractible and any manifold is locally contractible (i.e, every point has a contractible neighborhood). b) Let M be a contractible smooth n-dimensional manifold, and let ω be a closed differential k-form on M (i.e., dω = 0). Prove that there is a (k − 1)-form η on M such that dη = ω. ∗ ∗ ∗ (Hint: Let H be a contraction of M, and let α = H ω. Then ω = (H ◦ i1) ω = i1α ∗ and i0α = 0. Check that η = π∗α works.) Exercise 14. Let M be an n-dimensional manifold with boundary ∂M. A (smooth) retraction of M onto its boundary is a smooth map φ : M −→ ∂M such that φ(ζ) = ζ, whenever ζ ∈ ∂M. Prove that if M is compact and oriented, then there do not exist any retractions of M onto its boundary. R (Hint: Let ω be an (n − 1)-form on ∂M with ∂M ω > 0 (how do we know there is such an ω?), notice that dω = 0 (why?). Assume that φ is a retraction of M onto its boundary, and apply Stokes’ theorem to φ∗ω to get a contradiction.) If φ : M −→ ∂M is a continuous retraction, one can approximate it by a smooth retraction, so there are in fact no continuous retractions from M to ∂M. Exercise 15. (Brouwer fixed point theorem) Let B be the closed unit ball in Rn, show that any continuous map F : B −→ B has a fixed point (i.e., there is a point ζ ∈ B such that F (ζ) = ζ). (Hint: If F : B −→ B is a smooth map without any fixed points, let φ(ζ) be the point on ∂B where the ray starting at F (ζ) and passing through ζ hits the boundary. Show that φ is continuous and then apply the continuous version of the previous exercise.) Exercise 16. i Let Vi be a local orthonormal frame for TM with dual coframe θ . Show that X X i d = eθ ∇Vi , δ = − iVi ∇Vi . 2.12 Bibliography My favorite book for Riemannian geometry is the eponymous book by Gallot, Hulin, and Lafontaine. Some other very good books are Riemannian geometry and geometric analysis by Jost, and Riemannian geometry by Petersen. 2.12. BIBLIOGRAPHY 51 Perhaps the best place to read about connections is the second volume of Spi- vak, A comprehensive introduction to differential geometry which includes transla- tions of articles by Gauss and Riemann. Spivak does a great job of presenting the various approaches to connections and curvature. In each approach, he explains how to view the curvature as the obstruction to flatness. An excellent concise introduction to Riemannian geometry, which we follow when discussing geodesics, can be found in Milnor, Morse theory. Another excel- lent treatment is in volume one of Taylor, Partial differential equations. 52 LECTURE 2. A BRIEF INTRODUCTION TO RIEMANNIAN GEOMETRY Lecture 3 Laplace-type and Dirac-type operators 3.1 Principal symbol of a differential operator Let M be a smooth manifold, E −→ M,F −→ M real vector bundles over M, and D a differential operator of order ` acting between sections of E and sections of F, D ∈ Diff`(M; E,F ). In any coordinate chart V of M that trivializes both E and F, D is given by X α D = aα(y)∂y |α|≤` ∞ where {y1, . . . , ym} are the coordinates on V and aα ∈ C (V; Hom(E,F ) V ). This expression for D changes quite a bit if we choose a different set of coor- dinates. However it turns out we can make invariant sense of the ‘top order part’ of D, X α aα(y)∂y , |α|=` as a constant coefficient differential operator on each fiber of the tangent space of M. It is more common to take the dual approach and interpret this as a function on 1 m the cotangent bundle of M as follows: Let {y1, . . . , ym, dy , . . . , dy } be the induced coordinates on the cotangent bundle of M, π : T ∗M −→ M, over V and define the j principal symbol of D at the point ξ = ξjdy of the cotangent bundle, to be ` X α ∞ ∗ ∗ σ`(D)(y, ξ) = i aα(y)ξ ∈ C (T M; π Hom(E,F )). |α|=` That is, we just ignore all the terms in the expression for the differential operator 53 54 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS of order less than ` and replace differentiation with a polynomial in the cotangent variables. Why the factor of i? A key fact that we will exploit later is that the Fourier n transform on R interchanges differentiation by yj with multiplication by iξj, Z Z −iy·ξ −iy·ξ e ∂yj f(y) dy = e (iξj)f(y) dy. Rn Rn Of course here one should be working with complex valued functions, and soon we will be. For the moment, just think of the power of i as a formal factor. Another way of defining the principal symbol, which has the advantage of ∗ being obviously independent of coordinates, is: At each ξ ∈ Tp M, choose a function f such that df(p) = ξ and then set −` −itf itf (3.1) σ`(D)(p, ξ) = lim t e ◦ D ◦ e (p) ∈ Hom(Ep,Fp). t→∞ By the Leibnitz rule, e−itf ◦ D ◦ e−itf is a polynomial in t of order ` and the top order term at p is of the form ` X α1 αm ` X α1 αm (it) aα(p)(∂y1 f)(p) ··· (∂ym f)(p) = (it) aα(p)ξ1 ··· ξm |α|=` |α|=` which shows that (3.1) is independent of the choice of f and hence well-defined. Notice that the principal symbol of a differential operator is not just a smooth function on T ∗M, but in fact a homogeneous polynomial of order ` on each fiber of T ∗M, we will denote these functions by ∗ ∗ ∞ ∗ ∗ P`(T M; π Hom(E,F )) ⊆ C (T M; π Hom(E,F )). For an operator of order one or two, it is easy to carry out the differentiations in (3.1) and see that 1 1 σ (D)(p, ξ) = [D, f] , σ (D)(p, ξ) = − [[D, f] , f] 1 i 2 2 for any f such that df(p) = ξ. A similar formula holds for higher order operators. The principal symbol of order ` vanishes if and only if the operator is actually of order ` − 1, so we have a short exact sequence `−1 ` σ` ∗ ∗ 0 −→ Diff (M; E,F ) −→ Diff (M; E,F ) −−→ P`(T M; π Hom(E,F )) −→ 0. One very important property of the principal symbol map is that it is a homomor- phism. That is, if ∞ ∞ ∞ ∞ D1 : C (M; E1) −→ C (M; E2),D2 : C (M; E2) −→ C (M; E3) 3.1. PRINCIPAL SYMBOL OF A DIFFERENTIAL OPERATOR 55 are differential operators of orders `1 and `2 respectively, then ∞ ∞ D2 ◦ D1 : C (M; E1) −→ C (M; E3) is a differential operator with symbol σ`1+`2 (D2 ◦ D1)(p, ξ) = σ`2 (D2)(p, ξ) ◦ σ`1 (D1)(p, ξ), as follows directly from (3.1). This is especially useful for scalar operators Ei = R since then multiplication ∗ in P`1+`2 (T M) is commutative, so we always have σ`1 (D1)(p, ξ) ◦ σ`2 (D2)(p, ξ) = σ`2 (D2)(p, ξ) ◦ σ`1 (D1)(p, ξ), even though D1 ◦ D2 is generally different from D2 ◦ D1. It is also true that the symbol map respects adjoints in that, if D and D∗ are formal adjoints, then ∗ ∗ σ`(D ) = σ`(D) where on the right we take the the adjoint in Hom, so in local coordinates just the matrix transpose. We can use (3.1) to find the symbols of d and δ by noting that [d, f] ω = d(fω) − fdω = edf ω, [δ, f] ω = δ(fω) − fδω = −idf ] ω hence k ∗ k+1 ∗ σ1(d)(p, ξ) = −ieξ :Λ Tp M −→ Λ Tp M k ∗ k−1 ∗ σ1(δ)(p, ξ) = iiξ] :Λ Tp M −→ Λ Tp M. Then the symbol of the Hodge Laplacian is 2 (3.2) σ2(∆k)(p, ξ) = (σ1(d)σ1(δ) + σ1(δ)σ1(d)) (p, ξ) = eξiξ] + iξ] eξ = |ξ|g k ∗ 2 i.e., it is given by multiplying vectors in Λ Tp M by the scalar |ξ|g. A differential operator is called elliptic if σ`(D)(p, ξ) is invertible at all ξ 6= 0. Hence ∆k is an elliptic differential operator, d and δ are not elliptic, and d ± δ are elliptic. The main aim of this course is to understand elliptic operators. One particularly interesting property of an elliptic operator D on a compact manifold is that its ‘index’ ind D = dim ker D − dim ker D∗ is well-defined (in that both terms in the difference are finite) and remarkably stable, as we will show below. Before doing that, however, we will start by constructing examples of elliptic operators. 56 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS 3.2 Laplace-type operators A second-order differential operator on a Rk-bundle E −→ M over a Riemannian ∗ manifold (M, g) is of Laplace-type if its principal symbol at ξ ∈ Tp M is scalar 2 multiplication by |ξ|g. Thus by (3.2) the Hodge Laplacian ∆k is, for every k, a Laplace-type operator. A general Laplace-type operator H can be written in any local coordinates as ij k k ∞ H = −g ∂yi ∂yj + a ∂yk + b, with a , b ∈ C (V; Hom(E,E)). If E −→ M is a vector bundle with connection ∇E, then we can form the composition E T ∗M⊗E C∞(M; E) −−−→C∇ ∞(M; T ∗M ⊗ E) −−−−−−−→C∇ ∞(M; T ∗M ⊗ T ∗M ⊗ E) and then take the trace over the two factors of T ∗M to define the connection Laplacian, ∆∇ = − Tr(∇T ∗M⊗E∇E): C∞(M; E) −→ C∞(M; E). (This is also known as the Bochner Laplacian.) Explicitly, we have (∇T ∗M⊗E∇Es)(V,W ) = ∇E ∇E − ∇E s V W ∇V W and so in local coordinates we have ∆∇s = − gij∇E ∇E − gijΓk ∇E s. ∂yi ∂yj ij ∂yk Thus this is a Laplace-type operator. Directly from the formula in local coordinates we see that the Laplacian on functions is the connection Laplacian for the Levi-Civita connection ∆∇ = ∆. It turns out that any Laplace-type operator on a bundle E −→ M is equal to a connection Laplacian plus a zero-th order differential operator1. Proposition 7. Let (M, g) be a Riemannian manifold, E −→ M a vector bundle, and H a Laplace-type operator on sections of E. There exists a unique connection ∇E on E and a unique section Q ∈ C∞(M; Hom(E)) such that H = ∆∇ + Q. 1For the proof of this proposition see, e.g., Proposition 2.5 of Nicole Berline, Ezra Get- zler, Michele` Vergne Heat kernels and Dirac operators. Corrected reprint of the 1992 original. Grundlehren Text Editions. Springer-Verlag, Berlin, 2004. x+363 pp. ISBN: 3-540-20062-2 3.3. DIRAC-TYPE OPERATORS 57 3.3 Dirac-type operators The physicist P.A.M. Dirac constructed first-order differential operators whose squares were Laplace-type operators (albeit in ‘Lorentzian signature’) for the purpose of ex- tending quantum mechanics to the relativistic setting. After these operators were rediscovered by Atiyah and Singer, they were christened Dirac operators and later generalized to Dirac-type operators. Let E −→ M and F −→ M be Rk-vector bundles over a Riemannian mani- fold (M, g). A first order differential operator D ∈ Diff1(M; E,F ) is a Dirac-type operator if D∗D and DD∗ are Laplace-type operators. If E = F and D = D∗, we say that D is a symmetric Dirac-type operator. If D ∈ Diff1(M; E,F ) is a Dirac-type operator, then 0 D∗ D = ∈ Diff1(M; E ⊕ F ) e D 0 is a symmetric Dirac-type operator. We have seen one example of a symmetric Dirac-type operator so far, the de Rham operator: m M d + δ ∈ Diff1(M;Ω∗(M)), where Ω∗(M) = Ωk(M) = C∞(M;Λ∗T ∗M) k=0 Indeed, we know that (d + δ)∗ = d + δ and (d + δ)2 = ∆, the Hodge Laplacian on forms of all degrees. We can use the symbol of an arbitrary symmetric Dirac operator D to uncover some algebraic structure on E. For every p ∈ M, and ξ ∈ TpM, let θ(p, ξ) = σ1(D)(p, ξ): Ep −→ Ep, and, when the point p is fixed, let θ(ξ) = θ(p, ξ). Since D is symmetric, we have θ(ξ) = θ(ξ)∗ and since it is a Dirac-type operator, we have θ(ξ)2 = g(ξ, ξ) Id . ∗ We can ‘polarize’ this identity by applying it to ξ + η ∈ TpM , θ(ξ + η)2 = θ(ξ)2 + θ(ξ)θ(η) + θ(η)θ(ξ) + θ(η)2 = g(ξ + η, ξ + η) Id = (g(ξ, ξ) + 2g(ξ, η) + g(η, η)) Id 58 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS and hence θ(ξ)θ(η) + θ(η)θ(ξ) = 2g(ξ, η) Id . More generally, suppose we have two vector spaces V and W, a quadratic form q on V, and a linear map cl : V −→ End(W ) such that2 (3.3) cl (v) cl (v) = q(v) Id . We call cl Clifford multiplication by elements of V. 1 As above, we can polarize (3.3). Define q(v, w) = 2 (q(v + w) − q(v) − q(w)) and apply (3.3) to v + w to obtain (3.4) cl (v) cl (w) + cl (w) cl (v) = 2q(v, w) Id . The map cl canonically extends to the Clifford algebra of (V, q), which is defined as follows. First let O V = R ⊕ V ⊕ (V ⊗ V ) ⊕ (V ⊗ V ⊗ V ) ⊕ ... and note that the map cl extends to a map clb : N V −→ End(W ). This extension vanishes on elements of the form v ⊗ w + w ⊗ v − 2q(v, w) Id and hence on the ideal generated by these elements n O o Icl (V ) = span κ ⊗ (v ⊗ w + w ⊗ v − 2q(v, w) Id) ⊗ χ : κ, χ ∈ V . The quotient is the Clifford algebra of (V, q), O Cl(V, q) = V/Icl (V ), it has an algebra structured induced by the tensor product in N V which satisfies v · w + w · v = 2q(v, w) whenever v, w ∈ V. By construction, Cl(V, q) has the universal property that any linear map cl : V −→ End(W ) satisfying (3.3) extends to a unique algebra homo- morphism cl : Cl(V, q) −→ End(W ), with cl (1) = Id . 2Many authors use a different convention, namely cl (v) cl (v) = −q(v) Id . 3.4. CLIFFORD ALGEBRAS 59 This universal property makes the Clifford algebra construction functorial. Indeed, if (V, q) and (V 0, q0) are vector spaces with quadratic forms and T : V −→ V 0 is a linear map such that T ∗q0 = q then there is an induced algebra homomorphism Te : Cl(V, q) −→ Cl(V 0, q0). Uniqueness of Te shows that T]◦ S = Te ◦ Se for any quadratic-form preserving linear map S. One advantage of functoriality is that we can extend this construction to vector bundles: Given a vector bundle F −→ M and a bundle metric gF there is a vector bundle Cl(F, gF ) −→ M whose fiber over p ∈ M is Cl(Fp, gF (p)). In particular, coming back to our first order Dirac-type operator D ∈ C∞(M; E), we can conclude that its principal symbol σ1(D) induces an algebra bundle morphism (3.5) cl : Cl(T ∗M, g) −→ End(E) with cl (1) = Id . A bundle E −→ M together with a map (3.5) is called a Clifford module. (It should really be called a bundle of Clifford modules, but the abbreviation is common.) In the case of the de Rham operator we have ∗ ∗ ∗ ∗ σ1(d + δ)(p, ξ) = −i(eξ − iξ] ):Λ Tp M −→ Λ Tp M, and an induced algebra homomorphism ∗ ∗ ∗ (3.6) cl : Cl(T M, g) −→ End(Λ Tp M). 3.4 Clifford algebras Let us consider the structure of the algebra Cl(V, q) associated to a quadratic form q on V. If e1, . . . , ek is a basis for V then any element of Cl(V, q) can be written as a polynomial in the ei. In fact, using (3.4) we can choose the polynomial so that no exponent larger than one is used, and so that the ei occur in increasing order; thus any ω ∈ Cl(V, q) can be written in the form X α1 α2 αm (3.7) ω = aαe1 · e2 ··· em αj ∈{0,1} 60 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS with real coefficients aα. This induces a filtration on Cl(V, q), Cl(k)(V, q) = {ω as in (3.7), with |α| ≤ k}, which is consistent with the algebraic structure, Cl(k)(V, q) · Cl(`)(V, q) ⊆ Cl(k+`)(V, q). The map (3.6), which there was induced by the symbol of the de Rham oper- ator, always exists, M : Cl(V, q) −→ End(Λ∗V ). There is a distinguished element 1 ∈ Λ∗V, and evaluation at one induces Mf : Cl(V, q) −→ Λ∗V, Mf(ω) = M(ω)(1). Clearly if we think of v ∈ V as an element of Cl(V, q), we have Mf(v) = v. Moreover, comparing the anti-commutation rules in Cl(V, q) and Λ∗V, we can show that Mf is an isomorphism of vector spaces. In fact, Λ∗V is the graded algebra associated to the filtered algebra Cl(V, q), i.e., Cl(k)(V, q)/ Cl(k−1)(V, q) ∼= ΛkV. If we set q = 0, then we recognize Cl(V, 0) = Λ∗V as algebras. In general the algebras Cl(V, q) and Λ∗V only coincide as vector spaces. Let us identify a few of the low-dimensional Clifford algebras. From the vector space isomorphism Cl(V, q) ∼= Λ∗V we know that dim Cl(V, q) is a power of two. Let us agree that R itself, with its usual algebra structure, is the unique one-dimensional Clifford algebra. In dimension two, let us identify Cl(R, gR) and Cl(R, −gR). Let e1 be a unit vector in R, so that both of these Clifford algebras are generated by 1, e1. In the second case, where the quadratic form is minus the usual Euclidean norm, we have 2 e1 = −1, and it is easy to see that ∼ Cl(R, −gR) = C as R-algebras. In the first case, where the quadratic form is the usual Euclidean 1 1 norm, the elements v = 2 (1 + e1) and w = 2 (1 − e1) satisfy v · w = 0, v · v = v, w · w = w and we can identify ∼ Cl(R, gR) = R ⊕ R 3.4. CLIFFORD ALGEBRAS 61 as R-algebras, where the summands consist of multiples of v and w, respectively. 2 2 Next consider Cl(R , ±gR2 ). Let e1, e2 be an orthonormal basis of (R , gR2 ). 2 Using the universal property, we can define an isomorphism f between Cl(R , ±gR2 ) and a four dimensional algebra A by specifying the images of e1 and e2 and verifying that f(ei) · f(ei) = ±1, i ∈ {1, 2}. 2 For Cl(R , gR2 ) the map 0 1 1 0 e 7→ , e 7→ 1 1 0 2 0 −1 extends linearly to an algebra isomorphism 2 ∼ 2 Cl(R , gR2 ) = End(R ). 2 Similarly, we can identify Cl(R , −gR2 ) with the quaternions by extending the map e1 7→ i, e2 7→ j (so, e.g., e1 · e2 maps to k). Thus (real) Clifford algebras generalize both the complex numbers and the quaternions. It is possible to compute all of the Clifford algebras of finite dimensional vector spaces3 and an incredible fact is that they recur (up to ‘suspension’) with periodicity eight! It is useful to consider the complexified Clifford algebra Cl(V, q) = C ⊗ Cl(V, q) as these complex algebras have a simpler structure than Cl(V, q). In particular, we next show that they repeat with periodicity two. m Let us restrict attention to V = R and q = gRm , and abbreviate m m Cl(R , gRm ) = Cl(m), Cl(R , gRm ) = Cl(m). Proposition 8. There are isomorphisms of complex algebras ∼ 2 ∼ 2 Cl(1) = C , Cl(2) = End(C ) ∼ and Cl(m + 2) = Cl(m) ⊗ Cl(2) and so ∼ 2k ∼ 2k 2k Cl(2k) = End(C ), Cl(2k + 1) = End(C ) ⊕ End(C ). 3See for instance, Lawson, H. Blaine, Jr. and Michelsohn, Marie-Louise Spin geometry. Princeton Mathematical Series, 38. Princeton University Press, Princeton, NJ, 1989. 62 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS 2 Proof. The first two identities are just the complexifications of Cl(R, gR) and Cl(R , gR2 ) computed above. To prove periodicity, embed Rm+2 into Cl(m) ⊗ Cl(2) by picking an orthonor- mal basis {e1, . . . , em+2} and taking ej 7→ icl (ej) ⊗ cl (em+1)cl (em+2) for 1 ≤ j ≤ m, ej 7→ 1 ⊗ cl (ej) for j > m. Then the universal property of Cl(m + 2) leads to the isomorphism above. Given a vector space with quadratic form (V, q), let e1, . . . , em be an oriented orthonormal basis and define the chirality operator m d e Γ = i 2 e1 ··· em ∈ Cl(V, q). This is an involution (i.e., Γ2 = Id), and is independent of the choice of oriented orthonormal basis used in its definition. Moreover, Γ · v = −v · Γ if dim V even, Γ · v = v · Γ if dim V odd. From the explicit description of Cl(m), we can draw some immediate conse- quences about the representations of this algebra4. Indeed if m is even, say m = 2k, k then Cl(2k) = End(C2 ) and up to isomorphism there is a unique irreducible Cl(2k)- module, ∆/ , and it has dimension 2k, Cl(2k) = End(∆)/ . k k If m is odd, say m = 2k + 1, then Cl(2k + 1) = End(C2 ) ⊕ End(C2 ) and up to isomorphism there are two irreducible Cl(2k + 1) modules ∆/ ±, Cl(2k + 1) = End(∆/ +) ⊗ End(∆/ −) each of dimension 2k distinguished by Γ = ±1. ∆/ ± Any finite dimensional complex representation of Cl(2k) must be a direct sum of copies of ∆/ , or equivalently of the form W = ∆/ ⊗ W 0 4For more details see, e.g., Roe, John Elliptic operators, topology and asymptotic methods. Second edition. Pitman Research Notes in Mathematics Series, 395. Longman, Harlow, 1998. 3.5. HERMITIAN VECTOR BUNDLES 63 for some auxiliary ‘twisting’ vector space W 0. We can recover W 0 from W via 0 W = HomCl(2k)(∆/ ,W ) and in the same fashion we have 0 End(W ) = Cl(2k) ⊗ End(W ) = Cl(2k) ⊗ EndCl(2k)(W ). One can similarly decompose representations of Cl(2k + 1) in terms ∆/ ±. Although Cl(2k) = End(∆)/ follows directly from the proposition, it can be use- ful to have a more concrete representative of ∆/ . So let us describe how to construct ∆/ m from (R , gRm ) or indeed any m-dimensional vector space V with a positive-definite quadratic form, q. Let e1, . . . , em be an oriented orthonormal basis of V and consider the subspaces of V ⊗ C, 1 K = span{ηj = √ (e2j−1 − ie2j): j ∈ {1, . . . , m}} 2 1 K = span{η = √ (e2j−1 + ie2j): j ∈ {1, . . . , m}}. j 2 We can use the fact that V ⊗ C = K ⊕ K to define a natural homomorphism ∗ cl : V ⊗ C −→ End(Λ K) √ √ cl (k) = 2 (ek), cl k = 2 (ik) for k ∈ K. This satisfies the Clifford relations and so extends to a homomorphism of Cl(V, q) into End(Λ∗K). Comparing dimensions, we see this is in fact an isomorphism so we can take ∆/ = Λ∗K. We refer to this representation as the spin representation. Below, we will use the consequence that the metric q on V induces a Hermitian metric on ∆/ compatible with Clifford multiplication. 3.5 Hermitian vector bundles Recall that on a complex vector space V a Hermitian inner product is a map h·, ·i : V × V −→ C that is linear in the first entry and satisfies hv, wi = hw, vi, for all v, w ∈ V. It is positive definite if hv, vi ≥ 0, and moreover hv, vi = 0 implies v = 0. 64 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS A Hermitian metric hE on a complex vector bundle E −→ M is a smooth Hermitian inner product on the fibers of E. Compatible connections are defined in the usual way, namely, a connection ∇E on E is compatible with the Hermitian metric hE if E E ∞ dhE(σ, τ) = hE(∇ σ, τ) + hE(σ, ∇ τ), for all σ, τ ∈ C (M; E). One natural source of complex bundles is the complexification of real bundles. If F −→ M is a real bundle of rank k then F ⊗ C is a complex bundle of rank k. We can think of F ⊗ C as F ⊕ F together with a bundle map J : F ⊕ F / F ⊕ F (u, v) / (−v, u) satisfying J 2 = − Id . (Such a map is called an almost complex structure.) A bundle metric gF on F −→ M induces a Hermitian metric hF on F ⊗ C by hF ((u1, v1), (u2, v2)) = gF (u1, u2) + gF (v1, v2) + i (gf (u2, v1) − gF (u1, v2)) F thus the ‘real part’ of hF is gF . Similarly a connection on F, ∇ , induces a connection F on F ⊗ C, namely the induced connection on F ⊕ F. If ∇ is compatible with gF , then the induced connection on F ⊗ C is compatible with hF . A Riemannian metric on a manifold induces a natural Hermitian inner product on C∞(M; C) or more generally on sections of a complex vector bundle E endowed with a Hermitian bundle metric, hE. In the former case, Z ∞ hf, hi = f(p)h(p) dvolg for all f, h ∈ C (M; C) M and in the latter, Z ∞ hσ, τiE = hE(σ, τ) dvolg for all σ, τ ∈ C (M; E) M (though the integrals need not converge if M is not compact). A complex vector bundle E over a Riemannian manifold (M, g) is a complex Clifford module if there is a homomorphism, known as Clifford multiplication, ∗ cl : Cl(T M, g) −→ End(E). Clifford multiplication is compatible with the Hermitian metric hE if ∞ ∗ ∞ hE(cl (θ) σ, τ) = −hE(σ, cl (θ) τ), for all θ ∈ C (M; T M), σ, τ ∈ C (M; E). 3.5. HERMITIAN VECTOR BUNDLES 65 Clifford multiplication is compatible with a connection ∇E if [∇E, cl (θ)] = cl (∇θ) , where on the right hand side we are using the Levi-Civita connection. A connection compatible with Clifford multiplication is known as a Clifford connection. Let us define a Dirac bundle to be a complex Clifford module, E −→ M, with E a Hermitian metric hE and a metric connection ∇ , both of which are compatible with Clifford multiplication. Any Dirac bundle has a generalized Dirac operator 1 ðE ∈ Diff (M; E) given by E ∞ ∇ ∞ ∗ icl ∞ ðE : C (M; E) −−−→C (M; T M ⊗ E) −−−→C (M; E). Note that this is indeed a Dirac-type operator, since E [ðE, f] = [icl ◦ ∇ , f] = icl (df) shows that 2 2 σ1(ðE)(p, ξ) = cl (ξ) , σ2(ðE)(p, ξ) = |ξ|g. It is also a self-adjoint operator thanks to the compatibility of the metric with the connection and Clifford multiplication. 2 We know that ðE is a Dirac-type operator and hence can be written as a connection Laplacian up to a zero-th order term. There is an explicit formula for this term which is often very useful. First given two vector fields V,W ∈ C∞(M; TM), the curvature R(V,W ) can be thought of as a two-form Re(V,W ) ∈ Ω2(M), Re(V,W )(X,Y ) = g(R(V,W )Y,X) for all X,Y ∈ C∞(M; TM) and so we have an action of R(V,W ) on E by cl Re(V,W ) . Next note that for any complex Clifford module, E, (for simplicity over an even dimensional manifold) and a contractible open set U ⊆ M we have ∼ / ∗ / E U = ∆ ⊗ HomCl(T M)(∆,E U ) where ∆/ denotes the trivial bundle ∆/ × U −→ U. Correspondingly we have ∗ End(E) = Cl(T M) ⊗ EndCl(T ∗M)(E) locally. However both sides of this equality define bundles over M and the natural map from the right hand side to the left hand side is locally an isomorphism so these are in fact globally isomorphic bundles. 66 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS Proposition 9. The curvature RE ∈ Ω2(M; End(E)) of a Clifford connection ∇E on ∗ E −→ M decomposes under the isomorphism End(E) = Cl(T M) ⊗ EndCl(T ∗M)(E) into a sum E 1 E R (V,W ) = 4 cl Re(V,W ) + Rb (V,W ), ∗ E cl Re(V,W ) ∈ Cl(T M), Rb (V,W ) ∈ EndCl(T ∗M)(E) for any V,W ∈ C∞(M; TM). E 2 We call Rb ∈ Ω (M; EndCl(T ∗M)(E)) the twisting curvature of the Clifford connection ∇E. Proof. From the compatibility of the connection with Clifford multiplication, and the definition of the curvature in terms of the connection, it is easy to see that for any V,W ∈ C∞(M; TM), ω ∈ C∞(M; T ∗M), and s ∈ C∞(M; E), RE(V,W )cl (ω) s = cl (ω) RE(V,W )s + cl (R(V,W )ω) s where R(V,W )ω denotes the action of R(V,W ) as a map from one-forms to one- forms. We can write the last term as a commutator 1 1 cl (R(V,W )ω) s = 4 cl Re(V,W ) cl (ω) s − 4 cl (ω) cl Re(V,W ) s i ∗ since we have, in any local coordinates θ for T M with dual coordinates ∂i, cl Re(V,W ) cl (θp) − cl (θp) cl Re(V,W ) i j p p i j = g(R(V,W )∂i, ∂j)cl θ · θ · θ − g(R(V,W )∂i, ∂j)cl θ · θ · θ j i = −2g(R(V,W )∂p, ∂j)cl θ + 2g(R(V,W )∂i, ∂p)cl θ i = 4g(R(V,W )∂i, ∂p)cl θ . So we conclude that E 1 R − 4 cl (R) commutes with Clifford multiplication by one-forms, and hence it commutes with all Clifford multiplication. 2 ∇ With this preliminary out of the way we can now compare ðE and ∆ . The proof is through a direct computation in local coordinates5. 5See Theorem 3.52 of Nicole Berline, Ezra Getzler, Michele` Vergne Heat kernels and Dirac operators. Corrected reprint of the 1992 original. Grundlehren Text Edi- tions. Springer-Verlag, Berlin, 2004. x+363 pp. ISBN: 3-540-20062-2 3.5. HERMITIAN VECTOR BUNDLES 67 Theorem 10 (Lichnerowicz formula). Let E −→ M be a Dirac bundle, ∆∇ the E E0 2 Bochner Laplacian of the Clifford connection ∇ ,R ∈ Ω (M; EndCl(T ∗M)(E)) the E twisting curvature of ∇ and scalM the scalar curvature of M, then 2 ∇ E 1 ðE = ∆ + cl Rb + 4 scalM i where we define (in local coordinates ∂i for TM and dual coordinates θ ) E0 X E i j cl Rb = Rb (∂i, ∂j)cl θ · θ . i A Z2-graded Dirac bundle is a Dirac bundle E with a decomposition E = E+ ⊕ E− that is orthogonal with respect to the metric, parallel with respect to the connection, and odd with respect to the Clifford multiplication – by which we mean that cl (v) interchanges E+ and E−, for all v ∈ T ∗M. We can equivalently express this in terms of an involution on E :A Z2 grading on a Dirac bundle is γ ∈ C∞(M; End(E)) satisfying γ2 = Id, γcl (v) = −cl (v) γ, ∇Eγ = 0, γ∗ = γ. We make E into a Z2-graded Dirac-bundle by defining E± = {v ∈ E : γ(v) = ±v}. If M is even dimensional, then any complex Clifford module over M has a Z2-grading given by Clifford multiplication by the chirality operator γ = cl (Γ) but there are often other Z2-gradings as well. (Note that Γ is a well-defined section of Cl(T ∗M) by naturality.) In the presence of a Z2 grading, the generalized Dirac operator is a graded (or odd) operator, − 0 ðE ± 1 ± ∓ + ∗ − ðE = + , with ðE ∈ Diff (M; E ,E ), (ðE) = ðE. ðE 0 68 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS + The index of the Dirac-type operator ðE is the difference + + − ind ðE = dim ker ðE − dim ker ðE. and often this is an interesting geometric, or even topological, quantity. Generalized Dirac operators show up regularly in geometry, we will describe several natural examples in the next section. A simple consequence of the Lichnerowicz formula is that E 1 + cl Rb + 4 scalM > 0 =⇒ ker ðE = {0} =⇒ ind ðE = 0. + E 1 So the vanishing of the index of ðE is an obstruction to having cl Rb + 4 scalM > 0. This is most interesting when the former is a topological invariant. 3.6 Examples 3.6.1 The Gauss-Bonnet operator Our usual example is the de Rham operator ∗ ∗ ðdR = d + δ :Ω (M) −→ Ω (M), although this is a real generalized Dirac operator, so our conventions will have to be slightly different from those above. As we mentioned above, in this case the Clifford module is the bundle of forms of all degrees, Λ∗T ∗M −→ M. Clifford multiplication by a 1-form ω ∈ C∞(M; T ∗M) is given by cl (ω) = (eω − iω] ), this is compatible with both the Riemannian metric on Λ∗T ∗M and the Levi-Civita connection, and we have cl ◦ ∇ = d + δ. Thus Λ∗T ∗M is a (real) Dirac bundle with generalized Dirac operator the de Rham operator. This Dirac bundle is Z2-graded. Let us define M M ΛevenT ∗M = ΛkT ∗M, ΛoddT ∗M = ΛkT ∗M k even kodd 3.6. EXAMPLES 69 and correspondingly Ωeven(M) and Ωodd(M). Notice that Clifford multiplication ex- changes these subbundles and that the covariant derivative with respect to a vector field preserves these subbundles, so even even odd odd odd even ðdR :Ω (M) −→ Ω (M), ðdR :Ω (M) −→ Ω (M). even The operator ðdR is the Gauss-Bonnet operator. A form in the null space of d + δ is necessarily in the null space of the Hodge Laplacian ∆ = (d + δ)2 and so is a harmonic form. Conversely any compactly supported harmonic form is in the null space of d + δ. So on compact manifolds, ∗ even even odd ker(d + δ) = H (M), ind ðdR = dim H (M) − dim H (M). The latter is the Euler characteristic of the Hodge cohomology of M, χHodge(M), an important topological invariant of M. Notice that on odd-dimensional compact manifolds, the Hodge star is an isomorphism between Heven(M) and Hodd(M), so the Euler characteristic of the Hodge cohomology vanishes. The Lichnerowicz formula in this case is known as the Weitzenb¨ock formula, and simplifies to ∗ X k i j ∆k = ∇ ∇ − g(R(ei, ej)ek, e`)eθ ie` eθ iθ ijk` k where ek is a local orthonormal frame with dual coframe θ . In particular on functions we get ∗ ∆0 = ∇ ∇ and on one-forms we get Bochner’s formula ∗ ∆1 = ∇ ∇ + Ricf where Ricf is the endomorphism of T ∗M associated to the Ricci curvature via the metric. In particular, if the Ricci curvature is bounded below by a positive constant then ∆1 does not have null space, i.e., Ric > 0 =⇒ H1(M) = 0. 70 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS 3.6.2 The signature operator Now consider the complexification of the differential forms on M, ∗ ∗ ∗ ∗ ΛCT M = Λ T M ⊗ C and the corresponding space of sections, Ω∗ (M). Clifford multiplication is as before, C but now we can allow complex coefficients, 1 cl (ω) = i (eω − iω] ), this is compatible with both the Riemannian metric on Λ∗ T ∗M (extended to a C Hermitian metric) and the Levi-Civita connection, and we have icl ◦ ∇ = d + δ. Since Λ∗ T ∗M is a complex Clifford module, we have an involution given by C Clifford multiplication by the chirality operator Γ. One can check that, up to a factor of i, Γ coincides with the Hodge star, 1 k(k−1)+ dim M (Γ) : Ωk (M) −→ Ωm−k(M), (Γ) = i 2 ∗ . cl C C cl If M is even dimensional, this induces a grading of Λ∗ T ∗M as a Dirac bundle, Z2 C ∗ ∗ ∗ ∗ ∗ ∗ ΛCT M = Λ+T M ⊕ Λ−T M. The de Rham operator splits as − 0 ðsign d + δ = + ðsign 0 + and we refer to ðsign as the Hirzebruch signature operator. The index of this operator + + − ind ðsign = dim ker ðsign − dim ker ðsign is again an important topological invariant of M. It is always zero if dim M is not a multiple of four, and otherwise it is equal to the signature of the quadratic form 1 dim M H 2 (M) / C R u / M u ∧ u It is called the signature of M, sign(M). 3.6. EXAMPLES 71 3.6.3 The Dirac operator If E −→ M is a complex Clifford module over manifold M of even dimension, say m = 2k, then Clifford multiplication ∗ cl : Cl(T M, g) −→ End(E) is, for each p ∈ M, a finite dimensional complex representation of Cl(2k) on Ep. We ∗ can decompose Ep in terms of the unique irreducible representation of Cl(Tp M), ∆/ , Ep = ∆/ ⊗ HomCl(2k)(∆/ ,Ep). If it happens that Ep = ∆/ as Clifford modules for all p ∈ M we say that E is a spinC bundle over M. If a spinC bundle satisfies the symmetry condition HomCl(T ∗M)(E,E) is trivial we say that E is a spin bundle over M. We will generally denote a spin bundle over M by S/ −→ M. Remark 5. In the interest of time we have avoided talking about principal bundles and structure groups and hence also spin structures and spinC structures. In brief, k the structure group of a R -vector bundle is GLk(R) because the transition maps between trivializations are valued in GLk(R). If we choose a metric on the bundle then we can restrict attention to trivializations where the transition maps are valued in O(n), and if we choose an orientation we can use oriented frames and reduce the structure group to SO(n). It turns out that a double cover of SO(n) is a Lie group called the spin group Spin(n). (We can identify Spin(n) with the multiplicative (even) n group of invertible elements in Cl (R , gRn ).) If the structure group of the tangent bundle of a manifold M can be ‘lifted’ to Spin(n) we say that M admits a spin structure, and we can combine a spin structure with the spin representation to C k obtain a spin bundle. The group Spin (n) is the subgroup of Cl(R , gRk ) generated by Spin(n) together with the unit complex numbers. If the structure group of the tangent bundle of a manifold M can be ‘lifted’ to SpinC(n) we say that M admits a spinC structure, and we can combine a spinC structure with the spin representation to obtain a spinC bundle. For details we refer to any of the books in the bibliography of this lecture. There are topological obstructions to the existence of a spin bundle over a manifold M and the number of spin bundles when they do exist is also determined topologically6. For example, a genus g Riemann surface admits 22g distinct spin 6For a thorough discussion see Lawson, H. Blaine, Jr. and Michelsohn, Marie-Louise Spin geometry. Princeton Mathematical Series, 38. Princeton University Press, Princeton, NJ, 1989. 72 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS bundles. All spheres in dimension greater than two admit spin bundles. All compact orientable manifolds of dimension three or less admit spin bundles. The complex projective space CPk admits a spin bundle if and only if k is odd. Additionally, any (almost) complex manifold has a spinC bundle. The Riemannian metric and Levi-Civita connection induce the structure of a Dirac bundle on any spin bundle. Indeed, we saw above that the metric on a 2k-dimensional vector space V with positive quadratic form induces a metric on the corresponding ∆/ so it suffices to describe the induced compatible connection. In a local coordinate chart U that trivializes T ∗M, if we choose an orthonormal frame 1 m θ , . . . , θ for TM U , the Levi-Civita connection is determined by an m × m matrix of one-forms ω defined through i j ∇θ = ωijθ . (The fact that the connection respects the metric is reflected in ωij = −ωji.) We can / use ω to define a one-form valued in End(S) U , namely 1 X i j ω = ωijcl θ · θ e 2 i ∞ ∞ ð : C (M; S/) −→ C (M; S/). As before Clifford multiplication by the chirality operator Γ defines an involution exhibiting a Z2-grading on S,/ S/ = S/+ ⊕ S/−, and a corresponding splitting of the Dirac operator − 0 ð ð = + . ð 0 The Lichnerowicz formula for the Dirac operator is particularly interesting since there is no twisting curvature, so we get 1 2 = ∆∇ + scal . ð 4 M 3.6. EXAMPLES 73 Thus if the metric has positive scalar curvature then ð2 is strictly positive and in particular does not have null space. We will show that the index of ð+ is a topological invariant. Thus the vanishing of ind ð+ is a topological necessary condition for the existence of a metric of positive scalar curvature7 on M. 3.6.4 Twisted Clifford modules If E −→ M is a Dirac bundle and F −→ M is any Hermitian bundle with a unitary connection, then E ⊗F −→ M is again a Dirac bundle if we take the tensor product metric and connection and extend Clifford multiplication to E ⊗ F by acting on the first factor. We say that this is the Dirac bundle E twisted by F. If M admits a spin bundle S/ −→ M, then any Dirac bundle E −→ M is a twist of S./ Indeed, we have E = S/ ⊗ HomCl(T ∗M)(S,E/ ). In the case of a twisted spin bundle S/ ⊗ F −→ M, the ‘twisting curvature’ in the Lichnerowicz formula is precisely the curvature of the connection on F. The index of a twist of the signature operator is known as a twisted signa- ture. Twisted Dirac and twisted signature operators play an important rˆolein the heat equation proof of the index theorem. For example if M admits a spin bundle S/ −→ M then the Clifford module Λ∗T ∗M ⊗ C must be a twist of S,/ and indeed ∗ ∗ ∼ ∗ ∼ / /∗ / ΛCT M = Cl(T M) = End(S) = S ⊗ S. The generalized Dirac operator d + δ on Λ∗ T ∗M is the twisted Dirac operator ∗ C obtained by twisting S/ with S/ . The two different Z2 gradings correspond to whether we treat S/∗ as a -graded bundle (which yields the splitting into ΛevenT ∗M and Z2 C ΛoddT ∗M) or as an ungraded bundle (which yields the splitting into Λ∗ T ∗M and C + ∗ ∗ Λ−T M). 3.6.5 The ∂ operator A real manifold M of dimension m = 2k is a complex manifold if it has a collection of local charts parametrized by open sets in Ck for which the transition charts are 7In contrast, any manifold of dimension at least three admits a metric of constant negative scalar curvature, see Aubin, Thierry M´etriquesriemanniennes et courbure. J. Differential Geometry 4, 1970, 383-424. The same is true for manifolds with boundary and for non-compact manifolds. For compact surfaces, the sign of the integral of the scalar curvature is topologically determined by the Gauss-Bonnet theorem. 74 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS holomorphic. This yields a decomposition of the complexified tangent bundle 1,0 0,1 (1,0) 0,1 TM ⊗ C = T M ⊕ T M, v ∈ Tp M ⇐⇒ v ∈ Tp M. By duality there is a similar decomposition of the cotangent bundle ∗ 1,0 01, T M ⊗ C = Λ M ⊕ Λ M. In local complex coordinates, z1, . . . , zm, which we can decompose into real and imaginary parts zj = xj + iyj. We can define dzj = dxj + idyj, dzj = dxj − idyj 1,0 0,1 and then Λ M is spanned by {dzj} and Λ M is spanned by {dzj}. More generally we set Λp,qM = Λp(Λ1,0M) ∧ Λq(Λ0,1M) and we have a natural isomorphism k ∗ M p,q Λ T M ⊗ C = Λ M. p+q=k We denote the space of sections of Λp,qM by Ωp,qM, and speak of forms of bidegree (p, q). The exterior derivative naturally maps d :Ωp,qM −→ Ωp+1,qM ⊕ Ωp,q+1M and we can decompose it into the Dolbeault operators d = ∂ + ∂ ∂ :Ωp,qM −→ Ωp+1,qM, ∂ :Ωp,qM −→ Ωp,q+1M. It follows from d2 = 0 that 2 ∂2 = ∂ = ∂∂ + ∂∂ = 0 and from the Leibnitz rule for d that ∂(ω ∧ η) = ∂ω ∧ η + (−1)|ω|ω ∧ ∂η ∂(ω ∧ η) = ∂ω ∧ η + (−1)|ω|ω ∧ ∂η. 3.6. EXAMPLES 75 In local coordinates, for ω ∈ Ωp,qM X J K ω = αJK dz ∧ dz =⇒ |J|=p,|K|=q X αJK X αJK ∂ω = dz` ∧ dzJ ∧ dzK , and ∂ω = dz` ∧ dzJ ∧ dzK . ∂z` ∂z` |J|=p,|K|=q,` |J|=p,|K|=q,` The Dolbeault complex is given by (3.8) 0 −→ Ω0,0M −−→∂ Ω0,1M −−→∂ ... −−→∂ Ω0,mM −→ 0 and its cohomology is called the Dolbeault cohomology of M. We define the Hodge- Dolbeault cohomology to be ∗ ∗ H0,q(M) = {ω ∈ Ω0,pM :(∂∂ + ∂ ∂)ω = 0}. As in the real case, we will show below that the Hodge-Dolbeault cohomology coin- cides with the Dolbeault cohomology. A complex Hermitian manifold M is a K¨ahler manifold if the splitting 1,0 0,1 TM ⊗ C = T M ⊕ T M is preserved by the Levi-Civita connection. On a K¨ahlermanifold, the bundle M Λ0,∗M = Λ0,kM k is a Dirac bundle with respect to the Hermitian metric on M, the Levi-Civita con- nection, and the Clifford multiplication √ cl (ω) = 2 (eω0,1 − i(ω1,0)] ). The corresponding generalized Dirac operator is √ ∗ 2 (∂ + ∂ ). 0,∗ There is a natural Z2 grading on Λ M given by Λ0,∗M = Λ0,evenM ⊕ Λ0,oddM. The index of the corresponding non-self-adjoint operator is the Euler characteristic of the Hodge-Dolbeault cohomology. 76 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS 3.7 Exercises Exercise 1. a) Show that the scalar linear differential operators on Rm form a (non-commutative) algebra. Verify that the commutator PQ − QP of a differential operator of order ` and a differential operator of order `0 has at most order ` + `0 − 1. b) Show that the linear differential operators with constant coefficients form a com- mutative subalgebra which is isomorphic to the polynomial algebra R[ξ1, . . . , ξm]. Exercise 2. Verify that for P ∈ Diffk(M; E,F ) and Q ∈ Diff`(M; F,G), the operator PQ is in k+` Diff (M; E,G) and its principal symbol satisfies σk+`(P ◦ Q) = σk(P ) ◦ σ`(Q). Exercise 3. Let E −→ M and F −→ M be bundles endowed with metrics, lying over the Riemannian manifold M. Verify that the formal adjoint of P ∈ Diffk(M; E,F ),P ∗, k ∗ ∗ is an element of Diff (M; F,E) and its principal symbol satisfies σk(P ) = σk(P ) . Exercise 4. Let M be a Riemannian manifold with boundary, and E −→ M, and F −→ M bundles endowed with bundle metrics. The formal adjoint of an operator P : C∞(M; E) −→ C∞(M; F ) is defined by requiring ∗ ∞ ◦ ∞ ◦ hP σ, τiF = hσ, P τiE, for all σ ∈ Cc (M ; E), τ ∈ Cc (M ; F ). Let ν be the inward-pointing unit normal vector field at the boundary. Define 1 dvol∂M by iν dvolM ∂M . Show that if P ∈ Diff (M; E,F ) is a first-order differential operator then Z ∗ [ hP σ, τiF − hσ, P τiE = gF (iσ1(P )(p, ν )σ, τ) dvol∂M , ∂M ∞ ∞ for all σ ∈ Cc (M; E), τ ∈ Cc (M; F ). Hint: Use a partition of unity to reduce to a coordinate chart, then compute in local coordinates. Exercise 5. Show that End(Cm) has no proper two-sided ideals. Hint: Suppose M0 belongs to such an ideal I and v0 6= 0 belongs to the range of m M0. Show that every v ∈ C belongs to the range of some M ∈ I, and hence that every one-dimensional projection belongs to I. Exercise 6. a) Suppose a compact oriented manifold M has non-negative Ricci curvature – i.e., that g(Ric(f ω), ω) ≥ 0 for all ω ∈ Ω1(M). Show that every harmonic one-form η sat- isfies ∇η = 0. As this is a first order differential equation for η, a harmonic one-form 3.7. EXERCISES 77 is determined by its ‘initial condition’ at a point in M. Conclude that, if M is also connected, dim H1(M) ≤ dim M. b) Show that on any connected compact oriented Riemannian manifold dim H0(M) = 1 and so by Poincar´eduality dim Hm(M) = 1. c) From (b), we see that on a compact oriented Riemannian surface (i.e., two dimen- 1 sional manifold) χHodge(M) = 2−dim H (M). Show that if M is a compact oriented surface admitting a metric of non-negative Gaussian curvature then χHodge(M) ≥ 0. (It turns out that only the sphere and the torus have non-negative Euler character- istic.) Exercise 7. Let M and M 0 be compact oriented Riemannian manifolds. i)Show that k X dim Hk(M × M 0) = (dim Hj(M))(dim Hk−j(M 0)) j=0 when M×M 0 is endowed with the product metric (we will soon show that dim Hk(M) is independent of the choice of metric). ii) Conclude that 0 0 χHodge(M × M ) = χHodge(M)χHodge(M ). k ∗ 0 Lk j ∗ k−j ∗ 0 Hint: Use the fact that Λ T (M × M ) = j=0(Λ T M) ∧ (Λ T M ), show that with respect to this decomposition the Hodge Laplacian for the product metric satisfies k X ∆k,M×M 0 = (∆j,M ⊗ Id + Id ⊗∆k−j,M 0 ), j=0 and conclude that if j j j j j j 0 {α , . . . , α } is a basis of H (M) and {β , . . . , β 0 } is a basis of H (M ) 1 bj (M) 1 bj (M ) j k−j k 0 then {αs ⊗ βt : 0 ≤ j ≤ k} is a basis of H (M × M ). Exercise 8. Let M and M 0 be compact oriented Riemannian manifolds. Show that signature is multiplicative, sign(M × M 0) = sign(M)sign(M 0), where the signature on M × M 0 is defined using the product metric (we will soon show that the signature is independent of the choice of metric). Hint: Express dM×M 0 and δM×M 0 in terms of dM , δM , dM 0 , and δM 0 . Choose bases of ∗ ∗ 0 ∗ 0 Λ±T M and Λ±T M and use them to construct bases of Λ±T (M × M ). 78 LECTURE 3. LAPLACE-TYPE AND DIRAC-TYPE OPERATORS Exercise 9. We can interpret any vector field V on M as a first order differential operator. What is its principal symbol? Exercise 10. Let E −→ M and F −→ M be two vector bundles over M. Show that, for any k k ∈ N, the assignment U 7→ Diff (M; E U ,F U ) is a sheaf on M. 3.8 Bibliography The books Spin geometry by Lawson and Michelson, and Heat kernels and Dirac operators by Berline, Getzler, and Vergne treat Dirac operators in detail. Other good sources are Elliptic operators, topology and asymptotic methods by Roe which gives a very nice short treatment of many of the topics of this course, and volume two of Partial differential equations by Taylor. There is also a very nice treatment of these topics in Melrose, The Atiyah- Patodi-Singer index theorem, where they are systematically extended to non-compact manifolds with asymptotically cylindrical ends. Lecture 4 m Pseudodifferential operators on R 4.1 Distributions It is an unfortunate fact that most functions are not differentiable. Happily, there ∞ 1 ∞ is a way around this through duality. For instance, if f ∈ C (R) and φ ∈ Cc (R) then Z Z f 0(t)φ(t) dt = − f(t)φ0(t) dt R R and the right hand side makes sense even if f is not differentiable. So for a possibly non-differentiable f we will say that a function h is a weak derivative of f if Z Z 0 ∞ f(t)φ (t) dt = − h(t)φ(t) dt, for all φ ∈ Cc (R). R R Obviously if f is differentiable then f 0 is its weak derivative. Generally the taking of weak derivatives will force us to expand our notion of ‘function’. ∞ For example, for f(t) = |t| we have, for any φ ∈ Cc (R), Z Z 0 Z ∞ f(t)φ0(t) dt = − tφ0(t) dt + tφ0(t) dt R −∞ 0 Z 0 Z ∞ Z 0 ∞ = −[tφ(t)]−∞ + φ(t) + [tφ(t)]0 − φ(t) dt = − sign(t)φ(t) dt −∞ 0 R where sign(t) is the function equal to −1 for negative t and equal to 1 for positive t. Notice that f is continuous but its weak derivative is not. If we want to take a 1As in the previous chapter, we will work with complex valued functions and complex vector bundles. Thus for instance the symbol C∞(M) will now mean C∞(M; C). 79 80 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM second weak derivative we compute Z Z ∞ Z ∞ 0 0 0 0 ∞ sign(t)φ (t) dt = − φ (t) + φ (t) dt = −[φ(t)]∞ + [φ(t)]0 = −2φ(0). R 0 0 This shows that the weak derivative of of sign(t) is a multiple of the Dirac delta ‘function’, δ0, defined by Z ∞ φ(t)δ0(t) dt = φ(0), for all φ ∈ Cc (R). R Of course there is no function satisfying the definition of the delta function. Instead we should think of δ0 as a linear functional ∞ δ0 : Cc (R) −→ C, δ0(φ) = φ(0). This is the motivating example of a distribution. A distribution on Rm is a continuous linear functional on smooth functions of compact support, ∞ m T : Cc (R ) −→ C. Continuity here means that for every compact set K ⊆ Rm there are constants C and ` satisfying X α ∞ |T (φ)| ≤ C sup |∂ φ| for all φ ∈ Cc (K). K |α|≤` Similarly, let (M, g) be an oriented Riemannian2 manifold. A distribution on M is a continuous linear functional on smooth functions of compact support, ∞ T : Cc (M) −→ C, such that for every compact set K ⊆ M there are constants C and ` satisfying X α ∞ |T (φ)| ≤ C sup |∇ φ|g for all φ ∈ Cc (K). K |α|≤` We will denote3 the set of distributions by C−∞(M). If the same integer ` can be used for all compact sets K, we say that T is of order `. (A distribution of order ` 2We do not need to have a Riemannian metric or an orientation to define the space of distribu- tions. It does make things simpler, though, since we can use the Riemannian volume form to carry out integrations (really to trivialize the ‘density bundle’) and the gradient to measure regularity. 3In the literature one often finds the set of distributions denoted D0(M). This is because Laurent Schwartz, who first defined distributions, denoted the smooth functions of compact support by D(M). 4.1. DISTRIBUTIONS 81 ` has a unique extension to a continuous linear functional on Cc(M), but we shall not use this.) We can restate the continuity condition in terms of sequences: A linear func- ∞ ∞ tional T on C (M) is a distribution if and only if whenever φj ∈ Cc (M) is a sequence supported in a fixed compact subset of M that is converging uniformly to zero together with all of its derivatives, we have T (φj) −→ 0. The Hermitian pairing that the metric induces on C∞(M; C) yields an inclusion C∞(M) / C−∞(M) h / Th Z ∞ Th(φ) = hφ, hi = φ(p)h(p) dvol for all φ ∈ Cc (M). M indeed, any smooth function h ∈ C∞(M) naturally induces a distribution (of order zero) T h Z ∞ Th(φ) = hφ, hi = φ(p)h(p) dvol for all φ ∈ Cc (M). M More generally, we get a distribution (of order zero) Th, from any function h for which the pairing hφ, hi can be guaranteed to be finite, 1 ∞ Lloc(M) = {h measurable : hφ, hi < ∞ for all φ ∈ Cc (M)}. Similarly, any measure on M, µ, defines a distribution Tµ of order zero Z ∞ Tµ(φ) = φ dµ for all φ ∈ Cc (M). M The converse is true as well4, any distribution of order zero is a measure on M. An example of a distribution that is not a measure is 0 ∞ T (φ) = φ (0) for all φ ∈ Cc (M). As a very convenient abuse of notation, the pairing between a smooth function of compact support f and a distribution T is denoted ∞ −∞ h·, ·i : Cc (M) × C (M) −→ C hφ, T i = T (φ) and then in another abuse we identify Th and h. 4See Theorem 2.1.6 of Lars Hormander,¨ The analysis of linear partial differential operators. I. Distribution theory and Fourier analysis. Classics in Mathematics. Springer-Verlag, Berlin, 2003. 82 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM Now we can proceed to extend many concepts and operations of C∞(M) to C−∞(M) by duality using the pairing h·, ·i. For instance, we can multiply a distribu- tion T by a smooth function f ∈ C∞(M) by defining ∞ hφ, fT i = hφf, T i, for all φ ∈ Cc (M). We say that a distribution T vanishes on an open set U if ∞ hφ, T i = 0, for all φ ∈ Cc (U). The support of a distribution supp(T ) is the set of points in M having no neighborhood on which T vanishes. Equivalently, a point ζ is not in the support of T if there is a neighborhood of ζ on which T vanishes. The distributions of 5 −∞ compact support will be denoted Cc (M). A distribution of compact support can be evaluated not just on functions of compact support but on all of C∞(M). In fact, distributions of compact support form the continuous dual of the space C∞(M). Although it makes no sense to talk about the value of a distribution at a point, we can restrict a distribution to an open set U by restricting its domain to functions with compact support in U. We say that two distributions S and T coincide on an open set U if ∞ hφ, Si = hφ, T i for all φ ∈ Cc (U). If on each set of an open cover of M we have a distribution, and these distributions coincide on overlaps, then we can use a partition of unity to construct a distribution that restricts to the given ones. It is also true that two distributions coincide if and only if they coincide on open sets covering M. A closely related concept to the support of a distribution is that of the singu- lar support of a distribution sing supp(T ), defined by specifying its complement: a point ζ ∈ M is not in the singular support of T if there is a smooth function f that coincides with T in a neighborhood of ζ. We can apply a linear operator F to T by using its formal adjoint F ∗, ∗ ∞ hφ, F (T )i = hF (φ),T i, for all φ ∈ Cc (M). In particular, we can apply a differential operator to a distribution, e.g., ∞ hφ, ∆T i = h∆φ, T i, for all φ ∈ Cc (M). The weak derivatives we discussed above are computed in exactly this way. Thus a general function in, say, L1(M) is not differentiable as a function, but is infinitely differentiable as a distribution! 5Schwartz’ notation was E0(M), and one often finds this in the literature. 4.1. DISTRIBUTIONS 83 If E −→ M is a vector bundle over M we define distributional sections of E ∞ ∗ to be continuous linear functionals on Cc (M; E ). As above, there is a natural way of thinking of a smooth section τ of E as a distributional section Tτ of E, namely Z ∞ ∗ Tτ (ω) = ω(τ) dvolM , for all ω ∈ Cc (M; E ). M ∗ If we assume that E is endowed with a metric, gE, we can identify E with E and ∞ identify distributional sections of E with continuous linear functionals on Cc (M; E). We can then extend our abuse of notation and denote the evaluation of a distribution T on a compactly supported section σ by hσ, T iE = T (σ) and also denote Tτ merely by τ. All of the operations on distributions mentioned above extend to distributional sections. In particular we can talk about the support of a distributional section and −∞ then the distributional sections of compact support, Cc (M; E). These distributions have a simple description −∞ −∞ ∞ Cc (M; E) = Cc (M) ⊗ C (M; E). C∞(M) Indeed, every simple tensor −∞ ∞ T ⊗ σ ∈ Cc (M) ⊗ C (M; E) C∞(M) defines an element in the dual to C∞(M; E∗) by (T ⊗ σ)(ω) = T (ω(σ)), for all ω ∈ C∞(M; E∗), −∞ ∞ −∞ and so we have an inclusion Cc (M) ⊗C∞(M) C (M; E) −→ Cc (M; E). To see −∞ that this map is surjective, start with S ∈ Cc (M; E) and note that S defines, given any section ω ∈ C∞(M; E∗), a compactly supported distribution on M via ∞ C (M) 3 φ 7→ S(φω) ∈ C. Since the support of S is compact, we can cover it with finitely many sets {Ui} that ∗ ∗ trivialize E , choose {si1, . . . , sik} trivializing sections of E supported in Ui, and then write X X S(ω) = S( ωijsij) = S(sij)(ω(sij)) i,j i,j ! X ∞ ∗ = S(sij) ⊗ sij (ω), for all ω ∈ C (M; E ). i,j 84 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM 4.2 Examples 4.2.1 Fundamental solutions k Let P be a constant coefficient differential operator on Rm,P ∈ Diff (Rm), X α P = aα(x)D . |α|≤k We say that a distribution Q ∈ C−∞(Rm) is a fundamental solution of the differential operator P if PQ = δ0. Two fundamental solutions will differ by an element of the null space of P. 2 We have already seen that, in R, ∂x|x| = −2δ0, hence 1 2 − 2 |x| is a fundamental solution of ∆ = −∂x. On Rm, the fundamental solution of the Laplacian is given by ( (2π)−1 log |x| if m = 2 Q = m − 1 |x|2−m if m > 2 (m−2) Vol(Sm−1) ∞ m Indeed, note that on |x| > 0, ∆Qm = 0, so we can write, for any φ ∈ Cc (R ), Z hφ, ∆Qmi = h∆φ, Qmi = lim Qm∆φ dx ε→0 |x|>ε Z Z = lim Qm∆φ − φ∆Qm dx = − lim Qm∂rφ − φ∂rQm dS ε→0 |x|>ε ε→0 |x|=ε Z −1 log ε∂rφ − φε − lim dS if m = 2 ε→0 2π = |x|=ε Z ε2−m∂ φ − φ(2 − m)ε1−m − lim r dS if m > 2 m−1 ε→0 |x|=ε (2 − m) Vol(S ) = φ(0) where in the fourth equality we have used Green’s formula. If we identify R2 with C by identifying (x, y) with z = x+iy then the operator 1 ∂ = 2 (∂x + i∂y) on complex valued functions has as null space the space of holomorphic functions. 1 A similar application of Green’s theorem allows us to identify πz as a fundamental solution for ∂. 4.2. EXAMPLES 85 4.2.2 Principal values and finite parts Let us see a few more examples of distributions. The function x 7→ xα is locally integrable on R if α > −1. For α = −1 we define the (Cauchy) principal value of 1 1 −∞ x , PV( x ) ∈ C (R) by Z Z ∞ 1 φ(x) φ(x) − φ(−x) hφ, PV( x )i = lim dx = dx ε→0 |x|>ε x 0 x Z ∞ Z 1 0 ∞ = φ (tx) dt dx, for all φ ∈ Cc (R). 0 −1 This is indeed an element of C−∞(R), since whenever supp φ ⊆ [−C,C] we have Z ∞ Z 1 0 0 φ (tx) dt dx ≤ 2C sup |φ |, 0 −1 1 so PV( x ) is a distribution of order at most one (in fact of order one). Notice that 1 ∞ x PV( x ) = 1 since, for any φ ∈ Cc (R), Z Z 1 1 φ(x)x hφ, x PV( x )i = hxφ, PV( x )i = lim dx = φ(x) dx = hφ, 1i. ε→0 |x|>ε x It is then easy to see that −∞ 1 T ∈ C (R), xT = 1 =⇒ T = PV( x ) + Cδ0,C ∈ C. 1 1 so in this sense PV( x ) is a good replacement for x as a distribution. What about higher powers of x−1? Simply taking the principal value will not work for x−2 since we have Z φ(x) Z φ(x) + φ(−x) 2φ(0) 2 dx = 2 dx = + ( bounded ) |x|>ε x x>ε x ε −2 1 so instead we define the (Hadamard) finite part of x , FP( x2 ) by Z Z ∞ 1 φ(x) − φ(0) φ(x) − 2φ(0) + φ(−x) hφ, FP( x2 )i = lim 2 dx = 2 dx ε→0 |x|>ε x 0 x Z ∞ Z 1 Z 1 00 ∞ = s φ (stx) dt ds dx, for all φ ∈ Cc (R). 0 0 −1 As before, this is clearly a distribution of order at most two and we have −∞ 2 1 T ∈ C (R), x T = 1 =⇒ T = FP( x2 ) + C0δ0 + C1∂xδ0,C0,C1 ∈ C. 86 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM 1 We can continue this and define FP( xn ) for any n ∈ N, n ≥ 2, by Z Pn−2 1 (k) k 1 φ − k=0 k! φ (0)x hφ, FP( xn )i = lim n dx. ε→0 |x|>ε x This defines a distribution of order at most n and moreover n−1 −∞ n 1 X (k) T ∈ C (R), x T = 1 =⇒ T = FP( xn ) + Ci∂x δ0, {Ci} ⊆ C. k=0 One can similarly define finite parts of non-integer powers of x and also define finite parts in higher dimensions. 4.3 Tempered distributions On Rm, the class of Schwartz functions S(Rm) sits in-between the smooth functions and the smooth functions with compact support ∞ m m ∞ m Cc (R ) ⊆ S(R ) ⊆ C (R ) m n ∞ m α β o S(R ) = f ∈ C (R ): pα,β(f) = sup |x ∂ f| < ∞, for all α, β Rm Correspondingly the dual spaces satisfy −∞ m 0 m −∞ m Cc (R ) ⊆ S (R ) ⊆ C (R ) 0 m −∞ m S (R ) = T ∈ C (R ): pα,β(f) → 0 for all α, β =⇒ T (fj) → 0 . The dual space of S, S0(Rm), is known as the space of tempered distributions. Notice that these inclusions are strict. Any Lp function will be in S0(Rm) but −∞ m will only be in Cc (R ) if it has compact support. On the other hand, the function 2 e|x| defines a distribution in C−∞(Rm) but not a tempered distribution. 4.3.1 Convolution The convolution of two functions in S(Rm) is Z Z (f ∗ h)(x) = f(x − y)h(y) dy = h(x − y)f(y) dy = (h ∗ f) Rm Rm and defines a bilinear map S(Rm) × S(Rm) −→ S(Rm). The commutativity is par- ticularly clear from the expression ZZ hφ, f ∗ hi = f(x)h(y)φ(x + y) dx dy, 4.3. TEMPERED DISTRIBUTIONS 87 which we can also write as Z Z Z Z hφ, f ∗ hi = h(y) f(x)φ(x + y) dxdy = h(y) f(z − y)φ(z) dzdy = hfˇ∗ φ, hi, where fˇ(x) = f(−x). This allows us to use duality to define convolution of a tempered distribution with a Schwartz function m 0 m 0 m ∗ : S(R ) × S (R ) −→ S (R ) m hφ, f ∗ Si = hfˇ∗ φ, Si for all f, φ ∈ S(R ) and similarly ∞ m −∞ m −∞ m ∗ : Cc (R ) × C (R ) −→ C (R ) ˇ ∞ m hφ, f ∗ Si = hf ∗ φ, Si for all f, φ ∈ Cc (R ). For instance, notice that we have Z ˇ hφ, f ∗ δ0i = hf ∗ φ, δ0i = f(x)φ(x) dx = hφ, fi Rm m and so f ∗ δ0 = f for all f ∈ S(R ). If P is any constant coefficient differential operator, and f, h ∈ S(Rm) then P (f ∗ h) = P (f) ∗ h = f ∗ P (h) and so by duality we also have m 0 m P (f ∗ S) = P (f) ∗ S = f ∗ P (S), for all f ∈ S(R ),S ∈ S (R ). Putting these last two observations together, suppose that S ∈ S0(Rm) is a fundamental solution of the constant coefficient differential operator P, so that P (S) = δ0, then we have m P (f ∗ S) = f ∗ P (S) = f ∗ δ0 = f, for all f ∈ S(R ). Thus a fundamental solution for P is tantamount to a right inverse for P, even though P is generally not invertible. In particular, with Qm as above and m > 2 for simplicity, 0 m m ∞ m u ∈ S (R ), f ∈ S(R ), and ∆u = f =⇒ u = f ∗ Qm ∈ C (R ) Z 1 Z f(y) (4.1) u(x) = Qm(x − y)f(y) dy = m−1 m−2 dy Rm (2 − m) Vol(S ) Rm |x − y| 88 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM Remark 6. We can also use convolution to prove that test functions are dense −∞ 0 −∞ in distributions, be they Cc , S , or C . Indeed, first say that a sequence φn ∈ ∞ m Cc (R ) is an approximate identity if Z 1 φn ≥ 0, φn(x) dx = 1, φn(x) = 0 if |x| > n , Rd ∞ m and note that, as distributions, φn → δ0. Let χn ∈ Cc (R ) approximate the constant function one, and notice that, for any distribution S, the sequence ∞ m Sn ∈ Cc (R ),Sn(x) = χn(x)(φn ∗ S)(x) converges as distributions to S. 4.3.2 Fourier transform Recall that Fourier transform of a function f ∈ L1(Rm) is defined by 1 m ∞ m F : L (R ) −→ L (R ) Z 1 −ix·ξ F(f)(ξ) = fb(ξ) = n/2 f(x)e dx (2π) Rm and satisfies for f ∈ S(Rm), F(∂xj f)(ξ) = iξjF(f)(ξ), ∂ξj F(f)(ξ) = −iF(xjf)(ξ). 1 To simplify powers of i, let us define Ds = i ∂s so that these identities become F(Dxj f)(ξ) = ξjF(f)(ξ),Dξj F(f)(ξ) = −F(xjf)(ξ), and indeed for any multi-indices α, β, α β |β| α β (4.2) ξ Dξ F(f)(ξ) = (−1) F(Dx x f)(ξ). In particular we have the very nice result that m m F : S(Rx ) −→ S(Rξ ). The formal adjoint of F, F ∗, is defined by ∗ ∞ m hF(f), hi m = hf, F (h)i m , for all f, h ∈ C ( ) Rξ Rx c R and since we have Z Z 1 −ix·ξ hF(f), hi m = F(f)(ξ)h(ξ) dξ = e f(x)h(ξ) dξdx Rξ m/2 Rm (2π) Rm×Rm 4.3. TEMPERED DISTRIBUTIONS 89 we conclude that Z ∗ 1 ix·ξ F (h)(x) = m/2 h(ξ)e dξ. (2π) Rm ∗ Using the fact Dj = Dj we find α β ∗ |α| ∗ α β (4.3) x Dx F (φ)(x) = (−1) F (Dξ ξ φ)(x) and hence ∗ m m F : S(Rξ ) −→ S(Rx ). Proposition 11 (Fourier inversion formula). As a map on S(Rm), the Fourier transform F is invertible with inverse F ∗. Proof. We want to show that, for any f ∈ S(Rm) we have F ∗(F(f)) = f, i.e., Z Z eix·ξ e−iy·ξf(y) dy dξ = (2π)mf(x) but note that the double integral does not converge absolutely and hence we can not change the order of integration. We get around this by introducing a function ψ ∈ S with ψ(0) = 1, letting ψε(x) = ψ(εx) and then using Lebesgue’s dominated convergence theorem to see that ∗ ∗ F (h) = lim F (ψεh). ε→0 Hence the integral we need to compute is given by Z Z ZZ lim eix·ξψ(εξ) e−iy·ξf(y) dy dξ = lim ei(x−y)·ξψ(εξ)f(y) dξ dy ε→0 ε→0 Z Z m/2 y−x dy m/2 = (2π) lim f(y)ψb( ) m = (2π) lim f(x + εη)ψb(η) dη ε→0 ε ε ε→0 Z = (2π)m/2f(x) ψb(η) dη. −|x|2/2 R m/2 If we take ψ(x) = e , then ψb(η) = ψ(η) and ψb(η) dη = (2π) so the result follows. One can prove FF ∗ = Id similarly. We can extend the Fourier transform and its inverse to tempered distributions in the usual way ∗ 0 m 0 m F, F : S (R ) −→ S (R ) ∗ ∗ m hφ, F(T )i = hF (φ),T i, hφ, F (T )i = hF(φ),T i for all φ ∈ S(R ) 90 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM and these are again mutually inverse and satisfy (4.2), (4.3). In particular, we can compute that Z ∗ ∗ 1 −m/2 hφ, F(δ0)i = hF (φ), δ0i = F (φ)(0) = m/2 φ(x) dx = hφ, (2π) i (2π) Rm −m/2 so that F(δ0) = (2π) . Then, using (4.2) and (4.3), we can deduce that α |α| −m/2 α ∗ m/2 ∗ α |α| m/2 α F(D δ0) = (−1) (2π) x , F (1) = (2π) δ0, F (x ) = (−1) (2π) D δ0. Another consequence of Proposition 11 is that, for any φ ∈ S(Rm), 2 ∗ 2 kφbkL2 = hF(φ), F(φ)i = hF F(φ), φi = hφ, φi = kφkL2 thus F has a unique extension to an invertible map 2 m 2 m F : L (R ) −→ L (R ) and this map is an isometry – a fact known as Plancherel’s theorem. The Fourier transform is a very powerful tool for understanding differential k operators. Note that if P ∈ Diff (Rm), X α P = aα(x)D , |α|≤k and the coefficients aα(x) are bounded with bounded derivatives, then for any φ ∈ S(Rm) (or even S0(Rm)) we have P φ = F ∗(p(x, ξ)Fφ) X α with p(x, ξ) = aα(x)ξ . |α|≤k The polynomial p(x, ξ) is called the total symbol of P and will be denoted σtot(P ). One consequence which can seem odd at first is that a differential operator acts by integration! Indeed, we have 1 ZZ (4.4) (P φ)(x) = F ∗(σ (P )Fφ)(x) = ei(x−y)·ξp(x, ξ)φ(y) dy dξ tot (2π)m although note that this integral is not absolutely convergent. Notice the commonality between (4.4) and (4.1). In each case we have a map of the form Z (4.5) A(f)(x) = KA(x, y)f(y) dy Rm where f and KA, the integral kernel of A, may have low regularity. In fact, this describes a very general class of maps. 4.3. TEMPERED DISTRIBUTIONS 91 Theorem 12 (Schwartz kernel theorem). Let M and M 0 be manifolds, E −→ M and E0 −→ M 0 vector bundles, and let HOM(E,E0) −→ M × M 0 0 0 0 be the bundle whose fiber at (ζ, ζ ) ∈ M × M is Hom(Eζ ,Eζ0 ). If ∞ −∞ 0 0 A : Cc (M; E) −→ C (M ; E ) is a bounded linear mapping, there exists −∞ 0 0 KA ∈ C (M × M ; Hom(E,E )) known as the Schwartz kernel of A or the integral kernel of A such that hψ, A(φ)i = hψ φ, KAi where ∞ 0 0 0 ψ φ ∈ Cc (M × M ), (ψ φ)(ζ, ζ ) = ψ(ζ)φ(ζ ). We will not prove this theorem6 and we will not actually use it directly. Rather we view as an inspiration to study operators by describing their Schwartz kernels. In fact, it is common to purposely confuse an operator and its Schwartz kernel. Note 1 0 that if the integral kernel is given by a function, say KA ∈ Lloc(M × M ), then A is given by (4.5). A simple example is the kernel of the identity map −∞ KId ∈ C (M × M), KId = δdiagM and we abuse notation by writing Z 0 0 f(ζ) = δdiagM f(ζ ) dvol(ζ ). M Note that on Rm we can also write its integral kernel in terms of the Fourier transform Z ∗ 1 i(x−y)·ξ (4.6) Id = F F =⇒ KId = m e dξ. (2π) Rm Similarly we can write the kernel of a differential operator P ∈ Diff∗(M) as −∞ ∗ KP ∈ C (M × M), KP = P δdiagM or, on Rm, 1 Z (4.7) K = ei(x−y)·ξp(x, ξ) dξ. P (2π)m 6For a proof, see ... 92 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM 4.4 Pseudodifferential operators on Rm 4.4.1 Symbols and oscillatory integrals In both (4.6) and (4.7) we have integral kernels of the form 1 Z ei(x−y)·ξp(x, ξ) dξ (2π)m where p(x, ξ) is a polynomial in ξ of degree k and the integral makes sense as a distribution. The way we define pseudodifferential operators is to take a similar integral, but replace p(x, ξ) with a more general function. Let U ⊆ Rm be a (regular7) open set and r ∈ R a real number. A function a ∈ C∞(U × Rm) is a symbol of order r on U, a ∈ Sr(U) if for all multi-indices α, β there is a constant Cα,β such that α β p 2 r−|β| sup |Dx Dξ a(x, ξ)| ≤ Cα,β( 1 + |ξ| ) . x∈U Notice that the symbol condition is really only about the growth of a and its derivatives as |ξ| → ∞. So we always have r r0 m 0 S (U) ⊆ S (U; R ) if r < r and, if a is smooth in (x, ξ) and compactly supported (or Schwartz) in ξ, then \ r −∞ m a ∈ S (U) = S (U; R ). r∈R For example, if r ∈ N then a polynomial of degree r is an example of a symbol of order r. Also, for arbitrary r ∈ R, we can take a function a(x, θ) defined for |θ| = 1, extend it to (x, ξ) with |ξ| > 1 by r ξ a(r, ξ) = |ξ| a(x, |ξ| ) for |ξ| > 1 and choose any smooth extension of a for |ξ| < 1 and we obtain a symbol of order r. It is easy to see that pointwise multiplication defines a map r m r m r +r m S 1 (U; R ) × S 2 (U; R ) −→ S 1 2 (U; R ) 7Recall that an open set is regular if it is equal to the interior of its closure. 4.4. PSEUDODIFFERENTIAL OPERATORS ON RM 93 and that differentiation defines a map α β r r−|β| m Dx Dξ : S (U) −→ S (U; R ). Given a symbol a ∈ Sr(U), we will make sense of the oscillatory integral Z (4.8) ei(x−y)·ξa(x, ξ) dξ as a distribution. If r < −m then the integral converges to a continuous function ∗ −∞ (equal to Fξ (a)(x, x − y)). Similarly if a ∈ S (U) then the distribution is actually ∞ a smooth function. In this case, the action of this distribution on φ ∈ Cc (U) is 1 Z φ 7→ ei(x−y)·ξa(x, ξ)φ(x) dξ dx. (2π)m Notice that i(x−y)·ξ 2 i(x−y)·ξ (1 + ξ · Dx)e = (1 + |ξ| )e so for a ∈ Sr with r < −m and for any ` ∈ N we have Z Z i(x−y)·ξ 2 −` ` i(x−y)·ξ e a(x, ξ)φ(x) dξ dx = (1 + |ξ| ) (1 + ξ · Dx) e a(x, ξ)φ(x) dξ dx Z i(x−y)·ξ ` 2 −` = e (1 − ξ · Dx) ((1 + |ξ| ) a(x, ξ)φ(x)) dξ dx Moreover we have ` 2 −` X α (1 − ξ · Dx) ((1 + |ξ| ) a(x, ξ)φ(x)) = aα(x, ξ)Dx φ(x) |α|≤` r−` r with each aα ∈ S (U). So for arbitrary r ∈ R and a ∈ S (U), we define the ∞ distribution (4.8) on Cc (U) by 1 Z φ 7→ ei(x−y)·ξ(1 − ξ · D )`((1 + |ξ|2)−`a(x, ξ)φ(x)) dξ dx (2π)m x where ` is any non-negative integer larger than r + m. We will continue to denote this distribution by (4.8). Alternately, we can interpret (4.8) as 1 Z φ 7→ lim ei(x−y)·ξψ(εξ)a(x, ξ)φ(x) dξ dx ε→0 (2π)m 94 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM ∞ m where ψ ∈ Cc (R ) is equal to one in a neighborhood of the origin. Then we can carry out the integration by parts from the previous paragraph before taking the limit. For any open set U ⊆ Rm, and s ∈ R we define pseudodifferential operators on U of order r in terms of their integral kernels 1 Z Ψr(U) = K ∈ C−∞(U × U): K = ei(x−y)·ξa(x, ξ) dξ, for some a ∈ Sr(U) . (2π)m The symbol a is known as the total symbol of the pseudodifferential operator. Given a ∈ Sr(U), there are several notations for the corresponding pseudodifferential operator, e.g., Op(a) and a(x, D). Let us determine the singular support of K ∈ Ψr(U). The useful observation that α i(x−y)·ξ α i(x−y)·ξ (4.9) (x − y) e = Dξ e shows that 1 Z (x − y)αK = (x − y)αei(x−y)·ξa(x, ξ) dξ (2π)m 1 Z (−1)|α| Z = (Dαei(x−y)·ξ)a(x, ξ) dξ = ei(x−y)·ξ(Dαa(x, ξ)) dξ (2π)m ξ (2π)m ξ is an absolutely convergent integral whenever a ∈ Sr(U) for r < |α|−m and moreover taking further derivatives shows that (x − y)αK ∈ Cj(U × U) if r − |α| < −m − j so we can conclude that sing supp K ⊆ diagU×U . As with symbols, pseudodifferential operators are filtered by degree r m r0 m 0 Ψ (R ) ⊆ Ψ (R ) if r < r and a special rˆoleis played by the operators of order −∞, −∞ m \ r m Ψ (R ) = Ψ (R ). r∈R Each a(x, D) ∈ Ψs(U) defines a continuous linear operator ∞ −∞ a(x, D): Cc (U) −→ C (U) 1 ZZ (a(x, D)φ)(x) = ei(x−y)·ξa(x, ξ)φ(y) dy dξ = F ∗(a(x, ξ)F(φ)). (2π)m 4.4. PSEUDODIFFERENTIAL OPERATORS ON RM 95 Proposition 13. For a ∈ Sr(U), the operator a(x, D) is a continuous map ∞ ∞ (4.10) a(x, D): Cc (U) −→ C (U). For a ∈ Sr(Rm), the operator a(x, D) also extends to a linear operator m m a(x, D): S(R ) −→ S(R ) r ∞ Proof. If a ∈ S (U) and φ ∈ Cc (U) then, for every x, a(x, ξ)φb(ξ) is Schwartz function of ξ, hence the integral Z a(x, D)(φ)(x) = eix·ξa(x, ξ)φb(ξ) dξ is absolutely convergent, and one can differentiate under the integral sign and the derivatives will also be given by absolutely convergent integrals. Similarly, when a ∈ Sr(Rm), Z Z α α α ix·ξ α ix·ξ α x a(x, D)(φ) = (−1) (Dξ e )a(x, ξ)φb(ξ) dx = (−1) e (Dξ a(x, ξ)φb(ξ)) dx is absolutely convergent for every α. The total symbol of a differential operator is a polynomial so it has an expan- sion k k X X β X p(x, ξ) = pβ(x)ξ = pj(x, ξ) j=0 |β|=j j=0 where each pj(x, ξ) is homogeneous j pj(x, λξ) = λ pj(x, ξ) and a symbol of order j. It turns out we can ‘asymptotically sum’ any sequence of symbols of decreasing order. m Proposition 14 (Asymptotic completeness). Let U be an open subset of R , {rj} rj an unbounded decreasing sequence of real numbers and aj ∈ S (U). There exists a ∈ Sr0 (U) such that, for all N ∈ N, N−1 X rN (4.11) a − aj ∈ S (U) j=0 which we denote X a ∼ aj. 96 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM This is analogous to Borel’s lemma to the effect that any sequence of real numbers can occur as the coefficients of the Taylor expansion of a smooth function at the origin. Proof. Let us write U as the union of an increasing sequence of compact sets Kj ⊆ U, [ Ki = U, and Kj ⊆ Kj+1 for all i ∈ N. j∈N ∞ m Pick χ ∈ C (R , [0, 1]) vanishing on B1(0) and equal to one outside B2(0), pick a + sequence converging to zero, (εj) ⊆ R , and set ∞ X a(x, ξ) = χ(εjξ)aj(x, ξ). j=0 Note that on any ball {|ξ| < R} only finitely many terms of the sequence are non- zero, so the sum is a well-defined smooth function. We will see that by choosing εj decaying sufficiently quickly, we can arrange (5.2). To see that a(x, ξ) ∈ Sr0 (U) we need to show that, for each α and β there is a constant Cα,β such that α β p 2 r0−|β| |Dx Dξ χ(εjξ)a(x, ξ)| ≤ Cα,β( 1 + |ξ| ) . Let us estimate the individual terms in the sum α β X α γ |β|−|µ| β−µ |Dx Dξ χ(εjξ)aj(x, ξ)| ≤ |Dx Dξ aj(x, ξ)|εj |(D χ)(εjξ)|. γ+µ=β −1 The term with γ = β is supported in |ξ| ≤ εj and the other terms are supported −1 −1 in εj ≤ |ξ| ≤ 2εj . We can bound the former term by q p 2 rj −|β| p 2 r0−|β| −2 rj −r0 Cα,β(aj)( 1 + |ξ| ) ≤ Cα,β(aj)( 1 + |ξ| ) ( 1 + εj ) r0−rj p 2 r0−|β| ≤ Cα,β(aj)εj ( 1 + |ξ| ) and by similar reasoning we can bound the latter terms by β−µ r0−rj p 2 r0−|β| Cα,γ(aj)kD χkL∞ εj ( 1 + |ξ| ) . So altogether we have α β r0−rj p 2 r0−|β| |Dx Dξ χ(εjξ)aj(x, ξ)| ≤ Cα,β(χaj)εj ( 1 + |ξ| ) . 4.4. PSEUDODIFFERENTIAL OPERATORS ON RM 97 for some constant Cα,β(χaj) independent of εj. Given L ∈ N we can choose a se- quence εj(L) so that r0−rj −2 |α| + |β| ≤ L =⇒ Cα,β(χaj)εj (L) ≤ j and hence α β −2 p 2 r0−|β| |α| + |β| ≤ L =⇒ |Dx Dξ χ(εj(L)ξ)aj(x, ξ)| ≤ j ( 1 + |ξ| ) α β and |Dx Dξ a(x, ξ)| satisfies the bounds we need for |α|+|β| ≤ L. By diagonalization (taking εj = εj(j)) we can arrange the convergence of p 2 −(r0−|β|) α β ( 1 + |ξ| ) |Dx Dξ a(x, ξ)| for all α, β. The same argument applies to show that X rN χ(εjξ)aj(x, ξ) ∈ S (U) j≥N P and hence that a ∼ aj. It is easy to see that if X a ∼ aj −∞ P then {aj} determine a up to an element of S (U). Indeed, if b ∼ aj then N−1 N−1 X X rN a − b = (a − aj) − (b − aj) ∈ S (U) j=0 j=0 for all N, hence a − b ∈ S−∞(U). There is a further subset of symbols that imitates polynomials a little closer. r r We say that a symbol a ∈ S (U) is a classical symbol, a ∈ Scl(U), if for each r−j j ∈ N there is a symbol ar−j ∈ S (U) homogeneous in ξ for |ξ| ≥ 1, i.e., r−j ar−j(x, λξ) = λ ar−j(x, ξ) whenever |ξ|, |λξ| ≥ 1, and we have ∞ X a ∼ ar−j. j=1 We refer to ar as the principal symbol of a(x, D). 98 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM 4.4.2 Amplitudes and mapping properties In order to prove that pseudodifferential operators are closed under composition and adjoints, it is convenient to allow the symbol to depend on both x and y. Remarkably, this does not lead to a larger class of operators. Let U ⊆ Rm be an open set, we say that a function b ∈ C∞(U × U × Rm) is an amplitude of order r ∈ R, r m b ∈ S (U × U; R ) if for any multi-indices α, β, γ, there is a constant Cα,β,γ such that α β γ r−|γ| (4.12) sup |Dx Dy Dξ b(x, y, ξ)| ≤ Cα,β,γ(1 + |ξ|) . x,y∈U Thus an amplitude is a symbol where there are twice as many ‘space’ variables (x, y) as there are ‘cotangent’ variables ξ. An amplitude defines a distribution 1 Z φ 7→ ei(x−y)·ξb(x, y, ξ)φ(y) dξ dy. (2π)m For fixed x this is a well-defined oscillatory integral in (y, ξ) by the discussion above. Since the estimates are uniform in x, this defines a continuous function in x. We will denote the set of distributions with amplitudes of order r by Ψe r(U). We clearly have Ψr(U) ⊆ Ψe r(U), and we will soon show that Ψe r(U) = Ψr(U). We start by showing this is true for operators in \ Ψe −∞(U) = Ψe r(U). r∈R Proposition 15. The following are equivalent: i) K ∈ Ψ−∞(U), 1 R i(x−y)·ξ −∞ ii) K = (2π)m e a(x, ξ) dξ for some a ∈ S (U), iii) K ∈ C∞(U × U) and for every N ∈ N and multi indices α, β, N α β (4.13) sup |(1 + |x − y|) Dx Dy K(x, y)| < ∞. x,y∈U iv) K ∈ Ψe −∞(U). 4.4. PSEUDODIFFERENTIAL OPERATORS ON RM 99 Proof. Clearly, (ii) implies (i) and (i) implies (iv). Let us check that (iv) implies (iii). By definition, K ∈ Ψe −∞(U) means that, r m for each r ∈ R, there is an amplitude br ∈ S (U × U; R ) such that 1 Z K(x, y) = ei(x−y)·ξb (x, y, ξ) dξ. (2π)m r Let us assume that r −m so that the integral converges absolutely and we can use (4.9) and integrate by parts to find 1 Z (x − y)αDβDγK(x, y) = (−1)|α| ei(x−y)·ξ(D + iξ)β(D − iξ)γb (x, y, ξ) dξ x y (2π)m x y r which converges absolutely and uniformly in x, y provided |β| + |γ| + r < |α| − m. α β γ Thus (x − y) Dx Dy K(x, y) is uniformly bounded. Finally we show that (iii) implies (ii). If K satisfies (iii) then the function L(x, z) = K(x, x − z) satisfies N α β sup |(1 + |z|) Dx Dz L(x, z)| < ∞ for all N, α, β. Thus L(x, z) is a Schwartz function in z with bounded derivatives in x. We can write 1 Z K(x, y) = L(x, x − y) = ei(x−y)·ξ(F L)(x, ξ) dξ (2π)m/2 z −∞ and FzL ∈ S (U) so we have (ii). At the moment, the mapping property (4.10) is not good enough to allow us to compose operators. We will need an extra condition on the support of the Schwartz ∞ kernels of pseudodifferential operators to guarantee that they preserve Cc (U). Let us say that a subset R ⊆ Rm × Rm is a proper support if the left and right projections m m m R × R −→ R , restricted to R, are proper functions8. An example of a proper support is a compact set, a better example is a set R satisfying R ⊆ {(x, y, ξ): d(x, y) ≤ ε} for any fixed ε > 0. We will say that a distribution K ∈ C−∞(Rm × Rm) is properly supported if supp K is a proper support. We will say that an amplitude b(x, y, ξ) ∈ C∞(Rm × Rm; Rm) is properly supported if suppx,y b = closure of {(x, y): b(x, y, ξ) 6= 0 for some ξ} is a proper support. 8Recall that any continuous function maps compact sets to compact sets, while a function is called proper if the inverse image of a compact set is always a compact set. 100 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM Theorem 16. Let b(x, y, ξ) ∈ C∞(U × U; Rm) be a properly supported amplitude of order r. The operator ∞ −∞ B : Cc (U) −→ C (U) Z Z (4.14) 1 i(x−y)·ξ Bφ(x) = m e b(x, y, ξ)φ(y) dy dξ (2π) Rm U in fact maps ∞ ∞ B : Cc (U) −→ Cc (U) and induces maps B : C∞(U) −→ C∞(U),B : C−∞(U) −→ C−∞(U), −∞ −∞ and B : Cc (U) −→ Cc (U). The formal adjoint of B is of the same form ∗ ∞ −∞ B : Cc (U) −→ C (U) Z Z ∗ 1 i(x−y)·ξ B φ(x) = m e b(y, x, ξ)φ(y) dy dξ. (2π) Rm U Moreover, B ∈ Ψr(U) and its total symbol has an asymptotic expansion |α| X i α α (4.15) a(x, ξ) ∼ (D D b(x, y, ξ)) α! ξ y x=y α where for a multi-index α = (α1, . . . , αm), α! denotes α1! ··· αm!. Proof. First note that the description of the adjoint follows immediately from Z Z Z 1 i(x−y)·ξ hBφ, fi = m e b(x, y, ξ)φ(y)f(x) dy dξ dx (2π) Rm U U Z Z Z 1 i(x−y)·ξ = m e b(x, y, ξ)f(x)φ(y) dx dξ dy (2π) U Rm U ∞ for all φ, f ∈ Cc (U). ∞ ∞ We see that B maps Cc (U) into C (U) by the same argument as in Proposition ∞ 13. However now if φ ∈ Cc (U) and supp φ = K then we know that −1 0 πy (K) ∩ suppx,y b = K is a compact set and it is immediate from (4.14) that supp B(φ) ⊆ K0. Thus B : ∞ ∞ Cc (U) −→ Cc (U) and by the same argument ∗ ∞ ∞ B : Cc (U) −→ Cc (U). 4.4. PSEUDODIFFERENTIAL OPERATORS ON RM 101 By duality this means that B and B∗ also induce maps C−∞(U) −→ C−∞(U). Let φ ∈ C∞(U), we know that Bφ is defined, but a priori this is a distribution. Let us check that Bφ is smooth, by checking that it is smooth on any compact set K ⊆ U. We know that −1 0 πy (K) ∩ suppx,y b = K ∞ 0 is also compact, so we can choose χ ∈ Cc (U) such that χ(p) = 1 for all p ∈ K . Then, on K, we have Bφ = Bχφ so Bφ is smooth in the interior of K. This shows that B : C∞(U m) −→ C∞(U m) and since B∗ is of the same form, B∗ : C∞(U) −→ C∞(U). Then by duality both B ∗ −∞ −∞ and B induce maps Cc (U) −→ Cc (U). Let us define |α| X i α α aj(x, ξ) = (D D b(x, y, ξ)) . α! ξ y x=y |α|=j r−j m Note that aj ∈ S (R ) so it makes sense to define a(x, ξ) to be a symbol obtained P by asymptotically summing aj. We will be done once we show that B −a(x, D) ∈ Ψ−∞(U). Recall from Taylor’s theorem that, for each ` ∈ N, X 1 X b(x, y, ξ) = ∂αb(x, x, ξ)(y − x)α + c (x, y, ξ)(y − x)α α! y α |α|≤`−1 |α|=` Z 1 `−1 α with cα(x, y, ξ) = (1 − t) ∂y b(x, (1 − t)x + ty, ξ) dt 0 α and note the important fact that ∂y b(x, y, ξ), and cα(x, y, ξ) all satisfy (4.12). Using (4.9) and integrating by parts we find that Z Z 1 i(x−y)·ξ 1 i(x−y)·ξ X 1 α α m e b(x, y, ξ) dξ = m e Dξ ∂y b(x, x, ξ) dξ (2π) m (2π) m α! R R |α|≤`−1 Z 1 i(x−y)·ξ X 1 α α + m e Dξ ∂y cα(x, y, ξ) dξ (2π) m α! R |α|=` `−1 Z Z X 1 i(x−y)·ξ 1 i(x−y)·ξ = m e aj(x, ξ) dξ + m e R`(x, y, ξ) dξ (2π) m (2π) m j=1 R R where R`(x, y, ξ) satisfies (4.12) with r replaced by r − `. 102 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM r−` By definition of a(x, ξ), we know that there is an element Q` ∈ S (U) such that `−1 X a(x, D) = aj(x, D) + Q`(x, D) j=1 and hence we know that Z 1 i(x−y)·ξ (4.16) B − a(x, D) = K(x, y) = m e (R`(x, y, ξ) − Q`(x, ξ)) dξ (2π) Rm and R`(x, y, ξ) − Q`(x, ξ) satisfies (4.12) with r replaced by r − `. It is clear from the proof of Proposition 15 that the inequalities (4.13) for a given N, α, β hold for K(x, y) if ` is large enough. Since we are able to choose ` as large as we like, this means that the inequalities (4.13) hold for all N, α, β, but this means that B − a(x, D) ∈ Ψ−∞(U) as required. This theorem lets us quickly deduce that we get the same class of pseudodif- ferential operators using amplitudes of symbols. And it lets us show that pseudod- ifferential operators with properly supported amplitudes are closed under adjoints and under composition. Corollary 17. For all r ∈ R, the sets of distributions Ψr(U) and Ψe r(U) coincide. Proof. Let K ∈ Ψe r(U) have amplitude b(x, y, ξ). Choose χ ∈ C∞(U × U) such that χ(x, y) = 0 if d(x, y) > 1 r then we can write K as the sum of K1 ∈ Ψe (U) with amplitude χ(x, y)b(x, y, ξ) and −∞ K2 ∈ Ψe (U) with amplitude (1 − χ(x, y))b(x, y, ξ). The theorem guarantees that r −∞ K1 ∈ Ψ (U) because it is properly supported, and we already know that Ψe (U) = Ψ−∞(U). We will say that a pseudodifferential operator is properly supported if its Schwartz kernel is properly supported. It is easy to see that this occurs if and only if we can represent it using a properly supported amplitude. Corollary 18. If A ∈ Ψr(U) is properly supported then A∗ ∈ Ψr(U) is properly supported with symbol X i|α| (4.17) σ (A∗)(x, ξ) ∼ DαDασ (A)(x, ξ). tot α! ξ x tot |α|≥0 ∗ If σtot(A) is a classical symbol then so is σtot(A ) and ∗ (4.18) σr(A ) = σr(A). 4.4. PSEUDODIFFERENTIAL OPERATORS ON RM 103 Proof. For any properly supported function χ ∈ C∞(U × U) identically equal to one in a neighborhood of the diagonal containing supp KA, the amplitude χ(x, y)σtot(A)(x, ξ) is a properly supported amplitude for A. It follows from the theorem that A∗ ∈ r r Ψe (U) = Ψ (U), with properly supported amplitude χ(y, x)σtot(A)(y, ξ), and symbol |α| ∗ X i α α σtot(A ) ∼ (D D χ(y, x)σtot(A)(y, ξ)) α! ξ y x=y |α|≥0 which, since χ(x, y) is equal to one in a neighborhood of the diagonal, implies (4.17). This formula in turn implies (4.18). Corollary 19. If A ∈ Ψr(U),B ∈ Ψs(U) are properly supported, the composition ∞ ∞ A ◦ B : Cc (U) −→ Cc (U) is a properly supported pseudodifferential operator A ◦ B ∈ Ψr+s(U) with symbol X i|α| (4.19) σ (A ◦ B) ∼ Dασ (A)(x, ξ)Dασ (B)(x, ξ). tot α! ξ tot x tot |α|≥0 If σtot(A) and σtot(B) are classical symbols then so is σtot(A ◦ B) and (4.20) σr+s(A ◦ B) = σr(A)σs(B). Proof. Let a(x, ξ) = σtot(A)(x, ξ) and b(x, ξ) = σtot(B)(x, ξ). We know that 1 Z B∗u(x) = eix·ξb(x, ξ)u(ξ) dξ (2π)m b ∞ integrating against a test function φ ∈ Cc (U) gives 1 Z hBφ, ui = hφ, Bui = e−ix·ξφ(x)b(x, ξ)u(ξ) dξ dx (2π)m b and so Z Bφc (ξ) = e−iy·ξb(y, ξ)φ(y) dy. This gives us a representation of B with an amplitude depending only on y. We can then write Z Z (A ◦ B)(φ) = A(Bφ) = eix·ξa(x, ξ)Bφc (ξ) dξ = ei(x−y)·ξa(x, ξ)b(y, ξ)φ(y) dy dξ. 104 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM Thus A ◦ B ∈ Ψe r+s(U) with amplitude a(x, ξ)b(y, ξ). Next let us check that it is properly supported. If K ⊆ Rm is a compact set, then 0 −1 m K = πL(πR (K) ∩ supp KB) ⊆ R is compact and hence −1 −1 0 πR (K) ∩ supp KA◦B = πR (K ) ∩ supp A is compact. Similarly, −1 −1 −1 πL (K) ∩ supp KA◦B = πL (πR(πL (K) ∩ supp KA)) ∩ supp B is compact, and so A ◦ B is properly supported. Thus, as before, we can multiply its amplitude by a cut-off function to make it a properly supported amplitude without affecting formula (4.15), which yields (4.19). If A and B are classical, then formula (4.19) shows that A ◦ B is classical, and that the principal symbols satisfy (4.20). To define pseudodifferential operators on manifolds, we need to know that an operator that looks like a pseudodifferential operator in one set of coordinates will look like a pseudodifferential operator in any coordinates. Theorem 20. Let U and W be open subsets of Rm and let F : U −→ W be a diffeomorphism. Let P ∈ Ψr(U) and let ∞ ∞ F∗P : Cc (W) −→ C (W) −1 (F∗P )(φ) = P (φ ◦ F ) ◦ F r then F∗P ∈ Ψ (W). For each multi-index α there is a function φα(x, ξ) which is a polynomial in ξ of degree at most |α|/2, with φ0 = 1, such that the symbol of (F∗P ) satisfies X φα(x, ξ) (4.21) σ (F P )(F (x), ξ) ∼ (Dασ (P ))(x, (DF )t(x)ξ). tot ∗ α! ξ tot |α|≥0 If σtot(P ) is classical, so is σtot(F∗P ) and its principal symbol satisfies t (4.22) σr(F∗P )(F (x), ξ) = σr(P )(x, DF (x)ξ). Notice that, as with a symbol of a differential operator, this shows that the principal symbol of a pseudodifferential operator on U is well-defined as a section of the cotangent bundle T ∗U. 4.4. PSEUDODIFFERENTIAL OPERATORS ON RM 105 Proof. Let us denote F −1 by H. Note that Z 1 H(x) − H(y) = ∂tH(tx + (1 − t)y) dt = Φ(x, y) · (x − y) 0 where Φ is a smooth matrix-valued function. Since Φ(x, x) = limy→x Φ(x, y) = DH(x) and H is a diffeomorphism, the matrix Φ(x, y) is invertible in a neighborhood ∞ Z of the diagonal. Let χ ∈ Cc (Z) be identically equal to one in a neighborhood of the diagonal. Let p = σtot(P ). We have (F∗P )(φ)(x) = P (u ◦ F )(H(x)) Z Z 1 i(H(x)−y)·ξ = m e p(H(x), ξ)u(F (y)) dy dξ (2π) Rm U Z Z 1 i(H(x)−H(z))·ξ = m e p(H(x), ξ)u(z) det DH(z) dz dξ (2π) Rm W Z Z 1 i(x−z)·Φ(x,z)tξ = m e p(H(x), ξ)u(z) det DH(z) dz dξ. (2π) Rm W Let us briefly denote the last integrand by γ(x, z, ξ), and decompose it as χγ + (1 − χ)γ. We know that (1−χ)γ is a pseudodifferential operator whose amplitude vanishes in a neighborhood of the diagonal, and hence is an element of Ψ−∞(W). On the other hand, when integrating χγ we can make the change of variables η = Φ(x, z)tξ and find Z Z 1 i(x−z)·η t −1 det DH(z) m e χ(x, z)p(H(x), (Φ(x, z) ) η)u(z) dz dη (2π) Rm W det Φ(x, z) which is an element of Ψe r(W) with amplitude χ(x, z) p(x, z, η) = p(H(x), (Φ(x, z)t)−1η) e det DF (H(z)) det Φ(x, z) so we can apply (4.15) and establish (4.21), which in turn proves (4.22). Finally, let us consider pseudodifferential operators acting between sections of bundles. In the present context this is very simple since any vector bundle over Rm is trivial. Thus if E −→ Rm and F −→ Rm are vector bundles, a pseudodifferential operator of order r acting between sections of E and sections of F, r m A ∈ Ψ (R ; E,F ), is simply a (rank F ) × (rank E) matrix with entries in Ψr(Rm). The extension of the discussion above of composition, adjoint, and mapping properties is straightforward. 106 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM 4.5 Exercises Exercise 1. Show that a distribution T on Rm supported at the origin is a differential operator applied to δ0. Hint: First use continuity to find a number K such that hφ, T i = 0 for all φ that vanish to order K at the origin. Next, use this to view T as a linear functional on the finite dimensional vector space consisting of the coefficients of the Taylor expansion of order K of smooth functions at the origin. Exercise 2. Show that, if P is a differential operator and T is a distribution, supp(P (T )) ⊆ supp(T ) and sing supp(P (T )) ⊆ sing supp T. Exercise 3. ∞ Let f ∈ C (R) and let δ0 be the Dirac delta distribution at the origin. Express the 3 2 distribution f∂xδ0 as a C-linear combination of the distributions δ0, ∂xδ0, ∂xδ0 and 3 ∂xδ0. Exercise 4. −∞ a) Suppose that T ∈ C (R) satisfies ∂xT = 0 prove that T is equal to a constant C ∈ C. b) Let S ∈ C−∞(R), show that there is a distribution R ∈ C−∞(R) such that ∂xR = S. ∞ R ∞ Hint: Choose ψ ∈ Cc (R) such that ψ dx = 1. Given φ ∈ Cc (M) show that ∞ R there exists φe ∈ Cc (R) such that φ = φ dx ψ + ∂xφ.e For (a), show that R hφ, T i = φ dx hψ, T i, so C = hψ, T i. For (b), show that hφ, Ri = −hφ,e Si defines a distribution that satisfies ∂xR = S. Exercise 5. Suppose that T ∈ S0(Rm) satisfies ∆T = 0 on Rm. Show that T is a polynomial. Hint: First show that F(T ) is a distribution supported at the origin. Exercise 6. Let U ⊆ Rm be an open set and K ∈ C−∞(U ×U). Recall that K defines an operator ∞ −∞ Cc (U) −→ C (U). Suppose that K satisfies ∞ ∞ −∞ −∞ K : Cc (U) −→ C (U) and K : Cc (U) −→ C (U) and that K is smooth except possibly on diagU , i.e., sing supp K ⊆ diagU . Show that sing supp Ku ⊆ sing supp u −∞ for all u ∈ Cc (U). 4.6. BIBLIOGRAPHY 107 Exercise 7. −∞ ∞ Let KA ∈ C (U × U) and suppose that the associated operator A : Cc (U) −→ −∞ ∞ ∞ C (U) maps Cc (U) −→ C (U). Show that: ∞ −∞ ∞ i)If KA ∈ Cc (U × U), then A extends to a map C (U) −→ Cc (U). −∞ ∞ ∗ −∞ ∞ ii)Conversely, if A : C (U) −→ Cc (U) and A : C (U) −→ Cc (U), show that ∞ KA ∈ Cc (U × U). Exercise 8. Give an example of a pseudodifferential operator A ∈ Ψ∗(Rm) (e.g., of order −∞) ∞ m and a function φ ∈ Cc (R ) such that supp A(φ) is not contained in supp φ. Exercise 9. For properly supported A ∈ Ψr(Rm),B ∈ Ψs(Rm) with classical symbols, show that the commutator [A, B] ∈ Ψr+s−1(Rm) and find its principal symbol. Exercise 10. r m Assume that A ∈ Ψ (R ) is properly supported with classical symbol. If σr(A) is m ∗ ∗ real-valued (i.e., σr(A)(x, ξ) ∈ R for all x, ξ ∈ R ). Show that [A, A ] = AA − A∗A ∈ Ψ2r−2(Rm). Exercise 11. a) Given a function ∞ m m−1 a(x, ξ) ∈ C (R × S ) uniformly bounded together with its derivatives, and a number r ∈ R show that there exists a pseudodifferential operator A ∈ Ψr(Rm) with classical symbol such that σr(A)(x, ξ) |ξ|=1 = a(x, ξ). r m r−1 m b) If A ∈ Ψ (R ) has classical symbol and σr(A) = 0 show that A ∈ Ψ (R ). 4.6 Bibliography Of the books we have been using so far, both Spin geometry by Lawson and Michelson and the second volume of Partial differential equations by Taylor, treat pseudodifferential operators. For distributions, one can look at Bony, Cours d’analyse, volume one of Hormander¨ , The analysis of linear partial differential operators, and Introduction to the theory of distributions by Friedlander and Joshi. Richard Melrose has several excellent sets of lecture notes involving pseu- dodifferential operators at http://math.mit.edu/~rbm/Lecture_notes.html. Other good sources are the books Pseudodifferential operators by Michael Taylor and 108 LECTURE 4. PSEUDODIFFERENTIAL OPERATORS ON RM Pseudodifferential operators and spectral theory by Mikhail Shubin, and the lec- ture notes Introduction to pseudo-differential operators by Mark Joshi (available online on the arXiv). For an in-depth treatment, one can look at the four volumes of The analysis of linear partial differential operators by Hormander¨ . Lecture 5 Pseudodifferential operators on manifolds 5.1 Symbol calculus and parametrices Let M be an orientable compact Riemannian manifold, let E and F be complex vector bundles over M endowed with Hermitian metrics1 and let us denote by HOM(E,F ) −→ M × M the vector bundle whose fiber over a point (ζ, ζ0) ∈ M × M is the vector space Hom(Eζ ,Fζ0 ). A distribution K ∈ C−∞(M × M; HOM(E,F )) is a pseudodifferential operator on M of order r, acting between sections of E and sections of F, K ∈ Ψr(M; E,F ) if K is smooth away from the diagonal, i.e., sing supp K ⊆ diagM , ∞ and, for any coordinate chart φ : U −→ V of M and any smooth function χ ∈ Cc (V), the distribution χ(x)K(x, y)χ(y) coincides (via φ) with a pseudodifferential operator on U, r rank E rank F Kφ,χ ∈ Ψ (U; R , R ) 1We assume orientability, compactness, and a choice of metrics for simplicity, but we could more invariantly work with operators on half-densities. 109 110 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS acting between the trivial bundles φ∗E and φ∗F and with classical symbol of order r. Although this definition requires us to check that Kφ,χ is a pseudodifferential operator for every coordinate chart, Theorem 20 shows that it is enough to check this for a single atlas of M. This same theorem guarantees that the principal symbols of the operators Kφ,χ together define ∞ ∗ ∗ ∗ σr(K) ∈ C (T M; Hom(π E, π F )) with π : T ∗M −→ M the canonical projection. The principal symbol is homogeneous away form the zero section of T ∗M, so we do not lose any interesting information if we restrict it to the cosphere bundle ∗ ∗ S M = {v ∈ T M : g(v, v) = 1}. From now on, we will think of the symbol as a section ∞ ∗ ∗ ∗ σr(K) ∈ C (S M; Hom(π E, π F )). Let us summarize some facts about pseudodifferential operators on manifolds that follow from our study of pseudodifferential operators on Rm. Theorem 21. Let M be an orientable compact Riemannian manifold and let E,F, and G be complex vector bundles over M endowed with Hermitian metrics. For each r ∈ R we have a class Ψr(M; E,F ) of distributions satisfying r < r0 =⇒ Ψr(M; E,F ) ⊆ Ψr0 (M; E,F ) with a principal symbol map participating in a short exact sequence (5.1) r−1 r σr ∞ ∗ ∗ ∗ 0 −→ Ψ (M; E,F ) −→ Ψ (M; E,F ) −−→C (S M; Hom(π E, π F )) −→ 0. Pseudodifferential operators are asymptotically complete: rj For any unbounded decreasing sequence of real numbers {rj} and Aj ∈ Ψ (M; E,F ) there is A ∈ Ψr1 (M; E,F ) such that, for all N ∈ N, N−1 X rN (5.2) A − Aj ∈ Ψ (M; E,F ) j=1 which we denote X A ∼ Aj. 5.1. SYMBOL CALCULUS AND PARAMETRICES 111 Pseudodifferential operators act on functions and on distributions A ∈ Ψr(M; E,F ) =⇒ A : C∞(M; E) −→ C∞(M; F ) and A : C−∞(M; E) −→ C−∞(M; F ), they are closed under composition which is compatible with both the degree and the symbol A ∈ Ψr(M; E,F ),B ∈ Ψs(M; F,G) =⇒ r+s A ◦ B ∈ Ψ (M; E,G) and σr+s(A ◦ B) = σr(A) ◦ σs(B). The formal adjoint of a pseudodifferential operators is again a pseudodifferential operator of the same order and their principal symbols are mutually adjoint. Pseudodifferential operators contain differential operators Diffk(M; E,F ) ⊆ Ψk(M; E,F ) and the symbol of a differential operators coincides with its symbol as a pseudodif- ferential operator. Pseudodifferential operators also contain smoothing operators \ A ∈ Ψ−∞(M; E,F ) = Ψr(M; E,F ) ⇐⇒ A ∈ C∞(M × M; HOM(E,F )) r∈R ⇐⇒ A : C−∞(M; E) −→ C∞(M; F ) and A∗ : C−∞(M; F ) −→ C∞(M; E). Notice that, since M is compact, we do not need to distinguish between smooth functions and those with compact support (and similarly for distributions) and that all distributions are properly supported. We say that a pseudodifferential operator A ∈ Ψr(M; E,F ) is elliptic if its principal symbol is invertible, i.e., ∞ ∗ ∗ σr(A) ∈ C (M; Iso(π E, π F )). Thus an elliptic differential operator, such as a Laplace-type or Dirac-type operator, is also an elliptic pseudodifferential operator. Although invertibility of the symbol can not guarantee invertibility of the operator, e.g., the Laplacian on functions has all constant functions in its null space, it turns out that invertibility of the symbol is equivalent to invertibility of the operator up to smoothing operators. Theorem 22. A pseudodifferential operator A ∈ Ψr(M; E,F ) is elliptic if and only if there exists a pseudodifferential operator −r −∞ −∞ (5.3) B ∈ Ψ (M; F,E) s.t. AB − IdF ∈ Ψ (M; F ),BA − IdE ∈ Ψ (M; E) 112 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS If B satisfies (5.3) we say that B is a parametrix for A. It is easy to see that A has a unique parametrix up to smoothing operators (see exercise 2). Proof. Since the symbol of A is invertible and the sequence (5.1) is exact, we know that there is an operator −r −1 ∞ ∗ ∗ ∗ B1 ∈ Ψ (M; F,E), with σ−r(B) = σr(A) ∈ C (S M; Iso(π F, π E)). The composition A ◦ B1 is a pseudodifferential operator of order zero and σ0(A ◦ B1) = σr(A) ◦ σ−r(B) = Idπ∗E 0 is the same symbol as the identity operator IdE ∈ Ψ (M; E). Again appealing to the exactness of (5.1) we see that −1 R1 = IdE −AB1 ∈ Ψ (M; E). Let us assume inductively that we have found B1,...,Bk satisfying −r−(i−1) Bi ∈ Ψ (M; F,E) (5.4) −k and Rk = IdE −A(B1 + ... + Bk) ∈ Ψ (M; E) −r−k and try to find Bk+1 ∈ Ψ (M; F,E) such that −k−1 Rk+1 = IdE −A(B1 + ... + Bk+1) ∈ Ψ (M; E). Note that we can write this last equation as Rk+1 = Rk − ABk+1 −k and since both Rk and ABk+1 are elements of Ψ (M; E) exactness of (5.1) tells us that −k−1 Rk+1 ∈ Ψ (M; E) ⇐⇒ σk(Rk − ABk+1) = 0. −r−k So it suffices to let Bk+1 be any element of Ψ (M; F,E) with principal symbol −1 σ−r−k(Bk+1) = σr(A) ◦ σ−k(Rk) which we know exists by another application of exactness of (5.1). Thus inductively we can solve (5.4) for all k ∈ N. Next use asymptotic completeness to find B ∈ Ψ−r(M; F,E) such that X B ∼ Bi 5.1. SYMBOL CALCULUS AND PARAMETRICES 113 and notice that, for any N ∈ N, −N IdE −AB = IdE −A(B1 + ... + BN ) − A(B − B1 − ... − BN ) ∈ Ψ (M; E) −∞ hence IdE −AB ∈ Ψ (M; E) as required. The same construction can be used to construct B0 ∈ Ψ−r(M; F,E) such that 0 −∞ 0 −∞ IdF −B A ∈ Ψ (M; F ) but then B − B ∈ Ψ (M; F,E) by exercise 2 and so we −∞ also have IdF −BA ∈ Ψ (M; F ). An immediate corollary is elliptic regularity. Corollary 23. Let A ∈ Ψr(M; E,F ) be an elliptic pseudodifferential operator. If u ∈ C−∞(M; E) then Au ∈ C∞(M; F ) =⇒ u ∈ C∞(M; E). More generally, sing supp Au = sing supp u for all u ∈ C−∞(M; E). Proof. It is true for any pseudodifferential operator A that sing supp Au ⊆ sing supp u, (see exercise 6 of the previous lecture) the surprising part is the opposite inclusion. −r −∞ Let B ∈ Ψ (M; F,E) be a parametrix for A and R = IdE −BA ∈ Ψ (M; E) then any u ∈ C−∞(M; E) satisfies u = BAu + Ru and since Ru ∈ C∞(M; E) we have sing supp u = sing supp BAu ⊆ sing supp Au as required. In the next few sections we will continue to develop consequences of the exis- tence of a parametrix. 114 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS 5.2 Hodge decomposition An elliptic operator in Ψ∗(M; E,F ) induces a decomposition of the sections of E into its null space, which is finite dimensional, and its orthogonal complement and of the sections of F into its image, which has finite codimension, and its orthogonal complement. This is a version of the Hodge decomposition for an arbitrary elliptic operator. We will establish this, starting with the case of smoothing perturbations of the identity, and then show that it implies the Hodge decomposition of differential forms. Theorem 24. Let M be a compact Riemannian manifold, and E a Hermitian vector bundle over M. Any R ∈ Ψ−∞(M; E) satisfies: i) The null space of IdE +R acting on distributions is a finite dimensional space of smooth sections of E, −∞ ∞ {u ∈ C (M; E) : (IdE +R)u = 0} ⊆ C (M; E) is finite dimensional. ii) We have a decomposition −∞ ∗ C (M,E) = Im(IdE +R) ⊕ ker(IdE +R ) by which we mean that, whenever f ∈ C−∞(M; E), ∗ f ∈ Im(IdE +R) ⇐⇒ hf, uiE = 0 for all u ∈ ker(IdE +R ). −∞ Proof. If u ∈ C (M; E) and (IdE +R)u = 0 then u = −Ru ∈ C∞(M; E). Since M is compact C∞(M; E) ⊆ L2(M; E), so ker A ⊆ L2(M; E). In particular, if L ⊆ ker A is a bounded subset of L2(M; E) then L is also a bounded subset of C∞(M; E) and so by the Arzela-Ascoli theorem every sequence has a convergent subsequence in C∞(M; E) and hence in L2(M; E). This means that any orthonormal set in ker A must be finite, and so ker A is a finite dimensional space. ∗ ⊥ It is easy to see that Im(IdE +R) = (ker(IdE +R )) so let us show that −∞ Im(IdE +R) is closed. Assume that w ∈ C (M; E) and there exists a sequence −∞ (un) ⊆ C (M; E) such that (IdE +R)un → w as distributions. Because smooth functions are dense in distributions (cf. Remark 6) we can assume without loss of generality that ∞ ⊥ (un) ⊆ C (M; E) ∩ (ker(IdE +R)) . 5.2. HODGE DECOMPOSITION 115 2 Let us first consider the case where (un) is a bounded subset of L (M; E). Here, as above, we know that (Run) has a convergent subsequence, but then un = (IdE +R)un− Run must have a convergent subsequence, say un → u, and by continuity w = (IdE +R)u as required. 2 On the other hand, if un is not bounded in L , then we have a subsequence with kunkL2 → ∞. Restricting to this subsequence and letting un vn = kunkL2 2 we now have a sequence that is bounded in L (M; E) and moreover (IdE +R)vn → 0. By the previous argument, vn must have a convergent subsequence, say vn → v and ⊥ (IdE +R)v = 0. Since vn ∈ (ker(IdE +R)) this requires that v = 0, but then this contradicts the fact that kvkL2 = lim kvnkL2 = 1. n→∞ Corollary 25. Let M be a compact Riemannian manifold, E and F Hermitian vector bundles over M. For any r ∈ R and any elliptic operator of order r, A ∈ Ψr(M; E,F ), we have: i) The null space of A is finite dimensional. ii) The range of A is the orthogonal complement of the null space of A∗, i.e., if f ∈ C∞(M; F ) then ∗ f ∈ Im(A) ⇐⇒ hf, uiF = 0 for all u ∈ ker(A ) An operator that satisfies parts (i) and (ii) of the corollary is called Fredholm. Proof. Let B ∈ Ψ−r(M; F,E) be a parametrix for A and −∞ −∞ R = BA − IdE ∈ Ψ (M; E),S = AB − IdF ∈ Ψ (M; F ). The null space of A as an operator on C−∞(M; E) is contained in the null space of Id +R, hence by Theorem 24 is finite dimensional and made up of smooth functions. It is easy to see that Im A ⊆ (ker A∗)⊥, so assume that f ∈ (ker A∗)⊥ and let us show that f ∈ Im(A). Using the theorem, we can write ∗ ⊥ ∗ ⊥ ∗ ∗ ⊥ ∗ ⊥ (5.5) (ker A ) = (ker A ) ∩ ker(IdF +S ) ⊕ (ker A ) ∩ (ker(IdF +S )) so we only need to show that these two summands are in Im(A). For the first sum- ∗ ∗ ∗ mand, notice that ker(IdF +S ) = ker(B A ), and we have a short exact sequence ∗ (5.6) 0 −→ ker A∗ −→ ker B∗A∗ −−−→A Im(A∗) ∩ ker B∗ −→ 0. 116 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS Since all of these spaces are finite dimensional, we can identify ∗ ⊥ ∗ (ker A ) ∩ ker(IdF +S ) = Im A Im(A∗)∩ker B∗ ⊆ Im A. ∗ ∗ ∗ ∗ ⊥ For the second summand in (5.5), IdF +S = B A implies (ker(IdF +S )) ⊆ (ker A∗)⊥, so we can it as ∗ ⊥ ∗ ⊥ ∗ ⊥ (ker A ) ∩ (ker(IdF +S )) = (ker(IdF +S )) = Im(IdF +S) = Im AB ⊆ Im A. We can interpret this corollary, the fact that an elliptic operator A ∈ Ψr(M; E,F ) is Fredholm as an operator A : C∞(M; E) −→ C∞(M; F ) or A : C−∞(M; E) −→ C−∞(M; F ), as telling us about solutions to the equation Au = f. Namely, we can solve this equation precisely when f ∈ C−∞(M; F ) satisfies the (finitely many) conditions ∗ hf, u1i = 0, hf, u2i = 0,..., hf, uki = 0 where {u1, . . . , uk} is a basis of ker A and, in that case, there is a finite dimensional space of solutions, u0 + ker A ⊆ −∞ C (M; E) with u0 a particular solution. By elliptic regularity these solutions are smooth if and only if f is smooth. We also know that the ‘cokernel’ of A as an operator on smooth sections coker(A) = C∞(M; F )/A(C∞(M; E)) or as an operator on distributional sections coker(A) = C−∞(M; F )/A(C−∞(M; E)) is naturally isomorphic to ker A∗ ⊆ C∞(M; F ). The index of A as a Fredholm oper- ator ind(A) = dim ker A − dim coker A = dim ker A − dim ker A∗ is finite and is the same when A acts on functions and when A acts on distributions. Another immediate consequence of this corollary is the Hodge decomposition. 5.2. HODGE DECOMPOSITION 117 Corollary 26. Let M be a compact Riemannian manifold, for any k ∈ N we have decompositions C∞(M;ΛkT ∗M) = d C∞(M;Λk−1T ∗M) ⊕ δ C∞(M;Λk+1T ∗M) ⊕ Hk(M) C−∞(M;ΛkT ∗M) = d C−∞(M;Λk−1T ∗M) ⊕ δ C−∞(M;Λk+1T ∗M) ⊕ Hk(M). where the latter is a topological direct sum and the former an orthogonal direct sum. It follows that the de Rham cohomology groups coincide with the Hodge co- homology groups, and this remains true if the de Rham cohomology is computed distributionally. Proof. Applying Corollary 25 to the Hodge Laplacian we know that ∞ k ∗ k C (M;Λ T M) = Im(∆k) ⊕ H (M) ⊆ d C∞(M;Λk−1T ∗M) + δ C∞(M;Λk+1T ∗M) ⊕ Hk(M). However it is easy to see that d C∞(M;Λk−1T ∗M) ⊥ δ C∞(M;Λk+1T ∗M) since 2 hdω, δηiΛkT ∗M = hd ω, ηiΛ∗T ∗M = 0, which establishes the decomposition for smooth forms. The decomposition for smooth forms shows that any distribution that vanishes on d C∞(M;Λk−1T ∗M) , δ C∞(M;Λk+1T ∗M) , and Hk(M) must be identically zero. Since this is true for distributions in d C−∞(M;Λk−1T ∗M) ∩ δ C−∞(M;Λk+1T ∗M) , we also have the decomposition for distributional forms. The decomposition clearly shows that ker d :Ωk(M) −→ Ωk+1(M) ∼= Hk(M) Im (d :Ωk−1(M) −→ Ωk(M)) and similarly for distributions. The Hodge decomposition also holds on the Dolbeault complex (3.8), and more generally for an elliptic complex defined as follows: Let E1,...,E` be Hermitian vector bundles over a compact Riemannian manifold M and suppose that we have 1 differential operators Di ∈ Diff (M; Ei,Ei+1), ∞ D1 ∞ D2 D`−1 ∞ 0 −→ C (M; E1) −−−→C (M; E2) −−−→ ... −−−−→C (M; E`) −→ 0 118 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS ∗ such that Di ◦ Di+1 = 0. This complex is an elliptic complex if, for all ξ ∈ S M, the sequence of leading symbols ∗ σ1(D1)(ξ) ∗ σ1(D2)(ξ) σ1(D`−1)(ξ) ∗ 0 −→ π E1 −−−−−−−→ π E2 −−−−−−−→ ... −−−−−−−−→ π E` −→ 0 is exact. In this setting the same proof as above shows that we have a natural isomorphism of finite dimensional vector spaces ker Di ∼ ∗ ∗ = ker(DiDi−1 + Di+1Di), Im Di−1 i.e., the cohomology spaces of the complex are finite dimensional and coincide with the Hodge cohomology spaces. 5.3 The index of an elliptic operator An elliptic operator acting on a closed manifold has a distinguished parametrix. Given A ∈ Ψr(M; E,F ) elliptic, let us write C∞(M; E) = ker A ⊕ Im(A∗), C∞(M; F ) = ker A∗ ⊕ Im(A) and define G : C∞(M; F ) −→ C∞(M; E) by ( Gs = 0 if s ∈ ker A∗ Gs = u if s ∈ Im(A) and u is the unique element of Im(A∗) s.t. Au = s The operator G is known as the (L2) generalized inverse of A. It satisfies GA = IdE −Pker A, AG = IdF −Pker A∗ ∗ where Pker A and Pker A∗ are the orthogonal projections onto ker A and ker A , re- spectively. Notice that these projections are smoothing operators, since we can write X ∗ KPker A (x, y) = φj(x)φj (y), (5.7) j ∗ with {φj} a basis of ker A and {φj } the dual basis and similarly for Pker A∗ . It turns out that G is itself a pseudodifferential operator, and we can express the index of A using the trace of a smoothing operator −∞ Tr : Ψ (M; E) −→ C Z Tr(P ) = trE KP diag dvolg M M 5.3. THE INDEX OF AN ELLIPTIC OPERATOR 119 Theorem 27. Let A ∈ Ψr(M; E,F ) be an elliptic operator and G : C∞(M; F ) −→ C∞(M; E) its generalized inverse, then G ∈ Ψ−r(M; F,E) and ind(A) = Tr(Pker A) − Tr(Pker A∗ ) =: Tr([A, G]). Proof. Let B ∈ Ψ−r(M; F,E) be a parametrix for A and −∞ −∞ R = BA − IdE ∈ Ψ (M; E),S = AB − IdF ∈ Ψ (M; F ). Making use of associativity of multiplication, we have BAG = G + RG = B − BPker A∗ , GAB = G + GS = B − Pker A hence G = B − BPker A∗ − RG and G = B − Pker AB − GS, and substituting the latter into the former, G = B − BPker A∗ − RB + RPker AB + RGS. This final term satisfies RGS : C−∞(M; F ) −→ C∞(M; E),S∗G∗R∗ : C−∞(M; E) −→ C∞(M : F ) which shows that RGS ∈ Ψ−∞(M; F,E), and so G − B ∈ Ψ−∞(M; F,E) and G ∈ Ψ−r(M; F,E). Now note that, from (5.7), X 2 Tr(Pker A) = kφjkL2(M;E) = dim ker A and similarly for Pker A∗ so the formula ind(A) = Tr(Pker A) − Tr(Pker A∗ ) is immediate. Let us establish a couple of properties of the trace of smoothing operators that will be useful for understanding the index. Proposition 28. Let M be a compact Riemannian manifold and E,F Hermitian vector bundles. i) The operators of finite rank {K ∈ Ψ−∞(M; E,F ) : dim Im(K) < ∞} are dense in Ψ−∞(M; E,F ). ii) If P ∈ Ψ−∞(M; F,E) and A ∈ Ψr(M; E,F ) then Tr(AP ) = Tr(PA). 120 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS −∞ ∞ Proof. i) Given P ∈ Ψ (M; E,F ) so that KP ∈ C (M × M; HOM(E,F )) let us ∞ show that we can approximate KP in the C -topology by finite rank operators. Us- ing partitions of unity we can assume that KP is supported in an open set of M ×M over which E and F are trivialized, so KP is just a smooth matrix valued function on an open set of Rm × Rm. Identifying this open set with an open set in Tm × Tm with Tm the m-dimensional torus, we can expand each of the entries in a Fourier series in each variable. The truncated series then are finite rank operators that converge uniformly with all of their derivatives to KP , and restricting their support we get finite rank operators on M that approximate P. ii) By continuity we can assume that P has finite rank and by linearity that P has rank one, say P φ = v(φ)s, for some v ∈ C∞(M; F ∗), s ∈ C∞(M; E). Then we have AP (φ) = v(φ)As, P A(φ) = v(Aφ)s Z so Tr(AP ) = v(As) dvolg = Tr(PA). Now we can derive some of the most important properties of the index of an elliptic operator. Theorem 29. Let M be a compact Riemannian manifold, E,F, and G Hermitian vector bundles over M, and let A ∈ Ψr(M; E,F ) and B ∈ Ψs(M; F,G) be elliptic. i) B ◦ A ∈ Ψr+s(M; E,G) is elliptic and ind(B ◦ A) = ind(B) + ind(A). ii) If Q ∈ Ψ−r(M; F,E) is any parametrix of A then ind(A) = Tr(IdE −QA) − Tr(IdF −AQ) =: Tr([A, Q]). iii) If P ∈ Ψ−∞(M; E,F ) then ind(A) = ind(A + P ). iv) The index is a smooth homotopy invariant of elliptic operators. v) The index of A only depends on σr(A). Proof. i) Recall that we have a short exact sequence (5.6) ∗ 0 −→ ker A∗ −→ ker B∗A∗ −−−→A Im(A∗) ∩ ker B∗ −→ 0 5.3. THE INDEX OF AN ELLIPTIC OPERATOR 121 as these are all finite dimensional, we can conclude that dim ker(A∗B∗) = dim ker(B∗) + dim((ker A∗) ∩ (Im B∗)). By the same reasoning dim ker(BA) = dim ker(A) + dim((ker B) ∩ (Im A)) so that ind(BA) is dim ker A + dim((ker B) ∩ (Im A)) − dim ker(B∗) − dim((ker A∗) ∩ (Im B∗)) = dim ker A + dim((ker B) ∩ (ker A∗)⊥) − dim ker(B∗) − dim((ker A∗) ∩ (ker B)⊥) = dim ker A + dim(ker B) − dim ker(B∗) − dim(ker A∗) = ind(A) + ind(B) as required. ii) From Theorem 27, we know that if G is the generalized inverse of A then Tr([A, G]) = ind(A) and from the proof of Theorem 27 we know that G − Q ∈ Ψ−∞(M; F,E). It follows that Tr([A, Q]) − Tr([A, G]) = Tr(IdE −QA) − Tr(IdF −AQ) − Tr(IdE −GA) + Tr(IdF −AG) = Tr((G − Q)A) − Tr(A(G − Q)) which vanishes by Proposition 28. iii) follows immediately from (ii) since any parametrix of A is a parametrix of A + P. r iv) If t 7→ At ∈ Ψ (M; E,F ) is a smooth family of elliptic pseudodifferen- −r tial operators then we can construct a smooth family t 7→ Qt ∈ Ψ (M; F,E) of parametrices and hence ind(At) = Tr([At,Qt]) is a smooth function of t. Since this is valued in the integers, it must be constant and independent of t. v) It suffices to note that if two operators A and A0 have the same elliptic symbol then tA + (1 − t)A0 is a smooth family of elliptic operators connecting A and A0, so by (iv) they have the same index. 122 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS 5.4 Pseudodifferential operators acting on L2 So far, we have been studying pseudodifferential operators as maps C∞(M; E) −→ C∞(M; F ) or C−∞(M; E) −→ C−∞(M; F ), now we want to look at more refined mapping properties by looking at distributions with some intermediate regularity, e.g., C∞(M; E) ⊆ L2(M; E) ⊆ C−∞(M; E). Let us recall some definitions for linear operators acting between Hilbert spaces2, H1 and H2. It is convenient, particularly when differential operators are involved, to allow for linear operators to be defined on a subspace of H1, P : D(P ) ⊆ H1 −→ H2. We think of a linear operator between Hilbert spaces as the pair (P, D(P )) and only omit D(P ) when it is clear from context. We will always assume that D(P ) is dense in H1. Thus for instance any pseudodifferential operator A ∈ Ψr(M; E,F ) defines a linear operator between L2 spaces with domain C∞(M; E), A : C∞(M; E) ⊆ L2(M; E) −→ L2(M; F ). Of course, with this domain A actually maps into C∞(M; F ), so we will be interested in extending A to a larger domain. An extension of a linear operator (P, D(P )) is an operator (P,e D(Pe)), Pe : D(Pe) ⊆ H1 −→ H2 such that D(P ) ⊆ D(Pe) and Pe u = P u for all u ∈ D(P ). An operator (P, D(P )) is bounded if it sends bounded sets to bounded sets. In particular, there is a constant C such that kP ukH2 ≤ CkukH1 for all u ∈ D(P ) and the smallest constant is called the operator norm of P and denoted kP kH1−→H2 . 2For simplicity, we will focus on Hilbert spaces based on L2(M; E), but pseudodifferential operators also have good mapping properties on Banach spaces such as Lp(M; E), p 6= 2. 5.4. PSEUDODIFFERENTIAL OPERATORS ACTING ON L2 123 Recall that an operator (P, D(P )) is continuous at a point if and only if it is contin- uous at every point in D(P ) if and only if it is bounded. A bounded operator has a unique extension to an operator defined on all of H1, moreover this extension is bounded with the same operator norm. If an operator is bounded, we will assume that its domain is H1 and generally omit it from the notation. The set of bounded operators between H1 and H2 is denoted B(H1, H2). A compact operator is an operator that sends bounded sets to pre-compact sets, and is in particular necessarily bounded. We denote the space of compact operators between H1 and H2 by K(H1, H2). An example of a smoothing operator is a smoothing operator. Indeed, for smoothing operators between sections of E and sections of F, the image of a bounded subset of L2(M; E) is bounded in C∞(M; F ) and hence by Arzela-Ascoli compact in L2(M; F ). Lemma 30. Let M be a Riemannian manifold and E,F Hermitian vector bun- dles. If R ∈ Ψ−∞(M; E,F ), then (R, C∞(M; E)) is a compact operator between L2(M; E) −→ L2(M; F ). Its unique extension to L2(M; E) is the restriction of R as an operator on C−∞(M; E). Since the identity map is trivially bounded, we can use this lemma to see that −∞ any operator of the form IdE +R with R ∈ Ψ (M; E) defines a bounded operator on L2(M; E). There is a clever way of using this to show that any pseudodifferential operator of order zero defines a bounded operator on L2. Proposition 31. If r ≤ 0 and P ∈ Ψr(M; E,F ) then (P, C∞(M; E)) defines a bounded linear operator between L2(M,E) and L2(M,F ). Proof. We follow H¨ormanderand reduce to the smoothing case as follows. Assume we can find Q ∈ Ψ0(M,E), C > 0, and R ∈ Ψ−∞(M,E) such that P ∗P + Q∗Q = C Id +R. Then we have 2 ∗ kP ukL2(M,F ) = hP u, P uiF = hP P u, uiE ∗ ∗ 2 ≤ hP P u, ui+hQ Qu, ui = Chu, ui+hRu, ui ≤ (C+kRkL2(M;E)→L2(M;E))kukL2(M;E), so the proposition is proved once we construct Q, C, and R. 124 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS We proceed iteratively. Let σ0(P ) denote the symbol of degree zero of P, e.g., ∗ σ0(P ) = 0 if r < 0. Choose C > 0 large enough so that C − σ0(P ) σ0(P ) is, at each ∗ 0 point of S M, a positive element of hom(E), then choose Q0 so that its symbol is a ∗ 1 0 0 ∗ positive square root of C − σ0(P )σ0(P ), and let Q0 = 2 (Q0 + (Q0) ). Note that Q0 0 is a formally self-adjoint elliptic operator with the same symbol as Q0. Suppose we have found formally self-adjoint operators Q0,...,QN satisfying −i Qi ∈ Ψ (M; E) and N !2 ∗ X −1−N P P + Qi = C Id −RN ,RN ∈ Ψ (M; E). i=0 0 −N−1 Then, if QN+1 ∈ Ψ (M,E), we have N !2 ∗ X 0 P P + Qi + QN+1 − C Id i=0 N ! N ! 0 X X 0 = −RN + QN+1 Qi + Qi QN+1, i=0 i=0 −2−N 0 so in order for this to be in Ψ (M; E), QN+1 must satisfy 0 0 σ(QN+1Q0 + Q0QN+1) = σ(RN ). We can always solve this equation because σ(Q0) is self-adjoint and positive as shown in the following lemma. Lemma 32. Let A be a positive self-adjoint k × k matrix, and let ΦA(B) = AB + BA, then ΦA is a bijection from the space of self-adjoint matrices to itself. Proof. We can find a matrix S such that S∗ = S−1 and S∗AS = D is diagonal. ∗ −1 Define Ψ(B) = S BS and note that Ψ ΦAΨ = ΦΨ(A) so we may replace A with D = diag(λ1, . . . , λk). Let Est denote the matrix with entries ( 1 if s = i, t = j (Est)ij = 0 otherwise and note that ΦD(Est + Ets) = (λs + λt)(Est + Ets). Hence the matrices {Est + Ets} with s ≤ t form an eigenbasis of ΦD on the space of self-adjoint k × k matrices. Since, by assumption, each λs > 0 it follows that ΦD is bijective. 5.4. PSEUDODIFFERENTIAL OPERATORS ACTING ON L2 125 0 So using the lemma we can find σ(QN+1) among self-adjoint matrices and then 1 0 0 ∗ define QN+1 = 2 (QN+1 + (QN+1) ), finishing the iterative step. Finally, we define Q by asymptotically summing the Qi and the result follows. Corollary 33. Let A ∈ Ψr(M; E,F ), r ≤ 0, and let kσ0(A)k∞ = sup |σ0(A)(ζ, ξ)|, S∗M −∞ then for any ε > 0 there is a non-negative self-adjoint operator Rε ∈ Ψ (M,E) such that, for any u ∈ L2(M,E) 2 2 2 (5.8) kAukL2(M,F ) ≤ (kσ0(A)k∞ + ε)kukL2(M,E) + hRεu, uiE. In particular, if r < 0 then A ∈ Ψr(M; E,F ) defines a compact operator (A, C∞(M; E)) between L2(M; E) and L2(M; F ). Proof. The existence of Rε satisfying the inequality (5.8) follows directly from the proof of the theorem. If kσ0(A)k∞ = 0, we show that A is a compact operator by proving that the image of a bounded sequence is Cauchy. Let (uj) be a bounded sequence in 2 L (M; E), say kujk ≤ C for all j ∈ N. For each n ∈ N we know that (R 1 uj) has a n convergent subsequence in L2(M; E) and so by a standard diagonalization argument 0 0 we can find a subsequence (uj) of (uj) such that (R 1 uj) converges for all n. n 0 2 It follows that (Auj) is a Cauchy sequence in L (M; F ). Indeed, given η > 0 choose N ∈ N such that j, k > N implies 0 0 kRη(uj − uk)kL2(M;E) ≤ η. It follows that for such j, k 0 0 2 0 0 0 0 0 0 kA(uj−uk)kL2(M;F ) ≤ ηkuj−ukkL2(M;E)+hRη(uj−uk), (uj−uk)iL2(M;E) ≤ η(2C)+ηC. 0 Hence (Auj) is Cauchy and A is compact. The graph of a linear operator (P, D(P )) is the subspace of H1 × H2 given by {(u, P u) ∈ H1 × H2 : u ∈ D(P )}. We say that a linear operator (P, D(P )) is closed if and only if its graph is a closed subspace of H1 × H2. Any bounded operator has a closed graph, and the closed graph theorem says that an closed operator (P, D(P )) with D(P ) = H1 is bounded. 126 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS Let us point out that a closed operator need not have closed domain or closed range, however the null space of a closed operator does form a closed set. It is very convenient to deal with closed operators, especially when studying the spectrum of an operator or dealing with adjoints. For instance, the spectrum of a linear operator that is not closed consists of the entire complex plane! If A ∈ Ψr(M; E,F ) then (A, C∞(M; E)) has at least two closed extensions with domains: the maximal domain comes from restricting the action of A as a continuous operator on distributions, 2 2 Dmax(A) = {u ∈ L (M; E): Au ∈ L (M; F )}, the minimal domain comes from extending the action of A as a continuous oper- ator on smooth functions, 2 Dmin(A) = {u ∈ L (M; E): ∞ there exists (un) ⊆ C (M; E) s.t. lim un = u and (Aun) converges}. In the latter case, we define Au = lim Aun and this is independent of the choice of se- quence un as long as lim un = u and Aun converges. Since the graph of (A, Dmin(A)) is the closure of the graph of (A, C∞(M; E)), it is clearly the minimal closed exten- sion of A and (A, Dmax(A)) is also clearly the maximal closed extension of A. There are operators for which Dmin(A) 6= Dmax(A) (see exercise 8). However these must have positive order, since we have shown that operators of order zero are 2 bounded (which implies that Dmin(A) = Dmax(A) = L (M; E)). Next we show that elliptic operators also have a unique closed extension. Proposition 34. Let M be a compact Riemannian manifold and E,F Hermitian vector bundles over M, and r > 0. If A ∈ Ψr(M; E,F ) is an elliptic operator then (5.9) Dmin(A) = Dmax(A). In fact the following stronger statement is true: For any Q ∈ Ψ−r(M; F,E), 2 2 (5.10) Im(Q : L (M; F ) −→ L (M; E)) ⊆ Dmin(A). Proof. Let us start by proving (5.10). Let u ∈ L2(M; F ) and v = Qu, we need to ∞ show that v ∈ Dmin(A). Choose (un) ⊆ C (M; F ), a sequence converging to u, and note that, since Q and AQ are bounded operators on L2(M; F ), we have vn = Qun → v, Avn = AQun → Av. This implies v ∈ Dmin(A) as required. 5.5. SOBOLEV SPACES 127 Next, to prove (5.9), let B ∈ Ψ−r(M; F,E) be a parametrix for A and w ∈ Dmax(A). Then w = BAw + Rw, R ∈ Ψ−∞(M; E), shows that w is a sum of an section in the image of B and a smooth section, and by (5.10) both of these are in Dmin(A). 5.5 Sobolev spaces The second part of Proposition 34 strongly suggests that the L2-domain of an elliptic pseudodifferential operator of order r is independent of the actual operator involved. This is true, and the common domain is known as the Sobolev space of order r. We define these spaces, for any s ∈ R, by Hs(M; E) = {u ∈ C−∞(M; E): Au ∈ L2(M; E) for all A ∈ Ψs(M; E)}. We can think of Hs as the sections ‘whose first s weak derivatives are in L2’, and if s ∈ N this is an accurate description. Notice that s > 0 =⇒ Hs(M; E) ⊆ L2(M; E) ⊆ H−s(M; E). It will be useful to have at our disposal a family of operators s R 3 s 7→ Λs ∈ Ψ (M; E), (5.11) ∞ ∗ ∗ σs(Λs) = idE ∈ C (S M; hom(π E)) that are elliptic, (formally) self-adjoint, strictly positive, and moreover satisfy −1 ∞ Λ−s = Λs on C (M; E). These are easy to construct. First define Λ0 = IdE . Next for s > 0, start with any s/2 operator A ∈ Ψ (M; E) with σs/2(A) = idE and then define ∗ Λs = A A + IdE . Finally, for s < 0, take Λ−s to be the generalized inverse from section 5.3 and note −1 that since Λs is (formally) self-adjoint and injective, Λ−s = Λs . (When there is E more than one bundle involved, we will denote Λs by Λs .) Using these operators, let us define an inner product on Hs(M; E) by (5.12) Hs(M; E) × Hs(M; E) / C (u, v) / hu, viHs(M;E) = hΛsu, ΛsviL2(M;E) 128 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS and the corresponding norm 2 kukHs(M;E) = hu, uiHs(M;E). It is easy to see that this inner product gives Hs(M; E) the structure of a Hilbert space. Theorem 35. Let M be a compact Riemannian manifold and E,F Hermitian vector bundles over M. A pseudodifferential operator A ∈ Ψr(M; E,F ) defines bounded operators s s−r A : H (M; E) −→ H (M; F ) for any s ∈ R. For a distribution u ∈ C−∞(M; E), the following are equivalent: i) Au ∈ L2(M; E) for all A ∈ Ψr(M; E), i.e., u ∈ Hr(M; E) ii) There exists P ∈ Ψr(M; E,F ) elliptic such that P u ∈ L2(M; F ). ∞ iii) u is in the closure of C (M; E) with respect to the norm k·kHr(M;E). Moreover, every elliptic operator P ∈ Ψr(M; E,F ) defines a norm on Hs(M; E), kukP,s = kP ukHs−r(M;F ) + kPker P ukHs(M;E) which is equivalent to the norm k·kHs(M;E). In particular, the Hilbert space structure on Hs(M; E) is independent of the choice of family Λs as another choice of family yields equivalent norms. Proof. Showing that the map A : C∞(M; E) ⊆ Hs(M; E) −→ Hs−r(M; F ) is bounded is equivalent to showing that the map F E ∞ 2 2 Λs−r ◦ A ◦ Λ−s : C (M; E) ⊆ L (M; E) −→ L (M; F ) is bounded. But this is an operator of order zero, and so is bounded by Proposition 31. It follows that, for any A ∈ Ψr(M; E,F ) and any s > 0 there is a constant C > 0 such that kAukHs−r(M;F ) ≤ CkukHs(M;E). If A = P is elliptic, and G is the generalized inverse of P, then since it has order −r we have kGvkHs−r(M;E) ≤ kvkHr(M;F ). In particular, we have kP ukHs−r(M;F ) ≥ CkGP ukHs(M;E) = ku − Pker P ukHs(M;E) 5.5. SOBOLEV SPACES 129 hence kukHs(M;E) ≤ C(kP ukHs−r(M;F ) + kPker P ukHs−r(M;E)), which proves that kukP,s is equivalent to kukHs(M;E), since ker P is a finite dimen- sional space and all norms on finite dimensional spaces are equivalent. This also establishes the equivalence of (i) and (ii). For any s ∈ R, choose s > r and note that Hs(M; E) is the maximal domain of ∞ r r Λs−r : C (M; E) ⊆ H (M; E) −→ H (M; F ) the same argument as in Proposition 34 shows that the minimal domain equals the maximal domain and this shows that C∞(M; E) is dense in Hs(M; E). In this way we can refine our understanding of regularity of sections from C∞(M; E) ⊆ L2(M; E) ⊆ C−∞(M; E) to, for any s > 0, C∞(M; E) ⊆ Hs(M; E) ⊆ L2(M; E) ⊆ H−s(M; E) ⊆ C−∞(M; E). In fact, the natural L2-pairing which exhibits C−∞(M; E) as the dual of C∞(M; E) and L2(M; E) as its own dual puts Hs(M; E) in duality with H−s(M; E). Formally, for every s ∈ R there is a non-degenerate pairing Hs(M; E) × H−s(M; E) / C (u, v) / hu, viHs×H−s = hΛ−su, ΛsviL2(M;E) and so we can identify (Hs(M; E))∗ = H−s(M; E). (Notice that this is different from the pairing (5.12) with respect to which Hs is its own dual.) Also notice that there are natural inclusions s < t =⇒ Ht(M : E) ⊆ Hs(M; E) and indeed we can isometrically embed Ht(M; E) into Hs(M; E) by the opera- tor Λs−t. Notice that since these are pseudodifferential operators of negative order, this inclusion is compact! Put another way, if we have a sequence of distributions in Hs(M; E) that is bounded in Ht(M; E), then it has a convergent sequence in Hs(M; E). This is an L2-version of the usual Arzela-Ascoli theorem on a compact set K: a sequence of functions in C0(K) that is bounded in C1(K) has a convergent sequence in C0(K). It is easy to see that for any ` ∈ N0 C`(M; E) ⊆ Hs(M; E) whenever s > ` 130 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS interestingly there is a sort of converse embedding, known as the Sobolev embedding theorem, (see exercises 9 and 10) s ` 1 H (M; E) ⊆ C (M; E) if ` ∈ N0, ` > s − 2 dim M. This lets us see that \ [ C∞(M; E) = Hs(M; E), and C−∞(M; E) = Hs(M; E) s∈R s∈R s so that the scale of Sobolev spaces {H (M; E)}s∈R is a complete refinement of the inclusion of smooth sections into distributional sections. With the machinery we have built so far, it is a simple matter to refine Corol- lary 25 to Sobolev spaces. Theorem 36. Let M be a compact Riemannian manifold, E and F Hermitian vector bundles over M. Let P ∈ Ψr(M; E,F ) be an elliptic operator of order r ∈ R. We have a refined elliptic regularity statement: −∞ t t+r u ∈ C (M; E), P u ∈ H (M; F ) =⇒ u ∈ H (M; E) for any t ∈ R. Also, for any s ∈ R,P defines a Fredholm operator either as a closed unbounded operator P : Hs+r(M; E) ⊆ Hs(M; E) −→ Hs(M; F ) or as a bounded operator P : Hs+r(M; E) −→ Hs(M; F ) and its index is independent of s. Explicitly this means that: i) The null spaces of P and P ∗ in either case are finite dimensional (in fact these spaces are themselves independent of s.) ii) We have an orthogonal decomposition Hs(M; F ) = P (Hs+r(M; E)) ⊕ ker P ∗. A consequence of this theorem is a refined version of the Hodge decomposition. Namely, for any s ∈ R, Hs(M;ΛkT ∗M) = d Hs+1(M;Λk−1T ∗M) ⊕ δ Hs+1(M;Λk+1T ∗M) ⊕ Hk(M). 5.6. EXERCISES 131 5.6 Exercises Exercise 1. Make sure you understand how to prove all parts of Theorem 21. Exercise 2. Let A ∈ Ψr(M; E,F ) be an elliptic pseudodifferential operator. Show that if −r −∞ BL,BR ∈ Ψ (M; F,E) satisfy BLA − IdE ∈ Ψ (M; E) and ABR − IdF ∈ −∞ −∞ Ψ (M; F ) then BL − BR ∈ Ψ (M; F,E). Exercise 3. Show that any linear operator A : Rm −→ Rn is Fredholm and compute its index. Exercise 4. (Schur’s Lemma) Suppose (X, µ) is a measure space and K is a measurable complex- valued function on X × X satisfying Z Z |K(x, y)| dµ(x) ≤ C1, |K(x, y)| dµ(y) ≤ C2, X X for all y and for all x respectively. Show that the prescription Z Ku(x) = K(x, y)u(y) dµ(y) X defines a bounded operator on Lp(X, µ) for each p ∈ (1, ∞) with operator norm 1/p 1/q 1 1 p p kKkL →L ≤ C1 C2 , where q satisfies p + q = 1. Hint:It suffices to bound |hKu, vi| for u and v with kukLp ≤ 1, kvkLq ≤ 1. Start with the expression Z Z |hKu, vi| = K(x, y)u(y)v(x) dµ(y)dµ(x) X X and apply the inequality ap bq ab ≤ p + q with a = tu and b = t−1v where t is a positive real number to be determined. Minimize over t to get the desired bound. Exercise 5. Show that the linear operator T : C∞([0, 1]) ⊆ L2([0, 1]) −→ L2([0, 1]) given by T u = ∂xu is not bounded and not closed. 132 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS Exercise 6. 2 Let H1 = L (R), H2 = R and consider the operator Z Au = u(x) dx R ∞ from H1 to H2 with domain D(A) = Cc (R). Prove that the closure of the graph of A is not the graph of a linear operator. Hint: For each n ∈ N, let ( 1 n if |x| ≤ n un(x) = 0 if |x| > n 2 Show that un → 0 in L (R) and Aun = 2 for all n. Why does this solve the exercise? Exercise 7. Let A be a linear operator A : D(A) ⊆ H −→ H. Given λ ∈ C, show that A − λ Id is closed if and only if A is closed. Exercise 8. Show that the operator A = ∂x sin(x)∂x on the circle is symmetric, and that Dmin(A) 6= Dmax(A). Exercise 9. One can also define Sobolev spaces on Rm where there is a natural choice for the operators Λs in (5.11), namely −1 2 s/2 Λsf(x) = F (1 + |ξ| ) F(f) s m ∞ m and then as before H (R ) is the closure of Cc (R ) with respect to the norm kfks = kΛsfkL2(Rm). 1 Let s ∈ R and k ∈ N0 such that k < s − 2 m show that there is a constant C such that α sup sup |D f| ≤ Ckfks. |α|≤k ζ∈Rm m Hint: First note that kφkL∞(Rm) ≤ kF(φ)kL1(Rm) for any φ ∈ S(R ). Next note that α α 2 |α|/2 kF(D f)kL1(Rm) = kξ F(f)kL1(Rm) ≤ k(1 + |ξ| ) F(f)kL1(Rm) Finally, write (1 + |ξ|2)k/2 = (1 + |ξ|2)s/2(1 + |ξ|2)(k−s)/2 and use Cauchy-Schwarz. Exercise 10. Prove the Sobolev embedding theorem for integer degree Sobolev spaces: If s ∈ N0 s ` there is a continuous embedding of H (M; E) into C (M; E) for any ` ∈ N0 satisfy- 1 ing ` < s − 2 dim M. 5.6. EXERCISES 133 Hint: Let f ∈ Hs(M; E), use the fact that multiplication by a smooth function is a pseudodifferential operator of order zero to reduce to the case where f is supported in a coordinate chart φ : U ⊆ Rm −→ M that trivializes E. Since s is a natural s number the Sobolev norm on H (M; E) is equivalent to a norm k·kD with D a dif- ferential operator, use this to show that f ◦ φ ∈ Hs(U; Crank E). Use the previous exercise to conclude that f has ` continuous derivatives. One can use a technique called ‘interpolation’ to deduce the general Sobolev embed- ding theorem from this integer degree version. Exercise 11. (The adjoint of a linear operator) Let H1 and H2 be C-Hilbert spaces and A : D(A) ⊆ H1 −→ H2 a linear operator. The (Hilbert space) adjoint of A, is a linear operator from H2 to H1 defined on ∗ D(A) = {v ∈ H2 : ∃w ∈ H1 s.t. hAu, viH2 = hu, wiH1 for all u ∈ D(A)} by the prescription hAu, vi = hu, A∗vi for all u ∈ D(A), v ∈ (D(A))∗. (Note that we always assume our linear operators have dense domains, so A∗v is determined by this relation.) If H1 = H2, a linear operator (A, D(A)) is self-adjoint if D(A)∗ = D(A) and hAu, viH1 = hu, AviH1 for all u, v ∈ D(A). ∗ ∗ i) Show that the graph of (A , D(A) ) is the orthogonal complement in H2 × H1 of the set {(−Au, u) ∈ H2 × H1 : u ∈ D(A)} so in particular (A∗, D(A)∗) is always a closed operator (even if A is not a closed operator). ∗ ii) If (A, D(A)) is a closed operator, show that (D(A)) is a dense subspace of H2 (e.g., by showing that z ⊥ D(A)∗ =⇒ z = 0) so we can define the adjoint of (A∗, D(A)∗). Prove that ((A∗)∗, (D(A)∗)∗) = (A, D(A)). 2 2 ii) Suppose H1 = L (M; E), H2 = L (M; F ) where M is a Riemannian manifold and E,F are Hermitian vector bundles, and A ∈ Ψr(M; E,F ) for some r ∈ R. Let A† denote the formal adjoint of A, show that the Hilbert space adjoint of the linear operator A : C∞(M; E) ⊆ L2(M; E) −→ L2(M; F ) is the linear operator † † 2 2 A : Dmax(A ) ⊆ L (M; F ) −→ L (M; E). 134 LECTURE 5. PSEUDODIFFERENTIAL OPERATORS ON MANIFOLDS Hint: Recall the definition of the action of A† on distributions. 2 r iii) Let H1 = H2 = L (M; E). Show that if A ∈ Ψ (M; E) is elliptic and formally self-adjoint, then the unique closed extension of A as an operator on L2(M; E) is self-adjoint. Exercise 12. Let H1 and H2 be C-Hilbert spaces and A : D(A) ⊆ H1 −→ H2 a linear operator. Recall that A is Fredholm if: i) ker A is a finite dimensional subspace of H1 ∗ ii) ker A is a finite dimensional subspace of H2 ∗ iii) H2 has an orthogonal decomposition H2 = Im(A)⊕ker A (i.e., Im(A) is closed). Prove that an operator is Fredholm if and only if it has an inverse modulo compact operators. That is, there exists B : D(B) ⊆ H2 −→ H1 such that AB − IdH2 and BA − IdH1 are compact operators. Hint: One can follow the arguments above very closely, replacing smoothing opera- tors by compact operators. 5.7 Bibliography Same as Lecture 4. Lecture 6 Spectrum of the Laplacian 6.1 Spectrum and resolvent Let H be a complex Hilbert space and A : D(A) ⊆ H −→ H a linear operator. We divide the complex numbers into the resolvent set of A ρ(A) = {λ ∈ C : A − λ IdH has a bounded inverse} and its complement, the spectrum of A, Spec(A) = C \ ρ(A). The resolvent of A is the family of bounded linear operators ρ(A) / B(H) −1 λ / R(λ) = (A − λ IdH) This is, in a natural sense, a holomorphic family of bounded operators1. Notice that the graph of the resolvent satisfies {(u, R(λ)u): u ∈ H} = {((A − λ IdH)v, v): v ∈ H} and hence λ ∈ ρ(A) implies that the operator A − λ IdH is closed, and so by exercise 7 of the previous lecture, that A is closed. In other words if A is not closed then Spec(A) = C. 1See for instance, Rudin Functional Analysis, for the meaning of a holomorphic vector-valued function. 135 136 LECTURE 6. SPECTRUM OF THE LAPLACIAN Since a complex number λ is in the spectrum of A if A − λ IdH fails to have a bounded inverse, there is a natural division of Spec(A) by taking into account why there is no bounded inverse: the point spectrum or eigenvalues of A consists of Specpt(A) = {λ ∈ C : A − λ IdH is not injective }, the continuous spectrum of A consists of Specc(A) = {λ ∈ C : A−λ IdH is injective and has dense range, but is not surjective}, everything else is called the residual spectrum, Specres(A) = {λ ∈ C : A − λ IdH is injective but its range is not dense}. If λ ∈ Specpt(A) then ker(A − λ IdH) is known as the λ-eigenspace and will be denoted Eλ(A), its elements are called eigenvectors. Recall that the adjoint of a bounded operator A ∈ B(H) is the unique operator A∗ ∈ B(H) satisfying ∗ hAu, viH = hu, A viH for all u, v ∈ H. Just as on finite dimensional spaces, for any operator eigenvectors of different eigen- values are always linearly independent and if the operator is self-adjoint then they are also orthogonal (and the eigenvalues are real numbers). Let us recall some other elementary functional analytic facts. Proposition 37. Let H be a complex Hilbert space and T ∈ B(H). If T = T ∗ then kT kH→H = sup |hT u, uiH| = max |λ|. kuk=1 λ∈Spec(T ) Proof. Let us abbreviate |T | = supkuk=1 |hT u, uiH|. Cauchy-Schwarz shows that |T | ≤ kT kH→H. To prove the opposite inequality, let u, v ∈ H have length one and assume that hT u, viH ∈ R (otherwise we just multiply v by a unit complex number to make this real). We have 4hT u, viH = hT (u + v), (u + v)iH − hT (u − v), (u − v)iH 2 2 2 2 ≤ |T |(ku + vkH + ku − vkH) = 2|T |(kukH + kvkH) = 4|T | and hence kT kH→H = sup kT ukH = sup sup hT u, viH ≤ |T |. kukH=1 kukH=1 kvkH=1 6.1. SPECTRUM AND RESOLVENT 137 Next, replacing T by −T if necessary, we can assume that sup hT u, ui > kukH=1 H 0. Choose a sequence of unit vectors (un) such that limhT un, uni = |T | and note that 2 2 2 2 k(T − |T | Id)unkH = kT unkH + |T | − 2|T |hT un, uni ≤ 2|T | − 2|T |hT un, uni → 0 implies that (T − |T | Id) is not invertible, so |T | ∈ Spec(T ). Finally, we need to show that if λ > kT kH→H then λ ∈ ρ(T ), i.e., that (T − 1 λ IdH) is invertible. It suffices to note that A = λ T has operator norm less than one and hence the series X (−1)kAk k≥0 converges in B(H) to an inverse of IdH −A. With those preliminaries recalled, we can now establish the spectral theorem for compact operators. Theorem 38. Let H be a complex Hilbert space and K ∈ K(H) a compact operator. The spectrum of K satisfies Spec(K) ⊆ Specpt(K) ∪ {0} with equality if H is infinite dimensional, in which case zero may be an eigenvalue or an element of the continuous spectrum. Each non-zero eigenspace of K is fi- nite dimensional and the point spectrum is either finite or consists of a sequence converging to 0 ∈ C. If K is also self-adjoint there is a complete orthonormal eigenbasis of H and we can expand K in the convergent series X K = λPλ λ∈Spec(K) where Pλ is the orthogonal projection onto Eλ(K). Proof. First notice that if a compact operator on a space V is invertible, then the unit ball of V is compact and hence V is finite dimensional. Thus if H is infinite dimensional, then zero must be in the spectrum of K. 1 1 Let λ 6= 0 then K − λ IdH = −λ(IdH − λ K) and we know that IdH − λ K is a Fredholm operator of index zero (since we can connect it to IdH along Fredholm 138 LECTURE 6. SPECTRUM OF THE LAPLACIAN operators). Thus its null space is finite dimensional and if it is injective it is surjective so by the closed graph theorem has a bounded inverse. This shows that Spec(K)\{0} consists of eigenvalues with finite dimensional eigenspaces. Notice that, for any ε > 0 the operator K restricted to [ span Eλ(K) λ∈Spec(K) |λ|>ε is both compact and invertible, hence this space must be finite dimensional. Thus if the spectrum is not finite, the eigenvalues form a sequence converging to zero. Finally, if K is self-adjoint, let H0 be the orthogonal complement of [ span Eλ(K) . λ∈Specpt(K) 0 0 Notice that K preserves H , since if v ∈ Eλ(K) and w ∈ H we have hv, KwiH = hKv, wiH = λhv, wi = 0. Therefore K H0 is a compact self-adjoint operator with Spec(K H0 ) = {0} and so must be the zero operator. But this means that H0 is both in the null space of K and orthogonal to the null space of K, so H0 = {0}. Similarly, notice that X K − λPλ λ∈Spec(K) |λ|>ε is a self-adjoint operator with spectrum contained in [−ε, ε] ⊆ R. It follows that its operator norm is at most ε which establishes the convergence of the series to K. This immediately yields the spectral theorem for elliptic operators. Theorem 39. Let M be a closed manifold, E a Hermitian vector bundle over M, and P ∈ Ψr(M; E) an elliptic operator of positive order r ∈ R. If ρ(P ) 6= ∅ then Spec(P ) consists of a sequence of eigenvalues diverging to infinity, with each eigenspace a finite dimensional space of smooth sections of E. If P is self-adjoint then ρ(P ) 6= ∅, the eigenvalues are real, the eigenspaces are orthogonal, and there is an orthonormal eigenbasis of L2(M; E). 6.2. THE HEAT KERNEL 139 −1 −r Proof. If ζ ∈ ρ(P ) then (P − ζ IdE) ∈ Ψ (M; E) is a compact operator on L2(M; E) and we can apply the spectral theorem for compact operators. So it suffices to note that −1 −1 P v = λv ⇐⇒ (P − ζ IdE) v = (λ − ζ) v. If P is self-adjoint then Spec(P ) ⊆ R, so ρ(P ) 6= ∅. This is the case in particular for the Laplacian of a Riemannian metric. A classical problem is to understand how much of the geometry can be recovered from the spectrum of its Laplacian. This was popularized by an article of Mark Kac entitled ‘Can you hear the shape of a drum?’. 6.2 The heat kernel The fundamental solution of the heat equation ( (∂t + ∆)u = 0 u t=0 = u0 is a surprisingly useful tool in geometric analysis. In particular, it will help us to show that the spectrum of the Laplacian contains lots of geometric information. We will discuss an approach to the heat equation known as the Hadamard parametrix construction, following Richard Melrose. For simplicity we will work with the scalar Laplacian, but the construction extends to any Laplace-type operator with only notational changes. On any compact Riemannian manifold, the heat operator −t∆ ∞ ∞ + e : C (M) −→ C (R × M) is the unique linear map satisfying ( −t∆ (∂t + ∆)e u0 = 0 −t∆ e u0 t=0 = u0 Its integral kernel H(t, x, y) is known as the heat kernel and it is a priori a dis- tribution on R+ × M × M, but we will soon show that it is a smooth function for t > 0 and understand its regularity as t → 0. Once we know H(t, x, y) then we can 140 LECTURE 6. SPECTRUM OF THE LAPLACIAN solve the heat equation, even with an inhomogeneous term, since ( (∂t + ∆)v = f(t, x) =⇒ v t=0 = v0(x) Z Z t Z v(t, x) = H(t, x, y)v0(y) dvolg(y) + H(t − s, x, y)f(s, y) dvolg(y)ds. M 0 M It is straightforward to write e−t∆ as an operator on L2(M) by using the 2 spectral data of ∆. Indeed, if {φj} is an orthonormal basis of L (M), with ∆φj = λjφj, then we have X −tλj H(t, x, y) = e φj(x)φj(y) for all t > 0. j However it is hard to extract geometric information from this presentation. On Euclidean space, the heat kernel is well-known (see exercise 7) 1 |x − y|2 H m (t, x, y) = exp − . R (4πt)m/2 4t Notice that HRm is smooth in t, x, y as long as t > 0. As t → 0,HRm vanishes exponentially fast away from diagRm = {x = y} and blows-up as we approach {0} × diagRm . m/2 Notice that t HRm is still singular as we approach this submanifold since the argument of the exponential as we approach a point (0, x, x) depends not just on x, but also on the way we approach. For instance, for any z ∈ Rm, the limit of tm/2H(t, x, y) as t → 0 along the curve (t, x + t1/2z, x) is |z| exp − . 4 To understand the singularity at {0} × diagRm we need to take into account the different ways of approaching this submanifold. We do this by introducing ‘parabolic’ polar coordinates (parabolic because the t-direction should scale differ- ently from the x − y direction). Define + m 2 SP = {(θ, ω) ∈ R × R : θ + |ω| = 1} and notice that any point (t, x, y) ∈ R+ × Rm × Rm can be written t = r2θ, x = x, y = x − rω, with r = pt + |x − y|2 . 6.2. THE HEAT KERNEL 141 Indeed, (r, θ, ω, x) are the parabolic polar coordinates for R+ × Rm × Rm. In these m/2 coordinates, we can write t HRm as |ω|2 exp − 4θ and, since θ and ω never vanish simultaneously, this is now a smooth function! We say that these coordinates have resolved the singularity of the heat kernel. Taking a closer look at the parabolic polar coordinates, notice that these are coordinates only when r 6= 0. Each point in the diagonal of Rm at t = 0 corresponds to r = 0 and to a whole SP . A better way of thinking about this is that we have constructed a new manifold with corners (the heat space of Rm) m + m HR = R × SP × R with a natural map β HRm / R+ × Rm × Rm (r, θ, ω, x) / (r2θ, x, y − rω) which restricts to a diffeomorphism m + m m HR \{r = 0} −→ (R × R × R ) \ ({0} × diagM ). We refer to HRm (together with the map β) as the parabolic blow-up of R+×Rm×Rm along {0} × diagM . Now let (M, g) be a Riemannian manifold. As a first approximation to the heat kernel of ∆g consider 1 d(x, y)2 G(t, x, y) = exp − . (4πt)m/2 4t Analogously to the discussion above, this is a smooth function on R+ ×M 2 except at {0} × diagM , and we will resolve this singularity by means of a ‘parabolic blow-up’. What should replace the diagonal at time zero? Consider the space R+ × TM with its natural map down to M, + π : R × TM −→ TM −→ M. At each point p ∈ M define + 2 SP (p) = {(θ, ω) ∈ R × TpM : t + |ω|g = 1} and then [ SP (M) = SP (p). p∈M 142 LECTURE 6. SPECTRUM OF THE LAPLACIAN The map π restricts to a map π : SP (M) −→ M which is a locally trivial fibration with fibers SP . Let us define the heat space of M as the disjoint union + 2 G HM = (R × M \{0} × diagM ) SP M, 2 endowed with a smooth structure as a manifold with corners where we attach SP M + 2 to R ×M \{0}×diagM by the exponential map of g. There is a natural ‘blow-down map’ + 2 β : HM −→ R × M + 2 which is the identity on R × M \{0} × diagM and is equal to π on SP M. This means that given a normal coordinate system for R+ × M 2 at (0, p, p), + + 2 U ⊆ R × TpM × TpM −→ R × M we get a coordinate system around β−1(0, p, p) by introducing parabolic polar coor- dinates as before, but now on the fibers of TpM. Thus, starting with a coordinate system induced by expp : TpM −→ M, say ζ = (ζ1, ζ2, . . . , ζm), we get a coordinate system on HM, + m r, (θ, ω), ζ ∈ R × SP × R with the identification of r = 0 with SP (expp(ζ)) and the identification of points with r 6= 0 with 2 + 2 (r θ, expp(rω + ζ), expp(ζ)) ∈ (R × M \{0} × diagM ). 2 Recall that the distance between expp(rω + ζ) and expp(ζ) is given by |rω|g(p) + 4 O(|rω|g(p)) so that our approximation to the heat kernel in these coordinates is ! 1 |rω|2 + O(|rω|4 ) exp − g(p) g(p) (4πr2θ)m/2 4r2θ !! 1 |ω|2 r2|ω|4 = exp − g(p) + O g(p) (4πr2θ)m/2 4θ θ and this is r−m times a smooth function, so we have again resolved the singularity. This computation shows that G approximates the Euclidean heat kernel to ∞ leading order as t → 0. That is enough to see that, for any u0 ∈ C (M), Z G(t, x, y)χ(dg(x, y))u0(y) dvolg(y) → u0(x) M 2For a careful proof, see Proposition 7.7 in Richard B. Melrose, The Atiyah-Patodi-Singer index theorem. Research Notes in Mathematics, 4. A K Peters, Ltd., Wellesley, MA, 1993. 6.2. THE HEAT KERNEL 143 where χ(dg(x, y)) is a cut-off function near the diagonal . In fact, we will proceed much as in the construction of an elliptic parametrix and find a sequence of functions Aj such that the operator ∗ X j/2 He(r, (θ, ω), x) ∼ β (χ(dg(x, y))G(t, x, y)) t Aj(r, (θ, ω), x), j≥0 ∗ ∞ satisfies β (∂t +∆)He(r, (θ, ω), x) = O(r ) as r → 0. Here we are using the fact that, since β is the identity away from the blown-up face, it makes sense to pull-back the differential operator ∂t + ∆; in a moment, we will compute this in local coordinates. ∗ Let us denote β (χ(dg(x, y))G(t, x, y)) by Ge(r, (θ, ω), x). We will assume in- ductively that we have found A0,A1,...,AN such that N ! ∗ X j −n+N+1 β (∂t + ∆) Ge(r, (θ, ω), x) r Aj(r, (θ, ω), x) = r EN (r, (θ, ω), x) j≥0 with EN a smooth function on the blown-up space, vanishing to infinite order at t = 0 away from the diagonal, and then we need to find AN+1(r, (θ, ω), x) such that −n+N+1 ∗ (N+1) r EN (r, (θ, ω), x) + β (∂t + ∆) r GAe N+1 vanishes to order −n + N + 2 at the blown-up boundary face {r = 0}, i.e., h n−N−1 ∗ (N+1) i r β (∂t + ∆) r GAe N+1 = −EN (0, (θ, ω), x). r=0 To carry out this construction, it is convenient to use a different set of coordi- nates, ‘projective coordinates’ √ ζ τ = t ,Y = √ , x t in which we can compute ∗ 1 1 X ∗ β ((t∂t)f(t, x, y)) = 2 τ∂τ − 2 Yj∂Yj β (f(t, x, y)) ∗ 1 ∗ β (∂zj f(t, x, y)) = τ (∂Yj + τVj)β (f(t, x, y)) for some vector fields Vj on HM. In these coordinates, 1 |Y | β∗G(t, x, y) = exp − + O(τ 2|Y |2) (4π)m/2τ m 4 144 LECTURE 6. SPECTRUM OF THE LAPLACIAN so altogether we have to solve N + 1 − m 2 ∆ − 1 Y · ∂ + e−|Y | /4A = −E (0, Y, x). Y 2 Y 2 N+1 N and taking the Fourier transform in Y, this becomes 2 1 ρ=|η| 2 (|ξ| + 2 ξ · ∂ξ + C)(f(ξ)) = h(ξ) −−−−→ (ρ∂ρ + 2ρ + C)(f(ξ)) = h(ξ) and one can check that this has a unique solution which decays rapidly as ρ → ∞ ξ and depends smoothly on ξb = |ξ| . Taking the inverse Fourier transform we get the required AN+1. Having inductively constructed all of the Aj, now let He(r, (θ, ω), x) be a func- tion on HM such that rmHe(r, (θ, ω), x) is smooth on HM, vanishes to infinite order at t = 0 away from the diagonal, and at the blown-up boundary face has Taylor P j expansion G(r, (θ, ω), x) r Aj. (The existence of such a function follows from the classical Borel’s theorem, much like asymptotic completeness of symbols above.) This function He is our parametrix for the heat kernel. It solves the heat equation to infinite order as r → 0, in that ∗ \ n ∞ \ n ∞ + 2 S = β (∂t + ∆)He ∈ r C (HM) = t C (R × M ). n∈N n∈N Next we use it to find the actual solution to the heat equation. Notice that we can think of a function F ∈ C∞(R+ × M 2) as defining an operator F C∞(M) / C∞(R+ × M) R u / M F (t, x, y)u(y) dy but we can also think of it as defining an operator Op(F ) C∞(R+ × M) / C∞(R+ × M) R t R u / 0 M F (s, x, y)u(t − s, y) dyds and the advantage is that then it makes sense to compose operators. The composi- tion of the operators corresponding to the functions F1 and F2 is easily seen to be an operator of the same form, but now associated to the function Z t Z F1 ∗ F2(t, x, y) = F1(s, x, z)F2(t − s, z, y) dzds. 0 M 6.2. THE HEAT KERNEL 145 With respect to this product the identity is associated to the distribution δ(t)δ(x−y) and so we can recognize the heat equation for the heat kernel ( (∂t + ∆)H(t, x, y) = 0 if t > 0 H(0, x, y) = δ(x − y) as the equation Op((∂t + ∆)H(t, x, y)) = Id . Our parametrix satisfies ∗ Op(β (∂t + ∆)He) = Id + Op(S) so we need to invert Id + Op(S). This can be done explicitly by a series ! X (Id + Op(S))−1 = Op (−1)jS ∗ S ∗ ... ∗ S . j≥0 One can check that this series converges to Id + Op(Q) for some \ n ∞ + 2 Q ∈ t C (R × M ), n∈N and hence β∗H = He + (He ∗ β∗Q). In particular we see that \ β∗H − He ∈ rnC∞(HM). n∈N Theorem 40. Let (M, g) be a compact Riemannian manifold, HM its heat space with blow-down map + 2 β : HM −→ R × M The pull-back of H(t, x, y), the heat kernel of ∆g, satisfies β∗H ∈ r−mC∞(HM). The trace of the heat kernel has an asymptotic expansion as t → 0, Z −t∆ −m/2 X k/2 Tr e = H(t, x, x) dvolg ∼ t t ak M k≥0 −m/2 with a0 = (4π) Vol(M) and each of the ak are integrals over M of universal polynomials in the curvature of g and its covariant derivatives, Z ak = Uk(x) dvolg . M 146 LECTURE 6. SPECTRUM OF THE LAPLACIAN The coefficients ak are known as the local heat invariants of (M, g). One can use a symmetry argument to show that all of the ak with k odd are identically zero, but the other ak contain important geometric information. Proof. The only thing left to prove is the statement about the coefficients ak. Its clear from the construction above that the coefficients Aj are built up in a universal fashion from the expansion of the symbol of ∆. But as this symbol is precisely the Riemannian metric, and we know that its Taylor expansion at a point in normal coordinates has coefficients given by the curvature and its covariant derivatives, the result follows with Uk(x) = Ak(0, (1, 0), x). 6.3 Weyl’s law The following lemma is an example of a ‘Tauberian theorem’. Lemma 41 (Karamata). If µ is a positive measure on [0, ∞), α ∈ [0, ∞), then Z ∞ e−tλ dµ(λ) ∼ at−α as t → 0 0 implies Z ` a dµ(λ) ∼ `α as ` → ∞. 0 Γ(α + 1) Proof. We start by pointing out that Z Z α −tλ −λ t e dµ(λ) = e dµt(λ) α −1 with µt(λ)(A) = t µ(t A), so we can rewrite the hypothesis as Z Z ∞ −λ a α−1 −λ lim e dµt(λ) = a = λ e dλ t→0 Γ(α) 0 and the conclusion as Z Z ∞ a α−1 lim χ(λ) dµt(λ) = λ χ(λ) dλ t→0 Γ(α) 0 with χ(λ) equal to the indicator function for [0, `]. But this last equality is true for sλ + ∞ + χ(λ) = e for any s ∈ R , hence by density for any χ ∈ C0 (R ) and hence for Borel measurable functions, in particular for the indicator function of [0, `]. 6.3. WEYL’S LAW 147 P If we apply this to the measure dµ(λ) = δ(λ − λi), we can conclude. Theorem 42 (Weyl’s law). Let (M, g) be a compact Riemannian manifold and let N(Λ) be the ‘counting function of the spectrum’ of the Laplacian, X N(Λ) = dim Eλ(∆g). λ∈Spec(∆) λ<Λ As Λ → ∞ we have Vol(M) m/2 N(Λ) ∼ m/2 m Λ . (4π) Γ( 2 + 1) Hence the rate of growth of the eigenvalues of the Laplacian has geometric information: the volume of M and the dimension of M. We also get geometric information from the local heat invariants. In fact it was pointed out by Melrose (for planar domains) and eventually shown in general by Gilkey, that they control the Sobolev norms of the curvature. Theorem 43 (Gilkey3). For k ≥ 3, Z Z k k−2 2 k−2 a2k = (−1) Ak|∇ R| + Bk|∇ scalg| dvolg + Ek dvolg M M where Ak,Bk are positive constants and Ek is a universal polynomial in the curvature of g and its first k − 3 covariant derivatives. The explicit formulæ for the heat invariants quickly get out of hand. The first few are given by Z m/2 m/2 1 (4π) a0 = Vol(M), (4π) a2 = scal dvolg 6 M Z m/2 1 2 2 2 (4π) a4 = 5scal − 2| Ric | + 2|R| dvolg 360 M Z m/2 1 2 2 2 2 2 (4π) a6 = (−142|∇scal| − 26|∇ Ric | − 7|∇R| + 35scal − 42scal| Ric | 45360 M 2 +42scal|R| − 26 Ricij Ricjk Ricki −20 Ricij Rick` Rikj` − 8 Ricij Rik`nRjkln −24Rijk`RijnpRk`np) dvolg 3For a proof see Gengqiang Zhou, Compactness of isospectral compact manifolds with bounded curvatures. Pacific J. Math. 181 (1997), no. 1, 187200. 148 LECTURE 6. SPECTRUM OF THE LAPLACIAN There is a remarkable formula of Polterovich for the kth heat invariant4, 3n m −m/2 n X 3n + 2 1 j+n 2j a2n(x) = (4π) (−1) ∆ d(x, y) j + m 4jj!(j + n)! y=x j=0 2 however it is difficult to extract from here an explicit polynomial in the curvature. 6.4 Long-time behavior of the heat kernel We know that the heat operator e−t∆ is, for every t > 0, a bounded self-adjoint operator on L2(M). Hence its operator norm is equal to the modulus of its largest eigenvalue. In the same way N −t∆ X −tλj −tλN+1 2 2 ke − e Pλj kL (M)→L (M) = e . j=0 Let us denote N −t∆N −t∆ X −tλj e = e − e Pλj j=0 and point out that this is the heat operator of the Laplacian restricted to the or- thogonal complement of span{φ0, . . . , φN }. In particular, the semigroup property applied to its integral kernel HN yields Z −(s+t)∆N −s∆N −t∆N e = e e ⇐⇒ HN (s + t, x, y) = HN (s, x, z)HN (t, z, y) dz. M Thus Z HN (t + 2, x, y) = HN (1, x, z1)HN (t, z1, z2)HN (1, z2, y) dz1dz2, M 2 and we have α β |Dx Dy HN (t + 2, x, y)| α −t∆N β −tλN+1 ≤ kDx HN (1, x, z1)kL2(M)ke kL2→L2 kDy HN (1, z2, y)kL2(M) ≤ Cα,βe . 4See Gregor Weingart, A characterization of the heat kernel coefficients. arXiv.org/abs/ math/0105144 6.5. DETERMINANT OF THE LAPLACIAN 149 Proposition 44. Let (M, g) be a compact Riemannian manifold and {φj} an or- thonormal eigenbasis of the Laplacian of g such that λj ≤ λj+1. The series N X e−tλj φ(x)φ(y) j=0 converges uniformly to the heat kernel H(t, x, y) in C∞(M). In particular, for any t > 0 X Tr(e−t∆) = e−tλj and for large t, −t∆ −tλ1 | Tr(e − Pker ∆)| ≤ e . Remark 7. One can show much more generally that the trace we defined on smooth- ing operators coincides with the functional analytic trace given by summing the eigenvalues. 6.5 Determinant of the Laplacian The asymptotic expansion of the trace of the heat kernel allows us to define the determinant of the Laplacian. This is an important invariant with applications in physics, topology, and various parts of geometric analysis. First we imitate the classical Riemann zeta function and define X ζ(s) = λ−s λ∈Spec(∆) λ6=0 where eigenvalues are repeated in the sum according to their multiplicity. Here s is a complex variable, and Weyl’s law shows that this sum converges absolutely (and 1 hence defines a holomorphic function) if Re(s) > 2 dim M. Notice that on the circle the eigenfunctions are eiθn with eigenvalue n2, so the non-zero eigenvalues are the squares of the integers, each with multiplicity two. It follows that the ζ function of the circle is X −2s ζ(s) = 2 n = 2ζRiemann(2s) n∈N essentially equal to the Riemann zeta function! Returning to general manifolds, we can use the identity 1 Z ∞ dt µ−s = tse−tµ Γ(s) 0 t 150 LECTURE 6. SPECTRUM OF THE LAPLACIAN valid whenever Re µ > 0 to write Z ∞ 1 s −t∆ dt ζ(s) = t Tr(e − Pker ∆) Γ(s) 0 t 1 whenever Re(s) > 2 dim M. Indeed, Proposition 44 shows that the integral Z ∞ s −t∆ dt t Tr(e − Pker ∆) 1 t converges as t → ∞, for any s ∈ C, and Theorem 40 shows that the integral Z 1 s −t∆ dt t Tr(e − Pker ∆) 0 t converges as t → 0, if Re s > m/2. Moreover, it is straightforward to show that ζ(s) is a holomorphic function of s on the half-plane Re s > m/2 where the integral converges. Another use of Theorem 40 shows that the integral N ! Z 1 X dt ts Tr(e−t∆) − t−m/2 tk/2a k t 0 k=0 m−N−1 converges, and defines a holomorphic function, on the half plane Re s > 2 . To incorporate the projection onto the null space, it suffices to replace ak with ( ak + dim ker ∆ if k = m eak = ak otherwise On the other hand, for Re s > m/2, we can explicitly integrate Z 1 N N X dt X ak ts t−m/2 tk/2a = e ek t s + (k − m)/2 0 k=0 k=0 Thus, for any N ∈ N, we have a function Z ∞ 1 −t∆ dt ζN (s) = Tr(e − Pker ∆) Γ(s) 1 t Z 1 N ! N ! X dt X ak + ts Tr(e−t∆ − P ) − t−m/2 tk/2a + e ker ∆ ek t s + (k − m)/2 0 k=0 k=0 m−N−1 which is meromorphic on Re s > 2 and coincides with ζ(s) for Re s > m/2. Since N was arbitrary we can conclude: 6.6. THE ATIYAH-SINGER INDEX THEOREM 151 Theorem 45. Let (M, g) be a compact Riemannian manifold of dimension m. The ζ function admits a meromorphic continuation to C, with at worst simple poles located at (m − k)/2, k ∈ N and with residue eak m−k Γ( 2 ) (k−m)/2 −t∆ where eak is the coefficient of t in the short-time asymptotics of Tr(e − Pker ∆). It follows easily that the meromorphic continuation of ζ(s) is always regular at s = 0. Notice that, if the Laplacian had only finitely many positive eigenvalues {µ1, . . . , µ`} then hX −si X Y ∂s µj = − log µj = − log µj“ = − log det ∆”. s=0 Inspired by this computation, Ray and Singer defined −∂sζ(s) det ∆ = e s=0 and, as mentioned above, this has found applications in topology, physics, and geometric analysis. 6.6 The Atiyah-Singer index theorem In this section we will sketch how one can use the constructions of this lecture to prove the Atiyah-Singer index theorem. Recall that the objective is to find a topological formula for ind P where P ∈ Ψk(M; E,F ) is an elliptic operator acting between sections of Hermitian vector bundles E and F. A deep result known as Bott periodicity reduces the general problem to finding a formula for the index of a generalized Dirac operator as treated in Lecture 3, in fact it would be enough to handle twisted signature operators. 6.6.1 Chern-Weil theory The connection with topology is through the theory of characteristic classes, which takes a vector bundle with some structure (e.g., an orientation or a complex struc- ture) and defines cohomology classes. 152 LECTURE 6. SPECTRUM OF THE LAPLACIAN Given a rank n R-vector bundle E −→ M and a connection ∇E, recall (from section 2.5) that the curvature of ∇E is a two-form valued in End(E), RE ∈ Ω2(M; End(E)). In a local trivialization of E, say by a local frame {sj}, the connection is described by a matrix of one-forms E b ∇ sa = ωasb, b b i or in terms of the Christoffel symbols, ωa = Γiadx . In this frame we have E b b b c (R )a = dωa − ωc ∧ ωa, which we can express in matrix notation as RE = dω − ω ∧ ω. Taking the exterior derivative of both sides, we find the ‘Bianchi identity’ dRE = −dω ∧ ω + ω ∧ dω = −RE ∧ ω + ω ∧ RE = [ω, RE] Now let P : Mn(R) −→ R be an invariant polynomial, i.e., a polynomial in the entries of the matrix such that −1 P (A) = P (T AT ) for all T ∈ GLn(R). Because of this invariance, it makes sense to evaluate P on sections of End(E) and in particular we get P (RE) ∈ Ωeven(M). Since differential forms of degree greater than dim M are automatically zero, it makes equally good sense to evaluate any invariant power series on RE. Lemma 46. If P is an invariant power series and RE is the curvature of the con- nection ∇E, then P (RE) is a closed differential form and its cohomology class is independent of the connection. Proof. By linearity, it suffices to prove this for P homogeneous, say of order q. For any q matrices, A1,...,Aq, define p(A1,...,Aq) to be the coefficient of t1t2 ··· tq in P (t1A1 + ... + tqAq). Thus p(A1,...,Aq) is a symmetric multilinear function satisfying p(A, . . . , A) = q!P (A) and moreover the invariance of P implies that −1 −1 p(T A1T,...,T AqT ) = p(A1,...,Aq) for all T ∈ GLn(R). For any one-parameter family of matrices T (s) in GLn(R), with T (0) = Id, we have q −1 −1 X 0 0 = ∂s s=0p(T A1T,...,T AqT ) = p(A1,..., [Ai,T (0)],...,Aq) = 0 i=1 6.6. THE ATIYAH-SINGER INDEX THEOREM 153 0 and note that T (0) is an arbitrary element of Mn(R). We can apply this identity to see that P (RE) is closed, dP (RE) = q!dp(RE,...,RE) = q(q!)p(dRE,RE,...,RE) = q(q!)p([ω, RE],RE,...,RE) = 0. Next assume that ∇e E is another connection on E. We know that their differ- ence is an End(E) valued one-form, ∇e E − ∇E = α ∈ Ω1(M; End(E)). E E Define ∇t = ∇ + tα, and note that this is a connection on E with curvature given locally by E E 1 2 Rt = d(ω + tα) − (ω + tα) ∧ (ω + tα) = R + t(dα − [ω, α]) − 2 t [α, α]. A computation similar to that above yields E E E ∂tP (Rt ) = q(q!)dp(α, Rt ,...,Rt ) and so Z 1 Z 1 E E E E E E P (R1 )−P (R0 ) = q(q!) dp(α, Rt ,...,Rt ) dt = d q(q!) p(α, Rt ,...,Rt ) dt . 0 0 E (Another way of thinking about this is that the connections ∇t can be amalgamated into a connection ∇ on the pull-back of E to [0, 1] × M, then the fact that P (∇) is E closed implies that ∂tP (Rt ) is exact.) We will denote by [P (E)] the cohomology class of P (RE). Recall that the symmetric functions on matrices of size n are functions of the elementary symmetric polynomials τj defined by n n−1 det(t Id +A) = t + τ1(A)t + ... + τn−1(A)t + τn(A), so, e.g., τ1(A) = Tr(A), and τn(A) = Det(A). Hence the cohomology class [P (E)] is a a polynomial in the cohomology classes 2i [τi(E)] ∈ H (M). Moreover, notice that if we choose a metric on E and a compatible connection E E b ∇ , then in a local orthonormal frame {sa} for E the matrix (R )a will be skew- E symmetric and hence τi(R ) = 0 if i is odd. The non-vanishing classes 4i [τ2i(E)] ∈ H (M) 154 LECTURE 6. SPECTRUM OF THE LAPLACIAN are called the Pontryagin classes of E and they generate all of the characteristic classes of E. If we restrict the class of matrices we consider then there are sometimes more examples of invariant polynomials. For instance, if we assume that E is an oriented bundle then we can assume that RE is valued in skew-symmetric matrices and, if P is a polynomial on skew-symmetric matrices, we can define P (RE) as long as −1 P (T AT ) = P (A) for all T ∈ SOn(R). This yields one new invariant, the Pfaffian characterized (up to sign) by the equation Pff(A)2 = Det(A). Given a skew-symmetric matrix A = (aij), let X i j 2 n α = aijdx ∧ dx ∈ Ω (R ) i Now assume that the bundle E is a Hermitian vector bundle of complex rank E E n and for each Hermitian connection ∇ define ci(R ) by the equation E n E n−1 E det(t + iR ) = t + c1(R )t + ... + cn(R ). 2i th The class of [ci(E)] ∈ H (M) is known as the i Chern class of E. If F is a real vector bundle and E = F ⊗ C, then 4i [τ2i(F )] = [c2i(E)] ∈ H (M) and, in this case, [c2i+1(E)] = 0, but not in general. We can specify an invariant polynomial by specifying its value on a diagonal or block-diagonal matrix. Thus for instance, if A is skew-adjoint, it can be diagonalized 6.6. THE ATIYAH-SINGER INDEX THEOREM 155 P µj over C with eigenvalues µj. We define the Chern polynomial Ch(A) = e and the Todd polynomial Td(A) = Q µj and then define, for any complex Hermitian (1−e−µj ) vector bundle E, Chern character of E = Ch(RE), Todd genus of E = Td(RE). On the other hand, if B is skew-symmetric real valued matrix then, over R, we can write it in block-diagonal form where each block is either of the form 0 −µj µj 0 or is the 1-by-1 zero matrix. For these matrices we define the Hirzebruch L- Q µj Q µj polynomial by L(B) = and the A-roof genus by Ab(B) = µj tanh(µj ) 2 sinh( ) 2 (note that, as these are even functions, the sign ambiguity in the definition of µj is unimportant) and then, for any real vector bundle, L-polynomial of E = L(RE),A-roof genus of E = Ab(RE). 6.6.2 The index theorem Let E −→ M be a Z2-graded Dirac bundle, let γ be its involution, and ðE ∈ Diff1(M; E) the associated generalized Dirac operator. The construction of the heat 2 kernel above extends easily to Laplace-type operators such as ðE, −t − + ! 2 e ðE ðE 0 −tðE e = −t + − 0 e ðE ð and establishes short-time asymptotic expansions − + X + − X −tðE ðE −m/2 + k/2 −tðE ðE −m/2 − k/2 Tr(e ) ∼ t ak,Et , Tr(e ) ∼ t ak,Et with each coefficient given by the integral of a polynomial in the curvature of g, the curvature of ∇E, and the covariant derivatives of these curvatures, Z ± ± ak,E = Uk,E(x) dvolg . M It is easy to see (exercise 5) that the non-zero spectrum of the composition of two operators AB, coincides with the non-zero spectrum of BA, and that the 156 LECTURE 6. SPECTRUM OF THE LAPLACIAN corresponding eigenspaces have the same dimension. One remarkable consequence pointed out by McKean and Singer is that − + + − −tðE ðE −tðE ðE + Tr(e ) − Tr(e ) = ind(ðE) for all t > 0. (One can also prove this by showing that the derivative of the left hand side with respect to t vanishes, and then taking the limit of the left hand side as t → ∞.) They noted that this gives a very non-explicit formula for the index, Z + + − (6.1) ind(ðE) = Um,E(x) − Um,E(x) dvolg, M and conjectured that after a ‘fantastic cancellation’, the universal polynomial + − Um,E(x) − Um,E(x) depends only on the curvatures and on no covariant derivatives. Note that the formula (6.1) already gives interesting information about the index of a Dirac-type operator. For instance, it vanishes in odd dimensions, it is additive for connected sums, and multiplicative for finite covers. The fantastic cancellation, now known as the local index theorem was subse- quently shown in a couple of different ways. First Patodi was able to show that the cancellation takes place for a couple of specific operators by a direct computation. Then Gilkey gave a proof, simplified by Atiyah-Bott-Patodi, that first characterizes the form-valued invariants of a Riemannian structure and Hermitian bundle that are ‘natural’ in that they are invariant under pull-back, as precisely the elements of the ring generated by the Pontrjagin forms of TM and the Chern forms of E. The + − invariance properties of Um,E(x) − Um,E(x) are enough to show that it is natural, and then evaluation of some simple examples determines the polynomial explicitly. + − The other approach allows a direct computation Um,E(x) − Um,E(x) using the ‘supersymmetry’ of Clifford algebras. This was discovered by Getzler who showed that after taking the supersymmetry into account, one could use the classical formula of Mehler for the heat kernel of the harmonic oscillator. In one dimension this says that 2 2 P1 = −∂x + x =⇒ 1 1 e−tP1 (x, y) = √ exp − (x2 cosh 2t − 2xy + y2 cosh 2t) , 2π sinh 2t 2 sinh 2t but this can be generalized to higher dimensional Laplacians acting on vector bun- dles. 6.7. EXAMPLES 157 Applied to the Dirac operator on a spin manifold, this yields R/4πi U + (x) − U − (x) = Ev det1/2 m,S/ m,S/ m sinh(R/4πi) and hence Z + ind(ð ) = Ab(TM). M For a generalized Dirac operator associated to a Z2-graded Dirac bundle over an even dimensional manifold, recall that we have ∗ E 1 E End(E) = Cl(T M) ⊗ EndCl(T ∗M)(E),R = 4 cl Re + Rb E with Rb , the twisting curvature, a section of EndCl(T ∗M)(E). Similarly the grading γ of E decomposes into cl (Γ) ⊗ γb with γb ∈ EndCl(T ∗M)(E). The local index theorem in this case is the fact that ! ! + − 1/2 R/e 4πi E Um,E(x) − Um,E(x) = Evm det tr γb exp(iRb /2π) . sinh(R/e 4πi) E The differential form tr γb exp(iRb /2π) is known as the relative Chern form of E, it is closed and its cohomology class is independent of the choice of Clifford connection on E and will be denoted Ch0(E). The index theorem in this case is Z + 0 ind(ðE) = Ab(TM) Ch (E). M 6.7 Examples For the Gauss-Bonnet operator, recall that we have even even odd ðdR :Ω (M) −→ Ω (M) even and that the index of ðdR is equal to the Euler characteristic of M. The index theorem in this case is the Chern-Gauss-Bonnet theorem Z even χ(M) = ind(ðdR ) = Pff(TM). M For the signature operator, recall that we use the Hodge star to split ∗ ∗ ∗ ∗ ∗ ∗ ΛCT M = Λ+T M ⊕ Λ−T M 158 LECTURE 6. SPECTRUM OF THE LAPLACIAN ∗ ∗ and then the restriction of the de Rham operator to Λ+T M is the signature operator + ∗ ∗ ∗ ∗ ðsign :Λ+T M −→ Λ−T M. + The index of ðsign is the signature of the manifold M. The index theorem in this case is the Hirzebruch signature theorem Z + sign(M) = ind(ðsign) = L(M). M For the ∂ operator on a K¨ahler manifold, recall that we have a splitting k ∗ M p,q Λ T M ⊗ C = Λ M p+q=k with respect to which d splits into d = ∂ + ∂ :Ωp,qM −→ Ωp+1,qM ⊕ Ωp,q+1(M) and then operator √ + ∗ 0,even 0,odd ðRR = 2 (∂ + ∂ ):Ω M −→ Ω (M) is a generalized Dirac operator whose index is the Euler characteristic of the Dol- beault complex. The index theorem in this case is the Grothendieck-Riemann-Roch theorem Z + ind(ðRR) = Td(TM ⊗ C). M For a general elliptic pseudodifferential operator P ∈ Ψ∗(M; E,F ), choose a E F connection ∇ with curvature −2πiωE and a connection ∇ with curvature −2πiωF . Define −1 1 −1 2 ω(t) = (1 − t)ωE + tσ(P ) ωF σ(P ) + 2πi t(1 − t)(σ (P )∇σ(P )) and then set Z 1 1 −1 ω(t) odd ∗ Ch(f σ(P )) = − Tr(σ (P )∇σ(P )e ) dt ∈ Ω (S M) 2πi 0 This is the ‘odd Chern form’ defined by the symbol. The Atiyah-Singer index theorem is the formula Z ind(P ) = Ch(f σ(P )) Td(TM ⊗ C). S∗M 6.8. EXERCISES 159 6.8 Exercises Exercise 1. Show that zero is an eigenvalue of infinite multiplicity for d :Ω∗(M) −→ Ω∗(M), that d has no other eigenvalue, and that there is an infinite dimensional space of forms that are not in this eigenspace. Exercise 2. Verify directly that the spectrum of the Laplacian on R is [0, ∞) and that there are no eigenvalues. Exercise 3. Compute the spectrum of the Laplacian on the flat torus R2/Z2. Compute the spectrum of the Laplacian on 1-forms on the flat torus. Exercise 4. Compute the spectrum of the sphere S2 with the round metric. Exercise 5. Show that AB and BA have the same spectrum, including multiplicity, except possibly for zero. Exercise 6. Show that Eλ(∆k) form an acyclic complex. Exercise 7. Use the Fourier transform to show that the integral kernel of the heat operator on Rm is given by 0 2 0 1 − |ζ−ζ | K −t∆ (t, ζ, ζ ) = e 4t . e (4πt)m/2 Exercise 8. Use the heat kernel to give an alternate proof that a de Rham class contains a harmonic form. 6.9 Bibliography A great place to read about spectral theory is the book Perturbation theory for linear operators by Kato, I also like Functional analysis by Rudin. Our construction of the heat kernel is due to Richard Melrose and can be found in Chapter 7 of The Atiyah-Patodi-Singer index theorem. One should also look at Chapter 8 to see how to carry out Getzler rescaling geometrically.