This is page 121 Printer: Opaque this 4 Manifolds, Vector Fields, and Differential Forms
In preparation for later chapters, it will be necessary for the reader to learn a little bit about manifold theory. We recall a few basic facts here, beginning with the finite-dimensional case. (See Abraham, Marsden, and Ratiu [1988] for a full account.) The reader need not master all of this material now, but it suffices to read through it for general sense and come back to it repeatedly as our development of mechanics proceeds.
4.1 Manifolds
Our first goal is to define the notion of a manifold. Manifolds are, roughly speaking, abstract surfaces that locally look like linear spaces. We shall assume at first that the linear spaces are Rn for a fixed integer n, which will be the dimension of the manifold. Coordinate Charts. Given a set M,achart on M is a subset U of M together with a bijective map ϕ : U → ϕ(U) ⊂ Rn. Usually, we denote ϕ(m) by (x1,...,xn) and call the xi the coordinates of the point m ∈ U ⊂ M. Two charts (U, ϕ) and (U ,ϕ) such that U ∩ U = ∅ are called compat- ible if ϕ(U ∩ U ) and ϕ(U ∩ U) are open subsets of Rn and the maps
ϕ ◦ ϕ−1|ϕ(U ∩ U ):ϕ(U ∩ U ) −→ ϕ(U ∩ U ) and
ϕ ◦ (ϕ)−1|ϕ(U ∩ U ):ϕ(U ∩ U ) −→ ϕ(U ∩ U ) 122 4. Manifolds, Vector Fields, and Differential Forms are C∞. Here, ϕ ◦ϕ−1|ϕ(U ∩U ) denotes the restriction of the map ϕ ◦ϕ−1 to the set ϕ(U ∩ U ). See Figure 4.1.1.
xn ϕ(U) M
ϕ
x1 U xn
ϕ (U ) U V ϕ
x1
Figure 4.1.1. Overlapping charts on a manifold.
We call M a differentiable n-manifold if the following hold: M1. The set M is covered by a collection of charts, that is, every point is represented in at least one chart. M2. M has an atlas; that is, M canbewritten as a union of compatible charts. If a chart is compatible with a given atlas, then it can be included into the atlas itself to produce a new, larger, atlas. One wants to allow such charts, thereby enlarging a given atlas, and so one really wants to define a differentiable structure as a maximal atlas.Wewill assume that this is done and resist the temptation to make this process overly formal. A simple example will make what we have in mind clear. Suppose one considers Euclidean three-space R3 as a manifold with simply one (iden- tity) chart. Certainly, we want to allow other charts such as those defined by spherical coordinates. Allowing all possible charts whose changes of co- ordinates with the standard Euclidean coordinates are smooth then gives us a maximal atlas. A neighborhood of a point m in a manifold M is defined to be the inverse image of a Euclidean space neighborhood of the point ϕ(m) under achart map ϕ : U → Rn. Neighborhoods define open sets, and one checks that the open sets in M define a topology. Usually, we assume without explicit mention that the topology is Hausdorff:Two different points m, m in M have nonintersecting neighborhoods. 4.1 Manifolds 123
Tangent Vectors. Two curves t → c1(t) and t → c2(t)inann-manifold M are called equivalent at the point m if c1(0) = c2(0) = m and (ϕ ◦ c1) (0) = (ϕ ◦ c2) (0) in some chart ϕ. Here the prime denotes the differentiation of curves in Euclidean space. It is easy to check that this definition is chart indepen- dent and that it defines an equivalence relation. A tangent vector v to a manifold M at a point m ∈ M is an equivalence class of curves at m. It is a theorem that the set of tangent vectors to M at m forms a vector space. It is denoted by TmM and is called the tangent space to M at m ∈ M. Given a curve c(t), we denote by c(s) the tangent vector at c(s) defined by the equivalence class of t → c(s + t)att =0.Wehave set things up so that tangent vectors to manifolds are thought of intuitively as tangent vectors to curves in M. Let ϕ : U ⊂ M → Rn be achart for the manifold M,sothat we get as- sociated coordinates (x1,... ,xn) for points in U. Let v be a tangent vector to M at m; i.e., v ∈ TmM, and let c beacurve that is a representative of the equivalence class v. The components of v are the numbers v1,... ,vn defined by taking the derivatives of the components, in Euclidean space, of the curve ϕ ◦ c: ' ' d ' vi = (ϕ ◦ c)i' , dt t=0 where i =1,... ,n.From the definition, the components are independent of the representative curve chosen, but they do, of course, depend on the chart chosen. Tangent Bundles. The tangent bundle of M, denoted by TM,is the set that is the disjoint union of the tangent spaces to M at the points m ∈ M, that is, ( TM = TmM. m∈M Thus, a point of TM is a vector v that is tangent to M at some point m ∈ M. If M is an n-manifold, then TM isa2n-manifold. To define the dif- ferentiable structure on TM,weneed to specify how to construct local coordinates on TM.Todothis, let x1,... ,xn be local coordinates on M and let v1,... ,vn be components of a tangent vector in this coordinate system. Then the 2n numbers x1,... ,xn,v1,... ,vn give a local coordi- nate system on TM. This is the basic idea one uses to prove that indeed TM is a 2n-manifold. The natural projection is the map τM : TM → M that takes a tangent vector v to the point m ∈ M at which the vector v is attached (that is, 124 4. Manifolds, Vector Fields, and Differential Forms
∈ −1 ∈ v TmM). The inverse image τM (m)ofapointm M under the natural projection τM is the tangent space TmM. This space is called the fiber of the tangent bundle over the point m ∈ M.
Differentiable Maps and the Chain Rule. Let f : M → N beamap of a manifold M to a manifold N.Wecall f differentiable (resp. Ck)ifin local coordinates on M and N, the map f is represented by differentiable (resp. Ck) functions. Here, by “represented” we simply mean that coor- dinate charts are chosen on both M and N so that in these coordinates f, suitably restricted, becomes a map between Euclidean spaces. One of course has to check that this notion of smoothness is independent of the charts chosen—this follows from the chain rule. The derivative of a differentiable map f : M → N at a point m ∈ M is defined to be the linear map
Tmf : TmM → Tf(m)N constructed in the following way. For v ∈ TmM,choose a curve c :]−, [ → M with c(0) = m, and associated velocity vector dc/dt |t=0 = v . Then Tmf · v is the velocity vector at t =0of the curve f ◦ c : R → N, that is, ' ' d ' Tmf · v = f(c(t))' . dt t=0
The vector Tmf · v does not depend on the curve c but only on the vector v,asisseen using the chain rule. If f : M → N is of class Ck, then Tf : TM → TN is a mapping of class Ck−1. Note that ' ' dc' ' = T0c · 1. dt t=0
If f : M → N and g : N → P are differentiable maps (or maps of class Ck), then g ◦ f : M → P is differentiable (or of class Ck), and the chain rule holds: T (g ◦ f)=Tg◦ Tf.
Diffeomorphisms. A differentiable (or of class Ck) map f : M → N is called a diffeomorphism if it is bijective and its inverse is also differen- tiable (or of class Ck). If Tmf : TmM → Tf(m)N is an isomorphism, the inverse function theorem states that f is a local diffeomorphism around m ∈ M, that is, there are open neighborhoods U of m in M and V of f(m)inN such that f|U : U → V is a diffeomorphism. The set of all diffeomorphisms f : M → M forms a group under composition, and the chain rule shows that T (f −1)=(Tf)−1. 4.1 Manifolds 125
Submanifolds and Submersions. A submanifold of M is a subset S ⊂ M with the property that for each s ∈ S there is a chart (U, ϕ)inM with the submanifold property, namely, SM. ϕ : U → Rk × Rn−k and ϕ(U ∩ S)=ϕ(U) ∩ (Rk ×{0}). The number k is called the dimension of the submanifold S. This latter notion is in agreement with the definition of dimension for a general manifold, since S is a manifold in its own right all of whose charts are of the form (U ∩ S, ϕ|(U ∩ S)) for all charts (U, ϕ)ofM having the submanifold property. Note that any open subset of M is a submanifold and that a submanifold is necessarily locally closed, that is, every point s ∈ S admits an open neighborhood U of s in M such that U ∩ S is closed in U. There are convenient ways to construct submanifolds using smooth map- pings. If f : M → N is a smooth map, a point m ∈ M is a regular point if Tmf is surjective; otherwise, m is a critical point of f.IfC ⊂ M is the set of critical points of f, then f(C) ⊂ N is the set of critical values of f and N\f(C)istheset of regular values of f.1 The submersion theorem states that if f : M → N is a smooth map and n is a regular value of f, then f −1(n)isasmooth submanifold of M of dimension dim M − dim N and # $ −1 Tm f (n) =kerTmf.
The local onto theorem states that Tmf : TmM → Tf(m)N is surjective if and only if there are charts ϕ : U ⊂ M → U at m in M and ψ : V ⊂ N → V at f(m)inN such that ϕ maps into the product space Rdim M−dim N × Rdim N ; the image of U correspondingly has the form of a product U = U ×V ; the point m gets mapped to the origin ϕ(m)=(0, 0), as does f(m), namely, ψ(f(m)) = 0; and the local representative of f is a projection:
(ψ ◦ f ◦ ϕ−1)(x, y)=x.
In particular, f|U : U → V is onto. If Tmf is onto for every m ∈ M, then f is called a submersion.Itfollows that submersions are open mappings (the images of open sets are open). Immersions and Embeddings. A Ck map f : M → N is called an im- mersion if Tmf is injective for every m ∈ M. The local 1-to-1 theorem states that Tmf is injective if and only if there are charts ϕ : U ⊂ M → U at m in M and ψ : V ⊂ N → V at f(m)inN such that V is a product
1Sard’s theorem states that if f : M → N is a Ck-map, k ≥ 1, and if M has the property that every open covering has a countable subcovering, then if k> max(0, dim M − dim N), the set of regular values of f is residual and hence dense in N. 126 4. Manifolds, Vector Fields, and Differential Forms
V = U ×V ⊂ Rdim M ×Rdim N−dim M ;both m and f(m) get sent to zero, i.e., ϕ(m)=0 and ψ(f(m)) = (0, 0); and the local representative of f is the inclusion (ψ ◦ f ◦ ϕ−1)(x)=(x, 0). In particular, f|U : U → V is injective. The immersion theorem states that Tmf is injective if and only if there is a neighborhood U of m in M such that f(U)isasubmanifold of N and f|U : U → f(U)isadiffeomorphism. It should be noted that this theorem does not say that f(M)isasub- manifold of N.For example, f may not be injective and f(M)maythus have self-intersections. Even if f is an injective immersion, the image f(M) may not be a submanifold of N.Anexample is indicated in Figure 4.1.2.
y f R2
x π/4 π 7π/4 r = cos 2θ
Figure 4.1.2. An injective immersion.
The map indicated in the figure (explicitly given by f :]π/4, 7π/4[→ R2; θ → (sin θ cos 2θ, cos θ cos 2θ)) is an injective immersion, but the topol- ogy induced from R2 onto its image does not coincide with the usual topol- ogy of the open interval: Any neighborhood of the origin in the relative topology consists, in the domain interval, of the union of an open interval about π with two open segments ]π/4,π/4+[, ]7π/4 − , 7π/4[ . Thus, the image of f is not a submanifold of R2, but an injectively immersed submanifold. An immersion f : M → N that is a homeomorphism onto f(M) with the relative topology induced from N is called an embedding.Inthis case f(M)isasubmanifold of N and f : M → f(M)isadiffeomorphism. For example, if f : M → N is an injective immersion and if M is compact, then f is an embedding. Thus, the example given in the preceding figure is an example of an injective immersion that is not an embedding (and of course, M is not compact). Another example of an injective immersion that is not an embedding is the linear flow on the torus T2 = R2/Z2 with irrational slope: f(t)= (t, αt) (mod Z2). However, there is a difference between this injective im- mersion and the “figure eight” example above: In some sense, the second 4.1 Manifolds 127 example is better behaved; it has some “uniformity” about its lack of being an embedding. An injective immersion f : M → N is called regular if the following property holds: If g : L → M is any map of the manifold L into M, then g is Ck if and only if f ◦ g : L → N is Ck for any k ≥ 1. It is easy to see that all embeddings satisfy this property but that the previous example also satisfies it, without being an embedding, and that the “figure eight” example (see Figure 4.1.2) does not satisfy it. Varadarajan [1974] calls such maps quasi-regular embeddings; they appear below in the Frobenius theorem and in the study of Lie subgroups. Vector Fields and Flows. A vector field X on a manifold M is a map X : M → TM that assigns a vector X(m)atthepoint m ∈ M; that is, τM ◦ X = identity. The real vector space of vector fields on M is denoted by X(M). An integral curve of X with initial condition m0 at t =0 is a (differentiable) map c :]a, b[ → M such that ]a, b[isanopeninterval containing 0, c(0) = m0, and c(t)=X(c(t)) for all t ∈ ]a, b[. In formal presentations we usually suppress the domain of definition, even though this is technically important. The flow of X is the collection of maps ϕt : M → M such that t → ϕt(m)isthe integral curve of X with initial condition m. Existence and uniqueness theorems from ordinary differential equations guarantee that ϕ is smooth in m and t (where defined) if X is. From uniqueness, we get the flow property
ϕt+s = ϕt ◦ ϕs along with the initial condition ϕ0 = identity. The flow property generalizes the situation where M = V is a linear space, X(m)=Am for a (bounded) linear operator A, and where
tA ϕt(m)=e m to the nonlinear case. A time-dependent vector field is a map X : M × R → TM such that X(m, t) ∈ TmM for each m ∈ M and t ∈ R.Anintegral curve of X is a curve c(t)inM such that c(t)=X(c(t),t). In this case, the flow is the collection of maps ϕt,s : M → M such that t → ϕt,s(m)isthe integral curve c(t) with initial condition c(s)=m at t = s. Again, the existence and uniqueness theorem from ODE theory applies, and in particular, uniqueness gives the time-dependent flow property ϕt,s ◦ ϕs,r = ϕt,r. 128 4. Manifolds, Vector Fields, and Differential Forms
If X happens to be time independent, the two notions of flows are related by ϕt,s = ϕt−s. Differentials and Covectors. If f : M → R is a smooth function, we can differentiate it at any point m ∈ M to obtain a map Tmf : TmM → Tf(m)R. Identifying the tangent space of R at any point with itself (a process we usually do in any vector space), we get a linear map → R ∈ ∗ df(m):TmM . That is, df(m) TmM, the dual of the vector space TmM.Wecall df the differential of f.Forv ∈ TmM,wecall df(m) · v the directional derivative of f in the direction v.Inacoordinate chart or in linear spaces, this notion coincides with the usual notion of a directional derivative learned in vector calculus. Explicitly, in coordinates, the directional derivative is given by