A PEDAGOGICAL INVESTIGATION OF THE DEVELOPMENT OF GENERAL RELATIVITY USING DIFFERENTIAL FORMS
by
BENJAMIN DAVID SABREE
Submitted in partial fulfillment of the requirements for the degree of
Master of Science
Thesis Advisors: Prof. David A. Singer (CWRU Mathematics)
and Prof. Kenneth L. Kowalski (CWRU Physics)
Department of Mathematics
CASE WESTERN RESERVE UNIVERSITY
August 2008 CASE WESTERN RESERVE UNIVERSITY
SCHOOL OF GRADUATE STUDIES
We hereby approve the thesis/dissertation of
______
candidate for the ______degree *.
(signed)______(chair of the committee)
______
______
______
______
______
(date) ______
*We also certify that written approval has been obtained for any proprietary material contained therein. Contents
Acknowledgments iv
Abstract v
1 Introduction 1
2 Some General Advantages of Differential Forms 5
3 Differential Forms in General Relativity 12
4 Conclusions 20
References 23
iii Acknowledgments
There are many people without whom this thesis would not have been possible. I would like to extend my sincere gratitude to my department chair, Professor James
Alexander, my primary advisor, Professor David Singer, and my secondary advisor,
Professor Kenneth Kowalksi, for serving on my thesis committee. I especially appre- ciate the assistance I received from Professors Kowalski and Singer while researching and writing this thesis. I would like to thank Professor Thomas Ivey for his gener- ous input on the advantages of differential forms. Thanks is also due to Professor
Kowalski for helping greatly with my search for a thesis topic.
I owe a debt of gratitude to my parents, Anna and Steve Sabree, for their constant support and encouragement. Without it, I would not be where I am today.
Most of all, I would like to extend a heartfelt thanks to my greatest source of unwavering support, my wife Emily.
iv A Pedagogical Investigation of the Development of General
Relativity Using Differential Forms
Abstract
by
Benjamin David Sabree
General relativity is widely applicable to many areas of current physics research.
Some ‘math first’ treatments of the subject employ differential forms while others do not. This paper advocates those approaches that utilize differential forms by
first outlining some of the general mathematical advantages of forms and then by comparing developments of selected topics from the different treatments.
v 1 Introduction
General relativity, the modern theory of gravitation, was developed by Albert Ein- stein near the end of 1915. The theory attributes gravity to the effects of a curved four-dimensional spacetime, and it does so in the language of pseudo-Riemannian differential geometry. Exceptionally well-supported by numerous experiments and observations of the last century, general relativity is now understood to have conse- quences across a range of very diverse fields of physics. Some of the most obvious and immediate implications of the theory led to the beginning of modern cosmology, while in the last half of the twentieth century relativity was tied to the study of par- ticle physics. Despite having gone through an unfashionable period for a number of decades after its formulation, general relativity is now at the forefront of a great deal of current scientific research and experimentation.
The teaching of general relativity has a somewhat troubled history. For many years after it was first published, several factors contributed to the consideration of relativity as being on the fringes of mainstream physics study. Initially, relativity was perceived as being a mathematically difficult theory and its brilliant creator was still active in the field. Additionally, the theory was almost immediately experimentally reinforced as far as the limits of contemporary observational precision would allow, most notably by its agreement with the documented advance of the perihelion of
1 Mercury and its successful prediction of the aberration of starlight by the sun [1].
Due to the limited accuracy provided by instruments, extreme circumstances were required for relativistic effects to be detectable. As a consequence, the theory was seen as having very few applications. Being outside of the normal body of physics research, there was little need to teach relativity.
During the latter half of the twentieth century, technological advances brought general relativity back into the fold of mainstream physics. The increased accuracy of astronomical observations along with advances in experimental precision began to reveal a wealth of phenomena which exhibited general relativistic effects [2, 3, 4, 5, 6].
It was also during the late twentieth century that general relativity was first tied into the study of other fields, most notably particle physics [7].
This newfound prominence of general relativity in theoretical and experimental physics demands its adoption into the standard physics curriculum. However, due to the fact that the theory has only been actively taught for a relatively short period of time, the pedagogy is quite underdeveloped and no method has yet to achieve a
‘tried and true’ status.
When developing the subject of general relativity for introduction to advanced undergraduate and beginning graduate students, there are two standard approaches.
The deductive approach involves first developing the requisite mathematical machin-
2 ery, then motivating and deriving the field equations, and finally exploring solutions
to the field equations and considering modern topics. This approach is quite natu-
rally referred to as ‘math first’. There are many texts that follow this approach, not
the least of which is the exceptional and well-respected tome by Misner, Thorne, and
Wheeler, Gravitation [8]. A more appropriate and bite-sized treatment can be found
in Schutz’s A First Course in General Relativity, which is the primary text I used for this paper [9].
The second approach, aptly called ‘physics first’, involves beginning with solutions to the field equations and exploring their consequences and interesting phenomena, with the actual derivation of the field equations and the associated mathematics rel- egated to near-appendix status. Champions of the ‘physics first’ approach claim that time constraints and other limitations, such as student mathematical background, more often than not preclude the use of a ‘math first’ development [10]. The primary text leading this charge is Hartle’s Gravity, which quite nicely accomplishes its goal of motivating the study of relativity by immediately examining fascinating phenomena while evading much of its mathematical foundation [11].
Despite the fact that in some cases, time constraints or student interests may necessitate a ‘physics first’ approach, every teaching situation must be assessed and dealt with on an individual basis. Therefore, it is best to avoid discounting a ‘math
3 first’ approach unless the only offerings of general relativity are to be survey courses
where students possess no interest in, nor necessity of, pursuing the subject beyond
an introductory overview.
Among those developments which subscribe to the ‘math first’ approach, differ-
ent techniques can be observed. Classically, the subject is developed from the idea
of vectors and vector fields directly into the land of tensors, in which the story of
relativity is set [12, 13, 14, 7]. In this case, the difficulty of the mathematics is exac-
erbated by a failure to use more modern mathematical techniques. Using the more
contemporary concept of differential forms allows for discussions not necessarily tied
to specific coordinates, which in turn simplifies many of the arguments along the road
to deriving the field equations [8, 9, 15]. Because the complexity of the mathematics
required is one of the major factors contributing to adoption of the ‘physics first’ ap-
proach, perhaps the benefits gained by initially investing time in the understanding
of differential forms will result in the deductive, more thorough treatment of general
relativity being seen as a more palatable approach.
Differential forms were first developed by Elie´ Cartan near the turn of the 20th century. Cartan’s innovation was to combine the one-forms of Pfaff with the exterior algebra of Grassman in an attempt to solve systems of differential equations arising from the study of thermodynamics [16]. While the utility of differential forms initially
4 went unrecognized for many years, they have since found their way into the study of subjects ranging from classical mechanics and geometric optics to quantum mechanics and electromagnetism [17]. The importance of differential forms to general relativity is perhaps best reflected by their role in Gravitation [8].
2 Some General Advantages of Differential Forms
The most obvious, and perhaps most trivial, advantage of using differential forms is economy of notation. The gain of differential forms over vector notation is similar to, but not quite so pronounced as, that achieved by vector notation over sets of scalar equations. For instance, using the vector notation developed by Gibbs, Heaviside was able to express the twenty scalar equations of Maxwell as the four vector equations commonly known today [18, 19, 20]. One example of the economy of differential forms is seen in Stokes’ theorem. Classically, Stokes’ theorem is expressed in vector notation as: Z ZZ F · dr = curl F · dS , (1) C S where S must be oriented, piecewise-smooth, and bounded by C, a simple, piecewise- smooth, closed curve with positive orientation, and F is a vector field whose compo- nents have continuous partial derivatives on an open region containing S.
5 Alternatively, expressed in the language of differential forms, Stokes’ theorem is given as: Z I dω˜ = ω , (2) M ∂M where M is an oriented, piecewise-smooth manifold of dimension n, ∂M is its bound- ary, and ω is a compactly supported continuously differentiable n − 1 form on M.
Just as economy of notation is not the only advantage of a vector formulation, neither is it the only advantage of a differential forms formulation. This expression of Stokes’ theorem in differential forms notation is a more generalized theorem which usurps the divergence theorem as well.
In addition to providing an economy of notation, differential forms are often more natural to work with than other objects. For example, consider the following:
Let σ be a curve, M and N be manifolds, TM and TN be their tangent spaces
(at some points p and f(p), respectively), and f be a map from M to N:
R σ / M f N
TpM Tf(p)N
0 The derivative σ = vp is a vector which lives in TpM. If we compose the map f with
6 0 σ and consider the derivative (f ◦ σ) , we have a vector living in Tf(p)N. We can
0 then define the map f∗ by f∗(vp) ≡ (f ◦ σ) , which takes vectors in TpM to vectors in
Tf(p)N:
R JJ JJ f◦σ σ JJ JJ JJ J% / M f N
f∗ TpM / Tf(p)N
The map f∗ can push any vector forward, but it will not take vector fields in TM to vector fields in TN unless f is a diffeomorphism.
On the other hand, if ω is a one-form defined everywhere in the cotangent bundle, we have:
R JJ JJ f◦σ σ JJ JJ JJ J% / M f N
f∗ TpM / Tf(p)N
ω R
7 ∗ Here we can define f ω ≡ ω ◦ f∗ to be the pullback of ω:
R JJ JJ f◦σ σ JJ JJ JJ J% / M f N
f∗ TpM / Tf(p)N I II II ω ∗ II f ω II I$ R
So we can always pull a differential form back, but we must have a diffeomorphism to push vector fields forward.
Another situation for which using differential forms is more natural arises when considering the gradient. Although the gradient is introduced to multi-variable cal- culus students as a vector, it is actually a one-form.
Take a scalar field φ(t, x, y, z) defined at every spacetime point along the world line of some observer. Parameterizing the world line by the value of proper time τ at each event on the path gives us the curve
φ(τ) = φ [t(τ), x(τ), y(τ), z(τ)] . (3)
The four-velocity of our observer along the world line is given by
dt dx dy dz U~ = ~e + ~e + ~e + ~e . (4) dτ t dτ x dτ y dτ z 8 Examining the rate of change of φ along our curve, we find
dφ ∂φ dt ∂φ dx ∂φ dy ∂φ dz = + + + dτ ∂t dτ ∂x dτ ∂y dτ ∂z dτ (5) ∂φ ∂φ ∂φ ∂φ = U t + U x + U y + U z . ∂t ∂x ∂y ∂z
dφ ~ Notice that dτ , a number, is a linear function of U, so we have defined a one-form,
the gradient:
dφe : TM −→ R . (6)
The usual definition of a gradient vector is the vector which points in the direction of greatest change per unit length. However, without a metric on our manifold, comparing lengths of vectors pointing in different directions is impossible. Because a metric is not required to describe the gradient one-form, it is a more natural object than the gradient vector [21, 9].
Similarly, a normal one-form is also a more natural object than a normal vector.
A normal vector is a vector orthogonal to some surface at a point, meaning it is orthogonal to all vectors tangent to the surface at that point. To define this we require a scalar product, which in turn requires a metric. On the other hand, defining a normal one-form to the surface does not require a metric. A normal one-form to a surface is simply a one-form whose value when contracted with every vector tangent to the surface at that point is zero [9].
9 In particular, for a surface defined as the solution to
φ(t, x, y, z) = constant , (7)
we see that the gradient, dφ˜ , is a normal one-form. We can therefore picture dφ˜ as a
set of surfaces of constant φ, which can be quite useful intuitively [9].
Finding transformation rules for objects is yet another situation in which differ-
ential forms can be quite valuable. Classically, vectors are defined as objects that
transform under an arbitrary coordinate transformation in the same way as displace- −→ ments transform. That is, a vector ∆r can be represented as a displacement: in