GEM: the User Manual Understanding Spacetime Splittings and Their Relationships1
Robert T. Jantzen Paolo Carini and Donato Bini
June 15, 2021
1This manuscript is preliminary and incomplete and is made available with the understanding that it will not be cited or reproduced without the permission of the authors. Abstract
A comprehensive review of the various approaches to splitting spacetime into space-plus-time, establishing a common framework based on test observer families which is best referred to as gravitoelectromagnetism. Contents
Preface vi
1 Introduction 1 1.1 Motivation: Local Special Relativity plus Rotating Coordinates ...... 1 1.2 Why bother? ...... 4 1.3 Starting vocabulary ...... 6 1.4 Historical background ...... 8 1.5 Orthogonalization in the Lorentzian plane ...... 11 1.6 Notation and conventions ...... 14
2 The congruence point of view and the measurement process 17 2.1 Algebra ...... 17 2.1.1 Observer orthogonal decomposition ...... 18 2.1.2 Observer-adapted frames ...... 21 2.1.3 Relative kinematics: algebra ...... 23 2.1.4 Splitting along parametrized spacetime curves and test particle worldlines ...... 26 2.1.5 Addition of velocities and the aberration map ...... 28 2.2 Derivatives ...... 29 2.2.1 Natural derivatives ...... 29 2.2.2 Covariant derivatives ...... 30 2.2.3 Kinematical quantities ...... 31 2.2.4 Splitting the exterior derivative ...... 33 2.2.5 Splitting the differential form divergence operator ...... 35 2.2.6 Spatial vector analysis ...... 35 2.2.7 Ordinary and Co-rotating Fermi-Walker derivatives ...... 37 2.2.8 Relation between Lie and Fermi-Walker temporal derivatives ...... 39 2.2.9 Total spatial covariant derivatives ...... 43 2.2.10 Splitting the total covariant derivative ...... 46 2.3 Observer-adapted frame derivatives ...... 47 2.3.1 Natural frame derivatives ...... 47 2.3.2 Splitting the connection coefficients ...... 48 2.3.3 Observer-adapted connection components ...... 49 2.3.4 Splitting covariant derivatives ...... 50 2.3.5 Observer-adapted components of total spatial covariant derivatives ...... 52 2.4 Relative kinematics: applications ...... 55 2.4.1 Splitting the acceleration equation ...... 55 2.4.2 Analogy with electromagnetism: gravitoelectromagnetism ...... 57 2.4.3 Maxwell-like equations ...... 58 2.4.4 Splitting the spin transport equation ...... 60 2.4.5 Relative Fermi-Walker transport and gyro precession ...... 62 2.4.6 The Schiff Precession Formula ...... 65 2.4.7 The relative angular velocity as a boost derivative ...... 67
i 2.4.8 Relative kinematics: transformation of spatial gravitational fields ...... 68 2.5 Spatial curvature and torsion ...... 70 2.5.1 Definitions ...... 70 2.5.2 Algebraic symmetries ...... 70 2.5.3 Symmetry-obeying spatial curvature ...... 72 2.5.4 Spatial Ricci tensors and scalar curvatures ...... 73 2.5.5 Pair interchange symmetry ...... 73 2.5.6 Spatial covariant exterior derivative ...... 74 2.6 The symmetrized curl operator for symmetric spatial 2-tensors ...... 76 2.7 Splitting spacetime curvature ...... 77 2.7.1 Splitting definitions ...... 77 2.7.2 Spacetime duality and curvature ...... 78 2.7.3 Evaluation of splitting fields ...... 79 2.7.4 Maxwell-like equations ...... 81 2.8 Mixed commutation formulas ...... 82 2.8.1 Splitting the Ricci identities ...... 82 2.8.2 Commuting £(u)u and ∇(u) ...... 83 2.9 Splitting the Bianchi identities of the second kind ...... 84 2.9.1 Spacetime identities ...... 84 2.9.2 Spatial identities ...... 87 2.10 “Time without space defines space without time” and vice versa ...... 88
3 The slicing and threading points of view 89 3.1 Introduction ...... 89 3.2 Algebra ...... 89 3.2.1 The nonlinear reference frame ...... 89 3.2.2 Measurement and the lapse function ...... 90 3.2.3 The Shift ...... 92 3.2.4 Computational frames and the reference decomposition ...... 93 3.2.5 Decomposing the metric ...... 94 3.2.6 Relationship between the reference and observer decompositions ...... 96 3.2.7 The slicing, threading and reference representations ...... 97 3.2.8 Transformation between slicing and threading points of view ...... 98 3.2.9 So far: ...... 99 3.3 Derivatives ...... 101 3.3.1 Evolution ...... 101 3.3.2 Natural time derivatives ...... 101 3.3.3 Natural spatial derivatives ...... 102 3.3.4 Gauge transformations of the nonlinear reference frame ...... 103 3.3.5 Observer-adapted frame structure functions and kinematical quantities ...... 105 3.3.6 Spatial covariant derivative ...... 106 3.3.7 Spatial vector analysis ...... 107 3.3.8 Partially-observer-adapted frames: connection components ...... 108 3.3.9 Total spatial covariant derivatives ...... 109 3.3.10 Spatial gravitational forces ...... 111 3.3.11 Second-order acceleration equation ...... 112 3.3.12 The spin transport equation ...... 113 3.3.13 Transformation of spatial gravitational fields ...... 114 3.4 Spatial curvature ...... 115 3.5 Initial value problem? ...... 115 3.5.1 Hypersurface and slicing points of view ...... 115 3.5.2 Thin sandwich problem ...... 116 3.5.3 Congruence and threading points of view ...... 116
ii 3.5.4 Perfect fluids ...... 117
4 Maxwell’s equations 118 4.1 Introduction ...... 118 4.2 Splitting the electromagnetic field ...... 118 4.2.1 Congruence point of view ...... 118 4.2.2 Slicing and threading points of view ...... 119 4.2.3 Observer Boost ...... 120 4.2.4 Reference representation (Landau-Lifshitz-Hanni) ...... 120 4.3 Splitting the 4-current ...... 122 4.4 Splitting Maxwell’s equations ...... 123 4.4.1 Congruence point of view ...... 123 4.4.2 Slicing and threading points of view ...... 124 4.5 Vector potential ...... 126 4.6 Wave equations ...... 127 4.7 Computational 3-space representations ...... 128 4.8 Lines of force ...... 128
5 Stationary spacetimes 129 5.1 Stationary nonlinear reference frame ...... 129 5.2 Synchronization gap and Sagnac effect ...... 130 5.3 Rotating spatial Cartesian coordinates in flat spacetime ...... 133 5.4 Stationary axially-symmetric case: rotating Minkowski, G¨odeland Kerr spacetimes ...... 135
6 Perturbation problems 139 6.1 Linearization about an orthogonal nonlinear reference frame ...... 139 6.2 Post-Newtonian approximation ...... 139 6.3 The Newtonian limit ...... 139 6.4 Friedmann-Robertson-Walker Perturbations ...... 139
A Formulas from differential geometry 1 A.1 Manifold ...... 1 A.2 Frame and dual frame ...... 1 A.3 Linear transformations ...... 2 A.4 Change of frame ...... 3 A.5 Metric ...... 3 A.6 Unit volume n-form ...... 4 A.7 Connection ...... 4 A.8 Metric connection ...... 6 A.9 Curvature ...... 6 A.10 Total covariant derivative ...... 7 A.11 Parallel transport and geodesics ...... 8 A.12 Generalized Kronecker deltas ...... 8 A.13 Symmetrization/antisymmetrization ...... 8 A.14 Exterior product ...... 9 A.15 Hodge star duality operation ...... 9 A.16 Complex Duality Operation ...... 10 A.17 Exterior derivative ...... 10 A.18 Differential form divergence operator ...... 11 A.19 De Rham Laplacian ...... 12 A.20 Covariant exterior derivative ...... 12 A.21 Ricci identities ...... 13 A.22 Bianchi identities of the first and second kind ...... 13 A.23 Ricci Tensor, Scalar Curvature and Einstein tensor ...... 14
iii A.24 Contracted Bianchi Identities of the Second Kind and the Weyl Tensor ...... 14 A.25 Conformal Transformations in 3 Dimensions ...... 15 A.26 n = 3 Structure Functions and Orthonormal Frame Connection Components . . . . . 16 A.27 Lie derivative ...... 17
iv List of Figures
1.1 Spacetime and Space plus Time. Although spacetime is the arena where calculations are simpler, we always interpret them through our space plus time worldview, which depends on the choice of inertial observer...... 2 1.2 General linear coordinates on E2 and their two orthogonalizations. The dashed lines indicate the unit coordinate rectangle and the dotted lines the projections of a typical vector along the coordinate axes...... 12 1.3 General linear coordinates on M 2 and their two orthogonalizations. The dashed lines indicate the unit coordinate rectangle and the dotted lines the projections of a typical vector along the coordinate axes...... 14
2.1 The relative observer plane, the relative velocities and the associated relative observer maps. 24
3.1 Computational 3-spaces ...... 90 3.2 Reference decomposition of tangent and cotangent spaces ...... 90 3.3 Shifting of points of the computational 3-space...... 92 3.4 Projected computational frame ...... 94 3.5 Decompositions of a vector field ...... 96 3.6 Lorentz boost 2-plane ...... 99 3.7 A suggestive representation of the various total Lie spatial covariant derivatives for the hyper- surface, slicing and threading points of view, normalized to correspond to the proper time of the appropriate observers. Each covariant derivative is the sum of a temporal Lie derivative and a spatial covariant derivative along the temporal and spatial parts of the relevant multi- ple of the 4-velocity U represented in the triangle, each vector decomposition given under the symbol for the corresponding covariant derivative...... 111
5.1 Lifts from the quotient space ...... 131 5.2 Sagnac tube geometry ...... 131
v Preface
I was a sophomore in John Wheeler’s modern physics course at Princeton University in 1972 when the proofs for some chapters of Gravitation began showing up. This book and its authors (Misner, Thorne and Wheeler) have helped to shape generations of relativists including myself and have done much to establish the use of more modern mathematical notation and style as commonplace in the field and in neighboring areas. In particular it is the one textbook which presents the 3 + 1 approach to general relativity to a wide audience. However, an alternative “1 + 3” approach to the splitting of spacetime into space plus time also existed and was most readily found in the text The Classical Theory of Fields by Landau and Lifshitz, but few versed in the 3 + 1 school took the time to understand this alternative in terms of the same beautiful language that made the 3 + 1 approach such a powerful tool in gravitational physics. The culprit is of course quantum gravity which was always lurking in the background as one of the prime motivations for using the 3 + 1 approach. In astrophysical applications the superiority of either approach is not obvious and it seems clear that both approaches together can reveal complementary features of a physical problem, shedding more insight than either one alone. Indeed the 1 + 3 approach underlies much of what is done in the post-Newtonian approximation of general relativity, although it is not generally recognized, and provides the variable decomposition for the stationary exact solution industry, while in the cosmological context it appears that gauge invariant perturbation theory for Friedmann-Robertson-Walker cosmological models may even be simpler in this approach. Unfortunately the 1 + 3 approach, which apparently flourished in the fifties before the 3 + 1 school took over, suffered a near fatal public relations blackout in the intervening decades preceding the nineties. This has left many of us who have entered the field since the seventies almost “completely unaware” of much of this formalism beyond recognition of it from limited exposure to it in The Classical Theory of Fields or in the closely related congruence approach systematized for applications in cosmology by Ehlers and more widely known through the work of Hawking and Ellis. When Paolo Carini as a student of Remo Ruffini in Rome in 1989 got me interested in trying to explain what was really underlying the peculiar coordinate decomposition of the electromagnetic field found in an exercise in the Landau-Lifshitz text and its relationship to the 3 + 1 and 1 + 3 approaches, we were drawn into an examination of the very foundations of the splitting formalisms independent of electromagnetism. We were also “completely unaware” of the work in the fifties by Møller, Zel’manov, and Cattaneo on the 1 + 3 point of view. Driven by our ignorance of these matters and the lack of a clear discussion of their mathematical structure in the literature, we embarked on a study in which we learned a great deal not only about the larger framework in which these questions fit together, but also about the work that had been done and effectively buried in the past. Later joined by Donato Bini, we went on to examine some new aspects of the application of splitting formalisms to questions in general relativity. It seems not only valuable to share with the larger community what we have understood in a notation that easily allows comparison of the different approaches but also appropriate that some of the historical roots of the subject be recalled and not forgotten. Meanwhile during the decade of the nineties the 3 + 1 approach using the more modern point of view of congruences has enjoyed increasing visibility through many articles by Ellis and collaborators, and the need for understanding the relationships among various approaches has only increased. In this spirit the following monograph has grown to its present form. Certainly it is not meant to be exhaustive, especially where the 3+1 approach is concerned given the enormous amount of existing literature which deals with that case alone. Rather it is meant to reveal more clearly the relationships between the
vi various approaches in an effort to break down the artificial barriers which divide them. If this attempt is only partially successful, our efforts will have been well spent. The International Center for Relativistic Astrophysics at the University of Rome and its director Remo Ruffini are especially thanked for suggesting and supporting this project.
bob jantzen
vii Chapter 1
Introduction
1.1 Motivation: Local Special Relativity plus Rotating Coordi- nates
Most of us know special relativity pretty well and are quite happy switching back and forth between the space- time picture of 4-vector algebra and the space plus time picture of events occurring in space as time elapses, using 3-vector algebra. We have no difficulty using Lorentz transformations to transform 3-dimensional quantities from those measured by one inertial observer to another. We also have no problem extending our splitting algebra to spacetime derivative operators in inertial coordinates yielding the space plus time equivalent of time derivatives and spatial derivative operators. After all, these are the derivatives we began with before learning special relativity. Figure 1.1 suggestively compares these two pictures. Since old habits die hard, there is a strong incentive to push this habit into general relativity. This helps us interpret spacetime information in a curved spacetime using the same intuition that we have about classical 3-dimensional physics. The catch is that one no longer has a privileged class of “global inertial frames” as in special relativity which effectively allows a single inertial observer in flat spacetime to set up a preferred class of global inertial coordinate systems that may be used to interpret observations at all other points in spacetime. Essentially, one inertial observer in flat spacetime uniquely determines a family of inertial observers filling the spacetime with no relative velocities, and the “observations” of the original observer of an event not on his own worldline are understood to be those of the companion observer who is present. The only option allowing us to continue to make a spacetime splitting without relying on flatness is to give up the global splitting associated with a single preferred observer and settle for the local splitting of each member of a family of observers filling the spacetime and in arbitrary motion in the absence of any preference. Such a splitting takes place in the tangent space to each event in spacetime, describing the locally Minkowskian neighborhood of each observer in the family. If we agree to split each such tangent space based on the 4-velocity of the observer at a given event in the same way that we split flat spacetime globally based on the worldline of a single inertial observer, then all of our familiarity with special relativity can be transferred to general relativity in a rather straightforward way. The only difference is that what we did globally before, with the splitting at every spacetime point determined by the splitting at a single spacetime point, namely any point on the worldline of a chosen inertial observer, must be abandoned in favor of doing the same thing independently at each spacetime point, modulo continuity/differentiability conditions. There is one catch. Being creatures of habit, we like global splittings, so even though in general there is no preferred way of doing a global splitting, we can just do it arbitrarily, though clearly matters may be simplified if such a splitting can be adapted to any special structure that may exist in a particular spacetime. One then has to reconcile the local observer splittings with such a global splitting. This too is not anything particularly deep, and it involves using the linear algebra of nonorthogonal bases independently at each spacetime point (since such a global splitting will in general be nonorthogonal) to represent the orthogonal splitting of the local observers on spacetime. The new complication in general relativity is that in general one must deal with a family of so called
1 t ...... t ...... ~v ...... O ...... O ...... x ...... x
z z
......
...... ~v
...... y ...... y ...... x ...... x O O Figure 1.1: Spacetime and Space plus Time. Although spacetime is the arena where calculations are simpler, we always interpret them through our space plus time worldview, which depends on the choice of inertial observer.
2 “test observers” in arbitrary motion, and this introduces the well known effects that accompany noninertial (i.e., accelerated) observers even in nonrelativistic physics. However, in the classical example of a “rigidly rotating” family of noninertial observers in nonrelativistic physics, one still has a global correlation between the different members of the family of observers since one has a global (though not physical) Cartesian coordinate system which rotates with the passage of time. In the extension to general relativity one must consider such effects locally at each spacetime point. The effects of rigid rotation are well known and familiar. There is the centrifugal force that a body feels in the rotating frame even if it is not moving with respect to that frame due to the frame’s acceleration and the Coriolis force that a body feels if it is in motion with respect to that frame due to the frame’s rotation. Quantitatively in a system rotating with constant angular velocity Ω,~ there is a force on an otherwise free body ~x¨ = F~ /m = ~g + ~v × H,~ (1.1) where the “gravitoelectric” force (per unit mass)
~g = −A~ = −Ω~ × V~ = −Ω~ × (Ω~ × ~x) (1.2) is the negative of the acceleration field A~ of the rotating observers, whose velocity field is V = Ω~ ×~x. Similarly the “gravitomagnetic” force is the cross product of the body’s velocity ~v = ~x˙ in the rotating system with the “gravitomagnetic” vector field H~ = 2Ω~ = curl V~ , which is twice the angular velocity vector of the rotating frame and equals the local vorticity of the velocity field. The “gravitoelectromagnetic” terminology due to Thorne is in direct analogy with the Lorentz force of electromagnetism in a nonrotating system
(m/q)~x¨ = F~ /q = E~ + ~v × B.~ (1.3)
Note that the “gravitomagnetic” vector field H~ = curl V~ admits a vector potential V~ in the same way that the magnetic field B~ = curl A~ locally admits a vector potential. Similarly the “gravitoelectric” vector field
1 ~ ~ ~g = − grad[− 2 V · V ] (1.4)
admits a scalar potential in this time-independent case just like the conservative electric field E~ in electro- statics. The velocity field of the rotating observers and minus half its length squared (once multiplied by the mass m) serve as vector and scalar potentials for the noninertial forces we often call “fictitious” forces in classical mechanics, and they are directly interpretable in terms of kinematical properties of the velocity field of the family of noninertial observers used to describe the motion of the body being studied. These forces vanish as soon as we require the members of the family of observers to be inertial. In a curved spacetime such global frames are not immediately available, so one must analyse the situa- tion in the local rest space of each observer in the family of test observers used to describe the physics in 3-dimensional form. The kinematical properties of the 4-velocity field of these observers in spacetime, with some extra complication, directly generalize the above problem, leading to the introduction of “gravitoelec- tromagnetic” forces which enter into the force equation when expressed in terms of a family of noninertial observers. Rather than doing a global comparison of their motion with respect to a global inertial frame which does not exist in curved spacetime, it must be a local comparison at each point of spacetime with a suitable inertial observer having the same 4-velocity and a set of nonrotating spatial axes. Of course there are new features in curved spacetime which have no analogy in electromagnetism and this has to do with the spatial metric which describes the relative distances between nearby test observers as well as the different proper times that different observers use at the same event in spacetime, both of which depend on the gravitational field. These features, as well as the potentials for the gravitoelectromagnetic vector forces, are contained in the spacetime metric. Taking into account the local proper time complicates slightly the above analogy which relies on a global proper time function on flat spacetime. Thus we need relativistic definitions of acceleration, gradient, curl, time derivative, etc. This together with the details of the way in which an observer in spacetime measures quantities at a given event will enable us to push our present knowledge about special relativity and noninertial motion to the case of
3 general relativity. It helps explain the so called ADM or three-plus-one approach to general relativity as well as the slightly different Landau-Lifshitz approach and shows their similarities and differences, both of which involve the structure of a family of test observers together with a global time function on spacetime. It also shows how both relate to the Ehlers-Hawking-Ellis splitting approach, which is based only on a family of test observers with no global time function assumed to be available.
1.2 Why bother?
You might say, why invest a lot of time into understanding the details of this way of looking at all the different splitting approaches used in general relativity? After all, relativity physics liberated us from the prison of 3- dimensional language into the arena of spacetime where the true nature of kinematics and dynamics became much simpler to understand. Why try to climb back into the cage of 3-dimensional physics? Wasn’t centuries of solitary confinement enough? Well, many spacetimes and idealized problems we use in understanding gravitational theory practically beg us to do this. A rotating black hole spacetime, for example, has two very different privileged families of test observers, one of which is suited to the ADM picture and the other to the Landau-Lifshitz picture. The one we use depends on the question we want to study. Both turn out to be useful. The Ehlers-Hawking- Ellis picture provides the means to relate these two different pictures to each other, which is important if they both turn out to be needed, as they in fact do. The failure to investigate this “relativity of spacetime splitting formalisms” has kept most of us from having better intuition not only about black holes, or even the relativistic picture of the nonrelativistic problem of rotating coordinate systems (which continues to confuse people even now), but other interesting rotating spacetimes like the G¨odeluniverse. Many of us are somewhat familiar with (or at least aware of) the initial value problem for the ADM splitting of spacetime based on a family of spacelike hypersurfaces, but the corresponding problem for the splitting of Landau and Lifshitz based on a timelike congruence is almost unknown. In fact the exact solution industry for stationary spacetimes studies precisely this problem without the difficulties that breaking the stationarity symmetry introduces, since only the initial value problem remains of the Einstein equations in that case. The choice of metric variables made in studying this problem is adapted to the Landau-Lifshitz splitting, though few stop to think about its geometric significance or realize it is a decomposition parallel to the ADM splitting. Even if none of these special spacetimes interests you, if you are interested in any post-Newtonian calculations of more realistic, say isolated self-gravitating systems, then understanding the splitting game helps to make a little more sense out of what is universally done in that field. In recent years many references to “the gravitomagnetic field” have sprung up, but since no one took the time to unambiguously define just what this field was, controversy has blossomed between different schools of thought about just what a “real gravitomagnetic field” is. People can agree on a naive definition of the field in this stationary weak-field limit but for strong fields many different definitions are possible depending on the choices one makes for the way in which observers make measurements. All of these different choices can be fit into a general framework and related to each other. Of course the big difficulty in general relativity has always been the limited analytical computations that can be done exactly. An entire industry has grown up around the approximation schemes for general relativity which in practice involves rather complicated details. Certainly the naive things one can do for relatively simple exact solutions in terms of interpreting them in 3-dimensional form cannot be extended easily to the more realistic calculations done in post-Newtonian approximations to general relativity. However, one can interpret features of the approximate analysis in terms of the exact spacetimes which it is meant to approximate. Unfortunately many aspects of this exact geometry being approximated seem to be lost in the details of the approximation scheme itself. The language of the present discussion can help put these details into some perspective. The splitting of the gravitational field is closely related to the splitting of the electromagnetic field, though historically reversed in direction, since the individual electric and magnetic fields of classical Newtonian physics were unified into a single spacetime field by special relativity, while the spacetime metric of general relativity only later gave birth to the electric- and magnetic-like gravitational fields accompanied by the spatial metric. The four-dimensional form of Maxwell’s equations in a curved spacetime is very elegant
4 and powerful, perhaps because it is independent of any particular observers or local coordinates. However, in many practical applications, spacetime is endowed at least locally with either a preferred congruence of integral curves of a timelike vector field or a preferred slicing by a family of spacelike hypersurfaces or both, and it is convenient to decompose the electromagnetic field in some way using this additional structure. Of course the electric and magnetic fields measured by an observer in spacetime are well defined and easily expressed in an orthonormal frame adapted to the observer’s local rest space, but often coordinate systems and nonorthonormal frames prove more convenient for studying the field equations. In this context one must reinterpret the computational quantities which are naturally introduced in terms of some family of test observers. When the present work began, a clear discussion of the various approaches to formulating Maxwell’s equations in terms of three-dimensional quantities and their relationship to each other did not exist. More- over, the analogous discussion for the gravitational field itself was even more conspicuously absent in the literature. It therefore seemed useful to carefully develop the mathematics of the splitting formalisms in general relativity which provide the foundation for the subsequent splittings of the electromagnetic field as well as other matter fields on spacetime. In so doing one obtains a precise description of spatial gravitational force fields in the different points of view and of their interrelationships, as well as a clear exposition of the similarities between those nonlinear fields and the linear electric and magnetic fields. In view of the evolution of terminology which has taken place, it seems natural to refer to this analogy by the name of gravitoelectromagnetism. The analogy between the linear theory of electromagnetism and the linearized theory of general relativity was noted by Einstein even before arriving at his final formulation of the Einstein equations [Verbin and Nielsen, 2004], who compared the geodesic equations with the Lorentz force law. This analogy for the final theory was then soon spelled out explicitly by Thirring [1918]. Although the analogy in practice is usually considered for the linearized gravitational field, the implications and limitations of this analogy are best seen in the fully nonlinear context of general relativity. Of course as previously noted, an obvious question to ask is “Why bother to split spacetime at all?” Certainly the idea of a four-dimensional spacetime and its local Lorentzian geometry has been an important advance of this century. However, our intuition and experience are decidedly three-dimensional in charac- ter, and splittings of spacetime into space plus time allow us to interface better with the four-dimensional information, even when a splitting does not occur naturally. When it does, it can considerably simplify the presentation and interpretation of both the gravitational field and whatever matter fields are present. This is not to say that space-plus-time splittings are always useful. Sometimes 2 + 2-splittings are impor- tant, examples of which occur in spherical symmetry or in the null initial value problem. In many other instances splittings are actually unproductive, obscuring the spacetime structure of a problem. However, when splittings are useful, they are worth doing carefully. Many different points of view may be taken in splitting spacetime into space plus time but their inter- relationships are rarely considered. This is a useful thing to do since our intuition is based on the standard splitting of flat spacetime, but the three-dimensional quantities which serve as a foundation for this intu- ition often find themselves associated with distinct points of view in the context of a general splitting of an arbitrary spacetime. The result is that no single point of view captures all aspects of our three-dimensional intuition, and the particular application really should determine the choice which is most appropriate. How- ever, no common mathematical framework and no common notation yet exist to enable one to easily switch point of view or compare different points of view. Most relativists are consequently prisoners of the language of one particular choice. The goal of the present work is to establish such a common mathematical framework to help break down the barriers which exist between different schools of relativists who have settled upon a single choice of point of view. Unfortunately no text on general relativity can spare the space to do justice to this idea of a relativity of splitting formalisms, so we learn general relativity from one school or another but rarely appreciate more than one approach in our working lives even if we are relativists. The present text attempts to provide both a universal language and the detailed formulas which describe the relatively straightforward but systematic analysis of these various approaches.
5 1.3 Starting vocabulary
An overriding necessity in this enterprise is a careful definition of the vocabulary to be employed since the standard labels which occur in these discussions do not have universally accepted interpretations. All of the various splitting points of view can be nicely classified and will be assigned labels according to a neutral scheme independent of any particular surnames, thus sidestepping the issue of who “owns” which ideas. Each splitting point of view is based on two fundamental concepts: measurement and evolution, the realization of which differ for each of the possible choices. Before sketching the history of this topic, it is useful to establish some of the basic terminology to be used in what follows. Of course in order to encompass all of the approaches one finds in the literature in a simple scheme, one must be a bit loose about exactly what mathematical details characterize each of the basic categories into which these approaches will be divided. The notions of time and space are complementary since a “time line” represents “time elapsing at a point fixed in space” while a “time hypersurface” represents “space at a moment of time”. These two different notions of time, the first which focuses on measuring time at a single point of space and the second which is associated with some kind of synchronization of times at different points of space, will be assigned the labels “time” and “space” respectively. These divide the splitting points of view into two categories, those in which a local time direction is fundamental, and those in which a nonlocal correlation of local times, i.e., space is fundamental. In the time category, the “time lines” must be timelike in order to represent a local time direction at each event in spacetime, while in the space category, the “time hypersurfaces” or “spaces” must be spacelike in order to be associated with a moment of time in the usual sense of a Riemannian space (alternatively, in order that orthogonality define a local time direction). Given this division, one may consider a partial splitting or a full splitting depending on whether any additional structure is assumed. Table 1.1 establishes this general classification of points of view and the terminology that will be used to describe it. Given no additional structure, one has only a partial splitting of spacetime, splitting off either the time or the space alone. In the first case, to be called the “congruence point of view,” one has only a timelike congruence at one’s disposal, with a unit timelike tangent vector field u. Spacetime will be assumed to be time-oriented as well as oriented, so one may assume that u is future-pointing. It may then be interpreted as the 4-velocity of a family of test observers whose worldlines are the curves of the congruence, and it determines the local time direction at each point of spacetime. The orthogonal complement of this local time direction in the tangent space is the local rest space LRSu of the test observer at that event. It is exactly this structure that one needs for the measurement process which will be the same for the full and partial splittings in a given category in Table 1. The orthogonal decomposition of the tensor algebra induced by this decomposition of the tangent space at each event will define the measurement process, modulo a final step in which projection along the local time direction is replaced by contraction with u, yielding a collection of “spatial tensor fields” of different rank for each spacetime tensor field that is split. In general the splitting of the tangent spaces does not extend to the spacetime manifold. Only in the special case that the rotation of u vanishes does such an extension exist and one has a family of orthogonal spacelike hypersurfaces which slice the spacetime, leading to a full splitting in this category to be discussed below. The second case in the category of partial splittings of spacetime, that of a spacelike slicing of space- time with no additional structure, is essentially equivalent to the special case of a nonrotating congruence since every spacelike slicing admits a family of timelike orthogonal trajectories. These are the integral curves of the (rotation-free) unit normal vector field n to the slicing, which may be assumed to be future-pointing. The accompanying point of view, for the sake of completeness, might be called the “hypersurface point of view”. Its measurement process is associated with the normal congruence, taking u = n as the 4-velocity of the family of test observers who do the measuring. The local rest spaces of this family are integrable and coincide with the subspaces of the tangent space which are tangent to the slicing. A full splitting of spacetime at the manifold level requires both a slicing of the spacetime and a congruence, to be referred to as a “threading” of the spacetime, together with a compatibility condition that the two families be everywhere transversal. Such a structure will be called a “nonlinear reference frame” in order to distinguish it first from the terms “reference frame,” “frame of reference,” “reference system” and “system of reference” that one finds in the literature, second from the connotation of “frame” in the context of a linear frame of vector fields, and third from related terminology which occurs in the discussion of globally
6 time: 1 space: 3
(“single-observer” time) (“moment of time”)
time: 1 space: 3 ...... PARTIAL SPLITTING ...... u or LRSu ...... GE, GM fields ...... (3) congruence p.o.v. (4) hypersurface p.o.v.
time + space: 1 + 3 space + time: 3 + 1
...... FULL SPLITTING ...... parametrized nonlinear ...... a ...... reference frame: {t, x } ...... GE, GM fields ...... and potentials ...... (2) threading p.o.v. (1) slicing p.o.v.
spacelike local rest timelike observers spaces (timelike normal TIME gauge ↔ threading observers) ↔ slicing
arbitrary synchronization arbitrary identification SPACE gauge of observer times of “points of space” ↔ slicing ↔ threading
Table 1.1: A characterization of the different points of view (p.o.v.) that may be adopted in splitting spacetime. Solid lines in diagrams imply the use of the appropriate causality condition while dashed lines indicate that no causality condition is assumed. The hypersurface p.o.v. is essentially equivalent to the vorticity-free congruence p.o.v. The reference p.o.v. corresponds to a full splitting in which no causality assumptions are made.
7 constant frames in flat spacetime. A “parametrized nonlinear reference frame” will consist of a nonlinear reference frame together with a choice of parametrization of the family of slices. Such a parametrization defines a specific “time function” t on the spacetime which in turn provides an obvious parametrization for each curve in the threading congruence. In the category of full splittings, the distinguishing criterion is the causality condition imposed on the nonlinear reference frame. In the “slicing point of view” the slicing is assumed to be spacelike, but no assumption is made about the causality properties of the threading, which serves only as a way of identifying the points on different slices. In the “threading point of view,” the threading is assumed to be timelike, but no assumption is made about the causality properties of the slicing, which serves only to synchronize in some arbitrary fashion points on different curves in the congruence. If both causality conditions hold, then both points of view hold and one can transform between them. On the other hand it can also be useful in the case that at least one of the two causality conditions holds to not take advantage of that condition and exploit only the structure of the nonlinear reference frame that does not depend on it. This leads to the “reference point of view,” whose measurement process is associated with the nonorthogonal decomposition of the tangent space into the direct sum of a 1-dimensional subspace tangent to the threading and a three- dimensional subspace tangent to the slicing. One can always relate either the slicing or threading points of view to this acausal approach, which is the way in which they are usually represented in a local coordinate system adapted to the nonlinear reference frame. The partial splittings may be related to the full splittings in different ways. In the threading point of view one may define a (future-pointing) unit timelike tangent vector field m along the threading congruence, while in the slicing point of view one has the (future-pointing) timelike unit normal vector field n. By making the respective choices o = m and o = n (“o” for “observer”) of the 4-velocity of a privileged family of test observers in these two points of view, the identification u = o relates each of them to a corresponding congruence point of view described above, defining for each a measurement process. When both the slicing and threading points of view hold, then a unique boost in each tangent space relates the two timelike unit vectors m and n and this may be extended to a transformation of the measurement process. In the special case of an orthogonal nonlinear reference frame (one for which both the causality conditions hold and the slicing and threading are everywhere orthogonal), then m = n and the two points of view coincide. Evolution is defined first by a choice of a 1-parameter group of diffeomorphisms of the spacetime into itself which in some sense advances into the future (either its orbits are timelike or it pushes certain spacelike hypersurfaces into their future), and second by a choice of transport along its orbits for the spatial fields of the given point of view. For a partial splitting only one congruence is available and it is timelike. In the absense of additional structure one can take u or n respectively in the congruence or hypersurface points of view as the generator of such a group, and choose either spatially-projected (“spatial”) Lie transport (“noncovariant” but integrable) or spatially-projected parallel transport (“covariant” but in general nonintegrable) along this congruence. The latter transport of spatial fields coincides with Fermi-Walker transport which defines locally nonrotating axes along a worldline. Each of these choices may be extended to the full splitting in its category but it is the spatial Lie transport along the threading congruence which defines the evolution relative to the nonlinear reference frame, since fields which are “rigidly” attached to this frame do not evolve with this choice. However, unlike Fermi-Walker transport, spatial Lie transport is in general incompatible with orthonormal frames. A compromise between the two kinds of transport leads to co-rotating Fermi-Walker transport, which is the closest one can get to attaching an orthonormal frame to the nonlinear reference frame.
1.4 Historical background
Armed with this initial vocabulary, the historical background may be sketched in a way that puts the different formalisms into some perspective. The slicing and threading points of view today are introduced to most of us through two leading textbooks, respectively Gravitation by Misner, Thorne, and Wheeler [1973] and The Classical Theory of Fields by Landau and Lifshitz [1975], each of which carefully avoids mention of the “competing” point of view. Both points of view can be traced back to the early forties when the first edition of the Landau-Lifshitz text [1941] introduced the threading point of view splitting of the spacetime metric and, in the stationary case, of the spacetime connection to yield spatial gravitational
8 forces, as still described in their last edition. Soon after, Lichnerowicz [1944] introduced the beginnings of the slicing point of view with an article discussing the initial value problem in an orthogonal nonlinear reference frame. His later book on general relativity and electromagnetism [Lichnerowicz 1955] curiously enough makes use of the threading split, but in actual applications uses an orthogonal nonlinear reference frame in which the two points of view agree. The threading point of view apparently dominated during the fifties when much interest was focused on the equations of motion for test particles. Møller discussed a parametrization-dependent definition of spatial gravitational forces for a general spacetime in the first edition of his text The Theory of Relativity [Møller 1952] at the beginning of the decade. This was then refined to a parametrization-independent splitting later in the decade by Zel’manov [1956, 1959] in the Soviet Union and then independently by Cattaneo [1958,1959a,b,c] in Italy. Unfortunately most of the few references to these works that do appear in the literature cite papers written in Russian, Italian or French, so one must dig to find English versions. The most accessible discussion of much of this material is the second edition of Møller’s text [1972] which describes it in detail and also contrasts it with his original splitting. However, for some reason this text itself is not very prominent among the relativity texts one usually encounters for those of us who have entered the field since the sixties, perhaps because of its old-fashioned viewpoint. Of these authors, only Zel’manov [1956] discussed the splitting of Einstein’s equations in the general case. Meanwhile the slicing point of view was further developed during the fifties by Choquet-Bruhat [1956] and Dirac [1959]. Choquet-Bruhat extended Lichnerowicz’s initial value discussion to the general case, identifying but not naming the lapse and shift variables of the slicing point of view using orthonormal frame techniques. Dirac recognized the significance of the slicing point of view metric decomposition for the Hamiltonian dynamics of general relativity and its relation to his theory of constrained Hamiltonian systems. This was then refined in a series of papers at the turn of the decade by Arnowit, Deser and Misner who used the Hamiltonian formulation permitted by the slicing point of view to study the true degrees of freedom of the gravitational field, culminating in an often cited review article [Arnowit, Deser and Misner 1962]. This ushered in the new era of domination of the splitting scene by the slicing point of view, pushed by the problem of quantum gravity where Hamiltonian techniques have played a rather important role in an endless quest that has not yet met success. The notation of Arnowit, Deser and Misner, soon labeled by Wheeler’s lapse and shift terminology [Wheeler 1964] and later effectively propagated by the text of Misner, Thorne and Wheeler [1973], has found widespread acceptance. The slicing point of view is also commonly referred to as the “3+1” or ADM formalism. A number of useful reviews of various aspects of this formalism exist, among them being articles by York [1979], Isenberg and Nester [1980], Fischer and Marsden [1978, 1979] and Gotay et al [1991]. The term “1 + 3” formalism with its obvious change in emphasis has been suggested as an alternative label for the threading point of view. Although Cattaneo stopped his analysis of the threading point of view at the connection level, Cattaneo- Gasperini [1961, 1963] and Ferrarese [1963, 1965] continued it to the curvature level, studying the splitting of the spacetime curvature and of Einstein’s equations, and the various definitions of spatial curvature that are possible. This previously unexplored area of differential geometry dealing with a degenerate but nonintegrable connection, namely the spatial covariant derivative or transverse covariant derivative which occurs in the threading and congruence points of view, has never been fully understood in the context of the initial value problem for those points of view. This problem, which has been discussed by Stachel [1980] and Ferrarese [1987,1988, 1989], is of a completely different character than in the slicing point of view where it is rather well understood, and its resolution is still an open problem. The approach of Cattaneo and Ferrarese to the threading point of view was reformulated by Massa [1974a,b,c] and used to discuss gyroscope precession [Massa and Zordan, 1975]. This latter problem, more than any other, has focused attention on the effects of spatial gravitational fields. More recently Perjes [1988] and Abramowicz [1988, 1990] have considered variations of Møller’s parametrization-dependent threading approach. In the slicing point of view the natural extension of the splitting of the metric to the splitting of the connection and the discussion of spatial gravitational forces has only recently been considered, perhaps forgotten in the emphasis on the initial value problem and Hamiltonian dynamics. The roots of this discussion can be traced back to the original threading point of view work, although this link is not made apparent in citations. At the crossover point between the popularity of the threading and slicing points of view, Forward [1961] described the analogy between electromagnetism and linearized general relativity using the reference point of view, in a slight variation of Møller’s formalism. In the late seventies this article then
9 inspired a reference point of view discussion of the PPN formalism by Braginsky, Caves and Thorne [1977] who introduced an “electric-type” gravitational field and a “magnetic-type” gravitational field, the latter of which became the “gravitomagnetic” field of the eighties in a discussion of linearized general relativity by Braginsky, Polnarev and Thorne [1984]. In linearized gravity, in the usual weak field slow motion discussions, the corresponding spatial gravitational fields defined in the threading, slicing and reference points of view are very closely related and agree to the lowest order, although not to full post-Newtonian order. Curiously enough, it is always the threading fields which are used in post-Newtonian discussions even when the slicing point of view is advocated before linearization. The introduction of the terminology “gravitoelectric” field and the first discussion of slicing spatial gravitational forces finally appeared in the text Black Holes: The Membrane Paradigm by Thorne et al [1986], but only in the case of a stationary gravitational field in their treatment of black hole spacetimes. This may be extended in an obvious way to general spacetimes in a notation which shows the close relationship to the threading point of view as will be described below. The congruence point of view is briefly introduced in an article by Hawking [1966] and at great length in a pair of articles by Ellis [1971, 1973] at the beginning of the seventies (later updated by Ellis and van Elst [1998]), all based on earlier unifying work of Ehlers [1961] and of Kundt and Tr¨umper [1962] unavailable in English until the Ehlers article alone finally appeared in tranlation over three decades later [1994]. Completing the congruence vector field u to an orthonormal frame leads to the explicit orthonormal frame approach of Estabrook and Wahlquist [1964], who in a note added in proof in their article thank Pirani for calling their attention to Cattaneo’s work and cite articles in French; they were not the first or the last to have been as they say “completely unaware of this work.” (Cattaneo himself appears to have been completely unaware of Zel’manov’s work, while the present authors were completely unaware of both when this project was begun.) The hypersurface point of view is described in the article by Zel’manov [1973] in the early seventies in terms of a nonlinear reference frame, but some work is required to decipher his notation. Ehlers [1961] and Ellis [1971] also treat the hypersurface point of view as a special case of the congruence point of view. These various splittings of spacetime are particularly interesting in the case of electromagnetism, where all of our intuition is tied to individual electric and magnetic fields, and astrophysical applications can be aided by allowing this intuition to find expression in the context of a splitting. Early work by Ruffini and collaborators [Hanni and Ruffini 1973, Hanni and Ruffini 1975, Ruffini and Wilson 1975, Damour and Ruffini 1975, Hanni 1977, Damour et al 1978, Ruffini 1978] followed up in more detail by Damour [1978, 1982] revealed the utility of introducing the concept of electric and magnetic fields in studying black hole systems. This was discussed later in great detail from the slicing point of view by Thorne and Macdonald [1982], who summarize the history of the different splittings in general and as applied to Maxwell’s equations, and by Thorne et al [1986] for application to black hole systems. Maxwell’s equations may be expressed in the congruence point of view as done by Ellis [1973], in the threading point of view as done by Benvenuti [1960] in an application of Cattaneo’s formalism unfortunately appearing only in Italian, and in the slicing point of view as described by Misner, Thorne and Wheeler [1973] (for the correct vector potential splitting, see Isenberg and Nester [1980], for example). The reference point of view splitting is much older, dating back to the beginnings of general relativity in an article by Tamm [1924], as noted by Skrotsky [1957] and Plebanski [1960]. This appears in a peculiar mix of the reference and threading points of view in an exercise in the Landau and Lifshitz text [1975]. The mystery of this latter approach has caused its share of confusion about how one should define electric and magnetic fields in applications in general relativity. Hanni [1977] has given the complementary version which mixes the reference and slicing points of view. It is rather interesting to compare each of these numerous splittings of Maxwell’s equations with the others. Maxwell’s equations from the slicing point of view, first considered by Misner and Wheeler [1957] using the language of differential forms, are discussed in great detail by Thorne and Macdonald [1982] and by Thorne et al [1986]. An enormous language barrier exists at present between the slicing and threading points of view, prevent- ing those versed in the formalism and notation of one from easily penetrating the other or understanding how the two are related. This is exactly the problem the present exposition hopes to address, namely the lack of a common mathematical framework to discuss both approaches on an equal footing. Both of these splitting formalisms can be developed in a completely parallel way as complementary aspects of a single geometrical structure imposed on spacetime (the nonlinear reference frame), aspects which in a close way are related by
10 the same duality that links contravariant and covariant fields on the spacetime manifold. Furthermore, each of these approaches has important ties with the congruence point of view which invariantly describes the geometry of the observer congruence and with the reference point of view which links these discussions to adapted coordinate systems in practice. The style of the threading approach as usually presented is some- what more cumbersome than the slicing one, so it will be recast in the slicing style, generalizing from adapted coordinate systems to adapted local frames. In the special case of an orthogonal slicing and threading of a spacetime, the two descriptions will then coincide. This standardization of ideas and notation can also prove useful in approximation techniques, including the recent axiomatization of the idea of a Newtonian limit of general relativity based on a family of spacetime splittings as discussed by Ehlers [1989], Lottermoser [1989], and Schmidt [Ehlers, Schmidt and Lottermoser 1990]. The relatively unknown geometry of the spatial connection of the congruence and threading points of view is also currently of interest in the reformulation of gauge-invariant perturbation theory for Friedmann- Robertson-Walker spacetimes by Ellis and coworkers [Ellis and Bruni 1989, Ellis, Hwang, and Bruni 1989]. Each of these limits concern special cases of the problem of perturbation theory for a general spacetime from the somewhat unfamiliar alternative points of view to be discussed in this monograph. Chapter 2 describes the congruence point of view, which is later used to define the measurement process for the slicing and threading points of view. Chapter 3 studies the splitting geometry associated with a nonlinear reference frame and discusses both the slicing and threading points of view and their relationship to the congruence point of view and the reference point of view. Chapter 4 discusses electromagnetism in detail from each of the points of view while Chapter 5 considers the very important case of stationary spacetimes and the Sagnac effect and synchronization questions. The special case of flat spacetime in rotating coordinates clarifies the relationship of the spatial gravitational forces to the centrifugal and Coriolis forces. Chapter 6 discusses the weak field, slow motion limit and almost Friedmann-Robertson-Walker spacetimes. Although this text contains many formulas, only a few simple ideas applied in a methodical way are be- hind much of the detail. Orthogonal decomposition, by itself or represented in the context of a nonorthogonal decomposition, is at the heart of the algebra of respectively partial and full splittings of spacetime. Orthog- onal projection of differential operators in the same spirit then provides the differential tools necessary to extend the splitting algebra to include derivatives.
1.5 Orthogonalization in the Lorentzian plane
The heart of linking spacetime splittings to coordinate systems is a simple idea, but one which receives little attention in our academic preparation: the use of nonorthogonal coordinate systems. It is very useful to look at the Euclidean and Lorentzian 2-planes E2 and M 2 to recall first a case for which our geometric intuition holds and then see how it differs in the spacetime arena.
The Euclidean example: general linear coordinates Given any two nonzero linearly independent vectors (X,Y ) in R2 with its usual Euclidean structure, then expressing the position vector r = xX + yY defines its coordinates (x, y), which may be thought of as functions on R2. Then the self-dot product of the position vector defines the quadratic distance formula
~r · ~r = Ax2 + Bxy + Cy2 , (1.5) where A = X · X,B = X · Y,C = Y · Y. (1.6) Assume this is positive-definite: B2 − 4AC < 0, A > 0, C > 0. Figure 1.5 illustrates the geometry. There are two choices for completing the square on the quadratic form:
B B2 B2 B r · r = A(x + y)2 + (C − )y2 = (A − )x2 + C(y + x)2 , 2A 4A 2C 2C (1.7) B2 B2 = A(x)2 + (C − )(y)2 = (A − )(x=)2 + C(y=)2 , 4A 4C
11 ...... y ...... Y ...... ~r ...... x ... X ...... = .... y ...... B y ...... − X .. . 2A ...... = ...... Y . ... Y . Y = Y ...... ~r . . . ~r ...... x ...... X .. . X = X ...... B ...... − Y ...... 2C = ...... X ...... x=
Figure 1.2: General linear coordinates on E2 and their two orthogonalizations. The dashed lines indicate the unit coordinate rectangle and the dotted lines the projections of a typical vector along the coordinate axes. where new adapted coordinates are defined by B B x = x + y , x = x − y , x= = x , x = x= , 2A 2A (1.8) B B y = y , y = y , y= = y + x , y = y= − x= , 2C 2C The new coordinate systems correspond to the new bases
= = r = xX + yY = xX + yY = x=X + y=Y (1.9) which transform in a complimentary way
= B = B = X = X,X = X, X = X − Y,X = X + Y, 2C 2C (1.10) B B = = Y = Y − X,Y = Y + X, Y = Y,Y = Y, 2A 2A In each case one basis vector is retained and the additional new basis vector is obtained by orthogonal projection of the other, a process complimentary to completing the square on the corresponding coordinates. Figure 1.5 illustrates these projections and the new unit coordinate rectangles.
The Lorentzian case: general linear coordinates The Lorentzian case of two-dimensional Minkowski space M 2 is similar but our geometric intuition no longer holds.
12 Given a pair of vectors (X,T ) in M 2, one spacelike, the other timelike, then expressing the position vector r = xX + tT defines its coordinates (x, t), which may be thought of as functions on M 2. Then the self-dot product of the position vector defines the quadratic distance formula
~r · ~r = Ax2 + Bxt + Ct2 , (1.11) where A = X · X,B = X · T,C = T · T. (1.12) Assume this is a Lorentzian inner product: B2 −4AC > 0, A > 0, C < 0. Figure 1.5 illustrates the geometry. Again there are two choices for completing the square on the quadratic form:
B B2 B2 B r · r = A(x + t)2 + (C − )t2 = (A − )x2 + C(t + x)2 , 2A 4A 2C 2C (1.13) B2 B2 = = A(x)2 + (C − )(t)2 = (A − )(x=)2 + C(t )2 , 4A 4C where new adapted coordinates are defined by B B x = x + t , x = x − t , x= = x , x = x= , 2A 2A (1.14) = B = B t = t , t = t , t = t + x , t = t − x= , 2C 2C The new coordinate systems correspond to the new bases
= == r = xX + tT = xX + tT = x=X + t T (1.15) which transform in a complimentary way
= B = B = X = X,X = X, X = X − T,X = X + T, 2C 2C (1.16) B B = = T = T − X,T = T + X, T = T,T = T, 2A 2A In each case one basis vector is retained and the additional new basis vector is obtained by orthogonal projection of the other, a process complimentary to completing the square on the corresponding coordinates. Figure 1.5 illustrates these projections and the new unit coordinate rectangles. The first orthogonalization remains the same, but the second orthogonalization changes due to the change in sign of C. This whole discussion may be transferred to a tangent space where the starting basis vectors are two frame vectors and the coordinate functions correspond exactly to the dual 1-forms. If one pictures a 1-form σ geometrically in the tangent space by associating with it the two subspaces σ(~r) = 0, σ(~r) = 1, then the parallel sides of the unit coordinate rectangle serve this purpose for the two dual 1-forms. Note that the dual 1-form parallel sides for one basis vector are parallel to the other basis vector so that the natural evaluation gives 0. If we assume that the original basis of the tangent space is a coordinate frame X = ∂x,T = ∂t, then the dual 1-forms are dx and dt. The figure shows that the first orthogonalization does not change dt, while the second one does not change dx. The quadratic form is then the line element
2 2 2 ds = gxx dx + 2gtx dt dx + gtt dt , (1.17) and the space orthogonalization is
2 2 gtx 2 gtx 2 ds = gxx (dx + dt) + (gtt − ) dt gxx gxx x 2 2 2 = gxx (dx + N dt) − N dt , (1.18)
13 ...... t ...... ~r ...... T ...... x ...... X ...... = ...... t ...... t ...... B ...... − X ...... 2A ...... = ...... x ...... ~r . . ... ~r ...... = ...... T . . T .. . . T = T ...... = . B ...... − T ...... 2C ...... X ...... x .. X . X = X ...... Figure 1.3: General linear coordinates on M 2 and their two orthogonalizations. The dashed lines indicate the unit coordinate rectangle and the dotted lines the projections of a typical vector along the coordinate axes. while the time orthogonalization is
2 2 2 gtx 2 gtx 2 ds = (gxx − ) dx + gtt (dt + dx) gtt gtt 2 2 2 = γxx dx − M (dt + Mx dx) . (1.19)
These two distinct completions of the square lead immediately to a suggestive lapse function and shift vector notation for each case which describes the geometry of the associated projections in the context of the original coordinate frame X = ∂x , X · X = gxx , x 2 T = ∂t + N ∂x , T · T = −N , (1.20) = = = X = ∂x + Mx∂t , X · X = γxx , = = = 2 T = ∂t , T · T = −M ,
1.6 Notation and conventions
The conventions of Misner, Thorne and Wheeler [1973] will be followed unless stated otherwise or unless an ambiguity arises. Lower case Greek indices will take the values 0,1,2,3 and lower case Latin indices the values 1,2,3. The signature of the spacetime metric will be (-+++). A convenient mix of index and index- free notation will be used in our discussion in order to bridge the gap between those who feel comfortable without indices and those who do not. This leads to a problem when in the index notation the kernel symbol of an object is used to denote its trace, determinant, norm or other scalar property or if the same kernel symbol is used for two different objects. In these cases obvious distinguishing marks must be added
14 to the index-free symbols which in our applications will turn out not to be too cumbersome. For example, “index shifting” with the spacetime metric associates a single kernel symbol with many different objects. The “index lowering” and “index raising” maps associated with the spacetime metric will be denoted by [ and ] respectively, and the symbols S[ and S] will refer to the fully covariant and fully contravariant forms respectively of a given tensor field S. This gives a unique symbol for one-index objects if we identify the symbol alone with the index position which best characterizes the properties of the object or for two-index objects if we agree to use the kernel symbol alone for the mixed object. (4) (4) The spacetime metric tensor itself gαβ will be denoted by g in an index-free notation (and the inverse (4) αβ (4) −1 (4) ] (4) (4) or contravariant metric tensor g by g = g ) so that the symbol g ≡ | det( gαβ)| can denote (the absolute value of) its determinant as is customary in the index notation. The prefix “ (4) ” will distinguish certain four-dimensional objects from related three-dimensional objects for which it is convenient to use the same kernel symbol. Similarly the lapse function N and the shift vector field N α of the slicing formalism lead to the need for using N~ to indicate the shift in an index-free notation. The same problem will exist with the lapse function M and shift 1-form Mα to be introduced below for the threading point of view, so the 1-form = will be distinguished by the index-free symbol M. The two parallel lines over the kernel symbol recall the geometrical interpretation of a covector in terms of a family of parallel planes in its related vector space in the same way that the arrow oversymbol recalls the geometrical interpretation of a vector. These symbols = = interact with those for index-shifting in an obvious way, with N ≡ N~ [ and M~ ≡ M ]. One also needs an index-free notation for contraction of tensors. A generalized vector contraction notation will prove useful. Let the left contraction S T denote the tensor product of the two tensors S and T with a contraction between the rightmost contravariant index of S with the leftmost covariant index of T (i.e., ...α ... S ... T α...), and the right contraction S T the tensor product with a contraction between the leftmost ... α... contravariant index of T with the rightmost covariant index of S (i.e., S ...αT ... ), assuming in each case that such indices exist. Each of these contractions may themselves be generalized to an ordered contraction of sets of p adjacent indices, indicated by p and p respectively, reducing to the previous contractions when p = 1. Finally the spacetime or portion of spacetime under discussion will always be assumed to be orientable and time-orientable and local coordinates and frames will be assumed to be compatible with these orientations when appropriate. The unit oriented volume 4-form (4)η is defined by the component expression in an (4) 1/2 oriented frame ηαβδγ = g αβδγ with 0123 = 1, so that in an (oriented) orthonormal frame one has 0123 η0123 = 1 = −η . The dual of a p-form will then be represented in index form as a left contraction with (4)η as in Misner, Thorne and Wheeler. For a 2-form F the order does not matter and the dual is
∗ 1 (4) γδ Fαβ = 2 Fγδ η αβ . (1.21)
One must be careful when comparing formulas involving (4)η with the convention which uses the ordered indices 1,2,3,4 as in Hawking and Ellis [1973] rather than 0,1,2,3. This problem (among others) affects formulas for the vorticity of a timelike congruence. The one exception to the conventions of Misner, Thorne and Wheeler made in the present text is a reversal of the order of the covariant indices on the symbol for the components of the connection in a spacetime frame {eα} following Hawking and Ellis [1977]
(4) (4) γ ∇eα eβ = Γ αβeγ = eβ;α . (1.22) This convention might be called the “del” convention in contrast with the “semicolon” convention in which the order of the covariant indices instead follows the semicolon notation. Both the comma and subscripted partial notation will be used to indicate the frame derivatives of functions
eαf = ∂αf = f,α . (1.23)
One notational conflict between the old fashioned component notation and modern frame notation is the convention which doesn’t distinguish between the derivatives of components and the components of α α derivatives. For example, £XY = [X,Y ] conventionally denotes the components of the Lie derivative of
15 the vector field Y and not the Lie derivative of its components, which are instead represented by XY α = α β Y ,βX . On the other hand in the frame notation it is natural to write for the product rule
α α α α £X(Y eα) = (£XY )eα + Y (£Xeα) = [£XY ] eα , (1.24)
α α and here the symbol £XY = XY instead really does stand for the Lie derivative of the components. The context should always make clear which meaning is intended, and if it doesn’t, an explicit comment will. The notation introduced here may be found to be somewhat cumbersome in a particular application by someone already familiar with that particular application and accustomed to a more streamlined choice of symbols. However, the point is not to develop a notation that is the simplest to use in a specific application, but one which is capable of describing unambiguously all possible applications and their relationships. One may always later adopt an abbreviated notation for specific calculations by elimination of some of the qualifying marks on the kernel symbols. Given this introduction to the notational philosophy to be employed in the text, one can find in Appendix A a list of actual formulas from differential geometry that will be required. All discussion will be local in character. The region of four-dimensional spacetime where the discussion is valid will be designated by (4)M, and this region will be referred to simply as spacetime. A good example to keep in mind is the exterior of the event horizon in a black hole spacetime, for example, where the Boyer-Lindquist coordinates are valid. The threading point of view is valid outside the ergosphere (the time coordinate lines are are timelike), while the slicing point of view is valid outside the event horizon (the time coordinate hypersurfaces are spacelike).
16 Chapter 2
The congruence point of view and the measurement process
Although special relativity may be studied with all of the tools of general relativity, the existence of a special class of “inertial observers” in Minkowski spacetime enables one to simplify considerably the analysis of its spacetime geometry. With each such observer in Minkowski spacetime one can associate an entire family of such observers of zero relative velocity filling the spacetime and one may synchronize the proper times of all of them so that they describe a global time function for the family. This family constitutes the global reference frame of the given observer. The 4-velocity field of this family of observers is a covariant constant timelike unit vector field on Minkowski space. Two inertial observers in relative motion lead to two distinct such global reference systems. Although the worldlines of two particular inertial observers in relative motion may be transformed into each other by an active Poincar´etransformation of Minkowski space, their associated reference frames may be transformed into each other with a pure Lorentz tranformation or “boost” of Minkowski spacetime into itself about some arbitrary point. This in turn may be identified with the boost which maps the 4-velocity associated with the first reference frame onto that of the second in the tangent space at each point of Minkowski spacetime. It is this globally constant boost (the mixed tensor field which represents it is covariant constant) which allows one transform the measurements of physical quantities at some spacetime point from one family of observers to the other. However, in a general spacetime in general relativity, no such preferred families of inertial observers exist, and one must rely on arbitrary families of observers to establish a reference frame with which to measure physical quantities. The global boost of special relativity relating two families of inertial observers gives way to a point by point boost between the 4-velocities of the observers in the tangent space at each spacetime point associated with two families of observers covering the spacetime. Thus all of the algebra associated with a “single” boost of Minkowski space between two inertial reference frames must be repeated at each point independently in general relativity in order to relate the two reference frames associated with two families of observers. Furthermore, the differential operators associated with a single family of observers necessarily reflect the state of motion of those observers, leading to the description of noninertial effects due to their acceleration and relative rotation. The analysis of these questions is the natural generalization of special relativity kinematics to general relativity.
2.1 Algebra
Suppose u is a future-pointing timelike unit vector field on our spacetime (4)M
(4) α β (4) αβ α −1 = gαβu u = g uαuβ = uαu , (2.1) −1 = (4)g(u, u) = (4)g−1(u[, u[) = u[(u) .
17 It may be interpreted as the 4-velocity field of a family of test observers whose worldlines are the integral curves of u, each of which may be parametrized by the proper time τu which is defined up to an additive constant on each curve. This defines an observer congruence on the spacetime which may be used to induce a pointwise orthogonal decomposition of the spacetime tensor algebra. (Sachs and Wu [1977] call u itself a “reference frame” and its proper time parametrized integral curves the “observers”.)
2.1.1 Observer orthogonal decomposition The orthogonal decomposition of the spacetime tensor algebra begins with the orthogonal direct sum of each tangent space into the “local time axis” along u and the “local rest space” LRSu orthogonal to u. This may be accomplished using the orthogonal projection operators T (u) for the temporal projection and P (u) for 1 the spatial projection. Each of them can be identified with a 1 -tensor field, defined by α α [ T (u) β = −u uβ ,T (u) = −u ⊗ u , (2.2) α α α (4) [ P (u) β = δ β + u uβ ,P (u) = id + u ⊗ u , which are related to the identity tensor, the metric tensor and its inverse tensor in the following way
α α α (4) δ β = T (u) β + P (u) β , id = T (u) + P (u) , (4) (4) [ [ gαβ = T (u)αβ + P (u)αβ , g = T (u) + P (u) , (2.3) (4)gαβ = T (u)αβ + P (u)αβ , (4)g−1 = T (u)] + P (u)] . Here T suggests “time” or “tangential” (to the congruence), while P suggests “perpendicular” or “projec- tion”. The latter makes sense since often attention is focused on the spatial projection tensor P (u) which is conventionally denoted by the kernel symbol h, while the temporal projection T (u) is usually suppressed in favor of contraction with the timelike unit vector field or 1-form. 1 1 -tensor fields may be interpreted as linear transformations of each tangent space into itself α α β X → A βX ,X → A X (2.4) and the right contraction corresponds to matrix multiplication of the component matrices in the index- notation, both in this action and in their successive action by composition
α γ α A γ B β = [A B] β . (2.5) It is convenient to extend the “square” notation for the self-composition of a linear map with the corre- sponding self-right-contraction of the tensor field
2 α α γ 2 [A ] β = A γ A β ,A = A A. (2.6) With these notational conventions, the orthogonal projection tensor fields then satisfy the usual relations for an orthogonal decomposition α γ α 2 P (u) γ P (u) β = P (u) β ,P (u) = P (u) , α γ α 2 T (u) γ T (u) β = T (u) β ,T (u) = T (u) , (2.7) α γ α γ P (u) γ T (u) β = T (u) γ P (u) β = 0 ,P (u) T (u) = T (u) P (u) = 0 , which follow from the unit character of u and its orthogonality to the spatial projection tensor
α β α P (u) βu = 0 = uαP (u) β , (2.8) P (u) u = 0 = u[ P (u) . It is convenient to extend the projection operators to act on a tensor S by understanding P (u)S to mean the projection of S on each index by the projection P (u), i.e., contraction of S on each index by the appropriate index of P (u) α... α δ γ... [P (u)S] = P (u) γ ··· P (u) β ··· S , β... δ... (2.9) α... α δ γ... [T (u)S] β... = T (u) γ ··· T (u) β ··· S δ... .
18 These give the purely spatial and purely temporal parts of S. The orthogonal decomposition of S is accomplished by extending the orthogonal decomposition to each of its indices, and expanding out the result
α... α δ γ... S β... = δ γ ··· δ β ··· S δ... α α δ δ γ... = [T (u) γ + P (u) γ ] ··· [T (u) β + P (u) β] ··· S δ... (2.10) α... α... = [T (u)S] β... + (mixed terms) + [P (u)S] β... . The coefficent tensors of those factors of u with a free index in this formula define a family of tensors of all ranks between zero and the rank of S. These are the tensors which result from replacing the projection δ along u with contraction by −u or −uγ as appropriate. They represent the spatial projections of all possible contractions of S with any number of factors of −u or −u[ as appropriate, and can be easily used to reconstruct S by factoring in the appropriate missing factors of u[ and u respectively. The reduction of a tensor S to such a family of tensors will be called the measurement of S by the observer congruence. A tensor which has been spatially projected on all of its free indices is called a spatial tensor and gives zero upon contraction of any index with u. All of the members of the family of tensors which result from the 1 measurement of a single spacetime tensor are spatial tensors. For example, the measurement of a 1 -tensor S leads to the following result in the index notation
α S β ↔ γ δ α γ δ γ δ α γ δ {S δu uγ , −P (u) γ S δu , −uγ S δP (u) β,P (u) γ S δP (u) β} . (2.11) | {z } | {z } | {z } | {z } scalar vector 1−form 1 (1)−tensor
P (u) and its index-shifted forms are all spatial tensors which result from the purely spatial part of the measurement of the identity tensor and the metric and its inverse. The remaining parts of these measurements are trivial (4)id ↔ {−1, 0, 0,P (u)} , (4)g ↔ {−1, 0,P (u)[} , (2.12) (4)g−1 ↔ {−1, 0,P (u)]} . For the symmetric metric tensor and its inverse the two rank-one mixed parts are equal and vanish, while for the mixed identity tensor, the two distinct rank one parts are the zero vector field and the zero 1-form. α In each case the rank zero or spatial scalar part is the constant −1 = uαu . When tensors have symmetries among their contravariant indices or among their covariant indices, the measurement process may be collapsed to list only the distinct spatial fields which result from the measure- ment, as done above for the metric and its inverse. The simplest case occurs for p-forms, which give only two distinct fields since at most one contraction with u is possible before spatially projecting the result. No contraction yields the purely spatial part or “magnetic part” p-form and one contraction yields the single mixed part or “electric part” (p − 1)-form. This splitting may be expressed in the form
(E) α (E) [S (u)]α ...α = −u Sαα ...α ,S (u) = −u S, 1 p−1 1 p−1 (2.13) (M) (M) [S (u)]α1...αp = [P (u)S]α1...αp ,S (u) = P (u)S, while the orthogonal decomposition of S has the form
(E) (M) Sα ...α = p!u[α S (u)α ...α ] + S (u)α ...α , 1 p 1 2 p 1 p (2.14) S = u[ ∧ S(E)(u) + S(M)(u) .
This is equivalent to the following decomposition of the identity operator ALT (the antisymmetrizer) on differential forms ALT = u[ ∧ (−u ) + ALT ◦P (u) , (2.15)
19 or in index notation, of the generalized Kronecker delta tensor δ(p)
α ...α δ 1 p = p!δα1 ··· δαp β1...βp [β1 βp] α ...α (2.16) = −p!u[α1 u δα2 ··· δαp] + δ(u) 1 p , [β1 β2 βp] β1...βp where the spatial generalized Kronecker delta tensor δ(u)(p) = P (u)δ(p) is defined by
α ...α δ(u) 1 p = p!P (u)[α1 ··· P (u)αp] . (2.17) β1...βp [β1 βp]
Notice that the contraction u S is automatically spatial due to the antisymmetry of S (since u (u S) = 0 α β or equivalently u u Sβαγ3...γp = 0) and so requires no subsequent spatial projection. The (unit oriented) volume 4-form (4)η may be split in this way but only its electric part survives since it is easily shown that the magnetic part P (u)(4)η vanishes. The electric part reversed in sign
(4) δ (4) η(u) = u η , η(u)αβγ = u ηδαβγ (2.18)
defines the (unit oriented) volume 3-form for the local rest space LRSu. In an (oriented, time-oriented) 0123 123 orthonormal frame adapted to u and LRSu, then η0123 = 1 = −η and η(u)123 = 1 = η(u) . This spatial volume 3-form defines the spatial duality operation ∗(u) for antisymmetric spatial tensors and the usual cross product of two vectors X and Y in the local rest space
∗(u) α α β γ X ×u Y = [X ∧ Y ] , [X ×u Y ] = η(u) βγ X Y . (2.19)
One can also introduce the spatial “dot product” between two spatial vectors in order to mimic Euclidean vector analysis [ α β X ·u Y = P (u) (X,Y ) = P (u)αβX Y , (2.20) This just gives the spacetime inner product of the two spatial vectors. Note that one can replace the spacetime metric tensor in any contraction with spatial tensors by the appropriate valence spatial projection which acts as the metric on the local rest space and its tensor algebra. The spatial volume 3-form η(u) is used to define the spatial duality operation “ ∗(u) ” for spatial differential forms 1 ∗(u)S = S η(u)β1···βp , (2.21) α1···α3−p p! β1···βp α1···α3−p It satisfies ∗(u) ∗(u)S = S. Using this spatial operation, one can split the spacetime duality operation “ ∗ ” on differential forms. Assuming the right dual convention of Misner, Thorne and Wheeler [1973] this is given by 1 ∗S = S ηβ1···βp , (2.22) α1···α4−p p! β1···βp α1···α4−p This operation instead satisfies ∗ ∗S = (−1)p−1S for a p-form S. Using the definition of the electric and magnetic parts of a differential form, one finds for a p-form S
[ ∗S](E)(u) = (−1)p−1 ∗(u)S(M)(u) , (2.23) [ ∗S](M)(u) = ∗(u)S(E)(u) .
α In order to have unique index-free symbols for the three valence forms of a 2-form, let F (F β) be the [ ] αβ mixed valence form of a 2-form F (Fαβ) and let F (F ) be the 2-vector or contravariant form of the field. Then the above relationship with S = F [ is
[ ∗F [](E)(u) = −∗(u)F [(M)(u) , [ ∗F [](M)(u) = ∗(u)F [(E)(u) . (2.24)
20 2.1.2 Observer-adapted frames Components with respect to a frame adapted to the observer orthogonal decomposition can be quite useful α in the splitting game. An observer-adapted frame {eα} with dual frame {ω } will be any frame for which e0 is along u and the “spatial frame” {ea} spans the local rest space LRSu
−1 [ u = L e0 ≡ e> , u (ea) = 0 , (2.25) u[ = −Lω0 ≡ −ω> , ωa(u) = 0 .
The orientability and time-orientability assumptions imply L > 0 and η(u)123 > 0. Thus for a time-oriented oriented orthonormal observer-adapted frame, one has L = 1 or e0 = u. A frame which only has L = 1 will be called partially normalized. The splitting of a tensor field S amounts to a partitioning of the components in an observer-adapted frame according to whether or not individual indices are zero or not. The purely spatial part corresponds to those components which have only “spatial indices” 1,2,3, while the purely temporal part corresponds to the single component with every index equal to the “temporal index” 0. The remaining components parametrize the mixed parts of the tensor. Spatial tensors have only the spatially-indexed components nonzero. Thus any tensor equation in index-form involving only spatial tensor fields, when expressed in an observer-adapted frame, reduces to an equation whose only nonzero components correspond to replacing Greek indices with Latin indices. 1 For a 1 -tensor S one has 0 a 0 a S ↔ {S 0,S 0,S a,S b} (2.26) > a > a ↔ {S >,S >,S a,S b} , where the index “>” (pronounced “tan”) suggests “tangential” to the congruence (or even “time” in this context) and corresponds to the orthonormal temporal component obtained by scaling the zero-indexed frame component by the normalization factor L (evaluation on contravariant arguments on e> and covariant arguments on ω>). This family of spatial fields differs by a sign from the contraction/projection algorithm [ > [ described above which picks off the coefficients of u and u rather than of e> = u and ω = −u in the expansion of a tensor. One can represent this decomposition in index-free notation by using an index “⊥” (pronounced “perp”) standing for “perpendicular” to the congruence in place of the spatial index
> ⊥ > ⊥ S ↔ {S >,S >,S ⊥,S ⊥} . (2.27)
The symbol > was selected for being the reflection of the symbol ⊥ to a dual position. Similarly a p-form S splits in component form in the following way
{Sα1...αp } ↔ {S0a2...ap ,Sa1...ap }
↔ {S>a2...ap ,Sa1...ap } (2.28) (E) (M) ↔ {S>⊥...⊥,S⊥...⊥} = {−S (u),S (u)} .
The components of the spacetime metric and its inverse in such a frame may be represented in the conventional form (4) 2 (4) (4) g00 = −L , g0a = 0 , gab = P (u)ab = hab , (2.29) (4)g00 = −L−2 , (4)g0a = 0 , (4)gab = P (u)ab = hab , or equivalently (4) 2 0 0 a b g = −L ω ⊗ ω + habω ⊗ ω > > a b = −ω ⊗ ω + habω ⊗ ω , (2.30) (4) −1 −2 ab g = −L e0 ⊗ e0 + h ea ⊗ eb ab = −e> ⊗ e> + h ea ⊗ eb . The kernel symbol P (u) is here being replaced by h as often used in specific formalisms where the frame is understood to be adapted to a given congruence. The spacetime metric determinant factor has the expression
21 (4) 1/2 1/2 (4) 1/2 g = Lh , while the oriented spatial volume 3-form has components η(u)abc = η>abc = h abc. The three valence forms of the spatial projection are
[ a b ] ab a b a P (u) = habω ⊗ ω ,P (u) = h ea ⊗ eb ,P (u) = h bea ⊗ ω = ea ⊗ ω . (2.31)
Note that the cross product and dot product have the explicit expressions
a a b c 1/2 ad b c [X ×u Y ] = η(u) bcX Y = h h dbcX Y , (2.32) a b X ·u Y = habX Y .
It is also quite useful to consider frames which are only adapted to one of the two subspaces of the or- thogonal decomposition of the tangent spaces associated with a given family of test observers. An “observer- partially-adapted frame” {eα} will be a frame for which either 1) e0 is along u or 2) {ea} spans LRSu (equivalently ω0 is along u[), i.e., that spacetime frame contains either a subframe for the distribution of local time directions or for the distribution of local rest spaces. Both conditions together characterize the observer-adapted frames. Those observer-partially-adapted frames which are not observer-adapted will be considered below in the context of a nonlinear reference frame. For an electromagnetic 2-form F [, the electric and magnetic parts defined above correspond directly to α the electric field and the spatial dual of the magnetic field. It is convenient to let F (in index notation F β) be the mixed form of the 2-form and E(u) = F u and B(u) be the vector fields in the index-free notation, so one has E(u)[ = F [(E)(u) = −u F [ = F [ u , (2.33) B(u)[ = ∗(u)F [(M)(u) ,F [(M)(u) = ∗(u)B(u)[ , or equivalently β E(u)α = Fαβu , (2.34) α 1 αβγ 1 αβγδ B(u) = 2 η(u) Fβγ = − 2 η Fβγ uδ . In an observer-adapted frame, these definitions take the index-form
E(u)a = −F>a = Fa> , (2.35) 1 bc c B(u)a = 2 η(u)a Fbc ,Fab = η(u)ab B(u)c . One may re-express the 2-form and its spacetime dual in terms of the electric and magnetic 1-forms in the following way
[ [ [ ∗(u) [ γδ F = u ∧ E(u) + B(u) ,Fαβ = 2u[αE(u)β] + ηαβ uγ B(u)δ , (2.36) ∗ [ [ [ ∗(u) [ ∗ γδ F = −u ∧ B(u) + E(u) , Fαβ = −2u[αB(u)β] + ηαβ uγ E(u)δ .
22 2.1.3 Relative kinematics: algebra Suppose U is another unit timelike vector field representing a different family of test observers. One may then examine how the measurements of the two families of observers are related to each other. This involves as well the measurement of the various projection and boost maps between their local rest spaces which are necessary to interpret the “transformation” from the spatial quantities and operators of one set to those of the other. The relative velocities of one set of observers with respect to the other are defined by measuring the other’s 4-velocity U = γ(U, u)[u + ν(U, u)] , (2.37) u = γ(u, U)[U + ν(u, U)] . The relative velocity ν(U, u) of U with respect to u is spatial with respect to u and vice versa. Both have α 1/2 the same magnitude ||ν(U, u)|| = [ν(U, u)αν(U, u) ] , while the common gamma factor is related to that magnitude by −(4)g(U, u) = γ(U, u) = γ(u, U) = [1 − ||ν(U, u)||2]−1/2 . (2.38) Letν ˆ(U, u) = ||ν(U, u)||−1ν(U, u) be the unit vector giving the direction of the relative velocity ν(U, u), when nonzero. Introduce also the energy and spatial momentum per unit mass relative to u
E˜(U, u) = γ(U, u) , p˜(U, u) = γ(U, u)ν(U, u) . (2.39)
Multiplication of these quantities by the nonzero rest mass m of a test particle whose worldline coincides with one of the curves of the congruence of U yields its energy E(U, u) and spatial momentum p(U, u) as seen by the observers with 4-velocity u. These in turn are the result of the measurement by u of the 4-momentum P = mU of the test particle. In addition to the natural parametrization of the worldlines of U by the proper time τU , one may introduce a new parametrization τ(U,u) by dτ(U,u)/dτU = γ(U, u) . (2.40) This corresponds to the sequence of proper times of the family of observers from the u congruence which cross paths with a given worldline of the U congruence. It is convenient to abbreviate γ(U, u) by γ when its meaning is clear from the context in order to simplify the appearance of equations which involve this factor. When the relative velocity is nonzero, U and u define a 2-dimensional subspace of each tangent space called the relative observer plane. Its orthogonal complement is the subspace LRSu ∩LRSU common to both local rest spaces representing in each the directions orthogonal to the direction of relative motion. Equations (2.37) describe a unique active “relative observer boost” B(U, u) which acts as the identity on LRSu ∩LRSU such that in the relative observer plane
B(U, u)u = U,B(U, u)ν(U, u) = −ν(u, U) . (2.41)
The inverse boost B(u, U) “brings U to rest” relative to u. It will be convenient to use the same symbol for 1 a linear map of the tangent space into itself and the corresponding 1 -tensor acting by contraction. The right contraction between two such maps will represent their composition. When the contraction symbol is suppressed, the linear map will be implied. The boost B(U, u) restricts to an invertible map B(lrs)(U, u) ≡ P (U) ◦ B(U, u) ◦ P (u): LRSu → LRSU between the local rest spaces which acts as the identity on their common subspace. Similarly the projection P (U) restricts to an invertible relative projection map P (U, u) = P (U) ◦ P (u): LRSu → LRSU with −1 inverse P (U, u) : LRSU → LRSu and vice versa, and these maps also act as the identity on the common subspace of the local rest spaces. The boosts and projections between the local rest spaces differ only by a gamma factor along the direction of motion. It is exactly the inverse projection map which describes Lorentz contraction of lengths along the direction of motion. Figure 2.1.3 illustrates these maps and the relative velocities on the relative observer plane. α If Y ∈ LRSu, then the orthogonality condition 0 = uαY implies that Y has the form
Y = [ν(u, U) ·U P (U, u)Y ]U + P (U, u)Y. (2.42)
23 ...... ν(u, U) ...... ν(U, u) ...... X u... U ...... −νˆ(u, U) ...... νˆ(U, u) ...... −1 −1 . . . P (U, u) X = γ B(u, U)X ...... −1 .. . . B(u, U)X = γ P (u, U)X ...... P (u, U)X ......
Figure 2.1: The relative observer plane, the relative velocities and the associated relative observer maps.
−1 If X = P (U, u)Y ∈ LRSU is the field seen by U, then Y = P (U, u) X and
−1 P (U, u) X = [ν(u, U) ·U X]U + X (2.43) = [P (U) + U ⊗ ν(u, U)[] X, which gives a useful expression for the inverse projection in terms of the tensor which represents it. Any map between the local rest spaces may be “measured” by one of the observers, i.e., expressed entirely in terms of quantities which are spatial with respect to that observer. For example, the mixed tensor representing the relative observer projection P (U, u) = P (U) P (u) (2.44) = P (U) P (U, u) = P (U, u) P (u)
(which expands to P (u)+γU ⊗ν(U, u)), corresponding to the linear map P (U, u): LRSu → LRSU , is spatial with respect to u in its covariant index and with respect to U in its contravariant index, i.e., is a “connecting tensor” in the terminology of Schouten [1954]. It has associated with it two tensors
P (U) = P (U, u) P (U, u)−1 , (2.45) P (u) = P (U, u)−1 P (U, u) , which are spatial with respect to U and u respectively and correspond to identity transformations of each local rest space into itself. In the same way any linear map M(U, u): LRSu → LRSU is represented by such a connecting tensor and has associated with it two tensors MU (U, u) and Mu(U, u) which are spatial with respect to U and u respectively and act as linear transformations of the respective local rest spaces into themselves M(U, u) = MU (U, u) P (U, u) = P (U, u) Mu(U, u) , −1 MU (U, u) = M(U, u) P (U, u) , (2.46) −1 Mu(U, u) = P (U, u) M(U, u) . These latter tensors enable one to express the map in terms of the spatial projections of just one of the observers.
24 The relative boosts and projections all involve the further decomposition of each local rest space into the subspaces parallel and perpendicular to the direction of relative motion. Introducing the notation (||) (⊥) LRSu = LRSu ⊕ LRSu (2.47) (⊥) (⊥) for these subspaces, then LRSu = LRSU = LRSu ∩ LRSU is the common subspace of the two local rest spaces and and each of the above maps decompose into maps M (||)(U, u): LRS(||) → LRS(||) , u u (2.48) (⊥) (⊥) (⊥) M (U, u): LRSu → LRSu , where the latter map is the identity map for both the boosts and projections. The individual projections parallel and perpendicular to the direction of relative motion between the local rest spaces and within each local rest space have the representations P (||)(U, u) = −γνˆ(u, U) ⊗ νˆ(U, u)[ ,P (⊥)(U, u) = P (U, u) − P (||)(U, u) , (||) [ (⊥) (||) PU (U, u) =ν ˆ(u, U) ⊗ νˆ(u, U) ,PU (U, u) = P (U) − PU (U, u) , (2.49) (||) [ (⊥) (||) Pu (U, u) =ν ˆ(U, u) ⊗ νˆ(U, u) ,Pu (U, u) = P (u) − Pu (U, u) , where P (U, u)ˆν(u, U) = −γνˆ(U, u) explains the γ factor in the first relation. The vector identity
A ×u [B ×u C] = (A ·u C)B − (A ·u B)C (2.50) may be used to express this decomposition of a spatial vector X in the following way (||) (⊥) Pu (U, u)X =ν ˆ(U, u)[ˆν(U, u) ·u X] ,Pu (U, u)X =ν ˆ(U, u) ×u [ˆν(U, u) ×u X] . (2.51) These parallel and perpendicular projections in turn may be used to similarly decompose the boost −1 B(lrs)(U, u) and the inverse projection P (u, U) , for which one has the obvious relations (see Figure 2.1.3)
(||) −1 −1 (||) −2 (||) P (u, U) = γ B(lrs)(U, u) = γ P (U, u) , (2.52) (⊥) −1 (⊥) (⊥) P (u, U) = B(lrs)(U, u) = P (U, u) which may be used to reconstruct the spatial tensors associated with the boost and inverse projection. For example, for the inverse boost B(lrs)(u, U) one has −1 [ B(lrs)u(u, U) = P (u) − γ(γ + 1) ν(U, u) ⊗ ν(U, u) , (2.53) −1 [ B(lrs)U (u, U) = P (U) − γ(γ + 1) ν(u, U) ⊗ ν(u, U) , which follows from the expansion of
(||) (⊥) B(lrs)u(u, U) = B(lrs) u(u, U) + B (u, U)u (lrs) (2.54) (||) −1 (⊥) = Pu (u, U) + γ Pu (u, U) .
Thus if S ∈ LRSU , then its inverse boost is −1 [ B(lrs)(u, U)S = [P (u) − γ(γ + 1) ν(U, u) ⊗ ν(U, u) ] P (u, U)S. (2.55) The map P (u, U, u) = P (u, U)P (U, u) = P (u)P (U)P (u) (2.56) is an isomorphism of LRSu into itself which turns up in manipulations with these maps. It and its inverse (in the sense P (u, U, u)−1P (u, U, u) = P (u)) can be expressed in the following equivalent ways
(⊥) 2 (||) P (u, U, u) = Pu (u) + γ Pu (u, U) , = P (u) + γ2ν(U, u) ⊗ ν(U, u)[ , −1 −1 −1 −1 P (u, U, u) = P (U, u) P (u, U) = Pu(U, u) (2.57) (⊥) −2 (||) = Pu (U, u) + γ Pu (U, u) = P (u) − ν(U, u) ⊗ ν(U, u)[ .
25 The inverse map represents a “double Lorentz contraction” along the direction of relative motion. The map P (U, u)−1 appears in the transformation law for the electric and magnetic fields. Using the fact that (4) [P (U) F ] ν(u, U) = ν(u, U) ×U B(U) , (2.58) one finds P (U, u)E(u) = γP (U){(4)F [U + ν(u, U)]} (2.59) = γ[E(U) + ν(u, U) ×U B(U)] , and similarly P (U, u)B(u) = γ[B(U) − ν(u, U) ×U E(U)] . (2.60) Equivalently one may write
−1 E(u) = γP (U, u) [E(U) + ν(u, U) ×U B(U)] , −1 (2.61) B(u) = γP (U, u) [B(U) − ν(u, U) ×U E(U)] .
The transformation of the electric and magnetic fields takes a more familiar form if one re-expresses it in terms of the parallel/perpendicular decomposition of the boost using equation (2.52)
(||) (||) (||) E (u) = B(lrs)(u, U)E (U) , (2.62) (⊥) (⊥) (⊥) (⊥) E (u) = γB(lrs)(u, U)[E (U) − ν(u, U) ×U B (U)], with analogous expressions for the magnetic field. When expressed in a pair of orthonormal frames adapted to the two local rest spaces and related by the boost, these reduce to the familiar component expressions in a direct way. The map γP (u, U)−1 appears rather than the projection P (U, u) in these transformations due to the spatial duality operation. Suppose F (u) is a spatial 2-vector with respect to u. Then F (U) = P (U, u)F (u) is the purely spatial part of this 2-vector as seen by U. Its spatial dual with respect to U is
∗ ∗ F~ (U) = (U )F (U) = [ (U )P (U, u)∗(u)]F~ (u) (2.63) = γP (u, U)−1F~ (u) .
This is easily verified by an index calculation
α δα βγ σ F (U) = Uδη βγ u η σF (u) δα σ = −Uδu δ σF (u) δ α α σ (2.64) = −Uδu F (u) + u UσF (u) α α σ −1 α = γ[F (u) + u ν(U, u)σF (u) ] = [γP (u, U) F~ (u)] .
2.1.4 Splitting along parametrized spacetime curves and test particle worldlines The gravitational field is felt through its action on test particles, whose paths in spacetime (world lines) are either timelike (nonzero rest mass) or null (zero rest mass) curves. However, other gravitational effects manifest themselves along spacelike curves—synchonization of clocks, for example. It is therefore necessary to consider curves of all three causality types when viewing them in terms of a given observer family. Given any parametrized curve c in spacetime, with parameter λ, its tangent vector V (λ) = c0(λ) is defined in local coordinates by α α 0 α V (τU ) = dx (c (λ)) = d[x ◦ c(λ)]/dλ . (2.65) This tangent vector can be decomposed into its parts parallel and perpendicular to the observer 4-velocity
V (λ)α = V (λ)(||u) uα + [P (u)V (λ)]α , (2.66)
26 where its temporal and spatial parts are respectively
(||u) γ α α γ V (λ) = −uγ V (λ) , [P (u)V (λ)] = P (u) γ V (λ) . (2.67)
Dividing the spatial part by the temporal part defines the relative velocity of the path in spacetime repesented by the curve as seen by the observer family, regardless of its causality type. This relative velocity and (when nonzero) the unit vector defining its direction are defined by
ν(V (λ), u)α = [P (u)V (λ)]α/V (λ)(||u) , (2.68)
and νˆ(V (λ), u)α = sgn V (λ)(||u)[P (u)V (λ)]α/||P (u)V (λ)|| , (2.69) while the relative speed is the magnitude of the relative velocity
||ν(V (λ), u)|| = ||P (u)V (λ)|| . (2.70)
For a null curve, the relative velocity is itself a unit vector and the relative speed is 1. For a curve whose tangent vector is spatial with respect to the observer family, this speed goes to infinity. The relative velocity, direction and speed are invariant under a change of parametrization which scales the tangent vector by the rate of change of one parameter with respect to the other by the chain rule d d dλ d λ → λ(λ0) , → = . (2.71) dλ dλ0 dλ0 dλ
Suppose one has a timelike worldline c parametrized by its proper time λ = τU with unit timelike tangent vector U = c0. In local coordinates one has
α α 0 α U (τU ) = dx (c (τU )) = d[x ◦ c(τU )]/dτU . (2.72)
This worldine may represent the path of a test particle of mass m and charge q in spacetime. Measurements made by a test observer following the same worldline can be related to the sequence of measurements made by the family of different test observers of the observer congruence whose worldlines intersect the given worldline. All of the algebra developed above for two families of test observers may be restricted to the given worldline to relate the test particle quantities to the single observer congruence. In particular multiplying the energy and spatial momentum per unit mass by the rest mass yields the the energy and spatial momentum of the test particle E(P, u) = mE˜(U, u) = mγ(U, u) , (2.73) p(P, u) = mp˜(U, u) = mγ(U, u)ν(U, u) , which themselves result from the measurement of the 4-momentum P = mU
P = E(P, u)u + p(P, u) = E(P, u)[u + ν(P, u)] . (2.74)
One may also extend to a null worldline those quantities which do not rely on the timelike nature of P in order to study a test particle with zero rest mass m = 0. In this case one must work directly with the 4-momentum P and the energy and spatial momentum.
2 α 2 2 m = −PαP = E(P, u) − ||p(P, u)|| (2.75)
In the null case m = 0, the relative velocity is a unit vector ν(P, u) =ν ˆ(P, u), corresponding to the unit speed of light. To handle simultaneously the cases of zero and nonzero rest mass, introduce the “rest mass per unit mass” parameterm ˜ which has the value 0 for the first case and 1 for the second.
27 2.1.5 Addition of velocities and the aberration map Another map of physical interest between the local rest spaces of two observers is the nonlinear aberration map between their unit spheres which describes the relationship between the apparent directions of a light ray as seen by the two observers. This in turn is a special case of the addition of velocity formula which relates the relative velocities of a given test particle with 4-momentum P as seen by two different observers with 4-velocities u and U
P = E(P, u)[u + ν(P, u)] = E(P,U)[U + ν(P,U)] . (2.76)
One easily finds
E(P,U) = γ(U, u)[1 − ν(U, u) ·u ν(P, u)]E(P, u) , −1 −1 (2.77) ν(P,U) = γ(U, u) [1 − ν(U, u) ·u ν(P, u)] P (U, u)[ν(P, u) − ν(U, u)] .
Re-expressing the relative projection in terms of the boost leads to the formulas
(⊥) −1 −1 (⊥) ν(P,U) = γ(U, u) [1 − ν(U, u) ·u ν(P, u)] ν(P, u) , (2.78) (||) −1 (||) ν(P,U) = [1 − ν(U, u) ·u ν(P, u)] B(lrs)(u, U)[ν(P, u) − ν(U, u)] .
The null case describes the 4-momentum of a photon or the tangent vector to the path of a light ray, depending on the choice of language. The relative velocity is then a unit vector giving the direction of the photon or light ray. Dividing the energy and spatial momentum by Planck’s constant yields the frequency ω(P, u) = E(P, u)/¯h and wave vector k(P, u) = p(P, u)/¯h = ω(P, u)ν(P, u) of the photon. The relation- ship between the observed frequencies for the two different observer congruences determines the relativistic Doppler effect ω(P, u) = γ[1 − ν(U, u) ·u ν(P, u)]ω(P, u) . (2.79) The above relationship between the unit relative velocites is the aberration map. For the special case
ν(P, u)(||) = 0 , ||ν(P, u)(⊥)|| = 1 (2.80) of a light ray orthogonal to the relative motion as seen by u, one has the familiar result
||ν(P,U)(||)|| tan θ = = γ||ν(U, u)|| (2.81) U ||ν(P,U)(⊥)|| for the angle away from the perpendicular direction as seen by U, using the fact that boosts preserve lengths.
28 2.2 Derivatives
The orthogonal decomposition associated with a choice of observer congruence has been used to split each spacetime tensor field into a family of spatial fields which represent its measurement by the observers. The action of a spacetime differential operator on a spacetime tensor field leads to a new tensor field which may be measured. This leads to a representation of the differential operator itself in terms of two independent derivative operators which act on the collection of spatial fields which represent the spacetime tensor field being differentiated. The spatial projection of the derivative along u (when appropriate) leads to a temporal differential operator, while the spatial projection of the differential operator itself yields a spatial version of the differential operator. This splitting process may be applied to the natural derivatives, namely Lie and exterior derivatives, and to the covariant derivative and variations of it like the Fermi-Walker derivative. While the operators so defined preserve the spatial character of tensor fields they differentiate, they may also be applied to nonspatial tensor fields and this is actually useful in certain cases. However, because of the spatial projection, any terms containing undifferented factors of the observer 4-velocity will vanish, breaking the appropriate product rule for tensor (or wedge) products of nonspatial fields, while differentiated factors lead to the appearance of kinematical factors. When a second observer congruence is present, one must also introduce relative derivative operators which preserve the spatial character of fields which are spatial with respect to the original observer congruence. This is necessary in order to measure derivatives along the second observer congruence of the quantities which result from the measurement by the original observer congruence of fields associated with the second congruence. These operators may be restricted to the case of a single timelike worldline of a test particle of nonzero rest mass and suitably extended to the case of zero rest mass in order to study test particles of either type. This leads to the subject of apparent spatial gravitational forces acting on test particles due to the motion of the family of test observers, as well as their consequences for the behavior of test gyroscopes or spinning test particles.
2.2.1 Natural derivatives Two natural derivatives are important for various considerations, the Lie derivative, which for vector fields becomes the Lie bracket, and the exterior derivative. Each of these may be split as described above. The spatial Lie bracket is defined by for arbitrary vector fields X and Y by
[X,Y ](u) = P (u)[X,Y ] = £(u)XY, (2.82) and in general the spatial Lie derivative of an arbitrary tensor S is defined in the obvious way
£(u)X S = P (u)[£X S] . (2.83)
Similarly the spatial exterior derivative of an arbitrary differential form S is defined by
d(u)S = P (u)dS . (2.84)
Each of these spatial derivative operators can be applied to nonspatial fields S but are perhaps most useful when applied to spatial fields. However, when applied to nonspatial fields both operators violate the various product rules that usually hold since the spatial projection eliminates terms with temporal factors. For example, when S is spatial and the vector field X = fu is a temporal field, the spatial Lie derivative along X has the special property of being linear in the temporal component X> = f due to the orthogonality properties £(u)fuS = f£(u)uS,S spatial . (2.85) This property also holds for the spatial Lie derivative of the metric itself
(4) (4) £(u)fu g = f£(u)u g (2.86) since it differs from the spatial field P (u)[ only by the term u[ ⊗ u[ which has zero spatial Lie derivative along u.
29 For the same reason, the spatial exterior derivative of a temporal 1-form X[ = fu[ is linear in the temporal component X> = −f
d(u)[fu[] = P (u)[df ∧ u[ + fdu[] = fd(u)u[ . (2.87)
The derivative £(u)u provides a natural time derivative which will be referred to both as the temporal Lie derivative and as the Lie temporal derivative, while the derivative £(u)X along a spatial vector field X provides a natural spatial derivative. These two operators are necessary to describe the splitting of the Lie derivative. On the other hand, the temporal Lie derivative and the spatial exterior derivative together are necessary to describe the splitting of the exterior derivative. It is convenient to introduce the alternate notation ∇(lie)(u) = £(u)u for the Lie temporal derivative associated with u. This will be needed for efficient handling of three different preferred temporal derivatives.
2.2.2 Covariant derivatives Each spacetime tensor field S can be split into its representative family of spatial fields by the measurement process. Its covariant derivative (4)∇S can be split in exactly the same way and the result can be represented in terms of two projected differential operators acting either on members of the family of spatial fields representing S or on u itself, generating the kinematical quantities associated with the observer congruence. The two projected differential operators are the spatial covariant derivative, for differentiating along spatial directions, and the spatial Fermi derivative, or spatial Fermi-Walker derivative, for differentiating along the local time direction of the observer congruence. They are defined for an arbitrary tensor field S by ∇(u)S = P (u)[(4)∇S] , (2.88) (4) ∇(fw)(u)S = P (u)[ ∇uS] . In index notation it is convenient to introduce the double vertical bar symbol in place of the semicolon for the alternative notation for the spatial covariant derivative
(4) α... (4) α... α... [ ∇S] β...γ = ∇γ S β... = S β...;γ , α... α... α... (2.89) [∇(u)S] β...γ = ∇(u)γ S β... = S β...||γ . For a spatial tensor S, the spatial Fermi-Walker derivative coincides with the Fermi-Walker derivative [Walker 1932] which in turn reduces to the Fermi derivative [Fermi 1922] defined only for spatial tensors. For a nonspatial tensor, the spatial Fermi-Walker derivative and the Fermi-Walker derivative yield distinct results. One might also refer to the spatial Fermi-Walker derivative as the Fermi-Walker temporal derivative in analogy with the Lie temporal derivative, a term which is more suggestive of the direction of differentiation. Similarly the spatial covariant derivative may be applied to a nonspatial tensor, although it is most useful when applied to spatial tensors. One can speak of its spatial part ∇(u) ◦ P (u) and its temporal part ∇(u) ◦ T (u). In the literature the spatial covariant derivative is consistently defined to act only on spatial tensor fields, so only the geometry of the spatial part of spatial covariant derivative ∇(u) as defined here has been explored. This restricted derivative was mentioned by Møller [1952] and studied by Cattaneo [1958,1959] who called it the “transverse covariant derivative”, while its associated curvature was mentioned by Zel’manov [1956] and more deeply explored by Cattaneo-Gasperini [1961] and Ferrarese [1963,1965]. Ellis has recently used the terminology “orthogonal covariant derivative” [Ellis, Hwang and Bruni, 1990]. The spacetime metric and the volume 4-form are obviously covariant constant with respect to the spatial covariant derivative. It is straightforward to show that the spatial projection and each of its index-shifted forms are all covariant constant with respect to the spatial covariant derivative, as is the spatial volume 3-form ∇(u)(4)g = ∇(u)P (u) = ∇(u)P (u)[ = 0 , etc. (2.90) Thus index shifting of spatial fields and the spatial duality operation commute with the spatial covariant derivative. Fermi-Walker transport of a spatial tensor along the observer congruence is defined by requiring its spatial Fermi-Walker derivative to vanish. Similarly spatial parallel transport of a spatial tensor along a
30 curve everywhere orthogonal to the observer congruence is defined by requiring that its spatial covariant derivative along the curve vanish. Under both transports spatial tensors remain spatial by undergoing a unique 1-parameter family of boosts relative to parallel transport in order to preserve the orthogonality with u. Since parallel transport itself preserves inner products, so do these boosted transports, which means that the spatial projections and volume 3-form are also covariant constant with respect to the spatial Fermi-Walker derivative [ ∇(fw)(u)P (u) = ∇(fw)(u)P (u) = ∇(fw)(u)η(u) = 0 , etc. (2.91) This also easily follows from the spatial projection of the vanishing of the covariant derivatives of the spacetime metric and 4-volume form. Thus index-shifting and the spatial duality operations commute with the spatial Fermi-Walker derivative, exactly as for the spatial covariant derivative. Fermi-Walker transport of a spatial vector along the observer congruence has the physical interpretation of describing the behavior of the spin vector of a test torque-free gyroscope (gyro) carried by an observer.
2.2.3 Kinematical quantities (4) α The first nontrivial covariant derivative to split is ∇u, or u ;β in index-notation, namely the covariant derivative of the observer congruence 4-velocity vector field. This mixed tensor may be identified with a linear transformation of the tangent space, which is connected with its physical interpretation. The time projection on the contravariant index vanishes due to the unit nature of u
α 1 α uαu ;β = 2 (uαu );β = 0 , (2.92) so only two of the four terms in the splitting survive. In index-free form one has
(4)∇u = −a(u) ⊗ u[ + ∇(u)u , (4) a(u) = ∇uu = ∇(fw)(u)u , (2.93) ∇(u)u = −k(u) = θ(u) − ω(u) ,
while the index-lowered form is usually given when expressed in the index form
(4) [ (4) [ ∇u ]αβ = ∇βuα = uα;β = θ(u)αβ − ω(u)αβ − a(u)αuβ , γ δ k(u)αβ = −P (u) αP (u) βuγ;δ = −θ(u)αβ + ω(u)αβ , γ δ θ(u)αβ = P (u) αP (u) βu(γ;δ) , (2.94) γ δ ω(u)αβ = −P (u) αP (u) βu[γ;δ] , β a(u)α = uα;βu ,
The expansion tensor θ(u)[ and the rotation or vorticity tensor ω(u)[ are just the symmetric and sign-reversed antisymmetric parts of the spatial covariant derivative of u[. The expansion tensor is then decomposed into its pure-trace and tracefree parts to define the expansion scalar
α Θ(u) = Tr θ(u) = θ(u) α (2.95)
and the shear tensor
1 1 γ σ(u) = θ(u) − 3 Θ(u)P (u) , σ(u)αβ = θ(u)αβ − 3 θ(u) γ P (u)αβ . (2.96) The expansion scalar, shear tensor and rotation tensor, together with the acceleration vector, are referred to as the kinematical quantities of the vector field u or of its associated congruence of integral curves. The acceleration vector field is spatial due to the unit character of u
α α β 1 α β uαa(u) = uαu ;βu = 2 [uαu ];βu = 0 . (2.97) If it is identically zero the observer worldlines are geodesics and the observers are inertial observers. However, in contrast with the flat spacetime case, families of inertial observers are often less convenient to use than
31 noninertial observers unless they happen to admit additional special properties shared by the usual transla- tion invariant families of inertial observers in the flat case. This is due to the attractive nature of gravitation which causes the inertial observers to collapse together unless other conditions oppose this behavior. The remaining kinematical properties reside in the spatial tensor k(u). This mixed tensor may be thought of as a linear transformation of the local rest space into itself, and the decomposition of this linear transformation into its irreducible parts has a very physical interpretation. This interpretation involves a comparison with Lie dragging along u, for which the operator £(u)u is relevant. One may express the kinematical quantities of u entirely in terms of natural derivatives acting on u[ or its associated covariant or contravariant spatial projections du[ = −u[ ∧ a(u)[ + 2ω(u)[ , a(u)[ = −[du[](E) = u du[ = £uu[ = £(u)uu[ , [ 1 [ (M) 1 [ ω(u) = 2 [du ] = 2 d(u)u , (2.98) £uP (u)[ = £(u)uP (u)[ = £(u)u(4)g = 2θ(u)[ , £uP (u)] = £(u)uP (u)] = £(u)u(4)g−1 = −2θ(u)] , £uP (u) = u ⊗ a(u)[ , a(u)[ = −u [£uP (u)][ , £(u)uP (u) = 0 . The negative sign in the antisymmetrized covariant derivative used to define the rotation tensor arises because of the convention of adding the additional exterior derivative index to the left while the additional covariant derivative index adds to the right. For a 1-form, the two differ by a sign. The factor of two arises since the antisymmetrized covariant derivative (for a symmetric connection) is half the exterior derivative, modulo the sign difference. The definition of the rotation tensor by Ehlers [1961, 1993] and Ellis [1971] has the opposite sign, while Misner, Thorne and Wheeler [1973] use different signs in the two sections in which kinematical quantities are discussed. Since the spatial Lie derivative by u of the spatial metric yields (twice) the expansion tensor, this derivative operator does not commute with index-shifting on spatial fields when the expansion tensor is nonzero. Similarly the result
£uη(u) = £(u)uη(u) = u £u(4)η = Θ(u)η(u) (2.99) shows that it does not commute with the spatial duality operation either as long as the expansion scalar is nonzero. For example, for a spatial differential form S one has the relation
[∗(u)£(u)u∗(u)S]] = [£(u)u + Θ(u)]S] . (2.100)
However, the spatial projection tensor P (u) does have zero spatial Lie derivative along u, so the spatial projection operator and the spatial Lie derivative along u do commute
[£(u)u,P (u)] = 0 , (2.101) which means £(u)uP (u)S = P (u)£uS = £(u)uS. (2.102) Thus if a tensor is Lie dragged along u, its Lie derivative by u is zero which implies that the spatial Lie derivative of its spatial projection is also zero. It is natural to define spatial Lie transport of a spatial field along u by requiring that its spatial Lie derivative along u be zero. Then this result means that the spatial projection of Lie transport along u of a tensor yields the spatial Lie transport along u of the spatial projection of that tensor. The rotation tensor characterizes the failure of the Lie bracket of two spatial vector fields to again be spatial. The following short calculation for a pair of spatial vector fields X and Y using the invariant definition of the exterior derivative of a 1-form 2ω[(u)(X,Y ) = du[(X,Y ) = Xu[(Y ) − Y u[(X) − u[([X,Y ]) = −u[([X,Y ]) , (2.103) T (u)[X,Y ] = −u[([X,Y ])u = 2ω(u)[(X,Y )u ,
32 shows that the nonspatial part of this Lie bracket vanishes only if the rotation vanishes. The spatial part defines the spatial Lie bracket of the two spatial vector fields. When the rotation vanishes this corresponds to the Lie bracket defined within the hypersurfaces orthogonal to the congruence. The covariant rotation tensor is just half the spatial exterior derivative of u[, thus agreeing with its classical 3-dimensional definition in the Euclidean case [Synge and Schild 1949]. Its index-raised dual defines the spatial rotation or vorticity vector
~ω(u) = ∗(u)ω(u) = 1 [ ∗(u[ ∧ du[)]] , 2 (2.104) α 1 (4) βαγδ(4) 1 (4) αβγδ ω(u) = 2 uβ η ∇γ uδ = 2 η uβuγ;δ . The sign of this formula is affected both by the choice of index range (0,1,2,3 or 1,2,3,4) and the signature, each of which varies in the literature. For the index range 1,2,3,4, the spatial volume 3-form is defined by the right contraction (4)η u rather than the left contraction, while for the opposite signature (+ − −− or − − −+) the covariant spatial components of a vector field change sign relative to the contravariant spatial components, forcing another sign change to correspond to the classical spatial definition. For example, Ellis (4) [1973] in one of his review articles uses the convention η0123 = −1 in an orthonormal frame with signature − + ++ and defines the covariant rotation tensor as minus the spatial exterior derivative of u[ for a total of two sign changes, leading to the same formula as given here. A stationary spacetime is characterized by the existence of a timelike Killing vector field ξ which Lie derives the metric (4) £ξ gαβ = 2ξ(α;β) = 0 . (2.105) The 4-velocity vector field of a congruence of “Killing observers” can be defined by normalizing the Killing vector field u = `−1ξ , ` = |(4)g(ξ, ξ)|1/2 . (2.106) The normalization factor is also stationary, i.e., invariant under this symmetry, since
(4) α β (4) α β £ξ[ gαβξ ξ ] = [£ξ gαβ]ξ ξ = 0 (2.107) implies £ξ` = 0. For such an observer congruence, a “stationary observer congruence”, the expansion tensor vanishes
1 (4) 1 −1 (4) 1 −1 (4) θ(u) = 2 £(u)u g = 2 ` £(u)ξ g = 2 ` P (u)£ξ g = 0 . (2.108) Similarly the acceleration admits a spatial potential which is the natural logarithm of the normalization factor ` [ [ −1 [ a(u) = £(u)uu = P (u)£ −1 [` ξ ] = ... ` ξ (2.109) = d(u) ln ` = d ln ` , where the last equality follows from the stationarity of `. Only in the event that u itself is a Killing vector field must the normalization factor be constant, leading to vanishing acceleration.
2.2.4 Splitting the exterior derivative The rotation and acceleration appear in the splitting of the spacetime exterior derivative, which may be expressed in terms of the spatial exterior derivative and the temporal Lie derivative in the same way that the spacetime covariant derivative can be expressed in terms of the spatial covariant derivative and the spatial Fermi-Walker derivative, together with the kinematical quantities. To carry out the splitting of the exterior derivative, one needs several identities. Wedging the splitting identity (2.14) by u[ yields the useful preliminary identity u[ ∧ S = u[ ∧ P (u)S. (2.110) The Lie derivative of a differential form is related to the exterior derivative by the well known identity
£uS = d(u S) + u dS . (2.111)
33 For a spatial differential form S, the contraction of S by u vanishes (and u dS is automatically spatial) so
u dS = £uS = £(u)uS , u S = 0 , (2.112) and hence applying the splitting identity (2.14) one obtains
dS = −u[ ∧ £(u)uS + d(u)S , u S = 0 . (2.113)
Using these identities it is straightforward to evaluate the electric and magnetic parts of the exterior derivative of an arbitrary differential form
[dS](E) = −[d(u) + a(u)[∧]S(E) − £(u)uS(M) , (2.114) [dS](M) = d(u)S(M) + 2ω(u)[ ∧ S(E) .
The measurement of the identity 0 = d2u[ leads to identities for the spatial exterior derivatives of the acceleration and rotation forms. Just substituting S = du[ = −u[ ∧ a(u)[ + 2ω(u)[ into the splitting identity for the exterior derivative leads to
0 = (d2u[)(E) = [d(u) + a(u)[∧]a(u)[ − 2£(u)uω(u)[ , (2.115) 0 = (d2u[)(M) = 2[d(u)ω(u)[ − ω(u)[ ∧ a(u)[] ,
leading to the relations d(u)a(u)[ = 2£(u)uω(u)[ , (2.116) d(u)ω(u)[ = ω(u)[ ∧ a(u)[ . For an arbitrary differential form S, the splitting of d2S is obtained by iterating the splitting of the exterior derivative, with the result
0 = (d2S)(E) = {d(u)2 − 2ω(u)[ ∧ £(u)u}S(E) + {d(u)a(u)[ − 2£(u)uω(u)[} ∧ S(E) + {−[£(u)u, d(u)] + a(u)[ ∧ £(u)u}S(M) , (2.117) 0 = (d2S)(M) = {d(u)2 − 2ω(u)[ ∧ £(u)u}S(M) + 2{d(u)ω(u)[ − ω(u)[ ∧ a(u)[} ∧ S(E) .
Since S(E) and S(M) are independent, the terms involving each must separately vanish. Two of these terms vanish identically due to the identity for u[ itself (S(E) = 1 and S(M) = 0), while a third gives an identity for the commutator of the two natural derivatives acting on a spatial differential form
[£(u)u, d(u)] = a(u)[ ∧ £(u)uS , u S = 0 . (2.118)
This leaves the result that the measurement of d2 acting on a differential form leads to the operator d(u)2 − 2ω(u)[ ∧ £(u)u acting on each of the two splitting fields which represent that differential form. The vanishing of this operator then gives an identity for d(u)2 acting on any spatial differential form
d(u)2S = 2ω(u)[ ∧ £(u)uS , u S = 0 . (2.119)
Using this result, one can evaluate d(u)2 acting on an arbitrary differential form, leading to the result
d(u)2S = 2ω(u)[ ∧ [£(u)uS − d(u)(u S)] . (2.120)
34 2.2.5 Splitting the differential form divergence operator One may merge the splitting of the exterior derivative d and of the duality operation ∗ to obtain the splitting of the divergence operator δ, defined for a differential form S on spacetime by
δS = ∗d ∗S. (2.121)
This takes a more familiar form in index notation using the covariant derivative (assuming zero torsion)
(4) β [δS]α1...αp−1 = − ∇βS α1...αp−1 . (2.122)
The spatial divergence operator has an additional sign; for a spatial p-form S one has
δ(u)S = (−1)p∗(u)d(u)∗(u)S, β (2.123) [δ(u)S]α1...αp−1 = −∇(u)βS α1...αp−1 .
By manipulating the definitions and identities one finds the result
[δS](E) = −δ(u)S(E) − 2ω(u)] 2 S(M) , (2.124) [δS](M) = [δ(u)−??a(u) ]S(M) − ∗(u)£(u)u∗(u)S(E) .
In applying the last formula, the identity (2.100) is useful in re-expressing the term ∗(u)£(u)u∗(u)S, especially for a spatial 1-form S.
2.2.6 Spatial vector analysis For functions and spatial vector fields, one can introduce the covariant analogs of the gradient, curl and divergence operations in order to make contact with traditional vector analysis which takes place in Euclidean space. These spatial operations are defined in an obvious way in terms of the spatial covariant derivative. Let ∇~ (u) denote the covariant derivative operator with its differentiating index raised, i.e., [∇~ (u)f]α = ∇(u)αf. These operators are related to the spatial exterior derivative in the usual manner
~ ] gradu f = ∇(u)f = [d(u)f] ,
∗(u) [ ] curlu X = ∇~ (u) ×u X = [ (d(u)X )] , (2.125)
∗(u) ∗(u) [ divu X = ∇~ (u) ·u X = [d(u) X ] ,
where X is assumed to be a spatial vector field. However, nothing prevents their application to nonspatial fields. The spatial curl of an arbitrary vector field can be written in index form as
α αβγ (4) [curlu X] = η(u) ∇βXγ . (2.126)
Since many second rank tensors also enter into vector analysis in the form of linear transformations of vectors, one can follow the leftright arrow notation of Thorne et al [1986] for contravariant second rank spatial tensors ↔ α αβ γ [ S ·u X~ ] = S P (u)βγ X . (2.127) In the old-fashioned language such tensors are called “dyads”. Thus the spatial vorticity vector of u is just the spatial dual of half the contravariant form of the spatial exterior derivative of u, which defines the vorticity dyad ↔ω (u) = ω(u)] , (2.128) ∗(u)↔ 1 ~ω(u) = ω (u) = 2 curlu u . Allowing the spatial curl operator to act on the nonspatial vector u, one sees that the vorticity vector is half its curl. Expressing this in an orthonormal frame in terms of which u has a nonrelativistic relative spatial
35 velocity leads to the classical expression of nonrelativistic mechanics [Synge and Schild 1949]. With these conventions the linear transformation corresponding to the rotation tensor becomes
α β α β γ ω(u) X = −~ω(u) ×u X , ω(u) βX = −η βγ ω(u) X (2.129) when expressed in terms of the rotation vector. The spatial exterior derivative of the rotation 2-form itself as given above may also be rewritten in the 3-vector notation α α divu ~ω(u) = a(u) ·u ~ω(u) , ∇(u)αω(u) = a(u)αω(u) . (2.130) The identity for the exterior derivative of the acceleration 1-form becomes the following when rewritten
1 ∗(u) [ 2 curlu a(u) = [£(u)uω(u) ] = £(u)u~ω(u) + Θ(u)~ω(u) = ∇(fw)(u)~ω(u) − [θ(u) − P (u)Θ(u)] ~ω(u) , (2.131) 1 α 2 α α β 2 [curlu a(u)] = [∇(fw)(u) + 3 Θ(u)]~ω(u) − σ(u) βω(u) , where the expansion scalar appears from the commutation of the spatial Lie derivative and the spatial duality operation.
36 2.2.7 Ordinary and Co-rotating Fermi-Walker derivatives Suppose one refers to a tensor as being of “of type σ”, where σ is the gl(4,R)-representation corresponding to the GL(4,R)-representation under which the tensor transforms under a change of frame. Then the representation function σ is defined by
α... α γ... γ α... [σ(A)S] β... = A γ S β... + · · · − A βS γ... − · · · . (2.132) With this notation, the Fermi-Walker derivative along the observer congruence may be defined by
(4) (4) [ ∇(fw)(u) = ∇u − σ(u ∧ a(u) ) , (2.133) where the argument of σ is the antisymmetric tensor (mixed form)
[ α α α [u ∧ a(u) ] β = u a(u)β − a(u) uβ . (2.134)
Such an operator satisfies the obvious product laws with respect to tensor products and contractions thereof. It is easily verified the (4)g, (4)η, u, P (u), T (u), and all index-shifted variations of these tensor fields have an identically vanishing Fermi-Walker derivative. This means that the Fermi-Walker derivative commutes with the measurement process: the measurement of the Fermi-Walker derivative of a spacetime tensor yields the Fermi-Walker derivative acting on each member of the family of spatial tensors which represent that tensor. A tensor field is said to undergo Fermi-Walker transport along u if its Fermi-Walker derivative is zero, in analogy with parallel transport along u, to which it reduces for a geodesic congruence with vanishing acceleration. Thus (4)g, (4)η, u, P (u), and T (u) all undergo such Fermi-Walker transport, while only (4)g and (4)η are parallel transported along u. The effect of the difference term in the two derivatives is to generate an active boost in the velocity-acceleration plane which maps the parallel transport of u along u onto u itself. This same boost maps the parallel transport of any tensor field along u onto the Fermi-Walker transported tensor field along u. Both transports preserve inner products and commute with index-shifting and duality operations, but the Fermi-Walker derivative preserves the orthogonal projection and measurement operations associated with u as well. The Fermi-Walker derivative of a spatial tensor field is also spatial with respect to u. 1 In fact, the addition to the covariant derivative of a term −σ(A), where A is a 1 -tensor field, will generate a continuous Lorentz transformation relative to parallel transport along the curve or congruence as long as A is antisymmetric with respect to the metric, i.e., is the mixed form of a 2-form. Such a derivative will automatically commute with index shifting and its corresponding transport will preserve inner products, as does the Fermi-Walker derivative. If A is also spatial with respect to u then it will generate a continuous rotation of the local rest space of u while leaving u itself uneffected, and both u and u[ will still have vanishing derivative with respect to the new derivative (as will the metric and the spatial and temporal projection tensors). For a congruence of timelike curves, one may define a co-rotating Fermi-Walker derivative which further adapts the covariant derivative to the congruence by adding a term to the Fermi-Walker derivative which leads to a co-rotation of the local rest spaces relative to the congruence under the corresponding transport
(4) (4) ∇(cfw)(u) = ∇(fw)(u) + σ(ω(u)) (2.135) = (4)∇u − σ(u ∧ a(u)[ − ω(u)) .
A vector which is transported by the co-rotating Fermi-Walker transport has the same angular velocity as the relative velocity vector to be defined below, but does not exhibit the effects of the expansion and shear that the latter does. The spatial Fermi-Walker derivative has been introduced by spatially projecting the covariant derivative (4) (4) ∇u and not the Fermi-Walker derivative ∇(fw)(u), in order to have a temporal derivative one can apply (4) to u itself (since ∇(fw)(u)u = 0 while ∇(fw)(u)u = a(u))
(4) [ P (u) ∇(fw)(u) = ∇(fw)(u) − P (u)σ(u ∧ a(u) ) . (2.136)
37 The difference term only contributes to the derivative of nonspatial fields. In the same spirit one can define the spatial co-rotating Fermi-Walker derivative along u or equivalently the co-rotating Fermi-Walker temporal derivative along u ∇(cfw)(u) = ∇(fw)(u) + σ(ω(u)) . (2.137) This spatial derivative is related to its corresponding spacetime derivative in the same way as the ordinary Fermi-Walker derivatives are related
(4) [ P (u) ∇(cfw)(u) = ∇(cfw)(u) − P (u)σ(u ∧ a(u) ) . (2.138)
On spatial fields the spacetime and spatial ordinary and co-rotating Fermi-Walker derivatives agree, and so the measurement of the spacetime derivative leads to the action of the corresponding spatial derivative on each member of the family of spatial fields which represent a given spacetime field. With the present definitions, the derivatives of a multiple of u itself are
∇(cfw)(u)[fu] = ∇(fw)(u)[fu] = fa(u) , (2.139) which determines their action on the nonspatial parts of a tensor field. The alternative would be to define all the spatial operators by projection of the corresponding spacetime operators and introduce a new temporal derivative to handle nonspatial tensor fields (identical to the present definition of ∇(fw)(u)).
38 2.2.8 Relation between Lie and Fermi-Walker temporal derivatives The interpretation of the kinematical quanitites depends on the relationship between the Lie temporal derivative and the Fermi-Walker temporal derivatives. This in turn depends on the more general relationship between the Lie and covariant derivatives themselves. The formulas for the components of the covariant and Lie derivatives in a coordinate frame {eα} charac- α α terized by C βγ = ω ([eβ, eγ ]) = 0 are
(4) α... α... γ (4) α... [ ∇uS] β... = S β...,γ u + [σ( Γ(u))S] β... , α... α... γ α... [£uS] β... = S β...,γ u − [σ(∂u)S] β... , (2.140) α... γ (4) α... = S β...;γ u − [σ( ∇u)S] β... ,
(4) α α γ α α where the matrix arguments of σ are the suggestive abbreviations Γ(u) β = Γ γβu ,(∂u) β = u ,β, and (4) α α ( ∇u) β = u ;β. The last expression for the Lie derivative is true for an arbitrary frame and is the result of the “comma to semicolon rule” valid for any symmetric connection in a coordinate frame
£uS = (4)∇uS − σ((4)∇u)S. (2.141) Taking the spatial projection of this equation yields
(4) £(u)uS = ∇(fw)(u)S − P (u)σ( ∇u)S (2.142) for an arbitrary tensor but £(u)uS = ∇(fw)(u)S + σ(k(u))S (2.143) for a spatial tensor. The kinematical tensor k(u) thus acts as the linear transformation of the local rest space which describes the difference between spatial Lie transport and Fermi-Walker transport of spatial tensors along u. These two transports and their corresponding differential operators embody two possible choices for describing “evolution”, which might be called “Lie” and “Fermi-Walker” evolution respectively from their origins in derivative operators which correspond to those labels. In the absence of further structure, evolution can only be defined independently along each observer worldline and then only relative to a transport of the local rest space along each such worldline which defines “no evolution”. Note that for k(u) itself the difference between the spatial Fermi-Walker derivative and the spatial Lie derivative along u vanishes so
£(u)uk(u) = ∇(fw)(u)k(u) . (2.144) The Lie derivative along u has the following representation in terms of the ordinary and co-rotating Fermi-Walker derivatives
£u = (4)∇u − σ((4)∇u) = (4)∇u + σ(ω(u) − θ(u) + a(u) ⊗ u[) (4) [ = ∇(fw)(u) + σ(ω(u) − θ(u) + u ⊗ a(u) ) (2.145) (4) [ = ∇(cfw)(u) + σ(−θ(u) + u ⊗ a(u) ) . In the difference term relative to the co-rotating Fermi-Walker derivative, the expansion term in the Lie derivative generates a deformation of the local rest space of u while the acceleration term tilts the local rest space relative to u along the acceleration direction, breaking the orthogonality. Only in the case that u is a Killing vector field do both of these terms vanish
£u(4)g = 0 → θ(u) = 0 = a(u) , (2.146) in which case the Lie and co-rotating Fermi-Walker derivatives coincide. However, a unit timelike Killing vector field is perhaps only interesting in flat spacetime. Spatially projecting the representation of the Lie derivative along u in terms ∇(cfw)(u) leads to the following relation between that operator and the Lie temporal derivative
[ ∇(cfw)(u) = £(u)u + σ(θ(u)) − P (u)σ(a(u) ⊗ u ) . (2.147)
39 The last term only contributes for nonspatial fields. Thus when acting on spatial fields the three temporal derivatives are related by ∇ (u) = ∇ (u) + σ(ω(u)) (cfw) (fw) (2.148) = £(u)u + σ(θ(u)) . The co-rotating Fermi-Walker temporal derivative is therefore an obvious compromise between the Fermi- Walker temporal derivative (retaining the latter’s compatibility with spatial orthonormality) and the Lie temporal derivative (retaining the latter’s co-rotation feature). If one wants to use an orthonormal spatial frame, the best one can do to adapt it to the congruence is to transport it along the congruence by co-rotating Fermi-Walker transport, since under Lie transport, the frame will not remain orthonormal or spatial. Only when u is a Killing vector field is Lie transport compatible with orthonormality, since in this case it coincides with the co-rotating Fermi-Walker transport. An immediate consequence of the “comma to semicolon” formula for the Lie derivative is the following “product rule” £fuS = f£uS − σ(u ⊗ df)S, (2.149) with spatial projection £(u)fuS = f£(u)uS − P (u)σ(u ⊗ df)S. (2.150) For a stationary spacetime with a timelike Killing vector field ξ and its corresponding normalized 4-velocity u = `−1ξ, invariance under the symmetry group is associated with the Lie derivative by ξ and its spatial projection, not the corresponding operators for u. The previous identities together with d ln ` = a(u)[ then imply −1 £uS = ` £ξS + σ(u ⊗ a(u))S, −1 (2.151) £(u)uS = ` £(u)ξS + P (u)σ(u ⊗ a(u))S. These lead to the following relationships between the co-rotating Fermi-Walker derivative and co-rotating Fermi-Walker temporal derivative along u and the rescalings of the respective Killing vector field operators
(4) −1 ∇(cfw)(u) = ` £ξ , [ ∇(cfw)(u) = £(u)u − P (u)σ(a(u) ⊗ u ) (2.152) −1 [ = ` £(u)ξ − P (u)σ(a(u) ∧ u ) . Without stationarity, the co-rotating Fermi-Walker operators are the best that one can do to extend the corresponding Lie operators in the stationary case. Both the ordinary and co-rotating Fermi-Walker derivatives along u of a spatial tensor field reduce to the respective spatial derivatives of that tensor field. Thus the spacetime transport of a spatial tensor field along u for either of these two types of derivatives reduces to the corresponding spatial transport. The spatial Lie transport of spatial fields is rather simple. Lie transport a spatial tensor along the observer congruence using the 1-parameter group of diffeomorphisms generated by u and then spatially project it. If eα is any frame with e0 along u and {ea} Lie dragged along u, then {e0,P (u)ea} is an observer-adapted frame and {P (u)ea} is a spatial frame which undergoes spatial Lie transport. Any spacetime tensor which does not evolve in the Lie sense is represented by a family of spatial tensors which have components in such a spatial frame which are constant along the observer worldlines. Such tensors are “anchored” in the observer congruence. The spatial Lie derivative along u measures the rate at which a field is evolving in this sense. The Fermi-Walker evolution instead describes how a field changes with respect to what the observers see using an orthonormal spatial frame which is locally nonrotating, as determined by a set of torque-free gyros carried by the observer, rather than by comparing the field to the nearby observers seen by that observer. Fermi-Walker transport of the spatial frame along the observer congruence defines mathematically what locally nonrotating means. A spatial field which has constant components with respect to such a frame exhibits no evolution in the covariant sense. A spacetime tensor field which is Fermi-Walker transported along the observer congruence is represented by a family of spatial tensors which undergo spatial Fermi-Walker transport.
40 The interpretation of the kinematical quantities is associated with the comparison of the two kinds of evolution. These quantities describe the (limiting) relative motion of the nearby observers as seen by each observer moving through spacetime and as compared to a locally nonrotating orthonormal spatial frame. Physically an observer can assign a relative position vector in his local rest space to each neighboring observer at each moment of his proper time by using light signals, obtaining the relative distance from half the light travel time and the direction as an average direction. This relative position vector in the flat local rest space will undergo a time-dependent linear transformation compared to the locally nonrotating orthonormal spatial frame. The rate of change of this transformation is described by the kinematical tensor −k(u). Suppose X is a tangent vector which is Lie dragged along a given observer worldline. If X is sufficiently small, it has the interpretation of being a “connecting vector” whose tip lies on the worldline of some other fixed nearby observer worldline, identifying nearby points in the spacetime manifold with points in the tangent space. If it is initially spatial, it will not remain spatial since its tip moves at unit speed relative to the nearby observer’s proper time, while its initial point moves at unit speed relative to the given observer. Its spatial projection P (u)X after an interval ∆τu of the given observer’s proper time represents the observed position vector (“relative position vector”) of the nearby observer at the same value of the elapsed proper α time measured by that nearby observer, not by the given observer. The component along u, namely −uαX , represents the additional lapse of the given observer’s proper time necessary to reach the event at which P (u)X is the relative position vector. Neglecting this additional synchronization question helps define the spacetime neighborhood in which this discussion [Ehlers 1961, 1993, Ellis 1971] makes sense. Let Y = P (u)X be this relative position vector, which undergoes spatial Lie transport. Its spatial Fermi-Walker derivative (“relative velocity vector”) is therefore
∇(fw)(u)Y = −k(u) Y = [θ(u) − ω(u)] Y = ~ω(u) ×u Y + θ(u) Y. (2.153)
This shows that nearby observers appear to be rotating with angular velocity ~ω(u) and shearing and ex- panding by σ(u) and Θ(u) relative to locally nonrotating orthonormal axes. The co-rotating Fermi-Walker temporal derivative describes the deviation from pure rotation
∇(cfw)(u)Y = θ(u) Y. (2.154)
Thus the relative position vector fails to undergo spatial co-rotating Fermi-Walker transport along u only because of the effects of the expansion of the congruence. One can further decompose the relative position vector into a relative distance ||Y || and a relative direction vector Yˆ Y = ||Y ||Y.ˆ (2.155) This leads to a corresponding decomposition of the spatial Fermi-Walker derivative ˆ ˆ ˆ ˆ ˆ ∇(fw)(u)Y = ~ω(u) ×u Y + [σ(u) − σ(Y, Y )P (u)] Y, 1 (2.156) ∇ (u) ln ||Y || = Θ(u) + σ(Y,ˆ Yˆ ) . (fw) 3
The corresponding “relative acceleration” is obtained by taking one more spatial Fermi-Walker derivative
∇ (u)2Y = −[∇ (u)k(u)] Y − k(u) ∇ (u)Y (fw) (fw) (fw) (2.157) = −[∇(fw)(u)k(u) − k(u) k(u)] Y,
but since k(u) = −∇(u)u and ∇(fw)(u)u = a(u) this becomes
2 ∇(fw)(u) Y = {[∇(fw)(u), ∇(u)]u + k(u) k(u) + ∇(u)a(u)} Y. (2.158)
To finish this evaluation, one needs to consider the commutator of the spatial covariant derivative and the spatial Fermi-Walker derivative, which involves the curvature tensor through the Ricci identity. This will be postponed until the splitting of the curvature tensor is discussed.
41 On the other hand, a spatial vector Z which is not evolving relative to the locally nonrotating orthonormal spatial frame does evolve with respect to the observer congruence at a rate described by the kinematical tensor itself. If ∇(fw)(u)Z = 0 then
£(u)uZ = k(u) Z = −~ω(u) ×u Z − θ(u) Z. (2.159)
This would describe the evolution of the spin vector of a gyro carried by the observers relative to the observer congruence itself. Equivalently, the rotation tensor alone describes the evolution with respect to a co-rotating Fermi-Walker spatial frame ∇(cfw)(u)uZ = ω(u) Z = −~ω(u) ×u Z. (2.160)
42 2.2.9 Total spatial covariant derivatives Suppose one has a parametrized timelike or null curve c with parameter λ and tangent vector c0 in spacetime, thought of as the worldline of a test particle of nonzero rest mass m 6= 0 (m ˜ = 1) or of zero rest mass m = 0 (m ˜ = 0) respectively. It is then of interest to split the absolute or intrinsic derivative, here called the total (4) (4) covariant derivative operator D/dλ = ∇c0(λ) along this curve. This operator is discussed in section A.10 of the appendix. Before splitting this operator, it is useful to explore first its spatial projection and several related operators which are useful in representing the splitting. To identify this operator more explicitly, introduce the expanded notation (4)D(c0(λ))/dλ. In the case that c is a proper-time-parametrized worldline belonging to another observer congruence 0 (4) with 4-velocity U, with c (λ) = U ◦ c(λ), then this is just the splitting of the operator ∇U with respect −1 0 to u. However, if one uses the parameter λ = m τU , then the tangent vector c = P = mU equals the 4-momentum of the test particle, and one can handle the case of zero rest mass in a parallel fashion. Recall the notation of section (2.1.4). In both of these cases a new parametrization of the curve c can be defined which corresponds to the continuous sequence of proper times of the observers whose worldlines intersect this curve
dτ(P,u)/dλ = E(P, u) . (2.161)
Here the subscript notation on τ indicates the proper time with respect to u along P . For the casem ˜ = 1 of a timelike worldline, substituting m = 1 replaces P everywhere by the 4-velocity U, E(P, u) by the gamma factor γ(U, u), and the affine parameter λ by the proper time τU . This particular relation then connects the proper times dτ(U,u)/dτU = γ(U, u) , (2.162) corresponding to the Lorentz dilation of the test particle’s proper time in relative motion with respect to the observer congruence. It is convenient to follow the custom of not distinguishing the operator (4)D(c0(λ))/dλ which acts on (4) tensors defined only along the curve c from the operator ∇c0(λ) which acts on tensor fields defined in a neighborhood of c. When necessary they will be understood to apply to any smooth extension of a tensor along c to a tensor field, since only the values along the curve enter into these derivatives. From the total covariant derivative one may extract a derivative operator along the curve (equivalently, along P ) which preserves the spatial character of a tensor and can therefore be used in the representation of the splitting of the total covariant derivative of a spacetime tensor in terms of its representative spatial fields. The obvious way to do this is simply by spatial projection of the total covariant derivative
(4) (4) D(fw)(P, u)/dλ = P (u) D(P )/dλ = P (u) ∇P (2.163) = E(P, u)[∇(fw)(u) + ∇ν(P, u)] ,
and hence changing the parametrization to observer proper time
−1 D(fw)(P, u)/dτ(P,u) = E(P, u) D(fw)(P, u)/dλ (2.164) = ∇(fw)(u) + ∇ν(P, u) .
The qualifying notation (P, u) indicates the derivative along P as seen by u. However, two other useful possibilities exist. P itself can be split into spatial and temporal parts. Clearly one can use the spatial covariant derivative along the spatial part of P but one has the choice of the ordinary or co-rotating spatial Fermi-Walker derivative or the spatial Lie derivative along the temporal part. This leads to three different possibilities. One then has the option to rescale the derivative to correspond to the rate of change with respect to the observer proper time. Suppose S is a spatial tensor defined only along the worldline with tangent U. In order to discuss derivative operators along this worldline in terms of the three temporal derivatives {∇(tem)(u)}tem = fw,cfw,lie and the spatial covariant derivative ∇(u) already introduced for tensor fields, assume that S may be smoothly extended to a spatial field on (4)M. The three possible total spatial covariant derivatives that one can define
43 for such spatial tensors along the worldline can then be represented in terms of the following operators on the corresponding extended spatial tensor fields
D(tem)(P, u)/dλ = E(P, u)[∇(tem)(u) + ∇(u)ν(P, u)] (2.165) = E(P, u)∇(tem)(u) + ∇(u)p(P, u) , tem = fw,cfw,lie , which may then be rescaled to correspond to a derivative with respect to the observer proper time
−1 D(tem)(P, u)/dτ(P,u) = E(P, u) D(tem)(P, u)/dλ (2.166) = ∇(tem)(u) + ∇(u)ν(P, u) tem = fw,cfw,lie . These three operators will be called respectively the Lie, Fermi-Walker, and co-rotating Fermi-Walker total spatial covariant derivatives along P with respect to u. All of them, like the Lie and Fermi-Walker temporal derivatives and the spatial covariant derivative, may be applied to nonspatial tensor fields but are most useful for spatial tensor fields. When applied to functions, all of these operators reduce to the ordinary parameter derivative along the worldline if the function is only defined on the worldline or the ordinary derivative by P or E(P, u)−1P if the function is defined in a neighborhood of the worldline
D(tem)(P, u)f/dτ(P,u) ≡ df/dτ(P,u) = P f . (2.167) Since the spatial metric is covariant constant with respect to the spatial covariant derivative and has zero ordinary and co-rotating spatial Fermi-Walker derivative along u, the ordinary and co-rotating Fermi-Walker total spatial covariant derivatives of the spatial metric vanish, so that each of these derivatives commute with index-shifting on spatial fields. However, the Lie total spatial covariant derivative of the spatial metric, which coincides with its spatial Lie derivative, is not zero as long as the expansion tensor is nonvanishing
[ [ [ D(lie)(P, u)P (u) /dτ(P,u) = £(u)uP (u) = 2θ(u) . (2.168) Thus the Lie total spatial covariant derivative does not commute with index-shifting on spatial tensor fields and extra expansion terms must be taken into account when shifting indices. The co-rotating Fermi-Walker total spatial covariant derivative is a compromise between the Fermi- Walker and Lie total spatial covariant derivatives. It has the Fermi-Walker property of commuting with the shifting of indices but its associated transport has the same rotational properties as the Lie total spatial covariant derivative. One simply absorbs the expansion tensor into the latter derivative or equivalently the rotation tensor into the former to obtain this result
D(fw)(P, u)/dτ(P,u) + σ(ω(u)) = D(cfw)(P, u)/dτ(P,u) = D(lie)(P, u)/dτ(P,u) + σ(θ(u)) . (2.169) This new total covariant derivative was introduced by Massa [1974b, 1990]. The difference between the rotation and expansion tensors parametrizes the difference between the Lie and Fermi-Walker total spatial covariant derivatives
D(lie)(P, u)/dτ(P,u) − D(fw)(P, u)/dτ(P,u) = σ(k(u)) . (2.170) The Lie total spatial covariant derivative was used by Zel’manov [1956] and Cattaneo [1958]. Every timelike worldline with 4-velocity U has one spatial field defined along it, namely its relative velocity vector ν(U, u) as seen by u. The temporal derivative of this relative velocity defines the relative acceleration of U. Introduce therefore the ordinary and co-rotating and the Lie relative accelerations of U with respect to u by a (U, u) = D (U, u)ν(U, u)/dτ , (tem) (tem) (U,u) (2.171) tem = fw,cfw,lie . These are related to each other in the same way as the corresponding derivative operators
a(cfw)(U, u) = a(lie)(U, u) + θ(u) ν(U, u)
= a(fw)(U, u) + ω(u) ν(U, u) (2.172)
= a(fw)(U, u) − ~ω(u) ×u ν(U, u) .
44 The relative acceleration may in turn be related to the temporal derivative of the spatial momentum (per unit mass)p ˜(U, u) = γν(U, u) using the product rule and explicitly differentiating the expression for the gamma factor
γ = [1 − ||ν(U, u)||2]−1/2 , 2 1 [ (2.173) d ln γ/dτ(U,u) = γ ν(U, u) ·u [a(tem)(U, u) + 2 D(tem)(U, u)P (u) /dτ(U,u) ν(U, u)] , tem = fw,cfw,lie to obtain
D(tem)(U, u)˜p(U, u)/dτ(U,u) = γ[D(tem)(U, u)˜ν(U, u)/dτ(U,u) + d ln γ/dτ(U,u) ν(U, u)]
= γ[P (u, U) P (U, u) a(tem)(U, u) (2.174) 1 2 [ + 2 γ D(tem)(U, u)P (u) /dτ(U,u)(ν(U, u), ν(U, u)) ν(U, u)] .
Only in the Lie case is the derivative of the spatial metric nonzero, producing the expansion tensor θ(u)[ = 1 [ 2 D(lie)(U, u)P (u) /dτ(U,u). In all cases this relationship may be inverted to give the relative acceleration as a function of the temporal derivative of the spatial momentum either from the previous equation using equation (2.57) or by directly differentiating γ−1p˜(U, u). The result is
−1 −1 a(tem)(U, u) = γ Pu(U, u) D(tem)(U, u)˜p(U, u)/dτ(U,u) (2.175) 1 2 [ − 2 γ D(tem)(U, u)P (u) /dτ(U,u)(ν(U, u), ν(U, u)) ν(U, u) .
45 2.2.10 Splitting the total covariant derivative Consider the splitting of the total covariant derivative along an arbitrary timelike or null parametrized curve in spacetime representing the worldline of a test particle of nonzero or zero rest mass m respectively with 4-momentum P and associated parameter λ. Although one could derive formulas for the splitting of this operator acting on arbitrary tensors defined along the curve, the main application will be for a vector defined along the curve. Let Y = Y >(Y, u) + Y~ (Y, u) , Y~ (Y, u) ≡ P (u)Y (2.176) be such a vector. Recalling the splitting of P P = E(P, u)[u + ν(P, u)] , (2.177) and the Fermi-Walker total spatial covariant derivative of u
D(fw)(P, u)u/dτ(P,u) = [∇(fw)(u) + ∇(u)ν(P, u)]u = a(u) − k(u) ν(P, u) , (2.178) one finds
(4) −1 (4) P (u) D(P )Y/dτ(P,u) = E(P, u) P (u) D(P )Y/dτ(P,U) = D(fw)(P, u)Y/dτ(P,U) (2.179) > ~ = Y (Y,U)D(fw)(P, u)u/dτ(P,U) + D(fw)(P, u)Y (Y, u)/dτ(P,U) for the spatial projection. For the temporal projection recall that
(4) α α β α β α β ∇P u = u ;βP = −a(u) uβP − k(u) βP α α β = E(P, u)[a(u) − k(u) βν(P, u) ] (2.180)
= E(P, u)D(fw)(P, u)u/dτ(P,u)
and (4) > α(4) (4) α (4) α [ ∇P Y ] = −u ∇P Yα = ∇P [−u Yα] + Yα ∇P u > ~ [ = PY (Y, u) + E(P, u)Y (Y, u) D(fw)(P, u)u/dτ(P,u) (2.181) > ~ [ = E(P, u)[dY (Y,U)/dτ(P,u) + Y (Y, u) D(fw)(P, u)u/dτ(P,u)] so (4) > −1 (4) > [ D(P )Y/dτ(P,u)] = E(P, u) [ D(P )Y/dλ] (2.182) > ~ = dY (Y,U)/dτ(P,u) + Y (Y, u) ·u D(fw)(P, u)u/dτ(P,u) . In each case the splitting of the total covariant derivative leads to the Fermi-Walker total spatial covariant derivative of the splitting fields plus a term involving the Fermi-Walker total spatial covariant derivative of the observer 4-velocity, which introduces kinematical terms. The same result applies to the covector Y [ since index shifting commutes with these operators. This allows one to generate a formula for the splitting of the total covariant derivative of any tensor.
46 2.3 Observer-adapted frame derivatives
The previous section does not rely on the introduction of an explicit spatial frame to perform calculations. However, spatial frames can be quite useful in making calculations much more concrete and automatic. This is especially true in splitting spacetime fields involving derivatives, where the use of an observer-adapted spacetime frame makes calculations almost routine.
2.3.1 Natural frame derivatives It is convenient to have several notations for the frame derivatives of functions (including component func- tions) eαf = ∂αf = f,α . (2.183) α The commutators of these derivatives define the structure functions C βγ of the frame {eα}
γ α 1 α β γ [eα, eβ] = C αβeγ , dω = − 2 C βγ ω ∧ ω , (2.184) or γ β β γ £eα eβ = C αβeγ , £eα ω = −C αγ ω . (2.185) These functions satisfy the Jacobi identity
0 = [eα, [eβ, eγ ]] + [eβ, [eγ , eα]] + [eγ , [eα, eβ]] δ δ (2.186) = 2(∂[αC βγ] − C [αC βγ])eδ , equivalent to d2ωα = 0. The structure functions for an observer-adapted frame
c 0 a 1 a b c a 0 b [ea, eb] = C abec + C abe0 , dω = − C bcω ∧ ω − C 0bω ∧ ω , 2 (2.187) b 0 0 1 0 a b 0 0 a [e0, ea] = C 0aeb + C 0ae0 , dω = − 2 C abω ∧ ω − C 0aω ∧ ω , may be interpreted in terms of the geometry of the orthogonal decomposition, using only noncovariant deriva- tives. The purely spatial structure functions govern the spatial Lie bracket and spatial exterior derivative for purely spatial fields c [ea, eb](u) = £(u)e eb = C abec , a (2.188) c 1 c a b d(u)ω = − 2 C abω ∧ ω , leading to the expressions
b a b a c a b [X,Y ](u) = (X ∂bY − Y ∂bX + C abX Y )ea , (2.189) 1 c a b d(u)σ = 2 (∂aσb − ∂bσa − σcC ab)ω ∧ ω , for a spatial vector fields and 1-forms. 0 The structure functions C ab are proportional to the components of the rotation 2-form
[ 1 [ 1 0 1 0 a b ω(u) = d(u)u = − Ld(u)ω = LC abω ∧ ω , 2 2 4 (2.190) 1 0 ω(u)ab = 2 LC ab . These structure functions, or equivalently the rotation, directly measure the failure of the spatial frame vector fields to be hypersurface-forming since in the latter case their Lie brackets would again lie in the distribution determining the hypersurfaces, i.e., would again be spatial vector fields
[X,Y ] = [X,Y ](u) + 2ω(u)[(X,Y )u (2.191)
Similarly −1 −1 b £(u)uea = L £(u)e ea = L C 0aeb , 0 (2.192) a a −1 a b £(u)uω = u dω = −L C 0bω ,
47 b show that the structure functions C 0a directly describe the spatial Lie derivatives along u of the spatial frame and spatial dual frame and which appear in the component expression for the expansion tensor
1 −1 θ(u)ab = [£uP (u)]ab = (2L) [hab,0 − 2C(a|0|b)] , 2 (2.193) c 1/2 −1 c Θ(u) = θ(u) c = − Tr k(u) = £(u)u ln h − L C 0c
as follows from its Lie derivative definition. For an orthonormal spatial frame, as used by Durrer and Straumann [1988a,b, 1989] in the slicing point of view, for example, then hab = δab and hab,0 = 0, so that the symmetric combination of these structure functions alone determine the expansion tensor. 0 The structure functions C 0a enter into the expression for the acceleration of the congruence
[ [ 0 a a(u) = £(u)uu = d(u) ln L + C 0aω , (2.194)
and directly give the spatial acceleration components in a partially-normalized observer-adapted frame (for which L = 1). For a given observer congruence, one may impose conditions on the structure functions which do not directly represent kinematical quantities, provided that they are compatible with the Jacobi identities. The a structure functions C 0b determine how the spatial frame is transported along the observer congruence. Introducing the quantities b ∇(tem)(u)ea = C(tem)(u) aeb , (2.195) one finds b −1 b C(lie)(u) a = L C 0a , b −1 b b C(cfw)(u) a = L C 0a + θ(u) a , (2.196) b −1 b b b C(fw)(u) a = L C 0a + θ(u) a − ω(u) a . a Setting the matrix C(tem)(u) b to zero for each of the three choices in turn respectively defines the spatial frame’s spatial Lie transport, its co-rotating Fermi-Walker transport, and its Fermi-Walker transport along u. An observer-adapted frame for which one of these three conditions hold will be referred to respectively as a spatially-comoving observer-adapted frame, a spatially co-rotating observer-adapted frame or a locally nonrotating observer-adapted frame. For the spatially-comoving observer-adapted frame, letting (α, β, γ, δ) = (0, b, c, d) in 2.186 reduces it to d a C bc,0 = 0 under the condition C 0b = 0, showing that the spatial structure functions are constant along the congruence for this class of frame. That such a frame always exists is clear since one can always take any partially-observer-adapted frame for which e0 is along u and the spatial frame is Lie dragged along e0 (a comoving partially-observer-adapted frame) and simply spatially project that spatial frame. Similarly one can always choose a spatially-holonomic observer-adapted frame by taking comoving coordinates for the congruence and spatially projecting the coordinate frame vector fields for the three coordinates which a parametrize the congruence. Such a frame is then also spatially comoving and has C bc = 0, two conditions which make it the simplest choice of frame in which to perform certain calculations. In a spatially-comoving observer-adapted frame, the spatial Lie derivative of a spatial tensor with respect to either e0 or u just reduces to the ordinary derivative by these vector fields of the components. Introduce the spatial tensor C(u) a a C(u) bc = C bc (2.197) whose spatial components in the observer-adapted frame equal the corresponding structure functions. One may then freely shift the indices on C(u) unambiguously with the spatial metric. This tensor represents the freedom left in the choice of the spatial distribution of the spatial frame, once the evolution along the observer congruence has been fixed.
2.3.2 Splitting the connection coefficients So far, a temporal and a spatial covariant derivative operator have been extracted from the spacetime covariant derivative, together with the kinematical quantities of the observer congruence. Each of the derivative operators, namely the spatial Fermi-Walker derivative and the spatial covariant derivative, may
48 act on any spacetime tensor fields, but preserve the spatial property of spatial fields. These operators, together with the spatial kinematical fields, are enough to express the measurement of the spacetime covariant derivative of a spacetime tensor in terms of spatial operators acting on the spatial fields which represent that tensor. The simplest way to obtain such measurement formulas is to use a spatially-comoving spatially-holonomic observer-adapted frame and simply evaluate the standard formulas (see Appendix A) for the components of the connection and of the covariant derivative of those spacetime tensors which are of interest. Such a frame is used as a fundamental tool in the approaches of both Zel’manov [1956] and Cattaneo [1958,1960] to the threading point of view. The components of the spacetime metric connection with respect to a frame {eα} will here be defined with the ordering of the covariant components following the “del convention” of Hawking and Ellis [1977]
(4) α α (4) Γ βγ = ω ( ∇eβ eγ ) (2.198)
(4) α α δ rather than the “semi-colon convention” Γ βγ = ω (eβ;δeγ ) of Misner, Thorne and Wheeler [1973]. Using the notation
A{δβγ}− = Aδβγ − Aβγδ + Aγδβ . (2.199) and shifting indices on the structure functions with the frame component matrix of the spacetime metric, the connection components can be written
(4) α 1 (4) αδ (4) Γ βγ = 2 g ( g{δβ,γ}− + C{δβγ}− ) . (2.200) 1 Consider the 2 -tensor (4) α β γ Γ βγ eα ⊗ ω ⊗ ω (2.201) which represents the connection coefficients in a particular choice of frame and of course changes to a different tensor under a change of frame, according to the inhomogeneous transformation law for the connection coefficients. This tensor can be split as any other, but the results will be frame dependent like the tensor itself. However, if the frame is at least a partially-observer-adapted frame, then some of the splitting information is frame independent and one can interpret some of the fields which result from the splitting in a geometric way, similar to the interpretation of the structure functions of an observer-adapted frame. The more general partially-observer-adapted frames will be considered below in the discussion of a nonlinear reference frame. Here attention will be restricted to the class of observer-adapted frames.
2.3.3 Observer-adapted connection components Measurement of the connection coefficient “tensor” in the observer-adapted frame leads to the splitting
(4) α (4) 0 (4) 0 (4) 0 (4) a (4) 0 (4) a (4) a (4) a Γ βγ ↔ Γ 00; Γ a0, Γ 0a, Γ 00; Γ ab, Γ 0b, Γ b0; Γ bc (2.202) (4) > (4) > (4) > (4) a (4) > (4) a (4) a (4) a ↔ Γ >>; Γ a>, Γ >a, Γ >>; Γ ab, Γ >b, Γ b>; Γ bc
> where the 0 indices have been rescaled by L to correspond to components along e> and ω . These fields are also connected closely with the geometry of the congruence
(4) > Γ >> = £(u)u(ln L) , (4) a a Γ >> = a(u) , (4) > Γ a> = [d(u)(ln L)]a , (4) > Γ >a = a(u)a , (2.203) (4) > Γ ab = −k(u)ba , (4) a a a Γ >b = −k(u) b + C(lie)(u) b , (4) a a Γ b> = −k(u) b , (4) a a Γ bc = Γ(u) bc .
49 The spatial components of the spacetime connection equal the components of the spatial part of the spatial connection ∇(u) ◦ P (u)
c ∇(u)e eb = Γ(u) ab , a (2.204) a 1 ad Γ(u) bc = 2 h (h{db,c}− + C(u){dbc}− ) , For the hypersurface point of view, one makes the replacements u → n, L → N, > → ⊥ for “perpendicular” to n, ω(u) → 0, and k(u) → K(n), the extrinsic curvature tensor. The spatial part of the spatial connection is then equivalent to the connection of the induced metric on each slice. Only when L = 1 do the above formulas give the components of the connection in the partially-normalized (4) > (4) > frame. This distinction only matters for the explicitly L-dependent fields Γ >> and Γ a>, which are (4) associated with the threading parametrization. These two fields are not components of ∇e> as conventional −1(4) notation would imply, but rather of L ∇e0, unless L = 1 when e0 = e>, in which case they vanish. Otherwise they serve the purpose of converting temporal and spatial derivatives of 0-indexed components to derivatives of the corresponding >-indexed components. The purely spatial components determine the spatial part of the spatial connection while the remaining connection components dependent only on the kinematical a quantities, which determine the temporal part of the spatial connection. The contribution C(lie)(u) b to the (4) a field Γ >b comes from the spatial Lie derivatives of the spatial frame vectors and depends on their choice. Using a partially-normalized spatially-comoving spatially-holonomic observer-adapted frame
a a L = 1 ,C 0b = 0 = C bc , (2.205) eliminates the “gauge terms” which depend on the choice of e0 and the spatial frame, making it the most convenient frame to use in splitting the covariant derivative of a tensor field, leading to a family of spatial fields which arise from the original family representing the split of the tensor itself by a linear operation involving the spatial covariant derivative, the spatial Lie derivative along u and the kinematical quantities of the congruence. This was already done for the geodesic operator without the aid of such a frame. Furthermore, the spatial Lie derivative may be replaced by the spatial Fermi-Walker derivative plus a kinematical term to yield alternative expressions for the measured fields. The resulting formulas, in either case, are valid in any observer-adapted frame, the above “gauge terms” entering into the expressions for the observer-adapted components of the spatial Lie and covariant derivatives.
2.3.4 Splitting covariant derivatives 1 It is quite useful to explicitly consider splitting the covariant derivatives of vector fields, 1-forms and 1 - α > a α tensor fields. A vector field X splits into the family {X ,X } and its covariant derivative X ;β splits into the family > > d X ;> = £(u)uX + a(u)dX , a a a d a > X ;> = £(u)uX − k(u) dX + a(u) X , (2.206) > > d X ;a = ∇(u)aX − k(u)daX , a a a > X ;b = ∇(u)bX − k(u) bX , with trace α (4) > c X ;α = div X = {£(u)u + Θ(u)}X + {∇(u)c + a(u)c}X . (2.207) In these expressions and in similar ones below, the symbols £(u)uX> and £(u)uXa indicate respectively the spatial Lie derivative of a scalar X> and the spatial components of the spatial Lie derivative of the spatial vector Xa. The same applies to the spatial covariant derivative. In other words scalar projections along e> are intended before the action of spatial derivative operators. If X and Y are spatial vector fields, then these formulas imply the result
(4) ∇X Y − ∇(u)X Y = −k(u)(Y,X) e> = [θ(u)(Y,X) − ω(u)(Y,X)]e> . (2.208) Thus k(u) describes the difference between spatial and spacetime differentiation in the local rest space of the observer. When the vorticity ω(u) is zero, then k(u) reduces to the extrinsic curvature tensor −θ(u)
50 of the hypersurfaces orthogonal to the observer family, namely the sign-reversed second fundamental form of those hypersurfaces. As one parallel transports Y along X in the local rest space using the spatial connection, −k(u)(Y,X) describes the spacetime rate of change of the component of the transported field along the direction of the observer 4-velocity necessary to keep Y spatial compared to the spacetime parallel transported field. α... It is worth noting that for any tensor S β... one has
−1/2 1/2 −1 c {£(u)u + Θ(u)}S = h £(u)u(h S) − L C 0cS, (2.209) which gives an alternative expression for the temporal part of the spacetime divergence, the second term of which vanishes in a spatially-comoving frame. The spatial part on the other hand is the spacetime divergence of the spatial part of the vector field α c [P (u)X] ;α = {∇(u)c + a(u)c}X , (2.210) leading to the addition of the acceleration term to the spatial divergence. Such formulas can also be obtained without using an adapted frame by relying only on the properties of the spatial projection operator, as described by Ehlers [1961, 1993] and Ellis [1971]. For example
(4) α (4) α β α (4) β β(4) α ∇αX = ∇α(P (u) βX ) = P (u) β ∇αX + X ∇αP (u) β (2.211) β γ β (4) α β = ∇(u)βX + X P (u) γ ∇α(u uβ) = [∇(u)β + a(u)β]X illustrates this technique for a spatial vector field X. For a 1-form Xα represented by the spatial fields {X>,Xc}, the covariant derivative Xα;β has the following splitting d X>;> = £(u)uX> − Xda(u) , d Xa;> = £(u)uXa + Xdk(u) a − X>a(u)a , (2.212) d X>;a = ∇(u)aX> + Xdk(u) a ,
Xa;b = ∇(u)bXa + X>k(u)ba . Twice the symmetrized covariant derivative of the 1-form equals the Lie derivative of the spacetime metric with respect to its corresponding vector field
(4) d [£X g]>> = 2£(u)uX> − 2Xda(u) , (4) b [£X g]>a = {∇(u)a − a(u)a}X> + £(u)uXa + 2Xbk(u) a , b b (2.213) = {∇(u)a − a(u)a}X> + hab£(u)uX + 2Xbω(u) a , (4) [£X g]ab = £(u)P (u)XP (u)ab − 2X>θ(u)ab = £(u)XP (u)ab , where £(u)P (u)XP (u)ab = 2∇(u)(aXb) , (2.214) and b b £(u)uXa − 2Xbθ(u) a = hab£(u)uX . (2.215) 1 α Finally consider splitting the covariant derivative of the 1 -tensor S β > > d > d S >;> = £(u)uS > + a(u)dS > − S da(u) , a a a d a d S >;> = £(u)uS > − k(u) dS > − S da(u) , > > > d > d S a;> = £(u)uS a + S dk(u) a − a(u)aS > + a(u)dS a , > > d > d S >;a = ∇(u)aS > − k(u)adS > + S dk(u) a , (2.216) a a a d a d a > a S b;> = £(u)uS b − k(u) dS b + S dk(u) b + a(u) S b − S >a(u)b , a a a d > a S >;b = ∇(u)bS > + S dk(u) b − S >k(u) b , > > > d S a;b = ∇(u)bS a + S >k(u)ba − k(u)bdS a , a a a > a S b;c = ∇(u)cS b − k(u) cS b + S >k(u)bc ,
51 with trace α > c c d c > S >;α = [£(u)u + Θ(u)]S > + [∇(u)c + a(u)c]S > + S dk(u) c − a(u) S c , α > > c c > S a;α = [£(u)u + Θ(u)]S a + S ck(u) a + S >k(u)ca − a(u)aS > (2.217) c + [∇(u)c + a(u)c]S a . Notice that the temporal derivative which occurs in the divergence is the Θ(u)-augmented Lie derivative, while the spatial divergence occurs in each case accompanied by an acceleration term to form the spacetime a divergence of the spatial field S > and the spatial projection of the spacetime divergence of the spatial field a S b. These formulas have interesting special cases for symmetric and antisymmetric tensors. For a symmetric tensor one has α > c c d S >;α = [£(u)u + Θ(u)]S > + [∇(u)c + 2a(u)c]S > − S dθ(u) c , (2.218) α > > c S a;α = [£(u)u + Θ(u)]S a − a(u)aS > + [∇(u)c + a(u)c]S a . while for an antisymmetric tensor one has
α c cab (E)c ∗(u) (M) ] S >;α = ∇(u)cS > − η(u) ω(u)cSab = ∇(u)cS − 2~ω(u) ·u [ S ] , α (E)c (M)c S a;α = hac[£(u)u + Θ(u)]S + [∇(u)c + a(u)c]S a (2.219)
(E)] b ~ ∗(u) (M) ] = hab[£(u)u + Θ(u)]S + {[∇(u) + a(u)] ×u [ S ] }a .
Note that analogous to the relation for a spatial vector field, the spatial projection of the spacetime 1 divergence and the spatial divergence of a 1 -tensor differ by an acceleration term γ c γ X ;γ = [∇(u)c + a(u)c]X = [∇(u)γ + a(u)γ ]X , γ c δ γ γ (2.220) Xa ;γ = [∇(u)c + a(u)c]Xa , or P (u) αXδ ;γ = [∇(u)γ + a(u)γ ]Xα , while for a spatial 2-form one has
γ ∗(u) ] Xa ;γ = {[∇~ (u) + a(u)] ×u X }a , (2.221) leading to the acceleration-augmented curl of its spatial dual which represents the spatial projection of the spacetime divergence. In the case of a stationary spacetime, where one may define u by normalizing a timelike Killing vector −1 (4) field u = ` ξ, then the Killing equation £ξ g = 0 may be expressed in terms of u using the splitting of (4) > a £X g given above with X = ξ, X = ` 6= 0 and X = 0. This immediately yields three conditions on u (the spatial scalar, spatial vector and spatial tensor equations which represent the splitting of the Killing equation) £(u)u` = 0 , a(u)a = ∇(u)a ln ` , θ(u)ab = 0 . (2.222) The normalization factor must be a stationary function, the logarithm of which serves as an acceleration potential (modulo sign conventions), and the expansion tensor must vanish.
2.3.5 Observer-adapted components of total spatial covariant derivatives Given the definitions (2.195) and (2.204), the total spatial covariant derivative of the spatial frame along the worldline of a test particle with 4-momentum P is given by
D(tem)(U, u)ea/dτ(P,u) = [∇(tem)(u) + ∇(u)ν]ea b b c = [C (u) a + Γ(u) caν(P, u) ]eb (tem) (2.223) b b = [C(tem)(u) a + ωωω(u)(ν(P, u)) a]eb , tem = fw,cfw,lie .
b where (ωωω(u)(ν(P, u)) a) is the matrix of spatial connection 1-forms
b b c ωωω(u) a = Γ(u) caω (2.224)
52 evaluated on the relative velocity vector. The spatial connection 1-form is discussed below in section (2.5.6). From the appropriate product rule, one then has for a spatial vector field the expression
a a a b a b c D(tem)(P, u)X /dτ(P,u) = dX /dτ(P,u) + C(tem)(u) bX + Γ(u) bcν(P, u) X , (2.225) or in general for a spatial tensor field
a... D(tem)(P, u)S b.../dτ(P,u) a... a... (2.226) = dS b.../dτ(P,u) + [σ([C(tem)(u) + ωωω(u)(ν(P, u)] X)S] b... . When the spatial frame undergoes the corresponding transport along the observer congruence, the term a involving C(tem)(u) b = 0 vanishes and the total spatial covariant derivative of that type reduces to a simpler expression. One may find the Lie total spatial covariant derivative in this form mentioned by Møller [1952] and in subsequent work by Zel’manov [1956] and Cattaneo [1958], all of whom use spatially-comoving spatially-holonomic spatial frames. In general only the ordinary and co-rotating Fermi-Walker transport conditions for the spatial frame along u are compatible with the condition of orthonormality. An orthonormal spatial frame of this latter type has an arbitrary distribution of orientation on a given hypersurface transversal to the observer congruence, but its subsequent behavior away from that hypersurface along the observer congruence is completely fixed by the transport condition. For a given choice of such a frame, one may examine the relative rotation of a spatial frame which is transported along an arbitrary timelike worldline by the corresponding transport, which accomplished by requiring that the corresponding total covariant derivative of the frame along the worldline vanish. Suppose {ea} is any spatial frame defined on some fixed hypersurface Σ transversal to the observer congruence. One can extend it to a spatial frame on the spacetime by transporting it along the congruence by one of the three spatial transport conditions
a ∇(tem)(u)ea = 0 ↔ C(tem)(u) b = 0 . (2.227)
One can also spatially transport it along an arbitrary timelike or null worldline from its value at the inter- section of that worldline with the hypersurface to define a spatial frame along that single worldline
D(tem)(P, u)E(P, u, e)a = 0 . (2.228)
The two such frames along the worldline are linearly related
b E(tem)(P, u, e)a = R(tem)(P, u, e) aeb . (2.229)
If {ea} is an orthonormal spatial frame on Σ, then in general only in the ordinary and co-rotating Fermi- Walker cases will it remain orthonormal as it is extended along the observer congruence to a spatial frame on spacetime, as is the case for its corresponding spatial transport along the single given worldline. In these cases the matrix R(fw)(P, u, e) or R(cfw)(P, u, e) respectively will be an orthogonal matrix describing the relative rotation of the orthonormal spatial frame transported along the worldline with respect to the given orthonormal spatial frame on spacetime. a Suppose X = X ea undergoes the corresponding transport along the given worldline
a a a b 0 = D(tem)(P, u)X /dτ(P,u) = dX /dτ(P,u) + ωωω(ν(P, u)) bX (2.230)
or a a b dX /dτ(P,u) = −ωωω(u)(ν(P, u)) bX . (2.231)
Since {ea} is an orthonormal spatial frame, its connection 1-form is antisymmetric in its tensor-valued indices (see section (A.20) of the appendix), so its spatial dual defines a vector-valued 1-form whose value on the relative velocity defines the relative angular velocity of the two orthonormal frames {ea} and {E(P, u)a} along the worldline ∗(u) ] ζ(sc)(P, u, e) = ωωω(u)(ν(P, u)) , (2.232)
53 or a 1 abc ζ(sc)(P, u, e) = 2 η(u) ωωω(u)(ν(P, u))bc , a a c ωωω(u)(ν(P, u)) b = η(u) bcζ(sc)(P, u, e) , (2.233) tem = fw,cfw . The transport condition for X then takes the more familiar form
a a b c a dX /dτ(P,u) = η(u) bcζ(sc)(P, u, e) X = [ζ(P, u, e) ×u X] . (2.234)
Letting X = E(tem)(P, u, e)a leads to the relationship between the relative rotation and the relative angular velocity a −1 c a c dR(tem)(P, u, e) c/dτ(P,u) R(tem)(P, u, e) b = η(u) cbζ(sc)(P, u, e) . (2.235)
This determines the finite rotation of the frame {E(tem)(P, u, e)a} defined along the worldline with 4- momentum P with respect to the frame {ea} restricted to this worldline. Note that relative rotation along the worldline of the co-rotating Fermi-Walker transported spatial frame with respect to the Fermi-Walker transported spatial frame is related to the vorticity vector ~ω(u) in the same way that R(tem)(P, u) is related to the relative angular velocity ζ(tem)(P, u) since
a a a b D(cfw)(P, u)X /dτ(P,u) = D(fw)(P, u)X /dτ(P,u) + ω(u) bX (2.236) and ~ω(u) is the spatial dual of ω(u)]. The interpretation of the rotation R(tem)(P, u) is clear. The initial spatial distribution of the orientation of ea on the hypersurface Σ is completely arbitrary and this arbitrariness is propagated along the observer congruence by the choice of either ordinary or co-rotating spatial Fermi-Walker transport. This rotation removes the arbitrariness along the given worldline. Of course different paths between the same pair of points in spacetime will in general lead to different results. This is a manifestation of spatial curvature for the given observer congruence.
54 2.4 Relative kinematics: applications
There are two interesting applications of the splitting of the total covariant derivative along an arbitrary timelike or null parametrized curve in spacetime representing the worldline of a test particle of nonzero or zero rest mass m respectively with 4-momentum P and associated parameter λ. The first
(4) α D(P )P/dλ = mf(P ) , f(P )αP = 0 , (2.237) is the “acceleration equals force per unit mass” equation for a timelike worldline with 4-velocity U = m−1P
(4) (4) ˜ D(U)P/dτU = f(U) ↔ D(U)U/dτU = f(U) , (2.238) or the affinely parametrized geodesic equation for a null worldline (m = 0)
(4)D(P )P/dλ = 0 . (2.239)
The former reduces to the geodesic equation in the absense of an applied force, which when nonzero must be orthogonal to U in order to maintain the unit character of the 4-velocity
(4) α α(4) α ˜ 0 = D(U)[U Uα]/dτU = 2U DUα/τU = 2U f(U)α . (2.240)
These two variations of original equation will both be referred to as the acceleration equation. The second is the equation for the Fermi-Walker transport of a vector along an arbitrary timelike worldline to which it is orthogonal, which describes the behavior of the spin vector of a torque-free test gyro carried along that worldline or the spin of a spinning test particle following that worldline
α ∇(fw)(U)S = 0 ,UαS = 0 . (2.241)
This will be referred to simply as the spin transport equation.
2.4.1 Splitting the acceleration equation Letting Y = P in subsection (2.2.10) leads to
(4) P (u) D(P )P/dτ(P,u) = E(P, u)D(fw)(P, u)u/dτ(P,u) + D(fw)(P, u)p(Y, u)/dτ(P,u) , (2.242) (4) > [ D(P )P/dτ(P,u)] = dE(P, u)/dτ(P,u) + p(Y, u) ·u D(fw)(P, u)u/dτ(P,u) .
Consider first the timelike case, where one can use the unit 4-velocity U instead of the 4-momentum P and (4) D(U)U/dτU ≡ a(U) (2.243) defines the 4-acceleration. The acceleration vector and hence the force vector must be spatial with respect to U, so their temporal components are determined by the orthogonality condition
a(U) = γ[A(U, u)u + A(U, u)] , A(U, u) = A(U, u) ·u ν(U, u) , (2.244) f(U) = γ[℘(U, u)u + F (U, u)] , ℘(U, u) = F (U, u) ·u ν(U, u) .
Here ℘(U, u) is the power and F (U, u) the spatial force relative to the observer congruence as defined in special relativity [Anderson 1967]. These are defined respectively so that the appropriate time derivatives of the spatial momentum and energy as seen by the observer equal respectively the spatial force and power. Dividing out the mass parameter in the above equations (2.242) describes the splitting of the acceleration vector A(U, u) = γD(fw)(U, u)u/dτ(U,u) + D(fw)(U, u)˜p(U, u)/dτ(U,u) , (2.245) ˜ A(U, u) = dE(U, u)/dτ(U,u) +p ˜(U, u) ·u D(fw)(U, u)u/dτ(U,u) .
55 Having split both the acceleration and the force, equating their spatial and temporal parts leads to the splitting of the acceleration equation. With this in mind, the following preliminary definition proves useful ˜(G) F (U, u) = −γD(fw)(U, u)u/dτ(U,u) (fw) (2.246) = γ[−a(u) + k(u) ν(P, u)] , so that ˜(G) A(U, u) = D(fw)(U, u)˜p(U, u)/dτ(U,u) − F (fw)(U, u) , ˜(G) (2.247) ≡ D(tem)(U, u)˜p(U, u)/dτ(U,u) − F (tem)(U, u) , tem = fw,cfw,lie . Equating A(U, u) to the spatial force F˜(U, u) per unit mass then leads to the spatial projection of the acceleration equation ˜(G) ˜ D(tem)(U, u)˜p(U, u)/dτ(U,u) = F (U, u) + F (U, u) , (tem) (2.248) tem = fw,cfw,lie . This defines the ordinary and co-rotating and the Lie spatial gravitational forces per unit mass, related to each other in the same way as the corresponding total spatial covariant derivatives ˜ ˜ F (cfw)(U, u) = F (fw)(U, u) + ω(u) p˜(U, u) (2.249) ˜ = F (lie)(U, u) + θ(u) p˜(U, u) . Since the Lie total spatial covariant derivative does not commute with index lowering, it is convenient to introduce also the Lie covariant spatial gravitational force
[ ] ˜(G) ˜ [D(tem)(U, u)˜p(U, u) /dτ(U,u)] = F(lie[)(U, u) + F (U, u) . (2.250) The four distinct spatial gravitational forces have the explicit form ˜(G) F(fw)(U, u) = γ[−a(u) + ν(U, u) ×u ~ω(u) − θ(u) ν(U, u)] , ˜(G) F(cfw)(U, u) = γ[−a(u) + 2ν(U, u) ×u ~ω(u) − θ(u) ν(U, u)] , (2.251) ˜(G) F(lie)(U, u) = γ[−a(u) + 2ν(U, u) ×u ~ω(u) − 2θ(u) ν(U, u)] , ˜(G) F(lie[)(U, u) = γ[−a(u) + 2ν(U, u) ×u ~ω(u)] . Similarly the temporal projection of the acceleration equation becomes the equation specifying the rate of change of energy ˜ ˜(G) ˜ dE(U, u)/dτ(U,u) = ν(U, u) ·u [F (U, u) + F (U, u)] (fw) (2.252) [ = −γ[a(u) ·u ν(U, u) − θ(u) (ν(U, u), ν(U, u))] + ℘(U, u) . This can be written in terms of all the variously defined spatial gravitational forces as ˜ ˜(G) ˜ dE(U, u)/dτ(U,u) = ν(U, u) ·u [F (tem)(U, u) + F (U, u)] [ (2.253) + (tem)γθ(u) (ν(U, u), ν(U, u)) ,
(tem) ≡ (0, 0, 1, −1) , tem = fw,cfw,lie,lie[ , The relative acceleration defined in section 2.2.9 can be expressed in terms of the spatial projection of the acceleration using equations (2.175) and (2.247)
−1 −1 ˜(G) a(tem)(U, u) = γ Pu(U, u) [F (U, u) + A(U, u)] (tem) (2.254) 1 2 [ − 2 γ D(tem)(U, u)P (u) /dτ(U,u)(ν(U, u), ν(U, u)) ν(U, u) .
56 Using the acceleration equation this becomes
a(tem)(U, u) ≡ D(tem)(U, u)ν(U, u)/dτ(U,u) −1 −1 ˜(G) = γ Pu(U, u) [F (tem)(U, u) + F (U, u)] (2.255) 1 2 [ − 2 γ D(tem)(U, u)P (u) /dτ(U,u)(ν(U, u), ν(U, u)) ν(U, u) . The derivative of the spatial metric is nonzero only in the Lie case where it reduces to an expansion tensor term. However, this term in the relative acceleration is along the relative velocity and so does not contribute to the part orthogonal to the relative motion
−1 ˜(G) ˜ a(tem)(U, u) ×u ν(U, u) = γ [F (tem)(U, u) + F (U, u)] ×u ν(U, u) . (2.256)
2.4.2 Analogy with electromagnetism: gravitoelectromagnetism Comparison of the spatial projection of the acceleration equation with the Lorentz force per unit mass and charge exerted by an electromagnetic field expressed in terms of the electric and magnetic fields leads to analogous electric and magnetic spatial gravitational forces. To appreciate the analogy, for which the term “gravitoelectromagnetism” seems an appropriate description, one must split the Lorentz force associated with the electromagnetic 2-form (4)F [. The prefix (4) will subsequently be used on the electromagnetic field to avoid confusion with the kernel letter for spatial forces. The Lorentz 4-force on a charged particle of charge q, mass m, and 4-velocity U is
(EM) α (4) α β (4) α β (4) α β [f (U)] = q F βU = qγ[ F βu + F βν(U, u) ] , (2.257) f (EM)(U) = q (4)F U = q γ[(4)F u + (4)F ν(U, u)] .
This splits into the spatial force and power
(EM) F (U, u) = q[E(u) + ν(U, u) ×u B(u)] , (2.258) (EM) ℘ (U, u) = qE(u) ·u ν(U, u) , with similar equations for the per unit mass quantities. These expressions may be compared with the spatial gravitational forces which appear in the splitting of the acceleration in terms of either the ordinary or co-rotating Fermi-Walker or Lie total spatial covariant derivative of the covariant spatial momentum. In each case the spatial gravitational force and power have the following form (G) F (U, u) = mγ[~g(u) + H(tem)(u) ν(U, u)] , (tem) (2.259) (G) ℘ (U, u) = mγ[~g(u) ·u ν(U, u) − θ(u)(ν(U, u), ν(U, u))] . Apart from an additional overall gamma factor not present in the electromagnetic case, one has both electric- like and magnetic-like spatial gravitational forces which Thorne [Thorne et al 1986] has called the gravito- electric and gravitomagnetic forces respectively in slightly different contexts. The same terminology will be extended to each of the various points of view and choices of temporal derivatives. Independent of the choice of time derivative one has a unique gravitoelectric vector field
~g(u) = −a(u) (2.260)
due to the acceleration of the observer congruence. The gravitomagnetic tensor field depends on the choice of time derivative and in the Lie case the contravariant or covariant form of the momentum equation
H(fw)(u) = ω(u) − θ(u) = k(u) , H (u) = 2ω(u) − θ(u) , (cfw) (2.261) H(lie])(u) = 2ω(u) − 2θ(u) = 2k(u) ,
H(lie[)(u) = 2ω(u) .
57 The antisymmetric part of the gravitomagnetic tensor, apart from a factor of two, defines a unique gravito- magnetic vector field H~ (u) = 2~ω(u) (2.262) which is just twice the vorticity vector of the observer congruence. Using these definitions the spatial gravitational forces per unit mass may be written
˜(G) 1 ~ F(fw)(U, u) = γ[~g(u) + 2 ν(U, u) ×u H(u) − θ(u) ν ] , ˜(G) ~ F(cfw)(U, u) = γ[~g(u) + ν(U, u) ×u H(u) − θ(u) ν(U, u)] , (2.263) ˜(G) ~ F(lie)(U, u) = γ[~g(u) + ν(U, u) ×u H(u) − 2θ(u) ν(U, u)] , ˜(G) ~ F(lie[)(U, u) = γ[~g(u) + ν(U, u) ×u H(u)] .
The gravitomagnetic symmetric tensor field SYM H(tem)(u), which has no analog in electromagnetism, arises from the temporal derivative of the spatial metric, which is the new ingredient in gravitoelectromag- netism that makes it fundamentally different from electromagnetism. The spatial derivatives of the spatial metric also enter the total spatial covariant derivative of the spatial momentum as a “space curvature” force term when expressed in terms of an observer-adapted frame
a D(tem)(U, u)˜p(U, u) /dτ(U,u) a a b = dp˜(U, u) /dτ(U,u) + C(tem)(u) bp˜(U, u) (2.264) a c b + Γ(u) bcp˜(U, u) ν(U, u) .
This “space curvature” force term is quadratic in the spatial velocity. One can thus think of the spatial metric as a potential for these two different spatial forces, both associated with the spatial geometry, namely a the relative distances and directions of nearby observers in the observer congruence. The matrix C(tem)(u) b given by equation (2.196) depends on how the spatial frame is transported along the congruence and may be conveniently be chosen to vanish for one of the three temporal derivatives. To handle the case of a lightlike test particle with zero rest mass, one must work with the null 4-momentum P instead of a timelike 4-velocity U. The spatial gravitational force and power then take the form
(G) F (P, u) = E(P, u)[~g(u) + H(tem)(u) ν(P, u)] , (tem) (2.265) (G) [ ℘ (P, u) = E(P, u)[~g(u) ·u ν(P, u) + SYM H(tem)(u) (ν(U, u), ν(U, u)) ,
and measurement of the null geodesic equation yields
(G) D(tem)(P, u)p(P, u)/dτ(P,u) = F (tem)(P, u) , (G) (2.266) dE(P, u)/dτ(P,u) = ℘ (P, u) , tem = fw,cfw,lie,lie[ .
The gravitoelectric force roughly corresponds to the centrifugal force experienced in uniformly rotating coordinates in flat spacetime, while the gravitomagnetic force roughly corresponds to the Coriolis force, but the details of the correspondence will be discussed below in the context of a nonlinear reference frame.
2.4.3 Maxwell-like equations The analogy between the gravitoelectromagnetic vector fields and the electromagnetic ones shows that the exterior derivative of the observer velocity 1-form corresponds to the electromagnetic 2-form, which is itself locally the exterior derivative of a 4-potential 1-form
d(4)A = u[ ∧ E(u)[ + ∗(u)B(u)[ = (4)F [ , (2.267) du[ = u[ ∧ ~g(u)[ + ∗(u)H~ (u)[ .
58 The observer 4-velocity thus acts as the 4-potential for the gravitoelectromagnetic vector fields. Similarly in direct analogy with the Lorentz force, the right contraction of the 4-velocity U of a test particle with the mixed form of the 2-form du[ also produces the part of the spatial gravitational force per unit mass ˜(G) F(tem)(U, u) which involves only the gravitoelectromagnetic vector fields for tem=cfw,lie,lie[, although not for 1 the Fermi-Walker case which involves an additional factor of 2 . The splitting of the identity d(4)F [ = d2(4)A[ = 0 leads to half of Maxwell’s equations
∗(u) (4) [ (M) [d F ] = divu B(u) + H~ (u) ·u E(u) = 0 ,
∗(u) (4) [ (E)] [d F ] = − curlu E(u) + ~g(u) ×u E(u) (2.268) − [£(u)u + Θ(u)]B(u) = 0 ,
which follow from equation (2.114) with S = (4)F [ and making use of the identity (2.100) with S = B(u). Replacing (4)A by (4)u reduces these to the corresponding gravitoelectromagnetic equations
[divu +~g(u)·u]H~ (u) = 0 , (2.269) ~ curlu ~g(u) + [£(u)u + Θ(u)]H(u) = 0 , which are just equations (2.130) and (2.131) rewritten in terms of the gravitoelectromagnetic vector fields. Splitting the remaining half of Maxwell’s equations
∗d ∗(4)F = 4π(4)J (2.270) using equation (2.114) with S = ∗(4)F [ and the identities (2.24), which become
[ ∗(4)F [](E)(u) = −B(u)[ , [ ∗(4)F [](M)(u) = ∗(u)E(u)[ (2.271) when rewritten in terms of the electric and magnetic fields, leads to
∗(u) ∗(4) [ (M)] [d F ] = divu E(u) − H~ (u) ·u B(u) = 4πρ(u) ,
∗(u) ∗(4) [ (E)] [d F ] = curlu B(u) − ~g(u) ×u B(u) (2.272) − [£(u)u + Θ(u)]E~ (u) = 4πJ(u) , where (4)J = ρ(u)u + J(u) is the splitting of the 4-current. The remaining Maxwell-like equations for the gravitoelectromagnetic vector fields arise from the Einstein equations. In order to state them one must first introduce appropriate spatial curvature tensors associated with the spatial part of the spatial connection of the observer congruence u, and then split the spacetime curvature. This will be done below. Forward [1961] systematically developed the analogy between general relativity and electromagnetism in the post-Newtonian limit.
59 2.4.4 Splitting the spin transport equation The kinematical tensor k(u) has already been seen to relate spatial Fermi-Walker transport along the observer congruence to the spatial Lie transport. When applied to a spatial vector S this may be interpreted as the equation describing the evolution of the spin of a torque-free gyro carried by an observer. The following are equivalent ways of stating this transport
∇(fw)(u)S = 0 ,
∇(cfw)(u)S = ω(u) S
= −~ω(u) ×u S, (2.273)
∇(lie)(u)S = k(u) S
= −~ω(u) ×u S − θ(u) S.
The spin vector must rotate with the angular velocity of gravitomagnetic precession
1 ~ ζ(gm)(u) = −~ω(u) = − 2 H(u) (2.274) equal to minus the vorticity relative to a spatially Lie dragged spatial frame or a co-rotating Fermi-Walker propagated spatial frame in order to undo the local rotation of such a frame. It must also change in a way that undoes the expansion and shearing of the spatially Lie dragged frame vectors relative to a Fermi-Walker or co-rotating Fermi-Walker propagated spatial frame. One can also split the equation describing the transport of the spin vector of a gyro along an arbitrary worldline with 4-velocity U. Since the spin vector is spatial with respect to U, it has the form
S = [ν(U, u) ·u S~]u + S,~ (2.275) where is the spatial projection S~ = P (u, U)S of the spin. Since Fermi-Walker transport does not change the magnitude ||S|| of the 4-vector, the magnitude of the spatial projection of the spin will change as the inner product ν(U, u) ·u S~ changes 2 2 2 ||S~|| = ||S|| + [ν(U, u) ·u S~] , (2.276) leading to a change in the observed frequency of rotation associated with the spin vector. Since the temporal component of the spin is determined by its spatial projection, one only needs an evolution equation for the spatial projection, which is provided by the spatial projection of the Fermi-Walker transport equation. Because of the spatial condition on S relative to U, the Fermi-Walker transport condition reduces to (4) (4) β 0 = ∇(fw)(U)S = ∇(fw)(U)S = ∇U S − a(U)βS U (2.277) and its spatial projection is
−1 0 = γ P (u)∇(fw)(U)S −1 (4) β = γ P (u)[ ∇U S − a(U)βS U] (2.278) ~ ~ β = D(fw)(U, u)[(ν(U, u) ·u S)u + S]/dτ(U,u) − a(U)βS ν(U, u) ~ ~ β = (ν(U, u) ·u S)D(fw)(U, u)u/dτ(U,u) + D(fw)(U, u)S/dτ(U,u) − a(U)βS ν(U, u) .
Solving this for the Fermi-Walker total spatial covariant derivative and observing that
β a(u)βS = −γ[ν(U, u) ·u A(U, u)][ν(U, u) ·u S~] + γA(U, u) ·u S~ (2.279) [ −1 = γA(U, u) Pu(U, u) S~
using equation (2.57), one gets the result
~ −1 ~ ˜(G) [ −1 ~ D(fw)(U, u)S/dτ(U,u) = γ [ν(U, u) ·u S]F (fw)(U, u) − γ[A(U, u) Pu(U, u) S]ν(U, u) , (2.280)
60 which is easily re-espressed in terms of the co-rotating Fermi-Walker derivative
~ ~ −1 ~ ˜(G) D(cfw)(U, u)S/dτ(U,u) = −~ω(u) ×u S + γ [ν(U, u) ·u S]F (U, u) (fw) (2.281) [ −1 − γ[A(U, u) Pu(U, u) S~]ν(U, u) .
These equations may be decomposed into separate evolution equations for the length ||S~|| and direction Sˆ = ||S~||−1S~ of the spatial projection of the spin vector. The results are
~ −1 ˆ ˜(G) ˆ D(U, u) ln ||S||/dτ(U,u) = γ [ν(U, u) ·u S][F (U, u) ·u S] (fw) (2.282) [ −1 − γ[A(U, u) Pu(U, u) S~][ν(U, u) ·u Sˆ] ,
and ˆ ˆ ˆ D(tem)(U, u)S/dτ(U,u) = Ω(tem)(S, U, u) ×u S, (2.283) where ˆ −1 ˆ ˜(G) ˆ Ω(fw)(S, U, u) = −γ [ν(U, u) ·u S] F (fw)(U, u) ×u S [ −1 + γ[A(U, u) Pu(U, u) Sˆ] ν(U, u) ×u S,ˆ (2.284) ˆ ˆ Ω(cfw)(S, U, u) = Ω(fw)(S, U, u) − ~ω(u) . These formulas describe the precession of the spin vector as seen by the family of different observers of the observer congruence along the gyro’s worldline. Note that the angular velocity of the direction of the spatial projection of the spin vector depends on the direction of that spin vector. These formulas describe how the spin vector S changes as seen by the 1-parameter family of observers belonging to the observer congruence which intersect the worldline of the gyro. The observed spin vector S~ = P (u)S does not undergo a rotation with respect to the Fermi-Walker propagated axes of the observer but a more complicated motion required to keep S orthogonal to U. This is superimposed upon the gravitomagnetic precession when compared to co-rotating Fermi-Walker propagated axes, and complicated by the expansion and shear effects when compared to spatial Lie dragged axes.
61 2.4.5 Relative Fermi-Walker transport and gyro precession Another question one can consider is how a single observer with 4-velocity U following the gyro worldline would see the orientation of the gyro change with respect to a spatially co-rotating observer-adapted frame anchored in the given observer congruence u. The observer following the gyro will see these axes to be in relative motion and not even appear orthonormal. However, a basic assumption of special relativity implicit when one discusses pure Lorentz transformations in Minkowski space is that the orientation of a set of orthonormal axes in relative motion is defined to be the orientation they would have if they were not in relative motion. In other words one “brings them to rest” by the inverse relative observer boost in order to define their orientation with respect to a given orthonormal spatial frame in the local rest space of an observer. Suppose {ea} is a spatially co-rotating observer-adapted spatial frame along the observer congruence u. Then B(lrs)(U, u)ea are the axes “momentarily at rest” with respect to the gyro, and the orientation of the spin vector S with respect to them is well-defined and represents the orientation of S with respect to the moving axes ea. However, since the boost is an isometry between the two local rest spaces, the orientation of S with respect to B(U, u)ea is the same as the orientation with respect to ea of the boosted spin vector
S ≡ B(u, U)S (2.285)
in the local rest space of the observer congruence. Again since the boost is an isometry, the lengths of S and S are the same, and since the latter is constant under Fermi-Walker transport, so is the former. In other words S will rotate with respect to the orthonormal spatial frame ea, and its angular velocity with respect to the sequence of observers from the observer congruence along the gyro worldline will be the same as the angular velocity of S with respect to the boosted spatial frame B(U, u)ea, apart from a proper time renormalization. In order to calculate these angular velocities, one must consider the evolution of S along the gyro worldline. This boosted spin vector has the explicit expression
−1 −1 S = S~ − γ (γ + 1) [˜p(U, u) ·u S~]˜p(U, u) , (2.286) which is obtained by rewriting equation (2.55) in terms of the spatial momentum per unit mass. Since the Fermi-Walker total spatial covariant derivatives of S~,p ˜(U, u), and γ = E˜(U, u) are given by equations (2.280), (2.248), and (2.252), it is straightforward to evaluate the corresponding derivative of S. After some algebra one finds a result which may be expressed in the following equivalent forms
D (U, u)S/dτ = ζ (U, u) ×u S , (tem) (U,u) (tem) (2.287) tem=fw,cfw , where the Fermi-Walker and co-rotating Fermi-Walker “relative angular velocities” are related to each other by by the gravitomagnetic precession
ζ(cfw)(U, u) = −~ω(u) + ζ(fw)(U, u) 1 ~ (2.288) = − 2 H(u) + ζ(fw)(U, u) = ζ(gm)(u) + ζ(fw)(U, u) and the former is defined by
−1 ˜ −1 ˜(G) ζ(fw)(U, u) = −γ(γ + 1) ν(U, u) ×u F (U, u) + (γ + 1) ν(U, u) ×u F (U, u) (fw) (2.289) = ζ(thom)(U, u) + ζ(geo)(U, u) .
This may also be expressed in terms of the relative acceleration
2 −1 ζ(fw)(U, u) = −γ (γ + 1) ν(U, u) ×u a(cfw)(U, u) (2.290) ˜(G) −1 + ν(U, u) ×u [F(cfw)(U, u) − γ(γ + 1) ν(U, u) ×u ~ω(u)] ,
62 a formula first obtained by Massa and Zordan [1975]. These angular velocities, in contrast with the result ˆ for Ω(cfw)(S, U, u), depend only on the relative boost between the local rest spaces. For a gyro at rest relative to the observer congruence, only the gravitomagnetic precession term is nonzero. This is also known as the frame-dragging precession or the Lense-Thirring precession, since the effect was first studied by Lense and Thirring [1918] in the context of weak fields and slow motion. The Thomas precession term
−1 ˜ ζ(thom)(U, u) = −γ(γ + 1) ν(U, u) ×u F (U, u) (2.291)
in the relative angular velocity is due to the applied force acting on the gyro. In Minkowski spacetime choosing u to be a unit timelike rotation-free Killing vector field corresponding to a time translation, only this term is nonzero and has the equivalent expression
2 −1 ζ(thom)(U, u) = −γ (γ + 1) ν(U, u) ×u a(cfw)(U, u) −2 (2.292) = −||ν(U, u)|| (γ − 1)ν(U, u) ×u a(cfw)(U, u)
in terms of the relative acceleration
−1 −1 ˜ a(cfw)(U, u) = a(fw)(U, u) = a(lie)(U, u) = γ Pu(U, u) F (U, u) (2.293)
which in this case represents the usual spatial acceleration of special relativity. The acceleration of the gyro in Minkowski space causes it to precess as discussed by Thomas [1927]. In the limit ||ν(U, u)|| 1 and γ → 1 of nonrelativistic relative motion, the Thomas precession under these conditions reduces to
1 1 ˜ ζ(thom)(U, u) → − 2 ν(U, u) ×u a(cfw)(U, u) = − 2 ν(U, u) ×u F (U, u) . (2.294)
For circular motion with angular velocity Ω,~ the precession angular velocity is [γ − 1]Ω,~ as described in exercise 6.9 of Misner, Thorne and Wheeler [1973]. The first term in their equation (6.28) is exactly the boosted spin vector S. The second term in the Fermi-Walker angular velocity
−1 ˜(G) ζ(geo)(U, u) = (γ + 1) ν(U, u) ×u F (U, u) (fw) (2.295) −1 1 ~ = γ(γ + 1) ν(U, u) ×u [~g(u) + 2 ν(U, u) ×u H(u) − θ(u) ν(U, u)] is due to the spatial gravitational force alone, which is present even in the absence of an applied force, i.e., for geodesic motion of the gyro. For this reason it is often called the geodetic precession, in addition to the commonly used alternatives de Sitter precession or Fokker precession or even Fokker-de Sitter precession, named after the original investigators of this phenomenon [de Sitter 1916, Fokker 1920, Pirani 1956]. In the nonrelativistic limit γ → 1, neglecting terms of second order in the velocity, this term has the limiting expression 1 ζ(so)(U, u) = 2 ν(U, u) ×u ~g(u) . (2.296) Thorne [1989] describes this nonrelativistic term as an “induced gravitomagnetic precession” or “spin-orbit” precession since it corresponds to the gravitomagnetic precession due to an additional “induced” gravito- ~ magnetic field H(u)(ind) = −ν(U, u) ×u ~g(u) induced by the motion of the gyro in the gravitoelectric field in analogy with the induced magnetic field due to motion in an electric field. These relative angular velocities must still be interpreted in terms of the original discussion of the angular velocity of the relative orientation of the gyro spin and the spatially co-rotating observer-adapted spatial frame. These angular velocities do not depend on the choice of spatial frame but only involve the relative observer boost between the gyro and the observer congruence. They describe the angular velocities of the boosted spin vector S with respect to a spatial frame which is transported along the gyro worldline by the corresponding spatial transport. This transport seeks to eliminate the arbitrariness in the spatial distribution of the orientation of any given spatially co-rotating observer-adapted spatial frame, at least along the given gyro worldline.
63 However, if the gyro returns to a given observer worldline belonging to the observer congruence, the spatial frame transported along its worldline will in general differ from the value of the spatially co-rotating observer-adapted spatial frame there. This is a manifestation of “space curvature” which occurs as long as the spatial metric is not flat. If one is really interested in measuring the relative orientation of S with respect to a particular spatially co-rotating observer-adapted orthonormal spatial frame, one must remove the additional angular velocity of the co-rotating Fermi-Walker transported orthonormal spatial frame with respect to it, leading to a frame-dependent result for the ordinary derivative of the components of S with respect to the former frame
a b c dS /dτ(U,u) = abc[ζ(cfw)(U, u) + ζ(sc)(U, u, e)] S . (2.297)
Thorne [Thorne et al 1986] has referred to the relative angular velocity ζ(sc)(U, u) in the precession formula as the space curvature term. Of course now that ordinary derivatives are being used one has to choose the observer-adapted frame carefully in order that this make some physical sense. Finally, normalizing the proper time to correspond to that of the gyro instead of that of the observer congruence, the angular velocity of the given spatially co-rotating observer-adapted orthonormal spatial frame {ea} as seen by the gyro has components
a a a ζ(gyro)(U, u, e) = γ[ζ(cfw)(U, u) + ζ(sc)(U, u, e) ] (2.298)
a with respect to the boosted frame B(lrs)(U, u)ea. In other words, if S are the components of the spin vector with respect to this boosted frame, then
a b c dS /dτU = abcζ(gyro)(U, u, e) S . (2.299)
The precession formula will have a physical meaning in a given spacetime first only if the observer congru- ence itself has some physical meaning and second if the spatially co-rotating observer-adapted orthonormal spatial frame has some physical meaning. The obvious candidate satisfying the first condition is an observer congruence associated with a timelike Killing vector field in a stationary spacetime. Since the shear of such an observer congruence vanishes, the co-rotating Fermi-Walker transport along u reduces to spatial Lie transport, and so the spatially co-rotating observer-adapted orthonormal spatial frame is also a stationary frame. This spatial frame should also be as “Cartesian-like” as possible, i.e., the distribution of its orientation should be as aligned as possible, in order that the angular velocity have some meaning at each point along the gyro worldline. For spacetimes which are asymptotically flat at spacelike infinity, Nester [1991a, 1991b] has shown that a preferred spatial orthonormal frame exists on a given asymptotically flat spacelike hypersurface which asymptotically approaches a given inertial frame at spacelike infinity and in some sense has “Cartesian- like properties.” Given a preferred slicing by spacelike hypersurfaces, one could then boost these spatial frames into the local rest space of a preferred observer congruence to define a preferred class of observer- adapted orthonormal frames which might be used in interpreting the angular velocity. Stationary axially symmetric spacetimes have a preferred slicing orthogonal to the zero-angular-momentum observers, and so must possess such preferred frames. Whether or not they are compatible with the spatially-comoving condition is not clear. Alternatively, one can consider worldlines which return to a given observer worldline and attempt to calculate the total rotation of the spin between the initial and final intersection points (corresponding to a closed loop in the space of observers) to avoid this problem. This is the case for circular orbits often considered in gyro precession calculations, where the net rotation per revolution is of interest.
64 2.4.6 The Schiff Precession Formula The classic spin precession formula of Schiff [1960] describes how the spin vector precesses relative to the “distant stars” as seen by an observer carrying the gyro in the gravitational field of an isolated body [Weinberg 1972, Misner, Thorne and Wheeler 1973]. This is evaluated locally by using a spatial frame which is somehow anchored in the asymptotically flat part of the spacetime around such an isolated body. The discussion is cleaner for a stationary spacetime of this type with a stationary frame adapted to the asymptotically nonrotating stationary observers, the so-called static observers. Since spatial Lie transport along u coincides with co-rotating Fermi-Walker transport, the spatial projection of any frame which is comoving with respect to u will yield a spatial frame which is spatially comoving and which undergoes co-rotating Fermi-Walker transport along u. Call this a static spatial frame. Let {u, ea} be a spatially- comoving observer-adapted frame, whose spatial frame is therefore static. One can choose this frame to be orthonormal. For a black hole spacetime in Boyer-Lindquist coordinates, for example, the spatial coordinate differentials are orthogonal to each other and to the asymptotically-nonrotating timelike Killing vector field, and so one may normalize them to obtain such a frame. Light rays from “distant stars” will arrive at a given static observer from fixed directions relative to the static spatial frame. In this way the stationary symmetry anchors the static observer frame to the asymptotically flat part of the spacetime. It can then be used for a local comparison of the spin vector of a gyroscope with the “distant stars.” How can the observer carrying the gyro make this comparison? The static observer’s spatial frame is moving with respect to the gyro. The boost Ea = B(U, u)ea of the static observer’s spatial frame to the rest frame of the gyro is the spatial frame the gyro would see if the static observer were not in relative motion. The angular velocity ζ(gyro)(U, u, e) evaluated above describes the rotation of the spin vector with respect this boosted frame. This angular velocity depends crucially on the spatial distribution of the orientation of the static spatial frame {ea}, i.e., on how the orientation depends on the location of the observer in the observer congruence. If one has a stationary axially symmetric spacetime with an axially symmetric static spatial frame like the one described above for a black hole arising from the normalized Boyer-Linquist spatial coordinate differentials, then the static frame will itself rotate through an angle of 2π for each loop around a circular orbit of the axial symmetry subgroup. Clearly one does not want to measure the gyro precession with respect to such a frame, but rather one with Cartesianlike properties, if such a frame exists. The actual Schiff formula corresponds to the value of the angular velocity ζ(gyro)(U, u, e) in the slow motion weak field limit of general relativity for the linearized gravitational field of an isolated distribution of matter. It has been generalized to the PPN theory as discussed by Misner, Thorne and Wheeler [1973] or Thorne [1989]. The key difference between it and the limiting expression for ζ(cfw)(U, u)
1 ~ 1 ˜ 1 ζ(cfw)(U, u) → − 2 H(u) − 2 ν(U, u) ×u F (U, u) + 2 ν(U, u) ×u ~g(u) (2.300) is the fact that in this limit within general relativity, the space curvature precession has twice the value of 1 3 the spin orbit precession ζ(so)(U, u) → 2 ν(U, u) ×u ~g(u) leading to a total coefficient of 2 . The sum of these two terms is conventionally referred to as the geodetic precession. This will be discussed below in chapter ?? on the post-Newtonian limit. This same result generalizes in a simple way to the parametrized post-Newtonian (PPN) context following the same notation as Misner, Thorne and Wheeler [1973]. The PPN spatial coordinates are orthogonal to lowest order and may be normalized to define an orthonormal spatial frame which may be completed to an orthonormal frame by the addition of the normalization of the time coordinate derivative vector field which is orthogonal to the spatial coordinate differentials to lowest order. This is exactly analogous to the black hole case described above. However, all of the linearized discussions of spin precession are somewhat clouded by the mixing of the linearization process itself with the features of the spin precession. Without the assumption of stationarity, the entire discussion no longer has a very solid foundation. In the PPN context, the PPN coordinate grid is anchored to the asymptotically flat part of the spacetime but one has an entire class of gauge transformations which allow the coordinate grid to change. The measurement of the precession of a torque-free gyro in a free-fall earth orbit is the goal of the long awaited Stanford Gravity-Probe B experiment [Everitt 1979]. Of course to get explicit formulas for the the
65 actual cumulative precession measured, one must consider the actual geometry of the earth-satellite system, as well as the aberration effects involved in the star-tracking telescope. It is this experiment, the proposed LAGEOS experiment [Ciufolini 1990], and others with similar goals [Nordvedt 1988, Mashhoon, Paik, and Will 1989] which have provided much of the motivation for talking about “gravitomagnetism.”
66 2.4.7 The relative angular velocity as a boost derivative
The formula for the co-rotating relative angular velocity ζ(cfw)(U, u) has been obtained in a two-step process, first evaluating the evolution of the spatial projection of the spin vector and then of the transformation which converts it to the boosted spin vector. This quantity may be obtained instead directly from the evolution of the relative boost between the local rest spaces along the worldline. Starting from the Fermi-Walker transport equation for the spin vector S
(4) D(U)S/dτU = σ(U ∧ a(U))S, (2.301) one can calculate the spatial projection of the total covariant derivative of the boosted spin vector S = B(lrs)(u, U)S (4) D(fw)(U, u)S/dτ(U,u) = P (u) D(U)S/dτu (4) = P (u) D(U)B(lrs)(u, U)/dτ(U,u) B(lrs)(U, u)S (2.302) (4) + P (u)B(lrs)(u, U) D(U)S/dτ(U,u) , suppressing the contraction notation of the linear transformations involved. The second term vanishes
P (u)B (u, U)(4)DS/dτ = γ(U, u)P (u)B (u, U)σ(U ∧ a(U))B (U, u)S (lrs) (U,u) (lrs) (lrs) (2.303) = γ(U, u)P (u)σ(u ∧ B(lrs)(u, U)a(U))S = 0 since the factor of u is killed by the spatial projection or by contraction with the spatial field S. The first term defines the boost angular velocity tensor
(4) ∗(u) W (U, u) = P (u)[ D(U)B(lrs)(u, U)/dτ(U,u) B(lrs)(U, u)] = − ζ(fw)(U, u) , (2.304) which is an antisymmetric spatial tensor. Its sign-reversed spatial dual therefore defines the relative angular velocity vector ζ(fw)(U, u). The antisymmetry of W (U, u) follows since this represents the spatial part of the covariant Lie algebra derivative of the 1-parameter family of Lorentz transformations. This leads to the result D(fw)(U, u)S/dτ(U,u) = ζ(fw)(U, u) ×u S (2.305) describing the precession of the boosted spin vector. The relative angular velocity differs from the co-rotating angular velocity by the vorticity of the observer congruence
ζ(cfw)(U, u) = ζ(fw)(U, u) − ~ω(u) , (2.306) i.e., by the gravitomagnetic precession.
67 2.4.8 Relative kinematics: transformation of spatial gravitational fields The spatial gravitational force fields are simply related to the kinematical quantities associated with the observer congruence. If one has two distinct observer congruences with unit tangents u and U, one can describe the transformation law between the spatial gravitational fields observed by each. One need only express the quantities and operators in the expression for the spatial gravitational fields of one in terms of those of the other to obtain such laws, as in the above derivation of the transformation law for the electric and magnetic fields. The acceleration and kinematical field transform as follows a(U) = γ2P (u, U)−1[a(u) − k(u) ν(U, u)] 2 + γ P (U, u)a(fw)(U, u) , (2.307) k(U) = γP (U, u)[k(u) − a(u) ⊗ ν(U, u)[] − γ∇(U)ν(U, u) . The expansion tensor and the rotation tensor and vector then transform as [ 2 [ 1 θ(U) = γ P (U, u)[θ(u) + 2 {a(u) ⊗ ν(U, u) + ν(U, u) ⊗ a(u)] , [ 2 [ 1 ω(U) = γ P (U, u)[ω(u) − 2 a(u) ∧ ν(U, u)] 1 [ (2.308) + 2 γd(U)ν(U, u) , 2 −1 1 ~ω(U) = γ P (u, U) [~ω(u) + 2 ν(U, u) ×u a(u)] 1 + 2 γ curlU ν(U, u) , Converting to the gravitoelectromagnetic symbols leads to 2 −1 H~ (U) = γ P (u, U) [H~ (u) − ν(U, u) ×u ~g(u)]
+ γ curlU ν(U, u) , 2 −1 1 ~ ~g(U) = γ P (u, U) [~g(u) + 2 ν(U, u) ×u H(u) 2 (2.309) − θ(u) ν(U, u)] − γ P (U, u)a(fw)(U, u) 2 −1 = γ P (u, U) [~g(u) + ν(U, u) ×u H~ (u) 2 − θ(u) ν(U, u)] − γ P (U, u)a(cfw)(U, u) , −1 ˜(G) where the expressions in square brackets in the gravitoelectric field transformation laws are just γ F (fw)(U, u) −1 ˜(G) and γ F (cfw)(U, u) respectively, analogous to the Lorentz force and its magnetic analog which appear in the transformation law for the electric and magnetic fields. The terms explicitly involving the gravitoelectro- magnetic vector fields in the transformation law for the gravitomagnetic vector field and in the second form of the one for the gravitoelectric vector field are exactly analogous to the corresponding transformation laws for the magnetic and electric fields, apart from the extra gamma factor also present in the force law itself. Apart from the expansion term, the remaining part of the transformation law which breaks this corre- spondence, namely the relative acceleration and the relative velocity curl, can be further expanded. The relative acceleration, for example, can be re-expressed using equation (2.255). For the relative curl, one can apply the following useful formula −1 curlU X = γP (u, U) {curlu X (2.310) [ ] + ν(U, u) ×u [£(u)uX ] } , valid when X is spatial with respect to u. For an observer-adapted co-rotating Fermi-Walker orthonormal frame, the gravitoelectric and gravito- magnetic fields are related to the 2-form which results from evaluating the tensor-valued connection 1-form on u in a way similar to the way the electric and magnetic fields are related to the electromagnetic 2-form [jancar91]. The homogeneous part of the transformation law for the connection then leads to the terms in the transformation law for the gravitoelectric and gravitomagnetic vector fields which are analogous to those for the electric and magnetic fields.
68 SectionThe hypersurface point of view The zero vorticity case ω(u) = 0 of the congruence point of view essentially defines the hypersurface point of view, described in this way by Zel’manov [1973], Ehlers [1961, 1993] and Ellis [1971]. In this special case one has an integrable family of local rest spaces LRSu which define a spacelike slicing of spacetime with u equal to the unit normal n for this slicing. The spatial metric P (n)[ when restricted to a slice equals the induced Riemannian metric of the slice, while the the kinematical tensor k(n)[ = −θ(n)[ restricts to the extrinsic curvature tensor K[ of the slice. There is a natural isomorphism between spatial tensor fields on a given slice and the tensor algebra of the slice as a manifold itself, under which the spatial covariant derivative operator for spatial tensor fields corresponds to the covariant derivative of the induced metric connection. Because the vorticity vanishes, the covariant normal admits a family of integrating factors, so the accel- eration or equivalently the gravitoelectric field admits a family of potentials
[ n = −Ndt , a(n) = gradn ln N = −~g(n) . (2.311)
Thus − ln N serves as an “acceleration potential” [Ehlers 1961, 1993] while ln N serves as the gravitoelectric potential. The integrating factor N is only defined to within a factor whose spatial derivative vanishes, corresponding to a reparametrization of the slicing time function t. Since the vorticity vanishes, the gravit- omagnetic vector field is zero. Of course given explicitly the congruence on spacetime in terms of some local coordinates, one would have to integrate partial differential equations (for a time function) to obtain the slices, and conversely given a slicing in terms of local coordinates one would have to integrate ordinary differential equations (for the orthogonal trajectories) in order to obtain the normal congruence. This means that in practice one might have to distinguish the hypersurface and corresponding congruence points of view, but as far as the spacetime geometry of the splitting is concerned, they are the same. In the hypersurface point of view, it is natural to interchange the symbols > and ⊥. Since the slicing is considered primary rather than the implicit normal congruence, it is logical to let ⊥ stand for perpendicular to the slicing and > for tangential to the slicing, compared to the symbols > and ⊥ for tangential and perpendicular to the congruence in the corresponding congruence point of view adapted to the normal 1 congruence. Thus the decomposition of a spacetime 1 -tensor field S corresponding to the ordered family of spatial fields listed above for the equivalent normal congruence point of view becomes instead
0 a 0 a S ↔ {S 0,S 0,S a,S b} ⊥ a ⊥ a ↔ {S ⊥,S ⊥,S a,S b} , (2.312) ⊥ > ⊥ > ↔ {S ⊥,S ⊥,S >,S >} in the dual hypersurface point of view. The commonly used single vertical bar notation for the spatial covariant derivative index notation will also be adopted in place of the double vertical bar
α... α... α... [∇(n)S] β... = ∇(n)γ S β... = S β...|γ . (2.313)
69 2.5 Spatial curvature and torsion 2.5.1 Definitions Having explored the spatial and spacetime connections, the next step is to study the curvatures associated with both. The spatial versions of the usual invariant formulas defining the torsion and Riemann curvature tensors of a connection may be used to define the spatial torsion and curvature tensors associated with the spatial part of the spatial connection. Expressing them in terms of an observer-adapted frame then gives the component definitions. The spatial torsion definition gives no surprises, vanishing under the assumption that the spacetime torsion is zero
∇(u)XY − ∇(u)Y X − [X,Y ](u) = –T (u)(X,Y ) = 0 , a a a (2.314) –T (u) bc = 2Γ(u) [bc] − C bc = 0 ,
where X and Y are spatial vector fields. However, the rotation of the congruence leads to an additional temporal derivative term in the spatial curvature formula which arises from the temporal part of the commutator of two spatial vector fields. Three different curvature tensors then result from the choice of the Lie, Fermi-Walker or co-rotating Fermi-Walker temporal derivative to express the additional term
[ [∇(u)X, ∇(u)Y ] − ∇(u)[X,Y ] Z = R(tem)(u)(X,Y )Z + 2ω(u) (X,Y )∇(tem)(u)Z, (2.315) tem = lie,fw,cfw ,
where X, Y , and Z are spatial vector fields. These three tensors, the Lie spatial curvature tensor and the Fermi-Walker spatial curvature tensor and the co-rotating Fermi-Walker spatial curvature tensor differ by the same kinematical terms as the temporal derivatives themselves but reversed in sign
[ R(fw)(u)(X,Y )Z = R(lie)(u)(X,Y )Z + 2ω(u) (X,Y )k(u) Z, (2.316) [ R(cfw)(u)(X,Y )Z = R(lie)(u)(X,Y )Z − 2ω(u) (X,Y )θ(u) Z,
Expressing the defining identity in an observer-adapted frame leads to the formula
a a a a f R(tem)(u) bcd = Γ(u) db,c − Γ(u) cb,d − Γ(u) fbC cd (2.317) a f a f a + Γ(u) cf Γ(u) db − Γ(u) df Γ(u) cb − 2C(tem)(u) bω(u)cd .
The three curvature tensors are related to each other by
a a a R(fw)(u) bcd = R(lie)(u) bcd + 2k(u) bω(u)cd , a a a (2.318) R(cfw)(u) bcd = R(lie)(u) bcd − 2θ(u) bω(u)cd .
These are clearly antisymmetric in their final pair of indices, but the remaining symmetries characterizing an ordinary curvature tensor must be checked. The formula for the components of these spatial curvature tensors in terms of the spatial connection components and the spatial structure functions differs from the usual one only in the last term. By choosing an observer-adapted frame whose spatial frame undergoes the corresponding transport, this term is set to zero. For the Lie spatial curvature tensor this term is absent in a spatially-comoving spatial frame, and hence in the discussions of Zel’manov and Ferrarese who rely on such frames.
2.5.2 Algebraic symmetries An alternative way to obtain these definitions which reveals the symmetry properties of the first pair of indices relies on the spatial Ricci identity for a spatial vector field or 1-form, which for zero torsion have the
70 form α α [∇(u)γ , ∇(u)δ]Z = 2Z ||[δγ] α β α = R(tem)(u) βγδZ + 2ω(u)γδ∇(tem)(u)Z ,
[∇(u)γ , ∇(u)δ]Zα = 2Zα||[δγ] (2.319) β = −ZβR(tem)(u) αγδ + 2ω(u)γδ∇(tem)(u)Zα , tem = lie,fw,cfw . Since the spatial metric has zero ordinary or co-rotating Fermi-Walker temporal derivative, the two Ricci identities for a vector field and a 1-form are clearly related in a consistent way by index shifting as long as the corresponding curvature tensor has the usual antisymmetry property in the first pair of indices
(αβ) (αβ) R(fw)(u) γδ = R(cfw)(u) γδ = 0 . (2.320)
This implies that the symmetric part of the Lie spatial curvature tensor is not zero
(αβ) αβ R(lie)(u) γδ = 2θ(u) ω(u)γδ , (2.321) which is exactly the condition that shifting indices in these Ricci identities be consistent when expressed in terms of that curvature and the Lie derivative term. The commutator of the Lie derivative with index shifting generates the expansion tensor term
β β β β £(u)u[P (u)αβZ ] − P (u)αβ£(u)uZ = [£(u)uP (u)αβ]Z = 2θ(u)αβZ . (2.322)
These equivalent conditions on the symmetric part of the curvature tensors are in turn equivalent to the spatial Ricci identity for the spatial metric itself, modified by the rotation term from the noncommutivity of the spatial derivatives
0 = [∇(u)γ , ∇(u)δ]P (u)αβ = −2R(lie)(u)(αβ)γδ + 2ω(u)γδ£(u)uP (u)αβ (2.323) = −2R(fw)(u)(αβ)γδ = −2R(cfw)(u)(αβ)γδ .
The relation between the curvature tensors is therefore equivalent to
R (u)αβγδ = R (u) + 2ω(u)αβω(u)γδ , (fw) (lie) [αβ]γδ (2.324) R(cfw)(u)αβγδ = R(lie)(u)[αβ]γδ .
The component formula for the Lie spatial curvature tensor is identical with the usual frame component formula for an actual Riemannian 3-manifold, at least in a spatially-comoving frame. Thus any algebraic identity which follows directly from the latter formula with no index shifting (which can lead to additional expansion tensor terms when derivatives are involved) continues to hold for the Lie spatial curvature tensor. The usual cyclic symmetry of the torsionfree Bianchi identity of the first kind therefore holds
α α α α 3R(lie)(u) [βγδ] = R(lie)(u) βγδ + R(lie)(u) γδβ + R(lie)(u) δβγ = 0 , (2.325) as one may directly verify by expressing it in terms of the component formula and noting that all terms cancel in pairs. This implies that the other spatial curvature tensors do not satisfy the usual form of the first Bianchi identity
α 1 α α α R(fw)(u) [βγδ] = 3 [R(fw)(u) βγδ + R(fw)(u) γδβ + R(fw)(u) δβγ ] α = 2k(u) [βω(u)γδ] , (2.326) α α R(cfw)(u) [βγδ] = −2θ(u) [βω(u)γδ] .
71 2.5.3 Symmetry-obeying spatial curvature One can isolate the kinematical terms in the spatial curvature which arise from the commutator of spatial derivatives by making the second derivative terms in the curvature formula explicit. In this observer-adapted frame component derivation one needs the following relations
0 = ∇(u)chab = ∂chab − Γ(u)acb − Γ(u)bca , ab ab a b b a (2.327) 0 = ∇(u)ch = ∂ch + Γ(u) c + Γ(u) c , reflecting the covariant constancy of the spatial metric and its inverse. Then one has e e R(lie)(u)abcd = hae[∂cΓ(u) db − ∂dΓ(u) cb] e e e + Γ(u)aceΓ(u) db − Γ(u)adeΓ(u) cb − C cdΓ(u)aeb − 2Ca>bω(u)cd ef ef e = [∂cΓ(u)adb − ∂dΓ(u)acb] + hae[∂ch Γ(u)fdb − ∂dh Γ(u) cb] + ...
= [∂cΓ(u)adb − ∂dΓ(u)acb] (2.328) e e − [Γ(u)ace + Γ(u)eca]Γ(u) db + [Γ(u)ade + Γ(u)eda]Γ(u) cb + ···
= [∂cΓ(u)adb − ∂dΓ(u)acb] e e e − Γ(u)ecaΓ(u) db + Γ(u)edaΓ(u) cb − C cdΓ(u)aeb − 2Ca>bω(u)cd . Now one can substitute for the covariant gammas using the derivative formulas, but the calculation is much a a easier in a spatially-holonomic (C bc = 0) spatially-comoving (C >b = 0) spatial frame 1 R(lie)(u)abcd = 2 [hab,dc − hbd,ac + hda,bc − hab,cd + hbc,ad − hca,bd] e e − Γ(u) caΓ(u)edb + Γ(u) daΓ(u)ecb 1 1 (2.329) = 2 [∂c, ∂d]hab + 2 [had,bc − hac,bd + hbc,ad − hbd,ac] + ··· 1 = 2θ(u)abω(u)cd + 2 [had,bc − hac,bd + hbc,ad − hbd,ac] + ··· . Now one sees explicitly the expansion rotation factor responsible for the symmetric part of the curvature, but the remainder of the expression (chosen by Cattaneo-Gasperini as the spatial curvature tensor [1961]) still has hidden kinematical terms in the antisymmetric part of the second derivative terms. Making these explicit leads to the expression
R(lie)(u)abcd = 2θ(u)abω(u)cd
+ [−θ(u)adω(u)bc + θ(u)acω(u)bd − θ(u)bcω(u)ad + θ(u)bdω(u)ac] (2.330) 1 + 2 [had,(bc) − hac,(bd) + hbc,(ad) − hbd,(ac)] + ··· . As noted by Ferrarese [1965], the final part of the formula following the explicit kinematical terms has all the algebraic symmetries of the corresponding identical coordinate formula on a Riemannian manifold and therefore has all of the algebraic symmetries of the usual covariant Riemann tensor. This symmetry-obeying spatial curvature tensor has the expression 1 R(sym)(u)abcd = 2 [had,(bc) − hac,(bd) + hbc,(ad) − hbd,(ac)] e e (2.331) − Γ(u) caΓ(u)edb + Γ(u) daΓ(u)ecb in a spatially-holonomic spatially-comoving frame, and is related to the Lie spatial curvature tensor and other curvature tensors (independent of the frame) by ab ab ab [a b] R(sym)(u) cd = R(lie)(u) cd − 2θ(u) ω(u)cd − 4θ(u) [cω(u) d] [ab] [a b] = R(lie)(u) cd − 4θ(u) [cω(u) d] (2.332) ab [a b] = R(cfw)(u) cd − 4θ(u) [cω(u) d] ab [a b] ab = R(fw)(u) cd − 4θ(u) [cω(u) d] − 2ω(u) ω(u)cd .
Since both R(lie)(u)abcd and R(sym)(u)abcd satisfy the usual cyclic symmetry of the torsionfree Bianchi identity of the first kind, the two kinematical difference terms together must also, as can be easily verified. Absorbing the first of these into the Lie spatial curvature to yield its antisymmetric part (in the first pair of indices) then explains why that antisymmetric part alone does not satisfy this cyclic symmetry.
72 2.5.4 Spatial Ricci tensors and scalar curvatures Neither one of the two kinematical difference terms between the Lie and symmetry-obeying spatial curvature tensors contribute to the curvature scalar
ab ab ab ab ab R(sym)(u) ab = R(lie)(u) ab = R(cfw)(u) ab = R(fw)(u) ab − 2ω(u) ω(u)ab (2.333) but they do make a difference in the spatial Ricci and Einstein tensors. It does not make much sense to ab define the Lie spatial Ricci tensor by taking the trace of R(lie)(u) cd by contracting either one or the other of the indices of the first index pair against one from the second antisymmetric pair since two different tensors result which differ by a kinematical term associated with the symmetric part of the first index pair. The trace using instead the antisymmetric part of the first index pair does result in a unique Lie spatial Ricci tensor, as does the trace of the Fermi-Walker spatial curvature as well as the trace of the “symmetry-obeying” spatial curvature tensor, but only the last Ricci tensor is symmetric
a [ca] R(lie)(u) b = R(lie)(u) cb , a a ca R(fw)(u) b = R(lie)(u) b + 2ω(u) ω(u)cb , (2.334) a ca R(sym)(u) b = R(sym)(u) cb .
The relation between the Lie and the symmetry-obeying Ricci tensors is
a a [c a] R(sym)(u) b = R(lie)(u) b − 4θ(u) [cω(u) b] a c a c a ca (2.335) = R(lie)(u) b − θ(u) cω(u) b + θ(u) bω(u) c − θ(u) ω(u)bc , so c c 0 = R(sym)(u)[ab] = R(lie)(u)[ab] − θ(u) cω(u)ab − 2θ(u) [aω(u)b]c , (2.336) hence c c R (u) = θ(u) cω(u)ab + 2θ(u) ω(u) , (tem) [ab] [a b]c (2.337) tem = lie,fw,cfw . This antisymmetric part of the spatial Ricci tensor comes from the lack of pair interchange symmetry in the corresponding spatial curvature tensor. In the case of a nonrotating observer congruence, all four spatial curvature tensors coincide and are equivalent to the Riemann curvature tensor of the Riemannian 3-manifolds which result from the induced metric on the hypersurfaces orthogonal to the congruence. In the general rotating case, one may use any one of the four distinct tensors, although one may be better suited than the others in a particular context in terms of absorbing extra kinematical terms. In the decomposition of the Einstein equations, it is clearly preferable to have a symmetric spatial Ricci or Einstein tensor, so the symmetry-obeying choice of spatial curvature tensor is perhaps the cleanest separation of spatial curvature from kinematical terms which occur there. In the special case of an observer congruence with vanishing expansion tensor, as occurs in stationary spacetimes if the congruence is taken along the Killing trajectories, the Lie, co-rotating Fermi-Walker and symmetry-obeying choices agree but differ from the Fermi-Walker choice by a term quadratic in the rotation.
2.5.5 Pair interchange symmetry The pair interchange symmetry of an ordinary curvature tensor follows from the antisymmetry of the first index pair together with the cyclic symmetry of the Bianchi identity of the first kind. The symmetry-obeying spatial curvature tensor has all of these symmetries
R(sym)(u)αβγδ = R(sym)(u)γδαβ . (2.338)
The relationship between the symmetry-obeying and Lie spatial curvatures then implies the following
73 identity for the latter curvature
[αβ] αβ αβ αβ R(lie)(u) γδ − R(lie)(u)[γδ] = R(sym)(u) γδ − R(sym)(u)γδ [α β] [α β] + 4θ(u) [γ ω(u) δ] − 4θ(u)[γ ω(u)δ] [α β] = 8θ(u) [γ ω(u) δ] (2.339) αβ αβ = R(fw)(u) γδ − R(fw)(u)γδ αβ αβ = R(cfw)(u) γδ − R(cfw)(u)γδ or equivalently αβ αβ R(lie)(u) γδ − R(lie)(u)γδ (2.340) [α β] αβ αβ = 8θ(u) [γ ω(u) δ] + 2θ(u) ω(u)γδ − 2θ(u)γδω(u) . The trace of the first of these relations provides another way to obtain the antisymmetric part of the Lie spatial Ricci tensor.
2.5.6 Spatial covariant exterior derivative The differential form approach to curvature may also be developed in the spatial context. Just as in the spacetime context, the spatial covariant exterior derivative defined for spatial tensor-valued differential forms is very useful in concisely presenting many identities. This operator interpolates between the spatial exterior derivative for scalar-valued (i.e., ordinary) differential forms and the spatial covariant derivative for tensor- valued zero-forms, and is entirely analogous to the corresponding covariant exterior derivative on spacetime, as described for a general manifold in appendix A. Suppose Sα... is a spatial tensor which is antisymmetric in a group of p covariant indices. One β...γ1...γp may think of it as a tensor-valued p-form, i.e., when its p covariant arguments are evaluated, one is left with a tensor with the remaining index structure. One has in invariant notation 1 α... α... γ1 γp S β... = S β...γ ...γ ω ∧ · · · ∧ ω , p! 1 p (2.341) β α... S = eα ⊗ · · · ⊗ ω ⊗ · · · ⊗ S β... , and the spatial covariant exterior derivative is then defined by
β α... D(u)S = eα ⊗ · · · ⊗ ω ⊗ · · · ⊗ D(u)S β... , β α... = eα ⊗ · · · ⊗ ω ⊗ · · · ⊗ d(u)S β... (2.342) β α... + ∇(u) ∧ [eα ⊗ · · · ⊗ ω ⊗ · · ·] ⊗ S β... , where the notation ∇∧ is meant to indicate the wedge product of the additional covariant derivative index with the p-form. The first line of the above definition emphasizes the sloppy classical notation of not distinguishing the components of the covariant derivative from the derivative of the components. The former interpretation is assumed to hold for the occurrence in this formula. In an observer-adapted frame, the component formula is directly analogous to the general one. If S is a spatial tensor-valued differential form of type σ (describing the tensor-valued index structure) then its spatial covariant exterior derivative is
b a... S = ea ⊗ · · · ⊗ ω ⊗ · · · ⊗ S b... , a... a... a... D(u)S b... = d(u)S b... + [σ(ωωω(u)) ∧ S] b... , (2.343) b a... D(u)S = ea ⊗ · · · ⊗ ω ⊗ · · · ⊗ D(u)S b... , where a b a a c ωωω(u) = ea ⊗ ωωω(u) b ⊗ ω , ωωω(u) b = Γ(u) cbω (2.344) 1 is the matrix of spatial connection 1-forms thought of as a 1 -“tensor”-valued 1-form and the wedge product is meant between the connection 1-form and the p-form indices.
74 By introducing the spatial identity tensor as a vector-valued 1-form
a a a ϑϑϑ(u) = P (u) = ea ⊗ ω , ϑϑϑ(u) = ω , (2.345)
one obtains the vector-valued spatial torsion 2-form as its spatial covariant exterior derivative
ΘΘΘ(u) = D(u)ϑϑϑ(u) = d(u)ϑϑϑ(u) + ωωω(u) ∧ϑϑϑ(u) (2.346) where the unusual notation ∧ necessary for the index-free notation indicates a right contraction between the adjacent tensor-valued indices and an exterior product between the differential form indices. In the index-notation one has explicitly
a a a a b ΘΘΘ(u) = D(u)ϑϑϑ(u) = d(u)ϑϑϑ(u) + ωωω(u) b ∧ ϑϑϑ(u) (2.347) 1 a b c = 2 –T (u) bcω ∧ ω = 0 .