Classical Physics: Spacetime and Fields
Nikodem Poplawski
Department of Mathematics and Physics, University of New Haven, CT, USA
Preface
We present a self-contained introduction to the classical theory of spacetime and fields. This expo- sition is based on the most general principles: the principle of general covariance (relativity) and the principle of least action. The order of the exposition is: 1. Spacetime (principle of general covariance and tensors, affine connection, curvature, metric, tetrad and spin connection, Lorentz group, spinors); 2. Fields (principle of least action, action for gravitational field, matter, symmetries and conservation laws, gravitational field equations, spinor fields, electromagnetic field, action for particles). In this order, a particle is a special case of a field existing in spacetime, and classical mechanics can be derived from field theory.
I dedicate this book to my Parents: Bo˙zennaPop lawska and Janusz Poplawski. I am also grateful to Chris Cox for inspiring this book.
The Laws of Physics are simple, beautiful, and universal. arXiv:0911.0334v2 [gr-qc] 4 Jul 2020
1 Contents
1 Spacetime 5 1.1 Principle of general covariance and tensors ...... 5 1.1.1 Vectors ...... 5 1.1.2 Tensors ...... 6 1.1.3 Densities ...... 7 1.1.4 Contraction ...... 7 1.1.5 Kronecker and Levi-Civita symbols ...... 8 1.1.6 Dual densities ...... 8 1.1.7 Covariant integrals ...... 9 1.1.8 Antisymmetric derivatives ...... 9 1.2 Affine connection ...... 10 1.2.1 Covariant differentiation of tensors ...... 10 1.2.2 Parallel transport ...... 11 1.2.3 Torsion tensor ...... 11 1.2.4 Covariant differentiation of densities ...... 12 1.2.5 Antisymmetric covariant derivatives ...... 13 1.2.6 Partial integration ...... 14 1.2.7 Geodesic frame of reference ...... 14 1.2.8 Affine geodesics and four-velocity ...... 15 1.2.9 Infinitesimal coordinate transformations ...... 16 1.2.10 Killing vectors ...... 17 1.3 Curvature ...... 18 1.3.1 Curvature tensor ...... 18 1.3.2 Integrability of connection ...... 19 1.3.3 Parallel transport along closed curve ...... 20 1.3.4 Bianchi identities ...... 20 1.3.5 Ricci tensor ...... 21 1.3.6 Geodesic deviation ...... 21 1.4 Metric ...... 22 1.4.1 Metric tensor ...... 22 1.4.2 Christoffel symbols ...... 24 1.4.3 Riemann tensor ...... 27 1.4.4 Properties of Riemann tensor ...... 28 1.4.5 Metric geodesics ...... 29 1.4.6 Galilean frame of reference and Minkowski tensor ...... 30 1.4.7 Riemann normal coordinates ...... 31 1.4.8 Intervals, proper time, and distances ...... 32 1.4.9 Spatial vectors ...... 35 1.4.10 Embedded hypersurfaces ...... 37 1.4.11 Event horizon ...... 43 1.5 Tetrad and spin connection ...... 43 1.5.1 Tetrad ...... 43 1.5.2 Lorentz transformation ...... 44 1.5.3 Tetrad transport ...... 44 1.5.4 Spin connection ...... 45 1.5.5 Tetrad representation of curvature tensor ...... 46 1.6 Lorentz group ...... 47 1.6.1 Subgroups of Lorentz group and Einstein principle of relativity ...... 47 1.6.2 Infinitesimal Lorentz transformations ...... 48 1.6.3 Generators and Lie algebra of Lorentz group ...... 48 1.6.4 Rotations and boosts ...... 49 1.6.5 Poincar´egroup ...... 51
2 1.6.6 Invariants of Lorentz and Poincar´egroup ...... 53 1.6.7 Relativistic kinematics ...... 54 1.6.8 Four-acceleration ...... 57 1.7 Spinors ...... 59 1.7.1 Spinor representation of Lorentz group ...... 59 1.7.2 Spinor connection ...... 60 1.7.3 Curvature spinor ...... 62
2 Fields 63 2.1 Principle of least action ...... 63 2.2 Action for gravitational field ...... 64 2.3 Matter ...... 65 2.3.1 Metric energy-momentum tensor ...... 65 2.3.2 Tetrad energy-momentum tensor ...... 66 2.3.3 Canonical energy-momentum density ...... 66 2.3.4 Spin tensor ...... 67 2.3.5 Belinfante-Rosenfeld relation ...... 68 2.4 Symmetries and conservation laws ...... 69 2.4.1 Noether theorem ...... 69 2.4.2 Conservation of spin ...... 69 2.4.3 Conservation of metric energy-momentum ...... 70 2.4.4 Conservation of tetrad energy-momentum ...... 72 2.4.5 Conservation laws for Lorentz group ...... 73 2.4.6 Momentum four-vector ...... 74 2.4.7 Mass ...... 76 2.4.8 Angular momentum four-tensor ...... 76 2.4.9 Energy-momentum tensor for particles ...... 79 2.4.10 Spin tensor for particles ...... 83 2.4.11 Relativistic ideal fluids ...... 85 2.4.12 Multipole expansion of spin tensor ...... 88 2.4.13 Multipole expansion of energy-momentum tensor ...... 88 2.4.14 Mathisson-Papapetrou-Dixon equations ...... 90 2.5 Gravitational field equations ...... 94 2.5.1 Einstein-Cartan action and equations ...... 94 2.5.2 Sciama-Kibble action ...... 96 2.5.3 Einstein-Hilbert action and Einstein equations ...... 97 2.5.4 Utiyama action ...... 99 2.5.5 Einstein pseudotensor and principle of equivalence ...... 99 2.5.6 Møller pseudotensor ...... 102 2.5.7 Landau-Lifshitz energy-momentum pseudotensor ...... 104 2.5.8 Palatini variation ...... 106 2.5.9 Gravitational potential ...... 107 2.5.10 Raychaudhuri equation ...... 108 2.5.11 Relativistic spin fluids ...... 109 2.6 Spinor fields ...... 111 2.6.1 Dirac matrices ...... 111 2.6.2 Lagrangian density and spin tensor for spinor field ...... 114 2.6.3 Dirac equation ...... 115 2.6.4 Energy-momentum tensor for spinor field ...... 117 2.6.5 Discrete symmetries of spinors ...... 118 2.7 Electromagnetic field ...... 119 2.7.1 Gauge invariance and electromagnetic potential ...... 119 2.7.2 Electromagnetic field tensor ...... 121 2.7.3 Lagrangian density for electromagnetic field ...... 123
3 2.7.4 Electromagnetic current and electric charge ...... 124 2.7.5 Maxwell equations ...... 126 2.7.6 Energy-momentum tensor for electromagnetic field ...... 128 2.7.7 Lorentz force ...... 129 2.8 Action for particles ...... 131
4 1 Spacetime 1.1 Principle of general covariance and tensors Physical processes are described in coordinate systems in four-dimensional spacetime, called systems of reference or frames of reference. The principle of general covariance or Einstein’s general principle of relativity states that physical laws do not change their form (are covariant) under arbitrary, differentiable (and thereby continuous) coordinate transformations. Equivalently, physical laws have the same form in all admissible frames of reference.
1.1.1 Vectors Let us consider a coordinate transformation from old (unprimed) to new (primed) coordinates in a four-dimensional manifold: xi → x0j(xi), (1.1.1) where x0j are differentiable and nondegenerate functions of xi and the index i (and the other Latin ∂x0j indices) can be 0,1,2, or 3. The corresponding transformation matrix ∂xi , which is four-dimensional ∂x0j i and square (4 × 4), has a nonzero determinant | ∂xi | 6= 0, thereby that x are differentiable and 0j ∂xi ∂x0j nondegenerate functions of x . The matrix ∂x0j is inverse to ∂xi : X ∂x0i ∂xk = δk, (1.1.2) ∂xj ∂x0i j i where 1 i = k δi = . (1.1.3) k 0 i 6= k A scalar (invariant) is defined as a quantity that does not change under coordinate transformations:
φ0 = φ. (1.1.4)
Accordingly, the differential of a scalar is also a scalar:
dφ0 = dφ. (1.1.5)
If φ(xi) is a scalar function of the coordinates xi, then its differential can be expressed as X ∂φ dφ(xi) = dxi. (1.1.6) ∂xi i
i ∂ Coordinate differentials dx and partial derivatives ∂i = ∂xi transform according to X ∂x0j dx0j = dxi, (1.1.7) ∂xi i ∂ X ∂xi ∂ X ∂xi = , ∂0 = ∂ . (1.1.8) ∂x0j ∂x0j ∂xi j ∂x0j i i i A contravariant vector is defined as a set of quantities that transform like coordinate differentials:
X ∂x0j A0j = Ai. (1.1.9) ∂xi i These quantities are referred to as the components of the contravariant vector. A covariant vector is defined as a set of quantities that transform like partial derivatives of a scalar:
X ∂xi B0 = B . (1.1.10) j ∂x0j i i
5 These quantities are referred to as the components of the covariant vector. Therefore, coordinate differentials form a contravariant vector and partial derivatives of a scalar form a covariant vector. The coordinates xi do not form a vector. A linear combination of two scalars is a scalar. A linear combination aC + bD of two contravariant vectors C and D, where a and b are scalars, is a contravariant vector E whose components are Ei = aCi + bDi. A linear combination aC + bD of two covariant vectors C and D is a covariant vector E whose components are Ei = aCi + bDi. An upper index (in a contravariant vector) is called contravariant, and a lower index is called covariant. The derivative with respect to a quantity with a contravariant index i is a quantity with a covariant index i. Conversely, the derivative with respect to a quantity with a covariant index i is a quantity with a contravariant index i. Henceforth, we adopt the following Einstein’s summation convention. If the same coordinate index i appears in a given expression twice, as a contravariant index and a covariant index, and we apply the summation P in this expression, then we do not need P i to write the summation sign i. Accordingly, we can omit the summation signs in the formulae of this section.
1.1.2 Tensors A product of several vectors transforms under differentiable coordinate transformations such that each coordinate index transforms separately:
∂x0i ∂x0j ∂xp ∂xq A0iB0j ...C0 D0 ··· = AmBn ...C D .... (1.1.11) k l ∂xm ∂xn ∂x0k ∂x0l p q A tensor is defined as a set of quantities that transform like products of the components of vectors:
∂x0i ∂x0j ∂xp ∂xq T 0ij... = T mn... . (1.1.12) kl... ∂xm ∂xn ∂x0k ∂x0l pq... These quantities are referred to as the components of the tensor. A tensor is of rank (k, l) if it has k contravariant and l covariant indices. A scalar is a tensor of rank (0,0), a contravariant vector is a tensor of rank (1,0), and a covariant vector is a tensor of rank (0,1). A linear combination of two tensors of rank (k, l) is a tensor of rank (k, l) such that its components are the same linear combinations of the corresponding components of the tensors. The product of two tensors of ranks (k1, l1) and (k2, l2) is a tensor of rank (k1 + k2, l1 + l2). Tensor indices (all contravariant or all covariant) can be symmetrized:
1 X T = T , (1.1.13) (ij...k) n! {ij...k} permutations or antisymmetrized: 1 X T = T (−1)N , (1.1.14) [ij...k] n! {ij...k} permutations where n is the number of symmetrized or antisymmetrized indices and N is the number of per- 1 mutations that bring Tij...k to T{ij...k}. For example, for two indices: T(ik) = 2 (Tik + Tki) and 1 1 T[ik] = 2 (Tik − Tki), and for three indices: T[ijk] = 3 (Tijk + Tjki + Tkij). If n > 4 then T[ij...k] = 0. Symmetrized and antisymmetrized tensors or rank (k, l) are tensors of rank (k, l). Symmetrization of an antisymmetric tensor or antisymmetrization of a symmetric tensor bring these tensors to zero. Any tensor of rank (0,2) is the sum of its symmetric and antisymmetric part,
T(ik) + T[ik] = Tik. (1.1.15)
The number 0 can be regarded as a tensor of arbitrary rank. Therefore, all covariant equations of ij... classical physics must be represented in the tensor form: T kl... = 0.
6 1.1.3 Densities The element of volume in four-dimensional spacetime transforms according to 0i 4 0 ∂x 4 d x = d x. (1.1.16) ∂xk A scalar density is defined as a quantity that transforms such that its product with the element of volume is a scalar, s0d4x0 = sd4x: i 0 ∂x s = s. (1.1.17) ∂x0k A tensor density, which includes a contravariant and covariant vector density, is defined as a set of quantities that transform like products of the components of a tensor and a scalar density: i 0i 0j p q 0ij... ∂x ∂x ∂x ∂x ∂x mn... T = T . (1.1.18) kl... ∂x0k ∂xm ∂xn ∂x0k ∂x0l pq... These quantities are referred to as the components of the tensor density. A tensor density is of rank (k, l) if it has k contravariant and l covariant indices. For example, the square root of the determinant of a tensor of rank (0, 2) is a scalar density of weight 1: s s q l m j 2 j 0 ∂x ∂x ∂x ∂x p |T | = Tlm = |Tik| = |Tik|. (1.1.19) ik ∂x0i ∂x0k ∂x0n ∂x0n The above densities are said to be of weight 1. One can generalize this definition of densities by introducing densitites of weight w, which transform according to i w 0 ∂x s = s. (1.1.20) ∂x0k For example, d4x is a scalar density of weight -1. A linear combination of two densities of rank (k, l) and weight w is a density of rank (k, l) and weight w such that its components are the same linear combinations of the corresponding components of the densities. The product of two densities of weights w1 and w2 is a density of weight w1 + w2. Symmetrized and antisymmetrized densities of weight w are densities of weight w. Densities of weight 1 are simply referred to as densities. Tensors are densities of weight 0.
1.1.4 Contraction Einstein’s summation convention also applies within the same tensor or tensor density, if a given coordinate index i appears twice (as a contravariant and covariant index). Such a tensor or density is said to be contracted over index i. A contracted tensor of rank (k, l) transforms like a tensor of rank (k − 1, l − 1): ∂x0i ∂x0j ∂xp ∂xq ∂x0j ∂xq ∂x0j ∂xq T 0ij... = T mn... = δp T mn... = T mn... . (1.1.21) il... ∂xm ∂xn ∂x0i ∂x0l pq... ∂xn ∂x0l m pq... ∂xn ∂x0l mq... i For example, the contraction of a contravariant and covariant vector A Bi is a scalar (scalar product). A contracted tensor density of rank (k, l) and weight w transforms like a tensor density of rank (k − 1, l − 1) and weight w: i w 0i 0j p q i w 0j q 0ij... ∂x ∂x ∂x ∂x ∂x mn... ∂x ∂x ∂x p mn... T = T = δ T il... ∂x0k ∂xm ∂xn ∂x0i ∂x0l pq... ∂x0k ∂xn ∂x0l m pq... i w 0j q ∂x ∂x ∂x mn... = T . (1.1.22) ∂x0k ∂xn ∂x0l mq... Contraction of a symmetric tensor with an antisymmetric tensor (over indices with respect to which these tensors are symmetric or antisymmetric) gives zero. If contraction of two tensors gives zero, these tensors are said to be orthogonal. Two orthogonal vectors (one contravariant and one covariant) are said to be perpendicular.
7 1.1.5 Kronecker and Levi-Civita symbols
i The Kronecker symbol δk (1.1.3) is a tensor with constant components: ∂x0i ∂xl ∂x0i ∂xj δ0i = δj = = δi . (1.1.23) k ∂xj ∂x0k l ∂xj ∂x0k k A completely antisymmetric tensor of rank (4, 0), T ijkl = T [ijkl] has 1 independent component T : T ijkl = T ijkl, where ijkl is the completely antisymmetric, contravariant Levi-Civita permutation symbol: 0123 = 1, ijkl = [ijkl] = (−1)N , (1.1.24) and N is the number of permutations that bring ijkl to 0123. The determinant of a square matrix i i i Sk, det(Sk) = |Sk|, is defined through the permutation symbol: r ijkl i j k l mnpq |Ss | = SmSnSp Sq . (1.1.25)
i ∂x0i Taking Sk = ∂xk gives r 0i 0j 0k 0l ijkl ∂x ∂x ∂x ∂x ∂x mnpq = . (1.1.26) ∂x0s ∂xm ∂xn ∂xp ∂xq This equation looks like a transformation law for a tensor density (of weight 1) with constant components: 0ijkl = ijkl. Accordingly, T is a scalar density of weight -1. We also introduce the covariant Levi-Civita symbol εijkl through: i i i i δm δn δp δq j j j j ijkl δm δn δp δq εmnpq = − k k k k . (1.1.27) δm δn δp δq l l l l δm δn δp δq Therefore, the covariant Levi-Civita symbol is a tensor density of weight -1 and its product with a scalar density is a tensor. The covariant Levi-Civita symbol is given by N ε0123 = −1, εijkl = ε[ijkl] = (−1) , (1.1.28) where N is the number of permutations that bring εijkl to ε0123, and satisfies r m n p q |Ss |εijkl = Si Sj Sk Sl εmnpq. (1.1.29) Contracting (1.1.27) gives the following relations: i i i δm δn δp ijkl j j j εmnpl = − δm δn δp , k k k δm δn δp ijkl i j i j εmnkl = −2(δmδn − δnδm), ijkl i εmjkl = −6δm, ijkl εijkl = −24. (1.1.30)
1.1.6 Dual densities A contracted product of a covariant tensor and the contravariant Levi-Civita symbol gives a dual contravariant tensor density of weight 1: iklm ikl iklm ik iklm i Am = A , Blm = B , Cklm = C . (1.1.31) A contracted product of a contravariant tensor and the covariant Levi-Civita symbol gives a dual covariant tensor density of weight -1: m lm klm εiklmA = Aikl, εiklmB = Bik, εiklmC = Ci. (1.1.32) Therefore, there exists an algebraic correspondence between covariant tensors and contravariant densities of weight 1, and between contravariant tensors and covariant densities of weight -1.
8 1.1.7 Covariant integrals
i R j... i A covariant line integral is an integral of a tensor contracted with the line differential dx : T i...dx . A covariant surface integral is an integral of a tensor contracted with the surface differential df ik = dxidx0k −dxkdx0i (which can be geometrically represented as a parallelogram spanned by the vectors i 0i R j... ik dx and dx ): T ik...df . A covariant hypersurface (volume) integral is an integral of a tensor
dxi dx0i dx“i
contracted with the volume differential dSikl = dxk dx0k dx“k (which can be geometrically
dxl dx0l dx“l i 0i “i R j... ikl represented as a parallelepiped spanned by the vectors dx , dx ) and dx : T ikl...dS . A covariant four-volume integral is an integral of a tensor contracted with the four-volume differential dSijkl, defined analogously to dSikl. The dual density corresponding to the surface element is given by 1 df ? = ε df lm, (1.1.33) ik 2 iklm which gives 1 df lm = − lmikdf ? , df ikdf ? = 0. (1.1.34) 2 ik ik The dual density corresponding to the hypersurface element is given by 1 dS = − ε dSklm, dSklm = −klmidS . (1.1.35) i 6 iklm i The dual density corresponding to the four-volume element is given by 1 dΩ = ε dSiklm = dx0dx1dx2dx3. (1.1.36) 24 iklm Covariant integrands that include the above dual densities of weight -1 must be multiplied by a scalar density, for example, by the square root of the determinant of a tensor of rank (0, 2). According to Gauß’ and Stokes’ theorems, there exists relations between integrals over different elements: ∂ dxi ↔ df ki , (1.1.37) ∂xk ∂ ∂ df ? ↔ dS − dS , (1.1.38) ik i ∂xk k ∂xi ∂ dS ↔ dΩ . (1.1.39) i ∂xi
1.1.8 Antisymmetric derivatives A derivative of a covariant vector does not transform like a tensor: ∂A0 ∂ ∂xm ∂xm ∂A ∂2xm ∂xl ∂xm ∂A ∂2xm k = A = m + A = m + A , (1.1.40) ∂x0i ∂x0i ∂x0k m ∂x0k ∂x0i ∂x0i∂x0k m ∂x0i ∂x0k ∂xl ∂x0i∂x0k m
i because of the second term which is linear and homogeneous in Ai, unless x are linear functions of 0j ∂Ak x . This term is symmetric in the indices i, k, thereby the antisymmetric part of ∂xi with respect to these indices is a tensor: ∂xl ∂xm ∂xl ∂xm ∂0 A0 = ∂ A = ∂ A . (1.1.41) [i k] ∂x0[i ∂x0k] l m ∂x0i ∂x0k [l m]
The curl of a covariant vector Ai is defined as twice the antisymmetric part of ∂iAk: ∂iAk − ∂kAi, ∂ i and is a tensor. We will also use ,i = ∂xi to denote a partial derivative with respect to x . Similarly, completely antisymmetrized derivatives of tensors of rank (0, 2) and (0, 3), ∂[iBkl] and ∂[iCklm], are tensors. If Bkl = A[k,l] then ∂[iBkl] = 0, or conversely, if ∂[iBkl] = 0 then there exists a vector Ai such that Bkl = A[k,l]. The divergence of a tensor (or density) is a contracted derivative of this tensor
9 il... (density): ∂iT jk.... Because of the correspondence between tensors and dual densities, divergences of (completely antisymmetric if more than 1 index) contravariant densities are densities, dual to completely antisymmetrized derivatives of tensors:
i iklm ik iklm ikl iklm ∂iC = ∂[iCklm], ∂kB = ∂[kBlm], ∂lA = ∂[lAm]. (1.1.42)
ik k For example, the equations F[ik,l] = 0 and F ,i = j , that describe Maxwell’s electrodynamics (confer (2.7.32) and (2.7.82)), are tensorial. References: [1, 2].
1.2 Affine connection 1.2.1 Covariant differentiation of tensors
An ordinary derivative of a covariant vector Ai is not a tensor, because its coordinate transformation law (1.1.40) contains an additional noncovariant term, linear and homogeneous in Ai. Such a term vanishes only if xi are linear functions of x0j, that is, if the coordinate transformation from old to new coordinates (1.1.1) is linear. Let us consider the expression
l Ai;k = Ai,k − Γi kAl, (1.2.1)
l where the quantity Γi k (in the second term which is linear and homogeneous in Ai) transforms such that Ai;k is a tensor:
∂xl ∂xm ∂xl ∂xm A0 = A = (A − Γ n A ). (1.2.2) i;k ∂x0i ∂x0k l;m ∂x0i ∂x0k l,m l m n Also, (1.1.40) gives
∂xm ∂xl ∂2xn ∂xn A0 = A0 − Γ0 l A0 = A + A − Γ0 l A , (1.2.3) i;k i,k i k l ∂x0k ∂x0i l,m ∂x0k∂x0i n ∂x0l i k n so we obtain ∂xn ∂xl ∂xm ∂2xn Γ0 l = Γ n + . (1.2.4) ∂x0l i k ∂x0i ∂x0k l m ∂x0k∂x0i ∂x0j l Multiplying this equation by ∂xn gives the transformation law for Γi k: ∂x0j ∂xl ∂xm ∂x0j ∂2xn Γ0 j = Γ n + . (1.2.5) i k ∂xn ∂x0i ∂x0k l m ∂xn ∂x0k∂x0i l The algebraic object Γi k, which equips spacetime in order to covariantize a derivative of a vector, is referred to as the affine connection, affinity or simply connection. The connection has generally 64 independent components. The tensor Ai;k is the covariant derivative of a vector Ai with respect i to x . We will also use ∇i = ;i to denote the covariant derivative. The contracted affine connection transforms according to ∂xm ∂x0i ∂2xn Γ0 i = Γ l + . (1.2.6) i k ∂x0k l m ∂xn ∂x0k∂x0i The affine connection is not a tensor because of the second term on the right-hand side of (1.2.5). A derivative of a scalar is a covariant vector. Therefore, the covariant derivative of a scalar is equal to an ordinary derivative: φ;i = φ,i. (1.2.7) If we also assume that the covariant derivative of the product of two tensors obeys the same chain rule as an ordinary derivative: (TU);i = T;iU + TU;i, (1.2.8) then
k k k k k k k k l k Ak,iB + AkB ,i = (AkB ),i = (AkB );i = Ak;iB + AkB ;i = Ak,iB − Γl iAkB + AkB ;i. (1.2.9)
10 Therefore, we obtain the covariant derivative of a contravariant vector:
k k k l B ;i = B ,i + Γl iB . (1.2.10)
The chain rule (1.2.8) also infers that the covariant derivative of a tensor is equal to the sum of the corresponding ordinary derivative of this tensor and terms with the affine connection that covariantize each index:
ij... ij... i nj... j in... n ij... n ij... T kl...;m = T kl...,m + Γn mT kl... + Γn mT kl... + · · · − Γk mT nl... − Γl mT kn... − .... (1.2.11) the covariant derivative of the Kronecker symbol vanishes:
k k j j k δl;i = Γj iδl − Γl iδj = 0. (1.2.12) The second term on the right of (1.2.5) does not depend on the affine connection, but only on the coordinate transformation. Therefore, the difference between two different connections transforms j like a tensor of rank (1,2). Consequently, the variation δΓi k, which is an infinitesimal difference between two connections, is a tensor of rank (1,2).
1.2.2 Parallel transport Let us consider two infinitesimally separated points in spacetime, P (xi) and Q(xi + dxi), and a k k k k k i vector field A which takes the value A at P and A + dA at Q. Because dA = A ,idx and k k k k k A ,i is not a tensor, the difference dA between the vectors A + dA and A is not a vector. The differential dAk is not a vector because it arises from subtracting two vectors which are located at two points with different coordinate transformation laws. The transformation law for dAk follows from (1.1.40):
∂xm ∂xm ∂xm ∂xm ∂2xm dA0 = d A = dA + d A = dA + A dx0i. (1.2.13) k ∂x0k m ∂x0k m ∂x0k m ∂x0k m ∂x0i∂x0k m
In order to calculate the covariant difference between two vectors at two different points, we must bring these vectors to the same point. Instead of subtracting from the vector Ak + dAk at Q the vector Ak at P , we must subtract a vector Ak + δAk at Q that corresponds to Ak at P , thereby that the resulting difference (covariant differential) DAk = dAk − δAk is a vector. The vector Ak + δAk is the parallel-transported or parallel-translated Ak from P to Q. A parallel-transported linear combination of vectors must be equal to the same linear combination of parallel-transported vectors. Therefore, δAk is a linear and homogeneous function of Ak. It is also on the order of a differential, thus a linear and homogeneous function of dxi. The most general form of δAk is
k k l i δA = −Γl iA dx , (1.2.14) so k k k l i k i DA = dA + Γl iA dx = A ;idx . (1.2.15) k k k k Because δA is not a vector, Γl i is not a tensor. Because DA is a vector, A ;i is a tensor. The expressions for covariant derivatives of a covariant vector and tensors result from
δφ = 0, δ(TU) = δT U + T δU. (1.2.16)
1.2.3 Torsion tensor The second term on the right-hand side of (1.2.5) is symmetric in the indices i, k. Antisymmetrizing (1.2.5) with respect to these indices gives
∂x0j ∂xl ∂xm S0j = Sn , (1.2.17) ik ∂xn ∂x0i ∂x0k lm
11 where j j S ik = Γ[i k] (1.2.18) is the antisymmetric (in the covariant indices) part of the affine connection. Equation (1.2.17) is a transformation formula for a tensor, thereby (1.2.18) is a tensor, called the Cartan torsion tensor. The torsion tensor has generally 24 independent components. The contracted torsion tensor,
k S ik = Si, (1.2.19) is called the torsion trace or torsion vector.
1.2.4 Covariant differentiation of densities The differential of the determinant S of a square matrix S is given by
k i dS = s idS k, (1.2.20)
k i where s i is the minor corresponding to the component S k of the matrix. The components of the matrix S−1 inverse to S, i −1 j −1 i j i S j(S ) k = (S ) jS k = δk, (1.2.21) are related to the minors of S by sk (S−1)k = i . (1.2.22) i S The differential dS is therefore equal to
−1 k i k −1 i dS = S(S ) idS k = −SS id(S ) k, (1.2.23) which is equivalent to −1 k i k −1 i ∂lS = S(S ) i∂lS k = −SS i∂l(S ) k. (1.2.24) i ∂xi Taking S k = ∂x0k gives r r 0n m ∂x ∂x ∂x ∂ ∂x ∂l = . (1.2.25) ∂x0s ∂x0s ∂xm ∂xl ∂x0n A derivative of a scalar density s of weight w does not transform like a covariant vector density:
l j w l j w l j w−1 r 0 0 ∂x ∂x ∂x ∂x ∂x ∂x ∂x ∂ s = ∂l s = ∂ls + w ∂l s i ∂x0i ∂x0k ∂x0i ∂x0k ∂x0i ∂x0k ∂x0s l j w l j w−1 r 0n m ∂x ∂x ∂x ∂x ∂x ∂x ∂ ∂x = ∂ls + w s ∂x0i ∂x0k ∂x0i ∂x0k ∂x0s ∂xm ∂xl ∂x0n l j w j w 0n 2 m ∂x ∂x ∂x ∂x ∂ x = ∂ls + w s. (1.2.26) ∂x0i ∂x0k ∂x0k ∂xm ∂x0n∂x0i Let us consider the expression s;i = s,i − wΓis, (1.2.27) where the quantity Γi transforms such that s;i is a vector density of weight w:
l j w l j w 0 ∂x ∂x ∂x ∂x s = s;l = (s,l − wΓls). (1.2.28) ;i ∂x0i ∂x0k ∂x0i ∂x0k
Also, (1.2.26) gives
l j w j w 0n 2 m j w 0 0 0 0 ∂x ∂x ∂x ∂x ∂ x ∂x 0 s = s − wΓ s = ∂ls + w s − w Γ s, (1.2.29) ;i ,i i ∂x0i ∂x0k ∂x0k ∂xm ∂x0n∂x0i ∂x0k i
12 so we obtain the transformation law for Γi:
∂xl ∂x0n ∂2xm Γ0 = Γ + , (1.2.30) i ∂x0i l ∂xm ∂x0n∂x0i
k k which is the same as the transformation law for Γk i (1.2.6). Therefore, the difference Γi − Γk i is some covariant vector Vi. If we assume that parallel transport of the product of a scalar density of any weight and a tensor obeys the chain rule: δ(sT ) = δsT + sδT, (1.2.31) so the covariant derivative of such product behaves like an ordinary derivative:
(sT );i = s;iT + sT;i, (1.2.32)
then the covariant derivative of a tensor density of weight w is equal to the sum of the corresponding ordinary derivative of this tensor, terms with the affine connection that covariantize each index, and the term with Γi:
ij... ij... i nj... j in... T kl...;m = T kl...,m + Γn mT kl... + Γn mT kl... + ... n ij... n ij... ij... −Γk mT nl... − Γl mT kn... − · · · − wΓmT kl.... (1.2.33) the covariant derivative of the contravariant Levi-Civita density is
ijkl i njkl j inkl k ijnl l ijkn ijkl ;m = Γn m + Γn m + Γn m + Γn m − Γm . (1.2.34) In the summations over n only one term does not vanish for each term on the right-hand side of (1.2.34), thereby
ijkl i n=i|jkl j i|n=j|kl k ij|n=k|l l ijk|n=l ijkl ;m = Γn=i|m + Γn=j|m + Γn=k|m + Γn=l|m − Γm n ijkl ijkl = (Γn m − Γm) = −Vm . (1.2.35)
The Levi-Civita symbol is a tensor density with constant components, thereby it does not change under a parallel transport, δ = 0. Therefore, we have
ijkl ;m = 0. (1.2.36) By means of (1.1.27), we also have εijkl;m = 0. (1.2.37)
Consequently, we obtain Vi = 0 and k Γi = Γk i. (1.2.38)
1.2.5 Antisymmetric covariant derivatives
Completely antisymmetrized ordinary derivatives of tensors, A[i,k], B[ik,l] and C[ikl,m], are tensors because of their antisymmetry. Completely antisymmetrized covariant derivatives of tensors are tensors because ∇i is a covariant operation, and are given by direct calculation using the definition of the covariant derivative:
l m A[i;k] = A[i,k] − S ikAl,B[ik;l] = B[ik,l] − 2S [ikBl]m. (1.2.39)
i ik Divergences of (completely antisymmetric if more than 1 index) contravariant densities, C ,i, B ,i ikl and A ,i, are densities because of the correspondence between tensors and dual densities. Covariant divergences of contravariant densities are densities, and are given by direct calculation:
i i i ik ik k il ik C ;i = C ,i + 2SiC , B ;i = B ,i − S ilB + 2SiB . (1.2.40)
13 1.2.6 Partial integration If the product of two quantities (tensors or densities) TU is a contravariant density Ck then Z Z Z Z Z Z TU;kdΩ = (TU);kdΩ − T;kUdΩ = (TU),kdΩ + 2 SkT UdΩ − T;kUdΩ. (1.2.41)
R The first term on the right-hand side can be transformed into a hypersurface integral T UdSk. If the region of integration extends to infinity and Ck corresponds to some physical quantity then the R boundary integral T UdSk vanishes, giving Z Z Z TU;kdΩ = 2 SkT UdΩ − T;kUdΩ. (1.2.42)
If T = δk, then U = Ci and i Z Z i i C ;idΩ = 2 SiC dΩ. (1.2.43)
Equation (1.2.43) can be written as Z ∗ i ∇i C dΩ = 0, (1.2.44) where ∗ ∇i = ∇i − 2Si (1.2.45) is the modified covariant derivative.
1.2.7 Geodesic frame of reference Let us consider a coordinate transformation 1 xk = x0k + ak x0lx0m, (1.2.46) 2 lm
k where a lm is symmetric in the indices l, m. Substituting this transformation to (1.2.5) and calcu- lating it at xk = x0k = 0 gives ∂xi = δi (1.2.47) ∂x0k k and 0 j j j Γi k = Γi k + a ik. (1.2.48) Putting j j a ik = −Γ(i k)|xl=0 (1.2.49) gives 0 j Γ(i k) = 0. (1.2.50) Therefore, there always exists a coordinate frame of reference in which the symmetric part of the connection vanishes locally (at one point). If the affine connection is symmetric in the covariant j j indices, Γi k = Γk i (the torsion tensor vanishes), then (1.2.50) gives
0 j Γi k = 0. (1.2.51) The coordinate frame of reference in which the torsionless part of the connection vanishes (locally) is referred to as geodesic.
14 1.2.8 Affine geodesics and four-velocity Let us consider a point in spacetime P (xk) and a vector dxk at this point. We construct a point P 0(xk + dxk) and find the vector d0xk which is the parallel-transported dxk from P to P 0. Then construct a point P 00(xk +dxk +d0xk) and find the vector d00xk which is the parallel-transported d0xk from P 0 to P 00. The next point is P 000(xk + dxk + d0xk + d00xk) etc. Repeating this step constructs k dxk a polygonal line which in the limit dx → 0 becomes a curve such that the vector dλ (where λ is a parameter along the curve) tangent to it at any point, when parallely translated to another point on this curve, coincides with the tangent vector there. Such curve is referred to as an autoparallel curve or affine geodesic. Affine geodesics can be attributed with the concept of length, which, for the polygonal curve, is proportional to the number of parallel-transport steps described above. The condition that parallel transport of a tangent vector be a tangent vector is dxi dxi dxi dxk dxi d2xi + δ = − Γ i dxl = M + dλ , (1.2.52) dλ dλ dλ k l dλ dλ dλ2 where the proportionality factor M is some function of λ, or d2xi dxk dxl 1 − M dxi M + Γ i = , (1.2.53) dλ2 k l dλ dλ dλ dλ from which it follows that M must differ from 1 by the order of dλ. In the first term on the left-hand side of (1.2.53) we can therefore put M = 1, and we denote 1 − M by φ(λ)dλ, thereby d2xi dxk dxl dxi + Γ i = φ(λ) . (1.2.54) dλ2 k l dλ dλ dλ If we replace λ by a new variable s(λ) then (1.2.54) becomes d2xi dxk dxl φs0 − s00 dxi + Γ i = , (1.2.55) ds2 k l ds ds s02 ds where the prime denotes differentiation with respect to λ. Requiring φs0 − s00 = 0, which has a general solution s = R λ dλ exp[− R λ φ(x)dx], brings (1.2.55) to
d2xi dxk dxl + Γ i = 0, (1.2.56) ds2 k l ds ds where the scalar variable s is called the affine parameter. The autoparallel equation (1.2.56) is invariant under linear transformations s → as + b since the two lower limits of integration in the expression for s(λ) are arbitrary. We define the four-velocity vector: dxi ui = . (1.2.57) ds This definition brings (1.2.15) to DAk dAk = Ak ui, = Ak ui, (1.2.58) ds ;i ds ,i thereby Dui dui = + Γ i ukul = ui uj = 0. (1.2.59) ds ds k l ;j The relations (1.2.58) can be generalized to any tensor density T : DT dT = T ui, = T ui, (1.2.60) ds ;i ds ,i
dxi dxi The vector ds |Q is a parallel translation of ds |P . Because ds is a scalar, it is invariant under parallel i i transport, ds|Q = ds|P . Therefore, the vector dx |Q is a parallel translation of dx |P , thereby ds measures the length of an infinitesimal section of an affine geodesic.
15 i Only the symmetric part Γ(k l) of the connection enters the autoparallel equation (1.2.56) because dxk dxl of the symmetry of ds ds with respect to the indices k, l; affine geodesics do not depend on torsion. At any point, a coordinate transformation to the geodesic frame (1.2.46) brings all the components i dui Γ(k l) to zero, thereby the autoparallel equation becomes ds = 0. The autoparallel equation is also invariant under a projective transformation
i i i Γk l → Γk l + δkAl, (1.2.61) where Ai is an arbitrary vector. Substituting this transformation to (1.2.59) gives
dui + Γ i ukul = −uiukA . (1.2.62) ds k l k If we replace s by a new variables ˜(s) then (1.2.62) becomes
dU i ukA s˜0 +s ˜00 dxi + Γ i U kU l = − k , (1.2.63) ds˜ k l s˜02 ds˜ where dxi U i = (1.2.64) ds˜ k 0 00 and the prime denotes differentiation with respect to s. Requiring u Aks˜ +s ˜ = 0, which has a R s R s k general solutions ˜ = − ds exp[ Aku (x)dx], brings (1.2.63) to
dU i + Γ i U kU l = 0. (1.2.65) ds˜ k l
1.2.9 Infinitesimal coordinate transformations Let us consider a coordinate transformation
xi → x0i = xi + ξi, (1.2.66)
where ξi = δxi is an infinitesimal vector (a variation of xi). For a tensor or density T define
δT = T 0(x0i) − T (xi), (1.2.67) ¯ 0 i i k δT = T (x ) − T (x ) = δT − ξ T,k. (1.2.68)
For a scalar we find ¯ k δφ = 0, δφ = −ξ φ,k. (1.2.69) For a covariant vector
k ∂x k δAi = Ak − Ai ≈ −ξ Ak, (1.2.70) ∂x0i ,i δA¯ ≈ −ξk A − ξkA . (1.2.71) i ,i k i,k The variation (1.2.70) is not a tensor, but (1.2.71) is:
δA¯ = −ξk A − ξkA − 2Sj ξkA . (1.2.72) i ;i k i;k ik j
¯ i We refer to −δT as the Lie derivative of T along the vector ξ , LξT . For a contravariant vector
∂x0i δBi = Bk − Bi ≈ ξi Bk, (1.2.73) ∂xk ,k δB¯ i ≈ ξi Bk − ξkBi = ξi Bk − ξkBi + 2Si ξkBj. (1.2.74) ,k ,k ;k ;k jk
16 For a scalar density
i ∂x i δs = − 1 s ≈ −ξ s, (1.2.75) ∂x0i ,i ¯ i k i k i δs ≈ −ξ ,is − ξ s,k = −ξ ;is − ξ s;k + 2Siξ s. (1.2.76)
The chain rule for δ infers that, for a tensor density of weight w (which includes tensors as densities of weight 0), we have
ij... i mj... j im... m ij... m ij... δT kl... ≈ ξ ,mT kl... + ξ ,mT kl... + · · · − ξ ,kT ml... − ξ ,lT km... − ... m ij... −wξ ,mT kl..., (1.2.77) ¯ ij... i mj... j im... m ij... m ij... δT kl... ≈ ξ ;mT kl... + ξ ;mT kl... + · · · − ξ ;kT ml... − ξ ;lT km... − ... m ij... m ij... i m nj... j m in... −wξ ;mT kl... − ξ T kl...;m + 2S nmξ T kl... + 2S nmξ T kl... + ... n m ij... n m ij... m ij... −2S kmξ T nl... − 2S lmξ T kn... − · · · + 2wSmξ T kl.... (1.2.78) A Lie derivative of a tensor density of rank (k, l) and weight w is a tensor density of rank (k, l) and weight w. The formula for the covariant derivative of T can be written as
j ˆi T;k = T,k + Γi kCjT, (1.2.79)
where Cˆ is an operator acting on tensor densities:
ˆi ˆi i ˆi k k i ˆi i Cjφ = 0, CjAk = −δkAj, CjB = δj B , Cjs = −δjs, (1.2.80)
or generally
ˆm ij... i mj... j im... m ij... m ij... m ij... Cn T kl... = δnT kl... + δnT kl... + · · · − δk T nl... − δl T kn... − · · · − wδn T kl.... (1.2.81) Such defined operator also enters the formula for δT :
ˆk i δT = Ci T ξ ,k. (1.2.82)
1.2.10 Killing vectors
A covariant vector ζi that satisfies ζ(i;k) = 0 (1.2.83) is referred to as a Killing vector. Along an affine geodesic, D (uiζ ) = uk(uiζ ) = uiukζ + ζ ukui = 0. (1.2.84) ds i i ;k i;k i ;k
The first term in the sum in (1.2.84) vanishes because of the definition of ζi and the second term van- ishes because of the affine geodesic equation. Therefore, to each Killing vector ζi there corresponds i a quantity u ζi which does not change along the affine geodesic:
i u ζi = const. (1.2.85)
References: [1, 2, 3].
17 1.3 Curvature 1.3.1 Curvature tensor We define the commutator [A, B] of two operators A and B as
[A, B] = AB − BA = −[B,A]. (1.3.1)
The commutator of covariant derivatives is thus
[∇i, ∇k] = 2∇[i∇k]. (1.3.2)
The commutator of covariant derivatives of a contravariant vector is a tensor:
i i i l i i l [∇j, ∇k]B = 2∇[j∇k]B = 2∂[j∇k]B − 2Γ[k j]∇lB + 2Γl [j∇k]B i m l i i l i l m = 2∂[j(Γ|m| k]B ) + 2S jk∇lB + 2Γl [j∂k]B + 2Γl [jΓ|m| k]B i i l m l i i m l i = 2(∂[jΓ|m| k] + Γl [jΓ|m| k])B + 2S jk∇lB = R mjkB + 2S jk∇lB , (1.3.3) where || embraces indices which are excluded from symmetrization or antisymmetrization. Therefore, i R mjk, defined as i i i l i l i R mjk = ∂jΓm k − ∂kΓm j + Γm kΓl j − Γm jΓl k, (1.3.4) i is a tensor, referred to as the curvature tensor. The curvature tensor R mjk is antisymmetric in the indices j, k and has generally 96 independent components. The commutator of covariant derivatives of a covariant vector is m l [∇j, ∇k]Ai = −R ijkAm + 2S jk∇lAi, (1.3.5) and the commutator of covariant derivatives of a tensor is
in... i mn... n im... m in... m in... [∇j, ∇k]T lp... = R mjkT lp... + R mjkT lp... + · · · − R ljkT mp... − R pjkT lm... l in... − · · · + 2S jk∇lT lp.... (1.3.6)
A change in the connection, ˜ i i i Γj k = Γj k + T jk, (1.3.7) i where T jk is a tensor, results in the following change of the curvature tensor:
˜i ˜ i ˜ i ˜ j ˜ i ˜ j ˜ i i i j i j i R klm = Γk m,l − Γk l,m + Γk mΓj l − Γk lΓj m = Γk m,l − Γk l,m + Γk mΓj l − Γk lΓj m i i j i j i i j i j j i j i +T km,l − T kl,m + Γk mT jl − Γk lT jm + Γj lT km − Γj mT kl + T kmT jl − T klT jm i i i j i j i = R klm + T km;l − T kl;m + T kmT jl − T klT jm. (1.3.8)
i i For a projective transformation (1.2.61), T jk = δjAk, the curvature tensor changes according to
˜i i i R klm = R klm + δk(Am;l − Al;m). (1.3.9)
The variation of the curvature tensor is
i i i i j i j i j i j δR klm = (δΓk m),l − (δΓk l),m + δΓj lΓk m + Γj lδΓk m − δΓj mΓk l − Γj mδΓk l i i j j i j i i i j j i = (δΓk m);l − Γj lδΓk m + Γk lδΓj m + Γm lδΓk j − (δΓk l);m + Γj mδΓk l − Γk mδΓj l j i i j i j i j i j −Γl mδΓk j + δΓj lΓk m + Γj lδΓk m − δΓj mΓk l − Γj mδΓk l i i n i = (δΓk m);l − (δΓk l);m − 2S lmδΓk n. (1.3.10)
18 1.3.2 Integrability of connection The affine connection is integrable if parallel transport of a vector from point P to point Q is inde- pendent of a path along which this vector is parallelly translated, or equivalently, parallel transport of a vector around a closed curve does not change this vector. For an integrable connection, we can uniquely translate parallelly a given vector hi at point P to all points in spacetime:
δhi = dhi, (1.3.11)
or i i j h ,k = −Γj kh . (1.3.12) Therefore, we have
i j i j i j i j m i j i j m i j (Γj kh ),l − (Γj lh ),k = Γj k,lh − Γj kΓm lh − Γj l,kh + Γj lΓm kh = R jlkh = 0, (1.3.13) so, because hi is arbitrary, i R klm = 0. (1.3.14) i Spacetime with a vanishing curvature tensor R klm = 0 is flat. Let us consider 4 linearly independent i i vectors ha, where a is 1,2,3,4, and vectors inverse to ha:
X i i hahka = δk. (1.3.15) a
If the affine connection is integrable then (1.3.12) becomes
i i l ha,k = −Γl kha. (1.3.16)
Multiplying (1.3.16) by hja gives
i i i Γj k = −hjaha,k = hja,kha. (1.3.17)
An integrable connection has thus 16 independent components. If the connection is also symmetric, i S jk = 0, then hja,k − hka,j = 0, (1.3.18) which is the condition for the independence of the coordinates
Z Q i ya = hiadx (1.3.19) P
of the path of integration PQ. Adopting ya as the new coordinates (with point P = (0, 0, 0, 0) in the center) gives i ∂ya ∂x i i = hia, = ha, (1.3.20) ∂x ∂ya so (1.3.17) becomes i 2 i i ∂x ∂ ya Γj k(x ) = k j . (1.3.21) ∂ya ∂x ∂x 0j The transformation law for the connection (1.2.5) gives (with ya corresponding to x )
i Γj k(ya) = 0. (1.3.22)
A torsionless integrable connection can be thus transformed to zero; one can always find a system of coordinates which is geodesic everywhere. If a connection is symmetric but nonintegrable then a geodesic frame of reference can be constructed only at a given point (or along a given world line).
19 1.3.3 Parallel transport along closed curve Let us consider parallel transport of a covariant vector around an infinitesimal closed curve. Such a transport changes this vector, according to Stokes’ theorem (1.1.37) by
I I 1 Z ∂(Γ i A ) ∂(Γ i A ) ∆A = δA = Γ i A dxl = k m i − k l i df lm. (1.3.23) k k k l i 2 ∂xl ∂xm
i l i Along the curve, we have dAk = Γk lAidx , which gives Ak,l = Γk lAi. The last relation is approxi- mately valid, to terms of first order in ∆f lm = R df lm, inside this curve:
1 Z ∂Γ i ∂Γ i 1 ∆A ≈ k m − k l A + (Γ i Γ n − Γ i Γ n )A df lm ≈ Ri A ∆f lm. (1.3.24) k 2 ∂xl ∂xm i k m i l k l i m n 2 klm i
The change of a contravariant vector in parallel transport around an infinitesimal closed curve results k from ∆(AkB ) = 0: 1 ∆Bk ≈ − Rk Bi∆f lm, (1.3.25) 2 ilm and the corresponding change of a tensor results from the chain rule for parallel transport: 1 ∆T ik... ≈ − (Ri T jk... +Rk T ij... +· · ·−Rj T ik... −Rj T ik... −... )∆f lm. (1.3.26) np... 2 jlm np... jlm np... nlm jp... plm nj...
1.3.4 Bianchi identities Let us consider 1 ∇ ∇ ∇ Bi = ∇ (Ri Bm) + ∇ (Sm ∇ Bi) (1.3.27) j [k l] 2 j mkl j kl m and 1 1 1 ∇ ∇ ∇ Bi = − Rm ∇ Bi + Ri ∇ Bm + Sm ∇ ∇ Bi = − Rm ∇ Bi [j k] l 2 ljk m 2 mjk l jk m l 2 ljk m 1 + Ri ∇ Bm + Sm ∇ ∇ Bi + Sm Ri Bn + 2Sm Sn ∇ Bi. (1.3.28) 2 mjk l jk l m jk nml jk ml n Total antisymmetrization of the indices j, k, l in (1.3.27) and (1.3.28) gives 1 1 ∇ ∇ ∇ Bi = ∇ Ri Bm + Ri ∇ Bm + ∇ Sm ∇ Bi + Sm ∇ ∇ Bi (1.3.29) [j k l] 2 [j |m|kl] 2 m[kl] j] [j kl] m kl j] m and 1 1 ∇ ∇ ∇ Bi = − Rm ∇ Bi + Ri ∇ Bm + Sm ∇ ∇ Bi [j k l] 2 [ljk] m 2 m[jk l] [jk l] m m i n m n i +S [jkR |nm|l]B + 2S [jkS |m|l]∇nB , (1.3.30) so 1 1 ∇ Ri Bm + ∇ Sm ∇ Bi = − Rm ∇ Bi + Sm Ri Bn 2 [j |m|kl] [j kl] m 2 [ljk] m [jk |nm|l] m n i +2S [jkS |m|l]∇nB . (1.3.31)
Comparing terms in (1.3.31) with Bi gives the second Bianchi identity or the Bianchi identity:
i i m R n[jk;l] = 2R nm[jS kl], (1.3.32)
i while comparing terms with ∇kB gives the first Bianchi identity or the Ricci cyclic identity:
m m m n R [jkl] = −2S [jk;l] + 4S n[jS kl]. (1.3.33)
20 Contracting (1.3.32) and (1.3.33) with respect to one contravariant and one covariant index gives
i i m R n[ik;l] = 2R nm[iS kl], (1.3.34) k k k n R [jkl] = −2S [jk;l] + 4S n[jS kl]. (1.3.35)
i For a symmetric connection, S jk = 0, the Bianchi identity and the cyclic identity reduce to
i R n[jk;l] = 0, (1.3.36) m R [jkl] = 0. (1.3.37) The cyclic identity (1.3.37) imposes 16 constraints on the curvature tensor, thereby the curvature tensor with a vanishing torsion has 80 independent components.
1.3.5 Ricci tensor Contraction of the curvature tensor with respect to the contravariant index and the second covariant index gives the Ricci tensor:
j j j l j l j Rik = R ijk = Γi k,j − Γi j,k + Γi kΓl j − Γi jΓl k. (1.3.38) Contraction of the curvature tensor with respect to the contravariant index and the third covariant index gives the Ricci tensor with the opposite sign because of the antisymmetry of the curvature tensor with respect to its last indices. Contraction of the curvature tensor with respect to the contravariant index and the first covariant index gives the homothetic or segmental curvature tensor:
j j j Qik = R jik = Γj k,i − Γj i,k, (1.3.39) which is a curl. A change in the connection (1.3.7) results in the following changes of the Ricci tensor and segmental curvature tensor:
l l j l j l Rik → Rik + T ik;l − T il;k + T ikT jl − T ilT jk, (1.3.40) j j Qik → Qik + T jk,i − T ji,k. (1.3.41) For a projective transformation (1.2.61)
Rik → Rik + Ak;i − Ai;k, (1.3.42)
Qik → Qik + 4(Ak,i − Ai,k). (1.3.43) Therefore, the symmetric part of the Ricci tensor is invariant under projective transformations. The variation of the Ricci tensor is
l l j l δRik = (δΓi k);l − (δΓi l);k − 2S lkδΓi j, (1.3.44) while the variation of the segmental curvature tensor is
j j δQik = (δΓj k),i − (δΓj i),k. (1.3.45)
1.3.6 Geodesic deviation Let us consider a family of affine geodesics characterized by the affine parameter s, measured along each curve from its point of intersection with a given hypersurface, and distinguished by a scalar parameter t: xi = xi(s, t). We define ∂xi vi = , (1.3.46) ∂t which gives dui dvi vi uk − ui vk = vi uk − ui vk − 2Si ukvl = − − 2Si ukvl = −2Si ukvl, (1.3.47) ;k ;k ,k ,k kl dt ds kl kl
21 i ∂xi where u = ∂s is the four-velocity along each curve. We therefore have D2vi = (vi uj) uk = (ui vj) uk − 2(Si ukvl) uj ds2 ;j ;k ;j ;k kl ;j i j k i j k i k l j = u ;jkv u + u ;jv ;ku − 2(S klu v );ju i j k i l j k l i j k i j k i k l j = u ;kjv u − R ljku v u − 2S jku ;lv u + u ;jv ;ku − 2(S klu v );ju i j k i l j k l i j k i j k j k l = u ;kjv u − R ljku v u − 2S jku ;lv u + u ;j(u ;kv − 2S klu v ) i k l j i k j i j k l i k l j −2(S klu v );ju = (u ;ku );jv + R jklu u v − 2(S klu v );ju D = Ri ujukvl − 2 (Si ukvl), (1.3.48) jkl ds kl which can be written as D Dvi + 2Si ukvl = Ri ujukvl. (1.3.49) ds ds kl jkl This is the equation of geodesic deviation. If we replace affine geodesics by arbitrary curves then i k u ;ku 6= 0 and (1.3.49) becomes D Dvi + 2Si ukvl = Ri ujukvl + (ui uk) vj. (1.3.50) ds ds kl jkl ;k ;j The separation vector ξi = vidt (1.3.51) connects points on two infinitely close affine geodesics with t and t + dt for the same s. Multiplying (1.3.48) by dt gives another form of the equation of geodesic deviation,
D2ξi D = Ri ujukξl − 2 (Si ukξl). (1.3.52) ds2 jkl ds kl
References: [1, 2, 3, 4].
1.4 Metric 1.4.1 Metric tensor The affine parameter s is a measure of the length only along an affine geodesic. In order to extend the concept of length to all points in spacetime, we equip spacetime with an algebraic object gik, referred to as the covariant metric tensor and defined as
2 i k ds = gikdx dx . (1.4.1) The quantity ds in (1.4.1) is called the line element. The metric tensor is a symmetric tensor of rank (0,2): gik = gki. (1.4.2) The affine parameter s, whose differential is given by (1.4.1), is referred to as the interval. Because ds does not change under parallel transport along an affine geodesic from point P (xi) to point i i i i j Q(x + dx ), ds|Q = ds|P , and dx |Q is a parallel translation of dx |P , gik|Q = gik|P + gik,jdx is a parallel translation of gik|P : gik|Q = gik|P + δgik, (1.4.3) so j j Dgik = gik;jdx = dgik − δgik = gik,jdx − δgik = 0. (1.4.4) Therefore, the covariant derivative of the covariant metric tensor vanishes:
gik;j = 0. (1.4.5)
22 This relation is equivalent to l l gik,j − Γi jglk − Γk jgil = 0. (1.4.6) ik ki The symmetric contravariant metric tensor g = g is defined as the tensor inverse to gik:
ik k gijg = δj . (1.4.7)
Since the contravariant metric tensor is a function of the covariant metric tensor only, its covariant derivative also vanishes: ik g ;j = 0. (1.4.8) The metric tensor allows to associate covariant and contravariant vectors:
i ik A = g Ak, (1.4.9) k Bi = gikB , (1.4.10)
because such association works for the covariant differentials of these vectors which are vectors:
i ik ik k k DA = D(g Ak) = g DAi,DBi = D(gikB ) = gikDB (1.4.11)
(raising and lowering of coordinate indices commutes with covariant differentiation with respect to ρ Γµ ν ). For covariant and contravariant indices of tensors and densities this association is
ij... j... gimT kl... = Tm kl..., (1.4.12) km ij... ijm... g T kl... = T l.... (1.4.13) The contravariant and covariant components of a two-dimensional vector are shown in Figure 1. The four-velocity vector (1.2.57) is normalized because of (1.4.1):
Figure 1: Contravariant and covariant components of a vector.
g dxidxk uiu = g uiuk = ik = 1. (1.4.14) i ik ds2 This vector thus has 3 independent components. Let us consider the determinant of the matrix composed from the components of the covariant metric tensor gik, g = |gik|. (1.4.15) The square root of the absolute value of this determinant, p|g|, is a scalar density of weight 1. We can use it to construct from the Levi-Civita symbols a quantity which behaves like a tensor with respect to continuous coordinate transformations: p eiklm = |g|εiklm, (1.4.16)
iklm 1 iklm in kp lq mr e = = g g g g enpqr. (1.4.17) p|g|
23 If we change the sign of 1 or 3 of the coordinates, then the components of eiklm do not change iklm because and εiklm have the same components in all coordinate systems, whereas some of the components of a tensor change sign. The components (1.4.16) and (1.4.17) are thus referred to as those of the completely antisymmetric unit pseudotensor. The relations (1.1.30) are also valid if we replace and ε by e. The differential and derivatives of the determinant of the metric tensor are given, following (1.2.23) and (1.2.24), by
ik ik dg = gg dgik = −ggikdg , (1.4.18) ik ik g,l = gg gik,l = −ggikg ,l. (1.4.19) The variation of the determinant of the metric tensor is thus
ik ik δg = gg δgik = −ggikδg . (1.4.20)
The covariant derivative of the determinant of the metric tensor vanishes:
g;j = 0. (1.4.21)
The relations (1.2.36) and (1.2.37) give thus
ijkl e ;m = 0, eijkl;m = 0. (1.4.22) A Lie derivative of the metric tensor is
ik (i;k) (ik) l Lξg = −2ξ − 4S lξ , (1.4.23)
;i ik where =;k g . The covariant derivative of the covariant metric tensor defines the nonmetricity tensor:
Njik = −gik;j. (1.4.24)
The commutator of covariant derivatives (1.3.6) of the metric tensor gives
(ij) ij m ij ij R kl = −N[k ;l] − S klNm = −N[k ,l], (1.4.25)
so the segmental curvature tensor (1.3.39) is
ij Qkl = −N[k ,l]gij. (1.4.26)
The nonmetricity tensor vanishes because of (1.4.5). Consequently, the curvature tensor is antisym- metric in its first two indices: Rijkl = −Rjikl. (1.4.27) Therefore, the segmental curvature tensor also vanishes, and
jl Rijklg = Rik. (1.4.28)
Consequently, there is only one independent way to contract the curvature tensor, which gives the Ricci tensor up to a sign.
1.4.2 Christoffel symbols The condition (1.4.5) is referred to as metricity or metric compatibility of the affine connection, and imposes 40 constraints on the connection:
l l l l l gik;j + gkj;i − gji;k = gik,j − Γi jglk − Γk jgil + gkj,i − Γk iglj − Γj igkl − gji,k + Γj kgli l l l l +Γi kgjl = gik,j + gkj,i − gji,k − 2Γ(i j)gkl − 2S kjgil − 2S kigjl = 0. (1.4.29)
24 Multiplying (1.4.29) by gkm gives
m m m Γ(i j) = {i j } + 2S(ij) , (1.4.30) where 1 { m} = gmk(g + g − g ) (1.4.31) i j 2 kj,i ki,j ij,k are the Christoffel symbols. Using (1.4.7), they can be written as 1 { m} = − (g gmk + g gmk − gmkg g gln ). (1.4.32) i j 2 kj ,i ki ,j il jn ,k The Christoffel symbols are symmetric in their covariant indices:
k k {i j} = {j i}. (1.4.33)
k k k Because Γi j = Γ(i j) + S ij, the metric-compatible affine connection equals
k k k Γi j = {i j} + C ij, (1.4.34) where k k k C ij = 2S(ij) + S ij (1.4.35) is the contortion tensor, antisymmetric in its first two indices:
Cijk = −Cjik. (1.4.36) The inverse relation between the torsion and contortion tensor is
i i S jk = C [jk]. (1.4.37) The Christoffel symbols are the torsionless part of the connection. The difference between two affine connections is a tensor, thereby the sum of a connection and a tensor of rank (1,2) is a connection. Therefore, the Christoffel symbols form a connection, referred to as the Levi-Civita connection. We define the covariant derivative with respect to the Levi-Civita k k {} connection analogously to (1.2.11), with Γi j replaced by {i j}, and denote it :i instead of ;i, or ∇i instead of ∇i. The covariant derivative with respect to the Levi-Civita connection of the metric tensor vanishes, as for that with respect to any connection:
l l gik:j = gik,j − {i j}glk − {k j}gil = 0. (1.4.38) This equation agrees with (1.4.31) and gives the relation between ordinary derivatives of the metric tensor and the Christoffel symbols:
l l gik,j = {i j}glk + {k j}gil. (1.4.39) Similarly, we have ik ik i lk k il g :j = g ,j + {l j}g + {l j}g = 0. (1.4.40) The variation of the Levi-Civita connection is, as for any connection, a tensor: 1 1 δ{ k } = gkl (δg ) + (δg ) − (δg ) + δgkl(g + g − g ) i j 2 lj ,i li ,j ij ,l 2 lj,i li,j ij,l 1 1 = gkl (δg ) + (δg ) − (δg ) + gkl({ m}δg + { m}δg + { m}δg + { m}δg 2 lj :i li :j ij :l 2 l i mj j i lm l j mi i j lm 1 −{ m}δg − { m}δg ) + δgkl{ m}g = gkl (δg ) + (δg ) − (δg ) i l mj j l im i j lm 2 lj :i li :j ij :l 1 +gkl{ m}δg + δgkl{ m}g = gkl (δg ) + (δg ) − (δg ) + { m}δδk i j lm i j lm 2 lj :i li :j ij :l i j m 1 = gkl (δg ) + (δg ) − (δg ) , (1.4.41) 2 lj :i li :j ij :l
25 where we used (1.4.7). The covariant derivative over s of a tensor density with respect to the Levi-Civita connection is, analogously to (1.2.60),
D{}T = T ui. (1.4.42) ds :i The following formulae are satisfied: 1 1 1 g, i { k } = gjkg = − g gjk = = (lnp|g|) , (1.4.43) k i 2 jk,i 2 jk ,i 2 g ,i
k ij 1 p ik { }g = − ( |g|g ),i, (1.4.44) i j p|g|
i 1 p i B = ( |g|B ),i, (1.4.45) :i p|g|
ik 1 p ik F = ( |g|F ),i, (1.4.46) :i p|g|
Ai:k − Ak:i = Ai,k − Ak,i, (1.4.47) I Z ip i p B |g|dSi = B :i |g|dΩ, (1.4.48)
ik ki k where F = −F . The Christoffel symbols satisfy all formulae that are satisfied by Γi j in which i S jk = 0. Because the Levi-Civita connection is a symmetric connection, it can be brought to zero by transforming the coordinates to a geodesic frame of reference. In a geodesic frame, the covariant {} derivative with respect to the Levi-Civita connection, ∇i , coincides with the ordinary derivative ∂i. Since the covariant derivatives of the Levi-Civita symbols are equal to zero, according to (1.2.36) and (1.2.37), their covariant derivatives with respect to the Levi-Civita connection vanish:
ijkl :m = 0, εijkl:m = 0. (1.4.49) The following covariant derivatives with respect to the Levi-Civita connection also vanish:
ijkl g:j = 0, e :m = 0, eijkl:m = 0. (1.4.50)
The Lie derivative of the metric tensor (1.4.23) along a vector ξi can be written as
ik (i:k) Lξg = −2ξ , Lξgik = 2ξ(i:k), (1.4.51)
:i ik where =:k g . A Killing vector (1.2.83) for the Levi-Civita connection satisfies
ζ(i:k) = 0. (1.4.52)
It thus becomes a generator of a transformation
x0i = xi + ζi, (1.4.53) where is an infinitesimal scalar, which coincides with (1.2.66) for
ξi = ζi. (1.4.54)
Such transformations are isometries: they do not change the metric tensor. If the nonmetricity tensor does not vanish, the general formula for the affine connection (1.4.34) is 1 Γ k = { k } + Ck − N k + N k . (1.4.55) i j i j ij 2 ij (i j)
26 1.4.3 Riemann tensor The commutator of covariant derivatives with respect to the Levi-Civita connection of a covariant vector is {} {} m [∇j , ∇k ]Ai = −P ijkAm, (1.4.56) analogously to (1.3.5) and without the torsion tensor of this connection that vanishes. The curvature tensor constructed from the Levi-Civita connection is referred to as the Riemannian curvature tensor or the Riemann tensor:
i i i i l i l P mjk = ∂j{m k} − ∂k{m j} + {l j}{m k} − {l k}{m j}. (1.4.57) Similarly, the commutators of covariant derivatives of a contravariant vector and of a tensor are i i i respectively given by (1.3.3) and (1.3.6), in which R jkl is replaced with P jkl and S jk = 0. The commutator of covariant derivatives of the metric tensor vanishes:
{} {} m m [∇j , ∇k ]glp = −P ljkgmp − P pjkglm = 0, (1.4.58) so the covariant Riemann tensor Pimjk is also antisymmetric in the indices i, m. Substituting (1.4.31) in (1.4.57) gives 1 P = (g + g − g − g ) + g ({ j }{ n } − { j }{ n }), (1.4.59) iklm 2 im,kl kl,im il,km km,il jn i m k l i l k m which explicitly shows the following symmetry and antisymmetry properties:
Piklm = −Pikml, (1.4.60)
Piklm = −Pkilm, (1.4.61)
Piklm = Plmik. (1.4.62)
Accordingly, the Riemannian Ricci tensor is symmetric:
j Pik = P ijk = Pki. (1.4.63) Substituting (1.4.34) in (1.3.7) and (1.3.8) gives the relation between the curvature and Riemann tensors: i i i i j i j i R klm = P klm + C km:l − C kl:m + C kmC jl − C klC jm. (1.4.64) Contracting (1.4.64) with respect to in the indices i, l gives
i i j i j i Rkm = Pkm + C km:i − C ki:m + C kmC ji − C kiC jm. (1.4.65) Consequently, the Ricci scalar or the curvature scalar,
ik R = Rikg , (1.4.66)
is given by ik l j l l m R = P − g (2C il:k + C ijC kl − C imC kl), (1.4.67) where P is the Riemannian curvature scalar or the Riemann scalar:
ik P = Pikg . (1.4.68)
The variation of the Riemann tensor is, analogously to (1.3.10),
i i i δP klm = (δ{k m}):l − (δ{k l}):m, (1.4.69) and the variation of the Riemannian Ricci tensor is
l l δPik = (δ{i k}):l − (δ{i l}):k. (1.4.70)
27 Contracting (1.3.34) and (1.3.35) with the metric tensor gives
i m i m i m Rnk;l − Rnl;k + R nkl;i = −2RnmS kl − 2R nmkS il + 2R nmlS ik (1.4.71) and the contracted cyclic identity:
k n Rjl − Rlj = −2Sj;l + 2Sl;j − 2S lj;k + 4SnS lj. (1.4.72) Further contraction of (1.4.71) with the metric tensor gives the contracted Bianchi identity: 1 Ri − R = 2R Smk − Rik Sm . (1.4.73) l;i 2 ;l km l ml ik The Bianchi identity (1.3.36) and the cyclic identity (1.3.37) for the Riemann tensor are
i P n[jk:l] = 0, (1.4.74) m P [jkl] = 0. (1.4.75) Contracting these equations with the metric tensor gives
i Pnk:l + P nkl:i − Pnl:k = 0, (1.4.76)
Pjl − Plj = 0, (1.4.77) in agreement with (1.4.63). Further contraction of (1.4.76) with the metric tensor gives the con- tracted Bianchi identity: i G k:i = 0, (1.4.78) for the symmetric Einstein tensor, defined as 1 G = P − P g = G . (1.4.79) ik ik 2 ik ki This identity is a covariant conservation of the Einstein tensor.
1.4.4 Properties of Riemann tensor
In two dimensions there is only 1 independent component of the Riemann tensor, P1212. The Riemann scalar is 2P P = 1212 , (1.4.80) s
where s is the determinant of the two-dimensional metric tensor γik:
2 s = |γik| = γ11γ22 − γ12. (1.4.81) A surface near point x = 0, y = 0 is given by
x2 y2 z = + , (1.4.82) 2ρ1 2ρ2
where ρ1 and ρ2 are the radii of curvature. Substituting (1.4.82) to
2 2 2 2 i k dl = dx + dy + dz = γikdx dx (1.4.83)
gives γik(x, y), which then gives
P 1 = K = , (1.4.84) 2 x=y=0 ρ1ρ2 where K is the Gauß curvature. In three dimensions there are 3 independent pairs, 12, 23, and 31, thereby the Riemann tensor 3·2 has 6 independent components: 3 with identical pairs and 2 = 3 with different pairs (the cyclic
28 identity does not reduce the number of independent components). The Ricci tensor has also 6 components, which are related to the components of the Riemann tensor by P P = P γ − P γ + P γ − P γ + (γ γ − γ γ ). (1.4.85) αβγδ αγ βδ αδ βγ βδ αγ βγ αδ 2 αδ βγ αγ βδ Choosing the Cartesian coordinates at a given point, defined by the condition
gαβ = diag(1, 1, 1), (1.4.86)
and diagonalizing Pαβ, which is equivalent to 3 rotations, brings Pαβ to the canonical form with 6 − 3 = 3 independent components. Consequently, the Riemann tensor in three dimensions has 3 physically independent components. The Gauß curvature of a surface perpendicular to the x3 axis is given by P1212 K = 2 . (1.4.87) γ11γ22 − γ12 In four dimensions there are 6 independent pairs, 01, 02, 03, 12, 23, and 31, thereby there are 6·5 6 components with identical pairs and 2 = 15 with different pairs. The cyclic identity reduces the number of independent components by 1, thereby the Riemann tensor in four dimensions has generally 20 independent components. Choosing the Cartesian coordinates at a given point and applying 6 rotations brings Pijkl to the canonical form with 20 − 6 = 14 physically independent components. The Weyl tensor is defined as 1 1 W = P − (P g + P g − P g − P g ) + P (g g − g g ). (1.4.88) iklm iklm 2 il km km il im kl kl im 6 il km im kl This tensor has all the symmetry and antisymmetry properties of the Riemann tensor, and is also traceless (any contraction of the Weyl tensor vanishes).
1.4.5 Metric geodesics Let us consider two points in spacetime, P and Q. Among curves that connect these points, one curve has the minimal value of the interval s = R ds, and is referred to as a metric geodesic. The equation of a metric geodesic is given by the condition that R ds be an extremum with the endpoints of the curve fixed: Z Z Z δdxig dxj 1 Z δg dxidxj Z δ ds = δ (g dxidxk)1/2 = ij + ij = g ujδdxi ik ds 2 ds ij 1 Z Z Z 1 Z + g δxkuiujds = d(u δxi) − du δxi + g δxkuiujds 2 ij,k i i 2 ij,k Z du 1 Z = − i δxids + g δxiujukds = 0, (1.4.89) ds 2 jk,i
R i i i where we omit the total differential term d(uiδx ) because δx = 0 at the endpoints. Since δx is arbitrary, we obtain
d 1 Z duj 1 Z (g uj) − g ujukds = g + ukg uj − g ujukds ds ij 2 jk,i ij ds ij,k 2 jk,i duj = g + { m}g ujuk = 0 (1.4.90) ij ds j k im or, after multiplying (1.4.90) by gil:
D{}ul dul = + { l }ujuk = uiul = 0. (1.4.91) ds ds j k :i
29 The metric geodesic equation (1.4.91) can be written as
d2xi dxk dxl + { i } = 0. (1.4.92) ds2 k l ds ds Using (1.4.34) and (1.4.35), the affine geodesic equation (1.2.56) can be written as
d2xi dxk dxl dxk dxl + { i } + 2S i = 0. (1.4.93) ds2 k l ds ds kl ds ds If the torsion tensor is completely antisymmetric then the last term in (1.4.93) vanishes and the affine geodesic equation coincides with the metric geodesic equation. The equation of geodesic deviation with respect to the Levi-Civita connection is, analogously to (1.3.49),
D{}2vi = P i ujukvl. (1.4.94) ds2 jkl
If ζi is a Killing vector of the Levi-Civita connection then along a metric geodesic,
D{} (uiζ ) = uk(uiζ ) = uiukζ + ζ ukui = 0. (1.4.95) ds i i :k i:k i :k The first term in the sum in (1.4.95) vanishes because of (1.4.52) and the second term vanishes because of the metric geodesic equation. Therefore, to each Killing vector of the Levi-Civita connec- i tion there corresponds a quantity u ζi which does not change along the metric geodesic, analogously to (1.2.85): i i k u ζi = giku ζ = const. (1.4.96)
1.4.6 Galilean frame of reference and Minkowski tensor At a given point, the nondegenerate (g 6= 0) metric tensor can be brought to a diagonal (canonical) form gik = diag(±1, ±1, ±1, ±1). Physical systems are described by the metric tensor with g < 0. Without loss of generality, we assume that the canonical form of the metric tensor is
ik ik gik = ηik = diag(1, −1, −1, −1), g = η = diag(1, −1, −1, −1). (1.4.97)
A frame of reference in which gik has the canonical form is referred to as Galilean. The transformation (1.2.46) with (1.2.49) brings a symmetric affine connection, thus the Christoffel symbols, to zero at a given point without changing the components of the metric tensor because of (1.2.47). Therefore, a frame of reference can be locally both geodesic and Galilean. Such a frame is called inertial. In this frame, the first derivatives of the metric tensor vanish because of (1.4.39). The corresponding metric tensor (1.4.97) is referred to as the Minkowski tensor. The square of the line element for this metric is ds2 = c2dt2 − dx2 − dy2 − dz2. (1.4.98) In a locally inertial frame the coordinates xi, not only the differentials dxi, are components of a contravariant vector. i In the absence of torsion, spacetime with a vanishing Riemann tensor P klm = 0 is flat. In the new coordinates ya (1.3.19), (1.3.20) gives
i k ab ∂x ∂x ia kb ab g (y) = gik(x) = gik(x)h h = η . (1.4.99) ∂ya ∂yb Therefore, in a flat spacetime without torsion one can always find a system of coordinates which is Galilean everywhere.
30 1.4.7 Riemann normal coordinates If the frame of reference is locally geodesic and Galilean at a given point, taken as the origin of the coordinates, then the metric tensor at a point near the origin depends on the derivatives of the metric at the origin. In this frame, the Christoffel symbols at the origin vanish. We expand the metric tensor up to quadratic terms: 1 1 g (xk) = g (0) + g (0)xk + g (0)xkxl = η + g (0)xkxl, (1.4.100) ij ij ij,k 2 ij,kl ij 2 ij,kl where the metric tensor at the origin is equal to the Minkowski tensor and the first derivatives of the metric tensor at the origin vanish because of (1.4.39). We choose the coordinates such that
xi = ais (1.4.101)
for every metric geodesic curve passing through the origin and parametrized with the interval s, where ai is a constant four-vector and s = 0 at the origin. Such coordinates are referred to as the Riemann normal coordinates. Accordingly, the derivatives of xi with respect to s are
dxi d2xi d3xi (0) = ai, (0) = (0) = 0. (1.4.102) ds ds2 ds3 Consequently, the metric geodesic equation (1.4.92) gives
i j k {j k}(0)a a = 0, (1.4.103)
therefore the condition for the geodesic frame of reference (1.2.50) is satisfied:
i {j k}(0) = 0. (1.4.104)
Differentiating (1.4.92) with respect to s gives
d3xi d{ i } dxj dxk d2xj dxk + j k + 2{ i } = 0. (1.4.105) ds3 ds ds ds j k ds2 ds At the origin, the relations (1.4.102) reduce this equation to
d{ i } dxj dxk dxl dxj dxk j k = { i } = { i } (0)alajak = 0. (1.4.106) ds ds ds j k ,l ds ds ds j k ,l Therefore, the Christoffel symbols satisfy
i {(j k},l)(0) = 0. (1.4.107) In the geodesic frame of reference, the Riemann tensor (1.4.57) reduces to
i i i P jkl = {j l},k − {j k},l. (1.4.108)
Consequently, using (1.4.107) gives
i i i i i i i P jkl + P kjl = {j l},k − {j k},l + {k l},j − {k j},l = −3{j k},l, (1.4.109) which gives 1 { i } (0) = − (P i + P i )(0). (1.4.110) j k ,l 3 jkl kjl Differentiating (1.4.39) with respect to the coordinates and using vanishing of the first derivatives of the metric tensor at the origin gives
m m gij,kl = {k j },lgmi + {k i },lgmj. (1.4.111)
31 Substituting (1.4.110) into this equation gives 1 1 1 g (0) = − (P + P + P + P ) = − (P − P + P ) = − (P − P ) ij,kl 3 ikjl ijkl jkil jikl 3 ikjl kijl jikl 3 ikjl kjil 1 = − (P + P ). (1.4.112) 3 ikjl iljk Consequently, the covariant metric tensor (1.4.100) in the Riemann normal coordinates at a point near the origin, in quadratic approximation, is given by 1 1 g (xk) = η − (P + P )(0)xkxl = η − P (0)xkxl. (1.4.113) ij ij 6 ikjl iljk ij 3 ikjl The deviation of the metric tensor from the Minkowski tensor is proportional to the curvature. The corresponding contravariant metric tensor is given by 1 gij(xk) = ηij + P i j (0)xkxl. (1.4.114) 3 k l Similar calculations lead to the expansion of the covariant metric tensor in quartic approximation: 1 1 1 2 g (xk) = η − P (0)xkxl − P (0)xkxlxm − P − P pP (0)xkxlxmxn. ij ij 3 ikjl 6 ikjl:m 20 ikjl:mn 45 ikl jmnp (1.4.115)
1.4.8 Intervals, proper time, and distances The form of the Minkowski tensor distinguishes the coordinate x0 from the rest of the coordinates xα, where the index α can be 1,2,3. The temporal coordinate x0 can be written as x0 = ct, where t is referred to as time and c is called the speed of propagation of interaction. The coordinates xα are spatial and span space. The set of 4 coordinates xi describe an event and span spacetime. The curve xi(λ), where λ is a parameter, is referred to as a world line of a given point. The quantitites dxα vα = (1.4.116) dt are the components of a three-dimensional vector, the velocity of this point. An infinitesimal interval ds is timelike if ds2 > 0, spacelike if ds2 < 0, and null if ds2 = 0. In a Galilean frame of reference, the spatial coordinates are Cartesian (1.4.86). In this frame, the square of the line element (interval) between two infinitesimally separated points (events) is
2 i k 2 2 X α α ds = ηikdx dx = c dt − dx dx , (1.4.117) α where dxi are infinitesimal coordinate differences between the two points. The square of the interval between two finitely separated points is
2 i k 2 2 X α α ∆s = ηik∆x ∆x = c ∆t − ∆x ∆x , (1.4.118) α where ∆xi are finite coordinate differences between the two points. If ∆s is timelike, one can always find a frame of reference in which the two events occur at the same place, ∆xα = 0. A frame of reference in which dxα = 0, thereby vα = 0, describes a point at rest and is referred to as the rest frame or the comoving frame. In this frame t = τ,
ds2 = c2dτ 2, (1.4.119) where τ is the proper time. If dxα 6= 0, thereby vα 6= 0, along a world line then the point moves or is in motion. The proper time for a moving point is equal to the time measured by a clock moving with this point. If ∆s is spacelike, one can always find a frame of reference in which the two events
32 occur at the same time (are synchronous), ∆x0 = 0. If ds = 0 along a world line, this world line P α α 1/2 describes the propagation of a signal (interaction), with ( α v v ) = c. Equations (1.4.117) and (1.4.119) give 1 X dτ 2 = dt2 − dxαdxα, (1.4.120) c2 α so the proper time τ goes more slowly than the coordinate time t. If ∆s is timelike, the two events occur at different times: t1 6= t2. If t2 > t1 then t2 is in the future with respect to t1 and t1 is in the past with respect to t2. The time of a measurement t0 is called the present time. All events for which t < t0 form the absolute past relative to the event O at the present (events in this region occur before O in all systems of reference). All events for which t > t0 form the absolute future relative to the event O at the present (events in this region occur after O in all systems of reference). Such a division into the absolute past and the absolute future with respect to O is possible only for events for which their intervals with respect to O are timelike, as shown in Figure 2. For O = (0, 0, 0, 0), these events (ct, x, y, z) lie within a cone (ct)2 − x2 − y2 − z2 = 0 which is called the null cone or light cone. All events for which their intervals with respect to O are spacelike are absolutely remote relative to O. The principle of causality states that any event O can be affected only by events in the absolute past relative to O.
Figure 2: Light cone.
In the rest frame dxα = 0 gives uα = 0. At each point in space, the condition dxα = 0 gives the relation between the proper time and the coordinate time: 1√ dτ = g dx0, (1.4.121) c 00 which requires g00 ≥ 0. (1.4.122) The relation (1.4.14) gives 0 −1/2 u = (g00) . (1.4.123) The distance between two infinitesimally separated points cannot be obtained by imposing dx0 because x0 transforms differently at these points. Instead, we consider a signal that leaves point α α 0 0 α 0 0 0 B(x + dx ) at x + dx−, reaching point A(x ) at x and coming back to point B at x + dx+, as shown in Figure 3. Accordingly, we have
2 0 2 0 α α β ds = g00(dx ) + 2g0αdx dx + gαβdx dx = 0 (1.4.124)
gives q 0 1 α α β dx± = (−g0αdx ± (g0αg0β − g00gαβ)dx dx ). (1.4.125) g00
33 The difference in the time coordinate between emitting and receiving the signal at point B is equal 0 0 √ to the difference between dx+ and dx− times g00/c, and the distance dl between points A and B is equal to this difference times c/2: 2 α β dl = γαβdx dx , (1.4.126) where g0αg0β γαβ = −gαβ + (1.4.127) g00 is the symmetric spatial metric tensor of spacetime, that is, the metric tensor of space. The event at point A at x0 is synchronized with the event at point B at the arithmetic mean of the time coordinates of emitting and receiving the signal: 1 x0 + (dx0 + dx0 ) = x0 + g dxα, (1.4.128) 2 − + α where g0α gα = − . (1.4.129) g00 Therefore, we have 0 α δx = gαδx , (1.4.130) 0 which is equivalent to δx0 = 0, is the difference in x between two synchronized infinitesimally separated points.
Figure 3: Distance.
In terms of (1.4.126) and (1.4.129), the square of the line element is equal to 2 0 α 2 2 ds = g00(dx − gαdx ) − dl , (1.4.131) The three-dimensional velocity (1.4.116), dxα vα = , (1.4.132) dτ is defined in terms of the synchronized proper time (corresponding to the difference in x0 between two synchronized infinitesimally separated points (1.4.130)): 1√ 1√ dτ = g (dx0 − δx0) = g (dx0 − g dxα). (1.4.133) c 00 c 00 α Therefore, the metric (1.4.131) becomes v2 ds2 = g (dx0 − g dxα)2 1 − , (1.4.134) 00 α c2 where v is the speed, α β 1/2 v = (γαβv v ) . (1.4.135) Using (1.4.131) in the definition of the four-velocity (1.2.57) gives α α v 0 1 α u = , u = + gαu , (1.4.136) p 2 2 √ p 2 2 c 1 − v /c g00 1 − v /c from which we also find √ 0 α g00 u0 = g00u + g0αu = . (1.4.137) p1 − v2/c2
34 1.4.9 Spatial vectors The spatial components of a contravariant four-vector Ai form a three-dimensional, spatial vector A: Ai = (A0,Aα) = (A0, A). (1.4.138) The contravariant four-vector index α is also the contravariant spatial-vector index. The covariant components of a spatial vector are related to the contravariant components by the spatial metric tensor (1.4.127) which raises and lowers indices of spatial vectors analogously to the metric tensor acting on four-vectors:
β Aα = γαβA , (1.4.139) α αβ B = γ Bβ, (1.4.140)
αβ where γ is the inverse of γαβ: αδ α γ γβδ = δβ . (1.4.141) A linear combination aA + bB of two spatial vectors A and B, where a and b are scalars, is a spatial vector C whose components are
α α α C = aA + bB ,Cα = aAα + bBα. (1.4.142)
The following formulae are satisfied:
γαβ = −gαβ, (1.4.143)
g = −g00s, (1.4.144) gα = −g0α, (1.4.145)
00 1 α g = − gαg , (1.4.146) g00 where s = detγαβ. (1.4.147) For example, contracting (1.4.127) with (1.4.143) gives
αδ αδ αδ g0β αi α0 αi α0 g0β γ γβδ = g gβδ − g g0δ = g gβi − g gβ0 − (g g0i − g g00) g00 g00 α α g0β α = δβ − δ0 = δβ , (1.4.148) g00 in accordance with (1.4.141). The components gα form a spatial vector g. The dot product or scalar product of two spatial vectors is
α β A · B = γαβA B . (1.4.149)
The square of a spatial vector A is A2 = A · A (1.4.150) and its norm, length, or magnitude is √ A = |A| = A2. (1.4.151)
The angle between two spatial vectors θ is defined through
A · B = AB cosθ. (1.4.152)
In three-dimensional space, the completely antisymmetric permutation symbols are defined as
123 0123 αβγ [αβγ] = = 1, = , ε123 = −ε0123 = 1, εαβγ = ε[αβγ]. (1.4.153)
35 The spatial analogues of (1.4.16) and (1.4.17) are √ 1 e = sε , eαβγ = √ αβγ . (1.4.154) αβγ αβγ s The cross product or vector product of two spatial vectors A and B, C = A × B is defined as the spatial vector density, dual to the antisymmetric tensor
Cαβ = AαBβ − AβBα, (1.4.155) thereby giving 1 1 Cα = eαβγ C = eαβγ A B ,C = e Cβγ = e AβBγ , (1.4.156) 2 βγ β γ α 2 αβγ αβγ γ αβ αβγ Cαβ = eαβγ C ,C = e Cγ . (1.4.157) The permutation symbols satisfy
εαβγ εαδζ = δβδδγζ − δβζ δγδ, (1.4.158)
εαβγ εαβδ = 2δγδ, (1.4.159)
εαβγ εαβγ = 6, (1.4.160)
where δαβ is the Cartesian metric tensor,
αβ δαβ = δ = diag(1, 1, 1). (1.4.161)
The spatial covariant derivative ∇α acts on spatial vectors analogously to the metric covariant derivative acting on four-vectors:
β β β γ ∇αA = ∂αA + {γ α}γ A , (1.4.162) γ ∇αAβ = ∂αAβ − {β α}γ Aγ , (1.4.163)
δ where {α β}γ are the three-dimensional, spatial Christoffel symbols: 1 { δ } = γδγ (γ + γ − γ ). (1.4.164) α β γ 2 γα,β γβ,α αβ,γ The gradient operator is given by
α α αβ (grad) = (∇) = γ ∇β. (1.4.165)
The spatial components of a covariant-vector operator ∂i acting on a scalar φ form the gradient of φ: ∂φ ∂φ ∂φ ∂ φ = , = , ∇φ . (1.4.166) i c∂t ∂xα c∂t The divergence of a spatial vector A is, analogously to (1.4.45), 1 √ divA = ∇ · A = √ ∂ ( sAα). (1.4.167) s α The curl of a spatial vector A is defined as the spatial vector density, dual to the antisymmetric tensor ∂αAβ − ∂βAα: 1 (curlA)α = (∇ × A)α = eαβγ (∂ A − ∂ A ) = eαβγ ∂ A . (1.4.168) 2 β γ γ β β γ The Laplace-Beltrami operator or Laplacian is the divergence of the gradient, 1 √ 4 = ∇2 = ∇ · ∇ = √ ∂ ( sγαβ∂ ). (1.4.169) s α β
36 The d’Alembert operator or d’Alembertian is defined as 1 ∂2 = − 4. (1.4.170) c2 ∂t2 The time component of the dual hypersurface element (1.1.35) is equal to the spatial volume element dV : dS0 = dV. (1.4.171) The spatial analogue of the Gauß-Stokes theorem (1.1.39) is Gauß’ theorem: ∂ df ↔ dV , (1.4.172) α ∂xα where ? dfα = df0α (1.4.173) is the spatial component of the dual surface element (1.1.33), perpendicular to the xα axis. In a locally Galilean frame of reference, the covariant and contravariant components of a spatial vector are identical because γαβ = δαβ. (1.4.174) In this frame, we refer to the Cartesian coordinates x1, x2, x3 as x, y, z. These coordinates form the three-dimensional radius vector x. The following formulae are satisfied:
A × B = −B × A, (1.4.175) A · (B × C) = B · (C × A) = C · (A × B), (1.4.176) A × (B × C) = B(A · C) − C(A · B), (1.4.177) (A · B)2 + (A × B)2 = A2B2, (1.4.178) curl gradφ = 0, (1.4.179) div curl A = 0, (1.4.180) grad(φψ) = gradφ ψ + φ gradψ, (1.4.181) grad(A · B) = (A · ∇)B + (B · ∇)A + A × curl B +B × curl A, (1.4.182) div(φA) = gradφ · A + φ div A, (1.4.183) curl(φA) = gradφ × A + φ curl A, (1.4.184) div(A × B) = B · curl A − A · curl B, (1.4.185) curl(A × B) = (B · ∇)A − (A · ∇)B + A div B − B div A, (1.4.186) curl curl A = grad div A − 4A, (1.4.187) where α (A · ∇)B = A ∂αB. (1.4.188) The vector product of two spatial vectors A and B satisfies
A × B = AB sinθ n, (1.4.189) where n is a unit vector perpendicular to both A and B, in the direction given by the right-handed corkscrew rule.
1.4.10 Embedded hypersurfaces A surface embedded in a three-dimensional space consists of points whose radius vectors are vector functions of two parameters ξα, where the index α can be 1 or 2: x = x(ξ1, ξ2). A vector ∂x ∂ x = (1.4.190) α ∂ξα
37 is tangent to the surface. We define the induced or intrinsic metric tensor on the surface as
γαβ = ∂αx · ∂βx. (1.4.191)
The length element dl on the surface is given by the first fundamental form: ∂x ∂x dl2 = dx · dx = · dξαdξβ = γ dξαdξβ, (1.4.192) ∂ξα ∂ξβ αβ and the area element is given by p 1 2 dS = detγαβdξ dξ . (1.4.193) The inverse intrinsic metric tensor γαβ is defined according to
αδ α γ γβδ = δβ . (1.4.194) We define the unit normal vector to a surface as ∂ x × ∂ x n = 1 2 , n · n = 1. (1.4.195) |∂1x × ∂2x| This vector is perpendicular to a tangent vector:
∂αx · n = 0. (1.4.196)
If the surface is curved, then the normal vectors at two close points on the surface are not parallel. The change of the normal vector is given by the extrinsic curvature tensor:
Kαβ = ∂α∂βx · n. (1.4.197)
The extrinsic curvature is symmetric, Kαβ = Kβα. (1.4.198) Differentiating the relation (1.4.196) with respect to ξβ and using (1.4.197) gives
Kαβ = −∂αx · ∂βn. (1.4.199)
α β α The quantity Kαβdξ dξ is the second fundamental form. The intrinsic Christoffel symbols {β γ }, symmetric in the lower indices, are constructed from the intrinsic metric tensor analogously to the spatial Christoffel symbols (1.4.164) constructed from the spatial metric tensor. They are used to construct the covariant derivative ∇i acting on the vectors tangent to the surface, analogously to (1.4.162) and (1.4.163). The covariant derivatives acting on x and n are equal to the partial derivatives:
∇αx = ∂αx, ∇αn = ∂αn. (1.4.200)
The second derivatives of x, which are the first derivatives of the tangent vectors, satisfy the Gauß equation: γ ∂α∂βx = {α β}∂γ x + Kαβn, (1.4.201) which can be written in a covariant form:
∇α∇βx = Kαβn. (1.4.202)
Multiplying this equation by n gives (1.4.197). The first derivatives of the normal vector satisfy the Weingarten equation: β βγ ∂αn = −Kα ∂βx = −Kαγ γ ∂βx, (1.4.203) which can be written in a covariant form:
β ∇αn = −Kα ∇βx. (1.4.204)
38 Multiplying this equation by ∂γ x and using (1.4.191) gives (1.4.199). The intrinsic metric tensor and the extrinsic curvature can also be written in a covariant form:
γαβ = ∇αx · ∇βx, (1.4.205)
Kαβ = ∇α∇βx · n. (1.4.206) Using the Gauß equation, the relation
∂α∂β∂γ x = ∂β∂α∂γ x (1.4.207) can be written as δ δ ∂α({β γ }∂δx + Kβγ n) = ∂β({α γ }∂δx + Kαγ n). (1.4.208) Effecting the differentiation and using again the Gauß equation gives
δ δ δ ∂δx∂α{β γ } + {β γ }{α δ}∂x + {β γ }Kαδn + ∂αKβγ n + Kβγ ∂αn δ δ δ = ∂δx∂β{α γ } + {α γ }{β δ}∂x + {α γ }Kβδn + ∂βKαγ n + Kαγ ∂jn. (1.4.209)
Multiplying this equation by ∂ζ x and using (1.4.191), (1.4.196), and (1.4.199) gives
δ δ δ δ γδζ ∂α{β γ } + γζ {β γ }{α δ} − Kαζ Kβγ − γδζ ∂β{α γ } − γζ {α γ }{β δ} + Kβζ Kαγ = 0, (1.4.210) which is equivalent to the Gauß equation:
r γαβ = K αKγβ − K βKγα, (1.4.211)
where r γαβ is the intrinsic curvature tensor constructed from the intrinsic Christoffel symbols analogously to the Riemann tensor (1.4.57) constructed from the Levi-Civita connection. Multiplying (1.4.209) by n and using (1.4.196) and ∂αn · n = 0 gives the Codazzi-Mainardi-Peterson equation:
δ δ {β γ }Kαδ + ∂αKβγ = {α γ }Kβδ + ∂βKαγ , (1.4.212) which can be written in a covariant form:
∇αKβγ = ∇βKαγ . (1.4.213) The Gauß curvature is defined as detK K K − K K K = αβ = 11 22 12 21 . (1.4.214) detγαβ γ11γ22 − γ12γ21 Using the Gauß equation, it leads to the Gauß theorem: r K = 1212 , (1.4.215) detγαβ which is consistent with (1.4.80) and (1.4.84). A curve on a surface consists of points whose radius vectors depend on a parameter t: x = x(ξ1(t), ξ2(t)). Such a curve is geodesic if it satisfies the metric geodesic equation analogous to (1.4.92): d2ξα dξβ dξγ + { α } = 0. (1.4.216) dt2 β γ dt dt A geodesic curve also satisfies d2x d dx dx ∼ n, · = 0. (1.4.217) dt2 dt dt dt A hypersurface embedded in a four-dimensional spacetime consists of points whose coordinates are functions of three parameters ξα, where the index α can be 1, 2, or 3: xi = xi(ξ1, ξ2, ξ3). Equivalently, these coordinates satisfy an equation of constraint:
f(xi) = 0, (1.4.218)
39 where f is a function of the coordinates. The normal vector to this hypersurface is given by ∂f n = . (1.4.219) i ∂xi All infinitesimal displacements dxi along such a hypersurface satisfy, according to (1.4.219),
i df = nidx = 0. (1.4.220)
The normal vector (1.4.219) is orthogonal to the hypersurface:
ijkl ninj:k = 0, (1.4.221) where ijkl is the completely antisymmetric permutation symbol. This condition is equivalent to 1 n n = (n n + n n + n n − n n − n n − n n ) = 0. (1.4.222) [i j:k] 6 i j:k j k:i k i:j k j:i i k:j j i:k If the normal vector to a hypersurface is timelike, then the hypersurface is spacelike. Such a normal vector can be normalized: i n ni = 1, (1.4.223) which gives i n ni:k = 0. (1.4.224) In this case, the four-velocity of a point in spacetime can be taken as the normal vector:
ni = ui. (1.4.225)
If the four-velocity has only the time component, then the hypersurface is a hypersurface of constant time and represents a volume in space, in which the point exists at this time. A division of spacetime into such hypersurfaces is referred to as a foliation of spacetime. We consider a spacelike hypersurface. We define the projection tensor onto the hypersurface:
i i i h j = δj − n nj, (1.4.226) which is orthogonal to ni: i j h jn = 0. (1.4.227) The projection tensor satisfies i j i h jh k = h k. (1.4.228) The indices in the projection tensor can be raised or lowered by the metric tensor:
j ik i jk ik i k ki hik = h kgij = gik − nink = hki, h = h jg = g − n n = h . (1.4.229)
ik The tensors hik and h are symmetric and not inverse to one another. The projection ⊥ of a tensor T onto a hypersurface is defined as the contraction of the tensor T with the projection tensor through all indices. For example, the projections of vectors are
i i k k ⊥ V = h kV , ⊥ Vi = h iVk. (1.4.230) These projections are tangent vectors to the hypersurface. The projection of the metric tensor gives
k l ij i j kl ij ⊥ gij = h ih jgkl = hij, ⊥ g = h kh lg = h . (1.4.231) Consequently, the relation i j i j hij ⊥ V ⊥ V = gij ⊥ V ⊥ V (1.4.232) shows that the tensor hij is the intrinsic metric tensor γij on the hypersurface, analogously to (1.4.191): γij = hij. (1.4.233)
40 The inverse intrinsic metric tensor γij is defined as in (1.4.194). The projection of the normal vector vanishes: ⊥ ni = 0. (1.4.234) If the normal vector to the hypersurface is timelike, then the projections of tensors have only spatial i i components. Using the tensor n nj instead of h j projects a tensor onto the direction of the normal vector. The projection of the covariant derivative (with respect to a torsionless affine connection) of a vector defines the intrinsic covariant derivative of a vector on the hypersurface:
l l i l j DkV =⊥ ∇kV = h kh j∇iV . (1.4.235)
The intrinsic covariant derivative of the intrinsic metric tensor vanishes:
Dkγij =⊥ ∇kγij =⊥ ∇k(gij − ninj) = − ⊥ (ni∇knj + nj∇kni) = 0, (1.4.236)
which is a consequence of the metric compatibility of the affine connection (1.4.5). Accordingly, the intrinsic covariant derivative is constructed from the intrinsic Christoffel symbols, which are constructed from the intrinsic metric tensor. If ∇k is related to the Levi-Civita connection of the metric gij, then Dk is related to the Levi-Civita connection of the intrinsic metric γij. If the parallel transport of the normal vector to a hypersurface along a vector W i =⊥ V i on the hypersurface does not vanish, i j W ∇in 6= 0, (1.4.237) then the hypersurface is curved. Such a hypersurface has a nonzero extrinsic curvature tensor, defined as k l Kij = − ⊥ ∇inj = −h ih j∇knl. (1.4.238) Using (1.4.224) and (1.4.226), the extrinsic curvature is equal to
k k l l k Kij = −(δi − n ni)(δj − n nj)nl:k = −nj:i + nin nj:k. (1.4.239)
The extrinsic curvature is a tensor with only spatial components:
j Kijn = 0. (1.4.240)
Antisymmetrizing the indices in the extrinsic curvature and using (1.4.239) gives
k k Kij − Kji = ni:j − nj:i + n ninj:k − n njni:k. (1.4.241)
The term on the right-hand side is equal to the term in (1.4.222) contracted with nk, which vanishes. Consequently, the extrinsic curvature is symmetric, as in (1.4.198). This symmetry also results from (1.4.219): Kij = − ⊥ ∇i∂jf = − ⊥ ∇j∂if = Kji. (1.4.242) For the Levi-Civita connection, (1.4.51) gives 1 K = − ⊥ n = − ⊥ n = − ⊥ L g , (1.4.243) ij j:i (i:j) 2 n ij
i where Ln is the Lie derivative of the metric tensor along the vector n . If the normal vector is the four-velocity of a point in spacetime, then (1.4.239) gives
Du K = −u + u j . (1.4.244) ij j;i i ds The contraction of the extrinsic curvature tensor gives the extrinsic curvature scalar:
ij K = Kijγ . (1.4.245)
41 For a spacelike hypersurface, the spatial coordinates on this hypersurface can be taken as the parameters ξα. Differentiating the equation of constraint for a hypersurface f(xi(ξα)) = 0 with respect to ξα gives ∂f ∂f ∂xi ∂xi = = n = 0. (1.4.246) ∂ξα ∂xi ∂ξα i ∂ξα Differentiating covariantly this equation with respect to ξβ gives
∇n ∂xi ∇2xi ∇ ∂f ∂xi ∇2f ∇n i +n = +n ∇ ∇ xi = +n ∇ ∇ xi = β +n ∇ ∇ xi = 0, ∂ξβ ∂ξα i ∂ξβ∂ξα ∂xi ∂ξβ ∂ξα i β α ∂ξα∂ξβ i β α ∂ξα i β α (1.4.247) where the covariant derivatives ∇α are constructed from the metric tensor γαβ and the corresponding Levi-Civita connection. Accordingly, using (1.4.238), we obtain
i Kαβ = −∇αnβ = ni∇β∇αx , (1.4.248) which is consistent with the extrinsic curvature tensor for a surface (1.4.206). The intrinsic covariant derivative of a vector W i =⊥ V i on a hypersurface is
m m n n DjWk =⊥ ∇jWk = (δj − n nj)(δk − n nk)∇mWn m n m n = ∇jWk − n nj∇mWk − n nk∇jWn + n njn nk∇mWn m n m n = ∇jWk − n nj∇mWk + nkW ∇jnn + n njn nk∇mWn, (1.4.249)
i i i where we used niW = 0, which gives n ∇jWi = −W ∇jni. Consequently, the second derivative is
DiDjWk =⊥ (∇iDjWk) =⊥ (∇i ⊥ ∇jWk) m n m n =⊥ ∇i∇jWk+ ⊥ ∇i(−n nj∇mWk + nkW ∇jnn + n njn nk∇mWn) n n =⊥ ∇i∇jWk+ ⊥ ∇inkW ∇jnn =⊥ ∇i∇jWk + KikKjnW , (1.4.250) where we used (1.4.234) and (1.4.238). The commutator of intrinsic covariant derivatives gives the intrinsic curvature tensor: l [Di,Dj]Wk = −r kijWl, (1.4.251) whereas the commutator of covariant derivatives gives the Riemann tensor, according to (1.4.56). Therefore, antisymmetrizing the indices i, j in (1.4.250) gives
l l l l r kijWl =⊥ P kijWl − KikKjlW + KjkKilW , (1.4.252) which leads to ⊥ Plkij = rlkij + KikKjl − KjkKil. (1.4.253)
This equation is consistent with (1.4.211) for Plkij = 0, satisfied for a curved surface in a flat space. i The projection of n Pijkl is given by
i i i ⊥ (n Pijkl) =⊥ (∇l∇knj − ∇k∇lnj) =⊥ (∇l(Kkj − nkn nj:i) − ∇k(Klj − nln nj:i)) i =⊥ (∇lKkj − ∇kKlj + (∇knl − ∇lnk)n nj:i) = DlKkj − DkKlj, (1.4.254) where we used (1.4.219) and (1.4.234). This equation is consistent with (1.4.213) for Plkij = 0, satisfied for a curved surface in a flat space. Equations (1.4.253) and (1.4.254) are referred to as the Gauß-Codazzi equations. If the normal vector to a hypersurface is spacelike, then the hypersurface is timelike. An example of such a hypersurface is a hypersurface on which a given spatial coordinate is constant. The normal vector can be normalized: i n ni = −1. (1.4.255) The projection tensor onto a timelike hypersurface differs from (1.4.226) by a sign:
i i i h j = δj + n nj, (1.4.256)
42 whereas all other definitions are the same as for spacelike hypersurfaces. If a hypersurface is spacelike or timelike, and forms a boundary between two submanifolds in spacetime, then the intrinsic metric tensor γij and the extrinsic curvature tensor Kij are continuous across the hypersurface. Consequently, the first and second fundamental forms are continuous across the hypersurface. These two covariant conditions are referred to as the Darmois junction conditions. Equivalently, the metric tensor gij and its derivatives gij,k are continuous across the hypersurface. These two conditions are referred to as the Lichnerowicz junction conditions.
1.4.11 Event horizon If the normal vector (1.4.219) to a hypersurface is a null vector,
i nin = 0, (1.4.257) then this hypersurface is a null hypersurface. Equations (1.4.220) and (1.4.257) indicate that ni lies itself on the null hypersurface to which it is normal,
dxi ∝ ni, (1.4.258) which also gives 2 i i ds = dxidx ∝ nin = 0. (1.4.259) Therefore, all world lines on a null hypersurface are null. The light cones at the points of such a hypersurface are tangent to this hypersurface. Since all physical world lines must lie within the local light cones, the forward-time motion through a null hypersurface can occur in only one direction. To avoid any discontinuities, this direction is the same for all points on such a hypersurface. A null hypersurface is therefore an event horizon: a boundary in spacetime beyond which events cannot affect events on the other side. All laws of classical physics are known to be time-symmetric, that is, symmetric under the transformation t → −t. However, the existence of event horizons, which are solutions to these laws and provide boundary conditions for spacetime, violates this symmetry. The unidirectional character of the motion through an event horizon can be used to define the past and future: the arrow of time. References: [1, 2, 3, 4, 5].
1.5 Tetrad and spin connection 1.5.1 Tetrad In addition to the coordinate systems, at each point in spacetime we can set up four linearly inde- i pendent vectors ea such that i eaeib = ηab, (1.5.1) where a, b = 0, 1, 2, 3 are Lorentz indices and ηab = diag(1, −1, −1, −1) is the coordinate-invariant Minkowski metric tensor in a locally geodesic frame of reference at this point. This set of four vectors is referred to as a tetrad. The inverse tetrad eai satisfies
i b b eaei = δa, (1.5.2) i a i eaek = δk. (1.5.3)
ik The coordinate metric tensors gik and g are related to the Minkowski metric tensor through the tetrad:
a b gik = ei ekηab, (1.5.4) ik i k ab g = eaeb η , (1.5.5) where ηab satisfies bc b ηacη = δa. (1.5.6)
43 Any vector V can be specified by its components V i with respect to the coordinate system or by the coordinate-invariant projections V a of the vector onto the tetrad field:
a a i i V = ei V ,Va = eaVi, (1.5.7) i i a a V = eaV ,Vi = ei Va, (1.5.8)
ab and similarly for tensors and densities with more indices. We can use ηab and its inverse η to lower ik and raise Lorentz indices, as we use gik and its inverse g to lower and raise coordinate indices. a Let us consider the determinant of the matrix composed from the components of the tetrad ei ,
a e = |ei |. (1.5.9)
This determinant is related to the determinant g of the metric tensor gik, using (1.5.4), by
e = p|g|. (1.5.10)
The differential and derivatives of the determinant (1.5.9) are given, analogously to (1.4.18) and (1.4.19), by
i a a i de = eeadei = −eei dea, (1.5.11) i a a i e,k = eeaei,k = −eei ea,k. (1.5.12) The variation of (1.5.9) is thus, analogously to (1.4.20), equal to
i a a i δe = eeaδei = −eei δea. (1.5.13) Similarly to (1.4.21), the covariant derivative of (1.5.9) vanishes:
e;j = 0. (1.5.14)
1.5.2 Lorentz transformation The relation (1.5.4) imposes 10 constraints on the 16 components of the tetrad, leaving 6 components i i arbitrary. If we change from one tetrad ea to another,e ˜b, then the vectors of the new tetrad are linear combinations of the vectors of the old tetrad:
i b i e˜a = Λ aeb. (1.5.15)
i The relation (1.5.4) applied to the tetrad fielde ˜b, a b gik =e ˜i e˜kηab, (1.5.16)
b imposes on the matrix Λ a the orthogonality condition: c d Λ aΛ bηcd = ηab. (1.5.17)
b We refer to Λ a as a Lorentz matrix, and to a transformation of form (1.5.15) as the Lorentz trans- formation.
1.5.3 Tetrad transport A natural choice for the zeroth component of a tetrad at a given point is
i i e0 = u . (1.5.18) Along a world line this tetrad should be transported such that the zeroth component always coincides with the four-velocity. The Fermi-Walker transport of a tetrad is defined as
∇ei Du Dui a = −uiej j + ej u . (1.5.19) ds a ds ds a j
44 Putting a = 0 in (1.5.19) gives ∇ui Dui = , (1.5.20) ds ds so the Fermi-Walker transport of the four-velocity is equivalent to its covariant change and thus (1.5.18) is valid at all points. This transport does not change the orthogonality relation for tetrads (1.5.1) because (1.5.19) gives ∇ (ei e ) = 0. (1.5.21) ds a ib
1.5.4 Spin connection We define i i i i j ω ak = ea;k = ea,k + Γj kea. (1.5.22) The quantities a a k a k k j ω bi = ekω bi = ek(eb,i + Γj ieb) (1.5.23) transform like vectors under coordinate transformations. We can extend the notion of covariant dif- ab ferentiation to quantities with Lorentz coordinate-invariant indices by regarding ω i as a connection, referred to as Lorentz or spin connection. For a contravariant Lorentz vector
a a a b V |i = V ,i + ω biV , (1.5.24)
i where |i is the covariant derivative of such a quantity with respect to x . The covariant derivative a of a scalar V Wa coincides with its ordinary derivative:
a a (V Wa)|i = (V Wa),i, (1.5.25) which gives the covariant derivative of a covariant Lorentz vector:
b Wa|i = Wa,i − ω aiWb. (1.5.26)
The chain rule infers that the covariant derivative of a Lorentz tensor is equal to the sum of the corresponding ordinary derivative of this tensor and terms with spin connection corresponding to each Lorentz index:
ab... ab... a eb... b ae... e ab... e ab... T cd...|i = T cd...,i + ω eiT cd... + ω eiT cd... + · · · − ω ciT ed... − ω diT ce... − .... (1.5.27)
We assume that the covariant derivative |i is total, that is, also recognizes coordinate indices, acting on them like ;i. For a tensor with both coordinate and Lorentz indices
aj... aj... a ej... j al... e aj... l aj... T bk...|i = T bk...,i + ω eiT bk... + Γl iT bk... + · · · − ω biT ek... − Γk iT bl... − .... (1.5.28)
A total covariant derivative of a tetrad is
i i i j b i ea|k = ea,k + Γj kea − ω akeb = 0, (1.5.29) because of (1.5.22). Therefore, total covariant differentiation commutes with converting between a coordinate and Lorentz indices. Equation (1.5.29) determines the spin connection ω bi in terms of the affine connection, tetrad and its ordinary derivatives, in accordance with (1.5.23). Conversely, the affine connection is determined by the spin connection, tetrad and its derivatives:
j j a j Γi k = ω ik + ei,kea. (1.5.30) The torsion tensor is then related to these quantities by
j j a j S ik = ω [ik] + e[i,k]ea, (1.5.31)
45 and the torsion vector is k a k Si = ω [ik] + e[i,k]ea. (1.5.32) Metric compatibility of the affine connection leads to
a b a b c c gik;j = gik|j = ei ekηab|j = −ei ek(ω ajηcb + ω bjηac) = −(ωkij + ωikj) = 0, (1.5.33) so the spin connection is antisymmetric in its first two indices:
a a ω bi = −ωb i. (1.5.34) Accordingly, the spin connection has 24 independent components. The contortion tensor is related to the spin connection by Cijk = ωijk + ∆ijk, (1.5.35) where a a a ∆ijk = eiae[j,k] − ejae[i,k] − ekae[i,j] (1.5.36) are the Ricci rotation coefficients. The first term on the right-hand side in (1.5.35) is expected because both the contortion tensor and spin connection are antisymmetric in their first two indices. The quantities i i i i j $ ak = ea:k = ea,k + {j k}ea (1.5.37) form the Levi-Civita spin connection and are related to the Ricci rotation coefficients by (1.5.35) with Cijk = 0, $ijk = −∆ijk, (1.5.38) so Cijk = ωijk − $ijk. (1.5.39)
1.5.5 Tetrad representation of curvature tensor The commutator of the covariant derivatives of a tetrad with respect to the affine connection is
k k σ l k 2ea;[ji] = R lijea + 2S ijea;l. (1.5.40) This commutator can also be expressed in terms of the spin connection:
k k k b kb b k ea;[ji] = ω a[j;i] = (eb ω a[j);i] = ωba[jω i] + ω a[j;i]eb kb b k l k = ωba[jω i] + ω a[j,i]eb + S ijω al. (1.5.41) Consequently, the curvature tensor with two Lorentz and two coordinate indices depends only on the spin connection and its ordinary derivatives:
a a a a c a c R bij = ω bj,i − ω bi,j + ω ciω bj − ω cjω bi. (1.5.42) Because the spin connection is antisymmetric in its first two indices, the tensor (1.5.42) is antisym- metric in its first two (Lorentz) indices, like the Riemann tensor. The contraction of the curvature tensor (1.5.42) with a tetrad gives the Ricci tensor with one Lorentz and one coordinate index:
a i Rbj = R bijea. (1.5.43)
a The contraction of the tensor R i with a tetrad gives the Ricci scalar,
a i ab i j R = R iea = R ijeaeb. (1.5.44) The Riemann tensor with two Lorentz and two coordinate indices depends on the Levi-Civita connection (1.5.37) the same way the curvature tensor depends on the affine connection:
a a a a c a c P bij = $ bj,i − $ bi,j + $ ci$ bj − $ cj$ bi. (1.5.45)
46 The contraction of (1.5.45) with a tetrad gives the Riemannian Ricci tensor with one Lorentz and one coordinate index: a i Pbj = P bijea. (1.5.46) a The contraction of the tensor P i with a tetrad gives the Riemann scalar,
a i ab i j P = P iea = P ijeaeb. (1.5.47)
References: [3, 4, 6, 7, 8].
1.6 Lorentz group Lorentz transformations relate different tetrads at a given point in spacetime, where the metric tensor ijkl ijkl can be brought to the Galilean form: gik = ηik. Accordingly, we have = e , εijkl = eijkl, αβγ αβγ = e , and εαβγ = eαβγ .
1.6.1 Subgroups of Lorentz group and Einstein principle of relativity
A composition of two Lorentz transformations Λ1 and Λ2,
a a c Λ b = Λ(1)cΛ(2)b, (1.6.1)
a satisfies (1.5.17), thereby it is a Lorentz transformation. The Kronecker symbol δb also satisfies (1.5.17), thereby it can be regarded as the identity Lorentz transformation. Therefore, Lorentz transformations form a group, referred to as the Lorentz group. Taking the determinant of the relation (1.5.17) gives a |Λ b| = ±1. (1.6.2) a a A Lorentz transformation with |Λ b| = 1 is proper and with |Λ b| = −1 is improper. Proper Lorentz transformations form a group because the determinant of the product of two proper Lorentz transformations is 1. Improper Lorentz transformations include the parity transformation P
a Λ b(P ) = diag(1, −1, −1, −1), t → t, x → −x, (1.6.3) and the time reversal T a Λ b(T ) = diag(−1, 1, 1, 1), t → −t, x → x. (1.6.4) 0 0 0 0 The relation (1.5.17) gives Λ 0Λ 0 − Λ αΛ α = 1, thereby
0 |Λ 0| ≥ 1. (1.6.5)
0 i Lorentz transformations with Λ 0 ≥ 1 are orthochronous and form a group. If x is a timelike vector, i 00 0 0 0 α x xi > 0, then for an orthochronous transformation x = Λ 0x + Λ αx , q q 0 α 0 0 β β 0 2 0 2 0 0 |Λ αx | ≤ Λ αΛ αx x < (Λ 0) (x ) = |Λ 0x |. (1.6.6)
Therefore, the time component of a timelike vector does not change the sign under orthochronous transformations. Einstein’s special principle of relativity states that physical laws do not change their form under transformations within the orthochronous proper subgroup of the Lorentz group. Equivalently, physical laws have the same form in all admissible inertial frames of reference. The special principle of relativity is a special case of the general principle of relativity, in which arbitrary differentiable coordinate transformations are restricted to linear transformations (orthochronous proper Lorentz transformations) between inertial frames of reference. Under the parity transformation, the spatial components of contravariant and covariant vectors, which form spatial vectors, change the sign. The permutation symbols do not change under this transformation. Accordingly, the spatial components of dual vector densities, such as the components of a vector product (1.4.157) or a curl (1.4.168), do not change the sign. Such quantities, that
47 transform under proper Lorentz transformations like vectors and do not change the sign in their spatial components under the parity transformation, are referred to as axial vectors or pseudovectors. Similarly, the scalar contraction of the Levi-Civita symbol and a tensor changes the sign, while a scalar does not. Quantities that transform under proper Lorentz transformations like scalars and change the sign under the parity transformation are referred to as pseudoscalars.
1.6.2 Infinitesimal Lorentz transformations Let us consider an infinitesimal Lorentz transformation
µ µ µ Λ ν = δν + ν , (1.6.7)
µ where ν are infinitesimal quantities. The relation (1.5.17) gives
µν = −νµ, (1.6.8) where the indices are raised and lowered using the Minkowski metric tensor. Therefore, Lorentz transformations are given by 6 independent antisymmetric parameters µν . The corresponding transformation of a contravariant vector Aµ is 1 1 A0µ = Aµ + µ Aν = Aµ + ρσ(δµη − δµη )Aν = Aµ + ρσJ µ Aν , (1.6.9) ν 2 ρ σν σ ρν 2 νρσ where µ µ µ Jνρσ = δρ ησν − δσ ηρν . (1.6.10)
We define matrices Jρσ such that µ µ (Jρσ)ν = Jνρσ. (1.6.11) Therefore, in the matrix notation (with Aµ treated as a column), 1 A0 = 1 + ρσJ A. (1.6.12) 2 ρσ
The 6 matrices Jρσ are the infinitesimal generators of the vector representation of the Lorentz group. The explicit form of the generators of the Lorentz group in the vector representation is
0 −1 0 0 0 0 −1 0 −1 0 0 0 0 0 0 0 J01 = ,J02 = , 0 0 0 0 −1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 −1 0 0 0 0 0 0 0 0 0 0 −1 0 J03 = ,J12 = , 0 0 0 0 0 1 0 0 −1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 J23 = ,J31 = . (1.6.13) 0 0 0 −1 0 0 0 0 0 0 1 0 0 −1 0 0
1.6.3 Generators and Lie algebra of Lorentz group The commutator of the generators of the Lorentz group in the vector representation is given, using (1.6.10) and (1.6.11), by
µ µ λ µ λ µ [Jκτ ,Jρσ]ν = (Jκτ )λ(Jρσ)ν − (Jρσ)λ(Jκτ )ν = (−Jκρητσ − Jτσηκρ + Jκσητρ + Jτρηκσ)ν , (1.6.14) so [Jκτ ,Jρσ] = −Jκρητσ − Jτσηκρ + Jκσητρ + Jτρηκσ. (1.6.15)
48 The relation (1.6.15) constitutes the Lie algebra of the Lorentz group. If a set of qantitites φ transforms under a Lorentz transformation Λ with a matrix D(Λ)
φ → D(λ)φ, (1.6.16) then D is a representation of the Lorentz group if
D(I) = I,D(Λ1Λ2) = D(Λ1)D(Λ2), (1.6.17)
where I denotes the identity transformation, and Λ1 and Λ2 are two Lorentz transformations. There- fore, we have D(Λ−1) = D−1(Λ), (1.6.18) where Λ−1 is the Lorentz transformation to Λ: ΛΛ−1 = I. For an infinitesimal Lorentz transforma- tion in any representation, 1 D(Λ) = I + ρσJ , (1.6.19) 2 ρσ according to (1.6.12). The relation
−1 −1 D(Λ1Λ2Λ1 ) = D(Λ1)D(Λ2)D (Λ1) (1.6.20) gives (1.6.15), valid for any representation of the Lorentz group. −1 If Λ1 and Λ2 are two group transformations then Λ3 = Λ1Λ2Λ1 is a group transformation. If −1 Λ2 = I + 2G2 is an infinitesimal group transformation with generator G2 then Λ3 = I + 2Λ1G2Λ1 −1 is an infinitesimal group transformation with generator G3 = Λ1G2Λ1 . If Λ1 = I + 1G1 is an infinitesimal group transformation with generator G1 then, neglecting terms in 1 of higher order, G3 = G2 +1[G1,G2], thereby [G1,G2] is a generator. For a finite number N of linearly independent N generators, a general infinitesimal group transformation is Λ = I + Σa=1aGa. Because [Ga,Gb] is a N generator, it is a linear combination of the N generators: [Ga,Gb] = Σc=1fabcGc, where fabc are the structure constants of the Lie algebra of the given group. For the Lorentz group, aGa = D(Λ) − I, where D(Λ) is given by (1.6.19).
1.6.4 Rotations and boosts Rotations are proper orthochronous Lorentz transformations with
0 α 0 Λ α = Λ 0 = 0, Λ 0 = 1. (1.6.21) Rotations act only on the spatial coordinates xα and form a group, referred to as the rotation group. Boosts are proper orthochronous Lorentz transformations with
α Λ β = 0. (1.6.22) We define 1 J = e J βγ , (1.6.23) α 2 αβγ Kα = J0α, (1.6.24) and 1 ϑ = e βγ , (1.6.25) α 2 αβγ ηα = 0α. (1.6.26)
The explicit form of the generators of the rotation group Jα in the vector representation is 0 0 0 0 0 1 0 −1 0 J1 = 0 0 −1 ,J2 = 0 0 0 ,J3 = 1 0 0 . (1.6.27) 0 1 0 −1 0 0 0 0 0
49 For an infinitesimal Lorentz transformation (1.6.19)
D = I + ϑ · J + η · K. (1.6.28)
A finite Lorentz transformation can be regarded as a composition of successive identical infinites- imal Lorentz transformations:
n θ·J+η·K D = limn→∞(I + θ · J/n + η · K/n) = e . (1.6.29)
The finite parameters θ, η are the canonical parameters for a given Lorentz transformations. For a finite Lorentz transformation, (1.6.19) gives
1 ρσ J D(Λ) = e 2 ρσ , (1.6.30) so ∂D(Λ) Jµν = . (1.6.31) ∂µν Λ=I The explicit form of a finite Lorentz transformation in the vector representation is
1 0 0 0 1 0 0 0
θJ1 0 1 0 0 θJ2 0 cosθ 0 sinθ R1 = e = ,R2 = e = , 0 0 cosθ −sinθ 0 0 1 0 0 0 sinθ cosθ 0 −sinθ 0 cosθ 1 0 0 0 coshη sinhη 0 0
θJ3 0 cosθ −sinθ 0 ηK1 sinhη coshη 0 0 R3 = e = ,B1 = e = , 0 sinθ cosθ 0 0 0 1 0 0 0 0 1 0 0 0 1 coshη 0 sinhη 0 coshη 0 0 sinhη
ηK2 0 1 0 0 ηK3 0 1 0 0 B2 = e = ,B3 = e = , sinhη 0 coshη 0 0 0 1 0 0 0 0 1 sinhη 0 0 coshη (1.6.32)
α where Rα denotes a rotation about the x axis and Bα denotes a boost along this axis. The canonical parameters θ and η are respectively referred to as the angle of rotation and rapidity. The parameters ϑ and η in (1.6.25) and (1.6.26) are thus respectively infinitesimal values of the angle of rotation and rapidity. A rotation about any axis, say z, by an angle θ turns the two other axes, x and y, into new axes, x0 and y0, such that the angle between x and x0 (or y and y0) (1.4.152) is θ. The rotation group is compact: θα ∈ [0, 2π] and θ = 2π ⇔ θ = 0. The explicit form of a finite rotation in the three-dimensional vector representation is
1 0 0 cosθ 0 sinθ R1(θ) = 0 cosθ −sinθ ,R2(θ) = 0 1 0 , 0 sinθ cosθ −sinθ 0 cosθ cosθ −sinθ 0 R3(θ) = sinθ cosθ 0 . (1.6.33) 0 0 1
For instance, 0 Vx Vx Vx Vxcosθ − Vysinθ 0 Vy → Vy = R3 Vy = Vxsinθ + Vycosθ . (1.6.34) 0 Vz Vz Vz Vz The relation (1.6.31) gives ∂Rα(θ) Jα = . (1.6.35) ∂θ θ=0
50 The orthogonality relation (1.5.17) applied to any of the rotation matrices (1.6.33) shows that a rotation matrix R is orthogonal, that is, its transpose RT is equal to its inverse R−1:
T −1 T T Rα = Rα ,RαRα = Rα Rα = I, (1.6.36) where I is the identity matrix. The commutation relation (1.6.15) gives
[Jα,Jβ] = eαβγ Jγ , (1.6.37)
[Jα,Kβ] = eαβγ Kγ , (1.6.38)
[Kα,Kβ] = −eαβγ Jγ . (1.6.39)
Therefore, rotations do not commute and form a nonabelian group, rotations and boosts do not commute, and boosts do not commute. Changing the order of two nonparallel boosts is equivalent to applying a rotation, referred to as the Thomas-Wigner rotation. The structure constants of the Lie algebra of the rotation group are fabc = eabc. Moreover, the square of the generators of rotation,
2 J = JαJα, (1.6.40) commutes with Jα:
2 [J ,Jβ] = [Jα,Jβ]Jα + Jα[Jα,Jβ] = eαβγ (Jγ Jα + JαJγ ) = 0. (1.6.41)
Definining 1 L = (J + iK), (1.6.42) 2 1 Q = (J − iK), (1.6.43) 2 gives
[Lα,Lβ] = eαβγ Lγ , (1.6.44)
[Qα,Qβ] = eαβγ Qγ , (1.6.45)
[Lα,Qβ] = 0, (1.6.46)
so the Lorentz group is isomorphic with the product of two complex rotation groups. Accordingly, the Lorentz group can be regarded as the group of four-dimensional rotations in the Minkowski space, or the group of tetrad rotations.
1.6.5 Poincar´egroup Under an infinitesimal coordinate transformation (1.2.66) in a locally flat spacetime, (1.4.51) gives
ηik → ηik − ξi,k − ξk,i. (1.6.47)
i Therefore, the tensor ηik is invariant under (1.2.66) (isometric) if ξ is a Killing vector,
ξ(i,k) = 0, (1.6.48) which has the solution i ik i ξ = xk + , (1.6.49) where ik and i are constant. The first term on the right-hand side of (1.6.49) corresponds to a Lorentz rotation described by 6 parameters ik satisfying (1.6.8). The second term on the right-hand side of (1.6.49) corresponds to a translation. A combination of two translations does not change if their order is reversed, thereby translations commute:
[Tµ,Tν ] = 0, (1.6.50)
51 α α where Tµ is the generator of translation. The relations (1.6.37) and (1.6.38) mean that J and K are spatial vectors under rotations. Spatial translations are spatial vectors under rotations, while a time translation is a scalar:
[Jα,Tβ] = eαβγ Tγ , (1.6.51)
[Jα,T0] = 0. (1.6.52) The last relation indicates that the generators of rotations, like the generators of spatial translations, correspond to conserved quantities, which are quantities that do not change in time. The covariant generalization of (1.6.51) and (1.6.52) is
[Jµν ,Tρ] = Tµηνρ − Tν ηµρ. (1.6.53) The relations (1.6.15), (1.6.50) and (1.6.53) constitute the Lie algebra of the inhomogeneous Lorentz or Poincar´egroup. In particular,
[Kα,Tβ] = −T0δαβ, (1.6.54)
[Kα,T0] = −Tα. (1.6.55) The last relation indicates that the generators of boosts do not correspond to conserved quantities. For an infinitesimal rotation about the z axis,
(I + ϑJz)f(ct, x) = D(Rz(ϑ))f(ct, x) = f(ct, Rz(ϑ)x) ≈ f(ct, x − ϑy, ϑx + y, z) ∂f ∂f = f(ct, x) − ϑy + ϑx , (1.6.56) ∂x ∂y or ∂ ∂ J = x − y , (1.6.57) z ∂y ∂x which gives the differential representation of rotations:
Jα = eαβγ xβ∂γ . (1.6.58) For an infinitesimal boost along the z axis,
(I + ηKz)f(ct, x) = D(Bz(η))f(ct, x) = f(Bz(η)(ct, x)) ≈ f(ct + ηz, y, z + ηct) ∂f ∂f = f(ct, x) + ηz + ηct , (1.6.59) c∂t ∂z or ∂ ∂ K = z + ct , (1.6.60) z c∂t ∂z which gives the differential representation of boosts: ∂ ∂ K = x + ct . (1.6.61) α α c∂t ∂xα The relation for an infinitesimal translation, analogous to (1.6.19), is µ D(t) = I + Tµ, (1.6.62) so a finite translation is given by µ D(t) = e Tµ . (1.6.63) Translation in (1.6.49) can also be written as ν ν ν tµ()x = x + δµ. (1.6.64) The relation analogous to (1.6.35) is ∂tµ() Tµ = . (1.6.65) ∂ =0 The differential representation of a translation is thus ∂ T = . (1.6.66) µ ∂xµ
52 1.6.6 Invariants of Lorentz and Poincar´egroup Analogously to (1.6.41),
2 [L ,Lβ] = 0, (1.6.67) 2 [Q ,Qβ] = 0, (1.6.68)
so L2 and Q2 commute with all 6 generators of the Lorentz group. Consequently, J 2 + K2 and J · K commute with all generators of the Lorentz group, that is, are the invariants or Casimir operators of the Lorentz group. The Casimir operators of Lorentz group do not commute with the generators of translation Tµ, thereby they are not the invariants of the Poincar´egroup. Instead, the mass operator
2 µ m = −T Tµ (1.6.69) and 2 µ W = W Wµ, (1.6.70) where W µ is the Pauli-Luba´nskipseudovector 1 W µ = eµνρσJ T , (1.6.71) 2 ρσ ν commute with all generators of the Poincar´egroup, thereby they are the Casimir operators of the Poincar´egroup. The Pauli-Luba´nskipseudovector obeys the commutation relations
[Tµ,Wν ] = 0, (1.6.72)
[Jµν ,Wρ] = Wµηνρ − Wν ηµρ, (1.6.73) µ ν µνρσ [W ,W ] = e WρTσ. (1.6.74)
The relation (1.6.73) is analogous to (1.6.53) because W µ behaves like a vector under proper Lorentz transformations. We define the four-momentum operator
Pµ = iTµ, (1.6.75)
whose time component is the energy operator P0 = iT0 and spatial components form the momentum operator Pα = iTα. We define the angular four-momentum operator
Mµν = iJµν , (1.6.76)
whose spatial components form the angular momentum operator
Mα = iJα. (1.6.77)
Therefore, the following relations are satisfied:
[Mµν ,Mρσ] = −i(Mµρηνσ + Mνσηµρ − Mµσηνρ − Mνρηµσ), (1.6.78)
[Pµ,Pν ] = 0, (1.6.79)
[Mµν ,Pρ] = i(Pµηνρ − Pν ηµρ), (1.6.80) 2 µ m = P Pµ, (1.6.81) 1 W µ = − eµνρσM P , (1.6.82) 2 ρσ ν [Pµ,Wν ] = 0, (1.6.83)
[Mµν ,Wρ] = i(Wµηνρ − Wν ηµρ), (1.6.84) µ ν µνρσ [W ,W ] = −ie WρPσ, (1.6.85)
[Mα,Mβ] = ieαβγ Mγ . (1.6.86)
53 1.6.7 Relativistic kinematics The quantitites vα (1.4.116) form the three-dimensional vector of velocity:
dx v = , (1.6.87) dt where x is the radius vector. The magnitude of the velocity is equal to the speed (1.4.135):
|v| = v. (1.6.88)
Let us consider a boost in the direction of the z axis
x0i = e−ηK3 xi, (1.6.89)
where xi and x0i have a form of a column (4×1 matrix), and eηK3 is given by (1.6.32). Therefore, the coordinates in an inertial K-system (unprimed) are related to the coordinates in an inertial K0-system (primed) by
ct = ct0coshη + z0sinhη, x = x0, y = y0, z = z0coshη + ct0sinhη. (1.6.90)
Let us consider the origin of the K0-system, x0 = y0 = z0 = 0, in the K-system. Therefore, we have
ct = ct0coshη, z = ct0sinhη, (1.6.91)
dz 0 which gives the relation between the rapidity η and speed V = dt of K relative to K: tanhη = β, (1.6.92) where V V β = , β = . (1.6.93) c c Accordingly, coshη = γ and sinhη = βγ, where
V 2 −1/2 γ = 1 − . (1.6.94) c2 The relations (1.6.90) become
V t = γ t0 + z0 , c2 x = x0, y = y0, z = γ(z0 + V t0), (1.6.95)
and are referred to as a special Lorentz transformation in the z-direction. The reverse transformation is V t0 = γ t − z , c2 x0 = x, y0 = y, z0 = γ(z − V t). (1.6.96)
For a boost along an arbitrary direction, the spatial vector x = (x, y, z) transforms such that its 0 2 component parallel to the velocity V = cβ of K relative to K, xk = (x · V)V/V (similarly for
54 primed), behaves like z in (1.6.95) and its component perpendicular to V, x⊥ = x − xk, behaves like x in (1.6.95):
V · x0 t = γ t0 + , c2 0 x⊥ = x⊥, 0 0 xk = γ(xk + Vt ), (1.6.97) so (γ − 1)(V · x0)V x = γ(x0 + Vt0) + x0 = γVt0 + x0 + . (1.6.98) k ⊥ V 2 Therefore, the transformation law for the coordinates in two inertial frames of reference is ! ct γ γβ ct0 = (γ−1)β 0 , (1.6.99) x γβ 1 + β2 β x
or equivalently ! ct0 γ −γβ ct 0 = (γ−1)β . (1.6.100) x −γβ 1 + β2 β x The matrix in (1.6.100) is called a boost matrix. In the local Minkowski spacetime, contravariant vectors transform like xi, according to (1.6.97) and (1.6.99), ! W 0 γ γβ W 00 = (γ−1)β 0 , (1.6.101) W γβ 1 + β2 β W
covariant vectors transform such that they remain related to contravariant vectors by the Minkowski metric tensor, and tensors transform like products of vectors. For example, if V = cβzˆ is parallel to the z axis, a tensor of rank (0,2) transforms according to
2 2 T00 = γ(T000 + βT030 ) = γ (T0000 + βT3000 + βT0030 + β T3030 ),
T0⊥ = γ(T00⊥0 + βT30⊥0 ), 2 2 T03 = γ(T030 + βT000 ) = γ (T0030 + βT3030 + βT0000 + β T3000 ),
T⊥⊥ = T⊥0⊥0 ,
T3⊥ = γ(T30⊥0 + βT00⊥0 ), 2 2 T33 = γ(T330 + βT300 ) = γ (T3030 + βT0030 + βT3000 + β T0000 ), (1.6.102)
T where the index ⊥ denotes either 1 or 2, and the transposed components Tik = Tki transform like the transpositions of the right-hand sides in (1.6.102). If Tik is antisymmetric then T03 = T0030 . The relations (1.6.95) can be written as V dt = γ dt0 + dz0 , c2 dx = dx0, dy = dy0, dz = γ(dz0 + V dt0), (1.6.103)
which gives
0 vx vx = 0 2 , γ(1 + V vz/c ) 0 vy vy = 0 2 , γ(1 + V vz/c ) 0 vz + V vz = 0 2 , (1.6.104) 1 + V vz/c
55 where dx dx0 v = , v0 = . (1.6.105) dt dt0 Two special Lorentz transformations in the same direction commute because of (1.6.39). If a Lorentz 0 00 transformation from K to K has parameters β1 and γ1, and a Lorentz transformation from K to 0 00 K has parameters β2 and γ2, then a Lorentz transformation from K to K has parameters β3 and γ3 such that β1 + β2 β3 = , γ3 = γ1γ2(1 + β1β2). (1.6.106) 1 + β1β2 For a boost along an arbitrary direction, (1.6.99) gives the Lorentz transformation of velocities:
v0 + γV + (γ − 1)(v0 · V)V/V 2 v = . (1.6.107) γ(1 + v0 · V/c2)
If v0 = |V0| = c then v = |V| = c, in agreement with the constancy of the speed of propagation of interaction. Let us consider two points at rest in an inertial frame of reference K with positions z1 and z2, 0 thereby the distance between them is ∆z = z2 − z1. In the inertial frame K , moving relative to K 0 0 0 0 0 0 in the z-direction with speed V , z1 = γ(z1 +V t1) and z2 = γ(z2 +V t2), thereby if t1 = t2 is the time 0 0 0 at which we measure (simultaneously) the positions of the two points then ∆z = γ(z2 − z1) = γ∆z . Therefore, the length of an object in K0, whose length in the rest frame K is l (proper length), is
l l0 = < l, (1.6.108) γ
which is referred to as the Lorentz-FitzGerald contraction. The volume of an object in K0, whose volume in the rest frame K is V (proper volume), is
V V 0 = . (1.6.109) γ Let us suppose that there are two rods of equal lengths, moving parallel relative to each other. From the point of view of an observer moving with the first rod, the second one is shorter, and from the point of view of an observer moving with the second rod, the first one is shorter. There is no contradiction in this statement because the positions of both ends of a rod must be measured simultaneously and the simultaneity is not invariant: from the transformation law (1.6.95) it follows that if δt = 0 then δt0 6= 0 and if δt0 = 0 then δt 6= 0. Let us consider a clock (any mechanism with a periodic or evolutionary behavior) at rest in K0 0 0 0 with position z ; the time difference between two events with t1 and t2, as measured by this clock, 0 0 0 0 0 2 0 0 2 is ∆t = t2 − t1. In the frame K, t1 = γ(t1 + V z /c ) and t2 = γ(t2 + V z /c ), thereby
0 0 ∆t = t2 − t1 = γ∆t > ∆t . (1.6.110)
Therefore, the rate of time is slower for moving clocks than those at rest (time dilation), in agreement with (1.4.120) and (1.4.126), from which c2dτ 2 = c2dt2 − dl2 and 1 dτ = dt. (1.6.111) γ
Let us suppose that there are two clocks linked to the inertial frames K and K0, and that when the clock in K passes by the clock in K0 the readings of the two clocks coincide. From the point of view of an observer in K clocks in K0 go more slowly, and from the point of view of an observer in K0 clocks in K go more slowly. There is no contradiction in this statement because to compare the rates of the two clocks in K and K0 we must compare the readings of the same moving clock in K0 with different clocks in K; we require several clocks in one frame and one in the other, thus the measurement process is not symmetric with respect to the two frames of reference. The clock that
56 goes more slowly is the one which is being compared with different clocks in the other frame. The time interval measured by a clock is equal to the integral 1 Z ∆t = ds (1.6.112) c along its world line. Since the world line is a straight line for a clock at rest and a curved line for a clock moving such that it returns to the starting point, the integral R ds taken between two world points has its maximum value if it is taken along the straight line connecting these two points. For a Lorentz transformation with speed V = |V|, (1.6.107) gives
v0sinθ0 tanθ = , (1.6.113) γ(v0cosθ0 + V )
where θ is the angle between v and V, and θ0 is the angle between v0 and V. If v = v0 = c then
cosθ0 + V cosθ = c , (1.6.114) V 0 1 + c cosθ which is referred to as the aberration of a signal. Let us suppose that an observer in frame K 1 c measures a periodic signal with period T , frequency ν = T and wavelength λ = ν , propagating in the −z direction; the number of pulses in time dt is n = νdt. A second observer in frame K0, moving in the z direction with speed V relative to the first one, travels a distance V dt and measures V dt 0 V 0 0 dt λ more pulses: n = ν(1 + c )dt. Because the time interval dt with respect to K is dt = γ , the 0 0 V frequency of the signal in K is ν = γν(1 + c ) or ν0 = eην. (1.6.115)
This dependence of the frequency of a signal on a frame of reference is referred to as the Doppler effect. When c → ∞ (at which γ → 1) the above formulae, referring to relativistic kinematics, reduce to their nonrelativistic limit. The Lorentz transformation (1.6.99) reduces to the Galilei transformation,
t = t0, x = x0 + Vt0, (1.6.116)
so the time is an absolute (invariant) quantity in nonrelativistic (Newtonian) physics. Any two Galilei transformations commute. The transformation law for velocities (1.6.107) reduces to the simple addition of vectors, v = v0 + V. (1.6.117)
1.6.8 Four-acceleration In the Galilean frame, the line element in (1.4.117) is equal to
P vαvα 1/2 v2 1/2 cdt ds = cdt 1 − α = cdt 1 − = . (1.6.118) c2 c2 γ
This line element is a special case of that in (1.4.134) for g00 = 1 and g0α = 0. The corresponding differential of the proper time is equal to (1.6.111). In a locally inertial frame of reference, the components of the four-velocity in the Cartesian coordinates are
dx0 cdt dxα dxα γ u0 = = = γ, uα = = = vα, (1.6.119) ds ds ds cdt/γ c
which can be written as γ γ ui = γ, v , u = γ, − v , (1.6.120) c i c
57 where v is the velocity (1.6.87) and γ = √ 1 (confer (1.6.94)). The spatial components uα in 1−v2/c2 (1.6.119) coincide with those in (1.4.136), whereas u0 in (1.6.119) is a special case of that in (1.4.136) for g00 = 1 and g0α = 0. We define the four-acceleration:
Dui D2xi wi = = = ukui . (1.6.121) ds ds2 ;k This vector is orthogonal to ui because of (1.4.14): 1 D wiu = (uiu ) = 0, (1.6.122) i 2 ds i thus having 3 independent components. In a locally inertial frame of reference, the four-acceleration is given by dui d2xi wi = = = ukui . (1.6.123) ds ds2 ,k Its components in the Cartesian coordinates are
du0 dγ dx0 γ dγ 1 d 1 d 1 v2 −2 v dv w0 = = = = (γ2) = = 1 − · ds cdt ds c dt 2c dt 2c dt 1 − v2/c2 c2 c3 dt γ4 = v · a, c3 duα duα dx0 γ d γ2 vα dγ γ2 γ4 wα = = = (γvα) = aα + γ = aα + (v · a)vα, (1.6.124) ds cdt ds c2 dt c2 c2 dt c2 c4 which can be written as γ4 γ2 γ4 γ4 γ2 γ4 wi = v · a, a + (v · a)v , w = v · a, − a − (v · a)v , (1.6.125) c3 c2 c4 i c3 c2 c4 where a is the three-dimensional acceleration vector: dvα d2xα dv d2x aα = = , a = = . (1.6.126) dt dt2 dt dt2 The invariant square of the four-acceleration is thus
γ8 γ2 γ4 2 γ4 γ2 wiw = (v · a)2 − a + (v · a)v = − a2 + (v · a)2 . (1.6.127) i c6 c2 c4 c4 c2 If v = 0 at a given instant of time, the corresponding frame of reference is referred to as the instantaneous rest frame. In this frame
a2 wiw = − , (1.6.128) i c4 so 2p i a0 = c −w wi (1.6.129) is the magnitude of the acceleration in the instantaneous rest frame, called the proper acceleration. Along an affine geodesic, the four-acceleration with respect to the affine connection (1.6.121) vanishes because of (1.2.59). Along a metric geodesic, the four-acceleration with respect to the Levi-Civita connection (defined by (1.6.121) with colon instead of semicolon) vanishes because of (1.4.91). The equation of geodesic deviation (1.3.52) determines the relative four-acceleration of two bodies moving along two infinitely close affine geodesics. Let us suppose that a noninertial frame K0 moves with velocity v relative to an inertial frame of reference K. If the velocity of K0 changes by dv0 relative to the initial frame K0, then it changes by dv relative to K. In the nonrelativistic limit, the two changes are equal, dv = dv0, and K0 does
58 not rotate with respect to K. In relativistic kinematics, these changes are different because of the Thomas-Wigner rotation. The velocity of K0 relative to K after the change, v + dv, is equal to dv0 boosted by v. Using the Lorentz transformation (1.6.107), in which v0 is replaced with dv0 and V is replaced with v, we obtain
dv0 + γv + (γ − 1)(dv0 · v)v/v2 v + dv = , (1.6.130) γ(1 + dv0 · v/c2)
where γ = (1 − v2/c2)−1/2. Keeping only terms linear in dv0, we have
dv0 γ − 1 v v + dv = + v(1 − dv0 · v/c2) + (dv0 · v) , (1.6.131) γ γ v2 which gives v × dv0 v × dv = . (1.6.132) γ The angle of infinitesimal rotation from the nonrelativistic sum v + dv0 to the relativistic v + dv determines the relativistic rotation of K0 with respect to K. Using (1.4.189) with sin(dθ) ≈ dθ and dθ = n dθ, where n is a unit vector parallel to the axis of rotation, gives
(v + dv0) × (v + dv) v × (dv − dv0) 1 − γ dθ = ≈ = (v × dv). (1.6.133) v2 v2 v2 Consequently, we obtain the angular velocity of the Thomas precession:
dθ γ − 1 γ2 a × v Ω = = (a × v) = , (1.6.134) dt v2 γ + 1 c2 where a = dv/dt is the acceleration of K0 relative to K. When v c, a × v Ω ≈ . (1.6.135) 2c2
References: [2, 3].
1.7 Spinors 1.7.1 Spinor representation of Lorentz group Let γa be the coordinate-invariant 4×4 Dirac matrices defined as
a b b a ab γ γ + γ γ = 2η I4, (1.7.1) where I4 is the unit 4×4 matrix (4 is the lowest dimension for which (1.7.1) has solutions). Accord- i i a ingly, the spacetime-dependent Dirac matrices, γ = eaγ , satisfy
i j j i ij γ γ + γ γ = 2g I4. (1.7.2)
Under a tetrad rotation, (1.5.15) gives a a b γ˜ = Λ bγ . (1.7.3) Let L be a 4×4 matrix such that
a a b −1 a −1 γ = Λ bLγ L = Lγ˜ L , (1.7.4)
−1 −1 −1 where L is the matrix inverse to L: LL = L L = I4. The condition (1.7.4) represents the constancy of the Dirac matrices γa under the combined tetrad rotation and transformation γ → LγL−1. We refer to L as the spinor representation of the Lorentz group. The relation (1.7.4) gives
59 a the matrix L as a function of the Lorentz matrix Λ b. For an infinitesimal Lorentz transformation (1.6.7), the solution for L is 1 1 L = I + Gab,L−1 = I − Gab, (1.7.5) 4 2 ab 4 2 ab where Gab are the generators of the spinor representation of the Lorentz group: 1 Gab = (γaγb − γbγa). (1.7.6) 4 A spinor ψ is defined as a quantity that, under tetrad rotations, transforms according to
ψ˜ = Lψ. (1.7.7)
An adjoint spinor ψ¯ is defined as a quantity that transforms according to
ψ¯˜ = ψL¯ −1, (1.7.8)
so the product ψψ¯ is a scalar: ψ¯˜ψ˜ = ψψ.¯ (1.7.9) The indices of the γa and L that are implicit in the 4×4 matrix multiplication in (1.7.1), (1.7.2) and (1.7.4) are spinor indices. The relation (1.7.4) shows that the Dirac matrices γa can be regarded as quantities that have, in addition to the invariant index a, one spinor index and one adjoint-spinor index. The product ψψ¯ transforms like the Dirac matrices:
ψ˜ψ¯˜ = LψψL¯ −1. (1.7.10)
The spinors ψ and ψ¯ can be used to construct tensors. For example, ψγ¯ aψ transforms like a contravariant Lorentz vector:
¯ a ¯ −1 a b −1 a ¯ b ψγ ψ → ψL Λ bLγ L Lψ = Λ bψγ ψ. (1.7.11)
1.7.2 Spinor connection The derivative of a spinor does not transform like a spinor: ˜ ψ,i = Lψ,i + L,iψ. (1.7.12)
If we introduce the spinor connection Γi that transforms according to
−1 −1 Γ˜i = LΓiL + L,iL , (1.7.13)
then a covariant derivative of a spinor,
ψ;i = ψ,i − Γiψ, (1.7.14)
is a spinor: ˜ ˜ ˜ −1 −1 ψ;i = ψ,i − Γ˜iψ = Lψ,i + L,iψ − (LΓiL + L,iL )Lψ = Lψ;i. (1.7.15) Because ψψ¯ is a scalar, ¯ ¯ (ψψ);i = (ψψ),i, (1.7.16) the chain rule for covariant differentiation gives the covariant derivative of an adjoint spinor, ¯ ¯ ¯ ψ;i = ψ,i + ψΓi. (1.7.17)
We also have ¯ ¯ ψ|i = ψ;i, ψ|i = ψ;i. (1.7.18)
60 The Dirac matrices γa transform like ψψ¯, whose covariant derivative is ¯ ¯ ¯ ¯ ¯ ¯ ¯ ¯ (ψψ);i = ψ;iψ + ψψ;i = (ψψ),i − Γiψψ + ψψΓi = (ψψ),i − [Γi, ψψ]. (1.7.19) Therefore, the covariant derivative of a Dirac matrix is a a a a γ ;i = γ ,i − [Γi, γ ] = −[Γi, γ ], (1.7.20) which gives j j j j k j γ ;i = γ |i = γ ,i + Γk iγ − [Γi, γ ]. (1.7.21) Accordingly, we obtain a a b a γ |i = ω biγ − [Γi, γ ]. (1.7.22) ¯ i The quantity ψγ ψ|i transforms under Lorentz rotations like a scalar: ¯ i ¯ −1 i −1 ¯ i ψγ ψ|i → ψL Lγ L Lψ|i = ψγ ψ|i. (1.7.23)
The relation ηab|i = 0 infers that a γ |i = 0, (1.7.24) a because the Dirac matrices γ only depend on ηab. Multiplying both sides of (1.7.22) by γa from the left gives a b a ωabiγ γ − γaΓiγ + 4Γi = 0. (1.7.25) We seek the solution of (1.7.25) in the form 1 Γ = − ω γaγb − A , (1.7.26) i 4 abi i
where Ai is a spinor-tensor quantity with one vector index. Substituting (1.7.26) to (1.7.25), together a b c ab with the identity γcγ γ γ = 4η , gives a − γaAiγ + 4Ai = 0, (1.7.27)
so Ai is an arbitrary vector multiple of I4. Therefore, the spinor connection Γi is given, up to the addition of an arbitrary vector multiple of I4, by the Fock-Ivanenko coefficients: 1 1 Γ = − ω γaγb = − ω Gab. (1.7.28) i 4 abi 2 abi Using the definition (1.5.22), we can also write (1.7.28) as 1 1 Γ = − ej [γ , γc] = [γj , γ ]. (1.7.29) i 8 c;i j 8 ;i j For the Levi-Civita connection, the covariant derivative of a spinor (1.7.14) becomes
{} ψ:i = ψ,i − Γi ψ, (1.7.30) and the covariant derivative of an adjoint spinor (1.7.17) becomes ¯ ¯ ¯ {} ψ:i = ψ,i + ψΓi , (1.7.31) {} where the Levi-Civita spinor connection Γi is given, similarly to (1.7.28), by 1 1 Γ{} = − $ γaγb = − $ Gab. (1.7.32) i 4 abi 2 abi Substituting (1.5.39) into (1.7.32) gives 1 1 Γ = Γ{} − C γjγk = Γ{} − C γ[jγk]. (1.7.33) i i 4 jki i 4 jki Accordingly, (1.7.30) and (1.7.31) yield 1 ψ = ψ + C γiγjψ, (1.7.34) ;k :k 4 ijk 1 ψ¯ = ψ¯ − C ψγ¯ iγj. (1.7.35) ;k :k 4 ijk
61 1.7.3 Curvature spinor The commutator of total covariant derivatives of a spinor is
k k ψ|ji − ψ|ij = (ψ|j),i − Γiψ|j − Γj iψ|k − (ψ|i),j + Γjψ|i + Γi jψ|k k k = −Γj,iψ + ΓiΓjψ + Γi,jψ − ΓjΓiψ + 2S ijψ|k = Kijψ + 2S ijψ|k, (1.7.36)
where Kij = −Kji is defined as Kij = Γi,j − Γj,i + [Γi, Γj]. (1.7.37) Substituting (1.7.13) to (1.7.37) gives
−1 −1 K˜ij = Γ˜i,j − Γ˜j,i + [Γ˜i, Γ˜j] = L(Γi,j − Γj,i + [Γi, Γj])L = LKijL , (1.7.38)
a so Kij transforms under tetrad rotations like the Dirac matrices γ , that is, Kij is a spinor with one spinor index and one adjoint-spinor index. We refer to Kij as the curvature spinor. The relation (1.7.24) leads to k γ |i = 0. (1.7.39) Therefore, the commutator of covariant derivatives of the spacetime-dependent Dirac matrices van- ishes: k k l l k k k l k 2γ |[ji] = R lijγ + 2S ijγ |l + [Kij, γ ] = R lijγ + [Kij, γ ] = 0. (1.7.40)
Multiplying both sides of (1.7.40) by γk from the left gives
k l k Rklijγ γ + γkKijγ − 4Kij = 0. (1.7.41)
We seek the solution of (1.7.41) in the form 1 K = R γkγl + B , (1.7.42) ij 4 klij ij
where Bij is a spinor-tensor quantity with two vector indices. Substituting (1.7.42) to (1.7.41) gives
k γkBijγ − 4Bij = 0, (1.7.43)
so Bij is an antisymmetric-tensor multiple of I4. The tensor Bij is related to the vector Ai in (1.7.26) by Bij = Aj,i − Ai,j + [Ai,Aj]. (1.7.44)
Because ψ has no indices other than spinor indices, Ai is a vector and [Ai,Aj] = 0. The invariance of (1.7.41) under the addition of an antisymmetric-tensor multiple Bij of the unit matrix to the curvature spinor is related to the invariance of (1.7.25) under the addition of a vector multiple Ai of the unit matrix to the spinor connection. Setting Ai = 0, which corresponds to the Fock-Ivanenko spinor connection, gives Bij = 0. Therefore, the curvature spinor Kij is given, up to the addition of an arbitrary antisymmetric-tensor multiple of I4, by 1 1 K = R γkγl = R Gkl. (1.7.45) ij 4 klij 2 klij
References: [3, 4].
Spacetime is a fabric in which various fields representing matter exist. These fields can be de- scribed by vectors, tensors, and spinors. They satisfy the equations derived from two fundamental principles: the principle of relativity and the principle of least action. The physics of fields is referred to as field theory and constitutes Chapter 2 (Fields).
62 2 Fields 2.1 Principle of least action The most general formulation of the law that governs the dynamics of classical systems is Hamilton’s principle of least action, according to which every classical system is characterized by a definite scalar-density function L, and the dynamics of the system is such that a certain condition is satisfied. i Let φA(x ) be a set of physical fields (indexed by A), being differentiable functions of the coordinates, and let L be a Lorentz covariant quantity constructed from the fields φA and their derivatives. Let us consider a scalar quantity 1 Z S = LdΩ, (2.1.1) c
where the integration is over some region in locally Minkowski spacetime. Let δφA be arbitrary and independent, small changes of φA (regarded as a dynamical variable) over the region of integration, which vanish on the boundary. Then the change in S can be written as X δS = δAS, (2.1.2) A where 1 Z δ S = F δφ dΩ. (2.1.3) A c A A The principle of least action states that the dynamics of a physical system is given by the condition the scalar S be a local minimum. Therefore, any infinitesimal change in the dynamics of the system does not alter the value of S: δS = 0 (2.1.4) (S is a local extremum). The condition (2.1.4) is referred to as the principle of stationary action, which is the necessary part of the principle of least action. If L is covariant and φA transform covariantly under the Lorentz group, the variational condition (2.1.4) gives the Lorentz covariant equations FA = 0. (2.1.5) These equations are also invariant for any other transformations (internal symmetries) for which L is invariant. L is referred to as the Lagrangian density, S is the action functional, δS = 0 is the principle of least action, and (2.1.5) are the field equations. The field equations of a physical system are the result of the action being a local extremum. The condition that the action be a local minimum imposes additional restrictions on possible choices for S. The number of independent field equations for a given system is referred to as the number of the degrees of freedom representing this system. In most physical cases L contains only φA and their first derivatives. A Lagrangian density containing higher derivatives can always be written in terms of first derivatives by increasing the number of the components φA. Let us consider a physical system in the Galilean frame of reference. If L depends only on φ and ∂iφ, L = L(φ, φ,i), then 1 Z ∂L ∂L 1 Z ∂L ∂L δS = δφ + δ(φ,i) dΩ = δφ + (δφ),i dΩ c ∂φ ∂(φ,i) c ∂φ ∂(φ,i) 1 Z ∂L ∂L ∂L = δφ − ∂i δφ + ∂i δφ dΩ. (2.1.6) c ∂φ ∂(φ,i) ∂(φ,i) The last term in the integrand in the second line of (2.1.6) is a divergence. Its four-volume integral can be transformed, using the Gauß-Stokes theorem (1.1.39), into a hypersurface integral over the boundary of the integration region. Since δφ = 0 on the boundary, this term does not contribute to the variation of the action: 1 Z ∂L ∂L Z ∂L 1 Z ∂L ∂L δS = − ∂i δφdΩ + δφdSi = − ∂i δφdΩ. (2.1.7) c ∂φ ∂(φ,i) ∂(φ,i) c ∂φ ∂(φ,i)
63 If δS = 0 for arbitrary variations δφ that vanish on the boundary, then
∂L ∂L − ∂i = 0. (2.1.8) ∂φ ∂(φ,i) Defining the variational derivative of L with respect to φ,
δL ∂L ∂L = − ∂i , (2.1.9) δφ ∂φ ∂(φ,i)
we can write (2.1.8) as δL = 0. (2.1.10) δφ
The set of equations (2.1.8), for each field φA, is referred to as the Lagrange equations. There is some arbitrariness in the choice of L; adding to it the divergence of an arbitrary vector density or multiplying it by a constant produces the same field equations. If a system consists of two noninteracting parts A and B, with corresponding Lagrangian densitites LA(φA, ∂φA) and LB(φB, ∂φB), then the Lagrangian density for this system is the sum LA + LB. This additivity of the Lagrangian density means that the field equations for either of the two parts do not involve quantities pertaining to the other part. If LA also depends on φB and/or ∂φB, and/or LB depends on φA and/or ∂φA, then the subsystems A and B interact. References: [1, 2, 3].
2.2 Action for gravitational field Let us consider a Lagrangian density L that depends on the affine (or spin) connection and its first derivatives. Such Lagrangian density can be decomposed into the covariant part Lg that contains derivatives of the affine/spin connection, which is referred to as the Lagrangian density for the gravitational field, and the covariant part Lm that does not contain these derivatives, which is referred to as the Lagrangian density for matter:
L = Lg + Lm. (2.2.1)
The simplest covariant scalar that can be constructed from the affine/spin connection and its first derivatives is the Ricci scalar R. The corresponding Lagrangian√ density for the gravitational field is proportional to the product of R and the scalar density −g:
1 √ 1 √ L = − −gR = − −g P − gik(2Cl + Cj Cl − Cl Cm ) , (2.2.2) g 2κ 2κ il:k ij kl im kl where κ is Einstein’s gravitational constant and we used (1.4.67). The action for the gravitational field is thus 1 Z 1 Z √ S = L dΩ = − R −gdΩ. (2.2.3) g c g 2κc The metric tensor and the affine connection are two fundamental quantities describing a gravi- tational field. Since the affine connection is metric-compatible, given by (1.4.34), it is a function of the metric tensor, its derivatives and the torsion tensor. Accordingly, the metric and torsion tensors are dynamical variables in varying the action. Equivalently, the tetrad and spin connection can be taken as dynamical variables. Let us consider the Riemannian part of the Lagrangian density for the gravitational field (2.2.2), which is proportional to the Riemann scalar P : 1 √ L{} = − −gP. (2.2.4) g 2κ
64 √ i The scalar density −gP is linear in first derivatives of the Christoffel symbols {k l}: √ √ ik l l m l m l −gP = −gg ({i k},l − {i l},k + {i k }{m l} − {i l }{m k}) √ ik l l √ ik √ ik l l √ ik = ( −gg {i k}),l − {i k}( −gg ),l − ( −gg {i l}),k + {i l}( −gg ),k √ ik m l m l + −gg ({i k }{m l} − {i l }{m k}). (2.2.5) √ We can therefore subtract from −gP total derivatives without altering the field equations, replacing it by a noncovariant quantity G that does not contain first derivatives of the Christoffel symbols:
√ √ ik l √ ik l G = −gP − ( −gg {i k}),l + ( −gg {i l}),k l √ ik l √ ik √ ik m l m l = {i l}( −gg ),k − {i k}( −gg ),l + −gg ({i k }{m l} − {i l }{m k}) l √ ik j √ ik √ i jk √ k ij = {i l} ( −gg ):k + {j k} −gg − −g{j k}g − −g{j k}g l √ ik j √ ik √ i jk √ k ij −{i k} ( −gg ):l + {j l} −gg − −g{j l}g − −g{j l}g √ ik m l m l l j √ ik √ i jk + −gg ({i k }{m l} − {i l }{m k}) = {i l} {j k} −gg − −g{j k}g √ k ij l j √ ik √ i jk √ k ij − −g{j k}g − {i k} {j l} −gg − −g{j l}g − −g{j l}g √ ik m l m l √ ik m l m l + −gg ({i k }{m l} − {i l }{m k}) = −gg ({i l }{m k} − {i k }{m l}). (2.2.6) We also define G G = √ = gik({ m}{ l } − { m}{ l }). (2.2.7) −g i l m k i k m l The Riemannian part (2.2.4) of the Lagrangian density for the gravitational field reduces accordingly to 1 1 √ L{} = − G = − −gG. (2.2.8) g 2κ 2κ Any coordinate transformation results in variations of gik, thereby 1 Z 1 Z √ S{} = L{}dΩ = − P −gdΩ (2.2.9) g c g 2κc is not necessarily a minimum with respect to these variations (only an extremum) because not all δgik correspond to actual variations of the gravitational field. In order to exclude the variations δgik resulting from changing the coordinates, we must impose on the metric tensor 4 arbitrary constraints. If we choose g0α = 0, |gαβ| = const, (2.2.10) then G becomes 1 G = − g00gαβgγδg g . (2.2.11) 4 αγ,0 βδ,0 In the locally Galilean frame of reference, gαβ = −δαβ, thereby 1 G = − g00(g )2. (2.2.12) 4 αβ,0
00 {} For physical systems, g > 0. Therefore, in order for Sg to have a minimum, κ must be positive, {} otherwise an arbitrarily rapid change of gαβ in time would result in an arbitrarily low value of Sg and there would be no minimum of S. References: [2, 3].
2.3 Matter 2.3.1 Metric energy-momentum tensor The variation of the action for matter, 1 Z S = L dΩ, (2.3.1) m c m
65 with respect to the metric tensor: 1 Z 1 Z δS = T δgijdΩ = − T ijδg dΩ, (2.3.2) m 2c ij 2c ij
defines the metric energy-momentum density Tij. This tensor density is symmetric:
Tij = Tji. (2.3.3) Equivalently, we have δL ∂L ∂L T = 2 m = 2 m − 2∂ m . (2.3.4) ij δgij ∂gij k ij ∂(g ,k) The metric energy-momentum tensor is defined as T T = √ ij . (2.3.5) ij −g
2.3.2 Tetrad energy-momentum tensor The variation of the matter action (2.3.1) with respect to the tetrad: 1 Z δS = T aδei dΩ, (2.3.6) m c i a a defines the tetrad energy-momentum density Ti . Equivalently a i δLm = Ti δea (2.3.7) or a δLm Ti = i . (2.3.8) δea The corresponding tensor density with two coordinate indices is a Tij = eajTi . (2.3.9) The tetrad energy-momentum tensor is defined as T t = ij . (2.3.10) ij e This tensor is generally not symmetric.
2.3.3 Canonical energy-momentum density
A matter Lagrangian density Lm can be written as Lm = eL, where L is a scalar. If L depends on matter fields φ and their covariant derivatives φ|i, then such fields are said to be minimally coupled to the affine connection. If these fields are written in terms of Lorentz indices instead of vector i indices, then the tetrad appears in L only through a covariant combination eaφ|i. Varying L with respect to the tetrad gives, using (1.5.13), a i ∂L i a i ∂Lm a i δLm = eδL − eei Lδea = e φ|iδea − Lmei δea = φ|i − ei Lm δea. (2.3.11) ∂φ|a ∂φ|a The last term in (2.3.11), a ∂Lm a Θi = φ|i − ei Lm, (2.3.12) ∂φ|a is referred to as the canonical energy-momentum density. The corresponding tensor density with two coordinate indices is
i ∂Lm i ∂Lm i Θj = φ|j − δjLm = φ|j − δjLm. (2.3.13) ∂φ|i ∂φ,i Comparing (2.3.11) with (2.3.7) shows that the canonical energy-momentum density is identical with the tetrad energy-momentum density: a a Θi = Ti . (2.3.14)
66 2.3.4 Spin tensor The variation of the matter action (2.3.1) with respect to the spin connection, 1 Z δS = S iδωab dΩ, (2.3.15) m 2c ab i
i defines the spin density Sab : i δLm ∂Lm Sab = 2 ab = 2 ab , (2.3.16) δω i ∂ω i which is antisymmetric in the Lorentz indices:
i i Sab = −Sba . (2.3.17)
The second equality in (2.3.16) is satisfied because a matter Lagrangian density Lm may depend on ab the spin connection but not on its derivatives; a scalar density depending on derivatives of ω i is a ab i Lagrangian density for the gravitational field. The variations δω i are independent of δea, thereby the spin density is independent of the energy-momentum density. The relation (1.5.35) indicates that the spin density with three coordinate indices, which is antisymmetric in the first two indices, is generated by the contortion tensor:
k k δLm Sij = −Sji = 2 ij . (2.3.18) δC k
Accordingly, the variation of Lm with respect to the torsion tensor,
jk δLm τi = 2 i , (2.3.19) δS jk is a homogeneous linear function of the spin connection because of (1.4.35):
δL δL ∂Clmn τ = 2 m = 2 m = S (δlδmδn + δmδn δl + δnδmδl ) ijk δSijk δClmn ∂Sijk lmn i [j k] i [j k] i [j k] = Sijk − Sjki + Skij, (2.3.20)
Sijk = τ[ij]k, (2.3.21) antisymmetric in the last two indices: τijk = −τikj. (2.3.22)
The variation of Lm with respect to the metric-compatible affine connection in the metric-affine variational formulation of gravity is equivalent to the variation with respect to the torsion (or contortion) tensor. ab The spin connection ω i appears in Lm only through covariant derivatives of φ, in a combination ∂L − Γiφ, where ∂φ,i 1 Γ = − ω Gab (2.3.23) i 2 abi is the connection in the covariant derivative of φ:
φ|i = φ,i − Γiφ. (2.3.24)
i Consequently, the spin density Sab is identical with
i i ∂Lm Σab = −Σba = Gabφ, (2.3.25) ∂φ,i referred to as the canonical spin density. The spin tensor is defined as S s = ijk . (2.3.26) ijk e
67 2.3.5 Belinfante-Rosenfeld relation The total variation of the matter action with respect to geometrical variables is either 1 Z 1 Z δS = dΩT aδei + dΩS iδωab (2.3.27) m c i a 2c ab i or 1 Z 1 Z δS = dΩT δgik + dΩτ ikδSj . (2.3.28) m 2c ik 2c j ik The relation (1.5.5) gives 1 Z 1 Z Z dΩT δgik = dΩT (δei ek + ei δek)ηab) = dΩT ekaδei , (2.3.29) 2 ik 2 ik a b a b ik a and (1.5.31) gives 1 Z 1 Z dΩτ ikδSj = dΩτ ik δ(ej e ωab ) + δea ej + ea δej 2 j ik 2 j a ib k i,k a i,k a 1 Z = dΩ τ liδ(ej e )ωab + τ iδωab + (τ ikej δea) − (τ ikej ) δea + τ ikea δej 2 j a lb i ab i j a i ,k j a ,k i j i,k a 1 Z = dΩ τ lkωcb e δej + τ liωab ej δe + τ iδωab − (τ ikej ) δea + τ lmeb δej 2 j k lb c j i a lb ab i j a ,k i j l,m b 1 Z 1 Z + dS τ ikej δea = dΩ −τ lkωcb e ei ej δea + τ ilωb ejδea + τ iδωab 2 k j a i 2 j k lb c a i j al b i ab i ik i jk ij b ik a lm b i j a −(τa |k − S jkτa − 2Sjτa + ω akτb )δei − τj el,mebeaδei 1 Z = dΩ τ iδωab − τ ik ej δea + 2S τ ijδea . (2.3.30) 2 ab i j ;k a i j a i Comparing (2.3.27) with (2.3.28) leads to Z 1 Z 1 Z 1 Z dΩT aδei + dΩS iδωab = dΩT δgik + dΩ τ iδωab − τ ik ej δea i a 2 ab i 2 ik 2 ab i j ;k a i Z 1 Z 1 Z +2S τ ijδea = dΩT ekaδei + dΩτ iδωab + dΩτ jk eaδei j a i ik a 2 ab i 2 i ;k j a Z kj a b i − dΩSjτb ekei δea. (2.3.31)
ab i The terms in (2.3.31) with δω i give (2.3.21), while the terms with δea give 1 T a = T eka + τ jk ea − S τ aj (2.3.32) i ik 2 i ;k j j i or 1 T = T − ∇ (S j − S j + Sj ) + S (S j − S j + Sj ). (2.3.33) ik ik 2 j ik k i ik j ik k i ik Equation (2.3.33) is referred to as the Belinfante-Rosenfeld relation between the metric and tetrad energy-momentum densites. The Belinfante-Rosenfeld relation can be written, after dividing by e, as a tensor equation: 1 T = t − ∇∗(s j − s j + sj ), (2.3.34) ik ik 2 j ik k i ik ∗ where ∇j is given by (1.2.45). In the absence of spin, (2.3.33) and (2.3.34) reduce to
Tik = Tik, tik = Tik. (2.3.35)
References: [2, 3, 4, 7, 9]
68 2.4 Symmetries and conservation laws 2.4.1 Noether theorem Let us consider a physical system, described by a Lagrangian density L that depends on matter i fields φ, their first derivatives φ,i, and the coordinates x . The change of the Lagrangian density δL under an infinitesimal coordinate transformation (1.2.66) is thus ¯ ∂L ∂L ∂L i δL = δφ + δ(φ,i) + i ξ , (2.4.1) ∂φ ∂φ,i ∂x ¯ where the changes δφ and δ(φ,i) are brought about by the transformation (1.2.66) and ∂ denotes i partial differentiation with respect to x at constant φ and φ,i. The variation δL brought about by this transformation is also given by (1.2.75):
i δL = −ξ ,iL. (2.4.2) Using the Lagrange equations (2.1.8) and the identities
∂¯L ∂L ∂L L,i = i + φ,i + φ,ji, (2.4.3) ∂x ∂φ ∂φ,j j δ(φ,i) = (δφ),i − ξ ,iφ,j, (2.4.4) we bring (2.4.1) to i ∂L j δL = ξ L,i + (δφ − ξ φ,j) . (2.4.5) ∂φ,i ,i Combining (2.4.2) and (2.4.5) gives the conservation law,
i J ,i = 0, (2.4.6) for the Noether current:
i i ∂L j i ∂L ¯ J = ξ L + (δφ − ξ φ,j) = ξ L + δφ. (2.4.7) ∂φ,i ∂φ,i Equation (2.4.6) represents the Noether theorem, which states that to each continuous symmetry of a Lagrangian density there corresponds a conservation law.
2.4.2 Conservation of spin The Lorentz group is the group of tetrad rotations. Since a physical matter Lagrangian density Lm(φ, φ,i) is invariant under local, proper Lorentz transformations, it is invariant under tetrad rotations: ∂Lm ∂Lm a i 1 i ab δLm = δφ + δ(φ,i) + Ti δea + Sab δω i = 0, (2.4.8) ∂φ ∂φ,i 2 where the changes δ are brought about by a tetrad rotation. Under integration of (2.4.8) over spacetime, the first two terms vanish because of the Lagrange equations for φ (2.1.8): Z 1 T aδei + S iδωab dΩ = 0. (2.4.9) i a 2 ab i
a For an infinitesimal Lorentz transformation (1.6.7), the tetrad ei changes by
a a a a b a a δei =e ˜i − ei = Λ bei − ei = i, (2.4.10)
i a j and the tetrad ea, because of the identity δ(ei ea) = 0, according to
i i δea = − a. (2.4.11)
69 The spin connection changes by
ab a jb a jb a jb a cb a jb a bc ab δω i = δ(ej ω i) = jω i − ej ;i = cω i − ej |i + cω i = − |i. (2.4.12)
Substituting (2.4.11) and (2.4.12) to (2.4.9), together with partial integration (1.2.43), gives
Z 1 Z 1 − T ai + S iab dΩ = − T ij + S kij dΩ i a 2 ab |i ij 2 ij |k Z 1 = −T − S S k + S k ijdΩ = 0. (2.4.13) [ij] k ij 2 ij ;k
Since the infinitesimal Lorentz rotation ij is arbitrary, we obtain the covariant conservation law for the spin density (6 equations):
k k Sij ;k = Tij − Tji + 2SkSij . (2.4.14) Dividing this law by e gives the conservation law for the spin tensor:
∗ k ∇ksij = tij − tji. (2.4.15) The conservation law (2.4.15) also results from antisymmetrizing the Belinfante-Rosenfeld relation k (2.3.34) with respect to the indices i, k. If we use the metric-compatible affine connection Γi j, which ab is invariant under tetrad rotations, instead of the spin connection ω i as a variable in Lm, then we ab i must replace the term with δω i in (2.4.8) by a term with δ(ea,j).
2.4.3 Conservation of metric energy-momentum The metric and torsion tensors can be taken, instead of the tetrad and spin connection, as the dynamical variables describing spacetime. Under an infinitesimal coordinate transformation (1.2.66), the matter Lagrangian density Lm(φ, φ,i) changes according to
∂Lm ∂Lm ∂Lm ik ∂Lm ik δLm = δφ + δ(φ,i) + ik δg + ik δ(g ,l) ∂φ ∂φ,i ∂g ∂g ,l
∂Lm j ∂Lm j + j δS ik + j δ(S ik,l). (2.4.16) ∂S ik ∂S ik,l
1 R The matter action Sm = c Lm(φ, φ,i)dΩ is a scalar, thereby it does not change under this trans- formation: Z 1 ∂Lm ∂Lm ∂Lm ik ∂Lm ik δSm = δφ + δ(φ,i) + ik δg + ik δ(g ,l) c ∂φ ∂φ,i ∂g ∂g ,l ∂Lm j ∂Lm j + j δS ik + j δ(S ik,l) dΩ = 0. (2.4.17) ∂S ik ∂S ik,l
The first two terms in (2.4.17) vanish because of the Lagrange equations for φ (2.1.8). If the ik j variations δg and S ik vanish on the boundary of the region of integration, then
1 Z ∂L ∂L 1 Z ∂L ∂L δS = m − ∂ m δgikdΩ + m − ∂ m δSj dΩ m c ∂gik l ∂gik c j l j ik ,l ∂S ik ∂S ik,l 1 Z δL 1 Z δL 1 Z 1 Z = m δgikdΩ + m δSj dΩ = T δgikdΩ + τ ikδSj dΩ c δgik c j ik 2c ik 2c j ik δS ik 1 Z 1 Z = − T ikδg dΩ + τ ikδSj dΩ = 0. (2.4.18) 2c ik 2c j ik
70 The components of the metric tensor change because of an infinitesimal coordinate transformation (1.2.66), thereby the corresponding variation of the metric tensor is given by (1.4.51): ¯ δgik = δgik = −Lξgik = −2ξ(i:k), (2.4.19)
and the variation of the torsion tensor is given by a Lie derivative,
j ¯ j j j l l j l j l j δS ik = δS ik = −LξS ik = ξ ,lS ik − ξ ,iS lk − ξ ,kS il − ξ S ik,l. (2.4.20) The variation of the matter action under (1.2.66) is therefore equal to 1 Z 1 Z δS = δS¯ = − T ikδg¯ dΩ + τ ikδS¯ j dΩ = 0. (2.4.21) m m 2c ik 2c j ik The first term on the right of (2.4.21) is 1 Z 1 Z 1 Z 1 Z − T ikδg¯ dΩ = T ikξ dΩ = (T ikξ ) dΩ − T ik ξ dΩ 2c ik c i:k c i :k c :k i 1 Z 1 Z 1 Z 1 Z = (T ikξ ) dΩ − T ik ξ dΩ = T ikξ dS − T k ξldΩ. (2.4.22) c i ,k c :k i c i k c l :k The second term on the right of (2.4.21) is 1 Z 1 Z τ ikδS¯ j dΩ = (τ ikξjSl ) − (τ ikξlSj ) − (τ ikξlSj ) dΩ 2c j ik 2c j ik ,l j lk ,i j il ,k 1 Z + −(τ ikSl ) ξj + (τ ikSj ) ξl + (τ ikSj ) ξl − τ ikSj ξldΩ 2c j ik ,l j lk ,i j il ,k j ik,l 1 Z 1 Z 1 Z = τ ikξjSl dS − τ ikξlSj dS − τ ikξlSj dS 2c j ik l 2c j lk i 2c j il k 1 Z + −(2τ ikSj ) − (τ ijSk ) − τ ikSj ξldΩ. (2.4.23) 2c j li ,k l ij ,k j ik,l
If the variations ξi of the coordinates vanish on the boundary of the region of integration, then (2.4.21) becomes 1 Z δS = − 2T k + (2τ ikSj + τ ijSk ) + τ ikSj ξldΩ = 0. (2.4.24) m 2c l :k j li l ij ,k j ik,l
Since the variations ξi are arbitrary, (2.4.24) gives the covariant conservation law for the metric energy-momentum density (4 equations):
k ik j 1 ij k 1 ik j Tl :k + τj S li + τl S ij + τj S ik,l = 0. (2.4.25) 2 ,k 2 √ Dividing the conservation law (2.4.25) by −g gives √ −g k ik j 1 ij k ik j 1 ij k ,k 1 ik j Tl :k + tj S li + tl S ij + tj S li + tl S ij √ + tj S ik,l 2 ,k 2 −g 2
k ik j 1 ij k im j k ik j m 1 ij k m = Tl :k + tj S li + tl S ij − tj S li{m k} + tj S mi{l k } + tm S ij{l k } 2 :k 2 1 1 1 1 − t ijSm { k } + t ikSj + t ijSk { m } + t ikSj − t ikSm { j } 2 l i j m k j li 2 l ij m k 2 j ik:l 2 j ik m l 1 1 + t ikSj { m} + t ikSj { m} = 0, (2.4.26) 2 j mk i l 2 j im k l where τ t = √ijk . (2.4.27) ijk −g
71 We therefore obtain the conservation law for the metric energy-momentum tensor:
k ik j 1 ij k 1 ik j Tl + tj S li + tl S ij + tj S ik:l = 0. (2.4.28) 2 :k 2
If the matter Lagrangian density does not depend on the torsion tensor, then tijk = 0 and (2.4.25) reduces to ik T :k = 0. (2.4.29) Equivalently, (2.4.28) reduces to ik T :k = 0. (2.4.30) R ij ¯ ij ¯ In this case, vanishing of T δgijdΩ in (2.4.21) does not imply T = 0, because 10 variations δgij are not all independent; they are functions of 4 independent variations ξi.
2.4.4 Conservation of tetrad energy-momentum
The matter action Sm is invariant under infinitesimal translations of the coordinate system (1.2.66). The corresponding changes of the tetrad and spin connection are given by Lie derivatives, ¯ i i i j j i δea = −Lξea = ξ ,jea − ξ ea,j, (2.4.31) ¯ ab ab j ab j ab δω i = −Lξω i = −ξ ,iω j − ξ ω i,j. (2.4.32) Equation (2.4.9) becomes Z 1 T aδe¯ i + S iδω¯ ab dΩ = 0. (2.4.33) i a 2 ab i If the variations ξi of the coordinates vanish on the boundary of the region of integration, then substituting (2.4.31) and (2.4.32) into (2.4.33) gives Z 1 1 T aξi ej − T aξjei − S iξj ωab − S iξjωab dΩ i ,j a i a,j 2 ab ,i j 2 ab i,j Z 1 1 = −T j − T aej + (S jωab ) − S jωab ξidΩ = 0. (2.4.34) i ,j j a,i 2 ab i ,j 2 ab j,i This equation is satisfied for an arbitrary vector ξi, thereby we obtain j ab j ab ab j a j Sab ,jω i + Sab (ω i,j − ω j,i) − 2Ti ,j − 2Tj ea,i j k j c j c ab j a j = (Sab |j − 2SkSab + Scb ω aj + Sac ω bj)ω i − 2Ti ,j − 2Tj ea,i j ab a cb a cb +Sab (−R ij + ω ciω j − ω cjω i) = 0, (2.4.35) which reduces to j k ab ab j j j jk jk (Sab |j − 2SkSab )ω i − R ijSab − 2Ti ;j + 4SjTi − 2Tjkω i + 4S iTjk k k jl kl j j j jk jk = (Sjl ;k − 2SkSjl )ω i − R ijSkl − 2Ti ;j + 4SjTi − 2Tjkω i + 4S iTjk = 0. (2.4.36) The conservation law for the spin density (2.4.14) brings (2.4.36) to the covariant conservation law for the tetrad energy-momentum density: 1 T j = 2S T j + 2Sj T k + S jRkl , (2.4.37) i ;j j i ki j 2 kl ji which is equivalent to 1 Tij = C iTjk + S Rklji. (2.4.38) :j jk 2 klj This law can be written as the conservation law for the tetrad energy-momentum tensor: 1 tij = C itjk + s Rklji. (2.4.39) :j jk 2 klj Equations (2.4.37) and (2.4.39) are equivalent to (2.4.25) and (2.4.28).
72 2.4.5 Conservation laws for Lorentz group Let us consider a physical system in which the gravitational field (torsion and curvature) can be neglected, described by a matter Lagrangian density Lm. The Lagrangian density Lm therefore depends on the coordinates only through matter fields φ and their first derivatives φ,i. Differentiating Lm gives, using the Lagrange equations (2.1.8), ∂Lm ∂Lm ∂Lm ∂Lm ∂Lm ∂iLm = φ,i + φ,ji = ∂j φ,i + φ,ji = ∂j φ,i . (2.4.40) ∂φ ∂φ,j ∂φ,j ∂φ,j ∂φ,j This equation can be written as a conservation law:
j θi ,j = 0, (2.4.41) for a quantity j ∂Lm j θi = φ,i − δi Lm. (2.4.42) ∂φ,j The conservation law (2.4.41) is a special case of (2.4.37) in the absence of torsion and curvature, expressed in the Galilean and geodesic frame. The quantity (2.4.42) is a special case of the canonical energy-momentum density (2.3.13) in the absence of torsion, expressed in the Galilean and geodesic frame. The Noether current (2.4.7) can be written as
i ∂Lm i j J = δφ − θj ξ . (2.4.43) ∂φ,i
If xi are Cartesian coordinates then for translations, ξi = i = const and δφ = 0, the current (2.4.7) is i i ∂Lm j J = Lm − φ,j. (2.4.44) ∂φ,i The conservation law (2.4.6) for this current is
j i θj ,i = 0, (2.4.45)
i i i j 1 ij which gives (2.4.41) because are arbitrary. For Lorentz rotations, ξ = jx and δφ = 2 ijG φ, where Gij are the generators of the Lorentz group, the Noether current (2.4.7) is i ij ∂Lm 1 kl jk kl ∂Lm i 1 ∂Lm J = xjLm + Gklφ − xkφ,j = xk φ,l − xkδl Lm + Gklφ . (2.4.46) ∂φ,i 2 ∂φ,i 2 ∂φ,i
The conservation law (2.4.6) for this current is kl ∂Lm i 1 ∂Lm φ,[lxk] − δ[lxk]Lm + Gklφ = 0. (2.4.47) ∂φ,i 2 ∂φ,i ,i
Because kl are arbitrary, this equation gives the conservation law,
i Mkl ,i = 0, (2.4.48)
for the angular momentum density:
i i i ∂Lm i i i Mkl = xkθl − xlθk + Gklφ = xkθl − xlθk + Σkl . (2.4.49) ∂φ,i The angular momentum density is antisymmetric in the first two indices:
Mijk = −Mjik. (2.4.50)
73 This quantity is the sum, k k k Mij = Λij + Σij , (2.4.51) of two tensor densities: the orbital angular momentum density,
i i i Λkl = xkθl − xlθk , (2.4.52) and the intrinsic angular momentum density (canonical spin density) (2.3.25). The conservation law (2.4.48) gives
kli k li k li l ki l ki kli M ,i = δi θ + x θ ,i − δiθ − x θ ,i + Σ ,i = 0, (2.4.53) which reduces, by means of (2.4.41), to
i θkl − θlk − Σkl ,i = 0. (2.4.54) This equation is a special case of the conservation law for the spin density (2.4.14) in the absence of torsion, expressed in the Galilean and geodesic frame. The canonical energy-momentum density θik is not symmetric. However, the quantity j τik = θik + ∂jψik , (2.4.55) where 1 ψ j = − (Σ j − Σ j + Σj ), (2.4.56) ik 2 ik k i ik is symmetric:
j j j j j τik − τki = θik − θki + ∂j(ψik − ψki ) = Σik ,j + ∂j(ψik − ψki ) = 0. (2.4.57) Since (2.4.56) is antisymmetric in the last two indices,
ψikj = −ψijk, (2.4.58) the quantity (2.4.55) is also conserved:
ik ik ikj ik τ ,k = θ ,k + ψ ,jk = θ ,k = 0. (2.4.59)
The symmetric energy-momentum density τik is equal to the metric energy-momentum density (2.3.4), expressed in the Galilean and geodesic frame. Equation (2.4.55) is a special case of the Belinfante-Rosenfeld relation (2.3.34) in the absence of torsion, expressed in the Galilean and geodesic frame.
2.4.6 Momentum four-vector Integrating the conservation law (2.4.41) for the canonical energy-momentum density (2.4.42), which is satisfied if we neglect torsion and curvature, over a four-volume and using the Gauß-Stokes theorem (1.1.39) gives Z I ik ik θ ,kdΩ = θ dSk = 0, (2.4.60) where the integral on the right is taken over the closed hypersurface surrounding the four-volume. 0 If the hypersurface represented by the element dSk is taken as a hyperplane perpendicular to the x 0 axis (volume hypersurface), dSk = δkdV , then the closed hypersurface surrounds the four-volume between two hyperplanes at times t1 and t2:
I Z t Z t ik ik 2 i0 2 θ dSk = θ dSk = θ dV = 0. (2.4.61) t1 t1 We define the momentum four-vector or four-momentum of matter in the four-volume as 1 Z 1 Z 1 Z 1 Z P i = TikdS = Ti0dV = ΘikdS = Θi0dV, (2.4.62) c k c c k c
74 where we used (2.3.14). In the locally Galilean and geodesic frame of reference, the four-momentum (2.4.62) reduces to 1 Z 1 Z P i = θikdS = θi0dV, (2.4.63) c k c which is locally conserved because of (2.4.61):
i i i P |t1 = P |t2 ,P = const. (2.4.64)
1 i0 00 The components c θ form the four-momentum density. The component θ is referred to as the energy density, 00 ˙ ∂Lm W = θ = φ − Lm. (2.4.65) ∂φ˙ Integrating it over the volume gives the time component P 0 of the four-momentum, Z ∂L cP 0 = θ00dV = φ˙ − L, (2.4.66) ∂φ˙ where Z L = LmdV (2.4.67)
is the Lagrange function or Lagrangian. The covariant time component of cP i is referred to as the energy, E = cP0. (2.4.68) ˙ dφ Hereinafter, a dot above any quantity φ denotes the derivative of φ with respect to time, φ = dt , ¨ d2φ and two dots above φ denote second derivative of φ with respect to time, φ = dt2 . Consequently, the action of a physical system is equal to the time integral of the Lagrangian, Z S = Ldt. (2.4.69)
The components 1 Πα = θα0 (2.4.70) c form the momentum density Π. Integrating them over the volume gives the spatial components P α of the four-momentum, 1 Z P α = θα0dV, (2.4.71) c which form the momentum vector P:
P i = (P 0,P α) = (P 0, P). (2.4.72)
The conservation law (2.4.41) can be written as
1 ∂θ00 ∂θ0α + = 0, (2.4.73) c ∂t ∂xα 1 ∂θα0 ∂θαβ + = 0. (2.4.74) c ∂t ∂xβ Integrating these equations over a volume and using the Gauß’ theorem (1.4.172) gives
∂ Z I θ00dV = −c θ0αdf , (2.4.75) ∂t α ∂ Z 1 I θα0dV = − θαβdf . (2.4.76) ∂t c β
75 The integral of a three-dimensional vector V α over a two-dimensional surface represented by the H α element dfα, V dfα, is referred to as the flux of this vector. Integrating the components of the energy current S, Sα = cθ0α, (2.4.77)
over dfα gives the energy flux or power: I dE P = Sαdf = − . (2.4.78) α dt
The last equality results from (2.4.66), (2.4.68) and (2.4.75). Integrating the components θαβ, which represent the momentum current, over dfα gives the momentum flux. The stress tensor is defined as
σαβ = −θαβ. (2.4.79)
Its integral taken over dfα gives the vector opposite to the momentum flux, called the surface force F : s I α αβ Fs = σ dfβ. (2.4.80)
The relations (2.4.71), (2.4.76), (2.4.79) and (2.4.80) equal the time derivative of the momentum P α α to the surface force Fs : ˙ α α P = Fs . (2.4.81) The components of the energy-momentum tensor form the following matrix:
W S θik = c . (2.4.82) cΠ −σαβ
ikj ik Adding ψ ,j, where (2.4.58) is satisfied, to θ preserves the conservation law (2.4.41) and brings θik to a symmetric form, τ ik. Using the Gauß-Stokes theorem (1.1.38) gives Z Z Z 1 Z 1 I τ ikdS = θikdS + ψikj dS = cP i + (ψikj dS − ψikj dS ) = cP i + ψikjdf ∗ , k k ,j k 2 ,j k ,k j 2 kj (2.4.83) where the last integral is taken over the surface which bounds the hypersurface. If this surface is located in a region, where matter is absent, then the surface integral vanishes. Consequently, replacing θik by τ ik does not change the four-momentum (2.4.63): 1 Z 1 Z P i = τ ikdS = τ i0dV. (2.4.84) c k c
2.4.7 Mass We define the mass of matter in a four-volume as (P iP )1/2 m = i , (2.4.85) c where P i is the four-momentum of the matter (2.4.62). This definition is analogous to the mass operator (1.6.81). If the matter satisfies the dominant energy condition (confer (2.5.10)), then i i i P Pi ≥ 0 and the mass is a real quantity. If P Pi > 0, then m > 0. If P Pi = 0, then m = 0.
2.4.8 Angular momentum four-tensor Integrating the conservation law (2.4.48) for the angular momentum density (2.4.49), which is sat- isfied if we neglect torsion and curvature, over a four-volume and using the Gauß-Stokes theorem (1.1.39) gives Z I ikj ikj M ,jdΩ = M dSj = 0, (2.4.86)
76 where the integral on the right is taken over the closed hypersurface surrounding the four-volume. If the hypersurface represented by the element dSj is taken as a volume hyperplane, then the closed hypersurface surrounds the four-volume between two hyperplanes at times t1 and t2:
I Z t Z t ikj ikj 2 ik0 2 M dSj = M dSj = M dV = 0. (2.4.87) t1 t1 The angular momentum four-tensor of matter in the four-volume, 1 Z 1 Z M ik = MikjdS = Mik0dV, (2.4.88) c j c is therefore locally conserved:
ik ik ik M |t1 = M |t2 ,M = const. (2.4.89) The angular momentum four-tensor, following (2.4.50), is antisymmetric:
M ik = −M ki. (2.4.90)
1 αβ0 The components c M form the spatial angular momentum density. Integrating them over the volume gives the components of the spatial angular momentum tensor, 1 Z M αβ = Mαβ0dV. (2.4.91) c Since the angular momentum tensor is antisymmetric,
M αβ = −M βα, (2.4.92) we can define the angular momentum pseudovector M: 1 M α = eαβγ M ,M = e M γ . (2.4.93) 2 βγ αβ αβγ Integrating (2.4.51) over a hypersurface gives
M ik = Lik + Sik, (2.4.94)
where 1 Z 1 Z 1 Z 1 Z Lik = ΛikjdS = (xiθkj − xkθij)dS = Λik0dV = (xiθk0 − xkθi0)dV (2.4.95) c j c j c c is the orbital angular momentum four-tensor and 1 Z 1 Z Sik = ΣikjdS = Σik0dV (2.4.96) c j c
is the intrinsic angular momentum four-tensor. Unlike M ik, these tensors are not separately con- served. These tensors are also antisymmetric:
Lik = −Lki,Sik = −Ski. (2.4.97)
Similarly to (2.4.93), we can define the orbital angular momentum pseudovector L and the intrinsic angular momentum pseudovector S: 1 Lα = eαβγ L ,L = e Lγ , (2.4.98) 2 βγ αβ αβγ 1 Sα = eαβγ S ,S = e Sγ , (2.4.99) 2 βγ αβ αβγ M α = Lα + Sα, M = L + S. (2.4.100)
77 The symmetry of τ ik can be written, using (2.4.59), as
ki ik i kl k il τ − τ = ∂l(x τ − x τ ) = 0. (2.4.101)
Integrating this equation over a four-volume and using the Gauß-Stokes theorem (1.1.39) gives
I Z t Z t i kl k il i kl k il 2 i k0 k i0 2 (x τ − x τ )dSl = (x τ − x τ )dSl = (x τ − x τ )dV = 0, (2.4.102) t1 t1 which shows the conservation of the quantity 1 Z 1 Z L˜ik = (xiτ kl − xkτ il)dS = (xiτ k0 − xkτ i0)dV = const. (2.4.103) c l c The symmetry of a second-rank tensor whose ordinary divergence is zero is therefore related to the local conservation of the orbital angular momentum four-tensor constructed from that tensor. Using (2.3.25), (2.4.55), and the Gauß-Stokes theorem (1.1.38) leads to Z Z Z ˜ik ikl i klj k ilj ik i klj k ilj cL = Λ dSl + (x ψ ,j − x τ ,j)dSl = cL + (x ψ ),j − (x ψ ),j dSl Z 1 − (ψkli − ψilk)dS = cLik + (xiψklj) dS − (xiψklj) dS − (xkψilj) dS + (xkψilj) dS l 2 ,j l ,l j ,j l ,l j Z 1 I + ΣikjdS = cM ik + (xiψklj − xkψilj)df ∗ . (2.4.104) j 2 lj If the integration surface is located in a region, where matter is absent, then the surface integral vanishes. Consequently, the quantity (2.4.103) is equal to the angular momentum four-tensor (2.4.88) if we neglect torsion and curvature. This equality shows that replacing θik in (2.4.52) by τ ik changes the values of (2.4.95) and thereby (2.4.88). The angular momentum four-tensor can also be written, using (2.4.63), in terms of P i: Z M ik = (xidP k − xkdP i) + Sik. (2.4.105)
In the absence of the intrinsic angular momentum, (2.4.105) reduces to Z M ik = (xidP k − xkdP i). (2.4.106)
The conservation of M 0α,
1Z Z 1 Z M α0 = xαθ00dV − x0 θα0dV + Sα0 = xαθ00dV − ctP α + Sα0 = const, (2.4.107) c c divided by the conserved P 0 gives
Sα0 Xα = V αt + + const, (2.4.108) P 0 where cP α V α = (2.4.109) P 0 and R xαθ00dV Xα = . (2.4.110) R θ00dV If the intrinsic angular momentum is constant, then the relation (2.4.108) describes a uniform motion of the center of inertia, whose coordinates are Xα, with velocity V α. The coordinates of the center of inertia (2.4.110) are not the spatial components of a four-dimensional vector.
78 2.4.9 Energy-momentum tensor for particles Let us consider matter which is distributed over a small region in space and consists of points with the coordinates xi, forming an extended body whose motion is represented by a world tube in spacetime. The motion of the body as a whole is represented by an arbitrary timelike world line γ inside the world tube, which consists of points with the coordinates Xi(τ), where τ is the proper time on γ. We define
δxi = xi − Xi, (2.4.111) dXi ui = , (2.4.112) ds
2 i j where ds = gijdX dX . The conservation law for the tetrad energy-momentum density (2.4.39) is 1 Tji + { j }Tik − C jTik − R jSikl = 0. (2.4.113) ,i i k ik 2 ikl Integrating (2.4.113) over the volume of the body at a constant time X0 and using Gauß’ theorem to eliminate surface integrals gives Z Z Z 1 Z Tj0 dV + { j }TikdV − C jTikdV − R jSikldV = 0, (2.4.114) ,0 i k ik 2 ikl where dV is a volume element. If a body is not spatially extended then it is referred to as a particle. In this case, the quantity (2.4.111) satisfies δxi = 0. (2.4.115) i i i The affine connection Γj k, and consequently {j k}, C jk, and the curvature tensor in the integrands in (2.4.114), are therefore equal to their respective values at the point Xi. Consequently, we obtain
Z Z Z 1 Z Tj0 dV + { j } TikdV − C j TikdV − R j SikldV = 0. (2.4.116) ,0 i k ik 2 ikl We define the following integrals: Z M ik = u0 TikdV, (2.4.117) Z N ijk = u0 SijkdV. (2.4.118)
Since the integration domain is not spatially extended, these quantities are tensors, and can be represented as covariant hypersurface integrals: Z ik l ik M = u T dSl, (2.4.119) Z ijk l ijk N = u S dSl. (2.4.120)
Using these integrals and Z Z Z j0 j0 1 d j0 T ,0dV = T dV = 0 T dV (2.4.121) ,0 u ds
turns (2.4.116) into
d M j0 1 + { j }M (ik) − C jM [ik] − R jN ikl = 0. (2.4.122) ds u0 i k ik 2 ikl
79 The conservation law (2.4.113) gives 1 (xlTji) = Tjl − xl{ j }Tik + xlC jTik + xlR jSikm, (2.4.123) ,i i k ik 2 ikm l m ji m jl l jm l m j ik l m j ik (x x T ),i = x T + x T − x x {i k}T + x x Cik T 1 + xlxmR jSikn. (2.4.124) 2 ikn Integrating (2.4.123) over the volume of the body and using Gauß’ theorem to eliminate surface integrals gives Z Z Z Z 1 Z (xlTj0) dV = TjldV − xl{ j }TikdV + xlC jTikdV + xlR jSikmdV. (2.4.125) ,0 i k ik 2 ikm
In this relation, we use xi = Xi, which follows from (2.4.111) and (2.4.115). Substituting (2.4.112) l l 0 into (2.4.125) and using X ,0 = u /u gives
ul Z Z Z Z Z Tj0dV + Xl Tj0 dV = TjldV − Xl { j }TikdV + Xl C jTikdV u0 ,0 i k ik 1 Z + Xl R jSikmdV. (2.4.126) 2 ikm
This equation reduces, by means of (2.4.114), to
ul Z Z Tj0dV = TjldV. (2.4.127) u0
Using the definition (2.4.117) brings (2.4.127) to
ul M j0 = M jl. (2.4.128) u0 Putting l = 0 in (2.4.128) gives the identity. Integrating (2.4.124) over the volume of the body does not introduce new relations. The expressions analogous to (2.4.123) and (2.4.124) with higher multiples of xi do not introduce new relations as well. If the spin density vanishes, Sijk = 0, the particle is spinless. The conservation law for the spin density (2.4.14) gives in this case the symmetry of the energy-momentum density, Tik = Tki. The tensor density Tij is also equal to the metric energy-momentum density T ij according to (2.3.35). The quantity (2.4.117) is then symmetric:
M ik = M ki. (2.4.129)
Putting j = 0 in (2.4.128) gives ul M 0l = M 00. (2.4.130) u0 The relation (2.4.128) leads then to
ul ujul M jl = M 0j = M 00. (2.4.131) u0 (u0)2
The quantity (2.4.117) for a spinless particle is proportional to the product of the components of the four-velocity. Equations (2.4.62) and (2.4.117) give the four-momentum of a spinless particle:
1 Z M i0 M 0i ui P i = Ti0dV = = = M 00. (2.4.132) c cu0 cu0 c(u0)2
80 The mass (2.4.85) of the particle is therefore given by
P iP uiu M 00 2 m2 = i = i (M 00)2 = , (2.4.133) c2 (cu0)4 (cu0)2 leading to M 00 m = . (2.4.134) (cu0)2 Consequently, the four-momentum is given by
P i = mcui, (2.4.135)
and the mass satisfies P iu m = i . (2.4.136) c The four-momentum of a spinless particle is proportional to its four-velocity. The quantity (2.4.131) simplifies to M ik = mc2uiuk. (2.4.137) This relation gives Z uiuk TikdV = mc2 (2.4.138) u0 or uiuk Tik(x) = mc2δ(x − x ) , (2.4.139) 0 u0
where δ(x − x0) is the spatial Dirac delta representing a point mass located at x0. Contracting (2.4.137) with ui gives the relations for the four-momentum and mass:
M iku P i = k , (2.4.140) c M iku u m = i k . (2.4.141) c2 Since M ik for a particle is a tensor, P i is a four-vector and m is a scalar. In a locally inertial frame of reference, (1.6.120) and (2.4.135) give E P i = (mcγ, mγv) = , p , (2.4.142) c so the energy and momentum of the particle are
E = mc2γ, (2.4.143) P = mγv. (2.4.144)
Accordingly, (2.4.133) gives E2 = (Pc)2 + (mc2)2. (2.4.145) In the rest frame of the particle, P = 0, (2.4.145) reduces to Einstein’s formula for the rest energy,
E = mc2. (2.4.146)
The formulae (2.4.143) and (2.4.144) give
Pc2 v = . (2.4.147) E Taking the differential of (2.4.145) gives EdE = c2P · dP, from which we obtain, using (2.4.147),
dE = v · dP. (2.4.148)
81 If a particle is massless, m = 0, then (2.4.145) and (2.4.147) give
E = P c, v = c. (2.4.149)
We define the mass density µ such that √ µ sdV = dm, (2.4.150)
where s is given by (1.4.147). The mass density for a particle located at xa is m µ(x) = √ δ(x − x ), (2.4.151) s a so (2.4.139) turns into √ uiuk Tik = µc2 s . (2.4.152) u0 Therefore, the energy-momentum tensor for a spinless particle is given by
uiuk µ(x)c dxi dxk uiuk ik 2 2 √ T (x) = µ(x)c √ 0 = √ = mc δ(x − xa) 0 g00u g00 ds dt −gu Z uiuk = mc2 √ δ(x − x (τ))dτ, (2.4.153) −g a
where xa(τ) is the particle’s wordline as a function of its proper time τ. For a system of particles, we have X uiuk T ik(x) = m c2δ(x − x )√ . (2.4.154) a a −gu0 a In the absence of torsion and in the locally Galilean frame of reference, the conservation law for the energy-momentum tensor is given by (2.4.41), thereby
i Tα ,i = 0. (2.4.155) Let us consider a closed system of particles which carry out a finite motion, in which all quantities vary over finite ranges. We define the average over a certain time interval τ of a function f of these ¯ 1 R τ ¯˙ 1 quantities as f = τ 0 fdt. The average of the derivative of a bounded quantity f = τ f(τ) − f(0) → 0 as τ → ∞. Therefore, averaging (2.4.155) over the time gives
¯ β Tα ,β = 0. (2.4.156) Multiplying (2.4.156) by xα and integrating over the volume gives, omitting surface integrals, Z Z α ¯ β ¯ α x Tα ,βdV = − Tα dV = 0. (2.4.157)
The average energy of the system (2.4.66) is thus Z Z ¯ ¯ 0 ¯ i E = T0 dV = Ti dV. (2.4.158)
Substituting (1.6.120) into (2.4.154) gives
X v2 1/2 T i(x) = m c2δ(x − x ) 1 − , (2.4.159) i a a c2 a
i so Ti ≥ 0. Putting (2.4.159) into (2.4.158) gives
X v2 1/2 E¯ = m c2 1 − , (2.4.160) a c2 a
82 which is referred to as the virial theorem. The equation of motion for a spinless particle follows from (2.4.122), which reduces to
d M j0 + { j }M ik = 0. (2.4.161) ds u0 i k Substituting (2.4.137) into (2.4.161) gives d (muj) + { j }muiuk = 0. (2.4.162) ds i k
Contracting (2.4.162) with uj yields dm duj + m u + { j }uiuku = 0. (2.4.163) ds ds j i k j l i Differentiating u u gli = 1 with respect to s gives dul dg dul 2 uig + ului li = 2 u + uluig uk = 0. (2.4.164) ds li ds ds l li,k Using 1 1 { j }uiuku = (g + g − g )uiukul = g uiukul (2.4.165) i k j 2 li,k lk,i ik,l 2 li,k turns (2.4.164) into duj u + { j }uiuku = 0. (2.4.166) ds j i k j Consequently, (2.4.163) reduces to dm = 0, (2.4.167) ds showing that the mass of a particle is constant. Taking this constancy into account, (2.4.162) reduces to the metric geodesic equation (1.4.91). A spinless particle moves in a gravitational field along a metric geodesic, regardless of its mass. This phenomenon is referred to as the universality of free fall or weak equivalence principle.
2.4.10 Spin tensor for particles If the spin density does not vanish, we can use the conservation law for this quantity to determine the spin tensor and the energy-momentum tensor for a system of particles. The conservation law for the spin density (2.4.14) is
ijk i jlk j ilk [ij] S ,k − Γl kS + Γl kS − 2T = 0. (2.4.168) Integrating (2.4.168) over the volume of the body at a constant time X0 (analogously to the calcu- lations in the preceding section) and using Gauß’ theorem to eliminate surface integrals gives Z Z Z Z ij0 i jlk j ilk [ij] S ,0dV − Γl kS dV + Γl kS dV − 2 T dV = 0. (2.4.169)
i For a particle, the affine connection Γj k in the integrands in (2.4.169) is equal to its value at the point Xi. Consequently, we obtain Z Z Z Z ij0 i jlk j ilk [ij] S ,0dV − Γl k S dV + Γl k S dV − 2 T dV = 0. (2.4.170)
Using the integrals (2.4.117) and (2.4.118) turns (2.4.170) into
d N ij0 − Γ i N jlk + Γ j N ilk − 2M [ij] = 0. (2.4.171) ds u0 l k l k
83 The conservation law (2.4.168) gives
l ijk ijl l i jlk l j ilk l [ij] (x S ),k = S + x Γl kS − x Γl kS + 2x T . (2.4.172) Integrating (2.4.172) over the volume of the body and using Gauß’ theorem to eliminate surface integrals gives Z Z Z Z Z l ij0 ijl l i jmk l j imk l [ij] (x S ),0dV = S dV + x Γm kS dV − x Γm kS dV + 2 x T dV. (2.4.173)
Substituting (2.4.112) into (2.4.173) and using xi = Xi gives
ul Z Z Z Z Z Sij0dV + Xl Sij0 dV = SijldV + Xl Γ i SjmkdV − Γ j SimkdV u0 ,0 m k m k Z +2 T[ij]dV , (2.4.174) which reduces, by means of (2.4.169), to
ul Z Z Sij0dV = SijldV. (2.4.175) u0 Using the definition (2.4.118) brings (2.4.175) to
ul N ij0 = N ijl. (2.4.176) u0 Putting l = 0 in (2.4.176) gives the identity. The expressions analogous to (2.4.172) with higher multiples of xi does not introduce new relations. The relation (2.4.176) infers that the spin tensor for a system of particles satisfies
sijl = sijul, (2.4.177)
where ij ijl s = s ul. (2.4.178) If this tensor is orthogonal to ui, ij s uj = 0, (2.4.179) then it has 3 independent components. A system satisfying (2.4.177) and (2.4.179) is referred to as a spin fluid. The spin tensor (2.4.177) is traceless, because of (2.4.179). In a locally Galilean, rest frame of reference, (2.4.179) becomes s0α = 0. (2.4.180)
In this frame, the 3 components of sij are spatial, sαβ, and are equivalent to 3 components of a spatial pseudovector: 1 sα = eαβγ s . (2.4.181) 2 βγ The relation (2.4.128) gives ul ul M jl = M 0j + 2 M [j0]. (2.4.182) u0 u0 Putting j = 0 in (2.4.182) gives (2.4.130). The relations (2.4.134) and (2.4.182) lead then to
ujul ul ul M jl = M 00 + 2 M [j0] = mc2ujul + 2 M [j0]. (2.4.183) (u0)2 u0 u0 The relation (2.4.171) gives
d N i00 2M [i0] = − Γ i N 0lk + Γ 0 N ilk. (2.4.184) ds u0 l k l k
84 Substituting this equation into (2.4.183) yields
ul d N j00 M jl = mc2ujul + − Γ j N 0ik + Γ 0 N jik . (2.4.185) u0 ds u0 i k i k
Using (2.4.176) brings (2.4.185) to
ul d N j00 uk uk M jl = mc2ujul + + Γ j N i00 + Γ 0 N ji0 . (2.4.186) u0 ds u0 u0 i k u0 i k
Consequently, the four-momentum (2.4.132) is
M i0 1 d N i00 uk uk P i = = mcui + + Γ i N l00 + Γ 0 N il0 . (2.4.187) cu0 cu0 ds u0 u0 l k u0 l k
The four-momentum of a particle with spin is not proportional to its four-velocity. il0 ul i00 If the spin density is completely antisymmetric, then (2.4.176) gives N = − u0 N and thus
N ijk = 0. (2.4.188)
Therefore, such a field cannot be represented as a point or a system of points.
2.4.11 Relativistic ideal fluids
In an arbitrary frame of reference, the metric energy-momentum tensor Tik describing isotropic spinless matter (without a preferred direction in its rest frame) can be decomposed into several parts. One part is proportional to uiuk, analogously to (2.4.152). Another part is proportional to the projection tensor (1.4.229): hik = gik − uiuk, (2.4.189) which is orthogonal to ui: k hiku = 0. (2.4.190) i The tensor Tik can also contain terms with covariant derivatives of u . Let us assume that Tik does not depend on derivatives of ui. Therefore, we have
Tik = uiuk − phik = ( + p)uiuk − pgik, (2.4.191) where a scalar is equal to the energy density W in the locally Galilean rest frame and a scalar p is ik the pressure. In this frame T = diag(, p, p, p) and the stress tensor σαβ = −pδαβ, thereby (2.4.80) gives I I F α = − p df α = − p nαdf. (2.4.192)
This equation, referred to as Pascal’s law, states that the force per unit surface df acting on a surface is parallel, with the opposite sign, to the outward normal vector of this surface nα, dF α/df = −pnα. The scalars and p can also be written as
i k = Tiku u , (2.4.193) 1 p = − T hik. (2.4.194) 3 ik Matter described by the tensor (2.4.191) represents an ideal fluid. The relation (2.4.191) can be written as Tik = (cπi + pui)uk − pgik, (2.4.195) where 1 π = T uk = u (2.4.196) i c ik c i
85 is equal to the four-momentum density in the locally Galilean rest frame: 1 π = T . (2.4.197) i c i0 We also have π ui = . (2.4.198) i c The relation between and p is referred to as the equation of state. In the Galilean frame of reference, combining (1.6.120), (2.4.82), and (2.4.191) gives
+ pv2/c2 W = , (2.4.199) 1 − v2/c2 ( + p)v S = , (2.4.200) 1 − v2/c2 ( + p)v v σ = − α β − pδ . (2.4.201) αβ c2 − v2 αβ The relation (2.4.191) gives i T = T i = − 3p. (2.4.202) 2 2 0 α The component T00 = u0 + p(u0 − g00) is, by means of u0 = (g00dx + g0αdx )/ds, (1.4.126) and (1.4.127), equal to dl 2 T = u2 + pg , (2.4.203) 00 0 00 ds
so it is positive under physical conditions > 0, p > 0, and g00 > 0. If Tik depends also on derivatives of ui then matter described by the tensor (2.4.191) with the corresponding additional terms represents a viscous fluid. Comparing (2.4.202) with (2.4.159) gives
X v2 1/2 − 3p = m c2 1 − , (2.4.204) a c2 a where the summation extends over all particles in unit volume, thereby p ≤ /3. In the nonrelativistic limit p ≈ 0, while in the ultrarelativistic limit (v → c) p → /3. Let us consider a system of noninteracting identical particles of mass m, which we call an ideal gas, with the number of particles in unit volume (number density or concentration) n, thereby
µ = nm. (2.4.205)
Comparing (2.4.191) in the locally Galilean rest frame with (2.4.153) gives the kinetic formulae for ideal gases:
= nmc2γ,¯ (2.4.206) nm p = γv2. (2.4.207) 3 The covariant conservation (2.4.30) of the metric energy-momentum tensor (2.4.191) gives