Quantum field theory: Lecture Notes
Rodolfo Alexander Diaz Sanchez Universidad Nacional de Colombia Departamento de F´ısica Bogot´a, Colombia
August 23, 2015 Contents
1 Relativistic quantum mechanics 7 1.1 Surveyonquantummechanics ...... 7 1.1.1 Vector subspaces generated by eigenvalues ...... 8 1.2 Symmetriesinquantummechanics ...... 9 1.3 Irreducible inequivalent representations of groups ...... 12 1.4 ConnectedLiegroups ...... 14 1.5 Lorentztransformations ...... 17 1.6 The inhomogeneous Lorentz Group (or Poincar´egroup) ...... 19 1.6.1 Four-vectorsandtensors...... 20 1.7 SomesubgroupsofthePoincar´egroup ...... 22 1.7.1 Proper orthochronous Lorentz group ...... 23 1.7.2 Discrete transformations in the Lorentz group ...... 24 1.7.3 Infinitesimal transformations within the proper orthochronus Lorentz group ...... 25 1.8 QuantumLorentzTransformations ...... 25 1.8.1 Four-vector and tensor operators ...... 26 1.8.2 Infinitesimal quantum Lorentz transformations ...... 27 1.8.3 Lorentz transformations of the generators ...... 28 1.8.4 Lie algebra of the Poincar´egenerators ...... 29 1.8.5 Physical interpretation of Poincare’s generators ...... 30 1.9 One-particlestates ...... 32 1.9.1 One-particle states under pure translations ...... 32 1.9.2 One-particle states under homogeneous Lorentz transformations ...... 32 1.9.3 Physicallittlegroups...... 35 1.9.4 Normalization of one-particle states ...... 36 1.10 One-particle states with non-null mass ...... 38 1.10.1 Wigner rotation and standard boost ...... 40 1.11 One-particle states with null mass ...... 44 1.11.1 Determination of the little group ...... 44 1.11.2 Lie algebra of the little group ISO (2) ...... 48 1.11.3 Massless states in terms of eigenvalues of the generators of ISO (2)...... 50 1.11.4 Lorentz transformations of massless states ...... 51 1.12 Space inversion and time-reversal ...... 53 1.13 Parity and time-reversal for one-particle states with M > 0...... 56 1.13.1 Parity for M > 0...... 56 1.13.2 Time reversal for M > 0...... 58 1.13.3 Parity for null mass particles ...... 59 1.13.4 Time-reversal for null mass particles ...... 62 1.14 Action of T 2 andKramer’sdegeneracy...... 63
2 CONTENTS 3
2 Scattering theory 65 2.1 Construction of “in” and “out” states ...... 65 2.2 The S matrix ...... 70 − 2.3 Symmetries of the S matrix ...... 75 − 2.3.1 Lorentzinvariance ...... 75 2.3.2 Internalsymmetries ...... 81 2.3.3 Parity ...... 83 2.3.4 Time-reversal...... 86 2.3.5 PTsymmetry...... 89 2.3.6 Charge-conjugationC,CPandCPT ...... 90 2.4 Ratesandcross-sections ...... 91 2.4.1 One-particle initial states ...... 94 2.4.2 Two-particles initial states ...... 95 2.4.3 Multi-particle initial states ...... 95 2.4.4 Lorentz transformations of rates and cross-sections ...... 95 2.5 Physical interpretation of the Dirac’s phase space factor δ4 (p p ) dβ ...... 98 β − α 2.5.1 The case of Nβ =2...... 98 2.5.2 The case with Nβ =3andDalitzplots...... 101 2.6 Perturbationtheory ...... 102 2.6.1 Distorted-wave Born approximation ...... 107 2.7 Implicationsofunitarity ...... 108 2.7.1 Generalized optical theorem and CPT invariance ...... 111 2.7.2 Unitarity condition and Boltzmann H-theorem ...... 112
3 The cluster decomposition principle 115 3.1 Physicalstates ...... 115 3.1.1 Interchange of identical particles ...... 116 3.1.2 Interchange of non-identical particles ...... 117 3.1.3 Normalization of multi-particle states ...... 117 3.2 Creation and annihilation operators ...... 118 3.2.1 Commutation and anti-commutation relations of a (q) and a† (q) ...... 119 3.3 Arbitrary operators in terms of creation and annihilationoperators ...... 120 3.4 Transformation properties of the creation and annihilation operators ...... 121 3.5 Cluster decomposition principle and connected amplitudes...... 122 3.5.1 Someexamplesofpartitions...... 124 3.6 Structureoftheinteraction ...... 127 3.6.1 Asimpleexample ...... 129 3.6.2 Connected and disconnected parts of the interaction ...... 131 3.6.3 Some examples of the diagrammatic properties ...... 133 3.6.4 Implications of the theorem ...... 136
4 Relativistic quantum field theory 137 4.1 Freefields...... 137 4.2 Lorentz transformations for massive fields ...... 139 4.2.1 Translations...... 142 4.2.2 Boosts...... 144 4.2.3 Rotations ...... 144 4.3 Implementation of the cluster decomposition principle ...... 145 4.4 Lorentz invariance of the S matrix...... 146 − 4 CONTENTS
4.5 Internal symmetries and antiparticles ...... 147 4.6 Lorentz irreducible fields and Klein-Gordon equation ...... 149
5 Causal scalar fields for massive particles 151 5.1 Scalar fields without internal symmetries ...... 151 5.2 Scalar fields with internal symmetries ...... 155 5.3 Scalar fields and discrete symmetries ...... 158
6 Causal vector fields for massive particles 162 6.1 Vector fields without internal symmetries ...... 162 6.2 Spinzerovectorfields ...... 165 6.3 Spinonevectorfields...... 166 6.4 Spin one vector fields with internal symmetries ...... 174 6.4.1 Field equations for spin one particles ...... 175 6.5 Inversion symmetries for spin-one fields ...... 176
7 Causal Dirac fields for massive particles 178 7.1 Spinor representations of the Lorentz group ...... 178 7.2 Some additional properties of the Dirac matrices ...... 182 7.3 The chiral representation for the Dirac matrices ...... 185 7.4 CausalDiracfields ...... 190 7.5 Dirac coefficients and parity conservation ...... 193 7.6 Charge-conjugation properties of Dirac fields ...... 201 7.7 Time-reversal properties of Dirac fields ...... 204 7.8 Majoranafermionsandfields ...... 207 7.9 Scalar interaction densities from Dirac fields ...... 207 7.10TheCPTtheorem ...... 209
8 Massless particle fields 211
9 The Feynman rules 223 9.1 Generalframework ...... 223 9.2 Rules for the calculation of the S matrix ...... 226 − 9.3 Diagrammatic rules for the S matrix ...... 227 − 9.4 Calculation of the S matrixfromthefactorsanddiagrams ...... 229 − 9.5 Afermion-bosontheory ...... 234 9.5.1 Fermion-boson scattering ...... 234 9.5.2 Fermion fermion scattering ...... 238 9.5.3 Boson-bosonscattering ...... 238 9.6 Aboson-bosontheory ...... 238 9.7 Calculation of the propagator ...... 240 9.7.1 Other definitions of the propagator ...... 248 9.8 Feynman rules as integrations over momenta ...... 249 9.9 Examples of application for the Feynman rules with integration over four-momenta variables . . . 252 9.9.1 Fermion-boson scattering ...... 252 9.9.2 Fermion-fermion scattering ...... 255 9.9.3 Boson-bosonscattering ...... 255 9.10 Examples of Feynman rules as integrations over momenta ...... 258 9.10.1 Fermion-boson scattering ...... 259 9.10.2 Fermion-fermion and Boson-boson scattering ...... 260 CONTENTS 5
9.11 Topological structure of the lines ...... 260 9.12 Off-shell and on-shell four-momenta ...... 262 9.12.1 The r thderivativetheorem ...... 264 − 10 Canonical quantization 268 10.1 Canonicalvariables...... 268 10.1.1 Canonical variables for scalar fields ...... 269 10.1.2 Canonical variables for vector fields ...... 271 10.1.3 Canonical variables for Dirac fields ...... 274 10.2 Functional derivatives for canonical variables ...... 275 10.3 FreeHamiltonians ...... 277 10.3.1 Free Hamiltonian for scalar fields ...... 277 10.4 Interacting Hamiltonians ...... 280 10.5 TheLagrangianformalism...... 281 10.6 From Lagrangian to Hamiltonian formalism ...... 283 10.6.1 Setting the Hamiltonian for the use of perturbation theory ...... 286 10.7 Gauges of the Lagrangian formalism ...... 289 10.8 Globalsymmetries ...... 290 10.9 Conserved quantities in quantum field theories ...... 293 10.9.1 Space-time translation symmetry ...... 294 10.9.2 Conserved currents and Lagrangian densities for space-time symmetries ...... 295 10.9.3 Additional symmetry principles ...... 296 10.9.4 Conserved current for a two scalars Lagrangian ...... 298 10.10Lorentzinvariance ...... 300 10.10.1 Currents and time-independent operators ...... 300 10.10.2 Generators and Lie algebra between the homogeneous and inhomogeneous generators . . . 303 10.10.3 Invariance of the S matrix ...... 305 − 10.10.4 Lie algebra within the homogeneous generators ...... 305 10.11The transition to the interaction picture ...... 306 10.11.1 Scalar field with derivative coupling ...... 306 10.11.2 Spin one massive vector field ...... 307 10.11.3 Dirac Fields of spin 1/2 ...... 314 10.12Constraints and Dirac Brackets ...... 316
11 Quantum electrodynamics 323 11.1 Gaugeinvariance ...... 323 11.1.1 Currents and their coupling with Aµ ...... 325 11.1.2 Action for the photons (radiation) ...... 326 11.1.3 General overview of gauge invariance ...... 327 11.2 Constraints and gauge conditions ...... 328 11.3 Quantization in Coulomb Gauge ...... 332 11.3.1 Canonical quantization of the constrained variables...... 332 11.3.2 Quantization with the solenoidal part of Π~ ...... 335 11.3.3 Constructing the Hamiltonian ...... 337 11.4 Formulation of QED in the interaction picture ...... 339 11.5 Thepropagatorofthephoton ...... 342 11.6 Feynman rules in spinor electrodynamics ...... 343 11.6.1 DrawingtheFeynmandiagrams ...... 344 11.6.2 Factors associated with vertices ...... 344 6 CONTENTS
11.6.3 Factors associated with external lines ...... 344 11.6.4 Factors associated with internal lines ...... 345 11.6.5 Construction of the S matrixprocess ...... 346 − 11.7 General features of the Feynman rules for spinor QED ...... 346 11.7.1 Photon polarization ...... 347 11.7.2 Electron and positron polarization ...... 349 11.8 Example of application: Feynman diagrams for electron-photon (Compton) scattering ...... 350 11.9 Calculation of the cross-section for Compton scattering...... 352 11.9.1 Feynman amplitude for Compton scattering ...... 353 11.9.2 Feynman amplitude for the case of linear polarization ...... 356 11.9.3 Differential cross-section for Compton scattering ...... 358 11.9.4 Differential cross-section in the laboratory frame ...... 360 11.10Traces of Dirac gamma matrices ...... 361 11.11Some properties of “slash” momenta ...... 365
12 Path integral approach for bosons in quantum field theory 366 12.1 The general path-integral formula for bosonic operators ...... 367 12.1.1 Probability amplitude for infinitesimal time-intervals...... 369 12.1.2 Probability amplitude for finite time intervals ...... 369 12.1.3 Calculation of matrix elements of operators through the path-integral formalism ...... 371 12.2 Path formalism for the S matrix...... 373 − Chapter 1
Relativistic quantum mechanics
1.1 Survey on quantum mechanics
A Hilbert space , is a complete vector space with inner product. Given two vectors ψ , and φ in such a space, E | i | i there is a complex number φ ψ that satisfies the following axioms h | i φ ψ = ψ φ ∗ h | i h | i φ αψ + βψ = α φ ψ + β φ ψ h | 1 2i h | 1i h | 2i αφ + βφ ψ = α∗ φ ψ + β∗ φ ψ h 1 2 | i h 1 | i h 2 | i ψ ψ 0 ; and ψ ψ = 0 ψ = 0 (1.1) h | i ≥ h | i ⇔ | i where we define the norm of a vector ψ as | i ψ 2 ψ ψ ; ψ 2 = 0 ψ = 0 k| ik ≡ h | i k| ik ⇔ | i which is positive definite i.e. it is positive for any non-zero vector and zero for the null vector. A vector is normalized if its norm is equal to unity. Physical states in quantum mechanics are described by normalized vectors (or kets) on the Hilbert space . Physical observables in quantum mechanics are eigenvalues of hermitian E operators with a complete spectrum (that is, the eigenvectors of each one of these operators form a basis on the Hilbert space). We recall that the adjoint A† of a linear operator A, is another linear operator on a Hilbert space that satisfies the condition Aφ ψ = φ A†ψ ; ψ , φ h | i h ∀ | i | i∈E E further, it also happens that two linearly dependent (normalized) vectors ψ describe the same physical state. | i This fact induces the following definition
Definition 1.1 Given a normalized state ψ , A ray induced by a ket ψ is the set of all normalized vectors | i∈E | i that are linearly dependent with ψ | i iθ ψ e ψ : θ [0, 2π) R| i ≡ | i ∈ n o Two vectors belonging to the same ray describe the same physical states. On the other hand, two vectors belonging to different rays are linearly independent so that they represent different physical states. Therefore, if we think on each ray as a single object, we could say that a given physical state is represented by a single ray and that a given ray represents a unique physical state. This one-to-one relation between rays and physical states justify the introduction of such a concept. For a given observable A, a given ray posseses a unique eigenvalue α if the vectors of are eigenvectors of R R A A ψ = α ψ ψ | i | i ∀ | i ∈ R 7 8 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS the eigenvalues of an hermitian operator are real (this fact is necessary to interpret such eigenvalues as physical observables), and eigenvectors associated with different eigenvalues are orthogonal. When the spectrum of a given observable is degenerate, a given eigenvalue αn could be associated with several linearly independent eigenvectors as follows
A ψm = α ψm ; m = 1,...,g | n i n | n i n where g is the degree of degeneracy. Therefore, there will be g rays m associated with a given eigenvalue α . n n Rn n If a system is in a state described by the ray , and we measure an observable A, the probability of finding R the eigenvalue αk of A is given by
gk P (α )= ψm ψ 2 ; ψ and ψm m k |h k | i| | i ∈ R | k i ∈ Rk m=1 X since observables (hermitian complete linear operators on ), have eigenvectors that form a basis of the sum of E E these probabilities is equal to the unity
gn gn gn P (α ) = ψm ψ 2 = ψ ψm ψm ψ = ψ ψm ψm ψ = ψ I ψ k |h k | i| h | k i h k | i h | | k i h k | | i h | | i m=1 m=1 " m=1 # Xk Xk X Xk X Xk X P (αk) = 1 Xk thus, the feature for the observables of having a complete spectrum, is essential to keep the conservation of probability.
1.1.1 Vector subspaces generated by eigenvalues Let , ,..., be a set of subspaces of a given vector space . We say that is the direct sum of such a {E1 E2 Eq} E E set of subspaces and denote it as = . . . E E1 ⊕E2 ⊕ ⊕Eq if any given arbitrary vector x is expresible in a unique way in the form ∈E x = x + x + . . . + x such that x 1 2 q k ∈Ek in words, given an arbitrary x , it can be expressed by a sum of q vectors x , where each x belongs to a ∈E − { k} k subspace in the set. In addition, there is one and only one vector x belonging to , that can be part of this Ek k Ek decomposition. We say that x is the projection of x into the subspace . k Ek Let A be a linear operator of a vector space into itself. We say that a vector subspace is invariant under E Ek the action of A if for any x , we have that Ax = x . In words, a subspace is invariant under A if by ∈Ek ′ ∈Ek Ek restricting the domain of A to , the resultant range is also contained in . Ek Ek Let A be an observable (hermitian complete linear operator on ). For simplicity, we shall assume that its E spectrum is discrete. Its eigenvalues and eigenvectors are given by
A ψm = a ψm ; m = 1, ..., g | n i n | n i n with gn the degeneracy of the eigenvalue an. By taking all linearly independent vectors of the form
ψ1 , ψ2 , ψgn n n | n i and forming all possible linear combinations (including the null linear combination) we obtain the set of all eigenvectors of A, with eigenvalue a (plus the null vector). This set forms a subspace of , called the vector n E 1.2. SYMMETRIES IN QUANTUM MECHANICS 9 subspace induced by the eigenvalue a of A, and is denoted by . The dimensionality of such a subspace is n Ean clearly the degree gn of degeneracy of an. On the other hand, by virtue of the completeness of A, the set of all its linearly independent eigenvectors forms a basis of . Consequently, given the complete set ψm an arbitrary E {| n i} vector x , can be written in a unique way, as a linear combination of the form ∈E gn ∞ x = β ψm (1.2) n,m | n i Xn=1 mX=1 g1 g2 gn x = β ψm + β ψm + . . . + β ψm + . . . 1,m | 1 i 2,m | 2 i n,m | n i m=1 m=1 m=1 X X X gk x = x + x + . . . + x + . . . ; x β ψm (1.3) 1 2 n k ≡ k,m | k i mX=1 several observations are in order at this step: (1) a given vector x as defined in (1.3) belongs to . (2) Since k Eak the complete set of scalars that define the linear combination in (1.2) is unique (for a given order of the basis), each vector x defined in (1.3) is also unique for a given x. In conclusion, the Hilbert space , can be decomposed k E in a direct sum of subspaces generated by the eigenvalues a of a given observable A { n} = ...... (1.4) E Ea1 ⊕Ea2 ⊕ ⊕Ean ⊕ where the dimension of each is the degree of degeneracy of the eigenvalue a associated. In particular, if the Ean n eigenvalue is non-degenerate, the associated subspace is one-dimensional. Further, it is quite obvious that each subspace is invariant under the observable A. Ean We should keep in mind that the decomposition (1.4) depends on the observable chosen. By choicing another observable B, we should take its eigenvalues b and construct the associated subspaces , in order to construct m Ebm the decomposition of induced by B. E 1.2 Symmetries in quantum mechanics
A (passive) symmetry transformation is a change in the point of view that does not change the results of possible experiments. If an observer O sees the system in a state described for the ray , an equivalent observer O could R ′ see the same physical system in another state described by the ray . Nevertheless, both observers must find the R′ same physics, for example they should find the same probabilities
P ( , α )= P ′, α R k R k indeed this is only a necessary condition. Additional condi tions are necessary for the transformation to be a symmetry transformation1. We shall establish without proof a theorem owe to Wigner concerning with the characterization of possible structures for symmetry operators
Theorem 1.1 The symmetry representation theorem: Any symmetry transformation ′can be characterized R → R by an operator U on the Hilbert space, such that if ψ then U ψ with U being either linear unitary | i ∈ R | i ∈ R′ U (αφ + βψ) = αU (φ)+ βU (ψ) ; φ , ψ , α, β C ∀ | i | i∈E ∀ ∈ Uφ Uψ = φ ψ ; φ , ψ h | i h | i ∀ | i | i∈E or antilinear antiunitary
U (αφ + βψ) = α∗U (φ)+ β∗U (ψ) ; φ , ψ , α, β C ∀ | i | i∈E ∀ ∈ Uφ Uψ = φ ψ ∗ ; φ , ψ h | i h | i ∀ | i | i∈E 1In this context, the rays R and R′ are associated with the same physical state because both are seen from different observers. If they are seen by the same observer and R= 6 R′, they must be associated with different physical states. 10 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS
The adjoint of a linear operator is defined as
Uφ ψ = φ U †ψ ; φ , ψ (1.5) h | i h ∀ | i | i∈E E
We shall see that the relation (1.5) is not consistent for antilinear operators. To prove it, let us consider an arbitrary complex linear combination of the form
φ = α1φ1 + α2φ2 (1.6) substituting (1.6) on the LHS of Eq. (1.5), using the antilinearity of U and the axioms (1.1) we have
U (α φ + α φ ) ψ = α∗Uφ + α∗Uφ ψ h 1 1 2 2 | i h 1 1 2 2 | i U (α φ + α φ ) ψ = α Uφ ψ + α Uφ ψ (1.7) h 1 1 2 2 | i 1 h 1 | i 2 h 2 | i on the other hand, using Eq. (1.5), the antilinearity of U and the axioms (1.1), we can write the same expression as
U (α φ + α φ ) ψ = α φ + α φ U †ψ = α∗ φ U †ψ + α∗ φ U †ψ h 1 1 2 2 | i h 1 1 2 2 1 h 1 2 h 2 E E E U (α1φ1 + α2φ2) ψ = α1∗ Uφ1 ψ + α2∗ Uφ2 ψ (1.8) h | i h | i h | i equating equations (1.7, 1.8) we obtain that αi∗ = αi which cannot be hold by arbitrary complex values of α1, α2. In other words2, the condition (1.5) cannot be satisfied by an antilinear operator because Eq. (1.7) says that the left-hand-side (LHS) of Eq. (1.5) is linear in φ, while Eq. (1.8) says that the same expression is antilinear in φ. Therefore, we shall define the adjoint of an antilinear operator as
Uφ ψ ∗ φ U †ψ = ψ Uφ ; φ , ψ (1.9) h | i ≡ h h | i ∀ | i | i∈E E with this definition, the conditions for both unitarity or antiunitarity take the form
1 U † = U − (1.10) the identity is a trivial symmetry transformation which is linear and unitary. Many symmetries in Physics are continuous in the sense that we can connect the associated operator U with the identity by means of a continuous change in some parameters. This is the case in rotations, translations and Lorentz transformations. In that case, the requirement of continuity demands for the symmetry to be represented by a unitary linear transformation. To see it, we observe that we cannot pass continuously from a linear unitary operator (the identity) to an antilinear antiunitary operator, such a transition requires at least one discrete transformation. Symmetries represented by antilinear antiunitary operators involve a reversal in the direction of time’s flow. If a symmetry transformation is infinitesimally closed to the identity, it can be represented by a linear unitary operator written as U = I + iεT with ε a real infinitesimal quantity. For U to be unitary and linear T must be linear hermitian. Most of observables in quantum mechanics such as the angular momentum, momentum, Hamiltonian etc, arise from symmetry transformations in this way. The set of symmetry transformations satisfy the axioms of a group. If T is a transformation that takes i Rn into , we see that (a) the identity is a symmetry transformation, (b) composition of symmetry transformations Rn′ yields another symmetry transformation, (c) transformations are associative, (d) each symmetry transformation has an inverse that is also a symmetry transformation
2We should take into account that the Hilbert space of quantum mechanics is a complex (rather than real) vector space. 1.2. SYMMETRIES IN QUANTUM MECHANICS 11
The unitary or antiunitary operators U (T ) associated with these symmetry transformations carry the group { i } properties of the set T . However, the operators U (T ) act on vectors of the Hilbert space instead of rays. If { i} T takes into , then when U (T ) acts on a vector ψ , it yields a vector U (T ) ψ . Further 1 Rn Rn′ 1 | ni ∈ Rn 1 | ni ∈ Rn′ if T takes into then U (T ) acting on U (T ) ψ must yield a vector in the ray . On the other hand 2 Rn′ Rn′′ 2 1 | ni Rn′′ T T also takes into . Consequently U (T T ) ψ is also in the ray , so these vectors can only differ by 2 1 Rn Rn′′ 2 1 | ni Rn′′ a phase φn (T2, T1) U (T ) U (T ) ψ = eiφn(T2,T1)U (T T ) ψ (1.11) 2 1 | ni 2 1 | ni we shall see that with one important exception, the linearity or antilinearity of U (T ) tells us that these phases are independent of the state ψ . Consider two linearly independent states ψ and ψ , applying Eq. (1.11) | ni | Ai | Bi to the states ψ and ψ as well as to the state ψ ψ + ψ , we have | Ai | Bi | ABi ≡ | Ai | Bi U (T ) U (T ) ψ = eiφAB (T2,T1)U (T T ) ψ 2 1 | ABi 2 1 | ABi eiφAB (T2,T1)U (T T ) [ ψ + ψ ] = U (T ) U (T ) [ ψ + ψ ] (1.12) 2 1 | Ai | Bi 2 1 | Ai | Bi eiφAB (T2,T1)U (T T ) ψ + eiφAB (T2,T1)U (T T ) ψ = U (T ) U (T ) ψ + U (T ) U (T ) ψ 2 1 | Ai 2 1 | Bi 2 1 | Ai 2 1 | Bi eiφAB (T2,T1)U (T T ) ψ + eiφAB (T2,T1)U (T T ) ψ = eiφA U (T T ) ψ + eiφB U (T T ) ψ (1.13) 2 1 | Ai 2 1 | Bi 2 1 | Ai 2 1 | Bi any linear unitary or antilinear antiunitary operator has an inverse (its adjoint) which is also linear unitary or 1 antilinear antiunitary respectively. Multiplying (1.13) by U − (T2T1) we find
1 iφAB (T2,T1) iφAB (T2,T1) 1 iφA U − (T T ) e U (T T ) ψ + e U (T T ) ψ = U − (T T ) e U (T T ) ψ 2 1 2 1 | Ai 2 1 | Bi 2 1 2 1 | Ai n o n +eiφB U (T T ) ψ 2 1 | Bi o 1 when the operator U − (T2T1) jumps over the complex numbers, the latter become their complex conjugates when the operator is antilinear thus
iφAB (T2,T1) iφAB (T2,T1) iφA iφB e± ψ + e± ψ = e± ψ + e± ψ (1.14) | Ai | Bi | Ai | Bi where the minus sign in the phases occurs when the operator is antilinear. Since ψ and ψ are linearly | Ai | Bi independent, we equate coefficients in (1.14) to obtain
eiφAB = eiφA = eiφB hence the phases are independent of the states and Eq. (1.11) can be written as an equation of operators
iφ(T2,T1) U (T2) U (T1)= e U (T2T1) (1.15) in the case φ = 0, U (T ) provides a representation of the group of symmetry transformations. For non-null phases we obtain a projective representation or a representation up to a phase. The structure of the Lie group cannot tell us whether physical state vectors furnish an ordinary or a projective representation, but it can tell us whether the group has any intrinsically projective representation. The exception for the preceding argument has to do with the fact that it may not be possible to prepare the system in a state represented by ψ + ψ . For instance, it is widely believed that we cannot prepare a system | Ai | Bi in a superposition of two states whose total angular momentum are integer and half-integer respectively. In that case there is a superselection rule between different classes of states and the phases φ (T2, T1) could depend on which of these classes of states are acting the operators U (T2) U (T1) and U (T2T1). It could be shown that any symmetry group with projective representations can be enlarged in such a way that its representations can all be defined as ordinary i.e. with φ = 0, without changing the physical contents. 12 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS
1.3 Irreducible inequivalent representations of groups
For the sake of simplicity, we shall restrict our discussion to finite-dimensional vector spaces but most of our results are applied to infinite-dimensional vector spaces. Let G = T be a group. Each element of the group T can be mapped into a linear operator U (T ) of a { i} n n vector space V onto itself, in such a way that
U (TkTm)= U (Tk) U (Tm) (1.16) relation (1.16) guarantees that the group structure of G is preserved in the mapping. The set of linear operators U (T ) is called a representation of the group G, in the vector space V . If the mapping T U (T ) is one-to-one, { i } i → i we say that the representation is faithful and that there is an isomorphism between G and U (T ) . In that case, { i } both sets are totally identical as groups. However, in some cases, the mapping T U (T ) is not one-to-one, in i → i that case the representation is degenerate and we say that the mapping is a homomorphism. In that case, some information about G is not carried by U (T ) . { i } Let U (T ) be a representation of a group G in a vector space V . If we take an arbitrary non-singular { i } operator Q of V onto itself, it is obvious that the set of operators W (T ) QU (T ) Q 1 also forms a { i } ≡ i − representation. But equally obvious is the fact that this representation does not contain any new information. Conversely, suppose that we have two representations U (Ti) and W (Ti) of the same group in the same { } { 1 } vector space V , if it exists an operator Q such that W (Ti) = QU (Ti) Q− , for all W (Ti) and for all U (Ti) of each representation, then we say that U (T ) and W (T ) are equivalent representations, and we take them { i } { i } essentially as a single representation. However, it happens in some cases that there is not a non-singular operator Q that connects the two repre- sentations U (T ) and W (T ) (defined on the same vector space) in the way described above. In that case we { i } { i } say that we have two (or more) inequivalent representations of G in the vector space V . Let U (T ) be a representation of a group G in a vector space V . Suppose that exists a proper subspace V of { i } k V that is invariant under all linear operators in the set U (T ) . It means that we can restrict to V the domain of { i } k each U (Ti), because the range of each U (Ti) under such a restriction will be contained in Vk. As a consequence, we can form a representation U (T ) of G in the vector space V V . We say that the representation U (T ) { i } k ⊂ { i } of G in V , is reducible because such a representation can be restricted to a proper subspace of V . Even more, suppose that V can be decomposed in non-null vector subspaces V such that { p} V = V V . . . V (1.17) 1 ⊕ 2 ⊕ ⊕ m and that each subspace Vp is invariant under all operators U (Ti). In that case we say that the representation defined in V , is fully reducible into representations defined on each proper subspace Vk. Of course, it could happen that a given subspace Vp could be further reduced in smaller non-null vector subspaces invariant under all U (Ti), so that the representation on Vp is in turn reducible. The idea is to find a decomposition like (1.17) such that none of the subspaces Vp can be decomposed into smaller non-null subspaces in which we can form representations of U (T ) . In that case, we say that our representation is irreducible and that each V is a { i } p minimal invariant subspace under U (T ) . In addition, for most of the cases of interest, the subspaces of the { i } decomposition (1.17) are orthogonal each other, i.e. for any given vector x V and any given vector x V we i ∈ i k ∈ k have that x x =0 if k = i. We say that (1.17) is an orthogonal decomposition and we denote it as V V . h k | ii 6 i ⊥ k We shall assume from now on that we are dealing with orthogonal decompositions unless otherwise indicated. When we have a representation U (T ) of G in V , we can form the matrix representations of each U (T ) by { i } i taking any orthonormal basis V v (1.18) → {| ai} of V (we shall call it the “original” basis). Now, suppose that U (T ) is reducible in V , and that such a { i } reduction can be carried out as in Eq. (1.17). In that case, it is more convenient to choose the basis in the following way: we take an orthonormal basis of the subspace V of dimension d , that is a set of vectors w 1 1 {| 1,r1 i} 1.3. IRREDUCIBLE INEQUIVALENT REPRESENTATIONS OF GROUPS 13
where r1 runs over d1 linearly independent vectors within V1. We proceed in the same way with V2 and so on, then we form a basis for the whole space V as follows
w ; i = 1, 2,...,m ; r = 1, 2,...,d (1.19) {| i,ri i i i} it is easy to see that in this basis ordered as
w , w ,..., w , w , w ,..., w ,..., w , w ,..., w (1.20) {| 1,1i | 1,2i | 1,d1 i | 2,1i | 2,2i | 2,d2 i | m,1i | m,2i | m,dm i} the matrix representatives of each U (Tp) in V are all block-diagonal. To see it, we observe that each Vi is invariant under all U (T ). Hence, w V implies that U (T ) w V and taking into account that w V and p | i,ri i∈ i p | i,ri i∈ i | k,rk i∈ k that V V for i = k, we have i ⊥ k 6 w U (T ) w =0 If i = k h k,rk | p | i,ri i 6 Therefore, the matrix representation of each U (Tp) in the ordered basis (1.20) does not connect two vectors associated with different subspaces Vi and Vk. Thus, we form submatrices associated with each Vk, with zeros in the other entries. Since the basis given by (1.20) simplifies considerably the texture of the matrix representation of the operators U (T ), we call it the canonical basis associated with the representation U (T ) in V . p { p } Let us illustrate these facts with an example. Assume that U (T ) is a representation of a group G on a { i } seven dimensional vector space V , and that V can be decomposed in three minimal orthogonal invariant subspaces under U (T ) , as { i } V = V V V 1 ⊕ 2 ⊕ 3 where V1 is 2-dimensional, V2 is 3-dimensional and V3 is 2-dimensional, let us take an orthonormal basis on each subspace as follows
V w , w ; V w , w , w ; V w , w 1 → {| 1,1i | 1,2i} 2 → {| 2,1i | 2,2i | 2,3i} 3 → {| 3,1i | 3,2i} so that we shall use the following orthonormal ordered basis in V
w , w , w , w , w , w , w {| 1,1i | 1,2i | 2,1i | 2,2i | 2,3i | 3,1i | 3,2i} under this ordered basis, the matrix representative of each U (Ti) will have the following texture
00000 × × 00000 × × 0 0 0 0 A2 2 (U (Ti)) 0 0 × × × × D (U (Ti)) 0 0 0 0 = 0 B3 3 (U (Ti)) 0 (1.21) → × × × × 0 0 0 0 0 0C2 2 (U (Ti)) × × × × 00000 × × 00000 × × D (U (Ti)) = A2 2 (U (Ti)) B3 3 (U (Ti)) C2 2 (U (Ti)) (1.22) × ⊕ × ⊕ × where the “ ” symbol denotes elements that could be non-null. The matrices A2 2, B3 3, C2 2 are the matrix × × × × representatives of each U (Ti) in the subspaces V1,V2,V3 respectively. So the block-diagonal texture of Eq. (1.21), shows that the representation in V can be expressed as a direct sum of representations in V1,V2,V3. Conversely, if we have two (or more) representations in spaces V1,V2, it is clear that we can form a new representation by taking the direct sum of them which is a representation in V V . However, it is equally clear that the new representation 1 ⊕ 2 formed that way does not contain any new information with respect to the component representations. Notwithstanding, we should keep in mind that even if U (T ) in V is reducible, by choicing an arbitrary basis { i } such as (1.18), the matrices will not have the texture (1.21). To exhibit such a texture, an apropriate ordered basis such as the canonical basis given by (1.20) must be chosen. Therefore, changing from our “original” basis to a canonical basis in which the reduction is apparent, is one of the main challenges of group representation theory. 14 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS
The previous discussion, shows us that in characterizing representations of a given group, we should take over two types of redundancies: (a) Given two representations in the same vector space, we consider them different only if they are inequivalent. Equivalent representations are consider as a single one. (b) Given a representation on a vector space V , we should reduce it (if possible) in order to find the irreducible representations. The direct sum of these irreducible representations has no more information with respect to the irreducible ones. Thus, only irreducible representations are considered. Therefore, in characterizing the representations of a given group, we intend to classify all (or as many as possible) irreducible inequivalent representations. The technics and criteria for this classification are out of the scope of the present treatment. By now and for future purposes, we only mention a couple of Lemmas that are crucial in the theory of irreducible representations of groups.
Lemma 1 (Schur’s lemma 1) Let U (G) and U ′ (G) be two irreducible representations of a group G in V and V ′ respectively. Let A be a linear transformation from V to V which satisfies AU (g) = U (g) A for all g G. It ′ ′ ∈ follows that either (i) A = 0, or (ii) A is an isomorphism from V ′ onto V (i.e. V and V ′ are isomorphic) and U (G) is equivalent to U ′ (G).
Lemma 2 (Schur’s lemma 2) Let U (G) be an irreducible representation of a group G on the finite-dimensional vector space V . Let A be an arbitrary operator in V . If A commutes with all the operators in the representation, that is if AU (g)= U (g) A, g G, then A must be a multiple of the identity operator. ∀ ∈ By now we shall only discuss an important consequence of Schur’s lemma 2
Theorem 1.2 All irreducible representations of any abelian group must be 1-dimensional.
Proof: Let U (G) be an irreducible representation of an abelian group G. Let p be a fixed element of the group. Now, U (p) U (g)= U (g) U (p) g G, because of the abelianity of the group. Hence U (p) is an operator ∀ ∈ that commutes with all the U (g)′ s, we conclude from Schur’s lemma 2 that U (p) = λpE. Since p is arbitrary, the representation U (g) is equivalent to the set of operators λ E . But this representation is reducible in { } { p } contradiction with our hypothesis, unless E is the identity in one dimension. Therefore, U (G) is equivalent to the representation p λ C for all p G. QED. → p ∈ ∈ 1.4 Connected Lie groups
These are groups of transformations T (θ) that are described by a finite set of continuous parameters
θ θ1,θ2, ..., θr { }≡ in such a way that each element of the group is continuosly con nected with the identity by a path within the group. The group multiplication rule takes the form
1 2 r 1 2 r T θ¯ T (θ)= T f θ,θ¯ ; f θ,θ¯ f θ,θ¯ , f θ,θ¯ ,...,f θ,θ¯ = θ′ ,θ′ ,...,θ′ (1.23) ≡ where f θ,θ¯ is a set of r functions such that for a given function f a θ,θ¯ an a given couple of sets θ,θ¯ we have f a θ,θ¯ = θ a with θ another set of r parameters. According with (1.23), the set of r functions ′ ′ − f θ,θ¯ provides the law of combination for the two sets of parameters θ and θ¯ which in turn provides the law of combination of the group. By convention, it is customary to choose θa = 0 as the coordinates of the identity, in that case we have
T (θ) = T (0) T (θ)= T (f (0,θ)) ; T (θ)= T (θ) T (0) = T (f (θ, 0)) ⇒ T (θ) = T (f (0,θ)) = T (f (θ, 0)) 1.4. CONNECTED LIE GROUPS 15 consequently f a (θ, 0) = f a (0,θ)= θa (1.24) since these transformations are continuously connected with the identity, they must be represented on the Physical Hilbert space by unitary (rather than antiunitary) operators U (T (θ)). Operators U (T (θ)) can be represented by a power series, at least in a finite neighborhood of the identity
1 U (T (θ)) = I + iθaT + θbθcT + θ3 (1.25) a 2 bc O where Ta and Tbc = Tcb are operators independent of the θs, with Ta hermitian. Suppose that U (T (θ)) provides an ordinary (non-projective) representation of this group of transformations, thus
U T θ¯ U (T (θ)) = U T θ¯ T (θ) and using (1.23) we find U T θ¯ U (T (θ)) = U T f θ,θ¯ (1.26) by expanding condition (1.26) in powers of θ and θ¯, we shall obtain a condition. The expansion of f a θ,θ¯ around the identity (i.e. around θ¯ = θ = 0) up to second order gives ∂f a θ,θ¯ ∂f a θ,θ¯ 1 ∂2f a θ,θ¯ f a θ,θ¯ = f a (0, 0) + θb + θ¯b + θ¯bθc ∂θb ∂θ¯b 2 ∂θ¯b∂θc θ=θ¯=0 θ=θ¯=0 θ=θ¯=0 1 ∂2f a θ,θ¯ 1 ∂2f a θ,θ¯ + θbθc + θ¯bθ¯c + (3) b c ¯b ¯c 2 ∂θ ∂θ 2 ∂θ ∂θ O θ=θ¯=0 θ=θ¯=0
∂f a (0,θ) ∂f a θ,¯ 0 1 ∂2f a θ,θ¯ f a θ,θ¯ = f a (0, 0) + θb + θ¯b + θ¯bθc b ¯b ¯b c ∂θ θ=0 ∂θ 2 ∂θ ∂θ θ¯=0 θ=θ¯=0 1 ∂2f a θ,θ¯ 1 ∂2f a θ,θ¯ + θbθc + θ¯bθ¯c + (3) b c ¯b ¯c 2 ∂θ ∂θ 2 ∂θ ∂θ O θ=θ¯=0 θ=θ¯=0
where (3) denotes terms of third order i.e. proportional to θ¯3,θ3 ,θθ¯2,θ2θ¯. From Eq. (1.24), such an expansion O becomes
f a θ,θ¯ = 0+ δa θb + δa θ¯b + f a θ¯bθc + ga θbθc + ha θ¯bθ¯c + (3) b b bc bc bc O f a θ,θ¯ = θa + θ¯a + f a θ¯bθc + ga θbθc + ha θ¯bθ¯c + (3) (1.27) bc bc bc O 2 a ¯ 2 a ¯ 2 a ¯ a 1 ∂ f θ,θ a 1 ∂ f θ,θ a 1 ∂ f θ,θ f bc ; g bc ; h bc ≡ 2 ∂θ¯b∂θc ≡ 2 ∂θb∂θc ≡ 2 ∂θ¯b∂θ¯c θ=θ¯=0 θ=θ¯=0 θ=θ¯=0
setting θ¯ = 0 in (1.27) we obtain f a (0,θ)= θa + ga θbθc + (3) bc O a thus, in order to be consistent with the condition (1.24) for arbitrary values of θ, we require that g bc = 0. a Similarly, setting θ = 0 we observe that we require that h bc = 0. In other words, the second order terms of the type θbθc or θ¯bθ¯c would violate the condition (1.24), but second order terms of the type θ¯bθc are in principle allowed. Therefore, Eq. (1.27) becomes
1 ∂2f a θ,θ¯ f a θ,θ¯ = θa + θ¯a + f a θ¯bθc + (3) ; f a (1.28) bc bc ¯b c O ≡ 2 ∂θ ∂θ θ=θ¯=0
16 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS
From Eqs. (1.25, 1.28) the RHS of Eq. (1.26) becomes 1 U T f θ,θ¯ = I + if a θ,θ¯ T + f b θ,θ¯ f c θ,θ¯ T + . . . a 2 bc 1 U T f θ,θ¯ = I + i θa + θ¯a + f a θ¯bθ c + . . . T + θb + θ¯b + . . . θc + θ¯c + . . . T + . . . bc a 2 bc h i 1 U T f θ,θ¯ = I + i θa + θ¯a + f a θ¯bθc T + θb + θ¯b θc + θ¯c T + (3) (1.29) bc a 2 bc O h i and substituting (1.25) on the LHS of Eq. (1.26) we have
1 1 U T θ¯ U (T (θ)) = I + iθ¯aT + θ¯bθ¯cT I + iθdT + θeθf T + (3) (1.30) a 2 bc d 2 ef O Substituting (1.29) and (1.30) in Eq. (1.26) yields
1 1 I + iθ¯aT + θ¯bθ¯cT I + iθdT + θeθf T + (3) = I + i θa + θ¯a + f a θ¯bθc T a 2 bc d 2 ef O bc a 1 h i + θb + θ¯b θc + θ¯c T + (3) (1.31) 2 bc O 1 1 I + iθdT + θeθf T + iθ¯aT θ¯aθdT T + θ¯bθ¯cT + (3) = I + iθaT + iθ¯aT + if a θ¯bθcT d 2 ef a − a d 2 bc O a a bc a 1 1 1 + θbθcT + θbθ¯cT + θ¯bθcT 2 bc 2 bc 2 bc 1 + θ¯bθ¯cT + (3) 2 bc O
1 1 I + i θa + θ¯a T + θbθc + θ¯bθ¯c T θ¯bθcT T = I + i θa + θ¯a T + θbθc + θ¯bθ¯c T a 2 bc − b c a 2 bc h i + [if a T + T ] θ¯bθc +h (3)i (1.32) bc a bc O 2 where we have used the fact that indices of sum are dummy and that Tbc = Tcb. The terms of order 1, θ, θ,θ¯ and θ¯2 match automatically in Eq. (1.32). However, by matching coefficients of θθ¯ in such an equation we find a non-trivial condition
T T = [if a T + T ] − b c bc a bc T = T T if a T (1.33) bc − b c − bc a a a therefore given the structure of the group (1.26), i.e. the functions f θ,θ¯ , we have its quadratic coefficient f bc as can be seen in (1.28). From them, the second order terms in U (T (θ)) [Eq. (1.25)], can be calculated from the generators Ta that appear in the first-order terms. Moreover there is a consistency condition: the operator Tbc must be symmetric in b and c since it is the second derivative of U (T (θ)) with respect to θb and θc, as can be seen from Eq. (1.25). Consequently, Eq. (1.33) demands that
T = T = T T if a T (1.34) bc cb − c b − cb a substracting (1.34) from (1.33) we obtain
0 = T T if a T + T T + if a T − b c − bc a c b cb a T T T T = i [ f a + f a ] T b c − c b − bc cb a 1.5. LORENTZ TRANSFORMATIONS 17 and we obtain finally [T , T ]= iCa T ; Ca f a + f a (1.35) b c bc a bc ≡− bc cb the set of commutation relations in (1.35) defines a Lie algebra. It can also be proved that condition (1.35) is the only condition required to ensure that the process can be continued. In other words, the whole power series in (1.25) for U (T (θ)) can be calculated from an infinite sequence of relations like (1.33), as long as we know the first order terms, the generators Ta. It does not necessarily mean that the operators U (T (θ)) are uniquely determined for all θa if we know the generators T a, but it does mean that the operators U (T (θ)) are uniquely determined in at least a finite neighborhood of the coordinates θa = 0 associated with the identity, such that Eq. (1.26) is satisfied if θ, θ¯ and f θ, θ¯ are in this neighborhood. In some cases, it happens that the function f θ,θ¯ satisfies the condition (at least for some subset of the coordinates θa) f a θ,θ¯ = f a θ, θ¯ = θ¯a + θa (1.36) as it is the case in space-time translations or for rotations about a given fixed axis. In that case the coefficients a f bc in Eq. (1.28) vanish and so do the structure constants in (1.35). Hence, all generators commute
[Tb, Tc] = 0 from (1.36) the laws of combination (1.23, 1.26) for the group representation yield
U T θ¯ U (T (θ)) = U T f θ,θ¯ = U T θ¯ + θ = U T θ + θ¯ ¯ ¯ = U T f θ, θ = U (T ( θ)) U T θ U T θ¯ U (T (θ)) = U (T (θ)) U T θ¯ (1.37) ⇒ hence the elements of the group commute each other. Consequently, when the function f (θ) is given by (1.36), the connected Lie group becomes abelian. In that case, we can calculate U (T (θ)) for all θa and not only a neighborhood of the identity. From Eqs. (1.26, 1.36), we find
U (T (θ2)) U (T (θ1)) = U (T (θ1 + θ2))
U (T (θN )) ...U (T (θ2)) U (T (θ1)) = U (T (θ1 + θ2 + . . . + θN )) by defining θ θ/N, we see that for any positive integer N we have i ≡ θ N U (T (θ)) = U T (1.38) N setting N , the angle θ/N becomes infinitesimal. Thus, we can use expansion (1.25) for U (T (θ/N)) keeping →∞ only the terms at first-order in θ. In this way we get
N i a U (T (θ)) = lim 1+ θ Ta N N →∞ a U (T (θ)) = exp [iTaθ ] (1.39)
1.5 Lorentz transformations
Special relativity establishes the existence of certain special reference frames called inertial frames which are in constant relative motion among them. Special relativity is based on two basic postulates: (a) The laws of nature are the same in all inertial reference frames. (b) The speed of light in vacuum, measured in any inertial reference frame is the same regardless of the motion of the light source relative to that reference frame. The second postulate represents a significant deviation with respect to Galilean and Newtonian mechanics. It leads in 18 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS turn to different transformations connecting coordinate systems in different inertial frames. We denote as xµ the coordinates in one inertial frame S, with x1,x2,x3 cartesian space coordinates while x0 = t is a time coordinate, the speed of light will be settled equal to unity. We shall use latin indices such as i, j, k when running over the three space components, and greek indices such as µ, ν, ρ when running over the four space-time indices. Quantities for which we obtain the same value in any inertial frame are called Lorentz invariants. A well-known invariant in special relativity is the quantity
2 2 2 2 2 2 2 (dτ)2 dx1 + dx2 + dx3 (dt)2 = dx1 + dx2 + dx3 dx0 (1.40) ≡ − − where the invariance of this “proper time” is related with the invariance ofc, the speed of light in vacuum. This Lorentz invariant can be written as 100 0 dx1 2 2 1 2 3 0 010 0 dx µ ν (dτ) dx dx dx dx = dx gµν dx ≡ 001 0 dx3 0 0 0 1 dx0 − 100 0 010 0 gµν = (1.41) 001 0 0 0 0 1 − where gµν defined in (1.41) is called the metric tensor. We use from now on a convention of sum over repeated µ ν upper and lower indices. By using this invariant, we can relate the coordinates x of S, with the coordinates x′ in any other inertial frame S′, in the following way
µ ν µ ν gµν dx′ dx′ = gµν dx dx (1.42) this equation leads to ∂x µ ∂x ν ∂xµ ∂xν g ′ ′ = g = g δµ δν µν ∂xρ ∂xσ µν ∂xρ ∂xσ µν ρ σ ∂x µ ∂x ν g ′ ′ = g (1.43) µν ∂xρ ∂xσ ρσ it is also easy to arrive to (1.42) from (1.43). Hence Eqs. (1.43) and (1.42) are equivalent. A light wave travelling at unit speed satisfies dx = 1 g dxµdxν = (dx)2 (dt)2 = 0 dt ⇒ µν −
µ µ and the same holds for S′. Any coordinate transformation x x′ that satisfies Eq. (1.43) is linear → µ µ ν µ x′ = Λ νx + a (1.44)
µ µ 3 with a arbitrary constants, and Λ ν a constant matrix . Restricting for a while to a homogeneous transformation i.e. with aµ = 0, the Jacobian of such a transformation gives µ µ µ ∂x′ ν µ ∂x′ x′ = x ; Λ ∂xν ν ≡ ∂xν µ further, from (1.43) we see that Λ ν satisfies the condition
µ ν gµν Λ ρΛ σ = gρσ (1.45)
3 µ ′ The matrix Λ ν depends on the velocity of S with respect to S. However, since both frames are inertial, such a velocity is constant µ and so Λ ν is. 1.6. THE INHOMOGENEOUS LORENTZ GROUP (OR POINCARE´ GROUP) 19 it is convenient for some purposes to write the Lorentz transformation condition (1.45) in a different way. It is µν easy to check that the matrix gµν in (1.41) coincides with its inverse (that we denote as g ). Multiplying Eq. στ κ (1.45) by g Λ τ we find µ ν στ κ στ κ µ ν κ στ στ κ τ κ gµν Λ ρΛ σ (g Λ τ ) = gρσ (g Λ τ ) gµν Λ ρ (Λ σΛ τ g ) = (gρσg ) Λ τ = δρ Λ τ µ ν κ στ κ κ µ ⇒ νκ µ (gµν Λ ρ) (Λ σΛ τ g ) = Λ ρ = δµ Λ ρ = gµν g Λ ρ µ ν κ στ µ νκ (gµν Λ ρ) (Λ σΛ τ g ) = gµν Λ ρg the relativity principle demands for the Lorentz transformations to have an inverse. Defining M g Λµ , and ρν ≡ µν ρ multiplying with the inverse of this matrix we find4
ν κ στ νκ 1 αρ ν κ στ 1 αρ νκ Mρν (Λ σΛ τ g ) = Mρν g M − Mρν (Λ σΛ τ g )= M − Mρν g α ν κ στ α νκ ⇒ α κ στ ακ δ ν (Λ σΛ τ g ) = δ νg Λ σΛ τ g = g ν κ στ νκ ⇒ Λ σΛ τ g = g (1.46) the condition (1.45) or equivalently (1.46) is usually called the generalized orthogonality condition in the Minkowski metric space.
1.6 The inhomogeneous Lorentz Group (or Poincar´egroup)
The set of all Lorentz (inhomogeneous) transformations (Λ, a) form a group. If we perform a Lorentz transfor- { } mation (1.44) and then a second Lorentz transformation x µ x µ, then the resultant transformation xµ x µ ′ → ′′ → ′′ is described by
µ µ ρ µ µ ρ ν ρ µ x′′ = Λ¯ ρx′ +a ¯ = Λ¯ ρ (Λ νx + a ) +a ¯ µ µ ρ ν µ ρ µ x′′ = Λ¯ ρΛ ν x + Λ¯ ρa +a ¯ (1.47) The bar in Λ¯ is used to distinguish one Lorentz transformation from the other, and same fora ¯, with respect to a. We should show that the effect is the same as a Lorentz transformation xµ x µ. In other words, we have to → ′′ verify that (1.47) defines a Lorentz transformation. To do this, let us define µ Λ¯ µ Λρ Z ν ≡ ρ ν we should check that µ obeys the relation (1.45). For this, we take into account that both Λ and Λ¯ must obey Z ν such a relation g µ ν = g Λ¯ µ Λβ Λ¯ ν Λγ = g Λ¯ µ Λ¯ ν Λβ Λγ µν Z ρZ σ µν β ρ γ σ µν β γ ρ σ = g Λβ Λγ =g βγ ρ σ ρσ ⇒ g µ ν = g µν Z ρZ σ ρσ Hence µ defines a Lorentz transformation. Equation (1.47) shows us how is the composition rule for the Z ν transformations T (Λ, a) induced on physical states T Λ¯, a¯ T (Λ, a)= T ΛΛ¯ , Λ¯a +a ¯ (1.48) each Λ admits an inverse. To find it, we start from the definitio n of inverse and use again the condition (1.45) to write
1 ρ ν ρ ρσ ρσ µ ν Λ− νΛ β = δ β = g gσβ = g gµν Λ σΛ β 1 ρ ν ρσ µ ν Λ− νΛ β = (gνµg Λ σ) Λ β
4 µ µ The matrix Aνρ ≡ gµν Λ ρ = gνµ Λ ρ = (gΛ)νρ is non-singular since both g and Λ are non-singular. Therefore its transpose M ≡ A is also invertible. e 20 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS
Thus, condition (1.45) says that the inverse of Λ takes the form
1 ρ ρ ρσ µ Λ− Λ = g g Λ (1.49) ν ≡ ν νµ σ The reader can also check that (det Λ)2 = 1 (1.50) from the composition law (1.48) it is easy to see what is the identity for the transformations T (Λ, a)
T (1, 0) T (Λ, a) = T (1Λ, 1a + 0)= T (Λ, a) ; T (Λ, a) T (1, 0)= T (Λ1, Λ0 + a)= T (Λ, a) I = T (1, 0) (1.51) ⇒ where 1 is the identity matrix 4 4, and 0 the null 4-components vector. The inverse of a given T (Λ, a) can also × be obtained from
1 1 1 1 1 T Λ− , Λ− a T (Λ, a) = T Λ− Λ, Λ− a Λ− a = T (1, 0) − 1 1 1 − 1 T (Λ, a) T Λ− , Λ− a = T ΛΛ− , Λ Λ− a +a = T (1, 0) −1 1 −1 T − (Λ, a) = T Λ− , Λ− a (1.52) − 1.6.1 Four-vectors and tensors The Lorentz invariant (1.40) suggests to define a Lorentz invariant norm for the 4-components vectors xµ as follows
2 0 2 1 2 2 2 3 2 x (x,x)= x + x + x + x = x′,x′ (1.53) k k ≡ − 0 2 i strictly speaking this is a pseudonorm because it is not positive-definite since in some cases x >x xi. Taking into account (1.41) this relation can be written as µ ν (x,x)= x gµν x (1.54) it is easy to see the invariance of this norm under a homogeneous Lorentz transformation, by using condition (1.45) we find
µ ν µ α ν β µ ν α β α β x′,x′ = x′ gµν x′ = (Λ αx ) gµν Λ βx = (gµν Λ αΛ β) x x = gαβx x µ ν α β x′,x′ = x′ gµν x′ = x gαβx = (x,x) (1.55) a more convenient way to write this norm is the following
(x,x) = xµg xν xµx ; x g xν (1.56) µν ≡ µ µ ≡ µν xµ = x1,x2,x3,x0 ; x = (x ,x ,x ,x )= x1,x2,x3, x0 (1.57) µ 1 2 3 0 − we define a four-vector as any arrangement of four components that under a homogeneous Lorentz transforma- tion, changes under the same prescription of xµ. That is V µ is a four-vector if under a homogeneous Lorentz transformation we have µ µ α V ′ = Λ αV (1.58) µ for any “contravariant” four-vector V ′ we can define a “covariant” four-vector as in Eq. (1.56) V g V ν µ ≡ µν multiplying by the inverse of gµν we obtain the inverse relation
αµ αµ ν α ν α g Vµ = g gµν V = δ νV = V α αµ ⇒ V = g Vµ 1.6. THE INHOMOGENEOUS LORENTZ GROUP (OR POINCARE´ GROUP) 21 we can define the inner product between two (contravariant) four vectors V µ,W µ as
(V,W ) V µg W ν = V µW ≡ µν µ with the same procedure used to prove the Lorentz invariance of (x,x), we see that (V,W ) is also Lorentz invariant. The summation of upper and lowered indices is called a contraction. The Lorentz transformation of a covariant four-vector gives
ν ν α ν αβ βα ν Vµ′ = gµν V ′ = gµν Λ αV = gµν Λ αg Vβ = gµν g Λ α Vβ where we have used the symmetrical nature of gαβ. Applying Eq. (1.49) we obtain
β 1 β Vµ′ = Λµ Vβ = Λ− µVβ (1.59) so covariant four-vectors transform with the inverse transformation with respect to contravariant four-vectors. It justifies the names covariant and contravariant. Two adjacent four-vectors transform as
µ ν µ α ν β µ ν α β V ′ W ′ = (Λ αV ) Λ βW = Λ αΛ βV W (1.60) an arrangement of numbers characterized by two indices of the form T µν is called a second-rank Lorentz tensor if under a homogeneous Lorentz transformation, it changes in a way similar to the two adjacent four-vectors in Eq. (1.60), that is µν µ ν αβ T ′ = Λ αΛ βT (1.61) taking into account the expression (1.49) for the inverse of Λ, we can write this transformation as
µν µ αβ ν µ αβ 1 ν T ′ = Λ αT Λ β = Λ αT Λ− β (1.62) in matrix from it yields 1 Tcont′ = ΛTcontΛ− which is a similarity transformation of Tcont under Λ. We can define in an analogous way second-rank covariant tensors Tµν based on covariant four-vectors. By using Eq. (1.59) we have
1 α 1 β 1 α 1 β Vµ′Wν′ = Λ− µVα Λ− νVβ = Λ− µ Λ− νVαVβ 1 αh 1 β i T ′ = Λ− Λ− T (1.63) ⇒ µν µ ν αβ which can also be written as
1 α 1 β α 1 β Tµν′ = Λ− µTαβ Λ− ν = Λµ Tαβ Λ− ν (1.64) 1 Tcov′ = ΛTcovΛ− (1.65) from Eqs. (1.62, 1.64), it is easy to show that the contraction of two second rank tensors (one contravariant and other covariant) is a Lorentz invariant (also known as a Lorentz scalar or a zero-rank Lorentz tensor)
µν µ αβ 1 ν γ 1 δ µ γ 1 ν 1 δ αβ T ′ Hµν′ = Λ αT Λ− β Λµ Hγδ Λ− ν = Λ αΛµ Λ− β Λ− ν T Hγδ h 1 µ γ ih1 ν δ αβ i = Λ− α Λµ Λ− β Λν T Hγδ γ δ αβ h αβ i µν = δ α δβ T Hγδ = T Hαβ = T Hµν 22 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS thus we can denote this contraction with a quantity without Lorentz indices, and it is equal in any inertial reference frame µν µν T Hµν = T ′ Hµν′ = C
µα in a similar way, we can show that the contraction of a second-rank tensor T with a four-vector Vα, gives a four-vector W µ µα µ T Vα = W (1.66) the previous developments justify the convention that indices may be lowered or raised by contraction with gµν or gµν . For instance T g T µ ; T µ gµσT (1.67) σρ ≡ σµ ρ ρ ≡ σρ a very important four-vector in special relativity is the four-momentum
pµ = p1,p2,p3,p0 = p,p0 (p, E) , p = (p, E) (1.68) ≡ µ − hence p is the three momentum and the energy is the zeroth component. Taking into account the fundamental relation p2 + m2 = E2 (1.69) the pseudo-norm of the four momentum gives
2 p2 = pµg pν = pµp = p0 + p2 = E2 + p2 µν µ − − p2 = m2 (1.70) ⇒ − note that four-momenta with positive pseudonorm leads to m2 > 0, i.e. to a non-physical mass. Thus, physical − states are related with four-momenta with non-positive pseudonorm. It is important to keep in mind that if the metric tensor is chosen as η = (1, 1, 1, 1), we obtain p2 = m2 and the positive values of p2 are the physical µν − − − ones. Hence, it is extremely important to know the conventions used in each case.
1.7 Some subgroups of the Poincar´egroup
The whole group of transformations T (Λ, a) is called the inhomogeneous Lorentz group, or the Poincar´e group. It has several important subgroups. First the set of transformations with a = 0, T (Λ, 0) clearly forms a { } subgroup. To see it we observe first that the identity T (1, 0) of T (Λ, a) is also contained in T (Λ, 0) . Further { } { } Eq. (1.48) gives us the composition law, on this subset
T Λ¯, 0 T (Λ, 0) = T ΛΛ¯ , Λ¯0 + 0 = T ΛΛ¯ , 0 and the composition law is closed within the subset T (Λ, 0) . Finally, the inverse of any given element in { } T (Λ, 0) , also belongs to such a subset as can be seen from (1.52) { } 1 1 1 1 T − (Λ, 0) = T Λ− , Λ− 0 = T Λ− , 0 − this subgroup of the inhomogeneous Lorentz group is called the homogeneous Lorentz group. When we work on the homogeneous Lorentz group, we usually simplify the notation and write
T (Λ, 0) T (Λ) ; I T (1) ≡ ≡ 1 1 T Λ¯ T (Λ) = T ΛΛ¯ ; T − (Λ, 0) = T Λ− 1.7. SOME SUBGROUPS OF THE POINCARE´ GROUP 23 further, we note that Eq. (1.50) gives two possibilities (a) detΛ = +1 (b) detΛ = 1. Those transformations − with det Λ = +1 obviously form a subgroup of either the homogeneous or inhomogeneous Lorentz groups5. On the other hand, by taking the 00 components of Eq. (1.45) we have
µ ν gµν Λ 0Λ 0 = g00 (1.71) expanding the LHS explicitly and taking into account that gµν is diagonal, we obtain
µ ν µ µ 0 0 i i gµν Λ 0Λ 0 = gµµΛ 0Λ 0 = g00Λ 0Λ 0 + giiΛ 0Λ 0 2 g Λµ Λν = Λi Λi Λ0 (1.72) µν 0 0 0 0 − 0 where we are using sum over repeated (upper) indices i. Substituting (1.72) into (1.71) and using g = 1, we 00 − find 2 2 Λi Λi Λ0 = 1 Λ0 =1+Λi Λi 0 0 − 0 − ⇒ 0 0 0 0 2 0 0 similarly from Eq. (1.46) we can find that Λ 0 =1+Λ iΛ i. We obtain finally
0 2 i i 0 0 Λ 0 =1+Λ 0Λ 0 =1+Λ iΛ i (1.73) since the matrices Λµ are real, we have Λ0 Λ 0 0. Hence, we see from (1.73) that Λ0 1. Consequently, we ν i i ≥ 0 ≥ have that either Λ0 1 or Λ0 1. 0 ≥ 0 ≤−
1.7.1 Proper orthochronous Lorentz group 0 µ Those transformations with Λ 0 1 form a subgroup. To show it, we assume by hypothesis that both Λ ν and µ ≥ Λ¯ ν satisfy the conditions Λ0 1, and Λ¯ 0 1 (1.74) 0 ≥ 0 ≥ we have 0 0 µ 0 0 0 i ΛΛ¯ 0 = Λ¯ µΛ 0 = Λ¯ 0Λ 0 + Λ¯ iΛ 0 (1.75) if Λ¯ 0 Λi 0, Eqs. (1.74, 1.75) yield immediately that ΛΛ¯ 0 1. Now we should examine the case in which i 0 ≥ 0 ≥ Λ¯ 0 Λi < 0. First we prove the inequality i 0 a + b + ab a (a + 2) b (b + 2) 0 ; if a 0 and b 0 (1.76) − ≥ ≥ ≥ we prove it as follows p p
(a b)2 0 (a + b + ab)2 a (a + 2) b (b + 2) 0 − ≥ ⇒ − ≥ ⇒ (a + b + ab)2 a (a + 2) b (b + 2) ≥ if a, b are non-negative, we can take positive square roots on both sides and preserve the order relation in the inequality, thus a + b + ab a (a + 2) b (b + 2) ; if a 0 and b 0 ≥ ≥ ≥ then we obtain Eq. (1.76). p p Defining the three vectors
v Λ1 , Λ2 , Λ3 ; ¯v Λ¯ 0 , Λ¯0 , Λ¯ 0 (1.77) ≡ 0 0 0 ≡ 1 2 3 i i 0 0 v = √v v = Λ 0Λ 0 ; ¯v = √¯v ¯v = Λ kΛ k (1.78) k k · k k · 5It is also clear that the set of Lorentz transformationsp with detΛ = −1, does not formp a subgroup of the Lorentz group. For instance, it does not contain the identity. 24 CHAPTER 1. RELATIVISTIC QUANTUM MECHANICS
Eq. (1.73) shows that the lengths of these two three-vectors are given by
v = (Λ0 )2 1 ; ¯v = Λ¯ 0 2 1 (1.79) k k 0 − k k 0 − q q and using the inequality ¯v v ¯v v (1.80) | · | ≤ k k k k substituting (1.77) and (1.79) in (1.80) and taking into account that Λ¯ 0 Λi 0, we have i 0 ≤ Λ¯ 0 , Λ¯ 0 , Λ¯ 0 Λ1 , Λ2 , Λ3 (Λ0 )2 1 Λ¯ 0 2 1 1 2 3 · 0 0 0 ≤ 0 − 0 − q q Λ¯ 0 Λi (Λ0 )2 1 Λ¯ 0 2 1 i 0 ≤ 0 − 0 − q q Λ¯ 0 Λi (Λ0 )2 1 Λ¯ 0 2 1 − i 0 ≥ − 0 − 0 − q q Λ¯ 0 Λi (Λ0 )2 1 Λ¯ 0 2 1 (1.81) i 0 ≥ − 0 − 0 − q q substituting (1.81) in (1.75), we find
0 0 0 0 i ΛΛ¯ 0 = Λ¯ 0Λ 0 + Λ¯ iΛ 0