A Physicist’s Primer
Gordon Walter Semenoff Copyright c 2019 Gordon Walter Semenoff
PUBLISHEDBY PUBLISHER
BOOK-WEBSITE.COM
Licensed under the Creative Commons Attribution-NonCommercial 3.0 Unported License (the “License”). You may not use this file except in compliance with the License. You may obtain a copy of the License at http://creativecommons.org/licenses/by-nc/3.0. Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “ASIS” BASIS, WITHOUTWARRANTIESORCONDITIONSOFANYKIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
First printing, May 2019 Contents
I Many Particle Physics as a Quantum Field Theory
1 Many particle physics ...... 9 1.1 A preview of this chapter9 1.2 Introduction9 1.3 Non-relativistic particles 10 1.3.1 Identical particles...... 13 1.3.2 Spin...... 16 1.4 Second Quantization 17 1.5 The Heisenberg picture 20 1.6 Summary of this chapter 23
2 Degenerate Fermi and Bose Gases ...... 25 2.1 A preview of this chapter 25 2.2 The limit of weak interactions 25 2.3 Degenerate Fermi gas and the Fermi surface 28 2.3.1 The ground state |O > ...... 28 2.3.2 Particle and holes...... 29 2.3.3 The grand canonical ensemble...... 31 2.4 Bosons 33 2.5 Summary of this chapter 38 3 Classical field theory and the action principle ...... 41 3.1 The Action Principle 41 3.1.1 The Action...... 42 3.1.2 The action principle and the Euler-Lagrange equations...... 43 3.1.3 Canonical momenta, Poisson brackets and Commutation relations...... 47 3.2 Noether’s theorem 48 3.2.1 Examples of symmetries...... 49 3.2.2 Proof of Noether’s Theorem...... 50 3.3 Phase symmetry and the conservation of particle number 51 3.4 Translation invariance 53 3.5 Galilean symmetry 54 3.6 Scale invariance 57 3.6.1 Improving the energy-momentum tensor...... 58 3.6.2 The consequences of scale invariance...... 59 3.7 Special Schrödinger symmetry 60 3.8 The Schrödinger algebra 61 3.9 Summary of this chapter 63
II Relativistic Symmetry and Quantum Field Theory
4 Space-time symmetry and relativistic field theory ...... 69 4.1 Quantum mechanics and special relativity 69 4.2 Coordinates 73 4.3 Scalars, vectors, tensors 75 4.4 The metric 76 4.5 Symmetry of space-time 77 4.6 The symmetries of Minkowski space 77
5 The Dirac Equation ...... 79 5.1 Solving the Dirac equation 81 5.2 Lorentz Invariance of the Dirac equation 84 5.3 Phase symmetry and the conservation of electric current 86 5.4 The Energy-Momentum Tensor of the Dirac Field 87 5.5 Summary of this chapter 90
6 Photons ...... 93 6.1 Relativistic Classical Electrodynamics 93 6.2 Covariant quantization of the photon 94 6.2.1 Field equations and commutation relations...... 94 6.2.2 Massive photon (Optional reading)...... 102 6.3 Space-time symmetries of the photon 103 6.4 Quantum Electrodynamics 104 6.5 Summary of this chapter 106
III Functional methods and quantum electrodynamics
7 Functional Methods and Correlation Functions ...... 111 7.1 Functional derivative 111 7.2 Functional integral 113 7.3 Photon Correlation functions 114 7.3.1 Generating functional for correlation functions of free photons...... 117 7.3.2 Photon Generating functional as a functional integral...... 119 7.4 Functional differentiation and integration for Fermions 121 7.5 Generating functionals for non-relativistic Fermions 125 7.5.1 Interacting non-relativistic Fermions...... 127 7.6 The Dirac field 128 7.6.1 Two-point function for the Dirac field...... 128 7.6.2 Generating functional for the Dirac field...... 130 7.6.3 Functional integral for the Dirac field...... 131 7.7 Summary of this chapter 131
8 Quantum Electrodynamics ...... 135 8.1 Quantum Electrodynamics 135 8.2 The generating functional in perturbation theory 139 8.3 Wick’s Theorem 140 8.4 Feynman diagrams 141 8.5 Connected Correlations and Goldstone’s theorem 145 8.5.1 Connected correlation functions...... 145 8.5.2 Goldstone’s Theorem...... 147 8.6 Fourier transform 147 8.7 Furry’s theorem 150 8.8 One-particle irreducible correlation functions 151 8.9 Some calculations 152 8.9.1 The photon two-point function...... 152 8.9.2 The Dirac field two-point function...... 155 8.9.3 Traces of gamma matrices...... 158 8.9.4 Feynman Parameter Formula...... 158 8.9.5 Dimensional regularization integral...... 159 8.10 Quantum corrections of the Coulomb potential 160 8.11 Renormalization 162 8.11.1 The Ward-Takahashi identities...... 163 8.12 Summary of this Chapter 164 9 Formal developments ...... 167 9.1 In-fields, the Haag expansion and the S-matrix 167 9.2 Spectral Representation 168 9.2.1 Gauge invariant scalar operators...... 169 9.2.2 The Dirac field...... 170 9.3 S-matrix and Reduction formula 173 9.3.1 Some intuition about asymptotic behaviour...... 173 9.3.2 In and out-fields in QED...... 174 9.3.3 Proof of the LSZ identities...... 178 9.3.4 An Example: Electron-electron scattering...... 180 9.4 More generating functionals 182 9.4.1 Connected correlation functions...... 182 9.4.2 One-particle irreducible correlation functions...... 184 9.4.3 Charge conjugation symmetry ...... 186 Many Particle Physics as a I Quantum Field Theory
1 Many particle physics ...... 9 1.1 A preview of this chapter 1.2 Introduction 1.3 Non-relativistic particles 1.4 Second Quantization 1.5 The Heisenberg picture 1.6 Summary of this chapter
2 Degenerate Fermi and Bose Gases .. 25 2.1 A preview of this chapter 2.2 The limit of weak interactions 2.3 Degenerate Fermi gas and the Fermi surface 2.4 Bosons 2.5 Summary of this chapter
3 Classical field theory and the action prin- ciple ...... 41 3.1 The Action Principle 3.2 Noether’s theorem 3.3 Phase symmetry and the conservation of particle number 3.4 Translation invariance 3.5 Galilean symmetry 3.6 Scale invariance 3.7 Special Schrödinger symmetry 3.8 The Schrödinger algebra 3.9 Summary of this chapter
1. Many particle physics
1.1 A preview of this chapter In this chapter, we will formulate our first example of a quantum field theory. We begin with the study of a quantum mechanical system with non-relativistic, identical, interacting particles. We will define the problem at hand as that of needing to find a solution of the Schrödinger equation which describes that system, subject to the appropriate boundary conditions. We will discuss the two cases of particle exchange statistics, Fermions and Bosons. We shall then find a way to rewrite the quantum many-particle problem as a quantum field theory. For this, we introduce field operators and the problem is posed as a field equation and commutation relations that the field operators of the quantum field theory should satisfy.
1.2 Introduction In this chapter we will attempt to develop intuition for the answer to the question “what is a quantum field theory”. We will do this by studying a system with many particles. For now, we will assume that the particles are non-relativistic. The generalization to relativistic particles and relativistic quantum field theory will be discussed in later chapters. We will assume that the problem in front of us is quantum mechanical, that is, that we want to find a solution of the Schrödinger equation for the system as a whole and then use that solution, the wave function, to answer questions about the physical state of the system. In order to describe the quantum mechanical problem for a large number of particles in an elegant way, we will develop a procedure which is called “second quantization”. In non-relativistic quantum mechanics, when the total number of particles is finite, second quantization gives an alternative, but at the same time completely equivalent formulation of the problem of solving the Schrödinger equation. This formulation is convenient for some applications, such as perturbation theory which is widely used to study many-particle systems and it can be relevant to many interesting physical scenarios. Metals, superconductors, superfluids, trapped cold atoms and nuclear matter are important examples. The formalism is particularly useful in that it allows us to take the “thermodynamic limit” which is an idealization of such a system that considers 10 Chapter 1. Many particle physics
the limit as both the volume of the system and the total number of particles in the system go to infinity, with the density – the number of particles per unit of volume – kept finite. The system can simplify somewhat in that limit. Moreover, it can be a good approximation to real systems, where the number of particles in a macroscopic system is typically very large, of order Avogadro’s number,6.02 × 1023 and the size of the system is macroscopic, many orders of magnitude greater than the natural sizes of the components of the system, like the Compton wave-lengths of the particles for example. Later, in subsequent chapters, we will generalize the second quantized system that we find in order to make it relativistic, that is, so that it can describe particles with velocities approaching the velocity of light. In this generalization, the analog of second quantization is essential. Relativistic quantum mechanics is necessarily a many-particle theory and the number of particles is always infinite, so there is no convenient description of it using a many-particle Schrödinger equation. In both the relativistic and non-relativistic cases, the second quantized theory is a quantum field theory, that is, a quantum mechanical theory where fields are the dynamical variables. In classical field theory, a field is simply a function of space and time coordinates whose value at a given time and point in space has a physical interpretation. A familiar example of a classical field theory is classical electrodynamics where the electric field and magnetic field are the classical fields. We can think of classical electrodynamics as a mechanical theory where the dynamical variables are these classical fields and the mechanical problem is to determine the time evolution of the dynamical variables, in this case, to determine the electric and magnetic fields as functions of the space and time coordinates. This is done by solving Maxwell’s equations. In a quantum field theory, instead of being ordinary functions, like the electric and magnetic fields which are studied in classical electrodynamics, the fields in a quantum field theory are space and time-dependent operators which act on vectors in a Hilbert space, the space of possible quantum states of the quantum field theory. In such a theory, the physical entities, those attributes which can be measured by doing experiments, for example, are the expectation values and correlations of various operators. We will eventually get a much more precise picture of how this works.
1.3 Non-relativistic particles We will begin by studying the non-relativistic quantum mechanics of a system of identical particles. Particles are identical if all of their physical properties, such as their mass, electric charge, spin, et cetera, are identical. Generally we are interested in describing the behaviour of a large number of such particles. This can have many applications in physics, to any system where many identical degrees of freedom are involved, from the study of the collective properties of the electrons in a metal to the molecules of a gas or a liquid, to the behaviour of a superfluid or a superconductor. Our central goal here is not a comprehensive overview of such applications, which is in itself a fascinating subject, but, rather, our aim is to gain intuition about quantum field theory. We will begin by assuming that we can study a many-particle system by studying its Schrödigner equation. The Schrödinger equation contains the Hamiltonian, which is generally the energy of the system as a function of the dynamical variables. We will assume that the dynamical variables are the momenta and the positions of each of the particles. The kinetic energy of an assembly of particles is given by the sum over their individual kinetic energies
N ~p2 total kinetic energy = ∑ i i=1 2m
where ~pi is the momentum of the i’th particle and each particle has mass m. Particles also have a potential energy by virtue of their mutual interactions. We will assume that the potential energy is a function of the positions of the particles, V(~x1,...,~xN). If the particles 1.3 Non-relativistic particles 11 are identical, this potential energy should be a symmetric function of the positions, in that, if we interchange any two of the positions, the value of the potential is left unchanged. We will also generally assume that the total potential energy is due to two-body interactions, that is, that it can be written as a sum N total potential energy = V(~x1,...,~xN) = ∑ V(~xi,~x j) i< j=1 where V(~xi,~x j) is the energy that is stored in the interaction between particle i and partical j. If the particles are identical
V(~xi,~x j) = V(~x j,~xi) for each pair (i j). We will assume that this is always the case. For the most part, we shall assume that they are functions of relative positions of the particles so that
V(~xi,~x j) = V(~xi −~x j) The Hamiltonian for such a system is given by the sum of the kinetic energy and the interaction energy. It has the form
N 2 N ~pi H(~p1,...,~pN,~x1,...,~xN) = ∑ + ∑ V(~xi −~x j) (1.1) i=1 2m i< j=1
Here,~xi is the position and ~pi is momentum of the i’th particle and the index i runs over the labels of the particles, i = 1,2,...,N. In the quantum mechanics of non-relativistic particles, the positions and momenta are operators. We will temporarily denote operators with a hat, so that they are {~ˆx1,...,~xˆN ~pˆ1,...,~pˆN}. The precise property that defines them as operators are the commutation relations
h a bi a a xˆi ,xˆj = 0 , xˆi , pˆ jb = ih¯δi jδ b , pˆia, pˆ jb = 0 (1.2)
There the labels i, j take values in the set {1,2,...,N} and they label the distinct particles. The indices a,b take the values {1,2,3} and they label the three Cartesian components of the position or momentum vector of each particle. The right-hand-side of the non-zero commutation relations contains Planck’s constant, h¯. It is necessary to find a workable “representation” of the commutation relations (1.2) between a the position and momentum operators. A common way to do this is to think of the operators xˆi and a a pˆi as operating on functions of all of the coordinates, ϕ(~x1,...,~xN), with the operation of xˆi being a simply the multiplication of the function by the variable xi a a xˆi ϕ(~x1,...,~xN) = xi ϕ(~x1,...,~xN) a a and the operation ofp ˆi as proportional to the partial derivative by xi , ∂ pˆiaϕ(~x1,...,~xN) = −ih¯ a ϕ(~x1,...,~xN) ∂xi It is easy to see that this definition reproduces the commutation relation for position and momentum, a a a xˆi , pˆ jb ϕ(~x1,...,~xN) = xˆi pˆ jb ϕ(x1,...,xN) − pˆ jbxˆi ϕ(x1,...,xN)
a h¯ ∂ h¯ ∂ a = xi b ϕ(x1,...,xN) − b [xi ϕ(x1,...,xN)] i ∂x j i ∂x j a = [ih¯δi jδ b]ϕ(x1,...,xN) 12 Chapter 1. Many particle physics
The Hamiltonian in equation (1.1) is a function of positions and momenta. If positions and momenta become operators, the Hamiltonian also becomes an operator,1
2 N ~p N N h2 N ˆ ~ ~ ~ ~ ˆi ~ ~ ¯ ~ 2 ~ ~ H ≡ H(pˆ1,..., pˆN,xˆ1,...,xˆN) = ∑ + ∑ V(xˆi −xˆj) = − ∑ ∇i + ∑ V(xˆi −xˆj) (1.3) i=1 2m i< j=1 i=1 2m i< j=1
The Schrödinger equation determines how the quantum state evolves from an initial time to a later time ∂ ih¯ ψ(~x ,...,~x ,t) = Hˆ ψ(~x ,...,~x ,t) ∂t 1 N 1 N " N 2 N # h¯ ~ 2 ~ ~ = − ∑ ∇i + ∑ V(xˆi − xˆj) ψ(~x1,...,~xN,t) (1.4) i=1 2m i< j=1
Here, ψ(~x1,...,~xN,t) is the wave-function which should be interpreted as the probability amplitude that that particles occupy positions ~x1,...,~xN at time t. It should be normalized so that the total probability is unity, Z 3 3 2 d x1 ...d xN |ψ(~x1,...,~xN,t)| = 1
We can present the Schrödinger equation (1.4) as a time-independent equation by making the ansatz
−iEt/h¯ ψ(~x1,...,~xN,t) = e ψE (~x1,...,~xN)
Then (1.4) implies that
" N 2 # −h¯ ~ 2 EψE (~x1,...,~xN) = ∑ ∇i + ∑V(~xi −~x j) ψE (~x1,...,~xN) (1.5) i=1 2m i< j
The solution of this equation with boundary conditions should give us the wave-functions and the energies, E of stationary states. Here, ψE (~x1,...,~xN) is called an “eigenstate” or “eigenvector” of the Hamiltonian and E is the “eigenvalue” which is associated with it. Wave-functions with different energy eigenvalues are orthogonal, Z 3 3 † d x1 ...d xNψE (~x1,...,~xN)ψE0 (~x1,...,~xN) = δEE0
1 ~ We shall often use the notation where ∇ = (∇1,∇2,∇2) or ∇a with a = 1,2,3 is the gradient operator with components
∂ ∂ ∂ ~∇ ≡ , , ∂x1 ∂x2 ∂x3 a Given that xi , a = 1,2,3, i = 1,2,...,N is the a’th component of the Cartesian coordinates of i’th particle, ∂ ∇ia ≡ a , p ja = −ih¯∇ ja ∂xi We also denote Laplace’s operator as !2 !2 !2 ~ 2 ∂ ∂ ∂ 2 2~ 2 ∇i ≡ 1 + 2 + 3 , ~pi = −h¯ ∇i ∂xi ∂xi ∂xi 1.3 Non-relativistic particles 13
Generally, equation (1.5) is difficult to solve when the interaction potential is non-trivial. In fact, there are very few examples of interaction potentials where one can solve for the wave-functions or the energies exactly. One of them is the case of free particles, when the potential is zero. In that case, the explicit wave-function can be found, it is simply constructed from plane-waves,
1 N ~ i∑i=1 ki·~xi ψE (~x1,...,~xN) = 3 e (2π) 2 and the energy eigenvalue is
N h¯ 2~k2 E = ∑ i i=1 2m
If the initial state, say at time t = 0 is given by a function ψ0(~x1,...,~xN), the wave-function at any time is given by N 3 3 Z d k jd y j 2~ 2 ~ −ih¯ k j t/2mh¯+ik·(~x j−~y j) ψ(~x1,...,~xN,t) = ∏ 3 e ψ0(~y1,...,~yN) j=1 (2π)
In fact, in this simple case, the integrations over~k j can be done to get
N " 3 # Z d y j 2 ψ(~x ,...,~x ,t) = eim|~x j−~y j| /2ht¯ ψ (~y ,...,~y ) 1 N ∏ 2 3 0 1 N j=1 (2πh¯ t/im) 2 This formal expression is a solution of the initial value problem for the quantum state of N free particles.
1.3.1 Identical particles There is one important aspect of the problem which we have ignored until now and which must be discussed here. We have constructed the Hamiltonian so that the particles are identical. They have the same masses and the interaction between any pair of particles is governed by the same two-body potential as the interaction between any other pair of particles. The Hamiltonian is unchanged if we trade the labels on the indices of the particles. That is, if we make the substitution
{~x1,~x2,...,~xN;~p1,~p1,...,~pN} → {~xP(1),~xP(2),...,~xP(N);~pP(1),~pP(2),...,~pP(N)} where the permutation {1,2,...,N} → {P(1),P(2),...,P(N)} is a re-ordering of the integers {1,2,...,N}. There are N! different possible permutations, including the identity. We require that the permutation act on the indices of both the particles and the momenta. This guarantees that the commutation relations (1.2) as well as the Hamiltonian (1.3) are left unchanged by the transformation. This permutation symmetry of the Hamiltonian has an important consequence. Consider the Schrödinger equation (1.5) and let us assume that we manage to solve the equation to find an allowed value of the energy, E, and the wave-function which corresponds to it, ψE (~x1,...,~xN). The permutation symmetry then tells us that ψE (~xP(1),...,~xP(N)) also obeys the same equation, (1.5), for any of the N! distinct permutations. What is more, the normalization of the wave-functions are identical Z 3 3 † d x1 ...d xNψE (~xP(1),...,~xP(N))ψE (~xP(1),...,~xP(N)) Z 3 3 † = d xP−1(1) ...d xP−1(N)ψE (~x1,...,~xN)ψE (~x1,...,~xN) Z 3 3 † = d x1 ...d xNψE (~x1,...,~xN)ψE (~x1,...,~xN) 14 Chapter 1. Many particle physics where, P−1(i) is the integer that P maps onto the integer i and we have used the fact that 3 3 3 3 d xP−1(1) ...d xP−1(N) is an inconsequential re-ordering of d x1 ...d xN. Then, there are two possibilities. The first possibility is that, using permutations, we have found some new quantum states which are not equivalent to the one that we began with. That is, for some permutation, P, the wave function ψE (~x1,...,~xN) and the wave function ψE (~xP(1),...,~xP(N)) are truly distinct wave functions representing distinct quantum states. In order to describe distinct quantum states, the state vectors must be linearly independent. The test for linear independence is to ask whether the equation whether the equation
c1ψE (~x1,...,~xN)) + c2ψE (~xP(1),...,~xP(N)) = 0 (1.6) has a solution where c1 and c2 are not zero. If such a solution exists, they are linearly dependent. If both c1 and c2 must be zero, they are linearly independent. If the states are truly distinct quantum states, the only solution of equation (1.6) has both c1 and c2 equal to zero. Then ψE (~x1,...,~xN) and ψE (~xP(1),...,~xP(N)) are two different quantum states with the same energy eigenvalue E. Let us examine this possibility. If we consider all permutations and find the linearly independent states which are generated, we find a degenerate set of state vectors which are transformed into each other by permutations. The degeneracy would be a prediction of our quantum mechanical model. It is up to us to compare what we find with the real physical system which we are describing in order to see if the degeneracies which would result are indeed there. When the degeneracy is two-fold or greater, the particles which are being described are said to obey “parastatistics”. In parastatistical systems, the degeneracies can depend on the total number of particles. Nature does not seem to make use of parastatistics.2 For any three-dimensional many-particle system, and for any permutation P, the wave function ψE (~x1,...,~xN) and the wave function ψE (~xP(1),...,~xP(N)) are linearly dependent and represent the same quantum state. Such particles are said to be “indistinguishable”. This indistinguishability is extremely important to us. It is responsible for the stability of atoms, for example, via the Pauli exclusion principle applied to identical electrons. Nature would be very different if electrons were distinguishable particles. When particles are indistinguishable, the equation
c1ψE (~x1,...,~xN) + c2ψE (~xP(1),...,~xP(N)) = 0 has a solution where both c1 and c2 are non-zero for any permutation P. Then, the wave-functions must be proportional to each other,
ψE (~xP(1),...,~xP(N)) = c[P] ψE (~x1,...,~xN) where c[P] = − c2[P] . If the wave function is normalized, then |c[P]| = 1 and, considering a c1[P] permutation which, for example, exchanges the positions of just two particles, where doing the permutation twice returns the wave-function to its original form. Then c2[P] = 1 and we would conclude that c[P] = 1 or c[P] = −1. Then, considering the fact that any permutation can be built up out of successive interchanges of pairs of particles, we can see that, for any permutation, there are two possibilities, the first is where the wave-function is a completely symmetric function of its arguments, 3
c[P] = 1 , ∀P
2There are some examples of unusual statistics when the effective dimension of a quantum system is one or two, where permutations have a topological interpretation and the wave-function can have a richer structure. Particles which obey such statistics are called “anyons”. 3We shall use the mathematical symbol ∀ as shorthand for “for all”. 1.3 Non-relativistic particles 15 and
ψE (~x1,...,~xN)) = ψE (~xP(1),...,~xP(N)) for any permutation, P. The particles are called Bosons, or are said to obey “Bose-Einstein statistics”. The second possibility is where the wave-function is a totally anti-symmetric function of the positions arguments,
c[P] = (−1)deg[P] and
deg[P] ψE (~x1,...,~xN) = (−1) ψE (~xP(1),...,~xP(N)) for a permutation P and where the degree deg[P] is the number of interchanges of pairs that are needed to implement the permutation. Particles which obey statistics of this sort are called Fermions, or are said to obey “Fermi-Dirac statistics”. In the quantum many-body systems that are found in nature, particles that have identical properties are identical particles and they are either Fermions of Bosons. Given a solution of the Schrödinger equation we can construct a wave-function for Bosons by symmetrizing over the positions of the particles, so that
ψb(~x1,...,~xN,t) = cb ∑ψ(~xP(1),...,~xP(N),t) P
On the other hand, if the particles that the wave-function is intended to describe are Fermions, then we should anti-symmetrize over the positions of the particles,
deg(P) ψ f (~x1,...,~xN,t) = c f ∑(−1) ψ(~xP(1),...,~xP(N),t) P
Here, the summations are over all N! possible permutations, including the trivial one. In each of these expressions, the constants cb and c f should be adjusted to correctly normalize the resulting wave-function. When the wave-function is either completely symmetric or anti-symmetric, the probability density
† ψ (~x1,...,~xN,t)ψ(~x1,...,~xN,t) is a completely symmetric function of its arguments, (~x1,...,~xN). Since the particles are iden- † 3 3 3 tical, the quantity ψ (~x1,...,~xN,t)ψ(~x1,...,~xN,t)d x1,d x2,...d xN should be interpreted as the probability at time t for finding the system with particles occupying the infinitesimal volumes 3 3 3 d x1,d x2,...d xN which are each centered on the points~x1,...,~xN, respectively, with no reference to which particles occupy which volumes. It should be normalized so that
Z 3 3 † d x1 ...d xNψ (~x1,...,~xN,t)ψ(~x1,...,~xN,t) = 1
This has the interpretation that the total probability for finding the N particles somewhere is equal to one. 16 Chapter 1. Many particle physics
1.3.2 Spin There is one elaboration which we should discuss before proceeding to develop our current discussion further. That is the issue of spin. If we want to describe realistic many-particle systems of atoms or electrons, the particles in question generally have spin and their wave-functions must carry an index to label their spin state. To describe these, we add an index to the total wave-function for each particle, so that the wave-function is
ψσ1σ2...,σN (~x1,~x2,...,~xN,t)
For spin J, the indices σi each run over 2J + 1 values σi = −J,...,J which correspond to the spin states of a single particle. The wave-function of a system of identical particles must then be either symmetric or anti-symmetric under simultaneous permutations of the spin and position variables of the particles. Generally, Bosons have integer spins and Fermions have half-odd integer spin. In summary, for Bosons, J is an integer and
ψσ1...,σN (~x1,...,~xN,t) = ψσP(1)...,σP(N) (~xP(1),...,~xP(N),t) , ∀P For Fermions, J is a half-odd-integer and
deg[P] ψσ1...,σN (~x1,...,~xN,t) = (−1) ψσP(1)...,σP(N) (~xP(1),...,~xP(N),t) , ∀P where, when we implement the permutation, we permute both the spin and the position labels. The Hamiltonian can also have spin-dependent interactions. In that case, the potential energy is generally a hermitian matrix which operates on spin indices. For two-body interactions, the σiσ j two-body gets spin indices as Vρiρ j (~xi −~x j) and its operation on the wave-function is the mapping
J ρiρ j ψσ1...σi...σ j...σN (~x1,...,~xN,t) → ∑ Vσiσ j (~xi −~x j)ψσ1...ρi...ρ j...σN (~x1,...,~xN,t) ρiρ j=−J
We will see shortly that this sort of interaction is very easy to implement in second quantization. Before we continue let us consider a two examples. First, there is a spin-orbit interaction. For such an interaction, we need to understand how to measure the “spin” that is contained in a many-particle 1 wave-function. For this, we assume that the particles have spin J = 2 and we introduce the Pauli matrices, ~σ, as
0 1 0 −i 1 0 σ 1 = , σ 2 = , σ 3 = , (1.7) 1 0 i 0 0 −1
Then, the expectation value of the spin is simple the expectation value of the spin matrix, defined as 1 2~σ for each particle D E ~Σ = Z N 1 dx ...dx σ1...,ρi...σN †(~x ,...,~x ,t)~ τi (~x ,...,~x ,t) (1.8) 1 N ∑ ψ 1 N σρi ψσ1...τi...σN 1 N i=1 2 Here and in the following, we are using the Einstein summation condition for repeated up and down indices. In each term on the right-hand-side, each of the indices σ1,σ2,...,σi−1,σi+1,...,σN and τi and ρi are all summed from −J to J. We have omitted the summation symbols
J J J J J J ∑ ... ∑ ∑ ... ∑ ∑ ∑ σ1=−J σi−1=−J σi+1=−J σN =−J τi=−J ρi=−J 1.4 Second Quantization 17
A typical spin-dependent interaction is the spin-spin interaction which we could add to the spin-independent interaction to get
σ σ σ 1 σ V i j (~x −~x ) = δ σi δ j v (~x −~y) + ~σ σi ·~σ j v (~x −~y) (1.9) ρiρ j i j ρi ρ j 0 4 ρi ρ j ss We leave writing down a spin-orbit interaction as an exercise.
1.4 Second Quantization Second quantization is a technique which summarizes the many-particle quantum mechanical problem contained in (1.4), together with either Bose or Fermi statistics in an elegant way. To implement second quantization, we begin by constructing an abstract basis for the states of the N-particle system. We define the Schrödinger field operator, ψ(~x) which depends on one position variable,~x. In spite of the use of the symbol ψ, this operator should not be confused with a wave-function, it is an operator whose important property is that it obeys the commutation relations which will be listed in equations (1.10) or (1.11) below. There is one such operator for each different kind of identical particle, for example in a gas of electrons where the electron can exist in two spin 1 1 states, the field operator would have the spin index, ψσ (~x) with σ = − 2 , 2 labelling the spin. We shall also need the Hermitian conjugate of the field operator, ψ†σ (~x). This should be regarded as the hermitian conjugate of the operator ψσ (~x) in the sense that
† †σ † † †σ (ψσ (~x)|state >) =< state|ψ (~x) , (ψσ (~x)[operator]) = [operator] ψ (~x) In the case of particles with Bose-Einstein statistics, these operators satisfy the commutation relations
†ρ ρ †σ †ρ ψσ (~x),ψ (~y) = δσ δ(~x −~y), ψσ (~x),ψρ (~y) = 0, ψ (~x),ψ (~y) = 0 (1.10) where, as usual, the square bracket denotes a commutator ([A,B] = AB − BA). In the case of particles with Fermi-Dirac statistics, the commutators should be replaced by anti-commutators so that the operators satisfy the anti-commutation relations
†ρ ρ †σ †ρ ψσ (~x),ψ (~y) = δσ δ(~x −~y), ψσ (~x),ψρ (~y) = 0, ψ (~x),ψ (~y) = 0 (1.11) We use the curly brackets to denote an anti-commutator, ({A,B} = AB + BA). †σ The operators ψσ (~x) and ψ (~x) can be thought of as annihilation and creation operators for a particle at point~x and in spin state σ. To see this, consider the following construction. We begin with a specific quantum state which we shall call the “empty vacuum” |0 >. It is the state where there are no particles at all. Its mathematical definition is that it is the state which is annihilated by the operators ψσ (~x) for all values of the position~x and spin label σ,
ψσ (~x)|0 >= 0 ∀~x,σ (1.12) The adjoint of the above statement is that the Hermitian conjugate and the dual state to the vacuum also have the property
< 0|ψ†σ (~x) = 0 ∀~x,σ (1.13)
Then, we create particles which occupy the distinct points ~x1,...,~xn and are in spin states †σ σ1,...,σN by repeatedly operating ψ i (~xi) on the vacuum state 1 σ1...σN †σ1 †σN |~x1,...,~xN,> = √ ψ (~x1)...ψ (~xN)|0 > (1.14) N! 18 Chapter 1. Many particle physics
σ † σ ...σ Since the operators ψ i (~xi) either commute or anti-commute with each other, |~x1,...,~xN > 1 N is automatically either totally symmetric or anti-symmetric under permutations of the position coordinates and spins and it is therefore appropriate for either Bosons or Fermions, respectively. Similarly,
1 <~x1,...,~xN|σ ...σ = √ < 0|ψσ (~x) ...ψσ (~x1) (1.15) 1 N N! N 1 The inner product is
ρ1...ρN <~x1,...,~xN|σ1...σN |~y1,...,~yN,> 1 ρ ρ = (− )D(P) (~x −~y ) P(1) ... (~x −~y ) P(N) ∑ 1 δ 1 P(1) δσ1 δ N P(N) δσN N! P where D(P) = 0 for Bosons and D(P) = deg[P] for Fermions. σ ...σ In second quantization, the vectors |~x1,...,~xN,> 1 N are used to construct a state of the quantum system in the following way. A candidate for the wave-function of the system is a function of the N positions and the time, ψσ1...σN (~x1,...,~xN,t). We consider a state vector in the Hilbert space of the N-particle system, ψσ1...σN (~x1,...,~xN,t) and we form the quantity Z 3 3 σ1...σN |ψ(t) >= d x1 ...d xNψσ1...σN (~x1,...,~xN,t)|~x1,...,~xN,> (1.16)
There is a one-to-one correspondence between the state vectors |ψ(t) > and the functions ψσ1...σN (~x1,...,~xN,t).
If we have a function, ψσ1...σN (~x1,...,~xN,t), we simply form the corresponding |ψ(t) > by forming the integrals in equation (1.16). If, on the other hand, we are given |ψ(t) >, we can find the function which corresponds to it by taking the inner product,
<~x1,...,~xN|σ1...σN |ψ(t) >= ψσ1...σN (~x1,...,~xN,t) (1.17)
This gives us two languages in which we can discuss the same quantity. Now, let us assume that ψσ1...σN (~x1,...,~xN,t) is the wave-function. That is, it satisfies the Schödinger equation (1.4). Second quantization will then give us the wave-function described as the state Z 3 3 σ1...σN |ψ(t) >= d x1 ...d xNψσ1...σN (~x1,...,~xN,t)|~x1,...,~xN,> (1.18)
Unit normalization of the wave-function ψσ1...σN (~x1,...,~xN,t) results in unit normalization of the state |Ψ(t) >, Z 3 †σ1...σN < Ψ(t)|Ψ(t) >= d x1 ...d~xNψ (~x1,...,~xN,t)ψσ1...σN (~x1,...,~xN,t)
= 1
We can ask the question as to what is the equation which |Ψ(t) > must satisfy that is equivalent to the fact that ψσ1...σN (~x1,...,~xN,t) satisfies the Schrödinger equation. To answer this question, we consider the operator
Z 2 Z 3 h¯ ~ †σ ~ 1 3 3 †σ †ρ σ˜ ρ˜ H = d x ∇ψ (~x) · ∇ψ (~x) + d xd yψ (~x)ψ (~y)V (~x −~y)ψ ˜ (~y)ψ ˜ (~x) 2m σ 2 σρ ρ σ (1.19) 1.4 Second Quantization 19
This operator is the Hamiltonian in the second quantized language. It is easy to see that ψσ1...σN (~x1,...,~xN,t) obeys the Schrödinger equation (1.4) when |Ψ(t) > satisfies the equation ∂ ih¯ |Ψ(t) >= H |Ψ(t) > (1.20) ∂t Furthermore, one can construct an initial state |Ψ(0) > using the initial many-particle wave-function
ψσ1...σN (~x1,...,~xN,t = 0). The state at later times is then uniquely determined by (1.20) which has the formal solution
|Ψ(t) >= e−iHt/h¯ |Ψ(0) > and the wave-function at any time that can be extracted from it by taking the inner product,
ψσ1...σN (~x1,...,~xN,t) =<~x1,...,~xN|σ1...σN |Ψ(t) > and it must coincide with the solution of the many-body Schrödinger equation (1.4). Thus, the mathematical problem of solving the second-quantized operator equation (1.20) is identical in all respects to the mathematical problem of solving the many-particle Schrödinger equation (1.4), they are solved when we find the wave-function ψσ1...σN (~x1,...,~xN,t) or equivalently the state |Ψ(t) >. We thus have two equivalent formulations of the same theory. σ ...σ The state |~x1 ...~xN > 1 N that we have constructed should be thought of as the quantum mechanical state where the N particles can be found occupying the positions~x1,...,~xN and the spin σ ...σ states σ1 ...σN. To see this, we note that |~x1 ...~xN > 1 N is an eigenstate of the density operator, which we form from the product of a creation and annihilation operator,
σ† ρ(~x) = ψ (~x)ψσ (~x)
σ ...σ (where we are using the summation convention for the spin index σ). Operating on |~x1,...,~xN > 1 N , we discover that these states are eigenstates of the density with eigenvalues given by a sum of delta-functions
N ! σ1...σN σ1...σN ρ(~x) |~x1 ...~xn > = ∑ δ(~x −~xi) |~x1 ...~xn > (1.21) i=1 which is just what we would expect for a group of N particles localized at positions~x1,...,~xN. This formula holds for both Bosons and Fermions. The second quantized Schrödinger equation (1.20) does not contain the explicit information that there are N particles. The number of particles can be measured by the number operator which is an integral over space of the density operator, Z 3 †σ N = d xψ (~x)ψσ (~x)
The states, |Ψ(t) >, which we have constructed are eigenstates of the particle number operator,
N |Ψ(t) > = N |Ψ(t) >
Furthermore, the Hamiltonian commutes with the number operator,
[N ,H ] = 0 (1.22)
This can easily be checked explicitly using the algebra for the operators ψ(~x) and ψ†(~x). The result is that the number operator N and the Hamiltonian H can have simultaneous eigenvalues and that the total number of particles will be preserved by the time evolution of the system. 20 Chapter 1. Many particle physics
1.5 The Heisenberg picture Let us pause to review what we have done so far. We have found two different descriptions of the many-particle quantum system. The first one is conventional quantum mechanics where we should solve the partial differential equation to find the wave-function of the many-particle system. We have summarized the relevant equations in the inset below.
Many-particle quantum mechanics The wave-function must obey the Schrödinger equation:
N 2 ∂ −h¯ ~ 2 ih¯ ψσ1...σN (~x1,...,~xN,t) = ∑ ∇i ψσ1...σN (~x1,...,~xN,t) ∂t i=1 2m ρiρ j + ∑Vσiσ j (~xi −~x j)ψσ1...... ρi...ρ j...σN (~x1,...,~xN,t) i< j
The wave-function should be normalized, Z 3 3 †σ1...σN d x1 ...d xNψ (~x1,...,~xN,t)ψσ1...σN (~x1,...,~xN,t) = 1
Bosons have symmetric wave-functions
ψσ1...σN (~x1,...,~xN,t) = ψσP(1)...σP(N) (~xP(1),...,~xP(N),t)
Fermions have anti-symmetric wave-functions
deg(P) ψσ1...σN (~x1,...,~xN,t) = (−1) ψσP(1)...σP(N) (~xP(1),...,~xP(N),t)
for any permutation P.
The other is the second quantized picture, where we are given Hamiltonian and number operators containing fields and the operator nature of the fields is defined by their commutation relations. These and the equation of motion for the quantum state are summarized in the inset below. We have seen in the above development how the mathematical problem which is defined in the inset above and the inset below are equivalent.
Second quantization in the Schrödinger Picture The Schrödinger equation is
∂ ih¯ |Ψ(t) >= H |Ψ(t) > , N |Ψ(t) >= N |Ψ(t) > ∂t The Number and Hamiltonian operators are
Z J 3 †σ N = d x ∑ ψ (~x)ψσ (~x) σ=−J Z 2 Z 3 h¯ ~ †σ ~ 1 3 3 †σ 0 †σ ρρ0 H = d x ∇ψ (~x) · ∇ψ (~x) + d xd yψ (~x)ψ (~y)V 0 (~x −~y)ψ (~y)ψ 0 (~x) 2m σ 2 σσ ρ ρ 1.5 The Heisenberg picture 21
and [N ,H] = 0. For Bosons, the commutation relations are
†ρ ρ ψσ (~x),ψ (~y) = δσ δ(~x −~y) †σ †ρ ψσ (~x),ψρ (~y) = 0 , ψ (~x),ψ (~y) = 0
For Fermions, the anti-commutation relations are
†ρ ρ ψσ (~x),ψ (~y) = δσ δ(~x −~y) †σ †ρ ψσ (~x),ψρ (~y) = 0 , ψ (~x),ψ (~y) = 0
The latter set of equations, those in the inset above, are essentially the definition of a quantum †σ field theory. The quantum fields are the field operators ψσ (~x) and ψ (~x). Note that they do not depend on time. Instead, the state |Ψ(t) > is time dependent. The reason for this is that, like our many-particle problem (1.4), we have formulated the problem in the Schrödinger picture of quantum mechanics where operators are time independent and the states carry the time dependence. The Heisenberg picture is an alternative and equivalent formulation of quantum mechanics. It is related to the Schrödingier picture that we have developed so far by a time dependent unitary transformation of the operators and state vectors. The unitary transformation begins with the observation that, if we know the state of the system at an initial time, say at t = 0, we can find a formal solution of the equation of motion for the state vector,
|Ψ(t)i = e−iHt/h¯ |Ψ(0)i which uses a unitary operator that is obtained by exponentiating the Hamiltonian, exp(−iHt/h¯). We can thus set the state vector to its initial condition (assuming t = 0 is where we must impose an initial condition) by a unitary transformation. Going to the Heisenberg picture simply does this unitary transformation to all of the operators to get an equivalent description of the theory where the operators are time-dependent and the states are independent of time. The unitary transformation of the operators is
0 iHt/h¯ −iHt/h¯ †σ 0 iHt/h¯ †σ −iHt/h¯ ψσ (~x,t) = e ψσ (~x)e , ψ (~x,t) = e ψ (~x)e (1.23)
In the Heisenberg picture, the states are time independent. For a given physical situation the quantum state is simply given by the initial state of the system. The operators, on the other hand, become time dependent and it is their time dependence which carries the information of the time evolution of the quantum system. In quantum field theory, particularly the relativistic quantum field theory which we shall study later on, the equations of motion are more commonly presented in the Heisenberg picture. †σ Unlike the time-independent operators ψσ (~x) and ψ (~x) which we introduced in order to 0 †σ 0 construct second quantization, the Heisenberg picture operators ψσ (~x,t) and ψ (~x,t) now depend on time and their time dependence contains dynamical information, which is determined by equations (1.23). This information can also be given as a differential equation, the Heisenberg equation of motion, which can be obtained by taking a time derivative of equations (1.23).
∂ ∂ ih¯ ψ(~x,t) = [ψ(~x,t),H] , ih¯ ψ†(~x,t) = ψ†(~x,t),H (1.24) ∂t ∂t These are the usual algebraic operator equations which are meant to be solved to find the time dependence of the operators in the Heisenberg picture. In (1.24), and elsewhere when it is clear from 22 Chapter 1. Many particle physics the context, we will drop the prime on the Heisenberg picture fields. They are still distinguished from the Schrödinger picture fields in that they are time dependent and the Schrödinger picture fields are not. The Heisenberg picture field operators have an equal-time commutator algebra, which can be obtained from (1.10) or (1.11) by multiplying from the left and right by e−iHt/h¯ and eiHt/h¯ , respectively. This leads to the canonical equal-time commutation relations for Bosons,
†ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y) (1.25) †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0 (1.26) or the canonical equal-time anti-commutation relations for Fermions,
†ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y), (1.27) †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0 (1.28) At this point the reader should take careful note of the fact that this algebra holds only when the times in both of the operators are the same. The time-derivative of the time-dependent field ψσ (~x,t) can be computed from the Heisenberg equation of motion (1.24) using the equal time commutation relations. It is given by an equation which looks like a non-linear generalization of the Schrödinger equation 2 Z ∂ h¯ ~ 2 3 ρρ0 †σ 0 ih¯ + ∇ ψ (~x,t) = d yV 0 (|~x −~y|)ψ (~y,t)ψ 0 (~y,t)ψ (~x,t) (1.29) ∂t 2m σ σσ ρ ρ Again, the non-relativistic quantum mechanics problem is presented as a quantum field theory. The †σ operators ψσ (~x,t) and ψ (~x,t) are the quantized fields. They satisfy the equal time commutation relations in (1.25) and (1.26) for Bosons or the anti-commutation relations (1.27) and (1.28) for Fermions. These define their algebraic properties as quantum mechanical operators. Their time evolution is determined by solving the non-linear field equation (1.29). That field equation has been presented in a standard form, with the “Schrödinger wave operator”
∂ h¯ 2 ih¯ + ~∇ 2 ∂t 2m operating on the field on the right-hand-side and with an additional non-linear interaction term. We should note the similarity of the field equation with the non-linear Schrödinger equation for a single particle. However, as we have said before, ψσ (~x,t) is not a wave-function of a single particle, it is an operator which obeys the equal time (anti-)commutation relations. We thus have our third presentation of the many-particle problem, the field equation of a quantum field theory plus the equal-time commutation or anti-commutation relations which define the quantum fields as operators. We would fix the total number of particles by requiring that states are eigenvectors of the number operator N with eigenvalue N. This is compatible with the field equation when d dt N = 0, which is the case for the example that we are considering. The Heisenberg formulation of the many-particle problem is summarized in the inset below.
Second quantization in the Heisenberg picture The field equation is
2 Z ∂ h¯ ~ 2 3 ρρ0 †σ 0 ih¯ + ∇ ψ (~x,t) = d yV 0 (~x −~y)ψ (~y,t)ψ 0 (~y,t)ψ (~x,t) ∂t 2m σ σσ ρ ρ 1.6 Summary of this chapter 23
Equal-time commutation relations for Bosons are
†ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y), †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0
Equal-time anti-commutation relations for Fermions are
†ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y), †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0
d Since dt N = 0 and we could fix the total particle number by requiring that states are eigen- states of N with eigenvalue N.
1.6 Summary of this chapter In a quantum mechanical system of N identical non-relativistic particles with spin, the wave-
function, ψσ1...σN (~x1,...,~xN,t), is a function of time, t, the positions,~x1,...,~xN, and it depends on the spin states σ1 ...σN of the particles. If a particle has spin J, its index σ runs over 2J + 1 values. It must solve the Schrödinger equation,
N 2 ∂ −h¯ ~ 2 ih¯ ψσ1...σN (~x1,...,~xN,t) = ∑ ∇i ψσ1...σN (~x1,...,~xN,t) ∂t i=1 2m ρiρ j + ∑Vσiσ j (~xi −~x j)ψσ1...ρi...ρ j...σN (~x1,...,~xN,t) i< j with appropriate boundary conditions. Here, we have assumed a two-body interaction (that is, the interaction of any two particles does not depend on the positions and spins of the other particles). If the identical particles are Bosons, the wave-function is completely symmetric under the simul- taneous permutations of the labels of spins and positions, σ1~x1 ...σN~xN → σP(1)~xP(1) ...σP(N)~xP(N). If the identical particles are Fermions, it is completely anti-symmetric. Wave-functions should be normalized, Z 3 3 †σ1...σN d x1 ...d xN ψ (~x1,...,~xN,t)ψσ1...σN (~x1,...,~xN,t) = 1
The equivalent second quantized theory in the Schrödinger picture has the state vector |Ψ(t) > obeying the Schrödinger equation and being an eigenstate of the number operator with eigenvalue N, ∂ ih¯ |Ψ(t) >= H |Ψ(t) > , N |Ψ(t) >= N |Ψ(t) > ∂t where the number and Hamiltonian operators are Z 3 †σ N = d xψ (~x)ψσ (~x) Z 2 Z 3 h¯ ~ †σ ~ 1 3 3 †σ †σ 0 ρρ0 H = d x ∇ψ (~x) · ∇ψ (~x) + d xd yψ (~x)ψ (~y)V 0 (~x −~y)ψ 0 (~y)ψ (~x) 2m σ 2 σσ ρ ρ The quantized fields obey the commutation relations for Bosons:
†ρ ρ ψσ (~x),ψ (~y) = δσ δ(~x −~y) †σ †ρ ψσ (~x),ψρ (~y) = 0 , ψ (~x),ψ (~y) = 0 24 Chapter 1. Many particle physics or the the anti-commutation relations for Fermions:
†ρ ρ ψσ (~x),ψ (~y) = δσ δ(~x −~y) †σ †ρ ψσ (~x),ψρ (~y) = 0 , ψ (~x),ψ (~y) = 0
We use the notation [A,B] ≡ AB − BA for a communator and {A,B} ≡ AB + BA are for an anti- commutator.
The equivalent second quantized theory in the Heisenberg picture is defined by the field equation:
2 Z ∂ h¯ ~ 2 3 ρρ0 †σ 0 ih¯ + ∇ ψ (~x,t) = d yV 0 (~x −~y)ψ (~y,t)ψ 0 (~y,t)ψ (~x,t) ∂t 2m σ σσ ρ ρ and, if the particles are Bosons, the equal-time commutation relations for the time-dependent fields,
†ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y), †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0 or, if the particles are Fermions, the equal-time anti-commutation relations for the time-dependent fields,
†ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y), †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0
The wave equation and equal time commutation or anti-commutation relations above are a definition of the many-particle problem which is closest in spirit to a quantum field theory. In this formalism, it will turn out that the number operator, constructed from the time-dependent fields, Z 3 †σ N = d xψ (~x,t)ψσ (~x,t) (1.30) is independent of the time. If an initial state is an eigenstate of N with eigenvalue N, it will remain so at later times. Moreover, we can set the time that is inside the integral in (1.30) to whatever value we choose. We can thus show that it obeys the algebra
ρ† ρ† [N ,ψσ (~x,t)] = −ψσ (~x,t) , N ,ψ (~x,t) = ψ (~x,t) which tells us that if |ϕ > is an eigenstate of N with eigenvalue N,
N |ϕ >= N|ϕ >
ρ† then ψσ (~x,t) and ψ (~x,t) are a lowering and raising operators for particle number,
ρ† ρ† N ψ (~x,t)|ϕ >= (N + 1)ψ (~x,t)|ϕ > N ψσ (~x,t)|ϕ >= (N − 1)ψσ (~x,t)|ϕ >
N is a positive semi-definite operator and that its lowest eigenvalue is the empty vacuum, which obeys ψ(~x,t)|0 >= 0 for all values of ~x,t. Then N |0 >= 0 and all of the eigenvalues of N are non-negative integers. 2. Degenerate Fermi and Bose Gases
2.1 A preview of this chapter In this chapter, we will study the Heisenberg representation quantum field theories of non-relativistic many-particle systems what we developed in the previous chapter in the limit where the volume is very large, the density is finite and the inter-particle interactions is weak. This will introduce the idea of Fermi energy and Fermi surface for Fermions, the concept of particles and holes, and it will allow us to study some of the properties of a weakly interacting Fermi gas. We will also introduce the concept of a Bose condensate for a many-boson system and study the low energy excitations of a wealky interacting system of Bosons.
2.2 The limit of weak interactions As an example of our use of a quantum field theory to describe a quantum mechanical system with many identical particles, let us the special case where the particles interact with each other so weakly that, to a first approximation, we can ignore the interactions. In the Hamiltonians which we discussed in the previous chapter, this happens when we can ignore the terms containing the interaction potential V(~x −~y). We will assume that the particles are either Bosons or Fermions. The beginning of our development applies equally well to both cases. However, as we shall see later, there are dramatic differences in the low energy states of a system of Fermions or a system of Bosons. We will work in the Heisenberg picture of quantum mechanics. In that picture, the quantum field is a time- (as well as space-) dependent operator which obeys the wave equation
∂ h¯ 2 ih¯ + ~∇2 ψ (~x,t) = 0 (2.1) ∂t 2m σ It also obeys equal time commutation relations (for Bosons)
†ρ ρ 3 ψσ (~x,t),ψ (~y,t) = δσ δ (~x −~y) (2.2) †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0 (2.3) 26 Chapter 2. Degenerate Fermi and Bose Gases or the equal tme anti-commutation relations (for Fermions)
†ρ ρ 3 ψσ (~x,t),ψ (~y,t) = δσ δ (~x −~y) (2.4) †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) } = 0 (2.5)
We will take equations (2.1), (6.42) and (6.43) or equations (2.1), (2.4) and (2.5) as the definition of the quantum field theory and, in the following, we will proceed to find a solution of it. Here, 1 we have retained the spin index, σ,ρ,... to denote the spin state. If the spin is 2 , as it is for an 1 1 electron, this index runs over the two values, − 2 , 2 denoting the two spin states. For a spin J atom, this index will run over 2J + 1 values which label the 2J + 1 different spin states. To simplify the notation, we will generally consider spinless Bosons, so in formulae which are specific to Bosons, we will drop this index from the field operators. It is straightforward to generalize to Bosons with spin if it is needed. The field equation (2.1) is a linear partial differential equation which we can easily solve using a Fourier transform. Here, we are assuming that the three dimensional space is open infinite Euclidean space, called R3, and that the fields have boundary conditions such that their Fourier transform exists. A general solution of the wave equation is
3 2~ 2 Z d k i~k·~x−i h¯ k t/h 2m ¯ ~ ψσ (x,t) = 3 e ασ (k) (2.6) (2π) 2 If this were a wave equation for a classical field, and if (2.6) were a classical solution of that classical field equation, (2.6) is a complete solution in the sense that the function of wave-numbers, ~ ασ (k), can be completely determined by initial data. To do this, we take the Fourier transform of the field at an initial time. For example, if we know that at an initial time, say t = 0, the field ~ is given by the function ψinσ (~x), then we would determine ασ (k) by taking an inverse Fourier transform of equation (2.6) with respect to the space variables, and with the time set to t = 0 to get
Z 3 ~ d x −i~k·~x ασ (k) = 3 e ψσ (~x,0) (2π) 2 ~ We would use this formula to determine the function of wave-vectors, ασ (k). Plugging the result into equation (2.6) then determines the solution of the classical differential equation. However, here, the differential equation is one which must be obeyed by the field operators, so ~ we have a slightly different sense as to how equation (2.6) is a solution of the problem. Now, ασ (k) is a wave-number-dependent operator. As well as solving the wave equation, which equation (2.6) ~ accomplishes, we must determine the properties of the operators, ασ (k), so that the field ψσ (~x,t) in equation (2.6) satisfies the commutation relations (6.42) and (6.43) or the anti-commutation relations (2.4) and (2.5) . ~ †σ ~ It will indeed satisfy those relations if ασ (k) and α (k) satisfy
h ~ †ρ i ρ 3 ~ ασ (k),α (~p) = δσ δ (k −~p) (2.7) h ~ i h †σ ~ †ρ i ασ (k),αρ (~p) = 0 , α (k),α (~p) = 0 (2.8) for Bosons or
n ~ †ρ o ρ 3 ~ ασ (k),α (~p) = δσ δ (k −~p) (2.9) n ~ o n †σ ~ †ρ o ασ (k),αρ (~p) = 0 , α (k),α (~p) = 0 (2.10) 2.2 The limit of weak interactions 27 for Fermions. To see this explicitly for the case of Bosons, consider
3 3 Z d k Z d p ~ h¯~k2 h¯~p2 h i †ρ ik·~x−i 2m t −i~p·~y+i 2m t ~ †ρ ψσ (~x,t),ψ (~y,t) = 3 3 e e ασ (k),α (~p) (2.11) (2π) 2 (2π) 2
Z 3 Z 3 ~ 2 2 d k d p ~ h¯k h¯~p ρ ik·~x−i 2m t −i~p·~y+i 2m t 3 ~ = 3 3 e e δσ δ (k −~p) (2.12) (2π) 2 (2π) 2 3 Z d k ~ = δ ρ eik·(~x−~y) = δ ρ δ 3(~x −~y) (2.13) σ (2π)3 σ where we have used the fact that the Fourier transform of the Dirac delta function is given by the formula 3 Z d k ~ eik·(~x−~y) = δ 3(~x −~y) (2π)3 Then, in order to complete the solution of the problem, we must find the Hilbert space on ~ †σ ~ which ασ (k) and α (k) operate. This is straightforward, and similar to what we have done in the previous section for the Schrödinger picture field operators (in fact these are the Fourier transform of the field operators at time t = 0 where they should be identical to those operates). We begin with the state where there are no particles at all, the empty vacuum |0 >. Here, we will define it as the ~ ~ state which is annihilated by all of the annihilation operators ασ (k) for all values k, σ, ~ ~ ασ (k)|0 >= 0 for all k,σ Similarly
< 0|α†σ (~k) = 0 for all~k,σ
Then, we construct the multi-particle states by repeatedly operating on the vacuum state with the creation operator α†σ (~k), 1 σ1σ2...σN †σ1 †σ2 †σN |~k1,~k2,...,~kN > = √ α (~k1)α (~k2)...α (~kN)|0i (2.14) N! Because the creation operators either commute with each other for Bosons or they anti-commute with each other for Fermions, the states in equation (2.14) are either totally symmetric or totally an- tisymmetric, respectively, under permuting indices of the momenta and the spins. For a permutation P of the numbers {1,2,...,N}, and for Bosons,
~ ~ ~ σP(1)σP(2)...σP(N) ~ ~ ~ σ1σ2...σN |kP(1),kP(2),...,kP(N) > = |k1,k2,...,kN > and for Fermions
~ ~ ~ σP(1)σP(2)...σP(N) deg[P] ~ ~ ~ σ1σ2...σN |kP(1),kP(2),...,kP(N) > = (−1) |k1,k2,...,kN > where deg[P] is the degree of the permutation, the number of interchanges of neighbouring integers which must be made in order to implement the permutation. The dual states are
~ ~ ~ 1 ~ ~ ~ < k1,k2,...,kN|σ σ ...σ = √ < 0|ασ (kN)...ασ (k2)ασ (k1) 1 2 N N! N 2 1 and
ρ ρ ...ρ 0 ~ ~ ~ 0 1 2 N < k1,k2,...,kN|σ1σ2...σN |~p1,~p2,...,~pN > = 28 Chapter 2. Degenerate Fermi and Bose Gases
δ 0 ρ ρ = NN (− )σ[P] (~k −~p )... (~k −~p ) P(1) ... P(N) ∑ 1 δ 1 P(1) δ N P(N) δσ1 δσN N! P
where σ[P] = 0 for Bosons and σ[P] = deg[P]. When we plug the solution (2.6) into the Hamilto- nian, we get
Z 2 2~ 2 3 h¯ k †σ ~ ~ H0 = d k ∑ α (k)ασ (k) (2.15) σ=1 2m
Here we have used the subscript on H0 to denote the free field Hamiltonian. Also, the number operator is
Z 2 3 †σ ~ ~ N = d k ∑ α (k)ασ (k) (2.16) σ=1 The quantum states that we have constructed are eigenstates of both the Hamiltonian and the number operator,
N h¯ 2~k2 ~ ~ ~ σ1σ2...σN i ~ ~ ~ σ1σ2...σN H0|k1,k2,...,kN > = ∑ |k1,k2,...,kN > i=1 2m
σ1σ2...σN σ1σ2...σN N |~k1,~k2,...,~kN > = N |~k1,~k2,...,~kN >
Note that the energy of a basis state is given by the sum of the energies of the particles in the state. This is a result of the fact that the particles are not interacting, so their total energy is just the sum of their individual energies. What is more, there individual energies are entirely due to their kinetic energy, which for a non relativistic particle is ~p2/2m where, where the momentum is given in terms of the wavenumber by ~p = h¯~k. The general solution of the time-dependent Schrödinger equation for the quantum state of the system is
2~ 2 N h¯ ki Z −i ∑i=1 t/h¯ 3 3 2m ~ ~ ~ ~ σ1...σN |Ψ(t)i = d k1 ...d kN e φσ1...σN (k1,...,kN)|k1,...,kN > (2.17)
~ ~ The function φσ1...σN (k1,...,kN) is totally symmetric for Bosons and totally antisymmetric for Fermions. It is the initial value of the many-body wave-function of the system. That is, the function ~ ~ φσ1...σN (k1,...,kN) must be determined by initial data. The appropriate initial data would be the ~ ~ quantum state at an initial time, say t = 0, from which one could obtain φσ1...σN (k1,...,kN) by doing ~ ~ a Fourier transform of the initial wave-function ψσ1...σN (k1,...,kN,t = 0). The time-dependent state vector in equation (2.17) is a complete solution of the problem in the Schrödinger picture. For illustrative purposes, we can also consider the equivalent in the Heisenberg picture. There, it is the time dependent operator in equation (2.6).
2.3 Degenerate Fermi gas and the Fermi surface 2.3.1 The ground state |O > There is a profound difference between the ground state of a system of Bosons and a system of Fermions. When the particles are Bosons, the creation operators commute with each other and, h i2 for example, α†(~k) simply creates two particles in the state with wave-number~k. The lowest energy state simply has all of the Bosons in the single-particle state which has the lowest energy, N |O >= α†(0) |0 >. 2.3 Degenerate Fermi gas and the Fermi surface 29
On the other hand, when the particles are Fermions, their many-particle wave-function must be anti-symmetric. This is reflected in second quantization by the fact that the operators which create the Fermions anti-commute with each other. Then the algebra of creation operators in imply 0 0 h i2 α†σ (~k)α†σ (~k0) = −α†σ (~k0)α†σ (~k) and α†σ (~k) |0 >= 0. We cannot create a state where two Fermions have the same spin σ and the same wave-vector~k. This is the manifestation of the Pauli principle in this second quantized framework: two Fermions cannot occupy the same quantum state. This means that, in the ground state, the Fermions must have distinct wave-vectors and spins. Then the lowest energy state of a system of free Fermionic particles must be gotten by populating 2~ 2 N h¯ ki the spin states and all of the the wave-vector states that are closest to zero, so that E = ∑i=1 2m is minimal. These wave-vectors occur in the interior of a sphere with radius kF , called the Fermi wave-vector,
J 1 † |O >= ∏ ∏ α σ (~k)|0 > (2.18) c =−J |~k|≤kF σ
where the constant c should be chosen so that the state is normalized.
There is one particle in each state with wavenumber of magnitude less than or equal to kF . Here hk¯ F is called the Fermi momentum and kF is the Fermi wavenumber. It is the upper bound of wave-numbers that Fermions can have in the ground state. The highest energy states that are populated have the energy
h¯ 2k2 ε = F (2.19) F 2m
is called the Fermi energy. The boundary of the set of occupied states in wave-vector space, those states with |~k| = kF and which have the Fermi energy is called the Fermi surface.
2.3.2 Particle and holes
In all of the discussion so far, we have been assuming that the particles which we are studying occupy infinite three dimensional space. Moreover, their wave-vectors also occupy infinite three dimensional space and their spectrum is continuous. This makes the expression (2.18) that we used to define the Fermion ground state problematic in that it is a product over a continuously infinite number of operators. Of course, we could always make sense of it by the common trick of assuming that the space were not quite infinite, but a large but finite box where the fields has, for example, periodic boundary conditions. Then, the wave-vectors would be discrete and equation (2.18) would be an ordinary product over a very large number of discrete factors. Instead of this, we will find a simpler, albeit more formal way to deal with the definition of the ground state which remains in infinite volume and avoids infinite products altogether. In a construction such as this, the normalization constant c in equation (2.18) would be equal to one. We shall re-define what we mean by annihilation and creation operator for a particle as follows: as
~ ~ †σ ~ †σ ~ ~ ασ (k) = aσ (k) , α (k) = a (k) , |k| > kF (2.20) σ ~ †σ ~ † ~ ~ ~ β (k) = a (k) , βσ (k) = aσ (k) , |k| ≤ kF (2.21) 30 Chapter 2. Degenerate Fermi and Bose Gases
We now have two sets of creation and annihilation operators with the anti-commutator algebra
†ρ ρ ~ ασ (k),α (q) = δσ δ(k −~q) (2.22) †σ †ρ ασ (k),αρ (q) = 0 , α (k),α (q) = 0 (2.23) n σ † o σ ~ β (k),βρ (q) = δ ρ δ(k −~q) (2.24)
σ ρ n † † o {β (k),β (q)} = 0 , βσ (k),αρ (q) = 0 (2.25) ρ ρ† ρ {ασ (k),β (q)} = 0 , α (k),β (q) = 0 (2.26) n † o n ρ† † o ασ (k),βρ (q) = 0 , α (k),βρ (q) = 0 (2.27)
The reader should note well that we have not introduced any new concept here. We have simply re-labeled some of the same creation and annihilation operators that we had previously defined. The reason for this re-labeling was so that, using the definition in equation (2.18), we can see that the Fermion ground state is annihilated by the new annihilation operators,
~ ρ ~ ~ ασ (k)|O >= 0 , β (k)|O >= 0 , for all k,σ,ρ (2.28)
Then, there are apparently two types of excitations of this ground state. One is obtained by creating a particle in a wave-vector state that is outside of the Fermi surface. This is done with the creation †σ operator α (~k) (which has |~k| > kF by definition (2.21)). Such an excitation is called a “particle”. The other excitation is gotten by annihilating a particle which is already contained in |O > and † ~ whose wave-vector is inside the Fermi surface. This is done by operating with βρ (k) (which has |~k| < kF by definition (2.21)). Such an excitation is called a “hole”. We will also assume that the ground state is normalized so that
< O|O >= 1 (2.29)
The field operator is given
3 2~ 2 Z d k i~k·~x−i h¯ k t/h 2m ¯ ~ ψσ (x,t) = e ασ (k) + ~ 3 |k|>kF (2π) 2 3 2~ 2 Z d k −i~k·~x−i h¯ k t/h¯ + e 2m β †(~k) (2.30) ~ 3 σ |k| Z 3 3 †σ d k (2J + 1)kF ρ =< O|ψ (~x,t)ψσ (~x,t)|O >= (2J + 1) 3 · 1 = 2 k We can solve this equation to determine Fermi wave-number kF and the Fermi energy εF in terms of the density, 1 2 6π2ρ 3 h¯ 2 6π2ρ 3 k = , ε = F 2J + 1 F 2m 2J + 1 2.3 Degenerate Fermi gas and the Fermi surface 31 The ground state energy U = uV is the expectation value of the Hamiltonian, and the internal energy density u is 5 h¯ 2 (2J + 1)h¯ 2 6π2ρ 3 u = < O|~∇ψ†σ (~x,t) ·~∇ψ (~x,t)|O >= (2.31) 2m σ 20π2m 2J + 1 In fact, we can plug the expression for the field operator in equation (2.30) into the particle number and the Hamiltonian to find the expressions Z Z 3 †σ ~ ~ 3 † ~ σ ~ N = ρV + d kα (k)ασ (k) − d kβσ (k)β (k) k>kF k Z 2~ 2 Z 2~ 2 3 h¯ k †σ ~ ~ 3 h¯ k † ~ σ ~ H = uV + d k α (k)ασ (k) − d k βσ (k)β (k) k>kF 2m k 2.3.3 The grand canonical ensemble In many practical circumstances in the quantum field theory of Fermions, rather than fixing the total number of particles, as we have been doing so far, it is useful to consider an open system where particles can enter and leave the system. In this case, it is advantageous to study the system with a modified Hamiltonian H0 = H − µN (2.32) so that the expectation value of H0 is the appropriate “free energy” that is needed in order to study an open system. In that case, the parameter µ is the chemical potential. In principle, it can be adjusted in order that the system has a given density. If we use H0 to generate the time evolution of the fields, the field equation becomes ∂ h¯ 2 ih¯ + ~∇ 2 + µ ψ (~x,t) = 0 (2.33) ∂t 2m σ The chemical potential has the a thermodynamic definition, ∂u µ = ∂ρ V where u is the internal energy which we found in equation (2.35) in the previous section. Using equation (2.35), we find that, for our example of non-interacting Fermions at zero temperature, h¯ 2k2 µ = F = ε 2m F 32 Chapter 2. Degenerate Fermi and Bose Gases that is, the chemical potential is equal to the Fermi energy. The chemical potential has the statistical mechanics interpretation as the energy that is gained by adding a single particle to the system. In this example, this is clearly equal to the Fermi energy, since if we add one particle, we must add it with an energy greater than or equal to the Fermi energy. The solution of the field equation (2.33) is 3 2~ 2 3 2~ 2 Z d k i~k·~x−i h¯ k −ε t/h Z d k −i~k·~x−i h¯ k −ε t/h 2m F ¯ ~ 2m F ¯ † ~ ψσ (x,t) = e ασ (k) + e β (k) ~ 3 ~ 3 σ |k|>kF (2π) 2 |k| The grand canonical potential Φ = φV is given by the expectation value of the Hamiltonian H0 in equation (2.32), which we can find as 5 (2J + 1)h¯ 2 6π2ρ 3 φ = u − µρ = − (2.35) 30π2m 2J + 1 Notice that, if we use the thermodynamic definition of the pressure of the Fermi gas is given by 5 ∂ (2J + 1)h¯ 2 6π2ρ 3 P = − (uV) = (2.36) ∂V N 30π2m 2J + 1 we see that, the grand canonical potential for a fee Fermi gas is given by Φ = −PV and the grand canonical potential density is given by the negative of the pressure, φ = −P. As we observed at the end of the last section, when we plug the solution (2.34) into the Hamiltonian, we get Z 2 2~ 2 ! 0 3 h¯ k †σ H = d k − εF α (k)ασ (k) ~ ∑ |k|≥kF σ=1 2m Z 2 2~ 2 3 h¯ k † σ + d k − εF β (~k)β (~k) − PV (2.37) ~ ∑ σ |k|≤kF σ=1 2m 2 2 h¯ ~k − The absolute value 2m εF is a positive number, so both particles and holes have positive energies, when the energy is defined in this way. The states of the quantum theory are found by beginning with the ground state, |O >, and † ~ †σ ~ operating with the creation operators ασ (k) and β (k). A basis for the Fock space is | >, †σ (k)| >, †(k)| >, †σ1 (k ) †σ2 (k )| >, † (k ) † (k )| >, O α O βσ O α 1 α 2 O βσ1 1 βσ2 2 O †σ1 (k ) † (k )| >, †σ1 (k ) †σ2 (k ) †σ3 (k )| >,... α 1 βσ2 2 O α 1 α 2 α 3 O The quantum state for a particle is † ασ (k)|O > 2.4 Bosons 33 It has has energy and particle number which we can obtain by operating the Hamiltonian and particle number operators (with ground state energy and particle number subtracted) 2~ 2 ! 0 † h¯ k † (H + PV)α (k)|O >= − εF α (k)|O > (2.38) σ 2m σ † † ~ (N −Vρ)ασ (k)|O >= (+1) ασ (k)|O > , |k| > kF (2.39) where we remember that −PV is the ground state energy and Vρ is the ground state particle number. 2 2 ~ h¯ ~k ~ We see that a particle in momentum state k has positive energy, 2m − εF > 0 when |k| > kF , and its particle number is one. The quantum state for a hole is β †σ (k)|O > It has energy and particle number h2~k2 0 †σ ~ ¯ †σ ~ (H + PV)β (k)|O >= − εF β (k)|O > (2.40) 2m †σ †σ (N −Vρ)β (~k)|O >= (−1) β (~k)|O > , |~k| ≤ kF (2.41) 2 2 h¯ ~k − Its energy is also positive, given by 2m εF and its particle number is negative one. 2.4 Bosons Now, if instead of Fermions, we examine the states of Bosons, we find that the many-particle state at first sight has a much simpler structure. Arbitrarily many particles can occupy the lowest energy state, so the ground state of a system of free Bosons is given by 1 N |O >= √ α†(~k = 0) |0 > (2.42) N! This state is an eigenstate of the Hamiltonian, H0 in (2.15), with eigenvalue equal to zero and it is an eigenstate of the particle number operator, N in (2.16), with eigenvalue equal to N. This state, where the Bosons have a macroscopic occupation of a single eigenstate of the Hamiltonian, usually the ground state, is called a “Bose-Einstein condensate”. The macroscopic occupation of a single quantum state gives the Bose-Einstein condensate profound properties. It is a superfluid, which is a fluid with vanishing viscosity. It can flow past barriers without friction or dissipation. There is a beautiful argument due to Landau which relates superfluidity to the spectrum of small oscillations of the fluid, the so-called quasi-particles. Let us briefly review Landau’s argument which is based on Galilean relativity. First of all, let us review some facts about Galilean relativity. (There will be a more detailed presentation of Galilean relativity in the next chapter.). In Newtonian mechanics, the momentum and the energy of a mass M, moving with velocity~v are given by 1 ~P = M~v , E = Mv2 , (2.43) 2 respectively. According to Galilean relativity, if we view the same particle from a different reference frame, one which is moving with velocity ~V with respect to the first frame, the momentum and energy will be ~P0 = ~P − M~V (2.44) 1 1 E0 = M(~v −~V)2 = E −~P ·~V + MV 2 (2.45) 2 2 34 Chapter 2. Degenerate Fermi and Bose Gases Equations (2.44 and (2.45) tell us how to transforms the momentum and the energy when we view the system in a reference frame which is moving with velocity~v. Now, let us consider a fluid flowing through a capillary with uniform velocity~v with respect to the walls of the capillary. We begin by viewing the fluid in its own rest frame. In its rest frame, it has vanishing velocity and momentum and it has energy E0, the ground state energy of the static fluid. A superfluid will flow through a capillary without dissipation. Let us assume that our fluid is not a superfluid, that is, that the motion of the fluid is dissipative. Let us also assume that the process by which it dissipates is the production of ripples in the fluid, called quasi-particles. Let us assume that, in a small enough interval of time, only one quasi-particle is produced. The quasi-particle has momentum ~p and energy ω(p), which is a function of its momentum. After the quasi-particle is produced, in the rest frame of the fluid, the total momentum is that of the quasi-particle, ~p, and the total energy is that of the fluid at rest plus the quasiparticle energy, E0 + ω(p). By the transformation of Galilean relativity in equations (2.44) and (2.45), in the rest frame of the capillary whose velocity is ~V = −~v, the total momentum and energy are ~P0 = ~p − M~v (2.46) 1 E0 = E + ω(p) −~p ·~v + Mv2 (2.47) 0 2 We should compare this with the same motion where no quasi-particle is produced, and the momentum and energy would be ~P˜0 = −M~v (2.48) 1 E˜ 0 = E + Mv2 (2.49) 0 2 This process of producing a quasi-particle will proceed if it is energetically favourable. In the rest frame of the capillary. This is so if the energy of the state where the quasi-particle was produced is less than the energy of the state where it was not produced, E0 ≤ E˜ 0, that is if 1 1 E + ω(p) −~p ·~v + Mv2 ≤ E + Mv2 (2.50) 0 2 0 2 or if ω(p) ≤ ~p ·~v, at least for some values of ~p. This can happen when ω(p) v ≥ v ≡ minimum of (2.51) c p The last inequality tells us that dissipation is allowed only when the fluid velocity exceeds a 2 minimum critical velocity, vc. This critical velocity could vanish, for example, if ω(p) ∼ p , as it does for a normal fluid. On the other hand, if ω(p) ∼ p for small p = |~p|, the critical velocity could be non-zero and dissipation is not allowed for fluid flows with velocities smaller than the critical one. This is Landau’s criterion for a superfluid, that vc > 0. Let us now study the Bose-Einstein condensate in more detail. The ground state that we have written down in equation (2.42) has a fixed number of particles, N. What is more, all of the particles have vanishing kinetic energy and we have considered non-interacting particles so that they have no potential energy, so they are eigenstates of the free field theory Hamiltonian with vanishing energy. What is more, the energy does not depend on the total number of particles. In an open system, particles can wander in and out of the system without changing the energy. This means that the chemical potential is zero. We would thus expect an open system to be a superposition of states with different numbers of particles, rather than (2.42) we would have ∞ cN N |O >= ∑ √ α†(~k = 0) |0 > (2.52) N=0 N! 2.4 Bosons 35 In such a state, the field operator has an expectation value ∞ N ∗ p h ~ † ~ i < O|ψ(~x,t)|O >= ∑ NcNcN+1 (N + 1) < 0|α(k = 0)α (k = 0)|0 > (2.53) N=0 where, also ∞ N 2 h ~ † ~ i ∑ |cN| < 0|α(k = 0)α (k = 0)|0 > = 1 N=0 needs to be defined using a regularization. . There is no way to determine the coefficients cN in the context of free field theory, they are simply arbitrary and the ground state of an open system of free Bosons at zero temperature is not unique. To fix this ambiguity and to make our considerations more realistic, but still solvable, we shall consider the system with a small, positive chemical potential and a weak, repulsive interaction, so that the Hamiltonian is Z h¯ 2 λ H0 = d3x ~∇ψ†(x) ·~∇ψ(x) − µψ†(x)ψ(x) + ψ†(x)ψ†(x)ψ(x)ψ(x) (2.54) 2m 2 We are considering the case where both µ and λ are positive parameters. We have approximated the interactions by a delta-function two-body potential V(~x −~y) = λδ(~x −~y) and we will take the limit of this theory where the interaction is weak, that is, where λ is sufficiently small that the interaction can be treated as a perturbation. Of course, λ is a constant with dimensions, so to say that λ is small means that it is smaller than other quantities with the same dimensions that we could make out of the other parameters of the theory .1 As we have stated above, once the system is open, the ground state is no longer an eigenstate of particle number but it can be a superposition of states with different particle numbers as in equation (2.52). We have also emphasized that the ground state cannot be determined by free field theory alone and interactions are needed. However, once interactions are present and they play an important role, equation (2.52) is no longer a good characterization of the ground state. On the other hand, our description of the many-particle theory of weakly interacting Bosons using quantum field theory is a useful starting point. To begin, we observe that a characteristic of a state which is a superposition of states with different particle numbers is the fact that the field operator has an expectation value, < O|ψ(~x,t)|O >= η(~x,t) (2.55) When this is the case, we can separate the field operator into a classical and quantum part, ψ(~x,t) = η(~x,t) + ψ˜ (~x,t) where < O|ψ˜ (~x,t)|O >= 0 1 When written in terms of the s-wave scattering length, a, λ = 4πma , the criterion for weak coupling is that h¯ 2 1 aρ 3 << 1. 36 Chapter 2. Degenerate Fermi and Bose Gases In the weak coupling limit, the classical part of the field operator satisfies the classical field equation, ∂ h¯ 2 ih¯ η(~x,t) = − ~∇2 − µ η(~x,t) + λ|η(~x,t)|2η(~x,t) ∂t 2m The solutions of the above equation are r µ η = 0 , η = eiθ λ where the phase θ is not fixed by the equation. When both λ and µ are positive, the solution with q µ lower grand canonical potential is the second one, with η = λ . Here, we have made a choice of the phase. Then, at very weak coupling, the particle density and ground state energy density of the system are gotten by plugging this classical value of the field into the number density and the energy density to get µ ρ = λ λ λ P = −φ = µρ − ρ2 = ρ2 2 2 Let us assume that the field operator at time zero is given by 3 Z d k ~ ψ˜ (~x,0) = eik·~xα(~k) (2π)3 with the commutation relations h i h i α(~k),α(~`) = 0 = α†(~k),α†(~`) h i α(~k),α†(~`) = δ 3(~k −~`) When we plug r µ ψ(~x,0) = + ψ˜ (~x,0) λ √ into the Hamiltonian (2.54) and expand in powers of λ, we obtain µ2 H = − V + . 2λ Z 2 √ 3 h¯ † † µ † † † d x ~∇ψ˜ ·~∇ψ˜ − µψ˜ ψ˜ + ψ˜ ψ˜ + ψ˜ ψ˜ + 4ψ˜ ψ˜ + O( λ) (2.56) 2m 2 ! λ Z h¯ 2~k2 = − ρ2V + d3k + µ α†(~k)α(~k)+ 2 2m Z µ h i √ + d3k α†(~k)α†(−~k) + α(−~k)α(~k) + O( λ) (2.57) 2 2.4 Bosons 37 The Hamiltonian no longer has the form of an energy times the number operator α†α. We need to do a change of variables in order to get it in this form. Consider the transformation a(~k) coshϕ sinhϕ α(k) = (2.58) a†(−k) sinhϕ coshϕ α†(−k) α(~k) coshϕ −sinhϕ a(k) = (2.59) α†(−k) −sinhϕ coshϕ a†(−k) where ϕ is a function of |~k|. This is called a Bogoliubov transformation. Its specific form is designed to preserve the commutation relations, so that the new variables also obey h i h i a(~k),a(~`) = 0 = a†(~k),a†(~`) h i a(~k),a†(~`) = δ 3(~k −~`) for which we need the property cosh2 ϕ − sinh2 ϕ = 1. We assume that ϕ is a real function of |~k|. a(~k) coshϕ sinhϕ α(k) = (2.60) a†(−k) sinhϕ coshϕ α†(−k) α(~k) coshϕ −sinhϕ a(k) = (2.61) α†(−k) −sinhϕ coshϕ a†(−k) When we substitute into the Hamiltonian, we obtain ! µ2 Z h¯ 2~k2 H = − V + d3k + µ cosh2 ϕa†(~k)a(~k) + sinh2 ϕa(−~k)a†(−~k) + 2λ 2m Z µ h i + d3k cosh2 ϕa†(~k)a†(−~k) + cosh2 ϕa(−~k)a(~k) 2 Z µ h i + d3k sinh2 ϕa(−~k)a(~k) + sinh2 ϕa†(~k)a†(−~k) + 2 ! Z h¯ 2~k2 + d3k + µ coshϕ sinhϕ a†(~k)a†(−~k) + a(−~k)a(~k) + 2m Z µ h i + d3k coshϕ sinhϕ a†(−~k)a(−~k) + a(~k)a†(~k) + a†(~k)a(~k) + a(−~k)a†(−~k) (2.62) 2 The off-diagonal terms are proportional to ! µ h¯ 2~k2 cosh2 ϕ + sinhϕ2 + + µ coshϕ sinhϕ 2 2m ( ! ) 1 h¯ 2~k2 = .µ cosh2ϕ + + µ sinh2ϕ 2 2m and, we adjust ϕ so that this quantity vanishes, µ tanh2ϕ = − h¯ 2~k2 2m + µ 38 Chapter 2. Degenerate Fermi and Bose Gases Then, when we plug this solution into the Hamiltonian, we find that the Hamiltonian is !! µ2 Z d3k h¯ 2~k2 H = − V +V E(k) − + µ 2λ 2(2π)3 2m Z √ + d3kE(k)a†(~k)a(~k) + O( λ) (2.63) where the new energies are v u !2 u h¯ 2~k2 E(k) = t + µ − µ2 (2.64) 2m Here, we have been careful to keep track of terms which are produced by changing the order of a†(~k) and a(~k). The new excitation which is called a “quasi-particle” has a dispersion relation, for small |~k|, like a sound wave, E(k) ∼ vS|~k|. By Landau’s criterion, s E(k) λρh¯ 2 v = minimum of = = v c k m S the critical velocity is just equal to the quasi-particle velocity. The weakly interacting Bose gas is a superfluid with critical velocity given by the expression above. 2.5 Summary of this chapter In the absence of interactions, the field equation of a gas of non-relativistic particles is ∂ h¯ 2 ih¯ + ~∇2 + µ ψ (~x,t) = 0 ∂t 2m σ For Fermions, µ = εF , the Fermi energy and the field equation has the solution 3 2~ 2 Z d k i~k·~x−i h¯ k −ε t/h 2m F ¯ ~ ψσ (x,t) = e ασ (k)+ ~ 3 |k|>kF (2π) 2 3 2~ 2 Z d k −i~k·~x−i h¯ k −ε t/h¯ + e 2m F β †(~k) ~ 3 σ |k| where the creation and annihilation operators for particles and holes satisfy the algebra n ~ †ρ o ρ 3 ~ ασ (k),α (~p) = δσ δ (k −~p) n ~ o n †σ ~ †ρ o ασ (k),αρ (~p) = 0 , α (k),α (~p) = 0 n σ ~ † o σ 3 ~ β (k),βρ (~p) = δρ δ (k −~p) n σ ~ ρ o n † ~ † o β (k),β (~p) = 0 , βσ (k),βρ (~p) = 0 (2.65) n ~ ρ o n ~ † o ασ (k),β (~p) = 0 , ασ (k),βρ (~p) = 0 n †σ ~ ρ o n †σ ~ † o α (k),β (~p) = 0 , α (k),βρ (~p) = 0 2.5 Summary of this chapter 39 The ground state obeys ~ σ ~ ~ ασ (k)|O >= 0 , β (k)|O >= 0 , for all k , σ The Hamiltonian and number operator are diagonal Z 2 2~ 2 ! 0 3 h¯ k †σ H = d k − εF α (k)ασ (k) ~ ∑ |k|≥kF σ=1 2m Z 2 2~ 2 3 h¯ k † σ + d k − εF β (~k)β (~k) − PV ~ ∑ σ |k|≤kF σ=1 2m Z Z N = d3k α†(k)α(k) − d3k β †(~k)β(~k) + ρV |~k|>kF |~k|≤kF 2 2 1 h¯ 2 3 2 3 where ρ is the density, εF = 2m 3π ρ , kF = 3π ρ and the equation of state of a cold Fermi gas is 2 2 2 (3π ) 3 h¯ 5 P = ρ 3 5 m 1 Here we have assumed that the spin J = 2 . The ground state of a Bose gas with a weak repulsive interaction is a Bose-Einstein condensate where the field operator has a classical part, ψ(~x,t) = η(~x,t) + ψ˜ (~x,t) , ψ†(~x,t) = η∗(~x,t) + ψ˜ †(~x,t) with η(~x,t) =< O|ψ(~x,t)|O > , < O|ψ˜ (~x,t)|O >= 0 At sufficiently weak coupling, η(~x,t) satisfies the classical equation of motion, ∂ h¯ 2 ih¯ + ~∇2 + µ η(~x,t) = λη†(~x,t)η(~x,t)η(~x,t) (2.66) ∂t 2m which is called the Gross-Pitaevskii equation and, to the leading order in λ, the operator Ψ(~x,t) = ψ˜ (~x,t) satisfies the Bogoliubov-de Gennes equation ψ˜ †(~x,t) h¯ 2 ~ 2 ∗ 2 ! ∂ − 2m ∇ − µ + 2η η η ih¯ Ψ(~x,t) = 2 Ψ(~x,t) + ... (2.67) ∂t ∗2 h¯ ~ 2 ∗ −η 2m ∇ + µ − 2η η where corrections are small when λ is small and one must find a solution which obeys Ψ(~x,t) = 0 1 Ψ†(~x,t). A translation invariant solution of (2.66) is η = pµ/λ. The leading orders in 1 0 the density, the energy density and the pressure are obtained by plugging the classical solution ψ(~x,t) ∼ η into the number operator and Hamiltonian, 2 µ 4πh¯ 2 1 ρ = → µ = ρ 3 aρ 3 λ m 40 Chapter 2. Degenerate Fermi and Bose Gases 2 λ 2 2πh¯ 5 1 u = ρ = ρ 3 aρ 3 2 m 2 2πh¯ 5 1 P = ρ 3 aρ 3 m 4πh¯ 2a where we have used the expression λ = m with a the s-wave scattering length. These are the 1 first terms in an expansion in the dimensionless number aρ 3 and corrections to these formula are suppressed by higher powers of this constant. The corrections to the internal energy are known to the next order, 2 2 3 2πh¯ 5 1 128 1 2 u = ρ 3 aρ 3 1 + √ aρ 3 + ... m 15 π Moreover, the solution of the Bogoliubov-de Gennes equation yields the quasi-particle spectrum v 2 u 2 ! 2 u h¯ ~k2 h¯ p E(~k) = t + µ − µ2 ∼ 2πaρ |~k| + ... 2m 2m which, in agreement with Landau’s argument for the existence of a superfluid state, is linear in the wave-number for small wave-numbers. 2T. D. Lee and C. N. Yang, Phys. Rev. 105, 1119 (1957). ;T. D. Lee, K. Huang, and C. N. Yang, Phys. Rev. 106, 1135 (1957). 3. Classical field theory and the action principle 3.1 The Action Principle In the previous chapters, we have formulated the quantum mechanical many-particle system as a quantum field theory. The quantum field theory consisted of a field equation, which was a non-linear partial differential equation, in our example ∂ h¯ 2 ih¯ + ~∇ 2 + µ ψ (~x,t) = λψ†ρ (~x,t)ψ (~x,t)ψ (~x,t) (3.1) ∂t 2m σ ρ σ which the field operators must satisfy together with some boundary conditions. 1 In addition to the field equation, the field operators were required to satisfy equal time commutation or anti-commutation relations which defined the nature of the operators themselves. In this chapter, we shall examine an alternative way of encoding the information that is contained in the field equation and commutation relations. We will begin with a classical field theory which is specified by writing down an action functional and then we will derive the field equation using the action principle. The action principle stipulates that the action functional is stationary when it is evaluated on those fields which are solutions of the equations of motion of the classical field theory, with appropriate boundary conditions. The classical action also contains information about the fields when they are viewed as generalized dynamical variables. From it, we can identify the generalized coordinates and their canonical momenta and as well as their Poisson brackets. This will give us a classical field theory which we could then quantize by the standard procedure of replacing the classical fields by quantum mechanical operators and Poisson brackets by commutators or anti-commutators. In this approach, the essential information that we need to define the quantum field theory, as it is defined in equations (??)-(??), is encoded in the classical action. One great benefit of being able to derive the field equations of the quantum field theory from an action principle is that the symmetries of the theory are symmetries of the action, as well as being symmetries of the equation of motion and commutation relations. Symmetries of the action 1 Here, and in the following, we will consider the special case of a contact interaction. Everything that we say can easily be generalized to more complicated interactions. 42 Chapter 3. Classical field theory and the action principle are often easier to identify than symmetries of the field equations. In addition, the existence of the action and the action principle gives us a bridge between symmetries and conservation laws in the form of Noether’s theorem. This theorem states that, if a mechanical system has a continuous symmetry, and if its equation of motion is derived from an action, then the theory has a conserved charge that associated with that symmetry. In the following, after introducing the action and the action principle, we shall give two alternative proofs of Noether’s theorem. Finally, as we shall see in later chapters, the action is an important ingredient of the functional integral formulation of quantum field theory. 3.1.1 The Action Consider the classical field theory, that is a dynamical theory of classical fields which we shall †σ denote by ψσ (~x,t) and ψ (~x,t). In spite of the notation, where take the widely used convention that uses the same symbols, these classical fields are not operators as they were in the quantum field theory that we have formulated in the preceding chapters. Here, they are simply functions, smooth differentiable mappings of space and time (t,~x) onto the complex numbers where ψσ (~x,t) and ψ†σ (~x,t) take their values. Generally, these functions that must obey some appropriate boundary conditions (for example, that they fall of fast enough at infinity that their Fourier transform exists) but they are not necessarily solutions of the equations of motion, for the moment they are just †σ arbitrary functions. The dagger symbol, which distinguishes ψ (~x,t) from ψσ (~x,t), and which we also take from the quantum field theory, in this case simply means complex conjugation of the complex-valued classical fields. Strictly speaking, what we have said in the paragraph above applies only to Bosons. If our theory, once it is eventually quantized, will describe Fermions rather than Bosons, they fields †σ ψσ (~x,t) and ψ (~x,t) are slightly more complicated objects, in that they anti-commute with each other.2 They are still not operators in the sense that the Fermion quantum fields are operators. Rather than the anti-commutators of Fermion field operators which we have studied in previous sections, they simply have anti-commutators where the right-hand-sides always vanish, which we can summarize as †ρ 0 †ρ 0 ψσ (~x,t)ψ (~y,t ) + ψ (~y,t )ψσ (~x,t) = 0 0 0 ψσ (~x,t)ψρ (~y,t ) + ψρ (~y,t )ψσ (~x,t) = 0 ψ†σ (~x,t)ψ†ρ (~y,t0) + ψ†ρ (~y,t0)ψ†σ (~x,t) = 0 Clearly, the objects which obey the above rules are not ordinary functions. This means that, in our view, Fermions are never really described by classical fields. The closest that we can come are these anti-commuting functions. The detailed study of what these object are is an interesting one. However, we shall not require much of the details, other than a few simple rules which will allow us to use them for a few specific things. We will develop those rules as we need them. The anti-commutators given above are the first such rules. The second such rule concerns complex conjugation, where, when we conjugate a product of anti-commuting fields, we also reverse the order so that, for example, †σ 0 † †ρ 0 ψ (~y,t )ψρ (~x,t) = ψ (~x,t)ψσ (~y,t ) In spite of the fact that they are not really classical in the case of Fermions, we will often use the term “classical fields” when referring to either Bosons and Fermions. 2 At this point, we do not expect that it is obvious to the reader why this is needed. For now, we can say that it will simplify our work with Fermions later on. 3.1 The Action Principle 43 The action is given by the integral over the space and time coordinates of a Lagrangian density, Z Z †σ 3 ∂ S[ψσ ,ψ ] = dt d x L (ψ, ∂t ψ,∇ψ) (3.2) The Lagrangian density is a function of the classical fields and their derivatives. In our case, ih¯ ∂ ih¯ ∂ L (ψ, ∂ ψ,∇ψ) = ψ†σ (~x,t) ψ (~x,t) − ψ†σ (~x,t)ψ (~x,t) ∂t 2 ∂t σ 2 ∂t σ h¯ 2 − ~∇ψ†σ (~x,t) ·~∇ψ (~x,t) + µψ†σ (~x,t)ψ (~x,t) 2m σ σ λ 2 − ψ†σ (~x,t)ψ (~x,t) (3.3) 2 σ We will argue that, beginning with this classical field theory and applying rules for finding the equation of motion and then the rules of quantization in a straightforward way, we arrive at the non-relativistic quantum field theory that we have been discussing. †σ The action, S[ψσ ,ψ ] in equation (3.2) is the integral over time and space coordinates of the Lagrangian density, L , given in equation (3.3). The action is a functional. A functional is a mathematical object which maps functions onto numbers. Here, we imagine that we have classical †σ functions of the space and time coordinates, ψσ (~x,t) and ψ (~x,t) which we insert, together with their derivatives, into the expression (3.3) to form the Lagrangian density. We then insert the Lagrangian density into the integrand in equation (3.2) and perform the integral. The result is a number, the action has mapped the classical functions onto a number, in this case a real number.3 3.1.2 The action principle and the Euler-Lagrange equations The action principal states that The Action Principle: The action functional is stationary when it is evaluated on the field configurations which obey the classical equations of motion and the appropriate boundary conditions. Let us use this statement to find the equations of motion which correspond to a given action. For this, we need a way to decide when the action functional is stationary. Consider two classical †σ field configurations which differ by an infinitesimal function, the classical fields ψσ (~x,t),ψ (~x,t) †σ †σ †σ and the classical fields ψσ (~x,t)+δψσ (~x,t),ψ (~x,t)+δψ (~x,t) where δψσ (~x,t),δψ (~x,t) are functions of infinitesimal magnitude and arbitrary profile. The fields ψ†σ (~x,t) and δψ†σ (~x,t) are the complex conjugates of ψσ (~x,t) and δψσ (~x,t). It is still useful to treat them as independent †σ fields. The action evaluated on the first configuration is S[ψσ ,ψ ] and the action evaluated on †σ †σ the second configuration is S[ψσ + δψσ ,ψ + δψ ]. It is clear that these must differ by an infinitesimal amount. The action is stationary if the difference †σ †σ †σ δS ≡ S[ψσ + δψσ ,ψ + δψ ] − S[ψσ ,ψ ] 3 For Fermions, the situation is a little more complicated as the functional of anti-commuting functions is not just a number but is itself an algebraic entity. Again, we will not take the time to define it here, as only some operational aspects of dealing with anti-commuting functions will be needed in the following. 44 Chapter 3. Classical field theory and the action principle †σ vanishes to linear order in the infinitesimal functions δψσ and δψ . To linear order, Z Z δS = dt d3x· " ∂L ∂ ∂L ∂L δψ (~x,t) + δ ψ (~x,t) + δ(∇ ψ (~x,t)) σ σ ∂ a σ ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ # ∂L ∂ ∂L ∂L +δψ†σ (~x,t) + δ ψ†σ (~x,t) + δ(∇ ψ†σ (~x,t)) †σ ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) (3.4) In the equation above, we have assumed that the Lagrangian density L (x) depends on the fields and their first derivatives, that is, on the variables ∂ ∂ ψ (~x,t), ψ (~x,t),∇ ψ (~x,t),ψ†ρ (~x,t), ψ†ρ (~x,t),∇ ψ†ρ (~x,t) σ ∂t σ a σ ∂t a but otherwise it is quite general. The idea here is that, if we fix the space and time coordinates ∂ to a specific value, we must treat each of ψσ (~x,t), ∂t ψσ (~x,t) and ∇aψσ (~x,t) and their complex †σ ∂ †σ †σ conjugates ψ (~x,t), ∂t ψ (~x,t) and ∇aψ (~x,t) as independent variables. For fixed ~x and t, ∂ †σ and each value of σ, the partial derivatives by each of ψσ (~x,t), ∂t ψσ (~x,t), ∇aψσ (~x,t), ψ (~x,t), ∂ †σ †σ 4 ∂t ψ (~x,t) and ∇aψ (~x,t) are taken while holding all of the other variables fixed. The variation of the derivatives of the functions are defined as the derivatives of the variations, so that ∂ ∂ ∂ ∂ δ ψ (~x,t) ≡ δψ (~x,t) , δ ψ†ρ (~x,t) ≡ δψ†ρ (~x,t) ∂t σ ∂t σ ∂t ∂t †ρ †ρ δ(∇aψσ (~x,t)) ≡ ∇a(δψσ (~x,t)) , δ(∇aψ (~x,t)) ≡ ∇a(δψ (~x,t)) Then, using the product rule ! ∂ ∂L δψ (~x,t) ∂t σ ∂ ∂( ∂t ψσ (~x,t)) ! ∂ ∂L ∂ ∂L = δψ (~x,t) + δψ (~x,t) ∂t σ ∂ σ ∂t ∂ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ(~x,t)) 4 ∂ †σ ∂ †σ In the case of Fermions, for fixed ~x and t, we must treat ψσ (~x,t), ∂t ψσ (~x,t), ∇aψσ (~x,t), ψ (~x,t), ∂t ψ (~x,t) †σ and ∇aψ (~x,t) as independent anti-commuting numbers. In addition, derivatives by anti-commuting numbers must also be anti-commuting entities. For example, ∂ ∂ ∂ ∂ ∂ ∂ 0 0 = 0 , = − , etc. ∂ψσ (~x,t) ∂ψσ (~x ,t ) ∂ψσ (~x,t) ∂ ∂ ∂ψσ (~x,t) ∂( ∂t ψρ (~x,t)) ∂( ∂t ψρ (~x,t)) Moreover, variables and derivatives by the variables also anti-commute with each other. For example, ∂ † σ † ∂ † ψρ (~x,t) f (ψ,ψ ) = δρ f (ψ,ψ ) − ψρ (~x,t) f (ψ,ψ ) ∂ψσ (~x,t) ∂ψσ (~x,t) ∂ ∂ ψ†ρ (~x,t) f (ψ,ψ†) = −ψ†ρ (~x,t) f (ψ,ψ†) ∂ψσ (~x,t) ∂ψσ (~x,t) These rules should be sufficient for defining the variation of the action in the case of Fermions. 3.1 The Action Principle 45 we rewrite the expression for the variation of the action in equation (3.4) as Z Z δS = dt d3x· ( " # ∂L ∂ ∂L ∂L δψ (~x,t) − − ∇ σ ∂ a ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ ! ∂ ∂L ∂L + δψ (~x,t) + ∇ δψ (~x,t) σ ∂ a σ ∂t ∂(∇aψ (~x,t)) ∂( ∂t ψσ (~x,t)) σ " # ∂L ∂ ∂L ∂L +δψ†σ (~x,t) − − ∇ †σ ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) ! ) ∂ ∂L ∂L + δψ†σ (~x,t) + ∇ δψ†σ (~x,t) (3.5) ∂ a †σ ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) We shall call the right-hand-side of equation (3.5) the variation of the action. Gauss’ theorem, applied in four-dimensional space-time, can be used to rewrite the last two terms in each line of (3.5) as surface integrals. Each of these terms are the four-dimensional volume integral of a total divergence, for example, from the first line of (3.5), ! ∂ ∂L ∂L δψ (~x,t) + ∇ δψ (~x,t) σ ∂ a σ ∂t ∂(∇aψ (~x,t)) ∂( ∂t ψσ (~x,t) σ is such a four-divergence. Gauss’ theorem allows us to rewrite its space-time volume integral as a surface integral at the boundaries of space and time. We shall assume that the boundary †σ †σ conditions for the functions ψσ (~x,t), ψ (~x,t)r, δψσ (~x,t) and δψ (~x,t) are such that the surface terms that are generated in this way all vanish. These are normally taken either as Dirichlet boundary conditions where the value of the field ψσ (~x,t) is fixed at large |~x| so that δψσ (~x,t) must vanish there or the Neumann boundary condition where δψσ (~x,t) is allowed to be nonzero but the xa ∂L component of the derivatives of the fields normal to the boundary, †σ , must go to zero at |~x| ∂∇aψ (~x,t) the boundary. There is also a boundary condition associated with the boundaries of the time integral, †σ †σ δψ (~x,t)ψσ (~x,t) = 0 , ψ (~x,t)δψσ (~x,t) = 0 t=ti,t f t=ti,t f where ti and t f are initial and final times (which we will usually take to be minus and plus infinity, respectively). Then, assuming that these boundary conditions are obeyed, we can drop the total divergence terms from the variation of the action to get Z Z δS = dt d3x· ( " # ∂L ∂ ∂L ∂L δψ (~x,t) − − ∇ σ ∂ a ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ " #) ∂L ∂ ∂L ∂L +δψ†σ (~x,t) − − ∇ (3.6) †σ ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) †σ We are interested in specific elements of the set of all possible functions ψσ (~x,t) and ψ (~x,t) †σ where the action S[ψσ ,ψ ] is stationary. The action is stationary when the terms linear in the variations, which we have found in equation (3.6), vanish. This must be so for any profile of the †σ functions δψσ (~x,t) and δψ (~x,t). This requires that the coefficients of these functions under the 46 Chapter 3. Classical field theory and the action principle integrations in equation (3.6) must vanish. This gives us a set of differential equations which the classical field must obey, that is, the classical field equations (3.7) and (3.8) below. They are called the Euler-Lagrange equations Euler-Lagrange Equations ∂L ∂ ∂L ∂L − − ∇ = 0 (3.7) ∂ a ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ ∂L ∂ ∂L ∂L − − ∇ = 0 (3.8) †σ ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) Application of the Euler-Lagrange equations (3.7) and (3.8) to the action (3.2) yields equations for the classical fields which is identical to the one for the quantum fields in equation (3.1) plus an equation which is its complex conjugate. This gives the classical field equations. Modulo the ordering of operators in the interaction term, which is arbitrary in the classical equation (remembering minus signs for the case of Fermions where the classical functions anti-commute) but of course is important in the equation for quantum mechanical operators, the classical field equation is identical to the field equation of the quantum field theory. We will find ways to deal with the operator ordering ambiguity later, when we discuss specific computations. As well as the field equation, there are boundary conditions, which must be compatible with the boundary conditions which were used to eliminate boundary terms that were encountered when finding the linear variation of the action. Euler-Lagrange equations Beginning with the action functional Z Z S = dt d3xL where L is a function of the variables ∂ ∂ ψ (~x,t), ψ (~x,t),∇ ψ (~x,t),ψ†ρ (~x,t), ψ†ρ (~x,t),∇ ψ†ρ (~x,t) σ ∂t σ a σ ∂t a the equations of motion resulting from the action principle are the Euler-Lagrange equations: ∂L ∂ ∂L ∂L − − ∇ = 0 ∂ a ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ ∂L ∂ ∂L ∂L − − ∇ = 0 †σ ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) with the appropriate boundary conditions 3.1 The Action Principle 47 3.1.3 Canonical momenta, Poisson brackets and Commutation relations The other data that we need in order to define the quantum field theory are the canonical commuta- tion relations. The form that these must take are also encoded in the action. In this non-relativistic field theory, the Lagrangian density is linear in the time derivative of the field. For this reason, it is easiest to think of the Lagrangian density as being a function on the phase space of the mechanical system, that is, it is a function of the generalized coordinates and momenta, rather than generalized coordinates and velocities. The analog in classical mechanics, where qi are the generalized coordi- nates and pi are the canonical momenta, and the set of all values of the generalized coordinates and momenta together comprise phase space, is the classical action on phase space Z d S = dtL(q(t), p(t)) , L = p (t) q (t) − H(q(t), p(t)) i dt i L is the Lagrangian and the phase space function H(q, p) is the Hamiltonian. The momenta and coordinates have the Poisson bracket qi, p j = δi j , qi,q j = 0 , pi, p j = 0 ∂ which can be read from the first, linear in time derivatives term in the Lagrangian, L = pi(t) δi j ∂t q j(t)+ .... The classical field theory Lagrangian density (3.3) has a form analogous to this plus a total derivative, ∂ ∂ ih¯ L = ih¯ψ†σ (~x,t) ψ (~x,t) − H (ψ,ψ†) − ψ†σ (~x,t)ψ (~x,t) ∂t σ ∂t 2 σ and the total time derivative can be removed by a canonical transformation.5 Then, we would identify the generalized coordinate as the field ψσ (~x,t) and the canonical momenta as being equal to its coefficient in the Lagrangian density, ih¯ψ†σ (~x,t). The Poisson brackets for the classical field theory are then ψ (~x,t),ihψ†ρ (~y,t) = δ ρ δ(~x −~y) , σ ¯ PB σ ψ (~x,t),ihψ (~y,t) = 0 , ψ†σ (~x,t),ihψ†ρ (~y,t) = 0 σ ¯ ρ PB ¯ PB and, when we quantize, we identify the commutator bracket with ih¯ times the Poisson bracket. This tells us that the commutator in the case of Bosons, or anti-commutator in the case of Fermions, in the field theory should be †ρ ρ 3 ψσ (~x,t),ψ (~y,t) = δσ δ (~x −~y) †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0 , ψ (~x,t),ψ (~y,t) = 0 This indeed matches the commutation or anti-commutation relation given in the field theory (??). In addition to the field equation and commutation relations, we learn that the Hamiltonian is given by Z h¯ 2 H = d3x H (ψ,ψ†) ~∇ψ†σ (~x,t) ·~∇ψ (~x,t) − µψ†σ (~x,t)ψ (~x,t) 2m σ σ λ 2 . + ψ†σ (~x,t)ψ (~x,t) (3.9) 2 σ which, modulo operator ordering, agrees with the expression for Hamiltonian which we derived earlier. 5Even with the total derivative, an equivalent result can be found by analyzing the Lagrangian system as a constrained ρ† system where, when the constraints are properly resolved, we would obtain the same bracket for ψσ (~x,t) and ψ (~x,t). One easy way to see this is to remember that a total time derivative term in a classical action can be removed by a canonical transformation and that the Poisson bracket is left unchanged by canonical transformations. 48 Chapter 3. Classical field theory and the action principle 3.2 Noether’s theorem In the last section, we showed how the essential information which appears in the field equations and the commutation relations, if we take those as defining the quantum field theory, is also encoded in the classical action and the action principle. In this section we will show how, if the field equations can be derived from an action via the action principle, symmetries of the theory lead to conservation laws. By a conservation law, we mean an equation of continuity for a charge density and a current density ∂ R(~x,t) +~∇ · J~ (~x,t) = 0 ∂t where R(~x,t) is the charge density and J~ (~x,t) is the current density. The integral over space of the charge density, Z Q = d3xR(~x,t) defines a charge. The time derivative of the total charge by the time is given by d Z ∂ Z I Q = d3x R(~x,t) = − d3x~∇ · J~ (~x,t) = − d~a ·~∇ · J~ (~x,t) = 0 (3.10) dt ∂t where the latter integral is a surface integral of the normal component of the current density over the boundaries of the space. This formula is a statement of charge conservation. It says that the time rate of change of the total charge is equal to the total flux of the current through the boundaries of the system. We will normally use boundary conditions such that the current densities that we will consider go to zero sufficiently rapidly at spatial infinity that the surface integral vanishes and the total charge is therefore time independent. The symmetries which we shall study are those for which there exists the notion of an infinitesi- mal transformation. An example is translation invariance. A transformation is a change in the space coordinate,~x →~x˜ =~x +~c where~c is a vector whose components are constants. An infinitesimal transformation has a “parameter”,~c, a vector of infinitesimal magnitude. We define symmetry as follows. For our purposes, a symmetry is a particular transformation of the dynamical variables. A transformation of the dynamical variables is a replacement of the †ρ †ρ variables ψσ (~x,t) and ψ (~x,t) by new variables ψ˜σ (~x,t) and ψ˜ (~x,t) wherever they appear on the field equations or, equivalently, wherever they appear in the action. Generally, the new variables †ρ †ρ ψ˜σ (~x,t) and ψ˜ (~x,t) are functions of the old variables, ψσ (~x,t) and ψ (~x,t) as well as their derivatives, space-time coordinates and other parameters. For an infinitesimal symmetry, we will consider an infinitesimal change of variables ψσ (~x,t) → ψ˜σ (~x,t) = ψσ (~x,t) + δψσ (~x,t) ψ†ρ (~x,t) → ψ˜ †ρ (~x,t) = ψ†ρ (~x,t) + δψ†ρ (~x,t) , †ρ where δψσ (~x,t) and δψ (~x,t) have infinitesimal magnitude. For our present purposes, a symmetry is a particular infinitesimal transformation of the dynamical variables such that, without use of the equations of motion, we can show that the linear variation of the Lagrangian density is by terms which can be assembled into partial derivatives by the space and time coordinates: ∂ δL = R(~x,t) +~∇ · J~(~x,t) (3.11) ∂t That is, we have identified a symmetry if, by examining the linear variation of the Lagrangian density, we can show that it can be written in the form (3.11) for some R and J~ which depend on the fields, their derivatives and perhaps the space and time coordinates. 3.2 Noether’s theorem 49 3.2.1 Examples of symmetries Consider the Lagrangian density in equation (3.3), which we copy here for the reader’s convenience, ∂ ih¯ ∂ ih¯ ∂ h¯ 2 L (ψ, ψ,∇ψ) = ψ†σ (~x,t) ψ (~x,t) − ψ†σ (~x,t)ψ (~x,t) − ~∇ψ†σ (~x,t) ·~∇ψ (~x,t) ∂t 2 ∂t σ 2 ∂t σ 2m σ λ 2 +µψ†σ (~x,t)ψ (~x,t) − ψ†σ (~x,t)ψ (~x,t) σ 2 σ Phase symmetry We can see by inspection that the Lagrangian density written above is unchanged if we make the substitution −iθ ψσ (~x,t) → ψ˜σ (~x,t) = e ψσ (~x,t) ψ†σ (~x,t) → ψ˜ †σ (~x,t) = eiθ ψ†σ (~x,t) The infinitesimal transformation is †ρ †ρ δψσ (~x,t) = −iψσ (~x,t) , δψ (~x,t) = iψ (~x,t) where we have dropped the factor of θ on the right-hand-sides. Under this transformation, δL = 0 so the above transformation is a symmetry. In this simple case, R = 0 and J~ = 0. Space and time-translation invariance A second example is the case of a time translation and a space translation where ψσ (~x,t) → ψ˜σ (~x,t) = ψσ (~x +~ε,t + ε) ψ†σ (~x,t) → ψ˜ †σ (~x,t) = ψ†σ (~x +~ε,t + ε) The infinitesimal transformations are obtained by Taylor expansion ∂ ψ˜ (~x,t) = ψ (~x +~ε,t + ε) = ψ˜ (~x,t) +~ε ·~∇ψ (~x,t) + ε ψ (~x,t) + ... σ σ σ σ ∂t σ ∂ ψ˜ †σ (~x,t) = ψ†σ (~x,t + ε) = ψ˜ †σ (~x,t) +~ε ·~∇ψ†σ (~x,t) + ε ψ†σ (~x,t) + ... ∂t so that ∂ ∂ δψ (~x,t) = ~ε ·~∇ + ε ψ (~x,t) , δψ†ρ (~x,t) = ~ε ·~∇ + ε ψ†ρ (~x,t) σ ∂t σ ∂t Inspection of the change in the Lagrangian density yields ∂ δL = ~ε ·~∇ + ε L ∂t and the time translation is a symmetry. In this case, R = εL and Ja = εaL . 50 Chapter 3. Classical field theory and the action principle 3.2.2 Proof of Noether’s Theorem We have defined a symmetry as a transformation of the fields under which the transformation of the Lagrangian density can be written in the form given in equation (3.11). This was assumed to be possible with use of algebra, but without the benefit of the Euler-Lagrange equations of motion. Now we shall assume that, in addition to this, the Euler-Lagrange equations are satisfied by the classical fields. We begin with the variation of the Lagrangian density which we found in equation (??) as6 ∂L ∂ ∂L ∂L δ ≡ δψ (~x,t) + δ ψ (~x,t) + δ(∇ ψ (~x,t)) L σ σ ∂ a σ ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ ∂L ∂ ∂L ∂L +δψ†σ (~x,t) + δ ψ†σ (~x,t) + δ(∇ ψ†σ (~x,t)) †σ ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) We can reorganize this expression to put it into the form where we can use the Euler-Lagrange equations. We get ( ) ∂L ∂ ∂L ∂L δ = δψ (~x,t) − − ∇ L σ ∂ a ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ ( ) ∂L ∂ ∂L ∂L + δψ†σ (~x,t) − − ∇ †σ ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) " !# ∂ ∂L †σ ∂L + δψσ (~x,t) + δψ (~x,t) ∂t ∂ ∂ †σ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂L ∂L + (~x,t) + †σ (~x,t) ∇a δψσ δψ †σ ∂(∇aψσ (~x,t)) ∂(∇aψ (~x,t)) Now, we use the Euler-Lagrange equations to set the first two lines to zero. We obtain " !# ∂ ∂L †σ ∂L δL = δψσ (~x,t) + δψ (~x,t) ∂t ∂ ∂ †σ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂L ∂L + (~x,t) + †σ (~x,t) ∇a δψσ δψ †σ (3.12) ∂(∇aψσ (~x,t)) ∂(∇aψ (~x,t)) We have shown that, when the equations of motions are used after the variation of the Lagrangian density, any variation of the Lagrangian density is given by total derivatives. Moreover, in equation (3.12) we know the total derivatives are for a given Lagrangian density. Then, we can equate the two different expressions that we have found for the variation of the Lagrangian density which we have found, the one in equation (3.12) and the one in equation (3.11), " # ∂ ∂ ∂L R +~∇ · J~ = δψ (~x,t) ∂t ∂t σ ∂ ∂( ∂t ψσ (~x,t)) ∂L ∂L + (~x,t) + †σ (~x,t) ∇a δψσ δψ †σ (3.13) ∂(∇aψσ (~x,t)) ∂(∇aψ (~x,t)) By combining the terms, we obtain the equation of continuity ∂ R(~x,t) +~∇ · J~ (~x,t) = 0 (3.14) ∂t 6We remind the reader that, in all cases, the variation of the derivative of a function is equal to the derivative of the ∂ ∂ variation, for example, δ ∂t ψ = ∂t (δψ), δ(∇aψ) = ∇a (δψ). 3.3 Phase symmetry and the conservation of particle number 51 where the charge and current densities are given by the expressions ∂L †ρ ∂L R(~x,t) = δψσ (~x,t) + δψ (~x,t) − R(~x,t) ∂ ∂ †ρ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂L ∂L a(~x,t) = (~x,t) + †ρ (~x,t) − Ja(~x,t) J δψσ δψ †ρ ∂(∇aψσ (~x,t)) ∂(∇aψ (~x,t)) This is Noether’s theorem. In summary, Noether’s theorem tells us that, given a Lagrangian density L which is a function †σ of the variables ψσ (~x,t) and ψ (~x,t) and their first derivatives by time and space coordinates, Symmetry and Noether0s Theorem. †ρ †ρ †ρ Under ψσ (~x,t) → ψσ (~x,t) + δψσ (~x,t), ψ (~x,t) → ψ (~x,t) + δψ (~x,t), ∂ whenever, without equations of motion, δL = R(~x,t) +~∇ · J~(~x,t) (3.15a) ∂t ∂ R(~x,t) +~∇ · J~ (~x,t) = 0 where (3.15b) ∂t ∂L †ρ ∂L R(~x,t) = δψσ (~x,t) + δψ (~x,t) − R(~x,t) (3.15c) ∂ ∂ †ρ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂L ∂L a(~x,t) = (~x,t) + †ρ (~x,t) − Ja(~x,t) J δψσ δψ †ρ (3.15d) ∂(∇aψσ (~x,t)) ∂(∇aψ (~x,t)) The charge density R(~x,t) defined in equation (3.15c) and the current density Ja(~x,t) defined in equation (3.15d) are the conserved Noether current (R(~x,t),Ja(~x,t)). In either classical or quantum field theory, a charge density and current density which obey a continuity equation such as the one above is called a conserved current. Here, we have assumed that the fields are classical. However, like the field equations, which we assume to still hold when the classical fields are replaced by quantum field theory operators, we will generally assume that the same conservation law holds at both the classical and quantum levels. Of course, the conservation law is a consequence of the field equations, so if the quantum fields obey the fields equations, the conservations laws that are constructed from them should also hold. However, of course, there are issues in the quantum field theory, such as operator ordering and the singularities which we shall encounter when we consider products of operators evaluated at the same space-time point which can conspire to ruin a conservation law. The conservation of currents in the quantum field theory should therefore always be checked with some care as there are known cases where it fails. The existence of the conserved current R(~x,t),J~ (~x,t) as a consequence of symmetry is the content of Noether’s theorem. We have derived it in the context of our non-relativistic quantum field theory. However, it, or straightforward generalizations of it, are valid for any field theory where the field equations can be obtained from an action by a variational principle. 3.3 Phase symmetry and the conservation of particle number Now, let us consider the Lagrangian density (3.3)and the infinitesimal phase transformation †ρ †ρ δψ = iθψσ (~x,t) , δψ (~x,t) = −iθψ (~x,t) (3.16) which we have already identified as a symmetry, in this case, we found that δL = 0. The quantities R and J~ which we would use to construct the Noether current are both zero in this case, R = 0, J~ = 0. Then Noether’s theorem tells us that the charge density is ∂L †σ ∂L ρ(~x,t) = iθψσ (~x,t) − iθψ (~x,t) ∂ ∂ †σ ∂( ∂t ψ(~x,t)) ∂( ∂t ψ (~x,t) 52 Chapter 3. Classical field theory and the action principle †σ = −h¯θψ (~x,t)ψσ (~x,t) and the current density is ∂L ∂L ~ (~x,t) = i (~x,t) − i †σ (~x,t) J θψσ θψ †σ ∂(∇aψσ (~x,t)) ∂(∇aψ (~x,t)) ih¯ ←− −→ = −h¯θ ψ†σ (~x,t)( ∇ − ∇ )ψ (~x,t) 2m σ It is convenient to remove the factor of −h¯θ, it is a constant, and if a charge and current density obey the continuity equation, so do that charge and current density each multiplied by the same common constant. When we do this, find the conserved current (for which we use the same notation) i ∂L i †σ ∂L ρ(~x,t) = − ψσ (~x,t) + ψ (~x,t) (3.17a) h¯ ∂ h¯ ∂ †σ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) i ∂L i ∂L (~x,t) = − (~x,t) + †σ (~x,t) Ja ψσ a ψ a †σ (3.17b) h¯ ∂(∇ ψσ (~x,t)) h¯ ∂(∇ ψ (~x,t)) These are quite general, in that the above equation applies to any Lagrangian density which has a phase symmetry (and where the Lagrangian density depends on the fields and their first derivatives only). By Noether’s theorem, the charge and current densities are guaranteed to obey the conservation law ∂ ρ(~x,t) +~∇ ·~J(~x,t) = 0 (3.18a) ∂t For the Lagrangian density (3.3) which we have been discussing the charge and current densities are †σ ρ(~x,t) = ψ (~x,t)ψσ (~x,t) (3.19a) ih¯ ih¯ ~J(~x,t) = − ψ†σ (~x,t)∇ψ (~x,t) + ∇ψ†σ (~x,t)ψ (~x,t) (3.19b) 2m σ 2m σ The Noether charge is given by Z Z 3 3 †σ d xρ(~x,t) = d xψ (~x,t)ψσ (~x,t) ≡ N which is just the particle number which we have been using in previous chapters. Its derivative by time is Z Z I d 3 ∂ †σ 3 N = d x ψ (~x,t)ψ (~x,t) = − d x~∇ ·~J(~x,t) = − d~s ·~J(~x,t) dt ∂t σ where, in the last expression, we have used Gauss’ theorem to write the integral of the divergence of a vector field as the integral of the normal component of the vector on the boundary of the system. The last integral is interpreted as the negative of the total flux of particles leaving the system through the sphere at the infinite boundaries of space. It is equal to the time rate of change of the total particle number, as it should be for a conserved current. If that final surface integral vanishes the particle number is time-independent. The boundary condition that guarantees that this integral vanishes is the same boundary condition that would make the laplacian −~∇2 operating on the wave- function a hermitian differential operator. These are the boundary conditions which are normally imposed in a quantum mechanical system. Thus, we can expect that the boundary conditions result in conservation of the particle number. We have thus found that the time-independence of the particle number is a consequence of the symmetry of the theory under changes of phase of the field operator. 3.4 Translation invariance 53 3.4 Translation invariance The quantum field theory that we have been discussion has a symmetry under constant translations of the space and time coordinates. Under a space and time translation, †σ †σ †σ ψσ (~x,t) → ψ˜σ (~x,t) = ψσ (~x +~ε,t + ε) , ψ (~x,t) → ψ˜ (~x,t) = ψ (~x +~ε,t + ε) The infinitesimal transformations are gotten by taking the leading order in a Taylor expansion in the parameters~ε and ε, ∂ ∂ δψ (~x,~t) = ~ε · ∇ + ε ψ (~x,t) , δψ†ρ (~x,~t) = ~ε · ∇ + ε ψ†ρ (~x,t) σ ∂t σ ∂t and we can use some simple algebra to show that the variation of the Lagrangian density is the combination of derivatives ∂ δL = ~∇ · (~εL ) + (εL ) ∂t This qualifies the transformation as a symmetry where we identify R = εL and J~ =~εL . The Noether current then has charge density ih¯ †σ ~ ∂ ih¯ ~ ∂ †σ Tt = ψ (~x,t) ~ε · ∇ + ε ψ (~x,t − ~ε · ∇ + ε ψ (~x,t)ψ (~x,t)) − εL 2 ∂t σ 2 ∂t σ ih¯ ih¯ =~ε · ψ†σ (~x,t)~∇ψ (~x,t) − ~∇ψ†σ (~x,t)ψ (~x,t) 2 σ 2 σ 2 h¯ λ 2 + ε ~∇ψ†σ (~x,t) ·~∇ψ (~x,t) − µψ†σ (~x,t)ψ (~x,t) + ψ†σ (~x,t)ψ (~x,t) 2m σ σ 2 σ (3.20) and the current density is h¯ 2 ∂ ∂ T = − ∇ ψ†σ (~x,t) ~ε ·~∇ + ε ψ (~x,t) + ~ε ·~∇ + ε ψ†σ (~x,t)~∇ ψ (~x,t) a 2m a ∂t σ ∂t a σ − εaL h¯ 2 ∂ ∂ =ε − ∇ ψ†σ (~x,t) ψ (~x,t) + ψ†σ (~x,t)∇ ψ(~x,t) 2m a ∂t σ ∂t a h¯ 2 + ε − ∇ ψ†σ (~x,t)∇ ψ (~x,t) − ∇ ψ†σ (~x,t)∇ ψ (~x,t) b 2m a b σ b a σ −δabL } (3.21) From this Noether charge density (3.20) and current density (3.21), we identify the two-index object, called the energy-momentum tensor, which has components ∂ ∂L ∂ †σ ∂L Ttt = ψσ (~x,t) + ψ (~x,t) − L (3.22a) ∂t ∂ ∂t ∂ †σ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂L †σ ∂L Ttb = ∇bψσ (~x,t) + ∇bψ (~x,t) (3.22b) ∂ ∂ †σ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂ ∂L ∂ ∂L = (~x,t) + †σ (~x,t) Tat ψσ a ψ a †σ (3.22c) ∂t ∂(∇ ψσ (~x,t)) ∂t ∇ ψ (~x,t)) ∂L ∂L = (~x,t) + †σ (~x,t) − Tab ∇bψσ a ∇bψ a †σ δabL (3.22d) ∂(∇ ψσ (~x,t)) ∂(∇ ψ (~x,t)) 54 Chapter 3. Classical field theory and the action principle We emphasize that the above equations are general and the continuity equations for components of the energy-momentum tensor ∂ T + ∇aT = 0 (3.23a) ∂t tt at ∂ T + ∇aT = 0. (3.23b) ∂t tb ab hold, whatever the Lagrangian density is, as long as it is space and time translation invariant. Explicitly, for the quantum field theory with the Lagrangian density (3.3), the energy-momentum tensor is given by h¯ 2 T = ~∇ψ†σ (~x,t) ·~∇ψ (~x,t) − µψ†σ (~x,t)ψ (~x,t). tt 2m σ σ λ 2 + ψ†σ (~x,t)ψ (~x,t) (3.24a) 2 σ ih¯ ih¯ T = − ∇ ψ†σ (~x,t)ψ (~x,t) + ψ†σ (~x,t)∇ ψ (~x,t) (3.24b) tb 2 b σ 2 b σ h¯ 2 ∂ ∂ .T = − ∇ ψ†σ (~x,t) ψ (~x,t) + ψ†σ (~x,t)∇ ψ(~x,t) (3.24c) at 2m a ∂t σ ∂t a h¯ 2 T = − ∇ ψ†σ (~x,t)∇ ψ (~x,t) + ∇ ψ†σ (~x,t)∇ ψ (~x,t) ab 2m a b σ b a σ 2 h¯ λ 2 + δ ~∇2 ψσ†(~x,t)ψ (~x,t) − ψσ†(~x,t)ψ (~x,t) (3.24d) ab 4m σ 2 σ where, we we have used the field equation to eliminate the time derivative terms in the Lagrangian density in order to obtain equation (3.24d) for Tab. When we eliminate these time derivatives using the equation of motion, the Lagrangian density is 2 h¯ 2 σ† λ σ† 2 L = − ~∇ ψ (~x,t)ψ (~x,t) + ψ (~x,t)ψ (~x,t) 4m σ 2 σ which is reflected in the last term in equation (3.24d). However, for economy of notation, we have not eliminated the time derivatives of the fields in Tat , as we shall not use its explicit form in what follows. If that form is needed, we would also have to use the field equation in the expression for Tat . 3.5 Galilean symmetry In classical mechanics, Newton’s second law of motion for a free particle, d2 m ~x(t) = 0 (3.25) dt2 is invariant under replacing ~x by ~x +~vt where ~v is a constant (~x and t independent) vector. The symmetry of the equation of motion under this replacement is a form of non-relativistic relativity called Galilean symmetry. It tells us that the laws of physics hold equally well in either of two different reference frames, the original one, with space coordinates~x and time coordinate t˜ and a new one with space coordinates~x˜ and time coordinate t where the reference frames are related by the coordinate transformations ~x˜ =~x +~vt, t˜ = t 3.5 Galilean symmetry 55 Now, consider a system of particles which interact with each other in such a way that their equations of motion are 2 d ~ m 2~xi(t) = −∑∇iV(~xi −~x j) dt j where~xi with i = 1,..,N labels the positions of the particles. Here, we see that the N equations of motion are also invariant under Galilean transformation of the particle positions, the replacement of (~x1(t),~x2(t),...,~xN(t)) by (~x˜1(t),~x˜2(t),...,~x˜N(t)) = (~x1(t) +~vt,~x2(t) +~vt,...,~xN(t) +~vt) The classical mechanics of a free particle or an assembly of free particles interacting by a two-body potential are thus invariant under Galilean transformations. We expect that the quantum mechanics of systems such as these are also invariant. It is interesting to first ask how this symmetry can be seen in Schrödinger’s equation for a single free particle, the quantum mechanical version of the system described by the classical equation (3.25). We expect that the quantum mechanical system is Galilean invariant if the classical one is. The single free particle Schrödinger equation is ∂ h¯ 2 ih¯ + ~∇2 ψ(~x,t) = 0 ∂t 2m We could try to boost this system by substituting~x →~x +~vt into the wave-function. This does not quite work. The wave equation which the “boosted” wavefunction ψ(~x +~vt,t) satisfies is a little different from the original Schrödinger equation, ∂ h¯ 2 ih¯ − ih¯~v ·~∇ + ~∇2 ψ(~x +~vt,t) = 0 ∂t 2m It has an extra term “−ih¯~v ·~∇”. We next observe this extra term with an ~x-dependent change of phase of the wave-function. First of all, ∂ h¯ 2 m ih¯ + ~∇2 + ~v2 e−im~v·~x/h¯ ψ(~x +~vt,t) = 0 ∂t 2m 2 m 2 Now, there is a simpler extra term, “ 2~v ” which we can remove using another simple change of phase of the wave-function. We get 2 ∂ h¯ 2 −im~v·~x/h¯+i m~v2t/h¯ ih¯ + ~∇ e 2 ψ(~x +~vt,t) = 0 ∂t 2m −im~v·~x/h¯+i m~v2t/h¯ which is the original Schrödinger equation. Our conclusion is that ψ(~x,t) and e 2 ψ(~x+ ~vt,t) satisfy the same Schrödinger equation. In as much as the Schrödinger equation describes the physics of the system, the upshot is that all physical processes will follow the same dynamical rules in the original reference frame and in the Galilean boosted reference frame if we map the wave-function from one frame to another by this transformation. This statement, as we have formulated it above, applies to single particle quantum mechanics. We can easily generalize it to the transformation of a many-particle wave-function m 2 −im~v·(~x1+...+~xN )/h¯+i ~v t/h¯ ψ(~x1,...,~xN.t) → e 2 ψ(~x1 +~vt,...,~xN +~vt.t) We can easily use this formula to understand how Galilean symmetry works in the quantum field theory formulation of the many-particle problem. However, there is a shortcut to doing this, 56 Chapter 3. Classical field theory and the action principle beginning with free particles. Here, we are actually interested in the classical field theory which yields that quantum field theory when one applies the rules of quantization, as we have discussed them earlier in this chapter. We know that the wave equation which the classical field satisfies, in the case where there are no interactions, is identical in form to the Schrödinger equation, ∂ h¯ 2 ih¯ + ~∇2 + µ ψ (~x,t) = 0 ∂t 2m σ This tells us how to do the Galilean transformation of the classical field: −im~v·~x/h¯+i m~v2t/h¯ ψσ (~x,t) → ψ˜σ (~x,t) = e 2 ψσ (~x +~vt,t) (3.26) We know by our discussion above that this must be a symmetry of the non-interacting theory. What remains to check is that it is also a symmetry of a theory with interactions. To check that this is indeed a symmetry of the classical field theory with interactions, for example the theory with Lagrangian density given in equation (3.3), we consider the infinitesimal transformation ~ δψσ (~x,t) = −im~v ·~x/h¯ +t~v · ∇ ψσ (~x,t) δψ†σ (~x,t) = im~v ·~x/h¯ +t~v ·~∇ ψ†σ (~x,t) By plugging this transformation into the Lagrangian density above, we see that the variation of the Lagrangian density (without use of the equations of motion) is equal to δL = ~∇ · (~vtL (~x,t)) The transformation in equations (??) and (??) is therefore a symmetry. The Noether charge and current densities corresponding to this symmetry are easy to find in terms of components of the energy-momentum tensor and the number density and current, Bb(~x,t) = tTtb(~x,t) + mxb ρ(~x,t) , Bba(~x,t) = tTab(~x,t) + mxb ja(~x,t) Here, (ρ,~j) are the Noether current associated with phase symmetry that we found in equation (3.19a). It is straightforward to confirm that the Galilean current in (??) and (??) is conserved. Noether’s theorem implies that in a Galilean invariant system ∂ b Ba(~x,t) + ∇ B (~x,t) = 0 ∂t ab Alternatively, if the system has space- and time-translation invariance and therefore a conserved energy-momentum tensor, we could form the Galilean Noether charge and current densities and, using the conservation law for the energy-momentum tensor (??) and (??) alone, we can write ∂ [ tT (~x,t) + x ρ(~x,t)] + ∇ [ tT (~x,t) + mx j (~x,t)] ∂t tb b a ab b a = Ttb(~x,t) + m jb(~x,t) We obtain a condition that the energy-momentum tensor and particle current must obey in order to have Galilean invariance in the system, Ttb(~x,t) = −m jb(~x,t) 3.6 Scale invariance 57 The conclusion is that the momentum density is equal to minus the particle mass times the particle current density in any translation and Galilean invariant system. In summary, the consequences of Galilean invariance are Galilean invariance. im δaψ (~x,t) = t∇a − xa ψ (~x,t) (3.27a) σ h¯ σ †ρ im †ρ δaψ (~x,t) = t∇a + xa ψ (~x,t) (3.27b) h¯ b δaL = ∇ (tδbaL ) (3.27c) Bb(~x,t) = tTtb(~x,t) + mxb ρ(~x,t) (3.27d) Bba(~x,t) = tTab(~x,t) + mxb ja(~x,t) (3.27e) ∂ ∂ If T (~x,t) + ∇aT (~x,t) = 0, T (~x,t) + ∇aT (~x,t) = 0, (3.27f) ∂t tt at ∂t tb ab ∂ R (~x,t) +~∇ · J~ (~x,t) = 0 requires (3.27g) ∂t b b Tat (~x,t) + m ja(~x,t) = 0 (3.27h) 3.6 Scale invariance In some circumstances, the non-relativistic quantum field theory that we have been discussing can have a symmetry under scaling of the space and time variables. If we examine the field equation, ∂ h¯ 2 ih¯ ψ (~x,t) = − ~∇ 2 − µ ψ (~x,t) + λψ†ρ (~x,t)ψ (~x,t)ψ (~x,t) (3.28) ∂t σ 2m σ ρ σ †σ and, if we assume that ψσ (~x,t) and ψ (~x,t) satisfy the field equation, and, wherever ψσ (~x,t) and ψ†σ (~x,t) appear, we substitute7 †σ d †σ 2 ψ˜ (~x,t)) ≡ Λ 2 ψ (Λ~x,Λ t) (3.29) d 2 ψ˜σ (~x,t)) ≡ Λ 2 ψσ (Λ~x,Λ t)) (3.30) †σ with Λ is a positive real number, we see that ψ˜σ (~x,t) and ψ˜ (~x,t) also satisfy the field equation with some re-scaled parameters, ∂ h¯ 2 µ ih¯ ψ˜ (~x,t) = − ~∇ 2 − ψ˜ (~x,t) + λΛd−2ψ˜ †ρ (~x,t)ψ˜ (~x,t)ψ˜ (~x,t) (3.31) ∂t σ 2m Λ2 σ ρ σ where we have given the result for d dimensions. What is more, the factors in front of the fields †σ in equations (3.29) and (3.30) are determined by requiring that ψ˜σ (~x,t)) and ψ˜ (~x,t) satisfy the equal-time commutation or anti-commutation relations, †ρ h d 2 d †σ 2 i d ρ d ρ d ψ˜σ (~x,t)),ψ˜ (~x,t) = Λ 2 ψσ (Λ~x,Λ t)),Λ 2 ψ (Λ~x,Λ t) = Λ δσ δ (Λ~x−Λ~y) = δσ δ (~x−~y) where we have used the property of the Dirac delta function δ(Λx) = δ(x)/|Λ|. If we set µ → 0 and if d = 2 or λ → 0, we obtain a scale invariant quantum field theory. The infinitesimal symmetry 7Note that the space and time coordinates to not scale in the same way, in fact ~x → Λ~x and t → Λ2t. In general ~x → Λ~x and t → Λzt where z is called the dynamical critical exponent. For our free non-relativistic field theory, z = 2 whereas, for relativistic field theory that we will study in subsequent chapters, z = 1. 58 Chapter 3. Classical field theory and the action principle transformations are ∂ d δψ (~x,t)) = 2t +~x ·~∇ + ψ (~x,t) (3.32) σ ∂t 2 σ ∂ d δψ†σ (~x,t)) = 2t +~x ·~∇ + ψ†σ (~x,t) (3.33) ∂t 2 The Noether current that is associated with this transformation is constructed from the energy- momentum tensor and the particle number current as R(~x,t) = 2tTtt (~x,t) + xbTtb(~x,t) (3.34) d h¯ 2 Ja(~x,t) = 2tTat (~x,t) + x T (~x,t) − ∇aρ(~x,t) (3.35) b ab 2 2m When the system is scale invariant, this charge and current density obeys the equation of continuity, ∂ a R(~x,t) + ∇aJ (~x,t) = 0 ∂t This equation involves an identity between the momentum charge density and the particle number current, d h¯ 2 2T (~x,t) + T (~x,t) − ~∇2ρ(~x,t) = 0 (3.36) tt aa 2 2m Any time and space-translation invariant theory with a conserved energy-momentum tensor and a number charge and current density where the energy-momentum tensor and the number current also obeys equation (3.36), then theory also has scale symmetry. 3.6.1 Improving the energy-momentum tensor It is sometimes convenient to consider an “improved” the energy-mometum tensor. Improvement is a procedure which adds a conserved, symmetric tensor to the energy momentum tensor in order to get a tensor with more favourable properties. In the present case, consider d h¯ 2 T (~x,t) = T˜ (~x,t) + δ ~∇2 − ∇ ∇ ρ(~x,t) ab ab d − 1 4m ab a b σ† where d is the dimension of space and, as usual, ρ(~x,t) = ψ (~x,t)ψσ (~x,t). Since a h ~ 2 i ∇ δab∇ − ∇a∇b anything = 0 and therefore a a ∇ T˜ ab(~x,t) = ∇ Tab(~x,t) the added term does not affect continuity equations. Also, the quantity that has been added, is a total divergence of derivatives of the density d h¯ 2 d h¯ 2 δ ~∇2 − ∇ ∇ ρ(~x,t) = ∇c (δ δ − δ δ )∇dρ(~x,t) d − 1 4m ab a b d − 1 4m ab cd ac bd The derivatives ∇dρ(~x,t) will falls off rapidly at spatial infinity, particularly when the density approaches a constant there. Then, using Gauss’ theorem, we see that Z Z 3 3 d x Tab(~x,t) = d x T˜ ab(~x,t) 3.6 Scale invariance 59 Finally, if the spatial part of the energy-momentum tensor is symmetric, Tab(~x,t) = Tab(~x,t) then so is T˜ ab(~x,t) = T˜ ab(~x,t). Finally, h¯ 2 T a = T˜ a + d ~∇2ρ(~x,t) a a 2m where we have written the expression for d space dimensions. Thus, by adjusting the constant c, we can adjust the trace of T˜ ab. The condition for scale invariance which we found in equation (3.36) was h¯ 2 2T + T a − d ~∇2ρ = 0 tt a 4m In terms of T˜ ab the condition becomes 2Ttt + T˜ aa = 0 3.6.2 The consequences of scale invariance In summary, the consequences of scale invariance are Scale invariance. ∂ d δψ (~x,t) = 2t +~x ·~∇ + ψ (~x,t) (3.37a) σ ∂t 2 σ ∂ d δψ†ρ (~x,t) = 2t +~x ·~∇ + ψ†ρ (~x,t) (3.37b) ∂t 2 ∂ a δL = (2tL ) + ∇a (x L ) (3.37c) ∂t a S (~x,t) = 2tTtt (~x,t) + x Tta(~x,t) (3.37d) a Kb(~x,t) = 2tTtb(~x,t) + x T˜ ba(~x,t) d h¯ 2 − δ ~∇2 − ∇ ∇ xaρ(~x,t) (3.37e) ba b a d − 1 4m (3.37f) ∂ ∂ If T (~x,t) + ∇aT (~x,t) = 0, T (~x,t) + ∇aT (~x,t) = 0, (3.37g) ∂t tt at ∂t tb ab ∂ S (~x,t) +~∇ · K~ (~x,t) = 0 requires : (3.37h) ∂t ˜ a 2Ttt (~x,t) + Ta (~x,t) = 0 (3.37i) ˜ a The operator equation 2Ttt (~x,t) + Ta (~x,t) = 0 has interesting consequences. This identity must hold in any scale invariant field theory. Its expectation value must also hold in any state of a scale invariant theory, even when the state itself is not scale invariant. In particular, the ground state |O > which we have discussed for a weakly interacting Fermi or Bose gas cannot be scale invariant since it contains a finite density of particles. However, if the theory happened to be scale invariant, we would have ˜ a 2 < O|Ttt (~x,t)|O > + < O|Ta (~x,t)|O >= 0 Generally, the expectation value of Ttt (~x,t) is the energy density. Moreover, the average of the expectation values of the diagonal components of T˜ ab(~x,t) is equal to the pressure. This tells us that, in any state of a scale invariant theory, 2u = dP (3.38) 60 Chapter 3. Classical field theory and the action principle Generally, the system that we are discussing is not scale invariant. In fact, since for the Lagrangian density (3.3), the improved energy-momentum tensor is h¯ 2 T˜ = − ∇ ψ†σ (~x,t)∇ ψ (~x,t) + ∇ ψ†σ (~x,t)∇ ψ (~x,t) ab 2m a b σ b a σ d h¯ 2 1 + δ ~∇2 − ∇ ∇ ψσ†(~x,t)ψ (~x,t) d − 1 2m d ab a b σ λ 2 − δ ψσ†(~x,t)ψ (~x,t) (3.39) ab 2 σ where we have used the equation of motion to eliminate the time derivative terms. The trace condition for scale invariance is λ 2 2T + T˜ = −2µψσ†(~x,t)ψ (~x,t) + (2 − d) ψσ†(~x,t)ψ (~x,t) (3.40) tt aa σ 2 σ which is non-zero. The system can only have scale invariance if the right-hand-side vanishes, as an operator. This can only happen if the chemical potential vanishes. This is not surprising, as the chemical potential has the dimensions of an energy and it should not be scale invariant. Also, outside of two dimensions, the interaction is not scale invariant. It turns out that the apparent scale invariance when µ = 0 and d = 2 is violated by a scale anomaly, so even in two dimensions, there is no scale invariance once the particles interact with each other with generic values of the coupling constant. There can be some special values of the coupling, “fixed points” at which the theory is conjectured to be scale invariant. The Feschbach resonance, or unitary point of a cold atom gas is thought to be such a point. 3.7 Special Schrödinger symmetry In a translation, Galilean and scale invariant theory, there is always another symmetry, called the special Schrödinger symmetry. The special Schrödinger transformation of the fields is ∂ im~x2 d δψ (~x,t) = t2 +t~x ·~∇ − + t ψ (~x,t) σ ∂t h¯ 2 2 σ ∂ im~x2 d δψ†σ (~x,t) = t2 +t~x ·~∇ + + t ψ†σ (~x,t) ∂t h¯ 2 2 With this transformation, it can be shown that the Lagrangian density transforms by the total derivative terms, ∂ 2 δL = t L +~∇ · (t~xL ) ∂t The Noether charge and current densities are constructed in the standard manner. They are related to the energy-momentum tensor components and the particle and number current densities as 2 2 x R(~x,t) = t Ttt (~x,t) +tx T (~x,t) + m ρ(~x,t) b tb 2 2 2 2 x h¯ Ja(~x,t) = t Tat (~x,t) +tx T (~x,t) + m Ja(~x,t) − dt ∇aρ(~x,t) b ab 2 4m If the energy-momentum tensor is conserved, the above current is also conserved if the following expression vanishes h¯ 2 2tT (~x,t) + xbT (~x,t) +tT (~x,t) + mxbJ (~x,t) −td ~∇2ρ(~x,t)) = 0 tt tb aa b 4m 3.8 The Schrödinger algebra 61 Scale and Galilean symmetry are enough to guarantee the above. We conclude that a translation invariant quantum field theory which has a conserved energy-momentum tensor associated with the time and space-translation invariance, and which also also has a Galilean symmetry, so that Ttb(~x,t) + mJb(~x,t) = 0 and a scale symmetry so that h¯ 2 2T (~x,t) + T a(~x,t) − d ~∇2ρ(~x,t) = 0 tt a 4m must also have a conserved current corresponding to the special Schrödinger symmetry. In summary, the transformations, Noether currents and the conditions for their conservation are Special Schroedinger invariance. ∂ im~x2 d δψ (~x,t) = t2 +t~x ·~∇ − + t ψ (~x,t) (3.41a) σ ∂t h¯ 2 2 σ ∂ im~x2 d δψ†σ (~x,t) = t2 +t~x ·~∇ + + t ψ†σ (~x,t) (3.41b) ∂t h¯ 2 2 ∂ 2 a δL = t L + ∇a (tx L ) (3.41c) ∂t 2 2 a x S˜(~x,t) = t Ttt (~x,t) +tx Tta(~x,t) + m ρ(~x,t) (3.41d) 2 x2 K˜ (~x,t) = t2T (~x,t) +txaT˜ (~x,t) + m J (~x,t) b bt ab 2 b d h¯ 2 − δ ~∇2 − ∇ ∇ xaρ(~x,t) (3.41e) ba b a d − 1 4m ∂ ∂ If T (~x,t) + ∇aT (~x,t) = 0, T (~x,t) + ∇aT (~x,t) = 0, (3.41f) ∂t tt at ∂t tb ab ∂ ~ S˜(~x,t) +~∇ · K˜(~x,t) = 0 requires (3.41g) ∂t ˜ a Tat (~x,t) + m ja(~x,t) = 0 and 2Ttt (~x,t) + Ta (~x,t) = 0 (3.41h) 3.8 The Schrödinger algebra A translation, rotation and Galilean invariant quantum field theory has conserved Noether charges the Hamiltonian, H, the linear momentum Pa, the angular momentum, Mab and a Noether charge Ba corresponding to Galilean boosts. In addition, Galilean symmetry makes use of the conserved number operator N . These charges are time independent and the expressions for them are 62 Chapter 3. Classical field theory and the action principle somewhat simpler if we evaluate them at t = 0, where they are Z Z 3 3 H = d Ttt (~x,0) = d x H (~x,0) Z Z 3 3 Pa = − d x Tta(~x,0) = m d x ja(~x,0) Z Z 3 3 Mab = − d x Mab(~x,0) = − d x (xa Ttb(~x,0) − xb Tta(~x,0)) Z 3 = m d x (xa jb(~x,0) − xb ja(~x,0)) Z im Z Ba = d3x Ba(~x,0) = d3x xaρ(~x,0) h¯ Z N = d3x ρ(~x,t) The densities in the above integrals are 2 h¯ †σ λ †σ 2 H (~x,t) = ~∇ψ (~x,t) ·~∇ψ (~x,t) + ψ (~x,t)ψ (~x,t) 2m σ 2 σ †σ ρ(~x,t) = ψ (~x,t)ψσ (~x,t) ih¯ ih¯ j (~x,t) = − ψ†σ (~x,t)∇ ψ (~x,t) + ∇ ψ†σ (~x,t)ψ (~x,t) a 2m a σ 2m a σ 2 a h¯ ∂ H (x,t),ψ (~y,t) = −∇ δ(~x −~y)∇aψ (~x,t) − ih¯ ψ (~x,t)δ(~x −~y) ρ x 2m ρ ∂t ρ 2 †ρ a h¯ †ρ ∂ †ρ H (x,t),ψ (~y,t) = −∇ δ(~x −~y)∇aψ (~x,t) − ih¯ ψ (~x,t)δ(~x −~y) x 2m ∂t †σ †σ [ρ(~x,t),ψσ (~y,t)] = −δ(~x −~y)ψσ (~x,t), ρ(~x,t),ψ (~y,t) = δ(~x −~y)ψ (~x,t) ih¯ ih¯ [j (~x,t),ψ (~y,t)] = δ(~x −~y)∇ ψ (~x,t) − ∇ (δ(~x −~y)ψ (~x,t)) a σ m a σ 2m a σ ih¯ ih¯ j (~x,t),ψ†σ (~y,t) = δ(~x −~y)∇ ψ†σ (~x,t) − ∇ δ(~x −~y)ψ†σ (~x,t) a m a 2m a †σ We can use the equal-time commutation relations for the fields ψσ (~x,t) and ψ (~x,t) to get the commutators for the number density and current, [ρ(~x,t),ρ(~y,t)] = 0 ih¯ [ρ(~x,t),j (~y,t)] = − ρ(~y,t)∇ δ(~x −~y) a 2m a ih¯ [j (~x,t),j (~y,t)] = − (j (~y,t)∇ + j (~x,t)∇ )δ(~x −~y) a b 2m a b b b These can be used to form the commutators of the Noether charges which are summarized as [N ,H] = 0 [N ,Pa] = 0 [N ,Mab] = 0 [N ,Ba] = 0 (3.42) [Pa,Pb] = 0 [Mab,H] = 0 [Pa,H] = 0. (3.43) [Mab,Mcd] = δadMbc − δacMdb + δbcMda − δbdMac (3.44) [Mab,Pc] = δacPb − δbcPa [Mab,Bc] = δacBb − δbcBa (3.45) N [B ,B ] = 0 [B ,H] = P [B ,P ] = −δ m (3.46) a b a a a b ab 2 3.9 Summary of this chapter 63 Note that the commutators of elements of the set {H,Pa,Mab,Ba,N } result in linear combinations of the elements themselves. Note that this statement requires that the number operator N is an element of the set. This property gives the linear vector space that is formed from all linear combinations of elements of the set {H,Pa,Mab,Ba,N } the structure of a Lie algebra. It is called the Galilean algebra. It contains a sub-algebra, the angular momenta Jab and their commutators with each other which is just the Lie algebra of rotations. In d-dimensional space, there are d(d − 1)/2 distinct Jab (three in d = 3). If we are interested in a theory that also has scale invariance, we can add two more charges, the dilatation, and the speial Schrödinger operator Z Z 3 m 3 a ∆ = d x D(~x,0) = d x x ja(~x,0) ih¯ Z Z ~x2 S = d3x D(~x,0) = m d3x ρ(~x,0) 2 These have commutators with the Galilean algebra elements [∆,H] = 2H, [∆,Pa] = Pa, [∆,Ba] = −Ba, [∆,Mab] = 0 (3.47) [S,H] = ∆, [S,Pa] = −Ba, [S,Ba] = 0, [S,Mab] = 0 (3.48) [S,∆] = 2S (3.49) The Lie algebra which includes these charges has the basis {H,Pa,Mab,Ba,N ,∆,S} and it is called the Schrödinger algebra. We can use the equal-time commutation relation to find 3 3 [∆,ψ (0,0)] = i ψ (0,0) ∆,ψ†σ (0,0) = i ψ†σ (0,0) σ 2 σ 2 An operator that has this property, that its commutator with ∆ is equal to itself times a constant is called a scaling operator and the constant is its scaling dimension. In this case, the scaling 3 dimension is equal to 2 . We can obtain other operators with higher scaling dimensions in two ways. One is by taking products of operators such as †ρ1 †ρ2 ρ` ψσ1 (0,0)ψσ2 (0,0)....ψσk (0,0)ψ (0,0)ψ (0,0)....ψ (0,0) 3 is also a scaling operator with dimension (k + `) · 2 . 3.9 Summary of this chapter The action is the integral of the Lagrangian density over space and time Z Z ∂ ∂ S[ψ ,ψ†σ ] = dt d3x L (ψ,ψ†, ψ, ψ†,∇ψ,∇ψ†) σ ∂t ∂t The Lagrangian density is a function of the classical fields and their derivatives. For Bosons, †σ ψσ (~x,t) and ψ (~x,t) are ordinary functions. For Fermions, they are anti-commuting functions. An example of a Lagrangian density for a non-relativistic many-particle system with spin-independent contact interactions is ih¯ ∂ ih¯ ∂ h¯ 2 L = ψ†σ (~x,t) ψ (~x,t) − ψ†σ (~x,t)ψ (~x,t) − ~∇ψ†σ (~x,t) ·~∇ψ (~x,t) 2 ∂t σ 2 ∂t σ 2m σ λ 2 +µψ†σ (~x,t)ψ (~x,t) − ψ†σ (~x,t)ψ (~x,t) σ 2 σ 64 Chapter 3. Classical field theory and the action principle The field equations are the Euler-Lagrange equations ∂ ∂ ∂L ∂L − − ∇ = 0 L ∂ a ∂ψ (~x,t) ∂t ∂(∇aψ (~x,t)) σ ∂( ∂t ψσ (~x,t)) σ ∂ ∂ ∂L ∂L − − ∇ = 0 †σ L ∂ a †σ ∂ψ (~x,t) ∂t †σ ∂(∇aψ (~x,t)) ∂( ∂t ψ (~x,t)) with the appropriate boundary conditions. The field equation is ∂ h¯ 2 ih¯ ψ (~x,t) = − ~∇ 2 − µ ψ (~x,t) + λψ†ρ (~x,t)ψ (~x,t)ψ (~x,t) ∂t σ 2m σ ρ σ The Lagrangian density, when written in the form ∂ L = ih¯ψ†σ (~x,t) ψ (~x,t) − H (ψ,ψ†) ∂t σ †σ indicates that the canonical momentum conjugate to ψσ (~x,t) is ih¯ψ (~x,t), the Poisson bracket is ψ (~x,t),ihψ†σ (~y,t) = δ ρ δ(~x −~y) , σ ¯ PB σ from which we identify the commutation relations of the quantized fields, ψ (~x,t),ihψ†σ (~y,t) = δ ρ δ(~x −~y) → ψ (~x,t),ihψ†ρ (~y,t) = ihδ ρ δ(~x −~y) σ ¯ PB σ σ ¯ ¯ σ The fields must therefore obey the equal time commutation relations †ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y), †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0 for Bosons or anti-commutation relations †ρ ρ ψσ (~x,t),ψ (~y,t) = δσ δ(~x −~y), †σ †ρ ψσ (~x,t),ψρ (~y,t) = 0, ψ (~x,t),ψ (~y,t) = 0 for Fermions. The classical field theory has a continuous symmetry if there exists an infinitesimal change of variables ψσ (~x,t) → ψσ (~x,t) + δψσ (~x,t) ψ†ρ (~x,t) → ψ†ρ (~x,t) + δψ†ρ (~x,t) such that, without use of the equations of motion, the linear variation of the Lagrangian density can be assembled into partial derivatives, ∂ δL = R(~x,t) +~∇ · J~(~x,t) ∂t Then, Noether’s theorem states that the Noether current and charge densities ∂L †ρ ∂L R(~x,t) = δψσ (~x,t) + δψ (~x,t) − R(~x,t) ∂ ∂ †ρ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂L ∂L (~x,t) = (~x,t) + †ρ (~x,t) − J (~x,t) Ja δψσ δψ †ρ a ∂(∇aψσ (~x,t)) ∂(∇aψ (~x,t)) 3.9 Summary of this chapter 65 obey the continuity equation ∂ R(~x,t) +~∇ · J~ (~x,t) = 0 ∂t which associates a conserved charge Z 3 QR = d xR(~x,t) with the symmetry in question. The energy-momentum tensor is constructed from the Noether currents corresponding to space and time translation symmetries. The components of the energy-momentum tensor are Improved energy-momentum tensor ∂ ∂L ∂ †σ ∂L Ttt (~x,t) = ψσ (~x,t) + ψ (~x,t) − L ∂t ∂ ∂t ∂ †σ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂L †σ ∂L Ttb(~x,t) = ∇bψσ (~x,t) + ∇ψ (~x,t) ∂ ∂ †σ ∂( ∂t ψσ (~x,t)) ∂( ∂t ψ (~x,t)) ∂ ∂L ∂ ∂L (~x,t) = (~x,t) + †σ (~x,t) Tat ψσ a ψ a †σ ∂t ∂(∇ ψσ (~x,t)) ∂t ∂(∇ ψ (~x,t)) ∂L ∂L ˜ (~x,t) = (~x,t) + †σ (~x,t) − Tab ∇bψσ a ∇bψ a †σ δabL ∂(∇ ψσ (~x,t)) ∂(∇ ψ (~x,t)) d h¯ 2 + δ ~∇2 − ∇ ∇ ρ(~x,t) d − 1 4m ab a b Explicit energy-momentum tensor h¯ 2 T = ~∇ψ†σ (~x,t) ·~∇ψ (~x,t) − µψ†σ (~x,t)ψ (~x,t) tt 2m σ σ λ 2 + ψ†σ (~x,t)ψ (~x,t) 2 σ ih¯ T = − ∇ ψ†σ (~x,t)ψ (~x,t) − ψ†σ (~x,t)∇ ψ (~x,t) tb 2 b σ b σ h¯ 2 ∂ ∂ T = − ∇ ψ†σ (~x,t) ψ (~x,t) + ψ†σ (~x,t)∇ ψ(~x,t) at 2m a ∂t σ ∂t a h¯ 2 T˜ = − ∇ ψ†σ (~x,t)∇ ψ (~x,t) + ∇ ψ†σ (~x,t)∇ ψ (~x,t) ab 2m a b σ b a σ d h¯ 2 − ∇ ∇ ψσ†(~x,t)ψ (~x,t) d − 1 2m a b σ 2 1 h¯ λ 2 − δ ~∇2 ψσ†(~x,t)ψ (~x,t) + ψσ†(~x,t)ψ (~x,t) ab d − 1 4m σ 2 σ Particle density and current σ† ρ(~x,t) = ψ (~x,t)ψσ (~x,t) ih¯ ih¯ j (~x,t) = − ψσ†(~x,t)∇ ψ (~x,t) + ∇ ψσ†(~x,t)ψ (~x,t) a 2m a σ 2m a σ 66 Chapter 3. Classical field theory and the action principle Continuity equations ∂ T + ∇aT = 0 ∂t tt at ∂ T + ∇aT˜ = 0 ∂t tb ab ∂ ρ(~x,t) +~∇ ·~j(~x,t) = 0 ∂t Relations required by symmetry Rotation symmetry: T˜ ab(~x,t) = T˜ ba(~x,t) Galilean invariance: Tta(~x,t) = −mja(~x,t) ˜ a Scale invariance: 2Ttt (~x,t) + Ta (~x,t) = 0 Charge and current densities for space-time symmetries (charge density, current density) symmetry (Ttt ,Tat ) time translation d h¯ 2 −T ,−T˜ − (δ − ∇ ∇ )ρ space translation tb ab d − 1 4m ab a b xbTtc − xcTtb,xbT˜ ac − xcT˜ ab rotation d h¯ 2 d h¯ 2 x T − x T ,x T˜ − x T˜ + x δ ~∇2 − ∇ ∇ ρ − x δ ~∇2 − ∇ ∇ ρ rotation b tc c tb b ac c ab b d − 1 4m ac a c c d − 1 4m ab a b −tTtb − mxbρ,−tT˜ ab − mxbja Galilean boost d h¯ 2 2tT + xbT , 2tT + xbT˜ + δ ~∇2 − ∇ ∇ (xbρ) dilatation tt tb at ab d − 1 4m ab a b x2 x2 d h¯ 2 t2T +txbT + m ρ , t2T +txbT˜ + m j +t δ ~∇2 − ∇ ∇ (xbρ) special Schrodinger tt tb 2 at ab 2 a d − 1 4m ab a b Relativistic Symmetry and II Quantum Field Theory 4 Space-time symmetry and relativistic field theory ...... 69 4.1 Quantum mechanics and special relativity 4.2 Coordinates 4.3 Scalars, vectors, tensors 4.4 The metric 4.5 Symmetry of space-time 4.6 The symmetries of Minkowski space 5 The Dirac Equation ...... 79 5.1 Solving the Dirac equation 5.2 Lorentz Invariance of the Dirac equation 5.3 Phase symmetry and the conservation of electric current 5.4 The Energy-Momentum Tensor of the Dirac Field 5.5 Summary of this chapter 6 Photons ...... 93 6.1 Relativistic Classical Electrodynamics 6.2 Covariant quantization of the photon 6.3 Space-time symmetries of the photon 6.4 Quantum Electrodynamics 6.5 Summary of this chapter 4. Space-time symmetry and relativistic field theory 4.1 Quantum mechanics and special relativity It is often said that quantum field theory is the natural marriage of Einstein’s special theory of relativity and the quantum theory. The point of this section will be to motivate this statement. We will begin with a single free quantum mechanical particle and ask what is wrong with simply assuming that its energy spectrum is 2 p 2 4 2 2 ~p given by the relativistic expression E(~p) = m c + c ~p , rather than the non-relativistic E(~p) = 2m . After 2 ~p2 all, for small momenta, the first expression is E(~p) ≈ mc + 2m + ... is very similar to the second expression with an additional constant in the energy. Let us assume that the particle travels on open, infinite three dimensional space. It is described by its position~x and momentum ~p which, for the quantum mechanical particle, are operators with the commutation relation a a [xˆ , pˆb] = ih¯δ b Momentum and energy are conserved and the energy and momentum are related by p E(~p) = m2c4 +~p2c2 (4.1) where m is the rest mass of the particle and c is the speed of light. In the quantum mechanics of a single particle, we could consider a quantum state of the particle which is an eigenstate of its linear momentum, pˆa |pi = pa |pi , i = 1,2,3 (4.2) These eigenstates of momentum have a continuum normalization, so that p|p0 = δ 3(~p −~p 0) . Because the energy of the particle, given in equation (4.1) above, is a function only of the momentum momentum of the particle, an eigenstate of the momentum is also an eigenstate of the energy, that is p H |pi = m2c4 +~p2c2 |pi (4.3) 70 Chapter 4. Space-time symmetry and relativistic field theory where H is the Hamiltonian. The Schrödinger equation which must be satisfied by the time-dependent state vector, |Ψ(t)i, is ∂ ih¯ |Ψ(t)i = H |Ψ(t)i (4.4) ∂t The solution of this equation, assuming that at t = 0 the particle is in a the superposition of eigenstates of momentum R d3 p f (~p)|~p > is √ Z 2 4 2 2 |Ψ(t) >= d3 pe−i m c +~p c t/h¯ f (~p) |p > (4.5) This simple development would seem to be a complete solution of the quantum theory of a single relativistic particle. We can use it to answer questions about it. For example, let us consider the scenario where, at some initial time, say t = 0 the particle is localized at position~0. This state could be created by a measurement of the position of the particle. We construct an eigenstate of position, that is, one which obeys xˆa|~x >= xa|x > (4.6) by superposing the complete set of momentum states as Z |~x >= d3 p|~p > < ~p |~x > (4.7) where the overlap matrix is a plane wave ei~p·~x/h¯ < ~p|~x >= 3 (4.8) (2πh¯) 2 Then, the wave function that evolves from an eigenstate of position with eigenvalue~0 at t = 0, at a time t later becomes √ Z −i m2c4+~p2c2t/h¯ 3 e |Ψ(t) >= d p|~p > 3 (4.9) (2πh¯) 2 We can now ask the question as to the amplitude for observing the particle at position~x after a time t has elapsed. The answer is simply the overlap of the position eigenstate |~xi with the above wave function evaluated at t. The result is √ Z e−i m2c4+~p2c2t/h¯−i~p·~x/h¯ <~x|Ψ(t) >=<~x|eiHt/h¯ |~0 >= d3 p (4.10) (2πh¯)3 Now, we find the difficulty.1 One of the postulates of the special theory of relativity states that the speed of light is a maximum speed. However, from equation (4.10), the probability amplitude is nonzero in the causally forbidden region, where |~x| > ct. There seems to be a nonzero amplitude for motion at speeds greater than that of light. A formal way to see that (4.10) is indeed nonzero in the forbidden region, is to consider t where it occurs in that equation as a complex variable. Then, (4.10) is analytic in the lower half of the complex t-plane. When t is real, the expression is a distribution which should be defined by its limit as complex t approaches the real axis from the lower half plane. Given that it is analytic in this domain, it cannot be zero in any region of the lower half plane plus the real axis except for discrete points, otherwise it would have to be zero everywhere. It is definitely not zero for all times, in fact when t = 0 it is a Dirac delta function. Thus, it cannot be zero in the entire region ct < |~x|. 1 This is in addition to the already obvious difficulty that the expression (4.10) is not Lorentz invariant. In fact, it transforms like the time derivative of a Lorentz invariant function. Let us overlook this issue for the time being. 4.1 Quantum mechanics and special relativity 71 To see this more explicitly, we can do the integral for the special case where m = 0. It becomes Z ∞ h i iHt/h¯ ~ 1 −ip[ct−|~x|]/h¯ −ip[ct+|~x|]/h¯ <~x|e |0 > = 2 pdp e − e 4π2h¯ i|~x| 0 ∂ 1 1 1 = lim − ε→0+ ∂(ct) 4π2i|~x| ct − |~x| − iε ct + |~x| − iε ∂ 1 P P = − − iπδ(ct − |~x|) + iπδ(ct + |~x|) ∂(ct) 4π2i|~x| ct − |~x| ct + |~x| ∂ 1 P = −iπδ((ct)2 −~x2)sign(t) + (4.11) ∂(ct) 2π2i (ct)2 − |~x|2 In the first line above, we have integrated the angles in spherical polar coordinates. In the second line above, −ip[ct−|~x|]/h¯ ∂ −ip[ct−|~x|]/h¯ we have used pe = i ∂(ct) e and we have defined the integral over the semi-infinite domain by introducing the positive infinitesimal parameter ε. In the third line, we have used the identity 1 P lim = + iπδ(x) ε→0+ x − iε x where P/x is the principal value distribution. Also, for the Dirac delta function 1 δ(t2 − a2) = (δ(t − a) + δ(t + a)) 2|a| 1 δ(t2 − a2)sign(t) = (δ(t − |a|) − δ(t + |a|)) 2|a| In equation (4.11), we see that the wave-function of a massless particle spreads in two ways. The first is a wave which travels at the speed of light and is therefore confined to the light cone - where |~x| = ct. The second is a principle value distribution which is non-zero everywhere, including in the forbidden region where |~x| > ct. This latter spreading of the wave packet violates causality. It tells us that, in our quantum mechanical system, the result of a measurement of the position of the particle at position~x after time t would indeed be possible. The particle could be observed as travelling faster than light. This would certainly seem to be incompatible with the principles of the special theory of relativity where objects are restricted to having sub-luminal speeds. Figure 4.1: The wave packet is initially localized at~0 and as time evolves it spreads in such a way that there is a nonzero amplitude for detecting it in the vicinity of point~x. If it is detected at~x, since |ct| < |~x|, its classical velocity would be greater than that of light. 72 Chapter 4. Space-time symmetry and relativistic field theory Now that we have found a difficulty with causality, we need to find a way to resolve it. We will resolve it by going beyond single-particle quantum mechanics to an extended theory where there is another process which competes with the one that we have described. The total amplitude will then be the sum of the amplitudes for the two processes and we will rely on destructive interference of the amplitudes to solve our problem, that is, to make the probability of detecting the particle identically zero in the entire forbidden region |~x| > ct. Figure 4.2: We should add to the amplitude for the particle to travel from~0 to~x as in figure 4.1 the amplitude that a particle-anti-particle pair is created at ~x, the particle continues forward in time as it did in the first process, the anti-particle propagates backward in time and annihilates the particle which was prepared in the state localized at~0. To include the second process, we will begin by framing the first process, the one we have discussed so far, as the following thought experiment. One observer, whom we shall all Alice, is located at position~0 and prepares the particle in the state which is localized at~0. Alice could do this by measuring the position of the particle and we assume that the result of the measurement is that the particle is at position~0. We assume that Alice can do this measurement with arbitrarily good precision. Immediately after the measurement, the particle is allowed to evolve by its natural time evolution, the one which we have described above, so that after time t, its quantum state is given by equation (4.9) and its wave-function by equation (4.10). Then, at time t, another observer, Bob, who is located at point ~x does an experiment to detect the particle. Of course, in a given experiment, Bob might or might not find the particle at~x. But, given that the particle is has non-zero amplitude to propagate there, if Alice and Bob repeat this experiment sufficiently many times, Bob will eventually detect the particle at~x. The result of the experiment is to collapse the particle’s wave function to one which is localized at~x. The amplitude for the particle to propagate to~x is given by (4.10). If this were all there is to it, the result of the experiment violates causality. The second process that we will superpose with the one that we have described will require other states to be introduced. It then clearly involves an extension of single particle quantum mechanics. In the second process, the attempt by Bob, the observer who is located at~x, to observe a paarticle’s position creates a pair consisting of a particle and an anti-particle. The position measure collapses the wave function of the particle into the position eigenstate localized at ~x, the position which was the final state of the particle in the first experiment. The anti-particle is interpreted as a particle which moves backward in time, from time t to time 0. After time −t it has an amplitude to arrive at position~0 where it annihilates the particle that Alice, the observer at~0, has prepared in the localized state. The result of this second process is the same as that of the first process, a particle begins in a state localized at 0 and after a time t it is detected in a state localized at~x. The amplitude for the second process is similar but not identical to that of the first process, due to the fact 4.2 Coordinates 73 that the positron propagates backward in time. It is √ Z ei m2c4+~p2c2t/h¯−i~p·~x/h¯ <~0|eiHt/h¯ |~x > = d3 p (4.12) antiparticle (2πh¯)3 The total amplitude is the sum of amplitudes of the two processes, −iHt/h¯ iHt/h¯ A =<~x|e |~0 >particle + <~0|e |~x >antiparticle (4.13) i~p·~x/h¯ √ √ Z e h 2 4 2 2 2 4 2 2 i = d3 p ei m c +~p c t/h¯ + e−i m c +~p c t/h¯ (4.14) (2πh¯)3 Now, the total expression can have destructive interference. We will not demonstrate it in the general case, but in the case where the mass of the particle and antiparticle is zero. There, we can perform integral in (4.12) explicitly, ∂ 1 P <~0|eiHt/h¯ |~x > = −iπδ((ct)2 −~x2)sign(t) − (4.15) antiparticle ∂(ct) 2π2i (ct)2 − |~x|2 We see that, like the amplitude for the particle, the amplitude for the antiparticle also spreads outside of its light cone. However, when we add the amplitudes of the two processes together, their sum is ∂ 1 <~x|e−iHt/h¯ |~0 > + <~0|eiHt/h¯ |~x > = −iπδ((ct)2 − |~x|2)sign(t) (4.16) particle antiparticle ∂(ct) π2 We see that the principal value part of the expression, which was nonzero outside of the light cone, has canceled. What remains describes the wave function of the initial particle spreading along its light cone, as we might expect for a massless particle, which travels at the speed of light. The upshot of the above development is that a correct treatment of a quantum mechanical particle which also obeys the laws of special relativity requires more than just single particle quantum mechanics. The resolution of the difficulty that we have suggested needs an anti-particle. Quantum field theory will supply us with an anti-particle. Another lesson is that the properties of the anti-particle must be finely tuned to be very similar to that of the particle. Otherwise the exact cancellation of the amplitude outside of the light cone would not happen. We will eventually see that this fine-tuning is generally a property of the relativistic wave equations which replace the Schrödingier equation. They have both positive and negative energy solutions which we shall interpret as belonging to the particle and the anti-particle that the wave equation simultaneously describes. We will put off further discussion of this fact until we study relativistic fields and their wave equations. 4.2 Coordinates The non-relativistic classical and quantum fields ψσ (~x,t) which we have dealt with so far are functions of both the time, t and the space coordinates, labeled by the vector~x. So far,~x label points in three dimensional Euclidean space and time is parameterized by the variable t. In the following, when we proceed to study relativistic field theories, we will find it convenient to think of space and time coordinates from a unified point of view and include time to form a four-vector (ct,~x). The time is defined with a factor of the speed of light, so that if t is measured in time units, the x0 = ct is measured in distance units. Points in the four-dimensional space-time are called events. In the remainder of this chapter, we will introduce some of the notation which we will use to describe the relevant properties of spacetime when we are discussing relativistic field theories. We will also introduce scalar, vector and tensor fields. Then, we will discuss the symmetries of space-time and our four-dimensional Minkowski space in particular. For the most part we will be interested in infinite flat four-dimensional Minkowski space. However, at the outset, it is useful to briefly consider more general space-time. In any spacetime geometry, our basic need is a coordinate system which labels the events of the space-time. A coordinate system assigns a unique sequence of real numbers x0,x1,x2,...,xD−1 to each event in space-time. The number of entries in the sequence, D, is the dimension. We will usually deal with the physical case of four dimensions. The four real numbers x0,x1,x2,x3 contain the four bits of data that are necessary for locating an event. The component 74 Chapter 4. Space-time symmetry and relativistic field theory x0 = ct is associated with time, the other three components x1,x2,x3 are said to be the spatial coordinates. For short, we denote the array x0,x1,x2,x3 by an indexed object, xµ , where the index µ runs over the values µ = 0,1,2,3. Each distinct event in space-time should be associated with a distinct set of four numbers. Conversely, each distinct set of four numbers should label a unique event. Note that the position of the index of xµ is up. In the following, this will be important. An up index will be different from, and must be distinguished from, a down index. Also, we will use the convention that an index which is a Greek letter, such as µ,ν,λ,σ,ρ,α,β,..., typically runs over the range 0,1,2,3 and is used to denote a four-component object, whereas an index which is a letter a.b,c,... runs over the range 1,2,3 and is used to denote the three spatial dimensions. We will sometimes denote the spatial part of xµ by xa or~x and the time component as x0 or ct. We will sometimes consider dimensions D other than four. In that case, there is always one time dimension and D − 1 space dimensions. A useful idea is that of changing between different coordinate systems. To some extent the labelling of events in space-time is arbitrary. As well as the coordinate system with labels xµ , we could use an alternative, say one with some different labels, x˜µ . To be precise, if the four numbers x0,x1,x2,x3 label a specific event in space-time in the old coordinate system, the same event has the label x˜0,x˜1,x˜2,x˜3 in the new coordinate system. We could build up a dictionary for translating between the old and new coordinate systems. This dictionary is encoded in transformation functions x˜µ (x). These are four functions, x˜0(x),x˜1(x),x˜2(x),x˜3(x), each one a function of four variables, xµ = x0,x1,x2,x3. If we plug the old coordinates of a space-time event, xµ , into these functions, they give us the new coordinates, x˜µ , of the same event. We assume that such a coordinate transformation is invertible. This means that, if we know the functions x˜µ (x), we could at least in principle find the inverse transformation xµ (x˜). We also assume that we can take derivatives of the transformation functions so that ∂x˜µ ∂xµ ≡ ∂ x˜µ , ≡ ∂˜ xν (4.17) ∂xν ν ∂x˜ν ν are both non-singular 4 × 4 matrices, at least in the ranges of coordinates of interest. ∂ Note that we have defined derivatives, ∂µ ≡ ∂xµ , with a down index. The difference between down indices and up indices occurs in the way in which the objects carrying the indices transform under a general coordinate transformation, for example, the four-gradient, ∂µ , which has a down index transforms as ∂ ∂xν ∂ ∂xν ∂˜ = = = ∂ µ ∂x˜µ ∂x˜µ ∂xν ∂x˜µ ν where we have used the chain rule for differentiation. We also remind the reader that we are using the Einstein summation convention by which, unless it is explicitly stated otherwise, repeated up and down indices are assumed to be summed over their range. Thus, ∂xν ∂ D−1 ∂xν ∂ ≡ µ ν ∑ µ ν ∂x˜ ∂x ν=0 ∂x˜ ∂x An infinitesimal increment of the coordinates, dxµ , has an up index and transforms as ∂x˜µ dx˜µ = dxν ∂xν We will often see expressions where an index is set equal to a down index and then summed over all values of the index. This creates an object with a simpler transformation law, for example ∂x˜µ ∂xσ dx˜µ ∂˜ = dxρ ∂ = dxσ ∂ µ ∂xρ ∂x˜µ σ σ where we have used the chain rule of differential calculus, ∂x˜µ ∂xσ ∂xσ = = δ σ ∂xρ ∂x˜µ ∂xρ ρ σ Here, δ ρ is the Kronecker delta symbol, which is equal to one when the up and down indices are equal, ρ = σ, and zero otherwise. 4.3 Scalars, vectors, tensors 75 It will often be useful to consider infinitesimal coordinate transformations. An infinitesimal transforma- tion is one where the new coordinates differ from the old coordinates by an infinitesimal amount, which can be encoded in four functions f µ (x) of infinitesimal magnitude and arbitrary profile, so that x˜µ = xµ + f µ (x) (4.18) To linear order in infinitesimals, it is easy to find the inverse of this transformation xµ = x˜µ − f µ (x˜) (4.19) For these infinitesimal transformations, ∂x˜µ ∂xµ = δ µ + ∂ f µ (x) , = δ µ − ∂ f µ (x) (4.20) ∂xν ν ν ∂x˜ν ν ν where we have written the right-hand-sides only to the linear order in the infinitesimal transformation. 4.3 Scalars, vectors, tensors So far, the only structure which we have given space-time is the existence of a coordinate system and the possibility of transforming between different coordinate systems. This is already sufficient structure to define some fields. A relativistic field is a function of the space-time coordinates that transforms in a certain way. The simplest example of a field is a scalar field. A scalar field is a function of the space-time coordinates whose values at particular space-time events specify the values of some physical quantity. The field should have the same value at the same event of space-time when that event is described in any coordinate system. If xµ and x˜µ are coordinates which label the same event in two different coordinate systems, and the scalar field has functional form φ(x) in the xµ coordinates and φ˜(x˜) in the x˜µ coordinates, then the statement that the scalar field has the same value at the same event of space-time in the two coordinate systems gives the scalar field transformation law, φ˜(x˜) = φ(x) (4.21) In terms of infinitesimal transformations (4.18), equation (4.21) is µ φ˜(x˜) = φ(x) + δφ(x) + f (x)∂µ φ(x) + ... = φ(x) In the above equation, we see that φ˜(x˜) differs from φ(x) in two ways. First of all, its functional form µ changes. This is δφ(x). Secondly, the coordinate which it depends on change, this is the term f (x)∂µ φ(x). Canceling the untransformed φ(x) from each side of the above equation, we obtain, to linear order, the transformation law for the scalar field, λ δφ(x) = − f (x)∂λ φ(x) (4.22) Like an increment of the coordinates, dxµ , a vector field V µ (x) has a direction at a given space-time point. The components of the vector field, V µ (x) transform in a similar way, ∂x˜µ V˜ µ (x˜) = V ν (x) (4.23) ∂xν and, for an infinitesimal transformation, µ λ µ µ λ δV (x) = − f (x)∂λV (x) + ∂λ f (x)V (x) (4.24) We could also consider a vector field with a lower index, Aµ (x) which transforms like the gradient ˜ ∂xν operator ∂µ = ∂x˜ν ∂ν , ∂xµ A˜ (x˜) = A (x) (4.25) µ ∂x˜ν ν and the infinitesimal transformation λ λ δAµ (x) = − f (x)∂λ Aµ (x) − ∂ν f (x)Aλ (x) (4.26) 76 Chapter 4. Space-time symmetry and relativistic field theory µ1...µk By similar reasoning, a tensor field with any number of up and down indices, T ν1...ν` (x), has the transformation law, µ1 µk σ1 σ` µ1...µk ∂x˜ ∂x˜ ∂x ∂x ρ1...ρk T˜ ν ...ν (x˜) = ...... T σ ...σ (x) (4.27) 1 ` ∂xρ1 ∂xρk ∂x˜ν1 ∂x˜ν` 1 ` and µ1...µk λ µ1...µk µ1 λ...µk δT ν1...ν` (x) = − f (x)∂λ T ν1...ν` (x˜) + ∂λ f (x)T ν1...ν` (x˜) + ... µk µ1...λ λ µ1...µk λ µ1...µk + ∂ f (x)T ... (x˜) − ∂ν f (x)T (x˜) + ... + ∂ν f (x)T (x˜) (4.28) λ ν1 ν` 1 λ...ν` ` ν1...λ Any physical scalar, vector or tensor field should transform in the way which we have outlined if they are to have physical meaning. When we set the indices in a pair equal, where one is an upper index and one is a lower index, and then we sum over all values of the index, the transformation law acting on those indices cancel. For example, the composite of two vector fields µ V (x)Aµ (x) transforms like a scalar field µ λ µ δ V (x)Aµ (x) = − f (x)∂λ V (x)Aµ (x) 4.4 The metric Now that we have introduced coordinates of space-time, we must discuss how some fundamental quantities, like time and distance, for example, are to be computed. The geometry of space-time is encoded in a symmetric two-index tensor field called the metric, gµν (x). It contains all of the information that we need in order to understand the geometry of a space-time. The metric transforms like a tensor field with lower indices ∂x˜ρ ∂x˜σ g˜ (x˜) = g (x) (4.29) µν ∂xµ ∂xν ρσ The metric is usually assumed to be non-singular, so that it can be inverted and its inverse is denoted by the same symbol, but with up-indices, gµν (x), so that µν µ νλ λ g (x)gνλ (x) = δ λ , gµν (x)g (x) = δµ The inverse of the metric transforms like a tensor with two up-indices, ∂xµ ∂xν g˜µν (x˜) = gρσ (x) (4.30) ∂x˜ρ ∂x˜σ The infinitesimal transformations are λ λ λ δgµν (x) = − f (x)∂λ gµν (x) − ∂µ f (x)gλν (x) − ∂ν f (x)gµλ (x) (4.31) µν λ µν µ λν ν µλ δg (x) = − f (x)∂λ g (x) + ∂λ f (x)g (x) + ∂λ f (x)g (x) (4.32) For the most part, we will not be interested in general space-times, but will focus on Minkowski space. Minkowski space is defined as that space-time where one can find a coordinate system so that the metric has the special form −1 0 0 0 −1 0 0 0 0 1 0 0 µν 0 1 0 0 µν µ νλ λ ηµν = , η = , η η = δ , ηµν η = δ (4.33) 0 0 1 0 0 0 1 0 νλ λ µ 0 0 0 1 0 0 0 1 µν where we denote this special metric by the symbol ηµν and its inverse by η . Let us return to the case of a generic metric. Given an infinitesimal increment of the coordinates, dxµ , the proper time is defined by 2 µ ν −dτ = gµν (x)dx dx 4.5 Symmetry of space-time 77 (The minus sign on the right-hand-side of this equation is a matter of convention.) This proper time is the time which elapses on a clock which moves with an object along a trajectory. Here, the trajectory is given by the parametric equation xµ (s) = xµ + sdxµ , 0 ≤ s ≤ 1 The proper time is a well-defined physical quantity and it should not depend on the coordinate system which is used. This is guaranteed by the coordinate transformation of the increment dxµ and the metric (??) which combine to µ ν µ ν g˜µν (x˜)dx˜ dx˜ = gµν (x)dx dx Finally, we observe that the metric can be used to raise and lower indices. If we take a vector field, V µ (x), we can create a vector with a lower index by contracting it with the metric tensor ν Vµ (x) = gµν (x)V (x) and raise the index with the inverse of the metric, µ µν V (x) = g (x)Vν (x) We can use the coordinate transformation laws for the metric and for vector fields to see that, indeed, V µ (x) and Vµ (x) transform as vector fields with an upper and a lower index, respectively, when the metric and its inverse transform like tensors of the appropriate type. 4.5 Symmetry of space-time Now that we have introduced the concept of metric, we can discuss the idea of a symmetry of a space-time. We define a symmetry transformation of a space-time as a general coordinate transformation under which the metric remains unchanged. An infinitesimal transformation, which is implemented with a vector field fˆµ (x) is a symmetry of space-time if δgµν (x) = 0 where δgµν (x) is given in equation (??). We will use a hat on a vector field which corresponds to a symmetry, fˆµ (x), in order to distinguish it from a general coordinate transformation which we shall still denote by f µ (x). The condition that the coordinate transformation does not change the metric gives us a partial differential equation which the four functions fˆµ (x) must obey, ˆλ ˆλ ˆλ ∂µ f (x)gλν (x) + ∂ν f (x)gµλ (x) + f (x)∂λ gµν (x) = 0 (4.34) This equation is called the Killing equation. The solutions, fˆµ (x), of the Killing equation are called Killing vectors. Each linearly independent Killing vector generates a symmetry of space-time. Different space-times can have different symmetries, varying both in the number and the nature of the symmetry transformations. As one can imagine, a generic space-time might have no symmetry at all. There turns out to be a maximum number of symmetries that a space-time can have. The space-time that we will be the most interested in, Minkowski space which we introduce in the next section, is one of the maximally symmetric four dimensional spaces. 4.6 The symmetries of Minkowski space I Minkowski space is a maximally symmetric space-time. The largest number of Killing vectors that a four dimensional space-time can have is ten. The Killing equation on Minkowski space is ∂µ fˆν (x) + ∂ν fˆµ (x) = 0 (4.35) where the index on fˆµ is lowered by the Minkowski metric, ν fˆµ (x) ≡ ηµν fˆ (x) (4.36) The ten solutions of this equation are: 1. four constants fˆµ = cµ corresponding to constant translations of the space-time coordinates ˆµ µ ν ρ 2. f = ω ν x with constants ωµν = −ωνµ , where ωµν = gµρ ω ν . There are six independent compo- nents of this 4 × 4 antisymmetric tensor which correspond to three infinitesimal spatial rotations and three infinitesimal Lorentz boosts. 78 Chapter 4. Space-time symmetry and relativistic field theory Given that we have found the infinitesimal transformations, it is easy to find the finite transformation, it is the linear transformation µ µ ν µ x˜ = Λ ν x + c where cµ are constants and the constant matrix Λ obeys the equation ρ σ Λ µ Λ ν ηρσ = ηµν µ µ µ We can see from this equation that, to linear order, it is indeed solved by the expression Λ ν = δ ν + ω ν where ωµν is anti-symmetric. These are the matrices which implement Lorentz transformations. Here, we use the term Lorentz transformation to refer to both the change between reference frames moving at different constant velocities and rotations of the spatial coordinates. When they are combined with the constant translations of the coordinates cµ they are called Poincare transformations. 5. The Dirac Equation So far, we have formulated an approach to the quantum mechanics of a many-particle system which led us to the non-relativistic field equation, in the absence of interactions, 2 ∂ h¯ ~ 2 ih¯ ψ(~x,t) = − ∇ + εF ψ (~x,t) = 0 (5.1) ∂t 2m σ In this section we shall discuss the equation which replaces this one in a relativistic theory. From our point of view, the main difference between the two is symmetry. The field equation above has Galilean symmetry. We want to trade it for an equation that has Lorentz symmetry. We will continue to discuss a many-Fermion system and to give the discussion a physical context, we will sometime call the Fermions “electrons” with the idea that they will eventually become the electrons of quantum electrodynamics. We have ignored the interaction terms in the above equation. We will continue to do this, to assume that the Fermions are noninteracting. Later on, once we have a relativistic field equation, we will let the fields interact. To seek the appropriate relativistic wave equation, we could recall our discussion of the previous chapters. If we simply postulate a Hamiltonian with a relativistic dispersion relation, so that the wave equation (5.1) is replaced by ∂ q ih¯ ψ (~x,t) = m2c4 − c2h¯ 2~∇2ψ (~x,t) (5.2) ∂t σ σ the resulting theory has difficulties with causality. There is a finite probability of the particle propagating faster than the speed of light. The difficulty lies in the fact that the “Hamiltonian” operator on the right- hand-side of this equation is not a polynomial in derivatives. Having all orders in derivatives, it is not a local differential operator. Dirac found a way to replace this equation by one where the Hamiltonian has the same spectrum, but the operator is a polynomial in derivatives. We also have this goal. In our discussion of single particle relativistic quantum mechanics, the problems with causality had a potential solution if, besides the particle, ther theory contained an anti-particle. Let us address the problems with causality by postulating the existence of an anti-particle which would satisfy and equation similar to (5.4) but with negative energy, q ∂ 2 4 2 2 2 ih¯ ψ˜ ˜ (~x,t) = − m c − c h¯ ~∇ ψ˜ ˜ (~x,t) (5.3) ∂t σ σ Assuming that both the particle and the anti-particle occur in the same theory, we could combine the two into 80 Chapter 5. The Dirac Equation the same multi-component field to find " " p 2 4 2 2~ 2 ∂ ψσ (~x,t) m c − c h¯ ∇ 0 ψσ (~x,t) ih¯ = p (5.4) ∂t ψ˜σ˜ (~x,t) 0 − m2c4 − c2h¯ 2~∇2 ψ˜σ˜ (~x,t) p One might wonder whether there is a matrix Hamiltonian which has eigenvalues ± m2c2 + c2h¯ 2~k2 and which is polynomial in derivatives. There is no such 2 × 2 matrix. We thus need to involve the spin indices ψ (~x,t) and consider σ as a four-component object. It is Dirac’s great insight that our problem can be solved ψ˜σ˜ (~x,t) with a 4 × 4 matrix. (We caution the reader that equation (5.4) is still not quite correct.) Consider the four Hermitian 4 × 4 matrices β, α1, α2, α3 which have the algebraic properties ββ = 1, βαi + αiβ = 0, αiα j + α jαi = 2δ i j1 where 1 is the 4 × 4 unit matrix. (Alternatively, if we consider matrices with the above properties, there is a way to show that the minimal size of such matrices is 4 × 4. Then, we consider the wave equation ψ1(~x,t) ψ1(~x,t) ∂ ψ2(~x,t) h 2 iψ2(~x,t) ih¯ = βmc + ihc¯ ~α ·~∇ ∂t ψ3(~x,t) ψ3(~x,t) ψ4(~x,t) ψ4(~x,t) The “Dirac Hamiltonian” h 2 ~ i hD = βmc + ihc¯ ~α · ∇ (5.5) is a hermitian operator. It must have real eigenvalues. What is more 2 2 4 2 2~ 2 hD = m c − h¯ c ∇ (5.6) ~ 2 2 p 2 4 2 2 2 so, since the eigenvalues of ∇ are −~k , hD has eigenvalues ± m c + h¯ c ~k , which is the desired property. Moreover, hD is a polynomial, in fact it is at most linear in derivatives, and it is therefore a local operator. To make the Dirac equation look more covariant, we can define the matrices β = iγ0 , α = γ0~γ (5.7) γ0 = −γ0† , γi = γi† (5.8) {γ µ ,γν } = 2η µν (5.9) γ µ are called the Dirac gamma-matrices. Using them, the Dirac equation is the matrix differential equation 4 h µ mc i ∑ γab∂µ + δab ψb(~x,t) = 0 (5.10) b=1 h¯ or, with implicit summations over indices, ∂/ + mψ(x) = 0 (5.11a) where we shall use the slash notation for the product to the Dirac matrices with any other four-vector µ A/ ≡ γ Aµ (5.12a) 5.1 Solving the Dirac equation 81 We will hereafter assume that we are using a system of units where h¯ = 1 and where c = 1. We shall find it useful to define ψ¯ (x) ≡ ψ†(x)γ0, ψ†(x) ≡ −ψ¯ (x)γ0 (5.13a) Using this definition and taking a hermitian conjugate of the equation in (5.14a), we obtain h ←− i ψ¯ (x) − ∂/ + m = 0 (5.14a) where the left oriented arrow above the derivative indicates that it operators on whatever is to the left of it. The Dirac equation has a structure similar to the Schrödinger wave equation with the difference that, what we would call the single-particle Hamiltonian, hD, is a matrix and it is linear, rather than quadratic, in derivatives. We might expect that the Hamiltonian of the quantum field theory is given by Z Z 3 † 3 H0 = d xψ (x)hDψ(x) = d xH (x) (5.15) h i H (x) = iψ¯ (x) ~γ ·~∇ + m ψ(x) (5.16) and that the time derivative of ψ(~x,t) is generated by this Hamiltonian by taking a commutator, ∂ i ψ(x) = [ψ(x),H ] ∂t 0 In fact this will indeed be the case if the field operator obeys the equal-time anti-commutation relations n † o 3 ~ ψa(~x,t),ψb (~y,t) = δabδ (k −~y) n † † o {ψa(~x,t),ψb(~y,t)} = 0 , ψa (~x,t),ψb (~y,t) = 0 (5.17) Our task in the following will be to assume that ψ(~x,t) indeed obeys the Dirac equation and this anti- commutation relation and to find a solution of them. We have not discussed why the Dirac equation is relativistic. It is straightforward to see this, but we will put off the details until later. Here, we observe that, given the anti-commutation algebra of the Dirac matrices, 1 (γ µ ∂ )2 = γ µ γν ∂ ∂ = {γ µ ,γν }∂ ∂ = ∂ 2 µ µ ν 2 µ ν ν Using this identity, we can operate the matrix valued differential operator (−γ ∂ν +m) on the Dirac equation from the left to obtain ν µ 2 2 (−γ ∂ν + m)(γ ∂µ + m)ψ = 0 → (−∂ + m )ψ = 0 We see that, if ψ(x) obeys the Dirac equation, it also obeys the relativistic wave equation (−∂ 2 +m2)ψ(x) = 0. This implies that the solutions of the Dirac equation also obey this relativistic wave equation, and must therefore propagate like relativistic matter waves. 5.1 Solving the Dirac equation To see how the Dirac equation is solved, it is useful to choose a specific form for the Dirac matrices 0 σ i 0 1 γi = , γ0 = (5.18) σ i 0 −1 0 82 Chapter 5. The Dirac Equation where we use 1 to denote the 2 × 2 unit matrix which appears in the upper and lower triangle of γ0. Also, σ i are the 2 × 2 Pauli matrices, which we remind the reader are given by 0 1 0 −i 1 0 σ 1 = , σ 2 = , σ 3 = (5.19) 1 0 i 0 0 −1 The Pauli matrices have the properties σ iσ j + σ jσ i = 2δ i j1 , σ iσ j − σ jσ i = 2iεi jkσ k where εi jk is the totally antisymmetric tensor with ε123 = 1. It is easy to confirm that the explicit form (5.18) indeed have the correct anti-commutation algebra for Dirac matrices. With the matrices in (5.18), the Dirac equation is " ~ # m ∂0 +~σ · ∇ u(x) ~ = 0 (5.20) −∂0 +~σ · ∇ m v(x) where we have split the four-component Dirac spinor into u(x) an v(x) which are two 2-component objects. To solve the differential equation, we use the ansatz u(x) µ ν u ~ u = eip ηµν x = e−iωt+ik·~x (5.21) v(x) v v mu − i[ω −~σ ·~k]v = 0 (5.22) mv + i[ω +~σ ·~k]u = 0 (5.23) We have now reduced the Dirac equation to two matrix equations. Equation (5.23) determines the 2- component object v in terms of u, that is, if u were known, we could determine v as i v = − [ω +~σ ·~k]u (5.24) m Plugging this into (5.22) yields the condition ω2 =~k2 + m2 (5.25) p p which has two solutions for ω, ω = ~k2 + m2 and ω = − ~k2 + m2, the positive and negative energy solutions, respectively. In the following, we will use the notation where ω is the frequency which can be p either positive or negative, and E(~k) = ~k2 + m2 is positive, and sometimes abbreviated by E. Next, we note that ~σ ·~k is a Hermitian matrix which can be diagonalized. Once diagonal, it has real eigenvalues. The eigenstates of this matrix are said to be “eigenstates of helicity”. It is left as an exercise to the reader to show that there exist two eigenvectors, ~ ~ ~ ~ † † † † ~σ · ku+ = |k|u+ , ~σ · ku− = −|k|u− u+u+ = 1 = u−u− , u+u− = 0 = u−u+ We shall find the following identities very useful ~ ~ ~ ~ † |k| +~σ · k † |k| −~σ · k u+u+ = , u−u− = 2|~k| 2|~k| Then, putting it all together, we have four linearly independent solutions which we can superimpose to form the Dirac field, q q Z i~k·~x−iEt i 1 − |~k|/Eu i 1 + |~k|/Eu 3 e + ~ − ~ ψ(x) = d k √ 3 q a+(k) + q a−(k) 2(2π) 2 1 + |~k|/Eu+ 1 − |~k|/Eu− q q Z −i~k·~x+iEt i 1 − |~k|/Eu i 1 + |~k|/Eu 3 e + † ~ − † ~ + d k √ 3 q b+(k) + q b−(k) (5.26) 2(2π) 2 − 1 + |~k|/Eu+ − 1 − |~k|/Eu− 5.1 Solving the Dirac equation 83 p where E(~k) = ~k2 + m2. This solution will obey the anti-commutation relation for the Dirac field (5.17) if the Fourier coefficients satisfy the non-vanishing anti-commutation relations are n ~ † ~ 0 o ~ ~ 0 n ~ † ~ 0 o ~ ~ 0 a+(k),a+(k ) = δ(k − k ) , a−(k),a−(k ) = δ(k − k ) (5.27) n ~ † ~ 0 o ~ ~ 0 n ~ † ~ 0 o ~ ~ 0 b+(k),b+(k ) = δ(k − k ) , b+(k),b+(k ) = δ(k − k ) (5.28) All other combinations have vanishing anti-commutators. We can easily check that, with these anti- commutation relations for creation and annihilation operators, the solution in equation (5.26) obeys equations (5.17). With this solution, the Hamiltonian (5.16) is Z q 3 ~ 2 2 † ~ ~ † ~ ~ † ~ ~ † ~ ~ H0 = d k k + m a+(k)a+(k) + a−(k)a−(k) + b+(k)b+(k) + b−(k)b−(k) (5.29) and the number operator is Z N = d3xψ†(x,t)ψ(~x,t) Z 3 † ~ ~ † ~ ~ † ~ ~ † ~ ~ = d k a+(k)a+(k) + a−(k)a−(k) − b+(k)b+(k) − b−(k)b−(k) (5.30) In both of the above expressions, we have dropped infinite constants. Unlike in the non relativistic theory, the vacuum energy density and the vacuum charge density both contain infinite constants which we have to simply drop in order to have a sensible Hamiltonian and number operator. We see from the expression (5.30), that, in direct analogy to the non relativistic system that we have ~ † ~ studied, electrons, which are associated with a±(k) and a±(k) contribute positively to the particle number ~ † ~ whereas holes, or positrons, which are associated with b±(k) and b±(k), have negative particle number. What differs from the non-relativistic theory is the fact that electrons and positrons have the same energy p spectrum. Since the energy of a single electron, ~k2 + m2, can be arbitrarily large, for large values of |~k, it is also so for holes (or positrons). This means that, as we shall seen, unlike the Fermi sea, the relativistic analog, which we can call the “Dirac sea”, is infinitely deep. This is what leads to the infinite values of the energy and number densities (which we have dropped). Another difference with the nonrelativistic theory, where the electron had a state of well-defined spin is that in the relativistic theory, it is the helicity (the states labeled by subscripts (+) and (-)), which are important. The helicity can be thought of as the projection of the spin in the direction of motion of the Fermion. We construct the basis of the Fock space beginning with the vacuum |O > which we assume is normalized, < O|O >= 1 and has the property that it is annihilated by all of the annihilation operators, a+(~k)|O >= 0 , a−(~k)|O >= 0 , b+(~k)|O >= 0 , b−(~k)|O >= 0 for all values of~k. Then, multi particle and anti-particle states are created by operating creation operators † ~ † ~ † ~ 0 † ~ 0 † ~ † ~ † ~0 † ~0 a+(k1)...a+(km)a−(k1)...a−(km0 )b+(`1)...b+(`n)b−(`1)...b−(`n0 )|O > These states are eigenstates of particle number, N , with eigenvalue N = m + m0 − n − n0 and they are eigenstates of the Hamiltonian, H0, with eigenvalue the total energy, m q m0 q n q n0 q ~ 2 2 ~ 0 2 2 ~2 2 ~0 2 2 E = ∑ ki + m + ∑ (ki) + m + ∑ `i + m + ∑ (`i) + m 1 1 1 1 84 Chapter 5. The Dirac Equation 5.2 Lorentz Invariance of the Dirac equation The Dirac equation is clearly invariant under translations of the space-time coordinates. If ψ(~x,t) is a solution of the Dirac equation, then ψ(~x +~a,t + τ), with constants ~a and τ, is also a solution. What about Lorentz transformations? Let us begin by recalling how a Lorentz transformation of a scalar field is implemented. Recall that a Lorentz transformation is the linear transformation on the coordinates µ µ µ ν x → x˜ = Λ ν x µ ν where the matrices Λ ν x satisfy the equation µ ρ Λ ν Λ σ ηµρ = ηνσ We are often interested in infinitesimal transformations. For a Lorentz transformation µ µ µ Λ ν = δ ν + ω ν + ... −1 µ µ µ (Λ ) ν = δ ν − ω ν + ... In our discussion of coordinate transformations, we have derived the transformation property of the scalar field, φ˜(x˜) = φ(x) which, for the Lorentz transformation, we can rewrite as φ˜(x) = φ(Λ−1x) The infinitesimal transformation is then ν µ δφ(x) = −ωµν x ∂ φ(x) We would expect this transformation to be a symmetry of a relativistic wave equation that a scalar field would obey. We expect that, under a Lorentz transformation, the argument of the Dirac field also changes. However, the Dirac field has four components and the Lorentz transformation could also mix the components. Thus, we could make the ansatz that the Lorentz transformation of the Dirac field involves multiplication of it by a matrix, −1 µ ν ψ˜ (x) = S(Λ)ψ(Λ x) = ψ(x) − ω ν x ∂µ ψ(x) + sψ(x) + ... Here, S(Λ) + 1 + s + ... is a 4 × 4 matrix which depends on the Lorentz transformation. Since a Lorentz transformation should be invertible, we expect that S is an invertible matrix, that is, that detS 6= 0. This transformation is a symmetry of the Dirac equation if the transformed field also satisfies the equation, that is, if 0 = [∂/ + m]ψ˜ (x) µ ν = [∂/ + m]ψ(x) + [∂/ + m] −ω ν x ∂µ + s ψ(x) µ ν µ ν µ = −ω ν γ ∂µ + ∂/,s ψ(x) = −ω ν γ + [γ ,s] ∂µ ψ(x) = 0 µ ν µ if − ω ν γ + [γ ,s] = 0 µ We need to find s as a function of ω ν such that µ µ ν [γ ,s] = ω ν γ We can easily see that this equation is solved by 1 s = [γρ ,γσ ]ω 8 ρσ 5.2 Lorentz Invariance of the Dirac equation 85 To see this, consider 1 1 [γ µ ,s] = [γ µ ,[γρ ,γσ ]]ω = [γ µ ,γρ γσ ]ω 8 ρσ 4 ρσ 1 = ({γ µ ,γρ }γσ − γρ {γ µ ,γσ })ω 4 ρσ 1 = (2η µρ γσ − γρ 2η µσ )ω = ω µ γσ 4 ρσ σ Thus, we have found the infinitesimal Lorentz transformation of the Dirac field, 1 δψ(x) = ω xµ ∂ ν + [γ µ ,γν ] ψ(x) (5.31a) µν 8 ←−ν 1 δψ¯ (x) = ψ¯ (x) xµ ∂ − [γ µ ,γν ] ω (5.31b) 8 µν Although we shall not need it, we could also consider finite, rather than infinitesimal Lorentz transformations. They are a symmetry of the Dirac equation if the matrix S satistifes −1 µ −1ν ν S γ SΛ µ = γ 1 µ ν and this should be solved by S = 1 + s + ... = 1 + 8 [γ ,γ ]ωµν + .... For an infinitesimal spatial rotation in the 1-2 plane (or, about the 3-axis), the only non-zero components of ωµν are ω12 and ω21 = −ω12 and 1 δψ(x) = ω x1∂ 2 − x2∂ 1 + γ1γ2 ψ(x) (5.32) 12 2 1 = ω (~x ×~∇)3 + γ1γ2 ψ(x) (5.33) 12 2 1 σ 3 0 = iω (~x × (−i~∇))3 + ψ(x) (5.34) 12 2 0 σ 3 ←− 1 σ 3 0 δψ¯ (x) = −iω ψ¯ (x) (~x × (−i ∇ ))3 + (5.35) 12 2 0 σ 3 Indeed, for a rotation by an axis in the direction of the infinitesimal vector ~θ by angle |~θ|, ~ 1~ 1 θ · ~x × i ∇ + 2~σ 0 δψ(x) = i ψ(x) ~ 1~ 1 0 θ · ~x × i ∇ + 2~σ ←− ~ 1 1 θ · ~x × i ∇ + 2~σ 0 δψ¯ (x) = −iψ¯ (x) ←− ~ 1 1 0 θ · ~x × i ∇ + 2~σ that is, a rotation is a combination of a rotation of the spatial coordinates, implemented by the angular ~ 1~ momentum operator L =~x ×~p =~x × i ∇ and a rotation of the Dirac field spin, implemented by the Pauli 1 matrices 2~σ. Then, in particular, under infinitesimal rotations, the electron density and the mass operator transform like a scalar fields, δ ψ†(x)ψ(x) = ~θ ·~x ×~∇ ψ†(x)ψ(x) δ (ψ¯ (x)ψ(x)) = ~θ ·~x ×~∇ (ψ¯ (x)ψ(x)) 86 Chapter 5. The Dirac Equation For an Lorentz transformation with infinitesimal velocity~v, the only non-zero components of ωµν are ω0a = va and ωa0 = −vv and 1 δψ(x) = x0~v ·~∇ −~v ·~x∂ 0 + γ0~v ·~γ ψ(x) (5.36) 2 ←− ←−0 1 δψ¯ (x) = ψ¯ (x) x0~v · ∇ −~v ·~x ∂ − γ0~v ·~γ (5.37) 2 ←− ←−0 1 δψ†(x) = ψ†(x) x0~v · ∇ −~v ·~x ∂ + γ0~v ·~γ (5.38) 2 The mass operator transforms like a scalar field, δ (ψ¯ (x)ψ(x)) = (x0~v ·~∇ −~v ·~x∂ 0)(ψ¯ (x)ψ(x)) However, the density is not a scalar, but has the transformation law δ ψ†(x)ψ(x) = (x0~v ·~∇ −~v ·~x∂ 0) ψ†(x)ψ(x) +~v · ψ¯ (x)~γψ(x) (5.39) Of course, it transforms like the time-component of a vector field, consistent with the fact that it is the time-component of jµ (x) = ψ†(x)ψ(x),−ψ¯ (x)~γψ(x) (5.40) Indeed, we could examine the full Lorentz transformation law and see that jµ (x) transforms like a vector field. 5.3 Phase symmetry and the conservation of electric current In the above, we have examined the transformation law for the Dirac field density and we found that it transforms like the time-component of a vector field (5.40). We will see shortly that this vector field obeys µ the continuity equation, ∂µ j (x) = 0, which we would expect for a “conserved current” and, when our Dirac Fermions are coupled to photons, it, scaled by a unit of electric charge e, will be identified with the electric charge and current densities. In non-relativistic terminology, ej0(x) is the “charge density” and e~j(x) is the “current density”. In relativistic physics, we simply call jµ (x) a “current” or a “conserved current”. We consider the Dirac equation and its conjugate † h †µ ←− i ∂/ + m ψ = 0 , ψ γ ∂ µ + m = 0 Then, we note that, ㆵ γ0 = −γ0γ µ , so that, if we multiply the second equation above from the right by γ0 we obtain ←− † 0 h i ∂/µ + m ψ = 0 , ψ γ − ∂/ + m = 0 We shall shorten the notation by defining ψ¯ (x) ≡ ψ†(x)γ0 (5.41) Then, we can write the current as jµ (x) = −ψ¯ (x)γ µ ψ(x) (5.42) Now, we are ready to use derive the continuity equation. We form the current and we assume that the Dirac spinor satisfies the Dirac equation. Then, ←− µ µ h i −∂µ j (x) = ∂µ (ψ¯ (x)γ ψ(x)) = ψ¯ ∂/ + ∂/ ψ(x) = ψ¯ (x)[−m + m]ψ(x) = 0 This continuity equation for the current implies the conservation of charge, which we have so far called the number operator, d d Z Z ZZ N = d3xψ†(x)ψ(x) = d3x~∇ · ψ¯ (x)~γψ(x) = d2σnˆ · ψ¯ (x)~γψ(x) dt dt where we have used Gauss’ theorem to write the last term as a surface integral at the infinite boundary of three dimensional space. If our boundary conditions are such that the quantum expectation value of the current density goes to zero sufficiently rapidly there, the number operator is conserved. 5.4 The Energy-Momentum Tensor of the Dirac Field 87 5.4 The Energy-Momentum Tensor of the Dirac Field The Dirac equation can be derived from an action Z S = d4xL (x) where the Lagrangian density is 1−→ 1←− L (x) = −iψ¯ (x) ∂/ − ∂/ + m ψ(x) 2 2 Here we have defined the action as the integral of the Lagrangian density. It should be kept in mind, of course, that the Dirac field describes Fermions and therefore the Lagrangian density depends on anti-commuting classical fields ψ(x) and ψ¯ (x). The “−i” in front and the symmetrization of the derivative operators are present to make the Lagrangian density real. Moreover, the leading terms, up to total derivatives are (x) = i †(x) ∂ (x) + ... which is compatible with the equal-time anti-commutation relations that we L ψ ∂x0 ψ have used for the Dirac field. The Dirac equation is easily recovered from this Lagrangian density using the Euler-Lagrange equations. Now, consider a space-time translation xµ → xµ + ε µ , where ε µ is an infinitesimal constant four-vector. The Dirac field transforms as µ µ δψ(x) = −ε ∂µ ψ(x) , δψ¯ (x) = −ε ∂µ ψ¯ (x) By inspection, we see that, when the Dirac field transforms this way, the Lagrangian density transforms as µ δL (x) = ∂µ [−ε L (x)] The fact that the Lagrangian density varies by a total derivative term means that the infinitesimal translation is a symmetry of the theory. Of course, we already knew that this should be the case, since we have already seen that the equation of motion has this symmetry. To find the Noether current, we use Noether’s theorem which tells us that µ ν ∂L (x) ν ∂L (x) µ Jε (x) = −ε ∂ν ψ(x) − ε ∂ν ψ¯ (x) + ε L (x) ∂(∂µ ψ(x) ∂(∂µ ψ¯ (x) µν = εν T0 (x) where i ←−µ Tµν (x) = ψ¯ (x) γν ∂ µ − γν ∂ ψ(x) (5.43) 0 2 is the canonical energy-momentum tensor. Using the Dirac equation (∂/ + m)ψ(x) = 0, which implies that ←−2 0 = (−∂/ + m)(∂/ + m)ψ(x) = (−∂ 2 + m2)ψ(x) and also ψ¯ (x)(− ∂ + m2) = 0, It is easy to check that µν T0 (x) obeys the continuity equation i ←−2 ∂ Tµν (x) = ψ¯ (x) γν ∂ 2 − γν ∂ ψ(x) = 0 µ 0 2 . Another, alternative way to derive the Noether current is to begin with an infinitesimal translation, as we have just done, but to assume that the transformation parameter now depends on the coordinates, ε µ (x). The variation of the Lagrangian now has the form i ←−µ δL (x) = ∂ [ε µ (x)L (x)] − ∂ ε (x) ψ¯ (x) γν ∂ µ − γν ∂ ψ(x) µ µ ν 2 This equation makes no assumptions about the nature of ε µ (x), other than that it is infinitesimal. If we allow it to go to a constant four-vector, we recover the fact that the Lagrangian density transforms as a total derivative. Now, instead, we assume that ε µ (x) vanishes sufficiently rapidly at the boundaries of the integral so that boundary terms which are generated upon integration by parts can be ignored. Moreover, we assume 88 Chapter 5. The Dirac Equation that the Euler-Lagrange equations (i.e. the Dirac equation for ψ(x) and ψ¯ (x)) are obeyed. We recall that, if the Euler-Lagrange equations are obeyed, any such variation of the action must vanish. Then, we must have i ←−µ ∂ ψ¯ (x) γν ∂ µ − γν ∂ ψ(x) = 0 µ 2 This is just the conservation law for the Noether current that we obtained and confirmed above in our first derivation. This is a second route to Noether’s theorem. The space integral of the time-component of this energy-momentum tensor is the generator of space-time translations, Z i Z ←−0 Z P0 = d3xT00(x) = d3xψ†(x) ∂ 0 − ∂ ψ(x) = d3xψ†(x)h ψ(x), h = −γ0~γ ·~∇ − iγ0m 0 2 D D Z i Z ←−0 Pa = d3xT0a(x) = d3xψ†(x) γ0γa∂ 0 − γ0γa ∂ ψ(x) 0 2 1 Z = d3xψ†(x) γ0γah + h γ0γaψ(x) 2 D D Z = d3xψ†(x)(−i∇a)ψ(x) In both of the equation above, we have eliminated the time derivatives from the integrand by using the Dirac 0 equation. This means replacing i∂ ψ by hDψ where hD is the single-particle Dirac Hamiltonian defined in equation (5.5). The result is that P0 is the Dirac field Hamiltonian and we recognize Pa as the Dirac field linear momentum. We note that the four-divergence of the energy-momentum tensor on its second index also vanishes, i ←−µ i ←− ←−←−µ ∂ Tµν (x) = ψ¯ (x) ∂∂/ µ − ∂/ ∂ ψ(x) + ψ¯ (x) ∂/ ∂ µ − ∂/ ∂ ψ(x) = 0 ν 0 2 2 ←− where we have used the Dirac equation ∂ψ/ (x) = −mψ(x) and ψ¯ (x) ∂/ = ψ¯ (x)m. However, clearly, the energy-momentum tensor that we have found is not symmetric. It therefore cannot immediately be used to construct the Noether current for Lorentz transformations. The fact that its four-divergence vanishes for either of its indices means that it can be decomposed into a symmetric and anti-symmetric part 1 Tµν (x) = Tµν (x) + Tνµ (x) (5.44) 2 0 0 1 Tµν (x) = Tµν (x) − Tνµ (x) (5.45) A 2 0 0 and that both of these tensors are conserved, µν µν ∂µ T (x) = 0, ∂µ TA (x) = 0 This fact will be important to us shortly when we discuss improving this energy-momentum tensor. µ µ µ ν Now, consider an infinitesimal Lorentz transformation, x → x + ω ν x where the transformation of the Dirac field is given in equations (5.31a) and (5.31b), which we copy here 1 δψ(x) = ω xµ ∂ ν + [γ µ ,γν ] ψ(x) µν 8 ←−ν 1 δψ¯ (x) = ψ¯ (x) xµ ∂ − [γ µ ,γν ] ω 8 µν Under this transformation, the Lagrangian density varies as µ ν ν µ δL (x) = ωµν x ∂ L (x) = ∂ ωµν x L (x) The fact that the Lagrangian density varies by a total derivative term means that the Lorentz transformation is a symmetry of the theory. Of course, we already knew that this should be the case, since we have already demonstrated that the equation of motion has this symmetry. 5.4 The Energy-Momentum Tensor of the Dirac Field 89 To find the Noether current, let us assume that the transformation parameter depends on the coordinates, ωµν (x). The variation of the Lagrangian now has the form ν µ δL (x) = ∂ ωµν (x)x L (x) i 1 ←−σ 1 − ∂ ω ψ¯ (x) γλ xρ ∂ σ + [γρ ,γσ ] − ∂ xρ − [γρ ,γσ ] γλ ψ(x) λ ρσ 2 8 8 Now, if ωµν (x) vanishes sufficiently rapidly at the boundaries of the integral so that boundary terms can be ignored, and if we assume that the equations of motion are obeyed, the action must vanish for any variation, in particular, with any profile of ωρσ (x). Then, we must have λρσ ∂λ M (x) = 0 (5.46) where the Noether current is given by i n o Mλρσ (x) = Tλσ (x)xρ − Tλρ (x)xσ + ψ¯ (x) γλ ,[γρ ,γσ ] ψ(x) (5.47) 0 0 16 µν where T0 (x) is the energy-momentum tensor which was associated with translations. We could also find this current by using the more conventional Noether theorem. The conservation law (5.46) is a result of µν Noether’s theorem. Doing the derivatives explicitly, and remembering that ∂µ T0 (x) = 0, we get the identity i n o Tσρ (x) − Tρσ (x) = ∂ ψ¯ (x) γλ ,[γρ ,γσ ] ψ(x) (5.48) 0 0 λ 16 µν This is an equation for the anti-symmetric part of T0 (x), which states that it is given by the four-divergence i λ ρ σ of 16 ψ¯ (x) γ ,[γ ,γ ] ψ(x). The latter quantity is clearly anti-symmetric in the indices ρ and σ. We can use the algebraic properties of the gamma-matrices to see that the combination γλ [γρ ,γσ ] is equal to zero unless the three indices λ,ρ,σ are all different. Therefore γλ [γρ ,γσ ] = −4iελρσν γ5γν where ελρσν is the totally anti-symmetric tensor with ε0123 = 1. µν µν Then, recalling that we can write T0 (x) as its symmetric part T (x) plus its anti-symmetric part which we have found above, 1 Tρσ (x) = Tρσ (x) + ∂ ελρσν ψ¯ (x)γ5γν ψ(x) (5.49) 0 λ 4 Moreover 1 Mλρσ (x) = Mλρσ (x) + ∂ εγλσν xρ − εγλρν xσ ψ¯ (x)γ5γν ψ(x) . (5.50) 0 0 γ 4 Mλρσ (x) = Tλσ (x)xρ − Tλρ (x)xσ labelm (5.51) Now, we see that, for the improved energy momentum tensor, we could simply use its symmetric part i ←− ←− Tµν (x) = ψ¯ (x)[γ µ ∂ ν + γν ∂ µ − γ µ ∂ ν − γν ∂ µ ]ψ(x) , ∂ Tνµ (x) = 0 (5.52) 4 λ It obeys νµ ∂λ T (x) = and the spatial integral of the energy-momentum charge density Z Z Z 3 0µ 3 0µ 3 a0µc 5 c d xT0 (x) = d xT (x) + d x∇aε ψ¯ (x)γ γ ψ(x) R 3 a0µc 5 c where Gauss’ theorem could be used to write d x∇aε ψ¯ (x)γ γ ψ(x) as a surface integral on the sphere at infinity, where we could assume that the integrand vanishes sufficiently rapidly that the surface integral is zero. Then we conclude that Z Z µ 3 0µ 3 0µ P = d xT0 (x) = d xT (x) 90 Chapter 5. The Dirac Equation In addition Z Mρσ = d3xM0ρσ (x) and, we can us the improved energy-momentum tensor Tµν (x) as the energy-momentum tensor whose inte- grals give the current and charge densities and as well, the current associated with the Lorentz transformation can be constructed from this improved energy-momentum tensor as is shown in equation (??). There are good reasons why it is convenient to have a symmetric energy-momentum tensor. By modifying Tρσ (x) to make it symmetric, we will be able to unify the generator of translations and Lorentz transformations. If we recall that a space-time symmetry is a coordinate transformation which is generated by a Killing vector fˆµ (x), we might make a candidate for a conserved current by contracting the energy- momentum tensor with the vector field which generates the co-ordinate transformation, µ µ ν T f (x) ≡ T ν (x) f (x) Then, to have a conservation law, we need µ ∂µ T f (x) = 0 With a vector field f µ (x), we will have such a conservation law if: 1. Tµν (x) is conserved, i.e. ∂ Tµν (x) = 0. Then ∂ Tµ (x) = 0 if fˆµ = aµ , a constant vector. A µ µ fˆ translation invariant field theory should have a conserved energy-momentum tensor. A field theory can be translation invariant without being Lorentz invariant. It would still have a conserved energy- momentum tensor. We know an example from our study of non-relativistic many particle theory. However, if the theory is not Lorentz invariant, it should not be expected to have a symmetric energy-momentum tensor. 2. Tµν (x) is conserved and Tµν (x) = Tνµ (x) is symmetric. Then Tµ (x) is conserved when fˆµ (x) obeys fˆ the Killing equation ∂µ fˆν (x) + ∂ν fˆµ (x) = 0 A conserved, symmetric energy-momentum tensor can thus be used to generate all of the symmetries of Minkowski space. A translation and Lorentz invariant field theory should have a conserved and symmetric energy-momentum tensor. µν µ µ 3. Finally, T (x) is conserved, symmetric and has vanishing trace, T µ (x) = 0. Then T f (x) is conserved when f µ (x) satisfies the conformal Killing equation η ∂ f (x) + ∂ f (x) − µν ∂ f λ (x) = 0 µ ν ν µ 2 λ and it generates a conformal transformation. Notice that all solutions of the Killing equation are also solutions of the conformal Killing equation. However, the conformal Killing equation is less restrictive. It has more solutions than the Killing equation. The extra solutions correspond to conformal transformations. A conformal field theory should have a conserved, symmetric and traceless energy- momentum tensor. Our example of the non-interacting Dirac field is a conformal field theory when m = 0. 5.5 Summary of this chapter The Dirac theory of spinor fields ψ(x) and ψ¯ (x) is described by an action Z S[ψ,ψ¯ ] = dxL (x) which is a space-time volume integral of the Lagrangian density 1−→ 1←− L (x) = −iψ¯ (x) ∂/ − ∂/ + m ψ(x) 2 2 5.5 Summary of this chapter 91 Applying the Euler-Lagrange equation to the Lagrangian density yields the Dirac equation h ←− i ∂/ + mψ(x) = 0, ψ¯ (x) − ∂/ + m = 0 µ µ ν µν The Dirac matrices γ are 4×4 and they obey {γ ,γ } = 2η with ηµν the metric of Minkowski space-time. The equal time anti-commutation relations are n †b o 0 0 b 4 ψa(x),ψ (y) δ(x − y ) = δa δ (x − y) 0 0 n †a †b o 0 0 {ψa(x),ψb(y)}δ(x − y ) = 0 , ψ (x),ψ (y) δ(x − y ) = 0 The Dirac theory has a phase symmetry which results in the conserved Noether current corresponding to particle number, µ µ jV (x) = −ψ¯ (x)γ ψ(x) The Dirac theory is Poincare invariant. The Noether currents associated with this symmetry, Tµ (x) = µν −T (x) fν (x), can be formed from the appropriate Killing vectors fµ (x) and the symmetric, conserved energy-momentum tensor i −→ −→ ←− ←− Tµν (x) = ψ¯ (x)[γ µ ∂ ν + γν ∂ µ − γ µ ∂ ν − γν ∂ µ ]ψ(x) 4 With the explicit representation 0 σ i 0 1 γi = , γ0 = (5.53) σ i 0 −1 0 and the ortho-normal helicity eigenvectors, ~σ ·~ku+ = |~k|u+ ~σ ·~ku− = −|~k|u− the Dirac field is q q Z i~k·~x−iEt i 1 − |~k|/Eu i 1 + |~k|/Eu 3 e + ~ − ~ ψ(x) = d k √ 3 q a+(k) + q a−(k) 2(2π) 2 1 + |~k|/Eu+ 1 − |~k|/Eu− q q Z −i~k·~x+iEt i 1 − |~k|/Eu i 1 + |~k|/Eu 3 e + † ~ − † ~ + d k √ 3 q b+(k) + q b−(k) 2(2π) 2 − 1 + |~k|/Eu+ − 1 − |~k|/Eu− p where E(~k) = ~k2 + m2 and the non-vanishing anti-commutation relations are n ~ † ~ 0 o ~ ~ 0 n ~ † ~ 0 o ~ ~ 0 a+(k),a+(k ) = δ(k − k ) a−(k),a−(k ) = δ(k − k ) n ~ † ~ 0 o ~ ~ 0 n ~ † ~ 0 o ~ ~ 0 b+(k),b+(k ) = δ(k − k ) b+(k),b+(k ) = δ(k − k ) The vacuum is defined by a±(k)|O >= 0 , b±(k)|O >= 0 ∀k,± † † and particle and anti-particle states are created by a±(k) and b±(k), respectively. The Noether charges for the phase and space-time translation symmetries are given by Z Z 3 0 3 † † N = d xjV (x) = d k ∑ as (k)as(k) − bs (k)bs(k) s=± Z Z q 3 00 3 ~ 2 2 † † H = − d xT (x) = d k ∑ k + m as (k)as(k) + bs (k)bs(k) s=± Z Z a 3 0i 3 a † † P = − d xT (x) = d k ∑ k as (k)as(k) + bs (k)bs(k) s=± 6. Photons We must now turn from our treatment of non-relativistic and relativistic Fermions to a Bosonic degree of freedom, the electromagnetic field, whose physical manifestation is familiar to us as the electric field ~E(x) and the magnetic field ~B(x). These are both spatial three-vector fields. They appear in abundance in the physical world. We wish to study how these arise from a quantum theory. To do so, we begin by remembering that the low energy states of weakly interacting quantum Fermi and Bose gases are very different. Fermions have a Fermi surface, particles and holes and we have made use of these concepts to construct quantum field theories of non-relativistic Fermions and their relativistic analog, the Dirac theory. Bose gases, on the other hand, exhibit a Bose-Einstein condensate. The condensate was described by a classical part of the quantum field, in terms of our non-relativistic Bose field, we had ψ(~x,t) = η(~x,t) + ψ˜ (~x,t) where ψ(~x,t) and ψ˜ (~x,t) were quantized fields obeying equal time commutation relations and η(~x,t) was a classical field. Moreover, when quantum fluctuations are small, the classical part of the Bose field obeys the same field equation as the quantum field. For quantum electrodynamics, we can invert this logic. We know the field equations that classical electric and magnetic fields must obey, they are Maxwell’s equations. If these classical fields are the classical parts of quantum fields, and the quantum field theory is weakly coupled, in that the effects of quantum corrections are a small, we might expect that the quantum field theory is simply a theory where the quantized electric and magnetic fields have Maxwell’s equations as their field equations. This expectation will turn out to be correct. In fact, Maxwell’s equations as the field equations for the quantum theory turns out to be the only mathematically consistent formulation of quantum electrodynamics in four space-time dimensions. This is a result of symmetry and the requirement of renormalizability, which we shall learn about later in this chapter, and in later chapters, respectively. 6.1 Relativistic Classical Electrodynamics Classical electrodynamics is governed by Maxwell’s equations which are partial differential equations for the electric and magnetic fields, ~E and ~B, respectively, ~∇ ·~E(~x,t) = ρ(~x,t) (6.1) −~E˙(~x,t) +~∇ ×~B(~x,t) =~j(~x,t) (6.2) ~∇ ·~B(~x,t) = 0 (6.3) ~B˙(~x,t) +~∇ ×~E(~x,t) = 0 (6.4) 94 Chapter 6. Photons We are working in a system of units where the constants ε0 and µ0 that sometimes appear in these equations are set equal to one. (This requires that the speed of light be set equal to one.) We are quoting Maxwell’s equations with sources, a charge density rho(~x,t) and a current density~j(~x,t) which we will leave unspecified for now. Maxwell’s equations are internally consistent only when the charge and current densities satisfy the continuity equation ∂ ρ(~x,t) +~∇ ·~j(~x,t) = 0 (6.5) ∂t To put Maxwell’s equations into relativistic notation, we identify the electric and magnetic fields with the components of an anti-symmetric two-index tensor field, F µν (x) as 0i Ei(~x,t) ≡ F (6.6) i jk i j ε Bk(~x,t) ≡ F (x) (6.7) and the charge and current densities as a four-current ρ(~x,t),~j(~x,t) = j0(~x,t),~j(~x,t) ≡ jµ (x) (6.8) and the continuity equation is µ ∂µ j (x) = 0 (6.9) With this notation, Maxwell’s equations become µν µ ∂ν F (x) = j (x) (6.10a) ∂µ Fνλ (x) + ∂ν Fλ µ (x) + ∂λ Fµν (x) = 0 (6.10b) These are the relativistic form of Maxwell’s equations. They are the field equation of classical electrodynam- ics. 6.2 Covariant quantization of the photon In this section, we will outline a scheme for identifying the correct quantum field theory of the photon and then, in maintaining explicit Lorentz covariance in solving the quantum field theory. 6.2.1 Field equations and commutation relations Equations (6.10a) and (6.10b) are the field equations of classical electrodynamics and they are also the equations that we expect that the quantized electromagnetic fields must obey. However, we still do not have a guide to determining the operator nature of the fields. The operator nature of the fields is defined by the commutation relations which we have yet to find. Our strategy for finding commutation relations will be to construct a Lagrangian density from which the field equations can be derived by using a variational principle. Then we will deduce the commutation relations by examining the time derivative terms in the Lagrangian density. Finding a Lagrangian density requires an important preliminary step which involves identifying the appropriate dynamical variable. This is the variable which should be varied when we use the variational principle to find the field equation. This variable turns out to be the vector potential field Aµ (x). It is introduced to solve equation (6.10b). Consider the ansatz Fµν (x) = ∂µ Aν (x) − ∂ν Aµ (x) (6.11) By plugging it into equation (6.10b) we can see that equation is satisfied identically for any Aµ (x). Then, equation (6.10a) becomes an equation for Aµ (x), it is