Detemining low-energy constants in χPT from decays of decuplet baryons
Måns Holmberg
Master Thesis in Physics Department of Physics and Astronomy Uppsala University
Supervisor: Stefan Leupold Abstract
Since quantum chromodynamics is non-perturbative at low energies, we cannot use conventional perturbative methods. Instead, we turn to chiral perturbation theory to restore predictability at low energies. At next-to-leading order for baryons, the low-energy constants cM and cE, related to the magnetic and electric decuplet- to-octet transitions moments, are unknown. We determine the value of cM by fitting the calculated decay widths of radiative two-body decays of decuplet baryons to data, enabling us to predict the branching ratios of Σ∗0 → Σ0γ ∗0 0 − 0 − and Ξ → Ξ γ. The value of cE is similarly determined using the Ω → Ξ e ν¯e decay, with less precision, however, due to experimental uncertainty. Using our values of cM and cE, we predict the branching ratios of 20 radiative three-body decays of decuplet baryons. In the future, experimental facilities such as the Facility for Anti-proton and Ion Research in Germany will allow us to confirm our predictions and to pin down more accurately the low-energy constants for baryons at next-to-leading order, thus, helping us to further probe the non-perturbative regime of quantum chromodynamics at low energies. Populärvetenskaplig sammanfattning på svenska
Elementarpartiklar, så som elektroner och kvarkar, beskrivs av den så kallade standardmodellen – en kvantfältteori som kombinerar de tre starkaste fundamentala krafterna: elektromagnetism, den svaga växelverkan och den starka växelverkan. För att beskriva verkligheten med standardmodellen används störningsteori, ett kraftfullt matematiskt verktyg som bygger på antagandet att partiklar interagerar svagt med varandra. Men, vid låga energier bryter den starka växelverkan mot detta antagande, vilket gör att störningsteori blir oanvändbart. Problemet uppstår då kvarkar och gluoner binder samman för att bilda större partiklar, så kallade hadroner. Detta gör att vi inte kan använda kvantkromodynamik, den delen ar standardmodellen som hanterar den starka växelverkan, för att beskriva hadroner. Istället kan vi använda kiral störningsteori, en effektiv kvantfältteori som approximerar kvantkromodynamik vid låga energier på ett sätt som återställer vår förmåga att använda störningsteori. Kiral störningsteori konstrueras utefter kvantkromodymanikens symmetrier när de tre lättaste kvarkarnas massor går mott noll, där man skriver ner alla relevanta termer som uppfyller dessa symmetrier upp till en viss ordnings noggrannhet. Denna ordning beskriver hur viktiga alla termer är och gör att man i praktiken kan arbeta med ett ändligt antal termer. I detta arbete använder vi oss av kiral störningsteori för baryoner, vilket är partiklar som består av tre kvarkar, till andra ordningens noggrannhet. Termerna i kiral störningsteori är proportionella mot obestämda konstanter, så kallade lågenergikonstanter. För att beskriva fysiska fenomen måste vi därför först bestämma dessa lågenergikonstanter på något sätt. Vi gör detta genom att jämföra experimentell data med våra egna beräkningar av olika sönderfallsprocesser. Vid andra ordningens noggrannhet för baryoner finns två lågenergikonstanter som ännu inte är bestämda. Vårt mål är därför att göra just detta, med hjälp av att studera sönderfall av exciterade hyperoner. Hyperoner är en typ av baryoner som är uppbyggda av uppkvarkar, nerkvarkar och särkvarkar, med andra ord: de tre lättaste kvarkarna. Genom att jämföra två- och tre-kroppssönderfall av exciterade hyperoner med befintlig data kan vi bestämma de två tidigare okända lågenergikonstanterna, något som möjliggör oss att förutspå sönderfallshastigheterna av hela 22 okända sönderfallsprocesser. I framtiden kommer experiment, så som Facility for Anti-proton and Ion Research i Tyskland att kunna bekräfta våra förutspådda sönderfallsprocesser och därmed hjälpa oss att med bättre noggrannhet bestämma andra ordningens lågenergikonstanter för baryoner. Anledningen varför vi studerar hadronfysik är för att utöka vår förståelse av det problematiska lågenergiområdet inom kvantkromodynamik, vilket vi effektivt kan göra med hjälp av kiral störningsteori. Contents
1 Introduction 3 1.1 Outline ...... 5
2 Background 6 2.1 Symmetries – Lie groups and Lie algebras ...... 6 2.1.1 Lie groups ...... 6 2.1.2 Lie algebras ...... 8 2.1.3 Relating Lie groups and Lie algebras ...... 11 2.1.4 Making real Lie algebras complex ...... 12 2.1.5 Group actions and applications ...... 13 2.1.6 Representations ...... 16 2.2 The Lorentz group ...... 19 2.3 Representations of the Lorentz group ...... 22 2.4 Gamma matrices ...... 24 2.5 Spinors and Parity ...... 26 2.6 The special unitary group (under construction) ...... 26 2.7 Representations of SU(3) (under construction) ...... 28 2.8 Introduction to Quantum Chromodynamics ...... 28 2.8.1 Noether’s theorem ...... 32 2.8.2 Exact symmetries of QCD ...... 35 2.8.3 Approximate symmetries of QCD ...... 36 2.8.4 Chiral symmetry (under construction) ...... 38 2.9 Spontaneous symmetry breaking (under construction) ...... 39 2.10 Chiral perturbation theory (under construction) ...... 41
3 Methods and calculations 42 3.1 Decay processes and kinematics ...... 43
1 Måns Holmberg Master Thesis
3.2 NLO Chiral Lagrangian for baryons (under construction) ...... 47 3.3 Matrix elements ...... 50 3.3.1 Flavor factors ...... 51 3.3.2 B∗(J = 3/2) → Bπ ...... 53 3.3.3 B∗(J = 3/2) → Bγ ...... 55 3.3.4 B∗(J = 3/2) → Bπγ ...... 57 ∗ − 3.3.5 B (J = 3/2) → Be ν¯e ...... 62
4 Results and discussion 68 ∗ 4.1 Determining hA using B (J = 3/2) → Bπ at LO ...... 68 ∗ 4.2 Determining cM using B (J = 3/2) → Bγ at NLO ...... 69 ∗ − 4.3 Determining cE using B (J = 3/2) → Be ν¯e at NLO ...... 70 4.4 NLO predictions of BR(B∗(J = 3/2) → Bπγ)...... 72
4.5 The importance of cE at NLO ...... 76
5 Conclusion and future directions 78
Appendix A Properties of spin-3/2 fermions (under construction) 80
Appendix B Vertices (under construction) 81 B.1 TBΦ ...... 82 B.2 TBΦγ ...... 82 B.3 BBΦ ...... 83 B.4 TT Φ ...... 84 B.5 ΦΦγ ...... 84 B.6 BBγ ...... 85 B.7 T T γ ...... 87 B.8 T Bγ ...... 88
Appendix C Calculation of the matrix element of B∗(J = 3/2) → Bγ 89
Appendix D Electromagnetic current conservation of B∗(J = 3/2) → Bπγ 93
6 Bibliography 98
2 Chapter 1
Introduction
One of the main research challenges of modern particle physics [1] is to understand the confinement regime of quantum chromodynamics (QCD), where perturbation theory, the primary mathematical tool at hand, breaks down. This confinement regime occurs at low energies, where the expansion parameter, the QCD coupling constant, is no longer small. Confinement means that quarks do not exist as free entities but, instead, that they are confined to bound states, called hadrons. Studying hadrons is therefore crucial to understand this energy regime. Hadrons of specific interest, due to increasing experimental activities, are the hyperons – being baryons containing strange, up and down quarks. Studying the decays of (excited) hyperons will allow us to learn more about the intrinsic structure of hadrons and thus probe parts of the non-perturbative confinement regime. At very low energies, chiral perturbation theory (χPT) [2, 3, 4, 5, 6] provides a systematic improvable effective field theory (EFT) to describe and predict QCD phenomena. Since the three lightest quark masses are light (< 100 MeV) on a hadronic scale (∼ 1 GeV), the limit of vanishing (up, down,
strange) quark masses is a good approximation to QCD. In this limit, SU(3)L × SU(3)R×U(1)V chiral symmetry arises, in which the left- and right-handed parts of the Dirac fermion fields (describing quarks) transform independently. This chiral symmetry appears to be spontaneously broken, giving rise to the pseudoscalar meson octet (being pions, kaons, and the η meson), as Goldstone bosons1. The perturbative expansion is carried out in powers of (three-)momenta and masses of the Goldstone bosons, denoted p. The expansion breaks down if the Goldstone momenta become too large, which corresponds to an energy scale were not considered degrees of freedom become active, i.e., the effects of individual quarks become important. In χPT, being an EFT, one writes down all possible terms consistent with the symmetries of QCD in the limit of massless quarks. However, there are an infinite amount of terms that fulfills this
1Since quarks are only approximately massless, chiral symmetry (or more specifically, flavor symmetry) is not exact. This gives rise to Pseudo-Goldstone bosons, since the pseudoscalar meson octet states acquire a small mass proportional to the square root of the quark masses [5].
3 Måns Holmberg Master Thesis
requirement. We herefore construct a power-counting scheme to organize the large number of terms according to their importance. Terms which give rise to lower powers of p are more important, since we expand around small p. This power-counting scheme allows us to omit higher order corrections, giving us a finite amount of terms that we can deal with. Each interaction term of this perturbative expansion (in p) has an unknown coupling constant (from a pure χPT standpoint), that in principle could be determined from the full theory of QCD; but here we run out of luck, since this would require knowledge about the non-perturbative confinement regime of QCD which we wished to probe in the first place. These coupling constant encode information about resonances that come into play at higher energies; therefore, we call them low-energy constants (LEC:s). In practice, there are two approaches we can use to determine LEC:s. We either can determine them by fitting calculated decay widths (from χPT) to experimental data, or we can do it by direct numerical calculations; obtained by putting QCD on a discretized spacetime-lattice (so-called lattice QCD) [7]. we will adopt the first approach. In this work, we are going use the complete and minimal relativistic Lagrangian at next-to-leading order (NLO) for three-flavor χPT in the presence of baryon octet states and baryon decuplet states, constructed in [8]. Different representations of the flavor symmetry group (SU(3)) group give rise to different hadronic multiplets. In particular, the lowest-lying spin-1/2 and spin-3/2 hyperons are contained in the octet and decuplet representations, respectively. At leading order (LO) most LEC:s
are already well known, apart from the LEC called HA, that gives rise to the coupling of a Goldstone boson to the decuplet states, since we do not have simple observables to pin it down. Instead, one
can estimate the value of HA in the limit of a large number of colors (Nc) in QCD [9, 10, 11, 12]. Moreover, χPT deals with explicit symmetry breaking by the quark masses; however, at LO and NLO flavor symmetry is still exact. Since the strange quark is much heavier than that of the up and down quarks, we expect a more significant flavor symmetry breaking with increasing strangeness. Therefore, LEC:s at LO and NLO are expected to differ in value for a given process2, depending on the amount of strangeness involved. In order to quantify this flavor symmetry breaking at LO, we will determine ∗ 3 ∗ ∗ the LEC hA in the case of Ξ → Ξπ decays (one already know hA in the case of ∆ and Σ decays).
The hA interaction term gives rise to the coupling of a decuplet baryon to a pair of Goldstone boson and baryon octet state. At NLO, we know less about the various LEC:s, some of which are unknown. Because of this, we will determine the values of two unknown LEC:s; cM and cE, related to a magnetic transition moment and an axial vector electric transition moment of the decuplet-to-octet transitions, respectively. We ∗+ + ∗0 will determine cM by comparing the calculated decay widths of ∆ → Nγ, Σ → Σ γ, Σ → Λγ
2Flavor symmetry breaking occurs first at next-to-next-to-leading order (NNLO). Also, all LEC:s do have a precise value. The apparent splitting of LEC:s for different strangeness is caused by fitting to data, since nature does not stop at NLO as we do. 3Decuplet baryons, i.e., exited baryons, are denoted by ∗.
4 Måns Holmberg Master Thesis
with experimental results [13]. As in the case of hA, we expect to see a splitting of cM for different
strangeness (Deltas has S = 0 and Sigmas has S = 1), due to flavor breaking. To pin down cE, we will 0 − compare the calculated decay width of Ω → Ξ e ν¯e with data. Unfortunately, the measured branching 0 − ratio of Ω → Ξ e ν¯e have a large uncertainty, hence, limiting our knowledge of cE. Furthermore,
the authors of [14] do not include the cE interaction term in what they claim is a complete NLO Lagrangian; hence, they imply that this term should be included only at higher orders. In [8] we argue 0 − against this and in this work, we will show, by the example of Ω → Ξ e ν¯e, that the cE interaction term is important already at NLO. After determining the values of these LEC:s, we will be able to predict the decay widths of Σ∗0 → Σ0γ and Ξ∗0 → Ξ0γ, as well as of 20 radiative three-body decays. These three-body decays will
also allow us to determine cE more accurately, once experimental data is available. The future experiments at FAIR (Facility for Anti-proton and Ion Research) in Germany will help us study the two and three-body decays considered in this work; thus, promising a better understanding of hyperons, their interactions, and the corresponding LEC:s. The two experimental collaborations HADES (High Acceptance DiElectron Spectrometer) [15] and P¯ANDA (Antiproton ANnihilation at DArmstadt) [16] will both provide excellent opportunities to measure decays of excited hyperons. All of this, to further probe the confinement regime of QCD.
1.1 Outline
This thesis is structured as follows: chapter 2 contains the theoretical background and is divided into three main parts. The first part covers the mathematics of symmetries, the second part is an introduction to QCD, and the third part is about chiral perturbation theory for mesons and baryons. In chapter 3, we present the kinematics of decay processes and performs the calculations of the decay widths. The results and discussions are covered in chapter 4. In the end, chapter 5 summarizes the content of this thesis and provides suggestions for future directions. There are also four appendices which mostly cover explicit calculations too long for the main text. Appendix A contains the main properties of Rarita-Schwinger fields, i.e., spin-3/2 fermions. In appendix C we give the full calculation of the decay width of B∗(J = 3/2) → Bγ, i.e., the radiative two-body decay in section 3.3.3. Appendix B contains the derivations of most vertices used to calculate the decay widths. Finally, in appendix D we check the electromagnetic current conservation of B∗(J = 3/2) → Bπγ from section 3.3.4.
5 Chapter 2
Background
Here we cover a broad range of the theoretical background – from the mathematics of symmetries to chiral perturbation theory. In this chapter, we will introduce the most necessary information to understand topics such as continuous symmetries, spontaneous symmetry breaking, spinors, hadronic multiplets, the fundamentals of QCD and chiral perturbation theory for mesons and baryons. Section 2.1 about symmetries is a thorough introduction to Lie groups and Lie algebras and, as a consequence, is more mathematically oriented than the rest of the thesis.
2.1 Symmetries – Lie groups and Lie algebras
In general, the importance of symmetries in physics cannot be overstated. Symmetries are crucial in many fields of physics, especially when it comes to particle physics; where we mostly deal with continuous symmetries. The gauge symmetry of QCD and the approximate chiral symmetry are two examples of critical continuous symmetries. In general, continuous symmetries are described by Lie groups. Therefore, let us give an introduction to Lie groups and Lie algebras (used to understand Lie groups). We do this to finally understand chiral perturbation theory, which relies on the properties of the Lorentz group O(1, 3) and the SU(3) flavor group, as well as spontaneous symmetry breaking. The material covered in this section is not new, instead, it comes from several sources: The information about Lie groups, Lie algebras and their representations was mainly gathered from [17, 18], the information about group actions is from [18, 19, 20], and finally, the information about the mathematics of spontaneous symmetry breaking is from [21, 22].
2.1.1 Lie groups
A real Lie group G is a group and simultaneously a real smooth manifold, where the group product G × G → G and the inverse map G → G are both smooth. If G as a manifold is instead complex, and
6 Måns Holmberg Master Thesis
the group product and its inverse are complex analytic, then G is said to be a complex Lie group. We will restrict our attention to matrix Lie groups only, which are closed subgroups of the real or complex group of all n×n invertible matrices where the group product is given by matrix multiplication, i.e. the real or complex general linear group GL(n, R) or GL(n, C). Because of this, we are not going to write matrix Lie group explicitly. To begin with, let us define a few concepts of Lie groups that will prove to be useful: A Lie group G is said to be abelian if every two elements of the group commute, that is: [g, g0] = gg0 − g0g = 0 for all g, g0 ∈ G. Otherwise, the group is called non-abelian. A Lie group homomorphism between the Lie groups G and H is a continuous map Φ: G → H such that Φ(gg0) = Φ(g)Φ(g0) (2.1)
0 for all g, g ∈ G. Note that this implies that Φ(eG) = eH , where eG and eH are the identity elements of G and H (i.e., identity matrices), respectively. If Φ is a bijection it is called a Lie group isomorphism. If there exists a Lie group isomorphism between two Lie groups G and H, we call them isomorphic and write G =∼ H, which informally can be read as G is essentially the same as H. Isomorphic groups do not need to be distinguished as their group properties are the same. Two elements g, g0 of a Lie group G are said to be connected if there exists a continuous curve γ(t):[a, b] → G such that γ(a) = g and γ(b) = g0. A Lie group G itself is said to be connected (or path connected) if for every two elements g, g0 ∈ G there exists such a continuous curve. Moreover, a group is said to be simply connected if it is connected and if every closed path (loop) can be continuously shrunk down to a point (i.e. the manifold contains no holes). Let G and H be two Lie groups, then the Cartesian product G × H = {(g, h): g ∈ G and h ∈ H} together with the group multiplication (g, h) · (g0, h0) ≡ (gg0, hh0), for all g, g0 ∈ G and h, h0 ∈ H, is called the direct product of the two groups. The direct product of two Lie groups is a Lie group. Since G and H are two matrix Lie groups, i.e., being closed subgroups of GL(n, C) and GL(m, C), respectively; the Cartesian product G × H is a closes subgroup of GL(n + m, C). Dealing with global geometrical objects, such as Lie groups, can be hard, and it is often easier to deal with local (linearized) objects instead. This idea is key in the theory of Lie groups – rather than studying the group itself, one studies the linearized version of it, which is called the Lie algebra. In a sense, geometry is hard, and algebra is easy. Linearization of the Lie group about the identity produces a new set of elements that form the Lie algebra.
7 Måns Holmberg Master Thesis
2.1.2 Lie algebras
Consider a one-dimensional subgroup of a Lie group G, defined as a Lie group homomorphism γ : R → G (where R is a group under addition). From the requirement (2.1), which in this case translates to
γ(s + t) = γ(s)γ(t), it follows that γ(0) = eG = I. Think of a one-dimensional subgroup as an abstract curve in G where every two elements along the curve commute. Intuitively (and informally), this means that the curve is in some sense straight. Consider for instance, the group of rotations in 3 dimensions (i.e., SO(3)) and let R(θ) be the rotation of an angle θ around the vector nˆ, then, clearly R(θ + φ) = R(θ)R(φ); because rotating around nˆ with an angle θ and then by another angle φ is the same as doing the combined rotation at once (this is just saying that rotations in 2 dimensions commute). Therefore, R is the one-parameter subgroup of the rotations around nˆ. The key point here is to realize that all information about R is completely specified by nˆ; it sounds like R satisfies some differential equation with nˆ as the initial condition. Going back to the general case, we can construct a differential equation precisely like that. Let us solve γ(s + t) = γ(s)γ(t) by taking the derivative with respect to s on both sides and setting s = 0 afterwards. Doing this yields the following differential equation:
d d γ(s + t)| = γ(s)γ(t)| ⇔ γ0(t) = γ0(0)γ(t) . (2.2) ds s=0 ds s=0 with the unique solution (with γ(0) = I as initial condition) given by
γ(t) = exp γ0(0)t , (2.3) where exp is the regular matrix exponential. Analogously to the previous example, γ is now fully determined by γ0(0) – the initial condition. We could say that γ0(0) generates all elements of γ(t), because for each initial condition γ0(0), there exists a unique solution in some interval of t (if γ and γ0 are continuous). Furthermore, notice that γ0(0) is not an element of the group1, but rather it is part of 0 0 the tangent space of G at e, given by all possible g (0), denoted TeG. We say that γ (0)t is an element
of the Lie algebra g which coincides with TeG as a vector space, hence, informally
Lie group = exp (Lie algebra) . (2.4)
a Every element x ∈ TeG = g can be written as x = α Xa, where a = 1, ..., N, and where {Xa} =
{X1, ..., XN } is some basis of g. The sum over repeated indices is implicit. An arbitrary element of a
1E.g., a tangent line to a circle at some point is not part of the circle itself.
8 Måns Holmberg Master Thesis
2 a Lie group G that is connected to the identity can thus be expressed as g(~α) = exp(α Xa) ∈ G, where the components of the basis are given by
∂ Xa = a g(α) . (2.5) ∂α α=0
These are called the (infinitesimal) generators of the group, because they are all the possible initial directions (in an abstract sense) at identity, whose one-parameter subgroups reach all elements that are connected to identity. In physics, one usually defines the generators with an extra i, such that an a element of the group is g(~α) = exp(−iα Ya), i.e., Ya = iXa. For now, we will not adopt this convention. So far so good, however, the linear structure of g alone is quite uninformative, except for encoding information about the dimension of the manifold (dim(G) = N). Moreover, to justify calling the Lie algebra an algebra, we need a bilinear form that defines the product of two elements. We want this product to give us information about the multiplication of group elements. Let g, g0 ∈ G be connected to the identity, hence, we can write g = eX and g0 = eY where X,Y ∈ g. Then, in general, gg0 = eZ 6= eX+Y ; instead, the argument Z is given by the Baker-Campbell-Hausdorff formula (described in section 5.6 in [17])
1 1 Z = X + Y + [X,Y ] + ([X, [X,Y ]] + [Y, [X,Y ]]) + ··· . (2.6) 2 12
The corresponding element of the Lie algebra of the product of two group elements is thus expressed in terms of nested commutators. All one need to express Z to an arbitrary precision is therefore knowledge about the commutation relation. Consequently, the commutator encodes structure about the group multiplication and is precisely the bilinear form we seek. Note that elements of the Lie algebra only commute if and only if elements of the Lie group commute. A Lie algebra is also called abelian if its elements commute with each other. Now for the more abstract definition of a Lie algebra: A Lie algebra g is a real or complex vector space equipped with an antisymmetric bilinear form [ · , · ]: g × g → g that satisfies the Jacobi identity (2.7). The map [ · , · ] is called the Lie bracket (or Lie product). A bilinear form on some vector space V is a map V × V → V that is linear in each argument separately. Moreover, the antisymmetric property ([X,Y ] = −[Y,X] ∀ X,Y ∈ g) of the Lie bracket directly follows from the definition of the commutator. Note that in general, the Lie bracket does not have to be defined as the commutator (because multiplication might not be defined); however, when restricting our attention to matrix Lie groups only, the Lie bracket is always given by the commutator
2If not, there is no way to reach this group element from the Lie algebra; hence, (2.4) does not always come with an equal sign.
9 Måns Holmberg Master Thesis
(Hall). The Jacobi identity is
[X, [Y,Z]] + [Z, [X,Y ]] + [Y, [Z,X]] = 0 , for all X,Y,Z ∈ g , (2.7) which holds even for Lie brackets of non-matrix Lie groups. Because the Lie bracket is closed (a
fun exercise) we are able to write the Lie bracket of two basis components Xa and Xb as a linear combination: c [Xa ,Xb] = fab Xc , (2.8)
c where fab are real or complex numbers called structure constants, that obviously are antisymmetric in a, b. Upper and lower indices, corresponding to the components of vectors and dual vectors (covectors), respectively, do not have to be distinguished if the metric tensor (of the corresponding manifold) is positive definite; e.g., as in the case of a Euclidian metric. A counterexample is the Lorentzian metric, which is not positive definite. Note that if we simultaneously multiply the structure constants and the generators by a constant the algebra itself does not change, hence there is a freedom of normalization3. Let us now give two examples of some important Lie algebras. Consider the group GL(n, C) = {g ∈ M(n, C) : det g 6= 0}, where M(n, C) is the set of all n × n complex invertible matrices. Every group X X tr(X) element can be written as g = e ∈ G, where X ∈ gl(n, C), with the condition 0 6= det(e ) = e . However, because the exponential function is always nonzero, this condition does not restrict X in any way, meaning that gl(n, C) = M(n, C) is the associated Lie algebra, with the commutator as bracket. Moreover, consider the special linear group SL(n, C) = {g ∈ M(n, C) : det g = 1}, then, using the same argument, it easy to see that sl(n, C) = {g ∈ M(n, C): tr (g) = 0}. Furthermore, here are some useful definitions regarding Lie algebras: Similar to Lie groups, we can have a continuous map between Lie algebras. A Lie algebra homomorphism between Lie algebras is a continuous map φ : g → h such that
φ ([X,Y ]g) = [φ(X) , φ(Y )]h (2.9)
for all X,Y ∈ g. Again, if φ is bijective with a continuous inverse, we call it a Lie algebra isomorphism, denoted g =∼ h. Let h and g be two Lie algebras, then h is called a Lie subalgebra if it is a vector subspace of g that is closed under the Lie bracket, i.e [X,Y ] ∈ h for all X,Y ∈ h. For example, all real traceless matrices 4 form a Lie subalgebra of GL(n, R) . Let U and V be two real or complex vector spaces, then the direct sum of U and V is defined as
3The normalization can include complex numbers without making the Lie algebra complex. The dimension has to change for it to be truly complex. 4Because tr ([X,Y ]) = tr (XY ) − tr (YX) = 0.
10 Måns Holmberg Master Thesis
the set (given by the Cartesian product)
U ⊕ V = {(X,Y ): X ∈ U,Y ∈ V } , (2.10)
equipped with component-wise addition and scalar multiplication. Note that dim(U ⊕ V ) = dim(U) +
dim(V ). The same can be done with Lie algebras as vector spaces, i.e. g1 ⊕ g2 is defined as (2.10), but with the additional Lie bracket
0 0 0 0 [(X,Y ) , (X ,Y )]g1⊕g2 ≡ ([X,X ]g1 , [Y,Y ]g2 ) (2.11)
0 0 for all X,X ∈ g1 and all Y,Y ∈ g2. The direct sum of two Lie algebras is again a Lie algebra. Elements
of g1 and g2 (i.e., (X, 0) and (0,Y )) have intersection (0, 0) and commute with each other according to (2.11). Conversely, if we have two subalgebras a and b of some Lie algebra g that span g as a vector space, have intersection 0 and where [X,Y ] = 0, for all X ∈ a and Y ∈ b; then (X,Y ) → X + Y is an isomorphism of a ⊕ b to g.
2.1.3 Relating Lie groups and Lie algebras
For completeness, we ask ourselves if every (finite-dimensional and real) Lie algebra is the corresponding Lie algebra of some matrix Lie group. Lie’s third theorem (theorem 5.25 in [17]) says that this is the case! We will not use this fact in this thesis, but it is nice to know that whenever you find yourself with a closed commutation relation; there is a corresponding symmetry described by a matrix Lie group. Furthermore, how do Lie group homomorphisms relate to Lie algebra homomorphisms? That is, if we have a Lie group homomorphism Φ: G → H, is there a corresponding Lie algebra homomorphism between g and h (the Lie algebras of G and H)? The answer is yes (theorem 3.28 in [17]). There exists a unique linear map φ : g → h such that Φ(eX ) = eφ(X), where φ satisfies (2.9). Naturally, φ is the
derivative of Φ at identity, since φ is the map TIG G → TIH H = TΦ(IG)H, that is
d tX φ(X) = dΦIG (X) = Φ(e ) , (2.12) dt t=0
for all X ∈ g. On the contrary, the reverse (harder) question is more interesting, as we would like to study Lie algebras instead of Lie groups. Do all Lie algebra homomorphism give rise to a Lie group homomorphism? Unfortunately, not in general, but under the additional requirement that G is simply connected, the converse is true and is also given by Φ(eX ) = eφ(X) (theorem 5.6 in [17]). An essential corollary of this theorem is that if G and H are two simply connected Lie groups with Lie algebras g and h, respectively; then if g =∼ h, we necessarily have G =∼ H (corollary 5.7 in [17]). Also, let G ⊂ GL(n, C) and H ⊂ GL(m, C) be two matrix Lie groups with Lie algebras g and h,
11 Måns Holmberg Master Thesis
respectively. Then, it is not hard to show that the Lie algebra of G × H ⊂ GL(n + m, C) is isomorphic to g ⊕ h (exercise 3.9 in section 5 in [17]).
2.1.4 Making real Lie algebras complex
Let us consider changing the field of a Lie algebra from R to C. This is motivated by the fact that it is often easier to classify the representations of some real Lie algebra by first making it complex. In particular, we will see this first hand when dealing with representations of the Lorentz group. From a mathematical point of view, this is further motivated by the fact that C is algebraically closed, whereas R is not. The complexification or complex extension of a real vector space V is
VC = {X + iY : X,Y ∈ V } , with i(X + iY ) = −Y + iX , (2.13) which is isomorphic to V ⊕ iV (but the notation (X,Y ) is a bit clumsy compared to X + iY ). We would like to stress that this is nothing more complicated than when one constructs the complex plane
from a pair of real numbers, that is, when V = R, we obtain VC = C. Given a Lie algebra g, the
complexification gC defined as above (as a vector space), has the unique Lie bracket extension, given by
[X + iX ,Y + iY ] = ([X ,Y ] − [X ,Y ]) + i ([X ,Y ] + [X ,Y ]) . (2.14) 1 2 1 2 C 1 1 2 2 1 2 2 1
This Lie bracket is again bilinear, antisymmetric and satisfies the Jacobi identity (Proposition 3.37 ∼ ∼ Hall). For example, one can easily check that gl(n, R)C = gl(n, C) and sl(n, R)C = sl(n, C). Also, consider the unitary group U(n), consisting of all unitary n × n complex matrices; whose Lie algebra is † u(n) = {X ∈ M(n, C): X = −X}, since
† gg† = eX eX = I, (2.15)
for all g ∈ U(n) connected to the identity (i.e., with positive determinant) and for all X ∈ u(n). To see that this Lie algebra (and group) is real, we note that (iX)† = iX is hermitian (and not antihermitian)
and thus not part of the algebra. Next, let us investigate the complexification u(n)C, which consists of all elements of the form X + iY , for all X,Y ∈ u(n). We saw above that iY is hermitian, which means that X + iY is a sum of hermitian and antihermitian matrices. This is a well-known way of ∼ decomposing any complex matrix; hence, we conclude that u(n)C = gl(n, C). Likewise, we consider the special unitary group SU(n), which consists of all unitary n×n complex matrices with unit determinant. We saw earlier that group elements with unit determinant translate to traceless elements of the Lie ∼ algebra, thus, since the trace is linear, we conclude that su(n)C = sl(n, C), similar to above.
12 Måns Holmberg Master Thesis
Let us now address why this is useful. It trivially turns out that there is a one-to-one correspondence between the representations of a real Lie algebra and its complexification; which come in handy in situations where, compared to the real case, it is easier to study the complexification of a real Lie algebra. We will later see that a Lie algebra representation is a special case of a Lie algebra homomorphism; therefore, the above statement follows from (proposition 3.37 in [17]):
Let g be a real Lie algebra with complexification gC and let h be some complex Lie algebra. Then, every (real) Lie algebra homomorphism φ : g → h can be uniquely extended to a (complex) Lie algebra
homomorphism of gC to h, given by
φ(X + iY )C = φ(X) + iφ(Y ) (2.16)
for all X,Y ∈ g.
2.1.5 Group actions and applications
3 Given an element R of the group of rotations in 3 dimensions, i.e., SO(3), we rotate a vector ~x ∈ R 3 using matrix multiplication, to obtain R~x. We say that this is a group action of an element of R . In more general terms, a (left) group action from a group H to a set M is a mapping ψ : H × M → M such that ψ(I, m) = m ∀ m ∈ M,I identity of H, (2.17) ψ(h , ψ(h0, m)) = ψ(hh0, m) ∀ h , h0 ∈ H , m ∈ M.
The group action is called transitive if and only if for any two elements m, n ∈ M, there exists an element g ∈ G such that n = ψ(g, m); hence, giving a sense of connectedness. 0 0 0 −1 Consider the example of a group action of G on itself, defined by (g , g) 7→ Cg(g ) = gg g , for 0 g, g ∈ G. This is called the conjugate action. When G = GL(n, C), the conjugate action is what we know from linear algebra to be a similarity transformation. The group action defines an equivalence relation on M; we say that
m ∼ n if and only if n = ψ(h, m) for some h ∈ H. (2.18)
We call the equivalence class of m ∈ M the orbit of m:
ψ(H, m) = {ψ(h, m): h ∈ H} . (2.19)
2 ∼ ∼ To see why this is called an orbit imagine M = R = C and H = SO(2) = U(1), with matrix multiplication as group action. We find that all vectors of equal length belong to the same orbit, thus,
13 Måns Holmberg Master Thesis
justifying the name. Furthermore, if the group action is transitive, only one orbit exists – M itself. More on this later. In the case of the conjugate action, the orbit of g ∈ G is called the conjugacy class of g. The set of all orbits is called the quotient of the action ψ, denoted
M/H = {ψ(H, m): m ∈ M} . (2.20)
2 ∼ + Hence, M/H is obtained from M by identifying every orbit with a single element, e.g., R /SO(2) = R is the set of all possible radii (which cannot be negative). Let H ⊆ G be a subgroup and M = G, with the natural group action ψ(h, g) = gh. Given this, the orbit of g is called the left coset and denoted gH = {gh : h ∈ H}. Similarly, the set of all left cosets is the quotient G/H, which is not necessarily a group!. Given a group action ψ : G × M → M of a group G and a set M, we define the stability group (or as we physicists like to call it – the little group) at m ∈ M by
Gm = {g ∈ G : ψ(g, m) = m} . (2.21)
This is a subgroup of G (Gallier, Warner). Given this definition, symmetry should immediately come to mind, since the stability group is defined by the set of group elements that, given some action, leaves some point m ∈ M invariant. This is precisely where we are going. In fact, the stability group is a key component behind the mathematics of spontaneous symmetry breaking – which in turn is a key concept of chiral perturbation theory.
Let us now show that the choice of m ∈ M for the stability group Gm is arbitrary for a transitive group action, since −1 Gψ(g,m) = gGmg , (2.22) which is easy to show (excuse the slight abuse of notation):
0 0 Gψ(g,m) = {g ∈ G : ψ(g , ψ(g, m)) = ψ(g, m)}
= {g0 ∈ G : ψ(g0g, m) = ψ(g, m)} (2.23) = {g0 ∈ G : ψ(g−1g0g, m) = m}
−1 = gGmg .
Hence, the stability group of m and ψ(g, m) belong to the same conjugacy class, which means that the stability groups of any two elements are necessarily isomorphic, since subgroups of the same conjugacy class are isomorphic. Let us now come back to the statement that for a transitive group action there is only one orbit (or
14 Måns Holmberg Master Thesis
left coset), which is the whole set itself. It might seem intuitive, but we prove it regardless (Gallier, Scherer): Let G be a group and ψ : G × M → M be a transitive group action on the set M. Then, we define the new map ψm : G → M, for a fixed m ∈ M, defined by ψm(g) = ψ(g, m), for all g ∈ G.
Consider the set of ψm acting on the left coset gGm:
0 0 ψm(gGm) = {ψ(gg , m): g ∈ Gm}
0 0 = {ψ(g, ψ(g , m)) : g ∈ Gm} (2.24)
= {ψm(g)}
Consequentially, ψm induces a map ψ˜m : G/Gm → M, defined by ψ˜m(gGm) = ψm(g). We conclude 0 that ψ˜m is injective, since for all g, g ∈ G,
0 0 −1 0 −1 0 0 ψm(g ) = ψm(g) ⇔ ψ(g , m) = ψ(g, m) ⇔ ψ(g g , m) = m ⇔ g g ∈ Gm ⇔ gGm = g Gm . (2.25)
The map ψ˜m is also surjective, since the group action is transitive on M, which make it such that for all ∼ g ∈ G there is an ψ˜m(gGm) = ψ(g, m) = n ∈ M. As a result, ψ˜m is a bijection, such that G/Gm = M. You might wonder what the nature of such quotient sets is. So far, we have not used the fact that we are dealing with Lie groups and, furthermore, we have not yet said anything about the sets which the groups are acting on. The reason why we have not covered this is that it is a bit technical in general; therefore, see [21, 19, 20] if you are curious about the details of quotient spaces. In the special case when we have a Lie group G and a closed Lie subgroup H ⊆ G; the quotient set G/H of a smooth group action forms what is called a homogeneous manifold (or space). A homogeneous manifold is a smooth manifold endowed with a smooth Lie group action. Moreover, if H is normal5, G/H is actually another Lie group (theorem 3.64 in [19]). Now, with all of this machinery at hand, we can describe spontaneous symmetry breaking of some physical system in terms of group theory. Let different physical states of the system be given by m ∈ M, where M is a smooth manifold; and let the system as a whole, described by some scalar function S(m) (e.g., an action), be invariant under the transformations of the Lie group G, given some smooth group action ψ : G × M → M. Then, the minimum of this scalar function, corresponding to a
ground state m0 of the system, might not be invariant under the group action of some elements g ∈ G,
i.e., ψ(g, m0) 6= m0 with the minimum S(ψ(g, m0)) = S(m0). That is, the ground state is not unique! On the other hand, there might be a proper6 closed Lie subgroup of G whose elements do leave the
ground state itself invariant. This subgroup is the stability group Gm0 . But, since the ground state is
5A subgroup H of a group G is normal in G if it is invariant under conjugation by any element of G, i.e. if and only if ghg−1 ∈ H for all g ∈ G and h ∈ H. 6If the subgroup is not proper, it could be the trivial subgroup containing only the identity element, which does not lead to any symmetry breaking. Note that a subgroup H of a group G is proper if and only if H ⊂ G, that is, H 6= G.
15 Måns Holmberg Master Thesis
degenerate, the stability group is not unique either; in fact, all different ground states which can be reached by the group action7 give rise to different stability groups, that are conjugate to each other (as 0 shown in (2.22)). For a specific ground state m0, two group elements g, g ∈ G are equivalent if they 0 0 0 are related by g = g h for some h ∈ Gm0 , following ψ(g, m0) = ψ(g h, m0) = ψ(g , m0). This forms an
equivalence class, given by the left coset gGm0 . Finally, the quotient G/Gm0 is the set of all such cosets,
i.e., representing all group elements which transform m0 into different ground states. It follows that
G/Gm0 is isomorphic to the set of all ground states, which is a homogeneous space that we denote by
M0. The dimension of the space of all ground states is dim(M0) = dim(G/Gm) = dim(G) − dim(Gm) (corollary 3.7.26 in [21]). In quantum field theory, it is in this space where the so-called Goldstone bosons live. More on this and its application to QCD is described in section 2.9.
2.1.6 Representations
A finite-dimensional representation of a Lie group G is a Lie group homomorphism
Π: G → GL(V ) , (2.26) where V is a real or complex vector space, and where GL(V ) denotes all invertible linear transformations of V . Similarly, a finite-dimensional representation of a Lie algebra g is a Lie algebra homomorphism
π : g → gl(V ) , (2.27) where gl(V ) = End(V ) is the set of all linear transformations of V (also called an endomorphism of V ). By choosing a basis of GL(V ) and gl(V ), we can identify them with GL(n, C) and M(n, C), respectively. If two representations are isomorphic we do not distinguish between them, for example if two representations only differ by a choice of basis (thus are related by a similarity transformation). Representations are invertible linear transformations on some vector space V , which means that we can construct two actions Π: G × V → V and π : g × V → V , defined as (g, v) → Π(g)v and (X, v) → π(X)v, respectively, for g ∈ G, X ∈ g and v ∈ V . There are a few ways one can go about combining representations to generate new ones. Here we cover direct sums and tensor products of group representations. We start with direct sums:
Let Π1 , Π2 ,..., Πn be n representations of the matrix Lie group G acting on the vector spaces
V1 ,V2 ,...,Vn. The direct sum of these representations is a new representation Π1 ⊕ Π2 ⊕ · · · ⊕ Πn of
G that acts on V1 ⊕ V2 ⊕ · · · ⊕ Vn, defined by (similar to (2.10))
(Π1 ⊕ Π2 ⊕ · · · ⊕ Πn(g)) (v1 , v2 , . . . , vn) = (Π1(g)v1 , Π2(g)v2 ,..., Πn(g)vn) (2.28)
7 I.e., the group action is transitive when acting on the subset of all ground states (M0).
16 Måns Holmberg Master Thesis
for all g ∈ G. The definition is the same for Lie algebras. In the language of matrices, we can write the representation of direct sums as a block diagonal matrix, where the elements of the representations
make up the blocks. In this representation of representations, the vectors v1 , v2 , . . . , vn on which the representations are acting becomes a single combined column vector. Direct sums of Lie algebra representations work in an analogous way. See (section 4.3.1 Hall) for more information. Next up is tensor product representations. We first need to know the tensor product acting on vector spaces. Given two vector spaces, we can build a new vector space by introducing the tensor product. Let U and V be two finite-dimensional real or complex vector spaces, then, the tensor product of U and V , denoted U ⊗ V , is a vector space W , endowed with a bilinear map φ : U × V → W . One can show that a tensor product of U and V always exist and that it is unique, since any two tensor
products (W1, φ1) and (W2, φ2) of U and V are isomorphic. Therefore, we do not need to specify φ as long as U and V are two finite-dimensional real or complex vector spaces; thus, we denote φ(u, v) as
u ⊗ v for all u ∈ U and v ∈ V . Moreover, suppose that e1, e2, ··· , en is a basis of U and f1, f2, ··· , fm is a basis of V , then
{ei ⊗ fj : 1 ≤ i ≤ n , 1 ≤ j ≤ n} , (2.29)
is a basis for W . From this we see that dim(U ⊗ V ) = dim(U) dim(V ). We can think of the tensor product of vector spaces as a generalization of the outer product, since the outer product is the special case (u ⊗ v = uT v). The uniqueness of the bilinear map φ and (2.29) is described in more detail in (theorem 4.14 in [17]). To obtain tensor product representations, we need to define the tensor product of linear maps. Again, let U and V be two finite-dimensional real or complex vector spaces and let A : U → U and B : V → V be two linear maps. Then, the tensor product A ⊗ B is the unique linear map
(A ⊗ B): U ⊗ V → U ⊗ V, (2.30)
defined by (A ⊗ B)(u ⊗ v) = (Au) ⊗ (Bv) , (2.31)
for all u ∈ U and v ∈ V . The tensor product is given by the Kronecker product when representing the two linear maps as matrices.
Finally, let Π1 be a representation of the Lie group G and let Π2 be a representation of the Lie group H, acting on the vector spaces U and V , respectively. Then, according to (2.31), the tensor
product representation Π1 ⊗ Π2 acting on U ⊗ V , is a representation of G × H, given by
(Π1 ⊗ Π2)(g, h)(u ⊗ v) = (Π1(g)u) ⊗ (Π2(h)v) , (2.32)
17 Måns Holmberg Master Thesis
for all g ∈ G,h ∈ H and u ∈ U, v ∈ V . It is important to note that this notation is, unfortunately,
ambiguous in the special case when H = G, since Π1 and Π2 would be representations of the same
group G, meaning that we can either regard Π1 ⊗ Π2 as a representation of G or G × G. Hence, we
need to be careful to specify in what way we use Π1 ⊗ Π2. Now for the case of Lie algebras. Let G and H be two matrix Lie groups with algebras g and h.
Again, let Π1 and Π2 be representations of the Lie groups G and H, acting on the spaces U and V
respectively. Then, consider the tensor product representation Π1 ⊗ Π2 of G × H. The associated Lie
algebra representation, denoted π1 ⊗ π2, is then (remember (2.12))
d tX tY (π1 ⊗ π2)(X,Y ) = Π1(e ) ⊗ Π2(e ) dt t=0 d tX d tY (2.33) = Π1(e ) ⊗ IV + IU ⊗ Π2(e ) dt t=0 dt t=0
= π1(X) ⊗ IV + IU ⊗ π2(Y ) ,
tX tY with g = e ∈ G, h = e ∈ H and X ∈ g, Y ∈ h. Here, π1 and π2 are the corresponding Lie algebra representations of g and h. Since the associated Lie algebra of G × H is g ⊕ h, in the case of matrix Lie
groups, we have that π1 ⊗ π2 is a representation of g ⊕ h, that is acting on U ⊗ V . Explicitly, π1 ⊗ π2 acts on U ⊗ V as
(π1 ⊗ π2)(X,Y )(u ⊗ v) = (π1(X)u) ⊗ v + u ⊗ (π2(Y )v) , (2.34) where u ∈ U and v ∈ V . Moreover, let Π be a representation of a Lie group G acting on a vector space U. A vector subspace V ⊂ U is invariant if Π(g)v ∈ V for all v ∈ V and g ∈ G. A representation is called irreducible (or simple) if it has no invariant subspaces except the trivial ones: {0} and V itself. In addition, a representation is completely reducible (or semisimple) if V is isomorphic to a direct sum of irreducible representations. The tensor product of two irreducible representations of a group G will, in general, not be irreducible when viewed as a representation of G8. The process of decomposing the tensor product of two representations (irreducible or not) as a direct sum of irreducible representations is called Clebsch-Gordan decomposition. Let us now give an example of one of the most important representations – the adjoint representation. For a matrix Lie group G, with Lie algebra g, the adjoint representations is the map Ad: G → GL(g), −1 9 given by g 7→ Adg(X) = gXg for a fixed X ∈ g . This has its origin as the derivative of the conjugation action at identity. The corresponding representation of the Lie algebra is obtained from
8The same tensor product representation is, however, irreducible when instead viewed as a representation of G × G. 9 gXg−1 X −1 −1 n You can check that Adg(X) is indeed part of g by showing that e = ge g ∈ G, using the hint: (gXg ) = gXng−1.
18 Måns Holmberg Master Thesis
(2.12):
d tX −tX adX (Y ) = d(AdeX )e(Y ) = e Y e = XY − YX = [X,Y ] , (2.35) dt t=0 for all X,Y ∈ g. In physics, we often consider the explicit matrix elements of the adjoint representation (of the Lie algebra), which turns out to be given by the structure constants. To see this, we choose a
basis {Xa} of g and consider what these become in the adjoint representation. Doing this, we get
c adXa (Xb) = [Xa ,Yb] = fab Tc , (2.36)
c c which written in components are (adXa (Xb)) = fab . Instead of this cumbersome notation, we usually c c write [Xa]b = fab . This is in accordance with [23].
2.2 The Lorentz group
The Lorentz group O(1, 3) is a Lie group that is given by the set of all transformations that leave the T µ ν µ 4 symmetric bilinear form hx, gyi = x gy = gµνx y = x yµ invariant, where x, y ∈ R are four-vectors and g = diag(1, −1, −1, −1) is the (flat) metric of spacetime. That is, the Lorentz group consists of all T 10 Λ ∈ M(4, R) such that hΛx, gΛyi = hx, gyi or equivalently g = Λ gΛ, i.e. ,
T O(1 , 3) = Λ ∈ M(4, R): g = Λ gΛ . (2.37)
We will not discuss the origin of the above bilinear form, and why the metric has this specific signature, however, one should know that all of this follows as a direct consequence of the postulates of special relativity. The Lorentz group is not connected, in fact, it consists of four disjoint components, corresponding 0 0 11 to det Λ = ±1 and Λ 0 ≥ 1 or Λ 0 ≤ −1 . None of these components are simply connected. The Lie algebra of a Lie group only deals with group elements which are connected to the identity; hence, the Lie algebra can only reach the elements of the Lorentz group that are in the component that includes 0 the identity – the det Λ = 1, Λ 0 > 1 component. We call this restriction (which is the only component that is also a subgroup) of the Lorentz group the proper orthochronous Lorentz group12, which we will denote by SO(1, 3)+. We find the Lie algebra of SO(1, 3)+ by
T T −1 g = ΛT gΛ ⇐⇒ I = ΛT gΛg−1 ⇐⇒ I = etX getX g−1 ⇐⇒ I = etX etgXg , (2.38)
10This group is in general called the indefinite orthogonal group. 11We find the first condition by taking the determinant of g = ΛT gΛ and the second condition by looking at the µν σρ µ ν µ = ν = 0 component of the same relation in index notation, i.e. g = g Λ σΛ ρ. 12Proper refers to det (Λ) = 1, which means that spatial orientation is preserved, i.e., improper rotations are not 0 allowed. Furthermore, orthochronous refers to Λ 0 ≥ 1, which means that the direction of time is preserved.
19 Måns Holmberg Master Thesis
and by then differentiating both sides and setting t = 0,
d tXT tgXg−1 T −1 T 0 = e e ⇐⇒ 0 = X + gXg ⇐⇒ X g = −gX . (2.39) dt t=0
As a result, the Lie algebra of SO(1, 3)+ is given by
T so(1, 3) = {X ∈ M(4, R): X g = −gX} . (2.40)
We can now count the degrees of freedom to find the number of generators, which we can do directly by plugging in a generic 4 × 4 matrix into (2.39), or by
T µ µ ρµ µ ρ µν νµ (X g) ν = −(gX) ν ⇐⇒ X gρν = −g ρX ν ⇐⇒ X = −X . (2.41)
In the last step we contracted the ρ index and raised the lower index by the metric on both sides. Thus, we have found that elements of the Lie algebra are anti-symmetric 4 × 4 matrices (with both indices up) and these we know have 6 degrees of freedom, meaning that there are 6 generators. Next, let us choose a set of generators that have a nice physical meaning and use these to figure out the commutation relations; these are13: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 i 0 0 −i 0 J1 = ,J2 = ,J3 = , (2.42) 0 0 0 −i 0 0 0 0 0 i 0 0 0 0 i 0 0 −i 0 0 0 0 0 0
and 0 i 0 0 0 0 i 0 0 0 0 i i 0 0 0 0 0 0 0 0 0 0 0 K1 = ,K2 = ,K3 = . (2.43) 0 0 0 0 i 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 i 0 0 0
The set Ji should be familiar as the generators of rotations in 3 dimensions and the set Ki are the generators of boosts. Using these generators, a general Lorentz transformation of SO(1, 3)+ can now
be written as Λ = exp(iθiJi + iφiKi), where the extra i in the exponent exists to cancel the i in the generators, something that is purely a convention. The corresponding commutation relations of the Lie algebra in this basis are
[Ji ,Jj] = iijkJk , [Ji ,Kj] = iijkKk , [Ki ,Kj] = −iijkJk . (2.44)
C By going to the complexification of the Lie algebra, i.e., so(1, 3) −→ so(1, 3)C, we can create a new
13 µ µν These are not anti-symmetric since they are of the form (Ji) ν and not (Ji) .
20 Måns Holmberg Master Thesis
basis 1 1 N = (J + iK ) ,N † = (J − iK ) (2.45) i 2 i i i 2 i i which are closed under the commutator and commute with each other. We find
† † † † [Ni ,Nj] = iijkNk , [Ni ,Nj ] = iijkNk , [Ni ,Nj ] = 0 . (2.46)
Therefore there are two sets of commuting Lie algebras which have the same form of angular momentum operators as su(2), but over the field C. Hence, in the basis (2.45), we form two Lie subalgebras of
su(2)C. Furthermore, these subalgebras span so(1, 3)C, only intersect at the origin and commute with
each other. This means that so(1, 3)C is isomorphic to su(2)C ⊕ su(2)C. This is good, since we know how su(2) representations work.
A finite-dimensional irreducible representation of su(2) (or equivalently su(2)C, since one can always restrict the complex representation to the real part only) are specified by a positive integer or half-integer n = 0, 1/2, 1, 3/2,... , representing spin. Therefore, we label these su(2) representations by
πn, which, can be shown to be 2n + 1 dimensional [23].
From section 2.1.6 we know that a representation of the direct sum su(2)C ⊕ su(2)C is given 0 by πn ⊗ πn0 , where n, n specifies the two su(2) representations; let us use the following shorthand 0 notation π(n,n0) = πn ⊗ πn0 . We can now conclude that a (n , n ) representation (of the Lie algebra) is (2n + 1)(2n0 + 1) dimensional. There is a subtlety here, coming from the fact that the Lorentz group is not simply connected, thus, a representation of the Lie algebra of the Lorentz group might not be a representation of the group, see section 2.1.3. It turns out that a representation of the Lie algebra corresponds to a representation of the group only if n + n0 is an integer (see section 5.6 in [24]). If instead n + n0 is a half-integer, this is not true; however, in this case, we get a so-called projective representation of the Lorentz group. It is these projective representations that contain half-integer angular momenta, meaning that they are of great interest in physics. One usually extends the group to also include projective representations, to what is called its universal cover (or universal covering group). We can quickly figure out that the universal cover of SO(1, 3) is SL(2, C), since
C ∼ so(1, 3) −→ so(1, 3)C = su(2)C ⊕ su(2)C ∼ = sl(2, C) ⊕ sl(2, C) (2.47) ∼ = sl(2, C) ⊕ i sl(2, C) ∼ C = sl(2, C)C ←− sl(2, C) ,
21 Måns Holmberg Master Thesis
shows that the complexification of both SO(1, 3) and SL(2, C) have the same Lie algebra (up to 14 isomorphism) . Because SL(2, C) is simply connected, it follows that SL(2, C) is the universal cover of SO(1, 3) (which can be shown to be unique; see section 5.8 in [17]). You can read more about projective representations and the topology of the Lorentz group in section 2.7 in [24]. From now on, we consider the only universal cover.
We can find the spin content of a representation π(n,n0) by decomposing it as a direct sum of irreducible representations of su(2), i.e., Clebsch-Gordan decomposition. This is described in more detail in Appendix C in [17]. Here is the result15:
π(n,n0) = πn ⊗ πn0 = πn+n0 ⊕ πn+n0−1 ⊕ · · · ⊕ πn−n0+1 ⊕ πn−n0 . (2.48)
With this in mind, let us briefly discuss the representations of most interest in this thesis: The scalar representation, the left- and right-handed Weyl spinor representations, the vector representation; and finally, the Dirac spinor representation to describe spin-1/2 particles (with both left- and right-handed components) and the Rarita-Schwinger representation to describe spin-3/2 particles.
2.3 Representations of the Lorentz group
We start with the irreducible representation (0, 0), a 1 dimensional representation with angular momenta † j = 0. We see that the only 1 × 1 dimensional ”matrices” π(0,0)(Ni) and π(0,0)(Ni ) that satisfy the † commutation relations (2.46) are π(0,0)(Ni) = π(0,0)(Ni ) = 0. Meaning that this representation concerns objects which do not change under Lorentz transformations, i.e., these are (Lorentz) scalars. Things get a bit more interesting when considering the two 2-dimensional irreducible representations
(1/2, 0) and (0, 1/2) with angular momenta j = 1/2, where we can choose π(1/2,0)(Ni) = σi/2 and † † π(1/2,0)(Ni ) = 0 and π(0,1/2)(Ni) = 0 and π(0,1/2)(Ni ) = σi/2, respectively. Here, σi are the three Pauli matrices 0 1 0 −i 1 0 σ1 = , σ2 = , σ3 = . (2.49) 1 0 i 0 0 −1
Writing the transformations of the two representations explicitly (at the group level), we have16