Neutrinos, Symmetries and the Origin of Matter
João Tiago Neves Penedo
Thesis to obtain the Master of Science Degree in Engineering Physics
Examination Committee Chairperson: Prof.a Doutora Maria Teresa Haderer de la Peña Stadler Supervisor: Prof. Doutor Filipe Rafael Joaquim Members of the Comittee: Prof. Doutor Gustavo da Fonseca Castelo Branco Prof. Doutor Ricardo Jorge Gonzalez Felipe
November 2013
“Each piece, or part, of the whole of nature is always merely an approximation to the complete truth, or the complete truth so far as we know it.
In fact, everything we know is only some kind of approximation, because we know that we do not know all the laws as yet.
Therefore, things must be learned only to be unlearned again or, more likely, to be corrected.”
– Richard P. Feynman (1918-1988) The Feynman Lectures on Physics, Vol. I
i ii Acknowledgements
This thesis is not a work of mine alone: behind the stage curtain, a larger cast hides. To them I thank for not only helping me construct a symmetric work, in the Vitruvian sense, but also for keeping me sane in the process (well, as sane as possible at least). I would like to start by thanking my supervisor, Professor Filipe Joaquim, for his guidance, patience, and incessant encouragement. Being one of the few great professors I had, he has been responsible for introducing me to the world of particle physics research, the do’s and don’ts of the field, and for providing all the assistance needed in solving problems and answering questions, big or small, without exception. I am additionally indebted to Funda¸c˜aopara a Ciˆenciae Tecnologia (FCT) and Centro de F´ısica Te´oricade Part´ıculas(CFTP), thanking, in particular, the kindness of Cl´audiaRom˜ao,who has lent a helping hand whenever needed, and the support of Professor Jorge Rom˜ao,who worked towards my participation in the ICTP Summer School on Particle Physics, earlier this year. There, I was given the chance to learn from the leading experts in the field, and would like to express my gratitude towards Doctor Alejandro Ibarra for a most helpful lightning-discussion as well as a saving reference. I would also like to thank my family for their support and encouragement, and for enabling my crazy endeavors, without understanding, most of the time, what the heck I am doing. I thank my friends for the chaotic yet enjoyable sequence of events one calls a physics course. A special mention must be made to Pedro Boavida and Ant´onioCoutinho, to whom I thank for our spontaneous (and crucial) discussions about physics, Copernicus and chameleons. I end by expressing my thanks to Jo˜aoLoureiro, who was there even if two thousand kilometers away, and to S´ılvia Conde, for a superposition of all possible reasons.
iii iv Este trabalho foi financiado pela Funda¸c˜aopara a Ciˆenciae Tecnologia, sob o projecto PTDC/FIS/102120/2008.
This work was supported by Funda¸c˜aopara a Ciˆenciae Tecnologia, under the grant PTDC/FIS/102120/2008.
v vi Resumo
As simetrias como leis de invariˆanciadesempenham um papel fundamental na constru¸c˜aode teo- rias f´ısicas. Em particular, as simetrias de gauge est˜aona base do presente conhecimento do mundo subat´omico,que assenta no Modelo Padr˜aoda f´ısicade part´ıculas. Apesar de repetido sucesso, este modelo tem que ser necessariamente expandido `aluz da existˆenciade massas e mistura de neutrinos. Na presente disserta¸c˜aos˜aoexploradas extens˜oesdo Modelo Padr˜aobaseadas no mecanismo seesaw onde a supress˜aoda massa dos neutrinos ´enaturalmente explicada. Massas de neutrinos n˜aonulas conduzem a mistura lept´onica,cuja estrutura se aproxima a um padr˜ao tribimaximal, apontando para a poss´ıvel presen¸cade simetrias discretas na teoria a altas energias – como a invariˆanciasob transforma¸c˜oes do grupo A4, considerado neste trabalho. O Modelo Padr˜aorevela-se igualmente insuficiente na explica¸c˜aoda assimetria bari´onicado Universo. Nos modelos seesaw ´eposs´ıvel gerar dinamicamente essa assimetria atrav´esdos decaimentos dos novos estados pesados (fora de equil´ıbriot´ermico)mediante o mecanismo de leptog´enese,cuja eficiˆencia´ede- terminada numericamente resolvendo o sistema de equa¸c˜oesde Boltzmann adequado. Nesta disserta¸c˜ao, apresenta-se a an´alisede um modelo particular para viola¸c˜aoespontˆaneada simetria CP onde se explicam as massas e mistura de neutrinos impondo uma simetria discreta A4. A implementa¸c˜aodo mecanismo de leptog´eneseneste contexto ´ediscutida em detalhe.
Palavras-chave: Assimetria bari´onicado Universo; Leptog´enese;Massa e mistura de neutrinos; Mecanismo seesaw; Simetrias; Viola¸c˜aode CP
vii viii Abstract
Symmetries, understood as laws of invariance, play a fundamental role in the development of physics. In particular, gauge symmetries are at heart of our current understanding of the subatomic world, which relies on the Standard Model of particle physics. Despite its repeated successes, this model must neces- sarily be extended to accommodate the experimental observation of nonzero neutrino masses and mixing. In this thesis, we explore seesaw extensions of the Standard Model, where heavy states mediate neutrino mass generation and the smallness of these masses is naturally accounted for. Nonvanishing neutrino masses allow for leptonic mixing, whose structure strongly differs from that of quark mixing. The closeness of the lepton mixing matrix to the tribimaximal pattern points to the presence of discrete symmetries in the underlying high-energy theory – such as invariance under transformations of the A4 group, considered in this work. The Standard Model also fails to provide a satisfactory mechanism for the generation of the baryon asymmetry of the Universe. A remarkable feature of the seesaw extensions is the possibility that the out-of-equilibrium decays of the new heavy states are responsible for the dynamical generation of this asymmetry. This corresponds to the leptogenesis mechanism, whose efficiency is here determined by numerically solving a system of Boltzmann equations. Additionally, a particular model for spontaneous leptonic CP violation is analysed where neutrino masses and mixing are explained imposing an A4 discrete symmetry. The implementation of the leptogenesis mechanism in this context is discussed in detail.
Keywords: Baryon asymmetry of the Universe; CP violation; Leptogenesis; Neu- trino masses and mixing; Seesaw mechanism; Symmetries
ix x Contents
Acknowledgements iii
Resumo vii
Abstract ix
List of Figures xiv
List of Tables xv
List of Abbreviations xvii
1 Symmetries and Asymmetries in Nature 1 1.1 Evolution of the Concept of Symmetry ...... 3 1.2 Groups and Symmetry ...... 5 1.3 Symmetry in Physics ...... 8 1.3.1 From Classical to Quantum Mechanics ...... 8 1.3.2 New Kinds of Symmetry ...... 9 1.3.3 The Discrete Symmetries C, P and T ...... 10 1.3.4 Symmetry Breaking ...... 12 1.4 A Philosophical Interlude ...... 14 1.5 The Asymmetry of Existence ...... 15 1.5.1 Experimental Evidence ...... 15 1.5.2 The Tuning of Initial Conditions ...... 15 1.5.3 The Possibility of a B-Symmetric Universe ...... 16
2 The Standard Model of Particle Physics and (slightly) Beyond 17 2.1 Recap of the Electroweak Sector of the SM ...... 17 2.1.1 Neutral and Charged Electroweak Currents ...... 19 2.1.2 The Higgs Mechanism ...... 20 2.1.3 Fermion Masses and Mixing ...... 22 2.2 Neutrinos Beyond the SM ...... 25 2.2.1 The Neutrino Mass Term ...... 25 2.2.2 The Seesaw Mechanism ...... 27
xi 3 Lepton Mixing and Discrete Family Symmetries 35 3.1 Lepton Mixing ...... 35 3.2 Discrete Family Symmetries ...... 39 3.2.1 Symmetries of the Mass Matrices ...... 39 3.2.2 Direct vs. Indirect Models ...... 40
3.3 An A4 Model with Spontaneous CP Violation ...... 41 3.3.1 Spontaneous CP Violation ...... 44 3.3.2 Neutrino Masses and Mixing ...... 45 3.3.3 Nonzero Reactor Neutrino Mixing Angle ...... 47
4 Baryogenesis through Leptogenesis 49 4.1 Topics of Cosmology and Thermodynamics ...... 49 4.1.1 Cosmological Inflation ...... 49 4.1.2 Equilibrium Thermodynamics ...... 50 4.1.3 Expansion, Entropy and Degrees of Freedom ...... 52 4.1.4 Brief Thermal History of the Universe ...... 53 4.2 The Sakharov Conditions ...... 54 4.3 Is the SM Enough? ...... 56 4.4 Thermal Leptogenesis ...... 59 4.4.1 CPT, Unitarity and CP Asymmetries ...... 60 4.5 Boltzmann Equation(s) ...... 62 4.5.1 Two-body Decays and Inverse Decays ...... 63 4.5.2 2 2 Scatterings ...... 65 ↔ 5 Type II Seesaw Leptogenesis 69 5.1 Flavoured CP Asymmetries from Triplet Decays ...... 69 5.2 Boltzmann Equations for Type II Seesaw ...... 71 5.3 Scattering Reaction Densities ...... 74
5.4 Leptogenesis in an A4 Model ...... 75
Conclusions 79
Bibliography 88
A Computing Diagrams with Majorana Fermions 89
B Clebsch-Gordan Coefficients for A4 91 B.1 General Description of the Group ...... 91 B.2 Choice of an Explicit 3D Representation ...... 92 B.3 The Tensor Product Representation ...... 93 B.4 Computing the CGCs ...... 94
xii List of Figures
1.1 Selected symmetry drawings of M. C. Escher (1941) ...... 1 1.2 Results of subjecting an artificial “quasi-lattice” based on a Penrose tiling to optical diffraction (left), obtained by A. Mackay in 1982, and its physical analogue (right): elec- tron diffraction patterns of an aluminium-based icosahedral quasicrystal, published by D. Shechtman et al. in 1984 ...... 2
2.1 Mass hierarchy of the elementary fermions observed in Nature. Mass values and uncer- tainties are obtained from J. Beringer et al. (Particle Data Group) 2012 and references therein (light quarks present the highest relative mass uncertainties) ...... 25 2.2 Exchange interactions which in the effective theory give rise to the Weinberg operator of
(2.45). Seesaw types I and III correspond to the exchange of fermion fields NR and ΣR, respectively (left diagram), while the type II seesaw mechanism is implemented through the exchange of scalar fields ∆ (right diagram) ...... 28 2.3 Vertex contributions from the interaction Lagrangian (2.51) for type I seesaw ...... 29 2.4 Vertex contributions for type II interactions relevant for effective neutrino mass generation 32
3.1 Depiction of lepton mixing for both a normally ordered and an inverted neutrino mass spectrum, where the global fit data of Table 3.1 has been considered (left), to be compared with the tribimaximal ansatz (right) ...... 37 3.2 Predicted values for neutrino masses as a function of the CP-violating angle β ...... 46
3.3 Scatter plot of the experimentally allowed regions in the ε1, ε2 plane (left), where exact
TBM is seen to be excluded, and corresponding regions of the JCP, β plane (right) . . . 48
3.4 Values for the neutrinoless double beta decay parameter mee , in the exact TBM and perturbed cases, as a function of β ...... 48
4.1 Brief thermal history of the Universe ...... 54 4.2 Schematic representation of the vacuum structure of the electroweak theory (left) and effective diagram for the transition between vacua (right) ...... 57 4.3 Diagram for the expansion of a ‘true vacuum’ bubble ...... 58 4.4 Effect of electroweak sphalerons on the quantum numbers B and L ...... 60 4.5 Tree-level and one-loop diagrams for the process X ` ` whose interference generates a → CP asymmetry when compared to the conjugate process ...... 61
xiii 5.1 Tree-level diagrams for the decays of type II seesaw scalar triplets and one-loop diagrams contributing to the decay process ∆ ` ` ...... 70 i → α β 5.2 Scalar triplet interactions relevant to the BE out-of-equilibrium analysis, where one con- siders the diagrams presented for decays, inverse decays and s- and t-channel scatterings and their charge conjugates, as well as gauge scattering reactions ...... 72 5.3 Contours of the (magnitude of the) maximum CP asymmetries in the decays of hierarchical
scalar triplets ∆1,2 ...... 76 5.4 Scatter plot of the baryon asymmetry generated in randomly-chosen perturbed versions of 12 12 the model, for M1 = 10 GeV (black) and M1 = 5 10 GeV (cyan) ...... 77 × 5.5 Reaction densities normalized to the product H(T ) nγ (T ) (left) and evolution of the various densities considered in the BE network (right) ...... 78
A.1 Rules for writing propagator, vertex and spinor contributions obtained from the article of Denner et al...... 90
xiv List of Tables
1.1 Effect of some S4 permutations on an array of different circles ...... 6
2.1 Number of parameters contained in complex matrices depending on their properties . . . 23
3.1 Global fit results for the three-neutrino oscillation parameters (mass differences, mixing angles and Dirac phase) and for both ordering possibilities (see text) ...... 36
3.2 Representation assignments of the various fields under the action of the groups A4, Z4, and gauge SU(2) U(1) ...... 42 L× Y
B.1 Character table for the group A4 ...... 92
B.2 Explicit decomposition of the elements of A4 in terms of s and t ...... 92
xv xvi List of Abbreviations
BAU Baryon Asymmetry of the Universe BBN Big-Bang Nucleosynthesis BE Boltzmann (transport) Equation CC Charged Current CGC Clebsch-Gordan Coefficient CP Charge Conjugation and Parity EWPT Electroweak Phase Transition EWSB Electroweak Symmetry Breaking FRW Friedmann-Robertson-Walker (metric) GUT Grand Unified Theory NC Neutral Current QCD Quantum Chromodynamics QED Quantum Electrodynamics QFT Quantum Field Theory RIS Real Intermediate State SM Standard Model SMC Standard Model of Cosmology SSB Spontaneous Symmetry Breaking SUSY Supersymmetry TBM Tribimaximal VEV Vacuum Expectation Value
xvii xviii Symmetries and Asymmetries in Nature 1
It was from his second visit to the Moorish palace of Alhambra (Granada, Spain), in 1936, that Maurits C. Escher (1898-1972) drew inspiration to produce sketchbook after sketchbook of patterned drawings in a style which he called “regular division of the plane” [1]. Unbound by Islamic artistic tradition, which forbids the depiction of human and animal figures, this Dutch graphic artist, known for his graphical paradoxes and representation of impossible worlds, was able to produce spectacular and intricate plane filling motifs as the ones shown in Fig. 1.1.
(a) Symmetry drawing no. 45 (b) Symmetry drawing no. 67 (c) Symmetry drawing no. 88
Figure 1.1: Drawings of M. C. Escher (1941) which exhibit different types of geometrical symmetry. Aside from translational symmetries, these patterns present (if one ignores colour): a) reflection and four-fold rotational symmetries, b) glide reflection symmetry, and c) two-fold rotational symmetry.
All M.C. Escher works c 2013 The M.C. Escher Company - the Netherlands. All rights reserved. Used with permission. www.mcescher.com
Escher shared his early attempts at plane division with his brother, a geologist at the University of Leiden, who referred him to the existing work of crystallographers and mathematicians [2]. These were people who systematically studied the regularities of the flat worlds Escher was trying to recreate. The artist’s “regular division of the plane” is associated to the mathematical concept of tessellation – which corresponds to the tiling of a two-dimensional surface – and refers, in particular, to periodic tilings with translational symmetry in two independent directions. He became especially interested in a 1924 article by George P´olya [3], in which the author proves that any of these tilings can be taken to belong to one of 17 mathematical classes, corresponding to the so-called wallpaper groups.
1 Figure 1.2: Results of subjecting an artificial “quasi-lattice” based on a Penrose tiling to optical diffraction (left), obtained by A. Mackay in 1982 [4], and its physical analogue (right): electron diffraction patterns of an aluminium-based icosahedral quasicrystal, published by D. Shechtman et al. in 1984 [5]. Shechtman was awarded the 2011 Chemistry Nobel prize for the discovery of quasicrystals.
A wallpaper group is defined by the possible operations one can apply to the aforementioned tilings while leaving them invariant, i.e. indistinguishable from themselves prior to performing the operation. This classification of tilings is thus based on the symmetries they possess. The operations are selected from the set of Euclidean plane isometries, which comprises (aside from translations) rotations, reflections and glide reflections. If the restriction of demanding translational symmetry in two independent directions is removed, one might construct tilings which present no periodicity at all, such as Penrose tilings. Yet may still present rotational and reflection symmetries.
In nature, one finds that crystals can be classified according to a three-dimensional generalization of the wallpaper groups – the space or Federov groups – of which 230 exist. This classification, of great use to crystallographers, is possible since perfect crystals present the property of translational invariance in three independent directions in space. These systems are thus the par excellence example of how symmetries are present in the physical world: the properties of crystals signal the presence of regularity at the most fundamental level of matter.
There is, once more, the possibility of constructing nonperiodic physical structures, known as qua- sicrystals, for which the demand of translational invariance has been lifted. These objects can possess rotational symmetries not allowed1 in periodic crystals, such as eightfold [6], tenfold (see Fig. 1.2), and twelvefold [7] rotational symmetries. Examples of symmetry in biological systems proliferate: consider, for instance, the structure of honeycombs or the fivefold rotational symmetry than can be seen in a horizontally cut apple. Humans also tend to link symmetry with beauty, be it associated with works of architecture or the bilateral symmetry of a person’s face [8].
So far we have only considered a geometrical notion of symmetry, associated with regularity. This corresponds to the layman’s understanding of the word and one might wonder how one got to it.
1In the 2D wallpaper groups and the 3D Federov groups, the only possible rotational symmetries are twofold, threefold, fourfold, and sixfold rotational symmetries.
2 1.1 Evolution of the Concept of Symmetry
The term ‘symmetry’ has its origins in the Ancient Greek word συµµετρια´ , which results from the fusion of συν´ (with) and µετρ´ oν (measure) [9]. As suggested by its etymology, this term was used to represent a notion of commensurability, the possibility of measuring using a common standard, which translates into the presence of integer-based proportion relations.
In addition to the mathematical meaning of commensurability, where no subjectivity is implied, the word συµµετρια´ became generally associated with beauty, harmony and unity. This is the meaning which Plato ascribes to the word while referring to the human body in his dialogue Timaeus (c. 360 BC). It is also in this dialogue that the Greek philosopher theorizes about the nature of the classical elements – fire, water, air, and earth – which are, based on their properties, associated to four out of five possible convex regular polyhedra. These are the so-called Platonic solids and Plato’s association is based on the beauty he sees in them. Jumping forward to the sixteenth century, we see a revival of the attempt of using platonic solids to describe nature in Johannes Kepler’s Mysterium Cosmographicum (1596). The solids are now used to describe the geometry of planetary orbits, furnishing Kepler with a reasonable approximation of the ratios between the radii of orbits, which were first taken to be circular. Despite Kepler’s appeal to harmony, there is, however, no occurrence of ‘symmetry’ in his work.
Tracing the evolution of the meaning of ‘symmetry’ has proven to be a difficult endeavour to historians since one is tempted to reinterpret scientific documents in light of a modern view of the concept. G. Hon and B. Goldstein discriminate between two parallel paths in the evolution of the term, namely a mathematical path vs. an aesthetic one [10]. In fact, unlike the Greek case in which the two meanings of συµµετρια´ were present in a unique word, in Latin these were separated into commensurabiles and symmetria. The former term was used exclusively in a scientific context, allowing us to follow the mathematical path, while the latter pertains to a general, aesthetic notion of being well-proportioned, corresponding to an altogether different evolution. In the remainder of this section, one follows Hon and Goldstein’s historiographical work.
Concerning the mathematical path, one recognizes the usage of συµµετρια´ (and derived forms) as commensurability in ancient works such as Plato’s Theaetetus (c. 369 BC), Aristotle’s Nicomachean Ethics (c. 350 BC), and Euclid’s Elements (c. 300 BC). Indeed, in the Latin editions of these works, the term συµµετρια´ would be translated to commensurabiles. Such is the case with Isaac Barrow’s (1630-1677) edition of Euclid’s Elements, published in 1655. Barrow sets apart the two meanings of συµµετρια´ , showing that they were understood at the time.
Looking into the aesthetic path, where judgement is key, one finds that συµµετρια´ was used to describe ‘proper’ proportion, be it in Plato’s Timaeus, where ‘proper’ refers to beauty, in Aristotle’s Nicomachean Ethics, where certain occurrences of συµµετρια´ are associated with moderation, or in Ptolemy’s Almagest (c. 150), where it is used to convey the idea of suitability. It was Vitruvius who coined the Latin term symmetria to characterize an entity, such as a building or a machine, whose parts are joined gracefully by imposing well-chosen proportion relations between the parts themselves and between the parts and the whole. This is how Vitruvius’ transliteration of the Greek word is understood
3 in the context of his theory of architecture, presented in his De architectura (c. 15 BC): a property of a beautiful, well-coordinated unity. Claude Perrault (1613-1688), who translated the De architectura to French in 1673, separated Vitruvius’ use of symmetry, which he takes to simply pertain to proportion, from the common meaning of the word in seventeenth-century France. This common usage was associated to a kind of geometrical correspondence, such as that which relates the disposition of windows between the left and right sides of a building’s fa¸cade,and it represents an underdeveloped version of what we today understand as mirror symmetry.
Further uses of symmetry in scientific literature are found to be technical extensions of the concept of Vitruvius, such as Carl Linnaeus’ (1707-1778) use of the word connected to the idea of functionality in his classification of plant species or Ha¨uy’sformulation of a ‘law of symmetry’ in the context of crystallography. The aesthetic path turns into a scientific one. Ren´e-JustHa¨uy(1743-1822), a French mineralogist responsible for important steps in the mathematization of crystallography, initiated the systematic use of the term ‘symmetry’ in that field. He gives, however, no definition of the term, which is taken from context to refer to the geometry of the crystal, specifically to the relative disposition of its facets. What he calls his ‘law of symmetry’ corresponds to the fact that certain rotations of a crystal yield identical views. This hints towards the modern, geometrical notion of symmetry presented in the beginning of the current chapter.
Other rare occurrences of the word symmetry in eighteenth-century documents have been recorded in works of physics such as those by Henri-Louis Duhamel du Monceau (1700-1782) and Gaspard Monge (1746-1818) regarding ship design and construction. Duhamel du Monceau and Monge appeal, respec- tively, to equilibrium and to a bilateral equivalence between both sides of a vessel, but give no definition of symmetry.
Hon and Goldstein argue that it was not until the work of Adrien-Marie Legendre (1752-1833) that a turning point occurred. His studies of solid geometry in 1794 led him to explicitly define symmetry as a relation – not a property of a whole – between two solids which are, in our modern terminology, each other’s mirror image. Solids related in this way cannot be rotated in three-dimensional space such that they are made to coincide. Even though Legendre gave no reason for his choice of the word symmetry, one might conceive a link between his definition and Perrault’s geometrical correspondence.
It was after Legendre’s introduction of symmetry in solid geometry that adoption of term gradually flourished in distinct domains of science. Andr´e-MarieAmp`ere(1775-1836) imported Legendre’s defini- tion into chemistry, while other scientists, who were responsible for the introduction of the concept in their respective fields, gave distinct, parallel meanings to symmetry. Pierre-Simon Laplace (1749-1827) introduced it in probability to characterize a sequence of well-ordered events (symmetry as regularity), while Sylvestre Lacroix (1765-1843) took symmetry in algebra to signal invariability of a function re- garding permutation of its roots (symmetry as invariance). There is a close relation between Lacroix’s definition and the way ‘symmetrical function’ is understood in current technical usage. This plurality of meanings was often taken in stride. Such is the case with Augustin-Louis Cauchy (1789-1857), who used the term in both algebra and geometry without commenting on the different meanings it was given.
4 A unification of different meanings was accomplished through the group-theoretical definition of sym- metry. Group theory owes its development to the work of researchers such as Evariste´ Galois (1811-1832), Marius Sophus Lie (1842-1899), and Christian Felix Klein (1849-1925). It is concerned with the study of groups, an algebraic structure to be explored in the following section. The concept of group allows one to tie togheter the various meanings of symmetry through the idea of invariance under specified operations. The symmetries of a geometrical figure are thus identified with the possibility of invariance of said figure under, say, reflections or rotations. The operations need not be geometrical in nature, and thus symmetry is generalized, in modern science, beyond geometry.
1.2 Groups and Symmetry
In the context of abstract algebra, a group (G, ) is defined by both an operation ‘ ’, designated by · · group multiplication or product, and a set2 G of elements g. The product, not necessarily commutative, of two group elements g G and g0 G can thus be denoted by g g0 or simply by g g0. If the set G is ∈ ∈ · finite, the group itself is said to be finite or discrete. For an uncountable number of elements the group is said to be continuous. If, additionally, the group product is commutative, the group is said to be Abelian. To complete this definition, the elements of G must satisfy the four following axioms:
Closure: If g, g0 G and g00 = g g0, then g00 G. • ∈ ∈ Identity: There exists an element e G such that g e = e g = g for every g G. • ∈ ∈ Inverse: For every g G, there exists an element g−1 G, such that g g−1 = g−1g = e. • ∈ ∈ Associativity: For every g, g0, g00 G, the relationship (g g0)g00 = g(g0g00) holds. • ∈ To relate this abstract definition with geometrical symmetry it suffices to say that the elements of the group are made to correspond to geometrical operations – such as rotations, reflections or translations – and the group multiplication denotes composition of operations (associativity is guaranteed). There should be no confusion between the operations which belong to G and the operation of group multipli- cation, necessary for defining a group. The group property of closure implies that the composition of two geometrical operations is itself a geometrical operation. The identity element e corresponds to per- forming no operation whatsoever, while the existence of an inverse element tells us that for a geometrical operation, there exists another which cancels its effect. By looking at one of Escher’s symmetry drawings of Fig. 1.1, one can identify the operations which leave it invariant (see caption) and thus determine to which wallpaper group it belongs3. The observed properties of invariance of the system (the drawing), namely the geometrical operations which leave it indistinguishable from its previous self, define the group. The converse way of thinking is to specify the group and look for how the system must ‘respond’ to fulfil the conditions of invariance imposed by each group element. Take, for example, the group of permutations of n elements, denoted Sn (non-Abelian for n > 2). Each of the group elements is a permutation operation on some abstract space of n objects, such as a queue of n people, or a list of the n
2Often, this set-operation pair description is omitted and the group is identified with the set that defines it. 3In the language of the wallpaper groups, Escher’s drawings correspond (if one again ignores colour) to the groups known as: a) p4g, b) pg, and c) p2.
5 e (12) (13) (24) (234) (432) (132) (1234) (1324) (12)(34) (13)(24) (14)(23)
Table 1.1: Effect of some elements of the permutation group S4 on an array of different circles. Permu- tations are written in the cycle notation. For example, (123) reads “1 goes to 2, 2 goes to 3, and 3 goes to 1”, and (13)(24) corresponds to “1 goes to 3, 3 goes to 1, 2 goes to 4, and 4 goes to 2”.
2 2 2 arguments of a function. If one is given the function f(x, y, z) = a1 x +a2 y +a3 z +b xy +c xyz, where x, y, z are real variables and ai, b, c C are taken to be constant, imposing invariance under permutation ∈ of the arguments of f, i.e. imposing invariance under S3, translates into demanding a1 = a2 = a3 and b = 0 (no restriction arises for c). No exchange of arguments can change the value of the function. To proceed, it is useful to define what is known as a representation of the group – a mapping between each element of G and ‘something’ that carries out the operation. For a drawing of Escher, this ‘something’ could be the command “Take the picture and carefully rotate it by 90 degrees!”. In a mathematical language, group elements g are mapped into matrices Ug which act as linear transformations on some vector space. In order to define a representation, the matrices must obey the relation Ug Ug0 = Ug g0 . The vectors which belong to the vector space correspond to parts of the system under study.
To understand this, take the example of the group S4. Like all permutation groups, S4 is a discrete group: it has a finite number of elements, #S4 = 4! = 24. Each element of S4 can be denoted, in what is called the cycle notation, by a collection of number sequences separated by parenthesis. Each sequence of numbers determines which objects to cycle. This notation can be clarified by looking at Table 1.1, where some S4 permutations are applied to an array of four different circles. Since the notation relies on cycles, one has, for example, the equivalence (432) = (324) = (243). The full set can be written as:
S4 = e, (12), (13), (14), (23), (24), (34), (12)(34), (13)(24), (14)(23), (234), (243), (134), { (1.1) (143), (124), (142), (123), (132), (1234), (1243), (1324), (1342), (1423), (1432) . } A natural way of representing the group is to map each element g S4 into a 4 4 matrix U in the ∈ × g way suggested by the following examples (as one might guess, Ue = 14×4):
0 1 0 0 0 0 0 1 1 0 0 0 1 0 0 0 1 0 0 0 0 0 1 0 U(12)(34) = , U(1234) = , U(243) = . (1.2) 0 0 0 1 0 1 0 0 0 0 0 1 0 0 1 0 0 0 1 0 0 1 0 0 4 These matrices form a four-dimensional representation of S4 and act on the space (C ) spanned by the following vectors:
1 0 0 0 0 1 0 0 1 , 2 , 3 , 4 . (1.3) | i ≡ 0 | i ≡ 0 | i ≡ 1 | i ≡ 0 0 0 0 1
6 The choice of representation matrices of Eq. (1.2) becomes clear if one considers their action on the vectors of (1.3). In looking for an object which is invariant under the group S4, one finds that multiples of the vector 1 + 2 + 3 + 4 are clearly left unchanged after the action of any group element4, which | i | i | i | i is to say after i U i , g S4. Other invariant expressions can be constructed, a simple example | i → g | i ∀ ∈ ∗ 4 being the scalar y x = y xi, where x , y C , i.e. x = xi i , y = yi i and xi, yi C. This is h | i i | i | i ∈ | i | i | i | i ∈ † T −1 † so since U = U = (U ) = U −1 , and so y x y U U x = y U −1 x = y U x = y x . g g g g h | i → h | g g| i h | (g g)| i h | e| i h | i The operations act on parts of the system and, for a symmetric one, they cancel out. In fact, the above example of a S3-symmetric function can be recast in this new language of representations. One rewrites the function as f(x, y, z) = Aij rirj + Cijk rirjrk, with ~r = (x, y, z), the only nonzero entries of Aij and 5 Cijk being Aii = a for each i and C123 = c. Suppose now that ~r transforms according to the natural 3D 0 representation of S3, meaning that under the action of a certain g S3, ~r transforms as r (U ) r . ∈ i → g ij j 0 Here, the matrices Ug form a representation of S3 analogous to the one given for S4 in (1.2). One notices that, given the structure of Aij and Cijk, the function remains unchanged under the action of any group element, as expected. A remark should be made regarding group construction. It is possible to take a group G and reduce the set that defines it in such a way that the remaining structure H is still a group – H is said to be a subgroup of G. S3 is a subgroup of S4 obtained by restricting ourselves to three objects instead of four, removing all permutations which would affect that fourth object. Another subgroup of interest to us is A4, which is obtained from S4 by keeping only elements, denoted even permutations, which can be decomposed into a product of an even number of ‘pair switches’ – permutations which switch only two objects. Any element of S can be decomposed into a product of ‘pair switches’: e.g. (1234) = (14) (13) (12), where we stress n · · that the present convention implies combining permutations from right to left (first we apply (12), then
(13), and finally (14)). The full set of even permutations of four elements is (#A = #S /2 #A4 = 12): n n ⇒
A4 = e, (234), (243), (134), (143), (124), (142), (123), (132), (12)(34), (13)(24), (14)(23) . (1.4) { }
To give an example concerning continuous groups, consider SO(n), the group of rotations in n dimen- sions, of which SO(m) with m < n is trivially a subgroup (rotations in a plane can be seen as a subset of rotations in 3D space). These groups are naturally represented by orthogonal rotation matrices.
One can also build groups by joining rather than reducing them. Starting from two groups G1 and
G2, one can construct a third group by considering their direct product, G3 = G1 G2. Here, the set G3 × is indeed constructed through the Cartesian product of the others: each g3 can be identified by a pair 0 0 0 (g1, g2) and one has g3 g = (g1 g , g2 g ). Demanding invariance of a system under G3 corresponds to · 3 · 1 · 2 independently demanding its invariance under G1 and G2.
4 4 This means that there is a subspace of C which is invariant under the action of these matrices, and thus there exists a basis in which all matrices Ug take the form: 1 " # Ug = . Ue g
The matrices Ue g correspond to a 3D representation in their own right, and one has thus decomposed (through a change of basis) our natural 4D representation into two of smaller dimension. When further reduction is impossible, one is left with so-called irreducible representations (or irreps.) along the diagonal. 5It is common, yet misleading, to phrase this as ~r ‘is in’ or ‘belongs to’ the representation under discussion.
7 To give an example of how the direct product works, consider the Abelian group Zn, which corresponds to the set a, a2, . . . , an−1, an = e , where the relations between group elements are made explicit. The { } direct product Z2 Z2 produces a group with elements (e1, e2) e, (a, e2) a, (e1, b) b, (a, b) ab , × { ≡ ≡ ≡ ≡ } where one has denoted the elements of the first Z2 by e1, a and the elements of the second Z2 by { } 6 2 2 2 e2, b . Although this group presents four elements it is not isomorphic to Z4, since a = b = ab = e, { } corresponding to the only other possible group with four elements, the Klein group Z2 Z2. K ≡ ×
1.3 Symmetry in Physics
1.3.1 From Classical to Quantum Mechanics
Symmetry, taken henceforth to signify invariance under specified operations, has been instrumental to the development of modern physics. This group-theoretical notion was first imported from crys- tallography into physics thanks to Pierre Curie who, citing the work of Auguste Bravais and Evgraf Federov, investigated the connection between symmetries and physical phenomena in his 1894 paper Sur la Sym´etrie dans les Ph´enom`enesPhysiques [11]. In this work, Curie formulated what is now known as Curie’s Principle, which recognizes that the symmetries of a physical medium restrict which phenomena can occur. Hence, to allow certain phenomena, the presence of certain asymmetries is required. By looking at what one considers to be the laws of nature, one can identify symmetries of nature (or, as Eugene Wigner called them, laws of invariance). According to Wigner [12], it was Einstein’s work on special relativity in 1905 that brought the “reversal of a trend”: instead of reading off laws of invariance from laws of nature, it became natural to establish laws of invariance and from there go on to derive the laws of nature. Indeed, special relativity admits invariance under the Poincar´egroup (also called the inhomogeneous Lorentz group) which comprises Lorentz boosts, translations in space, translations in time, and spatial rotations7. One can regard these invariances as a fundamental part of the structure of the theory, the last three corresponding respectively to homogeneity, uniformity in time, and isotropy of space (Wigner’s “older principles of invariance”). Laws of invariance in theories described by the Lagrangian formalism can be associated with the conservation of a physical quantity through Emmy Noether’s first theorem (proved in 1915 and published in 1918 [13]). Invariance under translations in space is thus linked to the conservation of linear momen- tum, invariance under translations in time to the conservation of energy, and invariance under spatial rotations to the conservation of angular momentum. Additionally, an interesting connection between symmetries and non-observable quantities is given by T.D. Lee [14]. Invariance under space translations, time translations, or rotations implies that no absolute, preferred position, time, or spatial direction can be observed, respectively.
6Two groups (G, ·) and (G0,?) are said to be isomorphic, (G, ·) =∼ (G0,?) if there exists a map ϕ : G → G0 such that, for every g1, g2 ∈ G, both ϕ(g1 · g2) = ϕ(g1) ? ϕ(g2) and ϕ(g1) = ϕ(g2) ⇒ g1 = g2 hold. An isomorphism denotes a strong equivalence which translates into saying that the two groups are essentially the same. 7Technically, we are restricting ourselves to transformations Λ which can be connected continuously to the group identity, ignoring time and space inversions for which det Λ = −1. Although these are isometries of Minkowski space, they might not be symmetries of a quantum field theory.
8 The advent of (non-relativistic) quantum mechanics brought with it the study of symmetries in a novel context. Wigner pioneered this study, introducing his eponymous theorem (1931) [15] which states that symmetry transformations Uˆ must be either unitary or antiunitary. This can be seen by taking quantum states φ , which transform as φ φ0 = Uφˆ , and noticing that by demanding φ0 φ0 = | ii | ii → | ii | ii |h i| ji| φ φ i, j one obtains that either Uφˆ Uφˆ = φ φ (Uˆ is a unitary operator) or Uφˆ Uφˆ = |h i| ji| ∀ h i| ji h i| ji h i| ji φ φ (Uˆ is an antiunitary, nonlinear operator). For infinitesimal transformations, parametrized as h j| ii Uˆ(ε) = 1 + i ε Gˆ (Gˆ is denoted the transformation generator), demanding unitarity – the majority of operators of interest are unitary – implies that the generator is Hermitian, Gˆ† = Gˆ, allowing it n to be an observable. In the limit of a finite transformation, Uˆ(α) = limn→+∞ Uˆ(α/n) , one obtains Uˆ(α) = exp(i α Gˆ). In either case, the transformation operator Uˆ must commute with the system’s Hamiltonian, [U,ˆ Hˆ ] = 0, since a proper symmetry should not be spoilt by dynamics. This means that Gˆ also commutes with Hˆ , which can be seen in Heisenberg’s representation to directly imply that Gˆ is a conserved quantity, in line with Noether’s theorem.
1.3.2 New Kinds of Symmetry
A shift from spacetime to other kinds of symmetries eventually occurred. Such new symmetries included permutation symmetry, which was introduced by Heisenberg and pertains to the indistinguisha- bility of particles, the charge conjugation symmetry C (such discrete symmetries will be addressed in the following section), and the so-called internal symmetries, such as isospin symmetry. Also due to Heisenberg, isospin symmetry refers to the invariance of strong interactions when one transforms the doublet (proton, neutron)T through an element of SU(2). U(n) is the (continuous) group of n n unitary × matrices and SU(n) is the subgroup of U(n) obtained by keeping only matrices with unit determinant. These transformations represent rotations in an abstract, ‘internal’ space. Disregarding electromagnetic interactions and small mass differences, the validity of isospin symmetry allows one to consider protons and neutrons as different states of the same particle – the nucleon. SU(2) rotations mix these two states while leaving strong interactions unchanged. It is worth noting that isospin transformations are inherently global: the same SU(2) rotation is applied in all points of space at the same time [16].
A generalization can be made from global to local symmetries, where the above abstract rotations can be chosen to differ from point to point. Field theories which are based on invariance under internal, local transformations are called gauge theories. In this context, the transformations themselves are denoted gauge transformations. One of the simplest examples of a gauge theory is quantum electrodynamics (QED), the theory which describes the interactions between light and electrically charged matter. The corresponding gauge symmetry is U(1) or local phase invariance, which corresponds to admitting invari- ance under transformations of the charged matter fields of the form ψ(x) ei q α(x) ψ(x), where q is the → electric charge associated with the field and α(x) is an arbitrary function of space and time. The matter fields are said to belong to the defining representation of U(1). The introduction of a so-called gauge boson with certain transformation properties, in this case the photon field, is crucial to ensure gauge invariance of the QED Lagrangian. The idea of gauge invariance was first put forward by Hermann Weyl
9 in 1918 [17], as an (ultimately unfruitful) extension of Einstein’s work on general relativity8. It was only in 1929 that Weyl established a connection between electromagnetism and local phase invariance [18]. Field theories based on local invariance under the non-Abelian gauge group SU(n) are known as Yang-Mills theories and are at the heart of our current understanding of the subatomic world. They owe their name to C. N. Yang and Robert Mills who, in 1954, explored the consequences of turning isospin symmetry into a local symmetry [19]. In such a transition, several incompatibilities with experimental evidence arise, namely the appearance of several gauge bosons which are massless, leading to forces with an infinite range, and self-interacting, due to the non-Abelian character of the underlying theory [9, 16]. In 1961, S. Glashow put forward a proposal for the unification of the electromagnetic and weak interactions based on the enlarged gauge group SU(2) U(1) [20]. In this electroweak theory, whose × renormalizability (which translates into its calculability) was an issue, mass terms for the gauge bosons had to be put by hand, explicitly invalidating the symmetry. Glashow’s theory was later independently extended by S. Weinberg [21] and A. Salam [22] to include the Higgs mechanism (see Section 1.3.4). Proof that this new theory was renormalizable was given by Gerard ’t Hooft in 1971 [23], while working under Martinus Veltman. For their scientific contributions, Glashow, Weinberg and Salam were awarded the 1979 Nobel Prize in Physics, while Veltman and ’t Hooft were similarly honoured in 1999. The idea of invariance under a group of gauge transformations is a fruitful one. Quantum chromo- dynamics, the standard theory of the strong interaction, is a gauge theory based on the group SU(3). Electroweak and strong interactions are unified in the Standard Model of particle physics, which will be presented in the following chapter. Additional examples of the use of symmetries in modern physics in- clude grand unified theories (GUTs), based on groups which include the Standard Model as a subgroup, supersymmetry (SUSY), where a symmetry between bosons and fermions exists, and discrete flavour symmetries, to be explored in Chapter 3.
1.3.3 The Discrete Symmetries C, P and T
In quantum field theory (QFT), a special status is given to the discrete symmetries under the oper- ations of parity (P), time reversal (T) and charge conjugation (C). The first two operations correspond to spacetime symmetries already present in classical mechanics which were imported by Wigner into the quantum context. The last operation, charge conjugation, presents no classical analogue. In light of Wigner’s theorem, P and C correspond to unitary operators, while T can only be implemented as an antiunitary operator [24]. A parity transformation corresponds to an inversion of all three spatial coordinates, (x, y, z) → ( x, y, z). This inversion can also be obtained by changing the sign of one coordinate and per- − − − forming a 180o rotation, hence the association of parity to reflections and mirror symmetry. A time reversal transformation corresponds to a change in the sign of the time coordinate, t t. Both parity → − and time reversal correspond to improper Lorentz transformations (see Footnote 7).
8Weyl considered the possibility of arbitrary and local rescalings of the spacetime metric, which correspond to the change of a local unit length or gauge, hence the current usage of the latter term [9].
10 Concerning parity, one might ask what goes on the other side of the mirror. As Lewis Carroll’s Alice puts it, “things go the other way”: a right hand is converted into a left hand and vice-versa. One is thus confronted with the concept of chirality, which corresponds to Legendre’s use of the word symmetry, presented in Section 1.1. A object which differs from its mirror image is said to be chiral. The question is whether or not nature cares about chirality at all. In fact, while neither humans nor cars are bilaterally symmetric, there seems to be no reason to expect that their mirror images9, with inverted organs and engines, represent an impossible biology or a faulty vehicle. The chirality of people and cars are considered to be accidents of evolution and design [26] and parity is still expected to be a symmetry of microphysics. However, reality defies intuition: parity is not a symmetry of weak interactions. The idea that this might be so was put forward by T. D. Lee and C. N. Yang in 1956 [27], who proposed an experimental test to parity by considering the beta decay of cobalt-60. This test, performed in 1957 by C. S. Wu et al. [28], confirmed their hypothesis: there is a preferred direction for the emission of β radiation. To clarify what is meant by saying that parity is not a symmetry of nature or, equivalently, that parity symmetry is violated, consider the following dialogue:
Alice: Imagine I have a ball rotating ‘clockwise’ and invert the axes. I then get a ball rotating ‘counterclockwise’. Is this what parity violation means? Bob: No, not at all. That’s just a matter of description. Alice: How so? Bob: Well, the ball is rotating in a certain way. Whether it is clockwise or counterclockwise depends on your perspective, on how you define these words. Alice: I see. Then what does parity violation mean? Bob: Simple – in its extreme version it means that it is not possible to have the ball rotate in the other direction. Nature just doesn’t allow it. Alice: So in a frame where I say it rotates clockwise, no counterclockwise balls can be seen? Bob: And vice-versa. Exactly.
Regarding time reversal, few observed macroscopic phenomena, like a performance of Bach’s crab canon, exhibit a symmetry under such operation. However, the irreversibility of processes like burning a piece of firewood answers to a thermodynamical arrow of time, which is dependent on macroscopically probable configurations of a system and does not imply microscopical irreversability. In fact, the laws of classical mechanics are T-symmetric. Nevertheless, nature is not classical, but quantum, and one begs the question of whether time reversal is a symmetry of the laws of physics. Indeed, compelling experimental evidence that time reversal symmetry is violated has been found in B-meson decay chains by the BaBar collaboration [29]. As we shall briefly see, there is a fundamental link between T, P and C. Charge conjugation is defined in the context of relativistic quantum mechanics and refers to the exchange of particles and antiparticles. The latter correspond to the physically meaningful states resulting from reinterpreting the negative energy solutions of the Dirac equation as positive energy solutions with opposite U(1) charges. Like parity, charge conjugation is a symmetry of electromagnetic and strong
9To consider the mirror image of the world, one must go as far as the molecular level, since certain molecules, like sugars or gasoline, are chiral. The interesting phenomenon of tunnelling between molecular chiral states was investigated by Friedrich Hund in 1927 [25].
11 interactions, but not of weak interactions. In general, both C-symmetry and P-symmetry are violated by the same processes while the product CP of these operations seems to be a valid symmetry. This suggests that it is CP and not C or P individually that constitutes a symmetry of the physical world. Alas, this is not strictly true: in the neutral kaon system – the study of which owes much to the work of Murray Gell-Mann and Abraham Pais [30] – a small but nonzero departure from CP-symmetry was found in 1964 [31] and earned James Cronin and Val Fitch the 1980 Nobel Prize in Physics. There is still hope for salvaging the fundamental role of the above discrete symmetries in physics if one considers the CPT theorem, whose discovery and proof in the 1950s is credited to J. Schwinger [32], W. Pauli [33], J. Bell [34], and G. L¨uders[35]. This theorem states that a combination of the three presented discrete symmetries should be a symmetry of any reasonable quantum field theory, i.e. a theory which has Lorentz invariance, positive energy and local causality [36]. Thus, since CPT is conserved10, CP violation and T violation are fundamentally connected: one demands the other. So far, no deviation from CPT-symmetry has been observed [29, 37].
1.3.4 Symmetry Breaking
Symmetries can be defined as either exact, meaning unconditional validity, approximate, meaning valid only under certain conditions, or broken [9]. Symmetry breaking refers to the destruction of symmetry, the transition (even if abstract) between a situation where the invariance exists to one where it does not. This ‘break’ can be said to occur explicitly or spontaneously. Let us first consider the former case. Explicit symmetry breaking refers to a destruction of symmetry that can be traced to the physical law, in particular, to the presence of Lagrangian terms which spoil the invariance. These terms can arise either by construction, anomalously, or through higher-order effects. An example of symmetry-breaking terms which arise by construction corresponds to the parity violating structure of weak interactions. The transition from a classical description to a quantum one can be responsible for the appearance of so-called anomalous terms in the Lagrangian which can arise, for example, from the regularization procedure. Finally, symmetry breaking terms may arise due to non-renormalizable effects. To understand this, consider a field theory which provides an effective description of reality, meaning it approximates a broader theory at low energies: symmetries present in the effective theory might not be symmetries of the high-energy theory. Symmetries which, although not postulated, are found to be present in the theory and can be broken by quantum corrections or non-renormalizable effects are dubbed accidental.
Accidental symmetries of the SM include the global baryon number (B) and lepton flavour number (Li) symmetries. These are anomalously broken (see Section 4.3), while B L (L = L ) is not. However, in − i i seesaw extensions of the SM (see Section 2.2), the accidental B L is broken byP the non-renormalizable − Majorana mass terms for neutrinos, which arise from the interactions of heavy states in the high-energy theory. Spontaneous symmetry breaking (SSB), in turn, corresponds not to an actual destruction of the sym- metries of the laws but to their hiding, since it is the lowest energy state of the system (or vacuum state,
10This is a blatant abuse of language. To say, for example, that CP is (not) conserved means simply that nature is (not) CP-symmetric. There is no reference to a conserved quantity, but to a symmetry which is maintained.
12 0 ) which presents an asymmetry11. Following M. Guidry [38], one can classify broken symmetry sys- | i tems according to whether Uˆ 0 = 0 (Wigner mode), Uˆ 0 = 0 and the symmetry is global (Goldstone | i | i | i 6 | i mode), or Uˆ 0 = 0 and the symmetry is local (Higgs mode), Uˆ being the operator which realizes the | i 6 | i symmetry operation. The first case may correspond to the above explicit breaking examples while the last two correspond to spontaneous symmetry breaking scenarios. Classically, one can conceive of situations in which the choice of the lower energy state spoils existing symmetries: the fall of a vertically held pole spontaneously breaks rotational symmetry as a particular direction is chosen out of the existing infinite possibilities. One can see that the symmetry is hidden and not destroyed because it is still present in the full set of solutions. Likewise, if the pole is constrained to move in a plane, there are only two possible ground states and the choice of any of them spontaneously breaks the symmetry. In quantum systems, however, spontaneous symmetry breaking will not occur if the number of degenerate ground states is finite, since it is possible to construct a state from their superposition [39]. The concept is only applicable to idealized infinite systems, such as a ferromagnet.
If a ferromagnetic material is heated above the Curie (critical) temperature, Tc, no preferred ori- entation for the magnetic dipoles exists, resulting in zero net magnetization. However, as soon as the temperature drops below Tc, the system transitions to a ground state where a net magnetization develops in one of the infinite possible directions. A link to Curie’s principle is here readily found: it is only the asymmetry of the situation that allows the phenomenon, i.e. the appearance of magnetization in the absence of an applied magnetic field. An important result which arises in the context of SSB is Goldstone’s Theorem [40], which refers to the appearance of massless bosons – termed (Nambu-)Goldstone bosons – when a continuous symmetry is spontaneously broken. The number of such bosons matches the number of generators12 of the broken continuous group. Goldstone himself remarks that although “a method of losing symmetry is [...] highly desirable in elementary particle theory”, there seems to be no way to avoid in this context the introduction of “non-existent massless bosons”. A solution to this unphysical problem is found in the Higgs mechanism, which owes its origins to the work of P. Higgs [43, 44], R. Brout and F. Englert [45], and G. Guralnik, C. Hagen, and T. Kibble [46]. This mechanism consists in the appearance of gauge boson mass terms when the gauge symmetry of the theory is spontaneously broken, thanks to the Goldstone bosons which one expects to arise from the break. These unphysical massless bosons are ‘absorbed’ as mass degrees of freedom by the previously massless gauge bosons. The Higgs mechanism is a crucial feature of the Standard Model of particle physics, since it allows the generation of mass terms for fermions and gauge bosons without explicitly compromising the underlying symmetry group. The recent discovery by the ATLAS and CMS collaborations [47, 48] of a boson with a mass of 125.9 0.4 GeV [37], compatible ± with the Standard Model Higgs particle (which corresponds to an unabsorbed degree of freedom), will certainly allow for a deeper look into the question of the origin of particle mass.
11Thus, unlike what one might suppose, the elliptical motion of planets as opposed to the isotropic character of Newton’s law of universal gravitation does not constitute an example of spontaneous symmetry breaking. 12 2 n−1 One might regard the group Zn = {e, a, a , . . . , a } as being generated by powers of one single element (to wit, a). Similarly, the elements of a continuous (Lie) group which lie infinitesimally close to the identity element form a vector space – tangent to the manifold which the group defines – spanned by basis vectors which are called the group generators [41]. Exponentiation of these infinitesimal elements yields the remainder of the group (recall Section 1.3.1). In special cases, which will be of no concern to us, the number of Goldstone bosons might be less than the number of group generators [42].
13 1.4 A Philosophical Interlude
According to E. Castellani [49], symmetries in physical theories play four separate roles. A classifica- tory role is easily identified in the crystallographic enterprise as well as in the classification of elementary particles, an example being the work of Wigner on the representation theory of the Poincar´egroup [50]. Symmetries also possess a normative role, in the sense that they regulate the form of the theory, as well as a unifying role, present in the construction of theories, such as GUTs, which seek to join the funda- mental interactions under a simple symmetry group. Lastly, symmetries can be attributed an explanatory role, as they are taken to be fundamental principles which dictate how nature must behave. A notion of simplification is transversal to the four roles. According to Curie’s principle, it is asymmetry which allows a diversity of phenomena, which is to say, complexity.
The normative and explanatory roles lead to a methodological aspect: model building in modern particle physics often relies on postulating the presence of certain symmetry properties, as well as the breaking of these invariances, determining which Lagrangian terms are allowed. The increasingly central role which has been given to symmetries is not without justification: both the prediction of the Ω− particle, based on an incomplete classification of baryons and mesons (the Eightfold Way of Gell-Mann and Ne’eman [51]), and its subsequent discovery, along with the prediction and discovery of the W and Z bosons, mark extraordinary successes of the use of symmetry in physics. This predictive power seems to imply that symmetries as a basis for the description of the physical world are ‘here to stay’13.
It is important to emphasize the epistemological character of symmetries: as Wigner points out [12], the artificial (but fruitful) division of reality into initial conditions and laws of nature would not be possible in the absence of invariace under spacetime displacements. Physics would then differ from place to place and time to time, compromising the way scientific knowledge is obtained. There is also a close connection between symmetries and objectivity. Lorentz invariance, for instance, establishes a physical equivalence between different observers, with different perspectives. Equivalence renders conventions irrelevant and only that which is invariant under the symmetry group is considered physical.
The effectiveness of symmetry in physics is analysed by P. Kosso [9], who makes a case against any association of symmetry with design, stating that such a link is incompatible with the objective nature of symmetries: these are introduced, in part, to remove dependences on the decisions of a conscient observer from the physical laws. To end this interlude, one once again turns to Wigner, who ponders about the unreasonable effectiveness of mathematics in the study of natural sciences [52]. In his words, “the miracle of the appropriateness of [...] mathematics for the formulation of the laws of physics is a wonderful gift which we neither understand nor deserve”.
13An interesting connection is established by Wigner [12] concerning the relationship between physical laws and events and that of symmetries and laws. Were we to know all future events, the correlations between them (the laws) would become unnecessary. In the same way, the full knowledge of the laws of nature (if such a thing is possible) would make their symmetry properties a mere curiosity.
14 1.5 The Asymmetry of Existence
We live in an apparently biased Universe in which there is, as far as one can see [53], a clear preference for the presence of matter over antimatter. One of the greatest challenges of modern physics is to explain the observed asymmetry (in the state of the system, not in the laws which govern it) in light of our present knowledge of cosmology and particle physics. How could this imbalance come to be? One assumes that it is either a consequence of the initial conditions of the Universe, i.e. an accident, or it is dynamically generated as the Universe cools down. The latter hypothesis calls for a mechanism – baryogenesis – that produces an excess of baryons over antibaryons at some point in the past and, therefore, allows for our own existence.
1.5.1 Experimental Evidence
Although antimatter has been produced and studied, no primordial antimatter has been detected in the observable Universe as of today. Local evidence for the baryon asymmetry of the Universe (BAU) includes the scarcity of antimatter in the Earth, the absence of γ-ray production due to solar wind, and the successful landing of planetary probes in the planets of our solar system. Cosmic rays offer an additional probe into far away regions of the Universe: nonvanishing ratios between the number of cosmic antiprotons and protons have been measured, the magnitude of which is consistent with secondary proton production in collisions of cosmic rays with matter in the interstellar medium, in processes such as p + p 3p + p [54]. Additionally, no antinuclei such as antihelium (He) → have been found in cosmic radiation [55]. In short, the cosmic ray data is consistent with the absence of large structures of antimatter in the Universe. An even more compelling evidence for the absence of such structures comes from the missing γ-ray background one would expect from the collision of galaxies and anti-galaxies, at the hypothetical boundaries of matter-antimatter domains.
1.5.2 The Tuning of Initial Conditions
One turns now to the problem of whether the observed BAU was generated dynamically or accidentally. The reason why the accidental asymmetry scenario, based on proper initial conditions, is usually rejected is twofold. On the one hand, generating the observed asymmetry would require one extra quark for every 107 antiquarks in the early Universe [56]. This presents itself as a problem of naturalness, and one ∼ usually avoids this much fine-tuning. On the other hand, a constraint arises if the effects of inflation are considered. Inflation corresponds to a rapid expansion of the scale factor in the early Universe (see Section 4.1.1) and is predicted to exponentially dilute any initial asymmetry. In the dynamical case, the symmetry violating processes should be weak enough in order to produce the observed values of the BAU, which can be quantified in the present epoch through the parameter η, defined as the ratio between the number density of baryonic number, nB, and that of photons, nγ :
n n n n η B b − b b . (1.5) ≡ nγ ≡ nγ ' nγ
15 The quantity nB is itself defined as the difference between the number densities of baryons (nb) and antibaryons (nb). Apart from the baryon-antibaryon imbalance, the value of η also signals the imbalance between matter and radiation in the present-day Universe. The abundances of light elements and the anisotropies of the cosmic microwave background (CMB) constrain the value of η independently. One verifies that both conditions give compatible values for the BAU parameter, which when combined give [37]:
η = (6.19 0.15) 10−10. (1.6) ± ×
An equivalent description is given by the quantity YB, corresponding to the ratio between nB and the current entropy density of the Universe. The approximate relationship YB η/7.04 holds. '
1.5.3 The Possibility of a B-Symmetric Universe
The observation of matter dominance does not seem to exclude the existence of large quantities of antimatter in far away regions of the Universe that would effectively make it baryon symmetric. Let us assume a B-symmetric Universe, with any possible initial asymmetry being washed out by inflation. One can then compute baryon and antibaryon densities at freeze-out – the time when the expansion rate of the Universe, measured by the Hubble parameter H(t), surpasses the rate Γ of nucleon-antinucleon annihilation (for details, see Section 4.1). These densities will coincide due to B-symmetry of the laws. The temperature at freeze-out (H(T ) Γ) can be estimated as T 20 MeV [57], well below the nucleon ∼ ∼ mass, m 1 GeV, for which the following relation holds (cf. Eq. (4.17)): N ∼ 3/2 nb nb mN = exp( mN /T ). (1.7) nγ nγ ∼ T − An unreasonable value of n /n 10−19 is thus obtained at freeze-out. One concludes that some b γ ∼ mechanism must have segregated matter and antimatter when n /n 10−10, which corresponds to b γ ∼ T 40 MeV, or earlier. However, such mechanism would need to operate at the scale of the observable ∼ Universe and, at this time, the horizon (causally connected region) only contained a negligible amount −7 of matter ( 10 M ) [54]. Therefore one cannot avoid the conclusion that the Universe must have ∼ possessed a baryon asymmetry already at early times. To understand the creation of this asymmetry, one has to go beyond the realms of pure cosmology and consider the role that particle physics may have played in its production. The following chapter is thus dedicated to the Standard Model of particle physics and to extensions which can account for neutrino masses, whose smallness will be a concern. Their presence allows for lepton mixing, which will be discussed in Chapter 3. A compelling case was presented for a dynamically generated baryon asymmetry, which implies the presence of an asymmetry within the physical laws (recall Curie’s principle). As previously hinted, the construction of the SM relies heavily on the concepts of symmetry and symmetry breaking, and one should investigate whether the asymmetry conditions for generating the BAU – the Sakharov conditions – are satisfied in its context. This will be done in Chapter 4, where a popular baryogenesis scenario accommodating naturally small neutrino masses – leptogenesis – is presented. In Chapter 5, the viability of leptogenesis in a model which includes a discrete flavour symmetry and spontaneous breaking of the CP symmetry is analysed.
16 The Standard Model of Particle Physics and (slightly) Beyond 2
Particle physics relies on the principle that the interactions between the constituents of the subatomic world can be described, to a remarkable approximation, by the Standard Model. In spite of its successes and repeated experimental verification, the SM cannot provide a satisfatory answer to some questions which remain open. Among these is the fact that neutrino masses cannot be accounted for in the model, as required by neutrino oscillations. In this chapter we review relevant aspects of the SM and consider extensions in which small neutrino masses arise in a natural way.
2.1 Recap of the Electroweak Sector of the SM
The Standard Model is a relativistic quantum Yang-Mills theory built by postulating both an un- derlying SU(3) SU(2) U(1) gauge symmetry group – the subscripts are labels which correspond, c× L× Y respectively, to colour, left-handedness, and hypercharge – as well as the following field content:
+ φ ναL uαL φ 0 2, 1/2 , `αL 2, 1/2 , qαL 2, 1/6 , ≡ φ ! ∼ ≡ lαL ! ∼ − ≡ dαL! ∼ (2.1) l 1, 1 , u 1, 2/3 , d 1, 1/3 . αR ∼ − αR ∼ αR ∼ − In the above, φ, `α and qα represent the Higgs, lepton, and quark doublets, respectively, while the fields uα and dα correspond to up- and down-type quarks, and να and lα to neutrinos and charged leptons. The index α runs over three generations (or families) of fermions. Absent from the postulated field content are the twelve gauge bosons which appear in the theory following the demand of local gauge invariance. One will henceforth focus on the electroweak sector, which corresponds to the SU(2) U(1) gauge group, ignoring the (unbroken) strong SU(3) symmetry. L× Y c The quantities given between parentheses in (2.1) refer to representation assigments under this electroweak i subgroup. The first one specifies how the field transforms under SU(2)L, to which three generators I are associated. A value of ‘1’ signals that the field transforms as a singlet, Ii = 0, whereas ‘2’ indicates that it transforms as a doublet, Ii = τ i/2 (τ i are the Pauli matrices). The second quantity refers to the value taken by the hypercharge operator Y , the single generator of U(1)Y , for each (non-gauge) field. The SM is also a chiral theory, meaning that its basic fermionic ingredients are eigenfunctions of the chirality matrix γ5. In fact, it is possible to decompose a spinor field ψ into these right-handed and
17 left-handed chiral eigenfunctions (with eigenvalues +1 and 1 respectively): −
ψ = ψR + ψL. (2.2)
One thus defines the operators PR,L which obey:
1 + γ5 1 γ5 ψ = ψ P ψ, ψ = − ψ P ψ. (2.3) R 2 ≡ R L 2 ≡ L
2 These operators are said to be chiral projectors since (PR,L) = PL,R, PR +PL = 1, and PRPL = PLPR =
0. The chiral fields ψR,L are two-component spinors and belong to the simplest nontrivial representations of the Lorentz group [58], which grants them a fundamental role in the construction of the SM. The dynamics of the fields are encoded in the Lagrangian (density) of the theory which reads1:
µ † 1 i iµν 1 µν SM = (D φ) (D φ) V (φ) A A B B L µ − − 4 µν − 4 µν (2.4) + i `αL D/ `αL + i qαL D/ qαL + i lαR D/ lαR + i uαR D/ uαR + i dαR D/ dαR
Yl ` φ l + H.c. Yu q φ u + H.c. Yd q φ d + H.c. . − αβ αL βR − αβ αL βR − αβ αL βR One recognizes, in the above expression, the Klein-Gordon,e Dirac and Proca kinetic Lagrangian terms for scalar (spin 0), spinor (spin 1/2) and vector (spin 1) fields, respectively. However, the requirement of gauge invariance under SU(2) U(1) implies a change to the usual kinetic terms of the fermions and L× Y scalar Higgs doublet fields, namely the substitution of the ordinary derivative by a covariant one [41]:
i i ∂ D ∂ i g2 A I i g B Y, (2.5) µ → µ ≡ µ − µ − Y µ where g2 and gY are coupling constants associated with the gauge groups SU(2)L and U(1)Y , respectively. i This leads to the introduction of four real boson fields Aµ and Bµ in the theory, whose transformation properties are such that the kinetic terms are kept invariant under the action of the gauge group.
A modification is also in order for the Proca Lagrangian. Whereas the term for the Bµ field is a typical one, with the usual definition B ∂ B ∂ B , the term for the Ai gauge fields relies on the µν ≡ µ ν − ν µ µ i following definition of Aµν , to ensure invariance of the kinetic Proca term under the group SU(2)L:
i i i ijk j k A ∂ A ∂ A + g2 ε A A , (2.6) µν ≡ µ ν − ν µ µ ν where εijk is the totally antisymmetric (Levi-Civita) tensor of rank 3. Further inspection of Eq. (2.4) reveals the absence of mass terms for all SM particles. Charged fermion masses cannot be written down without explicitly breaking the symmetry group, since – as one can see from Eq. (2.9) – there is no invariant way of combining an SU(2)L doublet with an SU(2)L singlet, as required by the form of a Dirac mass term PR,Lψ = ψPL,R : m ψ ψ = m ψ ψ + ψ ψ + ψ ψ + ψ ψ = m ψ ψ + ψ ψ . (2.7) − − R R R L L R L L − R L L R 1 Aside from ignoring the strong sector, one omits gauge fixing and Faddeev-Popov terms in LSM. The Feynman µ µ µ 0 µ† 0 slash notation is also employed: A/ ≡ γ Aµ, where γ are the Dirac matrices (µ = 0, 1, 2, 3) obeying γ = γ γ γ and µ ν µν 5 0 1 2 3 5† 5 2 µ {γ , γ } = 2g . The chirality matrix is given by γ ≡ γ5 ≡ iγ γ γ γ = γ = (γ ) and anticommutes with every γ (the number ‘5’ is a mere label, not a Dirac index). Additionally, one defines ψ ≡ ψ†γ0 and φe ≡ iτ 2φ∗.
18 This clash with reality is resolved though the Higgs mechanism: after electroweak symmetry breaking (EWSB), the Yukawa terms in the Lagrangian will give rise to the desired fermionic mass terms. The former make up the last line of (2.4), where Yl, Yu and Yd are general complex matrices and ‘H.c.’ denotes Hermitian conjugation. Mass terms for the SM bosons will also arise after EWSB. In particular, one might suppose that, due to the form of the scalar potential in SM, L 2 V (φ) µ2φ†φ + λ φ†φ , (2.8) ≡ there is a mass term for the scalar doublet components. However, this is not so straightforward since one requires µ2 < 0 for EWSB to occur. Thus, this term is not a Klein-Gordon mass term. Before proceeding, we present, for the sake of completeness, the explicit way in which the fields trans- form under and element g of the gauge group, parametrized by the four local parameters θi(x), η(x) : ψ U ψ = exp i θi(x) Ii exp i η(x) Y ψ. (2.9) → g