SidneyFest 2005
QFT & QCD: PAST, PRESENT, and FUTURE (March 18-19, 2005) Home | Department of Physics Home | Harvard University Home
S Poster | Guestbook | Program & Speakers | Videos | Letters | Slide Shows | The Blog | Harvard Gazette Article | Sidney's Student | Sidney's Genealogy | Sidney's Homepage I D VIDEOS N MOV1: Opening remarks by John Huth and Arthur Jaffe MOV2: Norman Ramsey's Introduction of David Gross followed by David Gross's talk, "The Future of Physics"
E MOV3: Lisa Randall's Introduction of Frank Wilczek followed by Frank Wilczek's talk, "Asymptotic Freedom: From Paradox to Paradigm" Y MOV4: Welcome by President Lawrence Summers F MOV5: Greg Moore's Introduction of Paul Steinhardt followed by Paul Steinhardt's talk, "Cosmology in a False Vacuum" MOV6: Leon Cooper's Introduction of Murray Gell-Mann followed by Murray Gellman's talk, "Recollections of Sidney"
E MOV7: Howard Georgi's Introduction of Sheldon Glashow followed by Sheldon Glashow's talk, "Small Matrices, Sidney, and Me" S MOV8:Loeb House Dinner Program, part I T MOV9:Loeb House Dinner Program, part II MOV10: Alan Guth's Introduction of Erick Weinberg followed by Erick Weinberg's talk, "Vacuum Tunneling in de Sitter Space--QFT in the Past and in the Future"
MOV11: Kenneth Wilson's Introduction of Steven Weinberg followed by Steven Weinberg's talk, "Cosmological Correlations"
2 MOV12: Nathan Seiberg's Introduction of Gerard 't Hooft followed by Gerard 't Hooft's talk, "Symmetry and Sidney" 0 MOV13: Arthur Jaffe's Introduction of Edward Witten followed by Edward Witten's talk, "Emergent Phenomena in Condensed Matter and Particle Physics" 0 The videos are in MOV (Apple's QuickTime) format. If you don't already have QuickTime on your computer, you may download it from: QuickTime 5 QuickTime alternative (for Windows' standard players)
For questions and comments, please contact Barbara Drauschke or Arthur M. Jaffe
http://media.physics.harvard.edu/QFT/video.htm[2015/1/20 16:19:15] Physics 253: Quantum Field Theory Lectures by Sidney R. Coleman | Harvard University Department of Physics
CONTACT VISIT APPLY MAKE A GIFT
HOME ABOUT ACADEMICS RESEARCH PEOPLE RESOURCES ADMINISTRATION NEWS & EVENTS Physics 253: Quantum Field Theory Lectures by Sidney R. Coleman
Recorded in 1975-1976. The videos shown here were transferred to DVD in 2007 straight from surviving VHS tapes, which were, in turn, copied from the original source tapes. As such, the quality of the image has suffered and is as good as currently feasible without resorting to an expensive remastering process. Made in 1975-1976, the videos were shot using a black and white tube video camera. The black halo or vignette effect you see around the image is caused by the camera's Image Orthicon pickup tube.
Harvard affiliates have the option of borrowing DVDs of these lecture from Physics Research Library. Please note, however, that the resolution on the DVDs is comparable - or only marginally better - to that of the online video archive files.
Professor Coleman's wit and teaching style is legendary and, despite all that may have changed in the 35 years since these lectures were recorded, many students today are excited at the prospect of being able to view them and experience Sidney's particular genius second-hand.
Notes by Brian Hill, typeset and edited by Yuan-Sen Ting and Bryan Chen Gin: http://arxiv.org/abs/1110.5013
Choose a lecture to watch:
lecture 1 lecture 2 lecture 3 lecture 4 lecture 5 lecture 6 lecture 7 lecture 8 lecture 9 lecture 10 lecture 11 lecture 12 lecture 13 lecture 14 lecture 15 lecture 16 lecture 17 lecture 18 lecture 19 lecture 20 lecture 21 lecture 22 lecture 23 lecture 24 lecture 25 lecture 26 lecture 27 lecture 28 lecture 29 lecture 30 lecture 31 lecture 32 lecture 33 lecture 34 lecture 35 lecture 36 lecture 37 lecture 38 lecture 39 lecture 40 lecture 41 lecture 42 lecture 43 lecture 44 lecture 45 lecture 46 lecture 47 lecture 48 lecture 49 lecture 50 lecture 51 lecture 52 lecture 53 lecture 54
back to the Video Archive
News and Events: Recent News Science Research Public Lectures Physics Event Calendar Career Event Calendar Faculty Awards Loeb and Lee Lectures Archive Monday Colloquium Archive Room Booking Calendar Video Archive
HARVARD UNIVERSITY HARVARD DIRECTORY | HARVie | FINANCIAL FORMS Department of Physics Login | Logout 17 Oxford Street Cambridge, MA 02138 (617) 495-2872 phone Connect with us: (617) 495-0416 fax
Harvard University | Faculty of Arts And Sciences | Division of Science Copyright © 2015 The Presidents and Fellows of Harvard College
https://www.physics.harvard.edu/events/videos/Phys253[2015/1/20 16:11:00] Sidney Coleman
For the football player, see Sidney Coleman (American touched on subnuclear theory…These lectures football). span fourteen years, from 1966 to 1979. This was a great time to be a high-energy theorist, Sidney Richard Coleman (7 March 1937 – 18 Novem- the period of the famous triumph of quantum field theory. And what a triumph it was, in ber 2007) was an American theoretical physicist who studied under Murray Gell-Mann. He is notable for his the old sense of the word: a glorious victory parade, full of wonderful things brought back research in high-energy theoretical physics. from far places to make the spectator gasp with awe and laugh with joy. I hope some of that 1 Life and work awe and joy has been captured here.
Sidney Coleman grew up on the Far North Side of Coleman’s lectures at Harvard were legendary. Students Chicago. In 1957, he received his undergraduate degree in one quantum field theory course created T-shirts bear- from the Illinois Institute of Technology physics depart- ing his image and a collection of his more noted quota- ment. tions, among them: “Not only God knows, I know, and by the end of the semester, you will know.” Despite this He received his PhD from Caltech in 1962, and moved acclaim, he did not generally enjoy teaching or mentoring to Harvard University that year, where he spent his entire graduate students: career, meeting his wife Diana there in the late 1970s. They were married in 1982. “He was a giant in a peculiar sense, because he’s not I hate [teaching]. You do it as part of the known to the general populace,” Nobel laureate Sheldon job. Well, that’s of course false ... or maybe Glashow told the Boston Globe. “He’s not a Stephen more true than false when I say I hate it. ... Hawking; he has virtually no visibility outside. But within But I certainly would be just as happy if I had the community of theoretical physicists, he’s kind of a no graduate students. ... Occasionally there is major god. He is the physicist’s physicist.”[1] a graduate student who is a joy to collaborate with. Both David Politzer and Erick Weinberg In 1966, Antonino Zichichi recruited Coleman as a lec- were of this kind, but they were essentially al- turer at the then-new summer school at International most mature physicists. They were very bright School for Subnuclear Physics in Erice, Sicily. A leg- by the time they came to me. In general, work- endary figure at the school throughout the 1970s and early ing with a graduate student is like teaching a 1980s, Coleman was awarded the title “Best Lecturer” on course. It’s tedious, unpleasant work. A pain the occasion of the school’s fifteenth anniversary (1979). in the neck. You do it because you're paid to His explanation of spontaneous symmetry breaking in do it. If I weren't paid to do it I certainly would terms of a little man living inside a ferromagnet has often never do it.[5] been cited by later popularizers.[2][3] The classic particle physics text Aspects of Symmetry (1985) is a collection of Coleman’s lectures at Erice. A quote from his introduc- In 1989, Coleman was awarded the NAS Award for Sci- tion to the book is worth sharing here:[4] entific Reviewing from the National Academy of Sci- ences. That award praised his “lucid, insightful, and in- I first came to Erice in 1966, to lecture fluential reviews on partially conserved currents, gauge theories, instantons, and magnetic monopoles--subjects at the fourth of the annual schools on subnu- [6] clear physics organized by Nino Zichichi. I was fundamental to theoretical physics.” In 2005, Harvard charmed by the beauty of Erice, fascinated by University’s physics department held the “SidneyFest”, a the thick layers of Sicilian culture and history, conference on quantum field theory and quantum chro- and terrified by the iron rule with which Nino modynamics, organized in his honor. kept the students and faculty in line. In a word, Aside from his academic work, Coleman was a prominent I was won over, and I returned to Erice every science fiction enthusiast. He was one of the founders year or two thereafter, to talk of what was past, of Advent: Publishers[7] and occasionally reviewed genre or passing, or to come, at least insofar as it books for The Magazine of Fantasy and Science Fiction.[8]
1 2 4 EXTERNAL LINKS
2 Contributions to physics 4 External links
Some of his best known works are • Sidneyfest - physicists’ celebration of Sidney Cole- man’s life [9] • Coleman–Mandula theorem • Chicago Tribune obituary, November 20, 2007.
• Tadpoles • Harvard Gazette obituary, November 29, 2007.
• Coleman theorem[10] • Boston Globe obituary, January 20, 2008.
• Equivalence of the Thirring model and the quantum • Physics Today obituary, May 2008, written by sine-Gordon equation[11] Sheldon Glashow. • • Semiclassical analysis of the fate of a false vacuum “Quantum Mechanics In Your Face”, A lecture by Prof. Coleman at the New England sectional meet- • Coleman–Weinberg potential ing of the American Physical Society April 9, 1994.
• Q-balls in the thin-wall limit • Physics 253: Quantum Field Theory. Video of lec- tures by Sidney Coleman at Harvard in 1975-1976. • Lectures at Erice, some of which are preserved in his • book Aspects of Symmetry[4] (review and teaching) Sidney Coleman at the Mathematics Genealogy Project
• National Academy of Sciences Biographical Mem- 3 Notes oir
[1] Sidney Coleman; Harvard icon taught physics classes with wit
[2] L. Ryder, “Symmetry breaking”, J. Phys. A: Math. Gen. 38, 9729 (2005)
[3] Brading, Katherine and Castellani, Elena, “Symmetry and Symmetry Breaking”, The Stanford Encyclopedia of Phi- losophy (Fall 2008 Edition), Edward N. Zalta (ed.),
[4] Sidney Coleman (1988). Aspects of Symmetry. Cam- bridge University Press. ISBN 0-521-31827-0.
[5] Sopka, Katherine. “Oral History Transcript - Interview with Dr. Sidney Coleman” American Institute of Physics, Niels Bohr Library & Archives (January 19, 1977)
[6] “NAS Award for Scientific Reviewing”. National Academy of Sciences. Retrieved 27 February 2011.
[7] “Sidney Coleman Dies at 70”
[8] ISFDB bibliography
[9] Sidney Coleman and Jeffrey Mandula (1967). “All Pos- sible Symmetries of the S Matrix”. Physical Review 159 (5): 1251–1256. Bibcode:1967PhRv..159.1251C. doi:10.1103/PhysRev.159.1251.
[10] Sidney Coleman (1973). “There are no Goldstone bosons in two dimensions”. Communications in Mathematical Physics 31 (4): 259. Bibcode:1973CMaPh..31..259C. doi:10.1007/BF01646487.
[11] Coleman, Sidney (1975). “Quantum sine-Gordon equa- tion as the massive Thirring model”. Physical Review D 11 (8): 2088–2097. Bibcode:1975PhRvD..11.2088C. doi:10.1103/PhysRevD.11.2088. national academy of sciences
sidney coleman 1 9 3 7 – 2 0 0 7
A Biographical Memoir by howard georgi
Any opinions expressed in this memoir are those of the author and do not necessarily reflect the views of the National Academy of Sciences.
Biographical Memoir
Copyright 2011 national academy of sciences washington, d.c.
SIDNEY COLEMAN March 7, 1937-November 18, 2007
BY h o w a r d g e o r g i
idney coleman began his Ph.D. thesis (1962) with a quota- Stion from Justine by the Marquis de Sade: “What we do here is nothing to what we dream of doing.” Coleman’s bril- liant career as one of the leading quantum field theorists in the world probably surpassed even his graduate student dreams. He was an indispensible player in the resurgence of quantum field theory in the 1960s and 1970s, and indeed he taught particle physicists, established and aspiring, a new way of thinking about quantum field theory. There is a telling and audacious short paragraph early in his thesis.
To step out from behind the first person plural for a moment, I like to think of this [first] part of the work as a calculation—not strikingly different in spirit from a calculation of a radiative correction. It is true that the first half of this thesis does not look like the usual physics calculation, but that is only because the computational tools are algebraic rather than analytic. Symmetry had been an important tool in physics from the beginning. But already as a graduate student Coleman understood that he was bringing to the forefront a set of algebraic techniques in group representation theory that had usually played a secondary role to analysis in theoretical particle physics. The important objects were not the symmetry transformations themselves but their “generators”—associated
B i o g r a P h i c a l m e m o i r s with infinitesimal transformations. The mathematical tools were well known to physicists for the simplest compact Lie algebras because of the connection with angular momentum operators, the generators of rotations. For more complicated symmetries they were not part of the standard theoretical
physics toolbox.1 Coleman loved symmetry and quantum field theory. He understood the amazing power of these algebraic tools in working with quantum field theory. He knew that he had to explain them carefully to his elders (and eventu- ally to his students). And he knew that he had the requisite skills to do it. Sidney Coleman was born in Chicago on March 7, 1937. His father, a businessman, died when he was nine, and the family lived through hard times in the far North Side of Chicago. His brother, Robert L. Coleman, says2 that in the 1940s Sidney became interested in the building of the atomic bomb and declared his ambition to become a physicist. In high school he and a friend built a primitive computer and won the Chicago Science Fair. He graduated from the Illinois Institute of Technology in 1957 and went on to graduate school at Caltech, where he studied with Murray Gell-Mann and collaborated with Sheldon Glashow. In 1961 Coleman came to Harvard as the Corning Lecturer and Fellow. For over 30 years he was the magiste- rial leader of the Harvard group in particle theory. He was a legendary character as well as a leading theorist. I learned group theory from Coleman when I was an undergraduate at Harvard in the mid 1960s, and it was he who phoned me in 1971 after my four years away in graduate school to offer me a postdoctoral position at Harvard, a call I will never forget. In this memoir, whenever possible, I have let him speak for himself, because he did it so well in his writing. A remarkably clear thinker, a polymath, and a natural expositor and pedagogue, Coleman was usually not the first sidney coleman to come up with a new approach to a phenomenological puzzle; but he was always among the first to understand new theoretical ideas, often much more clearly than the inventors themselves. He was often the first to put new theoretical ideas on a firm footing and to understand their connection with deep issues in the foundations of physics. And he frequently took the lead in explaining them clearly to the community. As his Nobel Prize-winning colleague Steven Weinberg said, “Sidney was most interested in under- standing the foundations of theory rather than the special cases relevant to describing nature, and he revealed many of the deepest aspects of this grand theoretical structure through his work.” Many of Coleman’s best-known contribu- tions take the shape of theorems, or equivalence relations: the Coleman-Norton theorem (Coleman and Norton, 1965); the Coleman-Mandula theorem (Coleman and Mandula, 1967); “The Invariance of the vacuum is the invariance of the world” (Coleman, 1966); “There are no Goldstone bosons in two-dimensions” (Coleman, 1973); “Quantum sine-Gordon equation as the massive thirring model” (Coleman, 1975), and many more. Coleman’s approach to particle theory is well illustrated in the sequel to the story of higher approximate symmetries discussed in his thesis. Coleman’s many clear papers and lectures (some explicitly pedagogical with names like “Fun with SU(3)”) played a crucial role in making Gell-Mann’s discovery of the eight-fold way (Gell-Mann, 1961; Ne’eman, 1961) accessible to the quantum field theory community.F or many quantum field theorists the 1960s became the decade of higher symmetries. Many important contributions came out of this, perhaps most notably the structure of what we now know as the standard model. As often happens when theorists are given new toys to play with, many got carried away and took the ideas too far in the wrong directions: B i o g r a P h i c a l m e m o i r s approximate symmetries combining Gell-Mann’s SU(3) with symmetries of space and time. Coleman dealt with this by proving a theorem. The Coleman-Mandula theorem “on the impossibility of combining space-time and internal symme- tries in any but a trivial way” put much needed brakes on a runaway idea. The Coleman-Mandula theorem is also an illustration of an issue that a number of Coleman’s colleagues expe- rienced. Sometimes Coleman knew too much. There is an important loophole in the Coleman-Mandula theorem. It did not apply to supersymmetry because they did not consider the possibility of anticommuting parameters. Thinking in terms of theorems may lead you astray if your assumptions are not appropriate. Coleman knew so much that while it could be very valuable to consult with him about a nascent idea, it could also be discouraging. Sometimes, in order to make progress, one has to ignore (or be ignorant of) the known reasons why something doesn’t work. Coleman knew them all, and his colleagues had to learn to use his immense knowledge without being too intimidated or discouraged from trying crazy ideas. When Coleman wrote his thesis, it was not obvious what would be the most important application of higher symmetry in quantum field theory to particle physics. Gell-Mann’s approximate SU(3), the subject of Coleman’s thesis, we now think of as an accidental by-product of the fundamental theory of the strong interaction, arising because the values of some parameters (the u, d, and s quark masses) are small (for reasons that we still do not understand). Much more important is that the dynamics of the strong interactions and of the weak and electromagnetic interactions as well all involve symmetry in an essential way. A crucial step had come long before with Yang and Mills (1954) and Utiyama (1956) who wrote down theories with gauge symmetry in which the sidney coleman symmetry was tied to the fundamental interactions. These were extensions of quantum electrodynamics to incorporate non-Abelian symmetry. This was a time bomb. These theo- ries were mathematically fascinating and seemed somehow important physically, but nobody quite knew what to make of them when they were first written down, in part because they seemed to require massless particles that were not seen in the world. Even after Peter Higgs and others showed how to hide the symmetries and eliminate the massless particles by spontaneously breaking the gauge symmetry, and Weinberg, combining this with the symmetry structures suggested by Glashow and Salam and Ward had written down what would turn out to be the right theory of the weak interactions, the fundamental problem remained. No one quite knew what these theories meant. When Gerard ’t Hooft (1971a,b); ‘t Hooft and Martinus Veltman (1972); and others finally figured this out in the early 1970s, the floodgates opened because quantum field theorists had a huge new world of theories that they suddenly had the tools to explore. At the same time, experimental particle physicists were pushing their machines beyond the 1 GeV energy scale and begin- ning to see evidence of new and surprising physics at (what we then thought of as) high energy. The next few years were a remarkable confluence of theoretical and experimental progress in particle physics. While Coleman’s contributions to the enormous progress made in particle theory in the 1970s were huge, he was not directly involved in interpreting the exciting experimental results. Indeed, it was characteristic of Coleman that many of his deepest and most important contributions are hidden in long papers that might seem to the casual observer to be purely technical, working out of some minor mathematical detail. Two wonderful examples of this from the 1970s are the papers “Radiative Corrections as the Origin of Sponta- B i o g r a P h i c a l m e m o i r s neous Symmetry Breaking” (Coleman and Weinberg, 1973) and “Quantum sine-Gordon Equation as the Massive Thirring Model” (Coleman, 1975). In the first of theseC oleman and his student Erick Weinberg solve a puzzle. They begin thus,
Massless scalar electrodynamics, the theory of the electromagnetic interac- tions of a mass-zero charged scalar field, has had a bad name for a long time now; the attempt to interpret this theory consistently has led to endless paradoxes. In this paper we describe how nature avoids these paradoxes: Massless scalar electrodynamics does not remain massless, nor does it remain electrodynamics. In fact, this paper was much more than a consistent account of a pathological theory. It was enormously influential as a handbook for dealing with scale violation in quantum field theory. Coleman had been thinking hard about scale invariance since the late 1960s. In this paper, written soon after the revolution of spontaneously broken non-Abelian gauge theories, Coleman pulled together all the most useful techniques and described them with his characteristic clarity. In the process he discovered an important and very general phenomenon.
The surprising thing is that we have traded a dimensionless parameter, α, on which physical quantities can depend in a complicated way, for a dimensional one…on which physical quantities must depend in a trivial way, governed by dimensional analysis. We call this phenomenon dimensional transmutation.” We now know that dimensional transmutation is respon- sible for many of the surprising features of the strong inter- actions at high energies that were appearing in experiments when this paper was written. Coleman’s student, David Politzer, collaborated with Coleman and Erick Weinberg on some parts of the Radiative Corrections paper, and it was in this process that Politzer became interested in calculating the scaling properties of a sidney coleman non-Abelian gauge theory with no scalars. This led Politzer to the discovery of asymptotic freedom, also found at Princ- eton by David Gross and Frank Wilczek, for which the trio were awarded the 2004 Nobel Prize.3 Asymptotic freedom and dimensional transmutation, along with quark confine- ment, are the three dynamical pillars of QCD—our theory of the strong interactions based on the non-Abelian gauge theory SU(3). This theory incorporates and explains Gell- Mann’s approximate SU(3) symmetry that was the subject of Coleman’s thesis (Fritzsch et al., 1973; Weinberg, 1973), and as developed by Politzer and others, it led to the QCD parton model that now allows us to interpret the results of high-energy experiments with protons in terms of the funda- mental physics of the quarks and gluons inside. In “Quantum sine-Gordon equation as the massive thirring model” Coleman (1975) studied a pair of quantum field theories in one space and one time dimension. Neither of these theories is particularly important in itself (and certainly not very relevant to our world with three space dimensions). But in a masterful (and as usual exquisitely documented) analysis Coleman identified a precise equivalence between the two. He says,
Thus, I am led to conjecture a form of duality…for this two-dimensional theory. A single theory has two equally valid descriptions in terms of Lagrangian field theory: the massive Thirring model and the quantum sine-Gordon equation. The particles which are fundamental in one description are composite in the other:…Speculation on extending these ideas to four dimensions is left as an exercise for the reader. This concept of duality—that seemingly totally different classical theories can nonetheless describe exactly the same physics at the quantum level—became a central theme in the superstring revolution of the mid 1990s and continues to be central in field theory and string theory to this day. 10 B i o g r a P h i c a l m e m o i r s In the late 1970s and 1980s the results and the techniques in Coleman’s work on vacuum decay were crucial for the beginnings of the quantitative description of the beginning of the universe. He also addressed one of the deepest puzzles in physics in “Why there is nothing rather than something: A theory of the cosmological constant” (Coleman, 1988a). Here he argued that quantum gravity makes the value of the cosmological constant a quantum variable rather than a parameter, so that in his words, “The predictive power of the theory of everything has gone down the wormhole.” He further argued that a zero constant was much more likely than any other value, though he acknowledged that the Euclidean formulation of gravity is not a subject with firm foundations and clear rules of procedure; indeed, it is more like a trackless swamp. I think I have threaded my way through it safely, but it is always possible that unknown to myself I am up to my neck in quicksand and sinking fast. In this case he may well have been in quicksand, but his approach set the stage (for better or worse) for probabilistic approaches to the puzzle. For much of his career Coleman was the preeminent teacher of quantum field theory in the world and his approach to the subject, relying heavily on beautiful symmetry argu- ments, had enormous influence. He had 40 Ph.D. students, many of whom became leaders in high-energy theory and other areas of physics. Many hundreds of students from all over the Boston area attended his superbly organized and witty lectures on quantum field theory, and his notes formed the basis of courses and eventually textbooks used worldwide. Students and colleagues pored over his classic papers and summer school lectures. These were masterpieces. Coleman labored over them until no word was out of place and no explanatory or pedagogical opportunity was missed. Andrew Cohen tells the following personal story: sidney coleman 11
[It] happened when I was a beginning grad student while we were working on the Evaporation of Q-balls [Cohen et al., 1986]. Aneesh [Manohar] (who was my roommate at the time) was worried that Sidney’s exacting writing standards would mean that the paper would take forever to write. I suggested we go talk to Sidney and try to get him started early on the writing. When we went into Sidney’s office Aneesh blurts out “Andy has volunteered to write a draft of the introduction.” After Sidney gave me the evil eye for a moment he (seemingly reluctantly) agreed. I was terrified. I eventually went to Sidney’s previous paper where he introduced the notion of Q-balls, and through cutting and pasting managed to produce most of a coherent intro- duction, using essentially Sidney’s own words. The next morning I slipped it under his door and waited for him to come in. Sometime in the middle of the afternoon Sidney comes to find me and says “I was worried about having you work on the introduction, but this writing is fantastic!” Many of Coleman’s lectures were collected in his book Aspects of Symmetry (Coleman, 1988b). In 1989 he received the Award for Scientific Reviewing from the National Academy of Sciences for his “lucid, insightful, and influential reviews.” While his first love was the teaching of graduate-level quantum field theory, Coleman also gave brilliant under- graduate lectures. This was a personal sacrifice, because Coleman was renowned for doing his best work in the wee hours of the morning, and it was never clear whether he was better off getting a few hours of sleep before a late morning undergraduate class, or simply staying up for it. Fortunately, some of his lectures survive and are collected on the Harvard Physics Department Web page.4 Perhaps the most famous is “Quantum mechanics in your face” given at the New England sectional meeting of the American Physical Society (Apr. 9, 1994) in which Coleman pokes very educa- tional fun at the concept of the reduction of the wave packet. The talk contains a great selection of Coleman jokes. For example, explaining that the talk is pedagogical and that nothing in it is original, he says, “I claim some responsibility but no credit—the reverse of the usual scholarly procedure.” 12 B i o g r a P h i c a l m e m o i r s But he goes on to explain clearly with just first-year quantum mechanics that there is no problem with the interpretation of quantum mechanics. “The problem is the interpretation of classical mechanics.” Not a cloistered academic, Coleman was a public intel- lectual in the best sense. He served behind the scenes as a science adviser to a number of movies and NOVA programs. He had a deep and lifelong interest in science fiction and wrote and published science fiction criticism himself. As a teenage college student he was one of the cofounders of Advent Publishers, which is devoted to science fiction criticism. Coleman’s friend and cofounder of Advent, Earl Kemp, collected many Coleman memories for his online efanzine eI, in an issue5 devoted to Sidney Coleman and Kurt Vonnegut. Many of these are Coleman stories that Coleman loved to tell himself. Robert Lichtman writes,
I also wish I could locate a copy of Sidney’s account of the first time he was a Visiting Lecturer at Cambridge (or possibly Oxford). After dinner in the Commons, a silver snuff-box was, as per Tradition, passed from hand to hand. Being Sidney, he opened it and took a pinch. Anyone who knows him can visualize his slow smile when one of the Dons informed him that he was the first person to do that in more than a hundred years. If memory serves, he noted that it was fresh snuff, and mentioned that “of course one expects an old British University to pay attention to such details.” Robert Silverberg writes,
While traveling—alone—in France in the early 1970s, Sidney unexpectedly contracted a case of what turned out to be crabs. “Unexpectedly” because this is customarily a venereal disease, and he had been a model of chastity throughout his trip. The offending organisms must have been concealed in the bedding of his hotel room, he decided, and so he had suffered a case of punishment without the crime. But during the trip he had not, however, remained true to the dietary restrictions imposed by the religious doctrines of his forefathers; and, he said, after visiting a French doctor and having his ailment diagnosed for what it was, he was granted a vision of sidney coleman 13
his Orthodox grandfather rising up in wrath before him and thundering, “Thou hast eaten crustaceans, child, and now thou shalt be devoured by crustaceans thyself!” Though Coleman’s involvement with Advent wound down and he eventually retired in 2001, it is remarkable that he was still writing serious science fiction criticism while he was transforming particle theory in the 1970s. Here is a snippet from a review in 1974, reprinted in a moving website put together as a memorial by his science fiction colleagues.6
I do not know why Zelazny began this process of reverse alchemy five years ago, why he put away his magician’s tricks and turned his gold into lead. Maybe he simply ran out of steam; it happens often enough in literary careers; being a genius is a profession for the young. Or it might have been the pressures of the market. Zelazny began free-lancing full time about five years ago, and the economics of sf writing are not such as to allow time for tinkering with the elaborate and delicate machineries of wit. I don’t know why; all I know is that we once had something unique and wonderful, and it is gone. Coleman was an accomplished amateur magician in his teenage years. Coleman received many awards: the Boris Pregel Award from the New York Academy of Sciences; the Award for Lectures in Physics, Centro Ettore Majorana (International School of Physics, Erice); the Dirac Medal from the Inter- national Centre for Theoretical Physics; and the Dannie Heineman Prize from the American Physical Society. He was a fellow of the American Physical Society, the American Academy of Arts and Sciences, and was elected to member- ship in the National Academy of Sciences in 1980. Coleman’s wit could be as biting as it was clever, and his friends bore the brunt of this and loved it. They could count on him to keep their head sizes under control. “Courtesy,” Coleman argued, “is for strangers. Kindness is for friends.” Health problems bedeviled the end of Coleman’s life and 14 B i o g r a P h i c a l m e m o i r s deprived the world of what would surely have been an affec- tionately irreverent elder statesmanship. In the words of Sheldon Glashow, one of Coleman’s best friends throughout his adult life, and in a scientific association of almost 40 years Coleman’s first and last collaborator on theoretical particle physics (Coleman and Glashow, 1961, 1962, 1964,1997, 1998, 1999; Coleman et al., 1964, 1966): “Sidney was both an incomparable teacher and the most learned sage and sharpest critic in the world of theoretical physics: He was Pauli’s tongue in Einstein’s image. We have been deprived all too soon of one of our generation’s most profound and imaginative minds.” Sidney Coleman is survived by his wife of 25 years, Diana Coleman of Cambridge, Massachusetts; and his brother, Robert Coleman of Albany, California; and many friends and admirers around the world. We once had something unique and wonderful, and it is gone. Much of this memoir was written while I was at the Aspen Center for Physics. For many years Coleman would spend every other summer at the center. He was very proud of having learned to ride a bicycle there 20 years ago. His wife Diana taught him to ride on the tennis courts near the center, and cycling became an important part of his life. He cared deeply about the Aspen Center for Physics and at various times was an advisory board member and trustee. It was inspiring to remember Coleman’s contributions in a place that along with Erice, in Italy, he loved as much as any place in the world. I hope that this memoir will serve as a lasting reminder of the contributions of the Aspen Center for Physics to the community.7
i am grateful to Nima Arkani-Hamed, Andrew Cohen, Diana Coleman, Edward Farhi, Sheldon Glashow, Roberta Gordon, Howard Haber, Arthur Jaffe, Earl Kemp, Ken Lane, Aneesh Manohar, Stephen Parke, and Michael Turner for help and suggestions. sidney coleman 15
NOTES 1. though they were certainly understood and used by some math- ematically sophisticated theorists; see, for example, R. Utiyama’s paper (1956) generalizing Yang-Mills theory (Yang and Mills, 1954) to an arbitrary Lie group. 2. as reported in the delightful memorial by Roberta Gordon in the Harvard Gazette, Nov. 29, 2007, news.harvard.edu/gazette/story/2007/11/ sidney-coleman-dies-at-70/. For additional Coleman stories see the blog at betsydevine.com/blog/2007/11/20/our-friend-sidney-coleman-has-left-the- planet/. 3. the full story is told in Politzer’s Nobel Prize lecture in which Politzer (2005) refers to Coleman, for good reason, as “my beloved teacher.” 4. physics.harvard.edu/about/video.html. 5. efanzines.com/EK/eI10/index.htm. 6. efanzines.com/EK/eI36/index.htm#sid. 7. this material is based upon work supported by the National Science Foundation under grants no. 0804450 and 1066293. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation.
REFERENCES Cohen, A. G., S. R. Coleman, H. Georgi, and A. Manohar. 1986. The evaporation of Q balls. Nucl. Phys. B 272:301. Coleman, S. R. 1962. The Structure of Strong Interaction Symme- tries. Ph.D. thesis. Department of Physics. California Institute of Technology. thesis.library.caltech.edu/2386/. Coleman, S. R. 1966. The invariance of the vacuum is the invariance of the world. J. Math. Phys. 7:787. Coleman, S. R. 1973. There are no Goldstone bosons in two-dimen- sions. Commun. Math. Phys. 31:259-264. Coleman, S. R. 1975. Quantum sine-Gordon equation as the massive Thirring model. Phys. Rev. D 11:2088. Coleman, S. R. 1988a. Why there is nothing rather than something: A theory of the cosmological constant. Nucl. Phys. B 310:643. Coleman, S. R. 1988b. Aspects of Symmetry: Selected Erice Lectures. Cambridge, U.K.: Cambridge University Press,. 16 B i o g r a P h i c a l m e m o i r s
Coleman, S. R., and S. L. Glashow. 1961. Electrodynamic properties of baryons in the unitary symmetry scheme. Phys. Rev. Lett. 6:423. Coleman, S. R., and S. L. Glashow. 1962. Chiral symmetries. Ann. Phys.17:41-60. Coleman, S. R., and S. L. Glashow. 1964. Departures from the eight- fold way: Theory of strong interaction symmetry breakdown. Phys. Rev. B 134:671-681. Coleman, S. R., and S. L. Glashow. 1997. Cosmic ray and neutrino tests of special relativity. Phys. Lett. B 405:249-252. Coleman, S. R., and S. L. Glashow. 1998. Evading the GZK cosmic-ray cutoff. http://arxiv.org/abs/hep-ph/9808446v1. Coleman, S. R., and S. L. Glashow. 1999. High-energy tests of Lorentz invariance. Phys. Rev. D 59:116008. Coleman, S. R., and J. Mandula. 1967. All possible symmetries of the S matrix. Phys. Rev. 159:1251-1256. Coleman, S, R. and R. E. Norton. 1965. Singularities in the physical region. Nuovo Cimento 38:438-442. Coleman, S. R., and E. J. Weinberg. 1973. Radiative corrections as the origin of spontaneous symmetry breaking. Phys. Rev. D 7:1888- 1910. Coleman, S., S. L. Glashow, and D. J. Kleitman. 1964. Mass formulas and mass inequalities for reducible unitary multiplets. Phys. Rev. B 135:779-782. Coleman, S. R., S. Glashow, H. Schnitzer, and R. Socolow. 1966. Elec- tromagnetic mass differences of strongly interacting particles. In Proceedings of the XII International Conference on High Energy Physics, Dubna, 1964, vol. 1, pp. 785-787. Moscow: Atomizdat. Fritzsch, H., M. Gell-Mann, and H. Leutwyler. 1973. Advantages of the color octet gluon picture. Phys. Lett. B 47:365-368. Gell-Mann, M. 1961. The Eightfold Way: A Theory of Strong Inter- action Symmetry. California Institute of Technology Synchrotron Laboratory Report CTSL-20. Unpublished. ‘t Hooft, G. 1971a. Renormalizable Lagrangians for massive Yang- Mills fields. Nucl. Phys. B 35:167-188. ‘t Hooft, G. 1971b. Renormalization of massless Yang-Mills fields. Nucl. Phys. B 33:173-199. ‘t Hooft, G. and M. J. G. Veltman. 1972. Regularization and renor- malization of gauge fields. Nucl. Phys. B 44:189-213. sidney coleman 17
Ne’eman, Y. 1961. Derivation of strong interactions from a gauge invariance. Nucl. Phys. 26:222-229. Politzer, H. D. 2005. The dilemma of attribution. Proc. Natl. Acad. Sci. U. S. A. 102:7789-7793. Utiyama, R. 1956. Invariant theoretical interpretation of interaction. Phys. Rev. 101:1597-1607. Weinberg, S. 1973. Current algebra and gauge theories. 2. Non- Abelian gluons. Phys. Rev. D 8:4482-4498. Yang, C.-N., and R. L. Mills. 1954. Conservation of isotopic spin and isotopic gauge invariance. Phys. Rev. 96:191-195. 18 B i o g r a P h i c a l m e m o i r s
s e l e c t e d B IB l i o g r a P h y
1964 With S. L. Glashow. Departures from the eightfold way: Theory of strong interaction symmetry breakdown. Phys. Rev. B 134:671-681. 1967 With J. Mandula. All possible symmetries of the S matrix. Phys. Rev. 159:1251-1256. 1969 With J. Wess and B. Zumino. Structure of phenomenological Lagrang- ians. 1. Phys. Rev. 177:2239-2247. With C. G. Callan Jr., J. Wess, and B. Zumino. Structure of phenom- enological Lagrangians. 2. Phys. Rev. 177:2247-2250. 1970 With C. G. Callan Jr. and R. Jackiw. A new improved energy-momentum tensor. Ann. Phys. 59: 42-73. 1971 With R. Jackiw. Why dilatation generators do not generate dilatations. Ann. Phys.67:552-598. 1973 With E. J. Weinberg. Radiative corrections as the origin of sponta- neous symmetry breaking. Phys. Rev. D 7:1888-1910. 1974 With R. Jackiw and H. D. Politzer. Spontaneous symmetry breaking in the (N) model for large N*. Phys. Rev. D 10:2491. 1975 Quantum sine-Gordon equation as the massive Thirring model. Phys. Rev. D 11:2088. With R. Jackiw and L. Susskind. Charge shielding and quark confine- ment in the massive Schwinger model. Ann. Phys. 93:267. sidney coleman 19
1976 More about the massive Schwinger model. Ann. Phys. 101:239. 1977 With S. J. Parke, A. Neveu, and C. M. Sommerfield. Can one dent a dyon? Phys. Rev. D 15:544. The fate of the false vacuum. 1. Semiclassical theory. Phys. Rev. D 15:2929-2936. With C. G. Callan Jr. The fate of the false vacuum. 2. First quantum corrections. Phys. Rev. D 16:1762-1768. 1980 With E. Witten. Chiral symmetry breakdown in large N chromody- namics. Phys. Rev. Lett. 45:100. 1982 With B. Grossman. ’t Hooft’s consistency condition as a consequence of analyticity and unitarity. Nucl. Phys. B 203:205. 1985 With B. R. Hill. No more corrections to the topological mass term in QED in three dimensions. Phys. Lett. B 159:184. Q Balls. Nucl. Phys. B 262:263. 1986 With A. G. Cohen, H. Georgi, and A. Manohar. The evaporation of Q balls. Nucl. Phys. B 272:301. 1988 Black holes as red herrings: Topological fluctuations and the loss of quantum coherence. Nucl. Phys. B 307:867. Why there is nothing rather than something: A theory of the cosmo- logical constant. Nucl. Phys. B 310:643. Aspects of Symmetry: Selected Erice Lectures. Cambridge, U.K.: Cambridge University Press. 1997 With S. L. Glashow. Cosmic ray and neutrino tests of special relativity. Phys. Lett. B 405:249-252. 20 B i o g r a P h i c a l m e m o i r s
1999 With S. L. Glashow. High-energy tests of Lorentz invariance. Phys. Rev. D 59:116008. arXiv:1110.5013v5 [physics.ed-ph] 21 Feb 2013 oe fBinHl.Nmru diinleisb ihr on Please Sohn. Richard by comments. edits or questions (bryanging additional any have Numerous Chen you If Gin-ge ([email protected]). Bryan Hill. Brian ([email protected]), of notes ∗ † L dtdadtpstb unSnTn n ra i-eCe rmsc from Chen Gin-ge Bryan and Ting Yuan-Sen by typeset and Edited A T E eso fFbur 1 2013 21, February of version X oe rmSde oea’ hsc 253a Physics Coleman’s Sidney from Notes avr,Fl 1986 Fall Harvard, inyColeman Sidney 1 ∗ † [email protected])adRcadSohn Richard and [email protected]) n ftehandwritten the of ans otc unSnTing Yuan-Sen contact 0. Preface Notes from Sidney Coleman’s Physics 253a 2
0 Preface
It’s unexpected and heart-warming to be asked by Bryan Chen and Yuan-Sen Ting to write something about these notes, 25 years after taking them. I was the teaching assistant for Sidney’s quantum field theory course for three years. In the first year, I sat in, because frankly, I hadn’t learned quantum field theory well enough the first time that I took it. When I have the good fortune to hear a really good lecturer, I often re-copy my notes, preferably the evening on the day that I took them. Once in a while, students would miss a class, and then ask me if they could look at my notes. At some point, the requests started happening enough that it was suggested that a copy be put on reserve in the Harvard physics library. From there, copies of the notes just kept spreading. Sidney once expressed disappointment about the spread of the notes. For one thing, I even wrote down some of his anecdotes and jokes, and that made it less fun for him to re-tell them. For another, he wrote Aspects of Symmetry which shared a lot of material with what he taught in Physics 253b. He may have had in mind that he would write a field theory book as a companion volume. Of course, he never did write a field theory book, or you’d be reading that, and he never tried to rein the copies in. Now that he is gone, we are lucky that his clarity lives on. Thanks to Bryan Chen and Yuan-Sen Ting for creating this lovely LATEX version. Some- times transcription can seem tedious, but I hope it was as valuable for them as the first re-copying was for me, and that for you – fellow student of quantum field theory – the existence of these notes is similarly valuable. –Brian Hill, www.lingerhere.org, March 10, 2011
Editors’ notes The great field theorist Sidney Coleman for many years taught the course Physics 253 at Harvard on Quantum Field Theory. The notes you are reading were typeset from a scanned version of handwritten lecture notes by Brian Hill from the Fall 1986 of the first half of the course: Physics 253a. The Harvard Physics Department has made films of the lectures from the 1975-1976 version of the course available on their website as well. The typesetting for lectures 1-11 was done by Bryan Gin-ge Chen and for lectures 12-28 by Yuan-Sen Ting, who also recreated most of the figures. We have attempted to stay as faithful as possible to the scanned notes, aside from correcting some obvious errors in the notes and changing some of the in-text references. We thank Richard Sohn, Leonard Gamberg, Avraham Gal for pointing out typographical errors in the previous versions. The list of typographical errors can be found in Yuan-Sen Ting’s homepage www.cfa.harvard.edu/ yuanting. ∼ 0. Preface Notes from Sidney Coleman’s Physics 253a 3
Contents
0 Preface 2
1 September 23 7 Units; Translational invariance, Rotational invariance and Lorentz invariance for a singlefreeparticle...... 7
2 September 25 14 Position operator; Violation of Causality; Pair Production; Fock Space; Occupation number representation; SHO; Oscillator-like formalism for Fock space..... 14
3 September 30 21 Causality, observables and quantum fields; Constructing the quantum field from Fock space operators axiomatically; Translational invariance; Lorentz invari- ance and Relativistic normalization; Field constructed satisfies Klein-Gordon equation...... 21
4 October 2 28 Constructing Fock space from the quantum field axiomatically; The Method of the Missing Box, Classical Particle Mechanics, Quantum Particle Mechanics, Classical Field Theory, Quantum Field Theory; Quantum field from free scalar theory...... 28
5 October 7 38 Hamiltonian recovered in free scalar theory up to infinite constant; Normal ordering; Symmetries and conservation laws, Noether’s theorem; Noether’s theorem in field theory, conserved currents; ambiguity in currents; Energy-momentum tensor...... 38
6 October 9 49 Lorentz transformations; Angular momentum conservation; Internal symmetries; SO(2) internal symmetry; Charged field; SO(n) internal symmetry...... 49
7 October 14 58 Lorentz transformation properties of conserved quantities; Discrete symmetries; φ φ; Charge conjugation; Parity; Ambiguity of choice of parity; Time reversal;→ − Unitary and anti-unitary operators, angular momentum; Dilatations. 58
8 October 16 70 0. Preface Notes from Sidney Coleman’s Physics 253a 4
Scattering theory overview; Low budget scattering theory; Turning on and off func- tion; Schr¨odinger picture, Heisenberg picture and interaction picture; Evolu- tion operator; Time ordered product; Three models; Wick’s theorem. .... 70
9 October 21 83 Diagrammatic perturbation theory in Model 3; Vertex in model 1; Connected di- agrams; Thm: all Wick diagrams = : e connected Wick diagrams :; Model 1 P solved;Model2begun...... 83 P 10 October 23 93 Model 2 finished; Vacuum energy c.t.; S matrix is 1; Ground state energy; Yukawa potential; Ground state wave function; Model 3 and Mass renormalization; Renormalizedperturbationtheory...... 93
11 October 28 103 Feynman diagrams in Model 3; Feynman rules in model 3; A catalog of all Feynman diagrams in model 3 to (g2); Scattering amplitude at (g2); Direct and O O exchangeYukawapotentials...... 103
12 October 30 112 “Nucleon”-anti“nucleon” scattering at (g2); Energy eigenstate pole; Meson-“nucleon” scattering; “Nucleon”-anti“nucleon”O annihilation; Assembling the amplitudes for various processes into one amplitude; Mandelstam variables; Mandelstam- Kibble plot; crossing symmetry; CPT; Phase space and the S matrix; Differential tran. prob. unit time ...... 112 13 November 4 131 Differential tran. prob. Applications of unit time ; Decay; Cross sections, flux; Final state phase dσ space simplified for two bodies; dΩ ; Optical theorem; Final state phase space for three bodies; Feynman diagrams with external lines off the mass shell; theycouldbeaninternalpartofalargerdiagram...... 131
14 November 6 144 Fourier transform of the new blob; A second interpretation of Feynman diagrams n with lines off the mass shell; they are the coefficients of ρ in 0 S 0 ρ; A third interpretation of the blob; they are the Fourier transform ohf| the| i VEV of a string of Heisenberg fields; Reformulation of scattering theory; S matrix elements without the turning on and off function; LSZ formula stated.. . . . 144
15 November 13 163 0. Preface Notes from Sidney Coleman’s Physics 253a 5
LSZ formula proved; A second look at Model 3 and its renormalization; Renormal- izationconditions...... 163
16 November 18 171 Perturbative determination of a c.t.; Problems with derivative couplings; Rephras- ing renormalization conditions in terms of Green’s functions; Lehmann-Kallen spectral representation for the propagator; Rephrasing renormalization con- ditionsintermsof1PIfunctions...... 171
17 November 20 182 Perturbative determination of c.t.; Corrections to external lines in the computation of S matrix elements; One loop correction to meson self energy; Feynman’s trick for combining 2 denominators; Shift to make denominator SO(3, 1) in- variant; Wick notation to make denominator O(4) invariant; Integral tables for convergent combinations; Self-energy at one loop studied; Combining lots of denominators; The shift in the general case to reduce any multi-loop inte- graltoan integralover Feynman parameters...... 182
18 November 25 196 Rephrasing coupling constant renormalization in terms of a 1PI function; Exper- imental significance of the definition; Renormalization versus the infinities; Renormalizable Lagrangians; Unstable particles, Decay products...... 196
19 December 2 207 Unstable particles, lifetime, method of stationary phase; Where it begins again; Lorentz transformation laws of fields; Equivalent representations; Reducible reps; The finite dimensional inequivalent irreducible representations of SO(3); Unitarity; Complex conjugation; Direct product; Projection operators and reducibility...... 207
20 December 4 226 Parametrizing the Lorentz group; Commutation relations for the generators; de- composition into two sets obeying SO(3) commutation relations; The cata- log; Complex conjugation properties; Tensor product properties; Restriction to SO(3);Thevector;Rank2tensors;Spinors...... 226
21 December 9 240 Lagrangian made of two component spinors; Solution of Weyl equations of motion; Weyl particles; Dirac Lagrangian; Four-component spinors; Weyl basis, Dirac basis; Plane wave solutions of the Dirac equation...... 240 0. Preface Notes from Sidney Coleman’s Physics 253a 6
22 December 11 249 Plane wave solutions of the Dirac equation; Pauli’s theorem; Dirac adjoint; Pauli- Feynman notation; Parity; Bilinears; Orthogonality; Completeness; Summary. 249
23 December 16 267 Canonical quantization of the Dirac Lagrangian...... 267
24 December 18 273 Perturbation theory for spinors; Time ordered product; Wick’s theorem; Calcu- lation of the contraction (propagator); Wick diagrams; Feynman diagrams; Matrix multiplication; Spin averages and spin sums...... 273
25 January 6 289 Parity for spinors; Fermion and antifermion have opposite intrinsic parity; Charge conjugation; Majorana basis; Charge conjugation properties of fermion bilin- ears; Decay of ortho and para positronium; U U = U U ( 1)NF ;PT. . . . 289 P C C P − 26 January 8 301 Effect of PT on states; Proof of CPT theorem in perturbation theory; Renormal- izationofspinortheories;Propagator...... 301
27 January 13 312 2 1PI part of propagator; Spectral representation for propagator; ′( p) to (g ) in meson-nucleon theory; Coupling constant renormalization; Is ren6 ormalizationO sufficient to eliminate ’s...... P 312 ∞ 28 January 15 322 Regularization; Regulator fields; Dim’l regularization; Minimal subtraction; BPHZ renormalizability; Renormalization and symmetry; Renormalization of com- positeoperators...... 322
Class: Why not use Feynman’s lecture notes? Gell-Mann: Because Feynman uses a different method than we do. Class: What is Feynman’s method? Gell-Mann: You write down the problem. Then you look at it and you think. Then you write down the answer. 1. September 23 Notes from Sidney Coleman’s Physics 253a 7
1 September 23
In NRQM, rotational invariance simplifies scattering problems. Why does the addition of relativity, the addition of L.I., complicate quantum mechanics? The addition of relativity is necessary at energies E mc2. At these energies ≥ p + p p + p + π0 → is possible. At slightly higher energies p + p p + p + p + p → can occur. The exact solution of a high energy scattering problem necessarily involves many particle processes. You might think that for a given E, only a finite number, even a small number, of processes actually contribute, but you already know from NRQM that that isn’t true. 0 δV n 2 H H + δV δE = 0 δV 0 + |h | | i| + → 0 h | | i E E ··· n 0 n X − Intermediate states of all energies contribute, suppressed by energy denominators. For calculations of high accuracy effects at low energy, relativistic effects of order (v/c)2 can be included. Intermediate states with extra particles will contribute corrections of order Typical energies in problem 2 2 E ← mv v 2 2 = . As a general conclusion: the corrections of relativistic mc Typical mc c ← energy ∼ denominator kinematics and the corrections from multiparticle intermediate states are comparable; the addition of relativity forces you to consider many-body problems. We can’t even solve the zero-body problem. (It is a phenomenal fluke that relativistic kinematic corrections for the Hydrogen atom work. If the Dirac equation is used, without considering multi-particle intermediate states, corrections of v can be obtained. This is a fluke caused by some O c unusually low electrodynamic matrix elements.) We will see that you cannot have a consistent relativistic picture without pair production.
Units
1 1 ~ = c = 1 [m] = [E] = [T − ] = [L− ] Sometimes 1 = 1=2π Because we’re doing − and 1 = 1=“one-bar” relativistic (c) 2π quantum mechanics (~) 1 (1 fermi)− 197 MeV ≈ We say things like the inverse Compton wavelength of the proton is “1 GeV”. 1. September 23 Notes from Sidney Coleman’s Physics 253a 8
Lorentz Invariance Every Lorentz transformation is the product of an element of the connected Lorentz group, SO(3, 1), and 1, P (reflects all three space components), T (time reversal), or P T . By Lorentz invariance we mean SO(3, 1). Metric convention: + . − −− Theory of a single free spinless particle of mass µ
The components of momentum form a complete set of commuting variables.
Momentum operator P~ ~k = ~k ~k → | i | i State of a spinless particle is completely specified by its momentum. |{z}(3) Normalization ~k ~k ′ = δ (~k ~k ′) h | i − The statement that this is a complete set of states, that there are no others, is
1= d3k ~k ~k | ih | Z ψ = d3kψ(~k) ~k ψ(~k) ~k ψ | i | i ≡h | i Z (If we were doing NRQM, we’d finish describing the theory by giving the Hamiltonian, ~k 2 and thus the time evolution: H ~k = | | ~k .) | i 2µ | i We take H ~k = ~k 2 + µ2 ~k ω ~k | i | | | i ≡ ~k| i That’s it, the theoryq of a single free spinless particle, made relativistic. How do we know this theory is L.I.? Just because it contains one relativistic formula, it is not necessarily relativistic. The theory is not manifestly L.I.. The theory is manifestly rotationally and translationally invariant. Let’s be more precise about this.
Translational Invariance Given a four-vector, a, specifying a translation (active), there should be a linear operator, U(a), satisfying:
U(a)U(a)† =1, to preserve probability amplitudes (1.1) U(0) = 1 (1.2) U(a)U(b)= U(a + b) (1.3) 1. September 23 Notes from Sidney Coleman’s Physics 253a 9
iP a The U satisfying these is U(a)= e · where P =(H, P~ ). (This lecture is in pedagogical, not logical order. The logical order would be to state: 1. That we want to set up a translationally invariant theory of a spinless particle. The theory would contain unitary translation operators U(~a).
i ∂U 2. Define P = i , (by (1.3) [P ,P ] = 0, by (1.1) P~ = P~ †). − ∂ai ~a=0 i j i 3. Declare P to be a complete set and classify the states by momentum.
4. Define H = P~ 2 + µ2, thus giving the time evolution.) q More translational invariance
iP a U(a) 0 = a U(a)= e · | i | i where 0 here means state centered at zero and a means state centered at a. | i | i
O(x + a)= U(a)O(x)U(a)† a O(x + a) a = 0 O(x) 0 h | | i h | | i Non-relativistic reduction
iP~ ~a U(~a)= e− · iP~ ~a e− · ~q = ~q + ~a | i | i Something hard to digest, but correct:
iP~ ~a ~qe− · ~q =(~q + ~a) ~q + ~a | i | i iP~ ~a iP~ ~a e · ~qe− · ~q =(~q + ~a) ~q b | i | i iP~ ~a iP~ ~a e · ~qe− · = ~q + ~a ⇒ b Looks like the opposite of iP~ ~a b iP~ ~a b e− · O(~x)e · = O(~x + ~a)
The ~q operator is not an operator localized at ~q. No reason for these (last two equations) to look alike. b 1. September 23 Notes from Sidney Coleman’s Physics 253a 10
Rotational Invariance Given an R SO(3), there should be a U(R) satisfying ∈ U(R)U(R)† = 1 (1.4) U(1) = 1 (1.5)
U(R1)U(R2)= U(R1R2) (1.6)
Furthermore denote ψ′ = U(R) ψ | i | i ψ′ P~ ψ′ = R ψ P~ ψ h | | i h | | i
for any ψ i.e. U(R)†P~ U(R)= RP~ (1.7) | i and U(R)†HU(R)= H (1.8) A U(R) satisfying all these properties is given by U(R) ~k = R~k | i | i That (1.5) and (1.6) are satisfied is trivial. Proof that (1.4) is satisfied
3 U(R)U(R)† = U(R) d k ~k ~k U(R)† | ih | Z 3 3 3 = d k R~k R~k k′ = Rk,d k′ = d k | ih | Z 3 ~ ~ = d k′ k′ k′ =1 | ih | Z Proof that (1.7) is satisfied
1 1 U(R)†P~ U(R) = U(R)− P~ (U(R)− )† by (1.5) and (1.6) by (1.4) ↓ 1 1 = U(R− )P~ U(R− )†
1 3 1 = U(R− )P~ d k ~k ~k U(R− )† | ih | Z 1 3 1 = U(R− ) d k~k ~k ~k U(R− )† | ih | Z 3 1 1 3 3 = d k~k R− ~k R− ~k k = Rk′,d k = d k′ | ih | Z 3 = d k′R~k ′ ~k ′ ~k ′ | ih | Z = RP 1. September 23 Notes from Sidney Coleman’s Physics 253a 11
You supply proof of (1.8). This is the template for studying L.I. Suppose a silly physicist took
~k = 1+ k2 ~k | is z | i ~ ~ 2 (3) ~ ~ s k k ′ s =(1+p k )δ (k k ′) h | i z − 1 1= d3k ~k ~k 1+ k2 | is sh | Z z If he took U (R) ~k = R~r his proofs of (1.4), (1.7), (1.8) would break down because s | is | is 3 3 3 d k d k′ d k 2 = 2 i.e. 2 is not a rotationally invariant measure! 1+ kz 6 1+ kz′ 1+ kz
Lorentz Invariance
(3) ~k ~k ′ = δ (~k ~k ′) is a silly normalization for Lorentz invariance. h | d3ik is not a− Lorentz invariant measure. We want a Lorentz invariant measure on the hyperboloid k2 = µ2,k0 > 0. d4k is a Lorentz invariant measure. Restrict it to the hyperboloid by multiplying it by a Lorentz invariant d4kδ(k2 µ2)Θ(k0). − 1 d3k 2 2 This yields the measure on the hyperboloid , ω~ = ~k + µ , k =(ω~ ,~k). 2ω~k k k 3 ~ q So we take k = (2π) 2ω~k k . relativistically| i | i normalized So factors of 2π pcome outp right in the Feynman rules a few | months{z } from now ko
k
(Using d3xδ(~x2 R2) can get R sin θdθdφ) − 2 0 1 0 0 δ(k −Ki) Think of the δ function as a function of k and use the general formula δ(f(k )) = zeroes ′ . of f, Ki |f (Ki)| P 1. September 23 Notes from Sidney Coleman’s Physics 253a 12
Looks like factor multiplying d3k ought to get larger as ~k gets large. This is an illusion, caused by graphing on Euclidean paper. This is the same| illusion| as in the twin paradox. The moving twin’s path looks longer, but in fact, its proper time is shorter. Now the demonstration of Lorentz invariance: Given any Lorentz transformation Λ define U(Λ) k = Λk | i | i U(Λ) satisfies
U(Λ)U(Λ)† = 1 (1.9) U(1)=1 (1.10)
U(Λ1)U(Λ2)= U(Λ1Λ2) (1.11)
U †(Λ)P U(Λ) = ΛP (1.12) The proofs of these are exactly like the proofs of rotational invariance, using 3 3 3 d k d k d k′ 1= 3 k k 3 = 3 (2π) 2ω | ih | (2π) 2ω (2π) 2ω ′ Z ~k ~k ~k We have a fairly complete theory except we still don’t know where anything is. We need a position operator, satisfying
X~ = X~ † (1.13)
RX~ = U(R)†XU~ (R) (1.14) iP~ ~a iP~ ~a e · Xe~ − · = X~ + ~a (1.15) Take ∂ of (1.15) to get i[P ,X ]= δ . ∂ai i j ij
Determination of X~ in position space
ψ(~k) ~k ψ ~k P~ ψ = ~kψ(~k) ≡h | i h | | i ∂ ~k X~ ψ = i ψ(~k) + ~kF ( ~k 2) ψ(~k) h | | i ∂~k | | an arbitrary vector to satisfy commuting with the P~ ’s, inhomogeneous part of a complete| {z set,} can be commutation|{z} relation written this way The extra arbitrary vector can be eliminated by redefining the phases of the states. In the momentum state basis let
iG( ~k 2) (This is a unitary transformation, call it U. ~k ~k N = e | | ~k | i→| i | i In effect, U †XU~ is our new position operator.) Here ~ G( ~k 2)= ~kF ( ~k 2) ∇ | | | | 1. September 23 Notes from Sidney Coleman’s Physics 253a 13
The only formula this affects in all that we have done so far is the expression for ~k X~ ψ . With the redefined states, it is h | | i
3 ~ ~k X~ ψ = d k′ ~k ~k ′ k′ X~ ψ N h | | in N h | i h | | iN eZ−iG(|~k|2)δ(3)(~k ~k ′) i ∂ +~k ′F ( ~k ′ 2) eiG(|~k ′|2)ψ(~k ′) − [ ∂~k ′ | | ] ✘ 2 ✘✘ ✘✘ 3 | iG{z( ~k }) (3)| {z } ∂ ✘✘ 2 ✘✘ 2 iG iG ∂ = d k′e− | | δ (~k ~k ′) ✘✘G( ~k ′ )+ ✘~k ′✘F ( ~k ′ ) e ψ + e ψ − ✘− ~ | | | | ~ Z ∂k ′ ∂k ′ ∂ = i ψ(~k) ∂~k
Up to an unimportant phase definition, we have shown that the obvious definition for X~ is the unique definition, and we have done it without using L.I. or the explicit form of H. 2. September 25 Notes from Sidney Coleman’s Physics 253a 14
2 September 25
It is possible to measure a particle’s position in our theory, ~x is an observable, and this leads to a conflict with causality. Introduce position eigenstates
1 i~k ~x ~k ~x = e− · h | i (2π)3/2
iHt We will evaluate ~x e− ~x =0 and see that our particle has a nonzero amplitude to be found outside its forwardh | light| conei 2.
iHt 3 iHt ~x e− ~x =0 = d k ~x ~k ~k e− ~x =0 H ~k = ω ~k h | | i h | ih | | i | i ~k| i Z 1 ~ 3 ik ~x iω~ t 2 2 = d k e · e− k ω = ~k + µ (2π)3 ~k Z 2 π 2π q ∞ k dk ikr cos θ iω t = sin θdθ dφ e e− k r = ~x ,k = ~k (2π)3 | | | | Z0 Z0 Z0 1 2π −1 d(cos θ) R 1 1 ∞ | {z ikr} | {zikr} iωkt 2 2 − − = 2 kdk(e e )e ωk = k + µ (2π) ir 0 − Z p i ∞ ikr iω t = − kdke e− k (2π)2r Z−∞ These steps can be applied to the F.T. of any function of ~k . | | k branch cut for Im <0 iu Im >0
-iu
2By translational invariance and superposition we could easily get the evolution of any initial configuration from this calculation. 2. September 25 Notes from Sidney Coleman’s Physics 253a 15
i√k2+µ2t e− is a growing exponential as you go up the right side of the upper branch cut, ikr iω t and a decreasing exponential on the left side. Given r > 0 and r > t the product e e− k decreases exponentially as you go up the branch cut. Given r> 0 and r > t deform the contour to:
half circle at infinity does not contribute
The integral becomes k=iz µr i ∞ zr √z2 µ2t √z2 µ2t e− ∞ (z µ)r 2 2 − − − − − − − 2 ( z) d(iz)e [e e ]= 2 zdze sinh( z µ t) (2π) r µ − − −2π r µ − Z z}|{ Z p The integrand is positive definite, the integral is nonzero. We can measure a particle’s position in this theory. We can trap it in a box of arbitrarily small size, and we can release it and detect it outside of its forward light cone. The particle can travel faster than light and thus it can move backwards in time, with all the associated paradoxes. Admittedly, the chance that the particle is found outside the forward light cone falls off exponentially as you get further from the light cone, and that makes it extremely unlikely that I could go back and convince my mother to have an abortion, but if it is at all possible, it is still an unacceptable contradiction. In practice, how does this affect atomic physics? Not at all, because we never tried to localize particles to spaces of order their Compton wavelength when doing atomic physics. We say that the electron is in the TV picture tube and there is not much chance that it is actually out in the room with you. In principle, the ability to localize a single particle is a disaster, how does nature get out of it?
Particle trapped in container with reflecting walls 2. September 25 Notes from Sidney Coleman’s Physics 253a 16
If the particle is localized to a space with dimensions on the order of L the uncertainty in 1 the particle’s momentum is L . In the relativistic regime this tells us that the uncertainty in the particle’s energy is ∼1 . As L gets less than 1 states with more than one particle ∼ L µ are energetically accessible. If the box contained a photon and the walls were mirrors the photon would pick up energy as it reflected off the descending mirror, it could turn into two photons as it reflected.
If we try to localize a particle in a box with dimensions smaller or on the order of a Compton wavelength it is unknown whether what we have in the box is 1 particle, 3 particles, 27 particles or 0 particles. Relativistic causality is inconsistent with a single particle theory. The real world evades the conflict through pair production. This strongly suggests that the next thing we should do is develop a multi-particle theory.
Any number of one type of free spinless mesons
The space we construct is called Fock space. This formalism is also used in thermodynamics with the grand canonical ensemble. Particle number instead of being fixed, fluctuates around a value determined by the chemical potential. Basis for single particle states ~k | i (3) ~k ′ ~k = δ (~k ~k ′) H ~k = ω ~k P~ ~k = ~k ~k h | i − | i ~k| i | i | i This is the same as last time, except now this is just part of the basis. Two particle states ~k ,~k = ~k ,~k | 1 2i | 2 1i indistinguishability, Bose statistics |{z} (3) (3) (3) (3) ~k ,~k ~k′ ,~k′ = δ (~k ~k′ )δ (~k ~k′ )+ δ (~k ~k′ )δ (~k ~k′ ) h 1 2| 1 2i 1 − 1 2 − 2 1 − 2 2 − 1 H ~k1,~k2 =(ω~ + ω~ ) ~k1,~k2 | i k1 k2 | i P~ ~k ,~k =(~k + ~k ) ~k ,~k | 1 2i 1 2 | 1 2i etc. 2. September 25 Notes from Sidney Coleman’s Physics 253a 17
Also need a no particle state 0 | i 0 0 =1 H 0 =0 P~ 0 =0 h | i | i | i not part of a continuum |{z} |{z} The vacuum is unique, it must satisfy U(Λ) 0 = 0 . All observers agree that the state with no particles is the state with no particles. | i | i Completeness relation 1 1= 0 0 + d3k ~k ~k + d3k d3k k k k ,k | ih | | ih | 2! 1 2| 1 2ih 1 2| Z Z to avoid double counting. Alternatively, just check that this works on ~k,~k ′ |{z} | i Now we could proceed by setting up equations for wave functions. To specify a state, a wave function contains a number, a function of three variables, a function of six variables, etc. Interactions involving a change in particle number will connect a function of six variables to a function of nine variables. This would be a mess. We need a better description. As a pedagogical device, we will work in a periodic cubical box of side L for a while. Since a translation by L does nothing, the momenta must be restricted to allowed values ~ k (0, 0, L)=2nzπ 2πnx 2πny 2πnz · ~k = , , satisfying ~k (0, L, 0)=2n π L L L · y ~k (L, 0, 0)=2n π · x Dirac deltas become Kronecker deltas and integrals become sums
(3) ~ ~ 3 δ (k k ′) δ ′ d k − → ~k~k → Z X~k ~ ~ ~ ~ ~ ~ k k ′ = δ~k~k ′ k1, k2 k1′ , k2′ = δ~k ~k′ δ~k ~k′ + δ~k ~k′ δ~k ~k′ h | i h | i 1 1 2 2 1 2 2 1 Occupation number representation Each basis state corresponds to a single function
~k1 n(~k)= δ~~ | i↔ kk1 ~ ~ ~ k1, k2 n(k)= δ~~ + δ~~ | i↔ kk1 kk2 0 n(~k)=0 | i↔ 2. September 25 Notes from Sidney Coleman’s Physics 253a 18
Given a function n(~k) in the occupation number description, we write the state ~ n( ) Inner product n( ) n′( ) = n(k)!δ ′ | · i h · | · i n(~k)n (~k) Y~k no argument, to emphasize that the state depends on the |{z}whole function n, not just its value for one specific ~k. Define an occupation number operator
N(~k) n( ) = n(~k) n( ) H = ω N(~k) P~ = ~kN(~k) | · i | · i ~k X~k X~k This is a better formalism, but it could still use improvement. It would be nice to have an operator formalism that did not have any wave functions at all. Note that H for our system has the form it would have if the system we were dealing with was actually a bunch of harmonic oscillators. The two systems are completely different. In ours the N(~k) tells how many particles are present with a given momentum. In a system of oscillators, N(~k) gives the excitation level of the oscillator labelled by ~k.
Review of the simple harmonic oscillator
No physics course is complete without a lecture on the simple harmonic oscillator. We will review the oscillator using the operator formalism. We will then exploit the formal similarity to Fock space to get an operator formulation of our multi-particle theory. 1 H = ω[p2 + q2 1] [p, q]= i 2 − − Identity If [p, A] = [q, A] = 0 then A = λ I . c-number This is all we need to get the spectrum.z}|{ Define raising and lowering operators|{z} q + ip q ip a a† = − H = ωa†a ≡ √2 √2 [H, a†]= ωa† [H, a]= ωa [a, a†]=1 −
Ha† E = a†H E + ωa† E | i | i | i =(E + ω)a† E a† is the raising operator | i Ha E =(E ω)a E | i − | i 2. September 25 Notes from Sidney Coleman’s Physics 253a 19
E+2 Ladder at E+ of at a E states E-
2 Because, ψ H ψ = ω ψ a†a ψ = ω a ψ 0 the ladder of states must stop going h | | i h | | i || | i|| ≥ down or else E becomes negative. The only way this can happen is if a E = 0. Then | 0i H E0 = 0. | Thei lowest state of the ladder, having E = 0 is denoted 0 . The higher states are made 0 | i by n (a†) 0 n H n = nω n | i∝| i | i | i Get normalizations right a† n = c n +1 | i n| i 2 c = n aa† n = n +1 c = √n +1 | n| h | | i ⇒ n a n = d n 1 d 2 = n d = √n | i n| − i | n| n Now we use [p, A] = [q, A]=0 A = λI to show that this ladder built from a state with E = 0 is in fact the whole space.⇒ We do this by considering the projector, , onto the 0 P states in the ladder. Since a and a† keep you within the ladder
[a, ] = [a†, ]=0 [p, ] = [q, ]=0 P P ⇒ P P = λI ⇒ P The projector onto the ladder is proportional to the identity. There is nothing besides the states we have found. Now we will apply this to Fock space. Define creation and annihilation operators for each momentum, a , a† , satisfying ~k ~k ′
[a~ , a† ]= δ~~ ′ [a~ , a~ ′ ] = 0 [a† , a† ]=0 k ~k ′ kk k k ~k ~k ′ Hilbert space built by acting on 0 with strings of creation operators. | i ~k = a† 0 a~ 0 =0 | i ~k| i k| i ~ ~ ~ a~† a~† a~† 0 = k1, k2, k3 k1 k2 k3 | i | i H = ω~ a† a~ P~ = ~ka† a~ k ~k k ~k k X~k X~k
If ~k 0 = [a~ , A] = [a† , A]= A = λI ∀ k ~k ⇒ This tells us there are no other degrees of freedom. | {z } 2. September 25 Notes from Sidney Coleman’s Physics 253a 20
We’ve laid out a compact formalism for Fock space. Let’s drop the box normalization and see if it is working.
(3) ~ ~ [a~ , a† ]= δ (k k ′)[a~ , a~ ′ ] = 0 = [a† , a† ] k ~k ′ − k k ~k ~k ′ 3 3 H = d kω a† a P~ = d k~ka† a ~k ~k ~k ~k ~k Z Z Check energy and normalization of one particle states. ~ ~ k ′ k = 0 a ′ a† 0 = 0 [a ′ , a† ] 0 h | i h | ~k ~k| i h | ~k ~k | i (3) (3) = δ (~k ~k ′) 0 0 = δ (~k ~k ′) − h | i − 3 [H, a† ]= d k′ω~′ [a† a~ ′ , a† ]= ω~ a† ~k k ~k ′ k ~k k ~k ⇒ Z H ~k = Ha† 0 = [H, a† ] 0 = ω~ ~k | i ~k| i ~k | i k| i [P,~ a† ]= ~ka† P~ ~k = ~k ~k ~k ~k ⇒ | i | i ~ ~ Check normalization of two-particle states k1, k2 = a~† a~† 0 . Using commutation rela- | i k1 k2 | i tions check
(3) (3) (3) (3) ~k′ ,~k′ ~k ,~k = δ (~k ~k′ )δ (~k ~k′ )+ δ (~k ~k′ )δ (~k ~k′ ) h 1 2| 1 2i 1 − 1 2 − 2 1 − 2 2 − 1 Mathematical Footnote:
We’ve been calling the a and a† operators. An operator takes any normalizable vector in ~k ~k Hilbert space to another normalizable vector: A ψ is normalizable whenever ψ is normal- | i | i izable. Even x in 1-d QM is not an operator. dx f(x) 2 < ; dx f(x) 2x < . x is an unbounded operator. An unbounded operator has| A| ψ normalizable∞ | for| a dense∞ set3 R | i R of ψ . a and a† are even more awful than unbounded operators. An extra meson in a | i ~k ~k plane wave state added to any state is enough to make it nonnormalizable. a~k is an operator valued distribution. You only get something a mathematician would be happy with after 3 ~ integration. d kf(k)a~k is acceptable with a “sufficiently smooth” function. R
3Any ψ is the limit of a sequence in the dense set. | i 3. September 30 Notes from Sidney Coleman’s Physics 253a 21
3 September 30
In ordinary QM, any hermitian operator is observable. This can’t be true in relativistic quantum mechanics. Imagine two experiments that are at space-like separation. If x R 1 ∈ 1 and x R then (x x )2 < 0. 2 ∈ 2 1 − 2
R1 R2 space-time region space-time region accessible to observer 1 accessible to observer 2
Suppose observer 2 has an electron in his lab and she measures σk. If observer 1 can measure σy of that electron it will foul up observer 2’s experiment. Half the time when she remeasures σx it will have been flipped. This tells her that observer 1 has made a measurement. This is faster than light communication, an impossibility. It is a little hard to mathematically state the obvious experimental fact that I can’t measure the spin of an electron in the Andromeda galaxy. We don’t have any way of localizing particles, any position operator, yet. We can make a mathematical statement in terms of observables: If O1 is an observable that can be measured in R1 and O2 is an observable that can be measured in R2 and R1 and R2 are space-like separated then they commute:
[O1,O2]=0
Observables are attached to space time points. A given observer cannot measure all observables, only the ones associated with his or her region of space-time. It is not possible, even in principle, for everyone to measure everything. Out of the hordes of observables, only a restricted set can be measured at a space-time region. Localization of measurements is going to substitute for localization of particles. The attachment of observables to space-time points has no analog in NRQM, nor in the classical theory of a single particle, relativistic or nonrelativistic, but it does have an analog in classical field theory. In electromagnetism, there are six observables at each point: Ex(x) = Ex(~x, t), Ey(x), Ez(x), Bx(x), By(x), Bz(x). We can’t design an apparatus here that measures the Ex field now in the Andromeda galaxy. In classical field theory, these observables are numbers. In quantum mechanics, observ- ables are given by operators. The fields will become quantum fields, an operator for each spacetime point. We can see in another way that the electric field is going to have to become a quantum field: How would you measure an electric field? You mount a charged ball, pith 3. September 30 Notes from Sidney Coleman’s Physics 253a 22 ball, to some springs and see how much the springs stretch. The location of the ball is given by an equation like:
4 qx¨ = E (x) might be d xf(x)Ex(x) where f(x) x ←− gives some suitableR average over the pith ball. The amount the ball moves is related to the E~ field, and if the world is quantum me- chanical, x must be an operator, and so Ex must be an operator. We don’t have a proof, but what is strongly suggested is that QM and relativistic causality force us to introduce quantum fields. In fact, relativistic QM is practically synonymous with quantum field theory. We will try to build our observables from a complete commuting set of quantum fields.
φa(x) operator valued functions of space-time. a=1,...,N
Observables in a region R will be built out of φa(x) with x R. Observables in space-like separated regions will be guaranteed to commute if ∈
1. [φa(x),φb(y)] = 0 whenever (x y)2 < 0. − We are going to construct our fields out of the creation and annihilation ops. These five conditions will determine them:
a a 2. φ (x)= φ †(x) hermitian, observable
and that they have proper translation and Lorentz transformation properties
iP a a iP a a 3. e− · φ (x)e · = φ (x a) − a a 1 4. U(Λ)†φ (x)U(Λ) = φ (Λ− x) If this were not a scalar field, there would be extra factors here reflecting a change|{z} of basis; as well as a change of argument
and finally, a simplifying assumption, that the fields are a linear combination of a~k and a† (if that doesn’t work, we’ll try quadratic functions.) ~k
a 3 a a 5. φ (x)= d k[F (x)a~ + G (x)a† ] k k k ~k We can thinkR of our unitary transformations in two ways; as transformations on the states ψ U ψ , or as transformations on the operators A U †AU. NOT BOTH! What’s| i→ embodied| i in assumption (3): → 3. September 30 Notes from Sidney Coleman’s Physics 253a 23
iP~ ~a Given U(~a) the unitary operator of space translation by ~a (U(~a)= e− · ) the translation of a state ψ is a state ψ′ = U(~a) ψ . Suppose the value of some observable, like charge density is | i | i | i f(~x)= ψ ρ(~x) ψ h | | i f( x)
x f( x- a)
a x
then it should be that ψ′ ρ(~x) ψ′ = f(~x ~a). h | | i − Rewrite the first equation with ~x ~x ~a → − f(~x ~a)= ψ ρ(~x ~a) ψ − h | − | i
Equate ψ ρ(~x ~a) ψ = ψ′ ρ(~x) ψ′ h | − | i h | | i iP~ ~a iP~ ~a = ψ e · ρ(~x)e− · ψ h | | i A hermitian operator is determined by its expectation values
iP~ ~a iP~ ~a e · ρ(~x)e− · = ρ(~x ~a) − (3) is just the full relativistic form of this equation. The equation with a =(t,~0) is just the time evolution for Heisenberg fields. 1 For the exact same reason as x a appears in the RHS of (3), Λ− x appears in the RHS − of (4). (4) gives the Lorentz transformation properties of a ✿✿✿✿✿✿scalar field. This is not much of an assumption. We can get fields transforming as vectors or tensors by taking derivatives of the φa. Out of vector or tensor fields, we could make scalars. In order to apply condition (4), it is nice to have the discussion phrased in terms of relativistically normalized creation and annihilation operators. Recall the relativistically normalized one particle states
k = (2π)3/2 2ω ~k | i ~k| i p 3. September 30 Notes from Sidney Coleman’s Physics 253a 24
3/2 Introduce α†(k)=(2π) 2ω a† , α†(k) 0 = k . ~k ~k Multiparticle states are made by | i | i p
α†(k ) α†(k ) 0 = k ,...,k 1 ··· n | i | 1 ni The Lorentz transformation properties of the states are
U(Λ) 0 =0 | i U(Λ) k ,...,k = Λk ,..., Λk | 1 ni | 1 ni iP a and U(a)= e · is found from
P µ 0 =0 | i P µ k ,...,k =(k + + k )µ k ,...,k | 1 ni 1 ··· n | 1 ni In Eqs. (1.9)-(1.12) and Eqs. (1.1)-(1.3) of the Sept. 23 lecture we set up the criteria that U(Λ) and U(a) must satisfy. At the time our Hilbert space consisted only of the one particle part of the whole Fock space we have now. You should check that the criteria are satisfied in Fock space. We can determine Lorentz transformation and translation properties of the α†(k). Con- sider,
1 1 U(Λ)α†(k)U(Λ)† k1,...,kn = U(Λ)α†(k) Λ− k1,..., Λ− kn | i |1 1 i = U(Λ) k, Λ− k ,..., Λ− k | 1 ni = Λk,k ...,k | 1 ni
That is U(Λ)α†(k)U(Λ)† k ,...,k = α†(Λk) k ,...,k | 1 ni | 1 ni k1,...,kn is an arbitrary state in our complete basis so we have determined its action completely| i U(Λ)α†(k)U(Λ)† = α†(Λk) Similarly, or by taking the adjoint of this equation
U(Λ)α(k)U(Λ)† = α(Λk)
An analogous derivation shows that
iP x iP x ik x e · α†(k)e− · = e · α†(k) iP x iP x ik x e · α(k)e− · = e− · α(k) 3. September 30 Notes from Sidney Coleman’s Physics 253a 25
Now to construct the field φ (if there is more than one we’ll label them when we’ve found them) satisfying all 5 conditions. First we’ll satisfy condition (5) except we’ll write the linear combination of a and a† in terms of our new α(~k) and α†(~k) ~k ~k
It would be stupid not to use the L.I. measure d3k φ(x)= [f (x)α(k)+ g (x)α†(k)] (2π)32ω k k Z z }| ~k{ iP x iP x By (3), φ(x)= e · φ(0)e− · , that is,
3 d k iP x iP x iP x iP x · − · · † − · φ(x)= 3 [fk(0)e α(k)e + gk(0)e α (k)e ] (2π) 2ω~k Z 3 d k ik x ik x = [f (0)e− · α(k)+ g (0)e · α†(k)] (2π)32ω k k Z ~k
We have found the x dependence of fk(x) and gk(x) now we will use (4) to get their k dependence. A special case of (4) is
φ(0) = U(Λ)φ(0)U(Λ)† d3k [f (0)α(k)+ g (0)α†(k)] = (2π)32ω k k Z ~k d3k 3 [fk(0) U(Λ)α(k)U(Λ)† +gk(0) U(Λ)α†(k)U(Λ)†] (2π) 2ω~k Z α(Λk) α†(Λk) change variables 1 k Λ− k measure| is unchanged{z →} | {z } d3k = [f −1 (0)α(k)+ g −1 (0)α†(k)] (2π)32ω Λ k Λ k Z ~k
The coefficients of α(k) and α†(k) must be unchanged f (0) = f −1 (0) and g (0) = ⇒ k Λ k k gΛ−1k(0). k ranges all over the mass hyperboloid (k0 > 0 sheet), but a Lorentz transformation can turn any of these k’s into any other. So fk(0) and gk(0) are constants, independent of k.
3 d k ik x ik x φ(x)= [fe− · α(k)+ ge · α†(k)] (2π)32ω Z ~k We have two linearly independent solutions of conditions (3), (4) and (5), the coefficients of the complex constants f and g. We’ll name them. (Switching back to our old creation 3. September 30 Notes from Sidney Coleman’s Physics 253a 26
and annihilation ops.)
3 3 + d k ik x d k ik x φ (x)= a~ e− · φ−(x)= a† e · (2π)3/2 2ω k (2π)3/2 2ω ~k Z ~k Z ~k + convention is bananas, but it Note φ−(x)= φ (x)† p was est’d± by Heisenbergp and Pauli 50 years ago. Now we’ll apply hermiticity. Two independent combinations satisfying (2) are
+ 1 + φ(x)= φ (x)+ φ−(x) and φ(x)= [φ (x) φ−(x)] i − These are two independent cases of the most general choice satisfying (2):
iθ + iθ φ(x)= e φ (x)+ e− φ−(x)
Now to satisfy (1). There are three possible outcomes of trying to satisfy (1).
Possibility A: Both of the above combinations are OK. We have two fields φ1 and φ2 commuting with themselves and each other at spacelike separation. In this + possibility φ (x) and φ−(x) commute with each other at spacelike separation.
iθ + iθ Possibility B: Only one combination is acceptable. It is of the form φ = e φ +e− φ−. While θ may be arbitrary, only one θ is acceptable.
Possibility C: The program crashes, and we could weaken (5) or think harder.
Let’s first calculate some commutators. Using their expansions in terms of the a~ and a† k ~k and the commutation relations for a~ and a† we find k ~k
+ + [φ (x),φ (y)] = 0 = [φ−(x),φ−(y)] 3 + d k ik (x y) 2 − − · − and [φ (x),φ (y)] = 3 e ∆+(x y,µ ) (2π) 2ω~k ≡ − Z or just ∆+(x y) − + 2 also [φ−(x),φ (y)] = ∆ (x y,µ ) − + − | {z } ∆+ is manifestly Lorentz invariant ∆+(Λx) = ∆+(x).
2 Possibility A runs only if ∆+(x y)=0for(x y) < 0. We have encountered a similar integral when we were looking at the− evolution of− one particle position eigenstates. In fact,
3 1 d k ik (x y) i∂ ∆ (x y)= e− · − 0 + − 2 (2π)3 Z is the very integral we studied, and we found that it did not vanish when (x y)2 < 0. ✿✿✿ − 3. September 30 Notes from Sidney Coleman’s Physics 253a 27
Possibility A is DEAD. iθ + iθ On to possibility B. Take φ(x)= e φ (x)+ e− φ−(x) and calculate
[φ(x),φ(y)]=∆ (x y) ∆ (y x) θ dependence + − − + − drops out Does this vanish when (x y)2 < 0? Yes, and we can see this without any calcula- tions. A space-like vector has− the property that it can be turned into minus itself by a (connected) Lorentz transformation. This and the fact that ∆+ is Lorentz invariant, tells us that [φ(x),φ(y)] = 0 when x y is space-like. We can choose θ arbitrarily, but we can’t − choose more than one θ. We’ll choose θ = 0. Any phase could be absorbed into the a~k and a† . ~k Possibility B is ALIVE, and we don’t have to go on to C. We have our free scalar field of mass µ
3 + d k ik x ik x φ(x)= φ (x)+ φ−(x)= [a~ e− · + a† e · ] (2π)3/2 2ω k ~k Z ~k Our field satisfies an equation (show using k2 =pµ2)
2 µ ( + µ )φ(x)=0 = ∂ ∂µ
This is the Heisenberg equation of motion for the field. It is called the Klein-Gordon equation. If we had quantized the electromagnetic field it would have satisfied Maxwell’s equations. Actually, Schr¨odinger first wrote down the Klein-Gordon equation. He got it at the same time as he got the Schr¨odinger equation: 1 i∂ ψ = 2ψ 0 −2µ∇
~p 2 ~ This equation is obtained by starting with E = 2m noting that E = ω (when = 1) and i i i 1 p = k and for plane waves ω = i∂0 and k = i ∂i. Schr¨odinger was no dummy, he knew about relativity, so he also obtained (+µ2)φ(x)=0 from p2 = µ2. He immediately saw that something was wrong with the equation though. The equation has both positive and negative energy solutions. For a free particle the energies of its possible states are unbounded below! This is a disgusting relativistic generalization of a single particle wave equation, but with 50 years hindsight we see that this is no problem for a field that can create and destroy particles. 4. October 2 Notes from Sidney Coleman’s Physics 253a 28
4 October 2
We have constructed the quantum field. It is the object observables are built from. More than that though, we can reconstruct the entire theory from the quantum field. The structure we built in the last three lectures is rigid. We can make the top story the foundation. Suppose we started with a quantum field satisfying (as our φ does),
1. φ(x)= φ†(x), hermiticity
2. ( + µ2)φ(x) = 0, K.-G. equation
d3k ik (x y) ik (x y) 3. [φ(x),φ(y)]=∆+(x y) ∆+(y x)= (2π)32ω [e− · − e · − ] − − − ~k − U(Λ) φ(x)U(Λ) = φ(Λ 1x) R 4. † − φ is a scalar field U(a)†φ(x)U(a)= φ(x a) − 5. φ(x) is a complete set of operators, i.e. if x [A, φ(x)]=0 A = λI. ∀ ⇒ then from these properties, we could reconstruct the creation and annihilation operators and the whole theory. Conversely, all these properties follow from the expression for φ(x) in terms of the creation and annihilation operators. The two beginning points are logically equivalent.
Defining a and a† and recovering their properties ~k ~k Property (2) of φ, that φ is a solution of the Klein-Gordon equation, tells us that
3 ik x ik x 0 ~ 2 2 φ(x)= d k[α~ke− · + β~ke · ] (k = ω~k = k + µ ) Z q This is because any solution of the Klein-Gordon equation can be expanded in a complete set of solutions of the K.-G. eqn., and the plane wave solutions are a complete set. Because
φ is an operator, the coefficients in the expansion, α~k and β~k, are operators. Because of property (1), α = β†. Just to please my little heart, let’s define ~k ~k
These funny factors will make the † commutation relations for the a~ and a k ~k come out nice a† a~k ~k α~ = β~ = α† = k 3/2 k ~k 3/2 (2π) 2ω~k (2π) 2ω~k p p 4. October 2 Notes from Sidney Coleman’s Physics 253a 29
Then the expression is
3 d k ik x ik x φ(x)= [a~ e− · + a† e · ] (2π)3/2 2ω k ~k Z ~k p This defines implicitly the a~ and a† . To find their commutation relations we’ll first have k ~k to solve for them. Note that
3 d k i~k ~x i~k ~x φ(~x, 0) = [a~ e · + a† e− · ] (2π)3/2 2ω k ~k Z ~k While p 3 d k ik x ik x φ˙(x)= [a~ ( iω~ )e− · + a† (iω~ )e · ] (2π)3/2 2ω k − k ~k k Z ~k And p3 d k ω~k i~k ~x i~k ~x φ˙(~x, 0) = [ ia e · + ia† e− · ] (2π)3/2 2 − ~k ~k Z r 1 i~k ~x 1 The coefficient of e · in the expansion for φ(~x, 0) is (a~ + a† ) therefore (2π)3/2 2ω k ~k √ ~k − 1 d3x i~k ~x (a~ + a† )= φ(~x, 0)e− · 2ω k ~k (2π)3/2 √ ~k − 1 i~k ~x ω~k The coefficientR of e · in the expansion for φ˙(~x, 0) is ( ia~ + ia† ) therefore (2π)3/2 2 k ~k − − ω 3 ~k d x i~k ~x q ( ia~ + ia† )= φ˙(~x, 0)e− · 2 − k ~k (2π)3/2 That’s just− the inverse Fourier transformation applied. q R Now I can use these two expressions to solve for a~k. Take 2ω~k times the expression for
1 2 ω~k (a~ + a† ) and add i times the expression for ( ia~ + ia† ) and divide by 2 2ω k ~k ω~ 2 p k ~k √ ~k − k − − to get q q 3 1 d x i i~k ~x a = 2ω φ(~x, 0) + φ˙(~x, 0) e− · ~k 2 ~k (2π)3/2 ω Z ~k p Take the hermitian conjugate to get an expression for a† . ~k Having solved for the a and a† , we can now find their commutation relations. Write out ~k ~k the double integral for [a~ , a† ] and use property (3’) below (which is weaker than property k ~k ′ (3) (3)). You should get δ (~k ~k ′). − 4 From property (4), you can derive, what U(a)†a~kU(a) and U(Λ)†a~kU(Λ) are. You can also derive and state the analog of property (5).
4It is easier to find the action of Lorentz transformations on a(k) (2π)3/2 2ω a . ≡ ~k ~k p 4. October 2 Notes from Sidney Coleman’s Physics 253a 30
The properties of the field φ(x) are actually a little overcomplete. We can weaken one of them, (3), without losing anything. Since φ obeys the K.-G. equation, (2),
2 (x + µ )[φ(x),φ(y)]=0
We see that the commutator obeys the K.-G. equation for any given y. For a given y, we only need to give the commutator and the time derivative on one initial time surface, and the K.-G. equation determines its evolution off the surface. So we will weaken (3) to
[φ(~x, t),φ(~y, t)]=0 3’. [∂ φ(~x, t),φ(~y, t)] = iδ(3)(~x ~y) 0 − − We can easily check that this is the right specialization of property (3) by doing the integrals, which are easy with x0 = y0. (For a given y = (~y, t) we have chosen our initial surface to be x0 = t. The equations in (3’) are called equal time commutation relations.)
3 d k i~k (~x ~y) i~k (~x ~y) · − − · − [φ(~x, t),φ(~y, t)] = 3 [e e ] (2π) 2ω~k Change variables− ~k ~k Z in second term→− =0 3 d k i~k (~x ~y) i~k (~x ~y) [∂ φ(~x, t),φ(~y, t)] = ( iω )[e · − + e− · − ] 0 (2π)32ω − ~k ~k Again ~k ~k in second term Z →− 3 d k i~k (~x ~y) = ( iω✚)2✁e · − (2π)32ω✚ − ✚~k Z ✚ ~k Reminds us a little (3) of a bad dream in = iδ (~x ~y) b b which [pa,q ]= iδ − − − a In the remainder of this lecture we are going to develop a completely different approach to quantum field theory. We will obtain a field satisfying properties (1), (2), (3’), (4) and (5), and then we’ll stop. We have just shown that given a field satisfying these properties we can recover everything we did in the first three lectures. The new approach is 4. October 2 Notes from Sidney Coleman’s Physics 253a 31
The method of the missing box
continuum limit
classical classical particle field theory canonical mechanics quantization
quantum quantum particle field mechanics theory
Anyone looking at this diagram can see there is a missing box
Taking the continuum limit essentially is just letting the number of coordinates, degrees of freedom go to infinity. This is usually done in a cavalier way. It doesn’t matter a whole lot if you have a discrete infinity or a continuous infinity. With a continuous infinity you can Fourier transform the coordinates to obtain a discrete set of coordinates. Canonical quantization is a turn the crank way of getting quantum mechanics, beginning with a classical Hamiltonian. We’ll combine these two standard operations to get the missing box, but first, a lightning review of the essential principles of the three boxes we have.
Classical Particle Mechanics We start with a Lagrangian, a function of the generalized coordinates, and their time deriva- tives
L(q1,...,qN , q˙1,..., q˙N , t) qa(t) , a 1,...,N ∈ real number functions of time
e.g. L = 1 mq˙2 V (q) |{z} 2 − We define the action S = t2 dtL and apply Hamilton’s principle to get the equations of t1 motion. That is, we vary S by arbitrarily varying δqa(t) except at the endpoints R a a δq (t1)= δq (t2)=0 4. October 2 Notes from Sidney Coleman’s Physics 253a 32
and demand 0= δS
t2 ∂L ∂L δS = dt δqa + δq˙a ∂qa ∂q˙a t1 " a # Z X parts integration
t2 ∂L d a ✘a ✘t = dt | {zp δq + } ✘p ✘δq 2 ∂qa − dt a a |t1 t1 a Z ∂L vanishes because of X pa a ≡ ∂q˙ endpoint restriction Since the variation δqa(t) is arbitrary | {z } ∂L dV =p ˙ , for all a e.g. p = mq,˙ p˙ = ∂qa a − dq These are the Euler-Lagrange equations. Around 1920(!) the Hamiltonian formulation of classical particle mechanics was discov- ered. Define H(p ,...,p , q ,...,q , t)= p q˙a L 1 N 1 N a − a X H must be written in terms of the p’s and q’s only, not theq ˙’s. (This is not always possible. The new variables must also be independent, so it is possible to vary them independently. Examples where the p’s are not complete and independent are electromagnetism and a particle on a sphere in R3. The Lagrangian for the latter system may be taken as 1 L = m~r˙ 2 + λ(~r 2 a2) V (~r) 2 − − The equation of motion for the variable λ enforces the constraint. In the passage to the 5 Hamiltonian formulation pλ = 0; pλ is not an independent function. In this system, one way to get to the Hamiltonian description is to first eliminate λ and one more coordinate, taking perhaps θ, φ, polar coordinates for the system, rewriting the Lagrangian, and then trying again.) Vary the coordinates and momenta ✚ a ✘✘a ∂L a ∂L ✚ a dH = dpaq˙ + ✘padq˙ dq ✚dq˙ − ∂qa − ✚∂q˙a a Fortunately we didn’t Xhave to expand out dq˙a p˙a by the E-L equations Read off Hamilton’s equations |{z} ∂L dH ∂H ∂H Notice: ∂t =0 dt =0 a in that case H is called⇒ the energy (which =q ˙ a = p˙a ∂pa ∂q − is the name reserved for the conserved quantity resulting from time translation invariance) 5How can I vary that?!? 4. October 2 Notes from Sidney Coleman’s Physics 253a 33
Quantum Particle Mechanics
a We replace the classical variables pa, q by operator valued functions of time satisfying
a b [q (t), q (t)] = 0 = [pa(t),pb(t)] and [p (t), qb(t)] = iδb ~ =1 a − a The p’s and q’s are hermitian observables. They are assumed to be complete. They determine the Hilbert space. For example 1-d particle mechanics, the space of square inte- grable functions. A basis set can be the eigenstates of q. p or rather eipx tells you how to relate the phases of the various eigenstates and indeed even that the range of q is R (not [-1,1] or anything else). The quantum Hamiltonian, which determines the dynamics, is just the classical Hamil- tonian except it is now a function of the operator p’s and q’s. For any A
dA ∂A if there is explicit dependence on = i[H, A]+ time other than that implicit in the dt ∂t ←− time dependence of the p’s and q’s H is the generator of infinitesimal time translations just as in classical mechanics. H suffers from ordering ambiguities. Because p and q don’t commute, it is not clear, and it may matter, whether you write p2q, qp2 or pqp. Sometimes this ambiguity can be cleared up using other criteria. For example, in central force problems, we quantize in Cartesian coordinates and then transform to central coordinates.
Heisenberg equations of motion
dqa = i[H, qa] dt This step depends only on the commutation ∂H relations for the p’s and q’s. Fairly easy to see = i i ⇓ for a polynomial in the p’s and q’s. − ∂p a ∂H = ∂pa dp ∂H Similarly, a = dt −∂qa This is the motivation for the commutation relations. It is a way of putting the correspon- dence principle into the theory. The quantum equations resemble the classical equations, at least up to the ordering ambiguities, and any variation due to ordering ambiguities is down by a factor of ~. We wouldn’t expect any general procedure for turning classical theories into quantum theories, motivated only by the correspondence principle, to be able to fix those ambiguities. We can be a little more precise about how the classical equations are 4. October 2 Notes from Sidney Coleman’s Physics 253a 34
actually recovered. In general, the Heisenberg equations of motion for an arbitrary operator A relate one polynomial in p, q,p ˙ andq ˙ to another. We can take the expectation value of this equation to obtain (a quantum mechanical average of) an equation between observables. In the classical limit, when fluctuations are small, expectations of products can be replaced by products of expectations, pn p n, pq p q , and this turns our equation among h i→h i h i→h ih i polynomials of quantum operators into an equation among classical variables. The other two boxes are actually going to go quite quickly, because if you are just a little cavalier, the continuum limit is little more than some new notation.
Classical Field Theory We have an infinite set of generalized coordinates, real number functions of time, φa(~x, t) labelled by a discrete index, a, and a continuous index, ~x.
φa(~x, t) qa(t) ↔ t t ↔ a a, ~x ↔ It is sometimes a handy mnemonic to think of t, ~x, as a generalization of t, but that is not the right way to think about it. For example we are used to giving initial value data at fixed t in CPM. In CFT we don’t give initial value data at fixed t and ~x, that is obviously incomplete. We give initial value data for fixed t and all ~x. With cowboy boldness, everywhere in CPM we see a sum on a, we’ll just replace it with a a a (3) sum on a and an integral over ~x, and everywhere we see δb we’ll replace it with δb δ (~x ~y). The Dirac delta function has the exact same properties in integrals that the Kronecker delta− has in sums. The next thing to do is write down Lagrangians. Because a CPM Lagrangian can contain products of qa with different a, our Lagrangian could contain products of φ(~x, t) with different ~x d3xd3yd3zf (~x, ~y, ~z)φa(~x, t)φb(~y, t)φc(~z, t) f qa(t)qb(t)qc(t) abc ↔ abc Z Xa,b,c But note that a CPM Lagrangian does not contain products at different times. With an eye to Lorentz invariance, and noticing that Lagrangians are local in time, we will specialize to Lagrangians that are local in space. Also, you know that when taking continuum limits, differences of neighboring variables become spatial derivatives and a combination like 1 [ρ((n 1)a, t)+ ρ((n + 1)a, t) 2ρ(na, t)] a2 − −
∂2ρ would become ∂x2 in the continuum. But again with an eye to Lorentz invariance, because only first derivatives with respect to time appear in the Lagrangian, we will only consider 4. October 2 Notes from Sidney Coleman’s Physics 253a 35
first derivatives with respect to the xi. So L has the form
L(t)= d3x (φa(x), ∂ φa(x), x) L µ Z t2 And the action S = dtL(t)= d4x L Zt1 Z The Euler-Lagrange equations which come from varying S will be Lorentz covariant if is a Lorentz scalar. At three points we have used Lorentz invariance to cut down on the L possible forms for L. These have been specializations, not generalizations. Now we’ll apply Hamilton’s principle
under arbitrary variations δφ a a 0= δS satisfying δφ (~x, t1)= δφ (~x, t2) = 0 4 ∂ a ∂ a = d x a L δφ (~x, t)+ L a δ∂µφ ∂φ (~x, t) ∂∂µφ a a Z ∂µδφ Do parts integration. As usual, the X µ a πa restrictions on δφ make the surface ≡ ⇓ terms at t1 and t2 drop out ∂ | {z } = d4x L ∂ πµ δφa | {z } ∂φa − µ a a X Z ∂ µ Euler-Lagrange L = ∂µπa (for all ~x and a) equations ⇒ ∂φa
µ πa should not be thought of as a four-vector generalization of pa. The correspondence is
π0(~x, t) p (t) a ↔ a 0 In fact πa is often just written πa. We should say something about the surface terms at spatial infinity which we ignored when we did our parts integration. One can say “we are only considering field configura- tions which fall off sufficiently rapidly at spatial infinity that we can ignore surface terms.” Alternatively we could work in a box with periodic boundary conditions. Anyway, we’ll just be slothful.
A simple example of a possible L Most general satisfying (3) conditions L 1. Build out of one real scalar field, φ L φ = φ∗ (not φ = φ†, we’re not doing QFT yet)
2. is a Lorentz scalar L 4. October 2 Notes from Sidney Coleman’s Physics 253a 36
3. is quadratic in φ and ∂ φ L µ (1) and (3) are for simplicity. A motivation for (3) is that a quadratic action yields linear equations of motion, the easiest ones to solve. Most general is L 1 = a[∂ φ∂µφ + bφ2] L 2 µ One of these constants is superfluous; we are always free to rescale φ, φ φ . √ a → | | L becomes 1 = [∂ φ∂µφ + bφ2] L ±2 µ The Euler-Lagrange equations for our example are ∂ πµ L = ∂µφ and ≡ ∂∂µφ ± ∂ ∂ πµ L =0 or (∂µ∂ φ bφ)=0 µ − ∂φ ± µ −
In general the Hamiltonian which was p q˙a L in CPM is a a − P H = d3x(π0∂ φa )= d3x a 0 −L H a X Z Z = π0∂ φa is the Hamiltonian density H a 0 −L a X In our example 1 1 H = d3x[ (π0)2 + (~ φ)2 bφ2] ± 2 2 ∇ − Z Since each of these terms separately can be made arbitrarily large, if the energy is to be bounded below, they each better have a positive coefficient. better be + and b better be ± b = µ2 definition of µ 0 − ≥ µ 2 The E-L equation is now ∂µ∂ φ + µ φ =0 Gosh, this is looking familiar now. We are ready to fill in the last box. We are ready to canonically quantize classical field theory. Since we are not worrying about the passage to an infinite number of variables, this will be little more than a notational change in the canonical quantization of CPM. It would not even have required a notational change if Newton hadn’t chosen two ways of writing S for sum, and . R P 4. October 2 Notes from Sidney Coleman’s Physics 253a 37
Quantum Field Theory
a 0 We replace the classical variables φ , πa by quantum operators satisfying
a b 0 0 [φ (~x, t),φ (~y, t)] = 0 = [πa(~x, t), πb (~y, t)] and [π0(~x, t),φb(~y, t)] = iδb δ(3)(~x ~y) a − a − The Dirac delta for the continuous index ~x is just the continuum generalization of the Kronecker delta. H = d3x (π0,φa, x) determines the dynamics as usual. The commu- H a tation relations are set up to reproduce the Heisenberg equations of motion. Since this is R just a change of notation there is no need to redo any proofs, but let’s check that things are working in our example anyway. 1 H = d3x [(π0)2 +(~ φ)2 + µ2φ2] 2 ∇ Z ∂ φ(~y, t)= i[H,φ(~y, t)] = i( i) d3xπ0(~x, t)δ(3)(~x ~y) 0 − − Z = π0(~y, t)
0 2 2 0 Similarly, ∂0π (~y, t)= ~ φ µ φ. You need the commutator [π (~y, t), ~ φ(~x, t)] which is obtained by taking the gradient∇ − of the equal time commutation relation (ETCR).∇ We have reproduced our quantum field satisfying (1), (2), (3’), (4) and (5). This was accomplished in one lecture rather than three because this is a mechanical method without physical insight. The physical interpretation comes at the end instead of at the beginning. In our first method, the constructive method, we shook each object we introduced to make sure it made sense, and we finally obtained a local observable, the quantum field. We can’t put interactions in this method though, because in the very first steps we had to know the whole spectrum of the theory, and about the only theory we know the exact spectrum for is the free theory. In our magical canonical quantization method it’s easy to put in interactions: just let λφ4. At the first order in perturbation theory the part of λφ4 that has two L→L− creation and two annihilation operators produces two-into-two scattering! At second order we’ll get two-into-four and two-into-six6 scattering. Looks easy. . . if there weren’t any booby traps...but boy are there going to be booby traps.
6pair production! 5. October 7 Notes from Sidney Coleman’s Physics 253a 38
5 October 7
The simplest example obtained from applying canonical quantization to a free scalar field led to the same theory we got by constructing a local observable painstakingly in the theory we had before. We’ll double check that we get the same Hamiltonian we had in that theory. Evaluate 1 H = d3x[π2 +( φ)2 + µ2φ2] π = φ˙ 2 ∇ Z Write φ in terms of its Fourier expansion
3 d k ik x ik x φ(x)= [a~ e− · + a† e · ] (2π)3/2 2ω k ~k Z ~k Substitution would lead to a triple integral,p but the x integration is easily done yielding a δ function, which does one of the k integrals7.
3 ✭✭✭ 1 d k 2iω t 2 ✭✭2✭ 2 2 2 2 − ~k ✭✭✭ ~ † ~ H = a~k a ~k e ✭( ω~k + k + µ ) +a~ka~ (ω~k + k + µ ) 2 2ω − − | | k | | Z ~k ~k ′= ~k see That what multiplies e±2iω~kt is 0 is good. footnote− 1 H should not be time-dependent. |{z} | {z } ✭ ✭✭✭✭ 2 ~ 2 2 2iω~kt 2 ✭✭~ 2 2 + a† a~ (ω + k + µ )+ a† a† e ✭( ✭ω✭+ k + µ ) ~k k ~k | | ~k ~k − ~k | | − z }| { H had four types of terms: ones creating particles with momentum ~k and ~k, ones destroying particles with momentum ~k and ~k, ones creating a particle with momentum− ~k, − then destroying one with momentum ~k, and ones destroying a particle with momentum ~k, then creating one with momentum ~k. These all conserve momentum.
1 3 H = d k ω (a a† + a a† ) 2 ~k ~k ~k ~k ~k Z Almost, but not quite what we had before. (3) 3 1 (3) (3) Using [a~ , a† ]= δ (~k ~k ′) gives H = d kω~ a† a~ + δ (0) . δ (0)! Can get some k ~k ′ − k ~k k 2 idea of the meaning of this by putting the systemR in a box. H becomes 1 1 ω (a a† + a† a )= ω a† a + 2 ~k ~k ~k ~k ~k ~k ~k ~k 2 X~k X~k In a box we see that this is just a zero point energy. (It is still infinite though.) Just 1 2 2 2 1 as the spectrum of the Hamiltonian H = 2 (p + ω q ) starts at 2 ω, not zero, the spectrum
′ 7 d3xei~k·~xei~k ·~x = (2π)3δ(3)(~k + ~k ′) ~k ′ = ~k ⇒ − R 5. October 7 Notes from Sidney Coleman’s Physics 253a 39
1 of our Hamiltonian starts at 2 ~k ω~k, not zero. That this constant is infinite even in a box is because even in a box the field has an infinite number of degrees of freedom. There can be excitations of modes withP arbitrarily short wavelengths. The zero point energy is ultraviolet divergent. In general, infinite systems, like an infinite crystal, also have “infrared divergences.” If the energy density of an infinite crystal is changed by an infinite amount, the total energy is changed by an infinite amount. Our Hamiltonian for the system in infinite space, would still have an infinity (δ(3)(0)) even if the momentum integral were cut off, say by restricting it to ~k < K. That part of the infinity is eliminated by putting the system in a box, eliminating| long| wavelengths. Our system in infinite space has a zero-point energy that is also infrared divergent. This is no big deal for two reasons. (A) You can’t measure absolute energies, only energy differences, so it’s stupid to ask what the zero point energy is. This even occurs in introductory physics. We usually put interaction energies to be zero when particles are infinitely separated, but for some potentials you can’t do that, and you have to choose your zero another place. (B) This is just an ordering ambiguity. Just as the quantum Hamiltonian for the harmonic oscillator could be chosen as 1 H = (p + iωq)(p iωq) 2 − we could reorder our Hamiltonian (see below). In general relativity the absolute value of the energy density does matter. Einstein’s equations 1 R g R = 8πGT µν − 2 µν − µν couple directly to the energy density T00. Indeed, introducing a change in the vacuum energy density, in a covariant way, T T λg µν → µν − µν is just a way of changing the cosmological constant, a term introduced by Einstein and repudiated by him 10 years later. No astronomer has ever observed a nonzero cosmological constant.8 Our theory is eventually going to be applied to the strong interactions, maybe even to some grand unified theory. Strong interactions have energies typically of 1 GeV and 13 39 3 a characteristic length of a fermi, 10− cm. With a cosmic energy density of 10 GeV/cm , the universe would be about 1 km long according to Einstein’s equations. You couldn’t even get to MIT without coming back where you started. We won’t talk about why the cosmological constant is zero in this course. They don’t explain it in any course given at Harvard because nobody knows why it is zero. 8[BGC note: As of 1998, measurements on supernovae have indicated that Λ > 0.] 5. October 7 Notes from Sidney Coleman’s Physics 253a 40
Reordering the Hamiltonian:
Given a set of free fields (possibly with different masses) φ1(x1), ... , φn(xn), define the normal ordered product : φ (x ) φ (x ): 1 1 ··· n n as the usual product except rearranged so that all the annihilation operators are on the right and a fortiori all creation operators are on the left. No further specification is needed since all annihilation operators commute with one another as do all creation operators. This operation is only defined for free fields satisfying the K.-G. equation, which tells us we can write the field in terms of time independent creation and annihilation operators. Redefine H to be : H :. This gets rid of the normal ordering constant.9 This is the first infinity encountered in this course. We’ll encounter more ferocious ones. We ran into it because we asked a dumb question, a physically uninteresting question, about an unobservable quantity. Later on we’ll have to think harder about what we’ve done wrong to get rid of troublesome infinities. If we wanted to get as quickly as possible to applications of QFT, we’d develop scattering theory and perturbation theory next. But first we are going to get some more exact results from field theory.
Symmetries and Conservation Laws
We will study the relationship between symmetries (or invariances) and conservation laws in the classical Lagrangian framework. Our derivations will only use classical equations of motion. Hopefully everything will go through all right when Poisson brackets become commutators, since the commutation relations are set up to reproduce the classical equations of motion. In any given case you can check to see if ordering ambiguities or extra terms in commutators screw up the calculations. Given some general Lagrangian L(qa, q˙a, t) and a transformation of the generalized coor- dinates
qa(t) qa(t, λ) qa(t, 0) = qa(t) → 9 What’s wrong?
a†a = aa† 1 Normal order both sides − : a†a : =: a†a : 1 − 0= 1 − Answer. We don’t “normal order” equations. Normal ordering is not derived from the ordinary product any more than the cross product is derived from the scalar product. Normal ordering cannot accurately be used as a verb. 5. October 7 Notes from Sidney Coleman’s Physics 253a 41
we can define ∂qa Dqa ≡ ∂λ λ=0
This is useful because the little transformations will be important. Some examples: 1. Space translation of point particles described by n vectors ~r a, a = 1,...,n. The Lagrangian we’ll use is
n m a ~r˙ a 2 V ( ~r a ~r b ) 2 | | − ab | − | a=1 X Xab The transformation is ~r a ~r a + ~eλ, all particles in the system moved by ~eλ → D~r a = ~e
2. Time translations for a general system. Given some evolution qa(t) of the system, the transformed system is a time λ ahead qa(t) qa(t + λ) → ∂qa Dqa = ∂t
3. Rotations in the system of example (1)
Rotation matrix ~r a R ( λ ~e )~r a D~r a = ~e ~r a → × z}|{ angle, axis |{z} |{z} (If ~e = z, Dxa = ya, Dya = xa, Dza = 0.) − Most transformations are not symmetries (invariances). b dF a a Definition: A transformation is a symmetry iff DL = dt for some F (q , q˙ , t). This equality must hold for arbitrary qa(t), not necessarily satisfying the equations of motion. Why is this a good definition? Hamilton’s principle is
2 2 t t dF 0= δS = dt DL = dt = F (t2) F (t1) 1 1 dt − Zt Zt Thus a symmetry transformation won’t affect the equations of motion (we only consider transformations that vanish at the endpoints). Back to our examples: 5. October 7 Notes from Sidney Coleman’s Physics 253a 42
1. DL = 0, F =0
2. If L has no explicit time dependence, all of its time dependence comes from its depen- a a ∂L dL dence on q andq ˙ , ∂t = 0, then DL = dt , F = L 3. F =0
Theorem (E. Noether say ‘Nota’): For every symmetry there is a conserved quantity. The proof comes from considering two expressions for DL.
∂L a a ∂L DL = Dq + p Dq˙ used pa a ∂qa a ≡ ∂q˙ a X a a = (p ˙aDq + paDq˙ ) used E-L equations a X d a by equality of mixed partials = paDq Dq˙a= d Dqa dt dt a X dF By assumption DL = dt . Subtracting these two expressions for DL we see that the quantity dQ Q = p Dqa F satisfies =0 a − dt a X (There is no guarantee that Q = 0, or that for each independent symmetry we’ll get another independent Q, in fact the construction6 fails to produce a Q for gauge symmetries.) We can write down the conserved quantities in our examples
˙ a a 1. (pa = ma~r ), D~r = ~e, F = 0.
Q = m ~e ~r˙ a = ~e m ~r˙ a a · · a a a X X For each of three independent ~e’s we get a conservation law. The momentum
dP~ P~ = m ~r˙ a is conserved. =0 . a dt a ! X Whenever we get conserved quantities from spatial translation invariance, whether or not the system looks anything like a collection of point particles, we’ll call the conserved quantities the momentum. 5. October 7 Notes from Sidney Coleman’s Physics 253a 43
a ∂qa 2. Dq = ∂t , F = L. Note: Q is identical to H. ∂L Q = p q˙a L is conserved when =0 a − ∂t a X Whenever we get a conserved quantity from time translation invariance, we’ll call the conserved quantity the energy. 3. D~r a = ~e ~r a, F = 0. × Q = ~p (~e ~r a)= ~e (~r a ~p ) a · × · × a a a X X = ~e ~r a ~p three laws · × a a X dJ~ J~ = ~r a ~p is conserved =0 × a dt a ! X Whenever we get a conserved quantities from rotational invariance, we’ll call them the angular momentum. There is nothing here that was not already in the Euler-Lagrange equations. What this theorem provides us with is a turn the crank method for obtaining conservation laws from a variety of theories. Before this theorem, the existence of conserved quantities, like the energy, had to be noticed from the equations of motion in each new theory. This theorem organizes conservation laws. It explains, for example, why a variety of theories, including ones with velocity dependent potentials all have a conserved Hamiltonian, or energy (example (2)). From the conserved quantity, we can usually reconstruct the symmetry. This can be done in classical mechanics using Poisson brackets. We’ll do it in quantum mechanics using commutators.10 Assume Dqb and F depend only on the qa, not theq ˙a so that their expression in the a Hamiltonian formulation only depends on the q , not the pa. Then
a b a a b [Q, q ]= pbDq F, q = [pb, q ] Dq " − # b b iδa X X − b = iDqa − | {z } This assumption is not at all necessary. Usually the result holds. For example the energy generates time translations even though the assumption doesn’t hold.
10 i j k i Show: [Q ,Q ] = iǫijkQ when the Q ’s are generators of an SO(3) internal symmetry. For a general internal symmetry group, the quantum Q’s recreate the algebra of the generators. 5. October 7 Notes from Sidney Coleman’s Physics 253a 44
Symmetries and Conservation Laws in (Classical) Field Theory
Field theory is a specialization of particle mechanics. There will be more that is true in field theory. What is this more? Electromagnetism possesses a conserved quantity Q, the electric charge. The charge is the integral of the charge density ρ, Q = d3xρ(~x, t). There is also a current ~ and there is a much stronger statement of charge conservation than dQ = 0. Local charge conservation R dt says ∂ρ + ~ ~ =0 ∂t ∇· Integrate this equation over any volume V with boundary S to get
dQV using Gauss’s theorem = dA n ~ Q= d3xρ(~x,t) dt − · ( V ) ZS R This equation says you can see the chargeb change in any volume by watching the current flowing out of the volume. You can’t have: t
x
simultaneously wink out of existence with nothing happening anywhere else.
This picture satisfies global charge conservation, but violates local charge conservation. You have to be able to account for the change in charge in any volume, and there would have to be a flow of current in between the two charges. Even if there were not a current and a local conservation law, we could invoke special relativity to show this picture is inconsistent. In another frame the charges don’t disappear simultaneously, and for a moment global charge conservation is violated. Field theory which embodies the idea of local measurements, should have local conser- vation laws. Given some Lagrangian density (φa, ∂ φa, x) and a transformation of the fields L µ φa(x) φa(x, λ) φa(x, 0) = φa(x) → a ∂φa we define Dφ = ∂λ λ=0.
5. October 7 Notes from Sidney Coleman’s Physics 253a 45
µ µ a a Definition: A transformation is a symmetry iff D = ∂µF for some F (φ , ∂µφ , x). This equality must hold for arbitrary φa(x) not necessarilyL satisfying the equations of mo- tion. We are not using special relativity. F µ is not necessarily a 4-vector (just some set of 4 objects). Note that the previous definition can be obtained. To wit, d DL = D d3x = d3x∂ F µ = d3x∂ F 0 = F L µ 0 dt Z Z Z where F = d3xF 0. Why is this a good definition? Hamilton’s principle is R 0= δS = d4xD = d4x∂ F µ L µ Z Z Z 3 0 0 blithe as usual about = d x[F (~x, t2) F (~x, t1)] contributions from − spatial infinity Z Thus a symmetry transformation does not affect the equations of motion (we only consider variations that vanish at the endpoints when deriving the equations of motion.). Theorem (maybe this is Noether’s theorem?): For every symmetry there is a conserved current. The proof comes from considering two expressions for D . L
∂L a µ a µ ∂L D = Dφ + π D∂ φ usedπa ∂∂ φa L ∂φa a µ ≡ µ a X µ a µ a = (∂µπa Dφ + πa D∂µφ ) used E-L equations a X µ a by equality of mixed partials a a = ∂µ πa Dφ D∂µφ =∂µDφ a X µ By assumption D = ∂µF . Subtracting these two expressions for D we see that the four quantities L L J µ = πµDφa F µ satisfy ∂ J µ =0 a − µ a X dQV ~ which implies dt = S dA n J , S =boundary of V . The total amount of stuff, Q is This equation− justifies calling· J0 the densityR of stuff and Ji the current ofb stuff. independent of time. There is an ambiguity in the definition of F µ and J µ since F µ can be changed by any µ µ χ satisfying ∂µχ = 0. In the particle mechanics case, F was ambiguous, but only by a 5. October 7 Notes from Sidney Coleman’s Physics 253a 46
µ µ µν time independent quantity. For arbitrary antisymmetric A we can let F F + ∂νA (∂ ∂ Aµν = 0). As a result of this change J µ J µ ∂ Aµν . → µ ν → − ν
Q = d3xJ 0 Q d3x∂ Ai = Q → − i Z Z i.e. Q is unchanged, ignoring contributions from spatial infinity as usual. This is a lot of freedom in the definition of J µ. For example in a theory with three fields we can let J µ J µ (φ3)14∂ (∂µφ1∂ν φ2 ∂ν φ1∂µφ2) → − ν − This J µ is as good as any other. There are 500 papers arguing about which energy- momentum tensor is the right one to use inside a dielectric medium. 490 of them are idiotic. It’s like if someone passes you a plate of cookies and you start arguing about which copy is #1 and which is #2. They’re all edible! Sometimes one J µ is more useful in a given calculation than another, for some reason. Instead of arguing that the most convenient J µ is the “right” one, you should just be happy that you had some freedom of choice. From cranking Hamilton’s principle, we can give another derivation of the relation be- tween symmetries and conserved quantities.
a t2 δS = δ dtL = paDq |t1 a Z X for an arbitrary variation about a solution of the equations of motion. By assumption of a symmetry dF t2 δS = dtDL = dt dt Z Z t1
Subtracting we see p Dqa F is time independent. a a − P [Aside: Noether’s Theorem derived at the level11 of the action
If without using the equations of motion, for an arbitrary function of space time α(x) parametrizing a transformation, to first order in α(x)
δ = α(x)∂ kµ + ∂ α(x)jµ L µ µ (Equivalent to D = ∂ kµ when α is a constant). L µ 11i.e. using Hamilton’s principle rather than mucking around with the equations of motion 5. October 7 Notes from Sidney Coleman’s Physics 253a 47
Then by Hamilton’s principle, for fields satisfying the equations of motion,
0= d4xδ = d4x(α(x)∂ kµ + ∂ α(x)jµ) L µ µ Z Z 4 µ µ Let α(x) vanish at = d x(α(x)∂ k α(x)∂ j ) ∞ µ − µ to do parts integration. Z So it must be that J µ = kµ jµ is conserved. − Example: Space-time translation invariance
δφ = ǫ ∂ν φ δ∂ φ = ∂ (ǫ ∂ν φ) − ν τ − τ ν ∂ ∂ δ = Lδφ + L δ∂τ φ L ∂φ ∂∂τ φ
∂ ν ∂ ν ∂ ν ν is a parameter for the = Lǫν ∂ φ L ǫν ∂ ∂τ φ L ∂τ ǫν ∂ φ type of transformation. − ∂φ − ∂∂τ φ − ∂∂τ φ ν µ ν = ǫ ′ π ∂ φ∂ ǫ µ plays the same role − ν L − µ ν I read off kµν = gµν , jµν = πµ∂ν φ so T µν = πµ∂ν φ gµν is conserved.] − L − − L Noether’s Theorem: when the variation is a hermitian traceless matrix. (useful in theories with fields most easily written as matrices.)
matrix 4 µ Suppose (without using the equations of motion) δS = d x Tr δω ∂µ F . hermitian, traceless R but otherwise arbitraryz}|{ Then Hamilton’s principle says |{z}
4 µ 0= d xTr δω∂µF Z Then F µ is a matrix of conserved currents, except there is no conservation law associated with the trace. If you like, write this statement as 1 ∂ F µ Id Tr F µ =0 µ − N dimension of F µ |{z} 5. October 7 Notes from Sidney Coleman’s Physics 253a 48
Space-time translations in a general field theory
We won’t have to restrict ourselves to scalar fields because under space-time translations, an arbitrary field, vector, tensor, etc., transforms the same way. Just let the index a also denote vector or tensor indices. Translations are symmetries as long as does not have any explicit dependence on x. It L a a ∂ depends on x only through its dependence on φ and ∂µφ , ∂xL = 0.
a a φ (x) φ (x + λe) e some fixed four vector → Dφa = eν∂ φa D = eµ∂ = ∂ (eµ ) ν L µL µ L F µ = eµ J µ = πµeν∂ φa eµ = e T µν L a ν − L ν a X T µν = πµ∂ν φa gµν a − L a X There are four conserved currents, four local conservation laws, one for each of the four µν µ independent directions we can point e, i.e. ∂µT = 0 since ∂µJ = 0 for arbitrary e. T µ0 is the current that is conserved as a result of time translation invariance, indeed,
= T 00 = π φ˙a , H = d3x H a −L H a X Z T 00 is the energy density, T i0 is the current of energy. T iµ are the three currents from spatial translation invariance.
P i = d3xT 0i Z T 0i = the density of the ith component of momentum T ji = the jth component of the current of the ith component of momentum
µ µ a For a scalar field theory with no derivative interactions, πa = ∂ φ so
T µν = ∂µφa∂ν φa gµν − L a X Note that T µν is symmetric, so you don’t have to remember which index is which. T µν can be nonsymmetric, which can lead to problems, for example gravity and other theories of higher spin. Can try to symmetrize it. 6. October 9 Notes from Sidney Coleman’s Physics 253a 49
6 October 9 Lorentz transformations
Under a Lorentz transformation all vectors transform as
aµ Λµaν → ν µ µ where Λν specifies the Lorentz transformation. Λν must preserve the Minkowski space inner product, that is if bµ Λµbν then a bµ a bµ → ν µ → µ µ ν This must be true for arbitrary a and b. (The equation this condition gives is gµνΛαΛβ = gαβ.) We’ll be interested in one parameter subgroups of the group of Lorentz transformations parametrized by λ. This could be rotations about some specified axis by an angle λ or a boost in some specified direction by a rapidity λ. In any case, the Lorentz transformation is given by a family aµ aµ(λ)=Λ(λ)µaν → ν Under this (active) transformation (we are not thinking of this as a passive change of coordinates) the fields φa transform as (φa is a scalar),
a a a 1 φ (x) φ (x, λ)= φ (Λ(λ)− x) → We are restricting ourselves to scalar fields. Even though we only used scalar fields in our examples of T µν, the derivation of the conservation of T µν from space-time translation invariance applies to tensor, or vector, fields. With Lorentz transformations, we only consider scalars, because there are extra factors in the transformation law when the fields are tensorial. µ µ ν 1 For example a vector field A (x) Λν A (Λ− x). ∂φ → We need to get Dφ = ∂λ λ=0. We’ll define
µ µ µ DΛ ǫ defines some matrix ǫν ν ≡ ν µ µ From the invariance of a bν , we’ll derive a condition on ǫν .
µ µ µ 0= D(a bµ)=(Da )bµ + a (Dbµ) µ ν µ ν = ǫ νa bµ + a ǫµ bν relabel dummy indices in the second term, ν µ ν µ µ ν,ν µ. = ǫµν a b + ǫνµa b ⇓ → → =(ǫ + ǫ )aν bµ ǫ = ǫ µν νµ ⇒ µν − νµ 4 (4 1) · − since this has to hold for arbitrary a and b. µ and ν range from 0 to 3, so there are 2 =6 independent ǫ, which is good since we have to generate 3 rotations (about each axis) and 3 boosts (in each direction). 6. October 9 Notes from Sidney Coleman’s Physics 253a 50
As a second confidence-boosting check we’ll do two examples. Take ǫ = ǫ = 1, all other components zero. 12 − 21 Da1 = ǫ1a2 = ǫ a2 = a2 2 − 12 − Da2 = ǫ2a1 = ǫ a1 =+a1 1 − 21 x2
a
x1 infinitesimally shifted a
This says a1 gets a little negative component proportional to a2 and a2 gets a little component proportional to a1. This is a rotation, in the standard sense about the positive z axis. Take ǫ = ǫ = +1, all other components zero. 01 − 10 0 0 1 1 1 Da = ǫ1a = ǫ01a = a Da1 = ǫ1a0 = ǫ a0 = a0 0 − 10 This says x1, which could be the first component of the position of a particle, gets a little contribution proportional to x0, the time, which is definitely what a boost in the x1 direction does. In fact, Da0 = a1, Da1 = a0 is just the infinitesimal version of
a0 cosh λa0 + sinh λa1 → a1 sinh λa0 + cosh λa1 → Without even thinking, the great index raising and lowering machine has given us all the right signs. Now assuming is a scalar, we are all set to get the 6 conserved currents. L 6. October 9 Notes from Sidney Coleman’s Physics 253a 51
1 1 1µ µ From Λ− (λ)Λ(λ)=1,0= D(Λ− Λ) and D Λ− = ǫ ν |Λ=1 − ν There are extra a ∂ a 1 µ ν a Dφ (x)= φ (Λ− (λ) x ) terms in Dφ if ∂λ ν φa are not scalars λ=0 a 1 σ τ = ∂σφ (x)D(Λ− (λ)τ x )
= ∂ φa(x)( ǫσ)xτ = ǫ xτ ∂σφa(x) σ − τ − στ
Using the assumption that is a scalar depending only on x through its dependence on a a L φ and ∂µφ we have D = ǫ xλ∂σ L λσ L = ∂ [ǫ xλgµσ ] µ λσ L The conserved current J µ is J µ = πµǫ xλ∂σφa ǫ xλgµσ a λσ − λσ L a X = ǫ πµxλ∂σφa xλgµσ λσ a − L a ! X This current must be conserved for all six independent antisymmetric matrices ǫλσ, so the part of the quantity in parentheses that is antisymmetric in λ and σ must be conserved µλσ i.e. ∂µM = 0 where
M µλσ = πµxλ∂σφa xλgµσ (λ σ) a − L − ↔ a ! X = xλ πµ∂σφa gµσ (λ σ) a − L − ↔ a ! X = xλT µσ xσT µλ − If the φa were not scalars, we would have additional terms, feeding in from the extra terms in Dφa. The 6 conserved charges are
J λσ = d3xM 0λσ = d3x(xλT 0σ xσT 0λ) − Z Z For example, J 12, the conserved quantity coming from invariance under rotations about 3 i 1 jk the 3 axis, often called, J is (J = 2 ǫijkJ ),
J 3 = J 12 = d3x(x1T 02 x2T 01) − Z 6. October 9 Notes from Sidney Coleman’s Physics 253a 52
If we had point particles with
T 0i(~x, t)= pi δ(3)(~x ~ra(t)) a − a X J 3 would be (xa1p2 xa2p1)= (~ra ~p ) a − a × a 3 a a X X We have found the field theory analog of the angular momentum. The particles them- selves could have some intrinsic angular momentum. Those contributions to the angular momentum are not in the J i. We only have the orbital contribution. Particles that have intrinsic angular momentum, spin, will be described by fields of tensorial character, and that will be reflected in extra terms in the J ij. So far we have just found the continuum field theory generalization of three conserved quantities we learn about in freshman physics. But we have three other conserved quantities, the J 0i. What are they? Consider
J 0i = d3x[x0T 0i xiT 00] − Z This has an explicit reference to x0, the time, something we’ve not seen in a conservation law before, but there is nothing a priori wrong with that. We can pull the x0 out of the dJ0i integral over space. The conservation law is dt = 0 so we have d d 0= J 0i = t d3xT 0i d3xxiT 00 dt dt − Z Z d ✚✚ d = t d3✚xT 0i + d3xT 0i d3xxiT 00 ✚ dt ✚ −dt ✚Z Z Z ✚ pi pi Dividing through by p0 gives| {z } | {z }
i d 3 i 00 d i p dt d xx T dt (“center of energy” ) constant = 0 = 0 = p R p total energy This says that the “center of energy” moves steadily. T 00 is the relativistic generalization of mass. This is the relativistic generalization of the statement that the center of mass moves steadily. You aren’t used to calling this a conservation law, but it is, and in fact it is the Lorentz partner of the angular momentum conservation law. 6. October 9 Notes from Sidney Coleman’s Physics 253a 53
Internal Symmetries
There are other conservation laws, like conservation of electric charge, conservation of baryon number, and conservation of lepton number that we have not found yet. We have already found all the conservation laws that are in a general Lorentz invariant theory. These addi- tional conservation laws will only occur in specific theories whose Lagrange densities have special properties. Conservation laws are the best guide for looking for theories that actu- ally describe the world, because the existence of a conservation law is a qualitative fact that greatly restricts the form of the Lagrange density. All these additional charges are scalars, and we expect the symmetries they come from to commute with Lorentz transformations. The transformations will turn fields at the same spacetime point into one another. They will not relate fields at different spacetime points. Internal symmetries are non-geometrical sym- metries. Historically, the name comes from the idea that what an internal symmetry did was transform internal characteristics of a particle. For us internal just means non-geometrical. We’ll study internal symmetries with two examples. The first example is
2 1 2 = ∂ φa∂µφa µ2φaφa g (φa)2 L 2 µ − − a=1 a ! X X This is a special case of a theory of two scalar fields. Both fields have the same mass, and the potential only depends on the combination (φ1)2 +(φ2)2. This Lagrangian12 is invariant (D = 0) under the transformation L φ1 φ1 cos λ + φ2 sin λ → SO(2) symmetry φ2 φ1 sin λ + φ2 cos λ → − The same transformation at every space time point. This is a rotation13 in the φ1, φ2 plane. The Lagrangian is invariant because it only depends on (φ1)2 +(φ2)2 and that combination is unchanged by rotations.
Dφ1 = φ2, Dφ2 = φ1 D =0 F µ =0 − L J µ = πµDφ1 + πµDφ2 =(∂µφ1)φ2 (∂µφ2)φ1 1 2 − Q = d3xj0 = d3x(∂ φ1φ2 ∂ φ2φ1) 0 − 0 Z Z 12We have left particle mechanics behind and I’ll often use Lagrangian to mean Lagrange density. 13Clockwise rotation makes the signs come out conventionally if b and c (which we see a little bit later) are defined like a and b in Itzykson and Zuber p. 121. 6. October 9 Notes from Sidney Coleman’s Physics 253a 54
We can get some insight into this quantity by going to the case g = 0 in which case φ1 and φ2 are both free fields and can be expanded in terms of creation and annihilation ops.
3 a d k a ik x a ik x φ (x)= (a e− · + a †e · ) (2π)3/2 2ω ~k ~k Z ~k Now we’ll compute Q. Let’s have faithp in our formalism and assume that the terms with two creation ops or two annihilation ops go away. If they didn’t Q wouldn’t be time 2iω t independent since they are multiplied by e± ~k .
3 d k 1 2 2 1 Q = [a a †( iω~ iω~ )+2iω~ a a †] 2ω ~k ~k − k − k k ~k ~k Z ~k 1 2 We don’t have to worry about the order because [a , a † ]=0 ~k ~k ′
3 1 2 2 1 Q = i d k[a †a a †a ] ~k ~k − ~k ~k Z We are within reach of something intuitive. Define
1 2 1 2 a + ia a † ia † ~k ~k ~k ~k b~ = b† = − k √2 ~k √2 That’s not the end of the definitions; we need the other combination to reconstruct a1, ~k 2 1 2 a , a †, a †. So define, ~k ~k ~k
1 2 1 2 a ia a † + ia † ~k ~k ~k ~k c~ = − c† = k √2 ~k √2 These linear combinations of operators are allowable14. They are operators that create (or destroy) a superposition of states with particle 1 and particle 2. If one state is degenerate with another, and for a given ~k, ~k, 1 is degenerate with ~k, 2 , it is often convenient to work | i | i with linear combinations of these states as basis states. Because b† and c† create orthogonal ~k ~k states, b~ and c† commute with each other, as is easily checked. The useful thing about these k ~k linear combinations is that Q has a simple expression.
b† + c† b† c† 1 2 2 1 ~k ~k b~k c~k ~k ~k b~k + c~k i(a †a a †a )= i − + i − ~k ~k − ~k ~k √2 · √2i √2i · √2 1 = [2b† b~ +0b† c~ +0c† b~ 2c† c~ ]= b† b~ c† c~ 2 ~k k ~k k ~k k − ~k k ~k k − ~k k 14 Exchanging the role of b~k and c~k is the way of making the signs come out conventionally with counter- clockwise rotation. 6. October 9 Notes from Sidney Coleman’s Physics 253a 55
That is, 3 Q = d k(b† b~ c† c~ )= N N ~k k − ~k k b − c Z The b’s carry Q charge +1, the c’s carry Q charge -1. It’s like particles and antiparticles; the b’s and c’s have the same mass and opposite charge. Q = Nb Nc is true as an operator equation. b and c type mesons are eigenstates of Q. 1 and 2− type mesons are not. An eigenstate of Na and Nb is an eigenstate of Q, an eigenstate of N1 and N2 is not. By the way H and P~ have familiar forms in terms of the b’s and c’s.
3 3 H = d kω (b† b + c† c ) P~ = d k ~k(b† b + c† c ) ~k ~k ~k ~k ~k ~k ~k ~k ~k Z Z Taking all these linear combinations suggests that we could have changed bases earlier in the calculation and simplified things.
φ1 + iφ2 ψ = √2 3 d k ik x ik x = [b~ e− · + c† e · ] (2π)3/2 2ω k ~k Z ~k ψ always diminishes Q by 1 either byp annihilating a b type particle or by creating a c type particle. [Q, ψ]= ψ a Q “eigenfield” − There is also the hermitian conjugate of ψ, ψ†,
φ1 iφ2 ψ† = − √2 3 d k ik x ik x = [c~ e− · + b† e · ] (2π)3/2 2ω k ~k Z ~k p (ψ† will be denoted ψ∗ in the classical limit. A quantum field ψ∗ will be understood to be ψ†.) ψ† always increases Q by 1. [Q, ψ†]=+ψ†. 1 2 ψ and ψ† have neat commutation relations with Q. φ and φ have messy commutation relations with Q.
1 1 1 [Q, φ ]= [Q, ψ + ψ†]= ( ψ + ψ†) √2 √2 − 2 = iφ In agreement with a formula which 2 − 1 is usually true [Q,φa]= iDφa [Q, φ ]= iφ − 6. October 9 Notes from Sidney Coleman’s Physics 253a 56
iλ iλ Under our transformation ψ e− ψ, ψ∗ e ψ∗. This is called a U(1) or phase transformation. It is equivalent to→SO(2). → Digression; we’ll get back to symmetries. 1 2 As quantum fields, ψ and ψ† are as nice as φ and φ . They obey
2 2 ( + µ )ψ =0 ( + µ )ψ† =0
They obey simple equal time commutation relations:
[ψ(~x, t), ψ(~y, t)] = 0 = [ψ†(~x, t), ψ†(~y, t)] = [ψ(~x, t), ψ†(~y, t)]
[ψ(~x, t), ψ˙(~y, t)] = 0 = [ψ†(~x, t), ψ˙ †(~y, t)] (3) [ψ(~x, t), ψ˙ †(~y, t)] = iδ (~x ~y) = [ψ†(~x, t), ψ˙(~y, t)] − These equations can be obtained by doing something completely idiotic. Back to the classical Lagrangian which is
µ 2 1 = ∂ ψ∗∂ ψ µ ψ∗ψ (no ) L µ − 2 Imagine a person who once knew a lot of quantum field theory, but has suffered brain damage, is going to canonically quantize this theory. He has forgotten that stands for ∗ complex conjugate, and he is going to treat ψ and ψ∗ as if they were independent fields. Here he goes
µ ∂ µ µ µ πψ = L = ∂ ψ∗ πψ∗ = ∂ ψ ∂∂µψ The Euler-Lagrange equations he gets are
µ ∂ 2 ∂ π = L i.e. ψ∗ = µ ψ∗ µ ψ ∂ψ − The idiot got it right. He also gets ψ = µ2ψ. How about that!? Now he says “I’m − going to deduce the canonical commutation relations”:
(3) 0 iδ (~x ~y) = [ψ(~x, t), π (~y, t)] = [ψ(~x, t), ∂ ψ∗(~y, t)] − ψ 0 By God, he gets that right, too. How can you work with complex fields, which can’t be varied independently, treat them as if they can be and nevertheless get the right answers? We’ll demonstrate that this works for the equations of motion. It works very generally. You can demonstrate that it works for the equal time commutation relations. Suppose I have some action S(ψ, ψ∗). The equations of motion come from
4 0= δS = d x(Aδψ + A∗δψ∗) Z 6. October 9 Notes from Sidney Coleman’s Physics 253a 57
Naive Approach: Treating the variations in δψ and δψ∗ as independent we get
A = 0 and A∗ =0
Sharp Approach: However δψ is not independent of ψ∗. We can use the fact that ψ and ψ∗ are allowed to be complex. We can make purely real variations in ψ so that δψ = δψ∗ and we can make purely imaginary variations in ψ so that δψ = δψ∗. From the − real variation we deduce A + A∗ =0 and from the imaginary variation we deduce
A A∗ =0 − which implies A = A∗ = 0. ψr+iψ ψr iψ For the ETCR write ψ = i , ψ = − i . Nothing tricky or slick. √2 † √2 Back to internal symmetries. For our second example, take 2 1 n n = (∂ φa∂µφa µ2φaφa) g (φa)2 L 2 µ − − a=1 a=1 ! X X This is the same as our first example except we have n fields instead of 2. Just as in the first example the Lagrangian was invariant under rotations mixing up φ1 and φ2, this Lagrangian is invariant under rotations mixing up φ1, ... , φn, because it only depends on (φ1)2 + +(φn)2. The notations are, ··· φa Ra φb → b b n Xn rotation matrix × n(n 1) − |{z} There are 2 independent planes in n dimensions and we can rotate in each of them, n(n 1) − so there are 2 conserved currents and associated charges. This example is quite differ- ent from the first one because the various rotations don’t in general commute. (They all commuted in the first one by virtue of the fact that there was only one.) We don’t expect the various charges to commute. If they did they would generate symmetries that commute. Anyway, for any single rotation axis, the symmetry is just like the one we had in the first example, so we can read off the current, J [a,b] = ∂ φaφb ∂ φbφa µ µ − µ You can’t find combinations of the fields that have simple commutation relations with all the Q[a,b]’s. For n = 3 you can choose the fields to have a simple commutation relation [1,2] with one charge, say Q . This is just like isospin, which is a symmetry generated by I1, + 0 I2 and I3. You can only choose the particle states π , π and π− to be eigenstates of one of them, I3. 7. October 14 Notes from Sidney Coleman’s Physics 253a 58
7 October 14 Lorentz transformation properties of conserved quanti- ties
We’ve worked with three currents, J µ, the current of meson number, T µν, the current of the νth component of momentum, and M µνλ, the current of the [νλ] component of angular momentum. We integrated the zeroth component of each of these currents to obtain Q, P ν and J νλ respectively. These look like tensors with one less index, but do they have the right Lorentz transformation properties? We’ll prove that P ν = d3xT 0ν is indeed a Lorentz vector given that T µν is a two index tensor and that T µν is conserved, ∂ T µν = 0. The generalization to currents with R µ more indices or 1 index will be clear. We could do this proof in the classical theory or the quantum theory. We’ll choose the latter because we need the practice. The assumption that T µν is an operator in quantum field theory that transforms as a two-index tensor is phrased mathematically as follows: Given U(Λ) the unitary operator that effects Lorentz transformations in the theory µν µ ν στ 1 U(Λ)†T (x)U(Λ) = ΛσΛτ T (Λ− x) Now we’ll try to show that P ν = d3x T 0ν(~x, 0) Z is a Lorentz vector. Introduce n = (1, 0, 0, 0), a unit vector pointing in the time direction. Then we can write P ν in a way that makes its Lorentz transformation properties clearer.
P ν = d4x n T µνδ(n x) µ · Z Perform a Lorentz transformation on P ν
ν 4 µν U(Λ)†P U(Λ) = d x n U(Λ)†T (x)U(Λ)δ(n x) µ · Z 4 µ ν στ 1 = d x n Λ Λ T (Λ− x)δ(n x) µ σ τ · Z 1 1 Change integration variables x′ =Λ− x and define n′ =Λ− n
ν 4 ρ µ ν στ U(Λ)†P U(Λ) = d x′ Λ n′ Λ Λ T (x′)δ(n′ x′) µρ σ τ · Z 4 ν ρτ µ = d x′ n′ Λ T (x′)δ(n′ x′) Using ΛµρΛσ = gρσ ρ τ · Z ν 4 ρτ =Λ d x n′ T (x)δ(n′ x) τ ρ · Z 7. October 14 Notes from Sidney Coleman’s Physics 253a 59
In the last expression we’ve dropped the prime on the variable of integration. The only difference between what we have and what we would like to get
ν ν τ ν 4 ρτ U(Λ)†P U(Λ) = Λ P =Λ d x n T δ(n x) τ τ ρ · Z is that n has been redefined. The surface of integration is now t′ = 0, and we take the µν component nµ′ T in the t′ direction. Our active transformation has had the exact same 1 effect as if we had made a passive transformation changing coordinates to x′ = Λ− x. It’s the same old story: alias (another name) versus alibi (another place). passive active |{z} t' |{z} t x' n' n
x
You can think of this as Lorentz transforming the field or Lorentz transforming the surface. This doesn’t produce a vector P ν for any tensor T µν . For an arbitrary tensor, P ν is not even independent of what time you compute at, let alone changing the tilt of the surface. We need to use that T µν is conserved. More or less, that the current that flows through the surface t′ = 0 is the same as the current that flows through the surface t = 0. ν ν Note that n δ(n x)= ∂ θ(n x), so that if I call the Lorentz transform of P , P ′ , what µ · µ · we are trying to show is
ν 1ν σ 0= P Λ− P ′ − σ 4 µν = d x[∂ θ(n x) ∂ θ(n′ x)]T (x) µ · − µ · Z 4 µν = d x∂ [θ(n x) θ(n′ x)]T (x) µ · − · Z This integral over all spacetime is a total divergence. In the far future or the far past θ(n x) = θ(n′ x) so there are no surface terms there, and as usual we won’t worry about surface· terms at· spatial infinity. 7. October 14 Notes from Sidney Coleman’s Physics 253a 60
Discrete Symmetries
a a A discrete symmetry is a transformation q (t) q ′(t) that leaves the Lagrangian, L, un- changed L L. This could be parity, this could→ be rotation by π about the z axis. (It does → not include time reversal, wait15.) Since all the properties of the theory are derived from the Lagrangian, (the canonical commutation relations, the inner product, the Hamiltonian all come from L) and since the Lagrangian is unchanged, we expect that there is a unitary operator effecting the transfor- mation
a a U †q (t)U = q ′(t) U †HU = H
The discrete symmetry could be an element of a continuous symmetry group. It could be rotation by 20◦. There is no conserved quantity associated with a discrete symmetry however. What is special about the continuous symmetry is that there is a parameter, and that you have a unitary operator for each value of the parameter, θ, satisfying
U(θ)†HU(θ)= H
You can differentiate this with respect to the parameter to find that dU [I,H]=0 I i iDU ≡ − dθ ≡ − θ=0
There is nothing analogous for discrete symmetries.
Examples of internal symmetries (there is not a general theory, but we’ll do prototypical examples)
Example (1). The transformation is φ(x) φ(x) at every space-time point. → − This is a symmetry for any Lagrangian with only even powers of φ, in particular, 1 1 = (∂ φ)2 µ2φ2 λφ4 L 2 µ − 2 − →L There should be a unitary operator effecting the transformation
φ U †φU = φ → − For λ = 0, we will actually be able to construct U, that is give its action on the creation and annihilation operators, and thus its action on the basis states.
15See Dirac Principles of QM, pp. 103ff 7. October 14 Notes from Sidney Coleman’s Physics 253a 61
Since φ is a linear function of a and a† it must be that ~k ~k
U †a U = a ~k − ~k and the h.c. equation U †a† U = a† ~k − ~k We’ll also make a choice in the phase of U by specifying U 0 = 0 | i | i We can determine the action of U on the basis states
U k1,...,kn = Ua~† a~† a~† 0 | i k1 k2 ··· kn | i
= Ua~† U †Ua~† U † Ua~† U †U 0 k1 k2 ··· kn | i n n =( 1) a~† a~† a~† 0 =( 1) k1,...,kn − k1 k2 ··· kn | i − | i As an operator statement we see that
N iπN (N=meson number U =( 1) (= e if you prefer) 3 † = d ka a~ ) − ~k k R Suppose we could construct this operator when λ = 0. The existence of this unitary 6 operator tells you that you’ll never see 2 mesons scatter into 43 mesons or any odd number of mesons. Mesons are always produced in pairs. More formally, it must be that 0 = n S m h | | i where n is a state with n mesons, m is a state with m mesons and S is the scattering matrix| madei from the Hamiltonian, whenever| i n + m is odd. For if n + m is odd
n S m = n U †USU †U m = n U † (USU †) U m h | | i h | | i h | | i S because UHU †=H =( 1)n+m n S m = n S m − h | | i −h | | i{z } Our second example of a discrete internal symmetry will turn out to be Charge Conjugation. Recall that 2 1 1 = (∂ φa)2 µ2(φa)2 λ[(φ1)2 +(φ2)2]2 L 2 µ − 2 − a=1 X had an SO(2) symmetry. In fact it has an O(2) symmetry, rotations and rotations with reflections in the φ1φ2 plane. It is invariant under proper and improper rotations. We’ll take one standard improper rotation φ1 φ1 φ2 φ2 → → − 7. October 14 Notes from Sidney Coleman’s Physics 253a 62
Any other one can be obtained by composing this one with an element of the internal symmetry group SO(2). It is easy to write down the unitary operator in the case λ = 0. Then we have two independent free scalar field theories
U =( 1)N2 C − As mathematicians are fond of saying, we have reduced it to the previous case. The action of U is especially nice if we put it in terms of the fields ψ and ψ† already introduced φ1 + iφ2 φ1 iφ2 ψ = U † ψUC = − = ψ† √2 → C √2 ψ† ψ →
For this reason, this is sometimes called a conjugation symmetry. Because U † QU = Q C C − it is also called charge16 conjugation or particle-anti-particle conjugation. From the action on ψ and ψ† we see
UC† b~kUC = c~k and UC† c~kUC = b~k
and the equations obtained from these by hermitian conjugation. UC is unitary and hermi- tian: 2 U =1= U U † U = U † C C C ⇒ C C We could continue the discussion of discrete internal symmetries, but it is boring. You could write down a Lagrangian with four fields that is invariant under rotation in four- dimensional space and under permutations of any of the four fields. You could write down a theory with the icosahedral group.
Parity Transformations Any transformation that takes
φa(~x, t) M aφb( ~x, t) → b − Xb we’ll call a parity transformation. (We have used the fact that we live in an odd number of dimensions (3) and thus that ~x ~x is an improper rotation, in our definition. If → − we lived in two space dimensions we could do a similar thing with only x2 x2.) A parity transformation transforms each fundamental observable at the point →~x into − some linear combination of fundamental observables at the point ~x. − 16the charge from the SO(2) symmetry 7. October 14 Notes from Sidney Coleman’s Physics 253a 63
The usual confusing notation Usually parity takes (~x, t) ( ~x, t), when it is a symmetry, but all we really demand L →L − is that parity takes z }| { L(t) L(t) as usual. → Example (1) Parity is a symmetry of
1 µ2 P : φ(~x, t) φ( ~x, t) = (∂ φ)2 φ2 → − (M = 1) L 2 µ − 2 L L → From
3 d k ~ ~ ik ~x iω~ t ik ~x iω~ t φ(~x, t)= [a~ e · e− k + a† e− · e k ] (2π)3/2 2ω k ~k Z ~k and U † φ(~x, t)U = φ( ~x, t) P P − p we can see that the action on the creation and annihilation operators must be
a~k a ~k U † Up = − P a† a† ~k ( ~k) − and on the basis states
U ~k ,...,~k = ~k ,..., ~k P | 1 ni | − 1 − ni But there is a second possibility for parity
P ′ : φ(~x, t) φ( ~x, t) (M = 1) → − − − L L → Whenever there is an internal symmetry in a theory I can multiply one definition of parity by an element of that symmetry group (discrete or continuous) and get another definition of parity. In the case at hand the unitary operator UP ′ is given by N n U ′ =( 1) U or U ′ ~k ,...,~k =( 1) ~k ,..., ~k P − P P | 1 ni − | − 1 − ni Sometimes people distinguish between a theory with invariance under P : φ(~x, t) φ( ~x, t) and a theory with invariance under P ′ : φ(~x, t) φ( ~x, t), by → − → − − calling the first the theory of a scalar meson and the second a theory of a pseudo scalar meson. Our theory is invariant under both; it’s the old plate of cookies 7. October 14 Notes from Sidney Coleman’s Physics 253a 64
problem again. The theory has a set of invariances. As long as you are in agreement about the total set of invariances of the theory, you shouldn’t waste time arguing about what you’ll call each one. This is why the conventions on the parity of some particles is arbitrary (relative parity). If λφ4 is added to 3 − L both P and P ′ are still invariances of L. With a φ interaction P is a symmetry, P ′ isn’t. All the physicists in the world would agree that this is a scalar meson. One of the cookies has been poisoned. Example (2) This awful Lagrangian has been cooked up to illustrate a point. After this lecture you won’t see anything this bad again.
4 1 µ2 = (∂ φa)2 a (φa)2 gǫµνλσ∂ φ1∂ φ2∂ φ3∂ φ4 L 2 µ − 2 − µ ν λ σ a=1 X All four meson masses are different. The new interaction involving the totally antisymmetric tensor in four indices is invariant under L.T. for the same reason that ~a (~b ~c) is invariant under proper rotations. (~a (~b ~c) is multiplied by det R under· a rotation).× Because ǫµνλσ is nonzero only if· one× of its indices is timelike, the other three spacelike, three of the derivatives are on space coordinates. We get three minus signs under parity. An odd number of mesons are going to have to be pseudoscalar to get a net even number of minus signs. Because any φa φa → − is an internal symmetry of the free Lagrangian, it doesn’t matter which one you choose or which three you choose to be pseudoscalar. Example (3) Take the perverse Lagrangian of the last example by and make it worse by adding 4 (φa)3 a=1 X Now there is no definition of parity that gives a symmetry. This theory violates parity. Example (4) Sometimes people say that because the product of two reflections is 1, the square of parity is 1. This example is cooked up to show that a theory with parity can have indeed U 2 = 1. Indeed U cannot be chosen to satisfy U 2 = 1. P 6 P P 4 2 1 a 2 µ a 2 µ 2 = (∂ φ ) (φ ) + ∂ ψ∗∂ ψ m ψ∗ψ L 2 µ − 2 µ − a=1 X 4 a 3 µ 1 ν 2 λ 3 σ 4 2 2 h (φ ) gǫ ∂ φ ∂ φ ∂ φ ∂ φ [ψ + ψ∗ ] − − µνλσ a=1 X 7. October 14 Notes from Sidney Coleman’s Physics 253a 65
The transformation of the φa’s must be φa(~x, t) +φa( ~x, t). The only way to make the last term parity invariant is for ψ to transform→ − as
ψ iψ ψ∗ iψ∗ → ± → ∓
2 In either case UP = 1, UP† UP† ψUP UP = ψ. Fortunately, nothing like this occurs in nature (as far as6 we know). If it did,− and if parity were a symmetry (or an ap- proximate symmetry) of the world we would have a name for fields transforming like ψ, a “semi-pseudo-scalar”.
Time Reversal First a famous example from classical particle mechanics, a particle moving in a potential 1 L = mq˙2 V (q) T : q(t) q( t) 2 − → − T is not a discrete symmetry the way we have defined it
in the usual confusing notation where L(t) refers T : L(t) L( t) to the time dependence → − through the coordinates Nevertheless T does take one solution of the equations of motion into another. You might still hope there is a unitary operator that does the job in the quantum theory.
U † q(t)U = q( t) T T − There are two paradoxes I’ll give to show this can’t happen.
1st Paradox Differentiate U † q(t)U = q( t) with respect to t. Because p(t) q˙(t) T T − ∝ U † p(t)U = p(t) T T −
Consider U † [p, q]U = i. From the two relations we have just obtained we also have T T − Particularly poignant U † [p(t), q(t)]U = [p( t), q( t)] = i T T − − − − at t = 0 Looks like we would have to give up the canonical commutation relations to implement time reversal. If that isn’t enough to make you abandon the idea of a unitary time reversal operator I’ll continue to the 7. October 14 Notes from Sidney Coleman’s Physics 253a 66
iHt iHt 2nd Paradox Roughly, UT should reverse time evolution, i.e. UT† e− UT = e . I can prove this. For any operator (t) O iHt iHt (t)= e e− O O
Apply UT† the unitary transformation to both sides to obtain +iHt iHt ( t)= U † e U U † e− U = (0) O − T T O T T O O iHt iHt † −iHt but ( t)= e− U e let V U e UT O − T O ≡ T iHt iHt iHt 1 I’d like to show V = e . What we have is e− e = V − V which implies iHt iHt iHt O OiHt V e− = V e− . V e− commutes with any operator . V e− = 1. Now that I’ve provedO O O iHt iHt UT† e− UT = e
d Take dt t=0 of this relation, ✁ ✁ UT† ( iH)UT = iH, − canceling the i’s, we see that H is unitarily related to H. The spectrum of H cannot be bounded below, because (the spectrum of unitarily− related operators is the same and) the spectrum of H is not bounded above. AUGGH! A unitary time reversal operator is an object that makes no sense whatsoever. The answer is that time reversal is implemented by an antiunitary operator. Antiunitary operators are antilinear. Dirac notation is designed to automate the handling of linear operators, so for a while we’ll use some more cumbersome notation that does not automate the handling of linear operators. Let a, b denote states, α, β denote complex numbers and A, B denote operators. (a, b) is the inner product of two states. A unitary operator is an invertible operator, U, satisfying
(Ua, Ub)=(a, b) for all a, b unitarity This is enough of an assumption to show U is linear (see related proof below). The simplest unitary operator is 1.
U(αa + βb)= αUa + βUb linearity
The adjoint of a linear operator A is denoted A† and is the operator defined by (this definition is not consistent if A is not linear)
(a, A†b)=(Aa, b) for all a, b 7. October 14 Notes from Sidney Coleman’s Physics 253a 67
1 I’ll show that U † = U − (which is sometimes given as the definition of unitarity).
unitarity of U 1 1 (a, U − b) = (Ua, UU − b)=(Ua, b)
A transformation of the states, az}|{Ua, can also be thought of as a transformation of → the operators in the theory
(a, Ab) (Ua, AUb)=(a, U †AUb) can alternatively be thought of as A U †AU. → → An antiunitary operator is an invertible operator, Ω, (this is a notational gem, an upside down U) satisfying
(Ωa, Ωb)=(b, a) for all a, b antiunitarity
We can immediately make a little table showing the result of taking products of unitary and antiunitary operators. (The product of a unitary operator with an antiunitary operator U Ω is an antiunitary operator, etc.) U U Ω Ω Ω U We can prove (this is the related proof referred to above) that any operator (not neces- sarily invertible) satisfying the antiunitarity condition is antilinear.
(Ωa, Ωb)=(b, a) Ω(αa + βb)= α∗Ωa + β∗Ωb antilinearity ⇒
Consider (Ω(αa + βb) α∗Ωa β∗Ωb, Ω(αa + βb) α∗Ωa β∗Ωb). (You ask why?!) This − − − − is the inner product of Ω(αa + βb) α∗Ωa β∗Ωb with itself. If this is zero, the fact that − − the inner product is positive definite implies that Ω(αa + βb) α∗Ωa β∗Ωb = 0. The result − − we want! Indeed, it is simply a matter of expanding this inner product out into its 9 terms, applying the antiunitarity condition to each term, and then expand the 5 terms containing αa + βb some more to show this is zero. (The analogous proof for operators satisfying the unitarity condition also only uses properties of the inner product and is even easier.) The simplest antiunitary operator is complex conjugation, K. For the elements of some basis, bi, Kbi = bi and on any linear combination
K( αibi)= αi∗bi i i X X For consistency (bi, bj) must be real. This is the familiar complex conjugation of nonrel- ativistic quantum mechanics of position space wave functions. The basis is a complete set of real wave functions. A useful fact (especially conceptually) is that any antiunitary operator, Ω, is equal to UK for some unitary U. Proof by construction: take U = ΩK. 7. October 14 Notes from Sidney Coleman’s Physics 253a 68
In a more limited sense, the transformation of the states by an antiunitary operator Ω, a Ωa, can also be thought of as a transformation of the operators in the theory. Consider the→ expectation value of a Hermitian operator (observable) in the state a. It transforms as
(a, Aa) (Ωa, AΩa)=(AΩa, Ωa) (hermiticity) → 1 = (ΩΩ− AΩa, Ωa) (invertibility) 1 =(a, Ω− AΩa) (antiunitarity)
This transformation can alternatively be thought of as
1 A Ω− AΩ →
We don’t write Ω†AΩ because adjoint is not even defined for antilinear ops. 50 years ago [1931], Eugene Wigner proved a beautiful theorem telling us why unitary and antiunitary operators are important in QM. He showed that (up to phases) they are the only operators that preserve probabilities. It is not necessary to preserve inner products; they aren’t measurable. It is the probabilities that are measurable. Look in the appendix of his book on group theory. Given that F (a) (F : ) is continuous and for any a and b H → H (F (a), F (b)) 2 = (a, b) 2 | | | | a unitary op then F (a)= eiφ(a) or an (φ : R) × antiunitary op H→ n Time reversal is not a unitary operator. Time reversal is antiunitary. It is easy to see now how the two paradoxes are avoided.
1 Ω− iΩ = i T T − You can’t cancel the i’s as we did in paradox 2.
1 1 Ω− ( iH)Ω = iH Ω− HΩ = H T − T ⇒ T T We can explicitly construct the time reversal operator in free field theory. The simpler thing to look at in a relativistic theory is actually P T . Let’s find
1 Ω such that Ω− φ(x)Ω = φ( x) P T P T P T − The simplest candidate is just complex conjugation, in the momentum state basis. That is Ω does nothing, absolutely nothing to a~ and a† P T k ~k
1 1 Ω− a~ Ω = a~ and Ω− a† Ω = a† P T k P T k P T ~k P T ~k 7. October 14 Notes from Sidney Coleman’s Physics 253a 69
Furthermore we’ll take Ω 0 = 0 and it follows that P T | i | i Ω ~k ,...,~k = ~k ,...,~k P T | 1 ni | 1 ni again nothing, they just lie there. What does this operator do to φ(x)?
3 d k ik x ik x φ(x)= [a~ e− · + a† e · ] (2π)3/2 2ω k ~k Z ~k p 1 Apply ΩP T to φ(x). It does nothing to 3/2 , it does nothing to a~k, and nothing to (2π) √2ω~k a† . ~k But what about that i up in the exponential. It turns that into i! − 3 1 d k ik x ik x Ω− φ(x)ΩP T = [a~ e · + a† e− · ] P T (2π)3/2 2ω k ~k Z ~k = φ( x) ! − p P T does nothing to momentum states. That is expected. Parity turns ~k ~k and time → − reversal changes it back again. 8. October 16 Notes from Sidney Coleman’s Physics 253a 70
8 October 16 Scattering Theory
For a wide class of quantum mechanical systems, the description of any state is simple if you go far enough into the past or far enough into the future. We’ll illustrate this by a sequence of three increasingly complicated systems.
1. NRQM, two particles interacting through a repulsive force, which dies off at large separation.
2 2 pA pB H = + + V ( ~rA ~rB ) V 0 V ( )=0 2mA 2mB | − | ≥ ∞
Any state of the system, a real normalizable state, not a plane wave, looks in the far past / future like two particles far apart (actually a superposition of such states). The potential always pushes the particles far apart for times in the far future, and because the potential dies off they then look like noninteracting particles
A B t
B A t
There is no trace of the interaction in the far future or the far past. The states act like states with dynamics governed by a free Hamiltonian, H0, which is simple,
2 2 pA pB H0 = + 2mA 2mB
The aim of scattering theory is to tell what (superposition of) simple state(s) in the far future a simple state in the far past evolves into. 8. October 16 Notes from Sidney Coleman’s Physics 253a 71
How far in the “far past” you have to go depends on the initial condi- tions and the interaction. If the interaction is the nuclear force and we collide two neutrons at low energy which elastically scatter near t =0, then t = 7 years is far enough in the far past so that the system looks like two− non interacting nucleons. If however the initial con- ditions are set up so that the elastic scattering occurs around t = 1 billion years then you might have to go to t = (1 billion and seven−) years to make the system look simple. − It need not be that the simple Hamiltonian in the far future is the same as the simple Hamiltonian in the far past. 2. NRQM, Three particles A, B, C which interact through attractive interactions which are strong enough to make an AB bound state. Start in the far past with C and the AB bound state. Everything still looks like free particles governed by a free Hamiltonian but the free Hamiltonian in the far past is 2 2 pAB pC H0 = + 2mAB 2mC Now let scattering occur. In the far future, we can get states that look like three particles, A, B, C, non-interacting, governed by the free Hamiltonian 2 2 2 pA pB pC H0 = + + 2mA 2mB 2mC A B c t
A B c t
If you had a sufficiently advanced QM course, you may have studied such a system: + + e + H p + e + e− → There is no way to truncate this system’s full Hamiltonian into a free part and an interacting part, for which the free part describes the evolution of the system in the far future and the far past. If you use the far future H0, you don’t have an AB bound state. 8. October 16 Notes from Sidney Coleman’s Physics 253a 72
3. (A plausible picture of) the real world. In the real world we have loads of (stable) bound states. If the real world has a laboratory bench as a stable bound state, then I can do chalk-bench scattering, and I’ll need a description of a freely flying piece of chalk and a freely flying laboratory bench. The description of states in the far past requires states with free electrons, hydrogen atoms, protons, Iron atoms, Iron nuclei, laboratory benches, chalk, and the associated free Hamiltonians. Let’s get some formalism up. (The first part of this lecture, with lots of words and few equations, is the part of a lecture that makes some people nervous and some people bored.) Let H be the actual Hamiltonian of the world and be the actual Hilbert space of the world. If you go sufficiently far in the past, every stateH in the actual Hilbert space looks simple. Let 0 be the Hilbert space of simple states and let ψ 0. Somewhere in the real worldH Hilbert space there is a state that looks like ψ| iin ∈ the H far past. We’ll label that state ψ in, ψ in . Given another state φ | i, there is another state | i | i ∈ H | i ∈ H0 in the real world Hilbert space that looks like φ in the far future. We’ll label that state φ out, φ out . States in the complicated| i space are labelled by what they look | i | i ∈ H like in the far past or the far future. What we are after in scattering theory is the probability, and hence the amplitude, that a given state looking like ψ in the far past, looks like φ in the far future. We | i | i are after out φ ψ in h | i in in The correspondence between ψ and ψ (for every state ψ 0, there is a ψ that looks like ψ in the far| i past) and| i between φ in and| iφ ∈allows H us to define| i an∈ Hoperator in the simple| i Hilbert space : S, the scattering| i matrix,| i which is defined by H0 φ S ψ out φ ψ in h | | i ≡ h | i An ideal scattering theory would have two parts 1. A turn the crank method of obtaining the “descriptor” states ψ , φ , that is, gener- | i | i ating , from the real world Hamiltonian. We also need H which gives the evolution H0 0 of the descriptor states. H0 evolves the descriptor states without scattering. 2. A turn the crank method of obtaining S. 90% of the rest of this course will be devoted to calculating the matrix elements of S perturbatively. That’s an ideal scattering theory. We want to get calculating so we’ll start with a bargain basement, K-Mart scattering theory. 8. October 16 Notes from Sidney Coleman’s Physics 253a 73
Low Budget Scattering Theory
Imagine that H can be written as H = H0 + f(t)H′, f(t) = 0 for large t , and H0 a free Hamiltonian that evolves states simply, without scattering. Unless the interaction| | is with some externally specified apparatus, interesting Hamiltonians and the real world Hamiltonian are not of this form. We want a simple description of states in the far past / future. Because the interaction is off in the far past / future, the simple descriptor states are simply the states in the full theory far enough in the past / future that f(t) = 0, = . Furthermore, H H0 the Hamiltonian that gives the evolution of the simple states, H0, is just the full Hamiltonian H, far enough in the past / future that f(t) = 0. Most Hamiltonians don’t have an f(t) in them that goes to zero as t . However, | |→∞ many Hamiltonians are of the form H = H0 + H′ where H0 is a Hamiltonian we know the solution of. Maybe we could put an f(t) into the Hamiltonian without changing scattering processes much. We know we can’t do this in system (2). No matter how far you go into the far past / future, it is the interaction that holds the stable AB bound state together, and you can’t shut the interaction off without the bound state falling apart, totally changing scattering processes, no matter how long you wait to shut it off. The real world is like system (2) and we can only get a little ways studying the real world if we hack it up like this. We might get a ways studying system (1) like this. In the far past / future the particles in system (1) are far apart and noninteracting. If f(t) 0 in the far past / future, it should → not affect their evolution since they are not interacting then anyway. Suppose we wanted to insert an f(t) to study a theory of electrons interacting through a repulsive Coulomb force. We can see a flow developing. As a single electron goes off to ∞ away from all others, it still has a Coulomb field (cloud of photons) around it. If you weigh an E2 3 electron, you get a contribution 8π d x to the mass-energy in addition to the contribution to the mass-energy at the heart of the electron. This is one and the same electric field that R causes the scattering to take place, and you can’t turn off scattering without turning off this cloud. Maybe, if we turn the interaction off sufficiently slowly the simple states in the real theory will turn into the states in the free theory with probability 1. We want f(t) to look like: f(t) 1 t
T
f(t) turns on and off adiabatically. A more precise way of stating the condition under which we hope inserting f(t) into the theory won’t change scattering processes much is there must be a 1-1 correspondence between the asymptotic (simple) states of the full Hamiltonian 8. October 16 Notes from Sidney Coleman’s Physics 253a 74
and the states of the free Hamiltonian. That means no bound states, no confinement. We hope scattering processes won’t be changed at all under this assumption in the limit ∆ , T , ∆ 0. The last limit is needed so that edge effects are negligible. We→∞ want → ∞ T → adiabatic turn on and off, but we also want the interaction to be on much longer than the amount of time we spend turning it on and off. Similar requirements must be imposed if you put a system in a spatial box, depending on what kind of quantities you want to know about. In slightly racy language, the electron without its cloud of photons is called a “bare” electron, and with its cloud of photons a “dressed” electron. The scattering process goes like this: In the far far past a bare electron moves freely along. A billion years before it is to interact it leisurely dresses itself. Then it moves along for a long time as a dressed electron, briefly interacts with another (dressed) electron and moves for a long time again, dressed. Then it leisurely undresses. We need to develop Time dependent Perturbation Theory for Hamiltonians of the form
H = H0 + H′(t)
H′(t) may depend on time because of externally varying interactions or because of the insertion of f(t). We’ll do the formalism in the interaction picture developed by Dirac. This is the best formalism for doing time-dependent perturbation theory. If you have laser light shining on an atom, and you know this formalism, it is the most efficient way of calculating what happens to the atom, although it is higher powered than the minimum formalism you need for that problem.
Schr¨odinger picture We have states evolving in time according to d i ψ(t) = H(p , q , t) ψ(t) dt| iS S S | iS for Schr¨odinger since we’ll be working in several pictures |{z} The fundamental operators, pS and qS are time independent
qS = qS(t)= qS(0) pS = pS(t)= pS(0) The only operators that are not time independent are operators that explicitly depend on time. The time evolution operator U(t, t′) is given by
ψ(t) = U(t, t′) ψ(t′) | iS | iS U is completely determined by the 1st order differential equation in t d i U(t, t′)= H(p , q , t)U(t, t′) dt S S 8. October 16 Notes from Sidney Coleman’s Physics 253a 75
and the initial condition U(t, t′) t=t′ = 1. Think of t′ as a parameter. U is unitary (from the | 1 Hermiticity of H), i.e. U(t, t′)† = U(t, t′)− , which expresses the conservation of probability. U also obeys the composition law
U(t, t′)U(t′, t′′)= U(t, t′′)
which implies 1 U(t, t′)= U(t′, t)−
Heisenberg Picture The states do not change with time
ψ(t) = ψ(0) = ψ(0) | iH | iH | iS possible explicit time dependence
If AS (t) is an operator in the Schr¨odinger picture and AH (t) its counterpart in the Heisenbergz}|{ picture, demand that φ(t) A (t) ψ(t) = φ(t) A (t) ψ(t) Sh | S | iS H h | H | iH = φ(0) A (t) ψ(0) S h | H | iS = φ(t) U(0, t)†A (t)U(0, t) ψ(t) S h | H | iS ∴ AH (t)= U(0, t)AS(t)U(0, t)† = U(t, 0)†AS(t)U(t, 0)
Suppose we have some function of operators and the time in the Schr¨odinger picture, itself an operator. For example H itself is a function of pS, qS and t. To get the operator in the Heisenberg picture, all you have to do is replace pS and qS by pH and qH .
Expand H as a power HH (t)= H(pH (t), qH (t), t) series and insert U(t,0)U(t,0)† all over.
Interaction Picture
Assume H can be written as H(p,q,t) = H0(p, q)+ H′(p,q,t). This defines the relation between the functions H, H0 and H′, the arguments of these functions will change. The interaction picture is intermediate between the Schr¨odinger picture and the Heisen- berg picture. You make the transformation you would make to get the Schr¨odinger picture to the Heisenberg picture, but you do it using just the free part of the Hamiltonian only.
If there were no interaction H′, there would iH0(pS,qS)t ψ(t) I = e ψ(t) S be no time evolution of the states. That is | i | i what makes the interaction picture so useful. 8. October 16 Notes from Sidney Coleman’s Physics 253a 76
If H0 depended explicitly on time we would have to define a U0(t, 0) which would take iH0t the place of e− , but we will have no occasion to be that general. If AS(t) is an operator in the Schr¨odinger picture, and AI (t) its counterpart in the interaction picture, we get
iH0(pS,qS)t iH0(pS ,qS)t AI (t)= e AS(t)e− (8.1) by demanding φ(t) A (t) ψ(t) = φ(t) A (t) ψ(t) Sh | S | iS I h | I | iI We can find a differential equation for ψ(t) | iI d d i ψ(t) = i eiH0(pS ,qS)t ψ(t) dt| iI dt | iS = eiH0 (pS,qS)t[ H (p , q )+ H(p , q , t)] ψ(t) − 0 S S S S | iS iH0(pS,qS)t iH0(pS,qS)t = e [H′(pS, qS, t)]e− ψ(t) I ′ | i Expand H (pS,qS, t) in a = H′(pI , qI, t) ψ(t) I power series and insert | i e−iH0teiH0t all over. H (t) ψ(t) As promised, if H′ is zero, no time evolution. ≡ I | iI
In field theory, HI (t) will contain the free fields
iH0t iH0t φ(~x, t)= e φS(~x)e−
That is why all our results about free fields are still going to be useful. We can define U (t, t′) by ψ(t) = U (t, t′) ψ(t′) I | iI I | iI 1 UI†(t, t′)= UI (t, t′)− UI (t, t′)UI (t′, t′′)= UI (t, t′′) 1 iH0(pS,qS)t UI (t, t′)= UI (t′, t)− UI (t, 0) = e U(t, 0)
In a field theory, φI (x) obey free eq of motion + commutation relations. UI (t, t′) can be determined from the first order differential equation in t it satisfies d i U (t, t′)= H (t)U (t, t′) and the initial conditions U (t, t′) ′ =1 dt I I I I |t=t Now we’ll apply interaction picture perturbation theory to scattering theory. In the interaction picture the scattering process looks like this: In the far past the interaction is not felt, both because of f(t) and the fact that these particles are far apart. The states just lie there, although the p’s and q’s are changing. The time of scattering approaches and the state starts changing. After scattering, they stop scattering, like a game of musical chairs, everything freezes again. You want to connect the simple description in the far past to the simple description in the far future. Because of f(t) the simple description in the far past / future is the actual 8. October 16 Notes from Sidney Coleman’s Physics 253a 77
description in the far past / future. The states in the far past and future are their own descriptors (used in arrowed step).
φ S ψ out φ ψ in = φ(0) ψ(0) h | | i ≡ h | i I h | iI = φ( ) U ( , ) ψ( ) I h ∞ | I ∞ −∞ | −∞ iI = φ U ( , ) ψ ⇓ h | I ∞ −∞ | i ∴ S = U ( , ) I ∞ −∞
Our number one priority then is to evaluate UI ( , ). That will cause us to develop Dyson’s formula and Wick’s theorem. Then we’ll apply∞ −∞ this formalism to three models.
Proof that S = U ( , ) in the Schr¨odinger picture. I ∞ −∞ d i ψ in = H ψ in ψ( ) = ψ( ) in dt| i | i | −∞ i | −∞ i d i ψ = H ψ dt| i 0| i out φ ψ in φ S ψ h | i ≡h | | i out φ ψ in = out φ(t) ψ(t) in h | i hany time| willi do = out φ( ) U( , 0)U(0, ) ψ( ) in h ∞ | ∞ −∞ | −∞ i = φ( ) U( , ) ψ( ) h ∞ | ∞ −∞ | −∞ i 1 1 = φ(0) U ( , 0)†− U( , )U (0, )− ψ(0) h | 0 ∞ ∞ −∞ 0 −∞ | i = φ U ( , ) ψ h | I ∞ −∞ | i
[S,H0] = 0 because S turns free states of a given energy into other free states of the same energy.
Dyson’s Formula
We would like to find the solution of the equation d i UI (t, t′)= HI (t)UI (t, t′) UI (t, t′) =1 dt t=t′
Imagine that [HI (t),HI(t′)] = 0, which is not true. Then the solution would be ✭ ✭✭✭✭ ✭✭i✭t dt′′H (t′′) U (t,✭ t✭′)=✭ e− t′ I ✭I✭ R 8. October 16 Notes from Sidney Coleman’s Physics 253a 78
Let’s define a new exponential so this equation is right. Given a string of operators define the time-ordered product T [A (t ) A (t )] 1 1 ··· n n to be the string rearranged so that later operators are to the left of earlier operators, with the operator with the latest time one the leftest. The ambiguity of what to do at equal times does not bother us when the operators commute at equal times. This is certainly the case when all the operators are the same, HI (t), evaluated at various t. T , the symbol for the time ordering operation, is not an operator in Hilbert space. Time ordering is a notation. Now we’ll show that the differential equation for UI (t, t′) is satisfied by
i t dt′′H (t′′) T e− t′ I t > t′ R Under the time ordering symbol everything commutes, so we can naively take a time derivative to get d i t dt′′H (t′′) i t dt′′H (t′′) i T e− t′ I = T H (t)e− t′ I dt R I R Now t is a special time. It is the latest time, so the time ordering puts HI(t) on the leftest, and we can pull it out on the left to get
d i t dt′′H (t′′) i t dt′′H (t′′) i T e− t′ I = H (t)T e− t′ I dt R I R This solution of the differential equation also obeys the boundary condition (any old ordering does that). The solution of a first order differential equation with given initial value is unique. There- fore
i t dt′′H (t′′) U (t, t′)= T e− t′ I t > t′ Dyson’s Formula I R To illustrate what Dyson’s formula means, we’ll look at the second order term in the power series expansion for the exponential.
( i)2 t t − dt1 dt2T (HI (t1)HI (t2)) 2! ′ ′ Zt Zt 8. October 16 Notes from Sidney Coleman’s Physics 253a 79
t 2 t =t t 1
t' t t1
Think of this as an integration over the square. If the time ordering symbol were not there, you would be integrating HI (t1)HI(t2) over the whole square. Because of the time ordering symbol, you get instead twice the integral of HI (t1)HI (t2) over the lower half of the square, the shaded triangle. Our formula is only valid for t > t′. You can easily get the formula for t < t′ by taking the adjoint. We are going to apply Dyson’s formula to three model theories in order of increasing complexity.
1 µ µ2 2 Model 1 = ∂µφ∂ φ φ gρ(x)φ(x) L 2 − 2 − ρ(x) 0 as x in space or time. ρ(x) is a prescribed c-number function of space-time,→ a source→∞ which will create mesons. The equation of motion is
( + µ2)φ(x)= gρ(x) − Electromagnetism with an external source which generates the EM field looks like Aµ = ejµ. Except that our field is massive, has no vector index, and − is a quantum field, these two theories look similar. We’ll call model 1 quantum meso-dynamics. We’ll be able to solve it exactly, which is not a big surprise; in momentum space it is just a bunch of independent forced harmonic oscillators.
1 µ µ2 2 Model 2 = ∂µφ∂ φ φ gρ(~x)φ(x) L 2 − 2 − ρ(~x) 0 as ~x . This is the same as model 1 except the source is static. You might→ think| |→∞ this time independent problem would be easier than model 1, but it isn’t because the source does not turn off as t . We will have to use | |→∞ (and thus gain experience with) our adiabatic turning on and off function, f(t). This is the quantum scalar analog of electrostatics, so we’ll call it “mesostatics.” 8. October 16 Notes from Sidney Coleman’s Physics 253a 80
1 µ µ2 2 µ 2 Model 3 = ∂µφ∂ φ φ + ∂µψ∗∂ φ m ψ∗ψ gψ∗ψφ L 2 − 2 − − The equation of motion for the φ field is
2 ( + µ )φ = gψ∗ψ − This is beginning to look like the real thing. In real electrodynamics, the current jµ is not prescribed, it is the current of charged particles. Here we have the charged field ψ as a source for φ (ψ∗ψ is like a current). The φ field in turn appears in the equation of motion for the ψ field.
( + m2)ψ = gψφ − This theory also looks a lot like Yukawa’s theory of the interaction between mesons and nucleons, except our charged particles are spinless and we only have one meson. We’ll call this “meson-nucleon” theory. Actually, we had better not push this theory too far (we’ll be doing low orders in P.T. only). The classical Hamiltonian contains gψ∗ψ and that is not bounded below for either sign of g.
Wick’s Theorem
When doing perturbative calculations in g in any of these three models, we are going to have to evaluate time ordered products of strings of Hamiltonians between states. In model 1, I = gρ(x)φ(x). At fourth order in g for a meson scattered by the source we would have to evaluateH these are free fields
~k ′ T (φ(x )φ(x )φ(x )φ(x )) ~k h | 1 2 3 4 | i The time ordered product containsz 16 arrangements}| { of creation and annihilation oper- ators, from a~ a~ a~ a~ and a~ a~ a~ a† to a† a† a† a† . If we could rearrange these into k1 k2 k3 k4 k1 k2 k3 ~k4 ~k1 ~k2 ~k3 ~k4 normally ordered products, the only normally ordered product that could contribute would be the one with one creation operator on the left and one annihilation operator on the right, a great simplification. In model 3 we wil have to evaluate time ordered products like
T (φ(x1)ψ∗(x1)ψ(x1)φ(x2)ψ∗(x2)ψ(x2)) If we had an algorithm for normal ordering the time ordered product, we would again have great simplifications when we sandwiched this between states of mesons and nucleons. Wick’s theorem turns time ordered products of free fields into normal ordered products of free fields. To state Wick’s theorem we’ll define the contraction.
A(x)B(y) T (A(x)B(y)) : A(x)B(y): ≡ − 8. October 16 Notes from Sidney Coleman’s Physics 253a 81
Suppose, without loss in generality, really, that x0 >y0. Then
(+) ( ) (+) ( ) (+) ( ) T (A(x)B(y)) = A(x)B(y)=(A + A − )(B + B − ) =: AB : +[A , B − ]
(+) ( ) 0 0 and A(x)B(y) = [A , B − ] which is a c-number. This is also a c# when x < y . So whether x0
Using the defn. ✭ ✭✭✭✭ A(x)B(y)= 0 A(x)B(y) 0 = 0 T (A(x)B(y)) 0 0✭:✭A✭(x✭)B(y): 0 h | | i h | | i − ✭h | | i That’s why the calculation of 0 Tz}|{(φ(x)φ(y)) 0 in the first problem set is going to be h | | i useful. 4 d k ik (x y) i φ(x)φ(y)= 0 T (φ(x)φ(y)) 0 = e± · − h | | i (2π)4 k2 µ2 + iǫ Z − limǫ 0+ is understood. Convince yourself the doesn’t matter. You can also see that → ± 4 d k ik (x y) i ψ(x)ψ∗(y)= ψ∗(x)ψ(y)= e · − (2π)4 k2 m2 + iǫ Z − A little more obvious notation: : A(x) B(y)C(z)D(w): : A(x)C(z): B(y)D(w) and let ≡ φ φa1 (x ), φ φa2 (x ), etc., just for this proof. 1 ≡ 1 2 ≡ 2 Theorem (Gian-Carlo Wick)
term with no contractions all the other n(n−1) 1 possible T (φ1 ...φn)= : φ1 φn : + : φ1φ2 φn :+ 2 − + : φ1φ2φ3φ4 φn : terms with one ··· ··· contraction ··· z }| { . . all the other all possible 1 n(n−1) (n−2)(n−3) 1 terms with : φ1φ2φ3φ4 φn 1φn : if n is even 2 2 2 − + − + < n−1 + ··· possible terms with 2 two contractions contractions : φ1φ2φ3φ4 φn 2φn 1φn : if n is odd . ··· − − . You draw all possible terms with all possible contractions. That you get all that is no surprise. The remarkable and graceful thing about this theorem is that each term occurs with coefficient +1. Proof (By induction) Define the RHS of the expression to be W (φ φ ). We want to 1 ··· n show W = T . Trivial for n =1, 2. Choose without loss of generality x10 x20 xn0. Then ≥ ≥···≥
induction step ( ) (+) (+) T (φ φ )= φ T (φ φ ) = φ W (φ φ )= φ − W + Wφ + [φ , W ] 1 ··· n 1 2 ··· n 1 2 ··· n 1 1 1 z}|{ 8. October 16 Notes from Sidney Coleman’s Physics 253a 82
This expression is normal ordered. The first two terms contain all possible contractions that do not include φ1. The third term contains all possible contractions that do include φ1. Together they contain all possible contractions. Either a contraction includes φ1 or it doesn’t. The right hand side is thus W (φ φ ). 1 ··· n 9. October 21 Notes from Sidney Coleman’s Physics 253a 83
9 October 21 Diagrammatic Perturbation Theory
Dyson’s formula applied to S = U ( , ) is I ∞ −∞ i dtH (t) U ( , )= T e− I I ∞ −∞ R
(Without the use of the time ordering notation, this formula for UI (t, t′) was written down by Dirac 15 years before Dyson wrote it this way, and Dyson says he should not have credit for little more than a change in notation.) From this and Wick’s theorem, which for those of you who really love combinatorics can be written
1 n ∂ ∂ φiφj T (φ φ ) =: e 2 i,j=1 ∂φi ∂φj φ φ : 1 ··· n P 1 ··· n We have enough work done to write down diagrammatic perturbation theory for S = U ( , ). The easiest way to see this is to look at a specific model and a contribution to I ∞ −∞ UI ( , ) at a specific order in g. ∞ −∞ 3 In model 3, = gf(t)ψ∗ψφ, H = d x , HI I HI 4 4 ∗ R i d x I i d xgf(t)ψ ψφ U ( , )= T e− H = T e− I ∞ −∞ R R The contribution at second order in g is
2 ( ig) 4 4 − d x d x f(t )f(t )T (ψ∗ψφ(x )ψ∗ψφ(x )) 2! 1 2 1 2 1 2 Z One of the terms in the expansion of the time ordered product into normal ordered products by Wick’s theorem is
2 ( ig) 4 4 − d x d x f(t )f(t ): ψ∗ψ φ(x )ψ∗ψφ(x ): 2! 1 2 1 2 1 2 Z This term can contribute to a variety of physical processes. The ψ field contains opera- tors that annihilate a “nucleon” and operators that create an anti-“nucleon”. The ψ∗ field contains operators that annihilate an anti-nucleon and create a nucleon. The operator
: ψ∗ψ φ(x1)ψ∗ψφ(x2) :=: ψ∗ψ(x1)ψ∗ψ(x2): φ(x1)φ(x2)
can contribute to N + N N + N. That is to say → final 2 initial 2 : ψ∗ψ(x )ψ∗ψ(x ): hnucleon state| 1 2 |nucleon statei 9. October 21 Notes from Sidney Coleman’s Physics 253a 84
is nonzero because there are terms in the two ψ fields that can annihilate the two nucleons in the initial state and terms in the 2 ψ∗ fields that can then create two nucleons, to give a nonzero matrix element. It can also contribute to N +N N +N and N +N N +N. You → → can see that there is no combination of creation and annihilation operators in this operator that can contribute to N + N N + N. The ψ fields would have to annihilate the nucleons → and the ψ∗ fields cannot create antinucleons. This is good because this process does not conserve the U(1) symmetry charge. However it looks like our operator can contribute to vacuum N + N + N + N, which would be a disaster. The coefficient of that term after → integrating over x1 and x2 had better turn out to be zero. Another term in the expansion of the time ordered product into normal ordered products is 2 ( ig) 4 4 − d x d x f(t )f(t ): ψ∗ ψφ(x )ψ∗ ψφ(x ): 2! 1 2 1 2 1 2 Z This term can contribute to the following 2 2 scattering processes: N + φ N + φ, → → N + φ N + φ, N + N 2φ, 2φ N + N. A single→ term is capable→ of contributing→ to a variety processes because a single field is capable of creating or destroying a particle. The terms in the Wick expansion can be written down in a diagrammatic shorthand according to the following rules. At Nth order in perturbation theory, you start by writing down N interaction vertices and numbering them 1 to N. For model 3 at second order in perturbation theory you write down
This outgoing line is for the ψ∗ in ∗ ψ ψφ(x2). It can be thought of as creating This line is for the φ in a nucleon or annihilating an antinucleon
∗ 1 2
f E f ψ ψφ(x1). It creates E or destroys a meson ∗
This incoming line is for the ψ in ψ ψφ(x2). ¤ ¤ It can be thought of as annihilating a nucleon or creating an antinucleon
The vertex represents the factor of fψ∗ψφ. From the fact that there are two in this ( ig)2 4 4 − diagram you know to include 2! d x1d x2 Contractions are represented by connecting the lines. Any time there is a contraction, join R the lines of the contracted fields. The arrows will always line up, because the contractions for which they don’t are zero. An unarrowed line will never be connected to an arrowed line because that contraction is also zero. Our first term in the expansion of the time ordered product corresponds to the diagram
1 2
¤ f f E
E ¤ 9. October 21 Notes from Sidney Coleman’s Physics 253a 85
The second term corresponds to
1 2
¦ d ¦ ¦ ¦e
The term in the Wick expansion
2 ( ig) 4 4 − d x d x f(t )f(t ): ψ∗ψφ(x )ψ∗ ψφ(x ): 2! 1 2 1 2 1 2 Z
is zero because ψ∗ψ∗ = 0, so we never write down
1 2
F d F ¦ ¦e
Because the arrows always line up, we can shorten
1 2 to 1 2
¦ d ¦ ¦ ¦e ¦ d ¦ e¦
These diagrams are in one-to-one correspondence with terms in the Wick expansion of Dyson’s formula. We’ll call them Wick diagrams. They stand for operators and the vertices are numbered. We are most of the way to Feynman diagrams which stand for matrix elements, but these aren’t them yet. The vertices are numbered in Wick diagrams and
2 1 is distinct from 1 2
¦ d ¦ e¦ ¦ d ¦ e¦
(there are two distinct terms in the Wick expansion) even though after integrating over x1 and x2 these are identical operators. In Feynman diagrams the lines will be labelled by momenta.
On the other hand
¬Æf f ¬Æf 1f 2 is identical to 2 1 9. October 21 Notes from Sidney Coleman’s Physics 253a 86
(there is only one way of contracting all three fields at one vertex with all three at the other). Although these two have been written down to look different they aren’t. Rotate
the right one by 180◦ and you see they are the same.
1 2
f ®f
The contraction this diagram corresponds to is
2 ( ig) 4 4 − d x d x f(t )f(t ): ψ∗ψφ(x )ψ∗ψφ(x ): 2! 1 2 1 2 1 2 Z In model 1 = gρ(x)φ(x) (we don’t have to insert a turning on and off function because HI the interaction goes to zero as x in any direction and in particular in the time direction →∞ in the far past / future). ρ(x) is a prescribed c-number source, so strongly made we don’t have to worry about the back reaction of the field φ on the source. The vertex in this model is
•f
4 That represents ρφ(x). At O(g) in UI we have ( ig) d x1ρφ(x1) which is represented 1 − by . R
• 2 1 2 1 2