<<

Lectures on , Astrophysics, and Cosmology

Luis A. Anchordoqui Department of Physics and Astronomy, Lehman College, City University of New York, NY 10468, USA Department of Physics, Graduate Center, City University of New York, 365 Fifth Avenue, NY 10016, USA Department of Astrophysics, American Museum of Natural , Central Park West 79 St., NY 10024, USA (Dated: Spring 2016) This is a written version of a series of lectures aimed at undergraduate students in astrophysics/particle theory/particle experiment. We summarize the important progress made in recent towards understanding high astrophysical processes and we survey the state of the art regarding the concordance model of cosmology.

I. ACROSS THE changed after Galileo’s first telescopic observations: we no longer place ourselves at the center and we view the A look at the night sky provides a strong impression of universe as vastly larger [1–3]. a changeless universe. We know that clouds drift across In the early 1600s, proposed three laws that the , the sky rotates around the polar , and on described the motion of in a -centered solar longer , the Moon itself grows and shrinks and the system [4]. The laws are: Moon and planets move against the background of . 1. Planets the Sun in ellipses, with the Sun in Of course we know that these are merely local phenom- one of the two focuses. ena caused by motions within our . Far beyond the planets, the stars appear motionless. Herein 2. The line connecting the Sun and a sweeps we are going to see that this impression of changeless- out equal area in equal . ness is illusory. 3. The harmonic law states the squared of planets measured in years equals to the third A. Nani gigantum humeris insidentes powerT of their major axis measured in astronomical units, ( /yr)2 = (a/AU)3. T According to the ancient cosmological belief, the stars, Newton used later the harmonic law to derive the 1/r2 except for a few that appeared to move (the planets), dependence of the gravitational force [5]. We will fol- where fixed on a sphere beyond the last planet; see Fig. 1. low the opposite way and discuss how Kepler’s laws The universe was self contained and we, here on , follow from Newton’s law for gravitation. We begin by were at its center. Our view of the universe dramatically recalling how a two-body problem can be reduced to a one-body problem in the case of a central force. Denot- ing the position and the masses of the two objects by mi and ri, with i = 1, 2 the equations of motion are found to be

m ~r¨ = f ( ~r ~r )(~r r ) , (1) 1 1 − | 1 − 2| 1 − 2 and

m ~r¨ = + f ( ~r ~r )(~r r ) . (2) 2 2 | 1 − 2| 1 − 2 In other words, the center-of-mass (c.m.) of the system arXiv:0706.1988v3 [physics.ed-ph] 15 Jun 2016 m ~r + m ~r R~ = 1 1 2 2 . (3) m1 + m2

moves freely. Now, multiplying (1) by m2 and (2) by m1 and substracting the two equation we obtain

µ~r¨ = f (r)~r , (4)

where

m1m2 FIG. 1: Celestial spheres of ancient cosmology. µ = . (5) m1 + m2 2

We can then solve a one-body problem for the reduced Since rˆ is a unit vector, we have rˆ rˆ = 1 and d(rˆ rˆ)/dt = 0, · · mass µ moving with the distance r = ~r1 ~r2 in the hence gravitational field of the mass M = m + m| . − | 1 2 drˆ We can now derive the second law (a.k.a. the area ~a ~L = GMµ . (14) law). Consider the movement of a body under the influ- × dt ence of a central force (4). Since ~r ~r = 0, the vectorial Since ~L and GMµ are constant, we can write this as multiplication of (4) by ~r leads to × d ~ d µ~r ~r¨ = 0 , (6) (~v L) = (GMµrˆ) . (15) × dt × dt that looks already similar to a conservation law. Since Integration of (15) leads to

d ~ ~ ~ (~r ~r˙) = ~r˙ ~r˙ + ~r ~r¨ , (7) v L = GMµrˆ + C , (16) dt × × × × where the integration constant C~ is a constant vector. the first term in the right-hand-side is zero and we obtain Taking now the dot product with ~r, we have the conservation of angular momentum ~L = µ~r ~r˙ for × the motion in a cental potential ~r (~v ~L) = GMµrrˆ rˆ + ~r C~ . (17) · × · · d d µ~r ~r¨ = (µ~r ~r˙) = ~L = 0 . (8) Applying next the identity A~ (B~ C~) = (A~ B~) C~, it × dt × dt follows · × × · There are two immediate consequences: First, the mo- (~r ~v) ~L = GMµr + rC cos ϑ tion is always in the plane perpendicular to ~L. Second, × · ! C cos ϑ the area swept out by the vector ~r is = GMµr 1 + , (18) GMµ 1 1 ~ ~ ~ ~ dA = r v dt = dL , (9) where ϑ is the angle between ~r and C~. Expressing ~r ~v 2 × 2µ × as ~L/µ, defining e = C/(GM) and solving for r, we obtain and thus also constant. finally the equation for a conic section, which is Kepler’s We now turn to demonstrate the first law. We intro- first law: duce the unit vector rˆ = ~r/r and rewrite the definition of ~ L2/µ2 the angular momentum L as r = . (19) GM(1 + e cos ϑ) d ~L = µ~r ~r˙ = µrrˆ (rrˆ) × × dt Using (A3) we obtain angular momentum ! drˆ drˆ p = µrrˆ r˙rˆ + r = µr2rˆ . (10) L = µ GMa(1 e2) . (20) × dt × dt − To obtain the harmonic law we integrate the second The first term in the parenthesis vanishes, because of rˆ law in the form of (9) over one orbital period , rˆ = 0. Next we take the cross product of the gravitational× T acceleration, L A = πab = . (21) 2µT GM ~a = rˆ , (11) − r2 Squaring and solving for , it follows T with the angular momentum (abµ)2 2 = 4π2 . (22) ! 2 GM drˆ T L ~a ~L = rˆ µr2rˆ × − r2 × × dt Using (A1) and (20) for the angular momentum L, we ! drˆ obtain Kepler’s harmonic law, = GMµrˆ rˆ , (12) − × × dt 4π2 2 = a3 . (23) 11 2 2 T G(m1 + m2) where G = 6.674 10− N m kg− [6]. The identity from × vector analysis, A~ (B~ C~) = (A~ C~)B~ (A~ B~)C~, leads to × × · − · " ! # EXERCISE 1.1 The planet Neptune, the most distant drˆ drˆ gas giant from the Sun, with a semimajor axis ~a ~L = GMµ rˆ rˆ (rˆ rˆ) . (13) × − · dt − · dt a = 30.066 AU and an eccentricity e = 0.01. Pluto, the 3 next large world out from the Sun (though much smaller about 4.2 ly away. Therefore, the nearest star is 10,000 than Neptune) orbits with a = 39.48 AU and e = 0.250. times farther from us that the outer reach of the solar (i) To correct number of significant figures given the system. precision of the data in this exercise, how many years does it take Neptune to orbit the Sun? (ii) How many years does it take Pluto to orbit the Sun? (iii) Take the B. Stars and ratio of the two orbital periods you calculated in parts (i) and (ii). You will see that it is very close to the ratio On clear moonless nights, thousands of stars with of two small integers; which integers are these? Thus varying degrees of brightness can be seen, as well as the two planets regularly come close to one another, the long cloudy strip known as the . Galileo in the same part of their orbits, which allows them to first observed with his telescope that the Milky Way is have a maximum gravitational influence on each other’s comprised of countless numbers of individual stars. A orbits. This is an example of an orbital resonance (other half century later Wright suggested that the Milky Way examples in the solar system can be found among the was a flat disc of stars extending to great distances in a of , and between the moons and various plane, which we call the [7]. features of the rings of ). (iv) What is the aphelion Our Galaxy has a diameter of 100,000 ly and a thick- distance of Neptune’s orbit? Express your answer in ness of roughly 2,000 ly. It has a bulging central nucleus AU. (v) What are the perihelion and aphelion distances and spiral arms. Our Sun, which seems to be just an- of Plutos orbit? Is Pluto always farther from the Sun other star, is located half way from the than Neptune? to the edge, some 26, 000 ly from the center. The Sun orbits the Galactic center approximately once every 250 EXERCISE 1.2 A satellite in geosynchronous orbit million years or so, so its speed is (GEO) orbits the Earth once every day. A satellite in geostationary orbit (GSO) is a satellite in a circular 2π 26, 000 1013 km v = × = 200 km/s . (25) GEO in the Earth’s equatorial plane. Therefore, from 2.5 108 yr 3.156 107 s/yr the point of view of an observer on Earth’s surface, a × × satellite in GSO seems always to hover in the same point The total mass of all the stars in the Galaxy can be esti- in the sky. For example, the satellites used for satellite mated using the orbital data of the Sun about the center TV are in GSO so that satellite dishes can be stationary of the Galaxy. To do so, assume that most of the mass and need not track their motion through the sky. Take is concentrated near the center of the Galaxy and that a look; you will notice all satellite dishes on people’s the Sun and the solar system (of total mass m) move in houses point towards the Equator, that is South. How a circular orbit around the center of the Galaxy (of total far above Earth’s equator (i.e., above the Earth’s surface) mass M), is a satellite in GSO? Express your answer in kilometers, GMm v2 and in Earth radii. = m , (26) r2 r EXERCISE 1.3 The space station Mir traveled 3.6 where a = v2/r is the centripetal acceleration. All in all, billion kilometers during its life. Its circular orbit was 200 km above the surface of the Earth. (i) How many r v2 M = 2 1041 kg . (27) years was it in orbit? (ii) How many times did Mir circle G ≈ × the Earth per day (i.e., 24 hours)? (iii) Can you put a satellite into such an orbit that it circles the Earth 20 Assuming all the stars in the Galaxy are similar to our M 30 times per day? Sun ( 2 10 kg), we conclude that there are roughly 10≈11 stars× in the Galaxy. The astronomical distances are so large that we specify In addition to stars both within and outside the Milky them in terms of the time it takes the to travel a given Way, we can see with a telescope many faint cloudy patches in the sky which were once all referred to as distance. For example, one light second = 3 108m = nebulae (Latin for clouds). A few of these, such as those in 300, 000 km, one light minute = 1.8 107 km,× and one light × the of Andromeda and , can actually be discerned with the naked eye on a clear night. In the 1 ly = 9.46 1015 m 1013 km. (24) XVII and XVIII centuries, astronomers found that these × ≈ objects were getting in the way of the search for comets. For specifying distances to the Sun and the Moon, we In 1781, in order to provide a convenient list of objects not usually use meters or kilometers, but we could spec- to look at while hunting for comets, Messier published ify them in terms of light. The Earth-Moon distance is a celebrated catalogue [8]. Nowadays astronomers still 384,000 km, which is 1.28 ls. The Earth-Sun distance is refer to the 103 objects in this catalog by their Messier 150, 000, 000 km; this is equal to 8.3 lm. Far out in the numbers, e.g., the Andromeda is M31. solar system, Pluto is about 6 109 km from the Sun, or Even in Messier’s time it was clear that these extended 4 × 6 10− ly. The nearest star to us, Proxima Centauri, is objects are not all the same. Some are star clusters, × 4 groups of stars which are so numerous that they ap- peared to be a cloud. Others are glowing clouds of gas or dust and it is for these that we now mainly reserve the word nebula. Most fascinating are those that belong to a third category: they often have fairly regular ellip- tical shapes and seem to be a great distance beyond the Galaxy. Kant seems to have been the first to suggest that these latter might be circular discs, but appear elliptical because we see them at an angle, and are faint because they are so distant [9]. At first it was not universally accepted that these objects were extragalactic (i.e. out- side our Galaxy). The very large telescopes constructed in the XX century revealed that individual stars could be resolved within these extragalactic objects and that many contain spiral arms. Hubble did much of this ob- servational work in the 1920’s using the 2.5 m telescope on Mt. Wilson near Los Angeles, California. Hubble demostrated that these objects were indeed extragalac- FIG. 2: The method of measuring a star’s distance. tic because of their great distances [10]. The distance to our nearest , Andromeda, is over 2 million ly, a distance 20 times greater than the diameter of our or about 15 ly. Galaxy. It seemed logical that these nebulae must be Distances to stars are often specified in terms of paral- galaxies similar to ours. Today it is thought that there lax angles given in seconds of arc: 1 second (1”) is 1/60 are roughly 4 1010 galaxies in the – × of a minute (1’) of arc, which is 1/60 of a degree, so 1” that is, as many galaxies as there are stars in the Galaxy. = 1/3600 of a degree. The distance is then specified in (meaning parallax angle in seconds of arc), where the is defined as 1/φ with φ in seconds. For ex- II. DISTANCE MEASUREMENTS 5 ample, if φ = 6 10− ◦, we would say the the star is at a distance D = 4.5× pc. We have been talking about the vast distance of the The angular resolution of the Hubble Space Telescope objects in the universe. We now turn to discuss different (HST) is about 1/20 arcs. With HST one can measure methods to estimate these distances. of about 2 milli arc seconds (e.g., 1223 Sgr). This corresponds to a distance of about 500 pc. A. Besides, there are stars with radio emission for which observations from the Very Long Baseline Array (VLBA) allow accurate parallax measurements beyond 500 pc. One basic method to measure distances to nearby stars For example, parallax measurements of Sco X-1 are employs simple geometry and stellar parallax. Parallax 0.36 0.04 milli arc seconds which puts it at a distance of is the apparent displacement of an object because of a 2.8 kpc.± Parallax can be used to determine the distance change in the observer’s point of view. One way to see to stars as far away as about 3 kpc from Earth. Beyond how this effect works is to hold your hand out in front that distance, parallax angles are two small to measure of you and look at it with your left eye closed, then and more subtle techniques must be employed. your right eye closed. Your hand will appear to move against the background. By stellar parallax we mean EXERCISE 2.1 One of the first people to make a very the apparent motion of a star against the background accurate measurement of the circumference of the Earth of more distant stars, due to Earth’s motion around the was , a Greek philosopher who lived in Sun; see Fig. 2. The sighting angle of a star relative Alexandria around 250 B.C. He was told that on a cer- to the plane of Earth’s orbit (usually indicated by ) θ tain day during the summer (June 21) in a town called can be determined at two different times of the year Syene, which was 4900 stadia (1 stadia = 0.16 kilometers) separated by six months. Since we know the distance d to the south of Alexandria, the sunlight shown directly from the Earth to the Sun, we can determine the distance down the well shafts so that you could see all the way to D to the star. For example, if the angle of a given θ the bottom. Eratosthenes knew that the sun was never star is measured to be 89 99994 the parallax angle is . ◦, quite high enough in the sky to see the bottom of wells p = 0 00006 From , tan = d D, and φ . ◦. φ / in Alexandria and he was able to calculate that in fact since≡ the distance to the Sun is d = 1 5 108 km the . it was about 7 degrees too low. Knowing that the sun distance to the star is × was 7 degrees lower at its highpoint in Alexandria than d d 1.5 108 km in Syene and assuming that the sun’s rays were paral- D = = × = 1.5 1014 km , (28) tan φ ≈ φ 1 10 6 × lel when they hit the Earth, Eratosthenes was able to × − 1 Continuous radiation from stars 5

emitted by a star is found to be

Z Z 3 ∞ 2π ∞ x dx F d B kT 4 T4 dΩ dΩ = π ν ν = 2 3 ( ) x = σ , (32) 0 c h 0 e 1 ϑ ϑ✓ − where x = hν/(kT), dA dA 2π5k4 erg cos ϑdA coscos✓dAϑdA 5 σ = 2 3 = 5.670 10− 2 4 (33) Figure 1.3: Left: A detector with surface element dA on Earth measuring radiation coming 15c h × cm K s FIG. 3:fromLeft. a directionA detector with zenith with angle ϑ surface. Right: An element imaginarydA detectoron onEarth the surface of a star measuring radiation emitted in the direction ϑ. is the Stefan-Boltzmann constant [14, 15], and where we measuring radiation coming from a direction with zenith angle R 3 x 1 4 used ∞ x [e 1]− dx = π /15. ϑ (left). Right. An imaginary detector of area dA on the surface 0 − Theof a Kirchho star measuringff- distribution radiation contains emitted as its two in thelimiti directionng cases Wien’sθ [16]. law for high- A useful parameter for a star or galaxy is its . frequencies, hν kT, and the Rayleigh-Jeans law for low-frequencies hν kT.Inthe ≫ ≪ former limit, x = hν/(kT) 1, and we can neglect the 1 in the denominator of the PlanckThe total luminosity L of a star is given by the product ≫ − function, of its surface area and the radiation emitted per area 2hν3 calculate the circumferenceBν 2 exp( of thehν/kT Earth) . using a simple (1.10) ≈ c − L = 4πR2σT4 . (34) Thusproportion: the number of photons C/4900 with stadia energy h=ν much360 larger degrees than kT/ 7is degrees. exponentially This suppressed. In thegives opposite an limit, answerx = hν of/(kT 252,000) 1, and stadia ex 1=(1+ or 40,320x ...) km,1 x. which Hence Planck’s ≪ − − − ≈ Careful analyses of nearby stars have shown that the constantis veryh disappears close to from today’s the expression measurements for Bν,iftheenergy ofh 40,030ν of a single km. photon As- is small compared to the thermal energy kT and one obtains, absolute luminosity for most of the stars depends on the sume the Earth is flat and determine the parallax angle 2ν2kT mass: the more massive the star, the greater the luminosity. that can explain this phenomenon.B . Are the results con- (1.11) ν ≈ c2 Consider a thick spherical source of radius R, with sistent with the hypothesis that the Earth is flat? The Rayleigh-Jeans law shows up as straight lines left from the maxima of Bν in Fig. 1.4. constant intensity along the surface, say a star. An ob- server at a distance r sees the spherical source as a disk 1.3.2 Wien’s displacement law of angular radius ϑ = R/r. Note that since the source is B. Stellar luminosity c We note from Fig. 1.4 two important properties of Bν: Firstly, Bν as function of the frequencyoptically thick the observer only sees the surface of the ν has a single maximum. Secondly, Bν as function of the temperature T is a monotonicallysphere. Because the intensity is constant over the surface increasing function for all frequencies: If T1 >T2,thenBν(T1) >Bν(T2) for all ν. Both propertiesIn 1900, follow directly Planck from found taking the empirically derivative with therespect distribution to ν and T . In the formerthere is a symmetry along the ϕ direction such that the c2 case, we look for the maximum of f(ν)= 2h Bν as function of ν.Hencewehavetofindthesolid solid angle is given by dΩ = 2π sin ϑdϑ. By looking " ! # 1 zeros of f ′(ν), 3 − 2hν hν hν at Fig. 3 it is straightforward to see that the flux observed Bν 3(edνx =1) x expxexp=0 with 1x = dν. (29) (1.12) − −c2 kT − kT at r is given by The equation ex(3 x) = 3 has to be solved numerically and has the solution x 2.821. − ≈ Thus the intensity of thermal radiation is maximal for x 2.821 = hν /(kT) or Z Z ϑc describing the amount of energy emittedmax ≈ intomax the fre- F(r) = I cos ϑdΩ = 2πI sin ϑ cos ϑdϑ cT νmax quency interval [0ν,.50K ν+ cmdν] and or the solid5.9 angle1010Hz/dKΩ. per unit (1.13) 0 time and areaνmax by≈ a body in thermalT ≈ equilibrium× [11]. The 2 0 2 2 = πI cos ϑ = πI sin ϑc = πI(R/r) . (35) intrinsic (or surface) brightness Bν depends only on the ϑc 14 temperature T of the blackbody (apart from the natural k c h B At the surface of the star R = r and we recover (32). Very constants , and ). The dimension of ν in the cgs 2 system of units is far away, r R, and (35) yields F = πϑc I = IΩsource; see Appendix B. The validity of the inverse-square law erg F 1/r2 at a distance r > R outside of the star relies [B ] = . (30) ∝ ν Hz cm2 s sr on the assumptions that no radiation is absorbed and that relativistic effects can be neglected. The later con- In general the amount of energy per frequency interval dition requires in particular that the relative velocity of [ν, ν + dν] and solid angle dΩ crossing the perpendicular observer and source is small compared to the speed of area A per time is called the specific (or differential) light. All in all, the total (integrated) flux at the surface ⊥ intensity [12] of the Earth from a given with total luminosity L is found to be dE I = ; (31) ν dνdΩdA dt L Fobserved @ Earth = = , (36) ⊥ F 4πd2 see Fig. 3. For the special case of the blackbody radiation, L the specific intensity at the emission surface is given by where dL is the distance to the object. the Planck distribution, Iν = Bν. Stars are fairly good Another important parameter of a star is its surface approximations of blackbodies. temperature, which can be determined from the spec- Integrating (29) over all frequencies and possible solid trum of electromagnetic frequencies it emits. The wave- angles gives the emitted flux F per surface area A. The an- length at the peak of the spectrum, λmax, is related to the gular integral consists of the solid angle dΩ = sin θdθdφ temperature by Wien’s displacement law [17] and the factor cos θ taking into account that only the 3 perpendicular area A = A cos θ is visible [13]. The flux λmaxT = 2.9 10− m K . (37) ⊥ × 6

We can now use Wien’s law and the Steffan-Boltzmann rays. What is (i) the observed flux from the Sun equation (power output or luminosity AT4) to deter- and (ii) its absolute luminosity L . (iii) What is theF mine the temperature and the relative size∝ of a star. Sup- average Solar flux density measured at ? (iv) If the pose that the distance from Earth to two nearby stars approximate efficiency of the solar panels (with area of can be reasonably estimated, and that their apparent lu- 1.3 m2) on the Martian rover Spirit is 20%, then how minosities suggest the two stars have about the same many Watts could the fully illuminated panels generate? absolute luminosity, L. The spectrum of one of the stars peaks at about 700 nm (so it is reddish). The spectrum of EXERCISE 2.3 Suppose the MESSENGER spacecraft, the other peaks at about 350 nm (bluish). Using Wien’s while orbiting , decided to communicate with law, the temperature of the reddish star is Tr 4140 K. the probe, now exploring Saturn and its moons. ' The temperature of the bluish star will be double because When Mercury is closest to Saturn in their orbits, it its peak is half, Tb 8280 K. The power ra- takes 76.3 minutes for the radio signals from Mercury ' diated per unit of area from a star is proportional to the to reach Saturn. A little more than half a mercurian fourth power of the Kelvin temperature (34). Now the year later, when the 2 planets are furthest apart in their temperature of the bluish star is double that of the redish orbits, it takes 82.7 minutes. (i) What is the distance star, so the bluish must radiate 16 times as much energy between Mercury and the Sun? Give answers in both per unit area. But we are given that they have the same light-minutes and astronomical units. Assume that the luminosity, so the surface area of the blue star must be planets have circular orbits. (ii) What is the distance 1/16 that of the red one. Since the surface area is 4πR2, between Saturn and the Sun? we conclude that the radius of the redish star is 4 times larger than the radius of the bluish star (and its volume EXERCISE 2.4 The photometric method to search for 64 times larger) [18]. extrasolar planets is based on the detection of stellar An important astronomical discovery, made around brightness variations, which result from the of a 1900, was that for most of the stars, the color is related planet across a star’s disk. If a planet passes in front of to the absolute luminosity and therefore to the mass. a star, the star will be partially eclipsed and its light will A useful way to present this relationship is by the so- be dimmed. Determine the reduction in the apparent called Hertzsprung-Russell (HR) diagram [19]. On the I when Jupiter passes in front of the HR diagram, the horizontal axis shows the temperature Sun. T, whereas the vertical axis the luminosity L, each star is represented by a point on the diagram shown in Fig. 4. EXERCISE 2.5 The angular resolution of a telescope Most of the stars fall along the diagonal band termed the (or other optical system) is a measure of the smallest . Starting at the lowest right, we find the details which can be seen. Because of the distorting coolest stars, redish in color; they are the least luminous effects of earth’s atmosphere, the best angular resolution and therefore low in mass. Further up towards the left which can be achieved by optical telescopes from earth’s we find hotter and more luminous stars that are whitish surface is normally about 1 arcs. This is why much like our Sun. Still farther up we find more massive and clearer images can be obtained from space. The angular more luminous stars, bluish in color. There are also stars resolution of the HST is about 0.05 arcs, and the smallest that fall outside the main sequence. Above and to the angle that can be measured accurately with HST is right we find extremely large stars, with high luminosity actually a fraction of one resolution element. (i) Cepheid but with low (redish) color temperature: these are called variable stars are very important distance indicators red giants. At the lower left, there are a few stars of low because they have large and well-known . luminosity but with high temperature: these are white What is the distance of a Cepheid whose dwarfs. parallax angle is measured to be 0.005 0.001 arcs? Suppose that a detailed study of a certain star suggests (ii) The faintest stars that can be detected with± the HST that it most likely fits on the main sequence of the HR have apparent brightnesses which are 4 1021 times 12 2 × diagram. The observed flux is = 1 10− W m− , and fainter than the Sun. How far away could a star like F × the peak wavelength of its spectrum is λmax 600 nm. the Sun be, and still be detected with the HST? Express We can first find the temperature using Wien’s≈ law your answer in light years. (iii) How far away could a and then estimate the absolute luminosity using the Cepheid variable with 20,000 times the luminosity of HR diagram; namely, T 4800 K. A star on the main the Sun be, and still be detected with the HST? Express sequence of the HR diagram≈ at this temperature has your answer in light years. absolute luminosity of about L 1026 W. Then, using ≈ 18 (36) we can estimate its distance from us, dL = 3 10 m EXERCISE 2.6 × The discovery of the dwarf planet or equivalently 300 ly. Eris in 2005 threw the astronomical community into a tizzy and made international headlines; it is slightly EXERCISE 2.2 About 1350 J of energy strikes the larger than Pluto and brought up interesting questions atmosphere of the Earth from the Sun per second about what the definition of a planet is. Eventually, per square meter of area at right angle to the Sun’s this resulted in the controversial demotion of Pluto AST 250 Spring 2010 HOMEWORK #5 Due Friday March 26

(1) Develop you own mnemonic for the modern stellar spectral sequence: O B A F G K M L T Y. Be creative! I’ll read a few in class.

(2) Look up the spectral types of the following stars (the primary stars if it is a binary) and order them by (a) and (b) luminosity: Sun, Sirius, Betlegeuse, Aldebaran, and Barnard’s Star. (N.B. don’t just look up Teff and L. Understand the ordering based on spectral type. There could be a similar question on the exam).

(3) Estimate the mass of main sequence stars with twice the luminosity of the Sun and with half the luminosity of the Sun. What is the dominant nucleosynthesis process in the cores of these stars?

(4) Calculate the Schwarzchild radius for a star the mass of the Sun.

(5) (a) The Hertzsprung-Russell diagram is usually plotted in logarithmic coordinates (log L vs. log Teff with temperature increasing to the left). Mathematically derive the slope of a line of constant radius in the logarithmic H-R diagram. (b) Order the stars in problem 2 by stellar radii.

7

FIG. 4: HR diagram. The vertical axis depicts the inherent brightness of a star, and the horizontal axis the surface temperature increasing from right to left [20].

from the 9th planet of the Solar System to just one of a amount of sunlight reflected by Eris per unit time (i.e., number of dwarf planets. Throughout, assume that Eris its luminosity in reflected light); express your answer is spherical and is observed at opposition (i.e., the Earth in terms of d, r, the albedo a, and the luminosity of lies on the straight line connecting the Sun and Eris). the Sun L . (iii) We detect only a tiny fraction of this (i) In five hours, Eris is observed to move 7.5 arcseconds light reflected by Eris. Calculate the brightness, via the relative to the background stars as seen from Earth. inverse square law, of Eris as perceived here on Earth. 16 Because Eris is much further from the Sun than is the (iv) The measured brightness of Eris is 2.4 10− Joules 2 1 × Earth, it is moving quite a bit slower around the Sun meters− second− . Use this information to determine than the Earth, so this apparent motion on the sky is the radius r of Eris. This is what led to the controversy essentially entirely parallax due to the Earth’s motion. of what a planet is: if Pluto is considered a planet, then Calculate the speed with which the Earth goes around certainly Eris should be as well. We further elaborate the Sun, in kilometers/second. Use this information and about this controversy in the solution of the exercise. the small-angle formula to calculate the distance from (v) Calculate the angular size of Eris (i.e., the angle the the Earth to Eris. Express your result in AU. Compare diameter of Eris makes on the sky). Compare this to with the semi-major axis of Pluto’s orbit (which you will the resolution of the HST; will you be able to resolve need to look up). (ii) Eris shines in two ways: from its Eris (i.e., will it look like a point of light or a finite-size reflected light from the Sun (which will be mostly visible object in a telescope)? (vi) You go ahead and observe light), and from its blackbody radiation from absorbed Eris with Hubble, and find that it has a moon orbiting it. sunlight (which will mostly come out as light). Observations with Hubble show that this moon (called The albedo of Eris (i.e., the fraction of the sunlight Dysnomia) makes an almost circular orbit around Eris incident on Eris that is reflected) is very high, about with a period of 15.8 Earth days. The semi-major axis 85%. This suggests that Eris is covered by a layer of of the orbit subtends an angle of 0.5300 as seen from shiny ice; spectroscopy tells us that the ice is composed Earth. Calculate the semi-major axis in kilometers, and of frozen methane, CH4. Derive an expression for calculate the mass of Eris in kilograms. Compare with the brightness of Eris which depends on its distance the mass of Pluto (1.3 1022 kg). Is Eris more massive? from the Sun d, and its radius r. First, calculate the × 8

EXERCISE 2.7 A perfect blackbody at temperature T makes use of a familiar property of any sort of wave has the shape of an oblate ellipsoid, its surface being motion, known as effect [21]. given by the equation When we observe a sound or light wave from a source at rest, the time between the arrival wave crests at our x2 y2 z2 + + = 1 , (38) instruments is the same as the time between crests as they a2 a2 b2 leave the source. However, if the source is moving away with a > b. (i) Is the luminosity of the blackbody from us, the time between arrivals of successive wave isotropic? Why? (ii) Consider an observer at a distance crests is increased over the time between their departures from the source, because each crest has a little farther to dL from the blackbody, with dL a. What is the direc- tion of the observer for which the maximum amount of go on its journey to us than the crest before. The time between crests is just the wavelength divided by the flux will be observed (keeping the distance dL fixed)? Calculate what this maximum flux is. (iii) Repeat the speed of the wave, so a wave sent out by a source moving same exercise for the direction for which the minimum away from us will appear to have a longer wavelength than if the source were at rest. Likewise, if the source is flux will be observed, for fixed dL. (iv) If the two observers who see the maximum and minimum flux moving toward us, the time between arrivals of the wave crests is decreased because each successive crest has a from distance dL can resolve the blackbody, what is the apparent brightness, I, that each one will measure? shorter distance to go, and the waves appear to have a (v) Write down an expression for the total luminosity shorter wavelength. A nice analogy was put forward emitted by the black body as a function of a, b and by Weinberg [22]. He compared the situation with a T. (vi) Now, consider a galaxy with a perfectly oblate travelling man that has to send a letter home regularly shape, which contains only a large number N of stars, once a week during his travels: while he is travelling and no gas or dust. To make it simple, assume that all away from home, each successive letter will have a little stars have radius R and surface temperature T. Answer farther to go than the one before, so his letters will arrive again the questions (i-v) for the galaxy, assuming a little more than a week apart; on the homeward leg NR2 ab. Are there any differences from the case of a of his journey, each succesive letter will have a shorter blackbody? Explain why. (vii) Imagine that there were distance to travel, so they will arrive more frequently a very compact galaxy that did not obey the condition than once a week. NR2 ab. Would the answer to the previous question The Doppler effect began to be of enormous impor- be modified? Do you think such a galaxy could be stable? tance to astronomy in 1968, when it was applied to the study of individual spectral lines. In 1815, EXERCISE 2.8 The HR diagram is usually plotted in first realized that when light from the Sun is allowed to logarithmic coordinates (log L vs. log T, with the tem- pass through a slit and then through a glass prism, the perature increasing to the left). Derive the slope of a line resulting spectrum of colors is crossed with hundreds of of constant radius in the logarithmic HR diagram. dark lines, each one an image of the slit [23]. The dark lines were always found at the same colors, each corre- sponding to a definite wavelength of light. The same III. dark spectral lines were also found in the same posi- tion in the spectrum of the Moon and brighter stars. It There is observational evidence that stars move at was soon realized that these dark lines are produced by speeds ranging up to a few hundred kilometers per the selective absorption of light of certain definite wave- second, so in a year a fast moving star might travel lengths, as light passes from the hot surface of a star 1010 km. This is 103 times less than the distance to the through its cooler outer atmosphere. Each line is due to closest∼ star, so their apparent position in the sky changes absorption of light by a specific chemical element, so it very slowly. For example, the relatively fast moving became possible to determine that the elements on the star known as Barnard’s star is at a distance of about Sun, such as sodium, iron, magnesium, , and 56 1012 km; it moves across the line of sight at about chromium, are the same as those found on Earth. 89 km× /s, and in consequence its apparent position shifts In 1868, Sir Huggins was able to show that the dark (so-called “”) in one year by an angle of lines in the spectra of some of the brighter stars are 0.0029 degrees. The HST has measured proper motions shifted slightly to the red or the blue from their normal as low as about 1 milli arc second per year. In the radio position in the spectrum of the Sun [24]. He correctly (VLBA), relative motions can be measured to an accu- interpreted this as a Doppler shift, due to the motion of racy of about 0.2 milli arc second per year. The apparent the star away from or toward the Earth. For example, the position in the sky of the more distant stars changes so wavelength of every dark line in the spectrum of the star slowly that their proper motion cannot be detected with is longer than the wavelength of the correspond- even the most patient observation. However, the rate ing dark line in the spectrum of the Sun by 0.01%, this of approach or recession of a luminous body in the line shift to the red indicates that Capella is receding from of sight can be measured much more accurately than its us at 0.01% c (i.e., the of Capella is about motion at right angles to the line of sight. The technique 30 km/s). 9

There are three special cases: (i) θ0 = 0, which gives p ν = ν (1 β)/(1 + β) . (45) 0 −

In the non-relativistic limit we have ν = ν0(1 β). This corresponds to a source moving away from the− observer. Note that θ = 0. (ii) θ0 = π, which gives p ν = ν (1 + β)/(1 β) . (46) FIG. 5: A source of light waves moving to the right, relative 0 − S v to observers in the frame, with velocity . The frequency is Here the source is moving towards the observer. Note higher for observers on the right, and lower for observers on the left [25]. that θ = π. (iii) θ0 = π/2, which gives

ν = ν0γ . (47) S S Consider two inertial frames, and 0, moving with This last is the transverse Doppler effect – a second v relative velocity as shown in Fig. 5. Assume a light order relativistic effect. It can be thought of as arising source (e.g. a star) at rest in S0 emits light of frequency from the dilation of time in the moving frame. ν0 at an angle θ0 with respect to the observer O0. Let ! hν hν hν EXERCISE 3.2 Suppose light is emitted isotropically pµ = , cos θ, sin θ, 0 (39) in a star’s rest frame S , i.e. dN/dΩ = κ, where dN is c − c − c 0 0 the number of photons in the solid angle dΩ0 and κ is a be the momentum 4-vector for the photon as seen in S constant. What is the angular distribution in the inertial and frame S? ! µ hν0 hν0 hν0 EXERCISE 3.3 Show that for v c, the Doppler shift p = , cos θ0, sin θ0, 0 (40) 0 c − c − c in wavelength is  in S0. To get the 4-momentum relation from S0 S, λ0 λ v → − . (48) apply the inverse [26] λ ≈ c " !# hν hν0 hν0 To avoid confusion, it should be kept in mind that λ = γ + β cos θ c c − c 0 denotes the wavelength of the light if observed near ! the place and time of emission, and thus presumably hν hν hν cos θ = γ 0 cos θ + β 0 take the values measured when the same atomic tran- − c − c 0 c sition occurs in terrestrial laboratories, while λ0 is the hν hν0 wavelength of the light observed after its long journey sin θ = sin θ0 . (41) c c to us. If λ0 λ > 0 then λ0 > λ and we speak of a red- − shift; if λ0 λ < 0 then λ0 < λ, and we speak of a blueshift. The first expression gives −

ν = ν0γ(1 β cos θ0) , (42) EXERCISE 3.4 Through some coincidence, the Balmer − lines from single ionized helium in a distant star happen which is the relativistic Doppler formula. to overlap with the Balmer lines from hydrogen in the For observational astronomy (42) is not useful because Sun. How fast is that star receding from us? [Hint: both ν0 and θ0 refer to the star’s frame, not that of the ob- the from single-electron energy level server. Apply instead the direct Lorentz transformation transitions are inversely proportional to the square of S S0 to the photon energy to obtain the atomic number of the nucleus.] → ν = γν(1 + β cos θ) . (43) 0 EXERCISE 3.5 Stellar aberration is the apparent

This equation gives ν0 in terms of quantities measured motion of a star due to rotation of the Earth about the by the observer. It is sometimes written in terms of Sun. Consider an incoming photon from a star with µ wavelengths: λ = λ0γ(1 + β cos θ). (For details see 4-momentum p . Let S be the Sun’s frame and S0 the e.g. [27].) Earth frame moving with velocity v as shown in Fig. 6. Define the angle of aberration α by θ0 = θ α and show that α β sin θ. − EXERCISE 3.1 Consider the inertial frames S and S0 ≈ shown in Fig. 5. Use the inverse Lorentz transformation to show that the relation between angles is given by EXERCISE 3.6 HD 209458 is a star in the very similar to our Sun (M = 1.1M and R =

β cos θ0 1.1R ), located at a distance of about 150 ly. In 1999, two cos θ = − . (44) β cos θ 1 teams working independently discovered an extrasolar 0 − 10

FIG. 6: Schematic representation of stellar aberration [25]. planet orbiting the star using the so-called radial velocity A. Stellar nucleosynthesis planet search method [28, 29]. Note that a star with a planet must move in its own small orbit in response to There is a general consensus that stars are born when the planet’s . This leads to variations in the speed gaseous clouds (mostly hydrogen) contract due to the with which the star moves toward or away from Earth, pull of gravity. A huge gas cloud might fragment into i.e. the variations are in the radial velocity of the star with numerous contracting masses, each mass centered in an respect to Earth. The radial velocity can be deduced from area where the density is only slightly greater than at the displacement in the parent star’s spectral lines due to nearby points. Once such globules formed, gravity would the Doppler shift. If a planet orbits the star, one should cause each to contract in towards its center-of-mass. As have a periodic change in that rate, except for the extreme the particles of such accelerate inward, their case in which the plane of the orbit is perpendicular to kinetic energy increases. When the kinetic energy is our line of sight. Herein we assume that the motions sufficiently high, the repulsion between the of the Earth relative to the Sun have already been taken positive charges is not strong enough to keep hydrogen into account, as well as any long-term steady change of nuclei appart, and nuclear fussion can take place. In distance between the star and the sun, which appears as a star like our Sun, the “burning” of hydrogen occurs a median line for the periodic variation in radial velocity when four protons fuse to form a helium nucleus, with due to the star’s wobble caused by the orbiting planet. the release of γ rays, positrons and .1 The observed Doppler shift velocity of HD 209458 is The energy output of our Sun is believed to be due found to be K = V sin i = 82.7 1.3 m/s, where i = ± principally to the following sequence of fusion reactions: 87.1◦ 0.2◦ is the inclination of the planet’s orbit to the line perpendicular± to the line-of-sight. [30]. Soon after 1 1 2 + 1H +1H 1H + 2 e + 2 νe (0.42 MeV) , (49) the discovery, separate teams were able to detect a transit → of the planet across the surface of the star making it the 1 2 3 first known transiting extrasolar planet [31, 32]. The 1H +1H 2He + γ (5.49 MeV) , (50) planet received the designation HD 209458b. Because → the planet transits the star, the star is dimmed by about and 2% every 3.52447 0.00029 days. Tests allowing for a ± 3He +3He 4He +1H +1H (12.86 MeV) , (51) non-circular Keplerian orbit for HD 209458 resulted in 2 2 →2 1 1 an eccentricity indistinguishable from zero: e = 0.016 ± where the energy released for each reaction (given in 0.018. Consider the simplest case of a nearly circular parentheses) equals the difference in mass (times c2) be- orbit and find: (i) the distance from the planet to the tween the initial and final states. Such a released energy star; (ii) the mass m of the planet; (iii) the radius r of the is carried off by the outgoing particles. The net effect planet. of this sequence, which is called the pp-cycle, is for four 4 protons to combine to form one 2He nucleus, plus two positrons, two neutrinos, and two gamma rays:

IV. 1 4 + 4 H He + 2e + 2νe + 2γ . (52) 1 →2 Note that it takes two of each of the first two reactions The stars appear unchanging. Night after night the 3 heavens reveal no significant variations. Indeed, on hu- to produce the two 2He for the third reaction. So the man time scales, the vast majority of stars change very little. Consequently, we cannot follow any but the tini- est part of the life cycle of any given star since they live 1 for ages vastly greater than ours. Nonetheless, herein The word “burn” is put in quotation marks because these high- temperature fusion reactions occur via a nuclear process, and must we will follow the process of stellar evolution from the not be confused with ordinary burning in air, which is a chemical birth to the death of a star, as we have theoretically re- reaction, occurring at the atomic level (and at a much lower temper- constructed it. ature). 11 total energy released for the net reaction is 24.7 MeV. or + However, each of the two e quickly annihilates with dM(r) an electron to produce 2m c2 = 1.02 MeV; so the total = 4πr2ρ(r) . (60) e dr energy released is 26.7 MeV. The first reaction, the formation of deuterium from two protons, has very low An important application of (60) is to express physical probability, and the infrequency of that reaction serves quantities not as function of the radius r but of the en- to limit the rate at which the Sun produces energy. closed mass M(r). This facilitates the computation of the These reactions requiere a temperature of about 107 K, stellar properties as function of time, because the mass corresponding to an average kinetic energy (kT) of 1 keV. of a star remains nearly constant during its evolution, while the stellar radius can change considerably. EXERCISE 4.1 Approximately 1038 neutrinos are A radial-symmetric mass distribution M(r) produces produced by the pp chain in the Sun every second. according Gauss law the same gravitational acceleration, Calculate the number of neutrinos from the Sun that are as if it would be concentrated at the center r = 0. There- passing through your brain every second. fore the gravitational acceleration produced by M(r) is GM(r) In more massive stars, it is more likely that the energy g r ( ) = 2 . (61) output comes principally from the carbon (or CNO) cy- − r cle, which comprises the following sequence of reactions: If the star is in equilibrium, this acceleration is balanced by a pressure gradient from the center of the star to its 12 1 13 C + H N + γ , (53) surface. Since pressure is defined as force per area, P = 6 1 → 7 F/A, a pressure change along the distance dr corresponds to an increment 13N 13C + e+ + ν , (54) 7 → 6 dF = dAP (P + dP)dA − 13C +1H 14N + γ , (55) = dAdP = ρ(r)dAdr a(r) (62) 6 1 → 7 − |{z} − | {z } |{z} force mass acceleration 14N +1H 15O + γ , (56) 7 1 → 8 of the force F produced by the pressure gradient dP. For increasing r, the gradient dP < 0 and the resulting force 15O 15N + e+ + ν , (57) dF is positive and therefore directed outward. Hydro- 8 → 7 static equilibrium, g(r) = a(r), requires then − 15 1 12 4 dP GM(r) ρ(r) 7N +1H 6C +2He . (58) = ρ(r)g(r) = . (63) → dr − r2 It is easily seen that no carbon is consumed in this cycle If the pressure gradient and gravity do not balance each (see first and last equations) and that the net effect is the other, the layer at position r is accelerated, same as the pp cycle. The theory of the pp cycle and the carbon cycle as the source of energy for the Sun and the GM(r) 1 dP a(r) = + . (64) stars was first worked out by Bethe in 1939 [33]. r2 ρ(r) dr The fusion reactions take place primarily in the core of the star, where T is sufficiently high. (The surface In general, we need an equation of state, P = P(ρ, T, Yi), temperature is of course much lower, on the order of that connects the pressure P with the density ρ, the (not a few thousand K.) The tremendous release of energy yet) known temperature T and the chemical composition in these fusion reactions produces an outward pressure Yi of the star. For an estimate of the central pressure Pc = sufficient to halt the inward gravitational contraction; P(0) of a star in hydrostatic equilibrium, we integrate and our protostar, now really a young star, stabilizes in (63) and obtain with P(R) 0, ≈ the main sequence. Z R Z M dP M To a good approximation the stellar structure on the P dr G dM c = = 4 , (65) main sequence can be described by a spherically sym- 0 dr 0 4πr metric system in hydrostatic equilibrium. This requires where we used the continuity equation (60) to substitute that rotation, convection, magnetic fields, and other ef- dr = dM/(4πr2ρ) by dM. If we replace furthermore r by fects that break rotational symmetry have only a minor the stellar radius R r, we obtain a lower limit for the influence on the star. This assumption is in most cases central pressure, ≥ very well justified. Z M We denote by M(r) the mass enclosed inside a sphere M P G dM with radius r and density ρ(r) c = 4 0 4πr Z r Z M M M2 M r dr r 2 r G dM ( ) = 4π 0 0 ρ( 0) (59) > 4 = 4 . (66) 0 0 4πR 8πR 12

Inserting values for the Sun, it follows in the stellar core increases to a large number. Then in its core there will be many beryllium-8 nuclei that can fuse 2 !2  4 M 8 M R with another helium nucleus to form carbon-12, which Pc > = 4 10 bar . (67) 8πR4 × M R is stable:

The value obtained integrating the hydrostatic equation 4He +8 Be 12 C + γ (7.367 MeV) . (69) 11 2 4 6 using the “solar standard model” is Pc = 2.48 10 bar, → i.e. a factor 500 larger. × The net energy release of the triple-α process is 7.273 MeV. Further fusion reactions are possible, with 4 12 16 EXERCISE 4.2 Calculate the central pressure Pc of 2He fusing with 6C to form 8O. Stars spend approxi- a star in hydrostatic equilibrium as a function of its mately a few thousand to 1 billion years as a . mass M and radius R for (i) a constant mass density, Eventually, the helium in the core runs out and fusion ρ(r) = ρ0 and (ii) a linearily decreasing mass density, stops. Stars with 0.4M < M < 4M are fated to end

ρ(r) = ρc[1 (r/R)]. up as spheres of carbon and . Only stars with − M > 4M become hot enough for fusion of carbon and 20 24 Exactly where the star falls along the main sequence oxygen to occur and higher Z elements like 10Ne or 12Mg depends on its mass. The more massive the star, the can be made. further up (and to the left) it falls in the HR diagram. As massive (M > 8M ) red supergiants age, they pro- To reach the main sequence requires perhaps 30 million duce “onion layers” of heavier and heavier elements in years and the star is expected to remain there 10 billion their interiors. A star of this mass can contract under years (1010 yr). Although most of stars are billions of gravity and heat up even further, (T = 5 109 K), pro- 56 56 × years old, there is evidence that stars are actually being ducing nuclei as heavy as 26Fe and 28Ni. However, the born at this moment in the . average binding energy per nucleon begins to decrease As hydrogen fuses to form helium, the helium that is beyond the iron group of isotopes. Thus, the formation formed is denser and tends to accumulate in the cen- of heavy nuclei from lighter ones by fusion ends at the tral core where it was formed. As the core of helium iron group. Further fusion would require energy, rather grows, hydrogen continues to fuse in a shell around it. than release it. As a consequence, a core of iron builds When much of the hydrogen within the core has been up in the centers of massive supergiants. consumed, the production of energy decreases at the A star’s lifetime as a giant or supergiant is shorter center and is no longer sufficient to prevent the huge than its main sequence lifetime (about 1/10 as long). As gravitational forces from once again causing the core to the star’s core becomes hotter, and the fusion reactions contract and heat up. The hydrogen in the shell around powering it become less efficient, each new fusion fuel the core then fuses even more fiercely because of the rise is used up in a shorter time. For example, the stages in in temperature, causing the outer envelope of the star the life of a 25M star are as follows:: hydrogen fusion to expand and to cool. The surface temperature thus re- lasts 7 million years, hellium fusion lasts 500,000 years, duced, produces a spectrum of light that peaks at longer carbon fusion lasts 600 years, neon fusion lasts 1 year, wavelength (reddish). By this time the star has left the oxygen fusion lasts 6 months, and sillicon fusion lasts 1 main sequence. It has become redder, and as it has grown day. The star core in now pure iron. The process of cre- in size, it has become more luminous. Therefore, it will ating heavier nuclei from lighter ones, or by absorption have moved to the right and upward on the HR diagram. of neutrons at higher Z (more on this below) is called As it moves upward, it enters the red giant stage. This nucleosynthesis. model then explains the origin of red giants as a natural step in stellar evolution. Our Sun, for example, has been on the main sequence for about four and a half billion B. White dwarfs and Chandrasekhar limit years. It will probably remain there another 4 or 5 billion years. When our Sun leaves the main sequence, it is ex- At a distance of 2.6 pc Sirius is the fifth closest stellar pected to grow in size (as it becomes a red giant) until it system to the Sun. It is the brightest star in the Earth’s occupies all the volume out to roughly the present orbit night sky. Analyzing the motions of Sirius from 1833 of the planet Mercury. to 1844, Bessel concluded that it had an unseen com- If the star is like our Sun, or larger, further fusion panion, with an orbital period T 50 yr [53]. In 1862, can occur. As the star’s outer envelope expands, its discovered this companion,∼ Sirius B, at the time core is shrinking and heating up. When the temperature of maximal separation of the two components of the reaches about 108 K, even helium nuclei, in spite of their binary system (i.e. at apastron) [54]. Complementary greater charge and hence greater electrical repulsion, can follow up observations showed that the mass of Sirius then reach each other and undergo fusion: B equals approximately that of the Sun, M M . Sir- ius B’s peculiar properties were not established≈ until the 4He +4 He 8 Be + γ ( 91.8 keV) . (68) 2 2 →4 − next apastron by Adams [55]. He noted that its high Once beryllium-8 is produced a little faster than it decays temperature (T 25, 000 K) together with its small lu- 17 ' 26 (half-life is 6.7 10− s), the number of beryllium-8 nuclei minosity (L = 3.84 10 W) require an extremely small × × 13 radius and thus a large density. From Stefan-Boltzmann (74) implies P ρ5/3, where ρ is the density. For relativis- law we have tic particles, we∝ can obtain an estimate for the pressure !1/2 !2 inserting v = c, R L T 2 = 10− . (70) 4/3 R L T ≈ P ncp c}n , (75) ≈ ≈ 6 which implies P ρ4/3. It may be worth noting at Hence, the mean density of Sirius B is a factor 10 higher ∝ than that of the Sun; more precisely, ρ = 2 106 g/cm3. this juncture that (i) both the non-relativistic and the × relativistic pressure laws are polytropic equations of A lower limit for the central pressure of Sirius B fol- γ lows from (67) state, P = Kρ ; (ii) a non-relativistic degenerate gas has the same adiabatic index (γ = 5/3) as an 2 M 16 ideal gas, whereas a relativistic degenerate Fermi gas Pc > = 4 10 bar . (71) 8πR4 × has the same adiabatic index (γ = 4/3) as radiation; (iii) Assuming the pressure is dominated by an ideal gas the in the non-relativistic limit the pressure is inversely P m central temperature is found to be proportional to the fermion mass, 1/ , and so for non-relativistic systems the degeneracy∝ will first Pc 2 9 become important to electrons. Tc = 10 Tc, 10 K . (72) nk ∼ ≈ EXERCISE 4.3 Estimate the average energy of elec- For such a high Tc, the temperature gradient dT/dr in Sirius B would be a factor 104 larger than in the Sun. This trons in Sirius B from the equation of state for non- would in turn require a larger luminosity and a larger relativistic degenerate fermion gas, energy production rate than that of main sequence stars. 2 2/3 2 (3π ) } 5/3 Stars like Sirius B are called white dwarfs. They have P = n , (76) 5 m very long cooling times, because of their small surface lu- minosity. This type of stars is rather numerous. The mass and calculate the Lorentz factor of the electrons. Give density of main-sequence stars in the solar neighbor- a short qualitative statement about the validity of the hood is 0.04M /pc3 compared to 0.015M /pc3 in white non-relativistic equation of state for white dwarfs with dwarfs. The typical mass of white dwarfs lies in the a density of Sirius B and beyond. range 0.4 . M/M . 1, peaking at 0.6M . No further fusion energy can be obtained inside a . The Next, we compute the pressure of a degenerate non- star loses internal energy by radiation, decreasing in tem- relativistic electron gas inside Sirius B and check if it is perature and becoming dimmer until its light goes out. consistent with the lower limit for the central pressure For a classical gas, P = nkT, and thus in the limit of zero derived in (71). The only bit of information needed is the temperature, the pressure inside a star also goes to zero. value of ne, which can be written in terms of the density How can a star be stabilized after the fusion processes of the star, the atomic mass of the ions making up the and thus energy production stopped? The solution to star, and the number of protons in the ions (assuming this puzzle is that the main source of pressure in such the star is neutral): compact stars has a different origin. ρ ne = (77) According to Pauli’s exclusion principle no two µe mp fermions can occupy the same quantum state [56]. In where µe A/Z is the average number of nucleon per statistical mechanics, Heisenberg’s uncertainty princi- ≡ ple ∆x∆p } [57] together with Pauli’s principle imply free electron. For metal-poor stars µe = 2, and so from ≥ 1 that each phase-space volume, }− dx dp, can only be oc- (74) we obtain cupied by one fermionic state. h2n5/3 A (relativistic or non-relativistic) particle in a box of P e 3 ≈ me volume L collides per time interval ∆t = L/vx once with !5/3 the yz-side of the box, if the x component of its velocity (1.05 1027 erg s)2 106 g/cm3 × is vx. Thereby it exerts the force Fx = ∆px/∆t = pxvx/L. ≈ 9.11 10 28 g 2 1.67 10 24 g The pressure produced by N particles is then P = F/A = × − × × − 23 2 Npxvx/(LA) = npxvx. For an isotropic distribution, with 10 dyn/cm . (78) 2 2 2 2 2 ≈ v = v + v + v = 3 v , we have 2 h i h xi h yi h zi h xi Since 106 dyn/cm = 1 bar, we have P = 1017 bar, which P = 1 nvp . (73) is consistent with the lower limit derived in (71). 3 We can now relate the mass of the star to its radius 1/3 1/3 Now, if we take ∆x = n− and ∆p }/∆x }n , by combining the lower limit on the central pressure ≈ ≈ 2 4 combined with the non-relativistic expression v = p/m, Pc GM /R and the polytropic equation of state P = the pressure of a degenerate fermion gas is found to be Kρ5∼/3 K(M/R3)5/3 = KM5/3/R5. It follows that ∼ }2n5/3 GM2 KM5/3 P nvp . (74) = , (79) ≈ ≈ m R4 R5 14 or equivalently pressures of exercise 4.1 equal to the relativistic degen- erate electron pressure, M(10 12)/6 1 R = − = . (80) K 1/3 (3π2)1/3}c KM P = n4/3 . (87) 4 If the small differences in chemical composition can be neglected, then there is unique relation between the mass Compare the estimates with the exact limit. and the radius of white dwarfs. Since the star’s radius decreases with increasing mass, there must be a maximal The critical size can be determined by imposing two mass allowed. conditions: that the gas becomes relativistic, Ukin . 2 To derive this maximal mass we first assume the Nmec , and N = Nmax, pressure can be described by a non-relativistic degen- 4/3 erate Fermi gas. The total kinetic energy of the star is 2 c}Nmax 2 3 1/3 Nmaxmec & . (88) Ukin = Np /(2me), where n N/R and p }n . Thus R ∼ ∼ }2n2/3 }2N(3+2)/3 }2N5/3 This leads to U N kin 2 = 2 . (81) ∼ 2me ∼ 2meR 2meR !1/2 2 c} c} mec & , (89) For the potentail gravitational energy, we use the ap- R Gm2 2 N proximation Upot = αGM /R, with α = 1. Hence or equivalently }2N5/3 GM2 U R U U ( ) = kin + pot 2 . (82) !1/2 ∼ 2meR − R } c} R 8 & 2 5 10 cm . (90) mec Gm ∼ × For small R, the positive term dominates and so there N exists a stable minimum R for each M. min which is in agreement with the radii found for white However, if the Fermi gas inside the star becomes rel- dwarf stars. ativistic, then Ukin = Ncp, or

c}N4/3 U Nc}n1/3 (83) C. Supernovae kin ∼ ∼ R and Supernovae are massive explosions that take place at the end of a star’s life cycle. They can be triggered by 4/3 2 c}N GM one of two basic mechanisms: (I) the sudden re-ignition U(R) = Ukin + Upot . (84) ∼ R − R of nuclear fusion in a degenerate star, or (II) the sudden of the massive star’s core. Now both terms scale like 1/R. For a fixed chemical In a type I , a degenerate white dwarf ac- composition, the ratio N/M remains constant. Therefore, cumulates sufficient material from a binary companion, if M is increased the negative term increases faster than either through accretion or via a merger. This material the first one. This implies there exists a critical M so that raise its core temperature to then trigger runaway U becomes negative, and can be made arbitrary small by nuclear fusion, completely disrupting the star. Since the decreasing the radius of the star: the star collapses. This white dwarf stars explode crossing the Chandrasekhar critical mass is called Chandrasekhar mass M . It can Ch limit, M > M , the release total energy should not vary be obtained by solving (84) for U = 0. Using M = NNmN 4/3 2 2 so much. Thus one may wonder if they are possible we have c}N = GN m , or, with mN mp, max max N ' standard candles.  3/2 !3  c}  M EXERCISE 4.5 Type Ia supernovae have been ob- N   Pl 57 max  2  2 10 . (85) ∼ Gm  ∼ mp ∼ × served in some distant galaxies. They have well-known p 10 luminosities and at their peak LIa 10 L . Hence, ≈ This leads to we can use them as standard candles to measure the distances to very remote galaxies. How far away could MCh = Nmaxmp 1.5M . (86) a type Ia supernova be, and still be detected with HST? ∼ The Chandrasekhar mass derived “professionally” is In type II supernovae the core of a M & 8M star found to be MCh 1.46M [35]. undergoes sudden gravitational collapse. These stars ' have an onion-like structure with a degenerate iron core. EXERCISE 4.4 Derive approximate Chandrasekhar When the core is completely fused to iron, no further mass limits in units of solar mass by setting the central processes releasing energy are possible. Instead, high 10 Point explosion

10 Point explosion The sudden release of a large amount of energy E into a background fluid of density ⇢1 creates a strong explosion, characterized by a strongwhere shockP is wave the postshock (a ‘blast pressure. wave’) To find this pressure, we need to recall the jump emanating from the point where the energy was released.conditions Such explosionsacross a shock. occur If the for shock moves to the right with velocity v1 = v(t), then in the rest-frame of the shock the background gas streams with velocity v to example in astrophysics in the form of supernova explosions. 15 1 the left, and comes out of the shock with a higher density ⇢2, higher pressure P2, and with a lower velocity v2.

u2 u1

The Rankine-Hugonoit relations for the shock tell us

FIG. 7: Left. The sudden release of a large amount of energy into a background⇢ fluid ofv density ρ11creates a strong2 spherical shock 1 = 2 = + (10.3) But how fast will thewave, shock emanating wave from travel the point and where what the energy is left was behind? released. Right. TheJump problem conditions⇢ v of across +1 normal( shock+1) waves.2 If the shock moves to the right with velocity u , then in the rest-frame of the shock the background2 1 gas streams with velocity u = u to sh M 1 − sh the point explosion is alsothe left, known and comes as outSedov-Taylor of the shock with explosion, a higherwhere density afterρ2, higher the pressure two scientistsP2, and with a lower velocity u2. Conservation of momentum requires P + ρ u2 = P + ρ u2, see Appendix C. For the case at hand, P P and so Pv1 ρ u2. that first solved it by analytic (and1 in1 part1 2 numerical)2 2 means in the context1  of2 = 2 ∼ 1 1 (10.4) M c1 atomic bomb explosions. Today, the problem can provideis the a Mach useful number test of to the validate shock. For a strong explosion, the sound-speed of the a hydrodynamical numericalenergy collisions scheme, break because apart iron aninto helium analyticbackground and even- solution mediumThe for released is negligibly it can energy small, be goes so mainly that the into Mach neutrinos number (99%), will tend to infinity tually into protons and neutrons, in this limit. Forkinetic the pressure, energy (1%); the Rankine-Hugonoitonly 0.01% into photons. relation is computed which can then be compared to numerical results. Also,Much the of problem the modeling of supernova explosions and 56 4 2 Fe 13 He + 4 n (91) their remnants derivesP2 2 from the nuclear1 bomb research serves as a good example to demonstrate26 → the2 power of dimensional analysis and= M (10.5) program. WheneverP1 a supernova +1 goes+1 off a large amount scale-free solutions. and of energy E is injected into the “ambient medium” of 2 As the backgrounduniform pressure density is Pρ1 .= In⇢ the1c1/ initial, we phase then ofobtain the expansion in the limit of a strong 4 p n 1 2He 2 + 2 . shock: (92) → the impact of the external medium2 will be small, because the mass of the ambient medium2⇢1v1 that is overrun and This removes the thermal energy necessary to provide P2 (10.6) 10.1 A rough estimatepressure support and the star collapses. When the star taken along is still small' compared +1 with the ejecta mass. begings to contract the density increasesWith and the this free postshockThe supernovapressure, we remnant can now is estimate said to expand the thermal adiabatically. energy in the shocked electrons are forced together with protons tobubble: form neu- After some time a strong spherical shock front (a “blast Let’s begin by deriving an order of magnitude estimate for the radiuswave”)R expands(t)ofthe into the ambient medium, and5 the mass trons via inverse beta decay, 3 2 3 R swept upE bytherm the outwardlyP2R ⇢ moving1v R shock⇢1 significantly (10.7) shock as a function of time. The mass of the swept up material is of order M(t⇠) ⇠ 1 ⇠ t2 3 e− + p n + νe ; (93) exceeds the mass of⇠ the initial ejecta, see Fig. 7. The ⇢1R (t). The fluid velocity behind the shock→ will be of orderThis suggests the mean that radial the thermal velocity energy2 is of the same order as the kinetic energy, ram pressure, P2 ρ1ush, of the matter that enters the of the shock, v(t) R(teven)/t. though We further neutrinos expectdo not interact easilyand with scales matter, in theshock same wave fashion is much with∼ time. larger Hence than the also ambient for the pressure total energy E, which ⇠ at these extremely high densities, they exertis a a conserved tremen- quantity,P1 of the we upstream expect medium, and any radiated energy is dous outward pressure. The outer layers fall inward 2 5 much smaller than the explosion energyR5E. This regime, when the iron1 core collapses, formingR anR enormously during whichE the= energyE + E remains constant⇢ is known as (10.8) 2 3 kin therm ⇠ 1 t2 denseEkin neutronMv star [36]. If⇢M1R. MCh2 ,= then⇢1 the2 core stops the Sedov–Taylor(10.1) phase [37–39]. The mass of the swept collapsing⇠ because2 the⇠ neutrons startt gettingSolvingt packed for too the radiusup materialR(t), weis of get order the expectedM(t) ρ dependencer3(t), where r is the ∼ 1 tightly. Note that MCh as derived in (86) is valid for both radius of the shock. The fluid velocity1 behind the shock 2 What about the thermalneutrons energy and electrons, in the since bubble the stellar created mass is in by both the explosion?will be of order This the mean radialEt velocity5 of the shock, cases given by the sum of the nucleon masses, only the R(t) (10.9) should be of order ush(t) r(t)/t and so the/ kinetic⇢1 energy is main source of pressure (electrons3 or neutrons) differs. ∼ ✓ ◆ m 1 r2 r5 The critical sizeE followstherm from (90)PV by substituting e with E(10.2)Mu2 r3 kin = sh ρ1 2 = ρ1 2 . (95) mN, ⇠ 2 2 ∼ t t !1/2 What about the thermal energy in the bubble created by } c} 2 R & 3 105 cm . (94) the explosion? This should be of order m c 2 N GmN ∼ × 3 r5 E = P1V P r3 ρ u2 r3 ρ . (96) Since already Sirius B was difficult to detect, the ques- therm 2 2 ∼ 2 ∼ 1 sh ∼ 1 t2 tion arises if and how these extremely small stars can be observed. When core density reaches nuclear density, This suggests that the thermal energy is of the same order the equation of state stiffens suddenly and the infalling as the kinetic energy, and scales in the same fashion with material is “reflected.” Both the outburst and time. Therefore the outer layers that crash into the core and rebound r5 E = E + E ρ , (97) cause the entire star outside the core to be blown apart. kin therm ∼ 1 t2 16 yielding leave it. Because no light escapes after the star reaches this infinite density, it is called a . !1/5 Et2 r(t) . (98) ∼ ρ 1 V. WARPING The expanding shock wave slows as it expands A hunter is tracking a bear. Starting at his camp, he !1/5 !1/2 walks one mile due south. Then the bear changes direc- 2 E 2 E 3/2 ush = = r− . (99) tion and the hunter follows it due east. After one mile, 5 ρ t3 5 ρ 1 1 the hunter loses the bear’s track. He turns north and This means that the blask wave decelerates and dis- walks for another mile, at which point he arrives back at sapears after some time. The expanding supernova his camp. What was the color of the bear? remnant then passes from its Taylor-Sedov phase to its An odd question. Not only is the color of the bear “snowplow” phase. During the snowplow phase, the unrelated to the rest of the question, but how can the matter of the ambient is swept up hunter walk south, east and north, and then arrive back by the expanding dense shell, just as snow is swept up at his camp? This certainly does not work everywhere by a coasting snowplow. on Earth, but it does if you start at the North pole. There- fore the color of the bear has to be white. A surprising EXERCISE 4.6 Estimate the energy of the first observation is that the described by the hunter’s detonation of a nuclear weapon (code name Trinity) path has two right angles in the two bottom corners, and from the time dependence of the radius of its shock so the sum of all three angles is greater than 180◦. This wave. Photographs of the early stage of the explosion implies the metric space is curved. are shown in Fig. 8. The device was placed on the top What is meant by a curved space? Before answering of a tower, h = 30 m and the explosion took place at this question, we recall that our normal method of view- about 1100 m above sea level. (i) Explain the origin ing the world is via Euclidean plane geometry, where of the thin layer above the bright “fireball” that can the line element of the n-dimensional space is given by be seen in the last three pictures (t 0.053 s). Is the ≥ Xn shock front behind or ahead of this layer? Read the ds2 = dx2 . (100) radius of the shock front from the figures and plot it i i=1 as a function of time after the explosion. The time and length scale are indicated in the lables of the figures. Non-Euclidean geometries which involve curved spaces (ii) Fit (by eye or numerical regression) a line to the have been independently imagined by Gauss [50], radius vs. time dependence of the shock front in a Bolyai´ [51], and Lobachevsky [52]. To understand the log-log representation, ln(r) = a + b ln(t). Verifiy that b is idea of a metric space herein we will greatly simplify compatible with a Sedov-Taylor expansion. Then fix b the discussion by considering only 2-dimensional sur- to the theoretical expectation, re-evaluate a and estimate faces. For 2-dimensional metric spaces, the so-called first the energy of the bomb in tons of TNT equivalent. [Hint: and second fundamental forms of differential geometry ignore the initial (short) phase of free expansion.] uniquely determine how to measure lengths, areas and angles on a surface, and how to describe the shape of a If the final mass of a is less than MCh its parameterized surface. subsequent evolution is thought to be similar to that of a white dwarf. In 1967, an unusual object emitting a radio signal with period T = 1.377 s was detected at the A. 2-dimensional metric spaces Mullard Radio Astronomy Observatory. By its very na- ture the object was called “pulsar.” Only one year later, The parameterization of a surface maps points (u, v) Gold argued that pulsars are rotating neutron stars [41]. in the domain to points ~σ(u, v) in space: He predicted an increase on the pulsar period because   of electromagnetic energy losses. The slow-down of the x(u, v)   Crab pulsar was indeed discovered in 1969 [42]. ~ u v  y(u, v)  σ( , ) =   . (101) If the mass of the neutron star is greater than MCh,  z(u, v)  then the star collapses under gravity, overcoming even the neutron exclusion principle [43]. The star eventually Differential geometry is the local analysis of how small collapses to the point of zero volume and infinite density, changes in position (u, v) in the domain affect the position creating what is known as a “singularity” [44–49]. As the on the surface ~σ(u, v), the first derivatives ~σu(u, v) and density increases, the paths of light rays emitted from ~σv(u, v), and the surface normal nˆ(u, v). the star are bent and eventually wrapped irrevocably The first derivatives, ~σu(u, v) and ~σv(u, v), are vectors around the star. Any emitted photon is trapped into that span the tangent plane to the surface at point ~σ(u, v). an orbit by the intense gravitational field; it will never The surface normal at point ~σ is defined as the unit vector 17

FIG. 8: Trinity test of July 16, 1945. Figures also available at http://cosmo.nyu.edu/~mu495/HEA15/trinity/

2 18 normal to the tangent plane at point ~σ and is computed can be used to characterize the local shape of the folded using the cross product of the partial derivatives of the surface. surface parameterization, The concept of curvature, while intuitive for a plane curve (the reciprocal of the radius of curvature), requires ~σu ~σv nˆ(~σ) = × . (102) a more comprehensive definition for a surface. Through ~σu ~σv a point on a surface any number of curves may be drawn || × || with each having a different curvature at the point. We The tangent vectors and the surface normal define an have seen that at any point on a surface we can find nˆ orthogonal coordinate system at point ~σ(u, v) on the sur- which is at right angles to the surface; planes containing face, which is the framework for describing the local the normal vector are called normal planes. The inter- shape of the surface. section of a normal plane and the surface will form a Geometrically, d~σ is a differential vector quantity that curve called a normal section and the curvature of this is tangent to the surface in the direction defined by du curve is the normal curvature κ. For most points on and dv. The first fundamental form, I, which measures most surfaces, different sections will have different cur- the distance of neighboring points on the surface with vatures; the minimum and maximum values of these are parameters (u, v) and (u+du, v+dv), is given by the inner called the principal curvatures, denoted by κ1 and κ2. product of d~σ with itself The Gaussian curvature is defined by the product of the two principal curvatures K = κ κ . It may be calculated I ds2 = d~σ d~σ = (~σ du + ~σ dv) (~σ du + ~σ dv) 1 2 u v u v using the first and second fundamental coefficients. At ≡ · 2 · 2 = (~σu ~σu)du + 2(~σu ~σv)dudv + (~σv ~σv)dv · · · each grid point where these values are known two ma- = Edu2 + 2Fdudv + Gdv2 , (103) trices are defined. The matrix of the first fundamental form, where E, F and G are the first fundamental coefficients. ! The coefficients have some remarkable properties. For EF I = , (108) example, they can be used to calculate the surface area. FG Namely, the area bounded by four vertices ~σ(u, v), ~σ(u + δu, v), ~σ(u, v + δv), ~σ(u + δu, v + δv) can be expressed in and the matrix of the second fundamental form, terms of the first fundamental form with the assistance ! of Lagrange identity e f II = . (109) n 1 n  n   n  f g X− X X X 2  2  2 (aibj ajbi) =  a   b  −  k  k i=1 j=i+1 k=1 k=1 The Gaussian curvature is given by

 n 2 X  det II  akbk , (104) K = . (110) −   det I k=1 which applies to any two sets a , a , , an and As an illustration, consier a half-cylinder of radius R { 1 2 ··· } b1, b2, , bn of real numbers. The classical area ele- oriented along the x axis. At a particular point on the ment{ is··· found} to be surface, the scalar curvature can have different values depending on direction. In the direction of the half- 2 δA = ~σu δu ~σv δv = √EG F δu δv , (105) cylinder’s axis (parallel to the x axis), the surface has zero | × | − scalar curvature, κ = 0. This is the smallest curvature or in differential form value at any point on the surface, and therefore κ1 is in this direction. For a curve on the half-cylinder’s surface dA = √EG F2 du dv . (106) − parallel to the (y, z) plane, the cylinder has uniform scalar Note that the expression under the square root in (106) curvature. In fact this curvature is the greatest possible on the surface, so that κ = 1/R is in this direction. For is precisely ~σu ~σv and so it is strictly positive at the 2 regular points.| × | a curve on the surface not in one of these directions, the The key to the second fundamental form, II, is the scalar curvature is greater than κ1 and less than κ2. The unit normal vector. The second fundamental form coef- Gaussian curvature is K = 0. ficients at a given point in the parametric uv-plane are 2-dimensional metric spaces can be classified accord- given by the projections of the second partial deriva- ing to the Gaussian curvature into elliptic (K > 0), flat tives of ~σ at that point onto the normal vector and can (K = 0), and hyperbolic (K < 0). Triangles which lie on be computed with the aid of the dot product as follows: the surface of an elliptic geometry will have a sum of e = ~σuu nˆ, f = ~σuv nˆ, and g = ~σvv nˆ. The second angles which is greater than 180◦. which lie fundamental· form, · · on the surface of an hyperbolic geometry will have a sum of angles which is less than 180◦. II = e du2 + 2 f du dv + g dv2 , (107) 19

EXERCISE 5.1 The unit sphere can be parametrized patch under the parametrization as      cos θ cos φ  cos u sin v     ~σ(θ, φ) =  cos θ sin φ  . (115) ~ u v  sin u sin v    σ( , ) =   (111) sin θ  cos v  [Hint: A great circle (a.k.a. orthodrome) of a sphere is where (u, v) [0, 2π) [0, π]. (i) Find the distance of ∈ × the intersection of the sphere and a plane which passes neighboring points on the surface with parameters (u, v) through the center point of the sphere.] and (u + du, v + dv), a.k.a. the line element ds2. (ii) Find the surface area. (iii) Find the Gaussian curvature. The scalar curvature (or Ricci scalar) is the simplest curvature invariant of an n-dimensional hypersurface. EXERCISE 5.2 The tractrix is a curve with the follow- To each point on the hypersurface, it assigns a single ing nice interpretation: Suppose a dog-owner takes his real number determined by the intrinsic geometry of pet along as he goes for a walk “down” the y-axis. He the hypersurface near that point. It provides one way of starts from the origin, with his dog initially standing on measuring the degree to which the geometry determined the x-axis at a distance r away from the owner. Then by a given metric might differ from that of ordinary Eu- the tractrix is the path followed by the dog if he “fol- clidean n-space. In two dimensions, the scalar curvature lows his owner unwillingly”, i.e., if he constantly pulls is twice the Gaussian curvature, R = 2K, and completely against the leash, keeping it tight. This means mathe- characterizes the curvature of a surface. In more than matically that the leash is always tangent to the path of two dimensions, however, the curvature of hypersur- the dog, so that the length of the tangent segment from faces involves more than one functionally independent the tractrix to the y-axis has constant length r. The trac- quantity. trix has a well-known surface of revolution called the pseudosphere which, for r = 1, can be parametrized as   B. sechu cos v   ~ u v  sechu sin v  σ( , ) =   , (112)  u tanhu  Consider a freely falling spacecraft in the gravitational − field of a radially symmetric mass distribution with total with u ( , ) and v [0, 2π). (i) Find the line mass M. Because the spacecraft is freely falling, no ef- element.∈ (ii)−∞ Find∞ the surface∈ area. (iii) Find the fects of gravity are felt inside. Then, the spacetime coor- Gaussian curvature. dinates from r should be valid inside the spacecraft. → ∞ Let us call these coordinates Σ~ (t , x , y , z ), with x A curve γ with parametr t on a surface ~σ(u, v) is called parallel and y , z transversal∞ to movement.∞ ∞ ∞ ∞ The space-∞ a geodesic if at every point γ(t) the acceleration vector craft has velocity∞ ∞v at the distance r from the mass M, γ~¨(t) is either zero or parallel to its unit normal nˆ. measured in the coordinate system Σ~ = (t, r, θ, φ) in which the mass M is at rest at r = 0. As long as the EXERCISE 5.3 Show that a geodesic γ(t) on a surface gravitational field is weak, to first order approximation ~σ has constant speed. that the laws of hold [58], and we can use a Lorentz transformation [26] to relate Σ~ at rest and EXERCISE 5.4 A curve γ on a surface ~σ is a geodesic Σ~ moving with v = βc. We will define shortly what if and only if for any part (t) ~(u(t) v(t)) contained ∞ γ = σ , “weak” means in this context. For the moment, we pre- in a surface patch ~, the following two equations are σ sume that effects of gravity are small if the velocity of satisfied: the spacecraft, which was at rest a r , is still small v c. Should this be the case, we have→ ∞ d 1 2 2 (Eu˙ + Fv˙) = (Euu˙ + 2Fuu˙v˙ + Guv˙ ) , (113)  dt 2 q dt = dt 1 β2 ∞ − d 1 2 2 dr (Fu˙ + Gv˙) = (Evu˙ + 2Fvu˙v˙ + Gvv˙ ) , (114) dx = p dt 2 ∞ 1 β2 − where Edu2 +2Fdudv+Gdv2 is the first fundamental form dy = r dθ ∞ of ~σ. (113) and (114) are called the geodesic equations. dz = r sin θ dφ . (116) They are nonlinear and solvable analytically on rare ∞ occasions only. The infinitesimal distance between two spacetime events is given by the Minkowskian line element [59] EXERCISE 5.5 Show that if γ is a geodesic on the unit 2 2 µ ν 2 2 2 2 2 sphere S , then γ is part of a great circle. Consider the ds = gµνdx dx = c dt dx dy dz , (117) ∞ − ∞ − ∞ − ∞ 20 which, for the case at hand, becomes The time intervals dτ(r0) and dτ(r) are different and thus the time measured by clocks at different distances r from dr2 ds2 = (1 β2)c2dt2 + r2(dθ2 + sin2 dφ2) . (118) the mass M will differ too. In particular, the time τ − − 1 β2 measured by an observer at infinity will pass faster than∞ − the time experienced in a gravitational field, Herein we follow the notation of [27]: Greek indices (µ, ν, ) run from 0 to 3 and Latin indices (i, j, ) from τ(r) ··· ··· τ = < τ(r) . (125) 1 to 3. ∞ √1 2α/r We now turn to determine β from measurable quanti- − ties of the system: M and r. Consider the energy of the Since frequencies are inversely proportional to time, the spacecraft with rest mass m, frequency or energy of a photon traveling from r to r0 will be affected by the gravitational field as 2 GγmM (γ 1)mc = 0 , (119) r − − r ν(r0) 1 2α/r = − . (126) where the first term is the kinetic energy and the sec- ν(r) 1 2αr0 − ond the Newtonian expression for the potential energy. Therefore, an observer at r0 will receive photons, Note that here we have made the crucial assumption that → ∞ gravity couples not only to the mass of the spacecraft but which were emitted with frquency ν by a source at posi- 2 tion r, redhsifted to frequency ν , also to its total energy. Dividing by γmc gives ∞ ! r 1 GM 2GM 1 = 0 . (120) ν = 1 ν(r) . (127) − γ − rc2 ∞ − rc2

Introducing α = GM/c2 we can re-write (120) as Note that the photon frequency is redshifted by the grav- itational field. The size of this effect is of order Φ/c2, q α where Φ = GM/r is the Newtonian gravitational po- 1 β2 = 1 , (121) − − − r tential. We are now in position to specify more pre- cisely what weak gravitational fields means. As long as 2 1/2 2 where γ = (1 β )− . (121) leads to Φ /c 1, the deviation of − | |  2α α2 2α 2GM Φ(r) 1 β2 = 1 + 1 ; (122) g00 = 1 1 2 (128) − − r r2 ≈ − r − rc2 ≈ − c2 in the last step, we neglected the term (α/r)2, since we from the Minkowski value g00 = 1 is small, and Newto- attempt only at an approximation for large distances, nian gravity is a sufficient approximation. where gravity is still weak. Inserting this expression into What is the meaning of r = 2α? At (118), we obtain the metric describing the gravitational 2GM M field produced by a radially symmetric mass distribu- R = = 3 km , (129) Sch c2 M tion,     1 the Schwarzschild coordinate system (123) becomes ill- 2α 2α − ds2 = 1 c2dt2 1 dr2 r2dΩ2 , (123) defined. However, this does not mean necessarily that at − r − − r − r = RSch physical quantities like tidal forces become in- 2 2 2 2 finite. As a matter of fact, all scalar invariants are finite, where dΩ = dθ + sin θdφ . Wickedly, this agrees with 6 the exact result found by Schwarzschild [60] by solv- e.g. R = 0 and K = 12RSch/r . Here R is the Ricci scalar ing ’s vacuum field equations of general relativ- and K the Kretschmann scalar [62], a quadratic scalar in- ity [61]. variant used to find the true singularities of a spacetime. As in special relativity, the line element ds2 determines The Schwarzschild’s scalar invariants can only be found the time and spatial distance between two spacetime by long and troublesome calculation that is beyond the events. The time measured by an observer in the instan- scope of this course; for a comprehensive discussion see taneous rest frame, known as the dτ, is given e.g. [63, 64]. Before proceeding we emphasize again that, by dτ = ds/c [27]. In particular, the time difference be- whether or not the singularity is moved to the origin, tween two events at the same point is obtained by setting only depends on the coordinate frame used, and has no dxi = 0. If we choose two static observers at the position physical significance whatsoever; see Appendix D for an example. r and r0, then we find with dr = dφ = dθ = 0, If the gravitating mass is concentrated inside a radius p s smaller than RSch then we cannot obtain any information dτ(r) g00(r) dt g00(r) = = . (124) about what is going on inside RSch, and we say r = RSch d r p g r τ( 0) g00(r0) dt 00( 0) defines an . An object smaller than its 21

Schwarzschild radius, is called a black hole. In Newto- at a rate that is nian gravity, only the enclosed mass M(r) of a spherically s symmetric system contributes to the gravitational poten- r 2 1 2GM vescape tial outside r. Therefore, we conclude the Sun is not a 1 ⊕ = 1 (132) − c2 r − c2 black hole, becasue for all values of r the enclosed mass 2 is M(r) < rc /(2G). The Schwarzschild black hole is fully times as fast as one located far away from the Earth characterized by its mass M. To understand this better, (i.e. at r ). Note how much this expression looks we consider next what happens to a photon crossing the like the equivalent→ ∞ expression from special relativity event horizon as seen from an observer at r . 2 → ∞ for . Here R is the radius of the Earth, Light rays are characterized by ds = 0. Consider a and M is its mass. Using⊕ (132) calculate the rate at light ray traveling in the radial direction, that is to say which⊕ a stationary clock at a radius r (for r > R ) will dφ = dθ = 0. The Schwarzschild metric (123) becomes tick relative to one at the surface of the Earth. Is⊕ your dr  2α rate greater or less than 1? If greater than 1, this means = 1 c . (130) dt − r the high altitude clock at r > R ticks faster than one on the surface; if less than one,⊕ this means the high As seen from far away a light ray approaching a massive altitude clock ticks slower than one on the surface. star will travel slower and slower as it comes closer to (ii) Now consider an astronaut orbiting at r > R . What the . In fact, for an observer at in- is her orbital velocity as a function of r? Because⊕ she is finity the signal will reach r = RSch only asymptotically, moving with respect to a stationary observer at radius for t . Similarly, the communication with a freely → ∞ r, special relativity says that her clock is ticking slower. falling spacecraft becomes impossible as it reaches Calculate the ratio of the rate her clock ticks to that of a r = RSch. A more detailed analysis shows that indeed, stationary observer at radius r. (Note that for circular as seen from infinity, no signal can cross the surface motion, the acceleration in the spaceship travelling in at r = R . The factors (1 2α/r) in (123) control the Sch − a circle is not zero, so the spaceship is not in a single bending of light, a phenomenon known as gravitational frame of inertia.) (iii) Determine an expression for the lensing. The first observation of light deflection was ratio of the rate at which the orbiting astronaut’s clock performed by noting the change in position of stars as ticks to a stationary clock on the surface of the Earth, they passed near the Sun on the celestial sphere. The as a function of the radius r at which she orbits. You observations were performed in May 1919 during a total may ignore the small velocity of the clock on the surface solar eclipse, so that the stars near the Sun (at that time of the Earth due to the Earth’s rotation. (iv) Using in the constellation ) could be observed [65]. 1 √1 x 1 x/2 + , (1 x)− 1 + x + , and (1 −x)(1≈ y)−= 1 x ···y + xy− 1 ≈(x + y), all valid··· for EXERCISE 5.6 In addition to the time dilation due to x − 1 and− y 1,− derive− an expression≈ − of the form 1 δ an object moving at a finite speed that we have learned for the relative rate of a clicking clock on the surface− about in special relativity, we have seen that there is of the Earth and the orbiting astronaut. Demonstrate an effect in , termed “gravitational red- that δ 1. (v) Calculate the radius r at which the clock shift,” caused by gravity itself. To understand this latter of the orbiting astronaut ticks at the same rate as a effect, consider a photon escaping from the Earth’s sur- stationary one on the surface of the Earth; express your face to infinity. It loses energy as it climbs out of the result in Earth radii and kilometers. Will an astronaut Earth’s gravitational well. As its energy E is related to orbiting at a smaller radius age more or less than one its frequency ν by Planck’s formula E = hν, its frequency who stayed home? Thus, do astronauts on the Space must therefore also be reduced, so observers at a great Shuttle (orbiting 300 km above the Earth’s surface) age distance r must see clocks on the surface ticking at more or less than one staying home? a lower frequency→ ∞ as well. Therefore an astronaut orbit- ing the Earth ages differently from an astronomer sitting EXERCISE 5.7 “A full set of rules [of Brockian Ultra still far from the Earth for two reasons; the effect of grav- Cricket, as played in the higher dimensions] is so ity, and the time dilation due to motion. In this problem, massively complicated that the only time they were you will calculate both these effects, and determine their all bound together in a single volume they underwent relative importance. (i) The escape speed from an object gravitational collapse and became a black hole” [66]. of mass M if you are a distance r from it is given by A quote like this is crying out for a calculation. In r 2GM this problem, we will answer Adams challenge, and vescape = . (131) determine just how complicated these rules actually r are. An object will collapse into a black hole when That is, if you are moving this fast, you will not fall back its radius is equal to the radius of a black hole of the to the object, but will escape its gravitational field en- same mass; under these conditions, the escape speed tirely. Schwarzschild’s solution to Einstein’s field equa- at its surface is the (which is in fact the tions of general relativity shows that a stationary, non- defining characteristic of a black hole). We can rephrase moving clock at a radius r R from the Earth will tick the above to say that an object will collapse into a black ≥ ⊕ 22 hole when its density is equal to the density of a black the coordinate distance R (assume R > RSch and take the hole of the same mass. (i) Derive an expression for the absolute value of grr). [Hint: The following facts may be density of a black hole of mass M. Treat the volume helpfull: of the black hole as the volume of a sphere of radius Z r given by the Schwarzschild radius. As the mass of a 1 ξ π dξ = , (133) black hole gets larger, does the density grow or shrink? 1 ξ 2 (ii) Determine the density of the paper making up the 0 − Cricket rule book, in units of kilograms per cubic meter. and Standard paper has a surface density of 75 g per square Z r meter, and a thickness of 0.1 mm. (iii) Calculate the α ξ √ √ √ √ mass (in solar masses), and radius (in AU) of the black dξ = ln( α 1 + α) + α 1 α , (134) 1 1 ξ − − hole with density equal to that of paper. (iv) How many − pages long is the Brockian Ultra Cricket rule book? where α > 1 is constant.] (iii) Now use your answers to Assume the pages are standard size (8.500 1100). For part (i) and part (ii) to compute Π where C = 2ΠRphys. × 3 calculational simplicity, treat the book as spherical (a (iv) Plot Π as a function of ξ R/RSch for ξ [1, 10 ] (use common approximation in this kind of problem). What log axes for the x axis). What≡ happens with∈Π as ξ ? if the rule book were even longer than you have just → ∞ calculated? Would it still collapse into a black hole? C. luminosity and black hole growth EXERCISE 5.8 Black holes provide the ultimate lab- oratory for studying strong-field gravitational physics. Binary X-ray sources are places to find strong black The tides near black holes can be so extreme that a hole candidates [67, 68]. A companion star is a perfect process informally called “spaghettification” occurs in source of infalling material for a black hole. As the matter which a body falling towards a black hole is strongly falls or is pulled towards the black hole, it gains kinetic stretched due to the difference in gravitational force at energy, heats up and is squeezed by tidal forces. The different locations along the body (this is called a tidal heating ionizes the atoms, and when the atoms reach a effect). In the following, imagine that you are falling few million degrees Kelvin, they emit X-rays. The X- into a 3M black hole. (i) What is the Schwarzschild rays are sent off into space before the matter crosses the radius of this black hole (in km)? (ii) You are 1.5 m tall event horizon, and so we can detect this X-ray emission. and 70 kg in mass and are falling feet first. At what Another sign of the presence of a black hole is random distance from the black hole would the gravitational variation of emitted X-rays. The infalling matter that force on your feet exceed the gravitational force on your emits X-rays does not fall into the black hole at a steady head by 10 kN? Express this distance in km and in rate, but rather more sporadically, which causes an ob- Schwarzschild radii of the black hole. (iii) To appreciate servable variation in X-ray intensity. Additionally, if the if this amount of force is enough to “spaghettify” and X-ray source is in a binary system, the X-rays will be kill you, imagine that you are suspended from a ceiling periodically cut off as the source is eclipsed by the com- of your room (on Earth) with a steel plate tied to your panion star. feet. Calculate the mass of the plate (in kg) that will X-1 is one of the strongest X-ray sources we give you a nice tug of 10 kN (you can ignore the weight can detect from Earth [69] and the first widely thought to of your body here). Do you think this pull will kill be a black hole, after the detection of its rapid X-ray vari- you? (iv) Now consider a trip toward the supermassive ability [70] and the identification of its optical countem- black hole at the center of our Galaxy, which has an part with the blue HDE 226868 [71, 72]. estimated mass of 4 million M . How does this change The X-ray emission is powered mainly by accretion from the distance at which you will be “spaghettified” by the the strong stellar wind from HDE 226868 [73]. While the differential gravity force of 10 kN? Express your answer disk of accreting matter is incredibly bright on its own, in km and in Schwarzschild radii of the black hole. -1 has another source of light: a pair of jets per- (v) Find the smallest mass of the black hole for which pendicular to the disk erupt from the black hole carrying you would not die by “spaghettification” before falling part of the infalling material away into the interstellar within its event horizon. space [74]. Consider a steady spherically symmetrical accretion. EXERCISE 5.9 In the Schwarzschild metric r is a co- We assume the accreting material to be mainly hydro- moving coordinate, not a real physical distance. Rather, gen and to be fully ionized. Under these circumstances, integrals over ds constitute physical distances. In the fol- the radiation exerts a force mainly on the free electrons lowing we will take slices of spacetime at a constant time through Thomson scattering, since the scattering cross 2 (dt = 0). (i) Compute the physical circumference, C, at section for protons is a factor (me/mp) smaller, where 4 a given coordinate distance R from the center of a black me/mp = 5 10− is the ratio of the electron and proton × 1 2 hole of mass M at θ = π/2. (ii) Compute the physical masses [75]. If F is the radiant energy flux (erg s− cm− ) 25 2 distance R from the center of the black hole out to and σT = 6.7 10− cm is the Thomson cross section, phys × 23 then the outward radial force on each electron equals the EXERCISE 5.10 The pictures in Fig. 10 show a time rate at which it absorbs momentum, sequence of radio observations of the 0827+243. The core of the quasar is the bright object at a distance σTF Fout = . (135) of 0 ly and a fainter blob of plasma is moving away c from it. (i) What is the apparent velocity of the motion The attractive electrostatic Coulomb force between the of the plasma blob? (ii) Derive the apparent transverse electrons and protons means that as they move out the velocity of an object ejected from a source at velocity v at electrons drag the protons with them. In effect, the radi- an angle θ with respect to the line of sight between the ation pushes out electron-proton pairs against the total source and the observer. (iii) Which angle maximizes gravitational force the apparent transverse velocity? What is accordingly the minimal Lorentz-factor of the plasma blob observed GM in 0827+243? F = (m + m ) (136) in r2 p e acting on each pair at a radial distance r from the center. VI. EXPANSION OF THE UNIVERSE 1 If the luminosity of the accreting source is L (erg s− ), we have The observations that we will discuss in this section L reveal that the universe is in a state of violent explosion, F = (137) 4πr2 in which the galaxies are rushing appart at speeds ap- proaching the speed of light. Moreover, we can extrapo- by spherical symmetry, so the net inward force on an late this explosion backwards in time and conclude that electron-proton pair is all the galaxies must have been much closer at the same   time in the past – so close, in fact, that neither galaxies LσT 1 Fnet = GMmp . (138) nor stars nor even atoms or atomic nuclei could have − 4πc r2 had a separate existence. There is a limiting luminosity for which this expression vanishes, called the Eddington limit [76] A. Hubble’s law ! 4πGMmp 38 M 1 LEdd = 1.3 10 erg s− . (139) σT ' × M The XVI century finally saw what came to be a water- shed in the development of Cosmology. In 1543 Coper- At greater luminosities the outward pressure of radiation nicus published his treatise “De Revolutionibus Orbium would exceed the inward gravitational attraction and Celestium” (The Revolution of Celestial Spheres) where accretion would be halted. a new view of the world is presented: the heliocentric Active galactic nuclei (AGNs) are galaxies that model [3]. harbor compact masses at the center exhibiting intense It is hard to underestimate the importance of this work: non-thermal emission that is often variable, which it challenged the age long views of the way the uni- indicates small sizes (light months to light years). The verse worked and the preponderance of the Earth and, luminosity of an accreting black hole is proportional to by extension, of human beings. The realization that the rate at which it is gaining mass. Under favorable we, our planet, and indeed our solar system (and even conditions, the accretion leads to the formation of a our Galaxy) are quite common in the heavens and re- highly relativistic collimated jet. The formation of the produced by myriads of planetary systems, provided a jet is not well constrained, but it is thought to change sobering (though unsettling) view of the universe. All from magnetic-field-dominated near the central engine the reassurances of the cosmology of the Middle Ages to particle (electron and positron, or ions and electrons) were gone, and a new view of the world, less secure and dominated beyond pc distances. The AGN taxonomy, comfortable, came into being. Despite these “problems” controlled by the dichotomy between radio-quiet and and the many critics the model attracted, the system radio-loud classes, is represented in Fig. 9. The appear- was soon accepted by the best minds of the time such as ance of an AGN depends crucially on the orientation Galileo. of the observer with respect to the symmetry axis of The simplest and most ancient of all astronomical ob- the [78]. In this scheme, the difference servations is that the sky grows dark when the Sun goes between radio-loud and radio-quiet AGN depends on down. This fact was first noted by Kepler, who, in the the presence or absence of radio-emitting jets powered XVII century, used it as evidence for a finite universe. In by the central nucleus, which in turn may be speculated the XIX century, when the idea of an unending, unchang- to depend on: (i) black hole rotation; (ii) low power or ing space filled with stars like the Sun was widspread high power, as determined by the mass-accretion rate in consequence of the Copernican revolution, the ques- 2 Mc˙ /LEdd [79]. tion of the dark night sky became a problem. To clearly ascertain this problem, we recall that if absorption is 1.1. , AGN AND 27

added: a dusty torus or a wrapped disk obscuring the light of type 2 objects. The unification scheme that has emerged combining these ingredi- ents (black hole, disk, jet, torus and clouds) is usually attributed to Antonucci (1993) and Urry & Padovani (1995). As shown in Fig. 6, it is based on orientation effects compared to the line of sight.

24 pastel-00822242, version 1 - 14 May 2013

FIG. 9: UnificationFigure scheme of AGN. 6. TheUnification acronyms for the scheme different sub-classes of AGN. of AGN The are as acronyms follows: Fanaroff-Riley radio galaxies (FR I/II), narrow line (NLRG), broad line radio galaxy (BLRG), radio-loud quasar (RLQ), radio quiet quasar (RQQ), flat spectrumfor radio the quasar diff (FSRQ),erent and sub-classes Sefeyrt galaxies (Sy of 1/2) AGN [77]. are given in Fig. 2. Adapted from Urry & Padovani (1995) . neglected, the aparent luminosity of a star of absolute (1744) [81] and Olbers (1826) [82] postulated the exis- luminosity L at a distance r will be b = L/4πr2. If the tence of an interstellar medium that absorbs the light number density of such stars is a constant n, then the from very distant stars responsible for the divergence number of stars at distances r between r and r + dr is of the integral in (140). However, this resolution of dN = 4πnr2dr, so the total radiant energy density due to the paradox is unsatisfactory, because in an eternal all stars is universe the temperature of the interstellar medium Z Z would have to rise until the medium was in thermal ∞  L  b dN n r2dr equilibrium with the starlight, in which case it would ρs = = 2 4π 0 4πr be emitting as much energy as it absorbs, and hence Z ∞ could not reduce the average radiant energy density. = Ln dr . (140) 0 The stars themselves are of course opaque, and totally block out the light from sufficiently distant sources, The integral diverges, leading to an infinite energy den- but if this is the resolution of the so-called “Olbers sity of starlight! paradox” then every line of segment must terminate In order to avoid this paradox, both de Cheseaux´ 202 PINER ET AL. Vol. 640

25

In 1929, Hubble discovered that the spectral lines of galaxies were shifted towards the red by an amount pro- portional to their distances [83]. If the is due to the Doppler effect, this means that the galaxies move away from each other with velocities proportional to their separations. The importance of this observation is that it is just what we should predict according to the simplest possible picture of the flow of matter in an ex- panding universe. The redshift parameter is defined as the traditional shift in wavelength of a photon emitted by a distant galaxy at time tem and observed on Earth today λ ν z = obs 1 = em 1, (141) λem − νobs − Although measuring a galaxy’s redshift is relatively easy, and can be done with high precision, measuring its dis- tance is difficult. Hubble knew z for nearly 50 galaxies, but had estimated distances for only 20 of them. Nev- ertheless, from a plot of redshift versus distance (repro- duced in Fig. 11) he found the famous linear relation now known as the Hubble’s law: H z = 0 r , (142) c

where H0 is a constant (now called the Hubble con- stant). Since in the study of Hubble all the redshift were small, z < 0.04, he was able to use the classical non- relativistic realtion for small velocities (v c). From (48) the Doppler redshift is z v/c and Hubble’s law takes the form ≈

v = H0 r . (143)

Since the Hubble constant H0 can be found by dividing velocity by distance, it is customarily written in the 1 1 rather baroque units of km s− Mpc− . From Fig. 11 it 1 1 follows that H0 = 500 km s− Mpc− . However, it turned out that Hubble was severely underestimating the Fig. 5.—Distances from the core of Gaussian component centers as a function distances to galaxies. In Fig. 12 we show a more recent of time. The lines are the least-squares fits to outward motion with constant speed. determination of the Hubble constant from nearby galaxies, using HST data [84]. By combining results for component C2 in 0827+243, however, has been established of different research groups, the present day Hubble with a high degree of confidence. +5 1 1 expansion rate is H0 = 70 3 km s− Mpc− . − 4. DISCUSSION EXERCISE 6.2 The (SDSS) Fig. 6.—Mosaic of images of 0827+243 at 22 GHz. The bright feature moves Inspection of Table 3 shows that different Gaussian compo- is a survey that mapped positions and distances of a nents in the same source have different apparent speeds. There approximatelyFIG. 10: Mosaic15 lt-yr inof 0.6 images yr (source of frame), 0827 for+243 an apparent at 22 GHz speed of [80]. about 25c. Only four of the six epochs are shown to prevent overlapping of images. million galaxies using a dedicated 2.5 m telescope in are two possible origins of these different apparent speeds. The 1 The peak flux densities at the four epochs are 1.2, 1.6, 1.2, and 1.1 Jy beamÀ , New Mexico [85]. In this exercise, you will use data first possibility is that the components move with different pat- respectively. Images have been rotated 25 clockwise and restored with a cir- tern speeds that are not necessarily equal to the bulk speed. Lister cular 0.5 mas beam. Model component C3 is at the center of the bright jet from this survey to calculate H0. In Fig. 13 we show (2006) concluded from a correlation of apparent speeds from atfeature. the surface of a star, so the whole sky should have the spectrum of a star in our galaxy and spectra of four the 2 cm and MOJAVE surveys with other source properties that a temperature equal to that at the surface of a typical star. distant galaxies, as measured by the SDSS. For each of various pattern speeds are present in the jets, but that the fastest the galaxies, we indicate the measured brightness in EXERCISE 6.1 (i) In a forest there are n trees per units of Joules per square meter per second. Assume hectare, evenly spaced. The thickness of each trunk is that each of them has the same luminosity as that of the 11 37 D. What is the mean distance that you have an unob- Milky Way (LMW = 10 L , or LMW = 4 10 J/s). (i) De- structed view into the woods, i.e. the mean free path? termine the distance to each of the four× galaxies, using (ii) How is this related to the Olbers paradox? the inverse-square law relation between brightness and 16 CHAPTER 2. FUNDAMENTAL OBSERVATIONS

26

FIG.Figure 11:2.4: Hubble’sEdwin Hubble’s originaloriginal plot ofplot theof relationthe relation betweenbetween redshiftredshift (vertical(vertical axis) axis)and anddistance distance(horizon (horizontaltal axis). Note axis).that Notethe thatvertical inaxis the actually plots cz rather than z – and that the units are accidentally written verticalas km rather axisthan he actuallykm/s. (from plotsHubblecz rather1929, thanProc.z,Nat. and thatAcad. theSci., units15, are accidentally written as km rather than km/s [83]. 2.3.168) REDSHIFT PROPORTIONAL TO DISTANCE 17

Figure forFIG. Problem 13: 4. Spectra measured by the SDSS [86].

velocity of recession for each galaxy, and in each case use the distances to estimate the Hubble constant, in 4 units of kilometers per second per Megaparsec. You will not get identical results from each of the galaxies, due to measurement uncertainties (but they should all be in the same ballpark), so average the results of the four galaxies to get your final answer.

Now a point worth noting at this juncture is that galax- ies do not follow Hubble’s law exactly. In addition to the FIG.Figure 12:2.5: AA moremore modernmodern v versionersion of ofHubble’s Hubble’splot, plot,showing showingcz versuscz expansion of the universe, galaxy motions are affected by distance. In this case, the galaxy distances have been determined using versusCepheid distance.variable stars Inas thisstandard case, thecandles, galaxyas describ distancesed in Chapter have been6. (from de- the gravity of specific, nearby structures, such as the pull terminedFreedman, usinget al. 2001, CepheidApJ, 553, variable47) stars as standard candles [84]. of the Milky Way and Andromeda galaxies on each other. Each galaxy therefore has a peculiar velocity, where pe- velocity away from Earth. Since the values of z in Hubble’s analysis were all culiar is used in the sense of “individual,” or “specific to small (z < 0.04), he was able to use the classical, nonrelativistic relation for itself.” Thus, the recession velocity of a galaxy is really luminosity.the Doppler shift, Expressz = v/c, yourwhere v answersis the radial bothvelocit iny of metersthe light andsource in megaparsecs,(in this case, a galaxy). andIn giveterpreting twothe significantredshifts as Doppler figures.shifts,(ii)Hubble’sThe v = H d + v , (144) spectrumlaw takes the ofform each of these objects shows a pair of strong 0 pec absorption lines of calcium,v = H which0r . have rest wavelength(2.6) where vpec is the peculiar velocity of the galaxy Since the Hubble constant H0 can be found by dividing velocity by distance, λ = 3935 Å and 3970 Å, respectively. The wavelengths1 1 it0is customarily written in the rather baroque units of km s− Mpc− . When along the line of sight. If peculiar velocities could ofHubble thesefirst linesdiscov inered theHubble’s galaxiesLaw, he havethough beent that shiftedthe numerical to longervalue of have any value, then this would make Hubble’s law 1 1 wavelengthsthe Hubble constan (i.e.,t was redshifted),H0 = 500 km s− Mp byc− the(see expansionFigure 2.4). Ho ofwev theer, useless. However, peculiar velocities are typically universe.it turned out Asthat a guide,Hubble w theas sev spectrumerely underestimating of a starthe likedistances the Sunto galaxies. only about 300 km/s, and they very rarely exceed is shownFigure 2.5 insho thews a uppermore recen panel;t determination the calciumof the Hubble linesconstan are att 1000 km/s. Hubble’s law therefore becomes accurate zerofrom nearb redshift.y galaxies, Measureusing data theobtained redshiftby (appropriately of each galaxy.enough) Thatthe for galaxies that are far away, when H0d is much is, calculate the fractional change in wavelength of the larger than 1000 km/s. Furthermore, we can often calcium lines. [Hint: The tricky part here is to make sure estimate what a galaxy’s peculiar velocity will be by you are identifying the right lines as calcium. In each looking at the nearby structures that will be pulling on it. case, they are a close pair; for Galaxy #2, they are the prominent absorption dips between 4100 Åand 4200 Å. EXERCISE 6.3 Suppose we observer two galaxies, one Measure the redshift for both of the calcium lines in at a distance of 35 Mly with a radial velocity of 580 km/s, each galaxy (in each case the two lines should give the and another at a distance of 1, 100 Mly with a radial ve- same redshift, of course!). Give your final redshift to locity of 25, 400 km/s. (i) Calculate the Hubble constant two significant figures. Do the intermediate steps of the for each of these two observations. (ii) Which of the calculation without rounding; rounding too early can two calculations would you consider to be more trust- result in errors. (iii) Given the , calculate the worthy? Why? (iii) Estimate the peculiar velocity of 18 CHAPTER 2. FUNDAMENTAL OBSER27 VATIONS the closer galaxy. (iv) If the more distant galaxy had this same peculiar velocity, how would that change your 2 calculated value of the Hubble constant? r We would expect intuitively that at any given time the 23 universe ought to look the same to observers in all typical r12 galaxies, and in whatever direction they look. (Hereafter 3 we will use the label “typical” to indicate galaxies that do not have any large peculiar motion of their own, but are simply carried along with the general cosmic flow r31 of galaxies.) This hypothesis is so natural (at least since Copernicus) that it has been called the cosmological prin- 1 ciple by Milne [87]. As applied to the galaxies themselves, the cosmologi-Figure 2.6:FIG.A 14:triangle A triangledefined definedby three by threegalaxies galaxiesin ina auniformly uniformly expanding expanding universe [88]. cal principle requires that an observer in a typical galaxyuniverse. should see all the other galaxies moving with the same pattern of velocities, whatever typical galaxy the ob- server happens to be riding in. It is a direct mathematicalHubble SpaceIf galaxiesTelescop aree. The currentlybest curren movingt estimate away fromof eachthe Hubble other, constant, consequence of this principle that the relative speedcombining of thisthe impliesresults of theydiff wereerent closerresearc togetherh groups, in theis past. Con- any two galaxies must be proportional to the distance sider a pair of galaxies currently separated by a distance between them, just as found by Hubble. To see this r, with a velocity v = H0r relative1 to each1 other. If there H0 = 70 7 km s− Mpc− . (2.7) consider three typical galaxies at positions ~r1, ~r2, and ~r3. are no forces acting to accelerate± or decelerate their rela- They define the triangle shown in Fig. 14, with sides of tive motion, then their velocity is constant, and the time This is the value for the Hubble constant that I will use in the remainder of length that has elapsed since they were in contact is this book. Cosmological innocents sometimesr exclaim,1 when first encountering Hub- r12 ~r1 ~r2 tH = = H− , (148) ≡ | − | ble’s Law, “Surely it must be a violationv 0 of the cosmological principle to r23 ~r2 ~r3 ≡ | − | have all those distant galaxies moving away from us! It looks as if we are r31 ~r3 ~r1 . (145) independent of the current separation r between galax- ≡ | − | at a special location in the1universe – the point away from which all other ies. The time H0− is generally referred to as the Hub- galaxies are fleeing.” In fact, what we1 see here1 in our Galaxy is exactly what In a homogeneous and uniform expanding universe the ble time. For H 70 km s− Mpc− , the Hubble time is shape of the triangle is preserved as the galaxiesy moveou would exp1 ect to see in ≈a universe which is undergoing homogeneous and H0− 14 Gyr. If the relative velocities of galaxies have away from each other. Maintaining the correct relativeisotropic expansion.been≈ constantWe insee thedistan past,t thengalaxies one Hubblemoving timeaway ago,from allus; but ob- lengths for the sides of the triangle requires an expansionservers in anthey galaxiesother galaxy in thew universeould also weresee distan crammedt galaxies togethermo intoving away from law of the form them. a small volume. To see onThea more observationmathematical of galaxylev redshiftsel what pointswe mean naturallyby homogeneous, to r12(t) = a(t) r12(t0) isotropic expansion,a descriptionconsider three forgalaxies the evolutionat positions of the universe.!r , !r , and !r . They r (t) = a(t) r (t ) A big bang model could be broadly defined as a1 model2 3 23 23 0 define a triangle (Figure 2.6) with sides of length r31(t) = a(t) r31(t0) , (146) in which the universe expands from an initially highly dense state to its current low-density state. The Hubble where a(t) is a scale factor, which is totally independent time of 14 Gyr is comparabler12 !r1 to!r the2 ages computed for (2.8) ∼ ≡ | − | of location or direction. The scale factor a(t) tells us how the oldest known starsr23 in the universe.!r2 !r3 This rough equiv- (2.9) the expansion (or possibly contraction) of the universe alence is reassuring. However,≡ | − the| r31 !r3 !r1 . (2.10) depends on time. At any time t, an observer in galaxy (i.e the time elapsed since≡ its| original− | highly dense state) #1 will see the other galaxies receding with a speed is not necessarily exactly equal to tH. On the one hand, if gravity working on matter is the only force at work on dr a˙ large scales, then the attractive force of gravity will act v (t) = 12 = a˙ r (t ) = r (t) 12 dt 12 0 a 12 to slow down the expansion. If this were the case, the dr31 a˙ universe was expanding more rapidly in the past than v31(t) = = ar˙ 31(t0) = r31(t) . (147) H 1 dt a it is now, and the universe is younger than 0− . On the other hand, if the energy density of the universe is dom- You can easily demonstrate that an observer in galaxy inated by a cosmological constant Λ (more on this later), #2 or galaxy #3 will find the same linear relation be- then the dominant gravitational force is repulsive, and 1 tween observed recession speed and distance, with a˙/a the universe may be older than H0− . playing the role of the Hubble constant. Since this ar- The horizon distance is defined as the greatest dis- gument can be applied to any trio of galaxies, it implies tance a photon can travel during the age of the universe. that in any universe where the distribution of galaxies The Hubble distance, H = c/H0 4.3 Gpc, provides a is undergoing homogeneous, isotropic expansion, the natural distance scale.R However,≈ just as the age of the 1 velocity-distance relation takes the linear form v = Hr, universe is roughly equal to H0− in most big bang mod- with H = a˙/a. els, with the exact value depending on the expansion 22 CHAPTER 16 COSMOLOGY

EXAMPLE 16.2 Critical Density of the Universe We can estimate the critical mass density of the Universe, Using H ϭ 23 ϫ 10Ϫ3 m/(s · lightyear), where 1 light- 15 Ϫ11 2 2 ␳c, using classical energy considerations. The result turns year ϭ 9.46 ϫ 10 m and G ϭ 6.67 ϫ 10 N · m /kg , out to be in agreement with the rigorous predictions of yields a present value of the critical density ␳c ϭ 1.1 ϫ general relativity because of the simplifying assumption 10Ϫ26 kg/m3. As the mass of a hydrogen atom is 1.67 ϫ Ϫ27 that the mass of the Universe is uniformly distributed. 10 kg, ␳c corresponds to about 7 hydrogen atoms per cubic meter, an incredibly low density. Solution Figure 16.16 shows a large section of the Uni- verse with radius R with the critical density, containing a 28 total mass M, where M consists of the total mass of matter 2 history of theplus universe, the effective one mass horizon of radiation is roughly with energy equal E, E/c . A ~v galaxy of mass m and speed v at R will just escape to infin- v to c/H0, with theity with exact zero value, speed if again, the sum depending of its kinetic on energy the and m expansion history.gravitational potential energy is zero. Thus, Before proceeding any further, two qualifications have 1 2 GmM to be attached to the cosmologicalEtotal ϭ 0 ϭ K ϩ principle.U ϭ 2 mv Ϫ First, it is R ⇢ ,M obviously not true on small scales – we are in a Galaxy m 4 3 1 2 Gm 3␲R ␳c which belongs to a small2 mv ϭ of other galaxies, R which in turn lies near the enormousR cluster of galaxies 2 8␲G 2 in . In fact, of the 33v galaxiesϭ R in␳ Messier’sc catalogue, almost half are in one small part3 of the sky, the constella- tion of Virgo. TheBecause cosmological the galaxy of principle, mass m obeys if at the all Hubble valid, law, comes into playv ϭ onlyHR, the when preceding we viewequation the becomes universe on a scale at least as large as the distance between clusters Figure 16.16 (Example 16.2) A galaxy escaping from a 8␲G 3H 2 FIG.large 15: Spherical cluster contained region of within galaxies radius with R. Onlya larger the radius mass than H 2 ϭ ␳ or ␳ ϭ of galaxies, or about 100 million3 c light years.c 8 Second,␲G the distancewithin R slows between the mass clusters m. of galaxies, but smaller radius in using the cosmological principle to derive the rela- than any distance characterizing the universe as a whole. tion of proportionality between galactic velocities and distances, we suppose the usual rule for adding v c. This, of course, was not a problem for Hubble in 1929, 16.6 FREIDMANN MODELS AND THE AGE as none of the galaxies he studied then had a speed any- OF THE UNIVERSEB. Friedmann-Robertson-Walker cosmologies where near the speed of light. Nevertheless, it is im- portant to stress that when one thinks aboutFreidmann really’ larges work established the foundation for describing the time evolu- tion of the Universe based on general relativity. General relativity must be distances characteristic of the universe, as a whole, one In 1917 Einstein presented a model of the universe used in cosmological calculations because it correctly describes gravity, the must work in a theoretical framework capable of dealing based on his theory of general relativity [89]. It de- with velocities approaching the speed ofmost light. important force determining the Universe’s structure, over immense cos- mological distances. Newtonianscribes a geometricallytheory can lead symmetricto errors when (spherical) applied to space the with Note how Hubble’s law ties in with Olbers’ paradox. finite volume but no boundary. In accordance with the 1 Universe as a whole because it assumes that the force of gravity is always attrac- If the universe is of finite age, t H− , then the night H ∼ 0 tive and is instantaneouslycosmological transmitted. principle, Although the Freidmann model is homogeneous did consider and sky can be dark, even if the universe is infinitelymodels both large, with andisotropic. without Einstein It is also’s repulsive static: form the volumeof gravity of (cosmologi- the space does because light from distant galaxies has notcal yetconstant), had time it is easiestnot to change. see the general In order form to of obtain Big Bang a static behavior model, without Einstein to reach us. Galaxy surveys tell us thatintroducing the luminosity repulsive introducedgravitational forces a new at repulsive this point. force in his equations. The density of galaxies in the local universe is Freidmann found threesize of types this of cosmological time-dependent term , is given which by may the be cosmo- described in terms of the universal expansion scaling factor a(t). Figure 16.17 8 3 logical constant Λ. Einstein presented his model before nL 2 10 L Mpc− . shows a(t )(the(149) separation between galaxies) as a function of time for the ≈ × the redshifts of the galaxies were known, and taking the three cases labeled openuniverse universe, to fl beat universe, static was and then closed a reasonable universe. Note assumption. that By terrestrial standards, the universe isa( nott) alone a well-lit has a value of zero at the lower-left corner of the graph, not t, and When the expansion of the universe was discovered, this place; this luminosity density is equivalentthat to the a single three 40curves start at different times in the past in order to give the argument in favor of a cosmological constant vanished. watt light bulb within a sphere 1 AU insame radius. scaling If factor the at the present time, denoted t 0. Open universes have less Einstein himself later called it the biggest blunder of his horizon distance is c/H , then the totalmass flux and ofenergy light than that needed to halt the expansion. They start with a scale H 0 life. Nevertheless, the most recent observations seem to we receive from allR the≈ stars from all the galaxiesfactor of zero within and grow without limit, any given galaxy approaching a limiting indicate that a non-zero cosmological constant has to be the horizon will be present. Z H CopyrightR 2005 cThomson Learning,11 Inc. All Rights2 Reserved. Fgal nL dr nL 9 10 L Mpc− In 1922, Friedmann [90, 91] studied the cosmological ≈ 0 ∼ H0 ∼ × solutions of Einstein equations. If Λ = 0, only evolv- 11 2 2 10− L AU− . (150) ing, expanding or contracting models of the universe ∼ × are possible. The general relativistic derivation of the By the cosmological principle, this is the total flux of law of expansion for the Friedmann models will not be starlight you would expect at any randomly located spot given here. It is interesting that the existence of three in the universe. Comparing this to the flux we receive types of models and their law of expansion can be de- from the Sun, rived from purely Newtonian considerations, with re- sults in complete agreement with the relativistic treat- L 2 F = 0.08L AU− , (151) ment. Moreover, the essential character of the motion 4π AU2 ≈ can be obtained from a simple energy argument, which

10 we discuss next. we find that Fgal/F 3 10− . Thus, the total flux of starlight at a randomly ∼ selected× location in the universe Consider a spherical region of galaxies of radius R. is less than a billionth the flux of light we receive from (For the purposes of this calculation we must take R to the Sun here on Earth. For the entire universe to be as be larger than the distance between clusters of galaxies, well-lit as the Earth, it would have to be over a billion but smaller than any distance characterizing the universe times older than it is; and you would have to keep the as a whole, as shown in Fig. 15. We also assume Λ = 0.) stars shining during all that time. The mass of this sphere is its volume times the cosmic 29 mass density, the equation derived from general relativity [63]. For k = 0, the value of H fixes the so-called critical density as 4 π R3 M = ρm . (152) 3H2c2 3 ρ(k = 0) ρc = . (159) ≡ 8πG We can now consider the motion of a galaxy of mass m at the edge of the spherical region. According to Hub- Since we know the current value of the Hubble parame- ble’s law, the velocity of the galaxy is v = HR, and its ter to within 10%, we can compute the current value of hide corresponding kinetic energy the critical density to within 20%. We usually this uncertainty by introducing h,

1 2 1 2 2 1 1 K = mv = mH R . (153) H0 = 100 h km s− Mpc− , (160) 2 2 such that In a spherical distribution of matter, the gravitational 11 2 3 force on a given spherical shell depends only on the ρc,0 = 2.77 10 h M /Mpc × mass inside the shell. The potential energy at the edge 29 2 3 = 1.88 10− h g/cm of the sphere is × 5 2 3 = 1.05 10− h GeV/cm . (161) 2 × GMm 4πmR ρmG +0.05 U = = . (154) Note that since h 0.70 0.03 a flat universe requires an − R − 3 energy density of ≈ 10 protons− per cubic meter. ∼ Hence, the total energy is The expansion of the universe can be compared to the motion of a mass launched vertically from the surface

1 2 2 4π 2 of a celestial body. The form of the orbit depends on E = K + U = mH R Gm R ρm . (155) 2 − 3 the initial energy. In order to compute the complete or- bit, the mass of the main body and the initial velocity which has to remain constant as the universe expands. have to be known. In cosmology, the corresponding Likewise, parameters are the mean density and the Hubble con- stant. On the one hand, if the density exceeds the critial 2E 2 8π = H Gρm . (156) density, the expansion of any spherical region will turn mR2 − 3 to a contraction and it will collapse to a point. This Since we assume that the universe is homogeneous, H corresponds to the closed Friedmann model. On the other hand, if ρm < ρc, the ever-expanding hyperbolic and ρm cannot be functions of R. Thus, the left-hand- side of (156) cannot depend on the chosen distance R to model is obtained. These three models of the universe the coordinate center. However, the value of 2E/(mR2) are called the standard models. They are the simplest is time-dependent, because the distance between us and relativistic cosmological models for Λ = 0. Models with the galaxy will change as the universe expands. Since Λ , 0 are mathematically more complicated, but show the mass m of our test galaxy is arbitrary, we can choose the same behaviour. The simple Newtonian treatment it such that 2E/(mc2) = 1 holds at an arbitrary moment of the expansion problem is possible because Newtonian | | mechanics is approximately valid in small regions of the as long as E , 0. For different times, the left-hand-side scales as R 2 and thus we can rewrite (156) as universe. However, although the resulting equations − are formally similar, the interpretation of the quantities a˙ 2 8π kc2 involved is not the same as in the relativistic context. = Gρm . (157) The global geometry of Friedmann models can only be a 3 − a2R2 0 understood within the general [63]. Next, we define the abundance Ω of the different play- Note that because E is constant, k is constant too. Ac- i ers in cosmology as their energy density relative to ρ . tually, k = 0, 1 is generally known as the curvature c For example, the dimensionless mass density parameter constant. Throughout± the subscripted “0”s indicate that is found to be quantities (which in general evolve with time) are to be 2 evaluated at present . Finally, we account for the ρmc 8πG Ωm = = 2 ρm . (162) equivalence of mass and energy by including not only ρc 3H 2 the mass but also the energy density, ρ = ρmc + and so (157) becomes ··· For simplicity, for the moment we will keep considering scenarios with Λ = 0, but we advance the reader that a˙ 2 8π ρ kc2 H2 G c2 = 2 2 . (158) Λ ≡ a 3 c − a2R ΩΛ = . (163) 0 3H2 which is Friedmann equation (without cosmological con- Now, what about our universe? On a large scale stant) in the Newtonian limit. (158) agrees exactly with what is the overall curvature of the universe? Does it 30 have positive curvature, negative curvature, or is it flat? There is a caveat to the statement that the expansion of a By solving Einstein equations, Robertson [92, 93] and homogeneous universe is adiabatic: when particles anni- Walker [94], showed that the three hypersurfaces of con- hilate, such as electrons and positrons, this adds heat and stant curvature (the hyper-sphere, the hyper-plane, and makes the expansion temporarily non-adiabatic. This the hyper-pseudosphere) are indeed possible geometries matters at some specific epochs in the very early uni- for a homegeneous and isotropic universe undergoing verse. expansion. The metric they derived, independently of For a sphere of comoving radius R0, each other, is called the Friedmann-Robertson-Walker 4 (FRW) metric. The line element is most generally writ- V = π R3 a3(t) , (169) ten in the form 3 0 " # d%2 and so ds2 = c2dt2 a2(t) + %2dΩ2 , (164) − 1 k%2/R2 a˙ − V˙ = 4π R3 a2 a˙ = 3 V . (170) 0 a where dΩ2 = dθ2+sin2 θdφ2. It is easily seen that the spa- Since U = ρV, tial component of the FRW metric consists of the spatial metric for a uniformly curved space of radius R, scaled  a˙  a t U˙ = ρ˙V + ρV˙ = V ρ˙ + 3 ρ . (171) by the square of the scale factor ( ). If the universe had a a positive curvature k = 1, then the universe would be closed, or finite in volume. This would not mean that the Substituting (170) and (171) into (168) we have stars and galaxies extended out to a certain boundary, be-  a˙ a˙  yond which there is empty space. There is no boundary V ρ˙ + 3 ρ + 3 P = 0 (172) or edge in such a universe. If a particle were to move in a a a straight line in a particular direction, it would eventually and thus return to the starting point – perhaps eons of time later. On the other hand, if the curvature of the space was zero  a˙ ρ˙ = 3 ρ + P . (173) k = 0 or negative k = 1, the universe would be open. It − a could just go on forever.− Using the substitution This fluid equation describes the evolution of energy den- sity in an expanding universe. It tells us that the ex-  R sin(r/R) for k = +1 pansion decreases the energy density both by dilution  % = S (r) =  r for k = 0 ; (165) and by the work required to expand a gas with pressure k   R sinh(r/R) for k = 1 P 0. − ≥To solve this equation, we need an additional equation the FRW line element can be rewritten as of state relating P and ρ. Suppose we write this in the form h i ds2 = c2dt2 a2(t) dr2 + S2(r) dΩ2 ; (166) − k P = wρ . (174) see Appendix E for details. In principle, w could change with time, but we will as- The time variable t in the FRW metric is the cosmolog- sume that any time derivatives of w are negligible com- ical proper time, called the cosmic time for short, and is pared to time derivatives of ρ. This is reasonable if the the time measured by an observer who sees the universe equation of state is determined by “microphysics” that expanding uniformly around him. The spatial variables is not directly tied to the expansion of the universe. The (%, θ, φ) or (r, θ, φ) are called the comoving coordinates fluid equation then implies of a point in space. If the expansion of the universe is perfectly homogeneous and isotropic, the comoving ρ˙ a˙ = 3(1 + w) , (175) coordinates of any point remain constant with time. ρ − a Todescribe the time evolution of the scale factor a(t) we need an additional equation describing how the energy with solution content of the universe ρ is affected by expansion. The ρ  a  3(1+w) first law of thermodynamics, = − . (176) ρ0 a0 dU = TdS PdV, (167) − The pressure in a gas is determined by the thermal mo- with dQ = 0 (no heat exchange to the outside, since no tion of its constituents. For non-relativistic matter (a.k.a. outside exists) becomes cosmological dust), P mv2 v dU dV w dU = PdV + P = 0 . (168) = 2 2 1 , (177) − ⇒ dt dt ρ ∼ mc ∼ c  31 where v is the thermal velocity of particles with mass and substitutte from the fluid equation m. To a near-perfect approximation w = 0, implying 3 a ρm a− . Light, or more generally any highly relativistic ρ˙ = 3(ρ + P) (183) particle,∝ has an associated pressure (radiation pressure). a˙ − Pressure is defined as the momentum transfer onto a to obtain the acceleration equation perfectly reflecting wall per unit time and per unit area. Consider an isotropic distribution of photons (or another a¨ 4πG kind of particle) moving with the speed of light. The mo- = (ρ + 3P) . (184) a − 3c2 mentum of a photon is given in terms of its energy as p = E/c = hν/c. Consider now an area element dA of We see that if ρ and P are positive, the expansion of the the wall; the momentum transferred to it per unit time universe decelerates. Higher P produces stronger decel- is given by the momentum transfer per photon, times eration for given ρ, e.g., a radiation-dominated universe the number of photons hitting the area dA per unit time. decelerates faster than a matter-dominated universe. We will assume for the moment that all photons have In the remainder of this section, we consider a flat uni- the same frequency. If θ denotes the direction of a pho- verse, i.e., k = 0. It is easily seen that for non-relativistic ton relative to the normal of the wall, the momentum matter, the solution to Friedmann equation (158) is given component perpendicular to the wall before scattering by is p = p cos θ, and after scattering p = p cos θ; the ⊥ ⊥ −  2/3 2 two other momentum components are unchanged by t ρ0 ρ0t a t t 0 the reflection. Thus, the momentum transfer per pho- ( ) = and ρ( ) = 3 = 2 , (185) t0 a t ton scattering is ∆p = 2p cos θ. The number of photons scattering per unit time within the area dA is given by with the number density of photons, n times the area element 2 1 dA, times the thickness of the layer from which photons t0 = , (186) arrive at the wall per unit time. The latter is given by 3 H0 c cos θ, since only the perpendicular velocity component where we have used (159). Following the same steps for a brings them closer to the wall. Putting these terms to- bizarre universe, which is dominated today by radiation gether, we find for the momentum transfer to the wall pressure, yields the solution per unit time per unit area the expression  1/2 2 2 t ρ0 ρ0t P(θ) = 2hν n cos θ . (178) a t t 0 ( ) = and ρ( ) = 4 = 2 . (187) t0 a t Averaging this expression over a half-sphere (only pho- tons moving towards the wall can hit it) then yields From this simple exercise we can picture the the time evolution of the universe as follows. In the early 1 1 universe all matter is relativistic and radiation pressure P = hνn = ρ . (179) dominates: a(t) t1/2, ρ t 2, and ρ a 3 t 3/2. 3 3 rad − m − − The density of radiation∝ then∝ falls more∝ quickly∝ than Then for radition, w = 1/3, implying ρ a 4. This be- that of dust. On the other hand, when dust dominates: rad − 2/3 2 4 8/3 ∝ a(t) t , ρm t− , and ρ a− t , hence dust havior also follows from a simple argument: the number ∝ ∝ rad ∝ ∝ 3 domination increases. density of photons falls as n a− , and the energy per 1 ∝ photon falls as hν a− because of cosmological redshift (more on this below).∝ EXERCISE 6.4 Using the Hubble flow v = H0r show Next, we obtain an expression for the acceleration of that the expansion of the universe changes the particle number density according to n˙ = 3H n. the universe. If we multiply our standard version of the − 0 Friedmann equation by a2, we get In closing, we discuss how to measure distances in the 8πG kc2 FRW spacetime. Consider a galaxy which is far away a˙2 = ρa2 . (180) from us, sufficiently far away that we may ignore the 3c2 − R2 0 small scale perturbations of spacetime and adopt the FRW line element. In an expanding universe, the dis- Take the time derivative of (180) tance between two objects is increasing with time. Thus, 8πG   if we want to assign a spatial distance between two ob- 2a˙a¨ = ρ˙a2 + 2ρaa˙ . (181) jects, we must specify the time t at which the distance 3c2 is the correct one. Suppose that you are at the origin, divide by 2aa˙ and that the galaxy which you are observing is at a co- moving coordinate position (r, θ, φ). We define a proper a¨ 4πG  a  distance, as the distance between two events A and B in = ρ˙ + 2ρ , (182) a 3c2 a˙ a reference frame for which they occur simultaneously 32

(tA = tB). In other words, the proper distance dp(t) be- EXERCISE 6.6 Consider a positively curved universe tween two points in spacetime is equal to the length of (k = 1), in which the sole contribution to the energy the spatial geodesic between them when the scale factor density comes from non-relativistic matter. In this case 3 is fixed at the value a(t). The proper distance between the energy density has the dependence ρm = ρm,0/a . the observer and galaxy can be found using the FRW (i) Write down Friedmann equation for this universe and metric at a fixed time t, show that the parametric solution, h i 2 2 2 2 2 2 ds = a (t) dr + Sk(r) dΩ . (188) 4πGρm,0R a(θ) = 0 (1 cos θ) , 3c4 − Along the spatial geodesic between the observer and 3 4πGρm,0R galaxy, the angle (θ, φ) is constant, and thus t(θ) = 0 (θ sin θ) , (193) 3c5 − ds = a(t) dr . (189) satisfies the Friedmann equation. Here θ is a dimen- Likewise, using spatial variables (%, θ, φ) we have sionless parameter that runs from 0 to 2π, and R0 is the present radius of curvature if we have normalized 2 1/2 ds = a(t)[1 k(%/R) ]− dr (190) the scale factor at present to a(t0) = 1. (ii) What is amax, − the maximum possible scale factor for this universe? The proper distance dp is found by integrating over the (iii) What is the maximum value that the physical radius radial comoving coordinate r of curvature (aR0) reaches? (iv) What is the age of the universe when this maximum radius is reached? Z r (v) What is tcrunch, the time at which the universe dp = a(t) dr = a(t) r , (191) 0 undergoes a (that is a recollapse to a = 0)? [Hint: Recall that a˙ = da/dt = da/dθ dθ/dt.] or using (165)  EXERCISE 6.7 Consider a positively curved universe  k 1/2 1 √k R k  − sin− ( %/ ) for = +1 (k = 1), in which the sole contribution to the energy d = a(t) % for k = 0 . (192) − p  density comes from non-relativistic matter, and so the en-  1/2 1 3  k − sinh− ( √ k %/R) for k = 1 ergy density has the dependence ρ = ρ /a . (ii) Write | | | | − m m,0 down Friedmann equation for this universe and show In a flat universe, the proper distance to an object that the parametric solution, is just its coordinate distance, dp(t) = a(t)%. Because 1 1 2 sin− (x) > x and sinh− (x) < x, in a closed universe 4πGρm,0R a(θ) = 0 (cosh θ 1) , (k > 0) the proper distance to an object is greater than its 3c4 − coordinate distance, while in an open universe (k < 0) 3 4πGρm,0R the proper distance to an object is less than its coordinate t(θ) = 0 (sinh θ θ) , (194) distance. 3c5 − satisfies the Friedmann equation. (ii) Compare the time EXERCISE 6.5 A civilization that wants to conquer dependence of the scale factor for open, closed and the universe, which is homogeneous and isotropic, and critical matter-dominated cosmological models in a hence is described by the FRW metric, is getting ready log-log plot. to send out soldiers in all directions to invade all the universe out to a proper distance dp. Every soldier leaves the galaxy where the civilization was born, and travels through the universe with its spaceship along a C. Age and size of the Universe geodesic, out to a distance dp from the original galaxy. At the end of the invasion, which occurs at a fixed time t, all the soldiers stand on a spherical surface at a In special (and general) relativity the propagation of proper distance dp from their original galaxy. The total light is along a null geodesic (ds = 0). If we place the volume that has been invaded is the volume inside this observer at the origin (% = 0), and we choose a radial spherical surface. What is the total volume invaded? null geodesic (dθ = dφ = 0), we have Answer this question for the following three cases: (i) A flat metric (k = 0). (ii) A closed metric (k = +1) cdt d% = , (195) with radius of curvature R at the cosmic time t when a(t) ±[1 k(%/R)2]1/2 − the invaded volume and the proper distance dp are measured. (iii) An open metric (k = 1) with radius where + is for the emitted light ray and the is for a re- of curvature R at the cosmic time t when− the invaded ceived one. Imagine now that one crest of the− light wave volume and the proper distance dp are measured. was emitted at time tem at distance %em, and received at the origin %0 = 0 at t0, and that the next wave crest was 33 Distances in Cosmology

) or equivalently us Today distance,d0 em t ∆tem a(tem) t0 photon arrives = . (202) 0 ∆t a(t ) t 0 0 0 (

c The time interval between successive wave crests is the

= inverse of the frequency of the light wave, related to its wavelength by the relation c = λν. Hence, from (141) the LT

d redshift is

λ0 a0

Time z = 1 = 1 ; (203) λem − a(tem) − i.e., the redshift of a galaxy expresses how much the scale factor has changed since the light was emitted. The light detected today was emitted at some time t photon emitted em tem and, according to (203), there is a one-to-one correspon- em dence between z and tem. Therefore, the redshift z can Emission distance ,d Light travel distance em be used instead of time t to parametrize the history of the universe. A given z corresponds to a time when our FIG. 16: Cosmological redshift. universe was 1 + z times smaller than now. Generally, the expressions for a(t) are rather compli- cated and one cannot directly invert (203) to express the cosmic time t tem in terms of the redshift parameter emitted at tem + ∆tem and received at t0 + ∆t0; see Fig. 16. ≡ The two waves satisfy the relations: z. It is useful, therefore, to derive a general integral expression for t(z). Differentiating (203) we obtain

Z t0 Z %0 dt 1 d% a0 = p (196) dz = a˙(t)dt = (1 + z)H(t)dt , (204) a t c 2 2 tem ( ) − %em 1 k(%/R) −a (t) − − and from which follows that Z Z t t Z ∞ dz 0+∆ 0 dt 1 %0 d% t = . (205) = p . (197) z H(z)(1 + z) a t c 2 tem+∆tem ( ) − %em 1 k(%/R) − A constant of integration has been chosen here so that Now, substract (196) from (197) z corresponds to the initial moment of t = 0. →To ∞ obtain the expression for the Hubble parameter H Z t0+∆t0 Z t0 dt dt in terms of z and the present values of H0 and Ωm,0, it is = 0 (198) convenient to write the Friedmann equation (158) in the t +∆t a(t) − t a(t) em em em form and expand 2 kc ρm(z) H2 z z 2 H2 ( ) + 2 2 (1 + ) = Ωm,0 0 , (206) Z t0+∆t0 Z t0 Z t0+∆t0 a R ρm dt dt dt 0 0 ,0 = + a t a t a t tem+∆tem ( ) tem ( ) t0 ( ) where the definitions in (162) and (203) have been used. Z t t em+∆ em dt At z = 0, this equation reduces to (199) − t a(t) 2 em kc 2 = (Ωm 1)H , (207) 2 2 ,0 − 0 to obtain a0R0

Z t t Z t t 0+∆ 0 dt em+∆ em dt allowing us to express the current value of a0R0 in a = . (200) spatially curved universe (k , 0) in terms of H0 and t a(t) t a(t) 0 em Ωm,0. Taking this into account, we obtain Any change in a(t) during the time intervals between q 2 successive wave crests can be safely neglected, so that H(z) = H0 (1 Ωm,0)(1 + z) + Ωm,0 ρm(z)/ρm,0 − a(t) is a constant with respect to the time integration. q 2 3 Consequently, = H0 (1 Ωm,0)(1 + z) + Ωm,0(1 + z) . (208) − ∆t ∆t We can now complete our program by finding an ex- em = 0 , (201) a(tem) a(t0) pression for the comoving radial distance coordinate r as The Horizon 34

in the radiation and matter-dominated , so there is a Radius of horizon. observable universe The proper distance from the origin to %h is given by Z %h d% d (t) a(t) h = 2 1/2 0 [1 k(%/R) ] Z t − cdt = a(t) 0 . (212) 0 a(t0)

For k = 0, using (185) and (187) we obtain dh = 2ct in the radiation-dominated , and dh(t) = 3ct in the matter- dominated era. Now, substituting (186) into (185) we have 3 2/3 a(t) = H t (213) 2 0 and so from (203) it follows that FIG. 17: Cosmological horizon. 2 1 t (214) = 3/2 . 3 H0 (1 + z) a function of the reshift z. Since photons travel on null For the matter-dominated era, the proper horizon dis- geodesics of zero proper time, we see directly from the tance is metric (166) that 2c d (215) h = 3/2 . Z Z Z H0 (1 + z) cdt dt dz r = = c (1 + z)dz = c , (209) For a flat universe with Ω = 1, we find that at present − a(t) − dz H(z) m,0 time, 28 1 1 with H(z) given by (208). d = 2c/H = 1.85 10 h− cm = 6 h− Gpc . (216) h,0 0 × As the universe expands and ages, an observer at any Note that because a0 = 1, we have %h,0 = dh,0. point is able to see increasingly distant objects as the light from them has time to arrive, see Fig. 17. This means EXERCISE 6.8 Consider a flat model containing only that, as time progresses, increasingly larger regions of matter, with Ωm,0 = 1, and present Hubble constant H0. (i) What is the comoving distance to the horizon (z = )? the universe come into causal contact with the observer. ∞ The proper distance to the furthest observable point (the (ii) What is the redshift at which the comoving distance is half that to the horizon? (iii) What is the ratio of the age particle horizon) at time t is the “horizon distance”, dh(t). Again we return to the FRW metric, placing an ob- of the universe at that redshift, to its present age? (iv) At server at the origin (% = 0) and letting the particle horizon which redshift did the universe have half its present age? for this observer at time t be located at radial coordinate In closing, we show that Hubble’s law is indeed an distance %h. This means that a photon emitted at t = 0 at approximation for small redshift by using a Taylor ex- %h will reach the observer at the origin at time t. Recalling photons move along null geodesics (ds = 0) and consid- pansion of a(t), ering only radially traveling photons (dθ = dφ = 0), we 1 a(t) = a(t ) + (t t )a˙(t ) + (t t )2a¨(t ) + find 0 − 0 0 2 − 0 0 ···   Z t Z %h 1 2 2 dt0 1 d% = a(t ) 1 + (t t )H (t t ) q H + , (210) 0 0 0 0 0 0 = 2 1/2 , − − 2 − ··· 0 a(t0) c 0 [1 k(%/R) ] 2 − where q0 a¨(t0)a(t0)/a˙ (t0) is the deceleration param- yielding eter (it is named≡ − “deceleration” because historically, an accelerating universe was considered unlikely). If the  h R t i  sin c dt0/a(t0) for k = +1 expansion is slowing down, a¨ < 0 and q0 > 0. For not too  0  R t large time-differences, we can use the Taylor expansion %h = c dt /a(t ) for k = 0 . (211)  0 0 0 of a(t) and write  h R t i  sinh c dt0/a(t0) for k = 1 0 − 1 a(t) 1 z = 1 + (t t0)H0 . (217) If the scale factor evolves with time as a(t) = tα, with − ≈ 1 + z a(t0) ≈ − α > 1, we can see that the time integral in (211) diverges Hence Hubble’s law, z = (t0 t)H0 = d/cH0, is valid as as we approach t = 0. This would imply that the whole long as z H (t t) 1.− Deviations from its linear  0 0 −  universe is in causal contact. However, α = 1/2 and 2/3 form arises for z & 1 and can be used to determine q0. 35 2.5 Kinematic tests 61 2.5 Kinematic tests 63 ϕ0 = const (✓0, 0) θ0 = const ∆θ

l (t0, %0 = 0) ` ∆θ ✓ observer χ = 0 χ (t1,em%1) t = t0 ϕ0 tem (✓0 +θ0 + ∆θ✓, 0) Fig. 2.11. FIG. 18: Extended object of given transverse size ` at comoving tem propagate along radial geodesics and arrive today with an apparent angular distanceseparation%1 from!θ. The the proper observer size of the object, [95].l, is equal to the interval between the emission events at the endpoints:

2 l !s a(tem) #(χem) !θ, (2.68) = − = ! D.as obtained Angular from metric diameter (2.2). The angle and subtended luminosity by the object isdistances then l l !θ , (2.69) z = 5 4 z = a(tem) #(χem) = a(η χem) #(χem) / The distance0 − to an object is defined in termswhere of we thehave used object’s the fact that actualthe physical timesize,tem corresponds`, and toθ thethe conformal angular Fig. 2.12. time ηem η0 χem. If the object is close to us, that is, χem η0, then size of the object= − as viewed from earth.≪ Consider a light FIG. 19: For a flat univrese filled with dust dA(z) has a maximum a(η0 χem) a(η0) , #(χem) χem, directionsat z = in5 the/4, skycorresponding differs; this temperature to the redshift difference at which depends objects on the of angular a source of size ` at % =− %1 ≈and t = t1≈subtending an angle and separation.given The proper power size spectrum` will issubtend observed the to haveminimum a series angle of peaks∆θ ason the the angular ∆θ at the origin (% = 0, t = t0) as shown in Fig. 18. The l l separationsky. At is varied redshifts fromz large> 5 to/4 small objects scales. of The a given “first properacoustic sizepeak”` iswill roughly proper distance ` between!θ the two. ends of the object is ≈ a(η0) χem = D determinedappear by bigger the sound on the horizon sky atwith recombination, increasing thez [95]. maximum distance that a related to ∆θ by, We see that in this case !θ is inversely proportional to the distance, as expected. sound wave in the baryon–radiation fluid can have propagated by recombination. However, if the object is located far away, namely, close to the particle horizon, This sound horizon serves as a standard ruler of length l H 1(z ). Recombin- then η χem η , and s − r 0 − ≪ 0 ` ∼ ∆θ = . (218) ationand occurs consider at redshift thezr FRW1100. metricSince !0z asr being1, we centredcan set χem on(zr ) theχp in a(η0 χem) a(η0) , a#((χtem1))%1 # χp const. ≃ ≫ 1 = − ≪ → = (2.70) and in a dust-dominated universe, where # χp 2(a0 H0!0)− (see (2.9)), " # source. However, because of homogeneity,= the comov- The angular size of the object, we obtain ing distance between the source and the observer %1 is the We now define the angular diameterl distance ! " !θ , zr H0!0 1 1/2 1/2 ∝ a(η0 χem) same as we would calculate when1/2 we place the origin at − $θr zr− !0 0.87◦!0 . (2.73) ` our location. The≃ 2H photons(zr ) ≃ 2 from the≃ source are therefore dA = (219) ∆θ passing through a sphere, on3 which1/2 we sit, of proper We have substituted here H /H(zr ) ! z − , as follows from (2.61). Note that 20 2 ≃ 0 r surface area 4πa0%1. However, the redshift still affects 3/2 so that in Euclidean space, the corresponding angular size would be $θr tr /t0 zr− , the flux density in four further! " ways: (i) photon≃ ≈ or about 1000 times smaller. %1 are redshifted, reducing the flux density by a factor 1+z; d = a(t )% = . (220) The remarkable aspect of this result is that the angular diameter depends directly A 1 1 (ii) photon arrival rates are time dilated, reducing the 1 + z only on !0, which determines the spatial curvature, and is not very sensitive to flux density by a further factor 1 + z; (iii) opposing this, In analogy with (210) we write other parameters. As we will see in Chapter 9, this is true not only for a dust- dominatedthe bandwidth universe, as considereddν is reduced here, but by for a a factor very wide 1 + rangez, which of cosmological in- Z t Z creases the energy flux per unit bandwidth by one power 1 dt 1 %1 d% models, containing multiple matter components. Hence, measuring the angular = , (221) scaleof of 1the+ firstz; (iv) acousticfinally, peak has the emerged observed as the photons leading and at most frequency direct method a(t) c [1 k(%/R)2]1/2 0 0 − for determiningν0 were emitted the spatialat curvature. frequency Our best (1 evidence+ z)ν0. that Overall, the universe the is flux spatially flat (density!0 1), as is predicted the luminosity by inflation, at comes frequency from this (1 test.+ z)ν , divided From an examination point of view, only proficiency in = 0 by the total area, divided by (1 + z): the k = 0 case will be expected. Hence,

Z t1 Z z Lν([1 + z]ν0) dt dz ν(ν0) = %1 = c = c , (222) F 2 2 4πa0%1(r)(1 + z) 0 a(t) 0 H(z) L (ν ) = ν 0 , (224) where in the last equality we used (209). Then, for a 4πa2%2(1 + z)1+α flat universe filled with dust, the angular diameter as a 0 1 z function of is where the second expression assumes a power-law spec- 3/2 α `H0 (1 + z) trum L ν− . We can integrate over ν0 to obtain the ∆θ(z) = . (223) corresponding∝ total or bolometric formulae 2c (1 + z)1/2 1 − L At low redshifts (z 1), the angular diameter decreases = . (225)  F 2 2 2 in inverse proportion to z, reaches a minimum at z = 5/4, 4πa0%1(1 + z) and then scales as z for z 1; see Fig. 19  Perhaps the most important relation for observational The luminosity distance dL is defined to satisfy the rela- cosmology is that between the monochromatic flux den- tion (36). Thus, sity and luminosity. Start by assuming isotropic emis- 2 sion, so that the photons emitted by the source pass with dL = (1 + z)%1 = (1 + z) dA , (226) a uniform flux density through any sphere surround- ing the source. We can now make a shift of the origin, where we have taken a0 = 1. 36

26 VII. THE FORCE AWAKENS Figure 3. Observed magnitude 0.0001 Supernova Cosmology versus redshift is plotted for Project 12,13 well-measuresd distant and 24 High-Z Supernova (in the inset) nearby7 type Ia su- 0.001 Search pernovae. For clarity, measure- 22 pty Hamuy et al. Independent cosmological observations have un- Em ments at the same redshift are 0.01 0 combined. At redshifts beyond 20 masked the presence of some unknown formz of= 0.1 energy (distances greater than 9 0.2 0.4 0.6 1 rc about 10 light-years), the cos- 0.1 18 density, related to otherwise empty space, whichmological predictions ap- (indi-

with vacuum energy Mass density

cated by the curves) begin to BRIGHTNESS RELATIVE 16 pears to dominate the recent gravitational dynamicsdiverge, depending of on the as- 1 sumed cosmic densities of 14 the universe and yields a stage of cosmic acceleration.mass and vacuum energy. The 0.01 0.02 0.04 0.1 without vacuum energy red curves represent models We still have no solid clues as to the ofwith such zero vacuum and VED MAGNITUDE 22 Accelerating energy (or perhaps more accurately dark pressure).mass densities The ranging from the universe critical density rc down to zero (an empty ). The best fit OBSER 21 Decelerating cosmological constant is the simplest possible(blue form line) assumes of a mass universe

density of about rc /3 plus a dark energy because it is constant in both spacevacuum energy and density twice 20 that large—implying an accel- 0.2 0.4 0.6 1.0 time, and provides a good fit to the experimentalerating data cosmic as expansion. REDSHIFT z of today. In this section we will discuss the many obser- 0.8 0.7 0.6 0.5 vations that probes the dark energy and we will describe LINEAR SCALE OF THE UNIVERSE RELATIVE TO TODAY the generalities of the concordance model of cosmology lowed up. This approach also made it possible to use the By the end of the year, the error bars began to tighten, with Λ , 0. Hubble Space Telescope forFIG. follow-up 20: light-curve Observed observa- magnitudeas both groups (and now submitted relative papers brightness) with a few more versus su- tions, because we could specifyredshift in advance is the plotted one-square- for well-measuredpernovae, showing evidence distant for much [97, less 98] than and the(in ex- the degree patch of sky in which our wide-field imager would pected slowing of the cosmic expansion.9–11 This was be- find its catch of supernovae.inset) Such specificity nearby is a [99, require- 100]ginning SNe Ia. to be For a problem clarity, for measurements the simplest inflationary at the ment for advance scheduling of the HST. By now, the models with a universe dominated by its mass content. A. Supernova Cosmology Berkeley team, had grown sameto include redshift some dozen arecollabo- combined.Finally, Atat the redshifts beginning of beyond1998, the twoz groups= 0. 1pre- (dis- rators around the world, and was called Supernova Cos- sented the results shown in figure 3.12,13 mology Project (SCP). tances greaterthe than about 109 ly), the cosmological predictions (indicated by the curves)What’s begin wrong with to diverge, faint supernovae? depending on the Acommunity effort The faintness—or distance—of the high-redshift super- The expansion history of the cosmos canMeanwhile, be deter- the whole supernovaassumed community cosmic was making densitiesnovae in of figure mass 3 was and a dramatic vacuum surprise. energy. In the simplest The red mined using as a “standard candle” any distinguishableprogress with the understanding of relatively nearby su- pernovae. Mario Hamuy andcurves coworkers represent at Cerro Tololo modelsExploding with White zero vacuumDwarfs energy and mass took a major step forward by finding and studying many class of astronomical objects of known intrinsic bright- plausible, though unconfirmed, scenario would explain nearby (low-redshift) type densitiesIa supernovae. ranging7 The resulting from ρc down to zero (an empty cosmos). how all type Ia supernovae come to be so much alike, beautiful data set of 38 supernova light curves (some A ness that can be identified over a wide distance range. As The best fit (blue line)given assumes the varied range a mass of stars densitythey start from. of A lightweight about ρc/3 shown in figure 1) made it possible to check and improve star like the Sun uses up its nuclear fuel in 5 or 10 billion the light from such beacons travels to Earthon through the results of an Branch andplus Phillips, a vacuumshowing that energy type density twice that large, implying an 6,7 years. It then shrinks to an Earth-sized ember, a white dwarf, Ia peak brightness could be standardized. with its mass (mostly carbon and oxygen) supported against expanding universe, the cosmic expansion stretchesThe new supernovae-on-demand not accelerating techniques cosmic that per- expansionfurther collapse [101, by electron 102]. degeneracy pressure. Then it mitted systematic study of distant supernovae and the im- begins to quietly fade away. only the distances between galaxy clusters, butprovedalso understanding the of brightness variations among But the story can have a more dramatic finale if the white nearby type Ia’s spurred the community to redouble its ef- dwarf is in a close binary orbit with a large star that is still very wavelengths of the photons en route. Theforts. recorded A second collaboration, called the High-Z Supernova actively burning its nuclear fuel. If conditions of proximity Search and led by Brian of Schmidt magnitude of Australia’s Mountm + 1.and The relative apparent mass are right, there magnitude, will be a steady streamm, in of the redshift and brightness of each these candlesStromlo thus Observatory, pro- was formed at the end of 1994. The material from the active star slowly accreting onto the white team includesd many veteranband, supernovax, experts. is defined The two asdwarf. Over millions of years, the dwarf’s mass builds up vide a measurement of the total integrated exansionrival teams raced of each other over the next few years—oc- until it reaches the critical mass (near the Chandrasekhar the universe since the time the light was emitted.casionally A covering col- for each other with observations when limit, about 1.4 solar masses) that triggers a runaway ther- one of us had bad weather—as we all worked feverishlymx tomxmonuclear= 2 explosion—a.5 log type( xIa/ supernova.x ) , (227) find and study the guaranteed on-demand batches of ,0 This slow, relentless10 approach to a, 0sudden cataclysmic lection of such measurements, over a sufficientsupernovae. range of − conclusion− at a characteristicF massF erases most of the orig- At the beginning of 1997, the SCP team presented the inal differences among the progenitor stars. Thus the light distances, would yield an entire historical recordresults forof our first the seven high-redshiftwhere supernovae.x is the8 These observedcurves (see figure flux 1) and in spectra the of band all type Iax supernovae, whereas universe’s expansion. first results demonstrated them cosmologicalF analysis tech- are remarkably similar. The differences we do occasionally niques from beginning to end. xThey,0 and were suggestivex,0 are of an a referencesee presumably reflect magnitude, variations on the andcommonreference theme— F including differences, from one progenitor star to the next, Type Ia supernovae (SNe Ia) are the best cosmologicalexpansion slowing down at fluxabout the in rate the expected same for the band x, respectively. A difference in simplest inflationary Big Bang models, but with error bars of accretion and rotation rates, or different carbon-to-oxy- gen ratios. yard sticks in the market. They are precise distancestill too large indi- to permit definitemagnitudes, conclusions. ∆m = m1 m2, can then be converted to a − ∆m cators because they have a uniform intrinsic56 brightnessApril 2003 Physics Todayrelative brightness as I2/I1 2.512 . http://www.physicstoday.org due to the similarity of the triggering white dwarf mass In Fig. 20 we show the observed≈ magnitude (and rel- (i.e., MCh = M ) and consequently the amount of nu- ative brightness) versus redshift for well-measured dis- clear fuel available to burn. This makes SNe Ia the best tant and (in the inset) nearby SNe Ia. The faintness (or at least most practical) example of “standardizable (or distance) of the high-redshift supernovae in Fig. 20 candles” in the distant universe. comes as a dramatic surprise. In the (simplest) stan- Before proceeding, we pause to present some nota- dard cosmological models described in Sec. VI B, the tion. The (m) of a celestial object expansion history of the cosmos is determined entirely is a number that is a measure of its apparent bright- by its mass density. The greater the density, the more ness as seen by an observer on Earth. The smaller the the expansion is slowed by gravity. Thus, in the past, a number, the brighter a star appears. The scale used to high-mass-density universe would have been expanding indicate magnitude originates in the Hellenistic practice much faster than it does today. So one should not have of dividing stars visible to the naked eye into six magni- to look far back in time to especially distant (faint) su- tudes. The brightest stars in the night sky were said to pernovae to find a given integrated expansion (redshift). be of first magnitude (m = 1), whereas the faintest were Conversely, in a low-mass-density universe one would of sixth magnitude (m = 6), which is the limit of human have to look farther back. But there is a limit to how low visual perception (without the aid of a telescope). In the mean mass density could be. After all, we are here, 1856, Pogson formalized the system by defining a first and the stars and galaxies are here. All that mass surely magnitude star as a star that is 100 times as bright as a puts a lower limit on how far-that is, to what level of sixth-magnitude star, thereby establishing a logarithmic faintness we must look to find a given redshift. How- scale still in use today [96]. This implies that a star of ever, the high-redshift supernovae in Fig. 20 are fainter magnitude m is 1001/5 2.512 times as bright as a star than would be expected even for an empty cosmos. ' 37

Eventual collapse RELATIVE BRIGHTNESS OF SUPERNOVAE Eternal expansion Figure 4. The history of cosmic

}

Y expansion,and as measuredΩrad by theis the density fraction of relativistic matter

1 high-redshift supernovae (the black

0.1

0.01 (radiation). We might note in passing that the quantity 1.5 0.001 } data points), assuming flat cosmic 0.0001 geometry. The2 scale2 factor2 R 2of the universe iskc taken/( toa beR 10 atH pres-0) is sometimes referred to as Ωk. This usage ent, so it equalsis unfortunate, 1/(1 + z). The because it encourages us to think of

TIVE TO TODA curves in the blue shaded region represent curvaturecosmological models as in a contribution to the energy density of the which the accelerating effect of 1.0 0 vacuum energyuniverse, eventually over- which is incorrect. comes the decelerating effect of the mass density. These curves as- s

te z sume vacuum energy densities ra le s ranging from 0.95 r (top curve) e e 0.5 EXERCISEc 7.1 Imagine a class of astronomical objects cc t a a down to 0.4 r . In the yellow en r c th le shaded region, the curves repre- 0.5 s, e 1 that are both standard candles and standard yardsticks. te c ra e sent models in which the cosmic ele d 1.5 ec s REDSHIFT expansionIn is always other decelerating words, we know both their luminosities L and t d y 2 irs a due to high mass density. They as- f w on l 3 i a sume mass densities ranging (left to apparent ns their physical sizes `. Recall that the brightness a r p o right) from 0.8 rc up to 1.4 rc. In x . E . fact, for theI oflast two an curves, object the ex- is its flux on Earth divided by its angular

. LINEAR SCALE OF UNIVERSE RELA 0.0 pansion eventually halts and re- 2 –20 –10 0 +10 verses intoarea, a cosmic or collapse. solid angle on the sky, i.e. I = /θ , where θ the BILLIONS OF YEARS FROM TODAY angular size. How does the apparent brightnessF depend on redshift for a general cosmological model, for these FIG. 21: The history of cosmic expansion, as measured by the cosmological models, the expansion history of the cosmos as the recent measurements of theobjects cosmic microwave with back- fixed L and `? high-redshiftis determinedsupernovae entirely by its mass (the density. black The greater data points),the ground assuming strongly indicate, flat we can say quantitatively that cosmicdensity, geometry. the more the The expansion scale is slowed factor bya gravity.of the Thus, universeabout 70% is taken of the total tobe energy density is vacuum energy in the past, a high-mass-density universe would have been and 30% is mass. In units of the critical density rc, one 1 at present,expanding much so it faster equals than 1it/ does(1+ today.z). TheSo one curves should- inusually the bluewrites this shaded result as n’t have to look far back in time to especially distant (faint) W ! r /r " 0.7 and W ! r /r " 0.3. regionsupernovae represent to find cosmological a given integrated expansion models (redshift). in which the acceleratingL L c m m c B. Cosmic Microwave Background effect ofConversely, vacuum in energya low-mass-density eventually universe overcomes one would Why the not decelerating a cosmological constant? have to look farther back. But there is a limit to how low The story might stop right here with a happy ending—a effectthe of mean the mass mass density density. could be.These After all, curveswe are here, assume and complete vacuum physics energy model of the cosmic expansion—were it the stars and galaxies are here. All that mass surely puts not for a chorus of complaints from theThe particle cosmic theorists. microwave background (CMB) radiation densitiesa lower ranging limit on how from far—that 0.95 is,ρ to (topwhat level curve) of faint- down to 0.4 ρ . In the c The standard c model of particle was physicsdiscovered has no natural in 1964 by Penzias and Wilson, using an yellowness—we shaded must lookregion, to find the a given curves redshift. represent The high- place models for a vacuum in which energy density of the modest magni- redshift supernovae in figure 3 are, however, fainter than tude required by the astrophysicalantenna data. The simplest built es- for satellite communication [103]. The ra- the cosmicwould be expected expansion even for is an always empty cosmos. deceleratingtimates due to would high predict mass a vacuum energy 10120 times greater. If these data are correct, the obvious implication is (In supersymmetric models, it’s “only”diation 1055 times was greater.) acting as a source of excess noise (or “static”) density.that the They simplest assume cosmological mass model densities must be too ranging simple. (leftSo enormous to right) a L would from have engendered an acceleration The next simplest model might be one that Einstein en- so rapid that stars and galaxies couldin thenever have radio formed. receiver. Eventually, it became obvious 0.8 ρc up to 1.4 ρc. In fact, for the last two curves, the expansion tertained for a time. Believing the universe to be static, he Therefore it has long been assumed that there must be eventuallytentatively halts introduced and into reverses the equations into of ageneral cosmic rela- collapsesome underlying [101]. symmetry that preciselythat the cancels source the vac- of noise was actually a signal that was tivity an expansionary term he called the “cosmological uum energy. Now, however, the supernova data appear to constant” (L) that would compete against gravitational col- require that such a cancellation wouldcoming have to fromleave a re- outside the Galaxy. Precise measurements lapse. After Hubble’s discovery of the cosmic expansion, mainder of about one part in 10120were. That degree made of fine tun- at wavelength = 7 35 cm. The intensity Einstein famously rejected L as his “greatest blunder.” In ing is most unappealing. λ . later years, L came to be identified with the zero-point The cosmological constant modelof requires this radiation yet another was found not to vary by day or night Ifvacuum these energy data of all are quantum correct, fields. the obviousfine implication tuning. In the cosmic is expansion, mass density be- that theIt turns three out simplestthat invoking a models cosmological of constant cosmology al- comes ever introduced more dilute. Since theor end time of , of theit has year, nor to depend on the direction to a lows us to fit the supernova data quite well. (Perhaps there fallen by very many orders of magnitude. But the vacuum was more insight in Einstein’s blunder than in the best ef- in Sec. VI B must be too simple. The nextenergy todensity simplest rL, a property of emptyprecision space itself, of stays better than 1%. Almost immediately after forts of ordinary mortals.) In 1995, my SCP colleague Ariel constant. It seems a remarkable and implausible coinci- modelGoobar includes and I had found an that, expansionary with a sample of type term Ia su- indence the that equation the mass density, of justits in the detection present epoch, it is was concluded that this radiation comes pernovae spread over a sufficiently wide range of dis- within a factor of 2 of the vacuum energy density. motiontances, driven it would be bypossible the to separate cosmological out the competing constantGiven Λ these, which two fine-tuningfrom coincidences, the universeit seems as a whole: a blackbody emission of effects of the mean mass density and the vacuum-energy likely that the standard model is missing some funda- competes14 against gravitational collapse. The best fit to hot, dense gas (temperature T 3000 K, peak wave- density. mental physics. Perhaps we need some new kind of accel- ∼ The best fit to the 1998 supernova data (see figures 3 erating energy—a “dark energy” that,length unlike λL, maxis not con-1000 nm) redshifted by a factor of 1000 to the 1998and 4) implies supernova that, in the data present shown epoch, the invacuum Figs. en- 20stant. and Borrowing 21 implies from the example of the putative ergy density r is larger than the energy density attribut- “inflaton” field that is thought to have triggered inflation,∼ that, in the presentL epoch, the vacuum energy density ρ λmax 1 mm and T 3 K [104]. A compilation of exper- 2 theorists are proposing dynamicalΛ scalar-field models and able to mass (rmc ). Therefore, the cosmic expansion is now ∼ ∼ is largeraccelerating. than If the universe energy has no density large-scale curvature, attributableother even to more mass exoticρ malternatives. imental to a cosmological measurements con- in the range 0.03 cm . λ . 75 cm

Therefore,http://www.physicstoday.org the cosmic expansion is now accelerating. April 2003revealed Physics Today an accurate57 blackbody spectrum, see Fig. 22. To accommodate SNe Ia data we must add an addi- Actually, according to the FIRAS (Far InfraRed Abso- tional term into the Friedmann equation (158), lute Spectrometer) instrument aboard the COBE (Cosmic Background Explorer) satellite, which measured a temper- 8π ρ kc2 Λc2 ature of T = 2.726 0.010 K, the CMB is the most perfect H2 = G + . (228) 0 ± 3 c2 − a2R2 3 blackbody ever seen [106]. 0 The CMB photons we see today interacted with matter The Λ term also modifies the acceleration equation (184), for the last time some 380 kyr after the bang. Photon which becomes decoupling occurs when the temperature has dropped

2 to a point where there are no longer enough high energy a¨ Λc 4πG 1 + = (ρ + 3P) , (229) photons to keep hydrogen ionized: H γ / e−p . This a 3 − 3c2 era is known as recombination, even though the atomic and H(z) in (208) is now given by constituents had never been combined prior. The ion- ization potential of hydrogen is 13.6 eV (i.e., T 105 K), n ∼ 3 4 but recombination occurs at Trec 3000 K. This is H(z) = H0 Ωm,0(1 + z) + Ωrad,0(1 + z) + ΩΛ ∼ 10 because the low baryon to photon ratio, η 5 10− , o1/2 ≈ × 2 allows the high energy tail of the Planck distribution + (1 Ω0)(1 + z) , (230) − to keep the comparatively small number of hydrogen where atoms ionized until this much lower temperature.

Ω = Ωm + Ωrad + ΩΛ , (231) EXERCISE 7.2 (i) For blackbody radiation, the energy 2 38

Wavelength (cm) 10 1.0 0.1 10−17

10−18 ) 1 −

Hz −19 1 10 − 2.726 K blackbody sr 2 − −20 10 FIRAS COBE satellite W m

( DMR COBE satellite

ν UBC sounding rocket I B 10−21 LBL-Italy White Mt. & South Pole Princeton ground & balloon Cyanogen optical 10−22 FIG. 23: The CMB over the entire sky, color-coded to represent 110 100 1000 differences in temperature from the average 2.726 K: the color Frequency (GHz) scale ranges from +300 µK (red) to 300 µK (dark blue), repre- senting slightly hotter and colder spots (and also variations in Figure 1. FIG.Precise 22: Themeasurements CMB blackbody of the CMB spectrum spectrum. as confirmed The line represen by mea-ts a 2.73density.) K Results are from the WMAP satellite [107] and the blackbody,surements which describes over athe broad spectrum range very of wavelengths well, especially [105]. around the peak of inten-Planck mission [108]. sity. The spectrum is less well constrained at frequencies of3GHzandbelow(10cm and longer wavelengths). (References for this figure are at the end of this section under “CMB Spectrum References.”) density per unit frequency is given by temperature anisotropies in the CMB are interpreted as a 8πhν3dν snapshot of the early stages of this growth, which even- u dν = . (232) ν c3[exp(hν/kT) 1] tually resulted in the formation of galaxies [109, 110]. Wavelength (cm) − The full sky CMB temperature anisotropy map, as 30030 3 0.3 0.03 Since3.5 the energy of one photon is hν, the number density measured by the Wilkinson Microwave Anisotropy of photons is given by the same expression above Probe (WMAP) [107] and the Planck mission [108], is divided by hν. Calculate the present density of photons shown in Fig. 23. It is convenient to expand the differ- in the3.0 universe, knowing that the CMB temperature is ence ∆T(nˆ) between the CMB temperature observed in T0 2.726 K. [Hint: you will find it useful to know that a direction given by the unit vector nˆ = (θ, φ) and the R '2 x x dx/(e 1) 2.404.] (ii) If deuterium measurements present mean value T0 of the temperature in spherical − ' 10 require2.5 a baryon to photon ratio of η = 5.5 10− , what harmonics must the current density of baryons be? (iii)× Assuming that the Hubble constant is H Planck70 km s 1 Mpc 1, X∞ X 0 = − − ∆T(nˆ) T(nˆ) T = a Y , (233) Temperature (K) y 0 lm lm calculate what Ωb is. ≡ − 2.0 l=0 m l Chemical potential µ | |≤ Free-free Before the recombination epoch the universe was an where opaque “fog” of free electrons and became transparent to 1.5 Z photons0.1 afterwards.1 Therefore,10 when we100 look at the 1000 sky 1 T = d2nˆ T(nˆ) , (234) in any direction, we canFrequency expect to(GHz) see photons that orig- 0 4π inated in the “last-scattering surface.” This hypothesis Figure 2. hasThe been shapes tested of expected, very precisely but so far by unobserved, the observed CMB distribu-distortions, resulting from energy-releasing processes at different epochs. tion of the CMB; see Fig. 23. The large photon-to-nucleon Z ratio implies that it is very unlikely for the CMB to be alm = ∆T(nˆ) Ylm(nˆ) dΩ , (235) produced in astrophysical processes such as the absorp- tion and re-emission of starlight by cold dust, or the and where Ω denotes the solid angle parametrized by the absorption or emission by plasmas. Before the recombi- pair (θ, φ). The set Ylm is complete and orthonormal, nation epoch, Compton scattering tightly coupled pho- obeying { } tons to electrons, which in turn coupled to protons via electomagnetic interactions. As a consequence, photons Z and nucleons in the early universe behaved as a single dΩ Yl1m1 (Ω) Yl2m2 (Ω) = δl1l2 δm1m2 . (236) “photon-nucleon fluid” in a gravitational potential well created by primeval variations in the density of matter. Since ∆T(nˆ) is real, we are interested in the real-valued, Outward pressure from photons, acting against the in- orthonormal Ylm’s, defined by ward force of gravity, set up acoustic oscillations that propagated through the photon-nucleon fluid, exactly  l Pm(x)( √2 cos(mφ)) m > 0 like sound waves in air. The frequencies of these oscilla-  Y (θ, φ) = N(l, m) P (x) m = 0 , tions are now seen imprinted on the CMB temperature lm  l  l fluctuations. Gravity caused the primordial density per- Pm(x)( √2 sin(mφ)) m < 0 turbations across the universe to grow with time. The (237) 39 where Fig. 26. The angular scale corresponding to the particle s horizon size is the boundary between super- and sub- (2l + 1)(l m)! horizon scales. The size of a causally connected region N(l, m) = − (238) 4π (l + m)! on the surface of last scattering is important because it determines the size over which astrophysical processes is a normalization-factor, can occur. Normal physical processes can act coherently only over sizes smaller than the particle horizon. The rel- (1 x2)m/2 dm+l ative size of peaks and locations of the power spectrum Pm(x) = − (x2 1)l (239) l 2ll! dxm+l − gives information about cosmological parameters [114]. In Fig. 25 we show the influence of several cosmolog- l is the associated Legendre polynomial, Pl = Pm=0 is the ical parameters on the power spectrum. For historical Legendre polynomial, and x cos θ; for further details reasons, the quantity usually used in the multipole rep- ≡ see e.g. [111]. resentation is The lowest multipole is the l = 0 monopole, equal " #1/2 to the average full-sky flux and is fixed by normal- l(l + 1) ∆T Cl . (242) ization (234). The higher multipoles (l 1) and their ≡ 2π ≥ amplitudes alm correspond to anisotropies. A nonzero m corresponds to 2 m longitudinal “slices” ( m nodal As an illustration, we sketch how to use the power meridians). There are| l|+1 m latitudinal “zones”| | (l m spectrum to determine the curvature of space. At re- nodal latitudes). In Fig.−| 24 we| show the partitioning−| | combination the universe is already matter-dominated, described by some low multipole moments. so we can substitute zls 1100 into (215) to give an esti- mate of the horizon distance' at the CMB epoch EXERCISE 7.3 At every point in the sky,one observes a 2c blackbody spectrum, with temperature T(θ). The largest dh,ls = 0.23 Mpc . (243) H (1 + z)3/2 ≈ anisotropy is in the l = 1 (dipole) first spherical har- 0 monic, with amplitude 3.355 0.008 mK [113]. The dipole This is the linear diameter of the largest causally con- ± is interpreted to be the result of the Doppler shift caused nected region observed for the CMB, `ls. Therefore, sub- by the solar system motion relative to the nearly isotropic stituting (243) into (223) we find today’s angular diame- blackbody field, as broadly confirmed by measurements ter of this region in the sky of the radial velocities of local galaxies. Show that the 1 motion of an observer with velocity β = v/c relative to θ = = 0.03 1.8◦ . (244) an isotropic Planckian radiation field of temperature T (1 + z)1/2 1 ≈ 0 − produces a Doppler-shifted temperature pattern The reason for this “causality problem” is that the uni- " 2 # verse expands slower than light travels. Namely, as we β 3 T(θ) T0 1 + β cos θ + cos(2θ) + (β ) . (240) have seen, when the age of the universe increases the ≈ 2 O part observable to us increases linearly, ct, while the scale factor increases only with t2/3 (or t∝1/2). Thus we see more and more regions that were never in causal It is easily seen that the alm coefficients are frame- contact for a radiation or matter-dominated universe. dependent. Note that a simple rotation in the φ coor- We note that the sound horizon has approximately the dinate will change the sin φ, cos φ part of the spherical same angular size, because of v c/ √3. The sound harmonic for m 0 and a rotation in the θ coordinate s , horizon serves as a ruler at fixed redshift∼ z to measure will change the associated Legendre polynomial part for ls the geometry of spacetime. Moreover, the fluid of pho- l 0. So only the ` = m = 0 monopole coefficient is co- , tons and nucleons performs acoustic oscillations with its ordinate independent. To combat this problem, we use fundamental frequency connected to the sound horizon the power spectrum defined by plus higher harmonics. The relative size of peaks and locations then gives information about cosmological pa- Xl 1 2 rameters. The first panel of Fig. 25 shows that, for a flat Cl a . (241) ≡ 2l + 1 lm universe (Ω 1, the first peak sits at θ 1 as we have m= l tot ◦ − found in our simple≈ estimate (244). In≈ Fig. 25 we dis- A brief Cl initiation is provided in Fig. 26. play a compilation of measurements of the CMB angular power spectrum. The data agree with high significance EXERCISE 7.4 Show that the power spectrum Cl is with models when they input dark energy as providing invariant under rotations. 70% of the energy in the universe, and when the total energy≈ density ρ equals the critical density. The data To get a rough understanding of the power spec- also indicate that the amount of normal baryonic matter trum we can divide up the multipole representation in the universe Ωb is only 4% of the critical density. What into super-horizon and sub-horizon regions as shown in is the other 96%? 40

FIG. 24: Nodal lines separating excess and deficit regions of sky for various (l, m) pairs. The top row shows the (0, 0) monopole, and the partition of the sky into two dipoles, (1, 0) and (1, 1). The middle row shows the quadrupoles (2, 0), (2, 1), and (2, 2). The bottom row shows the l = 3 partitions, (3, 0), (3, 1), (3, 2), and (3, 3) [112]. 16.5 Inflation

optical light in white and orange, and the CDM map 100 (a) Curvature (b) Dark Energy (drwan up using data on gravitational lensing from

80 Magellan and European Space Observatory telescopes at Paranal) in blue. Galaxy clusters contain not only the

K) 60 µ

( galaxies ( 2% of the mass), but also intergalactic plasma

T ∼ ∆ ( 10% of the mass), and (assuming the null hypothesis) 40 CDM∼ ( 88% of the mass). Over time, the gravitational ∼ 20 attraction of all these parts naturally push all the parts Ωtot ΩΛ to be spatially coincident. If two galaxy clusters were to 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 collide/merge, we will observe each part of the cluster to 100 (c) Baryons (d) Matter behave differently. Galaxies will behave as collisionless particles but the plasma will experience ram pressure. 80 Throughout the collision of two clusters, the galaxies will then become separated from the plasma. This is K) 60 µ ( seen clearly in the Bullet Cluster, which is undergoing T ∆ 40 a high-velocity (around 4500 km/s) merger, evident from the spatial distribution of the hot, X-ray emitting 20 gas. The galaxies of both concentrations are spatially Ωbh2 Ωmh2

0.02 0.04 0.06 0.1 0.2 0.3 0.4 0.5 separated from the (purple) X-ray emitting plasma. 10 100 1000 10 100 1000 The CDM clump (blue), revealed by the weak-lensing l l map, is coincident with the collisionless galaxies, but lies ahead of the collisional gas. As the two clusters FIG. 25: Influence of several cosmological parameters on the cross, the intergalactic plasma in each cluster interacts angular power spectrum of the CMB [16]. Figure 16.4: The influence of several cosmological parameters on the angularwith power the plasma spectrum in the other cluster and slows down. of the CMB. However, the in each cluster does not interact at all, passing right through without disruption. There is a strong astrophysical evidence for a signif- This difference in interaction causes the CDM to sail is the distance a photonicant travelled amount freelyof nonluminous after its matter last scat intering the universe at tls. Thusahead the of the maximal hot plasma, separating each cluster into two components: CDM (and colissionless galaxies) in angular separation of areferred causally to connected as cold dark points matter is (CDM). with l = Forct example,ls(1 + zls) observations of the rotation of galaxies suggest that they the lead and the hot interstellar plasma lagging behind. rotate as they had(1 considerably + zls)tls more mass than we can What might this nonluminous matter in the universe see [115–117].ϑ = Similarly, observations0.02 of1◦ the motions be? We do not(16.13) know yet. It cannot be made of ordinary t0 ≈ ≈ of galaxies within clusters also suggest they have (baryonic) matter, so it must consist of some other sort The reason for thisconsiderably “causality problem” more mass is than that can the be seen universe [118].expands The of slower elementary than particle light [120] . travels: As the age of themost universe compelling increases, evidence the for part CDM obser is thatvable observed to us at increases linearly, ct, ∝ while the scale factor increasesthe Bullet Clusteronly with [119].t2/ In3 Fig.or ( 28t1/ we2). show Thus a composite we see more andEXERCISE more regions 7.5 We will examine galaxy rotation curves image of the Bullet Cluster (1E 0657-558) that shows the and show that they imply the existence of dark mat- that were never in causalX-ray contact light detected for a radiation by Chandra or matter-do in purple,minated (an image universe.ter. (i) Recall that the orbital period is given by 2 2 3 T The sound horizon hasfrom approximately Magellan and the the Hubble same space angular telescope size, b of)ecause the of vs= 4πc/a √/GM3.. The Write down an expression that relates T ≈ exact size depends among other on the cosmological model: The sound horizon serves as a ruler at fixed redshift zls to measure the geometry of space-time. Moreover, the fluid of pho- tons and nucleons performs acoustic oscillations with its fundamental frequency connected to the sound horizon plus higher harmonics. The relative size of peaks and locations gives information about cosmological parameters. Figure 16.4 shows the influence of several cos- mological parameters on the angular power spectrum as function of ℓ π/ϑ. The first panel ∼ shows that the first peak sits indeed at ℓ 100 (or ϑ 1 ) for a flat Universe, as we have ≈ ∼ ◦ found in our simple estimate (16.13). Observations by the WMAP satellite confirm with high significance the value for Ωb from BBN, for ΩΛ from type Ia supernovae, and that we live in a flat Universe.

16.5 Inflation

Shortcomings of the standard big-bang model Causality or horizon problem: why are even causally disconnected regions of the universe • homogeneous, as we discussed for CMB?

137 • the composition of the Universe: f(⌦B, ⌦CDM, ⌦HDM, ⌦⌫, ⌦) • the origin of structure What makes these parameters even more important and what makes CMB- cosmology such a hot subject is that in the near future measurements of the CMB angular power spectrum will determine these parameters with the unprecedented 41 precision of a few % (Jungman et al. 1996). Maps Power Spectra Doppler Peaks

δρ C l l 110 100 1000 l

C l Nothing

l(l+1)C Sachs-Wolfe Plateau

δφ = constant radial averaging 110 100 1000 l + diffusion damping

Cl 1 0.1 degrees super horizon sub horizon 110 100 1000 l

Figure 2. Simple Maps and their Power Spectra.Ifafull-skyCMBmap Figure 4. Simplified CMB Power Spectrum. The CMB power spectrum has only aLeft dipole panel.(top), it’s power spectrum is a delta function at ` =1. Ifamap can be crudely divided into three regions. The Sachs-Wolfe Plateau caused by the FIG. 26: Illustrative sky maps and their angular power spectra. If a full-sky CMB map has only a dipole (top), its has only temperature fluctuations on an angular scale of ⇠ 7 (middle) then all of scale independence of gravitational potential fluctuations which dominate the spec- power spectrum is a delta function at l = 1. If a map has only temperature fluctuations on an angular scale of 7◦ (middle) then the power is at ` ⇠ 10. If all the hot and cold spots are even smaller (bottom) then trum at large super-horizon scales. The horizon is the angular∼ scale corresponding allthe of power the is power at high `. is at l 10. If all the hot and cold spots are even smaller (bottom) then the power is at high l. Right panel ∼ to ctdec where c is the speed of light and tdec is the age of the Universe at decou- Simplified CMB power spectrum. The CMB power spectrum can be crudelypling. The divided Doppler peaks into on three scales slightly regions. smaller The than Sachs-Wolfe the horizon are due Plateau to caused by the scale independence of gravitational potential fluctuationsresonant which acoustic dominateoscillations analogous the spectrum to mellifluous bathroom at large singing super-horizon (see Figure 3. What is the CMB power spectrum? 8). At smaller scales there is nothing because the finite thickness of the surface of scales. The horizon is the angular scale corresponding to ctls. The Dopplerlast scattering peaks averages on scales small scale slightly fluctuations smaller along the than line of the sight. horizon Di↵usion are Similardue to theresonant way acoustic and cosines oscillations. are used in At smaller decompositions scales there of arbi- is nothingdamping because (photons the di finite↵using thicknessout of small scale of thefluctuations) surface also of suppresses last scattering power traryaverages functions small on flat scale space, sphericalfluctuations harmonics along can the be used line to of make sight. decompo- Diffusion dampingon these scales. (photons diffusing out of small scale fluctuations) . sitionsalso of suppresses arbitrary functions power on on the these sphere. scales Thus [114]. the CMB temperature maps 5

Angular scale (degrees) in a sphere around the center of the Milky Way with a 8 radius equal to 8 kpc? (iii) Assume that the Milky Way is made up of only luminous matter (stars) and that the Sun is at the edge of the galaxy (not quite true, but close). What would you predict the orbital velocity to be for a

) star 30 kpc from the center? and for 100 kpc? (iv) Obser-

K vations show that galaxy rotation curves are flat: stars µ

( move at the same orbital velocity no matter how far they

T are from the center. How much mass is actually con-

tained within a sphere of radius 30 kpc? 100 kpc? Take the orbital velocity at these radii to be the same as the orbital velocity of the Sun. (v) What do you conclude from all of this about the contents of our galaxy?

Multipole index l` C. ΛCDM FIG. 27: A compilation of measurements of the CMB angular power spectrum spanning the region 2 . l . 1500. The best fit The concordance model of cosmology predicts the of the ΛCDM model is also shown. evolution of a spatially flat expanding Universe filled with dark energy, dark matter, baryons, photons, and three flavors of left- handed (that is, one helicity state the orbital period and the orbital velocity for a circular νL) neutrinos (along with their right-handed antineu- orbit, and then write down an expression that relates the trinos νR. The best fit to the most recent data from orbital velocity with the mass enclosed within R. (ii) The the Planck satellite yields the following parameters: 2 Sun is 8 kpc from the center of the Milky Way, and its Ωm,0 = 0.308 0.013, Ωb,0h = 0.02234 0.00023, 2 ± ± orbital velocity is 220 km/s. Use your expression from ΩCDM,0h = 0.1189 0.0022, h = 0.678 0.009, and (i) to determine roughly how much mass is contained 1 Ω < 0.005 [121].± Note, however, that the± data only − 0 42

FIG. 28: The Bullet Cluster. measure accurately the acoustic scale, and the relation to the previous question. (iv) The dark energy is spread underlying expansion parameters (e.g., via the angular- out uniformly in the universe and it causes a gravita- diameter distance) depends on the assumed cosmology, tional repulsion which accelerates the expansion of the including the shape of the primordial fluctuation universe. Do you think the cosmological constant may spectrum. Even small changes in model assumptions be strongly affecting the of stars in our galaxy? can change h noticeably. Unexpectedly, the H0 inference from Planck data deviates by more than 2σ from the EXERCISE 7.7 Submillimeter Galaxies (SMGs), are previous result from the maser-cepheid-supernovae extremely dusty starburst galaxies that were discovered distance ladder h = 0.738 0.024 [122]. In what follows ± at high redshifts (z 1 to 3). Assume the dust emission we will take as benchmark: ΩΛ 0.7, Ωm,0 0.3, and from SMGs are well∼ characterized by blackbodies at 1 Ω 0, and h 0.7. As shown' in Fig. 29,' this set of − 0 ' ' a single dust temperature. If the observed spectrum parameters is in good agreement with cosmological and of a SMG peaks at 180 µm, what would be its dust astrophysical observations. temperature if it is at a redshift of z = 2?

EXERCISE 7.6 The Sun is moving around the center We now consider the benchmark model containing as of the Milky Way galaxy along a roughly circular orbit its only two components pressure-less matter and a cos- 1 mological constant, Ω + Ω = 1. Hence, the curvature at radius R = 8 kpc, with a velocity v = 220 km s− . m,0 Λ Let us approximate the mass distribution and gravita- term in the Friedmann equation and the pressure term tional potential of the galaxy as spherically symmetric. in the acceleration equation play no role. Multiplying (i) What is the total mass inside R? (ii) If the mass the acceleration equation (229) by 2 and adding it to the 2 Friedmann equation (228), we eliminate ρm, density varies with radius as ρ r− , then what is the density at radius R? Express∝ it in units of proton 3 24 masses per cm (the proton mass is mp = 1.672 10− g). a¨ a˙ 2 (iii) For the benchmark cosmological model,× with 2 + = Λc2 . (245) 1 1 a a ΩΛ 0.7 and H0 70 km s− Mpc− , what is the density' of the cosmological' constant (or dark energy accounting for it)? Express it in the same units as in Next, we rewrite first the left-hand-side and then the 43

(248) to the Friedmann equation (228), with t = t0. Now, we introduce the new variable x = a3/2 such that 3 da dx da dx 2x 1/3 No Big Bang = = − , (249) dt dt dx dt 3 and (248) becomes 2 3 9 x˙2 Λc2x2 + = 0 . (250) − 4 4C Supernovae Using an educated guess,

x(t) = A sinh( √3Λct 2) (251) 1 SNAP / , Target Statistical Uncertainty we fix A = √3 /Λc. The scale factor is then CMB C Boomerang a(t) = A2/3 sinh2/3( √3Λc2t/2) . (252) expands forever 0 The time-scale of expansion is driven by t = 2/ √3Λc2. (cosmological constant) Maxima lapses eventually Λ vacuum energy density ecol r The present age of the universe t0 follows from the nor- closed malization condition a(t0) = 1 and is given by

Clusters flat 1 p t0 = tΛ tanh− ( ΩΛ) . (253) -1 open The deceleration, a¨ q = , (254) 0 1 2 3 −aH2 mass density is a key parameter for observational tests of the ΛCDM model. We calculate first the Hubble parameter FIG. 29: Shown are three independent measurements of the cosmological parameters (Ω , Ω ). The high-redshift a˙ 2 Λ m H(t) = = coth(t/tΛ) , (255) supernovae [123], abundance [124] and the a 3tΛ CMB [125, 126] converge nicely near ΩΛ = 0.7 and Ωm = 0.3, as shown by the 68.3%, 95.4%, and 99.7% confidence regions. and after that The upper-left shaded region, labeled “no big bang,” indicates 1 h 2 i bouncing cosmologies for which the universe has a turning q(t) = 1 3 tanh (t/tΛ) . (256) point in its past [127]. The lower right shaded region corre- 2 − sponds to a universe which is younger than long-lived radioac- Note that, as expected, for t 0 we have q = 1/2, and 1 1 tive isotopes [128], for any value of H 50 km s− Mpc− . Also → 0 ≥ for t we have q = 1. Perhaps more interesting is shown is the expected confidence region allowed by the future the transition→ ∞ region from− a decelerating to an acceler- SuperNova / Acceleration Probe (SNAP) mission [129]. ating universe. As shown in Fig. 30 for ΩΛ = 0.7, this transition takes place at t 0.55 t0. This can be easily converetd to a redshift: z ≈= a(t )/a(t ) 1 0.7. Inter- right-hand-side as total time derivatives. Using 0 estingly, z can be directly probed∗ by SNe∗ − Ia observations.≈ " # ∗ d a˙ 2 a¨ (aa˙2) = a˙3 + 2aa˙a¨ = aa˙ 2 + 2 , (246) EXERCISE 7.8 Consider the benchmark model, with dt a a Ωm 0.3, Ω 0.7, with flat space geometry. What was ,0 ' Λ ' it follows that the redshift at which the universe had half its present age? d Λc2 d (aa˙2) = aa˙ 2Λc2 = (a3) . (247) dt 3 dt VIII. HOT THERMAL UNIVERSE Integration is now trivial,

Λc2 Though we can see only as far as the surface of last scat- aa˙2 = a3 + . (248) tering, in recent decades a convincing theory of the origin 3 C and evolution of the early universe has been developed. The integration constant, = 8πGρm /3, can be deter- Most of this theory is based on recent theoretical and C ,0 mined most easily by setting a(t0) = 1 and comparing experimental advances in elementary particle physics. 15.6 The ΛCDM model 44

q(t) 0.6 Q = +2/3 and Q = 1/3. The masses of the particles in- crease significantly− with each generation, with the pos- 0.4 ⌦⇤ =0.1 sible exception of the neutrinos [134]. The properties of Ω=0.1 0.2 quarks and leptons are summarized in Table I.

0 Now, an understanding of how the world is put to- gether requires a theory of how quarks and leptons inter- q -0.2 act with one another. Equivalently, it requires a theory -0.4 Ω=0.9 of the fundamental forces of nature. Four such forces have been identified. They can be characterized on the -0.6 basis of the following four criteria: the types of parti- ⌦⇤ =0.9 -0.8 cles that experience the force, the relative strength of the

-1 force, the range over which the force is effective, and 0 0.5 1 1.5 2 the nature of the particles that mediate the force. Two t/t0 t/t0 of the forces, gravitation and electromagnetism, have an

Figure 15.3: The deceleration parameter q as function of t/t0 for a ΛCDM model andunlimited various range; largely for this reason they are famil- FIG.values 30: for TheΩΛ deceleration(0.1, 0.3, 0.5, 0.7 parameter and 0.9 fromq theas top afunction to the bottom). of iar to everyone. The remaining forces, which are called t/t0 for the ΛCDM model for various values of ΩΛ = simply the weak force and the strong force, cannot be The limiting0.1, 0. behavior3, 0.5, 0. of7,q0corresponds.9 from top with to bottomq =1/2 [16]. for t 0 and q = 1 for t perceivedas directly because their influence extends only → − →∞ expected to the one of a flat Ωm = 1 and a ΩΛ = 1 universe. More interesting is the transitionover a short range, no larger than the radius of an atomic region and, as shown in Fig. 15.3, the transition from a decelerating to an accelerating universenucleus. The electromagnetic force is carried by the pho- happens for ΩΛ =0.7 at t 0.55t0. This can easily converted to redshift, z = a(t0)/a(t ) ≈ ∗ ton,∗ the− strong force is mediated by gluons, the W and Z 1 0.7,Hence, that is directly before measured continuing by Supernova our look observations. back through time, ≈ we make a detour to overview the generalities of the bosons transmit the weak force, and the quantum of the 2 gravitational force is called the graviton. The main prop- Exercisesstandard model of particle physics. erties of the force carriers are summarized in Table II. A 1. Derive the relation between temperature and time in the early (radiation dominated)comparison of the (approximate) relative force strengths universe using ρ = gaT 4 = gπ2T 4/30 as expression for the energy density of a gasfor with two protons inside a nucleus is given in Table III. SU ⊗ SU ⊗ U g relativistic degreesA. of freedom(3)C in the Friedmann(2)L equation.(1)Y What is the temperatureThough gravity is the most obvious force in daily life, on at t =1s?[Hints:Theexpressionρ = gπ2T 4/30 is valid for k = c = ! = 1. Then one 1 25 a nuclear1/2 scale it is the weakest of the four forces and its can measure temperatures in MeV/GeV and use s− =6.6 10− GeV and G− = The standard19 model (SM) is our most modern× attempt effect at the particle level can nearly always be ignored. MPl =1.2 10 GeV.] to answer× two simple questions that have been perplex- In the SM quarks and leptons are allotted several addi- ing (wo)mankind throughout the epochs: What is the tive quantum numbers: electric charge Q, lepton num- Universe made of? Why is our world the way it is? ber L = Le + Lµ + Lτ, baryon number B, strangeness s, The elementary-particle model accepted today views charmness c, bottomness b, and topness t. For each par- quarks and leptons as the basic (pointlike) constituents ticle additive quantum number N, the corresponding of ordinary matter. By pointlike, we understand that antiparticle has the additive quantum number N. − quarks and leptons show no evidence of internal struc- The additive quantum numbers Q and B are assumed ture at the current limit of our resolution. Presently, to be conserved in strong, electromagnetic, and weak the world’s largest microscope is the Large Hadron Col- interactions. The lepton numbers are not involved in lider (or LHC), a machine that collides beams of protons strong interactions, but are strictly conserved in both at a cenetr-of-mass energy √s = 13 TeV. Remarkably, electromagnetic and weak interactions. The remainder, 70% of the energy carried into the collision by the pro- s, c, b and t are strictly conserved only in strong and tons emerges perpendicular to the incident beams. At electromagnetic129 interactions, but can undergo a change a given transverse energy E , we may roughly estimate of one unit in weak interactions. ⊥ the LHC resolution as The quarks have an additional charge which enables them to interact strongly with one another. This charge 19 `LHC }c/E 2 10− TeV m/E ⊥ ⊥ is a three-fold degree of freedom which has come to ≈ ≈20 × 2 10− m . (257) be known as color [135], and so the gauge theory de- ≈ × scribing the strong interaction has taken on the name There are six quarks and six leptons, together with their of quantum chromodynamics (QCD). Each quark flavor antiparticles. These twelve elementary particles are all can have three colors usually designated red, green, and 1 blue. The antiquarks are colored antired, antigreen, and spin- 2 and fall naturally into three families or gener- ations. Each generation consists of two leptons with antiblue. Each quark or antiquark carries a single unit electric charges Q = 0 and Q = 1 and two quarks with of color or anticolor charge, respectively. The quanta of − the color fields are called gluons (as they glue the quarks together). There are eight independent kinds of gluons in SU(3)C, each of which carries a combination of a color 2 You can find a more extensive but still qualitative discussion in [13]. charge and an anticolor charge (e.g. red-antigreen). The For a more rigorous treatment see e.g. [130–133]. strong interactions between color charges are such that 45

TABLE I: The three generations of quarks and leptons in the Standard Model. Fermion Short-hand Generation Charge Mass Spin +0.7 up u I 2.3 0.5 MeV − charm c II + 2 1.275 0.025 GeV 1 3 ± 2 top t III 173.21 0.51 GeV Quarks +±0.5 down d I 4.8 0.3 MeV 1 − 1 strange s II 95±5 MeV − 3 2 bottom b III 4.18 0.03 GeV ± electron neutrino νe I < 2 eV 95% CL 1 muon neutrino νµ II 0 < 0.19 MeV 90% CL 2 tau neutrino ν III < 18.2 MeV 95%CL Leptons τ electron e I 0.511 MeV µ II 1 105.7 MeV 1 − 2 tau τ III 1.777 GeV

TABLE II: The four force carriers. Force Boson Short-hand Charge Mass Spin Electromagnetic photon γ 0 0 1

Weak WW± 1 80.385 0.015 GeV 1 ± ± Weak ZZ0 0 91.1876 0.0021 GeV 1 ± Strong gluon g 0 0 1 Gravitation graviton G 0 0 2 in nature the quarks (antiquarks) are grouped into com- posites collectively called hadrons [136–138]:

( qq¯ (quark + antiquark) mesons integral spin Bose-Eisntein statistics [139, 140] → . (258) qqq (three quarks) baryons half-integral spin Fermi-Dirac statistics [141, 142] →

In QCD each baryon, antibaryon, or meson is colorless. However, these colorless particles may interact strongly TABLE III: Relative force strength for protons in a nucleus. via residual strong interactions arising from their com- Force Relative Strength position of colored quarks and/or antiquarks. On the Strong 1 other hand the colorless leptons are assumed to be struc- 2 Electromagnetic 10− tureless in the SM and consequently do not participate 6 Weak 10− in strong interactions. 38 Gravitational 10− One may wonder what would happen if we try to see a single quark with color by reaching deep inside a hadron. Quarks are so tightly bound to other quarks becomes small. This aspect is referred to as asymptotic that extracting one would require a tremendous amount freedom [143, 144]. of energy, so much that it would be sufficient to cre- ate more quarks. Indeed, such experiments are done Before proceeding we note that aside from binding to- at modern particle colliders and all we get is not an iso- gether quarks inside the hadrons, the strong force indi- lated quark, but more hadrons (quark-antiquark pairs or rectly also binds protons and neutrons into atomic nuclei. triplets). This property of quarks, that they are always Such a nuclear force is mediated by pions: spin-0 mesons 0 bound in groups that are colorless, is called confinement. with masses mπ = 135.0 MeV and mπ± = 139.6 MeV. Moreover, the color force has the interesting property Electromagnetic processes between electrically that, as two quarks approach each other very closely (or charged particles are mediated by massless neutral equivalently have high energy), the force between them spin-1 photons. The interaction can be described by 46 a local U(1)EM gauge theory called quantum electro- plugged the final remaining experimental hole in the SM, dynamics (QED). The symmetry properties of QED cementing the theory further. are unquestionably appealing [145–151]. Moreover, In summary, the fundamental particles can be classi- QED has yielded results that are in agreement with fied into spin-1/2 fermions (6 leptons and 6 quarks), and 0 experiment to an accuracy of about one part in a spin-1 gauge bosons (γ, W±, Z , and g). The leptons have billion [152], which makes the theory the most accurate 18 degrees of freedom: each of the 3 charged leptons physical theory ever devised. It is the model for theories has 2 possible chiralities and its associated anti-particle, of the other fundamental forces and the standard by whereas the 3 neutrinos and antineutrinos have only which such theories are judged. one chirality (neutrinos are left-handed and antineutri- Every quark and lepton of the SM interact weakly. The nos are right-handed). The quarks have 72 degrees of + weak interaction, mediated by the massive W , W− and freedom: each of the 6 quarks, has the associated an- Z0 vector bosons, fall into two classes: (i) charge-current tiparticle, three different color states, and 2 chiralities. + (CC) weak interactions involving the W and W− bosons The gauge bosons have 27 degrees of freedom: a photon and (ii) neutral current (NC) weak interactions involving has two possible polarization states, each massive gauge the Z0 boson. The CC interactions, acting exclusively on boson has 3, and each of the eight independent types of left-handed particles and right-handed antiparticles, are gluon in QCD has 2. The scalar spin-0 Higgs boson, with 4 described by a chiral SU(2)L local gauge theory, where a mass mH 126 GeV, has 1 degree of freedom. the subscript L refers to left-handed particles only.3 On ' the other hand, the NC interactions act on both left- handed and right-handed particles, similar to the elec- tromagnetic interactions. In fact the SM assumes that B. Equilibrium thermodynamics both the Z0 and the photon arise from a mixing of two 0 0 bosons, W and B , via the electroweak mixing angle θW: The Universe we observe had its beginning in the big

0 0 bang, the cosmic firewall. Because the early universe was γ = B cos θW + W sin θW , to a good approximation in thermal equilibrium, particle 0 0 0 Z = B sin θW + W cos θW . (259) reactions can be modeled using the tools of thermody- − The electroweak interaction is described by a local gauge theory: SU(2)L U(1)Y, where the hypercharge U(1)Y ⊗ symmetry involves both left-handed and right-handed 4 Recently, ATLAS [160] and CMS [161] announced the observation particles [153–155]. Experiment requires the masses of of a peak in the diphoton mass distribution around 750 GeV, using 1 1 the weak gauge bosons W and Z to be heavy so that weak (respectively) 3.2 fb− and 2.6 fb− of data recorded at a c.m. en- interactions are very short-ranged. The W and Z gauge ergy √s = 13 TeV. The diphoton excesses could be interpreted as bosons acquire masses through spontaneous symmetry the decay products of a new massive particle z, with spin 0, 2, or higher [162]. Assuming a narrow width approximation ATLAS gives breaking SU(2)L U(1)Y U(1) . The breaking of × → EM a local significance of 3.6σ, or else a global significance of 2.0σ when the symmetry triggers the Higgs mechanism [156, 157], the look-elsewhere-effect in the mass range Mz/GeV [200 2000] which gives the relative masses of the W and Z bosons is accounted for. Signal-plus-background fits were∈ also− imple- in terms of the electroweak mixing angle, mented for a broad signal component with a large decay width. The largest deviation from the background-only hypothesis corresponds to M 750 GeV with a total width Γ 45 GeV. The local and MW = MZ cos θW , (260) z total global significances∼ evaluated for the broad∼ resonance fit are roughly while the photon remains massless. In addition, by cou- 0.3 higher than that for the fit using the narrow width approximation, corresponding to 3.9σ and 2.3σ, respectively. The CMS data gives a pling originally massless fermions to the scalar Higgs local significance of 2.6σ and a global significance smaller than 1.2σ. field, it is possible to produce the observed physical More recently, ATLAS and CMS updated their diphoton resonance 1 fermion masses without violating the gauge invariance. searches [163–165]. ATLAS reanalyzed the 3.2 fb− of data, targeting The conspicuously well-known accomplishments of separately spin-0 and spin-2 resonances. For spin-0, the most signif- the SU(3)C SU(2)L U(1)Y SM of strong and elec- icant deviation from the background-only hypothesis corresponds troweak forces⊗ can be considered⊗ as the apotheosis of the to Mz 750 GeV and Γtotal 45 GeV. The local significance is now increased∼ to 3.9σ but the global∼ significance remains at the 2σ level. gauge symmetry principle to describe particle interac- For the spin-2 resonance, both the local and global significances are tions. Most spectacularly, the recent discovery [158, 159] reduced down to 3.6σ and 1.8σ, respectively. The new CMS analysis of a new boson with scalar quantum numbers and cou- includes additional data (recorded in 2015 while the magnet was not 1 plings compatible with those of a SM Higgs has possibly operated) for a total of 3.3 fb− . The largest excess is observed for Mz = 760 GeV and Γtotal 11 GeV, and has a local significance of 2.8σ for spin-0 and 2.9σ spin-2≈ hypothesis. After taking into account the effect of searching for several signal hypotheses, the significance of the excess is reduced to < 1σ. CMS also communicated a combined 3 A phenomenon is said to be chiral if it is not identical to its mirror search with data recorded at √s = 13 TeV and √s = 8 TeV. For the image. The spin of a particle may be used to define a handedness combined analysis, the largest excess is observed at Mz = 750 GeV for that particle. The chirality of a particle is right-handed if the and Γtotal = 0.1 GeV. The local and global significances are 3.4σ direction of its spin is the same as the direction of its motion. It is and 1.6σ, respectively. This could be the first observation of physics≈ left-handed if the directions of spin and motion are opposite. beyond the SM at the LHC. 47 namics and statistical mechanics. It will be helpfull then how the different particle species evolve in the primeval to take a second detour and revise some concepts of sta- plasma. For kT mc2, the particles behave as if they tistical thermodynamics. were massless and the Bose-Einstein and Fermi-Dirac Consider a cubic box of volume V, and expand the distributions reduce to fields inside into periodic waves with harmonic bound- 1 ary conditions. The density of states in k-space is f (y) = , (266) ey 1 V ± dN = g d3k , (261) where we have defined y = ~p /(kT). Using (2π)3 | | Z n 1 where g is a degeneracy factor and k is the Fourier trans- ∞ z − z dz = Γ(n) ζ(n) (267) form wavenumber. The equilibrium phase space distri- 0 e 1 bution (or occupancy) function for a quantum state of − energy E is given by the familiar Fermi-Dirac or Bose- and Einstein distrubutions, Z n 1 ∞ z 1 − dz = (2n 2) Γ(n) ζ(n) , (268) 1 ez + 1 2n − f = , (262) 0 e(E µ)/(kT) 1 − ± we obtain where T is the equilibrium temperature, k is the Boltz- !3 Z 2 mann constant, µ is the chemical potential (if present), kT 4πg ∞ y dy n = and corresponds to either Fermi or Bose statistics. c (2π})3 ey 1 ± 0 ± Throughout we will consider the case µ T and ne- !3 | |  ζ(3) kT glect all chemical potentials when computing total ther- = g , modynamic quantities. All evidence indicates that this A± π2 }c is a good approximation to describe particle interactions π2 g in the super-hot primeval plasma [166] . ρ = (kT)4 , B± 30(}c)3 The number density of a dilute weakly-interacting gas 1 of particles in thermal equilibrium with g internal de- P = ρ , (269) grees of freedom is then 3 Z 1 where ζ(3) 1.2 and = 1 for bosons, and + = ≈ A− A n = f dN 3/4 for fermions, = 1 for bosons and + = 7/8 for V 5 B− B Z 2 fermions. 1 ∞ 4πp dp 2 = g For kT mc , the exponential factor dominates the 3 E/(kT)  (2π}) 0 e 1 denominator in both the Bose-Einstein and Fermi-Dirac Z 2 ± 2 3 1/2 1 ∞ (E m c ) distributions in (262), so that the bosonic or fermionic = g E dE , (263) 2 3 3 E−/(kT) nature of the particles becomes indistinguishable. Fur- 2π } c 2 e 1 mc ± thermore, we have where in the second line we have changed to momentum !1/2 ~ p2 space, ~p = }k, and in the third line we used the relativistic E = (p2c2 + m2c4)1/2 = mc2 1 + relation E = m2c4+p2c2. The analogous expression for the m2c2 energy density is easily obtain since it is only necessary p2 to multiply the integrand in (263) by a factor of E for the mc2 + . (270) ' 2m energy of each mode,

Z 2 2 4 1/2 Defining x = ~p / √2mkT, for the number density we g ∞ (E m c ) | | ρ = − E2 dE . (264) obtain the Boltzmann distribution 2π2}3c3 eE/(kT) 1 mi ± Z mc2/(kT) 3/2 4πg ∞ x2 2 n = e− (2mkT) e− x dx Recalling that the pressure is the average value of the (2π})3 momentum transfer p 2c2/E in a given direction, we 0 h i !3/2 have g mkT 2 e mc /(kT) = 3 − , (271) Z 2 2 4 3/2 } 2π g ∞ (E m c ) P = − dE , (265) 6π2}3c3 eE/(kT) 1 mi ± with the factor of 1/3 associated with the assumed 5 isotropy of the momentum distribution. The Gamma function is an extension of the factorial function for non-integer and complex numbers. If s is a positive integer, then Let us now compute the above expressions in two Γ(s) = (s 1)!. The Riemann zeta function of a real variable s, defined − P s asymptotic limits: relativistic and non-relativistic par- by the infinite series ζ(s) = ∞ 1/n , converges s > 1. Note that n=1 ∀ (268) follows from (267) using the relation 1 = 1 2 . ticles, which will be sufficient for our discussion of ex+1 ex 1 e2x 1 − − − 48 where we have used Substituting (278) into (277) and equating the dV and dT parts gives the familiar Z   ∞ n x2 1 1 + n x e− dx = Γ , (272) 2 2 ∂U ∂S 0 = T (279) ∂T ∂T with n = 2 and Γ(3/2) = √π/2. From (270) it is easily seen that to leading order ρ = mc2n in this case. and To obtain the associated pressure, note that to leading U + PV order p2c2 /E p 2/m, so that S = , (280) | | ' | | T Z where we have used the relation for extensive quatities mc2/(kT) 5/2 4πg 1 ∞ 4 x2 P e− (2mkT) x e− dx 6 ' (2π})3 3m (∂S/∂V = S/V and ∂U/∂V). It is useful to define the 0 entropy density s = S/V, which is thus given by mc2/(kT) 5/2 4πg 1 3 √π = e− (2mkT) (2π})3 3m 8 s = ρ + P . (281) !3/2 g mkT mc2/(kT) = e− kT For photons, we can compute all of the thermody- }3 2π namic quantities rather easily = nkT , (273) ! !3 2ζ(3) kTγ kT n = = 60.42 where we have used Γ(5/2) = 3 √π/4. Note that (273) γ π2 }c3 hc is just but the familar result for a non-relativistic perfect  3 2 T 3 gas, P = nkT. Since kT mc , we have P ρ and the = 20.28 photons cm− , pressure may be neglected for a gas of non-relativistic K 2 4 4 particles, as we had anticipated. π (kTγ) (kTγ) ρ = = 0.66 , For a gas of non-degenerate, relativistic species, the γ 15 (}c)3 (}c)3 average energy per particle is   ρ 16 T Eγ = = 3.73 10 erg ,  4 h i n × K ρ  π k T 2.701 T for bosons  30ζ(3) ' 1 E = =  7π4k , (274) P = ρ , h i n  T 3.151 T for fermions γ 3 γ 180ζ(3) ' 4 ργ sγ = . (282) whereas for a non-relativistic species 3 Tγ

3 2 E = mc2 + kT . (275) In the limit kT mic , the total energy density can be h i 2 conveniently expressed by   The internal energy U can be considered to be a func- X X 2  7  1 π 4 tion of two thermodynamic variables among P, V , and ρrad =  gB + gF (kT)  8  (c})3 30 T. (These variables are related by the equation of state.) B F Let us choose V and T to be the fundamental variables. 2 1 π 4 The internal energy can then be written as U(V, T). Let = gρ(T)(kT) , (283) (}c)3 30 us differentiate this function: ! ! where g is the total number of boson (fermion) ∂U ∂U B(F) dU = dV + dT . (276) degrees of freedom and the sum runs over all boson V T 2 ∂ T ∂ V (fermion) states with mic kT. The factor of 7/8 is due to the difference between the Fermi and Bose integrals. This equation can be combined with the first law (167) (283) defines the effective number of degrees of freedom, to give gρ(T), by taking into account new particle degrees of " ! # ! freedom as the temperature is raised. The change in ∂U ∂U TdS = + P dV + dT . (277) gρ(T) (ignoring mass effects) is given in Table IV [167]. ∂V T ∂T V

Now, since the internal energy is a function of T and V we may therefore choose to view S as a function of T and 6 Recall that an extensive property is any property that depends on the V, and this gives rise to the differential relation size (or extent) of the system under consideration. Take two identical samples with all properties identical and combine them into a single ! ! sample. Properties that double (e.g., energy, volume, entropy) are ∂S ∂S dS = dT + dV . (278) extensive. Properties that remain the same (e.g., temperature and pressure) are intensive. ∂T V ∂V T 49

Planck units: TABLE IV: Effective numbers of degrees of freedom in SM. r }c 19 MPl 10 GeV , Temperature New particles 4gρ(T) ≡ G ' r T < me γ’s + ν’s 29 }G 35 m < T < m e 43 `Pl 10− m , e µ ± ≡ c3 ' mµ < T < mπ µ± 57 r }G mπ < T < Tc∗ π’s 69 t 43 Pl 5 10− s . (284) Tc < T < mcharm - π’s + u, u¯, d, d¯, s, s¯ + gluons 247 ≡ c ' mc < T < mτ c, c¯ 289 The Planck time therefore sets the origin of time for the mτ < T < mbottom τ± 303 classical big bang era. It is inaccurate to extend the classi- ¯ mb < T < mW,Z b, b 345 cal solution of Friedmann equation to a = 0 and conclude mW,Z < T < mHiggs W±, Z 381 that the universe began in a singularity of infinite den- 0 mH < T < mtop H 385 sity. 43 mt < T t, t¯ 427 At t 10− s, a kind of phase transition is thought to have occured∼ during which the gravitational force con- *Tc corresponds to the confinement–deconfinement transition densed out as a separate force. The symmetry of the four between quarks and hadrons. forces was broken, but the strong, weak, and electro- magnetic forces were still unified, and there were no distinctions between quarks and leptons. This is an At higher temperatures, gρ(T) will be model dependent. unimaginably short time, and predictions can be only speculative. The temperature would have been about 32 EXERCISE 8.1 If in the next 1010 yr the volume of the 10 K, corresponding to particles moving about every universe increases by a factor of two, what then will be which way with an average kinetic energy of the temperature of the blackbody radiation? 23 32 1.4 10− J/K 10 K kT × 1019 GeV , (285) ≈ 1.6 10 10 J/GeV ≈ × − where we have ignored the factor 2/3 in our order of C. The first millisecond magnitude calculation. Very shortly thereafter, as the temperature had dropped to about 1028 K, there was another phase transition and the strong force condensed 10 35 The history of the universe from 10− seconds to today out at about 10− s after the bang. Now the universe is based on observational facts: the fundamental laws was filled with a soup of quarks and leptons. About this of high energy physics are well-established up to the time, the universe underwent an incredible exponential 10 26 energies reached by the LHC. Before 10− seconds, the expansion, increasing in size by a factor of & 10 in a 34 energy of the universe exceeds 13 TeV and we lose the tiny fraction of a second, perhaps 10− s. comfort of direct experimental guidance. The physics As a matter of fact, the favored ∼ΛCDM model implic- of that era is therefore as speculative as it is fascinating. itly includes the hypothesis of a very early period in Herein we will go back to the earliest of times - as close which the scale factor of the universe expands exponen- as possible to the big bang - and follow the evolution of tially: a(t) eHt. If the interval of exponential expansion the Universe. satisfies ∆∝t & 60/H, a small casually connected region It is cear that as a 0 the temperature increases with- can grow sufficiently to accommodate the observed ho- out limit T , but→ there comes a point at which the mogeneity and isotropy [168]. To properly understand extrapolation→ of ∞ classical physics breaks down. This is why this is so, we express the comoving horizon (210) as the realm of quantum black holes, where the thermal an integral of the comoving Hubble radius, energy of typical particles of mass m is such that their de Z t Z a Z a dt da 1 Broglie wavelength is smaller than their Schwarzschild c 0 c c d a 2 %h = 2 = ln . (286) radius. Equating h/mc to 2Gm/c yields a characteristic ≡ 0 a(t0) 0 Ha 0 aH mass for known as the Planck mass 7 At this stage it is important to emphasize a subtle dis- MPl. This mass scale, together with the corresponding 2 tinction between the comoving horizon %h and the co- length }/(MPlc) and time }/(MPlc ) define the system of moving Hubble radius c/(aH). If particles are separated by distances greater than %h, they never could have com- municated with one another; if they are separated by 7 Strictly speaking this is not quite the Planck mass. It is a factor of distances greater than c/(aH), they cannot talk to each √π larger. However, this heuristic derivation gives the right order other now. This distinction is crucial for the solution to of magnitude. the horizon problem which relies on the following: It is 5.1.2 Flatness Problem Revisited Recall the Friedmann Equation (41) for a non-flat universe 1 1 ⌦(a) = . (49) | | (aH)2

If the comoving Hubble radius decreases this drives the universe toward flatness (rather than away from it). This solves the flatness problem! The solution ⌦ = 1 is an attractor during inflation.

5.1.3 Horizon Problem Revisited A decreasing comoving horizon means that large scales entering the present universe were inside the horizon before inflation (see Figure 2). Causal physics before inflation therefore established spatial homogeneity. With a period of inflation, the uniformity of the CMB is not a mystery. 50

start ComoNowving letScales us make a tremendous approximation and as- ‘comoving’ sume that Friedmann equation is valid until the Planck Hubble length era. From (289) we read thathorizon if the re-en universetry is perfectly horizon exit Comoving flat, then Ω = 1 at all times. However, if there is even Horizon a small curvature term, the time dependence of Ω 1 is quite different. In particular, for the radiation dominated− 2 4 2 now end era we have, H ρrad a− and Ω 1 adensit, whereasy fluctuation dur- ∝ ∝ 3 − ∝ ing matter domination, ρm a− and Ω 1 a. In both cases Ω 1 decreases going∝ backwards− in∝ time. Since − we know that Ω0 1 is of order unityHot Big at present,Bang we can smooth patch Inflation − deduce its value at tPL,     a2 T2 Ω 1 T=TPl     | − |  Pl   0  64  2   2  (10− ) . (290) Ω 1 T=T ≈  a  ≈ T  ≈ O | − | 0 0 Pl Time [log(a)] FIG. 31: Evolution of the comoving Hubble radius, c/(aH), This means that to get the correct value of Ω0 1 1 to- Figure 7: Left: Evolution of the comoving Hubbleday, radius, the value (aH of Ω) 11, at in early the times inflationary has to be fine-tuned− universe.∼ The in the inflationary universe. The comoving Hubble sphere − shrinks duringcomoving inflation and Hubble expands after sphere inflation. shrinks Inflation duringto valuesinflation amazingly and expands close to after zero, inflation.but without Inflation being is is therefore a mechanism to zoom-in on a smooth sub-horizon exactly zero. This has been dubbed the flatness problem.8 patch [169]. therefore a mechanism to ‘zoom-in’ onShow a smooth that the inflationarysub-horizon hypothesis patch. elegantlyRight: Solution solve of the horizon problem. All scales that arethe relevant flatenss fine-tuning to cosmological problem. observations today were 5 possible that larger%h is much than larger the than Hubblec/(aH) radius now, so until that a After10 the. However, very brief inflationary at suciently period, early the universe times, these particles cannot communicate today but were in causal would⇠ have settled back into its more regular expan- scales were smaller than the Hubble radius and34 therefore causally5 connected. Similarly, contact early on. From (286) we see that this might hap- sion. For 10− s . t . 10 yr, the universe is thought pen if the comovingthe scales Hubble of radius cosmological in the early interest universe cameto back have been within dominated the Hubble by radiation. radius This atrelatively corresponds recent 3 27 was much larger than it is now so that %h got most of its to 10 K . T . 10 K. We have seen that the equa- contribution fromtimes. early times. Hence, we require a phase tion of state can be given by w = 1/3. If we neglect the of decreasing Hubble radius, as illustrated in Fig. 31. The contributions to H from Λ (this is always a good approxi- 1 1/2 shrinking Hubble sphere is defined by d(aH)− /dt < 0. mation for small enough a) then we find that a t and 1 2 4 ∼ From the relation d(aH)− /dt = a¨/(aH) we see immedi- ρrad a− . Substituting (283) into (158) we can rewrite ately that a shrinking comoving− Hubble radius implies the expansion∼ rate as a function of the temperature in 5.2accelerated Conditions expansion a¨ > 0. This for explains Inflation why inflation the plasma is often defined as a period of accelerated expansion. The !1/2 3 !1/2 Viasecond the time Friedmann derivative Equations of the scale factor a shrinking may of course comoving Hubble8π radiusGρrad can be8π related to the2 acceleration H = = gρ(T) T /MPl andbe related the the to the pressure first time of derivative the universe of the Hubble pa- 3 90 rameter according to q 2 1.66 gρ(T) T /MPl , (291) a¨ 1 2 ∼ = H2(1d ) ,H (287) d a a − < 0 where> we0 have adopted⇢ +3p< natural0 . units (} = c = k = 1). (50) dt a ) dt2 ) Neglecting the T-dependence of gρ (i.e. away from mass where  H˙ /H2. Acceleration✓ therefore◆ corresponds ≡ − thresholds and phase transitions), integration of (291) Theto  < three1. All equivalent in all, H is approximately conditions for constant inflation during thereforeyields (187) are: and the useful commonly used approxima- inflation whereas a grows exponentially, and so this tion implies that the comoving Hubble radius decreases  1/2 just as advertised. Now, consulting (184) we infer that 3M2   2  Pl  1 T − a¨ > 0 requires a negative pressure: P < ρ/3. To see t   2.42 s . (292) ' 32πρrad  ' √gρ MeV how this can be realized in various physics− models see 27 e.g. [169, 170]. 10 At about 10− s the Higgs field spontaneously ac- quires a vacuum expectation value, which breaks the EXERCISE 8.2 (228) can be rearranged to give electroweak gauge symmetry. As a consequence, the weak force and electromagnetic force manifest with dif- 8πGρ kc2 Λc2 + = 1 (288) ferent ranges. In addition, quarks and charged lep- c2H2 2 2 2 3 3 − H a R0 tons interacting with the Higgs field become massive. and so using (231) we rewrite (288) as

kc2 Ω 1 = . (289) 8 A didactic explanation of the flatness fine-tuning problem is 2 2 2 given [171]. − a R0H 51

The fundamental interactions have by then taken their during the transition. The quark-hadron crossover tran- present forms. sition therefore corresponds to a large redistribution of By the time the universe was about a microsecond old, entropy into the remaining degrees of freedom. To con- quarks began to condense into mesons and baryons. To nect the temperature to an effective number of r.d.o.f. we see why, let us focus on the most familiar hadrons: nu- make use of some high statistics lattice simulations of a cleons and their antiparticles. When the average kinetic QCD plasma in the hot phase, especially the behavior energy of particles was somewhat higher than 1 GeV, of the entropy during the changeover [174]. Concretely, protons, neutrons, and their antiparticles were contin- the effective number of interacting r.d.o.f. in the plasma ually being created out of the energies of collisions in- at temperature T is given by volving photons and other particles. But just as quickly,  7  particle and antiparticles would annihilate. Hence the gs(T) r(T) gB + gF , (294) process of creation and annihilation of nucleons was in ' 8 equilibrium. The numbers of nucleons and antinucle- where the coefficient r(T) is unity for leptons, two for ons were high: roughly as many as there were electrons, photon contributions, and is the ratio s(T)/sSB for the positrons, or photons. But as the universe expanded quark-gluon plasma [175]. Here, s(T) and sSB are the en- and cooled, and the average kinetic energy of particles tropy density and the ideal Stefan-Bolzmann limit shown dropped below about 1 GeV, which is the minimum en- in Fig 32. The entropy rise during the confinement- ergy needed in a typical collision to create nucleons and deconfinement changeover can be parametrized, for antinucleons (940 MeV each), the process of nucleon cre- 150 MeV < T < 500 MeV, by ation could not continue. However, the process of anni- hilation could continue with antinucleons annihilating 2 s 42.82 C C2 C e− 1 + 18.62 e 2 , (295) nucleons, until there were almost no nucleons left; but 3  2 T ' √392π eC2 1 not quite zero! − Manned and unmanned exploration of the solar sys- 2 where C = (T 151) /392 and C = 195.1/(T tem tell us that it is made up of the same stuff as the Earth: 1 MeV 2 MeV 134). For the same− energy range, we obtain − baryons. Observational evidence from radio-astronomy and detection indicate that the Milky Way, gs(T) 47.5 r(T) + 19.25 . (296) as well as interstellar space, and distant galaxies are also ' made of baryons. Therefore, we can cautiously conclude In Fig. 32 we show gs(T) as given by (296). The that the baryon number of the observable universe is parametrization is in very good agreement with phe- B > 0. This requires that the early qq¯ plasma contained a nomenological estimates [176, 177]. tiny surplus of quarks. After all anti-matter annihilated The entropy density is dominated by the contribution with matter, only the small surplus of matter remained of relativistic particles, so to a very good approximation

2 nB nB¯ 10 excess baryons 2π 3 η = − = 5 10− . (293) s = gs(T) T . (297) nγ × photons 45 The tiny surplus can be explained by interactions in the Conservation of S = sV leads to early universe that were not completely symmetric with d (sa3) = 0 (298) respect to an exchange of matter-antimatter, the so-called dt “baryogenesis” [172]. 3 3 At this stage, it is worthwhile to point out that if some and therefore that gs(T)T a remains constant as the uni- relativistic particles have decoupled from the photons, verse expands. As one would expect, a non-evolving it is necessary to distinguish between two kinds of rel- system would stay at constant number or entropy den- ativistic degrees of freedom (r.d.o.f.): those associated sity in comoving coordinates even though the number with the total energy density gρ, and those associated or entropy density is in fact decreasing due to the ex- with the total entropy density gs. At energies above pansion of the universe. Since the quark-gluon energy the deconfinement transition towards the quark gluon density in the plasma has a similar T dependence to that plasma, quarks and gluons are the relevant fields for of the entropy (see e.g. Fig. 7 in [174]), hereafter we the QCD sector, such that the effective number of inter- simplify the discussion by taking g = gρ = gs. acting (thermally coupled) r.d.o.f. is gs(T) = 61.75. As After the first millisecond has elapsed, when the ma- the universe cools down below the confinement scale jority of hadrons and anti-hadrons annihilated each ΛQCD 200 MeV, the SM plasma transitions to a regime other, we entered the lepton era. where∼ mesons and baryons are the pertinent degrees of freedom. Precisely, the relevant hadrons present in this energy regime are pions and charged kaons, such that D. Neutrino decoupling and BBN gs(T) = 19.25 [173]. This significant reduction in the de- grees of freedom results from the rapid annihilation or After the first tenth of a second, when the temper- decay of more massive hadrons which may have formed ature was about 3 1010 K, the universe was filled × 52

3 20 sSB T 60

í 15 L 50 3 T T

H 40 ê s s 10 g 30 5 20 0 150 200 250 300 350 400 450 500 150 200 250 300 350 400 450 500 T MeV T MeV

FIG. 32: Left. The parametrization of the entropy density given in Eq. (295) (dashed line) superposed on the result from high statistics lattice simulations [174] (solid line). Right. Comparison of gs(T) obtained using (296) (dashed line) and the phenomenological estimate of [176,H 177] (solidL line) [178]. H L with a plasma of protons, neutrons, electrons, positrons, nucleons are thus mantianed in kinetic equilibrium. The + photons, neutrinos, and antineutrinos (p, n, γ, e−, e , average kinetic energy per nucleon is 3T/2. One must be ν, and ν). The baryons are of course nonrelativistic careful to distinguish between kinetic equilibrium and while all the other particles are relativistic. These par- chemical equilibrium. Reactions like γγ pp¯ have ticles are kept in thermal equilibrium by various elec- long been suppressed, as there are essentially→ no anti- + tromagnetic and weak processes of the sort νν¯ e e−, nucleons around. + 9 νe− νe−, nνe pe−, γγ e e−, γp γp, etc. In For T > me 0.5 MeV 5 10 K, the number of ∼ ∼ × complying with the precision demanded of our phe- electrons, positrons, and photons are comparable, ne − ∼ nomenological approach it would be sufficient to con- ne+ nγ. The exact ratios are of course easily supplied by sider that the cross section of reactions involving left- inserting∼ the appropriate “g-factors.” Because the uni- + handed neutrinos, right-handed antineutrinos, and elec- verse is electrically neutral, ne− ne = np and so there is 2 2 5 2 − trons is σweak G E , where GF = 1.16 10− GeV− is a slight excess of electrons over positrons. When T drops ∼ F × + the Fermi constant. If we approximate the energy E of below me, the process γγ e e− is severely suppressed →me/T all particle species by their temperature T, their velocity by the Boltzmann factor e− , as only very energetic by c, and their density by n T3, then the interaction photons in the “tail-end” of the Bose distribution can par- rate of is [166] ∼ ticipate. Thus positrons and electrons annihilate rapidly + via e e− γγ and are not replenished (leaving a small 2 5 → 10 Γint,ν(T) vσ nν G T . (299) number of electrons n n 5 10 n ). As long ≈ h i ≈ F e− p − γ as thermal equilibrium was∼ preserved,∼ × the total entropy Comparing (299) with the expansion rate (291), calcu- remained fixed. We have seen that sa3 g(T)T3a3 = lated for g(T) = 10.75, we see that when the tempera- ∝ constant. For T & me, the particles in thermal equilib- ture drops below some characteristic temperature Tdec νL rium with the photons include the photon (gγ = 2) and neutrinos decouple, i.e. they lose thermal contact with e± pairs (ge± = 4). The effective total number of particle electrons [179–181]. The condition species before annihilation is gbefore = 11/2. On the other hand, after the annihilation of electrons and positrons, Γ (Tdec) = H(Tdec) (300) int,ν νL νL the only remaining abundant particles in equilibrium are sets the decoupling temperature for left handed neutri- photons. Hence the effective number of particle species dec is gafter = 2. It follows from the conservation of entropy nos: TνL 1 MeV. The much∼ stronger electromagnetic interaction contin- that ues to keep the protons, neutrons, electrons, positrons, 11 3 3 (Tγa) = 2 (Tγa) . (302) and photons in equilibrium. The reaction rate per nu- 2 before after cleon, Γ T3α2/m2 , is larger than the expansion rate int,N N That is, the heat produced by the annihilation of electrons as long as ∼ and positrons increases the quantity Tγa by a factor of m2 (T a)  1/3 T N γ after 11 > 2 a very low temperature , (301) | = 1.4 . (303) α MPl ∼ (T a) 4 ' γ |before where the non-relativistic form of the electromagnetic Before the annihilation of electrons and positrons, the cross section, σ α2/m2 , has been obtained by dimen- neutrino temperature T is the same as the photon tem- ∼ N ν sional analysis, with α the fine structure constant. The perature Tγ. But from then on, Tν simply dropped like 1 53

1 a− , so for all subsequent times, Tνa equals the value Therefore, even though out of thermal equilibrium, before annihilation, the neutrinos and antineutrinos make an important contribution to the energy density. (T a) = (T a) = (T a) . (304) ν |after ν |before γ |before We conclude therefore that after the annihilation pro- cess is over, the photon temperature is higher than the neutrino temperature by a factor of EXERCISE 8.3 By assuming that neutrinos saturate ! the dark matter density derive an upper bound on the Tγ (Tγa) after = | 1.4 . (305) neutrino mass [182]. Tν (Tνa) after ' after |

The energy density stored in relativistic species is customarily given in terms of the so-called effective number of neutrino species, Neff, through the relation " # 7  4 4/3 ρ = 1 + N ρ , (306) rad 8 11 eff γ and so !  4  4 ρrad ργ 8X gB T X gF T N − 0 B + 0 F , (307) eff ≡ ρ ' 7 2 T 2 T ν B ν F ν where ρν denotes the energy density of a single species of massless neutrinos, TB(F) is the effective temperature of boson (fermion) species, and the primes indicate that electrons and photons are excluded from the sums [183, 184]. The normalization of Neff is such that it gives Neff = 3 for three families of massless left-handed standard model neutrinos. For most practical purposes, it is accurate enough to consider that neutrinos freeze-out completely at about 1 MeV. However, as the temperature dropped below this value, neutrinos were still interacting with the electromagnetic plasma and hence received a tiny portion of the entropy from pair annihilations. The non-instantaneous neutrino decoupling gives a correction to the normalization Neff = 3.046 [185–188]. Near 1 MeV, the CC weak interactions,

+ nνe pe−, ne pν¯e, n pe−ν¯e (308) guarantee neutron-proton chemical equilibrium. Defining λnp as the summed rate of the reactions which convert neutrons to protons,

+ λnp = λ(nνe pe−) + λ(ne pν¯e) + λ(n pe−ν¯e) , (309) → → → the rate λpn for the reverse reactions which convert protons to neutrons is given by detailed balance:

∆m/T(t) λpn = λnp e− , (310) where ∆m mn mp = 1.293 MeV. The evolution of the fractional neutron abundance Xn/N nn/nN is described by the balance≡ equation− ≡

dXn/N(t) = λpn(t)[1 Xn N(t)] λnp(t)Xn N(t) , (311) dt − / − / where nN is the total nucleon density at this time, nN = nn + np. The equilibrium solution is obtained by setting dXn/N(t)/dt = 0:

λpn(t) h i 1 eq ∆m/T(t) − Xn/N(t) = = 1 + e . (312) λpn(t) + λnp(t) The neutron abundance tracks its value in equilibrium until the inelastic neutron-proton scattering rate decreases sufficiently so as to become comparable to the Hubble expansion rate. At this point the neutrons freeze-out, that is FO they go out of chemical equilibrium. The neutron abundance at the freeze-out temperature Tn/N = 0.75 MeV can be approximated by its equilibrium value (312),

h FO i 1 FO eq FO ∆m/T − Xn N(T ) X (T ) = 1 + e n/N . (313) / n/N ' n/N n/N 54

Since the ratio ∆m/TFO is of (1), a substantial fraction of neutrons survive when chemical equilibrium between n/N O neutrons and protons is broken. At this time, the photon temperature is already below the deuterium binding energy ∆D 2.2 MeV, thus one would expect sizable amounts of D to be formed via n p D γ process. However, the large photon-nucleon' density 1 → ratio η− delays deuterium synthesis until the photo–dissociation process become ineffective (deuterium bottleneck). Defining the onset of nucleosynthesis by the criterion

e∆D/TBBN η 1 , (314) ∼ we obtain T 89 keV. Note that (314) ensures that below T the high energy tail in the photon distribution, BBN ≈ BBN with energy larger than ∆D, has been sufficiently diluted by the expansion. At this epoch, N(T) = 3.36, hence the time-temperature relationship (292) dictates that (BBN) begins at

t 167 s 180 s , (315) BBN ' ≈ as widely popularized by Weinberg [22]. Once D starts forming, a whole nuclear process network sets in [189, 190]. When the temperature dropped below 80 keV, the universe has cooled sufficiently that the cosmic nuclear reactor can begin in earnest, building the lightest∼ nuclides through the following sequence of two-body reactions

p n γ D, → p D 3He γ, DD 3He n, DD p T, → → → TD 4He n, 4He T 7Li γ, → → . (316) 3He n p T, 3He D 4He p, 3He 4He 7Be γ, → → → 7Li p 4He 4He, 7Be n 7Li p, .→ → .

By this time the neutron abundance surviving at freeze-out has been depleted by β-decay to

FO tBBN/τn Xn N(T ) Xn N(T ) e− , (317) / BBN ' / n/N 4 where τn 887 s is the neutron lifetime. Nearly all of these surviving neutrons are captured in He because of its ' large binding energy (∆4He = 28.3 MeV) via the reactions listed in (316). Heavier nuclei do not form in any significant quantity both because of the absence of stable nuclei with A=5 or 8, which impedes nucleosynthesis via n 4He, p 4He or 4He 4He reactions, and because of the large Coulomb barrier for reactions such as 4He T 7Li γ and 3He 4He 7Be γ. By the time the temperature has dropped below 30 keV, a time comparable to the→ neutron lifetime, the average→ thermal energy of the nuclides and nucleons is too∼ small to overcome the Coulomb barriers; any remaining free neutrons decay, and BBN ceases. The resulting mass fraction of helium, conventionally referred to Yp, is simply given by

Y 2Xn N(t ) = 0.251 , (318) p ' / BBN where the subscript p denotes primordial. The above calculation demonstrates how the synthesized helium abun- dance depends on the physical parameters. After a bit of algebra, (318) can be rweritten as [189]   eff η Yp 0.251 + 0.014 ∆N + 0.0002∆τn + 0.009 ln . (319) ' ν 5 10 10 × −

In summary, primordial nucleosynthesis has a single BBN-predicted primordial abundance of 4He is very adjustable parameter: the baryon density. Observations insensitive to the baryon density parameter. Rather, the that led to the determination of primordial abundance 4He mass fraction depends on the neutron-to-proton of D, 3He and 7Li can determine η. The internal ratio at BBN because virtually all neutrons available at consistency of BBN can then be checked by comparing that time are incorporated into 4He. Therefore, while D, the abundances of the other nuclides, predicted using 3He, and 7Li are potential baryometers, 4He provides a this same value of η, with observed abundances. potential chronometer. Interestingly, in contrast to the other light nuclides, the 55

EXERCISE 8.4 Suppose that the difference in rest of extra relativistic degrees of freedom at BBN and CMB energy of the neutron and proton were 0.1293 MeV, epochs can be explained, e.g., by means of the right- instead of 1.293 MeV, with all other physical parameters handed partners of the three, left-handed, SM neutrinos. unchanged. Estimate the maximum possible mass In particular, milli-weak interactions of these Dirac states 4 fraction in He, assuming that all available neutrons are may allow the νR’s to decouple much earlier, at a higher incorporated into 4He nuclei. temperature, than their left-handed counterparts [203]. Determine the minimum decoupling temperature of the EXERCISE 8.5 A fascinating bit of cosmological right-handed neutrinos which is consistent with Planck history is that of ’s prediction of the CMB in data at the 1σ level. the late 1940s [191–193]. Unfortunately, his prediction was premature; by the time the CMB was actually discovered, his prediction had fallen into obscurity. E. Quantum black holes This problem reproduces Gamow’s line of argument. Gamow knew that nucleosynthesis must have taken As we have seen, black holes are the evolutionary end- 9 place at a temperature TBBN 10 K. He also knew points of massive stars that undergo a supernova explo- that the universe must currently≈ be t 1010 years 0 ∼ sion leaving behind a fairly massive burned out stellar old. He then assumed that the universe was flat and remnant. With no outward forces to oppose gravita- radiation dominated, even at the present time. (i) With tional forces, the remnant will collapse in on itself. these assumptions, what was the energy density of the The density to which the matter must be squeezed universe at the time of nucleosynthesis? (ii) What was scales as the inverse square of the mass. For example, the Hubble parameter at the time of nucleosynthesis? the Sun would have to be compressed to a radius of 3 km (c) What was the age of the universe at BBN? (iv) Given (about four millionths its present size) to become a black the present age, what should the present temperature hole. For the Earth to meet the same fate, one would need of the CMB be? (v) If we then assume that the universe to squeeze it into a radius of 9 mm, about a billionth its changed from being radiation dominated to matter present size. Actually, the density of a solar mass black dominated at a redshift zeq > 0, will this increase or hole ( 1019 kg/m3) is about the highest that can be cre- decrease the CMB temperature, for fixed values of TBBN ated through∼ gravitational collapse. A body lighter than and t0? the Sun resists collapse because it becomes stabilized by repulsive quantum forces between subatomic particles. The observationally-inferred primordial fractions However, stellar collapse is not the only way to form 4 of baryonic mass in He (Yp = 0.2472 0.0012, black holes. The known laws of physics allow matter Y = 0.2516 0.0011, Y = 0.2477 0.0029,± and 97 3 p ± p ± densities up to the so-called Planck value 10 kg/m , Yp = 0.240 0.006) [194–196] have been constantly the density at which the force of gravity becomes so eff ± favoring Nν . 3 [197]. Unexpectedly, two recent inde- strong that quantum mechanical fluctuations can break pendent studies yield Yp values somewhat higher than down the fabric of spacetime, creating a black hole with 35 8 previous estimates: Yp = 0.2565 0.001(stat) 0.005(syst) a radius 10− m and a mass of 10− kg. This is the ± ± ∼ and Yp = 0.2561 0.011 [198–200]. For τn = 885.4 0.9 s lightest black hole that can be produced according to the ± ± and τn = 878.5 0.8 s, the updated effective conventional description of gravity. It is more massive ± number of light neutrino species is reported as but much smaller in size than a proton. +0.80 +0.80 Neff = 3.68 0.70 (2σ) and Neff = 3.80 0.70 (2σ), re- The high densities of the early universe were a pre- − − spectively. The most recent estimate of Yp yields requisite for the formation of primordial black holes but Neff = 3.58 0.25(68%CL), 0.40(95.4%CL), (99%CL). did not guarantee it. For a region to stop expanding and ± ± ± This entails that a non-standard value of Neff is preferred collapse to a black hole, it must have been denser than av- at the 99% CL, implying the possible existence of erage, so the density fluctuations were also necessary. As additional types of neutrino species [201]. we have seen, such fluctuations existed, at least on large scales, or else structures such as galaxies and clusters EXERCISE 8.6 We have seen that he best multi- of galaxies would never have coalesced. For primordial parameter fit of Planck data yields a Hubble constant black holes to form, these fluctuations must have been which deviates by more than 2σ from the value obtained stronger on smaller scales than on large ones, which is with the HST. The impact of the Planck h estimate is par- possible though not inevitable. Even in the absence of ticularly important in the determination of Neff. Com- fluctuations, holes might have formed spontaneously at bining observations of CMB data the Planck Collabora- various cosmological phase transitions – for example, tion reported Neff = 3.15 0.23 [121]. However, if the when the universe ended its early period of accelerated value of h is not allowed to± float in the fit, but instead is expansion, known as inflation, or at the nuclear density frozen to the value determined from the maser-cepheid- epoch, when particles such as protons condensed out of supernovae distance ladder, the Planck CMB data then the soup of their constituent quarks. gives Neff = 3.62 0.25, which suggests new neutrino- The realization that black holes could be so small like physics (at around± the 2.3σ level) [202]. The hints prompted to consider quantum ef- 56 fects, and in 1974 his studies lead to the famous conclu- per degree of particle freedom i. The change of variables sion that black holes not only swallow particles but also u = Q/T, brings Eq. (322) into a more familar form, spit them out [204, 205]. The strong gravitational fields Z 2 around the black hole induce spontaneous creation of 27 Γs TBH u N˙ i = du. (323) pairs near the event horizon. While the particle with 128 π3 eu ( 1)2s positive energy can escape to infinity, the one with neg- − − ative energy has to tunnel through the horizon into the This expression can be easily integrated using (267) and black hole where there are particle states with negative (268), and yields energy with respect to infinity.9 As the black holes ra- 27 Γs diate, they lose mass and so will eventually evaporate N˙ i = Γ(3) ζ(3) TBH . (324) completely and disappear. The evaporation is generally A± 128 π3 10 regarded as being thermal in character, with a temper- Therefore, the black hole emission rate is found to be ature inversely proportional to its mass MBH,   20 TBH 1 1 1 N˙ i 7.8 10 s− , (325) TBH = = , (320) ≈ × GeV 8πGMBH 4 π rs and an entropy S = 2 π MBH rs, where rs is the   20 TBH 1 Schwarzschild radius and we have set c = 1. Note that N˙ i 3.8 10 s− , (326) ≈ × GeV for a solar mass black hole, the temperature is around 6 10− K, which is completely negligible in today’s uni- 12   verse. But for black holes of 10 kg the temperature is 20 TBH 1 N˙ i 1.9 10 s− , (327) about 1012 K hot enough to emit both massless particles, ≈ × GeV such as γ-rays, and massive ones, such as electrons and positrons. for particles with s = 0, 1/2, 1, respectively. The black hole, however, produces an effective po- At any given time, the rate of decrease in the black tential barrier in the neighborhood of the horizon that hole mass is just the total power radiated backscatters part of the outgoing radiation, modifing the 3 dM˙ BH X σs Q blackbody spectrum. The black hole absorption cross = gi , (328) dQ − 8π2 eQ/TBH ( 1)2s section, σs (a.k.a. the greybody factor), depends upon i − − the spin of the emitted particles s, their energy Q, and the mass of the black hole [210]. At high frequencies where gi is the number of internal degrees of freedom of (Qrs 1) the greybody factor for each kind of particle particle species i. A straightforward calculation gives  must approach the geometrical optics limit. The inte- X 27 Γs 2 grated power emission is reasonably well approximated M˙ BH = gi Γ(4) ζ(4) T . (329) − B± 128 π3 BH taking such a high energy limit. Thus, for illustrative i simplicity, in what follows we adopt the geometric op- tics approximation, where the black hole acts as a perfect Assuming that the effective high energy theory contains absorber of a slightly larger radius, with emitting area approximately the same number of modes as the SM (i.e., given by [210] gs=1/2 = 90, and gs=1 = 27), we find

2 A = 27πr . (321) dMBH 4 1 s = 8.3 1073 GeV . (330) dt × M2 Within this framework, we can conveniently write the BH greybody factor as a dimensionless constant normalized Ignoring thresholds, i.e., assuming that the mass of the to the black hole surface area seen by the SM fields Γs = black hole evolves according to (330) during the entire σs/A , such that Γs = 1, Γs 2/3, and Γs 1/4. 4 =0 =1/2 ≈ =1 ≈ process of evaporation, we can obtain an estimate for the All in all, a black hole emits particles with initial total lifetime of the black hole, energy between (Q, Q + dQ) at a rate Z     1 74 4 2 dN˙ σ Q − τBH = 1.2 10− GeV− M dMBH . (331) i s Q2 2s × BH = 2 exp ( 1) (322) dQ 8 π TBH − − 25 Using } = 6.58 10− GeV s, (331) can then be re-written as ×

9 99 3 One can alternatively think of the emitted particles as coming from τBH 2.6 10− (MBH/GeV) s the singularity inside the black hole, tunneling out through the event ' × 26 3 1.6 10− (M /kg) yr . (332) horizon to infinity [206]. ' × BH 10 Indeed both the average number [204, 205] and the probability dis- tribution of the number [207–209] of outgoing particles in each mode This implies that for a solar mass black hole, the lifetime obey a thermal spectrum. is unobservably long 1064 yr, but for a 1012 kg one, it is 57

1.5 1010 yr, about the present age of the universe. per degree of particle freedom i of particles of spin s Therefore,∼ × any of this mass would with initial total energy between (Q, Q + dQ) can be ap- be completing its evaporation and exploding right now. proximated by (322). The characteristic temperature of The questions raised by primordial black holes a 4 + n-dimensional black hole is [221] motivate an empirical search for them. Most of the mass of these black holes would go into gamma rays (quarks n + 1 TBH = , (334) and gluons would hadronize mostly into pions which 4 π rs in turn would decay to γ-rays and neutrinos), with an energy spectrum that peaks around 100 MeV. In 1976, where Hawking and Don Page realized that γ-ray background  1/(1+n) 2nπ(n 3)/2Γ( n+3 ) observations place strong upper limits on the number 1 MBH − 2  rs =   , (335) of such black holes [211]. Specifically, by looking at M  M n + 2  ∗ ∗ the observed γ-ray spectrum, they set an upper limit of 104/pc3 on the density of these black holes with is the Schwarzschild radius [222]. As in the conve- masses near 5 1011 kg. Even if primordial black holes tional 4-dimensional case, we can conveniently rewrite × never actually formed, thinking about them has led to the greybody factor as a dimensionless constant, Γs = remarkable physical insights because they linked three σs/A4 4+n, normalized to the black hole surface area ⊂ previously disparate areas of physics: general relativity,  2/(n+1) quantum theory, and thermodynamics [212]. n + 3 n + 3 2 A4 4+n = 4π rs (336) ⊂ 2 n + 1 EXERCISE 8.7 Very recently, it has become evident that a promising route towards reconciling the apparent seen by the SM fields [219]. The upper limit on the n mismatch of the fundamental scales of particle physics accretion rate for a 4 + -dimensional black hole is and gravity is to modify the short distance behavior of dM n + 32/(n+1) n + 3 gravity at scales much larger than the Planck length. π r2  , (337) dt ≈ 2 n + 1 s Such modification can be most simply achieved by intro- accr ducing extra dimensions (generally thought to be curled- where  is the nearby quark-gluon (or parton) energy up) in the sub-millimiter range [213]. In the canonical density [223]. The highest earthly value of energy den- example, spacetime is a direct product of ordinary 4- sity of partonic matter is the one created at the LHC, dimensional spacetime and a (flat) spatial n-torus with 3  < 500 GeV/fm . Consider the case with n = 6, circumferences of length 2 r (i = 1 n), generally of LHC π i , ..., which is well motivated by string theory [224]. (i) Show common linear size r = r . The SM fields cannot propa- i c that the black holes that could be produced at the LHC gate freely in the extra dimensions without conflict with (or in any forseeable accelerator built on Earth) would observations. This is avoided by trapping the fields to evaporate much too quickly to swallow the partons a 3-dimensional brane-world. Applying Gauss’ law at nearby. (ii) Determine the black hole lifetime. [Hint: For r r and r r , it is easily seen that the effective c c n = 6, you can evaluate the numerical results of [225] at Planck scale is related to the fundamental scale of grav- Q and normalize the cross sections results to the cap- ity M simply by a volume factor, h i ∗ ture area A4 4+n to obtain Γs=1/2 0.33 and Γs=1 0.34.] ⊂ ≈ ≈ M 2/n 1 r = Pl c M M ∗ ∗ IX. MULTI-MESSENGER ASTRONOMY    2/n 17 TeV MPl = 2.0 10− cm , (333) × M M For biological reasons our perception of the Universe ∗ ∗ is based on the observation of photons, most trivially by so that M can range from TeV to 1019 GeV, for ∗ ∼ staring at the night-sky with our bare eyes. Conventional rc 1 mm and n 2. If nature gracefully picked a suffi- ≤ ≥ astronomy covers many orders of magnitude in pho- ciently low-scale gravity, the first evidence for it would 4 14 ton wavelengths, from 10 cm radio-waves to 10− cm likely be the observation of microscopic black holes pro- gamma rays of GeV energy. This 60 octave span in pho- duced in particle collisions [214]. Although the black ton frequency allows for a dramatic expansion of our 1 hole production cross section, (MW− ), is about 5 orders observational capacity beyond the approximately one O 1 of magnitude smaller than QCD cross sections, (Λ− ), octave perceivable by the human eye. O QCD it was proposed that such black holes could be produced The γ-ray sky has been monitored since 1968. The copiously at the LHC [215, 216] and in cosmic ray colli- pioneering observations by the third Orbiting Solar Ob- sions [217, 218], and that these spectacular events could servatory (OSO-3) provided the first γ-ray sky map, be easily filtered out of the QCD background. To a first with 621 events detected above 50 MeV [226]. In ad- approximation it is reasonable to assume that the evap- dition, these observations revealed the existence of an oration process is dominated by the large number of SM isotropic emission. The presence of an isotropic diffuse brane modes [219, 220]. Therefore, the emission rate γ-ray background (IGRB) has been confirmed by the 58

-4 16 10 Ê Ê cosmic neutrinos-- IceCube L

1 Á Fermi LAT Mod A

- 3K É Fermi LAT Mod B 15 – 31 – sr -5 Û Fermi LAT Mod C 1 10 -

s 14

2 IR - ÁÛÉÉ IceCube ÁÛÁÛÉÁÛÉ-ÁÛÉÁÛ6ÉÉ 10 ÁÛÁÛÉÉÉ 13 cm ÁÛÁÛÉÉ Fit + − ÁÛÁÛÉÉ Band γγ e e ] ÁÛÁÛÉÉ -1 ÁÛ É -3 ÁÛ É sr ÁÛÁÛÉ -1 10 ÁÛ Total EGB É [E/eV] s ÁÛ -2 É 12 -7 GFM ÁÛÉ 10 GeV 10 ÁÛ Ê E VIS H Uncertainty ÁÛÉÁÛÉ ew 10-4 Ê Bands ÁÛÉ Ê 11 Ê Ê Ê dE Fermi LAT, 50 months, (FG model A) ÁÛÉ Ê dN/dE [MeV cm ê 2 Ê -E 8 Fermi LAT, 50 months, (FG model B) Ê -5 Ê UV 10 10 Fermi LAT, 50 months, (FG model C)

dN 10 Galactic foreground modeling uncertainty energy log 2 Fermi LAT, resolved sources, |b|>20° (FG model A) + −

E γp e e p 10-6 -9 102 103 104 105 106 9 10 Energy [MeV]

0 1 2 3 4 5 6 7 8 9 Galactic Center Mrk501 maximum of first objects form Fig.10 8.— Comparison10 of the total EGB intensities10 for di↵erent10 foreground models.10 The total EGB 10 10 10 10 10 intensity is obtained by summing the IGRB intensity and the cumulative intensity from resolved Fermi LAT sources at latitudes b > 20 (gray band). See Figure 7 for legend. | | 8 E GeV γe γe

7 −5 −4 −3 −2 −1 0 1 2 3 FIG. 33: The open symbols represent the total extragalactic 10 10 10 10 10 10 10 10 10 redshift z γ-ray background for different foregroundH L (FG) models as re- ported by the Fermi Collaboration [230]. For details on the modeling of the diffuse Galactic foreground emission in the FIG. 34: Mean interaction length for photons on the ultraviolet benchmark FG models A, B and C, see [230]. The cumulative (UV), visible (VIS), infrared (IR), and microwave (3K) back- intensity from resolved Fermi-LAT sources at latitudes b > 20◦ | | grounds. The electroweak scale is indicated by a dashed line. is indicated by a (grey) band. The solid symbols indicate The redshifts of the star formation epoch and the famous γ-ray the neutrino flux reported by the IceCube Collaboration [231]. source Markarian 501 are also indicated [234]. The best fit to the data (extrapolated down to lower energies), +0.4 18 5 2.46 0.12 1 2 1 1 Φ(Eν) = 2.06 0.3 10− (Eν/10 GeV)− ± GeV− cm− s− sr− , − × is also shown for comparison [232]. best candidates to probe the high energy universe are cosmic rays, neutrinos, and gravitational waves. Of course in doing multi-messenger astronomy one has to Small Astronomy Satellite 2 (SAS-2) [227] and the the face new challenges. It is this that we now turn to study. Energetic Experiment Telescope (EGRET) on board of the Compton Gamma Ray Observatory (CGRO) [228, 229]. Very recently, the Fermi-LAT has A. Cosmic rays released a new measurement of the IGRB spectrum from 100 MeV to 820 GeV at Galactic latitude b > 20 [230]; ◦ In 1912 Hess carried out a series of pioneering balloon see Fig. 33. The LAT has also measured the| | extragalac- flights during which he measured the levels of ionizing tic γ-ray background (EGB), which is the sum of the radiation as high as 5 km above the Earth’s surface [235]. IGRB and the flux from detected sources. For the first His discovery of increased radiation at high altitude re- time a deviation from a power-law shape in the high- vealed that we are bombarded by ionizing particles from energy part of the EGB and IGRB has been observed above. These cosmic ray particles are now known to con- as an exponential cut off with a break energy of about sist primarily of protons, helium, carbon, nitrogen and E = 280 GeV. The origin of the IGRB is not yet fully γ other heavy ions up to iron. understood. This leaves intringuing puzzles for the next 5 generation of GeV γ ray instruments to uncover [233]. Below 10 GeV the flux of particles is sufficiently What happens at higher energies? large that individual nuclei can be studied by detectors carried aloft in balloons or satellites. From such direct Above a few 100 GeV the universe becomes opaque to + experiments we know the relative abundances and the the propagation of γ rays, because of e e− production on energy spectra of a variety of atomic nuclei, protons, the radiation fields permeating the universe; see Fig. 34. electrons and positrons as well as the intensity, energy The pairs synchrotron radiate on the extragalactic mag- and spatial distribution of X-rays and γ-rays. Measure- netic field before annihilation and so the photon flux is ments of energy and isotropy showed conclusively that significantly depleted. Moreover, the charged particles one obvious source, the Sun, is not the main source. also suffer deflections on the B~-field camouflaging the Only below 100 MeV kinetic energy or so, where the exact location of the sources. In other words, the injec- solar wind shields protons coming from outside the tion photon spectrum is significantly modified en route solar system, does the Sun dominate the observed to Earth. This modification becomes dramatic at around proton flux. Spacecraft missions far out into the solar 6 10 GeV where interaction with the CMB dominates and system, well away from the confusing effects of the the photon mean free path is smaller than the Galactic Earth’s atmosphere and magnetosphere, confirm that radius. the abundances around 1 GeV are strikingly similar Therefore, to study the high energy behavior of dis- to those found in the ordinary material of the solar tance sources we need new messengers. Nowadays the system. Exceptions are the overabundance of elements 59 like lithium, beryllium, and boron, originating from the spallation of heavier nuclei in the interstellar medium.

EXERCISE 9.1 Consider a simple model of cosmic rays in the Galaxy (height H radius R) in which the net diffusion of cosmic rays is mainly perpendicular to the Galactic disk. In this case the density of cosmic rays depends only on the vertical coordinate z and follows the diffusion equation ∂n ∂2n = D + Q(z, t) , (338) ∂t ∂z2 where D = βcλ/3 is the diffusion coefficient, λ is the mean free path, and the source term is given by Q(z, t). Use the approximation Q(z, t) = Q0δ(z) to describe a time-independent concentration of stars close to z = 0, δ(z) is the Dirac delta function (see Appendix F). (i) Find the steady-state solution to the diffusion equation given a vanishing cosmic ray density at the edges of the Galaxy, n(z = +H) = n(z = H) = 0. (ii) Calculate the cosmic-ray column density

Z +H N = n(z)dz (339) H − and determine the average residence time τres from N = Q0τres. What is the mean free path for H = 500 pc 7 and τres = 10 yr?

5 FIG. 35: Compilation of measurements of the differential en- Above 10 GeV, the flux becomes so low that only 3 ergy spectrum of cosmic rays. The dotted line shows an E− ground-based experiments with large apertures and power-law for comparison. Approximate integral fluxes (per long exposure times can hope to acquire a significant steradian) are also shown. number of events. Such experiments exploit the atmo- sphere as a giant calorimeter. The incident cosmic radia- tion interacts with the atomic nuclei of air molecules and produces extensive air showers which spread out over particles in air. The two detection methods have differ- large areas. Already in 1938, Auger concluded from the ent strengths, and together allow for large statistics data size of extensive air showers that the spectrum extends samples and unrivaled control over systematic uncer- up to and perhaps beyond 106 GeV [236, 237]. Nowa- tainties. days substantial progress has been made in measuring The FD provides a calorimetric measurement of the 2 1 the extraordinarily low flux ( 1 event km− yr− ) above primary particle energy, only weakly dependent on theo- 1010 GeV. Continuously running∼ experiments using both retical models. The most common strategy to determine arrays of particle detectors on the ground and/or fluo- the nature of the primary cosmic ray is to study the longi- rescence detectors which track the cascade through the tudinal shower profile of the electromagnetic component atmosphere, have detected events with primary particle in the atmosphere. The slant depth is the amount of at- energies somewhat above 1011 GeV [238]. mosphere penetrated by a cosmic ray shower at a given The Pierre Auger Observatory employs the two de- point in its development, and is customarily denoted by tection methods [239]. It consists of an array of about the symbol X. The value of X is calculated by integrat- 1, 600 water Cherenkov surface detectors (SD) deployed ing the density of air from the point of entry of the air over a triangular grid of 1.5 km spacing and cover- shower at the top of the atmosphere, along the trajectory ing an area of 3, 000 km2 [240]. A SD event is formed of the shower, to the point in question. The depth of the when at least 3 non-aligned stations selected by the local shower maximum Xmax is the position of the maximum station trigger are in spatial and temporal coincidence. of energy deposition per atmospheric slant depth of an The ground array is overlooked by 24 fluorescence tele- extensive air shower. Lighter primaries penetrate the scopes, grouped in four sites, making up the fluores- atmosphere deeper than heavier primaries. In addition, cence detector (FD) [241]. The FD observes the longitu- due to the larger number of nucleons and the larger cross dinal development of the shower in the atmosphere by section, the event-by-event fluctuations of Xmax should detecting the fluorescence light emitted by excited nitro- be smaller for heavier nuclei. Therefore, the first two gen molecules and Cherenkov light induced by shower moments of the Xmax distribution, which are the mean 60

11 Xmax and standard deviation σ(Xmax) provide good dis- of energy 10 GeV and that you are at rest in the frame hcriminatorsi between different primary cosmic rays; for of the cosmic accelerator. details see e.g. [242]. The mechanism(s) responsible for imparting an en- The almost structureless power law spectrum spans ergy of more than one Joule to a single elementary par- many decades of energy, 101 GeV < E < 1011 GeV. A ticle continues to present a major enigma to high energy close examination of Fig. 35 reveals three major features: physics [243]. It is reasonable to assume that, in order (i) the steepening of the spectrum dubbed the knee cen- to accelerate a proton to energy E in a magnetic field B, tered at 106.6 GeV [244]; (ii) a pronounced hardening the size R of the accelerator must encompass the gyro of the spectrum at about 109.6 GeV, the so-called ankle 10.6 radius of the particle: R > Rgyro E/B, i.e. the accel- feature [245]; (iii) a cutoff around 10 GeV [246, 247]. erating magnetic field must contain∼ the particle’s orbit. Three additional more subtle features have been recently By dimensional analysis, this condition yields a maxi- spotted between the knee and the ankle: a harden- mum energy E γBR. The γ-factor has been included ing of the spectrum at around 107.3 GeV [248, 249] fol- to allow for the∼ possibility that we may not be at rest lowed by two softenings at 107.9 GeV [248, 249] and in the frame of the cosmic accelerator, resulting in the 108.5 GeV [250, 251]. The latter≈ is traditionally referred observation of boosted particle energies. Opportunity ≈to as the second knee. for particle acceleration to the highest energies is limited The variations of the spectral index reflect various as- to dense regions where exceptional gravitational forces pects of cosmic ray production, source distribution, and create relativistic particle flows. All speculations involve propagation. The first and second knee have unequiv- collapsed objects and we can therefore replace R by the ocal explanations, as reflecting the maximum energy of Schwarzschild radius R GM/c2 to obtain E < γBM. Galactic magnetic confinement or acceleration capabil- ∼ At this point a reality check is in order. Such a dimen- ity of the sources, both of which grow linearly in the sional analysis applies to the Fermilab accelerator: 10 charge Z of the nucleus; the first knee being where pro- kilogauss fields over several kilometers (covered with a tons drop out and the second knee where the highest-Z repetition rate of 105 revolutions per second) yield 1 TeV. Galactic cosmic rays drop out. As the energy increases The argument holds because, with optimized design and above the second knee to the ankle, the nuclear com- perfect alignment of magnets, the accelerator reaches ef- position switches from heavy to light [252] whereas the ficiencies matching the dimensional limit. It is highly cosmic ray arrival directions are isotropic to high accu- questionable that nature can achieve this feat. racy throughout the entire range [253–255]. Lastly, as the Given the microgauss magnetic field of our galaxy, energy increases above the ankle, not only does the spec- no structures are large or massive enough to reach the trum harden significantly, but the composition gradually energies of the highest energy cosmic rays. Dimensional becomes heavier (interpreting the data using conven- analysis therefore limits their sources to extragalactic tional extrapolations of accelerator-constrained particle objects. A common speculation is that there may be physics models) [256, 257]. relatively nearby active galactic nuclei powered by a The observed evolution in the extragalactic cosmic billion solar mass black holes. With kilo-Gauss fields we ray composition and spectral index presents a complex reach 1011 GeV. The jets (blazars) emitted by the central puzzle. A pure proton composition might be compati- black hole could reach similar energies in accelerating ble with the observed spectrum of extragalactic cosmic sub-structures boosted in our direction by a γ-factor of rays [258] when allowance is made for experimental un- 10, possibly higher. certainties in the energy scale and the fact that the real lo- cal source distribution is not homogeneous and continu- EXERCISE 9.2 (i) Derive the magnetic field strength ous [259] (although the sharpness of the ankle is difficult needed to hold a charge on a circular orbit of radius to accommodate). However, a pure proton composition R given its momentum p. Assume that the magnetic is incompatible with the Xmax and σ(Xmax) distributions field is uniform, that the motion of the particle is reproted by the Auger Collaboration [256, 257] unless perpendicular to the magnetic field and let β 1. current extrapolations of particle physics are incorrect. (ii) Given the circumference ( 26.659 km) of the LHC,∼ On the other hand, models which fit the spectrum and determine the uniform magnetic∼ field strength needed composition at highest energies, predict a deep gap be- to keep 7 TeV protons in orbit. (iii) Using this magnetic tween the end of the Galactic cosmic rays and the onset of field strength find what would be the required size the extragalactic cosmic rays. Models can be devised to needed for an LHC-like accelerator to launch particles fill this gap: fine-tuning is required to position this new to cosmic-ray energies 1011 GeV. Compare this with population so as to just fit and fill the gap [260, 261], un- the orbits in the solar system∼ and estimate the cost of the less we consider interactions in the region surrounding accelerator. (iv) Which of the following astrophysical the accelerator as illustrated in Fig. 36. objects are able to keep ultrahigh energy cosmic rays The discovery of a suppression above 1010.6 GeV 13 12 in orbit? Neutrons stars (R 10− pc, B 10 G), was first reported by the HiRes and Auger collabora- 5∼ ∼ AGN jets (R 1 kpc, B 10− G), supernova remnants tions [246, 247] and later confirmed by the Telescope Ar- ∼ 4 ∼ (R = 1 pc B 10− G). Consider protons and iron nuclei ray Collaboration [263]; by now the significance is well ∼ 61

source environment EBL/CMB detection

cosmic ray

FIG. 36: Sources (yellow stars) inject cosmic ray nuclei with a power law in energy into a surrounding region of radiation and turbulent magnetic fields. After propagation through this local environment and intergalactic space, these cosmic rays and their spallation products are detected at Earth. The photon energies in the source environment are characteristically of much higher energy than in the extragalactic background light (EBL) [262]. in excess of 20σ compared to a continuous power law therefore be anywhere from sub-degree to nonexistent. extrapolation beyond the ankle feature [264]. This sup- Moreover, neutrons with energy & 109 GeV have a pression is consistent with the Greisen-Zatsepin-Kuzmin boosted cτn sufficiently large to serve as Galactic mes- (GZK) prediction that interactions with cosmic back- sengers [268].11 The decay mean free path of a neu- 9 ground photons will rapidly degrade cosmic ray en- tron is c γn τn = 9.15 (En/10 GeV) kpc, the lifetime being ergies [265, 266]. Intriguingly, however, there are also boosted from its rest-frame value, τn = 886 s, to its lab indications that the source of the suppression may be value by γn = En/mn. It is therefore reasonable to expect more complex than originally anticipated. The trend to- that the arrival directions of the very highest energy cos- ward heavier composition above the ankle could reflect mic rays may provide information on the location of their the endpoint of cosmic acceleration, with heavier nuclei sources.12 dominating the composition near the end of the spec- trum, which coincidentally falls off near the expected GZK cutoff region [267]. If this were the case, the sup- B. Cosmic neutrinos pression would constitute an imprint of the accelerator characteristics rather than energy loss in transit. It is also For a deep, sharply focused examination of the possible that a mixed or heavy composition is emitted universe a telescope is needed which can observe a from the sources, and photodisintegration of nuclei and particle that is not much affected by the gas, dust, other GZK energy losses suppress the flux. and swirling magnetic fields it passes on its journey. The main reason why this impressive set of data fails The neutrino is the best candidate. As we have seen, to reveal the origin of the particles is undoubtedly that neutrinos constitute much of the total number of their directions have been scrambled by the microgauss elementary particles in the universe, and these neutral, 10 Galactic magnetic fields. However, above 10 GeV pro- weakly-interacting particles come to us almost without ton astronomy could still be possible because the ar- any disruption straight from their sources, traveling rival directions of electrically charged cosmic rays are at very close to the speed of light. A (low energy) no longer scrambled by the ambient magnetic field of neutrino in flight would not notice a barrier of lead fifty our own Galaxy. Protons point back to their sources light years thick. When we are able to see outwards in with an accuracy determined by their gyroradius in the neutrino light we will no doubt receive a wondrous new intergalactic magnetic field B, view of the universe. d dB θ = , (340) EXERCISE 9.3 In 1987, the astronomical world was ' Rgyro E electrified with the news of a supernova exploding in the , a companion where d is the distance to the source. Scaled to units to the Milky Way, at a distance of 150,000 ly. It was the relevant to the problem, nearest supernova to have gone off in 400 yr, and was studied in great detail. Its luminosity was enormous; θ (d/Mpc) (B/nG) 11.5 . (341) 0.1◦ ' E/10 GeV

Speculations on the strength for the inter-galactic mag- 11 Neutron astronomy from the nearby radio galaxy A may 7 9 netic field range from 10− to 10− G. For the distance also be possible [269]. to a nearby galaxy at 100 Mpc, the resolution may 12 For a more extensive discussion of this subject see e.g. [270]. 62 the explosion released as much visible light energy in covered in a search for the nearly guaranteed cosmo- a few weeks as the Sun will emit in its entire lifetime genic neutrinos (which are expected to produced as sec- of 1010 yr. It was easily visible to the naked eye from ondaries in the GZK chain reaction [276]) [277]. The the Southern hemisphere. However, models of the search technique was later refined to extend the neu- mechanisms taking place in the supernovae predict trino sensitivity to lower energies [278, 279], resulting that the visible light represents only 1% of the total in the discovery of an additional 26 neutrino candi- energy of the supernova; there is 100 times more energy dates with energies between 50 TeV and 2 PeV, constitut- emitted in the form of neutrinos, in a blast lasting only ing a 4.1σ excess for the combined 28 events compared a few seconds. (i) Calculate the total amount of energy to expectations from neutrino and muon backgrounds emitted by the supernova in neutrinos. Express your generated in Earth’s atmosphere [280]. Interpretation answer in Joules. (ii) Each neutrino has an energy of these results, however, does not appear to be en- 12 of roughly Eν 1.5 10− J. Calculate how many tirely straightforward. For instance, if one makes the h i ∼ × 2 neutrinos are emitted by the supernova. (This is an easy common assumption of an unbroken Eν− neutrino en- calculation, but will give you a very large number). ergy spectrum, then one expects to observe about 8-9 (iii) Kamiokande is one of the largest neutrino detectors. events with higher energies than the two highest energy In 1987, it consisted of 2.140 kton of water (it has events observed thus far. The compatibility between Ice- since been expanded). Calculate how many electron Cube observations and the hypothesis of an unbroken neutrinos should have been detected by Kamiokande if power-law spectrum requires a rather steep spectrum, 2.3 the detection efficiency at Eνe ia bout 60% [271]. Φ(Eν) E− [281]. Very recently, the IceCube results h i have been∝ updated [282–284]. At the time of writing, 54 We have seen that MeV neutrinos are are produced by events have been reported in four years of IceCube data nuclear reaction chains in the central core of stars. Mov- taking (1347 days between 2010 – 2014). The data are con- ing up in energy, neutrinos would also be inevitably sistent with expectations for equal fluxes of all three neu- 2 produced in many of the most luminous and energetic trino flavors [285]. The best-fit power law is EνΦ(Eν) = 8 0.58 2 1 1 objects in the universe. Whatever the source, the ma- 2.2 0.7 10− (Eν/100 TeV)− GeV cm− s− sr− and re- ± × chinery which accelerates cosmic rays will inevitably jects a purely atmospheric explanation at more than 5.7σ. also produce neutrinos, guaranteeing that high energy Splitting the data into two sets, one from the northern neutrinos surely arrive to us from the cosmos. sky and one from the souther sky, allows for a satisfac- Neutrino detectors must be generally placed deep un- tory power law fit with a different spectral index for each derground, or in water, in order to escape the back- hemisphere. The best-fit spectral index in the northern +0.3 grounds caused by the inescapable rain of cosmic rays sky is γN = 2.0 0.4, whereas in the southern sky it is − γS = 2.56 0.12 [283]. The discrepancy with respect to upon the atmosphere. These cosmic rays produce many ± which penetrate deeply into the earth, in even the a single power law corresponds to 1.1σ and may indi- deepest mines, but of course with ever-decreasing num- cate that the neutrino flux is anisotropic [286, 287]. The bers with depth. Hence the first attempts at high energy largest concentration of events is at or near the Galactic neutrino astronomy have been initiated underwater and center, within uncertainties of their reconstructed arrival under ice [272]. directions [288–290]. There are numerous proposed ex- The IceCube facility is located near the Amundsen- planations for the origin of IceCubes events [291]. How- Scott station below the surface of the Antarctic ice sheet ever, considerably more data are yet required before the at the geographic South Pole [273]. The main part of the final verdict can be given. detector is the InIce array, which covers a cubic kilometer of Antarctic glacial ice instrumented with digital opti- cal modules (DOMs) that detect Cherenkov ligh [274]. C. Gravitational waves The DOMs are attached to km-long supply and read- out cables called strings. Each string carries 60 DOMs Ever since Newton in the XVII century, we have spaced evenly along 1 km. The full baseline design of learned that gravity is a force that acts immediately 86 strings was completed in December 2010. In addi- on an object. In Einstein theory of general relativity, tion to the InIce array, IceCube also possesses an air however, gravity is not a force at all, but a curvature shower array called IceTop which comprises 80 stations, in space [61]. In other words, the presence of a very each of which consists of two tanks of water-ice instru- massive body does not affect probed objects directly; it mented with 2 DOMs to detect Cherenkov light [275]. warps the space around it first and then the objects move The hybrid observations of air showers in the InIce and in the curved space. Inherit from such a redefinition of IceTop arrays have mutual benefits, namely significant gravity is the concept of gravitational waves: as mas- air shower background rejection (for neutrino studies) sive bodies move around, disturbances in the curvature and an improved air shower muon detection (for cosmic of spacetime can spread outward, much like a pebble ray studies). tossed into a pond will cause waves to ripple outward In 2012, the IceCube Collaboration famously an- from the source. Propagating at (or near) the speed of nounced an observation of two 1 PeV neutrinos dis- light, these disturbances do not travel through spacetime ∼ 63

y y

x x y

x hh + +

FIG. 40: Two linearly independent polarizations of a gravita- tional wave are illustrated by displaying their effect on a ring of free particles arrayed in a plane perpendicular to the direction FIG. 37: Initial configuration of test particles on a circle of of the wave. The figure shows the distortions in the original radius L before a hits them. circle that the wave produces if it carries the plus-polarization or the cross-polarization. In general relativity there are only 2 independent polarizations. The ones shown here are orthog- onal to each other and the polarizations are transverse to the direction of the wave. y as such – the fabric of spacetime itself is oscillating! The simplest example of a strong source of gravita- x tional waves is a spinning neutron star with a small mountain on its surface. The mountain’s mass will cause curvature of the spacetime. Its movement will stir up spacetime, much like a paddle stirring up water. The waves will spread out through the universe at the speed of light, never stopping or slowing down. As these waves pass a distant observer, that observer will find spacetime distorted in a very particular way: distances between objects will increase and decrease rhythmically as the wave passes. To visualize this effect, consider a perfectly flat region of spacetime with a group FIG. 38: The effect of a plus-polarized gravitational wave on a ring of particles. The amplitude shown in the figure is roughly of motionless test particles lying in a plane, as shown in h = 0.5. Gravitational waves passing through the Earth are Fig. 37. When a weak gravitational wave arrives, passing many billion billion times weaker than this. through the particles along a line perpendicular to the ring of radius L, the test particles will oscillate in a cru- ciform manner, as indicated in Figs. 38 and 39. The area enclosed by the test particles does not change, and there is no motion along the direction of propagation. The y principal axes of the ellipse become L + ∆L and L ∆L. The amplitude of the wave, which measures the fraction− of stretching or squeezing, is h = ∆L/L. Of course the x size of this effect will go down the farther the observer is 1 from the source. Namely, h d− , where d is the source distance. Any gravitational∝ waves expected to be seen 20 on Earth will be quite small, h 10− . The frequency, wavelength,∼ and speed of a gravita- tional wave are related through λ = cν. The polarization of a gravitational wave is just like polarization of a light wave, except that the polarizations of a gravitational wave are at 45◦, as opposed to 90◦. In other words, the effect of a cross-polarized gravitational wave (h ) on test particles would be basically the same as a wave× with FIG. 39: The effect of cross-polarized gravitational waves on a plus-polarization (h+), but rotated by 45◦. The different ring of particles. polarizations are summarized in Fig. 40. In general terms, gravitational waves are radiated by 64 very massive objects whose motion involves accelera- M2 separated a distance R is [294] tion, provided that the motion is not perfectly spheri- cally symmetric (like a spinning, expanding or contract- 32 G4 (M M )2(M + M ) P = 1 2 1 2 . (342) ing sphere) or cylindrically symmetric (like a spinning − π c5 R5 disk). For example, two objects orbiting each other in a quasi-Keplerian planar orbit will radiate [292, 293]. The For the Earth-Sun system R is very large and M1 and M2 power given off by a binary system of masses M1 and are relatively very small, yielding

3 11 m 4 24 30 2 24 30 32 (6.7 10− kg s2 ) (6 10 kg 2 10 kg) (6 10 kg + 2 10 kg) P = × × × × × = 313 W . (343) − π (3 108 m/s)5 (1.5 1011m)5 × ×

Thus, the total power radiated by the Earth-Sun system neutron stars, or black holes) come close to each other, in the form of gravitational waves is truly tiny compared they send out intense gravitational waves. As the ob- to the total electromagnetic radiation given off by the jects come closer and closer to each other (that is, as R Sun, which is about 3.86 1026 W. The energy of the becomes smaller and smaller), the gravitational waves gravitational waves comes× out of the kinetic energy of become more and more intense. At some point these the Earth’s orbit. This slow radiation from the Earth- waves should become so intense that they can be di- Sun system could, in principle, steal enough energy to rectly detected by their effect on objects on the Earth. drop the Earth into the Sun. Note however that the This direct detection is the goal of several large experi- kinetic energy of the Earth orbiting the Sun is about ments around the world. 2.7 1033 J. As the gravitational radiation is given off, The great challenge of this type of detection, though, × it takes about 300 J/s away from the orbit. At this rate, is the extraordinarily small effect the waves would pro- it would take many billion times more than the current duce on a detector. The amplitude of any wave will fall age of the universe for the Earth to fall into the Sun. off as the inverse of the distance from the source. Thus, Although the power radiated by the Earth-Sun system even waves from extreme systems like merging binary is minuscule, we can point to other sources for which the black holes die out to very small amplitude by the time radiation should be substantial. One important example they reach the Earth. For example, the amplitude of is the pair of stars (one of which is a pulsar) discovered waves given off by the Hulse-Taylor binary as seen on 26 by Hulse and Taylor [295]. The characteristics of the orbit Earth would be roughly h 10− . However, some grav- of this binary system can be deduced from the Doppler itational waves passing the≈ Earth could have somewhat 20 shifting of radio signals given off by the pulsar. Each larger amplitudes, h 10− [292, 293]. For an object of the stars has a mass about 1.4 M . Also, their orbit 1 m in length, this means≈ that its ends would move by 20 is about 75 times smaller than the distance between the 10− m relative to each other. This distance is about a Earth and Sun, which means the distance between the billionth of the width of a typical atom. two stars is just a few times larger than the diameter of A simple device to detect this motion is the laser in- our own Sun. This combination of greater masses and terferometer, with separate masses placed many hun- smaller separation means that the energy given off by the dreds of meters to several kilometers apart acting as Hulse-Taylor binary will be far greater than the energy two ends of a bar. Ground-based interferometers are given off by the Earth-Sun system, roughly 1022 times as now operating, and taking data. The most sensitive is much. the Laser Interferometer Gravitational Wave Observa- The information about the orbit can be used to predict tory (LIGO) [296]. This is actually a set of three devices: just how much energy (and angular momentum) should one in Livingston, Louisiana; the other two (essentially be given off in the form of gravitational waves. As the on top of each other) in Hanford, Washington. Each con- energy is carried off, the orbit will change; the stars will sists of two light storage arms which are 2 to 4 km in draw closer to each other. This effect of drawing closer is length. These are at 90◦ angles to each other, and consist called an inspiral, and it can be observed in the pulsar’s of large vacuum tubes running the entire 4 kilometers. A signals. The measurements on this system were carried passing gravitational wave will then slightly stretch one out over several decades, and it was shown that the arm as it shortens the other. This is precisely the motion changes predicted by gravitational radiation in general to which an interferometer is most sensitive. relativity matched the observations very well, providing On September 14, 2015 at 09:50:45 UTC gravitational the first experimental evidence for gravitational waves. waves were detected by both of the twin LIGO de- Inspirals are very important sources of gravitational tectors [297]. The waves originated in the collision waves. Any time two compact objects (white dwarfs, and merger of two black holes (with 29 and 36 M )

65 approximately 400 Mpc from Earth. About 3 times the the very near future the upgrade of the Pierre Auger mass of the sun was converted into gravitational waves Observatory, named Auger Prime, will allow: (i) a pre- in a fraction of a second, with a peak power output cise reconstruction of mass dependent energy spectrum; about 50 times that of the whole visible universe. This (ii) the identification of primaries, event-by-event, up to detection inaugurates a new era of astronomy in which the highest energies; (iii) a systematic study of arrival gravitational waves are tools for studying the most direction(s) of an enhanced proton data sample [304]. mysterious and exotic objects in the universe. Even before we know the results from Auger Prime it seems clear that still larger aperture observatories with EXERCISE 9.4 (i) Estimate the power radiated in grav- much better energy and Xmax resolution will be called itational waves by a neutron star of M? = 1.4M orbiting for in order to measure the spectra and composition dis- a black hole of MBH = 20M , assuming the orbital radius tribution of individual sources. It is inspiring to note 2 10 is R = 6GMBH/c . (ii) If the kinetic energy of the neutron that some 5 million UHECRs above about 5.5 10 GeV star orbiting the black hole is about 7 1047 J, how much strike the Earth’s atmosphere each year, from× which we time will it take the neutron star to× fall into the black currently collect only about 50 or so with present ob- hole? servatories. In this sense, there exists some 5 orders of magnitude room for improvement! It may well be that the best hope to make inroads in this area is to take the D. Looking ahead search for UHECR sources into space from which a huge volume of atmosphere can be viewed using the fluores- The recent observation of a diffuse astrophysical flux cence technique. To this end several path finder efforts of high energy neutrinos and the direct detection of grav- are underway to develop the requisiste technologies. For itational waves represents the first light in the nascent example, in 2017 a NASA/CNES supported mission to fly field of multimessenger astronomy. The search for a super-pressure stratospheric ballom with a fluorescence correlations in the different data sample has already detector will take place. Such ballons can fly for hun- started [298–301]. Thus far, there are no excesses beyond dreds of days, and may observed the first air showers randomly expected. from above. Eventually these technologies may lead to An in-depth exploration of the neutrino universe re- a permanently orbiting satellite to detect UHECRs. An quires a next-generation IceCube detector. IceCube- optimist might even imagine an eventual constellation Gen2 is based upon the robust design of the current of satellites to tap the remaining 5 orders of magnitude detector [302]. The goal for this new observatory is to de- of UHECR “luminosity”, accessing naturally occuring liver statistically significant samples of very high energy particle beams at energies far in excess available to ter- 6 9 astrophysical neutrinos, in the 10 GeV . Eν . 10 GeV restrial colliders with an event rate opening up a new range, and yield hundreds of neutrinos across all fla- window on beyond-the-standard-model phenomena. vors at energies above 100 TeV. This will enable detailed spectral studies, significant point source detections, and new discoveries. Companion experiments in the deep Acknowledgments Mediterranean are moving into construction phase, and the space-based CHerenkov from Astrophysical Neutri- I’m thankful to Michael Unger for entertaining dis- nos Telescope () is in the R&D phase [303]. cussions and Walter Lewin for a very thorough reading Resolving the fundamental questions of UHECR com- of the notes and insightful comments. I’d also like to position and origins, and investigating particle physics thank Heinz Andernach for helpful remarks. This work above accelerator energies, will require both enhanced has been supported by U.S. National Science Foundation experimental techniques implemented at the existing ob- (NSF) CAREER Award PHY1053663 and by the National servatories, as well as a significant increase in exposure Aeronautics and Space Administration (NASA) Grant to catch the exceedingly rare highest energy events. In No. NNX13AH52G.

[1] G. Galilei, Sidereus Nuncius, (T. Baglioni, Republic of [4] J. Kepler, Harmonices Mundi (Johann Planck, Linz, Aus- Venice, 1610). tria, 1619). Reprinted on On the Shoulders of Giants: The [2] G. Galilei, Dialogues concerning two sciences, (1638). Great Works of Physics and Astronomy, (Ed. S. Hawking, Reprinted on On the Shoulders of Giants: The Great Works of Running Press, Philadelphia, 2002) ISBN 0-7624-1348-4; Physics and Astronomy, (Ed. S. Hawking, Running Press, p.635. Philadelphia, 2002) ISBN 0-7624-1348-4; p.399. [5] I. Newton, PhilosophiæNaturalis Principia Mathemat- [3] N. Copernicus, De revolutionibus orbium coelestium, (1543). ica, (1687). Reprinted on On the Shoulders of Giants: The Reprinted on On the Shoulders of Giants: The Great Works of Great Works of Physics and Astronomy, (Ed. S. Hawk- Physics and Astronomy, (Ed. S. Hawking, Running Press, ing, Running Press, Philadelphia, 2002) ISBN 0-7624- Philadelphia, 2002) ISBN 0-7624-1348-4; p.7. 1348-4; p.733. 66

[6] J. Beringer et al. [Particle Data Group Col- (1935). laboration], Phys. Rev. D 86, 010001 (2012). [35] M. S. Longair, (Cambridge University Press, UK, 2011) doi:10.1103/PhysRevD.86.010001 ISBN 978-0-521-75618-1. [7] T. Wright, An Original Theory of New Hypothesis of the [36] J. R. Oppenheimer and G. M. Volkoff, Phys. Rev. 55, 374 Universe, (H. Chapelle, London, 1750). (1939). [8] C. Messier, Catalogue des N´ebuleuses& des amas d’Etoiles´ , [37] L. I. Sedov, J. App. Math. Mech. 10, 241 (1946). 1781. K. G. Jones, Messier’s nebulae and star clusters, (Cam- [38] G. I. Taylor Proc. Roy. Soc. 201, 159 (1950) bridge University Press, 1991) ISBN 0-521-37079-5. doi:10.1098/rspa.1950.0049. [9] I. Kant, Allgemeine Naturgeschichte und Theorie des Him- [39] G. I. Taylor, Proc. Roy. Soc. 201, 175 (1950) mels, (Germany, 1755). doi:10.1098/rspa.1950.0050. [10] E. Hubble, The Realm of Nebulae, (Yale University Press, [40] A. Hewish, S. J. Bell, J. D. H. Pilkington, P. F. Scott, and R. New Haven, 1936; reprinted by Dover Publications, Inc., A. Nature 217, 709 (1968) doi:10.1038/217709a0. New York, 1958). [41] T. Gold, Nature 218, 731 (1968) doi:10.1038/218731a0. [11] M. Planck, Verh. d. . phys. Ges. 2, 202 (1900); [42] P. E. Boynton, E. J. Groth III, R. B. Partridge, and D. T. Verh. d. deutsch. phys. Ges. 2, 237 (1900); Annalen Phys. Wilkinson, Astrophys. J. 157 L 197 (1969). 4, 553 (1901). [43] J. R. Oppenheimer and H. Snyder, Phys. Rev. 56, 455 [12] G. B. Rybicki and A. P. Lightman, Radiative Processes in (1939). Astrophysics, (John Wiley & Sons, Massachusetts, 1979) [44] R. Penrose, Phys. Rev. Lett. 14, 57 (1965). ISBN 978-0-471-82759-7. [45] S. Hawking, Phys. Rev. Lett. 15, 689 (1965). [13] L. A. Anchordoqui, arXiv:1512.04361 [physics.pop-ph]. [46] S. Hawking, Proc. Roy. Soc. Lond. A 294, 511 (1966). [14] J. Stefan, Wiener Ber. 79, 391 (1879). [47] S. Hawking, Proc. Roy. Soc. Lond. A 295, 490 (1966); [15] L. Boltzmann, Annalen Phys. 22, 291 (1884). [48] S. Hawking, Proc. Roy. Soc. Lond. A 300, 187 (1967). [16] M. Kachelriess, A concise introduction to astrophysics, lec- [49] S. W. Hawking and R. Penrose, Proc. Roy. Soc. Lond. A tures given at the Institutt for fysikk NTNU, 2011. 314, 529 (1970). [17] W. Wien, Annalen Phys. 52, 132 (1894). [50] K. F. Gauss, General investigations of curved surfaces of 1827 [18] For further details see e.g., H. Karttunen, P. Kroger,¨ H. and 1825, (C. S. Robinson & Co., University Press Prince- Oja, M. Poutanen, K. J. , Fundamental Astronomy, ton, N. J., 1902). (4th Edition, Springer-Verlag Berlin Heidelberg New [51]J.B olyai,´ Appendix: Explaining the absolute true of space, York, 2003). published as an appendix to the essay by his father F. [19] E. Hertzsprung, Astron. Nachr. 196, 201 (1913); H. N. Bolyai´ An attempt to introduce youth to the fundamentals of Russell, Science 37, 651 (1913). pure science, elementary and advanced, by a clear and proper [20] Y. L. Shirley, Fundamentals of Astronomy lectures given at method (Maros Vas´ arhely,´ Transilvania, 1832). the University of Arizona, 2010. [52] N. I Lobachevsky, Kasanski Vestnik (Kazan Messenger), [21] C. Doppler, Abh. Konigl.¨ Bohm.¨ Ges. Wiss. 2, 465 (1843). Feb-Mar, 178 (1829); April, 228 (1829); Nov-Dec, 227 [22] S. Weinberg, The First Three Minutes: A Modern View of (1829); Mar-Apr, 251 (1830; Jul-Aug 571 (1830). the Origin of the Universe, (BasicBooks, New York, 1993) [53] F. W. Bessel; communicated by J. F. W. Herschel Mon. ISBN 0-465-02437-8. Not. Roy. Astron. Soc. 6, 136 (1844). [23] J. Fraunhofer, Determination of the refractive and color- [54] A. Clark, communicated by T. H. Safford, The observed dispersing power of different types of glass, in relation to the motions of the companion of Sirius (Cambridge: Welch, improvement of achromatic telescopes, Memoirs of the Royal Bigelow, and Company, MA, 1863). Academy of Sciences in Munich 5, 193 (1814-1815); see [55] W. S. Adams, Publications of the Astronomical Society especially pages 202-205 and the plate following page of the Pacific 27, 236 (1915) doi:10.1086/122440. 226. [56] W. Pauli, Z. Phys. 31:765 (1925). [24] W. Huggins, Philos. Trans. Roy. Soc. London 158, 529 [57] W. Heisenberg, Z. Phys. 43, 172 (1927). (1968) doi:10.1098/rstl.1868.0022 [58] A. Einstein, Annalen Phys. 17, 891 (1905) [Annalen Phys. [25] S. Sarkar, Lecture Notes on Special Relativity, lectures given 14, 194 (2005)]. at Oxford University, 2002. [59] H. Minkowski, Physikalische Zeitschrift 10, 104 (1909). [26] H. A. Lorentz, Proc. R. Neth. Acad. Arts Sci. 6, 809 (1904). [60] K. Schwarzschild, Sitzungsber. Preuss. Akad. Wiss. [27] L. A. Anchordoqui, arXiv:1509.08868 [physics.pop-ph]. Berlin (Math. Phys. ) 1916, 189 (1916) [physics/9905030]. [28] T. Mazeh et al., Astrophys. J. 532, L55 (2000) [61] A. Einstein, Annalen Phys. 49, 769 (1916) [Annalen Phys. doi:10.1086/312558 [astro-ph/0001284]. 14, 517 (2005)]. doi:10.1002/andp.200590044 [29] D. Queloz, A. Eggenberger, M. Mayor, C. Perrier, [62] E. Kretschmann, Annalen Phys. 53, 575 (1917). J. L. Beuzit, D. Naef, J. P. Sivan and S. Udry, Astron. [63] S. Weinberg, Gravtitation and Cosmology (John Wiley & Astrophys. 359, L13 (2000) [astro-ph/0006213]. Sons, New York, 1972) ISBN 0-471-92567-5 [30] R. A. Wittenmyer et al., Astrophys. J. 632, 1157 (2005) [64] C. W. Misner, K. S. Thorne and J. A. Wheeler, Gravitation, doi:10.1086/433176 [astro-ph/0504579]. (W. H. Freeman, San Francisco, 1973) ISBN 978-0-7167- [31] G. W. Henry, G. W. Marcy, R. P. Butler and S. S. Vogt, 0344-0. Astrophys. J. 529, L41 (2000). doi:10.1086/312458 [65] F. W. Dyson, A. S. Eddington, and C. Davidson, Phil. [32] D. Charbonneau, T. M. Brown, D. W. Latham Trans. Roy. Soc. 220A, 291 (1920). and M. Mayor, Astrophys. J. 529, L45 (2000) [66] D. Adams, Hitchhiker’s Guide to the Galaxy: Life, the Uni- doi:10.1086/312457 [astro-ph/9911436]. verse and Everything, (Harmony Books, NY, 1982) ISBN [33] H. A. Bethe, Phys. Rev. 55, 434 (1939). 0-345-39182-9; see chapter 17. [34] S. Chandrasekhar, Mon. Not. Roy. Astron. Soc. 95, 207 [67] A. G. W. , Nature 229, 178 (1971); [68] R. E. Wilson, Astrophys. J. 170, 529 (1971). 67

[69] R. Giacconi, P. Gorenstein, H. Gursky, J. R Waters, Astro- [104] R. H. Dicke, P.J. E. Peebles, P.G. Roll and D. T. Wilkinson, phys. J. 148, L119 (1967) Astrophys. J. 142, 414 (1965). doi:10.1086/148306 [70] M. Oda, P. Gorenstein, H. Gursky, E. Kellogg, E. Schreier, [105] G. F. Smoot, astro-ph/9705101. H. Tananbaum, R. Giacconi, Astrophys. J. 166, L1 (1971). [106] J. C. Mather et al., Astrophys. J. 420, 439 (1994). [71] B. L. Webster and P. Murdin, Nature, 235, 37 (1972). [107] C. L. Bennett et al. [WMAP Collaboration], Astrophys. J. [72] C. T. Bolton, Nature 235, 271 (1972). Suppl. 208, 20 (2013) [arXiv:1212.5225 [astro-ph.CO]]. [73] J. A. Petterson, Astrophys. J. 224, 625 (1978). [108] R. Adam et al. [Planck Collaboration], arXiv:1502.01582 [74] A. M. Stirling, R. E. Spencer, C. de la Force, M. A. Garrett, [astro-ph.CO]. R. P. Fender and R. N. Ogley, Mon. Not. Roy. Astron. [109] S. Weinberg, Cosmology, (Oxford University Press, UK, Soc. 327, 1273 (2001) doi:10.1046/j.1365-8711.2001.04821.x 2008) ISBN 978-0-19-852682-7. [astro-ph/0107192]. [110] S. Dodelson, Modern Cosmology, (Academic Press, Else- [75] J. J. Thomson, Conduction of electricity through gases (Cam- vier, Amsterdam, 2003) ISBN 978-0-12-219141-1. bridge University Press, Cambridge, 1906). [111] L. A. Anchordoqui and T. C. Paul, Mathematical models [76] A. S. Eddington, The internal constitution of the stars (Cam- of physics problems (, New York, 2013) ISBN 978-1- bridge University Press, Cambridge, 1926). 62618-600-2. [77] J. Biteau, PhD thesis, 2013, pastel-00822242. [112] P. B. Denton, L. A. Anchordoqui, A. A. Berlind, [78] C. M. Urry and P. Padovani, Publ. Astron. Soc. Pac. 107, M. Richardson, and T. J. Weiler (for the JEM-EUSO 803 (1995) doi:10.1086/133630 [astro-ph/9506063]. Collaboration), J. Phys. Conf. Ser. 531, 012004 (2014) [79] C. D. Dermer and B. Giebels, arXiv:1602.06592 [astro- doi:10.1088/1742-6596/531/1/012004 [arXiv:1401.5757 ph.HE]. [astro-ph.IM]]. [80] B. G. Piner, D. Bhattarai, P. G. Edwards and D. L. Jones, [113] G. Hinshaw et al. [WMAP Collaboration], Astrophys. Astrophys. J. 640, 196 (2006) doi:10.1086/500006 [astro- J. Suppl. 180, 225 (2009) doi:10.1088/0067-0049/180/2/225 ph/0511664]. [arXiv:0803.0732 [astro-ph]]. [81] J. P. L. de Cheseaux, Trait´ede la Com`ete (Lausanne, 1774), [114] C. H. Lineweaver, ASP Conf. Ser. 126, 185 (1997) [astro- pp. 223 ff; reprinted in The Bowl of Night, by F. P. Dickson ph/9702042]. (MIT Press, Cambridge, 1968) Appendix II. [115] V. C. Rubin and W. K. Ford, Jr., Astrophys. J. 159, 379 [82] H. W. M. Olbers, Bode’s Jahrbuch, 111 (1826); reprinted by (1970). doi:10.1086/150317 Dickson, op. cit., Appendix I. [116] V. C. Rubin, N. Thonnard and W. K. Ford, Jr., Astrophys. [83] E. Hubble, Proc. Nat. Acad. Sci. 15, 168 (1929). J. 238, 471 (1980). doi:10.1086/158003 [84] W. L. Freedman et al. [HST Collaboration], Astrophys. J. [117] V. C. Rubin, D. Burstein, W. K. Ford, Jr. and N. Thonnard, 553, 47 (2001) doi:10.1086/320638 [astro-ph/0012376]. Astrophys. J. 289, 81 (1985). doi:10.1086/162866 [85] http://www.sdss.org/iotw/archive.html [118] F. Zwicky, Helv. Phys. Acta 6, 110 (1933). [86] C. F. Chyba, J. R. Gott, and A. Spitkovsky, The Universe, [119] D. Clowe, M. Bradac, A. H. Gonzalez, M. Markevitch, lectures given at Princeton University, 2009 - 2010. S. W. Randall, C. Jones and D. Zaritsky, Astrophys. J. [87] E. A. Milne, Z. Astrophysik 6, 1 (1933). 648, L109 (2006) doi:10.1086/508162 [astro-ph/0608407]. [88] B. Ryden, Introduction to cosmology, (Addison-Wesley, San [120] J. L. Feng, Ann. Rev. Astron. Astrophys. 48, 495 Francisco, USA, 2003) ISBN 978-0805389128 (2010) doi:10.1146/annurev-astro-082708-101659 [89] A. Einstein, Sitzungsber. Preuss. Akad. Wiss. Berlin [arXiv:1003.0904 [astro-ph.CO]]. (Math. Phys. ) 1917, 142 (1917). [121] P. A. R. Ade et al. [Planck Collaboration], [90] A. Friedmann, Z. Phys. 10, 377 (1922). arXiv:1502.01589 [astro-ph.CO]. [91] A. Friedmann, Z. Phys. 21, 326 (1924). [122] A. G. Riess et al., Astrophys. J. 730, 119 (2011) Erratum: [92] H. P. Robertson, Astrophys. J. 82, 284 (1935). [Astrophys. J. 732, 129 (2011)] doi:10.1088/0004- [93] H. P. Robertson, Astrophys. J. 83, 187, 257 (1936). 637X/732/2/129, 10.1088/0004-637X/730/2/119 [94] A. G. Walker, Proc. Lond. Math. Soc. (2), 42 90 (1936). [arXiv:1103.2976 [astro-ph.CO]]. [95] V. Mukhanov, Physical Foundations of Cosmology, (Cam- [123] R. A. Knop et al. [Supernova Cosmology Project Col- bridge University Press, UK, 2005) ISBN: 978-0-521- laboration], Astrophys. J. 598, 102 (2003) [arXiv:astro- 56398-7 ph/0309368]. [96] N. Pogson, Mon. Not. Roy. Astron. Soc. 17, 12 (1856). [124] S. W. Allen, R. W. Schmidt and A. C. Fabian, Mon. [97] A. G. Riess et al. [Supernova Search Team Collaboration], Not. Roy. Astron. Soc. 334, L11 (2002) [arXiv:astro- Astron. J. 116, 1009 (1998) doi:10.1086/300499 [astro- ph/0205007]. ph/9805201]. [125] A. E. Lange et al. [Boomerang Collaboration], Phys. Rev. [98] S. Perlmutter et al. [Supernova Cosmology Project Collab- D 63, 042001 (2001) [arXiv:astro-ph/0005004]. oration], Astrophys. J. 517, 565 (1999) doi:10.1086/307221 [126] A. Balbi et al., Astrophys. J. 545, L1 (2000) [Erratum-ibid. [astro-ph/9812133]. 558, L145 (2001)] [arXiv:astro-ph/0005124]. [99] M. Hamuy et al, Astron. J. 106, 2392 (1993). [127] S. M. Carroll, W. H. Press and E. L. Turner, Ann. Rev. [100] M. Hamuy, M. M. Phillips, J. Maza, N. B. Suntzeff, Astron. Astrophys. 30, 499 (1992). R. A. Schommer and R. Aviles, Astron. J. 109, 1 (1995). [128] B. S. Meyer and D. N. Schramm, Astrophys. J. 311, 406 doi:10.1086/117251 (1986). [101] S. Perlmutter, Phys. Today, April 2003. [129] G. Aldering et al. [SNAP Collaboration], arXiv:astro- [102] N. A. Bahcall, J. P. Ostriker, S. Perlmutter ph/0209550. and P. J. Steinhardt, Science 284, 1481 (1999) [130] F. Halzen and A. D. Martin, Quarks and leptons: An in- doi:10.1126/science.284.5419.1481 [astro-ph/9906463]. troductory course In modern particle physics, (John Wiley & [103] A. A. Penzias and R. W. Wilson, Astrophys. J. 142, 419 Sons, New York, 1984) ISBN 0-471-88741-2 (1965). [131] V. D. Barger and R. J. N. Phillips, Collider physics, Front. 68

Phys. 71, 1 (1991) ISBN 0-201-14945-1 196 [arXiv:1005.3955 [hep-ph]]. [132] C. Quigg, Gauge Theories of the strong, weak, and electro- [168] A. H. Guth, Phys. Rev. D 23, 347 (1981). magnetic interactions, Front. Phys. 56, 1 (1983) ISBN 978- doi:10.1103/PhysRevD.23.347 0805360202 [169] D. Baumann, doi:10.1142/9789814327183 0010 [133] L. Anchordoqui and F. Halzen, arXiv:0906.1271 arXiv:0907.5424 [hep-th]. − [physics.ed-ph]. [170] A. Riotto, hep-ph/0210162. [134] K. A. Olive et al. [Particle Data Group Collabora- [171] C. H. Lineweaver, astro-ph/0305179. tion], Chin. Phys. C 38, 090001 (2014). doi:10.1088/1674- [172] A. D. Sakharov, Pisma Zh. Eksp. Teor. Fiz. 5, 32 1137/38/9/090001 (1967) [JETP Lett. 5, 24 (1967)] [Sov. Phys. Usp. [135] H. Fritzsch, M. Gell-Mann and H. Leutwyler, Phys. Lett. 34, 392 (1991)] [Usp. Fiz. Nauk 161, 61 (1991)]. B 47, 365 (1973). doi:10.1070/PU1991v034n05ABEH002497 [136] M. Gell-Mann, CTSL-20, TID-12608. [173] C. Brust, D. E. Kaplan and M. T. Walters, JHEP 1312, [137] Y. Ne’eman, Nucl. Phys. 26, 222 (1961). 058 (2013) doi:10.1007/JHEP12(2013)058 [arXiv:1303.5379 [138] M. Gell-Mann, Phys. Lett. 8, 214 (1964). [hep-ph]]. [139] S. N. Bose, Z. Phys. 26, 178 (1924). [174] A. Bazavov et al., Phys. Rev. D 80, 014504 (2009) [140] A. Einstein, [Sitzungsber. Preuss. Akad. Wiss. Berlin doi:10.1103/PhysRevD.80.014504 [arXiv:0903.4379 [hep- (Math. Phys. ) 22, 261 (1924); 1, 3 (1925); 3, 18 (1925). lat]]. [141] E. Fermi, Rend. Lincei 3, 145 (1926); Z. Phys. 36, 902 [175] L. A. Anchordoqui and H. Goldberg, Phys. Rev. Lett. (1926). 108, 081805 (2012) doi:10.1103/PhysRevLett.108.081805 [142] P. A. M. Dirac, Proc. R. Soc. Lond. Ser. A 112, 661 (1926). [arXiv:1111.7264 [hep-ph]]. [143] D. J. Gross and F. Wilczek, Phys. Rev. Lett. 30, 1343 (1973). [176] M. Laine and Y. Schroder, Phys. Rev. D 73, 085009 (2006) [144] H. D. Politzer, Phys. Rev. Lett. 30, 1346 (1973). doi:10.1103/PhysRevD.73.085009 [hep-ph/0603048]. [145] J. S. Schwinger, Phys. Rev. 74, 1439 (1948). [177] G. Steigman, B. Dasgupta and J. F. Beacom, Phys. [146] J. S. Schwinger, Phys. Rev. 75, 651 (1948). Rev.D 86, 023506 (2012) doi:10.1103/PhysRevD.86.023506 doi:10.1103/PhysRev.75.651 [arXiv:1204.3622 [hep-ph]]. [147] S. Tomonaga, Prog. Theor. Phys. 1, 27 (1946). [178] L. A. Anchordoqui, H. Goldberg and B. Vlcek, [148] R. P. Feynman, Rev. Mod. Phys. 20, 367 (1948). arXiv:1305.0146 [astro-ph.CO]. [149] F. J. Dyson, Phys. Rev. 75, 486 (1949). [179] R. A. Alpher, J. W. Follin and R. C. Herman, Phys. Rev. [150] F. J. Dyson, Phys. Rev. 75, 1736 (1949). 92, 1347 (1953). doi:10.1103/PhysRev.92.1347 [151] R. P. Feynman, Phys. Rev. 80, 440 (1950). [180] Ya. B. Zel’dovich, Adv. Astron. Astrophys. 3, 241 (1965). doi:10.1103/PhysRev.80.440 [181] Ya. B. Zel’dovich, Sov. Phys. Usp. 9, 602 (1967). [152] J. S. Schwinger, Phys. Rev. 73, 416 (1948). [182] B. W. Lee and S. Weinberg, Phys. Rev. Lett. 39, 165 (1977). [153] S. L. Glashow, Nucl. Phys. 22, 579 (1961). doi:10.1103/PhysRevLett.39.165 [154] S. Weinberg, Phys. Rev. Lett. 19, 1264 (1967). [183] G. Steigman, D. N. Schramm and J. E. Gunn, Phys. Lett. [155] A. Salam, Conf. Proc. C 680519, 367 (1968). B 66, 202 (1977). doi:10.1016/0370-2693(77)90176-9 [156] P. W. Higgs, Phys. Rev. Lett. 13, 508 (1964). [184] G. Steigman, K. A. Olive, D. N. Schramm and [157] F. Englert and R. Brout, Phys. Rev. Lett. 13, 321 (1964). M. S. Turner, Phys. Lett. B 176, 33 (1986). [158] G. Aad et al. [ATLAS Collaboration], Phys. Lett. B 710, 49 doi:10.1016/0370-2693(86)90920-2 (2012) [arXiv:1202.1408 [hep-ex]]. [185] D. A. Dicus, E. W. Kolb, A. M. Gleeson, E. C. G. Sudar- [159] S. Chatrchyan et al. [CMS Collaboration], Phys. Lett. B shan, V. L. Teplitz and M. S. Turner, Phys. Rev. D 26, 2694 710, 26 (2012) [arXiv:1202.1488 [hep-ex]]. (1982). doi:10.1103/PhysRevD.26.2694 [160] ATLAS Collaboration, Search for resonances decaying to [186] S. Dodelson and M. S. Turner, Phys. Rev. D 46, 3372 1 photon pairs in 3.2 fb− of pp collisions at √s = 13 TeV with (1992). doi:10.1103/PhysRevD.46.3372 the ATLAS detector, ATLAS-CONF-2015-081. [187] G. Mangano, G. Miele, S. Pastor and M. Peloso, Phys. [161] CMS Collaboration, Search for new physics in high mass Lett. B 534, 8 (2002) doi:10.1016/S0370-2693(02)01622-2 diphoton events in proton-proton collisions at 13 TeV, CMS- [astro-ph/0111408]. PAS-EXO-15-004. [188] G. Mangano, G. Miele, S. Pastor, T. Pinto, O. Pisanti [162] A. Strumia, arXiv:1605.09401 [hep-ph]; and references and P. D. Serpico, Nucl. Phys. B 729, 221 (2005) therein. doi:10.1016/j.nuclphysb.2005.09.041 [hep-ph/0506164]. [163] M. Delmastro [on behalf of the ATLAS Collaboration], [189] S. Sarkar, Rept. Prog. Phys. 59, 1493 (1996) Diphoton searches in ATLAS, 51st Rencontres de Moriond doi:10.1088/0034-4885/59/12/001 [hep-ph/9602260]. (Electroweak session) 17 May 2016, La Thuile (Italy). [190] K. A. Olive, G. Steigman and T. P. Walker, Phys. Rept. [164] P.Musella [on behalf of the CMS Collaboration], Search for 333, 389 (2000) doi:10.1016/S0370-1573(00)00031-4 [astro- high mass diphoton resonances at CMS, 51st Rencontres de ph/9905320]. Moriond (Electroweak session) 17 May 2016, La Thuile [191] G. Gamow, Phys. Rev. 70, 572 (1946). (Italy). doi:10.1103/PhysRev7.0.572 [165] CMS Collaboration, Search for new physics in high mass [192] R. A. Alpher, H. Bethe and G. Gamow, Phys. Rev. 73, 803 1 diphoton events in 3.3 fb− of proton-proton collisions at √s = (1948). doi:10.1103/PhysRev.73.803 13 TeV and combined interpretation of searches at 8 TeV and [193] G. Gamow, Rev. Mod. Phys. 21, 367 (1949). 13 TeV, CMS-PAS-EXO-16-018. doi:10.1103/RevModPhys.21.367 [166] E. W. Kolb and M. S. Turner, The Early Universe, Front. [194] Y. I. Izotov, T. X. Thuan and G. Stasinska, Astrophys. Phys. 69, 1 (1990). ISBN 0-201-11603-0 J. 662, 15 (2007) doi:10.1086/513601 [astro-ph/0702072 [167] K. A. Olive, CERN Yellow Report CERN-2010-002, 149- [ASTRO-PH]]. [195] M. Peimbert, V. Luridiana and A. Peimbert, Astrophys. 69

J. 666, 636 (2007) doi:10.1086/520571 [astro-ph/0701580]. 69, 065010 (2004) doi:10.1103/PhysRevD.69.065010 [hep- [196] G. Steigman, Ann. Rev. Nucl. Part. Sci. 57, ph/0301239]. 463 (2007) doi:10.1146/annurev.nucl.56.080805.140437 [224] I. Antoniadis, N. Arkani-Hamed, S. Dimopoulos [arXiv:0712.1100 [astro-ph]]. and G. R. Dvali, Phys. Lett. B 436, 257 (1998) [197] V. Simha and G. Steigman, JCAP 0806, 016 (2008) doi:10.1016/S0370-2693(98)00860-0 [hep-ph/9804398]. doi:10.1088/1475-7516/2008/06/016 [arXiv:0803.3465 [225] C. M. Harris and P. Kanti, JHEP 0310, 014 (2003) [astro-ph]]. doi:10.1088/1126-6708/2003/10/014 [hep-ph/0309054]. [198] Y. I. Izotov and T. X. Thuan, Astrophys. J. 710, L67 (2010) [226] W. L. Kraushaar, G. W. Clark, G. P. Garmire, R. Borken, P. doi:10.1088/2041-8205/710/1/L67 [arXiv:1001.4440 [astro- Higbie, and C. Leong, and T. Thorsos, Astrophys. J. 177, ph.CO]]. 341 (1972). [199] E. Aver, K. A. Olive and E. D. Skillman, JCAP [227] C. E. Fichtel, R. C. Hartman, D. A. Kniffen, D. J. Thomson, 1103, 043 (2011) doi:10.1088/1475-7516/2011/03/043 H. Ogelman, M. E. Ozel, T. turner, and G. F. Bignami [arXiv:1012.2385 [astro-ph.CO]]. Astrophys. J. 198, 163 (1975). [200] E. Aver, K. A. Olive and E. D. Skillman, JCAP [228] P. Sreekumar et al. [EGRET Collaboration], Astrophys. J. 1005, 003 (2010) doi:10.1088/1475-7516/2010/05/003 494, 523 (1998) doi:10.1086/305222 [astro-ph/9709257]. [arXiv:1001.5218 [astro-ph.CO]]. [229] R. C. Hartman et al. [EGRET Collaboration], Astrophys. [201] Y. I. Izotov, T. X. Thuan and N. G. Guseva, Mon. J. Suppl. 123, 79 (1999). Not. Roy. Astron. Soc. 445, no. 1, 778 (2014) [230] M. Ackermann et al. [Fermi-LAT Collaboration], As- doi:10.1093/mnras/stu1771 [arXiv:1408.6953 [astro- trophys. J. 799, 86 (2015) doi:10.1088/0004-637X/799/1/86 ph.CO]]. [arXiv:1410.3696 [astro-ph.HE]]. [202] P. A. R. Ade et al. [Planck Collaboration], Astron. Astro- [231] M. G. Aartsen et al. [IceCube Collaboration], Phys. Rev. D phys. 571, A16 (2014) doi:10.1051/0004-6361/201321591 91, no. 2, 022001 (2015) doi:10.1103/PhysRevD.91.022001 [arXiv:1303.5076 [astro-ph.CO]]. [arXiv:1410.1749 [astro-ph.HE]]. [203] L. A. Anchordoqui, H. Goldberg and G. Steigman, Phys. [232] L. A. Anchordoqui, H. Goldberg, T. C. Paul, L. H. M. da Lett. B 718, 1162 (2013) doi:10.1016/j.physletb.2012.12.019 Silva and B. J. Vlcek, Phys. Rev. D 90, no. 12, 123010 [arXiv:1211.0186 [hep-ph]]. (2014) doi:10.1103/PhysRevD.90.123010 [arXiv:1410.0348 [204] S. W. Hawking, Nature 248, 30 (1974). [astro-ph.HE]]. [205] S. W. Hawking, Commun. Math. Phys. 43 (1975) 199. [233] For further details see e.g., F. A. Aharonian, Very high [206] J. B. Hartle and S. W. Hawking, Phys. Rev. D 13 (1976) energy cosmic gamma radiation: A critical window on the 2188. extreme universe, (Singapore: World Scientific Publishing, [207] L. Parker, Phys. Rev. D 12, 1519 (1975). 2004) ISBN 981-02-4573-4. [208] R. M. Wald, Commun. Math. Phys. 45, 9 (1975). [234] J. G. Learned and K. Mannheim, Ann. Rev. Nucl. Part. [209] S. W. Hawking, Phys. Rev. D 14, 2460 (1976). Sci. 50, 679 (2000). [210] D. N. Page, Phys. Rev. D 13 (1976) 198. [235] V. F. Hess, Phys. Z. 13, 1804 (1912). [211] D. N. Page and S. W. Hawking, Astrophys. J. 206, 1 (1976). [236] P.Auger, R. Maze, T. Grivet-Meyer, Comptes Rendus 206, [212] S. W. Hawking, Phys. Rev. D 13, 191 (1976). 1721 (1938). [213] N. Arkani-Hamed, S. Dimopoulos and G. R. Dvali, Phys. [237] P. Auger, P. Ehrenfest, R. Maze, J. Daudin, Robley, and A. Lett. B 429, 263 (1998) doi:10.1016/S0370-2693(98)00466-3 Freon,´ Rev. Mod. Phys. 11, 288 (1939). [hep-ph/9803315]. [238] D. J. et al., Astrophys. J. 441, 144 (1995). [214] T. Banks and W. Fischler, hep-th/9906038. [239] A. Aab et al. [Pierre Auger Collaboration], Nucl. Instrum. [215] S. Dimopoulos and G. L. Landsberg, Phys. Rev. Lett. 87, Meth. A 798, 172 (2015) doi:10.1016/j.nima.2015.06.058 161602 (2001) doi:10.1103/PhysRevLett.87.161602 [hep- [arXiv:1502.01323 [astro-ph.IM]]. ph/0106295]. [240] J. Abraham et al. [Pierre Auger Collabora- [216] S. B. Giddings and S. D. Thomas, Phys. Rev. D tion], Nucl. Instrum. Meth. A 613, 29 (2010) 65, 056010 (2002) doi:10.1103/PhysRevD.65.056010 [hep- doi:10.1016/j.nima.2009.11.018 [arXiv:1111.6764 [astro- ph/0106219]. ph.IM]]. [217] J. L. Feng and A. D. Shapere, Phys. Rev. Lett. 88, [241] J. Abraham et al. [Pierre Auger Collabora- 021303 (2002) doi:10.1103/PhysRevLett.88.021303 [hep- tion], Nucl. Instrum. Meth. A 620, 227 (2010) ph/0109106]. doi:10.1016/j.nima.2010.04.023 [arXiv:0907.4282 [astro- [218] L. A. Anchordoqui, J. L. Feng, H. Goldberg and ph.IM]]. A. D. Shapere, Phys. Rev. D 65, 124027 (2002) [242] L. Anchordoqui, M. T. Dova, A. G. Mariazzi, T. Mc- doi:10.1103/PhysRevD.65.124027 [hep-ph/0112247]. Cauley, T. C. Paul, S. Reucroft and J. Swain, Annals [219] R. Emparan, G. T. Horowitz and R. C. Myers, Phys. Rev. Phys. 314, 145 (2004) doi:10.1016/j.aop.2004.07.003 [hep- Lett. 85, 499 (2000) doi:10.1103/PhysRevLett.85.499 [hep- ph/0407020]. th/0003118]. [243] D. F. Torres and L. A. Anchordoqui, Rept. Prog. Phys. [220] L. A. Anchordoqui, J. L. Feng, H. Goldberg and 67, 1663 (2004) doi:10.1088/0034-4885/67/9/R03 [astro- A. D. Shapere, Phys. Lett. B 594, 363 (2004) ph/0402371]. doi:10.1016/j.physletb.2004.05.051 [hep-ph/0311365]. [244] T. Antoni et al. [KASCADE Collaboration], Astropart. [221] L. Anchordoqui and H. Goldberg, Phys. Rev. D 67, Phys. 24, 1 (2005) doi:10.1016/j.astropartphys.2005.04.001 064010 (2003) doi:10.1103/PhysRevD.67.064010 [hep- [astro-ph/0505413]. ph/0209337]. [245] D. J. Bird et al. [HiRes Collaboration], Phys. Rev. Lett. 71, [222] R. C. Myers and M. J. Perry, Annals Phys. 172, 304 (1986). 3401 (1993). doi:10.1103/PhysRevLett.71.3401 doi:10.1016/0003-4916(86)90186-7 [246] R. U. Abbasi et al. [HiRes Collaboration], Phys. Rev. Lett. [223] A. Chamblin, F. and G. C. Nayak, Phys. Rev. D 100, 101101 (2008) doi:10.1103/PhysRevLett.100.101101 70

[astro-ph/0703099]. doi:10.1016/j.astropartphys.2010.12.008 [arXiv:0907.5194 [247] J. Abraham et al. [Pierre Auger Collabora- [astro-ph.HE]]. tion], Phys. Rev. Lett. 101, 061101 (2008) [268] L. A. Anchordoqui, H. Goldberg, F. Halzen doi:10.1103/PhysRevLett.101.061101 [arXiv:0806.4302 and T. J. Weiler, Phys. Lett. B 593, 42 (2004) [astro-ph]]. doi:10.1016/j.physletb.2004.04.054 [astro-ph/0311002]. [248] W. D. Apel et al., Astropart. Phys. 36, 183 (2012). [269] L. A. Anchordoqui, H. Goldberg and doi:10.1016/j.astropartphys.2012.05.023 T. J. Weiler, Phys. Rev. Lett. 87, 081101 (2001) [249] M. G. Aartsen et al. [IceCube Collaboration], Phys. Rev. D doi:10.1103/PhysRevLett.87.081101 [astro-ph/0103043]. 88, no. 4, 042004 (2013) doi:10.1103/PhysRevD.88.042004 [270] L. A. Anchordoqui, doi:10.5170/CERN-2013-003.303 [arXiv:1307.3795 [astro-ph.HE]]. arXiv:1104.0509 [hep-ph]. [250] T. Abu-Zayyad et al. [HiRes-MIA Collaboration], As- [271] D. N. Schramm, Comments Nucl. Part. Phys. 17, no. 5, trophys. J. 557, 686 (2001) doi:10.1086/322240 [astro- 239 (1987). ph/0010652]. [272] L. A. Anchordoqui and T. Montaruli, [251] D. R. Bergman and J. W. Belz, J. Phys. G 34, R359 (2007) Ann. Rev. Nucl. Part. Sci. 60, 129 (2010) doi:10.1088/0954-3899/34/10/R01 [arXiv:0704.3721 [astro- doi:10.1146/annurev.nucl.012809.104551 ph]]. [arXiv:0912.1035 [astro-ph.HE]]. [252] K. H. Kampert and M. Unger, Astropart. Phys. [273] A. Achterberg et al. [IceCube Collabo- 35, 660 (2012) doi:10.1016/j.astropartphys.2012.02.004 ration], Astropart. Phys. 26, 155 (2006) [arXiv:1201.0018 [astro-ph.HE]]. doi:10.1016/j.astropartphys.2006.06.007 [astro- [253] P. Abreu et al. [Pierre Auger Collabo- ph/0604450]. ration], Astropart. Phys. 34, 627 (2011) [274] R. Abbasi et al. [IceCube Collaboration], Nucl. Instrum. doi:10.1016/j.astropartphys.2010.12.007 [arXiv:1103.2721 Meth. A 601, 294 (2009) doi:10.1016/j.nima.2009.01.001 [astro-ph.HE]]. [arXiv:0810.4930 [physics.ins-det]]. [254] P. Abreu et al. [Pierre Auger Collaboration], Astrophys. [275] R. Abbasi et al. [IceCube Collaboration], Nucl. Instrum. J. Suppl. 203 (2012) 34 doi:10.1088/0067-0049/203/2/34 Meth. A 700, 188 (2013) doi:10.1016/j.nima.2012.10.067 [arXiv:1210.3736 [astro-ph.HE]]. [arXiv:1207.6326 [astro-ph.IM]]. [255] A. Aab et al. [Pierre Auger Collaboration], Astrophys. [276] V. S. Berezinsky and G. T. Zatsepin, Phys. Lett. B 28, 423 J. 802, no. 2, 111 (2015) doi:10.1088/0004-637X/802/2/111 (1969). doi:10.1016/0370-2693(69)90341-4 [arXiv:1411.6953 [astro-ph.HE]]. [277] M. G. Aartsen et al. [IceCube Collabora- [256] A. Aab et al. [Pierre Auger Collaboration], Phys. Rev. D tion], Phys. Rev. Lett. 111, 021103 (2013) 90, no. 12, 122005 (2014) doi:10.1103/PhysRevD.90.122005 doi:10.1103/PhysRevLett.111.021103 [arXiv:1304.5356 [arXiv:1409.4809 [astro-ph.HE]]. [astro-ph.HE]]. [257] A. Aab et al. [Pierre Auger Collaboration], Phys. Rev. D [278] S. Schonert, T. K. Gaisser, E. Resconi and O. Schulz, Phys. 90, no. 12, 122006 (2014) doi:10.1103/PhysRevD.90.122006 Rev.D 79, 043009 (2009) doi:10.1103/PhysRevD.79.043009 [arXiv:1409.5083 [astro-ph.HE]]. [arXiv:0812.4308 [astro-ph]]. [258] V. Berezinsky, A. Z. Gazizov and S. I. Grigorieva, Phys. [279] T. K. Gaisser, K. Jero, A. Karle and J. van San- Rev.D 74, 043005 (2006) doi:10.1103/PhysRevD.74.043005 ten, Phys. Rev. D 90, no. 2, 023009 (2014) [hep-ph/0204357]. doi:10.1103/PhysRevD.90.023009 [arXiv:1405.0525 [259] M. Ahlers, L. A. Anchordoqui and A. M. Tay- [astro-ph.HE]]. lor, Phys. Rev. D 87, no. 2, 023004 (2013) [280] M. G. Aartsen et al. [IceCube Collaboration], Sci- doi:10.1103/PhysRevD.87.023004 [arXiv:1209.5427 ence 342, 1242856 (2013) doi:10.1126/science.1242856 [astro-ph.HE]]. [arXiv:1311.5238 [astro-ph.HE]]. [260] R. Aloisio, V. Berezinsky and P. Blasi, JCAP 1410, [281] L. A. Anchordoqui, H. Goldberg, M. H. Lynch, no. 10, 020 (2014) doi:10.1088/1475-7516/2014/10/020 A. V. Olinto, T. C. Paul and T. J. Weiler, Phys. Rev. D [arXiv:1312.7459 [astro-ph.HE]]. 89, no. 8, 083003 (2014) doi:10.1103/PhysRevD.89.083003 [261] G. Giacinti, M. Kachelrie and D. V.Semikoz, Phys. Rev. D [arXiv:1306.5021 [astro-ph.HE]]. 91, no. 8, 083009 (2015) doi:10.1103/PhysRevD.91.083009 [282] M. G. Aartsen et al. [IceCube Collabora- [arXiv:1502.01608 [astro-ph.HE]]. tion], Phys. Rev. Lett. 113, 101101 (2014) [262] M. Unger, G. R. Farrar and L. A. Anchordo- doi:10.1103/PhysRevLett.113.101101 [arXiv:1405.5303 qui, Phys. Rev. D 92, no. 12, 123001 (2015) [astro-ph.HE]]. doi:10.1103/PhysRevD.92.123001 [arXiv:1505.02153 [283] M. G. Aartsen et al. [IceCube Collaboration], Astrophys. [astro-ph.HE]]. J. 809, no. 1, 98 (2015) doi:10.1088/0004-637X/809/1/98 [263] T. Abu-Zayyad et al. [Telescope Array Collaboration], As- [arXiv:1507.03991 [astro-ph.HE]]. trophys. J. 768, L1 (2013) doi:10.1088/2041-8205/768/1/L1 [284] M. G. Aartsen et al. [IceCube Collaboration], [arXiv:1205.5067 [astro-ph.HE]]. arXiv:1510.05223 [astro-ph.HE]. [264] J. Abraham et al. [Pierre Auger Collaboration], Phys. [285] M. G. Aartsen et al. [IceCube Collaboration], Lett. B 685, 239 (2010) doi:10.1016/j.physletb.2010.02.013 Phys. Rev. Lett. 114, no. 17, 171102 (2015) [arXiv:1002.1975 [astro-ph.HE]]. doi:10.1103/PhysRevLett.114.171102 [arXiv:1502.03376 [265] K. Greisen, Phys. Rev. Lett. 16, 748 (1966). [astro-ph.HE]]. doi:10.1103/PhysRevLett.16.748 [286] A. Neronov and D. V. Semikoz, Astropart. Phys. [266] G. T. Zatsepin and V. A. Kuzmin, JETP Lett. 4, 78 (1966) 75, 60 (2016) doi:10.1016/j.astropartphys.2015.11.002 [Pisma Zh. Eksp. Teor. Fiz. 4, 114 (1966)]. [arXiv:1509.03522 [astro-ph.HE]]. [267] R. Aloisio, V. Berezinsky and A. Gaz- [287] A. Neronov and D. Semikoz, Phys. Rev. D 93, izov, Astropart. Phys. 34, 620 (2011) no. 12, 123002 (2016) doi:10.1103/PhysRevD.93.123002 71

[arXiv:1603.06733 [astro-ph.HE]]. arXiv:1412.5106 [astro-ph.HE]. [288] S. Razzaque, Phys. Rev. D 88, 081302 (2013) [303] A. Neronov, D. V. Semikoz, L. A. Anchordoqui, J. Adams doi:10.1103/PhysRevD.88.081302 [arXiv:1309.2756 and A. V. Olinto, arXiv:1606.03629 [astro-ph.IM]. [astro-ph.HE]]. [304] A. Aab et al. [Pierre Auger Collaboration], [289] Y. Bai, A. J. Barger, V. Barger, R. Lu, A. D. Peter- arXiv:1604.03637 [astro-ph.IM]. son and J. Salvado, Phys. Rev. D 90, no. 6, 063012 [305] J. Bradley Phil. Trans. 35, 637, (1727 - 1728) (2014) doi:10.1103/PhysRevD.90.063012 [arXiv:1407.2243 doi:10.1098/rstl.1727.0064. [astro-ph.HE]]. [306] M. Unger, High energy astrophysics, lectures given at New [290] L. A. Anchordoqui, arXiv:1606.01816 [astro-ph.HE]. York University, 2015. [291] L. A. Anchordoqui et al., JHEAp 1-2, 1 (2014) [307] C.-P. Ma, Astro 161, lectures given at the University of doi:10.1016/j.jheap.2014.01.001 [arXiv:1312.6587 [astro- California, Berkeley. ph.HE]]. [308] J. A. Peacock, Cosmological Physics, (Cambridge Univer- [292] K. S. Thorne, Rev. Mod. Phys. 52, 285 (1980). sity Press, U.K., 1999) ISBN: 0-521-42270-1. doi:10.1103/RevModPhys.52.285 [309] P. B. Denton and T. J. Weiler, Astrophys. J. 802, no. 1, 25 [293] K. S. Thorne, Rev. Mod. Phys. 52, 299 (1980). (2015) doi:10.1088/0004-637X/802/1/25 [arXiv:1409.0883 doi:10.1103/RevModPhys.52.299 [astro-ph.HE]]. [294] P. C. Peters and J. Mathews, Phys. Rev. 131, 435 (1963). [310] S. B. Giddings and M. L. Mangano, Phys. Rev. doi:10.1103/PhysRev.131.435 D 78, 035009 (2008) doi:10.1103/PhysRevD.78.035009 [295] R. A. Hulse and J. H. Taylor, Astrophys. J. 195, L51 (1975). [arXiv:0806.3381 [hep-ph]]. [296] A. Abramovici et al., Science 256, 325 (1992). [311] S. B. Giddings and M. L. Mangano, arXiv:0808.4087 [hep- [297] B. P. Abbott et al. [LIGO Scientific and Virgo Collab- ph]. orations], Phys. Rev. Lett. 116, no. 6, 061102 (2016) [312] A. M. Hillas, Ann. Rev. Astron. Astrophys. 22, 425 (1984). doi:10.1103/PhysRevLett.116.061102 [arXiv:1602.03837 doi:10.1146/annurev.aa.22.090184.002233 [gr-qc]]. [313] R. M. Bionta et al., Phys. Rev. Lett. 58, 1494 (1987). [298] M. G. Aartsen et al. [IceCube and LIGO Scientific and [314] K. Hirata et al. [KAMIOKANDE-II Collaboration], Phys. VIRGO Collaborations], Phys. Rev. D 90, no. 10, 102002 Rev. Lett. 58, 1490 (1987); (2014) doi:10.1103/PhysRevD.90.102002 [arXiv:1407.1042 [315] J. N. Bahcall, A. Dar and T. Piran, Nature 326, 135 (1987). [astro-ph.HE]]. doi:10.1038/326135a0 [299] M. G. Aartsen et al. [IceCube and Pierre Auger [316] M. D. Kruskal, Phys. Rev. 119, 1743 (1960). and Telescope Array Collaborations], JCAP 1601, doi:10.1103/PhysRev.119.1743 no. 01, 037 (2016) doi:10.1088/1475-7516/2016/01/037 [317] G. ’t Hooft, Introduction to the theory of black holes, lectures [arXiv:1511.09408 [astro-ph.HE]]. given at Utrecht University, 2009. [300] M. Ackermann et al. [Fermi-LAT Collaboration], [318] A. Einstein and N. Rosen, Phys. Rev. 48, 73 (1935). arXiv:1602.04488 [astro-ph.HE]. doi:10.1103/PhysRev.48.73 [301] S. Adrian-Martinez et al. [ANTARES and Ice- [319] R. W. Fuller and J. A. Wheeler, Phys. Rev. 128, 919 (1962). Cube and LIGO Scientific and Virgo Collaborations], doi:10.1103/PhysRev.128.919 arXiv:1602.05411 [astro-ph.HE]. [302] M. G. Aartsen et al. [IceCube Collaboration],

Answers and Comments on the Exercises

1.1 This is a simple application of Kepler’s third law. For a = 30.066 AU, (23) gives 164.85 yr. The answer is given to five significant figures, the same number as we have for the semi-major axis. (ii) The exact same calculation for Pluto gives 248.1 yr, to four significant figures. (iii) The ratio of orbital times is 248/164.8 = 1.505 to four significant figures. This is quite close (within 0.3%) to a ratio of 3 : 2. That is, every time Pluto makes two orbits around the Sun, Neptune makes three orbits. These resonances are actually quite common in the Solar System. It turns out that many of the gaps in rings are due to resonances with the various moons of Saturn, and more complicated resonances explain some of the stunning detailed features seen in those rings. (iv) The eccentricity of the orbit is very close to zero, so we are not surprised that the aphelion distance, a(1 + e) = 30.4 AU, is very close to the semi-major axis. Here, the number of significant figures is subtle. You might think that the eccentricity is known to only a single significant figure, so that the aphelion should be given to the same significance. In fact, what counts in this calculation is the quantity 1 + e, which has three significant figures. (v) The perihelion is a(1 e) = 39.48(10.250) AU = 29.61 AU, and the aphelion, similarly, is a(1 + e) = 49.35 AU. The perhelion distance of Pluto− is less than the aphelion distance of Neptune, so indeed, Pluto is sometimes a bit closer to the Sun than is Neptune. It only gets a little inside Neptune’s orbit, and it turns out that it was last inside Neptune’s orbit from 1979 to 1999.

1.2 This is another application of Kepler’s third law; we have a period (24 hours) and want to find a radius of the orbit. However, note that this is not an orbit around the Sun, and so Kepler’s third law in its original form is not 72 valid. Rather, we can use Newton’s form of Kepler’s third law: 2 3 GM a = ⊕T . (344) 4π2 where M is the mass of the Earth and we will approximate as 90, 000 s. Thus a 4.2 107 m. Are we done? Well, we⊕ were asked for the distance from the Earth’s surface, whereasT ≈ what we have calculated∼ × is from the Earth’s center. So we need to subtract from this the radius of the Earth, 6371 km, leaving roughly 35, 629 km. Besides, we are asked to express this number in Earth radii; dividing by 6371 km gives about 5.6 Earth radii.

1.3 (i) The radius of the orbit was R 6371 km + 200 km 6600 km. The circumference of this circular orbit is 2πR = 4.2 104 km. So, the station has done≈ 8.6 104 orbits. Now,≈ how long does each orbit take? This is the inverse of the problem× we have done in exercise 1.2. We× know the semi-major axis of the orbit (the radius for a circular orbit), and can use Kepler’s law applied to the Earth to find the period: !1/2 4π2 = a3/2 5, 400 s = 90 minutes. (345) TMir GM ≈ ⊕ So, if every orbit takes roughly 90 minutes, 8.6 104 orbits take 4.6 108 s, or 15 yr, rounding off to the needed precision. (ii) We found that the period of each orbit× was close to 90 minutes.× Therefore, it made 16 turns per day. Now, what would an orbit of 20 revolutions per day look like? Let’s find the radius of this orbit. The period of this orbit is shorter by 16/20 from 90 minutes for Mir (or 72 minutes). We can find its orbital radius the hard way by using the full Newton’s version of Kepler’s law again, or else, we can remember that Kepler’s law holds for both orbits around the Earth, so we can find the ratio of both expressions. Namely:  2 a 3  2/3 Tnew sat = new sat a = a Tnew sat = 0.86a = 5, 700 km . (346) a ⇒ new sat Mir Mir TMir Mir TMir This would have been a nice circular orbit, except for the fact that it is 700 km underground.

2.1 The Earth radius is R 6378 km and we consider the two measurements separated 784 km in latitude. Then ⊕ ' the two cities are separated by 7◦ in latitude. If we interpret the 7◦ separation in terms of stellar parallax then the distance to the Sun would be d 784 km/ tan 7◦ 6, 385 km. If we interpret night and day as the Sun revolving around a flat disk, then the Sun will≈ crash into the∼ Earth.

2.2 (i) The solar constant is = 1.3 103 W/m2. (ii) The absolute luminosity is L = 3.7 1026 W. (iii) The Sun-Mars distance is D = 1.524 AU. ThenF × × !2 dSun Earth 2 F = − = 590 W m− . (347) F dSun Mars − 2 (iii) The power is P = FAsolar panels  = 767 W, where Asolar panels = 1.3 m and  = 0.2 is the efficiency.

2.3 Let D1 be the distance between Mercury and Saturn when they are closest to each other and D2 the distance when they are further appart. (i) Then the distance from Mercury to the Sun is D = (D D )/2 = 3.2 lm 0.385 AU. MS 2 − 1 ' (ii) The distance between Saturn and the Sun is DSS = D1 + DMS = 79.5 lm 9.57 AU. NASA’s MESSENGER spacecraft slammed into the surface of Mercury on the 30 April 2015 bringing' a groundbreaking mission to a dramatic end. The probe crashed at 3:26 pm red-sox time (1926 GMT), gouging a new crater into Mercury’s heavily pockmarked surface. This violent demise was inevitable for MESSENGER, which had been orbiting Mercury since March 2011 and had run out of fuel. The 10-foot-wide (3 meters) spacecraft was traveling about 8,750 mph (14,080 km/h) at the time of impact, and it likely created a smoking hole in the ground about 52 feet (16 m) wide in Mercury’s northern terrain. No observers or instruments witnessed the crash, which occurred on the opposite side of Mercury from Earth. Cassini is the fourth space probe to visit Saturn and the first to enter orbit, and its mission is ongoing as of 2016. It has studied the planet and its many natural satellites since arriving there in 2004.

2.4 The observed luminosity from the Sun when it is not eclipsed is πR2 σT4 . When Jupiter passes in front of the 2 2 2 4 Sun, it blocks an area of size πrJ , and the observed luminosity decreases to π(R rJ )σT , where rJ = 71, 492 km. The − fractional decrease in the apparent surface brightness is 2 2 4 2 ∆I π(R rJ )σT rJ = − 1 = 0.01 , (348) I πR2 σT4 − −R2 ≈ −

73

The eclipse only reduces the brightness by about 1%.

2.5 (i) The range of distances consistent with the measured parallax angle is 1/0.006 pc < D < 1/0.004 pc, or equivalently 167 pc < D < 250 pc. (ii) The faintest stars that can be detected with the HST have apparent brightnesses which are 4 1021 fainter than the Sun. This implies that the HST apparent brightness lower threshold 22 × is Ith = 2.5 10− I . For a star ? like the Sun, L? = L . Hence using (36) and the ratio method the distace to the × 10 6 star is found to be dL = d √I /Ith = 6.3 10 AU = 306 kpc = 10 ly, where d = 1 AU is the Earth-Sun distance × 4 and 1 pc = 206265 AU = 3.262 ly. (iii) For Cepheids, LC = 2 10 L . Because we want to calculate the limiting dis-

p × 12 8 tance for observing this object we take IC = Ith and so dL = d (LCI )/(L Ith) = 8.94 10 AU = 43.4 Mpc = 1.41 10 ly. × × 2.6 (i) As the Earth moves, the direction to which we point to Eris changes. In order to do calculations, we need to know how fast the Earth travels around the Sun. This can be done simply by remembering that the Earth travels the circumference of the Earth’s orbit in the time of one year, thus the speed is given by the distance divided by the time,

2π 1.5 108km v × × = 30 km/s, (349) ≈ 3 107 s × to one significant figure (we made the standard approximation that π 3). At this speed, how far does Earth move in 5 hours (i.e. 18, 000 s)? Rounding to 20, 000 s, the answer is ` = 600, 000≈ km. The parallax diagram is a long skinny triangle, with the 600, 000 km of the Earth’s path at the base, the distance d to Eris its length, and a 7.500 angle at its apex. If we measure angles in and if the angles are small, then there is a very simple relation between the 5 long side d and short side ` of long skinny triangles: θ = `/d. Now, since 7.500 4 10− radians, the distance to Eris is d = `/θ 1.5 1010 km. There are 1.5 108 km in an AU, so the distance to≈ Eris× is 100 AU. (For this problem, we can take the≈ distance× from the Sun to Eris,× and from the Earth to Eris, as essentially the same). The semi-major axis of Plutos orbit is 40 AU; Eris is quite a bit more distant. Interestingly, Eris is on a highly elliptical orbit, e = 0.44 (much larger than any of the other planets in the Solar System) and is currently very close to aphelion. Its perhelion is actually within the orbit of Pluto, but it will not be there for another 280 years, given its 560-year orbital period. (ii) The brightness of the Sun as seen at Eris is given by the inverse square law: I = L /(4πd2). This is the amount of energy per unit time per unit area received at Eris. The cross sectional area of Eris is πr2, and thus the total energy per unit time falling on Eris is the product of the apparent brightness and that area, namely: L r2/(4d2). However, only a fraction a of that light is reflected, the rest is absorbed. So our final answer is: r2aL / (4d2). (iii) What we have calculated in part (ii) is the luminosity (in reflected light, at least), of Eris. We are observing it a distance d away (again, we are taking the approximation that the distance from the Sun to Eris, and from Eris to the Earth, are essentially the same). So its brightness just follows from the inverse square law, namely:

 r 2 1 r2 IEris = aL = aL . (350) 2d 4πd2 16πd4 The brightness of a distant asteroid falls off as the inverse fourth power of the distance. This is why these distant guys took so long to be discovered; they are really faint. (iv) Let’s solve (350) for r r πb r = 4πd2 1.5 105 m . (351) aL ∼ ×

This is seriously large, larger than Pluto itself. The discovery of Eris (named after the goddess of strife and discord in ) set off a controversy in the astronomical community about whether it should be called a planet, and what the definition of a planet is. Reams have been written on this subject. The basic problem is that the concept of a planet has evolved as we have learned more, and we now realize that things that might conceivably be called planets now fall into a variety of categories:

terrestrial planets, relatively small rocky objects in the inner solar system, including Mercury, , Earth and • Mars;

gas giants, much much larger and more massive bodies in the outer part of the solar system, including Jupiter, • Saturn, Uranus and Neptune (many have argued that Uranus and Neptune should be in their own category of ice giants as they are actually mostly frozen gas, not vaporous gas like Jupiter and Saturn);

dwarf planets, some found in the main between Mars and Jupiter, but a class to which Pluto, Eris, • and other recent discoveries belong. 74

And this list does not yet include the massive moons of the Earth, Jupiter, and Saturn (the largest of which are considerably larger than Pluto), or the planets discovered around other stars. The term planet is now too broad to allow a single, all-encompassing clean definition, and our field has become richer with the discovery of Eris and its brethren. (v) The diameter of Eris is DEris 3, 000 km, and the distance is d = 100 AU. Imagining the triangle covering the diameter of Eris on one end, and≈ Earth at the other vertex, the angular size of Eris is then

7 θ = D /d = 2 10− 40 milliarcseconds , (352) Eris × ≈ where we used the conversion 1 200, 000 arcseconds. This is at the very limit of the resolution of HST, so you can resolve it (in fact, HST did). (vi)≈This is a simple application of Newton’s form of Kepler’s third law, which relates the period and semi-major axis of an orbiting body to the mass of the object it orbits. Here we are given the period and the semi-major axis, and we need to calculate the mass. Solving for the mass gives:

4π2a3 M = . (353) G 2 T However, do we have the semi-major axis? What were given is the angle the semi-major axis subtends in our HST images. Another opportunity to use the small-angle formula. Consider a very long skinny triangle, with length given by the distance from the Earth to Eris (100 AU) and interior angle 0.5300; we want to find the base of the triangle. It is: 1 radian a = θd = 0.53 arcsec 1.5 1010 km = 37, 000 km . (354) 2 105 arcsec × × If we plug this into (353) we find:

4π2 (3.7 107 m)3 M 22 = × 1 × = 1.6 10 kg . (355) 6.674 10 11 m3 s 2 kg− (15.8 days 86, 400 s/day)2 × × − − × × This is remarkable: Eris is a bit more massive than is Pluto.

2.7 (i) The luminosity is not isotropic, because the solid angle subtended by the blackbody from different directions will be different, and the surface brightness should be constant, so each observer will see a different flux depending on the direction. (ii) The maximum flux will be seen along the z-axis, because that is where the subtended solid angle is greatest. The solid angle is Ω = π(a/d )2, because the blackbody appears as a circular disk of angular radius L R a/dL 1. In addition, using a dL, the flux is simply F = I dΩ (because we can approximate cos θ 1, where θ is the angle between each point and the center of the observed object, whenever the object observed has' an angular 4 4 2 size much smaller than 1 radian). Using I = B = (σT /π) at the surface of the blackbody, F = IΩ = σT (a/dL) . (iii) The minimum flux will be seen along any direction along the equator, where the subtended solid angle will be 2 smallest. Now, the projected image of the ellipsoid is an ellipse, with solid angle Ω = π(ab/dL), so just as before, 4 2 4 F = IΩ = σT (ab/dL). (iv) Everyone sees the same apparent surface brightness, I = σT /π. (v) The total luminosity is the area times σT4. All we need is to find the area of the ellipsoid, which can be done for example by dividing the ellipsoid into thin rings of radius x parallel to the x y plane. For the area A we find: − Z a  !21/2  dz  A r dr   = 2 2π 1 +  , (356) × 0 dr where z = b(1 r2/a2)1/2, and − " #1/2 Z a a2 x2(1 q2) L = (σT4) 4π dr r − − , (357) a2 x2 0 − where q = b/a. If you have the patience, the integral can actually be solved. (vi) The galaxy is different because it is made of individual stars, and each star is spherical (or even if they are oblate because they are rotating, they should have their spin axes randomly oriented and uncorrelated). The condition NR2 ab guarantees that stars do not block each other, so the flux observed is the sum of the flux from each star and the luminosity is isotropic. This is true also for any optically thin gas, where each atom emits isotropically. Therefore, the flux is the same for all observers along different directions: L F = 2 , (358) 4πdL 75 where L is the total luminosity of the galaxy, and the apparent surface brightness is different. If all the luminosity is contained within the oblate ellipsoid, the average apparent surface brightness within the projected ellipsoid as seen by the observer on the z-axis is L I = F/Ω = , (359) 4π2a2 and for the observer on the equator, L I = F/Ω = . (360) 4π2ab The apparent surface brightness is higher when the galaxy is seen edge-on because the surface density of stars is greater, since the line of sight crosses a greater pathlength through the galaxy. (vii) The answer would be modified because stars would block each other so some stars would be occulted. This would be a really compact galaxy since its mean surface brightness would be similar to that of the Sun, and the stellar atmospheres would be actually heated by the other stars in the galaxy significantly. The galaxy would not be stable because every star would on average have a physicall collision with another star once every orbit (a good recipe for making a big black hole and some fireworks, more on this below).

2.8 Taking the log10 of (34) we have

2 4 2 4 log10 L = log10(4πR σT ) = log10(4πσ) + log10 R + log10 T = log10(4πσ) + 2 log10 R + 4 log10 T . (361)

If R is constant then log10(4πσ) + 2 log10 R = constant. Then, for constant R, the slope of log10 L vs. log10 T plot is 4, but since in the HR diagram T is plotted increasing to the left the slope is 4. − 3.1 From the inverse Lorentz transformation (42) we get sin θ tan θ = 0 . (362) γ(β cos θ ) − 0 Using the identity 1 + tan2 θ = sec2θ it is straightforward to obtain (44).

3.2 The distribution in the S system is given by dN dN dΩ = 0 , (363) dΩ dΩ0 dΩ so we need d cos θ0/d cos θ. Invert (44) to obtain β + cos θ dN κ(1 β2) cos θ = and so = − . (364) 0 β cos θ + 1 dΩ (1 + β cos θ)2 This is for a source moving away from O. To get the result for motion towards O, just replace β β. → − 3.3 To show that for v c the Doppler shift in wavelength is approximately v/c we use the binomial expansion:  " 2 !# "   2 !# 1/2 1/2 v v v v 2 2 λ0 = λ (1 + v/c) (1 v/c)− λ 1 + + 1 + λ[1 + v/c + (v /c )] , (365) − ≈ 2c O c2 − −2c O c2 ≈ O and so ∆λ/λ v/c. ≈ 3.4 The wavelengths from single electron energy level transitions are inversely proportional to the square of the atomic number of the nucleus. Therefore, the lines from singly-ionized helium are usually one fourth the wavelength of the corresponding hydrogen lines. Because of their redshift, the lines have 4 times their usual wavelength (i.e., p λ0 = 4λ) and so 4λ = λ (1 + v/c)/(1 v/c) v = 0.88 c. − ⇒ 3.5 Let ! hν hν hν pµ = , cos θ, sin θ, 0 (366) c − c − c 76 be the momentum 4-vector for the photon as seen in S and ! µ hν0 hν0 hν0 p0 = , cos θ0, sin θ0, 0 (367) c − c − c in S0. Do the direct Lorentz transformation from S S0 to get ν0 cos θ0 = νγ(cos θ + β), ν0 sin θ0 = ν sin θ, and → ν0 = νγ(1 + β cos θ). Use the last relation to eliminate ν and ν0 giving ! β + cos θ sin θ cos θ0 = and sin θ0 = . (368) 1 + β cos θ γ(1 + β cos θ)

2 For small β, this gives cos θ0 cos θ+β sin θ. Now, using θ0 = θ α, since α is small we have cos(θ α) = cos θ+α sin θ, and so α β sin θ. This is in≈ agreement with the data of Bradley− [305], a result which caused problems− for the æther theory of≈ electromagnetic waves (for details see e.g. [27]).

3.6 The universality of Newton’s law of gravitation tells us that all the equations and conclusions derived for the Sun and Earth interaction also hold for any system consisting of a star and a single planet orbiting around it. In particular, the period T of the planet is the same as the period of the star: the time it takes each of them to complete one orbit around their common center-of-mass. The period is the easiest parameter to determine, since precisely what is detected is a periodic motion of the star. (At least this is true in theory; in practice, determining the period from a finite set of observations can prove tricky.) The other quantity that we are usually able to estimate fairly accurately is the mass M of the star, based on the spectral type and luminosity of the star. From (23) it follows that the ratio T2/a3 is not exactly the same for all planets, but is very close to 4π2/(GM), since the ratio m/M is, almost by definition of a planet, very close to zero. (i) As a consequence, the first fact about the unseen planet that we can infer immediately from the knoweledge of M and T is its distance from the star, a = [(GMT2)/(4π2)]1/3 = 0.046 AU. (ii) In the simplest case of a nearly circular orbit, the planet describes a circle of radius a at constant velocity v, and the star describes a circle with constant velocity V, both orbits having period T. Then vT = 2πa, and since T determines a, by Kepler’s third law, we also have the velocity v of the planet. Then, from (3) we conclude that vm = VM so that we could determine the mass m of the planet, if we knew the value of V. If the plane of the orbit contained our line of sight, then V would simply be the maximal radial velocity. In general, if i denotes the angle of inclination between the normal to the plane of the orbit and our line of sight to the star, then the maximal radial velocity would be K = V sin i, and hence we can deduce the quantity m sin i = KM/v. Using the measured value of i we get m = 0.63M , where M = 1.898 1027 kg is the mass of Jupiter. (iii) From (348) it follows that r = √0.02R2 1.5r . J J × ≈ J 4.1 The flux density of neutrinos at Earth is

1038 neutrinos/s neutrinos F = = 3.5 1010 , (369) ν 4πd2 × cm2 s where d is the Sun-Earth distance. Thus, the flux of neutrinos passing through the brain per second is

neutrinos πD2 neutrinos F = F A = 3.5 1010 brain 6.2 1012 , (370) ν ν brain × cm2 s 4 ≈ × s where we have assumed that the diameter of the brain is D 15 cm. brain ' 3 4.2 The equation of hydrostatic support is (63). For a constant density, we can set M(r) = 4ρ0πr /3. Separation of variables and integration from the center to the surface (where P = Ps = 0) yields

Z Ps Z R 3 Z R 4 Gρ0πr ρ0 4 4 dP P dr G 2 rdr G 2R2 = 0 c = 2 = π ρ0 = π ρ0 . (371) Pc − − 0 3 r 3 0 −6

3 Substituting ρ0 by M/V = M/(4πR /3) we obtain the result 3 M2 P = G . (372) c 8π R4 (ii) The mass within a radius r is

Z r 4 ! 2 1 3 1 r M(r) = ρ(x)4πx dx = 4πρc r . (373) 0 3 − 4 R 77

3 For r = R the total mass of the star is found to be M = πρcR /3 and therefore we can express the central density in terms of M and R   3 M 1 3 1 4 ρc = , with which M(r) = 12M ξ ξ , (374) π R3 3 − 4 where ξ = r/R is the scaled radius. The integral of the equation of hydrostatic support is then

Z 1 3 1 4 3 M2 1 ( ξ ξ )(1 ξ) 5 M2 P M G 3 − 4 − d G c = 12 4 2 ξ = 4 . (375) π R 0 ξ 4π R

4.3 From (73) we have 1 2 P = nmv2 = n E , (376) 3 3 h i where n is the particle density. For a non-relativistic degenerate electron gas we have

!5/3 2 ρ 1  3 2/3 h2 ρ P = E = , (377) 3 µemp h i 20 π me µemp where ρ is the mass density. Equating the two relations and solving for E gives h i !2/3 3  3 2/3 h2 ρ E = . (378) h i 40 π me µemp Using the numerical value of the density of Sirius B we obtain: E = 155.27 keV. This corresponds to a Lorentz factor γ = 1.30 and β = 0.64. The electrons are thus mildly relativistic.h i The non-relativistic approximation agrees with the full relativistic result to an accuracy of 20% (note that the derivation of the equation of state uses the electron momentum). For larger densities the non-relativistic equation of state is surely not appropiate and we need to use the relativistic one.

4.4 By setting Pc = P we obtain

!4/3 !4/3 G M2 1  3 1/3 ρ 1  3 1/3 3 M4/3 hc hc α 4 = = 4 , (379) π R 8 π µemp 8 π 4πµemp R with α = 3/8 and 5/4 for constant and linear density, respectively. Solving for M yields

r !3/2 !2 3/2 9 3 hc 1 MCh = α− . (380) 256π 2 G µemp

const linear This evaluates numerically to MCh = 0.44M and MCh = 0.07M . For a constant density, the value is about a factor of 3 smaller than the exact result.

4.5 A type Ia supernova explosion is 10 billion times more luminous than the Sun (for a few days). Using the p 15 11 result of exercise 2.5 we write D = d (LIaI ))/(L Ith) = 6.32 10 AU = 3.07 Gpc = 10 ly, where Ith is the limiting brightness for detection with HST. ×

4.6 (i) The radius of the blast wave can be read off the figures taking into account the height of the tower. Note that the shock wave is not at the border of the fireball, but at the end of the compression layer that is growing with time (seen as a faint layer in e.g. the figure at 0.053 s). Using a ruler we estimate the numbers given in Table V, with an estimated precision of 6 m (corresponding to the 1/16th inch sub-division of the ruler). (ii) The Sedov-Taylor expansion is described by (98). In a log-log plot this corresponds to a line log10 r = a+b log10 t, with a = 1/5 log10(E/ρ1) and b = 2/5. The fitted slope, b = 0.42, is indeed close to the value expected for the Sedov-Taylor phase of 2/5 = 0.4 (least-square fitting of a power law with error bars yields 0.40 0.02, i.e. perfect compatibility within one standard deviation). The best value of a is the one minimizing the squared± distances to the data points:

 2  N  N d X h i X h i log (ri) a b log (ti) = 2 log (ri) a b log (ti) (381) da  10 − − 10  − 10 − − 10  i=1  i=1 2 d N N (lg(r ) a b lg(t ) = 2 lg(r ) a b lg(t ), da i i i i i=1 ! i=1 X X leading to

lg(r ) b lg(t ) a = i i N P P In [4]: a = (np.sum(yLog) - 2/5.*np.sum(xLog))/len(yLog) print "a=",a a= 2.77687821514 78 Let’s superimpose this result with the data points.

In [5]: plt.figure() TABLE V: Expansion of the shock front as a function of time. plt.errorbar(x, y, yerr=6, fmt=’o’) plt.xlim(0, 0.1) time (s) shock radius (m) xFit = np.arange(0.0010.006, 0.1, 0.001) 75 yFit = 10**a*xFit**(2/0.0165.) 108 plt.plot(xFit, yFit) 0.025 138 plt.xlabel(’t [s]’) 0.053 200 plt.ylabel(’r [m]’) 0.062 206 plt.show() 0.090 218

FIG. 41: Expansion of the shock front as a function of time [306].

Solving a for E yields leading to hX 5a X4⇡ i 1 E = 10 ⇢0. a = log (ri) b log (ti) = 2.77 (382) N 10 − 25 10 The density of air at 1100 m above sea level is about 1.1 kg/m3 and 1 ton TNT equivalent is 4.184 109 J. The results of the fit are shown in Fig. 41. Solving a for E yields E = 105aρ . The density of air at 1, 100 m above sea Therefore 1 ⇥ level is about 1.1 kg/m3 and 1 ton TNT equivalent is 4.184 109 J. Therefore, E = 7.78 1013 J = 18, 595 ton TNT. The official yield estimate of Trinity is 16, 800 ton TNT < E× < 23, 700 ton TNT [39]. Note× that the Sedov–Taylor expression was derived on the basis of a spherical explosion,5 whereas in this case the blastwave expands hemispher- ically. Following the orignal reasoning by Taylor [39], we have implicitely assumed “. . . that it may be justifiable to assume that most of the energy associated with the part of the blast wave which strikes the ground is absorbed there.”

5.1 (i) Differentiating ~σ(u, v) with respect to u and v yields      sin u sin v   cos u cos v  ∂~σ  −  ∂~σ   (u, v) ~σu(u, v) =  cos u sin v  and (u, v) ~σv(u, v) =  sin u cos v  . (383) ∂u ≡   ∂v ≡    0   sin v  − The coefficients of the first fundamental form may be found by taking the dot product of the partial derivatives

2 E = ~σu ~σu = sin v, F = ~σu ~σv = 0, G = ~σv ~σv = 1 . (384) · · · 79

The line element may be expressed in terms of the coefficients of the first fundamental form as ds2 = sin2 v du2 + dv2. (ii) The surface area is given by

Z π Z 2π Z π Z 2π π √ 2 A = EG F du dv = sin v du dv = 2π( cos v) = 4π . (385) 0 0 − 0 0 − 0 The coefficients of the second fundamental form are

2 e = ~σuu nˆ = sin v, f = ~σuv nˆ = 0, g = ~σvv nˆ = 1 . (386) · · · (iii) The Gaussian curvature is det II eg f 2 K = = − = 1 . (387) det I EG F2 −

5.2 (i) The coefficients of the first fundamental form are E = tanh2 u, F = 0, G = sech2 u . (388) The line element is ds2 = tanh2 u du2 + sech2 u dv2. (ii) The surface area is

Z 2π Z ∞ A = 2 sechu tanhu du dv = 4π (389) 0 0 which is exactly that of the sphere. (iii) The coefficients of the second fundamental form are e = sechu tanhu, f = 0, g = sechu tanhu . (390) − The Gaussian curvature is K = 1. − 5.3 We have d d γ˙ 2 = (γ~˙ γ~˙) = 2γ~¨ γ~˙ . (391) dt|| || dt · · Since γ is geodesic, γ~¨ is perpendicular to the tangent plane which contains γ~˙. Hence, γ~¨ γ~˙ = 0. Subsequently, d γ˙ 2/dt = 0. Therefore, the speed γ˙ is constant. · || || || || ¨ ¨ 5.4 The tangent plane is spanned by ~σu and ~σv. By definition the curve γ is a geodesic if γ~ ~σu = γ~ ~σv = 0. Since ˙ ¨ · · γ~ = u˙~σu + v˙~σv, it follows that γ~ ~σu = 0 becomes · " # d (u˙~σu + v˙~σv) ~σu = 0 . (392) dt · We rewrite the left hand side of the above equation: " # d d   d~σu (u˙~σu + v˙~σv) ~σu = (u˙~σu + v˙~σv) ~σu (u˙~σu + v˙~σv) dt · dt · − · dt d = (Eu˙ + Fv˙) (u˙~σu + v˙~σv) (u˙~σuu + v˙~σuv) dt − · d h 2 2 i = (Eu˙ + Fv˙) u˙ (~σu ~σuu) + u˙v˙(~σu ~σuv + ~σv ~σuu) + v˙ (~σv ~σuv) . (393) dt − · · · · We have that 1 ∂ 1 1 ~σu ~σuu = (~σu ~σu) = Eu, ~σv ~σuv = Gu, ~σu ~σuv + ~σv ~σuu = Fu . (394) · 2 ∂u · 2 · 2 · · Substituting them into (393), we obtain " # d d 1 2 2 (u˙~σu + v˙~σv) ~σu = (Eu˙ + Fv˙) (Euu˙ + 2Fuu˙v˙ + Guv˙ ) . (395) dt · dt − 2 80

This establishes the first differential equation (113). Similarly, (114) can be established from " # d (u˙~σu + v˙~σv) ~σv = 0 . (396) dt ·

5.5 For the parametrization in (115) the first fundamental form is found to be ds2 = dθ2 + cos2 θdφ2, with E = 1, F = 0, and G = cos2 θ. We restrict to unit-speed curves γ(t) = ~σ(θ(t), φ(t)), so that

Eθ˙ 2 + 2Fθ˙φ˙ + Gφ˙ 2 = θ˙ 2 + φ˙ 2 cos2 θ = 1 . (397)

If γ is a geodesic, then (114) is satisfied. Here (114) reduces to

d (φ˙ cos2 θ) = 0, or equivalently φ˙ cos2 θ = ζ, (398) dt where ζ is a constant. There are two cases: (i) ζ = 0; then φ˙ = 0. In this case, φ is constant and γ is part of a meridian. (ii) ζ , 0. Substituting (398) into the unit-speed condition (397), we have

ζ2 θ˙ 2 = 1 . (399) − cos2 θ Combining the above with (398), along the geodesic it holds that

!2 dφ φ˙ 2 1 = = . (400) dθ θ˙ 2 cos2 θ(cos2 θ/ζ2 1) − Integrate the derivative dφ/dθ: Z dθ φ φ0 = p , (401) − ± cos θ cos2 θ/ζ2 1 − where φ0 is a constant. The substitution u = tan θ yields Z   du  u  1   φ φ0 = p = sin−  p  , (402) − ± ζ 2 1 u2 ζ 2 1 − − − − − which leads to p tan θ = sin(φ φ ) ζ 2 1 . (403) ± − 0 − − Multiply both sides of the above equation by cos θ: p sin θ = ζ 2 1(cos φ cos θ sin φ sin φ cos θ cos φ) . (404) ± − − 0 − 0 Since ~σ(θ, φ) = (xy, z), we have p p z = (sin φ ζ 2 1) x (cos φ ζ 2 1) y . (405) ∓ 0 − − ± 0 − − Clearly, z = 0 when x = y = 0. Therefore, γ is contained in the intersection of S2 with a plane through the center of the sphere. Hence it is part of a great circle.

5.6 (132) gives the rate at which a clock at radius r ticks relative to one infinitely far away. Here we are asked to compare the rate of a clock at radius r relative to one at the radius of the Earth (i.e., at the Earth’s surface). We can think about this by considering the rate of each of these clocks relative to a distant clock. That is, the clock on the Earth’s surface has a rate a factor r 1 2GM 1 ⊕ (406) − c2 R ⊕ 81 slower than the distant clock, while the clock at radius r has a rate r 1 2GM 1 ⊕ . (407) − c2 r Note that both these expressions are less than one, but because r > R , the stationary clock at radius r ticks faster than that at the Earth’s surface. Indeed, the relative rate of the two is just⊕ the ratio of these two expressions s ! 1  1 2GM  1 2GM − 1 ⊕ 1 ⊕ . (408) − c2 r − c2 R ⊕ Again, the expression in (408) is greater than 1. (ii) Circular motion at speed v at a radius r gives rise to an acceleration v2/r, which we know is due to gravity. Thus if the astronaut has a mass m, Newton’s second law yields r GM m v2 GM ⊕ = m , or solving for v gives v = ⊕ . (409) r2 r r

The time dilation in special relativity is due to the by now familiar factor (1 v2/c2)1/2, which gives − r GM 1 ⊕ . (410) − rc2 Note again how similar this looks to the expression above for time dilation due to gravity. Again, this is the rate that an orbiting clock at radius r ticks relative to a stationary clock at the same radius. (iii) In part (408), we calculated the ratio of rates of stationary clocks at radius r and R (due to general relativity), while in part (410), we calculated the ratio of the rates of an orbiting clock at radius r to⊕ a stationary clock at the same radius. Therefore, the ratio of the rate of an orbiting clock at radius r to a stationary one on the ground is simply the product of these two results; namely, s ! 1  2GM  2GM −  GM  1 ⊕ 1 ⊕ 1 ⊕ . (411) − rc2 − R c2 − rc2 ⊕ (iv) We now simplify (411). We will do this in pieces, starting from (408) we can write: s s ! 1 !  1 2GM  1 2GM −  1 2GM  1 2GM 1 ⊕ 1 ⊕ 1 ⊕ 1 + ⊕ + . (412) − c2 r − c2 R ≈ − c2 r c2 R ··· ⊕ ⊕ We then use (1 x)(1 y) 1 (x + y) to re-write (412) as − − ≈ − s ! GM 2 2 1 ⊕ . (413) − c2 r − R ⊕ This then gets multiplied by (410), yielding

s ! ! GM 3 2 GM 3 2 1 ⊕ 1 ⊕ . (414) − c2 r − R ≈ − 2c2 r − R ⊕ ⊕ However, we do need to justify the use of the various approximations we have made. We dealt with a variety of expressions of the form 1 x; in every case x is of the form (GM )/(rc2). The smallest r we considered, and therefore the largest the expression− (GM )/(rc2), is r = R . So we plug in numbers⊕ at r = R to obtain ⊕ ⊕ ⊕ 10 3 2 1 24 GM 2 10− m s− kg− 6 10 kg 9 ⊕ = × × 7 10− , (415) R c2 3 6.4 106 m (3 108 m/s)2 ≈ × ⊕ × × × which is indeed a number much much smaller than 1. (v) We are asked to find the radius at which (414) is equal to unity. This clearly holds when 3/r 2/R = 0, or r = 1.5R . Given the radius of the Earth is 6, 400 km, − ⊕ ⊕ 82 the critical radius r = 1.5R is at a distance of 9, 600 km from the Earth’s center, or 3, 200 km above the Earth’s surface. Now, (414) is less than⊕ 1 for r < 1.5R , and so astronauts on the space shuttle age less than those staying home. ⊕ 2 5.7 (i) The Schwarzschild radius of a black hole of mass M is RSch = 2GM/c . The volume of a sphere of this radius 3 is just the familiar 4πRSch. The density is the mass divided by the volume, giving: " # 1 4 2GM3 − 3c6 ρ = M π = . (416) BH × 3 c2 32πG3M2 The more massive the black hole, the smaller the density. Thus there is a mass at which the black hole has the density of paper, which is what we are trying to figure out. (ii) The density is the mass per unit volume. If we can figure out the volume of a square meter of paper (whose mass we know, 75 g), we can calculate its density. The volume of a 4 piece of paper is its area times its thickness. The thickness is 0.1 mm, or 10− m, and so the volume of a square meter 4 3 of paper is 10− m . Therefore, the density of paper is

2 7.5 10− kg × 2 3 ρ = 4 3 = 7.5 10 kg/m , (417) 10− m × similar to (but slightly less than) the density of water (remember, paper is made of wood, and wood floats in water). (iii) Here we equate (416) with (417) and solve for the mass s 3c6 M = 3 1038 kg , (418) 32πG3ρ ≈ × where we have made all the usual approximations of π = 3, 32 = 10, and so on. We need to express this in solar masses, so we divide by M = 2 1030 kg to obtain M 1.5 108M . A black hole 150 million times the mass of the Sun has the same density as a piece× of paper. We know≈ the× Hitchhiker’s Guide to the Galaxy is science fiction, but do such incredibly massive black holes actually exist? Indeed they do: the cores of massive galaxies (including our own Milky Way) do contain such enormous black holes. Actually, the most massive such black hole known to exist is in the core of a particularly luminous galaxy known as M87, with a mass of 3 billion solar masses. We still have to calculate the Schwarzschild radius of a black hole. We could plug into the formula for a Schwarzschild radius and calculate away, but here we outline a simpler approach. We know the Schwarzschild radius is proportional to the mass of a black hole, and we happen to remember that a M mass black hole has a Schwarzschild radius of 3 km, (129). So a 150 million M black hole has a Schwarzschild radius 150 million times larger, or 4.5 108 km. We are asked to express this in terms of AU; 1 AU = 1.5 108 km, so the Schwarzschild radius of such a black× hole is 3 AU. (iv) We know the entire mass of the black hole.× If we can calculate the mass of a single piece of paper, the ratio of the two gives the total number of pages. So we now turn to calculate the mass of a single piece of paper. We know that a square meter of paper has a mass of 75 g. How many square meters is a standard-size sheet? One inch is 2 2 2 2 2 2.5 cm = 2.5 10− m. So 8.5 11 inch 100 inch 6 10− m− . Thus the mass is × × ≈ ≈ × 2 2 2 2 3 Mass of a piece of paper = 7.5 10− kg/m 6 10− m 5 10− kg. (419) × × × ≈ × That is, a piece of paper weighs about 5 g. We divide this into the mass we calculated above:

Mass of rule book 3 1038 kg Number of sheets of paper = = × = 6 1040 pages . (420) Mass per page 5 10 3 kg/page × × − Strictly speaking, if the rule book is printed on both sides of the page, we should multiply the above result by a factor of two. That is one seriously long set of rules! Finally, note that because the density of a more massive black hole is smaller (416), the mass and number of pages of the Brockian Ultra Cricket rule book as given by (420) is really just a lower limit. That is, if the rule book were even larger than what we have just calculated, it would still collapse into a black hole.

2 5.8 (i) The Schwarzschild radius of a 3M black hole is RSch = 2GM/c = 9 km. If you remember that for 1 M black hole the Schwarzchild radius is 3 km, you can scale from there. (ii) Using the Newton’s law of gravitation, we can write the difference in gravitational forces acting on two bodies of mass m which are located at distances r1 and r2 from the massive body of mass M   GmM GmM  1 1  δF F1 F2 = = GmM   . (421) ≡ − 2 − 2  2 − 2  r1 r2 r1 r2 83

We are interested in the difference in gravitational forces in two locations that are close to each other, since the height of the person falling into the black hole is small compared to the Schwarzchild radius. We take r2 = r1 + ∆, where ∆ r . Now, we simplify (421); dropping the subscript 1 we have  1 " # 1 1 (r + ∆)2 r2 r2 + 2r∆ + ∆2 r2 2∆ δF = GmM = GmM − = GmM − GmM . (422) r2 − (r + ∆)2 r2(r + ∆)2 r2(r + ∆)2 ≈ r3

In obtaining this expression, we used the approximations r + ∆ = r(1 + ∆/r) r and 2r∆ + ∆2 = 2r∆[1 + ∆/(2r)] 2r∆, ≈ ≈ because ∆ r. Next, we use (422) to find the distance rcrit from the black hole where the relative stretching force between your head and your legs is equal to some critical force

2∆ 2GmM∆1/3 δF = GmM and so r = . (423) crit 3 crit δF rcrit crit Finally, we can plug in the numbers. The mass of the black hole is M = 3M = 6 1030 kg. The mass of the body is m = 70 kg, δF = 10 kN. The critical radius is then, r 2000 km or recalling⊕ × that the the Schwarzchild radius crit crit ≈ for 3M black hole is RSch 10 km, we have rcrit 200RSch. Note, that in this case a significant amount of stretching occurs already relatively far∼ from the black hole. ∼(iii) The force with which the metal plate is pulling on you is given 2 by the Newton’s second law, F = msp g, where msp is the mass of the steel plate, and g 10 m/s is the gravitational 2 ∼ acceleration on the Earth. Thus msp = 10 kN/(10 m/s ) = 1, 000 kg, or 1 ton. If you still have hard time imagining how much weight 10 kN is, this is the weight of a typical car. So, imagine attaching a car to your feet: not pleasant. Most likely this is enough to kill or at the very least severely disable a person. (iv) We can apply the formula for rcrit from (423). Now the mass of the black hole is 1.3 106 times larger, so the radius increases by (1.3 106)1/3 100 times. 5 × × ≈ The answer is then rcrit = 2 10 km. In terms of the Schwarzchild radii, remember that RSch is linearly proportional to the mass. For 4 million solar× mass black hole, the Schwarzchild radius is then 1.3 106 10 km 107 km and so 2 × × ≈ rcrit 2 10− RSch. Since rcrit < RSch, the “spaghettification” happens inside the Schwarzchild radius. (v) As we saw in (iv),≈ the× radius at which the tidal force reaches 10 kN grows as the third root of the mass of the black hole, but the Schwarzchild radius grows linearly with the mass of the hole. In part (iii) for the 3M hole the critical radius was 6 outside RSch, while in part (iv) for 4 10 M hole the critical radius was inside. Thus, there should be a minimum × mass of the hole, at which rcrit = RSch, i.e., we can just barely pass through the horizon before getting fatally stretched. We find this mass setting rcrit = RSch, which leads to

 1/3 2GmMmin∆ 2GMmin = 2 , (424) δFcrit c and so

3 !  1/2 c m∆ 34 4 Mmin = = 2 10 kg = 10 M . (425) 2G δFcrit ×

So, if you fall into a 104M black hole, you will be killed right as you go through the horizon. If the black hole is more massive, then you can go through the horizon while still alive, and enjoy the sights! Sadly, you will not have much time to enjoy the view anyways, because you will be crushed by the singularity in 0.01 s seconds for this 104M black hole. This time is proportional to mass of the hole.

5.9 (i) The Schwarzschild metric is given by (123). Firstly, we have dt = 0, dr = 0, dθ = 0, and θ = π/2, and so (123) simplifies to

ds2 = r2 sin2 θ dφ2 = r2dφ2 . (426)

Now, to find the circunference we can integrate this function from 0 φ 2π, ≤ ≤ Z 2π C = Rdφ = 2πR . (427) 0 (ii) Secondly, we have dt = 0, dθ = 0, and dφ = 0, and so (123) simplifies to

  1   1/2 2GM − 2GM − ds2 = 1 dr2 ds = 1 dr . (428) − rc2 → − rc2 84

This can also be rewritten as Z R   1/2 2GM − R dr phys = 1 2 . (429) 0 − rc If we multiply both top and bottom by rc2 and divide top and bottom by 2GM we get an expression of the form s Z R rc2/(2GM) R = dr . (430) phys rc2/(2GM 1) 0 − Now, let rc2 2GM ξ = dξ = dr . (431) 2GM ⇒ c2 (430) can be made to look like (133) and (134) by multiplying the denominater by 1 and taking the absolute value of this function; namely − Z r Z r Z r  2GM α ξ 2GM  1 ξ α ξ  R = dξ =  dξ + dξ . (432) phys c2 1 ξ c2  1 ξ 1 ξ  0 − 0 − 0 − It follows that 2GM π    R = + ln √α 1 + √α + √α 1 √α , (433) phys c2 2 − − where α = Rc2/(2GM); equivalently,   r r  r r  2 2 2 2 2GM π  Rc Rc  Rc Rc  Rphys =  + ln  1 +  + 1  . (434) c2  2  2GM − 2GM 2GM − 2GM

(iii) Finally, we use the answers of (i) and (ii) to compute Π where

C = 2ΠRphys . (435) Using (427) we find   r r  r r  2 2 2 2 2GM π  Rc Rc  Rc Rc  2πR = 2ΠRphys = 2Π  + ln  1 +  + 1  , (436) c2  2  2GM − 2GM 2GM − 2GM and solving for Π we have

  r r  r r  1 2 2 2 2 2 − Rc π  Rc Rc  Rc Rc  Π = π  + ln  1 +  + 1  , (437) 2GM  2  2GM − 2GM 2GM − 2GM which can also be written as π  p p  p p  1 Π = πξ + ln ξ 1 + ξ + ξ 1 ξ − . (438) 2 − −

(iv) A plot of Π versus R/RSch is shown in Fig. 42. We can see that Π approaches the value of π measured in a flat space, as ξ . → ∞ 5.10 (i) From Fig. 10 estimate an initial position of about 32 ly at a time 2002.12 and 45 ly at 2002.73. The apparent velocity is therefore vapp 13 ly/0.61 yr = 21 ly/yr = 21c, which is in agreement with the value vapp = (25.6 4.4)c given in [80] reporting this≈ measurement. The apparent velocity of the blob is thus highly superluminal. (ii)± The light emitted at point A at time ti,1 will reach the observer located at a distance d1 at time t1 = d1/c; see Fig. 43. The blob travels with “true” velocity v from A to B a distance H which takes a time 1 L ∆tA B = ti,2 ti1 = , (439) → − v sin θ 85

Π vs. ξ 3.5

3.0

2.5 Π 2.0

1.5

1.0 1 10 100 1000 ξ≡ R/rs

FIG. 42: Π is not a constant [307]. Figure 1: 1 Plot of π versus R rs where the only hypothesis is that the signal travels at the speed of light c. The remaining distance to the observer is d2 = d1 L/ tan θ and therefore the light from position B will arrive at − 3 (d). Plot Π as a function of ξ R/r1s fLor ξ1  [1,1L0 ] (use log axes for the x axis). What ≡t = + ∈d . (440) happens t0 Π as ξ ∞? 2 v sin θ c 1 − tan θ → tThehe timeplot diiffserencegiven betweenin figu there signals1. from A and B is ! 1 L 1 L c v cos θ 1 β cos θ ∆t = t t = + = L − = L − , (441) 2 − 1 v sin θ c tan θ vc sin θ v sin θ

awherend wβe=cva/cn, andsee thetha apparentt Π app transverseroaches velocitythe tru ise thereforevalue of π that is measured in a flat space. 1 L β sin θ β = = . (442) app c ∆t 1 β cos θ − Problem #2 [Time to fall into a black hole] 2 (iii) Using x = tan θ/2, (442) can be re-written with standard trigonometry as βapp = 2βx/[(1 β) + (1 + β)x ] so that after a bit of algebra it follows that −

2 1 1 v = kc [(1 + β)x β/k] = (β/k γ− )(β/k + γ− ) , (443) We haven’t spent too much tiappme dis⇔cussing b−lack holes−in great detail. But take it as agiven that 2 1/2 a pawhererticleγin=it(1iallβy )a−t reisst theat Lorentzinfinit factor.y follo Thews lefta tr handajec termtory isth positiveat alw andays theobe equationys in x admits at least − 1 one solution if β/k γ− . Superluminal motion vapp c (i.e. k 1) is then possible as long as γβ βapp. A direct ≥ 2G≥M dt≥ ≥ 1 = 1 − rc2 dτ ! " this is a constant of the motion, and is actually closely related to the total energy of the particle.

(a). Rewrite the Schwarzschild metric in the context of a particle falling in from infinity along a direct radial line. Your metric should only have a dτ term and a dr term.

3 86 20 1. VERY HIGH ENERGY BLAZARS

Figure 4. SchematicFIG. 43: The situation view of in the exercise motion 5.10. of an emit- ting region along a direction at angle θ from the line consequence of the previousof sight. equation The is γ signalβ , whichemitted proves at t that,i,1 (resp. even forti, moderate2)travels superluminal motions, the ≥ app true velocity is relativistic.for Thed1 angle/c (resp. that maximizesd2/c) before the apparent arriving transverse at t1 (resp. velocityt2 can). A be found by differentiating vapp and solving for θmax,displacement we have of L is observed between t1 and t2, for an

effectively travelled distance2 of2 H. dvapp β cos θ β sin θ = = 0 cos θ = β . (444) dθ 1 β cos θ − (1 β cos θ)2 ⇒ max in Fig. 4. The apparent− velocity−vapp is given by Eq. (1.1): The maximum apparent transverse velocity is therefore L L L

pastel-00822242, version 1 - 14 May 2013 (1.1) vapp = = p =p d2 2 d1 2 d1 d2 t2 βt1sin θmax(t + β )1 cos(t θ+max ) β 1(t β t ) max i,2 c − i,1 c −i,2 i,1 −c βapp = − = − = 2 −= βγ . − (445) 1 β cos θmax 1 β cos θmax 1 β where the only hypothesis− is that− the signal travels− at the speed of light Therefore, c. The “true” velocity of the emitting region is v = H/(t t ). Then i i,2− i,1 using Eq. (1.1), the apparentq velocity is: q max 2 2 βapp = βγ = γ 1 1/γ γ = 1 + βapp , (446) H sin− θ ⇒ vi sin θ (1.2) vapp = = vi that is to say the plasma blob moves with aH/v highlyi relativisticH cos θ/c Lorentz1 factorcos of atθ least 21. − − c ∼ 6.1 (i) ImagineCalling a circle withβ = radiusvi/c,x andaround using the observer.t = tan θ A/2, fraction Eq. (1.1)s(x), 0 cans(x be) 1, re-written is covered by trees. Then ≤ ≤ 2 we’ll move a distancewithdx standardoutward, trigonometry and draw anothervapp circle./c =2 Thereβ aret/ 2(1πnxdxβ)+(1+trees growingβ)t in,so the annulus limited by these two circles.that They after hide a bit a distance of algebra: 2πxnDdx, or a fraction nDdx of the− perimeter of the circle. Since a fraction s(x) was already hidden, the contribution is only [1 s(x)]nDdx. We get£ § − 2 (1.3) vapp = kc [(1 + β)t β/k] =(β/k 1/γ) (β/k +1/γ) ⇔ s(x + dx) =−s(x) + [1 s(x)] n D− dx , × (447) − which gives a differential equation for s:

ds(x) = [1 s(x)] n D . (448) dx − This is a separable equation which can be integrated:

Z s Z x ds = n D dx . (449) 1 s 0 − 0 87

TABLE VI: Redshifts of the four galaxies in exercise 6.2 Galaxy λ (Å) λ (Å) z z z first line second line first line second line average 1 4100 4135 0.042 0.041 0.042 2 4145 4185 0.053 0.054 0.053 3 4215 4255 0.071 0.072 0.071 4 4318 4360 0.097 0.098 0.098

This yields the solution

nDx s(x) = 1 e− . (450) − This is the probability that in a random direction we can see at most to a distance x. This function x is a cumulative probability distribution. It is as if we have compressed the 2-dimensional forest into an imaginary 1-dimensional structure, with a characteristic mean free path. The corresponding probability density is its derivative ds/dx. The mean free path λ is the expectation of this distribution Z ∞ ds(x) 1 λ = x dx = . (451) 0 dx nD For example, if there are 2000 trees per hectare, and each trunk is 10 cm thick, we can see to a distance of 50 m, on average. (ii) The result can be easily generalized into 3 dimensions. Assume there are n stars per unit volume, and each has a diameter D and a surface A = πD2 perpendicular to the line of sight. Then we have

nAx s(x) = 1 e− , (452) − 1 4 where λ = (nA)− . For example, if there were one sun per cubic parsec, the mean free path would be 1.6 10 pc. If the universe were infinite old and infinite in size, the line of sight would eventually meet a stellar surface× in any direction, although we could see very far indeed.

6.2 (i) The relation between luminosity, distance, and brightness is given by the inverse-square law, namely

luminosity brightness = . (453) 4π distance2 Here we are given the luminosity of each galaxy (the four are the same, namely 4 1037 J/s), and the brightness, in units of Joules/meters2/second. Solving for the distance gives ×

!1/2 luminosity distance = , (454) 4π brightness and so we find: galaxy #1, distance = 6.5 1024 m = 210 Mpc; galaxy #2, distance = 8.4 1024 m = 270 Mpc; galaxy #3, distance = 1.1 1025 m = 360 Mpc; galaxy× #4, distance = 1.5 1025 m = 490 Mpc. ×(ii) The redshift is given by × × z = (λ λ0)/λ0, where λ0 = 3935 Å and 3970 Å for the two calcium lines. The measured wavelengths for each of the two− lines in each of the galaxies, the corresponding redshift from each of the lines, and the average redshift are given in Table VI. (iii) The redshift is equal to the velocity of recession divided by the speed of light. So we can calculate the velocity of recession as the redshift times the speed of light. The Hubble constant is given in Table VII. The four galaxies give consistent values of the Hubble constant, at about 60 km/s/Mpc. Not identical to the modern value of 70 km/s/Mpc, but close. That seemed quite straightforward; so why is there such controversy over the exact value of the Hubble constant? The difficult point is getting an independent measurement of the luminosity of each galaxy. The problem stated that each of the galaxies has the same luminosity of the Milky Way. That is only approximately true; the numbers were adjusted somewhat to make this come out with a reasonable value for H0.

6.3 (i) In the “local Universe” approximation where we pretend that cosmological redshifts are Doppler shifts and it is a good approximation to pretend that galaxies which are getting more distant from us due to the expansion of space are flying away from us at a given velocity, we can use the Hubble’s law. Then 88

TABLE VII: Determination of the Hubble constant.

Galaxy Redshift Velocity (km/s) Distance (Mpc) H0 (km/s/Mpc) 1 0.042 12600 210 60 2 0.053 15900 270 59 3 0.071 21300 360 59 4 0.098 29400 490 59

1 1 for the closer galaxy, we get H0 = 580 km/s/35 Mly = 17 km s− Mly− . For the farther galaxy, we get 1 1 H0 = 25, 400 km/s/1, 100 Mly = 23 km s− Mly− . (ii) The calculation from the more distant galaxy. Reason: peculiar velocities are always a few hundred km/s. They are random, so they could be anywhere from minus a few hundred to plus a few hundred. Potentially, this could be a large fraction of the 580 km/s of the closer galaxy. However, it will be a small fraction of the 25,400 km/s of the more distant galaxy. It is noteworthy that closer galaxies tend to be brighter and therefore the measurements are less likely to suffer from observational errors. While this is true, the exercise presents numbers to (at least approximately) equivalent significant figures in both cases. It may well have taken a lot more telescope time and effort to get the numbers on the more distant galaxy, but you have them. The peculiar velocity issue, however, is an intrinsic effect that perfect observations cannot get around. We will always have to deal with galaxies moving about in the universe even if we have amazing data. (iii) We use the value of H0 derived from the distant galaxy to calculate the receding v for the closer galaxy, v = H0d = 805 km/s, which implies the peculiar velocity of the nearby galaxy is vpec = 220 km/s, that is 220 km/s toward us. (iv) Assuming that 220 km/s of the 25,400 km/s we observed for the more distant− galaxy were due to peculiar velocity, from (144) we have H0 = 25, 620 km/s/1, 100 Mly = 23.3 km/s/Mly. Note that 1, 100 Mly only has two significant figures and so the difference is smaller than the precision of our measurement. This is a specific illustration of why, given that all galaxies will tend to have peculiar velocities of a few hundred km/s, more distant galaxies give you a more reliable estimate of the Hubble constant.

2 6.4 The Hubble flow v = H0r induces the flux vn through the surface 4πr of a sphere with radius r, and thus N˙ = 4πr2vn. These particles escape from the sphere containing the total number of particles N = Vn. Hence N˙ = 4πr2vn = 4πr3n˙ /3, or n˙ = 3vn/r = 3H n. − − − 0

6.5 At the final time of the invasion, t, the invaders are at proper distance dp, and therefore comoving distance r = dp/a(t); see (191). (i) For a flat space, the proper volume is obviously the usual one in Euclidean geometry,

4π V = d3 . (455) 3 p (ii) In a closed universe, and if R is the comoving radius of curvature, the proper area of a sphere at comoving coordinate r is 4π a2(t) R2 sin2(r/R), and the proper distance between two spheres at r and r + dr is just a(t)dr, as obtained from the FRW metric. Therefore, the proper volume of each spherical shell between r and r + dr is 4πa3(t)R2 sin2(r/R)dr, and the proper volume of the sphere is

Z r Z r/R " # dp sin(2dp/a/R) V = 4πa3(t)R2 sin2(r/R)dr = 4πa3(t)R3 sin2(r/R) d(r/R) = 4πa3R3 . (456) 0 0 2aR − 4

(iii) In an open universe, the calculation is just like for the closed universe but with the substitution sin(r/R) by sinh(r/R),

Z r/R ! sinh(2dp/a/R) dp V = 4πa3(t)R3 sinh2(r/R) d(r/R) = 4πa3R3 . (457) 0 4 − 2aR

6.6 For the case when the universe contains only matter with negligible pressure, the energy density changes as 3 2 ρm(t) = ρm,0/a (t). Multiplying (158) by a (t), we have

2 8πGρm,0 c (a˙)2 = . (458) c2a 2 3 − R0 89

Now, using a˙ = (da/dt) = (da/dθ)(dθ/dt), we find the left-hand-side of (458) is

c2 sin2 θ c2 1 + cos θ (a˙)2 = = , (459) R2 (1 cos θ)2 R2 1 cos θ 0 − 0 − where the last equality follows from sin2 θ = 1 cos2 θ = (1 cos θ)(1 + cos θ), and the right-hand-side of (458) is − − 2 2   2 8πGρm,0 c c 2 c 1 + cos θ = 1 = . (460) 3c2a − R2 R2 1 cos θ − R2 1 cos θ 0 0 − 0 − So the two sides of (458) are indeed equal, confirming that this parametric solution given as a(θ) and t(θ) is indeed a solution of Friedmann’s equation. (ii) The maximum value of a occurs at θ = π, and is

2 8πGρm,0R0 amax = . (461) 3c4 (iii) Correspondingly, the maximum value of the proper radius of curvature is

3 8πGρm,0R0 amaxR0 = . (462) 3c4 (iv) The age of the universe at θ = π is

2 3 4π Gρm,0R t = 0 . (463) max 3c5 (v) The big crunch happens when θ = 2π, and we then have

3 8πGρm,0R t = 0 . (464) crunch 3c5

6.7 Multiplying (158) by a2(t), we have

2 8πGρm,0 c (a˙)2 = + . (465) c2a 2 3 R0 Now, using the relations

i(ix) i(ix) x x x x e e− e− e e e− 1 sin(ix) = − = − = − = sinh x = i sinh x (466) 2i 2i − 2i − i

ei(ix) + e i(ix) e x + ex cos(ix) = − = − = cosh x , (467) 2 2

cosh2 x sinh2 x = cos2(ix) [sin(ix)/i]2 = cos2(ix) + sin2(ix) = 1 (468) − − we can rewrite (459) as

c2 sinh2 θ c2 cosh θ + 1 (a˙)2 = = , (469) R2 (cosh θ 1)2 R2 cosh θ 1 0 − 0 − and the right-hand-side of (465) is

2 2   2 8πGρm,0 c c 2 c cosh θ + 1 + = + 1 = . (470) 3c2a R2 R2 cosh θ 1 R2 cosh θ 1 0 0 − 0 − 90

a(t)/ A

ct/(2⇡ ) A FIG. 44: The time dependence of the scale factor for open, closed and critical matter-dominated cosmological models. The upper line corresponds to k = 1, the middle line to the flat k = 0 model, and the lowest line to the recollapsing closed k = +1 universe. The log scale is designed− to bring out the early-time behaviour, although it obscures the fact that the closed model is a symmetric 3 2 cycloid on a linear plot of a against t. We have set = 4πGρm R /(3c ) [308]. Figure 3. The time dependenceA ,0 of0 the scale factor for open, closed and critical matter-dominated cosmological models. The upper line corresponds to k = 1, the ° The two sides ofmiddle (458) are line indeed to the equal, flat confirmingk = 0 model, that and this parametric the lowest solution line to given the recollapsing as a(θ) and t( closedθ) is indeed a solution of Friedmann’sk = +1 universe. equation. (ii) TheA log comparison scale is of designed the scale to factors bring in out (185), the (193), early-time and (194) behaviour, corresponding to solutions with kalthough= 0, k = 1, it and obscuresk = 1, respectively the fact that is exhibited the closed in Fig. model 44. is a symmetric cycloid on a linear plot of R against− t. 6.8 (i) For the model with Ωm,0 = 1, the comoving distance is

5.4 Radiation-dominatedZ z universeZ z ! dz c dz 2c 1 r = c = = 1 . (471) H(z) H (1 + z)3/2 H − The universe cannot be dominated0 by0 matter0 at early times,0 √ because1 + z it contains some relativistic 3 The comovingparticles distance (photons). to the horizon, The number at a = density0 or z = of, is particlesr = 2c/H of0. (ii) allFor kinds this scale model, as halfn theR° comoving. However, distance the ∞ / 1 to the horizonenergy is (andr = c/ henceH0, and mass) the redshift of relativistic at which particles the comoving is redshifted, distance has thus this obeying value is obtainedE R° as:. Therefore, the energy-mass density corresponding to radiation scales as Ω R 4, rather/ than the Ω R 3 1 1 ° ° law for pressureless matter. This1 shows us= thatz the= 3 early. universe/ was inevitably radiation/ (472) − √ 2 ⇒ dominated: even if radiation makes a1 very+ z small contribution to the overall mass budget of the (iii) Foruniverse this same today, model, itand would at z = have3, the been age of relatively the universe more is obtained important from in the past. There must have been a time at which the densities in matterZ and radiation were equal, with the radiation dominating ∞ dz 2 1 1 t(z) (473) at early times and the matter= at later times. = 3/2 . z (1 + z) H(z) 3 H0 (1 + z) We already solved Friedmann’s equation for pressureless matter. If we include radiation, The present age of the universe is of course t0 = 2/(3H0), and so the ratio of the age at z = 3 to its present age is just we may as well stick to the simplest case, which is the k = 0 flat universe. This will always be a t(z = 3) 1 1 good approximation to the early phases of= the universe,= . as can be seen by going back to the basic(474) 2 t 2 3/2 2 2 1 form of Friedmann’s equation: R˙ =80 ºGΩR(1 /+3z) kc .8 For matter and radiation, ΩR R° and 2 ° / (iv) FromR° therespectively. same equation Atas above, small weR, find the density term therefore completely overwhelms the curvature

t(z) 1 1 z 22/3 1 0 5874 (475) = 3/2 = = = . . t0 (1 + z) 2 ⇒ −

16 91

Note that all these equations are of course valid only for the specific model that is flat and contains only matter, with Ωm,0 = 1.

2 7.1 The flux of one of these objects is = L/(4πd ), and its angular size is θ = `/dA. Hence, the apparent surface F L brightness is I /θ2, or ∝ F d2 I A z 4 = constant F2 = constant 2 = constant (1 + )− , (476) × θ × dL × where in the last equality we have used (226). Note that L, `, and 4π are constants and so can be absorbed in the constant of proportionality. Therefore, the apparent surface brightness I will always decrease with redshift as 4 (1 + z)− compared to the intrinsic surface brightness B, without any dependence on the cosmological model.

7.2 (i) The number density of photons per unit frequency is equal to the energy density per unit frequency divided by hν, or

8πν2dν n dν = . (477) ν c3 exp[hν/(kT)] 1 { − } The total number density is found by integrating over frequency, which gives Z 8π ν2dν n = . (478) c3 exp[hν/(kT)] 1 − Substituting x = hν/(kT), we find

!3 Z 8π kT x2dx n = . (479) c3 h ex 1 − 3 For T0 = 2.725, we find 410.4 cm− . (ii) The current density of baryons must then be

10 3 7 3 nb = 5.5 10− 410.4 cm− = 2.25 10− cm− . (480) × × (iii) Every baryon weighs approximately like the mass of a proton (this is not exact because, for example, the helium nuclei weigh a little less than 4 protons because of the helium nucleus binding energy, but the difference is rather 31 3 2 30 3 small). So the density of baryons is nbmp = 3.78 10− g/cm . The critical density is 3H /(8πG) = 9.2 10− g/cm , × 0 × so Ωb = 0.041.

7.3 To analyze the measurement of our own galaxy through the CMB, it is useful to consider the density Nγ(~p ) 3 of photons in phase space, defined by specifying that there are Nγ(~p )d p photons of each polarization (right or left circularly polarized) per unit spatial volume in a momentum-space volume d3p centered at ~p. Since ~p = hν/c and 4πh3ν2dν/c3 is the momentum-space volume between frequencies ν and dν, (477) gives | |

1 nT(c ~p )/h) 1 1 N (~p ) = | | = , (481) γ 2 4πh3ν2/c3 h3 exp[~p c/(kT) 1] | | − where nT is the number density of photons in equilibrium with matter at temperature T at photon frequency between ν and ν + dν, and the factor 1/2 takes account of the fact that nT includes both possible polarization states. This is of course the density that would be measured by an observer at rest in the radiation background. The phase space volume is Lorentz invariant, and the number of photons is also Lorentz invariant, so Nγ is a scalar, in the sense that a Lorentz transformation to a coordinate system moving with respect to the radiation background that takes ~p to ~p 0 also takes Nγ to Nγ0 , where

Nγ0 (~p 0) = Nγ(~p ) . (482)

If the Earth is moving in the x-direction with a velocity (in units of c) of β, and we take ~p to be the photon momentum in the frame at rest in the CMB and ~p 0 to be the photon momentum measured on Earth, then from (42) it follows that

~p 0 = γ(1 β cos θ) ~p (483) | | − | | 92 where θ is the angle between p and the x-axis. Thus

1 1 N0 (~p 0) = , (484) γ h3 exp[ ~p c/(kT )] 1 | 0| 0 − where the temperature is a function of the angle between the direction of the photon and the Earth’s velocity

T = T0γ(1 β cos θ) . (485) −

This means that the temperature T(θ) observed in the direction θ, is given in terms of the average temperature T0 by p 2 1 2 1 β  2 /  1  2   2 2  T(θ) = T − = T 1 β 1 β cos θ − T 1 β /2 + 1 + β cos θ + β cos θ + 0 1 β cos θ 0 − − ≈ 0 − ··· ··· h −   i T 1 + β cos θ + β2 cos2 θ 1/2 + . (486) ≈ 0 − ··· Using the trigonometric relation cos2 x = (1 + cos 2x)/2 we obtain (240). The motion of the observer (us) gives rise to both a dipole and other, higher order corrections. The observed dipole anisotropy implies that [109]

~v ~vCMB = 370 10 km/s towards φ = 267.7 0.8◦, θ = 48.2 0.5◦ , (487) − ± ± ± where θ is the colatitude (polar angle) and it is in the range 0 θ π and φ is the longitude (azimuth) and it is in the range 0 φ 2π. Allowing for the Sun’s motion in the Galaxy≤ ≤ and the motion of the Galaxy within the Local Group, this implies≤ ≤ that the Local Group is moving with

~v ~v 600 km/s towards φ = 268, θ = 27◦ . (488) LG − CMB ≈ This “peculiar” motion is subtracted from the measured CMB radiation, after which the intrinsic anisotropy is isolated (Fig. 23), and revealed to be about few parts in 105. Even though miniscule, these primordial perturbations provided seeds for the structure of the Universe.

1 PN 7.4 For a discrete set of directions in the sky, the normalized intensity function is I(Ω) = N i=1 δ(u~i, Ω). The spherical harmonic coefficients acan be written as

XN m 1 m a¯ = Y ∗(u~ ) , (489) l N l i i=1 where u~i is the unit vector to the ith direction, 1 i N. Next, we construct an estimation of the corresponding m ≤ ≤ power spectrum by squaring the al ’s followed by a sum over m:

2 X X XN 1 m 2 1 m C¯ l a¯ = Y ∗(u~i) . (490) ≡ 2l + 1 | l | N2(2l + 1) l m l m l i=1 | |≤ | |≤ Because all the sums are finite they could be rearranged and expanded to

XN X X X 1 m 2 2 m m C¯ l = Y (u~i) + Y ∗(u~i)Y (u~ j) . (491) N2(2l + 1) | l | N2(2l + 1) l l i=1 m l i

Now, since Pl(1) = 1 we set the unit direction vectors ~x and ~y to be equal in (492) to obtain

2l + 1 X = Ym(~x) 2 . (493) 4π | l | m l | |≤ 93

Combining (491), (492), and (493) leads to 1 1 X C¯ = + Pl(u~i u~ j) . (494) ` 4πN 2πN2 · i

m ¯ m Experimentally, only a¯l and Cl could be measured, but these are estimates of their continuous counterparts al , Cl respectively. Therefore, since inner products are invariant under rotations, it follows that the Cl are also invariant under rotations [309].

7.5 (i) The circumference of a circle of radius a is 2πa, so the orbital speed is the circumference divided by period: r 2πa 2πa GM v = = p = . (495) 2 3 a T 4π a /(GM) (ii) According to the Birkoff’s theorem, the orbit about a mass distributed within a sphere is the same as if the mass is all concentrated in the center of the sphere. So, we can use the velocity formula derived in (i), and invert it to obtain the mass enclosed by an orbit: a M(a) = v2 . (496) G The enclosed mass at 8 kpc is then M(8 kpc) 2 1041 kg = 1011M . (iii) If the mass enclosed by the orbit stays at 1011M as the radius increases, which follows≈ from× the fact that the Sun is at the edge of the luminous galaxy, then at different radii the velocity given by (496) will decrease with square root of the distance. At 30 kpc, it is v 110 km/s. At 100 kpc, we have v 60 km/s. (iv) Let’s look again at (496), which says that if the orbital velocity stays≈ the same, the mass enclosed will≈ increase linearly with a as the radius of the orbit grows. We already calculated the mass enclosed by 8 kpc orbit in part (ii). So, at 30 kpc, the mass enclosed will be 30 kpc/8 kpc times larger, or 3.8 1011M . At 100 kpc, the mass enclosed will 100 kpc/8 kpc times larger, or 1.3 1012M . (v) We see that the mass× of the gravitating matter is increasing linearly with radius, and exceeds by more× than factor of 10 the mass of the luminous matter (e.g. stars and gas). We thus infer that the outer halo of the galaxy is dominated by invisible dark matter.

2 7.6(i) The total mass inside R is obtained from GM(R)/R = vc (R). The answer can of course be found by substituting for the value of G and everything else in your favorite system of units, and if you are lucky not to make any mistake you may even get the right answer. Often, it is faster and safer to work it out using proportionality comparing to an example that you know and love. What could this example be but the Earth moving around the Sun? For the Earth, 1 with M and an orbit of 1 AU the velocity is 30 kms− (if you did not know how fast the Earth moves around the Sun, this is a good number to remember). So, the mass inside radius R is

M(R) = 8 1010M (497) × 2 (ii) If the density at R is ρ0, then the density at any other radius r is ρ0(R/r) , so the mass inside R is

Z R  2 2 R 3 M(R) = 4π drr ρ0 = 4πρ0R . (498) 0 r Hence the density at R is

M(R) 3 ρ = = 0.51 mp cm− . (499) 0 4πR3 The result is most easily computed remembering that the solar mass contains 1.19 1057 proton masses (another useful number to remember), and a parsec is 3.086 1016 m. (iii) The density is × × H2 3 0 6 3 6 3 ρ = Ω = 5.5 10− Ω mp cm− = 4 10− mp cm− . (500) Λ Λ 8πG × Λ × (iv) Because the dark energy is spread out uniformly, whereas the dark matter and baryonic matter are highly concentrated in the inner parts of galaxies, the density of dark energy is very small compared to the density of matter inside the radius of the solar orbit in the Milky Way. The dark energy therefore must have a tiny dynamical effect. 94

7.7 The relation between the emitted Tem and the observed Tobs is

2.9 10 3 mK T = T (1 + z) = − (1 + z) (501) em obs × obs , λmax

obs where in the last equality we used Wien’s displacement law [17]. For λmax = 180 µm and z = 2, we have Tem 48 K. If we did not account for redshift, we would have thought the galaxy was only at 16 K. '

7.8 In the benchmark model, at the present moment, the ratio of the vacum energy density to the energy density in matter is

ρΛ Ω = Λ 2.3 . (502) ρm,0 Ωm,0 ≈

In the past, however, when the scale factor was smaller, the ratio of densities was

ρΛ ρΛ ρΛ a3 = 3 = . (503) ρm(a) ρm,0/a ρm,0

If the universe has been expanding from an initial very dense state, at some moment in the past, the energy density of matter and Λ must have been equal. This moment of matter-Λ equality occurred when

3 Ωm,0 Ωm,0 amΛ = = 0.43 . (504) Ω 1 Ωm ≈ Λ − ,0 where we have used the normalization a0 = 1 for the present. Next, we generalize (205) to write the age of the universe at any redshift z, for a flat model with matter and a cosmological constant, Z Z 1 ∞ dz 1 ∞ dz t(z) = = . (505) p 3 p 3 H0 z (1 + z) Ωm,0(1 + z) + ΩΛ H0 √ΩΛ z (1 + z) 1 + (Ωm,0/ΩΛ)(1 + z)

p 3 2 With the change of variables y = 1 + (Ωm /Ω )(1 + z) we find 2ydy = 3(y 1)dz/(1 + z) and so ,0 Λ − Z 2 ∞ dy t = 2 . (506) 3H √Ω y y 1 0 Λ − The integral can be solved analytically as: Z " # dy 1 y + 1 = ln , (507) − y2 1 2 y 1 − − which yields for t

3   √1+Ωm,0(1+z) /ΩΛ 2  1 y  t = ln  +   p 2 p 2  3H0 √ΩΛ y 1 y 1  s − −s ∞    2  ΩΛ ΩΛ  = ln  3 + 1 + 3  . (508) 3H0 √ΩΛ  Ωm,0(1 + z) Ωm,0(1 + z) 

Using (203) and (504) we can rewrite (508) as h p i 2 3/2 3 2 H0t = p ln (a/amΛ) + 1 + (a/amΛ) = p ln A , (509) 3 1 Ωm 3 1 Ωm − ,0 − ,0 and so 2 H0t0 = p ln A0 , (510) 3 1 Ωm − ,0 95 where we have defined q 3/2 3 A = a− + 1 + a− 3.35 . (511) 0 mΛ mΛ '

Now, we want to find the value of a for which t = t0/2. This implies 1 ln A = ln A , (512) 2 0 where  a 3/2 A = x + √1 + x2 with x = . (513) amΛ

(512) implies A = √A0 and so

p  p 2 2 2 A0 1 x + √1 + x = A0 1 + x = A0 x x = − 0.64 , (514) ⇒ − ⇒ 2 √A0 ' yielding

2/3 1 a = am x = 0.56 and z = 1 = 0.78 . (515) Λ a − We see that the redshift at which the age of the universe was half the present age is larger in this benchmark model than in the model with Ωm,0 = 1, see (475). This is because in the benchmark model, which contains vacuum energy, the universe has started to accelerate recently, roughly since the epoch at a = amΛ. The universe took a longer time to expand to a = 0.56 and then it picked up speed again in its expansion up to the present a0 = 1.

3 3 8.1 We have seen in (282) that sγ T , so that Sγ VT . For a reversible adiabatic expansion, the entropy of the ∝ ∝ 1/3 (non-interacting) CMB photons remains unchanged. Hence, when V doubles, T will decrease by a factor (2)− . So after 1010 yr the average tempreature of the blackbody will become T = 2.2 K. h i 8.2 Since during inflation the Hubble rate is constant

2 kc 2 Ω 1 = a− . (516) 2 2 2 − a H R0 ∝

On the other hand, (290) suggests that to reproduce today’s observed value Ω0 1 1 the initial value at the beginning 54 − ∼ of the radiation-dominated phase must be Ω 1 10− . Since we identify the beginning of the radiation-dominated phase with the end of inflation we require| − | ∼

54 Ω 1 t t 10− . (517) | − | = f ∼ During inflation

 2 Ω 1 t=tf ai 2H∆t | − | = = e− . (518) Ω 1 t t a | − | = i f

Taking Ω 1 t=ti of order unity, it is enough to require that ∆t & 60/H to solve the flatness problem. Thus, inflation | − | 54 ameliorates the fine-tuning problem, by explaining a tiny number (10− ) with a number (60). O O

8.3 The effective number of neutrinos and antineutrinos is gνL = 6 and the temperature of the cosmic neutrino background is T = 0.7 T 1.9 K. Now, from (269) we have ν γ ≈ !3 3 ζ(3) kT T 3 n = g ν 45.63 ν 313 neutrinos/cm3 . (519) ν 4 π2 νL }c ≈ K ≈ If neutrinos saturate the dark matter density the upper bound on the neutrino mass is then

27 3 8 2 0.26 ρc 2.6 10− kg/m 35 kg 9.38 10 eV/c 2 mν < × 10− × 5.6 eV/c . (520) 8 3 27 nν ≈ 3.13 10 neutrino/m ≈ neutrino × 1.67 10− kg ∼ × × 96

2 27 where we have used mp = 938 MeV/c = 1.67 10− kg to obtain the result in natural units. × 8.4 If we change the difference between the proton and neutron mass to ∆m = 0.129 MeV while all other parameters FO remain the same, then the time of freeze-out of the neutron abundance occurs at the same temperature Tn/N = 0.1293/0.75 0.75 MeV. Therefore, the neutron abundance freezes out at nn/np = e− = 0.84. If there were no neutron decays and all neutrons combined to form helium, the maximum primordial 4He abundance would then be

max 2nn Yp = = 0.91 . (521) nn + np Note that neutrons would in fact not decay, they would be stable because the difference with the mass of the proton would be less than the mass of the electron. It would be rather unfortunate if the neutron had a mass so close to the proton mass: almost all the matter in the universe would have turned to helium in the beginning of the universe, and main-sequence stars would not live very long with the very small amount of hydrogen they would have left. The Sun would live for less than 1 billion years and the planet Earth would not have had enough time to sustain life on it for us to be here now.

8.5 (i) From (282), the energy density of photons at the time of BBN was

(kT )4 ρ = 0.66 BBN = 7.56 1020 J/m3 , (522) γ,BBN (}c)3 ×

9 where TBBN 10 K. Note that in reality we should also account for the neutrinos, but Gamow did not know much about the three≈ families of neutrinos and their interactions. A better estimate of the energy density at BBN goes as follows. The effective number of neutrinos and antineutrinos is 6, or 3 times the effective number of species of photons. On the other hand, the fourth power of the neutrinos temperature is less than the fourth power of the 4/3 photon temperature by a factor of 3− . Thus the ratio of the energy density of neutrinos and antineutrinos to that of photons is

4/3 ρν/ργ = 3− 3 = 0.7 . (523) Hence the total energy density after electron positron annihilation is

ρ ρ + ρ = 1.7ρ 1.3 1021 J/m3 . (524) BBN ' ν,BBN γ,BBN γ,BBN ' × (ii) Since the universe was radiation dominated, the critical density at BBN had to be equal to this radiation density, so 3c2H2 = ρ (525) 8πG BBN

3 1 This gives a Hubble parameter at the time of BBN in Gamow’s radiation dominated universe of H = 2.17 10− s− . (iii) Readjusting (187), the time for BBN is found to be × 1 t = = 231 s . (526) BBN,G 2H (iv) For a present age t 1010 yr, the temperature is given by 0 ≈ 2 2 2 4 3c H 3c (kT0,G) 0 = = 0.66 (527) 8πG 2 }c 3 32πGt0 ( ) which gives T0,G = 27 K. Note that actually this temperature just depends on t0 and the assumption of a flat, radiation-dominated universe, but it does not depend on TBBN. (v) If the universe changed from being radiation dominated to matter dominated at some redshift zeq, then at the present time the matter density is greater than the 2 radiation density by a factor 1 + zeq; so ρrad = ρmc /(1 + zeq). In a flat universe with only matter and radiation, the total 2 density has to be equal to the critical density, therefore ρrad + ρmc = ρrad(2 + zeq) = ρc. So,

2 2 4 3H0c 1 (kT0) ρrad = = 0.66 3 , (528) 8πG 2 + zeq (}c) 97

1/4 and the radiation temperature is smaller by a factor (2 + zeq)− .

8.6 The effective number of neutrino species contributing to r.d.o.f. can be written as

 !4  TνR  N   eff = 3 1 +  . (529) TνL

dec Taking into account the isentropic heating of the rest of the plasma between νR decoupling temperature TνR and the end of the reheating phase,

 dec 4/3  g(TνL ) δN = 3   , (530) ν  dec  g(TνR )

dec where TνL is the temperature at the end of the reheating phase (when left-handed neutrinos decouple), and we have dec taken Neff = 3 + δNν. To be consistent with Planck data at 1σ we require Neff < 3.68. We take g(TνL ) = 10.75 reflecting + + dec (eL− + eR + eR− + eL νeL + ν¯eR + νµL + ν¯µR + ντL + ν¯τR + γL + γR). From (530) the allowable range is g(TνR ) > 33. This is dec dec achieved for r(TνR ) > 0.29. Using (295) this can be translated into a decoupling temperature: TνR > 185 MeV.

8.7 At a given time, the rate of decrease in the BH mass is just the total power radiated ˙ X     1 dMBH σs 3 Q 2s − = gi Q exp ( 1) . (531) dQ − 8 π2 T − − i BH Integration of (531) leads to X Γs 4 M˙ BH = gi Γ(4) ζ(4) T A4 4+n . (532) − B± 8 π2 BH ⊂ i The net change of the BH mass is therefore

dMBH dMBH dMBH = + . (533) dt dt accr dt evap

Substituting M √sˆ into (532), where √sˆ is the center-of-mass energy of the constituents of the protons (quarks BH ∼ and gluons), a rather lengthy but straightforward calculation shows that dM/dt > 0  > 1010 GeV/fm3. Note that the energy desnity of partonic matter produce at the LHC is more than 7 orders of magnitude⇔ smaller. (ii) Since the ratio of degrees of freedom for gauge bosons, quarks and leptons is 29:72:18 (the Higgs boson is not included), from (532) we obtain a rough estimate of the mean lifetime,

 9/7   27 MBH TeV τ 1.67 10− s . (534) BH ≈ × M M ∗ ∗ then (534) indicates that black holes that could be produced at the LHC would evaporate instantaneously into visible quanta. For further thoughts on this subject [310, 311].

9.1 (i) For a steady state, ∂n/∂t = 0 and

∂2n Q = 0 δ(z) . (535) ∂z2 − D Integration yields ∂n Q = A 0 Θ(z) , (536) ∂z − D where A is an integration constant and Θ(z) the Heaviside step function (see Appendix F). A second integration leads to Q n(z) = B + Az 0 zΘ(z) , (537) − D 98 where B is an integration constant. From n( H) = 0 we can conclude that B = AH and so − Q n(z) = AH + Az 0 zΘ(z) . (538) − D On the other hand, n(+H) = 0 yields

Q 2AH 0 H = 0 (539) − D or 1 Q A = 0 . (540) 2 D Then the particle density in the range H z H is − ≤ ≤ 1 Q Q n(z) = 0 (H + z) 0 zΘ(z) , (541) 2 D − D which can be rewritten as 1 Q n(z) = 0 (H z ) (542) 2 D − | | (ii) The column density is

Z H Z H + 1 Q Q H2 N = n(z) dz = 2 0 (H z)dz = 0 . (543) H 0 2 D − 2D −

Using N = Q0τres we have

H2 H2 τres = D = . (544) 2D ⇒ 2τres Using Dβcλ/3 the mean free path is

3 H2 λ = . (545) 2 βcτres

For H = 500 pc, τ = 107 yr and β 1 the mean free path is about 0.1 pc. res ∼ 9.2 (i) The equation of motion is

d~p d = F~ = (γm~v) = Ze(~v B~) (546) dt dt × where e is the elementary charge and Z is the charge number. The acceleration in a magnetic field is always perpendicular to the velocity, ~v ~v˙ = ~a and hence γ˙ = 0. Therefore ⊥ γm~v˙ = Ze(~v B~) . (547) × For ~v B~ we can write down the component-wise differential equations which read as ⊥ Ze Ze v˙x = vyB and v˙ y = vxB . (548) γm −γm

The solution is ! ! ZeB ZeB v = v sin t and v = v cos t , (549) x γm y γm 99 which leads to ! ! vγm ZeB vγm ZeB x = cos t and y = sin t . (550) − ZeB γm ZeB γm

The radius is therefore q vγm cγm R = x2 + y2 = , (551) ZeB ≈ ZeB where in the last step we set v c. (ii) For a given radius R, the magnetic field strength can thus be expressed as ≈ cγm B = . (552) ZeR

For R 27 km/(2π) and cγmp = E/c we can calculate the average magnetic field at the LHC, BLHC 5.43 T. Note that in reality' the particles in a collider are not in a uniform magnetic field, but the collider ring is composed≈ of alternating sections for bending, accelerating and focussing the particles. Therefore the actual magnetic field strengths needed are slightly larger than the ones calculated above. At the LHC, the magnets produce a field of 8.7 Tesla. (iii) Useful formulae for the radius of a particle in a magnetic field can be be obtained by introducing E = γmc2 and evaluating the numerical constants, which gives the rule of thumb for particle physics detectors

E/(GeV) R = 3.3 m , (553) Z(B/T) and the rule of thumb for cosmic ray acceleration (sometimes called the Hillas criterion [312])

(E/EeV) R = 1.1 kpc (554) Z(B/µG)

The radius of a collider, with the average magnetic field of the LHC, that is expected to launch particles to 1011 GeV would be 6 1010 m. This radius is comparable to the Sun-Mercury distance, which is 5.76 1010 m (see exercise 2.3). Hence,× such a collider would be priceless! (iv) The maximum attainable energies in the× given astrophysical p 11 56Fe 12 p 10 objects are: for neutron stars, Emax 10 GeV and Emax 2.6 10 GeV; for AGN jets, Emax 10 GeV and 56 ∼ p ∼ × 56 ∼ E Fe 2.6 1011 GeV; for supernova remnants E 107 GeV and E Fe 2.6 108 GeV. max ∼ × max ∼ max ∼ × 9.3 (i) We are told that the energy emitted by the supernova in visible light is equal to that emitted by the Sun in 1010 yr. We can look up the luminosity of the Sun (energy emitted per second), and simply multiply by the 1010 yr,

3 107 s Total energy emitted in visible light = 4 1026 J/s 1010 yr × 1044 J . (555) × × × 1 yr ≈

The energy associated with the neutrinos is 100 times larger still than that, namely 1046 J. (ii) If each neutrino has an 12 57 energy of Eν 1.5 10− J, the total number of neutrinos emitted by the star is 7 10 . (iii) These neutrinos are emitted essentiallyh i ∼ all× at once, and thereafter, travelling at the speed of light, they∼ expand× into a huge spherical shell of ever-increasing radius. Thus, by the time they impinge on the Earth, they are spread out over a spherical shell of radius 150, 000 ly. The number density on the shell is:

Number 7 1057 neutrinos = × π(1.5 105 ly 1016 m/ly 2.5 1014 neutrinos/m2 . (556) Surface Area 4 × × ≈ × That is, every square meter on the Earth’s surface was peppered with 250 trillion neutrinos from the supernova. The 33 detector has 2.14 kton of water (Ntarget 1.28 10 free target nucleons) and so using the average cross section for weak interactions we have ∼ ×  2 1 Number 1 14 48 Eν 33 σ N = 2.5 10 5 10− 1.28 10 46 electron neutrinos . (557) 3 Surface Area weak target 3 · × · × MeV × ∼ When the discovery of the supernova was first announced, Bahcall, Dar, and Piran (DBP) immediately realized the possibility that Kamiokande could have detected the neutrinos from it. They locked themselves in their office, took the phone off the hook, did essentially the calculation that you have just done, and sent a paper off tol Nature, all 100 within 24 hours. They wanted to make a prediction about the neutrinos, untainted by any news that the neutrinos actually were found. Indeed, a few days later, the news of IMB [313] and Kamiokande [314] detection came out. The two detectors in deep mines recorded a total of 19 neutrino interactions over a span of 13 seconds. The BDP paper was published on 1987 March 12 (the supernova itself went off on February 23), and has the following understated but triumphal final sentence: “Note added in proof: Since this paper was received on 2 March, the neutrino burst was found by the Kamiokande experimental group, with properties generally consistent with the calculated expectations” [315]. Making a rough correction for the 60% efficiency reduces the expected number of events to 28, within about a factor of 2 of the actual detection.

47 9.4 Substituting M? and MBH in (342) we obtain P = 2.4 10 W. At this emission rate the neutron star will fall into the black hole in t 2.9 s. × ≈ 5.1 Kepler’s laws

2. The line connecting the Sun and a planet sweeps out equal area in equal time.

3. The “Harmonic Law” states the squared orbital period P of planets measured in years equals to the third power of their major axis measured in astronomical units, (P/yr)2 = (a/AU)3.

Kepler noted already that his laws describe also the motion of the Saturn moons, if in the Harmonic Law the appropriate units are used. Newton used later the Harmonic Law to derive the 1/r2 dependence of the gravitational force. We will follow the opposite way and discuss how Kepler’s laws follow from Newton’s law for gravitation. Ellipses: An ellipse may be defined by the condition r + r′ =2a, i.e. as the set of points with a constant sum 2a of the distances r and r′ to the two focal points F and F ′. Additionally to its major axis a, an ellipse is characterized either by its minor axis b or its eccentricity e. The latter two quantities can be connected by considering the 2 2 2 two points at the end of the minor axis b:Thenr = r′ = a and r = b +(ae) or

b2 = a2(1 e2) . (5.1) 101 − Consider a differential area dA on the surface of a sphere, in the form of a thin ring centered about the s b r′ symmetry axis. This ring can be thought of as the inter- r section of the spherical surface with two cones, one of half-angle ϑ, and other of half-angle ϑ + dϑ. The width ϑ of this ring is Rdϑ, and the radius of the ring is R sin ϑ. aaeThe differential solid angle is then Fs′ Fs dA (2πR sin ϑ)(Rdϑ) dΩ = = = 2π sin ϑdϑ. (B1) R2 R2

The solid angle inside a cone of half-angle ϑc can be determined by integrating

AnyFIG. point 45: of anAn ellipse ellipse can is bedefined specified by bythe the condition distancer r+tor one= 2 ofa, its focal points Z Z π ϑc 0 andwhich an angle describesϑ that theis measured set of points counter-clockwise with a constant beginning sum 2a of from the the major Ωaxis. = dΩ = 2π sin ϑ dϑ = 2π cos ϑ 0 − Fromdistances the figure,r and oner obtainsto the two immediately focal points (withF and cos(180F [16].◦ ϑ)= cos(ϑ)) 0 0 0 − − = 2π(1 cos ϑc) . (B2) 2 2 2 r′ = r +(ae + a) +2r(2ae + a) cos ϑ . (5.2) − It is often of interest to consider the small-angle approx- Eliminating r withAppendix the help A: of Propertiesr + r =2a ofand the solving ellipse for r one obtains 2 ′ ′ imation, where ϑc 1. In this limit, cos ϑc 1 ϑ /2.  ≈ − c a(1 e2) Therefore, the solid angle of a cone with small half-angle An ellipse is the set of all points for which the sum 2 r = − . ϑc is Ω(5.3)πθc . of the distances from two fixed1+ pointse cos ϑ (foci) is constant, ≈ see Fig. 45. Additionally to its major axis a, an ellipse is characterized either by its minor axis b or its eccentricity Appendix C: Conservation of mass and momentum As starting point, we recall how a two-body problem can be reduced to an one-body problem e. The latter two quantities can be connected by consid- in the case of a central force. ering the two points at the end of the minor axis b, for 2 2 2 Consider a fluid with local density ρ(t, x, y, x) and local which r = r0 = a and r = b + (ae) or velocity u~(t, x, y, z). Consider a control volume V (not necessarily small, not necessarily rectangular) which has b2 = a2(1 e2) . (A1) 37 − boundary S. The total mass in this volume is Any point of an ellipse can be specified by the distance r Z to one of its focal points and an angle ϑ that is measured M = ρdV . (C1) counter-clockwise beginning from the major axis. From Fig. 45, using cos(π ϑ) = cos ϑ, we have − − The rate-of-change of this mass is just 2 2 2 Z r0 = r + (ae + a) + 2r(2ae + a) cos ϑ . (A2) ∂M ∂ρ = dV . (C2) ∂t ∂t Eliminating r0 with the help of r + r0 = 2a and solving for r we obtain The only way such change can occur is by stuff flowing across the boundary, so a(1 e2) r = − . (A3) Z 1 + e cos ϑ ∂M = ρu~ dS~ . (C3) ∂t · We can change the surface integral into a volume integral Appendix B: Geometry of radiation using Green’s theorem, to obtain Z ∂M The solid angle Ω is the two dimensional analog of the = ~ (ρu~)dV . (C4) conventional one dimensional angle ϑ. Just as the angle ∂t − ∇ · ϑ is defined as the distance along a circle divided by the radius of that circle, so the solid angle Ω is analogously Note that (C2) and (C4) must be equal no matter what V defined as the area on the surface of a sphere divided by volume we choose, so the integrals must be pointwise the radius squared of that sphere. The units for ϑ and Ω equal. This gives us an expression for the local conser- are radians (r) and steradians (sr), respectively; although vation of mass it should be noted that both of these measures of angle ∂ρ have no actual dimensions. Since the total surface area + ~ (ρu~) = 0 , (C5) ∂t ∇ · of a sphere of radius R is 4πR2, the total solid angle in one sphere is 4π sr. which is sometimes called continuity equation. 102

We can go through the same process for momentum These forces contribute to changing the momentum, by instead of mass. We use Π to represent momentum, to the second law of motion: avoid conflict with P which represents pressure. The Π0 total momentum in the control volume is: i = F . (C13) dt i Z d dt t Πi = ρ ui dV , (C6) Note the tricky notation: we write / rather than ∂/∂ , and Π0 rather than Π, to remind ourselves that the three laws of motion apply to particles, not to the control vol- where the index i runs over the three components of the ume itself. The rate-of-change of Π, the momentum in momentum. The rate-of-change thereof is just the control volume, contains the Newtonian contribu- Z ∂Π ∂(ρui) tions, (C11) and (C12) via (C13), plus the flow contribu- i = dV . (C7) ∂t ∂t tions (C10). Combining all the contributions we obtain the main We (temporarily) assume that there are no applied forces result, ’s equation of motion: (i.e. no gravity) and no pressure (e.g. a fluid of non- ∂(ρui) j interacting dust particles). We also assume viscous + j(ρuiu ) = iP + ρgi . (C14) forces are negligible. Then, the only way a momentum- ∂t ∇ −∇ change can occur is by momentum flowing across the One sometimes encounters other ways of expressing boundary, the same equation of motion. Rather than emphasizing Z Z the momentum, we might want to emphasize the ve- ∂Πi j locity. This is not a conserved quantity, but sometimes = (ρui)u~ dS = (ρuiu djS) . (C8) ∂t · it is easier to visualize and/or easier to measure. If we expand the left-hand-side we have We are expressing dot products using the Einstein sum- mation convention, i.e. implied summation over re- ∂ui ∂ρ j ρ + ui + ui j(ρu ) + ρuj jui = ρgi iP, (C15) peated dummy indices, such as j in the previous expres- ∂t ∂t ∇ ∇ − ∇ sion. We can change the surface integral into a volume where the second and third terms cancel because of con- integral using Green’s theorem, to obtain servation of mass (C5), leaving us with Z ∂Π i u uj dV ∂ui = j(ρ i ) . (C9) ρ + ρuj jui = iP + ρgi . (C16) ∂t − ∇ ∂t ∇ −∇ Note that (C7) and (C9) must be equal no matter what Converting from component notation to vector notation, volume V we choose, so the integrands must be point- we obtain wise equal. This gives us an expression for the local ∂u~   ρ + ρ u~ ~ u~ = ~ P + ρ~g . (C17) conservation of momentum, ∂t · ∇ −∇

∂Πi ∂(ρui) j If we now consider a plane-parallel (∂/∂y = 0, ∂/∂z = 0, = = j(ρuiu ) . (C10) ∂t ∂t −∇ ∂/∂x = d/dx) steady-state (∂/∂t = 0) flow and we ignore gravity, (C5) and (C17) become We can understand this equation as follows: each com- ponent of the momentum-density ρu (for each i sepa- d i (ρu) = 0 , (C18) rately) obeys a local conservation law. There are strong dx parallels between (C5) and (C10). Note that the j oper- and ator on the right-hand-side is differentiating two∇ veloci- du 1 dP ties (ui and uj) only one of which undergoes dot-product u = ; (C19) summation (namely summation over j). Using vector dx −ρ dx j component notation (such as ju ) is a bit less elegant respectively. (C18) immediately gives ∇ ~ than using pure vector notation (such as u~) but in this ρu = constant ρ u = ρ u . (C20) case it makes things clearer. ∇ · → 1 1 2 2 We now consider the effect of pressure. It contributes Using a force on the particles in the control volume, namely d du dρ (ρu2) = 2ρu + u2 Z Z dx dx dx F Pd S PdV ! i = i = i . (C11) du du dρ − ∇ = ρu + u ρ + u dx dx dx A uniform gravitational field contributes another force, du d namely = ρu + u (ρu) Z dx dx du Fi = ρgidV . (C12) = ρu (C21) dx 103

(C19) can be rewritten as horizon du dP d 2 t ρu + = (ρu + P) = 0 . (C22) past horizon x dx dx dx xy = −1 ΙΙ This leads to

ρu2 + P = constant ρ u2 + P = ρ u2 + P . (C23) 0 → 1 1 1 2 2 2 Ι

02M r xy = −1 Appendix D: Kruskal coordinates future horizon y a) b) One elegant coordinate substitution is the replacement of r and t by the Kruskal coordinates x and y, whichFigure are 2:FIG.a ) The46: Left. blackThe hole black in the hole Schwarzschild in the Schwarzschild coordinates coordinatesr, t . The horizon defined by the following two equations [316] is at r =2(t, rM). The. b ) event Kruskal-Szekeres horizon is at r coordinates;= 2M. Right. here,Kruskal the coordinates coordi- of the horizon arenates. at Here,x = 0 the and coordinates at y = 0 .of The the horizonorientation are at ofx the= 0 local and at lightcones is  r  y = 0. The orientation of the local lightcones is indicated. Thin xy = 1 er/(2M) (D1)indicated. Thin red lines are the time = Constant limes in the physical part 2M − of space-time.red lines are the time = constant lines in the physical part of spacetime [317]. and

x/y = et/(2M) . (6.3) and(D2) is regular in the entire region xy > 1 . In particular, nothing special seems to the same sign, such as is the case° in the region marked I happen on thein two Fig. lines 46, butx = if 0xy and< 0,y = as 0 in . regionApparently, II, the there coordinate is no physicalt singularity The angular coordinates θ and φ are kept theor curvature same. singularity at r 2M . We do notice that the line x =0, µ and ' both Hereafter, we adopt geometrodynamic units G = c = 1. gets an imaginary! part. This means that region II is not constant, is lightlike,part of sinceour universe. two neighboring Actually, pointst does on not that serve line as obey a time dx =dµ =d' =0, By taking the ln of (D1) and (D2), and partiallyand diff thiseren- implies that ds = 0 , regardless the value of dy . similarly, the line y = 0 is tiating with respect to x and y, we read off coordinate there, but as a space coordinate, since there, lightlike. Indeed,dt2 weenters can with also read a negative oÆ from sign the in original the metric expression (123). (6.1)r is that if r =2M , dx dy dr dr dr the lines withthen constant the timeµ and coordinate.' are lightlike, as ds = 0 regardless the value of dt . + = + = The line(D3)y = 0 is called the future horizon and the line x = 0 is the past horizon (see x y r 2M 2M 2M(1 2M/r) Even if we restrict ourselves to the regions where t is − − Section 10). real, we find that, in general, every point (r, t) in the phys- and An other importantical region thing of spacetime to observe is mapped is that Eq.onto (6.4) two points attaches in thea real value for the time t when x(x,andy) plane:y both the have points the same (x, y sign,) and such ( x, asy is) theare case mapped in the region marked dx dy dt onto the same point (r, r). This leads− to− the picture of = . I in Fig.(D4) 2 b , but if xy < 0 , as in region II , the coordinate t gets an imaginary part. a black hole being a “” connecting our uni- x − y 2M This means that region II is not part of our universe. Actually, t does not serve as a verse to another universe, or perhaps another region of time coordinate there, but as a space coordinate, since there, dt2 enters with a positive The Schwarzschild metric (123) is now given by the spacetime of our universe [318]. However, there are sign in the metric (6.1). r is then the time coordinate. no timelike or light like paths connecting these two uni-  2M dx dy ds2 = 16M2 1 r2dΩ2 Even if weverses restrict [319]. ourselves If this to theis a regions wormhole where at all,t is it real, is a we purely find that, in general, − − r x y − every point (r,spacelike t) in the one. physical region of space-time is mapped onto two points in the 3 (x, y) plane: the points (x, y) and ( x, y) are mapped onto the same point (r, r) . This 32M r/(2M) 2 2 = e− dx dy r dΩ . (D5) ° ° − r − leads to the picture of a black hole being a wormhole connecting our universe to another universe, or perhaps anotherAppendix region E: of Geometry the space-time of S3 and of ourH3 universe. However, there are Note that, in the last expression, the zero andno the timelike pole or light like paths connecting these two universes. If this is a wormhole at all, at r = 2M have cancelled out. The function rit(x is, y a)can purely spacelike one. be obtained by inverting the algebraic expression (D1) Herein we provide a geometric interpretation of the 3 3 and is regular in the entire region xy > 1. In particular, hyper-sphere S and the hyperbolic hyper-plane H . The nothing special seems to happen on the− two lines x = 0 explanation given herein will build upon the content of and y = 0. Apparently, there is no physical singularity or the exquisite book by Kolb and Turner [166]. curvature singularity at r 2M. We do notice that the We begin by studing the familiar two dimensional line x = 0, θ and φ both constant,→ is lightlike, since two surfaces. To visualize the two12 sphere it is convenient neighboring points on that line obey dx = dθ = dφ = 0, to introduce an extra fictitious spatial dimension and and this implies that ds2 = 0, regardless the value of dy. to embed this two-dimensional curve space in a three- Likewise, the line y = 0 is lightlike. Indeed, we can also dimensional Euclidean space with cartesian coordinates 2 read off from the original expression (123) that if r = 2M, x1, x2, x3. The equation of the two sphere S of radius R the lines with constant θ and φ are lightlike, as ds2 = 0 is regardless the value of dt. The line y = 0 is called the 2 2 2 2 future horizon and the line x = 0 is the past horizon. x1 + x2 + x3 = R . (E1) An other important point to highlight is that (D2) at- taches a real value for the time t when x and y both have The line element in the three-dimensional Euclidean 104 space is spatial metric of a four-dimeniosnal Euclidean space is ds2 = dx2 + dx2 + dx2 + dx2. The fictitious coordinate can 2 2 2 2 1 2 3 4 ds = dx1 + dx2 + dx3 . (E2) be removed to give If x is taken as the fictitious third spatial coordinate, it 2 2 2 2 3 ds = dx1 + dx2 + dx3 can be eliminated from ds2 by the use of (E1) (x dx + x dx + x dx )2 + 1 1 2 2 3 3 . (E10) (x dx + x dx )2 R2 x2 x2 x2 ds2 = dx2 + dx2 + 1 1 2 2 . (E3) − 1 − 2 − 3 1 2 R2 x2 x2 − 1 − 2 In terms of coordinates x1 = % sin θ cos φ, x2 = x Now, introduce the coordinates % and θ defined in terms % sin θ sin φ, 3 = % cos θ, the metric is given by the of x and x by spatial part of (164) with k = 1. In terms of a coor- 1 2 dinate system that employs the 3 angular coordinates (χ, θ, φ) of a four-dimensional spherical coordinate sys- x1 = % cos θ and x2 = % sin θ . (E4) tem, x1 = R sin χ sin θ cos φ, x2 = R sin χ sin θ sin φ, Physically, % and θ correspond to polar coordinates in the x3 = R sin χ cos θ, x4 = R cos χ, the metric is given by 2 2 2 x3-plane; x3 = R % . In terms of the new coordinates, h i (E3) becomes − ds2 = R2 dχ2 + sin2 χ(dθ2 + sin2 θdφ2) . (E11)

R2d%2 The substitution χ = r/R leads to the spatial part of (166) ds2 = + %2dθ2 . (E5) R2 %2 with k = 1. − As in the two-dimensional example, the three- Note the similarity between this metric and the spatial dimensional open model is obtained by the replacement hypersurface with k = 1 in (164). R iR, which gives the metric in the form (164) with Another convenient coordinate system for the two k =→ 1, or in the form (166) with sin χ sinh χ. Again sphere is that specified by the usual polar and azimuthal the space− is unbounded and R sets the→ curvature scale. 3 angles (θ, φ) of spherical coordinates, related to xi by Embedding H in an Euclidean space requires four ficti- tious extra dimensions. x1 = R sin θ cos φ, x2 = R sin θ sin φ, x3 = R cos θ . (E6) In terms of these coordinates, (E2) becomes Appendix F: Dirac Delta Function

2 2 h 2 2 i ds = R dθ + sin θdφ . (E7) Dirac’s delta function is defined by the following prop- erty This form makes manifest the fact that the space is the two sphere of radius R. ( 0 t , 0 The equivalent formulas for a space of constant neg- δ(t) = , (F1) t = 0 ative curvature can be obtained with the replacement ∞ R iR in (E1). The metric corresponding to the form of with (E5)→ for the negative curvature case is Z t2 2 2 dt t 2 R d% 2 2 δ( ) = 1 (F2) ds = + % dθ . (E8) t R2 + %2 1 if 0 [t1, t2] (and zero otherwise). It is “infinitely and the metric in the form corresponding to (E7) is peaked”∈ at t = 0, with the total area of unity. You can h i view this function as a limit of Gaussian ds2 = R2 dθ + sinh2 θdφ2 . (E9) 1 t2/(2σ2) δ(t) = lim e− , (F3) 2 σ 0 The embedding of the hyperbolic plane H in an Eu- → √2π σ clidean space requires three fictitious extra dimensions, and such an embedding is of little use in visualizing the or a Lorentzian 2 geometry. While H cannot be globally embedded in 1  R3, it can be partailly represented by the pseudosphere δ(t) = lim . (F4)  0 π t2 + 2 (112). → The generalization of the two-dimensional models The important property of the delta function is the discussed above to three spatial dimensions is trivial. following relation For the three sphere S3 a fictitious fourth spatial di- Z mension is introduced and in cartesian coordinates the 2 2 2 2 2 dt f (t) δ(t) = f (0) , (F5) three sphere is defined by R = x1 + x2 + x3 + x4. The 105 which is valid for any test function f (t) that is bounded that the delta function is eventually integrated, we can and differentiable to any order, and which vanishes out- use it as if it is a function. side a finite range. This is easy to see. First of all, δ(t) The step (Heaviside) function, vanishes everywhere except t = 0. Therefore, it does not matter what values the function f (t) takes except at t = 0. ( 1 x 0 You can then say f (t) δ(t) = f (0) δ(t). Then f (0) can be Θ(x) = ≥ , (F7) pulled outside the integral because it does not depend 0 x < 0 on t, and you obtain the right-hand-side. This equation can easily be generalized to is the “primitive” (at least in symbolyc form) of the δ(x). Z Equivalently, Θ0(x), has the symbolic limit δ(x), as we show next. For any given test function f (x), integration dt f (t) δ(t t ) = f (t ) . (F6) − 0 0 by parts leads to

Mathematically, the delta function is not a function, Z + Z + ∞ ∞ because it is “too singular.” Instead, it is said to be a Θ0(x) f (x)dx = Θ(x) f 0(x)dx “distribution.” It is a generalized idea of functions, but − R −∞ Z−∞ dt t ∞ can be used only inside integrals. In fact, δ( ) can be = f 0(x)dx = f (0) ; (F8) regarded as an “operator” which pulls the value of a test − 0 function at zero. Put it this way, it sounds perfectly legit- imate and well-defined. But as long as it is understood therefore Θ0(x) = δ(x).