Lectures on Astronomy, Astrophysics, and Cosmology
Luis A. Anchordoqui Department of Physics and Astronomy, Lehman College, City University of New York, NY 10468, USA Department of Physics, Graduate Center, City University of New York, 365 Fifth Avenue, NY 10016, USA Department of Astrophysics, American Museum of Natural History, Central Park West 79 St., NY 10024, USA (Dated: Spring 2016) This is a written version of a series of lectures aimed at undergraduate students in astrophysics/particle theory/particle experiment. We summarize the important progress made in recent years towards understanding high energy astrophysical processes and we survey the state of the art regarding the concordance model of cosmology.
I. ACROSS THE UNIVERSE changed after Galileo’s first telescopic observations: we no longer place ourselves at the center and we view the A look at the night sky provides a strong impression of universe as vastly larger [1–3]. a changeless universe. We know that clouds drift across In the early 1600s, Kepler proposed three laws that the Moon, the sky rotates around the polar star, and on described the motion of planets in a sun-centered solar longer times, the Moon itself grows and shrinks and the system [4]. The laws are: Moon and planets move against the background of stars. 1. Planets orbit the Sun in ellipses, with the Sun in Of course we know that these are merely local phenom- one of the two focuses. ena caused by motions within our solar system. Far beyond the planets, the stars appear motionless. Herein 2. The line connecting the Sun and a planet sweeps we are going to see that this impression of changeless- out equal area in equal time. ness is illusory. 3. The harmonic law states the squared orbital period of planets measured in years equals to the third A. Nani gigantum humeris insidentes powerT of their major axis measured in astronomical units, ( /yr)2 = (a/AU)3. T According to the ancient cosmological belief, the stars, Newton used later the harmonic law to derive the 1/r2 except for a few that appeared to move (the planets), dependence of the gravitational force [5]. We will fol- where fixed on a sphere beyond the last planet; see Fig. 1. low the opposite way and discuss how Kepler’s laws The universe was self contained and we, here on Earth, follow from Newton’s law for gravitation. We begin by were at its center. Our view of the universe dramatically recalling how a two-body problem can be reduced to a one-body problem in the case of a central force. Denot- ing the position and the masses of the two objects by mi and ri, with i = 1, 2 the equations of motion are found to be
m ~r¨ = f ( ~r ~r )(~r r ) , (1) 1 1 − | 1 − 2| 1 − 2 and
m ~r¨ = + f ( ~r ~r )(~r r ) . (2) 2 2 | 1 − 2| 1 − 2 In other words, the center-of-mass (c.m.) of the system arXiv:0706.1988v3 [physics.ed-ph] 15 Jun 2016 m ~r + m ~r R~ = 1 1 2 2 . (3) m1 + m2
moves freely. Now, multiplying (1) by m2 and (2) by m1 and substracting the two equation we obtain
µ~r¨ = f (r)~r , (4)
where
m1m2 FIG. 1: Celestial spheres of ancient cosmology. µ = . (5) m1 + m2 2
We can then solve a one-body problem for the reduced Since rˆ is a unit vector, we have rˆ rˆ = 1 and d(rˆ rˆ)/dt = 0, · · mass µ moving with the distance r = ~r1 ~r2 in the hence gravitational field of the mass M = m + m| . − | 1 2 drˆ We can now derive the second law (a.k.a. the area ~a ~L = GMµ . (14) law). Consider the movement of a body under the influ- × dt ence of a central force (4). Since ~r ~r = 0, the vectorial Since ~L and GMµ are constant, we can write this as multiplication of (4) by ~r leads to × d ~ d µ~r ~r¨ = 0 , (6) (~v L) = (GMµrˆ) . (15) × dt × dt that looks already similar to a conservation law. Since Integration of (15) leads to
d ~ ~ ~ (~r ~r˙) = ~r˙ ~r˙ + ~r ~r¨ , (7) v L = GMµrˆ + C , (16) dt × × × × where the integration constant C~ is a constant vector. the first term in the right-hand-side is zero and we obtain Taking now the dot product with ~r, we have the conservation of angular momentum ~L = µ~r ~r˙ for × the motion in a cental potential ~r (~v ~L) = GMµrrˆ rˆ + ~r C~ . (17) · × · · d d µ~r ~r¨ = (µ~r ~r˙) = ~L = 0 . (8) Applying next the identity A~ (B~ C~) = (A~ B~) C~, it × dt × dt follows · × × · There are two immediate consequences: First, the mo- (~r ~v) ~L = GMµr + rC cos ϑ tion is always in the plane perpendicular to ~L. Second, × · ! C cos ϑ the area swept out by the vector ~r is = GMµr 1 + , (18) GMµ 1 1 ~ ~ ~ ~ dA = r v dt = dL , (9) where ϑ is the angle between ~r and C~. Expressing ~r ~v 2 × 2µ × as ~L/µ, defining e = C/(GM) and solving for r, we obtain and thus also constant. finally the equation for a conic section, which is Kepler’s We now turn to demonstrate the first law. We intro- first law: duce the unit vector rˆ = ~r/r and rewrite the definition of ~ L2/µ2 the angular momentum L as r = . (19) GM(1 + e cos ϑ) d ~L = µ~r ~r˙ = µrrˆ (rrˆ) × × dt Using (A3) we obtain angular momentum ! drˆ drˆ p = µrrˆ r˙rˆ + r = µr2rˆ . (10) L = µ GMa(1 e2) . (20) × dt × dt − To obtain the harmonic law we integrate the second The first term in the parenthesis vanishes, because of rˆ law in the form of (9) over one orbital period , rˆ = 0. Next we take the cross product of the gravitational× T acceleration, L A = πab = . (21) 2µT GM ~a = rˆ , (11) − r2 Squaring and solving for , it follows T with the angular momentum (abµ)2 2 = 4π2 . (22) ! 2 GM drˆ T L ~a ~L = rˆ µr2rˆ × − r2 × × dt Using (A1) and (20) for the angular momentum L, we ! drˆ obtain Kepler’s harmonic law, = GMµrˆ rˆ , (12) − × × dt 4π2 2 = a3 . (23) 11 2 2 T G(m1 + m2) where G = 6.674 10− N m kg− [6]. The identity from × vector analysis, A~ (B~ C~) = (A~ C~)B~ (A~ B~)C~, leads to × × · − · " ! # EXERCISE 1.1 The planet Neptune, the most distant drˆ drˆ gas giant from the Sun, orbits with a semimajor axis ~a ~L = GMµ rˆ rˆ (rˆ rˆ) . (13) × − · dt − · dt a = 30.066 AU and an eccentricity e = 0.01. Pluto, the 3 next large world out from the Sun (though much smaller about 4.2 ly away. Therefore, the nearest star is 10,000 than Neptune) orbits with a = 39.48 AU and e = 0.250. times farther from us that the outer reach of the solar (i) To correct number of significant figures given the system. precision of the data in this exercise, how many years does it take Neptune to orbit the Sun? (ii) How many years does it take Pluto to orbit the Sun? (iii) Take the B. Stars and galaxies ratio of the two orbital periods you calculated in parts (i) and (ii). You will see that it is very close to the ratio On clear moonless nights, thousands of stars with of two small integers; which integers are these? Thus varying degrees of brightness can be seen, as well as the two planets regularly come close to one another, the long cloudy strip known as the Milky Way. Galileo in the same part of their orbits, which allows them to first observed with his telescope that the Milky Way is have a maximum gravitational influence on each other’s comprised of countless numbers of individual stars. A orbits. This is an example of an orbital resonance (other half century later Wright suggested that the Milky Way examples in the solar system can be found among the was a flat disc of stars extending to great distances in a moons of Jupiter, and between the moons and various plane, which we call the Galaxy [7]. features of the rings of Saturn). (iv) What is the aphelion Our Galaxy has a diameter of 100,000 ly and a thick- distance of Neptune’s orbit? Express your answer in ness of roughly 2,000 ly. It has a bulging central nucleus AU. (v) What are the perihelion and aphelion distances and spiral arms. Our Sun, which seems to be just an- of Plutos orbit? Is Pluto always farther from the Sun other star, is located half way from the Galactic center than Neptune? to the edge, some 26, 000 ly from the center. The Sun orbits the Galactic center approximately once every 250 EXERCISE 1.2 A satellite in geosynchronous orbit million years or so, so its speed is (GEO) orbits the Earth once every day. A satellite in geostationary orbit (GSO) is a satellite in a circular 2π 26, 000 1013 km v = × = 200 km/s . (25) GEO in the Earth’s equatorial plane. Therefore, from 2.5 108 yr 3.156 107 s/yr the point of view of an observer on Earth’s surface, a × × satellite in GSO seems always to hover in the same point The total mass of all the stars in the Galaxy can be esti- in the sky. For example, the satellites used for satellite mated using the orbital data of the Sun about the center TV are in GSO so that satellite dishes can be stationary of the Galaxy. To do so, assume that most of the mass and need not track their motion through the sky. Take is concentrated near the center of the Galaxy and that a look; you will notice all satellite dishes on people’s the Sun and the solar system (of total mass m) move in houses point towards the Equator, that is South. How a circular orbit around the center of the Galaxy (of total far above Earth’s equator (i.e., above the Earth’s surface) mass M), is a satellite in GSO? Express your answer in kilometers, GMm v2 and in Earth radii. = m , (26) r2 r EXERCISE 1.3 The space station Mir traveled 3.6 where a = v2/r is the centripetal acceleration. All in all, billion kilometers during its life. Its circular orbit was 200 km above the surface of the Earth. (i) How many r v2 M = 2 1041 kg . (27) years was it in orbit? (ii) How many times did Mir circle G ≈ × the Earth per day (i.e., 24 hours)? (iii) Can you put a satellite into such an orbit that it circles the Earth 20 Assuming all the stars in the Galaxy are similar to our M 30 times per day? Sun ( 2 10 kg), we conclude that there are roughly 10≈11 stars× in the Galaxy. The astronomical distances are so large that we specify In addition to stars both within and outside the Milky them in terms of the time it takes the light to travel a given Way, we can see with a telescope many faint cloudy patches in the sky which were once all referred to as distance. For example, one light second = 3 108m = nebulae (Latin for clouds). A few of these, such as those in 300, 000 km, one light minute = 1.8 107 km,× and one light year × the constellations of Andromeda and Orion, can actually be discerned with the naked eye on a clear night. In the 1 ly = 9.46 1015 m 1013 km. (24) XVII and XVIII centuries, astronomers found that these × ≈ objects were getting in the way of the search for comets. For specifying distances to the Sun and the Moon, we In 1781, in order to provide a convenient list of objects not usually use meters or kilometers, but we could spec- to look at while hunting for comets, Messier published ify them in terms of light. The Earth-Moon distance is a celebrated catalogue [8]. Nowadays astronomers still 384,000 km, which is 1.28 ls. The Earth-Sun distance is refer to the 103 objects in this catalog by their Messier 150, 000, 000 km; this is equal to 8.3 lm. Far out in the numbers, e.g., the Andromeda Nebula is M31. solar system, Pluto is about 6 109 km from the Sun, or Even in Messier’s time it was clear that these extended 4 × 6 10− ly. The nearest star to us, Proxima Centauri, is objects are not all the same. Some are star clusters, × 4 groups of stars which are so numerous that they ap- peared to be a cloud. Others are glowing clouds of gas or dust and it is for these that we now mainly reserve the word nebula. Most fascinating are those that belong to a third category: they often have fairly regular ellip- tical shapes and seem to be a great distance beyond the Galaxy. Kant seems to have been the first to suggest that these latter might be circular discs, but appear elliptical because we see them at an angle, and are faint because they are so distant [9]. At first it was not universally accepted that these objects were extragalactic (i.e. out- side our Galaxy). The very large telescopes constructed in the XX century revealed that individual stars could be resolved within these extragalactic objects and that many contain spiral arms. Hubble did much of this ob- servational work in the 1920’s using the 2.5 m telescope on Mt. Wilson near Los Angeles, California. Hubble demostrated that these objects were indeed extragalac- FIG. 2: The parallax method of measuring a star’s distance. tic because of their great distances [10]. The distance to our nearest spiral galaxy, Andromeda, is over 2 million ly, a distance 20 times greater than the diameter of our or about 15 ly. Galaxy. It seemed logical that these nebulae must be Distances to stars are often specified in terms of paral- galaxies similar to ours. Today it is thought that there lax angles given in seconds of arc: 1 second (1”) is 1/60 are roughly 4 1010 galaxies in the observable universe – × of a minute (1’) of arc, which is 1/60 of a degree, so 1” that is, as many galaxies as there are stars in the Galaxy. = 1/3600 of a degree. The distance is then specified in parsecs (meaning parallax angle in seconds of arc), where the parsec is defined as 1/φ with φ in seconds. For ex- II. DISTANCE MEASUREMENTS 5 ample, if φ = 6 10− ◦, we would say the the star is at a distance D = 4.5× pc. We have been talking about the vast distance of the The angular resolution of the Hubble Space Telescope objects in the universe. We now turn to discuss different (HST) is about 1/20 arcs. With HST one can measure methods to estimate these distances. parallaxes of about 2 milli arc seconds (e.g., 1223 Sgr). This corresponds to a distance of about 500 pc. A. Stellar Parallax Besides, there are stars with radio emission for which observations from the Very Long Baseline Array (VLBA) allow accurate parallax measurements beyond 500 pc. One basic method to measure distances to nearby stars For example, parallax measurements of Sco X-1 are employs simple geometry and stellar parallax. Parallax 0.36 0.04 milli arc seconds which puts it at a distance of is the apparent displacement of an object because of a 2.8 kpc.± Parallax can be used to determine the distance change in the observer’s point of view. One way to see to stars as far away as about 3 kpc from Earth. Beyond how this effect works is to hold your hand out in front that distance, parallax angles are two small to measure of you and look at it with your left eye closed, then and more subtle techniques must be employed. your right eye closed. Your hand will appear to move against the background. By stellar parallax we mean EXERCISE 2.1 One of the first people to make a very the apparent motion of a star against the background accurate measurement of the circumference of the Earth of more distant stars, due to Earth’s motion around the was Eratosthenes, a Greek philosopher who lived in Sun; see Fig. 2. The sighting angle of a star relative Alexandria around 250 B.C. He was told that on a cer- to the plane of Earth’s orbit (usually indicated by ) θ tain day during the summer (June 21) in a town called can be determined at two different times of the year Syene, which was 4900 stadia (1 stadia = 0.16 kilometers) separated by six months. Since we know the distance d to the south of Alexandria, the sunlight shown directly from the Earth to the Sun, we can determine the distance down the well shafts so that you could see all the way to D to the star. For example, if the angle of a given θ the bottom. Eratosthenes knew that the sun was never star is measured to be 89 99994 the parallax angle is . ◦, quite high enough in the sky to see the bottom of wells p = 0 00006 From trigonometry, tan = d D, and φ . ◦. φ / in Alexandria and he was able to calculate that in fact since≡ the distance to the Sun is d = 1 5 108 km the . it was about 7 degrees too low. Knowing that the sun distance to the star is × was 7 degrees lower at its highpoint in Alexandria than d d 1.5 108 km in Syene and assuming that the sun’s rays were paral- D = = × = 1.5 1014 km , (28) tan φ ≈ φ 1 10 6 × lel when they hit the Earth, Eratosthenes was able to × − 1 Continuous radiation from stars 5
emitted by a star is found to be
Z Z 3 ∞ 2π ∞ x dx F d B kT 4 T4 dΩ dΩ = π ν ν = 2 3 ( ) x = σ , (32) 0 c h 0 e 1 ϑ ϑ✓ − where x = hν/(kT), dA dA 2π5k4 erg cos ϑdA coscos✓dAϑdA 5 σ = 2 3 = 5.670 10− 2 4 (33) Figure 1.3: Left: A detector with surface element dA on Earth measuring radiation coming 15c h × cm K s FIG. 3:fromLeft. a directionA detector with zenith with angle ϑ surface. Right: An element imaginarydA detectoron onEarth the surface of a star measuring radiation emitted in the direction ϑ. is the Stefan-Boltzmann constant [14, 15], and where we measuring radiation coming from a direction with zenith angle R 3 x 1 4 used ∞ x [e 1]− dx = π /15. ϑ (left). Right. An imaginary detector of area dA on the surface 0 − Theof a Kirchho star measuringff-Planck distribution radiation contains emitted as its two in thelimiti directionng cases Wien’sθ [16]. law for high- A useful parameter for a star or galaxy is its luminosity. frequencies, hν kT, and the Rayleigh-Jeans law for low-frequencies hν kT.Inthe ≫ ≪ former limit, x = hν/(kT) 1, and we can neglect the 1 in the denominator of the PlanckThe total luminosity L of a star is given by the product ≫ − function, of its surface area and the radiation emitted per area 2hν3 calculate the circumferenceBν 2 exp( of thehν/kT Earth) . using a simple (1.10) ≈ c − L = 4πR2σT4 . (34) Thusproportion: the number of photons C/4900 with stadia energy h=ν much360 larger degrees than kT/ 7is degrees. exponentially This suppressed. In thegives opposite an limit, answerx = hν of/(kT 252,000) 1, and stadia ex 1=(1+ or 40,320x ...) km,1 x. which Hence Planck’s ≪ − − − ≈ Careful analyses of nearby stars have shown that the constantis veryh disappears close to from today’s the expression measurements for Bν,iftheenergy ofh 40,030ν of a single km. photon As- is small compared to the thermal energy kT and one obtains, absolute luminosity for most of the stars depends on the sume the Earth is flat and determine the parallax angle 2ν2kT mass: the more massive the star, the greater the luminosity. that can explain this phenomenon.B . Are the results con- (1.11) ν ≈ c2 Consider a thick spherical source of radius R, with sistent with the hypothesis that the Earth is flat? The Rayleigh-Jeans law shows up as straight lines left from the maxima of Bν in Fig. 1.4. constant intensity along the surface, say a star. An ob- server at a distance r sees the spherical source as a disk 1.3.2 Wien’s displacement law of angular radius ϑ = R/r. Note that since the source is B. Stellar luminosity c We note from Fig. 1.4 two important properties of Bν: Firstly, Bν as function of the frequencyoptically thick the observer only sees the surface of the ν has a single maximum. Secondly, Bν as function of the temperature T is a monotonicallysphere. Because the intensity is constant over the surface increasing function for all frequencies: If T1 >T2,thenBν(T1) >Bν(T2) for all ν. Both propertiesIn 1900, follow directly Planck from found taking the empirically derivative with therespect distribution to ν and T . In the formerthere is a symmetry along the ϕ direction such that the c2 case, we look for the maximum of f(ν)= 2h Bν as function of ν.Hencewehavetofindthesolid solid angle is given by dΩ = 2π sin ϑdϑ. By looking " ! # 1 zeros of f ′(ν), 3 − 2hν hν hν at Fig. 3 it is straightforward to see that the flux observed Bν 3(edνx =1) x expxexp=0 with 1x = dν. (29) (1.12) − −c2 kT − kT at r is given by The equation ex(3 x) = 3 has to be solved numerically and has the solution x 2.821. − ≈ Thus the intensity of thermal radiation is maximal for x 2.821 = hν /(kT) or Z Z ϑc describing the amount of energy emittedmax ≈ intomax the fre- F(r) = I cos ϑdΩ = 2πI sin ϑ cos ϑdϑ cT νmax quency interval [0ν,.50K ν+ cmdν] and or the solid5.9 angle1010Hz/dKΩ. per unit (1.13) 0 time and areaνmax by≈ a body in thermalT ≈ equilibrium× [11]. The 2 0 2 2 = πI cos ϑ = πI sin ϑc = πI(R/r) . (35) intrinsic (or surface) brightness Bν depends only on the ϑc 14 temperature T of the blackbody (apart from the natural k c h B At the surface of the star R = r and we recover (32). Very constants , and ). The dimension of ν in the cgs 2 system of units is far away, r R, and (35) yields F = πϑc I = IΩsource; see Appendix B. The validity of the inverse-square law erg F 1/r2 at a distance r > R outside of the star relies [B ] = . (30) ∝ ν Hz cm2 s sr on the assumptions that no radiation is absorbed and that relativistic effects can be neglected. The later con- In general the amount of energy per frequency interval dition requires in particular that the relative velocity of [ν, ν + dν] and solid angle dΩ crossing the perpendicular observer and source is small compared to the speed of area A per time is called the specific (or differential) light. All in all, the total (integrated) flux at the surface ⊥ intensity [12] of the Earth from a given astronomical object with total luminosity L is found to be dE I = ; (31) ν dνdΩdA dt L Fobserved @ Earth = = , (36) ⊥ F 4πd2 see Fig. 3. For the special case of the blackbody radiation, L the specific intensity at the emission surface is given by where dL is the distance to the object. the Planck distribution, Iν = Bν. Stars are fairly good Another important parameter of a star is its surface approximations of blackbodies. temperature, which can be determined from the spec- Integrating (29) over all frequencies and possible solid trum of electromagnetic frequencies it emits. The wave- angles gives the emitted flux F per surface area A. The an- length at the peak of the spectrum, λmax, is related to the gular integral consists of the solid angle dΩ = sin θdθdφ temperature by Wien’s displacement law [17] and the factor cos θ taking into account that only the 3 perpendicular area A = A cos θ is visible [13]. The flux λmaxT = 2.9 10− m K . (37) ⊥ × 6
We can now use Wien’s law and the Steffan-Boltzmann rays. What is (i) the observed flux from the Sun equation (power output or luminosity AT4) to deter- and (ii) its absolute luminosity L . (iii) What is theF mine the temperature and the relative size∝ of a star. Sup- average Solar flux density measured at Mars? (iv) If the pose that the distance from Earth to two nearby stars approximate efficiency of the solar panels (with area of can be reasonably estimated, and that their apparent lu- 1.3 m2) on the Martian rover Spirit is 20%, then how minosities suggest the two stars have about the same many Watts could the fully illuminated panels generate? absolute luminosity, L. The spectrum of one of the stars peaks at about 700 nm (so it is reddish). The spectrum of EXERCISE 2.3 Suppose the MESSENGER spacecraft, the other peaks at about 350 nm (bluish). Using Wien’s while orbiting Mercury, decided to communicate with law, the temperature of the reddish star is Tr 4140 K. the Cassini probe, now exploring Saturn and its moons. ' The temperature of the bluish star will be double because When Mercury is closest to Saturn in their orbits, it its peak wavelength is half, Tb 8280 K. The power ra- takes 76.3 minutes for the radio signals from Mercury ' diated per unit of area from a star is proportional to the to reach Saturn. A little more than half a mercurian fourth power of the Kelvin temperature (34). Now the year later, when the 2 planets are furthest apart in their temperature of the bluish star is double that of the redish orbits, it takes 82.7 minutes. (i) What is the distance star, so the bluish must radiate 16 times as much energy between Mercury and the Sun? Give answers in both per unit area. But we are given that they have the same light-minutes and astronomical units. Assume that the luminosity, so the surface area of the blue star must be planets have circular orbits. (ii) What is the distance 1/16 that of the red one. Since the surface area is 4πR2, between Saturn and the Sun? we conclude that the radius of the redish star is 4 times larger than the radius of the bluish star (and its volume EXERCISE 2.4 The photometric method to search for 64 times larger) [18]. extrasolar planets is based on the detection of stellar An important astronomical discovery, made around brightness variations, which result from the transit of a 1900, was that for most of the stars, the color is related planet across a star’s disk. If a planet passes in front of to the absolute luminosity and therefore to the mass. a star, the star will be partially eclipsed and its light will A useful way to present this relationship is by the so- be dimmed. Determine the reduction in the apparent called Hertzsprung-Russell (HR) diagram [19]. On the surface brightness I when Jupiter passes in front of the HR diagram, the horizontal axis shows the temperature Sun. T, whereas the vertical axis the luminosity L, each star is represented by a point on the diagram shown in Fig. 4. EXERCISE 2.5 The angular resolution of a telescope Most of the stars fall along the diagonal band termed the (or other optical system) is a measure of the smallest main sequence. Starting at the lowest right, we find the details which can be seen. Because of the distorting coolest stars, redish in color; they are the least luminous effects of earth’s atmosphere, the best angular resolution and therefore low in mass. Further up towards the left which can be achieved by optical telescopes from earth’s we find hotter and more luminous stars that are whitish surface is normally about 1 arcs. This is why much like our Sun. Still farther up we find more massive and clearer images can be obtained from space. The angular more luminous stars, bluish in color. There are also stars resolution of the HST is about 0.05 arcs, and the smallest that fall outside the main sequence. Above and to the angle that can be measured accurately with HST is right we find extremely large stars, with high luminosity actually a fraction of one resolution element. (i) Cepheid but with low (redish) color temperature: these are called variable stars are very important distance indicators red giants. At the lower left, there are a few stars of low because they have large and well-known luminosities. luminosity but with high temperature: these are white What is the distance of a Cepheid variable star whose dwarfs. parallax angle is measured to be 0.005 0.001 arcs? Suppose that a detailed study of a certain star suggests (ii) The faintest stars that can be detected with± the HST that it most likely fits on the main sequence of the HR have apparent brightnesses which are 4 1021 times 12 2 × diagram. The observed flux is = 1 10− W m− , and fainter than the Sun. How far away could a star like F × the peak wavelength of its spectrum is λmax 600 nm. the Sun be, and still be detected with the HST? Express We can first find the temperature using Wien’s≈ law your answer in light years. (iii) How far away could a and then estimate the absolute luminosity using the Cepheid variable with 20,000 times the luminosity of HR diagram; namely, T 4800 K. A star on the main the Sun be, and still be detected with the HST? Express sequence of the HR diagram≈ at this temperature has your answer in light years. absolute luminosity of about L 1026 W. Then, using ≈ 18 (36) we can estimate its distance from us, dL = 3 10 m EXERCISE 2.6 × The discovery of the dwarf planet or equivalently 300 ly. Eris in 2005 threw the astronomical community into a tizzy and made international headlines; it is slightly EXERCISE 2.2 About 1350 J of energy strikes the larger than Pluto and brought up interesting questions atmosphere of the Earth from the Sun per second about what the definition of a planet is. Eventually, per square meter of area at right angle to the Sun’s this resulted in the controversial demotion of Pluto AST 250 Spring 2010 HOMEWORK #5 Due Friday March 26
(1) Develop you own mnemonic for the modern stellar spectral sequence: O B A F G K M L T Y. Be creative! I’ll read a few in class.
(2) Look up the spectral types of the following stars (the primary stars if it is a binary) and order them by (a) effective temperature and (b) luminosity: Sun, Sirius, Betlegeuse, Aldebaran, and Barnard’s Star. (N.B. don’t just look up Teff and L. Understand the ordering based on spectral type. There could be a similar question on the exam).
(3) Estimate the mass of main sequence stars with twice the luminosity of the Sun and with half the luminosity of the Sun. What is the dominant nucleosynthesis process in the cores of these stars?
(4) Calculate the Schwarzchild radius for a star the mass of the Sun.
(5) (a) The Hertzsprung-Russell diagram is usually plotted in logarithmic coordinates (log L vs. log Teff with temperature increasing to the left). Mathematically derive the slope of a line of constant radius in the logarithmic H-R diagram. (b) Order the stars in problem 2 by stellar radii.
7
FIG. 4: HR diagram. The vertical axis depicts the inherent brightness of a star, and the horizontal axis the surface temperature increasing from right to left [20].
from the 9th planet of the Solar System to just one of a amount of sunlight reflected by Eris per unit time (i.e., number of dwarf planets. Throughout, assume that Eris its luminosity in reflected light); express your answer is spherical and is observed at opposition (i.e., the Earth in terms of d, r, the albedo a, and the luminosity of lies on the straight line connecting the Sun and Eris). the Sun L . (iii) We detect only a tiny fraction of this (i) In five hours, Eris is observed to move 7.5 arcseconds light reflected by Eris. Calculate the brightness, via the relative to the background stars as seen from Earth. inverse square law, of Eris as perceived here on Earth. 16 Because Eris is much further from the Sun than is the (iv) The measured brightness of Eris is 2.4 10− Joules 2 1 × Earth, it is moving quite a bit slower around the Sun meters− second− . Use this information to determine than the Earth, so this apparent motion on the sky is the radius r of Eris. This is what led to the controversy essentially entirely parallax due to the Earth’s motion. of what a planet is: if Pluto is considered a planet, then Calculate the speed with which the Earth goes around certainly Eris should be as well. We further elaborate the Sun, in kilometers/second. Use this information and about this controversy in the solution of the exercise. the small-angle formula to calculate the distance from (v) Calculate the angular size of Eris (i.e., the angle the the Earth to Eris. Express your result in AU. Compare diameter of Eris makes on the sky). Compare this to with the semi-major axis of Pluto’s orbit (which you will the resolution of the HST; will you be able to resolve need to look up). (ii) Eris shines in two ways: from its Eris (i.e., will it look like a point of light or a finite-size reflected light from the Sun (which will be mostly visible object in a telescope)? (vi) You go ahead and observe light), and from its blackbody radiation from absorbed Eris with Hubble, and find that it has a moon orbiting it. sunlight (which will mostly come out as infrared light). Observations with Hubble show that this moon (called The albedo of Eris (i.e., the fraction of the sunlight Dysnomia) makes an almost circular orbit around Eris incident on Eris that is reflected) is very high, about with a period of 15.8 Earth days. The semi-major axis 85%. This suggests that Eris is covered by a layer of of the orbit subtends an angle of 0.5300 as seen from shiny ice; spectroscopy tells us that the ice is composed Earth. Calculate the semi-major axis in kilometers, and of frozen methane, CH4. Derive an expression for calculate the mass of Eris in kilograms. Compare with the brightness of Eris which depends on its distance the mass of Pluto (1.3 1022 kg). Is Eris more massive? from the Sun d, and its radius r. First, calculate the × 8
EXERCISE 2.7 A perfect blackbody at temperature T makes use of a familiar property of any sort of wave has the shape of an oblate ellipsoid, its surface being motion, known as Doppler effect [21]. given by the equation When we observe a sound or light wave from a source at rest, the time between the arrival wave crests at our x2 y2 z2 + + = 1 , (38) instruments is the same as the time between crests as they a2 a2 b2 leave the source. However, if the source is moving away with a > b. (i) Is the luminosity of the blackbody from us, the time between arrivals of successive wave isotropic? Why? (ii) Consider an observer at a distance crests is increased over the time between their departures from the source, because each crest has a little farther to dL from the blackbody, with dL a. What is the direc- tion of the observer for which the maximum amount of go on its journey to us than the crest before. The time between crests is just the wavelength divided by the flux will be observed (keeping the distance dL fixed)? Calculate what this maximum flux is. (iii) Repeat the speed of the wave, so a wave sent out by a source moving same exercise for the direction for which the minimum away from us will appear to have a longer wavelength than if the source were at rest. Likewise, if the source is flux will be observed, for fixed dL. (iv) If the two observers who see the maximum and minimum flux moving toward us, the time between arrivals of the wave crests is decreased because each successive crest has a from distance dL can resolve the blackbody, what is the apparent brightness, I, that each one will measure? shorter distance to go, and the waves appear to have a (v) Write down an expression for the total luminosity shorter wavelength. A nice analogy was put forward emitted by the black body as a function of a, b and by Weinberg [22]. He compared the situation with a T. (vi) Now, consider a galaxy with a perfectly oblate travelling man that has to send a letter home regularly shape, which contains only a large number N of stars, once a week during his travels: while he is travelling and no gas or dust. To make it simple, assume that all away from home, each successive letter will have a little stars have radius R and surface temperature T. Answer farther to go than the one before, so his letters will arrive again the questions (i-v) for the galaxy, assuming a little more than a week apart; on the homeward leg NR2 ab. Are there any differences from the case of a of his journey, each succesive letter will have a shorter blackbody? Explain why. (vii) Imagine that there were distance to travel, so they will arrive more frequently a very compact galaxy that did not obey the condition than once a week. NR2 ab. Would the answer to the previous question The Doppler effect began to be of enormous impor- be modified? Do you think such a galaxy could be stable? tance to astronomy in 1968, when it was applied to the study of individual spectral lines. In 1815, Fraunhofer EXERCISE 2.8 The HR diagram is usually plotted in first realized that when light from the Sun is allowed to logarithmic coordinates (log L vs. log T, with the tem- pass through a slit and then through a glass prism, the perature increasing to the left). Derive the slope of a line resulting spectrum of colors is crossed with hundreds of of constant radius in the logarithmic HR diagram. dark lines, each one an image of the slit [23]. The dark lines were always found at the same colors, each corre- sponding to a definite wavelength of light. The same III. DOPPLER EFFECT dark spectral lines were also found in the same posi- tion in the spectrum of the Moon and brighter stars. It There is observational evidence that stars move at was soon realized that these dark lines are produced by speeds ranging up to a few hundred kilometers per the selective absorption of light of certain definite wave- second, so in a year a fast moving star might travel lengths, as light passes from the hot surface of a star 1010 km. This is 103 times less than the distance to the through its cooler outer atmosphere. Each line is due to closest∼ star, so their apparent position in the sky changes absorption of light by a specific chemical element, so it very slowly. For example, the relatively fast moving became possible to determine that the elements on the star known as Barnard’s star is at a distance of about Sun, such as sodium, iron, magnesium, calcium, and 56 1012 km; it moves across the line of sight at about chromium, are the same as those found on Earth. 89 km× /s, and in consequence its apparent position shifts In 1868, Sir Huggins was able to show that the dark (so-called “proper motion”) in one year by an angle of lines in the spectra of some of the brighter stars are 0.0029 degrees. The HST has measured proper motions shifted slightly to the red or the blue from their normal as low as about 1 milli arc second per year. In the radio position in the spectrum of the Sun [24]. He correctly (VLBA), relative motions can be measured to an accu- interpreted this as a Doppler shift, due to the motion of racy of about 0.2 milli arc second per year. The apparent the star away from or toward the Earth. For example, the position in the sky of the more distant stars changes so wavelength of every dark line in the spectrum of the star slowly that their proper motion cannot be detected with Capella is longer than the wavelength of the correspond- even the most patient observation. However, the rate ing dark line in the spectrum of the Sun by 0.01%, this of approach or recession of a luminous body in the line shift to the red indicates that Capella is receding from of sight can be measured much more accurately than its us at 0.01% c (i.e., the radial velocity of Capella is about motion at right angles to the line of sight. The technique 30 km/s). 9
There are three special cases: (i) θ0 = 0, which gives p ν = ν (1 β)/(1 + β) . (45) 0 −
In the non-relativistic limit we have ν = ν0(1 β). This corresponds to a source moving away from the− observer. Note that θ = 0. (ii) θ0 = π, which gives p ν = ν (1 + β)/(1 β) . (46) FIG. 5: A source of light waves moving to the right, relative 0 − S v to observers in the frame, with velocity . The frequency is Here the source is moving towards the observer. Note higher for observers on the right, and lower for observers on the left [25]. that θ = π. (iii) θ0 = π/2, which gives
ν = ν0γ . (47) S S Consider two inertial frames, and 0, moving with This last is the transverse Doppler effect – a second v relative velocity as shown in Fig. 5. Assume a light order relativistic effect. It can be thought of as arising source (e.g. a star) at rest in S0 emits light of frequency from the dilation of time in the moving frame. ν0 at an angle θ0 with respect to the observer O0. Let ! hν hν hν EXERCISE 3.2 Suppose light is emitted isotropically pµ = , cos θ, sin θ, 0 (39) in a star’s rest frame S , i.e. dN/dΩ = κ, where dN is c − c − c 0 0 the number of photons in the solid angle dΩ0 and κ is a be the momentum 4-vector for the photon as seen in S constant. What is the angular distribution in the inertial and frame S? ! µ hν0 hν0 hν0 EXERCISE 3.3 Show that for v c, the Doppler shift p = , cos θ0, sin θ0, 0 (40) 0 c − c − c in wavelength is in S0. To get the 4-momentum relation from S0 S, λ0 λ v → − . (48) apply the inverse Lorentz transformation [26] λ ≈ c " !# hν hν0 hν0 To avoid confusion, it should be kept in mind that λ = γ + β cos θ c c − c 0 denotes the wavelength of the light if observed near ! the place and time of emission, and thus presumably hν hν hν cos θ = γ 0 cos θ + β 0 take the values measured when the same atomic tran- − c − c 0 c sition occurs in terrestrial laboratories, while λ0 is the hν hν0 wavelength of the light observed after its long journey sin θ = sin θ0 . (41) c c to us. If λ0 λ > 0 then λ0 > λ and we speak of a red- − shift; if λ0 λ < 0 then λ0 < λ, and we speak of a blueshift. The first expression gives −
ν = ν0γ(1 β cos θ0) , (42) EXERCISE 3.4 Through some coincidence, the Balmer − lines from single ionized helium in a distant star happen which is the relativistic Doppler formula. to overlap with the Balmer lines from hydrogen in the For observational astronomy (42) is not useful because Sun. How fast is that star receding from us? [Hint: both ν0 and θ0 refer to the star’s frame, not that of the ob- the wavelengths from single-electron energy level server. Apply instead the direct Lorentz transformation transitions are inversely proportional to the square of S S0 to the photon energy to obtain the atomic number of the nucleus.] → ν = γν(1 + β cos θ) . (43) 0 EXERCISE 3.5 Stellar aberration is the apparent
This equation gives ν0 in terms of quantities measured motion of a star due to rotation of the Earth about the by the observer. It is sometimes written in terms of Sun. Consider an incoming photon from a star with µ wavelengths: λ = λ0γ(1 + β cos θ). (For details see 4-momentum p . Let S be the Sun’s frame and S0 the e.g. [27].) Earth frame moving with velocity v as shown in Fig. 6. Define the angle of aberration α by θ0 = θ α and show that α β sin θ. − EXERCISE 3.1 Consider the inertial frames S and S0 ≈ shown in Fig. 5. Use the inverse Lorentz transformation to show that the relation between angles is given by EXERCISE 3.6 HD 209458 is a star in the constellation Pegasus very similar to our Sun (M = 1.1M and R =
β cos θ0 1.1R ), located at a distance of about 150 ly. In 1999, two cos θ = − . (44) β cos θ 1 teams working independently discovered an extrasolar 0 − 10
FIG. 6: Schematic representation of stellar aberration [25]. planet orbiting the star using the so-called radial velocity A. Stellar nucleosynthesis planet search method [28, 29]. Note that a star with a planet must move in its own small orbit in response to There is a general consensus that stars are born when the planet’s gravity. This leads to variations in the speed gaseous clouds (mostly hydrogen) contract due to the with which the star moves toward or away from Earth, pull of gravity. A huge gas cloud might fragment into i.e. the variations are in the radial velocity of the star with numerous contracting masses, each mass centered in an respect to Earth. The radial velocity can be deduced from area where the density is only slightly greater than at the displacement in the parent star’s spectral lines due to nearby points. Once such globules formed, gravity would the Doppler shift. If a planet orbits the star, one should cause each to contract in towards its center-of-mass. As have a periodic change in that rate, except for the extreme the particles of such protostar accelerate inward, their case in which the plane of the orbit is perpendicular to kinetic energy increases. When the kinetic energy is our line of sight. Herein we assume that the motions sufficiently high, the Coulomb repulsion between the of the Earth relative to the Sun have already been taken positive charges is not strong enough to keep hydrogen into account, as well as any long-term steady change of nuclei appart, and nuclear fussion can take place. In distance between the star and the sun, which appears as a star like our Sun, the “burning” of hydrogen occurs a median line for the periodic variation in radial velocity when four protons fuse to form a helium nucleus, with due to the star’s wobble caused by the orbiting planet. the release of γ rays, positrons and neutrinos.1 The observed Doppler shift velocity of HD 209458 is The energy output of our Sun is believed to be due found to be K = V sin i = 82.7 1.3 m/s, where i = ± principally to the following sequence of fusion reactions: 87.1◦ 0.2◦ is the inclination of the planet’s orbit to the line perpendicular± to the line-of-sight. [30]. Soon after 1 1 2 + 1H +1H 1H + 2 e + 2 νe (0.42 MeV) , (49) the discovery, separate teams were able to detect a transit → of the planet across the surface of the star making it the 1 2 3 first known transiting extrasolar planet [31, 32]. The 1H +1H 2He + γ (5.49 MeV) , (50) planet received the designation HD 209458b. Because → the planet transits the star, the star is dimmed by about and 2% every 3.52447 0.00029 days. Tests allowing for a ± 3He +3He 4He +1H +1H (12.86 MeV) , (51) non-circular Keplerian orbit for HD 209458 resulted in 2 2 →2 1 1 an eccentricity indistinguishable from zero: e = 0.016 ± where the energy released for each reaction (given in 0.018. Consider the simplest case of a nearly circular parentheses) equals the difference in mass (times c2) be- orbit and find: (i) the distance from the planet to the tween the initial and final states. Such a released energy star; (ii) the mass m of the planet; (iii) the radius r of the is carried off by the outgoing particles. The net effect planet. of this sequence, which is called the pp-cycle, is for four 4 protons to combine to form one 2He nucleus, plus two positrons, two neutrinos, and two gamma rays:
IV. STELLAR EVOLUTION 1 4 + 4 H He + 2e + 2νe + 2γ . (52) 1 →2 Note that it takes two of each of the first two reactions The stars appear unchanging. Night after night the 3 heavens reveal no significant variations. Indeed, on hu- to produce the two 2He for the third reaction. So the man time scales, the vast majority of stars change very little. Consequently, we cannot follow any but the tini- est part of the life cycle of any given star since they live 1 for ages vastly greater than ours. Nonetheless, herein The word “burn” is put in quotation marks because these high- temperature fusion reactions occur via a nuclear process, and must we will follow the process of stellar evolution from the not be confused with ordinary burning in air, which is a chemical birth to the death of a star, as we have theoretically re- reaction, occurring at the atomic level (and at a much lower temper- constructed it. ature). 11 total energy released for the net reaction is 24.7 MeV. or + However, each of the two e quickly annihilates with dM(r) an electron to produce 2m c2 = 1.02 MeV; so the total = 4πr2ρ(r) . (60) e dr energy released is 26.7 MeV. The first reaction, the formation of deuterium from two protons, has very low An important application of (60) is to express physical probability, and the infrequency of that reaction serves quantities not as function of the radius r but of the en- to limit the rate at which the Sun produces energy. closed mass M(r). This facilitates the computation of the These reactions requiere a temperature of about 107 K, stellar properties as function of time, because the mass corresponding to an average kinetic energy (kT) of 1 keV. of a star remains nearly constant during its evolution, while the stellar radius can change considerably. EXERCISE 4.1 Approximately 1038 neutrinos are A radial-symmetric mass distribution M(r) produces produced by the pp chain in the Sun every second. according Gauss law the same gravitational acceleration, Calculate the number of neutrinos from the Sun that are as if it would be concentrated at the center r = 0. There- passing through your brain every second. fore the gravitational acceleration produced by M(r) is GM(r) In more massive stars, it is more likely that the energy g r ( ) = 2 . (61) output comes principally from the carbon (or CNO) cy- − r cle, which comprises the following sequence of reactions: If the star is in equilibrium, this acceleration is balanced by a pressure gradient from the center of the star to its 12 1 13 C + H N + γ , (53) surface. Since pressure is defined as force per area, P = 6 1 → 7 F/A, a pressure change along the distance dr corresponds to an increment 13N 13C + e+ + ν , (54) 7 → 6 dF = dAP (P + dP)dA − 13C +1H 14N + γ , (55) = dAdP = ρ(r)dAdr a(r) (62) 6 1 → 7 − |{z} − | {z } |{z} force mass acceleration 14N +1H 15O + γ , (56) 7 1 → 8 of the force F produced by the pressure gradient dP. For increasing r, the gradient dP < 0 and the resulting force 15O 15N + e+ + ν , (57) dF is positive and therefore directed outward. Hydro- 8 → 7 static equilibrium, g(r) = a(r), requires then − 15 1 12 4 dP GM(r) ρ(r) 7N +1H 6C +2He . (58) = ρ(r)g(r) = . (63) → dr − r2 It is easily seen that no carbon is consumed in this cycle If the pressure gradient and gravity do not balance each (see first and last equations) and that the net effect is the other, the layer at position r is accelerated, same as the pp cycle. The theory of the pp cycle and the carbon cycle as the source of energy for the Sun and the GM(r) 1 dP a(r) = + . (64) stars was first worked out by Bethe in 1939 [33]. r2 ρ(r) dr The fusion reactions take place primarily in the core of the star, where T is sufficiently high. (The surface In general, we need an equation of state, P = P(ρ, T, Yi), temperature is of course much lower, on the order of that connects the pressure P with the density ρ, the (not a few thousand K.) The tremendous release of energy yet) known temperature T and the chemical composition in these fusion reactions produces an outward pressure Yi of the star. For an estimate of the central pressure Pc = sufficient to halt the inward gravitational contraction; P(0) of a star in hydrostatic equilibrium, we integrate and our protostar, now really a young star, stabilizes in (63) and obtain with P(R) 0, ≈ the main sequence. Z R Z M dP M To a good approximation the stellar structure on the P dr G dM c = = 4 , (65) main sequence can be described by a spherically sym- 0 dr 0 4πr metric system in hydrostatic equilibrium. This requires where we used the continuity equation (60) to substitute that rotation, convection, magnetic fields, and other ef- dr = dM/(4πr2ρ) by dM. If we replace furthermore r by fects that break rotational symmetry have only a minor the stellar radius R r, we obtain a lower limit for the influence on the star. This assumption is in most cases central pressure, ≥ very well justified. Z M We denote by M(r) the mass enclosed inside a sphere M P G dM with radius r and density ρ(r) c = 4 0 4πr Z r Z M M M2 M r dr r 2 r G dM ( ) = 4π 0 0 ρ( 0) (59) > 4 = 4 . (66) 0 0 4πR 8πR 12
Inserting values for the Sun, it follows in the stellar core increases to a large number. Then in its core there will be many beryllium-8 nuclei that can fuse 2 !2 4 M 8 M R with another helium nucleus to form carbon-12, which Pc > = 4 10 bar . (67) 8πR4 × M R is stable:
The value obtained integrating the hydrostatic equation 4He +8 Be 12 C + γ (7.367 MeV) . (69) 11 2 4 6 using the “solar standard model” is Pc = 2.48 10 bar, → i.e. a factor 500 larger. × The net energy release of the triple-α process is 7.273 MeV. Further fusion reactions are possible, with 4 12 16 EXERCISE 4.2 Calculate the central pressure Pc of 2He fusing with 6C to form 8O. Stars spend approxi- a star in hydrostatic equilibrium as a function of its mately a few thousand to 1 billion years as a red giant. mass M and radius R for (i) a constant mass density, Eventually, the helium in the core runs out and fusion ρ(r) = ρ0 and (ii) a linearily decreasing mass density, stops. Stars with 0.4M < M < 4M are fated to end
ρ(r) = ρc[1 (r/R)]. up as spheres of carbon and oxygen. Only stars with − M > 4M become hot enough for fusion of carbon and 20 24 Exactly where the star falls along the main sequence oxygen to occur and higher Z elements like 10Ne or 12Mg depends on its mass. The more massive the star, the can be made. further up (and to the left) it falls in the HR diagram. As massive (M > 8M ) red supergiants age, they pro- To reach the main sequence requires perhaps 30 million duce “onion layers” of heavier and heavier elements in years and the star is expected to remain there 10 billion their interiors. A star of this mass can contract under years (1010 yr). Although most of stars are billions of gravity and heat up even further, (T = 5 109 K), pro- 56 56 × years old, there is evidence that stars are actually being ducing nuclei as heavy as 26Fe and 28Ni. However, the born at this moment in the Eagle Nebula. average binding energy per nucleon begins to decrease As hydrogen fuses to form helium, the helium that is beyond the iron group of isotopes. Thus, the formation formed is denser and tends to accumulate in the cen- of heavy nuclei from lighter ones by fusion ends at the tral core where it was formed. As the core of helium iron group. Further fusion would require energy, rather grows, hydrogen continues to fuse in a shell around it. than release it. As a consequence, a core of iron builds When much of the hydrogen within the core has been up in the centers of massive supergiants. consumed, the production of energy decreases at the A star’s lifetime as a giant or supergiant is shorter center and is no longer sufficient to prevent the huge than its main sequence lifetime (about 1/10 as long). As gravitational forces from once again causing the core to the star’s core becomes hotter, and the fusion reactions contract and heat up. The hydrogen in the shell around powering it become less efficient, each new fusion fuel the core then fuses even more fiercely because of the rise is used up in a shorter time. For example, the stages in in temperature, causing the outer envelope of the star the life of a 25M star are as follows:: hydrogen fusion to expand and to cool. The surface temperature thus re- lasts 7 million years, hellium fusion lasts 500,000 years, duced, produces a spectrum of light that peaks at longer carbon fusion lasts 600 years, neon fusion lasts 1 year, wavelength (reddish). By this time the star has left the oxygen fusion lasts 6 months, and sillicon fusion lasts 1 main sequence. It has become redder, and as it has grown day. The star core in now pure iron. The process of cre- in size, it has become more luminous. Therefore, it will ating heavier nuclei from lighter ones, or by absorption have moved to the right and upward on the HR diagram. of neutrons at higher Z (more on this below) is called As it moves upward, it enters the red giant stage. This nucleosynthesis. model then explains the origin of red giants as a natural step in stellar evolution. Our Sun, for example, has been on the main sequence for about four and a half billion B. White dwarfs and Chandrasekhar limit years. It will probably remain there another 4 or 5 billion years. When our Sun leaves the main sequence, it is ex- At a distance of 2.6 pc Sirius is the fifth closest stellar pected to grow in size (as it becomes a red giant) until it system to the Sun. It is the brightest star in the Earth’s occupies all the volume out to roughly the present orbit night sky. Analyzing the motions of Sirius from 1833 of the planet Mercury. to 1844, Bessel concluded that it had an unseen com- If the star is like our Sun, or larger, further fusion panion, with an orbital period T 50 yr [53]. In 1862, can occur. As the star’s outer envelope expands, its Clark discovered this companion,∼ Sirius B, at the time core is shrinking and heating up. When the temperature of maximal separation of the two components of the reaches about 108 K, even helium nuclei, in spite of their binary system (i.e. at apastron) [54]. Complementary greater charge and hence greater electrical repulsion, can follow up observations showed that the mass of Sirius then reach each other and undergo fusion: B equals approximately that of the Sun, M M . Sir- ius B’s peculiar properties were not established≈ until the 4He +4 He 8 Be + γ ( 91.8 keV) . (68) 2 2 →4 − next apastron by Adams [55]. He noted that its high Once beryllium-8 is produced a little faster than it decays temperature (T 25, 000 K) together with its small lu- 17 ' 26 (half-life is 6.7 10− s), the number of beryllium-8 nuclei minosity (L = 3.84 10 W) require an extremely small × × 13 radius and thus a large density. From Stefan-Boltzmann (74) implies P ρ5/3, where ρ is the density. For relativis- law we have tic particles, we∝ can obtain an estimate for the pressure !1/2 !2 inserting v = c, R L T 2 = 10− . (70) 4/3 R L T ≈ P ncp c}n , (75) ≈ ≈ 6 which implies P ρ4/3. It may be worth noting at Hence, the mean density of Sirius B is a factor 10 higher ∝ than that of the Sun; more precisely, ρ = 2 106 g/cm3. this juncture that (i) both the non-relativistic and the × relativistic pressure laws are polytropic equations of A lower limit for the central pressure of Sirius B fol- γ lows from (67) state, P = Kρ ; (ii) a non-relativistic degenerate Fermi gas has the same adiabatic index (γ = 5/3) as an 2 M 16 ideal gas, whereas a relativistic degenerate Fermi gas Pc > = 4 10 bar . (71) 8πR4 × has the same adiabatic index (γ = 4/3) as radiation; (iii) Assuming the pressure is dominated by an ideal gas the in the non-relativistic limit the pressure is inversely P m central temperature is found to be proportional to the fermion mass, 1/ , and so for non-relativistic systems the degeneracy∝ will first Pc 2 9 become important to electrons. Tc = 10 Tc, 10 K . (72) nk ∼ ≈ EXERCISE 4.3 Estimate the average energy of elec- For such a high Tc, the temperature gradient dT/dr in Sirius B would be a factor 104 larger than in the Sun. This trons in Sirius B from the equation of state for non- would in turn require a larger luminosity and a larger relativistic degenerate fermion gas, energy production rate than that of main sequence stars. 2 2/3 2 (3π ) } 5/3 Stars like Sirius B are called white dwarfs. They have P = n , (76) 5 m very long cooling times, because of their small surface lu- minosity. This type of stars is rather numerous. The mass and calculate the Lorentz factor of the electrons. Give density of main-sequence stars in the solar neighbor- a short qualitative statement about the validity of the hood is 0.04M /pc3 compared to 0.015M /pc3 in white non-relativistic equation of state for white dwarfs with dwarfs. The typical mass of white dwarfs lies in the a density of Sirius B and beyond. range 0.4 . M/M . 1, peaking at 0.6M . No further fusion energy can be obtained inside a white dwarf. The Next, we compute the pressure of a degenerate non- star loses internal energy by radiation, decreasing in tem- relativistic electron gas inside Sirius B and check if it is perature and becoming dimmer until its light goes out. consistent with the lower limit for the central pressure For a classical gas, P = nkT, and thus in the limit of zero derived in (71). The only bit of information needed is the temperature, the pressure inside a star also goes to zero. value of ne, which can be written in terms of the density How can a star be stabilized after the fusion processes of the star, the atomic mass of the ions making up the and thus energy production stopped? The solution to star, and the number of protons in the ions (assuming this puzzle is that the main source of pressure in such the star is neutral): compact stars has a different origin. ρ ne = (77) According to Pauli’s exclusion principle no two µe mp fermions can occupy the same quantum state [56]. In where µe A/Z is the average number of nucleon per statistical mechanics, Heisenberg’s uncertainty princi- ≡ ple ∆x∆p } [57] together with Pauli’s principle imply free electron. For metal-poor stars µe = 2, and so from ≥ 1 that each phase-space volume, }− dx dp, can only be oc- (74) we obtain cupied by one fermionic state. h2n5/3 A (relativistic or non-relativistic) particle in a box of P e 3 ≈ me volume L collides per time interval ∆t = L/vx once with !5/3 the yz-side of the box, if the x component of its velocity (1.05 1027 erg s)2 106 g/cm3 × is vx. Thereby it exerts the force Fx = ∆px/∆t = pxvx/L. ≈ 9.11 10 28 g 2 1.67 10 24 g The pressure produced by N particles is then P = F/A = × − × × − 23 2 Npxvx/(LA) = npxvx. For an isotropic distribution, with 10 dyn/cm . (78) 2 2 2 2 2 ≈ v = v + v + v = 3 v , we have 2 h i h xi h yi h zi h xi Since 106 dyn/cm = 1 bar, we have P = 1017 bar, which P = 1 nvp . (73) is consistent with the lower limit derived in (71). 3 We can now relate the mass of the star to its radius 1/3 1/3 Now, if we take ∆x = n− and ∆p }/∆x }n , by combining the lower limit on the central pressure ≈ ≈ 2 4 combined with the non-relativistic expression v = p/m, Pc GM /R and the polytropic equation of state P = the pressure of a degenerate fermion gas is found to be Kρ5∼/3 K(M/R3)5/3 = KM5/3/R5. It follows that ∼ }2n5/3 GM2 KM5/3 P nvp . (74) = , (79) ≈ ≈ m R4 R5 14 or equivalently pressures of exercise 4.1 equal to the relativistic degen- erate electron pressure, M(10 12)/6 1 R = − = . (80) K 1/3 (3π2)1/3}c KM P = n4/3 . (87) 4 If the small differences in chemical composition can be neglected, then there is unique relation between the mass Compare the estimates with the exact limit. and the radius of white dwarfs. Since the star’s radius decreases with increasing mass, there must be a maximal The critical size can be determined by imposing two mass allowed. conditions: that the gas becomes relativistic, Ukin . 2 To derive this maximal mass we first assume the Nmec , and N = Nmax, pressure can be described by a non-relativistic degen- 4/3 erate Fermi gas. The total kinetic energy of the star is 2 c}Nmax 2 3 1/3 Nmaxmec & . (88) Ukin = Np /(2me), where n N/R and p }n . Thus R ∼ ∼ }2n2/3 }2N(3+2)/3 }2N5/3 This leads to U N kin 2 = 2 . (81) ∼ 2me ∼ 2meR 2meR !1/2 2 c} c} mec & , (89) For the potentail gravitational energy, we use the ap- R Gm2 2 N proximation Upot = αGM /R, with α = 1. Hence or equivalently }2N5/3 GM2 U R U U ( ) = kin + pot 2 . (82) !1/2 ∼ 2meR − R } c} R 8 & 2 5 10 cm . (90) mec Gm ∼ × For small R, the positive term dominates and so there N exists a stable minimum R for each M. min which is in agreement with the radii found for white However, if the Fermi gas inside the star becomes rel- dwarf stars. ativistic, then Ukin = Ncp, or
c}N4/3 U Nc}n1/3 (83) C. Supernovae kin ∼ ∼ R and Supernovae are massive explosions that take place at the end of a star’s life cycle. They can be triggered by 4/3 2 c}N GM one of two basic mechanisms: (I) the sudden re-ignition U(R) = Ukin + Upot . (84) ∼ R − R of nuclear fusion in a degenerate star, or (II) the sudden gravitational collapse of the massive star’s core. Now both terms scale like 1/R. For a fixed chemical In a type I supernova, a degenerate white dwarf ac- composition, the ratio N/M remains constant. Therefore, cumulates sufficient material from a binary companion, if M is increased the negative term increases faster than either through accretion or via a merger. This material the first one. This implies there exists a critical M so that raise its core temperature to then trigger runaway U becomes negative, and can be made arbitrary small by nuclear fusion, completely disrupting the star. Since the decreasing the radius of the star: the star collapses. This white dwarf stars explode crossing the Chandrasekhar critical mass is called Chandrasekhar mass M . It can Ch limit, M > M , the release total energy should not vary be obtained by solving (84) for U = 0. Using M = NNmN 4/3 2 2 so much. Thus one may wonder if they are possible we have c}N = GN m , or, with mN mp, max max N ' standard candles. 3/2 !3 c} M EXERCISE 4.5 Type Ia supernovae have been ob- N Pl 57 max 2 2 10 . (85) ∼ Gm ∼ mp ∼ × served in some distant galaxies. They have well-known p 10 luminosities and at their peak LIa 10 L . Hence, ≈ This leads to we can use them as standard candles to measure the distances to very remote galaxies. How far away could MCh = Nmaxmp 1.5M . (86) a type Ia supernova be, and still be detected with HST? ∼ The Chandrasekhar mass derived “professionally” is In type II supernovae the core of a M & 8M star found to be MCh 1.46M [35]. undergoes sudden gravitational collapse. These stars ' have an onion-like structure with a degenerate iron core. EXERCISE 4.4 Derive approximate Chandrasekhar When the core is completely fused to iron, no further mass limits in units of solar mass by setting the central processes releasing energy are possible. Instead, high 10 Point explosion
10 Point explosion The sudden release of a large amount of energy E into a background fluid of density ⇢1 creates a strong explosion, characterized by a strongwhere shockP is wave the postshock (a ‘blast pressure. wave’) To find this pressure, we need to recall the jump emanating from the point where the energy was released.conditions Such explosionsacross a shock. occur If the for shock moves to the right with velocity v1 = v(t), then in the rest-frame of the shock the background gas streams with velocity v to example in astrophysics in the form of supernova explosions. 15 1 the left, and comes out of the shock with a higher density ⇢2, higher pressure P2, and with a lower velocity v2.
u2 u1
The Rankine-Hugonoit relations for the shock tell us
FIG. 7: Left. The sudden release of a large amount of energy into a background⇢ fluid ofv density ρ11creates a strong2 spherical shock 1 = 2 = + (10.3) But how fast will thewave, shock emanating wave from travel the point and where what the energy is left was behind? released. Right. TheJump problem conditions⇢ v of across +1 normal( shock+1) waves.2 If the shock moves to the right with velocity u , then in the rest-frame of the shock the background2 1 gas streams with velocity u = u to sh M 1 − sh the point explosion is alsothe left, known and comes as outSedov-Taylor of the shock with explosion, a higherwhere density afterρ2, higher the pressure two scientistsP2, and with a lower velocity u2. Conservation of momentum requires P + ρ u2 = P + ρ u2, see Appendix C. For the case at hand, P P and so Pv1 ρ u2. that first solved it by analytic (and1 in1 part1 2 numerical)2 2 means in the context1 of2 = 2 ∼ 1 1 (10.4) M c1 atomic bomb explosions. Today, the problem can provideis the a Mach useful number test of to the validate shock. For a strong explosion, the sound-speed of the a hydrodynamical numericalenergy collisions scheme, break because apart iron aninto helium analyticbackground and even- solution mediumThe for released is negligibly it can energy small, be goes so mainly that the into Mach neutrinos number (99%), will tend to infinity tually into protons and neutrons, in this limit. Forkinetic the pressure, energy (1%); the Rankine-Hugonoitonly 0.01% into photons. relation is computed which can then be compared to numerical results. Also,Much the of problem the modeling of supernova explosions and 56 4 2 Fe 13 He + 4 n (91) their remnants derivesP2 2 from the nuclear1 bomb research serves as a good example to demonstrate26 → the2 power of dimensional analysis and= M (10.5) program. WheneverP1 a supernova +1 goes+1 off a large amount scale-free solutions. and of energy E is injected into the “ambient medium” of 2 As the backgrounduniform pressure density is Pρ1 .= In⇢ the1c1/ initial , we phase then ofobtain the expansion in the limit of a strong 4 p n 1 2He 2 + 2 . shock: (92) → the impact of the external medium2 will be small, because the mass of the ambient medium2⇢1v1 that is overrun and This removes the thermal energy necessary to provide P2 (10.6) 10.1 A rough estimatepressure support and the star collapses. When the star taken along is still small' compared +1 with the ejecta mass. begings to contract the density increasesWith and the this free postshockThe supernovapressure, we remnant can now is estimate said to expand the thermal adiabatically. energy in the shocked electrons are forced together with protons tobubble: form neu- After some time a strong spherical shock front (a “blast Let’s begin by deriving an order of magnitude estimate for the radiuswave”)R expands(t)ofthe into the ambient medium, and5 the mass trons via inverse beta decay, 3 2 3 R swept upE bytherm the outwardlyP2R ⇢ moving1v R shock⇢1 significantly (10.7) shock as a function of time. The mass of the swept up material is of order M(t⇠) ⇠ 1 ⇠ t2 3 e− + p n + νe ; (93) exceeds the mass of⇠ the initial ejecta, see Fig. 7. The ⇢1R (t). The fluid velocity behind the shock→ will be of orderThis suggests the mean that radial the thermal velocity energy2 is of the same order as the kinetic energy, ram pressure, P2 ρ1ush, of the matter that enters the of the shock, v(t) R(teven)/t. though We further neutrinos expectdo not interact easilyand with scales matter, in theshock same wave fashion is much with∼ time. larger Hence than the also ambient for the pressure total energy E, which ⇠ at these extremely high densities, they exertis a a conserved tremen- quantity,P1 of the we upstream expect medium, and any radiated energy is dous outward pressure. The outer layers fall inward 2 5 much smaller than the explosion energyR5E. This regime, when the iron1 core collapses, formingR anR enormously during whichE the= energyE + E remains constant⇢ is known as (10.8) 2 3 kin therm ⇠ 1 t2 denseEkin neutronMv star [36]. If⇢M1R. MCh2 ,= then⇢1 the2 core stops the Sedov–Taylor(10.1) phase [37–39]. The mass of the swept collapsing⇠ because2 the⇠ neutrons startt gettingSolvingt packed for too the radiusup materialR(t), weis of get order the expectedM(t) ρ dependencer3(t), where r is the ∼ 1 tightly. Note that MCh as derived in (86) is valid for both radius of the shock. The fluid velocity1 behind the shock 2 What about the thermalneutrons energy and electrons, in the since bubble the stellar created mass is in by both the explosion?will be of order This the mean radialEt velocity5 of the shock, cases given by the sum of the nucleon masses, only the R(t) (10.9) should be of order ush(t) r(t)/t and so the/ kinetic⇢1 energy is main source of pressure (electrons3 or neutrons) differs. ∼ ✓ ◆ m 1 r2 r5 The critical sizeE followstherm from (90)PV by substituting e with E(10.2)Mu2 r3 kin = sh ρ1 2 = ρ1 2 . (95) mN, ⇠ 2 2 ∼ t t !1/2 What about the thermal energy in the bubble created by } c} 2 R & 3 105 cm . (94) the explosion? This should be of order m c 2 N GmN ∼ × 3 r5 E = P1V P r3 ρ u2 r3 ρ . (96) Since already Sirius B was difficult to detect, the ques- therm 2 2 ∼ 2 ∼ 1 sh ∼ 1 t2 tion arises if and how these extremely small stars can be observed. When core density reaches nuclear density, This suggests that the thermal energy is of the same order the equation of state stiffens suddenly and the infalling as the kinetic energy, and scales in the same fashion with material is “reflected.” Both the neutrino outburst and time. Therefore the outer layers that crash into the core and rebound r5 E = E + E ρ , (97) cause the entire star outside the core to be blown apart. kin therm ∼ 1 t2 16 yielding leave it. Because no light escapes after the star reaches this infinite density, it is called a black hole. !1/5 Et2 r(t) . (98) ∼ ρ 1 V. WARPING SPACETIME The expanding shock wave slows as it expands A hunter is tracking a bear. Starting at his camp, he !1/5 !1/2 walks one mile due south. Then the bear changes direc- 2 E 2 E 3/2 ush = = r− . (99) tion and the hunter follows it due east. After one mile, 5 ρ t3 5 ρ 1 1 the hunter loses the bear’s track. He turns north and This means that the blask wave decelerates and dis- walks for another mile, at which point he arrives back at sapears after some time. The expanding supernova his camp. What was the color of the bear? remnant then passes from its Taylor-Sedov phase to its An odd question. Not only is the color of the bear “snowplow” phase. During the snowplow phase, the unrelated to the rest of the question, but how can the matter of the ambient interstellar medium is swept up hunter walk south, east and north, and then arrive back by the expanding dense shell, just as snow is swept up at his camp? This certainly does not work everywhere by a coasting snowplow. on Earth, but it does if you start at the North pole. There- fore the color of the bear has to be white. A surprising EXERCISE 4.6 Estimate the energy of the first observation is that the triangle described by the hunter’s detonation of a nuclear weapon (code name Trinity) path has two right angles in the two bottom corners, and from the time dependence of the radius of its shock so the sum of all three angles is greater than 180◦. This wave. Photographs of the early stage of the explosion implies the metric space is curved. are shown in Fig. 8. The device was placed on the top What is meant by a curved space? Before answering of a tower, h = 30 m and the explosion took place at this question, we recall that our normal method of view- about 1100 m above sea level. (i) Explain the origin ing the world is via Euclidean plane geometry, where of the thin layer above the bright “fireball” that can the line element of the n-dimensional space is given by be seen in the last three pictures (t 0.053 s). Is the ≥ Xn shock front behind or ahead of this layer? Read the ds2 = dx2 . (100) radius of the shock front from the figures and plot it i i=1 as a function of time after the explosion. The time and length scale are indicated in the lables of the figures. Non-Euclidean geometries which involve curved spaces (ii) Fit (by eye or numerical regression) a line to the have been independently imagined by Gauss [50], radius vs. time dependence of the shock front in a Bolyai´ [51], and Lobachevsky [52]. To understand the log-log representation, ln(r) = a + b ln(t). Verifiy that b is idea of a metric space herein we will greatly simplify compatible with a Sedov-Taylor expansion. Then fix b the discussion by considering only 2-dimensional sur- to the theoretical expectation, re-evaluate a and estimate faces. For 2-dimensional metric spaces, the so-called first the energy of the bomb in tons of TNT equivalent. [Hint: and second fundamental forms of differential geometry ignore the initial (short) phase of free expansion.] uniquely determine how to measure lengths, areas and angles on a surface, and how to describe the shape of a If the final mass of a neutron star is less than MCh its parameterized surface. subsequent evolution is thought to be similar to that of a white dwarf. In 1967, an unusual object emitting a radio signal with period T = 1.377 s was detected at the A. 2-dimensional metric spaces Mullard Radio Astronomy Observatory. By its very na- ture the object was called “pulsar.” Only one year later, The parameterization of a surface maps points (u, v) Gold argued that pulsars are rotating neutron stars [41]. in the domain to points ~σ(u, v) in space: He predicted an increase on the pulsar period because of electromagnetic energy losses. The slow-down of the x(u, v) Crab pulsar was indeed discovered in 1969 [42]. ~ u v y(u, v) σ( , ) = . (101) If the mass of the neutron star is greater than MCh, z(u, v) then the star collapses under gravity, overcoming even the neutron exclusion principle [43]. The star eventually Differential geometry is the local analysis of how small collapses to the point of zero volume and infinite density, changes in position (u, v) in the domain affect the position creating what is known as a “singularity” [44–49]. As the on the surface ~σ(u, v), the first derivatives ~σu(u, v) and density increases, the paths of light rays emitted from ~σv(u, v), and the surface normal nˆ(u, v). the star are bent and eventually wrapped irrevocably The first derivatives, ~σu(u, v) and ~σv(u, v), are vectors around the star. Any emitted photon is trapped into that span the tangent plane to the surface at point ~σ(u, v). an orbit by the intense gravitational field; it will never The surface normal at point ~σ is defined as the unit vector 17
FIG. 8: Trinity test of July 16, 1945. Figures also available at http://cosmo.nyu.edu/~mu495/HEA15/trinity/
2 18 normal to the tangent plane at point ~σ and is computed can be used to characterize the local shape of the folded using the cross product of the partial derivatives of the surface. surface parameterization, The concept of curvature, while intuitive for a plane curve (the reciprocal of the radius of curvature), requires ~σu ~σv nˆ(~σ) = × . (102) a more comprehensive definition for a surface. Through ~σu ~σv a point on a surface any number of curves may be drawn || × || with each having a different curvature at the point. We The tangent vectors and the surface normal define an have seen that at any point on a surface we can find nˆ orthogonal coordinate system at point ~σ(u, v) on the sur- which is at right angles to the surface; planes containing face, which is the framework for describing the local the normal vector are called normal planes. The inter- shape of the surface. section of a normal plane and the surface will form a Geometrically, d~σ is a differential vector quantity that curve called a normal section and the curvature of this is tangent to the surface in the direction defined by du curve is the normal curvature κ. For most points on and dv. The first fundamental form, I, which measures most surfaces, different sections will have different cur- the distance of neighboring points on the surface with vatures; the minimum and maximum values of these are parameters (u, v) and (u+du, v+dv), is given by the inner called the principal curvatures, denoted by κ1 and κ2. product of d~σ with itself The Gaussian curvature is defined by the product of the two principal curvatures K = κ κ . It may be calculated I ds2 = d~σ d~σ = (~σ du + ~σ dv) (~σ du + ~σ dv) 1 2 u v u v using the first and second fundamental coefficients. At ≡ · 2 · 2 = (~σu ~σu)du + 2(~σu ~σv)dudv + (~σv ~σv)dv · · · each grid point where these values are known two ma- = Edu2 + 2Fdudv + Gdv2 , (103) trices are defined. The matrix of the first fundamental form, where E, F and G are the first fundamental coefficients. ! The coefficients have some remarkable properties. For EF I = , (108) example, they can be used to calculate the surface area. FG Namely, the area bounded by four vertices ~σ(u, v), ~σ(u + δu, v), ~σ(u, v + δv), ~σ(u + δu, v + δv) can be expressed in and the matrix of the second fundamental form, terms of the first fundamental form with the assistance ! of Lagrange identity e f II = . (109) n 1 n n n f g X− X X X 2 2 2 (aibj ajbi) = a b − k k i=1 j=i+1 k=1 k=1 The Gaussian curvature is given by
n 2 X det II akbk , (104) K = . (110) − det I k=1 which applies to any two sets a , a , , an and As an illustration, consier a half-cylinder of radius R { 1 2 ··· } b1, b2, , bn of real numbers. The classical area ele- oriented along the x axis. At a particular point on the ment{ is··· found} to be surface, the scalar curvature can have different values depending on direction. In the direction of the half- 2 δA = ~σu δu ~σv δv = √EG F δu δv , (105) cylinder’s axis (parallel to the x axis), the surface has zero | × | − scalar curvature, κ = 0. This is the smallest curvature or in differential form value at any point on the surface, and therefore κ1 is in this direction. For a curve on the half-cylinder’s surface dA = √EG F2 du dv . (106) − parallel to the (y, z) plane, the cylinder has uniform scalar Note that the expression under the square root in (106) curvature. In fact this curvature is the greatest possible on the surface, so that κ = 1/R is in this direction. For is precisely ~σu ~σv and so it is strictly positive at the 2 regular points.| × | a curve on the surface not in one of these directions, the The key to the second fundamental form, II, is the scalar curvature is greater than κ1 and less than κ2. The unit normal vector. The second fundamental form coef- Gaussian curvature is K = 0. ficients at a given point in the parametric uv-plane are 2-dimensional metric spaces can be classified accord- given by the projections of the second partial deriva- ing to the Gaussian curvature into elliptic (K > 0), flat tives of ~σ at that point onto the normal vector and can (K = 0), and hyperbolic (K < 0). Triangles which lie on be computed with the aid of the dot product as follows: the surface of an elliptic geometry will have a sum of e = ~σuu nˆ, f = ~σuv nˆ, and g = ~σvv nˆ. The second angles which is greater than 180◦. Triangles which lie fundamental· form, · · on the surface of an hyperbolic geometry will have a sum of angles which is less than 180◦. II = e du2 + 2 f du dv + g dv2 , (107) 19
EXERCISE 5.1 The unit sphere can be parametrized patch under the parametrization as cos θ cos φ cos u sin v ~σ(θ, φ) = cos θ sin φ . (115) ~ u v sin u sin v σ( , ) = (111) sin θ cos v [Hint: A great circle (a.k.a. orthodrome) of a sphere is where (u, v) [0, 2π) [0, π]. (i) Find the distance of ∈ × the intersection of the sphere and a plane which passes neighboring points on the surface with parameters (u, v) through the center point of the sphere.] and (u + du, v + dv), a.k.a. the line element ds2. (ii) Find the surface area. (iii) Find the Gaussian curvature. The scalar curvature (or Ricci scalar) is the simplest curvature invariant of an n-dimensional hypersurface. EXERCISE 5.2 The tractrix is a curve with the follow- To each point on the hypersurface, it assigns a single ing nice interpretation: Suppose a dog-owner takes his real number determined by the intrinsic geometry of pet along as he goes for a walk “down” the y-axis. He the hypersurface near that point. It provides one way of starts from the origin, with his dog initially standing on measuring the degree to which the geometry determined the x-axis at a distance r away from the owner. Then by a given metric might differ from that of ordinary Eu- the tractrix is the path followed by the dog if he “fol- clidean n-space. In two dimensions, the scalar curvature lows his owner unwillingly”, i.e., if he constantly pulls is twice the Gaussian curvature, R = 2K, and completely against the leash, keeping it tight. This means mathe- characterizes the curvature of a surface. In more than matically that the leash is always tangent to the path of two dimensions, however, the curvature of hypersur- the dog, so that the length of the tangent segment from faces involves more than one functionally independent the tractrix to the y-axis has constant length r. The trac- quantity. trix has a well-known surface of revolution called the pseudosphere which, for r = 1, can be parametrized as B. Schwarzschild metric sechu cos v ~ u v sechu sin v σ( , ) = , (112) u tanhu Consider a freely falling spacecraft in the gravitational − field of a radially symmetric mass distribution with total with u ( , ) and v [0, 2π). (i) Find the line mass M. Because the spacecraft is freely falling, no ef- element.∈ (ii)−∞ Find∞ the surface∈ area. (iii) Find the fects of gravity are felt inside. Then, the spacetime coor- Gaussian curvature. dinates from r should be valid inside the spacecraft. → ∞ Let us call these coordinates Σ~ (t , x , y , z ), with x A curve γ with parametr t on a surface ~σ(u, v) is called parallel and y , z transversal∞ to movement.∞ ∞ ∞ ∞ The space-∞ a geodesic if at every point γ(t) the acceleration vector craft has velocity∞ ∞v at the distance r from the mass M, γ~¨(t) is either zero or parallel to its unit normal nˆ. measured in the coordinate system Σ~ = (t, r, θ, φ) in which the mass M is at rest at r = 0. As long as the EXERCISE 5.3 Show that a geodesic γ(t) on a surface gravitational field is weak, to first order approximation ~σ has constant speed. that the laws of special relativity hold [58], and we can use a Lorentz transformation [26] to relate Σ~ at rest and EXERCISE 5.4 A curve γ on a surface ~σ is a geodesic Σ~ moving with v = βc. We will define shortly what if and only if for any part (t) ~(u(t) v(t)) contained ∞ γ = σ , “weak” means in this context. For the moment, we pre- in a surface patch ~, the following two equations are σ sume that effects of gravity are small if the velocity of satisfied: the spacecraft, which was at rest a r , is still small v c. Should this be the case, we have→ ∞ d 1 2 2 (Eu˙ + Fv˙) = (Euu˙ + 2Fuu˙v˙ + Guv˙ ) , (113) dt 2 q dt = dt 1 β2 ∞ − d 1 2 2 dr (Fu˙ + Gv˙) = (Evu˙ + 2Fvu˙v˙ + Gvv˙ ) , (114) dx = p dt 2 ∞ 1 β2 − where Edu2 +2Fdudv+Gdv2 is the first fundamental form dy = r dθ ∞ of ~σ. (113) and (114) are called the geodesic equations. dz = r sin θ dφ . (116) They are nonlinear and solvable analytically on rare ∞ occasions only. The infinitesimal distance between two spacetime events is given by the Minkowskian line element [59] EXERCISE 5.5 Show that if γ is a geodesic on the unit 2 2 µ ν 2 2 2 2 2 sphere S , then γ is part of a great circle. Consider the ds = gµνdx dx = c dt dx dy dz , (117) ∞ − ∞ − ∞ − ∞ 20 which, for the case at hand, becomes The time intervals dτ(r0) and dτ(r) are different and thus the time measured by clocks at different distances r from dr2 ds2 = (1 β2)c2dt2 + r2(dθ2 + sin2 dφ2) . (118) the mass M will differ too. In particular, the time τ − − 1 β2 measured by an observer at infinity will pass faster than∞ − the time experienced in a gravitational field, Herein we follow the notation of [27]: Greek indices (µ, ν, ) run from 0 to 3 and Latin indices (i, j, ) from τ(r) ··· ··· τ = < τ(r) . (125) 1 to 3. ∞ √1 2α/r We now turn to determine β from measurable quanti- − ties of the system: M and r. Consider the energy of the Since frequencies are inversely proportional to time, the spacecraft with rest mass m, frequency or energy of a photon traveling from r to r0 will be affected by the gravitational field as 2 GγmM (γ 1)mc = 0 , (119) r − − r ν(r0) 1 2α/r = − . (126) where the first term is the kinetic energy and the sec- ν(r) 1 2αr0 − ond the Newtonian expression for the potential energy. Therefore, an observer at r0 will receive photons, Note that here we have made the crucial assumption that → ∞ gravity couples not only to the mass of the spacecraft but which were emitted with frquency ν by a source at posi- 2 tion r, redhsifted to frequency ν , also to its total energy. Dividing by γmc gives ∞ ! r 1 GM 2GM 1 = 0 . (120) ν = 1 ν(r) . (127) − γ − rc2 ∞ − rc2
Introducing α = GM/c2 we can re-write (120) as Note that the photon frequency is redshifted by the grav- itational field. The size of this effect is of order Φ/c2, q α where Φ = GM/r is the Newtonian gravitational po- 1 β2 = 1 , (121) − − − r tential. We are now in position to specify more pre- cisely what weak gravitational fields means. As long as 2 1/2 2 where γ = (1 β )− . (121) leads to Φ /c 1, the deviation of − | | 2α α2 2α 2GM Φ(r) 1 β2 = 1 + 1 ; (122) g00 = 1 1 2 (128) − − r r2 ≈ − r − rc2 ≈ − c2 in the last step, we neglected the term (α/r)2, since we from the Minkowski value g00 = 1 is small, and Newto- attempt only at an approximation for large distances, nian gravity is a sufficient approximation. where gravity is still weak. Inserting this expression into What is the meaning of r = 2α? At (118), we obtain the metric describing the gravitational 2GM M field produced by a radially symmetric mass distribu- R = = 3 km , (129) Sch c2 M tion, 1 the Schwarzschild coordinate system (123) becomes ill- 2α 2α − ds2 = 1 c2dt2 1 dr2 r2dΩ2 , (123) defined. However, this does not mean necessarily that at − r − − r − r = RSch physical quantities like tidal forces become in- 2 2 2 2 finite. As a matter of fact, all scalar invariants are finite, where dΩ = dθ + sin θdφ . Wickedly, this agrees with 6 the exact result found by Schwarzschild [60] by solv- e.g. R = 0 and K = 12RSch/r . Here R is the Ricci scalar ing Einstein’s vacuum field equations of general relativ- and K the Kretschmann scalar [62], a quadratic scalar in- ity [61]. variant used to find the true singularities of a spacetime. As in special relativity, the line element ds2 determines The Schwarzschild’s scalar invariants can only be found the time and spatial distance between two spacetime by long and troublesome calculation that is beyond the events. The time measured by an observer in the instan- scope of this course; for a comprehensive discussion see taneous rest frame, known as the proper time dτ, is given e.g. [63, 64]. Before proceeding we emphasize again that, by dτ = ds/c [27]. In particular, the time difference be- whether or not the singularity is moved to the origin, tween two events at the same point is obtained by setting only depends on the coordinate frame used, and has no dxi = 0. If we choose two static observers at the position physical significance whatsoever; see Appendix D for an example. r and r0, then we find with dr = dφ = dθ = 0, If the gravitating mass is concentrated inside a radius p s smaller than RSch then we cannot obtain any information dτ(r) g00(r) dt g00(r) = = . (124) about what is going on inside RSch, and we say r = RSch d r p g r τ( 0) g00(r0) dt 00( 0) defines an event horizon. An object smaller than its 21
Schwarzschild radius, is called a black hole. In Newto- at a rate that is nian gravity, only the enclosed mass M(r) of a spherically s symmetric system contributes to the gravitational poten- r 2 1 2GM vescape tial outside r. Therefore, we conclude the Sun is not a 1 ⊕ = 1 (132) − c2 r − c2 black hole, becasue for all values of r the enclosed mass 2 is M(r) < rc /(2G). The Schwarzschild black hole is fully times as fast as one located far away from the Earth characterized by its mass M. To understand this better, (i.e. at r ). Note how much this expression looks we consider next what happens to a photon crossing the like the equivalent→ ∞ expression from special relativity event horizon as seen from an observer at r . 2 → ∞ for time dilation. Here R is the radius of the Earth, Light rays are characterized by ds = 0. Consider a and M is its mass. Using⊕ (132) calculate the rate at light ray traveling in the radial direction, that is to say which⊕ a stationary clock at a radius r (for r > R ) will dφ = dθ = 0. The Schwarzschild metric (123) becomes tick relative to one at the surface of the Earth. Is⊕ your dr 2α rate greater or less than 1? If greater than 1, this means = 1 c . (130) dt − r the high altitude clock at r > R ticks faster than one on the surface; if less than one,⊕ this means the high As seen from far away a light ray approaching a massive altitude clock ticks slower than one on the surface. star will travel slower and slower as it comes closer to (ii) Now consider an astronaut orbiting at r > R . What the Schwarzschild radius. In fact, for an observer at in- is her orbital velocity as a function of r? Because⊕ she is finity the signal will reach r = RSch only asymptotically, moving with respect to a stationary observer at radius for t . Similarly, the communication with a freely → ∞ r, special relativity says that her clock is ticking slower. falling spacecraft becomes impossible as it reaches Calculate the ratio of the rate her clock ticks to that of a r = RSch. A more detailed analysis shows that indeed, stationary observer at radius r. (Note that for circular as seen from infinity, no signal can cross the surface motion, the acceleration in the spaceship travelling in at r = R . The factors (1 2α/r) in (123) control the Sch − a circle is not zero, so the spaceship is not in a single bending of light, a phenomenon known as gravitational frame of inertia.) (iii) Determine an expression for the lensing. The first observation of light deflection was ratio of the rate at which the orbiting astronaut’s clock performed by noting the change in position of stars as ticks to a stationary clock on the surface of the Earth, they passed near the Sun on the celestial sphere. The as a function of the radius r at which she orbits. You observations were performed in May 1919 during a total may ignore the small velocity of the clock on the surface solar eclipse, so that the stars near the Sun (at that time of the Earth due to the Earth’s rotation. (iv) Using in the constellation Taurus) could be observed [65]. 1 √1 x 1 x/2 + , (1 x)− 1 + x + , and (1 −x)(1≈ y)−= 1 x ···y + xy− 1 ≈(x + y), all valid··· for EXERCISE 5.6 In addition to the time dilation due to x − 1 and− y 1,− derive− an expression≈ − of the form 1 δ an object moving at a finite speed that we have learned for the relative rate of a clicking clock on the surface− about in special relativity, we have seen that there is of the Earth and the orbiting astronaut. Demonstrate an effect in general relativity, termed “gravitational red- that δ 1. (v) Calculate the radius r at which the clock shift,” caused by gravity itself. To understand this latter of the orbiting astronaut ticks at the same rate as a effect, consider a photon escaping from the Earth’s sur- stationary one on the surface of the Earth; express your face to infinity. It loses energy as it climbs out of the result in Earth radii and kilometers. Will an astronaut Earth’s gravitational well. As its energy E is related to orbiting at a smaller radius age more or less than one its frequency ν by Planck’s formula E = hν, its frequency who stayed home? Thus, do astronauts on the Space must therefore also be reduced, so observers at a great Shuttle (orbiting 300 km above the Earth’s surface) age distance r must see clocks on the surface ticking at more or less than one staying home? a lower frequency→ ∞ as well. Therefore an astronaut orbit- ing the Earth ages differently from an astronomer sitting EXERCISE 5.7 “A full set of rules [of Brockian Ultra still far from the Earth for two reasons; the effect of grav- Cricket, as played in the higher dimensions] is so ity, and the time dilation due to motion. In this problem, massively complicated that the only time they were you will calculate both these effects, and determine their all bound together in a single volume they underwent relative importance. (i) The escape speed from an object gravitational collapse and became a black hole” [66]. of mass M if you are a distance r from it is given by A quote like this is crying out for a calculation. In r 2GM this problem, we will answer Adams challenge, and vescape = . (131) determine just how complicated these rules actually r are. An object will collapse into a black hole when That is, if you are moving this fast, you will not fall back its radius is equal to the radius of a black hole of the to the object, but will escape its gravitational field en- same mass; under these conditions, the escape speed tirely. Schwarzschild’s solution to Einstein’s field equa- at its surface is the speed of light (which is in fact the tions of general relativity shows that a stationary, non- defining characteristic of a black hole). We can rephrase moving clock at a radius r R from the Earth will tick the above to say that an object will collapse into a black ≥ ⊕ 22 hole when its density is equal to the density of a black the coordinate distance R (assume R > RSch and take the hole of the same mass. (i) Derive an expression for the absolute value of grr). [Hint: The following facts may be density of a black hole of mass M. Treat the volume helpfull: of the black hole as the volume of a sphere of radius Z r given by the Schwarzschild radius. As the mass of a 1 ξ π dξ = , (133) black hole gets larger, does the density grow or shrink? 1 ξ 2 (ii) Determine the density of the paper making up the 0 − Cricket rule book, in units of kilograms per cubic meter. and Standard paper has a surface density of 75 g per square Z r meter, and a thickness of 0.1 mm. (iii) Calculate the α ξ √ √ √ √ mass (in solar masses), and radius (in AU) of the black dξ = ln( α 1 + α) + α 1 α , (134) 1 1 ξ − − hole with density equal to that of paper. (iv) How many − pages long is the Brockian Ultra Cricket rule book? where α > 1 is constant.] (iii) Now use your answers to Assume the pages are standard size (8.500 1100). For part (i) and part (ii) to compute Π where C = 2ΠRphys. × 3 calculational simplicity, treat the book as spherical (a (iv) Plot Π as a function of ξ R/RSch for ξ [1, 10 ] (use common approximation in this kind of problem). What log axes for the x axis). What≡ happens with∈Π as ξ ? if the rule book were even longer than you have just → ∞ calculated? Would it still collapse into a black hole? C. Eddington luminosity and black hole growth EXERCISE 5.8 Black holes provide the ultimate lab- oratory for studying strong-field gravitational physics. Binary X-ray sources are places to find strong black The tides near black holes can be so extreme that a hole candidates [67, 68]. A companion star is a perfect process informally called “spaghettification” occurs in source of infalling material for a black hole. As the matter which a body falling towards a black hole is strongly falls or is pulled towards the black hole, it gains kinetic stretched due to the difference in gravitational force at energy, heats up and is squeezed by tidal forces. The different locations along the body (this is called a tidal heating ionizes the atoms, and when the atoms reach a effect). In the following, imagine that you are falling few million degrees Kelvin, they emit X-rays. The X- into a 3M black hole. (i) What is the Schwarzschild rays are sent off into space before the matter crosses the radius of this black hole (in km)? (ii) You are 1.5 m tall event horizon, and so we can detect this X-ray emission. and 70 kg in mass and are falling feet first. At what Another sign of the presence of a black hole is random distance from the black hole would the gravitational variation of emitted X-rays. The infalling matter that force on your feet exceed the gravitational force on your emits X-rays does not fall into the black hole at a steady head by 10 kN? Express this distance in km and in rate, but rather more sporadically, which causes an ob- Schwarzschild radii of the black hole. (iii) To appreciate servable variation in X-ray intensity. Additionally, if the if this amount of force is enough to “spaghettify” and X-ray source is in a binary system, the X-rays will be kill you, imagine that you are suspended from a ceiling periodically cut off as the source is eclipsed by the com- of your room (on Earth) with a steel plate tied to your panion star. feet. Calculate the mass of the plate (in kg) that will Cygnus X-1 is one of the strongest X-ray sources we give you a nice tug of 10 kN (you can ignore the weight can detect from Earth [69] and the first widely thought to of your body here). Do you think this pull will kill be a black hole, after the detection of its rapid X-ray vari- you? (iv) Now consider a trip toward the supermassive ability [70] and the identification of its optical countem- black hole at the center of our Galaxy, which has an part with the blue supergiant star HDE 226868 [71, 72]. estimated mass of 4 million M . How does this change The X-ray emission is powered mainly by accretion from the distance at which you will be “spaghettified” by the the strong stellar wind from HDE 226868 [73]. While the differential gravity force of 10 kN? Express your answer disk of accreting matter is incredibly bright on its own, in km and in Schwarzschild radii of the black hole. Cygnus X-1 has another source of light: a pair of jets per- (v) Find the smallest mass of the black hole for which pendicular to the disk erupt from the black hole carrying you would not die by “spaghettification” before falling part of the infalling material away into the interstellar within its event horizon. space [74]. Consider a steady spherically symmetrical accretion. EXERCISE 5.9 In the Schwarzschild metric r is a co- We assume the accreting material to be mainly hydro- moving coordinate, not a real physical distance. Rather, gen and to be fully ionized. Under these circumstances, integrals over ds constitute physical distances. In the fol- the radiation exerts a force mainly on the free electrons lowing we will take slices of spacetime at a constant time through Thomson scattering, since the scattering cross 2 (dt = 0). (i) Compute the physical circumference, C, at section for protons is a factor (me/mp) smaller, where 4 a given coordinate distance R from the center of a black me/mp = 5 10− is the ratio of the electron and proton × 1 2 hole of mass M at θ = π/2. (ii) Compute the physical masses [75]. If F is the radiant energy flux (erg s− cm− ) 25 2 distance R from the center of the black hole out to and σT = 6.7 10− cm is the Thomson cross section, phys × 23 then the outward radial force on each electron equals the EXERCISE 5.10 The pictures in Fig. 10 show a time rate at which it absorbs momentum, sequence of radio observations of the quasar 0827+243. The core of the quasar is the bright object at a distance σTF Fout = . (135) of 0 ly and a fainter blob of plasma is moving away c from it. (i) What is the apparent velocity of the motion The attractive electrostatic Coulomb force between the of the plasma blob? (ii) Derive the apparent transverse electrons and protons means that as they move out the velocity of an object ejected from a source at velocity v at electrons drag the protons with them. In effect, the radi- an angle θ with respect to the line of sight between the ation pushes out electron-proton pairs against the total source and the observer. (iii) Which angle maximizes gravitational force the apparent transverse velocity? What is accordingly the minimal Lorentz-factor of the plasma blob observed GM in 0827+243? F = (m + m ) (136) in r2 p e acting on each pair at a radial distance r from the center. VI. EXPANSION OF THE UNIVERSE 1 If the luminosity of the accreting source is L (erg s− ), we have The observations that we will discuss in this section L reveal that the universe is in a state of violent explosion, F = (137) 4πr2 in which the galaxies are rushing appart at speeds ap- proaching the speed of light. Moreover, we can extrapo- by spherical symmetry, so the net inward force on an late this explosion backwards in time and conclude that electron-proton pair is all the galaxies must have been much closer at the same time in the past – so close, in fact, that neither galaxies LσT 1 Fnet = GMmp . (138) nor stars nor even atoms or atomic nuclei could have − 4πc r2 had a separate existence. There is a limiting luminosity for which this expression vanishes, called the Eddington limit [76] A. Hubble’s law ! 4πGMmp 38 M 1 LEdd = 1.3 10 erg s− . (139) σT ' × M The XVI century finally saw what came to be a water- shed in the development of Cosmology. In 1543 Coper- At greater luminosities the outward pressure of radiation nicus published his treatise “De Revolutionibus Orbium would exceed the inward gravitational attraction and Celestium” (The Revolution of Celestial Spheres) where accretion would be halted. a new view of the world is presented: the heliocentric Active galactic nuclei (AGNs) are galaxies that model [3]. harbor compact masses at the center exhibiting intense It is hard to underestimate the importance of this work: non-thermal emission that is often variable, which it challenged the age long views of the way the uni- indicates small sizes (light months to light years). The verse worked and the preponderance of the Earth and, luminosity of an accreting black hole is proportional to by extension, of human beings. The realization that the rate at which it is gaining mass. Under favorable we, our planet, and indeed our solar system (and even conditions, the accretion leads to the formation of a our Galaxy) are quite common in the heavens and re- highly relativistic collimated jet. The formation of the produced by myriads of planetary systems, provided a jet is not well constrained, but it is thought to change sobering (though unsettling) view of the universe. All from magnetic-field-dominated near the central engine the reassurances of the cosmology of the Middle Ages to particle (electron and positron, or ions and electrons) were gone, and a new view of the world, less secure and dominated beyond pc distances. The AGN taxonomy, comfortable, came into being. Despite these “problems” controlled by the dichotomy between radio-quiet and and the many critics the model attracted, the system radio-loud classes, is represented in Fig. 9. The appear- was soon accepted by the best minds of the time such as ance of an AGN depends crucially on the orientation Galileo. of the observer with respect to the symmetry axis of The simplest and most ancient of all astronomical ob- the accretion disk [78]. In this scheme, the difference servations is that the sky grows dark when the Sun goes between radio-loud and radio-quiet AGN depends on down. This fact was first noted by Kepler, who, in the the presence or absence of radio-emitting jets powered XVII century, used it as evidence for a finite universe. In by the central nucleus, which in turn may be speculated the XIX century, when the idea of an unending, unchang- to depend on: (i) black hole rotation; (ii) low power or ing space filled with stars like the Sun was widspread high power, as determined by the mass-accretion rate in consequence of the Copernican revolution, the ques- 2 Mc˙ /LEdd [79]. tion of the dark night sky became a problem. To clearly ascertain this problem, we recall that if absorption is 1.1. QUASARS, AGN AND BLAZARS 27
added: a dusty torus or a wrapped disk obscuring the light of type 2 objects. The unification scheme that has emerged combining these ingredi- ents (black hole, disk, jet, torus and clouds) is usually attributed to Antonucci (1993) and Urry & Padovani (1995). As shown in Fig. 6, it is based on orientation effects compared to the line of sight.
24 pastel-00822242, version 1 - 14 May 2013
FIG. 9: UnificationFigure scheme of AGN. 6. TheUnification acronyms for the scheme different sub-classes of AGN. of AGN The are as acronyms follows: Fanaroff-Riley radio galaxies (FR I/II), narrow line radio galaxy (NLRG), broad line radio galaxy (BLRG), radio-loud quasar (RLQ), radio quiet quasar (RQQ), flat spectrumfor radio the quasar diff (FSRQ),erent and sub-classes Sefeyrt galaxies (Sy of 1/2) AGN [77]. are given in Fig. 2. Adapted from Urry & Padovani (1995) . neglected, the aparent luminosity of a star of absolute (1744) [81] and Olbers (1826) [82] postulated the exis- luminosity L at a distance r will be b = L/4πr2. If the tence of an interstellar medium that absorbs the light number density of such stars is a constant n, then the from very distant stars responsible for the divergence number of stars at distances r between r and r + dr is of the integral in (140). However, this resolution of dN = 4πnr2dr, so the total radiant energy density due to the paradox is unsatisfactory, because in an eternal all stars is universe the temperature of the interstellar medium Z Z would have to rise until the medium was in thermal ∞ L b dN n r2dr equilibrium with the starlight, in which case it would ρs = = 2 4π 0 4πr be emitting as much energy as it absorbs, and hence Z ∞ could not reduce the average radiant energy density. = Ln dr . (140) 0 The stars themselves are of course opaque, and totally block out the light from sufficiently distant sources, The integral diverges, leading to an infinite energy den- but if this is the resolution of the so-called “Olbers sity of starlight! paradox” then every line of segment must terminate In order to avoid this paradox, both de Cheseaux´ 202 PINER ET AL. Vol. 640
25
In 1929, Hubble discovered that the spectral lines of galaxies were shifted towards the red by an amount pro- portional to their distances [83]. If the redshift is due to the Doppler effect, this means that the galaxies move away from each other with velocities proportional to their separations. The importance of this observation is that it is just what we should predict according to the simplest possible picture of the flow of matter in an ex- panding universe. The redshift parameter is defined as the traditional shift in wavelength of a photon emitted by a distant galaxy at time tem and observed on Earth today λ ν z = obs 1 = em 1, (141) λem − νobs − Although measuring a galaxy’s redshift is relatively easy, and can be done with high precision, measuring its dis- tance is difficult. Hubble knew z for nearly 50 galaxies, but had estimated distances for only 20 of them. Nev- ertheless, from a plot of redshift versus distance (repro- duced in Fig. 11) he found the famous linear relation now known as the Hubble’s law: H z = 0 r , (142) c
where H0 is a constant (now called the Hubble con- stant). Since in the study of Hubble all the redshift were small, z < 0.04, he was able to use the classical non- relativistic realtion for small velocities (v c). From (48) the Doppler redshift is z v/c and Hubble’s law takes the form ≈
v = H0 r . (143)
Since the Hubble constant H0 can be found by dividing velocity by distance, it is customarily written in the 1 1 rather baroque units of km s− Mpc− . From Fig. 11 it 1 1 follows that H0 = 500 km s− Mpc− . However, it turned out that Hubble was severely underestimating the Fig. 5.—Distances from the core of Gaussian component centers as a function distances to galaxies. In Fig. 12 we show a more recent of time. The lines are the least-squares fits to outward motion with constant speed. determination of the Hubble constant from nearby galaxies, using HST data [84]. By combining results for component C2 in 0827+243, however, has been established of different research groups, the present day Hubble with a high degree of confidence. +5 1 1 expansion rate is H0 = 70 3 km s− Mpc− . − 4. DISCUSSION EXERCISE 6.2 The Sloan Digital Sky Survey (SDSS) Fig. 6.—Mosaic of images of 0827+243 at 22 GHz. The bright feature moves Inspection of Table 3 shows that different Gaussian compo- is a survey that mapped positions and distances of a nents in the same source have different apparent speeds. There approximatelyFIG. 10: Mosaic15 lt-yr inof 0.6 images yr (source of frame), 0827 for+243 an apparent at 22 GHz speed of [80]. about 25c. Only four of the six epochs are shown to prevent overlapping of images. million galaxies using a dedicated 2.5 m telescope in are two possible origins of these different apparent speeds. The 1 The peak flux densities at the four epochs are 1.2, 1.6, 1.2, and 1.1 Jy beamÀ , New Mexico [85]. In this exercise, you will use data first possibility is that the components move with different pat- respectively. Images have been rotated 25 clockwise and restored with a cir- tern speeds that are not necessarily equal to the bulk speed. Lister cular 0.5 mas beam. Model component C3 is at the center of the bright jet from this survey to calculate H0. In Fig. 13 we show (2006) concluded from a correlation of apparent speeds from atfeature. the surface of a star, so the whole sky should have the spectrum of a star in our galaxy and spectra of four the 2 cm and MOJAVE surveys with other source properties that a temperature equal to that at the surface of a typical star. distant galaxies, as measured by the SDSS. For each of various pattern speeds are present in the jets, but that the fastest the galaxies, we indicate the measured brightness in EXERCISE 6.1 (i) In a forest there are n trees per units of Joules per square meter per second. Assume hectare, evenly spaced. The thickness of each trunk is that each of them has the same luminosity as that of the 11 37 D. What is the mean distance that you have an unob- Milky Way (LMW = 10 L , or LMW = 4 10 J/s). (i) De- structed view into the woods, i.e. the mean free path? termine the distance to each of the four× galaxies, using (ii) How is this related to the Olbers paradox? the inverse-square law relation between brightness and 16 CHAPTER 2. FUNDAMENTAL OBSERVATIONS
26
FIG.Figure 11:2.4: Hubble’sEdwin Hubble’s originaloriginal plot ofplot theof relationthe relation betweenbetween redshiftredshift (vertical(vertical axis) axis)and anddistance distance(horizon (horizontaltal axis). Note axis).that Notethe thatvertical inaxis the actually plots cz rather than z – and that the units are accidentally written verticalas km rather axisthan he actuallykm/s. (from plotsHubblecz rather1929, thanProc.z,Nat. and thatAcad. theSci., units15, are accidentally written as km rather than km/s [83]. 2.3.168) REDSHIFT PROPORTIONAL TO DISTANCE 17
Figure forFIG. Problem 13: 4. Spectra measured by the SDSS [86].
velocity of recession for each galaxy, and in each case use the distances to estimate the Hubble constant, in 4 units of kilometers per second per Megaparsec. You will not get identical results from each of the galaxies, due to measurement uncertainties (but they should all be in the same ballpark), so average the results of the four galaxies to get your final answer.
Now a point worth noting at this juncture is that galax- ies do not follow Hubble’s law exactly. In addition to the FIG.Figure 12:2.5: AA moremore modernmodern v versionersion of ofHubble’s Hubble’splot, plot,showing showingcz versuscz expansion of the universe, galaxy motions are affected by distance. In this case, the galaxy distances have been determined using versusCepheid distance.variable stars Inas thisstandard case, thecandles, galaxyas describ distancesed in Chapter have been6. (from de- the gravity of specific, nearby structures, such as the pull terminedFreedman, usinget al. 2001, CepheidApJ, 553, variable47) stars as standard candles [84]. of the Milky Way and Andromeda galaxies on each other. Each galaxy therefore has a peculiar velocity, where pe- velocity away from Earth. Since the values of z in Hubble’s analysis were all culiar is used in the sense of “individual,” or “specific to small (z < 0.04), he was able to use the classical, nonrelativistic relation for itself.” Thus, the recession velocity of a galaxy is really luminosity.the Doppler shift, Expressz = v/c, yourwhere v answersis the radial bothvelocit iny of metersthe light andsource in megaparsecs,(in this case, a galaxy). andIn giveterpreting twothe significantredshifts as Doppler figures.shifts,(ii)Hubble’sThe v = H d + v , (144) spectrumlaw takes the ofform each of these objects shows a pair of strong 0 pec absorption lines of calcium,v = H which0r . have rest wavelength(2.6) where vpec is the peculiar velocity of the galaxy Since the Hubble constant H0 can be found by dividing velocity by distance, λ = 3935 Å and 3970 Å, respectively. The wavelengths1 1 it0is customarily written in the rather baroque units of km s− Mpc− . When along the line of sight. If peculiar velocities could ofHubble thesefirst linesdiscov inered theHubble’s galaxiesLaw, he havethough beent that shiftedthe numerical to longervalue of have any value, then this would make Hubble’s law 1 1 wavelengthsthe Hubble constan (i.e.,t was redshifted),H0 = 500 km s− Mp byc− the(see expansionFigure 2.4). Ho ofwev theer, useless. However, peculiar velocities are typically universe.it turned out Asthat a guide,Hubble w theas sev spectrumerely underestimating of a starthe likedistances the Sunto galaxies. only about 300 km/s, and they very rarely exceed is shownFigure 2.5 insho thews a uppermore recen panel;t determination the calciumof the Hubble linesconstan are att 1000 km/s. Hubble’s law therefore becomes accurate zerofrom nearb redshift.y galaxies, Measureusing data theobtained redshiftby (appropriately of each galaxy.enough) Thatthe for galaxies that are far away, when H0d is much is, calculate the fractional change in wavelength of the larger than 1000 km/s. Furthermore, we can often calcium lines. [Hint: The tricky part here is to make sure estimate what a galaxy’s peculiar velocity will be by you are identifying the right lines as calcium. In each looking at the nearby structures that will be pulling on it. case, they are a close pair; for Galaxy #2, they are the prominent absorption dips between 4100 Åand 4200 Å. EXERCISE 6.3 Suppose we observer two galaxies, one Measure the redshift for both of the calcium lines in at a distance of 35 Mly with a radial velocity of 580 km/s, each galaxy (in each case the two lines should give the and another at a distance of 1, 100 Mly with a radial ve- same redshift, of course!). Give your final redshift to locity of 25, 400 km/s. (i) Calculate the Hubble constant two significant figures. Do the intermediate steps of the for each of these two observations. (ii) Which of the calculation without rounding; rounding too early can two calculations would you consider to be more trust- result in errors. (iii) Given the redshifts, calculate the worthy? Why? (iii) Estimate the peculiar velocity of 18 CHAPTER 2. FUNDAMENTAL OBSER27 VATIONS the closer galaxy. (iv) If the more distant galaxy had this same peculiar velocity, how would that change your 2 calculated value of the Hubble constant? r We would expect intuitively that at any given time the 23 universe ought to look the same to observers in all typical r12 galaxies, and in whatever direction they look. (Hereafter 3 we will use the label “typical” to indicate galaxies that do not have any large peculiar motion of their own, but are simply carried along with the general cosmic flow r31 of galaxies.) This hypothesis is so natural (at least since Copernicus) that it has been called the cosmological prin- 1 ciple by Milne [87]. As applied to the galaxies themselves, the cosmologi-Figure 2.6:FIG.A 14:triangle A triangledefined definedby three by threegalaxies galaxiesin ina auniformly uniformly expanding expanding universe [88]. cal principle requires that an observer in a typical galaxyuniverse. should see all the other galaxies moving with the same pattern of velocities, whatever typical galaxy the ob- server happens to be riding in. It is a direct mathematicalHubble SpaceIf galaxiesTelescop aree. The currentlybest curren movingt estimate away fromof eachthe Hubble other, constant, consequence of this principle that the relative speedcombining of thisthe impliesresults of theydiff wereerent closerresearc togetherh groups, in theis past. Con- any two galaxies must be proportional to the distance sider a pair of galaxies currently separated by a distance between them, just as found by Hubble. To see this r, with a velocity v = H0r relative1 to each1 other. If there H0 = 70 7 km s− Mpc− . (2.7) consider three typical galaxies at positions ~r1, ~r2, and ~r3. are no forces acting to accelerate± or decelerate their rela- They define the triangle shown in Fig. 14, with sides of tive motion, then their velocity is constant, and the time This is the value for the Hubble constant that I will use in the remainder of length that has elapsed since they were in contact is this book. Cosmological innocents sometimesr exclaim,1 when first encountering Hub- r12 ~r1 ~r2 tH = = H− , (148) ≡ | − | ble’s Law, “Surely it must be a violationv 0 of the cosmological principle to r23 ~r2 ~r3 ≡ | − | have all those distant galaxies moving away from us! It looks as if we are r31 ~r3 ~r1 . (145) independent of the current separation r between galax- ≡ | − | at a special location in the1universe – the point away from which all other ies. The time H0− is generally referred to as the Hub- galaxies are fleeing.” In fact, what we1 see here1 in our Galaxy is exactly what In a homogeneous and uniform expanding universe the ble time. For H 70 km s− Mpc− , the Hubble time is shape of the triangle is preserved as the galaxiesy moveou would exp1 ect to see in ≈a universe which is undergoing homogeneous and H0− 14 Gyr. If the relative velocities of galaxies have away from each other. Maintaining the correct relativeisotropic expansion.been≈ constantWe insee thedistan past,t thengalaxies one Hubblemoving timeaway ago,from allus; but ob- lengths for the sides of the triangle requires an expansionservers in anthey galaxiesother galaxy in thew universeould also weresee distan crammedt galaxies togethermo intoving away from law of the form them. a small volume. To see onThea more observationmathematical of galaxylev redshiftsel what pointswe mean naturallyby homogeneous, to r12(t) = a(t) r12(t0) isotropic expansion,a big bang descriptionconsider three forgalaxies the evolutionat positions of the universe.!r , !r , and !r . They r (t) = a(t) r (t ) A big bang model could be broadly defined as a1 model2 3 23 23 0 define a triangle (Figure 2.6) with sides of length r31(t) = a(t) r31(t0) , (146) in which the universe expands from an initially highly dense state to its current low-density state. The Hubble where a(t) is a scale factor, which is totally independent time of 14 Gyr is comparabler12 !r1 to!r the2 ages computed for (2.8) ∼ ≡ | − | of location or direction. The scale factor a(t) tells us how the oldest known starsr23 in the universe.!r2 !r3 This rough equiv- (2.9) the expansion (or possibly contraction) of the universe alence is reassuring. However,≡ | − the| age of the universe r31 !r3 !r1 . (2.10) depends on time. At any time t, an observer in galaxy (i.e the time elapsed since≡ its| original− | highly dense state) #1 will see the other galaxies receding with a speed is not necessarily exactly equal to tH. On the one hand, if gravity working on matter is the only force at work on dr a˙ large scales, then the attractive force of gravity will act v (t) = 12 = a˙ r (t ) = r (t) 12 dt 12 0 a 12 to slow down the expansion. If this were the case, the dr31 a˙ universe was expanding more rapidly in the past than v31(t) = = ar˙ 31(t0) = r31(t) . (147) H 1 dt a it is now, and the universe is younger than 0− . On the other hand, if the energy density of the universe is dom- You can easily demonstrate that an observer in galaxy inated by a cosmological constant Λ (more on this later), #2 or galaxy #3 will find the same linear relation be- then the dominant gravitational force is repulsive, and 1 tween observed recession speed and distance, with a˙/a the universe may be older than H0− . playing the role of the Hubble constant. Since this ar- The horizon distance is defined as the greatest dis- gument can be applied to any trio of galaxies, it implies tance a photon can travel during the age of the universe. that in any universe where the distribution of galaxies The Hubble distance, H = c/H0 4.3 Gpc, provides a is undergoing homogeneous, isotropic expansion, the natural distance scale.R However,≈ just as the age of the 1 velocity-distance relation takes the linear form v = Hr, universe is roughly equal to H0− in most big bang mod- with H = a˙/a. els, with the exact value depending on the expansion 22 CHAPTER 16 COSMOLOGY
EXAMPLE 16.2 Critical Density of the Universe We can estimate the critical mass density of the Universe, Using H ϭ 23 ϫ 10Ϫ3 m/(s · lightyear), where 1 light- 15 Ϫ11 2 2 c, using classical energy considerations. The result turns year ϭ 9.46 ϫ 10 m and G ϭ 6.67 ϫ 10 N · m /kg , out to be in agreement with the rigorous predictions of yields a present value of the critical density c ϭ 1.1 ϫ general relativity because of the simplifying assumption 10Ϫ26 kg/m3. As the mass of a hydrogen atom is 1.67 ϫ Ϫ27 that the mass of the Universe is uniformly distributed. 10 kg, c corresponds to about 7 hydrogen atoms per cubic meter, an incredibly low density. Solution Figure 16.16 shows a large section of the Uni- verse with radius R with the critical density, containing a 28 total mass M, where M consists of the total mass of matter 2 history of theplus universe, the effective one mass horizon of radiation is roughly with energy equal E, E/c . A ~v galaxy of mass m and speed v at R will just escape to infin- v to c/H0, with theity with exact zero value, speed if again, the sum depending of its kinetic on energy the and m expansion history.gravitational potential energy is zero. Thus, Before proceeding any further, two qualifications have 1 2 GmM to be attached to the cosmologicalEtotal ϭ 0 ϭ K ϩ principle.U ϭ 2 mv Ϫ First, it is R ⇢ ,M obviously not true on small scales – we are in a Galaxy m 4 3 1 2 Gm 3R c which belongs to a small2 mv ϭ local group of other galaxies, R which in turn lies near the enormousR cluster of galaxies 2 8G 2 in Virgo. In fact, of the 33v galaxiesϭ R in Messier’sc catalogue, almost half are in one small part3 of the sky, the constella- tion of Virgo. TheBecause cosmological the galaxy of principle, mass m obeys if at the all Hubble valid, law, comes into playv ϭ onlyHR, the when preceding we viewequation the becomes universe on a scale at least as large as the distance between clusters Figure 16.16 (Example 16.2) A galaxy escaping from a 8G 3H 2 FIG.large 15: Spherical cluster contained region of within galaxies radius with R. Onlya larger the radius mass than H 2 ϭ or ϭ of galaxies, or about 100 million3 c light years.c 8 Second,G the distancewithin R slows between the mass clusters m. of galaxies, but smaller radius in using the cosmological principle to derive the rela- than any distance characterizing the universe as a whole. tion of proportionality between galactic velocities and distances, we suppose the usual rule for adding v c. This, of course, was not a problem for Hubble in 1929, 16.6 FREIDMANN MODELS AND THE AGE as none of the galaxies he studied then had a speed any- OF THE UNIVERSEB. Friedmann-Robertson-Walker cosmologies where near the speed of light. Nevertheless, it is im- portant to stress that when one thinks aboutFreidmann really’ larges work established the foundation for describing the time evolu- tion of the Universe based on general relativity. General relativity must be distances characteristic of the universe, as a whole, one In 1917 Einstein presented a model of the universe used in cosmological calculations because it correctly describes gravity, the must work in a theoretical framework capable of dealing based on his theory of general relativity [89]. It de- with velocities approaching the speed ofmost light. important force determining the Universe’s structure, over immense cos- mological distances. Newtonianscribes a geometricallytheory can lead symmetricto errors when (spherical) applied to space the with Note how Hubble’s law ties in with Olbers’ paradox. finite volume but no boundary. In accordance with the 1 Universe as a whole because it assumes that the force of gravity is always attrac- If the universe is of finite age, t H− , then the night H ∼ 0 tive and is instantaneouslycosmological transmitted. principle, Although the Freidmann model is homogeneous did consider and sky can be dark, even if the universe is infinitelymodels both large, with andisotropic. without Einstein It is also’s repulsive static: form the volumeof gravity of (cosmologi- the space does because light from distant galaxies has notcal yetconstant), had time it is easiestnot to change. see the general In order form to of obtain Big Bang a static behavior model, without Einstein to reach us. Galaxy surveys tell us thatintroducing the luminosity repulsive introducedgravitational forces a new at repulsive this point. force in his equations. The density of galaxies in the local universe is Freidmann found threesize of types this of cosmological time-dependent term universes, is given which by may the be cosmo- described in terms of the universal expansion scaling factor a(t). Figure 16.17 8 3 logical constant Λ. Einstein presented his model before nL 2 10 L Mpc− . shows a(t )(the(149) separation between galaxies) as a function of time for the ≈ × the redshifts of the galaxies were known, and taking the three cases labeled openuniverse universe, to fl beat universe, static was and then closed a reasonable universe. Note assumption. that By terrestrial standards, the universe isa( nott) alone a well-lit has a value of zero at the lower-left corner of the graph, not t, and When the expansion of the universe was discovered, this place; this luminosity density is equivalentthat to the a single three 40curves start at different times in the past in order to give the argument in favor of a cosmological constant vanished. watt light bulb within a sphere 1 AU insame radius. scaling If factor the at the present time, denoted t 0. Open universes have less Einstein himself later called it the biggest blunder of his horizon distance is c/H , then the totalmass flux and ofenergy light than that needed to halt the expansion. They start with a scale H 0 life. Nevertheless, the most recent observations seem to we receive from allR the≈ stars from all the galaxiesfactor of zero within and grow without limit, any given galaxy approaching a limiting indicate that a non-zero cosmological constant has to be the horizon will be present. Z H CopyrightR 2005 cThomson Learning,11 Inc. All Rights2 Reserved. Fgal nL dr nL 9 10 L Mpc− In 1922, Friedmann [90, 91] studied the cosmological ≈ 0 ∼ H0 ∼ × solutions of Einstein equations. If Λ = 0, only evolv- 11 2 2 10− L AU− . (150) ing, expanding or contracting models of the universe ∼ × are possible. The general relativistic derivation of the By the cosmological principle, this is the total flux of law of expansion for the Friedmann models will not be starlight you would expect at any randomly located spot given here. It is interesting that the existence of three in the universe. Comparing this to the flux we receive types of models and their law of expansion can be de- from the Sun, rived from purely Newtonian considerations, with re- sults in complete agreement with the relativistic treat- L 2 F = 0.08L AU− , (151) ment. Moreover, the essential character of the motion 4π AU2 ≈ can be obtained from a simple energy argument, which
10 we discuss next. we find that Fgal/F 3 10− . Thus, the total flux of starlight at a randomly ∼ selected× location in the universe Consider a spherical region of galaxies of radius R. is less than a billionth the flux of light we receive from (For the purposes of this calculation we must take R to the Sun here on Earth. For the entire universe to be as be larger than the distance between clusters of galaxies, well-lit as the Earth, it would have to be over a billion but smaller than any distance characterizing the universe times older than it is; and you would have to keep the as a whole, as shown in Fig. 15. We also assume Λ = 0.) stars shining during all that time. The mass of this sphere is its volume times the cosmic 29 mass density, the equation derived from general relativity [63]. For k = 0, the value of H fixes the so-called critical density as 4 π R3 M = ρm . (152) 3H2c2 3 ρ(k = 0) ρc = . (159) ≡ 8πG We can now consider the motion of a galaxy of mass m at the edge of the spherical region. According to Hub- Since we know the current value of the Hubble parame- ble’s law, the velocity of the galaxy is v = HR, and its ter to within 10%, we can compute the current value of hide corresponding kinetic energy the critical density to within 20%. We usually this uncertainty by introducing h,
1 2 1 2 2 1 1 K = mv = mH R . (153) H0 = 100 h km s− Mpc− , (160) 2 2 such that In a spherical distribution of matter, the gravitational 11 2 3 force on a given spherical shell depends only on the ρc,0 = 2.77 10 h M /Mpc × mass inside the shell. The potential energy at the edge 29 2 3 = 1.88 10− h g/cm of the sphere is × 5 2 3 = 1.05 10− h GeV/cm . (161) 2 × GMm 4πmR ρmG +0.05 U = = . (154) Note that since h 0.70 0.03 a flat universe requires an − R − 3 energy density of ≈ 10 protons− per cubic meter. ∼ Hence, the total energy is The expansion of the universe can be compared to the motion of a mass launched vertically from the surface
1 2 2 4π 2 of a celestial body. The form of the orbit depends on E = K + U = mH R Gm R ρm . (155) 2 − 3 the initial energy. In order to compute the complete or- bit, the mass of the main body and the initial velocity which has to remain constant as the universe expands. have to be known. In cosmology, the corresponding Likewise, parameters are the mean density and the Hubble con- stant. On the one hand, if the density exceeds the critial 2E 2 8π = H Gρm . (156) density, the expansion of any spherical region will turn mR2 − 3 to a contraction and it will collapse to a point. This Since we assume that the universe is homogeneous, H corresponds to the closed Friedmann model. On the other hand, if ρm < ρc, the ever-expanding hyperbolic and ρm cannot be functions of R. Thus, the left-hand- side of (156) cannot depend on the chosen distance R to model is obtained. These three models of the universe the coordinate center. However, the value of 2E/(mR2) are called the standard models. They are the simplest is time-dependent, because the distance between us and relativistic cosmological models for Λ = 0. Models with the galaxy will change as the universe expands. Since Λ , 0 are mathematically more complicated, but show the mass m of our test galaxy is arbitrary, we can choose the same behaviour. The simple Newtonian treatment it such that 2E/(mc2) = 1 holds at an arbitrary moment of the expansion problem is possible because Newtonian | | mechanics is approximately valid in small regions of the as long as E , 0. For different times, the left-hand-side scales as R 2 and thus we can rewrite (156) as universe. However, although the resulting equations − are formally similar, the interpretation of the quantities a˙ 2 8π kc2 involved is not the same as in the relativistic context. = Gρm . (157) The global geometry of Friedmann models can only be a 3 − a2R2 0 understood within the general theory of relativity [63]. Next, we define the abundance Ω of the different play- Note that because E is constant, k is constant too. Ac- i ers in cosmology as their energy density relative to ρ . tually, k = 0, 1 is generally known as the curvature c For example, the dimensionless mass density parameter constant. Throughout± the subscripted “0”s indicate that is found to be quantities (which in general evolve with time) are to be 2 evaluated at present epoch. Finally, we account for the ρmc 8πG Ωm = = 2 ρm . (162) equivalence of mass and energy by including not only ρc 3H 2 the mass but also the energy density, ρ = ρmc + and so (157) becomes ··· For simplicity, for the moment we will keep considering scenarios with Λ = 0, but we advance the reader that a˙ 2 8π ρ kc2 H2 G c2 = 2 2 . (158) Λ ≡ a 3 c − a2R ΩΛ = . (163) 0 3H2 which is Friedmann equation (without cosmological con- Now, what about our universe? On a large scale stant) in the Newtonian limit. (158) agrees exactly with what is the overall curvature of the universe? Does it 30 have positive curvature, negative curvature, or is it flat? There is a caveat to the statement that the expansion of a By solving Einstein equations, Robertson [92, 93] and homogeneous universe is adiabatic: when particles anni- Walker [94], showed that the three hypersurfaces of con- hilate, such as electrons and positrons, this adds heat and stant curvature (the hyper-sphere, the hyper-plane, and makes the expansion temporarily non-adiabatic. This the hyper-pseudosphere) are indeed possible geometries matters at some specific epochs in the very early uni- for a homegeneous and isotropic universe undergoing verse. expansion. The metric they derived, independently of For a sphere of comoving radius R0, each other, is called the Friedmann-Robertson-Walker 4 (FRW) metric. The line element is most generally writ- V = π R3 a3(t) , (169) ten in the form 3 0 " # d%2 and so ds2 = c2dt2 a2(t) + %2dΩ2 , (164) − 1 k%2/R2 a˙ − V˙ = 4π R3 a2 a˙ = 3 V . (170) 0 a where dΩ2 = dθ2+sin2 θdφ2. It is easily seen that the spa- Since U = ρV, tial component of the FRW metric consists of the spatial metric for a uniformly curved space of radius R, scaled a˙ a t U˙ = ρ˙V + ρV˙ = V ρ˙ + 3 ρ . (171) by the square of the scale factor ( ). If the universe had a a positive curvature k = 1, then the universe would be closed, or finite in volume. This would not mean that the Substituting (170) and (171) into (168) we have stars and galaxies extended out to a certain boundary, be- a˙ a˙ yond which there is empty space. There is no boundary V ρ˙ + 3 ρ + 3 P = 0 (172) or edge in such a universe. If a particle were to move in a a a straight line in a particular direction, it would eventually and thus return to the starting point – perhaps eons of time later. On the other hand, if the curvature of the space was zero a˙ ρ˙ = 3 ρ + P . (173) k = 0 or negative k = 1, the universe would be open. It − a could just go on forever.− Using the substitution This fluid equation describes the evolution of energy den- sity in an expanding universe. It tells us that the ex- R sin(r/R) for k = +1 pansion decreases the energy density both by dilution % = S (r) = r for k = 0 ; (165) and by the work required to expand a gas with pressure k R sinh(r/R) for k = 1 P 0. − ≥To solve this equation, we need an additional equation the FRW line element can be rewritten as of state relating P and ρ. Suppose we write this in the form h i ds2 = c2dt2 a2(t) dr2 + S2(r) dΩ2 ; (166) − k P = wρ . (174) see Appendix E for details. In principle, w could change with time, but we will as- The time variable t in the FRW metric is the cosmolog- sume that any time derivatives of w are negligible com- ical proper time, called the cosmic time for short, and is pared to time derivatives of ρ. This is reasonable if the the time measured by an observer who sees the universe equation of state is determined by “microphysics” that expanding uniformly around him. The spatial variables is not directly tied to the expansion of the universe. The (%, θ, φ) or (r, θ, φ) are called the comoving coordinates fluid equation then implies of a point in space. If the expansion of the universe is perfectly homogeneous and isotropic, the comoving ρ˙ a˙ = 3(1 + w) , (175) coordinates of any point remain constant with time. ρ − a Todescribe the time evolution of the scale factor a(t) we need an additional equation describing how the energy with solution content of the universe ρ is affected by expansion. The ρ a 3(1+w) first law of thermodynamics, = − . (176) ρ0 a0 dU = TdS PdV, (167) − The pressure in a gas is determined by the thermal mo- with dQ = 0 (no heat exchange to the outside, since no tion of its constituents. For non-relativistic matter (a.k.a. outside exists) becomes cosmological dust), P mv2 v dU dV w dU = PdV + P = 0 . (168) = 2 2 1 , (177) − ⇒ dt dt ρ ∼ mc ∼ c 31 where v is the thermal velocity of particles with mass and substitutte from the fluid equation m. To a near-perfect approximation w = 0, implying 3 a ρm a− . Light, or more generally any highly relativistic ρ˙ = 3(ρ + P) (183) particle,∝ has an associated pressure (radiation pressure). a˙ − Pressure is defined as the momentum transfer onto a to obtain the acceleration equation perfectly reflecting wall per unit time and per unit area. Consider an isotropic distribution of photons (or another a¨ 4πG kind of particle) moving with the speed of light. The mo- = (ρ + 3P) . (184) a − 3c2 mentum of a photon is given in terms of its energy as p = E/c = hν/c. Consider now an area element dA of We see that if ρ and P are positive, the expansion of the the wall; the momentum transferred to it per unit time universe decelerates. Higher P produces stronger decel- is given by the momentum transfer per photon, times eration for given ρ, e.g., a radiation-dominated universe the number of photons hitting the area dA per unit time. decelerates faster than a matter-dominated universe. We will assume for the moment that all photons have In the remainder of this section, we consider a flat uni- the same frequency. If θ denotes the direction of a pho- verse, i.e., k = 0. It is easily seen that for non-relativistic ton relative to the normal of the wall, the momentum matter, the solution to Friedmann equation (158) is given component perpendicular to the wall before scattering by is p = p cos θ, and after scattering p = p cos θ; the ⊥ ⊥ − 2/3 2 two other momentum components are unchanged by t ρ0 ρ0t a t t 0 the reflection. Thus, the momentum transfer per pho- ( ) = and ρ( ) = 3 = 2 , (185) t0 a t ton scattering is ∆p = 2p cos θ. The number of photons scattering per unit time within the area dA is given by with the number density of photons, n times the area element 2 1 dA, times the thickness of the layer from which photons t0 = , (186) arrive at the wall per unit time. The latter is given by 3 H0 c cos θ, since only the perpendicular velocity component where we have used (159). Following the same steps for a brings them closer to the wall. Putting these terms to- bizarre universe, which is dominated today by radiation gether, we find for the momentum transfer to the wall pressure, yields the solution per unit time per unit area the expression 1/2 2 2 t ρ0 ρ0t P(θ) = 2hν n cos θ . (178) a t t 0 ( ) = and ρ( ) = 4 = 2 . (187) t0 a t Averaging this expression over a half-sphere (only pho- tons moving towards the wall can hit it) then yields From this simple exercise we can picture the the time evolution of the universe as follows. In the early 1 1 universe all matter is relativistic and radiation pressure P = hνn = ρ . (179) dominates: a(t) t1/2, ρ t 2, and ρ a 3 t 3/2. 3 3 rad − m − − The density of radiation∝ then∝ falls more∝ quickly∝ than Then for radition, w = 1/3, implying ρ a 4. This be- that of dust. On the other hand, when dust dominates: rad − 2/3 2 4 8/3 ∝ a(t) t , ρm t− , and ρ a− t , hence dust havior also follows from a simple argument: the number ∝ ∝ rad ∝ ∝ 3 domination increases. density of photons falls as n a− , and the energy per 1 ∝ photon falls as hν a− because of cosmological redshift (more on this below).∝ EXERCISE 6.4 Using the Hubble flow v = H0r show Next, we obtain an expression for the acceleration of that the expansion of the universe changes the particle number density according to n˙ = 3H n. the universe. If we multiply our standard version of the − 0 Friedmann equation by a2, we get In closing, we discuss how to measure distances in the 8πG kc2 FRW spacetime. Consider a galaxy which is far away a˙2 = ρa2 . (180) from us, sufficiently far away that we may ignore the 3c2 − R2 0 small scale perturbations of spacetime and adopt the FRW line element. In an expanding universe, the dis- Take the time derivative of (180) tance between two objects is increasing with time. Thus, 8πG if we want to assign a spatial distance between two ob- 2a˙a¨ = ρ˙a2 + 2ρaa˙ . (181) jects, we must specify the time t at which the distance 3c2 is the correct one. Suppose that you are at the origin, divide by 2aa˙ and that the galaxy which you are observing is at a co- moving coordinate position (r, θ, φ). We define a proper a¨ 4πG a distance, as the distance between two events A and B in = ρ˙ + 2ρ , (182) a 3c2 a˙ a reference frame for which they occur simultaneously 32
(tA = tB). In other words, the proper distance dp(t) be- EXERCISE 6.6 Consider a positively curved universe tween two points in spacetime is equal to the length of (k = 1), in which the sole contribution to the energy the spatial geodesic between them when the scale factor density comes from non-relativistic matter. In this case 3 is fixed at the value a(t). The proper distance between the energy density has the dependence ρm = ρm,0/a . the observer and galaxy can be found using the FRW (i) Write down Friedmann equation for this universe and metric at a fixed time t, show that the parametric solution, h i 2 2 2 2 2 2 ds = a (t) dr + Sk(r) dΩ . (188) 4πGρm,0R a(θ) = 0 (1 cos θ) , 3c4 − Along the spatial geodesic between the observer and 3 4πGρm,0R galaxy, the angle (θ, φ) is constant, and thus t(θ) = 0 (θ sin θ) , (193) 3c5 − ds = a(t) dr . (189) satisfies the Friedmann equation. Here θ is a dimen- Likewise, using spatial variables (%, θ, φ) we have sionless parameter that runs from 0 to 2π, and R0 is the present radius of curvature if we have normalized 2 1/2 ds = a(t)[1 k(%/R) ]− dr (190) the scale factor at present to a(t0) = 1. (ii) What is amax, − the maximum possible scale factor for this universe? The proper distance dp is found by integrating over the (iii) What is the maximum value that the physical radius radial comoving coordinate r of curvature (aR0) reaches? (iv) What is the age of the universe when this maximum radius is reached? Z r (v) What is tcrunch, the time at which the universe dp = a(t) dr = a(t) r , (191) 0 undergoes a big crunch (that is a recollapse to a = 0)? [Hint: Recall that a˙ = da/dt = da/dθ dθ/dt.] or using (165) EXERCISE 6.7 Consider a positively curved universe k 1/2 1 √k R k − sin− ( %/ ) for = +1 (k = 1), in which the sole contribution to the energy d = a(t) % for k = 0 . (192) − p density comes from non-relativistic matter, and so the en- 1/2 1 3 k − sinh− ( √ k %/R) for k = 1 ergy density has the dependence ρ = ρ /a . (ii) Write | | | | − m m,0 down Friedmann equation for this universe and show In a flat universe, the proper distance to an object that the parametric solution, is just its coordinate distance, dp(t) = a(t)%. Because 1 1 2 sin− (x) > x and sinh− (x) < x, in a closed universe 4πGρm,0R a(θ) = 0 (cosh θ 1) , (k > 0) the proper distance to an object is greater than its 3c4 − coordinate distance, while in an open universe (k < 0) 3 4πGρm,0R the proper distance to an object is less than its coordinate t(θ) = 0 (sinh θ θ) , (194) distance. 3c5 − satisfies the Friedmann equation. (ii) Compare the time EXERCISE 6.5 A civilization that wants to conquer dependence of the scale factor for open, closed and the universe, which is homogeneous and isotropic, and critical matter-dominated cosmological models in a hence is described by the FRW metric, is getting ready log-log plot. to send out soldiers in all directions to invade all the universe out to a proper distance dp. Every soldier leaves the galaxy where the civilization was born, and travels through the universe with its spaceship along a C. Age and size of the Universe geodesic, out to a distance dp from the original galaxy. At the end of the invasion, which occurs at a fixed time t, all the soldiers stand on a spherical surface at a In special (and general) relativity the propagation of proper distance dp from their original galaxy. The total light is along a null geodesic (ds = 0). If we place the volume that has been invaded is the volume inside this observer at the origin (% = 0), and we choose a radial spherical surface. What is the total volume invaded? null geodesic (dθ = dφ = 0), we have Answer this question for the following three cases: (i) A flat metric (k = 0). (ii) A closed metric (k = +1) cdt d% = , (195) with radius of curvature R at the cosmic time t when a(t) ±[1 k(%/R)2]1/2 − the invaded volume and the proper distance dp are measured. (iii) An open metric (k = 1) with radius where + is for the emitted light ray and the is for a re- of curvature R at the cosmic time t when− the invaded ceived one. Imagine now that one crest of the− light wave volume and the proper distance dp are measured. was emitted at time tem at distance %em, and received at the origin %0 = 0 at t0, and that the next wave crest was 33 Distances in Cosmology
) or equivalently us Today distance,d0 em t ∆tem a(tem) t0 photon arrives = . (202) 0 ∆t a(t ) t 0 0 0 (
c The time interval between successive wave crests is the
= inverse of the frequency of the light wave, related to its wavelength by the relation c = λν. Hence, from (141) the LT
d redshift is
λ0 a0
Time z = 1 = 1 ; (203) λem − a(tem) − i.e., the redshift of a galaxy expresses how much the scale factor has changed since the light was emitted. The light detected today was emitted at some time t photon emitted em tem and, according to (203), there is a one-to-one correspon- em dence between z and tem. Therefore, the redshift z can Emission distance ,d Light travel distance em be used instead of time t to parametrize the history of the universe. A given z corresponds to a time when our FIG. 16: Cosmological redshift. universe was 1 + z times smaller than now. Generally, the expressions for a(t) are rather compli- cated and one cannot directly invert (203) to express the cosmic time t tem in terms of the redshift parameter emitted at tem + ∆tem and received at t0 + ∆t0; see Fig. 16. ≡ The two waves satisfy the relations: z. It is useful, therefore, to derive a general integral expression for t(z). Differentiating (203) we obtain
Z t0 Z %0 dt 1 d% a0 = p (196) dz = a˙(t)dt = (1 + z)H(t)dt , (204) a t c 2 2 tem ( ) − %em 1 k(%/R) −a (t) − − and from which follows that Z Z t t Z ∞ dz 0+∆ 0 dt 1 %0 d% t = . (205) = p . (197) z H(z)(1 + z) a t c 2 tem+∆tem ( ) − %em 1 k(%/R) − A constant of integration has been chosen here so that Now, substract (196) from (197) z corresponds to the initial moment of t = 0. →To ∞ obtain the expression for the Hubble parameter H Z t0+∆t0 Z t0 dt dt in terms of z and the present values of H0 and Ωm,0, it is = 0 (198) convenient to write the Friedmann equation (158) in the t +∆t a(t) − t a(t) em em em form and expand 2 kc ρm(z) H2 z z 2 H2 ( ) + 2 2 (1 + ) = Ωm,0 0 , (206) Z t0+∆t0 Z t0 Z t0+∆t0 a R ρm dt dt dt 0 0 ,0 = + a t a t a t tem+∆tem ( ) tem ( ) t0 ( ) where the definitions in (162) and (203) have been used. Z t t em+∆ em dt At z = 0, this equation reduces to (199) − t a(t) 2 em kc 2 = (Ωm 1)H , (207) 2 2 ,0 − 0 to obtain a0R0
Z t t Z t t 0+∆ 0 dt em+∆ em dt allowing us to express the current value of a0R0 in a = . (200) spatially curved universe (k , 0) in terms of H0 and t a(t) t a(t) 0 em Ωm,0. Taking this into account, we obtain Any change in a(t) during the time intervals between q 2 successive wave crests can be safely neglected, so that H(z) = H0 (1 Ωm,0)(1 + z) + Ωm,0 ρm(z)/ρm,0 − a(t) is a constant with respect to the time integration. q 2 3 Consequently, = H0 (1 Ωm,0)(1 + z) + Ωm,0(1 + z) . (208) − ∆t ∆t We can now complete our program by finding an ex- em = 0 , (201) a(tem) a(t0) pression for the comoving radial distance coordinate r as The Horizon 34
in the radiation and matter-dominated eras, so there is a Radius of horizon. observable universe The proper distance from the origin to %h is given by Z %h d% d (t) a(t) h = 2 1/2 0 [1 k(%/R) ] Z t − cdt = a(t) 0 . (212) 0 a(t0)
For k = 0, using (185) and (187) we obtain dh = 2ct in the radiation-dominated era, and dh(t) = 3ct in the matter- dominated era. Now, substituting (186) into (185) we have 3 2/3 a(t) = H t (213) 2 0 and so from (203) it follows that FIG. 17: Cosmological horizon. 2 1 t (214) = 3/2 . 3 H0 (1 + z) a function of the reshift z. Since photons travel on null For the matter-dominated era, the proper horizon dis- geodesics of zero proper time, we see directly from the tance is metric (166) that 2c d (215) h = 3/2 . Z Z Z H0 (1 + z) cdt dt dz r = = c (1 + z)dz = c , (209) For a flat universe with Ω = 1, we find that at present − a(t) − dz H(z) m,0 time, 28 1 1 with H(z) given by (208). d = 2c/H = 1.85 10 h− cm = 6 h− Gpc . (216) h,0 0 × As the universe expands and ages, an observer at any Note that because a0 = 1, we have %h,0 = dh,0. point is able to see increasingly distant objects as the light from them has time to arrive, see Fig. 17. This means EXERCISE 6.8 Consider a flat model containing only that, as time progresses, increasingly larger regions of matter, with Ωm,0 = 1, and present Hubble constant H0. (i) What is the comoving distance to the horizon (z = )? the universe come into causal contact with the observer. ∞ The proper distance to the furthest observable point (the (ii) What is the redshift at which the comoving distance is half that to the horizon? (iii) What is the ratio of the age particle horizon) at time t is the “horizon distance”, dh(t). Again we return to the FRW metric, placing an ob- of the universe at that redshift, to its present age? (iv) At server at the origin (% = 0) and letting the particle horizon which redshift did the universe have half its present age? for this observer at time t be located at radial coordinate In closing, we show that Hubble’s law is indeed an distance %h. This means that a photon emitted at t = 0 at approximation for small redshift by using a Taylor ex- %h will reach the observer at the origin at time t. Recalling photons move along null geodesics (ds = 0) and consid- pansion of a(t), ering only radially traveling photons (dθ = dφ = 0), we 1 a(t) = a(t ) + (t t )a˙(t ) + (t t )2a¨(t ) + find 0 − 0 0 2 − 0 0 ··· Z t Z %h 1 2 2 dt0 1 d% = a(t ) 1 + (t t )H (t t ) q H + , (210) 0 0 0 0 0 0 = 2 1/2 , − − 2 − ··· 0 a(t0) c 0 [1 k(%/R) ] 2 − where q0 a¨(t0)a(t0)/a˙ (t0) is the deceleration param- yielding eter (it is named≡ − “deceleration” because historically, an accelerating universe was considered unlikely). If the h R t i sin c dt0/a(t0) for k = +1 expansion is slowing down, a¨ < 0 and q0 > 0. For not too 0 R t large time-differences, we can use the Taylor expansion %h = c dt /a(t ) for k = 0 . (211) 0 0 0 of a(t) and write h R t i sinh c dt0/a(t0) for k = 1 0 − 1 a(t) 1 z = 1 + (t t0)H0 . (217) If the scale factor evolves with time as a(t) = tα, with − ≈ 1 + z a(t0) ≈ − α > 1, we can see that the time integral in (211) diverges Hence Hubble’s law, z = (t0 t)H0 = d/cH0, is valid as as we approach t = 0. This would imply that the whole long as z H (t t) 1.− Deviations from its linear 0 0 − universe is in causal contact. However, α = 1/2 and 2/3 form arises for z & 1 and can be used to determine q0. 35 2.5 Kinematic tests 61 2.5 Kinematic tests 63 ϕ0 = const (✓0, 0) θ0 = const ∆θ
l (t0, %0 = 0) ` ∆θ ✓ observer χ = 0 χ (t1,em%1) t = t0 ϕ0 tem (✓0 +θ0 + ∆θ✓, 0) Fig. 2.11. FIG. 18: Extended object of given transverse size ` at comoving tem propagate along radial geodesics and arrive today with an apparent angular distanceseparation%1 from!θ. The the proper observer size of the object, [95].l, is equal to the interval between the emission events at the endpoints:
2 l !s a(tem) #(χem) !θ, (2.68) = − = ! D.as obtained Angular from metric diameter (2.2). The angle and subtended luminosity by the object isdistances then l l !θ , (2.69) z = 5 4 z = a(tem) #(χem) = a(η χem) #(χem) / The angular diameter distance0 − to an object is defined in termswhere of we thehave used object’s the fact that actualthe physical timesize,tem corresponds`, and toθ thethe conformal angular Fig. 2.12. time ηem η0 χem. If the object is close to us, that is, χem η0, then size of the object= − as viewed from earth.≪ Consider a light FIG. 19: For a flat univrese filled with dust dA(z) has a maximum a(η0 χem) a(η0) , #(χem) χem, directionsat z = in5 the/4, skycorresponding differs; this temperature to the redshift difference at which depends objects on the of angular a source of size ` at % =− %1 ≈and t = t1≈subtending an angle and separation.given The proper power size spectrum` will issubtend observed the to haveminimum a series angle of peaks∆θ ason the the angular ∆θ at the origin (% = 0, t = t0) as shown in Fig. 18. The l l separationsky. At is varied redshifts fromz large> 5 to/4 small objects scales. of The a given “first properacoustic sizepeak”` iswill roughly proper distance ` between!θ the two. ends of the object is ≈ a(η0) χem = D determinedappear by bigger the sound on the horizon sky atwith recombination, increasing thez [95]. maximum distance that a related to ∆θ by, We see that in this case !θ is inversely proportional to the distance, as expected. sound wave in the baryon–radiation fluid can have propagated by recombination. However, if the object is located far away, namely, close to the particle horizon, This sound horizon serves as a standard ruler of length l H 1(z ). Recombin- then η χem η , and s − r 0 − ≪ 0 ` ∼ ∆θ = . (218) ationand occurs consider at redshift thezr FRW1100. metricSince !0z asr being1, we centredcan set χem on(zr ) theχp in a(η0 χem) a(η0) , a#((χtem1))%1 # χp const. ≃ ≫ 1 = − ≪ → = (2.70) and in a dust-dominated universe, where # χp 2(a0 H0!0)− (see (2.9)), " # source. However, because of homogeneity,= the comov- The angular size of the object, we obtain ing distance between the source and the observer %1 is the We now define the angular diameterl distance ! " !θ , zr H0!0 1 1/2 1/2 ∝ a(η0 χem) same as we would calculate when1/2 we place the origin at − $θr zr− !0 0.87◦!0 . (2.73) ` our location. The≃ 2H photons(zr ) ≃ 2 from the≃ source are therefore dA = (219) ∆θ passing through a sphere, on3 which1/2 we sit, of proper We have substituted here H /H(zr ) ! z − , as follows from (2.61). Note that 20 2 ≃ 0 r surface area 4πa0%1. However, the redshift still affects 3/2 so that in Euclidean space, the corresponding angular size would be $θr tr /t0 zr− , the flux density in four further! " ways: (i) photon≃ energies≈ or about 1000 times smaller. %1 are redshifted, reducing the flux density by a factor 1+z; d = a(t )% = . (220) The remarkable aspect of this result is that the angular diameter depends directly A 1 1 (ii) photon arrival rates are time dilated, reducing the 1 + z only on !0, which determines the spatial curvature, and is not very sensitive to flux density by a further factor 1 + z; (iii) opposing this, In analogy with (210) we write other parameters. As we will see in Chapter 9, this is true not only for a dust- dominatedthe bandwidth universe, as considereddν is reduced here, but by for a a factor very wide 1 + rangez, which of cosmological in- Z t Z creases the energy flux per unit bandwidth by one power 1 dt 1 %1 d% models, containing multiple matter components. Hence, measuring the angular = , (221) scaleof of 1the+ firstz; (iv) acousticfinally, peak has the emerged observed as the photons leading and at most frequency direct method a(t) c [1 k(%/R)2]1/2 0 0 − for determiningν0 were emitted the spatialat curvature. frequency Our best (1 evidence+ z)ν0. that Overall, the universe the is flux spatially flat (density!0 1), as is predicted the luminosity by inflation, at comes frequency from this (1 test.+ z)ν , divided From an examination point of view, only proficiency in = 0 by the total area, divided by (1 + z): the k = 0 case will be expected. Hence,
Z t1 Z z Lν([1 + z]ν0) dt dz ν(ν0) = %1 = c = c , (222) F 2 2 4πa0%1(r)(1 + z) 0 a(t) 0 H(z) L (ν ) = ν 0 , (224) where in the last equality we used (209). Then, for a 4πa2%2(1 + z)1+α flat universe filled with dust, the angular diameter as a 0 1 z function of is where the second expression assumes a power-law spec- 3/2 α `H0 (1 + z) trum L ν− . We can integrate over ν0 to obtain the ∆θ(z) = . (223) corresponding∝ total or bolometric formulae 2c (1 + z)1/2 1 − L At low redshifts (z 1), the angular diameter decreases = . (225) F 2 2 2 in inverse proportion to z, reaches a minimum at z = 5/4, 4πa0%1(1 + z) and then scales as z for z 1; see Fig. 19 Perhaps the most important relation for observational The luminosity distance dL is defined to satisfy the rela- cosmology is that between the monochromatic flux den- tion (36). Thus, sity and luminosity. Start by assuming isotropic emis- 2 sion, so that the photons emitted by the source pass with dL = (1 + z)%1 = (1 + z) dA , (226) a uniform flux density through any sphere surround- ing the source. We can now make a shift of the origin, where we have taken a0 = 1. 36
26 VII. THE FORCE AWAKENS Figure 3. Observed magnitude 0.0001 Supernova Cosmology versus redshift is plotted for Project 12,13 well-measuresd distant and 24 High-Z Supernova (in the inset) nearby7 type Ia su- 0.001 Search pernovae. For clarity, measure- 22 pty Hamuy et al. Independent cosmological observations have un- Em ments at the same redshift are 0.01 0 combined. At redshifts beyond 20 masked the presence of some unknown formz of= 0.1 energy (distances greater than 9 0.2 0.4 0.6 1 rc about 10 light-years), the cos- 0.1 18 density, related to otherwise empty space, whichmological predictions ap- (indi-
with vacuum energy Mass density
cated by the curves) begin to BRIGHTNESS RELATIVE 16 pears to dominate the recent gravitational dynamicsdiverge, depending of on the as- 1 sumed cosmic densities of 14 the universe and yields a stage of cosmic acceleration.mass and vacuum energy. The 0.01 0.02 0.04 0.1 without vacuum energy red curves represent models We still have no solid clues as to the nature ofwith such zero vacuum dark energy and VED MAGNITUDE 22 Accelerating energy (or perhaps more accurately dark pressure).mass densities The ranging from the universe critical density rc down to zero (an empty cosmos). The best fit OBSER 21 Decelerating cosmological constant is the simplest possible(blue form line) assumes of a mass universe
density of about rc /3 plus a dark energy because it is constant in both spacevacuum energy and density twice 20 that large—implying an accel- 0.2 0.4 0.6 1.0 time, and provides a good fit to the experimentalerating data cosmic as expansion. REDSHIFT z of today. In this section we will discuss the many obser- 0.8 0.7 0.6 0.5 vations that probes the dark energy and we will describe LINEAR SCALE OF THE UNIVERSE RELATIVE TO TODAY the generalities of the concordance model of cosmology lowed up. This approach also made it possible to use the By the end of the year, the error bars began to tighten, with Λ , 0. Hubble Space Telescope forFIG. follow-up 20: light-curve Observed observa- magnitudeas both groups (and now submitted relative papers brightness) with a few more versus su- tions, because we could specifyredshift in advance is the plotted one-square- for well-measuredpernovae, showing evidence distant for much [97, less 98] than and the(in ex- the degree patch of sky in which our wide-field imager would pected slowing of the cosmic expansion.9–11 This was be- find its catch of supernovae.inset) Such specificity nearby is a [99, require- 100]ginning SNe Ia. to be For a problem clarity, for measurements the simplest inflationary at the ment for advance scheduling of the HST. By now, the models with a universe dominated by its mass content. A. Supernova Cosmology Berkeley team, had grown sameto include redshift some dozen arecollabo- combined.Finally, Atat the redshifts beginning of beyond1998, the twoz groups= 0. 1pre- (dis- rators around the world, and was called Supernova Cos- sented the results shown in figure 3.12,13 mology Project (SCP). tances greaterthe than about 109 ly), the cosmological predictions (indicated by the curves)What’s begin wrong with to diverge, faint supernovae? depending on the Acommunity effort The faintness—or distance—of the high-redshift super- The expansion history of the cosmos canMeanwhile, be deter- the whole supernovaassumed community cosmic was making densitiesnovae in of figure mass 3 was and a dramatic vacuum surprise. energy. In the simplest The red mined using as a “standard candle” any distinguishableprogress with the understanding of relatively nearby su- pernovae. Mario Hamuy andcurves coworkers represent at Cerro Tololo modelsExploding with White zero vacuumDwarfs energy and mass took a major step forward by finding and studying many class of astronomical objects of known intrinsic bright- plausible, though unconfirmed, scenario would explain nearby (low-redshift) type densitiesIa supernovae. ranging7 The resulting from ρc down to zero (an empty cosmos). how all type Ia supernovae come to be so much alike, beautiful data set of 38 supernova light curves (some A ness that can be identified over a wide distance range. As The best fit (blue line)given assumes the varied range a mass of stars densitythey start from. of A lightweight about ρc/3 shown in figure 1) made it possible to check and improve star like the Sun uses up its nuclear fuel in 5 or 10 billion the light from such beacons travels to Earthon through the results of an Branch andplus Phillips, a vacuumshowing that energy type density twice that large, implying an 6,7 years. It then shrinks to an Earth-sized ember, a white dwarf, Ia peak brightness could be standardized. with its mass (mostly carbon and oxygen) supported against expanding universe, the cosmic expansion stretchesThe new supernovae-on-demand not accelerating techniques cosmic that per- expansionfurther collapse [101, by electron 102]. degeneracy pressure. Then it mitted systematic study of distant supernovae and the im- begins to quietly fade away. only the distances between galaxy clusters, butprovedalso understanding the of brightness variations among But the story can have a more dramatic finale if the white nearby type Ia’s spurred the community to redouble its ef- dwarf is in a close binary orbit with a large star that is still very wavelengths of the photons en route. Theforts. recorded A second collaboration, called the High-Z Supernova actively burning its nuclear fuel. If conditions of proximity Search and led by Brian of Schmidt magnitude of Australia’s Mountm + 1.and The relative apparent mass are right, there magnitude, will be a steady streamm, in of the redshift and brightness of each these candlesStromlo thus Observatory, pro- was formed at the end of 1994. The material from the active star slowly accreting onto the white team includesd many veteranband, supernovax, experts. is defined The two asdwarf. Over millions of years, the dwarf’s mass builds up vide a measurement of the total integrated exansionrival teams raced of each other over the next few years—oc- until it reaches the critical mass (near the Chandrasekhar the universe since the time the light was emitted.casionally A covering col- for each other with observations when limit, about 1.4 solar masses) that triggers a runaway ther- one of us had bad weather—as we all worked feverishlymx tomxmonuclear= 2 explosion—a.5 log type( xIa/ supernova.x ) , (227) find and study the guaranteed on-demand batches of ,0 This slow, relentless10 approach to a, 0sudden cataclysmic lection of such measurements, over a sufficientsupernovae. range of − conclusion− at a characteristicF massF erases most of the orig- At the beginning of 1997, the SCP team presented the inal differences among the progenitor stars. Thus the light distances, would yield an entire historical recordresults forof our first the seven high-redshiftwhere supernovae.x is the8 These observedcurves (see figure flux 1) and in spectra the of band all type Iax supernovae, whereas universe’s expansion. first results demonstrated them cosmologicalF analysis tech- are remarkably similar. The differences we do occasionally niques from beginning to end. xThey,0 and were suggestivex,0 are of an a referencesee presumably reflect magnitude, variations on the andcommonreference theme— F including differences, from one progenitor star to the next, Type Ia supernovae (SNe Ia) are the best cosmologicalexpansion slowing down at fluxabout the in rate the expected same for the band x, respectively. A difference in simplest inflationary Big Bang models, but with error bars of accretion and rotation rates, or different carbon-to-oxy- gen ratios. yard sticks in the market. They are precise distancestill too large indi- to permit definitemagnitudes, conclusions. ∆m = m1 m2, can then be converted to a − ∆m cators because they have a uniform intrinsic56 brightnessApril 2003 Physics Todayrelative brightness as I2/I1 2.512 . http://www.physicstoday.org due to the similarity of the triggering white dwarf mass In Fig. 20 we show the observed≈ magnitude (and rel- (i.e., MCh = M ) and consequently the amount of nu- ative brightness) versus redshift for well-measured dis- clear fuel available to burn. This makes SNe Ia the best tant and (in the inset) nearby SNe Ia. The faintness (or at least most practical) example of “standardizable (or distance) of the high-redshift supernovae in Fig. 20 candles” in the distant universe. comes as a dramatic surprise. In the (simplest) stan- Before proceeding, we pause to present some nota- dard cosmological models described in Sec. VI B, the tion. The apparent magnitude (m) of a celestial object expansion history of the cosmos is determined entirely is a number that is a measure of its apparent bright- by its mass density. The greater the density, the more ness as seen by an observer on Earth. The smaller the the expansion is slowed by gravity. Thus, in the past, a number, the brighter a star appears. The scale used to high-mass-density universe would have been expanding indicate magnitude originates in the Hellenistic practice much faster than it does today. So one should not have of dividing stars visible to the naked eye into six magni- to look far back in time to especially distant (faint) su- tudes. The brightest stars in the night sky were said to pernovae to find a given integrated expansion (redshift). be of first magnitude (m = 1), whereas the faintest were Conversely, in a low-mass-density universe one would of sixth magnitude (m = 6), which is the limit of human have to look farther back. But there is a limit to how low visual perception (without the aid of a telescope). In the mean mass density could be. After all, we are here, 1856, Pogson formalized the system by defining a first and the stars and galaxies are here. All that mass surely magnitude star as a star that is 100 times as bright as a puts a lower limit on how far-that is, to what level of sixth-magnitude star, thereby establishing a logarithmic faintness we must look to find a given redshift. How- scale still in use today [96]. This implies that a star of ever, the high-redshift supernovae in Fig. 20 are fainter magnitude m is 1001/5 2.512 times as bright as a star than would be expected even for an empty cosmos. ' 37
Eventual collapse RELATIVE BRIGHTNESS OF SUPERNOVAE Eternal expansion Figure 4. The history of cosmic
}
Y expansion,and as measuredΩrad by theis the density fraction of relativistic matter
1 high-redshift supernovae (the black
0.1
0.01 (radiation). We might note in passing that the quantity 1.5 0.001 } data points), assuming flat cosmic 0.0001 geometry. The2 scale2 factor2 R 2of the universe iskc taken/( toa beR 10 atH pres-0) is sometimes referred to as Ωk. This usage ent, so it equalsis unfortunate, 1/(1 + z). The because it encourages us to think of
TIVE TO TODA curves in the blue shaded region represent curvaturecosmological models as in a contribution to the energy density of the which the accelerating effect of 1.0 0 vacuum energyuniverse, eventually over- which is incorrect. comes the decelerating effect of the mass density. These curves as- s
te z sume vacuum energy densities ra le s ranging from 0.95 r (top curve) e e 0.5 EXERCISEc 7.1 Imagine a class of astronomical objects cc t a a down to 0.4 r . In the yellow en r c th le shaded region, the curves repre- 0.5 s, e 1 that are both standard candles and standard yardsticks. te c ra e sent models in which the cosmic ele d 1.5 ec s REDSHIFT expansionIn is always other decelerating words, we know both their luminosities L and t d y 2 irs a due to high mass density. They as- f w on l 3 i a sume mass densities ranging (left to apparent ns their physical sizes `. Recall that the brightness a r p o right) from 0.8 rc up to 1.4 rc. In x . E . fact, for theI oflast two an curves, object the ex- is its flux on Earth divided by its angular
. LINEAR SCALE OF UNIVERSE RELA 0.0 pansion eventually halts and re- 2 –20 –10 0 +10 verses intoarea, a cosmic or collapse. solid angle on the sky, i.e. I = /θ , where θ the BILLIONS OF YEARS FROM TODAY angular size. How does the apparent brightnessF depend on redshift for a general cosmological model, for these FIG. 21: The history of cosmic expansion, as measured by the cosmological models, the expansion history of the cosmos as the recent measurements of theobjects cosmic microwave with back- fixed L and `? high-redshiftis determinedsupernovae entirely by its mass (the density. black The greater data points),the ground assuming strongly indicate, flat we can say quantitatively that cosmicdensity, geometry. the more the The expansion scale is slowed factor bya gravity.of the Thus, universeabout 70% is taken of the total tobe energy density is vacuum energy in the past, a high-mass-density universe would have been and 30% is mass. In units of the critical density rc, one 1 at present,expanding much so it faster equals than 1it/ does(1+ today.z). TheSo one curves should- inusually the bluewrites this shaded result as n’t have to look far back in time to especially distant (faint) W ! r /r " 0.7 and W ! r /r " 0.3. regionsupernovae represent to find cosmological a given integrated expansion models (redshift). in which the acceleratingL L c m m c B. Cosmic Microwave Background effect ofConversely, vacuum in energya low-mass-density eventually universe overcomes one would Why the not decelerating a cosmological constant? have to look farther back. But there is a limit to how low The story might stop right here with a happy ending—a effectthe of mean the mass mass density density. could be.These After all, curveswe are here, assume and complete vacuum physics energy model of the cosmic expansion—were it the stars and galaxies are here. All that mass surely puts not for a chorus of complaints from theThe particle cosmic theorists. microwave background (CMB) radiation densitiesa lower ranging limit on how from far—that 0.95 is,ρ to (topwhat level curve) of faint- down to 0.4 ρ . In the c The standard c model of particle was physicsdiscovered has no natural in 1964 by Penzias and Wilson, using an yellowness—we shaded must lookregion, to find the a given curves redshift. represent The high- place models for a vacuum in which energy density of the modest magni- redshift supernovae in figure 3 are, however, fainter than tude required by the astrophysicalantenna data. The simplest built es- for satellite communication [103]. The ra- the cosmicwould be expected expansion even for is an always empty cosmos. deceleratingtimates due to would high predict mass a vacuum energy 10120 times greater. If these data are correct, the obvious implication is (In supersymmetric models, it’s “only”diation 1055 times was greater.) acting as a source of excess noise (or “static”) density.that the They simplest assume cosmological mass model densities must be too ranging simple. (leftSo enormous to right) a L would from have engendered an acceleration The next simplest model might be one that Einstein en- so rapid that stars and galaxies couldin thenever have radio formed. receiver. Eventually, it became obvious 0.8 ρc up to 1.4 ρc. In fact, for the last two curves, the expansion tertained for a time. Believing the universe to be static, he Therefore it has long been assumed that there must be eventuallytentatively halts introduced and into reverses the equations into of ageneral cosmic rela- collapsesome underlying [101]. symmetry that preciselythat the cancels source the vac- of noise was actually a signal that was tivity an expansionary term he called the “cosmological uum energy. Now, however, the supernova data appear to constant” (L) that would compete against gravitational col- require that such a cancellation wouldcoming have to fromleave a re- outside the Galaxy. Precise measurements lapse. After Hubble’s discovery of the cosmic expansion, mainder of about one part in 10120were. That degree made of fine tun- at wavelength = 7 35 cm. The intensity Einstein famously rejected L as his “greatest blunder.” In ing is most unappealing. λ . later years, L came to be identified with the zero-point The cosmological constant modelof requires this radiation yet another was found not to vary by day or night Ifvacuum these energy data of all are quantum correct, fields. the obviousfine implication tuning. In the cosmic is expansion, mass density be- that theIt turns three out simplestthat invoking a models cosmological of constant cosmology al- comes ever introduced more dilute. Since theor end time of inflation, of theit has year, nor to depend on the direction to a lows us to fit the supernova data quite well. (Perhaps there fallen by very many orders of magnitude. But the vacuum was more insight in Einstein’s blunder than in the best ef- in Sec. VI B must be too simple. The nextenergy todensity simplest rL, a property of emptyprecision space itself, of stays better than 1%. Almost immediately after forts of ordinary mortals.) In 1995, my SCP colleague Ariel constant. It seems a remarkable and implausible coinci- modelGoobar includes and I had found an that, expansionary with a sample of type term Ia su- indence the that equation the mass density, of justits in the detection present epoch, it is was concluded that this radiation comes pernovae spread over a sufficiently wide range of dis- within a factor of 2 of the vacuum energy density. motiontances, driven it would be bypossible the to separate cosmological out the competing constantGiven Λ these, which two fine-tuningfrom coincidences, the universeit seems as a whole: a blackbody emission of effects of the mean mass density and the vacuum-energy likely that the standard model is missing some funda- competes14 against gravitational collapse. The best fit to hot, dense gas (temperature T 3000 K, peak wave- density. mental physics. Perhaps we need some new kind of accel- ∼ The best fit to the 1998 supernova data (see figures 3 erating energy—a “dark energy” that,length unlike λL, maxis not con-1000 nm) redshifted by a factor of 1000 to the 1998and 4) implies supernova that, in the data present shown epoch, the invacuum Figs. en- 20stant. and Borrowing 21 implies from the example of the putative ergy density r is larger than the energy density attribut- “inflaton” field that is thought to have triggered inflation,∼ that, in the presentL epoch, the vacuum energy density ρ λmax 1 mm and T 3 K [104]. A compilation of exper- 2 theorists are proposing dynamicalΛ scalar-field models and able to mass (rmc ). Therefore, the cosmic expansion is now ∼ ∼ is largeraccelerating. than If the universe energy has no density large-scale curvature, attributableother even to more mass exoticρ malternatives. imental to a cosmological measurements con- in the range 0.03 cm . λ . 75 cm
Therefore,http://www.physicstoday.org the cosmic expansion is now accelerating. April 2003revealed Physics Today an accurate57 blackbody spectrum, see Fig. 22. To accommodate SNe Ia data we must add an addi- Actually, according to the FIRAS (Far InfraRed Abso- tional term into the Friedmann equation (158), lute Spectrometer) instrument aboard the COBE (Cosmic Background Explorer) satellite, which measured a temper- 8π ρ kc2 Λc2 ature of T = 2.726 0.010 K, the CMB is the most perfect H2 = G + . (228) 0 ± 3 c2 − a2R2 3 blackbody ever seen [106]. 0 The CMB photons we see today interacted with matter The Λ term also modifies the acceleration equation (184), for the last time some 380 kyr after the bang. Photon which becomes decoupling occurs when the temperature has dropped
2 to a point where there are no longer enough high energy a¨ Λc 4πG 1 + = (ρ + 3P) , (229) photons to keep hydrogen ionized: H γ / e−p . This a 3 − 3c2 era is known as recombination, even though the atomic and H(z) in (208) is now given by constituents had never been combined prior. The ion- ization potential of hydrogen is 13.6 eV (i.e., T 105 K), n ∼ 3 4 but recombination occurs at Trec 3000 K. This is H(z) = H0 Ωm,0(1 + z) + Ωrad,0(1 + z) + ΩΛ ∼ 10 because the low baryon to photon ratio, η 5 10− , o1/2 ≈ × 2 allows the high energy tail of the Planck distribution + (1 Ω0)(1 + z) , (230) − to keep the comparatively small number of hydrogen where atoms ionized until this much lower temperature.
Ω = Ωm + Ωrad + ΩΛ , (231) EXERCISE 7.2 (i) For blackbody radiation, the energy 2 38
Wavelength (cm) 10 1.0 0.1 10−17
10−18 ) 1 −
Hz −19 1 10 − 2.726 K blackbody sr 2 − −20 10 FIRAS COBE satellite W m
( DMR COBE satellite
⌫
ν UBC sounding rocket I B 10−21 LBL-Italy White Mt. & South Pole Princeton ground & balloon Cyanogen optical 10−22 FIG. 23: The CMB over the entire sky, color-coded to represent 110 100 1000 differences in temperature from the average 2.726 K: the color Frequency (GHz) scale ranges from +300 µK (red) to 300 µK (dark blue), repre- senting slightly hotter and colder spots (and also variations in Figure 1. FIG.Precise 22: Themeasurements CMB blackbody of the CMB spectrum spectrum. as confirmed The line represen by mea-ts a 2.73density.) K Results are from the WMAP satellite [107] and the blackbody,surements which describes over athe broad spectrum range very of wavelengths well, especially [105]. around the peak of inten-Planck mission [108]. sity. The spectrum is less well constrained at frequencies of3GHzandbelow(10cm and longer wavelengths). (References for this figure are at the end of this section under “CMB Spectrum References.”) density per unit frequency is given by temperature anisotropies in the CMB are interpreted as a 8πhν3dν snapshot of the early stages of this growth, which even- u dν = . (232) ν c3[exp(hν/kT) 1] tually resulted in the formation of galaxies [109, 110]. Wavelength (cm) − The full sky CMB temperature anisotropy map, as 30030 3 0.3 0.03 Since3.5 the energy of one photon is hν, the number density measured by the Wilkinson Microwave Anisotropy of photons is given by the same expression above Probe (WMAP) [107] and the Planck mission [108], is divided by hν. Calculate the present density of photons shown in Fig. 23. It is convenient to expand the differ- in the3.0 universe, knowing that the CMB temperature is ence ∆T(nˆ) between the CMB temperature observed in T0 2.726 K. [Hint: you will find it useful to know that a direction given by the unit vector nˆ = (θ, φ) and the R '2 x x dx/(e 1) 2.404.] (ii) If deuterium measurements present mean value T0 of the temperature in spherical − ' 10 require2.5 a baryon to photon ratio of η = 5.5 10− , what harmonics must the current density of baryons be? (iii)× Assuming that the Hubble constant is H Planck 70 km s 1 Mpc 1, X∞ X 0 = − − ∆T(nˆ) T(nˆ) T = a Y , (233) Temperature (K) Compton y 0 lm lm calculate what Ωb is. ≡ − 2.0 l=0 m l Chemical potential µ | |≤ Free-free Before the recombination epoch the universe was an where opaque “fog” of free electrons and became transparent to 1.5 Z photons0.1 afterwards.1 Therefore,10 when we100 look at the 1000 sky 1 T = d2nˆ T(nˆ) , (234) in any direction, we canFrequency expect to(GHz) see photons that orig- 0 4π inated in the “last-scattering surface.” This hypothesis Figure 2. hasThe been shapes tested of expected, very precisely but so far by unobserved, the observed CMB distribu-distortions, resulting from energy-releasing processes at different epochs. tion of the CMB; see Fig. 23. The large photon-to-nucleon Z ratio implies that it is very unlikely for the CMB to be alm = ∆T(nˆ) Ylm(nˆ) dΩ , (235) produced in astrophysical processes such as the absorp- tion and re-emission of starlight by cold dust, or the and where Ω denotes the solid angle parametrized by the absorption or emission by plasmas. Before the recombi- pair (θ, φ). The set Ylm is complete and orthonormal, nation epoch, Compton scattering tightly coupled pho- obeying { } tons to electrons, which in turn coupled to protons via electomagnetic interactions. As a consequence, photons Z and nucleons in the early universe behaved as a single dΩ Yl1m1 (Ω) Yl2m2 (Ω) = δl1l2 δm1m2 . (236) “photon-nucleon fluid” in a gravitational potential well created by primeval variations in the density of matter. Since ∆T(nˆ) is real, we are interested in the real-valued, Outward pressure from photons, acting against the in- orthonormal Ylm’s, defined by ward force of gravity, set up acoustic oscillations that propagated through the photon-nucleon fluid, exactly l Pm(x)( √2 cos(mφ)) m > 0 like sound waves in air. The frequencies of these oscilla- Y (θ, φ) = N(l, m) P (x) m = 0 , tions are now seen imprinted on the CMB temperature lm l l fluctuations. Gravity caused the primordial density per- Pm(x)( √2 sin(mφ)) m < 0 turbations across the universe to grow with time. The (237) 39 where Fig. 26. The angular scale corresponding to the particle s horizon size is the boundary between super- and sub- (2l + 1)(l m)! horizon scales. The size of a causally connected region N(l, m) = − (238) 4π (l + m)! on the surface of last scattering is important because it determines the size over which astrophysical processes is a normalization-factor, can occur. Normal physical processes can act coherently only over sizes smaller than the particle horizon. The rel- (1 x2)m/2 dm+l ative size of peaks and locations of the power spectrum Pm(x) = − (x2 1)l (239) l 2ll! dxm+l − gives information about cosmological parameters [114]. In Fig. 25 we show the influence of several cosmolog- l is the associated Legendre polynomial, Pl = Pm=0 is the ical parameters on the power spectrum. For historical Legendre polynomial, and x cos θ; for further details reasons, the quantity usually used in the multipole rep- ≡ see e.g. [111]. resentation is The lowest multipole is the l = 0 monopole, equal " #1/2 to the average full-sky flux and is fixed by normal- l(l + 1) ∆T Cl . (242) ization (234). The higher multipoles (l 1) and their ≡ 2π ≥ amplitudes alm correspond to anisotropies. A nonzero m corresponds to 2 m longitudinal “slices” ( m nodal As an illustration, we sketch how to use the power meridians). There are| l|+1 m latitudinal “zones”| | (l m spectrum to determine the curvature of space. At re- nodal latitudes). In Fig.−| 24 we| show the partitioning−| | combination the universe is already matter-dominated, described by some low multipole moments. so we can substitute zls 1100 into (215) to give an esti- mate of the horizon distance' at the CMB epoch EXERCISE 7.3 At every point in the sky,one observes a 2c blackbody spectrum, with temperature T(θ). The largest dh,ls = 0.23 Mpc . (243) H (1 + z)3/2 ≈ anisotropy is in the l = 1 (dipole) first spherical har- 0 monic, with amplitude 3.355 0.008 mK [113]. The dipole This is the linear diameter of the largest causally con- ± is interpreted to be the result of the Doppler shift caused nected region observed for the CMB, `ls. Therefore, sub- by the solar system motion relative to the nearly isotropic stituting (243) into (223) we find today’s angular diame- blackbody field, as broadly confirmed by measurements ter of this region in the sky of the radial velocities of local galaxies. Show that the 1 motion of an observer with velocity β = v/c relative to θ = = 0.03 1.8◦ . (244) an isotropic Planckian radiation field of temperature T (1 + z)1/2 1 ≈ 0 − produces a Doppler-shifted temperature pattern The reason for this “causality problem” is that the uni- " 2 # verse expands slower than light travels. Namely, as we β 3 T(θ) T0 1 + β cos θ + cos(2θ) + (β ) . (240) have seen, when the age of the universe increases the ≈ 2 O part observable to us increases linearly, ct, while the scale factor increases only with t2/3 (or t∝1/2). Thus we see more and more regions that were never in causal It is easily seen that the alm coefficients are frame- contact for a radiation or matter-dominated universe. dependent. Note that a simple rotation in the φ coor- We note that the sound horizon has approximately the dinate will change the sin φ, cos φ part of the spherical same angular size, because of v c/ √3. The sound harmonic for m 0 and a rotation in the θ coordinate s , horizon serves as a ruler at fixed redshift∼ z to measure will change the associated Legendre polynomial part for ls the geometry of spacetime. Moreover, the fluid of pho- l 0. So only the ` = m = 0 monopole coefficient is co- , tons and nucleons performs acoustic oscillations with its ordinate independent. To combat this problem, we use fundamental frequency connected to the sound horizon the power spectrum defined by plus higher harmonics. The relative size of peaks and locations then gives information about cosmological pa- Xl 1 2 rameters. The first panel of Fig. 25 shows that, for a flat Cl a . (241) ≡ 2l + 1 lm universe (Ω 1, the first peak sits at θ 1 as we have m= l tot ◦ − found in our simple≈ estimate (244). In≈ Fig. 25 we dis- A brief Cl initiation is provided in Fig. 26. play a compilation of measurements of the CMB angular power spectrum. The data agree with high significance EXERCISE 7.4 Show that the power spectrum Cl is with models when they input dark energy as providing invariant under rotations. 70% of the energy in the universe, and when the total energy≈ density ρ equals the critical density. The data To get a rough understanding of the power spec- also indicate that the amount of normal baryonic matter trum we can divide up the multipole representation in the universe Ωb is only 4% of the critical density. What into super-horizon and sub-horizon regions as shown in is the other 96%? 40
FIG. 24: Nodal lines separating excess and deficit regions of sky for various (l, m) pairs. The top row shows the (0, 0) monopole, and the partition of the sky into two dipoles, (1, 0) and (1, 1). The middle row shows the quadrupoles (2, 0), (2, 1), and (2, 2). The bottom row shows the l = 3 partitions, (3, 0), (3, 1), (3, 2), and (3, 3) [112]. 16.5 Inflation
optical light in white and orange, and the CDM map 100 (a) Curvature (b) Dark Energy (drwan up using data on gravitational lensing from
80 Magellan and European Space Observatory telescopes at Paranal) in blue. Galaxy clusters contain not only the
K) 60 µ
( galaxies ( 2% of the mass), but also intergalactic plasma
T ∼ ∆ ( 10% of the mass), and (assuming the null hypothesis) 40 CDM∼ ( 88% of the mass). Over time, the gravitational ∼ 20 attraction of all these parts naturally push all the parts Ωtot ΩΛ to be spatially coincident. If two galaxy clusters were to 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 collide/merge, we will observe each part of the cluster to 100 (c) Baryons (d) Matter behave differently. Galaxies will behave as collisionless particles but the plasma will experience ram pressure. 80 Throughout the collision of two clusters, the galaxies will then become separated from the plasma. This is K) 60 µ ( seen clearly in the Bullet Cluster, which is undergoing T ∆ 40 a high-velocity (around 4500 km/s) merger, evident from the spatial distribution of the hot, X-ray emitting 20 gas. The galaxies of both concentrations are spatially Ωbh2 Ωmh2
0.02 0.04 0.06 0.1 0.2 0.3 0.4 0.5 separated from the (purple) X-ray emitting plasma. 10 100 1000 10 100 1000 The CDM clump (blue), revealed by the weak-lensing l l map, is coincident with the collisionless galaxies, but lies ahead of the collisional gas. As the two clusters FIG. 25: Influence of several cosmological parameters on the cross, the intergalactic plasma in each cluster interacts angular power spectrum of the CMB [16]. Figure 16.4: The influence of several cosmological parameters on the angularwith power the plasma spectrum in the other cluster and slows down. of the CMB. However, the dark matter in each cluster does not interact at all, passing right through without disruption. There is a strong astrophysical evidence for a signif- This difference in interaction causes the CDM to sail is the distance a photonicant travelled amount freelyof nonluminous after its matter last scat intering the universe at tls. Thusahead the of the maximal hot plasma, separating each cluster into two components: CDM (and colissionless galaxies) in angular separation of areferred causally to connected as cold dark points matter is (CDM). with l = Forct example,ls(1 + zls) observations of the rotation of galaxies suggest that they the lead and the hot interstellar plasma lagging behind. rotate as they had(1 considerably + zls)tls more mass than we can What might this nonluminous matter in the universe see [115–117].ϑ = Similarly, observations0.02 of1◦ the motions be? We do not(16.13) know yet. It cannot be made of ordinary t0 ≈ ≈ of galaxies within clusters also suggest they have (baryonic) matter, so it must consist of some other sort The reason for thisconsiderably “causality problem” more mass is than that can the be seen universe [118].expands The of slower elementary than particle light [120] . travels: As the age of themost universe compelling increases, evidence the for part CDM obser is thatvable observed to us at increases linearly, ct, ∝ while the scale factor increasesthe Bullet Clusteronly with [119].t2/ In3 Fig.or ( 28t1/ we2). show Thus a composite we see more andEXERCISE more regions 7.5 We will examine galaxy rotation curves image of the Bullet Cluster (1E 0657-558) that shows the and show that they imply the existence of dark mat- that were never in causalX-ray contact light detected for a radiation by Chandra or matter-do in purple,minated (an image universe.ter. (i) Recall that the orbital period is given by 2 2 3 T The sound horizon hasfrom approximately Magellan and the the Hubble same space angular telescope size, b of)ecause the of vs= 4πc/a √/GM3.. The Write down an expression that relates T ≈ exact size depends among other on the cosmological model: The sound horizon serves as a ruler at fixed redshift zls to measure the geometry of space-time. Moreover, the fluid of pho- tons and nucleons performs acoustic oscillations with its fundamental frequency connected to the sound horizon plus higher harmonics. The relative size of peaks and locations gives information about cosmological parameters. Figure 16.4 shows the influence of several cos- mological parameters on the angular power spectrum as function of ℓ π/ϑ. The first panel ∼ shows that the first peak sits indeed at ℓ 100 (or ϑ 1 ) for a flat Universe, as we have ≈ ∼ ◦ found in our simple estimate (16.13). Observations by the WMAP satellite confirm with high significance the value for Ωb from BBN, for ΩΛ from type Ia supernovae, and that we live in a flat Universe.
16.5 Inflation
Shortcomings of the standard big-bang model Causality or horizon problem: why are even causally disconnected regions of the universe • homogeneous, as we discussed for CMB?
137 • the composition of the Universe: f(⌦B, ⌦CDM, ⌦HDM, ⌦⌫, ⌦ ) • the origin of structure What makes these parameters even more important and what makes CMB- cosmology such a hot subject is that in the near future measurements of the CMB angular power spectrum will determine these parameters with the unprecedented 41 precision of a few % (Jungman et al. 1996). Maps Power Spectra Doppler Peaks
δρ C l l 110 100 1000 l
C l Nothing
l(l+1)C Sachs-Wolfe Plateau
δφ = constant radial averaging 110 100 1000 l + diffusion damping
Cl 1 0.1 degrees super horizon sub horizon 110 100 1000 l
Figure 2. Simple Maps and their Power Spectra.Ifafull-skyCMBmap Figure 4. Simplified CMB Power Spectrum. The CMB power spectrum has only aLeft dipole panel.(top), it’s power spectrum is a delta function at ` =1. Ifamap can be crudely divided into three regions. The Sachs-Wolfe Plateau caused by the FIG. 26: Illustrative sky maps and their angular power spectra. If a full-sky CMB map has only a dipole (top), its has only temperature fluctuations on an angular scale of ⇠ 7 (middle) then all of scale independence of gravitational potential fluctuations which dominate the spec- power spectrum is a delta function at l = 1. If a map has only temperature fluctuations on an angular scale of 7◦ (middle) then the power is at ` ⇠ 10. If all the hot and cold spots are even smaller (bottom) then trum at large super-horizon scales. The horizon is the angular∼ scale corresponding allthe of power the is power at high `. is at l 10. If all the hot and cold spots are even smaller (bottom) then the power is at high l. Right panel ∼ to ctdec where c is the speed of light and tdec is the age of the Universe at decou- Simplified CMB power spectrum. The CMB power spectrum can be crudelypling. The divided Doppler peaks into on three scales slightly regions. smaller The than Sachs-Wolfe the horizon are due Plateau to caused by the scale independence of gravitational potential fluctuationsresonant which acoustic dominateoscillations analogous the spectrum to mellifluous bathroom at large singing super-horizon (see Figure 3. What is the CMB power spectrum? 8). At smaller scales there is nothing because the finite thickness of the surface of scales. The horizon is the angular scale corresponding to ctls. The Dopplerlast scattering peaks averages on scales small scale slightly fluctuations smaller along the than line of the sight. horizon Di↵usion are Similardue to theresonant way sines acoustic and cosines oscillations. are used in Fourier At smaller decompositions scales there of arbi- is nothingdamping because (photons the di finite↵using thicknessout of small scale of thefluctuations) surface also of suppresses last scattering power traryaverages functions small on flat scale space, sphericalfluctuations harmonics along can the be used line to of make sight. decompo- Diffusion dampingon these scales. (photons diffusing out of small scale fluctuations) . sitionsalso of suppresses arbitrary functions power on on the these sphere. scales Thus [114]. the CMB temperature maps 5
Angular scale (degrees) in a sphere around the center of the Milky Way with a 8 radius equal to 8 kpc? (iii) Assume that the Milky Way is made up of only luminous matter (stars) and that the Sun is at the edge of the galaxy (not quite true, but close). What would you predict the orbital velocity to be for a
) star 30 kpc from the center? and for 100 kpc? (iv) Obser-
K vations show that galaxy rotation curves are flat: stars µ
( move at the same orbital velocity no matter how far they
T are from the center. How much mass is actually con-