1
JOHAN A.M. BLEEKER*, JOHANNES GEISS** AND MARTIN C.E. HUBER*** The century of space science
In the course of its brilliant evolution through the twentieth of our Universe have changed the perception of the world century, space science brought spectacular results and led around us, for scientists as well as for the general public. to fundamental scientific findings. In the early quest for For about a quarter of a century after the end of World higher altitude, cosmic rays were discovered by Viktor Hess War II, in what we might call the early epoch of space sci- during a balloon flight in 1912. Following the second ence, experiments in space were regarded as daring but World War, sounding rockets were used, starting in 1946, rather extravagant. But in the course of the 1970s, the meth- to study the structure of the terrestrial atmosphere – the ods and techniques of space science began to enter the threshold of space. Two landmarks of the space age stand mainstream of science, and by the end of the century, space out: the launch in 1957 of the first satellite, Sputnik 1, science had become a natural complement, and often an which sensed the near-Earth environment at orbital altitude, integral part of study in many fields of science. and in 1969 the first landing by humans on an extraterres- The present two volumes of The Century of Space trial body, the Moon. Today, sophisticated space probes Science tell the story of space science as it evolved during explore distant worlds, and space telescopes look back in the twentieth century. The origins of space research, the time towards the early Universe. enabling technology for space transportation and the ‘early The scientific exploration of space was at the origin and, epoch’ of space science mentioned above, are addressed in indeed, the motivation for our first ventures away from the Chapters 2–14. The chapters in the main body of this Earth. It is well known, however, that rockets – the enabling Reference Work then describe the development and results tools of space research – were developed with support from of the scientific topics that have benefited extensively from the military, and that the ‘space race’, later on, served as a space investigations. A chronology of the space age and a substitute battleground during the Cold War. Today, fortu- list of space science missions appear as appendices. nately, the adversities of those years have given way to We have had to restrict the number of fields covered, and peaceful global collaboration that would have been techno- have therefore concentrated on those topics with the most logically and ideologically impossible during most of the comprehensive record in space, namely the exploration of twentieth century. the Solar System, astronomy and gravitational physics. We The experiences of the astronauts on the Moon, their pic- note, however, that space science – far from being a coher- tures of the Earth from above, the visits of unmanned space- ent discipline itself – has by now become a widely used craft and robots to the distant planets and their satellites, and method that complements in an essential way the investiga- the probing of the largest distances and the earliest epochs tive tools of an impressive range of initially laboratory- and ground-based research fields. * SRON – National Institute for Space Research, Utrecht, We have also included three chapters on Earth science. The Netherlands ** ISSI – International Space Science Institute, Bern, Switzerland This field is important not only because of its scientific *** ESA – European Space Agency, Paris, France achievements, but also because, from early on in the space
Johan A.M. Bleeker, Johannes Geiss and Martin C.E. Huber, The Century of Space Science, 3–22 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 4 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER age, it started to raise humanity’s awareness of the fragility EARTH SYSTEM SCIENCE of their home planet – ‘spaceship Earth’ – which protects them from the hostile cosmic environment. It is appropriate Space-borne observational platforms gave us, for the first therefore to include here an image of ‘the blue planet in time, the means to survey the many features of our planet space’, photographed during one of the Apollo missions rapidly and effectively from a global perspective. This global (Figure 1). view of the Earth from space – together with the maturation Rather than use this chapter to summarize the contents of distinct Earth-science disciplines, such as geophysics of of the rest of the book, we instead highlight a number of the solid Earth, oceanography, and atmospheric dynamics results which demonstrate how space science has made and chemistry, as well as the recognition of the human role in major contributions to our knowledge, and sometimes has global change – has stimulated a new approach for studying even forced us to change our concepts of nature. In outlining our home world, what we might call Earth System science. these results, we start with Earth System science and then In this approach the Earth System is studied in the context of proceed – roughly following the historical development – a related set of interacting processes rather than as a collec- from the terrestrial environment and the Solar System to the tion of individual components. Interactions among oceans, Milky Way, and then to extragalactic space and cosmology. ice, land-masses, the atmosphere and biological systems are (Note, however, that the sequence chosen for the topics pre- significant but also very complex. The transport of energy sented in the main body of this work is the opposite: it and material within and among these subsystems occurs on a begins with gravitational physics and cosmology and ends global scale across a wide range of time-scales. Observations with Earth science.) from space are indispensable for present and future research,
Figure 1 The Blue Planet, with the dry lunar surface in the foreground. THE CENTURY OF SPACE SCIENCE 5 whose ultimate goal is an understanding of the processes responsible for the evolution of the Earth on all time-scales. The importance of a synoptic view of planet Earth is convincingly demonstrated by the global monitoring of stratospheric ozone (Figure 2). Ozone provides crucial pro- tection against hard ultraviolet radiation from the Sun. The mapping and monitoring of the evolution of the Antarctic ozone hole has helped us to develop and validate represen- tative models for ozone depletion and restoration in the upper atmosphere.
PLASMAS IN SPACE: MAGNETOSPHERE AND HELIOSPHERE
The International Geophysical Year (1957/58) was the first worldwide coordinated effort in space science. Its goal was to understand the aurorae and to map the variation of the Earth’s magnetic field. Space science has indeed clarified to a large extent the relation between the two. Sounding rock- ets on suborbital trajectories, satellites in various orbits together with ground-based observations of geomagnetic field variations were instrumental in revealing the processes that cause auroral phenomena. These early efforts immedi- ately led to the discovery of a magnetically trapped, colli- sionless particle population, namely the radiation belts and the terrestrial magnetosphere, including its extended magne- totail (Figure 3). It was found that what is responsible for the aurorae (Figure 4) is a medium-energy particle component being injected – during magnetic substorms – from the magnetotail into the auroral zone, rather than, as had been believed, trapped high-energy particles in the radiation belts. Space in the terrestrial neighbourhood was also exploited as a natural laboratory: in situ measurements led to the dis- covery of discontinuities in the collisionless space plasma and uncovered the phenomena of magnetic reconnection and shock acceleration, both fundamental processes throughout the cosmos. The terrestrial magnetosphere later became the prototype for modelling other planetary magnetospheres and the magnetospheres of neutron stars, pulsars and rotating black holes. The heliosphere is formed, because the corona of the Sun is in a state of expansion, forming a continuously flow- ing solar wind. Because of its expansion into three dimen- sions, the solar wind pressure is reduced until it can no longer overcome the ambient interstellar plasma pressure. A termination shock, formed at an estimated heliocentric distance of 80–100 AU, converts the supersonic wind into a Figure 2 Assimilated total ozone maps showing the evolution subsonic flow. of the Antarctic ozone hole between September 2000 and January 2001. The charged particles in the external interstellar gas can- not penetrate the heliopause, the boundary of the heliosphere. However, interstellar grains and neutral atoms do enter, and 6 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
Figure 3 The terrestrial magnetosphere, which results from the interaction of the solar wind with the magnetic field of the Earth, was completely unknown before the advent of the space age. This schematic drawing gives an impression of the complexity of the interaction. they have been found and investigated by spacecraft in the Evidence gathered from Lunar exploration by manned space between the planets and throughout the heliosphere, and unmanned spacecraft shows that water of crystalliza- allowing a direct analysis of physical, chemical and isotopic tion and hydrogen-containing minerals are absent from properties of the matter in the local interstellar cloud. Lunar material, and that other volatile elements, such as A rich variety of heliospheric phenomena have been carbon and nitrogen, are virtually absent as well (Figure 6). discovered, and the underlying processes are now generally Furthermore, neither in situ observations by astronauts nor well understood. And the heliosphere has become a para- pictures taken on the lunar surface or from lunar orbit digm for the interaction of a main sequence star with the revealed any traces of past water flows or sedimentation interstellar medium. (Figure 7). The extremely high temperature of the material ejected by a giant collision can readily explain this com- plete lack of volatile material. THE MOON AND THE TERRESTRIAL PLANETS It is also remarkable that the Moon has much less iron than does the Earth or meteorites. This is indeed what we The origin of the Earth–Moon system, has long been an would expect if the giant impact occurred after the Earth enigma. Moons orbit many planets in the Solar System but, had formed its iron core and if the impact ejected mainly with the exception of those of Earth and Pluto, they are tiny in material from the Earth’s crust and mantle. Fractionation- comparison with the planet they accompany. The satellite sys- corrected abundances of the three isotopes of oxygen also tems of the giant planets, Jupiter and Saturn, consisting of point to a terrestrial origin of the Moon: they are identical more than ten moons each were probably formed from in all Lunar and terrestrial samples, yet differ from those of circumplanetary disks at the time of the planets’ formation. Martian rocks and all classes of meteorites. The exceptional relative size of our Moon points to a different The aggregate evidence in favour of the giant impact the- genesis. Among the various hypotheses of the Lunar origin, ory has become so strong that, at the end of the twentieth there is only one – the giant impact theory – that explains all century, it is generally accepted as the proper explanation for the relevant observations (Figure 5). the origin of the Moon. THE CENTURY OF SPACE SCIENCE 7 The The aurora borealis as seen from the ground and from the Space Shuttle (note the silhouetted tail fin and engine shrouds), and pictures of aurorae on Jupiter and Saturn, as observed by the Hubble Space Telescope. the Hubble Space by and Saturn, as observed Jupiter Figure Figure 4 8 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
Figure 5 All evidence indicates that our Moon was created by a collision between Earth and a Mars-sized object that had formed in a nearby orbit. This painting by William K. Hartmann depicts the situation five hours after the collision.
The Apollo programme and unmanned lunar orbiters and melting in its early history. Later, between about 600 and landers have given us a remarkably complete picture of the 1500 million years after the birth of the Moon, iron- and evolution of the Moon as a geological, geophysical and geo- titanium-rich lava rose from below and filled the large chemical entity. Its anorthositic crust indicates complete basins that had been excavated a few hundred million years
Figure 6 A thin section of basaltic rock collected by Neil A. Armstrong and Edwin E. Aldrin in the Lunar Mare Tranquillitatis. The Moon is virtually free of hydrogen, carbon, nitrogen and other volatile elements; accordingly, the minerals in this sample of lunar rock do not contain these elements. THE CENTURY OF SPACE SCIENCE 9
Figure 7 The dry Lunar landscape in the Mare Imbrium. Like everywhere else on the Moon, there is no trace of past water flows. The rocks are dry, devoid of hydrogen, carbon and nitrogen, although nearby there is a valley that at times in the past had been considered to be a dry riverbed. earlier by large impacts during the last stages of accretion in Solar System have important similarities: their sizes, chemical the Solar System. The lava-filled basins are the dark basaltic compositions and distances from the Sun are rather similar, planes on the Moon that were called maria by Galileo. and so comparative studies can further our understanding of The other surface features are younger and are mostly their origin and evolution. Moreover, we note that these three due to more recent impacts, because by about 3 billion years are the only objects in the Universe for which, by the end of ago the interior of the Moon had cooled to the point where the twentieth century, detailed geological surveys had been geological activity came to a standstill, and the surface of conducted and of which we possess rock samples for labora- the Moon was only occasionally changed by impacts. On tory analysis. Earth, the intrinsic geological activity goes on to this day. It should be stressed, however, that comparative plane- Impacts have only a small influence on shaping surface fea- tology is not just a matter of transferring experience and tures of our planet. However, towards the end of the twenti- information from Earth science to planetology. Quite the eth century it became more and more apparent that impacts contrary: much has been learned about the origin and the play a major role in the evolution of life. early history of the Earth from comparisons with the Moon. The space age has also completely transformed plane- The methods used to derive the relative abundances of chem- tary science. This field has evolved from a branch of astron- ical elements in the Earth and other planetary bodies were omy into a multidisciplinary field that now embraces developed by comparing results of chemical analyses of astronomy, geology, geophysics, geochronology, geochem- rocks from the Earth, Moon and meteorites, including some istry and atmospheric science. meteorites from Mars. Comparisons of the molecular and Concepts developed over centuries of Earth science can be isotopic compositions of the extremely different atmospheres extended to those objects not too dissimilar from the Earth, of Venus, Earth, the Moon and Mars have also improved our namely the terrestrial planets and the Moon. Although an knowledge of fundamental atmospheric processes such as interplanetary traveller would perceive Earth, Moon and Mars outgassing and loss by escape, and of factors that determine (Figure 8) as entirely different worlds, these members of the planetary climates, such as the greenhouse effect. And our 10 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
Figure 8 Topographic map of Mars. Low levels are shown in blue, as a reminder that in the past there was probably extensive flooding on Mars. understanding of the role of impacts in surface geology and Venus, on the other hand, is so thick that surface features can in biological evolution on Earth has been greatly advanced be sensed only by radio and radar techniques (Figure 10). by the study of craters and other surface features of the Similarly on Earth, radar can ‘see through’ the clouds. terrestrial planets and the Moon. Information about the interiors of planets is obtained The technique of remote sensing from orbit permits us to from seismic data and from measurements of gravity, heat investigate not only the atmospheres but also the surfaces of flow and magnetic field. Strong magnetic fields, implying the Moon and the planets, including the Earth. Moreover, an ongoing dynamo effect inside the Earth, Jupiter and depending on the thickness of the atmosphere, the surface other large planets, have been observed. Magnetic field chemistry and mineralogy of these bodies can be probed by measurements now also indicate that Mars possessed a using parts of the electromagnetic spectrum other than the dynamo in its early history. visible, for example gamma rays for Mars and X-rays and The exploration of Mars gained momentum towards the gamma rays for the Moon (Figure 9). The atmosphere of end of the twentieth century, largely because indications of
Figure 9 A compositional map of the Lunar surface as determined by the Apollo 15 gamma-ray spectrometer. High concentrations of radioactive elements (red and yellow regions) are found in the maria and other regions on the nearside. THE CENTURY OF SPACE SCIENCE 11
Figure 10 A three-dimensional perspective view of the surface of Venus, showing the western Eistla region with the volcanic peaks of Gula Mons (left, 3 km high) and Sif Mons (right, 2 km high). The image was obtained by use of synthetic aperture radar (SAR) data combined with radar altimetry, both from the US-American Magellan mission. The hues are based on colour images recorded by the Soviet Venera 13 and 14 probes. extensive water flows at some undetermined earlier epoch then freezes upon exposure. Changes in the orientation of were discovered. These observations and findings in mete- this satellite’s magnetic axis with time are, in fact, consistent orites of Martian origin greatly increased interest in the with a conductive liquid-water layer less than 100 km below question of whether life, or at least extinct life, could be the surface. Europa’s icy shell is therefore thought to cover a found on Mars. salt-rich ocean, in which life might exist. Saturn’s largest satellite, Titan, also evoked considerable THE OUTER SOLAR SYSTEM interest (Figure 12). The temperature structure of its atmos- phere and the abundance of nitrogen and hydrocarbons are Pictures and other data transmitted from probes that have reminiscent of the early Earth. Titan, however, has not visited the outer planets and their environs, and also obser- evolved because of the low temperature and the lack of liq- vations with space telescopes, have revealed a fantastic vari- uid water at 10 AU from the Sun. ety of surfaces, atmospheric phenomena, ring systems, and Io’s plasma torus. Shepherd moons, which constrain thin rings of shards circling planets, were found as well. The THE ORIGIN OF THE SOLAR SYSTEM, larger moons, in particular, showed traces of geologically METEORITES AND COMETS recent changes on their surfaces and even current activity. The most dramatic example is the ongoing activity of Io, The question of the origin of the Solar System remained the innermost of Jupiter’s Galilean moons (Figure 11). This largely a domain of physicists well into the twentieth activity not only shapes the surface of this satellite, it also century. It was concluded that the process starts with the gives rise to the famous sulphur-rich Io torus, which in gravitational collapse of a cloud of gas and dust, and is fol- turn is a major source of ions for the huge magnetosphere lowed by the formation of a disk from which the Sun and of Jupiter. Given Io’s size, which is similar to that of the planets evolved. When chemical arguments were intro- Earth’s Moon, internal energy sources could not possibly duced, meteorite research became important in studying the drive the strong sulphur volcanism that exists in the present origin and evolution of the Solar System. From isotope epoch. The answer to the puzzle is the tidal force exerted by abundance measurements in terrestrial and meteorite sam- Jupiter, which has over 300 times the mass of the Earth. ples, it was found that the age of the Earth and of the Solar Europa, though farther from Jupiter than Io, still appears System was 4.5 billion years. to be affected by this tidal force. Evidence of mobile ‘ice In 1986, at the time of the return of Comet Halley, a fleet rafts’ on the surface of Europa indicates fluid flow. Tidal of spacecraft was despatched to encounter this celestial body stress may exceed the tensile strength of the ice, lead to and its environment. The Giotto spacecraft, in particular, went cracks in the ice and expose the liquid underneath, which deep into the coma, obtaining the first detailed picture of a 12 JOHAN BLEEKER, JOHANNES GEISS AND MARTIN HUBER
Figure 11 Left: the giant planet Jupiter with two of its Galilean moons, Io (reddish) and Europa (white). While these and other moons of Jupiter have nearly the same size as our Moon, they are tiny in comparison with their parent planet. Right: close-ups of Io and Europa, both of which exhibit geological activity that stems from powerful tidal forces exerted by massive Jupiter.
Figure 12 A collage of Saturn and its moons: Dione (foreground) and (from left to right) Thea, Enceladus, Tethys, Mimas and Titan. THE CENTURY OF SPACE SCIENCE 13
presence of astronauts onboard Skylab made it possible to use photographic film and return it to Earth for developing.) With access to a halo orbit around the L1 Lagrangian point on the Earth–Sun line, it became possible to combine remote- sensing observations with in-situ solar-wind measurements. The Solar and Heliospheric Observatory, placed in such an orbit, went a step further. It also enabled space observations by the method of helioseismology, and thus provided a detailed look into the solar interior as well (Figure 14). A complete picture of the structure and dynamics of the solar interior, the processes that maintain the high temperatures in the corona and the processes of solar-wind acceleration is thus emerging from this observatory. The riddle of whether the ‘missing’ solar neutrino flux at Earth should be explained by some exotic behaviour of the energy-generating solar core or by adapting the theory of elementary particles is now resolved, since the solar core does not have any unexpected properties. It is currently thought that the electron neutrinos generated in the core change their flavour while passing through the Sun and on their way from the Sun to the Earth. Consequently, since Figure 13 The nucleus of Comet Halley imaged by the the first large neutrino detectors on Earth were only capable Halley Multicolor Camera on board Giotto. of detecting electron neutrinos, the solar neutrino flux appeared lower since some of them had become µ- or - cometary nucleus and determining the composition of the gas neutrinos. and dust grains in the coma (Figure 13). This confirmed and Measurements by the Solar and Heliospheric Observatory considerably refined the so-called dirty snowball hypothesis, also confirmed that the solar ‘constant’ varies with the solar and also showed that comets have indeed preserved a wealth cycle. The solar irradiance was found to increase by about of virtually unchanged interstellar material, much like the 0.1% between solar minimum and maximum. material from which the Solar System was formed. Moreover, we now know that the Sun’s magnetic field supplies the energy that heats the corona. Waves which had previously been thought to contribute to the heating of the THE SUN corona were identified as the accelerating agent – through the ion-cyclotron mechanism – for the fast solar wind The Earth, its climate and the life it supports are all subjected (Figure 15) that emerges from coronal holes (i.e. from open to and governed by solar radiation and its subtle variations. magnetic field configurations). The astronomer, on the other hand, looks upon the Sun as a Coronal mass ejections, a phenomenon discovered by Rosetta Stone. Being the dominant object in the sky, it is Skylab, can now be followed far into interplanetary space, easily accessible to observations and has become a proving ground for advanced observing techniques. Indeed the first space experiments that went beyond investigating the ter- restrial atmosphere and ionosphere were devoted to record- ing solar ultraviolet spectra. Among the early astronomical satellites were the series of Orbiting Solar Observatories, followed in 1973 by the pioneering Apollo Telescope Mount on Skylab. From this first, albeit short-lived, space station the outer solar atmos- phere and the extension of the outer corona into the heliosphere could be investigated by a multiwavelength complement of imaging, spectroscopic and coronagraphic telescopes having focal lengths that enabled observations Figure 14 Temperature and rotation in the solar interior. down to arc-second resolution. (At a time when photo- The images show the deviations of these properties, as electric imaging detectors were not readily available, the derived from SOHO measurements, from current models. 14 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
Figure 15 Dopplergram showing the (blueshifted) source Figure 16 A coronal mass ejection propagating into the regions of the fast solar wind, namely the boundaries and heliosphere. Stars are visible in this coronagraph image boundary intersections of magnetic network cells in a coronal whose field of view is thirty solar diameters wide. hole. and also as they propagate towards the Earth (Figure 16). water in circumstellar and interstellar environments. Water This provides a useful means of predicting disturbances in possesses a very large number of strong rotational transi- the Earth’s environment – a range of phenomena which tions in the far infrared, which suggest that it can act as a have been collectively termed ‘space weather’. Given the major or even dominant coolant in shocks and circumstellar increasing dependence of our civilization on communica- outflows. In particular, the Infrared Space Observatory tion, navigation and Earth-observing satellites – all vulnera- unveiled a host of water-emitting cosmic sites. Among them ble to major disturbances in the space environment – space are the asymptotic giant branch (AGB) stars, confirming weather forecasting is becoming another important service that water molecules are the dominant coolants of the stel- whose roots lie in space science. lar winds emanating from these stars (Figure 17). Moreover, the Infrared Space Observatory revealed, for the first time in a circumstellar medium, polyacetylenic THE MILKY WAY chains, including C4H2 and C6H2, and benzene (C6H6). From these observations it appears that carbon-rich protoplanetary The impact of the space age on the evolution of, and progress nebulae are capable of producing prebiotic matter in space in astronomical research has been huge, since the full breadth (Figure 18). of the electromagnetic spectrum and primary cosmic-ray par- While our picture of stellar evolution has been estab- ticle population became available as information carriers for lished with the aid of ground-based observations, progress cosmic diagnostics. Space observatories exploring the in the study of star formation has come to depend increas- infrared, ultraviolet, X-ray and gamma-ray wavebands have ingly on observations from space. Diagnosing the process revealed a dynamic and violent Universe harbouring a zoo of of star formation requires penetration into the interiors of exotic objects. Inevitably, we have to limit ourselves here to protostellar clouds, provided by measurements at infrared selecting a few scientific highlights from space-borne astron- and microwave frequencies. Observations from space have omy and obviously they reflect a subjective choice. played a crucial role in improving our understanding of star Although ground-based radio astronomy has unveiled a formation, notably in the form of spectroscopic measure- variety of molecular species, a major asset among the sci- ments from the Infrared Space Observatory, and astrometric entific harvest of space-borne spectroscopy in the infrared measurements and imaging from Hipparcos and the Hubble and far-infrared regions has been the molecular signature of Space Telescope, respectively. THE CENTURY OF SPACE SCIENCE 15
Figure 17 Water acts as a coolant in the stellar winds of AGB stars.
Figure 18 Carbon-rich protoplanetary nebulae as organic chemistry factories in space.
Observationally, it has now been confirmed that practically cate a great diversity of morphology in planetary systems: all low-mass stars like our Sun are born via the formation of a Jupiter-like planets are also found much closer to the cen- disk, which arises from the angular momentum conservation tral star than in our Solar System and they are, most likely, of the gravitationally collapsing protostellar cloud. The final formed first. Earth-like planets are believed to form later, dimensions of these disks are comparable to the size of our through the aggregation of a large number of planetesimals Solar System, and they have a mass approximately ten times a few kilometres in size. the mass of our ‘heaviest’ planet, Jupiter, which implies the Observations from space have not only helped to identify presence of sufficient material to form a planetary system. the early stages in the life of a star, but have also revealed Disks of planetary material orbiting newborn stars are the most extreme remnants of stars, namely stellar-mass called protoplanetary disks. The Hubble Space Telescope black holes. The strongest evidence for the existence of has recently obtained high-resolution images of such disks such black holes comes from observations of compact X-ray around young stars in the Orion Nebula, where they show binary sources in which the orbital velocity of the normal up as dark silhouettes in images obtained in visible light companion star can be measured from the Doppler shift of (Figure 19). Ground-based observations of exoplanets indi- its characteristic spectral lines at optical frequencies. 16 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
Using Kepler’s laws, a quantity called the mass function, hole. So far about ten black hole systems (i.e. objects with f(M), of the system can be determined from measure- f(M) greater than 3M᭪) have been found in our Galaxy. The ments of the orbital period of the compact binary and the largest mass function was found for the recurrent nova V404 semi-amplitude of the radial velocity. This mass function is Cygni, a low-mass X-ray binary with a 6.5-day periodicity, the observational lower limit of the presumed black hole for which f(M) = 6.26 (Figure 20). mass. Fundamental physical arguments show that, among A remarkable achievement of space-borne astronomy very compact objects, only a black hole can have a mass in concerns astrometry, the oldest, most classical subdiscipline excess of 3 solar masses (M᭪), so if mass function values of astronomy: measuring precise positions, parallaxes and above 3M᭪ are found, the compact source has to be a black proper motions of stars to study the Galaxy’s structure and kinematics. The Hipparcos satellite has boosted this field by providing positions of nearly 120 000 stars to milli-arc- second accuracy, and established accurate distances for tens of thousands of stars. In fact, Hipparcos pushed the effective range for parallax measurements from 30 pc out to 300 pc. Accurate distances yield accurate luminosities for all kinds of stars, and consequently theories of stellar evolution can be tested much more rigorously (Figure 21). The kinematic data provided by Hipparcos through the measurement of proper motions coupled with accurate distances allows an in-depth assessment of the dynamic evolution of the Milky Way system. Among other things, these dynamical studies will display the interplay between gravity and pressure and the role of instabilities in our Galaxy. As mentioned in the opening paragraph of this chapter, the discovery of cosmic rays may be regarded as the first result of space research. Although this discovery took place Figure 19 Hubble Space Telescope picture showing disks in 1912, the fact that the primary radiation consists of of planetary material orbiting around young stars in the Orion charged particles coming from the depths of space became Nebula. The size of the solar system is indicated by the lower generally accepted around the middle of the century. Cosmic image in the lower left corner. rays observed at the surface of the Earth are mostly secondary
Figure 20 Radial velocity curve of the black hole system V404 Cygni. THE CENTURY OF SPACE SCIENCE 17
The modulation of cosmic rays by the reversal of the solar magnetic field in the course of its 22-year cycle has proved to be an important tool for investigating heliospheric processes and dimensions. It has been sug- gested that the modulation of cosmic rays may also influence – via ionization and nucleation in the troposphere – the formation of low-lying clouds, and may therefore represent a coupling mechanism between solar activity and terrestrial climate variations.
THE GRAVITATING UNIVERSE AND COSMOLOGY
The notion that most of the mass of the Universe is in a form we cannot see is among the most striking discoveries of contemporary science. As early as the 1930s, Fritz Zwicky pointed out that the visible mass we can observe is not sufficient to explain the motions of galaxies in clusters. The nature of this invisible mass, known as dark matter, is under intense discussion and investigation. Despite its invisibility, it can be detected through the effects of its gravitational field, which allows us to probe its distribution in and around galaxies and in galaxy clusters. Giant clusters of galaxies form in gravitational potential wells dominated by dark matter that extends well beyond Figure 21 Hertzsprung–Russell diagram, reflecting stellar the observed galaxy population. However, the dark matter evolution, for the 16631 single stars from the Hipparcos content and distribution can be probed by observing the hot Catalogue whose distances and colour magnitudes are known X-ray emitting intracluster gas that is gravitationally bound to within 10% and 0.025 mag, respectively. The colour indi- by the total cluster mass. Space-borne observations by cates the number of stars in each pixel of the diagram. X-ray telescopes play a pivotal role in furthering this research (Figure 22). An alternative method of probing the large-scale distribu- particles. Therefore measurements had to be made from tion and concentrations of gravitating matter in the Universe space to detect the primary radiation. (In the 1930s and is provided by gravitational lenses (Figure 23). Extragalactic 1940s, cosmic rays were the main source of particles for the gravitational lensing provides us with an ‘optical bench’ emerging field of elementary particle physics. Positrons, whose length is comparable to the radius of the observable muons, pions, kaons and hyperons were discovered as sec- Universe. And here a dissimilarity with traditional astro- ondary particles produced in the atmosphere by cosmic rays, nomical observations emerges most significantly: this and their lifetimes and modes of decay were determined.) method probes all mass – not only ‘the visible 10% of the Three principal sources of cosmic rays were identified iceberg’ (i.e. the luminous content of the Universe). namely (i) high-energy particles originating in the Galaxy, Confirmation that the enigmatic gamma-ray bursts are (ii) solar particles accelerated by shocks created by solar objects in the remote Universe, rather than transient sources flares, coronal mass ejections and the co-rotating interac- lying in the close galactic vicinity, came from observations tion regions, and (iii) the so-called anomalous component made from space. The location of such events in distant of cosmic rays. The latter are accelerated pick-up ions pro- galaxies, through the optical identification of the X-ray duced from the neutral interstellar gas flowing through the afterglows detected by the BeppoSAX X-ray satellite, heliosphere. Towards the end of the twentieth century, cos- showed that gamma-ray bursts are the most powerful mic-ray research, in conjunction with gamma-ray and X-ray sources of explosive energy released in the Universe since astronomy, became more and more important for localizing its very creation in the hot big bang. and studying violent processes in the Galaxy. Supernova rem- The amount of explosive energy release is currently best nants were identified as the main source of galactic cosmic explained by the so-called fireball model. This postulates a rays. However, it is not yet clear to what extent extragalactic cataclysmic event in which the gamma-ray emission arises sources contribute to cosmic rays of the highest energies. from internal shocks generated in a relativistically expanding, 18 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
Figure 22 XMM-Newton image of the hot X-ray emitting gas in the Coma Cluster of galaxies.
Figure 23 Although gravitational lenses have been discovered from the ground, the sharper pictures obtained from space permit a refined interpretation of the information provided by the deflected light. THE CENTURY OF SPACE SCIENCE 19
Figure 24 Hypernova 1998bw. optically thick plasma cloud travelling at more than 99.99% of the velocity of light. Gamma-ray burst sources would thus seem to produce the most extreme form of cosmic acceleration. Some gamma-ray bursts appear to be associ- ated with a peculiar type of highly luminous supernova, a so-called hypernova, implying the explosive death of a very massive star in which a black hole is formed in the gravita- tionally collapsing core (Figure 24). The earliest picture of the Universe so far has been obtained by observing in the microwave region. The cos- mic microwave background radiation (i.e. the isotropic high-frequency radio emission) was discovered in 1964 by Figure 25 The microwave temperature structure of the sky. Arno Penzias and Robert Wilson, who used a ground-based antenna. It turned out that the early Universe was very smooth. The radiation in question was last scattered from produced in the big bang, but that carbon and all the heavier the universal primordial plasma when the Universe had an elements are synthesized in the interiors of stars. Spallation age of roughly 300 000 years. by cosmic rays contributes significantly only for some of The Cosmic Background Explorer satellite mapped the the extremely rare elements, such as lithium, beryllium and temperature distribution of the microwave background over boron (Figure 26). the entire sky with an angular resolution of several degrees The theory of stellar nucleosynthesis was almost exclu- and revealed tiny spatial fluctuations of the microwave radi- sively based on element and isotope abundances measured ation field (Figure 25). This implied the existence of slight on or from the ground, namely in meteorites and in the density perturbations, the first solid observational evidence solar photosphere. But deriving the primordial abundance confirming the simplest hypothesis for the origin of large- of the isotopes of hydrogen and helium, the main products scale structure, namely that it grew out of tiny primordial of the big bang, required measurements in space as well. density perturbations. The primordial helium abundance is one of the most The first ideas about the synthesis of chemical elements important sources of information about the early Universe. Its in the early phase of the expanding Universe were devel- value derived from observation agrees, with an uncertainty of oped in the 1940s. However, the processes and locations that only a few per cent, with the theoretical prediction. This play a significant role in nucleosynthesis were identified confirms that the laws of physics derived from laboratory only in the 1950s and 1960s. It became clear that the iso- experiments can be applied without change back to the time topes of hydrogen, helium and lithium were fully or partly when the Universe was as young as one second. 20 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
had taken place, which is described by the cosmological constant ⌳ or another form of ‘dark energy’!
OUTLOOK
Space science has brought us more than new knowledge. Communication and even peaceful collaboration continued between space scientists and engineers living on both sides of the Iron Curtain, even in the darkest times of the Cold War. And space science not only helped to transcend borders between countries and political systems: it also led to intense intellectual exchanges across entrenched confines separat- ing disciplines. As far as everyday life is concerned, we should not forget that reliability considerations and quality control for space hardware – an absolute necessity in view of the unforgiving space environment – has produced a direct payoff in the widespread use of quality assurance in indus- trial production. Figure 26 The three principal production sites of nuclei. However, the link between basic research and applied Carbon and all heavier elements are produced in stars. The science still needs to be strengthened because emphasizing big bang yields only the lightest species. For species with 3 7 applications without underpinning them by basic research mixed origin, such as He and Li, the relative proportions is, in the long run, a dead end. Without science, there is no change with time and location in our Galaxy – and elsewhere. science to apply. A case in point is the highly politicized discipline of cli- The average density of the Universe as derived from deu- mate research: understanding the Earth System is required terium and 3He measurements is less than 10% of the critical to predict its reaction to changes. Action on the Earth density, which is the density that would ‘close’ the Universe System must not be taken unless it is based on trustworthy and, as mentioned above, it is also less than the matter predictions. On the other hand, present uncertainties in density that is needed to account for the forces that keep some of the quantitative predictions must not be used as an galaxies and clusters of galaxies together. Thus, by the end excuse for political inaction. The large increase in the quan- of the twentieth century it was becoming clear that there tities of greenhouse gases in the atmosphere caused by exists in the Universe an exotic form of matter that con- human civilization is well documented, and it has undoubt- tributes more to the total density than does the visible, edly begun to affect the climate. Further research is urgently baryonic matter. Observations of the subtle inhomo- needed, not for recognizing the acute danger – which is geneities in the cosmic microwave background radiation obvious – but for predicting the kind of change that will currently indicate that the exotic matter consists largely of most severely influence life on Earth. weakly interacting particles that seem to be much more At the beginning of the twenty-first century, the possibil- massive than protons. ities for science in space are by no means exhausted. Perhaps the most striking example of an outcome of the Emerging propulsion technology such as solar sailing will century of space science is the evidence that the cosmologi- expand the range which our spacecraft can explore. And we cal constant, ⌳, of Einstein’s general theory of relativity is are just beginning to deploy ‘quiet platforms’ which pro- not zero. In 1914, Einstein had to give a non-vanishing vide an essentially acceleration-free environment – often at value to this integration constant because the observational cryogenic temperatures – and thus enable us to perform data available at the time pointed to a static Universe. tests of the general theory of relativity and, probably within Consequently, it was necessary to account for the non- a decade, long-baseline interferometry from free-flying collapse of the Universe by giving ⌳ a finite value. After spacecraft. Extending the technique of interferometry into Edwin Hubble had discovered the expansion of the space will widen the diagnostic potential of space science Universe in 1928, ⌳ seemed to have lost its meaning. Only tremendously. With laser interferometry we shall be able to towards the end of the twentieth century did space observa- set up giant antennas in space for detecting low-frequency tions of remote supernovae indicate that the Universe had gravitational waves, opening an altogether new window on expanded more slowly in an earlier epoch, and therefore the Universe and making possible further, more penetrating that a long-lasting acceleration of the Universe’s expansion tests of general relativity. THE CENTURY OF SPACE SCIENCE 21
The technique of “nulling” interferometry, will enable us special pleasure to thank Clare Bingham, Livia Iebba, Lidy to observe terrestrial planets that orbit stars other than the Negenborn and Silvia Wenger for their untiring assistance, Sun, and study the spectra of their atmospheres and thus and Rudolf Treumann for his comprehensive and wise look for signatures of non-equilibrium that may be caused counsel regarding the material reported in this chapter. by life. Interferometry will, of course, also benefit other fields of astronomy. Once available in the X-ray domain, for example, interferometers will allow us to expose the event horizons of black holes and thus probe the limits of FIGURE CREDITS the observable Universe in objects that are not at cosmolog- ical distances. Figure 1. Photograph by the crew of Apollo-11. Planetary exploration will be a central theme of the space Figure 2. Royal Netherlands Meteorological Institute (KNMI) and ESA. effort in the twenty-first century. How this field develops, The data were taken by the Global Ozone Monitoring Experiment and how fast, will depend on available technologies as well (GOME) on ESA’s ERS-2 satellite. as scientific findings. Evolution and breakthroughs in the Figure 3. From: Hultqvist, B., Øieroset, M., Paschmann, G. and Treumann, fields of transportation and robotics will affect the relative R., (eds.) (1999). Magnetospheric Plasma Sources and Losses. In ISSI roles of human landings and automated exploration, includ- Space Science Series Vol. 6, Kluwer, Dordrecht, and Space Science ing sample return. And any progress in the search for current Reviews, 88, Figure 7.1, p. 371. or extinct life on other planets or satellites will have an enor- Figure 4. Upper left, Jan Curtis; upper right, STScI; lower left, NASA; mous influence on the ways and means of planetary explo- lower right, STScI. ration. The search for life in the Universe, and more specifically in the Solar System, is on. And space will play Figure 5. Painting by William K. Hartmann. an important role in the new field of astrobiology, which has Figure 6. NASA/Johnson Space Center. the potential to lead to a new age of scientific understanding. Astrobiology is also dissolving many boundaries between Figure 7. Photograph by the Apollo 15 crew, NASA Photo AS-15 9212430. disciplines because it requires the integration of several dis- Figure 8. Courtesy NASA and the Mars Orbiter Laser Altimeter (MOLA) parate fields of science, as for example astronomy, biology, Team of the Mars Global Surveyor project. From Kallenbach, R. Geiss, J. chemistry, Earth sciences and physics. and Hartmann, W.K. (eds.) (2001). Chronology and Evolution of Mars. In The outlook for a new century is always uncertain. Yet – ISSI Space Science Series Vol. 12, Kluwer, Dordrecht and Space Science Reviews, 96, Figure 2, pp. 466–467. excepting a major catastrophe that affects the physical world, or a radical change in the mode of thought of humanity – Figure 9. NASA. there is now such a momentum in technology development Figure 10. Data from the Magellan mission, NASA picture ID P-38724 that the outlook for space science is good. It is advisable not (excerpt); http://www.jpl.nasa.gov/magellan/gif/p38724.gif to make any specific predictions: indeed, possibilities often seem unimaginable. As pointed out in Chapter 3, on the Figure 11. Left: Voyager photograph, 13 February 1979. Upper right: Galileo Orbiter Mission (NASA), Catalog No. PIA 025727 (excerpt); enabling technology for space transportation, an author inset: http://www.planetaryexploration.net/jupiter/io/index.html. Lower writing at the beginning of the twentieth century would prob- right:http://photojournal.jpl.nasa.gov/cgi-bin/PIAGenCatalogPage.pl? ably not have foreseen the atom bomb, worldwide air traffic, PIA01127. the Global Positioning System, the Internet or, indeed, the Figure 12. http://www.jpl.nasa.gov/saturn/gif/saturn_montage.gif supreme reign of the computer. In this sense, we might expect new concepts for access to space to provide new Figure 13. From: Keller, H.U., Curdt, W., Kramm, J.-R. and Thomas N. impetus to the overall space effort – including space science. (1994). In R. Reinhard, N. Longdon and B. Battrick (eds.) Images of the nucleus of Comet Halley. ESA SP-1127, ESTEC, Noordwijk, Vol. 1, Figure 61, p. 63.
Figure 14. D. Gough and P. Scherrer, Chapter 44, and Acknowledgements http://sohowww.estec.esa.nl/gallery
The editors deeply appreciate the enthusiastic collaboration Figure 15. From: Hassler, D.M., Dammasch, I.E., Lemaire, Ph., Brekke, P., of those who have contributed to this reference work: Curdt, W., Mason, H.E., Vial, J.-Cl. and Wilhelm, K. (1999). Solar Wind the pioneers of space research and today’s leading space Outflow and the Chromospheric Magnetic Network. Science, 283, 810–813. science investigators. We acknowledge the initiative of Figure 16. Courtesy of SOHO/LASCO consortium. Peter Binfield at Kluwer, and his safe guiding of the project through the difficulties encountered in the editing Figure 17. From: Barlow, M.J., Nguyen-Q-Rieu, Truong-Bach, Cernicharo, J., González-Alfonso, E., Liu, X.-W., Cox, P., Sylvester, R.J., phase. We commend Hermine Vloemans’ firm and diplo- Clegg, P.E., Griffin, M.J., Swinyard, B.M., Unger, S.J., Baluteau, J.-P., matic leadership during the production process. It is a Caux, E., Cohen, M., Cohen, R.J., Emery, R.J., Fischer, J., Furniss, I., 22 JOHAN A.M. BLEEKER, JOHANNES GEISS AND MARTIN C.E. HUBER
Glencross, W.M., Greenhouse, M.A., Gry, C., Joubert, M., Lim, T., Figure 21. http://astro.estec.esa.nl/Hipparcos/TOUR/hrdiagram2.html Lorenzetti, D., Nisini, B., Omont, A., Orfeil, R., Péquignot, D., Saraceno, P., Serra, G., Skinner, C.J., Smith, H.A., Walker, H.J., Armand, C., Figure 22. http://sci.esa.int/img/e5/26085.jpg Burgdorf, M., Ewart, D., Di Giorgio, A., Molinari, S., Price, M., Sidher, S., Texier, D. and Trams, N. (1996). The rich far-infrared water vapour Figure 23. Hubble Space Telescope Image STScl-PRC00-08, spectrum of W Hya. Astron. Astrophys. 315, L241–L244. http://astro.hi.is/lens/pict_th.html
Figure 18. From: Cernicharo, J., Heras, A.M., Tielens, A.G.G.M., Pardo, Figure 24. From: Baron, E. (1998). How big do stellar explosions get? J.R., Herpin, F., Guélin, M. and Waters, L.B.F.M. (2001). Infrared Space Nature, 395, 635–636. Observatory’s Discovery of C4H2,C6H2, and Benzene in CRL 618. Astrophysical Journal Letters, 546, L123–L126. Figure 25. Cosmic Background Explorer Team.
Figure 19. http://antwrp.gsfc.nasa.gov/apod/ap961207.html Figure 26. From: Geiss, J. and von Steiger, R. (1997). Production of light nuclei in the early Universe. In A. Wilson (ed.) Fundamental Physics in Figure 20. From: Casares, J., Charles, P.A., Naylor, T. (1992). A 6.5-day Space, ESA SP-420, ESTEC, Noordwijk, pp. 99–106, Figure 1. periodicity in the recurrent nova V404 Cygni implying the presence of a black hole. Nature 355, 614–617. 11
JAMES R. ARNOLD* The Moon before Apollo
When Galileo first turned his telescope toward the sky, he this name was not well chosen. ‘Deserts’ might have been made two major discoveries. One, the four large moons of better, ‘lava plains’ better still. In this period also the moon’s Jupiter, is justly famous. The other, less familiar, is perhaps libration (see below), causing somewhat more than half the more important historically. He saw the Moon with enough lunar surface to be visible, was first noticed and used. resolution to conclude that it is not a ‘heavenly body’ as The next step in quantitative precision came with Tobias that term was understood, made of perfect and everlasting Mayer (Kopal 1969, pp. 241Ð2), who in 1750 recorded rather heavenly stuff, but a rough, cratered object more like the accurate coordinates for 23 reference points on the lunar sur- Earth Ð a real world rather than a figment of the human face. Even so, actual maps using these data and new observa- imagination. Its study moved from the field of theology to tions of similar precision did not appear until almost a century that of natural philosophy, now called science. later (Beer and Mädler 1837). In Jules Verne’s deservedly We are approaching the 400th anniversary of that impor- famous science fiction novels De la Terre à la Lune (1865) tant milestone in intellectual history. I give here a brief and Auture de la Lune (1870), it was Beer and Mädler’s maps account of lunar studies before the first Apollo landing, in that were brought to lunar orbit by his three intrepid astro- 1969. For this period I owe a great debt to Kopal (1969), nauts. They remained the standard well into the twentieth to which the reader is referred for a more complete and century. The only notable advance in this period was in the authoritative history. Our knowledge of the Moon up to measurement of altitudes of various features using shadow the modern era divides itself naturally, I believe, into two lengths in early lunar ‘morning’ and late lunar ‘afternoon’. strands. The first to develop historically was a series of suc- The second area of research which was within the means cessively refined maps of its visible surface. The second of students of the Moon before the space age had to do with was the growth of understanding of the orbital motion of its large-scale structure and dynamics. These generally the Moon, and of its physical nature. require much more precise quantitative study than the map- Galileo’s first published maps (Kopal 1969, p. 226) were ping of the surface, and so were slower to develop as areas very crude. No known features can be identified on them. of research. It is worth noting, however, that Newton’s cal- The first maps showing features recognizable to the modern culation of the gravitational attraction of the Moon to the student were those of M.F. van Langren in 1649 (Kopal Earth, by his own account ‘agreeing pretty nearly’ with an 1969, pp. 228Ð9), quickly followed by others. The maps of inverse square law of attraction, was a critical early step in G.B. Riccioli and his colleagues (Kopal 1969, p. 235) show his development of his theory of gravitation. Copernicus and other big craters clearly for the first time. The study of the Moon’s orbit may be said to have begun More importantly, most of the feature names given by with G.D. Cassini in the mid-seventeenth century. Reasonably Riccioli still survive today. He seems to have been responsi- precise values of the Moon’s orbital motions, including the ble for calling the low-lying, dark, relatively smooth areas inclination of its orbital plane to the ecliptic plane (about maria or ‘seas’, names which are still attached to these huge 5.14¼), the inclination I of its axis of rotation to its orbital basaltic flows. Although they are low-lying and rather flat, plane (about 1.53¼), the semimajor axis, the eccentricity, and quite a good ephemeris of its motion were determined by * University of California San Diego, La Jolla, CA, USA Lagrange and Laplace in the late eighteenth century.
James R. Arnold, The Century of Space Science, 271Ð275 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 272 JAMES R. ARNOLD
Darwin (1880) and some of his contemporaries opened surface, in fact about 59%, at one time or another. There is up a new scientific frontier by calculating the history of the also a physical libration, or actual rocking of the surface, Moon’s orbit, seeking light on the question of its origin. He but it is very much smaller. concluded that the moon has been and is receding from the We are ready, I think, to move on to the modern era. Earth, due to dissipative tidal forces exerted on the Earth by I choose to start the modern era as Harold Urey, the leading the Moon. This, he said, should cause the Earth’s rotation student of the moon ‘before Apollo’ always did, with the on its axis to slow over time as a consequence of conserva- publication of a remarkable book by Ralph Baldwin (1949). tion of angular momentum. Had the Moon ever been as In the brief span of 238 pages, Baldwin covered a lot of ter- close to the Earth as a few Earth radii, the length of our day ritory. First he reviewed the history of lunar studies to that would have been only a few hours. This analysis has proven point, with an excellent bibliography. Then he gave argu- robust. ments, clear and convincing to Urey and others able to By the early twentieth century three possible scenarios understand, for some key conclusions about the Moon. Not for the origin of the moon had been proposed and explored that he stopped the arguments! As it became apparent that to the limited degree possible with the meager data avail- they might soon be settled by direct observation, they in able. What we may call Darwin’s (1880) ‘fission hypothe- fact tended to grow more intense. sis’ held that the Moon had separated from the Earth when The most interesting of debates was over the mode of the Proto-Earth’s rapid rotation caused it to deform and origin of the abundant lunar craters, and of the circular become unstable. This idea had been embellished some- maria, such as Mare Imbrium and the ‘bull’s eye’ of Mare what, and the Pacific Ocean had been suggested as the scar Orientale. Baldwin (1949) summarized the nineteenth- of the fission process. An EarthÐMoon ‘double planet’ century literature on this topic, pointing out that the gener- scheme, analogous to the formation of double stars, had its ally accepted view, then and later, was that these features advocates. Finally, a capture model, in which the Moon was were volcanic. The other chief idea, that they were formed thought of as a small planet born elsewhere in the Solar by the impact of stray bodies, had been proposed quite System, and later captured in a close passage by the Earth, early, but was not taken seriously Ð it was even considered was seen as a possibility. While many papers on lunar a curiosity Ð by most students of the subject. origin were published over the decades, all three of these The one clear statement of the case for impacts before models continued to be advocated up to and even after Baldwin’s was given by a geologist famous in his time, the Apollo missions. They were widely considered to be G.K. Gilbert (1893). Baldwin (1949) put forward three the only three possibilities. It seems now that nature knew simple points. First, almost no lunar craters had anything better, but that is another story. resembling the conical outline familiar in terrestrial volca- While the fact that the Moon’s shape is not quite spheri- noes. There are only a few small, inconspicuous domes. cal was first deduced by Laplace, the first thorough study Second, the volume of the crater walls was always compa- was done by Jeffreys (1924). The Moon was shown to have rable, within the rather large errors associated with tele- a bulge pointed toward the Earth, accounting for its syn- scopic observations, to the volume of the central hole; chronous rotation. This was consistent at that time with averaged over many craters, the agreement was close. a model in which the bulge was caused by tides raised on Finally, while the larger craters usually show central peaks, a hot early Moon when it was much closer to the Earth. in no case does a central peak rise above the level of the However, the situation is not so simple. Moments of inertia surrounding terrain. of the Moon around three mutually perpendicular axes can The discussions of this and other questions were inter- be compared using observations made from Earth: the one rupted by a major event: the launch of Sputnik 1 on 4 called A, pointing toward the center of the Earth; C, close October 1957, followed on 3 November by the larger and to the rotation axis; and B, perpendicular to the other more ambitious Sputnik 2. On 31 January 1958 the USA two. They are all slightly different. (C A)/B is the best- responded by successfully launching the very small but determined ratio, given by Urey (1952) as 0.000629. The capable Explorer 1, which yielded James Van Allen’s dis- less certain values of C A and C B are certainly different covery of the Earth’s radiation belts. It was immediately from each other, so that to a first approximation the Moon obvious that great new possibilities for space research were is a triaxial ellipsoid. This was already known by the time about to be realized. For planetary scientists the Moon was of Urey’s book as inconsistent with a simple tidal model. the inevitable first target. The ellipticity of the lunar orbit, and the inclination Following the establishment of NASA in 1958, attention of its axis of rotation, combined with the parallax due to was given to the possibility of one or more lunar missions. the Earth’s finite size, give rise to the optical librations, Urey was particularly effective in advocating such missions. or apparent rocking, of the lunar surface in latitude and The first NASA committee to explore the subject was longitude. Thus we can see more than half of the Moon’s appointed early in 1959, with him as its most prominent THE MOON BEFORE APOLLO 273 member (and the author as a new recruit). What might be The first NASA lunar mission program was named called ‘the first Moon race’ then began. The most obvious Ranger. It began badly. Rangers 1 and 2 were engineering goal was to produce images of the then still invisible farside. test missions Ð both failed. Then missions 3Ð5, carrying The mare named Moscoviensis and the large, dark- cameras and scientific instruments and intended to floored crater named Tsiolkovsky will allow younger read- approach and impact the visible face of the moon, also ers to guess or remember who won that first race. Indeed, failed (in a different way each time.) However, the experi- the Soviet Union held the lead for a long time. After two ence gained seems to have been useful. Rangers 6Ð9, mission failures, their Luna 3 mission flew by the Moon on launched in 1964Ð65, were redesigned to carry cameras 8 October 1959 and obtained images of its hidden side, only. These four succeeded in producing the first close-up marking the first successful mission to another planetary images of the Moon, in the few seconds before impact. body. The pictures released were not very clear. This These images were clear and in focus. The final frames allowed the illusion to persist in the West for a while longer showed features smaller than one meter, sometimes much that the Soviet space probes were of poor quality, or even smaller. Some of these were craters, not very different in that the images were fraudulent. A scientist working with a appearance from the larger ones seen through Earth-based US intelligence agency undertook to evaluate the case. He telescopes. This was no surprise to those who believed they gave a very entertaining talk on the results. His conclusion were caused by impact of stray bodies, small fragments of was that the pictures were retouched ‘by experts, as good as asteroids or comets. They presented more difficulties to the ours’ before publication. He was sure that the Soviet scien- volcano party. tists actually had ‘genuine pictures better than the ones they One group of images displayed parts of bright rays origi- released.’ The better pictures obtained later in fact con- nating from one of the Moon’s youngest large craters. firmed their results. They could and did name the big fea- These were full of small pits which were elongated in a tures. The most notable difference between the nearside and common direction, appearing quite unlike the large primary the farside was the near-absence of maria on the latter. craters seen through Earth-based telescopes. There could NASA’s plans for lunar missions took some time to be little doubt that they were secondary impact features, develop. They were overtaken by President Kennedy’s formed as debris from the major crater-forming event speech to Congress in May 1961, announcing to the world moved out radially from it at subsonic speeds. It was also that he was committing the US Government to ‘placing a possible to conclude definitely what had already been man on the Moon in this decade and returning him safely to inferred from measurements of temperature versus time the Earth.’ All plans were restudied and, where appropriate, on the lunar surface (Baldwin 1949, p. 10, Figure 1 and revised. A manned flight on such a schedule required not accompanying discussion), namely that the Moon was not only the development of advanced capabilities at an accel- covered with bare rock. What the Ranger cameras saw was erated pace, but also their testing by precursor missions of fine-grained material. In general the surface in all three unprecedented scope and complexity. image sequences was remarkably smooth. Nonetheless, The new objective led to the announcement of three sets landing a spacecraft without human guidance was seen to of unmanned NASA missions to the Moon. The first, the have a finite risk of failure due to striking a rock. Ranger series, was at first largely unchanged from earlier Meanwhile the Soviet program was continuing to pro- plans. It included a series of impacts on the Moon’s visible duce important results and some more ‘firsts’. Two long face, with both scientific and reconnaissance objectives. The series of spacecraft, Luna and Zond, were launched second, the Surveyor series, was to land small instrument- throughout the 1960s and even later. Perhaps the most bearing spacecraft safely on the surface, again with multi- important was Luna 9, which was the first to soft-land on ple tasks. Both these programs were to be managed by the the Moon, on 3 February 1966, before the US Surveyor Jet Propulsion Laboratory in Pasadena, California. The program had begun. It transmitted pictures back to Earth of third, the Lunar Orbiter series, was to produce a photo- its landing site in western Oceanus Procellarum. Luna 10, graphic map of the Moon, nearside and farside, as complete which followed, was the first spacecraft to be put in orbit and detailed as possible. The goal was mainly to support around the Moon. the manned missions, but the data set would also be avail- This Surveyor series had an ambitious aim. The space- able for science. Responsibility for this was assigned to the craft were to land softly and upright on the lunar surface, Langley Research Center in Hampton, Virginia. on three extended feet. They carried not only cameras but It was a bumpy road we started down then. There were also other instruments (described briefly below). Their suc- numerous failures while scientists, engineers, and managers cess rate showed impressive progress since the Ranger learned their new roles. What is important here is not the series. Of the seven spacecraft launched, only Surveyors race, but what was learned and how that led to the remark- 2 and 4 failed to achieve their landings and scientific able scientific advances of the Apollo period. objectives. 274 JAMES R. ARNOLD
Surveyor 1 landed on 2 June 1966, at a near-equatorial Anthony Turkevich (1967). It used Rutherford scattering of western site not far from the crater Flamsteed in Oceanus alpha particles produced by radioactive decay (the same Procellarum.* This spacecraft and Surveyor 3 carried vidi- process Ernest Rutherford had used to demonstrate the exis- con TV cameras capable of producing a black-and-white tence of atomic nuclei). It could identify the nuclei, and TV image of 600 lines. The cameras could be rotated 360 hence the elements from which the particles were scattered, degrees, raised and lowered, and zoomed in and out. Over using the principle of conservation of momentum. Housed 10,000 pictures of the site (and the sky) were obtained, in a box open at the bottom, the instrument was deposited covering a full lunar day cycle in each case. on a flat spot on the surface. It accumulated a spectrum of A full report of Surveyor 1’s findings is given in Jaffe energies of backscattered particles over many hours of (1967), with an accompanying photographic section. The exposure. The concentrations of all major elements (abun- coverage of surface features was of course far more dance greater than 1%) in the top layer of the soil it sampled detailed than that available from the Ranger pictures (or the were then derived from their scattered energy spectrum. earlier Soviet ones). One fear was laid to rest by both Luna 9 The Surveyor 5 results clearly identified the material at and Surveyor 1: The two spacecraft did not sink into the this mare site as a (ground up) basaltic rock, comparable in lunar surface (confirmed to be finely divided material, now general to basalts on earth. However, it was unusual in hav- christened ‘regolith’) as one scientist had warned NASA ing a high concentration of the element titanium, 6Ð8% by that it would. Again, the surface was seen to be generally weight, which was beyond what geochemists were accus- smooth with low slopes, though there were rocks that could tomed to on the Earth. The result, though it evoked a good present serious obstacles to landing spacecraft. The princi- deal of skepticism at the time, was fully confirmed when pal scientific investigator was Eugene Shoemaker, in his the Apollo 11 soil from the same area was analyzed in the first important role in the lunar program. laboratory. This was the first chemical analysis of the com- Surveyor 3 was on the whole a repeat of Surveyor 1, position of an extraterrestrial material in situ. It was fol- enhancing the database of surface images and strengthening lowed by quite a few others, and all the data showed clearly the view that this sort of scene beheld by both craft’s cam- that the Moon is highly differentiated chemically. These eras might be typical of at least the lunar mare surfaces, and later analyses provided very useful constraints on the which because they showed fewer large craters were pre- origin and history of the Moon. ferred for the Apollo manned landings. In addition to the Meanwhile, the Soviet lunar program had not been idle. TV camera, Surveyor 3 carried a soil mechanics experi- A series of Luna missions and one mission called Zond ment, essentially a powered, fist-sized bucket with a range returned important data and images, particularly of the hid- of capabilities. In the course of about 18 hours it carried out den farside of the moon (e.g. Dolginov et al. 1967, dozens of tests of soil properties, including bearing Lebedinsky et al. 1967). strengths and response to impacts. Perhaps most important, Like the other two US mission series, the Lunar Orbiters it could excavate a trench to a depth of 17 cm and display were important both as precursors of the Apollo landings, the sampled material. Shoemaker was again chiefly respon- and for the database they provided. The method used to sible for these experiments. gather the data seems dated now, when CCD cameras and Surveyors 5, 6, and 7 landed at new and interesting spots other advances have made to job much easier. But at the on the nearside, respectively the prominent eastern Mare time it was ingenious, and above all it worked. So did the Tranquillitatis (already emerging as the most likely area for spacecraft, all five times. The first launch was in August the first manned landing), Sinus Medii near the center of the 1966, the last in August 1967. Moon’s face, and the northern rim of the crater Tycho in the Cameras with long rolls of narrow strip film were used to south. The best reference for the later Surveyor missions is take thousands of pictures of the lunar surface in a pre- Jaffe (1969). Camera and soil property instruments were planned pattern, both of whole bands of the lunar surface updated for these missions, but the most interesting new and of particular candidate landing areas at higher magnifi- development was the inclusion of an alpha-scattering instru- cation. These were developed chemically to bring up the ment for chemical analysis. This instrument was designed images, and then scanned with beams of light to produce and built at the University of Chicago by a group headed by streams of intensity data from which the black-and-white images could be reconstructed back on Earth. Sequences were designed to optimize lighting, so that shadows could help to bring out detail. Many months of exposures were * The convention used then and later reversed that used by astronomers in the preceding centuries, to one in which the direction is that seen from made and recorded. Well over 90% of the lunar surface was Earth, not as before by an observer on the Moon itself. It was adopted for eventually covered during the five missions, giving rise to a the convenience of the Apollo program. After Apollo the new convention remarkably valuable database of both the near- and farsides. was adopted by the international astronomical community. Thus, for example, Mare Orientale, which in Latin means ‘Eastern Sea’ is now at the Calculations showed that the spacecraft would remain in western edge of the visible hemisphere. orbit for a long time after the imaging had been completed, 12
HERBERT FRIEDMAN† From the ionosphere to high energy astronomy – a personal experience
In the first half of the 20th century, recognition of the storms are due to the magnetic action of the Sun; or to any existence of an electrified layer of the upper atmosphere dynamic action within the Sun, or in connection with hurri- that provided a mode for radio communication over great canes in his atmosphere, or anywhere near the Sun outside.” distances on the Earth grew with little sense of the role Furthermore he held strongly “that the connection between of invisible solar radiation in creating the electrical mirror. magnetic storms and sunspots is unreal and the seeming Solar radiation was thought to be black-body in spectral agreement between the periods has been mere shape characterized by a temperature of 6000 K with a max- coincidence.” Kelvin’s enormous prestige discouraged any imum in the yellow-green, trailing off rapidly in the infrared dispute and set back solar-terrestrial research for decades. and ultraviolet. None of this spectrum could produce signif- Toward the end of the 19th century Colonel Sabine of the icant ionization of the major constituents of the atmosphere. British army monitored a network of magnetic observatories X-rays were discovered by Wilhelm Röntgen shortly throughout the empire and noted that by “a most curious before the turn of the century and Guglielmo Marconi, a coincidence” the magnitude and frequency of magnetic dis- few years later, demonstrated trans-Atlantic radio communi- turbances was synchronized with the appearance and disap- cation via a high altitude, natural electrical mirror. Several pearance of sunspots. The direction of the compass needle ingenious physicists and electrical engineers pursued the swung in regular fashion over the diurnal cycle, but at times problems of radio reflection but had few clues to the nature the movements became more intense and rapid in the auroral of the solar radiation that keyed the phenomena of ionos- zone, giving rise to the name “magnetic storms”. It was com- pheric production and variability. The true nature of solar monly believed that interplanetary space was a vacuum and ionizing radiation remained a baffling puzzle until after that auroras were excited by direct streams of particles from WW II when the availability of rockets to carry detectors Sun to Earth unimpeded by any interplanetary medium. By directly into the ionosphere finally made the studies defini- timing the appearances of auroras and magnetic storms rela- tively diagnostic. Captured German V-2 (Vengeance) rock- tive to the visible outbursts of flares on the Sun, the travel ets while being studied by propulsion engineers were also speed was calculated to be about 800 km per sec, slower than turned from “weapons into plowshares”, when they were light, but consistent with concepts of particle streams. adapted to ionospheric studies. Friedrich Gauss, the great German mathematician- To preface the story of the modern era it is interesting to physicist-astronomer, as early as 1839, related fluctuations sketch the early ideas of radio propagation science. The per- in the compass needle to the passage of electric currents at ceptions of solar-terrestrial connections had developed high altitudes. In 1882, the Scotsman, Balfour Stewart, slowly over most of the 19th century. Lord Kelvin, one of defined these currents as a great dynamo of tidal move- the most influential physicists of his time, was adamant in ments of ionized air above 100 km that were driven by solar rejecting any notion (Kelvin 1892) “that terrestrial magnetic heating. The vertical movement was only 2 or 3 km, but it was sufficient to generate a great horizontal current sheet of electricity. Observations near the geomagnetic equator indi- † Naval Research Laboratory, Washington, DC, USA. Herbert Friedman died on 9 September 2000. The editors are indebted to Herbert Gursky cated circulating systems of electric currents of opposite for helping them to put the paper and its title into the final form symmetries in the northern and southern hemispheres. At
Herbert Friedman, The Century of Space Science, 277Ð286 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 278 HERBERT FRIEDMAN the equator the currents joined to form a strong flow from a critical value. At smaller angles the waves penetrated the west to east at 1100 local time, that Sidney Chapman, in the reflecting region and escaped into space. At night, skip early 20th century, named the Equatorial Electrojet. distances were greater than during the day and greater in The young Guglielmo Marconi at age 21, in 1895, built a winter than in summer in temperate latitudes. From these demonstration wireless telegraph on his father’s estate near simple observations Hulburt calculated the height of reflec- Bologna, Italy. On December 12, 1901 he transmitted a tion and the electron density (about 500,000 electrons and simple Morse code signal from England to Newfoundland, ions per cm3 at a reflection height of about 150 km). By a distance of 2900 km to the astonishment of most scientists 1926, Hulburt and Taylor were able to publish a remarkably who could not fathom how the waves, that were thought accurate account of the diurnal variation of ionospheric to travel in straight lines, could curve over the 160 km electron density. The work was almost coincident with high bulge of the surface of the earth. In 1902, Arthur E. Appleton’s and Barnett’s 1924Ð25 studies. Kennelly proposed that the radio waves were ducted around In the early years of ionospheric research, theorists the earth by an electrically conducting layer. Almost simul- speculated about particle radiation as a possible source of taneously, Oliver Heaviside reached the same conclusion ionization of the upper atmosphere. Confronted with an and the layer came to be called the Heaviside layer. With apparent 6000 K solar spectrum it was not possible to model the above background, the stage was now set for a more interactions of electromagnetic radiation with atmospheric focused scientific attack on the nature of a reflecting layer, constituents that would lead to the required ionization. now called the ionosphere, that eventually came to be asso- Within the space of a decade after the end of WW II, how- ciated with solar X-rays. In England, ionospheric research ever, the solar spectrum was revealed from its X-ray limit was lead by Edward Appleton and his student Miles Barnett throughout the ultraviolet with instruments carried on rockets [1925]; in the United States studies were conducted by to ionospheric height. Ionizing radiation was observed in E.O. Hulburt and A. Hoyt Taylor at the Naval Research X-rays and ultraviolet from the solar corona and chromos- Laboratory [1926] and by Gregory Breit and Merle A. Tuve phere, where temperatures range from hundreds of thousands at the Carnegie Institution [1926]. Successful experiments to millions of degrees K, and every spectral interval was and interpretations were achieved almost simultaneously in matched with its absorption at a particular height range. England and the United States, both groups working from Throughout this early epoch, some theorists still favored the ground. When German V-2 rockets were captured by solar particles as the source of ionizing radiation. Con- the Americans, they moved ionospheric research into space fronted with the apparent 6000 K temperature of the solar and outraced the ground-based competition. disk (Figure 1) it wasn’t possible to model atmospheric Appleton set out to determine the height of reflection interactions with solar radiation that would lead to the of a continuous wave with the cooperation of the British required ionization. At the higher chromospheric and coro- Broadcasting Co. whom he persuaded to provide him with nal temperatures shorter-wavelength extreme ultraviolet and a continuously varying signal from London at the end of X-rays would be produced but the particle concentrations the broadcast day so that he could detect the interference pattern of ground and sky waves at Oxford. He observed the elapsed time between emission and reception of the same frequency, as the broadcast frequency was oscillated back and forth. Starting with low frequencies, Appleton probed only the lower portion of the reflecting layer; work- ing later with higher frequencies, he distinguished layered regions of reflection, that he labeled D, E, and F. For these experiments, Appleton later received the Nobel Prize. Much of the early research by NRL scientists was char- acterized by an admirable simplicity and economy of means. Hulburt and Taylor cooped the partnership of radio amateurs around the world who used vacuum tubes with power outputs of less than 50 watts and very short radio waves, less than 200 meters, to communicate around the world. Transmissions skipped over a “zone of silence” encircling the transmitter to a distance of 30 to 50 km and at the same time were received out to distances of hundreds Figure 1 Solar spectral energy distribution from 2000 A to of kilometers. Hulburt and Taylor showed that the waves 7000 A compared with 6000 K black body sunlight above the were reflected only when the angle of incidence exceeded atmosphere. FROM THE IONOSPHERE TO HIGH ENERGY ASTRONOMY – A PERSONAL EXPERIENCE 279 in the solar atmosphere were estimated to be too thin (emission measure too low) to provide high enough intensi- ties to generate an E or F region. Appleton’s early measure- ments of vertical reflections between midnight and sunrise showed that substantial concentrations of electrons or ions persisted throughout the night, contrary to the prevailing idea that charges would disappear rapidly by attachment once the source of ionization was removed. Radio scientists were thus led to believe that an important portion of the ionization might be produced by corpuscles arriving equally by both day and night. Because a charged particle stream could not readily penetrate the earth’s magnetic field, seri- ous thought was given to neutral particle streams. In 1928, Hulburt proposed that ultraviolet radiation shortward of 1230 Å might be the source of the ionosphere. By 1930 he was intrigued by a possible connection between solar ultraviolet and sunspots and their link to magnetic storms and radio fadeout. In 1935, J.H. Dellinger at the U.S. National Bureau of Standards (NBS), summarized observations of a series of sudden ionospheric disturbances over a period of six months and stressed the importance of understanding the connection with solar activity. He pro- posed a joint effort of the NBS and the solar observatory on Mt. Wilson. During the same time frame, Robert H. Goddard was developing his rocket to carry instruments to high altitudes for atmospheric research. Correspondence Figure 2 A V-2 rocket just prior to launch at the White between Hulburt and John Fleming reveals that Hulburt Sands Proving Ground in New Mexico. About 45 feet tall contemplated solar rocket astronomy to understand the and 5 feet in diameter it was fueled by 10 tons of alcohol basic physics of solar control of the ionization. He noted and liquid oxygen. The rocket is shown connected by an umbilical cable to the firing line. To service the rocket and the theoretical match between atmospheric absorption of its payload a portable ladder was brought up. Only later solar soft X-rays and the altitude of ionization and sug- was a gantry provided from which each level of the rocket gested that a good test would be to fly photographic film could be reached comfortably. A successful flight could covered with thin aluminum foil or black paper in one of reach 170 km and last for 450 sec. (NRL.) Goddard’s rockets to detect X-rays. Those early glimmerings of high altitude research with rockets were interrupted by the war but the new technolo- about 120 km (Figure 3). It appeared that a thermal corona gies of the war were soon transferred to peaceful research. at one to two million deg C made a good fit with the ioniza- In 1942, Ernst Krause in the radio division at NRL under- tion requirement of the E-region. took to develop a program of guided missiles, specifically The Lyman alpha detector and the extreme ultraviolet a new version of the German V-1 buzz bomb known as the detector showed how those radiations shaped the bottom of JB-2 and the Lark, a rocket-propelled, guided ship-to-air the ionosphere and the upper part of the reflecting E-region. missile. At the end of the war, Krause pursuaded NRL to Lyman alpha, originating in the hot solar chromosphere at commit to a substantial effort in rocket development for 10,000 K and higher, contains most of the energy in the high altitude research which led to the resurrection of V-2 extreme ultraviolet and is absorbed between 75 and 90 km rockets late in the 1940s (see Figure 2). The first generation but does not interact with any of the major constituents, of successful studies of solar X-rays and extreme ultraviolet oxygen or nitrogen, atomic or molecular. Only later on did radiation began in 1949 when my NRL group flew a set M. Nicolet, the brilliant Belgian atmospheric scientist of Geiger counters sensitive to a narrow band of X-rays point out that it could ionize nitric oxide, a trace constituent centered at about 8 A, hydrogen Lyman-alpha (1216 A), present at only 108 molecules per cm 3. with almost 100% and the Schumann region, 1425 to 1600 A. As the rocket efficiency and thus have control of D-region. Radiation in climbed to an altitude of 150 km, the detectors pointing the Schumann region produced no ionization but played a normal to the spin axis swept the sky repeatedly. X-rays very important role in shaping the high ionosphere by dis- were detected above 80 km with increasing intensity to sociating molecular oxygen. By the process of dissociative 280 HERBERT FRIEDMAN
combination of a total solar eclipse and an array of rockets launched from the deck of a ship.
SOLAR FLARES
Of all the forms of solar activity, flares are the most spectacular. A solar flare creates a strong impact on the terrestrial environment, producing prompt shortwave radio blackout that may last for two to three hours. The after- effects may persist for one or two days in the form of great auroral displays, and ionospheric and magnetic storms that seriously degrade shortwave radio communications. The new arsenal of rockets that became available late in the 1950s made it possible to plan a program of solar flare studies. Although the supply of V-2 rockets was exhausted by 1952 it was replaced by smaller Aerobees and two staged Figure 3 The first measurement of the penetration of solar rockets that mated the Deacon with a Nike booster or a X-rays into the upper atmosphere made by a V-2 rocket in Skyhook balloon. The latter combinations were particularly 1949. The 8 A X-ray signal was modulated by the spin of attractive for studies that required a form of instant rock- the rocket as the Sun came into view once each roll period. etry. Launch from shipboard at sea offered range safety. By X-rays were first detected at about 90 kilometers and sailing downwind the ship could achieve nearly zero rela- reached peak intensity at about 130 km. (U.S. Naval Research tive wind conditions for inflation and release of the balloon Laboratory.) with its suspended rocket. The well deck aboard the U.S.S. Colonial measured 392 feet by 41 feet which we could recombination, molecular oxygen controls the rate of use to store three trailer-truckloads of helium while the neutralization of F-region electron density much more broad helicopter deck above served admirably for the bal- effectively than atomic oxygen. loon operations. The lumbering ship could make a speed of The 1949 measurement of harder X-rays (1Ð8 Å) led to 15 knots which was slower than the expected drift of the several years of broad band photometry of solar X-rays that balloon at altitude. To assure radio contact with the balloon extended the range of the spectrum, primarily with the aid payload we were assigned a destroyer, the U.S.S. Perkins of simple filters of beryllium, aluminum, titanium, mylar, that could track the balloon with a speed of 28 knots. formvar, etc. serving as the window materials of the photon Finally a crew of 650 sailors was tasked to man the ship for counters. It seems in retrospect that the NRL group was the naval chase. almost alone in that decade of pioneering studies of Each day as inflation began, the polyethylene balloon, the Sun. The X-ray spectral distribution from 1 Å to 44 Å most of it draped in tight folds resembling the stem of an resembled thermal emission from a thin corona at a temper- onion, rose 100 feet above the deck, crowned by a 20 foot ature of a few million degrees. Successive measurements at bulge filled with 5000 cubic feet of helium that would intervals of months to years showed flux variations of as expand further to thirty times that volume at altitude. The much as a factor of 7 for X-rays (8Ð20 Å) over the sunspot 12-feet long Deacon rocket dangled at the end of a 100-foot cycle. Such variability was consistent with ionospheric nylon line (Figure 4). At 80,000 feet, when a flare was electron density variations in the E-region, supporting a observed in visible light, a radio command would fire the direct connection between solar X-rays and E-region. But rocket and send it upward, piercing the balloon and rushing the observed variability over a solar cycle made it clear that ahead another 50 or 60 miles through the ionosphere. the concept of X-ray emission from a spherically symmetri- An NRL proposal for a naval expedition as part of the cal solar corona was very inadequate. Instead it seemed International Geophysical Year (IGY) to launch ten Rockoons that the corona was structured in condensations, formed for solar flare studies with the support of the USS Colonial, over sunspots, that produced enhanced X-ray emission. To an LSD with a large helicopter deck, was approved by resolve the question of spatial origin would require an the Office of Naval Research. Our ship was a sea-going X-ray scan of the solar disk or an X-ray photograph. Both drydock. The operational plan was to release a Rockoon methods were successfully applied at the end of the decade. each morning on ten successive days and allow it to float The X-ray photograph was obtained with primitive pinhole at 80,000 feet until the onset of a solar flare was detected, photography (Figure 6); the scan required a very special when it would be fired. One flare was successfully observed 13
BLAIR D. SAVAGE* Early ultraviolet spectroscopy from space
The period starting around 1950 and extending into the far-UV and UV bands; for a review of the beginnings of twenty-first century will probably ultimately be remembered EUV astronomy see Bowyer (1991). This delay resulted as the golden age of discovery in observational astronomy. mostly from the fear that the interstellar H I opacity in most This is because of the simultaneous occurrence of three tech- directions would be so large that only the stars closest to the nological advances: the ability to observe the entire electro- Sun would be observable. However, the highly irregular dis- magnetic spectrum, by combining observations from the tribution of H I in the local interstellar medium and the exis- ground and from above the absorbing effects of the Earth’s tence of the local hot bubble of ionized hydrogen rendered atmosphere; the creation of efficient and high-precision these fears invalid. EUV astronomy achieved its first photo- detectors of electromagnetic radiation; and the development metric detection of a stellar source during the ApolloÐSoyuz of computers for processing and manipulating large elec- mission (Lampton et al. 1976) and the first low-resolution tronic data sets. The field of ultraviolet (UV) astronomical EUV stellar spectra were obtained several years later spectroscopy at wavelengths shorter than 3100 Å has bene- (Holberg et al. 1980) as part of the Voyager 1 mission. fited from all three of these technological revolutions and In preparing this historical overview of developments in has required access to space for its very existence. the field of UV astronomy from space, I benefited substan- In this chapter I follow the development of the field of tially from reference to various papers summarizing the state UV astronomical spectroscopy from space over the period of UV astronomy over the past 40 years. These include the from approximately 1945 to 1980, with the emphasis on reviews by Friedman (1959), Wilson and Boksenberg spectroscopy of objects located beyond the Solar System. (1969), Wilson (1970), Bless and Code (1972), Code and The discussions concentrate on scientific developments in Savage (1972), Spitzer and Jenkins (1973), Boggess and the wavelength range from ~900 to ~3100 Å. The lower limit Wilson (1987), and Brosch (1999). of ~900 Å represents the wavelength where boundÐfree opacity of neutral hydrogen in the interstellar gas becomes large, and the upper limit of 3100 Å is where observations THE OBSCURING ATMOSPHERE begin to become possible from telescopes situated on the ground. The wavelength regions of 100Ð900 Å, 900Ð1200 Å, The attenuation produced by the Earth’s atmosphere in the and 1200Ð3100 Å are usually referred to as the extreme UV as a function of wavelength in angstroms is shown in ultraviolet (EUV), far-UV, and UV regions of the spectrum, Figure 1. The solid curve gives the altitude in kilometers at respectively. I shall not review the beginnings of EUV spec- which radiation normal to the atmosphere is reduced in troscopy from above the atmosphere since those beginnings intensity by a factor of 1/e. The various molecules and were delayed by a decade compared with activities in the atoms mostly responsible for the atmospheric absorption are indicated on the curve. The atmospheric absorption that *University of Wisconsin, Madison, WI, USA rapidly sets in near 3100 Å is produced by ozone (O3) while
Blair D. Savage, The Century of Space Science, 287Ð300 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 288 BLAIR D. SAVAGE
Figure 1 This figure from Friedman (1959) shows the altitude in kilometers at which the fraction of radiation at wavelengths shorter than the visible incident on the Earth’s atmosphere is reduced by a factor of 1/e. Strong atmospheric absorption due to O3 sets in at about 3100 Å. While in the mid-UV it is possible to carry out observations from high-flying balloons, satellite altitudes are required to observe celestial sources for the UV wavelengths below ~2000 Å. (By permission of the Journal of Geophysical Research.)
absorption at shorter wavelengths is mostly from O2,O,N2, atmosphere would have several major advantages over an and N. In the wavelength range ~2000Ð3000 Å it is possible observatory on the Earth’s surface. He noted that at optical to carry out UV observations from balloons carried to wavelengths the orbiting observatory would not suffer the altitudes of 30Ð40 km (see later section on Near-UV blurring effects of the Earth’s atmosphere. Therefore the Spectroscopy from Balloons). However, rocket or satellite orbiting observatory could produce images limited only by altitudes are required to observe at UV wavelengths shorter the quality of the telescope optics and its pointing system. than ~2000 Å. Even at rocket or satellite altitudes exceeding He also pointed out that the orbiting telescope would be 300 km, the trace atomic constituents in the atmosphere can capable of observing celestial objects over the entire electro- interfere with astronomical observations. In particular, the magnetic spectrum, including spectral regions that are not hydrogen geocorona extends many Earth radii from the sur- accessible to the ground-based observer. No professional face of the Earth, and the H I scatters sunlight in the Lyman astronomers took these ideas seriously when they were first series lines producing very strong emission, particularly in discussed, and over the subsequent 20 years the only other the Lyman , , and lines at 1215.67, 1025.72, and published speculations about observatories in space that I 972.54 Å. Similarly, Earth airglow emission can be strong in am aware of appear in the science fiction literature various O I transitions. Even at the altitudes of the major (Richardson 1940). However, everything changed after modern observatories such as the Hubble Space Telescope, World War II, in part as a result of the development and there is enough residual O I in the atmosphere for terrestrial capture of the V2 rocket technology and the start of the O I absorption lines to be apparent in high-resolution spectra Cold War between the United States and the Soviet Union. of hot stellar continuum sources. The first truly serious discussions of the potential scientific importance of placing satellite observatories instrumented for UV spectroscopy above the absorbing atmosphere are found THE SCIENTIFIC IMPORTANCE OF in an internal report entitled “Advantages of an Extra- ACCESS TO SPACE Terrestrial Observatory,” written for the Rand Corporation in 1946 by Lyman Spitzer reprinted in Spitzer (1997). Spitzer’s More than 75 years ago, Oberth (1923) pointed out that report discusses the science that could be pursued with several an astronomical observatory orbiting above the Earth’s types of orbiting observatories of different sizes. In addition EARLY ULTRAVIOLET SPECTROSCOPY FROM SPACE 289 to observatories designed to observe the Sun in the UV, he spectrometer operating from 912 to 2000 Å in order to obtain discussed the scientific potential of a modest 0.25 m (10- information about the physical properties of the gas between inch) reflecting telescope designed to obtain UV spectra of the stars?” In answering the question I was intrigued by the stellar sources, and the potential value of a large reflecting possibility of using a satellite observatory to study aspects of telescope operating at optical wavelengths. the interstellar gas (the hot phase) that were difficult or The scientific programs Spitzer proposed for this instru- impossible to study from the ground. I didn’t realize at the ment included: time that this assignment was one I would continue to work on for most of my professional career. ¥ studies of the composition of planetary atmospheres ¥ measures of the structure of stellar atmospheres, includ- ing the possibility of detecting expanding atmospheres in TECHNOLOGICAL CHALLENGES the strong absorption lines of C, N, and O ¥ measures of the color temperatures of hot stars The technological challenges faced by the pioneers of UV ¥ measures of stellar bolometric magnitudes spectroscopic space astronomy were considerable. They ¥ the analysis of eclipsing binary UV light curves to included gaining reliable access to space, developing small obtain information about stellar masses and atmospheric light weight UV spectroscopic optical systems, developing properties rocket and satellite pointing and control systems, and devel- ¥ using the improved understanding of stellar atmospheric oping efficient UV sensitive detectors. The most difficult conditions from the UV studies to improve on stellar dis- challenge was to design and build complex robotic systems tance determinations that would actually operate in the space environment. ¥ measures of the composition of the interstellar gas The progression through these various technological ¥ measures of the properties of interstellar absorbing grains challenges did not always proceed smoothly. As a beginning ¥ measures of the UV spectra of supernovae in order to bet- graduate student in 1964, I witnessed the activities around ter understand the explosion processes. Princeton University involving the pioneering projects in For each of these topics Spitzer discussed how observatories UV spectroscopy through the efforts of Donald Morton and above the atmosphere could be used to make major advances Lyman Spitzer and in optical diffraction limited imagery in the given science area. For example, in the case of measur- through the Stratoscope II balloon-borne telescope program ing the composition of the interstellar gas, Spitzer pointed involving Robert Danielson and Martin Schwarzschild. I out that in interstellar space most atoms and molecules were vividly remember discussions in the halls about rockets expected to be found in their ground (lowest-energy) state. blowing up on the launch pad, pointing and control systems Therefore measures of the composition of the gas through that failed to operate, rocket parachute systems that deployed absorption line spectroscopy would require an observatory too early, and balloon flights where the 3600 kg Stratoscope operating at UV wavelengths since most absorption lines out II telescope system failed to unlatch from the vertical point- of the ground state of abundant atoms fall in the UV. Slightly ing position, or where the sealed door on the pre-cooled more than 25 years after creating this vision for the future of 0.9 m diffraction-limited primary mirror failed to open at UV astronomy in space, Spitzer pioneered UV studies of the altitude. Reflecting on all these problems and wondering interstellar medium by using the high-resolution UV spectro- how long it might take to complete an experimental thesis in graph aboard the Copernicus satellite (see the section on the space astronomy, I made an arrangement with Robert Orbiting Astronomical Observatories). Danielson and Martin Schwarzschild to pursue a theoretical Shortly after the launch of Sputnik in 1957 and the organi- PhD thesis and to briefly join the Stratoscope II team fol- zation of NASA in 1958, several papers appeared discussing lowing my graduation to continue to explore my experimen- the science that could be pursued with UV spectrometers tal interests. The lesson I learned during this period was that operating above the Earth’s atmosphere, including those of the most difficult technical challenge faced by the early pio- Spitzer and Zabriskie (1959) and Code (1960). I find the neers in space astronomy was to achieve a high level of reli- Spitzer and Zabriskie paper, entitled “Interstellar research ability for remotely controlled robotic systems operating in with a spectroscopic satellite,” particularly interesting because the space environment. Even today, it appears that this is a my professional field of interest is the interstellar medium. I difficult goal to meet on a regular basis. remember referring to that paper in 1965 when taking Professor Spitzer’s course on “Physical Processes in the Interstellar Medium.” During the course I recall a particularly SOUNDING ROCKET SPECTROMETERS interesting class assignment. Professor Spitzer asked his students, “What observations would you obtain with a 1 UV space astronomy began as an experimental field at meter satellite observatory equipped with a high-resolution about the same time as Spitzer was considering its future. 290 BLAIR D. SAVAGE
Using captured German V2 rockets which could carry a 1000 kg of equipment to 150 km altitudes, scientists work- ing at the Naval Research Laboratory (NRL) led by Richard Tousey obtained the first UV spectrum of the Sun to wave- lengths as short as 2200 Å (Baum et al. 1946). The first UV stellar photometry followed about ten years later when NRL scientists observed 59 hot stars in a 130 Å wide UV band centered on 1115 Å (Bryam et al. 1957). These early UV stellar observations involved scans of the sky with simple photometer systems from unstabilized sounding rockets. The flight of a scanning objective grating spectrometer on an unstabilized Aerobee rocket produced the first low-resolution UV stellar spectrophotometry over the 1600Ð4100 Å region at 50 Å resolution (Stecher and Milligan 1962). In this observation the dispersion direction of the spectrograph was normal to the rocket roll axis. The roll of the rocket produced the spectral scan at the slit with a photomultiplier recording the spectrum. However, to cap- Figure 2 A schematic diagram of the simple UV spectrometer ture enough photons during the short intervals for which the and fine stabilization system flown on an Aerobee rocket by stars were in the field of view, the slit needed to be wide Morton and Spitzer (1966). The fine stabilization in the spec- and the resulting resolution of 50 Å was insufficient to trometer’s dispersion direction allowed the instrument to resolve discrete stellar absorption or emission lines. obtain UV spectra of and Sco from 1260 to 1720 Å at a The first moderate-resolution spectroscopic observations resolution of 1 Å. (By permission of Donald Morton and the of stars required the development of three-axis stabilization Astrophysical Journal.) systems for pointing rocket-borne instruments at stars for long enough periods of time to record the spectra. The of the paper by Morton and Spitzer (1966) reporting the pointing control system developed by the Space General observations of first line spectra of stars in the UV, Corporation was a three-axis stabilized gas reaction system On the first two flights the attitude control system failed that could be used to orient the entire sounding rocket dur- to stabilize the rocket. On the third flight both coarse ing its free fall. This system could point the payload within and fine systems worked properly, but the parachute 3 of a desired direction and was stabilized to a limit cycle failed on re-entry so that the impact damaged the pay- jitter of 15 arc minutes. In their pioneering experiment, load beyond repair and admitted light into the film cas- Morton and Spitzer (1966) employed an additional fine sta- settes. Most of the films were totally fogged, but bilization system as part of their instrument package which underdeveloping one from the calcium fluoride camera improved the pointing in the spectrograph dispersion direc- showed wide spectra of the early B-type stars and tion to approximately 16 arc seconds. Their instrument Sco with a resolution of 1 Å. (Figure 2) consisted of two 1200 lines/mm objective plane gratings followed by f/2 Schmidt cameras with 100-mm I can recall the gloomy mood after this third flight among focal lengths and 10 fields. One Schmidt corrector was the returning Princeton scientists and engineers when they made of calcium fluoride and transmitted to 1250 Å; the discussed their totally blackened images. However, that other corrector was quartz and transmitted to 1700 Å. The mood quickly changed to one of joy when the last piece of resulting UV spectra had a dispersion of about 65 Å/mm film was underdeveloped, and revealed clear spectra of and were recorded on UV sensitive Kodak Pathe SC5 film. and Sco extending from ~1260 to 1720 Å. This small pair of UV spectrometers produced spectra with One year after the success of obtaining the first moderate- a resolution of approximately 1 Å when the fine stabiliza- resolution UV spectra of stars in Scorpius, Morton (1967) tion platform achieved its design pointing stability in the obtained high-quality spectra of six stars in Orion at 3 Å dispersion direction. resolution from 1150 to 1630 Å. Those data provided the The pioneering attempts to obtain UV spectroscopic first evidence for high-speed mass loss from hot stars in the observations of stars did not achieve instant success. form of P Cygni profiles of the Si IV and C IV stellar During these learning stages there were often problems absorption lines (Figure 3). This result must have pleased associated with making the attitude-control pointing sys- Lyman Spitzer since in his 1946 Rand Corporation study he tems work properly. For example, quoting from the abstract remarks that, “In addition the nature of unusual stellar 14
MARTIN HARWIT* The early days of infrared space astronomy
INTRODUCTION D.A. Allen’s even earlier book adds an Australian’s view- point (Allen 1975). The author hopes that contributions Infrared observations can provide a number of unique such as these may eventually be synthesized to produce perspectives on the Universe. This article concerns itself with a dispassionate, comprehensive, truly international, and the men and women who pioneered infrared space astronomy, perhaps more broadly incisive account. traces the different techniques they employed, describes the trials and tribulations they had to overcome, and lists some of the gains they achieved. In the process it also examines the The cold universe social institutions, both academic and military, that were most Much of the Universe is cold. The wavelength, , at which influential in determining the evolution of the field. stars, galaxies, and interstellar or intergalactic gases radiate While many infrared and submillimeter observations is inversely proportional to their temperature, T (Figure 1). required going into space, a large number of others were The peak emission tends to occur at 3700/T micro- carried out from the ground. The earliest successes of meters, where T is measured in kelvin (K). One micrometer infrared astronomy were largely the work of ground-based is a millionth of a meter, traditionally called a micron and astronomers. Going into space required major technological abbreviated to m (or simply in the early literature of breakthroughs that took time to materialize. A rich texture the field). Observations at infrared wavelengths from 1 of sometimes friendly, occasionally aggressive, competition to 1000 m are thus uniquely sensitive to astronomical among ground-based, balloon, airborne, rocket, and satel- sources in the temperature range from ϳ3000 K to 3 K. lite observers emerged, that persists to this day. These include the coolest stars, planets, and interplanetary As an active participant in the field, the author makes no dust, circumstellar and interstellar matter, and, at the claims to objectivity. Nor is he able, in a chapter of twenty longest wavelengths and coldest temperatures, the earliest or thirty thousand words, to give proper credit to all who known radiation emitted by the entire Universe. made significant contributions. The story told here is over- whelmingly based on personal experiences in the USA. Many of the struggles faced in the USA, however, were A dusty universe duplicated in Europe, Japan, and the Soviet Union. An excellent review written more than 20 years ago by Interstellar dust Ð microscopic particles composed of ices, J.E. Beckman and A.F.M. Moorwood (1979) may provide minerals, and common organic and inorganic materials Ð is the reader with a complementary European perspective, and abundant throughout the Universe. These particles obstruct a clear view of the cosmos at optical wavelengths. Fortunately, *Professor Emeritus of Astronomy, Cornell University, Ithaca, NY, USA; a cloud of dust that may be totally opaque in the visible or Former Director, National Air and Space Museum, Washington, DC, USA ultraviolet can be virtually transparent in the infrared. Thus,
Martin Harwit, The Century of Space Science, 301Ð330 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 302 MARTIN HARWIT
The chemical universe The infrared band contains the spectral signatures of a vari- ety of atoms, molecules, ions, and solids, at least some of which are found in any astrophysical environment. Infrared spectroscopy allows us to recognize large varieties of sub- stances, ranging from cool ices in the interstellar medium to highly excited ions in active galactic nuclei. By determin- ing the strengths of different spectral features, we obtain important, often unique insights into the chemical and physical conditions in these systems.
INFRARED ASTRONOMICAL CONSTRAINTS
The human eye ceases to respond to radiation beyond the red part of the spectrum. This limit falls at a wavelength of Figure 1 Blackbody radiation spectrum as a function of tem- ϳ0.72 m. The infrared spectral domain begins approxi- perature. As the temperature T decreases by a factor of 10, the mately at a wavelength of 0.7Ð1 m, and stretches out to wavelength of peak emission increases by the same factor and ϳ1000 m ( 1 mm). The wavelength range between the total power emitted over the entire wavelength range ϳ200 m and 1 mm is often called the submillimeter domain. 4 decreases by a factor of 10 . While the atmosphere tends to be opaque to much of the infrared wavelengths can probe regions deep in the core of infrared spectral band, observations can be carried out from a dusty galaxy that are inaccessible at shorter wavelengths. high mountain tops in a number of atmospheric windows. Dust grains are heated by short-wavelength, mainly visi- The lowest panel of Figure 2 shows the approximate widths ble and ultraviolet radiation. They absorb this energy, and and depths of the atmospheric windows at an observatory re-radiate it at infrared wavelengths. Because of this effi- like Mauna Kea, Hawaii, sited at an altitude of 4.2 km. cient conversion, the majority of the radiant energy emitted As seen from the upper panels, atmospheric absorption by dense, dusty regions, such as star-forming clouds and decreases and transmission increases considerably with occasionally entire galaxies, lies at infrared wavelengths. increasing altitude, and becomes quite high throughout Since the transparency of dust clouds increases at longer much of the infrared domain at aircraft and balloon alti- ϳ ϳ wavelengths, red light is transmitted more readily than tudes, respectively, at 14 and 28 km. blue. Stars lying behind a tenuous dust cloud thus appear For the astronomer, good atmospheric transmission is reddened. This reddening can be a useful measure of the not enough. Low atmospheric emission is also essential. amount of dust in the cloud. The atmosphere radiates powerfully throughout much of the infrared, and this emission is often orders of magnitude greater than the radiation from astronomical sources. The The early universe intensity of atmospheric emission also tends to fluctuate In the expanding Universe, the more distant an object the with time. In the near-infrared, at 1 4 m, emission greater the velocity with which it recedes from us. This cos- by OH radicals is highly variable. In the far-infrared, at mic expansion shifts starlight from distant galaxies into the around 100 m, variable atmospheric water vapor content infrared region; the more distant the source, the farther is its produces varying emissivity. Such fluctuations cause the radiation shifted into the infrared. The expansion is character- flux, or signal, from an astronomical source to be marred ized by a redshift parameter, z, where 1 z gives the ratio of by superposed noise due to randomly varying atmospheric “wavelength reaching the observer” divided by “wavelength emission. actually emitted in the distant source”:1 z received/ emitted. Poor atmospheric transmission, and strong atmospheric The most distant quasars and galaxies observed to date emission are the prime reasons for undertaking infrared have redshifts z Ͼ5, so that radiation from the center of the astronomical observations from space. However, in the visual band is shifted to beyond 3 m. Because 1 z also early 1960s when serious efforts to conduct such observa- represents the factor by which the Universe has expanded tions were beginning, many of these factors were poorly since the radiation was emitted, objects at z 5 are seen as understood. This led to parallel developments in infrared they were at an epoch when the Universe was only one- astronomy. Most astronomers worked with ground-based sixth of its present size. equipment taken to high mountain tops, while a few THE EARLY DAYS OF INFRARED SPACE ASTRONOMY 303
Figure 2 Atmospheric absorption in the infrared at altitudes of balloons, aircraft, and a mountain-top site like Mauna Kea (after Traub and Stier 1976). attempted to build the instrumentation required to carry out date back to those of Sir William Herschel (1738Ð1822), observations from above the atmosphere. who noted the thermal emission of the Sun beyond the visi- Since rocket payloads were relatively expensive, US ble spectral range, rapid progress did not occur until the ground-based observers feared that the National Aeronautics 1960s. Two factors limited serious advances: a lack of suffi- and Space Administration (NASA) or National Science ciently sensitive detectors and a lack of motivation. Foundation (NSF) support for rocket work would drain In 1838 William Herschel’s son John determined the total away funding from their own efforts. Uncertainties about power emitted by the Sun. His results, obtained by measur- the precise limitations of ground-based techniques often led ing the rise in temperature of water in a blackened vessel to acrimonious disputes between ground-based observers on which was focused sunlight, were remarkably accurate and those who wanted to launch sensitive infrared astro- given the simplicity of his apparatus. He found that the Sun nomical telescopes above the atmosphere on rockets. Time radiated with enormous power. This posed a problem. Julius and again, the more traditional astronomers bluntly recom- Robert Mayer’s principle of conservation of energy, enunci- mended that rocket-borne efforts be slashed or pared back ated in 1845, led to the realization that the source of this to an absolute minimum. Pioneering infrared rocket astron- energy had to be huge or the Sun very young, otherwise the omy was not a happy venture! Sun’s energy would long ago have been depleted. Though To understand the backdrop against which the first suc- many sought to resolve this puzzle, a satisfactory explana- cessful observations were obtained, one needs to understand tion did not emerge until the twentieth century and the not only the technical difficulties that had to be overcome, development of theories of relativity and nuclear physics. but also the considerable early successes of ground-based Thirty years after John Herschel’s monumental finding, observers who also had to surmount significant instrumen- William Huggins used a thermopile connected to a gal- tal limitations but were able to make headway more quickly vanometer in the far more difficult attempt to measure the in more modest efforts. heat emitted by stars (Huggins 1869). (A thermocouple is a device consisting of strips of dissimilar metals that generate a voltage when one end of the device is heated.) These mea- EARLY GROUND-BASED EFFORTS surements were not particularly convincing, but Huggins’s contemporary E.J. Stone developed the technique further While the earliest attempts to conduct observations of (Stone 1870). He used two thermopiles back to back, alter- astronomical sources beyond the red part of the spectrum nating the positioning of a star first on one thermopile and 304 MARTIN HARWIT then on the other, in a mode that we would today call If an astronomer could secure a particularly sensitive and “pushÐpull chopping” (Figure 3). He also calibrated his stable detector, it was possible to undertake novel infrared system against a radiating cube kept at the temperature of measurements. One observer who took good advantage of boiling water. With these steps he obtained more credible these detectors was Harold L. Johnson who, with various results. co-workers, pioneered the UltravioletÐBlueÐVisualÐRedÐ Struggles to perfect increasingly sensitive instrumenta- Infrared (UBVRI) photometric system still in wide use. The tion persisted. In the early decades of the twentieth cen- I band lies at 0.9 m. Johnson also made observations in tury, pioneering work on better radiometers enabled the K band at 2.2 m and at longer wavelengths. He set out W.W. Coblentz at the US National Bureau of Standards to to obtain accurate photometric data on several thousand report radiometric measurements of 110 stars, and a series bright stars and soon realized the importance of reddening of collaborations between Edison Pettit and S.B. Nicholson and extinction by interstellar dust (Johnson 1965). The gen- provided greater insights into planetary and lunar phenom- eral attitude of his time, however, remained that infrared ena (Coblentz 1914, Pettit and Nicholson 1922). astronomy would continue to be merely an adjunct to optical The history of infrared detector development, however, work. Stars, after all, were known to be visible objects; and was largely guided not by astronomers, but by military the Universe appeared to be largely an aggregate of stars. needs, such as “night vision” enabling warm objects to be Given the still-formidable technical difficulties, most discerned in the dark. World War II brought about the astronomers found little motivation for embarking on a development of lead sulfide (PbS) cells that were far more career in infrared astronomy. sensitive than thermopiles. In the post-war era, Peter Felgett This attitude changed with the work of Gerry Neugebauer in Britain was one of the first to use these devices to and Robert W. Leighton at Caltech, who decided to conduct measure the fluxes from bright stars (Felgett 1951). Later, an unbiased survey of the sky at 2.2 m, where the atmos- the Cold War further accelerated military development of phere is transparent and PbS is sensitive. Since the amount infrared sensors. One particularly successful detector, of observing time that would be required was great, and indium antimonide (InSb), came into use at wavelengths no telescope was available for the purpose, Leighton and out to 5 m. At longer wavelengths, the military was devel- Neugebauer constructed their own. Much of their inex- oping copper-doped germanium (Ge:Cu) and gallium- pensive instrumentation was war-surplus, but the 1.57 m doped germanium (Ge:Ga) photoconductors. All these (62-inch), f/1 parabola they conceived was made from alu- devices worked at their best only over a limited bandwidth, minum hollowed out on a lathe. The surface obtained in this 1 ϳ Ð roughly / 3. To obtain continuous coverage over a fashion was not sufficiently accurate, so they poured a layer broader bandwidth a succession of different detector mate- of epoxy into the paraboloid, spun it up around its vertical rials had to be employed, each cryogenically cooled to its axis and kept it spinning until the epoxy had set. Properly own optimum operating temperature. coated, this gave a reflecting surface good enough to pro- duce images of the order of 2 arc minute resolution. This was satisfactory, since their eight-element PbS array pair- wise scanning the sky in a pushÐpull chopping mode would have to cover 30,000 square degrees. This meant that the beam employed would, of necessity, have to be sizeable. To assure themselves of the performance of their appara- tus, Leighton and Neugebauer cross-calibrated against stars already known from Johnson’s work. But they also hoped to go further. They were no longer restricted to pointing at visible stars and might find sources that had no optical counterparts. The final results of their survey were not to be published until the late 1960s. But while the work was in progress the two researchers and one of their students discovered a num- ber of truly remarkable sources. A letter to the Astrophysi- cal Journal (Neugebauer et al. 1965) broke the news. It was Figure 3 “Push–pull chopping” of a source by switching the electrifying! field of view of the telescope between an astronomical source and its surroundings on two opposite sides. By subtracting the The motivation for such a survey is to obtain an unbiased emission from the surroundings, a first attempt is made to census of objects that emit in the 2.0Ð2.5 m atmos- correct for atmospheric emission above the telescope. pheric window [which] might reveal many potentially 15
KENNETH NORDTVEDT* Verification of general relativity: tests in the Solar System
In 1900, Isaac Newton’s worldview of gravity, space, and decades. But when the post-World War II years brought time still prevailed Ð that the gravitational force was a uni- forth a stream of new technical abilities to launch space- versal, direct, and instantaneous action-at-a-distance between craft and to send ranging signals out into the Solar System, the masses of the Universe, that bodies and light rays and other supporting technologies, experiments were car- moved through an “absolute space, in its own nature, with- ried out with ever-increasing precision to confirm and out anything external” whose geometric structure was quantitatively measure the full variety of novel phenomena rigidly Euclidean without end, and that the dynamics of in the Solar System predicted by GR. physical law unfolded with respect to an “absolute, true, Although the Newtonian model of Solar System dynam- and mathematical time (flowing) equably without relation ics had been quite successful when used to discover the to anything external” (Newton 1687). Through the twentieth planet Neptune from what seemed to be unexplained per- century that edifice was overthrown and replaced by Albert turbations in the observed motions of the planet Uranus, Einstein’s general relativity (GR) perspective Ð that gravity and when it explained the Moon’s numerous orbital “irreg- is an interaction transmitted by a causal and dynamic field ularities” which result from the competition of the Sun’s whose sources are all forms of energy, including its own gravitational influence with that of Earth on that satellite, contributions, and which then acts elsewhere upon the same; the Newtonian system still faced problems, such as a robust and that the metrical relations between the clocks, rulers, anomaly found in the observed precession rate of the major and signals throughout the cosmos are dynamic, non- axis of Mercury’s orbit, and the failure of Albert Michelson Euclidean, locationally dependent, and established by the and Edward Morley to detect any change in the speed of fields of gravity. The detailed structure of metric gravita- light passing through their interferometer as the Earth tional field components in the Solar System has in all cases changed its velocity through the cosmos. Newtonian gravity been found to match the predictions of GR in a variety of and cosmology also faced theoretical challenges. This was experiments which primarily employed radar and laser rang- brought into focus in the late nineteenth century by Ernst ing between Earth and other planets or spacecraft. Mach, physicist and positivist critic of several concepts in The first half of the twentieth century was occupied physical law, who labeled the notion of absolute time “an mainly by the construction and calculational exploration of idle metaphysical conception” (Mach 1893), and stressed Einstein’s theory which he built upon the foundations of that what is observed in nature and experiment is the behav- James Clerk Maxwell’s electromagnetic field theory, his ior of clocks, not the unfolding of “time” as such. He also own special relativity theory, and then his Equivalence asserted that it was the relative feature of local motion with Principle. Further exploration and application of the theory respect to the distant “fixed stars” of the Universe that was to a variety of gedanken experiments, temporarily beyond empirically meaningful, not the notion of absolute motion the reach of experimental test, continued in subsequent relative to “space” as such. Einstein acknowledged the influence of Mach’s ideas in his formative years. Describing * Montana State University, Bozeman, MT, USA some consequences of his new theory Ð that the inertia of a
Kenneth Nordtvedt, The Century of Space Science, 335Ð352 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 336 KENNETH NORDTVEDT mass was increased by the proximity of other matter, and that gravitational field; and that clocks should tick slower the accelerated or rotating matter induced corresponding acceler- deeper they are in a gravitational potential, by a given frac- ations or rotations of nearby inertial frames, Einstein (1922) tional rate which is independent of their internal structure. pointed out that, “We must see in (these examples) a strong Measuring these predictions of the Equivalence Principle, support for Mach’s ideas as to the relativity of all inertial and checking the principle’s foundational phenomena Ð the actions.” A variety of such Machian effects indeed inspired universality of gravitational free-fall rates Ð to the highest some of the experimental tests of GR and alternative theories achievable precision continue to be at the core of the exper- which were finally carried out in the late twentieth century. imental program to test GR in the Solar System. In line The result of these tests suggested by the Machian perspec- with the field paradigm of modern physics, if the founda- tive is a more comprehensive empirical basis for theory. tion or predictions of the Equivalence Principle were found Einstein still had Mach in mind when he derived from his to be violated, this would most likely signal the existence of theory the possibility of a topologically closed Universe a previously unseen interaction field in physical law which without spatial boundaries: “If we think these (Mach’s) ideas is not “transformed away” in freely falling laboratories. For consistently through to the end we must expect the whole the purposes of discovering any such inverse-square (or inertia, that is, the whole (gravitational) g -field, to be deter- very long Yukawa range) interaction field whose strength of mined by the matter of the Universe, and not mainly by the coupling to matter is very weak compared with the gravita- boundary conditions at infinity.” tional coupling strength, the contemporary space experi- In taking on the challenge of incorporating gravity into ments designed to test the Equivalence Principle are his special-relativity structure of physical law, Einstein unsurpassed instruments. worked to replace the instantaneous, action-at-a-distance The universality of free fall also gave Einstein an impor- force of Newtonian gravity with an interaction between sep- tant clue to the type of field which could be the basis of arated matter which was carried by a dynamical field. The gravity and to the attribute of matter to which it must cou- successful electromagnetic field theory of Maxwell was his ple. From special relativity he learned that a body’s inertia guide. This view of the field-mediated interaction has equals its total energy content: E mc2. If the gravitational triumphed throughout physical law in the twentieth century. force on bodies were to be in universal proportion to the In this paradigm, two bodies interact by a staged, causal bodies’ inertial masses, then a gravitational field’s coupling process; one body is source of a dynamical field, the field strength to bodies must also be proportional to the bodies’ then spreads out in space and time from its origins in accor- energy contents. This led to consideration of a single scalar dance with its own dynamical laws, and finally another body field (tensor of rank 0) or a second-rank tensor field g of located elsewhere is acted upon by the resulting field found 10 potentials, being symmetric in its indices and which at its location. range over the four dimensions of space and time, as trans- In the search for a gravitational field, Einstein was pro- mitters of the gravitational interaction; either field could foundly influenced by the empirical fact that gravity accel- couple rather naturally to the energy content of bodies. (By erates bodies at a rate which does not depend on their contrast, the field that transmits the electromagnetic interac- chemical composition or other internal properties Ð the rate tion between charges is a first-rank tensor field A of four is apparently universal. First noted and studied by Galileo potentials, and the other interactions of physics, nuclear and and others four hundred years ago, tested by Newton to a weak, have also been found to be based on multiplets of one-part-in-a-thousand precision by comparing motions of first-rank tensor fields.) Through the decade 1905Ð15, scalar differently composed pendulums, and tested in Einstein’s and tensor gravity were both explored. For a variety of rea- time by Roland von Eötvös to precisions of a few parts in a sons Ð empirical implications and predictions, theoretical billion with torsion balances supporting different sub- uniqueness, consistency, and completeness Ð the general stances in Earth’s gravity, this strange and apparently exact theory of relativity emerged based on a pure second-rank proportionality between the inertia of bodies and the tensor gravitational field g (r, t) (Einstein 1916). strength of gravitational force they experience led Einstein, Special relativity, by itself, suggests that the motion of still short of a theory, to make a grand hypothesis. Para- bodies interacting gravitationally should deviate in detail phrasing him: since a local laboratory falling freely in grav- from Newtonian form. Since a body’s momentum was now ity “transforms gravity away” as far as the physics in that known to contain the speed-dependent modifications of laboratory is concerned, the freely falling laboratory must special relativity, application of the law of motion in the in all phenomenological respects be locally equivalent to an presence of a force inertial frame (even though it accelerates relative to the dis- tant inertial frame). From this Equivalence Principle, p Einstein was able to predict two novel phenomena: that d d m v f dt dt 2 2 light rays should deflect downward when passing through a 1 v /c VERIFICATION OF GENERAL RELATIVITY: TESTS IN THE SOLAR SYSTEM 337 yields corrections of order v2/c2, even before considering also be found to occur in a local gravitational field of the force side of the equation or further modifications to the equivalent acceleration. (Einstein 1907) momentum of the mass. But just like electromagnetism, As one example of Einstein’s ingenious reasoning which GR’s 10 gravitational potentials produce a total force followed from this principle, consider two identically con- between bodies which includes corrections from the static structed clocks freely falling in gravity and separated by a situation (with details depending on the motion of one or small vertical distance. Light signals triggered by each tick both of the interacting objects). A further novel feature of of the higher clock are sent down to the lower clock which, GR’s interaction is its nonlinearity: the gravitational force because of its gravitational acceleration and the finite veloc- due to a sum of sources is not simply the sum of the indi- ity of light, is always receiving the signals at a Doppler- vidual forces; additional forces come into play which are shifted rate which is lower than the transmitted rate. But by proportional to the product (and higher powers) of the the Equivalence Principle, the observed phenomena should source masses. All of these modifications, fractionally char- 2 2 2 be identical to what occurs when two separated clocks at rest acterized by the factor v /c or Gm/c r, are very small in inertial space exchange signals Ð the frequency of signal throughout the Solar System, each amounting to about transmission recorded by the transmitting clock equals the 8 2.5 10 in the case of Mercury’s orbit, for example. frequency of signal as recorded by the receiving clock. This General relativity is compatible with the possible exis- will occur only for the gravitationally free-falling clocks if tence of additional, very weakly coupled long-range fields the lower clock ticks slower than the upper clock by an which have so far escaped detection. The presence of such amount needed to compensate for the Doppler shift. fields and their interactions with matter will produce phe- Unlike Newton, who had no underlying theory for the nomena which either violate the Equivalence Principle and origins and magnitudes of mass (beyond its simple additiv- metric foundations of GR, or which diverge from the pre- ity), and who therefore on empirical grounds could simply dictions of pure tensor gravity at the post-Newtonian level. adopt the equality of inertial and gravitational mass for all Testing GR can therefore also be viewed as the search for objects, Einstein had found from his special theory of rela- any “new” long-range interaction fields in physical law. tivity that a body’s inertial mass was equal to its total This chapter reflects this interpretation of the twentieth energy content, E mc2, and he was therefore led to seek a century’s achievements. theory of gravity in which the gravitational field’s coupling strength to a body was also naturally and generally propor- tional to the body’s total energy content. Achieving this THE EARLY FOUNDATIONAL YEARS took him over a decade. But more immediately, using only his Equivalence Principle, he predicted two novel phenom- The Equivalence Principle ena. Using the argument outlined in the previous paragraph, Soon after formulating his theory of special relativity in he predicted that the rate of any laboratory clock located in 1905, Einstein turned to the task of properly incorporating a gravitational potential U(r) will differ from the rate of an gravity, the other known force of that time, into physical otherwise identical clock located elsewhere at gravitational law. His goal was to replace Newton’s instantaneous action- potential U(r ): at-a-distance with a causal field theory of gravity, Ј Ј analagous to the electromagnetic theory of interaction f f m U(r) U(r ) i , U(r) G (1) between charged particles transmitted by the Maxwell 2 f c i r ri fields. He was also profoundly influenced by the apparent fact that the gravitational forces on bodies were in universal where c is the speed of light, G is Newton’s gravitational proportion to the bodies’ inertial masses. Because of the constant, and mi are the masses responsible for the gravita- resulting identity of free-fall rates at a given location, a tional potential. Using similar forms of argument, he also localised laboratory freely falling in gravity would appear concluded that a light ray’s propagation direction cˆ will be as an inertial frame, though accelerating relative to other deflected by the transverse part of any gravitational acceler- distant inertial frames. He elevated this well-confirmed fea- ation field g through which it propagates: ture of gravity to a grand hypothesis Ð his Equivalence . ˆˆ Principle: dcˆ g g cc , g(r) U(r) (2) ds c2 All local phenomena seen in a laboratory freely falling in gravity are equivalent to phenomena in a gravity-free These effects are actually closely related. If the locally inertial frame, or conversely, phenomena present in a measured speed of light is to be a universal constant, but gravity-free but accelerated frame of reference and clock rates are universally diminished in gravitational understood from the basic kinematics of that frame must potentials, then the globally viewed speed of light must also 338 KENNETH NORDTVEDT
diminish in those gravitational potentials: from mathematician and friend Marcel Grossmann, Einstein ultimately built his theory of gravity upon a dynamical sec- 2 c(r) c (1 U(r)/c ) ond-rank tensor field of 10 potentials Ð g (r, t) Ð symmet- ric in its indices and , which each range over the four with c being the speed of light where the gravitational spacetime coordinates. Finding physically acceptable and potential of local bodies is negligible. As in any medium consistent field equations for these gravitational potentials, with an inhomogeneous speed of wave propagation, this including their coupling to matter, consumed several speed of light function then results in the downward deflec- years of labor. In the finished form of the theory, the gravi- tion of light wavefronts at the rate indicated by eqn (2). tational potentials fulfill a set of second-order, nonlinear Einstein soon realized that his prediction of light deflec- partial differential field equations for which the entire tion could be experimentally tested. If the path of light stressÐenergyÐmomentum tensor of laboratory matter T is between a distant star and an observing telescope on Earth the source of gravity: passes the Sun at distance of closest approach D, the inte- grated deflection angle will be approximately (Einstein 1911) 8 G G T c4 2GM 0.84 arcsec for a grazing ray c2D where G is a tensor constructed from the ten gravitational potentials, the square of their first partial derivatives, and The angular locations of such star images would therefore their second partial derivatives with respect to the space and move away from the Sun’s location and closer to the images time coordinates. This tensor is found to be unique by the from the less distorted regions of the starfield whose light necessity to be mathematically consistent with the property did not pass so close to the Sun. Erwin Freundlich led an that its source tensor T fulfills the traditional conservation expedition to southern Russia in the late summer of 1914 to laws of energy, momentum, and angular momentum under measure the light deflection by photographing the starfield the appropriate local conditions (Einstein 1916). The theory during an eclipse of the Sun. But World War I was breaking also specified how matter and other fields respond to out, and the expedition personnel were arrested and detained the gravitational fields; for example, an atom or object of by the Russian authorities. Such an experiment was not to negligible size will move between two spacetime locations be carried out until the occurrence of the next post-war on the trajectory r(t), which gives an “extremal” value to eclipse in 1919, a delay which permitted completion of the action integral Einstein’s theory and a change in the prediction. As early as 1907, Einstein was aware of the outstanding anomaly in Newtonian Solar System dynamics, and he S g (r, t) d x d x (3) sought to account for the discrepancy in his new theory. The accumulated astronomical observations of the previous cen- performed along that trajectory; here dx c dt,dr for tury had shown that the secular precession rate of the planet 0, 1, 2, 3. The specific mass of the atom is absent from this Mercury’s orbital major axis Ð the advance of its perihelion Ð action integral because of the identity of gravitational and amounted to 574 arcsec/century. This was a quality observa- inertial mass; a geometrical interpretation of these trajecto- tion aided by the relatively large eccentricity of Mercury’s ries as the extremal paths in a curved Riemannian space- orbit (e 0.2). But only 531 arcsec/century of this precession time geometry established by the field potentials g can could be accounted for by the perturbing gravitational accel- then straightforwardly follow. erations from the other planets in the Solar System, and no In late 1915, just weeks before arriving at the final form other explanations within Newtonian gravity emerged which of his theory, Einstein succeeded in using a slightly incom- remained plausible. It seems likely that the failure during plete version of his gravitational field equations to calculate the decade 1905Ð15 of several preliminary versions of a the static, spherically symmetric tensor gravitational field gravitational field theory to naturally account for this excess in the empty space surrounding the Sun; he then used the 43 arcsec/century precession led Einstein to the further con- equation of motion which results from the action integral siderations from which general relativity was formulated. given by eqn (3) to obtain the relativistic motion of the planet Mercury in that gravity field. Having no free the- oretical parameters to adjust, his new theory nevertheless explained the anomalous 43 arcsec/century perihelion General relativity and Mercury’s anomalous advance with good accuracy. Owing to the theory’s intrinsic perihelion advance nonlinearity, the Sun’s gravitational field included an Guided by the physical insights gained from his Equivalence important correction to the Newtonian inverse-square field: Principle and special relativity theory, and with formal help it varied as the inverse third power of distance and was 16
CLIFFORD M. WILL* Verification of general relativity: strong fields and gravitational waves
At the time of the birth of general relativity (GR), experi- them. The period began with an experiment to confirm the mental confirmation was almost a side issue. Einstein did gravitational frequency shift of light (1960) and ended with calculate observable effects of general relativity, such as the the reported decrease in the orbital period of the binary pul- deflection of light, which were tested, but compared to the sar at a rate consistent with the general relativity prediction inner consistency and elegance of the theory, he regarded of gravity-wave energy loss (1979). The results all sup- such empirical questions as almost peripheral. But today, ported GR, and most alternative theories of gravity fell by experimental gravitation is a major component of the field, the wayside (for a popular review, see Will 1993a). characterized by continuing efforts to test the theory’s pre- Since 1980, the field has moved toward what might be dictions, to search for gravitational imprints of high-energy termed a Quest for Strong Gravity. The principal figure of particle interactions, and to detect gravitational waves from merit that distinguishes strong from weak gravity is the astronomical sources. quantity ϳ GM/Rc2, where G is the Newtonian gravita- The modern history of experimental relativity can be tional constant, M is the characteristic mass scale of the divided roughly into four periods: Genesis, Hibernation, a phenomenon, R is the characteristic distance scale, and c is Golden Era, and the Quest for Strong Gravity. The Genesis the speed of light. Near the event horizon of a non-rotating (1887Ð1919) comprises the period of the two great experi- black hole, or for the expanding Universe, ϳ 0.5; for neu- ments that were the foundation of relativistic physics Ð the tron stars, ϳ 0.2. These are the regimes of strong gravity. MichelsonÐMorley experiment and the Eötvös experiment Ð For the solar system, 10 5; this is the regime of weak and the two immediate confirmations of GR Ð the deflection gravity. Figure 1 displays these regimes and various phe- of light and the perihelion advance of Mercury. Following nomena of interest. Notice that the strong-gravity regime this was a period of Hibernation (1920Ð1960) during which (near the diagonal line corresponding to 0.5) encom- relatively few experiments were performed to test GR, and at passes Planck-scale physics where quantum gravity and the same time the field itself became sterile and stagnant, grand unification of the interactions are important, all the relegated to the backwaters of physics and astronomy. way to the observable universe. But beginning around 1960, astronomical discoveries Until the discovery of the binary pulsar, the empirical (quasars, pulsars, cosmic background radiation) and new foundation of general relativity rested on tests in the weak- experiments pushed GR to the forefront. Experimental field regime. In Chapter 15, Kenneth Nordtvedt Jr. surveys gravitation experienced a Golden Era (1960Ð1980) during experimental tests of general relativity primarily from the which a systematic, worldwide effort took place to under- point of view of Solar System and laboratory experiments. stand the observable predictions of GR, to compare and While most of these focus on weak-gravity effects, some contrast them with the predictions of alternative theories can be viewed as strong-gravity tests, primarily experi- of gravity, and to perform new experiments to test ments to search for the relics of Planck-scale physics in fundamental interactions, as might show up in tests of the * Washington University, St. Louis, MO, USA equivalence principle.
Clifford M. Will, The Century of Space Science, 353Ð372 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 354 CLIFFORD M. WILL
Today, much of the focus has shifted to experiments These tests mainly involve black holes, neutron stars, and which can probe the effects of strong gravitational fields cosmology, and make essential use of gravitational radia- (see Figure 1). At one extreme are the strong gravitational tion as a carrier of information about strong gravity. Even fields associated with Planck-scale physics. Will unification though the gravitational waves that bathe the Earth are of the forces, or quantization of gravity at this scale leave extraordinarily weak (and are themselves describable by observable effects accessible by experiment, even those the weak-field limit of general relativity), they often carry that themselves are performed under weak-field conditions? the imprints of strong-gravity phenomena occurring near Dramatically improved tests of the equivalence principle or black holes, neutron stars, or from the early Universe. of the “inverse square law” are being designed, to search We begin (Section 1) by defining the distinction between for or to bound the imprinted effects of Planck-scale phe- strong and weak gravity, and providing (Section 2) an intro- nomena (see Chapter 15). At the other extreme are the duction to the general relativistic description of systems strong fields associated with compact objects such as black involving compact relativistic objects and gravitational radi- holes or neutron stars or with the universe as a whole. ation. In Section 3 we focus on tests of gravitational theory Astrophysical observations and gravitational-wave detectors using stellar binary systems of compact objects, the most are being planned to explore and test GR in the strong-field, famous of these being the binary pulsar PSR 1913 16. highly dynamical regime associated with the formation and Section 4 focuses on the likely direct detection of gravita- dynamics of these objects. tional radiation by Earth-based antennae in the first decade In this chapter, we shall focus on tests of general relativ- of the twenty-first century, and the possible tests of gravita- ity involving strong gravity and gravitational radiation. tional theory that could emerge. In Section 5 we briefly discuss other tests of strong gravity. We use the standard “geometrized” units of general rela- tivity textbooks (Misner et al. 1973), in which Newton’s PRESENT 1050 gravitational constant G and the speed of light c are unity. In UNIVERSE these units, mass and energy have units of length, with the MILKY WAY basic scale of length set by the solar mass: ML 1.476 km, 1040 UNIVERSE AT and velocities are dimensionless, representing fractions of HELIUM FUSION the speed of light. In the few places where indices are used, SUN Greek indices will run over the four spacetime dimensions, 1030 GPS ORBIT {0, 1, 2, 3}, with the 0 denoting time; while Roman indices NEUTRON will run only over spatial indices {1, 2, 3}. 20 STAR Rather than provide complete references to work done 10 INSIDE BLACK HOLES in this field, we will refer the reader where possible to the PRIMORDIAL BLACK appropriate review articles and monographs, specifically 10 HOLE EVAPORATING 10 TODAY to Theory and Experiment in Gravitational Physics (Will HUMAN 1993b), hereafter referred to as TEGP. References to TEGP
MASS IN GRAMS UNIVERSE AT MEASUREMENT OF 1 END OF INFLATION will be by chapter or section, e.g. “TEGP 8.9”. NEWTON’S G THE QUANTUM GRAVITY SCALE –10 1 STRONG-FIELD SYSTEMS IN GENERAL 10 STRAND OF DNA RELATIVITY PROBED BY BEST –20 ACCELERATORS 10 1.1 Defining weak and strong gravity HYDROGEN ATOM In the Solar System, gravity is weak, in the sense that the 10–30 10 20 Newtonian gravitational potential 10–30 10–20 10–10 1 10 10 DISTANCE IN METERS ϵ Ј Ј 1 3 Ј U(x, t) (x , t) x x d x (1) Figure 1 Strong v. weak gravity. Representative phenomena in is much smaller than unity (recall G c 1) every- gravitational physics are indicated by filled circles; other illus- 2 trative phenomena where gravitation plays little role are shown where. So, too, are the quantities v and p/ , where v is a by filled squares and italics. Phenomena above the diagonal typical velocity of bodies in the Solar System, and where line are unobservable, because they take place inside black p and are typical pressure and density, respectively, inside holes. Phenomena close to the diagonal line are in the strong- Solar System bodies. In fact U ϳ v2 ϳ p/ ϳ . Throughout gravity regime. the Solar System, the metric of spacetime deviates only VERIFICATION OF GENERAL RELATIVITY: STRONG FIELDS AND GRAVITATIONAL WAVES 355
slightly from its flat-spacetime Minkowski form prior to a merger and collapse to a final stationary state. diag( 1,1,1,1). By expanding the metric about this form Binary inspiral is one of the leading candidate sources for in powers of the small parameter , one can obtain the detection by a worldwide network of laser interferometric Newtonian limit of general relativity at lowest order, and gravitational-wave observatories nearing completion. A the first “post-Newtonian”, or “1PN”, corrections to proper description of such systems requires not only Newtonian gravity. The metric in such an expansion takes equations for the motion of the binary carried to extraor- the schematic form dinarily high PN orders (at least 3.5PN), but also requires equations for the far-zone gravitational waveform mea- 2 g00 1 2U(x, t) O( ) sured at the detector, that are equally accurate to high PN orders beyond the leading “quadrupole” approximation. 3/2 g0i O( ) (2) Of course, some systems cannot be properly described gij ij O( ) by any post-Newtonian approximation because their behav- ior is fundamentally controlled by strong gravity. These where the first terms beyond the Minkowski metric and the include the imploding cores of supernovae, the final merger 2 3/2 Newtonian potential at O( ) in g00, O( ) in g0i, and O( ) of two compact objects, the quasinormal-mode vibrations in gij, respectively, are the post-Newtonian terms. It turns of neutron stars and black holes, the structure of rapidly out that, in a wide range of alternative theories of gravity, rotating neutron stars, and so on. Phenomena such as these these post-Newtonian terms vary from one theory to the must be analyzed using different techniques. Chief among next only in the values of certain numerical coefficients. these is the full solution of Einstein’s equations via numeri- By inserting arbitrary dimensionless parameters in place cal methods. This field of “numerical relativity” is a rapidly of the numerical coefficients, one obtains a parametrized growing and evolving branch of gravitational physics, framework that encompasses many theories at once, and is a whose description is beyond the scope of this article (see, powerful tool for analyzing Solar System experiments. This e.g., the articles in Marck and Lasota 1997). “parametrized post-Newtonian” framework is described in more detail in Chapter 15; see also TEGP 4. In strong-field systems, this simple 1PN approximation 1.2 Compact bodies and the strong equivalence is no longer appropriate, for several reasons: principle ¥ The system may contain strongly relativistic objects, such When dealing with the motion and gravitational-wave as neutron stars or black holes, near and inside which ϳ 1, generation by orbiting bodies, one finds a remarkable sim- and the post-Newtonian approximation breaks down. plification within general relativity. As long as the bodies Nevertheless, under some circumstances, the orbital motion are sufficiently well separated that one can ignore tidal may be such that the interbody potential and orbital veloci- interactions and other effects that depend upon the finite ties still satisfy U ϳ v2 1 so that a kind of post- extent of the bodies (such as their quadrupole and higher Newtonian approximation for the orbital motion might multipole moments), then all aspects of their orbital behav- work; however, the strong-field internal gravity of the bod- ior and gravitational-wave generation can be characterized ies could (especially in alternative theories of gravity) leave by just two parameters: mass and angular momentum. imprints on the orbital motion. Whether their internal structure is highly relativistic, as in ¥ The evolution of the system may be affected by the emis- black holes or neutron stars, or non-relativistic as in the sion of gravitational radiation. The 1PN approximation Earth and Sun, only the mass and angular momentum are does not contain the effects of gravitational radiation needed. Furthermore, both quantities are measurable in back-reaction. In the expression for the metric given in principle by examining the external gravitational field of the eqns (2), radiation back-reaction effects do not occur until bodies, and make no reference whatsoever to their interiors. 7/2 3 5/2 O( ) in g00, O( ) in g0i, and O( ) in gij. Consequently, Damour (1987) calls this the “effacement” of the bodies’ in order to describe such systems, one must carry out a internal structure. It is a consequence of the strong equiva- solution of the equations substantially beyond 1PN order, lence principle (SEP), which is satisfied by general relativ- sufficient to incorporate the leading radiation damping ity, but is violated by almost all other gravitational theories, terms at 2.5PN order. including scalarÐtensor gravity (BransÐDicke theory and its ¥ The system may be highly relativistic in its orbital generalizations). SEP states that (i) all bodies fall in an motion, so that U ϳ v2 ϳ 1 even for the interbody field external gravitational field with the same acceleration and orbital velocity. Systems like this include the late (modulo tidal interactions), (ii) the outcome of any local stage of the inspiral of binary systems of neutron stars or test experiment is independent of the velocity of the (freely black holes, driven by gravitational radiation damping, falling) apparatus, and (iii) the outcome of any local test 356 CLIFFORD M. WILL
experiment is independent of where and when in the on the internal structure of each body. For compact bodies Universe it is performed. The term “bodies” here includes such as neutron stars and black holes, these internal struc- everything from laboratory-sized objects to black holes; ture effects could be large; for example, the gravitational and the term “experiments” includes everything from elec- binding energy of a neutron star can be 40% of its total tromagnetic experiments to measurements of Newton’s G. mass. General relativity satisfies SEP because it contains one, At 1PN order, the leading manifestation of this effect is and only one, gravitational field, the spacetime metric g . a violation of the equality of acceleration of massive bodies Consider the motion of a body in a binary system, whose such as the Earth and the Moon. This effect, known as the size is small compared to the binary separation. Surround Nordtvedt effect, has been tested by lunar laser ranging (see the body by a region that is large compared to the size of Chapter 15 or TEGP 8 for further discussion). the body, yet small compared to the separation. Because of This is how the study of orbiting systems containing the general covariance of the theory, one can choose a compact objects provides strong-field tests of general rela- freely falling coordinate system which co-moves with the tivity. Even though the strong-field nature of the bodies is body, whose spacetime metric takes the Minkowski form at effaced in GR, it is not in other theories, thus any result in its outer boundary (ignoring tidal effects generated by the agreement with the predictions of GR constitutes a kind of companion). There is thus no evidence of the presence of “null” test of strong-field gravity. the companion body, and the structure of the chosen body can be obtained using the field equations of GR in this coordinate system. Far from the chosen body, the metric is 2 MOTION AND GRAVITATIONAL characterized by the mass and angular momentum (assum- RADIATION IN GENERAL RELATIVITY ing that one ignores quadrupole and higher multipole moments of the body) as measured far from the body using 2.1 Introduction orbiting test particles and gyroscopes. These asymptotically measured quantities are oblivious to the body’s internal The motion of bodies and the generation of gravitational structure. A black hole of mass m and a planet of mass m radiation are long-standing problems that date back to the would produce identical spacetimes in this outer region. first years following the publication of GR, when Einstein The geometry of this region surrounding the one body calculated the gravitational radiation emitted by a labora- must be matched to the geometry provided by the compan- tory-scale object using the linearized version of GR, and de ion body. Einstein’s equations provide consistency condi- Sitter calculated N-body equations of motion for bodies in tions for this matching that yield constraints on the motion the 1PN approximation to GR. It has at times been contro- of the bodies. These are the equations of motion. As a result versial, with disputes over such issues as whether Einstein’s the motion of two planets of mass and angular momentum equations alone imply equations of motion for bodies m1, m2, J1 and J2 is identical to that of two black holes of (Einstein, Infeld, and Hoffman demonstrated explicitly that the same mass and angular momentum (again, ignoring they do, using a matching procedure similar to the one tidal effects). described in Section 1.2), whether gravitational waves are This effacement does not occur in an alternative gravi- real or are artifacts of general covariance (Einstein waffled; tional theory like scalarÐtensor gravity. There, in addition Bondi and colleagues proved their reality rigorously in the to the spacetime metric, a scalar field is generated by 1950s), or even over algebraic errors (Einstein erred by a the masses of the bodies, and controls the local value of the factor of two in his first radiation calculation; Eddington gravitational coupling constant (i.e., G is a function of ). found the mistake). Shortly after the discovery of the binary Now, in the local frame surrounding one of the bodies in pulsar PSR 1913 16 in 1974, questions were raised about our binary system, while the metric can still be made the foundations of the “quadrupole formula” for gravita- Minkowskian far away, the scalar field will take on a value tional radiation damping (and in some quarters, even about 0 determined by the companion body. This can affect the its quantitative validity). These questions were answered in value of G inside the chosen body, alter its internal struc- part by theoretical work designed to shore up the founda- ture (specifically its gravitational binding energy), and tions of the quadrupole approximation, and in part (perhaps hence alter its mass. Effectively, each mass becomes several mostly) by the agreement between the predictions of the functions mA( ) of the value of the scalar field at its loca- quadrupole formula and the observed rate of damping of tion, and several distinct masses come into play; inertial the pulsar’s orbit (see Section 3.1). Damour (1987) gives mass, gravitational mass, “radiation” mass, and so on. The a thorough review of this subject. precise nature of the functions will depend on the body, The problem of motion and radiation has received specifically on its gravitational binding energy, and as a renewed interest since 1990, with proposals for the construc- result, the motion and gravitational radiation may depend tion of large-scale laser interferometric gravitational-wave 17
GUSTAV A. TAMMANN* The cosmological constants
The evolution of the large scale of the Universe is but one the subsequent results in ultraviolet spectroscopy (see chapter of cosmology, yet it is fundamental to all of its Chapter 13). But optical observations above the atmosphere aspects. The understanding of the large-scale evolution has were also of fundamental interest offering almost 10 times once been described as the “Search for two numbers” smaller seeing disks and consequently the detection of (Sandage 1970). The first of these numbers is the Hubble con- roughly 100 times fainter objects than from the ground. stant H0 which measures the present expansion rate. The sec- Individual stars in crowded regions would become accessi- ond number was thought to account for the gravitational ble and much greater detail would be visible in extended deceleration of the Universe. But since evidence has been objects, as first pointed out by Hermann Oberth in 1923. found for an additional acceleration, the original deceleration However, this enormous gain was offset in part by the size parameter has been split into a decelerating term m and an limitations of a telescope in space. Plans for what was to accelerating term . m is a measure of the gravitating become the 2.4 m HST for imaging and spectroscopy in the mean matter density of the Universe, while corresponds to optical and near UV developed during the 1960s and were the energy density of Einstein’s “cosmological constant” . finally approved by the US Congress in 1977. ESA joined The three quantities H0, m, and fix the expansion the venture with a 15% share, which turned out to be of age T of the Universe and determine also whether the great benefit to astronomy in Europe. The start of HST, Universe will eventually recollapse or expand forever. planned for 1983, was delayed by the Space Shuttle acci- In principle the values of H0, m, and can be mea- dent until 1990. Since then, and particularly since 1993, sured in the present Universe as well as the age t of the oldest that is, after the repair mission for the defective mirror, objects which Ð as an essential test Ð must be smaller than T. HST has not only brought spectacular progress in astro- Yet the cosmological constants are elusive. Even the nomical research, but has also stirred an enormous public concerted effort of many of the largest optical and radio tele- interest in astronomy. scopes has not led to their satisfactory determination. As a When plans were made in 1979 (Macchetto et al. 1979; consequence observations from space were added wherever Longair and Warner 1979) for future research with HST, possible. In fact the determination of the cosmological con- the author was ambivalent about its benefits for the deter- stants was a driving force for some astronomical satellites. mination of H0, being convinced that the main uncertainties Several satellites will be mentioned in the following of the time were due to statistical problems (Malmquist which have contributed to our dawning understanding of bias; Section 1.3). He was wrong by not foreseeing that the cosmological parameters, but none has been as impor- standard supernovae of type Ia are so powerful standard tant as the Hubble Space Telescope (HST). candles that they can beat all statistical selection effects, The initial motivation for astronomers to go into space and that HST can provide Cepheid distances to their nearest were the wavelengths inaccessible from the ground. The representatives and thus calibrate their true luminosity. It reward was overwhelming with the detection of X-ray was clear, however, already in 1979 that the same super- and -ray sources in the early 1970s (see Chapter 12) and novae can be observed with HST at cosmologically signifi- cant redshifts and that this is the route to determine the * Universität Basel, Switzerland cosmological parameters q0 (or m) and .
Gustav A. Tammann, The Century of Space Science, 373Ð397 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 374 GUSTAV A. TAMMANN
1 THE HUBBLE CONSTANT H0 poses no problem. The crux, however, is introduced by pecu- liar motions which are superposed on the regular Hubble Since the Big Bang space has expanded. During the early flow. They are caused by the gravitational pull of galaxies stages the expansion was governed by complex processes, and clusters of galaxies and of any additional dark matter. For but after an inflationary period the Universe turned into a instance our Local Group of galaxies is a sufficient density constant growth rate carrying with it all matter and the enhancement to have turned its original expansion into a con- emerging galaxies. As a consequence any observer in the traction. Our nearest neighbor, the Andromeda galaxy, M31, Universe has the impression that all galaxies are being car- has thus a blueshift and approaches us at about 100 km s 1. ried away, and the faster they move the larger the distance. The largest peculiar velocity we partake of is a 630 km s 1 Yet strictly speaking the distant galaxies do not recede with velocity which manifests itself as a dipole anisotropy of the any velocity, but space between the observer and the galaxy cosmic microwave background (CMB, see below). is stretched. In spite of this the spectra of nearby galaxies are The peculiar velocities cause a dilemma for the determi- interpreted as if they had recession velocities. This is permis- nation of H0. On the one hand one wants to determine it sible here because the determination of the present expan- locally to find the present value H0; on the other hand one sion rate, that is, the Hubble constant H0, must be carried out must determine it at sufficiently large distances, where the at small distances where the look-back times are short. recession velocities are significantly larger than any possible If light travels through an expanding space its waves will peculiar motions. The best compromise is to determine H0 be stretched and when it arrives at the observer it will between recession velocities of 10 000 and 30 000 km s 1. appear redder than at the time of emission. Also the spectral But the determination of the corresponding distances poses lines of the galaxies will be shifted to the red. These lines formidable problems. can be ascribed to certain neutral or ionized elements and With this requirement in mind a new strategy has devel- each appears in the laboratory at a fixed wavelength 0. oped to determine H0. The idea is to map the Hubble flow If that particular line appears in the galaxy spectrum at out to the necessary distances by means of relative dis- wavelength one defines a redshift z of the galaxy as tances of a set of galaxies, for instance from “standard can- z ( 0)/ 0. The redshift is determined by the stretch dles” of constant luminosity, and then to determine the true factor the Universe has experienced during the light travel distances of one or a few nearby galaxies of the same set. time, that is, z (R0/Re) 1, where R0 is the present radius The latter step has been greatly facilitated by HST. The of the Universe (more exactly its radius of curvature) and route is described below. Re is the radius at the time of emission. As long as z is small, say z 0.2, the redshift can also be interpreted with 1.1 The Hubble diagram of standard candles sufficient accuracy as a Doppler effect. The recession velocity v is in that case simply given by v cz, where c is The expansion of the Universe implies that the recession the velocity of light (in km s 1). velocities v increase linearly with the photometric distance. To pin down the expansion rate of the Universe one Astronomers measure, in general, photometric distances in introduces the Hubble constant H0 which is defined as terms of the distance modulus (m M), which is defined as (m M) 5log rMpc 25, where m is the apparent magni- ϵ . ϽϽ H0 R0 /R0 v/r0 (v c) (1) tude of an object and M its absolute magnitude, that is, the apparent magnitude the object would have if seen from a where v is the “recession velocity” of a galaxy and r0 is its distance of 10 pc. Objects with (nearly) constant absolute 1 1 distance (in Mpc). Thus the units of H0 are km s Mpc magnitude are called “standard candles.” The apparent (the units are not repeated in the following), but its dimen- magnitudes m of standard candles are a measure of their sion is simply (time) 1. relative distances, or more exactly of their log r values. It should be noted that H0 is constant in the sense that it A plot of log v υ. the apparent magnitude m of extra- is the same anywhere in the present Universe, but it has a galactic standard candles constitutes the so-called “Hubble different value at other times. This is obvious because the diagram.” The standard candles concentrate in a linearly distance r is constantly increased by the expansion of the expanding Universe along a straight line of slope 0.2. Sys- Universe and consequently the Hubble parameter H must tematic deviations from this line beyond 30 000 km s 1 decrease with time. are caused by the deceleration or acceleration of the Universe Equation (1) gives the false impression that the determi- (provided that the apparent magnitudes are corrected for the nation of the Hubble constant H0 is simple. One just deter- z-dependent photometric K-term). mines the redshift z of the spectrum of any given galaxy, and Since Hubble and Humason (1931), the Hubble diagram hence its recession velocity v, and divides the latter by the has provided the fundamental evidence for the linear expan- galaxy’s distance. Indeed the determination of the redshift sion of the Universe. The ever-increasing persuasive power THE COSMOLOGICAL CONSTANTS 375 of the Hubble diagram from Humason et al. (1956) to conclusions follow from this. (1) If the total observed Sandage (1966) and Kristian et al. (1978), who used first- scatter of the Hubble diagrams is read vertically as an ranked cluster galaxies as standard candles, has shattered effect of peculiar motions, a generous upper limit is set of all attempts to interpret redshifts as a non-cosmological v/v 0.05, which holds for the range of 3500 v effect. 30 000 km s 1. The (all-sky) distance-dependent variation
1.1.1 The Hubble diagram of supernovae of type Ia 12 14 16 18 20 Following an early suggestion of Kowal (1968) it has 4.5 become increasingly clear that supernovae of type Ia B (SNe Ia) are the most powerful standard candles to date. This can be physically rationalized because a SN Ia out- 4.0 burst is caused each time a white dwarf is pushed over its
Chandrasekhar limit by a mass-shedding companion. log v For 35 SNe Ia in the distance range 1200 v 30 000 3.5 km s 1 good B and V magnitudes at maximum light are known, for 29 of them also Imax magnitudes. The SNe Ia, listed by Parodi et al. (2000), are selected to have Bmax 3.0 Vmax 0.06, after correction for galactic reddening to guard against absorption within their parent galaxies and to exclude a few intrinsicly red objects with peculiar spectra. 4.5 Two blue SNe Ia with peculiar spectra at early phases V (SN 1991 T, 1995 ac) are also excluded. The selected objects of the sample all have normal spectra as far as is 4.0 known (Branch et al. 1993).
The 35 SNe Ia define Hubble diagrams in B, V, and I log v m with a very small scatter of M 0 .2. This means that SNe 3.5 Ia are standard candles to better than 20% of their lumi- nosity. In spite of this the residuals about the Hubble line correlate with second parameters. The SNe Ia with slow 3.0 decline rates m15 are somewhat brighter, where m15 is the magnitude decline during the first 15 days after B maxi- mum. The relation between magnitude residual and decline 4.5 I rate is roughly m 0.45 m15. Also the intrinsically bluer SNe Ia are somewhat brighter than their redder coun- terparts by about m p (B V), where p is 4.0 1.2Ð2.5 for I, V, and B magnitudes, respectively (for the exact relations, see Parodi et al. 2000). Corrected magni- log v corr tudes mB,V,I are obtained by reducing all SNe Ia to the 3.5 mean decline rate m15 1.2 and to the mean color B V 0m.01. corr The corrected magnitudes mB,V,I define Hubble diagrams 3.0 as shown in Figure 1. Their tightness is astounding. The 12 14 16 18 20 Cerro Tololo collaboration, to whom one owes 70% of the mcorr(max) photometry of the fiducial sample, quote a mean observa- tional error of their m values of 0m.10 and of their col- Figure 1 The Hubble diagrams in B, V (and I) for the 35 (29) max corr m SNe Ia of the fiducial sample with magnitudes m , that is, ors (B V) of 0 .05 (Hamuy et al. 1996). This alone would suffice to explain the observed scatter of 0m.12 corrected for decline rate m15 and color (B V ). Circles are m SNe Ia in spirals, squares in E/S0 galaxies. Small symbols are 0m.13. An additional error source are the corrections for SNe Ia whose observations begin eight days after B maximum galactic absorption which were adopted from Schlegel et al. or later. Solid lines are fits to the data assuming a flat universe (1998). In fact, if one excludes the nine SNe Ia with large with m 0.3 and 0.7; dashed lines are linear fits with a m galactic absorption corrections (AV 0 .2) the scatter forced slope of 0.2 (corresponding approximately to m 1.0 m decreases to 0 .11 in all three colors. Two important and 0.0). (Parodi et al. 2000.) 376 GUSTAV A. TAMMANN
of H0 must be even smaller. (2) If, however, the scatter is Cepheids are the most uncontroversial distance indica- read horizontally and if allowance is made for the observa- tors in extragalactic astronomy. Their pulsational period is a tional errors of the apparent magnitudes (and for any pecu- function of their mass which in turn correlates with the liar motions) one must conclude that the luminosity scatter luminosity. The corresponding log PÐabsolute magnitude M of blue SNe Ia, once they are homogenized in m15 and relation was discovered in 1912 by Henrietta Leavitt who color, is smaller than can be measured at present. In other studied the many Cepheids in the Large Magellanic Cloud words, they are extremely powerful standard candles. (LMC) which all appear at the same distance. The slope of Linear regressions, which correspond closely to a the log PÐM relation for different passbands is today well Universe with m 1, 0, to the data in Figure 1 determined from the LMC Cepheids. Remaining uncertain- yield ties concern the zeropoint of the relation and the effect of differences of the metallicity of the Cepheids in different corr log v 0.2m c (2) galaxies. All authors agree that the metallicity effect is small, but they do not agree on the sign of the correspond- m with cB 0.676 0.004, cV 0.673 0.004, and ing correction. Therefore an uncertainty of 0 .10 will be cI 0.616 0.004. included into the error budget. It is easy to transform eqns (1) and (2), remembering At present it is customary to fix the zeropoint by adopt- (m M) 5log rMpc 25, into ing (m M)LMC 18.50. There is growing evidence that this value should be increased by 0m.05Ð0m.10. Based on log H0 0.2M c 5 (3) observations with the International Ultraviolet Explorer (IUE) and HST, Panagia (1998) has derived a geometrical where M is the absolute magnitude of SNe Ia in the appro- distance of (m M)LMC 18.58 0.05 from the fluorescent priate passband. Thus the problem of determining the ring of SN 1987A, a peculiar type II supernova in the LMC. large-scale value of H0 is reduced, with c now being Also the calibration of galactic Cepheids with the astromet- known, to measuring the absolute magnitude M of one (or ric satellite HIPPARCOS tends to give a somewhat larger a few) nearby SNe Ia. It must be stressed that the recession distance to the LMC (Madore and Freedman 1998; Feast velocity of the nearby calibrator(s) does not enter in any 1999; Pont 1999; Groenewegen and Oudmaijer 2000) in way. The solution for H0 is therefore independent of any agreement with RR Lyr stars and other distance indicators peculiar motions. (Walker 1999; Gratton 2000; Sakai et al. 2000; Romaniello et al. 2000). Smaller LMC moduli based on red-clump stars and RR 1.1.2 Cepheids, HST, and the luminosity calibration Lyr luminosities, calibrated through statistical parallaxes, of SNe Ia have comparatively low weight. Why two eclipsing binaries With the luminosity calibration of a SNe Ia in mind, a small in the LMC give an unacceptably small modulus of 18.3, team was formed to observe with HST the Cepheids in including HST observations (Fitzpatrick et al. 2000), is not IC 4182, home of the well observed SN 1937C. The understood at present. Cepheids would give a good distance to that inconspicuous DIVA, the planned next-generation astronomic satellite, galaxy and hence fix the absolute magnitude M in the B and is expected to measure very accurate parallaxes of 30Ð40 V band. Through eqn (3) this would lead to a high-weight galactic Cepheids. This should settle for good the question determination of H0. of the zeropoint. ESA’s very ambitious GAIA project would The team consisted of Allan Sandage, Abhijit Saha, in addition measure the parallaxes of many supergiants and Lukas Labhardt, Duccio Macchetto, Nino Panagia, and the thus make a frontal attack on the distance of the LMC. author. Their application for HST observing time was For the moment we will retain the customary zeropoint received with skepticism. Was the project not too easy for with (m M)LMC 18.50 and return to the question in HST? Could it not be executed from the ground? It was a Section 1.1.3. happy incidence for the team that the mirror of the most The HST distance of the Cepheids in IC 4182 provided complex and most expensive telescope ever build produced MB,V of SN 1937C which, inserted into eqn (3), gave a rather miserable images when it was finally launched in 1990. IC low value of H0 52 9 (Saha et al. 1994). Essentially the 4182 became an ideal target. In fact the photometry of the same IC 4182 distance had been derived before from the extended stellar images on the CCD frames was easier than ground from its brightest stars (Sandage and Tammann 1982), later, after the repair mission in 1993, when the crisp stellar but the result had been criticized. Now, with the HST result images became smaller than the pixel size of the WFPC-2 the light curve of SN 1937C from archival material came camera, but, of course, offering higher resolution essential under fire. The obvious step was to calibrate other nearby for more distant galaxies. SNe Ia, which became feasible after the repair of HST. 18
K.M. GÓRSKI* AND A.J. BANDAY** COBE, dark matter and large-scale structure in the Universe
PREAMBLE 1 LARGE-SCALE STRUCTURE OF THE UNIVERSE BEFORE COBE NASA’s first dedicated cosmological space mission, the Cosmic Background Explorer (COBE), has provided com- Following closely upon the launch of the COBE satellite prehensive full-sky observations of both microwave and far- in November 1989, the January 1990 issue of “Scientific infrared frequencies. Each of the three instruments onboard Ð American” presented an article “The Cosmic Background the Far Infrared Absolute Spectrometer (FIRAS), the Explorer”, which announced the following: “NASA’s cos- Differential Microwave Radiometers (DMR), and the Diffuse mological satellite will observe a radiative relic of the Big Infrared Background Explorer (DIRBE) Ð has contributed Bang. The resulting wealth of data will be scoured for clues significantly to our fundamental understanding of the uni- to the evolution of structure in the universe”. In this article verse. FIRAS has measured the Planckian nature of the relic we describe how the outcome of this remarkable mission blackbody radiation to unprecedented accuracy, thus estab- did indeed provide important scientific results which helped lishing stringent constraints on the thermal history of the in a very major way to improve our understanding of the universe. DMR has discovered full sky structure in the tem- universe. However, to start our story at the beginning, let us perature of the blackbody radiation over a range of angular first recall what was the status of our knowledge about the scales down to 10 degrees, thus establishing stringent lim- universe before COBE, and why COBE was needed for its on the evolution of large-scale structure in the universe subsequent exploration of the universe. and providing us with our first glimpse of the initial condi- It is very well known that our modern understanding tions for structure formation. DIRBE has provided the first of the universe as a physical system rests on three major evidence for the existence of a cosmic infrared background, pillars of observational cosmology: thus providing important constraints on the integrated cos- mological history of star formation in various pregalactic ¥ Universal Expansion: On cosmological distance scales, objects, protogalaxies, and galaxies, and the subsequent astronomical objects recede from one another with veloci- conversion of starlight into infrared emission by dust. ties proportional to their separations Ð a remarkable dis- Thus COBE has become a tour de force of modern cos- covery by Hubble in the 1920s. As we now know, the mological studies; it has influenced physical cosmology universe on the largest scales is very nearly homogeneous with its results dramatically, and established a splendid and expands isotropically. legacy for the community with its unparalleled view and ¥ Primordial Nucleosynthesis: The observed universal interpretation of the multiwavelength sky. abundances of light elements proved inconsistent with the idea that they were produced in stars. This puzzle was * European Southern Observatory, Garching bei München, Germany; Warsaw University Observatory, Warszawa, Poland explained in the 1940s by Gamow, Alpher, and Bethe in ** Max-Planck-Institut für Astrophysik, Garching bei München, Germany the context of an initially hot and dense phase in the early
K. M. Górski and A. J. Banday, The Century of Space Science, 399Ð421 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 400 K.M. GÓRSKI AND A.J. BANDAY
evolution of the expanding universe. This necessarily galaxies matured sufficiently to put strong constraints on required that a residual relic thermal radiation permeated theories of galaxy and large-scale structure formation. the universe, and its temperature was theoretically ¥ Dark matter in the universe: Ever since Zwicky’s realisa- expected to be 4Ð5K. tion in the 1930s that clusters of galaxies consisted predomi- ¥ Microwave Background Radiation: The relic thermal nantly of matter in some nonluminous form, the “missing radiation of temperature 3 K was serendipitously dis- mass” or later “dark matter” problem was one of the most covered by Penzias and Wilson (1965) and immediately serious puzzles in astronomy. The following observational recognized by Dicke et al. (1965) as the “missing link” picture was built over time: astronomical objects of increasing between the primordial fire-ball of the young Big Bang size (galaxies, groups, clusters, superclusters) appear to con- universe and its present day mature phase dominated by tain more and more mass that does not manifest itself via evolved astronomical objects. This discovery delivered a luminosity, but can be detected due to its gravitational effects. direct proof of the dense and hot early stage of evolution The amount of this hidden mass is large Ð perhaps about of the universe. 20Ð30% of what is required to render the universe spatially flat. This fraction is sufficiently large that (1) it clearly These three phenomenological ingredients laid a solid exceeds the census of baryons in the universe, i.e. the dark foundation for the Hot Big Bang model, which became matter in known astronomical objects is unlikely to be com- nearly unanimously accepted as the framework for our prised entirely of ordinary matter, and (2) it invites specula- understanding of the universe at large. The Cosmic tion that perhaps the universe indeed is spatially flat, and Microwave Background (CMB) radiation itself has subse- therefore that the required dark matter content for the whole quently become the subject of increasingly vigorous theo- universe must still be larger than for individual astronomical retical and observational studies. It was recognised very objects. This finding tied in with a rich supply of theoretical rapidly following its discovery that the CMB should con- candidates for weakly interacting massive particles, which tain “fossilized” imprints of the processes which could, or could dominate the matter content of the universe, and the even had to, occur in the early universe, and that such sig- idea that perhaps the dynamically detected matter (compris- natures of past events should be pristine in comparison with ing both luminous, baryonic objects and dark material) is the clues provided to us through the studies of nearby, well more clumped than the remaining more smoothly distributed evolved astronomical objects. Hence, both the nature of the dark matter (the so called biasing effect). An alternate specu- CMB electromagnetic spectrum, and its possible deviations lation involves the idea of a cosmological constant, or vacuum from a perfectly thermal (Planckian) form, and the CMB energy density, which was originally introduced by Einstein, anisotropy, or dependence of its temperature on the direc- and thereafter enjoyed various degrees of popularity with tion of observation on the sky, very quickly became attrac- astronomers trying to determine the global properties of the tive theoretical and observational research targets, which universe. Indeed, the currently available combination of CMB were pursued relentlessly up until the COBE mission, and, anisotropy, large scale structure, and high-redshift supernovae indeed, even more so afterward. observations can be interpreted as supportive of these cosmo- It is beyond the scope of this contribution to review logical constant dominated models of the universe. systematically the steady progress of observational and theoretical cosmology from the discovery of the CMB until ¥ CMB phenomenology: Immediately after the discovery the COBE mission, but let us try to mention the essential of the CMB radiation, it was realised that the background developments. radiation should be carefully measured to search for 1. any deviations of its electromagnetic spectrum from ¥ Large-scale structure of the galaxy distribution: It has thermal, and been assessed that galaxies not only form groups and clus- 2. the expected deviations from isotropy in the angular dis- ters, but their tendency to aggregate in space extends to tribution of its temperature on the sky. even larger scales of superclusters (separated by voids), and perhaps beyond. For a long time the extreme scales of Both effects, if found, would provide invaluable clues to our detectable inhomogeneity in the 3-D galaxy distribution understanding of physical processes occurring in the early coincided with the largest scales surveyed, which rendered universe. It would be the case regarding spectral distortions a determination of the scale of transition to the expected because only significant energy releases (e.g. bulk annihilation homogeneity of the universe at large somewhat elusive. A of exotic particles at some epoch) at very high redshift could rigorous quantitative description of the galaxy distribution measurably perturb the Planckian spectrum of the CMB. It in space has been developed, including number counts, cor- would also be the case regarding CMB anisotropy because it relation functions, and power spectra. During the 1980s, could reveal to us the early predecessors of presently observ- astronomical measurements of the spatial distribution of able structures in the universe, as outlined below. COBE, DARK MATTER AND LARGE-SCALE STRUCTURE IN THE UNIVERSE 401
It was well understood that the primary dynamical factor amplitude of 0.1% of the mean, superposed on the that must have driven the evolution of structure in the uni- isotropic background (Smoot et al. 1977). This effect, how- verse was gravitational instability. To understand how the ever, was expected and explained as being dominated by presently existing structure in the universe has arisen, it is our own motion (i.e. a combination of motions of the solar necessary to postulate the existence of perturbations in system in the Galaxy in the Local Group) with respect to the primordial spatial distribution of matter, which over the rest frame defined by the CMB radiation, and, hence, a time were amplified by their self-gravity. The logical local rather than cosmological effect. Despite this apparent appeal of the proposition to use the expected minute CMB lack of a tangible result, there was ongoing and steady anisotropies to study directly the matter distribution in the progress stimulated by the increasingly stringent limits universe as it just emerged from its embryonic stage was on theoretical predictions of expected CMB anisotropy built upon the following considerations: depending on particular scenarios of evolution of structure in the universe. In fact, the search for evidence of 1. the universe is opaque beyond redshift 1000, thus one anisotropy in the CMB by the experimental community may think of CMB photons as having been emitted from the became somewhat akin to the quest for the Holy Grail. last scattering surface at the epoch corresponding to such a redshift (typically about a few hundred thousand years after ¥ Mainstream theoretical cosmology in the 1980s: The the Big Bang); hence the CMB photons, which are influ- rapid development of theoretical and observational cosmol- enced very little by the nearly transparent universe at red- ogy after the discovery of the CMB radiation resulted in shifts 1000, bring to us precious information about the very impressive support for the Hot Big Bang model as a conditions as far away and early on in the universe as we basic paradigm for understanding the universe at large. can ever probe with electromagnetic radiation; However, there were still some nagging paradoxes left 2. gravitational instability in an expanding universe is a which could not be answered simply within the framework slow process (described by power law functions of time, of the model. Among those were the following: rather than the familiar Jeans exponentials in a non-expand- ing medium), hence the amplitudes of large scale inhomo- 1. overall homogeneity and isotropy of the universe Ð why geneities observed today must have then been small but not are the very distant regions in the universe, which were never negligible; in causal contact (i.e. could not have interacted at any time 3. the temperature of the CMB radiation photons when during the history of the standard Hot Big Bang universe), so they were emitted at the epoch of last scattering was physi- similar as suggested by the apparent isotropy of the CMB? cally related to the perturbations of the matter distribution 2. flatness of the observed universe Ð why is the average via both the gravitational potential of inhomogeneities (the density of the universe so close (within a factor of a few) to so called Sachs-Wolfe effect), the emitting plasma velocity the critical density, which makes the universe spatially flat, (or the Doppler effect), and the thermal effects in gravita- just right now, when we (the human race) exist? tionally compressed or rarified plasma. Thus one is led to 3. where did the initial perturbations which seeded struc- conclude that the physical conditions at the time of the last ture formation come from? scattering of the CMB photons are necessarily “imprinted” on a sky map of CMB temperature anisotropy. All these questions could be by-passed with a reference to With such understanding firmly established, all that still the initial conditions for the evolution of the universe. This, remained to be done was to measure these CMB tempera- however, satisfied practically no-one, so when the idea of ture fluctuations. inflation appeared in the early 1980s, it was rapidly Surely, the excitement related to the opening of a win- embraced as a compelling explanation for the shortfalls of dow in the background radiation through which direct the standard cosmological model. Inflation postulates that observations of the truly embryonic stages of evolution of the evolution of the early universe is driven by a scalar the large scale structure of the universe could be attempted field, called the inflaton, which dominates the energy den- was missed by neither experimentalists nor theoreticians sity. Random regions, which get trapped in a state of false in cosmology. But the required measurements proved to be vacuum (or a local minimum of the potential energy) of the an incredibly hard task, as we can only now appreciate inflaton, end up expanding very rapidly due to an effec- in hindsight. A brief, and somewhat unjust, summary of tively negative pressure (hence the term inflation). A typical the experimental CMB anisotropy efforts that were con- inflating region ends up so big that the currently observed ducted for more than two decades up until the COBE mis- astronomical universe would be just a small fraction sion is simple: there were no detections of cosmological thereof. Hence, the explanation of homogeneity and anisotropy, only upper limits. The only exception was the isotropy of the universe that we see. Near flatness is measurement of the CMB dipole temperature pattern, at an explained because, again, our astronomical universe would 402 K.M. GÓRSKI AND A.J. BANDAY
be just a small sector of an original arbitrarily curved mani- To achieve the full benefit of space observations, the goal fold stretched to enormously large size. Density perturba- of the mission and instrument design was to ensure that tions which seeded the presently observed structures were the scientific measurements conducted by COBE were generated due to zero-point quantum fluctuations of the ultimately limited by the ability to model the various inflaton. Even though inflation is not a verified theory astrophysical components and distinguish them from the its logical appeal is huge, since it is apparently solves a cosmological information sought. This goal thus drove the number of seemingly unrelated paradoxes at the price of mission strategy, spacecraft and operations design and introducing just one new puzzle Ð what is the physical choice of instruments. nature of the inflaton? 2.1 Satellite overview It should be now clear, even with this rather sketchy description of the development of experimental and theoret- The need to minimise, control and measure the impact ical cosmology after the discovery of the CMB, that there of systematic errors led to the requirements for an all-sky were important questions about the universe which could survey, a minimal survey period of 6 months, and con- be answered with accurate measurements of the attributes straints on the amount of interference from local sources of of the relic radiation. A hard-learned lesson from early radiation such as the Earth, Sun, Moon and radio emission ground-based and balloon-borne experiments was of the from the ground. The orbit, spacecraft attitude, and instru- enormous difficulties involved with attempts to perform ment enclosures were therefore carefully selected to avoid such measurements successfully, and that the better approach direct exposure to the Earth and Sun, and maintain a stable would be a well-planned experiment in space. An opportu- thermal environment for the instruments. For COBE, nity to realise such a project was provided by NASA in depicted in Figure 1, a 900 km altitude, Sun-synchronous 1976, when it selected the Cosmic Background Explorer, orbit was selected so that the orbit, with a duration of 103 COBE, for a design study. minutes, precessed by 1 degree per day. In order to meet the scientific requirements of the mission (as will be discussed below), the spacecraft must also spin. A rate of 0.8 rpm was 2 THE COBE PROJECT adopted, with the spin axis tilted back from the orbital direction by 96 degrees, so that residual atmosphere did not Perhaps the first issue to address is the simple question, affect the instruments. The attitude of the spacecraft was why a satellite? The most succinct response to this can be controlled by inertia wheels and electromagnets, and deter- found in the proceedings of a meeting held in Copenhagen mined from Sun sensors, Earth sensors and gyroscopes. from June 25Ð29, 1979. “The Universe at Large Redshifts” contained a brief paper by Ray Weiss designed to serve as 2.2 Instrumental design an introduction to a more complete review of the COBE mission (Mather and Kelsall 1980). Nevertheless, the most Three complementary experiments were flown on the relevant issues for the reader to understand are indeed COBE spacecraft: the Far-Infrared Absolute Spectrometer recorded there. Based on the integrated experiences of the (FIRAS) designed to measure the frequency spectrum of COBE team members with ground-based, balloon-borne sky radiation from 100 microns to 1 cm (see Figure 2), the and air-borne experiments to measure the background radi- Diffuse Infrared Background Explorer (DIRBE) to map the ation, and more importantly the limitations of these plat- sky from 1 to 300 microns, and the Differential Microwave forms, the arguments in favour of a space-borne experiment Radiometers (DMR) to search for anisotropies in the CMB were then, and remain today, as follows: on scales larger than 7 degrees. Due to the authors’ involvement with the analysis and 1. Freedom from atmospheric emission and fluctuations in interpretation of the COBE-DMR data, and the aim of this that emission. contribution being the assessment of the impact of COBE’s 2. Full sky coverage with a given single instrument. results on our understanding of the large-scale structure of 3. A benign and controlled thermal environment to reduce the universe, the remaining text is focused on the DMR systematic errors. instrument and the measurements of CMB anisotropy con- 4. The ability to perform absolute primary calibration in ducted with this instrument. Those readers interested specif- flight without the necessity of windows to avoid conden- ically in the results of DIRBE and FIRAS are referred to sation of the atmosphere on calibrators and instruments. NASA GSFC www page http://space.gsfc.nasa.gov/astro/ 5. Sufficient time both to peform tests for systematic errors cobe/ and references therein. and to gain the increase in sensitivity permitted by A differential radiometer is a device which outputs a extended observation time. voltage proportional to the difference in power received by 19
HUBERT REEVES* The origin of the light elements in the early Universe
Shortly after World War II, George Gamov and his corresponding steps in the chain could be achieved, so the collaborators (Alpher et al. 1948) considered the possibility scheme was largely abandoned. that all chemical elements might have been generated by a Around the same time, Fred Hoyle and his collaborators succession of nucleon capture reactions in the cooling pri- (Burbidge et al. 1957) proposed a stellar origin for the mordial Universe. The successful prediction of the exis- chemical elements. This scheme turned out to be highly suc- tence of a fossil radiation left over from the earliest times of cessful in accounting for the chemical elements from carbon the Universe (Gamov 1946) emerged from the hypothesized to uranium (12 A 238). However, it ran into problems formation of helium in the big bang. There are two steps to with the lighter elements. Because of their small electric this argument: (1) in order to have nuclear reactions, the charges, these nuclides rapidly undergo self-destroying temperature must have been in the MeV range (1010 K), and nuclear reactions at the high temperatures of stellar interiors. (2) given the fact that a free neutron decays in approximately Other formation mechanisms were clearly needed to 15 minutes, the transformation of some but not all of the pri- account for their presence and abundances in the cosmos. mordial nucleons into helium requires the mean free path for One requirement for the selection of the formation mecha- neutron capture (and hence the cosmic density) to lie in an nism of these fragile, light nuclides is that, after their forma- appropriate range. At too low a density, the Universe would tion by appropriate nuclear phenomena, they should be be of pure hydrogen; at too high a density it would be of pure rapidly evacuated into low-temperature regions. Two different helium. It is, in fact, about one-quarter helium and three- physical processes meet this requirement: (1) BBN, where quarters hydrogen (in fractional mass). By combining these the rapid cooling of the universe preserves the newly formed numbers, Gamov and his colleagues obtained the first esti- nuclei from further thermonuclear reactions; and (2) galactic mate of the ratio of nucleons to photons (of the order of cosmic-ray (GCR) bombardment of interstellar atoms 10 10). From there, they predicted the existence of a fossil (mostly C, N and O) ejecting spallation residues (mostly Li, radiation in the millimetric range (CMB, cosmic microwave Be and B) into the cold interstellar medium (ISM). background) and estimated its present temperature (a few kelvin). The theory of big bang nucleosynthesis (BBN) is essentially based on these arguments. However, the hypothesis of a primordial origin of all the THE CASE FOR BIG BANG NUCLEOSYNTHESIS chemical elements soon ran into major difficulties. In the proposed chain of successive nucleon capture, extending The best evidence in favour of the BBN of the ligther from hydrogen all the way to uranium, the nuclear instabil- nuclides can be seen from Figures 1Ð3. Figure 1 shows the ity of aggregates containing five and eight nucleons cons- abundance of 4He as a function of the time-integrated stel- tituted a fatal flaw: it could not be seen how the lar activity, represented by the oxygen abundance in various galaxies. Although an increase in helium from approxi- * Centre National de Recherche Scientifique, Paris, France mately 20 to 30% (fractional mass) is clearly visible,
Hubert Reeves, The Century of Space Science, 423Ð440 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 424 HUBERT REEVES
0.30
0.28
0.26
0.24 He Fractional mass abundance He Fractional 4 0.22
0.20
0.18 0 50 100 150 200 250 O / H x 106
Figure 1 4He fractional mass abundances as a function of 16O in chemically unevolved galaxies (after Pagel et al. 1992).
manifesting a stellar contribution, no celestial objects are Another problem is the untangling, for each nucleus, of known to have less than 20% helium (10 times less helium the relative contribution of different production mecha- than hydrogen, in numbers of atoms). nisms: BBN, GCR spallation and stellar nucleosynthesis. A similar situation is found for 7Li. As we go backward This is most important for the isotope 7Li, for which the in time Ð by observing progressively older stars, represent- three mechanisms all contribute. For this reason, the ing the state of the ISM at the moment of their formation Ð cosmic-ray spallation processes and their resulting abun- the 7Li to hydrogen ratio decreases progressively from 10 9 dances will be reviewed in some detail. to 10 10, where it levels off for all older stars (Figure 2). In the same fashion, the deuterium to hydrogen ratio appears to increase (Figure 3) with decreasing time. These observations suggest that these light nuclides NUCLEAR PHYSICS OVERVIEW were already present in the primordial galactic gas. They point to a common origin: nuclear reactions in the hot big It is well known that many features of the universal abun- bang. Can this suggestion be made quantitative? Can the dance of chemical elements can be qualitatively understood observed abundances be accounted for in terms of detailed through a knowledge of their nuclear properties. The iron computations? This is the subject of this chapter. It will peak (at A 56) corresponds to the most stable nuclear con- appear that the answer is definitely yes, although some figurations. The secondary peaks correspond to nuclei with aspects are still obscure. so-called magic numbers of neutrons (at N 50, 82 and One important difficulty stems from the fact that the 128), and also to the light nuclei with an integral number of available observations (of galaxies, stars, interstellar gas, alpha particles (at A 12, 16, 20, 24 and 28). In this respect, planets) cannot be readily compared with the computed pri- it is most informative to begin our study of the origin of the mordial yields. Many factors have modified the abundances light elements with a brief review of their nuclear proper- of the nuclides between the big bang and the moment ties. Since the three elements lithium, beryllium and boron at which they are observed in the history of the cosmos were potentially formed in the big bang, we shall extend (e.g. the birth of the stars in which they are measured). this review up to A 12. Extrapolation from the observed data back to BBN has Two important factors play a crucial role in the nucleo- been a difficult task for many decades. synthesis of the light nuclei: their small electric charges and THE ORIGIN OF THE LIGHT ELEMENTS IN THE EARLY UNIVERSE 425
Ð8
8 (H+3He)/H
D/H
Li)/H Ð9 7 3He/H Li +
6 6 log( Ð9 ) –5
4 Ð11 Ratio ( ϫ 10
Ð9 2
Ð10 (H)]
n
(B) / Ð11
n 0
log[ Ð12 02468101214 Age (Gyr) Ð13 Deuterium evolution
3 Ð14 Figure 3 D and He abundances as functions of time from the big bang (14 Gyr ago) to the present. The value of 3He at 14 Gyr is not an observation but results from BBN calcula- Ð10 tions. The apparent variation of this nuclide at later times is well within the uncertainties (adapted from Geiss and Ð11 Gloeckler 1998). (H)]
n * Ð12 * 6 7 9 11
(Be) / comes Li at 2.0 MK, Li at 2.5 MK, Be at 3.5 MK, B at
n Ð13 5.0 MK and 10B at 5.3 MK. As a result, the light elements log[ (except 4He and, to a certain extent, 3He) cannot resist the Ð14 heat of stellar interiors. However, in view of the particle instability of 4Li and 5Li, the helium isotopes cannot be destroyed in nucleon-induced reactions, but only by Ð3 Ð2 Ð10 helium-induced reactions, resulting in an appreciably (Fe/H) higher Coulomb barrier energy and higher thermal resis- Figure 2 Li H, Be H and B H stellar abundances as a func- tance (Figure 4). tion of Fe H abundances. The logarithmic scale of the x-axis is The second important nuclear property is the large bind- in units of fractional solar iron abundance. For more data, see ing energy of 4He, due to the large pairing effect of nuclear Cunha and Smith (1999) (after Gilmore 1992). forces when the nucleons are paired four by four: neutronÐ proton; spin up, spin down. As a result, every nucleus in this mass range is quite unstable toward a rearrangement involv- ing 4He nuclei. No nucleus of mass 5 manages to be stable; the large binding energy of alpha particles two relative to the lifetimes are 10 21 s. The isotopes 8Li and 8B are beta- other nuclear arrangements. unstable with respect to 8Be, which quickly (10 16 s) breaks At BBN or stellar temperatures, the kinetic energy fac- into two alpha particles. tors are well below the Coulomb barrier. Coulomb penetra- The nuclear stability situation at a mass of 9 is deeply tion factors, proportional to the electric charge, govern the marked by the alpha stability. 9B is unstable against two destruction rate of the light nuclei by proton capture. alphas p (10 19 s), but 9Be barely escapes disintegration Because of its low Z and low nuclear stability, deuterium is (it has a very small binding energy). This weak stability of doubly fragile. It disappears at 0.5 million K (MK). Next 9Be is reflected in the fact that the endothermic Q values 426 HUBERT REEVES
corresponding to its formation by spallation reactions are 9 11 9 10 remarkably large, and the exothermic Q values correspond- –1 ing to its destruction are small. It is also reflected in the fact
6 that it has only one ‘bound’ state; all the excited states are 10 –2 unstable against particle break-up. These facts are instru- 7 mental in explaining the low natural abundance of 9Be (one 7 6 7 10 11 –3 11 of the lowest in nature). The isotopes Li, Li, B and B fare better but all remain comparatively fragile; in high- energy proton-induced reactions they all quickly break –4 p+D p+p+n down into alpha particles and other products. 6 4 10 The high binding energy of He is also responsible for ) –5 the fact that, at all but the lowest temperatures, hydrogen is
arn 4 3
(b D transformed rapidly into He. The nuclei D and He are
des –6 3 He+γ intermediate steps in this chain of reactions; very small p+D amounts of them remain at the end of the process. These og l –7 facts dominate the scenario of BBN and also of main 11 sequence stellar energy generation. –8 The influence of these nuclear properties on the forma- tion rate of the light elements is reflected in their formation 10 cross-sections of the spallation reactions resulting from the –9 bombardment of atoms of C and O by protons (Figure 5). D The link is best illustrated by phase-space arguments. In the –10 6 high-energy region, the break-up of the excited nuclei into a 9 given configuration is proportional to the number of possi- –11 ble channels, which is itself a function of the binding 7 energy, and also of the number of bound excited states for –12 the given configuration. Above 100 MeV or so, the cross- –2 –10 sections reach a plateau which is maintained all the way up log Ep (MeV) to the highest energies. As expected, the cross-sections for the formation of 9Be are the smallest, while for Li and B Figure 4 Total destruction cross-sections of light nuclides by isotopes they have higher values. protons in barns (10 24 cm 2) as a function of energy. The number close to each curve refers to the mass number of the corresponding nuclide (after Reeves 1974).
100 p + 16O p + 12C 11B
10B 11B 7Li 6 7Li Li 10B
6 10 Li 9Be
9Be Cross-section (mbarn)
1 10 MeV 100 MeV 1 GeV 10 GeV 100 GeV 10 MeV 100 MeV 1 GeV 10 GeV 100 GeV
Figure 5 Spallation cross-sections in millibarns (10 27 cm 2) for the formation of the Li, Be and B isotopes by proton-induced reactions on 12C and 16O (after Reeves 1974). 20
JEAN SURDEJ* AND JEAN-FRANÇOIS CLAESKENS* Gravitational lensing
In his theory of general relativity (GR), Einstein established 6. the size of absorbing intergalactic gas clouds, that a massive object curves spaceÐtime and that photons 7. upper limits on the density of a cosmological population move along geodesics of this curved space. Einstein’s of massive compact objects. prediction was confirmed during the solar eclipse of 1919. This not only provided a decisive, quantitative proof of the In this chapter we summarize some of the theoretical and validity of GR but also gave confirmation of the concept observational evidence supporting these claims and show that electromagnetic waves do undergo deflections in gravi- how space observations have significantly contributed in tational fields. some of the areas listed above. Atmospheric lensing effects may lead to a distortion or to The chapter is organized as follows. We first review the the formation of multiple images of a distant source, result- historical background of gravitational lensing in Section 1. ing in the perception of a ‘mirage’. They sometimes literally Unlike most other astrophysical discoveries made during deform our view of the surrounding areas. Similarly, gravita- the last century, the physics of gravitational lensing was tional lensing may perturb our view of the distant Universe understood well before the first example of a multiply- and affect our physical understanding of various classes of imaged extragalactic object was found. extragalactic objects. The great interest in gravitational lens- In Section 2 we describe the basic principles and concepts ing comes from the fact that this phenomenon can be used as of gravitational lensing. We first establish the exact form of an astrophysical and cosmological tool. Indeed, gravitational the lens equation. By making use of the Einstein deflection lensing may help in deriving: angle, we derive the expression for the angular diameter of an Einstein ring, which is also the typical angular separation 1. the distance scale of the Universe Ð via the determination between multiply-lensed images when the conditions of of the Hubble constant H0 Ð and possibly the values of perfect alignment between the observer, the lens and the other cosmological parameters (the cosmological density source are no longer fulfilled. We then set up a sufficient 0 and the cosmological constant 0), condition for an observer to see an Einstein ring, or multiple 2. the mass distribution M(r) of the lens, images, of a distant source that is located behind a deflector, 3. the extinction law in the deflector, usually located at and go on to derive, for the case of a point-mass lens model, high redshift, expressions for the expected image positions and their ampli- 4. the nature and distribution of luminous and dark matter fications. Using wavefront and ray tracing diagrams, we in the Universe, introduce the important concepts of caustics and time delays. 5. the size and structure of quasars, We then show how all the lensed image configurations observed in the Universe may be understood in terms of the relative location between the observer and the caustics * Université de Liège, Liège, Belgium associated with an asymmetric lens. Several outstanding
Jean Surdej and Jean-François Claeskens, The Century of Space Science, 441Ð469 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 442 JEAN SURDEJ AND JEAN-FRANÇOIS CLAESKENS
examples of gravitational lens systems observed with the a light ray passing near the solar limb should be deflected Hubble Space Telescope (HST) serve as illustrations. by an angle given by In Section 3, we discuss the most remarkable astrophysi- 2 cal and cosmological applications of gravitational lensing. ˆ 4GM/(c R) 1 (1) These include the independent determination of the Hubble constant H0 based upon the measurement of the time delay where G is the gravitational constant, c is the velocity of t between the observed light curves of multiply-imaged light and M and R are the mass and radius of the Sun (or of quasars. The possibility of weighing the mass of lensing any other compact lens), respectively. This deflection angle galaxies, galaxy clusters and intervening dark matter from turns out to be exactly twice the value derived by Soldner the observation of multiply-imaged distant sources, arcs and by Einstein himself in 1911; the factor of 2 merely and arclets is also reviewed. We further discuss the possibil- reflects the metric curvature. Note that in Newtonian terms ity of using gravitational lensing to determine the cosmolog- the Einstein deflection also follows if one assumes a refrac- 2 ical density 0 ( 8 G 0/3H0) and the reduced cosmological tive index n associated with the Newtonian gravitational 2 2 2 constant 0 ( c /3H0 ) of the Universe, the cosmological potential U (|U| c ) via the relation density of dark compact lenses, the size of intergalactic L 2 absorbing clouds, dust extinction laws in deflecting galaxies n 1 2U/c (2) and the structure and size of quasars. We also present the sta- The analogy between atmospheric and gravitational lensing tus of optical searches for massive astrophysical compact halo then becomes obvious. Note, however, that the bending of objects (MACHOs) based upon microlensing, the formation light rays in gravitational fields is predicted to be totally of giant luminous arcs in galaxy clusters and the observation ‘achromatic’ (see eqn (1)). of ‘weak lensing’ in the Universe. Using photographs of a stellar field taken during the The anticipated contributions of ongoing and future solar eclipse in May 1919, and six months apart, Arthur space missions such as Chandra, XMM-Newton, MAP, Eddington and his collaborators (Dyson et al. 1920) were FIRST-Herschel, Planck, NGST and GAIA are presented in able to confirm, within a 20Ð30% uncertainty, the deflec- Section 4. tion angle predicted by Einstein (Figure 1). This was the second correct prediction of GR Ð the successful interpreta- tion of the advance of Mercury’s perihelion being the first Ð 1 HISTORICAL BACKGROUND and marked the full acceptance of the work of Einstein. It seems that Eddington (1920) was the first to propose One of the consequences of the theory of GR is that light the possible formation of multiple images of a background rays are deflected in gravitational fields. Although this pre- star by the gravitational lensing effect of a foreground one diction was made in the twentieth century, speculations that (but see the finding by Renn et al. (1997)). Note, however, light rays might be bent by gravitation had been proposed that Oliver Lodge (1919) had already characterized massive much earlier. Indeed, considering that light is composed of elementary constituents, Isaac Newton suggested as early as 1704 that the gravitational field of a massive object could possibly bend light rays, just as it would alter the trajectory (c) of material particles. A century later, Laplace independently made this same suggestion. Furthermore, the astronomer Johann von Soldner (1804) at the Munich Observatory (b) found that, in the framework of Newtonian mechanics, a light ray passing near the limb of the Sun should undergo an angular deflection of 0.875 . However, because the wave description of light prevailed during the eighteenth and nine- teenth centuries, neither the conjecture of Newton nor the result of Soldner were ever taken seriously. In 1911 Albert Einstein had re-derived the latter result on the basis of the equivalence principle, unaware of Soldner’s work. (a) In the elaboration of his theory of GR, Einstein pre- dicted that a massive object curves spacetime in its vicinity O and that any free particle, massive or not (e.g. photons), Figure 1 Curved spacetime around the Sun (a) and deflection will move along geodesics of this curved space. After deriv- of light rays from two distant stars as seen by an observer (O) ing the full field equations of GR, he predicted in 1915 that during a solar eclipse (b) and six months apart (c). GRAVITATIONAL LENSING 443
Figure 2 The paths of light rays in a point-mass (PM) lens model. In accordance with eqn (1), a set of parallel rays emitted from a distant source are deflected from their original paths as they cross the deflector plane. Different observer positions are represented from where an Einstein ring (O1) or a double image (O2) may be seen. objects like the Sun as imperfect focusing lenses since they of Einstein’s notebooks. It does therefore seem that by 1936 had no real focal length; the light from a background object Einstein had totally forgotten his pioneering gravitational being mainly concentrated along a focal line of ‘infinite’ lensing work from 1912, as if he had already convinced him- length (Figure 2). In 1923, E.B. Frost, then director of the self that the idea was too speculative to have any chance of Yerkes Observatory, initiated a programme to search for empirical confirmation. multiple images of stars in the Galaxy, but it seems that Fritz Zwicky (1937a,b) was the first to realize the very these observations never really took place. Orest Chwolson high probability of identifying a gravitational lens mirage, (1924) suggested that, in the case of a perfect alignment (i.e. one composed of several distinct images of a single between an observer and two stars located at different dis- object) among extragalactic sources. He even proposed to tances, the observer should see a ring-shaped image of the use galaxies as natural cosmic telescopes to observe other- background star around the foreground one. wise too faint and distant background objects. He also Independently, at the request of Rudi W. Mandl, a Czech emphasized the possibility of determining the mass of distant electrical engineer, Einstein (1936) rediscovered the major galaxies simply by applying gravitational lens optometry characteristics (double images, the ‘Chwolson’ ring usually and, in addition, to test the theory of GR. In 1937 Zwicky referred to as the ‘Einstein’ ring, and so on) of one star lensed stated that ‘the probability that galactic nebulae which act as by another, but he was very sceptical about the possibility of gravitational lenses will be found becomes practically a cer- observing this phenomenon among stars, probably because tainty’, and he was therefore very much surprised some the expected angular separations between the lensed images twenty years later that no such lensing effects had yet been were so small. What seems remarkable is that Einstein had found with the 5 m (200-inch) Hale Telescope at Palomar first established the whole theory of the formation of multi- (Zwicky 1957). ply-lensed images (the lensing equation, formation of double After several decades of little activity, interest in the the- images by a point-like deflector, amplification of the lensed ory of gravitational lenses was revived by Klimov (1963), images, and so on) during the spring of 1912, three years Liebes (1964) and Refsdal (1964a,b, 1966a). Some of their before completing his theory of GR. This finding has recently proposed applications were particularly promising because been reported by Renn et al. (1997), who had examined some of the recent discovery of quasars by Maarten Schmidt 444 JEAN SURDEJ AND JEAN-FRANÇOIS CLAESKENS
(1963). It would indeed be much easier to prove the lensing (Section 3.5). Since 1982 a group of French astronomers origin of multiple QSO images rather than that of extended led by L. Nottale from Meudon Observatory had regularly and diffuse galaxy images, since the former ones consist of submitted observing proposals with the CanadaÐFranceÐ very distant, luminous and star-like objects. (For more Hawaii Telescope (CFHT) to search for such arcs in the centres details on quasars, see Peterson (1997).) In 1964 Sjur of rich and compact galaxy clusters, among them Abell 370. Refsdal proposed to apply geometric optics in order to esti- However, the observing programme committees were not at all mate the time delay between the arrival times of two parts of receptive to this kind of proposition. Although these predictions the same distorted wavefront at the observer; this proposal, were entirely based on the theory of GR, and had been which formed the second part of his master’s thesis, was promoted several decades earlier by Zwicky, they probably (erroneously) judged of uncertain quality by his supervisor. still looked too revolutionary to conservative observing pro- Nevertheless he succeeded in publishing his findings gramme committee members. Furthermore, during the first (Refsdal 1964a,b); the referee of these papers was Dennis international conference that was explicitly dedicated to the Sciama! Refsdal finally, obtained his Ph.D. in 1970 on the field of gravitational lensing, entitled ‘Quasars and Gra- basis of that controversial second part of his master’s thesis. vitational Lenses’ (24th Liège International Astrophysical On the basis of the strong similarity between the spectra Colloquium, 1983), some of the organizers were very sur- of quasars and of the nuclei of Class 1 Seyfert galaxies, prised by the high degree of scepticism that was still present Barnothy (1965) proposed that high-redshift quasars could among a significant number of the participants. Probably half actually be the lensed images of distant Class 1 Seyfert the audience still then considered gravitational lensing to be galactic nuclei. Sanitt (1971) criticized this view on the an unproved phenomenon, while many others thought that basis of statistical arguments. the proposed lensed monsters would remain a mere cosmic Theoretical work continued at a low level of activity curiosity, with no further scientific interest. It also seems that through the 1970s. Refsdal (1965, 1970) and Press and until the 1980s, in some institutes gravitational lensing was Gunn (1973) discussed problems on lens statistics, Bourassa not always regarded as an acceptable subject to study. and Kantowski (1975) considered extended non-symmetric Astronomers who wished to pursue the subject would not lenses (Bourassa et al. 1973) and Dyer and Roeder (1972) have received a research grant for work dedicated to gravita- derived a distanceÐredshift relation for the case of inhomo- tional lensing Ð nor would they be invited to deliver a semi- geneous universes. In spite of clear theoretical predictions, nar on the topic. We know from Refsdal, Nottale and many the interest from observers was largely absent, and no other pioneers that they had the feeling of being considered systematic search for lenses was initiated. somewhat heretic by some of their colleagues. Forty-two years after Zwicky’s prediction, the first exam- Following the pioneering detections of multiply-imaged ple of a distant quasar (Q0957 561) doubly imaged by a quasars and giant luminous arcs, the levels of observational foreground massive lensing galaxy and its attendant galaxy as well as theoretical activities have increased dramatically. cluster was serendipitously discovered by Walsh et al. (1979). More than fifty multiply-imaged quasars and an even larger This system consists of two lensed quasar images separated number of gravitational arcs have now been discovered, and by approximately 6 . Good evidence that Q0957 561 A and more than 3100 scientific publications have been written B correspond to twin lensed images of a single quasar was over the past twenty years on the subject of gravitational provided by lensing (Figure 3). Space observations, and in particular 1. the similarity between their spectra, 2. a simple lens model which could naturally account for the slight morphological differences detected between the optical and radio images of the quasar (Young et al. 1981), 3. the discovery of the lensing galaxy between the twin quasar images, made possible by the use of modern CCD imaging, 4. the observation of a time delay of 1.14 years, first reported by Vanderriest et al. (1989), between the light variations of the two quasar components.
In parallel with the discovery of several other multiply- Figure 3 The number of scientific papers published annually imaged quasars, an even stronger interest in gravitational on gravitational lensing over the past forty years. (Based on the lensing studies developed with the first identification of gravitational lensing bibliography compiled by Pospieszalska- giant luminous arcs in 1986, by two independent teams Surdej et al. (2000).) 2
ARTURO RUSSO* The space age and the origin of space research
INTRODUCTION immediate post-war period. Then the Cold War was to fuel the development of more powerful missiles, capable of The exploration of space is one of the greatest achievements delivering a nuclear weapon across the oceans. of the twentieth century. In the ideal photo album of this cen- These missiles were also capable of launching a satellite tury one could hardly avoid including such pictures as our into orbit, and a new dimension was thus added to the Cold blue planet seen from space and the human footprint on the War confrontation: the demonstration of technical superior- Moon’s surface; the close-up images of a comet’s nucleus ity in a field which could greatly affect the public imagina- and the rough landscape of Mars; the floating of astronauts tion. This was seen as being particularly important in Third in the black vacuum around a space station and the rings of World countries, which were, at that epoch, emerging from Saturn photographed from an approaching spacecraft. the colonial era and in search of ideological principles, eco- Space technologies have provided us with the possibility nomic models and international patronage. Being first in of real-time global communications, as well as with unprec- space, first on the Moon, became an important political goal edented knowledge of cosmic phenomena. Traditional for both superpowers, wanting to demonstrate the superior- research fields like astronomy underwent dramatic changes ity of their cultural values, the efficiency of their political when telescopes could be flown beyond the atmosphere; institutions, the performance of their industrial system, and others emerged brand-new when physicists could put their the strength of their armed forces. sophisticated instruments onboard satellites and space-probes. Scientists on both sides of the Iron Curtain were able to Knowledge of our own planet took a big step forward when take advantage of the opportunity offered by this new dimen- it became possible to investigate, from space, the delicate sion, building instruments which could be fitted into the equilibrium of its various components. nose-cones of sounding rockets or the payloads of spacecraft, This book is devoted to the great advances in scienti- and designing space missions which could solve old and new fic knowledge that space technologies have made possible. problems in astronomy and geophysics. A new scientific com- Scientific research, however, can hardly be considered to be munity emerged from this experience, identified less by their the main driving force in space activities. Political and mili- disciplinary affiliation than by the technical means they used. tary considerations, in fact, played the major role in making Space science, in fact, is not a well-defined research the old dream of exploring space a technical reality. To the field. It includes, in principle, any scientific investigation deliberate attempt of the Nazi regime to develop the ulti- conducted by the use of rockets, Earth-orbiting satellites mate weapon to win the war we owe the V2, the prototype and deep-space-probes. In terms of established scientific of the rockets which helped the United States and the disciplines, it covers fields as different as atmospheric Soviet Union to start successful space programmes in the physics, geophysics, plasma physics, cosmic-ray physics and the various branches of astronomy (solar, stellar and * Università degli Studi di Palermo, Italy planetary; from radio wavelengths up to the gamma-ray
Arturo Russo, The Century of Space Science, 25Ð58 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 26 ARTURO RUSSO
written in the second century, whose heroes are taken to the Moon by a terrible whirlwind that lifts their ship into space. In Lucian’s second book, entitled Icaro-Menippus, the trip to the Moon is carefully planned and eventually performed by Menippus, the hero, with the help of wings fastened to his shoulders. At the height of the Italian Rinascimento, a voyage to the Moon was conceived in 1516 by the great Italian poet Ludovico Ariosto in his epic poem Orlando Furioso. Searching for the lost mind of Orlando, Astolpho is carried to the Moon by Elijah’s chariot drawn by four red horses. He visits the Moon, where everything lost on Earth is stored in a valley, and eventually finds the paladin’s mind in a phial. A century later, in Germany, one of the fathers of modern astronomy, Johannes Kepler, told of a Moon flight in his Somnium (Dream), posthumously published in 1634. In Kepler’s dream, conceived as an allegory for astronomy which reveals the reality of celestial bodies, supernatural demons (that is, astronomical knowledge) drive the learned traveller to the Moon along the shadow of the Earth during a lunar eclipse, and then they drive him back along the Moon’s shadow during a solar eclipse. Figure 1 Astronaut’s footprint on the Moon, 1969. (Source: NASA.) region of the electromagnetic spectrum). Space sciences have also revitalized the old subdiscipline of astrometry and have helped make observational cosmology a practical and very fruitful area of research. Moreover, they have made important contributions to material sciences, biology and medicine. Each of these disciplines was dramatically reshaped by the advent of space technologies, both regarding their scientific content (such as in situ measurements of ionospheric phenomena, chemical and isotopic investiga- tion of the Moon’s soil, astronomical observations in new spectral bands) and with respect to their increasingly inter- disciplinary nature (such as the growing importance of physicists in astronomical research). This chapter presents an historical overview of the origin of space science within the framework of the major politi- cal and military events which shaped the early phase of the space age. Given the nature of this overview, it was decided to include a comprehensive bibliography at the end, rather then to insert specific references to the vast literature on the history of space into the body of the text.
DREAMS OF SPACE FLIGHT
A journey to the Moon and beyond has for centuries been the object of popular fantasies, philosophical speculations and poetic dreams. Space travel literature can be traced Figure 2 Johannes Kepler. (Source: Istituto e Museo di Storia back to Lucian of Samosota’s Vera Historia (True History), della Scienza, Florence.) THE SPACE AGE AND THE ORIGIN OF SPACE RESEARCH 27
The same theme was dealt with in those years by two works in which space travel and encounters with alien people English bishops. Francis Godwin, Bishop of Hereford, serve as witty means for conveying the author’s opinions wrote The man in the Moon: or a discourse of a voyage about mundane matters. thither under the pseudonym Domingo Gonsales. The book But in the century which saw the birth of modern science, was published in 1638 and it became very popular all over other works tried to deal with the old dream of space flight Europe. The protagonist of the book, Gonsales himself, in the light of the new astronomical knowledge (sunspots, flies to the Moon by a chair pulled by wild geese. In the Jupiter’s satellites, Saturn’s rings, mountains on the Moon, same year, John Wilkins, Bishop of Chester, published The and so on), and in accordance with the tenets of the new discovery of the man in the Moon: or a discourse tending natural philosophy. A major event was the publication, in to prove that ‘tis probable there may be another habitable 1686, of Bernard de Fontenelle’s Entretiens sur la pluralité world in that planet. Wilkins tells nothing about the means des mondes, a popular astronomy book which was widely of propulsion but he speculates that no effort would be read throughout Europe for its fascinating speculations on the required beyond the point where the influence of terrestrial nature and habitability of Solar System bodies. Fontenelle gravity ceases, which he assumes to be some thirty kilometres claimed that each planet had its own inhabitants and dis- above the Earth’s surface. cussed their likely appearance, civilization and habits. At the court of the king of France we find the most The belief that planets are inhabited by different species remarkable author of Moon tales of the seventeenth cen- of rational beings was also expressed by the great Dutch tury. This is Savinien de Cyrano Bergerac, dramatist and scientist Christian Huygens in his Cosmotheros, or conjec- poet, philosopher and swordsman, whose enormous nose tures concerning the planetary worlds, first published in found a place in the history of French literature thanks to Latin in 1698. However, if the Moon and the planets were Edmond Rostand’s famous play of 1897. Cyrano’s L’autre no longer considered as metaphorical symbols but as real monde, ou les états et empires de la Lune, posthumously worlds subject to universal physical laws, tales of space published in 1657, tells of several attempts by himself to travels had to cope with the problem of escaping the Earth’s reach the Moon. First, he tied a string of vials filled with gravitational pull. In David Russen’s Iter lunare or voyage dew around himself, so that the rays of the morning sun to the Moon (1703) a huge spring device is proposed as a could draw him upward when evaporating the dew, but he means of launching a manned vehicle towards the Moon. only arrived in Canada on that attempt. Two years later, Daniel Defoe’s The consolidator tells us He then tried to launch himself from the top of a hill that ancient people mastered the art of space flight and used by a special spring-loaded vehicle, but the vehicle crashed to travel to and from the Moon. Defoe describes in particular down into the valley. After curing the bruises by smearing an intriguing engine based on the use of hollow bodies ox marrow on his body, the hero returned to his vehicle, but filled with ‘a certain spirit’ which produced a continuous he found that some soldiers had taken possession of it and fire. The fire acted on a spring-and-wheel mechanism were tying fireworks to it in order to make it appear as a which kept two large wings in motion. flying fire dragon. He tried to stop the soldier who was Many other tales of lunar travel can be found in eigh- setting fire to the fuse, only to find himself trapped in the teenth century literature, including Samuel Brunt’s A voyage vehicle while the latter lifted off to the clouds. to Cacklogallinia (1727), whose heroes are propelled into Tiers after tiers, all the rockets were rapidly exhausted, space by intelligent birds, and Murtagh McDermot’s A trip but instead of crashing down to Earth with the vehicle, to the Moon (1728), where a whirlwind takes the hero to Cyrano continued his ascension to the sky. The waning the Moon and gunpowder is used to return him to Earth. In Moon, he explained, used to suck up the marrow of ani- Voltaire’s Micromégas (1752) we find the first extraterres- mals, and that is why his greased body was pulled upwards. trial visitor of literary history: an intelligent giant arriving A few days later he finally landed on the Moon, where he on Earth from the star Sirius after a long journey through had many adventures and philosophical discussions with its the Milky Way and the Solar System. inhabitants. After returning to Earth, the hero decided to In the nineteenth century, science fiction became a visit the Sun, and the story is told in Cyrano’s second book, well-established literary gense, nourished by the dramatic Histoire comique des états et empires du Soleil (1662). progress of astronomy and by the growing importance of While presenting the first space travel performed by science and engineering in everyday life. Romantic tales and means of rocket propulsion, Cyrano’s works do not deal philosophical speculations were gradually replaced by scien- with science, nor even science fiction. They are instead tific and technical discussions of the various aspects of space remarkable examples of philosophical novels, a literary flight. Science fiction writers, however, still lacked the most device for the author to expound his materialistic ideas and important element: a credible means to escape the Earth’s his libertine vision of human affairs. European literature of gravitational field. An antigravity material called lunarium the seventeenth and eighteenth century counts several other was devised by the University of Virginia professor George 28 ARTURO RUSSO
Tucker in his novel A voyage to the Moon with some firing of the cannon and the story is then picked up in the account of the manners and customs, science and philoso- second book. Here we follow the heroes during their space phy of the people of Morosofia and other Lunarians, pub- travel. Owing to a large meteorite passing close to their lished in 1827 under the pseudonym of Joseph Atterley. Eight spaceship, the latter is displaced out of the planned trajec- years later, Edgar Allan Poe, a former student of Tucker’s, tory and ends by orbiting around the Moon. An attempt to sent his hero Hans Pfaall on a lunar trip in a home-made steer the vehicle towards landing by the use of the rockets balloon. In 1865, the Frenchman Achille Eyraud imagined fails and the voyagers are pushed towards the Earth, safely a spaceship powered by a reaction engine based on the descending in the Pacific Ocean where they are rescued by ejection of water for his Voyage à Venus. an American ship. The recognized father of modern science fiction is Jules Verne’s stories took advantage of the most advanced scien- Verne, whose novels De la terre à la lune (1865) and tific knowledge of his times, including calculations of escape Autour de la lune (1870) became very popular all over the velocity and flight trajectories, a sound determination of the world. The former tells of the long and careful preparation best geographic location for the launch cannon (in Florida, of a voyage to the Moon, to be accomplished by three not far from the present US launch facilities), and a realistic astronauts sitting inside an artillery shell fired into space by description of weightlessness in outer space. Inspired by the a huge cannon, called Columbiad. The shell was equipped general scientific optimism of the late nineteenth century, with a special mechanism for absorbing the shock, a chemi- Verne and his followers were less interested in writing cal system for air regeneration, and rockets for soft landing Moon tales (his heroes, in fact, do not land on the Moon), on the Moon’s surface. The book ends with the successful than in discussing in literary form the possibilities for
Figure 3 Illustration from De la terre à la lune by Jules Verne. Figure 4 Illustration from Autour de la lune by Jules Verne. (Source: AKG.) (Source: AKG.) 21
RICHARD MUSHOTZKY* Clusters of galaxies
Clusters and groups of galaxies are now recognized to be with infall velocities of ~2000 km s 1 and total masses 15 65 fundamental constituents of the Universe. According to of 10 ML have a kinetic energy of 10 erg. The shocks modern theory, most of the material in the Universe that has and structures generated in the merger have an important collapsed into coherent structures lies in groups of galaxies, influence on cluster shape, luminosity and evolution and while clusters are the only entities in the Universe that may generate large fluxes of relativistic particles. These should be representative of the Universe as a whole. Thus mergers are best studied with X-ray imaging and spec- the study of groups and clusters of galaxies is of tremendous troscopy studies of the temperature, abundances and gas importance. One of the most surprising distinguishing char- density, which can reveal the entropy and pressure in these acteristics of these, the largest and most massive objects in systems. X-ray imaging is also required to distinguish the Universe, is that they are filled with hot X-ray-emitting between true mergers and the projections that one expects in gas whose mass exceeds (often by factors of 3Ð7) the mass a hierarchical universe (Henriksen and Markevitch 1996). of the galaxies in the groups and clusters. This gas is Numerical studies of large-scale structure formation enriched in heavy elements (e.g., oxygen and iron) and is show that groups are the tracers of the cosmic web, being thus the repository of most of the metals in the Universe. the connection of galaxies to clusters in a hierarchical Clusters are also exceedingly sensitive tracers of the cos- merger “tree”. Thus detailed surveys of groups will provide mological parameters of the Universe (such as its mass den- the best measure of the nature and form of large-scale 1 sity m, its age H0 , and the cosmological constant ) and structure, which in turn will provide detailed cosmological the nature of the primordial spectrum of density fluctua- information. tions. Detailed studies of clusters and their evolution as Because detailed measurements of the mass, metallicity, well as their baryonic content are among the most robust evolution, number density, and structure of groups and clus- estimators of the cosmological parameters (Bahcall et al. ters of galaxies are best performed with X-ray data, 1999) and at present are one of the strongest arguments for X-ray astronomy has become central to the study of clus- a low- universe. X-ray surveys, at present, are capable of ters. At present, X-ray data can yield the cluster mass, detecting clusters in a uniform robust way to redshifts 1.2 structure, and iron abundances out to redshifts of ~0.8 and thus provide the fundamental material for the study of (Donahue et al. 1999), measuring the abundances and dis- cluster evolution and its cosmological impact. tribution of other elements at z ~ 0.1, finding clusters at z ~ Cluster mergers are thought to be the prime mechanism 1.24 (Rosati et al. 1999), and determining the nature and of massive cluster formation in a hierarchical universe form of cluster number and mass evolution (Henry 2000) to (White and Frenk 1991) and represent the most energetic z 0.5. Because groups are much less luminous, their events in the Universe since the big bang. These mergers study so far has been limited to the local volume of space (z 0.05). Since they are extremely numerous there is * NASA Ð Goddard Space Flight Center, Greenbelt, MD, USA hope that detailed X-ray surveys optimized for the study of
Richard Mushotzky, The Century of Space Science, 473Ð498 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 474 RICHARD MUSHOTZKY groups will be able to map the cosmic web seen radiation from a collisionally dominated plasma with about in detailed numerical simulations (Cen and Ostriker 1999). a third of solar abundances (see Sarazin 1988 for a review). The hot gas in the cluster potential well will scatter While there may be significant contributions at both lower the microwave background and generate a signal whose and higher energies from synchrotron or inverse Compton strength depends on the path integral of the pressure in the radiation from relativistic particles (Sarazin 1999), the bulk cluster but is independent of distance (Sunyaev and of the radiated energy and the total energy in the cluster is Zel’dovich 1981). The ratio of the X-ray properties to the due to thermal particles. The observed range of temperatures microwave properties is distance dependent, and provides a for clusters and groups is 0.2Ð15 keV, and the luminosities direct physical means of determining the angular distance to range from 1042 to 3 1045 erg s 1 (Mushotzky and Scharf the objects and thus measuring the Hubble constant and . 1997). There is a strong correlation between cluster lumi- (Frequently the Hubble constant is given in units of h, which nosity and temperature (Mushotzky 1984). The strongest means that it is normalized to a value of 100.) The emission line complex from a plasma of kT 3 keV is the Sunyaev–Zel’dovich effect provides an independent esti- He-like 6.67 triplet from Fe XXV, and the first studies of mate of the cluster gas mass, which has confirmed the X-ray cluster abundances were only able to measure the flux from estimates and reinforces the large baryonic ratios in clusters. that line complex. In more modern work, with the ASCA Knowledge of the nature of the galaxies in clusters and (Advanced Satellite for Cosmology and Astrophysics) and their evolution with cosmic time has been vastly enhanced BeppoSax satellites, the emission from the more abundant by the imaging capabilities of the Hubble Space Telescope elements heavier than oxygen has been measured. At the (HST). The detailed Hubble images from the UV to the lower temperatures representative of “poor” clusters and near-IR have enabled the age dating of elliptical galaxies, groups the spectrum is line dominated, much of the flux the most dominant population of cluster galaxies. being emitted by a veritable sea of Fe L lines. The HST optical data, when combined with the X-ray There is very little variance in cluster metal abundances measurements of cluster evolution, indicates that clusters are for massive systems with kT 3 keV, but at lower tempera- very old, even the most massive systems having formation tures (and masses) there appears to be a wide range of epochs at z 1, and most of the stars in the galaxies forming abundances and a variation in the range of Si to Fe abun- at z 2. This is contrary to the strong predictions of a 1 dances, indicative of a different formation of heavy ele- closed universe (Turner 1999). The great age of the stars in ments as a function of cluster mass. These lower-mass cluster galaxies also indicates that most of the metals, which systems also have more entropy per unit particle than is are found in the cluster gas, were formed at high z. indicated by the numerical simulations, indicating, perhaps, The exquisite Hubble images have also enabled the robust that there has been additional energy injected into the inter- measurement of gravitational lensing of background galaxies galactic medium (IGM) other than by gravity. The amount by the foreground cluster. While the lenses were discovered of energy implied is rather large and should have significant from ground-based imaging, one requires the HST images to effects on group and cluster formation. obtain accurate cluster mass profiles. These data can even Detailed analysis of the cluster temperature and gas den- place constraints on the fine-grain distribution of dark matter sity provide, via the equation of hydrostatic equilibrium, in clusters. In addition to providing measurements of the robust estimates of the total cluster mass and gas mass. It dark matter, lensing amplifies the light from the lensed became clear at the beginning of the 1990s (White et al. galaxies, allowing detailed measurements to be made of the 1993) that the dark matter in clusters accounted for “only” properties of high-redshift galaxies (Steidel et al. 1999). ~80% of the total mass. However, in the closed 1 Detailed numerical simulations of cluster formation and Universe popular at the time, dark matter should account evolution which include the effects of dark matter and gas for ~94% of the total mass. Since clusters are thought to be have reached quite high levels of sophistication (Eke et al. fairly representative samples of the Universe, this was a 1998). These calculations indicate that the gas is almost in strong indication that either the theory of cluster formation hydrostatic equilibrium and that the clusters should follow had a serious error or that 1, with a best estimate for a set of scaling laws. One of the most important of these being 0.3, for example the ratio of the baryons in the clus- laws is the relation between gas temperature and the total ters to the dark matter compared to the cosmic ratio. mass of the cluster, including gas, galaxies, and dark mat- In the centers of a large fraction of luminous clusters the ter. Theory indicates that the total mass should scale as the X-ray-emitting gas has a cooling time, in the absence of X-ray temperature to the power 1.5 and that for massive any known sources of heat, which is considerably less than systems the X-ray temperature is an accurate estimate of the Hubble time, so one anticipates that the gas should be the cluster mass. cooling (Fabian 1994). In the most spectacular examples of The 0.2Ð15 keV X-ray emission from most clusters of this phenomenon the gas should be cooling at a rate of 1 galaxies is dominated by thermal bremsstrahlung and line 1000 ML yr . X-ray spatial and spectral measurements CLUSTERS OF GALAXIES 475 provide striking confirmation of this idea, but the eventual it was not until the 1930s (Zwicky 1937, Smith 1936) that repository of the cooled material has proven to be elusive. they were recognized as very large conglomerations of The luminous galaxies that reside at the centers of cooling galaxies at great distances. The first dynamical analysis of flows have a wide variety of unique properties in the UV, clusters showed that there must exist much more gravita- optical, IR, and radio bands which indicate that the cooling tional material than was indicated by the stellar content of flow phenomenon has important consequences not only for the galaxies in the cluster. This was probably the first dis- the hot gas but also for galaxy formation and evolution. If covery of the preponderance of dark matter in the Universe. the final repository of the cooling gas is stars, as it clearly is However, the field did not advance much until the early for some of the material (O’Connell and McNamara 1988), 1960s, for what seems, in retrospect, to have been the sim- then this process could be a major factor in massive galaxy ple reason that astronomers did not want to accept that most formation. of the material in the Universe was dark. The field has undergone an enormous change, driven With ever larger ground-based telescopes, cluster primarily by a vast improvement in technical capability research developed along the lines of measuring the num- since the first discovery of X-ray cluster emission during bers, sizes, luminosity, and spatial and velocity distribution a rocket flight in 1968 (Friedman and Byram 1967). The of the galaxies. The development of large catalogs of clus- development of spacecraft allowed the first all-sky surveys ters (Abell 1958, Zwicky and Herzog 1963) based on eye with Uhuru in 1971 and Ariel 5 in 1975 (Gursky et al. estimates of the number of galaxies per unit solid angle 1971, Villa 1976), the development of low-background pro- was a major milestone in the field. The relatively strict cri- portional counters resulted in the first X-ray spectra with teria for the Abell catalog proved to be a good guide to the Ariel 5 (Mitchell et al. 1976) and OSO 8 (Serlemitsos et al. physical reality of the objects, and it is amazing that 40 1977); and the development of imaging X-ray optics years later we are still using this catalog. Extensive optical made possible the first X-ray images, with the Einstein follow-up studies of objects in these catalogs was more Observatory (Jones et al. 1979), and the large number of time-consuming, and it was not until the early 1970s (Rood images and spectra with Rosat, ASCA, and BeppoSAX in 1974) that the first large samples of estimated cluster mass the 1990s. The development of X-ray photon-counting from measurement of the velocity distribution of the galax- CCDs combined with high-throughput thin-foil optics led ies via the use of the virial theorem were obtained. to the first high-quality spectra of numerous clusters with By the early 1970s it became clear (see the pioneering ASCA. The tremendous improvement in sensitivity, spec- paper of Rood et al. (1972), and the detailed study of the tral resolution, and spatial resolution allowed by the spec- Coma Cluster by Kent and Gunn (1982)) that clusters of tacular increase in angular resolution made possible by the galaxies were dominated by dark matter, with galaxies rep- Chandra mirror and the technology breakthrough resulting resenting less than 5% of the total mass, and that there were in the very large collecting area of XMM-Newton will push definitive patterns in their galaxy content (Dressler 1980). X-ray cluster studies to new levels of sophistication and Thus the issue of the “missing mass” or “dark matter” detail in the first decade of the new century. The spectacular became the central theme of cluster research. improvement in spectral resolution that would have been The nomenclature of the field was partially defined by brought about by the calorimeter on Astro-E will have to Abell’s survey of the northern sky for clusters based on a await a future flight of this technology, due to the failure of visual search of the Palomar Observatory Sky Survey Astro-E to reach orbit. plates. He defined a set of “richnesses” and distance classes To go beyond these three new missions will require even based on the apparent magnitudes of the galaxies and the larger collecting areas and better spectral resolution, as will number of galaxies inside a fixed “metric” radius. Abell be possible with the Constellation-X mission under study in used strict criteria to define the physical reality of a cluster. the USA. While the progress in the 1990s was truly spec- “Rich” clusters (i.e., those with many galaxies inside a tacular, we anticipate even more rapid progress in the next fixed metric (Abell) radius) had a preponderance of “early” decade. type (elliptical and S0) galaxies, while “poorer” clusters had a larger fraction of spiral galaxies. It was clear that many clusters had a rather unusual central galaxy, a cD, or HISTORICAL CONTEXT centrally dominant galaxy (Morgan and Osterbrock 1969) which is very seldom, if ever, found outside clusters. There (See Bahcall (1977) for an early review.) Clusters of galax- was also an unusual type of radio source found primarily in ies are the largest virialized structures in the Universe, with clusters, a so-called WAT, or wide-angle tailed source 13 15 masses of 10 Ð3 10 ML and sizes of 1Ð3 Mpc. They (Owen and Rudnick 1976). There were the first indications were discovered rather early in the history of modern astron- of cluster evolution (Butcher and Oemler 1978) in which omy Ð by Herschel (as noted by Lundmark 1927) Ð but distant clusters at z ~ 0.2 tend to have “bluer” galaxies than 476 RICHARD MUSHOTZKY
low-redshift clusters (to an optical astronomer elliptical obtain accurate multi-color photometry for many hundreds galaxies have rather “red” colors while spirals tend to be has vastly improved the quality of the data. At present it is bluer), but the morphology of these galaxies was unknown. the combination of X-ray, optical, and even radio and IR However, the fundamental realization (Binney and data that yields the best constraints on the physical parame- Strimpel 1978, Heisler et al. 1985) that the mass of the ters of clusters. cluster could not be precisely determined from galaxy posi- The ability of modern instrumentation to observe clus- tion and velocity distributions severely hampered further ters at all mass scales over a wide range of redshifts has progress. The mass of clusters was parametrized by the so- made them one of the prime areas of extragalactic research. called mass-to-light ratio (in solar units, so the Sun has a Their use as cosmological probes, as large samples of value of 1). This is the ratio of the two quantities that can galaxies, and even as “fair samples” of the Universe has be well estimated by optical data: the total amount of light made the study of clusters a rather exciting area of research. in the system and a dynamical estimator of the mass (most From a theoretical point of view the rather “simple” physics often the virial estimator). For most clusters the values involved in their formation and evolution makes their prop- were ~400 h 1 (the mass to light ratio depends on the dis- erties amenable to “first principles” calculation and allows tance to the cluster since the total light is proportional to a direct comparison of theory and observation. A quick flux distance2, while the virial mass estimator is propor- look at the main journals shows hundreds of papers a year tional to velocity dispersion cluster size). There were being published in this area, and the rate of increase is also technical difficulties with measuring the effective clus- rapid. ter size and the total amount of light, and with determining the velocity dispersion in the presence of sub-structure in THEORETICAL CONTEXT the cluster and foreground and background contamination (see Oegerle et al. (1989) for a status report). Physical models of clusters Since the mass-to-light ratio of “normal” galaxies is in the range of 2Ð12, this large value for clusters indicated the Modern theories of cosmology and large-scale structure are presence of large amounts of dark matter. However, mass- built on the assumption that cold dark matter (CDM) is the to-light ratio is not a basic astrophysical quantity since it main form of dark matter in the Universe and that, based on depends on the age, initial mass function, dust content, and inflation, there is an initial power density spectrum of fluc- metallicity of a stellar system as well as on the optical color tuations. This theory has a set of “free” parameters Ð the system in which it is obtained. Sensible values from 1 to 20 Hubble constant H0, the mass density in gravitating mater- can be obtained without the need for dark matter. Thus, ial m, the fraction of the closure density in cold dark mat- while these data strongly indicated the presence of dark, ter particles, the value of the cosmological constant , the non-baryonic material, its abundance and nature could not normalization of the power spectrum 8, the baryonic frac- be strongly constrained. tion (the fraction of the closure density in baryons) b, and One of the most surprising discoveries of the space age the amount of “hot” dark matter. While this is rather a large has been that most of the baryonic material in clusters is in set of free parameters, most of them are fairly well deter- the form of hot X-ray-emitting gas which is enriched in mined by observation. In this CDM theory, structure forms heavy elements to about a third of solar abundances. This in a “bottom-up” scenario in which small masses form first discovery came about in stages (see the next section), and it and larger structures such as clusters form later. However, is now realized that X-ray emission from clusters is a domi- this is a vast oversimplification, and a wide range of mass nant process and that, in some sense, clusters of galaxies scales collapse at a wide range of redshifts. For a m 1, are X-ray objects. This discovery was essentially entirely 0 universe (the paradigm before the late 1990s), clus- unpredicted and has led to fundamental change in our ters form at rather low redshifts, and Gott and Gunn (1972) understanding of the Universe. It is rather surprising to referred to the present as “the epoch of cluster formation.” realize not only that most of the material in the Universe is Because clusters are the largest virialized systems in the dark and non-baryonic, but that most of the baryons in the Universe they are at the tail end of the distribution of Universe do not shine in optical light. The anthropomorphic fluctuations, so-called 3 fluctuations. Their formation and picture that the Universe is best studied with the light visi- evolution is thus very sensitive to the cosmological para- ble to our own eyes is not only seriously in error, it drives meters (see Bahcall et al. 1999 for a recent review). Most science in the wrong directions. cosmological models, in fact, are normalized by the COBE Since the discovery of X-ray emission from clusters, spectrum of fluctuations and the abundance of massive the nature of the optical data has changed radically. The clusters Ð so-called cluster normalized models. ability of multi-fiber spectrographs to obtain velocities for Before the late 1970s theoretical work focused on hundreds of galaxies in the cluster and of CCD imagers to mechanisms for “relaxation” (so-called violent relaxation 22
RALPH A.M.J. WIJERS* Gamma-ray bursts
The only feature that all but one (and perhaps all) of the very Like Jansky (radio), Penzias and Wilson (microwave), and many proposed models have in common is that they will not be Giacconi et al. (X-rays), it was what they were not looking the explanation of -ray bursts. Unfortunately, limitations of for that started a new field: occasional flares of gamma rays time prevent me from telling you which model is the exception. lasting for just seconds, from extraterrestrial sources. Soon, (If I did so, I would suggest Black Hole ridden by Accretion as other satellites confirmed the discovery. the favorite in the race with Glitch as a dark horse if only A number of reviews have appeared that summarize the because so many different horses and jockeys are riding under that name.) Ð Mal Ruderman, Texas Conference, 1974 knowledge up to that point. Ruderman’s paper (1975) and its observational counterpart by Strong et al. (1975) give an When Mal so ended the first theory review on gamma-ray account of the initial explosion of discovery and thought. bursts (GRBs), I was on the other side of the world, practic- Higdon and Lingenfelter (1990) give a good overview of ing my fractions. But had he spoken these same words at the the pre-BATSE state of the field. The new discoveries made end of the fifth Huntsville GRB meeting in October 1999, his with the Burst and Transient Source Experiment (BATSE) favorite model (though not his reasoning) would have been on board the Compton Gamma Ray Observatory (CGRO), instantly recognized by all present as a very plausible and and the bewilderment following its overthrow of the galac- popular contender for GRBs. His dark horse is still easily tic disk neutron star paradigm, are described in the review identified with soft gamma repeaters Ð Plus ça change, plus by Fishman and Meegan (1995). The consequences of the que ça reste le même. What happened roughly in between the discovery of counterparts to GRBs and the state of the art two events is the topic of this tale. We shall see how the field at the end of the 1990s are described in reviews by Piran emerged out of the mutual distrust of the Cold War adver- (1999; emphasis on theory and early stages of the burst) saries in the 1960s, and was finally resolved by a determined and by van Paradijs et al. (2000; emphasis on observations effort of just about the entire membership of the United of afterglows Ð the later part of this chapter is based on Nations in the late 1990s. Its progress, of course, was in fits parts of that review). To the historically inclined, review and starts, interwoven with stagnation and frustration. papers are only part of the story because they reflect more Since rays are absorbed by less than 1% of the atmos- the final outcome of research than the struggle to reach that phere, they are quintessentially the domain of space science. outcome. To get a flavor of the latter throughout the history The initial motivation for launching satellites with -ray of the subject, I recommend a number of conference pro- capability was not, however, scientific. The purpose of the ceedings. Theorists’ struggles to come to terms with the US Air Force’s Vela satellite program was the detection of phenomenon in the early days are reflected in the contribu- nuclear explosions in the upper atmosphere or in space, tions to the 1974 Texas Symposium (Bergmann et al. which were prohibited by the first ever test ban treaty. 1975). Various stages of progress in the next decade are documented in the proceedings of meetings in La Jolla * State University of New York at Stony Brook, NY, USA; present (Lingenfelter et al. 1982) and Stanford (Liang and address: Universiteit van Amsterdam, The Netherlands Petrosian 1986). The first conference of the BATSE/CGRO
Ralph A.M.J. Wijers, The Century of Space Science, 499Ð528 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 500 RALPH A.M.J. WIJERS
era and the accompanying wind of change is documented in At the beginning of a new century, this field is likely to the proceedings of the first Huntsville GRB symposium explode into a major branch of astrophysics. (Paciesas and Fishman 1992). After the discovery of coun- The organization of this chapter is mostly historical, terparts, a meeting was called on short notice in Elba, Italy though where dictated by logic I complete the discussion of (26Ð27 May 1997). Its atmosphere of excitement mixed some topics when they first arise, to prevent too much sub- with astonishment and confusion is firmly etched in my division of the narrative. It begins in the days when every memory; since the organizers wisely chose not to burden gamma-ray photon had a name, and it ends with the demise the attendees with writing proceedings papers, this feeling of the CGRO, halfway between the popular and the logical will forever remain the privilege of those who were there. end of the twentieth century. I dedicate this account to Jan The excitement was still very great during what I think of van Paradijs, who taught me so much. Despite his untimely as the best meeting I ever attended: the fourth Huntsville death on 2 November 1999, his science will last well into GRB symposium (Meegan et al. 1998). the twenty-first century. At this point I should describe my own path into the field, since it may have an impact on the tone and content of this review. I was in primary school when the discovery of GRBs 1 THE EARLY DAYS was announced and remained blissfully unaware of them until some time during my PhD training in Amsterdam. I did 1.1 Coming in from the cold not start active research in the field until Bohdan Paczynski« ’s unlimited enthusiasm swept me into it in 1991. Therefore As described by others in this volume, high-energy astron- my accounts of the pre-BATSE era are entirely derived omy started in the 1950s, because it required high-altitude from the literature, and from conversations with those balloon and satellite technology to get equipment above the researchers I interacted with Ð by no means an unbiased atmosphere for long times. Even 1% of the atmosphere sample of the field. Reviews are written with the sanitizing absorbs most gamma rays, so being in space is essential censorship of 20/20 hindsight and thus do not fully reflect (though some observations and new technology tests can be the bouts of inspiration and confusion that characterize done with balloon experiments). In the history of GRBs, the ongoing research: here lies a challenge for those who were other important point to note is that the early days of satel- there to see it. Having come into GRBs by way of lite astronomy coincided with the peak of the Cold War. A Princeton also means that my allegiance in the distance practical way was needed to verify compliance with the first scale debate of the early to mid-1990s was firmly extra- test ban treaty, which prohibited the detonation of nuclear galactic. While that point of view did eventually prevail, weapons in space. Since it was possible in principle to hide this outcome cannot have been so clearly predictable as I the initial explosion (by detonation behind the Moon), one thought it was then, as is nicely illustrated by the reports of type of detector was designed to catch the MeV gamma rays the “Great Debate” on the GRB distance scale (Fishman from radioactive decays in the debris cloud that would drift 1995, Lamb 1995, Paczynski« 1995). A good perspective on into view some time after the explosion. For this purpose, the merits of such debates can be obtained from Trimble’s the US Air Force in collaboration with the Los Alamos and (1995) discussion of the ShapleyÐCurtis debate. The field Sandia Laboratories operated the Vela satellites (Figure 1; of soft gamma repeaters started as part of GRBs but has the name is derived from the Spanish velar, which means slowly split off as a separate discipline; its recent develop- “to watch over” or “to hold a vigil”). They flew in pairs on ment has been similar in rapidity and magnitude to the opposite sides of a 250 000 km diameter orbit. GRB revolution, but this has been somewhat overshadowed Clandestine nuclear explosions were not found, but by the latter. I discuss them only briefly in this chapter something else was Ð short, intense bursts of gamma rays (Section 4). from random directions in the sky. The first one on record A large part of this chapter is devoted to the events fol- dates from 2 July 1967 (Figure 1), but the discovery was lowing 28 February 1997, when the first X-ray and optical not published until 1973 (Klebesadel et al. 1973). It is counterpart to a GRB was discovered. The developments sometimes stated that this was due to the classified nature since then have been fast and furious: the first two counter- of the mission, but this is false: since the objective of the parts settled the distance debate over GRBs firmly in favor satellites was to deter violations of the treaty, their exis- of the extragalactic scale, making them the most powerful tence was well advertised ( just as shops conspicuously explosions since the Big Bang. Many issues are still not advertise their closed-circuit cameras). However, since only quite settled, and thus this part of the story has a decidedly a few people worked on the data, and did so by hand, it less finished feel. Still, the discovery that they are somehow took time for them to convince themselves that spikes in the associated with young stars and can be observed out to very highly variable -ray background were real signals and had high redshifts opens up many new avenues of development. a cosmic origin. The basic method for doing this is still GAMMA-RAY BURSTS 501
1500
1000
counts/second 500
0 –4 0–2 2468 Time (seconds)
Figure 1 The Vela satellite (here Vela 5b), used by the USA to verify compliance with a ban on nuclear explosions in space (left). The light-curve of the first GRB it ever saw, on 2 July 1967, is also shown (right). (Images courtesy of “Astronomy Picture of the Day,” at http://antwrp.gsfc.nasa.gov/apod/astropix.html.) used in modern instruments: First, one requires more than contains an interesting scribbled note to the conclusion one detector to see the signal; and second, one uses the dif- section: the reader comments that such intense flashes in ference in arrival time of the burst signal between two TeV gamma rays should be visible with the TeV telescope widely separated satellites to constrain the arrival direction at Mount Hopkins. With the kind help of the people at (Figure 2). In this way, the early researchers eventually ADS, I discovered that the owner of that copy of the convinced themselves that the transient gamma-ray events Astrophysical Journal was Professor J. Grindlay, who con- they saw did not come from the Earth or the Sun. firmed to me that he had indeed considered the issue seri- It is interesting to note that the first observational paper ously. The problem, of course, was the same as with all already had a model to discuss. A few years earlier, Colgate other counterpart searches: one did not have any means of (1968) had published his idea that a flash of gamma rays getting the very early and accurate locations required to may be produced when the shock powering a Type II super- aim a telescope at the GRB locations quickly enough. nova breaks out of the surface of a red giant. Because the Klebesadel et al. (1973) commented that there have been shock conserves energy, it becomes ever hotter as it moves no detected supernovae near the GRBs they detected, and down the density gradient, and at the point where the therefore dismissed the model. Very recent developments (Thomson) optical depth becomes unity, it emits photons have put supernovae very much back into the picture with energy comparable to the electron rest mass. Since the (Section 6.2). medium is moving towards us with a Lorentz factor of The first few facts about GRBs emerged quickly. Cline about 1500, we see these blueshifted to GeV energies, and et al. (1973) used a hard X-ray detector on the IMP 6 satel- the flash time shortened to tens of microseconds. In later lite, designed for solar flare observations, to confirm the papers (Colgate 1973, 1974), he considers cooling before discovery and show that the spectra of the bursts really did shock break-out and revises his estimates to more nearly peak in gamma rays (thus excluding the possibility that the observed burst durations and photon energies. The they were just high-energy tails of some type of X-ray scanned version of the 1974 paper held in NASA’s ADS event). A telescope with some directional capability on database (http://adsabs.harvard.edu/abstract_service.html) board OSO 7 confirmed the extrasolar origin of one burst 502 RALPH A.M.J. WIJERS
observed with Apollo 16 that has enough signal above 1 MeV to show that the high-energy part is also a power
GAMMA law (Metzger et al. 1974). The time histories are usually BURST quite spiky, with overall durations of 0.1 to 100 s and sharp A fluctuations down to 16 ms, the finest time resolution of the Velas. The locations of the bursts seem to show no signifi- cant preference for any part of the sky, except that two events are consistent with the location of the well-known accreting source Cyg X-1 (a fact that contributed consider- θ ably to Ruderman’s pronouncement on theories). It is also D12 noted that the fluence distribution for bright events is S1 S2 B N ( S) S 3/2, the expectation for sources whose density 2–3 ANNULUS C∆T does not depend on distance (often referred to as the COS θ = S3 D12 “Euclidean” distribution). Cline (1975), in the same volume, 1–2 ANNULUS confirms the high-energy end of the spectra to be a power law, and argues that the claimed flattening of the fluence THE TRIANGULATION METHOD distribution at low fluence is not significant. With hindsight, his balloon detections must have been something other than Figure 2 When two or more satellites detect a GRB, the GRBs, since the fluence distribution is now known to be arrival time difference of the signal between each pair of flatter than Euclidean at the fluence values he reports. satellites can be used to constrain the location of the burst to a Enter Ruderman, who, also in the same volume, has the narrow circle on the sky. With three or more satellites, a true task of deciding which of the models to date survive error box – sometimes as small as a few arc minutes across – scrutiny. The overview of models later given by Nemiroff can be constructed. This principle underlies the Interplanetary (1994) lists 15 models dated 1974 or earlier; most of them Network (IPN), still in operation. (Image courtesy of K. Hurley.) are discussed in Ruderman’s (1975) paper. He recognizes the limitations of theorists: “Most theoretical astrophysi- cists function well in only one or two normal modes. (Wheaton et al. 1973). Incidentally, the first GRBs for Therefore, we often tend to twist rather strenuously to con- which a cosmic origin could be established were observed vince ourselves and others that observations of new phe- with the Vela 5 and 6 series satellites, which had enough nomena fit into our chosen specialties.” He finds the timing accuracy to pin down the directions to GRBs suffi- brilliant solution of inviting as many as possible to enlist: ciently well. The burst of 2 July 1967 was detected with the “For theorists who may wish to enter this broad and grow- Vela 3 and 4 series, and is classified as the first GRB only ing field, I should point out that there are a considerable on the basis of its similarity to the later ones, not because number of combinations, for example, comets of antimatter the direction to it is known to be inconsistent with a solar or falling onto white holes, not yet claimed.” terrestrial origin. Ruderman largely dismisses the extragalactic models, in The first published Soviet observation of a GRB, using part because Colgate’s supernova model is damaged by the the Kosmos 461 satellite, dates from 1974, and is by the non-detection of supernovae coincident with known GRBs. Leningrad group (Mazets et al. 1974). Igor Mitrofanov has However, he also derives a constraint on distances from a told me that they too knew earlier that they might have black-body limit to the luminosity, assuming the observed something cosmic on their hands, probably in about 1971. photon energies are comparable to the thermal energies of the emitting particles, which virtually excludes extragalac- tic models beyond 30 Mpc. He acknowledges this to be 1.2 The earliest observations and theories wrong for some radiation mechanisms, but nonetheless the From here on the pace of discovery picked up quickly. In constraint appears to have impressed the audience. One of the review by Strong et al. (1975), in the proceedings of his acknowledged ways around the constraint is synchro- the seventh Texas Symposium, 34 bursts are listed. Their tron radiation, since for this mechanism the emitted pho- “sizes” (now called fluence or time-integrated flux) are in tons do have much lower energies than the particles that the range 10 5 10 4 erg cm 2. The spectra, where avail- emit them; of course, we now know synchrotron radiation able, show a power law at low energies and then a bending to be the dominant contributor. Among the galactic models to a steeper decline at a few hundred keV. At first this he discusses a variety of “conventional” models, namely bending is fitted with a thermal bremsstrahlung function, magnetic flares on various types of object and accretion that is, an exponential cutoff, but there is already one burst events in compact objects. He then presents a number of 23
MARTIN ELVIS* Quasars
To the naked eye the night sky is filled with stars. But outside output. They also shoot out enormous jets of fast particles, of a quite narrow slice of the whole electromagnetic spectrum travelling at 99.999% of the speed of light. These jets stay a radically different sky is seen. Over two huge frequency narrow over huge distances, easily as large as the distances ranges, each about 10 decades broad, utterly different objects between whole galaxies. Such unusual objects as quasars dominate the view. Both at the long, radio wavelengths, and should surely be found in unusual places, and indeed quasars at the high X-ray and -ray energies, the brightest things in inhabit the very centers (“nuclei”) of galaxies. We believe the sky are quasars. Only in the three decades of frequency that the origin of the strange properties of quasars is a black from the ultraviolet, through the visible band and into the hole with a huge mass, 106 to 109 times that of the Sun. infrared do stars dominate, and in the infrared much of the Here I give a personal view of quasars across these starlight is seen only indirectly, having been absorbed and 20 decades of spectrum, and across four decades of dis- re-emitted by dust around the stars. Three decades is much covery, of which I have only witnessed two myself. After broader than the octave-wide band to which our human eyes telling the history of quasar discovery, I set out seven key are sensitive but, compared with the 20 decades of frequency quasar mysteries, describe how space obsevations gradually opened up to astronomy during the space age, it is small. brought the whole quasar phenomenon into view, and then Quasars1 are truly space-age astrophysics. Of the two explain each mystery, so far as we now can. I end with a wide bands where quasars dominate, the radio sky can be few speculations on where quasar research will go next. explored quite well from the ground. To explore the high- energy sky though we have no choice but to use telescopes mounted on spacecraft. 1 A QUICK QUASAR HISTORY What are these objects that dominate the sky over most of the spectrum? In complete contrast to stars, quasars are Understanding the quasar puzzle has been a long-term extremely distant objects seen back to early cosmic times, program. There were hints of strange activity in the nuclei and are the most powerful (“luminous”) continuous2 sources of some galaxies as long ago as 1917 when Slipher found of radiation in the Universe. Moreover, the light emitted by extraordinarily broad emission lines coming from the cen- quasars, even though it reaches to the highest energy -rays ter of the galaxy NGC 1068, and the next year when Curtis we can measure, is not the whole of the quasar power found a jet pointing straight out of the center of the ellipti- cal galaxy M87 in Virgo. Quasar prehistory took another step, 25 years later, when Karl Seyfert’s 1943 PhD thesis * Harvard-Smithsonian Center for Astrophysics, Cambridge, MA, USA described a few galaxies with extremely rapid gas motions 1 For simplicity and clarity I will refer to all the types of non-stellar in their nuclei (as determined from the Doppler-shifted “activity” in galactic nuclei as “quasars,” even though the research literature carefully distinguishes high-luminosity “quasars” from lower width of their peculiar spectral emission lines; Slipher had luminosity “active galactic nuclei” (or AGNs). Evidence for any basic rejected this explanation). Just a few years later, using sur- physical difference between these types of active objects has diminished, plus World War II radar equipment, radio astronomy blos- essentially to the vanishing point. 2 For a few seconds or minutes a gamma-ray burst can outshine any somed, and for the first time found a sky not dominated by quasar. stars. In 1953 some of these new radio sources turned out to
Martin Elvis, The Century of Space Science, 529Ð548 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 530 MARTIN ELVIS come in pairs, either side of a galaxy, suggesting ejection find no strong arguments against it. The results for from the galaxy, but there was little hard evidence for this. 3C 273 and 3C 48 were published six weeks later in four The true history of quasars did not begin for another consecutive articles in Nature (Hazard, Mackay, and decade, until 1963. Hard-won precise positions of radio Shimmins 1963; Schmidt 1963; Oke 1963; Greenstein sources enabled the large optical telescopes of the day to take and Matthews 1963). spectra of whatever optical object lay at that position. Some showed the starlight of normal galaxies, but often at large So these things were not stars, they simply appeared distances (as measured by the redshifts of their spectral lines, stellar on photographs, and hence they were dubbed “quasi- using the Hubble relation between redshift and distance stellar objects” which quickly became “quasars” (Hazard that describes the expansion of the Universe) (Chapter 17). 1979). Astronomers were not prompt to accept the idea that The spectra of those with compact, “stellar”-looking, optical quasars were at cosmological distances. This was not objects, though, had a baffling series of strong, broad emis- because this distance seemed too great: after all, there was sion lines. It took three years before Maarten Schmidt real- already the example of the large redshift (z 0.46) of the ized that one of these “radio stars” was actually at a large radio galaxy 3C 295. The problem was that, if the redshifts redshift, and so Ð given the Hubble relation between redshift implied distances according to the normal Hubble law and distance Ð presumably lay at a large distance from us. (Chapter 17), then it implied a second power source must This in turn implied a prodigous luminosity. be at work in the Universe, beyond the nuclear burning at Schmidt (1990) relates how his key discovery was made: the centers of stars. That is because it is one thing is to find that a steady source is very distant and powerful, and quite The puzzle was suddenly resolved in the afternoon of another thing to find that a variable source is far away, February 5, 1963, while I was writing a brief article because then it must emit its power from a region smaller about the optical spectrum of 3C 273. Cyril Hazard had than the time it takes light to cross it. In the case of quasars written up the occultation results for publication in this means the power of a whole galaxy of stars comes from Nature and suggested that the optical observations be a region smaller than the Solar System! (r(Solar System) published in an adjacent article. While writing the manu- 50r(EarthÐSun) 50 AU 7.5 1014 cm.) Any such script, I took another look at the spectra. I noticed that source has to be more efficient than nuclear burning can be four of the six lines in the photographic spectra showed (see next section). And the first quasars found, 3C 273 and a pattern of decreasing strength and decreasing spacing 3C 48, indeed varied quite strongly. It was this variability from red to blue. For some reason, I decided to construct that had fixed the idea in astronomers’ minds that these an energy-level diagram based on these lines. sources, of stellar optical appearance, must be nearby stars. I must have made an error in the process which But the high redshifts made that position untenable. In the seemed to contradict the regular spacing pattern. 40 years since the first redshift of 0.16, quasars have contin- Slightly irritated by that, I decided to check the regular ued to be found out to larger and larger redshifts (Figure 1), spacing of the lines by taking the ratio of their wave- reaching 5.8 in 2000, implying they were already around lengths to that of the nearest line of the Balmer series. when the Universe was only 5% of its current age, or less The first ratio, that of the 5630 line to H-beta, was 1.16. than 1 billion years old. The second ratio was also 1.16. When the third ratio was 1.16 again, it was clear that I was looking at a Balmer spectrum redshifted by 0.16. I was stunned by this development: stars of magnitude 13 are not supposed to show large redshift! When I saw Jesse Greenstein minutes later in the hallway and told him what had happened, he produced a list of wavelengths of emission lines from a just completed manuscript about the spectrum of 3C 48. Being prepared to look for large red- shift, it took us only minutes to derive a redshift of 0.37. The interpretation of such large redshifts was an extra- ordinary challenge. Greenstein and I soon found that an explanation in terms of gravitational redshift was essen- tially impossible on the basis of spectroscopic arguments. We recognized that the alternative explanation in terms of Figure 1 The largest redshift of quasars as a function of time. cosmological redshifts, large distances, and enormous The current record is z 5.80. (From Fan et al. 2001.) For luminosities and energies was very speculative but could early data see Schneider et al. (1992) and Fan et al. (1999). QUASARS 531
2 SEVEN QUASAR MYSTERIES either long or short wavelengths is the feature that makes quasars dominate the sky over most of the spectrum. The huge research literature on quasars revolves around a 3. Quasars accelerate material to high velocities. The quite small number of central questions or mysteries. While widths of the emission lines reach up to 10% of the these are capable of infinite subdivision I believe there are speed of light (0.1c), although most are a smaller, but just seven basic mysteries. still considerable, 0.03c. Some quasars show absorp- It is remarkable that with the first quasar papers, already tion lines, and some others show X-ray emission lines, four of the seven quasar mysteries were laid out: implying velocities up to 0.2cÐ0.3c. Most extreme of all, to create the observed “superluminal” motions in blazars 1. Quasar luminosities are enormous, and arise from tiny requires 10, i.e., υ 0.999c (Chapter 24; Ghisellini regions. The power from quasars covers a wide range et al. 1993). Ways to produce the broad emission and ( 107) from object to object: from 1040 erg s 1 to absorption lines without using mass motions are con- 1047 erg s 1, and can compete with, and easily exceed, trived, but not impossible. Large velocities in astronomy that of a whole galaxy of 1011 stars ( 1044 erg s 1). imply deep gravitational wells (to accelerate matter, or When Mathews and Sandage (1963) found that quasars to produce a general relativistic gravitational redshift), could vary their power output by 40% in just a few except for short-lived explosive events like supernovae. months, then it seemed that all this luminosity must 4. Quasars have linear symmetry. As radio maps became come from a region similar in size to our Solar System. more detailed through the exploitation of interferomety (From the simple “light travel time argument”: a source (work which led to a Nobel prize for two pioneers, Martin cannot vary coherently in less time than a light signal Ryle and Anthony Hewish) most of the radio sources were takes to cross the source. So if a source varies in a day, it found to have a “double lobe” structure (Jennison and is no bigger than a light-day across (but beware of high Das Gupta 1953) extending well beyond the galaxies they velocities; Chapter 24).) Moreover the radio galaxies straddled. With the advent of the Very Large Array (VLA) required that quasars put out enormous power over a in the 1970s these lobes could be mapped in detail at high long time period since the total energy stored in the dynamic range revealing pairs of large (1022Ð1024 cm) giant radio lobes must be 1060 ergs (Burbidge 1959). highly structured bubbles (Figure 2). In many, but by no At this point it became a good bet that an energy source means all, cases the central galaxy had a nucleus with a quite different from, and much more efficient than, spectrum like that of a quasar. Forty years later the fine nuclear burning in stars was at work. detail of Hubble Space Telescope images showed that 2. Quasar spectra are nothing like that of starlight. Stars even the quasars that are not bright radio sources have a have more or less black-body spectra with well-defined linear symmetry. Clear cone-shaped structures were seen temperatures of a few thousand degrees, so their emission stretching over a galaxy scale (1022 cm). In high-redshift in the radio band is tiny. Instead quasars can have huge radio-loud quasars (primarily in “radio galaxies”) Hubble radio luminosities with spectra that follow a simple images showed galaxy-scale optical line emission aligned straight-line form in log (flux) vs. log (frequency), i.e., a with the radio jets. A big clue to the inner structure of “power law” from 100 MHz to 100 GHz. So being quasars has to lie in this symmetry. bright in the radio does not simply mean that they are very cold objects. Their optical spectra have no obvious From these four mysteries an outline of quasars was thermal peak either, but also have a power law shape from already becoming clear: a large mass in a small region sug- 1 m to 0.1 m (Figure 3). Confusingly at first this gested that the, then outlandish seeming, concept of a black meant that quasars were both “bluer” than stars (i.e., had hole might underlie quasar physics, while the symmetry sug- more short-wavelength emission than expected based on gested a spin axis, either of the black hole or of a rotating gas their visual brightness) and “redder” than stars (that is, the disk. A short paper by Donald Lynden-Bell (1969) put this same at long wavelengths)! The whole spectrum is not together with extraordinary elegance. He pointed out that just one power law though, since the radio and optical energy release from gravitational infall is more efficient than power laws do not join up. When X-ray spectra first nuclear burning for any mass large enough to produce quasar became available, they too had a power-law shape, and luminosities. He then outlined how to release this energy as again it did not connect with the optical or radio power radiation: any material falling onto a black hole from some laws (Figure 3). In this sense quasars have no tempera- large distance will have some angular momentum around the ture, and so were called “non-thermal” sources. That does black hole and so, unless the incoming matter is perfectly not mean though that no combination of thermal spherically symmetrical, the material will fall into a flattened processes underlies the production of the quasar contin- shape to form an orbiting disk. (This is common in astron- uum (see Section 5.2). This failure to drop off in power at omy, recall Saturn’s rings, or nascent planetary systems.) If 532 MARTIN ELVIS
Figure 2 The complex two-sided radio structure of Cygnus A as imaged by the VLA. The large features, or “lobes”, are fed by a jet of relativistic particles coming out of the central “core” (white dot), which lies at the nucleus of a large galaxy. (Perley et al. 1984; courtesy NRAO.) enough matter accumulates then some (undertermined) form to pick out from normal stars in large numbers just from of viscous friction between adjacent orbits will drag on the a pair of photographs taken through different color fil- faster moving material, moving angular momentum outward, ters. This “color selection” was used to find the quasar and allowing matter to fall inward, eventuall accreting into using the, initially quite uncertain, radio source posi- the central black hole. As the matter falls inward the gravita- tions. Hence quite a large area of sky was photographed tional potential energy is released as radiation. At each radius for each radio source. Alan Sandage (1965) first saw that the radiation will be more or less a black body, but since along with the radio-emitting blue quasar, there were each radius emits a different temperature (getting hotter many more blue “interlopers” that had nothing to do inward, as the potential gradient of the black hole steepens) with the radio source. Spectra showed that these too were so the summed emission from all radii looks nothing like a quasars, but ones which were “radio quiet.” Radio-quiet black body, in fact it can mimic a power law. quasars turn out to be 10 times more common than the These “accretion disks” have since been found in our original quasars, which are now called, with impeccable galaxy around many compact objects: white dwarfs, neutron logic, “radio-loud quasars.” The difference between radio- stars, and stellar-mass black holes (Chapter 32), beginning loud and radio-quiet quasars is found only in the radio with the discoveries of the Uhuru satellite (Giacconi and spectrum (and possibly the -ray spectrum). The rest of Gursky 1974). These smaller systems in which it was possi- the spectrum, from far-IR to hard X-rays is essentially ble to establish that a black hole was almost certainly pre- the same (Elvis et al. 1994). (A slight difference of sent made the idea of black holes and accretion disks more power-law slope in the X-rays being the only exception.) familiar and better understood, and they are now a conven- Yet the difference in the radio band is huge, a factor of tional part of astrophysics (Frank et al. 1985, 1992). Nikolai 100Ð104 in the ratio of radio to optical luminosity sep- Shakura and Rashid Sunyaev (1973) expanded on accretion arates the two classes. disk theory by cleverly hiding all the unknown physics of 6. Quasars are far less common now than they were in the viscosity in a parameter, , and working out the disk behav- distant past. This was termed quasar “evolution”.* The ior as a function of . It took only a few years of searching for fainter and * I find this an unfortunate choice. All we really mean is “change with fainter optical counterparts of radio sources up to higher time.” In biology evolution demands mutation, reproduction, and selection, and higher redshifts to discover two further mysteries: and is an active, unpredictable process that will produce different outcomes even with identical starting conditions. “Evolution” in astronomy is merely the playing out of physical processes, with no reproduction or competition 5. Radio-loud quasars are just the tip of the iceberg. The and with a predictable outcome. Chaotic systems are an exception but still blue “non-thermal” spectra of quasars made them easy involve no analogs to the biological process of evolution. 24
GABRIELE GHISELLINI* Blazars
Up to the seventeenth century, the unaided eye was the only hole gravity, it is compressed, heated, and then it radiates. receiver that humanity could use to observe the Universe. This was realized quite early (Salpeter 1964; Zeldovich Evolution was able to adapt this “instrument” to be sensitive 1964; Lynden-Bell 1969; Shakura and Sunjaev 1973). to the light of the star we happen to be orbiting, the Sun. Another major advance came with the opening of the The invention of the telescope amplified the sensitivity of X-ray window, first (in the 1960s) with rocket experiments the eye and its angular resolution, letting humanity discover, pioneered by Riccardo Giacconi, Bruno Rossi, and others, less than a century ago, that other galaxies exist, far beyond and then with the first X-ray satellites in the early 1970s. The the Milky way, and that these galaxies are moving apart: the Uhuru, Ariel 5, HEAO 1 and then Einstein satellites made Universe expands. However, all we could discover using the clear that all kinds of quasars were strong X-ray emitters: at eye and its extension, the optical telescope, regarded a tiny, the same time, it started to be believed that quasars were per- very tiny, part of the entire electromagnetic spectrum. As haps the major contributors to the cosmic diffuse X-ray back- soon as technology enabled us to open new windows, we ground, already discovered in 1962 (Giacconi et al. 1962). discovered other phenomena, other objects, and could push A third qualitative “jump” was the improvements of the our knowledge farther out in space and time. interferometric technique in the radio band, again in the The opening of the radio window made the 1960s the early 1970s. Radio telescopes in different continents, look- golden decade for astronomy, with the discovery of the ing at the same source, can resolve details as close as a few microwave background and of pulsars. The third great dis- tenths of a millisecond of arc. This enabled us to discover covery made in that decade was that of quasars (Chapter 23). that some radio-emitting quasars have spots of radio emis- The term “quasar” originally stood for “quasi-stellar radio sion which are observed to move. Sometimes this motion source.” In fact when an optical telescope is pointed towards corresponds to velocities that exceed the speed of light. the direction of these radio sources, which can be as Far from challenging special relativity, this “superluminal” extended as minutes of arc in radio maps, the resulting motion, as it is now called, was even predicted before it was optical plate shows a source which looks like a star, that observed, by Martin Rees in 1966, and corresponds to real is, an unextended, or “pointlike” object. This apparently fast motion (but slower than the velocity of light!) at an innocuous point is instead a gigantic energy plant, able to angle close to our line of sight. produce much more power than an entire galaxy like our The fourth advance came with the Compton Gamma own, in a volume which is extremely small, if compared Ray Observatory (CGRO, launched in 1991) which discov- with the dimension of the Galaxy, and comparable with our ered that the subclass of quasars called blazars are very Solar System. The process that powers the stars, thermonu- strong -ray emitters, producing at energies greater than clear reactions, is not efficient enough to power quasars. We 100 MeV more power than in the rest of their electromag- believe that at the core of these sources there is a massive netic spectrum. The properties of blazars are very extreme, black hole, with a mass between a million and a billion that and we believe that this is partly due to special relativistic of the Sun. Matter around the hole is attracted by the black effects because their emitting plasma is moving with a bulk motion at large (υ 0.99c) speeds. Blazars and gamma-ray * Osservatorio Astronomico di Brera, Merate, Italy bursts are indeed the realm of special relativity: effects that
Gabriele Ghisellini, The Century of Space Science, 549Ð560 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 550 GABRIELE GHISELLINI were of academic interest up to 10 years ago suddenly quasars). At this time (the 1980s) the LPQs (low polariza- became the keys to understand their behavior. tion quasars) were left out, according to the belief that the The discovery of quasars and blazars literally opened up polarization characteristics were peculiar and important. the scale at which we can see the Universe, since they are But this belief became progressively weaker, up to the so powerful that they can be easily seen up to great dis- discovery of the (often dominant) -ray emission in both tances. For 30 years quasars were the most distant known blazars and LPQs. Then LPQs joined the class of blazars in objects in the Universe. And studying these sources we the 1990s. Linear polarization in the radio band, when study the effects of both general and special relativity, that observed, is an important tool to diagnose the absence of is, how space and time change as measured in different any cold electron component: if present, these electrons frames. could depolarize the flux due to Faraday rotation of the polarization angle (Wardle 1977). This argument does not apply in the case of electron and positron pairs (they make the polarization angle rotate in opposite directions, and the BL LACERTAE OBJECTS AND BLAZARS depolarization effect cancels out). All blazars so far can be classified as radio-loud objects, The “strange star” BL Lacertae but the level of radio emission, with respect to the optical, The star BL in the constellation of Lacertae had been varies widely. In fact objects selected through X-ray surveys known for many years from its optical variability. When the (Extended Medium Sensitivity Survey (EMSS) (Wolter radio source VRO 42.22.01 was associated with it, people et al. 1991), the Einstein SLEW survey (Perlman et al. thought they had discovered the first radio star. But unlike 1996), and the Rosat All Sky Survey (RASS)) can be con- other stars, its optical spectrum was completely featureless sidered as radio weak, since they have a ratio between the and this puzzled astronomers. Lots of other things were radio flux and the X-ray or optical flux much smaller than peculiar about BL Lacertae. The shape of its spectrum did the blazars found in radio surveys. not look thermal, but it was a power law (that is, F( ) To classify a source as a blazar is not an easy task. ). In addition, the optical light was highly polarized, There is, however, a useful theoretical definition: There are and a high degree of polarization was taken as the signature sources characterized by a relativistic jet producing non- of synchrotron emission, produced by ultrarelativistic elec- thermal radiation. This radiation is Doppler boosted along trons spiraling along magnetic field lines. A lot of observ- the velocity direction, coincident with the jet axis. We call ing time was invested in this source to obtain deeper blazar a source whose non-thermal radiation dominates images and spectra. In the end weak absorption lines were over the isotropic components (for example, thermal emis- observed, from which a redshift of z 0.07 was obtained sion from the accretion disk and nonthermal radio emission (Oke and Gunn 1974; Miller and Hawley 1977). This estab- from extended regions). lished that BL Lacertae was not a star after all, but a type of Observationally, we have the following properties of quasar. blazars: After the discovery of this prototype, other “stars” were found with similar characteristics, which were named “BL ¥ Rapid variability Ð The flux of all blazars varies violently, Lacertae objects,” or “Lacertids” (or “BL Lacs” for short). with large amplitudes in short times. Variations of a factor At the same time, it was realized that other radio sources, of two in hours are common at X- and -ray energies, showing the broad emission lines of typical quasars, were slightly smaller variations are seen in the optical in a also extremely variable in the optical and were named single night. Some objects have varied by more than two OVVs (optically violent variables) because of this. Their orders of magnitude over a few years. There are many optical spectrum does not generally show the flattening types of variability: there are objects whose flux varies towards the UV indicating the presence of the thermal com- continuously at all frequencies, without any quiescent ponent due to the blue bump (the thermal emission from phase. There are other objects undergoing violent outburst the accretion disk) and is therefore non-thermal in origin. phases from normally quiescent states. Therefore, if an Another class of radio-loud quasars, the HPQs (high polar- object is not continuously monitored for a long time, it ization quasars) showed polarization at a level greater than may be difficult to detect its violent variability. 3%, and it was finally realized that HPQs were also opti- ¥ Emission at all wavelengths Ð Blazars are active emitters cally variable, that is, OVVs and HPQs were the same class. all across the electromagnetic spectrum, from radio to It was then found that both BL Lacertae objects and HPQs -rays. If we plot the quantity F versus their spectrum have a flat radio spectrum, at least at high frequencies, is characterized by two very broad humps. The peak above 1 GHz. These similarities led to the definition of the energies are located in the IRÐsoft X-ray band and in the general class of blazars (contraction of BL Lacs and radio MeVÐGeV (and even TeV) bands, respectively. BLAZARS 551
¥ Non-thermal continuum Ð Most of the emission (and of the source light up at different times, giving us the (false) power) has a power-law shape in restricted energy ranges impression of motion. But observations made it clear that (a mere decade or so), with weak or absent blue bump. what was observed was always a motion from the center to ¥ Flat radio spectrum Ð All known blazars, above 1 GHz, the outer parts of the source, while the “Christmas tree” have a flat radio spectrum dominated by the core of the jet. model predicts also contraction “motions.” Furthermore, if the source is moving relativistically, then its radiation is beamed in the direction of motion, as described below, and JETS AND FAST MOTIONS we indeed have strong evidence for such beaming.
We are now used to seeing beautiful radio maps which can Superluminal motion “zoom” in a radio source from the megaparsec to the parsec scale, showing a jet which remains collimated at all scales. According to the now accepted explanation of superluminal υ Often there are lobes and hot-spots of more intense radio motion, the apparent speed app c app of the radio blobs emission at the ends of the jets. There the energy accumu- which are observed to move is due to a projection effect. lates through million of years, and only a small part of it is Once the constancy of the speed of light is accepted, super- radiated away, the rest is used to make the lobe grow and luminal motion can be explained by simple geometry, with advance in the intergalactic medium. no further involvment of special relativity. In the beginning, it was not clear at all how the energy The key point is that the source, while emitting photons, could be transported from the very small center of radio is moving towards the direction of the observer. Suppose galaxies to regions million of light years away. Then Martin we take two maps of the same radio source, separated by a Rees, in his PhD thesis (1966), made the suggestion that time interval tobs. When the source emits the photons of this job could be done by matter moving relativistically in a the first map, it is located at the position A (Figure 2). jet. The spectacular prediction he made, out of his idea, was After a time tem, the source is located at B. The segment superluminal motion: blobs of radio-emitting matter appar- AB c tem c tem, and the distance of the two light ently moving at a velocity greater than the light speed. fronts in the direction of the line of sight is HC Five years later, with the completion of the first Very c tem c tem cos c tem(1 cos ). This translates Long Baseline Interferometry (VLBI), superluminal motion to a time interval tobs tem(1 cos ). The two was indeed observed in the blazar 3C 279 (Figure 1; maps show the “blob” in two different positions, whose Whitney et al. 1971). projected distance, HB, is c tem sin . We then obtain Soon after the discovery of the superluminal motion in app sin /(1 cos ). For small viewing angles υ 3C 279, 3C 273, and other sources, alternative theories (a few degrees) and for relativistic speeds ( c), app can were challenging the idea of Rees. One of the most debated be greater than c. was the so called “Christmas tree” theory: different regions Definition and importance of beaming It is believed that in blazar jets the plasma is flowing with a bulk speed c close to the speed of light. In terms of the
Figure 2 Explanation of the observed superluminal motion. When in A, a “blob” emits a photon. After a time tem, the Figure 1 The spectrum of 3C 279 at various epochs. Note source is in B and emits a second photon. The two photons the extremely variable flux at all energies, particularly in the are separated by HC c tem(1 cos ) and arrive at CGRO band (0.1–10 GeV). (Adapted from Wehrle et al. 1998 the observer separated by a time interval tobs tem and Maraschi et al. 1998.) (1 cos ). 552 GABRIELE GHISELLINI
corresponding Lorentz factor, (1 2) 1/2, we derive produce the radio emission we see. Knowing directly the values between 5 and 20. Therefore it is essential, in study- size by VLBI, one derives the particle and the photon densi- ing blazars, to take into account relativistic effects, the most ties. Therefore one can estimate the probability of interac- important of which is called the beaming effect. This is the tions between synchrotron photons and relativistic electrons, sum of three effects: predicting the flux at high energies, especially in the X-ray band. These estimates often turn out to be orders of magni- (1) Assume that in the frame co-moving with the plasma, tude greater than observed. If we now assume that instead the radiation is emitted isotropically. Then half of the source is moving towards us at relativistic speeds, the the photons are emitted in the forward directions, half fluxes we receive are enhanced, therefore less electrons are in the backward directions. An observer, however, sees needed to produce this emission, and the calculated photon the plasma moving and so will see that half of the density at the source is lower. The interaction probability photons are contained in a cone of semi-aperture angle is less than before. The estimate of the X-ray flux then equal to 1/ . This is due to the aberration of light. depends on the Doppler factor: by matching the predicted (2) Since the source of radiation is moving, we must take X-ray flux with the observed flux we can put an upper limit into account the change of frequency of the observed to the amount of beaming. It is an upper limit because the photons (Doppler effect). radio region we are observing at the VLBI may not be the (3) The rate of arrival of photons is not equal to the rate of only contributor to the X-ray flux. emission, as measured in the co-moving frame (if we run towards a source, we will collect more photons per second than if we do not run). Radiation processes These three effects also depend on the angle between In tenuous plasmas interactions between particles (which our line of sight and the velocity vector of the plasma. They depend on their density) are not efficient in sharing the conspire to make the emitting source much brighter if we energy between the particles. If the injection of energy is see it at small viewing angles (photons are concentrated in due to the acceleration of few ultrarelativistic particles, then this small solid angle, they are seen blueshifted, with an the particle distribution has no chance to become thermal enhanced rate of arrival), and much dimmer at large view- (that is, described by a Maxwellian distribution). In rarefied ing angles. Therefore the (frequency integrated) intensity I relativistic plasmas the most efficient ways to produce of a moving source depends both on its velocity and the radiation are the synchrotron and the inverse Compton viewing angle: I 4I where I is the intensity seen by a processes. The first depends on the strength of the magnetic co-moving observer, and 1/[ (1 cos )] is the so field, the second on the density of the seed photons to be called Doppler boosting factor. We can see that the intensity scattered. When these seed photons are produced by the can be enhanced by orders of magnitudes when is small synchrotron process the resulting mechanism is called the (a few degrees) and c. synchrotron self-Compton process. For 20 years this was Note that relativistic motion greatly affects the variabil- thought to be the main mechanism responsible for the ity timescales we measure. Suppose that in the co-moving observed radiation, from the radio to the X-ray band, until frame the source flares in a time t . According to special the high-energy -ray emission was discovered (e.g. relativity, this time would be seen, at Earth, longer by a fac- Ginzburg and Syrovatskii 1965; Rees 1967; Blumenthal and tor . But this is true only if we do not use photons to mea- Gould 1970; and Jones et al. 1974a,b). The origin of the sure this time! If instead we do, then we must take into high-energy -ray emission is still controversial, since the account that the source has emitted the first photons when it contribution of radiation produced externally to the jet is was located in a certain region, and the last photons when enhanced by the bulk motion (see below) and can overtake it was located in another position. Therefore the two the locally produced synchrotron radiation as a producer of bunches of photons traverse paths of different length to seed photons to be scattered (Maraschi et al. 1992; Dermer reach us. This is the Doppler effect, and contributes a factor and Schlickeiser 1993 Sikora et al. 1994; Blandford and 1 cos . The two effects combine to give t t Levinson 1995; Ghisellini and Madau 1996). It is also fair to (1 cos ) t / : for small viewing angles the mention that even the inverse Compton nature of the -ray observed time is shorter, not longer! emission is controversial: as Mannheim (1993) suggested, The enhancement of the observed radiation by beaming it could be due to the synchrotron process, due to an addi- can be tested through the following argument. If one assume tional population of ultrarelativistic electrons as a result of that the radio emission, as mapped by VLBI techniques, is photonÐmeson interactions. Independent of its nature, the produced by the synchrotron process in a stationary source, high-energy emission (in the X- and -ray bands) firmly then one can calculate how many electrons are needed to established the need for relativistic bulk motion. 25
GIUSEPPINA FABBIANO* AND MARTIN F. KESSLER** X-ray and infrared properties of normal galaxies
WHY GALAXIES FROM SPACE population peaks. The importance of widening the observing OBSERVATORIES? window on galaxies is made clear by ground-based radio and near- to mid-IR observations, which have provided a Galaxies are key components of the Universe and have been way of penetrating deeper into dusty star-forming regions; studied in astrophysics for several different and important have allowed the study of the cold ISM and thus the discov- reasons. Galaxies trace the distribution of matter and can be ery of extended, dark, massive halos in spiral galaxies; and used as standard candles to derive information on the expan- have revealed the properties of both galaxian magnetic fields sion and large-scale structure of the Universe. From these and highly energetic particles. results constraints can be set on both cosmology and cos- With space observatories, the observing window on galax- mogony scenarios. Galaxies evolve: they form, their stellar ies has been opened even further, and a whole new set of population evolves, they interact and merge. Understanding phenomena has come into view. The waveband coverage has the forces governing galaxy formation and evolution is para- been extended to the UV (International Ultraviolet Explorer, mount for our understanding of the properties of galaxies IUE; Hubble Space Telescope, HST; Extreme UltraViolet and their components. Galaxies are where primordial matter Explorer, EUVE), mid- and far-IR (Infrared Astronomical gives birth to stars, and where stars evolve and form Satellite, IRAS; Infrared Space Observatory, ISO), and X- elements, enriching the interstellar medium (ISM) from rays (Einstein; Roentgen Satellite, Rosat; Advanced Satellite whence the next stellar generations will be born. Therefore for Cosmology and Astrophysics, ASCA; BeppoSAX; our understanding of stellar evolution and chemical evolu- Chandra; XMM-Newton). Each of these wavebands has tion is intimately connected with our understanding of given astronomers the means to explore new aspects of galaxies and their evolution. galaxies, including states of extreme star formation and Most of the work on galaxies Ð be it related to cosmol- nuclear activity, as well as the different phases of the ISM ogy, galaxy formation and evolution, or present-day galaxy (cold, warm, hot, gaseous, dusty) and the latest stages of stel- properties Ð has been supported by ground-based, mostly lar evolution. In this chapter we concentrate on the impact of optical observations. While a lot has been learned this way, X-ray and IR space observatories. This is not a complete the optical window is only a small part of the entire emission review of all the work that has been done in these areas, spectrum of galaxies, and the information it provides is but is meant to give a general feel for these fields as they mostly limited to the normal stellar component, whose emis- are today. sion peaks in the optical wavelengths. Moreover, optical light is significantly absorbed by ambient dust, especially in the blue, where the emission of the massive younger stellar A BRIEF HISTORY OF X-RAY OBSERVATIONS OF GALAXIES
* Harvard-Smithsonian Center for Astrophysics, Cambridge, MA, USA ** ESA Ð European Space Research and Technology Centre, Noordwijk, The first X-ray satellite, Uhuru, was launched from Kenya The Netherlands in December 1970. It was sensitive to photons in the
Giuseppina Fabbiano and Martin F. Kessler, The Century of Space Science, 561Ð578 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 562 GIUSEPPINA FABBIANO AND MARTIN F. KESSLER
2Ð10 keV energy range. Uhuru is not an acronym; it means 39 “freedom” in Swahili, and was so named because the day of the launch was the National Holiday of Kenya. Uhuru had 38 been conceived and built by a team of physicists-turned- UHURU M31 10 sqm astronomers, led by Riccardo Giacconi at AS&E in XRB 37
Cambridge, MA. Uhuru led to the discovery of intense Chandra (0.1 sqm) x X-ray emission from compact stellar remnants (neutron 36 XMM-Newton (0.5 sqm) t~10 4 s
stars, black holes, white dwarfs) in binary systems in the SNR
Log L (ergs/s) N>100 Milky Way, as well as from active galactic nuclei (AGNs; 35 Einstein / ROSAT / ASCA (0.01-0.02 sqm) see Chapter 23), and hot plasmas in clusters of galaxies. X-ray emission from sources in the Magellanic Clouds and 34 Virgo CVO* from M31 was detected with Uhuru, but other normal galaxies escaped detection because their fluxes were below 33 2 3 the Uhuru source detection threshold (see Giacconi and 0.1 1 10 10 10 Gursky (1974) for a review of Uhuru results). Distance (Mpc) To be able to detect and study in X-rays a significant number of normal galaxies, we had to wait for the first true Figure 1 The sensitivity of different X-ray satellites to the detection of 100 counts in 10,000 s observations from typical X-ray telescope, the Einstein Observatory, also developed by galactic sources at different distances. For imaging telescopes a team led by Giacconi, including groups at the Smithsonian the collecting area is indicated. Note, however, that unless Astrophysical Observatory, Massachusetts Institute of the telescope has good enough angular resolution (Figure 3) Technology, Goddard Space Flight Center, and Columbia the sources are likely to be confused, and therefore their University. Einstein was launched by NASA in 1978 individual contributions cannot be detected. The y-axis gives (Giacconi et al. 1979). With the Einstein grazing-incidence X-ray luminosities as well as typical X-ray emitters in given mirrors, X-rays from celestial objects could be focused onto luminosity ranges, from luminous O stars to cataclysmic position-sensitive focal plane detectors. Focusing optics variables (CV), supernova remnants (SNRs) and X-ray binaries resulted in two very significant advantages: the relatively (XRBs). The x-axis is a distance scale. Two benchmark small beam (~5 to ~45 , depending on the Einstein focal distances, to M31 and the Virgo Cluster of galaxies, are plane instrument) meant that the background noise in the marked by vertical stripes. detection area was much smaller than in any of the previous non-imaging X-ray satellites, resulting in the detection of Virgo z=0.5 1 thousand-fold fainter sources (Figures 1 and 2); and the imaging capability for the first time let us see clearly the 43 t~10 4 s morphology of the X-ray sources (Figure 3). These capabil- N>100 ities were crucial for opening up the X-ray window on galaxies. Normal galaxies feature in two papers in the first 42 UHURU ever set of Einstein publications. The survey of M31, the
Andromeda Galaxy, the large spiral galaxy closest to us, x 41 revealed a population of bright point-like sources and gave us for the first time a picture of what our own Milky Way Log L (ergs/s) might look like in X-rays to an external observer (Van 40 Einstein / ROSAT / ASCA Speybroeck et al. 1979). The peculiar asymmetric extended emission of M86 in the Virgo Cluster suggested the pres- ChandraXMM-Newton ence of a hot gasous halo interacting with the hot cluster 39 10 sqm gaseous medium, which had been discovered with Uhuru 10 102 10 3 10 4 (Formal et al. 1979). The foundations of what we now know about the X-ray emission of galaxies were estab- Distance (Mpc) lished with Einstein observations (Fabbiano 1989). Figure 2 The sensitivity of different X-ray satellites to the Subsequent X-ray satellites, the GermanyÐUSA Rosat, the detection of global emission from galaxies. Galaxy luminosi- JapanÐUSA ASCA, and the ItalyÐNetherlands BeppoSAX, ties are mostly in the 1039–1041 erg s 1 range, but some E and have further expanded our knowledge of the X-ray proper- S0 galaxies can be as luminous as 1042–1043 erg s 1 (Fabbiano ties of galaxies and widened the observable X-ray window 1989). Note that a significant advance in collecting area in several ways. Rosat was more sensitive at lower energies (to 10 m2) will be needed to begin studying normal galaxies at than Einstein and had slightly better angular resolution. cosmologically relevant distances. X-RAY AND INFRARED PROPERTIES OF NORMAL GALAXIES 563
M31 Virgo z=0.5 inferior to those of Einstein and Rosat, and as a result detailed studies of individual galaxies cannot be pursued in 104 most cases. We are now at a very exciting time for X-ray astronomy. In July 1999, NASA successfully launched Chandra, the 103 third Great Observatory, following the Hubble Space Telescope and the Compton Gamma Ray Observatory. The European Space Agency (ESA) successfully followed suit 102 with XMM-Newton in December of the same year. Chandra Chandra (0".5) and XMM-Newton represent a complementary quantum leap XMM-NewtonEinstein /(15") ROSAT (5") in sensitivity, imaging capabilities, and spectral resolution Linear size (pc) 10 ASCA (~2') compared with the previous generation of X-ray observato- ries, and both are going to deepen and perhaps revolution- ize the way we look at galaxies in X-rays. 1 0.11 10101023 X-RAY PROPERTIES OF GALAXIES Distance (Mpc) Extrapolating from the results of Uhuru and subsequent Figure 3 Linear sizes (in parsecs) equivalent to the angular resolution of X-ray telescopes as a function of the galaxy dis- non-imaging X-ray missions (the UK Ariel 5; the tance. While Chandra’s sub-arcsecond resolution is needed to USÐNetherlands Astronomical Netherlands Satellite, ANS; resolve dense stellar regions in M31 (e.g., near the nucleus), and the US SAS 3 and HEAO 1), one could conclude that Einstein, Rosat, and XMM-Newton can resolve and study most not much excitement lay ahead in the X-ray observations of bright X-ray sources. At the distance of the Virgo Cluster, galaxies. This was indeed the feeling of a good number of Chandra achieves a linear resolution comparable to that of scientists in the Einstein team during the initial discussions Einstein and Rosat for M31. The individual source detection of time allocation to different observing programs. It was sensitivity of previous missions (Einstein and Rosat), as well as thought that normal galaxies would emit because of their that of XMM-Newton, is significantly impaired by confusion population of galactic sources, as happens in the Milky within the parent galaxy at the Virgo distance. The angular res- Way, and that the X-ray emission would be totally domi- olution of ASCA is significantly inferior to that of the other nated by the nuclear source in the case of AGNs. While this imaging X-ray missions. Note that 104 pc – ASCA’s linear reso- lution at Virgo – is a typical galaxy size. is true, it is only a small part of the story. Twenty years ago, when X-ray astronomers first pointed Einstein at normal galaxies, they were engaged in a very successful fishing These characteristics have led to a better understanding of expedition. The Einstein survey of normal galaxies led to the spatial morphology of the X-ray emission of galaxies, the discovery of hot ISM in E and S0 galaxies and of hot and to the detection and study of the soft X-ray-emitting gaseous outflows from starburst nuclei, and demonstrated ISM of spiral galaxies. ASCA, was the first X-ray observa- how X-ray emission is not an isolated phenomenon to be tory equipped with CCD detectors. These detectors have studied by X-ray astronomers, but is instead closely con- spectral resolution 5 to 10 times superior than previous nected to the global emission properties of galaxies (see instruments, and are able to reveal X-ray emission lines reviews by Fabbiano (1989, 1996a) and the Einstein catalog from hot plasmas. Thus for the first time a study of the and atlas of galaxies (Fabbiano et al. 1992)). metal composition of the hot ISM of galaxies could be seri- Galaxies are key objects in the study of cosmology, the life ously attempted. ASCA (and BeppoSAX) also extends the cycle of matter, and stellar evolution. X-ray observations have observing windows to energies of 10 keV, significantly given us a new key band for understanding these building higher than the 2Ð4 keV upper reaches of Einstein and blocks of the Universe, with implications ranging from the the Rosat. This allowed the study of different spectral compo- study of extreme physical situations, such as can be found in nents of the X-ray emission, peaking in different energy the proximity of black holes, or near the surface of neutron ranges. For example, a softer component from a hot ISM stars; to the interaction of galaxies and their environment; and a harder component possibly from a population of to the measure of parameters of fundamental cosmological X-ray binaries can be detected in the ASCA spectra of importance. In this chapter we discuss the unique contribu- elliptical galaxies (Matsushita et al. 1994). However, tions to our understanding of the Universe that are provided ASCA’s angular resolution (Figure 3) is ~2 , significantly by X-ray observations of galaxies. 564 GIUSEPPINA FABBIANO AND MARTIN F. KESSLER
X-ray binaries and supernova remnants telescope (~10 m2) with Chandra-like resolution will be A review of the Einstein and first Rosat results on discrete needed to sample at significant depths the luminosity func- X-ray sources in external galaxies can be found in Fabbiano tion of discrete X-ray sources in galaxies at the distance of (1995a). Although the sample of detected sources has the Virgo Cluster and beyond (Figures 1 and 3): The poten- increased since, the basic conclusions are still valid. X-ray tial of these X-ray population studies is great, with implica- observations of the Milky Way and the Magellanic Clouds tions for the study of the most massive portion of the stellar demonstrate that the majority of bright, individual X-ray mass distribution, that can be detected through remnants in sources are binary systems containing one of the compact luminous X-ray binaries, and for a systematic study of the remnants of stellar evolution: white dwarfes, neutron stars, or interplay of SNRs with the ISM in different galaxies. black holes. The X-ray emission is the result of gravitational Super-Eddington sources: Black hole accretion of the atmosphere of the companion star onto this hunting grounds compact object. Thus, the discrete X-ray source population of galaxies gives us a direct view of the end stages of stellar A surprising and interesting result of imaging observations of evolution, and of the higher-mass component of the stellar nearby galaxies has been the large number of sources population. While the study of individual sources is best detected with luminosities well above the Eddington restricted to the Milky Way, because of the much higher luminosity for solar-mass (1ML) accreting objects fluxes, external galaxies present unique advantages in the (~1038 erg s 1). The Eddington luminosity is the luminosity study of the population properties of X-ray sources. All at which the gravitational pressure of the accreting material sources within a given galaxy are at the same distance from is balanced by the radiation pressure, and it represents a nat- us, so the study of luminosity distributions is much less ural limit on the power that can be radiated by an accreting prone to error than with X-ray sources in our Galaxy, where object. Given their luminosities, which exceed those of the distance uncertainties are large. If fairly face-on galax- Milky Way sources and sources in M31, these very lumi- ies are targeted, the effect of differences in interstellar nous sources cannot be steadily accreting neutron star bina- extinction will also be much smaller than in the Milky Way ries. They can be divided into a few categories: nuclear sample, where directional effects due to the position of the sources; X-ray counterparts of young SNRs; and other Solar System in the galactic plane affect the measurements. unidentified objects. With the exception of the SNR counter- For the same reasons, systematic studies of properties of parts, and of sources that future higher-resolution observa- supernova remnants (SNRs), that can be used to explore their tions may reveal as extended complex emission regions, evolution, are best done with external galaxies. Most SNRs these sources may host intermediate-mass to massive black are spectacular, extended, expanding shells of hot plasmas holes (Fabbiano 1995a, 1998, Makishima et al. 2000). and energetic particles, created by the shock waves of the X-ray emission from massive nuclear black holes in supernova explosion and the interaction of the stellar debris nearby galaxies gives us a probe into the accretion mecha- with the surrounding ISM. An example of a study of a well- nisms and has implications for the evolution of AGNs defined sample of SNRs, for which the distance is known (or (see Chapter 23). Relatively low-activity AGN and LINER all at the same distance within a galaxy), is the diameterÐ (low-ionization nuclear emission line region) nuclei have luminosity relation, which can probe both the age of the been detected as luminous X-ray sources (in the range remnant and the host ISM (e.g., Long and Helfand 1979). ~1039 1041 erg s 1) (Fabbiano 1996b). Although these lumi- Because high-quality imaging is needed for these stud- nosities would be in the super-Eddington regime for a normal ies, Einstein and Rosat have been the two observatories that X-ray binary, for nuclear sources due to accretion onto mas- have contributed most to this field. Observations of nearby sive nuclear back holes, the accretion must be very inefficient. spiral galaxies have revealed variable sources with charac- This is certainly the case for the nuclear source in M104, teristics similar to those of Milky Way sources. The lumi- the Sombrero Galaxy (Fabbiano and Juda 1997), where the 8 nosity distributions appear to vary in different galaxies. dynamics of the nuclear area point to a 10 ML black hole A notable but not unique case is that of M81, where the (Kormendy and Richstone 1995). Given this nuclear mass, X-ray sources appear brighter than in M31 and the Milky the X-ray luminosity of the nucleus of the Sombrero Galaxy Way (Fabbiano 1995a). is ~4 10 7 that of the Eddington luminosity. While these studies open exciting new avenues, much In M33, a bright and variable nuclear source is also more sensitivity and resolving power than has been available present (e.g., Peres et al. 1989), but its nature is more mys- so far are required to extend them beyond exploratory exer- terious. It is possible that it may be a super-Eddington cises. Chandra is already making a difference, as can be accretion binary (Takano et al. 1994). seen from the images released to the public, but very long Some at least of the non-nuclear super-Eddington exposures will be needed in most cases. A large-area sources may be accretion binaries. Their luminosities 26
STEVEN L. SNOWDEN* The hot part of the interstellar medium
In the preparation of this chapter there arose issues of cooler ISM of the disk which expel hot plasma into the halo semantics. Specifically, what does the “hot part” mean in (in this chapter the “halo” of the Galaxy is defined as the reference to the interstellar medium (ISM), and just what region of space above the neutral material of the disk, and part of the hot ISM should be considered? As the original includes both ionized and neutral gas associated with the task for this chapter referred to the X-ray and EUV Galaxy and affected by its gravitational potential; this is a (extreme ultraviolet), appropriate temperatures are in the fairly loose definition which extends from a couple of hun- several 105 K to several 106 K range. In a Universe where dred parsecs to tens of kiloparsecs, or even farther, above the fundamental temperature is currently about 3 K and in a the plane). For the purposes of this chapter, a partially soci- Galaxy where most of the material in the ISM has tempera- ological definition of the hot part of the interstellar medium tures less than 104 K, any component at 106 K must have will be used: that part of the ISM which is (1) observed in been produced by fairly extreme and unusually spectacular the 0.05 to 1.0 keV band as the soft X-ray diffuse back- methods. The most common of these methods in the gen- ground (SXRB) and EUV background, (2) associated with eral galactic disk are supernovae, occasionally aided and the Milky Way disk or halo, and (3) not claimed by others abetted by the stellar winds of massive stars. However, as for studies of supernova remnants or stellar wind-blown supernova remnants (SNRs) are ably covered elsewhere in bubbles. To simplify the presentation and to acknowledge this text (see Chapter 41), the topic for this chapter required that at least the observed part originates in general from further refinement. What happens to a SNR when it slips some of the same plasmas, the EUV will be included with 1 past middle age into its dotage? All of the visual features 4 keV X-rays, despite the historical separation by NASA of that allow a SNR to stand out clearly from the background the astrophysics and astronomy programs at 0.1 keV. fade: the shock fronts (the interface region between the Furthermore, by and large this chapter will focus on the 1 expanding SNR and the undisturbed ISM) with occasion- 4 keV SXRB because it is the dominant galactic back- ally spectacular radio, optical, and X-ray emission, and the ground and, at least locally, it is the best understood. hot and relatively dense interior with bright X-ray emission. Besides, SNRs that are still emitting strongly at energies 1 As the SNR fades it also increases in size to become a more greater than 4 keV typically still stand out clearly against significant, or at least a more extensive part of the ISM. In the general background and fail consideration (3) above. essence it becomes old and boring, and fades into the Despite the observational difficulties in the study of has- woodwork; a pale ghost of its former self. been SNRs and their lack of spectacular features (e.g., the Thus the study of the hot ISM, at least in the galactic wealth of structure revealed by the initial Chandra observa- disk, is essentially a study of what happens to SNRs that tions of the more youthful Cas A and Crab Nebula SNRs), a have lost their identity, though there are exceptions. few true believers have pursued the study of the SXRB over Although the study of hot plasmas in the halo of the galaxy the last 30 years with some diligence. Like X-ray astron- may have cosmological considerations, there are ties to the omy in general, it is a field that has truly required the disk through the galactic gravitational potential and through “Century of Space Science,” as photons with energies of “chimneys,” breakouts of SNRs through the confining 0.05 to 1.0 keV travel at most millimeters in Earth’s atmos- phere at sea level. The detectors must therefore be placed * NASA Ð Goddard Space Flight Center, Greenbelt, MD, USA above the atmosphere at altitudes of 200 km. The SXRB
Steven L. Snowden, The Century of Space Science, 581Ð605 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 582 STEVEN L. SNOWDEN
required the advent of astronomical research sounding rockets in the late 1950s and early 1960s to provide the facility for its observation. Over three decades of study, the observations of the SXRB have progressed from limited regions of the sky provided by single sounding-rocket flights, to all-sky mosaics provided by multiple flights, to all-sky surveys provided by satellite-borne observatories. 1 The initial discovery of the 4 keV SXRB in the late 1960s (Bowyer et al. 1968, Henry et al. 1968, Bunner et al. 1969) was interpreted in the context of an extragalactic background. This interpretation was an obvious one sug- gested by the general negative correlation between the sur- face brightness of the SXRB and the column density of galactic neutral hydrogen (H I)*, as well as the expectation that there could possibly be extensive X-ray emission from intergalactic space as the low-energy extension of the extra- galactic background above 2 keV first observed by Giacconi et al. (1962). The negative correlation observed at 1 keV was assumed to be due to the absorption of an Figure 1 Average mean free path of EUV and soft X-ray pho- 4 tons in the Milky Way as a function of energy, along with typi- isotropic X-ray flux of distant (extragalactic or galactic cal band-response functions (detector effective areas) for a halo) origin by the foreground material of the galactic disk. sample of soft X-ray and EUV detectors. The mean-free-path The fact that the required absorption cross-sections were 3 curve assumes a constant space density of nH 0.5 cm and smaller than the theoretical values could be explained by uses Morrison and McCammon (1983) absorption cross-sec- possible clumping of the cooler ISM. The nonzero flux tions. The Be band response function is from the Wisconsin observed in the galactic plane, where the Galaxy is com- detector (Bloch et al. 1986), and has been multiplied by a fac- pletely opaque to extragalactic photons in this energy tor of 50 for display purposes. The B band response curve is range, was attributed to additional background components, from the Wisconsin survey (McCammon et al. 1983). The 1 3 of either non-cosmic (e.g., a charged-particle background) band response functions of the 4 keV and 4 keV bands are from or cosmic (e.g., unresolved galactic point sources) origin. Rosat (Snowden et al. 1995). The band response functions Thus, the first big step toward our understanding of the show the sensitivities of the respective instruments as a func- tion of energy (the larger the values, the more able they are to SXRB went, as will be shown, in somewhat the wrong detect X-rays of that energy). direction. However, also from the first step, the study of the SXRB was correctly linked to the ISM, in this case cooler components of the ISM that can absorb an X-ray flux of through the Galactic disk, one optical depth is reached after distant origin and modulate its apparent surface brightness. a path length of 100 pc, assuming a smoothly distributed 1 3 By its nature, the 4 keV SXRB is closely linked to the ISM with a space density of 0.5 H I cm . cooler ISM, whether or not it originates in the galactic disk or as an extragalactic background. The column density of H I (NH, the amount of material along a line of sight) required for EVOLUTION OF THE DATA unit optical depth for the absorption of X-rays by the ISM is shown in Figure 1 as a function of energy. In the This section will focus on the data collected over the last 1 20 2 4 keV energy range this is NH 10 cm . Even at its mini- three decades and how they influenced our understanding of mum high-galactic-latitude column density (NH 5 the SXRB. It will not cover all contributions to the field (with 19 2 10 cm ), the Milky Way provides roughly half an optical apologies to those left out) but will hopefully give a feel for depth of absorption as viewed from Earth for X-rays of extra- the evolution of the data. While some reference will be made galactic origin. At lower latitudes with longer path lengths to models for the origin of the SXRB, the detailed discussion of such models will be deferred to later in this chapter. *Astronomy has the quaint convention of using roman numerals to indi- cate the ionization state of elements. However, a single “I” indicates neutral material. (In fact, the “I” is the label for the “first spectrum” of a given ele- ment, which by convention is the spectrum of the neutral overestimate.) Initial observations and the early years Thus, H I refers to neutral hydrogen and H II refers to ionized hydrogen (otherwise known as a proton). The ion O VI referred to later in this chapter As noted above, observations of the SXRB have evolved is five-times-ionized oxygen, or O5 . from the results of a limited number of sounding-rocket THE HOT PART OF THE INTERSTELLAR MEDIUM 583 flights covering limited regions of the sky to all-sky surveys Bunner et al. (1969) presented the last of the initial triad produced by orbiting observatories. The first published of SXRB observations, and reported results which were observation was the result of a sounding-rocket scan by more-or-less consistent with the previous two. In their Bowyer et al. (1968) which observed the northern Galactic analysis they considered three separate alternatives for 1 hemisphere between the center and anticenter directions (the aspects contributing to the angular structure of the 4 keV anticenter refers to the direction in the Galactic disk which SXRB: from Earth is 180 from the galactic center), scanning over 1. The small effective absorption cross-sections were due, the galactic pole. They observed a distinct negative correla- as in Bowyer and Field (1969), to clumping of the tion between the surface brightness of the SXRB in the absorbing medium into optically thick clouds. Although 1 keV band and the column density of galactic neutral 4 they did not find the evidence for the existence of such hydrogen that they attributed to the absorption of an extra- clouds compelling, clumping of the H I into clouds of galactic background. The fitted absorption cross-section was N 1021 cm 2 satisfied the intensity-dependence con- significantly less than the model prediction from atomic H straint of the 1 keV data with absorption column density physics, a result which was left as having mostly unspeci- 4 but was still consistent with the 0.5Ð1.0 keV data. How- fied strong implications for the ISM after a discussion of a ever, even with the assumed clumping, an additional soft possible incorrect value for the He/H ratio.* Higher X-ray component was required to explain an excess of emission intensities were observed in the direction of the galactic observed above the extrapolation of the extragalactic center, which were attributed to an anomalous emission power law identified at higher energies. component and interpreted as possible evidence for a galac- 2. The observed flux in the galactic plane was due to an tic corona (a hot galactic halo). The Bowyer et al. results unknown solar or terrestrial background. This assump- should be considered in the context of the results of Gursky tion reduced the requirement for clumping of the ISM, et al. (1963) who had identified the possibility of an extra- but an additional, softer extragalactic emission compo- galactic background contribution in their data at energies nent was still required. above 2 keV, and the suggestion by Gould and Sciama 3. The observed flux in the galactic plane was due to the (1964) and others that the intergalactic medium might emit superposition of unresolved galactic Population II sources strongly at softer (lower) X-ray energies. (the older, more numerous, and less massive stars in the Henry et al. (1968) reported the SXRB observation of Galaxy, M dwarfs for example). The required source another sounding-rocket flight, and like Bowyer et al. density, 10 2 pc 3, was less than the population of (1968) they observed extensive emission at 1 keV. In their 4 known objects, and so could not be ruled out. analysis, they ruled out galactic corona emission (the den- sity of the corona would need to be unreasonably high) as While there was some disagreement about the details of well as emission from external galaxies and galactic point the observational results presented by the three groups, they sources as the origin (because of both spectral considera- were all consistent with requiring an emission component tions and the excessive magnitude of the required total in excess of the extrapolation of the extragalactic power law 1 flux), leaving the conclusion that the background arose as observed above 2 keV to explain the background at 4 keV. freeÐfree emission (radiation from interactions between The “real” diffuse background was assumed to be extra- free electrons and positive ions) from a hot, dense inter- galactic or galactic halo in origin while any residual flux galactic medium. Bowyer and Field (1969), when not from the galactic plane was assumed to be either non- explaining the differences between the Bowyer et al. (1968) cosmic contamination or from unresolved point sources. and Henry et al. (1968) results, suggested that the counts Diffuse X-ray emission from the interstellar medium of the observed in the galactic plane were due to an isotropic galactic disk was not yet considered. particle background, and that the apparent absorption cross- sections were reduced from the theoretical values by clump- ing of the absorbing medium.** The concept of a clumpy, **When the ISM is clumped into at least partially optically thick clouds X-ray-absorbing ISM would play a role of some controversy which are not resolved by either the H I or SXRB observations, the effect is to reduce the apparent ability of the material to absorb a diffuse X-ray flux. in models of the SXRB for the following 20 years. For example, if the ISM is diffusely distributed then the absorption optical depth should just be the theoretical cross-section multiplied by the observed column density. However, if the ISM is clumped into unresolved “bricks,” an X-ray would certainly be stopped by the brick, but the bricks *While the absorption column density is measured by the column den- would cover very little of the sky so most X-rays would be unaffected. sity of neutral hydrogen, NH, derived from 21 cm measurements of the ISM Once the brick is optically thick it does not absorb more X-rays if you add (21 cm is the wavelength of the neutral hydrogen electron spin-flip transi- more material to it, but more material is removed from the ISM, allowing tion), the helium associated with the hydrogen provides roughly half of the more X-rays to pass between the bricks. The effective absorption cross- 1 physical absorption at 4 keV. Therefore if the true He/H ratio is lower than section is then reduced. Besides the effect of reduced cross-sections, the assumed, the model absorption will be overestimated. absorption becomes energy independent once the bricks are optically thick. 584 STEVEN L. SNOWDEN
The 1970s intensities in the two bands remained relatively constant (with some structure) over a factor-of-3 variation in inten- The 1970s were marked by the targeting of observations to sity and associated variation in column density (Fried et al. attempt to determine the origin of the background and the 1980). Two solutions for this discrepancy were that either expanded coverage of the soft X-ray sky. A significant role the absorbing column of galactic H I was significantly was played by the decade-long sounding-rocket campaign clumped, eliminating any differential (in energy) absorp- by the University of WisconsinÐMadison, Space Physics tion, or that the emission originated in front of the H I (e.g., Laboratory under the direction of W.L. Kraushaar. One goal Sanders et al. 1977). Hayakawa et al. (1978) came to the of the campaign was to determine the fraction, if any, of the latter conclusion as well, that a majority of the diffuse observed background that was extragalactic in origin; a sec- Galactic X-ray background originated in a local region sur- ond goal was to survey the entire sky in the 0.1Ð10.0 keV rounding the Sun. They used spectral fitting of sounding- band (see below). To address the first issue, the Small rocket data with the result that little or no interstellar Magellanic Cloud (SMC) was scanned to search for the absorption is required. signature of shadowing. The use of “shadowing” here refers Another vital aspect of the SXRB became clear in the to the simple idea that a foreground object such as an H I 1970s: the observed emission most likely had a thermal ori- cloud will absorb part or all of the X-ray emission originat- gin (emission from a hot plasma dominated by collisionally ing behind it, thus casting a shadow in the light of the back- excited emission lines). For example, Williamson et al. ground source. This is analogous to the sky on a stormy day (1974) discussed several non-thermal emission mechanisms when it appears darker in directions where the clouds are and found them lacking. Synchrotron production (radiation thicker, or more starkly, when the Moon passes in front of produced by charged particles spiraling around magnetic the Sun during an eclipse. The result of the SMC study field lines) failed because the required X-ray spectral index was that at least 75% of the observed 1 keV SXRB 4 is much larger than that at radio wavelengths, and the life- originated in front of the SMC (McCammon et al. 1971). time of the required electrons ( 1013 eV in a 3 G mag- Another group, Long et al. (1975), searching for detailed netic field reasonable for the Galaxy) is less than 104 years. negative correlation between the column density of galactic The number of 250 MeV electrons required to produce H I and the surface brightness of the SXRB, presented 1 the keV SXRB by Compton scattering (collisions between results for a sounding-rocket observation in the direction of 4 high-energy electrons and photons that increases the energy the Large Magellanic Cloud. They found that greater than of the photon) off the 3 K background (the fossil radiation 90% of the observed 1 keV background originated in front 4 left over from the Big Bang) is unreasonably large as well. of the galactic H I in that direction; a direction which is rel- Thermal bremsstrahlung (freeÐfree) emission from a hot atively opaque to 1 keV X-rays of distant origin if the ISM 4 plasma could produce the required spectral contribution is not clumped. but also required too high a space density for the plasmas, With increasing sky coverage it became clearer that and thus excessive pressures, if galactic in origin (e.g., while there was certainly a general negative correlation Henry et al. 1968). The inclusion of collisionally excited between the SXRB surface brightness and the column den- line emission from a thermal plasma (e.g., Cox and Tucker sity of H I (i.e., the plane-to-pole inverse variation of N H 1969) provided a much more efficient emission mechanism and X-ray intensity), the evidence for detailed negative in the 0.1Ð1.0 keV energy range, reducing the required correlation was in general lacking (i.e., shadows cast by density for that plasma to a reasonable level for a galactic distinct clouds in the ISM). However, it should be remem- origin. Thus thermal emission from extended regions of bered that the early detectors were mechanically collimated hot plasma became the most likely candidate for the source and typically had fields of view on the order of 10Ð50 of the SXRB. A galactic thermal origin became even more square degrees (the Moon has an apparent diameter of likely with the Copernicus observations of a nearly ubiqui- about half a degree). In addition, the observation of the sky tous interstellar O VI absorption line (e.g., Jenkins 1978). during a sounding-rocket flight was limited to around five The width of the ISM absorption lines in the spectra of minutes of useful data collection, so the observations were Milky Way stars indicated that the O VI could only be also photon limited. produced by collisional excitation by a thermal plasma at An advantage of the Wisconsin data was a second, softer T 105.5 K. While a plasma at this temperature is a bit too band provided by a detector with a boron filter (the B band, 1 cool to be responsible for the observed keV emission, it is the boron K absorption edge yields a high-energy cut-off 4 consistent with what could be expected from H I-cloud of 0.188 keV, compared to the usual carbon K absorption- interfaces with a hotter plasma. De Korte et al. (1976) pre- edge cut-off of 0.284 keV for the C or 1 keV band, see 4 sented an analysis of the data from two sounding-rocket Figure 1). The theoretical effective ISM absorption cross- flights from the early 1970s. Using data from two detectors sections for the two bands differ by about a factor of 2; with markedly different window thicknesses (which creates however, the hardness ratio (or X-ray color) of the observed a similar but not as strong an effect as having a separate 27
EWINE F. VAN DISHOECK* AND ALEXANDER G.G.M. TIELENS** Space-borne observations of the life cycle of interstellar gas and dust
The gas and dust in the interstellar medium (ISM) form an were detected the diffuse interstellar bands (DIBs; Heger essential part of the evolution of galaxies, the formation of 1922, Merrill 1934), whose identification is still uncertain stars and planetary systems, and the synthesis of organic mol- after more than 75 years. Initially, inspired by the identifica- ecules that may lead to the emergence of life elsewhere in the tion of simple diatomic species in the ISM, these bands Universe. Over their lifetimes, stars return much of their were attributed to molecular absorbers. However, once mass to the ISM through winds and supernova explosions, interstellar dust was established as an important interstellar leading to a slow enrichment in heavy elements and dust that component, an origin for the DIBs in absorption by dust form the building blocks from which future generations of grains was taken for granted. Nowadays the pendulum has stars and planets are made. Stars also inject energy into the swung back Ð and for good reason Ð to molecules as the ISM via ultraviolet photons, shocks, and wind-blown stellar prime candidates for the carriers of these absorption features bubbles. Cosmic rays, x-rays, and ionizing photons influence (Snow 1995). The foundation of the theoretical study of the the ionization state of the gas, whereas shielding by gas and ISM and the physical conditions that may prevail there was dust leads to the cold, neutral phases of the ISM where mole- put forward by Arthur Eddington (1926) in his famous cules can flourish. As a result, the composition and structure Bakerian Lecture. It was also he who exclaimed that “atoms of the ISM is governed by a complex interplay of micro- are physics but molecules are chemistry” with the scarcely scopic and macroscopic processes. Understanding this life hidden message that astronomers should stay away from cycle of gas and dust in the ISM is a key problem in astro- molecules. Nowadays, despite Eddington’s warning, molec- physics, for understanding not only our own Galaxy but also ular astrophysics is a thriving field driven to a large extent the much more rapid cycling between the ISM and stars in by the wealth of molecular data that has become available the earliest star-forming galaxies in the Universe (Figure 1). over the last three decades, much of it harvested from space. Research on the ISM started early in the twentieth cen- With the detection of the 21 cm H I line by Ewen and tury with the ground-based detection of interstellar Na and Purcell (1951), the study of the ISM turned to ground-based Ca optical absorption lines (Hartmann 1904, Heger 1919) radio observations, and data thus gathered still provide a and the appearance of many dark regions on photographic wealth of information on the distribution and kinematics of surveys of the Milky Way (e.g., Barnard 1919). Definite evi- the neutral, atomic ISM in our Galaxy and other galaxies dence for the presence of interstellar dust came from obser- (e.g., Hartmann and Burton 1997). The detection of the first vations by Trumpler (1930), whereas the first interstellar molecules at radio wavelengths (Weinreb et al. 1963, molecules, CH, CH , and CN, were identified between Cheung et al. 1968) paved the way for the development of 1937 and 1941 (Swings and Rosenfeld 1937, McKellar millimeter and submillimeter wave astronomy. The ubiqui- 1940, Douglas and Herzberg 1941). Around the same time tous CO molecule was detected by Wilson et al. (1970), and a surprisingly large number of other molecules have * Rijksuniversiteit Leiden, The Netherlands since been found in dense molecular clouds (see van ** Rijksuniversiteit Groningen, The Netherlands Dishoeck and Hogerheijde (1999) for a recent overview).
Ewine F. van Dishoeck and Alexander G.G.M. Tielens, The Century of Space Science, 607Ð645 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 608 EWINE F. VAN DISHOECK AND ALEXANDER G.G.M. TIELENS
Life cycle of a low mass star
Diffuse ISM Dense dust cloud molecular cloud Red Giant phase
Comets
Low mass star is born Asteroids
Planets Cloud breaks up Planetesimals form
Figure 1 Schematic diagram of the lifecycle of dust and gas from the diffuse ISM, through star formation, to the late stages of stellar evolution. (Pendleton and Cruikshank 1994.)
Space-based ultraviolet observations of the neutral ISM Sun Ð was by William Herschel in 1800, while his son John started with small spectrometers flown on rockets, which Herschel was the first to measure the total power emitted by led to the important detection of H2 by Carruthers (1970). the Sun, in 1838, and Edward Stone that of stars, in The later Copernicus satellite (Spitzer and Jenkins 1975) 1869Ð70 (Harwit 1999). In the late 1870s Thomas Edison provided extensive new information on H2 and other inter- observed Arcturus and the solar corona in the infrared stellar lines, mostly of atoms in various ionization stages. from a chicken coop in order to win a bet (Eddy 1972). Together with the ground-based H I data, they stimulated However, despite these early successes the field of infrared the development of the two- and three-phase model of the astronomy lay dormant until the late 1960s to early 1970s, ISM in which the gas is heated by the action of the photo- when progress in infrared instrumentation truly opened the electric effect on grains and by large-scale supernova explo- infrared window of the spectrum. sions, and is cooled by line radiation from atoms (Field The 1958 discussion mentioned above went on to et al. 1969, McKee and Ostriker 1977). consider the observational difficulties of detecting a low- Because the typical temperatures of the cold, neutral intensity astronomical signal against a 300 K telluric back- ISM range from 10 to a few hundred K, most of the ther- ground and emphasized the advantage of balloon or mal emission occurs at mid- and far-infrared wavelengths, a satellite observations. This was elaborated upon in an early part of the spectrum which is largely inaccessible from the discussion of infrared radiation from interstellar grains by ground. It is therefore not surprising that much of our Stein (1967), who concluded that it was hopeless even to progress over the last 30 years or so has come from space- consider observing it through the mid-infrared telluric win- based observations in the infrared. To our knowledge, the dows. Fortunately he was not much impressed with his own earliest discussion of the importance of the infrared for the pessimistic prediction, and, together with (among others) study of the ISM goes back to an exchange between Fred Gillett, he went on to pioneer the field of mid-infrared Spitzer, Kahn, and Drake at the 3rd Symposium on spectroscopy of interstellar dust using a circular variable Cosmical Gas Dynamics (Burges and Thomas 1958). Of filter-wheel. By the early 1970s it was fully appreciated that course, the first detection of infrared radiation Ð from the the best way to determine the composition of interstellar SPACE-BORNE OBSERVATIONS OF THE LIFE CYCLE OF INTERSTELLAR GAS AND DUST 609
Table 1 Selected infrared space missions and instruments relevant to ISM research
Mission Instrument Acronym Wavelength Resolving Operational range ( m) power period
IRAS Photometers 12, 25, 60, 100 3Ð5 1983 Low Resolution Spectrometer LRS 7.7Ð22.6 20Ð60 COBE Far-InfraRed Absolute FIRAS 100Ð10 000 100Ð500 1989Ð1993 Spectrophotometer IRTS Far Infrared Line Mapper FILM 63, 158 400 1995 ISO Short Wavelength Spectrometer SWS 2.5Ð45 2000, 20 000 1995Ð1998 Long Wavelength Spectrometer LWS 43Ð197 200, 10 000 Camera CAM 2.5Ð17 3Ð20 Camera+Circular Variable Filter CAM-CVF 2.3Ð16.5 35Ð50 Photometer PHT 3.3Ð200 3 Photometer-Spectrometer PHT-S 2.5Ð5, 6Ð12 90 SWAS Heterodyne Instrument 545, 610 5 105 1999Ð
dust was through infrared spectroscopy (Gaustad 1971). neutral ISM and the formation of stars. The various phases of That was an impressive foresight and insight, as well as a the ISM considered here are summarized in Figure 2, and are change of direction in a short time. It should be remem- discussed in order from the diffuse medium to old stars. The bered that by the late 1960s and early 1970s, rocket-borne richness of the infrared wavelength region is illustrated in ultraviolet studies had revealed the ubiquitous presence of Figure 3, which shows the complete ISO Short Wavelength 2200 Å bump in the interstellar extinction curve (Stecher Spectrometer (SWS) spectrum of Orion centered at the IRc2 1965, Bless and Savage 1972), which had been successfully source (van Dishoeck et al. 1998). Its proximity and extraor- modeled in terms of small graphite grains (Gilra 1972). dinary brightness have made this source the prime target for Yet, as has become abundantly clear since then, most of the pioneering observations at infrared wavelengths advances in our knowledge of the physics and composition (e.g., Genzel and Stutzki 1989). Much of the complexity is of the dust and gas have followed advances in infrared the result of the disruption of the star-forming environment detector and heterodyne submillimeter receiver technology, by powerful outflows from the massive young star, and its and the development of airborne and space-based platforms intense ultraviolet radiation dissociating and ionizing the gas through pioneers such as Martin Harwit, Gerard Kuiper, on the cloud surfaces. Orion is the nearest and best studied Frank J. Low, Charles H. Townes, and many others (e.g., region of massive star formation in the Galaxy, and therefore Harwit et al. 1966, Kleinmann and Low 1967). The Lear also serves as a template for more distant star-forming Jet and the Kuiper Airborne Observatory (KAO) in the regions in other galaxies. The strong continuum is due to USA, and the USÐNetherlandsÐUK Infrared Astronomical thermal emission from warm dust (50Ð300 K) and peaks Satellite (IRAS), have been pivotal in developing the field, around 70 m. A wealth of superposed lines of atoms, ions, which has culminated with ESA’s recent Infrared Space and molecules is seen, which can be used to constrain the Observatory (ISO) (Table 1). Other important missions physical parameters (see Figure 8 of Genzel 1992) and assess include NASA’s Cosmic Background Explorer (COBE), the relative importance of different processes, in particular balloons, rockets, the Japanese Infrared Telescope in shocks v. ultraviolet radiation. Space (IRTS), the US Air Force project Midcourse Space limitations preclude a comprehensive review of the Space Experiment (MSX), and NASA’s Submillimeter field, and only topics in which space-based observations Wave Astronomical Satellite (SWAS). In this respect, the have played a crucial role are covered. For example, ground- future looks very promising, with the Space Infra Red based submillimeter observations of molecular clouds are Telescope Facility (SIRTF), the ASTRO-F mission, the not mentioned, but the composition of ices in such clouds, Herschel Space Observatory (formerly known as FIRST), as deduced from mid-infrared spectroscopy is. Ultraviolet and the Next Generation Space Telescope (NGST) all on observations of the atomic component of the ISM and the horizon. space-based data on ionized H II regions and continuum This chapter presents an overview of the air- and space- observations of young stars are discussed elsewhere in this borne observations that over the last 30 years have played volume. Each section starts with a brief historical review a crucial role in our understanding of the cold and dense and an introduction of the basic physics, and goes on to 610 EWINE F. VAN DISHOECK AND ALEXANDER G.G.M. TIELENS
UV starlight [C II] 158 µ m
Stellar winds Diffuse neutral ISM PAH features Far-infrared dust continuum Supernovae
[C II] 158 µm, [C I] 610 µm [O I] 63, 145 µm, [S I] 25 µm UV starlight CO high-J Molecular Clouds PAH features Shocks H 2 pure rotational and ro-vibrational lines Hydrides: H 2 O, OH, HD Far-infrared dust continuum
Ices Luminosity Circumstellar envelope Amorphous silicates embedded YSO around young star Gas-phase absorption and emission Far-infrared dust continuum
Amorphous and crystalline silicates UV radiation PAH features young star Circumstellar disk CO ro-vibrational bands, H I recombination H 2 pure rotational lines Spectral energy distribution
Amorphous and crystalline silicates PAH features IR and UV radiation Circumstellar shell CO and HCN high-J old star around old star H 2 O, OH
Figure 2 Schematic diagram of the energy input to the various phases of the ISM discussed here, and the resulting diagnostic lines at mid- and far-infrared wavelengths. Most of these features can only be observed from air- and space-borne platforms. PAH stands for polycyclic aromatic hydrocarbons or large carbonaceous molecules in general; YSO for young stellar objects.
focus on recent ISO results to illustrate the current state of Molecular Astrophysics of Stars and Galaxies (Hartquist knowledge, in particular the spectroscopic data obtained and Williams 1998), van Dishoeck and Blake (1998), van with the ISO’s Short Wavelength Spectrometer (SWS, Dishoeck and Hogerheijde (1999), Langer et al. (2000), 2.4Ð45 m) and Long Wavelength Spectrometer (LWS, Ehrenfreund and Charnley (2000), and the proceedings of 43Ð197 m) (see Table 1 for an overview). Excellent previ- IAU Symposium 197 on Astrochemistry: From Molecular ous summaries include the proceedings of the Airborne Clouds to Planetary Systems (Minh and van Dishoeck Astronomy Symposium (Haas et al. 1995), which also con- 2000). Interstellar dust has been reviewed by, for example, tains many photographs of key people in the field, and the Tielens and Allamandola (1987a,b), Whittet (1992), and proceedings of the The Diffuse Infrared Radiation and the Cox et al. (2000). IRTS (Okuda et al. 1997). Further details of ISO results are contained in Star Formation with the ISO Satellite (Yun and Liseau 1998), ISO’s View on Stellar Evolution (Waters et al. 1 THE DIFFUSE INTERSTELLAR MEDIUM 1998), Analytical Spectroscopy with ISO (Heras et al. 1997, 2000), Solid Interstellar Matter: The ISO Revolution 1.1 Very small grains (d’Hendecourt et al. 1999a), and The Universe as seen by ISO (Cox and Kessler 1999). Reviews of various aspects The infrared emission from dust in the diffuse, neutral ISM of the neutral ISM include Spitzer (1978), Habing (1988), is faint and extended, and observations have been mostly Genzel (1992), and Hollenbach and Tielens (1999). limited to what is possible with cooled telescopes in space. Recent reviews of astrochemistry can be found in The The all-sky maps by IRAS (at 12, 25, 60 and 100 m) and 28
PRISCILLA C. FRISCH* The interstellar medium of our Galaxy
Over the past century our picture of diffuse material in space the metallicity of interstellar gas, and by inference the has grown from a simple model of isolated clouds in thermal mineralogy of interstellar grains. A primary goal of equilibrium with stellar radiation fields to one of a richly interstellar medium studies has been to determine the varied composite of materials with a wide range of physical chemical composition of interstellar clouds compared to, properties and morphologies. The Solar System interacts for instance, normal Population I stars such as the Sun. with a dynamical interstellar medium. Optical, radio, and Space observations are required to observe most astronomi- UV astronomy allow us to study the clouds which form the cally interesting elements such as C, N, O, Fe, Mg, and Si, galactic environment of the Sun. The composition and distri- since the resonant ground state transitions of these atoms bution of interstellar clouds in the disk and halo tell us about fall in the ultraviolet (UV, 912Ð3000 Å). the history of elemental formation in our galaxy, and the past Gene Parker once asked me “What is an interstellar and future environment of the Solar System. cloud?” The Rashomon-like answer depends on the con- Dark lanes of dusty clouds obscuring portions of the text. Early optical data showing velocity components in Milky Way are celestial landmarks, but the realization that interstellar absorption lines led to a definition of “clouds” interstellar gas pervades space is quite recent. The twentieth as discrete kinematical units. Alternative descriptions were century opened with the discovery of a “nebulous mass” of based on the physical properties of the clouds, for example interstellar gas in the sightline towards the binary star warm diffuse intercloud material in equilibrium (Stromgren Orionis (Hartmann 1904). A series of over 40 spectra 1948, Field et al. 1969), or hot tenuous coronal gas (Spitzer showed that the Ca II K line (3933 Å ) absorption was 1956) to confine the clouds. Ground and space data now nearly stationary in wavelength, “extraordinarily weak,” show interstellar material (ISM) with densities in the range and “almost perfectly sharp,” in contrast to broader variable 10 4 atoms cm 3 to over 103 atoms cm 3, kinetic tempera- 6 stellar absorption features. Sharp stationary Na I D1 and tures 20 K Tk 10 K, and many levels of ionization. D2 lines (5890, 5896 Å ) were discovered in Ori and Within 10 pc of the Sun, we see density contrasts of over Sco by Mary Lea Heger. An explanation offered was that a 400 and temperature contrasts over 100. The distinction stationary absorbing cloud of vapor was present in space between turbulence and “clouds” has been blurred by between these binary systems and the observer. The Ca II recent high-spectral resolution data showing that low- and Na I lines constituted the primary tracer for interstellar resolution spectral data may miss over half of the velocity gas during the first half of the century. components in a sightline (Section 3.2), and that 15% of Interstellar matter (excluding dark matter) provides the mass of cold clouds is contained in tiny (AU-sized) struc- about 30Ð40% of the galactic mass density in the solar tures (Section 3.1). Are these features “clouds” or manifesta- neighborhood. Trace elements heavier than He, which form tions of a turbulent ISM? The discovery of interstellar clouds the planets, record the chemical evolution of matter in our in the galactic halo (Münch 1957) added new questions Galaxy, and provide detailed information on physical con- about the stratification of ISM in the gravitational potential ditions in interstellar clouds, represent a small proportion of of the Milky Way Galaxy, and the origin of halo gas. the interstellar atoms ( 0.15%). These same elements trace Surprisingly, interstellar gas constitutes about 98% of the diffuse material inside of the heliosphere, and subparsec * University of Chicago, IL, USA spatial variations in interstellar cloud properties near the
Priscilla C. Frisch, The Century of Space Science, 647Ð675 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 648 PRISCILLA C. FRISCH
Sun indicate the solar environment could change on time their obscuration from “fine solid grains.” Eddington noted scales 104 years. Shapley’s conjecture in 1921 that inter- that radiation with energies greater than 13.6 eV (the stellar clouds affect planetary climates no longer seems ionization potential of hydrogen) would be prevented from outlandish. entering clouds by abundant hydrogen, and concluded inter- 2 Symbols used here are N (column densities, cm ), and stellar H2 would be abundant. 3 nH (total volume density for all forms of H, cm ). Early In the early part of the century, Harlow Shapley advanced optical and 21 cm radio data were insensitive to clouds the idea that interstellar clouds were linked to terrestrial with column densities N(H) 1019 cm 2. UV observations climate shifts. He noted that the diffuse luminous and dark of trace elements can detect kinematical objects with nebulae are found throughout space, and that the Sun was N(H) 1016 cm 2. Galactic longitudes and latitudes are receding from the Orion region where dark nebulae are quoted in System II. (System II was adopted by the prominent (Shapley 1921). Shapley suggested that a past International Astronomical Union in 1958 in order to correct climate-altering encounter between the Orion molecular earlier errors in the location of the galactic center. Galactic clouds and the Solar System, would yield a 20% variation in coordinates published before 1958 are incorrect.) The local solar radiation, which if sustained for a period of time would standard of rest (LSR) velocity frame represents heliocentric alter Earth’s climate. While encounters with dense clouds as velocities transformed to the rest frame corresponding to the envisioned by Shapley are statistically improbable, encoun- mean motion of comparison stars near the Sun, where the ters with clouds of modest density ( 10 cm 3) are much comparison set is selected according to some criterion. (Most more likely and would destabilize the heliosphere and mod- LSR interstellar velocities presented in the twentieth century ify the interplanetary environment (Zank and Frisch 1999). assumed, frequently without explanation, a “standard” solar motion corresponding to a velocity of 19.7 km s 1 towards 1.1 Optical absorption lines the apex position lII 57 , bII 22 . Recent Hipparcos results give a solar motion of 13.4 km s 1 towards the apex The interstellar nature of the sharp stationary absorption direction lII 28 , bII 32 .) Radio data are usually pre- features seen in binary systems was quickly established. sented in the LSR velocity frame, where this issue is particu- Plaskett and Pearce (1930) provided a convincing discus- larly troublesome. sion that the sharp Ca II and Na lines are formed in diffuse This chapter focuses on the diffuse gas in the space space, and labeled these features “interstellar.” Observations between the stars of our Galaxy. For an eloquent summary of 1700 stars with V 10.5 mag by Otto Struve had shown of the physical properties of the ISM and data up to the that K line strengths in general increase with increasing mid-1970s, see Spitzer (1978). magnitude (and thus distance), suggesting an interstellar ori- gin. Plaskett and Pearce measured Ca II K line velocities for 250 OB stars (to within 1.8 km s 1), and found they are 1 DISCOVERING INTERSTELLAR GAS “almost exactly twice” interstellar Ca II motions expected for the galactic rotation of interstellar clouds located at half The observational and theoretical foundations of ISM space the distance to the background star. Peculiar motions within studies were formed in the first half of the twentieth the clouds were found to be equally important for line century. In 1926 Sir Arthur Stanley Eddington laid out the broadening. principles of the ionization equilibrium of atoms in space During the late 1930s, Merrill and collaborators under the influence of dilute stellar radiation fields (the surveyed the yellow Na I D1, D2 and blue Ca II H, K lines Bakerian Lecture, Eddington 1926). He evaluated the in over 400 hot, bright stars. The D2/D1 line ratios were importance of the short wavelength stellar radiation field seen to increase as total line intensity decreased, providing ( 800 Å) for cloud heating (although extreme ultraviolet an early indication of line saturation. The formula relating radiation from space was not observable in 1926), and con- absorption line equivalent width (W) and column density cluded that diffuse clouds are illuminated by a radiation (N) was found to be faulty for “deep” lines, leading to the field at a Planck temperature of 10,000 K and have kinetic development of an empirical “curve of growth” (COG) temperatures Tk 10,000 K. He found frequent collisions using the doublet ratio (Ca II H/K, Na I D1/D2) to con- would establish Maxwellian velocity distributions for elec- strain the functional dependence of W. The COG, formal- trons and ions in space. Eddington concluded that the mate- ized later by Stromgren (Section 1.3), has been used to rial creating the stationary Ca II and Na I absorption lines is derive column densities from the equivalent widths of inter- uniformly distributed, and argued that stellar dynamics stellar absorption lines for 70 years. Cloud motions were 3 implied nH 10 cm for diffuse material. He determined shown to depend on star distance, with velocityÐlongitude that in space most interstellar Ca is Ca III and most Na is plots showing a double sine pattern with an amplitude at Na II He “reluctantly” concluded that dark nebulae derive half the expected galactic rotation value (e.g., see review of THE INTERSTELLAR MEDIUM OF OUR GALAXY 649
Münch 1968). The bulk motions implied by line velocities survey provided a consistent survey of cloud kinematics. showed a chaotic or turbulent component, with dispersion Adams estimated the relative strengths of the Ca II lines by 1 vradial 5Ð10 km s , which was interpreted as moving visually estimating line strengths, which was a common “clouds” or “currents.” The Na I line strengths increased practice then; the weakest features he detected correspond with both distance and the amount of interstellar “smoke” to equivalent widths of 3mÅ. Half of the stars in the sam- (now known as dust) in the sightline (where the dust was ple were found to have complex H or K lines, while about determined by the reddening of starlight). six stars showed weak Ca II absorption at 4226 Å. About The chemical composition of interstellar clouds was 25% of the stars had weak blue/near-UV features from explored by a series of observations at Mt. Wilson, Lick, CN I, CH I, CH II, Ca I, or Fe I. Adams found 21 Ca II line 1 and the Dominion Astrophysical Observatories. Sharp inter- components with vlsr 30 km s (the velocities quoted in stellar absorption lines from Na I ( 3302.4, 3303.0, Adams (1949) should be updated with modern wave- 5895.9, 5890.0), Ca I ( 4226.7), Fe I ( 3719.9, 3859.9), lengths), and concluded the velocities represented peculiar Ti II ( 3383.8, 3242.0, 3229.2, 3073.0), K I ( 7664.9, cloud motions through the LSR. 7699.0), CH II ( 3957.7, 4232.5), CH I ( 4300.3, Blaauw used Adams’ data to determine the form of the 3890.2), and CN I ( 3875.8, 3874.0) were discovered, velocity distribution of the chaotic component of interstel- and upper limits were placed on Al I ( 3944.0), Sr II lar cloud velocities (e.g., see review by Spitzer 1968). He ( 4077.7, 4215.5), and Ba II ( 4554.0, 4934.1) lines compared Ca II component velocities with two possible (e.g., see review of Münch 1968). The sightlines towards distributions Ð a Gaussian and an exponential distribution 2 Ori and 55 Cyg were shown to be excellent for identify- of component velocities. Blaauw concluded that an expo- ing new interstellar lines (Dunham 1939). Dunham used the nential form fitted the observed velocity distributions better weak 3302 Å and strong D1, D2 Na I doublets to construct than a Gaussian form. Munch extended this analysis to halo an empirical COG for the ISM towards 2 Ori, avoiding sat- stars, with observations of Ca II and/or Na I towards uration problems with the D lines. Ionization equilibrium 112 stars in the northern hemisphere sky (l 50 Ð160 , calculated for N(Ca I)/N(Ca II) provided the interstellar elec- |z| 1Ð2 kpc; Münch 1957). He confirmed earlier studies 3 tron density, ne 0.7Ð1.4 cm . The anomalous properties of showing that bulk cloud motions indicate clouds are interstellar Ca II lines were apparent in early data. These aligned with the Orion and Perseus spiral arms, and found observations yielded N(Ca II)/N(Na I) 1.5, which was the blue-shifted absorption components from expanding inter- value expected from stellar atmosphere data at that time. stellar gas around O-star associations. Based on the Blaauw The peculiar behavior of interstellar Ca seen in these results, and reasoning that turbulence is not Gaussian so early data was a harbinger of the fundamentally different that the increased energy dissipation by supersonic turbu- properties of volatiles and refractory elements in the ISM; lence would flatten a cloud velocity distribution in compar- however, an incomplete understanding of cloud ionization ison to a Gaussian, Münch fitted the observed disk and halo prevented this recognition. When Olin Wilson showed con- cloud velocities with the form: vincingly that Ca II has a velocity distribution which is 1 peculiar in comparison to Na I, he described the results as (V) exp( V V ) 2 o “unexpected … and disappointing.” The average internal velocity distribution for Ca II was about three times larger where is the mean radial velocity, found to be 5 than for Na I (22 km s 1 v. 7.5 km s 1; Wilson 1939). 1kms 1 by Blaauw. With the exponential distribution, a Although later high-resolution spectroscopy showed that constant value for the velocity dispersion for all Na I D2 1 the “lines” observed by Wilson were blends of several com- line strengths is obtained (2 4.6 km s ), providing a ponents (Section 3.2), the velocity distribution of Ca II was better fit to velocity distributions than a Gaussian form. This still peculiar. Merrill and Wilson showed that the ratio has created what I view as a conundrum, since the internal N(Na I)/N(Ca II) varied strongly from one cloud to another, velocity distributions of clouds are found to be Gaussian in with values 0.3Ð10 (e.g., see review of Münch 1968). cold clouds, while the bulk cloud velocities measured at Walter S. Adams published an influential survey of higher resolution show an exponential distribution. the strengths and velocities of Ca II H, K, lines and weak A study of the abundance variations between individual features from CN I, CH I, CH II, Fe I, and Ca I in 300 clouds was presented in a classic paper by Routly and disk (|z| 500 pc) OB stars visible from the northern hemi- Spitzer, in 1952, which confirmed earlier indications that the sphere (Adams 1949). The higher resolution of these distribution of Ca II cloud velocities is intrinsically larger data ( 6kms 1), compared to earlier photographic data, than the distribution of Na I velocities. They found both that demonstrated that interstellar gas is concentrated in clouds b(Ca II) 1.5 b(Na I) and that Ca II component velocities with velocity separations larger than the atomic velocity are larger than for Na I. The systematic decrease of dispersion within individual clouds. The velocities in this N(Na I)/N(Ca II) with increasing cloud velocity is known as 650 PRISCILLA C. FRISCH
the “RoutlyÐSpitzer (RS) effect”, manifested as N(Na I)/ curve of growth for the main component (heliocentric 1 1 N(Ca II) 1. RS proposed that the process accelerating velocity, vhc 15 km s ) gives bDop 2.4 km s and the Ca II also decreased N(Na I)/N(Ca II), either through colli- column densities for these elements. The photoionization sional ionization of Na I during acceleration, or from rela- equilibrium of Na I, which equilibrates photoionization tive differences between Ca and Na depletions onto dust with the temperature-dependent recombination rate of Na grains in low- v. high-velocity clouds. The first discovery of and an electron (N(Na II)/N(Na I) n(Na II)/n(Na I); 3 1 ISM within 20 pc of the Sun was possible because of the Section 1.3) gives ne 0.36Ð0.54 cm if the 15 km s RS effect. Ca II was detected towards Oph (14 pc), cloud is 50 pc from Oph. If photoionization of heavy but not Na I (Munch and Unsold 1962). Recent data give elements (e.g., C) supplies electrons, densities nH 1 3 N(Na I)/N(Ca II) 0.1 in the Vlsr 8kms cloud 500Ð900 cm are found, indicating the cloud is a thin towards Oph (Welty et al. 1996). Small-scale ( 1 ) vari- sheet of thickness 0.15 pc. Later studies show Na is ations in Ca II line strengths suggested the presence of small depleted by factors of 4Ð5, raising densities and reducing clouds ( 1 pc). These data provided the first evidence of cloud thickness by the same factors. Enhanced abundances small-scale structure and shocked ISM close to the Sun. of Ca II and Ti II in the 29 km s 1 cloud identify this As frequently happens, new techniques proven in first- material as “intercloud,” which Herbig suggested is local generation instruments provide limited data, but change the (since it is seen in front of many Scorpius stars). course of science. The first high-spectral-resolution (R Lewis Hobbs initiated the high-resolution era of optical 200,000) observations of optical absorption lines did not spectroscopy with a survey of the Na I D lines at resolu- achieve the initial goal of observing the Na I D line hyper- tion 1kms 1 using a Pepsios interferometer at Lick fine components (separated by 1.0 km s 1), but instead Observatory (Hobbs 1969). Turbulence was found to domi- demonstrated cloud velocity structure that is unresolved in nate observed line widths, since cloud temperatures of 1 photographic plates. High-spectral-resolution (R 3Ð5 100 K yielded bDop 0.3 km s , v. observed Doppler 5 1 10 ) spectrometers were developed to observe the hyperfine widths of bDop 1.5 km s . The photoionization balance of Na I lines towards Cyg using the long-focal-length Na applied to N(Na I)/N(H I) yielded typical electron densi- 3 3 McNath solar telescope (Livingston and Lynds 1964), and ties ne 0.008 cm for solar abundances, or nH 20 cm using a Pepsios spectrometer at Lick Observatory (Hobbs if electrons are supplied by the photoionization of metals. 1965). Much later, after the launch of the Copernicus satel- High-resolution optical data acquired by Hobbs, his stu- lite and after high-resolution optical data were routinely dents, and others proved to be a crucial supplement to UV acquired by Hobbs and others (Section 3.2), the Na I D-line data (Section 3.2). hyperfine splitting was discovered towards Cygnus, using a Michelson interferometer (effective resolution R 1.2 Radio astronomy – the first multispectral data 500,000; Wayte et al. 1978). Turbulence in cold interstellar clouds was found to be subsonic. The 1kms 1 cloud has Radio waves provided the first multispectral window on the 1 bDop 0.38 km s and a likely temperature of 70Ð124 K, ISM. Radio emission from the Galaxy was discovered by 1 indicating turbulent velocities vt 0.26Ð0.3 km s in com- Karl Jansky in 1932 during an investigation of radio static parison to the isothermal sound speed of 0.8 km s 1. in a Bell Labs shortwave ( 15 m) radio antenna. Jansky A thorough and now classic study of the ISM in front of recognized the interstellar origin of the hiss. Jansky’s Oph (HD 149757) was performed by George Herbig, using papers inspired Grote Reber, who, working with a private the Lick Observatory 120-inch (3 m) Coudé feed spectra radio telescope located in his backyard in Wheaton, Illinois, with photographic plates (Herbig 1968). The star Oph in 1938, detected cosmic static at 2 m and confirmed has a relatively featureless bright continuum (O9.5 Vn, V Jansky’s discovery. Over the next several years, Reber used 2.6 mag, 140 pc, vsini 380 km s 1), and a rich interstellar his backyard telescope to map the northern hemisphere spectrum (E(B V) 0.32 mag) lending itself to searches radio sky at 160 MHz and 480 MHz. The advent of World for new interstellar species. It is a runaway star from the War II moved radio astronomy to the forefront of technical ScorpiusÐCentaurus association moving supersonically interest, both in the USA and Europe, and during the fol- through the ISM (space velocity 40 km s 1), and with a lowing decade technical advances from the wartime use of circumstellar H II region (5 radius). Oph also has a ram- radar supported the development of the new radio sciences. pressure confined stellar wind observed around the star Shklovsky (1960) reviews the scientific basis for the devel- through 60 m radiation from heated, swept-up interstellar oping field of radio astronomy. grains. Herbig measured absorption lines from Na I, Ca II, The spectral index of the cosmic “radio static” was K I, Ti II, Fe I, CH I, CH II, and CN I, and set limits on measured and found to increase towards lower energies, in some 25 additional species at the W 1Ð2mÅ level. The contrast to the expected increase towards higher energies ISM is contained in two dominant clouds at 15 km s 1 predicted for a blackbody (or “thermal”) source. This and 29 km s 1 (heliocentric velocities). An empirical puzzle was explained when V.L. Ginzburg calculated the 29
FRANK B. MCDONALD* AND VLADIMIR S. PTUSKIN** Galactic cosmic rays
Our Galaxy is filled with a relativistic gas of high-energy energies of the order of 10 TeV (1013 eV). At high energies protons, electrons, and heavy nuclei. The interstellar energy ( 100 TeV), the cosmic-ray intensity is so small that density of these cosmic rays is ~1 eV cm 3 Ð comparable to experiments such as large, ground-based air-shower arrays the energy density of the galactic magnetic field and of the must be used. Throughout the modern era of cosmic rays Ð thermal energy of the interstellar medium Ð with an energy defined somewhat arbitrarily as 1946 onward Ð a vital stim- spectra that extends to a maximum energy above 1020 eV. ulus has been the guidance and insight from a number of High-energy particles are also an important and distinguish- excellent theoretical groups. ing feature of radio galaxies, quasars, and active galactic Cosmic-ray studies now span an epoch of almost exactly nuclei. The direct measurement by space and balloon exper- 100 years. In this brief overview we sketch the initial gesta- iments of their charge and mass composition and energy tion of cosmic-ray astrophysics. A more complete history spectra provide information on the source regions within our of the early development of cosmic rays by J. A. Simpson Galaxy, on injection and acceleration processes, and offer a can be found in Chapter 4. At the close of the nineteenth steadily increasing understanding of cosmic-ray transport century, scientists using gold-leaf electroscopes to study the through interstellar space. Observations from radio, gamma- conductivity of gases discovered that no matter how care- ray, and X-ray astronomy define the distribution of energetic fully they isolated their electroscopes from possible sources particles throughout our Galaxy and establish their presence of radiation they still discharged at a slow rate. In 1901 in extragalactic sources. The interpretation of these observa- two groups investigated this phenomenon, J. Elster and tions by allied scientific disciplines is significantly aided H. Geitel (1901) in Germany, and C. T. R. Wilson (1901) in by the detailed study of cosmic rays near Earth while our England. Both groups concluded that some unknown understanding of the sources and of the distribution of galac- source of ionizing radiation existed. Wilson even suggested tic cosmic rays is strongly dependent on the data from these that the ionization might be “due to radiation from sources other fields. outside our atmosphere, possibly radiation like Röntgen Space experiments have played a vital role in the devel- rays or like cathode rays, but of enormously greater opment of cosmic ray astrophysics. These experiments have penetrating power.” A year later two groups in Canada, determined the detailed charge and mass composition of Ernst Rutherford and H. Lester Cooke (1903) at McGill cosmic rays for the elements from hydrogen to nickel University, and J. C. McLennan and E. F. Burton (1902), at (Z 1Ð28) up to energies of ~1 GeV, have mapped the the University of Toronto showed that 5 cm of lead reduced charge distribution of many of the even Z elements to ura- this mysterious radiation by 30%. An additional 5 tonnes of nium (Z 92) and extended the direct observation of the pig lead failed to further reduce the radiation further. energy spectra of the more abundant elements up through Nothing significant happened until 1907 when Father Theodore Wulf (1907) of the Institute of Physics of Ignatus College in Valkenburg, Holland, invented a new electro- * University of Maryland, College Park, MD, USA ** IZMIRAN Ð Institute of Terrestrial Magnetism, Ionosphere and Radio scope. Wulf’s electroscope enabled scientists to carry the Waves Propagation, Troitsk, Russia search for the origin of the mysterious radiation out of the
Frank B. McDonald and Vladimir S. Ptuskin, The Century of Space Science, 677Ð697 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 678 FRANK B. MCDONALD AND VLADIMIR S. PTUSKIN laboratory, into the mountains, atop the Eiffel Tower and, radiation Ð indicating the presence of highly relativistic ultimately, aloft in balloons. Assuming that the radiation electrons throughout our Galaxy including some discrete came from the Earth, they expected to find a rapid decrease sources as well as extragalactic sources (Pacholczyk 1970). in the radiation as they moved away from the surface. They However, because of their small abundance (~1% of the did not find the decrease they expected and in some cases intensity of cosmic ray nuclei) electrons were not directly there seemed to be evidence that the radiation actually detected in the primary cosmic radiation until 1962 (Earl increased. Intrigued by the conflicting results obtained by 1962, Vogt and Meyer 1962). These discoveries made it Wulf and his colleagues, a young Austrian nuclear phy- possible to begin constructing realistic models of the origin sicist, Viktor Hess, obtained support from the Austrian and interstellar transport of galactic cosmic rays. Imperial Academy of Sciences and the Royal Austrian Aero As early as 1934, Baade and Zwicky linked the appear- Club to conduct a series of balloon flights to study the radi- ance of supernovae with neutron star formation and cosmic- ation. Hess obtained a license to pilot balloons in order to ray generation. Fermi (1949) regarded cosmic rays as a gas reduce the size of the crew and thereby increase the altitude of relativistic charged particles moving in interstellar mag- to which he could carry his electroscopes. On 12 August netic fields. His paper laid the groundwork for the modern 1912, using the hydrogen-filled Böhmen, Hess reached theory of cosmic ray acceleration and transport. The close an altitude of 5,350 m. Carrying two hermetically sealed link between radioastronomy and cosmic rays was conclu- ion chambers, he found that the ionization rate initially sively established at the time of the Paris Symposium on decreased, but then at about 1500 m it began to rise, until at Radioastronomy in 1958 (Paris Symposium 1959). This 5,000 m it was over twice the surface rate. Hess (1912) marked the birth of cosmic-ray astrophysics. The basic concluded that the results of these observations can best be model of the origin of galactic cosmic rays was developed explained by the assumption that radiation of a very high by Ginzburg and Syrovatskii (1964). penetrating power from above enters into the atmosphere Over the last three decades the continual evolution of and partially causes, even at the lower atmospheric layers, cosmic-ray detector systems for space missions have given ionization in the enclosed instruments. us a detailed description of galactic cosmic rays and a On a voyage from Amsterdam to Java, Clay (1927) growing understanding of their source region, acceleration, observed a variation in cosmic-ray intensity with latitude and transport. with a lower intensity near the equator, thus establishing that before entering the Earth’s magnetic field, the bulk of the primary cosmic rays were charged particles. In 1930 Bruno 1. COSMIC-RAY OBSERVATIONS Rossi, using Störmer theory, showed that if the cosmic rays were predominantly of one charge or the other there should The elemental composition of cosmic rays provides infor- be an eastÐwest effect. In the spring of 1933 two American mation on chemical fractionation in the source region as groups, Thomas H. Johnson (1933) of the Bartol Research well as some insight into the nature of this region and of the Foundation and Luis Alvarez and Arthur H. Compton (1933) propagation of cosmic rays in interstellar space. Cosmic- of the University of Chicago, simultaneously and indepen- ray isotopes probe more deeply the nature of the source dently measured the eastÐwest effect. It showed the cosmic region and the timescales of the injection and initial accel- radiation to be predominantly positively charged. In a series eration. Radioactive isotopes such as 10Be, 26Al, and 36Cl of balloon flights in the late 1930s, M. Schein and his co- reveal the temporal history of cosmic rays in the disk and workers (1941) used Geiger counter telescopes interspersed halo regions. The variation of the charge and mass compo- with lead absorbers to determine that most of the primary sition with energy Ð their energy spectra Ð can be related particles were not electrons, and hence protons were most to the acceleration process and to particle transport in the plausibly the dominant constituent. Galaxy. When improved measurements are available at ultra- In 1948 research groups from the University of high energies it should be possible to determine whether Minnesota and the University of Rochester flew nuclear these particles are of galactic or extragalactic origin. At the emulsions and cloud chambers on the same high-altitude highest energies the cosmic-ray arrival direction may also Skyhook balloon flight (Freier et al. 1948) and discovered indicate the approximate direction of the most powerful the presence of heavy nuclei in the primary cosmic radia- sources. tion. Further studies by many other groups soon established Over the past four decades of the space age there has that essentially all of the elements between H and Fe were been a remarkable improvement in the ability of cosmic present in the cosmic radiation near the top of the atmos- ray experiments to measure the elemental and isotopic phere Ð including an overabundance of the light elements composition over a steadily increasing range of charges and Li, Be, and B. Then in 1950 it was found that a significant energy. Space limitations permit discussion of only the fraction of the cosmic radio emission was synchrotron more recent results in a given area. GALACTIC COSMIC RAYS 679
The most remarkable feature of cosmic rays is their E2.7J(E) energy spectra (Figure 1). From ~10 GeV to 1020 eV these spectra, over some 10 orders of magnitude variation in inten- sity show a relatively featureless power-law distribution. At Energy losses + Solar modulation energies below a few giga-electronvolts the influence of solar Bump? modulation becomes important with significant temporal J(E) ∝ E–2.7 variations at 1 AU related to the 11- and 22-year solar and J(E) ∝ E–3.0 heliomagnetic cycles. At energies of less than 40 MeV the Dip? oxygen spectra in Figure 1 show the presence of so-called Knee J(E) ∝ E–2.7 anomalous cosmic rays, discussed in Chapter 4. Those are cut-off? partially ionized interstellar atoms accelerated at the solar Anisotropy <0.1% Composition ‘‘Normal’’ Anisotropy small* Anisotropy? wind termination shock. Near 10 MeV there is a highly vari- with FIP Bias All species present* Composition? able turn-up in the ion spectrum produced by particles of 1010 1012 1014 1016 1018 10 20 solar/interplanetary origin although the acceleration up to Total Energy, E (eV) energies more than tens of giga-electronvolts was registered in some solar flare events. Figure 2 Schematic representation of cosmic ray spectrum If the differential intensity in Figure 1 is multiplied by (Axford 1994). E2.7, where E is the particle energy, then the structure in the spectrum becomes more apparent at energies 10 Gev (Figure 2). The spectral slope steepens and the energy is Above 10 TeV the composition is not well known; this is the proportional to E 3 above 1014 eV. It decreases back to the region where acceleration by supernova shocks becomes form ~E 2.7 above 1018 eV, and extends to ~3 1020 eV. difficult. It is generally assumed that the cosmic-ray “knee” The experimental limitation imposed by the current size at ~1015 eV may reflect a gradual transition in the composi- of air-shower arrays does not allow measurements of the tion to particles of increasingly higher charge. At energies cosmic-ray intensity at energies greater than 3 1020 eV. 1019 eV questions of galactic magnetic confinement lead to the assumption that these particles are of extragalactic origin. At energies above 4 1019 eV even protons should experience significant deceleration by the 3 K black-body radiation (the GreisenÐZatsepinÐKuzmin effect). At energies of 1012Ð1014 eV there are small anisotropies of ~0.1% which are thought to be due to local effects. At this time there are no meaningful anisotropies observed at higher energies except the ultra-high energies ~1018 eV (see Section 6).
1.1 The elemental composition Over the charge region Z 1Ð28 (HÐNi), cosmic-ray experi- ments in space can resolve the individual elements over an extended energy range. A summary of these data (Figure 3) shows the relative abundance of cosmic rays at ~1 AU along with the Solar System abundance (Simpson 1983) for two dif- ferent energy regimes, 70Ð280 MeV/nucleon and 1Ð2 GeV/ nucleon. It can be seen that H and He are the dominant ele- ments, constituting some 98% of the cosmic-ray ions, but are still under-abundant in the cosmic rays relative to the Solar System abundance. There is reasonably good agree- ment between the cosmic-ray and Solar System abundance data for most of the even elements particularly for C, O, Mg, and Fe. The light elements Li, Be and B, and Sc and V in Figure 1 Energy spectrum of cosmic rays measured at the the sub-iron region are greatly over-abundant when com- Earth (Jokipii 1989). pared to the Solar System abundance Ð a result of nuclear 680 FRANK B. MCDONALD AND VLADIMIR S. PTUSKIN
106 dances and plotted against the elements’ first ionization poten- tial (FIP), a well-defined fractionation effect is observed He (Figure 4). Very similar effects are also present in the relative 105 abundance of the solar coronae, solar wind, and gradual solar energetic particle events. However, particles with low FIP tend to be chemically 104 active and form stable compounds and structures such as dust grains. Meyer et al. (1997) have developed in detail the sce- O C nario in which galactic cosmic-ray ions originate predomi- 103 nantly from the gas and dust of the interstellar medium. This Ne model gives a natural explanation of the low abundance of Mg Si Fe H and He. With the presently available data, it is not possible 102 N S to choose between FIP and volatility. The maximum energy A Ca attainable by the shock acceleration of dust grains is of Ti Cr the order of 0.1 MeV/nucleon, so the “FIP v. volatility” argu- Ni 10 ment is really about the injection and initial “energization” Na Al processes. 1 Mn P Cl K 1.2 The ultraheavy nuclei (Z 30) RELATIVE ABUNDANCE –1 10 Co F These elements arise from a combination of s-processes (produced in an environment where it is more probable that V 10 –2 they undergo beta decay before adding another neutron) and r-processes (produced in a neutron-rich environment where Sc beta decay is less probable). It is expected that r-process ele- –3 Li B 10 ments will dominate in explosive nucleosynthesis such as occurs in supernova explosions. Binns et al. (1989) found over the charge range 33 z 60 that the FIP-corrected –4 10 Be observed abundance is similar to that of the Solar System abundance; above Z 60, they found an enhancement of r-process produced nuclei in the source region. 10–5 246810121416182022242628 Of particular importance is the actinide region (Z 88), NUCLEAR CHARGE NUMBER where the elements can only be produced via the r-process. Figure 3 The relative abundances of H to Ni in cosmic rays, These have been studied by an experiment flown on NASA’s solid line and in the Solar System, dashed line. All abun- Long Duration Exposure Facility (LDEF). The experiment dances normalized at Silicam-100 (Simpson 1983). consisted of an array of thick, inert, solid-state nuclear track detector stacks Ð sheets of polycarbonate with a collecting area of 10 m2. They were exposed in Earth orbit spallation in interstellar space by nuclei of higher charge. aboard the LDEF for some 69 months. From this exposure The secondary nuclei generated by these reactions with the Donnelly et al. (1999) measured an actinide/subactinide ratio interstellar gas will have essentially the same velocity as the given by incident primary nuclei and hence the same energy per nucleon. Their energy spectra tend to be steeper than those Z 88 of the primaries due to energy-dependent escape of the 0.0147 0.003 74 Z 87 higher-energy primaries from the Galaxy. It is possible to correct the 1 AU observations for the changes produced by nuclear interactions, ionization energy which is consistent within statistical errors with the abun- losses, and escape from the Galaxy in the journey of the ener- dance ratio 0.013 of propagated primordial Solar System getic nuclei through interstellar space and thus obtain an esti- material. Notice that the propagated present-day Solar mate of the source abundance of these elements. When these System material would give 0.0077 for this ratio. Donnelly source abundances are normalized by their Solar System abun- et al. (1999) found that the measured abundance is consistent 30
GERARD F. GILMORE* Stellar populations and dynamics in the Milky Way galaxy
Our Galaxy offers a unique opportunity to deduce the closest approach to such a view that is possible. An ideal important physics involved in galaxy formation from obser- astrophysicist would have astrometric eyes! vations of those old stars that were formed at the time of the Available information strongly suggests that the Galactic formation of the Milky Way, and whose present properties extreme Population II subdwarf system formed early, though contain some fossil record of the Galaxy’s history. Only in the duration of its aggregation into the proto-Galaxy remains the Milky Way galaxy and its immediate neighbour satellite unclear. This subdwarf system now forms a flattened, pres- galaxies can one obtain the true three-dimensional stellar sure-supported distribution, with axial ratio ~2:1. The thick spatial density distributions, stellar kinematics and stellar disk formed close in time to the subdwarf system, with at chemical abundances. Knowledge of how stars move and least the metal-poor tail of the thick disk being comparable in how they are distributed in space measures the Galactic age to the globular cluster system. The thick disk is probably gravitational potential, including its dark matter content, chemically and kinematically discrete from the Galactic old while knowledge of stellar kinematics, ages and chemistry disk Ð by which we mean those stars of the thin disk with age constrains the star formation and gas accretion history. greater than a few billion years Ð implying a discontinuous The scientific goal is to build on the powerful concept of Galactic history. The inner Galaxy is mostly old, almost cer- stellar populations, which basically distinguishes old, metal- tainly barred, but as yet remains to be studied in detail, espe- poor stars (Population II) from younger, more metal-rich cially in the innermost regions. Importantly, recent dynamical stars (Population I), to include the rich complexity of the analyses lead to the conclusion that there is no statistically real Universe. significant amount of non-luminous mass in the solar neigh- No single section of the electromagnetic spectrum pro- bourhood, and hence no evidence for dissipative dark matter. vides the ‘best’ view of the Galaxy. Rather, all views are We consider in turn the general questions of galaxy for- complementary. However, some views are certainly more mation which the science of stellar population studies aims representative than others. The most fundamental must be a to address, followed by discussion of star count and infrared view of the entire contents of the Galaxy. Such a view studies of Galactic structure, and astrometric and kinematic would require access to a universal property of matter that studies of the Galactic mass distribution. The primary space is independent of the state of that matter. This is provided missions which have contributed to these subjects are the by gravity, since all matter, by definition, has mass. Mass Hubble Space Telescope (HST), whose excellent spatial res- generates the gravitational potential, which in turns defines olution allows study in crowded regions, whose dark sky the size and the shape of the Galaxy. While the most reli- background allows the study of very faint objects and, most able and comprehensive, such a view is also the hardest importantly here, allows reliable image distinction between to derive. Kinematics and distance data are, however, the stars and resolved objects (galaxies); the sequence of infrared observatories, which have allowed observations through the * University of Cambridge, United Kingdom optically thick dust obscuring the inner Galaxy, Infrared
Gerard F. Gilmore, The Century of Space Science, 699Ð719 © 2001 Kluwer Academic Publishers. Printed in The Netherlands. 700 GERARD F. GILMORE
Astronomical Satellite (IRAS), Diffuse Infrared Background where k and mp are the Boltzmann constant and the mass of 6 1 Experiment (DIRBE) on the Cosmic Background Explorer the proton, respectively. Numerically, Tvirial ~10 R50 M12 K (COBE) and the Infrared Space Observatory (ISO); and the for gravitational (half-mass) radius, R, in units of 50 kpc, 12 High Precision Parallax Collecting Satellite (Hipparcos), and mass M in units of 10 ML. Since the disks of spiral arguably ESA’s most substantial contribution to astrophysics galaxies are cold, with T Tvirial, energy must have been in the twentieth century. lost. Since this lost energy was in random motions of individual particles, the only possible loss mechanism is through an inelastic collision, leading to the internal excita- THE FORMATION OF DISK GALAXIES tion of the particles, and subsequent energy loss through radiative de-excitation. Clearly, particles with small cross- Current understanding of the formation and early evolution of section per unit mass for collisions, such as stars, will not disk galaxies allows a description of the important physical dissipate their random kinetic energy efficiently, so that dis- processes at various levels of complexity and generality, and sipation must occur prior to star formation, while the galaxy illustrates what we have learned, what we are attempting still is still gaseous. The virial temperature of a typical galaxy- to learn, and the relative importances of surveys, targeted 6 sized potential well is Tgalaxy ~ 10 K, with a corresponding research projects and newer space-based studies. At one one-dimensional velocity dispersion of ~100 km s 1. extreme, one simply considers the global evolution of a gas The physical conditions in the Universe at the epoch of cloud, and assumes that mean values of relevant parameters galaxy formation (redshift ~a few), as deduced from obser- suffice for an adequate description of generic properties. vations of quasar absorption lines (the GunnÐPeterson test Alternatively, one gives up general applicability and instead for neutral hydrogen), are such that hydrogen is ionized, adopts specific numerical values for those parameters that and correspond to temperatures of the proto-galactic gas quantify the important physics, attempting a detailed con- of ~104 K, with a sound speed of only ~10 km s 1. The frontation of model predictions with observed stellar popu- collapse of this gas in Galactic potential wells will thus lations. The relation of any model prediction to detailed induce supersonic motions, and lead to both thermalization observations at a single radius in a specific galaxy clearly of energy through radiative shocks and subsequent loss of needs to be considered with some care. Mindful of this caveat, energy by cooling. It is this conversion of potential energy, we outline here the most important timescales and physical first to random kinetic energy as described by the virial the- processes that are likely to play a role in the determination orem, and then to radiation via atomic processes, the net of the observable properties of galaxies like the Milky Way. result of which is an increase in the binding energy of the system, that is termed dissipation. General models of dissipational disk galaxy formation The rate at which excited atoms can cool obviously places a fundamental limit on the amount and rate of dissi- The existence of cold, thin Galactic disks has strong pational energy loss, and hence on the maximum rate at implications for galaxy formation. To see this, consider a which a gas cloud can radiate its pressure support and col- standard picture whereby galaxies form from growing lapse. A convenient measure of this timescale is the cooling primordial density perturbations, which expand with the time of a gas cloud, which is the time for radiative processes background Universe until their self-gravity becomes domi- to remove the internal energy of the cloud. Defining the nant and they collapse upon themselves. Were there to be cooling rate per unit volume to be n2 (T ), where n is the no loss of energy in the collapse, and neglecting angular particle number density, and where the functional form of momentum, the transformation of potential energy into is determined by the relative importances of freeÐfree, thermal (kinetic) energy would lead to an equilibrium boundÐfree and boundÐbound transitions and is thus an system with final radius equal to half its size at maximum implicit function of the chemical abundance, gives expansion, supported by random motions of the constituent particles. Thus an equilibrium, purely gaseous proto-galaxy 3nkT T should have temperature t (2) cool 2 n (T ) n GMm p T T (1a) It is usually of most interest to compare this timescale with virial kR the global gravitational free-fall collapse time of a system, and a stellar proto-galaxy should equivalently have velocity which is the time it would take to collapse upon itself if dispersion there were no pressure support. This timescale depends only on the mean density of the system, and is given by