<<

SOME ASPECTS OF QUANTUM THEORY

John Scales Avery

August 22, 2020 INTRODUCTION1

I hope that this book will be of interest to students and researchers in mathematics, and theoretical chemistry. The first few chapters can be read with ease by anyone with a knowledge of calculus and differential equations. However, some later chapters, and most of the appendices, are more demanding.. Chapter 10 deals with resonance energy transfer and especially with the relativistic treatment of this phenomenon. My fascination with this topic dates back to my Ph.D. thesis work in the early 1960’s at Imperial College, which was then a part of the University of London. I had previously been working at the laboratory of Prof. Albert Szent-Gy¨orgyiand the Marine Bi- ological Laboratory at Woods Hole, Massachusetts. The problem on which we had been working was a quantum mechanical treatment of the primary process in photosynthesis, where a photon is absorbed, and its energy stabi- lized. Resonance energy transfer plays a large role in this process2. When I started my Ph.D. work in London, I decided to see whether relativistic corrections made a difference3. My calculations4 showed that while the usual non-relativistic treatment leads to transition probabilities that fall off as 1/R6, the calculated relativistic transition probabilities had a long-range component that fell off as 1/R2. Thus,if we imagine a very large sphere around an excited atom of molecule, the probability that the excitation energy will be transferred to one or another of the acceptors is independent of the size of the sphere! Is this a process that competes with spontaneous photon emission? Or is it an alternative way of treating the joint process of emission and absorption?5 Today, sixty years later, I continue to be fascinated by this question. In Chapter 10 experiments are proposed which could demonstrate that reso- nance energy transfer over macroscopic distance is possible. I am grateful to my son, Associate Professor James Emil Avery of the Institute,

1This book makes some use of my previously published book chapters. 2J.S. Avery, Z. Bay and A. Szent-Gy¨orgyi, On the Energy Transfer in Biological Sys- tems, Proc. Nat. Acad. Sci. (US), 47, 1742-1744, (1961) 3J.S. Avery, Resonance energy transfer and related phenomena, Ph.D. thesis, Imperial College of Science and Technology, University of London, (1965) 4J.S. Avery, Resonance energy transfer and spontaneous photon emission, Proc. Phys. Soc. (London) 88, Part 1, (1966) 5J.S. Avery, Use of the S-Matrix in the Relativistic Treatment of Resonance Energy Transfer. Int. J. , XXV, 79-96, (1984) University of , for his help and advice. He deserves to be listed as co-author of this book. However, I I don’t want him to be blamed for the book’s shortcomings, for example, in case the discussion section of Chapter 10 is seen to be too speculative. Besides the usual topics, the book also focuses on some aspects of quan- tum theory that have been of special interest to myself and to my son, James. Among these special areas of interest is the use of 4-dimensional hyperspher- ical harmonics in reciprocal-space quantum chemistry. We share this interest with Professor Vincenzo Aquilanti and his group at the University of Perugia in Italy6. Both James and I have made numerous research visits to Perugia, where we have enjoyed both the wonderful hospitality and great mathemat- ical knowledge of Prof. Aquilanti and his co-workers. I should mention that James has a number of important papers in which he uses hyperspherical harmonics to calculate 3-center and 4-center interelectron repulsion integrals for exponential-type basis sets (ETO’s).7 James and I are also co-authors of several books on hyperspherical harmonics.8 9 10 My interest in many-dimensional spaces brought me into contact with Professor Dudley R. Herschbach of Harvard University. I have been privileged to visit his brilliant research group many times, and to work closely with Prof. Herschbach and his colleagues for many years.11 12

6Aquilanti, V. and Avery, J., Sturmian expansions for quantum mechanical many-body problems and hyperspherical harmonics, Adv. Quant. Chem., 39 72-101, (2001) 74-center STO interelectron repulsion integrals with Coulomb Sturmians Avery, James Emil & Avery, J. S., (2018), In : Advances in Quantum Chemistry. 76, p. 133-146 8Generalized Sturmians and Atomic Spectra , by J.E. Avery and J.S. Avery, World Scientific Publishing (2006) 9Symmetry-Adapted Basis Sets , by J.S. Avery, S. Rettrup and J.E. Avery, World Scientific Publishing Co, (2012) 10Hyperspherical Harmonics and their Physical Applications, by J.E. Avery and J.S. Avery, World Scientific Publishing Co. (2017) 11Avery, J. and Herschbach, D. R., Hyperspherical Sturmian basis functions, Int. J. Quantum Chem., 41 673, (1992) 12J. Avery, D-Dimensional Hydrogenlike Orbitals, in Dimensional Scaling in Chemical Physics , D.R. Herschbach, J, Avery, and O. Goscinski editors, Kluwer Academic Publish- ers, Dordricht, Netherlands, (1992),pages 139-164 Figure 1: Professor Dudley R. Herschbach accepting the American Institute of Chemistry’s gold medal in 2011. He shared the 1986 in Chemistry for his pioneering contributions to our understanding of the mechanisms of chemical reactions. Figure 2: Professor Vincenzo Aquilanti (born in 1939). After work- ing at Harvard with Dudley Herschbach, he returned to Italy, where he became the head of the chemistry department at the University of Perugia. He and his group have done pioneering theoretical and experimental work on the mechanism of chemical reactions, using molecular beam techniques. Professor Aquilanti and his group have also developed the use of 4-dimensional hy- perspherical harmonics in momentum-space quantum theory, an interest which they share with my son James and myself. Figure 3: Associate Professor James Emil Avery of the , . He is the author of a number of important papers that uses hyperspherical harmonics to calcu- late difficult 3-center and 4-center interelectron repulsion integrals for exponential-type orbitals, and is also the co-author of several books on hyperspherical harmonics and generalized Sturmians. Contents

1 11 1.1 Rutherford’s model of the atom ...... 11 1.2 The Geiger-Marsden scattering experiment ...... 13 1.3 Rutherford’s model of the atom ...... 16 1.4 Informality, enthusiasm and speed ...... 16 1.5 Artificial transmutations of elements ...... 20

2 NIELS BOHR 23 2.1 Christian Bohr’s household ...... 23 2.2 Planck, Einstein and Bohr ...... 26 2.3 Atomic numbers ...... 31 2.4 Bohr’s Institute of Theoretical Physics ...... 32 2.5 Bohr anticipates the nuclear arms race ...... 38

3 SCHRODINGER’S¨ WAVE EQUATION 43 3.1 A wave equation for matter ...... 43 3.2 Felix Bloch’s story about Schr¨odinger...... 46 3.3 Dirac’s relativistic wave equation ...... 46 3.4 Some equations ...... 53

4 HARMONIC POLYNOMIALS AND SPHERICAL HARMONICS 55 4.1 Spherical polar coordinates ...... 55 4.2 The Laplacian operator in spherical coordinates ...... 56 4.3 Homogeneous and harmonic polynomials ...... 57 4.4 Harmonic polynomials and spherical harmonics ...... 57 4.5 An angular integration theorem ...... 61

5 THE SCHRODINGER¨ EQUATION FOR HYDROGEN 63 5.1 Separation of the equation ...... 63 5.2 Solutions to the radial equation ...... 64 5.3 Fock’s momentum-space treatment of hydrogen ...... 68 5.4 The Pauli exclusion principle and the periodic table ...... 72 5.5 Valence bond theory ...... 80

7 8 CONTENTS

5.6 Molecular orbital theory ...... 82 5.7 The Hartree-Fock-Roothaan equations ...... 95 5.8 Koopmans’ theorem ...... 97 5.9 Electron creation and annihilation operators ...... 99 5.10 Quantum chemistry and the development of ...... 100

6 PERIODIC SYSTEMS 119 6.1 The discovery of X-rays ...... 119 6.2 Bragg father and son ...... 121 6.3 J.D. Bernal and Dorothy Crowfoot Hodgkin ...... 125 6.4 The structure of DNA: Molecular ...... 128 6.5 Direct and reciprocal lattice vectors ...... 133 6.6 A H¨uckel calculation for a graphite plane ...... 135 6.7 3-dimensional crystal lattices ...... 136 6.8 Quantum treatment of electrons in crystals ...... 137 6.9 The nearly-free electron approximation ...... 137 6.10 Molecular crystals ...... 140 6.11 Periodic boundary conditions ...... 141 6.12 Homogeneous boundary conditions ...... 142 6.13 Taylor series expansion of the inter-monomer interaction ...... 144 6.14 X-ray diffraction experiments ...... 145

7 HARMONIC OSCILLATORS 147 7.1 Normal modes ...... 147 7.2 Molecular vibrations and rotations ...... 150 7.3 Commutation relations ...... 151 7.4 Phonon creation and annihilation operators ...... 152 7.5 Collections of harmonic oscillators ...... 154

8 THE 155 8.1 Lorentz invariance and 4-vectors ...... 155 8.2 The Dirac equation for an electron in an external electromagnetic potential 157 8.3 Time-independent problems ...... 158 8.4 The Dirac equation for an electron in the field of a nucleus ...... 159

9 INTERACTION BETWEEN MATTER AND RADIATION 163 9.1 Lagrangian densities for fields ...... 163 9.2 Electromagnetic potentials ...... 165 9.3 Separation of the longitudinal and transverse potentials ...... 170 9.4 Linear polarization ...... 171 9.5 Spontaneous photon emission ...... 172 9.6 Photon absorption ...... 173 9.7 Problems with field theories ...... 175 CONTENTS 9

10 RESONANCE ENERGY TRANSFER 179 10.1 Introduction ...... 179 10.2 Review of the Perrin-F¨orstertheory ...... 180 10.3 A relativistic interaction ...... 181 10.4 The Green’s function of the Helmholtz equation ...... 182 10.5 Matrix elements ...... 183 10.6 Transition probability at macroscopic separations ...... 184 10.7 Comparison with spontaneous photon emission ...... 185 10.8 The Perrin-F¨orsterregion ...... 186 10.9 A proposed experiment ...... 187 10.10 Discussion: Direct interparticle interaction ...... 189 10.11 Feynman’s first seminar at Princeton ...... 190 10.12 The absorber theory of radiation ...... 192 10.13 Karl Schwarzschild’s contributions to physics ...... 192

A THE HISTORY OF COMPUTERS 199 A.1 Pascal and Leibniz ...... 199 A.2 Jacquard and Babbage ...... 201 A.3 Harvard’s sequence-controlled calculator ...... 206 A.4 The first electronic computers ...... 207 A.5 Biosemiotics ...... 211 A.6 The Traitorous Eight ...... 214 A.7 Integrated circuits ...... 217 A.8 Moore’s law ...... 218 A.9 Self-reinforcing information accumulation ...... 221 A.10 Automation ...... 222 A.11 Neural networks ...... 226

B GROUP THEORY 239 B.1 Finite Groups ...... 239 B.2 Representations of geometrical symmetry groups ...... 240 B.3 Similarity transformations ...... 241 B.4 Characters and reducibility ...... 242 B.5 The great orthogonality theorem ...... 244 B.6 Classes ...... 248 B.7 Projection operators ...... 249 B.8 The regular representation ...... 251 B.9 Classification of basis functions ...... 252

C NON-EUCLIDIAN GEOMETRY 257 C.1 Albert Einstein and non-Euclidian geometry ...... 257 C.2 Family background ...... 257 C.3 Special relativity theory ...... 259 10 CONTENTS

C.4 General relativity ...... 260 C.5 Metric tensors ...... 262 C.6 The Laplace-Beltrami operator ...... 266 C.7 Geodesics ...... 271

D Sturmian basis sets 273 D.1 One-electron Coulomb Sturmians ...... 273 D.2 L¨owdin-orthogonalizedCoulomb Sturmians ...... 276 D.3 The Fock projection ...... 277 D.4 Generalized Sturmians and many-particle problems ...... 278 D.5 Use of generalized Sturmian basis sets to solve the many-particle Schr¨odinger equation ...... 279 D.6 Momentum-space orthonormality relations for Sturmian basis sets . . . . . 280 D.7 Sturmian expansions of d-dimensional plane waves ...... 282 D.8 An alternative expansion of a d-dimensional plane wave ...... 283

E GENERALIZED STURMIANS APPLIED TO ATOMS 285 E.1 Goscinskian configurations ...... 285 E.2 Relativistic corrections ...... 291 E.3 The large-Z approximation: Restriction of the basis set to an -block . . . 293 E.4 Electronic potential at the nucleus in the large-Z approximationR ...... 294 E.5 Core ionization energies ...... 297 E.6 Advantages and disadvantages of Goscinskian configurations ...... 299 E.7 -blocks, invariant subsets and invariant blocks ...... 300 R E.8 Invariant subsets based on subshells; Classification according to ML and Ms 306 E.9 An atom surrounded by point charges ...... 311

F MOLECULAR ORBITALS BASED ON STURMIANS 319 F.1 The one-electron secular equation ...... 319 F.2 Shibuya-Wulfman integrals and Sturmian overlap integrals evaluated in terms of hyperpherical harmonics ...... 326 F.3 Molecular calculations using the isoenergetic configurations ...... 332 (N) (N) F.4 Building Tν0ν and Sν0ν from 1-electron components ...... 334 F.5 Interelectron repulsion integrals for molecular Sturmians from hyperspheri- cal harmonics ...... 335 F.6 Many-center integrals treated by Gaussian expansions (Appendix E) . . . . 339 F.7 A pilot calculation ...... 343 F.8 Automatic generation of symmetry-adapted basis functions ...... 346 Chapter 1

ERNEST RUTHERFORD

1.1 Rutherford’s model of the atom

In 1895, the year during which Roentgen made his revolutionary discovery of X-rays, a young New Zealander named Ernest Rutherford was digging potatoes on his father’s farm, when news reached him that he had won a scholarship for advanced study in England. Throwing down his spade, Rutherford said, “That’s the last potato I’ll dig!” He postponed his marriage plans and sailed for England, where he enrolled as a research student at University. He began work at the Cavendish Laboratory, under the leadership of J.J. Thomson, the discoverer of the electron. In New Zealand, Rutherford had done pioneering work on the detection of radio waves, and he probably would have continued this work at Cambridge, if it had not been for the excitement caused by the discoveries of Roentgen and Becquerel. Remembering this period of his life, Rutherford wrote: “Few of you can realize the enormous sensation caused by the discovery of X-rays by Roentgen in 1895. It interested not only the scientific man, but also the man in the street, who was excited by the idea of seeing his own insides and his bones. Every laboratory in the world took out its old Crookes’ tubes to produce X-rays, and the Cavendish was no exception.” J.J. Thomson, who was interested in studying ions (charged atoms or molecules) in gases, soon found that gaseous ions could be produced very conveniently by means of X- rays. Rutherford abandoned his research on radio waves, and joined Thomson in this work.

“When I entered the Cavendish Laboratory”, Rutherford remembered later, “I began to work on the ionization of gases by means of X-rays. After reading the paper of Becquerel, I was curious to know whether the ions produced by the radiation from uranium were of the same nature as those produced by X-rays; and in particular, I was interested because Becquerel thought that his radiation was somehow intermediate between light and X-rays.”

“I therefore proceeded to make a systematic examination of the radiation, and I found

11 12 QUANTUM THEORY

Figure 1.1: Rutherford receiving the 1908 Nobel Prize in Chemistry. that it was of two types - one which produced intense ionization, and which was absorbed by a few centimeters of air, and the other, which produced less intense ionization, but was more penetrating. I called these alpha rays and beta rays respectively; and when, in 1898, Villard discovered a still more penetrating type of radiation, he called it gamma-radiation.” Rutherford later showed that the alpha-rays were actually ionized helium atoms thrown out at enormous velocities by the decaying uranium, and that beta-rays were high-speed electrons. The gamma-rays turned out to be electromagnetic waves, just like light waves, but of extremely short wavelength. Rutherford returned briefly to New Zealand to marry his sweetheart, Mary Newton; and then he went to Canada, where he had been offered a post as Professor of Physics at McGill University. In Canada, with the collaboration of the chemist, Frederick Soddy (1877-1956), Rutherford continued his experiments on radioactivity, and worked out a revolutionary theory of transmutation of the elements through radioactive decay. During the middle ages, alchemists had tried to change lead and mercury into gold. Later, chemists had convinced themselves that it was impossible to change one element into another. Rutherford and Soddy now claimed that radioactive decay involves a whole series of transmutations, in which one element changes into another! Returning to England as head of the physics department at University, Rutherford continued to experiment with alpha-particles. He was especially interested in the way they were deflected by thin metal foils. Rutherford and his assistant, Hans Geiger (1886-1945), found that most of the alpha-particles passed through a metal foil with only 1.2. THE GEIGER-MARSDEN SCATTERING EXPERIMENT 13 a very slight deflection, of the order of one degree.

1.2 The Geiger-Marsden scattering experiment

In 1911, a young research student named Ernest Marsden joined the group, and Rutherford had to find a project for him. What happened next, in Rutherford’s own words, was as follows: “One day, Geiger came to me and said, ‘Don’t you think that young Marsden, whom I’m training in radioactive methods, ought to begin a small research?’ Now I had thought that too, so I said, ‘Why not let him see if any alpha-particles can be scattered through a large angle?’ I may tell you in confidence that I did not believe that they would be, since we knew that the alpha-particle was a very fast, massive particle, with a great deal of energy; and you could show that if the scattering was due to the accumulated effect of a number of small scatterings, the chance of an alpha-particle’s being scattered backward was very small.” “Then I remember two or three days later, Geiger coming to me in great excitement and saying, ‘We have been able to get some of the alpha-particles coming backwards’. It was quite the most incredible event that has ever happened to me in my life. It was almost as incredible as if you fired a 15-inch shell at a piece of tissue paper and it came back and hit you.” “On consideration, I realized that this scattering backwards must be the result of a single collision, and when I made calculations, I found that it was impossible to get anything of that order of magnitude unless you took a system in which the greater part of the mass of the atom was concentrated in a minute nucleus.” “It was then that I had the idea of an atom with a minute massive center carrying a charge. I worked out mathematically what laws the scattering should obey, and found that the number of particles scattered through a given angle should be proportional to the thickness of the scattering foil, the square of the nuclear charge, and inversely proportional to the fourth power of the velocity. These deductions were later verified by Geiger and Marsden in a series of beautiful experiments.” 14 QUANTUM THEORY

Figure 1.2: The Geiger-Marsden scattering experiment. To Rutherford’s great surprise, the experiment showed that some of the alpha particles were scat- tered backwards. After treating the problem mathematically, Rutherford con- cluded that most of the mass of an atom must be concentrated in a very small, positively-charged nucleus, around which the much lighter electrons circulate in orbits. 1.2. THE GEIGER-MARSDEN SCATTERING EXPERIMENT 15

Figure 1.3: In Thomson’s model of the atom, the electrons were embedded, like raisins in a pudding, in a diffuse background of positive charge. The Geiger- Marsden experiment forced Rutherford to propose a new model to account for the observed back-scattering of alpha particles. 16 QUANTUM THEORY 1.3 Rutherford’s model of the atom

According to the model proposed by Rutherford in 1911, every atom has an extremely tiny nucleus, which contains almost all of the mass of the atom. Around this tiny but massive nucleus, Rutherford visualized light, negatively-charged electrons circulating in orbits, like planets moving around the sun. Rutherford calculated that the diameter of the whole atom had to be several thousand times as large as the diameter of the nucleus.

1.4 Informality, enthusiasm and speed

Rutherford’s model of the atom explained beautifully the scattering experiments of Geiger and Marsden, but at the same time it presented a serious difficulty: According to Maxwell’s equations, the electrons circulating in their orbits around the nucleus ought to produce electromagnetic waves. It could easily be calculated that the electrons in Rutherford’s atom ought to lose all their energy of motion to this radiation, and spiral in towards the nucleus. Thus, according to classical physics, Rutherford’s atom could not be stable. It had to collapse. Rutherford’s laboratory was like no other in the world, except J.J. Thomson’s. In fact, Rutherford had learned much about how to run a laboratory from his old teacher, Thomson. Rutherford continued Thomson’s tradition of democratic informality and cheerfulness. Like Thomson, he had a gift for infecting his students with his own powerful scientific curiosity, and his enthusiastic enjoyment of research. Thomson had also initiated a tradition for speed and ingenuity in the improvisation of experimental apparatus - the so-called “sealing-wax and string” tradition - and Rutherford continued it. Niels Bohr, after working with Rutherford, was later to continue the tradition of informality and enthusiasm at the Institute for Theoretical Physics which Bohr founded in Copenhagen in 1920. Most scientific laboratories of the time offered a great contrast to the informality, en- thusiasm, teamwork and speed of the Thomson-Rutherford-Bohr tradition. E.E. da C. Andrade, who first worked in Lenard’s laboratory at Heidelberg, and later with Ruther- ford at Manchester, has given the following description of the contrast between the two groups: “At the Heidelberg colloquium, Lenard took the chair, very much like a master with his class. He had the habit, if any aspect of his work was being treated by the speaker, of interrupting with, ‘And who did that first?’ The speaker would reply with a slight bow, ‘Herr Geheimrat, you did that first’, to which Lenard answered, ‘Yes, I did that first’.” “At the Manchester colloquium, which met on Friday afternoons, Rutherford was, as in all his relations with the research workers, the boisterous, enthusiastic, inspiring friend, undoubtedly the leader but in close community with the led, stimulating rather than commanding, ‘gingering up’, to use a favorite expression of his, his team.” Although Rutherford occasionally swore at his “lads”, his affection for them was very real. He had no son of his own, and he became a sort of father to the brilliant young men in 1.4. INFORMALITY, ENTHUSIASM AND SPEED 17

Figure 1.4: Henry Moseley (1887-1915). his laboratory. Their nickname for him was “Papa”. Such was the laboratory which Harry Moseley joined in 1910. At almost the same time, Moseley’s childhood friend, Charles Darwin (the grandson of the “right” Charles Darwin), also joined Rutherford’s team. After working on a variety of problems in radioactivity which were given to him by Rutherford, Moseley asked whether he and Charles Darwin might be allowed to study the spectra of X-rays. At first, Rutherford said no, since no one at Manchester had any experience with X-rays; “and besides”, Rutherford added with a certain amount of bias, “all science is either radioactivity or else stamp-collecting”. However, after looking more carefully at what was being discovered about X-rays, Rutherford gave his consent. In 1912, a revolutionary discovery had been made by the Munich , Max von Laue (1879-1960): It had long been known that because of its wavelike nature, white light can be broken up into the colors of the spectrum by means of a “diffraction grating” - a series of parallel lines engraved very closely together on a glass plate. For each wavelength of light, there are certain angles at which the new wavelets pro- duced by the lines of the diffraction grating reinforce each other instead of cancelling. The angles of reinforcement are different for each wavelength, and thus the different colors are separated by the grating. Max von Laue’s great idea was to do the same thing with X-rays, using a crystal as a diffraction grating. The regular lines of atoms in the crystal, von Laue reasoned, would act be fine enough to fit the tiny wavelength of the X-rays, believed to be less than one ten-millionth of a centimeter. Von Laue’s experiment, performed in 1912, had succeeded beautifully, and his new 18 QUANTUM THEORY

Figure 1.5: Sir (1887-1962), grandson of the “right” Charles Darwin. 1.4. INFORMALITY, ENTHUSIASM AND SPEED 19 technique had been taken up in England by a father and son team, Bragg (1862-1942) and William (1890-1971). The Braggs had used X-ray diffrac- tion not only to study the spectra of X-rays, but also to study the structure of crystals. Their techniques were later to become one of the most valuable research tools available for studying molecular structure. Having finally obtained Rutherford’s permission, Moseley and Darwin threw themselves into this exciting field of study. Remembering his work with Harry Moseley, Charles Darwin later wrote: “Working with Moseley was one of the most strenuous exercises I have ever undertaken. He was, without exception, the hardest worker I have ever known... There were two rules for his work: First, when you started to set up the apparatus for an experiment, you must not stop until it was set up. Second, when the apparatus was set up, you must not stop work until the experiment was done. Obeying these rules implied a most irregular life, sometimes with all-night sessions; and indeed, one of Moseley’s experteses was the knowledge of where in Manchester one could get a meal at three in the morning.” After about a year, Charles Darwin left the experiments to work on the theoretical aspects of X-ray diffraction. (He was later knighted for his distinguished contributions to theoretical physics.) Moseley continued the experiments alone, systematically studying the X-ray spectra of all the elements in the periodic system. Niels Bohr had shown that the binding energies of the allowed orbits in a hydrogen atom are equal to Rydberg’s constant , R (named after the distinguished Swedish spectroscopist, Johannes Robert Rydberg), divided by the square of an integral “quantum number”, n. He had also shown that for heavier elements, the constant, R, is equal to the square of the nuclear charge, Z, multiplied by a factor which is the same for all elements. The constant, R, could be observed in Moseley’s studies of X-ray spectra: Since X-rays are produced when electrons are knocked out of inner orbits and outer electrons fall in to replace them, Moseley could use the Planck-Einstein relationship between frequency and energy to find the energy difference between the orbits, and Bohr’s theory to relate this to R. Moseley found complete agreement with Bohr’s theory. He also found that the nuclear charge, Z, increased regularly in integral steps as he went along the rows of the periodic table: Hydrogen had Z=1, helium Z=2, lithium Z=3, and so on up to uranium with Z=92. The 92 electrons of a uranium atom made it electrically neutral, exactly balancing the charge of the nucleus. The number of electrons of an element, and hence its chemical properties, Moseley found, were determined uniquely by its nuclear charge, which Moseley called the “atomic number”. Moseley’s studies of the nuclear charges of the elements revealed that a few elements were missing. In 1922, Niels Bohr received the Nobel Prize for his quantum theory of the atom; and he was able to announce at the presentation ceremony that one of Moseley’s missing elements had been found at his institute. Moseley, however, was dead. He was one of the ten million young men whose lives were needlessly thrown away in Europe’s most tragic blunder - the First World War. 20 QUANTUM THEORY 1.5 Artificial transmutations of elements

During the First World War, Rutherford’s young men had joined the army, and he had been forced to spend most of his own time working on submarine detection. In spite of this, he had found some spare time for his scientific passion - bombarding matter with alpha particles. Helped by his laboratory steward, Kay, Rutherford had studied the effects produced when alpha particles from a radium source struck various elements. In a letter to Niels Bohr, dated December 9, 1917, Rutherford wrote: “I have got, I think, results that will ultimately have great importance. I wish that you were here to talk matters over with me. I am detecting and counting the lighter atoms set in motion by alpha particles, and the results, I think, throw a good deal of light on the character and distribution of forces near the nucleus... I am trying to break up the atom by this method. In one case, the results look promising, but a great deal of work will be required to make sure. Kay helps me, and is now an expert counter. Best wishes for a happy Christmas.” In July, 1919, Bohr was at last able to visit Manchester, and he heard the news directly from his old teacher: Rutherford had indeed produced artificial nuclear transmutations! In one of his experiments, an alpha-particle (i.e. a helium nucleus with nuclear charge 2) was absorbed by a nitrogen nucleus. Later, the compound nucleus threw out a proton with charge 1; and thus the bombarded nucleus gained one unit of charge. It moved up one place in the periodic table and became an isotope of oxygen. Bohr later wrote: “I learned in detail about his great new discovery of controlled, or so-called artificial, nuclear transmutations, by which he gave birth to what he liked to call ‘modern Alchemy’, and which in the course of time, was to give rise to such tremendous consequences as regards man’s mastery of the forces of nature.” Other scientists rushed to repeat and extend Rutherford’s experiments. Particle ac- celerators were built by E.O. Lawrence (1901-1958) in California, by J.H. van de Graff (1901-1967) at the Massachusetts Institute of Technology and by John Cockcroft (1897- 1967), working with Rutherford at the Cavendish Laboratory. These accelerators could hurl protons at energies of a million electron-volts. Thus, protons became another type of projectile which could be used to produce nuclear transmutations.

Suggestions for further reading 1. McKown, Robin. Giant of the Atom, Ernest Rutherford. Julian Messner Inc, New York, (1962). 2. McLintock, A.H. Rutherford, Sir Ernest (Baron Rutherford of Nelson, O.M., F.R.S.). An Encyclopaedia of New Zealand (1966 ed.). Te Ara - The Encyclopedia of New Zealand, (18 September 2007). 3. Heilbron, J. L. Ernest Rutherford and the Explosion of Atoms. Oxford: , (2003). 4. Dacey, James. What was Rutherford’s greatest discovery?. Physics World, (1 Septem- ber 2011). 1.5. ARTIFICIAL TRANSMUTATIONS OF ELEMENTS 21

5. Aaserud, Finn Kokowski, M. (ed.). Niels Bohr’s Mission for an ’Open World’ (PDF). Proceedings of the 2nd ICESHS. Cracow. pp. 706-709, (2006). 6. Aaserud, Finn; Heilbron, J. L. Love, Literature and the Quantum Atom: Niels Bohr’s 1913 Trilogy Revisited. Oxford: Oxford University Press, (2013). 7. Bohr, Niels. Nobel Prize Lecture: The Structure of the Atom (excerpts). In French, A. P.; Kennedy, P. J. (eds.). Niels Bohr: A Centenary Volume. Cambridge, Mas- sachusetts: Harvard University Press. pp. 91-97, (1985) [1922]. 8. Cockroft, John D. Niels Henrik David Bohr. 1885-1962. Biographical Memoirs of Fellows of the Royal Society. 9: 36-53, (1 November 1963). 9. Favrholdt, David. Niels Bohr’s Philosophical Background. Copenhagen: Munks- gaard, (1992). 10. Faye, January. Niels Bohr: His Heritage and Legacy. Dordrecht: Kluwer Academic Publishers, (1991). 11. Heilbron, John L. Bohr’s First Theories of the Atom. In French, A. P.; Kennedy, P. J. (eds.). Niels Bohr: A Centenary Volume. Cambridge, Massachusetts: Harvard University Press. pp. 33-49, (1985). 12. Heisenberg, Elisabeth. Inner Exile: Recollections of a Life With . Boston: Birkh¨auser,(1984). 13. Hilberg, Raul. The Destruction of the European Jews. 2. New Haven, Connecticut: Yale University Press, (1961). 14. Jammer, Max. The Conceptual Development of Quantum Mechanics. Los Angeles: Tomash Publishers, (1989). 15. Jones, R.V. Most Secret War. London: Hamilton, (1978). 16. Kieler, JA¸rgen.˜ Resistance Fighter: A Personal History of the Danish Resistance. Translated from the Danish by Eric Dickens. Jerusalem: Gefen Publishing House, (2007). 17. Kragh, Helge. The Theory of the Periodic System. In French, A. P.; Kennedy, P. J. (eds.). Niels Bohr: A Centenary Volume. Cambridge, Massachusetts: Harvard University Press. pp. 50-67, (1985). 18. Kragh, Helge. Niels Bohr and the quantum atom: the of atomic structure, 1913-1925. Oxford: Oxford University Press, (2012). 19. Medawar, Jean; Pyke, David. Hitler’s Gift: The True Story of the Scientists Expelled by the Nazi Regime. New York: Arcade Publishing, (2001). 20. Pais, Abraham. Niels Bohr’s Times, In Physics, Philosophy and Polity. Oxford: Clarendon Press, (1991). 21. Powers, Thomas. Heisenberg’s War: The Secret History of the German Bomb. New York: Knopf, (1993). 22. Rhodes, Richard. The Making of the Atomic Bomb. New York: Simon and Schuster, (1986). 23. Richardson, W. Mark; Wildman, Wesley J., eds. Religion and Science: History, Method, Dialogue. London, New York: Routledge, (1996). 24. Rife, Patricia. and the Dawn of the Nuclear Age. Boston: Birkh¨auser, (1999). 22 QUANTUM THEORY

25. Rozental, Stefan. Niels Bohr: His Life and Work as Seen by his Friends and Col- leagues. Amsterdam: North-Holland, (1967). 26. Stadtler, Bea; Morrison, David Beal; Martin, David Stone. : A History of Courage and Resistance. West Orange, New Jersey: Behrman House, (1995). Chapter 2

NIELS BOHR

2.1 Christian Bohr’s household

Christian Bohr (1855-1911) was appointed professor of physiology at the University of Copenhagen in 1886. In this position, he made a number of important discoveries con- nected with respiration in mammals, including what is now known as the “Bohr effect”, + i.e. the tendency of high concentrations of CO2 and of H ions to increase the efficiency of hemoglobin in releasing oxygen. Christian Bohr was also the teacher of , who later won a Nobel Prize in and Physiology. Christian Bohr’s wife, Ellen Adler Bohr, belonged to a wealthy Jewish banking family, and Niels Bohr was born in the impressive multi-story Adler mansion that still stands today near one of Copenhagen’s canals opposite the Danish Parliament. During the time that Niels and Harold Bohr were growing up, this house was the meeting place for many of Copenhagen’s leading intellectuals, and the boys were allowed to attend meetings where scientific and philosophical questions were debated. This upbringing contributed to the fact that both Niels and Harold later became famous in their respective fields, physics and mathematics. The Bohr family has produced outstanding scientists for four generations. Besides Christian, Niels and Harold Bohr, there is also Niels’ son Aage, who shared a Nobel Prize in Physics for his work on the excited states of nuclei. Aage’s sons, Vilhelm and Thomas, are also outstanding scientists. Having been brought up in a highly intellectual household, Niels Bohr’s scientific abili- ties developed early. In 1905, when Niels was 20, a gold medal competition was announced by the Royal Danish Society of Sciences and Letters. The challenge was to investigate a method for determining the surface tension of liquids. The method had been proposed earlier by Lord Raleigh, and it involved measuring the frequency of oscillations on the surface of a water jet. After working in his father’s laboratory, making his own glassware to produce elliptical water jets, and presenting his results together with a mathematical analysis, Niels Bohr won the gold medal.

23 24 QUANTUM THEORY

Figure 2.1: Christian Bohr (1855-1911), the father of Niels and Harold Bohr. He was Professor of Physiology at the University of Copenhagen. 2.1. CHRISTIAN BOHR’S HOUSEHOLD 25

Figure 2.2: Niels Bohr (1885-1952) as a young man.

Figure 2.3: Niels Bohr and his wife, Margrethe. 26 QUANTUM THEORY 2.2 Planck, Einstein and Bohr

According to the model proposed by Rutherford in 1911, every atom has an extremely tiny nucleus, which contains almost all of the mass of the atom. Around this tiny but massive nucleus, Rutherford visualized light, negatively-charged electrons circulating in orbits, like planets moving around the sun. Rutherford calculated that the diameter of the whole atom had to be several thousand times as large as the diameter of the nucleus. Rutherford’s model of the atom explained beautifully the scattering experiments of Geiger and Marsden, but at the same time it presented a serious difficulty: According to Maxwell’s equations, the electrons circulating in their orbits around the nucleus ought to produce electromagnetic waves. It could easily be calculated that the electrons in Rutherford’s atom ought to lose all their energy of motion to this radiation, and spiral in towards the nucleus. Thus, according to classical physics, Rutherford’s atom could not be stable. It had to collapse. Niels Bohr became aware of this paradox when he worked at Rutherford’s Manchester laboratory during the years 1911-1913. Bohr was not at all surprised by the failure of classical concepts when applied to Rutherford’s nuclear atom. Since he had been educated in , he was more familiar with the work of German than were his English colleagues at Manchester. In particular, Bohr had studied the work of (1858-1947) and Albert Einstein (1879-1955). Just before the turn of the century, the German physicist, Max Planck, had been studying theoretically the electromagnetic radiation coming from a small hole in an oven. The hole radiated as though it were an ideally black body. This “black body radiation” was very puzzling to the physicists of the time, since classical physics failed to explain the frequency distribution of the radiation and its dependence on the temperature of the oven. In 1901, Max Planck had discovered a formula which fitted beautifully with the exper- imental measurements of the frequency distribution of black body radiation; but in order to derive his formula, he had been forced to make a radical assumption which broke away completely from the concepts of classical physics. Planck had been forced to assume that light (or, more generally, electromagnetic radia- tion of any kind) can only be emitted or absorbed in amounts of energy which Planck called “quanta”. The amount of energy in each of these “quanta” was equal to the frequency of the light multiplied by a constant, h, which came to be known as “Planck’s constant”. This was indeed a strange assumption! It seemed to have been pulled out of thin air; and it had no relation whatever to anything that had been discovered previously in physics. The only possible justification for Planck’s quantum hypothesis was the brilliant success of his formula in explaining the puzzling frequency distribution of the black body radiation. Planck himself was greatly worried by his own radical break with classical concepts, and he spent many years trying unsuccessfully to relate his quantum hypothesis to classical physics. In 1905, Albert Einstein published a paper in the Annalen der Physik in which he applied Planck’s quantum hypothesis to the photoelectric effect. (At that time, Einstein was 25 years old, completely unknown, and working as a clerk at the Swiss Patent Office.) 2.2. PLANCK, EINSTEIN AND BOHR 27

Figure 2.4: Niels Bohr and Albert Einstein in a photo by Paul Ehrenfest. Public domain, Wikimedia Commons

The photoelectric effect was another puzzling phenomenon which could not in any way be explained by classical physics. The German physicist Lenard had discovered in 1903 that light with a frequency above a certain threshold could knock electrons out of the surface of a metal; but below the threshold frequency, nothing at all happened, no matter how long the light was allowed to shine. Using Planck’s quantum hypothesis, Einstein offered the following explanation for the photoelectric effect: A certain minimum energy was needed to overcome the attractive forces which bound the electron to the metal surface. This energy was equal to the threshold frequency multiplied by Planck’s constant. Light with a frequency equal to or higher than the threshold frequency could tear an electron out of the metal; but the quantum of energy supplied by light of a lower frequency was insufficient to overcome the attractive forces. Einstein later used Planck’s quantum formula to explain the low-temperature behavior of the specific heats of crystals, another puzzling phenomenon which defied explanation by classical physics. These contributions by Einstein were important, since without this supporting evidence it could be maintained that Planck’s quantum hypothesis was an ad hoc assumption, introduced for the sole purpose of explaining black body radiation. As a student, Niels Bohr had been profoundly impressed by the radical ideas of Planck and Einstein. In 1912, as he worked with Rutherford at Manchester, Bohr became con- vinced that the problem of saving Rutherford’s atom from collapse could only be solved by means of Planck’s quantum hypothesis. Returning to Copenhagen, Bohr continued to struggle with the problem. In 1913, he found the solution: The electrons orbiting around the nucleus of an atom had “angular momentum”. Assuming circular orbits, the angular momentum was given by the product of 28 QUANTUM THEORY

the mass and velocity of the electron, multiplied by the radius of the orbit. Bohr introduced a quantum hypothesis similar to that of Planck: He assumed that the angular momentum of an electron mevr in an allowed orbit, (multiplied by 2 pi), had to be equal to an integral multiple of Planck’s constant h. The lowest value of the integer, n=1, corresponded to the lowest allowed orbit. Thus, in Bohr’s model, the collapse of Rutherford’s atom was avoided.

mevr = n~ n = 1, 2, 3, 4, .... h ~ (2.1) ≡ 2π

Here me is the mass of the electron, v is its velocity and r is the radius of its classical orbit around the nucleus, so that mevr is the classical expression for the angular momentum of an electron in a circular orbit. In Bohr’s quantization of angular momentum, h represents Planck’s constant, which had been introduced by Max Planck to explain the frequency distribution of black body radiation. Bohr calculated that the binding energies of the various allowed electron orbits in a hydrogen atom should be a constant divided by the square of the integer n; and he calculated the value of the constant to be 13.5 electron-Volts. This value fit exactly the observed ionization energy of hydrogen. After talking with the Danish spectroscopist, H.M. Hansen, Bohr realized with joy that by combining his formula for the allowed orbital energies with the Planck-Einstein formula relating energy to frequency, he could explain the mysterious line spectrum of hydrogen.

2 1 2 1 Ze E = mev = −2 −2 r n2~2 rn = 2 Ze me 2 2 Z e me En = − 2~2n2 Z2e2m 1 1 ∆E = e (2.2) 2 2 n − n ~  1 2  When Niels Bohr published all this in 1913, his paper produced agonized cries of “foul!” from the older generation of physicists. When Lord Rayleigh’s son asked him if he had seen Bohr’s paper, Rayleigh replied: “Yes, I have looked at it; but I saw that it was of no use to me. I do not say that discoveries may not be made in that sort of way. I think very likely they may be. But it does not suit me.” However, as more and more atomic spectra and properties were explained by extensions of Niels Bohr’s theories, it became clear that Planck, Einstein and Bohr had uncovered a whole new stratum of phenomena, previously unsuspected, but of deep and fundamental importance. 2.2. PLANCK, EINSTEIN AND BOHR 29

Figure 2.5: Line spectra of hydrogen. The figure show the emission spectrum energies associated with transitions between allowed energies. These transition energies fitted perfectly with Niels Bohr’s calculations. As more and more atomic spectra and properties were explained by extensions of Niels Bohr’s theories, it became clear that Planck, Einstein and Bohr had uncovered a whole new stratum of phenomena, previously unsuspected, but of deep and fundamental importance. 30 QUANTUM THEORY

Figure 2.6: Another photo of Bohr and Einstein by Ehrenfest. 2.3. ATOMIC NUMBERS 31 2.3 Atomic numbers

Bohr’s atomic theory soon received strong support from the experiments of one of the brightest of Rutherford’s bright young men - Henry Moseley (1887-1915). Moseley came from a distinguished scientific family. Not only his father, but also both his grandfathers, had been elected to the Royal Society. After studying at Oxford, where his father had once been a professor, Moseley found it difficult to decide where to do his postgraduate work. Two laboratories attracted him: the great J.J. Thomson’s Cavendish Laboratory at Cambridge, and Rutherford’s laboratory at Manchester. Finally, he decided on Manchester, because of the revolutionary discoveries of Rutherford, who two years earlier had won the 1908 Nobel Prize for Chemistry. Rutherford’s laboratory was like no other in the world, except J.J. Thomson’s. In fact, Rutherford had learned much about how to run a laboratory from his old teacher, Thomson. Rutherford continued Thomson’s tradition of democratic informality and cheerfulness. Like Thomson, he had a gift for infecting his students with his own powerful scientific curiosity, and his enthusiastic enjoyment of research. Thomson had also initiated a tradition for speed and ingenuity in the improvisation of experimental apparatus - the so-called “sealing-wax and string” tradition - and Rutherford continued it. Niels Bohr, after working with Rutherford, was later to continue the tradition of informality and enthusiasm at the Institute for Theoretical Physics which Bohr founded in Copenhagen in 1920. Niels Bohr had shown that the binding energies of the allowed orbits in a hydrogen atom are equal to Rydberg’s constant , R (named after the distinguished Swedish spectroscopist, Johannes Robert Rydberg), divided by the square of an integral “quantum number”, n. He had also shown that for heavier elements, the constant, R, is equal to the square of the nuclear charge, Z, multiplied by a factor which is the same for all elements. The constant, R, could be observed in Moseley’s studies of X-ray spectra: Since X-rays are produced when electrons are knocked out of inner orbits and outer electrons fall in to replace them, Moseley could use the Planck-Einstein relationship between frequency and energy to find the energy difference between the orbits, and Bohr’s theory to relate this to R. Moseley found complete agreement with Bohr’s theory. He also found that the nuclear charge, Z, increased regularly in integral steps as he went along the rows of the periodic table: Hydrogen had Z=1, helium Z=2, lithium Z=3, and so on up to uranium with Z=92. The 92 electrons of a uranium atom made it electrically neutral, exactly balancing the charge of the nucleus. The number of electrons of an element, and hence its chemical properties, Moseley found, were determined uniquely by its nuclear charge, which Moseley called the “atomic number”. Moseley’s studies of the nuclear charges of the elements revealed that a few elements were missing. In 1922, Niels Bohr received the Nobel Prize for his quantum theory of the atom; and he was able to announce at the presentation ceremony that one of Moseley’s missing elements had been found at his institute. Moseley, however, was dead. He was one of the ten million young men whose lives were needlessly thrown away in Europe’s most tragic blunder - the First World War. 32 QUANTUM THEORY

Figure 2.7: Niels Bohr with his sons at their summer house in Tisvilde.

2.4 Bohr’s Institute of Theoretical Physics

In 1916, Niels Bohr was appointed professor of theoretical physics at the University of Copenhagen, a post that had been created especially for him. The following year, in 1917, he started to raise money for the construction of a new institute in which his new department could be housed. The project received large contributions from the Danish government and the Carlsberg Foundation, and from wealthy Danish businessmen. Bohr himself designed the building, which opened in 1920. During the period when Hitler’s Nazi party was coming to power in , Bohr was able to offer a refuge at his Institute of Theoretical Physics to many important physicists who could no longer remain in Germany. Those to whom Bohr gave refuge included Guido Beck, Felix Bloch, , , Otto Frisch, , Lise Meitner, , Eugene Rabinowitch, Stefan Rozental, Erich Ernst Schneider, , Arthur von Hippel and . Because of this, because of Bohr’s dynamic and inspiring presence, and because he was able to continue the tradition of informality, enthusiasm and speed which characterized J.J. Thomson’s Cavendish and Rutherford’s Manchester laboratories, Bohr’s institute became the world’s most important center for theoretical physics, especially during the 1930’s. Bohr was tirelessly energetic. He liked to discuss his ideas in dialogue with one of the bright young men at his institute, putting forward an idea, and expecting a counter- argument to be thrown back. It was like a game of ping-pong. In this way, a new idea could be tested by exploring all of its consequences. When a new scientist arrived at his institute, Bohr liked to invite the newcomer to accompany him on a two-day walking tour to his summer house in Tisvilde, about 50 2.4. BOHR’S INSTITUTE OF THEORETICAL PHYSICS 33 kilometers north of Copenhagen. In his autobiographical book “Physics and Beyond”, Werner Heisenberg describes such a two-man tour together with Bohr. This custom allowed Bohr to get to know both the personality and the potential scientific contributions of the new arrival. It also allowed Bohr to get some exercise and to keep himself in good physical condition.

The Nazi occupation of Denmark On 9 April, 1940, Nazi Germany invaded and quickly occupied Denmark. The Germans explained that their purpose was “to protect Denmark from a British invasion”. During the first three years of occupation the Germans allowed the Danish government, police force and army to exist. However, in 1943, after extensive sabotage actions by the Danish resistance movement, the German policy changed and became much harsher. Shortly after this sudden change, the Danes became aware that their Jewish population was in danger of being arrested and sent to concentration camps. Luckily it was possible for Danish citizens to organize a secret rescue operation, in which almost all members of Denmark’s Jewish community escaped to Sweden in small boats. Among them were Niels Bohr and his son Aage.

Niels and fly to England After some time in Sweden, where he helped to organize aid for Jewish refugees from Denmark, Niels Bohr and his son Aage flew to England in a small aircraft. It flew at a high altitude in order to avoid observation. Niels Bohr’s oxygen mask did not fit properly because of his unusually large head, and he became unconscious. Luckily this was noticed before anything very serious happened. 34 QUANTUM THEORY

Figure 2.8: The Institute of Theoretical Physics, established by Niels Bohr at the University of Copenhagen. Today it is known as the Niels Bohr Institute

Figure 2.9: Another view of the Niels Bohr Institute. 2.4. BOHR’S INSTITUTE OF THEORETICAL PHYSICS 35

Figure 2.10: Aage Bohr (1922-2008), one of Niels and Margrethe Bohr’s sons. Together with Ben Mottelson, he was awarded the 1975 Nobel Prize in Physics for developing a successful theory of the excited states of nuclei. 36 QUANTUM THEORY

Figure 2.11: Ben Roy Mottelson (born in 1926), who shared the 1975 Nobel Prize in Physics with Aage Bohr. Although now very old, he still comes in to work at the Niels Bohr Institute. 2.4. BOHR’S INSTITUTE OF THEORETICAL PHYSICS 37

Figure 2.12: George de Hevesy (1885-1966), co-discoverer of the element Haffnium, and pioneer of the use of radioactive tracer elements in biochem- istry. He received the Nobel Prize in Chemistry in 1943 for work which he performed at the Niels Bohr Institute. The name “Haffnium” is derived from the Latin name for Copenhagen. 38 QUANTUM THEORY 2.5 Bohr anticipates the nuclear arms race

After escaping from Denmark to Sweden in a fishing boat in 1943, Niels Bohr and his son Aage flew to England, and then to Los Alamos in the United States, where work on a nuclear bomb was in progress. In 1943, a special intelligence unit called “Aslos” had been set up to determine how far German work on a nuclear bomb had progressed. Advanced units, entering mainland Europe after D-Day, interviewed captured German scientists and found that the German program had never come near to producing a nuclear bomb. The news that the Germans would not produce atomic bombs was classified as a secret. Nevertheless, it passed through the grapevine to the scientists working on the atomic bomb project in America; and it reversed their attitude to the project. Until then, they had been worried that Hitler would be the first to produce nuclear weapons. In 1944, they began to worry instead about what the American government might do if it came to possess such weapons. At Los Alamos, Niels Bohr became the center of discussion and worry about the ethics of continued work on the bomb project. He was then 59 years old; and he was universally respected both for his pioneering work in , and for his outstandingly good character. Bohr was extremely worried because he foresaw a postwar nuclear arms race unless international control of atomic energy could be established. Consequently, as a spokesman for the younger atomic scientists, he approached both Roosevelt and Churchill to urge them to consider means by which international control might be established. Roosevelt, too, was worried about the prospect of a postwar nuclear armaments race; and he was very sympathetic towards Bohr’s proposals for international control. He sug- gested that Bohr travel to England and contact Churchill, to obtain his point of view. Churchill was desperately busy, and basically unsympathetic towards Bohr’s proposals; but on May 16, 1944, he agreed to a half-hour interview with the scientist. The meeting was a complete failure. Churchill and his scientific advisor, Lord Cherwell, spent most of the time talking with each other, so that Bohr had almost no time to present his ideas. Although he could be very persuasive in long conversations, Bohr was unable to present his thoughts briefly. He wrote and spoke in a discursive style, similar to that of Henry James. Each of his long, convoluted sentences was heavily weighted with qualifications and dependent clauses. At one point in the conversation, Churchill turned to Lord Cherwell and asked: “What’s he talking about, physics or politics?” Bohr’s low, almost whispering, way of speaking irritated Churchill. Furthermore, the two men were completely opposed in their views: Bohr was urging openness in approach- ing the Russians, with a view to establishing international control of nuclear weapons. Churchill, a defender of the old imperial order, was concerned mainly with maintaining British and American military supremacy. After the interview, Churchill became worried that Bohr would give away “atomic secrets” to the Russians; and he even suggested that Bohr be arrested. However, Lord Cherwell explained to the Prime Minister that the possibility of making atomic bombs, as well as the basic means of doing so, had been common knowledge in the international 2.5. BOHR ANTICIPATES THE NUCLEAR ARMS RACE 39 scientific community ever since 1939. After his disastrous interview with Churchill, Niels Bohr carefully prepared a memoran- dum to be presented to President Roosevelt. Realizing how much depended on its success or failure, Bohr wrote and rewrote the memorandum, sweating in the heat of Washington’s summer weather. Aage Bohr, who acted as his father’s secretary, typed the memorandum over and over, following his father’s many changes of mind. Finally, in July, 1944, Bohr’s memorandum was presented to Roosevelt. It contains the following passages: “...Quite apart from the question of how soon the weapon will be ready for use, and what role it will play in the present war, this situation raises a number of problems which call for urgent attention. Unless, indeed, some agreement about the control of the new and active materials can be obtained in due time, any temporary advantage, however great, may be outweighed by a perpetual menace to human society.” “Ever since the possibilities of releasing atomic energy on a vast scale came into sight, much thought has naturally been given to the question of control; but the further the exploration of the scientific problems is proceeding, the clearer it becomes that no kind of customary measures will suffice for this purpose, and that the terrifying prospect of a future competition between nations about a weapon of such formidable character can only be avoided by a universal agreement in true confidence...” Roosevelt was sympathetic with the ideas expressed in this memorandum. In an inter- view with Bohr, he expressed his broad agreement with the idea of international control of atomic energy. Unfortunately, the President had only a few months left to live. Roosevelt’s successor, Harry Truman, had not known about the existence of nuclear weapons before taking office, and he was cautiously feeling his way. Meanwhile, General Leslie Groves, the military commander of the Los Alamos project, was very anxious to get credit for ending World War II, rather than being blamed for wasting billions of dollars of the taxpayers’ money. It was easy for Groves to convince Truman to give the order to drop bombs on Hiroshima and Nagasaki. Thus Bohr’s efforts to prevent this tragedy failed, and the postwar nuclear arms race which he anticipated still casts a dark shadow over the future of human civilization and the biosphere.

Suggestions for further reading 1. McKown, Robin. Giant of the Atom, Ernest Rutherford. Julian Messner Inc, New York, (1962). 2. McLintock, A.H. Rutherford, Sir Ernest (Baron Rutherford of Nelson, O.M., F.R.S.). An Encyclopaedia of New Zealand (1966 ed.). Te Ara - The Encyclopedia of New Zealand, (18 September 2007). 3. Heilbron, J. L. Ernest Rutherford and the Explosion of Atoms. Oxford: Oxford University Press, (2003). 4. Dacey, James. What was Rutherford’s greatest discovery?. Physics World, (1 Septem- ber 2011). 40 QUANTUM THEORY

5. Aaserud, Finn Kokowski, M. (ed.). Niels Bohr’s Mission for an ’Open World’ (PDF). Proceedings of the 2nd ICESHS. Cracow. pp. 706-709, (2006). 6. Aaserud, Finn; Heilbron, J. L. Love, Literature and the Quantum Atom: Niels Bohr’s 1913 Trilogy Revisited. Oxford: Oxford University Press, (2013). 7. Bohr, Niels. Nobel Prize Lecture: The Structure of the Atom (excerpts). In French, A. P.; Kennedy, P. J. (eds.). Niels Bohr: A Centenary Volume. Cambridge, Mas- sachusetts: Harvard University Press. pp. 91-97, (1985) [1922]. 8. Cockroft, John D. Niels Henrik David Bohr. 1885-1962. Biographical Memoirs of Fellows of the Royal Society. 9: 36-53, (1 November 1963). 9. Favrholdt, David. Niels Bohr’s Philosophical Background. Copenhagen: Munks- gaard, (1992). 10. Faye, January. Niels Bohr: His Heritage and Legacy. Dordrecht: Kluwer Academic Publishers, (1991). 11. Heilbron, John L. Bohr’s First Theories of the Atom. In French, A. P.; Kennedy, P. J. (eds.). Niels Bohr: A Centenary Volume. Cambridge, Massachusetts: Harvard University Press. pp. 33-49, (1985). 12. Heisenberg, Elisabeth. Inner Exile: Recollections of a Life With Werner Heisenberg. Boston: Birkh¨auser,(1984). 13. Hilberg, Raul. The Destruction of the European Jews. 2. New Haven, Connecticut: Yale University Press, (1961). 14. Jammer, Max. The Conceptual Development of Quantum Mechanics. Los Angeles: Tomash Publishers, (1989). 15. Jones, R.V. Most Secret War. London: Hamilton, (1978). 16. Kieler, JA¸rgen.˜ Resistance Fighter: A Personal History of the Danish Resistance. Translated from the Danish by Eric Dickens. Jerusalem: Gefen Publishing House, (2007). 17. Kragh, Helge. The Theory of the Periodic System. In French, A. P.; Kennedy, P. J. (eds.). Niels Bohr: A Centenary Volume. Cambridge, Massachusetts: Harvard University Press. pp. 50-67, (1985). 18. Kragh, Helge. Niels Bohr and the quantum atom: the Bohr model of atomic structure, 1913-1925. Oxford: Oxford University Press, (2012). 19. Medawar, Jean; Pyke, David. Hitler’s Gift: The True Story of the Scientists Expelled by the Nazi Regime. New York: Arcade Publishing, (2001). 20. Pais, Abraham. Niels Bohr’s Times, In Physics, Philosophy and Polity. Oxford: Clarendon Press, (1991). 21. Powers, Thomas. Heisenberg’s War: The Secret History of the German Bomb. New York: Knopf, (1993). 22. Rhodes, Richard. The Making of the Atomic Bomb. New York: Simon and Schuster, (1986). 23. Richardson, W. Mark; Wildman, Wesley J., eds. Religion and Science: History, Method, Dialogue. London, New York: Routledge, (1996). 24. Rife, Patricia. Lise Meitner and the Dawn of the Nuclear Age. Boston: Birkh¨auser, (1999). 2.5. BOHR ANTICIPATES THE NUCLEAR ARMS RACE 41

25. Rozental, Stefan. Niels Bohr: His Life and Work as Seen by his Friends and Col- leagues. Amsterdam: North-Holland, (1967). 26. Stadtler, Bea; Morrison, David Beal; Martin, David Stone. The Holocaust: A History of Courage and Resistance. West Orange, New Jersey: Behrman House, (1995). 42 QUANTUM THEORY Chapter 3

SCHRODINGER’S¨ WAVE EQUATION

3.1 A wave equation for matter

In 1926, the difficulties surrounding the “old quantum theory” of Max Planck, Albert Einstein and Niels Bohr were suddenly solved, and its true meaning was understood. Two years earlier, a French aristocrat, Louis de Broglie, writing his doctoral dissertation at the Sorbonne in Paris, had proposed that very small particles, such as electrons, might exhibit wave-like properties. The ground state and higher excited states of the electron in Bohr’s model of the hydrogen atom would then be closely analogous to the fundamental tone and higher overtones of a violin string. Almost the only person to take de Broglie’s proposal seriously was Albert Einstein, who mentioned it in one of his papers. Because of Einstein’s interest, de Broglie’s matter-waves came to the attention of other physicists. The Austrian theoretician, Erwin Schr¨odinger, working at Z¨urich, searched for the underlying wave equation which de Broglie’s matter- waves obeyed. Schr¨odinger’sgifts as a mathematician were so great that it did not take him long to solve the problem. The Schr¨odingerwave equation for matter is now considered to be more basic than Newton’s equations of motion. The wave-like properties of matter are not apparent to us in our daily lives because the wave-lengths are extremely small in comparison with the sizes of objects which we can perceive. However, for very small and light particles, such as electrons moving in their orbits around the nucleus of an atom, the wave-like behavior becomes important. Schr¨odingerwas able to show that Niels Bohr’s atomic theory, including Bohr’s seem- ingly arbitrary quantization of angular momentum, can be derived by solving the wave equation for the electrons moving in the attractive field of the nucleus. The allowed orbits of Bohr’s theory correspond in Schr¨odinger’stheory to harmonics, similar to the funda- mental harmonic and higher overtones of an organ pipe or a violin string. (If Pythagoras had been living in 1926, he would have rejoiced to see the deepest mysteries of matter

43 44 SCHRODINGER’S¨ WAVE EQUATION explained in terms of harmonics!) Bohr himself believed that a complete atomic theory ought to be able to explain the chemical properties of the elements in Mendel´eev’speriodic system. Bohr’s 1913 theory failed to pass this test, but the new de Broglie-Schr¨odingertheory succeeded! Through the work of Pauli, Heitler, London, Slater, Pauling, Hund, Mulliken, H¨uckel and others, who applied Schr¨odinger’s wave equation to the solution of chemical problems, it became appar- ent that the wave equation could indeed (in principle) explain all the chemical properties of matter. Strangely, the problem of developing the fundamental quantum theory of matter was solved not once, but three times in 1926! At the University of G¨ottingenin Germany, (1882-1970) and his brilliant young students Werner Heisenberg and Pascal Jordan solved the problem in a completely different way, using matrix methods. At the same time, a theory similar to the “matrix mechanics” of Heisenberg, Born and Jordan was developed independently at Cambridge University by a 24 year old mathematical genius named Paul Adrian Maurice Dirac. At first, the Heisenberg-Born-Jordan-Dirac quantum theory seemed to be completely different from the Schr¨odingertheory; but soon the G¨ottingenmathematician David Hilbert (1862-1943) was able to show that the theories were really identical, although very differently expressed. 3.1. A WAVE EQUATION FOR MATTER 45

Figure 3.1: Bust of Erwin Schr¨odingerin the courtyard arcade of the main building, University of . 46 SCHRODINGER’S¨ WAVE EQUATION 3.2 Felix Bloch’s story about Schr¨odinger

There is an interesting story about Erwin Schr¨odinger’sderivation of his famous wave equation. According to the solid state physicist Felix Bloch, Peter Debye was chairing a symposium in Z¨urich, , at which de Broglie’s waves were being discussed. At one point during the symposium, Debye said: “Well, if there are waves associated with every particle, there must be a wave equation.” Then, turning to Schr¨odinger,he said: “You, Erwin. You’re not doing anything important at the moment. Why don’t you find the wave equation obeyed by de Broglie’s waves?” During the following weekend, the whole group started off for a skiing trip. “Come with us, Erwin!”, they said, but Schr¨odingerreplied: “No, forgive me, I think I will stay here and work.” By the end of the weekend he had derived his famous non-relativistic wave equation. He had first tried a relativistic equation (now known as the Klein-Gordon equation), but had rejected it because he believed that the equation had to be first-order in time. Later, Felix Bloch asked Peter Debye, “Aren’t you sorry that you didn’t derive the wave equation yourself, instead of giving the job to Schr¨odinger?”Debye replied wistfully, “At least I was right about the need for a wave equation, wasn’t I?”

3.3 Dirac’s relativistic wave equation

In 1928, P.A.M. Dirac derived a relativistic wave equation that was first-order in time. To do this, he made use of a set of four anticommuting matrices. Solutions to the Dirac equation in the absence of external fields also obey the Klein-Gordon equation, which is second-order in time, the equation that Schr¨odingerfirst tried and then abandoned. Dirac’s relativistic equation explained for the first time many details of the spectrum of hydrogen, but critics complained that it predicted the existence of negative energy states, and they asked, “Why don’t the positive energy electrons fall down into these states?” Dirac replied “Because the negative energy states are all occupied.” ‘But then”, the critics said, “an extremely energetic photon could create an electron-hole pair!” “Keep looking”, Dirac answered, “and you will find that it sometimes happens.” Thus, an astonishing consequence of Dirac’s relativistic wave equation was the prediction of the existence of antimatter! Years passed. Then, in 1932, the physicist Carl David Anderson observed in a cos- mic ray photographic plate an event that confirmed Dirac’s prediction of the existence of antimatter. A highly-energetic photon was annihilated, and converted into an electron- antielectron pair. The antielectron was given the name “positron”. Since that time, the antiparticles of other particles have been discovered, created in high-energy events where a photon is annihilated and a particle-antiparticle pair created. 3.3. DIRAC’S RELATIVISTIC WAVE EQUATION 47

Figure 3.2: Carl David Anderson in 1936. 48 SCHRODINGER’S¨ WAVE EQUATION

Figure 3.3: Louis Victor Pierre Raymond, duc de Broglie, (1892-1987). 3.3. DIRAC’S RELATIVISTIC WAVE EQUATION 49

Figure 3.4: Heisenberg in 1933 50 SCHRODINGER’S¨ WAVE EQUATION

Figure 3.5: P.A.M. Dirac, the greatest British physicist of the 20th century. A memorial inscribed with his relativistic wave equation stands in Westminister Cathedral, near to the statue of Newton. 3.3. DIRAC’S RELATIVISTIC WAVE EQUATION 51

Figure 3.6: Niels Bohr, Werner Heisenberg and Wolfgang Pauli, c. 1935. 52 SCHRODINGER’S¨ WAVE EQUATION

Figure 3.7: Peter Debye, (1884-1966). 3.4. SOME EQUATIONS 53 3.4 Some equations

The relativistic relationship between energy and momentum

E2 p2c2 = m2c4 (3.1) − Here E stands for energy, p for momentum, m for mass, and c for the velocity of light.

The Klein-Gordon equation

~2 ∂2 + ~2 2 ψ = m2c2ψ (3.2) −c2 ∂2t ∇   The Klein-Gordon equation can be derived from equation 3.1 by making the substitutions

~ ∂ E x4 ict → i ∂x4 ≡ ~ ∂ pj j = 1, 2, 3 (3.3) → i ∂xj

where ~ is Planck’s constant divided by 2π.

Schr¨odinger’snon-relativistic wave equation The non-relativistic relationship between energy and momentum is given by p2 E = c p2 + m2c2 + V + V m2c2 >> p2 (3.4) ≈ 2m p Schr¨odinger’snon-relativistic wave equation,

~2 2 + V ψ = Eψ (3.5) −2m∇   can be derived by making the substitutions

~ ∂ pj j = 1, 2, 3 (3.6) → i ∂xj If the ψ has time-dependence of the form

ψ(x, t) = ψ(x)eiEt/~ (3.7) then we can write ∂ψ i~ = Hψ (3.8) ∂t 54 SCHRODINGER’S¨ WAVE EQUATION where

~2 H 2 + V ≡ −2m∇   3 ∂2 2 (3.9) ∇ ≡ ∂x2 j=1 j X Suggestions for further reading 1. Akhlesh Lakhtakia. Models and Modelers of Hydrogen: Thales, Thomson, Ruther- ford, Bohr, Sommerfeld, Goudsmit, Heisenberg, Schr¨odinger, Dirac, Sallhofer. World Scientific, (1996). 2. O’Connor, John J.; Robertson, Edmund F., Erwin Schr¨odinger, MacTutor History of Mathematics archive, . 3. Halpern, Paul. Einstein’s Dice and Schr¨odinger’s Cat. Perseus Books Group, (2015). 4. Heitler, W. Erwin Schr¨odinger.1887-1961. Biographical Memoirs of Fellows of the Royal Society. 7: 221-226, (1961). 5. Farmelo, Graham. The Strangest Man: The Hidden Life of Paul Dirac, Quantum Genius. Faber and Faber, (2009). 6. Dalitz, R. H.; Peierls, R. Paul Adrien Maurice Dirac. 8 August 1902 - 20 October 1984. Biographical Memoirs of Fellows of the Royal Society. 32: 137-185, (1986). Chapter 4

HARMONIC POLYNOMIALS AND SPHERICAL HARMONICS

4.1 Spherical polar coordinates

Spherical harmonics are very useful in many branches of physics and engineering. They are especially used when the problem to be solved has spherical symmetry. In our case, we shall need spherical harmonics to solve the spherically symmetric problem of the Schr¨odinger equation for hydrogenlike atoms:

~2 e2Z2 2 ψ(x) = Eψ(x) (4.1) −2m ∇ − r  e 

Here me is the electron’s mass, e is its charge, Z is the atomic number of the atom, r is the distance between the electron and the nucleus, and ~ is Planck’s constant divided by 2π. It is convenient to solve this problem using spherical polar coordinates:

x2 + y2 + z2 = r z p cos(θ) = r y tan(ϕ) = (4.2) r The meaning of r, θ and ϕ is illustrated in the figure shown below.

55 56 QUANTUM THEORY

Figure 4.1: This figure illustrates the meaning of spherical polar coordinates. In geography, θ would represent latitude, while ϕ would represent longitude.

4.2 The Laplacian operator in spherical coordinates

The Laplacian operator, 2, expressed in terms of spherical polar coordinates, is given by ∇ ∂2 ∂2 ∂2 2 + + ∇ ≡ ∂x2 ∂y2 ∂z2 1 ∂ ∂ 1 ∂ ∂ 1 ∂2 = r2 + sin θ + (4.3) r2 ∂r ∂r r2 sin θ ∂θ ∂θ r2 sin2 θ ∂ϕ2 The details of the derivation of this equation are given in Appendix B. As we shall see, the spherical harmonics Yl,m(θ, ϕ) obey the relationship

2 l(l + 1) Yl,m(θ, ϕ) = Yl,m(θ, ϕ) (4.4) ∇ − r2 Thus, if we let

ψ(x) = R(r)Yl,m(θ, ϕ) (4.5) the Schr¨odingerequation for hydrogenlike (one electron) atoms is separable:

2 1 ∂ 2 ∂ l(l + 1) R(r)Yl,m(θ, ϕ) = r R(r)Yl,m(θ, ϕ) (4.6) ∇ r2 ∂r ∂r − r2  

Dividing both sides by Yl,m(θ, ϕ), we find that the radial part of the Schr¨odingerequation for one-electron atoms must obey the equation

~2 1 ∂ ∂ e2Z2 l(l + 1) r2 + R(r) = ER(r) (4.7) −2m r2 ∂r ∂r − r r2   4.3. HOMOGENEOUS AND HARMONIC POLYNOMIALS 57 4.3 Homogeneous and harmonic polynomials

Let us consider a 3-dimensional space whose Cartesian coordinates are x, y and z.A polynomial in these coordinates is defined to be a sum of terms in which the coordinates are raised to various powers. A polynomial is said to be homogeneous if, in each term, the sum of the powers is the same as it is in all the other terms. A homogeneous polynomial h is harmonic if it also obeys the Laplace equation: ∂2 ∂2 ∂2 2h(x, y, z) + + h(x, y, z) = 0 (4.8) ∇ ≡ ∂x2 ∂y2 ∂z2   For example, we can think of the two polynomials 2 2 f2 = x + y h = x2 y2 (4.9) 2 − Both f2 and h2 are homogeneous, but only h2 is harmonic. The reader is invited to invent a few other polynomials that illustrate this difference.

4.4 Harmonic polynomials and spherical harmonics

Harmonic polynomials are very closely related to spherical harmonics. The order l of a harmonic polynomial is the sum of the powers to which the coordinates are raised in each term. Thus, for example, the order of h2 in the previous equation is 2. For each harmonic polynomial hl, there is a spherical harmonic related to it by h (x, y, z) Y (θ, ϕ) = l (4.10) l rl The quantum number m is absent from this relationship because the spherical harmonic ∂ generated in this way is not necessarily an eigenfunction of the operator ∂ϕ . However Yl(θ, ϕ) always satisfies the relationship

2 l(l + 1) Yl(θ, ϕ) = Yl(θ, ϕ) (4.11) ∇ − r2

The proof is as follows: For any homogeneous polynomial fl of order l, 2 β β−2 β 2 r fl = β(β + 2l + 1)r fl + r fl (4.12) ∇ ∇ This relationship can be proved  directly by carrying out the differentiation: 3 2 β ∂ β−1 ∂r β ∂fl r fl = βr fs + r ∇ ∂xj ∂xj ∂xj j=1    X 3 2 β−4 2 β−2 β−2 ∂fl β ∂ fl = β(β 2)r xj fl + βr fs + 2βr xj + r 2 − ∂xj ∂x j=1 j X   β−2 β 2 = β(β + 2l + 1)r fl + r fl (4.13) ∇ 58 QUANTUM THEORY

2 If the homogeneous function in is also harmonic, then (since hl = 0) ∇ 2 β β−2 r hl = β(β + 2l + 1)r hl (4.14) ∇ In the special case where β = l, this becomes − h l(l + 1) h 2 l = l (4.15) ∇ rl − r2 rl

l or, letting Yl = hl/r ,

2 l(l + 1) Yl = Yl (4.16) ∇ − r2 which is what we wanted to prove. Hyperspherical harmonics are the d-dimensional ana- logues of spherical harmonics. Like 3.dimensional spherical harmonics, they can be gener- ated from harmonic polynomials.The 4-dimensional hyperspherical harmonics, a table of which is shown below, are of particular interest because from them, the Fourier transforms of hydrogenlike atomic orbitals can be generated, as we shall see in Chapter 5. 4.4. HARMONIC POLYNOMIALS AND SPHERICAL HARMONICS 59

xj Table 4.1: 3-dimensional spherical harmonics, uj j = 1, 2, 3 ≡ r

l m √4π Yl,m(u) √4π Yl,m(θ, ϕ)

0 0 1 1

1 1 3 (u + iu ) 3 eiϕ sin(θ) − 2 1 2 − 2 q q

1 0 √3u3 √3 cos(θ)

1 -1 3 (u iu ) 3 e−iϕ sin(θ) 2 1 − 2 2 q q

1 15 2 1 15 2iϕ 2 2 2 2 2 (u1 + iu2) 2 2 e sin (θ) q q

2 1 15 u (u + iu ) 15 eiϕ sin(θ) cos(θ) 2 3 1 2 − 2 q q

2 0 1 √5(3u2 1) 1 √5 (3 cos2(θ) 1) 2 3 − 2 −

2 -1 15 u (u iu ) 15 e−iϕ sin(θ) cos(θ) 2 3 1 − 2 2 q q

2 -2 1 15 (u iu )2 1 15 e−2iϕ sin2(θ) 2 2 1 − 2 2 2 q q 60 QUANTUM THEORY

Table 4.2: 4-dimensional hyperspherical harmonics

2 2 2 2 √3(u1 + iu2)

2 2 1 2√3u (u + iu ) − 3 1 2 λ l m √2π Yλ,l,m(u) 2 2 0 √2(u2 + u2 2u2) − 1 2 − 3 0 0 0 1 2 2 1 2√3u (u iu ) − 3 1 − 2 1 1 1 √2(u + iu ) − 1 2 2 2 2 √3(u iu )2 − 1 − 2 1 1 0 2u − 3 2 1 1 2√3 u (u + iu ) − 4 1 2 1 1 1 √2(u iu ) − 1 − 2 2 1 0 2√6 u4u3

1 0 0 2u4 2 1 1 2√3 u (u iu ) − 4 1 − 2

2 0 0 3u2 u2 u2 u2 4 − 1 − 2 − 3 4.5. AN ANGULAR INTEGRATION THEOREM 61 4.5 An angular integration theorem

Theorem Let x n1 x n2 x nd I(n) dΩ 1 2 d (4.17) ≡ r r ··· r Z       where x1, x2, . . . , xd are the Cartesian coordinates of a d-dimensional space, dΩd is the generalized solid angle, r is the hyperradius, and Re[nj] > 1, j = 1, 2, 3, ..., d. Then − d 2 1 nj + 1 I(n) = 1 + eiπnj Γ (4.18) Γ d+n 2 2 2 j=1   Y   where d

n nj (4.19) ≡ j=1 X Proof Consider the integral

∞ d ∞ 2 2 d−1 −r n1 n2 nd nj −x dr r e dΩ x x x = dxj x e j (4.20) 1 2 ··· d j 0 j=1 −∞ Z Z Y Z The right-hand side of (4.20) can be evaluated and has the form

d ∞ d nj −x2 1 iπn nj + 1 dx x e j = 1 + e j Γ (4.21) j j 2 2 j=1 Z−∞ j=1   Y Y  provided that Re[nj] > 1, j = 1, 2, 3, ..., d. The left-hand side of (4.20) can be written as −

∞ n n n 2 x 1 x 2 x d I(n) d + n dr rd+n−1e−r dΩ 1 2 d = Γ (4.22) 0 r r ··· r 2 2 Z Z         Combining equations (4.20)-(4.22), we obtain (4.18), Q.E.D.

Comments If all the nj’s are zero or positive integers, then (4.18) reduces to

d/2 d π 0 (nj 1)!! if all the nj s are even 2(n/2−1)Γ d+n − I(n) =  2 j=1 (4.23)  Y   0 otherwise   62 QUANTUM THEORY

When d = 3, (4.23) becomes

3 4π 0 n n n (nj 1)!! all nj s even x1 1 x2 2 x3 3 (n + 1)!! − dΩ3 =  j=1 (4.24) r r r  Y Z        0 otherwise  while when d = 4, (4.23) tells us that

x n1 x n2 x n3 x n4 dΩ 1 2 3 4 4 r r r r Z 2  4       4π 0 (nj 1)!! all nj s even (n + 2)!! − =  j=1 (4.25)  Y  0 otherwise  Let us now consider a general polynomial (not necessarily homogeneous) of the form:

n1 n2 nd P (x) = cn x1 x2 xd (4.26) n ··· X Then we have

n1 n2 nd n dΩd P (x) = cn dΩ x1 x2 xd = cn r I(n) (4.27) n ··· n Z X Z X It can be seen that equation (4.18) can be used to evaluate the generalized angular integral of any polynomial whatever, regardless of whether or not it is homogeneous. The utility of these relationships is very great indeed. They provide a method for developing the theory of angular momentum and hyperangular momentum which supple- ments the usual group-theoretical methods. For example, Clebsch-Gordan coefficients and generalized Clebsch-Gordan coefficients may be generated by means of (4.23), rather than through equations derived from group theory, which become quite complicated when they are generalized to d dimensions. Chapter 5

THE SCHRODINGER¨ EQUATION FOR HYDROGEN

5.1 Separation of the equation

The Schr¨odingerequation for hydrogenlike (1-electron) atoms is

~2 e2Z2 2 ψ(x) = Eψ(x) (5.1) −2m ∇ − r  e  If we let

ψ(x) = R(r)Yl,m(θ, ϕ) (5.2) the Schr¨odingerequation for hydrogenlike (one electron) atoms is separable:

2 1 ∂ 2 ∂ l(l + 1) R(r)Yl,m(θ, ϕ) = r R(r)Yl,m(θ, ϕ) (5.3) ∇ r2 ∂r ∂r − r2  

Dividing both sides by Yl,m(θ, ϕ), we find that the radial part of the Schr¨odingerequation for one-electron atoms must obey the equation

~2 1 d d e2Z2 l(l + 1) r2 + R(r) = ER(r) (5.4) −2m r2 dr dr − r r2   where we have made use of the relationship

2 l(l + 1) Yl,m(θ, ϕ) = Yl,m(θ, ϕ) (5.5) ∇ − r2 and where we have replaced partial derivatives in the radial equation by ordinary deriva- tives, since we now have a differential equation in a single variable.

63 64 QUANTUM THEORY 5.2 Solutions to the radial equation

If we let

ρ Zr (5.6) ≡ and 2E  = (5.7) Z2 then the radial equation becomes:

1 d 2 d l(l + 1) 2 ρ + +  Rn,l(ρ) = 0 (5.8) ρ2 dρ dρ − ρ2 ρ   Equation (5.8) has solutions of the form

l −ρ/(n) Rn,l(ρ) = Nn,l ρ e F [ l + 1 n 2l + 2 2ρ/n)] (5.9) − | | where

Z3/2 (l + n)! 1/2 2 l+2 N = (5.10) n,l 2(2l + 1)! (n 1 l)! n  − −    and where F [a b x] is a confluent hypergeometric function: | | ax a(a + 1)x2 a(a + 1)(a + 2)x3 F [a b x] 1 + + + + ... (5.11) | | ≡ b b(b + 1)2! b(b + 1)(b + 2)3!

The confluent hypergeometric series terminates and reduces to a polynomial when a is a negative integer. In our case this means that l + 1 n must be a negative integer, and thus, for the series to terminate, as is required for finiteness− at large values of r, l cannot exceed n 1. A table of the first few radial wave functions for hydrogenlike atoms is shown below: − 5.2. SOLUTIONS TO THE RADIAL EQUATION 65

Table 5.1: Radial wave functions for hydrogenlike atoms

n l Rnl(r)

1 0 e−ZrZ3/2

e−Zr/2Z3/2(1− Zr ) 2 0 √ 2 2

−Zr/2 3/2 2 1 e √Z Zr 2 6

 2  2e−Zr/3Z3/2 2Zr − 2Zr +1 3 0 √ 27 3 3 3

3 1 4 2 e−Zr/3Z3/2 1 Zr Zr 27 3 − 6 q 

2 2 −Zr/3 3/2 2 3 2 81 15 e Z Zr q 66 QUANTUM THEORY

Table 5.2: Atomic orbitals

n l m √4πRn,l(r)Yl,m(θ, ϕ)

1 0 0 e−ZrZ3/2

e−Zr/2Z3/2(1− Zr ) 2 0 0 √ 2 2

−Zr/2 3/2 2 1 1 e √Z Zr 3 eiϕ sin(θ) − 2 6 2 q

−Zr/2 3/2 2 1 0 e √Z Zr √3 cos(θ) 2 6

−Zr/2 3/2 2 1 -1 e √Z Zr 3 e−iϕ sin(θ) 2 6 2 q

 2  2e−Zr/3Z3/2 2Zr − 2Zr +1 3 0 0 √ 27 3 3 3

3 1 1 4 2 e−Zr/3Z3/2 1 Zr Zr 3 eiϕ sin(θ) − 27 3 − 6 2 q  q

3 1 0 4 2 e−Zr/3Z3/2 1 Zr Zr√3 cos(θ) 27 3 − 6 q 

3 1 -1 4 2 e−Zr/3Z3/2 1 Zr Zr 3 e−iϕ sin(θ) 27 3 − 6 2 q  q 5.2. SOLUTIONS TO THE RADIAL EQUATION 67

Table 5.3: Atomic orbitals (continued)

n l m √4πRn,l(r)Yl,m(θ, ϕ)

2 2 −Zr/3 3/2 2 1 15 2iϕ 2 3 2 2 81 15 e Z Zr 2 2 e sin (θ) q q

3 2 1 2 2 e−Zr/3Z3/2Zr2 15 eiϕ sin(θ) cos(θ) − 81 15 2 q q

3 2 0 2 2 e−Zr/3Z3/2Zr2 1 √5 (3 cos2(θ) 1) 81 15 2 − q

2 2 −Zr/3 3/2 2 15 −iϕ 3 2 -1 81 15 e Z Zr 2 e sin(θ) cos(θ) q q

2 2 −Zr/3 3/2 2 1 15 −2iϕ 2 3 2 -2 81 15 e Z Zr 2 2 e sin (θ) q q 68 QUANTUM THEORY 5.3 Fock’s momentum-space treatment of hydrogen

In a brilliant 1935 paper, the Russian physicist V. Fock was able to show that a rela- tionship exists between 4-dimensional hyperspherical harmonics and the solutions to the Fourier transformed Schr¨odingerfor hydrogenlike (1-electron) atoms. In direct space, the Schr¨odingerequation (in atomic units) for an electron moving in the potential V (x) is

1 2 + V (x) ψ(x) = Eψ(x) (5.12) −2∇   We can let 1 ψ(x) = d3p eip·xψt(p) (5.13) (2π)3/2 Z where 1 ψt(p) = d3x e−ip·xψ(x) (5.14) (2π)3/2 Z Substituting (5.13) into (5.12), we have

1 p2 d3p + V (x) E eip·xψt(p) = 0 (5.15) (2π)3/2 2 − Z   0 We now multiply on the left by e−ip ·x and integrate over d3x. This gives:

p02 1 E ψt(p0) = − d3p V t(p0 p) ψt(p) (5.16) 2 − (2π)3/2 −   Z which is the 1-particle Schr¨odingerequation in reciprocal space. For hydrogenlike atoms, Z V (x) = (5.17) − r so that from (??),

t 2 Z V (p) = 2 (5.18) −rπ p Letting

2E = k2 (5.19) − and combining (5.16), (5.17) and (5.18), we obtain Z 1 p02 + k2 ψt(p0) = d3p ψt(p) (5.20) π2 p0 p 2 Z | − |   5.3. FOCK’S MOMENTUM-SPACE TREATMENT OF HYDROGEN 69

Fock then made the transformation: 2kp u = 1 sin χ sin θ cos ϕ 1 k2 + p2 ≡ 2kp u = 2 sin χ sin θ sin ϕ 2 k2 + p2 ≡ 2kp u = 3 sin χ cos θ 3 k2 + p2 ≡ k2 p2 u = − cos χ (5.21) 4 k2 + p2 ≡ Here θ and ϕ are the polar angles of the vector p:

p1 = p sin θ cos ϕ

p2 = p sin θ sin ϕ

p3 = p cos θ (5.22) while k2 p2 2kp χ cos−1 − = sin−1 (5.23) ≡ k2 + p2 k2 + p2     is an angle introduced by Fock in order to transform the integral d3p into an integral over solid angle in a 4-dimensional space. Fock’s transformation maps the 3-dimensional p-space onto the surface of a unit sphere in a 4-dimensional space. It is easy to verify from (5.21) that

2 2 2 2 u1 + u2 + u3 + u4 = 1 (5.24)

From the Jacobian of the transformation from Cartesian coordinates to 4-dimensional hyperspherical coordinates, one finds that the element of solid angle in the 4-dimensional space is given by

dΩ = sin2 χ sin θ dχdθdϕ 2kp 2 = sin θ dχdθdϕ (5.25) k2 + p2   Comparing this with

d3p = p2dp sin θ dθdϕ (5.26) and making use of the fact that dχ 2k = (5.27) dp k2 + p2 70 QUANTUM THEORY

we have

2k 3 dΩ = d3p k2 + p2   k2 + p2 3 d3p = dΩ (5.28) 2k   Also, from (5.21), we have:

4k2 p p0 = · (k2 + p2)(k2 + p02)(u u0 u u0 ) · − 4 4 1 4k2 1 = (5.29) p p0 2 (k2 + p2)(k2 + p02) u u0 2 | − | | − | Inserting (5.28) and (5.29) into (5.20), we obtain:

2 2 2 2 Z (k + p ) p02 + k2 ψt(p0) = dΩ ψt(p) (5.30) 2kπ2 u0 u 2 Z | − |   We now let 4k5/2 ψt(p) = ϕ(Ω) (5.31) (k2 + p2)2

(As shown in Section 5.3 below, the factor 4k5/2 in the numerator is needed to normalize ψt(p)). Equation (5.30) then takes on the simple form

Z 1 ϕ(Ω0) = dΩ ϕ(Ω) (5.32) 2kπ2 u0 u 2 Z | − | From equation (??), with d = 4 and α = d/2 1 = 1, we have − ∞ 1 = C1(u u0) (5.33) u0 u 2 λ · λ | − | X=0 so that (5.32) becomes

∞ Z ϕ(Ω0) = dΩ C1(u u0) ϕ(Ω) (5.34) 2kπ2 λ · λ X=0 Z Remembering equation (??) we can rewrite this in the form

∞ Z ϕ(Ω0) = K O [ϕ(Ω)] (5.35) 2kπ2 λ λ λ X=0 5.3. FOCK’S MOMENTUM-SPACE TREATMENT OF HYDROGEN 71

For d = 4, I(0) 2π2 K = = (5.36) λ λ + 1 λ + 1 so that equation (5.35) becomes: ∞ Z 1 ϕ(Ω0) = O [ϕ(Ω0)] (5.37) k λ + 1 λ λ X=0 If ϕ(Ω) is an eigenfunction of Λ2. so that 0 Oλ0 [ϕ(Ω )] = δλ0λϕ(Ω) (5.38) then (5.37) will be satisfied provided that Z = 1 (5.39) k(λ + 1) or, from (5.19), k2 Z2 Z2 E = = = λ = 0, 1, 2... n = 1, 2, 3... (5.40) − 2 −2(λ + 1)2 −2n2 where we have made the identification λ + 1 = n. We can see that Fock’s treatment gives the usual energy levels for hydrogenlike atoms. For the transformed wave function ϕ(Ω), any 4-dimensional hyperspherical harmonic will do, but for most applications, it is convenient to use hyperspherical harmonics of the type shown in Table 2.1. Thus we obtain the Fourier transformed hydrogenlike orbitals: 5/2 t 4k ψ (p) = Yn− ,l,m(Ω ) M(p)Yn− ,l,m(Ω ) n,l,m (k2 + p2)2 1 4 ≡ 1 4 4k5/2 M(p) (5.41) ≡ (k2 + p2)2 For the first few values of n, l and m, 5.41 yields: 2√2 k5/2 ψt (p) = 1,0,0 (k2 + p2)2π 4√2 k5/2(k2 p2) ψt (p) = − 2,0,0 (k2 + p2)3π 8ik7/2(p ip ) ψt (p) = 1 − 2 2,1,−1 − (k2 + p2)3π 8i√2 k7/2 ψt (p) = 2,1,0 −(k2 + p2)π 8ik7/2(p + ip ) ψt (p) = 1 2 2,1,1 (k2 + p2)3π ...... (5.42) 72 QUANTUM THEORY

To see how Fock’s reciprocal space solutions to the hydrogenlike wave equation are related to the familiar hydrogenlike orbitals, we can make a table of hydrogenlike orbitals with Z/n replaced by the constant k. The radial functions become

0 3/2 −kr R1,0(r) = 2k e R0 (r) = 2k3/2(1 kr)e−kr 2,0 − 3/2 0 2k −kr R , (r) = kr e 2 1 √3 2k2r2 R0 (r) = 2k3/2 1 2kr + e−kr 3,0 − 3   ...... (5.43) and so on, and the corresponding wave functions will be

0 χn,l,m(x) = Rn,l(r)Yl,m(Ω3) (5.44)

As you can verify, taking the Fourier transforms of the wave functions defined by equations (5.43) and (5.44), and making the substitutions shown in equation (5.21), we obtain the Fourier transformed solutions of V. Fock, equation (5.41). But this set of solutions is not quite the same as a set of familiar hydrogenlike orbitals because Z/n is everywhere replaced by the constant k. A set of Fock’s solutions corresponding to a particular value of k is called a set of Coulomb Sturmians. Such a set obeys a potential-weighted orthonormality relation, as we will discuss in detail in Chapters 6 and 7.

5.4 The Pauli exclusion principle and the periodic ta- ble

Bohr himself believed that a complete atomic theory ought to be able to explain the chem- ical properties of the elements in Mendel´eev’speriodic system. Bohr’s 1913 theory failed to pass this test, but the new de Broglie-Schr¨odingertheory succeeded! Through the work of Pauli, Heitler, London, Slater, Pauling, Hund, Mulliken, H¨uckel and others, who applied Schr¨odinger’swave equation to the solution of chemical problems, it became apparent that the wave equation could indeed (in principle) explain all the chemical properties of matter.

The solutions to Schr¨odinger’swave equation for an electron moving in the field of a nucleus are called atomic orbitals, and the first few of them are shown in Figure 11.6.They are analogous to the harmonics of a violin string or an organ pipe, except the they are three-dimensional. The electron had been shown to have a magnetic moment, and in a magnetic field, it was found to orient itself either in the direction of an applied magnetic field, or in the opposite direction - either “spin-up” or “spin-down”. This effect could be observed in the splitting of the lines in atomic spectra in the presence of an applied 5.4. THE PAULI EXCLUSION PRINCIPLE AND THE PERIODIC TABLE 73

magnetic field. The “spin” and magnetic moment of electrons were completely explained in 1928 by P.A.M. Dirac’s relativistic wave equation. Meanwhile, the Austrian physicist Wolfgang Pauli proposed his famous exclusion prin- ciple, which explained the periodic table and the chemical properties of the elements. According to the Pauli exclusion principle, in the lowest energy state of an atom, the elec- trons fill the atomic orbitals in the order (1s), (2p), (3d), ... Two electrons are allowed in each linearly independent orbital, one with spin up and the other with spin down. This leads to the following electron configurations for the elements:

Hydrogen; (1s); very active metal; valence=1 • Helium; (1s)2; noble (inert) gas; valence=0 • Lithium; (1s)2(2p)1; very active metal; valence=1 • Beryllium; (1s)2(2p)2; metal; valence=2 • Boron; (1s)2(2p)3; less active metal; valence=3 • Carbon; (1s)2(2p)4; intermediate; valence=4 • Nitrogen; (1s)2(2p)5; less active nonmetal; valence=5 • Oxygen; (1s)2(2p)6; nonmetal; valence=6 • Fluorine; (1s)2(2p)7; very active nonmetal; valence=7 • Neon; (1s)2(2p)8; noble gas; valence=0 • Sodium; (1s)2(2p)8(2s)1; very active metal; valence=1 • In chemical reactions, the metals tend to give away their outer-shell electrons, while the non-metals tend to accept electrons. The most active metals, hydrogen, lithium, sodium, potassium, rubidium and cesium, all have a single electron in their outer shell, and they tend to give this electron away. The most active nonmetals, fluorine, chlorine, bromine and iodine, all are missing a single electron to complete their outer shell. We can notice that common table salt, is a cubic crystal structure formed from Na+ ions and Cl− ions. When it is dissolved in water, the sodium-chloride crystal dissociates into Na+ ions, complexed with water molecules and Cl− ions, also forming complexes with water. We see here the strong tendency of very active metals to give up their outer shell electron and to form positive ions, while very active nonmetals have an equally strong tendency to form negative ions. Helium, neon, argon, krypton, and radon, all with completely filled outer shell, are unreactive noble gases, with no tendency at all to give away or accept electrons or to form ions.

The Hartree-Fock equations The application of the Schr¨odinger equation to our understanding of chemical reactivity and the periodic table was made quantitative through the work of Douglas Hartree (1897- 1958) and Vladimir A. Fock (1898-1974). Douglas Hartree was born in Cambridge, England, where his father was a professor of engineering at Cambridge University and his mother was the mayor of the city. In his work on the electronic structure of atoms, Hartree visualized the electrons moving 74 QUANTUM THEORY

Figure 5.1: Atomic orbitals. 5.4. THE PAULI EXCLUSION PRINCIPLE AND THE PERIODIC TABLE 75

Figure 5.2: The periodic table of the elements. 76 QUANTUM THEORY in both the attractive field of the atomic nucleus and in a repulsive potential produced collectively by all the electrons. Hartree’s method for treating this problem was to make an initial guess of the size of the atomic orbitals (Figure 11.6) occupied by the electrons. He then calculated the repulsive potential that would result, and combined it with the nuclear attraction potential. Solving the Schr¨odingerequation for the an electron moving in this new potential, he obtained a set of improved atomic orbitals, and from these he could calculate an improved total potential. He continued to iterate this process until the change resulting from successive iterations became very small, at which point he described the electrical field in which the electrons moved as being self-consistent. Hartree called his procedure the Self-Consistent-Field (or SCF) Method. He published his first results in 1927, only a year after Schr¨odinger’sdiscovery of his wave equation. The Russian physicist Vladimir A. Fock was able to refine Hartree’s method by postulat- ing that the total electronic wave function of an atom or molecule had to be antisymmetric with respect to the exchange of the coordinates of any two electrons in the system. When spin was included in the wave function, this requirement led in a natural way to the exclu- sion principle postulated by Wolfgang Pauli. When combined with Hartree’s SCF method, Fock’s antisymmetry requirement led to more accurate results and better agreement be- tween theory and experiment. However, the Hartree-Fock SCF equations were much more difficult to solve. Later Clemens C.J. Roothaan (1918-2019) converted the Hartree-Fock equations into a matrix form suitable for solution by digital computers. The method in use today is thus known as the Hartree-Fock-Roothaan SCF Method. When applied to molecules, it is called the Hartree-Fock-Roothaan LCAO SCF Method. The LCAO in the name stands for the fact that molecular orbitals are represented as Linear Combinations of Atomic Orbitals. 5.4. THE PAULI EXCLUSION PRINCIPLE AND THE PERIODIC TABLE 77

Figure 5.3: Wolfgang Pauli (1900-1958). 78 QUANTUM THEORY

Figure 5.4: Douglas Hartree (1897-1958). 5.4. THE PAULI EXCLUSION PRINCIPLE AND THE PERIODIC TABLE 79

Figure 5.5: Vladimir A. Fock (1898-1974). 80 QUANTUM THEORY 5.5 Valence bond theory

Lewis structures In 1916, G.N. Lewis of the University of California, Berkeley, proposed a theory of chemical bonding in which a pair of electrons, one donated by each of the bonding atoms, together form a covalent bond. For example, in the hydrogen molecule, H2, one electron is con- tributed by each of the two hydrogen atoms. Another example of a Lewis structure is the − NO2 ion shown in Figure 15.2. The outer-shell electrons that do not contribute to bonding are represented by pairs of dots and are called lone pairs.

Heitler-London theory The next step in valence bond theory was taken in 1927 by Walter Heitler and Fritz London, who used Erwin Schr¨odinger’swave equation and Wolfgang Pauli’s exclusion principle to study the covalent bonding of the hydrogen molecule.

Linus Pauling’s contributions Linus Pauling developed these ideas further by introducing the key concepts of resonance and orbital hybridization. Pauling’s two famous books, Introduction to Quantum Mechan- ics, With Applications to Chemistry (with E. Bright Wilson, 1935), and The Nature of the Chemical Bond (1939) were extremely important and influential, as was Charles Coulson’s Valence (1952). 5.5. VALENCE BOND THEORY 81

Figure 5.6: Gilbert N. Lewis (1875-1946). He was nominated for the Nobel Prize in Chemistry 41 times, but never won it.

− Figure 5.7: The NO2 ion, an example of a Lewis structure. 82 QUANTUM THEORY 5.6 Molecular orbital theory

In molecular orbital theory, atomic orbitals shown in in Figure 11.6 are used to build up a representation of the orbital of an electron moving in a molecule. For example, Figure 15.3 shows a schematic diagram of the LCAO (Linear Combination of Atomic Orbitals) model of bonding in the H2 molecule. When two hydrogen atoms approach each other, the two electrons move in an attractive potential produced by both nuclei. Thus the range of their motion is enlarged. According to the Pauli exclusion principle, both electrons are allowed to move in this enlarged region, if they have opposite spins. The electronic wave function in the enlarged region of motion is called a a molecular orbital. In the LCAO approximation, molecular orbitals are built up from atomic orbitals cen- tered on the individual atoms of a molecule. For example, suppose that in the H2 molecule we denote the positions of the two nuclei by X1 and X2. Then we can approximate the molecular orbitals φσ(x) and φσ∗ (x) by symmetric and antisymmetric combinations of the two atomic orbitals, χ s(x X ) and χ s(x X ): 1 − 1 1 − 2 φσ(x) = N [χ1s(x X1) + χ1s(x X2)] 0 − − φσ∗ (x) = N [χ s(x X ) χ s(x X )] (5.45) 1 − 1 − 1 − 2 where N and N 0 are normalizing constants. The symmetric combination is called a bonding orbital, and in the hydrogen molecule ground state it is doubly occupied, the two electrons having opposite spin quantum numbers. In the ground state of H2, the antibonding orbital, φσ∗ (x), is unoccupied, as is illustrated schematically in Figure 15.3.

Quantum biochemistry Erich H¨uckel devised an extremely simple semiemperical approximation for treating the highest filled and lowest empty molecular orbitals of flat organic molecules such as benzene, napthaline, pyradine, guanine, cytosine, etc. Solution to H¨uclel’sequations only required the diagonalization of a small matrix, and this could be done even before the advent of electronic computers. During World War II, the French scientist Alberte Pullman sat in a basement room in Paris diagonalizing H¨uckel matrices with a desk-top mechanical calculator, while her husband Bernard drove a tank for the Free French forces in North Africa. After the war Alberte and Bernard Pullman published a pioneering book entitled Quantum Biochemistry (1963), which opened a new field of research. Meanwhile, the great Hungarian-American biochemist and physiologist Albert Szent-Gy¨orgyi,published a book entitled Introduction to a Submolecular Biology (1960) in which he explored the biochemical roles of charge donors and charge acceptors. Charge donors are molecules whose highest filled molecular orbitals are relatively high in energy while charge acceptors are molecules are relatively low in energy. A typical programmable minicomputer or “microprocessor”, manufactured in the 1970’s, could have 30,000 circuit elements, all of which were contained on a single chip. By 1989, more than a million transistors were being placed on a single chip; and by 2000, the number reached 42,000,000. 5.6. MOLECULAR ORBITAL THEORY 83

Figure 5.8: A schematic diagram of the LCAO (Linear Combination of Atomic Orbitals) model of bonding in the H2 molecule.

As a result of miniaturization and parallelization, the speed of computers rose expo- nentially. In 1960, the fastest computers could perform a hundred thousand elementary operations in a second. By 1970, the fastest computers took less than a second to per- form a million such operations. In 1987, a massively parallel , with 566 parallel processors, called GFll was designed to perform 11 billion floating-point operations per second (flops). By 2002 the fastest computer performed 40 at teraflops, making use of 5120 parallel CPU’s. Computer disk storage has also undergone a remarkable development. In 1987, the magnetic disk storage being produced could store 20 million bits of information per square inch; and even higher densities could be achieved by optical storage devices. Storage density has until followed a law similar to Moore’s law. In the 1970’s and 1980’s, computer networks were set up linking machines in various parts of the world. It became possible (for example) for a scientist in Europe to perform a calculation interactively on a computer in the United States just as though the distant machine were in the same room; and two or more computers could be linked for perform- ing large calculations. It also became possible to exchange programs, data, letters and manuscripts very rapidly through the computer networks. The exchange of large quantities of information through computer networks was made easier by the introduction of fiber optics cables. By 1986, 250,000 miles of such cables had been installed in the United States. If a ray of light, propagating in a medium with a large refractive index, strikes the surface of the medium at a grazing angle, then the ray undergoes total internal reflection. This phenomenon is utilized in fiber optics: A light signal can propagate through a long, hairlike glass fiber, following the bends of the fiber without 84 QUANTUM THEORY

Figure 5.9: Robert Mulliken (1896-1986). His contributions to molecular orbital theory won him the 1966 Nobel Prize in Chemistry. 5.6. MOLECULAR ORBITAL THEORY 85

Figure 5.10: Erich H¨uckel (1896-1980). He developed a very simple semiemper- ical approximation for treating the highest filled and lowest empty molecular orbitals of flat organic molecules. 86 QUANTUM THEORY

Figure 5.11: Alberte Pullman (1920-2011).

Figure 5.12: Alberte Pullman with her husband, Bernard. 5.6. MOLECULAR ORBITAL THEORY 87

Figure 5.13: Linus Pauling (1901-1994). The New Scientist called him one of the 20 most important scientists in history. He was awarded the Nobel Prize in Chemistry in 1954 and the Nobel Peace Prize in 1962. 88 QUANTUM THEORY

Figure 5.14: Two-time Nobel Prize winner Linus Pauling, Research Professor at the Linus Pauling Institute of Science and Medicine, Palo Alto, California, and E. Bright Wilson, Jr., Professor Emeritus of Chemistry at Harvard University, provide a readily understandable study of “wave mechanics,” discussing the Schr¨odingerwave equation and the problems which can be solved with it. The book was first published in 1935, and it is still in use today. 5.6. MOLECULAR ORBITAL THEORY 89

Figure 5.15: Linus Pauling’s famous book, The Nature of the Chemical Bond and the Structure of Molecules and Crystals. Cornell University Press, 1939. 90 QUANTUM THEORY

Figure 5.16: Clemens C.J. Roothaan (1918-2019). His thesis supervisor, Robert Mulliken, said of him, “I tried to induce Roothaan to do his Ph.D. thesis on H¨uckel-type calculations on substituted benzenes. But after carrying out some very good calculations on these he revolted against the H¨uckel method, threw his excellent calculations out the window, and for his thesis developed entirely independently his now well known all-electron LCAO SCF self-consistent-field method for the calculation of atomic and molecular wave functions, now ap- propriately referred to, I believe, as the Hartree-Fock-Roothaan method”. 5.6. MOLECULAR ORBITAL THEORY 91

Figure 5.17: Charles Coulson (1910-1974). He was the leader of important quan- tum chemistry groups, first at Kings College London, and then later at Oxford University. He was a very tall man, and also a religious one, with a strong sense of duty. On the day of his death in 1974, he worked until the last minute. His last words were, “I think I can’t do any more”. His book, Valence, greatly influenced the development of quantum chemistry. 92 QUANTUM THEORY

Figure 5.18: Per-Olov L¨owdin(1916-2000). Prof. L¨owdinfounded two impor- tant quantum chemistry groups, one at the University of Uppsala in Sweden, and another at the University of Florida in Gainsville. Many scientists were in- troduced to quantum chemistry at his famous Uppsala summer schools, which often included mountain-climbing expeditions. Prof. L¨owdinalso founded two important journals, The International Journal of Quantum Chemistry and Advances in Quantum Chemistry. He invented a number of mathematical techniques much used in the field, for example his method for symmetrical orthogonalization of basis functions. losing intensity because of total internal reflection. However, before fiber optics could be used for information transmission over long distances, a technological breakthrough in glass manufacture was needed, since the clearest glass available in 1940 was opaque in lengths more than 10 m. Through studies of the microscopic properties of glasses, the problem of absorption was overcome. By 1987, devices were being manufactured commercially that were capable of transmitting information through fiber-optic cables at the rate of 1.7 billion bits per second. 5.6. MOLECULAR ORBITAL THEORY 93

Figure 5.19: Tjalling C. Koopmans (1910-1985). Born in the Netherlands he studied mathematics and physics at the University of Utrecht. After publish- ing his famous theorem in quantum physics, he switched his interest to appli- cations of mathematics in economics. In 1975, he shared the Nobel Memorial Prize in Economics for his work in that field. Koopmans’ theorem states that if we neglect the readjustment of orbitals resulting from ionization, the en- ergy needed to remove an electron from the kth molecular orbital is just the eigenvalue of the Fock operator corresponding to that orbital. 94 QUANTUM THEORY

Figure 5.20: Prof. Clemens C. J. Roothaan in the Com- putation Center (ca. 1964). As computers gained more and more speed and memory capacity, quantum chemists like Prof. Roothaan have become able to use progressively more accurate calculational methods. Prof. Roothaan was my teacher in group theory when I studied at the University of Chicago. I later had the privilege of knowing him quite well as the result of meeting him at many quantum chemistry conferences. He and his brother were both in the Dutch resistance movement against the Nazis during World War II. Although his brother was killed, Prof. Roothaan survived, and lived an extremely long and productive life. He lived to be 101. I remember talking with him when he was a very old man. He told me about a new and powerful computer chip that he was developing. 5.7. THE HARTREE-FOCK-ROOTHAAN EQUATIONS 95 5.7 The Hartree-Fock-Roothaan equations

In the Hartree-Fock-Roothaan method, the wave function is built up from a linear combi- nation of atomic spin-orbitals χa, located on the various atoms in the molecule:

M

Φi = χaCa,i (5.46) a=1 X The Fock operator is defined by the equation

N 2 c ∗ e F (1) H (1) + dτ2Φs(2) (1 P1,2)Φs(2) (5.47) ≡ r , − s=1 1 2 X Z

Here dτ2 indicates integration over the space and spin coordinates of electron 2, while Hc(1) is the core Hamiltonian as a function of the space and spin coordinates of electron R 1: 2 c ~ 2 H (1) = 1 + V (1) (5.48) −2me ∇

The operator P1,2 exchanges the space and spin coordinates of electrons 1 and 2:

P , Φs(2)Φi(1) Φs(1)Φi(2) (5.49) 1 2 ≡ When the Fock operator acts on a filled molecular spin-orbital, it generates a linear com- bination of filled molecular spin-orbitals:

N

F (1)Φj(1) = Φi(1)λi,j (5.50) i=1 X where N is the number of electrons in the molecule, and where

dτ1Φi(1)F (1)Φj(1) = λi,j (5.51) Z is a matrix representation of the Fock operator based on the filled molecular spin-orbitals. We can bring the Fock operator into a diagonal form by means of a unitary trnsformation. Letting k, k = 1, 2, 3, ,N be the diagonal elements, we then have: ···

F (1) k Φk(1) = 0 k = 1, 2, 3, ,N (5.52) { − } ··· These simultaneous equations are called the Hartree-Fock equations. The Fock operator F (1)can be expressed in the form

N c F (1) = H (1) + [Js(1) Ks(1)] (5.53) − s=1 X 96 QUANTUM THEORY

where 2 c ~ 2 H (1) 1 + V (1) ≡ −2me ∇ 2 ∗ e Js(1)Φk(2) dτ2Φs(2) Φs(2)Φk(1) ≡ r1,2 Z 2 ∗ e Ks(1)Φk(2) dτ Φ (2) Φs(1)Φk(2) (5.54) ≡ 2 s r Z 1,2 Roothaan’s equations In his famous Ph.D. thesis, Clemens Roothaan applied the Hartree-Fock equations to the case where molecular spin-orbitals are represented as linear combinations of atomic spin- orbitals: M

Φk(1) = χa(1)Cb,k (5.55) b X=1 ∗ Inserting this into the Hartree-Fock equations, multiplying from the left by χa(1) and integrating over the space and spin coordinates of electron 1, we obtain M ∗ dτ χ (1) [F (1) k] χb(1)Ca,k = 0 (5.56) 1 a − b X=1 Z or in matrix form M

[Fa,b kSa,b] Cb,k = 0 (5.57) − b X=1 where

∗ Fa,b dτ χ (1)F (1)χb(1) ≡ 1 a Z ∗ Sa,b dτ χ (1)χb(1) (5.58) ≡ 1 a Z The matrix Sa,b is a matrix of overlap integrals, and it must be taken into account because the atomic spin-orbitals are not necessarily othhonormal. Roothaan expressed the Fock matrix Fa,b in the form

c Fa,b = Ha,b + Pc,dΓ(ab)cd (5.59) c,d X where 2 ∗ ∗ e Γ(ab)cd dτ1 dτ2χa(1)χc (2) χd(2)χb(1) ≡ r1,2 Z Z 2 ∗ ∗ e dτ dτ χ (1)χ (2) χd(1)χb(2) (5.60) − 1 2 a c r Z Z 1,2 5.8. KOOPMANS’ THEOREM 97

The density matrix Pc,d is defined by the relationship

M ∗ Pc,d νiC Cd,i (5.61) ≡ c,i i=1 X where

1 filled spin orbitals νi (5.62) ≡   0 empty spin orbitals  In Roothaan’s method, one begins by evaluating and storing the the M M M M × × × dimensional matrix of integrals Γ(ab)cd. An initial guess of the coefficients Ca,i is then used c to evaluate the core Hamiltonian, overlap and Fock matrices Ha,b, Sa,b and Fa,b. By solving he matrix form of the Hartree-Fock equations, new and improved values of the coefficients are obtained. The process is repeated many times until it (hopefully) converges to an accurate self-consistent solution.

5.8 Koopmans’ theorem

A Hartree-Fock-Roothaan calculation gives us a ground state N-electron wave function ∆0 which is totally antisymmetric with respect to the exchange of the space and spin coordinates of any two electrons. The physicist J.C. Slater introduced a way of writing such a totally antisymmetric wave function as a determinant in the atomic spin-orbitals.

χµ(x ) χµ0 (x ) χµ00 (x ) 1 1 1 ··· 1 χµ(x2) χµ0 (x2) χµ00 (x2) ∆ = χ χ 0 χ 00 ··· (5.63) 0 µ µ µ χ (x ) χ 0 (x ) χ 00 (x ) | · · · | ≡ √N! µ 3 µ 3 µ 3 . . . ··· . . .

involving atomic orbitals of the form

αi ms = 1/2 χµ(xi) χn,l,m,ms (xi) Rn,l(ri)Yl,m(θi, φi) (5.64) ≡ ≡ βi ms = 1/2  −

In fact, the Hartree-Fock equations can be derived by assuming that the ground-state wave function has this form, and then using a variational principle to minimize the energy of the ground state (see, for example, ([Avery, 1976])). The ground state energy then has the 98 QUANTUM THEORY

form:

∗ E = dτ dτ dτN ∆ H∆ 0 1 2 ··· 0 0 ZN Z Z ∗ c = dτ1Φs(1)H (1)Φs(1) s=1 X Z N N 2 1 ∗ ∗ e + dτ1 dτ2Φs(1)Φt (2) Φt(2)Φs(1) 2 r , s=1 t=1 1 2 X X Z Z N N 2 1 ∗ ∗ e dτ1 dτ2Φs(1)Φt (2) Φt(1)Φs(2) (5.65) − 2 r , s=1 t=1 1 2 X X Z Z The ground-state energy can also be expressed in terms of the core Hamiltonian matrix, the Fock matrix, and the density matrix, based on atomic orbitals:

M M 1 E = P Hc + F (5.66) 0 2 a,b a,b a,b a=1 b=1 X X  Since M, the number of basis functions, is larger than N, the number of electrons, solutions to the Hartree-Fock-Roothaan equations give us a number of states greater than the number of electrons. The lowest N of the resulting molecular orbitals are interpreted as being filled, while those of higher energy are seen as empty or “virtual” orbitals. If we lift an electron from a filled orbital to a virtual orbital,we change the occupation numbers, and hence we also change the density matrix. The resulting change in energy is given by

0 c 1 0 0 (∆E)ν →ν0 = (P Pa,b)H + (P P Pa,bPc,d)Γ ab cd (5.67) i i a,b − a,b 2 a,b c,d − ( ) a,b " c,d # X X In the special case where

0 ν = νi δi,k (5.68) i − we are removing an electron from the molecule, in other words, turning it into a positive ion with a single electronic charge. The approximate change in energy is then

∆Ek = k (5.69) − where

∗ c k dτ Φ (1)H (1)Φk(1) ≡ 1 k Z N 2 1 ∗ ∗ e + dτ1 dτ2Φk(1)Φt (2) Φt(2)Φs(1) 2 r , t=1 1 2 X Z Z N 2 1 ∗ ∗ e dτ1 dτ2Φk(1)Φt (2) Φt(1)Φs(2) (5.70) − 2 r , t=1 1 2 X Z Z 5.9. ELECTRON CREATION AND ANNIHILATION OPERATORS 99 is the eigenvalue of the Fock matrix belonging to the kth spin-orbital. This result is known as Koopmans’ theorem.

5.9 Electron creation and annihilation operators

We shall now consider an alternative way of representing many-electron wave functions. We † † † define a set of “electron creation operators”, b1, b2, b3, corresponding to the 1-electron spin-orbitals χ , χ , χ , . In this new notation, the Slater··· determinant 1 2 3 ···

χµ(x ) χµ0 (x ) χµ00 (x ) 1 1 1 ··· 1 χµ(x2) χµ0 (x2) χµ00 (x2) ∆ = χ χ 0 χ 00 ··· (5.71) ν µ µ µ χ (x ) χ 0 (x ) χ 00 (x ) | · · · | ≡ √N! µ 3 µ 3 µ 3 . . . ··· . . .

is represented by † † † ∆ν = b , b 0 , b 00 , 0 (5.72) | i µ µ µ · · · | i

We also introduce a set of “electron annihilation operators”, b1, b2, b3, corresponding to the same set of 1-electron spin-orbitals. When one of these operators acts··· on an N-electron state, it produces an (N 1)-electron state. We also impose the condition that when any electron annihilation operator− acts on the “vacuum state” 0 , it gives zero. | i

bµ 0 = 0 (5.73) | i for all values of µ. The adjoint of this equation has the form

0 b† = 0 (5.74) h | µ The many-electron wave functions expressed in terms of electron creation and annihilation operators will be totally antisymmetric with respect to the exchange of the space and spin coordinates of any two electrons provided that these operators obey the following “anticomutation relations”:

† † bi bj + bjbi = δi,j † † † † bi bj + bjbi = 0

bibj + bjbi = 0 (5.75)

The vacuum state is assumed to be normalized, so that

0 0 = 1 (5.76) h | i The adjoint of a many-electron state is represented by

∆ν = 0 , bµ00 , bµ0 , bµ (5.77) h | h | · · · 100 QUANTUM THEORY

Then, using the anticomutation relations to move the annihilation operators over to the right, where they annihilate the vacuum state, we find that

† † † ∆ν ∆ν = 0 , bµ00 , bµ0 , bµ, b , b 0 , b 00 , 0 h | i h | · · · µ µ µ · · · | i † † = 0 , bµ00 , bµ0 , b 0 , b 00 , 0 h | · · · µ µ · · · | i † = 0 , bµ00 , b 00 , 0 h | · · · µ · · · | i ...... = 0 0 = 1 (5.78) h | i so that the many-electron state is properly normalized.

1-electron and 2-electron operators In our new notation, 1-electron are represented by

† = fi,jbi bj (5.79) F i,j X where

∗ fi,j dτ χ (1)f(1)χj(1) (5.80) ≡ 1 i Z Examples of 1-electron operators are the core Hamiltonian or externally applied electric or magnetic fields. By contrast, 2-electron operators involve the interactions of electrons with each other, an example being the Coulomb repulsion between electrons. In our new notation, 2-electron operators have the form 1 = g b†b†b b (5.81) 2 st|uv s t u v G s,t,u,v X where

∗ ∗ gst|uv = dτ1 dτ2χs(1)χt (2)g(1, 2)χu(2)χv(1) (5.82) Z Z 5.10 Quantum chemistry and the development of com- puters

Some personal memories of early computers I hope that readers will forgive me if I tell them of my own personal memories of early computers: 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 101

When I arrived at Imperial College (then part of the University of London) in 1962, I worked with a crystallographic group that using the Mercury computer at University College to do the calculations needed to arrive at molecular structures. This gave me the chance to use Mercury to do quantum chemical calculations. I always went over to University College with the crystallographers at night, because time on the computer was so expensive that we could only afford to use it at night. I would make a bed for myself out of three rather hard chairs in a row and would try to sleep. At 3 AM or 4 AM they would wake me up and would say “Now it’s your turn”. Mercury was as big as a house, but could do far less than a modern laptop. It had 50,000 or so vacuum tubes which required cooling. The cooling system sometimes broke down, and one or another of the vacuum tubes sometimes failed, so one had to be grateful for the periods when Mercury was working. Our programs were written on punched tape in a language called CHLF3. (The letters stood for Cambridge, London, Harwell and Farnsborough, the four places that had Mercurys). After we had read the paper tape into the computer, the program was converted into a magnetic form on a rapidly rotating drum, and then checked against the original input. If it did not check, we had a so-called “drum parity”, which meant that we had to stop the computer and restart it by hand, using a bewildering array of manual controls. After finishing the work on Mercury at 6 AM or so, I would walk home, passing through the almost-deserted streets of Soho, and seeing pale-faced teenagers who had been up all night, high on amphetamines. They were sitting on the pavement near an underground station, waiting for it to open. After we had used Mercury for two years or so, IBM gave Imperial College one of their early computers. Using this was much better. Programs for the IBM machine were written on punched cards. We just went over to the machine with our punched cards and stood in line to have them read into the computer. Then a few minutes later we were handed a printout of the output. The IBM was much better than the machines that were available in eastern Europe, and for this reason I was contacted by Janos Ladik and his group at the Hungarian Academy of Science, who proposed a collaboration. We worked together for several years, calculating the electronic structure of a number of polypeptides and polynucleotides. In 1965, Janos Ladik invited me to attend a meeting of quantum theorists and computer scientists from both East and West, held at a town on the Hungarian Puszta, the great Hungarian plain east of Budapest. Both Charles Coulson and Per-Olov L¨owdin were there, as well as many scientists from the eastern side of the Iron Curtain. At the meeting, Enrico Clementi spoke about computer programs that he had developed for performing ab-initio1 calculation of the electronic structure of molecules. Clementi was an important IBM scientist, and he had his own laboratory with a large computer which he could use as he liked. The programs that he described to us took hundreds of hours to complete an electronic structure calculation on a single molecule.

1ab-initio is a Latin expression meaning “from the beginning”. Such programs are completely free of input parameters based on experiments. 102 QUANTUM THEORY

Figure 5.21: Enrico Clementi (born 1931) explained to us that microminiatur- ization would soon make computers hundreds of times faster, smaller and less expensive. He was completely right.

In the question period after Clementi’s lecture, someone from the audience said: “It’s all right for you, Clementi. You can use hundreds of hours on a single calculation if you want to, because you are sitting at IBM with your own dedicated computer. But what about the rest of us? What good are these programs to us?” Clementi answered: “In a few years, computers will be hundreds of times faster, and they will also be cheaper.” The audience asked: “And how will this happen?”. Clementi answered: “Through microminiaturization.” He was completely right. That was exactly what happened.

The invention of transistors; Microelectronics The problem of unreliable vacuum tubes was solved in 1948 by John Bardeen, William Shockley and Walter Brattain of the Bell Telephone Laboratories. Application of quantum theory to solids had lead to an understanding of the electrical properties of crystals. Like atoms, crystals were found to have allowed and forbidden energy levels. The allowed energy levels for an electron in a crystal were known to form bands, i.e., some energy ranges with many allowed states (allowed bands), and other energy ranges with none (forbidden bands). The lowest allowed bands were occupied by electrons, while higher bands were empty. The highest filled band was called the “valence band”, and the lowest empty band was called the “conduction band”. According to quantum theory, whenever the valence band of a crystal is only partly filled, the crystal is a conductor of electricity; but if the valence band is completely filled with electrons, the crystal is an electrical insulator. (A completely filled band is analogous to a room so packed with people that none of them can move.) 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 103

In addition to conductors and insulators, quantum theory predicted the existence of “semiconductors” - crystals where the valence band is completely filled with electrons, but where the energy gap between the conduction band and the valence band is very small. For example, crystals of the elements silicon and germanium are semiconductors. For such a crystal, thermal energy is sometimes enough to lift an electron from the valence band to the conduction band. Bardeen, Shockley and Brattain found ways to control the conductivity of germanium crystals by injecting electrons into the conduction band, or alternatively by removing elec- trons from the valence band. They could do this by “doping” the crystals with appropriate impurities, or by injecting electrons with a special electrode. The semiconducting crystals whose conductivity was controlled in this way could be used as electronic valves, in place of vacuum tubes. By the 1960’s, replacement of vacuum tubes by transistors in electronic computers had led not only to an enormous increase in reliability and a great reduction in cost, but also to an enormous increase in speed. It was found that the limiting factor in computer speed was the time needed for an electrical signal to propagate from one part of the central processing unit to another. Since electrical impulses propagate with the speed of light, this time is extremely small; but nevertheless, it is the limiting factor in the speed of electronic computers.

Integrated circuits In order to reduce the propagation time, computer designers tried to make the central processing units very small; and the result was the development of integrated circuits and microelectronics. (Another motive for miniaturization of electronics came from the requirements of space exploration.) Integrated circuits were developed in which single circuit elements were not manufac- tured separately. Instead, the whole circuit was made at one time. An integrated circuit is a sandwich-like structure, with conducting, resisting and insulating layers interspersed with layers of germanium or silicon, “doped ” with appropriate impurities. At the start of the manufacturing process, an engineer makes a large drawing of each layer. For example, the drawing of a conducting layer would contain pathways which fill the role played by wires in a conventional circuit, while the remainder of the layer would consist of areas destined to be etched away by acid. The next step is to reduce the size of the drawing and to multiply it photographically. The pattern of the layer is thus repeated many times, like the design on a piece of wallpaper. The multiplied and reduced drawing is then focused through a reversed microscope onto the surface to be etched. Successive layers are built up by evaporating or depositing thin films of the appropriate substances onto the surface of a silicon or germanium wafer. If the layer being made is to be conducting, the surface would consist of an extremely thin layer of copper, covered with a photosensitive layer called a “photoresist”. On those portions of the surface receiving light from the pattern, the photoresist becomes insoluble, while on those areas not receiving 104 QUANTUM THEORY light, the photoresist can be washed away. The surface is then etched with acid, which removes the copper from those areas not protected by photoresist. Each successive layer of a wafer is made in this way, and finally the wafer is cut into tiny “chips”, each of which corresponds to one unit of the wallpaper- like pattern. Although the area of a chip may be much smaller than a square centimeter, the chip can contain an extremely complex circuit. A typical programmable minicomputer or “microprocessor”, manufactured during the 1970’s, could have 30,000 circuit elements, all of which were contained on a single chip. By 1986, more than a million transistors were being placed on a single chip. As a result of miniaturization, the speed of computers rose steadily. In 1960, the fastest computers could perform a hundred thousand elementary operations in a second. By 1970, the fastest computers took less than a second to perform a million such operations. In 1987, a computer called GF11 was designed to perform 11 billion floating-point operations (flops) per second. GF11 (Gigaflop 11) is a scientific parallel-processing machine constructed by IBM. Approximately ten floating-point operations are needed for each machine instruction. Thus GF11 runs at the rate of approximately a thousand million instructions per second (1,100 MIPS). The high speed achieved by parallel-processing machines results from dividing a job into many sub-jobs on which a large number of processing units can work simultaneously. Computer memories have also undergone a remarkable development. In 1987, the magnetic disc memories being produced could store 20 million bits of information per square inch; and even higher densities could be achieved by optical storage devices. (A “bit” is the unit of information. For example, the number 25, written in the binary system, is 11001. To specify this 5-digit binary number requires 5 bits of information. To specify an n-digit binary number requires n bits of information. Eight bits make a “byte”.) In the 1970’s and 1980’s, computer networks were set up linking machines in various parts of the world. It became possible (for example) for a scientist in Europe to perform a calculation interactively on a computer in the United States just as though the distant machine were in the same room; and two or more computers could be linked for perform- ing large calculations. It also became possible to exchange programs, data, letters and manuscripts very rapidly through the computer networks.

Suggestions for further reading 1. Akhlesh Lakhtakia. Models and Modelers of Hydrogen: Thales, Thomson, Ruther- ford, Bohr, Sommerfeld, Goudsmit, Heisenberg, Schr¨odinger, Dirac, Sallhofer. World Scientific, (1996). 2. O’Connor, John J.; Robertson, Edmund F., Erwin Schr¨odinger, MacTutor History of Mathematics archive, University of St Andrews. 3. Halpern, Paul. Einstein’s Dice and Schr¨odinger’s Cat. Perseus Books Group, (2015). 4. Heitler, W. Erwin Schr¨odinger.1887-1961. Biographical Memoirs of Fellows of the Royal Society. 7: 221-226, (1961). 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 105

5. Farmelo, Graham. The Strangest Man: The Hidden Life of Paul Dirac, Quantum Genius. Faber and Faber, (2009). 6. Dalitz, R. H.; Peierls, R. Paul Adrien Maurice Dirac. 8 August 1902 - 20 October 1984. Biographical Memoirs of Fellows of the Royal Society. 32: 137-185, (1986). 7. Douglas, Sandy, Some Memories of EDSAC 1: 1950-1952, Ann. Hist. Comp., Vol. 1, No. 2, 1980, pp. 98-99, 208. 8. Medwick, Paul A., Douglas Hartree and Early Computations in Quantum Mechanics, Ann. Hist. Comp., Vol. 10, No. 2, 1988, pp. 105-112. 9. Hartree, D. R., Calculating Machines-Recent and Prospective Developments, Cam- bridge Univ. Press, Cambridge, UK, 1947. 10. Hartree, D. R., Automatic Calculating Machines, Mathematical Gazette, Vol. 34, Dec. 1950, pp. 241-252. 11. Graham, L. (1982). The reception of Einstein’s ideas: Two examples from contrasting political cultures. In Holton, G. and Elkana, Y. (Eds.) Albert Einstein: Historical and cultural perspectives. Princeton, NJ: Princeton UP, pp. 107-136 12. Fock, V. A. (1964). The Theory of Space, Time and Gravitation. Macmillan. 13. Lewis, G. N. (1916), The Atom and the Molecule, J. Am. Chem. Soc., 38 (4): 762-85, 14. Miburo, Barnabe B. (1993), Simplified Lewis Structure Drawing for Non-science Ma- jors, J. Chem. Educ., 75 (3): 317 15. Coffey, Patrick (2008). Cathedrals of Science: The Personalities and Rivalries That Made Modern Chemistry. Oxford University Press. 16. Davenport, Derek A. (1996). The Many Lives of Linus Pauling: A Review of Reviews. Journal of Chemical Education. 73 (9): A210. 17. Hargittai, IstvA¡n˜ (2000). Hargittai, Magdolna (ed.). Candid science: conversations with famous chemists (Reprinted ed.). London: Imperial College Press. 18. Marinacci, Barbara, ed. (1995). Linus Pauling: In His Own Words; Selected Writ- ings, Speeches, and Interviews. Introduction by Linus Pauling. New York: Simon & Schuster. 19. Sturchio, Jeffrey L. (April 6, 1987). Linus C. Pauling, Transcript of an Inter- view Conducted by Jeffrey L. Sturchio in Denver, Colorado on 6 April 1987 (PDF). Philadelphia, PA: Chemical Heritage Foundation. 20. Avery, John, The Quantum Theory of Atoms, Molecules and Photons, McGraw-Hill, (1972). 21. Avery, John, Creation and Annihilation Operators, McGraw-Hill, (1976). 22. Avery, John, Dahl, J.P., and Hansen, A.E., editors, Understanding Molecular Prop- erties, Reidel, Dordricht, (1987). 23. Avery, John, Hyperspherical Harmonics; Applictions in Quantum Theory, Kluwer Academic Publishers, Dordricht, (1989). 24. Avery, John, editor, The Chemical Bond, Special Issue of the International Journal of Quantum Chemistry, Volume 58, (1996). 25. Avery, John, Hyperspherical Harmonics and Generalized Sturmians, Kluwer Aca- demic Publishers, Dordrecht, Netherlands, (2000). 106 QUANTUM THEORY

26. Avery, James and Avery, John, Generalized Sturmians and Atomic Spectra, World Scientific, (2000). 27. Avery, John, Rettrup, Sten and Avery, James, Symmetry Adapted Basis Functions. World Scientific, (2012). 28. Avery, James and Avery, John, Hyperspherical Harmonics and Their Physical Appli- cations, World Scientific, (2017). 29. H. Babbage, Babbages Calculating Engines: A Collection of Papers by Henry Prevost Babbage, MIT Press, (1984). 30. A.M. Turing, The Enigma of Intelligence, Burnett, London (1983). 31. Ft. Penrose, The Emperor’s New Mind: Concerning Computers, Minds, and the Laws of Physics, Oxford University Press, (1989). 32. S. Wolfram, A New Kind of Science, Wolfram Media, Champaign IL, (2002). 33. A.M. Turing, On computable numbers, with an application to the Entscheidungsprob- lem, Proc. Lond. Math. Soc. Ser 2, 42, (1937). Reprinted in M. David Ed., The Undecidable, Raven Press, Hewlett N.Y., (1965). 34. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 35. J. Shurkin, Engines of the Mind: A History of Computers, W.W. Norten, (1984). 36. J. Palfreman and D. Swade, The Dream Machine: Exploring the Computer Age, BBC Press (UK), (1991). 37. T.J. Watson, Jr. and P. Petre, Father, Son, and Co., Bantam Books, New York, (1991). 38. A. Hodges, : The Enegma, Simon and Schuster, (1983). 39. H.H. Goldstein, The Computer from Pascal to Von Neumann, Princeton University Press, (1972). 40. C.J. Bashe, L.R. Johnson, J.H. Palmer, and E.W. Pugh, IBM’s Early Computers, Vol. 3 in the History of Computing Series, MIT Press, (1986). 41. K.D. Fishman, The Computer Establishment, McGraw-Hill, (1982). 42. S. Levy, Hackers, Doubleday, (1984). 43. S. Franklin, Artificial Minds, MIT Press, (1997). 44. P. Freiberger and M. Swaine, Fire in the Valley: The Making of the Personal Com- puter, Osborne/MeGraw-Hill, (1984). 45. R.X. Cringely, Accidental Empires, Addison-Wesley, (1992). 46. R. Randell editor, The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 47. H. Lukoff, From Dits to Bits, Robotics Press, (1979). 48. D.E. Lundstrom, A Few Good Men from Univac, MIT Press, (1987). 49. D. Rutland, Why Computers Are Computers (The SWAC and the PC), Wren Pub- lishers, (1995). 50. P.E. Ceruzzi, Reckoners: The Prehistory of the Digital Computer, from Relays to the Stored Program Concept, 1935-1945, Greenwood Press, Westport, (1983) 51. S.G. Nash, A History of Scientific Computing, Adison-Wesley, Reading Mass., (1990). 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 107

52. P.E. Ceruzzi, Crossing the divide: Architectural issues and the emer- gence of stored programme computers, 1935-1953, IEEE Annals of the History of Computing, 19, 5-12, January-March (1997). 53. P.E. Ceruzzi, A History of Modern Computing, MIT Press, Cambridge MA, (1998). 54. K. Zuse, Some remarks on the history of computing in Germany, in A History of Computing in the 20th Century, N. Metropolis et al. editors, 611-627, Academic Press, New York, (1980). 55. A.R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 56. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 57. A. Hodges, Alan Turing: The Enegma, Simon and Schuster, New York, (1983). 58. A. Turing, On computable numbers with reference to the Entscheidungsproblem, Jour- nal of the London Mathematical Society, II, 2. 42, 230-265 (1937). 59. J. von Neumann, The Computer and the Brain, Yale University Press, (1958). 60. I.E. Sutherland, Microelectronics and computer science, Scientific American, 210-228, September (1977). 61. W. Aspray, and the Origins of Modern Computing, M.I.T. Press, Cambridge MA, (1990, 2nd ed. 1992). 62. W. Aspray, The history of computing within the history of information technology, History and Technology, 11, 7-19 (1994). 63. G.F. Luger, Computation and Intelligence: Collected Readings, MIT Press, (1995). 64. Z.W. Pylyshyn, Computation and Cognition: Towards a Foundation for Cognitive Science, MIT Press, (1986). 65. D.E. Shasha and C. Lazere, Out of Their Minds: The Creators of Computer Science, Copernicus, New York, (1995). 66. W. Aspray, An annotated bibliography of secondary sources on the history of software, Annals of the History of Computing 9, 291-243 (1988). 67. R. Kurzweil, The Age of Intelligent Machines, MIT Press, (1992). 68. S.L. Garfinkel and H. Abelson, eds., Architects of the Information Society: Thirty- Five Years of the Laboratory for Computer Sciences at MIT, MIT Press, (1999). 69. J. Haugeland, Artificial Intelligence: The Very Idea, MIT Press, (1989). 70. M.A. Boden, Artificial Intelligence in Psychology: Interdisciplinary Essays, MIT Press, (1989). 71. J.W. Cortada, A Bibliographic Guide to the History of Computer Applications, 1950- 1990, Greenwood Press, Westport Conn., (1996). 72. M. Campbell-Kelly and W. Aspry, Computer: A History of the Information Machine, Basic Books, New York, (1996). 73. B.I. Blum and K. Duncan, editors, A History of Medical Informatics, ACM Press, New York, (1990). 74. J.-C. Guedon, La Planete Cyber, Internet et Cyberspace, Gallimard, (1996). 75. S. Augarten, Bit by Bit: An Illustrated History of Computers, Unwin, London, (1985). 76. N. Wiener, Cybernetics; or Control and Communication in the Animal and the Ma- chine, The Technology Press, John Wiley and Sons, New York, (1948). 77. W.R. Ashby, An Introduction to Cybernetics, Chapman and Hall, London, (1956). 108 QUANTUM THEORY

78. M.A. Arbib, A partial survey of cybernetics in eastern Europe and the Soviet Union, Behavioral Sci., 11, 193-216, (1966). 79. A. Rosenblueth, N. Weiner and J. Bigelow, Behavior, purpose and teleology, Phil. Soc. 10 (1), 18-24 (1943). 80. N. Weiner and A. Rosenblueth, Conduction of impulses in cardiac muscle, Arch. Inst. Cardiol. Mex., 16, 205-265 (1946). 81. H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Transactions of sixth-tenth conferences, Josiah J. Macy Jr. Foundation, New York, (1950- 1954). 82. W.S. McCulloch and W. Pitts, A logical calculus of ideas immanent in nervous ac- tivity, Bull. Math. Biophys., 5, 115-133 (1943). 83. W.S. McCulloch, An Account of the First Three Conferences on Teleological Mecha- nisms, Josiah Macy Jr. Foundation, (1947). 84. G.A. Miller, Languages and Communication, McGraw-Hill, New York, (1951). 85. G.A. Miller, Statistical behavioristics and sequences of responses, Psychol. Rev. 56, 6 (1949). 86. G. Bateson, Bali - the value system of a steady state, in M. Fortes, editor, Social Struc- ture Studies Presented to A.R. Radcliffe-Brown, Clarendon Press, Oxford, (1949). 87. G. Bateson, Communication, the Social Matrix of Psychiatry, Norton, (1951). 88. G. Bateson, Steps to an Ecology of Mind, Chandler, San Francisco, (1972). 89. G. Bateson, Communication et Societe, Seuil, Paris, (1988). 90. S. Heims, Gregory Bateson and the mathematicians: From interdisciplinary interac- tions to societal functions, J. History Behavioral Sci., 13, 141-159 (1977). 91. S. Heims, John von Neumann and Norbert Wiener. From Mathematics to the Tech- nology of Life and Death, MIT Press, Cambridge MA, (1980). 92. S. Heims, The Cybernetics Group, MIT Press, Cambridge MA, (1991). 93. G. van de Vijver, New Perspectives on Cybernetics (Self-Organization, Autonomy and Connectionism), Kluwer, Dordrecht, (1992). 94. A. Bavelas, A mathematical model for group structures, Appl. Anthrop. 7 (3), 16 (1948). 95. P. de Latil, La Pensee Artificielle - Introduction a la Cybernetique, Gallimard, Paris, (1953). 96. L.K. Frank, G.E. Hutchinson, W.K. Livingston, W.S. McCulloch and N. Wiener, Teleological Mechanisms, Ann. N.Y. Acad. Sci. 50, 187- 277 (1948). 97. H. von Foerster, Quantum theory of memory, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Trans- actions of the sixth conferences, Josiah J. Macy Jr. Foundation, New York, (1950). 98. H. von Foerster, Observing Systems, Intersystems Publications, California, (1984). 99. H. von Foerster, Understanding Understanding: Essays on Cybernetics and Cogni- tion, Springer, New York, (2002). 100. M. Newborn, Kasparov vs. Deep Blue: Computer Chess Comes of age, Springer Verlag, (1996). 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 109

101. K.M. Colby, Artificial Paranoia: A Computer Simulation of the Paranoid Process, Pergamon Press, New York, (1975). 102. J.Z. Young, Discrimination and learning in the octopus, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social sys- tems. Transactions of the ninth conference, Josiah J. Macy Jr. Foundation, New York, (1953). 103. M.J. Apter and L. Wolpert, Cybernetics and development. I. Infor- mation theory, J. Theor. Biol. 8, 244-257 (1965). 104. H. Atlan, L’Organization Biologique et la Theorie de I’Information, Hermann, Paris, (1972). 105. H. Atlan, On a formal definition of organization, J. Theor. Biol. 45, 295-304 (1974). 106. H. Atlan, Organization du vivant, information et auto-organization, in Volume Sym- posium 1986 de l’Encylopediea Universalis, pp. 355-361, Paris, (1986). 107. E.R. Kandel, Nerve cells and behavior, Scientific American, 223, 57-70, July, (1970). 108. E.R. Kandel, Small systems of neurons, Scientific American, 241 no.3, 66-76 (1979). 109. A.K. Katchalsky et al., Dynamic patterns of brain cell assemblies, Neurosciences Res. Prog. Bull., 12 no.1, (1974). 110. G.E. Moore, Cramming more components onto integrated circuits, Electronics, April 19, (1965). 111. P. Gelsinger, P. Gargini, G. Parker and A. Yu, Microprocessors circa 2000, IEEE Spectrum, October, (1989). 112. P. Baron, On distributed communications networks, IEEE Trans. Comm. Systems, March (1964). 113. V.G. Cerf and R.E. Khan, A protocol for packet network intercommunication, Trans. Comm. Tech. COM-22, V5, 627-641, May (1974). 114. L. Kleinrock, Communication Nets: Stochastic Message Flow and Delay, McGraw- Hill, New York, (1964). 115. L. Kleinrock, Queueing Systems: Vol. II, Computer Applications, Wiley, New York, (1976). 116. R. Kahn, editor, Special Issue on Packet Communication Networks, Proc. IEEE, 66, November, (1978). 117. L.G. Roberts, The evolution of packet switching, Proc. of the IEEE 66, 1307-13, (1978). 118. J. Abbate, The electrical century: Inventing the web, Proc. IEEE 87, November, (1999). 119. J. Abbate, Inventing the Internet, MIT Press, Cambridge MA, (1999). 120. J.C. McDonald, editor, Fundamentals of Digital Switching, 2nd Edition, Plenum, New York, (1990). 121. B. Metcalfe, Packet Communication, Peer-to-Peer Communication, San Jose Calif, (1996). 122. T. Berners-Lee, The Original Design and Ultimate Destiny of the World Wide Web by its Inventor, Harper San Francisco, (1999). 110 QUANTUM THEORY

123. J. Clark, Netscape Time: The Making of the Billion-Dollar Start-Up That Took On Microsoft, St. Martin’s Press, New York, (1999). 124. J. Wallace, Overdrive: Bill Gates and the Race to Control Cyberspace, Wiley, New York, (1997). 125. P. Cunningham and F. Froschl, The Electronic Business Revolution, Springer Verlag, New York, (1999). 126. J.L. McKenny, Waves of Change: Business Evolution Through Information Technol- ogy, Harvard Business School Press, (1995). 127. M.A. Cosumano, Competing on Internet Time: Lessons From Netscape and Its Battle with Microsoft, Free Press, New York, (1998). 128. F.J. Dyson, The Sun, the Genome and the Internet: Tools of Scientific Revolutions, Oxford University Press, (1999). 129. L. Bruno, Fiber Optimism: Nortel, Lucent and Cisco are battling to win the high- stakes fiber-optics game, Red Herring, June (2000). 130. N. Cochrane, We’re insatiable: Now it’s 20 million million bytes a day, Melbourne Age, January 15, (2001). 131. K.G. Coffman and A.N. Odlyzko, The size and growth rate of the Internet, First Monday, October, (1998). 132. C.A. Eldering, M.L. Sylla, and J.A. Eisenach, Is there a Moore’s law for bandwidth?, IEEE Comm. Mag., 2-7, October, (1999). 133. G. Gilder, Fiber keeps its promise: Get ready, bandwidth will triple each year for the next 25 years, Forbes, April 7, (1997). 134. A.M. Noll, Does data traffic exceed voice traffic?, Comm. ACM, 121- 124, June, (1999). 135. B. St. Arnaud, J. Coulter, J. Fitchett, and S. Mokbel, Architectural and engineering issues for building an optical Internet, Proc. Soc. Optical Eng. (1998). 136. M. Weisner, The computer for the 21st century, Scientific American, September, (1991). 137. R. Wright, Three Scientists and Their Gods, Time Books, (1988). 138. S. Nora and A. Mine, The Computerization of Society, MIT Press, (1981). 139. T. Forester, Computers in the Human Context: Information Theory, Productivity, and People, MIT Press, (1989). 140. J. Hoffmeyer, Some semiotic aspects of the psycho-physical relation: the endo-exosemiotic boundary, in Biosemiotics. The Semiotic Web, T.A. Sebeok and J. Umiker-Sebeok, editors, Mouton de Gruyter, Berlin/New York, (1991). 141. J. Hoffmeyer, The swarming cyberspace of the body, Cybernetics and Human Know- ing, 3(1), 1-10 (1995). 142. J. Hoffmeyer, Signs of Meaning in the Universe, Indiana University Press, Bloom- ington IN, (1996). 143. J. Hoffmeyer, Biosemiotics: Towards a new synthesis in biology, European J. Semiotic Stud. 9(2), 355-376 (1997). 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 111

144. J. Hoffmeyer and C. Emmeche, Code-duality and the semiotics of nature, in On Semiotic Modeling, M. Anderson and F. Merrell, editors, Mouton de Gruyter, New York, (1991). 145. C. Emmeche and J. Hoffmeyer, From language to nature - The semiotic metaphor in biology, Semiotica, 84, 1-42 (1991). 146. C. Emmeche, The biosemiotics of emergent properties in a pluralist ontology, in Semiosis, Evolution, Energy: Towards a Reconceptualization of the Sign, E. Taborsky, editor, Shaker Verlag, Aachen, (1999). 147. S. Brier, Information and consciousness: A critique of the mechanistic concept of information, in Cybernetics and Human Knowing, 1(2/3), 71-94 (1992). 148. S. Brier, Ciber-Semiotics: Second-order cybernetics and the semiotics of C.S. Peirce, Proceedings from the Second European Congress on System Science, , October 5-8, 1993, AFCET, (1993). 149. S. Brier, A cybernetic and semiotic view on a Galilean theory of psychology, Cyber- netics and Human Knowing, 2 (2), 31-46 (1993). 150. S. Brier, Cybersemiotics: A suggestion for a transdisciplinary frame- work for de- scription of observing, anticipatory, and meaning producing systems, in D.M. Dubois, editor, Computing Anticipatory Systems, CASYS - First International Conference, Liege, Belgium 1997, AIP Conference Proceedings no. 437, (1997). 151. S. Oyama, The Ontogeny of Information, Cambridge University Press, (1985). 152. J. Hoffmeyer, The swarming cyberspace of the body, Cybernetics and Human Know- ing, 3(1), 1-10 (1995). 153. J.L. Casti and A. Karlqvist, editors, Complexity, Language, and Life: Mathematical Approaches, Springer, Berlin, (1985). 154. H. Maturana and F. Varla, Autopoiesis and Cognition: The Realization of the Living, Reidel, London, (1980). 155. J. Mingers, Self-Producing Systems: Implications and Application of Autopoiesis, Plenum Press, New York, (1995). 156. J. Buchler, editor, Philosophical Writings of Peirce: Selected and Edited with an Introduction by Justus Buchler, Dover Publications, New York, (1955). 157. T.L. Short, Peirce’s semiotic theory of the self, Semiotica, 91 (1/2), 109-131 (1992). 158. J. von Uexk¨ull, Umwelt und Innenwelt der Tiere. 2. verm, und verb. Aufl., Springer, Berlin, (1921). 159. J. von Uexk¨ull,The theory of meaning, Semiotica, 42(1), 25-87 (1982 [1940]). 160. T. von Uexk¨ull, Introduction: Meaning and science in Jacob von Uexkull’s concept of biology, Semiotica, 42, 1-24 (1982). 161. T. von Uexk¨ull, Medicine and semiotics, Semiotica, 61 , 201-217 (1986). 162. G. Bateson, Form, substance, and difference. Nineteenth Annual Korzybski Memorial Lecture, (1970). Reprinted in G. Bateson, Steps to an Ecology of Mind, Balentine Books, New York, (1972), pp. 448-464. 163. G. Bateson, Mind and Nature: A Necessary Unity, Bantam Books, New York, (1980). 164. G. Bateson, Sacred Unity: Further Steps to an Ecology of Mind, Harper Collins, New York, (1991). 112 QUANTUM THEORY

165. J. Ruesch and G. Bateson, Communication, Norton, New York, (1987). 166. E.F. Yates, Semiotics as a bridge between information (biology) and dynamics (physics), Recherches Semiotiques/Semiotic Inquiry 5, 347- 360 (1985). 167. T.A. Sebeok, Communication in animals and men, Language, 39, 448-466 (1963). 168. T.A. Sebeok, The Sign and its Masters, University of Texas Press, (1979). 169. P. Bouissac, Ecology of semiotic space: Competition, exploitation, and the evolution of arbitrary signs, Am. J. Semiotics, 10, 145-166 (1972). 170. F. Varla, Autopoiesis: A Theory of Living Organization, North Holland, New York, (1986). 171. R. Posner, K. Robins and T.A. Sebeok, editors, Semiotics: A Handbook of the Sign- Theoretic Foundations of Nature and Culture, Walter de Gruyter, Berlin, (1992). 172. R. Paton, The ecologies of hereditary information, Cybernetics and Human Knowing, 5(4), 31-44 (1998). 173. T. Stonier, Information and the Internal Structure of the Universe, Springer, Berlin, (1990). 174. T. Stonier, Information and Meaning: An Evolutionary Perspective, Springer, Berlin, (1997). 175. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 176. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 177. Alan Turing, The Enigma of Intelligence, Burnett, London (1983). 178. R. Randell (editor), The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 179. Allan R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 180. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 181. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 182. R. Randell (editor), The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 183. Allan R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 184. H. Babbage, Babbages Calculating Engines: A Collection of Papers by Henry Prevost Babbage, MIT Press, (1984). 185. A.M. Turing, The Enigma of Intelligence, Burnett, London (1983). 186. Ft. Penrose, The Emperor’s New Mind: Concerning Computers, Minds, and the Laws of Physics, Oxford University Press, (1989). 187. S. Wolfram, A New Kind of Science, Wolfram Media, Champaign IL, (2002). 188. A.M. Turing, On computable numbers, with an application to the Entscheidungsprob- lem, Proc. Lond. Math. Soc. Ser 2, 42, (1937). Reprinted in M. David Ed., The Undecidable, Raven Press, Hewlett N.Y., (1965). 189. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 190. J. Shurkin, Engines of the Mind: A History of Computers, W.W. Norten, (1984). 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 113

191. J. Palfreman and D. Swade, The Dream Machine: Exploring the Computer Age, BBC Press (UK), (1991). 192. T.J. Watson, Jr. and P. Petre, Father, Son, and Co., Bantam Books, New York, (1991). 193. A. Hodges, Alan Turing: The Enegma, Simon and Schuster, (1983). 194. H.H. Goldstein, The Computer from Pascal to Von Neumann, Princeton University Press, (1972). 195. C.J. Bashe, L.R. Johnson, J.H. Palmer, and E.W. Pugh, IBM’s Early Computers, Vol. 3 in the History of Computing Series, MIT Press, (1986). 196. K.D. Fishman, The Computer Establishment, McGraw-Hill, (1982). 197. S. Levy, Hackers, Doubleday, (1984). 198. S. Franklin, Artificial Minds, MIT Press, (1997). 199. P. Freiberger and M. Swaine, Fire in the Valley: The Making of the Personal Com- puter, Osborne/MeGraw-Hill, (1984). 200. R.X. Cringely, Accidental Empires, Addison-Wesley, (1992). 201. R. Randell editor, The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 202. H. Lukoff, From Dits to Bits, Robotics Press, (1979). 203. D.E. Lundstrom, A Few Good Men from Univac, MIT Press, (1987). 204. D. Rutland, Why Computers Are Computers (The SWAC and the PC), Wren Publishers, (1995). 205. P.E. Ceruzzi, Reckoners: The Prehistory of the Digital Computer, from Relays to the Stored Program Concept, 1935-1945, Greenwood Press, Westport, (1983) 206. S.G. Nash, A History of Scientific Computing, Adison-Wesley, Reading Mass., (1990). 207. P.E. Ceruzzi, Crossing the divide: Architectural issues and the emer- gence of stored programme computers, 1935-1953, IEEE Annals of the History of Computing, 19, 5-12, January-March (1997). 208. P.E. Ceruzzi, A History of Modern Computing, MIT Press, Cambridge MA, (1998). 209. K. Zuse, Some remarks on the history of computing in Germany, in A History of Computing in the 20th Century, N. Metropolis et al. editors, 611-627, Academic Press, New York, (1980). 210. A.R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 211. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 212. A. Hodges, Alan Turing: The Enegma, Simon and Schuster, New York, (1983). 213. A. Turing, On computable numbers with reference to the Entscheidungsproblem, Jour- nal of the London Mathematical Society, II, 2. 42, 230-265 (1937). 214. J. von Neumann, The Computer and the Brain, Yale University Press, (1958). 215. I.E. Sutherland, Microelectronics and computer science, Scientific American, 210-228, September (1977). 216. W. Aspray, John von Neumann and the Origins of Modern Computing, M.I.T. Press, Cambridge MA, (1990, 2nd ed. 1992). 217. W. Aspray, The history of computing within the history of information technology, History and Technology, 11, 7-19 (1994). 114 QUANTUM THEORY

218. G.F. Luger, Computation and Intelligence: Collected Readings, MIT Press, (1995). 219. Z.W. Pylyshyn, Computation and Cognition: Towards a Foundation for Cognitive Science, MIT Press, (1986). 220. D.E. Shasha and C. Lazere, Out of Their Minds: The Creators of Computer Science, Copernicus, New York, (1995). 221. W. Aspray, An annotated bibliography of secondary sources on the history of software, Annals of the History of Computing 9, 291-243 (1988). 222. R. Kurzweil, The Age of Intelligent Machines, MIT Press, (1992). 223. S.L. Garfinkel and H. Abelson, eds., Architects of the Information Society: Thirty- Five Years of the Laboratory for Computer Sciences at MIT, MIT Press, (1999). 224. J. Haugeland, Artificial Intelligence: The Very Idea, MIT Press, (1989). 225. M.A. Boden, Artificial Intelligence in Psychology: Interdisciplinary Essays, MIT Press, (1989). 226. J.W. Cortada, A Bibliographic Guide to the History of Computer Applications, 1950- 1990, Greenwood Press, Westport Conn., (1996). 227. M. Campbell-Kelly and W. Aspry, Computer: A History of the Information Machine, Basic Books, New York, (1996). 228. B.I. Blum and K. Duncan, editors, A History of Medical Informatics, ACM Press, New York, (1990). 229. J.-C. Guedon, La Planete Cyber, Internet et Cyberspace, Gallimard, (1996). 230. S. Augarten, Bit by Bit: An Illustrated History of Computers, Unwin, London, (1985). 231. N. Wiener, Cybernetics; or Control and Communication in the Animal and the Ma- chine, The Technology Press, John Wiley and Sons, New York, (1948). 232. W.R. Ashby, An Introduction to Cybernetics, Chapman and Hall, London, (1956). 233. M.A. Arbib, A partial survey of cybernetics in eastern Europe and the Soviet Union, Behavioral Sci., 11, 193-216, (1966). 234. A. Rosenblueth, N. Weiner and J. Bigelow, Behavior, purpose and teleology, Phil. Soc. 10 (1), 18-24 (1943). 235. N. Weiner and A. Rosenblueth, Conduction of impulses in cardiac muscle, Arch. Inst. Cardiol. Mex., 16, 205-265 (1946). 236. H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Transactions of sixth-tenth conferences, Josiah J. Macy Jr. Foundation, New York, (1950- 1954). 237. W.S. McCulloch and W. Pitts, A logical calculus of ideas immanent in nervous ac- tivity, Bull. Math. Biophys., 5, 115-133 (1943). 238. W.S. McCulloch, An Account of the First Three Conferences on Teleological Mecha- nisms, Josiah Macy Jr. Foundation, (1947). 239. G.A. Miller, Languages and Communication, McGraw-Hill, New York, (1951). 240. G.A. Miller, Statistical behavioristics and sequences of responses, Psychol. Rev. 56, 6 (1949). 241. G. Bateson, Bali - the value system of a steady state, in M. Fortes, editor, Social Struc- ture Studies Presented to A.R. Radcliffe-Brown, Clarendon Press, Oxford, (1949). 242. G. Bateson, Communication, the Social Matrix of Psychiatry, Norton, (1951). 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 115

243. G. Bateson, Steps to an Ecology of Mind, Chandler, San Francisco, (1972). 244. G. Bateson, Communication et Societe, Seuil, Paris, (1988). 245. S. Heims, Gregory Bateson and the mathematicians: From interdisciplinary interac- tions to societal functions, J. History Behavioral Sci., 13, 141-159 (1977). 246. S. Heims, John von Neumann and Norbert Wiener. From Mathematics to the Tech- nology of Life and Death, MIT Press, Cambridge MA, (1980). 247. S. Heims, The Cybernetics Group, MIT Press, Cambridge MA, (1991). 248. G. van de Vijver, New Perspectives on Cybernetics (Self-Organization, Autonomy and Connectionism), Kluwer, Dordrecht, (1992). 249. A. Bavelas, A mathematical model for group structures, Appl. Anthrop. 7 (3), 16 (1948). 250. P. de Latil, La Pensee Artificielle - Introduction a la Cybernetique, Gallimard, Paris, (1953). 251. L.K. Frank, G.E. Hutchinson, W.K. Livingston, W.S. McCulloch and N. Wiener, Teleological Mechanisms, Ann. N.Y. Acad. Sci. 50, 187- 277 (1948). 252. H. von Foerster, Quantum theory of memory, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Trans- actions of the sixth conferences, Josiah J. Macy Jr. Foundation, New York, (1950). 253. H. von Foerster, Observing Systems, Intersystems Publications, California, (1984). 254. H. von Foerster, Understanding Understanding: Essays on Cybernetics and Cogni- tion, Springer, New York, (2002). 255. M. Newborn, Kasparov vs. Deep Blue: Computer Chess Comes of age, Springer Verlag, (1996). 256. K.M. Colby, Artificial Paranoia: A Computer Simulation of the Paranoid Process, Pergamon Press, New York, (1975). 257. J.Z. Young, Discrimination and learning in the octopus, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social sys- tems. Transactions of the ninth conference, Josiah J. Macy Jr. Foundation, New York, (1953). 258. M.J. Apter and L. Wolpert, Cybernetics and development. I. Infor- mation theory, J. Theor. Biol. 8, 244-257 (1965). 259. H. Atlan, L’Organization Biologique et la Theorie de I’Information, Hermann, Paris, (1972). 260. H. Atlan, On a formal definition of organization, J. Theor. Biol. 45, 295-304 (1974). 261. H. Atlan, Organization du vivant, information et auto-organization, in Volume Sym- posium 1986 de l’Encylopediea Universalis, pp. 355-361, Paris, (1986). 262. E.R. Kandel, Nerve cells and behavior, Scientific American, 223, 57-70, July, (1970). 263. E.R. Kandel, Small systems of neurons, Scientific American, 241 no.3, 66-76 (1979). 264. A.K. Katchalsky et al., Dynamic patterns of brain cell assemblies, Neurosciences Res. Prog. Bull., 12 no.1, (1974). 265. G.E. Moore, Cramming more components onto integrated circuits, Electronics, April 19, (1965). 116 QUANTUM THEORY

266. P. Gelsinger, P. Gargini, G. Parker and A. Yu, Microprocessors circa 2000, IEEE Spectrum, October, (1989). 267. P. Baron, On distributed communications networks, IEEE Trans. Comm. Systems, March (1964). 268. V.G. Cerf and R.E. Khan, A protocol for packet network intercommunication, Trans. Comm. Tech. COM-22, V5, 627-641, May (1974). 269. L. Kleinrock, Communication Nets: Stochastic Message Flow and Delay, McGraw- Hill, New York, (1964). 270. L. Kleinrock, Queueing Systems: Vol. II, Computer Applications, Wiley, New York, (1976). 271. R. Kahn, editor, Special Issue on Packet Communication Networks, Proc. IEEE, 66, November, (1978). 272. L.G. Roberts, The evolution of packet switching, Proc. of the IEEE 66, 1307-13, (1978). 273. J. Abbate, The electrical century: Inventing the web, Proc. IEEE 87, November, (1999). 274. J. Abbate, Inventing the Internet, MIT Press, Cambridge MA, (1999). 275. J.C. McDonald, editor, Fundamentals of Digital Switching, 2nd Edition, Plenum, New York, (1990). 276. B. Metcalfe, Packet Communication, Peer-to-Peer Communication, San Jose Calif, (1996). 277. T. Berners-Lee, The Original Design and Ultimate Destiny of the World Wide Web by its Inventor, Harper San Francisco, (1999). 278. J. Clark, Netscape Time: The Making of the Billion-Dollar Start-Up That Took On Microsoft, St. Martin’s Press, New York, (1999). 279. J. Wallace, Overdrive: Bill Gates and the Race to Control Cyberspace, Wiley, New York, (1997). 280. P. Cunningham and F. Froschl, The Electronic Business Revolution, Springer Verlag, New York, (1999). 281. J.L. McKenny, Waves of Change: Business Evolution Through Information Tech- nology, Harvard Business School Press, (1995). 282. M.A. Cosumano, Competing on Internet Time: Lessons From Netscape and Its Battle with Microsoft, Free Press, New York, (1998). 283. F.J. Dyson, The Sun, the Genome and the Internet: Tools of Scientific Revolutions, Oxford University Press, (1999). 284. L. Bruno, Fiber Optimism: Nortel, Lucent and Cisco are battling to win the high- stakes fiber-optics game, Red Herring, June (2000). 285. N. Cochrane, We’re insatiable: Now it’s 20 million million bytes a day, Melbourne Age, January 15, (2001). 286. K.G. Coffman and A.N. Odlyzko, The size and growth rate of the Internet, First Monday, October, (1998). 287. C.A. Eldering, M.L. Sylla, and J.A. Eisenach, Is there a Moore’s law for bandwidth?, IEEE Comm. Mag., 2-7, October, (1999). 5.10. QUANTUM CHEMISTRY AND THE DEVELOPMENT OF COMPUTERS 117

288. G. Gilder, Fiber keeps its promise: Get ready, bandwidth will triple each year for the next 25 years, Forbes, April 7, (1997). 289. A.M. Noll, Does data traffic exceed voice traffic?, Comm. ACM, 121- 124, June, (1999). 290. B. St. Arnaud, J. Coulter, J. Fitchett, and S. Mokbel, Architectural and engineering issues for building an optical Internet, Proc. Soc. Optical Eng. (1998). 291. M. Weisner, The computer for the 21st century, Scientific American, September, (1991). 292. R. Wright, Three Scientists and Their Gods, Time Books, (1988). 293. S. Nora and A. Mine, The Computerization of Society, MIT Press, (1981). 294. T. Forester, Computers in the Human Context: Information Theory, Productivity, and People, MIT Press, (1989). 118 QUANTUM THEORY Chapter 6

PERIODIC SYSTEMS

6.1 The discovery of X-rays

In 1895, while the work leading to the discovery of the electron was still going on, a second revolutionary discovery was made. In the autumn of that year, Wilhelm Konrad Roentgen (1845-1923), the head of the department of physics at the University of W¨urtzburgin Bavaria, was working with a discharge tube, repeating some of the experiments of Crookes.

Roentgen was especially interested in the luminescence of certain materials when they were struck by cathode rays. He darkened the room, and turned on the high voltage. As the current surged across the tube, a flash of light came from an entirely different part of the room! To Roentgen’s astonishment, he found that a piece of paper which he had coated with barium platinocyanide was glowing brightly, even though it was so far away from the discharge tube that the cathode rays could not possibly reach it! Roentgen turned off the tube, and the light from the coated paper disappeared. He turned on the tube again, and the bright glow on the screen reappeared. He carried the coated screen into the next room. Still it glowed! Again he turned off the tube, and again the screen stopped glowing. Roentgen realized that he had discovered something completely strange and new. Radiation of some kind was coming from his discharge tube, but the new kind of radiation could penetrate opaque matter! Years later, when someone asked Roentgen what he thought when he discovered X-rays, he replied: “I didn’t think. I experimented!” During the next seven weeks he experimented like a madman; and when he finally announced his discovery in December, 1895, he was able to report all of the most important properties of X-rays, including their ability to ionize gases and the fact that they cannot be deflected by electric or magnetic fields. Roentgen correctly believed X-rays to be electromagnetic waves, just like light waves, but with very much shorter wavelength. It turned out that X-rays were produced by electrons from the cathode of the discharge tube. These electrons were accelerated by the strong electric field as they passed across the tube from the cathode (the negative terminal) to the anode (the positive terminal).

119 120 QUANTUM THEORY

Figure 6.1: Wilhelm Konrad Roentgen (1845-1923). Wellcome Images. 6.2. BRAGG FATHER AND SON 121

Figure 6.2: X-ray photograph by W.K. Roentgen. Wellcome Images.

They struck the platinum anode with very high velocity, knocking electrons out of the inner parts of the platinum atoms. As the outer electrons fell inward to replace these lost inner electrons, electromagnetic waves of very high frequency were emitted. On January 23, 1896, Roentgen gave the first public lecture on X-rays; and in this lecture he demonstrated to his audience that X-ray photographs could be used for med- ical diagnosis. When Roentgen called for a volunteer from the audience, the 79 year old physiologist, Rudolf von K¨olliker stepped up to the platform, and an X-ray photograph was taken of the old man’s hand. The photograph, still in existence, shows the bones beautifully. Wild enthusiasm for Roentgen’s discovery swept across Europe and America, and soon many laboratories were experimenting with X-rays. The excitement about X-rays led indirectly to a third revolutionary discovery - radioactivity.

6.2 Bragg father and son

Sir William Henry Bragg (1862-1942) graduated from Trinity College, Cambridge Univer- sity. with first class honors in mathematics in 1885. In that year, at the age of 25, he was appointed Professor of Mathematics and Experimental Physics at the University of Adelaide in Australia. In 1908, Bragg returned to England after 23 years in Australia. 122 QUANTUM THEORY

Figure 6.3: Max von Laue (1879-1960). 6.2. BRAGG FATHER AND SON 123

Figure 6.4: Sir William Henry Bragg (1862-1942). He and his son, Lawrence Bragg shared the 1915 Nobel Prize in Physics “for their services to the analysis of crystal structure by means of X-rays”. He studied with J.J. Thomson at Cambridge University after having won a scholarship to Trinity College in 1885. X-ray crystallography, pioneered by Bragg and his son, has proved to be enormously important both in chemistry and in biology. It has allowed us to understand the structure of both organic and inorganic molecules, and initiated the science of molecular biology. 124 QUANTUM THEORY

Figure 6.5: Sir William Lawrence Bragg (1890-1971). 6.3. J.D. BERNAL AND DOROTHY CROWFOOT HODGKIN 125

Figure 6.6: Bragg’s law of diffraction. Two beams with identical wavelength and phase approach a crystalline solid and are scattered off two different atoms within it. The lower beam traverses an extra length of 2d sin θ. Constructive interference occurs when this length is equal to an integer multiple of the wavelength of the radiation.

6.3 J.D. Bernal and Dorothy Crowfoot Hodgkin

In England, J.D. Bernal and Dorothy Crowfoot Hodgkin pioneered the application of X- ray diffraction methods to the study of complex biological molecules. In 1949, Hodgkin determined the structure of penicillin; and in 1955, she followed this with the structure of vitamin B12. In 1960, Max Perutz and John C. Kendrew obtained the structures of the blood proteins myoglobin and hemoglobin. This was an impressive achievement for the Cambridge crystallographers, since the hemoglobin molecule contains roughly 12,000 atoms. The structure obtained by Perutz and Kendrew showed that hemoglobin is a long chain of amino acids, folded into a globular shape, like a small, crumpled ball of yarn. They found that the amino acids with an affinity for water were on the outside of the globular molecule; while the amino acids for which contact with water was energetically unfavorable were hidden on the inside. Perutz and Kendrew deduced that the conformation of the protein - the way in which the chain of amino acids folded into a 3-dimensional structure - was determined by the sequence of amino acids in the chain. In 1966, D.C. Phillips and his co-workers at the Royal Institution in London found the crystallographic structure of the enzyme lysozyme (an egg-white protein which breaks down the cell walls of certain bacteria). Again, the structure showed a long chain of amino acids, folded into a roughly globular shape. The amino acids with hydrophilic groups were on the outside, in contact with water, while those with hydrophobic groups were on the inside. The structure of lysozyme exhibited clearly an active site, where sugar molecules of bacterial cell walls were drawn into a mouth-like opening and stressed by electrostatic forces, so that bonds between the sugars could easily be broken. 126 QUANTUM THEORY

Figure 6.7: J.D. Bernal, (1901-1971). 6.3. J.D. BERNAL AND DOROTHY CROWFOOT HODGKIN 127

Figure 6.8: Dorothy Crowfoot Hodgkin (1910-1994). She and her mentor J.D Bernal were great pioneers in the application of X-ray crystallogrography to determination of the structure of biological molecules, such as proteins. She was awarded the Nobel Prize in Chemistry in 1964. 128 QUANTUM THEORY

Figure 6.9: Sir John C. Kendrew (1917-1997) with an unfinished model of myo- globin. He shared the 1962 Nobel Prize in Chemistry with Peruz, Crick, Wat- son and Wilkins.

6.4 The structure of DNA: Molecular biology

The discovery of the molecular structure of DNA was an event of enormous importance for genetics, and for biology in general. The structure was a revelation! The giant, helical DNA molecule was like a twisted ladder: Two long, twisted sugar-phosphate backbones formed the outside of the ladder, while the rungs were formed by the base pairs, A, T, G and C. The base adenine (A) could only be paired with thymine (T), while guanine (G) fit only with cytosine (C). Each base pair was weakly joined in the center by hydrogen bonds - in other words, there was a weak point in the center of each rung of the ladder - but the bases were strongly attached to the sugar-phosphate backbone. In their 1953 paper, Crick and Watson wrote: ”It has not escaped our notice that the specific pairing we have postulated suggests a possible copying mechanism for genetic material”. Indeed, a sudden blaze of understanding illuminated the inner workings of heredity, and of life itself. If the weak hydrogen bonds in the center of each rung were broken, the ladderlike DNA macromolecule could split down the center and divide into two single strands. Each single strand would then become a template for the formation of a new double-stranded molecule. Because of the specific pairing of the bases in the Watson-Crick model of DNA, the two strands had to be complementary. T had to be paired with A, and G with C. Therefore, if the sequence of bases on one strand was (for example) TTTGCTAAAGGTGAACCA... , then the other strand necessarily had to have the sequence AAACGATTTCCACTTGGT... The Watson-Crick model of DNA made it seem certain that all the genetic information 6.4. THE STRUCTURE OF DNA: MOLECULAR BIOLOGY 129 needed for producing a new individual is coded into the long, thin, double-stranded DNA molecule of the cell nucleus, written in a four-letter language whose letters are the bases, adenine, thymine, guanine and cytosine. In 1956, George Emil Palade of the Rockefeller Institute used electron microscopy to study subcellular particles rich in RNA (ribosomes). Ribosomes were found to consist of two subunits - a smaller subunit, with a molecular weight one million times the weight of a hydrogen atom, and a larger subunit with twice this weight. It was shown by means of radioactive tracers that a newly synthesized protein molecule is attached temporarily to a ribosome, but neither of the two subunits of the ribosome seemed to act as a template for protein synthesis. Instead, Palade and his coworkers found that genetic information is carried from DNA to the ribosome by a messenger RNA molecule (mRNA). Electron microscopy revealed that mRNA passes through the ribo- some like a punched computer tape passing through a tape-reader. It was found that the adapter molecules, whose existence Crick had postulated, were smaller molecules of RNA; and these were given the name “transfer RNA” (tRNA). It was shown that, as an mRNA molecule passes through a ribosome, amino acids attached to complementary tRNA adaptor molecules are added to the growing protein chain. The solution of the DNA structure in 1953 initiated a new kind of biology - molecular biology. This new discipline made use of recently-discovered physical techniques - X- ray diffraction, electron microscopy, electrophoresis, chromatography, ultracentrifugation, radioactive tracer techniques, autoradiography, electron spin resonance, nuclear magnetic resonance and ultraviolet spectroscopy. In the 1960’s and 1970’s, molecular biology became the most exciting and rapidly-growing branch of science. Since DNA was known to carry the genetic message, coded into the sequence of the four nucleotide bases, A, T, G and C, and since proteins were known to be composed of specific sequences of the twenty amino acids, it was logical to suppose that the amino acid sequence in a protein was determined by the base sequence of DNA. The information somehow had to be read from the DNA and used in the biosynthesis of the protein. It was known that, in addition to DNA, cells also contain a similar, but not quite identical, polynucleotide called ribonucleic acid (RNA). The sugar-phosphate backbone of RNA was known to differ slightly from that of DNA; and in RNA, the nucleotide thymine (T) was replaced by a chemically similar nucleotide, uracil (U). Furthermore, while DNA was found only in cell nuclei, RNA was found both in cell nuclei and in the cytoplasm of cells, where protein synthesis takes place. Evidence accumulated indicating that genetic information is first transcribed from DNA to RNA, and afterwards translated from RNA into the amino acid sequence of proteins. The crystallographic determination of the structures of DNA and RNA and proteins opened the way for a new field of science, Molecular Biology, now one of the most rapidly developing of all research fields. 130 QUANTUM THEORY

Figure 6.10: Maurice Wilkins (1916-2004). He applied to DNA the X-ray diffrac- tion methods pioneered by Dorothy Hodgkin. It was his work, and that of Rosalind Franklin, together with Linus Pauling’s model-building methods, that enabled Crick and Watson to correctly solve the structure of DNA. He shared the 1962 Nobel Prize in Physiology or Medicine with them. 6.4. THE STRUCTURE OF DNA: MOLECULAR BIOLOGY 131

Figure 6.11: Rosalind Franklin (1920-1958). It was one of her high-quality diffraction photographs, taken in Maurice Wilkins’ laboratory, that proved to be critical for the DNA structure. She might have shared the Nobel Prize with Wilkins, Crick and Watson, but before this could be considered by the committee, she died of ovarian cancer. 132 QUANTUM THEORY

Figure 6.12: Francis Crick (1916-2004) and James Dewey Watson (born 1928) at the Cavendish Laboratory with their model of DNA. After their discovery of the structure of DNA, it became clear that it was this molecule that carried genetic information between generations. 6.5. DIRECT AND RECIPROCAL LATTICE VECTORS 133 6.5 Direct and reciprocal lattice vectors

In order to understand crystal structure, X-ray diffraction, and the quantum mechanical treatment of crystals, we can begin by considering a 2-dimensional lattice points in space defined by the relationship

Rn = n a + n a n = 0, 1, 2, 3, n = 0, 1, 2, 3, (6.1) 1 1 2 2 1 ± ± ± ··· 2 ± ± ± ···

Here a1 and a2 are vectors defining the unit cell of the 2-dimensional system in direct space. For example, the substance graphite, from which pencil leads are made, consists of planar arrays in which carbon atoms are tightly bound together in linked hexagonal arrays. These planes are much more loosely bound together, so it makes sense to treat each graphite plane by itself. In the case of the 2-dimensional lattice of a graphite plane, the vectors a1 and a2 are given by

3 √3 a1 = d, d (2 − 2 )

a2 = 0, √3d (6.2) n o Here d is a small length defining the size of the unit cell. We can also define a reciprocal lattice, whose points in momentum-space are given by the relationships

Gm = m b + m b m = 0, 1, 2, 3, m = 0, 1, 2, 3, (6.3) 1 1 2 2 1 ± ± ± ··· 2 ± ± ± ··· We would like the function f(r) defined by the Fourier series

iGm·r f(r) = fme (6.4) m X to have the periodicity of the direct lattice. In other words, we would like this Fourier series to have the property:

iGm·(r+Rn) f(r + Rn) = fme = f(r) (6.5) m X regardless of the values of the coefficients fm. For this to be the case, we require that

eiGm·Rn = 1 (6.6) a relationship that holds only when

Gm Rn = 2πN N = 0, 1, 2, 3, (6.7) · ± ± ± ···

Remembering the definitions of Gm and Rn, we have

(m b + m b ) (n a + n a ) = 2πN N = 0, 1, 2, 3, (6.8) 1 1 2 2 · 1 1 2 2 ± ± ± ··· 134 QUANTUM THEORY

Figure 6.13: The arrangement of carbon atoms, tightly bound together in a graphite plane, like a giant planar molecule.

If

bi aj = δi,j (6.9) · then the desired relationship will be fulfilled, since any product of integers is also an integer. We are now able to determine the vectors b1 and b2. In the example of the graphite plane, the conditions

b a = 1 1 · 1 b a = 0 1 · 2 b a = 1 2 · 2 b a = 0 (6.10) 2 · 1 can be solved to yield

2 b = , 0 1 3d   1 1 b2 = , (6.11) 3d √3 d   6.6. A HUCKEL¨ CALCULATION FOR A GRAPHITE PLANE 135

Figure 6.14: In a crystal of graphite, the planes interact with each other very weakly, so it makes sense to treat each plane by itself. 6.6 A H¨uckel calculation for a graphite plane

We can introduce a basis functions with three indices, n , n2, j . Here n and n2 are the | 1 i 1 integers the specify a particular direct lattice vector, Rn = n1a1 + n2a2 that indicates the position of a particular unit cell. Each unit cell of a graphite plane contains 2 carbon atoms and the index j = 1, 2 indicates on which of these the basis function is located. In the H¨uckel approximation, the matrix elements of the Hamiltonian are given by α diagonal elements

0 0 0  n1, n2, j H n1, n2, j =  β nearest neighbors (6.12) h | | i   0 otherwise  Then   ik·(n1a1+n2a2) Hj0,j(k) = 0, 0, j H n1, n2, j e n ,n h | | i X1 2 α β 1 + eik·a1 + eik·(a1+a2) = (6.13)    β 1 + eik·a1 + eik·(a1+a2) α   Solving the quadratic equation

det H 0, (k) E (k)δ 0, = 0 (6.14) | j j − b j j| 136 QUANTUM THEORY

we find two bands of allowed energy levels:

E±(k) = α β 3 + 2 cos(k a ) + 2 cos(k a ) + 2 cos(k (a + a )) (6.15) ± | | { · 1 · 2 · 1 2 } The allowed values of k can be determined by imposing periodic boundary conditions. Suppose that we impose periodic boundary conditions at the edges of a parallelogram in the graphite plane, and that the edges are determined by the vector na1 and na2. Then the parallelogram will contain N = n2 unit cells. The periodic boundary conditions require that

eink·a1 = 1 eink·a2 = 1 (6.16)

These boundary conditions will be satisfied if 2π b b k = m 1 + m 2 n 1 − 2 2 − 2     m = 1, 2, 3, , n 1 ··· m = 1, 2, 3, , n (6.17) 2 ···

Other values of m1 and m2 are possible, but they do not lead to any new independent solutions.

6.7 3-dimensional crystal lattices

The discussion given above can be extended to 3-dimensional crystal lattices with only a slight increase in complexity. For 3-dimensional lattices, the direct lattice vectors are given by

Rn = n a + n a + n a nj = 0, 1, 2, 3, j = 1, 2, 3 (6.18) 1 1 2 2 3 3 ± ± ± ··· While the reciprocal lattice vectors are

Gm = m b + m b + m b mj = 0, 1, 2, 3, j = 1, 2, 3 (6.19) 1 1 2 2 3 3 ± ± ± ··· The relationship

bi aj = δi,j (6.20) · will be fulfilled if a a b = 2π 2 × 3 1 a a a 1 · 2 × 3 a3 a1 b2 = 2π × a1 a2 a3 a· ×a b = 2π 1 × 2 (6.21) 3 a a a 1 · 2 × 3 6.8. QUANTUM TREATMENT OF ELECTRONS IN CRYSTALS 137 where the vector product of two vectors A and B is defined by the relationship: A B A B A B ,A B A B ,A B A B (6.22) × ≡ { 2 3 − 3 2 3 1 − 1 3 1 2 − 2 1} The sum of any two direct lattice vectors Rn and Rn0 is a direct lattice vector, and the difference between them is also a direct lattice vector. Similarly, if we take the sum or difference between two reciprocal lattice vectors, Gm and Gm0 . we obtain a reciprocal lattice vector. These relationships follow from the fact that sums of integers and differences between integers are always integers.

6.8 Quantum treatment of electrons in crystals

The electrons in a crystal move in a periodic potential of the form

iGm·r V (r) = Vme (6.23) m X where the Fourier coefficients of the potential, Vm depend on the nature of the crystal, and where the vectors Gm are the appropriate reciprocal lattice vectors. We now introduce a set of basis functions of the form:

ik·r ψk(r) = uk(r)e (6.24) where uk(r) has the periodicity of the crystal lattice, and can therefore be represented by the Fourier series:

iG 0 ·r uk(r) = um0 e m (6.25) m0 X 6.9 The nearly-free electron approximation

In this approximation we let the basis functions be plane waves, so that 1 u (r) = k √ 1V ψ (r) = eik·r (6.26) k √ V where is the volume of the crystal. Then the matrix elements of the potential between basis functionsV are given by

3 ∗ 0 d x ψ 0 (r)V (r)ψk(r) k V k k ≡ h | | i Z 1 0 = d3x ei(k−k )·rV (r) V Z V 0 = m d3x ei(k−k +Gm)·r (6.27) m X V Z 138 QUANTUM THEORY

Non-zero matrix elements occur only when k k0 is exactly equal to a reciprocal lattice vector. We can use perturbation theory to treat− the electrons in a crystal in the nearly-free electron approximation. Letting 0 1 ψk(r) = ψ (r) + ψ (r) + (6.28) k k ··· where 1 ψ0(r) = eik·r (6.29) k √ V If the perturbation produced by the potential is small, then the zeroth-order energy is given by 2 2 2 0 ~ ~ k Ek = k k = (6.30) 2me · 2me just as would be the case for completely free electrons. If we take only the first-order correction into account, the wave function is given approximately by 0 1 ψk(r) = ψk(r) + ψk(r) 0 1 k V k 1 0 = eik·r + h | | i eik ·r (6.31) √ k02 k2 √ k06 k V X= − V while the energy becomes 0 1 2 Ek = Ek + Ek + Ek 2 2 0 0 ~ k 2me k V k k V k = + k V k + 2 h | |02 ih |2 | i (6.32) 2me h | | i k k ~ k06 k X= − 0 Using the Fourier series expansion of V (r) and remembering that eik ·r is required to obey periodic boundary conditions, we can evaluate matrix elements of the potential:

0 0 1 3 i(k−k +Gm)·r k V k = Vm d x e h | | i m V X Z 0 = Vmδk −k, Gm (6.33) m X The matrix elements are zero unless k0 k is exactly equal to a reciprocal lattice vector. Substituting this result into our expression− for the energies, we obtain: 0 1 2 Ek = Ek + Ek + Ek 2 2 2 ~ k 2me Vm = + V0 + 2 | | (6.34) 2me (k Gm) (k Gm) k k ~ G 6 Xm=0 − · − − · The denominator of the second term vanishes when

(k Gm) (k Gm) k k = 0 (6.35) − · − − · This condition will be fulfilled whenever a vector k touches a plane perpendicularly bisecting a reciprocal lattice vector. Such a plane is defined to be the boundary of a Brillion zone. 6.9. THE NEARLY-FREE ELECTRON APPROXIMATION 139

Figure 6.15: The first, second, third and fourth Brillouin zones in a plane of a simple cubic reciprocal lattice.

Figure 6.16: The first Brillouin zone in the reciprocal lattice of a face-centered crystal.

The band gap at the edge of a Brillion zone As we approach the boundary of a Brillion zone (i.e as we approach a plane perpendicularly bisecting a reciprocal lattice vector), the effect of the periodic potential V (r) becomes more and more pronounced. Finally, very near to the boundary, the perturbation series fails to converge. ear to the boundary, we can approximate the wave function by a linear combination of two nearly degenerate zeroth-order wave functions: 1 1 ψ (r) = C eik·r + C eik·(r+Gm) (6.36) k,m 1 √ 2 √ V V Then 2 ~ 2 1 ik·r 1 ik·(r+Gm) + V (r) Ek.m C1 e + C2 e = 0 (6.37) −2me ∇ − √ √    V V  Multiplying from the left by e−ik·r and integrating over the volume of the crystal, and then doing the same with eik·(r+Gm), we obtain two simultaneous equations which will have a 140 QUANTUM THEORY

solution provided that the secular determinant vanishes

2 2 ~ k + V0 Ek,m Vm 2me −

det = 0 (6.38)

2 2 ~ k Vm + V0 Ek,m 2me −

Thus, finally, we obtain the energy at the Brillouin zone boundary

~2k2 Ek,m = + V0 Vm (6.39) 2me ±

and we see that the band gap is 2Vm.

6.10 Molecular crystals

A molecular crystal is a weakly-bound periodic arrangement of tightly-bound constituent molecules. Examples are crystals of benzene or naphthalene. The low melting point of benzene reflects the small amount of energy required to separate the benzene molecules from one another. A much larger amount of energy would be needed to break the covalent bonds of all the benzene rings. The wave function of the electrons in a molecular crystal can be built from products of the wave functions of the individual molecules. Suppose that we have found wave functions representing the ground state and an excited state of an isolated constituent molecule. We might do this by means of the Hartree-Fock SCF method, in which case the wave function of the isolated molecule would be antisymmetrized. In molecular crystals, overlap of wave the wave functions of neighboring molecules can be neglected, so antisummetrization over the entire crystal is unnecessary. The basis functions used to build up the electronic wave function of a molecular crystal in the Frenkel exciton picture can be simple products of wave functions of the constituent molecules. 0 We can separate the Hamiltonian of the crystal into two parts, H0 and H , where H0 represents the Hamiltonian of the individual molecules, while H0 represents the interaction between them. Our basis set will consist of eigenfunctions of H0. Let

0 = ∆ (1)∆ (2) ∆ (N) (6.40) | i 0 0 ··· 0

represent the eigenfunction of H0 with all N subunits in their ground state, and let

n = ∆ (1)∆ (2) ∆ζ (n) ∆ (N) (6.41) | i 0 0 ··· ··· 0

represent the eigenfunction of H0 with all the subunits in their ground states except the nth subunit, which is in its ζth excited state. In the simplest approximation, we can say 6.11. PERIODIC BOUNDARY CONDITIONS 141 that the effect of H0 is to hybridize the N degenerate states of the form shown in equation 0 (6.41). We can try to build up eigenfunctions of the complete Hamiltonian, H = H0 + H from linear combinations of these basis functions:

N k = n n k (6.42) | i | ih | i n=1 X In order that k should be an eigenfunction of H, we require that | i N 0 0 n H n n k = Ek n k (6.43) h | | ih | i h | i n=1 X Inserting the trial solution 1 n k = eik·an (6.44) h | i √N into equation (6.43), we obtain

N 0 ik·(an−a 0 ) Ek = n H n e n (6.45) h | | i n=1 X because of the translational symmetry of the crystal, the lattice sum shown in equation (6.45) depends only on the relative distance, an0 an, except at the surfaces of the crystal, where special boundary conditions must be imposed.− Written out in full, the wave function of the ζ-band is

N 1 ik·an k, ζ = ∆ (1)∆ (2) ∆ζ (n) ∆ (N) e (6.46) | i √ 0 0 ··· ··· 0 N n=1 X Here k is called the “exciton wave number”, while ζ is called the “band index”. States of this kind were introduced by J. Frenkel and R. Peierls to explain the optical properties of molecular crystals, and they have become known as “Frenkel exciton states”. A Frenkel exciton state is an excited state of the entire crystal. All of the electrons are involved, and the excited state is delocalized, although each electron is localized on its own monomeric subunit. The Frenkel exciton picture can be used to describe collective excited states of polymers, provided that they have translational symmetry, and also to describe arrays of atoms, if the atoms are sufficiently separated so that orbital overlap between them can be neglected.

6.11 Periodic boundary conditions

In solid state theory, periodic boundary conditions (sometimes called “Born-von K´arman boundary conditions”) are often used. These boundary conditions restrict the allowed 142 QUANTUM THEORY

values of the wave number k. Suppose, for example, we are using the Frenkel exciton picture to describe the collective excited states of a polymer with N subunits. Then the wave number k is not a vector, but reduces simply to a number, k. If we require that

eika1 = eikaN+1 (6.47)

This is the same as requiring that

eikaN+1 = eik(aN+1−a1) = eikL = 1 (6.48) eika1 where L is the length of the polymer. This condition will be fulfilled if 2πj k = j = 1, 2, ,N (6.49) L ··· No new linearly independent states are produced if higher values of the integer j are used. For a cubic crystal consisting of N 3 molecules, periodic boundary conditions require that

2πjx kx = jx = 1, 2, ,N L ··· 2πjy ky = jy = 1, 2, ,N L ··· 2πjz kz = jz = 1, 2, ,N (6.50) L ··· assuming that the crystal is also cubic in shape, so that each side has length L

(6.51)

6.12 Homogeneous boundary conditions

For cases where only interactions between nearest neighbors are important, another type of boundary condition (called “homogeneous”) is often used. Homogeneous boundary condi- tions are especially appropriate for small systems, such as small polymers. For an oligomer, where only nearest-neighbor interactions are important, the set of secular equations to be solved have the form

(α Ek) 1 k + β 2 k = 0 − h | i h | i β 1 k + (α Ek) 2 k + β 3 k = 0 h | i − h | i h | i β 2 k + (α Ek) 3 k + β 4 k = 0 h | i − h | i h | i β 3 k + (α Ek) 4 k + β 5 k = 0 h | i − h | i h | i β 4 k + (α Ek) 5 k + β 6 k = 0 h | i − .h | i h | i . . . . .

β N 1 k + (α Ek) N k = 0 (6.52) h − | i − h | i 6.12. HOMOGENEOUS BOUNDARY CONDITIONS 143

where β represents the nearest-neighbor interaction energy, while α is the energy of an isolated monomer. The symmetry of this set of secular equations is spoiled by the lack of coupling to anything at the two ends of the polymer. We can, however, restore the symmetry by adding a fictitious monomer at each end so that the equations become

β 0 k + (α Ek) 1 k + β 2 k = 0 h | i − h | i h | i β 1 k + (α Ek) 2 k + β 3 k = 0 h | i − h | i h | i β 2 k + (α Ek) 3 k + β 4 k = 0 h | i − h | i h | i β 3 k + (α Ek) 4 k + β 5 k = 0 h | i − h | i h | i β 4 k + (α Ek) 5 k + β 6 k = 0 h | i − .h | i h | i . . . . .

β N 1 k + (α Ek) N k + β N + 1 k = 0 (6.53) h − | i − h | i h | i and requiring that 0 k = N + 1 k = 0 (6.54) h | i h | i The set of secular equations (6.53), together with the boundary conditions (6.54), have normalized solutions of the form 1 n k = sin(nkd) (6.55) h | i rN + 1 where

d = an an (6.56) +1 − is the lattice spacing. The homogeneous boundary conditions require that (N + 1)kd = π (6.57) so that we obtain N linearly-independent states, with π 2π Nπ kd = , , , (6.58) N + 1 N + 1 ··· N + 1 Notice that regardless of whether we use periodic boundary conditions or homogeneous boundary conditions, we obtain the correct number of linearly independent eigenstates of the total Hamiltonian. We start with a set of N-fold degenerate basis functions, and the inter-monomeric interactions split these into a band of N exciton states. For a polymer treated with homogeneous boundary conditions, the exciton energies are

Ek = α + 2β cos(kd) (6.59) This expression also holds for a polymer treated with periodic boundary conditions, but the allowed values of k are slightly different, being given by (6.49) rather than (6.58). Similar results are obtained in the case of 3-dimensional crystals. 144 QUANTUM THEORY 6.13 Taylor series expansion of the inter-monomer in- teraction

The unperturbed Hamiltonian of a molecular crystal or polymer can be written in the form

N

H0 = H(n) (6.60) n=1 X where

H(n)∆ (n) = E ∆ (n) n = 1, 2, ,N 0 0 0 ··· H(n)∆ζ (n) = Eζ ∆ζ (n) n = 1, 2, ,N (6.61) ··· These eigenstates if the unperturbed Hamiltonian are orthonormal, and so we have the relationships

∗ dτn∆0(n)∆0(n) = 1 Z ∗ dτn∆ζ (n)∆ζ (n) = 1 Z ∗ dτn∆ζ (n)∆0(n) = 0 (6.62) Z We now introduce the 6-fold Taylor series expansion

1 ∂ ∂ 1 = 1 + (xi an) + 1 + (xj an0 ) + xi xj − ∂an ··· − ∂an0 ··· an an0 | − |     | − | = (6.63)

We then obtain, as the leading nonzero term in the inter-monomer interaction:

Dn Dn0 3(Dn R)(Dn0 R) n0 H0 n = · · · (6.64) h | | i R3 − R5 where

R an an0 (6.65) ≡ − and

∗ Dn e dτn∆ζ (n)(xj an)∆0(n) (6.66) ≡ j − X Z

the sum being taken over all the electrons in the nth monomer. The quantity Dn is called the “transition dipole moment”. 6.14. X-RAY DIFFRACTION EXPERIMENTS 145 6.14 X-ray diffraction experiments

It can be shown that if the electron density ρ(r) in a crystal is represented by the Fourier series:

iGm·r ρ(r) = Fme (6.67) m X then the intensity of an X-ray with wave number k scattered into a state with wave number k0 is given by

0 2 I(k k ) Fm (6.68) − ∼ | | The phase problem

2 In diffraction experiments, Fm is not measured, but only a quantity proportional to Fm , | | and thus Fm is determined only up to a constant of proportionality and a phase factor. A common method for overcoming this difficulty is to guess the atomic positions in a crystal, and then assume that the atoms are relatively unaffected by the presence of their neighbors. In this way, a trial electron denstiy distribution ρ(r) can be calculated, and hence the Fourier coefficients Fm. The calculated diffraction intensities thus generated can be compared with those experimentally observed. The atomic positions in the model are then changed slightly, and the comparison process is repeated and refined until a good fit is obtained. Very similar considerations also hold for electron diffraction experiments and diffraction experiments.

Suggestions for further reading 1. Nitske, Robert W., The Life of W. C. R¨ontgen,Discoverer of the X-Ray, University of Arizona Press, 1971. 2. Glasser, Otto (1933). Wilhelm Conrad R¨ontgenand the Early History of the Roentgen Rays. London: John Bale, Sons and Danielsson, Ltd 3. Neil W. Ashcroft and N. David Mermin, Solid State Physics (Harcourt: Orlando, 1976). 4. Bragg W (1913). The Diffraction of Short Electromagnetic Waves by a Crystal. Proceedings of the Cambridge Philosophical Society. 17: 43-57. 5. H. Lodish, A. Berk, S.L. Zipursky, P. Matsudaira, D. Baltimore, and J. Darnell, Molecular Cell Biology, 4th Edition, W.H. Freeman, New York, (2000). 6. Lily Kay, Who Wrote the Book of Life? A History of the Genetic Code, Stanford University Press, Stanford CA, (2000). 7. Sahotra Sarkar (editor), The Philosophy and History of Molecular Bi- ology, Kluwer Academic Publishers, Boston, (1996). 8. James D. Watson et al. Molecular Biology of the Gene, 4th Edition, Benjamin- Cummings, (1988). 146 QUANTUM THEORY

9. J.S. Fruton, Proteins, Enzymes, and Genes, Yale University Press, New Haven, (1999). 10. S.E. Lauria, Life, the Unfinished Experiment, Charles Scribner’s Sons, New York (1973). 11. A. Lwoff, Biological Order, MIT Press, Cambridge MA, (1962). 12. James D. Watson, The Double Helix, Athenium, New York (1968). 13. F. Crick, The genetic code, Scientific American, 202, 66-74 (1962). 14. F. Crick, Central dogma of molecular biology, Nature, 227, 561-563 (1970). 15. David Freifelder (editor), Recombinant DNA, Readings from the Scientific American, W.H. Freeman and Co. (1978). 16. James D. Watson, John Tooze and David T. Kurtz, Recombinant DNA, A Short Course, W.H. Freeman, New York (1983). Chapter 7

HARMONIC OSCILLATORS

7.1 Normal modes

Let us consider the small vibrations of a classical system of particles about the equilibrium positions. Suppose that the kinetic energy of the system is given by

d d 1 dxi dxj T = m δ (7.1) 2 i i,j dt dt i=1 j=1 X X while the leading term in a Taylor series expansion of the potential energy has the form

d d 1 V = V xixj (7.2) 2 i,j i=1 j=1 X X The coordinates x1, x2 , xd, which represent small displacements from the equilibrium positions of the particles,··· are by no means the most convenient ones for solving the equa- tions of motion of the system. We can bring the kinetic energy into a more convenient form by going over to the mass-weighted coordinates defined by

i i X √mi x i = 1, 2, , d (7.3) ≡ ··· In terms of these coordinates, the kinetic energy has the form

d d 1 dXi dXj T = δ (7.4) 2 i,j dt dt i=1 j=1 X X while the potential energy becomes

d d 1 V V = i,j XiXj (7.5) 2 mimj i=1 j=1 √ X X 147 148 QUANTUM THEORY

The mass-weighted coordinates are still not the most convenient ones that we can find, since the potential energy matrix Vi,j may contain off-diagonal terms. and we would like to get rid of these. We can find a unitary transformation which diagonalizes Vi,j/√mimj by solving the secular equations

d Vi,j (k)δi,j Uj,k = 0 (7.6) mimj − V j=1 √ X   Having performed the diagonalization, we can express the potential energy and the kinetic energy of the system in terms of the normal coordinates defined by

d d k i i q = X Ui,k = √mi x Ui,k (7.7) i=1 i=1 X X When we do this, the kinetic energy retains its diagonal form because of the unitarity of Uj,k:

d 1 dqk 2 T = (7.8) 2 dt k X=1   but the off-diagonal terms in the potential energy disappear:

d 1 V = (k)(qk)2 (7.9) 2 V k X=1 From (7.8) and (7.9) we can see that the Lagrangian of the system can be written in the form d

L = T V = Lk (7.10) − k X=1 where

k 2 1 dq k 2 Lk = (k)(q ) (7.11) 2 dt − V "  # The canonically conjugate momentum paired with the coordinate qk is defined in mechanics to be ∂L dqk p = = (7.12) k ∂q˙k dt he Hamiltonian of the system can be written in the form

d

H = T + V = Hk (7.13) k X=1 7.1. NORMAL MODES 149

where 1 H = p2 + ω2q2 (7.14) k 2 k k k  and

ωk = (k) (7.15) V In other words, when the Hamiltonian whichp represents small vibrations of a classical sys- tem is expressed in terms of the normal coordinates (or normal modes), it reduces to a sum of simple harmonic oscillator Hamiltonians. The normal coordinates are found by di- agonalizing the mass-weighted potential energy matrix. The harmonic oscillator frequency of each is found by taking the square root of the corresponding eigenvalue of the mass- weighted potential energy matrix.

To illustrate this procedure, we can think of a system, whose Lagrangian is given by

d d i j 1 dx dx i j m δi,j Vi,jx x (7.16) 2 dt dt − i=1 j=1 X X   where

2κ i = j Vi,j = (7.17) κ i = j 1 (− ± This Lagrangian corresponds to a linear system of point masses, each joined elastically to the next. Then the secular equations (7.6) have the form

κUk− ,k + [2κ (k)] Uk,k κUk ,k = 0 k = 2, . . . , d 1 (7.18) − 1 − V − +1 − The trial solution

2 Uj,k = sin(jka) (7.19) rd + 1 makes all of the secular equations redundant, All of them redundantly require that

(k) = κ [1 cos(ka)] (7.20) V − Imposing homogeneous boundary conditions (i.e clamping the two ends of the line) restricts the allowed values of k, and we must have

π 2π πd k = , , , (7.21) (d + 1)a (d + 1)a ··· (d + 1)a 150 QUANTUM THEORY

where (d + 1)a is the length of the chain. The frequency spectrum of the normal modes is given by

(k) 2κ [1 cos(2ka)] ωk = V = − (7.22) r m r m In terms of the normal mode coordinates and their time derivatives, the Lagrangian of the system becomes

2 1 dqk 2 L = (ωkqk) (7.23) 2 dt − k " # X   which can be recognized as a sum of harmonic oscillator Lagrangians.

7.2 Molecular vibrations and rotations

In the simplest possible approximation, we can regard a molecule (or a cluster in a non- melted state) as a collection of point masses held together by springlike bonds. When we calculate the normal modes of such a system, we always find that there are six zero- frequency modes. Three of these correspond to the degrees of freedom associated with translation of the whole system, and three with rotation. Let us use the symbols Rs to represent the equilibrium position of the atom s, and xs to represent the displacement of the atom from its equilibrium position. Then in our simple model, the classical potential energy of the molecule can be written in the form

N N 1 2 V = kst ( xs + Rs xt Rt Rs Rt ) (7.24) 2 | − − | − | − | t>s s=1 X X

Here kst represents the force constant of the “spring” which connects atom s with atom t. Let us also introduce the notation

Rst Rs Rt ≡ − xst xs xt (7.25) ≡ −

Then, if we assume that xst << Rst and expand V in a Taylor series, we obtain the leading term | | | |

N N kst 2 V 2 (Rst xst) ≈ Rst · t>s s=1 | | XNXN 1 3 3 Vs,µ t,νxs,µxt,ν (7.26) ≡ 2 ; t>s s=1 µ=1 ν=1 X X X X 7.3. COMMUTATION RELATIONS 151

where

xs (xs , xs , xs ) (7.27) ≡ 1 2 3 By diagonalizing the mass-weighted potential energy matrix V s,µ;t,ν (7.28) √msmt we can find the normal modes of the system, and as mentioned, six of them will be zero- frequency modes corresponding to translations and rotations of the entire system.

7.3 Commutation relations

Let us consider a simple harmonic oscillator whose Hamiltonian is given by 1 H = p2 + ω2q2 (7.29) 2 We can go over to a quantum treatment of the harmonic oscillator by letting let ∂ p = i~ (7.30) − ∂q so that

[p, q] pq qp = i~ (7.31) ≡ − − Thus ω2 ω2 [H, p] = q2, p = (q [q, p] + [q, p] q) = i~ω2q (7.32) 2 2 and similarly   1 1 [H, q] = p2, q = (p [p, q] + [p, q] q) = i~p (7.33) 2 2   Now suppose that n is an eigenfunction of H, so that | i

H n = En n (7.34) | i | i If we act on such an eigenfunction with an operator of the form p iωq, then we can show by means of the commutation relations that the result is also an eigenfunction± of H:

H(p + iωq) n = [H, p] + iω[H, q] + (p + iωq)H n | i { 2 } | i = i~ω q + iω( i~p) + (p + iωq)En n − | i = (En + ~ω)(p + iωq) n (7.35) { } | i 152 QUANTUM THEORY

and

H(p iωq) n = [H, p] iω[H, q] + (p iωq)H n − | i { 2 − − } | i = i~ω q iω( i~p) + (p iωq)En n − − − | i = (En ~ω)(p iωq) n (7.36) { − − } | i These equations show that (p iωq) n is an eigenfunction of H corresponding to the raised ± | i or lowered energy En ~ω. The operator p + iωq is thus a “raising operator” (sometimes called a “creation operator”).± When it acts on an eigenfunction of the harmonic oscillator Hamiltonian H, it produces another eigenfunction of H, whose energy is raised by an amount ~ω. Similarly, p iωq can be thought of as a “lowering operator”, or ”annihilation operator”. Acting on an− eigenfunction of H. it produces another eigenfunction, whose eigenvalue is lowered by an amount ~ω. If we continue to act on an eigenfunction of H with the lowering operator p iωq, we must eventually reach the ground state, a state whose energy cannot be lowered− further. Since it cannot lower the energy of the ground state, the lowering operator must give zero when it acts on it. Thus we have the relationship:

(p iωq) 0 = 0 (7.37) − | i where 0 represents the ground state. If we next act on this equation from the left with the raising| i operator, we obtain:

(p + iωq)(p iωq) 0 = p2 + ω2q2 + iω[p, q] 0 − | i | i = (2H ~ω) 0 − | i  = (2E0 ~ω) 0 = 0 (7.38) − | i and thus 1 E0 = ~ω (7.39) 2 Combining this with our previous results, we obtain the entire energy spectrum of a quan- tum harmonic oscillator: 1 En = n + ~ω n = 0, 1, 2, 3, (7.40) 2 ···   7.4 Phonon creation and annihilation operators

It is convenient to define a normalized raising operator, which we shall call a “creation operator”:

† a iN(p + iωq) (7.41) ≡ − 7.4. PHONON CREATION AND ANNIHILATION OPERATORS 153 and a normalized lowering operator, which we shall call an “annihilation operator”:

a iN(p iωq) (7.42) ≡ − The normalizing constant, N, is chosen in such a way that a† n = √n + 1 n + 1 (7.43) | i | i and

a n + 1 = √n + 1 n (7.44) | i | i Then

aa† n = √n + 1a n + 1 = (n + 1) n | i | i | i = N2(p iωq)(p + iωq) n − | i = N2 2H + iω[p, q] n { } | i = N2(n + 1)2~ω n (7.45) | i Solving for the normalization constant, we obtain 1 N = (7.46) √2~ω Thus i a† = − (p + iωq) √2~ω i a = (p iωq) (7.47) √2~ω − We can also express p and q in terms of the creation and annihilation operators:

~ω † p = i (a a) r 2 − ~ω † q = i (a + a) (7.48) − r 2 We can see from these relationships that the creation and annihilation operators obey the commutation rules:

[a, a†] = 1 (7.49)

[a, a] = 0 (7.50)

[a†, a†] = 0 (7.51) 154 QUANTUM THEORY

The operator a†a is called the “number operator”, since it is easy to show from the rela- tionships given above that

a†a n = n n (7.52) | i | i We can also note that the harmonic oscillator Hamiltonian can be written in the form:

† 1 H = ~ω a a + (7.53) 2   7.5 Collections of harmonic oscillators

Let us now consider a system whose Hamiltonian is a sum of simple harmonic oscillator Hamiltonians:

2 2 2 † 1 H = Hk = pk + ωkqk = ~ωk akak + (7.54) 2 k k k   X X  X The commutation relationships for the momenta and coordinates of the system are

[pk, qk0 ] = i~δk,k0 (7.55) −

[pk, pk0 ] = 0 (7.56)

[qk, qk0 ] = 0 (7.57) while those of the creation and annihilation operators are

† [ak, ak0 ] = δk,k0 (7.58)

† † [ak, ak0 ] = 0 (7.59)

[ak, ak0 ] = 0 (7.60)

The eigenfunctions of the total Hamiltonian are products of simple harmonic oscillator eigenfunctions. If we use the symbol n1, n2, n3, to represent such an eigenfunction, then we can write: | · · · i

H n , n , n , = En .n .n ,··· n , n , n , | 1 2 3 · · · i 1 2 3 | 1 2 3 · · · i 1 = ~ωk nk + n1, n2, n3, (7.61) 2 | · · · i k X   Chapter 8

THE DIRAC EQUATION

8.1 Lorentz invariance and 4-vectors

Albert Einstein’s special theory of relativity was built on the negative result of the Michaelson- Morley experiment, an experiment that attempted to measure the absolute velocity of the earth through space. Einstein boldly postulated that no experiment whatever can measure absolute motion, that is to say, according to his postulate it is impossible for an observer to know whether he is in a state of rest or in a state of uniform motion. All inertial frames are equivalent. Einstein’s postulate has been amply confirmed by experiment, and today it is one of the basic principles of modern physics. The equivalence of all inertial frames can be expressed in another way: Every funda- mental physical law must exhibit symmetry between the space and time coordinates in such a way that ict enters on the same footing as the Cartesian coordinates x, y and z. (Here i √ 1, while c is the velocity of light, and t is the time.) In relativistic theory, space and≡ time− combine to form a pseudo-Euclidean space-time continuum (Minkowski space). A transformation from one inertial frame to another (a Lorentz transformation) corresponds to a rotation in this space, and such a transformation must leave all fundamental physical laws invariant in form. Every physical quantity that is represented by a 3-component vector in non-relativistic theory has a 4th component in the relativistic 4-dimensional space-time continuum. Thus, for example, the position vector x = (x, y, z) in 3-dimensional space has a 4th component in relativistic theory: xλ = (x, y, z, ict) = (x, ict) (8.1) while the vector potential A = (Ax,Ay,Az) in electromagnetic theory is the space compo- nent of a 4-vector, whose 4th component is i multiplied by the electrostatic potential φ:

Aλ = (Ax,Ay,Az, iφ) = (A, iφ) (8.2)

Similarly, the current density vector j = (jx, jy, jz) is the space-component of a 4-vector

jλ = (jx, jy, jz, icρ) = (j, icρ) (8.3)

155 156 QUANTUM THEORY whose time-component is ic multiplied by the charge density ρ. (Throughout this chapter we will represent 3-vectors by writing them in bold-face letters. Thus jλ = (j, icρ) means that the first three components of the 4-vector jλ are given by j = (jx, jy, jz), while the 4th component is icρ.) The gradient operator ∂ also becomes the space-component of a 4-vector in relativistic theory:

∂ ∂ ∂ ∂ i ∂ ∂λ , , , = ∂, (8.4) ≡ ∂x ∂x ∂x ∂x −c ∂t  1 2 3 4    while the Laplacian operator is replaced by the d’Alembertian operator:

4 4 2 2 ∂  ∂λ (8.5) ≡ ≡ ∂xλ λ λ X=1 X=1 an operator which exhibits the required space-time symmetry, so that its form is the same in all inertial frames. In relativistic electrodynamics, the electric field vector E and the magnetic field vector H are components of an antisymmetric tensor Fλ0,λ, which is related to Aλ by

0 z y i x H −H − E z 0 x i y Fλ0,λ ∂λ0 Aλ ∂λAλ0 =  −H H − E  (8.6) ≡ − y x 0 i z H −H − E  i x i y i z 0   E E E    The 4-vector Aλ, which represents the electromagnetic potential, is related to the 4-vector representing current density by 4π Aλ = jλ (8.7)  − c

When both the current density jλ and the electromagnetic potential 4-vector Aλ are independent of time, equation (8.7) reduces to:

2 4π Aλ(x ) = jλ(x ) (8.8) ∇1 1 − c 1 which has the Green’s function solution

1 1 A (x ) = d3x j (x ) (8.9) λ 1 c 2 x x λ 2 Z | 1 − 2| We can see that (8.9) is a solution to (8.8) because

1 2 = 4πδ3(x x ) 4πδ(x x )δ(y y )δ(z z ) (8.10) ∇1 x x − 1 − 2 ≡ − 1 − 2 1 − 2 1 − 2 | 1 − 2| 8.2. THE DIRAC EQUATION FOR AN ELECTRON IN AN EXTERNAL ELECTROMAGNETIC POTENTIAL157 and therefore

2 1 3 2 1 Aλ(x ) = d x jλ(x ) ∇1 1 c 2 ∇1 x x 2 Z | 1 − 2| 4π 3 3 = d x δ (x x ) jλ(x ) − c 2 1 − 2 2 Z 4π = jλ(x ) − c 1 (8.11)

The subscript 1 on the Laplacian operator means that the operator is acting on the coor- dinates of the field-point x1 rather than on the source-point, x2. Because of charge conservation, the current density 4-vector obeys the condition

4

∂λjλ = 0 (8.12) λ X=1 Since the current density is related to the electromagnetic potential 4-vector through (8.7), it is natural to work in the Lorentz gauge, where a similar condition is imposed on Aλ:

4

∂λAλ = 0 (8.13) λ X=1 Equations (8.7) and (8.13) are Maxwell’s equations in a vacuum, written in a form that makes the space-time symmetry apparent.

8.2 The Dirac equation for an electron in an external electromagnetic potential

P.A.M. Dirac’s relativistic wave equation for an electron moving in an external potential Aλ can be written in the form:

4 i γλ ∂λ Aλ + c χµ = 0 (8.14) − c "λ # X=1   where atomic units are used and where the γλ’s are 4 4 matrices: × 0 0 0 i 0 0 0 1 0 0 i −0 0 0 1− 0 γ = γ = (8.15) 1  0 i −0 0  2  0 1 0 0   i 0 0 0   1 0 0 0     −      158 QUANTUM THEORY

0 0 i 0 1 0 0 0 0 0− 0 i 0 1 0 0 γ = γ = (8.16) 3  i 0 0 0  4  0 0 1 0  −  0 i 0 0   0 0 0 1   −   −  In atomic units, the electron rest-mass is equal to 1, and Planck’s constant divided by 2π is also equal to 1, while the velocity of light has a value equal to the reciprocal of the fine structure constant: m0 = 1 ~ = 1 c = 137.036 (8.17)

From the definitions of the γλ’s, it follows that they anticommute:

γλ0 γλ + γλγλ0 = 2Iδλ0,λ (8.18)

In equation (8.18), I is a 4 4 unit matrix. Solutions to the 1-electron Dirac equation are 4-component spinors. ×

8.3 Time-independent problems

In the special case where the external electromagnetic potential 4-vector Aλ is independent of time, it is convenient to write the Dirac equation (8.14) in a different form, where we introduce the notation α = iγ γ γ γ (8.19) 0 0 ≡ 4 From equations (8.15), (8.16) and (8.19) it follows that the components of the 3-vector α can be written in block form as 0 σ α = j j = 1, 2, 3 (8.20) j σ 0  j  where, in the off-diagonal blocks, σj, j = 1, 2, 3 are the 2 2 Pauli spin matrices: × 0 1 0 i 1 0 σ = , σ = , σ = (8.21) 1 1 0 2 i −0 3 0 1      −  For time-independent problems, the Dirac equation for a single electron can then be written in the form: [H µ] χµ(x) = 0 (8.22) − where i H = icα ∂ A(x) + Iφ(x) + γ c2 (8.23) − · − c 0   is the Dirac Hamiltonian of an electron moving in a constant external electromagnetic potential, µ is the 1-electron energy, and χµ(x) is the 4-component time-independent 8.4. THE DIRAC EQUATION FOR AN ELECTRON IN THE FIELD OF A NUCLEUS159

spinor of the electron. The kinetic energy term in the Dirac Hamilton is given by

0 0 ∂3 ∂− 0 0 ∂ ∂ icα ∂ = ic  + − 3  (8.24) − · − ∂3 ∂− 0 0  ∂ ∂ 0 0   + − 3    where

∂± ∂ i∂ (8.25) ≡ 1 ± 2 Similarly, the part of the Dirac Hamiltonian involving potentials is

φ 0 A3 A− 0 φ −A −A α A + Iφ =  − + 3  (8.26) − · A3 A− φ 0 − −  A A 0 φ   − + 3    where

A± A iA (8.27) ≡ 1 ± 2

8.4 The Dirac equation for an electron in the field of a nucleus

When A(x) = 0, and φ(x) = Z/r, equation (8.22) reduces to −

Z 2 icα ∂ + γ c µ χµ(x) = 0 (8.28) − · − r 0 −   which is the Dirac equation for an electron moving in the attractive electrostatic potential of a nucleus with charge Z. Equation (8.28) can be solved exactly, and the solutions have the form ignjl(r)Ωj,l,M (θ, ϕ) χµ(x) = χnjlM (x) = (8.29)   fnjl(r)Ωj, j−l,M (θ, ϕ) − 2   Examples are shown in equations (??) and (??). In equation (8.29), the angular function Ωj,l,M (θ, ϕ) is a two-component “spherical spinor”, which is an eigenfunction of orbital an- gular momentum corresponding to the quantum number l, total angular momentum (or- bital plus spin) with quantum number j, and the z-component of total angular momentum, with quantum number M. The spherical spinors are built up from spherical harmonics and 2-component spinors by combining them with the appropriate Clebsch-Gordan coefficients in such a way as to produce eigenfunctions of total angular momentum. The Clebsch- Gordan coefficients that enter are different, depending on whether j = l + 1 or j = l 1 . 2 − 2 160 QUANTUM THEORY

1 When j = l + 2 ,

1  l + M + 2  Yl,M− 1 (θ, ϕ) s 2l + 1 2   Ωj,l,M (θ, ϕ) =   (8.30)    1   l M + 2   − Yl,M+ 1 (θ, ϕ)   s 2l + 1 2          while when j = l 1 , − 2

1  l M + 2  − Yl,M− 1 (θ, ϕ) −s 2l + 1 2   Ωj,l,M (θ, ϕ) =   (8.31)    1   l + M + 2   Yl,M+ 1 (θ, ϕ)   s 2l + 1 2          The radial function gnjl(r) is much larger than fnjl(r). The large and small radial functions are defined respectively by

γ−1 −Zr/n¯ gnjl(r) = r e (W (r) W (r)) (8.32) N 1 − 2 and 2 c nj γ−1 −Zr/n¯ fnjl(r) = 2 − r e (W1(r) + W2(r)) (8.33) N sc + nj where 1 2Zr W (r) nrF j + n + 1 2γ + 1 1 ≡ 2 − n¯  

1 2Zr W (r) (¯n κ)F j + n 2γ + 1 2 ≡ − 2 − n¯  

(8.34)

with (j + 1 ) j = l + 1 − 2 2 κ (8.35)  ≡ j + 1 j = l 1  2 − 2  1 2 Z 2 γ j + (8.36) ≡ 2 − c s    8.4. THE DIRAC EQUATION FOR AN ELECTRON IN THE FIELD OF A NUCLEUS161

1 nr n j (8.37) ≡ − − 2 and

2 1 n¯ n 2nr(j + γ) (8.38) ≡ r − 2 − Just as in the definition of the non-relativistic hydrogenlike orbitals, F (a b ζ) is a confluent hypergeometric function: | |

a a(a + 1) F (a b ζ) 1 + ζ + ζ2 + (8.39) | | ≡ b b(b + 1)2! ···

When Z 137, the 1-electron energies  c2 nj = (8.40) 2 1 + Z c(γ+nr) r   162 QUANTUM THEORY Chapter 9

INTERACTION BETWEEN MATTER AND RADIATION

9.1 Lagrangian densities for fields

Let us now turn our attention to the problem of how to treat continuous systems, or fields, using the Lagrangian formalism. We shall begin by considering a 3-dimensional space, with coordinates x, y and z, and later we will generalize to spaces of higher dimension and also to spaces where the metric is non-Euclidean. We have just been discussing normal modes for small vibrations of a classical system of particles about the equilibrium positions. We might also wish to discuss the normal modes of a violin string or a drumhead, or the harmonics of sound waves inside a closed container. In these examples, the medium can be regarded as continuous as long as we are considering low frequencies, but at high frequencies, where the wavelength is comparable to the distance between the particles of which the system is composed, this picture breaks down. But at low frequencies, the displacement from equilibrium can be treated as a field. There are other fields that one might wish to consider, such as electromagnetic fields. What we now wish to discuss is the problem of how to start with a wave equation in classical mechanics, and how to pass over to a quantum mechanical treatment of the system. In order to do this, it is useful to introduce the concept of a Lagrangian density, , This is sometimes defined as the kinetic energy per unit volume of space, minus theL density of potential energy. However, in some cases, for example when the Lagrangian must be invariant under a Lorentz transformation, this definition cannot be used, and we must instead define the Lagrangian density as that which gives the correct wave equations. The usual Lagrangian of the system is the Lagrangian density integrated over the whole volume of the system.

L = dx dy dz (9.1) L ZZZ The variational principle from which the equations of motion can be derived can be written

163 164 QUANTUM THEORY

in the form

δL dt = δ dx dy dz dt = 0 (9.2) L Z ZZZZ Suppose that the continuous system is described by a number of fields, φ1, φ2, , φn, which are continuous functions of the space and time coordinates x, y, z and t. For example,··· φ1, φ2, might represent the components of the electromagnetic field. Suppose also that the Lagrangian··· density can be expressed in terms of the fields and their first derivatives with respect to the space and time coordinates, so that j = 1, 2, , n ∂φj ··· = φj, µ µ = 1, 2, 3, 4 (9.3) L L ∂x  4    x = ict If we change one of the fields by a small amount, the resulting change in the Lagrangian density is given by

4 ∂ ∂ ∂(δφi) δ = L δφi + L µ µ (9.4) L ∂φi ∂(∂φi/∂x ) ∂x µ=1 X The variational principle then becomes

4 ∂ ∂ ∂(δφi) δL dt = L + L µ µ dx dy dz dt (9.5) ∂φi ∂(∂φi/∂x ) ∂x " µ=1 # Z ZZZZ X The second term on the right-hand side of (9.5) can be integrated by parts in a manner analogous to equations (??)-(??). Thus, for example, we can write

∂ ∂(δφi) L dx dy dz dt ∂(∂φ /∂x) ∂x ZZZZ i ∂ xb = L δφi dy dz dt ∂(∂φi/∂x) ZZZ xa ∂ ∂ L δφi dx dy dz dt (9.6) − ∂x ∂(∂φ /∂x) ZZZZ i The first term to the right of the = sign is required to vanish because the variation must be zero at the end points. In this way, we can rewrite the variational principle (9.2) in the form: ∂ 4 ∂ ∂ δL dt = L µ L µ δφi dx dy dz dt = 0 (9.7) ∂φi − ∂x ∂(∂φi/∂x ) " µ=1 # Z ZZZZ X Since the variation δφi is arbitrary, (9.7) can be fulfilled only if

∂ 4 ∂ ∂ L µ L µ = 0 (9.8) ∂φi − ∂x ∂(∂φi/∂x ) µ=1 X 9.2. ELECTROMAGNETIC POTENTIALS 165

Thus if we are able to write down the Lagrangian density of a continuous system, the the variational principle leads us to the equations of motion (9.8). Conversely, knowledge of the equations of motion of a continuous system can be used to infer the Lagrangian density from which they are derived. The canonical conjugate of the field φi is called the momentum density, and is defined as ∂ πi L (9.9) ≡ ∂(∂φi/∂t) and the Hamiltonian density , corresponding to the Lagrangian density , is given by H L n ∂φi = πi (9.10) H ∂t − L i=1 X Notice that equation (9.8) is invariant under a Lorenz transformation. but equation (9.10) is not, since the time coordinate is singled out for special treatment. This corresponds to the fact that energy, by itself, is not a Lorenz-invariant quantity, but only the 4th component of a 4-vector. It is possible to perform approximately relativistic calculations using the Hamiltonian formulation, but the Lagrangian formalism should be used rather than the Hamiltonian in cases where Lorentz invariance is important. Having found the equations of motion (9.8), we can next solve them, subject to the boundary conditions, and find solutions corresponding to pure harmonic oscillations. The fields are then expressed as sums of these harmonic modes. When the sums are substituted into the Lagrangian L = dx dy dz, then L can be recognized as a sum of simple harmonic oscillator Lagrangians.L The quantization of these can proceed in the standard way. RRR 9.2 Electromagnetic potentials

Let us now attempt to apply these methods to the electromagnetic potential 4-vector, Aλ. This is a case where the definition of the Lagrangian density as the density of kinetic energy minus the density of potential energy cannot be used. We must instead use the concept of the Lagrangian density as a quantity which leads, through the variational principle, to the correct wave equations. In the case of the electromagnetic potentials, we know that they obey the wave equation

2 4π Aλ = jλ jλ = 1, 2, 3, 4 (9.11)  − c where

4 ∂2 1 ∂2 2 = 2 x4 = ict (9.12)  ≡ ∂xµ2 ∇ − c2 ∂t2 µ=1 X We must invent a Lagrangian density that leads to the wave equation (9.11). The way to do this is to try all possibilities, starting with the simplest ones, always remembering 166 QUANTUM THEORY

that the forms must be Lorentz invariant. The Lagrangian density which actually leads to equation (9.11) is

4 4 2 4 1 ∂Aλ 4π = + Aλjλ (9.13) L −2 ∂xµ c λ µ=1 λ X=1 X   X=1 From (9.8) we have:

∂ 4 ∂ ∂ L µ L µ = 0 (9.14) ∂Aλ − ∂x ∂(∂Aλ/∂x ) µ=1 X With the trial Lagrangian density of equation (9.13), we have ∂ 4π L = jλ ∂Aλ c 4 4 ∂ ∂ ∂ ∂Aλ 2 µ L µ = µ µ =  Aλ (9.15) − ∂x ∂(∂Aλ/∂x ) ∂x ∂x µ=1 µ=1 X X Substituting these results into the Euler-Lagrange equations (9.14) gives 4π j + 2A = 0 (9.16) c λ  λ which is just the wave equation that we wanted to reproduce. The next step is to solve the wave equation and find the harmonic normal modes of the system. However, we can see that if we include the term involving the current density, the solutions will not necessarily be harmonic. Therefore we omit the interaction term, hoping to include it later by means of perturbation theory. This leaves us with the homogeneous wave equation:

2  Aλ = 0 (9.17) which has harmonic solutions of the form

ik·x ∗ −ik·x Ak,λ = ak,λ(t)e + a (t)e (9.18) N k,λ Here the values of k are restricted by periodic boundary conditions imposed at the faces of a box of normalization of volume V . is a constant which we will choose later in N ∗ a convenient way. ak,λ and its complex conjugate ak,λ are Fourier coefficients which are harmonic functions of time. Substituting (9.18) into (9.17), we can see that Ak,λ will be a solution, provided that ∂2a k,λ + ω2a = 0 ∂t k k,λ ∂2a∗ k,λ + ω2a∗ = 0 (9.19) ∂t k k,λ 9.2. ELECTROMAGNETIC POTENTIALS 167

where ωk = k /c. Having found harmonic solutions to the wave equation, we must ex- pand the potentials| | in terms of these, and substitute them back into the Lagrangian. We then hope to be able to recognize the Lagrangian as a sum of simple harmonic oscillator Lagrangians. Thus, expanding the potentials in terms of their normal modes, we write

0 ik·x ∗ −ik·x Aλ(x, t) = ak,λe + a e (9.20) N k,λ k X  In (9.20), the sum runs over only half of k-space for the following reason: By writing the harmonic solutions in the form (9.18), (i.e as a function plus its complex conjugate), we ensure that they shall be real. However, we do not want the solution corresponding to positive k to repeat that corresponding to k. Therefore we restrict the sum to half of k-space, and we denote this with a prime on− the summation. From (9.20), we have

0 ∂Aλ ik·x ∗ −ik·x = ikµak,λe ikµa e µ = 1, 2, 3 (9.21) ∂xµ N − k,λ k X  and

0 ∗ ∂Aλ ∂ak,λ ∂ak,λ = N eik·x + e−ik·x (9.22) ∂x4 ic ∂t ∂t k X   Let us now calculate the Hamiltonian density associated with the electromagnetic po- tentials. According to our earlier discussion, theH Hamiltonian density is given by

4 ∂Aλ ∂ = L H ∂t ∂(∂Aλ/∂t) − L λ=1   X 0 4 3 2 2 4 1 ∂Aλ ∂Aλ 4π = jλAλ 2 ∂xµ − ∂x4 − c λ "µ=1 # λ X=1 X     X=1 = + + (9.23) Hradiation Hinteraction ≡ Hr Hi

Now we know that the Fourier coefficients ak,λ depend harmonically on time, so that

∂ak,λ = ikcak,λ ∂t ± ∂a∗ k,λ = ikca∗ (9.24) ∂t ∓ k,λ Also we know that the box-normalized functions eik·x/√V obey the orthonormality rela- tions

1 3 i(k−k0)·x d x e = δ 0 (9.25) V k,k Z 168 QUANTUM THEORY

Substituting (9.25) and (9.24) into (9.23), we obtain the Hamiltonian for the free electro- magnetic field in the form

0 4 3 2 ∗ ∗ H = d x = V ak,λa + a ak,λ (9.26) r Hr N k,λ k,λ Z k λ=1 X X  ∗ ∗ Notice that the terms proportional to ak,λak,λ and ak,λak,λ do not occur in Hr because the sum over k is restricted to one half of k-space. With the substitutions 1 ak,λ = ( ipk,λ + kcqk,λ) √2kc − ∗ 1 ak,λ = (ipk,λ + kcqk,λ) (9.27) √2kc

Hr becomes

0 V 4 H = 2k p2 + k2c2q2 (9.28) r c N k,λ k,λ k λ=1 X X  This can be identified as the Hamiltonian of a collection of harmonic oscillators, provided that c = (9.29) N 2kV r If we make the identification a∗ a† , and choose the value of given by (9.29), then k,λ → k,λ N Hr takes on the form

0 1 4 H = kc a a† + a† a (9.30) r 2 k,λ k,λ k,λ k,λ k λ X X=1   This can be recognized as the Hamiltonian of a collection of harmonic oscillators with † frequencies ωk = kc. In the quantum treatment of such a system, ak,λ and ak,λ correspond to creation and annihilation operators, and they obey the commutation relations

† ak,λ, ak,λ = δk,k0 δλ,λ0 h † † i ak,λ, ak,λ = 0

h[ak,λ, ak,λi] = 0 (9.31)

(See equations (??)-(??)). Expanding the potentials in terms of the normal modes of the system, we have

0 c A = a eik·x + a† e−ik·x (9.32) λ 2kV k,λ k,λ r k X   9.2. ELECTROMAGNETIC POTENTIALS 169

The interaction Hamiltonian, expressed in terms of the photon creation and annihilation operators, then becomes:

4 4π 3 H = d x Aλ jλ i − c λ=1 Z 0 X4 4π 3 ik·x † 3 −ik·x = ak,λ d x e jλ + a d x e jλ (9.33) − √ k,λ k λ 2ωkV X X=1  Z Z  The potential Aλ is a 4-vector

Aλ = A, iϕ (9.34) { } of which the first three (spacelike) components are the vector potential, which is related to the magnetic field, while the last (timelike) component is the scalar or electrostatic potential. The current density jλ is also a 4-vector:

jλ = j, iρ (9.35) { } is also a 4-vector. Its first three components form the current density vector in 3-dimensional space, while the 4th component represents the charge density. For an electron in the state ψ satisfying the Dirac equation, its contribution to the current density is given by j = ecψ∗αψ (9.36) where the three components of the vector α are the Dirac matrices representing the velocity operator: 0 0 0 1 0 0 1 0 α = (9.37) 1  0 1 0 0   1 0 0 0    0 0 0 i 0 0 i −0 α = (9.38) 2  0 i 0 0  −  i 0 0 0     0 0 1 0  0 0 0 1 α = (9.39) 3  1 0 0− 0   0 1 0 0   −  In the non-relativistic limit, this reduces approximately to

e ∗ jN.R. ψ pψ (9.40) → me For an electron in the state ψ, its contribution to the charge density ρ is given by ρ = eψ∗ψ (9.41) 170 QUANTUM THEORY 9.3 Separation of the longitudinal and transverse po- tentials

The approximate Hamiltonian of the electron-photon system The sum of the Hamiltonian that represents the radiation field by itself and the Hamiltonian representing the interaction between matter and radiation can be written in the form:

4

Hr + Hi = Hλ (9.42) λ X=1 where

3 2 2 3 1 ∂Aλ ∂A4 1 Hλ = d x jλAλ (9.43) 8π ∂xµ − ∂x − c " (µ=1 4 ) # Z X     Thus the Hamiltonian can be split up into four parts, each of which depends on only one of the potentials. This means that the potentials would be independent of each other, except that the components of the 4-current jλ are not independent, but obey a conservation law. For most physical and chemical applications, it is convenient to treat the vector potential A in the way that we have outlined above, but to treat the scalar potential ϕ in a different way. The classical wave equation for the scalar potential is 4π 2A = j (9.44)  4 c 4 or 1 ∂2 2 ϕ = 4πρ = 4πeψ∗ψ (9.45) ∇ − c2 ∂t2 − −   If we are in the near neighborhood of a charge distribution, the time-dependence of ϕ can usually be neglected in comparison with 2ϕ, and we have the approximate relationship ∇ 2ϕ = 4πρ (9.46) ∇ − which has the solution ρ(x0) ϕ(x) = d3x0 (9.47) x x0 Z | − | We now introduce an approximation which is useful for practical calculations in atomic or molecular physics. We divide the charged particles in the universe into two parts: (1) the small system that we intend to study, and (2) all the rest. We assume that in some approx- imation, the vector potential in our small system can be neglected in comparison with the scalar potential. Conversely, we assume that for the interaction of our small system with 9.4. LINEAR POLARIZATION 171

the distant remainder of the universe, the scalar potential ϕ can be neglected in compari- son with the vector potential A. In this approximation, the Hamiltonian representing the interaction of the particles in our small system with other charges, both inside and outside it, is

1 3 3 Hi = d x j A + d x ρϕ (9.48) − c · Z Z where j, ρ and ϕ belong to the small system that we are studying, while the vector potential A belongs to the universe outside. We can write the approximate interaction Hamiltonian Hi in the form

3 2πhc 3 ik·x ∗ † 3 −ik·x ∗ Hi = ak,λ d x e ψ αλψ + a d x e ψ αλψ − V k k,λ k λ=1 r X X  Z Z  ψ∗(x)ψ(x)ψ∗(x0)ψ(x0) + e2 d3x d3x0 (9.49) x x0 Z Z | − | 9.4 Linear polarization

The equations

2 4π Aλ = jλ λ = 1, 2, 3, 4 (9.50)  − c are equivalent to Maxwell’s equations for free space, provided that we impose the subsidiary condition

4 ∂Aλ = 0 x4 ict (9.51) ∂xλ ≡ λ X=1 This is called the “Lorenz condition”, and it defines the “gauge” in which we will be working. The Lorenz condition is a natural one to impose, since the potentials Aλ arise from 4-currents which obey the conservation law:

4 ∂j λ = 0 (9.52) ∂xλ λ X=1 When A4 = 0, the Lorenz condition reduces to A = 0 (9.53) ∇ · which requires the divergence of the vector potential to vanish. Let us consider a case where A4 = 0, and where A represents a plane wave propagating through space with wave number k. Then we can write A in the form

i(k·x−kct) A = A0Re u e (9.54)  172 QUANTUM THEORY

where A0 is a constant and u is a unit vector in the direction of A. Then the Lorenz condition requires that A = A Re k u ei(k·x−kct) = 0 (9.55) ∇ · 0 · which will be satisfied for all values of x if  k u = 0 (9.56) · In other words, the unit vector u must be perpendicular to the direction of propagation of the wave. The plane monochromatic wave described by these equations is said to be linearly polarized, with polarization vector u. For a fixed value of k, there are two linearly independent polarization vectors, u1 and u2, which are perpendicular to each other, and also perpendicular to k. For example. if the z-axis is chosen to be in the direction of k, then u1 and u2 can be chosen to be in the directions of the x-axis and y-axis respectively.

9.5 Spontaneous photon emission

Using the electron creation and annihilation operator notation introduced in Section 5.9 of this book, we can write the Hamiltonians of the electrons in the small system that we are studying and the Hamiltonian of the radiation field by itself in the form

H0 = Hr + He 1 1 1 = hck a† a + + Hc b†b + g b†b†b b (9.57) 2 k,λ k,λ 2 r,s r s 2 r,s|t,u r s t u k,λ r,s r,s,t,u X   X X We can also introduce a number of states which are eigenfunctions of H0:

H A = EA A 0| i | i H B = EB B 0| i | i H0 C = EC C |. i . .| i . . . (9.58) We would like to calculate the probability per unit time that the interaction Hamiltonian will induce a transition between two of these eigenstates, A and B , accompanied by the spontaneous emission of a photon. To do so, we start by| representingi | i B in the form: | i 1 † † B = bf biak,λ A (9.59) | i nk,λ + 1 | i In other words, B differs from A byp having an extra photon in the mode k, λ. and by | i | i the transition χi χf . We can check that B is properly normalized by calculating its scalar product with→ its adjunct state | i

1 † B = A ak.λbi bf (9.60) h | nk,λ + 1 h | p 9.6. PHOTON ABSORPTION 173

Making use of the relations

† † ak,λa A = (1 + a ak,λ) A = (1 + nk,λ) A (9.61) k,λ| i k,λ | i | i and

† bi bi A = νi A = A † | i | i | i bf b A = (1 νf ) A = A (9.62) f | i − | i | i

(where νi and νf are occupation numbers. equal to 1 or 0 depending on whether the spin-orbital is occupied or unoccupied) we obtain 1 + n B B = k,λ = 1 (9.63) 2 h | i ( 1 + nk,λ) so that B is properly normalized. Here nkp,λ is the number of photons in the mode k.λ in the initial| i state A , and since we are treating spontaneous photon emission, this number is zero. Letting | i

2π~c 3 ik·x ∗ Mr,s(k, λ) e d x e χ (x)αλχs(x) (9.64) ≡ − V k r r Z the transition matrix element of the interaction Hamiltonian then becomes:

† 0 0 † † 0 0 † † A Hi B = A b bs ak0,λ0 Mr,s(k , λ ) + a 0 0 M (k , λ ) b bia A h | | i h | r k ,λ r,s f k,λ| i r,s k0,λ0 X X n o † † = A brbsbf bi A Mr,s(k, λ) r,s h | | i X = Mi,f (k, λ) (9.65)

For a full derivation of the rate of spontaneous photon emission, see [Avery, 1976]. For cases where the transition from the excited state to the ground state is dipole-allowed, i.e. when

3 ∗ d x χ (x) x χf (x) = 0 (9.66) i 6 Z one finds excited state lifetimes of the order of 10−8 seconds, the decay of the excited state being due to spontaneous photon emission.

9.6 Photon absorption

Suppose that the initial state A of the electron-photon system contains nk,λ photons in the mode kλ, so that | i

† a ak,λ A = nk,λ A (9.67) k,λ | i | i 174 QUANTUM THEORY

Now let us consider a transition from this state to the state 1 † B = bf biak,λ A (9.68) | i √nk,λ | i which differs from A by having one fewer photon in the mode kλ, and by the transition | i i f. The factor 1/√nk,λ is needed to make B properly normalized. The matrix element of→ the interaction Hamiltonian between the two| i states is given by

1 † A Hi B = A Hi bf biak,λ A h | | i √nk,λ h | | | i

1 † 0 0 † † 0 0 † = A brbs ak0λ0 Mr,s(k , λ ) + ak0,λ0 Mr,s(k , λ ) bf biak,λ A nk,λ h | | i √ r,s k0,λ0 X X n o 1 † † = A ak,λak,λ A Mi,f (k, λ) √nk,λ h | | i † = √nk,λ Mi,f (k, λ) (9.69) where

† 2π~c 3 −ik·x ∗ M (k, λ) e d x e χ (x)αλχs(x) (9.70) r,s ≡ − V k r r Z Using Fermi’s “Golden Rule Number Two” from perturbation theory, we obtain the prob- ability per unit time for the transition A B : | i → | i 1 2π † 2 = nk,λ Mi,f (k, λ) ρ(B) (9.71) τ ~ | | where ρ(B) is the density of final states. The number of photons in the mode kλ is nk,λ/V , where V is the volume of normalization. Therefore, since the photons move with velocity c, the flux of photons in this mode is nk,λc/V . The criss-section σ for the absorption of a photon is defined the be the probability per unit time for absorption, divided by the flux. Therefore, in the non-relativistic limit, we obtain the cross-section 2 2 2 2 e ~ ~c −ik·x ∗ ∂ σ = (2π) dτ e χ u χf ρ(B) (9.72) c m c k i · ∂x ~   e  Z  

Here σ represents the cross-section for the absorption of a photon with wave number k and polarization vector u, accompanied by the 1-electron transition i f. Similarly, the cross.section for absorption of a left- or right-circularly polarized photon→ is given by 2 2 2 2 e ~ ~c −ik·x ∗ ∂ σ± = (2π) dτ e χ (u iu ) χf ρ(B) c m c 2k i 1 ± 2 · ∂x ~   e  Z  

(9.73)

If χf corresponds to a bound state with a finite lifetime Γ, the density of final states can be represented by the function ~Γ ρ(B) = 2 2 (9.74) π (EA EB) + (~Γ) { − } 9.7. PROBLEMS WITH FIELD THEORIES 175

0.30

0.25

0.20

0.15

0.10

0.05

-10 -5 5 10

Figure 9.1: In this figure, the density of final states ρ(B) is plotted as a function of ~Γ on a scale where EA EB = 1. | − | 9.7 Problems with field theories

Quantum electrodynamics was pioneered very largely by the great British physicist P.A.M. Dirac, and yet Dirac was dissatisfied with the theory as it stood because of logical inconsis- tencies which had to be overcome by “renormalization”, a procedure that Dirac regarded as a “stop-gap measure”. There is a remarkable contrast between the enormous calculational success of quantum electrodynamics, and the logical inconsistencies that worried Dirac. One serious problem with quantum field theories is the infinite self-energies that they predict. Another difficulty is connected with Haag’s theorem. In an article entitled “Haag’s Theorem as a Reason to Reconsider Direct-Action Theories”[Kastner, 2015a], R.E. Kastner writes: “Haag showed that the interacting field demands an inequivalent representation from that of the free field; the vacuum states of the two fields cannot be defined in the same representation. This result presents a serious problem for the basic mathematical consistency of quantum field theories, and has led to much discussion... Haag presented a more general and formal result in which the infinite degrees of freedom of the quantized field can be seen as the actual source of the problem.[203]”. In their famous 1949 paper, J.A. Wheeler and R.P. Feynman say “...action at a distance must have a close connection with field theory. But never does it consider the action of a charge on itself. The theory of direct interparticle action is equivalent, not to the usual field theory, but to a modified or adjunct field theory..” In this adjunct field theory, which incorporates the concept of direct interparticle interaction , the logical difficulties of the usual theory are removed, for example the difficulties connected with giving the electromagnetic field an infinite number of degrees of freedom, but the calculational power of the usual theory is retained. 176 QUANTUM THEORY

Figure 9.2: Paul Adrien Maurice Dirac (1902-1984). Although he was the prin- cipal founder of quantum electrodynamics, he was dissatisfied with the theory as it stood. Dirac once said that renormalization is just a stop gap procedure, and there had to occur a fundamental change in our ideas. 9.7. PROBLEMS WITH FIELD THEORIES 177

Figure 9.3: Rudolf Haag (1922-2016). He was the author of a theorem criticizing all field theories. In one form of Haag’s theorem, the root of the problem is seen to be the fact that field theories lead to an infinite number of degrees of freedom. Haag was awarded the Max Planck medal in 1970 and the Henri Poincar´ePrize of the International Association of Mathematical Physics in 1997. 178 QUANTUM THEORY Chapter 10

RESONANCE ENERGY TRANSFER

10.1 Introduction

The phenomenon of resonance energy transfer or “sensitized fluorescence” was discovered in 1922 by G. Cario and J. Franck [Cario and Franck, 1923]. They exposed a mixture of mercury and thallium vapors to a frequency of light which could only be absorbed by the mercury, and observed in the fluorescence a frequency which only the thallium could emit. An initial theoretical explanation of this phenomenon on the basis of classical cou- pled oscillator theory was put forward by J. Perrin in 1926. Later F. Perrin, Kallman and London, developed quantum mechanical theories to explain sensitized fluorescence. These theories were improved by Th. F¨orster[Scholes, 2003]. Much experimental work has also been done in this field, and interest is high today because resonance energy transfer has proved to be a very useful tool in nonoscience [Chou and Dennis, 2015], [Wegner, 2015], [Kusku and Akan, 2011] and biophysics [Avery, Bay and Szent-Gy¨orgyi,1961] [Pearlstein, 2002]. In the present chapter, we extend the early theories by considering relativistic effects. We replace the Coulomb interaction by a relativistic interaction, as was done in early pa- pers by one of us [Avery, 1966], [Avery, 1984]. Similar results were obtained by Gomberoff and Power [Gomberoff and Power, 1966]. More recently, very important theoretical work applying relativistic theory to resonance energy transfer was carried out by D.L. An- drews and his co.workers [Andrews and Demidov, 1999] [Juzeliunas and Andrews, 2000] [Anderson, 2008] [Daniels et al., 2003] [Bradshaw and Andrews, 2008]. The theory presented here is a quantum mechanical direct-interaction theory. In other words it does not involve photons, and thus it avoids problems such as the infinities asso- ciated self-interaction energies, and the consequences of Haag’s theorem that appear when photons are regarded as an infinite assembly of harmonic oscillators. In 1949, J.A. Wheeler and R.P. Feynman published a famous paper [Wheeler and Feynman, 1949], in which they showed that the classical behavior of a system of charged particles can be correctly ex- plained by a direct interparticle interaction principle. They were motivated by the desire

179 180 QUANTUM THEORY to avoid worrying infinities and other problems in the usual formulation of quantum electro- dynamics, [Kastner, 2015a], [Kastmer, 2015b], [Grundler, pdf], [Lazarovici, prepring, pdf], [Klaczynski, 2015]. The Wheeler-Feynman action principle is time-symmetric. In other words, it makes use of both retarded and advanced potentials in a symmetrical way.

10.2 Review of the Perrin-F¨orstertheory

In the Perrin-F¨orstertheory of resonance energy transfer (also called sensitized fluores- cence), one considers two atoms or molecules, located respectively at the points a1 and a2 so that they are separated by the distance R a1 a2. This distance is assumed to be large enough so that overlap between the electronic≡ wave− functions of the two systems can be neglected. If it were not for the perturbation due to the interaction of the electrons of the first system with those of the second, the product of the electronic wave functions of the two systems, ψr(1)ψs(2), would be a stationary state, i.e. an eigenfunction of the total Hamiltonian. However. there is a perturbation term due to the Coulomb interaction of the electrons on one system with those on the other, and this perturbation can induce a transition to another state, which we can denote by ψt(1)ψu(2). The perturbation matrix element is given by:

e2 H0 = dτ dτ ψ†(1)ψ†(2) ψ (1)ψ (2) (10.1) rs|tu 1 2 r s x x t u i,j i j Z Z X | − | Here xi indicates the position of electron i on system 1, while xj is the position of electron j in system 2. Expanding 1/ xi xj in a six-fold Taylor series about the centers of the | − | two systems, a1 and a2, one obtains for the first non-zero term the interaction between the transition dipole moments of the two systems. dr,t ds,u 3(dr,t R)(ds,u R) H0 e2 1 · 2 1 · 2 · (10.2) rs|tu ≈ R3 − R5   where

r,t † d dτ ψ (1)r ψt(1) 1 ≡ 1 r 1 Z s,u † d dτ ψ (2)r ψu(2) (10.3) 2 ≡ 2 s 2 Z and

r xi a 1 ≡ − 1 r xj a (10.4) 2 ≡ − 2 From equation (10.2), one can see that the perturbation inducing the transition is propor- tional to 1/R3. When first-order perturbation theory is used to calculate the transition 10.3. A RELATIVISTIC INTERACTION 181

probability, the perturbation matrix element is squared, and it is thus proportional to 6 1/R . In F¨orster’sversion of the theory, a distance R0 is defined to be the distance at which the probability for resonance transfer of electronic excitation energy between two systems is the same as the probability, 1/τ, of spontaneous photon emission by the initially excited system. Thus, F¨orsterwrote: 1 1 R 6 = 0 (10.5) τ τ R s→a   Typical values of R0 are of the order of magnitude 50 Angstr¨oms.˚

10.3 A relativistic interaction

It is interesting to try to extend the Perrin-F¨orstertheory by replacing the Coulomb interac- tion by its relativistic counterpart.[Avery, 1966], [Avery, 1984], [Andrews and Demidov, 1999]. We start by recalling the definition of the contravatiant anticommuting Dirac matrices:

1 0 0 0 0 1 0 0 γ0 = (10.6)  0 0 1 0  −  0 0 0 1   −   0 0 0 1  0 0 1 0 γ1 = (10.7)  0 1 0 0  −  1 0 0 0   −   0 0 0 i  0 0 i −0 γ2 = (10.8)  0 i 0 0   i 0 0 0   −   0 0 1 0  0 0 0 0 1 γ3 = (10.9)  1 0 0− 0  −  0 1 0 0    We next introduce the second-order electron-electron S-matrix[204] defined by

3 2 4 4 µ µ Srs|tu e d x d x ψ¯r(1)γ (1)ψs(1)G(1, 2)ψ¯t(2)γ (2)ψu(2) (10.10) ≡ 1 2 µ=0 X Z Z where d4x = d3xdt and ψ¯ ψ†γ0, and where G(1,2) is the Green’s function for the d’Alembertian operator: ≡ 1 eik·(x1−x2)−iω(t1−t2) G(1, 2) d4k (10.11) ≡ 4π3 k k (ω/c)2 i Z · − − 182 QUANTUM THEORY

and d4k = d3kdω. We now let

µ r,t i(ωr−ωt)t1 ecψ¯r(1)γ (1)ψt(1) j (x )e ≡ µ 1 µ s,u i(ωs−ωu)t2 ecψ¯s(2)γ (2)ψu(1) j (x )e (10.12) ≡ µ 2 Then [204]

Srs|tu 3 3 3 r,t i(ωr−ωt)t1 s,u i(ωs−ωu)t2 = d x1dt1 d x2dt2 jµ (x1)e G(1, 2)jµ (x2)e µ=0 X Z Z = 2πδ(ωr ωt + ωs ωu)Urs|tu (10.13) − − − where[204]

3 ik|x1−x2| 1 3 3 rt e su Urs|tu = d x d x j (x ) j (x ) −c2 1 2 µ 1 x x µ 2 µ=0 1 2 X Z Z | − | 1 3 d3x d3x jrt(x ) (1, 2)jsu(x ) (10.14) ≡ −c2 1 2 µ 1 G µ 2 µ=0 X Z Z and ω k rt (10.15) ≡ c 10.4 The Green’s function of the Helmholtz equation

The function 1 eik|x1−x2| e−ik|x1−x2| (1, 2) + (10.16) G ≡ 2 x x x x | 1 − 2| | 1 − 2|  is the Green’s function of the Helmholtz equation and it obeys the relationships [ 2 + k2] (1, 2) = [ 2 + k2] (1, 2) = 4πδ(x x ) (10.17) ∇1 G ∇2 G 1 − 2 The significance of this function is as follows: Suppose that the current 4-vector and the electromagnetic potential 4-vector are simple harmonic functions of time: ikct jµ(x, ict) = jµ(x)e µ = 1, 2, 3, 4 ikct Aµ(x, ict) = Aµ(x)e (10.18) In that case, the wave equation relating the potentials to the source current becomes

2 2 1 + k Aµ(x, ict) = jµ(x, ict) µ = 1, 2, 3, 4 (10.19) ∇ 4π Making use of the 3-dimensional symmetric form of Green’s theorem and equation (10.17), we have

3 Aµ(1) = d x 1, 2)jµ(2) (10.20) 2 G( Z 10.5. MATRIX ELEMENTS 183 10.5 Matrix elements

We wish to evaluate matrix elements of the form

3 ¯ µ 3 † 0 µ d x1ψr(1)γ (1)ψt(1) = d x1ψr(1)γ (1)γ (1)ψt(1) (10.21) Z Z where

γ0γ0 = I 0 1 γ γ = α1 0 2 γ γ = α2 0 3 γ γ = α3 (10.22) where 1 0 0 0 0 1 1 0 I = (10.23)  0 0 1 0   0 0 0 1    is the identity matrix, and where the components of the 3-vector α are given by

0 0 0 1 0 0 1 0 α = (10.24) 1  0 1 0 0   1 0 0 0      0 0 0 i 0 0 i −0 α = (10.25) 2  0 i 0 0  −  i 0 0 0      0 0 1 0 0 0 0 1 α = (10.26) 3  1 0 0− 0   0 1 0 0   −  The matrix elements of the Dirac operator α can be related to the transition dipole mo- ments by the commutation relation:

~c [H , d ] = α(j) j = 1, 2 (10.27) j j i

where Hj is the Dirac Hamiltonian operator of the jth electron.

~c ∂ 2 Hj = α(j) eA(xj) eφ(xj) + βjmc (10.28) · i ∂x − −  j  184 QUANTUM THEORY

Then

3 † † d x ψ (1) α(1) ψt(1) = dτ ψ (1) (H r r H ) ψt(1) 1 r 1 r 1 1 − 1 1 Z Z i(Er Et) † = − dτ1 ψr(1) r1 ψt(1) ~c Zr,t = i(ωr ωt)d (10.29) − 1 with a similar expression for j = 2. Finally, with the help of equation (E.43), we can obtain the relations:

† r,t dτ1 ψr(1) α(1) ψt(1) = ikd1 Z † s,u dτ ψ (2) α(2) ψu(2) = ikd (10.30) 2 s − 2 Z where

Er Et Es Eu k − − (10.31) ≡ ~c ≈ ~c The sign difference between the first and second parts of equation (10.30) comes from the fact that one system is losing excitation energy, while the other is gaining it.

10.6 Transition probability at macroscopic separations

When the sensitizer and the acceptor are separated by a macroscopic distance, the non- relativistic probability of resonance energy transfer between them becomes vanishingly small. However, in a relativistic calculation, the terms involving α lead to a transition probability which falls off only as 1/R2. At macroscopic sensitizer-acceptor separations, we can make the approximation

x x a a R (10.32) 1 − 2 ≈ 1 − 2 ≡ so that the Green’s function of the Helmholtz equation becomes

1 (eikR + e−ikR) cos(kR) (1, 2) (10.33) G ≈ 2 R ≡ R Then equation (10.14) becomes

3 1 cos(kR) 3 rt 3 su Urs|tu = d x j (x ) d x j (x ) (10.34) −c2 R 1 µ 1 2 µ 2 µ=0 X Z  Z  10.7. COMPARISON WITH SPONTANEOUS PHOTON EMISSION 185

From equations (10.12) and (10.22), we have

3 rt 3 † d x1 j0 (x1) = ec d x1 ψr(x1)I(1)ψt(x1) = 0 Z Z 3 rt 3 † d x1 j1 (x1) = ec d x1 ψr(x1)α1(1)ψt(x1) Z Z 3 rt 3 † d x1 j2 (x1) = ec d x1 ψr(x1)α2(1)ψt(x1) Z Z 3 rt 3 † d x1 j3 (x1) = ec d x1 ψr(x1)α3(1)ψt(x1) (10.35) Z Z With the help of equation (10.30) we then have

3 r,t 3 † d x1 jµ (x1) = ec d x1 ψr(x1)αµ(1)ψt(x1)

Z Z r,t = ikec d1,µ µ = 1, 2, 3 (10.36)

s,u The expressions for jµ (x2) are similar. Thus, at macroscopic sensitizer-acceptor separa- tions, we have

3 1 cos(kR) 3 rt 3 su Urs|tu = d x j (x ) d x j (x ) −c2 R 1 µ 1 2 µ 2 µ=0 X Z  Z  cos(kR) e2k2 dr,t ds,u (10.37) ≈ − 1 · 2 R Then

2 2 r,t s,u 2 2 (e k d1 d2 cos(kR)) Urs|tu · (10.38) | | ≈ R2 falls off only as 1/R2. Now suppose that the sensitizer is at the center of a large sphere of radius R, on the surface of which acceptors are uniformly distributed. Then the prob- ability of resonance energy transfer from the electronicly excited sensitizer to one or an- other of the acceptors will be independent of the radius of the sphere. This shows that the sensitizer can lose its excitation energy to acceptors over macroscopic distances. A much more detailed calculation leading to the prediction of resonance energy transfer over macroscopic distances has been carried through by G. Juzeliunas and D.L. Andrews [Andrews and Demidov, 1999].

10.7 Comparison with spontaneous photon emission

Is long-range resonance energy transfer a process that competes with spontaneous photon emission? Or is it another way of describing the same process? In the usual theory of spontaneous photon emission, the presence of acceptors is not needed. The spontaneous 186 QUANTUM THEORY

photon emission would take place even if the remainder of space were entirely empty. At the end of this article, we propose an experimental test. The presence of acceptors may also be involved in the recently observed violation of Planck’s law ([Thompsen et. al., 2018]). We are working on a longer paper which will explore these questions.

10.8 The Perrin-F¨orsterregion

We have, until now, been discussing the relativistic calculation of resonance energy transfer probabilities at very large sensitizer-acceptor separations, where xi xj R and kR 1. − ≈  Let us now turn to the region of F¨orster’s R0, where we have kR 1 and k xi xj 1. We must ask whether the relativistic calculation in the short-range limit| gives− us|  the Perrin-F¨orster’sresult, which has been confirmed by so many experiments. In this region,

e2(1 α(i) α(j))eik|xi−xj | e2(1 α(i) α(j)) − · − · x x x x i,j i j → i,j i j X | − | X | − | e2 e2α(i) α(j) = · (10.39) x x x x i,j i j − i,j i j X | − | X | − | The first term in the last line of equation (E.46) is that which gives rise to the Perrin- F¨orsterresults. Our task now is to find matrix elements of the perturbation Hamiltonian due to the second term, and to show that when kR 1, the contribution of this term is small compared to the contribution of the first term. Let

00 2 † † (α(1) α(2)) H1,2 e dτ1 dτ2 ψr(1)ψs(2) · ψt(1)ψu(2) (10.40) ≡ − xi xj Z Z | − | Making use of equation (E.41), we obtain for the leading non-zero term in the Taylor series 00 expansion of H in terms of 1/ xi xj : 1,2 | − | e2(ikdr,t) ( ikds,u) H00 1 · − 2 + ... 1,2 ≈ − R (dr,t) (ds,u) = e2k2 1 · 2 + ... (10.41) − R The ratio between this term and the leading Perrin-F¨orsterterm in equation (10.2) is

00 r,t s,u r,t s,u H , d d d d 1 2 e2k2 1 · 2 / e2 1 · 2 H0 ≈ − R R3 1,2     R3 = k2 = k2R2 (10.42) − R −   00 so that when kR 1, the term H1,2 can be neglected in comparison with the Perrin-F¨orster term. Thus, in this region, the relativistic treatment closely approximates the well-verified non-relativistic Perrin-F¨orsterresult shown in equations (10.2) and (10.5). 10.9. A PROPOSED EXPERIMENT 187 10.9 A proposed experiment

Figure 10.1: We propose the experiment illustrated in this figure for detecting resonance energy transfer over macroscopic distances. The central rectangle represents a glass slide, whose thickness (for example 0.2 cm.) is a macroscopic distance. On one side, the slide is coated with sensitizer molecules, while on the other side it is partially coated with acceptor molecules. If we illuminate the slide from the sensitizer side with a frequency of light which only the sensitizer molecules can absorb, and observe the fluorescence of the sensitizer from the same side, we can answer the question of whether resonance energy transfer can take place over macroscopic distances. If the transfer takes place, the presence of the acceptors on the opposite side of the slide will reduce the observed fluorescence of the sensitizer.

In the proposed experiment, sensitizer and acceptor molecules are chosen in such a way that the emission maximum of the sensitizers corresponds to the absorption maximum of the acceptors. Let ω1 be the frequency of light corresponding to the absorption maximum of the sensitizer molecules, while ω2 ω3 is the frequency corresponding both to the emission maximum of the sensitizers and≈ the absorption maximum of the acceptors. The slide is illuminated from the sensitizer side with light of frequency ω1, and radiation at the frequency ω2 is observed from the sensitizer side, both on portions of the slide which are coated with acceptors and portions where the acceptors are not present. If resonance energy transfer over macroscopic distances is taking place, the fluorescence of the sensitizer will be damped by the presence of acceptors on the opposite side of the slide.

Figure 10.2: This figure shows an alternative arrangement, where the 2mm. gap is empty. Above the gap is a thin glass slide partially coated with accepter molecules. Below the gap is another thin glass slide, completely coated with sensitizer molecules. 188 QUANTUM THEORY

Experiments on long-distance resonance energy transfer, for example [205] [207] [208], [209], [210], [211], have detected departures from the 1/R6 distance dependence predicted Perrin-F¨orstertheory, but until now only deviations in the nanometer range have been detected We propose the following set of measurements, which aim at the detection of resonance energy transfer over a macroscopic distance: A glass slide roughly 2 mm. in thickness (a macroscopic distance) is uniformly coated on one side with sensitizer molecules, as shown in Figure 1, while the opposite side of the slide is partially coated with acceptor molecules. The sensitizer and acceptor molecules are chosen in such a way that their absorption and emission bands lie in the visual range of the spectrum, a range in which the glass slide is transparent. Furthermore, the sensitizer and acceptor molecules should be chosen in such a way the the Stokes-shifted emission band of the sensitizers overlaps maximally with the absorption band of the acceptors. The slide is illuminated from the sensitizer side with a frequency of light, ω1, which only the sensitizers can absorb, and, from the same side, the emission is observed at a frequency, ω2, at which the sensitizer molecules fluoresce. If this fluorescence is damped on the portion of the slide coated with acceptors, resonance energy transfer over macroscopic distances will have been demonstrated. The trivial process, where a photon emitted by the sensitizer, is absorbed by the acceptor, Stokes-shifted, and then fluoresced, could explain an enhancement of intensity in the frequency region near the acceptor fluorescence maximum, but not the loss of intensity in the frequency region near the fluorescence maximum of of the sensitizer. No explanation besides resonance energy transfer can be devised to explain such a loss.

The arrangement shown in Figure 2 The measurements should be repeated with the experimental arrangement shown in Figure 2. In this case, it is possible to perform the following initial experiments for the purpose of calibration:

1. The following calibration experiments are performed on the slide with the acceptor molecules by itself: The slide is illuminated from the acceptor side with light of constant intensity I1, with frequencies varying over the entire relevant range. The intensity of emitted light, I2, is measured at various points along the length of the slide from the coated side, and the ratio I1/I2 is recorded, again with frequencies varying over the entire range.

2. Another calibration experiment can be performed on the slide coated with sensitizer molecules by itself: The slide is illuminated with a constant intensity, I0, from the glass side, with frequencies running over the entire range. The corresponding fluo- rescence intensities, I2, are measured from the sensitizer side, and the ratio I0/I2 is recorded, again with frequencies varying over the entire range. Finally, the fluores- cence intensity, I3, is measured from the glass side, and the ratio, I0/I3. is recorded at all frequency combinations. 10.10. DISCUSSION: DIRECT INTERPARTICLE INTERACTION 189

Having performed these initial calibration experiments, we are ready for the sensitized fluorescence experiment: The two slides are placed together with a 2 mm. gap, as shown in Figure 2. We illuminate the slide with constant intensity, I0, from the sensitizer side, with frequencies running over the entire range, and we record the fluorescence intensity, I3, from the sensitizer side, at various points on the slide, at frequencies combinations running over the entire range. If we see a loss of intensity of I3 at the frequency range corresponding to the fluorescence maximum of the sensitizer, localized at the points of the side which are nearest to the acceptor molecules, this can be interpreted as evidence of resonance energy transfer to the acceptors. If desired, the experiment can be simplified. The initial calibration experiments can be omitted if the absorption maximum of the sensitizers, ω1, is known. The two slides, arranged as shown in Figure 2 can be illuminated from the sensitizer side with intensity, I0, and frequency ω1. The fluorescence intensity, I3, can then be measured at all frequencies, at various points, from the sensitizer side. Loss of intensity in the frequency range corre- sponding to the fluorescence maximum of the sensitizer, at points on the slide where the sensitizers and acceptors are close, can still be interpreted as evidence of resonance energy transfer. To explain this loss of sensitizer fluorescence intensity, one has to imagine that before the sensitizer could fluoresce, it transferred its excitation energy to an acceptor. As mentioned above, the trivial process might be invoked to explain an enhancement in intensity in the frequency region near to the fluorescence maximum of the absorber, but it cannot explain a loss of intensity in the frequency region near the fluorescence maximum of the sensitizers in those portions of the slide where the sensitizers and acceptors are close together. If observed, such a loss could only be explained by resonance energy transfer.

10.10 Discussion: Direct interparticle interaction

Despite the great successes of quantum field theories, they contain some logical incon- sistencies. P.A.M. Dirac himself, the principal founder of quantum electrodynamics, was dissatisfied with the theory as it stood because of these problems. Dirac once said that renormalization is just a stop gap procedure, and there had to occur a fundamental change in our ideas. One serious problem with quantum field theories is the infinite self-energies that they predict. Another difficulty is connected with Haag’s theorem. In an article entitled “Haag’s Theorem as a Reason to Reconsider Direct-Action Theories”[Kastner, 2015a], R.E. Kastner writes: “Haag showed that the interacting field demands an inequivalent representation from that of the free field; the vacuum states of the two fields cannot be defined in the same representation. This result presents a serious problem for the basic mathematical consistency of quantum field theories, and has led to much discussion... Haag presented a more general and formal result in which the infinite degrees of freedom of the quantized field can be seen as the actual source of the problem.[203]”. In their famous 1949 paper, J.A. Wheeler and R.P. Feynman say “...action at a distance must have a close connection with field theory. But never does it consider the action 190 QUANTUM THEORY of a charge on itself. The theory of direct interparticle action is equivalent, not to the usual field theory, but to a modified or adjunct field theory..” In this adjunct field theory, which incorporates the concept of direct interparticle interaction , the logical difficulties of the usual theory are removed, for example the difficulties connected with giving the electromagnetic field an infinite number of degrees of freedom, but the calculational power of the usual theory is retained.

10.11 Feynman’s first seminar at Princeton

After graduating from MIT with a B.Sc. in physics (having published two papers while still an undergraduate), Richard Feynman entered Princeton University as a graduate student. Here is Wikipedia’s account of the first seminar that he gave there:

“Attendees at Feynman’s first seminar, which was on the classical version of the Wheeler- Feynman absorber theory, included Albert Einstein, Wolfgang Pauli, and John von Neu- mann. Pauli made the prescient comment that the theory would be extremely difficult to quantize, and Einstein said that one might try to apply this method to gravity in general relativity, which Sir Fred Hoyle and Jayant Narlikar did much later as the Hoyle-Narlikar theory of gravity. Feynman received a Ph.D. from Princeton in 1942; his thesis advisor was . His doctoral thesis was titled “The Principle of Least Ac- tion in Quantum Mechanics”. Feynman had applied the principle of stationary action to problems of quantum mechanics, inspired by a desire to quantize the Wheeler-Feynman absorber theory of electrodynamics, and laid the groundwork for the path integral formu- lation and Feynman diagrams. A key insight was that positrons behaved like electrons moving backwards in time.” James Gleick wrote: “This was Richard Feynman nearing the crest of his powers. At twenty-three ... there may now have been no physicist on earth who could match his exu- berant command over the native materials of theoretical science. It was not just a facility at mathematics (though it had become clear ... that the mathematical machinery emerg- ing in the Wheeler-Feynman collaboration was beyond Wheeler’s own ability). Feynman seemed to possess a frightening ease with the substance behind the equations, like Einstein at the same age, like the Soviet physicist - but few others.” 10.11. FEYNMAN’S FIRST SEMINAR AT PRINCETON 191

Figure 10.3: Richard Feynman (1918-1988). Feynman shared the 1965 Nobel Prize in Physics for his contributions to the development of quantum elec- trodynamics. He was an inspiring and popular lecturer, much loved by his students. Many of his lectures are available on the Internet. 192 QUANTUM THEORY 10.12 The absorber theory of radiation

Wheeler and Feynman’s 1949 paper Here are some quotations from this famous paper:

“We... propose here to go back to the great basic problem of classical physics the motion of a system of charged particles under the forces and to inquire influence of electromagnetic what description of the interactions and motions is possible’which is at the same time (1) well defined (2) economical in postulates and (3) in agreement with experience. “We conclude that these requirements are satisfied by the theory of action at a distance of Schwarzschild, Tetrode, and Fokker. In this description of nature no direct use is made of the notion of field. Each particle moves in compliance with the principle of stationary action:

µ 1/2 eaeb µ ν J = mac ( daµda ) + δ (abµab )(daνdb ) = extremum − − c a a

“However unfamiliar this direct interparticle treatment compared to the electrodynamics of Maxwell and Lorentz, it deals with the same problems, talks about the same charges, considers the interaction of the same current elements, obtains the same capacities, predicts the same inductances and yields the same physical conclusions. Consequently action at a distance must have a close connection with field theory. But never does it consider the action of a charge on itself. The theory of direct interparticle action is equivalent, not to the usual field theory, but to a modified or adjunct field theory...”

10.13 Karl Schwarzschild’s contributions to physics

Here are some quotations from the Wikipedia article about Karl Schwarzschild:

“Schwarzschild provided the first exact solution to the Einstein field equa- tions of general relativity, for the limited case of a single spherical non-rotating mass, which he accomplished in 1915, the same year that Einstein first in- troduced general relativity. The Schwarzschild solution, which makes use of Schwarzschild coordinates and the Schwarzschild metric, leads to a derivation of the Schwarzschild radius, which is the size of the event horizon of a non- rotating black hole... “Thousands of dissertations, articles, and books have since been devoted to the study of Schwarzschild’s solutions to the Einstein field equations. How- ever, although Schwarzschild’s best known work lies in the area of general 10.13. KARL SCHWARZSCHILD’S CONTRIBUTIONS TO PHYSICS 193

relativity, his research interests were extremely broad, including work in ce- lestial mechanics, observational stellar photometry, quantum mechanics, in- strumental astronomy, stellar structure, stellar statistics, Halley’s comet, and spectroscopy... “He also introduced a field free variational formulation of electrodynamics (also known as “action at distance” or “direct interparticle action”) based only on the world line of particles as 1 S = m ds + q q δ ( P P ) ds ds (10.44) i i 2 i j i j i j i Ci CiCj || || X Z ZZ

where Cα are the world lines of the particle, dsa the (vectorial) arc element along the world line. Two points on two world lines contribute to the Lagrangian (are coupled) only if they are a zero Minkowskian distance (connected by a light ray), hence the term δ ( PiPj ). The idea was further developed by Tetrode and Fokker in the 1920s and|| Wheeler|| and Feynman in the 1940s and constitutes an alternative/equivalent formulation of electrodynamics. 194 QUANTUM THEORY

Figure 10.4: Karl Schwartzschild (1873-1916). He showed his exceptional abili- ties early by publishing two papers on binary orbits in astronomy at the age of 16. Bibliography

[Cario and Franck, 1923] G. Cario and J, Frank, Z. Phys. 17, 202, (1923)

[Avery, Bay and Szent-Gy¨orgyi,1961] J.S. Avery, Z. Bay and A. Szent-Gy¨orgyi, On the Energy Transfer in Biological Systems, Proc. Nat. Acad. Sci. (US), 47, 1742-1744, (1961)

[Avery, 1966] J.S. Avery, Resonance energy transfer and spontaneous photon emission, Proc. Phys. Soc. (London) 88, Part 1, (1966)

[Avery, 1984] J.S. Avery, Use of the S-Matrix in the Relativistic Treatment of Resonance Energy Reansfer. Int. J. Quantum Chemistry, XXV, 79-96, (1984)

[Gomberoff and Power, 1966] L. Gomberoff and E.A. Power, The resonance transfer of excitation, Proc. Pys. Soc. 88, 281. (1966)

[Scholes, 2003] G.D. Scholes, Long-Range Resonance Energy Transfer in Molecular Sys- tems, Annu. Rev. Phys. Chem. 54, 57-87, (2003)

[Chou and Dennis, 2015] K.F. Chou and A.M. Dennis, F¨orsterResonance Energy Transfer Between Quantum Dot Donors and Quantum Dot Acceptors, Sensors, 2015, 15, 13288-13325, (2015)

[Wegner, 2015] K-D. Wegner, F¨orsterResonance Energy Transfer From Terbium Com- plexes to Quantum Dots for Multiplexed Immunoassays and Molecular Rulers, Doc- toral Thesis, Universit´eParis-Sud, (2015)

[Kusku and Akan, 2011] M. Kusku and O.B. Akan, A Nanoscale Communication Chan- nel with Fluorescence Resonance Energy Transfer (FRET), 1st IEEE International Workshop on Molecular and Nano Scale Communication (MoNaCom), Published in: IEEE Conference on Computer Communications Workshops (INFOCOM WK- SHPS), (2011)

[Pearlstein, 2002] R.M. Pearlstein, Photosynthetic Exciton Transfer in the 1960’s, Photo- synthesis Research 73, 119-126, (2002)

[Andrews and Demidov, 1999] D.L. Andrews and A.A. Demidov, Resonance Energy Transfer, (Wiley, 1999)

195 196 BIBLIOGRAPHY

[Juzeliunas and Andrews, 2000] G. Juzeliunas and D.L. Andrews, Quantum Electrodynam- ics of Resonance Energy Transfer, Adv. Chem. Physics, 112, (2000) [Anderson, 2008] D.L. Anderson, Editor, Encyclopedia of Applied Spectroscopy, Wiley, (2008) [Daniels et al., 2003] G.J. Daniels, R.G. Jenkens, D.S. Bradshaw, and D.L. Andrews, Res- onance Energy Transfer: The Unified Theory Revisited, J. Chem. Phys. 119, 2264- 2274, (2003) [Bradshaw and Andrews, 2008] D.S. Bradshaw and D.L. Andrews, Optically controlled res- onance energy transfer: Mechanisms and configuration for all-optical switching, J. Chem. Phys. 128, 144506, (2008) [Wheeler and Feynman, 1949] J.A. Wheeler and R.P. Feynman, Classical Electrodynamics in Terms of Direct Interparticle Interaction, Rev. Mod. Phys. 21, 3, 425-433, (1949) [Kastner, 2015a] R.E. Kastner, Haag’s Theorem as a Reason to Reconsider Direct-Action Theories, International Journal of Quantum Foundations, 1, (2), 56-64, (2015) (https://arxiv.org/pdf/1502.03814.pdf) [Kastmer, 2015b] R.E. Kastner, The Transactional Interpretation of Quantum Mechanics: The Reality of Possibility (Cambridge University Press, 2012) [Grundler, pdf] G. Gr¨undler, Remarks on Wheeler-Feynman Absorber Theory (https://arxiv.org/pdf/1501.03516.pdf) [Lazarovici, prepring, pdf] D. Lazarovici, Against Fields, (preprint, pdf availabe online), [Klaczynski, 2015] L. Klaczynski, Haag’s Theorem in Renormalisable Quantum Field Theories, Doctoral Thesis at Humboldt-Universit¨at zu Berlin, (2015), (available on line at http://www2.mathematik.hu-berlin.de/ kreimer/wp- content/uploads/PhDLutz.pdf) [Thompsen et. al., 2018] D. Thompsen et al. Hundred-fold enhancement in far-field radia- tive heat transfer over the blackbody limit, Nature, 3 September, 2018. [1] R. Haag, On quantum field theories, Matematisk-fysiske Meddelelser, 29, 12, 1955. [2] A.I. Akhiezer and V.B. Berestetski, Quantum Electrodynamics, Interscience, New York, 1965. [3] R.S. Swathi and K.L. Sebastian, Distance dependance of fluorescence resonance energy transfer, J. Chem. Soc. 121, 777-787, (2009) [4] R.S. Swathi and K.L. Sebastian, Long range resonance energy transfer from a dye molecule to graphene has (distance)-4 dependence, J. Chem Phys. 130, 086101, (2009). BIBLIOGRAPHY 197

[5] L.I. Rangel-Zamudio, D.S. Rushforth, and R.P. Van Duyne, Long-Range Reso- nance Energy Transfer from Aromatic Hydrocarbons to the Anion Radical of Tetra- cyanoquinodimethane, J. Phys. Chem. 90, 807, (1986)

[6] R. Deshmukh et al., Long-Range Resonant Energy Transfer Using Optical Topolog- ical Transitions in Metamaterials, ACS Photonics, 5, 2737-2741, (2018)

[7] K. Akulov et al., Long-Distance Resonant Energy Transfer Mediated by Hybrid Plasmonic-Photonic Modes, J. Phys. Chem. C., just-accepted manuscript, 18 June, (2018).

[8] D. Martin-Cano et al., Resonance Energy Transfer and Superradiance Mediated by Plasmonic Nanowaveguides, Nano Lett. 10 (8), 3129-3134, (2010).

[9] City College of New York, Team study breaks Forster resonant energy transfer (FRET) distance limit, Physics, Optics and Photonics, Oct. 25, (2018).

Suggestions for further reading 1. G. Cario and J, Frank, Z. Phys. 17, 202, (1923) 2. J.S. Avery, Z. Bay and A. Szent-Gy¨orgyi, On the Energy Transfer in Biological Sys- tems, Proc. Nat. Acad. Sci. (US), 47, 1742-1744, (1961) 3. J.S. Avery, Resonance energy transfer and spontaneous photon emission, Proc. Phys. Soc. (London) 88, Part 1, (1966) 4. J.S. Avery, Use of the S-Matrix in the Relativistic Treatment of Resonance Energy Transfer. Int. J. Quantum Chemistry, XXV, 79-96, (1984) 5. L. Gomberoff and E.A. Power, The resonance transfer of excitation, Proc. Pys. Soc. 88, 281. (1966) 6. G.D. Scholes, Long-Range Resonance Energy Transfer in Molecular Systems, Annu. Rev. Phys. Chem. 54, 57-87, (2003) 7. K.F. Chou and A.M. Dennis, F¨orsterResonance Energy Transfer Between Quantum Dot Donors and Quantum Dot Acceptors, Sensors, 2015, 15, 13288-13325, (2015) 8. K-D. Wegner, F¨orsterResonance Energy Transfer From Terbium Complexes to Quan- tum Dots for Multiplexed Immunoassays and Molecular Rulers, Doctoral Thesis, Uni- versit´eParis-Sud, (2015) 9. M. Kusku and O.B. Akan, A Nanoscale Communication Channel with Fluorescence Resonance Energy Transfer (FRET), 1st IEEE International Workshop on Molecular and Nano Scale Communication (MoNaCom), Published in: IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), (2011) 10. R.M. Pearlstein, Photosynthetic Exciton Transfer in the 1960’s, Photosynthesis Re- search 73, 119-126, (2002) 11. D.L. Andrews and A.A. Demidov, Resonance Energy Transfer, (Wiley, 1999) 12. G. Juzeliunas and D.L. Andrews, Quantum Electrodynamics of Resonance Energy Transfer, Adv. Chem. Physics, 112, (2000) 198 BIBLIOGRAPHY

13. D.L. Anderson, Editor, Encyclopedia of Applied Spectroscopy, Wiley, (2008) 14. G.J. Daniels, R.G. Jenkens, D.S. Bradshaw, and D.L. Andrews, Resonance Energy Transfer: The Unified Theory Revisited, J. Chem. Phys. 119, 2264-2274, (2003) 15. D.S. Bradshaw and D.L. Andrews, Optically controlled resonance energy transfer: Mechanisms and configuration for all-optical switching, J. Chem. Phys. 128, 144506, (2008) 16. J.A. Wheeler and R.P. Feynman, Classical Electrodynamics in Terms of Direct In- terparticle Interaction, Rev. Mod. Phys. 21, 3, 425-433, (1949) 17. R.E. Kastner, Haag’s Theorem as a Reason to Reconsider Direct-Action Theories, In- ternational Journal of Quantum Foundations, 1, (2), 56-64, (2015) (https://arxiv.org/pdf/1502.03814.pdf) 18. R.E. Kastner, The Transactional Interpretation of Quantum Mechanics: The Reality of Possibility (Cambridge University Press, 2012) 19. G. Gr¨undler, Remarks on Wheeler-Feynman Absorber Theory (https://arxiv.org/pdf/1501.03516.pdf) 20. D. Lazarovici, Against Fields, (preprint, pdf availabe online), 21. L. Klaczynski, Haag’s Theorem in Renormalisable Quantum Field Theories, Doctoral Thesis at Humboldt-Universit¨atzu Berlin, (2015), (available on line at http://www2.mathematik.hu- berlin.de/ kreimer/wp-content/uploads/PhDLutz.pdf) 22. D. Thompsen et al. Hundred-fold enhancement in far-field radiative heat transfer over the blackbody limit, Nature, 3 September, 2018. 23. R. Haag, On quantum field theories, Matematisk-fysiske Meddelelser, 29, 12, 1955. 24. A.I. Akhiezer and V.B. Berestetski, Quantum Electrodynamics, Interscience, New York, 1965. 25. R.S. Swathi and K.L. Sebastian, Distance dependance of fluorescence resonance en- ergy transfer, J. Chem. Soc. 121, 777-787, (2009) 26. R.S. Swathi and K.L. Sebastian, Long range resonance energy transfer from a dye molecule to graphene has (distance)-4 dependence, J. Chem Phys. 130, 086101, (2009). 27. L.I. Rangel-Zamudio, D.S. Rushforth, and R.P. Van Duyne, Long-Range Resonance Energy Transfer from Aromatic Hydrocarbons to the Anion Radical of Tetracyanoquin- odimethane, J. Phys. Chem. 90, 807, (1986) 28. R. Deshmukh et al., Long-Range Resonant Energy Transfer Using Optical Topological Transitions in Metamaterials, ACS Photonics, 5, 2737-2741, (2018) 29. K. Akulov et al., Long-Distance Resonant Energy Transfer Mediated by Hybrid Plasmonic- Photonic Modes, J. Phys. Chem. C., just-accepted manuscript, 18 June, (2018). 30. D. Martin-Cano et al., Resonance Energy Transfer and Superradiance Mediated by Plasmonic Nanowaveguides, Nano Lett. 10 (8), 3129-3134, (2010). 31. City College of New York, Team study breaks Forster resonant energy transfer (FRET) distance limit, Physics, Optics and Photonics, Oct. 25, (2018). Appendix A

THE HISTORY OF COMPUTERS

A.1 Pascal and Leibniz

If civilization survives, historians in the distant future will undoubtedly regard the inven- tion of computers as one of the most important steps in human cultural evolution - as important as the invention of writing or the invention of printing. The possibilities of ar- tificial intelligence have barely begun to be explored, but already the impact of computers on society is enormous. The invention of transisters was a crucial step in the history of computers, and this invention in turn depended on the development of quantum theory. Thus quantum theory, despite its highly abstract nature, has had an enormous impact on the modern world. The first programmable universal computers were completed in the mid-1940’s; but they had their roots in the much earlier ideas of Blaise Pascal (1623-1662), Gottfried Wilhelm Leibniz (1646-1716), Joseph Marie Jacquard (1752-1834) and Charles Babbage (1791-1871). In 1642, the distinguished French mathematician and philosopher Blaise Pascal com- pleted a working model of a machine for adding and subtracting. According to tradition, the idea for his “calculating box” came to Pascal when, as a young man of 17, he sat thinking of ways to help his father (who was a tax collector). In describing his machine, Pascal wrote: “I submit to the public a small machine of my own invention, by means of which you alone may, without any effort, perform all the operations of arithmetic, and may be relieved of the work which has often times fatigued your spirit when you have worked with the counters or with the pen.” Pascal’s machine worked by means of toothed wheels. It was much improved by Leibniz, who constructed a mechanical calculator which, besides adding and subtracting, could also multiply and divide. His first machine was completed in 1671; and Leibniz’ description of it, written in Latin, is preserved in the Royal Library at Hanover: “There are two parts of the machine, one designed for addition (and subtraction), and the other designed for multiplication (and division); and they should fit together. The adding (and subtracting) machine coincides completely with the calculating box of Pascal. Something, however,

199 200 QUANTUM THEORY

Figure A.1: Blaise Pascal (1623-1662) was a French mathematician, physicist, writer, inventor and theologian. Pascal, a child prodigy, was educated by his father, who was a tax-collector. He invented his calculating box to make his father’s work less tedious. A.2. JACQUARD AND BABBAGE 201

Figure A.2: The German mathematician, philosopher and universal genius Got- tfried Wilhelm von Leibniz (1646-1716) was a contemporary of . He invented differential and integral calculus independently, just as Newton had done many years earlier. However, Newton had not published his work on calculus, and thus a bitter controversy over priority was precipitated. When his patron, the Elector of Hanover moved to England to become George I, Leibniz was left behind because the Elector feared that the controversy would alienate the English. Leibniz extended Pascal’s calculating box so that it could perform multiplication and division. Calculators of his design were still being used in the 1960’s. must be added for the sake of multiplication...” “The wheels which represent the multiplicand are all of the same size, equal to that of the wheels of addition, and are also provided with ten teeth which, however, are movable so that at one time there should protrude 5, at another 6 teeth, etc., according to whether the multiplicand is to be represented five times or six times, etc.” “For example, the multiplicand 365 consists of three digits, 3, 6, and 5. Hence the same number of wheels is to be used. On these wheels, the multiplicand will be set if from the right wheel there protrude 5 teeth, from the middle wheel 6, and from the left wheel 3.”

A.2 Jacquard and Babbage

By 1810, calculating machines based on Leibniz’ design were being manufactured commer- cially; and mechanical calculators of a similar (if much improved) design could be found in laboratories and offices until the 1960’s. The idea of a programmable universal computer is due to the English mathematician, Charles Babbage, who was the Lucasian Professor of 202 QUANTUM THEORY

Figure A.3: Charles Babbage (1791-1871) and his analytical engine.

Mathematics at Cambridge University. (In the 17th century, Isaac Newton held this post, and in the 20th century, P.A.M. Dirac and Stephen Hawking also held it.) In 1812, Babbage conceived the idea of constructing a machine which could automat- ically produce tables of functions, provided that the functions could be approximated by polynomials. He constructed a small machine, which was able to calculate tables of quadratic functions to eight decimal places, and in 1832 he demonstrated this machine to the Royal Society and to representatives of the British government. The demonstration was so successful that Babbage secured financial support for the construction of a large machine which would tabulate sixth-order polynomials to twenty decimal places. The large machine was never completed, and twenty years later, after having spent seventeen thousand pounds on the project, the British government withdrew its support. The reason why Babbage’s large machine was never finished can be understood from the following account by Lord Moulton of a visit to the mathematician’s laboratory: “One of the sad memories of my life is a visit to the celebrated mathematician and inventor, Mr. Babbage. He was far advanced in age, but his mind was still as vigorous as ever. He took me through his workrooms.” “In the first room I saw the parts of the original Calculating Machine, which had been shown in an incomplete state many years before, and had even been put to some use. I asked him about its present form. ‘I have not finished it, because in working at it, I came on the idea of my Analytical Machine, which would do all that it was capable of doing, and much more. Indeed, the idea was so much simpler that it would have taken more work to complete the Calculating Machine than to design and construct the other in its entirety; so I turned my attention to the Analytical Machine.’” “After a few minutes talk, we went into the next workroom, where he showed me the working of the elements of the Analytical Machine. I asked if I could see it. ‘I have never completed it,’ he said, ‘because I hit upon the idea of doing the same thing by a different and far more effective method, and this rendered it useless to proceed on the old lines.’” A.2. JACQUARD AND BABBAGE 203

Figure A.4: Joseph Marie Jacquard (1752-1834) invented a loom which could be programed to produce any design by means of punched cards. News of his invention inspired Babbage to invent a universal programmable computing machine.

“Then we went into a third room. There lay scattered bits of mechanism, but I saw no trace of any working machine. Very cautiously I approached the subject, and received the dreaded answer: ‘It is not constructed yet, but I am working at it, and will take less time to construct it altogether than it would have taken to complete the Analytical Machine from the stage in which I left it.’ I took leave of the old man with a heavy heart.” Babbage’s first calculating machine was a special-purpose mechanical computer, de- signed to tabulate polynomial functions; and he abandoned this design because he had hit on the idea of a universal programmable computer. Several years earlier, the French inventor Joseph Marie Jacquard had constructed an automatic loom in which large wooden “punched cards” were used to control the warp threads. Inspired by Jacquard’s invention, Babbage planned to use punched cards to program his universal computer. (Jacquard’s looms could be programmed to weave extremely complex patterns: A portrait of the in- ventor, woven on one of his looms in Lyon, hung in Babbage’s drawing room.) One of Babbage’s frequent visitors was Augusta Ada1, Countess of Lovelace (1815- 1852), the daughter of Lord and Lady Byron. She was a mathematician of considerable abil- ity, and it is through her lucid descriptions that we know how Babbage’s never-completed Analytical Machine was to have worked.

1 The programming language ADA is named after her. 204 QUANTUM THEORY

Figure A.5: Jacquard’s loom. A.2. JACQUARD AND BABBAGE 205

Figure A.6: Lord Byron’s daughter, Augusta Ada, Countess of Lovelace (1815- 1852) was an accomplished mathematician and a frequent visitor to Babbage’s workshop. It is through her lucid description of his ideas that we know how Babbage’s universal calculating machine was to have worked. The program- ming language ADA is named after her. 206 QUANTUM THEORY A.3 Harvard’s sequence-controlled calculator

The next step towards modern computers was taken by Herman Hollerith, a statistician working for the United States Bureau of the Census. He invented electromechanical ma- chines for reading and sorting data punched onto cards. Hollerith’s machines were used to analyze the data from the 1890 United States Census. Because the Census Bureau was a very limited market, Hollerith branched out and began to manufacture similar machines for use in business and administration. His company was later bought out by Thomas J. Watson, who changed its name to International Business Machines. In 1937, Howard Aiken, of Harvard University, became interested in combining Bab- bage’s ideas with some of the techniques which had developed from Hollerith’s punched card machines. He approached the International Business Machine Corporation, the largest manufacturer of punched card equipment, with a proposal for the construction of a large, automatic, programmable calculating machine. Aiken’s machine, the Automatic Sequence Controlled Calculator (ASCC), was com- pleted in 1944 and presented to Harvard University. Based on geared wheels, in the Pascal-Leibniz-Babbage tradition, ASCC had more than three quarters of a million parts and used 500 miles of wire. ASCC was unbelievably slow by modern standards - it took three-tenths of a second to perform an addition - but it was one of the first programmable general-purpose digital computers ever completed. It remained in continuous use, day and night, for fifteen years.

Figure A.7: The Automatic Sequence-Controlled Calculator ASCC can still be seen by visitors at Harvard’s science building and cafeteria. A.4. THE FIRST ELECTRONIC COMPUTERS 207 A.4 The first electronic computers

In the ASCC, binary numbers were represented by relays, which could be either on or off. The on position represented 1, while the off position represented 0, these being the only two digits required to represent numbers in the binary (base 2) system. Electromechanical calculators similar to ASCC were developed independently by Konrad Zuse in Germany and by George R. Stibitz at the Bell Telephone Laboratory.

Electronic digital computers In 1937, the English mathematician A.M. Turing published an important article in the Proceedings of the London Mathematical Society in which envisioned a type of calculating machine consisting of a long row of cells (the “tape”), a reading and writing head, and a set of instructions specifying the way in which the head should move the tape and modify the state and “color” of the cells on the tape. According to a hypothesis which came to be known as the “Church-Turing hypothesis”, the type of computer proposed by Turing was capable of performing every possible type of calculation. In other words, the Turing machine could function as a universal computer. In 1943, a group of English engineers, inspired by the ideas of Alan Turing and those of the mathematician M.H.A. Newman, completed the electronic digital computer Colossus. Colossus was the first large-scale electronic computer. It was used to break the German Enigma code; and it thus affected the course of World War II. In 1946, ENIAC (Electronic Numerical Integrator and Calculator) became operational. This general-purpose computer, designed by J.P. Eckert and J.W. Mauchley of the Uni- versity of Pennsylvania, contained 18,000 vacuum tubes, one or another of which was often out of order. However, during the periods when all its vacuum tubes were working, an electronic computer like Colossus or ENIAC could shoot ahead of an electromechanical machine (such as ASCC) like a hare outdistancing a tortoise. During the summer of 1946, a course on “The Theory and Techniques of Electronic Digital Computers” was given at the University of Pennsylvania. The ideas put forward in this course had been worked out by a group of mathematicians and engineers headed by J.P. Eckert, J.W. Mauchley and John von Neumann, and these ideas very much influenced all subsequent computer design.

Cybernetics The word “Cybernetics”, was coined by the American mathematician Norbert Wiener (1894-1964) and his colleagues, who defined it as “the entire field of control and commu- nication theory, whether in the machine or in the animal”. Wiener derived the word from the Greek term for “steersman”. Norbert Wiener began life as a child prodigy: He entered Tufts University at the age of 11 and received his Ph.D. from Harvard at 19. He later became a professor of math- ematics at the Massachusetts Institute of Technology. In 1940, with war on the horizon, 208 QUANTUM THEORY

Figure A.8: Alan Turing (1912-1954). He is considered to be the father of theoretical computer science. During World War II, Turing’s work allowed the allies to crack the German’s code. This appreciably shortened the length of the war in Europe, and saved many lives.

Figure A.9: John von Neumann (1903-1957, right) with J. Robert Oppenheimer. In the background is an electronic digital computer. A.4. THE FIRST ELECTRONIC COMPUTERS 209

Figure A.10: MIT’s Norbert Wiener (1894-1964) coined the word “Cybernetics”, derived from a Greek word meaning “steersman”. Wiener was one of the principle organizers of the Macy Conferences.

Wiener sent a memorandum to , another MIT professor who had done pi- oneering work with analogue computers, and had afterwards become the chairman of the U.S. National Defense Research Committee. Wiener’s memorandum urged the American government to support the design and construction of electronic digital computers, which would make use of binary numbers, vacuum tubes, and rapid memories. In such machines, the memorandum emphasized, no human intervention should be required except when data was to be read into or out of the machine. Like Leo Szilard, John von Neumann, Claude Shannon and Erwin Schr¨odinger,Norbert Wiener was aware of the relation between information and entropy. In his 1948 book Cy- bernetics he wrote: “...we had to develop a statistical theory of the amount of information, in which the unit amount of information was that transmitted by a single decision between equally probable alternatives. This idea occurred at about the same time to several writers, among them the statistician R.A. Fisher, Dr. Shannon of Bell Telephone Laboratories, and the author. Fisher’s motive in studying this subject is to be found in classical statistical theory; that of Shannon in the problem of coding information; and that of the author in the problem of noise and message in electrical filters... The notion of the amount of in- formation attaches itself very naturally to a classical notion in statistical mechanics: that of entropy. Just as the amount of information in a system is a measure of its degree of organization, so the entropy of a system is a measure of its degree of disorganization; and the one is simply the negative of the other.” During World War II, Norbert Wiener developed automatic systems for control of anti- aircraft guns. His systems made use of feedback loops closely analogous to those with which animals coordinate their movements. In the early 1940’s, he was invited to attend a 210 QUANTUM THEORY

Figure A.11: Margaret Mead (1901-1978) and Gregory Bateson (1904-1980). They used the feedback loops studied by Wiener to explain many aspects of human behavior. Bateson is considered to be one of the main founders of the discipline Biosemiotics, which considers information to be the central feature of living organisms. series of monthly dinner parties organized by Arturo Rosenbluth, a professor of physiology at Harvard University. The purpose of these dinners was to promote discussions and collaborations between scientists belonging to different disciplines. The discussions which took place at these dinners made both Wiener and Rosenbluth aware of the relatedness of a set of problems that included homeostasis and feedback in biology, communication and control mechanisms in neurophysiology, social communication among animals (or humans), and control and communication involving machines. Wiener and Rosenbluth therefore tried to bring together workers in the relevant fields to try to develop common terminology and methods. Among the many people whom they contacted were the anthropologists Gregory Bateson and Margaret Mead, Howard Aiken (the designer of the Automatic Sequence Controlled Calculator), and the mathematician John von Neumann. The Josiah Macy Jr. Foundation sponsored a series of ten yearly A.5. BIOSEMIOTICS 211 meetings, which continued until 1949 and which established cybernetics as a new research discipline. It united areas of mathematics, engineering, biology, and sociology which had previously been considered unrelated. Among the most important participants (in addition to Wiener, Rosenbluth, Bateson, Mead, and von Neumann) were Heinz von Foerster, Kurt Lewin, Warren McCulloch and Walter Pitts. The Macy conferences were small and infor- mal, with an emphasis on discussion as opposed to the presentation of formal papers. A stenographic record of the last five conferences has been published, edited by von Foerster. Transcripts of the discussions give a vivid picture of the enthusiastic and creative atmo- sphere of the meetings. The participants at the Macy Conferences perceived Cybernetics as a much-needed bridge between the natural sciences and the humanities. Hence their enthusiasm. Weiner’s feedback loops and von Neumann’s theory of games were used by anthropologists Mead and Bateson to explain many aspects of human behavior.

A.5 Biosemiotics

The Oxford Dictionary of Biochemistry and Molecular Biology (Oxford University Press, 1997) defines Biosemiotics as “the study of signs, of communication, and of information in living organisms”. The biologists Claus Emmeche and K. Kull offer another definition of Biosemiotics: “biology that interprets living systems as sign systems”. The American philosopher Charles Sanders Peirce (1839-1914) is considered to be one of the founders of Semiotics (and hence also of Biosemiotics). Peirce studied philosophy and chemistry at Harvard, where his father was a professor of mathematics and astronomy. He wrote extensively on philosophical subjects, and developed a theory of signs and meaning which anticipated many of the principles of modern Semiotics. Peirce built his theory on a triad: (1) the sign, which represents (2) something to (3) somebody. For example, the sign might be a broken stick, which represents a trail to a hunter, it might be the arched back of a cat, which represents an aggressive attitude to another cat, it might be the waggle-dance of a honey bee, which represents the coordinates of a source of food to her hive-mates, or it might be a molecule of trans-10-cis-hexadecadienol, which represents irresistible sexual temptation to a male moth of the species Bombyx mori. The sign might be a sequence of nucleotide bases which represents an amino acid to the ribosome-transfer-RNA system, or it might be a cell-surface antigen which represents self or non-self to the immune system. In information technology, the sign might be the presence or absence of a pulse of voltage, which represents a binary digit to a computer. Semiotics draws our attention to the sign and to its function, and places much less emphasis on the physical object which forms the sign. This characteristic of the semiotic viewpoint has been expressed by the Danish biologist Jesper Hoffmeyer in the following words: “The sign, rather than the molecule, is the basic unit for studying life.” A second important founder of Biosemiotics was Jakob von Uexk¨ull(1864-1944). He was born in Estonia, and studied zoology at the University of Tartu. After graduation, he worked at the Institute of Physiology at the University of Heidelberg, and later at the Zoological Station in Naples. In 1907, he was given an honorary doctorate by Heidelberg 212 QUANTUM THEORY

Figure A.12: Charles Sanders Pearce (1839-1914).

Figure A.13: Jakob Johann Baron von Uexk¨ull(1964-1944). Together with Pearce and Bateson, he is one of the principle founders of Biosemiotics. A.5. BIOSEMIOTICS 213

for his studies of the physiology of muscles. Among his discoveries in this field was the first recognized instance of negative feedback in an organism. Von Uexk¨ull’slater work was concerned with the way in which animals experience the world around them. To describe the animal’s subjective perception of its environment he introduced the word Umwelt; and in 1926 he founded the Institut fur Umweltforschung at the University of Heidelberg. Von Uexk¨ullvisualized an animal - for example a mouse - as being surrounded by a world of its own - the world conveyed by its own special senses organs, and processed by its own interpretative systems. Obviously, the Umwelt will differ greatly depending on the organism. For example, bees are able to see polarized light and ultraviolet light; electric eels are able to sense their environment through their electric organs; many insects are extraordinarily sensitive to pheromones; and a dog’s Umwelt far richer in smells than that of most other animals. The Umwelt of a jellyfish is very simple, but nevertheless it exists.2 Von Uexk¨ull’sUmwelt concept can even extend to one-celled organisms, which receive chemical and tactile signals from their environment, and which are often sensitive to light. The ideas and research of Jakob von Uexk¨ullinspired the later work of the Nobel Laureate ethologist Konrad Lorenz, and thus von Uexk¨ullcan be thought of as one of the founders of ethology as well as of Biosemiotics. Indeed, ethology and Biosemiotics are closely related. Biosemiotics also values the ideas of the American anthropologist Gregory Bateson (1904-1980), who was mentioned in Chapter 7 in connection with cybernetics and with the Macy Conferences. He was married to another celebrated anthropologist, Margaret Mead, and together they applied Norbert Wiener’s insights concerning feedback mechanisms to sociology, psychology and anthropology. Bateson was the originator of a famous epigram- matic definition of information: “..a difference which makes a difference” . This definition occurs in Chapter 3 of Bateson’s book, Mind and Nature: A Necessary Unity, Bantam, (1980), and its context is as follows: “To produce news of a difference, i.e. information”, Bateson wrote, “there must be two entities... such that news of their difference can be represented as a difference inside some information-processing entity, such as a brain or, perhaps, a computer. There is a profound and unanswerable question about the nature of these two entities that between them generate the difference which becomes information by making a difference. Clearly each alone is - for the mind and perception - a non-entity, a non-being... the sound of one hand clapping. The stuff of sensation, then, is a pair of values of some variable, presented over time to a sense organ, whose response depends on the ratio between the members of the pair.”

Microelectronics The problem of unreliable vacuum tubes was solved in 1948 by John Bardeen, William Shockley and Walter Brattain of the Bell Telephone Laboratories. Application of quantum theory to solids had lead to an understanding of the electrical properties of crystals. Like atoms, crystals were found to have allowed and forbidden energy levels.

2 It is interesting to ask to what extent the concept of Umwelt can be equated to that of consciousness. To the extent that these two concepts can be equated, von Uexk¨ull’sUmweltforschung offers us the opportunity to explore the phylogenetic evolution of the phenomenon of consciousness. 214 QUANTUM THEORY

The allowed energy levels for an electron in a crystal were known to form bands, i.e., some energy ranges with many allowed states (allowed bands), and other energy ranges with none (forbidden bands). The lowest allowed bands were occupied by electrons, while higher bands were empty. The highest filled band was called the “valence band”, and the lowest empty band was called the “conduction band”. According to quantum theory, whenever the valence band of a crystal is only partly filled, the crystal is a conductor of electricity; but if the valence band is completely filled with electrons, the crystal is an electrical insulator. (A completely filled band is analogous to a room so packed with people that none of them can move.) In addition to conductors and insulators, quantum theory predicted the existence of “semiconductors” - crystals where the valence band is completely filled with electrons, but where the energy gap between the conduction band and the valence band is very small. For example, crystals of the elements silicon and germanium are semiconductors. For such a crystal, thermal energy is sometimes enough to lift an electron from the valence band to the conduction band. Bardeen, Shockley and Brattain found ways to control the conductivity of germanium crystals by injecting electrons into the conduction band, or alternatively by removing elec- trons from the valence band. They could do this by “doping” the crystals with appropriate impurities, or by injecting electrons with a special electrode. The semiconducting crystals whose conductivity was controlled in this way could be used as electronic valves, in place of vacuum tubes. By the 1960’s, replacement of vacuum tubes by transistors in electronic computers had led not only to an enormous increase in reliability and a great reduction in cost, but also to an enormous increase in speed. It was found that the limiting factor in computer speed was the time needed for an electrical signal to propagate from one part of the central processing unit to another. Since electrical impulses propagate with the speed of light, this time is extremely small; but nevertheless, it is the limiting factor in the speed of electronic computers.

A.6 The Traitorous Eight

According to the Wikipedia article on Shockley, “In 1956 Shockley moved from New Jersey to Mountain View, California to start Shock- ley Semiconductor Laboratory to live closer to his ailing mother in Palo Alto, California. The company, a division of Beckman Instruments, Inc., was the first establishment working on silicon semiconductor devices in what came to be known as Silicon Valley. “His way [of leading the group] could generally be summed up as domineering and increasingly paranoid. In one well-known incident, he claimed that a secretary’s cut thumb was the result of a malicious act and he demanded lie detector tests to find the culprit, when in reality, the secretary had simply grabbed at a door handle that happened to have an exposed tack on it for the purpose of hanging paper notes on. After he received the Nobel Prize in 1956 his demeanor changed, as evidenced in his increasingly autocratic, erratic and A.6. THE TRAITOROUS EIGHT 215

Figure A.14: William Shockley (1910-1989) shared the 1956 Nobel Prize in Physics with John Bardeen and Walter Brattain. hard-to-please management style. In late 1957, eight of Shockley’s researchers, who would come to be known as the ‘traitorous eight, resigned after Shockley decided not to continue research into silicon-based semiconductors. They went on to form Fairchild Semiconductor, a loss from which Shockley Semiconductor never recovered. Over the course of the next 20 years, more than 65 new enterprises would end up having employee connections back to Fairchild.” 216 QUANTUM THEORY

Figure A.15: The Traitorous Eight: From left to right, Gordon Moore, C. Shel- don Roberts, Eugene Kleiner, Robert Noyce, Victor Grinich, Julius Blank, Jean Hoerni and Jay Last. A.7. INTEGRATED CIRCUITS 217 A.7 Integrated circuits

In order to reduce the propagation time, computer designers tried to make the central processing units very small; and the result was the development of integrated circuits and microelectronics. (Another motive for miniaturization of electronics came from the requirements of space exploration.) Integrated circuits were developed in which single circuit elements were not manufac- tured separately. Instead, the whole circuit was made at one time. An integrated circuit is a sandwich-like structure, with conducting, resisting and insulating layers interspersed with layers of germanium or silicon, “doped ” with appropriate impurities. At the start of the manufacturing process, an engineer makes a large drawing of each layer. For example, the drawing of a conducting layer would contain pathways which fill the role played by wires in a conventional circuit, while the remainder of the layer would consist of areas destined to be etched away by acid. The next step is to reduce the size of the drawing and to multiply it photographically. The pattern of the layer is thus repeated many times, like the design on a piece of wallpaper. The multiplied and reduced drawing is then focused through a reversed microscope onto the surface to be etched. Successive layers are built up by evaporating or depositing thin films of the appropriate substances onto the surface of a silicon or germanium wafer. If the layer being made is to be conducting, the surface would consist of an extremely thin layer of copper, covered with a photosensitive layer called a “photoresist”. On those portions of the surface receiving light from the pattern, the photoresist becomes insoluble, while on those areas not receiving light, the photoresist can be washed away. The surface is then etched with acid, which removes the copper from those areas not protected by photoresist. Each successive layer of a wafer is made in this way, and finally the wafer is cut into tiny “chips”, each of which corresponds to one unit of the wallpaper- like pattern. Although the area of a chip may be much smaller than a square centimeter, the chip can contain an extremely complex circuit. A typical programmable minicomputer or “microprocessor”, manufactured during the 1970’s, could have 30,000 circuit elements, all of which were contained on a single chip. By 1986, more than a million transistors were being placed on a single chip. As a result of miniaturization, the speed of computers rose steadily. In 1960, the fastest computers could perform a hundred thousand elementary operations in a second. By 1970, the fastest computers took less than a second to perform a million such operations. In 1987, a computer called GF11 was designed to perform 11 billion floating-point operations (flops) per second. GF11 (Gigaflop 11) is a scientific parallel-processing machine constructed by IBM. Approximately ten floating-point operations are needed for each machine instruction. Thus GF11 runs at the rate of approximately a thousand million instructions per second (1,100 MIPS). The high speed achieved by parallel-processing machines results from dividing a job into many sub-jobs on which a large number of processing units can work simultaneously. 218 QUANTUM THEORY

Computer memories have also undergone a remarkable development. In 1987, the magnetic disc memories being produced could store 20 million bits of information per square inch; and even higher densities could be achieved by optical storage devices. (A “bit” is the unit of information. For example, the number 25, written in the binary system, is 11001. To specify this 5-digit binary number requires 5 bits of information. To specify an n-digit binary number requires n bits of information. Eight bits make a “byte”.) In the 1970’s and 1980’s, computer networks were set up linking machines in various parts of the world. It became possible (for example) for a scientist in Europe to perform a calculation interactively on a computer in the United States just as though the distant machine were in the same room; and two or more computers could be linked for perform- ing large calculations. It also became possible to exchange programs, data, letters and manuscripts very rapidly through the computer networks.

A.8 Moore’s law

In 1965, only four years after the first integrated circuits had been produced, Dr. Gordon E. Moore, one of the founders of Intel, made a famous prediction which has come to be known as “Moore’s Law”. He predicted that the number of transistors per integrated circuit would double every two years, and that this trend would continue through 1975. In fact, the general trend predicted by Moore has continued for a much longer time. Although the number of transistors per unit area has not continued to double every two years, the logic density (bits per unit area) has done so, and thus a modified version of Moore’s law still holds today. How much longer the trend can continue remains to be seen. Physical limits to miniaturization of transistors of the present type will soon be reached; but there is hope that further miniaturization can be achieved through “quantum dot” technology, molecular switches, and autoassembly. A typical programmable minicomputer or “microprocessor”, manufactured in the 1970’s, could have 30,000 circuit elements, all of which were contained on a single chip. By 1989, more than a million transistors were being placed on a single chip; and by 2000, the number reached 42,000,000. As a result of miniaturization and parallelization, the speed of computers rose expo- nentially. In 1960, the fastest computers could perform a hundred thousand elementary operations in a second. By 1970, the fastest computers took less than a second to per- form a million such operations. In 1987, a massively parallel computer, with 566 parallel processors, called GFll was designed to perform 11 billion floating-point operations per second (flops). By 2002 the fastest computer performed 40 at teraflops, making use of 5120 parallel CPU’s. Computer disk storage has also undergone a remarkable development. In 1987, the magnetic disk storage being produced could store 20 million bits of information per square inch; and even higher densities could be achieved by optical storage devices. Storage density has until followed a law similar to Moore’s law. In the 1970’s and 1980’s, computer networks were set up linking machines in various A.8. MOORE’S LAW 219

Figure A.16: Gordon E. Moore (born 1929), a founder of Intel and the author of Moore’s Law. In 1965 he predicted that the number of components in integrated circuits would double every year for the next 10 years”. In 1975 he predicted the this doubling would continue, but revised the doubling rate to “every two years. Astonishingly, Moore’s Law has held much longer than he, or anyone else, anticipated. 220 QUANTUM THEORY

Figure A.17: Amazingly, Moore’s Law has held much longer than he, or anyone else, anticipated. Perhaps quantum dot technologies can extend its validity even longer.

Figure A.18: A logarithmic plot of the increase in PC hard-drive capacity in gigabytes. An extrapolation of the rate of increase predicts that the individual capacity of a commercially available PC will reach 10,000 gigabytes by 2015, i.e. 10,000,000,000,000 bytes. (After Hankwang and Rentar, Wikimedia Commons) A.9. SELF-REINFORCING INFORMATION ACCUMULATION 221 parts of the world. It became possible (for example) for a scientist in Europe to perform a calculation interactively on a computer in the United States just as though the distant machine were in the same room; and two or more computers could be linked for perform- ing large calculations. It also became possible to exchange programs, data, letters and manuscripts very rapidly through the computer networks. The exchange of large quantities of information through computer networks was made easier by the introduction of fiber optics cables. By 1986, 250,000 miles of such cables had been installed in the United States. If a ray of light, propagating in a medium with a large refractive index, strikes the surface of the medium at a grazing angle, then the ray undergoes total internal reflection. This phenomenon is utilized in fiber optics: A light signal can propagate through a long, hairlike glass fiber, following the bends of the fiber without losing intensity because of total internal reflection. However, before fiber optics could be used for information transmission over long distances, a technological breakthrough in glass manufacture was needed, since the clearest glass available in 1940 was opaque in lengths more than 10 m. Through studies of the microscopic properties of glasses, the problem of absorption was overcome. By 1987, devices were being manufactured commercially that were capable of transmitting information through fiber-optic cables at the rate of 1.7 billion bits per second.

A.9 Self-reinforcing information accumulation

Humans have been living on the earth for roughly two million years (more or less, depending on where one draws the line between our human and prehuman ancestors, Table 6.1). During almost all of this,time, our ancestors lived by hunting and food-gathering. They were not at all numerous, and did not stand out conspicuously from other animals. Then, suddenly, during the brief space of ten thousand years, our species exploded in numbers from a few million to seven billion, populating all parts of the earth, and even setting foot on the moon. This population explosion, which is still going on, has been the result of dramatic cultural changes. Genetically we are almost identical with our hunter-gatherer ancestors, who lived ten thousand years ago, but cultural evolution has changed our way of life beyond recognition. Beginning with the development of speech, human cultural evolution began to accel- erate. It started to move faster with the agricultural revolution, and faster still with the invention of writing and printing. Finally, modern science has accelerated the rate of social and cultural change to a completely unprecedented speed. The growth of modern science is accelerating because knowledge feeds on itself. A new idea or a new development may lead to several other innovations, which can in turn start an avalanche of change. For example, the quantum theory of atomic structure led to the in- vention of transistors, which made high-speed digital computers possible. Computers have not only produced further developments in quantum theory; they have also revolutionized many other fields. The self-reinforcing accumulation of knowledge - the information explosion - which 222 QUANTUM THEORY characterizes modern human society is reflected not only in an explosively-growing global population, but also in the number of scientific articles published, which doubles roughly every ten years. Another example is Moore’s law - the doubling of the information density of integrated circuits every two years. Yet another example is the explosive growth of Internet traffic shown in Table 17.1. The Internet itself is the culmination of a trend towards increasing societal information exchange - the formation of a collective human consciousness. This collective consciousness preserves the observations of millions of eyes, the experiments of millions of hands, the thoughts of millions of brains; and it does not die when the individual dies.

A.10 Automation

During the last three decades, the cost of computing has decreased exponentially by be- tween twenty and thirty percent per year. Meanwhile, the computer industry has grown exponentially by twenty percent per year (faster than any other industry). The astonish- ing speed of this development has been matched by the speed with which computers have become part of the fabric of science, engineering, industry, commerce, communications, transport, publishing, education and daily life in the industrialized parts of the world. The speed, power and accuracy of computers has revolutionized many branches of science. For example, before the era of computers, the determination of a simple molecular structure by the analysis of X-ray diffraction data often took years of laborious calculation; and complicated structures were completely out of reach. In 1949, however, Dorothy Crowfoot Hodgkin used an electronic computer to work out the structure of penicillin from X-ray data. This was the first application of a computer to a biochemical problem; and it was followed by the analysis of progressively larger and more complex structures. Proteins, DNA, and finally even the detailed structures of viruses were studied through the application of computers in crystallography. The enormous amount of data needed for such studies was gathered automatically by computer-controlled diffractometers; and the final results were stored in magnetic-tape data banks, available to users through computer networks. The application of quantum theory to chemical problems is another field of science which owes its development to computers. When Erwin Schr¨odingerwrote down his wave equation in 1926, it became possible, in principle, to calculate most of the physical and chemical properties of matter. However, the solutions to the Schr¨odingerequation for many-particle systems can only be found approximately; and before the advent of computers, even approximate solutions could not be found, except for the simplest systems. When high-speed electronic digital computers became widely available in the 1960’s, it suddenly became possible to obtain solutions to the Schr¨odinger equation for systems of chemical and even biochemical interest. Quantum chemistry (pioneered by such men as J.C. Slater, R.S. Mullikin, D.R. Hartree, V. Fock, J.H. Van Vleck, L. Pauling, E.B. Wilson, P.O. L¨owdin, E. Clementi, C.J. Ballhausen and others) developed into a rapidly-growing field, as did solid state physics. Through the use of computers, it became possible to A.10. AUTOMATION 223 design new materials with desired chemical, mechanical, electrical or magnetic properties. Applying computers to the analysis of reactive scattering experiments, D. Herschbach, J. Polanyi and Y. Lee were able to achieve an understanding of the dynamics of chemical reactions. The successes of quantum chemistry led Albert Szent-Gy¨orgyi, A. and B. Pullman, H. Scheraga and others to pioneer the fields of quantum biochemistry and molecular dynam- ics. Computer programs for drug design were developed, as well as molecular-dynamics programs which allowed the conformations of proteins to be calculated from a knowledge of their amino acid sequences. Studies in quantum biochemistry have yielded insights into the mechanisms of enzyme action, photosynthesis, active transport of ions across membranes, and other biochemical processes. In medicine, computers began to be used for monitoring the vital signs of critically ill patients, for organizing the information flow within hospitals, for storing patients’ records, for literature searches, and even for differential diagnosis of diseases. The University of Pennsylvania has developed a diagnostic program called INTERNIST- 1, with a knowledge of 577 diseases and their interrelations, as well as 4,100 signs, symp- toms and patient characteristics. This program was shown to perform almost as well as an academic physician in diagnosing difficult cases. QMR (Quick Medical Reference), a microcomputer adaptation of INTERNIST-1, incorporates the diagnostic functions of the earlier program, and also offers an electronic textbook mode. Beginning in the 1960’s, computers played an increasingly important role in engineering and industry. For example, in the 1960’s, Rolls Royce Ltd. began to use computers not only to design the optimal shape of turbine blades for aircraft engines, but also to control the precision milling machines which made the blades. In this type of computer-assisted design and manufacture, no drawings were required. Furthermore, it became possible for an industry requiring a part from a subcontractor to send the machine-control instructions for its fabrication through the computer network to the subcontractor, instead of sending drawings of the part. In addition to computer-controlled machine tools, robots were also introduced. They were often used for hazardous or monotonous jobs, such as spray-painting automobiles; and they could be programmed by going through the job once manually in the programming mode. By 1987, the population of robots in the United States was between 5,000 and 7,000, while in Japan, the Industrial Robot Association reported a robot population of 80,000. Chemical industries began to use sophisticated computer programs to control and to optimize the operations of their plants. In such control systems, sensors reported cur- rent temperatures, pressures, flow rates, etc. to the computer, which then employed a mathematical model of the plant to calculate the adjustments needed to achieve optimum operating conditions. Not only industry, but also commerce, felt the effects of computerization during the postwar period. Commerce is an information-intensive activity; and in fact some of the crucial steps in the development of information-handling technology developed because of the demands of commerce: The first writing evolved from records of commercial trans- actions kept on clay tablets in the Middle East; and automatic business machines, using 224 QUANTUM THEORY punched cards, paved the way for the development of the first programmable computers. Computerization has affected wholesaling, warehousing, retailing, banking, stockmarket transactions, transportation of goods - in fact, all aspects of commerce. In wholesaling, electronic data is exchanged between companies by means of computer networks, allowing order-processing to be handled automatically; and similarly, electronic data on prices is transmitted to buyers. The key to automatic order-processing in wholesaling was standardization. In the United States, the Food Marketing Institute, the Grocery Manufacturers of America, and several other trade organizations, established the Uniform Communications System (UCS) for the grocery industry. This system specifies a standard format for data on products, prices and orders. Automatic warehouse systems were designed as early as 1958. In such systems, the goods to be stored are placed on pallets (portable platforms), which are stacked automat- ically in aisles of storage cubicles. A computer records the position of each item for later automatic retrieval. In retailing, just as in wholesaling, standardization proved to be the key requirement for automation. Items sold in supermarkets in most industrialized countries are now labeled with a standard system of machine-readable thick and thin bars known as the Universal Product Code (UPC). The left-hand digits of the code specify the manufacturer or packer of the item, while the right-hand set of digits specify the nature of the item. A final digit is included as a check, to make sure that the others were read correctly. This last digit (called a modulo check digit) is the smallest number which yields a multiple of ten when added to the sum of the previous digits. When a customer goes through a check-out line, the clerk passes the purchased items over a laser beam and photocell, thus reading the UPC code into a small embedded com- puter or microprocessor at the checkout counter, which adds the items to the customer’s bill. The microprocessor also sends the information to a central computer and inventory data base. When stocks of an item become low, the central computer generates a re- placement order. The financial book-keeping for the retailing operation is also carried out automatically by the central computer. In many places, a customer passing through the checkout counter of a supermarket is able to pay for his or her purchases by means of a plastic card with a magnetic, machine- readable identification number. The amount of the purchase is then transmitted through a computer network and deducted automatically from the customer’s bank account. If the customer pays by check, the supermarket clerk may use a special terminal to determine whether a check written by the customer has ever “bounced”. Most checks are identified by a set of numbers written in the Magnetic-Ink Character Recognition (MICR) system. In 1958, standards for the MICR system were established, and by 1963, 85 percent of all checks written in the United States were identified by MICR numbers. By 1968, almost all banks had adopted this system; and thus the administration of checking accounts was automated, as well as the complicated process by which a check, deposited anywhere in the world, returns to the payers bank. Container ships were introduced in the late 1950’s, and since that time, container sys- A.10. AUTOMATION 225 tems have increased cargo-handling speeds in ports by at least an order of magnitude. Computer networks contributed greatly to the growth of the container system of trans- portation by keeping track of the position, ownership and contents of the containers. In transportation, just as in wholesaling and retailing, standardization proved to be a necessary requirement for automation. Containers of a standard size and shape could be loaded and unloaded at ports by specialized tractors and cranes which required only a very small staff of operators. Standard formats for computerized manifests, control documents, and documents for billing and payment, were instituted by the Transportation Data Coordinating Committee, a non-profit organization supported by dues from shipping firms. In the industrialized parts of the world, almost every type of work has been made more efficient by computerization and automation. Even artists, musicians, architects and authors find themselves making increasing use of computers: Advanced computing systems, using specialized graphics chips, speed the work of architects and film animators. The author’s traditional typewriter has been replaced by a word-processor, the composer’s piano by a music synthesizer. In the Industrial Revolution of the 18th and 19th centuries, muscles were replaced by machines. Computerization represents a Second Industrial Revolution: Machines have begun to perform not only tasks which once required human muscles, but also tasks which formerly required human intelligence. In industrial societies, the mechanization of agriculture has very much reduced the fraction of the population living on farms. For example, in the United States, between 1820 and 1980, the fraction of workers engaged in agriculture fell from 72 percent to 3.1 percent. There are signs that computerization and automation will similarly reduce the number of workers needed in industry and commerce. Computerization is so recent that, at present, we can only see the beginnings of its impact; but when the Second Industrial Revolution is complete, how will it affect society? When our children finish their education, will they face technological unemployment? The initial stages of the First Industrial Revolution produced much suffering, because labor was regarded as a commodity to be bought and sold according to the laws of supply and demand, with almost no consideration for the needs of the workers. Will we repeat this mistake? Or will society learn from its earlier experience, and use the technology of automation to achieve widely-shared human happiness? The Nobel-laureate economist, Wassily W. Leontief, has made the following comment on the problem of technological unemployment: “Adam and Eve enjoyed, before they were expelled from Paradise, a high standard of living without working. After their expulsion, they and their successors were condemned to eke out a miserable existence, working from dawn to dusk. The history of technological progress over the last 200 years is essentially the story of the human species working its way slowly and steadily back into Paradise. What would happen, however, if we suddenly found ourselves in it? With all goods and services provided without work, no one would be gainfully employed. Being unemployed means receiving no wages. As a result, until ap- propriate new income policies were formulated to fit the changed technological conditions, 226 QUANTUM THEORY everyone would starve in Paradise.” To say the same thing in a slightly different way: consider what will happen when a factory which now employs a thousand workers introduces microprocessor-controlled industrial robots and reduces its work force to only fifty. What will the nine hundred and fifty redundant workers do? They will not be able to find jobs elsewhere in industry, commerce or agriculture, because all over the economic landscape, the scene will be the same. There will still be much socially useful work to be done - for example, taking care of elderly people, beautifying the cities, starting youth centers, planting forests, cleaning up pollution, building schools in developing countries, and so on. These socially beneficial goals are not commercially “profitable”. They are rather the sort of projects which gov- ernments sometimes support if they have the funds for it. However, the money needed to usefully employ the nine hundred and fifty workers will not be in the hands of the govern- ment. It will be in the hands of the factory owner who has just automated his production line. In order to make the economic system function again, either the factory owner will have to be persuaded to support socially beneficial but commercially unprofitable projects, or else an appreciable fraction of his profits will have to be transferred to the government, which will then be able to constructively re-employ the redundant workers. The future problems of automation and technological unemployment may force us to rethink some of our economic ideas. It is possible that helping young people to make a smooth transition from education to secure jobs will become one of the important respon- sibilities of governments, even in countries whose economies are based on free enterprise. If such a change does take place in the future, while at the same time socialistic countries are adopting a few of the better features of free enterprise, then one can hope that the world will become less sharply divided by contrasting economic systems.

A.11 Neural networks

Physiologists have begun to make use of insights derived from computer design in their efforts to understand the mechanism of the brain; and computer designers are beginning to construct computers modeled after neural networks. We may soon see the development of computers capable of learning complex ideas, generalization, value judgements, artistic creativity, and much else that was once thought to be uniquely characteristic of the human mind. Efforts to design such computers will undoubtedly give us a better understanding of the way in which the brain performs its astonishing functions. Much of our understanding of the nervous systems of higher animals is due to the Spanish microscopist, Ram´ony Cajal, and to the English physiologists, Alan Hodgkin and Andrew Huxley. Cajal’s work, which has been confirmed and elaborated by modern electron microscopy, showed that the central nervous system is a network of nerve cells (neurons) and threadlike fibers growing from them. Each neuron has many input fibers (dendrites), and one output fiber (the axon), which may have several branches. A.11. NEURAL NETWORKS 227

S It is possible the computers of the future will have pattern-recognition and learning abilities derived from architecture inspired by our understanding of the synapse, by Young’s model, or by other biological models. However, pattern recognition and learning can also be achieved by programming, using computers of conventional architecture. Programs already exist which allow computers to understand both handwriting and human speech; and a recent chess-playing program was able to learn by studying a large number of championship games. Having optimized its parameters by means of this learning experience, the chess- playing program was able to win against grand masters! Like nuclear physics and genesplicing, artificial intelligence presents a challenge: Will society use its new powers wisely and humanely? The computer technology of the future can liberate us from dull and repetitive work, and allow us to use our energies creatively; or it can produce unemployment and misery, depending on how we organize our society. Which will we choose?

Suggestions for further reading 1. H. Babbage, Babbages Calculating Engines: A Collection of Papers by Henry Prevost Babbage, MIT Press, (1984). 2. A.M. Turing, The Enigma of Intelligence, Burnett, London (1983). 3. Ft. Penrose, The Emperor’s New Mind: Concerning Computers, Minds, and the Laws of Physics, Oxford University Press, (1989). 4. S. Wolfram, A New Kind of Science, Wolfram Media, Champaign IL, (2002). 5. A.M. Turing, On computable numbers, with an application to the Entscheidungsprob- lem, Proc. Lond. Math. Soc. Ser 2, 42, (1937). Reprinted in M. David Ed., The Undecidable, Raven Press, Hewlett N.Y., (1965). 6. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 7. J. Shurkin, Engines of the Mind: A History of Computers, W.W. Norten, (1984). 8. J. Palfreman and D. Swade, The Dream Machine: Exploring the Computer Age, BBC Press (UK), (1991). 9. T.J. Watson, Jr. and P. Petre, Father, Son, and Co., Bantam Books, New York, (1991). 10. A. Hodges, Alan Turing: The Enegma, Simon and Schuster, (1983). 11. H.H. Goldstein, The Computer from Pascal to Von Neumann, Princeton University Press, (1972). 12. C.J. Bashe, L.R. Johnson, J.H. Palmer, and E.W. Pugh, IBM’s Early Computers, Vol. 3 in the History of Computing Series, MIT Press, (1986). 13. K.D. Fishman, The Computer Establishment, McGraw-Hill, (1982). 14. S. Levy, Hackers, Doubleday, (1984). 15. S. Franklin, Artificial Minds, MIT Press, (1997). 16. P. Freiberger and M. Swaine, Fire in the Valley: The Making of the Personal Com- puter, Osborne/MeGraw-Hill, (1984). 17. R.X. Cringely, Accidental Empires, Addison-Wesley, (1992). 228 QUANTUM THEORY

18. R. Randell editor, The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 19. H. Lukoff, From Dits to Bits, Robotics Press, (1979). 20. D.E. Lundstrom, A Few Good Men from Univac, MIT Press, (1987). 21. D. Rutland, Why Computers Are Computers (The SWAC and the PC), Wren Pub- lishers, (1995). 22. P.E. Ceruzzi, Reckoners: The Prehistory of the Digital Computer, from Relays to the Stored Program Concept, 1935-1945, Greenwood Press, Westport, (1983) 23. S.G. Nash, A History of Scientific Computing, Adison-Wesley, Reading Mass., (1990). 24. P.E. Ceruzzi, Crossing the divide: Architectural issues and the emer- gence of stored programme computers, 1935-1953, IEEE Annals of the History of Computing, 19, 5-12, January-March (1997). 25. P.E. Ceruzzi, A History of Modern Computing, MIT Press, Cambridge MA, (1998). 26. K. Zuse, Some remarks on the history of computing in Germany, in A History of Computing in the 20th Century, N. Metropolis et al. editors, 611-627, Academic Press, New York, (1980). 27. A.R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 28. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 29. A. Hodges, Alan Turing: The Enegma, Simon and Schuster, New York, (1983). 30. A. Turing, On computable numbers with reference to the Entscheidungsproblem, Jour- nal of the London Mathematical Society, II, 2. 42, 230-265 (1937). 31. J. von Neumann, The Computer and the Brain, Yale University Press, (1958). 32. I.E. Sutherland, Microelectronics and computer science, Scientific American, 210-228, September (1977). 33. W. Aspray, John von Neumann and the Origins of Modern Computing, M.I.T. Press, Cambridge MA, (1990, 2nd ed. 1992). 34. W. Aspray, The history of computing within the history of information technology, History and Technology, 11, 7-19 (1994). 35. G.F. Luger, Computation and Intelligence: Collected Readings, MIT Press, (1995). 36. Z.W. Pylyshyn, Computation and Cognition: Towards a Foundation for Cognitive Science, MIT Press, (1986). 37. D.E. Shasha and C. Lazere, Out of Their Minds: The Creators of Computer Science, Copernicus, New York, (1995). 38. W. Aspray, An annotated bibliography of secondary sources on the history of software, Annals of the History of Computing 9, 291-243 (1988). 39. R. Kurzweil, The Age of Intelligent Machines, MIT Press, (1992). 40. S.L. Garfinkel and H. Abelson, eds., Architects of the Information Society: Thirty- Five Years of the Laboratory for Computer Sciences at MIT, MIT Press, (1999). 41. J. Haugeland, Artificial Intelligence: The Very Idea, MIT Press, (1989). 42. M.A. Boden, Artificial Intelligence in Psychology: Interdisciplinary Essays, MIT Press, (1989). 43. J.W. Cortada, A Bibliographic Guide to the History of Computer Applications, 1950- 1990, Greenwood Press, Westport Conn., (1996). A.11. NEURAL NETWORKS 229

44. M. Campbell-Kelly and W. Aspry, Computer: A History of the Information Machine, Basic Books, New York, (1996). 45. B.I. Blum and K. Duncan, editors, A History of Medical Informatics, ACM Press, New York, (1990). 46. J.-C. Guedon, La Planete Cyber, Internet et Cyberspace, Gallimard, (1996). 47. S. Augarten, Bit by Bit: An Illustrated History of Computers, Unwin, London, (1985). 48. N. Wiener, Cybernetics; or Control and Communication in the Animal and the Ma- chine, The Technology Press, John Wiley and Sons, New York, (1948). 49. W.R. Ashby, An Introduction to Cybernetics, Chapman and Hall, London, (1956). 50. M.A. Arbib, A partial survey of cybernetics in eastern Europe and the Soviet Union, Behavioral Sci., 11, 193-216, (1966). 51. A. Rosenblueth, N. Weiner and J. Bigelow, Behavior, purpose and teleology, Phil. Soc. 10 (1), 18-24 (1943). 52. N. Weiner and A. Rosenblueth, Conduction of impulses in cardiac muscle, Arch. Inst. Cardiol. Mex., 16, 205-265 (1946). 53. H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Transactions of sixth-tenth conferences, Josiah J. Macy Jr. Foundation, New York, (1950- 1954). 54. W.S. McCulloch and W. Pitts, A logical calculus of ideas immanent in nervous ac- tivity, Bull. Math. Biophys., 5, 115-133 (1943). 55. W.S. McCulloch, An Account of the First Three Conferences on Teleological Mecha- nisms, Josiah Macy Jr. Foundation, (1947). 56. G.A. Miller, Languages and Communication, McGraw-Hill, New York, (1951). 57. G.A. Miller, Statistical behavioristics and sequences of responses, Psychol. Rev. 56, 6 (1949). 58. G. Bateson, Bali - the value system of a steady state, in M. Fortes, editor, Social Struc- ture Studies Presented to A.R. Radcliffe-Brown, Clarendon Press, Oxford, (1949). 59. G. Bateson, Communication, the Social Matrix of Psychiatry, Norton, (1951). 60. G. Bateson, Steps to an Ecology of Mind, Chandler, San Francisco, (1972). 61. G. Bateson, Communication et Societe, Seuil, Paris, (1988). 62. S. Heims, Gregory Bateson and the mathematicians: From interdisciplinary interac- tions to societal functions, J. History Behavioral Sci., 13, 141-159 (1977). 63. S. Heims, John von Neumann and Norbert Wiener. From Mathematics to the Tech- nology of Life and Death, MIT Press, Cambridge MA, (1980). 64. S. Heims, The Cybernetics Group, MIT Press, Cambridge MA, (1991). 65. G. van de Vijver, New Perspectives on Cybernetics (Self-Organization, Autonomy and Connectionism), Kluwer, Dordrecht, (1992). 66. A. Bavelas, A mathematical model for group structures, Appl. Anthrop. 7 (3), 16 (1948). 67. P. de Latil, La Pensee Artificielle - Introduction a la Cybernetique, Gallimard, Paris, (1953). 68. L.K. Frank, G.E. Hutchinson, W.K. Livingston, W.S. McCulloch and N. Wiener, Teleological Mechanisms, Ann. N.Y. Acad. Sci. 50, 187- 277 (1948). 230 QUANTUM THEORY

69. H. von Foerster, Quantum theory of memory, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Trans- actions of the sixth conferences, Josiah J. Macy Jr. Foundation, New York, (1950). 70. H. von Foerster, Observing Systems, Intersystems Publications, California, (1984). 71. H. von Foerster, Understanding Understanding: Essays on Cybernetics and Cogni- tion, Springer, New York, (2002). 72. M. Newborn, Kasparov vs. Deep Blue: Computer Chess Comes of age, Springer Verlag, (1996). 73. K.M. Colby, Artificial Paranoia: A Computer Simulation of the Paranoid Process, Pergamon Press, New York, (1975). 74. J.Z. Young, Discrimination and learning in the octopus, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social sys- tems. Transactions of the ninth conference, Josiah J. Macy Jr. Foundation, New York, (1953). 75. M.J. Apter and L. Wolpert, Cybernetics and development. I. Infor- mation theory, J. Theor. Biol. 8, 244-257 (1965). 76. H. Atlan, L’Organization Biologique et la Theorie de I’Information, Hermann, Paris, (1972). 77. H. Atlan, On a formal definition of organization, J. Theor. Biol. 45, 295-304 (1974). 78. H. Atlan, Organization du vivant, information et auto-organization, in Volume Sym- posium 1986 de l’Encylopediea Universalis, pp. 355-361, Paris, (1986). 79. E.R. Kandel, Nerve cells and behavior, Scientific American, 223, 57-70, July, (1970). 80. E.R. Kandel, Small systems of neurons, Scientific American, 241 no.3, 66-76 (1979). 81. A.K. Katchalsky et al., Dynamic patterns of brain cell assemblies, Neurosciences Res. Prog. Bull., 12 no.1, (1974). 82. G.E. Moore, Cramming more components onto integrated circuits, Electronics, April 19, (1965). 83. P. Gelsinger, P. Gargini, G. Parker and A. Yu, Microprocessors circa 2000, IEEE Spectrum, October, (1989). 84. P. Baron, On distributed communications networks, IEEE Trans. Comm. Systems, March (1964). 85. V.G. Cerf and R.E. Khan, A protocol for packet network intercommunication, Trans. Comm. Tech. COM-22, V5, 627-641, May (1974). 86. L. Kleinrock, Communication Nets: Stochastic Message Flow and Delay, McGraw- Hill, New York, (1964). 87. L. Kleinrock, Queueing Systems: Vol. II, Computer Applications, Wiley, New York, (1976). 88. R. Kahn, editor, Special Issue on Packet Communication Networks, Proc. IEEE, 66, November, (1978). 89. L.G. Roberts, The evolution of packet switching, Proc. of the IEEE 66, 1307-13, (1978). 90. J. Abbate, The electrical century: Inventing the web, Proc. IEEE 87, November, (1999). A.11. NEURAL NETWORKS 231

91. J. Abbate, Inventing the Internet, MIT Press, Cambridge MA, (1999). 92. J.C. McDonald, editor, Fundamentals of Digital Switching, 2nd Edition, Plenum, New York, (1990). 93. B. Metcalfe, Packet Communication, Peer-to-Peer Communication, San Jose Calif, (1996). 94. T. Berners-Lee, The Original Design and Ultimate Destiny of the World Wide Web by its Inventor, Harper San Francisco, (1999). 95. J. Clark, Netscape Time: The Making of the Billion-Dollar Start-Up That Took On Microsoft, St. Martin’s Press, New York, (1999). 96. J. Wallace, Overdrive: Bill Gates and the Race to Control Cyberspace, Wiley, New York, (1997). 97. P. Cunningham and F. Froschl, The Electronic Business Revolution, Springer Verlag, New York, (1999). 98. J.L. McKenny, Waves of Change: Business Evolution Through Information Technol- ogy, Harvard Business School Press, (1995). 99. M.A. Cosumano, Competing on Internet Time: Lessons From Netscape and Its Battle with Microsoft, Free Press, New York, (1998). 100. F.J. Dyson, The Sun, the Genome and the Internet: Tools of Scientific Revolutions, Oxford University Press, (1999). 101. L. Bruno, Fiber Optimism: Nortel, Lucent and Cisco are battling to win the high- stakes fiber-optics game, Red Herring, June (2000). 102. N. Cochrane, We’re insatiable: Now it’s 20 million million bytes a day, Melbourne Age, January 15, (2001). 103. K.G. Coffman and A.N. Odlyzko, The size and growth rate of the Internet, First Monday, October, (1998). 104. C.A. Eldering, M.L. Sylla, and J.A. Eisenach, Is there a Moore’s law for bandwidth?, IEEE Comm. Mag., 2-7, October, (1999). 105. G. Gilder, Fiber keeps its promise: Get ready, bandwidth will triple each year for the next 25 years, Forbes, April 7, (1997). 106. A.M. Noll, Does data traffic exceed voice traffic?, Comm. ACM, 121- 124, June, (1999). 107. B. St. Arnaud, J. Coulter, J. Fitchett, and S. Mokbel, Architectural and engineering issues for building an optical Internet, Proc. Soc. Optical Eng. (1998). 108. M. Weisner, The computer for the 21st century, Scientific American, September, (1991). 109. R. Wright, Three Scientists and Their Gods, Time Books, (1988). 110. S. Nora and A. Mine, The Computerization of Society, MIT Press, (1981). 111. T. Forester, Computers in the Human Context: Information Theory, Productivity, and People, MIT Press, (1989). 112. J. Hoffmeyer, Some semiotic aspects of the psycho-physical relation: the endo-exosemiotic boundary, in Biosemiotics. The Semiotic Web, T.A. Sebeok and J. Umiker-Sebeok, editors, Mouton de Gruyter, Berlin/New York, (1991). 232 QUANTUM THEORY

113. J. Hoffmeyer, The swarming cyberspace of the body, Cybernetics and Human Know- ing, 3(1), 1-10 (1995). 114. J. Hoffmeyer, Signs of Meaning in the Universe, Indiana University Press, Bloom- ington IN, (1996). 115. J. Hoffmeyer, Biosemiotics: Towards a new synthesis in biology, European J. Semiotic Stud. 9(2), 355-376 (1997). 116. J. Hoffmeyer and C. Emmeche, Code-duality and the semiotics of nature, in On Semiotic Modeling, M. Anderson and F. Merrell, editors, Mouton de Gruyter, New York, (1991). 117. C. Emmeche and J. Hoffmeyer, From language to nature - The semiotic metaphor in biology, Semiotica, 84, 1-42 (1991). 118. C. Emmeche, The biosemiotics of emergent properties in a pluralist ontology, in Semiosis, Evolution, Energy: Towards a Reconceptualization of the Sign, E. Taborsky, editor, Shaker Verlag, Aachen, (1999). 119. S. Brier, Information and consciousness: A critique of the mechanistic concept of information, in Cybernetics and Human Knowing, 1(2/3), 71-94 (1992). 120. S. Brier, Ciber-Semiotics: Second-order cybernetics and the semiotics of C.S. Peirce, Proceedings from the Second European Congress on System Science, Prague, October 5-8, 1993, AFCET, (1993). 121. S. Brier, A cybernetic and semiotic view on a Galilean theory of psychology, Cyber- netics and Human Knowing, 2 (2), 31-46 (1993). 122. S. Brier, Cybersemiotics: A suggestion for a transdisciplinary frame- work for de- scription of observing, anticipatory, and meaning producing systems, in D.M. Dubois, editor, Computing Anticipatory Systems, CASYS - First International Conference, Liege, Belgium 1997, AIP Conference Proceedings no. 437, (1997). 123. S. Oyama, The Ontogeny of Information, Cambridge University Press, (1985). 124. J. Hoffmeyer, The swarming cyberspace of the body, Cybernetics and Human Know- ing, 3(1), 1-10 (1995). 125. J.L. Casti and A. Karlqvist, editors, Complexity, Language, and Life: Mathematical Approaches, Springer, Berlin, (1985). 126. H. Maturana and F. Varla, Autopoiesis and Cognition: The Realization of the Living, Reidel, London, (1980). 127. J. Mingers, Self-Producing Systems: Implications and Application of Autopoiesis, Plenum Press, New York, (1995). 128. J. Buchler, editor, Philosophical Writings of Peirce: Selected and Edited with an Introduction by Justus Buchler, Dover Publications, New York, (1955). 129. T.L. Short, Peirce’s semiotic theory of the self, Semiotica, 91 (1/2), 109-131 (1992). 130. J. von Uexk¨ull, Umwelt und Innenwelt der Tiere. 2. verm, und verb. Aufl., Springer, Berlin, (1921). 131. J. von Uexk¨ull,The theory of meaning, Semiotica, 42(1), 25-87 (1982 [1940]). 132. T. von Uexk¨ull, Introduction: Meaning and science in Jacob von Uexkull’s concept of biology, Semiotica, 42, 1-24 (1982). 133. T. von Uexk¨ull, Medicine and semiotics, Semiotica, 61 , 201-217 (1986). A.11. NEURAL NETWORKS 233

134. G. Bateson, Form, substance, and difference. Nineteenth Annual Korzybski Memorial Lecture, (1970). Reprinted in G. Bateson, Steps to an Ecology of Mind, Balentine Books, New York, (1972), pp. 448-464. 135. G. Bateson, Mind and Nature: A Necessary Unity, Bantam Books, New York, (1980). 136. G. Bateson, Sacred Unity: Further Steps to an Ecology of Mind, Harper Collins, New York, (1991). 137. J. Ruesch and G. Bateson, Communication, Norton, New York, (1987). 138. E.F. Yates, Semiotics as a bridge between information (biology) and dynamics (physics), Recherches Semiotiques/Semiotic Inquiry 5, 347- 360 (1985). 139. T.A. Sebeok, Communication in animals and men, Language, 39, 448-466 (1963). 140. T.A. Sebeok, The Sign and its Masters, University of Texas Press, (1979). 141. P. Bouissac, Ecology of semiotic space: Competition, exploitation, and the evolution of arbitrary signs, Am. J. Semiotics, 10, 145-166 (1972). 142. F. Varla, Autopoiesis: A Theory of Living Organization, North Holland, New York, (1986). 143. R. Posner, K. Robins and T.A. Sebeok, editors, Semiotics: A Handbook of the Sign- Theoretic Foundations of Nature and Culture, Walter de Gruyter, Berlin, (1992). 144. R. Paton, The ecologies of hereditary information, Cybernetics and Human Knowing, 5(4), 31-44 (1998). 145. T. Stonier, Information and the Internal Structure of the Universe, Springer, Berlin, (1990). 146. T. Stonier, Information and Meaning: An Evolutionary Perspective, Springer, Berlin, (1997). 147. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 148. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 149. Alan Turing, The Enigma of Intelligence, Burnett, London (1983). 150. R. Randell (editor), The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 151. Allan R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 152. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 153. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 154. R. Randell (editor), The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 155. Allan R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 156. H. Babbage, Babbages Calculating Engines: A Collection of Papers by Henry Prevost Babbage, MIT Press, (1984). 157. A.M. Turing, The Enigma of Intelligence, Burnett, London (1983). 158. Ft. Penrose, The Emperor’s New Mind: Concerning Computers, Minds, and the Laws of Physics, Oxford University Press, (1989). 159. S. Wolfram, A New Kind of Science, Wolfram Media, Champaign IL, (2002). 234 QUANTUM THEORY

160. A.M. Turing, On computable numbers, with an application to the Entscheidungsprob- lem, Proc. Lond. Math. Soc. Ser 2, 42, (1937). Reprinted in M. David Ed., The Undecidable, Raven Press, Hewlett N.Y., (1965). 161. N. Metropolis, J. Howlett, and Gian-Carlo Rota (editors), A History of Computing in the Twentieth Century, Academic Press (1980). 162. J. Shurkin, Engines of the Mind: A History of Computers, W.W. Norten, (1984). 163. J. Palfreman and D. Swade, The Dream Machine: Exploring the Computer Age, BBC Press (UK), (1991). 164. T.J. Watson, Jr. and P. Petre, Father, Son, and Co., Bantam Books, New York, (1991). 165. A. Hodges, Alan Turing: The Enegma, Simon and Schuster, (1983). 166. H.H. Goldstein, The Computer from Pascal to Von Neumann, Princeton University Press, (1972). 167. C.J. Bashe, L.R. Johnson, J.H. Palmer, and E.W. Pugh, IBM’s Early Computers, Vol. 3 in the History of Computing Series, MIT Press, (1986). 168. K.D. Fishman, The Computer Establishment, McGraw-Hill, (1982). 169. S. Levy, Hackers, Doubleday, (1984). 170. S. Franklin, Artificial Minds, MIT Press, (1997). 171. P. Freiberger and M. Swaine, Fire in the Valley: The Making of the Personal Com- puter, Osborne/MeGraw-Hill, (1984). 172. R.X. Cringely, Accidental Empires, Addison-Wesley, (1992). 173. R. Randell editor, The Origins of Digital Computers, Selected Papers, Springer- Verlag, New York (1973). 174. H. Lukoff, From Dits to Bits, Robotics Press, (1979). 175. D.E. Lundstrom, A Few Good Men from Univac, MIT Press, (1987). 176. D. Rutland, Why Computers Are Computers (The SWAC and the PC), Wren Publishers, (1995). 177. P.E. Ceruzzi, Reckoners: The Prehistory of the Digital Computer, from Relays to the Stored Program Concept, 1935-1945, Greenwood Press, Westport, (1983) 178. S.G. Nash, A History of Scientific Computing, Adison-Wesley, Reading Mass., (1990). 179. P.E. Ceruzzi, Crossing the divide: Architectural issues and the emer- gence of stored programme computers, 1935-1953, IEEE Annals of the History of Computing, 19, 5-12, January-March (1997). 180. P.E. Ceruzzi, A History of Modern Computing, MIT Press, Cambridge MA, (1998). 181. K. Zuse, Some remarks on the history of computing in Germany, in A History of Computing in the 20th Century, N. Metropolis et al. editors, 611-627, Academic Press, New York, (1980). 182. A.R. Mackintosh, The First Electronic Computer, Physics Today, March, (1987). 183. S.H. Hollingdale and G.C. Tootil, Electronic Computers, Penguin Books Ltd. (1970). 184. A. Hodges, Alan Turing: The Enegma, Simon and Schuster, New York, (1983). 185. A. Turing, On computable numbers with reference to the Entscheidungsproblem, Jour- nal of the London Mathematical Society, II, 2. 42, 230-265 (1937). 186. J. von Neumann, The Computer and the Brain, Yale University Press, (1958). A.11. NEURAL NETWORKS 235

187. I.E. Sutherland, Microelectronics and computer science, Scientific American, 210-228, September (1977). 188. W. Aspray, John von Neumann and the Origins of Modern Computing, M.I.T. Press, Cambridge MA, (1990, 2nd ed. 1992). 189. W. Aspray, The history of computing within the history of information technology, History and Technology, 11, 7-19 (1994). 190. G.F. Luger, Computation and Intelligence: Collected Readings, MIT Press, (1995). 191. Z.W. Pylyshyn, Computation and Cognition: Towards a Foundation for Cognitive Science, MIT Press, (1986). 192. D.E. Shasha and C. Lazere, Out of Their Minds: The Creators of Computer Science, Copernicus, New York, (1995). 193. W. Aspray, An annotated bibliography of secondary sources on the history of software, Annals of the History of Computing 9, 291-243 (1988). 194. R. Kurzweil, The Age of Intelligent Machines, MIT Press, (1992). 195. S.L. Garfinkel and H. Abelson, eds., Architects of the Information Society: Thirty- Five Years of the Laboratory for Computer Sciences at MIT, MIT Press, (1999). 196. J. Haugeland, Artificial Intelligence: The Very Idea, MIT Press, (1989). 197. M.A. Boden, Artificial Intelligence in Psychology: Interdisciplinary Essays, MIT Press, (1989). 198. J.W. Cortada, A Bibliographic Guide to the History of Computer Applications, 1950- 1990, Greenwood Press, Westport Conn., (1996). 199. M. Campbell-Kelly and W. Aspry, Computer: A History of the Information Machine, Basic Books, New York, (1996). 200. B.I. Blum and K. Duncan, editors, A History of Medical Informatics, ACM Press, New York, (1990). 201. J.-C. Guedon, La Planete Cyber, Internet et Cyberspace, Gallimard, (1996). 202. S. Augarten, Bit by Bit: An Illustrated History of Computers, Unwin, London, (1985). 203. N. Wiener, Cybernetics; or Control and Communication in the Animal and the Ma- chine, The Technology Press, John Wiley and Sons, New York, (1948). 204. W.R. Ashby, An Introduction to Cybernetics, Chapman and Hall, London, (1956). 205. M.A. Arbib, A partial survey of cybernetics in eastern Europe and the Soviet Union, Behavioral Sci., 11, 193-216, (1966). 206. A. Rosenblueth, N. Weiner and J. Bigelow, Behavior, purpose and teleology, Phil. Soc. 10 (1), 18-24 (1943). 207. N. Weiner and A. Rosenblueth, Conduction of impulses in cardiac muscle, Arch. Inst. Cardiol. Mex., 16, 205-265 (1946). 208. H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Transactions of sixth-tenth conferences, Josiah J. Macy Jr. Foundation, New York, (1950- 1954). 209. W.S. McCulloch and W. Pitts, A logical calculus of ideas immanent in nervous ac- tivity, Bull. Math. Biophys., 5, 115-133 (1943). 210. W.S. McCulloch, An Account of the First Three Conferences on Teleological Mecha- nisms, Josiah Macy Jr. Foundation, (1947). 236 QUANTUM THEORY

211. G.A. Miller, Languages and Communication, McGraw-Hill, New York, (1951). 212. G.A. Miller, Statistical behavioristics and sequences of responses, Psychol. Rev. 56, 6 (1949). 213. G. Bateson, Bali - the value system of a steady state, in M. Fortes, editor, Social Struc- ture Studies Presented to A.R. Radcliffe-Brown, Clarendon Press, Oxford, (1949). 214. G. Bateson, Communication, the Social Matrix of Psychiatry, Norton, (1951). 215. G. Bateson, Steps to an Ecology of Mind, Chandler, San Francisco, (1972). 216. G. Bateson, Communication et Societe, Seuil, Paris, (1988). 217. S. Heims, Gregory Bateson and the mathematicians: From interdisciplinary interac- tions to societal functions, J. History Behavioral Sci., 13, 141-159 (1977). 218. S. Heims, John von Neumann and Norbert Wiener. From Mathematics to the Tech- nology of Life and Death, MIT Press, Cambridge MA, (1980). 219. S. Heims, The Cybernetics Group, MIT Press, Cambridge MA, (1991). 220. G. van de Vijver, New Perspectives on Cybernetics (Self-Organization, Autonomy and Connectionism), Kluwer, Dordrecht, (1992). 221. A. Bavelas, A mathematical model for group structures, Appl. Anthrop. 7 (3), 16 (1948). 222. P. de Latil, La Pensee Artificielle - Introduction a la Cybernetique, Gallimard, Paris, (1953). 223. L.K. Frank, G.E. Hutchinson, W.K. Livingston, W.S. McCulloch and N. Wiener, Teleological Mechanisms, Ann. N.Y. Acad. Sci. 50, 187- 277 (1948). 224. H. von Foerster, Quantum theory of memory, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social systems. Trans- actions of the sixth conferences, Josiah J. Macy Jr. Foundation, New York, (1950). 225. H. von Foerster, Observing Systems, Intersystems Publications, California, (1984). 226. H. von Foerster, Understanding Understanding: Essays on Cybernetics and Cogni- tion, Springer, New York, (2002). 227. M. Newborn, Kasparov vs. Deep Blue: Computer Chess Comes of age, Springer Verlag, (1996). 228. K.M. Colby, Artificial Paranoia: A Computer Simulation of the Paranoid Process, Pergamon Press, New York, (1975). 229. J.Z. Young, Discrimination and learning in the octopus, in H. von Foerster, editor, Cybernetics - circular, causal and feed-back mechanisms in biological and social sys- tems. Transactions of the ninth conference, Josiah J. Macy Jr. Foundation, New York, (1953). 230. M.J. Apter and L. Wolpert, Cybernetics and development. I. Infor- mation theory, J. Theor. Biol. 8, 244-257 (1965). 231. H. Atlan, L’Organization Biologique et la Theorie de I’Information, Hermann, Paris, (1972). 232. H. Atlan, On a formal definition of organization, J. Theor. Biol. 45, 295-304 (1974). 233. H. Atlan, Organization du vivant, information et auto-organization, in Volume Sym- posium 1986 de l’Encylopediea Universalis, pp. 355-361, Paris, (1986). 234. E.R. Kandel, Nerve cells and behavior, Scientific American, 223, 57-70, July, (1970). A.11. NEURAL NETWORKS 237

235. E.R. Kandel, Small systems of neurons, Scientific American, 241 no.3, 66-76 (1979). 236. A.K. Katchalsky et al., Dynamic patterns of brain cell assemblies, Neurosciences Res. Prog. Bull., 12 no.1, (1974). 237. G.E. Moore, Cramming more components onto integrated circuits, Electronics, April 19, (1965). 238. P. Gelsinger, P. Gargini, G. Parker and A. Yu, Microprocessors circa 2000, IEEE Spectrum, October, (1989). 239. P. Baron, On distributed communications networks, IEEE Trans. Comm. Systems, March (1964). 240. V.G. Cerf and R.E. Khan, A protocol for packet network intercommunication, Trans. Comm. Tech. COM-22, V5, 627-641, May (1974). 241. L. Kleinrock, Communication Nets: Stochastic Message Flow and Delay, McGraw- Hill, New York, (1964). 242. L. Kleinrock, Queueing Systems: Vol. II, Computer Applications, Wiley, New York, (1976). 243. R. Kahn, editor, Special Issue on Packet Communication Networks, Proc. IEEE, 66, November, (1978). 244. L.G. Roberts, The evolution of packet switching, Proc. of the IEEE 66, 1307-13, (1978). 245. J. Abbate, The electrical century: Inventing the web, Proc. IEEE 87, November, (1999). 246. J. Abbate, Inventing the Internet, MIT Press, Cambridge MA, (1999). 247. J.C. McDonald, editor, Fundamentals of Digital Switching, 2nd Edition, Plenum, New York, (1990). 248. B. Metcalfe, Packet Communication, Peer-to-Peer Communication, San Jose Calif, (1996). 249. T. Berners-Lee, The Original Design and Ultimate Destiny of the World Wide Web by its Inventor, Harper San Francisco, (1999). 250. J. Clark, Netscape Time: The Making of the Billion-Dollar Start-Up That Took On Microsoft, St. Martin’s Press, New York, (1999). 251. J. Wallace, Overdrive: Bill Gates and the Race to Control Cyberspace, Wiley, New York, (1997). 252. P. Cunningham and F. Froschl, The Electronic Business Revolution, Springer Verlag, New York, (1999). 253. J.L. McKenny, Waves of Change: Business Evolution Through Information Tech- nology, Harvard Business School Press, (1995). 254. M.A. Cosumano, Competing on Internet Time: Lessons From Netscape and Its Battle with Microsoft, Free Press, New York, (1998). 255. F.J. Dyson, The Sun, the Genome and the Internet: Tools of Scientific Revolutions, Oxford University Press, (1999). 256. L. Bruno, Fiber Optimism: Nortel, Lucent and Cisco are battling to win the high- stakes fiber-optics game, Red Herring, June (2000). 238 QUANTUM THEORY

257. N. Cochrane, We’re insatiable: Now it’s 20 million million bytes a day, Melbourne Age, January 15, (2001). 258. K.G. Coffman and A.N. Odlyzko, The size and growth rate of the Internet, First Monday, October, (1998). 259. C.A. Eldering, M.L. Sylla, and J.A. Eisenach, Is there a Moore’s law for bandwidth?, IEEE Comm. Mag., 2-7, October, (1999). 260. G. Gilder, Fiber keeps its promise: Get ready, bandwidth will triple each year for the next 25 years, Forbes, April 7, (1997). 261. A.M. Noll, Does data traffic exceed voice traffic?, Comm. ACM, 121- 124, June, (1999). 262. B. St. Arnaud, J. Coulter, J. Fitchett, and S. Mokbel, Architectural and engineering issues for building an optical Internet, Proc. Soc. Optical Eng. (1998). 263. M. Weisner, The computer for the 21st century, Scientific American, September, (1991). 264. R. Wright, Three Scientists and Their Gods, Time Books, (1988). 265. S. Nora and A. Mine, The Computerization of Society, MIT Press, (1981). 266. T. Forester, Computers in the Human Context: Information Theory, Productivity, and People, MIT Press, (1989). Appendix B

GROUP THEORY

B.1 Finite Groups

The definition of a finite group A finite group is defined by the following conditions:

1. If any two elements belonging to the group are multiplied together, the product is another element belonging to the group.

2. There is an identity element.

3. Each element has an inverse.

4. Multiplication of the elements is associative1, but necessarily commutative.

5. The group contains g elements, where g is a finite positive integer called the order of the group.

As a simple example, we might think of a molecule which is symmetric with respect to rotations through an angle of 2π/3 about some axis but which has no other symmetry. Then the set of geometrical operations that leave the molecule invariant form a group containing 3 elements: the identity element; a rotation through an angle 2π/3 about the axis of symmetry, and a rotation through an angle 4π/3 about the same axis. Let us denote −1 these operations respectively by E, C3, and C3 . We can easily construct a multiplication table for the group. If we do so, each element of the group will appear once and only once in any row or column of the multiplication table. This follows from the fact that AX = B has one and only one solution among the group elements. Since A−1 and B belong to the group, and since the product of any two elements belongs to the group, X = A−1B is also a uniquely-defined element. Now suppose that the element B appears more than once in the Ath row of the multiplication table. Then AX = B will have more than one

1A(BC)=(AB)C

239 240 LIVES IN MATHEMATICS

solution which is impossible. Since no element can appear more than once, each element must appear once because there are g elements and g places in the row, all of which have to be filled.

B.2 Representations of geometrical symmetry groups

The elements of a geometrical symmetry group are linear coordinate transformations. Such transformations have the form

d ∂Xi Xi = xj + bi (B.1) ∂xj j=1 X where ∂Xi/∂xj and bi are constants. Now consider a set of functions Φ1,Φ2, ..., ΦM . We can use equation (B.1) to express Φ1(x) as a function of X. If we then expand the resulting function of X in terms of the other Φn ’s, we shall obtain a relation of the form | i

Φn(x) = Φn0 (X)Dn0,n (B.2) n0 X If we denote the coordinate transformation in equation (B.1) by the symbol G, we can rewrite equations (B.1) and (B.2) in the form:

X = Gjx −1 Φn(x) Φn(G X) GjΦn(X) ≡ j ≡ = Φn0 (X)Dn0,n(G) (B.3) n0 X

In this sense, the coordinate transformation defines an operator Gj, and Dn0,n(Gj) is a matrix representing Gj. Is can easily be shown that the matrices representing a set of operators G1, G2,...,Gg in a given basis, obey the same multiplication table as the operators themselves. For example, if we know that

−1 C3C3 = E (B.4)

and that

C3Φn = Φn0 Dn0,n(C3) n0 X −1 −1 C3 Φn = Φn0 Dn0,n(C3 ) n0 X EΦn = Φn0 Dn0,n(E) (B.5) n0 X B.3. SIMILARITY TRANSFORMATIONS 241

then it follows that: −1 −1 C3C3 Φn = C3Φn0 Dn0,n(C3 ) n0 X −1 = Φn00 Dn00,n0 (C3)Dn0,n(C3 ) n00 ( n0 ) X X = EΦn = Φn00 Dn00,n(E) (B.6) n00 X so that we must have −1 Dn00,n(E) = Dn00,n0 (C3)Dn0,n(C3 ) (B.7) n0 X Thus given any set of basis functions Φ1, Φ2, ..., ΦM which mix together under the ele- ments of a group G1, G2,...,Gg, we can obtain a set of matrices Dn0,n(Gj) defined by the relationships

GjΦn = Φn0 Dn0,n(Gj) j = 1, 2, ..., g (B.8) n0 X These matrices will obey the same multiplication table as the operators G1, G2,...,Gg, and they are said to form a matrix representation of the group.

B.3 Similarity transformations

0 0 0 Now let us consider another representation, Dm0,m(Gj), based on a set of functions Φ1,Φ2, 0 ..., ΦM which are related to our original set Φ1,Φ2, ..., ΦM by the transformation: 0 Φm = ΦnSn,m n X 0 −1 Φn = ΦmSm,n (B.9) m X The primed representation is defined by the relationship 0 0 0 GjΦm = Φm0 Dm0,m(Gj) j = 1, 2, ..., g (B.10) m0 X Then from equations (B.8)-(B.10) we have 0 0 0 GjΦm = Φm0 Dm0,m(Gj) m0 X = Gj ΦnSn,m n X = Φn0 Dn0,n(Gj)Sn,m n,n0 X 0 −1 = Φm0 Sm0,n0 Dn0,n(Gj)Sn,m (B.11) m0,n,n0 X 242 LIVES IN MATHEMATICS so that we must have

0 −1 Dm0,m(Gj) = Sm0,n0 Dn0,n(Gj)Sn,m (B.12) n,n0 X or

D0 = S−1DS (B.13)

A transformation of this type, where the matrix S need not be unitary, is called a ‘similarity transformation’.

B.4 Characters and reducibility

The character χ(Gj) of the matrix Dn0,n(Gj) is defined as the sum of the diagonal elements:

χ(Gj) Dn,n(Gj) (B.14) ≡ n X We would like to show that the character of each element in a representation of a finite group is invariant under a similarity transformation. From equations (B.12) and (B.14) we have:

0 0 χ (Gj) Dm,m(Gj) ≡ m X −1 = Sm,n0 Dn0,n(Gj)Sn,m m,n,n0 X −1 = Sn,mSm,n0 Dn0,n(Gj) n,n0 m ! X X = δn0,nDn0,n(Gj) n,n0 X = Dn,n(Gj) = χ(Gj) q.e.d. (B.15) n X If two representations are connected by a similarity transformation, then they are said to be ‘equivalent’. From (B.15) it follows that when two representations are equivalent, then 0 χ (Gj) = χ(Gj) for j = 1, 2, ..., g. Sometimes it is possible by means of a similarity transformation to bring all the elements of a representation into a block-diagonal form. In other words it may be possible to bring 0 Dm0,m(Gj) into a form where the non-zero elements are confined blocks along the diagonal, the blocks being the same for all the group elements. To express the same idea differently, it is sometimes possible to go over by means of a similarity transformation from the original 0 0 0 basis set, Φ1,Φ2, ..., ΦM to a new basis set Φ1,Φ2, ..., ΦM which can be divided into two B.4. CHARACTERS AND REDUCIBILITY 243

Table A.1 Multiplication table for the group C3

−1 E C3 C3

−1 E E C3 C3

−1 C3 C3 C3 E

−1 −1 C3 C3 E C3

or more subsets, each of which mixes only with itself under the operations G1, G2,...,Gg. A representation based on two or more subsets of basis functions which mix only with themselves under the operations of the group is said to be ‘reduced’. Whenever it is possible to bring a representation into a reduced form by means of a similarity transformation, it is said to be ‘reducible’. Whenever this is not possible, the representation is said to be ‘irreducible’. 244 LIVES IN MATHEMATICS

Table A.2 Character table for the group C3

−1 E C3 C3

A 1 1 1

i(2π/3) −i(2π/3) Γc 1 e e

∗ −i(2π/3) i(2π/3) Γc 1 e e

B.5 The great orthogonality theorem

A unitary matrix is a matrix whose conjugate transpose (Hermitian adjoint) is equal to its inverse. It is always possible, by means of a similarity transformation, to bring the matrix α β representations of a finite group into unitary form. Now let Dn0,n(Gj) and Dm0,m(Gj) be two unitary irreducible representations of a finite group of order g. The great orthogonality theorem, from which much of the power of group theory is derived, then states that

g α∗ β g Dn0,n(Gj)Dm0,m(Gj) = δα,βδn0,m0 δn,m (B.16) dα j=1 X α where dα is the dimension of the matrices Dn0,n(Gj). The proof of the great orthogonality theorem depends on Schur’s lemma, which states that if A is a matrix that commutes with α every matrix Dn0,n(Gj), j = 1, 2, ..., g in a unitary irreducible representation of a finite group, then A must be a multiple of the unit matrix, i.e., if

AD(Gj) D(Gj)A = 0, j = 1, 2, ..., g (B.17) − then

A I (B.18) ∼ α The proof of Schur’s lemma is as follows: If A commutes with Dn0,n(Gj), j = 1, 2, ..., g, then so does its conjugate transpose A†. Therefore we can let A be Hermitian without loss of generality, and we can diagonalize A by means of a unitary transformation:

UAU −1 = A(d) (B.19) B.5. THE GREAT ORTHOGONALITY THEOREM 245 where A(d) is diagonal. Then

−1 (d) −1 (d) U A UD(Gj) D(Gj)U A U = 0, j = 1, 2, ..., g (B.20) − Multiplying on the left by U and on the right by U −1 then yields

(d) −1 −1 (d) A UD(Gj)U UD(Gj)U A = 0, j = 1, 2, ..., g (B.21) − Thus we can write

(d) 0 0 (d) A D (Gj) D (Gj)A = 0, j = 1, 2, ..., g (B.22) − where

0 −1 D (Gj) UD(Gj)U (B.23) ≡

(d) (d) (d) Since A is diagonal we can write An0,n = An δn0,n. Thus with the indices written out, (B.22) becomes:

(d) 0α 0α (d) A 0 δn00,n0 D 0 (Gj) D 00 0 (Gj)A δn0,n = 0, j = 1, ..., g n n ,n − n ,n n n0 X   (B.24) from which it follows that

(d) (d) 0α A 0 A D 0 (Gj) = 0, j = 1, 2, ..., g (B.25) n − n n ,n   Without loss of generality, we can choose U in such a way that repeated eigenvalues of A(d) (d) (d) are grouped together along the diagonal. Then A 00 = An implies that n 6 0α 0†α 0α −1 Dn0,n(Gj) = Dn0,n(Gj) = Dn,n0 (Gj ) = 0, j = 1, 2, ..., g (B.26)

0α Thus Dn0,n(Gj) can only have non-zero elements in the blocks that correspond to repeated eigenvalues of A(d) and it would therefore be reducible unless all of the eigenvalues are equal, which would contradict the original assumption of irreducibility. This proves Schur’s lemma.

Having demonstrated the validity of Schur’s lemma, we are now in a position to prove the great orthogonality relation. To do so we define the matrix M by the relationship

g α β −1 M D (Gj)XD (G ) (B.27) ≡ j j=1 X 246 LIVES IN MATHEMATICS where X is an arbitrary matrix of appropriate dimensions to make matrix multiplication α β possible and where D (Gj) and D (Gj) are unitary irreducible representations of the finite group. Then

g α β −1 α α β −1 β −1 D (Gi)MD (Gi ) = D (Gi)D (Gj)XD (Gj )D (Gi ) j=1 Xg α β −1 = D (Gk)XD (Gk ) = M (B.28) k X=1 where GiGj = Gk and where we have used the fact that each group element appears once and only once in every row of the multiplication table to replace the sum over j by a sum β over k. Multiplying (B.28) from the right by D (Gi) we obtain:

α β D (Gi)M = MD (Gi) i = 1, 2, ..., g (B.29)

Then, according to Schur’s lemma, M must be a multiple of the unit matrix. It may of course be a square matrix consisting entirely of zeros, since such a matrix is also a multiple of the unit matrix. Multiplying (B.29) from the left by M −1 we obtain:

−1 α β M D (Gi)M = D (Gi) i = 1, 2, ..., g (B.30) from which we can see that if M is not the null matrix, then the irreducible representations α β D (Gi) and D (Gi) must be the same, i.e., if M is not the null matrix, α = β. Let us first consider the case where M is the null matrix and where α = β. Then putting indices into (B.27) we have: 6

g dα dβ α β −1 Dn0,n(Gj)Xn,m0 Dm0,m(Gj ) = 0 (B.31) j=1 n=1 m0 X X X=1

But Xn,m0 is arbitrary, and therefore (B.31) can only hold for all cases if

g α β −1 Dn0,n(Gj)Dm0,m(Gj ) = 0 (B.32) j=1 X Now let us consider the second possibility: Suppose that α = β. Then

g β β −1 δα,βM = D (Gj)XD (Gj ) (B.33) j=1 X Putting indices into (B.33) we have

g dβ dβ α β −1 δα,βMn0,m = Dn0,n(Gj)Xn,m0 Dm0,m(Gj ) (B.34) j=1 n=1 m0 X X X=1 B.5. THE GREAT ORTHOGONALITY THEOREM 247

Taking the trace of both sides of (B.34) yields

g dβ dβ dβ β β −1 δα,βtr(M) = Dm,n(Gj)Xn,m0 Dm0,m(Gj ) j=1 m=1 n=1 m0 X X X X=1 g dβ dβ

= δn,m0 Xn,m0 = gtr(X) (B.35) j=1 n=1 m0 X X X=1 so that

g dβ dβ g α β −1 Iδα,β trX = Dn0,n(Gj)Xn,m0 Dm0,m(Gj ) (B.36) dα j=1 n=1 m0 X X X=1 where I is the identity matrix. Because X is arbitrary, this relationship can only hold in all cases if (B.16) is valid. The great orthogonality relation is very central, and almost all of the results of group theory depend upon it. For example, combining (B.16) with the definition of characters (B.14), we obtain: g g α∗ β α∗ β χ (Gj)χ (Gj) D (Gj) D (Gj) ≡ n,n m,m j=1 j=1 ( n )( m ) X Xg X X = δ δ δ = gδ (B.37) d α,β n,m n,m α,β α n m X X Equation (B.37) holds only for unitary representations, but every representation is equiva- lent to a unitary representation since it is always possible to perform a similarity transfor- mation that orthonormalizes the basis functions. Therefore, since characters are invariant under similarity transformations, the orthonormality of characters g 1 α∗ β 0 if the representations are inequivlent χ (Gj)χ (Gj) = α,β g ≡ 1 if the representations are equivlent j=1 X  (B.38) holds even for non-unitary irreducible representations. Now consider a representation Dn0,n(Gj) which may be reducible. If we reduce it by means of a similarity transformation, then in its reduced form it will be block-diagonal, each block being irreducible. Taking the trace, we find that the character of an element 0 in the reduced representation Dn0,n(Gj) is the sum of the characters of the irreducible representations of which it is composed. Thus d 0 χ(Gj) D (Gj) ≡ n,n n=1 X1 2 = χ (Gj) + χ (Gj) + ... β = nβχ (Gj) (B.39) β X 248 LIVES IN MATHEMATICS

β where nβ is the number of times that the irreducible representation D occurs among the diagonal blocks of D0. Then from (B.38) we have

g g 1 χα∗(G )χ(G ) = n χα∗(G )χβ(G ) g j j β j j j=1 β j=1 X X X = nβα,β = nα (B.40) β X This gives us a way to find out how many times a particular irreducible representation Dα occurs in a reducible representation D. According to (B.40), we just have to take the scalar product of the characters and divide by the order of the group. When we say that Dα ‘occurs’ nα times in D, we mean that it is possible by means of a similarity transformation α to bring D into block-diagonal form where D occurs nα times along the diagonal blocks. The relationship is sometimes written in the form

1 2 D = n1D + n1D + ... (B.41)

Obviously in this decomposition we do not need to distinguish between different equivalent forms of an irreducible representation Dα, since all of them have the same character, and it is possible to go from one to another by means of a similarity transformation.

B.6 Classes

Two elements of a group Gi and Gj are said to be in the same ‘class’ if there exists another element Gl in the group such that

−1 Gi = Gl GjGl (B.42)

Thus, if we start with a particular element Gj, we can generate the set of elements in the same class by keeping j fixed in (B.42) and letting Gl run through all the elements of the group. It also follows from (B.42) that we can construct an operator Mk which commutes with all the elements of the group by summing the elements of a particular class:

Mk Gj (B.43) ≡ k classX Then for an arbitrary group element Gl we have

−1 −1 Gl [Mk,Gl] = Gl [Gj,Gl] k classX −1 = G GjGl Gj l − class k X  = (Gi Gj) = 0 (B.44) − k classX B.7. PROJECTION OPERATORS 249

Equation (B.44) can hold only if [Mk,Gl] = 0. An operator, such as Mk, which commutes with every element of the group is called an ‘invariant’. If there are r classes in a group, there will be r linearly independent invariants that can be constructed in this way. For any representation of two elements Gi and Gj in the same class, it follows from (B.42) that

−1 −1 D(Gi) = D(Gl )D(Gj)D(Gl) = D(Gl) D(Gj)D(Gl) (B.45)

Thus if D(Gi) and D(Gi) represent two elements in the same class, they are connected by a similarity transformation, and therefore they have the same character. In other words, all elements in the same class have the same character. This means that in applying equation (B.40) we do not need to go through quite so much work. Instead of summing over all of the elements in the group, we can take the product of characters for a representative element in each class, multiply by the number of elements in the class, and then sum over the classes. If gk represents the number of elements in the class k, then the orthogonality relation for characters, equation (B.38), can be written in the form

r g g k χα∗(G ) k χβ(G ) = δ (B.46) g k j g k j α,β k X=1 r r α where χk is the character of a representative element in class k.

B.7 Projection operators

The great orthogonality theorem, equation (B.16), can be used to construct group-theoretical 1 1 1 2 2 2 projection operators. Suppose that the sets of functions (Φ1, Φ2, ..., Φd1 ), (Φ1, Φ2, ...Φd2 ), etc. each form the basis for an irreducible representation of a group, and that there are r0 nonequivalent irreducible representations. Then

dβ β β β GjΦn = Φn0 Dn0,n(Gj) (B.47) n0 X=1 Then from (B.16) we have

g dβ g α∗ β β α∗ β Dm,m(Gj)GjΦn = Φn0 Dm,m(Gj)Dn0,n(Gj) j=1 n0 j=1 X X=1 X dβ g β = δα,β Φn0 δm,n0 δm,n dα n0 X=1 g β = δα,β Φmδm,n (B.48) dα 250 LIVES IN MATHEMATICS

From (B.48) it follows that if we let

g α dα α∗ P D (Gj)Gj (B.49) m ≡ g m,m j=1 X then

α β β PmΦn = δα,βδm,nΦm (B.50)

α In other words, when the operator Pm defined by equation (B.49) acts on any function in 1 1 1 2 2 2 the set (Φ1, Φ2, ..., Φd1 ), (Φ1, Φ2, ...Φd2 ),..., the function is given back unchanged, provided α that m = n and α = β. Otherwise the function is annihilated. Thus, Pm is a projection operator corresponding to the mth basis function of the αth irreducible representation of α the group in a standard unitary representation. If Pm acts on an arbitrary function, it will annihilate all of it except the component that transforms like the mth basis function of Dα. A second type of group-theoretical projection operator can be defined by the relation- ship

dα g dα α α dα α∗ P P = D (Gj)Gj (B.51) ≡ m g m,m m=1 j=1 m=1 X X X which can be rewritten as g α dα α∗ P χ (Gj)Gj (B.52) ≡ g j=1 X From (B.50) it follows that

dα dα α β α β β β P Φn = PmΦn = δα,β δm,nΦm = δα,βΦn (B.53) m=1 m=1 X X When P α acts on an arbitrary function, it annihilates everything except the component which can be expressed as a linear combination of basis functions of the irreducible repre- sentation Dα. If we sum (B.53) over all of the irreducible representations of the group, we obtain

r0 r0 α β β β P Φn = δα,βΦn = Φn (B.54) α=1 α=1 X X Therefore the sum acts like the identity operator and we can write

r0 P α = E (B.55) α=1 X B.8. THE REGULAR REPRESENTATION 251

Combining (B.55) with (B.52), we obtain

g r0 dα α∗ χ (Gj)Gj = E G (B.56) g ≡ 1 j=1 α=1 X X Since the group elements G1, ..., Gg are linearly independent, equation (B.55) implies that

r0 d α χα∗(G ) = δ (B.57) g j j,1 α=1 X The character of the identity element in any representation is equal to the dimension of that representation:

α∗ α χ (E) = χ (E) = dα (B.58) Therefore, when j = 1, we obtain from (B.57) the relationship

r0 2 dα = g (B.59) α=1 X i.e., the sum of the squares of the dimensions of the irreducible representations is equal to the order of the group.

B.8 The regular representation

The ‘regular representation’ of a finite group is a reducible representation Dreg in which the basis consists of the group elements themselves: g reg GjGn = Gn0 Dn0,n(Gj) (B.60) n0 X=1 Dreg must thus be a set of g g g matrices. If we know the multiplication table for a finite group, we can construct the regular× representation. For example, the multiplication table for the group C3 is shown above. It can easily be verified that if we let 1 0 0 Dreg(E) = 0 1 0   0 0 1  0 0 1  Dreg(C ) = 1 0 0 (B.61) 3   0 1 0  0 1 0  Dreg(C−1) = 0 0 1 3   1 0 0   252 LIVES IN MATHEMATICS

then the matrices will be the regular representation of the group C3 according to the definition shown in (B.60) and the multiplication table (A.1). Since GiGj = Gj for Gi = E, it follows that the character of every group element except the identity6 element vanishes6 in the regular representation. (We can notice that this holds in the example given above.) Therefore in the case of the regular representation, equation (B.40) becomes:

g 1 1 n = χα∗(G )χreg(G ) = χα∗(E)χreg(E) = d (B.62) α g j j g α j=1 X Thus each irreducible representation of a finite group appears dα times in the regular representation. When each element of a group commutes with every other one, a group is said to be Abelian. Then from the definition of classes, (B.42), it follows that in an Abelian group, every element is in a class by itself, so that an Abelian group contains g classes, i.e. r = g. We can next ask how many non-equivalent irreducible representations an Abelian group contains. To answer this question, we remember from Schur’s lemma that the only matrix that commutes with every matrix in an irreducible representation of a group must be a multiple of the unit matrix. But in an Abelian group, all of the elements commute with each other, and therefore their irreducible representations must all be multiples of the unit matrix. This can happen only if all the irreducible representations are 1-dimensional. 0 0 Thus for an Abelian group, dα = 1, α = 1, 2, ..., r and r = g. It can be seen from the multiplication table of the group C3 that it is Abelian. In the example of C3, (B.59) becomes 1 + 1 + 1 = 3.

B.9 Classification of basis functions

We can us the group-theoretical projection operators to classify basis sets into basis func- tions for the various irreducible representations of a group. For example, we can construct the projection operators of the group C3 from the character table: 1 P 1 = E + C + C−1 3 3 3 1 P 2 = E + e−i2π/3C + ei2π/3C−1 (B.63) 3 3 3 1 P 3 = E + ei2π/3C + e−i2π/3C−1 3 3 3  Since the group C3 is Abelian, all of its irreducible representations are 1-dimensional, and hence there is no difference between projection operators of the type P α and those of α 1 2 3 the type Pn . Notice that P + P + P = E in accordance with (B.55), and that the α β α projection operators are idempotent, i.e., P P = δα,βP . All projection operators must be idempotent, since projecting out a subspace of a Hilbert space twice has the same effect as doing it once, and acting in succession with projection operators corresponding to different subspaces must yield zero. B.9. CLASSIFICATION OF BASIS FUNCTIONS 253

imϕ Now consider the set of functions Φm = e where m is an integer. We can use the projection operators of (B.63) to split the Hilbert space spanned by this set of functions into three subspaces. Using the relationships

Eeimϕ = eimϕ imϕ im(ϕ−2π/3) C3e = e (B.64) −1 imϕ im(ϕ+2π/3) C3 e = e we obtain 1 P 1eimϕ = eimϕ 1 + e−im2π/3 + eim2π/3 3 0 if m = 1, 2, 4, 5, ... = (B.65) eimϕ if m±= 0±, 3±, 6±, 9, ...  ± ± ± and similarly 0 if m + 1 = 1, 2, 4, 5, ... P 2eimϕ = eimϕ if m + 1± = 0±, 3±, 6±, 9, ...  ± ± ± 0 if m 1 = 1, 2, 4, 5, ... P 3eimϕ = (B.66) eimϕ if −m 1± = 0±, 3±, 6±, 9, ...  − ± ± ± imϕ Thus the Hilbert space spanned by the functions Φm = e is divided into three subspaces each of which consists of basis functions for one of the irreducible representations of C3. For non-Abelian groups the Hilbert space spanned by a set of basis functions can be divided α into still smaller subspaces through the use of projection operators of the type Pn defined in equation (B.49). If we wish to have names for the the two types of projection operators, α α α α we might call Pm ‘strong’ and P ‘weak’, since Pn has a stronger effect than P . Now suppose that we have divided the Hilbert space spanned by a set of basis functions α into small subspaces by means of the strong projection operators Pn , so that

α Pn Φj = pjΦj pj = 0 or 1 (B.67) We will now show that if an operator T commutes with every element of the group, then the matrix elements of T linking functions belonging to different subspaces must necessarily vanish. The proof is as follows: Since T commutes with every element of the group, and since the projection operators are constructed from group elements, we have

α [Pn ,T ] = 0 (B.68) Then

α Φj [P ,T ] Φk = (pj pk) Φj T Φk = 0 (B.69) h | n | i − h | | i

Thus if Φj and Φk belong to different subspaces when the basis set is classified by the α action of the projection operators P , i.e., if pj = pk, then Φj T Φk = 0. It follows that n 6 h | | i 254 LIVES IN MATHEMATICS

a matrix representation of the operator T will be block-diagonal if it is based on functions α that have been classified by means of the projection operators Pn , i.e. if it is based on a set of functions that satisfy (B.67). Such a basis set is said to be ‘symmetry-adapted’. We can introduce a special notation to represent fully symmetry-adapted basis func- α,n 2 tions. Let ηj be such a function. By this we indicate that the function transforms under the action| i of the group elements like nth basis function of the αth standard irre- ducible representation of the group, while the index j distinguishes between the various linearly independent functions that have this property. With this notation we can write:

α β,m β,m P η = δα,βδn,m η (B.70) n | j i | j i Using this notation, the statement that a matrix representation of the operator T based on symmetry-adapted functions will be block-diagonal can be written in the form:

α,n β,m α,n β,m η T η = δα,βδn,m η T η (B.71) h i | | j i h i | | j i The eigenvalues and eigenfunctions of T can also be expressed in this notation: T Ψα,m = λα,m Ψα,m (B.72) | κ i κ | κ i where

α,m α,m Ψκ = ηj Cj,κ (B.73) | i j | i X In other words, a set of functions all of which transform like the nth basis function of the αth irreducible representation of a group combine to form an eigenfunction of an operator T that commutes with all of the group elements. α,n We will now try to find a relationship between the degeneracy of the root λκ and α the dimension dα of the irreducible representation D . To do this, we introduce the ‘shift operator’ g α dα α∗ 0 P 0 D 0 (Gj)Gj m = m (B.74) m ,m ≡ g m ,m 6 j=1 X Then by an argument similar to (B.48) we have

g α α,m dα α∗ α,m P 0 η = D 0 (Gj)Gj η m ,m| j i g m ,m | j i j=1 X dα g α,m00 dα α∗ α = η D 0 (Gj)D 00 (Gj) | j i g m ,m m ,m m00 j=1 X=1 X dα α,m00 α,m0 = η δm00,m0 = η (B.75) | j i | j i m00 X=1 2We also introduce the Dirac notation here, since it is useful in the discussion of matrix elements. B.9. CLASSIFICATION OF BASIS FUNCTIONS 255

α where we have made use of the great orthogonality relation (B.16). Since Pm0,m is a linear combination of group elements, it must commute with T :

α Pm0,m,T = 0 (B.76)

Therefore  

α,m0 α α,m α,m0 α,m α,m0 α α,m Ψ P 0 ,T Ψ = λ λ Ψ P 0 Ψ h κ | m ,m | κ i κ − κ h κ | m ,m| κ i  α,m0 α,m   = λ λ = 0 (B.77) κ − κ   α,1 α,dα so that the roots corresponding to the dα eigenfunctions Ψκ , ..., Ψκ must be degen- erate. Such a degeneracy is called a ‘due degeneracy’ because| i it is| duei to the symmetry properties of the system. If there are other degeneracies, they are termed ‘accidental’. 256 LIVES IN MATHEMATICS Appendix C

NON-EUCLIDIAN GEOMETRY

C.1 Albert Einstein and non-Euclidian geometry

Albert Einstein’s famous general theory of relativity has a central place in modern cosmol- ogy. We will seebelow how Einstein, with the help of his friend Marcel Grossman, game to realize that non-Euclidian geometry wasexactly what he needed to turn his principle of the equivalence of gravitational and inertial mass into a quantitative theory.

C.2 Family background

Albert Einstein was born in Ulm, Germany, in 1879. He was the son of middle-class, irreligious Jewish parents, who sent him to a Catholic school. Einstein was slow in learning to speak, and at first his parents feared that he might be retarded; but by the time he was eight, his grandfather could say in a letter: “Dear Albert has been back in school for a week. I just love that boy, because you cannot imagine how good and intelligent he has become.” Remembering his boyhood, Einstein himself later wrote: “When I was 12, a little book dealing with Euclidean plane geometry came into my hands at the beginning of the school year. Here were assertions, as for example the intersection of the altitudes of a triangle in one point, which, though by no means self-evident, could nevertheless be proved with such certainty that any doubt appeared to be out of the question. The lucidity and certainty made an indescribable impression on me.” When Albert Einstein was in his teens, the factory owned by his father and uncle began to encounter hard times. The two Einstein families moved to Italy, leaving Albert alone and miserable in Munich, where he was supposed to finish his course at the gymnasium. Einstein’s classmates had given him the nickname “Beidermeier”, which means something like “Honest John”; and his tactlessness in criticizing authority soon got him into trouble. In Einstein’s words, what happened next was the following: “When I was in the seventh grade at the Lutpold Gymnasium, I was summoned by my home-room teacher, who ex- pressed the wish that I leave the school. To my remark that I had done nothing wrong, he

257 258 LIVES IN MATHEMATICS replied only, ‘Your mere presence spoils the respect of the class for me’.” Einstein left gymnasium without graduating, and followed his parents to Italy, where he spent a joyous and carefree year. He also decided to change his citizenship. “The over-emphasized military mentality of the German State was alien to me, even as a boy”, Einstein wrote later. “When my father moved to Italy, he took steps, at my request, to have me released from German citizenship, because I wanted to be a Swiss citizen.”

The financial circumstances of the Einstein family were now precarious, and it was clear that Albert would have to think seriously about a practical career. In 1896, he entered the famous Z¨urich Polytechnic Institute with the intention of becoming a teacher of math- ematics and physics. However, his undisciplined and nonconformist attitudes again got him into trouble. His mathematics professor, Hermann Minkowski (1864-1909), considered Einstein to be a “lazy dog”; and his physics professor, Heinrich Weber, who originally had gone out of his way to help Einstein, said to him in anger and exasperation: “You’re a clever fellow, but you have one fault: You won’t let anyone tell you a thing! You won’t let anyone tell you a thing!” Einstein missed most of his classes, and read only the subjects which interested him. He was interested most of all in Maxwell’s theory of electro-magnetism, a subject which was too “modern” for Weber. There were two major examinations at the Z¨urich Polytechnic Institute, and Einstein would certainly have failed them had it not been for the help of his loyal friend, the mathematician Marcel Grossman. Grossman was an excellent and conscientious student, who attended every class and took meticulous notes. With the help of these notes, Einstein managed to pass his ex- aminations; but because he had alienated Weber and the other professors who could have helped him, he found himself completely unable to get a job. In a letter to Professor F. Ostwald on behalf of his son, Einstein’s father wrote: “My son is profoundly unhappy because of his present joblessness; and every day the idea becomes more firmly implanted in his mind that he is a failure, and will not be able to find the way back again.” From this painful situation, Einstein was rescued (again!) by his friend Marcel Gross- man, whose influential father obtained for Einstein a position at the Swiss Patent Office: Technical Expert (Third Class). Anchored at last in a safe, though humble, position, Ein- stein married one of his classmates. He learned to do his work at the Patent Office very efficiently; and he used the remainder of his time on his own calculations, hiding them guiltily in a drawer when footsteps approached. In 1905, this Technical Expert (Third Class) astonished the world of science with five papers, written within a few weeks of each other, and published in the Annalen der Physik. Of these five papers, three were classics: One of these was the paper in which Einstein ap- plied Planck’s quantum hypothesis to the photoelectric effect. The second paper discussed “Brownian motion”, the zig-zag motion of small particles suspended in a liquid and hit randomly by the molecules of the liquid. This paper supplied a direct proof of the validity of atomic ideas and of Boltzmann’s kinetic theory. The third paper was destined to estab- lish Einstein’s reputation as one of the greatest physicists of all time. It was entitled “On the Electrodynamics of Moving Bodies”, and in this paper, Albert Einstein formulated his C.3. SPECIAL RELATIVITY THEORY 259 special theory of relativity. Essentially, this theory maintained that all of the fundamental laws of nature exhibit a symmetry with respect to rotations in a 4-dimensional space-time continuum.

C.3 Special relativity theory

The theory of relativity grew out of problems connected with Maxwell’s electromagnetic theory of light. Ever since the wavelike nature of light had first been demonstrated, it had been supposed that there must be some medium to carry the light waves, just as there must be some medium (for example air) to carry sound waves. A word was even invented for the medium which was supposed to carry electromagnetic waves: It was called the “ether”. By analogy with sound, it was believed that the velocity of light would depend on the velocity of the observer relative to the “ether”. However, all attempts to measure differences in the velocity of light in different directions had failed, including an especially sensitive experiment which was performed in America in 1887 by A.A. Michelson and E.W. Morley. Even if the earth had, by a coincidence, been stationary with respect to the “ether” when Michelson and Morley first performed their experiment, they should have found an “ether wind” when they repeated their experiment half a year later, with the earth at the other side of its orbit. Strangely, the observed velocity of light seemed to be completely independent of the motion of the observer! In his famous 1905 paper on relativity, Einstein made the negative result of the Michelson- Morley experiment the basis of a far-reaching principle: He asserted that no experiment whatever can tell us whether we are at rest or whether we are in a state of uniform motion. With this assumption, the Michelson-Morley experiment of course had to fail, and the measured velocity of light had to be independent of the motion of the observer. Einstein’s Principle of Special Relativity had other extremely important consequences: He soon saw that if his principle were to hold, then Newtonian mechanics would have to be modified. In fact, Einstein’s Principle of Special Relativity required that all fundamental physical laws exhibit a symmetry between space and time. The three space dimensions, and a fourth dimension, ict, had to enter every fundamental physical law in a symmetrical way. (Here i is the square root of -1, c is the velocity of light, and t is time.) When this symmetry requirement is fulfilled, a physical law is said to be “Lorentz- invariant” (in honor of the Dutch physicist H.A. Lorentz, who anticipated some of Ein- stein’s ideas). Today, we would express Einstein’s principle by saying that every funda- mental physical law must be Lorentz-invariant (i.e. symmetrical in the space and time coordinates). The law will then be independent of the motion of the observer, provided that the observer is moving uniformly. Einstein was able to show that, when properly expressed, Maxwell’s equations are already Lorentz-invariant; but Newton’s equations of motion have to be modified. When the needed modifications are made, Einstein found, then the mass of a moving particle appears to increase as it is accelerated. A particle can never be accelerated to a velocity 260 LIVES IN MATHEMATICS greater than the velocity of light; it merely becomes heavier and heavier, the added energy being converted into mass. From his 1905 theory, Einstein deduced his famous formula equating the energy of a system to its mass multiplied by the square of the velocity of light. As we shall see, his formula was soon used to explain the source of the energy produced by decaying uranium and radium; and eventually it led to the construction of the atomic bomb. Thus Einstein, a lifelong pacifist, who renounced his German citizenship as a protest against militarism, became instrumental in the construction of the most destructive weapon ever invented - a weapon which casts an ominous shadow over the future of humankind. Just as Einstein was one of the first to take Planck’s quantum hypothesis seriously, so Planck was one of the first physicists to take Einstein’s relativity seriously. Another early enthusiast for relativity was Hermann Minkowski, Einstein’s former professor of mathe- matics. Although he once had characterized Einstein as a “lazy dog”, Minkowski now contributed importantly to the mathematical formalism of Einstein’s theory; and in 1907, he published the first book on relativity. In honor of Minkowski’s contributions to relativity, the 4-dimensional space-time continuum in which we live is sometimes called “Minkowski space”. In 1908, Minkowski began a lecture to the Eightieth Congress of German Scientists and Physicians with the following words: “ From now on, space by itself, and time by itself, are destined to sink completely into the shadows; and only a kind of union of both will retain an independent existence.” Gradually, the importance of Einstein’s work began to be realized, and he was much sought after. He was first made Assistant Professor at the University of Z¨urich, then full Professor in Prague, then Professor at the Z¨urich Polytechnic Institute; and finally, in 1913, Planck and Nernst persuaded Einstein to become Director of Scientific Research at the Kaiser Wilhelm Institute in Berlin. He was at this post when the First World War broke out While many other German intellectuals produced manifestos justifying Germany’s in- vasion of Belgium, Einstein dared to write and sign an anti-war manifesto. Einstein’s manifesto appealed for cooperation and understanding among the scholars of Europe for the sake of the future; and it proposed the eventual establishment of a League of Euro- peans. During the war, Einstein remained in Berlin, doing whatever he could for the cause of peace, burying himself unhappily in his work, and trying to forget the agony of Europe, whose civilization was dying in a rain of shells, machine-gun bullets, and poison gas.

C.4 General relativity

The work into which Einstein threw himself during this period was an extension of his theory of relativity. He already had modified Newton’s equations of motion so that they exhibited the space-time symmetry required by his Principle of Special Relativity. However, Newton’s law of gravitation. remained a problem. Obviously it had to be modified, since it disagreed with his Special Theory of Relativity; C.4. GENERAL RELATIVITY 261 but how should it be changed? What principles could Einstein use in his search for a more correct law of gravitation? Certainly whatever new law he found would have to give results very close to Newton’s law, since Newton’s theory could predict the motions of the planets with almost perfect accuracy. This was the deep problem with which he struggled. In 1907, Einstein had found one of the principles which was to guide him, the Principle of Equivalence of inertial and gravitational mass. After turning Newton’s theory over and over in his mind, Einstein realized that Newton had used mass in two distinct ways: His laws of motion stated that the force acting on a body is equal to the mass of the body multiplied by its acceleration; but according to Newton, the gravitational force on a body is also proportional to its mass. In Newton’s theory, gravitational mass, by a coincidence, is equal to inertial mass; and this holds for all bodies. Einstein decided to construct a theory in which gravitational and inertial mass necessarily have to be the same. He then imagined an experimenter inside a box, unable to see anything outside it. If the box is on the surface of the earth, the person inside it will feel the pull of the earth’s gravitational field. If the experimenter drops an object, it will fall to the floor with an acceleration of 32 feet per second per second. Now suppose that the box is taken out into empty space, far away from strong gravitational fields, and accelerated by exactly 32 feet per second per second. Will the enclosed experimenter be able to tell the difference between these two situations? Certainly no difference can be detected by dropping an object, since in the accelerated box, the object will fall to the floor in exactly the same way as before. With this “thought experiment” in mind, Einstein formulated a general Principle of Equivalence: He asserted that no experiment whatever can tell an observer enclosed in a small box whether the box is being accelerated, or whether it is in a gravitational field. According to this principle, gravitation and acceleration are locally equivalent, or, to say the same thing in different words, gravitational mass and inertial mass are equivalent. Einstein soon realized that his Principle of Equivalence implied that a ray of light must be bent by a gravitational field. This conclusion followed because, to an observer in an accelerated frame, a light beam which would appear straight to a stationary observer, must necessarily appear very slightly curved. If the Principle of Equivalence held, then the same slight bending of the light ray would be observed by an experimenter in a stationary frame in a gravitational field. Another consequence of the Principle of Equivalence was that a light wave propagating upwards in a gravitational field should be very slightly shifted to the red. This followed because in an accelerated frame, the wave crests would be slightly farther apart than they normally would be, and the same must then be true for a stationary frame in a gravitational field. It seemed to Einstein that it ought to be possible to test experimentally both the gravitational bending of a light ray and the gravitational red shift. This seemed promising; but how was Einstein to proceed from the Principle of Equiva- lence to a formulation of the law of gravitation? Perhaps the theory ought to be modeled after Maxwell’s electromagnetic theory, which was a field theory, rather than an “action at a distance” theory. Part of the trouble with Newton’s law of gravitation was that it allowed a signal to be propagated instantaneously, contrary to the Principle of Special Relativity. A field theory of gravitation might cure this defect, but how was Einstein to find such a 262 LIVES IN MATHEMATICS theory? There seemed to be no way. From these troubles Albert Einstein was rescued (a third time!) by his staunch friend Marcel Grossman. By this time, Grossman had become a professor of mathematics in Z¨urich, after having written a doctoral dissertation on tensor analysis and non-Euclidean geometry, the very things that Einstein needed. The year was then 1912, and Einstein had just returned to Z¨urich as Professor of Physics at the Polytechnic Institute. For two years, Einstein and Grossman worked together; and by the time Einstein left for Berlin in 1914, the way was clear. With Grossman’s help, Einstein saw that the gravitational field could be expressed as a curvature of the 4-dimensional space-time continuum. In 1919, a British expedition, headed by Sir Arthur Eddington, sailed to a small island off the coast of West Africa. Their purpose was to test Einstein’s prediction of the bending of light in a gravitational field by observing stars close to the sun during a total eclipse. The observed bending agreed exactly with Einstein’s predictions; and as a result he became world-famous. The general public was fascinated by relativity, in spite of the abstruseness of the theory (or perhaps because of it). Einstein, the absent-minded professor, with long, uncombed hair, became a symbol of science. The world was tired of war, and wanted something else to think about. Einstein met President Harding, Winston Churchill and Charlie Chaplin; and he was invited to lunch by the Archbishop of Canterbury. Although adulated elsewhere, he was soon attacked in Germany. Many Germans, looking for an excuse for the defeat of their nation, blamed it on the pacifists and Jews; and Einstein was both these things.

C.5 Metric tensors

Let us consider a coordinate system x1, x2, , xd labelling the points in a d-dimensional space. We can label the points in a different··· way by going to a new coordinate system X1,X2, ,Xd where the new coordinates are expressed as functions of the old ones. ··· X1 = X1(x1, x2, , xd) ··· X2 = X2(x1, x2, , xd) . . . ··· . . . Xd = Xd(x1, x2, , xd) (C.1) ··· For example, (C.1) might represent a transformation from Cartesian coordinates to spher- ical polar coordinates. If we have an equation written in terms of the old coordinates, we might ask how to rewrite it in terms of the new ones. More generally, we can try to write a physical equation in such a way that it will look the same in every coordinate system. Suppose that the space is Euclidean (flat), so that in terms of the Cartesian coordinates x1, x2, , xd, the infinitesimal element of length separating two points is given by the Pythagorean··· rule:

2 i j i j ds = δi,jdx dx gi,jdx dx (C.2) ≡ C.5. METRIC TENSORS 263

(In equation (C.2) and in the remainder of this section, we use the Einstein convention, in which a sum over repeated indices is understood, although not written explicitly.) The 2 symbol gi,j which appears in the definition of the infinitesimal length ds is called the covariant metric tensor, and for Cartesian coordinates in a Euclidean space, it is just the Kronecker delta function. Using the identity

∂xi dxi = dXµ (C.3) ∂Xµ we can rewrite (C.2) as

i j 2 ∂x ∂x µ ν µ ν ds = δi,j dX dX Gµ,νdX dX (C.4) ∂Xµ ∂Xν ≡ where ∂xi ∂xj Gµ,ν gi,j (C.5) ≡ ∂Xµ ∂Xν

The quantity Gµ,ν which appears in equations (C.4) and (C.5) is the covariant metric tensor in the new coordinate system. In any space, whether Euclidean or not, the covariant metric tensor is defined by the expression which yields ds2, the square of the infinitesimal distance between two points, as in equation (C.2) or (C.4). The word tensor refers to the way in which a quantity transforms under changes in the coordinate system. The rank of a tensor is the number of indices. The covariant metric tensor is the prototype of a covariant tensor of second rank. Any physical quantity which must be transformed according to the rule

∂xi ∂xj A = a (C.6) µ,ν i,j ∂Xµ ∂Xν under the coordinate transformation x1, x2, , xd X1,X2, ,Xd is said to be a co- variant tensor of second rank. The d-component··· entity→ ··· ∂Xµ dXµ = dxi (C.7) ∂xi is the prototype of a contravariant tensor of first rank. Any quantity that transforms according to the rule ∂Xµ Aµ = ai (C.8) ∂xi is said to be a contravariant tensor of first rank (or contravariant vector). The distance element ds is the prototype of an invariant or scalar. Any quantity φ which is invariant under coordinate transformations is said to be a scalar. The gradient of a scalar

∂φ ∂xi ∂φ = (C.9) ∂Xµ ∂Xµ ∂xi 264 LIVES IN MATHEMATICS

is the prototype of a covariant tensor of first rank, or covariant vector. Any quantity which transforms according to the rule ∂xi A = a (C.10) µ ∂Xµ i is said to be a covariant vector. We can also define tensors of higher rank. For example, ∂Xµ ∂Xν ∂Xσ Aµνσ = aijk (C.11) ∂xi ∂xj ∂xk is said to be a contravariant tensor of third rank. A covariant vector and a contravariant vector can be contracted into a scalar: ∂xi ∂Xµ A Bµ = a bj = δi a bj = a bi (C.12) µ ∂Xµ ∂xj i j i i Similarly, if we contract a contravariant vector with the covariant metric tensor, we obtain a covariant vector:

ν GµνA = Aµ i gija = ai (C.13)

It is useful to define a quantity called the contravariant metric tensor, which gives the Kronecker δ-function when it is contracted with the covariant metric tensor:

µν µ G Gνσ = δσ ij i g gjk = δk ∂Xµ ∂Xν Gµν = gij (C.14) ∂xi ∂xj If we contract a covariant vector with the contravariant metric tensor, we obtain a con- travariant vector:

µν µ G Aν = A (C.15)

In a similar way, we can raise or lower the indices of a tensor of higher rank. For example, it is easy to show that

νσρ σρ GµνA = Aµ (C.16) In a Cartesian coordinate system with unit metric we are accustomed to writing the volume element as

dv = dx1dx2 dxd (C.17) ··· This is obviously unsatisfactory from the standpoint of tensor analysis, since the right- hand side of equation (C.17) appears to be a contravariant tensor of rank d (or rather a C.5. METRIC TENSORS 265

particular component of such a tensor), while the left-hand side has no indices at all. In order to write the volume element in an invariant way, the Italian mathematician Tulio Levi-Civita (1873-1941) introduced a totally antisymmetric covariant tensor of rank d. In a Cartesian coordinate system, for a flat space, the Levi-Civita tensor is given by

( 1)σ if ijkl = σ(1234 ) − ··· ··· e = (C.18) ijkl···   0 otherwise In other words, the Levi-Civita tensor is 1 if ijkl is a permutation of 1234 , with the sign depending on whether the permutation± is even··· or odd, and it is zero otherwise.··· In terms of this tensor, the volume element of equation (C.17) becomes

1 i j k l dv = eijkl···dx dx dx dx (C.19) d! ··· while in a transformed coordinate system it is

1 µ ν σ ρ dV = Eµνσρ···dX dX dX dX (C.20) d! ··· where ∂xi ∂xj ∂xk Eµνσ··· = eijk··· (C.21) ∂Xµ ∂Xν ∂Xσ ··· In this way, Levi-Civita used the formalism of tensor calculus to re-derive the previous result of the German mathematician Carl Gustav Jacobi (1804-1851), who had shown that in a curvilinear coordinate system, the volume element is given by

∂xi dV = dX1dX2 dXd (C.22) ∂Xµ ···

i µ where ∂x /∂X is the determinant of the d d square matrix of transformation coefficients from Cartesian| | coordinates to curvilinear× coordinates. This determinant is called the Jacobian of the transformation. From the relationship

∂xi ∂xj G = δ (C.23) µ,ν ∂Xµ ij ∂Xν one can show that the Jacobian ∂xi Gµν G (C.24) | | ≡ ∂Xµ ≡ | |

q p

is the square root of the determinant of the covariant metric tensor. The Jacobian is usually represented by the symbol G . Levi-Civita’s book Absolute Differential Calculus has been translated into many languages.| | It is still in print, and it remains one of the p 266 LIVES IN MATHEMATICS

best textbooks in the field, along with Schr¨odinger’s Space-Time Structure, Brillouin’s Les Tenseurs and Landau and Lifshitz’s The Classical Theory of Fields. The Jacobian, G , is the prototype of a scalar density. We can construct tensor densities by multiplying| | tensors by the Jacobian appropriate for the coordinate system. p When a tensor density is transformed to another coordinate system, the Jacobian has to be recalculated from the transformed covariant metric tensor. Tensor capacities can be constructed by dividing tensors by the Jacobian. Now consider a scalar function ψ. Its gradient is a covariant vector, and therefore

∂ψ ∂ψ Gµν = scalar (C.25) ∂Xµ ∂Xν It follows that if we let

∂ψ ∂ψ = G Gµν + κψ2 (C.26) L | | ∂Xµ ∂Xν   p where κ is a constant, then the variational principle

δ dX1dX2 dXd = 0 (C.27) ··· L ··· ZZ Z will be invariant under a curvilinear coordinate transformation. As we saw above, the Euler-Lagrange equations that follow from this variational principle are

∂ ∂ ∂ L L = 0 (C.28) ∂Xµ ∂ (∂ψ/∂Xµ) − ∂ψ

With the Lagrangian density of equation (C.26), this becomes

1 ∂ ∂ψ G Gµν = κ ψ (C.29) G ∂Xµ | | ∂Xν | | p p C.6 The Laplace-Beltrami operator

The operator

d d 1 ∂ ∂ ∆ = G Gµν (C.30) ∂Xµ | | ∂Xν µ=1 ν=1 G X X | | p p is the generalized Laplacian operator, which plays such an important role in the theory of hyperspherical harmonics, but here it is written in a form due to Eugenio Beltrami (1835- 1899), which is invariant under coordinate transformations. (In equation (C.29), we have C.6. THE LAPLACE-BELTRAMI OPERATOR 267 abandoned the Einstein convention, and have re-introduced explicit sums.) To illustrate this equation, let us consider some examples. In a d-dimensional space, we can let

1 x = r sin θ1 sin θ2... sin θd−2 cos θd−1 2 x = r sin θ1 sin θ2... sin θd−2 sin θd−1 3 x = r sin θ1 sin θ2... cos θd−2 ...... d−1 x = r sin θ1 cos θ2 d x = r cos θ1 (C.31) while

X1 = r 2 X = θ1 3 X = θ2 ...... d−1 X = θd−2 d X = θd−1 (C.32)

Then the Jacobians for various values of d are

d = 3 G = r2 sin θ | | 1 d = 4 G = r3 sin2 θ sin θ p| | 1 2 d = 5 G = r4 sin3 θ sin2 θ sin θ p| | 1 2 3 5 4 3 2 d = 6 p G = r sin θ1 sin θ2 sin θ3 sin θ4 | |. . . p . . . d−1 d−2 d−3 2 d = d G = r sin θ sin θ ... sin θd− sin θd− (C.33) | | 1 2 3 2 p The covariant metric tensor for d = 3 is

1 0 0 2 Gµ,ν = 0 r 0 (C.34)  2 2  0 0 r sin θ1   while for d = 4

1 0 0 0 0 r2 0 0 Gµ,ν =  2 2  (C.35) 0 0 r sin θ1 0  0 0 0 r2 sin2 θ sin2 θ   1 2    268 LIVES IN MATHEMATICS for d = 5 1 0 0 0 0 0 r2 0 0 0  2 2  Gµ,ν = 0 0 r sin θ1 0 0 (C.36)  0 0 0 r2 sin2 θ sin2 θ 0   1 2   0 0 0 0 r2 sin2 θ sin2 θ sin2 θ   1 2 3    The contravariant metric tensors are just their reciprocals. 1 0 0 1 µ,ν 0 0 G =  r2  (C.37) 1  0 0   2 2   r sin θ1    1 0 0 0 1 0 0 0  r2  µ,ν 1 G = 0 0 0 (C.38)  2 2   r sin θ1   1   0 0 0   2 2 2   r sin θ1 sin θ2    1 0 0 0 0 1 0 0 0 0  r2  1 0 0 0 0 µ,ν  2 2  G =  r sin θ1  (C.39)  1   0 0 0 0   2 2 2   r sin θ1 sin θ2   1   0 0 0 0   2 2 2 2   r sin θ1 sin θ2 sin θ3    and so on. Combining these results, we obtain the Laplace-Beltrami operators: ∂ Gµ,ν ∂Xν ν X ∂ 1 ∂ 1 ∂ 1 ∂ = , , , , ... (C.40) ∂r r2 ∂θ r2 sin2 θ ∂θ r2 sin2 θ sin2 θ ∂θ  1 1 2 1 2 3  For d = 3,

3 ∂ G Gµ,ν | | ∂Xν ν=1 X p ∂ 1 ∂ 1 ∂ = r2 sin θ , , (C.41) 1 ∂r r2 ∂θ r2 sin2 θ ∂θ  1 1 2  C.6. THE LAPLACE-BELTRAMI OPERATOR 269

3 3 1 ∂ ∂ G Gµ,ν ∂Xµ | | ∂Xν µ=1 ν=1 G X X | | p 1 ∂ p∂ 1 ∂ ∂ 1 ∂2 = r2 + sin θ + (C.42) 2 2 1 2 2 2 r ∂r ∂r r sin θ1 ∂θ1 ∂θ1 r sin θ1 ∂θ2

For d = 4,

4 ∂ G Gµ,ν | | ∂Xν ν=1 X p ∂ 1 ∂ 1 ∂ 1 ∂ = r3 sin2 θ sin θ , , , 1 2 ∂r r2 ∂θ r2 sin2 θ ∂θ r2 sin2 θ sin2 θ ∂θ  1 1 2 1 2 3  (C.43)

4 4 1 ∂ ∂ G Gµ,ν ∂Xµ | | ∂Xν µ=1 ν=1 G X X | | p 1 ∂ ∂ 1 ∂ ∂ = r3 p + sin2 θ 3 2 2 1 r ∂r ∂r r sin θ1 ∂θ1 ∂θ1 1 ∂ ∂ 1 ∂2 + sin θ + (C.44) 2 2 2 2 2 2 2 r sin θ1 sin θ2 ∂θ2 ∂θ2 r sin θ1 sin θ2 ∂θ3

For d = 5,

5 ∂ G Gµ,ν | | ∂Xν ν=1 X4 p3 2 = r sin θ1 sin θ2 sin θ3 ∂ 1 ∂ 1 ∂ 1 ∂ 1 ∂ , , , , × ∂r r2 ∂θ r2 sin2 θ ∂θ r2 sin2 θ sin2 θ ∂θ r2 sin2 θ sin2 θ sin2 θ ∂θ  1 1 2 1 2 3 1 2 3 4  (C.45)

5 5 1 ∂ ∂ G Gµ,ν ∂Xµ | | ∂Xν µ=1 ν=1 G X X | | p 1 ∂ ∂ 1 ∂ ∂ 1 ∂ ∂ = r4 p + sin3 θ + sin2 θ 4 2 3 1 2 2 2 2 r ∂r ∂r r sin θ1 ∂θ1 ∂θ1 r sin θ1 sin θ2 ∂θ2 ∂θ2 1 ∂ ∂ 1 ∂2 + sin θ + (C.46) 2 2 2 3 2 2 2 2 2 r sin θ1 sin θ2 sin θ3 ∂θ3 ∂θ3 r sin θ1 sin θ2 sin θ3 ∂θ4 270 LIVES IN MATHEMATICS

For general values of d, d ∂ G Gµ,ν | | ∂Xν ν=1 Xd−1p d−2 d−3 = r sin θ sin θ sin θd− 1 2 ··· 2 ∂ 1 ∂ 1 ∂ 1 ∂ , , , , 2 2 2 2 2 2 2 × ∂r r ∂θ1 r sin θ ∂θ2 ··· r sin θ sin θ sin θd− ∂θd−1  1 1 2 ··· 2  (C.47)

d d 1 ∂ ∂ G Gµ,ν ∂Xµ | | ∂Xν µ=1 ν=1 G X X | | p 1 ∂ ∂ 1 ∂ ∂ = rpd−1 + sind−2 θ d−1 2 d−2 1 r ∂r ∂r r sin θ1 ∂θ1 ∂θ1 1 ∂ ∂ + sind−3 θ + 2 2 d−3 2 r sin θ1 sin θ2 ∂θ2 ∂θ2 ··· 1 ∂ ∂ + sin θ 2 2 2 2 d−2 r sin θ sin θ sin θ sin θd− ∂θd−2 ∂θd−2 1 2 3 ··· 2 1 ∂2 + (C.48) 2 2 2 2 2 r sin θ sin θ sin θd− ∂θ 1 2 ··· 2 d−1 As we saw in equation (??), the Laplace-Beltrami operator in hyperspherical coordinates can be written as 1 ∂ ∂ Λ2 ∆ = rd−1 (C.49) rd−1 ∂r ∂r − r2 where r is the hyperradius and where Λ2 is the generalized angular momentum operator. Comparing this with the results that we have just been discussing, we can see that for d = 3, 2 2 1 ∂ ∂ 1 ∂ Λ = sin θ1 + 2 2 (C.50) − sin θ1 ∂θ1 ∂θ1 sin θ1 ∂θ2 while for d = 4,

2 1 ∂ 2 ∂ Λ = 2 sin θ1 − sin θ1 ∂θ1 ∂θ1 1 ∂ ∂ 1 ∂2 + 2 sin θ2 + 2 2 2 (C.51) sin θ1 sin θ2 ∂θ2 ∂θ2 sin θ1 sin θ2 ∂θ3 and for d = 5,

2 1 ∂ 3 ∂ 1 ∂ 2 ∂ Λ = 3 sin θ1 + 2 2 sin θ2 − sin θ1 ∂θ1 ∂θ1 sin θ1 sin θ2 ∂θ2 ∂θ2 1 ∂ ∂ 1 ∂2 + 2 2 sin θ3 + 2 2 2 2 (C.52) sin θ1 sin θ2 sin θ3 ∂θ3 ∂θ3 sin θ1 sin θ2 sin θ3 ∂θ4 C.7. GEODESICS 271

For general values of d, we have

2 1 ∂ d−2 ∂ Λ = d−2 sin θ1 − sin θ1 ∂θ1 ∂θ1 1 ∂ d−3 ∂ + 2 d−3 sin θ2 + sin θ1 sin θ2 ∂θ2 ∂θ2 ··· 1 ∂ ∂ + 2 2 2 sin θd−2 sin θ sin θ sin θ sin θd− ∂θd−2 ∂θd−2 1 2 3 ··· 2 1 ∂2 + 2 2 2 2 (C.53) sin θ sin θ sin θd− ∂θ 1 2 ··· 2 d−1 We have until now been considering spaces that are intrinsically flat, but a d-dimensional hyperspherical surface embedded in a d + 1-dimensional space has intrinsic curvature. If the hyperradius r is regarded as a constant, then the Laplace-Beltrami operator for such a surface is given by

Λ2 ∆ = (C.54) − r2

while the covariant metric tensor on the surface is

r2 0 0 0 2 2 ··· 0 r sin θ1 0 0  2 2 2 ···  Gµ,ν = 0 0 r sin θ1 sin θ2 0 (C.55)  0 0 0 r2 sin2 θ sin2 θ sin2 θ ···   1 2 3   . . . . ···   . . . .      The infinitesimal element of length on the surface, ds is given by

d d 2 µ ν ds = Gµ,νdX dX µ=1 ν=1 X X = r2 dθ2 + sin2 θ dθ2 + sin2 θ sin2 θ dθ2 + (C.56) 1 1 2 1 2 3 ···  C.7 Geodesics

In the geometry of curved spaces, geodesics play the role that straight lines play in Eu- clidean geometry. The geodesic curves are local minima of path length. The minimal geodesics between two points are the shortest paths through the curved space, and play an important role when analyzing physical systems in curved space. They can be determined 272 LIVES IN MATHEMATICS

by the variational principle

d d dXµ dXν s = ds = G ds v µ,ν ds ds uµ=1 ν=1 Z Z uX X d d t dXµ dXν = G ds = minimum (C.57) µ,ν ds ds µ=1 ν=1 Z X X The Euler-Lagrange equations which follow from this variational principle are d ∂L ∂L = 0 µ = 1, 2, , d (C.58) ds ∂(dXµ/ds) − ∂Xµ ··· with

d d dXµ dXν L = G (X) (C.59) µ,ν ds ds µ=1 ν=1 X X The Euler-Lagrange equations for geodesics can be written in the form

d2Xσ dXµ dXσ = Γσ (C.60) ds µν ds ds σ Here Γµν is a Christoffel symbol, which is related to the metric tensors by

1 ∂G ∂G ∂G Γσ = Gσρ ρµ + ρν µν (C.61) µν 2 ∂Xν ∂Xµ − ∂Xρ   In general relativity theory, the trajectories of particles are geodesics in a space-time con- tinuum, whose metric is affected by the presence of other masses. Appendix D

Sturmian basis sets

D.1 One-electron Coulomb Sturmians

Because of their completeness properties, one-electron Sturmian basis sets have long been used in theoretical atomic physics [Shull and L¨owdin,1959], [Rotenberg, 1962], [Rotenberg, 1970], [Avery, 2003], [Gasaneo et al., 2009]. Their form is identical with that of the familiar hy- drogenlike atomic orbitals, except that the factor Z/n is replaced by a constant k. The one-electron Coulomb Sturmians can be written as

χnlm(x) = Rnl(r)Ylm(θ, φ) (D.1) where Ylm is a spherical harmonic, and where the radial function has the form

l −kr Rnl(r) = nl(2kr) e F (l + 1 n 2l + 2 2kr) (D.2) N − | | Here 2k3/2 (l + n)! nl = (D.3) N (2l + 1)! n(n l 1)! s − − is a normalizing constant, while ∞ at a a(a + 1) F (a b x) xt = 1 + x + x2 + (D.4) | | ≡ t b 2b(b + 1) ··· t=0 t!b X is a confluent hypergeometric function. Coulomb Sturmian basis functions obey the fol- lowing one-electron Schr¨odingerequation (in atomic units):

1 2 nk 1 2 + k χnlm(x) = 0 (D.5) −2∇ − r 2   which is just the Schr¨odingerequation for an electron in a hydrogenlike atom with the replacement Z/n k. All of the functions in a such a basis set correspond to the same energy, → 1  = k2 (D.6) −2 273 274 LIVES IN MATHEMATICS

Table B.1: One-electron Coulomb Sturmian radial functions. If k is replaced by Z/n they are identical to the familiar hydrogenlike radial wave functions.

n l Rn,l(r)

1 0 2k3/2e−kr

2 0 2k3/2(1 kr)e−kr −

2k3/2 2 1 kr e−kr √3

2(kr)2 3 0 2k3/2 1 2kr + e−kr − 3  

2√2 kr 3 1 2k3/2 kr 1 e−kr 3 − 2  

√2 3 2 2k3/2 (kr)2 e−kr 3√5 D.1. ONE-ELECTRON COULOMB STURMIANS 275

In other words the basis set is isoenergetic. In the wave equation obeyed by the Sturmians, (D.5), the potential is weighted differently for members of the basis set corresponding to different values of n. Equation (D.5) can be written in the form:

1 2 Z 1 2 kn βn + k χnlm(x) = 0 βn = (D.7) −2∇ − r 2 Z  

The weighting factors βn are chosen in such a way as to make all of the solutions isoener- getic. All solutions correspond to the energy  = k2/2. In the Hamiltonian formulation of physics, the eigenvalues of the wave equation are− a spectrum of allowed energies, but here all of the solutions of the wave equation correspond to the same energy, and the weighting factors play the role of eigenvalues. The functions in a Coulomb Sturmian basis set can be shown to obey and obey a potential-weighted orthonormality relation: To see this, we consider two solutions, χnlm(x) and χn0l0m0 (x), obeying the equations:

1 2 1 2 nk + k χnlm(x) = χnlm(x) −2∇ 2 r   0 1 2 1 2 ∗ n k ∗ + k χ 0 0 0 (x) = χ 0 0 0 (x) (D.8) −2∇ 2 n l m r n l m  

∗ Multiplying the two equations from the left respectively by χn0l0m0 (x) and χnlm(x), inte- grating over the coordinates, and subtracting the two equations, we obtain:

0 3 ∗ 1 (n n ) d x χ 0 0 0 (x) χnlm(x) = 0 (D.9) − n l m r Z where we have also made use of the fact that (from Hermiticity)

3 ∗ 1 2 1 2 d x χ 0 0 0 (x) + k χnlm(x) n l m −2∇ 2 Z   3 1 2 1 2 ∗ d x χnlm(x) + k χ 0 0 0 (x) = 0 (D.10) − −2∇ 2 n l m Z   Thus for n = n0, the potential-weighted scalar product vanishes, and it vanishes also when l0 = l or m06 = m because of the orthogonality of the spherical harmonics. The Coulomb Sturmians6 are6 normalized in such a way that the orthonormality relation is:

3 ∗ 1 k d x χ 0 0 0 (x) χ (x) = δ 0 δ 0 δ 0 (D.11) n l m r nlm n n n l l m m Z Because of their completeness and their close relationship with Coulomb potentials, Coulomb Sturmians are widely used in atomic physics. 276 LIVES IN MATHEMATICS D.2 L¨owdin-orthogonalizedCoulomb Sturmians

The Coulomb Sturmians form a complete set in the sense that any square-integrable func- tion of x can be expanded in terms of them. For this reason, they are useful as basis functions in many applications. Sometimes it may be convenient to use Coulomb Stur- mian basis functions in a form that is orthonormalized in the conventional way. Let us denote the orthogonalized Coulomb Sturmians byχ ˜µ(x), where µ (n, l, m). This new basis set is related to the original set of Coulomb Sturmians discussed≡ above by

χ˜µ(x) = χµ(x)Wµ0,µ (D.12) µ0 X where Wµ0,µ is a transformation matrix. We wish the transformation to be such that

3 ∗ d x χ˜ 0 (x)˜χµ(x) S˜µ0,µ = δµ0,µ (D.13) µ ≡ Z Suppose that

3 ∗ d x χµ0 (x)χµ(x) = Sµ0,µ (D.14) Z Then, in matrix notation, the condition that the transformation matrix W must satisfy is

W †SW = I (D.15) where the dagger denotes the Hermitian adjoint, i.e., the conjugate transpose. Following L¨owdinand Wannier, we can choose from all the possible solutions to the matrix equation (D.15) the one for which

W † = W (D.16)

(This is sometimes called symmetrical orthogonalization.) Then (D.15) will be satisfied if

W = S−1/2 (D.17)

In order to find the square root of the overlap matrix S, we diagonalize it, take the inverse square root in the diagonal representation, and then transform back to the original rep- resentation. This gives us W = S−1/2, which we then use to perform the transformation shown in equation (D.12). D.3. THE FOCK PROJECTION 277

15

10

5

0.2 0.4 0.6 0.8 1.0

Figure D.1: A set of 15 L¨owdin-orthogonalized Coulomb Sturmians corresponding to l = 0 and k = 1. The radial parts are shown as functions of r. If an arbitrary radial function is to be expanded in terms of this set, the value of k for the set can be adjusted in such a way as to give maximum accuracy. L¨owdin-orthogonalized Coulomb Sturmians are used in the Hartree-Fock calculations of Chapter 2.

D.3 The Fock projection

Coulomb Sturmian basis functions and their Fourier transforms are related by

1 3 ip·x t χnlm(x) = d x e χnlm(p) (D.18) (2π)3 Z and by the inverse transform p

t 1 3 −ip·x χnlm(p) = d x e χnlm(x) (D.19) (2π)3 Z By projecting momentum-space ontop the surface of a 4-dimensional hypersphere, V. Fock [Fock, 1935], [Fock, 1958] was able to show that the Fourier-transformed Coulomb Stur- mians can be very simply expressed in terms of 4-dimensional hyperspherical harmonics through the relationship t χn,l,m(p) = M(p)Yn−1,l,m(uˆ) (D.20) where 4k5/2 M(p) (D.21) ≡ (k2 + p2)2 278 LIVES IN MATHEMATICS

and 2kp u = 1 1 k2 + p2 2kp u = 2 2 k2 + p2 2kp u = 3 3 k2 + p2 k2 p2 u = − (D.22) 4 k2 + p2 The 4-dimensional hyperspherical harmonics are given by [Judd, 1975], [Avery, 1989], [Avery, 2000], [Avery and Avery, 2006]

1+l Yλ,l,m(uˆ) = λ,lC (u )Yl,m(u , u , u ) (D.23) N λ−l 4 1 2 3

where Yl,m is a spherical harmonic of of the familiar type, while

λ l 2(λ + 1)(λ l)! λ,l = ( 1) i (2l)!! − (D.24) N − s π(λ + l + 1)! is a normalizing constant, and

[j/2] ( 1)tΓ(j + α t) Cα(u ) = − − (2u )j−2t (D.25) j 4 t!(j 2t)!Γ(α) 4 t=0 X − is a Gegenbauer polynomial. The first few The relationships between hyperspherical har- monics, harmonic polynomials, and harmonic projection will be discussed in Appendix C. Table 5.1 in Chapter 5 shows the first few hyperspherical harmonics.

D.4 Generalized Sturmians and many-particle prob- lems

In 1968, Osvaldo Goscinski [[Goscinski, 1968, 2003]] generalized the concept of Sturmian basis sets by considering isoenergetic sets of solutions to a many-particle Schr¨odingerequa- tion with a weighted potential:

1 ∆ + βνV (x) Eκ Φν = 0 (D.26) −2 0 − | i  

The weighting factors βν are chosen in such a way as to make all of the functions in the set correspond to the same energy, Eκ, and this energy is usually chosen to be that of the quantum mechanical state which is to be represented by a superposition of generalized D.5. USE OF GENERALIZED STURMIAN BASIS SETS TO SOLVE THE MANY-PARTICLE SCHRODINGER¨ EQUATION279

Sturmian basis functions. If the basis set is used to treat N-particle systems where the particles have different masses, the operator ∆ in the kinetic energy term is given by

N 1 2 ∆ j (D.27) ≡ mj ∇ j=1 X while if the masses are all equal, it is given by the generalized Laplacian operator:

d ∂2 ∆ (D.28) ≡ ∂x2 j=1 j X with d = 3N and x = (x1, x2, ..., xd) (D.29) Like the one-electron Coulomb Sturmians, the functions in generalized Sturmian basis sets can be shown to obey a potential-weighted orthonormality relation [Avery and Avery, 2006]:

2Eκ Φν0 V0(x) Φν = δν0,ν (D.30) h | | i βν D.5 Use of generalized Sturmian basis sets to solve the many-particle Schr¨odingerequation

If we wish to solve a many-particle Schr¨odingerequation of the form 1 ∆ + V (x) Eκ Ψκ = 0 (D.31) −2 − | i   we can approximate a solution as a superposition of generalized Sturmian basis functions

Ψκ Φν Bν,κ (D.32) | i ≈ ν | i X Substituting this superposition into the Schr¨odingerequation and remembering that each of the basis functions satisfies eq.(D.26), we obtain: 1 ∆ + V (x) E Φ B 2 κ ν ν,κ ν − − | i X   = [V (x) βνV0(x)] Φν Bν,κ 0 (D.33) ν − | i ≈ X If we multiply from the left by a conjugate function from our generalized Sturmian basis set and integrate over all coordinates, we obtain a set of secular equations from which the kinetic energy term has disappeared:

∗ Φν0 [V (x) βνV0(x)] Φν Bν,κ = 0 (D.34) ν h | − i X 280 LIVES IN MATHEMATICS

If we introduce the definition

1 ∗ Tν0,ν Φν0 V (x) Φν (D.35) ≡ −pκ h | | i where pκ 2Eκ (D.36) ≡ − and make use of the potential-weighted orthonormalityp relations (D.30), we can rewrite the secular equations in the form:

[Tν0,ν pκδν0,ν] Bν,κ = 0 (D.37) ν − X The generalized Sturmian secular equations are strikingly different from conventional Hamil- tonian secular equations in several ways:

The kinetic energy term has disappeared. • The matrix representing the approximate potential V (x) is diagonal. • 0 The roots of the secular equations are not energies, but values of the scaling parameter • 2 pκ, from which the energy can be obtained through the relationship Eκ = p /2. − κ

For Coulomb potentials, the matrix Tν0,ν is energy-independent. • D.6 Momentum-space orthonormality relations for Stur- mian basis sets

By arguments similar to those used in equations (D.8)-(D.11), a set of generalized Sturmian basis functions can be shown to obey a potential-weighted orthonormality relation in direct space 2 ∗ 2Eκ pκ dx Φ 0 (x)V (x)Φν(x) = δν0,ν = δν0,ν (D.38) ν 0 β − β Z ν ν where 2 p 2Eκ (D.39) κ ≡ − (In equation (D.38) and in the remainder of this appendix, we abandon the Dirac bra and ket notation in order to distinguish between functions of x (x , x , ..., xN ) and functions ≡ 1 2 of p (p1, p2, ..., pN )). We would now like to find the momentum-space orthonormality relations≡ obeyed by Fourier transforms of the generalized Sturmian basis set. Because the Fourier transform is unitary, the inner product of any two functions in L2 is preserved under the operation of taking their Fourier transforms, i.e.

dx f ∗(x)g(x) = dp f t∗(p)gt(p) (D.40) Z Z D.6. MOMENTUM-SPACE ORTHONORMALITY RELATIONS FOR STURMIAN BASIS SETS281

∗ ∗ Using this well-known relationship with f (x) = Φν0 (x) and g(x) = V0(x)Φν(x), we have

∗ t∗ t dx Φν0 (x)V0(x)Φν(x) = dp Φν0 (p)[V0Φν] (p) (D.41) Z Z t In order to evaluate [V0Φν] (p), we remember the Fourier convolution theorem, which states that the Fourier transform of the product of two functions is the convolution of their Fourier transforms. Thus if a and b are any two functions in L2,

1 0 1 [ab]t (p0) dx e−ip ·xa(x)b(x) = dp at(p0 p)bt(p) (D.42) ≡ (2π)d/2 (2π)d/2 − Z Z Letting a(x) = V0(x) and b(x) = Φν(x) we have

t 0 1 t 0 t [V Φν] (p ) = dp V (p p)Φ (p) (D.43) 0 (2π)d/2 0 − ν Z Since the momentum-space integral equation corresponding to (D.26) has the form 2β (p02 + p2 )Φt (p0) = ν dp V t(p0 p)Φt (p) (D.44) κ ν −(2π)d/2 0 − ν Z it follows that 2 2 t (p + pκ) t [V0Φν] (p) = Φν(p) (D.45) − 2βν Finally, substituting (D.45) into (D.41), we obtain the momentum-space orthonormality relations for a set of generalized Sturmian basis functions: 2 2 t∗ p + pκ t dp Φ 0 (p) Φ (p) = δ 0 (D.46) ν 2p2 ν ν ,ν Z  κ  Because all of the functions Φν(x) in the generalized Sturmian basis set obey equation (D.26), the potential-weighted direct space orthonormality relations shown in equation (D.38) can be rewritten in the form 2 ∗ ∆ + pκ dx Φ 0 (x) − Φν(x) = δν0,ν (D.47) ν 2p2 Z  κ  so that the momentum-space and direct-space orthonormality relations can be seen to be related to each other in a symmetrical way. These weighted orthonormality rela- d (1) d tions in L2(R ) are the usual orthonormality relations in the Sobolev space W2 (R ) (see [Weniger, 1985]). For the case of unequal masses, where d 1 ∂2 ∆ 2 (D.48) ≡ mj ∂x j=1 j X the momentum-space orthonormality relations for generalized Sturmians (D.46) takes on the slightly modified form 2 2 t∗ j pj /mj + pκ t dp Φ 0 (p) Φ (p) = δ 0 (D.49) ν 2p2 ν ν ,ν Z P κ ! 282 LIVES IN MATHEMATICS D.7 Sturmian expansions of d-dimensional plane waves

If the set of generalized Sturmian basis functions is complete in the sense of spanning the (1) d Sobolev space W2 (R ), we can use it to construct a weakly convergent expansion of a d-dimensional plane wave (valid only in the sense of distributions). Suppose that we let

p2 + p2 eip·x = κ Φt∗(p)a (x) (D.50) 2p2 ν ν κ ν   X We can then determine the unknown functions aν(x) by means of the orthonormality t∗ relations (D.46). Multiplying (D.50) on the left by Φν0 (p) and integrating over dp making use of (D.46), we obtain

ip·x t∗ dp e Φν0 (p) = δν0,νaν(x) = aν0 (x) (D.51) ν Z X so that ip·x t d/2 aν(x) = dp e Φν(p) = (2π) Φν(x) (D.52) Z Thus finally we obtain an expansion of the form p2 + p2 eip·x = (2π)d/2 κ Φt∗(p)Φ (x) (D.53) 2p2 ν ν κ ν   X (1) d If the set of generalized Sturmians Φν(x) does not span W2 (R ), equation (D.53) becomes p2 + p2 P eip·x = (2π)d/2 κ Φt∗(p)Φ (x) (D.54) 2p2 ν ν  κ  ν   X where P [eip·x] is the projection of the d-dimensional plane wave onto the subspace spanned by the set Φν(x) . For example, if we are considering a system of N electrons, with d = 3N, the generalized{ } Sturmian basis set might be antisymmetric with respect to exchange of the N electron coordinates but otherwise complete. In that case, P [eip·x] would represent the projection of the plane wave onto that part of Hilbert space corresponding to functions of x that are antisymmetric with respect to exchange of the N electron coordinates. Neither the expansion shown in equation (D.53) nor that shown in equation (D.54) is point-wise convergent. In other words, we cannot perform the sums shown on the right-hand sides of these equations and expect them to give point-wise convergent representations of the plane wave or its projection. However, the expansions are valid in the sense of distributions. For the case of unequal masses, the generalized Sturmian plane wave expansion takes on the slightly modified form

2 2 pκ + pj /mj eip·x = (2π)d/2 j Φt∗(p)Φ (x) (D.55) 2p2 ν ν P κ ! ν X D.8. AN ALTERNATIVE EXPANSION OF A D-DIMENSIONAL PLANE WAVE 283 D.8 An alternative expansion of a d-dimensional plane wave

In the Hamiltonian formulation of physics, one typically obtains sets of functions whose orthonormality relation has the form

∗ dx Φν0 (x)Φν(x) = δν0,ν (D.56) Z Such a set of basis functions might, for example be the configurations resulting from the solution of the N-electron approximate Schr¨odingerequation

1 ∆ + V (x) Eν Φν(x) = 0 (D.57) −2 0 −   with x (x1, x2, ...., xd) and d = 3N. It is interesting to notice that a d-dimensional plane wave can≡ also be expanded in terms of a basis set with orthonormality relations of the form shown in equation (D.56). To see this we write

−ip·x ∗ e = aν(p)Φν(x) (D.58) ν X

Multiplying from the left by Φν0 (x) and integrating over the coordinates, we obtain the relation

−ip·x ∗ dx e Φν0 (x) = aν(p) dx Φν(x)Φν0 (x) ν Z X Z d/2 t = aν(p)δν0,ν = aν0 (p) = (2π) Φν0 (p) ν X (D.59)

Thus we obtain the alternative expansion

−ip·x d/2 t ∗ e = (2π) Φν0 (p)Φν(x) (D.60) ν X or

ip·x d/2 t∗ e = (2π) Φν0 (p)Φν(x) (D.61) ν X The expansion (D.53) was a consequence of the weighted orthonormality relations obeyed by generalized Sturmian basis sets, while the expansion (D.61) resulted from the more conventional orthonormality relations (D.56). Both forms of the expansion are used in Chapter 8. 284 LIVES IN MATHEMATICS Appendix E

GENERALIZED STURMIANS APPLIED TO ATOMS

E.1 Goscinskian configurations

The Generalized Sturmian Method (Appendix D) is a newly-developed direct method for performing Configuration Interaction calculations on bound states [Avery, 1989]-[Avery and Avery, 2006]. It avoids the initial Hartree-Fock-Roothaan SCF calculation, and it is especially suitable for calculating large numbers of excited states of few-electron atoms or ions. When the Generalized Sturmian Method is applied to atoms or atomic ions, it is con- venient to use basis functions that are Slater determinants:

χµ(x ) χµ0 (x ) χµ00 (x ) 1 1 1 ··· 1 χµ(x2) χµ0 (x2) χµ00 (x2) Φ = χ χ 0 χ 00 ··· (E.1) ν µ µ µ χ (x ) χ 0 (x ) χ 00 (x ) | i | · · · | ≡ √N! µ 3 µ 3 µ 3 . . . ··· . . .

built from hydrogenlike atomic spin-orbitals of the form

αi ms = 1/2 χµ(xi) χn,l,m,ms (xi) Rn,l(ri)Yl,m(θi, φi) (E.2) ≡ ≡ βi ms = 1/2  −

with weighted nuclear charges Qν. In other words, the atomic spin-orbitals have the form

285 286 QUANTUM THEORY

shown in equation (5.64), with radial functions given by

3/2 −Qν r R1,0(r) = 2Qν e 3/2 Qν Qνr −Qν r/2 R2,0(r) = 1 e √2 − 2   5/2 Qν −Qν r/2 R2,1(r) = r e 2√6 3/2 2 2 2Qν 2Qνr 2Qνr −Qν r/3 R3,0(r) = 1 + e 3√3 − 3 27   ...... (E.3)

The reader will recognize these as the familiar hydrogenlike radial functions with the nu- clear charge Z replaced by Qν. If the effective charges Qν characterizing the configurations Φν are chosen in such a way that | i 1/2 2Eκ Qν = βνZ = − (E.4) 1 + 1 + 1 +  n2 n02 n002 ··· so that 2 Qν 1 1 1 Eκ = + + + (E.5) − 2 n2 n02 n02 ···   the configurations will obey the approximate N-electron Schr¨odingerequation:

N 1 2 + βνV (x) Eκ Φν = 0 (E.6) −2 ∇j 0 − | i " j=1 # X where N Z V0(x) = (E.7) − rj j=1 X is the nuclear attraction potential. In equation (E.6), the energy Eκ is kept constant for the whole basis set, while the weighting factors βν are adjusted to make the basis set isoenergetic. Thus the weighting factors βν play the role of eigenvalues in equation (E.6). This type of problem has been called the conjugate eigenvalue problem by Coul- son, Josephs, Goscinski and others [Goscinski, 1968, 2003], and it is characteristic for the equations defining generalized Sturmian basis sets (Appendix B). To see that with the special choice of weighted charges shown in equation (E.4) Φν will satisfy (E.6), we first notice that the hydrogenlike atomic orbitals with weighted| nucleari charges obey the 1-electron Schr¨odingerequation:

2 1 2 Qν Qν + χµ(xj) = 0 (E.8) −2∇j 2n2 − r  j  E.1. GOSCINSKIAN CONFIGURATIONS 287

Since the Slater determinant Φν is an antisymmetrized product of atomic orbitals, all of which obey (E.8), it follows that| i

N 2 2 1 2 Qν Qν Qν Qν Φν = + + + + + Φν −2 ∇j | i − 2n2 2n02 ··· r r ··· | i " j=1 # 1 2 X     

= [Eκ βνV (x)] Φν (E.9) − 0 | i

and thus equation (E.6) is satisfied. Each configuration Φν has its own effective nu- | i clear charge Qν. Within a particular configuration, the hydrogenlike atomic orbitals are orthonormal ∗ dτj χµ0 (xj)χµ(xj) = δµ0,µ (E.10) Z and they also obey the virial relationship

2 2 Qν Qν dτj χµ(xj) = (E.11) − | | r − n2 Z j From equations (E.6), (E.10) and (E.11), it can be shown [Avery, 2000], [Avery and Avery, 2006] that the generalized Sturmian configurations Φν obey the potential-weighted orthonor- mality relation | i ∗ 2Eκ Φν0 V0 Φν = δν0,ν (E.12) h | | i βν We next introduce the definitions

pκ 2Eκ (E.13) ≡ − and p 1 1 ν 2 + 02 + (E.14) R ≡ rn n ··· With the help of these definitions, equation (E.4) can be written in the form

pκ Qν = βνZ = (E.15) ν R The set of Sturmian configurations forms a set of isoenergetic solutions of the approximate Schr¨odingerequation (E.6), where the potential is weighted, and the weighting factors βν are chosen in such a way as to insure that all the solutions correspond to a common energy. From (E.13) we can see that their common energy Eκ is related to pκ by

2 pκ Eκ = (E.16) − 2 In previous publications we have called such atomic configurations Goscinskian configura- tions to recognize Prof. Osvaldo Goscinski’s pioneering work in generalizing the concept 288 QUANTUM THEORY

of Sturmian basis sets [Goscinski, 1968, 2003]. The non-relativistic Schr¨odingerequation of an N-electron atom has the form:

N 1 2 + V (x) Eκ Ψκ = 0 (E.17) −2 ∇j − | i " j=1 # X where 0 V (x) = V0(x) + V (x) (E.18)

0 Here V0(x) is the nuclear attraction potential shown in equation (E.7) while V (x) is the interelectron repulsion potential

N 1 V 0(x) = (E.19) r i>j ij X We can try to build up the wave function from a superposition of Goscinskian configura- tions, i.e. from a superposition of isoenergetic solutions of the approximate wave equation (E.6), where V0 is the nuclear attraction potential of the atom. Thus we write:

Ψκ Φν Cν,κ (E.20) | i ≈ ν | i X Inserting this superposition into (E.17) we have

1 ∆ + V (x) E Φ C 0 (E.21) 2 κ ν ν,κ ν − − | i ≈ X   However, each of the basis functions obeys (E.6), and therefore we can rewrite (E.21) in the form

[V (x) βνV0(x)] Φν Cν,κ 0 (E.22) ν − | i ≈ X The energy term Eκ is now nowhere to be seen, and a remark is perhaps needed here to explain what has happened to it: The configurations in our Generalized Sturmian basis set are isoenergetic. They all correspond to the same energy, Eκ, since the weighting factors βν are chosen especially to make them do so. What we have done in going from (E.21) to (E.22) is to choose this energy to be the same as that which appears in (E.21). In other words, the energy to which all the members of our basis set correspond is chosen to be equal to the energy of the state that we are trying to approximate. If we take the scalar product of (E.22) with a conjugate function from our basis set, we obtain the set of secular equations:

Φν0 [V (x) βνV0(x)] Φν Cν,κ = 0 (E.23) ν h | − | i X E.1. GOSCINSKIAN CONFIGURATIONS 289

We now introduce the definitions:

0 1 ∗ Tν0,ν Φν0 V0 Φν (E.24) ≡ −pκ h | | i and 0 1 ∗ 0 Tν0,ν Φν0 V Φν (E.25) ≡ −pκ h | | i From the potential-weighted orthonormality relations (E.12) we can see that

0 T 0 = δν0νZ ν (E.26) ν ,ν R 0 Notice that the nuclear attraction matrix Tν0,ν is both diagonal and energy-independent. 0 The interelectron repulsion matrix Tν0,ν can be evaluated using methods discussed in Ap- 0 pendix D, and it is also energy-independent. In order to see that Tν0,ν really is energy- independent, we notice that it is built up from terms of the form

1 1 3 3 0 1 0 Jµ1,µ2,µ3,µ4 = d x d x ρµ1,µ2 (x) 0 ρµ3,µ4 (x ) (E.27) pκ pκ x x Z Z | − | where densities are defined by

∗ ρµ1,µ2 (x) χµ1 (x)χµ2 (x) 0 ≡ ∗ 0 0 ρµ ,µ (x ) χ (x )χµ (x ) (E.28) 3 4 ≡ µ3 4 and where the orbitals are the hydrogenlike orbitals with weighted nuclear charge shown in equations (E.2) and (E.3). We now let

s pκx 0 ≡ 0 s pκx (E.29) ≡

Then, making the substitution Qν pκ/Rν in (E.3) we have → 3 ρµ1,µ2 (x) = pκρ˜µ1,µ2 (s) 0 3 0 ρµ3,µ4 (x ) = pκρ˜µ3,µ4 (s ) (E.30)

0 0 whereρ ˜µ1,µ2 (s) andρ ˜µ3,µ4 (s ) are pure functions of s and s respectively. Finally, noticing that 1 1 0 = 0 (E.31) pκ x x s s | − | | − | we can write

1 3 3 0 1 0 Jµ1,µ2,µ3,µ4 = d s d s ρ˜µ1,µ2 (s) 0 ρ˜µ3,µ4 (s ) (E.32) pκ s s Z Z | − | 290 QUANTUM THEORY

Since the building-blocks from which it composed are independent of pκ, the interelectron 0 repulsion matrix Tν0,ν is also independent of pκ and hence independent of energy. The 0 energy-independent interelectron repulsion matrix Tν0,ν consists of pure numbers (in atomic units) which can be evaluated once and for all and stored. With the help of equations (E.24)-(E.26), the secular equation (E.23) can be rewritten in the form:

0 pκδν0,νZ ν pκTν0,ν + βνpκδν0,νZ ν Cν,κ = 0 (E.33) ν − R − R X   Finally, using the relationship βνZ ν = pκ (E.34) R and dividing by pκ, and reversing the signs, we obtain

0 δν0,νZ ν + Tν0,ν pκδν0,ν Cν,κ = 0 (E.35) ν R − X   The Generalized Sturmian secular equation for atoms and atomics ions (E.35) differs in several remarkable ways from the secular equations that would be obtained using a Hamil- tonian method: 1. The kinetic energy term has disappeared.

2. The nuclear attraction term, δν0,νZ ν, is diagonal. R 0 3. The interelectron repulsion matrix Tν0,ν is energy-independent. It consists of dimen- sionless pure numbers.

4. Finally, the roots of the secular equations are not energies but values of the parameter pκ, which is related to the energy spectrum through equation (E.16). The parameter pκ = βνZ ν = Qν ν can be thought of as a scaling parameter, since the effective nuclear chargesR associatedR with the Goscinskian configurations are proportional to it.

5. The configurations Φν in the basis set are not fully determined until the secular equations have been| solved.i Only the form of the basis functions is known in advance, but not the scale. When the secular equation is solved, the resulting spectrum of pκ values yields not only a spectrum of energies but a nearly optimum set of basis functions for the representation of each state. The basis set for the representation of highly excited states is diffuse, while the set for representation of tightly-bound states is contracted. The step of optimizing Slater exponents for each problem is thus not needed.

0 6. Once the energy-independent interelectron repulsion matrix Tν0,ν has been constructed, the properties of an entire isoelectronic series can be calculated with almost no ad- ditional effort. E.2. RELATIVISTIC CORRECTIONS 291 E.2 Relativistic corrections

If the number of electrons N is kept constant while Z is allowed to increase, the energies calculated from the Generalized Sturmian secular equation approach those found by so- lution of the non-relativistic Schr¨odingerequation, but a relativistic correction must be added in order for the energies to approach experimental values. A crude relativistic cor-

rection can be found for a multiconfigurational state Ψκ(x) = ν Φν(x)Cνκ by calculating the ratio of the relativistic energy of the with interelectron repulsion entirely neglected to the non-relativistic energy, again with interelectron repulsion entirelyP neglected. The ratio can be written in the form E C2 Φ H Φ f (Z) = κ,rel = ν νκ ν 0 ν rel (E.36) κ 1 2h | 2| 2i Eκ,nonrel Z C P− 2 ν νκRν Here P

Φν H0 Φν rel = µ,rel µ = (n, l, m, ms) (E.37) h | | i µ∈ν X is the relativistic energy of the configuration Φν(x) with interelectron repulsion entirely neglected, while 1 Z2 1 = Z2R2 µ = (n, l, m, m ) (E.38) 2 n2 2 ν s − µ∈ν − X is the nonrelativistic energy of Φν(x). The quantity µ,rel represents the relativistic energy of a single electron moving in the attractive Coulomb potential of a nucleus with charge Z. This energy is easy to calculate exactly [Akhiezer and Berestetskii, 1965], if effects such as vacuum polarization and the Lamb shift are neglected. It is given by:

2 c 2 µ,rel = / c (E.39) 2 1 2 − Z 1 + c(γ+n−|j+1/2|)     1 2 Z 2 γ j + c = 137.036 (E.40) ≡ 2 − c s    where j is the total angular momentum (orbital plus spin) of a single electron, i.e. l 1 . ± 2 The corrected energy, fκ(Z)Eκ,nonrel, agrees closely with the experimental values of energies, especially when Z is large compared with N. The approximate relativistic correction discussed here is by no means confined to the Generalized Sturmian Method. It can be used in quantum calculations of every kind, per- formed on atoms and molecules. The assumption behind the correction is that relativistic effects are due mainly to the nuclear attraction part of the Hamiltonian, and only to a lesser extent to interelectron repulsion terms. 292 QUANTUM THEORY

Table E.1: This table shows the relativistic correction for a single electron moving in the field of a nucleus with charge Z, i.e. the relativistic energy without the rest energy, divided by the non-relativistic energy. It is interesting to notice that the correction affects the 4th significant figure of the energy for values of Z as low as 10. In all cases the effect of the relativistic correction is to increase the binding energy.

n j Z=1 Z=10 Z=20 Z=30

1 1 1.00001 1.00133 1.00538 1.01228 2

1 2 1.00002 1.00167 1.00673 1.01537 2

3 2 1.00000 1.00033 1.00133 1.00301 2

1 3 1.00001 1.00133 1.00538 1.01226 2

3 3 1.00000 1.00044 1.00178 1.00402 2

5 3 1.00000 1.00015 1.00059 1.00133 2 E.3. THE LARGE-Z APPROXIMATION: RESTRICTION OF THE BASIS SET TO AN -BLOCK293 R

-0.0555 j=52 -0.0560 j=32

-0.0565

-0.0570

-0.0575 j=12

Z 10 20 30 40 50

2 Figure E.1: This figure shows µ/Z for n=3. The non-relativistic energy is the horizontal line nonrel = 1/18, while the relativistic energies are shown by curves. − E.3 The large-Z approximation: Restriction of the basis set to an -block R If interelectron repulsion is entirely neglected, i.e. when disregarding the second term in Eq. (E.35), the calculated energies Eκ become those of a set of N completely independent electrons moving in the field of the bare nucleus:

2 2 2 2 pκ 1 2 2 Z Z Z Eκ = Z ν = 2 2 2 (E.41) − 2 −→ −2 R −2n1 − 2n2 − · · · − 2nN In the large-Z approximation, we do not neglect interelectron repulsion, but we restrict the basis set to those Goscinskian configurations that would be degenerate if interelectron repulsion were entirely neglected, i.e., we restrict the basis to a set of configurations all of which correspond to the same value of ν. In that case, the first term in (E.35) is a R multiple of the identity matrix, and the eigenvectors Cνκ are the same as those that would 0 be obtained by diagonalizing the energy-independent interelectron repulsion matrix Tν0ν, since the eigenfunctions of any matrix are unchanged by adding a multiple of the unit matrix. The simplified secular equation then becomes:

0 [Tν0ν λκδν0ν] Cνκ = 0 (E.42) ν − X The roots are shifted by an amount equal to the constant by which the identity matrix is multiplied: pκ = Z ν + λκ = Z ν λκ (E.43) R R − | | 294 QUANTUM THEORY

E €€€€€€€€€ Z2 -1.0

-1.1

-1.2

-1.3

-1.4

Z 10 20 30 40 50

Figure E.2: The ground state of the carbon-like isoelectronic series, calculated in the large-Z approximation. The energies divided by Z2 are shown as functions of Z. Experimental values are indicated by dots, while the energies calculated from equation (E.44) are shown as curves. The lower (solid) curve, which approaches the experimental values with increasing Z, has been corrected for relativistic effects. The upper (dashed) curve is uncorrected. and the energies become 1 2 Eκ = (Z ν λκ ) (E.44) −2 R − | | With the relativistic correction of equation (E.36), this becomes

1 2 Eκ = f(Z) (Z ν λκ ) (E.45) − 2 R − | |

Since the roots λκ are always negative, we may use the form λκ in place of λκ to make explicit the fact that interelectron repulsion reduces the binding−| energies,| as of course it must. The roots λκ are pure numbers that can be calculated once and for all and stored. From these roots, a great deal of information about atomic states can be found with very little effort.

E.4 Electronic potential at the nucleus in the large-Z approximation

The electronic potential ϕ(x1) is related to the electronic density distribution by ρ(x0 ) ϕ(x ) = d3x0 1 (E.46) 1 1 x x0 Z | 1 − 1| E.4. ELECTRONIC POTENTIAL AT THE NUCLEUS IN THE LARGE-Z APPROXIMATION295

If the coordinate system is centered on the nucleus, the electronic potential at the nucleus is then given by ρ(x0 ) ϕ(0) = d3x0 1 (E.47) 1 x0 Z | 1| But the electron density corresponding to the state Ψκ is defined as

3 3 ∗ ρ(x ) = N ds d x ds d xN dsN Ψ (x)Ψκ(x) (E.48) 1 1 2 2 ··· κ Z Z Z Z Z where the integral is taken over the spin coordinate of the first electron and over the space

and spin coordinates of all the other electrons. The wave function Ψκ(x) = ν Φκ(x)Bνκ is a linear combination of Goscinskian configurations. Thus the density is given by P ∗ ρ(x1) = ρν0ν(x1)BνκBνκ (E.49) ν0,ν X where

3 3 ∗ ρν0ν(x ) = N ds d x ds d xN dsN Φ 0 (x)Φν(x) 1 1 2 2 ··· ν Z Z Z Z Z 0 if ν0 and ν differ by 2 or more orbitals

 ∗ 0 0 =  χµ0 (x1)χµ(x1) if ν and ν differ only by µ µ  →  N 2 0 i=1 χµi (x1) if ν = ν  | |  (E.50)  P In equation (E.50) we have made use of the fact that within an -block, the atomic spin- orbitals are orthonormal. R Within the framework of the large-Z approximation we have

∗ ∗ ∗ dx Ψκ(x)V0(x)Ψκ(x) = Bν0κBνκ dx Φν0 (x)V0(x)Φν(x) ν0 ν Z X X Z p2 = κ B 2 (E.51) β νκ − ν ν | | X In the second step above, we make use of the potential weighted orthonormality relation 2 (E.12). Further, since Bνκ = 1, equation (E.51) reduces to ν | | 2 P ∗ pκ dx Ψ (x)V (x)Ψκ(x) = = pκZ ν (E.52) κ 0 −β − R Z ν This result can be used to express the electronic potential at the nucleus in a very simple form. Combining (E.47) and (E.48), we obtain 1 ϕ(0) = N dx Ψ∗ (x)Ψ (x) (E.53) x κ κ Z | 1| 296 QUANTUM THEORY

jH0L

100 Z HN - 10L Z HN - 2L 4Z+ €€€€€€€€€€€€€€€€€€€€€€€€€€€€€€€€€€ 2Z+ €€€€€€€€€€€€€€€€€€€€€€€€€€€€€€ 9 4 80 ZN 2 QΝRΝ

60

40

20

0 N 0 5 10 15 20

Figure E.3: When interelectron repulsion is entirely neglected, the electronic 2 potential at the nucleus is given by Z ν , which is exactly piecewise linear in N. The effect of interelectron repulsionR is to decrease ϕ(0) and to make the dependence only approximately piecewise linear. The figure shows ϕ(0) ne- glecting interelectron repulsion (upper values) and including it (lower values). The dots are calculated from the electronic densities of the ground state wave functions, whereas the lines are the closed form expressions found in equations (E.58) and (E.56).

From the definition of V0, equation (E.7), and from the fact that each term in the sum in (E.7) gives the same contribution, we have

1 ∗ ϕ(0) = dx Ψ (x)V (x)Ψκ(x) (E.54) −Z κ 0 Z Combining equations (E.54) and (E.52) we obtain the extremely simple result:

ϕ(0) = pκ ν (E.55) R which can alternatively be written in the form:

2 ϕ(0) = Z ν λκ ν (E.56) R − | |R or in a third form: 2 ϕ(0) = Qν ν (E.57) R since Qν = Z λκ / ν. From equations (E.55)-(E.57) it follows that for an isonuclear series, the electronic− | | potentialR at the nucleus depends on N in an approximately piecewise linear way. For example, let us consider the isonuclear series where Z = 18. Keeping the E.5. CORE IONIZATION ENERGIES 297

nuclear charge Z constant at this value, we begin to add electrons. For the ground state we have: N N 2 1 ≤ 2 1 1 1  2 N−2 ν 2 + 2 + + 2 =  1 + 4 2 N 10 (E.58) R ≡ n1 n2 ··· nN  ≤ ≤  2 + 8 + N−10 10 N 18 1 4 9 ≤ ≤   E.5 Core ionization energies

The large-Z approximation can be used to calculate core-ionization energies, i.e. the energies required to remove an electron from the inner shell of an atom. From (E.44) we can see that this energy will be given by

1 2 0 0 2 ∆E = (Z ν λκ ) (Z ν λ ) (E.59) 2 R − | | − R − | κ|   where the unprimed quantities refer to the original ground state, while the primed quan- tities refer to the core-ionized states. Since

2 02 ν ν = 1 (E.60) R − R Equation (E.59) can be written in the form

2 2 0 2 Z 0 0 λκ λκ ∆E = Z [ ν λ ν λκ ] + | | − | | (E.61) − 2 R | κ| − R | | 2 Thus we can see that within the framework of the large-Z approximation, the quantity ∆E Z2/2 is linear in Z for an isoelectronic series. This quantity represents the contribu- tion− of interelectron repulsion to the core ionization energy, since if interelectron repulsion is completely neglected, the core ionization energy is given by ∆E = Z2/2. Core ionization energies calculated from equations (E.59)-(E.61) are shown in Figures E.4 through E.6. 298 QUANTUM THEORY

Z2 DE - €€€€€€€€€ 2 0 Z 10 20 30 40

-20 N = 2

-40

-60

-80

N = 10 -100

-120 N = 18

Figure E.4: For isoelectronic series, equation (E.61) indicates that within the large-Z approximation, the quantity ∆E Z2/2 is exactly linear in Z, as is illustrated above. ∆E is the core ionization− energy.

DE

160

150

140

130

120

110

N 5 10 15

Figure E.5: For isonuclear series, the dependence of the core ionization energy on N is approximately piecewise linear. Whenever a new shell starts to fill, the slope of the line changes. The dots in the figure were calculated using equation (E.61), where it is not obvious that the dependence ought to be approximately piecewise linear. However, equations (E.58) and (E.56) can give us some insight into the approximately piecewise linear relationship. E.6. ADVANTAGES AND DISADVANTAGES OF GOSCINSKIAN CONFIGURATIONS299

DE

140

120 Z = 18

100

80

60

40 Z = 10 20

N 2 4 6 8 10 12

Figure E.6: This figure shows the dependence of the core-ionization energy on both N and Z for the filling of the n = 2 shell. Points with N > Z are omit- ted because the large-Z approximation cannot be used for these points. The energies are in Hartrees. E.6 Advantages and disadvantages of Goscinskian con- figurations

We seen that when V0(x) is chosen to be the the Coulomb attraction of the bare nucleus, the approximate Schr¨odingerequation

N N 1 2 Qν j + βνV0(x) Eκ Φν = 0 βνV0(x) = (E.62) −2 ∇ − | i − rj " j=1 # j=1 X X can be solved exactly using configurations composed of hydrogenlike spin-orbitals with the especially chosen weighted charges Qν shown in equation (E.4). There is no need to calculate the weighting factors βν. These are obtained automatically when the secular equation is solved. Nor is there a need to normalize the configurations. This is also achieved automatically. Thus the choice of V0(x) as the potential of the bare nucleus has many advantages; but it also has disadvantages. Just as is the case in perturbation theory, convergence is most rapid if V0(x) is chosen to be as close as possible to the actual potential. By choosing V0(x) to be the Coulomb attraction of the bare nucleus, we have neglected interelectron repulsion. This is why the Generalized Sturmian Method with Goscinskian configurations works best when the number of electrons in an atom or ion is small, and why it works especially well when Z >> N, i.e. when the Coulomb attraction of the nucleus dominates over the effects of interelectron repulsion. To extend the range of applicability of the method to atoms and ions with large values of N, we would need to choose a V0(x) which included some of the effects of interelec- 300 QUANTUM THEORY

tron repulsion. For example, we could let it be the Hartree potential. The approximate Schr¨odingerequation (E.6) can always be solved provided that it is separable, and it is separable whenever the approximate potential has the form

N

V0(x) = v(xj) (E.63) j=1 X The separated form of (E.6) becomes:

1 2 + βνv(xj) ζ ϕζ (xj) = 0 (E.64) −2∇j −  

where the weighting factors βν must be chosen in such a way that

ζ = Eκ (E.65) ζ∈ν X

If the spin-orbitals ϕζ (xj) satisfy (E.64), then configurations of the form

ϕζ (x ) ϕζ0 (x ) ϕζ00 (x ) 1 1 1 ··· 1 ϕζ (x2) ϕζ0 (x2) ϕζ00 (x2) Φ = ϕ ϕ 0 ϕ 00 ··· (E.66) ν ζ ζ ζ ϕ (x ) ϕ 0 (x ) ϕ 00 (x ) | i | · · · | ≡ √N! ζ 3 ζ 3 ζ 3 . . . ··· . . .

will satisfy the approximate Schr¨odingerequation (E.6). Some of the neatness of the Generalized Sturmian Method with Goscinskian configurations is certainly lost by choosing a V0(x) that includes effects of interelectron repulsion, but it could be worth paying this price in order to extend the method to atoms and atomic ions with larger values of N. We are at present exploring these possibilities, and some work in this direction is also being done by Prof. Gustavo Gasaneo and his group in Argentina.

E.7 -blocks, invariant subsets and invariant blocks R To tie the discussion of this chapter in with the general principles discussed in Chapter 1, we identify T with the operator whose roots and eigenfunctions we wish to study. The group of symmetry operations that leave the nuclear attraction and interelectron repulsion matrix of an atom invariantG consists of rotations of the entire system about the nucleus, together with reflections and inversions that do not affect the interelectron distances. These operations do not affect the radial parts of the atomic orbitals from which the Goscinskian configurations are constructed, nor do they affect the spin. Thus the set of configurations, all of which are characterized by the same value of

1 1 1 ν 2 + 02 + 002 + (E.67) R ≡ rn n n ··· E.7. -BLOCKS, INVARIANT SUBSETS AND INVARIANT BLOCKS 301 R i.e. configurations all of which are built from hydrogenlike atomic spin-orbitals with a particular set of principal quantum numbers (n, n0, n00,...), is closed under , and it cor- responds to an invariant subset as discussed in Chapter 1. The block of TG0 based on it corresponds to an invariant block. As expected, the eigenfunctions of interelectron repul- sion matrix for the -blocks are the symmetry-adapted basis functions that we desire. In Chapter 1, we mentionedR that when the roots of an invariant block are degenerate, then in order to take full advantage of the symmetry of the problem, we need to add an extremely small perturbation which will slightly remove the degeneracy. In the present case, this slight perturbation is given by

Tp = aLz + bSzF (E.68)

where a and b are two very small irrational numbers. (They are chosen to be irrational in order to avoid accidental degeneracies.) When this small perturbation is added to 0 0 T , the degeneracy is slightly removed. The eigenfunctions of T + Tp for an -block are then Russell-Saunders states, i.e. they are simultaneous eigenfunctions of theR total 2 2 angular momentum operator L , its z-component Lz, the total spin operator S , and its z-component Sz. We can ask how many linearly independent configurations there are in a ground-state -block. The answer is that when the Pauli principle is taken into account, R the number of configurations mk in an -block is given by the binomial coefficient R

ns ns! = mk (E.69) Nv Nv!(ns Nv)! ≡   − where ns is the number of atomic spin-orbitals in the highest filled shell, while Nv is the number of valence electrons. For the ground-state configurations illustrated in Tables 4.3 and 4.4, Nv is the number of valence electrons. For example, for the ground state of lithium, we are starting to fill the n = 2 shell, for which ns=8. Since there is only one valence electron, Nv=1 and

ns 8 8! mk for lithium ground state: = = = 8 (E.70) Nv 1 1!(8 1)!     − Tables 4.3 and 4.4 show the roots of T 0 and their corresponding spectral terms for the first row of the periodic table. For the lithiumlike isoelectronic series, one of the roots corresponds to a 2S state and the other corresponds to a 2P state. We now remember that the total degeneracy of a Russell-Saunders state is given by (2L + 1) (2S + 1), where L and S are respectively the quantum numbers of total orbital angular× momentum and total spin. The 2S state is thus 2-fold degenerate, while the 2P state is 6-fold degenerate. By diagonalizing the ground-state -block for lithium, we thus obtain 8 Russell-Saunders states, the same as the number ofR Pauli-allowed states in the block. With a little effort, the reader can verify that the number of Russell-Saunders states shown in Tables 4.3 and 4.4 corresponds to the number of Pauli-allowed configurations given by the binomial coefficient in equation (E.69). Of course for the lithium ground state, the construction of 302 QUANTUM THEORY

0 Table E.2: Eigenvalues of the 2-electron interelectron repulsion matrix Tν0,ν for S=1, 0 MS=1, n = 2 and n =3, 4, 5.

n0=3 n0 = 4 n0=5 λκ term λκ term λκ term | | | | | | .108252 3S .077484 3S .056075 3S .134734 3P .087582 3P .065019 3P .135408 3D .090845 3D .061128 3P .138421 3P .093401 3P .063370 3D .155155 3F .099235 3F .067758 3F .160439 3P .099991 3P .067934 3P .165613 3D .104253 3D .070494 3D .168814 3S .106271 3D .071269 3D .173917 3D .107976 3S .072413 3F .186893 3P .108188 3F .072857 3S .111210 3G .073295 3G .111264 3F .073588 3G .113313 3P .073920 3F .114381 3D .074306 3G .074578 3H .074963 3F .075173 3P .075545 3D

Russell-Saunders states is trivial, while for beryllium it can be accomplished with the help of Clebsch-Gordan coefficients. However for carbon, where

ns 8 8! mk for carbon ground state: = = = 70 (E.71) Nv 4 4!(8 4)!     − the construction of the Russell-Saunders states is non-trivial. An indication of the nature of the states obtained is given in Table 4.5. As can easily be verified, the sum of the degeneracies shown in the table is equal to 70. E.7. -BLOCKS, INVARIANT SUBSETS AND INVARIANT BLOCKS 303 R

Table E.3: Roots of the ground state -block of the interelectron repulsion matrix for the Li-like, Be-like, B-like and C-like isoelectronicR series.

Li-like Be-like B-like C-like λκ term λκ term λκ term λκ term | | | | | | | | 0.681870 2S 0.986172 1S 1.40355 2P 1.88151 3P

0.729017 2P 1.02720 3P 1.44095 4P 1.89369 1D

1.06426 1P 1.47134 2D 1.90681 1S

1.09169 3P 1.49042 2S 1.91623 5S

1.10503 1D 1.49395 2P 1.995141 3D

1.13246 1S 1.52129 4S 1.96359 3P

1.54037 2D 1.98389 3S

1.55726 2P 1.98524 1D

1.99742 1P

2.04342 3P

2.05560 1D

2.07900 1S 304 QUANTUM THEORY

0 Table E.4: Roots of the ground state -block of the interelectron repulsion matrix Tν0ν for the N-like, O-like, F-like and Ne-likeR isoelectronic series.

N-like O-like F-like Ne-like λκ term λκ term λκ term λκ term | | | | | | | | 2.41491 4S 3.02641 3P 3.68415 2P 4.38541 1S

2.43246 2D 3.03769 1D 3.78926 2S

2.44111 2P 3.05065 1S

2.49314 4P 3.11850 3P

2.52109 2D 3.14982 1P

2.53864 2S 3.24065 1S

2.54189 2P

2.61775 2P E.7. -BLOCKS, INVARIANT SUBSETS AND INVARIANT BLOCKS 305 R

0 Table E.5: Eigenvalues of T 0 for the carbon-like ν = √3 block. ν ,ν R

λκ term degen. configuration | |

1.88151 3P 9 .994467(1s)2(2s)2(2p)2+.105047(1s)2(2p)4

1.89369 1D 5 .994467(1s)2(2s)2(2p)2–.105047(1s)2(2p)4

1.90681 1S 1 .979686(1s)2(2s)2(2p)2+.200537(1s)2(2p)4

1.91623 5S 5 (1s)2(2s)(2p)3

1.95141 3D 15 (1s)2(2s)(2p)3

1.96359 3P 9 (1s)2(2s)(2p)3

1.98389 3S 3 (1s)2(2s)(2p)3

1.98524 1D 5 (1s)2(2s)(2p)3

1.99742 1P 3 (1s)2(2s)(2p)3

2.04342 3P 9 .105047(1s)2(2s)2(2p)2–.994467(1s)2(2p)4

2.05560 1D 5 .105047(1s)2(2s)2(2p)2+.994467(1s)2(2p)4

2.07900 1S 1 .200537(1s)2(2s)2(2p)2–.979686(1s)2(2p)4 306 QUANTUM THEORY E.8 Invariant subsets based on subshells; Classifica- tion according to ML and Ms When we are dealing with configurations corresponding to excited states of an atom, the number of Pauli-allowed states in an -block may be very large, and we can ask whether our large basis set of Goscinskian configurationsR W can be divided into smaller invariant subsets Wk. A little thought reveals that this is indeed possible. The symmetry operations 0 that leave T invariant do not affect the radial parts Rn,l(r) of the atomic orbitals, so sets of configurations built from the subshell sets ((n, l), (n0l0), (n00, l00),...) will be invariant subsets of our large basis set W under the operations of . The results will then differ slightly from those obtained in the large-Z approximation,G which allows mixing between subshells. As an example of an invariant subset based on subshells we can consider the set of configurations corresponding to (1s)(2d)2 for the lithium-like isoelectronic series:

2 10 (1s)(3d)2 = 90 (E.72) 1 × 2     The invariant subset contains 90 configurations. Diagonalization of the 90 90 block yields the Russell Saunders states shown in Table 4.5. × We can also pick subsets Wk characterized by particular eigenvalues of Sz and Lz. These reductions in the size of the invariant subsets and the invariant blocks make it feasible to generate symmetry-adapted basis sets automatically also in the case of highly excited configurations. The use of symmetry-adapted basis sets leads to accurate calculations as is illustrated in Tables 4.7-4.9. E.8. INVARIANT SUBSETS BASED ON SUBSHELLS; CLASSIFICATION ACCORDING TO ML AND MS307

Table E.6: This table shows the multiplets generated by diagonalizing the energy-invariant interelectron repulsion matrix T 0 for the 90 90 block the Hamiltonian based on neutral × 2 lithium configurations corresponding to (1s)(3d) , with Rν = √11/3. The reader can verify that the sum of the degeneracies of the multiplets is 90. The energies shown are for neutral lithium.

λκ term degen. energy | |

.270978 4F 28 –4.63798

.271649 2F 14 –4.63594

.278128 2D 10 –4.63526

.278998 4P 12 –4.61623

.279669 2P 6 –4.61359

.281871 2G 18 –4.61155

.297850 2S 2 –4.55650 308 QUANTUM THEORY

Table E.7: 3S excited state energies calculated with 78 Goscinskians, using the crude relativistic correction described in the text. The calculation of similar tables for 1P, 3P, 1D, 3D, doubly excited autoionizing states, etc., is equally easy, rapid, and of comparable accuracy. Tables are given in Chapter 4 in [Avery and Avery, 2006], but may easily be reproduced using our programs, as shown in Tutorial 1 on [Avery and Avery, 2006a].

He Li+ Be2+ B3+ C4+ N5+

1s2s 3S –2.1737 –5.1085 –9.2957 –14.735 –21.427 –29.373 expt. –2.1750 –5.1109 –9.2983 –14.738 –21.429 –29.375 1s3s 3S –2.0683 –4.7509 –8.5459 –13.454 –19.476 –26.612 expt. –2.0685 –4.7522 –8.5480 –13.457 –19.478 –26.614 1s4s 3S –2.0364 –4.6365 –8.2999 –13.027 –18.820 –25.678 expt. –2.0363 –4.6373 –8.3015 –13.030 –18.822 –25.680 1s5s 3S –2.0226 –4.5859 –8.1896 –12.835 –18.522 –25.253 expt. –2.0224 –4.5862 –8.1905 –18.524 –25.254 1s6s 3S –2.0154 –4.5591 –8.1309 –12.732 –18.363 –25.024 expt. –2.0152 –4.5592 –18.364 1s7s 3S –2.0112 –4.5432 –8.096 –12.67 –18.267 –24.888 expt. –2.0109 –4.5431 –18.268 1s8s 3S –2.0085 –4.5330 –8.0736 –12.631 –18.206 –24.799 expt. –2.0082 –4.5328 –18.206 1s9s 3S –2.0067 –4.5261 –8.0583 –12.604 –18.164 –24.739 expt. -2.0064 1s10s 3S –2.0051 –4.5212 –8.0475 –12.585 –18.134 –24.696 expt. –2.0051 E.8. INVARIANT SUBSETS BASED ON SUBSHELLS; CLASSIFICATION ACCORDING TO ML AND MS309

Table E.8: 1S ground and excited state energies (in Hartrees) for the 2-electron isoelec- tronic series. The basis set used consisted of 592 generalized Sturmians of the Goscinski type, using the crude relativistic correction described in the text. The whole table was computed approximately a second. Experimental values are taken from the NIST tables [National Institute for Standards and Technology (NIST)] (http://physics.nist.gov/asd).

He Li+ Be2+ B3+ C4+

1s2 1S –2.8956 –7.2716 –13.649 –22.028 –32.412 expt. –2.9034 –7.2798 –13.657 –22.035 –32.416 1s2s 1S –2.1441 –5.0350 –9.1768 –14.571 –21.218 expt. –2.1458 –5.0410 –9.1860 –14.582 –21.230 1s3s 1S –2.0607 –4.7303 –8.5112 –13.405 –19.414 expt. –2.0611 –4.7339 –8.5183 –13.415 –19.425 1s4s 1S –2.0333 –4.6280 –8.2844 –13.005 –18.791 expt. –2.0334 –4.6299 –8.2891 –18.800 1s5s 1S –2.0221 –4.5858 –8.1889 –12.833 –18.520 expt. –2.0210 –4.5825 –18.513 1s6s 1S –2.0147 –4.5579 –8.129 –12.729 –18.359 expt. –2.0144 –4.5571 1s7s 3S –2.0109 –4.5426 –8.0951 –12.67 –18.281 expt. –2.0104 –4.5418 1s8s 3S –2.0083 –4.5326 –8.0732 –12.641 –18.26 expt. –2.0079 1s9s 3S –2.0065 –4.5258 –8.0583 –12.626 –18.203 expt. –2.0062 1s10s 3S –2.0049 –4.521 –8.0476 –12.602 –18.162 expt. –2.0050 310 QUANTUM THEORY

Table E.9: 1D excited state energies for the 2-electron isoelectronic series, compared with experimental values taken from the NIST tables.

He Li+ Be2+ B3+ C4+ N5+

He Li+ Be2+ B3+ C4+ N5+

1s3d 1D –2.0555 –4.7218 –8.4990 –13.388 –19.387 –26.498 expt. –2.0554 –4.7225 –8.5012 –13.392 –19.396 –26.514 1s4d 1D –2.0312 –4.6246 –8.2801 –12.998 –18.779 –25.622 expt. –2.0311 –4.6252 –8.2824 –13.003 –18.788 –25.639 1s5d 1D –2.0200 –4.5797 –8.1790 –12.818 –18.497 –25.217 expt. –2.0198 –4.5801 –8.1807 –18.507 –25.234 1s6d 1D –2.0139 –4.5554 –8.1242 –12.721 –18.345 –24.997 expt. –2.0137 –4.5557 –18.354 1s7d 1D –2.0102 –4.5407 –8.0912 –12.662 –18.253 –24.865 expt. –2.0100 –4.5409 –18.262 1s8d 1D –2.0078 –4.5311 –8.0699 –12.624 –18.194 –24.779 expt. –2.0076 –4.5314 –18.202 1s9d 1D –2.0062 –4.5246 –8.0552 –12.598 –18.153 –24.720 expt. –2.0060 1s10d 1D –2.0050 –4.5199 –8.0447 –12.579 –18.124 –24.678 expt. –2.0048 1s11d 1D –2.0041 –4.5165 –8.0370 –12.566 –18.102 –24.647 expt. –2.0035 1s12d 1D –2.0032 –4.5139 –8.0311 –12.555 –18.086 –24.624 expt. –2.0033 E.9. AN ATOM SURROUNDED BY POINT CHARGES 311

E €€€€€€€€€ Z2

-0.58

-0.60

-0.62

-0.64

Z 10 20 30 40 50

Figure E.7: Here the lowest 3S energies of the heliumlike isoelectronic series are divided by Z2. The lower line is corrected for relativistic effects. The dots are experimental values. E.9 An atom surrounded by point charges

For a heavy atom surrounded by lighter atoms (ligands), the effect of the ligands is some- times approximated by considering the atom to be surrounded by an array of point charges with an appropriate symmetry. The potential experienced by such an atom is 0 00 V (x) = V0(x) + V (x) + V (x) (E.73) where V0 is the nuclear attraction potential N Z0 V0(x) = (E.74) − ri i=1 X and V 0 is the interelectron repulsion potential N N 1 V 0(x) = (E.75) rij j>i i=1 X X 00 while V (the “crystal field”) expresses the effect of a set of charges qa located at the positions Xa: N q V 00(x) = a (E.76) − xi Xa i=1 a X X | − | The generalized Sturmian secular equation analogous to (E.35) then becomes

0 00 δν0,νZ ν + Tν0,ν + Tν0,ν pκδν0,ν Cν,κ = 0 (E.77) ν R − X   312 QUANTUM THEORY where 00 1 ∗ 00 Tν0,ν Φν0 V Φν (E.78) ≡ −pκ h | | i 00 In order to evaluate Tν0,ν we must first calculate one-electron matrix elements of the form

00 3 ∗ qa vµ1,µ2 d x χµ1 (x)χµ2 (x) ≡ x Xa Z a | − | X l 3 ∗ r< = qa d xi χ (x)χµ (x) Pl(xˆ Xˆ a) µ1 2 rl+1 · a l Z  > a X X ∞ l 2 r< = qa dr r l+1 Rn1,l1 (r)Rn2,l2 (r) a l 0 r> a X X Z   ∗ dΩ Y (xˆ)Yl ,m (xˆ)Pl(xˆ Xˆ a) (E.79) × l1,m1 2 2 · Z where

l l+1 l r /Ra r < Ra r< l+1 (E.80) r> a ≡  l l+1    Ra/r Ra < r

If the points Xa are all equidistant from the central atom and if the charges are all equal, then equation (E.79) can be rewritten in the form:

4πq ∞ rl v00 = a dr r2 < R (r)R (r) µ1,µ2 2l + 1 l+1 n1,l1 n2,l2 a l 0 r> a X X Z   ∗ ∗ Y (Xˆ a) dΩ Y (xˆ)Yl ,m (xˆ)Yl,m(xˆ) × l,m l1,m1 2 2 Z ∗ ∗ Fl Y (Xˆ a) dΩ Y (xˆ)Yl ,m (xˆ)Yl,m(xˆ) ≡ l,m l1,m1 2 2 l a X X Z (E.81) where

∞ l 4πqa 2 r< Fl dr r Rn ,l (r)Rn ,l (r) ≡ 2l + 1 rl+1 1 1 2 2 Z0  > a (E.82)

00 00 Once we are in possession of vµ1,µ2 , the matrix element Tν1,ν2 can be evaluated by means of the generalized Slater-Condon rules discussed in Appendix D, Section D.1. For fixed 00 angular geometry, the matrix Tν1,ν2 turns out to be a function of the parameters

sa pκRa pκ Xa (E.83) ≡ ≡ | | E.9. AN ATOM SURROUNDED BY POINT CHARGES 313

where Xa is the distance of the ath charge qa from the central atom. The generalized Sturmian| secular| equations (E.77) can be solved as follows: We begin by picking values of sa The secular equation (E.77) is then solved, yielding roots pκ for the ground state and the excited states. The corresponding values of Xa are then known. This can be | | repeated for a number of sa values, yielding solutions as functions of the distances Xa . As an example, we can consider an atom surrounded by 8 equal point charges q at| the| positions

Xˆ = (+1, +1, +1)/√3 Xˆ = (+1, 1, 1)/√3 1 5 − − Xˆ = ( 1, +1, +1)/√3 Xˆ = ( 1, +1, 1)/√3 2 − 6 − − Xˆ = (+1, 1, +1)/√3 Xˆ = ( 1, 1, +1)/√3 3 − 7 − − Xˆ = (+1, +1, 1)/√3 Xˆ = ( 1, 1, 1)/√3 (E.84) 4 − 8 − − − which has cubic symmetry. We can also discuss the effect of a square-planar array

Xˆ 1 = (+1, 0, 0) Xˆ 3 = (0, +1, 0) Xˆ = ( 1, 0, 0) Xˆ = (0, 1, 0) (E.85) 2 − 4 − a linear array:

Xˆ = (0, 0, +1) Xˆ = (0, 0, 1) (E.86) 1 2 − a trigonal array:

Xˆ = (2√2/3, 0, 1/3) 1 − Xˆ = ( √2/3, √6/3, 1/3) 2 − − Xˆ = ( √2/3, √6/3, 1/3) (E.87) 3 − − − and a tetrahedral array

Xˆ 1 = (0, 0, 1) Xˆ = (2√2/3, 0, 1/3) 2 − Xˆ = ( √2/3, 2/3, 1/3) 3 − − Xˆ = ( √2/3, 2/3, 1/3) (E.88) 4 − −p − p The presence of a symmetrical set of charges surrounding a central atom reduces the symmetry of the system: Instead of being invariant under the the full group of rotations about the central atom (as well as rotations in spin space) the system is now invariant only under a restricted group of rotations. However, subsets of configurations that are closed under the operations of the full rotation group, are also closed under the more restricted group of rotations. Therefore, if we are willing to have invariant subsets of configurations that are larger than minimal, we can choose them almost as though the 314 QUANTUM THEORY

perturbing surrounding charges were not there. Thus the invariant subsets in the perturbed case can be established by following the prescription:

ν = (a; b) 0 0 00 00 a = (MS, (n, l), (n , l ), (n , l ),...) 0 0 00 00 b = ((m, ms), (m , ms), (m , ms ),...)

Wa = ( Φa,b , Φa,b , Φa,b ,...) (E.89) | 1 i | 2 i | 3 i The symmetry-adapted basis functions found by diagonalizing the invariant blocks can be used as basis functions for a more ambitious configuration interaction calculation. For example, let us think of a neutral lithium atom, surrounded by a square planar array of point charges. To make the example more specific, let us think of the set of configurations characterized by 10 (1s)2(3d) (E.90) 1   This set has 10 members, and they can be used as a basis for solving the generalized Sturmian secular equations in the presence of the square-planar array of point charges. In the absence of the charges, the 10 configurations are members of a 2D multiplet, and in the large-Z approximation its energy is

1 2 1 2 Eκ = (ZRν λ ) = (√19 .58281) = 7.1294 (E.91) −2 − | | −2 − − This corresponds to the effective charge

pκ ZRν λ Q = = − | | = 2.5989 (E.92) Rν Rν where 1 1 1 √19 Rν = 2 + 2 + 2 = λ = .58281 (E.93) r1 1 3 3 | | In order to calculate T 00 and find the effect of the point charges (which we treat here as a small perturbation), we need to evaluate the radial integrals:

∞ l .19193 l = 0 2 r< 2 dr r l+1 R3,2(r) = .10363 l = 2 0 r> | |  Z  a  .06826 l = 4 (E.94) 

with Xa = 5 and | | 7/2 2Q 2 2 −Qr/3 R3,2(r) = r e (E.95) 81 r15 E.9. AN ATOM SURROUNDED BY POINT CHARGES 315

The angular integrals needed for constructing T 00 are

∗ ˆ dΩ Y2,m1 (xˆ)Y2,m2 (xˆ)P0(xˆ Xa) = 4δm1,m2 a · X Z ∗ ˆ dΩ Y2,m1 (xˆ)Y2,m2 (xˆ)P1(xˆ Xa) = 0 a · X Z ∗ ˆ dΩ Y2,m1 (xˆ)Y2,m2 (xˆ)P3(xˆ Xa) = 0 (E.96) a · X Z and

∗ ˆ dΩ Y2,m1 (xˆ)Y2,m2 (xˆ)P2(xˆ Xa) a · X Z 4 0 0 0 0 7   2  0 0 0 0     − 7       4  =  0 0 0 0  (E.97)  − 7       2   0 0 0 0   − 7       4     0 0 0 0   7    while

∗ ˆ dΩ Y2,m1 (xˆ)Y2,m2 (xˆ)P4(xˆ Xa) a · X Z 1 5 0 0 0 14 6   2  0 0 0 0     − 7       3  =  0 0 0 0  (E.98)  7       2   0 0 0 0   − 7       5 1     0 0 0   6 14    316 QUANTUM THEORY

B1

B2

A1 E

2 Figure E.8: This diagram shows schematically the splitting of the D1/2 multiplet of neutral lithium discussed above in the field of a square-planar array of point 2 charges. The D−1/2 multiplet which is similarly split is not shown. Besides being split by the presence of the charges, the multiplet is also displaced in energy, as is the lithium ground state.

T2 g

Eg

Figure E.9: This figure shows the same multiplet as Figure 4.8, but the splitting is due to an octohedral array of point charges. E.9. AN ATOM SURROUNDED BY POINT CHARGES 317

3.0

2.5

2.0

1.5

1.0

0.5

0.0 -3 -2 -1 0 1 2 3

Figure E.10: This figure shows the angular dependence of one of the T1u orbitals produced when degenerate f-orbitals are split by the presence of an octohedral arrangement of point charges. The T1u irreducible representation of the group Oh is a 3-dimensional representation. The other two components are shown in Figure 4.11. 318 QUANTUM THEORY

3.0

2.5

2.0

1.5

1.0

0.5

0.0 -3 -2 -1 0 1 2 3

3.0

2.5

2.0

1.5

1.0

0.5

0.0 -3 -2 -1 0 1 2 3

Figure E.11: This figure shows the angular dependence of the remaining two T1u orbitals. Both in this figure and in Figure 4.10, the vertical axis represents θ and the horizontal axis represents φ. Appendix F

MOLECULAR ORBITALS BASED ON STURMIANS

F.1 The one-electron secular equation

Molecular orbitals may be represented as superpositions of Coulomb Sturmian basis func- tions (Appendix D) centered on the nuclei of a molecule. These basis functions are an exam- ple of Exponential-Type Orbitals (ETO’s) [Harris and Michels, 1967], [Pinchon and Hoggan 2007], [Pinchon and Hoggan, 2009], [Weatherford, 1982], and calculations using them can poten- tially be much more accurate than calculations based on Gaussians. Gaussian basis functions have serious drawbacks, since very many of them are needed to approximate the molecular orbitals, and since the cusp at the nucleus is never adequately represented. Furthermore, Gaussian basis functions cannot accurately represent the expo- nential decay of the orbitals at large distances from the nuclei. Thus while the mainstream effort of quantum chemistry today follows the path of Gaussian technology, there exists a small group of researchers who struggle with the difficult mathematical problems involved in using exponential-type orbitals (ETO’s) as basis functions, and we hope that the present chapter will make a contribution to this effort. We will see that the automatic scaling prop- erties associated with the Generalized Sturmian Method have advantages also in the case of molecules, and that molecular orbitals based on many-center Coulomb Sturmians have ad- vantages over other ETO’s with respect to the ease of evaluation of interelectron repulsion integrals. We will first consider the use of Coulomb Sturmian basis functions located on the different atoms of a molecule to solve the 1-electron molecular Schr¨odingerequation, an endeavor which was pioneered by C.E, Wulfman, B. Judd, T. Koga, V. Aquilanti and others [Shibuya and Wulfman, 1965], [Wulfman, 2011], [Judd, 1975], [Koga et al., 1984]- [Koga et al., 1991], [Aquilanti et al., 1996],[Aquilanti et al., 1997]. These authors solved the Schr¨odingerequation in momentum space, but here we will use a direct-space treatment to reach the same results. In this approach to molecular orbital theory, we search for

319 320 QUANTUM THEORY

solutions to the one-electron Schr¨odingerequation

1 2 + v(x) ζ ϕζ (x) = 0 (F.1) −2∇ −   where v(x) is the Coulomb attraction potential of the nuclei: Z v(x) = a (F.2) x X − a a X | − | cWe will approximate the molecular orbitals ϕζ (x) by superpositions of Coulomb Sturmian atomic orbitals centered on the various atoms of the molecule. To do this it is convenient to introduce a notation where τ stands for a set of four indices, the first three being the quantum numbers of a one-electron Coulomb Sturmian basis function of the type discussed in Appendix B, while the final index, a, is the index of the nucleus on which the atomic orbital is centered: τ (n, l, m, a) (F.3) ≡ In this notation we can write

χτ (x) χnlm(x Xa) (F.4) ≡ − A molecular orbital is then represented by a superposition of the form

ϕζ (x) = χnlm(x Xa)Cτ,ζ χτ (x)Cτ,ζ (F.5) − ≡ nlma τ X X The normalization condition for the molecular orbitals is

3 ∗ ∗ 1 = d xj ϕζ (xj)ϕζ (xj) = Cτ 0,ζ mτ 0,τ Cτ,ζ (F.6) τ 0 τ Z X X where 3 ∗ mτ 0,τ d xj χ 0 (xj)χτ (xj) (F.7) ≡ τ Z is the matrix of many-center Sturmian overlap integrals. The matrix mτ 0,τ may be evaluated using the properties of hyperspherical harmonics, and we will discuss below the details of how this may be done. Coulomb Sturmian basis functions are discussed in detail in Appendix B. They have exactly the same form as the familiar hydrogenlike atomic orbitals,

χnlm(x) = Rn,l(r)Ylm(θ, φ) (F.8)

except in the radial part, Rn,l(r), the factor Z/n is replaced by a constant, k. The first few Coulomb Sturmian radial functions are 3/2 −kr R1,0(r) = 2k e 3/2 −kr R , (r) = 2k (1 kr)e 2 0 − 1 3/2 −kr R2,1(r) = 2k kr e √3 2 2 3/3 2k r −kr R , (r) = 2k 1 2kr + e (F.9) 3 1 − 3   F.1. THE ONE-ELECTRON SECULAR EQUATION 321

The reader can verify that these are precisely the same as hydrogenlike atomic orbitals with the replacement Z/n k. We now substitute the superposition (F.5) into the one-electron Schr¨odingerequation→ (F.1). This gives us:

1 2 1 2 + k + v(x) χnlm(x Xa)Cτ,ζ −2∇ 2 − nlma X   1 1 2 + k2 + v(x) χ (x)C = 0 (F.10) 2 2 τ τ,ζ ≡ τ − ∇ X   with 1 2 ζ k (F.11) ≡ −2 where each of the Coulomb Sturmian atomic orbitals χτ (x) χnlm(x Xa) obeys a one- electron Schr¨odingerequation of the form ≡ −

1 2 1 2 nk + k χnlm(x Xa) = 0 (F.12) −2∇ 2 − x Xa −  | − | Taking the scalar product of (F.10) with a conjugate function in our basis set, we obtain

2 3 ∗ 1 2 k d xχ 0 (x) + + v(x) χ (x)C = 0 (F.13) τ 2 2 τ τ,ζ τ − ∇ X Z   With the notation 1 3 ∗ Wτ 0,τ d xχ 0 (x)v(x)χτ (x) (F.14) ≡ −k τ Z and 2 1 3 ∗ 1 2 k Sτ 0,τ d xχ 0 (x) + χτ (x) ≡ k2 τ −2∇ 2 Z   n 3 ∗ 1 = d xχτ 0 (x) χτ (x) (F.15) k x Xa Z | − | we obtain a secular equation of the form

[Wτ 0,τ kSτ 0,τ ] Cτ,ζ = 0 (F.16) τ − X We can call Wτ 0,τ the Wulfman integrals to honor the pioneering contributions of Prof. C.E. Wulfman. The integrals Sτ 0,τ are called Shibuya-Wulfman integrals [Avery, 1989], [Avery, 2000], [Avery and Avery, 2006], [Judd, 1975], [Shibuya and Wulfman, 1965], and methods for their evaluation are discussed below. It can be shown [Koga and Matsuhashi, 1987] that the matrix elements of the many-center potential Wτ 0,τ can be expressed in terms of the Shibuya-Wulfman integrals by means of the sum rule n0n Wτ 0,τ = Kτ 0,τ 00 Kτ 00,τ (F.17) Za0 Za r τ 00 X 322 QUANTUM THEORY

where Za0 Za 0 S 0 Kτ ,τ 0 τ ,τ (F.18) ≡ r n n With the help of this sum rule, the secular equations (F.16) can be rewritten in the form

0 Kτ 0,τ 00 Kτ 00,τ kKτ 0,τ C = 0 (F.19) − τ,ζ τ " τ 00 # X X with

0 Za Cτ,ζ = Cτ,ζ (F.20) r n Now suppose that we have solved the secular equation

0 [Kτ 0,τ kδτ 0,τ ] Cτ,ζ (F.21) τ − X 0 The values of k and Cτ,ζ thus obtained will also be solutions to (F.19). To see this, we perform the sum over τ 00 in (F.19), making use of (F.21):

0 Kτ 0,τ 00 Kτ 00,τ kKτ 0,τ C − τ,ζ τ " τ 00 # X X 0 0 = k Kτ 0,τ 00 δτ 00,τ C kδτ 0,τ C τ,ζ − τ,ζ τ " τ 00 # X X 0 = k [Kτ 0,τ kδτ 0,τ ] Cτ,ζ = 0 (F.22) τ − X Since we have two forms of the molecular Sturmian secular equation, (F.16) and (F.21), one might ask which form is the best. The answer is that if the number of basis functions used is small, (F.16) give the most accurate results. However, particularly for small values of the parameter S, equation (F.21) suffers from problems of overcompleteness when the number of basis functions is increased. By contrast, as Monkhorst and Jeziorski have pointed out [Monkhorst and Jeziorski, 1979], equation (F.16) has no such problems, and therefore it is the method of choice when the basis set used is very large or when S is small. We can call the matrix Kτ 0,τ the Koga matrix to honor the contributions of Prof. T. Koga and his group. F.1. THE ONE-ELECTRON SECULAR EQUATION 323

E

-0.5

-1.0

-1.5

-2.0 R 0 2 4 6 8

+ Figure F.1: Energies of the ground state and excited states of H2 , calculated by solving equation (F.21). The energies are expressed in Hartrees and are given as a function of the internuclear separation R, expressed in Bohrs.

E

-0.1

-0.2

-0.3

-0.4

-0.5

-0.6

-0.7

-0.8 R 0 2 4 6 8

+ Figure F.2: A closer view of the excited state energies of H2 . In the united-atom limit, these energies approach those of the excited states of He+, i.e., 4/(2n2). 324 QUANTUM THEORY

4

2

0

-2

-4

-4 -2 0 2 4

+ Figure F.3: The ground state molecular orbital of the H2 ion at nuclear separa- tion R=1.21702 Bohrs (S=2, k=1.64335). In the united-atom limit, k=2.

4

2

0

-2

-4

-4 -2 0 2 4

Figure F.4: The same state at nuclear separation 2.98216 Bohrs (S=4, k=1.34131). F.1. THE ONE-ELECTRON SECULAR EQUATION 325

4

2

0

-2

-4

-4 -2 0 2 4

Figure F.5: Here the internuclear distance has been increased to 5.13325 Bohrs (S=6, k=1.16885).

4

2

0

-2

-4

-4 -2 0 2 4

Figure F.6: The same state with nuclear separation 7.50577 Bohrs (S=8, k=1.06585). As the nuclear separation increases, k approaches 1. 326 QUANTUM THEORY F.2 Shibuya-Wulfman integrals and Sturmian over- lap integrals evaluated in terms of hyperpherical harmonics

The Shibuya-Wulfman integrals Sτ 0,τ defined by equation (F.15) as well as the molecular Sturmian overlap integrals

3 ∗ mτ 0,τ d x χ 0 (x)χτ (x) (F.23) ≡ τ Z can conveniently be evaluated in reciprocal space. Let us first consider the Sturmian overlap integrals. Using the fact that

3 ∗ 3 t∗ t mτ 0,τ d x χ 0 (x)χτ (x) = d p χ 0 (p)χ (p) (F.24) ≡ τ τ τ Z Z where, if we let

µ (n, l, m) (F.25) ≡ while τ (n, l, m, a), then ≡

t 1 3 −ip·x −ip·Xa t χ (p) d x e χτ (x) = e χ (p) τ ≡ (2π)3/2 µ Z 1 t∗ 3 ip·x ∗ ip·Xa0 t∗ χ 0 (p) d x e χ 0 (x) = e χ (p) (F.26) τ ≡ (2π)3/2 τ µ Z We thus obtain

3 ip·R t∗ t mτ 0,τ = d p e χµ0 (p)χµ(p) (F.27) Z where

R Xa0 Xa (F.28) ≡ − We now make use of V. Fock’s relationship [Fock, 1935], [Fock, 1958]

t χµ(p) = M(p)Yn−1.l.m(uˆ) (F.29) where 4k5/2 M(p) (F.30) ≡ (k2 + p2)2 In equation (F.29), uˆ is a 4-dimensional unit vector that defines Fock’s projection of mo- mentum space onto the surface of a 4-dimensional hypersphere. 2kp 2kp 2kp k2 p2 uˆ = (u , u , u , u ) = 1 , 2 , 3 , − (F.31) 1 2 3 4 k2 + p2 k2 + p2 k2 + p2 k2 + p2   F.2. SHIBUYA-WULFMAN INTEGRALS AND STURMIAN OVERLAP INTEGRALS EVALUATED IN TERMS OF HYPERPHERICAL HARMONICS327

while Yµ(uˆ) is a 4-dimensional hyperspherical harmonic defined by:

1+l Yλ,l,m(uˆ) = λ,lC (u )Yl,m(u , u , u ) (F.32) N λ−l 4 1 2 3 where

λ l 2(λ + 1)(λ l)! λ,l = ( 1) i (2l)!! − (F.33) N − s π(λ + l + 1)! is a normalizing constant while

[j/2] ( 1)tΓ(n + α t) Cα(u ) = − − (2u )j−2t (F.34) j 4 t!(j 2t)!Γ(α) 4 t=0 X − is a Gegenbauer polynomial, and where Yl,m is a familiar 3-dimensional spherical harmonic. The first few hyperspherical harmonics are shown in Table 5.1. The index λ corresponds t to n 1 so that the Fourier transform of χ , , (x) is given by χ (p) = M(p)Y , , (uˆ) = − 1 0 0 1,0,0 0 0 0 M(p)/(√2π), and so on. Substituting (F.29) into (F.27), we obtain

3 ip·R 2 ∗ mτ 0,τ = d p e M(p) Yn0−1,l0,m0 (uˆ)Yn−1,l,m(uˆ) Z 3 ip·R 2 ∗ d p e M(p) Y 0 (uˆ)Yµ(uˆ) (F.35) ≡ µ Z (Here, and throughout the book, a unit vector is indicated by a “hat”.) One can show [Avery, 1989] that the Shibuya-Wulfman integrals can be written in a similar form:

3 3 ip·R 2k ∗ S 0 = d p e Y 0 (uˆ)Y (uˆ) (F.36) τ ,τ k2 + p2 µ µ Z   One can also show [Caligiana, 2003] that

2k 3 d3p eip·R Y ∗(uˆ) = (2π)3/2f (S)Y (Sˆ) (F.37) k2 + p2 µ n,l l,m Z  

where Yl,m is an ordinary 3-dimensional spherical harmonic and where

S = Sx,Sy,Sz kR S = k R (F.38) { } ≡ | |

The function fn,l(S) is defined by

3/2 1 (n l)(n + l + 1) k fn,l Rn,l − Rn+1,l ≡ − 2s n(n + 1) 1 (n + l)(n l 1) − − Rn− ,l (F.39) −2 n(n 1) 1 s − 328 QUANTUM THEORY

where Rn,l is the radial function of the Coulomb Sturmians given in equation (F.9), and where Rn− ,l 0 if l > n 1 (F.40) 1 ≡ − Similarly, one can show [Caligiana, 2003] that

3 ip·R 2 3/2 d p e M(p) Yµ(uˆ) = (2π) gn,l(S)Yl,m(Sˆ) (F.41) Z where

1 (n l)(n + l + 1) gn,l fn,l − fn+1,l ≡ − 2s n(n + 1) 1 (n + l)(n l 1) − − fn− ,l (F.42) −2 n(n 1) 1 s − where we define fn− ,l 0 if l > n 1 (F.43) 1 ≡ − The first few values of fn,l(S) and gn,l(S) are shown in Table 5.2. Equations (F.37) and (F.41) are respectively identical with the Shibuya Wulfman in- tegrals and the molecular Sturmian overlap integrals except that they contain only one 4-dimensional hyperspherical harmonic instead of a product of two. Thus the problem of evaluating both Sτ 0,τ and mτ 0,τ reduces to the problem of evaluating the coefficients

∗ ∗ cµ00;µ0,µ = dΩ4Yµ00 (uˆ)Yµ0 (uˆ)Yµ(uˆ) (F.44) Z These coefficients can readily be pre-evaluated once and for all using the hyperangular integration theorems discussed in Appendix C, and they can be stored as a large but very sparse matrix. We then obtain the relationships:

∗ Yµ0 (uˆ)Yµ(uˆ) = Yµ00 (uˆ)cµ00;µ0,µ (F.45) µ00 X

3/2 Sτ 0,τ = (2π) Yl00,m00 (Sˆ)fn00,l00 (S)cµ00;µ0,µ (F.46) µ00 X and 3/2 mτ 0,τ = (2π) Yl00,m00 (Sˆ)gn00,l00 (S)cµ00;µ0,µ (F.47) µ00 X Similar methods can be used to calculate the Wulfman integrals Wτ 0,τ [Avery, 2000]. The first few Shibuya-Wulfman integrals are shown in Table 5.3. We can notice that when S = 0 the diagonal elements are 1, while the off-diagonal elements vanish. The first few displaced Coulomb Sturmian overlap integrals mτ 0,τ are shown in Table 5.4. F.2. SHIBUYA-WULFMAN INTEGRALS AND STURMIAN OVERLAP INTEGRALS EVALUATED IN TERMS OF HYPERPHERICAL HARMONICS329

Table F.1: gn,l(S) and fn,l(S), where S k Xa0 Xa . The functions gn,l(S) and fn,l(S) appear respectively in the two-center overlap≡ | integrals− and| the Shibuya-Wulfman integrals.

n l gn,l(S) fn,l(S)

S 2 e− 3 + 3S + S S 1 0 e− (1 + S) 3 

1 S 2 3 2 S 2 2 0 e− 3 + 3S + 2S + S e− S −6 −3 

S 2 S e− S 3 + 3S + S 2e S(1 + S) 2 1 − 6√3  3√3

1 S 4 1 S 2 3 0 e− S e− S ( 2 + S) 15 3 −

S 3 S 2 e− S (1 + S) e− S(1 + S S ) 3 1 − − 15√2 3√2

e SS2(3 + 3S + S2) e SS2(1 + S) 3 2 − − 15√10 3√10 330 QUANTUM THEORY

Table F.2: This table shows the first few Shibuya-Wulfman integrals Sτ 0,τ , as functions of S k(Xa0 Xa), with S S and S (S sin θ cos φ, S sin θ sin φ, S cos θ). The integrals were≡ generated− by means of≡ equation | | (F.46).≡

τ 0 τ = (1, 0, 0, a) τ = (2, 0, 0, a)

2 (1, 0, 0, a0) e−S(1 + S) e−SS2 −3

2 1 (2, 0, 0, a0) e−SS2 e−S(3 + 3S 2S2 + S3) −3 3 −

√2 1 (2, 1, 1, a0) e−SS(1 + S) sin θ eiφ e−SS( 1 S + S2) sin θ eiφ − − 3 3√2 − −

2 1 (2, 1, 0, a0) e−SS(1 + S) cos θ e−SS( 1 S + S2) cos θ −3 3 − −

√2 1 (2, 1, 1, a0) e−SS(1 + S) sin θ e−iφ e−SS( 1 S + S2) sin θ e−iφ 3 −3√2 − − F.2. SHIBUYA-WULFMAN INTEGRALS AND STURMIAN OVERLAP INTEGRALS EVALUATED IN TERMS OF HYPERPHERICAL HARMONICS331

3 ∗ Table F.3: The first few overlap integrals mτ 0,τ d xχτ 0 (x)χτ (x) between displaced Coulomb Sturmians. The definitions of S, θ and≡φ are the same as in Table 5.3. The R integrals were evaluated by means of equation (F.47).

τ 0 τ = (1, 0, 0, a) τ = (2, 0, 0, a)

1 1 (1, 0, 0, a0) e−S(3 + 3S + S2) e−S(3 + 3S + 2S2 + S3) 3 −6

1 1 (2, 0, 0, a0) e−S(3 + 3S + 2S2 + S3) e−S(15 + 15S + 5S2 + S4) −6 15

1 1 (2, 1, 1, a0) e−SS(3 + 3S + S2) sin θ eiφ e−SS3(1 + S) sin θ eiφ − −6√2 15√2

1 1 (2, 1, 0, a0) e−SS(3 + 3S + S2) cos θ e−SS3(1 + S) cos θ −6 15

1 1 (2, 1, 1, a0) e−SS(3 + 3S + S2) sin θ e−iφ e−SS3(1 + S) sin θ e−iφ 6√2 −15√2 332 QUANTUM THEORY F.3 Molecular calculations using the isoenergetic con- figurations

We now introduce N-electron configurations which are Slater determinants of the form:

ϕζ (x ) ϕζ0 (x ) ϕζ00 (x ) 1 1 1 ··· 1 ϕζ (x2) ϕζ0 (x2) ϕζ00 (x2) Φ = ϕ ϕ 0 ϕ 00 ··· (F.48) ν ζ ζ ζ ϕ (x ) ϕ 0 (x ) ϕ 00 (x ) | i | · · · | ≡ √N! ζ 3 ζ 3 ζ 3 . . . ··· . . .

where the molecular spin-orbitals ϕ (x) satisfy ζ 1 k2 Z 2 + + β v(x ) ϕ (x ) = 0 v(x ) = a (F.49) 2 j 2 ν j ζ j j x X − ∇ a j a   X | − | Since the individual molecular orbitals satisfy (F.49), the configurations Φν are solutions to the separable N-electron equation: | i

N 2 1 2 k + + βνv(xj) Φν = 0 (F.50) −2∇j 2 | i j=1 X   which can also be written in the form:

N 2 1 2 k + + βνV (x) Φν −2∇j 2 0 | i " j=1 # X   N 1 2 = + βνV (x) Eκ Φν = 0 (F.51) −2 ∇j 0 − | i " j=1 # X where x (x , x ,..., xN ) and ≡ 1 2 N k2 Nk2 Eκ = = (F.52) − 2 − 2 j=1 X and where N N Za V0(x) v(xj) = (F.53) ≡ xj Xa j=1 j=1 a X X X | − | Comparing (F.51) with (E.6), we can see that they are the same. Thus the isoenergetic solutions to the approximate N-electron Schr¨odinger equation (F.51) form a generalized Sturmian basis set. We would like to use these configurations to build up solutions to the N-electron Schr¨odingerequation

N 2 1 2 k + + V (x) Ψκ = 0 (F.54) −2∇j 2 | i " j=1 # X   F.3. MOLECULAR CALCULATIONS USING THE ISOENERGETIC CONFIGURATIONS333 with N 1 V (x) = V (x) + (F.55) 0 r i>j ij X Thus we write Ψκ Φν Bνκ (F.56) | i ≈ ν | i X Substituting this into the N-electron Schr¨odinger equation, and taking the scalar product with a conjugate configuration, we obtain the secular equations:

N 2 1 2 k Φν0 + + V (x) Φν Bνκ = 0 (F.57) h | −2∇j 2 | i ν " j=1 # X X   We now introduce a k-independent matrix representing the total potential based on the configurations Φν : | i (N) 1 T 0 Φν0 V (x) Φν (F.58) ν ν ≡ −k h | | i and another k-independent matrix

N 2 (N) 1 ∗ 1 2 k S 0 Φ 0 + Φν (F.59) ν ν ≡ k2 h ν | −2∇j 2 | i j=1 X   In terms of these matrices, the secular equations become:

(N) (N) Tν0ν kSν0ν Bνκ = 0 (F.60) ν − X h i Nk2 Solving equation (F.60), we obtain k for each state κ and thus the energy Eκ = . For − 2 a given state κ, the value of k then determines the weighting factors βν1 , βν2 ,... needed to make each configuration Φν1 , Φν2 ,... correspond to the same energy Eκ. | i | i (N) (N) In order to build the N-electron matrices Tν0ν and Sν0ν and solve equation (F.60), we must first obtain the coefficients Cτζ by solving (F.16) or (F.21). In the case of diatomic molecules, we begin by picking a value of the parameter S = kR, where R is the interatomic distance and k is the exponent of the Coulomb Sturmian basis set. Neither R nor k is known at this point, but only their product S. As we shall see below, for the diatomic case, all of the integrals involved in equations (F.16) and (F.21) are pure functions of S. Having chosen S, we can thus solve the one-electron secular equations and obtain the coefficients Cτζ and the spectrum of ratios k/βν. We are then able to solve equation F.60, which gives 2 us a spectrum of k-values, and thus energies Nk /2, and the eigenvectors Bνκ. From a k-value, we also get the unscaled distance R =−S/k. We repeat the procedure for a range of S-values and interpolate to find the solutions as functions of R. In the case of polyatomic molecules, one can choose a set of angles between the nuclei; these are left fixed under scaling of the coordinate system. The procedure is then similar to that described for the diatomic case. 334 QUANTUM THEORY

F.4 Building T (N) and S(N) from 1-electron compo- ν0ν ν0ν nents

We have already discussed how the matrix of many-center Sturmian overlap integrals

3 ∗ mτ 0τ d xj χ 0 (xj)χτ (xj) (F.61) ≡ τ Z may be evaluated using the properties of hyperspherical harmonics (F.47). The matrix mτ 0τ is needed in order to normalize the molecular orbitals ϕζ (xj), the normalization condition on the coefficients Cτ,ζ being

3 ∗ ∗ 1 = d xj ϕζ (xj)ϕζ (xj) = Cτ 0,ζ mτ 0,τ Cτ,ζ (F.62) τ 0 τ Z X X Having performed the normalization, we then need to transform the nuclear attraction matrix elements Wτ 0τ to a representation based on the molecular orbitals:

3 ∗ ∗ v˜ζ0ζ d xj ϕ 0 (xj)v(xj)ϕζ (xj) = k C 0 0 Wτ 0τ Cτζ (F.63) ≡ ζ − τ ζ τ 0 τ Z X X

Once we are in possession of the 1-electron matrix elementsv ˜ζ0ζ , we can evaluate

0,(N) 1 ∗ T 0 Φ 0 V (x) Φν (F.64) ν ν ≡ −k h ν | 0 | i by means of the Slater-Condon rules. Because of the potential-weighted orthonormality relations obeyed by generalized Sturmian basis sets (Appendix B), we expect the matrix 0,(N) Tν0ν to be diagonal. We next transform the matrix of Shibuya-Wulfman integrals to a representation based on the molecular orbitals:

∗ S˜ζ0ζ C 0 0 τ 0τ Cτζ (F.65) ≡ τ ζ § τ 0 τ X X (N) From these 1-electron matrix elements, the N-electron matrix Sν0ν can be constructed with the help of the Slater-Condon rules. Finally we must deal with the difficult term

N 0(N) 1 ∗ 1 T 0 Φ 0 Φ (F.66) ν ν k ν r ν ≡ − h | i>j ij | i X which is a k-independent matrix representing the effects of interelectron repulsion, and this will be discussed in the next section. F.5. INTERELECTRON REPULSION INTEGRALS FOR MOLECULAR STURMIANS FROM HYPERSPHERICAL HARMONICS335 F.5 Interelectron repulsion integrals for molecular Stur- mians from hyperspherical harmonics

We will now show that just as the theory of hyperspherical harmonics can be used to facilitate the calculation of Shibuya-Wulfman and molecular Sturmian overlap integrals, it also provides a method for very rapid calculation of the most important interelectron repulsion integrals involving molecular Sturmians. We again make use of momentum space: 0 Let ρµ ,µ (x Xa) and ρµ ,µ (x Xa0 ) be two electron density distributions, centered re- 1 2 − 3 4 − spectively on nuclei at the positions Xa and Xa0 . Then the interelectron repulsion between them is given by the integral:

3 3 0 1 0 Jµ ,µ ,µ ,µ = d x d x ρµ ,µ (x Xa) ρµ ,µ (x Xa0 ) (F.67) 1 2 3 4 1 2 − x x0 3 4 − Z Z | − | If we introduce the Fourier transform representation

1 1 1 0 = d3p e−ip·(x−x ) (F.68) x x0 2π2 p2 | − | Z

we can rewrite Jµ1,µ2,µ3,µ4 in the form

3 1 ip·R t t Jµ ,µ ,µ ,µ = 4π d p e ρ (p)ρ ( p) (F.69) 1 2 3 4 p2 µ1,µ2 µ3,µ4 − Z where R = Xa0 Xa and − 1 ρt (p) = d3x ρ (x)e−ip·x (F.70) µi,µj (2π)3/2 µi,µj Z

Now let Rn,l(2r) be a Coulomb Sturmian radial function with r replaced by 2r, and we let g(r) be any function of r. From the completeness property of Sturmian basis sets we know that it is possible to make an expansion of the form

g(r) = anRn,l(2r) (F.71) n X and from the potential-weighted orthonormality relations (D.11) it follows that the expan- sion coefficients will be given by

n ∞ a = dr r R (2r)g(r) (F.72) n 2 n,l Z0 We now let the density be composed of a product of two Coulomb Sturmian basis functions:

∗ ∗ ρµ1,µ2 (x) = χµ1 (x)χµ2 (x) = Rn1,l1 (r)Rn2,l2 (r)Yl1,m1 (xˆ)Yl2,m2 (xˆ) (F.73) 336 QUANTUM THEORY

If we make the expansion

ρµ ,µ (x) = Rn00,l00 (2r)Yl00,m00 (xˆ) µ00 µ ,µ χµ00 (2x) µ00 µ ,µ (F.74) 1 2 C ; 1 2 ≡ C ; 1 2 µ00 µ00 X X then the coefficients in the expansion will be given by

n00 ∞ µ00 µ ,µ = dr r Rn00,l00 (2r)Rn ,l (r)Rn ,l (r) C ; 1 2 2 1 1 2 2 Z0 ∗ ∗ dΩ Y 00 00 (xˆ)Y (xˆ)Yl ,m (xˆ) (F.75) × 3 l ,m l1,m1 2 2 Z

Like the coefficients cµ00;µ0,µ, the coefficients µ00;µ0,µ form a large but very sparse matrix which can be pre-calculated once and for allC and stored. The series in (F.74) terminates and the expansion is exact. Making use of the relationships (F.73) and (F.74), we obtain the result

3 3 0 1 0 Jµ ,µ ,µ ,µ = d x d x ρµ ,µ (x Xa) ρµ ,µ (x Xa0 ) 1 2 3 4 1 2 − x x0 3 4 − Z Z | − | 0 0 = Jµ ,µCµ ,µ1,µ2 Cµ,µ3,µ4 (F.76) µ0,µ X where

3 1 ip·R t t Jµ0,µ = 4π d p e ρ 0 (p)ρ ( p) (F.77) p2 µ µ − Z and where

ρµ0 (x) = Rn0,l0 (2r)Yl0,m0 (xˆ)

ρµ(x) = Rn,l(2r)Yl,m(xˆ) (F.78)

Then, making use of Fock’s relationship we have:

t ρ 0 (p) = M˜ (p)Yn0− ,l0,m0 (wˆ ) M˜ (p)Yµ0 (wˆ ) µ 1 ≡ t l l ρ ( p) = ( 1) M˜ (p)Yn− ,l,m(wˆ ) ( 1) M˜ (p)Yµ(wˆ ) (F.79) µ − − 1 ≡ − where M˜ (p) and wˆ are similar to M(p) and uˆ, but with double the k value.

4kp 4kp 4kp 4k2 p2 wˆ = (w , w , w , w ) = 1 , 2 , 3 , − (F.80) 1 2 3 4 4k2 + p2 4k2 + p2 4k2 + p2 4k2 + p2   and 4(2k)5/2 M˜ (p) (F.81) ≡ (4k2 + p2)2 F.5. INTERELECTRON REPULSION INTEGRALS FOR MOLECULAR STURMIANS FROM HYPERSPHERICAL HARMONICS337

Then

3 1 ip·R 2 l Jµ0,µ = 4π d p e M˜ (p)( 1) Yµ0 (wˆ )Yµ(wˆ ) (F.82) p2 − Z l Since we know how to represent the product ( 1) Yµ0 (wˆ )Yµ(wˆ ) in terms of Yµ00 (wˆ ), we can − express the matrix Jµ0,µ in terms of a single vector, Jµ00 : Let

l ∗ c˜µ00 µ0,µ ( 1) dΩ Y 00 (wˆ )Yµ0 (wˆ )Yµ(wˆ ) (F.83) ; ≡ − 4 µ Z Then

Jµ0,µ = Jµ00 c˜µ00;µ0,µ (F.84) µ00 X where

3 1 ip·R 2 Jµ 4π d p e M˜ (p)Yµ(wˆ ) (F.85) ≡ p2 Z

The coefficientsc ˜µ00;µ0,µ differ slightly from the coefficients cµ00;µ0,µ, but they too form a large but very sparse matrix that can be pre-calculated and stored. We must now evaluate Jµ. To do so, it is convenient to introduce the notation

t −l t ρ (p) = M˜ (p)Yµ(wˆ ) i R (p)Yl,m(pˆ) (F.86) µ ≡ n,l where

∞ t 2 2 R (p) dr r jl(pr)Rn,l(2r) (F.87) n,l ≡ π r Z0 Then, expanding the plane wave in terms of spherical harmonics and spherical Bessel functions and integrating over the solid angle in momentum space, we have: 1 J = 4π d3p eip·RM˜ 2(p)Y (wˆ ) µ p2 µ Z ∞ 2 ˆ ˜ t = (4π) Yl,m(S) dp M(p)Rn,l(p)jl(pR) (F.88) Z0 The radial p-integrals in equation (F.88) are simple enough to be evaluated exactly by Mathematica, and they depend only on n and l. They can conveniently be stored as interpolation functions. It is also convenient to initialize by performing the sum shown in equation (F.84). This sum, and the sums required for the evaluation of Jµ1,µ2,µ3,µ4 from Jµ0,µ can be performed very rapidly because of the sparseness of the coefficients. 338 QUANTUM THEORY

2

1

5 10 15 20

-1

-2

∞ ˜ t Figure F.7: The integrals 0 dp M(p)Rn,l(p)jl(pR)/k of equation (F.88) are shown here plotted as functions of S kR. There are 105 functions, corresponding R n = 1, 2,..., 14 and l = 0, 1, . . . , n ≡1. −

1.0

0.5

0.0 5 10 15 20

-0.5

-1.0

∞ ˜ t Figure F.8: The integrals 0 dp M(p)Rn,l(p)jl(pR)/k shown in more detail. For small values of S the integrals are proportional to Sl, while for large values R they are proportional to 1/Sl+1. F.6. MANY-CENTER INTEGRALS TREATED BY GAUSSIAN EXPANSIONS (APPENDIX E)339 F.6 Many-center integrals treated by Gaussian ex- pansions (Appendix E)

One also needs to calculate 3-center and 4-center integrals of the form 1 J = d3x d3x0 χ∗ (x)χ (x) χ∗ (x0)χ (x0) (F.89) τ1,τ2,τ3τ4 τ1 τ2 x x0 τ3 τ4 Z Z | − |

where the centers Xa1 ,..., Xa4 may in general be 4 different points. Even in this difficult case, molecular Sturmian basis functions have very marked advantages. One can show

(Appendix E) that Jτ1,τ2,τ3τ4 /k is independent of k. They may therefore be calculated once and for all and stored. One can also show (Appendix E) that the Coulomb Sturmian atomic orbitals can be expressed in terms of Gaussian expansions, where the Gaussian exponents αi are universals that need never be changed despite changes in scaling due changes in the value of k. The coefficients γ0,i, γ1,i,... in the following expression

2 j −s −αis s e γj,ie s kr (F.90) ≈ i ≡ X are also universals, and they too need never be changed, despite changes in scaling. We make the expansion

2 3/2 −αi|kx−kXa| m χτ (x) = χn,l,m(x Xa) k Γn,l,i e Rl (kx kXa) − ≈ i − X (F.91)

where the coefficients Γn,l,i are defined by the relationship

2 2l + 1 −l −αis R˜n,l(s)s Γn,l,ie (F.92) 4π ≈ r i X ˜ 3/2 m with Rn,l(s) Rn,l(r)/k and where Rl is a regular solid harmonic (Appendix E). Figure 5.9 shows the≡ Gaussian expansion of e−s e−kr, while the table shows coefficients in the expansion. ≡ 340 QUANTUM THEORY

Exp@-sD 1.0

0.8

0.6

0.4

0.2

s 1 2 3 4 5 6

2 −s −αis Figure F.9: This figure shows the Gaussian expansion e i γ0,ie , using the 10 coef- ficients and exponents shown in Table 5.5. The expansion≈ is reasonably accurate throughout most of the range, but it fails to produce the sharp cusp nearPs = 0.

s² Exp@-sD

0.5

0.4

0.3

0.2

0.1

s 1 2 3 4 5 6

2 2 −s −αis Figure F.10: Here we see the Gaussian expansion s e i γ2,ie . As in Figure 5.9, the expansion is compared with the exact function.≈ P F.6. MANY-CENTER INTEGRALS TREATED BY GAUSSIAN EXPANSIONS (APPENDIX E)341

Table F.4: Universal coefficients for Gaussian expansions of Coulomb Sturmians: They are 2 j −s −αis used in the relationship s e = i γj,ie , where s kr. When k changes with scaling, the Gaussian expansion changes scale automatically. ≡ P

i αi γ0,i γ1,i γ2,i

1 5.12 0.474589 0.456553 0.011253 − −

2 2.56 0.409842 0.420846 0.135640 − −

3 1.28 0.522704 0.461490 0.030952 − −

4 0.64 0.028869 0.157189 0.390496 − −

5 0.32 0.237377 0.008340 0.284720 −

6 0.16 0.074194 0.248277 0.001174

7 0.08 0.039810 0.147977 0.631545

8 0.04 0.001091 0.025882 0.224411 −

9 0.02 0.000808 0.001018 0.000462 −

10 0.01 0.000129 0.000170 0.000468 − 342 QUANTUM THEORY

0.5

0.4

0.3

0.2

0.1

5 10 15 20

Figure F.11: Interelectron repulsion integrals Jτ1,τ2,τ3,τ4 /k for diatomic molecules for n = (2, 2, 2, 2) and l = (0, 0, 0, 0) as functions of S = kR. The lowest curve shows a the results when (a1, a2, a3, a4) = (S.0, S, 0). The next higher curve shows the case where (a1, a2, a3, a4) = (S.S, S, 0). The highest curve, which is the (a1, a2, a3, a4) = (S.S, 0, 0) case, is compared with 1/S, which it approaches asymptoticly as S becomes large. F.7. A PILOT CALCULATION 343 F.7 A pilot calculation

We have made a small pilot calculation on the dissociation of the hydrogen molecule, using a very restricted basis set. In the calculation shown in Figure 5.12, the ground state wave function changes character as a function of the internuclear separation, R. As R 0, the wave function becomes more and more dominated by a configuration which is built from→ two gerade molecular orbitals. But as the molecule dissociates, the wave function becomes the linear combination of configurations representing two isolated neutral hydrogen atoms, each with its own electron, and the total energy corresponds to that of two isolated hydrogen atoms. 344 QUANTUM THEORY

0.5

2 4 6 8 10 -0.5

-1.0

-1.5

-2.0

-2.5

-3.0

Figure F.12: This figure shows the results of a preliminary calculation on the dissociation of the hydrogen molecule using a very restricted basis set. Energies are shown in Hartrees as functions of the internuclear separation, measured in Bohrs. The lowest curve shows the ground-state electronic energy by itself, without internuclear repulsion. The two upper curves show the ground state and first excited singlet state electronic energies with nuclear repulsion added, i.e. the total energies of the two states. The calculated equilibrium bond length is 1.41 Bohrs, which can be compared with the experimental value, 1.40 Bohrs. It can be seen from the figure that at a separation of 5 Bohrs or more, the molecule is completely dissociated, and in fact the calculated wave function at that internuclear separation corresponds to two neutral hydrogen atoms, each with its own electron, while the total energy corresponds to that of two isolated hydrogen atoms. F.7. A PILOT CALCULATION 345

EZ2 0.0 R 2 4 6 8 10 12 14

-0.5

-1.0

-1.5

-2.0

Figure F.13: This figure shows ground-state energies divided by Z2 for the 2- electron isoelectronic series for homonuclear diatomic molecules, Z being the nuclear charges. The energies in Hartrees are shown as functions of the in- teratomic distance R, measured in Bohrs. The dotted curves are electronic energies alone, while the solid curves also include internuclear repulsion. For both the solid and dotted curves the lowest curve corresponds to Z=1, the next lowest to Z = 2, and so on. As in Figure 5.12, a very restricted basis set was used for the calculation. 346 QUANTUM THEORY F.8 Automatic generation of symmetry-adapted basis functions

In Chapter 4, Section 4.2, we discussed a large-Z approximation that could be used when the Generalized Sturmian method is applied to atoms. In the large-Z approximation, the basis set used to treat a particular state is restricted to the set of configurations that become degenerate if interelectron repulsion is completely neglected. We also saw in Chapter 4 that such a set of states could be used as an invariant subset, and thus be used to generate symmetry-adapted basis functions for more accurate calculations where the large-Z approximation is abandoned. If we compare equation (F.51) with equation (E.6) of Chapter 4, we can see that the two equations are closely analogous. In each case, the configuration Φν satisfies | i an approximate Schr¨odingerequation with a weighted potential V0, the weighting factors being chosen in such a way as to make all of the configurations in the basis set isoenergetic. Thus both the Goscinskian configurations of Chapter 4 and the isoenergetic configurations of Chapter 5 are examples of generalized Sturmians, as discussed in Appendix B. Therefore it is interesting to ask whether something analogous to the large-Z approximation exists in the case of molecules. What happens if we decide to use as a basis only those molecular configurations that become degenerate if we completely neglect interelectron repulsion? Let us suppose that this degeneracy is not accidental, but is a due degeneracy. It then follows that any set of configurations that become degenerate if interelectron repulsion is completely neglected is closed under the operations of the symmetry group of the molecule, and it can be used as an invariant subset for the automatic generation of symmetry- adapted basis functions needed in a large and accurate calculation. The method can thus be summarized as follows:

(0,N) 1 (N) 0 1. Construct the matrices Tν0ν k Φν V0(x) Φν and Sν0ν , based on configurations that are solutions to (F.51). These≡ − willh | already| bei diagonal, as was discussed above.

(0,N) (N) 2. Search for configurations corresponding to the same value of Tν,ν /Sν,ν . Such a set of degenerate configurations is an invariant subset provided that the search has been sufficiently complete.

3. For each invariant subset, solve equation (F.60). The eigenfunctions will then be the symmetry-adapted configurations needed for a larger and more accurate calculation. INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, VOL. XXV, 79-96 (1984)

Use of the S-Matrix in the Relativistic Treatment of Resonance Energy Transfer

JOHN AVERY Department of Physical Chemistry, H. C. 0rsted Institute, Unioersity of Copenhagen, Copenhagen, Denmark

Abstract The S-matrix formalism is used to treat the phenomenon of resonance energy transfer (sensitized fluorescence). It is shown that for dipole-allowed transitions and short sensitizer-acceptor separations, the relativistic treatment yields the same result as the nonrelativistic Perrin-Forster theory. For large sensitizer-acceptor separations, long-range coupling terms appear in the relativistic treatment. Resonance energy transfer through these long-range coupling terms is compared with spontaneous photon emission, and direct-interaction theories of electromagnetism are discussed. In the Appendix it is shown that the relativistic theory predicts resonance transfer of triplet excitation energy through the spin-spin coupling term in the Breit interaction.

Experimental Discovery of Resonance Energy Transfer and Early Theories The phenomenon of resonance energy transfer was discovered experimentally in 1923 by Cario and Franck [l], who exposed a mixture of mercury and thallium vapors to a frequency of light which only the mercury could absorb. In the emission spectrum they observed a frequency of light which only the thallium could emit, which demonstrated that energy had been transferred from one species to the other. A theory for the mechanism of energy transfer in the Cario-Franck experi- ment was first proposed by Perrin [2,3], and later improved by Perrin, Forster, and others [4-91. In the Perrin-Forster theory of resonance energy transfer, the system is divided into two subunits, and its initial state is represented by a wave function of the form:

l*i)=l$c(1) $g(2)), (1) where 4*(1)represents subunit 1 in the excited state 5, and $,(2) represents subunit 2 in the ground state. (For example, subunit 1 might be a mercury atom, while subunit 2 might represent a thallium atom.) The two subunits are assumed to be separated by a distance R which is large enough so that exchange can be neglected. In the final state of the total system, which is represented by a wave function of the form a transfer of excitation energy has taken place, so that subunit 1 is in its ground state g', while subunit 2 is in an excited state 5'.

Q 1984 John Wiley and Sons, Inc. CCC 0020-7608/84/010079-18$04.O0 80 AVEKY

The Hamiltonian of the total system can be written in the form: H =Ho+H’, (3) where the unperturbed Hamiltonian Ho = HI + H* (4) is the sum of the Hamiltonians of the isolated subunits 1 and 2, and where

represents the Coulomb interaction between the charged particles i of subunit 1 and the charged particles j of subunit 2, qi and qi being the respective charges. In the Perrin-Forster theory of resonance energy transfer, ITi) and /Ti) are eigenfunctions of Ho,and a transition between them is induced by H’. In order to calculate the probability per unit time of the transition i -f as a function of the separation R, one makes a bipolar expansion of l/lxi -xil around the center- of-mass positions Rt and Rz of the two subsystems:

Because of the orthogonality relationships

the matrix elements of the first three terms in Eq. (6) vanish, and the leading term in the matrix element for the transition is the dipole-diple term:

If we let RESONANCE ENERGY TRANSFER 81 and

and if we note that

where then Eqs. (5) and (9) yield:

Since the matrix element for the transition falls off as l/R3, the transition probability per unit time falls off as 1/R6. Because of this, one sometimes expresses the Perrin-Forster rate constant for resonance energy transfer in the form:

where T~ is the time needed for spontaneous emission of a photon, and Ro is the separation of 1 and 2 for resonance energy transfer which is equal to the probability for spontaneous photon emission. Typically, for dipole-allowed transitions, Rois of the order of 50 A. The validity of the Perrin-Forster theory for dipole-allowed resonance energy transfer over distances of this order has been amply confirmed by experiments [ 11-14].

Relativistic Treatment of Resonance Energy Transfer Let us now try to improve the Perrin-Forster theory by taking relativistic effects into account [15, 161. To do this, we begin with the second-order electron-electron S-matrix [ 171:

where

d4xj= d3xjdtj, (18) and where G(1,2) is the Green’s function for the d’Alembertian operator: 1 d4k expik. (x1-x2)-iw(fl-f2) G(W=~ J k . k - (w/c)~- is 82 AVEKY

Here e is a real, positive infinitesimal number which is allowed to go to zero after all the integrations have been performed, and d4k = d3kdo. In Eq. (16), b;, b;, bf, and 6, are electron creation and annihilation operators obeying the anticommutation relations:

b;b, i-bjb: = 0,

The electron creation and annihilation operators in Eq. (20) refer to spin orbitals which are eigenstates of the Dirac operator in a local potential (for example, the effective potential of an atom or molecule), and we are thus working in an independent particle approximation. In Eqs. (17) and (18) and throughout the remainder of this paper, a sum over p running from 1 to 4 is implied whenever the index p is repeated. The matrices y: and y: are defined as in Ref. 18, with the superscripts indicating that they act, respectively, on the spin coordinates of electrons 1 and 2. We also use the definition &r =i@:y4, where the dagger means “conjugate transpose,” and in general we shall follow the notation of Ref. 18. The matrix element in Eqs. (17) and (18) can be brought into another form by performing integrations over dt2,d4k, and dtl:If we assume that the orbitals @, and depend harmonically on time, then we can write:

e&(2) r:at(2) =iZ(x2) exp (ioSft2?, (21) where oSf= w, -or and i;(x2?=e&(x2) Y:wx~), (22) and similarly we can write:

ecW1)Y:Q,~(~) =i:”(x1)exp (iorUt1), (23) where

C’(xl)=ecGr(xl)r:~~(xl). Then, integrating over dt2,d4k, and dtl,we obtain [17]:

where

and where the transition currents i;(xl) and jE(x2)are defined by Eqs. (22) and (24). The effective interaction energy matrix element U,s~t,is similar to the matrix element of a Coulomb interaction, except that the transition densities RESONANCE ENERGY TRANSFER 83 are replaced by a scalar product of transition four-currents, and the interaction contains a retardation factor, exp (ilustlr12/c). In order to apply Eqs. (16)-(26) to the phenomenon of resonance energy transfer, let us represent the initial state of a system composed of two subunits by a wave function of the form: I%)= I$*(U &(2))=bmbflI$d(1) $g(2))=bmb"l%) (27) and let us represent the final state by the wave function

I*f) = I$g'(l) $*'(2)) = b:bkl$g'(l) $g(2)) = blbklq0). (28) Equations (27) and (28) have the same meaning as Eqs. (1) and (2). In the initial state of the system, subunit 1 is in a singly excited state $,(1) which differs from the ground state by the excitation of an electron from the orbital n to the orbital rn, while subunit 2 is in its ground state $g(2). In the final state, the situation is reversed, and subunit 2 is excited while subunit 1 is in its ground state. The transition probability is proportional to l(qilS/9f)l2.Combining Eqs. (16), (27), and (28), and making use of the anticommutation relations (20), we obtain: (Vilslqf)=Smklln -Smklnl =Smklln. (29) In Eq. (29) we have used the fact that the two subunits are assumed to be separated by a distance R = IR1 - R21 which is large enough so that the exchange term can be neglected. Let us try to find the dependence of Umklln on the separation R. We can make a bipolar expansion of (1/r12)exp (i/wkl1rl2/c)about the points R1 and R2. If we let K = lWklI/C, then the expansion analogous to Eq. (6) is

In the nonrelativistic theory of resonance energy transfer, the first three terms in the bipolar expansion [Eq. (6)] vanish because of the orthonormality relations [Eqs. (7) and (8)]. By contrast, the first three terms in Eq. (30) all give rise to nonvanishing contributions to the matrix element Umklln. Substituting Eq. (30) into Eq. (26) we obtain:

a a + d3x1(x1-R1).-j,"" (x,) d3x2(x2-R2).-j~(x~) J aRi aR2 iKR +. . .)L R' 84 AVEKY

To see that the first term in Eq. (31)does not vanish, we note that

The fourth component vanishes, since yiyi = 1 and, if m # n,

However, the space part of the four-vector d'xl jF"(xl)does not vanish, since

Thus we find that in the relativistic theory of resonance energy transfer, there is a nonzero term of the form:

This term represents magnetic coupling between the two subsystems, and it is remarkable because it falls off only as 1/R. Other terms proportional to 1/R arise when the differential operators a/aR, and d/dR2 act exclusively on the retardation factor eiKR rather than on the denominator of eiKR/R.Thus, for example,

a2 -=-(eiKR eiKR R,R,(3 -it& - K 2R2) &,,( 1 - ~KR)- (36) aR1,aR2, R R R2 ).

The term proportional to K~ in Eq. (41) falls off only as 1/R. If we substitute Eq. (36)into the fourth term of Eq. (31),we obtain: RESONANCE ENERGY TRANSFER 85

If the transitions m +rz and k +I are dipole allowed, then the I*. =4 term predominates over the I*. = 1, 2, and 3 terms, and Eq. (37) becomes:

J

C'K'(D~ . R)(D~.R) eiKR w- -+. . . (38) KR,>I R2 R where D1and D2 are the transition dipole moments: Dl=e I d3x1 @hdxl-Rl)@fl(xl), (39) D2= e I d3x2 @;(x2)(x2- R2)@,(xZ). With the help of the commutation relation:

we can express Eq. (35) in the form:

=C~K'(D~. D~), (41) with D, and D2 defined by Eq. (39). Thus when R is large compared with 1/~=h/2.rr = c/lwmn1, then the leading term in the multipole expansion is

In the opposite case, when R is small compared with 1/~,substitution of Eq. (36) into Eq. (31) yields:

(Di * Dz) - 3(Di. R)(Dz * R) (43) Umklln R3 R5 Comparing this with Eq. (14), we can see that the relativistic treatment of resonance energy transfer gives the same result as the Perrin-Forster theory for dipole-allowed transitions and small values of R. For large values of R, the relativistic treatment of the dipole-allowed case yields a coupling term [Eq. (42)] that falls off as 1/R.

Probability Per Unit Time for Resonance Energy Transfer Let us now turn to the problem of calculating the probability per unit time for resonance energy transfer. In the nonrelativistic case, the matrix element of 86 AVERY

the perturbation inducing the transition lUi)+ lUf) has a leading term given by Eq. (14). In first-order time-dependent perturbation theory, the probability per unit time that the transition will take place is given by laf12/t where

so that

[exp (iwift)- ~][exp(-iwift) - 11 Id= 7 2 (45) 0 if We must now remember that the system is not completely isolated, but is coupled with its surroundings. This coupling can be taken into account by saying that the states \qi)and lqf)are not exactly stationary, but that there are certain unspecified interactions with the environment other than H'. These interactions collectively produce, on the average, r transitions per unit time which will take the system out of the state lqi)or the state lPf)and into some other state. This coupling of the system with its environment can be taken into account in our simplified perturbation model by saying that the states 1qi)and are split by the interaction into a group of states. The density of these states (i.e., the number of states per energy interval) is given by a Lorentzian distribution:

This type of distribution corresponds, for example, to the measured line shape for atomic spectra. r is the radiation half-life for spectra of isolated atoms in a rarified gas. In a gas at atmospheric pressure, where collision broadening pre- dominates, l/r corresponds to the average time between collisions. The factor r/.rr in the numerator in Eq. (45) normalizes the distribution in such a way that m I-, dwp(w)= 1. (47) We must now calculate the probability of transition from one of the group of states into which \Ti)is split by the interaction with its environment to one of the group of states into which ITf)is split. Then we must sum these probabilities over the whole set of final states, and finally we must sum over the whole set of initial states. In other words, we must evaluate:

{exp[i(w -w')t]-l}{exp [-i(w -w')t]-l} (48) (w -wy RESONANCE ENERGY TRANSFER 87

Evaluating the double integral in Eq. (48)by means of contour integration, we obtain:

This represents the probability that after an interval t the system will have undergone a transition from the initial state \qi)to the final state Thus, in the special case where wi = wf, the rate at which the transition takes place becomes:

Using Eq. (14) and taking the z axis in the direction of R,we can write for the nonrelativistic case:

1 = -(D:,D + D :,D ;y + 4D7,D ;z - 201.0 lzD2xD2z R6 ;.

- 201 yD1 DzyDzz + D 1 ,D I yD2xDz ). (51) If our manifold of final states includes states which represent resonance energy transfer to various randomly oriented absorber molecules, we must average over the orientations of D2 with respect to R. Doing this, and averaging also over the orientations of D1with respect to R,we obtain:

For dipole-allowed transitions in organic dyes, such as acriflavine and rhodamine B (two dyes commonly used in studying resonance energy transfer), if the transition dipole moment is expressed in the form D = eL, where e is the charge of an electron, then L is a length whose order of magnitude is a few Angstroms. If we let r = 1013- /s and T~ = lo-' s and if we rewrite Eq. (52)in the form:

then

Now let us consider what effect the relativistic corrections have on the rate of resonance energy transfer for dipole-allowed transitions. If we replace H; by the matrix element of the effective interaction energy, Umklln[Eqs. (31), 88 AVERY

(42), and (43)], then we find that for KR<< 1 the rate of resonance energy transfer is given by Eqs. (53) and (54), in agreement with the Perrin-Forster theory. However, when KR-- 1, the relativistic effects begin to become important, and when KR >> 1, they predominate. Combining Eqs. (42) and (50) and averaging over orientations of the dipole moments, we obtain an average transition rate for the region where KR >> 1:

The R dependence of the average rate of resonance energy transfer for dipole- allowed transitions is illustrated in Figure 1.

Long-Range Resonance Energy Transfer From Eq. (55) we can see that when KR >> 1, the probability for resonance transfer of excitation energy from an excited atom or molecule to an absorber

\ -12 I I \r I -2 -1 0 1 2 log (xR) - Figure 1. Dependence of the averaged rate of resonance energy transfer on the sensitizer-acceptor separation R for dipole-allowed transitions. For small separ- ations, relativistic corrections to the Perrin-Forster theory are unimportant, and the transition rate falls off as l/Rh.When R = 1/~=A/2.rr, relativistic corrections become important, and for larger separations, they predominate. When KR >> 1, the averaged transition rate falls off as 1/R2. (-) Relativistic transfer rate; (---) transfer rate in the nonrelativistic Perrin-Farster theory. RESONANCE ENERGY TRANSFER 89 is proportional to SIR2.Let us now imagine an excited atom placed at the center of a sphere of radius R, where R > S/K. Let us also imagine that on the surface of the sphere there are a number of similar atoms in their ground states. According to the argument outlined above, the probability for resonance energy transfer to one or another of the atoms on the sphere will fall off as SIR2.On the other hand, the number of absorber atoms which we can place on the sphere without changing the number per unit area will increase as R2. Thus, if we keep the number per unit area constant, the probability of resonance energy transfer to one or another of the absorbers will be independent of R. For example, suppose that the transition dipole moment of the atoms which we are considering is given by D = eL, and suppose that the number of absorber atoms per unit area on the surface of the sphere is 1/4?rL2.Then the total number on the surface of the sphere will be N =4?rR2/4?rL2=R2/L2.The ratio of the rate of resonance energy transfer to one or another of the absorbers to the rate of spontaneous photon emission will have the order of magnitude [from Eq. (SS)]:

where X = 1/K. If we let T~ = lO-’s, r- l0l3- /s, and A - 1000 A, Eq. (56)gives the order of magnitude of the ratio as approximately unity. We can see from this argument that a theory of resonance energy transfer which takes relativistic effects into account leads to the surprising prediction that resonance energy transfer can take place over macroscopic distances! (It should be remembered, however, that the simple first-order perturbation theory which we have used here is inadequate in an important respect since it is unable to take into account the screening of one absorber by another. In order to describe this screening, it would be necessary to treat resonance energy transfer in a manner analogous to the treatment of delocalization of an initially localized Frenkel exciton state [19-2 11.)

Possible Experiments to Observe Long-Range Energy Transfer The Perrin-Forster-type of resonance energy transfer (sometimes called “sensitized fluorescence”) is usually studied in aqueous solutions of organic dyes, such as acridine orange, acriflavine, rhodamine B, etc. These dyes have the advantage of large transition dipole moments. They also have a high fluorescence efficiency, i.e., it seldom happens that an excited dye molecule loses all of its energy to vibrational modes. A little energy is always lost 1221, and therefore the absorption maximum does not coincide with the fluorescence maximum. Resonance energy transfer is most likely when the fluorescence maximum of the excited molecule corresponds to the absorption maximum of the absorbers. In the usual experiments, two dyes, a “sensitizer” S and an “acceptor” A are chosen such that this resonance condition is fulfilled, and they are mixed together in solution. The solution is irradiated with light whose frequency corresponds to the absorption maximum of S, and the fluorescence spectrum is observed. 90 AVERY

It is found that the quantum efficiency of the fluorescence of S is increased by the presence of A, and that the fluorescence efficiency of A is increased (“sensit- ized”) by the presence of S. In moderately concentrated solutions, where the average S-A distance is of the order of 50 A, the deduced rate of energy transfer is far too great to be explained by emission and reabsorption of photons. The difficulty with this type of experiment as a means of detecting long-range resonance energy transfer is that the total observed fluorescence is due to the combined effect of a great many S and A molecules. These molecules are randomly distributed throughout the solution. The environment of some sensitizer molecules includes an acceptor at very close range, whereas other sensitizers have no acceptor nearby. One of Forster’s important achievements is the deduction of a l/R6 dependence for the transfer rate from a statistical analysis [23] of the concentration dependence of sensitized fluorescence in dye solutions. However, the indirectness and difficulty of such an analysis have led other authors to hope that experiments could be performed in which the S-A would be constant and measurable. Zwick and Kuhn [24] achieved such an experiment by absorbing the S and A molecules on opposite sides of thin layers of barium stearate, using a technique developed by Langmuir and Schaefer [25]. The dyes used by Zwick and Kuhn were acriflavine and rhodamine B. The absorption and fluorescence maxima of both these dyes are in the visible range, whereas barium stearate is transparent in the visible, having its absorption maximum in the ultraviolet. The layer of stearate was therefore optically inert and served only to separate the S monolayer from the A monolayer. Zwick and Kuhn measured the quantum fluorescence efficiency of the acceptor as a function of film thickness, and they found a deduced transfer rate in agreement with a 1/R6dependence up to a film thickness of 100 A. The Zwick-Kuhn technique seems better suited to an attempt to measure long-range resonance energy transfer than an experiment carried out in solution. In a solution, the extremely strong short-range transfer between molecules which are accidentally close together forms a background which would mask the weaker long-range effects. On the other hand, for two layers separated by an inert film, the S-A distance is always greater than the thickness of the film. Therefore one might attempt an experiment of the following kind: Consider a sandwich-like arrangement with monolayers of S and A separated by a layer of a transparent medium as shown in Figure 2. The whole transparent film is covered on one side with the sensitizer dye. On the other side of the film, only half the area is covered with the acceptor. The film is illuminated from the S side with monochro- matic light at the absorption peak of S, and the fluorescence efficiency of S is observed from the S side as a function of film thickness. If resonance energy transfer from S to A is competing with emission of a photon by S, then the fluorescence of S, observed from the S side, will be quenched in the region of the film where A is present on the opposite side. If this quenching is observed, then resonance energy transfer can be inferred, since the presence of A on the opposite side could not decrease the fluorescence of S by any other mechanism. Such an experiment has not yet been performed (see Figs. 2 and 3). RESONANCE ENERGY TRANSFER 91

A

S

Figure 2. Arrangement of sensitizer and acceptor layers on opposite sides of a thin film of barium stearate in the experiments performed by Zwick and Kuhn [24]. These authors measured the rate of energy transfer from S to A up to film thicknesses of 100 A. Experiments at larger film thicknesses have not yet been performed.

Direct-Interaction Theories of Electrodynamics We saw above that a theory of resonance energy transfer which includes relativistic effects leads to the very surprising prediction that the rate of resonance energy transfer over macroscopic distances can be comparable to the rate of spontaneous photon emission. We must now ask whether this predicted long- range resonance energy transfer is a phenomenon which competes with spon- taneous photon emission, or whether it is just another way of describing the spontaneous emission of a photon and its subsequent absorption by the acceptor. Suppose that we choose the second answer and maintain that long-range resonance energy transfer is just another way of describing spontaneous photon

t w

Figure 3. Absorption maxima (w, and w3)and fluorescence maxima (w2 and ud) for experiments on resonance energy transfer performed with organic dye molecules. Because of the loss of energy to vibrational modes, the fluorescence maxima of both the sensitizer and acceptor are shifted to the red with respect to the absorption maxima. The rate of transfer is greatest when the fluorescence maximum of the sensitizers coincides with the absorption maximum of the acceptors, i.e., when w2= w3. In our simplified theory, r represents the halfwidth of the bands. 92 AVERY emission. Then we should note an important difference between the resonance transfer description and the conventional one. In the conventional description, no absorber need be present in order for the photon to be emitted. For example, in the standard cosmological description of the early stages of the universe [26], all the matter is imagined to have been initially concentrated in a small region of space. Subsequently, the matter exploded out from this position, but with a velocity less than the velocity of light. Thus, unless space is closed, it is divided into two regions-a central region containing matter and an outer region which is entirely empty. According to the conventional description of spontaneous photon emission, photons must be streaming out from the central region into the totally empty space beyond. On the other hand, if absorbers are necessary for emission of radiation, then at all times during the evolution of the universe the radiation energy would be retained within the region occupied by the matter. Thus the two pictures lead to very different cosmological models, which could, in principle, be distinguished experimentally. We should also note another difference between the resonance energy transfer description of radiation and the conventional one. In the conventional descrip- tion, the radiation field has an infinite number of degrees of freedom, while in the direct-interaction picture, the field has no more degrees of freedom than the particles which generate it. The infinite number of degrees of freedom which the electromagnetic field has in the conventional picture lead to an embarrassing infinity in the zero-point energy of the field, and this infinity is avoided in the direct-interaction picture. For this reason, a number of authors have explored the possibility of formulating quantum electrodynamics in terms of a direct interaction [27-32, 35-42]. Interest in direct-interaction formulations of elec- trodynamics has also been motivated by the desire of quantum physicists and chemists to have a fully relativistic method for calculating the bound states of atoms and molecules. For example, Roothaan has recently approached the problem from this direction [34]. I hope that the present paper will contribute to the current interest in direct-interaction theories of quantum electrodynamics.

Appendix: Resonance Transfer of Triplet Excitation Energy through Spin-Spin Coupling Terms in the Breit Interaction In the main part of this paper, we have discussed resonance energy transfer for cases where both the sensitizer and acceptor undergo dipole-allowed transi- tions. In this appendix, we shall treat triplet energy transfer. This is a case in which the sensitizer and acceptor undergo dipole-forbidden transitions involving a spin flip. A theory of triplet energy transfer has been developed by Dexter for cases where the sensitizer and acceptor are close enough to each other so that there is some overlap of the wave functions [9]. However, in this appendix we shall confine our attention to cases where the sensitizer-acceptor separation R is large enough so that overlap between the wave functions can be neglected. We shall RESONANCE ENERGY TRANSFER 93 see that in this case, the spin-spin coupling terms in the Breit interaction produce a small probability for triplet energy transfer. Although the rate of triplet energy transfer induced by this coupling is small, we shall see that it is large enough to be observed during the very long lifetimes which the triplet states of aromatic hydrocarbons exhibit when the molecules are frozen in rigid matrices. In order to discuss the spin-spin coupling, it is convenient to expand the matrix element of the effective interaction energy U,I,, [Eq. (26)] in powers of l/c, retaining terms up to 1/c2, and solving for the small components of the four-component Dirac spinors in terms of the large components. If we do this, we obtain [17]

where U(e’is the Breit interaction operator in the Pauli approximation:

In Eq. (A2), r=xl-xz, while u1 and u2 are three vectors whose components are the Pauli spin matrices ux,cry, and uz.In Eq. (Al), 4r,&, 4,, and 4, are two-component electron spin orbitals. The term

ul uz 3(ul * r)(u2 r) H” = (&)2 (7-r5 >. in Eq. (A2) is responsible for the mechanism of triplet resonance energy transfer which will be discussed in this appendix. Let us introduce a notation where It, 1) denotes a spin state where electron 1 has its spin pointing parallel to the z axis, while the spin of electron 2 is antiparallel. If IJ, T) represents the opposite situation, while (1, denotes the adjoint of IL, t),then 94 AVERY

The matrix element responsible for resonance transfer of triplet excitation energy then becomes:

[ d3x1[ d3x2~T(xI)~T(~~)H"~~(xz)~~(x~)

where we have made a bipolar expansion of H"(r)around the sensitizer and acceptor centers as in Eq. (6).For triplet-singlet transitions, the integrals involv- ing the space parts of the electron orbitals are not equal to zero because the triplet and singlet orbitals are eigenfunctions of different Fock operators, and hence not orthogonal. Letting R = (X, Y,Z) we have

and similarly, while so that

and

(A101 Averaging over orientations and setting

we obtain the order of magnitude for the transition rate [Eq. (49)] as

--_-1 9 eh - (~~-.f)- 5 ( 2mc )2hf;;". RESONANCE ENERGY TRANSFER 95

Equation (A12) can be rewritten in the form:

where, with T= 1OZ3-/sand ro= 10 s, Rl=lOA. 1414) Resonance transfer of triplet excitation energy over distances of this magnitude has been observed experimentally by Gay [33].

Acknowledgments The author is extremely grateful to Lektor Lars Erik Lundberg and Professor Jens Peder Dahl for stimulating discussions. He is also very grateful to Professors D. P. Craig, E. A. Power, P. 0. Lowdin, 0. Goscinski, and J. Jortner for their advice and encouragement.

Bibliography

[l] G. Cario and J. Franck, Z. Phys. 17,202 (1923). [2] J. Perrin, Proces de la Conseil de Chimie Solvay (Gauthier and Villar, Paris, 1925), p. 322. [3] J. Perrin, C.R. Acad. Sci. 184, 1097 (1927). [4] H. Kallmann and F. London, Z. Phys. Chem. B2,207 (1928). [5] F. Perrin, Ann. Phys. 17,283 (1932). [6] Th. Forster, Naturwissenschaften 33, 166 (1946). [7] Th. Forster, Z. Naturwiss. A 4, 321 (1949). [8] S. I. Vavilov, J. Phys. (USSR) 7, 141 (1943). [9] D. L. Dexter, J. Chem. Phys. 21, 836 (1953). [lo] J. Franck and R. Livingston, Rev. Mod. Phys. 21, 505 (1949). [ll] S. I. Vavilov, Microstructure of Light (Verlag, Berlin, 1954). [12] E. J. Bowen and B. Brocklehurst, Trans. Faraday SOC.49, 1131 (1953). [13] A. N. Terenin and V. L. Ermolaev, Usp. Fiz. Nauk. 58, 37 (1953). [ 141 Th. Forster, Fluoreszinz Organischer Verbindungen (Vandenhoeck and Ruprecht, Gottingen, 195 1). [15] J. Avery, Proc. Phys. SOC.(London) 88, 1 (1966). [16] R. R. McLone and E. A. Power, Mathematica 11, 91 (1964). [17] A. I. Akhiezer and V. B. Berestetskii, Quantum Electrodynamics (Interscience, New York, 1965). [ 181 J. Avery, Creation and Annihilation Operators (McGraw-Hill, New York, 1976). [19] J. Avery, R. Mason, and L. G. Augenstein, Biopolym. Symp. 1, 259 (1964). [20] J. Avery and R. Mason, J. Phys. Chem. 69, 784 (1965). [21] J. Avery, in Electronic Aspects ofBiochernistry, B. Pullman, Ed. (Academic, New York, 1964). [22] J. Avery, The Quantum Theory of Atoms, Molecules and Photons (McGraw-Hill, New York, 1972), pp. 251-253. [23] J. Avery and J. C. Packer, in The Triplet State, A. B. Zahlan, Ed. (Cambridge U.P., Cambridge, 1967). [24] M. M. Zwick and H. Kuhn, Z. Naturforsch. A 17,411 (1962). [25] I. Langmuir and V. J. Schaefer, J. Am. Chem. SOC. 59, 1406 (1937). [26] S. Weinberg, The First Three Minutes (Bantam, New York, 1979). [27] J. A. Wheeler and R. P. Feynman, Rev. Mod. Phys. 17, 157 (1945). [28] J. A. Wheeler and R. P. Feynman, Rev. Mod. Phys. 21, 425 (1949). 96 AVERY

[29] G. Ludwig, Z. Naturforsch. A 5, 637 (1950). [30] G. Siissmann, Z. Phys. 131,629 (1952). [31] G. N. Lewis, Proc. Natl. Acad. Sci. USA 12, 22 (1926). [32] L. E. Lundberg, Commun. Math. Phys. 50, 103 (1976). [33] H. Gay, “Studies of phosphorescence and energy transfer between triplet states in aromatic hydrocarbons, Ph.D. Thesis, Imperial College of Science and Technology, London, 1964. [34] C. C. J. Roothaan, in The Uncertainty Principle and Foundations of Quantum Mechanics, A Fifty Year Survey-A Tribute to Werner Heisenberg, W. C. Price and S. S. Chissick, Eds. (Wiley, London, 1977). [3S] F. Hoyle and J. V. Narlikar, Ann. Phys. 54, 207 (1969); 62, 44 (1971). [36] P. C. W. Davies, Proc. Cambr. Philos. SOC.68, 751 (1970). [37] P. C. W. Davies, J. Phys. A 4, 836 (1971); 5, 1025 (1972). [38] H. J. Groenwold, K. Ned. Akad. Wet. Versl. Gewone Vergad. Afd. Natuur. Kd. 52, 226 (1949); 53,414, 610 (1950). [39] J. A. Wheeler, “Pregeometry: Motivations and prospects,” in Quantum Theory and Gravitation, A. R. Marlow, Ed. (Academic, New York, 1980). [40] S. Weinberg, Gravitation and Cosmology: Principles and Applications of the General Theory of Relativity (Wiley, New York, 1972), p. 619. [41] L.-E. Lundberg, Commun. Math. Phys. 50, 103 (1976). [42] L.-E. Lundberg, preprint (1982). [43] E. A. Power, Phys. Rev. A 10, 756 (1974). [44] E. A. Power, J. Chem. Phys. 46,4297 (1967). [4S] E. A. Power, W. J. Meath, and J. 0. Hirshfelder, Phys. Rev. Lett. 17, 799 (1966). [46] L. Gomberoff and E. A. Power, Proc Phys. SOC.88, 281 (1966). [47] L. Gomberoff, Proc. R. SOC.London Ser. A 295,476 (1966). [48] R. R. Mclone and E. A. Power, Proc. R. SOC.London Ser. A 286,573 (1965). Einstein A-coefficients for acridine orange in various solvents

March 3, 2019

Definition of the Einstein A-coefficients The Wikipedia article on the Einstein A and B coefficients defines the A coefficients as follows: “Spontaneous emission is the process by which an electron ‘spontaneously’ (i.e. without any outside influence) decays from a higher energy level to a lower one. 1 The process is described by the Einstein coefficient A21 sec− , which gives the proba- bility per unit time that an electron in state 2 with energy E2 will decay spontaneously to state 1 with energy E , emitting a photon with an energy E E = hν. Due to 1 2 − 1 the energy-time uncertainty principle, the transition actually produces photons within a narrow range of frequencies called the spectral linewidth. If ni is the number density of atoms in state i , then the change in the number density of atoms in state 2 per unit time due to spontaneous emission will be

dn 2 = A n (1) dt − 21 2  spontaneous The same process results in increasing of the population of the state 1:

dn 1 = A n (2) dt 21 2  spontaneous Tables of Einstein A-coefficients An article by P.M. Meti et. al. gives the following values for the Einstein A-coefficients of acridine orange hemi zinc salt in various solvents as functions of concentration. These tables refer to the transition from the first electronically excited state of acridine orange to the ground state.

1 Table 1: Acridine orange in ethanol solvent

1 concentration (M) A-coefficient, sec−

5 6 1 10− 2.5 10 × ×

6 6 2 10− 6.9 10 × ×

6 6 3 10− 4.6 10 × ×

6 6 4 10− 4.14 10 × ×

Table 2: Acridine orange in propanol solvent

1 concentration (M) A-coefficient, sec−

5 6 1 10− 6.9 10 × ×

6 6 2 10− 1.61 10 × ×

6 6 3 10− 1.65 10 × ×

6 6 4 10− 8.6 10 × ×

2 Table 3: Acridine orange in butanol solvent

1 concentration (M) A-coefficient, sec−

5 5 1 10− 4.6 10 × ×

6 5 2 10− 1.15 10 × ×

6 5 3 10− 1.03 10 × ×

6 5 4 10− 7.47 10 × ×

Table 4: Acridine orange in decanol solvent

1 concentration (M) A-coefficient, sec−

5 5 1 10− 4.6 10 × ×

6 6 2 10− 3.2 10 × ×

6 6 3 10− 2.07 10 × ×

6 5 4 10− 9.6 10 × ×

3 Derivation of the transition dipole moment from the Einstein A- coefficient Here is an excerpt from Notes.pdf:

In the dipole approximation, the average rate of spontaneous photon emmission is given by 1 1 e2 1 e2 = k3D2c = (kD)2kc (3) τ 3 c 3 c 0 ~  ~  where 2π E k = = i (4) λ ~c is the wave number of the emitted light, λ being the wavelength and Ei the energy difference between the Stokes-shifted initial excited state and the ground state. For acridine orange, the maximum emission takes place at a wavelength of λ 550 nanometers = 5500 Angstroms (5) ≈ Then 2π ω = kc 3 108m/sec 1 ≈ 5.5 10 7m ×  −  × 15 1 = 3.43 10 sec− (6) × For an organic molecule such as acridine orange, the transition dipole moment, D, can be approximated by the size of the molecule, let us say 1 nanometer. With this very rough estimate, we can write 2 2 2 2 2πD 2π 4 k D 1.3 10− (7) ≡ λ ≈ 550 ≈ ×     a pure number. The fine structure constant is also a pure number. Putting all these elements together, we obtain a very rough estimate for the rate of spontaneous emission of an organic dye molecule such as acridine orange: 1 1 e2 = k3D2c τ 3 c 0 ~  1 1 4 15 1 1.3 10− 3.43 10 sec− ≈ 3 137 × ×   9 1 10 sec−  (8) ≈ For 2 molar acridine orange in ethanol, the rate of spontaneous photon emission de- 6 1 rived from the Einstein A-cofficient is 6.9 10 sec− . Thus with this solvent and × concentration, equation (1) becomes

dn2 6 1 = 6.9 10 n sec− (9) dt − × 2  spontaneous

4 Figure 1: Acridine orange hemi zinc salt.

where n2 is the number of acridine orange molecules per unit volume. In the special case where there is only one molecule present, equations (8) and (9) can be compared. We find from the ratio that

6 2 6.9 10 4 6 (kD) = × 1.3 10− 10− (10) 109 × × ≈   

5 370 QUANTUM THEORY Bibliography

[1] Abragam, A. and Bleany B., Electron Paramagnetic Resonance of Transition Ions, Clarendon, 1970. [Akhiezer and Berestetskii, 1965] Akhiezer, A.I. and Berestetskii, V.B., Quantum Elec- trodynamics, Interscience, New York, 1965. [2] Anderson, R.W.; Aquilanti, V.; Cavalli, S. and Grossi, G., Steriodirected states in molecular dynamics: a discrete basis for the quantum mechanical scattering matrix, J. Phys. Chem., 95 8184, 1991. [Aquilanti et al., 1996] Aquilanti, V., Cavalli, S., De Fazio, D, and Grossi, G. Hyperangular momentum: Applications to atomic and molecular science, in New Methods in Quantum Theory, Tsipis, C.A., Popov, V.S., Herschbach, D.R., and Avery, J.S., Eds., Kluwer, Dordrecht, 1996. [3] Aquilanti, V.; Cavalli, S. and Grossi, G., Hyperspherical coordinates for molecular dynamics by the method of trees and the mapping of potential-energy surfaces for triatomic systems, J. Chem. Phys., 85 1362, 1986. [4] Aquilanti, V.; Grossi, G.; Lagan´a,A.; Pelikan, E., and Klar, H., A decoupling scheme for a 3-body problem treated by expansions into hyperspherical harmonics. The hydrogen molecular ion, Lett. Nuovo Cimento, 41 541, 1984. [5] Aquilanti, V.; Grossi, G., and Lagan´a,A., On hyperspherical mapping and harmonic expansions for potential energy surfaces J. Chem. Phys., 76 1587-8, 1982. [6] Aquilanti, V. and Cavalli, S., Hyperspherical analysis of kinetic paths for elementary chemical reactions and their angular momentum dependence, Chem. Phys. Lett., 141 309, 1987. [7] Aquilanti, V.; Cavalli, S.; Grossi, G., and Anderson, R.W., Representation in Hy- perspherical and Related Coordinates of the potential-energy surface for triatomic reactions, J. Chem. Soc. Faraday Trans., 86 1681, 1990. [8] Aquilanti, V.; Benevente, L.; Grossi, G. and Vecchiocattivi, F., Coupling schemes for atom-diatom interactions, and an adiabatic decoupling treatment of rotational temperature effects on glory scattering, J. Chem. Phys., 89 751-761, 1988.

371 372 BIBLIOGRAPHY

[9] Aquilanti, V. and Grossi, G., Angular momentum coupling schemes in the quantum mechanical treatment of P-state atom collisions J. Chem. Phys., 73 1165-1172, 1980. [10] Aquilanti, V. and Grossi, G., Discrete representations by artificial quantization in the quantum mechanics of anisotropic interactions, Lett. Nuovo Cimento, 42 157, 1985. [11] Aquilanti, V.; Cavalli, S., and Grossi, G., Discrete analogs of spherical harmon- ics and their use in quantum mechanics: The hyperquantization algorithm, Theor. Chem. Acta, 79 283, 1991. [12] Aquilanti, V. and Cavalli, S., Discrete Analogs of Hyperspherical Harmonics and their use for the Quantum Mechanical Three Body Problem, Few Body Systems, Suppl., 6 573, 1992. [13] Aquilanti, V. Cavalli, S. and De Fazio, D., Angular and hyperangular momentum coupling coefficients as Hahn polynomials, J. Phys. Chem., 99 15694, 1995. [14] Aquilanti, V., Cavalli, S., Coletti, C. and Grossi, G., Alternative Sturmian bases and momentum space orbitals; an application to the hydrogen molecular ion, Chem. Phys., 209 405, 1996. [Aquilanti et al., 1997] Aquilanti, V., Cavalli, S. and Coletti, C., The d-dimensional hy- drogen atom; hyperspherical harmonics as momentum space orbitals and alternative Sturmian basis sets, Chem. Phys., 214 1, 1997. [15] Aquilanti, V., and Avery, J., Generalized potential harmonics and contracted stur- mians, Chem. Phys. Letters, 267 1, 1997. [16] Aquilanti, V., Cavalli, S., Coletti, C., Di Domenico, D., and Grossi, G., Hyperspher- ical harmonics as Sturmian orbitals in momentum space: a systematic approach to the few-body Coulomb problem, Int. Rev. Phys. Chem., 20 673, 2001. [17] Aquilanti, V. and Caligiana, A., Sturmian approach to one-electron many-center systems: integrals and iteration schemes, Chem. Phys. Letters, 366 157 2002. [18] Aquilanti, V., Caligiana A., and Cavalli, S., Hydrogenic elliptic orbitals, coulomb sturmian sets. Recoupling coefficients among alternative bases, Int. J. Quantum Chem., 92 99, 2003. [19] Aquilanti, V., Caligiana, A., Cavalli, S., and Coletti, C., Hydrogenic orbitals in momentum space and hyperspherical harmonics. Elliptic sturmian basis sets, Int. J. Quantum Chem., 92 212, 2003. [20] Aquilanti, V., and Caligiana, A., Sturmian orbitals in Quantum Chemistry: an introduction, in Fundemental World of Quantum Chemistry: A Tribute to the Memory of P.O. L¨owdin,I, Br¨andas,E.J. and Kryachko, E.S., Eds., Kluwer, Dordrecht, 297, 2003. BIBLIOGRAPHY 373

[21] Aquilanti, V. and Avery, J., Sturmian expansions for quantum mechanical many- body problems and hyperspherical harmonics, Adv. Quant. Chem., 39 72-101, 2001.

[22] Atkins, P.W., Child, M.S. and Phillips, C.S.G., Tables for Group Theory, Ox- ford, 1970.

[Avery, 1976] Avery, J., Creation and Annihilation Operators, McGraw-Hill, 1976.

[Avery and Ørmen, 1980] Avery, J. and Ørmen, P.J., Transferable integrals in a deforma- tion density approach to molecular orbital calculations; IV. Evaluation of angular integrals by a vector-pairing method, Int. J. Quantum Chem., 18 953-971, 1980.

[Avery, 1989] Avery, J., Hyperspherical Harmonics; Applications in Quantum Theory, Kluwer Academic Publishers, Dordrecht, 1989.

[23] Avery, J., Hyperspherical Sturmian basis functions in reciprocal space, in New Methods in Quantum Theory, Tsipis, C.A., Popov, V.S., Herschbach, D.R., and Avery, J.S., Eds., Kluwer, Dordrecht, 1996.

[24] Avery, J. and Antonsen, F., A new approach to the quantum mechanics of atoms and small molecules, Int. J. Quantum Chem., Symp., 23 159, 1989.

[25] Avery, J. and Antonsen, F., Iteration of the Schr¨odingerequation, starting with Hartree-Fock wave functions, Int. J. Quantum Chem., 42 87, 1992.

[26] Avery, J. and Antonsen, F., Theor. Chim. Acta, 85 33, 1993.

[27] Avery, J. and Herschbach, D. R., Hyperspherical Sturmian basis functions, Int. J. Quantum Chem., 41 673, 1992.

[28] Avery, J. and Wen, Z.-Y., A Formulation of the quantum mechanical many-body in terms of hyperspherical coordinates, Int. J. Quantum Chem., 25 1069, 1984.

[29] Avery, J., Correlation in iterated solutions of the momentum-space Schr¨odinger equation, Chem. Phys. Lett., 138 (6) 520-4, 1987.

[30] Avery, J., Hyperspherical harmonics; Some properties and applications, in Con- ceptual Trends in Quantum Chemistry, Kryachko, E.S., and Calais, J.L., Eds, Kluwer, Dordrecht, 1994.

[31] Avery, J., Hansen, T.B., Wang, M. and Antonsen, F., Sturmian basis sets in mo- mentum space, Int. J. Quantum Chem., 57 401, 1996.

[32] Avery, J., and Hansen, T.B., A momentum-space picture of the chemical bond Int. J. Quantum Chem., 60 201, 1996.

[33] Avery, J., Many-particle Sturmians, J. Math. Chem., 21 285, 1997. 374 BIBLIOGRAPHY

[34] Avery, J. and Antonsen, F., Relativistic sturmian basis functions, J. Math. Chem., 24 175, 1998.

[35] Avery, J., A formula for angular and hyperangular integration, J. Math. Chem., 24 169, 1998.

[Avery, 1999] Avery, J., Many-electron Sturmians applied to atoms and ions, J. Mol. Struct., 458 1, 1999.

[Avery, 1999a] Avery, J., Many-electron Sturmians as an alternative to the SCF-CI Method, Adv. Quantum Chem., 31 201, 1999.

[Avery, 2000] Avery, J., Hyperspherical Harmonics and Generalized Sturmians, Kluwer Academic Publishers, Dordrecht, Netherlandsf, 2000.

[36] Avery, J. Selected applications of hyperspherical harmonics in quantum theory, J. Phys. Chem.. 97 2406, 1993.

[37] Avery, J. and Antonsen, F., Evaluation of angular integrals by harmonic projection, Theor. Chim. Acta., 85 33, 1993.

[38] Avery, J., Fock transforms in reciprocal-space quantum theory, J. Math. Chem., 15 233, 1994.

[39] Aquilanti V. and Avery, J., Generalized potential harmonics and contracted Stur- mians, Chem. Phys. Letters, 267 1-8, 1997.

[40] Avery, J., Many-electron Sturmians applied to atoms and ions, J. Mol. Struct. (Theochem), 458 1-9, 1999.

[41] Avery, J., A formula for angular and hyperangular integration, J. Math. Chem., 24 169-174, 1998.

[42] Avery, J. and Sauer, S., Many-electron Sturmians applied to molecules, in Quan- tum Systems in Chemistry and Physics, Volume 1, Hern´andez-Laguna, A., Maruani, J., McWeeney, R. and Wilson, S, Eds., Kluwer Academic Publishers, 2000.

[43] Avery, J. and Coletti, C., Generalized Sturmians applied to atoms in strong external fields, J. Math. Chem., 27 43-51 2000.

[44] Avery, J. and Coletti, C., Many-electron Sturmians applied to atoms and ions in strong external fields, in New Trends in Quantum Systems in Chemistry and Physics, Marauani, J. et al., Eds., 77-93, Kluwer Academic Publishers, 2001.

[45] Avery, J. and Shim, R., Core ionization energies of atoms and molecules calculated using the Generalized Sturmian Method, Int. J. Quantum Chem., 79 1-7, 2000. BIBLIOGRAPHY 375

[46] Avery, J., The Generalized Sturmian Method and inelastic scattering of fast elec- trons, J. Math. Chem., 27, 279-292, 2000. [47] Avery, J., Sturmian methods in quantum theory, in Proc. Workshop on Concepts in Chemical Physics, Billing, G.D. and Henriksen, N., Eds., Danish Technical University, 2001. [Avery, 2003] Avery, J., Sturmians, in Handbook of Molecular Physics and Quan- tum Chemistry, Wilson, S., Ed., Wiley, Chichester, 2003. [48] Avery, J., Harmonic polynomials, hyperspherical harmonics, and Sturmians, in Fundemental World of Quantum Chemistry, A Tribute Volume to the Memory of Per-Olov L¨owdin, Br¨andas,E.J. and Kryachko, E.S., Eds., Kluwer Academic Publishers, Dordrecht, 261-296, 2003. [49] Avery, J. and Avery, J., The Generalized Sturmian Method for calculating spectra of atoms and ions, J. of Math. Chem., 33 145-162, 2003. [50] Avery, J. and Avery, J., Kramers pairs in configuration interaction, Adv. Quant. Chem., 43 185-206, 2003. [51] Avery, J., Many-center Coulomb Sturmians and Shibuya-Wulfman integrals, Int. J. Quantum Chem., 100 121-130, 2003. [52] Avery, J., Avery J. and Goscinski, O., Natural orbitals from generalized Sturmian calculations, Adv. Quant. Chem., 43 207-16, 2003. [Avery, Avery, Aquilanti and Caligiana] Avery, J., Avery, J., Aquilanti, V., and Caligiana, A., Atomic densities, polarizabilities and natural orbitals derived from generalized Sturmian calculations, Adv. Quant. Chem., 47 156-173, 2004. [Avery and Avery, 2004] Avery, J. and Avery, J., Generalized Sturmian solutions for many-particle Schr¨odingerequations, J. Phys. Chem. A, 41 8848, 2004. [Avery and Avery, 2006] Avery, J. and Avery, J., Generalized Sturmians and Atomic Spectra, World Scientific, 2006. [Avery and Avery, 2006a] Avery, J. and Avery, J., The Generalized Sturmian Library, http://sturmian.kvante.org, 2006a. [53] Avery, J. and Avery, J., Atomic core-ionization energies; approximately piecewise- linear relationships, J. Math. Chem., 46 164-181, 2009. [Avery, 2009] Avery, J., Harmonic polynomials, hyperspherical harmonics and atomic spec- tra, J. Comp. Appl. Math., 2009. [Avery and Avery, 2009] Avery, J., and Avery, J., Can Coulomb Sturmians be used as a basis for N-electron molecular calculations?, J. Phys. Chem A,, 113 14565, 2009. 376 BIBLIOGRAPHY

[Avery, 2011] Avery, J.E., New Computational Methods in the Quantum Theory of Nanostructures, Ph.D. Thesis, University of Copenhagen, 2011.

[54] Balhausen, C.J., Introduction to Ligand Field Theory, McGraw-Hill, 1962.

[55] Ballot, L. and Farbre de la Ripelle, M., Application of the hyperspherical formalism to trinucleon bound-state problems, Ann. Phys., 127 62, 1980.

[56] Bandar, M. and Itzyksen, C., Group theory and the H atom, Rev. Mod. Phys., 38 330, 1966.

[57] Bang, J.M. and Vaagen, J.S., The Sturmian expansion: a well-depth-method for orbitals in a deformed potential, Z. Phys. A, 297 223-36, 1980.

[58] Bang, J.M., Gareev, F.G., Pinkston, W.T. and Vaagen, J.S., One- and two-nucleon overlap functions in nuclear physics Phys. Rep., 125 253-399, 1985.

[59] Baryudin, L. E. and Telnov, D. A., Sturmian expansion of the electron density deformation for 3d-metal ions in electric field, Vestn. Leningr. Univ., Ser 4: Fiz., Khim., 1 83-6, 1991.

[60] Baretty, Reinaldo; Ishikawa, Yasuyuki; and Nieves, Jose F., Momentum space ap- proach to relativistic atomic structure calculations, Int. J. Quantum Chem., Quan- tum Chem. Symp., 20 109-17, 1986.

[61] Biedenharn, L.C. and Louck, J.D., Angular Momentum in Quantum Physics, Addison Wesley, Reading, Mass, 1981.

[62] Biedenharn, L.C. and Louck, J.D., The Racah-Wigner Algebra in Quantum Theory, Addison Wesley, Reading, Mass, 1981.

[63] Bishop, D.B., Group Theory and Chemistry, Courier Dover Publications, 1993.

[64] Blinder, S.M., On Green’s functions, propagators, and Sturmians for the nonrel- ativistic Coulomb problem, Int. J. Quantum Chem., Quantum Chem. Symp., 18 293-307, 1984.

[65] Bransden, B.H.; Noble, C.J.; and Hewitt, R.N., On the reduction of momentum space scattering equations to Fredholm form, J. Phys. B: At., Mol., Opt. Phys., 26 2487-99, 1993.

[66] Brandsen, B.H., and Joachim, C.J., Physics of Atoms and Molecules; 2nd Ed., Prentace-Hall, 2003.

[67] Brink, D.M. and Satchler, D.R., Angular Momentum, Clarendon, Oxford, 1968.

[Caligiana, 2003] Caligiana. A., Sturmian Orbitals in Quantum Chemistry, Ph.D. thesis, Department of Chemistry, University of Perugia, Italy, October, 2003. BIBLIOGRAPHY 377

[68] Coleman, A.J., The Symmetric Group Made Easy, Adv. Quantum Chem., 4 83-108, 1968.

[Coletti, 1998] Coletti, C. Struttura Atomica e Moleculare Come Tottura della Simmetria Ipersferica, Ph.D. thesis, Department of Chemistry, University of Perugia, Italy, 1998.

[69] Condon, E.U. and Shortley, G.H., Theory of Atomic Spectra, Cambridge, 1953.

[70] Cotton, F.A., Chemical Applications of Group Theory, Interscience, 1963.

[71] Coulson, C.A., Momentum distribution in molecular systems. I. Single bond. III. Bonds of higher order, Proc. Camb. Phys. Soc., 37 74, 1941.

[72] Coulson, C.A. and Duncanson, W.E., Momentum distribution in molecular systems. II. C and C-H bond, Proc. Camb. Phys. Soc., 37 67, 1941.

[73] Coxeter, H.S.M. et al. Editors, Mathematical Expositions, Volume 21: The Collected Papers of Alfred Young, 1873-1940, University of Toronto Press, Toronto, 1977.

[74] Dahl, J.P., The Wigner function, Physica A, 114 439, 1982.

[75] Dahl, J.P., On the group of translations and inversions of phase space and the Wigner function, Phys. Scripta, 25 499-503, 1982.

[76] Dahl, J.P., Dynamical equations for the Wigner functions, in Energy Storage and Redistribution in Molecules, Hinze, J., Ed., Plenum, New York, 1983.

[77] Dahl, J.P., The phase-space representation of quantum mechanics and the Bohr- Heisenberg correspondence principle, in Semiclassical Description of Atomic and Nuclear Collisions, Bang, J. and De Boer, J., Eds., North Holland, Amster- dam, 1985.

[78] Dahl, J.P., The dual nature of phase-space representations, in Classical and Quan- tum Systems, Doebner, H.D. and Schroeck, F., Jr., Eds., World Scientific, Singa- pore, 1993.

[79] Dahl, J.P., A phase space essay, in Conceptual Trends in Quantum Chemistry, Kryachko, E.S. and Calais, J.L., Eds., Kluwer Academic Publishers, Dordrecht, Netherlands, 1994.

[80] Dahl, J.P. and Springborg, M., The Morse oscillator in position space, momentum space, and phase space J. Chem. Phys., 88 4535-47, 1988.

[81] Dahl, J.P. and Springborg, M., Wigner’s phase-space function and atomic structure. I. The hydrogen atom , J. Mol. Phys., 47 1001, 1982. 378 BIBLIOGRAPHY

[82] Das, G.P.; Ghosh, S.K.; and Sahni, V.C., On the correlation energy density func- tional in momentum space, Solid State Commun., 65 (7) 719-21, 1988.

[83] Dinesha, K.V., Rettrup, S., and Sarma, C.R., An Indexing Scheme for Spin-Free Configurations of Electrons, Int. J. Quantum Chem., 34 445-455, 1988.

[84] Dinesha, K.V., Sarma, C.R, and Rettrup, S., Group Theoretical Techniques and the Many-Electron Problem, Adv. Quantum Chem., 14 125-168, P. 1981.

[85] Englefield, M.J., Group theory and the Coulomb problem, Wiley-Interscience, New York, 1972.

[Fock, 1935] Fock, V.A., Zur Theorie des Wasserstoffatoms, Z. Phys., 98 145, 1935.

[Fock, 1958] Fock, V.A., Hydrogen atoms and non-Euclidian geometry, Kgl. Norske Viden- skab. Forh., 31 (1958) 138.

[86] Fonseca, A.C. and Pena, M.T., Rotational-invariant Sturmian-Faddeev ansatz for the solution of hydrogen molecular ion (H2+): a general approach to molecular three- body problems, Phys. Rev. A: Gen. Phys., 38 4967-84, 1988.

[87] Friis-Jensen, B., Cooper, D.L., and Rettrup, S., Normalization of Projected Spin Eigenfunctions, J. Math. Chem., 22 249-254, 1997.

[88] Friis-Jensen, B., Cooper, D.L., and Rettrup, S., Spin-Coupled Calculations based on Projected Spin Eigenfunctions, Theor. Chem. Acc., 99 64-67, 1998.

[89] Friis-Jensen, B., and Rettrup, S., A Spin-Free Approach for Evaluation of Electronic Matrix Elements using Character Operators of N , Int. J. Quantum Chem., 60, 983- 991, 1996. S

[90] Friis-Jensen, B., Rettrup, S., and Sarma, C.R. An Indexing Scheme for Classes of N : Partitions of N, Int. J. Quantum Chem., 64 421-426, 1997. S [Gasaneo et al., 2009] Gasaneo, G. et al., Theory of hyperspherical Sturmians for three- body reactions, J. Phys. Chem. A, 113, 14573-14582, 2009.

[91] Gasaneo, G. et al., Multivariable hypergeometric solutions for three charged particles, J. Phys. B, 30 L265-L271, 1997.

[92] Gasaneo, G., and Mmacek, J.H., Hyperspherical adiabatic eigenfunctions for zero- range potentials, J. Phys. B., 35 2239-2250, 2002.

[93] Gasaneo, G., and Colavecchia, F.D., Two-body Coulomb wave functions as a kernel for three alternative integral transformations , J. Phys. A, 36 8443-8462, 2003.

[94] Gerratt, J., General theory of spin-coupled wave functions for atoms and molecules, Adv. At. Mol. Phys., 7 141-221, 1971. BIBLIOGRAPHY 379

[Goscinski, 1968, 2003] Goscinski, O., Upper and lower bounds and the conjugate eigen- value problem, Adv. Quantum Chem., 41 51-85, 2003.

[Gradshteyn and Ryshik, 1965] Gradshteyn, I.S. and Ryshik, I.M., Tables of Integrals, Series and Products, Academic Press, New York, 1965.

[95] Griffeth, J.S., The Theory of Transition Metal Ions, Cambridge, 1964.

[96] Gruber, B., Editor, Symmetries in Science, VI: From the Rotation Group to Quantum Algebras, Plenum, New York, 1993.

[97] Guldberg, A., Rettrup, S., Bendazzoli, G.L., and Palmieri, P., A New Symmetric Group Programme for Direct Configuration Interaction Studies in Molecules, Int. J. Quantum Chem. (Symp.), 21 513-521, 1987.

[98] Hall, G.G., Applied Group Theory, Longmans, 1967.

[99] Hammermesh, M., Group Theory and Its Applications to Physical Prob- lems, Addison-Wesley, 1964.

[100] Hansen, T.B., The many-center one-electron problem in momentum space, Thesis, Chemical Institute, University of Copenhagen, 1998.

[Harris and Michels, 1967] Harris, F.E. and Michels, H.H., Evaluation of molecular inte- grals for Slater-type orbitals, Adv. Chem. Phys., 13 205, 1967.

[101] Hartt, K. and Yidana, P.V.A., Analytic Sturmian functions and convergence of separable expansions, Phys. Rev. C: Nucl. Phys., 36 475-84, 1987.

[102] Heine, V., Group Theory in Quantum Mechanics, Pergamon, 1960.

[103] Herschbach, D. R., Dimensional interpolation for two-electron atoms, J. Chem. Phys., 84 838, 1986.

[104] Herschbach, D. R., Avery, J. and Goscinski, O., Eds., Dimensional Scaling in Chemical Physics, Kluwer, Dordrecht, 1993.

[105] Hochstrasser, R.M., Molecular Aspects of Symmetry, Benjamin, 1966.

[106] Hollas, J.M., Symmetry in Molecules, Chapman, 1972.

[107] Jaff´e,H.H. and Ochrin M., Symmetry in Chemistry, Wily, 1965.

[Judd, 1975] Judd, B.R., Angular Momentum Theory for Diatomic Molecules, Academic Press, New York, 1975.

[108] Judd, B.R., Operator Techniques in Atomic Spectroscopy, McGraw-Hill, 1963. 380 BIBLIOGRAPHY

[109] Judd, B.R., Second Quantization and Atomic Spectroscopy, Johns Hopkins Press, 1967.

[110] Kadolkar, C., Sarma, C.R., and Rettrup, S., Configuration Interaction Studies Using Biorthogonal Approach to VB Basis, Int. J. Quantum Chem., 53 183-187, 1995.

[111] Kaplan, I.G., Symmetry of Many-Electron Systems, Academic Press, 1975.

[112] Karule, E. and Pratt, R.H., Transformed Coulomb Green function Sturmian expan- sion, J. Phys. B: At., Mol. Opt. Phys., 24 1585-91, 1991.

[113] Knox, R.S. and Gold, A., Symmetry in the Solid State, Benjamon, 1964.

[Koga et al., 1984] Koga, T. and Murai, Takeshi, Energy-density relations in momentum space. III. Variational aspect, Theor. Chim. Acta, 65 311-16, 1984.

[114] Koga, T., Direct solution of the H(1s) H+ long-range interaction problem in momentum space, J. Chem. Phys., 82 2022,− 1985.

[115] Koga, T. and Matsumoto, S., An exact solution of the interaction problem between two ground-state hydrogen atoms, J. Chem. Phys., 82 5127, 1985.

[116] Koga, T. and Kawaai, R., One-electron diatomics in momentum space. II. Second and third iterated LCAO solutions J. Chem. Phys., 84 5651-4, 1986.

[117] Koga, T. and Matsuhashi, T., One-electron diatomics in momentum space. III. Nonvariational method for single-center expansion, J. Chem. Phys., 87 1677-80, 1987.

[Koga and Matsuhashi, 1987] Koga, T. and Matsuhashi, T., Sum rules for nuclear attrac- tion integrals over hydrogenic orbitals, J. Chem. Phys., 87 4696-9, 1987.

[118] Koga, T. and Matsuhashi, T., One-electron diatomics in momentum space. V. Non- variational LCAO approach, J. Chem. Phys., 89 983, 1988.

[119] Koga, T.; Yamamoto, Y., and Matsuhashi, T., One-electron diatomics in momen- tum space. IV. Floating single-center expansion, J. Chem. Phys., 88 6675-6, 1988.

[120] Koga, T. and Ougihara, T., One-electron diatomics in momentum space. VI. Non- variational approach to excited states, J. Chem. Phys., 91 1092-5, 1989.

[Koga et al., 1991] Koga, T.; Horiguchi, T. and Ishikawa, Y., One-electron diatomics in momentum space. VII. Nonvariational approach to ground and excited states of heteronuclear systems, J. Chem. Phys., 95 1086-9, 1991.

[121] Kupperman, A. and Hypes, P.G., 3-dimensional quantum mechanical reactive scat- tering using symmetrized hyperspherical coordinates, J. Chem. Phys., 84 5962, 1986. BIBLIOGRAPHY 381

[122] la Cour Jansen, T., Rettrup, S., Sarma, C.R., and Snijders, J.G. On the Evaluation of Spin-Orbit Coupling Matrix Elements in a Spin Adapted Basis, Int. J. Quantum Chem., 73 23-27, 1999. [123] Leech, J.W. and Newman, D.J., How to Use Groups, Methuen, 1969. [124] Lin, C.D., Analytical channel functions for 2-electron atoms in hyperspherical coor- dinates, Phys. Rev. A, 23 1585, 1981. [125] Linderberg, J. and Ohrn,¨ Y., Kinetic energy functional in hyperspherical coordi- nates, Int. J. Quantum Chem., 27 273, 1985. [126] Louck, J.D., Generalized orbital angular momentum and the n-fold degenerate quan- tum mechanical oscillator, J. Mol. Spectr., 4 298, 1960. [127] Loebl, E.M., Editor, Group Theory and Its Applications, Academic Press, 1968. [128] Lomont, J.S., Applications of Finite Group Theory, Academic Press, 1959. [129] Manakov, N.L.; Rapoport, L.P. and Zapryagaev, S.A., Sturmian expansions of the relativistic Coulomb Green function, Phys. Lett. A, 43 139-40, 1973. [130] Maquet, Alfred; M., Philippe and Veniard, Valerie, On the Coulomb Sturmian basis, NATO ASI Ser, Ser C, 271 (Numer. Determ. Electron. Struct. At., Diat. Polyat. Mol.), 295-9, 1989. [131] McWeeny, R., and Coulson, C.A., The computation of wave functions in momentum space. I. The helium atom, Proc. Phys. Soc. (London) A, 62 509, 1949. [132] McWeeny, R., The computation of wave functions in momentum space. II. The hydrogen molecule ion, Proc. Phys. Soc. (London) A, 62 509, 1949. [133] McWeeny, R., Methods of Molecular Quantum Mechanics, (second edition), Academic Press, 1992. [134] McWeeny, R., Symmetry, Pergamon, 1963. [135] Meijer, P.H.E. and Bauer, E., Group Theory, North Holland, 1962. [Michels, 1981] Michels, M.A.J., Evaluation of angular integrals in rotational invariants, Int. J. Quantum Chem., 20 951-952, 1981. [136] Monkhorst, H.J. and Harris, F.E., Accurate calculation of Fourier transform of two-center Slater orbital products, Int. J. Quantum Chem., 6 601, 1972. [Monkhorst and Jeziorski, 1979] Monkhorst, H.J. and Jeziorski, B., No linear dependence or many-center integral problems in momentum space quantum chemistry, J. Chem. Phys., 71 5268-9, 1979. 382 BIBLIOGRAPHY

[137] Moore, C.E., Atomic Energy Levels; Circular of the National Bureau of Standards 467, Superintendent of Documents, U.S. Government Printing Office, Washington 25 D.C., 1949.

[138] Morse P.M. and Feshbach, H., Methods of Theoretical Physics, McGraw Hill, 1953.

[139] Motenberg, M., Bivens, R., Metropolis, N., and Wooten, J.K. Jr. The 3-j and 6-j Symbols, Technology Press, MIT, Cambridge Mass., 1959.

[140] Murnagham, F.D. The Theory of Group Representations, Johns Hopkins Press, Baltimore, 1938.

[141] National Institute for Fusion Science (NIFS), Atomic and Molecular Databases: Data for Autoionizing States, http://dprose.nifs.ac.jp/DB/Auto/

[National Institute for Standards and Technology (NIST)] National Institute of Standards and Technology (NIST), NIST Atomic Spectra Database, http://physics.nist.gov/asd .

[National Institute of Standards and Technology, 2002] National Institute of Standards and Technology (NIST), CODATA Recommended Values of the Fundamental Physical Constants: 2002, http://physics.nist.gov/cuu/Constants/

[142] Navasa, J. and Tsoucaris, G., Molecular wave functions in momentum space, Phys. Rev. A, 24 683, 1981.

[143] Ojha, P.C., The Jacobi-matrix method in parabolic coordinates: expansion of Coulomb functions in parabolic Sturmians, J. Math. Phys. (N Y), 28 392-6, 1987.

[144] Orchin, M. and Jaff´e,H.H., Symmetry, Orbitals and Spectra, Wily, 1971.

[145] Orgel, L.E., An Introduction to Transition Metal Chemistry Ligand Field Theory, Methuen, 1960.

[146] Paldus, J., Coupled Cluster Methods, in Wilson, S., Editor, Handbook of Molec- ular Physics and Quantum Chemistry, Wiley, 2003.

[147] Paldus, J., Dynamical Groups, in Drake, G.W.F. Editor, Atomic, Molecular and Optical Physics Handbook, American Institute of Physics, New York, 1996.

[148] Paldus, J., and Gould, M.D., Unitary Group Approach to Reduced Density Matrices, II, Theor. Chim. Acta, 86 83-96, 1993.

[149] Paldus, J., Rettrup, S., and Sarma, C.R., Clifford algebra realization of Rumer-Weyl basis, J. Mol. Structure (Theochem), 199 85-101, 1989. BIBLIOGRAPHY 383

[150] Palmieri, P., and Rettrup, S., Teoria Del Gruppo Simmetrico Per Il Calcolo Degli Elementi Di Matrice E Per La Soluzione Delle Equazioni CI, in Scuola di Chimica Teorica : Basi Teoriche ed Applicationi Numeriche, Moccia, R. and Pisani, C., Eds., Polo Editorale Chimico, Milano, 197-276, 1990.

[151] Palmieri, P., Tarroni, R., Mitrushenkov, A.O., and Rettrup, S., Efficient Truncation Strategies for Multi Reference Configuration Interaction Molecular Energies and Properties, J. Chem. Phys., 109 7085-7091, 1998.

[152] Pauncz, R., The Symmetric Group in Quantum Chemistry, CRC Press, Boca Raton, 1995.

[153] Petrochen, M.I. and Trifanov, E.D., Group Theory in Quantum Mechanics, Iliffe, 1969.

[Pinchon and Hoggan 2007] Pinchon, D. and Hoggan, P.E., Rotation matrices for real spherical harmonics: general rotations of atomic orbitals in space-fixed axes, J. Phys. A., 40 1597-16100, 2007.

[Pinchon and Hoggan, 2009] Pinchon, D. and Hoggan, P.E., Gaussian approximation of exponential type orbitals based on B functions, Int. J. Quantum Chem., 109 135- 148, 2009.

[154] Rettrup, S., A recursive formula for Young’s orthogonal representation, Chem. Phys. Lett., 47 59-60, 1977.

[155] Rettrup, S., Many-body correlations using unitary groups, in The Unitary Group for the Evaluation of Electronic Energy Matrix Elements, Lecture Notes in Chemistry Vol. 22, 108-118, Hinze, J., Ed., Springer-Verlag, Berlin, 1981.

[156] Rettrup, S., An Iterative Method for Calculating Several of the Extreme Eigensolu- tions of Large Real Non-Symmetric Matrices, J. Comp. Phys., 45 100-107, 1982.

[157] Rettrup, S., Direct evaluation of spin representation matrices and ordering of per- mutation group elements, Int. J. Quantum Chem., 29 119-128, 1986.

[158] Rettrup, S., The permutation group in many-electron theory, in Conceptual Trends in Quantum Chemistry III, Kryachko, E.S., and Calais, J.L., Eds, Kluwer, Dordrecht, 225-238, 1997.

[Rettrup, 2003] Rettrup, S., Lecture Notes on Quantum Chemistry, Department of Chemistry, University of Copenhagen, 2003.

[159] Rettrup, S., Alternative graphical representation of determinential many-electron states, Int. J. Quantum Chem., 106 2511-2517, 2006. 384 BIBLIOGRAPHY

[160] Rettrup, S., Bendazzoli, G.L., Evangelisti, S., and Palmieri, P., A symmetric group approach to the calculation of electronic correlation effects in molecules, in Under- standing Molecular Properties, 533-546, Avery, J.S., Dahl, J.P., and Hansen, Aa.E., Eds., Reidel Publ. Co., Dordrecht, Holland, 1987.

[161] Rettrup, S., and Chisholm, C.D.H., On R(4) Symmetries in atomic structure, The- oret. Chim. Acta (Berl.), 57 209-218, 1980.

[162] Rettrup, S., and Pauncz, R., Representations of the symmetric group generated by projected spin functions: a graphical approach, Int. J. Quantum Chem., 60 91-98, 1996.

[163] Rettrup, S., and Sarma, C.R., A new program for CI calculations in molecules, Theoret. Chim. Acta (Berl.), 46 73-76, 1977.

[164] Rettrup, S., and Sarma, C.R., A Programmable procedure for generating many- particle states, Phys. Letters, 75A 181-182, 1980.

[165] Rettrup, S., Sarma, C.R., and Dahl, J.P., Molecular point group adaptation of spin- free configurations, Int. J. Quantum Chem., 22 127-148, 1982.

[166] Rettrup, S., Thorsteinsson, T., and Sarma, C.R., A graphical approach to configura- tion interaction studies in molecules using determinants of non-orthogonal orbitals, Int. J. Quantum Chem., 40 709-717, 1991.

[167] Robinson, G. de B., Representation Theory of Symmetric Groups, University of Toronto Press, Toronto, 1961.

[168] Rohwedder, B. and Englert, B.G., Semiclassical quantization in momentum space, Phys. Rev. A: At., Mol., Opt. Phys., 49 2340-6, 1994.

[169] Rose, M.E., Elementary Theory of Angular Momentum, Wiley, 1957.

[Rotenberg, 1962] Rotenberg, M., Ann. Phys. (New York), 19 262, 1962.

[Rotenberg, 1970] Rotenberg, M., Theory and application of Sturmian functions, Adv. At. Mol. Phys., 6 233-68, 1970.

[170] Royer, A., Wigner function as the expectation value of a parity operator, Phys. Rev. A, 15 449-450, 1977.

[171] Rudin, W., Fourier Analysis on Groups, Interscience, New York, 1962.

[172] Rutherford, D.E., Substitutional Analysis, Edinburgh University Press, Edin- burgh, 1948. BIBLIOGRAPHY 385

[173] Sarma, C.R., Ahsan, M.A.H., and Rettrup, S., A graphical approach to permutation group representations for many-electron systems, Int. J. Quantum Chem., 58 637- 643, 1996.

[174] Sarma, C.R., Palmieri, P., and Rettrup, S., A graphical scheme for addressing de- terminants in large scale configuration interaction treatments, Mol. Phys., 98 1851- 1856, 2000.

[175] Sarma, C.R., Ravi Shankar, N., and Rettrup, S., A graphical scheme for representing many-electron spin-free configurations, Int. J. Quantum Chem., 86 417-421, 2002.

[176] Sarma, C.R., and Rettrup, S., A programmable spin-free method for configuration interaction, Theoret. Chim. Acta (Berl.), 46 63-72, 1977.

[177] Sarma, C.R., and Rettrup, S., A programme for the study of many-body correlations: matrix elements of the generators of U(n), J. Phys. A: Math. Gen., 13 2267-2274, 1980.

[178] Sch¨onlandD.S., Molecular Symmetry, Van Norstrand, 1965.

[Shibuya and Wulfman, 1965] Shibuya, T. and Wulfman, C.E., Molecular orbitals in mo- mentum space, Proc. Roy. Soc. A, 286 (1965) 376.

[Shull and L¨owdin,1959] Shull, H. and L¨owdin,P.-O., Superposition of configurations and natural spin-orbitals. Applications to the He problem, J. Chem. Phys., 30 617, 1959

[179] Sinanoglu, O., and Bruekner, K., Three Approaches to Electron Correlation in Atoms, Yale University Press, 1970.

[180] Slater, J.C., Quantum Theory of Atomic Structure, McGraw-Hill, 1960.

[181] Smith, F.T., Generalized angular momentum in many-body collisions, Phys. Rev., 120 1058, 1960.

[182] Smith, F.T., A symmetric representation for three-body problems. I. Motion in a plane, J. Math. Phys., 3 735, 1962.

[183] Smith, F.T., Participation of vibration in exchange reactions, J. Chem. Phys., 31 1352-9, 1959.

[184] Smith, V.H. Jr., Density functional theory and local potential approximations from momentum space considerations, in Local Density Approximations in Quan- tum Chemistry and Solid State Physics, Plenum, New York, N. Y, Dahl, J.P. and Avery, J.,Eds., 1-19, 82, 1984.

[185] Springborg, M. and Dahl, J.P., Wigner’s phase-space function and atomic structure, Phys. Rev. A, 36 1050-62, 1987. 386 BIBLIOGRAPHY

[Sommer-Larsen, 1990] Sommer-Larsen. P., Ph.D. Thesis, University of Copenhagen, 1990.

[186] Sugano, S., Tanabe, Y. and Kamimura, H., Multiplets of Transition Metal Ions in Crystals, Academic Press, 1960.

[187] Suzuki, H., Electronic Absorption Spectra and Geometry of Organic Molecules, Academic Press, 1967.

[188] Thorsteinsson, T., and Rettrup, S., Configuration interaction studies of elec- tronic structure, in Supercomputing Annual Report 1997, J. Wasniewski, Ed., UNI C, The Danish Computing Centre for Research and Education, Lyngby, 1998. • [189] Tinkham, M., Group Theory and Quantum Mechanics, McGraw-Hill, 1964.

[190] Urch, D.S., Orbitals and Symmetry, Penguin, 1970.

[Vilenkin, 1968] Vilenkin, N.K., Special Functions and the Theory of Group Rep- resentations, American Mathematical Society, Providence, R.I., 1968.

[191] Watanabe, H., Operator Methods in Ligand Field Theory, Prentace Hall, 1966.

[Weatherford, 1982] Weatherford, C.A., and Jones, H.W., Editors, International Con- ference on ETO Multicenter Integrals, Reidel, Dordrecht, 1982.

[Weyl, 1950] Weyl, H., The Theory of Groups in Quantum Mechanics, Dover, 1950.

[Weyl, 1939] Weyl, H., The Classical Groups, Their Invariants and Representa- tions, Princeton University Press, 1939.

[192] Wen, Z.-Y. and Avery, J., Some properties of hyperspherical harmonics, J. Math. Phys., 26 396, 1985.

[Weniger, 1985] Weniger, E.J., Weakly convergent expansions of a plane wave and their use in Fourier integrals, J. Math. Phys., 26 276, 1985.

[193] Weniger, E.J. and Steinborn, E.O., The Fourier transforms of some exponential- type basis functions and their relevance for multicenter problems, J. Chem. Phys., 78 6121, 1983.

[194] Weniger, E.J.; Grotendorst, J., and Steinborn, E.O., Unified analytical treatment of overlap, two-center nuclear attraction, and Coulomb integrals of B functions via the Fourier transform method, Phys. Rev. A, 33 3688, 1986.

[195] Wigner, E.P., Group Theory and Its Application to the Quantum Mechan- ics of Atomic Spectra, Academic Press, 1959. BIBLIOGRAPHY 387

[196] Wilson, S., Editor, Handbook of Molecular Physics and Quantum Chem- istry, Wiley, 2003.

[197] Wulfman, C.E., Semiquantitative united-atom treatment and the shape of triatomic molecules, J. Chem. Phys., 31 381, 1959.

[198] Wulfman, C.E., Dynamical groups in atomic and molecular physics, in Group The- ory and its Applications, Loebel, E.M. Ed., Academic Press, 1971.

[199] Wulfman, C.E., Approximate dynamical symmetry of two-electron atoms, Chem. Phys. Lett., 23 370-372, 1973.

[200] Wulfman, C.E., On the space of eigenvectors in molecular quantum mechanics, Int. J. Quantum Chem., 49 185, 1994.

[Wulfman, 2011] Wulfman, C.E., Dynamical Symmetry, World Scientific, 2011.

[201] Yurtsever, E.; Yilmaz, O., and Shillady, D.D., Sturmian basis matrix solution of vibrational potentials. Chem. Phys. Lett., 85 111-116, 1982.

[202] Yurtsever, E., Franck-Condon integrals over a Sturmian basis. An application to photoelectron spectra of molecular hydrogen and molecular nitrogen, Chem. Phys. Lett., 91 21-26, 1982.

[Cario and Franck, 1923] G. Cario and J, Frank, Z. Phys. 17, 202, (1923)

[Avery, Bay and Szent-Gy¨orgyi,1961] J.S. Avery, Z. Bay and A. Szent-Gy¨orgyi, On the Energy Transfer in Biological Systems, Proc. Nat. Acad. Sci. (US), 47, 1742-1744, (1961)

[Avery, 1966] J.S. Avery, Resonance energy transfer and spontaneous photon emission, Proc. Phys. Soc. (London) 88, Part 1, (1966)

[Avery, 1984] J.S. Avery, Use of the S-Matrix in the Relativistic Treatment of Resonance Energy Reansfer. Int. J. Quantum Chemistry, XXV, 79-96, (1984)

[Gomberoff and Power, 1966] L. Gomberoff and E.A. Power, The resonance transfer of excitation, Proc. Pys. Soc. 88, 281. (1966)

[Scholes, 2003] G.D. Scholes, Long-Range Resonance Energy Transfer in Molecular Sys- tems, Annu. Rev. Phys. Chem. 54, 57-87, (2003)

[Chou and Dennis, 2015] K.F. Chou and A.M. Dennis, F¨orsterResonance Energy Transfer Between Quantum Dot Donors and Quantum Dot Acceptors, Sensors, 2015, 15, 13288-13325, (2015) 388 BIBLIOGRAPHY

[Wegner, 2015] K-D. Wegner, F¨orsterResonance Energy Transfer From Terbium Com- plexes to Quantum Dots for Multiplexed Immunoassays and Molecular Rulers, Doc- toral Thesis, Universit´eParis-Sud, (2015)

[Kusku and Akan, 2011] M. Kusku and O.B. Akan, A Nanoscale Communication Chan- nel with Fluorescence Resonance Energy Transfer (FRET), 1st IEEE International Workshop on Molecular and Nano Scale Communication (MoNaCom), Published in: IEEE Conference on Computer Communications Workshops (INFOCOM WK- SHPS), (2011)

[Pearlstein, 2002] R.M. Pearlstein, Photosynthetic Exciton Transfer in the 1960’s, Photo- synthesis Research 73, 119-126, (2002)

[Andrews and Demidov, 1999] D.L. Andrews and A.A. Demidov, Resonance Energy Transfer, (Wiley, 1999)

[Juzeliunas and Andrews, 2000] G. Juzeliunas and D.L. Andrews, Quantum Electrodynam- ics of Resonance Energy Transfer, Adv. Chem. Physics, 112, (2000)

[Anderson, 2008] D.L. Anderson, Editor, Encyclopedia of Applied Spectroscopy, Wiley, (2008)

[Daniels et al., 2003] G.J. Daniels, R.G. Jenkens, D.S. Bradshaw, and D.L. Andrews, Res- onance Energy Transfer: The Unified Theory Revisited, J. Chem. Phys. 119, 2264- 2274, (2003)

[Bradshaw and Andrews, 2008] D.S. Bradshaw and D.L. Andrews, Optically controlled res- onance energy transfer: Mechanisms and configuration for all-optical switching, J. Chem. Phys. 128, 144506, (2008)

[Wheeler and Feynman, 1949] J.A. Wheeler and R.P. Feynman, Classical Electrodynamics in Terms of Direct Interparticle Interaction, Rev. Mod. Phys. 21, 3, 425-433, (1949)

[Kastner, 2015a] R.E. Kastner, Haag’s Theorem as a Reason to Reconsider Direct-Action Theories, International Journal of Quantum Foundations, 1, (2), 56-64, (2015) (https://arxiv.org/pdf/1502.03814.pdf)

[Kastmer, 2015b] R.E. Kastner, The Transactional Interpretation of Quantum Mechanics: The Reality of Possibility (Cambridge University Press, 2012)

[Grundler, pdf] G. Gr¨undler, Remarks on Wheeler-Feynman Absorber Theory (https://arxiv.org/pdf/1501.03516.pdf)

[Lazarovici, prepring, pdf] D. Lazarovici, Against Fields, (preprint, pdf availabe online), BIBLIOGRAPHY 389

[Klaczynski, 2015] L. Klaczynski, Haag’s Theorem in Renormalisable Quantum Field Theories, Doctoral Thesis at Humboldt-Universit¨at zu Berlin, (2015), (available on line at http://www2.mathematik.hu-berlin.de/ kreimer/wp- content/uploads/PhDLutz.pdf)

[Thompsen et. al., 2018] D. Thompsen et al. Hundred-fold enhancement in far-field radia- tive heat transfer over the blackbody limit, Nature, 3 September, 2018.

[203] R. Haag, On quantum field theories, Matematisk-fysiske Meddelelser, 29, 12, 1955.

[204] A.I. Akhiezer and V.B. Berestetski, Quantum Electrodynamics, Interscience, New York, 1965.

[205] R.S. Swathi and K.L. Sebastian, Distance dependance of fluorescence resonance energy transfer, J. Chem. Soc. 121, 777-787, (2009)

[206] R.S. Swathi and K.L. Sebastian, Long range resonance energy transfer from a dye molecule to graphene has (distance)-4 dependence, J. Chem Phys. 130, 086101, (2009).

[207] L.I. Rangel-Zamudio, D.S. Rushforth, and R.P. Van Duyne, Long-Range Reso- nance Energy Transfer from Aromatic Hydrocarbons to the Anion Radical of Tetra- cyanoquinodimethane, J. Phys. Chem. 90, 807, (1986)

[208] R. Deshmukh et al., Long-Range Resonant Energy Transfer Using Optical Topolog- ical Transitions in Metamaterials, ACS Photonics, 5, 2737-2741, (2018)

[209] K. Akulov et al., Long-Distance Resonant Energy Transfer Mediated by Hybrid Plasmonic-Photonic Modes, J. Phys. Chem. C., just-accepted manuscript, 18 June, (2018).

[210] D. Martin-Cano et al., Resonance Energy Transfer and Superradiance Mediated by Plasmonic Nanowaveguides, Nano Lett. 10 (8), 3129-3134, (2010).

[211] City College of New York, Team study breaks Forster resonant energy transfer (FRET) distance limit, Physics, Optics and Photonics, Oct. 25, (2018). Index

Ab-initio calculations, 101 Arms race, danger of, 38 Abelian groups, 252 Artificial intelligence, 199, 226 Absolute velocity, 155 Artificial transmutation of elements, 20 Accelerated development, 221 Artistic creativity, 226 Accelerators, 20 As big as a house, 101 Accidental degeneracies, 254 ASCC calculator, 206, 210 Active site, 125 Aslos mission, 38 Active transport, 223 Atom, model of, 16, 26 ADA programming language, 203 Atomic bomb, 260 Adler, Ellen, 23 Atomic numbers, 31 Agriculture, 225 Atomic orbitals, 67, 72 Aiken, Howard, 206, 210 Atomic spectra, 28 Aircraft engines, 223 Atomic spin-orbitals, 301 Albert Einstein, 257 Atomic units, 157, 158, 279, 291 Alchemy, 12, 20 Augusta Ada, 203 Allowed energy bands, 102, 214 Austria, 43 Allowed orbits, 19, 31 Autoassembly, 218 Alpha particles, 13, 20 Automatic warehouses, 224 Alpha rays, 12 Automation, 222, 225 America, 259 Autoradiography, 129 Amino acids, 125 Axons, 226 Analytical engine, 202 B-like isoelectronic series, 293 Analytical machine, 202 Babbage, Charles, 199, 201, 202 Anderson, Carl David, 46 Bacterial cell wall, 125 Angular momentum, 27, 43 Ballhausen, Carl J., 222 Anode, 119 Band structure of crystals, 102, 214 Anti-war manifesto, 260 Banking, 224 Antibonding orbitals, 82 Bardeen, John, 102, 213, 215 Anticommutation, 158 Bare nucleus, 293 Antimatter, 46 Base pairs, 128 Antisymmetric functions, 282 Basis functions, 242, 252, 305 Antisymmetry requirement, 76 Bateson, Gregory, 210, 213 Approximate N-electron Schr¨odingerequa- Be-like isoelectronic series, 293 tion, 287, 290 Beck, Guido, 32 Approximate Schr¨odingerequation, 346 Becquerel, Henri, 11

390 INDEX 391

Bell Telephone Laboratories, 102, 207, 209, Broglie, Louis de, 43, 46 213 Brownian motion, 258, 259 Beltrami, Eugenio, 267 Bush, Vannevar, 207 Bending of a light ray, 261 Bytes, 104, 218 Berkeley, 80 Bernal, J.D., 125 C-like isoelectronic series, 293 Beryllium, 73 Cajal, Ram`ony, 226 Beta rays, 12 Calculating box, 199 Binary digit, 211 Calculations took hundreds of hours, 101 Binary numbers, 104, 207, 218 Cambridge University, 11, 44, 76, 121 Binding energies, 19, 31 Canada, 12 Binomial coefficient, 301 Canonically conjugate momentum, 148 Biosemiotics, 211 Carbon, 73 Biosynthesis of proteins, 129 Carbon-like ground state, 302 Bits, 104, 218 Cartesian coordinates, 263 Bits per unit area, 218 Cathode rays, 119 Black body radiation, 26 Cavendish Laboratory, 11 Bloch, Felex, 32 Central processing unit, 103, 214 Bloch, Felix, 46 Central processing units, 82, 218 Block-diagonal form, 242, 247 Chain of point masses, 149 Bohr contacts Roosevelt and Churchill, 38 Chains of subgroups, 278 Bohr effect, 23 Character tables, 252 Bohr’s atomic theory, 31, 43 Characters, 242, 247, 251 Bohr’s escape to Sweden, 33 Charge acceptors, 82 Bohr, Aage, 23, 33, 38, 39 Charge conservation, 157 Bohr, Christian, 23 Charge density, 155 Bohr, Harold, 23 Charge donors, 82 Bohr, Margrethe, 23 Chemical bonds, 44, 72 Bohr, Niels, 16, 20, 23, 26, 33, 38 Chemical industries, 223 Bohr, Thomas, 23 Chemistry, 19, 222 Bohr, Vilhelm, 23 Cherwell, Lord, 38 Boltzman’s kinetic theory, 259 Chess-playing program, 227 Bombyx mori, 211 Chips, 104, 217 Bonding orbitals, 82 Choice of V0, 300 Born, Max, 44 Christoffel symbol, 272 Boron, 73 Chromatography, 129 Boundary conditions, 166 Church-Turing hypothesis, 207 Box normalization, 166 Churchill, Winston, 38 Bragg angle, 121 Classes, 248 Bragg, William Henry, 19, 121 Classification of basis functions, 252 Bragg, William Lawrence, 19, 121 Classified basis functions, 253 Brain, mechanism of, 226 Clebsch-Gordan coefficients, 160 Brattain, Walter, 102, 213, 215 Clementi, Enrico, 97, 101, 222 392 INDEX

Cockcroft, Sir John, 20 Crystal fields, 311 Collapse of Rutherford’s atom, 27 Crystallography, 222 Collection of harmonic oscillators, 168 Crystals, 17, 102, 213 Collective consciousness, 222 Cubic field, 313 Colossus, 207 Cultural evolution, 199, 221, 226 Commutation, 254 Current density 3-vector, 155 Complementarity, 129 Current density 4-vector, 156, 157 Completeness, 282 Curvature of space, 262 Computer disc storage, 83, 218 Curvature, intrinsic, 271 Computer memories, 104, 218 Curved spaces, 272 Computer networks, 83, 104, 218, 224 Curvilinear coordinate transformation, 266 Computer-assisted design, 223 Cybernetics, 207, 209, 213 Computerization of commerce, 223 Cytoplasm, 129 Computers, 199 Conduction band, 103, 214 D’Alembertian operator, 156 Conduction bands, 102, 214 Danish resistance movement, 33 Conductor, 102, 214 Darwin, Sir Charles Galton, 17 Configuration Interaction, 285 Data banks, 222 Configurations, 332 Debye, Peter, 46 Confluent hypergeometric function, 64, 161, Definition of a finite group, 239 273 Degeneracy, 254, 278 Conjugate eigenvalue problem, 273 Degenerate configurations, 346 Conjugate function, 290 Dendrites, 226 Conjugate transpose, 244 Denmark’s Jewish community escapes, 33 Consciousness, 211 Densities, 335 Container ships, 225 Derivation of the secular equations, 287 Continuous media, 163 Detection of radio waves, 11 Continuous systems, 164 Diagnosis, 223 Contraction of tensors, 264 Diatomic molecules, 322, 333 Contravariant metric tensor, 263, 268 Differential and integral calculus, 201 Contravariant tensors, 263 Diffraction grating, 17 Control and communication, 209 Diffraction of X-rays, 17 Coordinate transformations, 240 Diffractometers, 222 Copenhagen, 16, 31 Digital computers, 76 Coulomb Sturmian basis functions, 277 Dirac alpha matrices, 158 Coulomb Sturmians, 273, 319, 320 Dirac equation, 157, 158 Coulson, Charles, 80 Dirac equation for a hydrogenlike atom, 159 Countess of Lovelace, 203 Dirac gamma matrices, 158 Covariant metric tensor, 263, 267 Dirac Hamiltonian, 158 Covariant tensors, 263 Dirac notation, 254 Creation and annihilation operators, 168 Dirac spinors, 158 Crookes’ tubes, 11 Dirac’s relativistic wave equation, 46 Crookes, Sir William, 119 Dirac, P.A.M, 202 INDEX 393

Dirac, P.A.M., 44, 157 Electronic digital computers, 207 Discharge tube, 119 Electronic valves, 103, 214 DNA, 129 Electrons, 27 Doping, 103, 214, 217 Electrophoresis, 129 Dorothy Crowfoot Hodgkin, 125 Electrostatic forces, 125 Double-stranded DNA, 128 Electrostatic potential, 155 Drum parity, 101 Elements, 12, 19 Due degeneracies, 254, 346 Embedded computer, 224 Dynamical symmetry, 278 Emmeche, Claus, 211 Dynamics of reactions, 223 Energy-independent matrix, 290 England, 13 Eastern Europe, 101 ENIAC digital computer, 207 Effective charge, 286 Entropy, 209 Ehrenfest, Paul, 27 Equilibrium position, 147 Eigenvalues of Lz, 306 Equivalence of inertial frames, 155 Eigenvalues of Sz, 306 Equivalent representations, 242 Eigenvalues of the interelectron repulsion ma- Etched away by acid, 103, 217 trix, 305 Ether, 259 Einstein convention, 263, 267 Ethics, 38 Einstein in Italy, 258 Ethology, 211 Einstein’s family, 258 ETO’s, 319 Einstein’s letter to Freud, 262 Euclidean geometry, 257 Einstein, A., 155 Euclidian space, 262 Einstein, Albert, 16, 26, 27, 43 Euler-Lagrange equations, 266, 272 Ekert, J.P., 207 Excited configurations, 306 Elector of Hanover, 201 Excited states, 305 Electric field vector, 156 Expansion of products, 335 Electric organs, 211 Exponential factor, 321 Electrodynamics of Moving Bodies, 259 Exponential-Type Orbitals, 319 Electromagnetic fields, 164 External electromagnetic potential, 157, 158 Electromagnetic potential 4-vector, 156, 157 Electromagnetic potentials, 165 F-like isoelectronic series, 302 Electromagnetic radiation, 26 Fairchild Semiconductor, 215 Electromagnetic theory, 155 Feedback in biology, 209 Electromagnetic waves, 119 Few-electron atoms, 305 Electromagnetism, 259 Fiber optics, 83, 221 Electromechanical calculators, 207 Field-point, 157 Electron exchange, 282 Fields, 163 Electron microscopy, 129 Film animation, 225 Electron rest mass, 158 Financial book-keeping, 224 Electron spin resonance, 129 Fine structure constant, 158 Electron-hole pair, 46 Finite-dimensional groups, 239 Electronic data, 224 Fisher, R.A., 209 394 INDEX

Flat organic molecules, 82 Genetic material, 128 Floating-point operations, 104, 217 Geodesics, 272 Flops, 82, 104, 217, 218 German nuclear program, 38 Fluorine, 73 Germanium, 103, 214 Fock projection, 277, 278, 326, 337 Germany, 17, 119 Fock’s transformation, 69 Gigaflop 11, 104, 217 Fock’s treatment, 68 Gold medal competition, 23 Fock, V., 97, 222, 277, 278, 326 Goscinski, O., 278, 286 Fock, Vladimir A., 73 Goscinskian configurations, 286, 287, 293, Foerster, Heinz von, 210 306, 311 Forbidden energy bands, 102, 214 Goscinskian configurations, advantages and Force constant of a spring, 150 disadvantages, 299 Four-vectors, 155 Graphics chips, 225 Fourier coefficients, 166 Gravitation, 260 Fourier convolution theorem, 281 Great orthogonality theorem, 244, 245, 247, Fourier transform representation, 335 249, 254 Fourier transformed hydrogenlike orbitals, 71 Grossman, Marcel, 258, 262 Fourier transforms, 277, 278, 280 Group elements, 251 Franck, James, 32 Group representations, 240 Franklin, Rosalind, 129 Group-theoretical projection, 249 Free electromagnetic field, 168 Groves, General L., 39 Frequency distribution, 26 Frequency spectrum, 150 H¨uckel, Erich, 82 Frisch, Otto, 32 Hamiltonian, 290 Hamiltonian density, 165 G¨ottingen,44 Hansen, H.M., 28 Gamma rays, 12 Hardware, 226 Gasaneo, Gustavo, 300 Harmonic normal modes, 166 Gaussian basis functions, 319 Harmonic oscillator Lagrangians, 150, 167 Gaussian expansions, 339 Harmonics, 43 Gegenbauer polynomials, 278, 327 Hartree potential, 300 Geiger, Hans, 13 Hartree, D.R., 222 Geiger-Marsden experiment, 13 Hartree, Douglas, 73 General relativity, 272 Hartree, Douglas R., 97 Generalized angular integral, 62 Hartree, William, 97 Generalized Shibuya-Wulfman matrix, 333 Hartree-Fock-Roothaan equations, 285 Generalized Sturmian basis set, 279 Hartree-Fock-Roothaan Method, 76 Generalized Sturmian Method, 285, 305, 346 Hartrees, 291 Generalized Sturmian secular equations, 287, Hawking, Steven, 202 290 Heisenberg, Werner, 33, 44, 46 Generalized Sturmians, 278–281 Heitler, Walter, 80 Generalized Wigner coefficients, 328 Heitler-London theory, 80 Genetic evolution, 221 Helium, 12, 73 INDEX 395

Hemoglobin, 23, 125 Income policies, 225 Hermann Minkowski, 258 Increasingly paranoid, 214 Hermitian adjoint, 244 Independent electrons, 293 Herschbach, Dudley, 223 Inertial frames, 155 Hevesy, George de, 32, 38 Infinitesimal element of length, 271 Highest filled orbitals, 82 Infinitessimal length, 263 Hilbert space, 252, 253 Informality, enthusiasm and speed, 16 Hilbert, David, 44 Information, 104, 218 Hippel. Arthur von, 32 Information accumulation, 221 Hodgkin, Alan, 226 Information explosion, 221 Hodgkin, Dorothy, 222 Information technology, 199 Hodgkin, Dorothy Crowfoot, 125 Inner product, 280 Hoffmeyer, Jesper, 211 Institute for Theoretical Physics, 31 Hollerith, Herman, 206 Insulator, 102, 214 Homeostasis, 209 Insulin, 125 Homogeneous functions, 58 Integral equation, 281 Homogeneous wave equation, 166 Integrated circuits, 103, 217 Hospitals, 223 Integration by parts, 164 Hundreds of times faster and cheaper, 102 Intel, 218 Huperspherical coordinates, 267 Interactive calculations, 104, 218 Huxley, Andrew, 226 Interelectron repulsion, 288, 293, 300, 346 Hydrogen, 73 Interelectron repulsion integrals, 335, 336 Hydrogen bonds, 128 Interelectron repulsion matrix, 293, 302, 305 Hydrogen molecule, 82 Internet, 222 Hydrogen spectrum, 28 INTERNIST-1, 223 Hydrogenlike orbitals, 273, 321 Intrinsic curvature, 271 Hydrogenlike Schr¨odingerequation, 273, 321 Invariance, 164 Hydrogenlike spin-orbitals, 286, 287, 300 Invariant blocks, 300, 306 Hydrogenlike wave equation, 286 Invariant subsets, 300, 306, 346 Hydrophilic groups, 125 Invariants, 248 Hydrophobic groups, 125 Invention of computers, 199 Hyperangular integration, 328 Invention of transistors, 102 Hypergeometric functions, 161 Hypersphere, 277, 326 Inventory data base, 224 Hyperspherical harmonics, 59, 60, 62, 66, Inverse, 239, 244 278, 326, 327, 335, 336 Inverse transform, 277 Inversions, 300 IBM Corporation, 104, 206, 217 Ionization, 12 IBM machine, 101 Ions, 73 Idempotents, 252 Ions in gases, 11 Identity element, 239, 251 Irreducible representations, 244, 252 Imperial College, 101 Isoenergetic basis sets, 278 Impurities, 103, 217 Isoenergetic configurations, 332, 346 396 INDEX

Jacobi, Carl Gustav, 265 Lewis, G.N., 80 Jacobian, 265 Lie detector test, 215 Jacobians, 267 Ligands, 311 Jacquard’s loom, 203 Linear coordinate transformations, 240 Jacquard, Joseph Marie, 203 Linear field, 313 Jordan, Pascal, 44 Linus Pauling, 82 Literature searches, 223 Kaiser Wilhelm Institute, 260 Lithium, 73 Kendrew, J.C, 125 Lithium-like ground state, 301 Kendrew, J.C., 125 Lithiumlike isoelectronic series, 293 Kinetic energy density, 163 Logic density, 218 Kinetic energy operator, 287 London, Fritz, 80 Kinetic energy term, 279, 280 Lord and Lady Byron, 203 King George I, 201 Lord Moulton, 202 Klein-Gordon equation, 46 Lorentz gauge, 157 Koga, T., 321 Lorentz invariance, 155, 163, 165, 259 Krogh, August, 23 Lorentz transformation, 155, 165 Kronecker delta, 263 Lorentz, H.A., 259 Kull, K., 211 Lorenz, Konrad, 211 Los Alamos, 38 L¨owdin,Per-Olov, 222 Lovelace, Countess of, 203 Ladik, Janos, 101 Lowest empty orbitals, 82 Lagrangian densities, 163 Lysozyme, 125 Lagrangian density, 164 Laplace-Beltrami operator, 266 Machine instructions, 104, 217 Laplacian operator, 156, 157 Macy Conferences, 213 Large and small components, 160 Macy Foundation, 210 Large-Z approximation, 293, 346 Magnetic disc memories, 104, 218 Laser, 224 Magnetic field vector, 156 Laue, Max von, 17 Magnetic ink, 224 Lawrence, E.O., 20 Magnetic moment of electrons, 73 LCAO approximation, 82 Magnetic splitting of levels, 73 LCAO SCF Method, 76 Manchester, 20, 31 Learning by computers, 226 Manchester University, 13 Lee, Yuan, 223 Many-center Coulomb Sturmians, 319 Legendre polynomials, 278, 327 Many-center potentials, 321 Leibniz, Gottfried Wilhelm, 199, 201 Many-particle problems, 278 Leontief, Wassily W., 225 Marcel Grossman, 258, 262 Levi, Hilde, 32 Marsden, Ernest, 13 Levi-Civita tensor, 265 Mass defect, 260 Levi-Civita, Tulio, 265 Mass increases with velocity, 260 Lewin, Kurt, 210 Mass-weighted coordinates, 147 Lewis structures, 80 Mass-weighted potential energy matrix, 148 INDEX 397

Mathematica, 337 Momentum space orthonormality, 280 Matrix mechanics, 44 Momentum-space, 69, 277 Matrix representation of operators, 253 Momentum-space orthonormality, 280, 281 Matter waves, 43 Moore’a law, 218 Mauchley. J.W., 207 Moore’s law, 218, 221 Maurice Wilkins, 129 Moore, Gordon, 218 Maxwell’s equations, 16, 26, 157, 259, 260 Moore, Gordon C., 215 McCulloch, Warren, 210 Moore, Gordon E., 218 McGill University, 12 Morley, E.W., 259 MCIR system, 224 Moseley’s law, 19 Mead, Margaret, 210, 213 Moseley, Harry, 17 Mechanization of agriculture, 225 Moseley, Henry, 31 Medicine, 223 Motion of the nucleus, 287 Meitner, Lise, 32 Mottelson, Ben Roy, 38 Memories, 104, 218 Mullikin, R.S., 222 Mercury computer, 101 Multiplication, 199 Messenger RNA (mRNA), 129 Multiplication and division, 201 Metric tensors, 262 Multiplication of group elements, 239 Michaelson-Morley experiment, 155 Multiplication table, 241, 251 Michelson, A.A., 259 Music synthesizers, 225 Michelson-Morley experiment, 259 Myoglobin, 125 Microelectronics, 103, 213, 217 Microminiaturization, 101 N-like isoelectronic series, 302 Microprocessors, 82, 104, 217, 218 Nature of the Chemical Bond, 80 Microscope, 103, 217, 226 Nazi occupation of Denmark, 33 Military mentality, 258 Ne-like isoelectronic series, 302 Miniaturization, 82, 103, 217, 218 Negative energy states, 46 Minicomputer, 104, 217 Negative feedback, 211 Minimal geodesics, 272 Neon, 73 Minkowski space, 155, 157, 260 Nerve cells, 226 Minkowski, Hermann, 260 Nervous system, 226 MIPS, 104, 217 Networks, 104, 218 Mixed tensors, 263 Neumann, John von, 207, 209 Molecular biology, 129 Neural networks, 226 Molecular calculations, 332 Neurophysiology, 209 Molecular dynamics, 223 New Zealand, 12 Molecular orbital theory, 82 Newman, M.H.A., 207 Molecular orbitals, 319 Newton’s equations of motion, 43, 260 Molecular structure, 19 Newton, Isaac, 201 Molecular Sturmians, 335 Newtonian mechanics, 259, 261 Molecular vibrations and rotations, 150 Niels Bohr Institute, 16, 38 Molecules, 320 Nitrogen, 73 Momentum space, 277, 326, 335 Nobel Prize in Chemistry, 80 398 INDEX

Non-Abelian groups, 252 Pauli spin matrices, 158 Non-Euclidean geometry, 262 Pauli, Wolfgang, 46, 73, 76, 80 Non-relativistic wave equation, 288 Pauli-allowed states, 306 Nonequivalent irreducible representations, 249 Pauling, Linus, 80, 82, 222 Nonhomogeneous polynomials, 62 PC hard-drive capacity, 218 Normal coordinates, 148, 150 Peirce, Charles Sanders, 211 Normal modes, 147, 151, 163, 166 Periodic system, 19 Normalization, 289 Periodic table, 44, 72 Normalizing constant, 278, 327 Personal memories of early computers, 100 Nuclear arms race, 38 Perturbation, 301 Nuclear attraction, 288 Perutz, Max, 125 Nuclear attraction integrals, 322 Phillips, D.C., 125 Nuclear attraction matrix, 290 Photoelectric effect, 27, 258 Nuclear attraction matrix diagonal and energy-Photon annihilation operators, 169 independent, 305 Photon creation operators, 169 Nuclear charge, 13, 20 Photoresist, 103, 217 Nuclear charge, weighted, 286 Photosynthesis, 223 Nuclear charges, 311 Physics and Beyond, 33 Nuclear magnetic resonance, 129 Pitts, Walter, 210 Nucleic acids, 129 Placzek, George, 32 Nucleus, 13 Planck’s constant, 26, 158 Null matrix, 245 Planck’s quantum hypothesis, 26, 259, 260 Planck, Max, 16, 26, 260 O-like isoelectronic series, 302 Planck-Einstein formula, 19, 28, 31 One-center expansions, 311 Plane wave, Sturmian expansion, 282 Optical storage devices, 104, 218 Point charges, 311 Optimal basis set, 305 Poison gas, 260 Orbital angular momentum, 301 Polanyi, J., 223 Orbital hybridization, 80 Polarized light, 211 Order of a group, 239 Polyatomic molecules, 333 Orthonormal sets, 278 Polynucleotides, 129 Orthonormality, 287 Polypeptides and polynucleotides, 101 Orthonormality relations, 167 Population explosion, 221 Oxygen, 73 Position vector, 155 Positron, 46 Pacifism, 262 Potential, 288 Palade, George Emil, 129 Potential energy density, 163 Pale-faced teenagers, 101 Potential-weighted Orthonormality, 287 Parallel-processing, 104, 217 Potential-weighted orthonormality, 275, 279, Parallelization, 82, 218 280, 289, 335 Pascal, Blaise, 199 Pre-calculated integrals, 337 Pauli exclusion principle, 72, 80, 82 Pre-calculated matrices, 336 Pauli principle, 301 Principal quantum number, 287 INDEX 399

Principal quantum numbers, 300 Rayleigh, Lord, 28 Principle of Equivalence, 261 Reactive scattering, 223 Product of characters, 249 Reciprocal space, 326 Products of Coulomb Sturmians, 335 Red shift, 261 Projection operator, 282 Reducibility, 242 Projection operators, 249, 252 Reducible representations, 247 Protein chain, 129 Reflections, 300 Protein structure, 125 Regular representation, 251 Protein synthesis, 129 Relativistic 1-electron energies, 161 Protons, 20 Relativistic effects, 155 Pseudo-Euclidean space-time continuum, 155 Relativistic electrodynamics, 156 Pseudo-rotation, 155 Relativistic hydrogenlike orbitals, 160 Pullman, Alberte, 82, 223 Relativistic wave equation, 157 Pullman, Bernard, 82, 223 Relativity theory, 258 Punched cards, 101, 203 Removal of degeneracy, 301 Pythagoras, 43 Repeated eigenvalues, 245 Replacement orders, 224 Quanta, 26 Representations of groups, 240 Quantization of angular momentum, 27, 43 Resonance, 80 Quantum Biochemistry, 82 Rest energy, 158 Quantum biochemistry, 223 Retailing, 224 Quantum chemical calculations, 101 Ribonucleic acid, 129 Quantum chemistry, 222 Ribosomes, 129 Quantum dot technology, 218 RNA, 129 Quantum dots, 218 RNA and ribosomes, 129 Quantum hypothesis, 26 Robots, 223, 226 Quantum numbers, 19, 31, 287 Rockefeller Institute, 129 Quantum theory, 44, 102, 213, 221 Roentgen, Wilhelm Konrad, 119 Quick Medical Reference, 223 Rolls Royce Ltd., 223 Roosevelt, Franklin, 38 R-blocks, 300, 306 Roothaan, Clemens C.J., 97 Rabinowitch, Eugene, 32 Roothaan, Clemens, C.J., 76, 82 Radial functions, 273, 320 Rosalind Franklin, 129 Radial functions, relativistic, 160 Rosenbluth, Arturo, 209 Radial wave functions, 65 Rotations, 150, 239, 300 Radioactive tracer techniques, 129 Royal Institution, 121 Radioactive tracers, 129 Royal Institution, London, 125 Radioactive transmutation, 12 Royal Society, 202 Radioactivity, 20, 121, 260 Rozental, Stefan, 32 Radium, 20, 260 Russell-Saunders states, 301 Raleigh, Lord, 23 Rutherford’s model of the atom, 13, 16, 26 Rank of a tensor, 263 Rutherford, Lord, 11, 20, 31 Rapidly rotating drum, 101 Rydberg, Johannes, 19, 31 400 INDEX

Scalar density, 266 Slater, J.C., 222 Scalar products, 247 Slater-Condon rules, 287 Scalars, 264 Sleeping on three chairs, 101 Scaling of the coordinate system, 333 Small displacements from equilibrium, 147 Scaling parameter, 280, 287, 305 Sobolev spaces, 281, 282 Scattering, 13 Social communication, 209 SCF calculations, 285 Socially beneficial projects, 226 Scheraga, Herald, 223 Soddy, Frederick, 12 Schneider, Erich Ernst, 32 Sodium, 73 Schr¨odingerequation, 279 Software, 226 Schr¨odingerequation in reciprocal space, 68 Soho, 101 Schr¨odinger’swave equation, 73 Solid angle element, 69 Schr¨odinger,Erwin, 209 Sorbonne, 43 Schr¨odingerequation, 43 Source-point, 157 Schr¨odinger,Erwin, 43, 222 Space exploration, 103, 217 Schur’s lemma, 244, 245, 252 Space-component, 156 Second Industrial Revolution, 225 Space-time continuum, 259, 272 Secular equation, 319 Space-time symmetry, 155–157, 259, 260 Secular equations, 148, 287, 290, 333 Spain, 226 Self-Consistent Field method, 97 Sparse matrices, 336 Self-consistent field method, 76 Special relativity, 258, 259 Self-reinforcing accumulation, 221 Special theory of relativity, 155 Semiconductors, 103, 214 Specific heats, 27 Semiemperical approximations, 82 Spectra of X-rays, 17 Semiotics, 211 Spectral lines, 28 Sensation, 213 Spectrum of energies, 305 Sensors, 223 Speed of computers, 104, 217 Shannon, Claude, 209 Speed of light, 103, 214, 259, 260 Shells, 301 Spherical harmonics, 160, 273, 286, 320, 327 Shibuya-Wulfman integrals, 326, 327 Spherical polar coordinates, 262 Shift operator, 254 Spherical spinor, 160 Shockley, William, 102, 213, 214 Spin, 73, 82 Sigmund Freud, 262 Spin-orbit coupling, 287 Sign systems, 211 Spin-spin coupling, 287 Silicon, 103, 214 Spinors, 4-component, 158 Silicon Valley, 214 Spray-painting, 223 Similarity transformation, 244 Square matrix, 245 Similarity transformations, 241 Square-planar field, 313 Simple harmonic oscillators, 149 Stability of atoms, 16, 26 Single-stranded DNA, 128 Standard irreducible representation, 254 Sixth-order polynomials, 202 Standard set, 278 Slater determinants, 285, 286, 332 Standard tree, 267 Slater exponents, 305 Standard unitary representation, 249 INDEX 401

Standardization, 224, 225 Thin films, 103, 217 Stibitz, George R., 207 Thomson’s model of the atom, 13 Stockmarket, 224 Thomson, J.J., 11, 31 Storage density, 83, 218 Thought experiment, 261 Strong projection operators, 252, 253 Thymine, 129 Structure of DNA, 128 Time-component, 156 Structure of proteins, 125 Time-independent Dirac equation, 158 Sturmian basis sets, 278, 280 Time-independent solution, 156 Sturmian expansion of a plane wave, 282 Tisvilde, 33 Sturmian overlap integrals, 326 Total angular momentum, 160 Sturmian radial function, 335 Total internal reflection, 83, 221 Sturmian secular equations, 280, 287, 290, Total potential, 333 321 Total spin operator, 301 Sturmians, 319 Trace, 247 Subcellular particles, 129 Traitorous eight, 214 Subgroups, 278 Transfer RNA, 129 Subshell sets, 306 Transformation laws, 263 Subspaces, 252 Transistors, 104, 217, 221 Sugar-phosphate backbone, 128, 129 Transistors, invention of, 102 Supermarkets, 224 Translations, 150 Surface tension measurement, 23 Transmutation of elements, 12 Swiss Patent Office, 258 Transportation, 224 Switzerland, 27, 43 Trial Lagrangian density, 166 Symmetry group, 346 Trigonal field, 313 Symmetry of molecules, 239 Triplet states, 302 Symmetry-adapted basis functions, 346 Truman, Harry, 39 Symmetry-adapted basis sets, 253, 306 Tubes sometimes failed, 101 System of N electrons, 282 Turing, Alan, 207 Szent-Gy¨orgyi,Albert, 82 Two-center densities, 339 Szent-Gy¨orgyi,Albert, 223 Szilard, Leo, 209 Uexk¨ull,Jakob von, 211 Ultracentrifugation, 129 Tables of quadratic functions, 202 Ultraviolet light, 211 Taylor series expansion, 147 Ultraviolet spectroscopy, 129 Technological unemployment, 225 Umwelt, 211 Teller. Edward, 32 Unemployment, 226 Templates, 128 Uniform motion, 155 Tensor analysis, 262 Unit matrix, 252, 293 Tensor capacities, 266 Unitarity, 241 Tensor densities, 266 Unitary representations, 247 Tensors, 263 Universal coefficients, 339 Tetrahedral field, 313 Universal computer, 207 Theory of Games, 210 Universal integrals, 337 402 INDEX

Universal product code, 224 Wilkins, Maurice, 129 Universal programmable computer, 203 Wilson, E. Bright, 80, 222 University of Adelaide, 121 Word-processors, 225 University of California, 80 World War I, 19, 31 University of Copenhagen, 23 Writing, 226 Uracil, 129 Wulfman, C.E., 321 Uranium, 11, 260 X-ray crystallography, 19, 121 Vacuum tubes, 207 X-ray diffraction, 129, 222 Vacuum tubes required cooling, 101 X-rays, 11, 119 Valance band, 103, 214 X-rays, spectra of, 17 Valence, 80 Z¨urich Polytechnic Institute, 258 Valence bands, 102, 214 Z¨urich, 43 Valence bond theory, 80 Zero-frequency modes, 151 Valence electrons, 301 Zuse, Konrad, 207 Van de Graff, J.H., 20 Van Vleck, J.H., 222 Variational optimization, 305 Variational principles, 164 Vector potential, 155 Velocity of light, 158 Vibrations, 150 Virial theorem, 287 Vitamin B12, 125 Volume element, 264 Von Laue, Max, 121

Wafers, 103, 217 Waggle dance, 211 Watson-Crick model, 128 Wave equation, 166, 222 Wave equations, 163 Wave lengths, 163 Wave mechanics, 43 Wave theory of light, 17, 119, 259 Weak projection operators, 252 Weighted nuclear charge, 285, 286 Weighted potential, 278, 280, 346 Weighting factors, 300 Weisskopf, Victor, 32 Wholesaling, 224 Why War?, 262 Wiener, Norbert, 207, 209, 213