8083.9789814340489-tp.indd 1 8/29/11 4:59 PM This page intentionally left blank Bernard H Lavenda Universita’ degli Studi di Camerino, Italy
World Scientific
NEW JERSEY • LONDON • SINGAPORE • BEIJING • SHANGHAI • HONG KONG • TAIPEI • CHENNAI
8083.9789814340489-tp.indd 2 8/29/11 4:59 PM Published by World Scientific Publishing Co. Pte. Ltd. 5 Toh Tuck Link, Singapore 596224 USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601 UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE
British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library.
A NEW PERSPECTIVE ON RELATIVITY An Odyssey in Non-Euclidean Geometries Copyright © 2012 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher.
For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher.
ISBN-13 978-981-4340-48-9 ISBN-10 981-4340-48-0
Typeset by Stallion Press Email: [email protected]
Printed in Singapore.
YeeSern - A New Perspective on Relativity.pmd1 10/14/2011, 9:08 AM Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
In memory of Franco Fraschetti (1924–2009)
v Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
This page intentionally left blank Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
Preface
Electrodynamics was the next oasis after thermodynamics which saw a confluence of physicists and mathematicians, many of whom had been protagonists in thermodynamics. Just as thermodynamics had an offspring, quantum theory,so too did electrodynamics, namely the theory of relativity. While a single name can be attached to the origins of thermodynamics, Sadi Carnot, and that of its offspring, Max Planck, no such simplicity exists in electrodynamics and relativity. Relativity is as much about physics as it is about the human beings, and their failings, that made it. Every physics student will have heard of Maxwell’s equations but will he have also heard of Weber’s force? The stu- dent may have heard of Weber and Gauss for the units named after them, but not about their championing of Ampère’s law which threatened the supremacy of Newton’s inverse square law. The names of Helmholtz, Clau- sius and Boltzmann may be familiar from thermodynamics and statistical thermodynamics but much less known for their theories of electromag- netism. Every student of mathematics will have heard the names of Gauss and Riemann, but will he also know of their fundamental contributions to electromagnetism? Who were Abraham, Heaviside, Larmor, Liénard, Lorenz, Ritz, Schwarzschild, and Voigt? Why have their names been struck from the annals of electromagnetism? We are familiar with the priority disputes between Kelvin and Clausius in thermodynamics, but not with those in electromagnetism and relativity. A student of physics may have heard the name of Lorentz, because of his law of force and transformation, but not at the same level of Einstein. And Poincaré is known for just about everything else than his principle of relativity. The history of electromagnetism and relativity has been rewritten and in a very unflattering way.
vii Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
viii A New Perspective on Relativity
By the modern historical account of electromagnetism and relativity, there were winners and losers. Maxwell is said to have triumphed over Weber and Gauss, in formulating a field theory of electromagnetism, and over Lorenz and Riemann in the formulation of his displacement current, Einstein’s absolute speed of light prevailed over Ritz’s ballistic theory of emission, Lorentz’s supremacy over Abraham and Bucherer in devising a model of the electron whose expressions for the variation of mass, momen- tum, and energy with velocity were later to be adopted in toto by relativity as a model for all matter, whether charged or not, and Einstein’s seniority in stating the principles of relativity though they were previously enunciated by Poincaré. Why were the experimentalists, Ives and Essen, so vehemently opposed to relativity? Ives viewed his verification of the second-order Doppler shift as a clear demonstration that a moving clock runs slow by the same factor that was predicted by Larmor and Lorentz, and not as a vindication of time dilatation in special relativity. Essen, who built the first cesium clock, queried what happens to the lost ticks when more ticks are transmitted than are received, independent of whether two clocks are approaching or receding from one another? Essen went so far as to query relativity as a “joke or swindle?” Most if not all monographs on relativity do not touch on these ques- tions. Not so with O’Rahilly’s Electromagnetics written in 1938. Not everyone will agree with his dispraise of Maxwell’s displacement current, or his over appraisal of Ritz, but much of what he says could not be truer today: There is far more authoritarianism in science that physicists are aware or at least publicly acknowledge. Anybody with a scientific reputation would today hesitate to criticize Einstein, except by way of outdoing him in cosmological speculations.
Essen expressed similar views Students are told that the theory (relativity) must be accepted although they cannot be expected to understand it...The theory is so rigidly held that young scientists who have any regard for their careers dare not openly express their doubts.
Whether there is any truth in the allegations I will leave to the reader. But what I plan to do is to present relativity from a ‘new’ point of view that treats, known and unknown, relativistic phenomena from different per- spectives. I put ‘new’ in quotation marks because the approach is really not new, but was suggested by Kaluza and Variˇcak over a century ago. What Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
Preface ix
is ‘new,’ I believe, is the wealth of physical phenomena that can be drawn from the non-Euclidean geometrical perspective. This monograph is nei- ther intended an historical account of relativity nor an essay in constructive criticism of it. Arecurring theme is that motion causes deformity and this can, under certain circumstances, catapult us into non-Euclidean spaces. It was also an exciting exercise to see where non-Euclidean geometries could be found, but were not appreciated as such. There are at least two eye-catching rela- tions: The product of two longitudinal Doppler shifts is the square root of the cross-ratio, and whose logarithm is hyperbolic distance, and the Beltrami metric in polar coordinates is the exact expression for the metric for the uniformly rotating disc. Gravitational phenomena rather than being a manifestation of warped space-time can be accounted for by a varying index of refraction in an inhomogeneous medium that modifies Fermat’s principle of least time. The reader will find old and new things alike — but the ‘old’ with a new interpretation. I don’t expect that everything is true to 100 percent, some things will have to be changed, modified or clarified, but, I do believe that this is a very fruitful approach that has led to the questioning of many fundamental aspects of relativity. According to Riemann, physics is the search for a geometric mani- fold upon which physical processes occur. The line element of constant curvature, 1 dx2 , (R) + 1 2 1 4 α x
appearing in his Habilitation Dissertation, when written in polar coordinates is precisely the metric for a uniformly rotating disc with constant negative curvature, α<0. When charge is added, it becomes the Liénard expression for the rate of energy loss due to radiation. The role of the longitudinal Doppler shift means that space and time do not appear separately but only in a ratio, as a homogeneous coor- dinate. It is the difference in longitudinal Doppler shifts that is responsi- ble for the slowing down of clocks in relative motion. Einstein elevated c, the velocity of light in vacuo, to a universal constant. The fact that c is a constant, even to observers in relative motion, is tantamount to making it a unit of measurement — one which is necessary for the existence of a Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
x A New Perspective on Relativity
non-Euclidean geometry. So raising c to a universal constant, as Essen has pointed out, meant that the definition of unit length or time, or both, had to be abandoned. Relativity will thus unfold in a hyperbolic space of velocities that is entirely consonant with the relativistic addition of velocities. Following the historical route spreads the honors of discovery of rel- ativity more evenly. Poincaré had arrived at the postulates of relativity at least five years before Einstein, but “because he did not fully appreciate the status of both postulates” is no argument to deny him credit. To deny Poincaré his primary role in developing the theory of relativ- ity because he held onto the aether concept is to deny Carnot the credit for discovering his principle because he still believed in caloric theory.It would never have passed my mind to say that Boltzmann’s principle is incom- plete because it deals with only part of a probability distribution, being a very large number instead of a proper fraction, whereas I have shown that the entropy is the potential of law of error for which the most probable value is the average value of the measurements, that I have detracted any credit from Boltzmann. And which average is considered most probable will determine the form of the entropy. What is incomprehensible was Poincaré’s need to ‘adjust’ the laws of physics so as to preserve Euclidean geometry, and Einstein’s later con- currence with him. Was Euclidean geometry superior to non-Euclidean geometries to which Poincaré made so many outstanding contributions? Why couldn’t Poincaré connect with his fractional linear transformations which preserve certain geometric properties and define a new concept of length in hyperbolic geometry with Lorentz transformations which he did so much work on? Historians of science make much ado over the tortuous path that Einstein followed to arrive at his field equations of general relativity — taking for granted that they are the final solution to the gravitational problem. Little progress has been made since Einstein wrote down his equations almost a century ago, and what the general theory proposes has still to be collaborated by observation. Singularities, black holes, and gravitational waves have, as yet, to be confirmed. Why time warps and what constitutes emptiness is left to be explained. How can a gravitational field exist in the absence of matter and all other physical fields? Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
Preface xi
Young Maxwell gave a very interesting example of an optical instru- ment, for which the optical length of any curve in the object space is equal to that of its image, by expressing Fermat’s principle for the extremal path of a ray in terms of a varying index of refraction and a flat metric. The varying index of refraction had the exact same form as the coefficient in (R) for positive curvature (α>0). This gave me the idea that an optico- gravitational approach might prove useful in which a non-constant index of refraction would mimic a varying gravitational field while the flat metric would include the centrifugal potential. That gravitational and centrifugal forces appeared in different parts led me to question the equivalence prin- ciple whereby a gravitational field can be annulled by acceleration. The distortions that we observe due to motion is the result of our Euclidean rulers and clocks. Inhabitants of hyperbolic space would see no changes in the measuring devices since they change along with them. All gedanken experiments using local observers would lead to null results. Paradoxes exist because the phenomena which give rise to them are not understood. Everyone would agree that emission theories are dead, but to say that the velocity on the outward journey is c + v, and the velocity on the return journey is c − v, where v is the velocity of the aether in the Michelson–Morley experiment is truly contradictory. To explain the null result a contraction hypothesis in the direction of the motion was assumed, yet the only contraction that arises from the Doppler shift is a second-order one in the direction normal to the motion. The journey has been a long one for me. Along the way I have gotten to know a lot of people through their writings. I can feel the eccentricity and biting sarcasm of Oliver Heaviside, who without a formal education, took on the establishment with his unwavering faith in Maxwell; the youthful enthusiasm of Walther Ritz for his science, the credit that was denied him, and the tragedy of his short and painful life; the nonchalance by which Poincaré added hypotheses to theories, his wavering afterthoughts about them, and his humility that led him to uphold Euclidean geometry after all the work he did in bringing hyperbolic geometry into the mainstream of mathematics — but not physics; the quarrelsome and critical Abraham, who was denied the credit he justly deserved, and whose death was also tragic; the mild mannered, cautious and pragmatic approach of Lorentz, and, finally, the enigmatic figure of Einstein, who, more often than not, Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
xii A New Perspective on Relativity
contradicted his own principles. It has also shown me other sides to people whom I thought I knew.The openness to explore all avenues, no matter how distasteful, that Planck exercised in his approach to blackbody radiation is now contrasted to his opinionated view that non-Euclidean geometries was ‘child’s play,’ in comparison to the demands that relativity make on the mind. But is it?
Trevignano Romano Bernard H. Lavenda March 2011 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
Contents
Preface vii
List of Figures xxi
1. Introduction 1 1.1 Einstein’s Impact on Twentieth Century Physics .... 1 1.1.1 The author(s) of relativity ...... 2 1.1.2 Models of the electron ...... 26 1.1.3 Appropriation of Lorentz’s theory of the electron by relativity ...... 27 1.2 Physicists versus Mathematicians ...... 30 1.2.1 Gauss’s lost discoveries ...... 31 1.2.2 Poincaré’s missed opportunities ...... 35 1.3 Exclusion of Non-Euclidean Geometries from Relativity ...... 41 References ...... 47
2. Which Geometry? 51 2.1 Physics or Geometry ...... 51 2.1.1 The heated plane ...... 51 2.2 Geometry of Complex Numbers ...... 57 2.2.1 Properties of complex numbers ...... 57 2.2.2 Inversion ...... 58 2.2.3 Maxwell’s ‘fish-eye’: An example of inversion from elliptic geometry ...... 61
xiii Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
xiv A New Perspective on Relativity
2.2.4 The cross-ratio ...... 67 2.2.5 The Möbius transform ...... 72 2.3 Geodesics ...... 76 2.4 Models of the Hyperbolic Plane and Their Properties ...... 80 2.5 A Brief History of Hyperbolic Geometry ...... 88 References ...... 107
3. A Brief History of Light, Electromagnetism and Gravity 109 3.1 The Drag Coefficient: A Clash Between Absolute and Relative Velocities ...... 109 3.2 Michelson–Morley Null Result: Is Contraction Real? ...... 112 3.3 Radar Signaling versus Continuous Frequencies . . . 117 3.4 Ives–Stilwell Non-Null Result: Variation of Clock Rate with Motion ...... 118 3.5 The Legacy of Nineteenth Century English Physics . . 122 3.5.1 Pressure of radiation ...... 122 3.5.2 Poynting’s derivation of E = mc2 ...... 123 3.5.3 Larmor’s attempt at the velocity composition law via Fresnel’s drag ...... 124 3.6 Gone with the Aether ...... 127 3.6.1 Elastic solid versus Maxwell’s equations .... 127 3.6.2 The index of refraction ...... 133 3.7 Motion Causes Bodily Distortion ...... 137 3.7.1 Optical effect: Double diffraction experiments ...... 137 3.7.2 Trouton–Noble null mechanical effect ...... 138 3.7.3 Anisotropy of mass ...... 140 3.7.4 e/m measurements of the transverse mass . . . 149 3.8 Modeling Gravitation ...... 156 3.8.1 Maxwellian gravitation ...... 156 3.8.2 Ritzian gravitation ...... 163 References ...... 174 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
Contents xv
4. Electromagnetic Radiation 177 4.1 Spooky Actions-at-a-Distance versus Wiggly Continuous Fields ...... 177 4.1.1 Irreversibility from a reversible theory ..... 181 4.1.2 From fields to particles ...... 184 4.1.3 Absolute versus relative motion ...... 186 4.1.4 Faster than the speed of light ...... 189 4.2 Relativistic Mass ...... 192 4.2.1 Gedanken experiments ...... 194 4.2.2 From Weber to Einstein ...... 197 4.2.3 Maxwell on Gauss and Weber ...... 200 4.2.4 Ritz’s electrodynamic theory of emission .... 208 4.3 Radiation by an Accelerating Electron ...... 212 4.3.1 What does the radiation reaction force measure? ...... 212 4.3.2 Constant rate of energy loss in hyperbolic velocity space ...... 217 4.3.3 Radiation at uniform acceleration ...... 220 4.3.4 Curvatures: Turning and twisting ...... 225 4.3.5 Advanced potentials as perpetual motion machines ...... 229 References ...... 232
5. The Origins of Mass 235 5.1 Introduction ...... 235 5.2 From Motional to Static Deformation ...... 236 5.2.1 Potential theory ...... 237 5.3 Gravitational Mass ...... 243 5.3.1 Attraction of a rod: Increase in mass with broadside motion ...... 243 5.3.2 Attraction of a spheroid on a point in its axis of revolution: Forces of attraction as minimal curves of convex bodies ...... 245 5.4 Electromagnetic Mass ...... 249 5.4.1 What does the ratio e/m measure? ...... 255 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
xvi A New Perspective on Relativity
5.4.2 Models of the electron ...... 262 5.4.3 Thomson’s relation between charges in motion and their mass ...... 263 5.4.4 Oblate versus prolate spheroids ...... 265 5.5 Minimal Curves for Convex Bodies in Elliptic and Hyperbolic Spaces ...... 275 5.6 The Tractrix ...... 280 5.7 Rigid Motions: Hyperbolic Lorentz Transforms and Elliptic Rotations ...... 283 5.8 The Elliptic Geometry of an Oblate Spheroid ...... 287 5.9 Matter and Energy ...... 289 References ...... 298
6. Thermodynamics of Relativity 301 6.1 Does the Inertia of a Body Depend on its Heat Content? ...... 301 6.2 Poincaré Stress and the Missing Mass ...... 303 6.3 Lorentz Transforms from the Velocity Composition Law ...... 308 6.4 Density Transformations and the Field Picture ..... 315 6.5 Relativistic Virial ...... 323 6.6 Which Pressure? ...... 325 6.7 Thermodynamics from Bessel Functions ...... 327 6.7.1 Boltzmann’s law via modified Bessel functions ...... 328 6.7.2 Asymptotic probability densities ...... 334 References ...... 338
7. General Relativity in a Non-Euclidean Geometrical Setting 341 7.1 Centrifugal versus Gravitational Forces ...... 341 7.2 Gravitational Effects on the Propagation of Light . . . 344 7.2.1 From Doppler to gravitational shifts ...... 344 7.2.2 Shapiro effect via Fermat’s principle ...... 346 7.3 Optico-gravitational Phenomena ...... 348 7.4 The Models ...... 361 7.5 General Relativity versus Non-Euclidean Metrics . . . 367 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
Contents xvii
7.6 The Mechanics of Diffraction ...... 375 7.6.1 Gravitational shift of spectral lines ...... 378 7.6.2 The deflection of light ...... 379 7.6.3 Advance of the perihelion ...... 381 References ...... 383
8. Relativity of Hyperbolic Space 385 8.1 Hyperbolic Geometry and the Birth of Relativity . . . 385 8.2 Doppler Generation of Möbius Transformations .... 388 8.3 Geometry of Doppler and Aberration Phenomena ...... 393 8.4 Kinematics: The Radar Method of Signaling ...... 398 8.4.1 Constant relative velocity: Geometric-arithmetic mean inequality ...... 398 8.4.2 Constant relative acceleration ...... 401 8.5 Comparison with General Relativity ...... 407 8.6 Hyperbolic Geometry of Relativity ...... 410 8.7 Coordinates in the Hyperbolic Plane ...... 415 8.8 Limiting Case of a Lambert Quadrilateral: Uniform Acceleration ...... 419 8.9 Additivity of the Recession and Distance in Hubble’s Law ...... 421 References ...... 423
9. Nonequivalence of Gravitation and Acceleration 425 9.1 The Uniformly Rotating Disc in Einstein’s Development of General Relativity ...... 425 9.2 The Sagnac Effect ...... 434 9.3 Generalizations of the Sagnac Effect ...... 439 9.4 The Principle of Equivalence ...... 443 9.5 Fermat’s Principle of Least Time and Hyperbolic Geometry ...... 449 9.6 The Rotating Disc ...... 453 9.7 The FitzGerald–Lorentz Contraction via the Triangle Defect ...... 464 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
xviii A New Perspective on Relativity
9.8 Hyperbolic Nature of the Electromagnetic Field and the Poincaré Stress ...... 468 9.9 The Terrell–Weinstein Effect and the Angle of Parallelism ...... 470 9.10 Hyperbolic Geometries with Non-Constant Curvature 473 9.10.1 The heated disc revisited ...... 473 9.10.2 A matter of curvature ...... 476 9.10.3 Schwarzschild’s metric: How a nobody became a one-body ...... 478 9.10.4 Schwarzschild’s metric: The inside story .... 482 9.11 Cosmological Models ...... 484 9.11.1 The general projective metric in the plane . . . 484 9.11.2 The expanding Minkowski universe ...... 490 9.11.3 Event horizons ...... 492 9.11.4 Newtonian dynamics discovers the ‘big bang’ ...... 496 References ...... 498
10. Aberration and Radiation Pressure in the Klein and Poincaré Models 501 10.1 Angular Defect and its Relation to Aberration and Thomas Precession ...... 501 10.2 From the Klein to the Poincaré Model ...... 509 10.3 Aberration versus Radiation Pressure on a Moving Mirror ...... 512 10.3.1 Aberration and the angle of parallelism ..... 512 10.3.2 Reflection from a moving mirror ...... 514 10.4 Electromagnetic Radiation Pressure ...... 515 10.5 Angle of Parallelism and the Vanishing of the Radiation Pressure ...... 522 10.6 Transverse Doppler Shifts as Experimental Evidence for the Angle of Parallelism ...... 525 References ...... 526
11. The Inertia of Polarization 529 11.1 Polarization and Relativity ...... 529 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
Contents xix
11.1.1 A history of polarization and some of its physical consequences ...... 529 11.1.2 Spin ...... 540 11.1.3 Angular momentum ...... 543 11.1.4 Elastic strain ...... 545 11.1.5 Plane waves ...... 550 11.1.6 Spherical waves ...... 553 11.1.7 β-decay and parity violation ...... 554 11.2 Stokes Parameters and Their Physical Interpretations ...... 560 11.3 Poincaré’s Representation and Spherical Geometry . . 568 11.3.1 Isospin and the electroweak interaction ..... 572 11.4 Polarization of Mass ...... 577 11.4.1 Mass and momentum ...... 577 11.4.2 Relativistic space-time paths: An example of mass polarization ...... 585 11.5 Mass in Maxwell’s Theory and Beyond ...... 590 11.5.1 A model of radiation ...... 590 11.5.2 Enter mass: Proca’s equations ...... 600 11.5.3 Proca’s approach to superconductivity ..... 607 11.5.4 Phase and mass ...... 617 11.5.5 Compressional electromagnetic waves: Helmholtz’s theory ...... 620 11.5.6 Directed electromagnetic waves ...... 627 11.6 Relativistic Stokes Parameters ...... 631 11.6.1 Weyl and Dirac versus Stokes ...... 631 11.6.2 Origin of the zero helicity state ...... 640 11.6.3 Lamb shift and left-hand elliptical polarization ...... 648 References ...... 654
Index 657 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
This page intentionally left blank Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
List of Figures
1.1 A tiling of the hyperbolic plane by curvilinear triangles that form right-angled pentagons...... 37 2.1 A bug’s life in the heated disk; ‘hot’ in the center and ‘cold’ on the disc...... 52 2.2 Construction of the point of inversion P...... 59 2.3 Circle of inversion for constructing the inverse P with respect to P ...... 60 2.4 Maxwell’s “fish-eye.” ...... 63 2.5 The magnification of the inner product as it is projected stereographically onto the Euclidean plane...... 64 2.6 In the case of inversion both the point and its image are on the same ray emanating from the center of the disc H...... 66 2.7 It appears that rulers get longer as they are moved further from the origin. However, the elliptic distance from x to y is exactly the same as that from X to Y...... 67 2.8 A tiling of the plane...... 69 2.9 Calculation of cross-ratio and perspectivity...... 70 2.10 The four points u, a, c, v and a, d, x , y from point p have the same angles, hence, have the same cross-ratio. This also is true for c, b, w, z and d, b, x , y ...... 72 2.11 Derivation of Snell’s law...... 77 2.12 Angle of parallelism...... 78 2.13 The number of lines passing through P that are hyperparallel to the line g are infinite. The lines h1 and h2 are limiting parallel to g, while the others are hyperparallel to g...... 79
xxi Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
xxii A New Perspective on Relativity
2.14 Surfaces of negative constant curvature that are mapped onto part of the hyperbolic plane. The middle figure is the mapping of a pseudosphere that produces horocycles as dashed curves...... 83 2.15 The ratio of concentric limiting arcs depends only on the distance between them...... 84 2.16 Using Euclidean geometry to derive the angle of parallelism by considering concentric limiting arcs...... 85 2.17 A right triangle in hyperbolic space: As P increases without limit the angle tends to the angle of parallelism which is a function only of d...... 88 2.18 The parallax of a star...... 89 2.19 Tractrix and pseudosphere as its surface of revolution. .... 90 2.20 Minkowski’s vision of space-time...... 94 2.21 Projection of the hyperboloid onto the plane...... 97 2.22 Geodesics determined by planes cutting the hyperboloid and passing through the center...... 99 2.23 Cayley’s calculation of distance in the projective disc model. 100 2.24 The Poincaré disc model as a stereographic projection from the south pole S of the bottom sheet...... 101 2.25 Beltrami’s double mapping of Klein and his hyperbolic disc model onto the Poincaré disc model...... 102 2.26 The combined vertical orthogonal projection upwards and the stereographic projection downwards...... 102 2.27 Geodesics consist of arcs that cut the disc, , orthogonally. . 104 3.1 Fizeau’s aether-drag apparatus with mirrors placed on corners to reflect light...... 111 3.2 Monochromatic, yellow light is split by a mirror into two beams. 113 3.3 Second-order wavelength shifts plotted as a function of first-order shifts...... 121 3.4 Trouton–Noble experiment to search for effect of Earth moving through aether...... 139 3.5 Planes formed from a moving trihedron...... 146 3.6 Thomson’s apparatus for determining the ratio e/m for cathode rays...... 150 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
List of Figures xxiii
3.7 The points on the parabola refer to electrons deflected by parallel and anti-parallel (left side) fields...... 152 3.8 Elliptical orbit of Mercury showing the excess rotation of the major axis...... 165 4.1 The configuration for calculating the retarded scalar potential. 182 4.2 Orientation of two circuit elements ds and ds ...... 201 4.3 Frenet frame field for a trajectory of the motion...... 225 5.1 Stellar aberration: (a) A telescope at rest, and (b) a telescope aimed at the same star but in relative motion...... 236 5.2 The potential of a homogeneous rod...... 238 5.3 A rod AB has length 2 with O as its center. The attracted point P with an element of mass dm at a distance r from it. r1 and r2 are the lines joining P to the ends of the rod at A and B. ... 239 5.4 Family of ellipses and orthogonal confocal hyperbolas. . . . 243 5.5 Attraction of a circular disc on its axis...... 245 5.6 A figure of revolution...... 246 5.7 The ratio of charge to mass as a function of the relativity veloc- ity. The sloping curve is the ratio determined by Abraham while the horizontal curve results from Lorentz’s formula. . 257 5.8 The orientation of the fields in Bucherer’s experiment. .... 258 5.9 (a) Oblate ellipsoid with a = b > c; (b) prolate ellipsoid with a = b < c...... 268 5.10 The caustic circle of radius c separates the bright (periodic) region a > c from the shadow (exponential) region, a < c. . . 274 5.11 The perimeter L consists of the two half-lines that are tangent to the circle and the arc length between them...... 277 5.12 A circle inscribed in an n-gon...... 278 5.13 A regular n-gon inscribed in a circle...... 280 5.14 Newton’s tractrix...... 282 7.1 The set-up for the Shapiro effect...... 346 7.2 Rays tangent to a circular caustic of radius l...... 364 7.3 Sector inscribed in a triangle...... 365 7.4 Newton’s tractrix again...... 367 7.5 The stereographic projection of a point on the sphere P onto the plane at point Q...... 368 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
xxiv A New Perspective on Relativity
7.6 Comparison of the Newtonian potential (a) with that of the Schwarzschild potential (b)...... 372 7.7 Geodesic curves that cut the rim of the hyperbolic plane orthogonally are arcs of a circle whose center O lies outside the disc...... 375 8.1 Circles of inversion...... 391 8.2 A more detailed description of the circle containing the fixed points v1 and λ which are uniform states of motion at relative velocities u and 2u/(1 + u2). The Möbius automorphism of the disc may be considered as a composition of two hyperbolic rotations: A rotation of π about the hyperbolic midpoint between the origin and λ, and a rotation about the origin. The maximum angle φ is determined by the angle of parallelism, , beyond which no motion can occur...... 392 8.3 Extension of hyperbolic trigonometry to general triangles. . 394 8.4 Hyperbolic velocity triangle...... 395 8.5 A Lambert quadrilateral in velocity space consisting of three right-angles and one acute angle...... 416 8.6 A Lambert quadrilateral comprised of complementary segments where the ‘fourth vertex’ is an ideal point...... 420 9.1 The Sagnac Interferometer as originally depicted in his 1913 article...... 436 9.2 Disc cut out of hemisphere at an angle ϑ...... 440 9.3 Gamow’s [62] depiction of Einstein’s gedanken experiment showing the equivalence between acceleration and gravity...... 443 9.4 The angle of parallelism between two bounding parallels connected by the geodesic curve γ...... 451 9.5 Geometric characterization of the metric density...... 455 9.6 Geometric set-up for stellar aberration...... 459 9.7 Fokker’s [65] visualization of fitting errors when objects are placed on curved surfaces. The left and right sides correspond to negative and positive curvature, respectively...... 465 9.8 Hyperbolic right triangle inscribed in a unit disc...... 466 9.9 Interpretation of the variables of the two metrics which are the radii of the elliptic plane...... 486 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
List of Figures xxv
9.10 The three possible scenarios of closed, flat and open universes. The freckles are the galaxies which are more or less evenly distributed...... 492 9.11 The fates of the universe...... 498 10.1 A segment H of a horocycle with center at infinity with angles of parallelism ...... 505 10.2 Angle of parallelism with transversal perpendicular to one of the parallel lines...... 506 10.3 Poincaré’s projections of the Beltrami model vertically into the southern hemisphere and stereographically back onto the equator...... 510 10.4 Klein model where vertical sections of the hemisphere are projected into straight lines. Geodesics retain their straightness at the cost of not being conformal...... 511 10.5 Radiation falling obliquely on a mirror of length AB...... 517 10.6 The Poincaré half-plane model of measuring distances. . . . 524 11.1 Spherical right triangle for scheme (II)...... 538 11.2 Hyperbolic right triangle related to the scheme (III)...... 539 11.3 Weak β-decay of the neutron. In Fermi’s theory this occurs at a single point where the emission of an electron-antineutrino pair is analogous to electromagnetic photon emission. .... 555 11.4 The decay of polarized cobalt...... 556 11.5 The decay plane of cobalt 60...... 557 11.6 The spherical coordinates used to describe the orientation of spin...... 559 11.7 The Poincaré sphere is the parametrization of the Stokes parameters in elliptic geometry...... 563 11.8 The polarization ellipse swept out by the electric field vector which is enclosed by a rectangle of sides 2a and 2b. The transformation to new electric vector components Ex and Ey consists in a counter-clockwise rotation about the angle ψ. . 564 11.9 Complex plane representation of polarized states...... 566 11.10 Stereographic projection of the complex plane onto the Poincaré sphere...... 568 11.11 The scattering of a neutrino and antineutrino emits a Z0 boson which decays into W bosons...... 575 Aug. 26, 2011 11:17 SPI-B1197 A New Perspective on Relativity b1197-fm
xxvi A New Perspective on Relativity
11.12 V and S interactions rotate toward one another as the electron velocity decreases...... 580 11.13 A short vertical antenna...... 591 11.14 The configuration of electric and magnetic fields on the surface of a sphere. P is Poynting’s vector showing the direction of radiation. In any small portion, a spherical wave cannot be distinguished from a plane wave...... 595 11.15 The polar plots of the spherical harmonics. Maxwell’s equations prohibit the middle radiation pattern...... 597 11.16 The diagrams of the original and deformed paths of integration with the pole at r =∞as if it were at a finite distance...... 640 11.17 A right-spherical triangle...... 644 11.18 A right-spherical triangle traced out by an orbiting electron. 646 11.19 Zeeman splitting: light path parallel (perpendicular) to field results in a doublet (triplet)...... 648 11.20 The conventional explanation of the Lamb shift as the shielding of the electron’s charge by virtual electron-positron pairs that are produced by the vacuum when acted upon by an electric field...... 651 11.21 Splitting of energy levels of a hydrogen-like atom (not drawn to scale). All shifts are left-hand elliptical polarizations. . . . 653 Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Chapter 1 Introduction
Planck made two great discoveries in his lifetime: the energy quantum and Einstein [Miller 81]
1.1 Einstein’s Impact on Twentieth Century Physics
When one mentions the word ‘relativity’ the name Albert Einstein springs to mind. So it is quite natural to ask what was Einstein’s contribution to the theory of relativity, in particular, and to twentieth century physics, in general. Biographers and historians of science run great lengths to rewrite history. Undoubtedly,Abraham Pais’s [82] book, Subtle is the Lord, is the defini- tive biography of Einstein; it attempts to go beneath the surface and gives mathematical details of his achievements. A case of mention, which will serve only for illustration, is the photoelectric effect. Pais tells us that Einstein proposed Emax = hν − P, where ν is the fre- quency of the incident (monochromatic) radiation and P is the work func- tion — the energy needed for an electron to escape the surface. He pointed out that [this equation] explains Lenard’s observation of the light intensity independence of the electron energy. Pais, then goes on to say that first
E [sic Emax] should vary linearly with ν. Second, the slope of the (E, ν) plot is a universal constant, independent of the nature of the irradiated material. Third, the value of the slope was predicted to be Planck’s constant determined from the radiation law. None of this was known then.
This gives the impression that Einstein singlehandedly discovered the photoelectric law. This is certainly inaccurate. Just listen to what J. J. Thomson [28] had to say on the subject: It was at first uncertain whether the energy or the velocity was a linear function of the frequency...Hughes, and Richardson and Compton were however able to
1 Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
2 A New Perspective on Relativity
show that the former law was correct... The relation between maximum energy 1 2 = ν − and the frequency can be written in the form 2 mv k V0e, where V0 is a potential characteristic of the substance. Einstein suggested that k was equal to h, Planck’s constant. [italics added]
Pais asks “What about the variation of the photoelectron energy with light frequency? One increases with the other; nothing more was known in 1905.” So it is not true that “At the time Einstein proposed his heuristic principle, no one knew how E depended on ν beyond the fact that one increases with the other.” ...And this was the reason for Einstein’s Nobel Prize.
1.1.1 The author(s) of relativity
Referring to the second edition of Edmund Whittaker’s book, History of the Theory of Relativity, Pais writes Forty years latter, a revised edition of this book came out. At that time Whittaker also published a second volume dealing with the period from 1910 to 1926. His treatment of the special theory of relativity in the latter volume shows how well the author’s lack of physical insight matches his ignorance of the literature. I would have refrained from commenting on his treatment of special relativity were it not for the fact that his book has raised questions in many minds about the priorities in the discovery of this theory. Whittaker’s opinion on this point is best conveyed by the title of his chapter on this subject: ‘The Relativity Theory of Poincaré and Lorentz.’
Whittaker ignited the priority debate by saying In the autumn of the same year, in the same volume of the Annalen der Physik as his paper on Brownian motion, Einstein published a paper which set forth the relativity theory of Poincaré and Lorentz with some amplifications, and which attracted much attention. He asserted as a fundamental principle the constancy of the speed of light, i.e. that the velocity of light in vacuo is the same for all systems of reference which are moving relatively to each other: the assertion which at the time was widely accepted, but has been severely criticized by later writers. In this paper Einstein gave the modifications which must now be introduced into the formulae for aberration and the Doppler effect.
Except for the ‘severe criticism,’ which we shall address in Sec. 4.2.1, Whit- taker’s appraisal is balanced. Pais’s criticism that “as late as 1909 Poincaré did not know that the contraction of rods is a consequence of the two Einstein postulates,” and that “Poincaré therefore did not understand one of the most basic traits of special relativity” is an attempt to discredit Poincaré in favor of Einstein. In fact, there have been conscientious attempts at demonstrating Poincaré’s ignorance of special relativity. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 3
The stalwarts of Einstein, Gerald Holton [88] and Arthur Miller [81] have been joined by John Norton [04] and Michel Janssen [02]. There has been a growing support of Poincaré, by the French, Jules Leveugle [94], Christian Marchal, and Anatoly Logunov [01], a member of the Russian Academy of Sciences. It is, however, of general consensus that Poincaré arrived at the two postulates first — by at least ten years — but that “he did not fully appreciate the status of both postulates” [Goldberg 67].Appre- ciation is fully in the mind of the beholder. There is a similar debate about who ‘discovered’ general relativity, was it Einstein or David Hilbert? These debates make sense if the theories are correct, unique and compelling — and most of all the results they bear. In this book we will argue that they are not unique. It is also very dangerous when historians of science enter the fray, for they have no means of judging the correctness of the theories. However, since it makes interesting reading we will indulge and present the pros and cons of each camp. Why then all the appeal for Einstein’s special theory of relativity? Probably because the two predictions of the theory were found to have practical applications to everyday life. The slowing down of clocks as a result of motion should also apply to all other physical, chemical and bio- logical phenomena. The apparently inescapable conclusions that a twin who goes on a space trip at a speed near that of light returns to earth to find his twin has aged more than he has, and the decrease in frequency of an atomic oscillator on a moving body with the increase in mass on the moving body which is converted into radiation, all have resulted in paradoxes. All this means that the physics of the problems have as yet to be under- stood. Just listen to the words of the eminent physicist Victor Weisskopf[60]: We all believe that, according to special relativity, an object in motion appears to be contracted in the direction of motion by a factor [1 − (v/c)2]1/2. A passenger in a fast space ship, looking out the window, so it seemed to us, would see spherical objects contracted into ellipsoids.
Commenting on James Terrell’s paper on the “Invisibility of the Lorentz contraction” in 1960, Weisskopf concludes: ... is most remarkable that these simple and important facts of the relativistic appearance of objects have not been noticed for 55 years.
It is well to recognize that what appears as to be a firmly established phenomenon keeps popping up in different guises. It is the same type Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
4 A New Perspective on Relativity
of remarks that the space contraction is a ‘psychological’ state of mind, and not a ‘real’ physical effect, that prompted Einstein to reply: The question of whether the Lorentz contraction is real or not is misleading. It is not ‘real’ insofar as it does not exist for an observer moving with the object.
Here, Einstein definitely committed himself to the ‘reality’ of the Lorentz contraction.
1.1.1.1 Einstein’s retraction of these two postulates and the existence of the aether The cornerstones of relativity are the equivalence of all inertial frames, and the speed of light is a constant in all directions in vacuo. These pos- tulates were also those of Poincaré who uttered them at least seven years prior to Einstein. So what makes Einstein’s postulates superior to those of Poincaré? Stanley Goldberg [67] andArthur Miller [73] tell us that Poincaré’s [04] statements the laws of physical phenomena must be the same for a stationary observer as for an observer carried along in a uniform motion of translation; so that we have not and cannot have any means of discerning whether or not we are carried along in such a motion,
and no velocity can surpass that of light,
were elevated to “a priori postulates” [Goldberg 67] which “stood at the head of his theory.” These postulates also carry the name of Einstein. Why then would Einstein ever think of retracting them? If time dilatation and space contraction due to motion are actual pro- cesses then there is no symmetry between observers in different inertial frames. The first postulate of relativity is therefore violated [Essen 71]. Einstein used gedanken experiments which is an oxymoron. Consider what Einstein [16] has to say about a pair of local observers on a rotating disc: By a familiar result of the special theory of relativity the clock at the circumference — judged by K — goes more slowly than the other because the former is in motion and the latter is at rest. An observer at the common origin of coordinates capable of observing the clock at the circumference by means of light would therefore see it lagging behind the clock beside him. As he will not make up his mind to let the velocity of light along the path in question depend explicitly on the time, he will Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 5
interpret his observations as showing that the clock at the circumference ‘really’ goes more slowly than the clock at the origin.
First the uniformly rotating disc is not an inertial system so the special theory does not apply. Second, local observers cannot discern any changes to their clocks or rulers as to where they are on the disc because they shrink or expand with them. It is only to us Euclideans that these variations are perceptible. If the velocity of light is independent of the velocity of its source, how then can the outward journey of a light signal to an observer moving at velocity v be c + v, on its return it travels with a velocity c − v? Although this violates the second postulate, such assertions appear in the expression for the elapsed time of sending out a light signal from one point to another and back again in the Michelson–Morley experiment whose null result they hope to explain. They also appear alongside Einstein’s relativistic velocity composition law in his famous 1905 paper “On the Electrodynamics of Moving Bodies.” Also in that paper is his ‘definition’ of the velocity of light as the ratio of “light path” to the “time interval.” But we are not allowed to measure the path of the light ray and determine the time it took, for c has been elevated to a universal constant! “How can two units of measurement be made constant by definition?” Essen queries. In his first attempt to explain the bending of rays in a gravitational field, Einstein [11] claims For measuring time at a place which, relative to the origin of the coordinates, has a gravitation potential , we must employ a clock which — when removed to the origin of coordinates — goes (1 + /c2) times more slowly than the clock used for measuring time at the origin of coordinates. If we call the velocity of light at the origin of coordinates c0, then the velocity of light c at a place with the gravitational potential will be given by the relation = + c c0 1 . c2
The principle of the constancy of the velocity of light holds good according to this theory in a different form from that which usually underlies the ordinary theory of light. [italics added]
On the contrary, this violates the second postulate which makes no refer- ence to inertial nor non-inertial frames.And is his equation a cubic equation for determining c? Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
6 A New Perspective on Relativity
It did not take Max Abraham [12] long to point this out stating that Einstein had given “the death blow to relativity,” by retracting the invariance of c. Abraham said he warned “repeatedly against the siren song of this theory...[and] that its originator has now convinced himself of its untenability.” What Abraham objected most to was that even if rela- tivity could be salvaged, at least in part, it could never provide a “complete world picture,” because it excludes, by its very nature, gravity. Einstein also uses the same Doppler expression for the frequency shift. The Doppler shift is caused by the motion of the source with respect to the observer. “There is, therefore, no logical reason why it should be caused by the gravitational potential, which is assumed to be equivalent to the acceleration times distance” [Essen 71]. Thus Einstein is proposing another mechanism for the shift of spectral lines that employs accelerative motion rather than the relative motion of source and receiver. Does the acceleration of a locomotive cause a shift in the frequency of its whistle? or is it due to its velocity with respect to an observer on a stationary platform? But no, Einstein has replaced the product of acceleration and distance with the gravitational potential — which is static! Just where a clock is in a gravitational field will change its frequency. This is neither a shift caused by velocity nor acceleration. Everyone would agree that Einstein removed the aether. Whereas Hertz considered the aether to be dragged along with the motion of a body, Lorentz considered the aether to be immobile, a reference frame for an observer truly at rest. On the occasion of a visit to Leyden in 1920, Ein- stein [22a] had this to say about the aether: ... the whole change in the conception of the aether which the special theory of relativity brought about, consisted in taking away from the aether its last mechanical quality,namely,its immobility. ...according to the general theory of relativity space is endowed with physical qualities; in this sense, therefore, there exists an aether. ... space without aether is unthinkable; for in such a space there not only would be no propagation of light, but also no possibility of the existence for standards of space and time (measuring rods and clocks), nor therefore any space time intervals in the physical sense. But this aether may not be thought of as endowed with the quality characteristic of ponderable media, as consisting of parts which may be tracked through time. The idea of motion may not be applied to it.
Essentially what Einstein is saying that what was not good for special relativity is good for general relativity for “We know that [the new aether] determines the metrical relations in the space-time continuum.” How is it Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 7
needed for the propagation of light signals and yet has not the character- istics of a medium? Einstein’s real problem is with rotations for “Newton might no less well have called his absolute space ‘aether;’ what is essential is merely that besides observable objects, another thing, which is not per- ceptible, must be looked upon as real, to enable acceleration or rotation to be looked upon as something real.” This is five years after Einstein’s formulation of general relativity, and his desire is to unite the gravitational and electromagnetic fields into “one unified conformation” that would enable “the contrast between aether and matter [to] fade away, and, through the general theory of relativity, the whole of physics would become a complete system of thought.” The search for that utopia was to occupy Einstein for the remainder of his life.
1.1.1.2 Which mass? In Lorentz’s theory two masses result depending on how Newton’s law is expressed, i.e. d F = (mv), dt or
F = ma,
where a is the acceleration. Both forms of the force law coincide when the mass is independent of the velocity, but not so when it is a function of the velocity. If the force is perpendicular to the velocity there results the transverse mass,
m0 mt = √ , (1 − β2) while if parallel to the velocity there results the longitudinal mass, m m = 0 . l (1 − β2)3/2 While it is true that a larger force is required to produce an acceleration in the direction of the motion than when it is perpendicular to the motion, it “is unfortunate that the concept of two masses was ever developed, for the [second] form of Newton’s law is now recognized as the correct one” [Stranathan 42]. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
8 A New Perspective on Relativity
In the early days of relativity the relativistic mass was written m = 4 2 = 2 4 3 E/c , and not m E/c . Einstein was aloof to the factor of 3 — which was a consequence of the Lorentz transform on energy — but not to there being two masses. According to Einstein [05] “with a different definition of the force and acceleration we would obtain different numerical values for the masses; this shows that we must proceed with great caution when comparing different theories of the motion of the electron.” Apart from ‘numerical’ differences, Kaufmann’s experiments identified the mass as the transverse mass, but this did not prevent Einstein [06a] to propose an experimental method to determine the ratio of the transverse to the longitudinal mass. According to Einstein the ratio of the transverse to longitudinal mass would be given by the ratio of the electric force, eE, to the potential, V, “at which the shadow-forming rays get deflected,” i.e.
mt ρ Ex = , ml 2 V
where ρ is the radius of curvature of the shadow-forming rays and Ex is the electric field in the x-direction. As the ‘definition’ of the longitudinal mass, ml, Einstein takes 1 kinetic energy = m v2. 2 l It would be very difficult for Einstein to get this energy as a nonrelativistic approximation of a relativistic expression for the kinetic energy. Einstein’s contention that A change of trajectory evidently is produced by a proportional change of the field only at electron velocities at which the ratio of the transverse to longitudinal mass is noticeably different from unity
is at odds with his assumption of the validity of the equation of motion,
d2x m0 =−eEx, dt2 which holds “if the square of the velocity of the electrons is very small compared to the square of the velocity of light.” The mass of the electron m0 is not specified as to whether it is the transverse or longitudinal mass, or a combination of the two. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 9
This example shows that Einstein was not attached to his relativity theory as he is made out to be. Why is it that the same types of contradic- tions and incertitudes found in Poincaré’s statements are used as proof as to his limitations as a physicist, while there is never mention of them in Einstein’s case?
1.1.1.3 Conspiracy theories In order to defend the supremacy of German science, David Hilbert, with the help of Hermann Minkowski and Emil Wiechert, set out to deny Poincaré the authorship of relativity. Hilbert was the last in a long line of illustrious Göttingen mathematicians who sought to retain the domi- nance of the University which boasted of the likes of Carl Friedrich Gauss, Bernhard Riemann and Felix Klein. Whereas there existed a friendly com- petition between Felix Klein and Poincaré [Stillwell 89], Hilbert’s prede- cessor, there was jealousy between Hilbert and Poincaré, which was only exasperated when Poincaré won the Bolyai prize in mathematics for the year 1905. Ironic as it may be, János Bolyai was the co-inventor of hyper- bolic geometry, and the rivalry between Klein and Poincaré had to do with the development of that geometry. As the story goes, Arnold Sommerfeld [04], an ex-assistant of Klein’s, Gustav Herglotz and Wiechert were working on superluminal electrons during the fall of 1904 through the spring of 1905. In the summer months of 1905, beginning on the notorious date of the 5th of June, the Göttingen mathematicians organized seminars on the ‘theory of electrons,’ in which there was a session on superluminal electrons chaired by Wiechert on the 24th of July. The date of the 5th of June coincided with Poincaré’s [05] presentation of his paper, “Sur la dynamique de l’électron,” to the French Academy of Sciences. The printed paper was published and sent out to all correspon- dents of the Academy that Friday, the 9th of June. The earliest it could have arrived in Göttingen was Saturday the 10th, or given postal delays it would have arrived no latter than the following Tuesday, the 13th of June.a In that
aThese dates are reasonable since the other German physics bi-monthly journal, Fortschrift der Physik had a synopsis of the Poincaré paper in its 30th of June issue. Given the publication delay, it would make the 10th of June arrival date of the Comptes Rendus issue more likely. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
10 A New Perspective on Relativity
paper Poincaré supposedly declared that no material body can go faster than the velocity of light in vacuum, and this threw a wrench into the works of the Göttingen school [Marchal]. However, this is nothing different than what Poincaré [98] had been saying since 1898 when he postulated the invariance of light in vacuo to all observers, whether they are stationary or in motion. Or, to what Poincaré reiterated in 1904: “from these results, if they are confirmed would arise a new mechanics [in which] no velocity could surpass that of light.” So the all-important date of the publication date of 5th of June to the proponents of the conspiracy theory [Leveugle 04] is a red herring for it said only what he had said before on the limiting velocity of light. Moreover, there was a continual boycott of Poincaré’s relativity work in such prestigious German journals as Annalen der Physik. Consequently, there was no con- tingency for the appearance of Einstein’s paper when it did. But let us continue. So the plot was hatched that some German, of minor importance and one who was willing to take the risks of plagiarism, had to be found that would reproduce Poincaré’s results without his name. Now Minkowski knew of Einstein since he had been his student at the ETHb from 1896– 1900. Einstein was also in contact with Planck, since Einstein’s summary of the work appearing in other journals for the Beiblätter zu der Annalen der Physik earned him a small income. In fact, there is one review of Einstein of a paper by A. Ponsot “Heat in the displacement of the equilibrium of a cap- illary system,” that appeared in the Comptes Rendu 140 just 325 pages before Poincaré’s June 5th paper. To make matters worse, an article by Weiss, which appeared in the same issue of Comptes Rendu, was summarized in the November issue of the Supplement, but not for Poincaré’s paper. Neither that paper nor its longer extension that was published in the Rendiconti del Circolo Matematico di Palermo [06] were ever summarized in the Beiblätter. Surely, these papers would have caught the eye of Planck, who was running the Annalen, and was known to be in correspondence with Einstein not only in this connection, but, also with regard to questions on quanta. Einstein had also published some papers on the foundations
bThe Eidgenössische Technische Hochschule (ETH) was then known as the Eid- genössische Polytechnikum; the name was officially changed in 1911. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 11
of thermodynamics during the years 1902–1903 in the Annalen whose sim- ilarity with those of J. Willard Gibbs was “quite amazing” even to Max Born [51]. Thus, the relativity paper was supposedly prepared by the Göttingen mathematicians and signed by Einstein who submitted it for publication at the end of June, arriving at the offices of the Annalen on the 30th of June. Einstein was an outsider, being considered a thermodynamicist, with a lot to gain and little, if nothing, to lose. The paper fails to mention either Lorentz or Poincaré, and, for that matter, contains no references at all. If there was a referee for the paper,c other than Planck himself, it would have been obvious that the transformation of the electrodynamic quantities went under the name of Lorentz, with Lorentz’s parameter k(v) replaced by Einstein’s ϕ(v), both ultimately set equal to 1, and the relativis- tic addition law had already been written down by Poincaré as a conse- quence of the Lorentz transform in his 1905 paper on “Sur la dynamique de l’électron.” Although Einstein derives the relativistic composition law in the same way as Poincaré, he provides a new generalization when the composition of Lorentz transformations are in different planes, for that also involves rotations. It has been claimed that there was no connection between Lorentz and Einstein for Einstein gets the wrong expression for the transverse mass in his “Electrodynamics of moving bodies,” while Lorentz errs when he subjects the electric current to a Lorentz transforma- tion [Ohanian 08]. But, it is clear from his method of derivation from the Lorentz force, that Einstein’s error was a typo. Einstein’s paper appeared in the 26th of September issue of the Annalen, and Planck lost no time in orga- nizing a symposium on his paper that November, which, in the words of von Laue, was “unforgettable.” Not all is conjecture, certain things are known. First, Poincaré worked in friendly competition with Klein in studying universal coverings of sur- faces. What initiated Poincaré on his studies of hyperbolic geometry was
cApparently the paper was handled by Wilhelm Röntgen, a member of the Kurato- rium of the Annalen, who gave it to his young Russian assistant, Abraham Joffe [Auf- fray 99]. Joffe noted that the author was known to the Annalen, and recommended publication. That an experimental physicist should have handled the paper, and not the only theoretician on the Kuratorium — Planck — would have made such a refer- ring procedure extremely dubious. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
12 A New Perspective on Relativity
an 1882 letter of Klein to Poincaré who informed him of previous work by Schwarz. Second, it was Klein who brought Hilbert to Göttingen. When criticized about his choice, Klein responded “I want the most difficult of all.” Third, Klein was known to pass on important letters and scien- tific material to Hilbert. Fourth, since Klein and Poincaré were on good terms and in contact, it would be unthinkable that Klein did not know of Poincaré’s work on relativity, and that Klein would have passed this on to Hilbert. Fifth, there was a lack of “kindred spirit” [Gray 07] between Poincaré and Hilbert from their first meeting in Paris in 1885. Sixth, Poincaré was “unusually open about his sources,” [Gray 07] and non-polemical, while Hilbert had a tremendous will who thought every problem was solv- able. Lastly, Poincaré’s work on relativity was actively boycotted in Ger- many, and later in France thanks to Paul Langevin. Thus, it is unthinkable that Hilbert was in the dark about relativity theory prior to 1905. His col- league, Minkowski, became interested in electrodynamics through reading Lorentz’s papers. According to C. Reid, in “Hilbert,” Hilbert conducted a joint seminar with Minkowski.Ayear after their study,in 1905, they decided to dedicate the seminar to a topic in physics: the electrodynamics of moving bodies. Hilbert was often quoted as saying “physics is too important to be left to the physicists.” What is truly unbelievable that the discover of rela- tivity and two models of hyperbolic geometry would not even once think there was a relation between the two. Everything else is conjecture, even Einstein’s supposed receipt of the latest issue of Volume CXL of Comptes Rendus, vested as a reviewer for the Beiblätter, on Monday the 12th of June in the Berne Patent Office. Undoubtedly, that would have created a dire urgency to finish his article on the electrodynamics of a moving body [Auf- fray 99]. But wherever the real truth may lie, there cannot be any doubt that Planck played a decisive role in Einstein’s rise to fame. The behavior of Langevin to a fellow countryman is even more baf- fling when we realize that he was the first French physicist to learn of the “new mechanics” of Poincaré, which would later be known as relativity,but without the name of its author. Langevin had accompanied Poincaré to the Saint-Louis Congress of 1904 where he presented his principle of relativ- ity. It is hardly admissible that Langevin was not familiar of all Poincaré’s publications especially when Poincaré [06] dedicated a whole section of Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 13
his 1906 article in the Rendiconti to him, entitling it “Langevin Waves,” and stating Langevin has put forth a particularly elegant formulation of the formulas which define the electromagnetic field produced by the motion of a single electron.
Yet, in his obituary column of Poincaré, Langevin fails to note Poincaré’s priority over Einstein’s writing Einstein has rendered the things clearer by underlining the new notions of space and time which correspond to a group totally different than the conserved transforma- tions of rational mechanics, and asserting the generality of the principle of relativ- ity and admitting that no experimental procedure could ascertain the translational movement of a system by measurements made on its interior. He has succeeded in giving definitive form to the Lorentz group and has indicated the relations that exist between the same quantity simultaneously made on each of two systems in relative movement. Henri Poincaré arrived at the same equations in the same time following a differ- ent route, his attention being directed to the imperfect form which the formulas for the transformation had been given by Lorentz. Familiar with the theory of groups, he was preoccupied to find the invariants of the transformation, elements which are unaltered and thanks to which it is possible to pronounce all the laws of physics in a form independent of the reference system; he sought the form that these laws must have in order to satisfy the principle of relativity.
This could not have appeared in a more appropriate place: Revue de Méta- physique et de Morales! Another priority feud also erupted between Einstein and Hilbert over general relativity in November 1915. It ended with the publication of papers with the unpretentious titles of “The foundation of the general theory of relativity,” by Einstein, and “The foundations of physics,” by Hilbert. His- torians of science make Einstein’s theory the ultimate theory of gravita- tion with titles like “How Einstein found his field equations,” [Norton 84], and “Lost in the tensors: Einstein’s struggle with covariance principles” [Earman & Glymour 78]. In the opinion of O’Rahilly [38], “Einstein’s the- ory, which delights every aesthetically minded mathematician, is a much less grandiose affair as judged and assessed by the physicist.” He points out that WaltherRitz arrived at prediction of a perihelion advance of the planets in 1908. We will use his same force equation to show he could have obtained the other predictions of general relativity in Sec. 3.8.2. Furthermore, the same experimental tests of these equations can be obtained with far more simplicity, as we shall see in Chapter 7. The proponents of the conspiracy Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
14 A New Perspective on Relativity
theory claim that Einstein’s conciliatory letter of December to Hilbert may be due, in part, for the favor that Hilbert did for him ten years earlier. The defenders of Einstein belittle Poincaré for his “lack of insight into certain aspects of the physics involved” [Goldberg 67]. The same can be said of Einstein; in a much quoted letter to Carl Seelig on the occasion of the 50th ‘anniversary’ of relativity, Einstein writes: The new feature was the realization of the fact that the bearing of the Lorentz- transformations transcended their connection with Maxwell’s equations and was concerned with the nature of space and time in general. A further result was that the Lorentz invariance is a general condition for any physical theory. This was for me of particular importance because I had already previously found that Maxwell’s theory did not account for the micro-structure of radiation and could therefore have no general validity.
In a letter to von Laue in 1952, Einstein elaborated what he meant by a “second type” of radiation pressure: one has to assume that there exists a second type of radiation pressure, not deriv- able from Maxwell’s theory, corresponding to the assumption that radiation energy consists of indivisible point-like localized quanta of energy hν (and of momentum hν/c, c = velocity of light), which are reflected undivided. The way of looking at the problem showed in a drastic and direct way that a type of immediate reality has to be ascribed to Planck’s quanta, that radiation, must, therefore, possess a kind of molecular structure as far as energy is concerned, which of course contradicts Maxwell’s theory.
Maxwell’s equations together with the Lorentz force satisfy the Lorentz transform so it is difficult to see that the transformation is more general than what it transforms. In addition, the discovery of Planck’s radiation law did not contradict the Stefan–Boltzmann radiation law, nor provide a new type of radiation. Here, Einstein is confusing macro- scopic laws with the underlying microscopic processes that are entirely compatible with those laws when the former are averaged over all fre- quencies of radiation. Consequently, there is no second type of radiation pressure. What the conspiracy theories have in common with their opponents is the presumption that the end result is correct. What authority did Poincaré’s June paper of 1905 have for dashing the efforts of Sommerfeld’s investiga- tions on superluminal electrons? Weber was no stranger to superluminal particles nor was Heaviside. In all the years preceding that paper, there was no authority bearing down upon them even though the mathemat- ical structure of relativity had been set in place. What was supposedly Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 15
new about Einstein’s paper was the liberation of space and time from an electromagnetic framework, as he claimed in his letter to Seelig. But is this true?
1.1.1.4 Space-time in Einstein’s world The conventional way of rebuffing the conspiracy theories is “to show the nature of Poincaré’s ideas and approach that prevented him from pro- ducing what Einstein achieved” [Cerf 06]. Einstein was not so unread as he would have us believe for he used Poincaré’s method — radar signal- ing — in discussing simultaneous events, and falls into the same trap as Poincaré did. Poincaré asks us to consider two observers, A and B, who are equipped with clocks that can be synchronized with the aid of light signals. B sends a signal to A marking down the time instant in which it is sent. A, on the other hand, resets his clock to that instant in time when he receives the signal. Poincaré realized that such a synchronization would introduce an error because it takes a time t for light to travel between B and A. That is, A’s clock would be behind B’s clock by a time t = d/c, where d is the distance between B and A. This error, according to Poincaré is easy to correct: Let A send a light signal to B. Since light travels at the same speed in both directions, B’s clock will be behind A’s by the same time t. Therefore, in order to synchronize their clocks it is necessary for A and B to take the arithmetic mean of the times arrived at in this way. This is also Einstein’s result. Certainly the definition of the velocity v = d/t seems innocuous enough. But, as Louis Essen [71] has pointed out it is possible to define the units of any two of these terms. Normally, one measures distance in meters and time in seconds so the velocity is meters per second. But making the velocity of light constant “in all directions and to all observers whether sta- tionary or in relative motion” is tantamount to making c a unit of measure- ment, or what will turn out to be an absolute constant. According to Essen, “the definition of the unit of length or of time must be abandoned; or, to meet Einstein’s two conditions, it is convenient to abandon both units.” The two conditions that Essen is referring to is the dilatation of time and the contraction of length. There is no new physical theory, but, “simply a new system of units in which c is constant” so that either time or length Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
16 A New Perspective on Relativity
or both must be a function of c such that their ratio, d/t, gives c. This is not what Louis de Broglie [51] had to say: Poincaré did not take the decisive step. He left to Einstein the glory of having perceived all the consequences of the principle of relativity and, in particular, of having clarified through a deeply searching critique of the measures of length and duration, the physical nature of the connection established between space and time by the principle of relativity.
So by elevating the velocity of light to a universal constant, Einstein implied that the geometry of relativity was no longer Euclidean. The number c is an absolute constant for hyperbolic geometry that depends for its value on the choice of the unit of measurement. To the local observers there is no such thing as time dilatation nor length contrac- tion. These distortions are due to our Euclidean perspective. It is all a question of ‘frame of reference.’
Poincaré after having written down his relativistic law of the com- position of velocities should have realized that the only function which could satisfy such a law is the hyperbolic tangent, which is the straight line segment in Lobachevsky (velocity) space. Thus, time and space have no separate meaning, but only their ratio does. Consider Einstein’s two postulates which he enunciated in 1905:
(i) The same laws of electrodynamics and optics will be valid for all frames of reference for which the equations of mechanics hold. (ii) Light is always propagated in empty space with a definite velocity c, which is independent of the state of motion of the emitting body.
Match them against Poincaré’s first two postulates as he pronounced them in 1904:
(i) The laws of physical phenomena should be the same whether for an observer fixed, or for an observer carried along in a uniform move- ment of translation; so that we could not have any means of discerning whether or not we are carried along in such a motion; (ii) Light has a constant velocity and in particular that its velocity is the same in all directions. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 17
Now Poincaré introduces a third postulate, which Pais makes the following comment: The new mechanics, Poincaré said, is based on three hypotheses. The first of these is that bodies cannot attain velocities larger than the velocity of light. The second is (I use modern language) that the laws of physics shall be the same in all inertial frames. So far so good. Then Poincaré introduces a third hypothesis. ‘One needs to make still a third hypothesis, much more surprising, much more difficult to accept, one which is of much hindrance to what we are currently used to. A body in translational motion suffers a deformation in the direction in which it is displaced... However strange it may appear to us, one must admit that the third hypothesis is perfectly verified.’ It is evident that as late as 1909 Poincaré did not know that the contractions of rods is a consequence of the two Einstein postulates. Poincaré therefore did not understand one of the most basic traits of special relativity.
Whether or not rods contract or rotate when in motion will be dis- cussed in Sec. 9.9, but it appears that Pais is reading much too much into what Poincaré said as to what he actually did. In Sec. 4 of “Sur la dynamique de l’électron” published in 1905, entitled “The Lorentz transformation and the principle of least action,” Poincaré shows that both time dilatation and space contraction follow directly from the Lorentz transformations. By the Lorentz transformation, δx = γl(δx − βct), δy = lδy, δz = lδz, δt = γl(δt − βδx/c),
it follows that for measurements made on a body at the same moment, δt = 0, in an inertial system moving with a relative velocity β = v/c along − the x-axis, the body undergoes contraction by a factor γ 1 when viewed in the unprimed frame when we set l = 1. It is therefore very strange that Poincaré would reintroduce this as a third hypothesis when it is a con- sequence of Lorentz’s transformation which he accepts unreservedly. As Poincaré was prone to writing popular articles and books he may have thought that the contraction of rods were sure to catch the imagination of the layman. The problem is in the interpretation of what is meant by the second postulate regarding the constancy of light, which is usually interpreted as the velocity of light relative to an observer, whether he be stationary or moving at a velocity v. Thus, instead of obtaining values c + v or c − v for the velocity of light, for an observer moving at ±v relative to the source, one would always ‘measure’ c. A frequency would therefore not undergo a Doppler shift, contrary to what occurs. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
18 A New Perspective on Relativity
According to Einstein’s prescription, the time taken for a light signal to complete a ‘back-and-forth’ journey over a distance d is the arithmetic average of the two 1 1 1 c t = d + = d . 2 c + v c − v c2 − v2
We are thus forced to conclude that instead of obtaining the velocity c,we get the velocity c(1 − v2/c2), which differs from the former in the presence of a second order term, −v2/c2. Rather, if we use the relativistic velocities (c + v)/(1 + v/c) and (c − v)/(1 − v/c), we obtain 1 1 + v/c 1 − v/c t = d + = d/c, 2 c + v c − v
and the second-order effect disappears, just as it would in the Michelson– Morley experiment [cf. Sec. 3.2]. It is not as Einstein claims: “The quotient [distance by time] is, in agreement with experience, a universal constant c, the velocity of light in empty space.” The ‘experience’ is the transmission of signals back and forth, like those envisioned by Poincaré. In this setting, the ‘principle’ of the constancy of light is untenable [Ives 51]. The velocities of light in the out and back directions co and cb will, in general, be different. If the distance traversed by the light signal is d, the total time for the outward and backward journey is, according to Einstein, 1 d d co + c d t = + = d . (1.1.1) 2 co cb cocd 2
But, according to the principle of relativity, there should be no difference in the velocities of light in the outward and backward directions, so that this principle decrees
d t = . (1.1.2) c
Equating (1.1.1) and (1.1.2) yields [Ives 51]
(co + c )/2 1 b = , cocb c Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 19
which can easily be rearranged to read: √ √ c = 2 (cocb) ≤ + 1. (1.1.3) (cocb) (co cb) The inequality in (1.1.3) follows from the arithmetic-geometric mean = = inequality which becomes an equality only when co cb c. Thus, if there are no superluminal velocities, the latter case must hold, for if not, one of the two velocities, co or cb must be greater than c. Asimilar situation occurs for the inhomogeneous dispersion equation of a wave [cf. Sec. 11.5.6], 2 = 2 2 + 2 ω c κ ω0,
where ω and κ are the frequency and wave number, and ω0 is the critical frequency below which the wave becomes attenuated. Differentiation of the dispersion equation gives
ωdω = c2κ dκ.
Introducing the definitions of phase and group velocities, u = ω/κ and w = dω/dκ, it becomes apparent that u > c implies w < c [Brillouin 60]. Since uw = c2, the equivalence of the two velocities requires the critical frequency to vanish and so restores the isotropy of space. Einstein [05] uses absolute velocities to show that two observers trav- eling at velocities ±v would not find that their clocks are synchronous while those at rest would declare them so. He considers light emitted at A at time tA to be reflected at B at time tB which arrives back at A at time tA. If d is the distance between A and B, the time for the outward and return journeys are d t − t = , B A c + v and
d t − t = , A B c − v respectively. Since these are not the same, Einstein concludes that what seems simultaneous from a position at rest is not true when in relative motion. But, in order to do so, Einstein is using absolute velocities: the velocity on the outward journey is c + v, and the velocity of the return Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
20 A New Perspective on Relativity
journey is, c − v, and so violates his second postulate. If the relativistic law of the composition of velocities is used, instead, the total times for outward and return journeys become the same, which is what is found to within the limits of experimental error [Essen 71]. Einstein then attempts to associate physical phenomena with the fact that clocks in motion run slower than their stationary counterparts, and rods contract when in motion in comparison with identical rods at rest. He considers what is tantamount to the Lorentz transformations, as a rotation through an imaginary angle, θ, x = x cosh θ − ct sinh θ, ct = ct cosh θ − x sinh θ, at the origin of the system in motion so that x = 0. He thus obtains √ x/t = c tanh θ, t = t (1 − v2/c2) = t/ cosh θ. (1.1.4)
He then concludes that clocks transported to a point will run slower by 1 2 2 an amount 2 tv /c with respect to stationary clocks at that point, which is valid up to second-order terms. Rather, what Einstein should have noticed is that − 1 1 + v/c θ = tanh 1 v/c = ln 2 1 − v/c is the relative distance in a hyperbolic velocity space whose ‘radius of curvature’ is c. Space and time have lost their separate identities, and only appear in the ratio v = x/t whose hyperbolic measure is θ = v¯/c. The role of c is that of an absolute constant, whose numerical value will depend on the arbitrary choice of a unit segment. By raising the velocity of light to a universal constant, Einstein implied that the space is no longer Euclidean. Euclidean geometry needs standards of length and time; in this sense Euclidean geometry is relative. In terms of meters and seconds, the speed of light is 3×108 m/s. If there was no Bureau of Standards we would have no way of defining what a meter or second is. Not so in Lobachevskian geometry where angles determine the sides of the triangle. In Lobachevskian geometry lengths are absolute as well as angles. The ‘radius of curvature’ c is no longer an upper limit to the velocities, but, rather, defines the unit of measurement. Lobachevskian geometries with different values of c will not be congruent. As c approaches infinity, Lobachevskian formulas go over into their Euclidean counterparts. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 21
The exponential distance, 1/2 ¯ 1 + v/c ν ev/c = = , (1.1.5) 1 − v/c ν is the ordinary longitudinal Doppler factor for a shift in the frequency, ν , due to a moving source at velocity v. In the Euclidean limit, θ ≈ x/ct and (1.1.5) reduces to the usual Doppler formula [Variˇcak 10]: ν = ν(1 + v/c).
It is undoubtedly for this reason that both Einstein and Planck found non- Euclidean geometries distasteful. For as Planck remarked [98] It need scarcely be emphasized that this new conception of the idea of time makes the most serious demands upon the capacity of abstraction and projective power of the physicist. It surpasses in boldness everything previously suggested in spec- ulative natural phenomena and even in the philosophical theories of knowledge: non-Euclidean geometry is child’s play in comparison. And, moreover, the princi- ple of relativity, unlike non-Euclidean geometry, which only comes seriously into consideration in pure mathematics, undoubtedly possesses a real physical signifi- cance. The revolution introduced by this principle into the physical conceptions of the world is only to be compared in extent and depth with that brought about by the introduction of the Copernican system of the universe.
Prescinding Planck’s degrading remarks concerning non-Euclidean geometries, we can safely conclude that
the distortion effects due to the spatial contraction and time dilatation of moving objects can be perceived by an observer using a Euclidean metric and clock. To local observers in hyperbolic space, there is no possible way of discerning these distortions because their rulers and clocks shrink or expand with them. All the ‘peculiar consequences’ are based on the issue of ‘frame of reference.’
What is truly tragic is that Poincaré never realized that his models of non-Euclidean geometries were pertinent to relativity.According to Arthur Miller [73] For a scientist of Poincaré’s talents the awareness of Lorentz’s theory should have been the impetus for the discovery of relativity. Poincaré seemed to have all the req- uisite concepts for a relativity theory: a discussion of the various null experiments to first and second order accuracy in v/c; a discussion of the role of the speed of light in length measurements; the correct relativistic transformation equations for the electromagnetic field and the charge density; a relativistically invariant action Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
22 A New Perspective on Relativity
principle; the correct relativistic equation for the addition of velocities; the concept of the Lorentz group; a rudimentary of the four-vector formalism and of four-dimensional space; a correct relativistic kinematics...[italics added]
so what went wrong? Miller claims that “his relativity was to be an induc- tive one with the laws of electromagnetism as the basis of all of physics.” This, according to Miller, prevented him from grasping the “universal applicability of the principle of relativity and therefore the importance of the constancy of the velocity of light in all inertial frames.” In other words, the equations are right but the deductions are wrong. One can deduce what he likes from the equations as long as it is compatible with experiment. While Miller [81] acknowledges that both Poincaré and Einstein, “simultaneously and independently,” derived the relativistic addition law for velocities, “only Einstein’s view could achieve its full potential.” He further claims that Poincaré never proved “the independence of the veloc- ity of light from its source....” These assertions have no justification at all: Poincaré did not have to prove anything, the velocity addition law negates ballistic theories. It is also not true that “Lorentz’s theory contained special hypotheses for this purpose.” No special hypotheses are needed since the velocity addition law is a direct outcome of the Lorentz transformations. Here is a clear intent to disparage Poincaré. And where is the experimental verification of Einstein’s theory as opposed to Poincaré’s? Or, maybe, Poincaré just did not go far enough? According to Scribner [64] the whole of the kinematical part of Einstein’s 1905 paper could have been rewritten in terms of aether theory. So accord- ing to him, the aether would play the role of the caloric in Carnot’s theory which, by careful use, did not invalidate his results. Carnot never ‘closed’ his cycle for that would have meant equating the heat absorbed at the hot reservoir with the heat rejected at the cold reservoir since, according to caloric theory, heat had to be conserved. Where Einstein puts into quotation marks “stationary” as opposed to “moving” it does not imply a physical difference because one is relative to the other. Moreover, the distinctions between “real” and “apparent” must likewise be abandoned. If there is no distinction between the two, then why should Einstein have taken exception to Variˇcak’s remark that Einstein’s “contraction is, so to speak, only a psychological and not a physical fact.” This brought an immediate reaction from Einstein to the effect that Variˇcak’s Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 23
note “must not remain unanswered because of the confusion that it could bring about.” After all these years has the confusion been abated? To condemn Lorentz and Poincaré for their belief in the aether is absurd. The aether for them was the caloric for Carnot. But did the caloric invalidate Carnot’s principle? And if Carnot has his principle, why does Poincaré not have his? Carnot’s principle still stands when the scaffolding of caloric theory falls. Another analogy associates Poincaré to Weber, and Einstein to Max- well. Weber needed charges as the seat of electrical force, while Maxwell needed the aether as the medium in which his waves propagate. Maxwell’s circuital equations make no reference to charges as the carriers of electricity. Miller [73] asserts that Poincaré did not realize “in a universal rela- tivity theory the basic role is played by the energy and momentum instead of the force.” But it was Lorentz’s force that was able to bridge Maxwell’s macroscopic field equations with the microscopic world of charges and currents. It is clear that Poincaré did not want to enter into polemics with Einstein. And Einstein, on his part, admits that his work was preceded by Poincaré. After a critical remark made by Planck on Einstein’s first derivation of m = E/c2, to the effect that it is valid to first-order only, the following year Einstein [06b] makes another attempt. In this study he proposes to show that this condition is both necessary and sufficient for the law of momentum, which maintains invariant the center of grav- ity, citing Poincaré’s 1900 paper in the Lorentz Festschrift. He then goes on to say Although the elementary formal considerations to justify this assertion are already contained essentially in a paper of Poincaré, I have felt, for reasons of clarity, not to avail myself of that paper.
Even though Einstein clearly admits to Poincaré’s priority no one seems to have taken notice of it. On July the 5th 1909, Mittag-Leffler, editor of Acta Mathematica writes to Poincaré to solicit a paper on relativity writing You know without doubt Minkowski’s Space and Time published after his death, and also the ideas of Einstein and Lorentz on the same problem. Now, Fredholm tells me that you have reached the similar ideas before these other authors in which you Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
24 A New Perspective on Relativity
express yourself in a less philosophical, but more mathematical, manner. Would you write me a paper on this subject ...in a comprehensible language that even the simple geometer would understand.
Poincaré never responded. Then there was the letter of recommendation of Poincaré’s to Weiss at the ETH where he considers Einstein as one of the most original minds that I have met. I don’t dare to say that his predictions will be confirmed by experiment, insofar as it will one day be possible.
Notwithstanding, Einstein writes in November 1911 that “Poincaré was in general simply antagonistic.” Relativity was probably just a word to him, since it was he who postulated the ‘principle of relativity.’ But it is true that Poincaré looked to experimental confirmation for his princi- ple. Be that as it may, what is truly incomprehensible is Poincaré’s lack of appreciation of the velocity addition law, for that should have put him on the track of introducing hyperbolic geometry. Then the distortions in space and time could be explained as the distortion we Euclideans observe when looking into another world governed by the axioms of hyperbolic geometry. To the end of his life, Poincaré maintained that Euclidean geom- etry is the stage where nature enacts her play, never once occurring to him that his mathematical investigations would have some role in that enactment. Now Poincaré was more than familiar with Lorentz’s contraction of electrons when they are in motion. He even added the additional, non- electromagnetic, energy necessary to keep the charge on the surface of the electron from flying off in all directions. The contraction of bodies is likened to the inhabitants of this strange world becoming smaller and smaller as they approach the boundary.The absolute constant needed for such a geom- etry would be the speed of light which would determine the radius of cur- vature of this world. In retrospect, it is unbelievable how Poincaré could have missed all this. It is also said that Poincaré was using the principle of relativity as a fact of nature, to be disproved if there is one experiment that can invalidate it. This is not much different than the second law of thermodynamics. In fact when Kaufmann’s measurements of the specific charge initially tended Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 25
to favor the Abraham model of the electron [cf. Sec. 5.4.1], Poincaré [54] appears to have lost faith in his principle for [Kaufmann’s] experiments have given grounds to the Abraham theory. The princi- ple of relativity may well not have been the rigorous value which has been attributed to it.
Kaufmann’s experiments were set-up to discriminate between various models proposed for the dependency of the mass of the electron on its speed. And if the Lorentz model had been found wanting, Einstein had much more to lose since his generalization of Lorentz’s electron theory to all of matter would certainly have been its death knell. Einstein had this to say in his Jahrbuch [07] article: It should also be mentioned that Abraham’s and Bucherer’s theories of the motion of the electron yield curves that are significantly closer to the observed curve than the curve obtained from the theory of relativity. However, the probability that their theories are correct is rather small, in my opinion, because their basic assumptions concerning the dimensions of the moving electron are not suggested by theoretical systems that encompass larger complexes of phenomena.
The last sentence is opaque, for what do the dimensions of a moving electron share with larger complexes of phenomena? And how are both related with Kaufmann’s deflection measurements? Einstein may not have liked Abraham’s model, but Abraham did because, according to him, it was based on common sense. It must be remembered that Lorentz’s theory of the electron was also a model. According to Born and von Laue, Abraham will be remembered for his unflinching belief in “the absolute aether, his field equations, his rigid electron just as a youth loves his first flame, whose memory no later experience can extinguish.” But how rigid could Abraham’s electron be if the electrostatic energy depended on its contraction when in motion? That is everyone will agree that “Abraham took his electron to be a rigid spherical shell that maintained its spherical shape once set in motion... [yet] a sphere in the unprimed coordinate system becomes, in the primed system, an ellipsoid of revolu- tion” [Cushing 81]. The unprimed system is related to the prime system by a dilation factor, equal to the inverse FitzGerald–Lorentz contraction, which elongates one of the axes into the major axis of the prolate ellipsoid. In the Lorentz model, one of the axes is shortened by the contraction factor so that an oblate ellipsoid results. In fact, as we shall see in Sec. 5.4.4, that Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
26 A New Perspective on Relativity
the models of Abraham and Lorentz are two sides of the same coin, which are related in the same way that hyperbolic geometry is related to elliptic geometry, or a prolate ellipsoid to an oblate ellipsoid. If we take Einstein’s [Northrop 59] remark: If you want to find out anything about theoretical physicists, about the methods they use, I advise you to stick closely to one principle: don’t listen to their words, fix your attention on their deeds.
at face value, then according to Einstein’s own admission, there is no dif- ference between the Poincaré–Lorentz theory and his. Whether the mass comes from a specific model of an electron in motion, or from general prin- ciples which makes no use of the fact that the particle is charged or not, they merge into the exact same formula for the dependence of mass on speed.
1.1.2 Models of the electron
At the beginning of the twentieth century several models of the electron were proposed that were subsequently put to the test by Kaufmann’s exper- iments involving the deflection of fast moving electrons by electric and magnetic fields. The two prime contenders were the Abraham and Lorentz models. If mass of the electron were of purely electromagnetic origin, it should fly apart because the negative charges on the surface would repel one another. There is a consensus of opinion that it was for this reason Abraham chose a rigid model of an electron which would not see the accu- mulation of charge that a deformed sphere would. Miller [81] contends that Abraham “chose a rigid electron because a deformable one would explode, owing to the enormous repulsive forces between its constituent elements of charge.” Even a spherical electron would prove unstable without some other type of binding forces. In that case, “the electromagnetic foundations would be excluded from the out- set,” according to Abraham. In order to calculate the electrostatic energy Abraham needed an expression for the capacitance for an ellipsoid of revolution. This he found in an 1897 paper by Searle. The last thing he had to do was to postulate a dependence of the semimajor axis of rev- olution upon the relative velocity β = v/c. ‘Rigid’ though the electron may be, Abraham evaluated the electrostatic energy in the primed system where a sphere of radius a turns into a cigar-shaped prolate ellipsoid with Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 27 √ semimajor axis a/ (1−β2). So Abraham’s rigid electron was not so rigid as he might have thought for the total electromagnetic energy he found was proportional to [Bucherer 04]: 1 1 + β ln − β. 2 1 − β This expression happens to be the difference between the measures of dis- tance in hyperbolic and Euclidean velocity spaces. When the radius of cur- vature, c, becomes infinite, the total electromagnetic energy will vanish, and we return to Euclidean space. So Abraham’s total electromagnetic energy was a measure of the distance into hyperbolic space which depended on the magnitude of the electron’s velocity. Abraham’s model fell into disrepute, and even Abraham abandoned it in latter editions of his second volume of Theorie der Elektrizität. However his electron turns out to be a cigar-shaped, prolate ellipsoid when in motion, while Lorentz’s was a pancake-shaped, oblate ellipsoid. So the two models were complementary to one another; the former belonging to hyperbolic velocity space while the latter to elliptic velocity space, with the transition between the two being made by ‘inverting’ the semimajor and semiminor axes.
1.1.3 Appropriation of Lorentz’s theory of the electron by relativity
Another historian of science, Russell McCormmach [70], claims that: Einstein recognized that not only electromagnetic concepts, but the mass and kinetic energy concepts, too, had to be changed. Entirely in keeping with his goal of finding common concepts for mechanics and electromagnetism, he deduced from the elec- tron theory elements of a revised mechanics. In his 1905 paper he showed that all mass, charged or otherwise, varies with motion and satisfies the formulas he derived for the longitudinal and transverse masses of the electron. He also found a new kinetic energy formula applying to electrons and molecules alike. And he argued that no particle, charged or uncharged, can travel at a speed greater than that of light since otherwise its kinetic energy becomes infinite. He first derived these non-Newtonian mechanical conclusions for electrons only. He extended them from electrons to material particles on the grounds that any material particle can be turned into an electron by the addition of charge “no matter how small.” It is curi- ous to speak of adding an indefinitely small charge, since the charge of an electron is finite. Einstein could speak this way because he was concerned solely with the “electromagnetic basis of Lorentzian electrodynamics and optics of moving bodies” [italics added]. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
28 A New Perspective on Relativity
The argument that takes us from electrodynamic mass to mass in general is the following. Kaufmann and others have deflected cathode rays by electric and magnetic fields to find the ratio of charge to mass. This ratio was found to change with velocity. If charge is invariant, then it must be the mass in the ratio that increases with the particle’s velocity. These measurements cannot be used to confirm that all the mass of the electron is electromagnetic in nature. The reason is that “Einstein’s theory of relativity shows that mass as such, regardless of its origin, must depend on the velocity in a way described by Lorentz’s formula” [italics added] [Born 62]. In a collection dedicated to Einstein, Dirac [86] in 1980 observed In one aspect Einstein went much farther than Lorentz, Poincaré and others, namely in assuming that the Lorentz transforms should be applicable in all of physics, and not only in the case of phenomena related to electrodynamics. Any physical force, that may be introduced in the future, must be consistent with Lorentz transforms.
According to J. J. Thomson [28],
Einstein has shown that to conform with the√ principles of Relativity mass must − 2 2 vary with velocity according to the law m0/ (1 v /c ). This is a test imposed by Relativity on any theory of mass. We see that it is satisfied by the conception that the whole of the mass is electrical in origin, and this conception is the only one yet advanced which gives a physical explanation of the dependence of mass on velocity.
So this would necessarily rule out the existence of neutral matter, and, in fact, this is what Einstein [05] says when he remarks that charge “no matter how small” can be added to any ponderable body. The dependencies of mass upon motion arose from the assumption that bodies underwent contraction in the direction of their motion. This follows directly from the nature of the Lorentz transformation. From the geometry of the body one could determine the energy, W, and momentum, G, since the two are related by
dW = vdG,
in a single dimension. Then since G = mv, the expression for the increment in the energy becomes
dW = v2dm + mv dv. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 29
Introducing dW = c2dm, and integrating lead to √ 2 m/m0 = 1/ (1 − β ), (1.1.6)
where m0 is a constant of integration, and β = v/c, the relative velocity. Expression (1.1.6) was derived by Gilbert N. Lewis in 1908. The same proof was adopted by Philipp Lenard, a staunch anti-relativist, in his Über Aether und Uräther who attributes it to Hassenöhrl’s [09] derivation of radia- tion pressure. The only verification of a dependency of mass upon velocity at that time was Kaufmann’s experiments on canal rays. Kaufmann was able to measure the ratio e/m, and assuming that the charge is constant, all the variation of this ratio must be attributed to the mass. The mass of the negative particle contains both electromagnetic and non-electromagnetic contributions. However, Lewis contended that what- ever its origin is mass remains mass so that “it matters not what the sup- posed origin of this mass may be. Equation (1.1.6) should therefore be directly applicable to the experiments of Kaufmann.” But an accelerating electron radiates, and the radiative force is missing from dG. This did not trouble Lewis, and he went on to compare the observed value of the relative velocity with that calculated from (1.1.6). His results are given in the follow- ing table.
m/m0 β (observed) β (calculated)
10 0 1.34 0.73 0.67 1.37 0.75 0.69 1.42 0.78 0.71 1.47 0.80 0.73 1.54 0.83 0.76 1.65 0.86 0.80 1.73 0.88 0.82 2.05 0.93 0.88 2.14 0.95 0.89 2.42 0.96 0.91 Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
30 A New Perspective on Relativity
Although the calculated and observed values of the relative veloci- ties follow the same monotonic trend, the latter are between 6–8% larger. Lewis believed that this was within the limits of experimental error in Kauf- mann’s experiments. While Kaufmann claimed a higher degree of accuracy is necessary, Lewis believed that notwithstanding the extreme care and delicacy with which the observations are made, it seems almost incredible that measurements of this character, which con- sisted in the determination of the minute displacement of a somewhat hazy spot on a photographic plate, could have been determined with the precision claimed.
So what is Lewis comparing his results to? Kaufmann’s initial results agreed better with the expression, m 3 1 1 + β2 1 + β = ln − 2 1 , m0 4 β 2β 1 − β
derived from Abraham’s model rather than (1.1.6), which coincides with the Lorentz model, but which has been “derived from strikingly different principles.” Why neutral matter should be subject to the deflection by the electromagnetic fields in Kaufmann’s set-up is not broached. But, Lewis considers that the mass of a positively charged particle emanating from a radioactive source would be a good test-particle because it consists of mainly ‘ponderable’ matter with a very small ‘electromagnetic’ mass. Lewis believed that his non-Newtonian mechanics revived the parti- cle nature of light. From the fact that the mass, according to (1.1.6), becomes infinite as the velocity approaches that of light, it follows that “a beam of light has mass, momentum and energy, and is traveling at the velocity of light would have no energy, momentum, or mass if it were at rest....” This is almost two decades before Lewis [26] was to coin the name ‘photon’ in a paper entitled “The conservation of photons.” The paper was quickly forgotten, but the name stuck.
1.2 Physicists versus Mathematicians
In attempting to unravel the priority rights to the unification of light and electricity we can appreciate a remarkable confluence of physi- cists and mathematicians in one single arena that was never to repeat itself. On the physics side there were André-Marie Ampère, Ludwig Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 31
Boltzmann, Rudolf Clausius, Michael Faraday, Hermann von Helmholtz, James Clerk Maxwell, and Wilhelm Weber, while on the mathematics side there were Carl Friedrich Gauss and Bernhard Riemann, and those that should have been there, but were not: János Bolyai and Nicolai Ivanovitch Lobachevsky. To Ampère credit must go to the fall of the universal validity of New- ton’s inverse square law as a means by which particles interact with one another at a distance. Today, Ampère is remembered as a unit, rather than as the discoverer of that law, and contemporary treatises on electromag- netism present the alternative formulation of Jean-Baptiste Biot and Félix Savart. Although both laws of force coincide when the circuit is closed, they differ on the values that the force takes between two elements of current when open. That the interaction of persisting direct (galvanic) currents needed an angular-dependent force was loathed and scorned at. Surely, magnetism cannot be the result of the motion of charged particles. Odd as it may seem, like many of the French physics community, Biot rejected Ampère’s discovery outright. Since the angular dependencies vanish when electric currents appear in complete circuits, it seemed as extra baggage to many, including Maxwell, who reasoned in continuous fields which could store energy and media (i.e. the aether) in which waves could propagate in. Yet, it was Ampère’s attempt that would initiate a search for a molecular understand- ing of what electricity is and how it works.
1.2.1 Gauss’s lost discoveries It may take very long before I make public my investigations on this issue; in fact, this may not happen in my lifetime for I fear the ‘clamor of the Boeotians.’ Gauss in a letter to Bessel in 1829 on his newly discovered geometry.
Gauss’s seal was a tree but with only seven fruits; his motto read “few, but ripe.” Such was, in effect, an appraisal of Gauss’s scientific accomplish- ments. Gauss had an aversion for debate, and, probably, a psychological problem of being criticized by people inferior to him, like the Boeotians of Greece who were dull and ignorant. Ampère’s discovery would have finished in oblivion had it not caught the eye of Gauss. By 1828 Gauss was resolved to test Ampère’s angle law when he came into contact with a young physicist, Wilhelm Weber. With no Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
32 A New Perspective on Relativity
surprise, Weber was offered a professorship at Göttingen three years later, and an intense collaboration between the two began. According to his 1846 monograph, Weber was out to measure a force of one current on the other. This was something not contemplated by Ampère who was satisfied to making static, or what he called ‘equilibrium,’ measurements. When Weber was ready to present his results, he shied away from a discussion of the angular force because he knew it would cause commotion. A letter from Gauss persuaded him otherwise, and insisted that further progress was needed to find a “constructible representation of how the propagation of the electrodynamic interaction occurs.” Weber accepted Fechner’s model in which opposite charges are mov- ing in opposite directions, and interpreted Ampère’s angular force in terms of the force arising from relative motion, depending not only on their rel- ative velocities but also on their accelerations. In so doing, Weber can thus be considered to be the first relativist! The anomaly in Ampère’s law, where there appears a diminution of the force at a certain angle, now appeared as a diminution of the force at a certain speed. That constant later became known as Weber’s constant, and in a series of experiments carried out with Rudolf Kohlrasch it was found to be the speed of light, increased by a fac- tor of the square root of 2. Present at these experiments was Riemann, and Riemann was later to present his own ideas on the matter. In the 1858 paper, “A contribution to electrodynamics,” that was read but not published until after Riemann’s death, Riemann states I have found that the electrodynamic actions of galvanic currents may be explained by assuming that the action of one electrical mass on the rest is not instantaneous, but is propagated to them with a constant velocity which, within the limits of observation, is equal to that of light.
Although he errs referring to φ =−4πρ as Poisson’s law, instead of ∇2φ =−4πρ, Riemann surely did not merit the wrath that Clausius bestowed upon him. Riemann proposes a law of force similar to that of Weber, where the accelerations along the radial coordinate connecting the two particles are replaced by the accelerations projected onto the coordi- nate axes, and advocates the use of retarded potentials instead of a scalar potential. In his Treatise, Maxwell cites Clausius’s criticisms as proof of the unsoundness of Riemann’s paper. Surely, Maxwell had no need of Clausius’s help, so it was probably used to avoid direct criticism. Moreover, Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 33
Clausius’s criticisms are completely unfounded, and what Maxwell found wanting in Weber’s electrokinetic potential actually applies to Clausius’s expression. Whereas Clausius had some grounds for his priority dispute with Kelvin when it came to the second law, here he has none. Weber’s formulation,which today is all but forgotten, held sway in Germany until Heinrich Hertz [93], Helmholtz’s former assistant, verified experimentally the propagation of electromagnetic waves and showed that they had all the characteristics of light. Helmholtz then crowned Maxwell’s theory, and went even a step further by generalizing it to include longitu- dinal waves, if ever there would be a need of them [cf. Sec. 11.5.5]. Gauss played a fundamental role in bridging the transition from Ampère to Weber. Moreover, Maxwell’s formulation of a wave equation, from his circuit equations, in which electromagnetic disturbances prop- agate at the speed of light, was undoubtedly what Gauss thought was as an oversimplification of the problem. The complexity of the interac- tions in Ampère’s hypothesis persuaded him that it was not as simple as writing down a wave equation for a wave propagating at the speed of light. This will not be the only time Gauss loses out on a fundamental discovery. Gauss’s letters are more telling than his publications, and if it had not been for his reluctance to publish he would have certainly been the discov- erer of what we now know as hyperbolic geometry. Gauss wrote another famous letter, this time to Taurinus in 1824, again reluctant to publish his findings. This is what he said: ◦ ...that the sum of the angles cannot be less than 180 ; this is the critical point, the reef on which all the wrecks occur...I have pondered it for over thirty years, and I do not believe that anyone can have given it more thought...than I, though I have never published anything on it. The assumption that the sum of three angles is less ◦ than 180 leads to a curious geometry, quite different from ours (the Euclidean), but thoroughly consistent...
Gauss is, in fact, referring to hyperbolic geometry, and it is another of his lost discoveries. The credit went instead to Bolyai junior and Lobachevsky. In 1831, Gauss was moved to publish his findings, as it appears in a letter to Schumacher: I have begun to write down during the last few weeks some of my own meditations, a part of which I have never previously put in writing, so that already I have had to think it all through anew three or four times. But I wished this not to perish with me. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
34 A New Perspective on Relativity
But it was too late, before Gauss could finish his paper, a copy of Bolyai’s Appendix arrived. Gauss’s reply to Wolfgang Bolyai senior unveils his disappointment: If I commenced by saying that I am unable to praise this work, you would certainly be surprised for a moment. But I cannot say otherwise. To praise it, would be to praise myself. Indeed the whole contents of the work, the path taken by your son, the results to which he is led, coincide almost entirely with my meditations, which have occupied my mind partly for the last thirty or thirty-five years. So I remained quite stupefied...it was my idea to write down all this later so that at least it should not perish with me. It is therefore a pleasant surprise for me that I am spared the trouble, and I am very glad that it is just the son of my old friend, who takes precedence of me in such a remarkable manner.
Even more mysterious is why Gauss failed to help the younger Bolyai gain recognition for his work. Was it out of jealousy or Gauss’s extreme prudence? Another person who was looking to the stars for confirmation that two intersecting lines can be parallel to another line was Lobachevsky. He, like Gauss, considered geometry on the same status of electrodynam- ics, that is, a science founded on experimental fact. Lobachevsky fully realized that deviations from Euclidean geometry would be exceedingly small, and, therefore, would need astronomical observations. Just as Gauss attempted to measure the angles of a triangle formed by three mountain- tops, Lobachevsky claimed that astronomical distances would be necessary to show that the sum of the angles of a triangle was less than two right angles. In 1831 Gauss deduced from the axiom that two lines through a given point can be parallel to a third line that the circumference of a circle is 2πR sinh r/R, where R is an absolute constant. By simply replacing R by iR, he obtained 2πR sin r/R, or the circumference of a circle of radius r on the sphere. The former will be crucial to the geometrical interpretation of the uniformly rotating disc that had occupied so much of Einstein’s thoughts. And we will see in Sec. 9.11 that Gauss’s expression for the hyperbolic circumference is what modern cosmologists confuse with the expansion factor of the universe. The first person to show that there was a complete correspondence between circular and hyperbolic functions was Taurinus in 1826, who was in Gauss’s small list of correspondents on geometrical matters. Although this lent credibility to hyperbolic geometry, neither Taurinus nor Gauss Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 35
felt confident hyperbolic geometry was self-consistent. In 1827 Gauss came within a hair’s breadth of what would latter be known as the Gauss–Bonnet theorem. This theorem shows that the surfaces of negative curvature pro- duce a geometry in which the angular defect is proportional to the area. Gauss was cognizant that a pseudosphere was such a surface, and Gauss’s student Minding latter showed that hyperbolic formulas for triangles are valid on the pseudosphere. But, a pseudosphere is not a plane, like the Euclidean plane, because it is infinite only in one direction. The exten- sion of the pseudosphere to a real hyperbolic plane came much later with Eugenio Beltrami’s exposition in 1868. So it was not clear to Gauss and his associates what this new geometry was, and, if, in fact, it was logically consistent. Gauss dabbled in many areas of physics and mathematics, and it would appear that his interests in electricity and non-Euclidean geome- tries are entirely disjoint. Who would have thought that these two lost discoveries might be connected in some way? Surely Poincaré did not and it is even more incredible because he developed two models of hyperbolic geometry that would have made the handwriting on the wall unmistakable to read.
1.2.2 Poincaré’s missed opportunities
Jules-Henri Poincaré began his career as a mathematician, and, undoubt- edly, became interested in physics because of the courses he gave at the Sorbonne. Poincaré was not a geometer by trade, but made a miraculous discovery that the Bolyai–Lobachevsky geometry which the geometers, Beltrami and Klein, were trying to construct already existed in mainstream mathematicians [Stillwell 96]. The tragedy is that he failed to see what he called a Fuchsian group was the same type of transform that Lorentz was using in relativity, and that he would be commenting on the latter without any recognition of the former.
1.2.2.1 From Fuchsian groups to Lorentz transforms Poincaré’s first encounter with hyperbolic geometry came when he was trying to understand the periodicity occurring in solutions to particular differential equations. The single periodicities of trigonometric functions Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
36 A New Perspective on Relativity
were well-known, and so too the double periodicities of elliptic functions. Double periodicity can be best characterized by tessellations consisting of parallelograms in the complex Euclidean plane whose vertices are multi- ples of the doubly periodic points. Poincaré found a new type of periodic function, which he called ‘Fuch- sian,’ after the mathematician Lazarus Immanuel Fuchs who first discov- ered them.d The periodic function is invariant under a group of substitu- tions of the form az + b z → , (1.2.1) cz + d for which ad − bc = 0, for otherwise it would result in a lack-luster con- stant mapping. Poincaré wanted to study this group of transformations by the same type of tessellations that elliptic functions could be charac- terized in the complex Euclidean plane. Only now the tessellation con- sists of curvilinear triangles in a disc, shown in Fig. 1.1, which Poincaré obtained from earlier work by Schwarz in 1872. The curvilinear triangles form right-angled pentagons which are mapped onto themselves by the linear fractional transformation, (1.2.1). As Poincaré tells us Just at the time I left Caen, where I was living, to go on a geological excursion ... we entered an omnibus to go some place or other. At the moment I put my foot on the step the idea came to me, without anything in my former thoughts seeming to have paved the way for it, that the transformation I had used to define Fuchsian functions were identical with those of non-Euclidean geometry.
The linear fractional transformations, (1.2.1), can be used to define a new concept of length for which the cells of the tessellation are all of equal size. The resulting geometry is precisely that of Bolyai–Lobachevsky which, through Klein’s renaming in 1871, has come to be known as hyperbolic geometry. If c = b and d = a, then the fractional linear transformation (1.2.1) becomes the distance-preserving and orientation-preserving map, with a2 − b2 = 1, of Poincaré’s conformal disc model of the hyperbolic plane D2-isometrics. What Poincaré failed to realize is that by interpreting z as the linear fractional transformation (1.2.1), with a = cosh and b = sinh , becomes precisely the transformation he named in honor of Lorentz, where
dAfter Klein informed Poincaré in May 1880 that there were groups of linear frac- tional transformations, other than those of Fuchs, Poincaré named them ‘groupes kleinéens,’ to the chagrin of Klein. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 37
Fig. 1.1. A tiling of the hyperbolic plane by curvilinear triangles that form right- angled pentagons.
the sides of any curvilinear triangle in Fig. 1.1 are proportional to the hyper- bolic measures of the three velocities in three different reference frames. Had Poincaré recognized this, it would have changed his mind about the ‘convenience’ of Euclidean geometry, and would have brought hyperbolic geometry into mainstream relativity. That is, given three bodies moving with velocities u1, u2 and u3, the corresponding triangle with curvilinear sides has as its vertices the points u1, u2 and u3. The relative velocities will correspond to the sides of the triangle and the angles between the velocities will add up to some- thing less than two right angles. It should also be appreciated that the square of the relative velocity is invariant under (1.2.1). Suppose that w is a relative velocity formed from the composition of u and v, then if these velocities are replaced by the velocities u and v relative to some other frame, the value of w will be unaffected by the change. In other words, the square of the relative velocity w is invariant under a Lorentz transformation. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
38 A New Perspective on Relativity
However, it never dawned on Poincaré that these curvilinear-shaped triangles might be relativistic velocity triangles for he kept mathematics and physics well separated in his mind. For he considered ... the axioms of geometry ... are only definitions in disguise. What then are we to think of the question: Is Euclidean geometry true? We might as well ask if the metric system is true and if the old weights and measures are false; if Cartesian coordinates are true and polar coordinates false. One geometry cannot be more true than another: it can only be more convenient.
Convenience was certainly not the answer.e
1.2.2.2 An author of E = mc2 Unquestionably the most famous formula in all of physics, its origins lie elsewhere than in Einstein’s [05b] paper “Does the inertia of a body depend upon its energy content?” John Henry Poynting [07] derived a relation between energy and mass from the radiation pressure around the turn of the twentieth century.Friedrich Hassenöhrl [04] obtained the effective mass 4 2 = 4 of blackbody radiation as 3 ε/c , where ε hν. The same factor of 3 was found by Comstock [08] from his electromagnetic analysis, and represents the sum of the energy and the work done by compression, the latter being equal to one-third of the energy in the ultrarelativistic limit. The sum of the two quantities is the enthalpy, as was first clearly stated by Planck [07], so in Einstein’s title ‘heat content,’ or enthalpy, should replace ‘energy content.’ Once again we find evidence of Poincaré’s priority in the derivation of the famous formula, and, as we have mentioned, Einstein’s recognition of it [cf. p. 23]. In the second edition of his text, Électricité et Optique, Poincaré [01] treats the problem of the recoil due to a body’s radiation. He considers the emission of radiation in a single direction, and in order to maintain fixed the center of gravity, the body recoils like an ‘artillery cannon’ (pièce d’artillerie). According to the theory of Lorentz, the amount of the recoil will not be negligible. Suppose, says Poincaré, that the artillery piece has a mass of 1 kg, and the radiation that is sent in one direction at the velocity
eStrangely, we find Einstein [22b] uttering the same words: “For if contradictions between theory and experience manifest themselves, we should rather decide to change physical laws than to change axiomatic Euclidean geometry.” Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 39
of light has an energy of three million Joules. Then, according to Poincaré, it will recoil a distance of 1 cm. Actually, the relation between ‘electromagnetic momentum’ and Poynting’s vector appears in a 1895 paper by Lorentz, which was com- mented and elaborated upon by Poincaré [00] in 1900. He derives the expression between the momentum density, G, and the energy flux, S,as
G = S/c2. (1.2.2)
Even earlier in 1893, J. J. Thomson refers to ‘the momentum’ arising from the motion of his Faraday tubes. It is only later that Abraham [03] intro- duced the term ‘electromagnetic momentum.’ Pauli [58] unjustly attributes (1.2.2) to Planck [07] as a theorem regarding the equivalence between momentum density and the energy flux density. According to Pauli, This theorem can be considered as an extended version of the principle of the equivalence of mass and energy.Whereas the principle only refers to the total energy, the theorem has also something to say on the localization of momentum and energy.
Since the magnitude of the energy flux, S = Ec, (1.2.2) becomes:
mv = E/c.
Then introducing m = 103 grams, E = 3×1013 ergs, and c = 3×1010 cm/sec, Poincaré finds v = 1 cm/sec for the recoil speed. Thus, Poincaré derived E = Gc, and if G is the momentum of radiation, G = mc, so that m = E/c2 is the mass equivalent to the energy of radiation. Poincaré was infatuated with the break-down of Newton’s third law, the equality between action and reaction, in his new mechanics. In a follow- up paper entitled, “The theory of Lorentz and the principle of reaction,” Poincaré [00] considers electromagnetic energy as a ‘fictitious fluid’ (fluide fictif) with a mass E/c2. The corresponding momentum is the mass of this fluid times c. Since the mass of this fictitious fluid was ‘destructible’ for it could reappear in other guises, it prevented him from identifying the fictitious fluid with a real fluid. What Poincaré could not rationalize became ‘fictitious’ to him. The lack of conservation of the fictitious mass prevented Poincaré from identifying it with real mass, which had to be conserved under all circumstances. What is conserved, however, is the inertia associated with the radiation that has produced the recoil of the artillery cannon. It is the Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
40 A New Perspective on Relativity
difference between the initial mass and what is radiated that is equal to the change in energy of the system. Ives [52] showed that m − m = m = E/c2, (1.2.3) where m is the change in mass after radiation, and E/c2 is the mass of the radiant energy, which follows directly from Poincaré’s 1904 relativity principle. The difference between the Doppler shift in the frequency due to a source moving toward and away from a fixed observer is: 1 1 + β 1/2 1 − β 1/2 νv ν = ν − = √ . (1.2.4) 2 1 − β 1 + β c (1 − β2)
The frequency shift becomes a nonlinear function of the velocity, just like the expression for the relativistic momentum. But here there is no mass present! The relation between frequency and energy was known at the time; it is given by Planck’s law, E = hν, so that (1.2.4) could be written as Ev h ν/c = √ = G, c2 (1 − β2) where G is momentum imparted to the artillery piece due to recoil. It is given by m G = √ v, (1 − β2) if (1.2.3) holds. The derivation is thus split into two parts: A nonrelativistic relation between mass and energy, (1.2.3), which depends only on the cen- tral frequency, ν, and a relativistic part that relates the size of the shift to the velocity, according to (1.2.4). It is through the difference in the Doppler shifts that the momentum acquires nonlinear dependency upon the velocity,
1 ¯ −¯ β (ev/c − e v/c) = sinh (v¯/c) = √ , (1.2.5) 2 (1 − β2) where v¯ is the hyperbolic measure of the velocity whose Euclidean measure is v. Equation (1.2.5) also indicates that c is the absolute constant of velocity Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 41
space. If we multiply (1.2.5) through by πc, it becomes Gauss’s expression for the semi-perimeter of a non-Euclidean circle of radius v¯, and absolute constant c, that he wrote in a letter to Schumacher in 1831 [cf. Eq. (9.11.24)]. Where is the mass dependence on velocity?
The Doppler shifts refer to a shift in frequency, the frequency is related to an energy, the energy is related to mass; that is, the mass equivalent of radiation. In fact, the attributed nonlinear dependence of mass on its speed, (1.2.4), can be obtained without mentioning mass at all!
Poincaré was ever so close to developing a true theory of relativity, but ultimately could not break loose of the classical bonds which held him. It is even a greater tragedy that he could not bridge the gap between his mathematical studies on non-Euclidean geometries and relativity that could have unified his lifelong achievements.
1.3 Exclusion of Non-Euclidean Geometries from Relativity
Neither Whittaker, nor Pais, gave any reference to the potential role that non-Euclidean geometries could have played in relativity. Pais pays lit- tle tribute to Hermann Minkowski other than saying that Einstein had a change in heart; rather than considering the transcription of his theory into tensorial form as ‘superfluous learnedness’ (überflüssige Gelehrsamkeit), he later claimed it was essential in order to bridge the gap from his special to general theories. Minkowski, in his November 1907 address to the Göttingen Mathe- matical Society, began with the words “The world in space and time is, in a certain sense, a four-dimensional non-Euclidean manifold” [cf. p. 37]. The invariance of the hyperboloid of space-time from the Lorentz transform was identified as a pseudosphere of imaginary radius, or a surface of negative, constant curvature. It is plain from Whittaker’s formulas that the Lorentz transformation consists of a rotation through an imaginary angle. Poincaré too viewed the Lorentz transformation as a rotation in four- dimensional space-time about an imaginary angle and that the ratio of the space to time transformations gave the relativistic law of velocity addition. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
42 A New Perspective on Relativity
But, he could not bring himself to identify the velocity as a line element in Lobachevsky space. Edwin Wilson, who was J. Willard Gibbs’s last doctoral student, and Lewis [12] felt the need to introduce a non-Euclidean geometry for rota- tions, but not for translations. They assumed, however, that Euclid’s fifth postulate (the parallel postulate) held, and, therefore, excluded hyper- bolic geometry from the outset, even though their space-time rotations are through an imaginary angle. Had they realized that their non-Euclidean geometry was hyperbolic they would have retracted the statement that “Through any point on a given line one and only one parallel (non- intersecting) line can be drawn.” It would have also saved them the trouble of inventing a new geome- try for the space-time manifold of relativity. They do, in fact, disagree with Poincaré that it is, however, inconsistent with the philosophic spirit of our time to draw a sharp distinction between that which is real and that which is convenient, and it would be dogmatic to assert that no discoveries of physics might render so convenient as to be almost imperative the modification or extension of our present system of geometry.
Neither their plea nor paper had a sequel. In the last of his eight lectures, delivered at Columbia University in 1909, we listened to Planck’s animosity toward non-Euclidean geometries. Although blown up, and completely out of proportion, Planck was making a statement that he does not want any infringement on the special theory of relativity by mathematicians. Where would this infringement come from? From nowhere else than the Göttingen school of mathematicians, notably Felix Klein. The Hungarian Academy of Science established the Bolyai Prize in mathematics in 1905. The commission was made up of two Hungarians and two foreigners, Gaston Darboux and Klein. The contenders were none other than Poincaré and Hilbert. Although the prize went to Poincaré, his old friend Klein refused to present him with it citing ill health. According to Leveugle [04] it would have meant that Klein had to publicly admit Poincaré’s priority over Einstein to the principle of relativity, and the group of transformations that has become known as the Lorentz group, a name coined by Poincaré in honor of his old friend. This would not have been Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 43
received well by the Göttingen school for not only did Hilbert come in at second place, it would have been a debacle of all their efforts to retain relativity as a German creation. Arnold Sommerfeld, a former assistant to Klein, showed in 1909 that the famous addition theorem of velocities, to which Einstein’s name was now attached, was identical to the double angle formula for the hyperbolic tangent. The velocity parallelogram closes only at low speeds. This was the first demonstration that hyperbolic geometry def- initely had a role in relativity, and its Euclidean limit emerged at low speeds. Now Sommerfeld would surely have known that the hyperbolic tan- gent is the straight line segment in Lobachevsky’s non-Euclidean geometry. Acknowledgment of his former supervisor’s interest in relativity surfaced in the revision of Pauli’s [58, Footnote 111] authoritative Mathematical Ency- lopedia article on relativity where he wrote: This connection with the Bolyai–Lobachevsky geometry can be briefly described in the following way (this had not been noticed by Variˇcak): If one interprets dx1, dx2, dx3, dx4 as homogeneous coordinates in a three-dimensional projective space, then the invariance of the equation (dx1)2 + (dx2)2 + (dx3)2 − (dx4)2 = 0 amounts to introducing a Cayley system of measurement, based on a real conic section. The rest follows from the well-known arguments of Klein.
Sommerfeld just could not resist rewriting the history of relativity. He changed Minkowski’s opinion of the role Einstein had in formulating the principle of relativity. Quite inappropriately he inserted a phrase praising Einstein for having used the Michelson experiment to show that a state of absolute rest, where the immobile aether would reside, has no effect on physical phenomena [Pyenson 85]. He also exchanged the role of Einstein as the clarifier with that as the originator of the principle of relativity.f A much more earnest attempt to draw hyperbolic geometry into the mainstream of relativity was made by Vladimir Variˇcak. Variˇcak says that
fAnd Sommerfeld’s revisions did not stop at relativity. Writing in the obituary col- umn of the recently deceased Marion von Smolukowski, Sommerfeld lauds Einstein for his audacious assault on the derivation of the coefficient of diffusion in Brow- nian motion, “without stopping to bother about the details of the process.” Von Laue, writing in his History of Physics clearly states that Smolukowski developed a statistical theory of Brownian motion in 1904 “to which Einstein gave definitive form (1905).” Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
44 A New Perspective on Relativity
even before he heard Minkowski’s 1907 talk, he noticed the profound anal- ogy between hyperbolic geometry and relativity.At low velocities, the laws of mechanics reduce to those of Newton, just as Lobachevskian geome- try reduces to that of Euclidean geometry when the radius of curvature becomes very large. To Variˇcak, the Lorentz contraction appears as a defor- mation of lengths, just as the line segment of Lobachevskian geometry is bowed. Taking the line element of the half-plane model of hyperbolic geom- etry, Variˇcak says that it cannot be moved around without deformation. Thus, he queries whether the Lorentz contraction can be understood as an anisotropy of the (hyperbolic) space itself. Variˇcak also appreciates that in relativity the velocity parallelogram does not close; hence, it does not exist, and must be replaced by hyperbolic addition, which is the double angle formula of the hyperbolic tangent. Relativity abandons the absolute, but does introduce an absolute velocity, c; this corresponds to the absolute constant in the Lobachevsky velocity space. Owing to the fact that an inhabitant of the hyperbolic plane would see no distortion to his rulers as he moves about because his rulers would shrink or expand with him, Variˇcak questions the reality of the Lorentz transform. To Variˇcak, the “contraction is, so to speak, only a psychological and not a physical fact.” Although known non-Euclidean geometries were not entertained by Einstein, Variˇcak’s formulation should have raised eye- brows. But it did not. The only thing that it would do, by questioning the reality of the space contraction, would be to cause confusion, and this pro- voked a response by Einstein himself. But whose confusion did he abate? Apart from optical applications referring to the Doppler shift and aberration, which were already contained in Einstein’s 1905 paper in a different form, Variˇcak produced no new physical relations or new insights into old ones. These factors led to the demise of the hyperbolic approach to relativity, as far as physicists were concerned. However, there was an isolated incident in 1910, where Theodor Kaluza [10] draws an analogy between a uniformly rotating disc and Lobachevskian geometry. Kaluza writes the line element as
r2 dϕ 2 1 + dr,(*) 1 ± r2 dr Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 45
which at constant radius becomes
r2 dϕ. (**) 1 ± r2
If Kaluza wants to show that the circumference of a hyperbolic circle is greater than its Euclidean counterpart, he has to choose the negative sign in expression (*), bring out the dr from under the square root, and remove the square in the numerator of (**). Apart from these typos, and the fact that the first factor in (*) had to be divided by (1 − r2)2, Kaluza was the first to draw attention to the fact that the hyperbolic metric of constant curvature describes exactly a uniformly rotating disc. The paper was stillborn. Another unexplainable event is that Einstein entered into a mathe- matical collaboration with his old friend, Marcel Grossmann, to develop a Riemannian theory of general relativity. Grossmann was an expert in non-Euclidean geometries; so why did he not set Einstein on the track of looking at known non-Euclidean metrics instead of putting him on the track of Riemannian geometry? Probably Einstein wanted the general the- ory to reduce to Minkowski’s metric in the absence of gravity which meant that the components of the metric tensor reduce to constants. But that meant he was fixing the propagation of gravitational interactions at the speed of light. Grossmann is, however, usually remembered for having led Einstein astray in rejecting the Ricci tensor as the gravitational tensor [Norton 84]. In order for it to reproduce correctly the curvature of ‘space-time,’ the coefficients would have to be (nonlinear) functions of space, and maybe even of time. According to Einstein the Riemannian metric should play the role of the gravitational field. Curvature would be a manifestation of the presence of mass–energy so that if he could find a curvature tensor, comprising of the components of the metric tensor, then by setting it equal to a putative energy–momentum tensor he could find the components of the metric tensor, and thereby determine the line element. Such an equation would combine time and space with energy and momentum. The rest is history and has been too amply described by historians of science. Since the metric has ten components, the search was on for a curvature tensor with the same number of components. The contraction of the Riemann–Christoffel tensor into the Ricci tensor, Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
46 A New Perspective on Relativity
having ten components, seemed initially as a good bet to be set equal the energy–momentum tensor. Setting the Ricci tensor equal to zero was made a condition for the emptiness of space. It constitutes Einstein’s law of grav- itation, and as Dirac [75] tells us ‘Empty’ here means that there is no matter present and no physical fields except the gravitational field. [italics added] The gravitational field does not disturb the emptiness. Other fields do.
So gravity can act where matter and radiation are not! When the field is not empty, setting the Ricci tensor equal to the energy–momentum tensor leads to inconsistencies insofar as energy– momentum is not conserved. If the Ricci tensor vanishes then so do all that is related to it, like the scalar, or total, curvature. Einstein found that if he subtracted one-half the curvature-invariant from the Ricci tensor and set it equal to the energy–momentum tensor, then energy–momentum would be conserved. The equipment needed to carry out the program involves, curvi- linear coordinates, parallel displacement, Christoffel symbols, covariant differentiation, Bianchi relations, the Ricci tensor and its contraction, plus a knowledge of what the energy–momentum tensor is. The only outstand- ing solution is known as the Schwarzschild metric, in which the metric is constructed on solving the ‘outer’ and ‘inner’ solutions [cf. Secs. 9.10.3 and 9.10.4].All the known tests of general relativity are independent of the time- component of the metric, except for the gravitational shift of spectral lines, which is independent of the spatial component. The latter was predicted by Einstein in 1911, prior to his general theory of relativity. However, it does not follow from the Doppler shift so Einstein was either uncannily lucky, or the true explanation lies elsewhere. Viewed from a pseudo-Euclidean point of view, there is a clear distinction between special and general relativity. Within the hyperbolic framework, this separation between inertial and noninertial ones becomes blurred. This is because the uniformly rotating disc is, as Stachel [89] claims, the missing link to Einstein’s general theory. That the Beltrami metric describes exactly the uniformly rotating disc, means that hyperbolic geom- etry is also the framework for noninertial systems. We have already seen Planck’s hostility to non-Euclidean geometries. There was also Wilhelm Wien, Planck’s assistant editor of the Annalen, who Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 47
insisted that relativity has “no direct point of contact with non-Euclidean geometry,” and Arnold Sommerfeld who considered the reinterpretation of relativity in terms of non-Euclidean geometry could “be hardly recom- mended.” Authoritarianism carried the day and non-Euclidean geometry was shelved for good. It is the purpose of this monograph to show that non-Euclidean geometries make inroads into relativistic phenomena and warrant our attention.
References [Abraham 03] M. Abraham, “Prinzipien der Dynamik des Elektrons,” Ann. der Phys. 10 (1903) 105–179. [Abraham 12] M. Abraham, “Relativität und Gravitation. Erwiderung auf eine Bemerkung des Herrn A. Einstein,” Ann. der Phys. 38 (1912) 1056–1058. [Auffray 99] J.-P. Auffray, Einstein et Poincaré: Sur des Traces de la Relativité (Le Pommier, Paris, 1999), pp. 131, 133. [Born 51] M. Born, “Physics in my generation, the last fifty years,” Nature 268 (1951) 625. [Born 62] M. Born, Einstein’s Theory of Relativity (Dover, New York, 1962), p. 278. [Brillouin 60] L. Brillouin, Wave Propagation and Group Velocity (Academic Press, New York, 1960), p. 143. [Bucherer 04] A. H. Bucherer, Mathematische Einführung in die Elektronentheorie (Teubner, Leipzig, 1904), p. 50, Eq. (91a). [Cerf 06] R. Cerf, “Dismissing renewed attempts to deny Einstein the discovery of special relativity,” Am. J. Phys. 74 (2006) 818–824. [Comstock 08] D. F. Comstock, “The relation of mass to energy,” Phil. Mag. 15 (1908) 1–21. [Cushing 81] J. T. Cushing, “Electromagnetic mass, relativity, and the Kaufmann experiments,” Am. J. Phys. 49 (1981) 1133–1149. [de Broglie 51] L. de Broglie, Savants et Découvertes (Albin Michel, Paris, 1951), p. 50. [Dirac 75] P. A. M. Dirac, General Theory of Relativity (Wiley-Interscience, New York, 1975), p. 25. [Dirac 86] P. A. M. Dirac, Collection Dedicated to Einstein, 1982-3 (Nauka, Moscow, 1986), p. 218. [Earman & Glymour 78] J. Earman and C. Glymour, “Lost in the tensors: Ein- stein’s struggles with covariance principles” Stud. Hist. Phil. Sci. 9 (1978) 251–278. [Einstein 05a] A. Einstein, “On the electrodynamics of moving bodies,” Ann. der Phys. 17 (1905); transl. in W. Perrett and G. B. Jeffrey, The Principle of Rela- tivity (Methuen, London, 1923). [Einstein 05b] “Does the inertia of a body depend upon its energy content?,” Ann. der Phys. 18 (1905) 639–641; translated in The Collected Papers of Albert Einstein: The Swiss Years, Vol. 2 (Princeton U. P., Princeton NJ, 1989), pp. 172–174. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
48 A New Perspective on Relativity
[Einstein 06a] A. Einstein, “On a method for the determination of the ratio of the transverse and longitudinal mass of the electron,” Ann. der Phys. 21 (1906) 583–586; translated in The Collected Papers of Albert Einstein, Vol. 2 (Princeton U. P., Princeton NJ, 1989), pp. 207–210. [Einstein 06b] A. Einstein, “Le principe de conservation du mouvement du centre de gravité ed l’inertie de l’energie,” Ann. der Phys. 20 (1906) 627–633. [Einstein 07] A. Einstein, “On the relativity principle and the conclusions drawn from it,” Jahrbuch der Radioaktivität und Elektronik 4 (1907) 411–462; trans- lated in The Collected Papers of Albert Einstein, Vol. 2 (Princeton U. P., Princeton NJ, 1989), pp. 252–311. [Einstein 11] A. Einstein, “On the influence of gravitation on the propagation of light,” Ann. der Phys. 35 (1911); translated in W. Perrett and G. B. Jeffrey, The Principle of Relativity (Methuen, London, 1923), pp. 99–108. [Einstein 16] A. Einstein, “The foundation of the general theory of relativity,” Ann. der Phys. 49 (1916); translated in W. Perrett and G. B. Jeffrey, The Principle of Relativity (Methuen, London, 1923), pp. 111–173. [Einstein 22a] A. Einstein, “Aether and the theory of relativity,” translated in G. B. Jeffrey and W. Perrett, Sidelights on Relativity (E. P. Dutton, New York, 1922), pp. 1–24. [Einstein 22b] A. Einstein, “Geometry and experience,” translated in G. B. Jeffrey and W. Perrett, Sidelights on Relativity (E. P. Dutton, New York, 1922), pp. 27–56. [Essen 71] L. Essen, The Special Theory of Relativity: A Critical Analysis (Clarendon Press, Oxford, 1971). [Goldberg 67] S. Goldberg, “Henri Poincaré and Einstein’s theory of relativity,” Am. J. Phys. 35 (1967) 934–944. [Gray 07] J. Gray, Worlds Out of Nothing (Springer, London, 2007), p. 252. [Hassenöhrl 04] F. Hassenöhrl, “Zur Theorie der Strahlung in bewegten Körpern,” Ann. der Phys. 320 (1904) 344–370; Berichtigung, ibid. 321 589–592. [Hassenöhrl 09] F. Hassenöhrl, “Bericht über dei Trägheit der Energie,” Jahrbuch der Radioactivität 6 (1909) 485–502. [Hertz 93] H. Hertz, Electric Waves (Macmillan, London, 1893). [Holton 88] G. Holton, Thematic Origins of Scientific Thought (Harvard U. P., Cambridge MA, 1988). [Ives 51] H. Ives, “Revisions of the Lorentz transformations,” Proc. Am. Phil. Soc. 95 (1951) 125–131. [Ives 52] H. Ives, “Derivation of the mass–energy relation,” J. Opt. Soc. Am. 42 (1952) 540–543. [Janssen 02] M. Janssen, “Reconsidering a scientific revolution: The case of Einstein versus Lorentz,” Phys. Perspect. 4 (2002) 424–446. [Kaluza 10] Th. Kaluza, “Zur Relativitätstheorie,” Physik Zeitschr. XI (1910) 977–978. [Leveugle 94] J. Leveugle, “Poincaré et la relativité,” La Jaune et la Rouge 494 (1994) 31–51. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
Introduction 49
[Leveugle 04] J. Leveugle, La Relativité, Poincaré et Einstein, Planck, Hilbert. Histoire Véridique del la Théorie de la Relativité (l’Harmattan, Paris, 2004). [Lewis 08] G. N. Lewis, “Arevision of the fundamental laws of matter and energy,” Phil. Mag. 16 (1908) 705–717. [Lewis 26] G. N. Lewis, “The conservation of photons,” Nature 118 (1926) 874–875. [Logunov 2001] A. A. Logunov, On the Articles by Henri Poincaré “On the dynamics of the electron”, 3rd ed. (Dubna, 2001). [Marchal] C. Marchal, “Poincaré, Einstein and the relativity: A surprising secret,” (http:// www.cosmosaf.iap,fr/Poincare.htm). [McCormmach 70] R. McCormmach, “Einstein, Lorentz and the electromagnetic view of Nature,” Hist. Studies Phys. Scis. 2 (1970) 41–87. [Miller 73] A. I. Miller, “A study of Henri Poincaré’s ‘Sur la Dynamique de l’Électron,”’ Arch. His. Exact Sci. 10 (1973) 207–328. [Miller 81] A. I. Miller, Albert Einstein’s Special Theory of Relativity (Addison- Wesley, Reading MA, 1981), p. 254. [Northrop 59] F. C. Northrop, “Einsteins’s conception of science,” in ed. P. A. Schillip, Albert Einstein Philosopher-Scientist, Vol. II, (Harper Torchbooks, New York, 1959), p. 388. [Norton 84] J. Norton, “How Einstein found his field equations,” Stud. Hist. Phil. Sci. 14 (1984) 253–284. [Norton 04] J. D. Norton, “Einstein’s investigations of Galilean covariant electro- dynamics prior to 1905,” Arch. His. Exact Sci. 59 (2004) 45–105. [Ohanian 08] H. C. Ohanian, Einstein’s Mistakes: The Human Failings of Genius (W. W. Norton & Co., New York, 2008), p. 84. [Pais 82] A. Pais, Subtle is the Lord (Oxford U. P., Oxford, 1982), p. 381. [Pauli 58] W. Pauli, Theory of Relativity (Dover, New York, 1958), p. 125. [Planck 07] M. Planck, “Zur Dynamik bewegter Systeme,” Berliner Sitzungsberichte Erster Halbband (29) (1907) 542–570; see also, B. H. Lavenda, “Does the inertia of a body depend on its heat content?,” Naturwissenschaften 89 (2002) 329–337. [Planck 98] M. Planck, Eight Lectures on Theoretical Physics (Dover, New York, 1998), p. 120. [Poincaré 98] H. Poincaré, “La mesure du temps,” Rev. Mét. Mor. 6 (1898) 371–384. [Poincaré 00] H. Poincaré, “The theory of Lorentz and the principle of reaction,” Arch. Néderland. Sci. 5 (1900) 252–278. [Poincaré 01] H. Poincaré, Électricité et Optique: La Lumière et les Théories Électrody- namiques, 2 ed. (Carré et Naud, Paris, 1901), p. 453. [Poincaré 04] H. Poincaré, “L’état actuel et l’avenir de la physique mathema- tique,” Bulletin des sciences mathématiques 28 (1904) 302–324; translation “The principles of mathematical physics,” Congress of Arts and Sci- ence, Universal Exposition, St. Louis, 1904 Vol. 1, (1905) pp. 604–622 (http://www.archive.org/details/ congressofartssc01inte). [Poincaré 05] H. Poincaré, “Sur la dynamique de l’électron,” Comptes Rend. Acad. Sci. Paris 140 (1905) 1504–1508. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch01
50 A New Perspective on Relativity
[Poincaré 06] H. Poincaré, “Sur la dynamique de l’électron,” Rend. Circ. Mat. Palermo 21 (1906) 129–175. [Poincaré 52] H. Poincaré, Science and Hypothesis (Dover, New York, 1952), pp. 70–71; translated from the French edition, 1902. [Poincaré 54] H. Poincaré, Oeuvres (Gauthier-Villars, Paris, 1954), p. 572. [Poynting 07] J. H. Poynting, The Pressure of Light, 13th Boyle Lecture delivered 30/05/1906 (Henry Frowde, London, 1907); (Soc. Promo. Christ. Know., London, 1910). [Pyenson 85] L. Pyenson, The Young Einstein: The Advent of Relativity (Adam Hilger, Bristol, 1985). [Schribner 64] C. Scribner, Jr, “Henri Poincaré and the principle of relativity,” Am. J. Phys. 32 (1964) 672–678. [Sommerfeld 04] A. Sommerfeld, “Überlichtgeschwindigkeitsteilchen,” K. Akad. Wet. Amsterdam Proc. 8 (1904) 346 (translated from Verslag v. d. Gewone Ver- gadering d. Wis-en Natuurkundige Afd. 26/11/1904, Dl. XIII); Nachr. Wiss. Göttingen 25/02/1904, 201–235. [Stachel 89] J. Stachel, “The rigidly rotating disc as the ‘missing link’ in the history of general relativity,” in Einstein and the History of General Relativity, eds. D. Howard and J. Stachel (Birhaüser, Basel, 1989). [Stillwell 89] J. Stillwell, Mathematics and Its History (Springer, New York, 1989), p. 311. [Stillwell 96] J. Stillwell, Sources of Hyperbolic Geometry (Am. Math. Soc., Providence RI, 1996), p. 113. [Stranathan 42] J. D. Stranathan, The ‘Particles’ of Modern Physics (Blakiston, Philadelphia, 1942), p. 137. [Thomson 28] J. J. Thomson and G. P. Thomson, Conduction of Electricity Through Gases, 3rd ed. (Cambridge U. P., Cambridge, 1928), p. 439. [Variˇcak 10] V. Variˇcak, “Application of Lobachevskian geometry in the theory of relativity,” Physikalische Zeitschrift 11 (1910) 93–96. [Walter 99] S. Walter, “The non-Euclidean style of Minkowskian relativity,” in J. Gray, ed. The Symbolic Universe (Oxford U. P., Oxford, 1999), pp. 91–127. [Weisskopf 60] V.F. Weisskopf, “The visual appearance of rapidly moving objects,” Phys. Today, Sept. 1960, 24–27. [Whitakker 53] E. Whittaker, A History of the Theories of Aether and Electricity, Vol. II The Modern Theories 1900–1926 (Thomas Nelson & Sons, London, 1953), p. 38. [Wilson & Lewis 12] E. B. Wilson and G. N. Lewis, “The space-time manifold of rel- ativity. The non-Euclidean geometry of mechanics and electrodynamics,” Proc. Am. Acad. Arts and Sci. 48 (1912) 387–507. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Chapter 2 Which Geometry?
2.1 Physics or Geometry 2.1.1 The heated plane
In La Science et l’Hypothèse Henri Poincaré [68] argued for the ‘passivity’ of physical space. Since all measurements involve both physical and geo- metrical assumptions, Poincaré considered it meaningless to ask whether space was Euclidean or non-Euclidean. We might try to measure the sum of the angles of a triangle formed by three hill tops and check to determine ◦ whether their sum was greater or less than 180 . In fact, Gauss attempted such a measurement. He measured the sum of the angles of a triangle formed by the three peaks of Broken, Hohehangen and Inselsberg. The sides of the triangle were 69, 85 and ◦ 197 km. Gauss determined that the sum exceeded 180 by 14 85. How- ever, to the chagrin of Gauss, the experiment was inconclusive since the experimental error was greater than the excess he found. In fact, the sum ◦ could have as well as been less than 180 . The triangle was too small, since as Gauss realized, the defect is proportional to its area, and only a big tri- angle, of astronomical proportions, could be used to settle the question of whether the geometry of the universe is Euclidean or not. Poincaré was more indecisive in that he reasoned that any defect which could be revealed could equally as well be the consequence of the fact that light rays do not always travel in straight paths. It is this type of reasoning that was used against Poincaré, and from being denied the dis- covery of relativity. For we have seen in 1.2.2 that many of the concepts that were attributed to Einstein rightly belong to Poincaré, such as the veloc- ity addition theorem, for which uniform motion is undetectable as far as physical laws are concerned, and the axiom that nothing can travel faster
51 Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
52 A New Perspective on Relativity
than light. His willingness to change a physical law so as to suit Euclidean geometry is responsible for his secondary role in twentieth century science. But not a word was muttered when Einstein [22] was found agreeing with Poincaré’s philosophy. To make his point, Poincaré considered an imaginary universe in the interior of a sphere of radius R. In such a universe, at any point p, its temperature would be given by T(p) = k(R2 − r2), where k is a positive constant and r is the Euclidean distance from the center of the sphere to the point p in question. He also assumed that the linear dimensions of the body vary with the temperature at the point where the body is found so that as one moves from the center of the body to the surface he becomes colder and contracts. In fact, it would take him an infinite amount of time to reach the surface. Even worse he cannot detect his shrinkage because the measuring sticks he uses shrink along with him. To our traveler, the universe appears infinite. We know that in Euclidean space the shortest path between two points is a straight line. But, because of the shrinkage, these geodesics, which are by definition the paths of shortest distance between pairs of points in , are not straight lines, but are curves bent inward toward the center of . Actually, they are circular arcs that cut the boundary normally. This is shown in Fig. 2.1 where the bug’s right legs are shorter than his left so even though he thinks he is traveling in a straight path, the unequal lengths of his legs cause him to follow a circular arc AB.
Fig. 2.1. A bug’s life in the heated disk; ‘hot’ in the center and ‘cold’ on the disc. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 53
However, to the bug, his right legs do not appear to be shorter than his left legs because his measuring tools also contract as things get colder. But, to us Euclideans, it appears that the bug’s right legs are shorter than his left legs because we are using Euclidean measuring sticks.
So even though we owe this model of hyperbolic geometry to Poincaré, he failed to find it physically attractive. The question he posed “Which geometry is correct?” was answered by him with another question: “Which geometry is more convenient?” And unhesitatingly Poincaré clung to Euclidean geometry as the true geometry which Nature chooses. So that if we find a discrepancy between a physical law and Euclidean geometry we must be willing to change the former so as to preserve the latter. In effect, Poincaré was debasing his models of hyperbolic geometry, along with those of Beltrami and Klein, as having no physical relevance. Suppose that we have to deal with a rather large metal sheet which is not at a uniform temperature. Take one edge of the sheet and label it the x-axis, and consider its normal y-axis to vary with temperature in the following way:
1 T = by − , p
where b and p are constants. Suppose also that the metallic sheet is fixed in such a way that it cannot bend or buckle. Lastly, we are given a measuring stick made of another metal whose thermal coefficient of expansion is p. How can we use this stick to determine the nature of the geometry of the sheet? It would be better to have a measuring rod whose coefficient of ther- mal expansion were zero, but not having one we are left to make measure- ments with this imperfect rod. We therefore inquire how to make consistent measurements. There are two ways:
(i) At some standard temperature, which we take to be zero degrees cel- sius, the measuring stick has a length ds. But because there will be points with higher temperatures, the true length at any point (x, y) will be
ds = (1 + pT)ds = pby ds. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
54 A New Perspective on Relativity
This choice allows us to maintain our Euclidean measure on the surface by allowing for a temperature correction factor pby. This is our physical law which allows us to preserve the Euclidean nature of the geometry. (ii) We make measurements without taking into account the changes in the length of the rod. Here, we clearly rule out that there are changes in matter due to heat variations, and look to the geometry to make the necessary modifications.
If we opt for the second choice we realize that the x-axis represents absolute cold, which corresponds to a line at infinity. If we try to make measurements using a rod parallel to the y-axis we find that the stick will become shorter and shorter as it approaches the x-axis, so that it will appear as a line at infinity because it is infinitely far away. The prime interest of a geometer is to create an object, such as a tri- angle, made up of the measuring sticks, that when moved over the surface remains congruent. We shall refer to such displacements as motions, of which we will be interested primarily in infinitesimal ones. But, we must first determine how we measure distance, or define a metric for the space. Wantingto keep as close as possible with a Euclidean measure we might try: √ pby ds = (dx2 + dy2).
If we agree to a choice of units where pb = 1, then √ (dx2 + dy2) ds = . (2.1.1) y This ‘distance’ increases without limit as y → 0. For x constant, the ‘distance’ along vertical lines increases exponentially in comparison with its Euclidean counterpart. For example, the adjacent distances between = 1 1 = y 1, 2 , 4 , ...at x 0 are all equal. We now want the invariance property of this metric to determine the permissible motions. Consider, for instance, a point transformation:
x = x (x, y), y = y (x, y). (2.1.2)
We want this point transformation to conserve distance; the condition is: dx 2 + dy 2 dx2 + dy2 = . (2.1.3) y2 y2 Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 55
Obviously, this implies the invariance of distance, but we want something more. We want it also to preserve angles, meaning we want it to be confor- mal. For later use, observe that 1 ∂x 2 ∂y 2 1 + = , y 2 ∂x ∂x y2 ∂x ∂x ∂y ∂y + = 0, (2.1.4) ∂x ∂y ∂x ∂y 1 ∂x 2 ∂y 2 1 + = . y 2 ∂y ∂y y2
Now consider two infinitesimal displacements, (d1x, d1y) and (d2x, d2y) drawn from the point (x, y) and making an angle θ, and the cor- responding ones (d1x , d1y ) and (d2x , d2y ) drawn from (x , y ) making a corresponding angle θ . In order for the transformation to be conformal, we require the cosines of the two angles,
+ d1xd2x d1yd2y y2 cos θ = √ 2+ 2 2+ 2 , (2.1.5a) d1x d1y d2x d2y y2 y2
and + d1x d2x d1y d2y y 2 cos θ = √ 2+ 2 2+ 2 , (2.1.5b) d1x d1y d2x d2y y2 y2
to be equal, where we have divided numerator and denominator by y2 and y 2, respectively, in order to be able to use (2.1.3). That is, on account of (2.1.3) the denominators in (2.1.5a) and (2.1.5b) are equal so it remains only to show that the condition,
1 1 (d1x d2x + d1y d2y ) = (d1xd2x + d1yd2y), (2.1.6) y2 y2
holds. If we introduce, dx ∂x ∂y ∂y dx = dx + dy, dy = dx + dy, ∂x ∂y ∂x ∂y Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
56 A New Perspective on Relativity
into the left-hand side of (2.1.6) and use (2.1.4) it becomes evident that the left side coincides with the right side thereby establishing the conformality of the point transformation (2.1.2). Regarding motions, it is easily seen that magnification x → ax, with a ≥ 0, and translation x → x + s, with s real, are two possible motions. It is well-known that in a two-dimensional space, like the one we are consider- ing, if there exist two independent motions then there must be a third. This third is called inversion and it states that if there are two points connected by a straight line to the origin of a circle whose circumference divides the two points then the product of the distances that the two points are from the origin is equal to the square of the radius. Inversion introduces the notion of anti-congruence. The basic motions are most easily expressed in terms of complex vari- ables z = x + iy and w = x + iy , viz.
translation: w = z + s (s ∈ R) magnification: w = az (a ≥ 0) (2.1.7) inversion: w = 1/z¯,
where z¯ is the complex conjugate of z. These three independent motions imply that any two-dimensional object in the space may be shifted, magnified and turned inside-out and still remain congruent if the number of inversions is even, or anti-congruent if the number of inversions is odd. These motions give the object complete freedom of movement. If we generalize the concept of inversion to include the product of an inversion in the unit circle, a translation by an amount c,
z w = , cz + 1
and another inversion then we can construct a generic displacement as a product of the fundamental motions involving an even num- ber of inversions. Such a generalized displacement will have the form [cf. (1.2.1)]:
az + b w = , (2.1.8) cz + d Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 57
where a, b, c, d are real numbers and the determinant = ad − bc > 0. The linear fractional transformation is known as a Möbius transform, and it will play a prominent role in what follows. The Möbius transform (2.1.8) can be obtained by the following ele- mentary motions [Archbold 70]:
z1 = cz, (magnification)
z2 = z1 + d, (translation)
z3 = 1/z2, (inversion) z = z , 4 c 3
and so a w = − z , c 4
which is (2.1.8). The displacement of any object in our space requires knowing the position of the object, and the alignment of a particular direction with that of an arbitrarily chosen direction in the space. For this to be accom- plished we need three parameters, and the corresponding group is a three- parameter group. An object enjoying free mobility in such a space is a congruent space and its geometry is a congruent geometry. We will now dig deeper into the notions of these motions by transferring to the complex plane.
2.2 Geometry of Complex Numbers 2.2.1 Properties of complex numbers = + An ordered√ pair (x, y) is called a complex number, z x iy. The modulus of z is r = (x2 + y2). The number θ, defined by cos θ = x/r and sin θ = y/r is called the amplitude or argument (arg)ofz. In terms of r and θ the complex number can be expressed as z = reiθ = r( cos θ + i sin θ), and de Moivre’s theorem follows:
(cos θ + i sin θ)n = cos nθ + i sin nθ. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
58 A New Perspective on Relativity
If z1 and z2 are any two complex numbers then
|z1z2|=|z1|·|z2|,
|z1/z2|=|z1|/|z2|,
arg (z1z2) = arg z1 + arg z2,
arg (z1/z2) = arg z1 − arg z2.
The last two properties recall the property of logarithms, which we shall shortly return to. Moreover, the product of a complex number z and its complex con- jugate z¯ is zz¯ =|z|2. The square of the absolute value of the sum of two complex numbers is:
2 ¯ ¯ ¯ ¯ ¯ ¯ |z1 + z2| = (z1 + z2)(z1 + z2) = z1z1 + z2z2 + z1z2 + z1z2 2 2 ¯ ¯ =|z1| +|z2| + z1z2 + z1z2 2 ¯ 2 =|z1| + 2Re(z1z2) +|z2| 2 2 ≤|z|1 + 2|z1||z2|+|z2| = (|z1|+|z2|) ,
¯ ¯ since Re(z1z2) ≤|z1z2|=|z1||z2|. Taking the positive square roots and observing that |z|=|z¯| gives the triangle inequality:
| | | | | | z1 + z2 ≤ z1 + z2 . (2.2.1)
2.2.2 Inversion
The property of inversion can be stated as: If a circle of radius R has a center O and two points P and P are inverse with respect to the circle then the following conditions must hold:
(i) O, P, P lie on the same straight line; (ii) O does not lie between P and P ; (iii) OP · OP = r2.
To find the point of inversion P we construct a semicircle with diameter OP .IfQ is the point of intersection of this semicircle with the circle whose origin is 0 then P will be the foot perpendicular from Q to OP . This is a Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 59
Fig. 2.2. Construction of the point of inversion P.
consequence of the fact that OQP is a right triangle as shown in Fig. 2.2. Because OPQ will also be a right triangle,
OP r cos ϑ = = , r OP and, consequently,
OP · OP = r2; (2.2.2)
O then is the center of inversion and the circle is called the circle of inversion. These conditions can be simply stated as: If P and P are represented by the complex number z and w, then
(i) arg w = arg z, (ii) |w|=r2/|z|,
where property (i) takes both properties (i) and (ii) of the above. Given the point of inversion P we may calculate the coordinates of P , and vice-versa. From Fig. 2.3 it is apparent that the right triangles whose hypoteneuses are OP and OP are similar so that
x x = . (2.2.3) y y
Then since (2.2.2) holds,
(x2 + y2)(x 2 + y 2) = r2. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
60 A New Perspective on Relativity
Fig. 2.3. Circle of inversion for constructing the inverse P with respect to P .
Introducing the value of y in (2.2.3) we have
y 2 r4 x2 + x2 · = , x2 x2 + y2
or
x r2 y r2 x = , y = . x2 + y2 x2 + y2
These are the coordinates of the interior points which are fully symmetric to the exterior points,
2 2 xr yr x = , y = . x2 + y2 x2 + y2
Although the method of inversion has found extensive use in elec- trostatics, apparently introduced by Lord Kelvin, it seems to be relatively unknown in other branches of science. On closer inspection, however, it appears to have been employed for the first time in optics in a completely novel way by, the then twenty-three year old, Maxwell [Born & Wolf 59]. Since it combines Fermat’s principle of least time, which we will need later on in Chapter 7, and inversion, we will now turn to a discussion of it. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 61
2.2.3 Maxwell’s ‘fish-eye’: An example of inversion from elliptic geometry
Light emitted by a point source at P0 will propagate in a medium of index of refraction η(x, y, z). Although an infinite number of rays have been emitted by our point source, only a finite number will be found to pass through any other point in the medium, with the exception of a point P1 through which an infinite number of rays pass. Such a point is said to be a stigmatic, or sharp, image of P0. An optical instrument which images stymatically in three-dimensions is referred to as absolute. To every point P0 in the object space there corre- sponds a stigmatic image P1 in the image space. These points in the two spaces are said to be conjugate to one another. It was precisely Maxwell, in 1858, who proved that for an absolute instrument the optical length of any curve in the object space is equal to the optical length of its image, provided both spaces are homogeneous. Maxwell provides us with a simple example of an absolute instrument in a medium which is characterized by a refractive index,
1 η(r) = η0, (2.2.4) 1 + (r/a)2
where r denotes the distance from a fixed point O, and η0 and a are con- stants. It is commonly referred to as Maxwell’s ‘fish-eye’ which he first studied in 1854. According to Fermat’s principle, light will propagate between any two points in such a way as to minimize (or at least to extremize) its travel time. In a system of varying index of refraction, (2.2.4), the true path will render the optical length,
√ I = η(r)ds = η(r) (dr2 + r2dϕ2)
√ √ (dr2 + r2dϕ2) (1 + r2ϕ 2)dr = η0 = η0 , (2.2.5) 1 + r2/a2 1 + r2/a2
an extremum, where the prime indicates differentiation with respect to r. Calling the integrand , and noting that ϕ is a cyclic coordinate, i.e. ϕ is absent but its derivative is not, we immediately obtain a first integral of Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
62 A New Perspective on Relativity
the motion, ∂ η(r)r2ϕ = √ = c = const. ∂ϕ (1 + r2ϕ2) Solving for ϕ ,weget
r dr ϕ = c √ , (η2(r)r2 − c2) on integrating. To perform the integral it will be convenient to set ρ = r/a and κ = c/aη0. For then we find:
ρ κ(1 + ρ2)dρ ϕ − ϕ0 = √ ρ (ρ2 − κ2(1 + ρ2)2)
ρ 2 d − κ ρ − 1 = sin 1 √ dρ, dρ (1 − 4κ2) ρ
and, consequently, by inverting,
c r2 − a2 sin (ϕ − α) = √ , (2.2.6) 2 2 − 2 (a η0 4c ) ar where α is a constant of integration. Expression (2.2.6) is the equation of a circle in polar coordinates. All rays through the fixed point, P0(r0, ϕ0), must be as such to keep the ratio, r2 − a2 r2 − a2 = 0 , r sin (ϕ − α) r0 sin (ϕ0 − α)
constant. The fixed point P1(r1, ϕ1) must also satisfy this ratio for whatever α may be, and this leads to the conditions
2 r0r1 = a , ϕ1 = π + ϕ0. (2.2.7)
All rays from a point P0 meet at P1 which lies on a line connecting P0 to O. The 2 points P0 and P1 lie on opposite sides of O such that OP0 · OP1 = a . Conse- quently, Maxwell’s fish eye is an absolute instrument where the image is an inversion since the first condition in (2.2.7) is the condition for inversion, (2.2.2). Only this time O is between the two points instead of condition 2 above. For ϕ = α and ϕ = π + α, r = a and each ray emanating from a fixed point P0 intersects the circle r = a normally.All Euclidean circles orthogonal Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 63
Fig. 2.4. Maxwell’s “fish-eye.”
to the rim of the circle of radius r = a are the routes of geodesics in the elliptic plane E. The rays emanating from any fixed point P0 and coalescing at P1, which lies on the line OP0, are geodesics, or paths of shortest distance between the two points. Arcs of a circle replace straight lines in the elliptic plane. We shall return to this point shortly. To transform the Eq. (2.2.6) from polar to Cartesian coordinates we set x = r cos ϕ and y = r sin ϕ. We then obtain c y cos α − x sin α = √ (x2 + y2 − a2), 2 2 − 2 a (a η0 4c ) a4 (x − b sin α)2 + (y + b cos α)2 = a2 + b2 = , (2.2.8) 4κ2 √ = 2 2 − 2 where b (a/2c) (a η0 4c ). According to the theorem of chords, all chords passing through a fixed interior point, in this case O, are divided 2 into two parts whose lengths have constant product: OP0 · OP1 = a .W thus have to set b = 0, so that the radius of the circle of inversion is exactly a = 2c/η0. If we do not distinguish between the flat metric and the index of refraction in (2.2.5), then we can write the metric as
dr2 + r2dϕ2 ˜2 = η2 2 = ds (r)ds 2 2 , (2.2.9) (1 + (r/r0) ) Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
64 A New Perspective on Relativity
Fig. 2.5. The magnification of the inner product as it is projected stereographically onto the Euclidean plane.
where we set the absolute constant a = r0. A simple way to get new geo- metric structures is to distort old ones. The stereographic projection of the dot product of the tangent vectors x˜ and y˜ at a point p on the surface of a sphere S projects onto the Euclidean plane at a point q where x and y are the tangent vectors, as shown in Fig. 2.5. The relation between their inner products is given by the stereographic inner product distortion [O’Neill 66]
x2 + y2 x · y = + x˜ ·˜y 1 2 , (2.2.10) r0 and so transforms the Euclidean plane into the stereographic plane with con- 2 stant, positive curvature, 1/r0. To rationalize (2.2.10), we consider the inverse map of a plane onto a sphere, from a horizontal plane of height r0 onto a sphere of radius r0. This is given by the projection along the radius (x, y, z) → (λx, λy, λz), where = 2+ 2+ 2 = 2 z r0, the domain of the plane, and (λx) (λy) (λr0) r0, the codomain of the sphere. Solving for λ results in: r λ = √ 0 , (2.2.11) 2 + 2 + 2 (r0 x y ) and, consequently, the stereographic inner product can be written as
λx · λy =˜x ·˜y,
which is again (2.2.10). Stereographic projection was one of the topics covered by Riemann in his 1854 lecture for his Habilitation. Although he discusses a space of posi- 2 tive, constant, curvature, 1/r0, he was undoubtedly aware of what happens Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 65
when r0 becomes imaginary. In such an event, the inner product (2.2.10) 2 + 2 2 makes sense only when we restrict it to a disc x y < r0. Inside this region, the two-dimensional space is one of negative, constant, curvature. We shall come back to this in our discussion of the Poincaré disc model in Sec. 9.5. Stereographic projection possesses two very remarkable properties:
(i) Circles on the sphere are mapped into lines or circles in the plane. (ii) Angles are preserved: the angle formed from two intersecting circles on the sphere is the same as the angle formed from intersecting lines or circles in the plane that correspond to the former under stereographic projection.
Therefore, by sacrificing straight line geodesics we have been able to preserve angles so the stereographic projection is a conformal map of the surface. Maxwell, unwittingly, discovered that his expression for the refractive index, (2.2.4), was the dilatation factor in
ds˜ = η(r)ds, (2.2.12)
in which the infinitesimal shape on the surface is represented in the map by a similar shape that differs from the original one only in size. The one on the stereographic plane is just η times bigger, and the index of refraction is the stereographic magnification factor! This was indeed a big fish to fry for the 23 year-old: he obtained an image as an inversion using stereographic projection. We will use the metric (2.2.12) in Chapter 7 to derive the tests of general relativity by identifying physically the index of refraction, η, which is the magnification factor of the flat metric, ds. The essential point is that both the point source P0 and its image point P1 are on different collinear rays emanating from O, whereas in the case of inversion, a circle P0 and its image P1 are on the same ray emanating from O. This is guaranteed by the form of the index of refraction (2.2.4). As we have just mentioned, we can get a surface of negative curva- − 2 = 2 ture, 1/r0 1/(ir0) by allowing the radius of the sphere to take on the imaginary value ir0. Instead of the index of refraction (2.2.4) we now have
1 η = η (r) 2 0, (2.2.13) 1 − (r/r0) Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
66 A New Perspective on Relativity
which obviously limits us to a (hyperbolic) disc r < r0, which is the absolute constant of the space. Following the same procedure as before, we now find the equation of the circle
− 2 + + 2 = 2 − 2 (x β sin α) (y β cos α) β r0, (2.2.14) √ = 2 2 + 2 where β (r0/2c) (r0η0 4c ). The circle of inversion, C, has a center at (β, α), and its distance from the center of the hyperbolic disc is β. But, in order that (2.2.14) describe a circle, β>r0, as can be seen in Fig. 2.6, the center of inversion must lie outside the hyperbolic plane, H. Thus, its center does not separate the source P0 and its image P1 along a common line uniting the three points, and, as a consequence P0 and P1 will not lie on geodesics arcs of a circle. So, it was not at all fortuitous that young Maxwell chose the form (2.2.4) for the index of refraction, and not (2.2.13). In fact, as we approach the rim of H, which does not belong to H, the index of refraction (2.2.13) becomes infinite. Since the velocity of propaga- tion is inversely proportional to the index of refraction, it will become very small in the limit. Clocks slow down and rulers shrink as they approach the rim when viewed from our Euclidean perspective. We might expect that this shrinking of rulers and slowing down of clocks to have something to do with space contraction and time dilatation. This ‘shrinkage’ of rulers, and ‘slowing down’ of clocks is in direct con- trast as to what happens in the stereographic, or elliptic, plane of constant
Fig. 2.6. In the case of inversion both the point and its image are on the same ray emanating from the center of the disc H. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 67
Fig. 2.7. It appears that rulers get longer as they are moved further from the origin. However, the elliptic distance from x to y is exactly the same as that from X to Y.
positive curvature. The projection of points in the northern hemisphere X and Y are much further away from the origin than projections from the southern hemisphere, x and y, as shown in Fig. 2.7. The index of refraction (2.2.4) becomes smaller and smaller the farther we move away from the origin. This means that the velocity increases and clocks speed up, while rulers get longer as they move farther away from the origin. Large circles in the stereographic plane have very small stereographic arc- length since they correspond to small circles about the north pole of the sphere S. In consideration of the relationship between hyperbolic and elliptic spaces we might expect phenomena such as time contraction and space dilatation to be characteristic of elliptic spaces when viewed from our Euclidean perspective. To an inhabitant of the plane, he would measure the same distance between x and y in Fig. 2.7, as he would measure between X and Y.
2.2.4 The cross-ratio
Now consider a circular arc, defined by arg[(z − z1)/(z − z2)]=const. In addition let there be two fixed points on the arc P1 and P2 with P lying between them. We let P vary such that the angle, measured in radians, ∠P1P2 = θ is constant. If P, P1, P2 are represented respectively by z, z1, z2 the necessary and sufficient condition that the angle remains constant, as P is varied, is: z − z1 arg (z − z1) − arg (z − z2) = arg = θ. z − z2 Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
68 A New Perspective on Relativity
If 0 <θ<π, the locus of P is the arc of a circle with endpoints P1 and P2. For the particular values θ = π/2, the locus of P is a semi-circle, while for θ = π it is the segment P1P2. Generalizing to four points lying on an arc we have: If points P3 and P4 lie on an arc whose endpoints are P1 and P2, then z − z z − z arg 3 1 = arg 4 1 , z3 − z2 z4 − z2 or z − z z − z arg 3 1 · 4 2 = 0, z3 − z2 z4 − z1
where the zi’s represent the Pi’s. But, this can only be if the number, z − z z − z 3 1 · 4 2 , z3 − z2 z4 − z1 is real and positive. This number is the cross-ratio of the four numbers z1, z2, z3, z4. Alternatively, if P3 and P4 lie outside of the arc segment P1P2, then z − z z − z arg 3 1 · 4 2 = π, z3 − z2 z4 − z1 and the corresponding cross-ratio is a negative real number. The cross-ratio finds its origins in renaissance art where artists found it necessary to give depth to their two-dimensional drawings. If the points A, B, C, D lie on a line and the pairs of points A, B separate C, D then the cross-ratio, AC AD {A, B|C, D}= , BC BD is positive, while if they do not then the cross-ratio is negative. The cross-ratio of four points is the minimum number of points that is invariant under projection. A correspondence between two straight lines such that for all cor- responding quadruples, A, B, C, D and A , B , C , D , their cross-ratios are equal, {A, B|C, D}={A , B |C , D } is called a projective correspondence. Since the ordinary projection of a line onto a line preserves the cross-ratio, it is an example of a projective correspondence. Such a correspondence is said to be perspective. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 69
A perspective is merely a realistic representation of spatial depth on a plane.Yet,the method for correct perspective was awarded to the Florentine painter, Brunelleschi at the beginning of the fifteenth century.Alberti solved a special case, known as costruzione legittima, whereby nonhorizontal floor tiles are lined up on a base line with ever-progressing smaller tiles placed behind them and letting them converge to a vanishing point on the horizon as in Fig. 2.8. The development of projective geometry followed, mainly through the work of Desargues, with the introduction of ‘vanishing’ points, or points at infinity where parallels meet, and transformations which change lengths and angles, i.e. projections. But, if length and angles are not invariant under projection, what is? Since it is possible to project any three points on a line onto any three others, this cannot be an invariant. The smallest num- ber of points which is invariant is four, and the cross-ratio is a projective invariant. Following the proof given by Möbius in 1827 that the cross-ratio is a projective invariant, we consider four points on a line A, B, C, D, and a point O not lying on the line as in Fig. 2.9. Drop a normal onto the line and let δ be its length. By computing the area of the triangles OCA, OCB, ODA and ODB, first using the height δ and the bases AB, BC, DA, and DB, and then using the bases OA, and OB, and expressing the height in
Fig. 2.8. A tiling of the plane. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
70 A New Perspective on Relativity
Fig. 2.9. Calculation of cross-ratio and perspectivity.
terms of the sines of the angle at O,wefind
1 1 δ · CA = area OAC = OA · OC sin ∠COA, 2 2 1 1 δ · CB = area OCB = OB · OC sin ∠COB, 2 2 1 1 δ · DA = area ODA = OA · OD sin ∠DOA, 2 2 1 1 δ · DB = area ODB = OB · OD sin ∠DOB, 2 2 Taking the ratio of the first and second pairs, and dividing the former by the latter results in CA DA sin ∠COA sin ∠DOA = . CB DB sin ∠COB sin ∠DOB Observing that any other four points A , B , C , D in perspective with the original points A, B, C, D, with the same external point O, will have the same central angle at O, shown in Fig. 2.9, and, consequently, will have the same cross-ratio. Projective transformations, or collineations as they are sometimes referred to, can map parallel lines onto intersecting lines thereby providing a sense of depth, like the converging parallel lines in Fig. 2.8. In order to define a projective transformation, we must add points ‘at infinity.’ These are necessary in order to insure the one-to-one correspondence that arises in connection with the central projection of a plane onto a plane in which Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 71
some of the points of the first plane have no images. The straight lines of one that intersect the plane correspond to points of intersection with the plane, while those lines parallel to the plane are new points, called points at infinity where parallels meet. For when the straight line that intersects the plane becomes closer and closer to a parallel line, its point of intersection recedes to infinity. The Euclidean plane is transformed into the projective plane by the addition of points at infinity. The logarithm of the cross-ratio measures hyperbolic distance. Because of the logarithmic form it would not satisfy the triangle inequality, (2.2.1). It is well-known that logarithmic equations of state in thermody- namics [Lavenda 09], and logarithmic measures of divergence in informa- tion theory [Kullback 59], have all the topological requisites of a distance except that of the triangle inequality. However, a remarkable property of the cross-product enables the hyperbolic distance to satisfy the triangle inequality, and, therefore, be considered as a bona fide distance. Consider four collinear points with a and b between x and y with b between a and x. The cross-ratio,
{a, b|x, y} > 1,
unless a = b.Ifd is some other interior point,
{a, d|x, y}·{d, b|x, y}={a, b|x, y}, (2.2.15)
and the cross-ratio is associative. Since the distance is the logarithm of the cross-ratios, it is precisely this last property that would lead one to believe that the triangle inequality cannot be satisfied. But wait a moment. What happens if we shorten the interval, say to some x lying between b and x. It can be shown that [Buseman & Kelly 53]:
{a, b|x , y} > {a, b|x, y}. (2.2.16)
So anytime we shorten the interval we increase the cross-ratio, and, conse- quently, the distance from a to b is also increased. To establish the triangle inequality consult Fig. 2.10. The perspectivity of the lines uv and xy from the pole, p, and the inequality (2.2.16) give
{a, c|u, v}={a, d|x , y }≥{a, d|x, y}. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
72 A New Perspective on Relativity
Fig. 2.10. The four points u, a, c, v and a, d, x ,y from point p have the same angles, hence, have the same cross-ratio. This also is true for c, b, w, z and d, b, x ,y .
Likewise, the perspectivity of wz and xy, together with inequality (2.2.16), give {c, b|w, z}={d, b|x , y }≥{d, b|x, y}.
Taking the product of the two inequalities, and using property (2.2.15), result in
{a, c|u, v}·{c, b|w, z}≥{a, d|x, y}·{d, b|x, y}={a, b|x, y}.
Finally, forming the hyperbolic distances by taking the logarithm of both sides yields the triangle inequality,
h(a, c) + h(c, b) ≥ h(a, b), (2.2.17)
for the hyperbolic distance as the logarithm of the cross-ratio.
2.2.5 The Möbius transform
The properties of the Möbius transform that we discuss here will be used in Chapter 8, especially in Sec. 8.2. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 73
2.2.5.1 Invariance of the cross-ratio We will now show that the Möbius transform leaves the cross-ratio invari- ant. It is this property that Poincaré used, to show that all the cells of the tessellations in the hyperbolic plane are of equal size. Take all four numbers z1, z2, z3, z4 to be different, and cz + d = 0 for any of them. If
azi + b wi = , czi + d
i = 1, ..., 4, the wi are all different and their differences are given by:
w1 − w2 = (z1 − z2)/(cz1 + d)(cz2 + d),
w2 − w3 = (z2 − z3)/(cz2 + d)(cz3 + d),
with denoting, again, the determinant. Dividing the first by the second, w − w z − z cz + d 1 2 = 1 2 · 3 . w2 − w3 z2 − z3 cz1 + d Likewise, w − w z − z cz + d 1 4 = 1 4 · 3 , w4 − w3 z4 − z3 cz1 + d and again dividing the first by the second, w − w w − w z − z z − z 1 2 1 4 = 1 2 1 4 . w2 − w3 w4 − w3 z2 − z3 z4 − z3 The left- and right-hand sides are the cross-ratios of four numbers, and they are equal. This shows that the Möbius transform preserves cross-ratios.
2.2.5.2 Fixed points A fixed point occurs when w = z. Fixed points are, therefore, determined by the equation
cz2 + (d − a)z − b = 0.
It is not difficult to see that the only Möbius transform with more than two fixed points is the identity transform. For if a = d and b = c = 0, every point is fixed. The transformation reduces to w = z, or the identity Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
74 A New Perspective on Relativity
transformation. Now, if a = d, b = 0, and c = 0, the quadratic has one root, i.e. ∞, which is the only fixed point. Further, if c = 0, a = d, and b = 0, the quadratic has distinct roots, b/(a − d) and ∞. Now, assume that c = 0 and δ is either of the square roots of the discriminant, (a − d)2 + 4bc.Ifδ = 0, the quadratic has two distinct roots, (a − d ± δ)/2c. Rather, if δ = 0, the roots coalesce to a single fixed point (a − d)/2c. Hence, we have shown that there cannot be more than two fixed points of a Möbius transformation.
2.2.5.3 Associativity The Möbius transformation is also associative, just like the cross-ratio, (2.2.15). That is, if T1 transforms z1 into z2 and T2 transforms z2 into z3, then the product T1T2 transforms z1 into z3. Let T1 be the Möbius transform, w = (az + b)/(cz + d), and T2 be the transform, w = (Az + B)/(Cz + D), then their product, T1T2 is defined as A[(az + b)/(cz + d)]+B w = , C[(az + b)/(cz + d)]+D which has the same form (Aa + Bc)z + (Ab + Bd) w = . (Ca + Dc)z + (Cb + Dd)
Since the determinant is the product of determinants 12 = (AD − BC) (ad − bc), and does not vanish, it makes T1T2 also a Möbius transformation. A special product transformation will be of importance in our further developments; that is, when the product T1T2 = I, the identity transfor- mation. The identity w = z will result only when the following conditions are met
Aa + Bc = Cb + Dd, Ab + Bd = 0, Ca + Dc = 0.
This will happen only when the ratios A : B : C : D are the same as d : − b : −c : a. Then there is a unique transform T2 which has the Möbius transform, dz − b w = . −cz + a Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 75
−1 It is the inverse to T1, and is written as T1 . Hence, T1 transforms z1 into z2, −1 + = and T1 transforms z2 back into z1. Moreover, if it happens that a d 0, T1 −1 and T1 are the same, and T1 is called involutory, since it is its own inverse.
2.2.5.4 Transformations for which the unit circle is invariant These transformations are particularly interesting for they correspond to the Lorentz transformations in relativity. Considering complex coordinates on the unit disc, a Lorentz transformation corresponds to a Möbius trans- formation, az + c¯ w = , (2.2.18) cz + a¯ for which |a¯| > |c¯|, so that their ratio, |c¯/a¯| will be a point in the interior of the unit disc. This is a necessary and sufficient condition that w maps the interior of the unit disc onto its interior [Schwerdtfeger 62]. A Möbius transform which transforms three distinct points of a unit circle into three other distinct points of the circle it must, obviously, trans- form the unit circle into itself since if z is a circle or a line, so too will be w. If the Möbius transform, w = (az+b)/(cz+d) transforms the unit circle into itself, |w|=1, implying |az + b|=|cz + d|. The latter condition must be the same as |z|=1. Now, the condition |az + b|=|cz + d| is the same as
|z + b/a|=|c/a|·|z + d/c|.
This is the equation of a circle having a pair of inverse points −b/a and −d/c. We know that the two inverse points must be of the form z = 1/z¯, ¯ which implies b/a = c¯/d. As a special case we can set b = c¯ and d = a¯. For then, the Möbius transform which carries the unit circle into itself will be of the form (2.2.18). Inverting it we get aw¯ − c¯ z = . −cw + a The family of circles |z|=κ>0, where κ is real, is transformed into the coaxial circle: |az¯ − c¯|=κ|cz − a|. Coaxial circles are a family of circles such that any pair has the same radical axis. The radical axis is the line passing through the two points of intersection of a pair of circles, as the line PQ in Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
76 A New Perspective on Relativity
Fig. 8.1. The origin, which is a circle of radius κ = 0, is transformed into c¯/a¯, which lies inside the unit circle when |c| < |a|, and outside of it when |c| > |a|. The condition that the determinant must not vanish, ad − bc = 0, prohibits the case |c|=|a|.
2.3 Geodesics
Returning to our hyperbolic model of the heated plane, we have
√ 1/2 p2 (dx2 + dy2) p2 1 dy 2 s = = 1 + dx, (2.3.1) p1 y p1 y dx
as the distance between two points p1 and p2. The pre-factor has the form of a varying index of refraction. For if we suppose that at the Earth’s surface y = 0, the index of refraction η(y) will be a function of height y only. The propagation time, τ along a ray connecting two endpoints p1 and p2 will be given by Fermat’s principle of least time:
p 2 √ cτ = η(y) (1 + y 2)dx, (2.3.2) p1 where the prime denotes differentiation with respect to the independent variable, x. The product cτ is known as the optical path length, where c is the velocity of light in vacuum. According to Fermat’s principle of least time, the optical path length is stationary for the true ray path. In terms of the integrand of (2.3.2), √ (y, y ) = η(y) (1 + y 2), (2.3.3)
the Euler–Lagrange equation can be written as
∂ − y = C, (2.3.4) ∂y where C = const is a first integral of the motion. Explicitly, the Euler– Lagrange equation (2.3.4) is η √ = C. (2.3.5) (1 + y2) So, the constant C is the value of the index of refraction where the ray becomes horizontal. The angle, θ, formed between the tangent to the ray Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 77
Fig. 2.11. Derivation of Snell’s law.
and the normal to the ray, shown in Fig. 2.11, is given by
√ √ θ = arc sin(dx/ (dx2 + dy2)) = arc sin(1/ (1 + y 2)),
so that the Euler–Lagrange equation (2.3.5) coincides with Snell’s law,
η(y) sin θ = C. (2.3.6)
According to Snell’s law, the sines of the angles which the incident θi and transmitted θt rays make with the normal to an interface between two different media are proportional, i.e.
sin θ i = η, (2.3.7) sin θt
where η is the relative index of refraction of the two media. Expression (2.3.6) generalizes Snell’s law to the case where the index of refraction is a function of the height. Ordinarily, the index of refraction decreases with altitude, and this is borne out by the heated plane model since upon comparing terms in the integrands of (2.3.1) and (2.3.2) we find η = 1/y, and, consequently, dη/dy < 0. The true ray that will connect the two points will be concave: Light minimizes its propagation time by arching its path upwards between the endpoints, like a cat ready to attack. As a result, objects do not appear to be where they are but are a little bit lower than our line of sight. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
78 A New Perspective on Relativity
In contrast, the index of refraction will be an increasing function of height in inversion layers where mirages are formed. The ray is now convex and the images will be higher than our line of sight. In both cases, these distortions are caused by the non-Euclidean nature of the geometry. This will be a recurrent theme throughout. Even without making the integral (2.3.1) stationary we can get some remarkable properties about hyperbolic geometry, undoubtedly the most important of which is the angle of parallelism. Transforming to polar coor- dinates, and considering the arc γ to increase the angle from α to π/2weget
√ √ (dx2 + dy2) π/2 (r 2 + r2) = dθ γ y α r sin θ
π/2 dθ ≥ =−ln tan (α/2), (2.3.8) α sin θ
where the prime now stands for differentiation with respect to the inde- pendent variable, θ. If we set (2.3.8) proportional to the minimum distance from a point P, using the perpendicular distance d, to a line , as shown in Fig. 2.12, we have one of the most remarkable formulas in all of mathematics. The number α of radians in the angle of parallelism depends only on the distance d from P to Q and not on the particular line , or the particular point P. The formula was discovered independently by J. Bolyai and N. Lobachevsky. In Euclidean geometry the rays emanating at P must coincide, so α, which is usually written as (d) in the literature, is always a right angle. However, under Lobachevsky’s postulate these lines are distinct and the angle (d) is necessarily acute. It is a function only of the hyperbolic distance d.
Fig. 2.12. Angle of parallelism. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 79
The Euler–Lagrange equation which renders the integral (2.3.1) sta- tionary is:
2 1 + y y + = 0. y
The solution to this equation is a family of circles
(x − a)2 + y2 = c2,
whose centers lie on the x-axis, where a and c are constants of integration. Restricting ourselves to the half-circles located in the upper half-plane will give us the Poincaré half-plane model. The semi-circles will be the geodesics of our hyperbolic space. If the plane were Euclidean, we could draw only one line through any given point parallel to a given straight line. This is Euclid’s fifth postulate. In this plane there would be only one geodesic through a given point that would be parallel to another given geodesic. Not so in our heated plane! Because the geodesics are semi-circles, all geodesics through a point P not lying on the geodesic g in Fig. 2.13 are parallel to g, even h1 and h2, which are tangent to it at points U and V, because those points have been excluded by considering them infinitely far away, i.e. points at infinity. To any student of geometry, this smacks of Lobachevsky geometry, who only claimed that there exist two lines parallel to a given line through a given point not on the line. However, this does not mean that he did not rec- ognize that there were infinitely many non-intersecting lines. His parallel
Fig. 2.13. The number of lines passing through P that are hyperparallel to the line g are infinite. The lines h1 and h2 are limiting parallel to g, while the others are hyperparallel to g. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
80 A New Perspective on Relativity
property is what is now usually referred to as ‘asymptotically parallel’ or ‘horoparallel.’ Here, Lobachevsky’s statement is due to the peculiar nature of the Poincaré half-plane model. Nevertheless, the half-plane model illustrates the hallmark of Lobachevskian geometry: the sum of the angles of a triangle are less than two right angles. The heated plane model also illustrates other√ properties of projective geometry. We substitute the positive square root, [c2 − (x − a)2] for y in (2.3.1) and determine the distance s between two points x1 and x2 as
x2 c s = dx 2 − − 2 x c (x a) 1 x2 = 1 1 + 1 + − − + dx 2 x c x a c x a 1 1 x − (a − c) (a + c) − x = ln 2 · 1 . (2.3.9) 2 (a + c) − x2 x1 − (a − c)
Now, let us define two other x-coordinates, x3, x4 with x4 > x3, as the points where the geodesics intersect the x-axis, i.e. x3 = a − c and x4 = a + c. Substituting these values into (2.3.9) results in 1 x − x x − x s = ln 2 3 · 4 1 . (2.3.10) 2 x4 − x2 x1 − x3 This is precisely the logarithm of the cross-ratio, x − x x − x 2 3 · 4 1 , x4 − x2 x1 − x3
of four ordered points, x1, x2, x3, x4. For fixed endpoints x3 and x4, (2.3.10) is the hyperbolic distance between x1 and x2, which we know by (2.2.17) satisfies the triangle inequality.
2.4 Models of the Hyperbolic Plane and Their Properties
In the half-plane model, studied at the beginning√ of this chapter, we found it equipped with the distance function ds = (dx2 + dy2)/y. This is one model of the hyperbolic plane because anything with the same metric is also a viable model of the hyperbolic plane. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 81
In search of these other models, we take our cue from Euclidean and spherical geometries where equivalent metrics, or isometries, are found = + by using complex functions.√ In terms of the complex number, z x iy, the distance ds = (dx2 + dy2)/y becomes |dz|/Im z. The map from the half-plane to the disc is iz + 1 −iw + 1 w = ,orz = , z + i w − i so that |dz| iw + 1 −iw + 1 ds = = d Im Im z w − i w − i 2|dw| (1 − iw)(w¯ + i) = Im |w − i|2 |w − i|2 2|dw| = . (2.4.1) 1 −|w|2 Thus, we have two models already of the hyperbolic plane:
• the upper half-plane model with distance ds =|dz|/Im z, where the ‘lines’ are semi-circles perpendicular to the real axis, as in Fig. 2.13, and angles which are the same as Euclidean angles; and • the open disc model with metric, (2.4.1), and ‘lines’ that are circular arcs orthogonal to the boundary, as shown in Fig. 2.6, with angles the same as Euclidean angles.
The reason for conformality of the Poincaré disc model is that it took two inversions to go from the half-plane to the disc. We will soon meet yet another disc model which straightens out the circular arcs at the cost of losing angle invariance. The attribute of having a circle at infinity as a natural boundary is that the points on the disc are actually located at infinity as our inhabitants of the unit disc, whom we shall refer affectionately to as ‘Poincarites,’ know.a The distance from the origin to any point tends to infinity as the point tends to 1. Lines, or rather circular arcs, which have a common point on the circle
aThe circle at infinity will take on a physical vest when it is identified as the limit of the inner solution to the Schwarzschild metric in Sec. 9.10.3. The name ‘Poincarites’ was probably first used by Needham [97]. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
82 A New Perspective on Relativity
at infinity are known as asymptotic lines. The point where they meet is not a point belonging to the lines, but, rather, a limit point because points are at ‘infinity.’ In contrast, ultraparallels are circular arcs which cut the circle at infinity but have no common point. The distinction between these two lines in the unit disc is far from academic. A product of reflections in ultraparallel lines constitutes a trans- lation, whereas a product of reflections in asymptotic lines is a ‘limit’ rota- tion. Limit rotations on the disc are circles tangent to the circle at infinity, and are known as horocycles, or ‘limit’ cycles. An amazing find was that a horocycle, or a horosphere in three-dimensions, is a circle at infinity that obeys Euclidean, and not a hyperbolic, geometry. This discovery was made by Wachter, a student of Gauss, way back in 1816. It will allow us to use Euclidean geometry to determine the properties of the hyperbolic plane, notably the angle of parallelism and the necessity of introducing an abso- lute constant, or a unit of measure, which is completely foreign to Euclidean geometry. The horocycle, or circle whose center is at infinity, i.e. on the unit disc, is most clearly seen considering the ‘pseudosphere.’ Of all the mappings of constant negative curvature on the unit disc, it is only the middle figure in Fig. 2.14, which has been adapted from Klein’s 1928 book on non-Euclidean geometry, that shows the horocyles as dashed lines. The solid lines are the image of one turn of the covering of the pseudosphere. All three mappings show that surfaces of constant negative curvature are mapped only onto part of the disc. We postpone a discussion of some of the remarkable properties of the pseudosphere, which is to hyperbolic geometry what the plane and sphere are to Euclidean and elliptic geometry, respectively, and use the following property of horocycles to derive the angle of parallelism.
The ratio of any two concentric limiting arcs cut by radii depends only on the distance between them and not on their size or where they are located in the hyperbolic plane.
It is by no means an understatement to say that all the trigonometric relations of hyperbolic geometry follow from the fact that the ratio of con- centric limiting arcs l and m, with l > m, intercepted between two radii is Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 83
Fig. 2.14. Surfaces of negative constant curvature that are mapped onto part of the hyperbolic plane. The middle figure is the mapping of a pseudosphere that produces horocycles as dashed curves.
given by
l/m = ea/κ, (2.4.2)
where a is the distance between the arcs and κ is a positive (absolute) con- stant. In Fig. 2.15 the arcs l, m, and n are cut by two radii. The distance between the first two is a, while the distance between the second and third arcs is b. We know that the ratios depend on the distance between them but Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
84 A New Perspective on Relativity
Fig. 2.15. The ratio of concentric limiting arcs depends only on the distance between them.
we do not know the functional form. That is,
l/m = f(a), m/n = f(b), l/n = f(a + b),
where f is a positive, and increasing function. These relations suggest the functional relation,
f(a) · f(b) = f(a + b).
Such a functional relation can only be satisfied by an exponential function. The transfer from any exponential g > 1 to the constant e entails introduc- ing an absolute constant κ such that
ga = (eln g)a = ea/κ,
and on account that g > 1, κ>0. If we go back to the pseudosphere, we find that for any two points on its surface, the following remarkable relationship holds
x2 + y2 <κ2.
When we go to plot these points on a Euclidean plane, as in Fig. 2.14, they are constrained to lie within a circle of radius κ. All points on the entire pseudosphere are thus constrained to lie within a circle of radius κ on a Euclidean plane. This radius is called the radius of curvature,orspace constant, and is an absolutely determined length. It is the analog of the radius of a sphere in spherical geometry under the transform κ → iκ. And although it is absolutely determined, its magnitude will depend upon the units chosen. Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
Which Geometry? 85
Fig. 2.16. Using Euclidean geometry to derive the angle of parallelism by consid- ering concentric limiting arcs.
The arc lengths C B and CB belong to two concentric horocycles in Fig. 2.16. The angle at C is a right angle so that by Euclidean geometry B C = l sin β and AC = l cos β. The distance between the concentric arcs is
l l eb/κ = eCC /κ = = = csc β. BC l sin β Now, the ratio of the concentric limiting arcs l + AC and l is [Kulczycki 61] l + AC ea/κ eb/κ = = 1 + cos β, l and consequently, 1 + cos β ea/κ = = cot(β/2). sin β Denoting β = (a) as the angle of parallelism, which can only depend on the hyperbolic distance a, we obtain the Bolyai–Lobachevsky formula,
(a) − tan = e a/κ. (2.4.3) 2
Euclidean geometry can be used in the hyperbolic plane to derive non- Euclidean results by considering the properties of concentric horocy- cles, and from their property (2.4.2) all the trigonometric formulas of hyperbolic geometry follow.
Now consider M1(x1, y1) and M2(x2, y2) as any two points on a horo- cycle lying in the unit disc. Let M(x, y) be any point on the arc M1M2. With Aug. 26, 2011 11:16 SPI-B1197 A New Perspective on Relativity b1197-ch02
86 A New Perspective on Relativity
M1M/MM2 = λ, the relation between the coordinates are known to be x + λx y + λy x = 1 2 , y = 1 2 . (2.4.4) 1 + λ 1 + λ
The values λ1 and λ2 will be the roots of the unit disc,
x2 + y2 = 1, (2.4.5)
that is when M coincides with either P or Q on the boundary. Introducing (2.4.4) into (2.4.5),
2 2 (x1 + λx2) + (y1 + λy2) − (1 + λ) = 0,
and defining
= 2 + 2 − = 2 + 2 − 11 x1 y1 1, 22 x2 y2 1,