Ray Optics for Imaging Systems Course Notes for IMGS-321 11 December 2013
Roger Easton Chester F. Carlson Center for Imaging Science Rochester Institute of Technology 54 Lomb Memorial Drive Rochester, NY 14623 1-585-475-5969 [email protected]
December 11, 2013 Contents
Preface ix 0.1References:...... 1
1 Introduction 1 1.1 Models of Light and Propagation ...... 2 1.1.1 Raymodeloflight(“geometricaloptics”)...... 2 1.1.2 Wavemodeloflight(“physicaloptics”):...... 2 1.1.3 Photonmodeloflight(“quantumoptics”):...... 3
2 Ray (Geometric) Optics 5 2.1Whatisanimagingsystem?...... 5 2.1.1 SimplestImagingSystem—PinholeinAbsorber...... 5 2.2First-OrderOptics...... 6 2.3Third-OrderOptics...... 9 2.3.1 Higher-OrderApproximations...... 10 2.4NotationsandSignConventions...... 10 2.4.1 NatureofObjectsandImages:...... 11 2.5HumanEye...... 13 2.6PrincipleofLeastTime...... 13 2.7 Fermat’s Principle for Reflection...... 14 2.7.1 PlaneMirrors...... 17 2.8Fermat’sPrincipleforRefraction:...... 18 2.8.1 Dispersion...... 19 2.8.2 RefractiveConstantsforGlasses...... 21 2.9ImageFormationintheRayModel...... 24 2.9.1 RefractionataSphericalSurface...... 24 2.9.2 ImagingwithSphericalMirrors...... 27 2.10First-OrderImagingwithThinLenses...... 28 2.10.1ExamplesofThinLenses...... 30 2.10.2SphericalMirror...... 32 2.11 Image Magnifications...... 32 2.11.1 Transverse Magnification:...... 32 2.11.2 Longitudinal Magnification:...... 33 2.11.3 Angular Magnification...... 34 2.12SingleThinLenses...... 35 2.12.1PositiveLens...... 35 2.12.2NegativeLens...... 36 2.12.3MeniscusLenses...... 36 2.12.4 Simple Microscope (magnifier,“magnifyingglass,”“loupe”)...... 37 2.13SystemsofThinLenses...... 41 2.13.1Two-LensSystem...... 41 2.13.2 Effective(Equivalent)FocalLength...... 43
v vi CONTENTS
2.13.3SummaryofDistancesforTwo-LensSystem...... 48 2.13.4 “EffectivePower”ofTwo-LensSystem...... 48 2.13.5 Lenses in Contact: t =0...... 49 2.13.6 Positive Lenses Separated by t
3 Tracing Rays Through Optical Systems 95 3.1ParaxialRayTracingEquations...... 95 3.1.1 ParaxialRefraction...... 96 3.1.2 ParaxialTransfer...... 97 3.1.3 LinearityoftheParaxialRefractionandTransferEquations...... 98 3.1.4 ParaxialRayTracing...... 98 3.2MatrixFormulationofParaxialRayTracing...... 100 3.2.1 RefractionMatrix...... 101 3.2.2 RayTransferMatrix...... 102 3.2.3 “Vertex-to-VertexMatrix”forSystem...... 104 3.2.4 Example1:SystemofTwoPositiveThinLenses...... 105 3.2.5 Example2:TelephotoLens...... 108
3.2.6 VV0 DerivedFromTwoRays...... 109 3.3Object-to-Image(Conjugate)Matrix...... M 110 3.3.1 Matrix of the “Relaxed” Eye (focused at ) ...... 114 3.4Vertex-VertexMatricesofSimpleImagingSystems...... ∞ 115 3.4.1 Magnifier(“magnifyingglass,”“loupe”)...... 115 3.4.2 GalileanTelescopeofThinLenses...... 116 3.4.3 KeplerianTelescopeofThinLenses...... 117 3.4.4 ThickLenses...... 117 3.4.5 Microscope...... 121 3.5 Image Location and Magnification...... 122 3.6MarginalandChiefRaysfortheSystem...... 122 3.6.1 ExamplesofMarginalandChiefRaysforSystems...... 123 CONTENTS vii
4 Depth of Field and Depth of Focus 141 4.0.2 ExamplesofDepthofFieldfromVideoandFilm...... 143 4.1 Criterion for “Acceptable Blur” ...... 149 4.2DepthofFieldviaRayleigh’sQuarter-WaveRule...... 152 4.3HyperfocalDistance...... 156 4.4MethodsforIncreasingDepthofField...... 156 4.5 Sidebar: Transverse Magnificationvs.FocalLength...... 157
5Aberrations 161 5.1ChromaticAberration...... 161 5.2Third-OrderOptics,MonochromaticAberrations...... 165 5.2.1 NamesofAberrations...... 173 5.2.2 Aberration Coefficients...... 174 5.2.3 Fourth-Order(Third-OrderRay)Aberrations:...... 181 5.2.4 ZernikePolynomials...... 190 5.3 Structural Aberration Coefficients...... 193 5.4OpticalImagingSystemsandSampling...... 193 5.5OpticalSystem“RulesofThumb”...... 193 Preface This book is intended to introduce the mathematical tools that can be applied to model and predict the action of optical imaging systems.
ix
0.1 REFERENCES: 1 0.1 References:
Many references exist for the subject of wave optics, some from the point of view of physics and many others from the subdiscipline of optics. Unfortunately, relatively few from either camp concentrate on the aspects that are most relevant to imaging.
Useful Optics Texts: [P3] (the three) Pedrottis, Introduction to Optics, Pearson Prentice-Hall, 2007. [G] Gaskill, Jack D., Linear Systems, Fourier Transforms, and Optics, John Wiley, 1978. [JG] Goodman, Joseph, Introduction to Fourier Optics, Third Edition, Roberts & Company, 2005. [H] Eugene Hecht, Optics, 4th Edition, Addison-Wesley, 2002. [PON] Reynolds, DeVelis, Parrent, Thompson, The New Physical Optics Notebook, SPIE, 1989. [BW] Max Born and Emil Wolf, Principles of Optics, 7th Expanded Edition, Cambridge University Press, 2005. [GF] Grant R. Fowles, Introduction to Modern Optics (Second Edition), Dover Publications, 1975. [RHW]RobertH.Webb,Elementary Wave Optics, Dover Publications, 1997. [FLS] R. Feynman, R. Leighton, M. Sands, The Feynman Lectures on Physics,Addison- Wesley, 1964. [KF] M.V. Klein and T.E. Furtak, Optics, Second Edition, Wiley, 1986 [JW] F. Jenkins and H. White, Fundamentals of Optics, 4th Edition, McGraw-Hill, 1976. [NP] A. Nussbaum and R. Phillips, Contemporary Optics for Scientists and Engineers, Prentice-Hall, 1976. [I] K. Iizuka, Engineering Optics, Springer-Verlag, 1985. [FBS] D. Falk, D. Brill, and D. Stork, Seeing the Light, Harper and Row, 1986. Lawrence Mertz, Transformations in Optics, John Wiley & Sons, 1965.
Physics Texts with useful discussions: [HR] D. Halliday and R. Resnick, Physics, 3rd Edition, Wiley, 1978. [C] F. Crawford, Waves, Berkeley Physics Series Vol. III, McGraw-Hill, 1968. John D. Jackson, Classical Electrodynamics, Third Edition, Wiley, 1998, §6. Feynman, Leighton, and Sands, Lectures on Physics, particularly Volume 1.§25-§33 and Vol- ume II §32-§33
Curriculum: Geometrical Optics and Imaging
1. Models for light propagation
(a) ray model (“geometric optics”) (b) wave model (“physical optics”) (c) photon model (quantum optics)
2. First-order optics
(a) third-order optics, aberrations (b) higher-order approximations
3. Sign conventions for distances and angles
(a) Nature of objects and images (real and virtual) 2 Preface
4.Humaneye 5. Refractive index
(a) Optical path length (b) Fermat’s principle of least time (P3 §2.2, H §4.5, BW §3.3)
(c) Snell’s law for reflection: θ2 = θ1 − i. plane mirrors
(d) Snell’s law for refraction: n1 sin [θ1]=n2 sin [θ2] i. plane interface between two media (e) Dispersion (variation in n with λ) i. relationship between mean refractive index and dispersion ii.crownandflint glasses (f) Dispersing prisms
6. Refraction at a Spherical Surface
(a) Paraxial approximation, imaging equation (b)Reflection at a spherical surface
7. Imaging with thin lenses
(a) Imaging equation in terms of object and image distances and focal length (b)system“power” (c) spherical mirrors (d) object/image conjugates (e) Image magnifications i. Transverse magnification ii. Longitudinal magnification iii. Angular magnification (f)Singlethinlenses i. positive lens ii. negative lens iii. meniscus lens iv.simplemicroscope (g) Systems of thin lenses i.lensesincontact ii.effective focal length and power of two-lens system iii. focal and principal points iv. afocal systems (telescopes) v. eyeglasses vi. compound microscopes vii. Newtonian form of imaging equation viii. telephoto lens ix. Stops and pupils A.aperturestop B. entrance and exit pupils 0.1 REFERENCES: 3
C. field stop (h) Marginal and chief (principal) rays i. telecentricity
8. Tracing rays through optical systems
(a) paraxial ray tracing equations i. paraxial refractiontransfer ii. paraxial transfer iii. linearity of equations (b) matrix formulation of paraxial ray tracing i. refraction matrix ii. transfer matrix iii. Lagrangian invariant iv. vertex-to-vertex matrix for imaging system v. object-to-image (conjugate) matrix vi. matrix for eye model (c) Examples of imaging system matrices i. magnifier ii. Galilean telescope iii. Keplerian telescope iv.thicklens v.microscope (d) image location and magnification (e)Depthoffield and depth of focus i.examplesfromfilm and video ii. criterion for “acceptable blur” iii.depthoffield via Rayleigh’s quarter-wave rule iv. hyperfocal distance v. methods for increasing depth of field vi. transverse magnification vs. focal length (f) Aberrations i. Chromatic aberration A. achromatic doublet B. apochromatic triplet ii. Third-Order (Seidel) Aberrations A. spherical aberration (relation to defocus) B.coma C. astigmatism D.distortion E.curvatureoffield F. piston error
9. Computed Ray Tracing, OSLOTM
Chapter 1
Introduction
The obvious first question to consider is “what is optics” (or perhaps “what are optics?” heh, heh). Onereasonabledefinition of optics is the application of physical principles and observed phenomena to manipulate “light” in useful ways. This presupposes the definition of “light,” which I specify as electromagnetic radiation of any “color,” temporal frequency, and wavelength. This is more general than the definition put forth by humanocentrics (e.g., color scientists), but is much more reasonable in our field, where we want to take advantage of all measureable radiation to learn information about objects that emit, reflect, refract, or otherwise modify radiation. The definition in imaging is somewhat narrower: the application of the properties if materials and of light to form “images,” which are “recognizable (though approximate) replicas of the spatial and spectral distribution of light reflected, transmitted, and/or emitted by an object.” To design optical image-forming systems, we must model the propagation of light from the object (source) to the optic, the action of the optic on the incident light distribution, and finally propagation from the optic to the sensor. The last step of conversion of the spatial (and possibly spectral) distribution of incident light into measurable physical and/or chemical changes in some mediumbythesensor,isoutsidethescopeofthisdiscussion. We hope to find a mathematical model of optical imaging as a “system,” where an output dis- tribution g is created from an input object distribution f by the action of an imaging system , e.g., g [x, y, λ]= f [x, y, z, λ] . We generally use this model to (try to) solve the inverse imagingO problem by inferringO{ the input object} from the output image and knowledge of the system. The task may be difficult or even impossible; it is easy to see one difficulty because most sensors measure only a 2-D distribution of monochromatic light and therefore cannot possibly recover the three spatial dimensions of a realistic object from a single image.
Schematic of an optical system that acts on an input with three spatial dimensions, time, and wavelength f [x, y, z, t, λ] to produce a 2-D monochrome (gray scale) image g [x0,y0].
1 2 CHAPTER 1 INTRODUCTION 1.1 Models of Light and Propagation
To be able even to write down, let alone solve, the imaging equation(s) for optical systems, we need to specify the mathematical model of light that will describe its behavior as it propagates and interacts with input objects, optical systems, and output sensors. To simplify the descriptions in the different contexts, three physical models for light and its interactions are used that are (loosely speaking) distinguished by the physical scale of the phenomena:
1.1.1 Ray model of light (“geometrical optics”) macroscopic-scale phenomena (e.g., reflection, refraction)
1.(a) light propagates as RAYS that travel in straight lines until encountering an change in properties of a medium or an interface between media. Except to differentiate the color of light, the wavelength λ and temporal frequency ν of the light are assumed to be zero and infinity, respectively (λ 0,ν ), which means that there are no effects due to diffraction; → →∞ (b) uses Fermat’s principle of least time to derive Snell’s law, which describes the phenomena of reflection and refraction; (c) useful for designing imaging systems (to locate the images and determine their magnifi- cations) (d) calculations for modeling the behavior of optical systems (lenses and/or mirrors) are (relatively) simple and may be easily implemented in software; (e)thequality of images from the system is assessed in terms of aberrations of the optical system, which describe deviations of the image from ideal behavior.
1.1.2 Wave model of light (“physical optics”): 1. microscopic-scale phenomena (diffraction/interference, reflection, refraction, refractive index, ...)
(a) considers light (electromagnetic radiation) to propagate as WAVES ; (b) propagation and interaction of light are described by Maxwell’s equations; 8 1 (c) light propagates with velocity c in vacuum c / 3 10 ms− and velocity v (e) the oscillation frequency ν0 of waves emitted by a particular light source is constant regardless of medium and is related to the vacuum wavelength λ0 via: λ0 ν0 = c · (f) the ratio of the propagation velocities in vacuum and in a medium is the index of refraction of the medium: c n ≡ v (g) the wavelength of the wave in a medium is shorter the “vacuum wavelength” λ0 via: λ λ = 0 medium n (h) wave optics explains the image-forming phenomena of reflection, refraction, diffraction (and interference, which is really just another name for diffraction) and the phenomena of polarization and dispersion that affect the quality of images; 1.1 MODELS OF LIGHT AND PROPAGATION 3 (i) mathematical calculations in wave optics are more “complicated” than those in ray optics and often not easy to implement in computers. For example, it is difficult to evaluate the exact form of light after propagating a short distance from the source; (j) uses the Huygens-Fresnel principle to derive the mathematical model for propagation of light, which if often divided into three regions: i. linear, shift-invariant model in the Rayleigh-Sommerfeld diffraction region (valid everywhere) ii. linear, shift-invariant approximation in the near field for propagation by a “suffi- ciently large” distance from the source (Fresnel diffraction) iii. linear, shift-variant approximation in the far field for propagation to “very large” distances from the source (Fraunhofer diffraction); (k) wave/physical optics is useful for assessing the quality of the images produced by systems. 1.1.3 Photon model of light (“quantum optics”): atomic-scale phenomena (emission and absorption of radiation) 1.(a) light is composed of PHOTONS with both wave and particle characteristics; (b) used to explain/analyze the physical interaction of light and matter, such as emission by sources (e.g., lasers), and the photoelectric effect in sensors; c E h (c) Fundamental relationships: E0 = hν0 = h and momentum p = = ,whereh is λ0 c λ0 Planck’s constant: 34 15 h = 6.626 10− Js= 4.136 10− eV s ∼ × ∼ × Phenomena described by the ray and wave models are most relevant to imaging, though the quantum model is vital for understanding the properties and artifacts of light sensing. You probably have seen some consideration of ray optics in undergraduate physics, and any such experience will be useful in this course. The most common treatments of optics consider rays first because the mathematical models and calculations are simpler. However, the preparation of linear systems you just had makes it possible and even desirable to consider the wave model first by applying the concepts of the impulse response and transfer function; these may significantly simplify the concepts and calculations. There are several goals to be reached by the conclusion of this discussion; we want to have the capabilities to do several things: locate the image(s) of an object generated by the lens, mirror, or system of lenses and/or • mirrors; determine the “character” (real or virtual) and the size(s) (i.e., the transverse magnification) • of the image(s); determine the “field of view” of the imaging system, i.e., the angular subtense of the object • that is imaged; determinetherangeofdistancesinthescenefromtheopticalsystemthatappearstobe“in • focus” (the depth of field); determine the capability of the optics to distinguish closely spaced objects — this is the “spatial • resolution” of the system (often specified in terms of measurements from the “point spread function”orthe“modulation transfer function” = “MTF,” which are optical analogues of the “impulse response” and “transfer function” that are considered in the course on Fourier methods); 4 CHAPTER 1 INTRODUCTION understand the constraints on system performance due to the properties of materials used in the • imaging system, such as the variation in refractive index of glass with wavelength (dispersion) Much of this discussion (especially about depth of field and spatial resolution) will benefitfrom concepts derived in the course on Fourier methods, but we must also be aware of the limitations in these concepts due to nonlinearities and/or shift-variant properties of the optical system. Chapter 2 Ray (Geometric) Optics Ray optics (commonly, though unfortunately, called “geometric optics”) uses the model of light as a ray to evaluate the locations and properties of images created by systems of lenses and/or mirrors. It does not consider any effects due to the wave model of light, such as interference or diffraction (which are actually just different words for the same phenomenon: “interference” considers few light sources and “diffraction” considers an infinite number, or just “many”). The subject of ray optics may be subdivided into categories of “first-order,” “third-order,” and even higher-order optical computations. It also cannot explain other wave-propagation phenomena, such as total internal reflection. 2.1 What is an imaging system? As a simple definition, we may consider an imaging system to map the distribution of the input “object” to a “similar” distribution at the output “image” (where the meaning of “similar” is to be determined). Often the input and output amplitudes are represented in different units. For example, the input often is electromagnetic radiation with units of, say, watts per unit area, while the output may be a transparent negative emulsion measured in dimensionless units of “density” or “transmit- tance.” In other words, the system often changes the form of the energy; it is a “transducer.” In the ray model, we can think of the imaging system as “selecting” and/or “redirecting” rays of light to map the energy onto the image sensor. The “selection” or “redirection” process uses some type of physical interaction between light and matter to remap the energy emitted or modified by the object onto the sensor. Among the more obvious physical interactions in our experience are refraction and reflection, but these are not the only, nor even the simplest, possible mechanisms. The very simplest interaction between light and matter is absorption, where the light energy is transferred to matter and “disappears” (of course, it does not really “vanish,” but most often is converted into heat in the matter, but it is no longer available to create an image, so it may as well have “disappeared.” We can use an absorber to create the simplest imaging system: the pinhole camera 2.1.1 Simplest Imaging System — Pinhole in Absorber Consider a 3-D volume of space that contains the object. Occasionally, a ray of light emitted (or reflected) from a location in the volume is selected by the pinhole and reaches the sensor. every point in space is “in focus” on the sensor transverse magnification Mt determined by relative distances z2 MT = −z1 negative sign means image is inverted 5 6 CHAPTER 2 RAY (GEOMETRIC) OPTICS The number of rays from the object that actually reach the image is small. The interaction with the sensor requires the quantum model of discrete energy packets, so the number of packets is small if the hole diameter is small. If the object is a uniformly emitting planar source, the numbers of packets measured from different locations in the field are different (Poisson statistics); these numerical variations in what should be identical measurements appear as “noise.” The metric of noise is determined by the mean value μ of the signal and the variation about that mean, which is described by the standard deviation σ. The signal-to-noise ratio is a dimensionless quantity that may be defined many ways, but we’ll use a simple definition that will suit this purpose μ μ SNR = = √μ ≡ σ √μ More photons leads to larger signals (μ ) and larger standard deviation (σ ), but mean increases ↑ ↑ faster than the variance σ = √μ, so the SNR is better statistics and less relative noise “Quality” of image depends on diameter d0 of pinhole. Improve statistics by increasing the number of photons. Larger dose or larger pinhole. The “blur” quality of the image is better for smaller pinhole because less uncertainty in ray path. How to improve? Longer exposure time multiple pinholes Depth of field Redirect rays: reflective pinholes Reflection Refraction Diffraction (wave property), e.g., holography 2.2 First-Order Optics Of most concern to us will be “first-order,” “paraxial ” or “Gaussian” optics, where the angles of light rays measured relative to the optical axis are assumed to be small, so that the ray heights remain small as the rays propagate down the optical axis, which is the source of another common term of “paraxial optics,” meaning that the ray remains near the optical axis. In cases such that the ray angle θ ∼= 0, then we can approximate trigonometric functions by the first terms in their power-series expansions (the “Taylor series” ): 0 1 2 2 n (x x0) (x x0) df (x x0) d f 1 d f n f [x]= − f [x0]+ − + − + + (x x0) + 0! · 1! dx 2! dx2 ··· n! · dxn · − ··· Ã ¯x=x0 ! Ã ¯x=x0 ! ¯x=x0 n ¯ ¯ ¯ ∞ (x x0) ¯ ¯ ¯ = − f (n) [x ] ¯ ¯ ¯ n! 0 n=0 · X If the base value and the derivatives are evaluated at the origin, we have a “Maclaurin series:” ∞ 1 f [x]= f (n) [0] xn n! n=0 · X 2.2 FIRST-ORDER OPTICS 7 The Maclaurin series for the sine is: ∞ 1 dn sin [θ]= (sin [θ]) θn n! · dθn · n=0 ¯θ=0 X ¯ 1 0 1 ¯ 1 1 2 1 3 1 4 sin [θ]= sin [0] θ + (+¯ cos [0]) θ + ( sin [0]) θ + ( cos [0]) θ + (+ sin [0]) θ + 0! · · 1! · · 2! · − · 3! · − · 4! · · ··· θ3 θ5 =0+θ +0 +0+ − 3! 5! − ··· θ3 θ5 = θ + − 3! 5! − ··· θ3 θ5 = θ + − 6 120 − ··· Note that only odd powers of θ arepresentintheseriesforsin [θ], because the sine is an odd (antisymmetric) function that satisfies the condition sin [ θ]= sin [+θ]. − − The corresponding series for the even (or symmetric) cosine includes only even powers of θ: 2 4 2n θ θ ∞ θ cos [θ]=1 + = ( 1)n 2! 4! (2n)! − − ··· n=0 − X = lim cos [θ] =1 θ 0 ⇒ ∼= { } θ2 = cos [θ] 1 ⇒ ≡ − 2 So the approximation of the cosine with two terms is the difference of a constant and a parabola. The series for the (odd, antisymmetric) tangent is less commonly known and includes only the odd powers of θ: 3 2n θ 2 5 ∞ 2n 2 1 2n 1 tan [θ]=θ + + θ + = 2 − B2n θ − = lim tan [θ] = θ 3 15 ··· (2n)! ⇒ θ=0 { } n=0 ¡ ¢ ∼ X ¡ ¢ th where B isbthe Bernoulli number. The first-, third-, and fifth-order series approximations for the tangent are: π tan [θ] = θ for > θ 0 ∼ 2 | | ' θ3 tan [θ] = θ + ∼ 3 θ3 2 tan [θ] = θ + + θ5 ∼ 3 15 The validity of these approximations is perhaps more obvious from the graphs, where we can see that sin [θ] / θ and tan [θ] ' θ for small positive values of θ. 8 CHAPTER 2 RAY (GEOMETRIC) OPTICS 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 theta Comparison of θ (black), sin [θ] (red), and tan [θ] (blue) for 0 θ +0.5 radians, showing that ≤ ≤ sin [θ] / θ and tan [θ] ' θ over this domain. The corresponding first-order approximation to the cosine is the unit constant lim cos [θ] =1 θ 0 { } → 1.2 1.1 1.0 0.9 0.8 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 0.16 0.18 0.20 theta The first-order approximation to cos [θ] (red) compared to the unit constant (black), showing that the two are very similar for small values of θ. The advantage of the first-order approxmation is that evaluation of the ray heights and angles becomes simple because of the proportionality. 2.3 THIRD-ORDER OPTICS 9 2.3 Third-Order Optics It likely is obvious from the definition of first-order optics that “third-order” optics includes the second term in the expansions: θ3 θ3 sin [θ] = θ = θ ∼ − 3! − 6 θ3 tan [θ] = θ + ∼ 3 θ2 θ2 cos [θ] = 1 =1 ∼ − 2! − 2 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 theta Comparison third-order approximations of sin [θ] (red), and tan [θ] (blue) to the linear term θ (black) . Note that the third-order approximation for the cosine is a biased parabola: 1.2 1.1 1.0 0.9 0.8 0.0 0.1 0.2 0.3 0.4 0.5 theta 2 cos [θ] (black) and its third-order approximation as 1 θ (red). − 2 10 CHAPTER 2 RAY (GEOMETRIC) OPTICS The results for ray angles using third-order optics will differ from those of first-order optics; these differences lead to image aberrations. 2.3.1 Higher-Order Approximations We clearly can add additional terms to the power series that will increase the accuracy of any calculations at the cost of significantly more complexity. 2.4 Notations and Sign Conventions One of the simplest and most difficult aspects of ray optics is the set of conventions to be adopted for all of the quantities to be measured. As in many aspects of optics, there are competing choices for conventions that have their own distinct advantages, but that lead to different equations for image locations, etc. We are going to use the directed distance convention, where distances are positive if measured from left to right. The problem becomes remembering which are the points measured “from” and “to,” respectively. The figure shows sign conventions for the different quantities. Note that in all cases, light travels from left to right in all media with positive refractive index (n>0),so the distances are positive if measured in the same direction of light travel and negative if measured in the other direction. Sign conventions for distances, heights, angles, and curvatures. The distance is positive if measured from left to right; the height is positive if the endpoint is above the axis; the angle from the axis or from a normal is positive if measured in the counterclockwise direction (positive θ); and the curvature is positive if its center is to the right of the vertex (intersection of the surface and the optical axis). Now consider the example in the figure where an optical system forms acts on a red “object” (the upright red arrow) located at the object point labeled by O to produce an “image” at O0.The horizontal black line is the line of symmetry of the optical system and is calle the “optical axis.” 2.4 NOTATIONS AND SIGN CONVENTIONS 11 Sign conventions for a specificcase:theobjectheightatO is positive, while the image height at O0 is negative. The angle θ of the (blue) ray from the base of the object to the (green) first surface is positive. The radius of curvature R of the first surface is positive. The front and rear surfaces of the optical system are shown in green; their intersections with the optical axis are the vertices of the system. The object space includes all features to the left of the vertex V that is closer to the object, so V is the object-space vertex of the imaging system. Similarly, the image space includes all features to the right of the vertex V0 that is closer to the image O0, so V0 is the image-space vertex. The ray shown in blue from the object O to the green optical surface makes an angle θ measured from the optical axis to the ray; since this angle is measured counterclockwise, it is a positive angle θ>0. The image-space ray from V0 to O0 measured from theaxisisaclockwiseangle,soθ0 < 0. The front surface of the optical system has a radius of curvature R that is measured from the vertex to the center of curvature, i.e. R =VC, where the overscored pair of letters denotes the distance from the first feature to the second. In this case, the distance from V to C is measured from left ot right, so VC R > 0. In the same manner, the distance from the rear vertex V0 to ≡ its center of curvature C0 is measured from right to left, so R0 V0C0 < 0; R0 is negative in this example. ≡ Two other features are shown in the figure that we have not yet described, one each in object and image space. F and F0 are object-space and image-space focal points, respectively. They are endpoints of the object-space and image-space focal lengths; the other endpoints are either the vertices (if the lenses are “thin”) or the principal points (which we shall label as H and H0, respectively). That discussion will have to wait until later. We will often have the need to propagate a light ray through an optical system consisting of asetofdifferent thin lenses or a set of surfaces separated by different media. The cascade of calculations requires distances measured from the object to the lens or front surface and from lens or back surface to the image. The need to express multiple distances will be addressed by both subscripts and “primed” notation, depending on context, where the “unprimed” notation will refer to the distance before the lens or surface and the “primed” notation to that after. When multiple surfaces are needed, the first will be denoted by the subscript “1,” the next by “2,” etc. Notation can also be a problem. The two different lower-case Greek letters for “phi” (straight φ and cursive ϕ)willbeusedindifferent ways: φ represents the “power” of a lens or surface and is 1 measured in reciprocal length, most commonly reciprocal meters m− , which is named the diopter. The cursive phi (ϕ) will be used to represent an angle, and therefore is dimensionless. The cursive letter f is used to represent a function, e.g., f [x, y, t], whereas the “straight” letter f will be used to denote the focal length with dimensions of length. This means that: 1 φ = f 2.4.1 Nature of Objects and Images: 1. Real Object: Rays incident on the lens are diverging from the source; the object distance is positive 12 CHAPTER 2 RAY (GEOMETRIC) OPTICS 2. Virtual Object: Rays into the lens are converging toward the “source” located “behind” the lens; object distance is negative 3. Real Image: Rays emerging from the lens are converging toward the image; image distance is positive 4. Virtual Image: Rays emerging from the lens are diverging, so that the “image” is behind the lens and the image distance is negative 2.5 HUMAN EYE 13 2.5 Human Eye Since this course considers optics of imaging systems, and since the images generated by many optical systems are viewed by human eyes, we need to at least introduce the optics of the eye; we will consider it in more detail when we trace rays through the “standard” eye model later. The optics of the human eye include the curved surface (the “cornea,” which exhibits most of the power of the system) and a deformable lens. The system is intended to form an image on the retina, which is a fixed distance from the cornea. The lens is deformed by action of ciliary muscles to change the plane that is viewed “in focus.” When the muscles are relaxed, the lens is “flatter,” i.e., the radii of curvature of the surfaces are larger. To view an object “close up,” the focal length of the eye lens must be shortened by making the lens shape more spherical. This is accomplished by tightening the ciliary muscles (which is the reason why your eyes get tired after an extended time of viewing objects up close). If the retina is located “too far” from the cornea, so that the image is “in front” of the retina when the muscles are relaxed, then the eye sees a “blurry” image of distant objects, but nearby objects may be well focused. This is the condition of “nearsightedness” or “myopia.” If the retina is “too close” to the cornea, the image is focused behind it and the eye sees distant objects more sharply (“hyperopia” or “farsightedness.”) 2.6 Principle of Least Time The mathematical model of ray optics is based on a principle stated by Fermat. Long before that, Hero of Alexandria hypothesized a model of light propagation that could be called the principle of least distance: A ray of light traveling between two arbitrary points traverses the shortest possible path in space. (Hero of Alexandria) This statement applies to reflection and transmission through homogeneous media (i.e., the medium is characterized by a single index of refraction). However, Hero’s principle is not valid if the object and observation points are located in different media (as is the normal situation for refraction) or if multiple media are present between the points. In 1657, Pierre Fermat modified Hero’s statement to formulate the principle of least time (which actually works): A light ray travels the path that requires the least time to traverse. (Fermat) The laws of reflection and refraction may be easily derived from Fermat’s principle. A moving ray 14 CHAPTER 2 RAY (GEOMETRIC) OPTICS (or car, bullet, or baseball) traveling a distance s at a velocity v requires t seconds: s t = v If the ray travels at different velocities for different increments of distance, the total travel time is the summation over the different distances and different velocities: M s t = m v m=1 m X c If we define the velocity of a light ray in a medium of index n to be v = . then: n M M sm 1 t = = (nmsm) c c c m=1 m=1 ≡ X nm X ³ ´ where the optical path length is defined: M (nmsm) m=1 ≡ X For a single medium, the optical path length is: