research papers
Aberration-free aspherical lens shape for shortening the focal distance of an already convergent beam
ISSN 1600-5775 John P. Sutter* and Lucia Alianelli
Diamond Light Source Ltd, Chilton, Didcot, Oxfordshire OX11 0DE, UK. *Correspondence e-mail: [email protected]
Received 25 May 2017 Accepted 14 August 2017 The shapes of single lens surfaces capable of focusing divergent and collimated beams without aberration have already been calculated. However, nanofocusing compound refractive lenses (CRLs) require many consecutive lens surfaces. Edited by M. Yabashi, RIKEN SPring-8 Center, Here a theoretical example of an X-ray nanofocusing CRL with 48 consecutive Japan surfaces is studied. The surfaces on the downstream end of this CRL accept X-rays that are already converging toward a focus, and refract them toward a Keywords: X-ray; lens; oval; nanofocusing; aberration. new focal point that is closer to the surface. This case, so far missing from the literature, is treated here. The ideal surface for aberration-free focusing of a convergent incident beam is found by analytical computation and by ray tracing to be one sheet of a Cartesian oval. An ‘X-ray approximation’ of the Cartesian oval is worked out for the case of small change in index of refraction across the lens surface. The paraxial approximation of this surface is described. These results will assist the development of large-aperture CRLs for nanofocusing.
1. Introduction Compound refractive lenses (CRLs) have been used to focus X-ray beams since Snigirev et al. (1996) demonstrated that the extremely weak refraction of X-rays by a single lens surface could be reinforced by lining up a series of lenses. As the X-ray focal spot size has been brought down below 1 mm, more lenses have been necessary to achieve the very short focal lengths required. Because the absorption of X-rays in the lens material is generally significant, it thus becomes critical to design the CRL with the shortest length possible for the given focal length in order to minimize the thickness of the refrac- tive material through which the X-rays must pass. In recent years, designs for novel nanofocusing lenses have been proposed. X-ray refractive lenses will deliver ideally focused beam of nanometer size if the following conditions can be satisfied: (1) The lens material should not introduce unwanted scat- tering. (2) The fabrication process should not introduce shape errors or roughness above a certain threshold. (3) Absorption should be minimized in order to increase the lens effective aperture. (4) The lens designs should not introduce geometrical aberrations. In response to the first three of these conditions, planar micro-fabrication methods including electron beam litho- graphy, silicon etch and LIGA have successfully been used to fabricate planar parabolic CRLs, single-element parabolic kinoform lenses (Aristov et al., 2000) and single-element elliptical kinoform lenses (Evans-Lutterodt et al., 2003) from
1120 https://doi.org/10.1107/S1600577517011808 J. Synchrotron Rad. (2017). 24, 1120–1136 research papers silicon. However, the best focus that can be obtained is CRL requires a much larger number of lenses because of the strongly dependent on material and X-ray energy. A colli- small refractive power and the short focal length. Evans- mating–focusing pair of elliptical silicon kinoform lenses with Lutterodt et al. (2007), in their demonstration of the ability of a focal length of 75 mm has successfully focused 8 keV kinoform lenses to exceed the numerical aperture set by the photons from an undulator source of 45 5 mm full width at critical angle, used Fermat’s theorem to calculate the ideal half-maximum (FWHM) into a spot of 225 nm FWHM shapes of their four lenses, but explicitly described the shape (Alianelli et al., 2011). On the other hand, although silicon of only the first lens (an ellipse). Sanchez del Rio & Alianelli lenses can be fabricated with very high accuracy, they are too (2012) pointed out the general answer, known for centuries, absorbing to deliver a focused beam below 100 nm at energy that the ideal shape of a lens surface for focusing a point values below 12 keV, unless kinoform lenses with extremely source to a point image is not a conic section (a curve small sidewalls can be manufactured. As a result, diamond has described by a second-degree polynomial). Rather, it is a come to be viewed as a useful material for CRLs. Diamond, Cartesian oval, which is a type of quartic curve (i.e. a curve along with beryllium and boron, is one of the ideal candidates described by a fourth-degree, or quartic, polynomial). An to make X-ray lenses due to good refractive power, low array of sections of Cartesian ovals is therefore one possible absorption and excellent thermal properties. Planar refractive solution to the task of designing an X-ray lens for single-digit lenses made from diamond were demonstrated by No¨hammer nanometer focusing. Previous authors in X-ray optics have not et al. (2003). Fox et al. (2014) have focused an X-ray beam of calculated analytical solutions (which do exist), but instead 15 keV to a 230 nm spot using a microcrystalline diamond lens, relied on numerical calculations of the roots without asking and an X-ray beam of 11 keV to a 210 nm spot using a how well conditioned the quartic polynomial is; that is, how nanocrystalline diamond lens. Designs of diamond CRLs stable the roots of the polynomial are against small changes in proposed by Alianelli et al. (2016) would potentially be its coefficients. However, in this paper it will be shown that capable of focusing down to 50 nm beam sizes. Many technical finite numerical precision can cause errors in the calculation of problems remain to be solved in the machining of diamond; the Cartesian oval when the change in refractive index across however, our current aim is to provide an ideal lens design the lens surface becomes very small, as is usually the case with to be used when the technological issues are overcome. We X-rays. Moreover, it has not been made explicit in the litera- assume that technology in both reductive and additive tech- ture when it is reasonable to approximate the ideal Cartesian niques will advance in the coming decade and that X-ray oval with various conic sections (ellipses, hyperbolas or refractive lenses with details of several tens of nanometers will parabolas). As a result, for the sake of rigor, the authors have be fabricated. This will make X-ray refractive optics more considered it worthwhile to find the analytical solutions competitive than they are today for the ultra-short focal explicitly. This has not been done before in any recent papers. lengths. When that happens, lens designs that do not introduce Finally, Alianelli et al. (2015) state that no analytical solution aberrations will be crucial. exists for a lens surface that accepts an incident beam The aim of this paper is to define a nanofocusing CRL with converging to a point and that focuses this beam to another the largest possible aperture that can be achieved without point closer to the lens surface. This paper will concentrate on introducing aberrations to the focus. The determination of the that very case and will show that in fact such a lens surface can ideal shape of each lens surface becomes more critical as the be described by a Cartesian oval. The existence of such desired aperture grows. Suzuki (2004) states that the ideal lens solutions removes the necessity of using pairs of lens surfaces surface for focusing a plane wave is an ellipsoid, although in of which the first slightly focuses the beam and the second fact this is true only if the index of refraction increases as the collimates the beam again. X-rays cross the surface [see Sanchez del Rio & Alianelli Schroer & Lengeler (2005) proposed the construction of (2012) and references therein, as well as x2.4 of this paper]. ‘adiabatically’ focusing CRLs, in which the aperture of each The same author also proposes the use of two ellipsoidal lens follows the width of the X-ray beam as the beam lenses for point-to-point focusing, but a nanofocusing X-ray converges to its focus. Fig. 1 displays a schematic of an adia-
Figure 1 Schematic drawing of an adiabatically focusing CRL for X-rays. The X-ray beam runs along the central axis from left to right. Ai , Ri and Li are, respectively, the geometrical aperture, the radius at the apex and the length along the beam direction of the ith lens surface. If the lens surfaces are 2 assumed to be parabolic as in Schroer & Lengeler (2005), Li = Ai =ð8RiÞ. q1i and q2i are, respectively, the distance of the object and the distance of the image of the ith lens surface from that surface’s apex.
J. Synchrotron Rad. (2017). 24, 1120–1136 Sutter and Alianelli Aberration-free aspherical lens shape 1121 research papers
Table 1 following treatment. The loss of numerical precision when Lens surfaces proposed for a diamond nanofocusing CRL for X-rays of using the exact analytical solutions of the Cartesian oval at energy 15 keV. small will be avoided by an approximation of the quartic The distance from the apex of the last lens surface to the final focal plane is Cartesian oval equation to lowest order in . The cubic 11.017 mm. i is the place of each surface in the CRL. R is the radius at the i equation resulting from this ‘X-ray approximation’ will be apex. fi is the focal length. Ai is the geometrical aperture. q1i is the downstream distance of the object of lens surface i. (A negative value of q1i means that the shown to be numerically stable. The approximate cubic and object of lens surface i is upstream.) q2i is the downstream distance of the the exact quartic equation will be shown to agree when the image of lens surface i. The distance between the apices of consecutive lens latter is numerically tractable. As in Sanchez del Rio & surfaces is 0.005 mm. For legibility, Ri and Ai are rounded to four significant digits, and fi, q1i and q2i are rounded to three decimal places. Alianelli (2012), conic approximations will be made to the ideal surface in the paraxial case. The results of this paper iRi (mm) fi (mm) Ai (mm) q1i (mm) q2i (mm) agree with theirs in showing that Cartesian ovals, even when 1 0.05 15484.670 0.075 46999.864 23092.905 calculated by the cubic ‘X-ray’ approximation, introduce no 2 0.045 13936.203 0.07425 23092.900 8691.196 3 0.0405 12542.583 0.07351 8691.191 5133.801 detectable aberrations, and that elliptical or hyperbolic lens 4 0.03645 11288.325 0.07277 5133.796 3528.896 surfaces introduce less aberration than the usually used 5 0.03281 10159.492 0.07204 3528.891 2619.136 parabolic surfaces. However, this paper will also demonstrate 6 0.02952 9143.543 0.07132 2619.131 2035.944 7 0.02657 8229.189 0.07061 2035.939 1632.140 that at sufficiently high apertures even the elliptical or 8 0.02391 7406.270 0.06990 1632.135 1337.408 hyperbolic approximation will produce visible tails in the focal 9 0.02152 6665.643 0.06921 1337.403 1113.907 spot. This is especially true for surface 48, the final surface and 10 0.01937 5999.078 0.06851 1113.902 939.463 11 0.01743 5399.171 0.06783 939.458 800.220 the one with the smallest focal length, where the elliptical and 12 0.01569 4859.254 0.06715 800.215 687.069 hyperbolic approximation produces tails in the focus at an 13 0.01412 4373.328 0.06648 687.064 593.780 aperture only slightly larger than that given in Table 1. 14 0.01271 3935.995 0.06581 593.775 515.941 15 0.01144 3542.396 0.06516 515.936 450.345 At the end of this treatment, the focal spot profiles calcu- 16 0.01029 3188.156 0.06450 450.340 394.601 lated by ray tracing for the compound refractive lens (CRL) of 17 0.009265 2869.341 0.06386 394.596 346.891 Table 1 will be compared with the diffraction broadening that 18 0.008339 2582.407 0.06322 346.886 305.808 19 0.007505 2324.166 0.06259 305.803 270.245 inevitably results from the limited aperture. The absorption in 20 0.006754 2091.749 0.06196 270.240 239.322 the lens material limits the passage of X-rays through a CRL 21 0.006079 1882.574 0.06134 239.317 212.325 to an effective aperture A that is smaller than the geome- 22 0.005471 1694.317 0.06073 212.320 188.677 eff 23 0.004924 1524.885 0.06012 188.672 167.898 trical aperture. This in turn restricts the numerical aperture 24 0.004432 1372.397 0.05952 167.893 149.592 (NA) of a CRL of N surfaces to a value Aeff /(2q2N), where q2N 25 0.003988 1235.157 0.05893 149.587 133.428 is the distance from the last (Nth) surface to the final focus. 26 0.003590 1111.641 0.05834 133.423 119.125 27 0.003231 1000.477 0.05775 119.120 106.446 Lengeler et al. (1999) derive a FWHM of 0.75 /(2NA) for the 28 0.002908 900.429 0.05718 106.441 95.189 Airy disk at the focal spot. This yields the diffraction broad- 29 0.002617 810.387 0.05660 95.184 85.179 ening and hence the spatial resolving power of the CRL. 30 0.002355 729.348 0.05604 85.174 76.268 31 0.002120 656.413 0.05548 76.263 68.325 Formulas for the effective aperture of a CRL in which all lens 32 0.001908 590.772 0.05492 68.320 61.238 surfaces are identical have been derived by Lengeler et al. 33 0.001717 531.695 0.05437 61.233 54.909 (1998, 1999), and Schroer & Lengeler (2005) have derived 34 0.001545 478.525 0.05383 54.904 49.253 35 0.001391 430.673 0.05329 49.248 44.194 formulas for the effective aperture of an adiabatically focusing 36 0.001252 387.605 0.05276 44.189 39.667 CRL. Very recently Kohn (2017) has re-examined the calcu- 37 0.001126 348.845 0.05223 39.662 35.613 lation of the effective aperture, surveying the various defini- 38 0.001014 313.960 0.05171 35.608 31.981 39 0.0009124 282.564 0.05119 31.976 28.725 tions appearing in the literature and distinguishing carefully 40 0.0008212 254.308 0.05068 28.720 25.806 between one-dimensionally and two-dimensionally focusing 41 0.0007390 228.877 0.05017 25.801 23.187 CRLs. In this paper the effective aperture and the numerical 42 0.0006651 205.989 0.04967 23.182 20.837 43 0.0005986 185.390 0.04917 20.832 18.728 aperture will be estimated numerically by ray tracing, taking 44 0.0005388 166.851 0.04868 18.723 16.834 full account of the absorption to which each ray is subjected 45 0.0004849 150.166 0.04820 16.829 15.133 along the path from the source to the focus. For simplicity, the 46 0.0004364 135.150 0.04771 15.128 13.605 47 0.0003928 121.635 0.04724 13.600 12.232 lens surfaces will all be assumed to be one-dimensionally 48 0.0003535 109.471 0.04676 12.227 10.999 focusing parabolic cylinders. It will be shown that the parabola is an adequate approximation to the ideal Cartesian oval within the effective aperture of the CRL in Table 1. batic CRL. A potential example of such a CRL, which would be made from diamond, is given in Table 1. This example 2. Principles treats X-rays of energy 15 keV, for which diamond has an index of refraction 1 where = 3.23 10 6. The very 2.1. Definitions and derivation of ideal lens surface small difference between the index of refraction of diamond The first task of this article is to calculate the exact surface and that of vacuum is typical for X-ray lenses. Calculations of yðxÞ of a lens that bends an already convergent bundle of rays surfaces 2, 24 and 48 of Table 1 will be demonstrated in the into a new bundle of rays converging toward a closer focus.
1122 Sutter and Alianelli Aberration-free aspherical lens shape J. Synchrotron Rad. (2017). 24, 1120–1136 research papers
One can rearrange this to find an expression for the surface slope y0ðxÞ, which will be useful for design calculations once a solution for yðxÞ has been obtained, 0 no no n 2 1=2 2 1=2 y0ðxÞ¼ x x2 þ q þ yðxÞ x2 þ q þ yðxÞ n 1 2 . no 1=2 2 2 q1 þ yðxÞ x þ q2 þ yðxÞ 0 no n 2 1=2 q þ yðxÞ x2 þ q þ yðxÞ : ð7Þ n 2 1 This is a nonlinear differential equation, but nevertheless it can be solved by noticing that the numerators of each fraction of equation (6) are the derivatives of that fraction’s denomi- nator. A simple variable substitution thus presents itself, Figure 2 Schematic drawing of surface yðxÞ (to be calculated in this paper) across 2 1 V ðxÞ¼x2 þ q þ yðxÞ ! V 0ðxÞ¼x þ y0ðxÞ q þ yðxÞ ; which incident rays in a medium of refractive index n converging to a 1 1 2 1 1 focus F are refracted into rays in a medium of refractive index n0 1 ð8Þ converging to a new closer focus F2. q1 and q2 are, respectively, the distances of F and F from the coordinate origin O along the central axis, 1 2 which is parallel to y^. x^ and y^ are the coordinate unit vectors. P is an 2 1 ^ 2 0 0 arbitrary point along yðxÞ. ^tP and NP are, respectively, the unit tangent V2ðxÞ¼x þ q2 þ yðxÞ ! V2ðxÞ¼x þ y ðxÞ q2 þ yðxÞ : ^ ^ 0 2 and the unit inward normal to yðxÞ at P. kP and kP are, respectively, the unit wavevector of the incident ray and the refracted ray passing through ð9Þ 0 P. P is the angle of the incident ray to the inward normal at P, and P is the angle of the refracted ray to the inward normal at P. As the initial condition, one may set yðx ¼ 0Þ = 0 as shown in 2 2 Fig. 2. In that case, V1ðx ¼ 0Þ = q1 and V2ðx ¼ 0Þ = q2 . Inte- gration of both sides of equation (6) starting from x = 0 can The required quantities are labelled and defined in Fig. 2. then be written According to Snell’s Law taken for the rays at an arbitrary Zx Zx point P, 1 V 0ðsÞ 1 n0 V 0ðsÞ 1 2 1=2 ds ¼ 1=2 ds; ð10Þ sin n0 2 V ðsÞ 2 n V ðsÞ P ¼ : ð1Þ 0 1 0 2 sin 0 n P 0 where s is a dummy variable. Now, V1ðsÞ ds =dV1 and Let the coordinate vector of P be ½x ^ þ yðxÞ^ . Fig. 2 shows 0 x y V2ðsÞ ds =dV2, allowing equation (10) to be rewritten in the that very simple form x x^ q þ yðxÞ y^ 2 2 2 2 k^ ¼ no1 ; ð2Þ x þ½ qZ1þyðxÞ x þ½ qZ2þyðxÞ P 1=2 1 dV 1 n0 dV 2 2 ffiffiffiffiffi1 ffiffiffiffiffi2 x þ q1 þ yðxÞ p ¼ p : ð11Þ 2 V1 2 n V2 q2 q2 1 2 x x^ q þ yðxÞ y^ ^ 0 no2 The integrals on both sides of equation (11) are elementary kP ¼ 1=2 ; ð3Þ 2 2 x þ q2 þ yðxÞ and yield the result nono 1=2 n0 1=2 n0 x2 þ q þ yðxÞ 2 q ¼ x2 þ q þ yðxÞ 2 q : ^ þ y0ðxÞ^ 1 1 2 2 ^ x y n n tP ¼ 1=2 : ð4Þ 1 þ ½ y0ðxÞ 2 ð12Þ It is also seen in Fig. 2 that Equation (12) describes a Cartesian oval with the two foci F1 and F2 shown in Fig. 2. Note that the distances of any point ^ 2 2 1=2 sin P kP ^tP on yðxÞ from F1 and F2 are r1 = fx þ½q1 þ yðxÞ g and r2 = ¼ : ð5Þ 2 1=2 0 ^ 0 fx2 þ½q þ yðxÞ g , respectively. Equation (12) can then be sin P kP ^tP 2 written in the standard form for a Cartesian oval (Weisstein, Substitution of equations (1)–(4) into (5) yields the first-order 2016), ordinary differential equation 0 1 n0 n0 n0 r1 r2 ¼ q1 q2 if q1 q2 > 0; 0 0 0 n n n x þ y ðxÞ q1 þ yðxÞ n B x þ y ðxÞ q2 þ yðxÞ C ð Þ no@noA 0 0 0 13 1=2 ¼ 1=2 : ð6Þ n n n 2 2 n 2 2 r r ¼ q q if q q < 0: x þ q1 þ yðxÞ x þ q2 þ yðxÞ n 2 1 n 2 1 1 n 2
J. Synchrotron Rad. (2017). 24, 1120–1136 Sutter and Alianelli Aberration-free aspherical lens shape 1123 research papers
0 The case ½q1 ðn =nÞq2 = 0 may be of some interest, and is surface. It was thus decided to solve equation (17) for yðxÞ physically achievable for the problem of this article (q1 > q2) explicitly. Writing equation (17) in powers of y yields if n0 > n. In this case, equation (12) can be squared and re- "#"#"# arranged into 2 n0 2 n0 2 n0 2 &' 4 3 0 2 2 0 2 2 1 y þ 41 q1 q2 y 2 q1 ðÞn =n q2 q1 ðÞn =n q2 n n n x þ yðxÞþ 2 ¼ 2 ; ð14Þ ( "#"#"# 0 0 2 1 ðÞn =n 1 ðÞn =n 0 2 0 2 0 2 n 2 n 2 n 2 0 2 0 0 þ 21 x þ 41 q1 q2 and q1 ðn =nÞ q2 = ðn =nÞ½ðn =nÞ 1 q2 < 0. This is a circle n n n of radius centred at ð0; Þ, where ) 2 n0 n0 2 2 q ðÞn0=n q ðn0=nÞ q þ 4 1 q1q2 y ¼ 1 2 ¼ q ¼ 1 : ð15Þ n n 1 ðÞn0=n 2 1 þðn0=nÞ 2 1 þðn0=nÞ ( "# "# n0 2 n0 2 þ 41 q q x2 n 1 n 2 ) n0 n0 n0 þ 8 1 q q q q y 2.2. Closed-form solutions of ideal lens surface n n 1 2 1 n 2 "# ( 2.2.1. Derivation of algebraic equation. By adding q to 2 1 n0 2 n0 n0 ÀÁ both sides of equation (12) and then squaring the equation, 4 2 2 þ 1 x þ 4 q1 þ q2 one obtains n n n "# ) ! 2 2 0 2 n 2 x þ q1 þ yðxÞ ¼ þ 1 þ q q x ¼ 0: ð18Þ n 1 2 0 2no n 2 2 x þ q2 þ yðxÞ n 0 0 no n n 2 1=2 The calculation of yðxÞ therefore amounts to finding the roots þ 2 q q x2 þ q þ yðxÞ n 1 n 2 2 of the quartic polynomial equation (18) for any x. As a quartic polynomial equation with real coefficients, equation (18) is n0 2 þ q1 q2 : ð16Þ guaranteed to have four solutions, of which n – all four may be real, or – two may be real, while the other two are complex and conjugates of each other, or Rearranging this equation to put the radical alone on one side, – all four may be complex, forming two pairs of complex then squaring it again, yields conjugates. 0 4 No more than one of these roots can satisfy the original 4 n 4 q þ yðxÞ þ q þ yðxÞ equation of the Cartesian oval, equation (12). To be physically 1 n 2 significant, that root must be real. If equation (18) produces no 0 2 n 2 2 real root that satisfies equation (12) when calculated at some 2 q þ yðxÞ q þ yðxÞ n 1 2 particular x, the ideal lens surface does not exist at that x.This ()"# 0 2 0 2 raises the possibility that the ideal lens surface may be n n 2 þ 21 x2 q q q þ yðxÞ bounded; that is, it has a maximum achievable aperture. n 1 n 2 1 2.2.2. Calculation of roots of quartic polynomial equation. ()"# 0 2 0 2 0 2 Analytical procedures for calculating the roots of cubic and n n n 2 2 1 x2 þ q q q þ yðxÞ n n 1 n 2 2 quartic equations were worked out in the 16th century. ("#"# Nonetheless, explicit solutions of such equations are published 2 n0 2 n0 2 n0 2 so rarely that a detailed description of the method will be þ 1 x4 21þ q q x2 n n 1 n 2 useful for the reader. Note that the procedure for quartic ) equations includes the determination of one root of a cubic n0 4 equation. Many standard mathematical texts explain the þ q q ¼ 0: ð17Þ 1 n 2 solution of cubic and quartic equations; Weisstein (2016) has been followed closely here. Equation (17) is a quartic polynomial equation in both x and y. The first step in solving a general quartic equation It can be written as a quadratic equation in x2, since no odd ay4 þ by3 þ cy2 þ dy þ e = 0 is the application of a coordinate powers of x appear in it, and by using the quadratic formula a transformation to a new variable y given by closed-form expression of xðyÞ can be calculated. However, b the inversion of this function to obtain yðxÞ is difficult, and yðxÞ y ¼ y þ ; ð19Þ would be far more useful for design calculations of the lens 4a
1124 Sutter and Alianelli Aberration-free aspherical lens shape J. Synchrotron Rad. (2017). 24, 1120–1136 research papers
"# 4 and the division of both sides of the general equation by a such n0 2 ÀÁ that one obtains a ‘depressed quartic’, that is, a quartic with r ¼ 1 Rx2 4 2 n no cubic term, in y . This has the form y þ py þ qy þ r =0, "#("# 4 4 where n0 2 n0 2 ¼ 1 1 x4 n n 2 "#"# 1 3b 2 p ¼ þ c ; n0 2 n0 2 a 8a 21 1 þ 2 q2x2 n n 1 "#"# 2 0 2 0 2 0 2 n n n 2 2 2 1 2 þ q2x 1 b3 bc n n n q ¼ þ d ; ð20Þ "#"# a 8a2 2a 2 n0 n0 2 n0 n0 2 þ 4 1 1 þ þ q q x2 n n n n 1 2 "# "# n0 2 n0 n0 n0 2 1 3b4 b2c bd þ 1 4 q4 4 1 3 q3q r ¼ þ þ e : 1 1 2 3 2 n n n n a 256a 16a 4a "# 0 2 0 0 2 0 3 0 4 n n n n n 2 2 þ 2 2 4 5 4 þ 2 q1q2 By substituting the coefficients of equation (18) into equations n n n n n "# (19) and (20), one obtains the coordinate transformation, 5 2 n0 n0 n0 þ 4 3 þ q q3 n n n 1 2 0 2 "#) q1 ðÞn =n q2 y ¼ y ð21Þ n0 6 n0 2 1 ðÞn0=n 2 þ 4 þ q4 : ð24Þ n n 2 and the depressed equation in y , which has the following The strategy now is to add to both sides of the depressed coefficients, equation a quantity ð y 2u þ u2=4Þ, where u is a real quantity that will be determined shortly. Knowing that "# 4 2 2 2 2 2 ð y þ y u þ u =4Þ = ð y þ u=2Þ , one obtains from the n0 2 ÀÁ p ¼ 1 Px2 depressed equation the following, n 2 "# ( "# "# 2 1 2 1 2 2 2 2 2 2 y þ u ¼ ðÞu p y qy þ u r : ð25Þ n0 n0 n0 2 4 ¼ 1 21 x2 21þ 2 q2 n n n 1 "# The left side of equation (25) is thus a perfect square. Notice n0 2 n0 2 that the right side of equation (25) is a quadratic equation. 2 2 þ q2 n n 2 Therefore it too will be a perfect square if u can be chosen to "# ) make its two roots equal; that is, if its discriminant D equals 2 n0 n0 n0 zero, þ 4 1 þ þ q q ; ð22Þ n n n 1 2 1 D ¼ q2 4ðÞu p u2 r 4 ÀÁ ¼ u3 pu2 4ru þ 4pr q2 ¼ 0: ð26Þ "# 3 Equation (26) is known as the ‘resolvent cubic’. As a cubic n0 2 equation with real coefficients, it is guaranteed to have three q ¼ 1 Q n roots, of which either one or all will be real. The analytical "#( 3 2 3 solution of a general cubic equation u þ fu þ gu þ h =0 n0 2 n0 2 n0 4 begins with the calculation of two quantities A and B, of which ¼ 1 8 q3 8 q3 n n 1 n 2 the general formula is shown on the left, and the value for the resolvent cubic is shown on the right, 0 2 0 n n 2 8 1 þ 2 q1q2 2 n n 3g f 4 1 2 ) A ¼ ¼ r p ; 9 3 9 n0 3 n0 ð27Þ þ 8 2 þ q q2 ; ð23Þ 9fg 27h 2f 3 4 1 1 n n 1 2 B ¼ ¼ pr þ q2 þ p3: 54 3 2 27
J. Synchrotron Rad. (2017). 24, 1120–1136 Sutter and Alianelli Aberration-free aspherical lens shape 1125 research papers
"# ( 3 2 2 2 1=2 The discriminant of a general cubic equation is Dc = A þ B . ÀÁ n0 1 ÀÁ 4 ÀÁ 1 ÀÁ The next step depends on the value of D . u x2 ¼ 1 Px2 þ 2 Rx2 þ P2 x2 c 1 n 3 3 9 (i) Dc > 0. One root of the cubic equation is real and the "#) ! other two are complex conjugates. Theffiffiffiffiffiffi real root is 1 4 PxðÞ2 RxðÞþ2 1 Q2 þ 1 P3ðÞx2 p 1=3 3 2 27 ð1=3Þ f þ S þ T, where S = ðB þ D Þ and T = cos arccos 3=2 : pffiffiffiffiffiffi c 3 4 2 1 2 2 1=3 3 RxðÞþ9 P ðÞx ðB Dc Þ . For the resolvent cubic, the real root u1 is ð32Þ "# 2 ÀÁ 0 2 ÀÁ ÀÁ ÀÁ Note that in all these cases one may write 2 1 n 2 2 2 u1 x ¼ 1 Px þ Sx þ Tx ; "# 3 n 2 2 ÀÁ n0 ÀÁ "# 2 2 2 u1 x ¼ 1 U1 x : n0 2 4 ÀÁÀÁ 1 1 ÀÁ n S ¼ 1 Px2 Rx2 þ Q2 þ P3 x2 n 3 2 27 Substitution of u into equation (25) then yields a perfect ( 1 4 ÀÁ 1 ÀÁ3 square on both sides, 2 2 2 þ Rx P x 2 2 3 9 2 1 q ) ! y þ u1 ¼ ðÞu1 p y : ð33Þ 1=2 1=3 2 2ðÞu p 4 ÀÁÀÁ 1 1 ÀÁ2 1 þ Px2 Rx2 þ Q2 þ P3 x2 ; 3 2 27 If u1 > p, then equation (33) falls into two cases. In the first, "# one simply equates the square root of both sides, 2 0 2 ÀÁÀÁ ÀÁ n 4 2 2 1 2 1 3 2 1 q T ¼ 1 Px Rx þ Q þ P x y 2 þ u ¼þðÞu p 1=2 y : ð34Þ n 3 2 27 2 1 1 2ðÞu p ( 1 3 4 ÀÁ 1 ÀÁ Equation (34) is a quadratic equation in y . Its two solutions Rx2 P2 x2 3 9 are ) ! 1=2 2 1=2 1=3 ÀÁÀÁ ÀÁ 1 1=2 1 2q 4 2 2 1 2 1 3 2 y ¼ ðÞu p ðÞ u þ p þ Px Rx þ Q þ P x : ð28Þ I 2 1 2 1 ðÞu p 1=2 3 2 27 "#( 1 1 0 2 ÀÁ ÀÁ n 1 2 2 1=2 ¼ 1 U1 x Px ð35Þ (ii) Dc ¼ 0. All roots of the cubic equation are real and at n 2 ! ) least two are equal. S and T in equation (28) are then equal. 1=2 1 ÀÁ ÀÁ 2Q Thus, for the resolvent cubic, 2 2 U1 x þ Px 1=2 : 2 U ðÞ x2 PxðÞ2 "# 1 2 ÀÁ 1 n0 2 ÀÁ ÀÁ In the second case, one equates the square root of the left side u x2 ¼ 1 Px2 þ 2Sx2 1 3 n of equation (33) with the negative of the square root of the "# 2 right side, so that n0 2 S ¼ 1 ð29Þ 1 1=2 q y 2 þ u ¼ ðÞu p y : ð36Þ n 1 1 ðÞ &' 2 2 u1 p 4 ÀÁÀÁ 1 1 ÀÁ 1=3 Px2 Rx2 þ Q2 þ P3 x2 : Equation (36), like equation (34), is a quadratic equation in y . 3 2 27 Its two solutions are 1 1 2q 1=2 (iii) D < 0. All roots of the cubic equation are real and y ¼ ðÞu p 1=2 ðÞþu þ p c II 2 1 2 1 ðÞu p 1=2 unequal. In this case, an angle ’ is defined such that "# 1 1 0 2 ÀÁ ÀÁ n 1 1=2 ¼ 1 U x2 Px2 ð37Þ B n 2 1 ’ ¼ arccos pffiffiffiffiffiffiffiffiffi : ð30Þ 3 ()! A 1=2 ÀÁ ÀÁ 1 2 2 2Q U1 x þ Px þ : 2 2 2 1=2 One of the real roots of the general cubic equation is then U1ðÞ x PxðÞ given by y I and y II are the four solutions of the depressed quartic. The explicit expressions in P, Q and U1 were calculated pffiffiffiffiffiffiffiffi ’ 1 0 2 u ¼ 2 A cos f; ð31Þ assuming that 1 ðn =nÞ > 0; however, the same expressions 1 3 3 are also valid if 1 ðn0=nÞ2 < 0. The only change is that the
explicit expression for y I would appear like that for y II in which for the resolvent cubic yields equation (37), and the explicit equation for y II would appear
1126 Sutter and Alianelli Aberration-free aspherical lens shape J. Synchrotron Rad. (2017). 24, 1120–1136 research papers
0 like that for y I in equation (35). Because this does not affect zero at x = 0, because then the constant (y ) term vanishes but the solution, it will not be mentioned further. the linear (y1) term does not.
The solutions of the original quartic equation (18) are easily If u1 = p, the formulas for the roots become somewhat obtained from equations (35) and (37) by using equation (21), simpler, "# "# "# "# 1 1 0 2 0 2 n0 2 n0 2 n n yI ¼ 1 q1 q2 yI ¼ 1 q1 q2 n n n n ! no ÀÁ ÀÁ ÀÁ ÀÁ ÀÁ 1=2 1 2 2 1=2 1 2 2 2 2 1=2 þ U1 x Px ð38Þ 2Px 2 P x 4Rx ; ð40Þ 2 ()! 2 1=2 1 ÀÁ ÀÁ 2Q "# "# 2 2 1 U1 x þ Px ; 0 2 0 2 2 2 2 1=2 n n U1ðÞ x PxðÞ y ¼ 1 q q II n 1 n 2 ! "# "# noÀÁ ÀÁ ÀÁ 1 1 1=2 1=2 n0 2 n0 2 2Px2 þ 2 P2 x2 4Rx2 : ð41Þ y ¼ 1 q q 2 II n 1 n 2 ÀÁ ÀÁ These roots must also be checked to determine which one 1 2 2 1=2 U1 x Px ð39Þ fulfills equation (12) for the Cartesian oval. 2 ()! 1=2 If u < p, then equation (33) can have a real solution only ÀÁ ÀÁ 1 1 2 2 2Q if both of the squared factors are equal to zero. This would U1 x þ Px þ : 2 2 2 1=2 2 U1ðÞ x PxðÞ require that y þ u1=2=0andy q=½2ðu1 pÞ = 0 simulta- neously. If that is not possible for the values of p, q and u1 In principle, any of these roots could be the one that satisfies calculated above, then no real solution exists. the original equation for the Cartesian oval, equation (12). Fig. 3 shows a set of examples of the exact solutions in The simplest way to find this root is to evaluate equations (38) equations (38) and (39) for various values of n0=n. The solu- and (39) at x = 0. The root that equals zero there is the correct tions clearly appear in two disconnected sheets, one inner one. Equation (18) will have one and only one root equal to and one outer. However, only one of these sheets satisfies
Figure 3 0 Examples of solutions of the Cartesian oval equation (18) at various values of n =n for a representative set of values for q1 (23.0929 m) and q2 (8.6912 m). The solutions ‘yðxÞ’ were calculated by using the exact equations (38) and (39) for x > 0; note that yð xÞ = yðxÞ. The solutions ‘xðyÞ’ were calculated by solving equation (18) as a quadratic equation in x2 with y-dependent coefficients [see equation (45)]. The top row demonstrates three cases in which n0=n > 1 and the bottom row demonstrates three cases in which n0=n < 1. In each row, n0=n ! 1 from left to right. The sheet labelled ‘surface of lens’ is the one that fulfills the original lens equation (12).
J. Synchrotron Rad. (2017). 24, 1120–1136 Sutter and Alianelli Aberration-free aspherical lens shape 1127 research papers the original lens equation (12). If n0=n > 1, it is the inner sheet; [For the X-ray case where n0=n ’ 1, this condition reduces 0 0 if n =n < 1, it is the outer sheet. It is evident that as n =n ! 1 approximately to j2ð"yÞ=ðq1 q2Þj 1.] The binomial the outer sheet becomes very much larger than the inner theorem yields sheet. How large the outer sheet becomes at x = 0 can be 1=2 1 1 1 5 estimated as follows. First, define the following quantities from ðÞ1 þ z ’ 1 þ z z2 þ z3 z4 þ ...: ð46Þ 2 8 16 128 equations (22)–(24), Hence 1 ð1 þ zÞ1=2 ’ ð1=2Þz þð1=8Þz2 ð1=16Þz3 plus 2 P0 ¼ lim PxðÞ¼ ¼ 0 6ðÞq1 q2 ; ð42Þ higher-order terms that will be discussed later. Note that this n0=n!1 has no term in z0. Substituting this into equation (45) and 3 summing terms with the same power of ð"yÞ on the right-hand Q0 ¼ lim Q ¼ 8ðÞq1 q2 ; ð43Þ n0=n!1 side, one obtains the approximate equation ðÞ"x 2 ’ C ðÞþ"y C ðÞ"y 2 þ C ðÞ"y 3: ð47Þ R ¼ lim RxðÞ¼ ¼ 0 3ðÞq q 4: ð44Þ 1 2 3 0 0 1 2 n =n!1 For the linear term on the right-hand side, one obtains Substitution of these values into equation (27) yields A =0 21 ðÞ n0=n ðÞn0=n 2 þ ðÞn0=n 3 q q and B = 0. Thus the discriminant D of the resolvent cubic C ¼ 1 2 : ð48Þ c 1 0 ðÞn =n q1 q2 is zero, and from equation (29) one can define U10 = 0 limn0=n!1U1ðx ¼ 0Þ = ð1=3ÞP0. Because P0 <0,U10 > P0 and As n =n ! 1, this quantity becomes very small as the terms 2 equations (38) and (39) apply. Letting " =1 ðn0=nÞ in the numerator almost cancel out. Equation (48) therefore and recalling the initial assumption q1 > q2, one finds three becomes subject to numerical errors caused by limited preci- solutions that remain bounded while the fourth solution sion. However, remembering that in the X-ray case ðn0=nÞ = 1 yII ðx ¼ 0Þ diverges as 4" ðq1 q2Þ. This sensitivity of the 1 þ where j j is much less than 1, one can make a power fourth root on the exact value of " makes the quartic equation series expansion of C1 in . The lowest term of this power (18) ill-conditioned. Therefore the numerical evaluation of series is equations (38) and (39) is very sensitive to roundoff errors 4q1q2 2 caused by the limited precision in cases in which " is very C1 ’ : ð49Þ ðÞq1 q2 small, even though the equations themselves remain theore- tically exact. Such cases are not only common but normal in For the quadratic term on the right-hand side of equation (47), X-ray optics, for which = ðn0=nÞ 1 generally has a magni- one obtains 5 ( "# tude on the order of 10 or less. An approximation that n0 n0 2 n0 n0 can capture the three bounded roots of equation (18) with C ¼ 1 q3 2 1 q2q 2 n n 1 n n 1 2 high accuracy while ignoring the divergent root is therefore "# ) justified. n0 n0 n0 2 2 1 q q2 þ 1 q3 n n 1 2 n 2 2.3. The ‘X-ray approximation’ to the ideal lens surface . 0 3 Equation (18) can be rewritten as a quadratic equation in x2 n q1 q2 : ð50Þ simply by rearranging terms. The quadratic formula can then n 2 be applied to determine an equation for x in terms of y, Like C , C also approaches zero as ðn0=nÞ!1. The lowest "# 1 2 n0 2 term of the power series expansion of C2 in terms of is ðÞ"x 2 ¼ ðÞ"y 2 q q ðÞ"y 2 1 2 2ðÞq þ q n C ’ 1 2 : ð51Þ 2 ðÞq q n0 n0 n0 1 2 þ 2 q q q q n n 1 2 1 n 2 For the cubic term on the right-hand side of equation (47), one 0 ()1 1=2 obtains @ 2ðÞq1 q2 ðÞ"y A 1 1 : ð45Þ n0 q ðÞn0=n q ðÞq q 3 0 2 1 2 1 2 ðÞn =n q1 q2 C3 ¼ ; ð52Þ n 0 5 ðÞn =n q1 q2 The ‘ ’ accounts for the two roots of any quadratic equation. whose power series in terms of is found simply by setting =0, However, if the plus sign is chosen, the resulting equation cannot be satisfied by the condition yðx ¼ 0Þ = 0 as is required. 1 C3 ’ : ð53Þ Therefore only the minus sign yields a useful set of solutions ðÞq1 q2 for the lens surface. Finally one can calculate the lowest term in the power series Equation (45) is exact. However, the radical can be expansion of ", expanded by using the binomial theorem if 0 2 n 0 2 " ¼ 1 ’ 2 : ð54Þ z ¼ 2ðÞq1 q2 ðÞ"y ðÞn =n q1 q2 1: n
1128 Sutter and Alianelli Aberration-free aspherical lens shape J. Synchrotron Rad. (2017). 24, 1120–1136 research papers
Substituting equations (49), (51), (53) and (54) into equation where BXR0 = BXRðx ¼ 0Þ. From this one finds that (47) and keeping only the lowest-order terms in yields 2 2 2 2 16 2 3 16 xD0þxD0 ¼ BXR0 þ AXR ¼ DXR0; 2 2 2q1q2 3 2ðÞq1 þ q2 3 2 2 3 3 2 2 x ¼ y y y þ ... ðÞq1 q2 ðÞq1 q2 ðÞq q ðÞq q ðÞq q 1 2 1 2 1 2 ð62Þ ð55Þ where DXR0 = DXRðx ¼ 0Þ. Inspection of equation (60) shows It is justified to keep all terms up to cubic on the right-hand 2 2 that DXR0 < 0 and that therefore xD0þxD0 < 0, which proves side of equation (55) because all are multiplied by the same 2 2 that xD0þ and xD0 have opposite signs. Equation (61) shows power of . The neglected higher-order terms Yn (n 4) on 2 2 that, if =ðq1 q2Þ > 0, xD0þ > 0 and xD0 < 0, since the right-hand side of equation (55), which arise from the 3=2 2 ð AXRÞ > 0. Likewise, if =ðq1 q2Þ < 0, xD0þ < 0 and binomial expansion in equation (46), are given to lowest order 2 xD0 > 0. Only the positive squared x can yield real values of x; in by the negative squared x is discarded. There are thus two values n of x at which the discriminant D ðxÞ =0: 2n 1 1=2 2 y XR Yn ¼ 2 ðÞq1 q2 : ð56Þ n q q (i) =ðq1 q2Þ > 0. 1 2 1=2 In the X-ray approximation, these terms diminish rapidly with 3=2 1=2 x ¼ x ¼ 2 B þ ðÞA increasing n. Therefore the fourth-order term is already much D0 q q XR0 XR 1 & 2 less than the cubic term included in the right-hand side of 1=2 1 equation (55), and higher-order terms are smaller still. This ¼ 2 ðÞq þ q 9q q 2ðÞq þ q 2 q q 54 1 2 1 2 1 2 justifies the neglect of terms beyond the cubic in equation (55). 1 2 ' ÀÁ1=2 1 3=2 In standard form, equation (55) is þ q2 q q þ q2 : ð63Þ 93=2 1 1 2 2 3 2 ðÞq1 q2 2 y þ ðÞq1 þ q2 y þ ðÞq1q2 y þ x ¼ 0: ð57Þ 2 (ii) =ðq1 q2Þ < 0. This equation can be solved analytically. When x = 0 the roots 1=2 1=2 are trivial: 0, q and q . The solutions for general x can be x ¼ x ¼ 2 B þ ðÞA 3=2 1 2 D0 q q XR0 XR determined by the same methods used to calculate the resol- 1 & 2 1=2 vent cubic of the exact equation. The discriminant of equation 1 ¼ 2 ðÞq þ q 9q q 2ðÞq þ q 2 (57) is D ¼ A3 þ B2 , where q q 54 1 2 1 2 1 2 XR XR XR 1 2 ' ÀÁ ÀÁ1=2 1 1 3=2 A ¼ q2 q q þ q2 ; ð58Þ þ q2 q q þ q2 : ð64Þ XR 9 1 1 2 2 93=2 1 1 2 2