<<

the abdus salam international centre for theoretical physics

Optical path function

Anna Bianco SINCROTRONE TRIESTE

School on Synchrotron Radiation and Applications in memory of J.C. Fuggle and L. Fonda 8-26 May 2006 Miramare – Trieste, Italy Outline

•Main properties of SR, brilliance

•Main properties of VUV, EUV and soft x-rays mirrors and gratings

• Conserving brilliance up to the experiment

• Determining optical paths from Fermat’s principle: general theory of aberrations Main properties of Synchrotron Radiation

• Very broad and continuous spectral range, from infrared up to soft and hard x-rays

• High intensity

• The emitted radiation is highly collimated and emanates from a very small source: the electron beam

• Pulse time structure

• High degree of polarization Spectral range

4-30eV 250eV - several keV 300-40nm 30-250eV 40-5nm

D.Attwood, “Soft x-rays and extreme ultraviolet radiation”, Cambridge University Press, 1999 Brilliance

photon flux σ 1 Brilliance = σ σ I x z x′σ ′z BW

I = electron current in the storage ring σ xσ z = transverse area from which SR is emitted σ ′xσ z′ = solid angle into which SR is emitted ∆E BW = spectral bandwidth, usually: = 0 . 1 % E

z Source size Solid angle σ σ x x z σ’xσ’z SR brilliance at ELETTRA Why is brilliance important? (1)

photon flux σ 1 Brilliance = σ σ I x z ′xσ z′BW more flux Æ more signal for the experiment

But why combining the flux with geometrical factors?

Liouville’s theorem: for an optical system the occupied phase space volume cannot be decreased along the optical path

(without loosing photons) Æ (σσ’)final ≥ (σσ’)initial Example : a focusing beam

f σ’z σz σ’ f z σz

Optical element z’ z’

σ’ σ’ f z z z z

σz σ f Liouville’s theorem: (σσ’)final ≥ (σσ’)initial z Meaning of x’,z’

• transverse momenta p , p can be expressed in terms of the direction cosines: x z z

p x = p cos α p = p cos γ G z γ p • defining: x'= cosα z'= cosγ α β y and assuming: p y >> p x , p z Æ p ≅ py , the transverse momenta are proportional to x ' , z ' : x

px = py x' pz = py z'

z

pz z’ y py Why is brilliance important? (2)

Liouville’s theorem: (σσ’)final ≥ (σσ’)initial Æ to focus a beam in a small spot (which is needed for achieving energy and/or spatial resolution) one must accept an increase in the beam divergence

High beam divergence along the beamline: Æ large optical devices Æ high costs and low optical qualities With a not brilliant source the spot size can be made small only reducing the photon flux.

The high brilliance of the radiation source allows the development of monochromators with high energy resolution and high throughput and gives also the possibility to image a beam down to a very small spot on the sample with high intensity. The beamline

The researcher needs at his experiment a certain number of photons/second into a phase volume of some particular characteristics. Moreover, these photons have to be monochromatized.

The beamline: • is the means of bringing radiation from the source to the experiment transforming the phase volume in a controlled way: it demagnifies, monochromatizes and refocuses the source onto a sample • must preserve the excellent qualities of the radiation source VUV, EUV and soft x-rays We restrict ourselves to photon energies from 10 to 2000 eV.

4-30eV 250eV - several keV 300-40nm 30-250eV 40-5nm These regions are very interesting because are characterized by the presence of the absorption edges of most low and intermediate Z elements Æ photons with these energies are a very sensitive tool for elemental and chemical identification But… these regions are difficult to access. Ultra-high vacuum

VUV, EUV and soft x-rays have a high degree of absorption in all materials Æ No windows Æ The entire optical system must be kept under vacuum

Ultrahigh vacuum conditions (P=1-2x10-9 mbar) are required: • Not to disturb the storage ring and the experiment • To avoid photon absorption in air • To protect the optical surfaces from contamination (especially from carbon) Grazing incidence

Strong absorption of radiation by all materials: Æ no lenses: the only optical elements that can work are mirrors and diffraction gratings, used in reflection

Reflectivities drop down fast with the increasing of the grazing incidence angle Æ only reflective optics at grazing incidence angles (1-2 degrees) Focusing properties The meridional or tangential plane contains the central incidence ray and the normal to the surface. The sagittal plane is the plane perpendicular to the tangential plane and containing the normal to the surface. Paraboloid Rays traveling parallel to the symmetry axis OX are all focused to a point A. Conversely, the parabola collimates rays emanating from the focus A.

Line equation: Y 2 = 4aX

Paraboloid equation: Y 2 + Z 2 = 4aX where: a = f cos 2 ϑ

Position of the pole P: ϑ ϑ2 Xϑo = a tan

Yo = 2a tanϑ ϑ Paraboloid equation: ϑ x2 sin2 + y2 cos2 + z 2 − 2xy sin cos − 4 axsecϑ = 0

J.B. West and H.A. Padmore, Optical Engineering, 1987 Ellipsoid X 2 Y 2 Line equation: + = 1 a 2 b2 Ellipsoid equation: X 2 Y 2 Z 2 2 + 2 + 2 = 1 a ϑ b b r + r' where: a = ; b = a 1− e2 2 ϑ 1 e = r 2 + r'2 −2rr'cos(2ϑ ) 2a ϑ Rays from one focus F will always be perfectly focused to the second focus F . 1 ϑ 2 ⎛ sin2 1 ⎞ ⎛ cos2 ⎞ z 2 ⎛ 4 f cos ⎞ ⎡2sin e2 − sin 2 ϑ ⎤ x2 ⎜ + ⎟ + y 2 ⎜ ⎟ + − x − xy = 0 ⎜ 2 2 ⎟ ⎜ 2 ⎟ 2 ⎜ 2 ⎟ ⎢ 2 ⎥ ⎝ b a ⎠ ⎝ b ⎠ b ⎝ b ⎠ ⎣⎢ b ⎦⎥ −1 ⎛ 1 1 ⎞ where: f = ⎜ + ⎟ ⎝ r r' ⎠ J.B. West and H.A. Padmore, Optical Engineering, 1987 Toroid (1)

The bicycle tyre toroid is generated by rotating a circle of radius ρ in an arc of radius R. In general, two non-coincident focii are produced: one in the meridional plane and one in the sagittal plane

Tangential focus: ⎛ 1 1 ⎞ cosϑ 1 ⎜ + ⎟ = ⎝ r r't ⎠ 2 R

Sagittal focus: ⎛ 1 1 ⎞ 1 1 ⎜ + ⎟ ϑ = ⎝ r r's ⎠ 2cos ρ

ρ Stigmatic image: = cos 2 ϑ R J.B. West and H.A. Padmore, Optical Engineering, 1987 Toroid (2) ρ 2 2 2 2 2 x + y + z = 2Rx − 2R(R − ) + 2(R − ρ) (R − x) + y S x N S A

z TK SM K T

O TL M N y L

For ρ=R Æ spherical mirror A stigmatic image can only be obtained at normal incidence. For a vertical deflecting spherical mirror at grazing incidence the horizontal sagittal focus is always further away from the mirror than the vertical tangential focus. The mirror only weakly focusses in the sagittal direction. Gratings The separates the different components of the spectrum by redirecting the radiation by an amount which depends upon the wavelength.

Outside, Incident k=-2 negative wavelength λ α β k=-1 orders k=2 k=1 k=0 Grating normal Inside, positive sinα + sin β = Nkλ orders N=1/d is the groove density, k is the order of diffraction (±1,±2,...) VUV, EUV and soft x-rays beamline

Basic elements:

• mirrors to focus, deflect and filter

• gratings to diffract

• slits to spatially select the radiation

Optical elements have to preserve the quality (brilliance) of the radiation Conserving brilliance

Brilliance decreases because of:

• roughness and slope errors on optical surfaces

• thermal deformations of optical elements due to heat load produced by the high power radiation

• aberrations of optical elements

In the following we will consider OEs with theoretical surface shapes. Perfect imaging and aberrations An ideal optical element is able to perform perfect imaging if all the rays originating from a single object point cross at a single image point.

Deviations from perfect imaging are called aberrations. Aberrations theory Image quality is essential for achieving high energy and spatial resolution Æ knowledge of aberrations theory is necessary It shows what the different aberration terms are and how they play a role in the image formation Æ it teaches how aberrations can be reduced

Goal: understand in general terms how to treat mathematically the focusing properties of a concave optical element. We will study the case of a grating.

The general theory of aberrations of diffraction gratings applies Fermat’s principle to derive expressions for the aberration coefficients. Fermat’s principle

Light-rays choose their paths to minimize the optical length B G ∫ n(r)dl B A G n ( r ) : index of refraction of the medium dl : line segment along the path A

A more accurate statement: a -ray going from A to B must traverse an optical path length which is stationary with respect to small variations of that path Theory of conventional diffraction gratings For a classical grating with rectilinear grooves parallel to z with constant spacing d, the optical path length is: z F = AP +PB +kN λ y y P B O r’ zb α β r x

za A where λ is the wavelength of the diffracted light, k is the order of diffraction (±1,±2,...), N=1/d is the groove density Perfect focus condition (1)

Let us consider some number of light rays starting from A and impinging on the grating at different points P. Fermat’s principle states that if the point A is to be imaged at the point B, then all the optical path lengths from A via the grating surface to B will be the same. z B is the point of a perfect focus P y if: O B ∂F ∂F r’ =0 =0 α β ∂y ∂z r x for any pair of (y,z )

A Perfect focus condition (2)

Equations:

∂F ∂F F = AP +PB +kN λ y =0 =0 for any pair of ( y,z ) + ∂y ∂z

can be used to decide on the required characteristics of the diffraction grating: •the shape of the surface •the grooves density •the object and image distances Aberrated image

∂F ∂F In general, and are functions of y and z and can not be made zero for any y,z ∂y ∂z

Æ when the point P wanders over the grating surface, diffracted rays fall on slightly different points on the focal plane and an aberrated image is formed z •B0: gaussian image, y produced by the central ray P B • B: ray diffracted by the O generic point P on the r ’ z B grating surface α o b0 0 • Aberrations: displacements β0 r x of B with respect to B0

za A Grating surface

The grating surface may in general be described by a series expansion:

z ∞ ∞ y i j P x = ∑∑aij y z ij=00= B O r’ β α a00= a10= a01= 0 because of the choice x of origin r j = even if the xy plane is a symmetry plane A

Giving suitable values to the coefficients aij’s we obtain the expressions for the various geometrical surfaces. aij coefficients (1) 1 1 1 1 a = ; a = ; a = ; a = ; Toroid 02 ρ2 20 2R 22 4R 2 40 8R 3 1 a = ; a = 0; a = 0 04 8ρ 3 12 30 ρ Sphere, cylinder and plane are special cases of toroid: R=ρ Æ sphere R= ∞ Æ cylinder ϑ R=ρ= ∞ Æ plane ϑ ϑ 2 1 cos ϑ3sin a02 = ; a20 =ϑ ; a22 = ; Paraboloid 4 f cos 4 f ϑ 32 f 3 cos ϑ ϑ tan ϑ sin cos a = − ; a = − 12 8 f 2 30 8 f 2 ϑ 5sin 2 cos sin 2 a = ; a = 40 64 f 3 04 64 f 3 cos3 ϑ aij coefficients (2) ϑ ϑ Ellipsoid ϑ ϑ1 cos b2 ⎡sin2 1 ⎤ a02 = ; a20ϑ= ; a04 = 3 3 ϑ ⎢ 2 + 2 ⎥; 4 f cos 4 f 64ϑf cos b a ϑ ⎣ ⎦

tan 2 2 sin 2 2 a12 = 2 ϑ e −sinϑ ; a30 = 2 e −sin ; 8 f cos ϑ 8 f ϑ 2ϑ 2 2 2 b ⎡5sin cos 5sinϑ 1 ⎤ a40 = 3 3 ⎢ 2 − 2 + 2 ⎥; 64 f cos ϑ b a a ⎣ ϑ ⎦ sin2 ⎡3 b2 ⎛ cos2 ϑ ⎞⎤ a = cos2 − ⎜1− ⎟ 22 3 3 ⎢ 2 ⎜ ⎟⎥ 16 f cos ⎣2 a ⎝ 2 ⎠⎦ −1 ⎡1 1 ⎤ where f = + ⎣⎢r r′⎦⎥

http://xdb.lbl.gov/Section4/Sec_4-3Extended.pdf Optical path function (1)

z F = AP +PB +kN λ y P (x,y,z) y

B(x ,y ,z ) 2 2 2 O b b b ()()() r’ AP= xa − x + ya − y + za − z β α 2 2 2 PB= ()()()xb − x + yb − y + zb − z r x β

α < 0; β > 0 xa = r cosα ya = r sinα x = r′ cos y = r′ sin β A(xa,ya,za) b b Optical path function (2)

i j F = ∑ Fijk y z ijk 1 1 1 = F + yF + zF + y 2 F + z 2 F + y 3 F 000 100 011 2 200 2 020 2 300 1 1 1 1 + yz 2 F + y 4 F + y 2 z 2 F + z 4 F 2 120 8 400 4 220 8 040 1 1 1 + yzF + αyF + y 2 F + y 2 zF + ... 111 2 102 4 202 2 211 β k k Fijk = za Cijk ( ,r) + zb Cijk ( ,r') + Nkλfijk

⎧1 when ijk =100 fijk = ⎨ ⎩0 otherwise Perfect focus condition (3)

∂F ∂F =0 =0 for any pair of (y,z) ∂y ∂z

Fijk =0 for all ijk ≠ (000 )

i j Each term F ijk y z in the series (except F000 and F100) represents a particular type of aberration α Fijk coefficientsβ (1) ′ F000 = r + r α β for r,r’ >> za,zb F100 = Nkλ − (sinαα+ sin β ) α β ⎛ cos2 cos2 ⎞ ⎜ ⎟ F200 = ⎜ + α ⎟ − 2a20 ()cos + cos ⎝ r r′ ⎠ β α 1 1 F = + − 2a ()cos + cos β 020 r r′ 02 α β α ⎡T (r, )⎤ ⎡T (r′, )⎤ β F300 = sin + sin β − 2a30 ()cos + cos ⎣⎢ r ⎦⎥ ⎣⎢ r′ ⎦⎥ α ⎡S(r, )⎤ ⎡S(r′, )⎤ F120 = ⎢ α ⎥sin + ⎢ ⎥sin − 2a12 ()cos + cos β ⎣ r ⎦ α⎣ r′ ⎦ α cos 2 1 where T (r, ) = − 2a cos α and S (r, ) = − 2a cos α r 20 r 02 and analogous expressions for T (r′, β ) and S (r′, β ) α α α α α β Fijk coefficientsβ (2) β β β ⎡α4T(r, )⎤ ⎡T 2 (r, )⎤ ⎡4T(r′, )⎤ ⎡T 2 (r′, )⎤ F = sin 2 − + α sin 2 − 400 ⎢ 2 ⎥ ⎢ ⎥ ⎢ 2 ⎥ ⎢ ⎥ ⎣ r α⎦ ⎣ r ⎦ ⎣ r′ β ⎦ ⎣ r′ ⎦ β ⎡sin cos sin cos ⎤ 2 ⎡1 1 ⎤ -8a30 ⎢ + ⎥ −8a40 ()cos + cos + 4a20 ⎢ + ⎥ ⎣ r r′ β⎦ ⎣r r′⎦ α α α ⎡2S(r, )⎤ 2 ⎡2S(r′, )⎤ β 2 ⎡T(r, )S(r, )⎤ β⎡T(r′, )S(r′, )⎤ F220 = sin + sin − − ⎣⎢ r 2 ⎦⎥ ⎣⎢ r′2 ⎦⎥ ⎣⎢ α r ⎦⎥ ⎣⎢ βr′ ⎦⎥ α ⎡1 1 ⎤ ⎡sin cos βsin cos ⎤ + 4a20a02 + − 4a22 α()cos + cos − 4a12 + ⎢ ′⎥ ⎢ β′ ⎥ ⎣r r ⎦ β ⎣ r r ⎦ α ⎡1 1 ⎤ ⎡S 2 (r, )⎤ ⎡S 2 (r′, β )⎤ 2 () F040 = 4a02 ⎢ + ⎥ −8a04 cos + cos − ⎢ ⎥ − ⎢ ⎥ ⎣r r′⎦ ⎣ r ⎦ ⎣ r′ ⎦ α Fijk coefficients (3) β

αza zb F011 = − − r r′ β

za sin zb sin F111 = − 2 − 2 r α r′ z 2 sin z 2 sin F = a + b α 102 r 2 r′2 β 2 α 2 2 2 ⎛ za ⎞ ⎡2sin α ⎤ ⎛ zb ⎞ ⎡2sin ⎤ F202 = ⎜ ⎟ ⎢ −T()r, ⎥ + ⎜ ⎟ ⎢ −T(rβ′, )⎥ ⎝ r ⎠ ⎣ r ⎦ ⎝ r′ ⎠ ⎣ r′ ⎦ β z ⎡ 2sin 2 ⎤ z ⎡ 2sin 2 β ⎤ a b ′ F211 = 2 ⎢T(r, ) − ⎥ + 2 ⎢T (r , ) − ⎥ r ⎣ r ⎦ r′ ⎣ r′ ⎦ Gaussian image point (1)

⎛∂F⎞ ⎛∂F⎞ If we apply Fermat’s principle to the central ray: ⎜ ⎟ =0 ⎜ ⎟ =0 ⎝ ∂y ⎠ ⎝ ∂z ⎠y=0,z=0 α y=0,z=0 β grating equation F100 = 0 sin + sin 0 = Nkλ z z law of magnification a = − b0 F011 = 0 r r′ in the sagittal direction α 0 β

The tangential focal distance r’0 is obtained by setting: ⎛ cos2 cos2 ⎞ α ⎜ + 0 ⎟ − 2a ()cos + cos β = 0 tangential focusing F200 = 0 ⎜ ⎟ 20 0 ⎝ r r0′ ⎠

The three above equations determine the Gaussian image point B0(r’0,β0,zb0) Gaussian image point (2)

z y P B z tan−1 b0 O r' 0 B0 r ’ α 0 zb0 β0 x r z tan −1 a r za A Sagittal focusing

While the second order aberration term F200 governs the tangential focusing, the second order term F020 governs the sagittal focusing:

1 1 α F020 = 0 + − 2a ()cos + cos β = 0 sagittal focusing r r′ 02

Example: toroidal mirror 1 1 Substitutinga = ; a = in F = 0; F = 0 02 2 ρ 20 2R 200 020 and imposing α = -β = θ

⎛ 1 1 ⎞ cosϑ 1 ⎛ 1 1 ⎞ 1 1 ⎜ + ⎟ = ⎜ + ⎟ ϑ = ⎝ r rt '⎠ 2 R ⎝ r rs '⎠ 2cos ρ Aberrations terms

Most important imaging errors:

F200 defocus F020 astigmatism F300 primary coma (aperture defect) F120 astigmatic coma F400 F220 F040 spherical aberration

There is an ambiguity in the naming of the aberrations in the grazing incidence case! Ray aberrations (1)

The generic ray starting from A will arrive at the focal plane at a point B displaced from the Gaussian image point B0 by the ray aberrations ∆yb and ∆zb: z ′ y ∆y r0 ∂F b ∆yb = ∆zb P cos β 0 ∂y B O ∂F r ’ z B0 ∆z = r′ α o b0 b 0 ∂z β0 x r

za A Ray aberrations (2)

Substituting the expansion of F , the ray aberrations for each aberration type can be calculated separately:

ijk r0′ i−1 j ∆yb = Fijk i y z cos β 0

ijk i j−1 ∆zb = r0′ Fijk y j z

Provided the aberrations are not too large, they are additive: they may either reinforce or cancel.

ijk ijk ∆yb = ∑ ∆yb ∆zb = ∑ ∆zb ijk ijk Aberrated image

Example of footprint on the grating:

2w=44mm z 2l=2mm

y

ijk ijk Substituting y=±w and z=±l in the ray aberrations ∆yb and ∆zb , we evaluate the contributions of the rays which are more distant from the pole of the grating

Æ size (∆yb * ∆zb) of the resulting aberrated image Defocus and coma contributions

The defocus contribution is linear in the ruled length (± w) of the grating, the error in the dispersive direction is symmetric about the Gaussian image point:

200 r0′ ∆yb (±w) = ± F200 2 w cos β0

The coma contribution is proportional to w2 giving a dispersive error which only occurs on one side of the Gaussian image point for rays from both the top and the bottom of the grating (y=±w):

300 r0′ 2 ∆yb (±w) = F300 3 w cos β0 Comparison ray trace - aberration calculations

Example

z z

B B

F040

F020 B0 y B0 y F200 F300

Ray trace simple tells us that the Aberration-based calculations ray arrives in a certain point specify the different contributions Aberrations contribution to resolution λ λ ⎛β∂ ⎞ ∆ = ⎜ ⎟ β∆ ⎝ ∂ ⎠ β α =const cos = ∆β Nk ∆y λ cos β ∆y Substituting: ∆β = b Æ ∆ = b r′ Nk r′ r′ ∂F 1 ∂F ∆y = 0 ∆λ = Substituting: b Æ Nk ∂y cos β 0 ∂y

1 i−1 j ∆λ = ∑ Fijk i y z Nk ijk Aberration theory: conclusions

∂F ∂F = 0 = 0 • Perfect focus condition: ∂ y ∂ z for each pair (y,z) Æ all the coefficients Fijk must be zero

• Non-zero values for the coefficients Fijk lead to displacements of the rays arriving in the image plane from the ideal Gaussian image point.

• We have found the expressions for these rays displacements and the corresponding contributions to wavelength resolution. In this way the impact on the imaging and energy resolution properties of a given grating can be evaluated.

• By a proper choice of the grating shape, groove density, object and image distances, the sum of the aberrations may be reduced to a minimum. References (1)

• D.Attwood, “Soft x-rays and extreme ultraviolet radiation”, Cambridge University Press, 1999

• D.Attwood, “Challenges for utilization of the new Synchrotron facilities”, Nucl. Instr.and Meth.A291, 1-7, 1990

• B.W.Batterman and D.H.Bilderback, “X-Ray Monocromators and Mirrors” in “Handbook on Synchrotron Radiation”, Vol.3, G.S.Brown and D.E.Moncton, Editors, North Holland, 1991, chapter 4

• W.Gudat and C.Kunz, “Instrumentation for Spectroscopy and Other Applications”, in “Syncrotron Radiation”, “Topics in Current Physics”, Vol.10, C.Kunz, Editor, Springer-Verlag, 1979, chapter 3

• M.Howells, “Gratings and monochromators”, Section 4.3 in “X-Ray Data Booklet”, Lawrence Berkeley National Laboratory, Berkeley, 2001

• M.Howells, “Vacuum Ultra Violet Monochromators”, Nucl. Instrum. and Meth. 172, 123-131, 1980

• M.C. Hutley, “Diffraction Gratings”, Academic Press, 1982

• R.L. Johnson, “Grating Monochromators and Optics for the VUV and Soft-X-Ray Region” in “Handbook on Synchrotron Radiation”, Vol.1, E.E.Koch, Editor, North Holland, 1983, chapter 3 References (2)

• C.Kunz and J.Voss, “Scientific progress and improvement of optics in the VUV range”, Rev. Sci.Instrum. 66 (2), 1995

• G.Margaritondo, Y.Hwu and G.Tromba, “Synchrotron light: from basics to and coherence related applications”, in: “Syncrotron Radiation:Fundamentals, Methodologies and Applications”, Conference Proceedings, Vol. 82, S.Mobilio and G.Vlaic, Editors, S.Margherita di Pula, 2001

• W.R.McKinney, M.Howells, H.Padmore, “Aberration analysis calculations for Synchrotron radiation beamline design”, in “Gratings and Grating Monochromators for Synchrotron Radiation”, W.R. McKinney, C. A. Palmer, Eds., Proc. SPIE Vol.3150, 1997

• A.G. Michette, “Optical Systems for X Rays”, Plenum Press, 1986

• W.B.Peatman, “Gratings, mirrors and slits”, Gordon and Breach Science Publishers, 1997

• J.B. West and H.A. Padmore, “Optical Engineering” in “Handbook on Synchrotron Radiation”, Vol.2, G.V.Marr, Editor, North Holland, 1987, chapter 2

• G.P.Williams, “Monocromator Systems”, in “Synchrotron Radiation Research: Advances in Surface and Interface Science”,Vol.2, R.Z.Bachrach, Editor, Plenum Press, 1992, chapter 9