1
Parallel FDTD modelling of nonlocality in plasmonics Joshua Baxter, Antonino Cala` Lesina, and Lora Ramunno
Abstract—As nanofabrication techniques become more precise, expected spread of charge density near plasmonic surfaces as with ever smaller feature sizes, the ability to model nonlocal well as predict the blueshift in plasmon resonance with de- effects in plasmonics becomes increasingly important. While creasing particle size [13], [17], [19]. More recently, the effects nonlocal models based on hydrodynamics have been implemented using various computational electromagnetics techniques, the of electron diffusion were incorporated within a generalized finite-difference time-domain (FDTD) version has remained elu- nonlocal optical response (GNOR) model, correctly predicting sive. Here we present a comprehensive FDTD implementation of a broadened line-shape that was observed experimentally [20]. nonlocal hydrodynamics, including for parallel computing. As a Nonlocal hydrodynamic models have been implemented us- sub-nanometer step size is required to resolve nonlocal effects, a ing several computational electromagnetic methods including parallel implementation makes the computational cost of nonlocal FDTD more affordable. We first validate our algorithms for small the finite element method [21], the discontinuous Galerkin spherical metallic particles, and find that nonlocality smears out time domain method [22], and the boundary element method staircasing artifacts at metal surfaces, increasing the accuracy [23]. A finite-difference time-domain (FDTD) implementation over local models. We find this also for a larger nanostructure would be advantageous due to the wide-spread use of FDTD in with sharp extrusions. The large size of this simulation, where the photonics community, its relative ease of implementation, nonlocal effects are clearly present, highlights the importance and impact of a parallel implementation in FDTD. its broadband capabilities, and its ability to produce time- domain movies. Only two attempts have been made [24], Index Terms—FDTD, plasmonics, nonlocality, hydrodynamic [25], where one is based on an erroneous approximation, plasma model, GNOR, parallel computing neither consider high performance computing and parallel implementations, and neither include electron diffusion. A cor- I.INTRODUCTION rect and comprehensive FDTD implementation has remained ABRICATING objects with nanoscale precision is pos- unreported. F sible due to significant progress in nanofabrication tech- In this paper, we present a parallel FDTD implementation of niques over the last few decades [1]. As a result, plasmonic nonlocal hydrodynamics, including a GNOR implementation. nanostructures and metamaterials [2] are having tremendous A parallel implementation in FDTD is especially important impact in many fields including biosensing [3], quantum for simulating nonlocality, as the grid cell size needs to cryptography [4], nonlinear optics [5], photovoltaics [6], light be smaller than the Fermi wavelength λF ∼ 0.5 nm to emitting devices [7], and precision medicine [8]. capture the spread of electron density [26]. This can require Numerical modelling in plasmonics is typically based on a large amount of memory, and can present a prohibitive optical models for bulk permittivity. These models – such as computational load. High-performance computing represents the Drude model for free electron response, and the Lorentz a viable solution. The FDTD rectangular-meshing scheme, and critical points models for bound electron response [9], where a field update at one location requires fields only in [10] – are based on the local response approximation (LRA), adjacent Yee-cells [27], lends itself well to parallel computing; which assumes that the induced polarization or current at a indeed, message passing interface (MPI)-based FDTD solvers given location depends only on the electromagnetic field at that are well reported for LRA models [28]. Furthermore, high- same location. While appropriate for many applications, the performance computing is becoming more accessible with Drude model is insufficient for modelling plasmonic structures cloud computing, computing consortiums, increased high- smaller than 10 nm [11]–[14], as well as those containing performance computing investments, and new, higher-level sharp features or nanoscale gaps [15], [16]. The charge density parallel programming languages such as Chapel [29]. arXiv:2005.06997v1 [physics.optics] 14 May 2020 near the surface of plasmonic objects is spread over a finite The structure of this paper is as follows. We review nonlocal thickness on the order of the a few angstroms [16], [17], and hydrodynamics in Section II, including the most common thus for small features, cannot be treated as localized at the version without electron diffusion, as well as GNOR, which surface, as is implicit in LRA models. does include electron diffusion. In Section III, we derive The hydrodynamic plasma model treats the conduction band from the nonlocal models FDTD update equations for the electrons as a free electron gas [5], [12], [16]–[18], accounting polarization field via the auxiliary differential equation (ADE) for free electron fluctuations via a pressure term. Unlike the method. In Section IV, we discuss the implementation of the Drude model, it does not make an LRA and thus nonlocality nonlocal FDTD update equations for parallel computing within is incorporated. It has been found to correctly predict the a MPI framework. In Section V, we test our implementa- tions by simulating the optical response of small metallic J. Baxter, A. Cala` Lesina, and L. Ramunno are with the Department of Physics and Center for Research in Photonics, University of Ottawa, Ottawa, nanospheres and comparing our results to analytic solutions Ontaria, Canada e-mail: [email protected]. and experimental results. We find an unexpected benefit of 2
2 2 nonlocal versus LRA modelling: a marked decrease in stair- where β = 1/3vF and vF is the Fermi velocity. The free casing artifacts at the metal boundary. Due to the rectangular electron density fluctuations are accounted for in the last term discretization inherent in most FDTD approaches, fields can of Eq. 5. This is often referred to as the pressure term, and build up in an unphysical manner at plasmonic surfaces, and is responsible for the known blue shift in the surface plasmon this can be particularly problematic in applications that rely on resonance with decreasing particle size [13], [17]. Along with plasmonic near-field enhancement [30]. The incorporation of this formula for the velocity field, we require the continuity nonlocality significantly decreases this unphysical field build equation given by up. In Section VI, we simulate the response from a spherical nanoparticle containing sharp extrusions as a demonstration of ∂n = −∇ · (nv). (6) a large-scale simulation that requires both parallel computing ∂t and nonlocal modelling. We find that despite the nanoparticle From here we consider two approaches. The first is the most being larger, the sharp extrusions exhibit plasmonic features widely used, and assumes a current density given by J = not accessible to the LRA. Finally, in Section VII we give −env. As we use a polarization field formulation in this paper, concluding remarks. ∂PNL we set J = ∂t , where we have defined PNL to be the (nonlocal) free electron polarization field. The velocity field II.HYDRODYNAMICMODELSFORNONLOCALITY 1 ∂PNL is thus v = − ne ∂t , and Eq. 6 becomes The most general response of a linear optical material to incident radiation is described by 1 n = n + ∇ · P , (7) 0 e NL Z where n is the equilibrium free electron density. From Eq. 5, D(r, ω) = ε ε(r, r0, ω)E(r0, ω)dr0, (1) 0 0 we then obtain where the dielectric function of the material, ε(r, r0, ω), is non- 2 local when the displacement field D at one location depends ∂ PNL ∂PNL 2 2 + γ = ε0ωpE + β ∇(∇ · PNL), (8) on the electric field E at other locations. In many applications, ∂t2 ∂t q 2 it is appropriate to use a local response approximation (LRA), e n0 where ωp = . We call Eq. 8 the “nonlocal Drude wherein ε(r, r0, ω) = δ(r − r0)ε(ω). In plasmonic modelling, mε0 model” because it consists of the classical LRA Drude model the interaction of the electric field and the free electron plasma plus one additional term that gives rise to nonlocality (i.e., the is typically described by the Drude model, which employs the term proportional to β2). LRA, where the dielectric function reduces to [9] The model represented by Eq. 8 differs from those used in the two previous nonlocal FDTD works. In Ref. [24], the 2 ωp gradient-divergence term was simplified to a Laplacian, and ε(ω) = 1 − , (2) ω2 + iγω this resulted in spurious, non-physical resonances [18]. In Ref. [23], a current density formulation is used. A benefit to using where ω is the plasma frequency, γ is a collisional damping p a polarization field formulation is that it gives ready access to rate, and i is the imaginary unit. the free electron density via Gauss’s Law, as we demonstrate The hydrodynamic plasma model [17] goes beyond the in Section V. LRA, more accurately describing spatial-temporal free elec- The second approach we consider includes electron dif- tron dynamics via fusion, which was also found to play an important role in the optical response of small metallic particles [20]. This has ∂v e 1 δG[n] + (v · ∇)v = − (E + v × B) − γv − ∇ , (3) been described by the generalized nonlocal optical response ∂t m m δn (GNOR) model, where diffusion is considered by modifying 1 ∂PG where v is the velocity field of the free electron plasma, E the expression for the velocity field via v = − en ( ∂t + and B are the electric and magnetic fields, and n is the free D∇(∇ · PG)), where D is the diffusion coefficient and where electron density. The energy functional G[n] considers the we have denoted the nonlocal GNOR free electron polarization internal kinetic energy of the electron gas and is usually taken field by PG. GNOR correctly predicts both the blueshift in to be the Thomas-Fermi functional, giving the plasmon resonance frequency as well as the broadening of the absorption peak with decreasing particle size. A thorough
2 2 discussion of the nonlocal Drude and GNOR models is given δG[n] h 3 3 = n , (4) in Ref. [17]. δn 2m 8π Using in Eqs. 5 and 6 the GNOR definition for the velocity where h is Plancks constant. As we are only considering the field, we obtain the following time-domain nonlocal-diffusive linear nonlocal response in this paper, we neglect the nonlinear hydrodynamics model for the polarization field, which we terms (magnetic-Lorentz and convection) in Eq. 3 giving hereafter refer to as the “GNOR” model:
∂v e β2 ∂2P ∂P ∂ + γv = − E − ∇n, (5) G +γ G = ε ω2E+η∇(∇·P )+D ∇(∇·P ), (9) ∂t m n ∂t2 ∂t 0 p G ∂t G 3
2 where η = β + Dγ. We start by deriving f4 from the nonlocal Drude model. It is from Eqs. 8 and 9 that we derive in the next section Using central differencing, we discretize Eq. 8 centered at time our FDTD update equations for implementing the two different n∆t, to obtain models of the free electron response of a plasmonic material, one accounting for nonlocality only (Eq. 8), and the other Pn+1 − 2Pn + Pn−1 Pn+1 − Pn−1 nonlocality with diffusion (Eq. 9). NL NL NL + γ NL NL To properly model the optical response of many plasmonic ∆t2 2∆t 2 n 2 n materials, one must also include the contribution of bound = ε0ωpE + β ∇(∇ · PNL). (11) electrons. We employ the LRA-based N-critical points model that assumes a susceptibility of the form Further, using weighted central averaging as discussed in Ref. [30], we set En = (En−1 + 2En + En+1)/4 to obtain χCP (ω) = (ε∞ − 1) N eiφp e−iφp n+1 n n−1 n−1 n n+1 X P = D1PNL + D2P + D3(E + 2E + E ) + ApΩp + , (10) NL NL Ωp − ω − iΓp Ωp + ω + iΓp n p=1 + DNL∇(∇ · PNL), (12) where ε∞ is the infinite frequency permittivity, and where Ap, Ωp, Γp, and φp are fitting parameters. This model can be readily transformed to the time-domain for FDTD implemen- tation, as described in detail in Ref. [31]. 2 D1 = 2 , (13) Dd∆t ! III.UPDATE EQUATIONS FOR NONLOCAL FDTD 1 γ 1 D2 = − 2 , (14) We derive in this section the FDTD update equations for Dd 2∆t ∆t the time-domain nonlocal Drude and GNOR models (Eqs. 8 2 ε0ωp and 9, respectively), using the ADE method. We discretize D3 = , (15) our domain via the Yee cell [27] where the electric fields are 4Dd 2 collocated with the polarization fields in time and space; the β DNL = , (16) Yee cell positions of the electric, magnetic, and polarization Dd ! fields used in this paper are listed in Table I. We denote the γ 1 Dd = + . (17) free electron polarization field by Pf (which signifies either 2∆t ∆t2 PNL or PG), and the bound electron polarization field by PCP . n The FDTD update algorithm at time = n∆t, where ∆t is As the x, y, and z components of ∇(∇ · PNL) must be the time step size, consists of updating (in order) the: collocated with the x, y, and z components of PNL, we have n+1/2 n−1/2 n (1) magnetic field H = f1(H , E ), n+1 n n n n+1/2 (2) electric field E = f2(E , Pf , PCP , H ), i+1/2,j,k i+1/2,j,k n+1 n+1 n ! (3) bound charge polarization PCP = f3(E , PCP ), n ∂ ∂Px ∂Py ∂Pz n+1 n+1 n ∇(∇·PNL) = + + (4) free charge polarization Pf = f4(E , Pf ), ∂x ∂x ∂y ∂z x x via update equations f , f , f , and f , whose form and i+3/2,j,k i+1/2,j,k i−1/2,j,k 1 2 3 4 P − 2P + P required inputs depend on the model from which they are ≈ x x x ∆x2 derived. The equation f1 is the magnetic field update derived P i+1,j+1/2,k − P i+1,j−1/2,k − P i,j+1/2,k + P i,j−1/2,k through discretization of the Maxwell-Faraday equation [32]; + y y y y we do not derive this here because in plasmonic simulations it ∆x∆y is typically unchanged from the vacuum equation. In what P i+1,j,k+1/2 − P i+1,j,k−1/2 − P i,j,k+1/2 + P i,j,k−1/2 + z z z z , follows, we present the electric field update equation f2 ∆x∆z (derived from the Maxwell-Ampere` law), the bound charge (18a) polarization update equation f3 (derived from Eq. 10), and the free charge polarization update f4 (derived from Eq. 8 for i,j+1/2,k ! i,j+1/2,k the nonlocal Drude model, and Eq. 9 for the GNOR model). ∂ ∂P ∂P ∂P ∇(∇·Pn ) = x + y + z Though we present f4 for two nonlocal models, one must NL ∂y ∂x ∂y ∂z chose which to use – they cannot be used simultaneously. y y P i,j+3/2,k − 2P i,j+1/2,k + P i,j−1/2,k ≈ y y y ∆y2 TABLE I i+1/2,j+1,k i−1/2,j+1,k i+1/2,j,k i−1/2,j,k YEE CELLPOSITIONSOFTHEELECTRIC, MAGNETIC, AND POLARIZATION P − P − P + P + x x x x FIELDS ∆x∆y Electric Magnetic Polarization i,j+1,k+1/2 i,j+1,k−1/2 i,j,k+1/2 i,j,k−1/2 Pz − Pz − Pz + Pz Ex,Hx,Px i+1/2, j, k i, j+1/2, k+1/2 i+1/2, j, k + , Ey,Hy,Py i, j+1/2, k i+1/2, j, k+1/2 i, j+1/2, k ∆y∆z Ez,Hz,Pz i, j, k+1/2 i+1/2, j+1/2, k i, j, k+1/2 (18b) 4
where
i,j,k+1/2 ! i,j,k+1/2 ∂ ∂P ∂P ∂P 2 2 ! n x y z 1 2 Ωp + Γp ∇(∇·PNL) = + + C = − (24) ∂z ∂x ∂y ∂z 1p C ∆t2 2 z z p i,j,k+3/2 i,j,k+1/2 i,j,k−1/2 2 2 ! Pz − 2Pz + Pz Ω + Γ ≈ 1 Γp 1 p p 2 C2p = − 2 − (25) ∆z Cp ∆t ∆t 4 i+1/2,j,k+1 i−1/2,j,k+1 i+1/2,j,k i−1/2,j,k Px − Px − Px + Px C ε A Ω sin φ + C = 4p − 0 p p p (26) ∆x∆z 3p 2 2∆tC i,j+1/2,k+1 i,j−1/2,k+1 i,j+1/2,k i,j−1/2,k p Py − Py − Py + Py ε A Ω (Ω cos φ − Γ sin φ ) + , C = 0 p p p p p p (27) ∆y∆z 4p C (18c) p C4p ε0ApΩp sin φp C5p = + (28) 2 2∆tCp 2 2 where, for brevity, the components of PNL are written as Pa Γ 1 Ω + Γ C = p + + p p , (29) for a = x, y, z. p ∆t ∆t2 4 For the GNOR model, the f4 update equation includes Eqs. 12-18 (with β2 replaced by η) along with a suitable discretization for the last term of Eq. 9, the only term that where p denotes the pth critical point. does not appear in the nonlocal Drude model. As this term n+1 cannot be achieved via central differencing, we use an alternate Next we turn to the f2 update equation for E . Ampre’s second order finite difference scheme given by law is discretized in time to give
∂ 3∇(∇ · Pn ) − 4∇(∇ · Pn−1) + ∇(∇ · Pn−2) ∇(∇·Pn ) ≈ G G G . n+1 n n+1 n N n+1 n G E − E Pf − Pf X (PCP,p − PCP,p) ∂t 2∆t ε ε + + (19) 0 ∞ ∆t ∆t ∆t p=1 = ∇ × Hn+1/2. (30)
The GNOR f4 update then becomes
For the nonlocal Drude model we set Pf = PNL, and plug
n+1 n n−1 n−1 n n+1 Eq. 12 into Eq. 30 to obtain PG = D1PG + D2PG + D3(E + 2E + E ) n + DG1∇(∇ · PG) n n−1 n−2 + DG2(3∇(∇ · PG) − 4∇(∇ · PG ) + ∇(∇ · PG )), (20) N ! n+1 X E ε0ε∞ + D3 + C3p p=1 where N ! n+1/2 X n = ∆t(∇ × H ) + ε0ε∞ − 2D3 + C4p E η p=1 DG1 = , (21) Dd N ! D X n−1 n − D3 + C5p E − (D1 − 1)P DG2 = . (22) NL 2∆tDd p=1 N X n n−1 − (C1p − 1)PCP,p − D2PNL p=1 The f3 update equation for the bound charge density is derived from the critical points model (Eq. 10) in Ref. [31]. N X n−1 n We state it here: − C2pPCP,p − DNL∇(∇ · PNL). p=1 (31)
n+1 n n−1 n+1 n n−1 PCP,p = C1pPCP,p+C2pPCP,p+C3pE +C4pE +C5pE (23) For the GNOR model we set Pf = PG, and plug Eq. 20 into 5
Eq. 30 to obtain TABLE II SUBDOMAIN UPDATE SCHEME FOR DIFFERENT FIELD COMPONENTS.
N ! Component x dimension y dimension z dimension n+1 X Ex and Px 0 → Nx − 1 1 → Ny 1 → Nz E ε0ε∞ + D3 + C3p Ey and Py 1 → Nx 0 → Ny − 1 1 → Nz p=1 Ez and Pz 1 → Nx 1 → Ny 0 → Nz − 1 N ! Hx 1 → Nx 0 → Ny − 1 0 → Nz − 1 n+1/2 X n = ∆t(∇ × H ) + ε0ε∞ − 2D3 + C4p E Hy 0 → Nx − 1 1 → Ny 0 → Nz − 1 p=1 Hz 0 → Nx − 1 0 → Ny − 1 1 → Nz N ! X − D + C En−1 − (D − 1)Pn 3 5p 1 G � and � field updates � field updates p=1 � , � �! ! ! �! �! N �", �" �" X n n−1 − (C1p − 1)PCP,p − D2PG p=1
N � X n−1 n − C2pPCP,p − DG1∇(∇ · PG) � p=1 0 0 n n−1 n−2 0 � 0 � − DG2(3∇(∇ · PG) − 4∇(∇ · PG ) + ∇(∇ · PG )). " " (32)