Electromagnetic Particle-in-Cell Algorithms on Unstructured Meshes for Kinetic Plasma Simulations
Dissertation
Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University
By
Dong-Yeop Na, M.S.
Graduate Program in Electrical and Computer Engineering
The Ohio State University
2018
Dissertation Committee:
Prof. Fernando L. Teixeira, Advisor Prof. Kubilay Sertel Prof. Robert Lee c Copyright by
Dong-Yeop Na
2018 Abstract
Plasma is a significantly ionized gas composed of a large number of charged parti- cles such as electrons and ions. A distinct feature of plasmas is the collective interac- tion among charged particles. In general, the optimal approach used for modeling a plasma system depends on its characteristic (temporal and spatial) scales. Among var- ious kinds of plasmas, collisionsless plasmas correspond to those where the collisional frequency is much smaller than the frequency of interests (e.g. plasma frequency) and the mean free path is much longer than the characteristic length scales (e.g. Debye length).
Collisionless plasmas consisting of kinetic space charge particles interacting with electromagnetic fields are well-described by Maxwell-Vlasov equations. Electromag- netic particle-in-cell (EM-PIC) algorithms solve Maxwell-Vlasov systems on a com- putational mesh by employing coarse-grained superparticle. The concept of super- particle, which may represent millions of physical charged particles (coarse-graining of the phase space), facilitates the realization of computer simulations for under- scaled kinetic plasma systems mimicking the physics of real kinetic plasma systems.
In this dissertation, we present an EM-PIC algorithm on general (irregular) meshes based on discrete exterior calculus (DEC) and Whitney forms. DEC and Whitney forms are utilized for consistent discretization of Maxwells equation on general ir- regular meshes. The proposed EM-PIC algorithm employs a mixed finite-element
ii time-domain (FETD) field solver which yields a symplectic integrator satisfying en-
ergy conservation. Importantly, we employ Whitney-forms-based gather and scatter
schemes to obtain exact charge conservation from first principles, which had been a
long-standing challenge for PIC algorithms on irregular meshes.
Several further contributions are made in this dissertation: (i) We develop a local and explicit EM-PIC on unstructured grids using sparse approximate inverse (SPAI) strategy and study macro- and microscopic residual errors in motions of charged par- ticles affected by the approximate inverse errors. (ii) We extend the present EM-PIC algorithm to the relativistic regime with several relativistic particle-pushers and com- pare their performance. (iii) We implement a secondary electron emission (SEE) processor based on probabilistic Furman-Pivi model and numerically investigate mul- tipactor effects that are resonant electron discharges from conducting surfaces by external RF fields. (iv) We diagnose numerical Cherenkov radiation, which is a detri- mental effect frequently found in EM-PIC simulations involving relativistic plasma beams, for the present EM-PIC algorithm on general meshes. (v) We extend the
FETD field solver for the solution of Maxwell’s equations in circularly symmetric or body-of-revolution (BOR) geometries. (vi) Lastly, we combine the EM-PIC algo- rithm with the BOR-FETD field solver for the efficient analysis of vacuum electronic devices (VED).
iii Dedicated to my beloved wife Da-Young and my family
iv Acknowledgments
First and foremost, I would like to express my sincere gratitude to my advisor,
Prof. Fernando L. Teixeira, for the support, encouragement, and guidance during the years of my graduate study. It has been a great honor and privilege to work with him.
His passion and immense knowledge in electromagnetics, mathematics, and physics, and kindness and commitment to his students will always inspire me.
Besides, I would like to thank Dr. Yuri A. Omelchenko and Prof. Ben-Hur V.
Borges for their helpful discussions and suggestions.
My special appreciation also goes to the members of my doctoral committee, Prof.
Kubilay Sertel and Prof. Robert Lee, for insightful comments.
I would like to thank to many of ESL colleagues, past and present, Haksu Moon,
WoonGi Yeo, Jungwhan Park, Carlos A. Viteri, Cagdas Gunes, Daniel O. Acero, and
Julio L. Nicolini, and my friends, Yun-Shik Hahn, Chunghyun Lee, Jongchan Choi,
Kyoung-Ho Jeong, and Huyngjun Kim.
I wish to thank my family for their constant support and unconditional love.
Last but not least, I would like to share this accomplishment with my beloved wife, Da-Young, and sincerely appreciate her her encouragement, support, and love.
v Financial support from National Science Foundation grant ECCS-1305838, De- fense Threat Reduction Agency grant HDTRA1-18-1-0050, Ohio Supercomputer Cen- ter grants PAS-0061 and PAS-0110, and The Ohio State University Presidential Fel- lowship Program are gratefully acknowledged.
vi Vita
March 30, 1987 ...... Born - Seoul, Korea
Feburary, 2012 ...... B.S. in Electrical and Computer Eng., Ajou University, Suwon, Korea July, 2014 ...... M.S. in Electrical and Computer Eng., Ajou University, Suwon, Korea August, 2014-May, 2017 ...... Graduate Research Associate, ElectroScience Laboratory, The Ohio State University, USA May, 2017-May, 2018 ...... Presidential Fellowship Program, The Ohio State University, USA May, 2018-August, 2018 ...... Graduate Research Associate, ElectroScience Laboratory, The Ohio State University, USA August, 2018-present ...... Graduate Teaching Associate, Electrical and Computer Eng., The Ohio State University, USA
Publications
Jounral Publications
Dong-Yeop Na, Haksu Moon, Yuri A. Omelchenko, Fernando L. Teixeira, “Local, explicit, and charge-conserving electromagnetic particle-in-cell algorithm on unstruc- tured grids,” IEEE Trans. Plasma Sci., 44 (2016) 1353–1362.
Dong-Yeop Na, Yuri A. Omelchenko, Haksu Moon, Ben-Hur V. Borges, Fernando L. Teixeira, “Axisymmetric charge-conservative electromagnetic particle simulation algorithm on unstructured grids: Application to microwave vacuum electronic De- vices,” J. Comput. Phys., 346 (2017) 295–317.
vii Dong-Yeop Na, Haksu Moon, Yuri A. Omelchenko, Fernando L. Teixeira, “Rel- ativistic extension of a charge-conservative finite element solver for time-dependent Maxwell-Vlasov equations,” Phys. Plasmas, 25 (2018) 013109.
Dong-Yeop Na, Ben-Hur V. Borges, Fernando L. Teixeira, “Finite element time- domain body-of-revolution Maxwell solver based on discrete exterior calculus,” J. Comput. Phys., 376 (2017) 249–275.
Conference publications
Dong-Yeop Na, Fernando L. Teixeira, Yuri A. Omelchenko, “Charge-conserving relativistic PIC algorithm on unstructured grids,” 2016 USNC-URSI National Radio Science Meeting, Boulder, CO, Jan. 6-9, 2016.
Dong-Yeop Na, Fernando L. Teixeira, H. Moon, Yuri A. Omelchenko, “Full-wave FETD-based PIC algorithm with local explicit update,” 2016 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, Fajardo, PR, June 26-July 1, 2016.
Dong-Yeop Na, Fernando L. Teixeira, Yuri A. Omelchenko, “Unstructured-grid and conservative electromagnetic particle-in-cell: application to micromachined slow- wave structures,” 2016 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, Fajardo, PR, June 26-July 1, 2016.
Dong-Yeop Na, Yuri A. Omelchenko, Fernando L. Teixeira, “An efficient algorithm for simulation of plasma beam high-power microwave sources,” 2017 IEEE MTT-S International Microwave Symposium, Honolulu, HI, June 4-9, 2017.
Dong-Yeop Na, Fernando L. Teixeira, Ben-Hur V. Borges, “Finite-element time- domain solver for axisymmetric devices based on discrete exterior calculus and trans- formation optics,” 2017 SBMO/IEEE MTT-S International Microwave and Opto- electronics Conference, Aguas de Lindoia, Brazil, Aug. 27-30, 2017.
Dong-Yeop Na, Yuri A. Omelchenko, Fernando L. Teixeira, “Irregular-grid-based particle-in-cell simulations of resonant electron discharges with probabilistic secondary electron emission model,” 2017 XXXIInd General Assembly and Scientific Sympo- sium of the International Union of Radio Science, Montreal, QC, Canada, August 19-26, 2017.
viii Dong-Yeop Na, Yuri A. Omelchenko, Fernando L. Teixeira, “Discretization of Maxwell-vlasov equations based on discrete exterior calculus,” 2017 XXXIInd General Assembly and Scientific Symposium of the International Union of Radio Science, Montreal, QC, Canada, August 19-26, 2017.
Dong-Yeop Na, Julio L. Nicolini, Robert Lee, Ben-Hur V. Borges, Yuri A. Omelchenko, Fernando L. Teixeira, “Diagnosis of Numerical Cherenkov Instability in Plasma Simu- lations on General Mesh,” Computational Aspects of Time Dependent Electromagnetic Wave Problems in Complex Materials, The Institute of Computational and Experi- mental Research in Mathematics (ICERM), Providence, RI, June 24-29, 2018.
Dong-Yeop Na, Fernando L. Teixeira, Yuri A. Omelchenko, “Dispersion Analy- sis of Electron Bernstein Waves in Magnetized Warm Plasmas by Finite Element Particle-in-Cell Modeling,” 2018 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, Boston, MA, July 8-13, 2018.
Dong-Yeop Na, Fernando L. Teixeira, Yuri A. Omelchenko, “Numerical Cherenkov Radiation Effects from Grid Dispersion in Finite Element Particle-in-Cell Simulations of Relativistic Electron Beams,” 2018 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, Boston, MA, July 8-13, 2018.
Fields of Study
Major Field: Electrical and Computer Engineering
Studies in: Electromagnetic theory Computational electromagnetics Antennas Mathematics
ix Table of Contents
Page
Abstract ...... ii
Dedication ...... iv
Acknowledgments ...... v
Vita ...... vii
List of Tables ...... xiv
List of Figures ...... xvi
1. Introduction ...... 1
1.1 Background and motivation ...... 1 1.2 Contribution of this dissertation ...... 6 1.3 Organization of this dissertation ...... 7
2. Local, Explicit, and Charge-conserving EM-PIC on Unstructured Mesh . 11
2.1 Explicit FETD-PIC Algorithm ...... 13 2.1.1 Mixed E − B FETD scheme ...... 14 2.1.2 Gather-scatter and particle pusher steps ...... 16 2.1.3 Discrete continuity equation ...... 17 2.2 Sparse Approximate Inverse (SPAI) strategy ...... 19 2.2.1 Discrete Gauss’ law ...... 20 2.3 Numerical Results ...... 21 2.3.1 Single-particle trajectories ...... 21 2.3.2 Plasma ball expansion ...... 27 2.3.3 Electron beam in a vacuum diode ...... 30 2.3.4 Electron Bernstein waves ...... 33
x 2.4 Conclusion ...... 35
3. Relativitic Extension of Particle-Pusher ...... 36
3.1 Particle-pushers in the relativistic regime ...... 37 3.1.1 Relativistic Boris pusher ...... 38 3.1.2 Vay pusher ...... 40 3.1.3 Higuera-Cary pusher ...... 41 3.2 Numerical results ...... 41 3.2.1 Synchrocyclotron ...... 41 3.2.2 Harmonic oscillations in Lorentz-boosted frame ...... 44 3.2.3 Relativistic Bernstein Modes in Magnetized Pair-Plasma . . 47 3.3 Conclusion ...... 55
4. Multipactor ...... 57
4.1 Irregular-Grid EM-PIC Algorithm integrated with Furman-Pivi model 60 4.2 Charge-conserving scatter near conducting surface ...... 61 4.3 Furman-Pivi SEE model implementation ...... 62 4.4 Numerical Results and Discussion ...... 63 4.4.1 Verification of SEE model in EM-PIC simulations ...... 63 4.4.2 Multipactor on copper versus stainless steel surfaces . . . . 66 4.4.3 Surface treatment effects ...... 68 4.4.4 Multipactor susceptibility to RF voltage amplitude . . . . . 70 4.4.5 Multipactor saturation effects ...... 72 4.5 Conclusion ...... 75
5. Numerical Cherenkov Radiation and Grid Dispersion Effects ...... 79
5.1 Numerical Cherenkov Radiation in the FDTD-based EM-PIC Algo- rithm ...... 81 5.2 Numerical Cherenkov Radiation in finite-element-based EM-PIC Al- gorithms ...... 85 5.2.1 SQ Mesh ...... 88 5.2.2 Triangular-element-based FE meshes ...... 92 5.3 Numerical Experiments ...... 100 5.4 Conclusion ...... 109
6. Finite-Element Time-Domain Body-of-Revolution Maxwell-Solver . . . . 111
6.1 Formulation ...... 113 6.1.1 Exploration of transformation optics (TO) concepts . . . . . 113 6.1.2 Field decomposition ...... 115
xi 6.1.3 Mixed FE time-domain BOR solver ...... 116 6.1.4 Symmetry axis singularity treatment ...... 123 6.2 Numerical Examples ...... 125 6.2.1 Cylindrical cavity ...... 126 6.2.2 Logging-while-drilling sensor simulation ...... 129 6.3 Conclusion ...... 137
7. Axisymmetric Electromagnetic Particle-in-Cell Algorithm: Application to Microwave Vacuum Electronic Devices ...... 140
7.1 Spatial dimensionality reduction ...... 145 7.1.1 Exterior calculus representation of Maxwell’s equations . . . 145 7.1.2 Cylindrical axisymmetry constraints ...... 146 7.1.3 Modified Hodge star operator ...... 147 7.2 Validation ...... 150 7.2.1 Metallic hollow cylindrical cavity ...... 150 7.2.2 Space-charge-limited (SCL) cylindrical diode ...... 152 7.3 Numerical examples ...... 156 7.3.1 Relativistic backward-wave oscillator (BWO) ...... 158 7.4 Conclusion ...... 166
Appendices 168
A. Basics of Plasmas ...... 168
A.1 Fundamental parameters ...... 168 A.2 Quasi-neutrality in plasma ...... 170 A.3 Plasma oscillation ...... 171 A.4 Collisions in plasmas ...... 172
B. Kinetic Plasma Description ...... 174
B.1 Plasma kinetic equation ...... 174 B.2 Vlasov equation for collisionless plasmas ...... 178 B.3 Superparticle: Coarse-grained f (x, v, t) ...... 179 B.4 Maxwell-Vlasov or Poisson-Vlasov systems ...... 181
C. Discrete Exterior Caclulus (DEC) ...... 183
C.1 Whitney forms ...... 183 C.2 Pairing operation ...... 184 C.3 Generalized Stokes’ theorem ...... 184
xii C.4 Discretization of Maxwell’s equation ...... 184 C.4.1 Cartesian coordinates case ...... 184 C.4.2 Body-of-revolution case ...... 185 C.5 Incidence Matrices ...... 187 C.6 Discrete Hodge matrix ...... 189 C.7 Barycentric dual lattice relations ...... 193
D. Cartesian-like PML implementation ...... 195
E. Stability Conditions ...... 201
Bibliography ...... 203
xiii List of Tables
Table Page
2.1 Number of elements in Meshes 1, 2, and 3 ...... 22
2.2 Convention used for particle trajectory visualization...... 24
3.1 Verification of discrete Gauss’ law for the non-relativistic case (Fig. 3.2a)...... 45
3.2 Verification of discrete Gauss’ law for the relativistic case without syn- chronization (Fig. 3.2b)...... 46
3.3 Verification of discrete Gauss’ law for the relativistic case with syn- chronization (Fig. 3.2c)...... 46
4.1 Multipactor simulation parameters for the parallel waveguide in Fig. 4.7a. 69
4.2 Triangularly-grooved surface parameters...... 70
4.3 Mesh parameters...... 70
4.4 Spectral amplitude of output voltage signals for high-order harmonics. 73
5.1 Basic meshes properties...... 102
6.1 Maximum time-step intervals for various cases in the simulation of cylindrical metallic cavity...... 127
6.2 Eigenfrequencies for the cylindrical cavity and normalized errors be- tween numerical and analytic results...... 132
7.1 Estimation of the run time of EM-PIC simulations based on FETD and FDTD at each time-update...... 149
xiv 7.2 Resonant frequencies for axisymmetric cavity modes and normalized errors between numerical and analytic works...... 153
7.3 Mesh information for different SCSWS cases ...... 163
xv List of Figures
Figure Page
2.1 Basic steps in a EM-PIC algorithm. On unstructured meshes, conven- tional field solvers are implicit, requiring the solution of a (large) linear system at each time step...... 14
2.2 Charge-conserving gather and scatter steps [1]. (a) Interpolation of E and B at the position of the particle by edge-based (left) and face- based degrees of freedom contributions (right) (weighted by the Whit- ney functions) in the gather step. (b) Exact charge-conserving scatter scheme. The sum of the two colored areas in the left, representing the magnitude of the edge currents, is equal to the blue area in the left, representing the charge variation at node 1 during one time step. . . 16
2.3 Relative position difference (RPD) of the various test particles w.r.t. the standard particle placed at the origin, in a polar diagram where the radial distance is represented in logarithmic scale...... 23
2.4 Results for a circular particle trajectory on 3 different meshes. (a) (b) (c) Particle trajectory histories. (d) (e) (f) RPDs versus time for the four test particles. (g) (h) (i) Normalized RPD bands for the four test particles...... 25
2.5 Results for a trajectory with drift. (a) (b) (c) Particle trajectory his- tory. (d) (e) (f) RPDs versus time for the four test particles. (g) (h) (i) Normalized RPD bands for the four test particles...... 27
2.6 Radial current versus radius coordinate for the expanding plasma at time step n = 9 × 104 using the LU-based implicit fields solver and the SPAI-based explicit field solver with k = 2, 4, and 6...... 28
xvi 2.7 (a) Normalized residuals of the discrete continuity equation for the plasma ball expansion example using different field solvers, at t = 2×104∆t. (b) Similar results for the discrete Gauss’ law. (c) Averaged normalized residuals for the discrete Gauss’ law versus time step index. 29
2.8 Results for the accelerated electron beam at t = 6×104∆t. (a) (b) Par- ticle distribution snapshot from charge-conserving EM-PIC algorithms using an LU-based implicit solver and a SPAI-based (k = 2) explicit solver, respectively . (c) Particle distribution snapshot from a con- ventional (non-charge conserving on the unstructured grid) EM-PIC algorithm with an LU-based implicit solver. (d) (e) (f) Corresponding electric-field profile distributions...... 31
2.9 Number density and average velocity of particles across a transversal section of the electron beam at t = 3 × 103∆t, after steady-state has been reached...... 32
2.10 Simulated ω × k dispersion diagram for the X mode propagation and
for electron Bernstein waves in a magnetized warm plasma. Here ωpe is the plasma frequency and ∆x is the grid spacing, chosen uniform. The analytical results are indicated by the red dots in the diagram. Note that the use of a charge-conserving scatter step in PIC algorithm as described in [1] reduces the numerical noise and yields cleaner spectral bands in the numerically generated band diagrams. In addition, a charge-conserving scatter step mitigates the spurious DC field cause by spurious charge accumulation on the grid nodes, as observed at the bottom of the zoomed plots. Overall, a very good agreement is observed between the numerical and the analytical results...... 34
3.1 (a) Cyclotron configuration. (b) Computational domain, where the blue vertical strip indicates the region where an external longitudinal RF electric field is applied. The DC magnetic field is applied in the whole computational region except for the RF acceleration gap (red). 42
3.2 Electron trajectories on a cyclotron: (a) Non-relativistic, (b) Relativis- tic, unsynchronized, and (c) Relativistic, synchronized...... 43
3.3 Orbital frequency and relativistic factor for the case shown in Fig. 3.2c. 44
3.4 Comparison of electron velocity magnitudes of the three cases shown in Fig. 3.2...... 45
xvii 3.5 Motion of harmonic oscillator of a single positron inverse-Lorentz- transformed into Laboratory frame...... 47
3.6 Dispersion relations for classical (non-relativistic) electron Bernstein modes of PIC results (Parula colormap) and analytic predictions [2] (dashed red line)...... 48
3.7 An isotropic 2D Maxwell-Boltzmann-J¨uttnervelocity distribution, f0 (p) for η = 1/20: (a) Speed distribution and (b) relativistic velocity dis- tribution...... 51
3.8 Dispersion relations for plasma waves propagating in magnetized rel- ativistic pair-plasma for η = 1/20: Comparison of PIC results and analytic prediction...... 52
3.9 Normalized residuals versus nodal index for (a) discrete continuity equation (DCE) and (b) discrete Gauss law (DGL)...... 53
4.1 Schematic illustration of a typical SEE process in an irregular-grid-based EM-PIC simulation. Note that electric current densities by the primary or secondaries are deposited on red- or blue-highlighted edges, respectively. . 58
4.2 Comparison of simulation and experimental results for SEE on copper [(a) and (b)] and stainless steel [(c) and (d))] surfaces. Figures (a) and (c) illustrate SEY δ versus the primary incident energy. Figures (b) and (d) show the emitted-energy spectrum dδ/dE...... 59
4.3 Geometrical illustration of exact charge conservation on irregular grids for a primary impact (also applicable for secondary electrons emitted on the opposite way) at PEC surfaces during ∆t. Plot (a) depicts the charge vari- ation rate at jth node. Plot (b) depicts the divergence of current on jth node, which is equal to the sum of ith and kth currents...... 61
4.4 Angular dependence of δ on a copper surface...... 65
xviii 4.5 PIC results for probabilistic SEE model. (a) Superparticle population versus time (RF voltage periods). (b) and (c) Snapshots of particle’s trajectories for copper and stainless steel cases, respectively. These trajectory snapshots
are taken during four successive half-periods of the RF signal, i.e.: t/TRF ∈ (0, 0.5), t/TRF ∈ (0.5, 1), t/TRF ∈ (1, 1.5), and t/TRF ∈ (1.5, 2), where TRF = 1/fRF = 0.96 [ns]...... 65
4.6 Particle trajectory snapshots on the phase space. The coordinate axes rep- resent x/10 [m], y [m], and the normalized speed of the particles (|vp| /20c). Each plot corresponds to a half-period of the RF signal, as in Fig. 4.5. . . 66
4.7 Multipactor in parallel plate waveguides. (a) Schematics of the problem ge- ometry. (b) Flat surface waveguide meshing. (c) Triangular-grooved waveg- uide meshing...... 68
4.8 RF voltage amplitude susceptibility at fRFDpp = 4 [GHz·mm] for flat and grooved copper surfaces...... 71
4.9 RF voltage cycle versus population amplification, An for both surfaces at a VRF = 1, 143.16 V...... 72
4.10 Output signals for both surfaces (a) in time-domain and (b) frequency domain. 74
4.11 Particle position snapshots taken over a half RF period during the saturation regime. The RF voltage period is 0.5 ns. Plots (a)-(f) are for the flat surface and plots (g)-(l) are for the grooved surfaces...... 76
4.12 External-field and self-field snapshots taken over a half period during the saturation regime. The RF voltage period is 0.5 ns. Plots (a)-(f) are for the flat surface and plots (g)-(l) are for the grooved surfaces...... 77
4.13 Snapshots of vy [m/s] versus y [m] taken over a half RF period during the saturation regime. The RF voltage period is 0.5 ns. Plots (a)-(f) are for the flat surface and plots (g)-(l) are for the grooved surfaces...... 77
4.14 Snapshots of vx [m/s] versus y [m] taken over a half RF period during the saturation regime. The RF voltage period is 0.5 ns. Plots (a)-(f) are for the flat surface and plots (g)-(l) are for the grooved surfaces...... 78
xix 5.1 Numerical grid dispersion of the 2-D Yee’s FDTD scheme on a struc- tured mesh. (a) The red color surface represents the dispersion diagram of the normalized frequency ω∆t/π versus the normalized numerical wavenumber κ˜h in radians. The olive color surface represents the light cone. The contour levels at the bottom represent the normalized phase errors (with respect to the color bar). (b) Wavenumber magnitude ver- sus frequency for different wave propagation angles with respect to the o o x axis, φp ∈ [0 , 45 ]...... 82
5.2 Analytic NCR predictions on a structured FDTD grid for a bulk beam
velocity vb = 0.9c. (a) 3-D numerical dispersion diagrams (in red) and beam planes (fundamental plane in green and aliased beams in transparent yellow). (b) Trajectories of NCR solutions projected onto the 2-D κ˜-space...... 84
5.3 Schematic illustration of the four types of mesh considered in this study. (a) Square regular (SQ) elements in both FDTD and FETD, (b) right-angle triangular (RAT) elements in FETD, (c) isosceles tri- angular (ISOT) elements in FETD, and (d) highly-irregular triangular (HIGT) elements in FETD...... 86
5.4 Schematic of SQ mesh. There are two characteristic edges (A and B) directed along the y and x and colored in red and blue, respectively. . 88
5.5 Numerical grid dispersion for the FETD scheme on the SQ mesh. (a) The red color surface represents the dispersion diagram of the normal- ized frequency ω∆t/π versus the normalized numerical wavenumber κ˜h in radians. The olive color surface represents the light cone. The contour levels at the bottom represent the normalized phase errors (with respect to the color bar). Note that the normalized phase error is always negative in this case because of a slightly faster-than-light numerical phase velocity. (b) Projected dispersion curves for different o o wave propagation angles with respect to the x axis φp ∈ [0 , 45 ]. . . . 91
5.6 Analytic prediction of NCR for the FETD algorithm on the SQ mesh
when vb = 0.9c. (a) 3-D dispersion diagram. (b) NCR solution con- tours projected onto the first Brillouin zone in the κ˜-space...... 92
xx 5.7 A periodically-arranged triangular grid. It has three characteristic edges denoted by A, B, and C. Labels inside circles denote global facet indexes and labels inside rectangles and pentagons denote local edge and node indexes, respectively...... 93
5.8 Numerical grid dispersion for the FETD scheme on the RAT mesh with the CFL number equal to one. Unlike the FDTD or FETD-SQ cases, this diagram exhibits an additional (upper) dispersion band. (a) The red (lower band) and blue (upper band) color surfaces represent the dispersion diagram of the normalized frequency ω∆t/π versus the nor- malized numerical wavenumber κ˜h in radians. The olive color surface represents the light cone. The contour levels at the bottom repre- sent the normalized phase errors (with respect to the color bar). (b) Projected dispersion curves for different wave propagation angles with o o respect to the x axis φp ∈ [−45 , 45 ]...... 96
5.9 (a) The vector proxy of a Whitney 1-form associated with the edge −→ AB on a triangular mesh. (b) Tangential component along edge. (c) Normal component to the edge direction...... 97
5.10 Analytic prediction of NCR for the FETD-based EM-PIC scheme on
the RAT mesh assuming a plasma beam with bulk velocity vb = 0.9c. (a) Dispersion diagram. (b) NCR solution contours projected onto the first Brillouin zone in the κ˜-space...... 98
5.11 Numerical grid dispersion for the FETD scheme on the ISOT mesh with CFL number equal to one. Unlike the FDTD or FETD-SQ cases, this diagram exhibits an additional (upper) dispersion band. (a) The red (lower band) and blue (upper band) color surfaces represent the dispersion diagram of the normalized frequency ω∆t/π versus the nor- malized numerical wavenumber κ˜h in radians. The olive color surface represents the light cone. The contour levels at the bottom and top represent the normalized phase errors (with respect to the color bar). (b) Projected dispersion curves for different wave propagation angles o o with respect to the x axis φp ∈ [26.57 , 90 ]...... 98
5.12 Analytic prediction of NCR for the FETD-based EM-PIC scheme on
the ISOT mesh assuming a plasma beam with bulk velocity vb = 0.9c. (a) Dispersion diagram. (b) NCR solution contours projected onto the first Brillouin zone in the κ˜-space...... 99
xxi 5.13 Initial velocity distributions for a relativistic pair plasma beam with
bulk velocity vb = 0.9c (γb ≈ 2.3). (a) Phase space in the beam rest frame. (b) Velocity distribution in the beam rest frame. (c) Phase space in the laboratory frame. (d) Velocity distribution in the labora- tory frame...... 101
5.14 (a) HIGT mesh. (b) Histogram of the edge lengths. (c) Histogram of the triangular element angles...... 102
5.15 B field amplitude distribution (log scale) over the first Brillouin zone in the κ˜-space as measured from EM-PIC simulation snapshots at 47 µs. (a) and (c) plots correspond to FDTD- and FETD-based EM-PIC sim- ulations on the SQ mesh, respectively. In (b) and (d), the analytical predictions are superimposed to the numerical results...... 104
5.16 B field amplitude distribution (log scale) over the first Brillouin zone in the κ˜-space as measured from EM-PIC simulation snapshots at 47 µs. (a) and (c) plots correspond to FETD-based EM-PIC simulations on the RAT and ISOT meshes, respectively. In (b) and (d), the analytical predictions are superimposed to the numerical results...... 105
5.17 B field amplitude distribution (log scale) over the first Brillouin zone in the κ˜-space as measured from FETD-based EM-PIC simulation snap- shots at 47 µs on the HIGT mesh...... 107
5.18 The qualitative comparison of the B field amplitude distribution (log scale) on the κ˜-space between FDTD and FETD-HIGT cases. (a)
shows the spectral amplitude of B versusκ ˜yh at some fixed values of κ˜yh and vice-versa in (b)...... 107
5.19 Evolution of the magnetic energy Wm due to NCR on various meshes. 108
5.20 Snaphots of the magnetic field distribution resulting from EM-PIC simulations of a single electron-positron pair moving relativistically. The snapshots are taken at 75.2 ns, 112.8 ns, and 150.4 ns, as indicated. The results correspond to: (a-c) FDTD-based EM-PIC simulation on SQ mesh , (d-f) FETD-based EM-PIC simulation on SQ mesh, (g- i) FETD-based EM-PIC simulation on the RAT mesh, (j-l) FETD- based EM-PIC simulation on ISOT mesh, (m-o) FETD-based EM-PIC simulation on HIGT mesh...... 110
xxii 6.1 Depiction of an axisymmetric structure...... 113
6.2 (2+1) setup for fields on (a) primal and (b) dual meshes at the meridian plane. The vertical axis is ρ and the horizontal axis is z...... 116
6.3 Vector proxies of various degrees of Whitney forms on the mesh: (a) (1) (2) (0) (RWG) Wj , (b) Wk , (c) Wi , and (d) Wj . Note that tj is a unit vector tangential to j−th edge and parallel to its direction and nk is a unit vector normal to k−th face...... 119
6.4 Field boundary conditions on the primal mesh for the TEφ field with (a) perfect magnetic conductor (m = 0) and (b) perfect electric conductor (m 6= 0) and for the TMφ field with (c) perfect magnetic conductor (m 6= 0) and (d) perfect electric conductor (m = 0). Dashed lines indicate Dirichlet boundary condition, for example edges on the z axis representing a perfect electric conductor boundary for TEφ field in (b), or nodes on the z axis representing a perfect electric conductor boundary for the TMφ field in (d)...... 124
6.5 Schematic view of the simulated cylindrical cavity with perfect electric conductor (PEC) walls. The cavity dimensions are a = 0.5 m and h =1m...... 127
6.6 Normalized spectral amplitude for E, showing the eigenfrequencies of the cavity. Black solid lines correspond to the present FETD-BOR result. Red solid and blue dashed lines are analytic predictions for the
TEmnp and TMmnp eigenfrequencies, respectively...... 129
6.7 Transient snapshots for Ez inside the cylindrical cavity at (a) 1.0024 [µs], (b) 1.0028 [µs], (c) 1.0032 [µs], and (d) 1.0036 [µs]...... 130
6.8 Transient snapshots for Bz inside the cylindrical cavity at (a) 1.0024 [µs], (b) 1.0028 [µs], (c) 1.0032 [µs], and (d) 1.0036 [µs]...... 131
6.9 Logging-while-drilling sensor problem geometry (from inner to outer features): metallic mandrel, transmit (Tx) and receive (Rx) coil an- tennas, mud-filled borehole, and adjacent geological formation. . . . . 133
xxiii 6.10 Logging-while-drilling sensor responses. (a) First scenario: the con- ductivity of the adjacent geological formation is varied. (b) Second scenario: the sensor moves downward through a borehole surrounded by a geological formation with three horizontal layers...... 135
6.11 Computed (a) AR and (b) PD (in deg.) by a logging-while-drilling sensor surrounded by homogeneous geological formations with different conductivities. This corresponds to the first scenario in Fig. 6.10. The results from the present algorithm are compared against FDTD and NMM results [3] (see more details in the main text)...... 136
6.12 Computed PD (deg.) between the two receivers of the logging-while- drilling sensor versus the z position of the transmitter coil antenna. This corresponds to the second scenario in Fig. 6.10. The results from the present algorithm are compared against FDTD and NMM results [3] (see more details in the main text)...... 137
6.13 Electric field distribution during the half period for zTx = (a) −50 inch, (b) −25 inch, (c) 5 inch, (d) 25 inch, (e) 50, and (f) 70 inch. Note that
zTx = 0 at the interface between first (5 S/m) and second (0.0005 S/m) formations...... 138
7.1 Schematics of two examples of axisymmetric vacuum electronic devices. (a) Backward-wave oscillator producing bunching effects on an electron beam. Wall ripples are designed to support slow-wave modes in the device. (b) Space-charge-limited cylindrical vacuum diode...... 142
7.2 A charged ring travels inside an axisymmetric object bounded by PEC: (a) a 3D view, (b) the meridian plane...... 146
7.3 The original problem shown in Fig. 7.2 is replaced by an equivalent 2D problem in the meridian plane as depicted above, which considers TEφ-polarized EM fields on Cartesian space with an artificial inhomo- geneous medium. The variable coloring serves to stress the dependency of the artificial medium parameters on ρ...... 149
7.4 Snapshots for electric field distribution at 2 µs. Note that RGB colors and white arrows indicate magnitudes and vectors of the electric fields, respectively...... 151
7.5 Spectrum for resonant cavity modes from 1 MHz to 1 GHz...... 152
xxiv 7.6 Schematics for divergent and convergent flows in the cylindrical diode. 154
7.7 Space-charge-limited current density for various Lz/ρo and comparison between present EM-PIC simulations and KARAT by [4]...... 155
7.8 Electric field intensity of self- and external fields at the instant of vir- tual cathode formation...... 156
7.9 Shematics of backward-wave oscillator with an instant particle distri- bution snapshots at t = 21.50 ns...... 157
7.10 Electric potential distribution (contour plots) and corresponding elec- tric fields (vector plots) between the cathode and the anode...... 157
7.11 A zoomed-in region of four rightmost corrugations of Fig. 7.9 with RGB color scales reflecting particle velocities...... 159
7.12 Phase-space plot at 24.00 ns...... 160
7.13 A snapshot of steady-state self-fields (76.00 ns)...... 160
7.14 Output signal analysis in (a) time and (b) frequency domains. . . . . 161
7.15 Verification of charge conservation at nodes along time (at time-steps of 7.5 × 104, 9 × 104, 12 × 104) by testing NR levels of (a) DCE and (b) DGL...... 162
7.16 3D velocity plots for an electron beam with the BFS magnetic field of 0.5T...... 162
7.17 SCSWS boundary profiles for all cases...... 163
7.18 Field signal at the output port in (a) SCSWS and (b) staircased SC- SWS in the time domain...... 164
7.19 Normalized spectral amplitude at the output port in SCSWS and stair- cased SCSWS...... 165
7.20 Dispersion relations from “cold tests”...... 166
xxv C.1 Example (primal) unstructured mesh...... 187
C.2 Incidence matrices for (a) curl [Dcurl] and (b) gradient [Dgrad] operators for the mesh in Fig. C.1...... 188
C.3 Sparsity patterns for discrete Hodge matrices corresponding to the toy 0→0 1→1 −11→1 mesh depicted in Fig. C.1: (a) [?] , (b) [?] , (c) ?µ , and 2→2 (d) [?µ−1 ] ...... 192
xxvi Chapter 1: Introduction
1.1 Background and motivation
Plasma is a significantly ionized gas, known as the fourth state of matter, com- posed of a large number of charged particles such as electrons and ions [5]. A distinct feature in characterizing most plasmas originates from collective interactions among all charged particles through the long-range behavior of Coulomb forces [6] rather than binary interactions or hard collisions (between every two particles) which dominate molecular dynamics of neutral gases [7]. At low densities, plasmas behave classically and its underlying dynamics includes particle kinematics and electromagnetism.
In general, the approach used for modeling a plasma system depends on its char- acteristic (temporal and spatial) scales [8]. The simplest one is magnetohydrodynam- ics (MHD), which is computationally efficient, based on the assumption of plasmas behaving like fluids [5], but only captures large-scale phenomena, and some of the physics such waves and instabilities are not described. The most accurate model is of course to microscopically account for the dynamics of all charged particles. This is impractical though since, as noted, usual plasmas consist of large numbers of charged particles.
1 Among various kinds of plasmas, collisionsless plasmas correspond to those where the strong binary Coulomb collisions are almost negligible for their description [6,
9, 10]. This occurs if the collisional frequency is much smaller than the frequency of interest (e.g. plasma frequency) and the mean free path is much longer than the characteristic length scale (e.g. Debye length). The main focus of this dissertation will be on the study of collisionless plasmas.
The behavior of collisionless plasmas is governed by Vlasov equation which de- scribes nonlinear evolution of the phase space distribution function, viz. the num- ber density over the 6-dimensional phase space (position and momentum) [9]. The
Maxwell-Vlasov system is the combination of the Vlasov equation with Maxwell’s equations in a multi-physical system involving (1) Maxwell’s equation, (2) Newton’s law of motions, and (3) Lorentz force acting on each particle [10]. In this system each particle will be tracked in 3-dimensional Euclidean space in response to Lorentz forces. In modeling collisionless plasmas, a coarse-graining of the phase distribu- tion function (relatively macroscopic treatment) is employed to make the number of simulated particles not too large. In this case, superparticles are employed, each representing typically several millions of actual charged particles [11–13].
Electromagnetic particle-in-cell (EM-PIC) algorithm is a numerical approach to solve the Maxwell-Vlasov system by temporally tracking all superparticles over the
Euclidean space [11–13]. From their kinetic movement, the algorithm calculates their equivalent currents with the (direct or Galerkin) projection onto a grid (cell complex) which reconstructs the original problem domain. Subsequently, EM fields driven by the currents are to be solved by applying conventional computational electromagnetic
(CEM) techniques, specifically, discrete counterparts of EM fields are updated on the
2 grid. Then, the updated discrete fields are interpolated at the superparticles’ posi- tions so as to evaluate Lorentz forces acting on superparticles. Finally superparticles are accelerated and pushed to the new positions by solving Lorentz force equation and Newton’s law of motion. The above describes a fundamental cycle that EM-PIC algorithm conducts at each time step and this is repeated through the desired simula- tion time window. The four steps in each cycle are called scatter, field-solver, gather, and particle-pushers, respectively [11–13].
Most previous EM-PIC algorithms have employed a structured grid with the use of the finite-difference time-domain (FDTD) algorithm [11–14] or the pseudo-spectral time-domain (PSTD) algorithm [15] and here demonstrated successful performances on various practical applications. Apart from the historical origin, the main reasons to use the structured grids are that (1) its formulation and implementation is rather simple but robust enough, (2) it is relatively straightforward to introduce finite size superparticles (shape factors) onto the grid that can alleviate some numerical arti- facts [12], (3) discrete charge conservation can be achieved for arbitrary orders of shape factors [16–18], and (4) superparticles can be easily tracked along the grid.
Nevertheless, structured grids present two fundamental drawbacks: (1) staircasing errors and (2) poor numerical dispersion properties [19, 20]. The former severely de- grades the geometric fidelity while modeling realistic devices that may include curved and slanted boundaries. In addition, local mesh refinement (to capture locally find features) is hampered. Furthermore, it becomes difficult to accurately model sec- ondary electron emission process from curved surfaces. As a result, structured grids necessitate the use of special treatments such as ad-hoc cut cell methods or conformal
finite-difference approaches [21] that may violate energy and charge conservation.
3 A natural alternative is to use unstructured grids based on the finite-element method (FEM). Such grids are devoid from staircasing errors and provide better per- formance w.r.t. numerical grid dispersions [22]. Moreover, unstructured grids enable a greater degree of space adaptivity using mesh refinement techniques. Conventional
FEM to solve for electromagnetic fields [23] are mostly based on vector wave equation in the frequency domain and implemented using either a weighted residual method or variational principle. In this dissertation, we shall utilize FEM applied for transient plasma problems on the time-domain. In this case, the time-varying Maxwell’s curl equations are discretized based on compatible discretization principles, yielding the so called mixed E −B finite-element time-domain (FETD) scheme [24–26]. In order to do that, the discrete exterior calculus of differential forms shall be utilized, shedding light on clearer geometrical meaning for all Maxwell dynamic variables hidden behind vector calculus [27–36].
A long-standing challenge for EM-PIC simulations on unstructured grids has been violation of charge conservation which requires a posterior corrections based on costly
Poisson’s solvers. Based on compatible discretization principles, a novel EM-PIC method on unstructured grids has been proposed in [1] which makes use of Whitney forms for the scatter and gather algorithms, guaranteeing exact charge conservation from first principles.
Nevertheless, there are still important challenges when using unstructured grids such as: (1) the resulting Maxwell field solver is implicit on the time domain, requiring a sequential linear solver at each time step and (2) A full analysis of the grid numerical dispersion remains necessary to evaluate grid-heating-effects and numerical Cherenkov instabilities in unstructured grids.
4 In this dissertation, we first develop a local and explicit EM-PIC on unstructured grids using a sparse approximate inverse (SPAI) strategy. We study the perturba- tions in the motion of charged particles induced by the approximate inverse error. In addition, we extend the EM-PIC algorithm on unstructured grids to the relativistic regime using several types of relativistic particle-pushers (Boris, Vay, and Higuera-
Cary pushers [37–39]). Their performance is compared analytically and numerically.
We implement realistic particle boundary conditions for secondary electron emission
(SEE) based on the probabilistic Furman-Pivi model [40] and study multipactor ef- fects associated to avalanched electrons resonant with external RF voltages frequently observed in high power microwave applications. In addition, we investigate numerical
Cherenkov radiation (NCR) or instability, which is a detrimental effect frequently found in EM-PIC simulations involving relativistic plasma beams.
Another motivation of this dissertations is in the development of the EM-PIC solvers in circularly symmetric or body-of-revolution (BOR) geometries, which is important a plethora of applications involving analysis and design of high power mi- crowave devices, directed energy devices and other applications. In the cylindrical coordinate system, azimuthal field variations can be described by eigenmodal ex- pansions, where the modal field solutions is reduced a 2-dimensional problem in the meridian ρz-plane. In this dissertation, we shall explore transformation optics (TO) principles [26, 41–45] to map the original 3-D BOR problem to a 2-D equivalent one in the meridian ρz-plane based on a Cartesian coordinate system where cylindrical metric is fully embedded into the constitutive properties of an effective inhomoge- neous and anisotropic medium that fills the domain. On the meridian plane, the
fields are decomposed into TEφ and TMφ polarizations. In this way, a Cartesian
5 2-D FETD code can be easily retrofitted to this problem with no modifications nec- essary except to accommodate the presence of the artificial medium. We validate the algorithm against analytic solutions for resonant fields in cylindrical cavities and pseudo-analytical solutions for the radiated fields by cylindrically symmetric antennas in layered media.
We combine the EM-PIC algorithm with the BOR-FETD scheme into an axisym- metric EM-PIC algorithm optimized for the analysis of vacuum electronic devices
(VED) [46–50]. These typically employ corrugated cylindrical or coaxial waveguides, called slow-wave structure (SWS), that interact with an energetic electron beam to produce high power microwaves. We use the algorithm to investigate the physical performance of VEDs designed to harness particle bunching effects arising from the coherent (resonance) Cherenkov electron beam interactions within micro-machined
SWSs.
1.2 Contribution of this dissertation
Main contributions of this dissertation are:
• Integration of a local and explicit FETD scheme with the sparse approximate in-
verse (SPAI) strategy with the previous charge conservative EM-PIC algorithm
on unstructured grids and investigation of the approximate inversion error in-
fluencing on the motion of charged particles.
• Extension of the algorithm to the relativistic regime with Boris, Vay, and
Higuera-Cary particle-pushers and comparison of their relative performance.
6 • Numerical analysis of parallel-plate multipactor effects based on probabilistic
Furman-Pivi model for the estimation of secondary electron emission process.
• Evaluation of numerical Cherenkov radiations (or instabilities) present in rela-
tivistic EM-PIC simulations with a generalized grid dispersion analysis account-
ing for different mesh element shapes.
• Development of a new FETD Maxwell solver for the general analysis of body-
of-revolution (BOR) geometries based on transformation optics concepts.
• Development of the EM-PIC algorithm optimized for the analysis of axisymmet-
ric vacuum electronic devices such as cylindrical vacuum diodes and backward-
wave oscillators.
More details for each contribution are presented in the next subsection.
1.3 Organization of this dissertation
This dissertation is organized as follows.
In Chapter 2, we present a charge-conserving EM-PIC algorithm on unstructured grids based on a FETD methodology with explicit field update, i.e., requiring no linear solver [51]. The proposed explicit EM-PIC algorithm attains charge conservation from
first principles by representing fields, currents, and charges by differential forms of various degrees, following the methodology put forth in [1]. The need for a linear solver is obviated by constructing a SPAI for the FE system matrix, which also preserves the locality (sparsity) of the algorithm. We analyze in detail the residual error caused by SPAI on the motions of charged particles and beam trajectories and
7 show that this error is several order of magnitude smaller than the inherent error caused by the spatial and temporal discretizations.
Accurate modeling of relativistic particle motions is essential for physical predic- tions in many problems involving vacuum electronic devices, particle accelerators, and relativistic plasmas. In Chapter 3, we extend the local, explicit, and charge-conserving
FETD-PIC algorithm to the relativistic regime by implementing and comparing three relativistic particle-pushers: (relativistic) Boris, Vay, and Higuera-Cary [52]. We illus- trate the application of the proposed relativistic FETD-PIC algorithm for the analysis of particle cyclotron motion at relativistic speeds, harmonic particle oscillation in the
Lorentz-boosted frame, and relativistic Bernstein modes in magnetized charge-neutral
(pair) plasmas.
In Chapter 4, we combine a novel FE-based EM-PIC algorithm for the solution of Maxwell-Vlasov equations on unstructured grids together with the Furman-Pivi probabilistic model governing the secondary electron emission (SEE) process [53].
The Furman-Pivi probabilistic model [40] is based on a broad phenomenological fit to experiment data to obtain accurate simulations of SEE process (rather than a conventional monoenergetic one). The algorithm is suited for the analysis of reso- nant electron discharging phenomena (multipactor effects) in high-power RF devices since the use of unstructured grids enables local mesh refinement and simulation of complex geometries with minimal geometrical defeaturing. We apply the algorithm to model multipactor effects on waveguides with flat or corrugated walls and contrast the evolution of the electron population in various cases and investigate the respective saturation process arising from self-field counterbalance effects.
8 In Chapter 5, we investigate numerical Cherenkov radiation (NCR) or instabil- ity which is a detrimental effect frequently found in EM-PIC simulations involv- ing relativistic plasma beams [54]. NCR is caused by spurious coupling between electromagnetic-field modes and multiple beam resonances. This coupling may result from the slowdown of poorly-resolved waves due to numerical (grid) dispersion and from aliasing mechanisms. NCR has been studied in the past for finite-difference- based EM-PIC algorithms on regular (structured) meshes with rectangular elements.
In this chapter, we extend the analysis of NCR to finite-element-based EM-PIC al- gorithms implemented on unstructured meshes. The influence of different mesh ele- ment shapes and mesh layouts on NCR is studied. Analytic predictions are compared against results from FE-based EM-PIC simulations of relativistic plasma beams on various mesh types.
In Chapter 6, we present a FETD Maxwell solver for the analysis of BOR ge- ometries based on discrete exterior calculus (DEC) of differential forms and TO con- cepts [55]. We explore TO principles to map the original 3-D BOR problem to a
2-D one in the meridian ρz-plane based on a Cartesian coordinate system where the cylindrical metric is fully embedded into the constitutive properties of an effec- tive inhomogeneous and anisotropic medium that fills the domain. The proposed solver uses a (TEφ, TMφ) field decomposition and an appropriate set of DEC-based basis functions on an irregular grid discretizing the meridian plane. A symplectic time discretization based on a leap-frog scheme is applied to obtain the full-discrete marching-on-time algorithm. We validate the algorithm by comparing the numeri- cal results against analytical solutions for resonant fields in cylindrical cavities and
9 against pseudo-analytical solutions for fields radiated by cylindrically symmetric an- tennas in layered media.
In Chapter 7, we present a charge-conservative EM-PIC algorithm optimized for the analysis of cylindrically-shaped VEDs, which typically employ corrugated cylin- drical or coaxial waveguides, called slow-wave structure (SWS), with an energetic electron plasma beam to produce high power microwaves [56]. Present Maxwell field solver is a specific version of the BOR-FETD scheme, viz. only accounting for only the zeroth azimuthal eigenmode, combined with the Cartesian EM-PIC algorithm. The previous advances including charge conservation, local and explicit field update, rela- tivistic extension of particle-pusher, and the BOR-FETD scheme, have made possible this work, which is motivated by the demand to accurately capture realistic physics of beam-SWS interactions in complex geometry devices. The algorithm is validated considering cylindrical cavity and space-charge-limited cylindrical diode problems.
We use the algorithm to investigate the physical performance of VEDs designed to harness particle bunching effects arising from the coherent (resonance) Cherenkov electron beam interactions within micro-machined slow wave structures.
10 Chapter 2: Local, Explicit, and Charge-conserving EM-PIC on Unstructured Mesh
In the past few decades, electromagnetic particle-in-cell (EM-PIC) algorithms coupled to time-dependent Maxwell’s equations [11, 13, 57] have been applied to a variety of problems involving charged particles and beam-wave interaction, including plasma-based accelerators [58–61], inertial confinement fusion [62], and vacuum elec- tronic devices [46,63]. Historically, EM-PIC codes have been using regular grids and
finite-difference approaches [14], such as the celebrated Yee’s finite-difference time- domain (FDTD) algorithm [64]. However, complex geometries involving curved (such as conformal cathodes and curved waveguide sections) or very fine geometrical fea- tures cannot be accurately modeled by regular grids because of ensuing ‘staircase’
(step-cell) effects [65]. Although many studies have been done to ameliorate staircase errors in finite-differences, including the use of conformal finite-differences [21, 66], heterogeneous grids [67], and subgridding [68, 69], the most general solution to this problem is to employ irregular, unstructured grids (meshes). The finite-element (FE) method is a better option in this case because it is naturally suited for such type of grids. In addition, FE also enables a greater degree of space-adaptivity (using mesh refinement techniques) in a systematic fashion and can also be applied for transient problems using FE time-domain (FETD) algorithms [27,70].
11 However, existing FE-based EM-PIC codes based on unstructured grids have three important drawbacks. First, FE-based EM-PIC algorithms tend to numerically vio- late charge conservation due to the fact that the continuity equation leaves residuals at the discrete level on unstructured grids. Past efforts to enforce charge conserva- tion have included adding a posterior correction steps by Poisson’s solvers [14] or pseudo-currents [71]. However, the former approach requires a time-consuming linear solver at each time step and the latter introduces a diffusion parameter that may alter the physics. A recent charge-conserving PIC algorithm based on second-order vector wave equation for the electric field that does not require introduction of correction terms is described in [72,73]. However, the solution space of the second-order vector wave equation in the time-domain includes spurious solutions with secular growth of the form t∇φ, which are not physical admissible solutions to Maxwell’s equations and can pollute the numerical results [1,74,75]. More recently, a novel gather-scatter algorithm with exact charge conservation on unstructured grids was described in [1], based on concepts borrowed from differential geometry [30,35] and discrete differential forms [28, 76]. Charge-conserving PIC algorithms were also developed under similar tenets in [77,78]. A second challenge for unstructured-grid EM-PIC algorithms is that the field solver is implicit, i.e., it requires the repeated solution of a linear system of equations sequentially at each time step [27,79]. Finally, a third challenge (shared by
FDTD-based algorithms as well) is that their performance is hindered by the global
Courant stability bounds on time steps used to advance fields and particles.
In order to overcome the second challenge noted above, a sparse inverse approx- imation (SPAI) strategy for unstructured meshes [26, 33] is incorporated here into an explicit FETD-based EM-PIC algorithm with exact charge-conserving properties
12 developed in [1]. For a given mesh, the resulting SPAI explicit solver obtains an approximation for the inverse of the FE system matrix based on (powers of) the sparsity pattern of the original FE system matrix. This is done once-and-for-all for any given mesh i.e., independently from any field excitation and particle distribution, and decoupled from the field update. The SPAI explicit solver is easily parallelizable and produces exponential convergence of the approximate inverse matrix to the ex- act inverse matrix as the density (sparsity) of the former is increased (reduced) [33].
Importantly, since sparsity is retained, the algorithm remains local [26]. The explicit and sparse nature of the resulting EM-PIC algorithm enable integration with asyn- chronous time stepping techniques [80–82] designed to overcome the third challenge indicated above. We investigate in detail here the effect of the approximate inverse on the particle dynamics by comparing particle trajectories computed with the new proposed algorithm against analytical solutions (when available) and a conventional implicit EM-PIC algorithm employing a direct LU-solver. We show that the error caused by the SPAI approximation is several order of magnitude smaller than inherent space and time discretization errors.
2.1 Explicit FETD-PIC Algorithm
A typical EM-PIC algorithm consists of four basic steps [1]: (1) field solver (con- sisting of electric and/or magnetic field updates from Maxwell’s equations), (2) gather step (fields interpolation at each particle position), (3) scatter (assigning currents to grid edges and charges to grid nodes from the particle positions and velocities), and
(4) particle acceleration and push (governed by Lorentz force and Newton’s law of
13 B update ( Faraday’s law )
Gather
Particle Acceleration & Push (Lorentz force / Newton’s law of motion)
Scatter
E update ( Ampere’s law ) Implicit
Figure 2.1: Basic steps in a EM-PIC algorithm. On unstructured meshes, conven- tional field solvers are implicit, requiring the solution of a (large) linear system at each time step.
motion). These four steps are sequentially performed at each time step, as illustrated in Fig. 2.1.
2.1.1 Mixed E − B FETD scheme
In the language of differential forms for the electromagnetic field [83], the electric
field E and the (Hodge dual of the) current density ?J are represented as 1-forms, and the magnetic flux density B is represented as a 2-form [24]. On a mesh, 1-forms and
2-forms are associated to mesh edges and facets, respectively [30, 35]. Accordingly,
14 in order to discretize Maxwell’s equations, the FETD algorithm expands E and ?J in terms of Whitney 1-forms associated with edges of the mesh, and B in terms of
Whitney 2 forms associated with faces of the mesh [1,24].
Next, using the generalized Stoke’s theorem to obtain semi-discrete equations fol- lowed by a leap-frog discretization in time (second-order symplectic time integration), the following full-discrete FETD scheme is obtained [1,33]:
n+ 1 n− 1 n [B] 2 = [B] 2 − ∆t [Dcurl] · [E] (2.1)
1 1 n+1 n T n+ 2 n+ 2 [?] · [E] = [?] · [E] + ∆t [Dcurl] · [?µ−1 ] · [B] − [J] . (2.2)
where ∆t is the time step increment, the superscript n denotes the time step index,
and [B], [E], and [J] are column vectors representing B on each face, and E and ?J
on each edge, respectively. In addition, [Dcurl] is the incidence matrix representing
the discrete exterior derivative (or, equivalently, the discrete curl operator distilled
from the metric, that is, with elements in the set {−1, 0, 1}) on the mesh [30,33], and
[?] and [?µ−1 ] are discrete Hodge (mass) matrices whose elements are given by the
volume integrals [33,76]
Z (1) (1) [?]J,j = WJ · Wj dΩ (2.3) Ω Z −1 (2) (2) [?µ−1 ]K,k = µ WK · Wk dΩ (2.4) Ω
(1) (2) where Wj , j = 1,...,N1 and Wk , k = 1,...,N2 are the vector proxies of Whitney
1- and 2-forms [30] that span the set of N1 edges and N2 faces of the mesh, respectively. T h ˜ i It can be shown that [Dcurl] = Dcurl , the incidence matrix on the dual mesh [1,30,
35,84]. Eqs. (1) and (2) constitute an implicit field solver because [?] is nondiagonal:
in order to update the electric field from eq. (2) it is necessary to solve a large linear
15 ( ) + 𝑛𝑛 1 𝑛𝑛 1 1 ( ) 1 𝑝𝑝 𝑛𝑛+ 𝑛𝑛− edge 𝔼𝔼 𝑊𝑊1 ⃗𝑟𝑟 ( ) 2 2 2 𝔹𝔹 𝑖𝑖 𝔹𝔹 𝑖𝑖 2 𝑛𝑛 𝑛𝑛 1 𝑛𝑛 𝑊𝑊𝑖𝑖 ⃗𝑟𝑟𝑝𝑝 1 3 3 𝑝𝑝 𝑛𝑛 𝔼𝔼 𝑊𝑊 ⃗𝑟𝑟 edge 𝑝𝑝 edge 𝑛𝑛 face ⃗𝑟𝑟 ⃗𝑟𝑟𝑝𝑝 ( ) 2 3 𝑖𝑖 𝑛𝑛 1 𝑛𝑛 𝔼𝔼 2 𝑊𝑊2 ⃗𝑟𝑟𝑝𝑝 (a)
node
2
1 𝑛𝑛+ 2 1 edge 𝕁𝕁 𝑛𝑛+1 𝑛𝑛 𝑛𝑛+1 ℚ 1 − ℚ 1 𝑛𝑛+1 𝑝𝑝 𝑝𝑝 edge 1 ⃗𝑟𝑟 edge ⃗𝑟𝑟
2 𝑛𝑛 3 node 𝑛𝑛 ⃗𝑟𝑟𝑝𝑝 ⃗𝑟𝑟𝑝𝑝 1 1 𝑛𝑛+ 2 node 𝕁𝕁 2 (b) 3
Figure 2.2: Charge-conserving gather and scatter steps [1]. (a) Interpolation of E and B at the position of the particle by edge-based (left) and face-based degrees of freedom contributions (right) (weighted by the Whitney functions) in the gather step. (b) Exact charge-conserving scatter scheme. The sum of the two colored areas in the left, representing the magnitude of the edge currents, is equal to the blue area in the left, representing the charge variation at node 1 during one time step.
system of equations at every time step. The explicit scheme proposed here is detailed in Chapter 2.2 below.
2.1.2 Gather-scatter and particle pusher steps
In the gather step, Whitney forms are used to determine the electric and magnetic
field values at the position of each particle, as depicted schematically in Fig. 2.2a.
n n+ 1 n− 1 Specifically, from the values of [E] on edges and [B] 2 and [B] 2 on faces, vector
16 proxies of Whitney forms are used to interpolate En(x) and Bn(x) at any ambient
point x, and in particular at the charged particles’ locations, by
N1 n X n (1) E (x, n∆t) ≡ E (x) = Ej Wj (x) (2.5) j=1
N2 X 1 n+ 1 n− 1 (2) B (x, n∆t) ≡ Bn(x) = 2 + 2 W (x) (2.6) 2 Bk Bk k k=1
1 n n n+ 2 where Ej denotes the j-th element of the column vector [E] and likewise for Bk 1 n− 2 and Bk . This is illustrated schematically in Fig. 2.2a. In the scatter step, we compute the particle current densities mapped to the edges of the mesh, i.e. to the
n+ 1 mesh-based quantity [J] 2 , for incorporation back into the field solver. We adopt here the charge-conserving scatter for unstructured grids recently proposed in [1]. By
n n+1 referring to Fig. 4.3, given the initial xp and final xp locations of a particle p with charge qp during a time step ∆t, the associated current flowing along edge 1 is written as
n+1 Z xp n qp (1) qp n n+1 n+1 n J1 = W1 (x) · dl = λ1(xp )λ2(xp ) − λ1(xp )λ2(xp ) (2.7) ∆t n ∆t xp
where λ1(x) and λ2(x) are the barycentric coordinates of point x w.r.t vertices 1 and 2
respectively (the boundary points of edge 1 in consideration). Analogous assignments
follow for the other edges of the mesh.
2.1.3 Discrete continuity equation
As demonstrated in [1], the above scatter algorithm yields exact charge conserva-
tion at the discrete level because the variation of the charge at any node of the mesh
exactly matches the total current flowing in or out of that particular node. In other
17 words, the discrete continuity equation (DCE) below holds
n+1 n h i n+ 1 [Q] − [Q] D˜ · [ ] 2 + = 0 (2.8) div J ∆t
h ˜ i where Ddiv is the incidence matrix associated to the discrete divergence operator
T n in the dual mesh, which is also equal to [Dgrad] [1, 30, 35, 84], and [Q] denotes the column vector with the charge associated to each node of the mesh1. Note that the
nodal charge at any node i us obtained from the sum of the nearby particle charges
weighted by their corresponding barycentric coordinates w.r.t. at that particular
node, that is
n X n Qi = qpλi(xp ). (2.9) p
Barycentric coordinates can be identified as Whitney 0-forms associated to a par-
(0) n n ticular node i, i.e. Wi (xp ) = λi(xp ) [30, 35]. We provide a geometrical illus- tration of (2.8) in Fig. 4.3. From eq. (2.9), the charge variation at node 1 due
n+1 n to a charged particle movement during ∆t is proportional to λ1(xp ) − λ1(xp ). This quantity is represented by the blue-colored area in Fig. 4.3. At the same
time, from eq. (2.7), the current flowing along edge 1 is associated with the factor
n n+1 n+1 n λ1(xp )λ2(xp ) − λ1(xp )λ2(xp ), which is equal to the red-colored area in Fig. 4.3. A similar factor is present for edge 2 which is indicated by the green-colored area.
From the area equivalences, it is clear that the sum of the current flow out of node 1
along edges 1 and 2 is equal to the charge variation on node 1.
The particle push step computes the Lorentz force acting on each charged parti-
cle given the (interpolated) electric and magnetic fields at the particle location and
1 h i T h i T The equivalence between D˜div and [Dgrad] , and similarly between D˜curl and [Dcurl] is up to a sign, depending on the relative orientation chosen for the primal and dual meshes [30].
18 its velocity, and applies Newton’s force law to accelerate the particle. This step is
implemented here by extending the particle push described in [1] to the relativistic
regime based on the methodology put forth in [38].
2.2 Sparse Approximate Inverse (SPAI) strategy
As noted above, a linear solve (implicit time-update) is required in (2.2) due to
n+1 the presence of [?] multiplying the unknown [E] on the l.h.s. Naively, this linear
−1 solve could be avoided by pre-multiplying both sides of (2.2) by [?] , leading to
n+1 n −1 h i n+ 1 n+ 1 ˜ 2 2 [E] = [E] + ∆t [?] · Dcurl · [?µ−1 ] · [B] − [J] . (2.10)
−1 This multiplication is, of course, wholly impractical for large problems because [?] is dense and such a direct inversion is computationally very costly and scales poorly
−1 with size. Even for relatively small problems, the fact that [?] is dense makes the algorithm non-local and unsuited for asynchronous time-update algorithms [80].
Instead, to obtain an explicit and local field update algorithm, we explore the fact
−1 that, in the continuum, not only ? but also ? is a strictly local operator [26,
−1 76, 85]. This indicates that, although dense, [?] should be well approximated
−1 by a sparse approximate inverse (SPAI), which we denote [?]a . Each column of
−1 [?]a can be obtained independently (and in parallel fashion) once a suitable sparsity
−1 pattern for [?]a is chosen. Since the sparsity pattern of [?] encodes nearest-neighbor
−1 k edge adjacency, good candidates for the sparsity pattern of [?]a are [?] for k = 1, 2,..., which would encode k-nearest neighbor adjacency among edges (with larger
k providing better accuracy but denser matrices). A parallel algorithm for computing
−1 [?]a along these lines is detailed in [33], where it is also shown that the Frobenius
19 −1 −1 norm of the difference matrix k [?]a − [?] kF has exponential convergence to zero for increasing k.
−1 Once [?]a is precomputed, the explicit and local SPAI-based field update simply writes
n+1 n −1 h i n+ 1 n+ 1 ˜ 2 2 [E] = [E] + ∆t [?]a · Dcurl · [?µ−1 ] · [B] − [J] . (2.11)
2.2.1 Discrete Gauss’ law
h ˜ i h ˜ i By premultiplying both sides of (2.11) by Ddiv ·[?]a, where Ddiv is the incidence matrix representing the discrete divergence operator on the dual grid, and using the h i ˜ ∗ identity Ddiv · [Dcurl] = 0 [30,35,84], we obtain
h i n+1 h i n h i n+ 1 ˜ ˜ ˜ 2 Ddiv · [?]a · [E] = Ddiv · [?]a · [E] + ∆t Ddiv · [J] . (2.12)
This last equation can be rearranged as
n+1 n ! h i [E] − [E] h i n+ 1 D˜ · [? ] · = − D˜ · [ ] 2 , (2.13) div a ∆t div J which, using (2.8), can be rewritten as ! h i [ ]n+1 − [ ]n [ ]n+1 − [ ]n D˜ · [? ] · E E = Q Q . (2.14) div a ∆t ∆t
Eq. (2.14) implies that residuals of the discrete Gauss’ law (DGL) at any two succes- sive time steps remain the same, in other words
h ˜ i n+1 n+1 h ˜ i n n Ddiv · [?]a · [E] − [Q] = Ddiv · [?]a · [E] − [Q] , (2.15) and by induction,
h ˜ i n n h ˜ i 0 0 Ddiv · [?]a · [E] − [Q] = Ddiv · [?]a · [E] − [Q] (2.16) | {z } | {z } resn res0
20 h ˜ i 0 0 for all n, so that if initial conditions have Ddiv · [?]a · [E] = [Q] , then the DGL is verified for all time steps.
In the next Section, we analyze the error incurred by the above SPAI approx-
imation to obtain an explicit field solver for EM-PIC simulations on unstructured
grids.
2.3 Numerical Results
In order to investigate the error caused by the SPAI-based explicit solver in EM-
PIC simulations, we consider in this Section examples involving single charged particle
trajectories, a plasma ball expansion, and an accelerated electron beam.
2.3.1 Single-particle trajectories
Typical PIC simulations comprise an ensemble of superparticles effecting a coarse-
graining of the phase-space. As such, instantaneous errors in individual particle trajec-
tories may not always be relevant when computing grid-averaged physical quantities.
Nevertheless, it is of interest to examine the secular trends on the particle trajectory
discrepancies.
We investigate the motion of a single charged particle initially positioned at the
ext ext origin in the presence of an external magnetic field Bz and electric field Ey . In this case, the exact solution can be written as [86]
ext ext vy,0 vx,0 qpEy qpEy vy,0 x (t) = cos ωct + + 2 sin ωct − 2 t + (2.17) ωc ωc mpωc mpωc ωc ext ext vy,0 vx,0 qpEy qpEy vy,0 y (t) = sin ωct − + 2 cos ωct + 2 + (2.18) ωc ωc mpωc mpωc ωc
where vx,0 and vy,0 are the initial velocity components.
21 Table 2.1: Number of elements in Meshes 1, 2, and 3
Mesh 1 Mesh 2 Mesh 3 Edge # 951 2168 6036 Face # 610 1408 3960 Node # 342 761 2077 ∆lav [m] 0.1160 0.0590 0.0300
We examine two types of single-particle trajectories. The first corresponds a pure cyclotron motion (Bz 6= 0 and Ey = 0) and the second includes a drift motion as well
−15 (Bz 6= 0 and Ey 6= 0). We assume a superparticle with qp = −1.6×10 [C] and mass
−27 8 mp = 9.1×10 [kg]. In both cases, the initial velocity is set equal to 2×10 [m/s]. We consider three unstructured meshes labeled, from coarsest to finest, as 1, 2, and 3, all of which discretize the domain Ω = {(x, y) ∈ [0, 1]2}. Table 2.1 provides information about the number of elements and other properties of the meshes considered. The parameter ∆lav indicates the average edge length, which roughly halves for each mesh index increment.
The boundaries of the solution domain are truncated using a perfectly matched layer (PML) [74, 75]. The time increment is chosen as ∆t as 10, 5, and 2.5 [ps] for meshes 1, 2, and 3, respectively, and the simulation is terminated at t = 150 [ns].
An implicit solver based on LU decomposition is used as reference. Charged particle trajectories calculated by using such LU solver are referred to standard tra- jectories. On the other hand, particle trajectories obtained by the SPAI-based explicit
field solver are designated as test trajectories. The effect of the inverse approximation error can be quantified by examining the discrepancy between standard and test tra- jectories. This discrepancy can be further compared to the discrepancy in particles’
22 𝒏𝒏 𝐑𝐑𝐑𝐑𝐑𝐑𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝟏𝟏
𝒏𝒏 𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝟐𝟐 𝒏𝒏 𝐑𝐑𝐑𝐑𝐑𝐑 𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝟒𝟒 𝐑𝐑𝐑𝐑𝐑𝐑 𝒏𝒏 𝐑𝐑𝐑𝐑𝐑𝐑𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝐭𝟑𝟑
(a)
Figure 2.3: Relative position difference (RPD) of the various test particles w.r.t. the standard particle placed at the origin, in a polar diagram where the radial distance is represented in logarithmic scale.
trajectories between that result from the LU-based solver and the analytic solution, which measures the inherent numerical (space and time) discretization error.
To quantify the error, we define the relative position difference (RPD), which is the ratio of the magnitude of the difference between the standard and test position vectors at certain time step n to the total travel length of the standard particle up to time step n, i.e.,
n n n xp,test − xp,std |d | j testj RPDn = = (2.19) testj Pn i i−1 i=1 xp,std − xp,std Lstd where RPDn is the RPD for the j-th test particle at time instant n, and xi and testj p,std xi are the standard and test particle position, respectively, at time step i. p,testj
23 Table 2.2: Convention used for particle trajectory visualization.
solver test particle number symbol used analytical sol. 1 SPAI k = 2 2 + SPAI k = 4 3 × SPAI k = 6 4
For visualization purposes, we plot the RPD in a polar graph as shown in Fig. 2.3,
with the radial coordinate represented in a logarithmic scale. The standard trajectory
points computed by the implicit LU-based solver are indicated by 4 and placed at the origin of the RPD for all times. The symbols , +, ×, and represent, in turn, the relative position of test particles’ 1, 2, 3, and 4 w.r.t. to standard trajectory points, as given by the vector dn /L . As summarized in Table 2.2, these four testj std sets of points correspond, respectively, to the exact trajectory points obtained via an analytic solution and to the trajectory points obtained using the SPAI-based explicit
field solver with k = 2, 4, and 6.
24 (a) (b) (c)
-4 -4 -4 10 10 10
space and time discretization error -8 -8 -8 10 10 10
-12 -12 -12 10 10 10 RPD [a.u.] RPD [a.u.] RPD RPD [a.u.] RPD
-16 Analytic -16 Analytic -16 Analytic 10 10 10 inverse SPAI w/ k=2 SPAI w/ k=2 SPAI w/ k=2
approx. SPAI w/ k=4 SPAI w/ k=4 SPAI w/ k=4
-20 error SPAI w/ k=6 -20 SPAI w/ k=6 -20 SPAI w/ k=6 10 10 10 2 4 6 8 10 12 14 2 4 6 8 10 12 14 2 4 6 8 10 12 14 -8 time [sec] -8 time [sec] -8 time [sec] x 10 x 10 x 10 (d) (e) (f)
Analytic Analytic Analytic SPAI w/ k=2 SPAI w/ k=2 SPAI w/ k=2 SPAI w/ k=4 SPAI w/ k=4 SPAI w/ k=4 SPAI w/ k=6 SPAI w/ k=6 SPAI w/ k=6
(g) (h) (i)
Figure 2.4: Results for a circular particle trajectory on 3 different meshes. (a) (b) (c) Particle trajectory histories. (d) (e) (f) RPDs versus time for the four test particles. (g) (h) (i) Normalized RPD bands for the four test particles.
25 Oscillatory motion
ext −3 2 ext In this case Bz = 5.085 × 10 [Wb/m ] and Ey = 0 [V/m] so that a pure
2 cyclotron motion with angular frequency ωc = 6.05 × 10 [rad/s] results. Fig. 2.4 illustrates the result of the SCP test for the circular trajectory. Figs. 2.4a, 2.4b, and 2.4c illustrate the trajectory of the SCP for Meshes 1, 2, and 3, respectively.
Figs. 2.4d, 2.4e, and 2.4f show the RPDs for four test particles on each mesh. It is seen that RPDs for the analytic test particle is very large (several orders of magnitude) compared to the RPDs of the EM-PIC simulation with SPAI-based explicit field solver for k = 2, 4, and 6. We note again that the RPD for the analytic test particle arises from space and time discretization errors, whereas the other RPDs are due solely to the inverse approximation error. Therefore, these results indicate that inverse approximation error is negligible compared to the other inherent numerical errors.
We also note, as expected, that the RPD due to the discretization error decreases as the mesh is progressively refined (curve with in Figs. 2.4d, 2.4e, and 2.4f). On the other hand, the RPD due to the inverse approximation error remains fairly constant across the different meshes
(curves with +, ×, and in Figs. 2.4d, 2.4e, and 2.4f). Examining these figures, it is also observed that the error decreases as the parameter k increases.
Fig. 2.4g, 2.4h, and 2.4i show the RPD bands normalized by the analytic test par- ticle’s RPD (i.e. setting the RPD of the analytical result to unity radius in the plot).
In all cases, the normalized RPD bands rotate around the origin (LU-decomposition implicit solution) around nearly circular orbits. Such normalized RPD bands for test particles 2, 3, and 4 become larger as mesh is refined since the space and time discretization errors decrease, as noted above.
26
-2 Analytic
10 SPAI w/ k=2 SPAI w/ k=4 Analytic SPAI w/ k=6 -6 10 SPAI w/ k=2 SPAI w/ k=4 SPAI w/ k=6 -10 10 RPD [a.u.] RPD
-14 10
-18 10 0 1 2 3 -7 time [sec] x 10 (a) (b) (c)
Figure 2.5: Results for a trajectory with drift. (a) (b) (c) Particle trajectory history. (d) (e) (f) RPDs versus time for the four test particles. (g) (h) (i) Normalized RPD bands for the four test particles.
E × B drift motion
ext −3 2 ext 3 In this case, we set Bz = 5.085×10 [Wb/m ] and Ey = −5×10 [V/m]. This add a drift motion to the trajectory of the particle, as seen in Fig. 2.5a. We consider
mesh 3 result only, for brevity. The RPD data is shown in Fig. 2.5b and Fig. 2.5c.
Similar to the pure circular trajectory case, the RPDs for different k are very small
compared to analytic RPD. It is again seen that the bands converge to the center of
the circle, which stands for the position of the standard particle, as k increases.
2.3.2 Plasma ball expansion
In the next example, we consider the simulation of an expanding plasma ball.
We consider 5 × 104 superparticles, each representing 200 electrons, initially placed uniformly within a circle of 0.5 [m] radius centered at the origin. At t = 0 positive and negative charged particles overlap, with net zero charge everywhere. Negative particles are initialized with Maxwellian distribution with thermal velocity |vth| =
0.1×c [m/s]. Positive charged are assumed with zero velocity at all times. The initial
27
] 0 2
/m -0.2
A/m] = 9,000 µ µA -0.4 𝑡𝑡 ∆𝑡𝑡
-0.6
-0.8 SPAI-PIC w/ k=2 -1 SPAI-PIC w/ k=4
radial currentradial density [ SPAI-PIC w/ k=6 -1.2 LUD-PIC radial current density [
0 1 2 3 4 5 radiusradius[m] [m]
Figure 2.6: Radial current versus radius coordinate for the expanding plasma at time step n = 9×104 using the LU-based implicit fields solver and the SPAI-based explicit field solver with k = 2, 4, and 6.
4 −3 density of particles is n ≈ 6.37 × 10 [m ] and the Debye length is λD ≈ 0.663 [m],
5 resulting on a plasma parameter Λ = 4πnλd ≈ 2.34 × 10 . The unstructured mesh used in this simulation has 1880 faces, 2884 edges, and 1005 nodes. A PML is used to truncate the solution domain. A time step increment ∆t = 5 [ps] is used, and the simulation is terminated at 10 [ns].
Fig. 2.6 shows the radial current density from the plasma expansion at t = 9 ×
103∆t as a function of the radial coordinate computed by implicit LU-based and explicit SPAI-based field solvers with k = 2, 4, and 6. The picture in the inset of
Fig. 2.6 shows a snapshot of the particle distribution at t = 9 × 103∆t. There is no discernible difference in the current density profile among the results shown in
Fig. 2.6.
28 nodal index, i nodal index, i (a) (b) (c)
Figure 2.7: (a) Normalized residuals of the discrete continuity equation for the plasma ball expansion example using different field solvers, at t = 2 × 104∆t. (b) Similar results for the discrete Gauss’ law. (c) Averaged normalized residuals for the discrete Gauss’ law versus time step index.
In order to check charge conservation, we plot in Fig. 2.7a the normalized residual
(NR) for DCE (2.8) and DGL (2.14). These residuals are evaluated for each time
1 step n + 2 or n and node i, and defined as
1 n+1 n n+ 2 Qi − Qi NRDCE = 1 + (2.20) i h i n+ 1 PN1 ˜ 2 ∆t j=1 Ddiv [J]j i,j n n Qi NRDGLi = 1 − h i (2.21) PN0 ˜ PN1 n j=1 Ddiv k=1 [?]aj,k [E]k i,j
1 n+ 2 where N0 denotes the total number of nodes in the mesh. Fig. 2.7a shows |NRDCEi | 1 n+ 2 at n=20, 000 versus the nodal index for different solvers. It is seen |NRDCEi | is fairly low, about 10−13, in all cases. The small noise above the double-precision floor 10−15
can be attributed from arithmetic round-off errors in the scatter process. Fig. 2.7b
n shows a similar plot now for |NRDGLi |, which is very close to the double-precision floor. In order to verify that residual levels of the DGL are maintained by (2.16)
n during the time-update, we also plot |NRDGLi | averaged across all nodes of the mesh,
29 n PN0 n i.e. |NRDGL|ave = i=1 NRDGLi /N0 as a function of the time step n in Fig. 2.7c. As
n seen, |NRDGL|ave has nearly constant values close to the double-precision floor, with only a very small increase due to cumulative round-off error.
2.3.3 Electron beam in a vacuum diode
In order to further verify charge conservation and stability for long-time simula- tions, we simulate next an electron beam accelerated by a vacuum diode. The domain
Ω = {(x, y) ∈ [0, 1]2} has lateral walls representing anode and cathode surfaces with potential difference set as 1.5×105 [V]. The top and bottom boundaries of the domain are truncated by a PML. The unstructured mesh has 2301 faces, 3524 edges, and 1224 nodes. The time step interval is set to ∆t = 270 [ps], and the simulation is run up to
16.2 [µs]. Each superparticle used in the simulation represents 50×106 electrons. For the thermionic emission of electrons from the cathode at the left boundary, a slow ini- tial mean velocity of 104 [m/s] is assumed for the electrons. Fig. 2.8 presents snapshots of the particle distribution and the self-field (electric) profile. Fig. 2.8a and Fig. 2.8d show the field and particle distribution for the charge-conserving EM-PIC algorithm with LU-based implicit field solver. Fig. 2.8b and Fig. 2.8e show the field and particle distribution for the charge-conserving EM-PIC algorithm with SPAI-based (k = 2) explicit field solver. Finally, Fig. 2.8c and Fig. 2.8f show the field and particle dis- tribution for an EM-PIC with LU-based implicit field solver and conventional gather step (non-charge-conserving on an unstructured grid) where edge currents are ob- tained from the straightforward projection of the instantaneous product qv, summed
(1) over all particles, onto the edge element Wj , i.e.
30 (a) (b) (c)
(d) (e) (f)
Figure 2.8: Results for the accelerated electron beam at t = 6 × 104∆t. (a) (b) Particle distribution snapshot from charge-conserving EM-PIC algorithms using an LU-based implicit solver and a SPAI-based (k = 2) explicit solver, respectively . (c) Particle distribution snapshot from a conventional (non-charge conserving on the unstructured grid) EM-PIC algorithm with an LU-based implicit solver. (d) (e) (f) Corresponding electric-field profile distributions.
1 1 1 n+ 2 X n+ 2 (1) n+ 2 Jj = qpvp · Wj xp (2.22) p
1 n+ 2 n+1 n where xp = (xp + xp )/2. In the latter case, violation of the continuity equation produced spurious bunching of the charges into strips of higher density. In addition, the self field is highly asymmetric and randomly oriented near the beam center. These
31 0 10 1 = 3,000 SPAI w/ k=2 SPAI w/ k=4 𝑡𝑡 ∆𝑡𝑡 SPAI w/ k=6 0.75 𝑐𝑐 LUD −3 m -1 10 0.5 density
0.25 average velocity/
-2 10 0 -0.5 0 0.5
𝑥𝑥 𝑚𝑚 Figure 2.9: Number density and average velocity of particles across a transversal section of the electron beam at t = 3 × 103∆t, after steady-state has been reached.
spurious effects are not present in either the implicit and explicit charge-conserving
simulations.
Fig. 2.9 shows the average particle density and the average velocity of particles
across a transverse section of the beam versus the longitudinal direction x along the beam at time step n = 3000, for the charge-conserving algorithm with LU-based implicit solver and with SPAI-based explicit solver using k = 2, 4, 6. As expected, the number density of particles monotonically decreases as the average velocity of particles increases, keeping a uniform current flow in steady-state across x. There is an excellent agreement among all these cases, indicating the robustness of the
SPAI-based explicit solver.
32 2.3.4 Electron Bernstein waves
Electron Bernstein waves are instrumental for many applications such as plasma heating, driving plasma currents, and temperature measurement diagnostics [2]. Such waves are present in over-dense plasmas otherwise inaccessible to electromagnetic
(EM) electron cyclotron waves. Because electron Bernstein wave propagation is only possible inside the magnetized warm plasma, mode conversion from EM waves in- cluding ordinary (O) or extraordinary (X) modes [2] should be performed.
Here, we analyze dispersion characteristics of electron Bernstein waves propagat- ing in the magnetized warm plasma by using the proposed FETD-PIC algorithm on irregular grids. It is shown that the use of non-charge-conserving scatter algorithms in
FETD-PIC simulations induces a spurious static (self-)field due to charge deposition on the grid and, as a result, produces more noisy spectral bands. In contrast, the proposed charge-conserving FETD-PIC solver [1,51,56] is shown to produce sharper spectral bands with less noise.
Magnetized warm plasmas can support two types of waves both propagating and polarized in a direction perpendicular to the stationary magnetic field: (i) X mode and (ii) electron Bernstein waves. In what follows, we compare the dispersion relations for the X mode and electron Bernstein waves obtained analytically and numerically by means of FETD-PIC simulations.
We assume a z-directed stationary magnetic field and electron Bernstein wave propagation along x, with the same conditions as used in [87]. Consider a magne-
20 −3 tized warm plasma with electron density ne = 2.4 × 10 [m ] and static applied magnetic field B~ = 5.13ˆz [T]. The electrons have initial random distribution over
[0.0005, 0.012] × [0, 0.000025] and Maxwellian distribution for the thermal velocity
33 Figure 2.10: Simulated ω × k dispersion diagram for the X mode propagation and for electron Bernstein waves in a magnetized warm plasma. Here ωpe is the plasma frequency and ∆x is the grid spacing, chosen uniform. The analytical results are indicated by the red dots in the diagram. Note that the use of a charge-conserving scatter step in PIC algorithm as described in [1] reduces the numerical noise and yields cleaner spectral bands in the numerically generated band diagrams. In addition, a charge-conserving scatter step mitigates the spurious DC field cause by spurious charge accumulation on the grid nodes, as observed at the bottom of the zoomed plots. Overall, a very good agreement is observed between the numerical and the analytical results.
with |~vth| = 0.07c. We set the total number of (macro-)particles in the simulation equal to 13,800, corresponding to a scaling factor of 5 × 109. The motion of ions (of mass mi) is neglected since mi/me ≈ 1, 838, where me is the electron mass. Also, we
11 11 have ωpe = 8.7 × 10 [rad/s] and ωce = 9.0 × 10 [rad/s], where ωpe is the plasma frequency [rad/s] and ωce is the gyrofrequency [rad/s]. Using the FETD-PIC simula- tion data, we perform a Fourier analysis of electric field sampled in space and time to obtain the dispersion relation ω(kx) for the X mode and for the electron Bernstein wave.
Fig. 2.10 shows the dispersion relations computed analytically and numerically.
The reference analytical result for this problem is obtained from [2]. It can be seen
34 that the X mode is dominant for small kx, but as kx increases the electron Bern- stein wave becomes dominant. The two close-in views compare results from charge- conserving and non charge-conserving EM-PIC simulations. In the latter case, a strong spurious static (self-)field is produced, which perturbs the particle trajectories and is evidenced by the noisy spectral bands. The absolute spectral resolution is affected by the time step increments employed in the EM-PIC simulation. With this in mind, we have chosen identical increments for both simulations.
2.4 Conclusion
We have developed a EM-PIC algorithm suited for unstructured grids that com- bines a local explicit field solver with a charge-conserving scatter-gather scheme. A sparse approximate inverse is pre-computed to obviate the need for a linear solver at each time step and to retain the local nature of the algorithm. Excellent agreement was verified between EM-PIC simulations utilizing the proposed field solver and a con- ventional (implicit) field solver based on a LU-solver. The explicit and local nature of the proposed EM-PIC algorithm makes it suitable for integration with asynchronous time stepping techniques as well.
35 Chapter 3: Relativitic Extension of Particle-Pusher
Particle-in-cell (PIC) algorithms [11–13, 88, 89] have been a very successful tool in many scientific and engineering applications such as electron accelerators [59, 60,
90], laser-plasma interactions [88, 91–94], astrophysics [95, 96], vacuum electronic de- vices [56, 97, 98] and semiconductor devices [99–102]. In many cases, the particles of interest are often in the relativistic regime and the relevant physical phenomena need to be described by taking into account fully relativistic effects. Relativistic PIC algorithms can be found in a variety of references [90–94,99,103–106].
In this chapter, the previously developed charge conserving FETD PIC algorithm developed in [1, 51] for time-dependent Maxwell-Vlasov equations is extended to the relativistic regime. In particular, we integrate Boris [107], Vay [38], and Higuera-
Cary [108] relativistic pushers in the conservative PIC-FETD algorithm for solving time-dependent Maxwell-Vlasov equations and provide a brief comparison among them. Several examples such as particle cyclotron motion, harmonic particle oscil- lation in the Lorentz-boosted frame, and relativistic Bernstein modes in magnetized charge-neutral (pair) plasmas are presented for validation. We adopt MKS units throughout this work.
36 3.1 Particle-pushers in the relativistic regime
In the particle update step, the particle mass is modified to account for relativistic effects such that
dr u p = p , (3.1) dt γp dup q = [E (rp, t) + vp × B (rp, t)] , (3.2) dt m0 where up = γpvp, vp is the velocity of the p-th particle, and γp is its relativistic factor
−2 2 2 defined as γp = 1 − |vp| /c . Using the central-differences to approximate the time derivatives, Eqs. (3.1) and (3.2) are discretized as
1 n+1 n n+ 2 rp − rp up = 1 , (3.3) ∆t n+ 2 γp n+ 1 n− 1 2 2 up − up q n n q n u¯p n = Ep + v¯p × Bp = Ep + × Bp , (3.4) ∆t m0 m0 γ¯p
1 where v¯p is the mean particle velocity between the n ± 2 time steps, which can also be approximated as u¯p/γ¯p with u¯p =γ ¯pv¯p.
In the non-relativistic case, γp → 1, v¯p can be chosen based on the midpoint rule, 1 1 n n n+ 2 n− 2 viz. v¯p = vp = vp + vp /2, to obtain updated phase coordinates explicitly. In this case, the (non-relativistic) Boris algorithm is typically used not only due to its computationally-efficient velocity update obtained by separating irrotational (electric) and rotational (magnetic) forces but also because of its long-term numerical stability.
The latter property essentially means that, in spite of not being symplectic, the non- relativistic Boris algorithm preserves phase-space volume such that it provides energy conservation bounded within a finite interval. Note that every symplectic integrator guarantees phase-space volume-preservation but not vice-versa. In contrast, in the
37 relativistic regime v¯p should be carefully determined to accurately model the kinetics of high-energy particles. Next, we examine in detail three different relativistic pushers proposed by Boris, Vay, and Higuera-Cary.
3.1.1 Relativistic Boris pusher
The main tenet of the relativistic Boris pusher is basically similar to the non- relativistic-Boris-pusher, viz. separation of irrotational and rotational forces [89].
Importantly, it averages v¯p as
1 1 n+ 2 n n− 2 n v up − p + v up + p ¯v = (3.5) p,B 2 q 2 2 n n where v (u) = u/ 1 + |u| /c , p = αEp , and α = q∆t/2m0. The particle velocity update in the relativistic Boris pusher follows the procedure below [89]
1 − n− 2 n uB = up + p , (3.6)
0 − − uB = uB + uB × tB, (3.7)
+ − 0 uB = uB + uB × sB, (3.8)
1 n+ 2 + n up = uB + p , (3.9)
− 0 + where uB, uB, and uB are auxiliary vectors and the subscript B refers to the Boris
n 2 n n algorithm. In addition, tB = βp /γ¯p,B, sB = 2tB/ (1 + |tB| ), and βp = αBp . The
factorγ ¯p,B is computed as
q q − 2 2 + 2 2 γ¯p,B = 1 + |uB| /c = 1 + |uB| /c , (3.10)
n and to obtain Bp , we set:
1 n+ 1 n− 1 Bn = B 2 + B 2 . (3.11) p 2 p p
38 Note that the relativistic Boris pusher has two variants: with and without correc-
tion. The relativistic Boris pusher without correction uses tB as defined above. The