arXiv:1711.10993v1 [cond-mat.mtrl-sci] 29 Nov 2017 in ihsutrn hehl nry o rtu re- activation tritium neutron low low energy, and threshold expan- tention sputtering thermal high high of point, coefficient sion, melting low high conductivity, their thermal to years, ap- promis- thanks of past very candidates, design as the ing the emerged During have is materials energy Tungsten-based safe components. and reactor clean propriate of source a as igesl-nesiil n vacancies defects, and smallest self-interstitials the of single characteristics elucidate and to properties crucial the been have calculations quest, of this principles In first models irradiation. predictive under build evolution magnetic microstructure to and fundamental as electronic such is as well defects, properties, as these proper- structure of atomic transport nature their and the de- mechanical Understanding produce their ties. will alter which that levels fects radiation high sustain will ftenme fKh-hmobtl,ie tis it i.e. power orbitals, third Kohn-Sham the of as number pre- scales the More system of cost the approaches. computational to ab-initio the respect other cisely, DFT with to scaling compared particular, size In better much and approach. a precision this offers between by balance offered good efficiency the calcula- to mechanical thanks quantum tions principles first for method Theory tional with studied be can defects few methods. a these just with clusters only otn F acltosaeuulyol oei this cubic the in reduce to done possible only still is and usually it Fortunately, atoms, are regime. hundred calculations some DFT to routine sizes system accessible the iersaigDTcluain o ag ugtnsystems Tungsten large for calculations DFT scaling Linear n ftebgcalne oad h s fFusion of use the towards challenges big the of One vrteps eae onSa K)DniyFunc- Density (KS) Kohn-Sham decades past the Over eetees hsihrn ui cln tl limits still scaling cubic inherent this Nevertheless, tpa Mohr, Stephan 2 est arx nti ae eso ht nefiieelectr finite once that, show we the paper of temperat version this finite scaling a In forlinear as developed large matrix. metals, were are to density methods that applied problems scaling straightforwardly tackle linear be recent to these the allowed as atoms, has hundred However, scali methods some cubic DFT to intrinsic size ing the system While accessible applications. the of range wide hnlresprel r required. appro are electronic an supercells the such large investigating that when for believe approaches We key novel a towards Reactors. plays Fusion which Tungsten, for of bulk number materials on the based to examples respect prototype with linearly scales that treatment aoaoyo tmcadSldSaePyis onl Unive Cornell , State Solid and Atomic of Laboratory 4,5 est ucinlTer DT a eoetequasi-stand the become has (DFT) Theory Functional Density .INTRODUCTION I. 1 acln uecmuigCne BC,C od ioa29, Girona Jordi C/ (BSC), Center Supercomputing Barcelona DT a eoetems popular most the become has (DFT) 1 acEixarch, Marc 4 nv rnbeAps NCMM L INAC-MEM, Alpes, Grenoble Univ. 3 CE,P.Luı opns2,000Breoa Spain Barcelona, 08010 23, Llu´ıs Companys Pg. ICREA, 5 E,IA-E,L INAC-MEM, CEA, 1 2,3 hs materials These . BigDFT u pt now, to up But . 1 aiiinAmsler, Maximilian Dtd oebr3,2017) 30, November (Dated: oei bet xli hslclt opoieacomputatio a provide to locality this exploit to able is code O ( N 3 ). i,F300Geol,France Grenoble, F-38000 Sim, basis n prahsle ntelclt ftedniymatrix density the scal- of linear locality that the the of time F in foundation computation lies The a approaches large. to ing as leads twice system only a is in atoms of opeiyo F aclto yuigs-aldlinear so-called using by approaches scaling calculation DFT of complexity httepoete ftedniymti tapoint a at matrix density the of properties the that edol npoints on only pend nedi a esonta h arxelements matrix the that shown be can it indeed ea xoetal ihtedistance the with exponentially decay aosadmtl tfiietemperatures finite at metals and lators e ovrosDTcdsta xii uhalinear a such exhibit that instance steadily codes for the as DFT algorithm, with has scaling various Together this to supercomputers, 7. today’s led and of 6 capacity Refs. increasing calcu- in DFT found scaling be linear perform lations to approaches of ety size. system with linearly a scales reach that eventually quan- algorithm to a allows DFT this describe system, completely mechanical to tum enough is matrix density quest etto ae nadrc reeeg minimization workload computational sys- energy the metallic reducing for thereby free tems, calculations direct perform to a allows technique on based mentation lso eue cln ehd o easi h lit- exam- the few in a metals are for there methods Still scaling metal- erature reduced for of missing. demonstrations ples are abundant exhibiting systems and systems to lic gap, applied finite only a usually are DFT ing can calculations 45. scale Ref. large in for found methods be and codes ture or 2 ( i,F300Geol,France Grenoble, F-38000 Sim, ev .Mantsinen, J. Mervi hs hoeia onain aeldt ag vari- large a to led have foundations theoretical These oee,teevrosipeettoso ierscal- linear of implementations various these However, BigDFT r , r tutr fsc aeil,i particular in materials, such of structure ′ go F a o ogtm limiting time long a for was DFT of ng .Tes-aldnasgtdesprinciple nearsightedness so-called The ). 35–37 14–30 46 tm famtli ytm eprovide We system. metallic a of atoms r snee oesr oaiyo the of locality ensure to needed is ure ihrsetto respect With . st,Ihc,NwYr 45,USA 14853, York New Ithaca, rsity, rgeswt epc olna scal- linear to respect with progress c ih epi pnn h path the opening in help might ach nuaos hycno,i general, in cannot, they insulators, oei nigsf n long-lasting and safe finding in role SIESTA , nctmeauei mlyd the employed, is temperature onic noeve vrlna cln ehd can methods scaling linear over overview an ; 42–44 r o biii iuain o a for simulations ab-initio for ard ymn reso magnitudes. of orders many by r noeve fpplreetoi struc- electronic popular of overview an ; 83 acln,Spain Barcelona, 08034 6,7 r enn htduln h number the doubling that meaning , ′ 38 sn notmzdlocal optimized an using nalclzdrgo around region localized a in ,3 1, , † Quickstep n ug Genovese Luigi and ONETEP ONETEP 39 , ∗ eetimple- recent a , | r OPENMX nal 9–13 − r ic the Since . 31–34 ′ ,5 4, | o insu- for 8 F , r states ( 47 Con- and , r r 40,41 In . , de- r ′ ) 2

OPENMX, there is an implementation of a divide-and- namely the density matrix, is represented in a separable conquer approach within a Krylov subspace that allows way via a set of localized and adaptive basis functions to perform linear scaling calculations for metals; how- {φα(r)}, from now on also called “support functions”, as ever the method seems to require a careful tuning before 41 ′ αβ ′ it can be applied to large systems . Furthermore it has F (r, r )= φα(r)K φα(r ) , (1) been shown that the approach by Suryanarayana et. al.48, Xα,β which calculates the electronic charge density and the to- K tal energy directly by performing Gauss quadratures over where the matrix denotes the “density kernel”. From this expression we can easily get the electronic charge the spectrum of the Hamiltonian and in this way is capa- ρ r F r, r ble to reach a linear complexity, also works for metallic density as ( ) = ( ) and use it for the construction of the Kohn-Sham Hamiltonian, systems49. These pioneering examples demonstrate that, even if 1 H[ρ]= − ∇2 + V [ρ]+ V , (2) their application is possible, reduced scaling approaches 2 KS P SP for large metallic systems have to be considered under a ′ ρ(r ) r′ different perspective. As for such systems the number of where the Kohn-Sham potential VKS[ρ] = |r−r′| d + degrees of freedom is very large and the electronic struc- VXC[ρ] contains the Hartree and exchange-correlationR 50 ture is complicated, first-principles calculation become a potential, and VP SP denotes the that useful tool only when complementing other approaches, is used to describe the union of the nuclei and core elec- like for instance force fields, which are unable to provide trons. This Hamiltonian operator gives rise to the Hamil- quantum-mechanical information. When such kind of in- tonian matrix H, defined as formation is required, like for example when studying the arrangement of close to a defective region, in- Hαβ = φα(r)H(r)φβ (r)dr , (3) vestigation techniques like the ones presented above are Z or utmost importance. which can then be used to determine a new density ker- BigDFT In this paper we report on the capabilities of nel; methods to do so will be discussed later. to perform reduced and eventually also linear scaling In order to obtain a linear scaling behavior, it is calculations for a metallic system at finite temperature. necessary to employ a set of localized support func- Whereas previous publications have highlighted in de- tions that eventually lead to sparse matrices. What tail the accuracy, efficiency and linear scaling capabili- distinguishes BigDFT from other similar approaches ties of this code for systems with a finite HOMO-LUMO 43,44 is the special set of localized support functions that gap , metals have so far not been considered. In it uses. These are expanded in an underlying basis this paper we demonstrate that the basic algorithm of set of Daubechies wavelets51 and are optimized in-situ. BigDFT remains stable also for systems with vanishing Daubechies offer the outstanding property of be- gap and thus allows to routinely perform accurate linear ing at the same time orthogonal, systematic and exhibit- scaling calculation for large metallic systems without the ing compact support. The in-situ optimization, together need of additional adjustments. As specific example we with an imposed approximative orthonormality, results have chosen Tungsten due to its relevance for finding safe in a set of quasi-systematic support functions offering a and long-lasting materials for fusion reactors. very high precision. This allows to work with a mini- The outline of this paper is as follows: In Sec. II we mal set of support functions, meaning that only very few focus on the theoretical background, with Sec. II A sum- functions per atom are necessary to obtain a very high marizing the principles of the linear scaling version of accuracy. Obviously, this in-situ optimization comes at BigDFT BigDFT , and Sec. IIB discussing how can some cost compared to an approach working with a fixed mitigate the challenges arising for metallic systems. In set of non-optimized support functions. However, in the Sec. III we then present numerical results, with Sec. IIIA latter case we would require a much larger set to obtain demonstrating the precision that we obtain with the lin- the same precision52, and all matrix operations in the BigDFT ear scaling version of and Sec. IIIB showing subspace of the support functions, whose scaling is cu- various performance figures. bic in the worst case, would become considerably more costly. Apart from that, the use of a minimal basis set also has additional advantages, as for instance an easy II. THEORY and accurate fragment identification and associated pop- ulation analysis for large systems52, which can be used A. Overview of the algorithm for a reliable effective electrostatic embedding53.

The detailed implementation of the linear scaling algo- rithm of BigDFT has been presented in detail in Refs. 43 B. Advantages of BigDFT for metallic systems and 44. Here we will give a brief overview over the most important concepts. DFT calculations for metallic systems are a challeng- The central quantity on which the algorithm is based, ing task. Due to the non-zero density of states at the 3

1000 1 Fermi energy, the occupation of the eigenstates around density of states that energy value can easily jump between occupied and 900 Fermi function empty during the self-consistency cycle, leading to a phe- 800 0.8 nomenon called “charge sloshing”. A solution to this problem is to introduce a finite electronic temperature, 700 leading to the grand-canonical extension of DFT as de- 600 0.6 54 rived by Mermin . In such a setup, the occupations 500 are smoothed out around the Fermi level and the self- consistency cycle becomes more stable. 400 0.4 In the context of linear scaling approaches, the intro- 300 duction of a finite temperature has the additional advan- 200 0.2 tage that it intensifies the decay properties of the density density of states (arbitrary units) matrix, as mentioned in Sec. I, and thus justifies the ex- 100 ploitation of the nearsightedness principle. Nevertheless, 0 0 linear scaling calculations for metals remain very chal- -1 -0.5 0 0.5 1 lenging. First of all, the used electronic temperatures energy (Hartree) must not be too large — otherwise one would change the physics in a too drastic way — and thus the density FIG. 1. Density of states for the 11x11x11 supercell of bcc Tungsten, containing 2662 atoms. The spectral width is very matrix decays much slower compared to finite gap sys- small, which is a direct consequence of the special properties tems. Moreover, the vanishing gap complicates the calcu- of the support functions used by BigDFT. lation of the density kernel. In BigDFT, we use for this task the CheSS library55 — one of the building-blocks BigDFT 10000 of the program suite — that offers several dif- AO with confinement ferent solvers. In the Fermi Operator Expansion (FOE) AO without confinement method14,15, which is the linear scaling solver available within CheSS, one has to approximate the function that assigns the occupation numbers — typically the Fermi function — with Chebyshev polynomials. Obviously this 1000 method is most efficient if the degree of the polynomial expansion is small. This is the case if first the spectral width of the involved matrices is very small, and second if polynomial degree the Fermi function that must be approximated with the polynomials is smooth. Unfortunately the latter condi- tion is violated for metals, since — even when using a 100 small finite temperature — the Fermi function that must be approximated exhibits a sharp drop at the Fermi en- 0.001 0.01 ergy and we thus have to approximate a rather step-like kT (atomic units) function. As a consequence, it is questionable whether the FOE method can be used in practice for calculations FIG. 2. Polynomial degree used by the FOE method within with metals. CheSS as a function of the electronic temperature, for two set of atomic orbitals (AO). The set that was obtained by solving Fortunately it turns out that the special properties of the atomic Schr¨odinger equation with a confining potential the support functions used by BigDFT lead to such a leads to considerably lower values due to its smaller spectral small spectral width that FOE can still be used for metal- width. lic systems. In Fig. 1 we show the density of states for the 11x11x11 supercell of body centered cubic (bcc) Tung- sten, containing 2662 atoms. As can be seen, the spec- in Ref. 43, a confining potential is used in order to prop- tral width is even smaller than the default [−1, 1] interval erly localize the support functions during the optimiza- for the Chebyshev polynomials. In this way the polyno- tion, and this confinement also seems to help in reducing mial degree required to accurately represent the Fermi the spectral width. In Fig. 2 we show the polynomial function can be kept reasonably small even for metallic degree used by CheSS as function of the temperature systems. for two sets of atomic orbitals. Both were obtained by This small spectral width is a direct consequence of solving the Schr¨odinger equation for the isolated atom two concurrent elements. First of all, the usage of PSP within the pseudopotential approach, but in one case we helps by eliminating the need for the treatments of core additionally added the confining potential. In this latter electrons whilst smoothening the behavior of the valence setup, the resulting set of atomic orbitals exhibits a much KS orbitals close to the nuclei. The second important smaller spectral width (36.7eV) compared to the case point is the special way in which BigDFT optimizes without confinement (186.4 eV), leading to much smaller the support functions. As is explained in more detail polynomial degrees. 4

-208.30 Nevertheless it is important to note that FOE, BigDFT linear DIAG BigDFT linear FOE even though it can be used very efficiently within BigDFT cubic 40 Abinit cubic BigDFT, is an O(N) method designed for very large -208.35 systems. For intermediate system size, alternative 20 solvers within CheSS, such as diagonalization using LA- -208.40 PACK56/ScaLAPACK57 or PEXSI58, might thus be more efficient. The first method does not exploit the -208.45 0 sparsity of the matrices and thus exhibits a cubic scal- 3/2 pressure (GPa)

ing, whereas the second one scales as O(N), O(N ) and energy per atom (eV) -208.50 O(N 2) for one-, two- and three-dimensional systems, re- -20 spectively. Still, FOE is the method of choice in the limit -208.55 of very large systems since it is the only solver which -40 scales strictly linearly with system size. -208.60 30 31 32 33 34 35 36 37 38 volume per minimal cell (Angstroem3) III. TESTS AND CONSIDERATIONS FIG. 3. Plots of the energy (left axis) and the pressure (right axis) as a function of the cell volume, for the four se- In order to demonstrate the accuracy and performance tups described in Sec. III A 1. The linear scaling version of of the linear scaling version of BigDFT for metallic sys- BigDFT yields results that are consistent with those of the tems, we focus on one specific system, namely bcc Tung- two traditional cubic scaling approaches. sten. All runs were performed using a grid spacing of at most 0.38 atomic units, the exchange-correlation part was described by the PBE functional59, and the Krack correctly described. In Fig. 3 we compare the DoS of the HGH pseudopotential60 was used. As we are interested reference calculation with the cubic version of BigDFT in systems requiring the usage of very large supercells, and the one obtained with the linear version with diago- we did not considered k-points in our calculations. Nev- nalization. As can be seen, both setups yield an identical ertheless, we still choose as test-bed for our approach a DoS for the occupied states. For the unoccupied ones, the bulk-like system that can be easily simulated via k-points linear version of BigDFT shows some deviations. How- and small supercells, in order to verify the accuracy of ever, this is not surprising, since the optimization of the our linear scaling approach. support functions only takes into account the occupied states, and a good accuracy can thus only be expected for the latter. However, as shown in Ref. 65 it is possible A. Accuracy of the linear scaling version to include extra states in the optimization of the support functions in case that the user is interested in low-energy 1. Energy versus volume conduction states. Overall, we see that, thanks to the in-situ optimization of the support functions, the linear As a first test we demonstrate that the linear scaling scaling version of BigDFT is able to correctly reproduce version can accurately calculate the equation of state re- the electronic structure of a metallic system. lating energy and volume. To this end, we scaled the lattice vectors of the Tungsten system by ±4% around its equilibrium value. We compare four different setups: B. Performance (1)/(2) the linear scaling version of BigDFT using a 9x9x9 supercell (containing 1458 atoms) and no k-points, 1. Scaling with system size using as solver both diagonalization (DIAG) and FOE; (3) the cubic scaling version of BigDFT using the 2- As anticipated, we expect that DFT calculations of atoms unit cell and a 9x9x9 k-point mesh; (4) the same metallic systems at large scales will be time-consuming setup as (3), but run with the Abinit code61–64. In Fig. 3 compared to similar simulations for insulators. In Fig. 5 we compare the energy-vs-volume curves for all four se- we show the total runtime as a function of the number of tups. In the same figure we also show, for all four setups, atoms in the system, going from the 4x4x4 supercell (128 the variation of the pressure as a function of the volume. atoms) up to the 12x12x12 supercell (3456 atoms). As As can bee seen from this test, the linear scaling approach can be seen, each of the three approaches that we com- correctly determines the optimal lattice parameter. pare — cubic, linear with diagonalization and linear with FOE — is characterized by a typical system size at which the method outperforms the other ones. For the small 2. Density of states systems, the cubic approach is clearly the fastest one. Above about 500 atoms, the linear approach using diag- As a second test we compare the density of states onalization becomes the method of choice, since the cu- (DoS) in order to verify that the electronic structure is bic scaling of the diagonalization exhibits a rather small 5

BigDFT cubic 100 100 BigDFT linear DIAG

EF 80 80

60 60 Percent

40 40 number of states (arbitrary units)

20 20 Unknown Flib LowLevel Communications Density Kernel Hamiltonian 0 0 -10 -5 0 5 432 686 energy (eV) 1024 1458 2000 2662 3456

FIG. 4. Density of states for the cubic scaling version of FIG. 6. Portions of the overall walltime spent in the different BigDFT and the linear scaling version with diagonalization. sections of the code for some of the FOE runs of Fig. 5. The The energies were shifted such that the Fermi energy lies at overall behavior of the code is similar over a wide range of zero. Up to the Fermi energy, the linear scaling version of system sizes, indicated by the number of atoms on the x-axis. BigDFT yields results that are consistent with those of the The FOE operations to construct the density kernel take a traditional cubic scaling approach. little less than 40% of the overall walltime.

cubic 14 cubic fit linear DIAG cubic fit ditional cubic scaling approach. However, the inspection linear FOE 12 linear fit of the overall walltime clearly shows that the application to metals is much heavier compared to insulators. Con- 10 sidering the CPU-minutes per atom, which can be used

8 as a metric to quantify the computational workload for O(N) codes44, and comparing with values obtained for 6

walltime (hours) systems such as organic of light atoms, reveals that the latter run up to two order of magnitude faster 4 (!) on the same platform.

2 Nonetheless, this behavior is not related to the absence of a gap, but is due to the the unbiased nature of the 0 0 500 1000 1500 2000 2500 3000 3500 description, which requires support functions optimized number of atoms in-situ. The reasons for this claim are explained in the following. In Fig. 6 we show the percentage of the time FIG. 5. Scaling of the total runtime as a function of the sys- spent in the different sections of the code for the FOE tem size, for the cubic approach, the linear approach with runs of Fig. 5. We see that about 40% of the time is diagonalization, and the linear approach with FOE. The cal- spent in the application of the KS Hamiltonian, 40% in culations were performed at the Γ point, and the runs were the determination of the density matrix and some 20% in performed in parallel, using 9600 cores (800 MPI tasks with 12 OpenMP threads). communications, and this over a wide range of number of atoms. All these calculations converged in about 15 iterations of the combined self-consistent optimization of the support functions and the density kernel. This fact prefactor. For system sizes beyond about 1000 atoms, shows, on the one hand, that FOE calculations of the however, this cubic scaling starts to become a dominant kernel are not necessarily a bottleneck, even when higher part, and thus the truly linear scaling approach with FOE polynomial degrees are needed, i.e. systems without a becomes the fastest option. gap can be calculated efficiently with our method. On the other hand, it will be of much interest to work with pre-optimized basis functions that exhibit a similar accu- 2. Considerations about the computational cost racy, in the same spirit as the fragment-based approach that has been employed using molecular fragments with We have demonstrated that the linear scaling version BigDFT65,66; this would provide results in only one (in- of BigDFT can offer an unbiased and unconstrained de- stead of 15) iterations and therefore lead to calculations scription of metallic systems and is thus capable of yield- running more than one order of magnitude faster. Work ing results that are of the same quality as those of a tra- is ongoing in this direction. 6

IV. CONCLUSIONS AND OUTLOOK tential of quantum-mechanical investigations of metallic systems at such sizes will only be possible if they are con- sidered as complementary investigation techniques along- In this work we have demonstrated, by applying our side other approaches at this scale. code BigDFT to the case of large Tungsten systems, that it is possible to perform accurate and efficient linear scaling DFT calculations for metals with this code. Even V. ACKNOWLEDGMENTS though the linear scaling version of BigDFT was de- signed — as most other codes — for insulating systems, We acknowledge valuable discussions with Mar´ıaJos´e the obtained performance — considering also the very Caturla and Chu-Chun Fu. S.M. acknowledges support high accuracy of the resulting description — for metal- from the MaX project, which has received funding from lic systems is excellent. We have shown that the results the European Unions Horizon 2020 Research and In- obtained with the linear scaling version of BigDFT are novation Programme under Grant Agreements 676598. of equal quality as those obtained with traditional cu- M.A. acknowledges support from the Novartis Univer- bic scaling approaches, but the reduced scaling allows to sit¨at Basel Excellence Scholarship for Life Sciences and tackle much larger systems. The crossover point between the Swiss National Science Foundation (P300P2-158407, the cubic and linear scaling treatment lies at about 500 P300P2-174475). We gratefully acknowledge the com- atoms. puting resources on Marconi-Fusion under the EURO- Thanks to these achievements, the possibility of ad- fusion project BigDFT4F, from the Swiss National Su- dressing the challenge of unbiased first-principles investi- percomputing Center in Lugano (project s700), the Ex- gation for systems with such a large degree of complexity treme Science and Engineering Discovery Environment opens up new interesting opportunities, as now more re- (XSEDE) (which is supported by National Science Foun- alistic conditions, like for instance lower concentrations dation grant number OCI-1053575), the Bridges system of supercell defects, can be considered. Nevertheless it at the Pittsburgh Supercomputing Center (PSC) (which has to be pointed out that, despite the very good per- is supported by NSF award number ACI-1445606), the formance offered by our approach, calculations like the Quest high performance computing facility at Northwest- one presented remain extremely challenging from a first- ern University, and the National Energy Research Scien- principles point of view. Therefore, exploiting the full po- tific Computing Center (DOE: DE-AC02-05CH11231).

[email protected] 9 Jacques Des Cloizeaux, “Energy bands and projection op- † [email protected] erators in a : Analytic and asymptotic properties,” 1 Owais A. Waseem and Ho Jin Ryu, “Tungsten- Phys. Rev. 135, A685–A697 (1964). based composites for nuclear fusion applications,” in 10 W. Kohn and J. R. Onffroy, “Wannier functions in a simple Nuclear Material Performance, edited by Rehab O. Abdel nonperiodic system,” Phys. Rev. B 8, 2485–2495 (1973). Rahman and Hosam El-Din Mostafa Saleh (InTech, Rijeka, 11 J. J. Rehr and W. Kohn, “Wannier functions in crystals 2016) Chap. 07. with surfaces,” Phys. Rev. B 10, 448–455 (1974). 2 D. Nguyen-Manh, A. P. Horsfield, and S. L. Dudarev, 12 S. Goedecker, “Decay properties of the “Self-interstitial atom defects in bcc transition metals: finite-temperature density matrix in metals,” Group-specific trends,” Phys. Rev. B 73, 020101 (2006). Phys. Rev. B 58, 3501–3502 (1998). 3 Lisa Ventelon, F. Willaime, Chu-Chun Fu, M. Heran, and 13 Sohrab Ismail-Beigi and T. A. Arias, “Locality of the den- I. Ginoux, “Ab initio investigation of radiation defects sity matrix in metals, semiconductors, and insulators,” in tungsten: Structure of self-interstitials and specificity Phys. Rev. Lett. 82, 2127–2130 (1999). of di-vacancies compared to other bcc transition met- 14 S. Goedecker and L. Colombo, “Efficient Linear Scal- als,” Journal of Nuclear Materials 425, 16 – 21 (2012), ing Algorithm for Tight-Binding ,” microstructure Properties of Irradiated Materials. Phys. Rev. Lett. 73, 122–125 (1994). 4 P. Hohenberg and W. Kohn, “Inhomogeneous 15 S. Goedecker and M. Teter, “Tight-binding gas,” Phys. Rev. 136, B864 (1964). electronic-structure calculations and tight-binding 5 W. Kohn and L. J. Sham, “Self-consistent equa- molecular dynamics with localized orbitals,” tions including exchange and correlation effects,” Phys. Rev. B 51, 9455–9464 (1995). Phys. Rev. 140, A1133 (1965). 16 S. Goedecker, “Low Complexity Algo- 6 Stefan Goedecker, “Linear scaling electronic structure rithms for Electronic Structure Calculations,” methods,” Rev. Mod. Phys. 71, 1085 (1999). J. Comput. Phys. 118, 261 – 268 (1995). 7 D R Bowler and T Miyazaki, “O(n) meth- 17 Otto F. Sankey, David A. Drabold, and An- ods in electronic structure calculations,” drew Gibson, “Projected random vectors and the re- Reports on Progress in Physics 75, 036503 (2012). cursion method in the electronic-structure problem,” 8 W. Kohn, “Density functional and density matrix Phys. Rev. B 50, 1376–1381 (1994). method scaling linearly with the number of atoms,” 18 Uwe Stephan and David A. Drabold, “Order-N Phys. Rev. Lett. 76, 3168–3171 (1996). projection method for first-principles computations 7

of electronic quantities and Wannier functions,” scale ab initio calculations: the conquest code,” Phys. Rev. B 57, 6391–6407 (1998). Phys. Status Solidi B 243, 989–1000 (2006). 19 Weitao Yang, “Direct calculation of elec- 37 D R Bowler and T Miyazaki, “Calculations tron density in density-functional theory,” for millions of atoms with density functional Phys. Rev. Lett. 66, 1438–1441 (1991). theory: linear scaling shows its potential.” 20 Weitao Yang, “A local projection method for the linear J. Phys.: Condens. Matter 22, 074207 (2010). combination of atomic orbital implementation of density- 38 Jos´e M Soler, Emilio Artacho, Julian D Gale, Al- functional theory,” J. Chem. Phys. 94, 1208–1214 (1991). berto Garc´ıa, Javier Junquera, Pablo Ordej´on, 21 Weitao Yang and Tai-Sung Lee, “A density- and Daniel S´anchez-Portal, “The siesta method matrix divide-and-conquer approach for elec- for ab initio order- n materials simulation,” tronic structure calculations of large molecules,” Journal of Physics: Condensed Matter 14, 2745 (2002). J. Chem. Phys. 103, 5674–5678 (1995). 39 Joost VandeVondele, Matthias Krack, Fawzi Mohamed, 22 X.-P. Li, R. W. Nunes, and David Vanderbilt, “Density- Michele Parrinello, Thomas Chassaing, and J¨urg Hutter, matrix electronic-structure method with linear system-size “Quickstep: Fast and accurate density functional calcula- scaling,” Phys. Rev. B 47, 10891–10894 (1993). tions using a mixed and plane waves approach,” 23 R. McWeeny, “Some Recent Advances in Density Matrix Comput. Phys. Commun. 167, 103–128 (2005). Theory,” Rev. Mod. Phys. 32, 335–369 (1960). 40 T. Ozaki and H. Kino, “Efficient projec- 24 Francesco Mauri, Giulia Galli, and Roberto tor expansion for the ab initio lcao method,” Car, “Orbital formulation for electronic-structure Phys. Rev. B 72, 045121 (2005). calculations with linear system-size scaling,” 41 Taisuke Ozaki, “O(n) krylov-subspace method for Phys. Rev. B 47, 9973–9976 (1993). large-scale ab initio electronic structure calculations,” 25 Pablo Ordej´on, David A. Drabold, Matthew P. Grumbach, Phys. Rev. B 74, 245101 (2006). and Richard M. Martin, “Unconstrained minimization ap- 42 Luigi Genovese, Alexey Neelov, Stefan Goedecker, Thierry proach for electronic computations that scales linearly with Deutsch, Seyed Alireza Ghasemi, Alexander Willand, system size,” Phys. Rev. B 48, 14646–14649 (1993). Damien Caliste, Oded Zilberberg, Mark Rayson, Anders 26 Pablo Ordej´on, David A. Drabold, Richard M. Mar- Bergman, and Reinhold Schneider, “Daubechies wavelets tin, and Matthew P. Grumbach, “Linear system-size as a basis set for density functional pseudopotential calcu- scaling methods for electronic-structure calculations,” lations.” J. Chem. Phys. 129, 014109 (2008). Phys. Rev. B 51, 1456–1476 (1995). 43 Stephan Mohr, Laura E. Ratcliff, Paul Boulanger, Luigi 27 Francesco Mauri and Giulia Galli, “Electronic- Genovese, Damien Caliste, Thierry Deutsch, and Stefan structure calculations and molecular-dynamics Goedecker, “Daubechies wavelets for linear scaling density simulations with linear system-size scaling,” functional theory,” J. Chem. Phys. 140, 204110 (2014), Phys. Rev. B 50, 4316–4326 (1994). http://dx.doi.org/10.1063/1.4871876. 28 Jeongnim Kim, Francesco Mauri, and Giulia Galli, “Total- 44 Stephan Mohr, Laura E. Ratcliff, Luigi Genovese, energy global optimizations using nonorthogonal localized Damien Caliste, Paul Boulanger, Stefan Goedecker, orbitals,” Phys. Rev. B 52, 1640–1648 (1995). and Thierry Deutsch, “Accurate and efficient linear 29 W. Hierse and E. B. Stechel, “Order-N methods scaling dft calculations with universal applicability,” in self-consistent density-functional calculations,” Phys. Chem. Chem. Phys. 17, 31360–31370 (2015). Phys. Rev. B 50, 17811–17819 (1994). 45 Laura E. Ratcliff, Stephan Mohr, Georg Huhs, Thierry 30 E. Hern´andez and M. J. Gillan, “Self-consistent Deutsch, Michel Masella, and Luigi Genovese, “Chal- first-principles technique with linear scaling,” lenges in large scale quantum mechanical calculations,” Phys. Rev. B 51, 10157–10160 (1995). Wiley Interdiscip. Rev.-Comput. Mol. Sci. 7, e1290 (2017), 31 Chris-Kriton Skylaris, Peter D. Haynes, Arash A. Mostofi, e1290. and Mike C. Payne, “Introducing : Linear-scaling 46 Jolyon Aarons, Misbah Sarwar, David Thompsett, density functional simulations on parallel computers,” and Chris-Kriton Skylaris, “Perspective: Meth- J. Chem. Phys. 122, 084119 (2005), http://dx.doi.org/10.1063/1.1839852ods for. large-scale density functional calculations on 32 Peter D. Haynes, Chris-Kriton Skylaris, Arash A. Mostofi, metallic systems,” J. Chem. Phys. 145, 220901 (2016), and Mike C. Payne, “Onetep: linear-scaling density- http://dx.doi.org/10.1063/1.4972007. functional theory with local orbitals and plane waves,” 47 lvaro Ruiz-Serrano and Chris-Kriton Skylaris, “A Phys. Status Solidi B 243, 2489–2499 (2006). variational method for density functional the- 33 A. A. Mostofi, P. D. Haynes, C. K. Skylaris, and M. C. ory calculations on metallic systems with thou- Payne, “Onetep: linear-scaling density-functional the- sands of atoms,” J. Chem. Phys. 139, 054107 (2013), ory with plane-waves,” Mol. Simul. 33, 551–555 (2007), http://dx.doi.org/10.1063/1.4817001. http://dx.doi.org/10.1080/08927020600932801. 48 Phanish Suryanarayana, Kaushik Bhat- 34 Chris-Kriton Skylaris, Peter D Haynes, Arash A tacharya, and Michael Ortiz, “Coarse- Mostofi, and Mike C Payne, “Recent progress graining kohnsham density functional theory,” in linear-scaling density functional calculations with Journal of the Mechanics and Physics of Solids 61, 38 – 60 (2013). plane waves and : the onetep code,” 49 Mauricio Ponga, Pilar Ariza, Michael Ortiz, and Kaushik J. Phys.: Condens. Matter 20, 064209 (2008). Bhattacharya, “Linear scaling dft for defects in metals,” in 35 D. R. Bowler, I. J. Bush, and M. J. Gillan, “Practical TMS 2014: 143rd Annual Meeting & Exhibition: Annual Meeting Supplemental Proceedings methods for ab initio calculations on thousands of atoms,” (Springer International Publishing, Cham, 2016) pp. 265– Int. J. Quantum Chem. 77, 831–842 (2000). 272. 36 D. R. Bowler, R. Choudhury, M. J. Gillan, 50 Alex Willand, Yaroslav O Kvashnin, Luigi Gen- and T. Miyazaki, “Recent progress with large- ovese, Alvaro´ V´azquez-Mayagoitia, Arpan Krishna 8

Deb, Ali Sadeghi, Thierry Deutsch, and Ste- G. Zerah, F. Jollet, M. Torrent, A. Roy, M. Mikami, Ph. fan Goedecker, “Norm-conserving pseudopotentials with Ghosez, J.-Y. Raty, and D.C. Allan, “First-principles com- chemical accuracy compared to all-electron calculations,” putation of material properties: the {ABINIT} J. Chem. Phys. 138, 104109 (2013). project,” Comput. Mater. Sci 25, 478 – 492 (2002). 51 Ingrid Daubechies, Ten lectures on wavelets (Society for 62 X Gonze, G Rignanese, M Verstraete, J Betiken, Y Pouil- Industrial and Applied Mathematics, Philadelphia, 1992). lon, R Caracas, F Jollet, M Torrent, G Zerah, M Mikami, 52 Stephan Mohr, Michel Masella, Laura E. Ratcliff, P Ghosez, M Veithen, J-Y Raty, V Olevano, F Bruneval, and Luigi Genovese, “Complexity reduction in large L Reining, R Godby, G Onida, D Hamann, and D Al- quantum systems: Fragment identification and popu- lan, “A brief introduction to the software package,” lation analysis via a local optimized minimal basis,” Z. Kristallogr. 220, 558–562 (2005). J. Chem. Theory Comput. 13, 4079–4088 (2017), pMID: 63 X. Gonze, B. Amadon, P.-M. Anglade, J.-M. Beuken, 28732165, http://dx.doi.org/10.1021/acs.jctc.7b00291. F. Bottin, P. Boulanger, F. Bruneval, D. Caliste, R. Cara- 53 Stephan Mohr, Michel Masella, Laura E. Ratcliff, and cas, M. Ct;, T. Deutsch, L. Genovese, Ph. Ghosez, M. Gi- Luigi Genovese, “Complexity reduction in large quan- antomassi, S. Goedecker, D.R. Hamann, P. Hermet, F. Jol- tum systems: Reliable electrostatic embedding for mul- let, G. Jomard, S. Leroux, M. Mancini, S. Mazevet, M.J.T. tiscale approaches via optimized minimal basis functions,” Oliveira, G. Onida, Y. Pouillon, T. Rangel, G.-M. Rig- (2017), arXiv:1703.06906. nanese, D. Sangalli, R. Shaltaf, M. Torrent, M.J. Ver- 54 N. David Mermin, “Thermal proper- straete, G. Zerah, and J.W. Zwanziger, “Abinit: First- ties of the inhomogeneous electron gas,” principles approach to material and nanosystem proper- Phys. Rev. 137, A1441–A1443 (1965). ties,” Comput. Phys. Commun. 180, 2582 – 2615 (2009). 55 Stephan Mohr, William Dawson, Michael Wagner, Damien 64 X.Gonze, F.Jollet, F.Abreu Araujo, D.Adams, B.Amadon, Caliste, Takahito Nakajima, and Luigi Genovese, “Ef- T.Applencourt, C.Audouze, J.-M.Beuken, J.Bieder, ficient computation of sparse matrix functions for large- A.Bokhanchuk, E.Bousquet, F.Bruneval, D.Caliste, scale electronic structure calculations: The chess library,” M.Cote, F.Dahm, F.Da Pieve, M.Delaveau, M.Di Gen- J. Chem. Theory Comput. 13, 4684–4698 (2017), pMID: nar ando B.Dorado, C.Espejo, G.Geneste, L.Genovese, 28873312, http://dx.doi.org/10.1021/acs.jctc.7b00348. A.Gerossier, M.Giantomassi, Y.Gillet, D.R.Hamann, 56 E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, L.He, G.Jomard, J.Laflamme Janssen, S.Le Roux, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A.Levitt, A.Lherbier, F.Liu, I.Lukacevic, A.Martin, A. McKenney, and D. Sorensen, LAPACK Users’ Guide, C.Martins, M.J.T.Oliveira, S.Ponce, Y.Pouillon, 3rd ed. (Society for Industrial and Applied Mathematics, T.Rangel, G.-M.Rignanese, A.H.Romero, B.Rousseau, Philadelphia, PA, 1999). O.Rubel, A.A.Shukri, M.Stankovski, M.Torrent, M.J.Van 57 L. S. Blackford, J. Choi, A. Cleary, E. D’Azevedo, J. Dem- Setten, B.Van Troeye, M.J.Verstraete, D.Waroquier, mel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, J.Wiktor, B.Xue, A.Zhou, and J.W.Zwanziger, “Re- A. Petitet, K. Stanley, D. Walker, and R. C. Whaley, cent developments in the abinit software package,” ScaLAPACK Users’ Guide (Society for Industrial and Ap- Comput. Phys. Commun. 205, 106 (2016). plied Mathematics, Philadelphia, PA, 1997). 65 Laura E Ratcliff, Luigi Genovese, Stephan 58 Lin Lin, Mohan Chen, Chao Yang, and Lixin He, Mohr, and Thierry Deutsch, “Fragment ap- “Accelerating atomic orbital-based electronic structure proach to constrained density functional the- calculation via pole expansion and selected inversion,” ory calculations using Daubechies wavelets,” J. Phys.: Condens. Matter 25, 295501 (2013). The Journal of Chemical Physics 142, 234105 (2015). 59 John P. Perdew, Kieron Burke, and Matthias Ernzer- 66 Laura E. Ratcliff, Luca Grisanti, Luigi Genovese, hof, “Generalized gradient approximation made simple,” Thierry Deutsch, Tobias Neumann, Denis Danilov, Phys. Rev. Lett. 77, 3865 (1996). Wolfgang Wenzel, David Beljonne, and J´erˆome 60 M. Krack, “Pseudopotentials for h to kr optimized Cornil, “Toward fast and accurate evaluation of for gradient-corrected exchange-correlation functionals,” charge on-site energies and transfer integrals in Theoretical Chemistry Accounts 114, 145–152 (2005). supramolecular architectures using linear constrained 61 X. Gonze, J.-M. Beuken, R. Caracas, F. Detraux, density functional theory (CDFT)-based methods,” M. Fuchs, G.-M. Rignanese, L. Sindic, M. Verstraete, Journal of Chemical Theory and Computation 11, 2077–2086 (2015).