arXiv:2011.08560v1 [cond-mat.dis-nn] 17 Nov 2020 ue ls otetasto,suiso pngassys- spin-glass 40 of to studies up sizes transition, tempera- of the at tems to method close this simple Using a tures in free- than simulation. the efficiently Metropolis of more structure much rugged landscape differ- the energy the explore Thereby, can decorrelate. the replicas to through ent thus freely travel and to space val- temperature phase free-energy high deep at explore and fully leys replicas to tempera- the ex- temperature enables different low intervals procedure the at This time between attempted. certain are replicas tempera- tures After the different of at changes performed. (replicas) system are the tures simulations of Metropolis copies where of method [3–5] ensembles. (PT) broad-energy pering of term of the range in wide subsumed can be that a problem all been this tackling have developments huge There algorithmic the overcome and to barriers. states) sufficient (metastable not free-energy is minima energy local thermal the simula- in The stuck ergodicity. get dra- broken tions effectively fails landscape the free-energy it of rugged the because temperatures with in systems low for weight At matically statistical ensemble. their canonical to according figurations the task. of low- challenging the very investigation a in the phase systems temperature such renders of poly- glasses, properties folding thermodynamical spin e.g., dy- or systems, the en- physical mers problem of many This down in phase. slowing countered low-temperature massive the in from namics suffer [1] scape o rudsaesace ytm faot10 about of systems searches ground-state For n ot al 91]mto hc rcessimilarly proceeds which method [9–13] Carlo Monte ing systems of investigation landscape. free-energy the rugged the in with probably simplicity: method it its employed and makes is set which most temperature performance PT suitable good of a exhibits advantage needs only great algorithm The the [8]. sible ∗ ‡ † [email protected] [email protected] [email protected] n omnyepoe ehdi h aallTem- Parallel the is method employed commonly One con- sample to designed is [2] algorithm Metropolis The land- free-energy rugged with systems of Simulations nte eetdvlpeti h ouainAnneal- Population the is development recent Another fma on-rptmscnb civdfralltiesize lattice all for achieved be modifi can this times by round-trip that en mean show We in of histograms case. multicanonical power-law-shaped standard simulating the by modified is 1 the method, multicanonical esuytebmdlEwrsAdro pngascomparing glass spin Edwards-Anderson bimodal the study We ntttfu hoeicePyi,Universit¨at Leipzig, Physik, f¨ur Theoretische Institut .INTRODUCTION I. 3 pn aebe eotd[,7]. [6, reported been have spins ofltHsormTcnqe o pnGlasses Spin for Techniques Histogram Nonflat ai M¨uller,Fabio /k esml n aalltmeig oa prahweetee the where approach an to tempering, parallel and -ensemble ∗ Dtd oebr1,2020) 18, November (Dated: tfnSchnabel, Stefan 3 r fea- are thsbe oe ydffrn eerhr htti en- the this is that improvement suggested researchers One 1 different optimal. by not is noted semble However, been energy. same has the in with histogram it simu- energies flat the possible a method all yielding this visit probability to In up applied 20]. set been [19, is Refs. lation already rugged in has glasses with It spin systems to of too. simulation landscape, the free-energy which in simu- transitions well phase the first-order performs for with systems designed of algorithm lation established well other for computing parallel of method. because real- use any efficient disorder play, the different into allows many come izations simulating of not necessity does how- the advantage systems, this disordered For ever, massively for implementation. suitability its parallel to is main algorithm able The this not complexity. of additional advantage is the it to more 15] due remains however, cumbersome 12, optimization, [10, Its glasses PT. outperform spin for optimizing method of the attempts simple the to Despite evalua- contrast Annealing. the in Simulated observables permits thermodynamic This of equilibrium. tion simulation thermal the at temperature kept the is lowering and after replicas replicas population of the of population of resampling big intermediate a introducing by on an- gradually The performed is schedule. is system annealing nealing an the to as according down [14] cooled Annealing Simulated to ytm oee,i u mlmnain ntecs of case the in implementation, our underlying in the However, of simu- system. the independently of diminish times should round-trip lations the that the on suggest effort of region simulation this the nature concentrating of and the bottlenecks simulation the and the identifying work the automatically of that for algorithm performance in round- improved considered the The of models energy. behavior in scaling times the trip model improves Ising it ferromagnetic which the method for to The applied energy. others in among the trips is an maximize round uses to performed order method of in number The diffusivity local [22]. the of Ref. estimator in proposed was gorithm phase. low-temperature the which the histograms than towards energy grow often in descrip- more resulting this region region, out, high-energy low-energy point the authors samples the integrated tion As the of states. inverse of the density is distribution sampling the /k h utcnncl(UA ehd[61]i an- is [16–18] method (MUCA) Multicanonical The nte o-aaercotmzto fteMC al- MUCA the of optimization non-parametric Another † esml yHseb n tnhob 2] where [21], Stinchcombe and Hesselbo by -ensemble n ofadJanke Wolfhard and P 311 48 epi,Germany Leipzig, 04081 231101, IPF ae noconsideration. into taken s ainasgicn pe-pi terms in speed-up significant a cation ryisedo a itgasa in as histograms flat of instead ergy salse ehd,nml the namely methods, established ‡ nsemble 2
11 the three-dimensional (3D) bimodal Edwards-Anderson 10 5 10 (EA) spin glass [23], the round-trip times did not sys- flat PSH(E) 1/k 4 tematically improve with this method. Instead the simu- PT 10 lation got stuck for some of the considered samples, ren- 9 power law 10 3 dering a comparison to the other methods impossible. 10 ) ) E ( In this work we present a different approach: we pre- E 2 ( 10 scribe parametric profiles for the histograms of the simu- SH H E
7 P lation and adjust the simulation weights accordingly. As 10 ∆ 1 g 10 E for the three previous MUCA variants, it requires the − g 0 knowledge of the underlying density of states, but it is E 10 much more flexible. The profiles are all chosen to be 5 10 shifted power laws having two free parameters. −1000 −800 −600 −400 −200 0 As an example we consider the 3D bimodal EA spin E glass. This is one of the simplest models exhibiting a rugged free-energy landscape and is also interesting from FIG. 1. The recorded histograms H(E) of the different meth- the point of view of an optimization problem where find- ods and the profile function PSH(E) for one disorder realiza- ing ground states of hard disorder realizations is NP- tion of linear lattice size L = 8. The dotted and the dashed hard [24]. Despite the exponential growth of the compu- vertical lines indicate the position of the ground-state energy Eg and the position of the pole of the power law (5), respec- tational resources fundamental questions regarding the tively. nature of the spin-glass phase still remain. For the progress in understanding the open questions the devel- opment of new methods and an improvement of the ex- criterion with an energy dependent weight function isting methods is crucial. The rest of the paper is organized as follows. In Sec. II W (Enew) Pacc = min 1, , (2) the spin-glass model and the simulation methods are ex- W (E ) old plained. The direct comparison of the round-trip times of the individual methods is performed in Sec. III. The where the weight function is proportional to the inverse of the density of states Ω(E), framework of extreme-value statistics is introduced in Sec. IV. In Sec. V benchmarks for the global comparison W (E) ∝ Ω−1(E). (3) are discussed and the different methods are compared in terms of those benchmarks. The results are summarized For the MUCA simulations Ω(E) has to be sufficiently in Sec. VI. well-known a priori for each disorder realization. An es- timator for it can, for instance, be obtained by means of the Wang-Landau algorithm [25] or, as in this work, by II. MODEL AND EMPLOYED METHODS other iterative procedures which are explained, e.g., in Ref. [26]. This ensemble produces histograms which are We take into consideration the 3D bimodal EA model flat in energy and is, therefore, often also referred to as whose Hamiltonian takes the form “flat histogram method”. A straightforward generalization of the flat histogram H = − Jij SiSj, (1) method are the nonflat histogram methods. If the sim- Xhiji ulation weights for the flat MUCA method are multi- plied with the desired energy dependent shape (or profile) where the bonds Jij and the spins Si can take values ±1. function PSH(E) The sum runs over all neighboring spins in the simple- −1 cubic lattice with periodic boundary conditions. W (E) ∝ Ω (E)PSH(E), (4) Due to the disordered nature of spin glasses the study has to take into account a sufficiently large set of disor- the resulting histograms will be shaped according to der realizations on which the averaged quantities can be PSH(E). In this work all the profiles are shifted power computed. In this case one disorder realization consists laws of the form α of a set of 3V couplings Jij which are either positive or E 3 P (E, ∆E, α)= +1 , (5) negative unity with a probability of 50%, where V = L SH ∆E − E is number of spins in a lattice of linear lattice size L. The g disorder realizations are generated prior to the simulation where the exponent α < 0 and ∆E > 0 is the position and then kept fixed for all times (quenched disorder). As of the pole relative to the ground-state energy Eg of the an adequate set of disorder realizations 4000 samples with respective spin-glass realization. In this parametrization L = 3 and L = 4 are generated and 5000, 6000, and 4000 the power laws are normalized to unity at E = 0. samples of size L =5, 6, and 8, respectively. In Fig. 1 the recorded histograms of the different meth- The method which we adapted is the well-established ods are displayed on a logarithmic y-scale for one disor- MUCA method [17] employing a generalized Metropolis der realization with L = 8. In contrast to flat MUCA all 3 methods have in common that the distribution of sam- ergodicity and apply it to spin glasses and the traveling pled states grows towards the ground-state energy. The salesman problem [27]. recorded histogram of nonflat MUCA matches perfectly Since for the above mentioned methods the density of the imposed profile and its histogram in the ground-state states is the only needed input it was determined only region is similar to that of PT. We are convinced that this once to high accuracy employing the iterative procedure feature which among the existing methods is strongest adapted from Ref. [26] but with power-law shaped distri- for PT enhances the ability of sampling the low-energy butions in energy. In this case, and generally when the region and especially the ability of finding low-energy ground-state energy of the system is not known, a priori states of investigated systems. There are different possi- the profile function has to be adapted whenever a lower ble choices of functional forms which enhance the sam- energy is found. pling of the low-energy region and even stepwise defined Lastly, the PT method being probably the most em- function could be employed and might even yield better ployed algorithm for spin-glass simulations, is included results. We chose a power law because the two involved in the comparison. The ensemble in this case is defined parameters allow for a good adaptation but the tuning of by a set of M temperatures {Ti, i =1, ..., M}. For each the parameters in the two-dimensional parameter space temperature Ti a Metropolis simulation of a copy of the remains feasible. system (replica) is performed. The temperatures of the For the above parametrization we found a fixed param- replicas i and j are allowed to exchange configuration eter set namely α = −3.6 and ∆E = 96 which indepen- according to dently of the lattice size yielded the shortest mean round- 1 1 ( − )(Ej −Ei) ex Tj Ti trip times, among the considered profiles. Subsequently Pij = min 1,e , (8) we will refer to the nonflat MUCA setting with the power- law shape belonging to this parameter set just as power- where Ei and Ej are the energies of replica i and j and law (PL) setting or nonflat MUCA method. While the kB = 1. This prescription allows for fast decorrelation overall best results are obtained with this parameter set, when a replica travels to high temperature and the explo- we want to point out that an improvement compared to ration of the local minima at low temperatures. Among flat MUCA was visible for each of the considered param- the vast choice of different PT protocols available [28] eter sets. The parametrization with a fixed offset from we opted for the constant exchange rate protocol with the ground-state energy yields different relative distribu- acceptance rates between 40% and 60% [29]. For all sim- tions depending on the ground-state energy encountered ulations the maximal temperature was chosen to be well in the respective disorder realization. The value of the above the critical temperature, Tmax > 3 > Tc ≈ 1. profile function at the ground-state energy is given by The exchange rates were imposed on each individual dis- order realization in an initial equilibration run during α which the temperatures were modified accordingly. The 1 P (E , ∆E, α)= . (6) number of replicas was set to M = 7, 7, 12, 14, and 20 SH g Eg 1 − ∆E ! for L = 3, 4, 5, 6, and 8, respectively. We note that the choice of the temperature set is crucial for the PT algo- The sampling at the ground-state energy compared to rithm and also provides the possibility of optimizations zero energy is thus enhanced by a factor of ≈ 13 for a as for example in Ref. [30]. However, in this work we disorder realization with L = 4 and a typical ground- rather limit ourselves to a well established protocol for state energy of ≈ −100. For a sample with L = 8 and PT focusing on the optimization of the nonflat histogram typical ground-state energy of ≈ −900 instead it is en- technique. hanced by a factor of ≈ 4500. Due to this feature this parametrization of the profile function does not require any adjustments of the parameters in the system sizes III. COMPARISON OF THE ROUND-TRIP which we considered. Presumably such a profile will also TIMES yield good results for larger systems, although we cannot be certain. The observable taken into account for this study is the Next the 1/k-ensemble [21] is considered which is de- round-trip time. For all methods except PT and each fined by setting the simulation weights equal to the in- disorder realization it is defined as the time needed by the verse of the integrated density of states up to the energy simulation to travel from the highest energy (E ≈ 0) to of the respective bin the ground-state energy and back. For PT, instead, the −1 round trip is measured between the ground-state energy E W (E) ∝ 1/k = dE′Ω(E′) . (7) and an energy typical for a canonical ensemble with a 1/k temperature well above the freezing point of the disorder Eg ! Z realization [31][32]. This time can be taken as an upper Here, a first-order Taylor expansion of lnΩ at E leads bound of the autocorrelation time of the energy of the ′ ′ to W (E) ≈ W1/k(E) if PSH(E) = d ln Ω(E )/dE |E′=E. respective disorder realization at the ground state. We This prescription again relies on the knowledge of the want to stress that the energies we refer to as ground- density of states. The authors of Ref. [21] stress its robust state energies are the lowest encountered energies and 4
4 4 4 10 L = 4 10 L = 4 10 L = 4 ) ) ) i i i ( ( 103 103 ( 103 PT PT flat τ τ
τ