Chapter 1 Simulated Annealing

Chapter 1 Simulated Annealing Alexander G. Nikolaev and Sheldon H. Jacobson Abstract Simulated annealing is a well-studied local search metaheuristic used to address discrete and, to a lesser extent, continuous optimization problems. The key feature of simulated annealing is that it provides a mechanism to escape local optima by allowing hill-climbing moves (i.e., moves which worsen the objective function value) in hopes of finding a global optimum. A brief history of simulated annealing is presented, including a review of its application to discrete, continuous, and multi- objective optimization problems. Asymptotic convergence and finite-time performance theory for simulated annealing are reviewed. Other local search algorithms are discussed in terms of their relationship to simulated annealing. The chapter also presents practical guidelines for the implementation of simulated annealing in terms of cooling schedules, neighborhood functions, and appropriate applications. 1.1 Background Survey Simulated annealing is a local search algorithm (metaheuristic) capable of escaping from local optima. Its ease of implementation and convergence properties and its use of hill-climbing moves to escape local optima have made it a popular tech- nique over the past two decades. It is typically used to address discrete and, to a lesser extent, continuous optimization problems. Survey articles that provide a good overview of simulated annealing’s theoretical development and domains of application include [46, 55, 75, 90, 120, 144]. Aarts and Korst [1] and van Laarhoven Alexander G. Nikolaev Industrial and Systems Engineering, University at Buffalo, Buffalo, NY 14260-2050 e-mail: [email protected] Sheldon H. Jacobson Department of Computer Science, University of Illinois, Urbana, IL, USA 61801-2302 e-mail: [email protected] M. Gendreau, J.-Y. Potvin (eds.), Handbook of Metaheuristics,1 International Series in Operations Research & Management Science 146, DOI 10.1007/978-1-4419-1665-5 1, c Springer Science+Business Media, LLC 2010 2 Alexander G. Nikolaev and Sheldon H. Jacobson and Aarts [155] devote entire books to the subject. Aarts and Lenstra [2] dedicate a chapter to simulated annealing in their book on local search algorithms for discrete optimization problems. 1.1.1 History and Motivation Simulated annealing is so named because of its analogy to the process of physical annealing with solids, in which a crystalline solid is heated and then allowed to cool very slowly until it achieves its most regular possible crystal lattice configuration (i.e., its minimum lattice energy state), and thus is free of crystal defects. If the cooling schedule is sufficiently slow, the final configuration results in a solid with such superior structural integrity. Simulated annealing establishes the connection between this type of thermodynamic behavior and the search for global minima for a discrete optimization problem. Furthermore, it provides an algorithmic means for exploiting such a connection. At each iteration of a simulated annealing algorithm applied to a discrete optimization problem, the values for two solutions (the current solution and a newly selected solution) are compared. Improving solutions are always accepted, while a fraction of non-improving (inferior) solutions are accepted in the hope of escaping local optima in search of global optima. The probability of accepting non-improving solutions depends on a temperature parameter, which is typically non-increasing with each iteration of the algorithm. The key algorithmic feature of simulated annealing is that it provides a means to escape local optima by allowing hill-climbing moves (i.e., moves which worsen the objective function value). As the temperature parameter is decreased to zero, hill- climbing moves occur less frequently, and the solution distribution associated with the inhomogeneous Markov chain that models the behavior of the algorithm con- verges to a form in which all the probability is concentrated on the set of globally optimal solutions (provided that the algorithm is convergent; otherwise the algorithm will converge to a local optimum, which may or may not be globally optimal). 1.1.2 Definition of Terms To describe the specific features of a simulated annealing algorithm for discrete optimization problems, several definitions are needed. Let Ω be the solution space (i.e., the set of all possible solutions). Let f : Ω → ℜ be an objective function defined on the solution space. The goal is to find a global minimum, ω∗ (i.e., ω∗ ∈ Ω such that f (ω∗) ≤ (ω) for all ω ∈ Ω). The objective function must be bounded to ensure that ω∗ exists. Define N(ω) to be the neighborhood function for ω ∈ Ω. Therefore, associated with every solution, ω ∈ Ω, are neighboring solutions, N(ω), that can be reached in a single iteration of a local search algorithm. 1 Simulated Annealing 3 Simulated annealing starts with an initial solution ω ∈ Ω. A neighboring solution ω ∈ N(ω) is then generated (either randomly or using some pre-specified rule). Simulated annealing is based on the Metropolis acceptance criterion [101], which models how a thermodynamic system moves from the current solution (state) ω ∈ Ω to a candidate solution ω ∈ N(ω), in which the energy content is being minimized. The candidate solution, ω, is accepted as the current solution based on the acceptance probability exp[−( f (ω) − f (ω))/t ] if f (ω) − f (ω) > 0 { ω } = k P Accept as next solution 1iff (ω) − f (ω) ≤ 0. (1.1) Define tk as the temperature parameter at (outer loop) iteration k, such that tk > 0 for all k and lim tk = 0. (1.2) k→∞ This acceptance probability is the basic element of the search mechanism in simulated annealing. If the temperature is reduced sufficiently slowly, then the system can reach an equilibrium (steady state) at each iteration k.Let f (ω) and f (ω) denote the energies (objective function values) associated with solutions ω ∈ Ω and ω ∈ N(ω), respectively. This equilibrium follows the Boltzmann distribution, which can be de- scribed as the probability of the system being in state ω ∈ Ω with energy f (ω) at temperature T such that (− (ω)/ ) { ω } = exp f tk . P System is in state at temperature T (1.3) ∑ω∈Ω exp(− f (ω )/tk) If the probability of generating a candidate solution ω from the neighbors of solution ω ∈ Ω is gk(ω,ω ), where ∑ gk(ω,ω )=1, for all ω ∈ Ω, k = 1,2,..., (1.4) ω∈N(ω) then a non-negative square stochastic matrix Pk can be defined with transition probabilities ⎧ ⎨⎪ gk(ω,ω )exp(−Δω,ω /tk) ω ∈ N(ω), ω = ω ω ∈/ (ω), ω = ω Pk(ω,ω )= 0 N (1.5) ⎩⎪ 1 − ∑ω∈N(ω),ω=ω Pk(ω,ω ) ω = ω for all solutions ω ∈ Ω and all iterations k = 1,2,... and with Δω,ω ≡ f (ω )− f (ω). These transition probabilities define a sequence of solutions generated from an inhomogeneous Markov chain [120]. Note that boldface type indicates matrix/vector notation, and all vectors are row vectors. 4 Alexander G. Nikolaev and Sheldon H. Jacobson 1.1.3 Statement of Algorithm Simulated annealing is outlined in pseudo-code (see [46]). Select an initial solution ω ∈ Ω Select the temperature change counter k = 0 Select a temperature cooling schedule, tk Select an initial temperature T = t0 ≥ 0 Select a repetition schedule, Mk, that defines the number of iterations executed at each temperature, tk Repeat Set repetition counter m = 0 Repeat Generate a solution ω ∈ N(ω) Calculate Δω,ω = f (ω ) − f (ω) If Δω,ω ≤ 0, then ω ← ω If Δω,ω > 0, then ω ← ω with probability exp(−Δω,ω /tk) m ← m + 1 Until m = Mk k ← k + 1 Until stopping criterion is met This simulated annealing formulation results in M0 + M1 + ···+ Mk total iterations being executed, where k corresponds to the value for tk at which some stopping criterion is met (for example, a pre-specified total number of iterations has been executed or a solution of a certain quality has been found). In addition, if Mk = 1for all k, then the temperature changes at each iteration. 1.1.4 Discrete Versus Continuous Problems The majority of the theoretical developments and application work with simulated annealing has been for discrete optimization problems. However, simulated annealing has also been used as a tool to address problems in the continuous domain. There is considerable interest in using simulated annealing for global optimization over regions containing several local and global minima (due to an inherent non-linearity of objective functions). Fabian [48] studies the performance of simulated annealing methods for finding a global minimum of a given objective function. Bohachevsky et al. [15] propose a generalized simulated annealing algorithm for function optimization for use in statistical applications, and Locatelli [96] presents a proof of convergence for the algorithm. Optimization of continuous functions involves finding a candidate solution by picking a direction from the current (incumbent) solution and a step size to take in this direction and evaluating the function at the new (candidate) location. If the function value of this candidate location is an im- provement over the function value of the incumbent location, then the candidate 1 Simulated Annealing 5 becomes the incumbent. This migration through local minima in search of a global minimum continues until the global minimum is found or some termination criteria are reached. Belisle [12] presents a special simulated annealing algorithm for global optimization, which uses a heuristically motivated cooling schedule. This algorithm is easy to implement and provides a reasonable alternative to existing methods. Belisle et al. [13] discuss convergence properties of simulated annealing algorithms applied to continuous functions and apply these results to hit-and-run algorithms used in global optimization.

Load more