The Cross-Entropy Method for Continuous Multi-Extremal Optimization

Total Page:16

File Type:pdf, Size:1020Kb

The Cross-Entropy Method for Continuous Multi-Extremal Optimization The Cross-Entropy Method for Continuous Multi-extremal Optimization Dirk P. Kroese Sergey Porotsky Department of Mathematics Optimata Ltd. The University of Queensland 11 Tuval St., Ramat Gan, 52522 Brisbane 4072, Australia Israel. Reuven Y. Rubinstein Faculty of Industrial Engineering and Management Technion, Haifa Israel Abstract In recent years, the cross-entropy method has been successfully applied to a wide range of discrete optimization tasks. In this paper we consider the cross-entropy method in the context of continuous optimization. We demonstrate the effectiveness of the cross-entropy method for solving difficult continuous multi-extremal optimization problems, including those with non- linear constraints. Key words: Cross-entropy, continuous optimization, multi-extremal objective function, dynamic smoothing, constrained optimization, nonlinear constraints, acceptance{rejection, penalty function. 1 1 Introduction The cross-entropy (CE) method (Rubinstein and Kroese, 2004) was motivated by Rubinstein (1997), where an adaptive variance minimization algorithm for estimating probabilities of rare events for stochastic networks was presented. It was modified in Rubinstein (1999) to solve combinatorial optimization problems. The main idea behind using CE for continuous multi-extremal optimization is the same as the one for combinatorial optimization, namely to first associate with each optimization problem a rare event estimation problem | the so-called associated stochastic problem (ASP) | and then to tackle this ASP efficiently by an adaptive algorithm. The principle outcome of this approach is the construction of a random sequence of solutions which converges probabilistically to the optimal or near-optimal solution. As soon as the ASP is defined, the CE method involves the following two iterative phases: 1. Generation of a sample of random data (trajectories, vectors, etc.) according to a specified random mechanism. 2. Updating the parameters of the random mechanism, typically parameters of pdfs, on the basis of the data, to produce a \better" sample in the next iteration. The CE method has been successfully applied to a variety of problems in combinatorial op- timization and rare-event estimation, the latter with both light- and heavy-tailed distributions. Applications areas include buffer allocation, queueing models of telecommunication systems, neu- ral computation, control and navigation, DNA sequence alignment, signal processing, scheduling, vehicle routing, reinforcement learning, project management and reliability systems. References and more details on applications and theory can be found in the CE tutorial (de Boer et al., 2004) and the recent CE monograph (Rubinstein and Kroese, 2004). It is important to note that the CE method deals successfully with both deterministic problems, such as the traveling salesman problem, and noisy (i.e., simulation-based) problems, such as the buffer allocation problem. The goal of this paper is to demonstrate the quite accurate performance of the CE method for solving difficult continuous multi-extremal problems, both constrained and unconstrained. In particular we show numerically that typically it finds the optimal (or near-optimal) solution very fast and provides more accurate results than those reported in the literature. 2 It is out of the scope of this paper to review the huge literature on analytical/deterministic methods for solving optimization problem. Many of these methods are based on gradient or pseudo- gradient techniques. Examples are Line Search, Gradient Descent, Newton-Type Methods, Variable Metric, Conjugate Gradient etc. The main drawback of gradient-based methods is that they, by their nature, do not cope well with optimization problems that have non-convex objective functions and/or many local optima. Such multi-extremal continuous optimization problems arise abundantly in applications. A popular and convenient approach to these type of problems is to use random search techniques. The basic idea behind such methods is to systematically partition the feasible region into smaller subregions and then to move from one subregion to another based on information obtained by random search. Well-known examples include simulated annealing (Aarts and Korst, 1989), threshold acceptance, (Dueck and Scheur, 1990), genetic algorithms (Goldberg, 1989), tabu search (Glover and Laguna, 1993), ant colony method (Dorigo et al., 1996), and the stochastic comparison method (Gong et al., 1992). The rest of this paper is organized as follows. Section 2 presents an overview of the CE method. In section 3 we give the main algorithm for continuous multi-extremal optimization using multi- dimensional normal sampling with independent components. A particular emphasis is put on the issue of different updating procedures for the parameters of the normal pdf, so-called fixed and dynamic smoothing. In Section 4 we present numerical results with the CE Algorithm for both unconstrained and constrained programs. We demonstrate the high accuracy of CE using two approaches to constrained optimization: the acceptance-rejection approach and the penalty approach. 2 The Main CE Algorithm for Optimization The main idea of the CE method for optimization can be stated as follows: Suppose we wish to maximize some \performance" function S(x) over all elements/states x in some set X . Let us denote the maximum by γ∗, thus γ∗ = max S(x) : (1) x2X To proceed with CE, we first randomize our deterministic problem by defining a family of pdfs 3 ff(·; v); v 2 Vg on the set X . Next, we associate with (1) the estimation of P E `(γ) = u(S(X) > γ) = uIfS(X)>γg; (2) the so-called associated stochastic problem (ASP). Here, X is a random vector with pdf f(·; u), for some u 2 V (for example, X could be a normal random vector) and γ is a known or unknown parameter. Note that there are in fact two possible estimation problems associated with (2). For a given γ we can estimate `, or alternatively, for a given ` we can estimate γ, the root of (2). Let us consider the problem of estimating ` for a certain γ close to γ ∗. Then, typically fS(X) > γg is a rare event, and estimation of ` is a non-trivial problem. The CE method solves this efficiently by making adaptive changes to the probability density function according to the Kullback-Leibler CE, thus creating a sequence f(·; u); f(·; v1); f(·; v2); : : : of pdfs that are \steered" in the direction of the theoretically optimal density f(·; v∗) corresponding to the degenerate density at an optimal point. In fact, the CE method generates a sequence of tuples f(γt; vt)g, which converges quickly to a small neighborhood of the optimal tuple (γ∗; v∗). More specifically, we initialize by setting −2 v0 = u, choosing a not very small quantity %, say % = 10 , and then we proceed as follows: 1. Adaptive updating of γt. For a fixed vt−1, let γt be the (1 − %)-quantile of S(X) under vt−1. That is, γt satisfies P vt−1 (S(X) > γt) > %; (3) P vt−1 (S(X) 6 γt) > 1 − %; (4) where X ∼ f(·; vt−1). A simple estimator of γt, denoted γt, can be obtained by drawing a random sample X1; : : : ; XN from f(·; vt−1) and evaluating theb sample (1 − %)-quantile of the performances as γt = S(d(1−%)Ne): (5) b 2. Adaptive updating of vt. For fixed γt and vt−1, derive vt from the solution of the program Ev X maxv D(v) = maxv t−1 IfS( )>γtg ln f(X; v) : (6) The stochastic counterpart of (6) is as follows: for fixed γt and vt−1 (the estimate of vt−1), derive vt from the following program b b N b 1 max D(v) = max I X ln f(X ; v) : (7) v v N fS( i)>γtg i Xi=1 b b 4 Remark 2.1 (Smoothed Updating) Instead of updating the parameter vector v directly via the solution of (7) we use the following smoothed version vt = αv~t + (1 − α)vt−1; 8 i = 1; : : : n; (8) b b where v~t is the parameter vector obtained from the solution of (7), and α is called the smoothing parameter, with 0:7 < α ≤ 1. Clearly, for α = 1 we have our original updating rule. The reason for using the smoothed (8) instead of the original updating rule is twofold: (a) to smooth out the values of vt, (b) to reduce the probability that some component vt;i of vt will be zero or one at the first few iterations.b This is particularly important when vt is abvectorb or matrix of probabilities. Note that for 0 < α ≤ 1 we always have that vt;i > 0, whileb for α = 1 one might have (even at the first iterations) that either vt;i = 0 or vt;i =b1 for some indices i. As result, the algorithm will converge to a wrong solution. b b Thus, the main CE optimization algorithm, which includes smoothed updating of parameter vector v can be summarized as follows. Algorithm 1 Generic CE Algorithm for Optimization 1. Choose some v0. Set t = 1. b 2. Generate a sample X1; : : : ; XN from the density f(·; vt−1) and compute the sample (1 − %)- quantile γt of the performances according to (5). b b 3. Use the same sample X1; : : : ; XN and solve the stochastic program (7). Denote the solution by vt. e 4. Apply (8) to smooth out the vector vt. e 5. Repeat steps 2{4 until a pre-specified stopping criterion is met. 3 Continuous Multi-Extremal Optimization Here we apply the CE Algorithm 1 to continuous multi-extremal optimization problems (1), for both (a) unconstrained and (b) constrained problems with non-linear boundaries. We assume n henceforth that each x = (x1; : : : ; xn) is a real-valued vector and that X is a subset of R . 5 (a) The Unconstrained Case In this case generation of a random vector X = (X1; : : : ; Xn) 2 X is straightforward.
Recommended publications
  • On Modularity - NP-Completeness and Beyond?
    On Modularity - NP-Completeness and Beyond? Ulrik Brandes1, Daniel Delling2, Marco Gaertler2, Robert G¨orke2, Martin Hoefer1, Zoran Nikoloski3, and Dorothea Wagner2 1 Department of Computer & Information Science, University of Konstanz, {brandes,hoefer}@inf.uni-konstanz.de 2 Faculty of Informatics, Universit¨atKarlsruhe (TH), {delling,gaertler,rgoerke,wagner}@informatik.uni-karlsruhe.de 3 Department of Applied Mathematics, Faculty of Mathematics and Physics, Charles University, Prague, [email protected] Abstract. Modularity is a recently introduced quality measure for graph clusterings. It has im- mediately received considerable attention in several disciplines, and in particular in the complex systems literature, although its properties are not well understood. We here present first results on the computational and analytical properties of modularity. The complexity status of modularity maximization is resolved showing that the corresponding decision version is NP-complete in the strong sense. We also give a formulation as an Integer Linear Program (ILP) to facilitate exact optimization, and provide results on the approximation factor of the commonly used greedy al- gorithm. Completing our investigation, we characterize clusterings with maximum modularity for several graph families. 1 Introduction Graph clustering is a fundamental problem in the analysis of relational data. Although studied for decades, it applied in many settings, it is now popularly referred to as the problem of partitioning networks into communities. In this line of research, a novel graph clustering index called modularity has been proposed recently [1]. The rapidly growing interest in this measure prompted a series of follow-up studies on various applications and possible adjustments (see, e.g., [2,3,4,5,6]).
    [Show full text]
  • On Metaheuristic Optimization Motivated by the Immune System
    Applied Mathematics, 2014, 5, 318-326 Published Online January 2014 (http://www.scirp.org/journal/am) http://dx.doi.org/10.4236/am.2014.52032 On Metaheuristic Optimization Motivated by the Immune System Mohammed Fathy Elettreby1,2*, Elsayd Ahmed2, Houari Boumedien Khenous1 1Department of Mathematics, Faculty of Science, King Khalid University, Abha, KSA 2Department of Mathematics, Faculty of Science, Mansoura University, Mansoura, Egypt Email: *[email protected] Received November 16, 2013; revised December 16, 2013; accepted December 23, 2013 Copyright © 2014 Mohammed Fathy Elettreby et al. This is an open access article distributed under the Creative Commons Attribu- tion License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. In accordance of the Creative Commons Attribution License all Copyrights © 2014 are reserved for SCIRP and the owner of the intellectual property Mohammed Fathy Elettreby et al. All Copyright © 2014 are guarded by law and by SCIRP as a guardian. ABSTRACT In this paper, we modify the general-purpose heuristic method called extremal optimization. We compare our results with the results of Boettcher and Percus [1]. Then, some multiobjective optimization problems are solved by using methods motivated by the immune system. KEYWORDS Multiobjective Optimization; Extremal Optimization; Immunememory and Metaheuristic 1. Introduction Multiobjective optimization problems (MOPs) [2] are existing in many situations in nature. Most realistic opti- mization problems require the simultaneous optimization of more than one objective function. In this case, it is unlikely that the different objectives would be optimized by the same alternative parameter choices. Hence, some trade-off between the criteria is needed to ensure a satisfactory problem.
    [Show full text]
  • Arxiv:Cond-Mat/0104214V1
    Extremal Optimization for Graph Partitioning Stefan Boettcher∗ Physics Department, Emory University, Atlanta, Georgia 30322, USA Allon G. Percus† Computer & Computational Sciences Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA (Dated: August 4, 2021) Extremal optimization is a new general-purpose method for approximating solu- tions to hard optimization problems. We study the method in detail by way of the NP-hard graph partitioning problem. We discuss the scaling behavior of extremal optimization, focusing on the convergence of the average run as a function of run- time and system size. The method has a single free parameter, which we determine numerically and justify using a simple argument. Our numerical results demonstrate that on random graphs, extremal optimization maintains consistent accuracy for in- creasing system sizes, with an approximation error decreasing over runtime roughly as a power law t−0.4. On geometrically structured graphs, the scaling of results from the average run suggests that these are far from optimal, with large fluctuations between individual trials. But when only the best runs are considered, results con- sistent with theoretical arguments are recovered. PACS number(s): 02.60.Pn, 05.65.+b, 75.10.Nr, 64.60.Cn. arXiv:cond-mat/0104214v1 [cond-mat.stat-mech] 12 Apr 2001 ∗Electronic address: [email protected] †Electronic address: [email protected] 2 I. INTRODUCTION Optimizing a system of many variables with respect to some cost function is a task frequently encountered in physics. The determination of ground-state configurations in disordered materials [1, 2, 3, 4] and of fast-folding protein conformations [5] are but two examples.
    [Show full text]
  • Extremal Optimization of Graph Partitioning at the Percolation
    Extremal Optimization of Graph Partitioning at the Percolation Threshold Stefan Boettcher1,2 1Physics Department, Emory University, Atlanta, Georgia 30322, USA 2Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545, USA (May 31, 2018) The benefits of a recently proposed method to approximate hard optimization problems are demon- strated on the graph partitioning problem. The performance of this new method, called Extremal Optimization, is compared to Simulated Annealing in extensive numerical simulations. While gener- ally a complex (NP-hard) problem, the optimization of the graph partitions is particularly difficult for sparse graphs with average connectivities near the percolation threshold. At this threshold, the relative error of Simulated Annealing for large graphs is found to diverge relative to Extremal Opti- mization at equalized runtime. On the other hand, Extremal Optimization, based on the extremal dynamics of self-organized critical systems, reproduces known results about optimal partitions at this critical point quite well. I. INTRODUCTION this critical point, the performance of SA markedly de- teriorates while EO produces only small errors. The optimization of systems with many degrees of free- This paper is organized as follows: In the next section dom with respect to some cost function is a frequently en- we describe the philosophy behind the EO method, in countered task in physics and beyond [1]. In cases where Sec. III we introduce the graph partitioning problem, the relation between individual components of the sys- and Sec. IV we present the algorithms and the results tem is frustrated [2], such a cost function often exhibits obtained in our numerical comparison of SA and EO, a complex “landscape” [3] over the space of all configu- followed by conclusions in Sec.
    [Show full text]
  • Rank-Constrained Optimization: Algorithms and Applications
    Rank-Constrained Optimization: Algorithms and Applications Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Chuangchuang Sun, BS Graduate Program in Aeronautical and Astronautical Engineering The Ohio State University 2018 Dissertation Committee: Ran Dai, Advisor Mrinal Kumar Manoj Srinivasan Wei Zhang c Copyright by Chuangchuang Sun 2018 Abstract Matrix rank related optimization problems comprise rank-constrained optimization problems (RCOPs) and rank minimization problems, which involve the matrix rank function in the objective or the constraints. General RCOPs are defined to optimize a convex objective function subject to a set of convex constraints and rank constraints on unknown rectangular matrices. RCOPs have received extensive attention due to their wide applications in signal processing, model reduction, and system identification, just to name a few. Noteworthily, The general/nonconvex quadratically constrained quadratic programming (QCQP) problem with wide applications is a special category of RCOP. On the other hand, when the rank function appears in the objective of an optimization problem, it turns out to be a rank minimization problem (RMP), classified as another category of nonconvex optimization. Applications of RMPs have been found in a variety of areas, such as matrix completion, control system analysis and design, and machine learning. At first, RMP and RCOPs are not isolated. Though we can transform a RMP into a RCOP by introducing a slack variable, the reformulated RCOP, however, has an unknown upper bound for the rank function. That is a big shortcoming. Provided that a given matrix upper bounded is a prerequisite for most of the algorithms when solving RCOPs, we first fill this gap by demonstrating that RMPs can be equivalently transformed into RCOPs by introducing a quadratic matrix equality constraint.
    [Show full text]
  • Continuous Extremal Optimization for Lennard-Jones Clusters
    PHYSICAL REVIEW E 72, 016702 ͑2005͒ Continuous extremal optimization for Lennard-Jones clusters Tao Zhou,1 Wen-Jie Bai,2 Long-Jiu Cheng,2 and Bing-Hong Wang1,* 1Nonlinear Science Center and Department of Modern Physics, University of Science and Technology of China, Hefei Anhui, 230026, China 2Department of Chemistry, University of Science and Technology of China, Hefei Anhui, 230026, China ͑Received 17 November 2004; revised manuscript received 19 April 2005; published 6 July 2005͒ We explore a general-purpose heuristic algorithm for finding high-quality solutions to continuous optimiza- tion problems. The method, called continuous extremal optimization ͑CEO͒, can be considered as an extension of extremal optimization and consists of two components, one which is responsible for global searching and the other which is responsible for local searching. The CEO’s performance proves competitive with some more elaborate stochastic optimization procedures such as simulated annealing, genetic algorithms, and so on. We demonstrate it on a well-known continuous optimization problem: the Lennard-Jones cluster optimization problem. DOI: 10.1103/PhysRevE.72.016702 PACS number͑s͒: 02.60.Pn, 36.40.Ϫc, 05.65.ϩb I. INTRODUCTION rank each individual according to its fitness value so that the least-fit one is in the top. Use ki to denote the individual i’s The optimization of a system with many degrees of free- rank; clearly, the least-fit one is of rank 1. Choose one indi- dom with respect to some cost function is a frequently en- ͑ ͒ vidual j that will be changed with the probability P kj , and countered task in physics and beyond.
    [Show full text]
  • A New Hybrid BA ABC Algorithm for Global Optimization Problems
    mathematics Article A New Hybrid BA_ABC Algorithm for Global Optimization Problems Gülnur Yildizdan 1,* and Ömer Kaan Baykan 2 1 Kulu Vocational School, Selcuk University, Kulu, 42770 Konya, Turkey 2 Department of Computer Engineering, Faculty of Engineering and Natural Sciences, Konya Technical University, 42250 Konya, Turkey; [email protected] * Correspondence: [email protected] Received: 1 September 2020; Accepted: 21 September 2020; Published: 12 October 2020 Abstract: Bat Algorithm (BA) and Artificial Bee Colony Algorithm (ABC) are frequently used in solving global optimization problems. Many new algorithms in the literature are obtained by modifying these algorithms for both constrained and unconstrained optimization problems or using them in a hybrid manner with different algorithms. Although successful algorithms have been proposed, BA’s performance declines in complex and large-scale problems are still an ongoing problem. The inadequate global search capability of the BA resulting from its algorithm structure is the major cause of this problem. In this study, firstly, inertia weight was added to the speed formula to improve the search capability of the BA. Then, a new algorithm that operates in a hybrid manner with the ABC algorithm, whose diversity and global search capability is stronger than the BA, was proposed. The performance of the proposed algorithm (BA_ABC) was examined in four different test groups, including classic benchmark functions, CEC2005 small-scale test functions, CEC2010 large-scale test functions, and classical engineering design problems. The BA_ABC results were compared with different algorithms in the literature and current versions of the BA for each test group. The results were interpreted with the help of statistical tests.
    [Show full text]
  • Improviesed Genetic Algorithm for Fuzzy Overlapping Community Detection in Social Networks
    International Journal of Advanced Computational Engineering and Networking, ISSN: 2320-2106, Volume-5, Issue-9, Sep.-2017 http://iraj.in IMPROVIESED GENETIC ALGORITHM FOR FUZZY OVERLAPPING COMMUNITY DETECTION IN SOCIAL NETWORKS 1HARISH KUMAR SHAKYA, 2KULDEEP SINGH, 3BHASKAR BISWAS 123Indian Institute of Technology, Varanasi, (UP) India, India Email: [email protected],[email protected], [email protected] Abstract: Community structure identification is an important task in social network analysis. Social communications exist with some social situation and communities are a fundamental form of social contexts. Social network is application of web mining and web mining is also an application of data mining. Social network is a type of structure made up of a set of social actors like as persons or organizations, sets of pair ties, and other interaction socially between actors. In recent scenario community detection in social networks is a very hot and dynamic area of research. In this paper, we have used improvised genetic algorithm for community detection in social networks, we used the combination of roulette wheel selection and square quadratic knapsack problem. We have executed the experiment on different datasets i.e. Zachary’s karate club [31], American college football [39], Dolphin social network [32] and many more. All are verified and well known datasets in the research world of social network analysis. An experimental result shows the improvement on convergence rate of proposed algorithm and discovered communities are highly inclined towards quality. Index term: community detection, genetic algorithm, roulette wheel selection, I. INTRODUCTION optimization problem, which involves a quality function first defined by Newman and Girvan [3], A social network is a branch of data mining which involves finding some structure or pattern amongst the set of individuals, groups and organizations.
    [Show full text]
  • Nature Based Algorithms Have Been Used Succesifully For
    Anais do I WORCAP, INPE, São José dos Campos, 25 de Outubro de 2001, p. 115-117 Extremal Dynamics Applied to Function Optimization Fabiano Luis de Sousa*,1, Fernando Manuel Ramos**,1 (1) Área de Computação Científica Laboratório Associado de Computação e Matemática Aplicada Instituto Nacional de Pesquisas Espaciais (INPE) (*) Doutorado, email: [email protected]; (**) Orientador Abstract In this article is introduced the Generalized Extremal Optimization algorithm. Based on the dynamics of self-organized criticality, it is intended to be used in complex inverse design problems, where traditional gradient based optimization methods are not efficient. Preliminary results from a set of test functions show that this algorithm can be competitive to other stochastic methods such as the genetic algorithms. Key words: Extremal dynamics, optimization, self-organized criticality. Introduction Recently, Boettcher and Percus [1] have proposed a new optimization algorithm based on a simplified model of natural selection developed by Bak and Sneppen [2]. Evolution in this model is driven by a extremal process that shows characteristics of self-organized criticality (SOC). Boettcher and Percus [1] have adapted the evolutionary model of Bak and Sneppel [2] to tackle hard problems in combinatorial optimization, calling their algorithm Extremal Optimization (EO). Although algorithms such as Simulated Annealing (SA), Genetic Algorithms (GAs) and the EO are inspired by natural processes, their practical implementation to optimization problems shares a common feature: the search for the optimal is done through a stochastic process that is “guided” by the setting of adjustable parameters. Since the proper setting of these parameters are very important to the performance of the algorithms, it is highly desirable that they have few of such parameters, so that the cost of finding the best set to a given optimization problem does not become a costly task in itself.
    [Show full text]
  • Extremal Optimization at the Phase Transition of the 3-Coloring Problem
    PHYSICAL REVIEW E 69, 066703 (2004) Extremal optimization at the phase transition of the three-coloring problem Stefan Boettcher* Department of Physics, Emory University, Atlanta, Georgia 30322, USA Allon G. Percus† Computer & Computational Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA and UCLA Institute for Pure and Applied Mathematics, Los Angeles, California 90049, USA (Received 10 February 2004; published 24 June 2004) We investigate the phase transition in vertex coloring on random graphs, using the extremal optimization heuristic. Three-coloring is among the hardest combinatorial optimization problems and is equivalent to a 3-state anti-ferromagnetic Potts model. Like many other such optimization problems, it has been shown to exhibit a phase transition in its ground state behavior under variation of a system parameter: the graph’s mean vertex degree. This phase transition is often associated with the instances of highest complexity. We use extremal optimization to measure the ground state cost and the “backbone,” an order parameter related to ground state overlap, averaged over a large number of instances near the transition for random graphs of size n up to 512. For these graphs, benchmarks show that extremal optimization reaches ground states and explores a sufficient number of them to give the correct backbone value after about O͑n3.5͒ update steps. Finite size ␣ ͑ ͒ scaling yields a critical mean degree value c =4.703 28 . Furthermore, the exploration of the degenerate ground states indicates that the backbone order parameter, measuring the constrainedness of the problem, exhibits a first-order phase transition. DOI: 10.1103/PhysRevE.69.066703 PACS number(s): 02.60.Pn, 05.10.Ϫa, 75.10.Nr I.
    [Show full text]
  • Studies on Extremal Optimization and Its Applications in Solving Real World Optimization Problems
    Proceedings of the 2007 IEEE Symposium on Foundations of Computational Intelligence (FOCI 2007) 1 Studies on Extremal Optimization and Its Applications in Solving Real World Optimization Problems Yong-Zai Lu*, IEEE Fellow, Min-Rong Chen, Yu-Wang Chen Department of Automation Shanghai Jiaotong University Shanghai, China Abstract— Recently, a local-search heuristic algorithm called of surviving. Species in this model is located on the sites of a Extremal Optimization (EO) has been proposed and successfully lattice. Each species is assigned a fitness value randomly with applied in some NP-hard combinatorial optimization problems. uniform distribution. At each update step, the worst adapted This paper presents an investigation on the fundamentals of EO species is always forced to mutate. The change in the fitness of with its applications in discrete and numerical optimization problems. The EO was originally developed from the fundamental the worst adapted species will cause the alteration of the fitness of statistic physics. However, in this study we also explore the landscape of its neighbors. This means that the fitness values of mechanism of EO from all three aspects: statistical physics, the species around the worst one will also be changed randomly, biological evolution or co-evolution and ecosystem. Furthermore, even if they are well adapted. After a number of iterations, the we introduce our contributions to the applications of EO in solving system evolves to a highly correlated state known as SOC. In traveling salesman problem (TSP) and production scheduling, and that state, almost all species have fitness values above a certain multi-objective optimization problems with novel perspective in discrete and continuous search spaces, respectively.
    [Show full text]
  • Gumbel-Softmax-Based Optimization
    Li et al. Comput Soc Netw (2021) 8:5 https://doi.org/10.1186/s40649‑021‑00086‑z RESEARCH Open Access Gumbel‑softmax‑based optimization: a simple general framework for optimization problems on graphs Yaoxin Li1, Jing Liu1, Guozheng Lin1, Yueyuan Hou2, Muyun Mou1 and Jiang Zhang1* *Correspondence: [email protected] Abstract 1 School of Systems Science, In computer science, there exist a large number of optimization problems defned on Beijing Normal University, No.19, Xinjiekouwai graphs, that is to fnd a best node state confguration or a network structure, such that St, Haidian District, the designed objective function is optimized under some constraints. However, these Beijing 100875, People’s problems are notorious for their hardness to solve, because most of them are NP-hard Republic of China Full list of author information or NP-complete. Although traditional general methods such as simulated annealing is available at the end of the (SA), genetic algorithms (GA), and so forth have been devised to these hard problems, article their accuracy and time consumption are not satisfying in practice. In this work, we proposed a simple, fast, and general algorithm framework based on advanced auto- matic diferentiation technique empowered by deep learning frameworks. By introduc- ing Gumbel-softmax technique, we can optimize the objective function directly by gradient descent algorithm regardless of the discrete nature of variables. We also intro- duce evolution strategy to parallel version of our algorithm. We test our algorithm on four representative optimization problems on graph including modularity optimization from network science, Sherrington–Kirkpatrick (SK) model from statistical physics, maxi- mum independent set (MIS) and minimum vertex cover (MVC) problem from combina- torial optimization on graph, and Infuence Maximization problem from computational social science.
    [Show full text]