Genetic Algorithm As Machine Learning for Profiles Recognition

Total Page:16

File Type:pdf, Size:1020Kb

Genetic Algorithm As Machine Learning for Profiles Recognition Genetic Algorithm as Machine Learning for Profiles Recognition Yann Carbonne1 and Christelle Jacob2 1University of Technology of Troyes, 12 rue Marie Curie, Troyes, France 2Altran Technologies, 4 avenue Didier Daurat, Blagnac, France Keywords: Genetic Algorithm, Machine Learning, Natural Language Processing, Profiles Recognition, Clustering. Abstract: Persons are often asked to provide information about themselves. These data are very heterogeneous and result in as many “profiles” as contexts. Sorting a large amount of profiles from different contexts and assigning them back to a specific individual is quite a difficult problem. Semantic processing and machine learning are key tools to achieve this goal. This paper describes a framework to address this issue by means of concepts and algorithms selected from different Artificial Intelligence fields. Indeed, a Vector Space Model is customized to first transpose semantic information into a mathematical model. Then, this model goes through a Genetic Algorithm (GA) which is used as a supervised learning algorithm for training a computer to determine how much two profiles are similar. Amongst the GAs, this study introduces a new reproduction method (Best Together), and compare it to some usual ones (Wheel, Binary Tournament).This paper also evaluates the accuracy of the GAs predictions for profiles clustering with the computation of a similarity score, as well as its ability to classify two profiles are similar or non-similar. We believe that the overall methodology can be used for any kind of sources using profiles and, more generally, for similar data recognition. 1 INTRODUCTION different social networks. Even if existing solutions, such as the use of Vector Space Models (VSM) for For several years, we have witnessed the exponential information retrieval (Salton, 1968), inspired our growth of data worldwide. According to the experts, study, they are not straight related. 90% of world data had been generated over the last The problem is to identify the same real person two years. Human cannot handle this large amount of between different profiles from different sources. data, hence machine comes in the foreground for To do so, the objective is to teach a computer to processing and extracting meaningful information automatically answer the question: “Are these two from them. profiles about the same real person?”. Just like it In this paper, we will focus on a special kind of would be for a human, the teaching will be split in data: those concerning people. These data can be very two phases. During a first phase, the computer will heterogeneous due to the diversity of their origin. use a human-made set of data to train. Within this Data comes from several sources: public (social training set, for each possible combination of profiles, media, forums, etc.) or private (employee database, the question above had been answered. The training customer database, etc.). should be done with various profiles from different Despite their diversity, collected data are sources to be relevant. After the training phase, the processed the same way: each user (a real person) is computer will be able to predict a similarity score matched with one or several profiles. A profile could between two profiles. The performances will be contain global information (city, gender …) or determined through the analysis of predefined criteria specific information (work history …). The for predictions. information volume could also be dense or sparse. In this study, we investigate how to determine a This paper differs from existing studies about person profile using a combination of natural profiles recognition in social networks (Rawashdeh & language processing, genetic algorithm and machine Ralescu, 2014) because it does not focus on similarity learning. In addition, we propose a new reproduction between profiles within a social network but between mechanism, named here Best Together (BT). The 157 Carbonne, Y. and Jacob, C.. Genetic Algorithm as Machine Learning for Profiles Recognition. In Proceedings of the 7th International Joint Conference on Computational Intelligence (IJCCI 2015) - Volume 1: ECTA, pages 157-166 ISBN: 978-989-758-157-1 Copyright c 2015 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved ECTA 2015 - 7th International Conference on Evolutionary Computation Theory and Applications new reproduction is compared with other methods (1) M = Nx ∪ Ny such as Wheel and Binary Tournament. Results indicate BT as a promising strategy for profile Whenever transposing a vector into a VSM, the recognition. vector has a value 0 on its non-existing dimensions. This paper is organized as follows: Section 2 Illustration to the use of VSM with an example : introduces how to convert a profile (a set of semantic considering two profiles P and P : information) to a mathematic model, understandable 1 2 for a computer. Section 3 describes the overview of Table 2: Contents for profiles P1 and P2. the genetic algorithm used to train the computer for profile recognition. Section 4 analyses the results of P1 P2 the prediction made by our model and it will be Foo firstname Foo firstname followed by our conclusions. Bar lastname Bar lastname Google organisation Horses like Paris city Google like 2 MATHEMATICS DESIGN FOR For the purpose of this example, the weighting for PROFILES each label are: 2.1 Representation of a Profile Table 3: Weighting example. First of all, let us introduce the mathematical model Label Weight used for profiles representation. A profile will be firstname 0.7 considered as a set of labelled semantic information. lastname 0.8 For example: organisation 0.4 city 0.5 Table 1: Profile example. like 0.1 Information Label The associated VSM, with the vector V1 for P1 and Foo firstname V2 for P2, will be: Bar lastname Paris city Table 4: VSM for V1 and V2. beer like Dimension V1 V2 In order to be able to compare several profiles on Foo 0.7 0.7 a same frame of references, we will use the Vector Bar 0.8 0.8 Space Models (VSM) of semantics. Google 0.4 0.1 Indeed, computers have difficulty understanding Paris 0.5 0 semantic information but VSM provides a solution Horses 0 0.1 for this problem. They have widely been used in different fields as a recent survey highlights (Turney The advantage of this representation is to keep the and Pantel, 2010). In particular, they have been semantic information in the forefront of the successfully used in the field of Machine Learning for mathematic analysis. In the example above, both classification (Dasarathy, 1991) and also for profiles have the information “Google” but for one, clustering (A. K. Jain, 1999). this information is labelled as “organisation” and for the other, it is labelled as “like”. Even with these Within a VSM, each profile Px will be transposed as a vector V with N dimensions which are all the different labels, this model will consider the x x information important but at a different scale. By information in the profile Px. For example, the profile above will be a vector with the dimensions: “Foo”, intuition, we would like to set the label “organisation” “Bar”, “Paris” and “beer”. at a higher value than the label “like” because the former is usually more relevant to distinct two The value of Vx in a specific dimension δ will be a weighting from the label matched with the profiles than the later. We acquired this intuition information δ. through our experience and we would like the The VSM consists in creating a new vector space computer to get the same “intuition” for any kind of labels. of M dimensions. For two vectors Vx and Vy, the dimension M is set as: 158 Genetic Algorithm as Machine Learning for Profiles Recognition 2.2 Similarity Function GA can also be used as Machine Learning (ML) algorithm and has been shown to be efficient in this Once the VSM for two vectors Vx and Vy is created, purpose (Goldberg, 1989). The idea behind is that the usual method is to compute their similarity with natural-like algorithms can demonstrate, in some the cosine (Turney and Pantel, 2010): cases in the ML field, a higher efficiency compared to human-designed algorithms (Kluwer Academic similarity (V ,V ) = cos (α) (2) x y Publishers, 2001). Indeed, actual evolutionary where α is the angle between Vx and Vy. processes have succeeded to solve highly complex This similarity rate gives a good clue of how close problems, as proved through probabilistic arguments two vectors are within their vector space, therefore (Moorhead and Kaplan, 1967). how similar two profiles are. In our case, GA will be used to determine an But there is a human intuition in profiles adequate set of weighting for each label present in a recognition that is missing with this computation. training set. Our training set is composed with Sometimes, two profiles do not contain enough similarities between profiles, two profiles are either relevant information to evaluate their similarity. For similar (output = 1) or not similar (output = 0). example, if two profiles like the same singer and these profiles only contain this information, it is not enough 3.1 Genetic Representation to determine they both concern the same person. Through experimentation, we noticed that the norm The genotype for each chromosome of the population of a vector is a good metric to evaluate the relevance will be the group of all labels in the training set. Each of a profile. Therefore, the similarity rate is smoothed label is defined as a gene and the weighting for a with the average norm of the two vectors: specific label is the allele of the linked gene. The similarity(Vx,Vy) = cos(α) * (‖Vx‖ + ‖Vy‖) / 2 (3) weighting is a value in [0,1], it could be translated as the relevance of a label and it reaches its best at value where ‖V‖ is the Euclidean norm for the vector V.
Recommended publications
  • Metaheuristics1
    METAHEURISTICS1 Kenneth Sörensen University of Antwerp, Belgium Fred Glover University of Colorado and OptTek Systems, Inc., USA 1 Definition A metaheuristic is a high-level problem-independent algorithmic framework that provides a set of guidelines or strategies to develop heuristic optimization algorithms (Sörensen and Glover, To appear). Notable examples of metaheuristics include genetic/evolutionary algorithms, tabu search, simulated annealing, and ant colony optimization, although many more exist. A problem-specific implementation of a heuristic optimization algorithm according to the guidelines expressed in a metaheuristic framework is also referred to as a metaheuristic. The term was coined by Glover (1986) and combines the Greek prefix meta- (metá, beyond in the sense of high-level) with heuristic (from the Greek heuriskein or euriskein, to search). Metaheuristic algorithms, i.e., optimization methods designed according to the strategies laid out in a metaheuristic framework, are — as the name suggests — always heuristic in nature. This fact distinguishes them from exact methods, that do come with a proof that the optimal solution will be found in a finite (although often prohibitively large) amount of time. Metaheuristics are therefore developed specifically to find a solution that is “good enough” in a computing time that is “small enough”. As a result, they are not subject to combinatorial explosion – the phenomenon where the computing time required to find the optimal solution of NP- hard problems increases as an exponential function of the problem size. Metaheuristics have been demonstrated by the scientific community to be a viable, and often superior, alternative to more traditional (exact) methods of mixed- integer optimization such as branch and bound and dynamic programming.
    [Show full text]
  • Removing the Genetics from the Standard Genetic Algorithm
    Removing the Genetics from the Standard Genetic Algorithm Shumeet Baluja Rich Caruana School of Computer Science School of Computer Science Carnegie Mellon University Carnegie Mellon University Pittsburgh, PA 15213 Pittsburgh, PA 15213 [email protected] [email protected] Abstract from both parents into their respective positions in a mem- ber of the subsequent generation. Due to random factors involved in producing “children” chromosomes, the chil- We present an abstraction of the genetic dren may, or may not, have higher fitness values than their algorithm (GA), termed population-based parents. Nevertheless, because of the selective pressure incremental learning (PBIL), that explicitly applied through a number of generations, the overall trend maintains the statistics contained in a GA’s is towards higher fitness chromosomes. Mutations are population, but which abstracts away the used to help preserve diversity in the population. Muta- crossover operator and redefines the role of tions introduce random changes into the chromosomes. A the population. This results in PBIL being good overview of GAs can be found in [Goldberg, 1989] simpler, both computationally and theoreti- [De Jong, 1975]. cally, than the GA. Empirical results reported elsewhere show that PBIL is faster Although there has recently been some controversy in the and more effective than the GA on a large set GA community as to whether GAs should be used for of commonly used benchmark problems. static function optimization, a large amount of research Here we present results on a problem custom has been, and continues to be, conducted in this direction. designed to benefit both from the GA’s cross- [De Jong, 1992] claims that the GA is not a function opti- over operator and from its use of a popula- mizer, and that typical GAs which are used for function tion.
    [Show full text]
  • Genetic Algorithm: Reviews, Implementations, and Applications
    Paper— Genetic Algorithm: Reviews, Implementation and Applications Genetic Algorithm: Reviews, Implementations, and Applications Tanweer Alam() Faculty of Computer and Information Systems, Islamic University of Madinah, Saudi Arabia [email protected] Shamimul Qamar Computer Engineering Department, King Khalid University, Abha, Saudi Arabia Amit Dixit Department of ECE, Quantum School of Technology, Roorkee, India Mohamed Benaida Faculty of Computer and Information Systems, Islamic University of Madinah, Saudi Arabia How to cite this article? Tanweer Alam. Shamimul Qamar. Amit Dixit. Mohamed Benaida. " Genetic Al- gorithm: Reviews, Implementations, and Applications.", International Journal of Engineering Pedagogy (iJEP). 2020. Abstract—Nowadays genetic algorithm (GA) is greatly used in engineering ped- agogy as an adaptive technique to learn and solve complex problems and issues. It is a meta-heuristic approach that is used to solve hybrid computation chal- lenges. GA utilizes selection, crossover, and mutation operators to effectively manage the searching system strategy. This algorithm is derived from natural se- lection and genetics concepts. GA is an intelligent use of random search sup- ported with historical data to contribute the search in an area of the improved outcome within a coverage framework. Such algorithms are widely used for maintaining high-quality reactions to optimize issues and problems investigation. These techniques are recognized to be somewhat of a statistical investigation pro- cess to search for a suitable solution or prevent an accurate strategy for challenges in optimization or searches. These techniques have been produced from natu- ral selection or genetics principles. For random testing, historical information is provided with intelligent enslavement to continue moving the search out from the area of improved features for processing of the outcomes.
    [Show full text]
  • A Sequential Hybridization of Genetic Algorithm and Particle Swarm Optimization for the Optimal Reactive Power Flow
    sustainability Article A Sequential Hybridization of Genetic Algorithm and Particle Swarm Optimization for the Optimal Reactive Power Flow Imene Cherki 1,* , Abdelkader Chaker 1, Zohra Djidar 1, Naima Khalfallah 1 and Fadela Benzergua 2 1 SCAMRE Laboratory, ENPO-MA National Polytechnic School of Oran Maurice Audin, Oran 31000, Algeria 2 Departments of Electrical Engineering, University of Science and Technology of Oran Mohamed Bodiaf, Oran 31000, Algeria * Correspondence: [email protected] Received: 21 June 2019; Accepted: 12 July 2019; Published: 16 July 2019 Abstract: In this paper, the problem of the Optimal Reactive Power Flow (ORPF) in the Algerian Western Network with 102 nodes is solved by the sequential hybridization of metaheuristics methods, which consists of the combination of both the Genetic Algorithm (GA) and the Particle Swarm Optimization (PSO). The aim of this optimization appears in the minimization of the power losses while keeping the voltage, the generated power, and the transformation ratio of the transformers within their real limits. The results obtained from this method are compared to those obtained from the two methods on populations used separately. It seems that the hybridization method gives good minimizations of the power losses in comparison to those obtained from GA and PSO, individually, considered. However, the hybrid method seems to be faster than the PSO but slower than GA. Keywords: reactive power flow; metaheuristic methods; metaheuristic hybridization; genetic algorithm; particles swarms; electrical network 1. Introduction The objective of any company producing and distributing electrical energy is to ensure that the required power is available at all points and at all times.
    [Show full text]
  • An Improved Adaptive Genetic Algorithm for Two-Dimensional Rectangular Packing Problem
    applied sciences Article An Improved Adaptive Genetic Algorithm for Two-Dimensional Rectangular Packing Problem Yi-Bo Li 1, Hong-Bao Sang 1,* , Xiang Xiong 1 and Yu-Rou Li 2 1 State Key Laboratory of Precision Measuring Technology and Instruments, Tianjin University, Tianjin 300072, China; [email protected] (Y.-B.L.); [email protected] (X.X.) 2 Canterbury School, 101 Aspetuck Ave, New Milford, CT 06776, USA; [email protected] * Correspondence: [email protected] Abstract: This paper proposes the hybrid adaptive genetic algorithm (HAGA) as an improved method for solving the NP-hard two-dimensional rectangular packing problem to maximize the filling rate of a rectangular sheet. The packing sequence and rotation state are encoded in a two- stage approach, and the initial population is constructed from random generation by a combination of sorting rules. After using the sort-based method as an improved selection operator for the hybrid adaptive genetic algorithm, the crossover probability and mutation probability are adjusted adaptively according to the joint action of individual fitness from the local perspective and the global perspective of population evolution. The approach not only can obtain differential performance for individuals but also deals with the impact of dynamic changes on population evolution to quickly find a further improved solution. The heuristic placement algorithm decodes the rectangular packing sequence and addresses the two-dimensional rectangular packing problem through continuous iterative optimization. The computational results of a wide range of benchmark instances from zero- waste to non-zero-waste problems show that the HAGA outperforms those of two adaptive genetic algorithms from the related literature.
    [Show full text]
  • Chapter 12 Gene Selection and Sample Classification Using a Genetic Algorithm/K-Nearest Neighbor Method
    Chapter 12 Gene Selection and Sample Classification Using a Genetic Algorithm/k-Nearest Neighbor Method Leping Li and Clarice R. Weinberg Biostatistics Branch, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA, e-mail: {li3,weinberg}@niehs.nih.gov 1. INTRODUCTION Advances in microarray technology have made it possible to study the global gene expression patterns of tens of thousands of genes in parallel (Brown and Botstein, 1999; Lipshutz et al., 1999). Such large scale expression profiling has been used to compare gene expressions in normal and transformed human cells in several tumors (Alon et al., 1999; Gloub et al., 1999; Alizadeh et al., 2000; Perou et al., 2000; Bhattacharjee et al., 2001; Ramaswamy et al., 2001; van’t Veer et al., 2002) and cells under different conditions or environments (Ooi et al., 2001; Raghuraman et al., 2001; Wyrick and Young, 2002). The goals of these experiments are to identify differentially expressed genes, gene-gene interaction networks, and/or expression patterns that may be used to predict class membership for unknown samples. Among these applications, class prediction has recently received a great deal of attention. Supervised class prediction first identifies a set of discriminative genes that differentiate different categories of samples, e.g., tumor versus normal, or chemically exposed versus unexposed, using a learning set with known classification. The selected set of discriminative genes is subsequently used to predict the category of unknown samples. This method promises both refined diagnosis of disease subtypes, including markers for prognosis and better targeted treatment, and improved understanding of disease and toxicity processes at the cellular level.
    [Show full text]
  • A Note on Evolutionary Algorithms and Its Applications
    Bhargava, S. (2013). A Note on Evolutionary Algorithms and Its Applications. Adults Learning Mathematics: An International Journal, 8(1), 31-45 A Note on Evolutionary Algorithms and Its Applications Shifali Bhargava Dept. of Mathematics, B.S.A. College, Mathura (U.P)- India. <[email protected]> Abstract This paper introduces evolutionary algorithms with its applications in multi-objective optimization. Here elitist and non-elitist multiobjective evolutionary algorithms are discussed with their advantages and disadvantages. We also discuss constrained multiobjective evolutionary algorithms and their applications in various areas. Key words: evolutionary algorithms, multi-objective optimization, pareto-optimality, elitist. Introduction The term evolutionary algorithm (EA) stands for a class of stochastic optimization methods that simulate the process of natural evolution. The origins of EAs can be traced back to the late 1950s, and since the 1970’s several evolutionary methodologies have been proposed, mainly genetic algorithms, evolutionary programming, and evolution strategies. All of these approaches operate on a set of candidate solutions. Using strong simplifications, this set is subsequently modified by the two basic principles of evolution: selection and variation. Selection represents the competition for resources among living beings. Some are better than others and more likely to survive and to reproduce their genetic information. In evolutionary algorithms, natural selection is simulated by a stochastic selection process. Each solution is given a chance to reproduce a certain number of times, dependent on their quality. Thereby, quality is assessed by evaluating the individuals and assigning them scalar fitness values. The other principle, variation, imitates natural capability of creating “new” living beings by means of recombination and mutation.
    [Show full text]
  • Genetic Algorithm by Ben Shedlofsky Executive Summary the Genetic
    Genetic Algorithm By Ben Shedlofsky Executive Summary The Genetic algorithm is a way using computers to figure out how a population evolves genetically. These understandings of how a population evolves towards a more healthy population; as well as how it evolves through breeding purposes. Each generation develops a new offspring and some offspring converge towards a healthier population and some don’t at all. This algorithm helps figure out in mathematical terms how a population breeds through a computer simulation. The major problem was understanding step by step how a Genetic algorithm works and how it doesn’t work. The other problem was figuring out to present it to the audience without the audience not understanding what I presented or some magic show of how it works. In a computational format the Genetic Algorithm uses binary or base 2 instead of using base 10. This makes it a little harder to read and understand from a business end user. Once you understand that there are just 1’s and 0’s instead of 0 thru 9 digits, you start to recognize that this uses a computational system to derive populations. Breeding for each generation happens either by two parents or by one parent. The two parents are called crossover and usually happen the most. While the one parent or mutation usually happen very low percentage. This is a very similar makeup to a biological population like monkey breeding amongst themselves or single cell organisms breeding amongst themselves too. Each one of us has genes or trait that makes us who we are and these differences in our genes or traits make us unique to the individual.
    [Show full text]
  • Self-Adaptation of Mutation Operator and Probability for Permutation Representations in Genetic Algorithms
    Self-Adaptation of Mutation Operator and Probability for Permutation Representations in Genetic Algorithms Martin Serpell [email protected] Department of Computer Science, University of the West of England, Bristol, BS161QY, United Kingdom James E. Smith [email protected] Department of Computer Science, University of the West of England, Bristol, BS161QY, United Kingdom Abstract The choice of mutation rate is a vital factor in the success of any genetic algorithm (GA), and for permutation representations this is compounded by the availability of several alternative mutation operators. It is now well understood that there is no one “optimal choice”; rather, the situation changes per problem instance and during evolution. This paper examines whether this choice can be left to the processes of evolution via self- adaptation, thus removing this nontrivial task from the GA user and reducing the risk of poor performance arising from (inadvertent) inappropriate decisions. Self-adaptation has been proven successful for mutation step sizes in the continuous domain, and for the probability of applying bitwise mutation to binary encodings; here we examine whether this can translate to the choice and parameterisation of mutation operators for permutation encodings. We examine one method for adapting the choice of operator during runtime, and several different methods for adapting the rate at which the chosen operator is applied. In order to evaluate these algorithms, we have used a range of benchmark TSP prob- lems. Of course this paper is not intended to present a state of the art in TSP solvers; rather, we use this well known problem as typical of many that require a permutation encoding, where our results indicate that self-adaptation can prove beneficial.
    [Show full text]
  • A Paradigm for Genetically Breeding Populations of Computer Programs to Solve Problems
    Computer Science Department June 1990 GENETIC PROGRAMMING: A PARADIGM FOR GENETICALLY BREEDING POPULATIONS OF COMPUTER PROGRAMS TO SOLVE PROBLEMS John R. Koza ([email protected]) Computer Science Department Stanford University Margaret Jacks Hall Stanford, CA 94305 TABLE OF CONTENTS 1............ INTRODUCTION AND OVERVIEW 1 1.1. ........EXAMPLES OF PROBLEMS REQUIRING DISCOVERY OF A COMPUTER PROGRAM 1 1.2. ........SOLVING PROBLEMS REQUIRING DISCOVERY OF A COMPUTER PROGRAM 3 2............ BACKGROUND ON GENETIC ALGORITHMS 6 3............ THE ÒGENETIC PROGRAMMINGÓ PARADIGM 8 3.1. ........THE STRUCTURES UNDERGOING ADAPTATION 8 3.2. ........THE SEARCH SPACE 10 3.3. ........THE INITIAL STRUCTURES 10 3.4. ........THE FITNESS FUNCTION 10 3.5. ........THE OPERATIONS THAT MODIFY THE STRUCTURES 11 3.5.1....... THE FITNESS PROPORTIONATE REPRODUCTION OPERATION 11 3.5.2....... THE CROSSOVER (RECOMBINATION) OPERATION 12 3.6. ........THE STATE OF THE SYSTEM 13 3.7. ........IDENTIFYING THE RESULTS AND TERMINATING THE ALGORITHM 14 3.8. ........THE PARAMETERS THAT CONTROL THE ALGORITHM 14 4............ EXPERIMENTAL RESULTS 16 4.1. ........MACHINE LEARNING OF A FUNCTION 16 4.1.1....... BOOLEAN 11-MULTIPLEXER FUNCTION 16 4.1.2 ......THE BOOLEAN 6-MULTIPLEXER AND 3-MULTIPLEXER FUNCTIONS 24 4.1.3 ......NON-RANDOMNESS OF THESE RESULTS 24 4.2. ........PLANNING 27 4.2.1. .....BLOCK STACKING 27 4.2.2. .....CORRECTLY STACKING BLOCKS 29 4.2.3....... EFFICIENTLY STACKING BLOCKS 31 4.2.4. .....A PARSIMONIOUS EXPRESSION FOR STACKING BLOCKS 32 4.2.5. .....ARTIFICIAL ANT - TRAVERSING THE JOHN MUIR TRAIL 33 4.3. ........SYMBOLIC FUNCTION IDENTIFICATION 35 4.3.1....... SEQUENCE INDUCTION 35 4.3.1.1...
    [Show full text]
  • Self-Adaptation Mechanism to Control
    International Journal of Computer Science & Information Technology (IJCSIT) Vol 3, No 4, August 2011 SELF -ADAPTATION MECHANISM TO CONTROL THE DIVERSITY OF THE POPULATION IN GENETIC ALGORITHM Chaiwat Jassadapakorn 1 and Prabhas Chongstitvatana 2 Department of Computer Engineering, Chulalongkorn University, Bangkok, Thailand [email protected], [email protected] ABSTRACT One of the problems in applying Genetic Algorithm is that there is some situation where the evolutionary process converges too fast to a solution which causes it to be trapped in local optima. To overcome this problem, a proper diversity in the candidate solutions must be determined. Most existing diversity- maintenance mechanisms require a problem specific knowledge to setup parameters properly. This work proposes a method to control diversity of the population without explicit parameter setting. A self- adaptation mechanism is proposed based on the competition of preference characteristic in mating. It can adapt the population toward proper diversity for the problems. The experiments are carried out to measure the effectiveness of the proposed method based on nine well-known test problems. The performance of the adaptive method is comparable to traditional Genetic Algorithm with the best parameter setting. KEYWORDS Genetic Algorithm, Population Diversity, Diversity Control 1. INTRODUCTION Genetic Algorithm (GA) is a probabilistic search and optimization algorithm. The GA begins with a random population -- a set of solutions. A solution (or an individual) is represented by a fixed-length binary string. A solution is assigned a fitness value that indicates the quality of solution. The high-quality solutions are more likely to be selected to perform solution recombination.
    [Show full text]
  • Evolution Strategies
    Evolution Strategies Nikolaus Hansen, Dirk V. Arnold and Anne Auger February 11, 2015 1 Contents 1 Overview 3 2 Main Principles 4 2.1 (µ/ρ +; λ) Notation for Selection and Recombination.......................5 2.2 Two Algorithm Templates......................................6 2.3 Recombination Operators......................................7 2.4 Mutation Operators.........................................8 3 Parameter Control 9 3.1 The 1/5th Success Rule....................................... 11 3.2 Self-Adaptation........................................... 11 3.3 Derandomized Self-Adaptation................................... 12 3.4 Non-Local Derandomized Step-Size Control (CSA)........................ 12 3.5 Addressing Dependencies Between Variables............................ 14 3.6 Covariance Matrix Adaptation (CMA)............................... 14 3.7 Natural Evolution Strategies.................................... 15 3.8 Further Aspects............................................ 18 4 Theory 19 4.1 Lower Runtime Bounds....................................... 20 4.2 Progress Rates............................................ 21 4.2.1 (1+1)-ES on Sphere Functions............................... 22 4.2.2 (µ/µ, λ)-ES on Sphere Functions.............................. 22 4.2.3 (µ/µ, λ)-ES on Noisy Sphere Functions........................... 24 4.2.4 Cumulative Step-Size Adaptation.............................. 24 4.2.5 Parabolic Ridge Functions.................................. 25 4.2.6 Cigar Functions.......................................
    [Show full text]