Probabilistic Graph Programs for Randomised and Evolutionary Algorithms
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Reinforcement Learning in Supervised Problem DomainsTechnische Universität München Fakultät für Informatik Lehrstuhl VI – Echtzeitsysteme und Robotik reinforcement learning in supervised problem domains Thomas F. Rückstieß Vollständiger Abdruck der von der Fakultät für Informatik der Technischen Universität München zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften (Dr. rer. nat.) genehmigten Dissertation. Vorsitzender: Univ.-Prof. Dr. Daniel Cremers Prüfer der Dissertation 1. Univ.-Prof. Dr. Patrick van der Smagt 2. Univ.-Prof. Dr. Hans Jürgen Schmidhuber Die Dissertation wurde am 30. 06. 2015 bei der Technischen Universität München eingereicht und durch die Fakultät für Informatik am 18. 09. 2015 angenommen. Thomas Rückstieß: Reinforcement Learning in Supervised Problem Domains © 2015 email: [email protected] ABSTRACT Despite continuous advances in computing technology, today’s brute for- ce data processing approaches may not provide the necessary advantage to win the race against the ever-growing amount of data that can be wit- nessed over the last decades. In this thesis, we discuss novel methods and algorithms that are capable of directing attention to relevant details and analysing it in sequence to overcome the processing bottleneck and to keep up with this data explosion. In the first of three parts, a novel exploration technique for Policy Gradi- ent Reinforcement Learning is presented which replaces traditional ad- ditive random exploration with state-dependent exploration, exploring on a higher, more strategic level. We will show how this new exploration method converges faster and finds better global solutions than random exploration can. The second part of this thesis will introduce the concept of “data con- sumption” and discuss means to minimise it in supervised learning tasks by deriving classification as a sequential decision process and ma- king it accessible to Reinforcement Learning methods.
- 
												  A Comprehensive Case Study: an Examination of Machine Learning and Connectionist AlgorithmsA Comprehensive Case Study: An Examination of Machine Learning and Connectionist Algorithms A Thesis Presented to the Department of Computer Science Brigham Young University In Partial Fulfillment of the Requirements for the Degree Master of Science Frederick Zarndt [email protected] June 1995 Contents 1. Introduction . 1 2. Databases . 6 2.1 Database Names . 7 2.2 Database Statistics . 11 3. Learning Models . 20 3.1 Decision Trees . 20 3.2 Nearest Neighbor . 21 3.3 Statistical . 21 3.4 Rule Based . 22 3.6 Neural Networks . 22 4. Method . 24 4.1 Cross-Validation . 24 4.2 Consistency of Training/Testing Sets and Presentation Order . 26 4.3 Attribute Discretization . 26 4.4 Unknown Attribute Values . 28 4.5 Learning Model Parameters . 30 5. Results . 31 6. Limitations . 52 7. Conclusion . 53 Appendix A: Standard Database Format (SDF) . 55 Appendix B: Tools . 58 B.1 Database Tools . 58 B.1.1 verify . 58 B.1.2 verify.ksh . 58 B.1.3 dsstats . 58 B.1.4 dsstats.ksh . 59 B.1.5 xlate . 59 B.2 Test Tools . 61 B.2.1 xval.<algorithm> . 61 B.2.2 test-ml . 66 B.2.3 test-conn . 67 B.3 Result Extraction Tools . 68 B.3.1 average . 68 B.3.2 summarize . 68 Appendix C: Database Generators . 69 Appendix D. Learning Model Parameters . 70 D.1 Decision Trees . 70 D.2 Nearest Neighbor . 70 D.3 Statistical . 71 D.4 Rule Based . 71 D.5 Neural Networks . 72 Appendix E. Database Descriptions . 80 Bibliography . 85 A Comprehensive Case Study: An Examination of Machine Learning and Connectionist Algorithms Frederick Zarndt Department of Computer Science M.S.
- 
												  On the Parameterization of Cartesian Genetic ProgrammingOn the Parameterization of Cartesian Genetic Programming Paul Kaufmann Roman Kalkreuth Gutenberg School of Management & Economics Department of Computer Science Johannes Gutenberg University, Mainz, Germany Dortmund Technical University, Germany [email protected] [email protected] Abstract—In this work, we present a detailed analysis of the (1 + 4) Evolutionary Strategy (ES) that prefers off-spring Cartesian Genetic Programming (CGP) parametrization of the individuals over parents of the same fitness. The consequence selection scheme (µ + λ), and the levels back parameter l. We of the selection preference is that inactive genes that have been also investigate CGP’s mutation operator by decomposing it into a self-recombination, node function mutation, and inactive gene touched by the randomization operator do not contribute to a randomization operators. We perform experiments in the Boolean change of the fitness and therefore propagate always to the and symbolic regression domains with which we contribute to next generation. This circumstance allows a steady influx of the knowledge about efficient parametrization of two essential random genetic material, and we assume that it supports the parameters of CGP and the mutation operator. self-recombination operator to explore close and distant parts Index Terms—Cartesian Genetic Programming of the search space, i.e., increase its search horizon. The search horizon of the self-recombination operator can I. INTRODUCTION also be improved using another inner mechanism of CGP. Genetic Programming (GP) can be considered as nature- Goldman and Punch showed that active genes are distributed inspired search heuristic, which allows for the automatic unevenly in a single-line CGP model [15].
- 
												  Dynamic Page Based Crossover in Linear Genetic ProgrammingDynamic Page Based Crossover in Linear Genetic Programming M.I. Heywood, A.N. Zincir-Heywood Abstract. Page-based Linear Genetic Programming (GP) is take the form of a ‘linear’ list of instructions [5-9]. proposed in which individuals are described in terms of a Execution of an individual therefore mimics the process of number of pages. Pages are expressed in terms of a fixed program execution normally associated with a simple number of instructions, constant for all individuals in the register machine as opposed to traversing a tree structure population. Pairwise crossover results in the swapping of (leaves representing an input, the root node the output). single pages, thus individuals are of a fixed number of instructions. Head-to-head comparison with Tree structured Each instruction is defined in terms of an opcode and GP and block-based Linear GP indicates that the page-based operand, and modifies the contents of internal registers, approach evolves succinct solutions without penalizing memory and program counter. generalization ability. The second component of interest is the crossover operator. Biologically, crossover is not ‘blind,’ Keywords: Genetic Programming, Homologous Crossover, Linear chromosomes exist as distinct pairs, each with a matching Structures, Benchmarking. homologous partner [10]. Thus, only when chromosome sequences are aligned may crossover take place; the entire process being referred to as meiosis [10]. Until recently, I. INTRODUCTION however, crossover as applied in GP has been blind. Typically, the stochastic nature of crossover results in A Darwinism perspective on natural selection implies that a individuals whose instruction count continues to increase set of individuals compete for a finite set of resources, with with generation without a corresponding improvement to individuals surviving more frequently when they performance.
- 
												  Outline of Machine LearningOutline of machine learning The following outline is provided as an overview of and topical guide to machine learning: Machine learning – subfield of computer science[1] (more particularly soft computing) that evolved from the study of pattern recognition and computational learning theory in artificial intelligence.[1] In 1959, Arthur Samuel defined machine learning as a "Field of study that gives computers the ability to learn without being explicitly programmed".[2] Machine learning explores the study and construction of algorithms that can learn from and make predictions on data.[3] Such algorithms operate by building a model from an example training set of input observations in order to make data-driven predictions or decisions expressed as outputs, rather than following strictly static program instructions. Contents What type of thing is machine learning? Branches of machine learning Subfields of machine learning Cross-disciplinary fields involving machine learning Applications of machine learning Machine learning hardware Machine learning tools Machine learning frameworks Machine learning libraries Machine learning algorithms Machine learning methods Dimensionality reduction Ensemble learning Meta learning Reinforcement learning Supervised learning Unsupervised learning Semi-supervised learning Deep learning Other machine learning methods and problems Machine learning research History of machine learning Machine learning projects Machine learning organizations Machine learning conferences and workshops Machine learning publications
- 
												  A Survey of Bio Inspired Optimization AlgorithmsInternational Journal of Soft Computing and Engineering (IJSCE) ISSN: 2231-2307, Volume-2, Issue-2, May 2012 A Survey of Bio inspired Optimization Algorithms Binitha S, S Siva Sathya example for optimization, because if we closely examine Abstract—Nature is of course a great and immense source of each and every features or phenomenon in nature it always inspiration for solving hard and complex problems in computer find the optimal strategy, still addressing complex interaction science since it exhibits extremely diverse, dynamic, robust, among organisms ranging from microorganism to fully complex and fascinating phenomenon. It always finds the optimal solution to solve its problem maintaining perfect fledged human beings, balancing the ecosystem, maintaining balance among its components. This is the thrust behind bio diversity, adaptation, physical phenomenon like river inspired computing. Nature inspired algorithms are meta formation, forest fire ,cloud, rain .etc..Even though the heuristics that mimics the nature for solving optimization strategy behind the solution is simple the results are amazing. problems opening a new era in computation .For the past Nature is the best teacher and its designs and capabilities are decades ,numerous research efforts has been concentrated in extremely enormous and mysterious that researchers are this particular area. Still being young and the results being very amazing, broadens the scope and viability of Bio Inspired trying to mimic nature in technology. Also the two fields have Algorithms (BIAs) exploring new areas of application and more a much stronger connection since, it seems entirely reasonable opportunities in computing. This paper presents a broad that new or persistent problems in computer science could overview of biologically inspired optimization algorithms, have a lot in common with problems nature has encountered grouped by the biological field that inspired each and the areas and resolved long ago.
- 
												  Retaining Experience and Growing SolutionsRetaining Experience and Growing Solutions Robyn Ffrancon1 Abstract Generally, when genetic programming (GP) is used for function synthesis any valuable experience gained by the system is lost from one problem to the next, even when the problems are closely related. With the aim of developing a system which retains beneficial experience from problem to problem, this paper introduces the novel Node-by-Node Growth Solver (NNGS) algorithm which features a component, called the controller, which can be adapted and improved for use across a set of related problems. NNGS grows a single solution tree from root to leaves. Using semantic backpropagation and acting locally on each node in turn, the algorithm employs the controller to assign subsequent child nodes until a fully formed solution is generated. The aim of this paper is to pave a path towards the use of a neural network as the controller component and also, separately, towards the use of meta-GP as a mechanism for improving the controller component. A proof-of-concept controller is discussed which demonstrates the success and potential of the NNGS algorithm. In this case, the controller constitutes a set of hand written rules which can be used to deterministically and greedily solve standard Boolean function synthesis benchmarks. Even before employing machine learning to improve the controller, the algorithm vastly outperforms other well known recent algorithms on run times, maintains comparable solution sizes, and has a 100% success rate on all Boolean function synthesis benchmarks tested so far. Keywords Genetic Programming — Semantic Backpropagation — Local Search — Meta-GP — Neural Networks 1TAO team, INRIA Saclay, Univ.
- 
												  Probabilistic Graph Programs for Randomised and Evolutionary AlgorithmsProbabilistic Graph Programs for Randomised and Evolutionary Algorithms Timothy Atkinson?, Detlef Plump, and Susan Stepney Department of Computer Science, University of York, UK ftja511,detlef.plump,[email protected] Abstract. We extend the graph programming language GP 2 with prob- abilistic constructs: (1) choosing rules according to user-defined probabil- ities and (2) choosing rule matches uniformly at random. We demonstrate these features with graph programs for randomised and evolutionary al- gorithms. First, we implement Karger's minimum cut algorithm, which contracts randomly selected edges; the program finds a minimum cut with high probability. Second, we generate random graphs according to the G(n; p) model. Third, we apply probabilistic graph programming to evolutionary algorithms working on graphs; we benchmark odd-parity digital circuit problems and show that our approach significantly out- performs the established approach of Cartesian Genetic Programming. 1 Introduction GP 2 is a rule-based graph programming language which frees programmers from handling low-level data structures for graphs. The language comes with a concise formal semantics and aims to support formal reasoning on programs; see, for example, [22, 19, 11]. The semantics of GP 2 is nondeterministic in two respects: to execute a rule set fr1; : : : ; rng on a host graph G, any of the rules applicable to G can be picked and applied; and to apply a rule r, any of the valid matches of r's left-hand side in the host graph can be chosen. GP 2's compiler [4] has been designed by prioritising speed over completeness, thus it simply chooses the first applicable rule in textual order and the first match that is found.
- 
											Parametrizing Cartesian Genetic Programming: an Empirical StudyParametrizing Cartesian Genetic Programming: An Empirical Study Paul Kaufmann1 and Roman Kalkreuth2 1 Department of Computer Science, Paderborn University, Germany 2 Department of Computer Science, University of Dortmund, Germany Abstract. Since its introduction two decades ago, the way researchers parameterized and optimized Cartesian Genetic Programming (CGP) remained almost unchanged. In this work we investigate non-standard parameterizations and optimization algorithms for CGP. We show that the conventional way of using CGP, i.e. configuring it as a single line optimized by an (1+4) Evolutionary Strategies-style search scheme, is a very good choice but that rectangular CGP geometries and more elab- orate metaheuristics, such as Simulated Annealing, can lead to faster convergence rates. 1 Introduction Almost two decades ago Miller, Thompson, Kalganova, and Fogarty presented first publications on CGP|an encoding model inspired by the two-dimensional array of functional nodes connected by feed-forward wires of an Field Pro- grammable Gate Array (FPGA) device [6, 1]. CGP has multiple pivotal advan- tages: { CGP comprises an inherent mechanism for the design of simple hierarchical functions. While in many optimization systems such a mechanism has to be implemented explicitly, in CGP multiple feed-forwards wires may originate from the same output of a functional node. This property can be very useful for the evolution of goal functions that may benefit from repetitive inner structures. { The maximal size of encoded solutions is bound, saving CGP to some extent from \bloat" that is characteristic to Genetic Programming (GP). { CGP offers an implicit way of propagating redundant information through- out the generations. This mechanism can be used as a source of randomness and a memory for evolutionary artifacts.
- 
												  The Hierarchical Fair Competition (HFC) Framework for Sustainable Evolutionary AlgorithmsThe Hierarchical Fair Competition (HFC) Framework for Sustainable Evolutionary Algorithms Jianjun Hu [email protected] Department of Computer Science and Engineering, Erik Goodman [email protected] Kisung Seo [email protected] Zhun Fan [email protected] Department of Electrical and Computer Engineering, Rondal Rosenberg [email protected] Department of Mechanical Engineering, Michigan State University, East Lansing, MI, 48823, USA Abstract Many current Evolutionary Algorithms (EAs) suffer from a tendency to converge pre- maturely or stagnate without progress for complex problems. This may be due to the loss of or failure to discover certain valuable genetic material or the loss of the capa- bility to discover new genetic material before convergence has limited the algorithm’s ability to search widely. In this paper, the Hierarchical Fair Competition (HFC) model, including several variants, is proposed as a generic framework for sustainable evo- lutionary search by transforming the convergent nature of the current EA framework into a non-convergent search process. That is, the structure of HFC does not allow the convergence of the population to the vicinity of any set of optimal or locally optimal solutions. The sustainable search capability of HFC is achieved by ensuring a contin- uous supply and the incorporation of genetic material in a hierarchical manner, and by culturing and maintaining, but continually renewing, populations of individuals of intermediate fitness levels. HFC employs an assembly-line structure in which subpop- ulations are hierarchically organized into different fitness levels, reducing the selection pressure within each subpopulation while maintaining the global selection pressure to help ensure the exploitation of the good genetic material found.
- 
												  On Evolutionary Synthesis of Compact Polymorphic Combinational Circuits⋆On Evolutionary Synthesis of Compact Polymorphic Combinational Circuits⋆ ZBYSEKˇ GAJDA1†,LUKA´ Sˇ SEKANINA1‡ Faculty of Information Technology, Brno University of Technology, Czech Republic Polymorphic gates are unconventional circuit components that are not supported by existing synthesis tools. This article presents new methods for synthesis of polymorphic circuits. Proposed methods, based on polymorphic binary decision diagrams and polymorphic multiplexing, extend the ordinary circuit represen- tations with the aim of including polymorphic gates. In order to reduce the number of gates in circuits synthesized using pro- posed methods, an evolutionary optimization based on Cartesian Genetic Programming (CGP) is implemented. The implemen- tations of polymorphic circuits optimized by CGP represent the best known solutions if the number of gates is considered as the decision criterion. Key words: polymorphic circuit, digital circuit synthesis, evolutionary computing, genetic programming 1 INTRODUCTION Polymorphic electronics was introduced by A. Stoica’s group at NASA Jet Propulsion Laboratory as a new class of electronic devices that exhibit a new ⋆ This is authors’ manuscript of the paper: Gajda Zbysek, Sekanina Lukas: On Evolutionary Synthesis of Compact Polymorphic Combinational Circuits. Journal of Multiple-Valued Logic and Soft Computing, Vol. 17, No. 6, 2011, p. 607-631. For the final version, see Old City Pub- lishing, Inc. at http://www.oldcitypublishing.com/MVLSC/MVLSCcontents/MVLSCv17n5- 6contents.html † email: [email protected] ‡ email: [email protected] 1 style of (re)configuration [28]. Polymorphic gates play the central role in the polymorphic electronics. A polymorphic gate is capable of switching among two or more logic functions. However, selection of the function is performed unconventionally.
- 
												  Reversing Operators for Semantic BackpropagationEcole´ Polytechnique Masters Thesis Reversing Operators for Semantic Backpropagation Author: Supervisor: Robyn Ffrancon Marc Schoenauer A thesis submitted in fulfillment of the requirements for the degree of Masters in Complex Systems Science This work was conducted during a 6 month internship at TAO team, INRIA, Saclay, France June 2015 ECOLE´ POLYTECHNIQUE Abstract Complex Systems Science Masters Reversing Operators for Semantic Backpropagation by Robyn Ffrancon Boolean function synthesis problems have served as some of the most well studied bench- marks within Genetic Programming (GP). Recently, these problems have been addressed using Semantic Backpropagation (SB) which was introduced in GP so as to take into account the semantics (outputs over all fitness cases) of a GP tree at all intermediate states of the program execution, i.e. at each node of the tree. The mappings chosen for reversing the operators used within a GP tree are crucially important to SB. This thesis describes the work done in designing and testing three novel SB algorithms for solving Boolean and Finite Algebra function synthesis problems. These algorithms generally perform significantly better than other well known algorithms on run times and solution sizes. Furthermore, the third algorithms is deterministic, a property which makes it unique within the domain. Acknowledgements I wish to thank my internship supervisor Marc Schoenauer for interesting discussions and excellent supervision during my six month masters internship at INRIA, Saclay, France where this work