Adaptive Evaluation of Non-Strict Programs

Total Page:16

File Type:pdf, Size:1020Kb

Adaptive Evaluation of Non-Strict Programs UCAM-CL-TR-730 Technical Report ISSN 1476-2986 Number 730 Computer Laboratory Adaptive evaluation of non-strict programs Robert J. Ennals August 2008 15 JJ Thomson Avenue Cambridge CB3 0FD United Kingdom phone +44 1223 763500 http://www.cl.cam.ac.uk/ c 2008 Robert J. Ennals This technical report is based on a dissertation submitted June 2004 by the author for the degree of Doctor of Philosophy to the University of Cambridge, Kings College. Technical reports published by the University of Cambridge Computer Laboratory are freely available via the Internet: http://www.cl.cam.ac.uk/techreports/ ISSN 1476-2986 Abstract Most popular programming languages are strict. In a strict language, the binding of a variable to an expression coincides with the evaluation of the expression. Non-strict languages attempt to make life easier for programmers by decoupling expression binding and expression evaluation. In a non-strict language, a variable can be bound to an unevaluated expression, and such expressions can be passed around just like values in a strict language. This separation allows the programmer to declare a variable at the point that makes most logical sense, rather than at the point at which its value is known to be needed. Non-strict languages are usually evaluated using a technique called Lazy Evaluation. Lazy Evaluation will only evaluate an expression when its value is known to be needed. While Lazy Evaluation minimises the total number of expressions evaluated, it imposes a considerable bookkeeping overhead, and has unpredictable space behaviour. In this thesis, we present a new evaluation strategy which we call Optimistic Evaluation. Optimistic Evaluation blends lazy and eager evaluation under the guidance of an online pro- filer. The online profiler observes the running program and decides which expressions should be evaluated lazily, and which should be evaluated eagerly. We show that the worst case per- formance of Optimistic Evaluation relative to Lazy Evaluation can be bounded with an upper bound chosen by the user. Increasing this upper bound allows the profiler to take greater risks and potentially achieve better average performance. This thesis describes both the theory and practice of Optimistic Evaluation. We start by giving an overview of Optimistic Evaluation. We go on to present formal model, which we use to justify our design. We then detail how we have implemented Optimistic Evaluation as part of an industrial-strength compiler. Finally, we provide experimental results to back up our claims. 3 Acknowledgments First of all, I would like to thank my supervisor, Simon Peyton Jones, who has provided in- valuable support, inspiration, and advice. Enormous thanks must go to Alan Mycroft, who has acted as my lab supervisor. I am also extremely grateful for the support of Microsoft Research, who have provided me not only with funding and equipment, but also with a supervisor, and a building full of interesting people. I would also like to thank Jan-Willem Maessen and Dave Scott, who provided suggestions on this thesis, and Manuel Chakravarty, Karl-Filip Fax´en, Fergus Henderson, Neil Johnson, Alan Lawrence, Simon Marlow, Greg Morisett, Nick Nethercote, Nenrik Nilsson, Andy Pitts, Norman Ramsey, John Reppy, Richard Sharp, and Jeremy Singer, who have provided useful suggestions during the course of this work. This work builds on top of years of work that many people have put into the Glasgow Haskell Compiler. I would like to thank all the people at Glasgow, at Microsoft, and elsewhere, who have worked on GHC over the years. Special thanks must go to Simon Marlow, who was always able to help me when I ran into deep dark corners in the GHC runtime system. 4 Contents 1 Introduction 9 1.1 LazyEvaluation.................................. 9 1.2 MixingEagerandLazyEvaluation . .... 10 1.3 OptimisticEvaluation. ... 12 1.4 StructureofthisThesis . ... 13 1.5 Contributions ................................... 13 I Overview 14 2 Optimistic Evaluation 15 2.1 SwitchableLetExpressions. .... 16 2.2 Abortion...................................... 17 2.3 ChunkyEvaluation ................................ 17 2.4 ProbabilityofaNestedSpeculationbeingUsed . ......... 19 2.5 ExceptionsandErrors.. .. .. .. .. .. .. .. .. 20 2.6 UnsafeInput/Output .............................. 20 3 Online Profiling 22 3.1 TheNeedforProfiling .............................. 22 3.2 Goodness ..................................... 23 3.3 CalculatingWastedWork . .. 24 3.4 BurstProfiling................................... 28 5 6 CONTENTS II Theory 30 4 Semantics 31 4.1 ASimpleLanguage................................ 32 4.2 OperationalSemantics . .. 33 4.3 DenotationalSemantics. ... 40 4.4 Soundness..................................... 46 5 A Cost Model for Non-Strict Evaluation 48 5.1 WhyWeNeedaCostModel ........................... 49 5.2 CostGraphs.................................... 51 5.3 ProducingaCostViewforaProgram . ... 60 5.4 RulesforEvaluation .............................. 62 5.5 AnOperationalSemantics . .. 66 5.6 DenotationalMeaningsofOperationalStates . ......... 71 6 Deriving an Online Profiler 75 6.1 CategorisingWork ................................ 76 6.2 Blame....................................... 80 6.3 BoundingWorstCasePerformance. .... 86 6.4 BurstProfiling................................... 89 III Implementation 95 7 The GHC Execution Model 96 7.1 TheHeap ..................................... 97 7.2 TheStack ..................................... 100 7.3 EvaluatingExpressions . 103 7.4 ImplementingtheEvaluationRules. ...... 105 7.5 OtherDetails ................................... 112 8 Switchable Let Expressions 113 8.1 SpeculativeEvaluationofaLet. ..... 114 8.2 FlatSpeculation................................. 119 8.3 Semi-Tagging................................... 122 8.4 ProblemswithLazyBlackholing . 125 CONTENTS 7 9 Online Profiling 128 9.1 RuntimeStateforProfiling . 128 9.2 ImplementingtheEvaluationRules. ...... 132 9.3 HeapProfiling................................... 137 9.4 FurtherDetails .................................. 139 10 Abortion 143 10.1WhentoAbort .................................. 143 10.2HowtoAbort ................................... 145 11 Debugging 149 11.1 HowtheDarkSideDoIt ............................. 150 11.2 FailingtoDebugLazyPrograms . 151 11.3 EliminatingTailCallElimination. ........ 151 11.4 OptimisticEvaluation. 152 11.5HsDebug ..................................... 153 IV Conclusions 154 12 Results 155 12.1 HowtheTestswereCarriedOut . 156 12.2Performance.................................... 157 12.3Profiling...................................... 159 12.4Metrics ...................................... 167 12.5Semi-Tagging ................................... 172 12.6HeapUsage .................................... 173 12.7Miscellanea .................................... 176 13 Related Work 181 13.1 StaticHybridStrategies. ..... 182 13.2EagerHaskell ................................... 187 13.3 Speculative Evaluation for Multiprocessor Parallelism.............. 189 13.4 SpeculationforUniprocessorParallelism . .......... 191 13.5ModelsofCost .................................. 193 13.6Profiling...................................... 196 13.7 DebuggersforNon-StrictLanguages . ....... 197 13.8 ReducingSpaceUsage . .. .. .. .. .. .. .. 199 13.9 OtherApproachestoEvaluation . ..... 200 8 CONTENTS 14 Conclusions 202 A Proof that Meaning is Preserved 205 A.1 EvaluationPreservesMeaning . 205 A.2 AbortionPreservesMeaning . 208 B Proof that Costed Meaning is Preserved 211 B.1 Lemmas...................................... 211 B.2 ProofofSoundnessforEvaluationRules . ....... 212 B.3 ProofofSoundnessforAbortionRules. ...... 214 C Proof that programWork predicts workDone 216 C.1 Preliminaries ................................... 216 C.2 ProofforEvaluationTransitions . ...... 217 C.3 ProofforAbortionTransitions . ..... 220 D Example HsDebug Debugging Logs 221 CHAPTER 1 Introduction Avoiding work is sometimes a waste of time. Non-strict languages are elegant, but they are also slow. Optimistic Evaluation is an imple- mentation technique that makes them faster. It does this by using a mixture of Lazy Evaluation and Eager Evaluation, with the exact blend decided by an online profiler. 1.1 Lazy Evaluation Lazy Evaluation is like leaving the washing-up until later. If you are never going to use your plates again then you can save considerable time by not washing them up. In the extreme case you may even avoid washing up a plate that is so dirty that it would have taken an infinite amount of time to get clean. However, if you are going to need to use your plates again, then leaving the washing up until later will waste time. By the time you eventually need to use your plates, the food on them will have stuck fast to the plate and will take longer to wash off. You will also have to make space in your kitchen to hold all the washing up that you have not yet done. Non-strict languages aim to make life easier for programmers by removing the need for a programmer to decide if and when an expression should be evaluated. If a programmer were to write the following: let x = E in E0 9 10 Chapter 1. Introduction then a strict language such as C [KR88] or ML [MTHM97] would evaluate the let expression eagerly. It would evaluate the E to a value, bind x to this value, and then evaluate E0. By contrast, a non-strict language such as Haskell [PHA+99] or Clean [BvEvLP87, NSvEP91] will not require that E be evaluated until x is known to be needed. Non-strict languages are usu-
Recommended publications
  • Comparative Studies of Programming Languages; Course Lecture Notes
    Comparative Studies of Programming Languages, COMP6411 Lecture Notes, Revision 1.9 Joey Paquet Serguei A. Mokhov (Eds.) August 5, 2010 arXiv:1007.2123v6 [cs.PL] 4 Aug 2010 2 Preface Lecture notes for the Comparative Studies of Programming Languages course, COMP6411, taught at the Department of Computer Science and Software Engineering, Faculty of Engineering and Computer Science, Concordia University, Montreal, QC, Canada. These notes include a compiled book of primarily related articles from the Wikipedia, the Free Encyclopedia [24], as well as Comparative Programming Languages book [7] and other resources, including our own. The original notes were compiled by Dr. Paquet [14] 3 4 Contents 1 Brief History and Genealogy of Programming Languages 7 1.1 Introduction . 7 1.1.1 Subreferences . 7 1.2 History . 7 1.2.1 Pre-computer era . 7 1.2.2 Subreferences . 8 1.2.3 Early computer era . 8 1.2.4 Subreferences . 8 1.2.5 Modern/Structured programming languages . 9 1.3 References . 19 2 Programming Paradigms 21 2.1 Introduction . 21 2.2 History . 21 2.2.1 Low-level: binary, assembly . 21 2.2.2 Procedural programming . 22 2.2.3 Object-oriented programming . 23 2.2.4 Declarative programming . 27 3 Program Evaluation 33 3.1 Program analysis and translation phases . 33 3.1.1 Front end . 33 3.1.2 Back end . 34 3.2 Compilation vs. interpretation . 34 3.2.1 Compilation . 34 3.2.2 Interpretation . 36 3.2.3 Subreferences . 37 3.3 Type System . 38 3.3.1 Type checking . 38 3.4 Memory management .
    [Show full text]
  • Application and Interpretation
    Programming Languages: Application and Interpretation Shriram Krishnamurthi Brown University Copyright c 2003, Shriram Krishnamurthi This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. If you create a derivative work, please include the version information below in your attribution. This book is available free-of-cost from the author’s Web site. This version was generated on 2007-04-26. ii Preface The book is the textbook for the programming languages course at Brown University, which is taken pri- marily by third and fourth year undergraduates and beginning graduate (both MS and PhD) students. It seems very accessible to smart second year students too, and indeed those are some of my most successful students. The book has been used at over a dozen other universities as a primary or secondary text. The book’s material is worth one undergraduate course worth of credit. This book is the fruit of a vision for teaching programming languages by integrating the “two cultures” that have evolved in its pedagogy. One culture is based on interpreters, while the other emphasizes a survey of languages. Each approach has significant advantages but also huge drawbacks. The interpreter method writes programs to learn concepts, and has its heart the fundamental belief that by teaching the computer to execute a concept we more thoroughly learn it ourselves. While this reasoning is internally consistent, it fails to recognize that understanding definitions does not imply we understand consequences of those definitions. For instance, the difference between strict and lazy evaluation, or between static and dynamic scope, is only a few lines of interpreter code, but the consequences of these choices is enormous.
    [Show full text]
  • Haskell Communities and Activities Report
    Haskell Communities and Activities Report http://tinyurl.com/haskcar Thirty Fourth Edition — May 2018 Mihai Maruseac (ed.) Chris Allen Christopher Anand Moritz Angermann Francesco Ariis Heinrich Apfelmus Gershom Bazerman Doug Beardsley Jost Berthold Ingo Blechschmidt Sasa Bogicevic Emanuel Borsboom Jan Bracker Jeroen Bransen Joachim Breitner Rudy Braquehais Björn Buckwalter Erik de Castro Lopo Manuel M. T. Chakravarty Eitan Chatav Olaf Chitil Alberto Gómez Corona Nils Dallmeyer Tobias Dammers Kei Davis Dimitri DeFigueiredo Richard Eisenberg Maarten Faddegon Dennis Felsing Olle Fredriksson Phil Freeman Marc Fontaine PÁLI Gábor János Michał J. Gajda Ben Gamari Michael Georgoulopoulos Andrew Gill Mikhail Glushenkov Mark Grebe Gabor Greif Adam Gundry Jennifer Hackett Jurriaan Hage Martin Handley Bastiaan Heeren Sylvain Henry Joey Hess Kei Hibino Guillaume Hoffmann Graham Hutton Nicu Ionita Judah Jacobson Patrik Jansson Wanqiang Jiang Dzianis Kabanau Nikos Karagiannidis Anton Kholomiov Oleg Kiselyov Ivan Krišto Yasuaki Kudo Harendra Kumar Rob Leslie David Lettier Ben Lippmeier Andres Löh Rita Loogen Tim Matthews Simon Michael Andrey Mokhov Dino Morelli Damian Nadales Henrik Nilsson Wisnu Adi Nurcahyo Ulf Norell Ivan Perez Jens Petersen Sibi Prabakaran Bryan Richter Herbert Valerio Riedel Alexey Radkov Vaibhav Sagar Kareem Salah Michael Schröder Christian Höner zu Siederdissen Ben Sima Jeremy Singer Gideon Sireling Erik Sjöström Chris Smith Michael Snoyman David Sorokin Lennart Spitzner Yuriy Syrovetskiy Jonathan Thaler Henk-Jan van Tuyl Tillmann Vogt Michael Walker Li-yao Xia Kazu Yamamoto Yuji Yamamoto Brent Yorgey Christina Zeller Marco Zocca Preface This is the 34th edition of the Haskell Communities and Activities Report. This report has 148 entries, 5 more than in the previous edition.
    [Show full text]
  • Pure Quick Reference
    Pure Quick Reference Albert Graf¨ April 15, 2018 Abstract This is a quick reference guide to the Pure programming language for the impa- tient. It briefly summarizes all important language constructs and gives a few basic examples, so that seasoned programmers can pick up the language at a glance and start hacking away as quickly as possible. Copyright © 2009-2018 Albert Gräf. Permission is granted to copy, distribute and/or modify this docu- ment under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation. See http://www.gnu.org/copyleft/fdl.html. The latest version of this document can be found at https://agraef.github.io/pure-lang/quickref/pure-quickref.pdf. Contents 1 Introduction 3 1.1 Background and Recommended Reading . 3 1.2 Getting Started . 4 2 Lexical Matters 5 3 Expressions 6 3.1 Primary Expressions . 7 3.2 Function Applications . 9 3.3 Operators . 11 3.4 Predefined Operators . 13 3.5 Patterns . 15 3.6 Conditional Expressions . 18 3.7 Block Expressions . 19 3.8 Lexical Scoping . 21 3.9 Comprehensions . 22 3.10 Special Forms . 23 1 4 Definitions 26 4.1 The Global Scope . 26 4.2 Rule Syntax . 27 4.3 Function Definitions . 29 4.4 Variable Definitions . 33 4.5 Constant Definitions . 33 4.6 Type Definitions . 34 4.7 Macro Definitions . 37 5 Programs and Modules 38 5.1 Modules . 38 5.2 Namespaces . 39 5.3 Private Symbols . 42 5.4 Hierarchical Namespaces . 42 6 C Interface 43 7 The Interpreter 45 7.1 Running the Interpreter .
    [Show full text]
  • Rxjs in Action
    Paul P. Daniels Luis Atencio FOREWORD BY Ben Lesh MANNING www.allitebooks.com RxJS in Action www.allitebooks.com www.allitebooks.com RxJS in Action COVERS RXJS 5 PAUL P. DANIELS LUIS ATENCIO FOREWORD BY BEN LESH MANNING SHELTER ISLAND www.allitebooks.com For online information and ordering of this and other Manning books, please visit www.manning.com. The publisher offers discounts on this book when ordered in quantity. For more information, please contact Special Sales Department Manning Publications Co. 20 Baldwin Road PO Box 761 Shelter Island, NY 11964 Email: [email protected] ©2017 by Manning Publications Co. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by means electronic, mechanical, photocopying, or otherwise, without prior written permission of the publisher. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in the book, and Manning Publications was aware of a trademark claim, the designations have been printed in initial caps or all caps. Recognizing the importance of preserving what has been written, it is Manning’s policy to have the books we publish printed on acid-free paper, and we exert our best efforts to that end. Recognizing also our responsibility to conserve the resources of our planet, Manning books are printed on paper that is at least 15 percent recycled and processed without the use of elemental chlorine. Manning Publications Co. Development
    [Show full text]
  • Functional Programming and the Lambda Calculus
    Functional Programming and the Lambda Calculus Stephen A. Edwards Columbia University Fall 2008 Functional vs. Imperative Imperative programming concerned with “how.” Functional programming concerned with “what.” Based on the mathematics of the lambda calculus (Church as opposed to Turing). “Programming without variables” It is elegant and a difficult setting in which to create subtle bugs. It’s a cult: once you catch the functional bug, you never escape. Referential transparency The main (good) property of functional programming is referential transparency. Every expression denotes a single value. The value cannot be changed by evaluating an expression or by sharing it between different parts of the program. No references to global data; there is no global data. There are no side-effects, unlike in referentially opaque languages. The Lambda Calculus Fancy name for rules about how to represent and evaluate expressions with unnamed functions. Theoretical underpinning of functional languages. Side-effect free. Very different from the Turing model of a store with evolving state. O’Caml: The Lambda Calculus: fun x ­> 2 * x λx . ∗ 2 x English: The function of x that returns the product of two and x Grammar of Lambda Expressions expr → constant | variable-name | expr expr | (expr) | λ variable-name . expr Constants are numbers; variable names are identifiers and operators. Somebody asked, “does a language needs to have a large syntax to be powerful?” Bound and Unbound Variables In λx . ∗ 2 x, x is a bound variable. Think of it as a formal parameter to a function. “∗ 2 x” is the body. The body can be any valid lambda expression, including another unnnamed function.
    [Show full text]
  • Brno University of Technology Just-In-Time
    BRNO UNIVERSITY OF TECHNOLOGY VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ FACULTY OF INFORMATION TECHNOLOGY FAKULTA INFORMAČNÍCH TECHNOLOGIÍ DEPARTMENT OF INTELLIGENT SYSTEMS ÚSTAV INTELIGENTNÍCH SYSTÉMŮ JUST-IN-TIME COMPILATION OF THE DEPENDENTLY-TYPED LAMBDA CALCULUS JUST-IN-TIME PŘEKLAD ZÁVISLE TYPOVANÉHO LAMBDA KALKULU MASTER’S THESIS DIPLOMOVÁ PRÁCE AUTHOR Bc. JAKUB ZÁRYBNICKÝ AUTOR PRÁCE SUPERVISOR Ing. ONDŘEJ LENGÁL, Ph.D. VEDOUCÍ PRÁCE BRNO 2021 Brno University of Technology Faculty of Information Technology Department of Intelligent Systems (DITS) Academic year 2020/2021 Master's Thesis Specification Student: Zárybnický Jakub, Bc. Programme: Information Technology Field of Intelligent Systems study: Title: Just-in-Time Compilation of Dependently-Typed Lambda Calculus Category: Compiler Construction Assignment: 1. Investigate dependent types, simply-typed and dependently-typed lambda calculus, and their evaluation models (push/enter, eval/apply). 2. Get familiar with the Graal virtual machine and the Truffle language implementation framework. 3. Create a parser and an interpreter for a selected language based on dependently-typed lambda calculus. 4. Propose a method of normalization-by-evaluation for dependent types and implement it for the selected language. 5. Create a just-in-time (JIT) compiler for the language using the Truffle API. 6. Compare the runtime characteristics of the interpreter and the JIT compiler, evaluate the results. Recommended literature: https://www.graalvm.org/ Löh, Andres, Conor McBride, and Wouter Swierstra. "A tutorial implementation of a dependently typed lambda calculus." Fundamenta Informaticae 21 (2001): 1001-1031. Marlow, Simon, and Simon Peyton Jones. "Making a fast curry: push/enter vs. eval/apply for higher-order languages." Journal of Functional Programming 16.4-5 (2006): 415-449.
    [Show full text]
  • An Introduction to Probabilistic Programming Arxiv:1809.10756V1
    An Introduction to Probabilistic Programming Jan-Willem van de Meent College of Computer and Information Science Northeastern University [email protected] Brooks Paige Alan Turing Institute University of Cambridge [email protected] Hongseok Yang School of Computing KAIST [email protected] Frank Wood Department of Computer Science University of British Columbia [email protected] arXiv:1809.10756v1 [stat.ML] 27 Sep 2018 Contents Abstract1 Acknowledgements3 1 Introduction8 1.1 Model-based Reasoning . 10 1.2 Probabilistic Programming . 21 1.3 Example Applications . 26 1.4 A First Probabilistic Program . 29 2 A Probabilistic Programming Language Without Recursion 31 2.1 Syntax . 32 2.2 Syntactic Sugar . 37 2.3 Examples . 42 2.4 A Simple Purely Deterministic Language . 48 3 Graph-Based Inference 51 3.1 Compilation to a Graphical Model . 51 3.2 Evaluating the Density . 66 3.3 Gibbs Sampling . 74 3.4 Hamiltonian Monte Carlo . 80 3.5 Compilation to a Factor Graph . 89 3.6 Expectation Propagation . 94 4 Evaluation-Based Inference I 102 4.1 Likelihood Weighting . 105 4.2 Metropolis-Hastings . 116 4.3 Sequential Monte Carlo . 125 4.4 Black Box Variational Inference . 131 5 A Probabilistic Programming Language With Recursion 138 5.1 Syntax . 142 5.2 Syntactic sugar . 143 5.3 Examples . 144 6 Evaluation-Based Inference II 155 6.1 Explicit separation of model and inference code . 156 6.2 Addressing Transformation . 161 6.3 Continuation-Passing-Style Transformation . 165 6.4 Message Interface Implementation . 171 6.5 Likelihood Weighting . 175 6.6 Metropolis-Hastings .
    [Show full text]
  • LEAGER PROGRAMMING Abstract
    LEAGER PROGRAMMING Anthony Chyr UUCS-19-002 School of Computing University of Utah Salt Lake City, UT 84112 USA February 26, 2019 Abstract Leager programming, a portmanteau of “lazy” and “eager” or “limit” and “eager,” is an evaluation strategy that mixes lazy evaluation and eager evaluation. This evaluation strategy allows iterators to precompute the next value in a separate thread, storing the result in a cache until it is needed by the caller. Leager programming often takes the form of an iterator, which alone allows data to be prefetched, and when chained together can be used to form concurrent pipelines. We found a dramatic reduction in latency on par with code written with asynchronous callbacks, while making minimal modifications to the initial sequential code. LEAGER PROGRAMMING by Anthony Chyr A senior thesis submitted to the faculty of The University of Utah in partial fulfillment of the requirements for the degree of Bachelor of Computer Science School of Computing The University of Utah May 2019 Approved: / / Matthew Flatt H. James de St. Germain Supervisor Director of Undergraduate Studies School of Computing / Ross Whitaker Diretor School of Computing Copyright c Anthony Chyr 2019 All Rights Reserved ABSTRACT Leager programming, a portmanteau of “lazy” and “eager” or “limit” and “eager,” is an evaluation strategy that mixes lazy evaluation and eager evaluation. This evaluation strategy allows iterators to precompute the next value in a separate thread, storing the result in a cache until it is needed by the caller. Leager programming often takes the form of an iterator, which alone allows data to be prefetched, and when chained together can be used to form concurrent pipelines.
    [Show full text]
  • A Distributed Execution Engine Supporting Data-Dependent Control flow
    A distributed execution engine supporting data-dependent control flow Derek Gordon Murray University of Cambridge Computer Laboratory King’s College July 2011 This dissertation is submitted for the degree of Doctor of Philosophy Declaration This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except where specifically indicated in the text. This dissertation does not exceed the regulation length of 60,000 words, including tables and footnotes. A distributed execution engine supporting data-dependent control flow Derek G. Murray Summary In computer science, data-dependent control flow is the fundamental concept that enables a machine to change its behaviour on the basis of intermediate results. This ability increases the computational power of a machine, because it enables the machine to execute iterative or recursive algorithms. In such algorithms, the amount of work is unbounded a priori, and must be determined by evaluating a fixpoint condition on successive intermediate results. For example, in the von Neumann architecture—upon which almost all modern computers are based—these algorithms can be programmed using a conditional branch instruction. A distributed execution engine is a system that runs on a network of computers, and provides the illusion of a single, reliable machine that provides a large aggregate amount of compu- tational and I/O performance. Although each individual computer in these systems is a von Neumann machine capable of data-dependent control flow, the effective computational power of a distributed execution engine is determined by the expressiveness of the execution model that describes distributed computations.
    [Show full text]
  • Name Synopsis Options Description
    Pure(1) Pure(1) NAME pure − the Pure interpreter SYNOPSIS pure [options ...] [script ...] [-- args ...] pure [options ...] -x script [args ...] OPTIONS --help, -h Print help message and exit. -i Force interactive mode (read commands from stdin). -Idirectory Add a directory to be searched for included source scripts. -Ldirectory Add a directory to be searched for dynamic libraries. --noediting Do not use readline for command-line editing. --noprelude, -n Do not load the prelude. --norc Do not run the interactive startup files. -q Quiet startup (suppresses sign-on message in interactive mode). -v[level] Set verbosity level. See belowfor details. --version Print version information and exit. -x Execute script with givencommand line arguments. -- Stop option processing and pass the remaining command line arguments in the argv variable. DESCRIPTION Pure is a modern-style functional programming language based on term rewriting. Pure programs are basi- cally collections of equational rules used to evaluate expressions in a symbolic fashion by reducing them to normal form. A brief overviewofthe language can be found in the PURE OVERVIEW section below. (In case you’re wondering, the name ‘‘Pure’’actually refers to the adjective.But you can also write it as ‘‘PURE’’and takethis as a recursive acronym for the ‘‘Pure Universal Rewriting Engine’’.) pure is the Pure interpreter.The interpreter has an LLVM backend which JIT-compiles Pure programs to machine code, hence programs run blazingly fast and interfacing to C modules is easy,while the interpreter still provides a convenient, fully interactive environment for running Pure scripts and evaluating expres- sions. Pure programs (a.k.a.
    [Show full text]
  • Optimising Compilers
    Computer Laboratory Optimising Compilers Computer Science Tripos Part II Timothy Jones Lent 2019 [email protected] http://www.cl.cam.ac.uk/Teaching/1819/OptComp/ Learning Guide The course as lectured proceeds fairly evenly through these notes, with 7 lectures devoted to part A, 5 to part B and 3 or 4 devoted to parts C and D. Part A mainly consists of analy- sis/transformation pairs on flowgraphs whereas part B consists of more sophisticated analyses (typically on representations nearer to source languages) where typically a general framework and an instantiation are given. Part C consists of an introduction to instruction scheduling and part D an introduction to decompilation and reverse engineering. One can see part A as intermediate-code to intermediate-code optimisation, part B as (al- ready typed if necessary) parse-tree to parse-tree optimisation and part C as target-code to target-code optimisation. Part D is concerned with the reverse process. Rough contents of each lecture are: Lecture 1: Introduction, flowgraphs, call graphs, basic blocks, types of analysis Lecture 2: (Transformation) Unreachable-code elimination Lecture 3: (Analysis) Live variable analysis Lecture 4: (Analysis) Available expressions Lecture 5: (Transformation) Uses of LVA Lecture 6: (Continuation) Register allocation by colouring Lecture 7: (Transformation) Uses of Avail; Code motion Lecture 8: Static Single Assignment; Strength reduction Lecture 9: (Framework) Abstract interpretation Lecture 10: (Instance) Strictness analysis Lecture 11: (Framework) Constraint-based analysis; (Instance) Control-flow analysis (for λ-terms) Lecture 12: (Framework) Inference-based program analysis Lecture 13: (Instance) Effect systems Lecture 13a: Points-to and alias analysis Lecture 14: Instruction scheduling Lecture 15: Same continued, slop Lecture 16: Decompilation.
    [Show full text]