Adapting a Constraint-Based Compiler Tool to a New VLIW Architecture

Total Page:16

File Type:pdf, Size:1020Kb

Adapting a Constraint-Based Compiler Tool to a New VLIW Architecture IT 19 060 Examensarbete 30 hp September 2019 Adapting a Constraint-Based Compiler Tool to a New VLIW Architecture Martin Kjellin Institutionen för informationsteknologi Department of Information Technology Abstract Adapting a Constraint-Based Compiler Tool to a New VLIW Architecture Martin Kjellin Teknisk- naturvetenskaplig fakultet UTH-enheten The compiler tool Unison uses combinatorial optimisation to perform integrated register allocation and instruction scheduling, and is designed to be possible to adapt Besöksadress: to different processor architectures. Black Arrow is a VLIW (very long instruction Ångströmlaboratoriet Lägerhyddsvägen 1 word) architecture designed for signal processing within the field of mobile Hus 4, Plan 0 communication technology. Black Arrow has structural traits that complicate the register allocation and instruction scheduling process. Most important of these is that Postadress: functionally equivalent execution units within the processor are not topologically Box 536 751 21 Uppsala equivalent, since their distances to other execution units vary. This means that an instruction schedule must indicate not only in what cycle each instruction is to be Telefon: executed, but also the execution unit that will execute it. This thesis presents and 018 – 471 30 03 evaluates a version of Unison that has been adapted to Black Arrow. A central point Telefax: in the adaptation is how the problem of assigning each Black Arrow instruction to a 018 – 471 30 00 specific execution unit is solved through the use of Unison’s ability to choose between several alternative instructions implementing the same operation. The Hemsida: evaluation shows that the Black Arrow version of Unison, when optimising for http://www.teknat.uu.se/student minimal execution time and a variant of minimal code size, is able to find optimal solutions to the register allocation and instruction scheduling problems for several common integer algorithms, within reasonable time. Handledare: Karl-Filip Faxén Ämnesgranskare: Pierre Flener Examinator: Mats Daniels IT 19 060 Tryckt av: Reprocentralen ITC Contents Acknowledgements 7 1 Introduction 9 1.1 Purpose . 9 1.2 Structure . 10 2 Black Arrow 11 2.1 Datapath . 11 2.2 Instruction Set . 14 2.3 Bach and Kudzul-C . 14 3 Combinatorial Optimisation 17 3.1 Constraint Programming . 17 3.2 Lazy Clause Generation . 18 4 Unison 21 4.1 Code Import . 21 4.2 Linearisation . 22 4.3 Extension with Optional Copy Operations . 23 4.4 Augmentation with Alternative Temporaries . 24 4.5 Constraint Model . 26 4.6 Solving . 30 5 Target Description 33 5.1 Instruction Set . 33 5.2 Choosing Execution Unit . 35 5.3 Optional Copy Operations . 36 5.4 Global Interconnects . 37 5.5 Resources . 39 5.6 Latencies and Serialisation . 39 5.7 Read and Write Capabilities . 40 6 Toolchain 43 6.1 Preprocessing . 43 6.2 Unison’s processing . 45 6.3 Postprocessing . 45 7 Evaluation 49 7.1 Test Programs . 49 7.2 Test Setup . 50 5 7.3 Optimising for Minimal Execution Time . 52 7.4 Optimising for Minimal Code Size . 56 7.4.1 Optimising with Constant-Size Bundles . 57 7.4.2 Optimising with Varying-Size Bundles . 59 7.5 Discussion . 62 8 Related Work 65 8.1 Combinatorial Optimisation Approaches . 65 8.2 Cluster Allocation . 66 9 Future Work 67 9.1 Instruction Set . 67 9.2 Choice of Execution Units . 68 9.3 Further Investigations . 69 Bibliography 71 6 Acknowledgements This thesis has only one author, but several others have given invaluable help during the long process of writing it and performing the investigation reported in it. My supervisor Karl-Filip Faxén has guided me in the world of processors and compilers and helped me understand how Black Arrow works and how it can be modelled. Fur- thermore, we have had many interesting discussions about other important subjects, such as politics and music. Pierre Flener has been a very interested and encourag- ing reviewer and has given me valuable advice on how to improve the thesis text. Roberto Castañeda Lozano has willingly helped me understand all important details of Unison. His helpfulness is even more remarkable considering that he has no formal responsibility for this thesis. Roberto was also the one who proposed using Unison’s mechanism for alternative instructions to assign instructions to the execution units of Black Arrow, an idea that is very important in this thesis. Per Starbäck made the LATEX document class stpthesis, which was used to give the thesis a beautiful layout. I am most grateful to them all! Last but not least, I would like to thank Sofia Nordin, who, apart from being a very dear friend, was the one who helped me get in touch with Karl-Filip and thus with the idea of the investigation presented in the thesis. 7 8 1 Introduction A compiler consists of two main parts: a front-end and a back-end. While the front-end analyses the source program that is about to be compiled and produces an intermediate representation of it, the back-end produces a target program (often in some assem- bly language) from the intermediate representation [1, p. 4]. Two important tasks performed by a compiler back-end are register allocation and instruction scheduling [1, pp. 505–506]. Register allocation is the act of assigning temporaries (program variables) to registers or memory at each point in the program, while instruction scheduling is the act of ordering the instructions of the program to achieve high throughput (number of instructions executed per time unit). These tasks can be per- formed using heuristic or systematic methods. Given enough time, the latter methods produce optimal register allocations and instruction schedules, in relation to some objective such as minimal code size or minimal (estimated) execution time for the compiled program. The instruction scheduling performed by the compiler is especially important for so-called VLIW (very long instruction word) architectures. The defining characteristic of such an architecture is that each instruction word encodes all the operations to be issued in a single cycle. This means that the compiler, during instruction scheduling, must decide which operations are to be issued in parallel, instead of relying on hardware to do this at runtime [1, p. 710]. 1.1 Purpose The work presented in this thesis has been performed at RISE SICS, a government- owned Swedish research institute within applied information and communication technology. The compiler tool Unison [12], which has been developed in collaboration between RISE SICS, the communication technology company Ericsson, and KTH Royal Institute of Technology, uses combinatorial optimisation to perform integrated register allocation and instruction scheduling. Unison can be integrated with the back- end of the compiler LLVM [32] and is designed to be possible to adapt to different processor architectures. In another collaboration between RISE SICS and Ericsson, a new VLIW architec- ture called Black Arrow has been developed [22, 23]. Black Arrow, which is intended to be used for signal processing within the field of mobile communication technology, is a programmable accelerator, that is, a processor designed to provide as much as possible of both the flexibility of a digital signal processor (DSP) and the performance of a hardware accelerator (a processor specialised in efficiently calculating certain mathematical functions). As part of the Black Arrow project, an architecture-specific compiler called Bach (Black Arrow Compiler in Haskell), which is not based on 9 methods for combinatorial optimisation, has been developed [22]. When compared to LLVM’s default algorithms for register allocation and in- struction scheduling, Unison has already been shown to produce code of similar or better quality for a few architectures, among these the VLIW architecture Hexagon [12]. However, the Black Arrow architecture has structural traits that complicate the instruction scheduling and register allocation problems, and it has not been studied before if Unison can handle these traits and, if so, produce better code than Bach. The purpose of this study is therefore to: • Investigate if Unison can be adapted to support the Black Arrow architecture. • If so, evaluate three aspects of the performance of the adapted version of Unison: – The quality of the produced code (in terms of code size and estimated execution time). – The speed of the instruction scheduling and register allocation process. – The impact on quality and speed of using different solvers, techniques and strategies for combinatorial optimisation. As will be shown, it is indeed possible to adapt Unison to the Black Arrow architecture, so that the tool can find optimal solutions to the instruction scheduling and register allocation problems for Black Arrow, in a way that provides reasonable processing times for several common integer algorithms, when the optimisation concerns minimal execution time and a variant of minimal code size. However, the evaluation also shows that other programs and another code-size optimisation criterion make the optimisation significantly harder for Unison. 1.2 Structure As a necessary background to the adaptation of Unison for the Black Arrow archi- tecture, this introduction is followed by descriptions of the Black Arrow architecture (Chapter 2), two techniques for combinatorial optimisation that can be used by Unison (Chapter 3), and Unison’s instruction scheduling and register allocation process (Chap- ter 4). After these background chapters, the target description that adapts Unison to Black Arrow and that has been developed as part of this study is presented in Chapter 5. The practical process of transforming a program representation produced by Bach to one where Unison has performed instruction scheduling and register allocation is described in Chapter 6, before the adapted version of Unison is evaluated in Chapter 7. The thesis ends with a discussion of some related work in Chapter 8 and suggestions for future work in Chapter 9. 10 2 Black Arrow The most recent version of the Black Arrow architecture is (in September 2019) version 1.0 [23].
Recommended publications
  • Causal Commutative Arrows Revisited
    Causal Commutative Arrows Revisited Jeremy Yallop Hai Liu University of Cambridge, UK Intel Labs, USA [email protected] [email protected] Abstract init which construct terms of an overloaded type arr. Most of the Causal commutative arrows (CCA) extend arrows with additional code listings in this paper uses these combinators, which are more constructs and laws that make them suitable for modelling domains convenient for defining instances, in place of the notation; we refer such as functional reactive programming, differential equations and the reader to Paterson (2001) for the details of the desugaring. synchronous dataflow. Earlier work has revealed that a syntactic Unfortunately, speed does not always follow succinctness. Al- transformation of CCA computations into normal form can result in though arrows in poetry are a byword for swiftness, arrows in pro- significant performance improvements, sometimes increasing the grams can introduce significant overhead. Continuing with the ex- speed of programs by orders of magnitude. In this work we refor- ample above, in order to run exp, we must instantiate the abstract mulate the normalization as a type class instance and derive op- arrow with a concrete implementation, such as the causal stream timized observation functions via a specialization to stream trans- transformer SF (Liu et al. 2009) that forms the basis of signal func- formers to demonstrate that the same dramatic improvements can tions in the Yampa domain-specific language for functional reactive be achieved without leaving the language. programming (Hudak et al. 2003): newtype SF a b = SF {unSF :: a → (b, SF a b)} Categories and Subject Descriptors D.1.1 [Programming tech- niques]: Applicative (Functional) Programming (The accompanying instances for SF , which define the arrow operators, appear on page 6.) Keywords arrows, stream transformers, optimization, equational Instantiating exp with SF brings an unpleasant surprise: the reasoning, type classes program runs orders of magnitude slower than an equivalent pro- gram that does not use arrows.
    [Show full text]
  • An Exploration of the Effects of Enhanced Compiler Error Messages for Computer Programming Novices
    Technological University Dublin ARROW@TU Dublin Theses LTTC Programme Outputs 2015-11 An Exploration Of The Effects Of Enhanced Compiler Error Messages For Computer Programming Novices Brett A. Becker Technological University Dublin Follow this and additional works at: https://arrow.tudublin.ie/ltcdis Part of the Computer and Systems Architecture Commons, and the Educational Methods Commons Recommended Citation Becker, B. (2015) An exploration of the effects of enhanced compiler error messages for computer programming novices. Thesis submitted to Technologicl University Dublin in part fulfilment of the requirements for the award of Masters (M.A.) in Higher Education, November 2015. This Theses, Masters is brought to you for free and open access by the LTTC Programme Outputs at ARROW@TU Dublin. It has been accepted for inclusion in Theses by an authorized administrator of ARROW@TU Dublin. For more information, please contact [email protected], [email protected]. This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 4.0 License An Exploration of the Effects of Enhanced Compiler Error Messages for Computer Programming Novices A thesis submitted to Dublin Institute of Technology in part fulfilment of the requirements for the award of Masters (M.A.) in Higher Education by Brett A. Becker November 2015 Supervisor: Dr Claire McDonnell Learning Teaching and Technology Centre, Dublin Institute of Technology Declaration I certify that this thesis which I now submit for examination for the award of Masters (M.A.) in Higher Education is entirely my own work and has not been taken from the work of others, save and to the extent that such work has been cited and acknowledged within the text of my own work.
    [Show full text]
  • Directing Javascript with Arrows
    Directing JavaScript with Arrows Khoo Yit Phang Michael Hicks Jeffrey S. Foster Vibha Sazawal University of Maryland, College Park {khooyp,mwh,jfoster,vibha}@cs.umd.edu Abstract callback with a (short) timeout. Unfortunately, this style of event- JavaScript programmers make extensive use of event-driven pro- driven programming is tedious, error-prone, and hampers reuse. gramming to help build responsive web applications. However, The callback sequencing code is strewn throughout the program, standard approaches to sequencing events are messy, and often and very often each callback must hard-code the names of the next lead to code that is difficult to understand and maintain. We have events and callbacks in the chain. found that arrows, a generalization of monads, are an elegant solu- To combat this problem, many researchers and practitioners tion to this problem. Arrows allow us to easily write asynchronous have developed libraries to ease the construction of rich and highly programs in small, modular units of code, and flexibly compose interactive web applications. Examples include jQuery (jquery. them in many different ways, while nicely abstracting the details of com), Prototype (prototypejs.org), YUI (developer.yahoo. asynchronous program composition. In this paper, we present Ar- com/yui), MochiKit (mochikit.com), and Dojo (dojotoolkit. rowlets, a new JavaScript library that offers arrows to the everyday org). These libraries generally provide high-level APIs for com- JavaScript programmer. We show how to use Arrowlets to construct mon features, e.g., drag-and-drop, animation, and network resource a variety of state machines, including state machines that branch loading, as well as to handle API differences between browsers.
    [Show full text]
  • The Glasgow Haskell Compiler User's Guide, Version 4.08
    The Glasgow Haskell Compiler User's Guide, Version 4.08 The GHC Team The Glasgow Haskell Compiler User's Guide, Version 4.08 by The GHC Team Table of Contents The Glasgow Haskell Compiler License ........................................................................................... 9 1. Introduction to GHC ....................................................................................................................10 1.1. The (batch) compilation system components.....................................................................10 1.2. What really happens when I “compile” a Haskell program? .............................................11 1.3. Meta-information: Web sites, mailing lists, etc. ................................................................11 1.4. GHC version numbering policy .........................................................................................12 1.5. Release notes for version 4.08 (July 2000) ........................................................................13 1.5.1. User-visible compiler changes...............................................................................13 1.5.2. User-visible library changes ..................................................................................14 1.5.3. Internal changes.....................................................................................................14 2. Installing from binary distributions............................................................................................16 2.1. Installing on Unix-a-likes...................................................................................................16
    [Show full text]
  • SOME USEFUL STRUCTURES for CATEGORICAL APPROACH for PROGRAM BEHAVIOR in Computer Science, Where We Often Use More Complex Structures Not Expressible by Sets
    JIOS, VOL. 35, NO. 1 (2011) SUBMITTED 02/11; ACCEPTED 03/11 UDC 004.423.45 [ SomeSome useful Useful structures Structures for categorical for Categorical approach Approach for program for Programbehavior Behavior Viliam Slodicákˇ [email protected] Department of Computers and Informatics Faculty of Electrical Engineering and Informatics Technical university of Košice Letná 9, 042 00 Košice Slovak Republic Abstract Using of category theory in computer science has extremely grown in the last decade. Categories allow us to express mathematical structures in unified way. Algebras are used for constructing basic structures used in computer programs. A program can be considered as an element of the initial algebra arising from the used programming language. In our contribution we formulate two ways of expressing algebras in categories. We also construct the codomain functor from the arrow category of algebras into the base category of sets which objects are also the carrier-sets of the algebras. This functor expresses the relation between algebras and carrier-sets. Keywords: Algebra, arrow category, monad, Kleisli category, codomain functor 1. Introduction Knowing and proving of the expected behavior of complex program systems is very important and actual rôle. It carries the time and cost savings: in mathematics [8] or in practical applications of economical character. The aim of programming is to construct such correct programs and program systems that during their execution provide expected behavior [7]. A program can be considered as an element of the initial algebra arising from the used programming language [14]. Algebraic structures and number systems are widely used in computer science.
    [Show full text]
  • The Bluej Environment Reference Manual
    The BlueJ Environment Reference Manual Version 2.0 for BlueJ Version 2.0 Kasper Fisker Michael Kölling Mærsk Mc-Kinney Moller Institute University of Southern Denmark HOW DO I ... ? – INTRODUCTION ........................................................................................................ 1 ABOUT THIS DOCUMENT................................................................................................................................. 1 RELATED DOCUMENTS.................................................................................................................................... 1 REPORT AN ERROR.......................................................................................................................................... 1 USING THE CONTEXT MENU........................................................................................................................... 1 1 PROJECTS.......................................................................................................................................... 2 1.1 CREATE A NEW PROJECT................................................................................................................... 2 1.2 OPEN A PROJECT ............................................................................................................................... 2 1.3 FIND OUT WHAT A PROJECT DOES .................................................................................................... 2 1.4 COPY A PROJECT ..............................................................................................................................
    [Show full text]
  • Companion Object
    1. Functional Programming Functional Programming What is functional programming? What is it good for? Why to learn it? Functional Programming Programs as a composition of pure functions. Pure function? Deterministic. Same result for the same inputs. Produces no observable side eects. fun mergeAccounts(acc1: Account, acc2: Account): Account = Account(acc1.balance + acc2.balance) val mergedAcc = mergeAccounts(Account(500), Account(1000)) Target platform: JVM Running on Arrow v. 0.11.0-SNAPSHOT Running on Kotlin v. 1.3.50 Referential Transparency Function can be replaced with its corresponding value without changing the program's behavior. Based on the substitution model. Referential Transparency Better reasoning about program behavior. Pure functions referentially transparent. Substitution model Replace a named reference (function) by its value. Based on equational reasoning. Unlocks local reasoning. fun add(a: Int, b: Int): Int = a + b val two = add(1, 1) // could be replaced by 2 everywhere in the program val equal = (two == 1 + 1) Target platform: JVM Running on Arrow v. 0.11.0-SNAPSHOT Running on Kotlin v. 1.3.50 Substitution model Same example, but with two Accounts. fun mergeAccounts(acc1: Account, acc2: Account): Account = Account(acc1.balance + acc2.balance) val mergedAcc = mergeAccounts(Account(100), Account(200)) // could be replaced by Account(300) everywhere // in the program val equal = (mergedAcc == Account(300)) Target platform: JVM Running on Arrow v. 0.11.0-SNAPSHOT Running on Kotlin v. 1.3.50 Side eect? Function tries to access or modify the external world. val accountDB = AccountDB() fun mergeAccounts(acc1: Account, acc2: Account): Account { val merged = Account(acc1.balance + acc2.balance) accountDB.remove(listOf(acc1, acc2)) // side effect! accountDB.save(merged) // side effect! return merged } val merged = mergeAccounts(Account(100), Account(400)) Target platform: JVM Running on Arrow v.
    [Show full text]
  • It's All About Morphisms
    It’s All About Morphisms Uberto Barbini @ramtop https://medium.com/@ramtop About me OOP programmer Agile TDD Functional PRogramming Finance Kotlin Industry Blog: https://medium.com/@ ramtop Twitter: @ramtop #morphisms#VoxxedVienna @ramtop Map of this presentation Monoid Category Monad Functor Natural Transformation Yoneda Applicative Morphisms all the way down... #morphisms#VoxxedVienna @ramtop I don’t care about Monads, why should I? Neither do I what I care about is y el is to define system behaviour ec Pr #morphisms#VoxxedVienna @ramtop This presentation will be a success if most of you will not fall asleep #morphisms#VoxxedVienna @ramtop This presentation will be a success if You will consider that Functional Programming is about transformations and preserving properties. Not (only) lambdas and flatmap #morphisms#VoxxedVienna @ramtop What is this Category thingy? Invented in 1940s “with the goal of understanding the processes that preserve mathematical structure.” “Category Theory is about relation between things” “General abstract nonsense” #morphisms#VoxxedVienna @ramtop Once upon a time there was a Category of Stuffed Toys and a Category of Tigers... #morphisms#VoxxedVienna @ramtop A Category is defined in 5 steps: 1) A collection of Objects #morphisms#VoxxedVienna @ramtop A Category is defined in 5 steps: 2) A collection of Arrows #morphisms#VoxxedVienna @ramtop A Category is defined in 5 steps: 3) Each Arrow works on 2 Objects #morphisms#VoxxedVienna @ramtop A Category is defined in 5 steps: 4) Arrows can be combined #morphisms#VoxxedVienna
    [Show full text]
  • Mechanized Reasoning During Compilation in the Glasgow Haskell Compiler
    HERMIT: Mechanized Reasoning during Compilation in the Glasgow Haskell Compiler By Andrew Farmer Submitted to the graduate degree program in Electrical Engineering and Computer Science and the Graduate Faculty of the University of Kansas in partial fulfillment of the requirements for the degree of Doctor of Philosophy. Chairperson Dr. Andy Gill Dr. Perry Alexander Dr. Prasad Kulkarni Dr. James Miller Dr. Christopher Depcik Date Defended: April 30, 2015 The Dissertation Committee for Andrew Farmer certifies that this is the approved version of the following dissertation: HERMIT: Mechanized Reasoning during Compilation in the Glasgow Haskell Compiler Chairperson Dr. Andy Gill Date approved: ii Abstract It is difficult to write programs which are both correct and fast. A promising approach, functional programming, is based on the idea of using pure, mathematical functions to construct programs. With effort, it is possible to establish a connection between a specification written in a functional language, which has been proven correct, and a fast implementation, via program transformation. When practiced in the functional programming community, this style of reasoning is still typ- ically performed by hand, by either modifying the source code or using pen-and-paper. Unfortu- nately, performing such semi-formal reasoning by directly modifying the source code often obfus- cates the program, and pen-and-paper reasoning becomes outdated as the program changes over time. Even so, this semi-formal reasoning prevails because formal reasoning is time-consuming, and requires considerable expertise. Formal reasoning tools often only work for a subset of the target language, or require programs to be implemented in a custom language for reasoning.
    [Show full text]
  • Download Article (PDF)
    Cent. Eur. J. Comp. Sci. • 4(3) • 2014 • 96-106 DOI: 10.2478/s13537-014-0220-7 Central European Journal of Computer Science Categorical structures as expressing tool for differential calculus Research Article William Steingartner1∗, Davorka Radaković2† 1 Department of Computers and Informatics, Faculty of El. Engineering and Informatics Technical University of Košice, Letná 9, 04200 Košice, Slovakia 2 Department of Mathematics and Informatics, Faculty of Sciences University of Novi Sad, Trg D. Obradovića 4, 21000 Novi Sad, Serbia Received 28 February 2014; accepted 01 September 2014 Abstract: Category is a mathematical structure consisting of objects and morphisms between objects with some specific properties. Categories examine in abstract way the properties of particular mathematical concepts by formalizing them as collections of objects and morphisms. Categorical structures are widely used in computer science for ex- act mathematical modeling. This paper highlights the most typical use of categories for constructing the model of part of differential calculus by using special category named arrow category; and codomain and domain functors. Keywords: arrow category• category theory • categorical model• codomain functor• derivative © Versita sp. z o.o. 1. Introduction Category theory is an area of study in mathematics that examines in abstract way the properties of particular math- ematical concepts by formalizing them as collections of objects and arrows (called morphisms, although this term also has a specific, non category-theoretical meaning), where these collections satisfy some basic conditions [1–3]. Category theory has been developed over the last fifty years, and it has concerned with the study of algebraic structures [4, 5]. Many significant areas of mathematics and informatics can be formalized by using categories.
    [Show full text]
  • Session Arrows: a Session-Type Based Framework for Parallel Code Generation
    MENG INDIVIDUAL PROJECT IMPERIAL COLLEGE LONDON DEPARTMENT OF COMPUTING Session Arrows: A Session-Type Based Framework For Parallel Code Generation Supervisor: Prof. Nobuko Yoshida Author: Dr. David Castro-Perez Shuhao Zhang Second Marker: Dr. Iain Phillips June 19, 2019 Abstract Parallel code is notorious for its difficulties in writing, verification and maintenance. However, it is of increasing importance, following the end of Moore’s law. Modern pro- grammers are expected to utilize the power of multi-core CPUs and face the challenges brought by parallel programs. This project builds an embedded framework in Haskell to generate parallel code. Combining the power of multiparty session types with parallel computation, we create a session typed monadic language as the middle layer and use Arrow, a general interface to computation as an abstraction layer on top of the language. With the help of the Arrow interface, we convert the data-flow of the computation to communication and generate parallel code according to the communication pattern between participants involved in the computation. Thanks to the addition of session types, not only the generated code is guaranteed to be deadlock-free, but also we gain a set of local types so that it is possible to reason about the communication structure of the parallel computation. In order to show that the framework is as expressive as usual programming lan- guages, we write several common parallel computation patterns and three algorithms to benchmark using our framework. They demonstrate that users can express computa- tion similar to traditional sequential code and gain, for free, high-performance parallel code in low-level target languages such as C.
    [Show full text]
  • An Ode to Arrows
    An Ode to Arrows Hai Liu Paul Hudak Department of Computer Science Yale University New Haven, CT 06520, U.S.A. {hai.liu,paul.hudak}@yale.edu Abstract. We study a number of embedded DSLs for autonomous or- dinary differential equations (autonomous ODEs) in Haskell. A naive implementation based on the lazy tower of derivatives is straightforward but has serious time and space leaks due to the loss of sharing when handling cyclic and infinite data structures. In seeking a solution to fix this problem, we explore a number of DSLs ranging from shallow to deep embeddings, and middle-grounds in between. We advocate a so- lution based on arrows, an abstract notion of computation that offers both a succinct representation and an effective implementation. Arrows are ubiquitous in their combinator style that happens to capture both sharing and recursion elegantly. We further relate our arrow-based DSL to a more constrained form of arrows called causal commutative arrows, the normalization of which leads to a staged compilation technique im- proving ODE performance by orders of magnitude. 1 Introduction Consider the following stream representation of the “lazy tower of derivatives” [9] in Haskell: data D a = D {val :: a, der :: D a } deriving (Eq, Show) 0 Mathematically it represents an infinite sequence of derivatives f(t0), f (t0), 00 (n) f (t0), ... , f (t0), ... for a function f that is continuously differentiable at some value t0. This representation has been used frequently in a technique called Functional Automatic Differentiation The usual trick in Haskell is to make D a an instance of the Num and Fractional type classes, and overload the mathemat- ical operators to simultaneously work on all values in the tower of derivatives: instance Num a ⇒ Num (D a) where D x x 0 + D y y0 = D (x + y)(x 0 + y0) u@(D x x 0) ∗ v@(D y y0) = D (x ∗ y)(x 0 ∗ v + u ∗ y0) negate (D x x 0) = D (−x)(−x 0) ..
    [Show full text]