Extensible Programming with First-Class Cases

Extensible Programming with First-Class Cases Matthias Blume Umut A. Acar Wonseok Chae Toyota Technological Institute at Chicago {blume,umut,wchae}@tti-c.org Abstract mediate language. This translation is formalized as part of our type We present language mechanisms for polymorphic, extensible system. records and their exact dual, polymorphic sums with extensible As in any dual construction, the introduction form of the primal first-class cases. These features make it possible to easily extend corresponds to the elimination form of the dual. Thus, elimination existing code with new cases. In fact, such extensions do not re- forms of sums (e.g., match or case) correspond to introduction quire any changes to code that adheres to a particular programming forms of records. In particular, record extension (an introduction style. Using that style, individual extensions can be written inde- form) corresponds to extension of cases (an elimination form). This duality motivates making cases first-class values as opposed pendently and later be composed to form larger components. These 1 language mechanisms provide a solution to the expression problem. to mere syntactic form. With cases being first-class and extensible, We study the proposed mechanisms in the context of an im- one can use the usual mechanisms of functional abstraction in a plicitly typed, purely functional language PolyR. We give a type style of programming that facilitates composable extensions. We system for the language and provide rules for a 2-phase transforma- show examples for this later in the introduction and in Section 2. tion: first into an explicitly typed λ-calculus with record polymor- Our polymorphic sums provide a notion of type refinement sim- phism, and finally to efficient index-passing code. The first phase ilar to data sorts of Freeman and Pfenning [9] and give rise to a sim- eliminates sums and cases by taking advantage of the duality with ple programming pattern facilitating composable extensions. Com- records. posable extensions can be used as a principled approach to solv- We implement a version of PolyR extended with imperative fea- ing the well-known expression problem described by Wadler [30]. tures and pattern matching—we call this language MLPolyR. Pro- There have been many attempts at solving the expression problem, grams in MLPolyR require no type annotations—the implemen- most of them in an object-oriented context [28, 22, 3, 16, 27, 6, 8, tation employs a reconstruction algorithm to infer all types. The 33, 18, 4, 29, 34]. Garrigue shows an approach based on polymor- compiler generates machine code (currently for PowerPC) and op- phic variants which is similar to our proposal, but somewhat less timizes the representation of sums by eliminating closures gener- general [11]. ated by the dual construction. Composable record extension and its dual Categories and Subject Descriptors D.3.3 [Programming Lan- guages]: Language Constructs and Features—Polymorphism To understand the underlying mechanism, it is instructive to first look at an example. In MLPolyR we write { a = 1, ... = r } General Terms Design, Languages to create a new record which extends record r with a new field a. Since records are first-class values, we can abstract over the Keywords duality, first-class cases, records, sums record being extended and obtain a function add a that extends any argument record (as long as it does not already contain a) with 1. Introduction a field a. Such a function can be thought of as the “difference” between its result and its argument: In this paper we study language mechanisms for polymorphic ex- fun add a r = { a = 1, ... = r } tensible records and their duals: polymorphic sums with a mecha- Here the difference consists of a field labeled a of type int and nism for adding new cases to existing code handling such sums. We value 1. The type of function add a is inferred as:2 present a type system based on a straightforward application of row val add a: { β } → { a: int, β } polymorphism [26] and incorporate it into a dialect of ML called We can write similar functions add b and add c of types MLPolyR. Our compiler for MLPolyR provides efficient type re- { β } → { b:bool, β } and { β } → { c:string, β } construction of principal types by using a variant of the well-known which add fields b and c respectively: algorithm W [19]. The key technical insight to fully general type fun add b r = { b = true, ... = r } inference is to keep separate type constructors for sums and cases during type inference. Taking advantage of duality, sums and cases fun add c r = { c = "hello", ... = r } can be eliminated later by translation into an explicitly typed inter- We can then “add up” record differences represented by add a, add b, add c by composing these functions: fun add ab r = add a (add b r) fun add bc r = add b (add c r) The inferred types are: Permission to make digital or hard copies of all or part of this work for personal or val add ab: { β } → { a: int, b: bool, β } classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation val add bc: { β } → { b: bool, c: string, β } on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. 1 ICFP’06 September 16–21, 2006, Portland, Oregon, USA. In Standard ML [20], the corresponding syntactic form is known as match. Copyright c 2006 ACM 1-59593-309-3/06/0009. $5.00. 2 We omit the universal quantifier and the kind of the row variable β. Finally, we can create actual records by “adding” differences to the fun kv2kb kv = λv.‘App(kv,[v]) empty record: fun kb2kv kb = withfresh(λxr.‘Lam([xr],kb(‘Var xr))) val a = add a {} fun cvt app(e,~e,k ) = let val ab = add ab {} v fun lc([],kb) = kb([]) val bc = add bc {} | lc(e::~e,kb) = pc(e,~e,λ(v, ~v).kb(v::~v)) When translated to the dual, extensibility of records becomes exten- and pc(e,~e,kb) = cvt(e,λv.lc(~e,λ~v.kb(v,~v))) sibility of code. Here is a function representing the difference be- in pc(e,~e,λ(v, ~v).‘App(v,kv::~v)) end tween two code fragments, one of which can handle case ‘A while the other, represented by the argument c, cannot: and cvt lam(~x,e) = withfresh (λxk. fun add A c = cases ‘A () => print "A" ‘Lam(xk::~x,cvt(e,kv2kb(‘Var xk)))) default: c and cvt(e,kb) = match e with Note that function add A corresponds to add a of the dual. The cases ‘Con i ⇒ kb(‘Con i) type inferred for add A is: | ‘Var x ⇒ kb(‘Var x) val add A: (<β> ,→ ()) → (<‘A of (), β> ,→ ()) | ‘Lam(~x,e)⇒ kb(cvt lam(~x,e)) Here a type hρi ,→ τ denotes the type of first-class cases where | ‘App(e,~e)⇒ cvt app(e,~e,kb2kv(kb)) hρi is the sum type that is being handled and τ is the result. One fun convert e = cvt lam([],e) can think of ,→ as an alternative function arrow whose elimination form will be discussed below. Examples for functions add B and Figure 1. A simple CPS-converter. When converting ‘App, the contin- add C (corresponding to add b and add c in the dual) are: fun add B c = cases ‘B () => print "B" uation builder kb must be turned into a syntactic value kv that can be passed as an additional argument; this k is a new ‘Lam with a single parameter x default: c v r and a body obtained by applying k to xr (see function kb2kv). Conversely, fun add C c = cases ‘C () => print "C" b to convert a ‘Lam, an extra formal argument xk representing the continu- default: c ation is added. The corresponding kb simply constructs an ‘App of xk to As in the dual, we can now compose difference functions to obtain whatever argument v is given to kb (see function kv2kb). For clarity, the larger differences: code for ‘App and ‘Lam has been separated out into functions cvt app and fun add AB c = add A (add B c) cvt lam; cvt app uses helper functions lc and pc to recursively convert fun add BC c = add B (add C c) the operator e and all operands ~e. By applying a difference to the empty case nocases we obtain case values: val case A = add A nocases Following Appel [1], the conversion can be performed in val case AB = add AB nocases essentially linear time using an approach that could be called val case BC = add BC nocases “continuation-builder passing style” where the converter function These values can be used in a match form. The match construct is cvt (see Figure 1) receives the expression e to be converted and kb, the elimination form for the case arrow ,→. The following expres- the continuation builder, which represents the context in which e sion will cause "B" to be printed: appeared within the original expression. Once the converter comes match ‘B () with case BC up with a syntactic value v for the result of e, kb is invoked on this The previous examples demonstrate how functional record exten- v in order to produce the CPS expression representing e’s original sion in the primal corresponds to code extension in the dual. This context. forms the basis for our proposed solution to the expression problem.

Extensible Programming with First-Class Cases

Programming Languages

Chapter 5 Names, Bindings, and Scopes

Metaobject Protocols: Why We Want Them and What Else They Can Do

Introduction to Programming in Lisp

ECSS-E-TM-40-07 Volume 2A 25 January 2011

Chapter 2 Basics of Scanning And

Typescript Language Specification

A System of Constructor Classes: Overloading and Implicit Higher-Order Polymorphism

Lecture Notes on Types for Part II of the Computer Science Tripos

Python Programming

Programming Language

Open C++ Tutorial∗