From Structures and Functors to Modules and Units

From Structures and Functors to Modules and Units Scott Owens Matthew Flatt University of Utah {sowens,mflatt}@cs.utah.edu Abstract structure constructs are tied closely together by the functor ap- Component programming techniques encourage abstraction and plication linking mechanism. reuse through external linking. Some parts of a program, however, Unlike functions, functors cannot be recursively defined, which must use concrete, internally specified references, so a pure compo- keeps them from expressing patterns where mutually dependent nent system is not a sufficient mechanism for structuring programs. program features are split across component [30] boundaries. We We present the combination of a static, internally-linked module argue that this results from the close coupling of the internal and system and a purely abstractive component system. The latter ex- external linking mechanisms. Each structure produced by a functor tends our previous model of typed units to properly account for application is potentially accessible with a direct link, which means translucency and sharing. We also show how units and modules that its contents must be fully defined, even if it is not intended to can express an SML-style system of structures and functors, and be the final result of a sequence of applications. we explore the consequences for recursive structures and functors. This paper presents a module system based on our experience developing PLT Scheme. The system supports both internal linking Categories and Subject Descriptors D.3.3 [Programming Lan- and recursive external linking, and it uses a different construct for guages]: Language Constructs and Features—Modules, packages each. We call the structure-like construct module, and the functor- like component construct unit. The coupling between modules and General Terms Design, Languages units is looser than between structures and functors; external linkages between units can be resolved incrementally without produc- Keywords Module, component, structure, functor, unit ing intervening modules. Loose coupling is crucial in allowing units to naturally support compositions that contain cyclic linkages. 1. Introduction It also permits us to address compilation in the module layer, and place compilation management functionality there without compli- A complete module system for a typed functional language must cating the component layer. support a wide range of features for managing types and organiz- The unit construct extends our previous model of units [13] ing programs, including generativity, translucency, and the shar- to support cooperation with modules, and to handle translucent ing of types, as well as nesting, recursive dependency, and flexible and opaque type imports and exports. We explain our module- composition of the modules themselves. In particular, module com- unit separation, and relate its expressive power to that of the SML position should support two distinct linking mechanisms: internal module system. In the process, we offer an alternate semantics linking and external linking. for the SML module system through compilation into module and Internal linking supports definite reference to a specific im- unit. For SML programmers, our alternate view may offer insights plementation of an interface. Internally linked module systems into enabling mutual recursion among SML functors. For non- are common; examples include Java’s packages and classes, ML’s SML programmers, our alternate semantics hopefully complements structures, and Haskell’s modules. In each of these systems, two the existing semantics of SML and of units to foster a deeper modules are linked when one directly mentions the name of an- understanding of key module and component system technologies. other, either with a dotted path (ModuleName.x), or import state- Section 2 introduces units and modules and shows how they ment (import ModuleName). complement each other. Section 3 contains several examples of the External linking supports parameterized reference to an arbi- expressiveness of units. Section 4 relates units and modules to func- trary implementation of an interface. ML’s module system includes tors and structures, and gives a translation from a model of gen- a functor construct that supports external linking. A functor de- erative structures and functors into modules and units. Section 5 clares an interface over which its definitions are parameterized; the discusses necessary extensions to our model for practical program- parameterization is resolved outside of the functor with a functor ming . The type system for units (section 6) adapts ideas from type application, in analogy to function application. A functor appli- systems for SML modules, in particular manifest types [20], to sup- cation supplies a structure that satisfies the functor’s declared in- port sharing and translucency. Section 7 presents a reduction se- terface, and it produces another structure. Thus, the functor and mantics and soundness theorem. 2. Modules and Units Permission to make digital or hard copies of all or part of this work for personal or Figures 1, 2, and 3 present the syntax of our language of modules classroom use is granted without fee provided that copies are not made or distributed and units over a simply-typed core language that includes sum and for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute product types, value definitions, and type definitions. The module to lists, requires prior specific permission and/or a fee. system comprises the module and path constructs, and the unit ICFP’06 September 16–21, 2006, Portland, Oregon, USA. system comprises the unit and compound expressions and the Copyright c 2006 ACM 1-59593-309-3/06/0009. $5.00. invoke definition. m, t, x = module, type, and value names As our form for managing internal linking, modules have two i name = m | t | x Pm = m | Pm.m | Pm.mb roles. First, module definitions can be nested for fine-grained man- i i i i ident = m | t | x Px = x | Pm.x | Pm.bx agement of name grouping and lexical scoping. Second, top-level i i is an infinite set of stamps Pt = t | Pm.t | Pm.bt modules naturally form the boundaries of separate compilation, Figure 1. Names, identifiers, and paths because every dependency on another top-level module is explicit in the body of the module. To facilitate separate compilation for modules, we restrict a program’s top-level to module definitions Ty = int | Ty+Ty | Ty × Ty | Ty → Ty | Pt K = ? only and require that the dependency relation between modules be | unitT Γ (name∗ → name∗) Γ = | Γ,B acyclic. B = xi:Ty | ti:K | ti:K=Ty | mi:M M = Γ provide name∗ Figure 2. Type grammar 2.2 Units Component programming emphasizes independent development and deployment of components, as well as component re-use. Units Prog = M∗ support independent deployment with external linking, which lets i ∗ M = module m provide name = Ds each of a component’s dependencies be satisfied at its point of de- i ∗ | module m :Γ provide name = Ds ployment, instead of at its point of implementation. Recursive link- Ds = | D Ds ing forms an important part of independent deployment, because = i | i | i Γ | D x :Ty=E t :K=Ty invoke E as m : M any two units can be linked, as long as they satisfy the appropriate E = Z | Px | injl E | injr E | (E,E) | π1E | π2E | case E of E + E | λxi:Ty.E | EE interfaces. | unit import Γ export Γ. Ds Independent deployment also requires independent or compo- | compound import Γ export Γ link U∗ where L∗ sitional compilation of components (not merely separate compi- U = E:import Γ export Γ lation as for modules). Each of a component’s indirect references L = ident ← ident must have a signature that describes the compile-time information for the reference, so that the unit can be compiled without any Figure 3. Term grammar knowledge of other units that might be used to satisfy the import. Since all of a unit’s imported values are supplied externally, and their types are part of the compile-time signature, units are natu- We present the language without inference, so each value and rally first-class values, similar to functions. Unit linking is an ex- type definition comes with an annotation of its type (Ty) or kind pression, similar to function application; consequently, units sup- (K), respectively. The expressions (E) of the language are typical port dynamic, run-time linking. of a simply-typed λ-calculus, with the addition of the unit system. A unit expression creates an atomic unit, with explicit imports Thus, the language has only one kind, and it has types for integer, and exports described by contexts (Γ). The unit’s body must define sum, product, function, and unit values. A type can also contain a each exported binding (B), and its body can refer to any imported path to a type definition. binding. Each imported or exported binding can specify a value, opaque type, translucent type, or module. Module bindings support 2.1 Modules the grouping of imported and exported values, types, and modules. A program (Prog) is a sequence of modules (M), each of which The type of a unit includes a single context that contains the unit’s contains a sequence (Ds) of definitions (D). Each definition’s scope imported and exported bindings, and it also includes a listing of extends from the immediately following definition to the end of the which of these bindings are imports and which are exports. The module. Definitions are accessed from outside of the module with single context can interleave imported and exported bindings, al- a path to either a value (Px) or to a type (Pt). For example, the path lowing the types of imported values to refer to types exported from m1.n.t refers to the definition of type t inside of module n which is the unit.

From Structures and Functors to Modules and Units

ECSS-E-TM-40-07 Volume 2A 25 January 2011

Lecture Notes on Types for Part II of the Computer Science Tripos

Open C++ Tutorial∗

Bidirectional Typing

Let's Get Functional

Programming with Gadts

Run-Time Type Information and Incremental Loading in C++

Scala by Example (2009)

An Ontology-Based Semantic Foundation for Organizational Structure Modeling in the ARIS Method

Kodak Color Control Patches Kodak Gray

1 Simply-Typed Lambda Calculus

Query Rewriting Using Views in a Typed Mediator