Automated Techniques for Creation and Maintenance of Typescript Declaration Files

Total Page:16

File Type:pdf, Size:1020Kb

Automated Techniques for Creation and Maintenance of Typescript Declaration Files Automated Techniques for Creation and Maintenance of TypeScript Declaration Files Erik Krogh Kristensen PhD Dissertation Department of Computer Science Aarhus University Denmark Automated Techniques for Creation and Maintenance of TypeScript Declaration Files A Dissertation Presented to the Faculty of Science and Technology of Aarhus University in Partial Fulfillment of the Requirements for the PhD Degree by Erik Krogh Kristensen August 8, 2019 Abstract JavaScript was initially a scripting language for creating small interactive web pages. However, massive applications, both web pages and server ap- plications, are increasingly being developed using JavaScript. The dynamic nature of the JavaScript language makes it hard to create proper tooling with features such as auto-completion and code-navigation. TypeScript is a JavaScript superset that adds, on top of JavaScript, an optional static type system, which facilitates features such as auto-completion, code navigation, and detection of type errors. However, many TypeScript applications still use untyped libraries written in JavaScript. Developers or users of these JavaScript libraries can choose to write TypeScript declaration files, which provide API models of the libraries and are used to type-check TypeScript applications. These declaration files are, however, written manually and of- ten not by the original authors of the library, which leads to mistakes that can misguide TypeScript application developers and ultimately cause bugs. The goal of this thesis is to design automated techniques for assisting in the development of TypeScript declaration files. This thesis identifies several challenges faced by developers of TypeScript declaration files and tackles these challenges using techniques from programming language re- search. Type inference is used to create new, and update existing, decla- ration files. Automated testing is used to detect errors in declaration files. Finally, data-flow analysis and a novel concept of reasonably most-general- clients are used to verify the absence of errors in declaration files. Each of the techniques is used to improve the quality of real-world declaration files. i Resumé JavaScript startede som et scriptingsprog til at lave små interaktive hjemmesider. Dog udvikles større applikationer, både hjemmesider og andre applikationer, i stigende grad ved brug af JavaScript. JavaScript- sprogets dynamiske karakter gør det vanskeligt at skabe fornuftige udviklingsmiljøer med funktioner som automatisk kode-fuldendelse og kode-navigation. TypeScript er et superset af JavaScript, der udover JavaScript tilføjer et valgfrit system af statiske typer, der muliggør funktioner som automatisk kode-fuldendelse, kode-navigation og detektion af typefejl. Mange TypeScript-applikationer bruger dog stadig ikke-typede biblioteker skrevet i JavaScript. Udviklere eller brugere af disse JavaScript-biblioteker kan vælge at skrive TypeScript-deklarationsfiler, der leverer API-modeller af bibliotekerne og bruges til at kontrollere typerne i TypeScript-applikationer. Disse deklarationsfiler er dog skrevet manuelt og ofte ikke af de originale forfattere af biblioteket, hvilket kan medføre fejltagelser, der kan vildlede TypeScript-applikationsudviklere og i sidste ende forårsage fejl. Målet med denne afhandling er at designe automatiserede teknikker til at assistere i udviklingen af TypeScript-deklarationsfiler. Denne afhandling identificerer flere udfordringer, som udviklere af TypeScript- deklarationsfiler står overfor, og håndterer disse udfordringer ved hjælp af teknikker fra programmeringssprogforskning. Type inferens bruges til at oprette nye og opdatere eksisterende deklarationsfiler. Automatisk testning bruges til at opdage fejl i deklarationsfiler. Endeligt bruges datastrømningsanalyse og et nyt koncept af rimelige-mest-generelle klienter til at verificere fraværet af fejl i deklarationsfiler. Hver af teknikkerne bruges til at forbedre kvaliteten af faktisk benyttede deklarationsfiler. iii Acknowledgments I would like to thank the following people for their support and guidance during my Ph.D. studies. Anders Møller for being an attentive and hard-working supervisor, for helping me navigate the world of academia, for letting me run with my ideas, and for significantly improving the quality of my writing. Esben An- dreasen for all the excellent office discussions we had. Gianluca Mezzetti for being a coding guru without whom this project would have taken longer and been worse. Søren Eller Thomsen for dragging me to the coffee machine for a refill and otherwise interrupting my work. The rest of the Programming Languages research group and various people from Aarhus University for providing an excellent work and study environment with room for learn- ing, discussions, and beer. Eric Bodden and his research group for the great time I had while visiting the Heinz Nixdorf Institut. Finally, my family for supporting me despite not being sure what I really do. Erik Krogh Kristensen, Aarhus, August 8, 2019. v Contents Abstract i Resumé iii Acknowledgments v Contents vii I Overview 1 1 Introduction 3 1.1 Challenges . 7 1.2 Thesis statement . 8 1.3 Contributions . 8 1.4 Outline . 9 2 JavaScript and TypeScript 11 2.1 JavaScript . 11 2.2 TypeScript . 19 2.3 TypeScript Declaration Files . 30 2.4 Errors in TypeScript Declaration Files . 31 3 Automated Program Analysis 35 3.1 Type Analysis . 35 3.2 Unification-Based Type Inference . 36 3.3 Subtyping-Based Type Inference . 40 3.4 Data-Flow Analysis . 44 3.5 Dynamic Analysis . 50 4 Conclusion 55 4.1 Future work . 57 vii viii CONTENTS II Publications 59 5 Inference and Evolution of TypeScript Declaration Files 61 5.1 Abstract . 61 5.2 Introduction . 61 5.3 Motivating Examples . 63 5.4 TSinfer: Inference of Initial Type Declarations . 66 5.5 TSevolve: Evolution of Type Declarations . 69 5.6 Implementation . 69 5.7 Experimental Evaluation . 73 5.8 Related Work . 82 5.9 Conclusion . 82 6 Type Test Scripts for TypeScript Testing 83 6.1 Abstract . 83 6.2 Introduction . 84 6.3 Motivating Examples . 86 6.4 Basic Approach . 89 6.5 Challenges and Design Choices . 93 6.6 Soundness and (Conditional) Completeness . 102 6.7 Experimental Evaluation . 104 6.8 Related Work . 114 6.9 Conclusion . 116 6.10 Appendix . 117 7 Reasonably-Most-General Clients for JavaScript Library Analysis 119 7.1 Abstract . 119 7.2 Introduction . 119 7.3 Motivating example . 122 7.4 Reasonably-Most-General Clients . 125 7.5 Abstract Domains in Static Type Analysis . 130 7.6 Using RMGCs in Static Type Analysis . 130 7.7 Evaluation . 136 7.8 Evaluation on larger benchmarks . 142 7.9 Related Work . 144 7.10 Conclusion . 145 Bibliography 147 Part I Overview 1 Chapter 1 Introduction Optionally typed languages, which mix static and dynamic typing, are in- creasingly being used by developers, with languages such as TypeScript, Python, and Groovy seeing an increase in contributors within a year on GitHub of 90%, 50%, and 40% respectively [4]. Many developers prefer to use TypeScript or Python with both being in the top 3 most loved languages according to Stack Overflow [7]. These optionally typed languages allow programmers to mix dynami- cally typed code, where type checking is deferred until runtime, with stati- cally typed code, which can be type-checked before the program is executed. Some of the patterns used in dynamic programming languages can be very hard to express in statically typed languages [30], however, the tool support for these dynamic languages often lacks compared to statically typed lan- guages. By mixing dynamic and static types, an optionally typed language gives programmers the option of using the programming style that fits the best when solving a problem. Developers writing typed applications in optionally typed languages of- ten want to use existing untyped libraries, and in languages such as Type- Script or Python typed API models describing the behavior of the untyped libraries are often available. These API models facilitate type checking of typed applications that use the untyped libraries, and the API models en- able precise auto-completion and code-navigation in IDEs. However, the API models are ignored at runtime, where the untyped library is executed without any regard for the types in the API models. For TypeScript and Python, these API models are called declaration files and stub files, respec- tively. Both languages have official repositories containing API models for a large collection of libraries.1 1For Python this collection is called typeshed [15], and for TypeScript the collection is called DefinitelyTyped [14]. 3 4 CHAPTER 1. INTRODUCTION Some optionally typed languages are also gradually typed. A gradually typed language adds, on top of being optionally typed, type checks at the boundaries between statically and dynamically typed code [102, 103]. These type checks detect at runtime whether the value of a variable matches the type annotation for the variable. Programmers can, due to the type checks, trust type annotations on variables to be correct at runtime, and that no type errors will happen inside fully annotated code where dynamic types are not used. Most optionally typed languages, including all previously mentioned, are not gradually typed, as these languages do not insert type checks on all boundaries between statically and dynamically typed code. What sets a gradually typed language apart from languages such as Groovy and Dart, which has type checks at variable assignments, is that a gradually typed language can track blame on higher-order functions,
Recommended publications
  • Enabling RUST for UEFI Firmware
    presented by Enabling RUST for UEFI Firmware UEFI 2020 Virtual Plugfest August 20, 2020 Jiewen Yao & Vincent Zimmer, Intel Corporation www.uefi.org 1 Jiewen Yao • Jiewen Yao is a principal engineer in the Intel Architecture, Graphics, and Software Group. He has been engaged as a firmware developer for over 15 years. He is a member of the UEFI Security sub team, and the TCG PC Client sub working group. www.uefi.org 2 Vincent Zimmer • Vincent Zimmer is a senior principal engineer in the Intel Architecture, Graphics, and Software Group. He has been engaged as a firmware developer for over 25 years and leads the UEFI Security sub team. www.uefi.org 3 Agenda • EDKII Security Summary • RUST Language • Enabling RUST for EDKII • Summary / Call to Action www.uefi.org 4 EDKII Security Summary www.uefi.org 5 BIOS Memory Issue in Hack Conf www.uefi.org 6 BIOS Security Bug Top Issue Open Source Close Source Buffer Overflow/ 50% 38% Integer Overflow SMM 7% 18% Variable 8% 5% Register Lock 3% 10% www.uefi.org 7 Vulnerabilities in C/C++ Source: Trends, challenges, and strategic shifts in the software vulnerability mitigation landscape – Microsoft, Bluehat IL 2019 www.uefi.org 8 Firmware as Software • Many software issues are also firmware issues. – Buffer Overflow – Integer Overflow – Uninitialized Variable • Software mitigation can be used for firmware mitigation. – (See next page) www.uefi.org 9 3 Levels of Prevention Prevention Method EDKII Open Source Example Eliminate Reduce Attack Surface SMI Handler Profile Vulnerability Static Analysis / Dynamic
    [Show full text]
  • Beyond BIOS Developing with the Unified Extensible Firmware Interface
    Digital Edition Digital Editions of selected Intel Press books are in addition to and complement the printed books. Click the icon to access information on other essential books for Developers and IT Professionals Visit our website at www.intel.com/intelpress Beyond BIOS Developing with the Unified Extensible Firmware Interface Second Edition Vincent Zimmer Michael Rothman Suresh Marisetty Copyright © 2010 Intel Corporation. All rights reserved. ISBN 13 978-1-934053-29-4 This publication is designed to provide accurate and authoritative information in regard to the subject matter covered. It is sold with the understanding that the publisher is not engaged in professional services. If professional advice or other expert assistance is required, the services of a competent professional person should be sought. Intel Corporation may have patents or pending patent applications, trademarks, copyrights, or other intellectual property rights that relate to the presented subject matter. The furnishing of documents and other materials and information does not provide any license, express or implied, by estoppel or otherwise, to any such patents, trademarks, copyrights, or other intellectual property rights. Intel may make changes to specifications, product descriptions, and plans at any time, without notice. Fictitious names of companies, products, people, characters, and/or data mentioned herein are not intended to represent any real individual, company, product, or event. Intel products are not intended for use in medical, life saving, life sustaining, critical control or safety systems, or in nuclear facility applications. Intel, the Intel logo, Celeron, Intel Centrino, Intel NetBurst, Intel Xeon, Itanium, Pentium, MMX, and VTune are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.
    [Show full text]
  • Precise Null Pointer Analysis Through Global Value Numbering
    Precise Null Pointer Analysis Through Global Value Numbering Ankush Das1 and Akash Lal2 1 Carnegie Mellon University, Pittsburgh, PA, USA 2 Microsoft Research, Bangalore, India Abstract. Precise analysis of pointer information plays an important role in many static analysis tools. The precision, however, must be bal- anced against the scalability of the analysis. This paper focusses on improving the precision of standard context and flow insensitive alias analysis algorithms at a low scalability cost. In particular, we present a semantics-preserving program transformation that drastically improves the precision of existing analyses when deciding if a pointer can alias Null. Our program transformation is based on Global Value Number- ing, a scheme inspired from compiler optimization literature. It allows even a flow-insensitive analysis to make use of branch conditions such as checking if a pointer is Null and gain precision. We perform experiments on real-world code and show that the transformation improves precision (in terms of the number of dereferences proved safe) from 86.56% to 98.05%, while incurring a small overhead in the running time. Keywords: Alias Analysis, Global Value Numbering, Static Single As- signment, Null Pointer Analysis 1 Introduction Detecting and eliminating null-pointer exceptions is an important step towards developing reliable systems. Static analysis tools that look for null-pointer ex- ceptions typically employ techniques based on alias analysis to detect possible aliasing between pointers. Two pointer-valued variables are said to alias if they hold the same memory location during runtime. Statically, aliasing can be de- cided in two ways: (a) may-alias [1], where two pointers are said to may-alias if they can point to the same memory location under some possible execution, and (b) must-alias [27], where two pointers are said to must-alias if they always point to the same memory location under all possible executions.
    [Show full text]
  • Practical C/C++ Programming Part II
    Practical C/C++ programming Part II Wei Feinstein HPC User Services Louisiana State University 7/11/18 Practical C/C++ programming II 1 Topics • Pointers in C • Use in functions • Use in arrays • Use in dynamic memory allocation • Introduction to C++ • Changes from C to C++ • C++ classes and objects • Polymorphism • Templates • Inheritance • Introduction to Standard Template Library (STL) 7/11/18 Practical C/C++ programming II 2 What is a pointer? • A pointer is essentially a variable whose value is the address of another variable. • Pointer “points” to a specific part of the memory. • Important concept in C programming language. Not recommended in C++, yet understanding of pointer is necessary in Object Oriented Programming • How to define pointers? int *i_ptr; /* pointer to an integer */ double *d_ptr; /* pointer to a double */ float *f_ptr; /* pointer to a float */ char *ch_ptr; /* pointer to a character */ int **p_ptr; /* pointer to an integer pointer */ 7/11/18 Practical C/C++ programming II 3 Pointer Operations (a) Define a pointer variable. (b) Assign the address of a variable to a pointer. & /* "address of" operator */ (c) Access the value pointed by the pointer by dereferencing * /* “dereferencing" operator */ Examples: int a = 6; int *ptr; ptr = &a; /* pointer p point to a */ *ptr = 10; /* dereference pointer p reassign a value*/ var_name var_address var_value ptr 0x22aac0 0xXXXX a 0xXXXX 6 7/11/18 Practical C/C++ programming II 4 Pointer Example int b = 17; int *p; /* initialize pointer p */ p = &b; /*pointed addr and value,
    [Show full text]
  • Coverity Static Analysis
    Coverity Static Analysis Quickly find and fix Overview critical security and Coverity® gives you the speed, ease of use, accuracy, industry standards compliance, and quality issues as you scalability that you need to develop high-quality, secure applications. Coverity identifies code critical software quality defects and security vulnerabilities in code as it’s written, early in the development process when it’s least costly and easiest to fix. Precise actionable remediation advice and context-specific eLearning help your developers understand how to fix their prioritized issues quickly, without having to become security experts. Coverity Benefits seamlessly integrates automated security testing into your CI/CD pipelines and supports your existing development tools and workflows. Choose where and how to do your • Get improved visibility into development: on-premises or in the cloud with the Polaris Software Integrity Platform™ security risk. Cross-product (SaaS), a highly scalable, cloud-based application security platform. Coverity supports 22 reporting provides a holistic, more languages and over 70 frameworks and templates. complete view of a project’s risk using best-in-class AppSec tools. Coverity includes Rapid Scan, a fast, lightweight static analysis engine optimized • Deployment flexibility. You for cloud-native applications and Infrastructure-as-Code (IaC). Rapid Scan runs decide which set of projects to do automatically, without additional configuration, with every Coverity scan and can also AppSec testing for: on-premises be run as part of full CI builds with conventional scan completion times. Rapid Scan can or in the cloud. also be deployed as a standalone scan engine in Code Sight™ or via the command line • Shift security testing left.
    [Show full text]
  • Declare Property Class Accept Null C
    Declare Property Class Accept Null C Woesome and nontechnical Joshuah planned inveterately and pull-outs his frontiers sourly and daftly. Unquiet Bernard fly-by very instructively while Rick remains ectotrophic and chastened. Sometimes stereoscopic Addie unnaturalizes her acorns proportionally, but unlidded Cat invert heinously or orientalizes rancorously. Even experts are accepted types and positional parameters and references, and assigns to. Use HasValue property to check has value is assigned to nullable type sometimes not Static Nullable class is a. Thank you declare more freely, declare property class accept null c is useful to provide only one dispiriting aspect of. Here we're defining a suggest that extracts all non-nullable property keys from plant type. By disabling cookies to accept all? The car variable inside counter Person class shouldn't be declared type grass and. Use JSDoc type. Any class property name, null safe code token stream of properties and corresponds to accept a class! Why death concept of properties came into C The decline because not two reasons If the members of a class are private then select another class in C. JavaScript Properties of variables with null or undefined. Type cup type as should pickle the null value variable the pastry of the variable to which null. CS31 Intro to C Structs and Pointers. Using the New Null Conditional Operator in C 6 InformIT. Your extra bet is to get themselves the good group of initializing all variables when you disabled them. This class that null values can declare variables declared inside an exception handling is nullable context switches a varargs in.
    [Show full text]
  • Mixed Messages: Measuring Conformance and Non-Interference in Typescript
    Mixed Messages: Measuring Conformance and Non-Interference in TypeScript Jack Williams1, J. Garrett Morris2, Philip Wadler3, and Jakub Zalewski4 1 University of Edinburgh, Edinburgh, Scotland [email protected], 2 University of Edinburgh, Edinburgh, Scotland [email protected] 3 University of Edinburgh, Edinburgh, Scotland [email protected] 4 University of Edinburgh, Edinburgh, Scotland [email protected] Abstract TypeScript participates in the recent trend among programming languages to support gradual typing. The DefinitelyTyped Repository for TypeScript supplies type definitions for over 2000 popular JavaScript libraries. However, there is no guarantee that implementations conform to their corresponding declarations. We present a practical evaluation of gradual typing for TypeScript. We have developed a tool for use with TypeScript, based on the polymorphic blame calculus, for monitoring JavaScript libraries and TypeScript clients against the TypeScript definition. We apply our tool, TypeScript TPD, to those libraries in the DefinitelyTyped Repository which had adequate test code to use. Of the 122 libraries we checked, 62 had cases where either the library or its tests failed to conform to the declaration. Gradual typing should satisfy non-interference. Monitoring a program should never change its behaviour, except to raise a type error should a value not conform to its declared type. However, our experience also suggests serious technical concerns with the use of the JavaScript proxy mechanism for enforcing contracts. Of the 122 libraries we checked, 22 had cases where the library or its tests violated non-interference. 1998 ACM Subject Classification D.2.5 [Software Engineering]: Testing and Debugging Keywords and phrases Gradual Typing, TypeScript, JavaScript, Proxies Digital Object Identifier 10.4230/LIPIcs.ECOOP.2017.39 1 Introduction We have good news and we have bad news.
    [Show full text]
  • Chapter 9 Pointers
    Chapter 9 Pointers Objectives ❏ To understand the concept and use of pointers ❏ To be able to declare, define, and initialize pointers ❏ To write programs that access data through pointers ❏ To use pointers as parameters and return types ❏ To understand pointer compatibility, especially regarding pointers to pointers ❏ To understand the role of quality in software engineering Computer Science: A Structured Programming Approach Using C 1 FIGURE 9-1 Derived Types Computer Science: A Structured Programming Approach Using C 2 9-1 Introduction A pointer is a constant or variable that contains an address that can be used to access data. Pointers are built on the basic concept of pointer constants. Topics discussed in this section: Pointer Constants Pointer Values Pointer Variables Accessing Variables Through Pointers Pointer Declaration and Definition Declaration versus Redirection Initialization of Pointer Variables Computer Science: A Structured Programming Approach Using C 3 FIGURE 9-2 Character Constants and Variables Computer Science: A Structured Programming Approach Using C 4 FIGURE 9-3 Pointer Constants Computer Science: A Structured Programming Approach Using C 5 Note Pointer constants, drawn from the set of addresses for a computer, exist by themselves. We cannot change them; we can only use them. Computer Science: A Structured Programming Approach Using C 6 Note An address expression, one of the expression types in the unary expression category, consists of an ampersand (&) and a variable name. Computer Science: A Structured Programming Approach Using C 7 FIGURE 9-4 Print Character Addresses Computer Science: A Structured Programming Approach Using C 8 Note A variable’s address is the first byte occupied by the variable.
    [Show full text]
  • A Name for the Null Pointer: Nullptr (Revision 4) 1. the Problem, And
    Doc No: SC22/WG21/N2431 = J16/07-0301 Date: 2007-10-02 Project: JTC1.22.32 References: Revision of SC22/WG21/N2214 = J16/07-0074 Reply to: Herb Sutter Bjarne Stroustrup Microsoft Corp. Computer Science Dept. 1 Microsoft Way Texas A&M University, TAMU 3112 Redmond WA USA 98052 College Station TX USA 77843-3112 Email: [email protected] Email: [email protected] A name for the null pointer: nullptr (revision 4) 1. The Problem, and Current Workarounds The current C++ standard provides the special rule that 0 is both an integer constant and a null pointer constant. From [C++03] clause 4.10: A null pointer constant is an integral constant expression (expr.const) rvalue of integer type that evaluates to zero. A null pointer constant can be converted to a pointer type; the result is the null pointer value of that type and is distinguishable from every other value of pointer to object or pointer to function type. Two null pointer values of the same type shall compare equal. The conversion of a null pointer constant to a pointer to cv-qualified type is a single conver- sion, and not the sequence of a pointer conversion followed by a qualification conversion (conv.qual). This formulation is based on the original K&R C definition and differs from the definition in C89 and C99. The C standard [C99] says (clause 6.3.2.3): An integer constant expression with the value 0, or such an expression cast to type void *, is called a null pointer constant.[55] If a null pointer constant is converted to a pointer type, the resulting pointer, called a null pointer, is guaranteed to compare unequal to a pointer to any object or function.
    [Show full text]
  • Object Oriented Programming in Objective-C 2501ICT/7421ICT Nathan
    Subclasses, Access Control, and Class Methods Advanced Topics Object Oriented Programming in Objective-C 2501ICT/7421ICT Nathan René Hexel School of Information and Communication Technology Griffith University Semester 1, 2012 René Hexel Object Oriented Programming in Objective-C Subclasses, Access Control, and Class Methods Advanced Topics Outline 1 Subclasses, Access Control, and Class Methods Subclasses and Access Control Class Methods 2 Advanced Topics Memory Management Strings René Hexel Object Oriented Programming in Objective-C Subclasses, Access Control, and Class Methods Subclasses and Access Control Advanced Topics Class Methods Objective-C Subclasses Objective-C Subclasses René Hexel Object Oriented Programming in Objective-C Subclasses, Access Control, and Class Methods Subclasses and Access Control Advanced Topics Class Methods Subclasses in Objective-C Classes can extend other classes @interface AClass: NSObject every class should extend at least NSObject, the root class to subclass a different class, replace NSObject with the class you want to extend self references the current object super references the parent class for method invocations René Hexel Object Oriented Programming in Objective-C Subclasses, Access Control, and Class Methods Subclasses and Access Control Advanced Topics Class Methods Creating Subclasses: Point3D Parent Class: Point.h Child Class: Point3D.h #import <Foundation/Foundation.h> #import "Point.h" @interface Point: NSObject @interface Point3D: Point { { int x; // member variables int z; // add z dimension
    [Show full text]
  • Towards Support for Non-Null Types and Non-Null-By- Default in Java 1
    Towards Support for Non-null Types and Non-null-by- default in Java Patrice Chalin Dept. of Computer Science and Software Engineering, Dependable Software Research Group, Concordia University Montréal, Québec, Canada [email protected] Abstract. This paper begins with a survey of current programming language support for non-null types and annotations, with a particular focus on extensions to Java. With the advent of Java 5 annotations, we note a marked increase in the availability of tools that can statically detect potential null dereferences. For such tools to be truly effective, they require that developers annotate declarations with nullity modifiers. Unfortunately, it has been our experience in specifying moderately large code bases, that the use of non-null annotations is more labor intensive than it should be. Hence, backed by an empirical study, we recently outlined a proposal for adapting the Java Modeling Language (JML) to support a non-null-by-default semantics. This paper extends our earlier work by providing a detailed description of the changes made to JML in support of the new default. Finally, in relation to our survey, we justify the claim that JML is one of the most fully developed solutions for bringing support for non-null annotations/types to Java. Keywords: non-null annotations, non-null types, Java, JML, non-null by default. 1 Introduction Null pointer exceptions are among the most common faults raised by components written in mainstream imperative languages. Developers increasingly have at their disposal tools that can detect possible null dereferences (among other things) by means of static analysis of component source.
    [Show full text]
  • Assignment 3 Pointers and Program Analysis
    Assignment 3 Pointers and Program Analysis 15-411: Compiler Design Rob Arnold (rdarnold@andrew) and Eugene Marinelli (emarinel@andrew) Due: Thursday, October 9, 2008 (1:30 pm) Reminder: Assignments are individual assignments, not done in pairs. The work must be all your own. You may hand in a handwritten solution or a printout of a typeset solution at the beginning of lecture on Thursday, October 9. Please read the late policy for written assignments on the course web page. If you decide not to typeset your answers, make sure the text and pictures are legible and clear. Problem 1 [15 points] As discussed in class, constant propagation can be performed by first computing reaching definitions. However, it is also possible to perform constant propagation directly as a forward dataflow analysis. In this problem, we will explore a sort of constant propagation for pointer values, keeping track of some information about the value that a pointer variable x may have. We write val(l, x, V ) if the value of the pointer variable x at line l in the programs may be V , where V is either NULL or !NULL. This means we don’t track the precise value (which would be impossible), but only a sound approximation of the possible values of x at line l. As in Lectures 4 and 5, we mean the value of x before the statement at line l is executed. The val predicate is seeded at assignments according to the following rules. l : x ← e l : x ← NULL e 6= NULL l : x ← alloc(k) succ(l, l0) succ(l, l0) succ(l, l0) val(l0, x, NULL) val(l0, x, NULL) val(l0, x, !NULL) val(l0, x, !NULL) In the middle rule we have two conclusions, because we cannot predict the value of e in this simple analysis and so x may be null or it may not be null.
    [Show full text]