Reasoning About Programs Need: Definition of Works/Correct: a Specification (And Bugs) but Programs Fail All the Time

Good programs, broken programs? Goal: program works (does not fail) Reasoning about Programs Need: definition of works/correct: a specification (and bugs) But programs fail all the time. Why? A brief interlude on 1. Misuse of your code: caller did not meet assumptions specifications, assertions, and debugging 2. Errors in your code: mistake causes wrong computation 3. Unpredictable external problems: • Out of memory, missing file, network down, … • Plan for these problems, fail gracefully. 4. Wrong or ambiguous specification, implemented correctly Largely based on material from University of Washington CSE 331 A Bug's Life, ca. 1947 A Bug's Life Defect: a mistake in the code Think 10 per 1000 lines of industry code. We're human. -- Grace Hopper Error: incorrect computation Because of defect, but not guaranteed to be visible Failure: observable error -- program violates its specification Crash, wrong output, unresponsive, corrupt data, etc. Time / code distance between stages varies: • tiny (<second to minutes / one line of code) • or enormous (years to decades to never / millons of lines of code) "How to build correct code" Testing 1. Design and Verify Make correctness more likely or provable from the start. • Can show that a program has an error. 2. Program Defensively • Can show a point where an error causes a failure. Plan for defects and errors. • Cannot show the error that caused the failure. • make testing more likely to reveal errors as failures • Cannot show the defect that caused the error. • make debugging failures easier 3. Test and Validate Try to cause failures. • Can improve confidence that the sorts of errors/failures • provide evidence of defects/errors targeted by the tests are less likely in programs similar • or increase confidence of their absence to the tests. 4. DeBug • Cannot show absence of defects/errors/failures. Determine the cause of a failure. • Unless you can test all possible behaviors exhaustively. (Hard! Slow! Avoid!) Solve inverse problem. Usually intractable for interesting programs. (without running them) Why reason about programs statically? Reasoning about programs “Today a usual technique is to make a program and then to • Reason about a single program execution. • Concrete, dynamic: be the machine, run the program. test it. While program testing can be a very effective way to • Test or debug: important, but "too late." show the presence of bugs, it is hopelessly inadequate for showing their absence. The only effective way to raise the • Reason about all possible executions of a program. • Abstract, static: consider all possible paths at once. confidence level of a program significantly is to give a • Usually to prevent broken programs. convincing proof of its correctness. ” • Hard for whole programs, easier if program uses clean, modular abstractions. -- Edsger Dijkstra • Many compromises in between. Forward Reasoning Forward: careful witH assignment Suppose we initially know (or assume) w > 0 // we know: nothing w = x+y; // w > 0 // we know: w == x + y x = 17; x = 4; // w > 0, x == 17 y = 42; // we know: w == old x + y, x == 4 // w > 0, x == 17, y == 42 // must update other facts too... z = w + x + y; y = 3; // w > 0, x == 17, y == 42, z > 59 // we know: w == old x + old y, … // x == 4, y == 3 Then we know various things after, e.g., z > 59 // we do NOT know: w == x + y == 7 Backward Reasoning Reasoning Forward and Backward If we want z < 0 at the end Forward: • Determine what assumptions imply. // w + 17 + 42 < 0 • Ensure an invariant is maintained. x = 17; • Invariant = property that is always true // w + x + 42 < 0 y = 42; // w + x + y < 0 Backward: z = w + x + y; • Determine sufficient conditions. // z < 0 • For a desired result: What assumptions are needed for correctness? • For an undesired result: Then we need to start with w < -59 What assumptions will trigger an error/bug? Reasoning Forward and Backward Precondition and Postcondition Forward: Precondition: “assumption” before some code • Simulate code on many inputs at once. • Learn many facts about code's behavior, // pre: w < -59 • some of which may be irrelevant. x = 17; Backward: // post: w + x < -42 • Show how each part of code affects the end result. • More useful in many contexts (research, practice) Postcondition: “what holds” after some code • Closely linked with debugging If you satisfy the precondition, then you are guaranteed the postcondition. Conditionals, forward. Conditionals, backward. // pre: initial assumptions // pre: (C, X) or (!C, Y) if(...) { if(C) { // pre: && condition true // pre: X: weakest such that ... // post: X ... // post: Z } else { } else { // pre: && condition false // pre: Y: weakest such that ... // post: Y ... // post: Z } } // either branch could have executed // either branch could have executed // post: X || Y // post: need Z Weakest precondition: the minimal assumption under which the postcondition is guaranteed to be true. Conditional, backward Is static reasoning enougH? // 9. pre: x <= -3 or (3 <= x, x < 5) or 8 <= x • // 8. pre: (x <= -3, x < 5) or (3 <= x, x < 5) Can learn things about the program we have. // or 8 <= x • Basis for human proofs, limited automated // 7. pre: (x < 5, (x <= -3 or 3 <= x)) // or 8 <= x reasoning. // 6. pre: (x < 5, 9 <= x*x) or 8 <= x • Compilers check types, do correct optimizations. // 5. pre: (x < 5, 9 <= x*x) or (5 <= x, 8 <= x) if (x < 5) { • Many static program analysis techniques // 4. pre: 9 <= x*x x = x*x; • Proving entire program correct is HARD! // 2. post: 9 <= x } else { // 3. pre: 8 <= x x = x+1; // 2. post: 9 <= x • Should also write down things we expect to be true } // 1. post: 9 <= x -4 -3 -2 -1 0 1 2 3 4 5 6 7 8 9 "How to build correct code" What to do wHen things go wrong 1. Design and Verify Early, informative failures Make correctness more likely or provable from the start. 2. Program Defensively Goal 1: Give information about the problem Plan for defects and errors. • To the programmer – descriptive error message • make testing more likely to reveal errors as failures • To the client code: exception, return value, etc. • make debugging failures easier 3. Test and Validate Goal 2: Prevent harm Try to cause failures. Whatever you do, do it early: before small error causes big problems • provide evidence of defects/errors Abort: alert human, cleanup, log the error, etc. • or increase confidence of their absence Re-try if safe: problem might be transient 4. DeBug Skip a subcomputation if safe: just keep going Determine the cause of a failure. Fix the problem? Usually infeasible to repair automatically (Hard! Slow! Avoid!) Solve inverse problem. There are two ways of constructing a software design: Defend your code One way is to make it so simple that there are obviously no deficiencies, 1. Make errors impossible with type safety, memory safety (not C!). and the other way is to make it so complicated that there are no obvious deficiencies. 2. Do not introduce defects, make reasoning easy with simple code. • KISS = Keep It Simple, Stupid The first method is far more difficult. 3. Make errors immediately visible with assertions. -- Sir Anthony Hoare, Turing Award winner • Reduce distance from error to failure 4. Debug (last resort!): find defect starting from failure Debugging is twice as hard as writing the code in the • Easiest in modular programs with good specs, test suites, assertions • Use scientific method to gain information. first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it. • Analogy to health/medicine: -- Brian Kernighan, author of The C Programming Language book, much more wellness/prevention vs. diagnosis/treatment Defensive programming, testing Square root witH assertion // requires: x >= 0 Check: // returns: approximation to square root of x • Precondition and Postcondition double sqrt(double x) { • Representation invariant assert(x >= 0.0); • Other properties that should be true double result; ... compute square root ... Check statically via reasoning and tools assert(absValue(result*result – x) < 0.0001); Check dynamically via assertions return result; assert(index >= 0); } assert(array != null); assert(size % 2 == 0); Write assertions as you write code Write many tests and run them often Don’t go to sea witHout your lifejacket! When not to use assertions Don't check for user input errors with assertions. User errors are Finally, it is absurd to make elaborate security checks on debugging runs, expected situations that programs must handle. when no trust is put in the results, and then remove them in production // assert(!isEmpty(zipCode)); // XX NO XX runs, when an erroneous result could be expensive or disastrous. What would we think of a sailing enthusiast who wears his lifejacket when if (isEmpty(zipCode)) { training on dry land, but takes it off as soon as he goes to sea? handleUserError(...); Hints on Programming Language Design } -- C.A.R. Hoare Don’t clutter code with useless, distracting repetition x = y + 1; // assert(x == y + 1); // XX NO XX Don’t perform side effects, won’t happen if assertions disabled. // assert(array[i]++ != 42); // XX NO XX array[i]++; // part of the program logic assert(array[i] != 42); printf(array[i]); Last Resort: Principled Debugging Principled Debugging 1. Find small, repeatable test case that produces the failure 2. Narrow down location and proximate cause Observe • Scientific Method: observe, hypothesize, experiment, analyze Form Hypothesis • Keep a record 3. Fix the defect (and test the fix!) Design

Reasoning About Programs Need: Definition of Works/Correct: a Specification (And Bugs) but Programs Fail All the Time

The AWK Programming Language

The Evolution of the Unix Time-Sharing System*

Computer Science Undergraduate Program

Oral History of Brian Kernighan

USENIX April 06

The AWK Manual Edition 1.0 December 1995

The A-Z of Programming Languages: AWK

The A-Z of Programming Languages (Interviews with Programming Language Creators)

Operating Systems

COMSW 1003-1 Introduction to Computer Programming in C

The Development of the C Languageߤ

A Research UNIX Reader: Annotated Excerpts from the Programmer's