Intermittent Computation Without Hardware Support Or Programmer

Intermittent Computation without Hardware Support or Programmer Intervention Joel Van Der Woude, Sandia National Laboratories; Matthew Hicks, University of Michigan https://www.usenix.org/conference/osdi16/technical-sessions/presentation/vanderwoude This paper is included in the Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’16). November 2–4, 2016 • Savannah, GA, USA ISBN 978-1-931971-33-1 Open access to the Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation is sponsored by USENIX. Intermittent Computation without Hardware Support or Programmer Intervention Joel Van Der Woude Matthew Hicks Sandia National Laboratories∗ University of Michigan Abstract rapid changes drive us closer to the realization of smart As computation scales downward in area, the limi- dust [20], enabling applications where the cost and size tations imposed by the batteries required to power that of computation had previously been prohibitive. We are computation become more pronounced. Thus, many fu- rapidly approaching a world where computers are not ture devices will forgo batteries and harvest energy from just your laptop or smart phone, but are integral parts their environment. Harvested energy, with its frequent your clothing [47], home [9], or even groceries [4]. power cycles, is at odds with current models of long- Unfortunately, while the smaller size and lower cost of running computation. microcontrollers enables new applications, their ubiqui- To enable the correct execution of long-running appli- tous adoption is limited by the form factor and expense of cations on harvested energy—without requiring special- batteries. Batteries take up an increasing amount of space purpose hardware or programmer intervention—we pro- and weight in an embedded system and require special pose Ratchet. Ratchet is a compiler that adds lightweight thought to placement in order to facilitate replacing bat- checkpoints to unmodified programs that allow exist- teries when they die [20]. In addition, while Moore’s ing programs to execute across power cycles correctly. Law drives the development of more powerful comput- Ratchet leverages the idea of idempotency, decompos- ers, batteries have not kept pace with the scaling of ing programs into a continuous stream of re-executable computation [18]. Embedded systems designers attempt sections connected by lightweight checkpoints, stored to address this growing gap by leveraging increasingly in non-volatile memory. We implement Ratchet on top power-efficient processors and design practices [49]. Un- of LLVM, targeted at embedded systems with high- fortunately, these advances have hit a wall—the battery performance non-volatile main memory. Using eight wall; enabling a dramatic change in computing necessi- embedded systems benchmarks, we show that Ratchet tates moving to batteryless devices. correctly stretches program execution across frequent, Batteryless devices, instead of getting energy from the random power cycles. Experimental results show that power grid or a battery, harvest their energy from their Ratchet enables a range of existing programs to run on environment (e.g., sunlight or radio waves). In fact, intermittent power, with total run-time overhead averag- the first wave of energy harvesting devices is available ing below 60%—comparable to approaches that require today [4, 50, 9]. These first generation devices prove hardware support or programmer intervention. that it is possible to compute on harvested energy. This 1 Introduction affords system designers the novel opportunity to remove a major cost and limitation to system scaling. Improvements in the design and development of com- Unfortunately, while harvested energy represents an puting hardware have driven hardware size and cost to opportunity for system designers, it represents a chal- rapidly shrink as performance improves. While early lenge for software developers. The challenge comes computers took up entire rooms, emerging computers are from the nature of harvested energy: energy harvest- millimeter-scale devices with the hopes of widespread ing provides insufficient power to perform long-running, deployment for sensor network applications [23]. These continuous, computation [37]. This results in frequent ∗Work completed while at the University of Michigan power losses, forcing a program to restart from the be- USENIX Association 12th USENIX Symposium on Operating Systems Design and Implementation 17 wasting energy, the power monitoring hardware itself consumes power. Even simple power monitoring circuits (think 1-bit voltage level detector) consume power up to 33% of the power of modern ultra-low-power microcontrollers [5, 43]. An alternative approach, as taken by DINO [26], is to forgo specialized hardware, instead, placing the burden on the programmer to reason about the possible outcomes of frequent, random, power failures. DINO requires that programmers divide programs into a series of checkpoint-connected tasks. These tasks then use data versioning to ensure that power cycles do not violate memory consistency. Smaller tasks increase the like- lihood of eventual completion, at the cost of increased overhead. Larger tasks result in fewer checkpoints, but Figure 1: Energy harvesting devices replace batteries with a small energy storage capacitor. This is the ideal charge and decay of that risk never completing execution. Thus, the burden is on capacitor when given a square wave input power source. The voltage the programmer to implement—for all control flows— across the capacitor must be above v trig for program computation correct-sized tasks given the program and the expected to take place as that is the minimum energy required to store the largest possible checkpoint in the worst-case environmental and device operating environment. Note that even small changes conditions. The energy expended going from v on to v trig and in the program or operating environment can change from v trig to v off is wasted in what is called the guard band. dramatically the optimal task structure for a program. The guard band is essential for correctness in one-time checkpointing systems [39, 17, 3]. In real systems, the charge and discharge rate Our goal is to answer the question: What can be is chaotic, depending on many variables, including temperature and done without requiring hardware modifications or bur- device orientation. dening the programmer? To answer this question, we propose leveraging information available to the compiler ginning, in hopes of more abundant power next time. to preserve memory consistency without input from the While leveraging faster non-volatile memory technolo- programmer or specialized hardware. We draw upon the gies might seem like an easy way to avoid the problems wealth of research in fault tolerance [31, 24, 21, 51, 25, associated with these frequent power cycles, previous 13] and static analysis [7, 22] and construct Ratchet, work exposes the inconsistent states that can result from a compiler that is able to decompose unmodified pro- power cycles when using these technologies [26, 38]. grams into a series of re-executable sections, as shown Previous research attempts to address the problem of in Figure 2. Using static analysis, the compiler can intermittent computation using two broad approaches: separate code into idempotent sections—i.e., sequences rely on specialized hardware [39, 29, 3, 17, 27] or require of code that can be re-executed without entering a state the programmer to reason about the effects of common inconsistent with program semantics. The compiler case power failures on mixed-volatility systems [26]. identifies idempotent sections by looking for loads and Hardware-based solutions, rely on a single checkpoint stores to non-volatile memory and then enforcing that no that gets saved just before power runs out. It is critical section contains a write after read (WAR) to the same for correctness that every bit of work gets saved by the address. By decomposing programs down to a series checkpoint and that there is no (non-checkpointed) work of re-executable sections and gluing them together with after a checkpoint [38]. On the other hand, taking a checkpoints of volatile state, Ratchet supports existing, checkpoint too early wastes energy after the checkpoint arbitrary-length programs, no matter the power source. that could be used to perform meaningful work. This Ratchet shifts the burden of reasoning about the effects tradeoff mandates specialized hardware to measure avail- of intermittent computation and mixed-volatility away able power and predict when power is likely to fail. Due from the programmer to the compiler—without relying to the intermittent nature of harvested energy, predict- on hardware support. ing power failure is a risky venture that requires large We implement Ratchet as a set of modifications to the guard bands to accommodate a range of environmental LLVM compiler [22], targeting energy harvesting plat- conditions and hardware variances. Figure 1 depicts how forms that use the ARM architecture [2] and have wholly guard-bands waste energy that could otherwise be used non-volatile main memory. We also implement an ARM- to make forward progress. In addition to the guard-bands based energy-harvesting simulator that simulates power 18 12th USENIX Symposium on Operating Systems Design and Implementation USENIX Association failures at frequencies experienced by existing energy harvesting devices. To benchmark Ratchet,

Intermittent Computation Without Hardware Support Or Programmer

JTAG Programmer Overview for Hercules-Based Microcontrollers

Smart Battery Charging Programmer Software User Manual Smart Battery Charging Programmer Software User Manual

Intel Quartus Prime Pro Edition User Guide: Programmer Send Feedback

PHP Programmer Location: North Las Vegas NV, USA

Dissertation Submitted in Partial Fulﬁllment of the Requirements for The

PHP Programmer Analyst Markham, ON

The PHP Programmer's Guide to Secure Code

FUJITSU FLASH MCU Programmer for FR Specifications Version 1.9 1 September 2004 Software Version Number: V01L10 ©2002 FUJITSU LIMITED Printed in Japan

CSC 272 - Software II: Principles of Programming Languages

How to Become a Programmer Everything (Non-Technical) You Need to Know to Start Making Money Writing Code Rob Walling V1.0

AVR Programming Methods

PIC32 Flash Programming Specification