Protecting Bare-Metal Embedded Systems with Privilege Overlays

Protecting Bare-metal Embedded Systems With Privilege Overlays Abraham A. Clements∗, Naif Saleh Almakhdhuby, Khaled S. Saabz, Prashast Srivastavay, Jinkyu Kooy, Saurabh Bagchiy, Mathias Payery ∗Purdue University and Sandia National Laboratories, [email protected] yPurdue University, fnalmakhd, srivas41, kooj, [email protected], [email protected] zGeorgia Institute of Technology, [email protected] Abstract—Embedded systems are ubiquitous in every aspect of launched the largest distributed denial of service (DDoS) modern life. As the Internet of Thing expands, our dependence attack to date [39]. The criticality of security for embedded on these systems increases. Many of these interconnected systems systems extends beyond smart things. Micro-controllers ex- are and will be low cost bare-metal systems, executing without an operating system. Bare-metal systems rarely employ any security ecuting bare-metal software have been embedded so deeply protection mechanisms and their development assumptions (un- into systems that their existence is often overlooked, e.g., restricted access to all memory and instructions), and constraints in network cards [26], hard drive controllers [57], and SD (runtime, energy, and memory) makes applying protections memory cards [17]. We rely on these systems to provide secure challenging. and reliable computation, communication, and data storage. To address these challenges we present EPOXY, an LLVM- based embedded compiler. We apply a novel technique, called Yet, they are built with security paradigms that have been privilege overlaying, wherein operations requiring privileged obsolete for several decades. execution are identified and only these operations execute in Embedded systems largely lack protection against code privileged mode. This provides the foundation on which code- injection, control-flow hijack, and data corruption attacks. integrity, adapted control-flow hijacking defenses, and protec- Desktop systems, as surveyed in [53], employ many defenses tions for sensitive IO are applied. We also design fine-grained randomization schemes, that work within the constraints of bare- against these attacks such as: Data Execution Prevention metal systems to provide further protection against control-flow (DEP), stack protections (e.g., stack canaries [22], separate and data corruption attacks. return stacks [31], and SafeStack [40]), diversification [49, 41], These defenses prevent code injection attacks and ROP attacks ASLR, Control-Flow Integrity [9, 18], or Code-Pointer In- from scaling across large sets of devices. We evaluate the tegrity (CPI) [40]. Consequently, attacks on desktop-class performance of our combined defense mechanisms for a suite of 75 benchmarks and 3 real-world IoT applications. Our results for systems became harder and often highly program dependent. the application case studies show that EPOXY has, on average, Achieving known security properties from desktop systems a 1.8% increase in execution time and a 0.5% increase in energy on embedded systems poses fundamental design challenges. usage. First, a single program is responsible for hardware configuration, inputs, outputs, and application logic. Thus, the I. INTRODUCTION program must be allowed to access all hardware resources Embedded devices are ubiquitous. With more than 9 billion and to execute all instructions (e.g., configuring memory embedded processors in use today, the number of devices has permissions). This causes a fundamental tension with best surpassed the number of humans. With the rise of the “Internet security practices which require restricting access to some of Things”, the number of embedded devices and their con- resources. Second, bare-metal systems have strict constraints nectivity is exploding. These “things” include Amazon’s Dash on runtime, energy usage, and memory usage. This requires all button, utility smart meters, smart locks, and smart TVs. Many protections to be lightweight across these dimensions. Third, of these devices are low cost with software running directly embedded systems are purpose-built devices. As such, they on the hardware, known as “bare-metal systems”. In such have application-specific security needs. For example, an IO systems, the application runs as privileged low-level software register on one system may unlock a lock while on a different with direct access to the processor and peripherals, without system, it may control an LED used for debugging. Clearly the going through intervening operating system software layers. former is a security-sensitive operation while the latter is not. These bare-metal systems satisfy strict runtime guarantees on Such application-specific requirements should be supported extremely constrained hardware platforms with few KBs of in a manner that does not require the developer to make memory, few MBs of Flash, and low CPU speed to minimize intrusive changes within her application code. Combined, power and cost constraints. these challenges have meant that security protection for code With increasing network connectivity ensuring the secu- injection, control-flow hijack, and data corruption attacks are rity of these systems is critical [21, 51]. In 2016, hijacked simply left out from bare-metal systems. smart devices like CCTV cameras and digital video recorders As an illustrative example, consider the application of DEP to bare-metal systems. DEP, which enforces W ⊕ X on all Stdlib Src App Src HAL Src memory regions, is applied on desktops using a Memory LLVM Linker Management Unit (MMU), which is not present on micro- Plugin controllers. However, many modern micro-controllers have a Clang Passes GCC SafeStack peripheral called the Memory Protection Unit (MPU) that can LLVM enforce read, write, and execute permissions on regions of Bitcode Stdlib the physical memory. At first glance, it may appear that DEP Diversification can be achieved in a straightforward manner through the use LLVM Linker Privilege Plugin Overlaying ` of the MPU. Unfortunately, we find that this is not the case: GNU Linker the MPU protection can be easily disabled, because there is no isolation of privileges. Thus, a vulnerability anywhere in Options Bin Backend the program can write the MPU’s control register to disable Linker Script it. A testimony to the challenges of correctly using an MPU are the struggles existing embedded OSs have in using it for Fig. 1. The compilation work flow for an application using EPOXY. Our security protection, even for well-known protections such as modifications are shown in shaded regions. DEP. FreeRTOS [1], a popular operating system for low-end micro-controllers, leaves its stacks and RAM to be writable EPOXY on 75 benchmark applications and three representa- and executable. By FreeRTOS’s own admission, the MPU tive IoT applications that each stress different sub-systems. port is seldom used and is not well maintained [3]. This was Our performance results for execution time, power usage, evidenced by multiple releases in 2016 where MPU support and memory usage show that our techniques work within did not even compile [8, 2]. the constraints of bare-metal applications. Overheads for the To address all of these challenges, we developed EPOXY benchmarks average 1:6% for runtime and 1:1% for energy. (Embedded Privilege Overlay on X hardware with Y software), For the IoT applications, the average overhead is 1:8% for a compiler that brings both generic and system-specific protec- runtime, and 0:5% for energy. We evaluate the effectiveness tions to bare-metal applications. This compiler adds additional of our diversification techniques, using a Return Oriented Pro- passes to a traditional LLVM cross-compilation flow, as shown gramming (ROP) compiler [52] that finds ROP-based exploits. in Figure 1. These passes add protection against code injection, For our three IoT applications, using 1,000 different binaries control-flow hijack and data corruption attacks, and direct of each, no gadget survives across more than 107 binaries. manipulation of IO. Central to our design is a lightweight This implies that an adversary cannot reverse engineer a single privilege overlay, which solves the dichotomy of allowing the binary and create a ROP chain with a single gadget that scales program developer to assume access to all instructions and beyond a small fraction of devices. memory but restrict access at runtime. To do this, EPOXY In summary, this work: (1) identifies the essential com- reduces execution privileges of the entire application. Then, ponents needed to apply proven security techniques to bare- using static analysis, only instructions requiring elevated priv- metal systems; (2) implements them as a transparent runtime ileges are added to the privilege overlay to enable privileges privilege overlay, without modifying existing source code; just prior to their execution. EPOXY draws its inputs from a (3) provides state-of-the-art protections (stack protections and security configuration file, thus decoupling the implementation diversification of code and data regions) for bare-metal sys- of security decisions from application design and achieves all tems within the strict requirements of run-time, memory size, the security protections without any application code modifica- and power usage; (4) demonstrates that these techniques are tion. Combined, these protections provide application-specific effective from a security standpoint on bare-metal systems. security for bare-metal systems that are essential on modern Simply put, EPOXY brings bare-metal

Load more