A Low-Cost Agile Neural Trojan Attack Methodology

SIN2: Stealth Infection on Neural Network – A Low-cost Agile Neural Trojan Attack Methodology Tao Liu∗, Wujie Wen∗ and Yier Jiny ∗Florida International University, yUniversity of Florida ∗ftliu023, wweng@fiu.edu, [email protected]fl.edu Abstract—Deep Neural Network (DNN) has recently become ers can easily mislead the decisions of a normally trained the “de facto” technique to drive the artificial intelligence (AI) DNN model by exploiting specific vulnerabilities of classifiers industry. However, there also emerges many security issues as the through carefully manipulated input (adversarial attacks) [5]. DNN based intelligent systems are being increasingly prevalent. Existing DNN security studies, such as adversarial attacks and Meanwhile, DNN models can be also contaminated at the poisoning attacks, are usually narrowly conducted at the software training stage, leading to undesirable inference results at algorithm level, with the misclassification as their primary testing stage (poisoning attacks) [6]. However, these algo- goal. The more realistic system-level attacks introduced by the rithmic attacks place the “misclassification” as their primary emerging intelligent service supply chain, e.g. the third-party adversarial goal, which apparently neglects the back-doors cloud based machine learning as a service (MLaaS) along with the portable DNN computing engine, have never been discussed. inside neural computing frameworks. In this work, we propose a low-cost modular methodology–Stealth In this work, we target the DNN attacking problem with Infection on Neural Network, namely “SIN2”, to demonstrate a specific perspective on intelligent supply chain – untrusted the novel and practical intelligent supply chain triggered neu- 2 individuals or NIP vendors may stealthily infect the legitimate ral Trojan attacks. Our “SIN ” well leverages the attacking neural network models with malicious payloads to conduct opportunities built upon the static neural network model and the underlying dynamic runtime system of neural computing practical neural Trojan attacks on end user side without harm- framework through a bunch of neural Trojaning techniques. We ing the quality of intelligent services. Our major contributions implement a variety of neural Trojan attacks in Linux sandbox can be summarized as follows: 1) We develop a low-cost 2 by following proposed “SIN ”. Experimental results show that Trojan insertion framework, namely “SIN2”, to facilitate the our modular design can rapidly produce and trigger various novel and practical intelligent supply chain triggered neural Trojan attacks that can easily evade the existing defenses. Trojan attack; 2) We propose a bundle of Trojan insertion I.I NTRODUCTION techniques for agile and practical Trojan attacks; 3) We vali- date our proposed SIN2 methodology with realistic malwares Deep Neural Network (DNN) is now infiltrating a wide and security engines and demonstrate a variety of neural range of real-world applications like image recognition, nature Trojan attacks in a prototype Linux sandbox. Experimental language processing, self-driving cars, etc [1]. However, the results show that our modular design can bypass the defensive ever-increasing computation and storage requirements of state- detection and precisely trigger the neural Trojan within the of-the-art DNN models significantly challenge the accessibility intelligent system. of DNN-based intelligent services on many resource-constraint platforms such as smart devices and drone. For instance, the II.B ACKGROUND classification of cutting edge ResNet [2] involves excessive DNN introduces multiple layers with complex structures to memory accesses and computations over ∼150-million param- model a high-level abstraction of the data [7], and exhibit eters within 152 neural layers, while its training typically takes high effectiveness in cognitive applications by leveraging the many weeks even over expensive multiple-GPU clusters. Such deep cascaded layer structure [2]. The convolutional layer a concern has motivated tremendous investments to explore extracts sufficient feature maps from the inputs by apply- affordable intelligent computing systems and business models. ing kernel-based convolutions. The pooling layer performs a The emerging machine learning as a service (MLaaS) offers downsampling operation (through max or mean pooling) along off-the-shelf intelligence services through cloud-based neural the spatial dimensions for a volume reduction, and the fully- computing frameworks [3]. Meanwhile, many tiny hardware connected layer further computes the class score based on the accelerated deep learning systems (DLS), e.g. Intel Movidius final weighted results and the non-linear activation functions. Neural Compute Stick (NCS) [4], have been emerging as Fig. 1 depicts an overview of a commercial intelligent mature consumer electronic products, drastically lowering the system, which incorporates two integrated components: neu- entering barrier and incubating the supply chain of intelligent ral network model and neural computing framework. As services: commercial neural models can be usually trained by shown in Fig. 1(a), the neural network model consists of a neural intellectual property (NIP) vendors or individuals, and comprehensive layer topology and associated parameters (or then distributed and eventually consumed by end users through weights) in each layer. Fig. 1(b) presents the generalized ar- their leased or purchased products. chitecture of neural computing framework, including runtime However, such a new business model also brings ever- system and computing substrate. We define the runtime sys- increasing security concerns. Recent studies show that attack- tem as a middleware that can drive heterogeneous computing This work is supported by the 2016-2017 Collaborative Seed Award substrates (i.e. CPU, GPU, FPGA and ASIC etc.) to conduct Program of Florida Center for Cybersecurity (FC2). neural processing (i.e. Convolution, Activation, Pooling and Input Feature Extraction Classification D1 C1 D1 C1 A1 B1 D 1 C1 Lotus (27.5%) A1 B1 D 1 C 1 D C 1 1 A 1 B1 A B A1 B1 1 1 Lilium (56.25%) D C D C D C A B A B Narcissus(16.25%) D C A B D C A B A B Convolution 1 Pooling 1 Convolution 2 Pooling 2 Flatten Fully Connected Softmax (a) Neural Network Model - the deep cascaded layer structure and probability-based classification. System Integration and Runtime Support GPU FPGA ASIC MLaaS DLS Convolution Pooling Activation Softmax (b) Neural Computing Framework – substrate and runtime system (with built-in neural processing functions). Fig. 1: The architecture of commercial intelligent system. Softmax etc.) through integrated application programming be later used to “awaken” the neural Trojan once the selected interface (API). Computing substrates are dedicated hardwares inputs are involved in the regular neural processing; The neural that can execute and accelerate DNN computation. For ex- Trojan, i.e. the infected neural network model and runtime ample, Jouppi et al. [8] demonstrate the Tensor Processing system, will be delivered to victim through the intelligent Unit (TPU), which is now integrated with Google Cloud supply chain and disguised as a normal service. The victim will to provide on-demand MLaaS [9]. Barry et al. [10] present directly consume the service (i.e. performing image recogni- the Vision Processing Unit (VPU) housed inside the Intel tion task) with legitimate input without compromising the user Movidius NCS [4], to run real-time neural processing directly experience (i.e. expected classification accuracy). Once the from the USB device. neural Trojan has been triggered, malicious payloads will be III. SIN2 METHODOLOGY extracted from the infected neural network model and executed through runtime system on victim side. A. Attack Model IV.A TTACK DESIGN We assume the attacker (untrusted individuals or NIP vendors) can provide the neural network computing services to A. The Objective of Attack Design the victim (end users). To lower the cost, victim will directly A successful attack should satisfy following constraints: consume the services by leasing or purchasing the commer- Confidentiality. The neural Trojan should not impact the cial intelligent system which includes the neural computing quality of intelligent services and can evade existing security framework (computing substrate and runtime system) and the detections from the victim side. Since the DNN parameter pre-trained neural network model. As an intelligent service manipulation during payload embedding can easily degrade provider, attacker possesses the full knowledge of the runtime the classification accuracy of an intelligent system. Integrity. system and neural network model. The purpose of neural The extracted payloads must be structurally executable when Trojan attack is to threaten the victim by performing more the attacker activates the neural Trojan. Therefore, securing diversified malicious payloads on top of the original misclas- the integrity of payloads during the embedding and extracting sification in such an intelligent service. The proposed approach process is essential to exert the attack. Efficiency. The practical will create the neural Trojan by exploiting the vulnerabilities neural Trojan attack should be very efficient. Ideal Trojan of the specific architecture of intelligent system. insertion methods should maximize the efficiency of payloads Attack Vector. We design two types of attack vector to embedding, e.g. more payloads but less number of modified facilitate the

A Low-Cost Agile Neural Trojan Attack Methodology

Theendokernel: Fast, Secure

Impact Analysis of System and Network Attacks

Exercises for Portfolio

A Universal Framework for (Nearly) Arbitrary Dynamic Languages Shad Sterling Georgia State University

EECS 498/598: Brain-Inspired Computing: Models, Architectures, and Programming

A Deep Neural Network for On-Board Cloud Detection on Hyperspectral Images

Principles of Operating Systems Name (Print): Fall 2019 Seat: SEAT Final

Computer Viruses and Malware Advances in Information Security

COMPUTER SECURITY - REGIONAL 2019 Page 1 of 9 Time: ______

AI EDGE up Series

Artificial Intelligence Computing for Automotive Webcast for Xilinx Adapt: Automotive: Anywhere January 12Th, 2021

A Bit More Parallelism