What Is Reinforcement Learning?

Deep Learning and Reinforcement Learning Workflows in AI Valerie Leung © 2015 The MathWorks, Inc.1 Why MATLAB for Artificial Intelligence? 2 Artificial Intelligence Development of computer programs to perform tasks that normally require human intelligence 3 AI Applications Object Classification Speech Recognition Predictive Maintenance Signal Classification Automated Driving Stock Market Prediction 4 Machine Learning and Deep Learning Machine Learning Unsupervised Supervised Learning Learning [Labeled Data] [No Labeled Data] Clustering Classification Regression 5 Machine Learning and Deep Learning Machine Learning Unsupervised Supervised Learning Learning [Labeled Data] Machine learning typically involves [No Labeled Data] feature extraction Clustering Classification Regression Deep Learning Deep learning typically does not involve feature extraction 6 Deep Learning Uses a Neural Network Architecture Input Layer Output Hidden Layers (n) Layer 7 Deep Learning Datatypes Image Signal Text Numeric 8 Deep Learning Workflow Prepare Data Train Model Deploy Data access and Model design, Multiplatform preprocessing hyperparameter code generation tuning (CPU, GPU) Model exchange Ground truth labeling Edge deployment across frameworks Hardware- Enterprise accelerated training deployment 9 Why MATLAB for AI Tasks? Increased productivity with interactive tools Generate simulation data for complex models and systems Ease of deployment and scaling to various platforms Full AI workflows that cannot be easily replicated by other toolchains 10 Why MATLAB for AI Tasks? Increased productivity with interactive tools Model Labeling Training Exchange Full AI workflows that cannot be easily replicated by other toolchains 11 Labeling for deep learning is repetitive, tedious, and time-consuming… but necessary 12 Partially automate ground truth labeling with Apps 13 Partially automate ground truth labeling with Apps App for ground truth labeling dedicated to automotive applications 14 Applications developed using labeled data 15 User Story – Veoneer (Autoliv) ▪ Automotive: – Software and hardware for active safety, autonomous driving, occupant protection, and brake control ▪ Application: – Build radar sensor – Check accuracy using LiDAR-based verification ▪ Used MATLAB to semi-automate labeling and tracking of 3D LiDAR point clouds 16 Manual Labeling for 25 events took over 20 minutes. After automation with MATLAB tools, it took 5 minutes. 17 Design deep networks interactively Check for errors with Network Analyzer Design network with Deep Network Designer 18 Transfer Learning with Pre-trained Models SqueezeNet MobileNetV2 AlexNet Xception GoogLeNet ResNet-18 VGG-16 Inception- Inception-v3 ResNet-50 ResNet-v2 VGG-19 DenseNet-201 ResNet-101 Import & Export Models Between Frameworks TensorFlow-Keras Caffe Model ONNX Model Importer Importer Converter 19 Model Exchange with MATLAB PyTorch TensorFlow- Keras Caffe2 MXNet ONNX MATLAB Core ML Caffe CNTK (…) ONNX = Open Neural Network Exchange 20 Why MATLAB for AI Tasks? Increased productivity with interactive tools Generate simulation data for complex models and systems Ease of deployment and scaling to various platforms Full AI workflows that cannot be easily replicated by other toolchains 21 Why MATLAB for AI Tasks? Generate simulation data for complex models and systems Reinforcement Learning Full AI workflows that cannot be easily replicated by other toolchains 22 Reinforcement Learning vs Machine Learning vs Deep Learning Machine Learning Reinforcement learning learns Reinforcement Unsupervised Supervised Learning through trial and error, i.e. Learning Learning [Labeled Data] [No Labeled Data] [Interaction Data] through interaction It’s about learning a behavior or Decision Clustering Classification Regression Controls accomplishing a task Making Deep Learning 23 What is Reinforcement Learning? Reinforcement learning is a type of machine learning that trains an agent through repeated interactions with an environment through a trial & error process that uses a reward system to maximize success 24 A Practical Example of Reinforcement Learning Training a Self-Driving Car 25 A Practical Example of Reinforcement Learning Training a Self-Driving Car OBSERVATION ACTION AGENT ► Vehicle’s computer learns how to drive [agent] ► using sensor readings from LIDAR, cameras [observation] ► that represent road conditions, vehicle position [environment] ► by generating steering, braking, throttle commands [action] REWARD ► to avoid collisions and lane deviation [reward]. ENVIRONMENT The goal of reinforcement learning is for the agent to find an optimal algorithm for performing a task 26 Deep Networks are commonly found in the agent, because they can model complex problems. AGENT • Turn left • Turn right • Brake • Accelerate 27 Reinforcement Learning Workflow Generate Data Train Model Deployment Multiplatform code Scenario Design Reinforcement learning generation (CPU, GPU) Simulation-based Training agent to data generation perform task Edge deployment Developing reward system to optimize Enterprise performance deployment Simulink – generate data for dynamic systems (planes, cars, robots, etc.) 28 Scaling up deep learning in parallel and in the cloud Cloud GPU MATLAB + MATLAB Parallel Parallel Computing Server Multicore Toolbox CPU Run thousands of simulations in parallel with MATLAB Parallel Server to save hours of training time. 29 MATLAB and Simulink for Reinforcement Learning ▪ Reinforcement learning is a dynamic process ▪ MATLAB and Simulink virtual models allow you to simulate conditions that are difficult or dangerous to emulate in the real world ▪ Suitable for: – Control-based problems, e.g. automated driving (lane keep assist, adaptive cruise control), robotics, etc. – Decision-making problems, e.g. financial trading, games, etc. 30 Why MATLAB for AI Tasks? Increased productivity with interactive tools Generate simulation data for complex models and systems Ease of deployment and scaling to various platforms Full AI workflows that cannot be easily replicated by other toolchains 31 Why MATLAB for AI Tasks? Ease of deployment and scaling to various platforms Code Embedded Enterprise Generation Devices Systems Full AI workflows that cannot be easily replicated by other toolchains 32 Deployment and Scaling for AI Embedded Devices Enterprise Systems 33 Deploying Deep Learning Models for Inference Intel MKL-DNN Library ARM Compute Library Deep Learning Networks NVIDIA TensorRT & cuDNN Libraries 34 Benchmark of GPU Coder TensorFlow MXNet GPU Coder PyTorch Intel® Xeon® CPU 3.6 GHz - NVIDIA libraries: CUDA10 - cuDNN 7 - Frameworks: TensorFlow 1.13.0, MXNet 1.4.0 PyTorch 1.0.0 35 Musashi Seimitsu Industry Co.,Ltd. Detect Abnormalities in Automotive Parts MATLAB use in project: ▪ Preprocessing of captured images ▪ Image annotation for training ▪ Deep learning-based analysis – Various transfer learning methods (Combinations of CNN models, Classifiers) – Estimation of defect area using Class Activation Map (CAM) – Abnormality/defect classification ▪ Deployment to NVIDIA Jetson using GPU Coder Automated visual inspection of 1.3 million bevel gear per month 36 Why MATLAB for AI Tasks? Increased productivity with interactive tools Generate simulation data for complex models and systems Ease of deployment and scaling to various platforms Full AI workflows that cannot be easily replicated by other toolchains 37.

What Is Reinforcement Learning?

Lecture 10: Recurrent Neural Networks

Image Retrieval Algorithm Based on Convolutional Neural Network

LSTM-In-LSTM for Generating Long Descriptions of Images

The History Began from Alexnet: a Comprehensive Survey on Deep Learning Approaches

Active Authentication Using an Autoencoder Regularized CNN-Based One-Class Classiﬁer

Key Ideas and Architectures in Deep Learning Applications That (Probably) Use DL

Deep Learning in MATLAB

Path Towards Better Deep Learning Models and Implications for Hardware

Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Data Movement Is All You Need: a Case Study on Optimizing Transformers

Optimizing Alexnet on Tiny Imagenet10

Convolutional Neural Networks As a Model of the Visual System: Past, Present, and Future, 2020