<<

Feb 2020 @qualcomm_tech

Making AI ubiquitous Technologies, Inc. Devices, machines, and things are becoming more intelligent

2 Reasoning Learn, infer context, Offering new and anticipate capabilities to enrich our lives

Perception Action Hear, see, monitor, Act intuitively, and observe interact naturally, and protect privacy

3 A world where virtually everyone and everything is intelligently connected

4 Distributed autonomy Privacy/security

New experiences Immediacy The intelligent Processing over Efficiency Edge cloud wireless edge On-device

On-device AI, processing, sensing, Customized/local value vision,… augmented by edge cloud Reliability

Process data closest to the source to scale for massive amount of data and Private/public networks connected things Personalization

5 Process data at the source to scale AI and make sense of a digitized world

Past Today Cloud-centric AI Partially-distributed AI AI training and AI inference Power-efficient in the central cloud on-device AI inference

On-device

Future Fully-distributed AI With lifelong on-device learning On-device Privacy intelligence is Reliability

paramount Low latency Process data closest to the source, complement the cloud Efficient use of network bandwidth On-device intelligence is quickly gaining momentum Key segments are expected to see full AI attach rates by 2025 10% 100% AI attach rate AI attach rate

2018 2025

PCs / Smart Mobile Automotive XR Tablets speakers

Source: Tractica, 2019 Mobile is becoming the pervasive AI platform ~7.8 Billion Cumulative unit shipments forecast between 2018–2022

Source: IDC Aug. ‘18 9 Mobile computing Smart cities Mobile scale changes everything Smart homes Automotive Bringing AI

Smartphones to the masses Healthcare

Industrial IoT Networking

Wearables Extended reality

Rapid replacement cycles Superior scale Integrated/optimized technologies

10 AI offers enhanced experiences and new capabilities for

True personal assistance Superior photography

Extended battery life Natural user interfaces

Enhanced connectivity Enhanced security

A new development paradigm where things repeatedly improve

11 AI will drive transformation across industries

12 Boundless mobile XR experiences

13 Shaping the future of transportation

Personalized driver settings Driver awareness monitoring Greater autonomous capabilities

14 Powering the factory of the future

15 Autonomous manufacturing Smart security for home Smart displays Smarter and robotics and enterprise and speakers agriculture

More efficient use Home hubs and Sustainable cities Digitized logistics of energy and utilities smart appliances and infrastructure and retail

IoT AI for IoT across the home, industrial/enterprise, and Smart Cities 16 Power and thermal efficiency are essential for on-device AI The challenge of Constrained mobile AI workloads environment

Very compute intensive Must be thermally efficient for sleek, ultra-light designs Large, complicated neural network models Requires long battery life for all-day use Complex concurrencies Storage/Memory bandwidth limitations Real-time

Always-on

17 Making power efficient AI pervasive Focusing on high performance HW/SW and optimized network design

Efficient Algorithmic hardware advancements tools Developing heterogeneous compute to Algorithmic research that benefits from Software accelerated run-time run demanding neural networks at low state-of-the-art deep neural networks for deep learning power and within thermal limits Optimization for space and SDK/development frameworks Selecting the right compute runtime efficiency block for the right task

18 Our AI leadership Over a decade of cutting-edge AI R&D, speeding up commercialization and enabling scale

Research face MWC demo Collaboration Brain Corp Qualcomm® Artificial Power efficiency gains Mobile AI Enablement detection with deep showcasing photo with on raises $114M Intelligence Research through compression, Center in Taiwan to Deep-learning learning sorting and hand TensorFlow initiated quantization, and open based AlexNet wins writing recognition compilation ImageNet competition

Research in spiking Research artificial Opened Qualcomm Announced Qualcomm Gauge equivariant neural networks neural processing Research Netherlands Facebook Technologies architectures Caffe2 support ships ONNX CNNs supported by , Facebook,

2009 2013 2015 2016 2017 2018 2019 2020

Acquired Qualcomm EuVision 2007 Opened joint Technologies Investment and research lab researchers Qualcomm Research collaboration with Completed Brain with University Acquired win best paper initiates first AI project Brain Corp Corp joint research of Amsterdam Scyfer at ICLR

® Qualcomm® 3rd Gen Snapdragon Consistent AI R&D investment is Neural 660 Vision Snapdragon 665, 730, ® Processing Snapdragon Intelligence Automotive 730G Qualcomm SDK 630 Platform Cockpit Snapdragon the foundation for product leadership Ride Platform nd rd Qualcomm Artificial Intelligence Research is an initiative of Qualcomm Technologies, Inc. 2 Gen AI Engine 3 Gen AI Engine Snapdragon Qualcomm® 4th Gen AI 5th Gen AI Qualcomm Snapdragon, Qualcomm Neural Processing SDK, Qualcomm Vision Intelligence (Snapdragon 835) (Snapdragon 845) 710 Cloud AI 100 Engine Engine Platform, Qualcomm AI Engine, Qualcomm Cloud AI, Qualcomm Snapdragon Ride, and st ® 1 Gen Qualcomm AI Engine (Snapdragon Qualcomm® (Snapdragon Qualcomm QCS400I are products of Qualcomm Technologies, Inc. and/or its subsidiaries. ® 19 (Qualcomm Snapdragon™ 820 855) QCS400 865) Mobile Platform) (First audio SoC) Leading research and development across the entire spectrum of AI

Fundamental Applied research research

Deep Neural Graph and Machine Deep G-CNN transfer network kernel learning learning for learning compression optimization training tools graphics

Bayesian Deep Neural Compute Source CV DL for combinatorial generative network Voice UI in memory compression new sensors optimization models quantization

Hybrid Video Bayesian Hardware- Power reinforcement Fingerprint recognition distributed aware management learning learning deep learning & prediction

20 Can we apply foundational mathematics of physics, like quantum field theory, to deep learning?

21 G-CNN Video

22 Today’s deep learning Tomorrow’s deep learning Traditional CNNs Gauge Equivariant CNNs Produce state-of-the art results but… No matter how you rotate or move the object, do not generalize input like rotations the generalized model will still identify the object

Applying Translation Rotated objects and foundational images applicable to works drones, robots, cars, mathematics fisheye-lens cameras. of physics VR, AR,..

Like quantum field theory, to deep learning

Rotation doesn’t work

(Convolutional neural networks would need to be retrained with (Generalized CNNs (G-CNN): Gauge equivariant CNN, Group, and Steerable CNN new rotated images to determine new set of parameters—like filter weights) pioneered by Qualcomm AI Research do not need to be retrained)

Advancing fundamental AI research, such as generalized CNNs

23 Unifying framework Equivariance Gauge equivariant CNN unify special cases like No matter how you rotate or move the Group CNNs and Steerable CNNs, all pioneered object, it will still be identified by Qualcomm AI Research G-CNN can generalize models for different Robust performance, faster training, and fewer symmetries — traditional CNNs must training examples required be retrained

Broad societal benefits Generalized geometry Use cases like drones, robots, cars, XR, fisheye Traditional CNNs work well on narrow field-of-view lenses, 3D gaming, … cameras, but fail on e.g. fish-eye cameras But also areas like state-of-the-art accuracy on G-CNN can analyze image data on any curved climate pattern segmentation space, from flat to spherical

Pioneering deep learning research in generalized CNNs 24 Trained neural network model

New Inference input data output

Compression Quantization Compilation Learning to prune model while Learning to reduce bit-precision Learning to compile AI models for keeping desired accuracy while keeping desired accuracy efficient hardware execution

Applying AI to optimize AI model through automated techniques

Hardware AI Acceleration Acceleration research awareness (scalar, vector, tensor) Such as compute-in-memory

Advancing AI research to increase power efficiency 25 Trained neural network model

New Inference input data output

Compression Quantization Compilation Learning to prune model while Learning to reduce bit-precision Learning to compile AI models for keeping desired accuracy while keeping desired accuracy efficient hardware execution

Applying AI to optimize AI model through automated techniques

Compression with Perf. per watt Performance Recent less than 1% loss improvement from improvement over 3x in accuracy1 >4x savings in memory 4x TensorFlow Lite3 examples and compute2

Advancing AI research to increase power efficiency 26 1: With both Bayesian compression and spatial SVD with ResNet18 as baseline. 2: For a quantized INT8 model vs a FP32 model that is not quantized. 3: On average improvement of tested AI models. Qualcomm® Artificial Intelligence Engine The hardware and software components for efficient on-device machine learning

Mobile Apps

NN Frameworks Cognitive Toolkit

Runtime Software Frameworks Qualcomm® Neural Processing TensorFlow Lite Google NN API 5th Gen SDK AI Engine Libraries Qualcomm® Math Libraries OpenCL Hexagon NN

Cores Qualcomm® Hexagon™ DSP Qualcomm® ™ CPU Qualcomm® ™ GPU Scalar Vector Tensor

Qualcomm Math Libraries, Qualcomm Artificial Intelligence Engine, Qualcomm Kryo, Qualcomm Adreno, Qualcomm Hexagon, and Qualcomm Neural Processing SDK are products of Qualcomm Technologies, Inc. and/or its subsidiaries. 27 Adreno 650 Qualcomm Neural Processing SDK New AI mixed 2x higher TOPS New features precision 16-bit and 32-bit FP AI and improvements instructions highlights

Hexagon 698 NNAPI support 4x higher TOPS Deep learning 3x number of Up to 35% bandwidth accelated operators power savings compression Optimized for Google ASR and Google Lens

LP-DDR5 Memory Hexagon NN Direct 30% more bandwidth Provide developers direct to Hexagon

5th gen Qualcomm AI engine AI model efficiency toolkit 15 TOPS Data free Quantization Model quantization aware training compression

28 Qualcomm® Cloud AI 100

• Built on 7nm • >350 TOPS Peak AI Performance • Sampling 2nd half of 2019 Qualcomm® Neural Processing SDK Software accelerated runtime for the execution of deep neural networks on device

Efficient execution on Snapdragon • Takes advantage of Snapdragon capabilities

Qualcomm® Qualcomm® Qualcomm® • Runtime and libraries accelerate deep Kryo™ CPU Adreno™ GPU Hexagon™ DSP neural net processing on all engines: CPU, GPU, and DSP with HVX and HTA Model framework/Network support • Convolutional neural networks and LSTMs • Support for Caffe/Caffe2, TensorFlow, and user/developer defined layers Optimization/Debugging tools • Offline network conversion tools • Debug and analyze network performance Offline Analyze Sample Ease of conversion tools performance code integration • API and SDK documentation with sample code • Ease of integration into customer applications

Qualcomm Kryo, Qualcomm Adreno and Qualcomm Hexagon are products of Qualcomm Technologies, Inc. Available at: developer.qualcomm.com 30 Frameworks OS Ecosystem Features Devices

Noise Suppression

Super Resolution

Night Shot

Face Recognition

Speech Recognition

Qualcomm Object AI Engine Detection

Video Segmentation Cognitive Bokeh Toolkit

31 X50

Qualcomm Ventures AI Fund

Foundational 5G + AI Systems Advanced Ecosystem R&D technology design silicon investment leadership expertise Uniquely positioned to power the intelligently connected future Intelligence is becoming more distributed, with power-efficient on- device AI complementing the cloud

Mobile is democratizing AI and bringing it to new frontiers

Qualcomm Technologies is well positioned to provide superior AI solutions and make AI ubiquitous

33 www.qualcomm.com/ai

www.qualcomm.com/news/onq BLOG

@qualcomm_tech Questions? http://www.youtube.com/playlist?list=PL8A Connect with Us D95E4F585237C1&feature=plcp

http://www.slideshare.net/qualcommwirelessevolution

34 Thank you!

Follow us on: For more information, visit us at: www.qualcomm.com & www.qualcomm.com/blog

Nothing in these materials is an offer to sell any of the References in this presentation to “Qualcomm” may mean Qualcomm components or devices referenced herein. Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as ©2018 Qualcomm Technologies, Inc. and/or its affiliated applicable. Qualcomm Incorporated includes Qualcomm’s licensing companies. All Rights Reserved. business, QTL, and the vast majority of its patent portfolio. Qualcomm Qualcomm, Snapdragon, Hexagon, Adreno, and Kryo are Technologies, Inc., a wholly-owned subsidiary of Qualcomm trademarks of Qualcomm Incorporated, registered in the Incorporated, operates, along with its subsidiaries, substantially all of United States and other countries. Other products and brand Qualcomm’s engineering, research and development functions, and names may be trademarks or registered trademarks of their substantially all of its product and services businesses, including its respective owners. business, QCT.