BRKIOT-2394.Pdf

Unlocking the Mystery of Machine Learning and Big Data Analytics Robert Barton Jerome Henry Distinguished Architect Principal Engineer @MrRobbarto Office of the CTAO CCIE #6660 @wirelessccie CCDE #2013::6 CCIE #24750 CWNE #45 BRKIOT-2394 Cisco Webex Teams Questions? Use Cisco Webex Teams to chat with the speaker after the session How 1 Find this session in the Cisco Events Mobile App 2 Click “Join the Discussion” 3 Install Webex Teams or go directly to the team space 4 Enter messages/questions in the team space BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 3 Tuesday, Jan. 28th Monday, Jan. 27th Wednesday, Jan. 29th BRKIOT-2600 BRKIOT-2213 16:45 Enabling OT-IT collaboration by 17:00 From Zero to IOx Hero transforming traditional industrial TECIOT-2400 networks to modern IoT Architectures IoT Fundamentals 08:45 BRKIOT-1618 Bootcamp 14:45 Industrial IoT Network Management PSOIOT-1156 16:00 using Cisco Industrial Network Director Securing Industrial – A Deep Dive. Networks: Introduction to Cisco Cyber Vision PSOIOT-2155 Enhancing the Commuter 13:30 BRKIOT-1775 Experience - Service Wireless technologies and 14:30 BRKIOT-2698 BRKIOT-1520 Provider WiFi at the Use Cases in Industrial IOT Industrial IoT Routing – Connectivity 12:15 Cisco Remote & Mobile Asset speed of Trains and Beyond Solutions PSOIOT-2197 Cisco Innovates Autonomous 14:00 TECIOT-2000 Vehicles & Roadways w/ IoT BRKIOT-2497 BRKIOT-2900 Understanding Cisco's 14:30 IoT Solutions for Smart Cities and 11:00 Automating the Network of Internet Of Things (IOT) BRKIOT-2108 Communities Industrial Automation Solutions Connected Factory Architecture Theory and 11:00 Practice PSOIOT-2100 BRKIOT-1291 Unlock New Market 16:15 Opening Keynote 09:00 08:30 Opportunities with LoRaWAN for IOT Enterprises Embedded Cisco services Technologies IOT IOT IOT Track #CLEMEA www.ciscolive.com/emea/learn/technology-tracks.html Cisco Live Thursday, Jan. 30th Celebration 18:30 Friday, Jan. 31st Guest Keynote 17:00 BRKIOT-2548 BRKIOT-2100 Cisco Distributed 08:30 IoT and Intent-Based Networking Automation Solutions Solutions for Smart Cities and Connected Roadways 11:30 BRKIOT-2225 BRKIOT-3511 BRKIOT-2003 A security design for enabling IoT gateway scalable 09:45 Digital Building Theory & Practice deployment with Cisco Industry 4.0 Kinetic Gateway Management Module (GMM) BRKIOT-2204 BRKIOT-2394 Leveraging industrial BRKIOT-2526 Unlocking the Mystery of Machine 09:00 device visibility and 11:15 Wi-Fi Technology in Learning and Big Data operational intent to 14:45 Industrial IoT inform security policies and controls PSOIOT-2400 Bringing IT and OT together PSOIOT-1151 to drive business benefits Achieving business 13:15 outcomes using IoT 13:30 solutions IOT IOT IOT Track #CLEMEA www.ciscolive.com/emea/learn/technology-tracks.html Agenda 1. An introduction to Data Analytics and AI/ML • Principles 2. Fundamentals of Machine Learning • Principles • Supervised learning • Unsupervised learning • Deep learning (neural networks) 3. Applications of AI/ML at Cisco – IoT and Beyond • Data Analytics for IoT • Data Center • Collaboration • Security BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 6 The Purpose of This Presentation • AI/ML and Big Data are among the most hyped technologies today . • This presentation will teach you how AI/ML and Big Data Analytics really work • This presentation will give network engineers a foundation in these technologies to build network architectures for data analysis . especially in the world of IoT © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public An Introduction to Big Data Analytics and Machine Learning Data Analytics Enables Insights • Big data and analytics entails not just a change in the volume of data, but also a change in the way we work with the data. • Traditional data warehouse tools do not uncover connections within those data sets – we need data analytics with Machine Learning! Automation for Faster Reveal Make Data Focus on Results Hidden Patterns Driven Decisions Important Things BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 9 The Magic of Machine Learning Data is the Rocket Fuel for Machine Learning – Andrew Ng Machine Learning Definitions Traditional, Rules-based Arthur Samuel (1959) Live data Field of study that gives computers the Output ability to learn without being explicitly Program programmed Tom Mitchell (1997) Machine Learning / AI A computer program is said to learn if Training data Program its performance at a task T, as (Model) measured by a performance P, Output improves with experience E © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public So, What is Big Data ? (per Wikipedia definition) Big Data is the term for a collection of data sets so large and complex that it becomes difficult to process using traditional data processing tools © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public Comparing the Function of Traditional Data Management Systems (Structured Data) OLTP Business OLTP ETL EDW Intelligence Extract, OLTP Transform, Enterprise Load Data Online Transactional Warehouse Processing Systems BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 12 Comparing Structured and Unstructured Data Customer Name ID Phone Email Irene 14213 408-923- [email protected] 1242 Walter 62342 514-231- [email protected] 2315 Miyuki 12344 416-231- [email protected] 2341 Bank Account Account Balance Transactions Number Credit Debit 234123 $1,232 $123 423142 $5,231 $611 521231 $50 $512 BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 13 Unstructured vs. Structured Data • Unstructured data is a poor fit for traditional relational databases 10,000 •More than 90% is unstructured data UNSTRUCTURED DATA •Quantity doubles every 2 years •Most unstructured data is neither stored nor analyzed! (IN BILLIONS) GB of Data STRUCTURED DATA 0 2005 2010 2015 Source: Cloudera BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 14 3Vs of Big Data Big Data refers to distributed computing architectures specifically aimed at the “3 V’s” of data: Volume, Velocity, and Variety VOLUME VELOCITY VARIETY BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 15 Big Data Management Systems NoSQL (Not Only SQL) Massive Parallel Fast key-value Processing (MPP) store/retrieve in real time Shared-nothing Structured and Unstructured Structured Data (SQL) Hadoop Distributed batch, query, and processing Unstructured Data BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 16 The Hadoop Stack Top-Level Interfaces ETLs Viz Tools BI Tools Mid-Level ZOO Many PIG HIVE Abstractions KEEPER more . (Projects) Distributed Data MapReduce / YARN Processing The foundational Hadoop File System (HDFS) file system BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 17 Basics of the Hadoop File System (HDFS) Master Node Name Node Scalability of data management and processing Data is linear with the addition of new slave nodes Data Data Slave Nodes . Data Node Data Node Data Node BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 18 Basics of the Hadoop File System (HDFS) Master Node Name Node Data Data Data . Slave Nodes Data Node Data Node Data Node BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 19 Analyzing the Data (MapReduce) Master Node Batch processing of Name Node Java API Application the application Job job (the job sits Job Tracker in the queue) Task Tracker Task Tracker Task Tracker Slave Nodes . Data Node Data Node Data Node BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 20 MapReduce: Job Trackers and Task Trackers Master Node Batch processing of Name Node Java API Application the application Job job (the job sits Job Tracker in the queue) Task Tracker Task Tracker Task Tracker Slave Nodes . Data Node Data Node Data Node BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 21 MapReduce: Job Trackers and Task Trackers Master Node Name Node Application Job Job Tracker Output Cisco Cloudera CVD: https://www.cisco.com/c/en/us/td/docs/u nified_computing/ucs/UCS_CVDs/Cisco _UCS_Integrated_Infrastructure_for_Big _Data_with_Cloudera_28node.html Task Tracker Task Tracker Task Tracker . Data Node Data Node Data Node BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 22 Making Sense of Data – Fundamental Concepts of Machine Learning Machine learning will decide which IoT ‘things’ survive - Jaques Touillon, Airboxlab BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 24 Machine Learning Map Artificial Intelligence Any technology that allows a computing system to mimic human thinking functions Example 1 Example 2 If XYZ: do blah else: do blih BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 25 Machine Learning Map Artificial Intelligence Machine Learning An AI technology where all the rules are not set in the program, but are learned Example while the program is used BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 26 ML Applications Lip Reading Deep Blue Watson Deep Mind Playing Music WaveNet (TTS) Counting people Recognition NLP Translation Self-driving Computer Networks And Many more: CRM, Healthcare, Personal Assistants, …. BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 27 McCullogh & Pitts - 1943 Artificial Intelligence Binary threshold neuron Von Neumann used this logic when designing the “universal computer” If sum > threshold, output 1 Send weighted inputs Output 0 otherwise BRKIOT-2394 © 2020 Cisco and/or its affiliates. All rights reserved. Cisco Public 28 In the 1950s, ML Research Expanded • 1962, Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms, Frank Rosenblatt When I show these shapes to the camera This IBM 704 computaah can say “it’s a triangle” (you will see why this works soon) BRKIOT-2394 © 2020 Cisco and/or its affiliates.

BRKIOT-2394.Pdf

Supermicro GPU Solutions Optimized for NVIDIA Nvlink

Identificação De Textos Em Imagens CAPTCHA Utilizando Conceitos De

High Performance Computing and AI Solutions Portfolio

Vikas Sindhwani Google May 17-19, 2016 2016 Summer School on Signal Processing and Machine Learning for Big Data

Warned That Big, Messy AI Systems Would Generate Racist, Unfair Results

Gmail Smart Compose: Real-Time Assisted Writing

The Machine Learning Journey with Google

Google's 'Project Nightingale' Gathers Personal Health Data

NVIDIA Gpudirect RDMA (GDR) Host Memory Host Memory • Pipeline Through Host for Large Msg 2 IB IB 4 2

Dell EMC Poweredge C4140 Technical Guide

HPE Apollo 6500 Gen10 System Overview

The Computer That Could Be Smarter Than Us Cognitive Computing