Oracle Machine Learning
Total Page:16
File Type:pdf, Size:1020Kb
Oracle Machine Learning Platform Overview Move the algorithms, not the data! Mark Hornick, Senior Director, Product Management Data Science and Machine Learning Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 2 Agenda • Oracle Machine Learning family of products • Supporting multiple personas • OML component details • Enabling Oracle Applications use of ML Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 3 Oracle Machine Learning OML4SQL OML Notebooks Oracle Advanced Analytics with Apache Zeppelin on SQL API Autonomous Database OML4R Oracle Data Miner Oracle R Enterprise Oracle SQL Developer extension R API OML4Py* OML4Spark Python API Oracle R Advanced Analytics for Hadoop OML Microservices** Supporting Oracle Applications Image, Text, Scoring, Deployment, Model Management * Coming soon ** For use through Oracle Application only Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 4 New Branding Corresponding Products and Components Oracle Advanced Analytics (OAA) and Oracle Machine Learning (OML) Oracle R Advanced Analytics for Hadoop (ORAAH) Oracle Machine Learning for SQL (OML4SQL) Oracle Advanced Analytics (OAA) / Oracle Data Mining (ODM) Oracle Machine Learning for R (OML4R) Oracle R Enterprise (ORE) Oracle Machine Learning for Python (OML4Py) Oracle Machine Learning for Python (OML4Py) Oracle Machine Learning for Spark (OML4Spark) Oracle R Advanced Analytics for Hadoop (ORAAH) Oracle Machine Learning Notebooks previously known as Oracle Machine Learning Oracle Data Miner (ODMr) Note: The official price list product names have not changed. Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 5 OML empowers Enterprise Users Data Scientists Business and Executives Oracle Data Analysts Machine Learning Application and DBA and IT Dashboard Professionals Developers Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 6 Data Scientists Data Scientists •Popular data science languages: Business and Executives R, Python, SQL Oracle Data Analysts •Augment with 3rd party packages •Scalability and performance Machine •Automation-enhanced productivity Learning •Greater enterprise collaboration •Integrate and analyze data across the enterprise Application and DBA and IT Dashboard Professionals Developers Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 7 Business and Data Analysts •Expand analytical tool set with ML Data Scientists •Enable non-ML experts with AutoML •Leverage domain knowledge for better results •Collaborate with Data Scientists and IT Business and Executives Oracle Data Analysts Machine Learning Application and DBA and IT Dashboard Professionals Developers Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 8 DBA and IT Professionals Data Scientists Business and Executives Oracle Data Analysts Machine Learning • Even greater value from Oracle investment • Support scalability and performance • Simpler, streamlined infrastructure Application and DBA and IT • Maintain data security, backup, recovery Dashboard Professionals Developers • Use SQL, expand to Python and R • Leverage Database and Big Data sources Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 9 Application and Dashboard Developers Data Scientists Business and Executives Oracle Data Analysts Machine • Realize intelligent solutions faster through Oracle stack integration Learning • Easily uptake data scientists’ R, Python, and SQL scripts and rapidly deploy solutions Application and DBA and IT Dashboard • Embed ML in applications and dashboards Professionals using SQL, REST, and SODA APIs Developers Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | Executives • Benefit from world-class data Data Scientists management technology and support • Democratize ML across the enterprise to enable better data-driven decisions Business and • Deploy solutions faster to realize ROI Executives Oracle Data Analysts Machine Learning Application and DBA and IT Dashboard Professionals Developers Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | Vision: Autonomous Database evolves into Autonomous Data Platform Data Ingest Data Big Analytics Data Machine Application Learning Development From essentially a database cloud service… …to a tightly integrated data platform Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 12 Vision: Manage and Analyze Cross-Platform Data with Oracle Machine Learning Data Lake Oracle Database Select API Cloud or Reach broader Oracle Object User Interface, e.g. Options On-premises Data Sources Storage Amazon OML Notebooks REST S3 Popular R Azure Blob OML4R Oracle Cloud SQL Storage IDEs Popular Python NoSQL OML4Python Oracle Big Data SQL Databases IDEs Kafka SQL Developer OML4SQL Streams Big Data OML4Spark Service (HDFS) Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 13 Oracle Machine Learning Algorithms and Analytics CLASSIFICATION REGRESSION FEATURE EXTRACTION – Naïve Bayes – Linear Model – Principal Comp Analysis (PCA) – Logistic Regression (GLM) – Generalized Linear Model – Non-negative Matrix Factorization – Decision Tree – Support Vector Machine (SVM) – Singular Value Decomposition (SVD) – Random Forest – Stepwise Linear regression – Explicit Semantic Analysis (ESA) – Neural Network – Neural Network – Support Vector Machine – LASSO TEXT MINING SUPPORT – Explicit Semantic Analysis – Algorithms support text columns ATTRIBUTE IMPORTANCE – Tokenization and theme extraction CLUSTERING – Minimum Description Length – Explicit Semantic Analysis (ESA) for – Hierarchical K-Means – Principal Component Analysis (PCA) document similarity – Hierarchical O-Cluster – Unsupervised Pair-wise KL Div – Expectation Maximization (EM) – CUR decomposition for row & AI STATISTICAL FUNCTIONS – Basic statistics: min, max, ANOMALY DETECTION ASSOCIATION RULES median, stdev, t-test, F-test, Pearson’s, – One-Class SVM – A priori/ market basket Chi-Sq, ANOVA, etc. TIME SERIES PREDICTIVE QUERIES R AND PYTHON PACKAGES – Forecasting - Exponential Smoothing – Predict, cluster, detect, features – Third-party R and Python Packages – Includes popular models through Embedded Execution e.g. Holt-Winters with trends, SQL ANALYTICS – Spark MLlib algorithm integration seasonality, irregularity, missing data – SQL Windows – SQL Patterns – SQL Aggregates Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 14 OML algorithms differentiators Feature In-Database Spark No data movement to separate analytical engines Wide range of ML techniques supported Native Native and Spark MLlib High performance from parallel, distributed execution Spark 2-based. Use all nodes from Hadoop cluster Greater scalability from improved memory utilization Handle narrow, wide, and sparse data Plus star schema, nested data Automation Data preparation, text mining, partitioned models, AutoML Multiple language interfaces SQL, R, Python R, Java High performance R Formula for R Language Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | Oracle Machine Learning Notebooks for Autonomous Database Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | 1 6 Oracle Machine Learning Notebooks Autonomous Database as a Data Science Platform • Collaborative UI – Based on Apache Zeppelin – Supports data scientists, data analysts, application developers, DBAs – Easy sharing of notebooks and templates with permissions, versioning, and execution scheduling • Included with Autonomous Database – Automatically provisioned, managed, backed up – In-database machine learning algorithms and analytics functions via OML4SQL, and soon to be augmented with Python and R Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | Oracle Machine Learning for SQL (OML4SQL) for Python (OML4Py) for R (OML4R) Empower SQL users with immediate access to ML in Oracle Database and Oracle Autonomous Database Empower data scientists with open source environments Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | Traditional Analytics and Data Source Interaction Read/Write files using built-in tool capabilities Data Source ? read Flat Files extract / export export load Data source connectivity packages • Access latency Deployment • Paradigm shift: e.g., R/Python Data Access Language R/Python Ad hoc • Memory limitation – data size, in-memory processing cron job • Single threaded • Issues for backup, recovery, security • Ad hoc production deployment Copyright © 2019, Oracle and/or its affiliates. All rights reserved. | Oracle Machine Learning for SQL Component of ADB and Oracle Advanced Analytics option to Oracle Database • Use in-database parallel and distributed OML Notebooks SQL Interfaces machine learning algorithms from SQL SQL*Plus SQLDeveloper and PL/SQL … • ML models as first class database objects • Export / import models across databases • Batch and real-time scoring with explanatory predictive