April 5, 2018
Db2 on Cloud: Accelerate Development and Testing on the Cloud Db2
Simon Lightstone Offering Manager, Db2 on Cloud simonlig@ca.ibm.com Db2 Tech Talk series Host and Today’s Speaker
Lynn Chou Simon Lightstone Noah Kuttler Db2 Tech Talk Host Speaker Speaker Community Marketing Manager, IBM Offering Manager, Db2 on Cloud IBM Product Marketing Manager, Core Databases and IBM Analytics Db2 on Cloud Champions Safe Harbor Statement
Copyright © IBM Corporation 2018. All rights reserved.
U.S. Government Users Restricted Rights - Use, duplication, or disclosure restricted by GSA ADP Schedule Contract with IBM Corporation
THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL PURPOSES ONLY. WHILE EFFORTS WERE MADE TO VERIFY THE COMPLETENESS AND ACCURACY OF THE INFORMATION CONTAINED IN THIS PRESENTATION, IT IS PROVIDED “AS IS” WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. IN ADDITION, THIS INFORMATION IS BASED ON CURRENT THINKING REGARDING TRENDS AND DIRECTIONS, WHICH ARE SUBJECT TO CHANGE BY IBM WITHOUT NOTICE. FUNCTION DESCRIBED HEREIN MY NEVER BE DELIVERED BY I BM. IBM SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION. NOTHING CONTAINED IN THIS PRESENTATION IS INTENDED TO, NOR SHALL HAVE THE EFFECT OF, CREATING ANY WARRANTIES OR REPRESENTATIONS FROM IBM (OR ITS SUPPLIERS OR LICENSORS), OR ALTERING THE TERMS AND CONDITIONS OF ANY AGREEMENT OR LICENSE GOVERNING THE USE OF IBM PRODUCTS AND/OR SOFTWARE.
IBM, the IBM logo, ibm.com and DB2 are trademarks or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law trademarks in other countries. A current list of IBM trademarks is available on the Web at “Copyright and trademark information” at www.ibm.com/legal/copytrade.shtml
3 Db2 Tech Talk: Accelerate Development and Testing on the Cloud Contents
1. Introduction 4. APIs
2. Db2 Cloud Family 5. Tutorial Samples Very quick overview Flex Architecture 6. Closing and Feedback
3. Noteable Addons: DSX and more DSX with DSM based Db2 data
Think 2018 / DOC ID 8290 / March 19, 2018 / © 2018 IBM Corporation 5 Next Era of Db2 and Developers for Cloud
6 Deployment Options On Premises On Cloud
APPLIANCE SOFTWARE CONTAINER HOSTED MANAGED DBaaS DBaaS
Db2 Hosted Db2 on Cloud
CONTROL SIMPLICITY “Nearly half of large enterprises will have hybrid cloud deployments by the end of 2017” Gartner – Special Report “Private Cloud Matures, Hybrid Cloud Is Next.”
Think 2018 / DOC ID 8290 / March 19, 2018 / © 2018 IBM Corporation 7 The Common SQL Engine enables flexibility and optimizes customer investments as hybrid data architectures evolve
Built on a common SQL engine enabling true hybrid integration, choice, and portability
Managed or Managed SW defined Custom Open source Dedicated hosted public public warehouse deployable Hadoop with analytics Cloud DBaaS Cloud DWaaS on-premises database SW Hortonworks appliance or in cloud Db2 Integrated Db2 on Cloud Warehouse Db2 Db2 Analytics Db2 Hosted on Cloud Warehouse Db2 Big SQL System
• Application Agility • Licensing • Standardized analytics Write once, run anywhere Flexible entitlements for Common programming One ISV product certification business agility and cost- model for all platforms optimization for in-DB analytics • Operational compatibility • Integration • Common Skills Reuse operational and Common Data Virtualization One skill set for all housekeeping procedures capabilities for query federation deployments and data movement Drive higher efficiencies and portfolio rationalization SQL guidance: http://ibm.biz/CommonSQL 8 8 A deeper dive into the components of the Common SQL Engine
Built on a common SQL engine enabling true hybrid integration, choice, and portability
Managed or Managed public SW defined Custom Open source Dedicated Open Source hosted public Cloud DWaaS warehouse deployable Hadoop with analytics and other Cloud DBaaS on-premises database SW Hortonworks appliance Commercial or in cloud databases Db2 Warehouse Integrated Db2 on Cloud Db2 Analytics My SQL, Pos tgreSQL on Cloud Db2 Warehouse Par quet ( D b2 Ev ent Stor e) Db2 Hosted Db2 Big SQL System Data services for SQL/NoSQL and REST APIS with secure data transfer Accept, parse/validate SQL (including Oracle and PostgreSQL), and process/return data
Statement optimization Leverage statistics on schema, data, and distribution to make better query decisions and improve over time
Identify and manage workloads meeting complex business goals Identification of work, management, monitoring and continuous improvement
Security and accessibility of data Encryption of data in transit and at rest, with authorized access to data only to those who are entitled
Virtualization via query federation Common SQL via Connect to 3rd party and Open Source data sources and objects - expanding your reach Query Federation 9 §Management with a Single Pane of Glass Monitor, Administer, Manage, and Tune your Environment § § A single pane of glass § Simple to install and setup § Scales to 100s of databases § Historical analysis: - minutes, hours, days § Smart alerts with recommendations IBM Supports Your Data Access Patterns Multiple access Styles and Database Languages
§ SQL – Structure Query Language • Rich support of the SQL language § JSON – Storage and manipulation of JSON records within a relational database § Rest – OData standards based access to data § pureXML – Storage and manipulation of XML documents • Supports XQuery and SQL/XML functions § SQL Procedural Language (SQL PL) • It can be used to implement procedural logic in SQL statements § PL/SQL – Procedural Language/Structured Query Language • Reduces the complexity of enabling existing PL/SQL solutions so that they will work with the Db2 data server § RDF Graph Store – Native support for graph triples New: Db2 on Cloud Free Plan (“Lite”)
§ Deploys instantly in 1 click. § No tools to download. SQL editor is built in. § No time limits.
§ Great for prototyping.
§ Get started using the IBM Cloud. https://ibm.biz/freedb2 A Simple “Download and Go” experience! Up, Running, and Productive in Minutes
§ Marketplace or Discovery Portal – single point of download § Download platform-specific executable (Mac, Windows, or Linux) § Small download, less than 15 minutes from “I want” to “I got” § 3 clicks and userID/pwd input only Db2 Family Cloud Flex Plans Precise Performance Free Tiers Offerings Plans
Think 2018 / DOC ID 8290 / March 19, 2018 / © 2018 IBM Corporation 14 “Flex” Plans Let You Scale on the Cloud Pain-free scaling with sliderbars
§ Db2 on Cloud Flex • Scale your app without downtime • Scaling typical takes < 20 minutes • Fully managed, 99.99% uptime SLA § Db2 Warehouse on Cloud Flex • Scale analytics workloads easily • MPP cluster with BLU in-memory acceleration • Fully managed, 45 min scale time. Up to 48 TB § Db2 Hosted Flex • Unmanaged Db2: IBM hosts it; you manage it • Full root access
§ Lower development costs § Scale down over weekends § Scale up before running tests § Deploy & scale via API Db2 on Cloud – Plans
Fully managed, secure, dedicated single tenant environments with built-in Oracle compatibility
Precise Performance Plans Flex Plan (fixed price, balanced configurations) (elastic scaling of compute & storage)
PP 500 PP 1400 PP 10000 Flex
Server Type: Virtual Bare Metal Bare Metal Server Type: Virtual
# Cores: 2 (virtual) 12 48 # Cores: 1 - 32 (virtual)
Memory: 8 GB 128 GB 1 TB Memory: 4 - 128 GB
Storage: 500 GB HDD 1.4 TB SSD 11 TB SSD Storage: 2 GB - 4 TB
Monthly Monthly $189 for: Scale beyond that for: Price (USD) Price (USD) • 1 core • $52 per core & • 4 GB RAM 4 GB RAM Non-HA: $250 $4,000 $18,000 Non-HA: • 2 GB storage • $1 per GB storage HA: $500 $8,000 $36,000 • 2 million IOs • $0.20 per 1 million IOs HA: 2x non-HA price (except IOs – IOs only charged for on primary server)
High Availability (HA) configurations include: • Primary/standby pair with synchronous replication • Automatic failover • Single IP address for simplified client access to cluster • Reduced maintenance time using rolling updates Flex Plan – Independently Scale Compute & Storage
§ Compute resource (cores/memory) can be scaled both up and down -Outage typically less than 15 minutes (reduced outage with HA plan)
§ Compute scaling levels – # virtual cores (RAM):
1 (4 GB) 2 (8 GB) 4 (16 GB) 8 (32 GB) 16 (64 GB) 32 (128 GB)
§ Storage can scale up only – can’t be scaled down § Storage scaling levels: -Starts with 100 GB local SSD storage, followed by addition of up to two 2 TB portable storage volumes
2 GB 10 GB 25 GB 50 GB 100 GB 250 GB 500 GB 1 TB 2 TB 4 TB
PSV PSV added added -Growth within a volume does not require an outage; adding PSV does (reduced outage with HA Plan) Flex Plan – Architecture
Cores and Memory 1 Core 32 Core (IBM Cloud VM) 4 GB 128 GB
Data Storage Local SSD 2TB 2TB (Note 100 GB for OS and 1 TB Portable Portable PSV in base for ( 100 GB – data OS/Metadata(LUKS Encrypted) 200 GB Active Logs) Storage Storage not shown) (added after 100GB) (added after 2000 GB) 2 - 100 GB 250 GB - 2 TB 4 TB Storage controlled at more granular levels by using Journaled disk quotas (2 GB – 4000 GB)
Recovery Object Storage Backups and for HA archive logs and load copy data are (Swift uploaded to object storage currently) Scaling an Instance of the Flex Plan Scaling an Instance of the Flex Plan (cont.) Db2 on Cloud – High Availability Configuration
§ Reduced outage times for unplanned outages and maintenance – 99.99% uptime SLA § Configuration includes both a primary server plus a warm standby server § Log shipping keeps standby up-to-date for fast failovers § Simplified client configuration – floating IP address always points to current primary § Not logged operations that compromise data recoverability are blocked § Reduced maintenance times due to rolling updates
Automatic Client Reroute Integrated Cluster Management • Clients transparently connect • Health of servers monitored to standby when primary fails • Automated takeover
Primary Standby Db2 on Cloud Server Server
High Transaction Log Record Shipping Availability • Keeps standby Configuration current Loading Data into Db2 on Cloud is Easy!
§ Cloud-base data services include - Lift - Data Connect, Data Refinery - Data Refinery (part of WDP)
§ Console of Db2 managed services include - Load from remote client files - Load from cloud object storage (IBM Cloud Swift and Amazon S3)
§ ETL tools such as DataStage
§ Remote Db2 load/import/ingest utilities
§ Data Conversion Workbench for data migration
§ Large data transfer services such as - Data Transfer Service - Mass Data Movement Service
See load section of Knowledge Center for further details on load options for Db2 cloud services: http://www-01.ibm.com/support/knowledgecenter/SS6NHC/com.ibm.swg.im.dashdb.doc/learn_how/loaddata_overview.html Db2 on Cloud Service Offerings – Accelerating Success
Jumpstart Accelerator
• 50 hours to be used • 50 hours to be used within within 90 days of the first year purchase • Use against scoped • Services for hand holding, projects; clients can choose not doing between • Pre-Scoped Project • Guidance on proven Custom Scope Project practices for Db2 on Cloud, “how to” focus • Projects where IBM does the work • Advice, guidance, assistance Db2 on Cloud – Service Level Agreements (SLA)
§ SLA is an agreement to compensate a customer if High Availability Plans: documented levels of uptime are not met Compensation (% of Availability during a - Credited against a future invoice for the service monthly fee) towards contracted month future invoice § Downtime is measured from the time the >= 99.99% 0% customer reports the event until the time the >= 99% and < 99.99% 10% service is restored < 99% 25%
§ As described in the Service Description document, Non-High Availability Plans: downtime does not include time related to: Compensation (% of Availability during a - A scheduled or announced maintenance outage monthly fee) towards contracted month (i.e. planned outages do not count) future invoice - Causes beyond IBM's control >= 99.5% 0% - Problems with Client or third party content or >= 99% and < 99.5% 10% technology, < 99% 25% designs or instructions
§ Details can be found in the Service Description (SD) documents (always use the most recent version of the SD available): § Pay-as-you-go: http://www-03.ibm.com/software/sla/sla db.nsf /pdf /7519- 07/$file/i126-7519- 07_ 09- 2017_en_US.pdf § IBM/BP sales: http://www-03.ibm.com/software /sla/sladb .n sf/p df/7579-04/$file/i126-7579-04_06- 2017_en_US.pdf IBM Data Server Manager Natively Supports Db2 in the Cloud
IBM Data Server Manager is a critical integrated part of the hybrid IBM Data Store value proposition. It's the up and running, management, and the problem determination console for Db2, Db2 on Cloud, Db2 Warehouse on Cloud, Db2 Warehouse and Big SQL.
IBM Data Server Manager is more than a tool for monitoring Db2. It is a set of common services and capabilities for Enterprise and Application DBAs, Data Engineers and Analysts and Developers.
It's the center of the Hybrid Enterprise.
By establishing a connection to a Db2 Warehouse on Cloud or a Db2 on Cloud database, Data Server Manager automatically reconfigures its feature set to complement the selected database type
25 © 2017 IBM Corporation Data Science Experience Integrating Analytic Components into One Desktop Environment
§ The Data Science Experience (DSX) enables data scientists to master the art of data science in a number of environments - On-prem, Private cloud, Public Cloud - Simplified single user application § DSX provides data scientists access to - Commonly used tools for data scientists such as - Jupyter notebooks - RStudio § Numerous open source and commercial analytics libraries § Constantly updated resources § An environment that facilitates team work and collaboration RStudio and R Integration Making Analytics Easy
§ RStudio makes R easier to use - It includes a code editor, debugging & visualization tools - Syntax highlighting, code completion, and smart indentation - Quickly jump to function definitions § Execute R code directly from the source editor § Easily manage multiple working directories using projects § Integrated R help and documentation § Interactive debugger to diagnose and fix errors quickly § Extensive package development tools Data Server Manager: Core Capabilities § Monitoring and Alerts § Proactive alerts § Overview Dashboards and drilldown § Real-time, and historical performance data § Client Monitoring § HADR and pureScale § Query Tuning § Advisors to identify tuning opportunities § Capture SQL and workloads § Explain query access paths § Improve statistics quality, improve database and query design § Database Administration and Use § Build, format, explain and run SQL statements and download as excel § Perform or schedule routine admin § Manage privileges, maintenance and availability § Connect to remote data sources § Create, and mange database objects § Configuration Management and History § Keep an inventory of database objects history of change. § Registry variables, configuration settings, custom key/ value settings to help with diagnosis APIs and Rest Remote Mobile and Open access to IBM Data Stores
§ What is the problem? Mobile (Android, Windows, iOS) and other open application models need access to IBM DB environments § What is our solution? Robust APIs provide a broad set of capabilities. In addition, Data Server Gateway for OData supports open standards based connectivity to IBM Data Server environments REST API Exceptional power to automate your apps. Power to fully automate Exceptional power to automate your apps.
§ REST API for SQO § API for Deployment § API for Destroy • Note: Applies to free and paid § API for complex authentication and adding/deleting users. • Great for multi-tenant applications. § API for scaling • cf update-service flextest -c '{"ram_gb":8,"email":”[email protected]”,”scale_time":0}' Data Server Gateway for OData Remote Mobile and Open access to IBM Data Stores
§ What is the problem? Mobile (Android, Windows, iOS) and other open application models need DBA access to IBM DB environments Application including Mobile § What is our solution? or other Open Model Data Server Gateway for OData supports open standards based connectivity to IBM Data Server environments § Why OData? OData (Open Data Protocol) is an ISO/IEC approved, Oasis standard that defines a set of best practices for building and consuming Restful API’s OData § What’s included? Gateway* • Publish OData V4 Rest API endpoints/services for selected database tables • Support for most data types • Support for Crud operations • Get, Post, Put (update a full row), Patch (update selected columns), Delete * DS Driver supports OData Gateway access to IBM • JSON and XML payloads Data Stores • API persistence • Authentication and Authorization • SSL Support IBM Data Studio An integrated, modular environment for DB development and administration § Supports IBM Db2, Db2 Warehouse Db2 for z/OS, Db2 for i, Informix and Big SQL ibm.biz/IBMDataStudio § Enables developers and administrators to create Application and manage heterogeneous database environments Developer for increased productivity. § Streamline database development with advanced query validation, object management, procedure development, deploy and debug. § Improved collaboration through the open source integrated Eclipse environment § Support for popular source code management and team collaboration tools like Git § pureQuery integration for application acceleration § Data Studio available at no charge § Support entitlement provided through Db2 Deep dive tutorials
Think 2018 / DOC ID 8290 / March 19, 2018 / © 2018 IBM Corporation 34 Using Python and DSX with Db2
Think 2018 / DOC ID 8290 / March 19, 2018 / © 2018 IBM Corporation 35 Making Magic with Jupyter • Db2 Magic Commands • Secure and persistent database connections • One command from SQL to DataFrame • SQL Formatting in Cell • Graphing