D A T 2 0 2 - R What's new in Aurora

Tony Petrossian GM Amazon Aurora

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Aurora Enterprise at open source price

Delivered as a managed service

Drop-in compatibility with MySQL and PostgreSQL

Simplicity and cost-effectiveness of open-source

Throughput and availability of commercial databases Amazon Aurora Simple pay-as-you-go pricing 4

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Database layers

SQL

Transactions Multiple layers Caching of processing all in a single Logging & Storage engine Aurora decouples storage and query processing

SQL Database Transactions node

Caching Amazon Aurora Storage Processing

Shared storage volume

nodes Storage Storage Scale-out, distributed storage processing architecture

Purpose-built log-structured distributed Availability Zone 1 Availability Zone 2 Availability Zone 3 storage system designed for databases SQL SQL SQL Storage volume is striped across hundreds Transactions Transactions Transactions

of storage nodes distributed over 3 Caching Caching Caching Instancenodes different Availability Zones

Six copies of data, two copies in each Availability Zone to protect against AZ+1 Shared storage volume failures

Data is written in 10 GB “protection nodes Storage groups”, growing automatically when needed 8

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora: Why re-imagine the RDBMS

Customers Aurora: Why re-imagine the RDBMS Aurora: Why re-imagine the RDBMS

Applications operating 24x7 and scaling to unimaginable levels © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora distributed storage provides: Redo log processing Instant crash recovery Fault-tolerant and self-healing storage Fast database cloning

Amazon Aurora Database backtrack Database snapshots Delivered as Continuous backups and point-in-time restore a managed Storage automatic scaling independent of compute service Read and write scalability Warm cache on database restart Low latency replication Aurora distributed storage provides:

Redo log processing

Instant crash recovery

Fault-tolerant and self-healing storage

Fast database cloning Amazon aurora Database backtrack

Database snapshots

Continuous backups and point-in-time restore transactions SQL Storage Auto Scaling independent of compute

Read and write scalability

Warm cache on database restart

Low latency replication Aurora

MySQL and PostgreSQL-compatible relational database built for the cloud Performance and availability of commercial-grade databases at 1/10th the cost

Performance Availability Highly secure Fully managed and scalability and durability

5x throughput of standard Fault-tolerant, self-healing Network isolation, Managed by Amazon RDS: MySQL and 3x of standard storage; six copies of data encryption at No server provisioning, software PostgreSQL; scale-out up to across three Availability Zones; rest/transit, compliance patching, setup, configuration, 15 read replicas continuous backup to and assurance programs or backups 16

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora Global Database Faster DR and enhanced data locality

Promote remote readers to a master for faster cross-region disaster recovery (DR)

Bring data close to your customer’s applications in different Regions

Subsecond data replication cross-Region Aurora Global Database Faster DR and enhanced data locality Northern Virginia (Secondary Region)

Oregon Ohio R R (Secondary Region) (Primary Region)

M R R Storage Inbound Inbound replication R R Storage

Ireland Storage

Outbound replication (Secondary Region) Inbound Inbound replication

High throughput: Up to 200K writes/sec R R Low replica lag: < 1-sec cross-Region lag

Fast recovery: < 1-min. downtime after Region unavailability Storage Inbound Inbound replication Support for multiple secondary Regions Support for in-place conversion to Global Database Fast cross-account database cloning Analytics

Dev/test Create a copy of a database without applications duplicate storage costs • Creation of a clone is nearly instantaneous— Clone we don’t copy data Production • Data copy happens only on write—when applications original and cloned volume data differ Clone

Typical use cases • Clone a production DB to run tests • Reorganize a database • Run analytics workloads • Save a point in time snapshot for analysis without impacting production system Production database

https://aws.amazon.com/blogs/aws/amazon-aurora-fast-database-cloning/ Aurora Serverless for PostgreSQL and MySQL

Application

Request routers

Scalable DB capacity Warm pool of instances

DB storage Challenges with integrating machine learning (ML) with your database

Select and train the model

Create application code to read data from the database

Query and format the data for the ML algorithm

Call an ML service to run the algorithm

Format the output Amazon Aurora ML Simple, optimized, and secure Aurora, Amazon SageMaker, and Amazon Comprehend (in preview) integration

ML predictions Integration with Familiar SQL Low-latency, Security & on relational data Amazon language, no real time governance SageMaker & ML expertise Amazon Comprehend Aurora optimized ML query processing

Select * from user_feedback where aws_comprehend.detect _sentiment(review_text, ‘EN’)' = ‘POSITIVE'"

user_feedback ID Feedback 1 Great product! Good job Mediocre I didn’t like it Loved it Terrible service 50 Great service Amazon RDS Proxy (preview) Fully managed, highly available database proxy for Amazon RDS

Supports a large number of application connections Applications

RDS Database Deployed across multiple AZs and fails over without Instance losing a connection RDS Proxy Integrates with AWS Secrets Manager and IAM

Get started with a few clicks in the console

Connection Pooling Preview: Aurora MySQL and RDS MySQL Coming soon: Aurora PostgreSQL and RDS PostgreSQL What is Performance Insights

• Analyze and tune Database Performance • Database Load is determined by Average Active • Available through AWS Management Console Sessions (AAS) and AWS API SDK • Categorized data by Wait Events, SQL, Hosts, • Set up alarms for key issues and Users • SQL statistics for queries new! SQL statistics in Aurora Database Activity Streams DAS architecture

AWS Cloud

Amazon Kinesis Users Partner Database Alerts Amazon RDS Security Apps Database Instance /Replica

Amazon Kinesis Data Firehose Aurora read scaling options

15 promotable read replicas per cluster

Application Application Application BI/reporting Auto scaling to automatically add servers servers servers application and remove replicas server

Physical replication across Regions Read/write Read only (Aurora Global Database)

Database Read Logical (binlog) replication to any Primary replica server Asynchronous MySQL database replication Aurora read replicas

Read/write endpoint Read endpoint

Read Master replica

Shared distributed storage volume Aurora MySQL multi-master

Read-write end-point Read-write end-point

Master Master

Shared distributed storage volume

Continuous Availability with Multi-Master

- https://aws.amazon.com/blogs/database/building-highly-available--applications- using-amazon-aurora-mmsr/ Federated Query for Amazon Athena (preview) Run SQL queries on data spanning multiple data stores

Run SQL queries on relational, non-relational, object, or custom data sources; in the cloud or on premises

Open-source connectors for common data sources Amazon S3/ S3 Glacier

Amazon Redshift Build connectors to custom data sources

Amazon ElastiCache Run connectors in AWS Lambda: No servers to manage Amazon Aurora

Amazon DynamoDB

Amazon DocumentDB federated query (preview)

Queries on RDS and Aurora PostgreSQL databases

JDBC/ODBC Analytics on live data without data movement

Unified analytics across data warehouse, data lake, and operational databases

Flexible and easy way to ingest data

Performant and secure access to data Related breakouts

DAT309 Amazon Aurora storage demystified: How it all works DAT321 Deep dive on Amazon Aurora with MySQL compatibility DAT328 Deep dive on Amazon Aurora with PostgreSQL compatibility DAT350 Migrating open-source databases from Amazon EC2 to Amazon Aurora PostgreSQL DAT207-R What’s new in Amazon RDS DAT382 Amazon Aurora Multi-Master: Scaling out database write performance DAT404 Amazon Aurora Multi-Master: Scaling out database write performance Learn databases with AWS Training and Certification Resources created by the experts at AWS to help you build and validate database skills

25+ free digital training courses cover topics and services related to databases, including: • Amazon Aurora • • Amazon DocumentDB • Amazon DynamoDB • Amazon ElastiCache • Amazon Redshift • Amazon RDS Validate expertise with the new AWS Certified Database - Specialty beta exam

Visit aws.training

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you!

Tony Petrossian [email protected]

© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.