D A T 2 0 2 - R What's new in Amazon Aurora
Tony Petrossian GM Amazon Aurora Amazon Web Services
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Aurora Enterprise database at open source price
Delivered as a managed service
Drop-in compatibility with MySQL and PostgreSQL
Simplicity and cost-effectiveness of open-source databases
Throughput and availability of commercial databases Amazon Aurora Simple pay-as-you-go pricing 4
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Database layers
SQL
Transactions Multiple layers Caching of processing all in a single Logging & Storage engine Aurora decouples storage and query processing
SQL Database Transactions node
Caching Amazon Aurora Storage Processing
Shared storage volume
nodes Storage Storage Scale-out, distributed storage processing architecture
Purpose-built log-structured distributed Availability Zone 1 Availability Zone 2 Availability Zone 3 storage system designed for databases SQL SQL SQL Storage volume is striped across hundreds Transactions Transactions Transactions
of storage nodes distributed over 3 Caching Caching Caching Instancenodes different Availability Zones
Six copies of data, two copies in each Availability Zone to protect against AZ+1 Shared storage volume failures
Data is written in 10 GB “protection nodes Storage groups”, growing automatically when needed 8
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora: Why re-imagine the RDBMS
Customers Aurora: Why re-imagine the RDBMS Aurora: Why re-imagine the RDBMS
Applications operating 24x7 and scaling to unimaginable levels © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora distributed storage provides: Redo log processing Instant crash recovery Fault-tolerant and self-healing storage Fast database cloning
Amazon Aurora Database backtrack Database snapshots Delivered as Continuous backups and point-in-time restore a managed Storage automatic scaling independent of compute service Read and write scalability Warm cache on database restart Low latency replication Aurora distributed storage provides:
Redo log processing
Instant crash recovery
Fault-tolerant and self-healing storage
Fast database cloning Amazon aurora Database backtrack
Database snapshots
Continuous backups and point-in-time restore transactions SQL Storage Auto Scaling independent of compute
Read and write scalability
Warm cache on database restart
Low latency replication Aurora
MySQL and PostgreSQL-compatible relational database built for the cloud Performance and availability of commercial-grade databases at 1/10th the cost
Performance Availability Highly secure Fully managed and scalability and durability
5x throughput of standard Fault-tolerant, self-healing Network isolation, Managed by Amazon RDS: MySQL and 3x of standard storage; six copies of data encryption at No server provisioning, software PostgreSQL; scale-out up to across three Availability Zones; rest/transit, compliance patching, setup, configuration, 15 read replicas continuous backup to Amazon S3 and assurance programs or backups 16
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora Global Database Faster DR and enhanced data locality
Promote remote readers to a master for faster cross-region disaster recovery (DR)
Bring data close to your customer’s applications in different Regions
Subsecond data replication cross-Region Aurora Global Database Faster DR and enhanced data locality Northern Virginia (Secondary Region)
Oregon Ohio R R (Secondary Region) (Primary Region)
M R R Storage Inbound Inbound replication R R Storage
Ireland Storage
Outbound replication (Secondary Region) Inbound Inbound replication
High throughput: Up to 200K writes/sec R R Low replica lag: < 1-sec cross-Region lag
Fast recovery: < 1-min. downtime after Region unavailability Storage Inbound Inbound replication Support for multiple secondary Regions Support for in-place conversion to Global Database Fast cross-account database cloning Analytics
Dev/test Create a copy of a database without applications duplicate storage costs • Creation of a clone is nearly instantaneous— Clone we don’t copy data Production • Data copy happens only on write—when applications original and cloned volume data differ Clone
Typical use cases • Clone a production DB to run tests • Reorganize a database • Run analytics workloads • Save a point in time snapshot for analysis without impacting production system Production database
https://aws.amazon.com/blogs/aws/amazon-aurora-fast-database-cloning/ Aurora Serverless for PostgreSQL and MySQL
Application
Request routers
Scalable DB capacity Warm pool of instances
DB storage Challenges with integrating machine learning (ML) with your database
Select and train the model
Create application code to read data from the database
Query and format the data for the ML algorithm
Call an ML service to run the algorithm
Format the output Amazon Aurora ML Simple, optimized, and secure Aurora, Amazon SageMaker, and Amazon Comprehend (in preview) integration
ML predictions Integration with Familiar SQL Low-latency, Security & on relational data Amazon language, no real time governance SageMaker & ML expertise Amazon Comprehend Aurora optimized ML query processing
Select * from user_feedback where aws_comprehend.detect _sentiment(review_text, ‘EN’)' = ‘POSITIVE'"
user_feedback ID Feedback 1 Great product! Good job Mediocre I didn’t like it Loved it Terrible service 50 Great service Amazon RDS Proxy (preview) Fully managed, highly available database proxy for Amazon RDS
Supports a large number of application connections Applications
RDS Database Deployed across multiple AZs and fails over without Instance losing a connection RDS Proxy Integrates with AWS Secrets Manager and IAM
Get started with a few clicks in the console
Connection Pooling Preview: Aurora MySQL and RDS MySQL Coming soon: Aurora PostgreSQL and RDS PostgreSQL What is Performance Insights
• Analyze and tune Database Performance • Database Load is determined by Average Active • Available through AWS Management Console Sessions (AAS) and AWS API SDK • Categorized data by Wait Events, SQL, Hosts, • Set up alarms for key issues and Users • SQL statistics for queries new! SQL statistics in Aurora Database Activity Streams DAS architecture
AWS Cloud
Amazon Kinesis Users Partner Database Alerts Amazon RDS Security Apps Database Instance /Replica
Amazon Kinesis Data Firehose Aurora read scaling options
15 promotable read replicas per cluster
Application Application Application BI/reporting Auto scaling to automatically add servers servers servers application and remove replicas server
Physical replication across Regions Read/write Read only (Aurora Global Database)
Database Read Logical (binlog) replication to any Primary replica server Asynchronous MySQL database replication Aurora read replicas
Read/write endpoint Read endpoint
Read Master replica
Shared distributed storage volume Aurora MySQL multi-master
Read-write end-point Read-write end-point
Master Master
Shared distributed storage volume
Continuous Availability with Multi-Master
- https://aws.amazon.com/blogs/database/building-highly-available-mysql-applications- using-amazon-aurora-mmsr/ Federated Query for Amazon Athena (preview) Run SQL queries on data spanning multiple data stores
Run SQL queries on relational, non-relational, object, or custom data sources; in the cloud or on premises
Open-source connectors for common data sources Amazon S3/ S3 Glacier
Amazon Redshift Build connectors to custom data sources
Amazon ElastiCache Run connectors in AWS Lambda: No servers to manage Amazon Aurora
Amazon DynamoDB
Amazon DocumentDB Amazon Redshift federated query (preview)
Queries on RDS and Aurora PostgreSQL databases
JDBC/ODBC Analytics on live data without data movement
Unified analytics across data warehouse, data lake, and operational databases
Flexible and easy way to ingest data
Performant and secure access to data Related breakouts
DAT309 Amazon Aurora storage demystified: How it all works DAT321 Deep dive on Amazon Aurora with MySQL compatibility DAT328 Deep dive on Amazon Aurora with PostgreSQL compatibility DAT350 Migrating open-source databases from Amazon EC2 to Amazon Aurora PostgreSQL DAT207-R What’s new in Amazon RDS DAT382 Amazon Aurora Multi-Master: Scaling out database write performance DAT404 Amazon Aurora Multi-Master: Scaling out database write performance Learn databases with AWS Training and Certification Resources created by the experts at AWS to help you build and validate database skills
25+ free digital training courses cover topics and services related to databases, including: • Amazon Aurora • Amazon Neptune • Amazon DocumentDB • Amazon DynamoDB • Amazon ElastiCache • Amazon Redshift • Amazon RDS Validate expertise with the new AWS Certified Database - Specialty beta exam
Visit aws.training
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you!
Tony Petrossian [email protected]
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.