Amazon Aurora
Total Page:16
File Type:pdf, Size:1020Kb
D A T 2 0 2 - R What's new in Amazon Aurora Tony Petrossian GM Amazon Aurora Amazon Web Services © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Aurora Enterprise database at open source price Delivered as a managed service Drop-in compatibility with MySQL and PostgreSQL Simplicity and cost-effectiveness of open-source databases Throughput and availability of commercial databases Amazon Aurora Simple pay-as-you-go pricing 4 © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Database layers SQL Transactions Multiple layers Caching of processing all in a single Logging & Storage engine Aurora decouples storage and query processing SQL Database Transactions node Caching Amazon Aurora Storage Processing Shared storage volume nodes Storage Storage Scale-out, distributed storage processing architecture Purpose-built log-structured distributed Availability Zone 1 Availability Zone 2 Availability Zone 3 storage system designed for databases SQL SQL SQL Storage volume is striped across hundreds Transactions Transactions Transactions of storage nodes distributed over 3 Caching Caching Caching Instancenodes different Availability Zones Six copies of data, two copies in each Availability Zone to protect against AZ+1 Shared storage volume failures Data is written in 10 GB “protection nodes Storage groups”, growing automatically when needed 8 © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora: Why re-imagine the RDBMS Customers Aurora: Why re-imagine the RDBMS Aurora: Why re-imagine the RDBMS Applications operating 24x7 and scaling to unimaginable levels © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora distributed storage provides: Redo log processing Instant crash recovery Fault-tolerant and self-healing storage Fast database cloning Amazon Aurora Database backtrack Database snapshots Delivered as Continuous backups and point-in-time restore a managed Storage automatic scaling independent of compute service Read and write scalability Warm cache on database restart Low latency replication Aurora distributed storage provides: Redo log processing Instant crash recovery Fault-tolerant and self-healing storage Fast database cloning Amazon aurora Database backtrack Database snapshots Continuous backups and point-in-time restore transactions SQL Storage Auto Scaling independent of compute Read and write scalability Warm cache on database restart Low latency replication Aurora MySQL and PostgreSQL-compatible relational database built for the cloud Performance and availability of commercial-grade databases at 1/10th the cost Performance Availability Highly secure Fully managed and scalability and durability 5x throughput of standard Fault-tolerant, self-healing Network isolation, Managed by Amazon RDS: MySQL and 3x of standard storage; six copies of data encryption at No server provisioning, software PostgreSQL; scale-out up to across three Availability Zones; rest/transit, compliance patching, setup, configuration, 15 read replicas continuous backup to Amazon S3 and assurance programs or backups 16 © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Aurora Global Database Faster DR and enhanced data locality Promote remote readers to a master for faster cross-region disaster recovery (DR) Bring data close to your customer’s applications in different Regions Subsecond data replication cross-Region Aurora Global Database Faster DR and enhanced data locality Northern Virginia (Secondary Region) Oregon Ohio R R (Secondary Region) (Primary Region) M R R Storage Inbound Inbound replication R R Storage Ireland Storage Outbound replication (Secondary Region) Inbound Inbound replication High throughput: Up to 200K writes/sec R R Low replica lag: < 1-sec cross-Region lag Fast recovery: < 1-min. downtime after Region unavailability Storage Inbound Inbound replication Support for multiple secondary Regions Support for in-place conversion to Global Database Fast cross-account database cloning Analytics Dev/test Create a copy of a database without applications duplicate storage costs • Creation of a clone is nearly instantaneous— Clone we don’t copy data Production • Data copy happens only on write—when applications original and cloned volume data differ Clone Typical use cases • Clone a production DB to run tests • Reorganize a database • Run analytics workloads • Save a point in time snapshot for analysis without impacting production system Production database https://aws.amazon.com/blogs/aws/amazon-aurora-fast-database-cloning/ Aurora Serverless for PostgreSQL and MySQL Application Request routers Scalable DB capacity Warm pool of instances DB storage Challenges with integrating machine learning (ML) with your database Select and train the model Create application code to read data from the database Query and format the data for the ML algorithm Call an ML service to run the algorithm Format the output Amazon Aurora ML Simple, optimized, and secure Aurora, Amazon SageMaker, and Amazon Comprehend (in preview) integration ML predictions Integration with Familiar SQL Low-latency, Security & on relational data Amazon language, no real time governance SageMaker & ML expertise Amazon Comprehend Aurora optimized ML query processing Select * from user_feedback where aws_comprehend.detect _sentiment(review_text, ‘EN’)' = ‘POSITIVE'" user_feedback ID Feedback 1 Great product! Good job Mediocre I didn’t like it Loved it Terrible service 50 Great service Amazon RDS Proxy (preview) Fully managed, highly available database proxy for Amazon RDS Supports a large number of application connections Applications RDS Database Deployed across multiple AZs and fails over without Instance losing a connection RDS Proxy Integrates with AWS Secrets Manager and IAM Get started with a few clicks in the console Connection Pooling Preview: Aurora MySQL and RDS MySQL Coming soon: Aurora PostgreSQL and RDS PostgreSQL What is Performance Insights • Analyze and tune Database Performance • Database Load is determined by Average Active • Available through AWS Management Console Sessions (AAS) and AWS API SDK • Categorized data by Wait Events, SQL, Hosts, • Set up alarms for key issues and Users • SQL statistics for queries new! SQL statistics in Aurora Database Activity Streams DAS architecture AWS Cloud Amazon Kinesis Users Partner Database Alerts Amazon RDS Security Apps Database Instance /Replica Amazon Kinesis Data Firehose Aurora read scaling options 15 promotable read replicas per cluster Application Application Application BI/reporting Auto scaling to automatically add servers servers servers application and remove replicas server Physical replication across Regions Read/write Read only (Aurora Global Database) Database Read Logical (binlog) replication to any Primary replica server Asynchronous MySQL database replication Aurora read replicas Read/write endpoint Read endpoint Read Master replica Shared distributed storage volume Aurora MySQL multi-master Read-write end-point Read-write end-point Master Master Shared distributed storage volume Continuous Availability with Multi-Master - https://aws.amazon.com/blogs/database/building-highly-available-mysql-applications- using-amazon-aurora-mmsr/ Federated Query for Amazon Athena (preview) Run SQL queries on data spanning multiple data stores Run SQL queries on relational, non-relational, object, or custom data sources; in the cloud or on premises Open-source connectors for common data sources Amazon S3/ S3 Glacier Amazon Redshift Build connectors to custom data sources Amazon ElastiCache Run connectors in AWS Lambda: No servers to manage Amazon Aurora Amazon DynamoDB Amazon DocumentDB Amazon Redshift federated query (preview) Queries on RDS and Aurora PostgreSQL databases JDBC/ODBC Analytics on live data without data movement Unified analytics across data warehouse, data lake, and operational databases Flexible and easy way to ingest data Performant and secure access to data Related breakouts DAT309 Amazon Aurora storage demystified: How it all works DAT321 Deep dive on Amazon Aurora with MySQL compatibility DAT328 Deep dive on Amazon Aurora with PostgreSQL compatibility DAT350 Migrating open-source databases from Amazon EC2 to Amazon Aurora PostgreSQL DAT207-R What’s new in Amazon RDS DAT382 Amazon Aurora Multi-Master: Scaling out database write performance DAT404 Amazon Aurora Multi-Master: Scaling out database write performance Learn databases with AWS Training and Certification Resources created by the experts at AWS to help you build and validate database skills 25+ free digital training courses cover topics and services related to databases, including: • Amazon Aurora • Amazon Neptune • Amazon DocumentDB • Amazon DynamoDB • Amazon ElastiCache • Amazon Redshift • Amazon RDS Validate expertise with the new AWS Certified Database - Specialty beta exam Visit aws.training © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you! Tony Petrossian [email protected] © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved..