AWS Cloud Storage Services
Ian Perez Ponce Sr. Business Development Manager, Amazon Web Services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Learning objectives
ü Establish a foundational understanding of core AWS storage services – including Amazon Simple Storage Service (Amazon S3), Amazon Glacier, Amazon Elastic Block Store (Amazon EBS), and Amazon Elastic File System (Amazon EFS)
ü Learn how data transfer services such as AWS Snowball, Snowball Edge, and AWS Snowmobile, plus hybrid storage solutions such as AWS Storage Gateway can simplify large scale data ingest and egress
ü Hear of new storage features and offerings along with leading use case examples from a mix of customers
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What’s driving storage relevance?
Data Trust Frameworks
Natural Information Language Assets Processing
Internet Of Things
Artificial Intelligence
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. IN SHORT…
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EVERYTHING.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS cloud storage is core
Gives you unique scale... Yielding bigger insights... Greatest reliability Most big data & data lake Broad security and compliance deployments Diverse portfolio Most managed databases Fastest innovation Easiest data warehousing Singular query-in-place analytics
Building on or migrating Data matters at Helping you innovate an application to AWS… any scale faster... Advanced developer tools Artificial Intelligence & Experienced consulting and support Machine Learning Methodical migration services IoT The most data movement services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Complete set of building blocks
Data movement Data security
Hybrid Storage and management
Data Discovery and Streaming Data Protection Data Visualization File Sync Block File Serverless Computing Storage Storage WAN Acceleration Automation
Private Networks Audit Trails
Monitoring and Metrics 3rd Party Applications Object Archival Storage Storage Access Controls
Physical Appliances Encryption
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The broadest range of storage services
Data movement Data security
AWS Storage Gateways and management
Amazon Kinesis Firehose Amazon Macie
AWS QuickSight EFS File Sync Amazon Amazon AWS Lambda EBS EFS Online S3 Transfer Acceleration AWS CloudFormation
AWS Direct Connect AWS CloudTrail
AWS CloudWatch 3rd Party Applications Amazon Amazon S3 Glacier AWS IAM
AWS Snow Family AWS KMS Offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS storage customers
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What are customers building?
Backup and Active Data Lakes Compliance Databases & Enterprise Restore Archive & IoT Analytics Applications
Non-disruptive Media workflows 400% faster Industry Bespoke database Integrated with queries certifications lift-and-shift major vendors Easy place to start Tape replacement Built for Lockable with audit Tailored Hadoop Fully managed Integrated with all Public Sector, streaming data trails workloads infrastructure major vendors FinServ, Healthcare/Life Data visualization Secure Lift-and-shift Sciences migrations
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Where can we help you start?
Backup and Active Data Lakes Compliance Databases & Enterprise Restore Archive & IoT Analytics Applications
S3 & Glacier S3 and the S3, Athena and S3 EC2 and EBS EFS S-IA tier Redshift Spectrum Storage Gateway Glacier and EMR, Redshift EBS Glacier (with Bulk EFS the Vault Lock Snow family and Expedited feature retrieval tiers) Storage Gateway EFS (hybrid)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Object Storage
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon S3 by the numbers
One of first three AWS Services 52 Availability Zones 18 Regions (2006) (12 more coming in 2018/19) (4 more coming in 2018/19)
99.999999999% Millions of requests Trillions of Durability per second objects
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Object storage classes
S3 Standard - S3 One Zone - S3 Standard Glacier Infrequent Access Infrequent Access
Active data 30 day min duration 30 day min duration Archive data Millisecond access Millisecond access Millisecond access Minutes to Hours Min 3 AZs Min 3 AZs Min 1 AZ Min 3 AZs $0.023 $0.0125 $0.01 $0.004
Automated Lifecycle Policies
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Pricing is per GB per month in the US East (N. Virginia) region How do AWS object storage classes differ in design?
S3 Standard S3 Standard-IA S3 One Zone-IA Glacier
Availability Zone
Availability Zone
Availability Zone Availability Zone AWS Region AWS Region
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Understanding Durability
Standard IA Glacier
Two copies on one site Copies on two sites AWS Region
designed for designed for designed for 99.99% 99.999% 99.999999999% durability durability durability
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Value of Amazon S3 and Glacier
Durable, Available, & Scalable Security & Compliance Query In Place
Flexible Management Ecosystem
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Storage management made simple
Cross-Region Event Lifecycle Policies Object Tags Replication Notifications
Amazon CloudWatch Amazon S3 Storage Class AWS CloudTrail Request Metrics Inventory Analysis Data Events
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon S3 Security, Encryption & Compliance The broadest set of tools in the industry
Security Encryption Compliance • IAM and Bucket Policies • Encryption in transit with TLS • PCI-DSS • Access Control Lists • SSE-S3 – Amazon S3 manages • HIPAA/HITECH • Audit logging with CloudTrail data & keys • FedRAMP & Alerts with CloudWatch • SSE-C – Customer managed keys • FISMA • Secure CloudFormation • SSE-KMS – Master keys in KMS • EU Data Protection templates • CSE – 100% Customer managed Directive • Amazon Macie • Default Bucket Encryption • S3 Console Permission Checks • Encryption Status in Inventory
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Cross-region Replication Automatically replicate data to any other AWS Region
• Replicate by object, bucket, or prefix • Support for SSE-KMS encrypted objects • Ownership overwrite ü Change the object owner in the destination region
Region A Region B Cross-region connectivity
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Introducing S3 and Glacier Select for query-in-place
Issue standard SQL queries against objects and archives (in-place) You no longer have to retrieve entire object
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Do more with your data in place
Data Lake Machine Learning IoT Storage Storage & AI Storage
• Athena • AWS IoT • Rekognition • Redshift Spectrum • Greengrass • LEX • QuickSight • Other IoT sensors • Polly • EMR • MXNet & TensorFlow
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Lakes
Security-as-a-Service for 4000 customers using 25PB and growing 110% per year “AWS storage is fully redundant, multi-region, Co-lo simply not agile enough or cost effective more secure, and faster at less than half the cost.” • Built an S3 data lake and avoided $1.6M CAPEX - in the first year alone • Stress-tested 100x larger load with zero CAPEX - Paul Fisher Technical fellow • 4x better “I/O per $” ratio • Gained new insights into their customers through S3 data management capabilities
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Active Archive
“We have 20 petabytes of content on On-premises infrastructure took weeks to AWS, the equivalent of more than produce customer content 800,000 hours of video, available on Needed performant, secure, our platform. We can only move all economical media distribution solution that content around the world with the scalability we’re getting on the AWS • Amazon S3 and Glacier delivered $5.4M Cloud. “ improved TCO over on-premises
• - Andy Shenkler Content delivery improved to < 1 hour Chief Solutions and Technology Officer • Workflow pipelines now highly parallel and elastic
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Regulatory Compliance
Controls and enforces over 1M daily social media “What would it have cost us to build posts for corporate customers a WORM data store, Required this unregulated social media get it certified for SEC Rule 17(a)-4(f) content in their email archive offering and CFTC Rule 1.31 (b)-(c), and then scale it?” • Built fully compliant archive/purge workflows using Amazon S3 and Glacier
- Rich Sutton • Created a compliant two-step legal hold with vault-level tags and Glacier Vault Lock VP of Engineering
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Block Storage
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Elastic Block Store
Persistent Reliable Scalable
Highly available Built-in backup Fully elastic: (99.999%) options (snapshots) expand or change multi-AZ design on the fly Provisioned IOPS Move between EC2 Optimize based on instances latency, throughput or cost
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EC2 instance store i2/i3 d2
EBS SSD-backed volumes gp2 io1 AWS block storage offerings EBS HDD-backed volumes st1 sc1
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What is Amazon EC2 instance store?
EC2 instances • Local to instance
• Non-persistent data store
• Data not replicated (by default) Instance Store • No snapshot support or • SSD or HDD
Physical Host
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EBS volume types: General Purpose SSD
Baseline: 100 to 10,000 IOPS; 3 IOPS per GiB
Burst: 3,000 IOPS (for volumes up to 1,000 GiB)
Throughput: Up to 160 MiB/s
gp2 Latency: Single-digit ms General Purpose SSD Capacity: 1 GiB to 16 TiB
Great for boot volumes, low-latency applications, and bursty databases
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EBS volume types: Provisioned IOPS
Baseline: 100 to 32,000 IOPS
Throughput: Up to 500 MiB/s
Latency: Single-digit ms
io1 Capacity: 4 GiB to 16 TiB Provisioned IOPS
Ideal for critical applications and databases with sustained IOPS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EBS volume types: Throughput Provisioned
Baseline: 40 MiB/s per TiB up to 500 MiB/s
Burst: 250 MiB/s per TiB up to 500 MiB/s
Capacity: 500 GiB to 16 TiB st1 Throughput Optimized HDD
Ideal for large-block, high-throughput sequential workloads
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EBS volume types: Throughput Provisioned
Baseline: 12 MiB/s per TB up to 192 MiB/s
Burst: 80 MiB/s per TB up to 250 MiB/s
Capacity: 500 GiB to 16 TiB sc1 Cold HDD
Ideal for sequential throughput workloads, such as logging and backup
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Choosing an EBS volume type
What is more important to your workload?
or IOPS Throughput?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Choosing an EBS volume type
IOPS Throughput is more important is more important > 80,000 ≤ 80,000 Small, random I/O Large, sequential I/O
Latency? Aggregate throughput? < 1 ms Single-digit ms ≤ 1,750 MB/s > 1,750 MB/s
Which is more important? Which is more important? Cost Performance Cost Performance
i3 d2
gp2 io1 sc1 st1 © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Elastic Volumes
• Simple; Flexible; Non-disruptive; Automated • Modify the configuration of live volumes attached to instances • Dynamically increase size, tune performance, and change the type of existing and new current generation volumes • No downtime, no performance impact. • You can automate changes using CloudWatch with Lambda or CloudFormation • No need to plan ahead, provision what you need today and change the configuration as business needs change.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EBS snapshots
EBS Replica volume
Availability Availability Zone Zone
Amazon EBS snapshot S3
• Point-in-time backup of modified volume blocks • Stored in S3, accessed via EBS APIs • Subsequent snapshots are incremental • Deleting snapshot will only remove data exclusive to that snapshot
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Database and Analytics
Threat analysis company ingesting “AWS storage completely changed our and analyzing 50TB daily business operations, time to market and manpower. EBS volumes cut our cluster Right-sizing clusters cost weeks and lost data indexing times from weeks to hours. Moving data into S3 saved us 95% and our data lake • Saved 95% re-architecting to a “hot” index on now outperforms our clusters—the harder we EBS with an analytics data lake on S3 push it the faster it gets for extremely large datasets. We simply could not do this • EBS shortened indexing times from weeks to anywhere else.” hours while cutting OPEX • Now getting consistent 1-3s search response - Gene Stevens times across 5PB of growing data in S3 CTO and co-founder • Managing 1 billion S3 objects and 2,500 instances with just six engineers
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon File Storage
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Elastic File System
Simple Elastic Scalable
Fully managed file Scale up and down Consistent, scalable storage (NFS) automatically performance Seamless Lowest TCO Highly available integration and durable Secure
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Designed for a wide spectrum of needs
Analytics Web serving Dev tooling Media workflows Content management Home directories Database backups Container storage Enterprise apps and messaging Scale-out jobs Metadata-intensive jobs
High throughput and parallel I/O Low latency and serial I/O
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Before EFS… costs of DIY file storage
Amazon EBS File server volume costs a - Storage AZ Clients volumes Inter-AZ data Amazon EC2 File server transfer costs b - instance costs
AZ Storage volumes
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon EFS architecture a - AZ NFS clients Mount target b - AZ NFS clients Mount target DNS Name DNS
c Amazon EFS - File System AZ NFS clients Mount target
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Security model and features
Control Control file and Control administrative Encrypt network traffic directory access access (API access) data at rest
using Amazon VPC security using POSIX using AWS IAM using keys groups and network ACLs permissions (action-level and managed in resource-level AWS KMS permissions)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Access EFS from multiple sources
Amazon EC2
Corporate data center Amazon EFS
VMware Cloud on AWS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon EFS economics
No minimum commitments No need to provision No other fees, charges, or up-front fees storage in advance or billing dimensions
$0.30/GB-Month (US Regions) $0.33/GB-Month (EU Ireland) $0.36/GB-Month (EU Frankfurt) $0.36/GB-Month (AP Sydney)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EFS Customers
Database backups Media and entertainment workflows Enterprise applications
The picture can't be displayed. pict ure can' t be The picture can't be displayed. disp laye d.
Developer tools Home directories Container storage
Web serving + content management Big data and analytics
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Enterprise applications
Newly acquired streaming media product “Good, fast, and cheap. We picked two depended on a local file server and got all three with EFS. It gave us the Had to launch at global scale in 90 days – with agility to deliver a new product on minimal changes schedule, eliminated scale and performance concerns, and operates • DIY was too complex and took too long below our OPEX expectations.” • Lift-and-shift to EFS took 2 hours • EFS with EC2 auto-scaling met global scale - Chris DeAcosta agility needs Sr. director software engineering • Seamless integration between partner application and existing AWS systems • Post-mortem TCO analysis showed that EFS was still the best choice
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Migration and Hybrid Storage Tiering
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Moving your data into, out of and across the platform
Networks Roads Hybrid
AWS Amazon S3 Amazon EFS Amazon AWS Snow APN AWS Direct Transfer File Kinesis family competency Storage Connect Acceleration Sync Firehose (Snowball, Snowball partners Gateways Edge, Snowmobile)
A private Up to 300% Up to 5x faster file Capture, trans- Secure, physical Integrations Hybrid storage that connection faster transfers transfers than form, & load transport between 3rd party seamlessly connects between your data into and out of open source tools. streaming data appliances that vendors and AWS on-premises center, office, or S3. Ideal when Ideal for migrating into S3 for use move up to services. Ideal for applications to AWS colocation working with data into EFS or with Amazon Exabytes of data leveraging existing storage. Ideal for environment and long geographic moving between business into and out of software licenses backup, DR, AWS distances cloud file systems intelligence and AWS and skills bursting, tiering or analytics tools migration
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS Snow Family
AWS Snowball AWS Snowball Edge AWS Snowmobile • 80 TB capacity/10 G network • 100 TB capacity/10GE+ • Exabyte-scale 45ft container • Data encryption end-to-end networking • Data encryption end-to-end • Rugged 8.5 G impact case • Compute and storage for • Dedicated security personnel hybrid/edge workloads • Rain and dust resistant • GPS tracking, alarm • Data encryption end-to-end monitoring, 24/7 • Rugged 8.5 G impact case surveillance, and optional • Rain and dust resistant additional security • Rack-mountable, clusterable
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Storage Gateway enables hybrid storage solutions Use standard storage protocols to access AWS storage services
Customer Premises
Amazon S3 Application Direct servers NFS Connect Amazon Glacier iSCSI
Enterprise VTL Internet Amazon EBS storage File Volume snapshots Tape Backup Amazon AWS servers Amazon CloudWatch KMS VPC AWS AWS CloudTrail IAM
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Backup and restore
Canada’s largest biotech firm Data sovereignty required local hot files “It made no sense to keep buying and tape archives in each of 10 global offices big disk siloes, especially as we opened up new global offices, and now we can • AWS Volume Gateway eliminated 50-hour recover in the cloud from a snapshot if backup windows and tape archive systems we ever had to.” • Cut on-premises storage CAPEX 40%; dropped RTO from 48 hours to 10 minutes • Meets cloud strategy while retaining local - Adam Leggett ownership and data sovereignty IT manager • Enabled datacenter exit in next 12 months
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon EFS File Sync Sync data from existing NFS file systems into EFS file systems
Simple Fast Secure Set up and manage easily Up to 5x faster than Encrypted, parallel from the AWS Console standard Linux copy tools data transfer
• File systems from on-premises to EFS Use EFS File Sync to copy… • DIY in-cloud file systems to EFS • EFS file systems between AWS Regions
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon S3 Transfer Acceleration
Accelerated Transfer Public Internet
Up to 300% faster 171% on average Time [hrs]
Rio De Warsaw New York Atlanta Madrid Virginia Melbourne Paris Los Seattle Tokyo Singapore Janeiro Angeles 500 GB upload from clients in these locations to a bucket in Singapore
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What’s new in AWS Storage?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Recently announced storage features Storage S3 & Glacier EBS EFS Gateway Snow Family • Support for • VTL (Tape Amazon Glacier Gateway) is and S3 One Zone- available in Asia Infrequent Access Pacific (Singapore) to Amazon Region
April CloudWatch storage metrics • S3 Select is now Generally Available • Copying encrypted • HIPAA-eligible • VTL (Tape • Snowball Edge snapshots under service Gateway) expands now available in custom CMK now backup application Asia Pacific completes faster • Available in Asia support with (Singapore) region with less storage Pacific (Seoul) NovaStor
May region DataCenter
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. More on Why AWS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Why AWS storage?
The best reliability and largest scale The most secure, The most complete portfolio compliant, and auditable
The most data movement choices More than twice the partners The most comprehensive support and consulting
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The most reliable and largest scale
“…the scale at which AWS operates its public cloud storage services dwarfs the other vendors in this Magic Quadrant.”
- Gartner Magic Quadrant for Public Cloud Storage Services, Worldwide Raj Bala, Arun Chandrasekaran, John McArthur, July 24, 2017
For example: Amazon S3 holds trillions of objects and OBJECTS regularly peaks at millions of requests per second
© 2018, Amazon Web Services, Inc.TIME or its affiliates. All rights reserved. AWS offers the most ways to move data to/from the cloud
Networks Roads Hybrid
AWS Amazon S3 Amazon EFS Amazon AWS Snow APN AWS Direct Transfer File Kinesis family competency Storage Connect Acceleration Sync Firehose (Snowball, Snowball partners Gateways Edge, Snowmobile)
A private Up to 300% Up to 5x faster file Capture, trans- Secure, physical Integrations Hybrid storage that connection faster transfers transfers than form, & load transport between 3rd party seamlessly connects between your data into and out of open source tools. streaming data appliances that vendors and AWS on-premises center, office, or S3. Ideal when Ideal for migrating into S3 for use move up to services. Ideal for applications to AWS colocation working with data into EFS or with Amazon Exabytes of data leveraging existing storage. Ideal for environment and long geographic moving between business into and out of software licenses backup, DR, AWS distances cloud file systems intelligence and AWS and skills bursting, tiering or analytics tools migration
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Twice as many partnerships
Primary Storage Backup and Restore Archive
Disaster Recovery Analytics Enterprise Applications Complete partner list at https://aws.amazon.com/backup-recovery/partner-solutions/
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Experienced training, support and consulting
Escalation: Operation: Customization: AWS Support Plans AWS Training AWS Professional Services Support packages for Helping organizations Supplemental, specialized environments that are: adopt AWS through: experience and skills:
• Experimental • Digital courses • APN Consulting Partners • Production • Classroom training • AWS ProServe • Business-critical • Certification exams • AWS managed services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Call to action… Learning paths to AWS Storage
https://aws.amazon.com/training/path-storage/ © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Please complete the session survey in the summit mobile app.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Submit Session Feedback
1. Tap the Schedule icon. 2. Select the session 3. Tap Session Evaluation you attended. to submit your feedback. Thank you