Big Data Analytics Options on AWS AWS Whitepaper Big Data Analytics Options on AWS AWS Whitepaper
Total Page:16
File Type:pdf, Size:1020Kb
Big Data Analytics Options on AWS AWS Whitepaper Big Data Analytics Options on AWS AWS Whitepaper Big Data Analytics Options on AWS: AWS Whitepaper Copyright © Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress may not be used in connection with any product or service that is not Amazon's, in any manner that is likely to cause confusion among customers, or in any manner that disparages or discredits Amazon. All other trademarks not owned by Amazon are the property of their respective owners, who may or may not be affiliated with, connected to, or sponsored by Amazon. Big Data Analytics Options on AWS AWS Whitepaper Table of Contents Abstract ............................................................................................................................................ 1 Introduction ...................................................................................................................................... 2 The AWS advantage in big data analytics .............................................................................................. 3 Amazon Kinesis .......................................................................................................................... 4 Ideal usage patterns ........................................................................................................... 5 Cost model ........................................................................................................................ 5 Performance ...................................................................................................................... 6 Durability and availability .................................................................................................... 6 Scalability and elasticity ...................................................................................................... 6 Interfaces .......................................................................................................................... 6 Anti-patterns ..................................................................................................................... 7 Amazon MSK ............................................................................................................................. 7 Ideal usage patterns ........................................................................................................... 8 Cost model ........................................................................................................................ 8 Performance ...................................................................................................................... 8 Durability and availability .................................................................................................... 8 Scalability and elasticity ...................................................................................................... 9 Interfacess ......................................................................................................................... 9 Anti-patterns ..................................................................................................................... 9 AWS Lambda ............................................................................................................................. 9 Ideal usage patterns ........................................................................................................... 9 Cost model ...................................................................................................................... 10 Performance .................................................................................................................... 10 Durability and availability .................................................................................................. 10 Scalability and elasticity .................................................................................................... 10 Interfaces ........................................................................................................................ 11 Anti-patterns ................................................................................................................... 11 Amazon EMR ........................................................................................................................... 11 Ideal usage patterns ......................................................................................................... 12 Cost model ...................................................................................................................... 12 Performance .................................................................................................................... 12 Durability and availability .................................................................................................. 13 Scalability and elasticity .................................................................................................... 13 Interfaces ........................................................................................................................ 13 Anti-patterns ................................................................................................................... 16 AWS Glue ................................................................................................................................ 16 Ideal usage patterns ......................................................................................................... 16 Cost model ...................................................................................................................... 17 Performance .................................................................................................................... 17 Durability and availability .................................................................................................. 17 Scalability and elasticity .................................................................................................... 17 Interfaces ........................................................................................................................ 17 Anti-patterns ................................................................................................................... 18 AWS Lake Formation ................................................................................................................ 18 Ideal usage patterns ......................................................................................................... 18 Cost model ...................................................................................................................... 19 Performance .................................................................................................................... 19 Durability and availability .................................................................................................. 19 Scalability and elasticity .................................................................................................... 19 Interfaces ........................................................................................................................ 20 Anti-patterns ................................................................................................................... 20 Amazon Machine Learning ......................................................................................................... 20 Ideal usage patterns ......................................................................................................... 22 Cost model ...................................................................................................................... 22 iii Big Data Analytics Options on AWS AWS Whitepaper Performance .................................................................................................................... 23 Durability and availability .................................................................................................. 23 Scalability and elasticity .................................................................................................... 24 Interfaces ........................................................................................................................ 24 Anti-patterns ................................................................................................................... 24 Amazon DynamoDB .................................................................................................................. 24 Ideal usage patterns ......................................................................................................... 25 Cost model ...................................................................................................................... 25 Performance .................................................................................................................... 26 Durability and availability .................................................................................................. 26 Scalability and elasticity .................................................................................................... 27 Interfaces ........................................................................................................................ 27 Anti-patterns ..................................................................................................................