Big Data & Cloud
Total Page:16
File Type:pdf, Size:1020Kb
Big Data & Cloud 4th European Summit on the Future Internet António Miguel Ferreira, CEO, Lunacloud Aveiro, 13 to 14th June 2013 ? About About Cloud use case 1 Cloud use case 2 Big Data Lunacloud is a cloud infrastructure and platform services provider (IaaS + PaaS), with datacenters in the UK, Portugal, France and Russia (July 2013). About Cloud use case 1 Cloud use case 2 Big Data 1. The Cloud is a more efficient way of using IT resources. 2. High-end compute & storage resources are now widely available through the global Internet. 3. A whole new set of challenges may be addressed quickly and with lower costs. About Cloud use case 1 Cloud use case 2 Big Data Cloud use case 1 IT research About Cloud use case 1 Cloud use case 2 Big Data Altoros Product engineering in areas such as implementation of NoSQL and NewSQL storage systems, Hadoop distributed computing, etc. Offices in Silicon Valley (Sunnyvale, California), Norway, Denmark, Switzerland, UK, Eastern Europe (Minsk, Belarus) and South America (Buenos Aires, Santa Fe, Argentina). Challenge #1: NoSQL benchmarking Testing of different NoSQL databases against various types of workloads: Cassandra, MongoDB, Riak, Couchbase, MySQL Cluster, and Hbase. Yahoo! Cloud Serving Benchmark (YCSB) used to evaluate performance. Infrastructure needs 120 virtual machines with a total of 960 GB RAM + 920 CPU Cores + 12 TB local storage ... For only 15 days! Traditional IT capex = 100.000 € Cloud IaaS cost = 8.000 € About Cloud use case 1 Cloud use case 2 Big Data Altoros Product engineering in areas such as implementation of NoSQL and NewSQL storage systems, Hadoop distributed computing, etc. Offices in Silicon Valley (Sunnyvale, California), Norway, Denmark, Switzerland, UK, Eastern Europe (Minsk, Belarus) and South America (Buenos Aires, Santa Fe, Argentina). Challenge #2: Hadoop benchmarking Testing different Hadoop distribution packages to assess their performance, ease of use, functionality, etc: Apache Hadoop; CDH, Cloudera's Distribution, including Apache Hadoop; HDP, Hortonworks Data Platform; MapR M3 Edition; Intel Hadoop; Pivotal HD. TeraSort benchmark to measure performance. Infrastructure needs 250 virtual machines with a total of 2000 GB RAM + 2000 CPU Cores + 25TB local storage ... For only 25 days! Traditional IT capex = 200.000 € Cloud IaaS cost = 28.000 € About Cloud use case 1 Cloud use case 2 Big Data Concepts – Use case 1: 1.Immediate availability 2.Elasticity 3.Pay per use About Cloud use case 1 Cloud use case 2 Big Data Cloud use case 2 Entertainment About Cloud use case 1 Cloud use case 2 Big Data Music Stage Social network for independent bands and musicians. Challenge: Multimedia storage 10’s or 100’s of thousands of bands and artists want to upload 10’s or 100’s of musics and videos, each with 4 to 30 MB in size. Need for unlimited storage space with no upfront investment. Infrastructure needs Up to 100.000 bands x (100 musics + 10 videos) = 60TB, spread across the world Growing from 1 band to 100.000 bands, without worrying about storage, resilience and geographical coverage. About Cloud use case 1 Cloud use case 2 Big Data Concepts – Use case 2 1.Elasticity 2.On-demand self-service (API) 3.Broad network access About Cloud use case 1 Cloud use case 2 Big Data Big Data NoSQL – Our case About Cloud use case 1 Cloud use case 2 Big Data Cloud Storage Unlimited virtual disk accessible through a web interface or an API. Applications can use cloud storage to place any objects. Data is replicated in at least 3 different physical stores. Built over a NoSQL Cassandra database. Cloud Mongo MongoDB as a Service, with single or replicated instances, provisioned through a point-and-click web interface. It allows developers of applications to focus on code, not database management. About Cloud use case 1 Cloud use case 2 Big Data The future of the Cloud In our view IaaS is a commodity, a utility service, that is best when infrastructure is closer to customers, in nearby datacenters. PaaS is a differentiator and will accelerate innovation. SaaS is where most of the activity and innovation takes place. Thank you Email [email protected] Web www.lunacloud.com Twitter @lunacloud.