From Cloud Computing to Sky Computing Ion Stoica and Scott Shenker UC Berkeley

From Cloud Computing to Sky Computing Ion Stoica and Scott Shenker UC Berkeley Abstract and switching providers is relatively easy; you can even keep We consider the future of cloud computing and ask how we your number when switching. might guide it towards a more coherent service we call sky Returning to McCarthy’s prediction, what concerns us computing. The barriers are more economic than technical, here is not that there is no single public computing utility, and we propose reciprocal peering as a key enabling step. but that cloud computing is not an undifferentiated commodity. In contrast to telephony, the cloud computing market has ACM Reference Format: evolved away from commoditization, with cloud providers Ion Stoica and Scott Shenker. 2021. From Cloud Computing to Sky striving to differentiate themselves through proprietary ser- Computing. In Workshop on Hot Topics in Operating Systems (HotOS vices. In this paper we suggest steps we can take to overcome ’21), May 31-June 2, 2021, Ann Arbor, MI, USA. ACM, New York, NY, this differentiation and help create a more commoditized ver- USA, 7 pages. https://doi.org/10.1145/3458336.3465302 sion of cloud computing, which we call the Sky computing. Before doing so, we briefly summarize the history of cloud 1 Introduction computing to provide context for the rest of the paper. In 1961, John McCarthy outlined what he saw as the future of computing (we regretfully note the gendered language): 2 Historical Context “computation may someday be organized as a public utility, just as the telephone system is a public The NSF high performance computing (HPC) initiative in utility. We can envisage computer service compa- the 1980s provided an early glimpse of the utility computing nies whose subscribers are connected to them [...]. vision, though limited to the HPC community. However, the Each subscriber needs to pay only for the capac- advent of personal computers delayed this vision by putting ity that he actually uses, but he has access to all the focus on smaller-scale computing available to all, rather programming languages characteristic of a very than high-performance computing for a smaller community. large system.” The personal computer industry was started by hobbyists who wanted their own machines to tinker with, and was As a prediction about technology, McCarthy’s vision was fuelled by Moore’s Law that enabled personal computers to remarkably prescient. His short description accurately de- keep up rapidly increasing user demands. picts what we now call cloud computing, where users have The emergence of the Internet led quickly to several pop- access to massive amounts of computation and storage and ular services being accessible worldwide, including e-mail, are charged only for the resources they use. On the contrary, bulletin board systems, and games. The advent of the World his prediction about economics was far off the mark; today, Wide Web ignited an explosion of new services such as in the United States even telephone service is no longer a search, on-line retail, and eventually social media. To cope public utility but delivered instead through a competing set with the resulting scale, these services had to build data- of providers. However, while not a utility, phone service is centers and design their own complex distributed systems. largely a commodity, providing a uniform user experience: This path was followed by many companies – such as Ya- i.e., no matter what provider you use, you can reach anyone, hoo!, Google, eBay, and Amazon – but the onus of creating large-scale service-specific infrastructures became a barrier Permission to make digital or hard copies of all or part of this work for to entry in the Internet services market because these steps personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear required huge investments which most could not afford. this notice and the full citation on the first page. Copyrights for components This changed in 2006 when Amazon launched S3 and EC2, of this work owned by others than ACM must be honored. Abstracting with kicking off the age of cloud computing by democratizing ac- credit is permitted. To copy otherwise, or republish, to post on servers or to cess to compute/storage and promoting the “pay-as-you-go” redistribute to lists, requires prior specific permission and/or a fee. Request business model. This, together with the end of Moore’s Law permissions from [email protected]. (which made it quite expensive to build and scale services in HotOS ’21, May 31-June 2, 2021, Ann Arbor, MI, USA © 2021 Association for Computing Machinery. on-premise clusters), led to a renewed push towards build- ACM ISBN 978-1-4503-8438-4/21/05...$15.00 ing what could have become utility computing. However, https://doi.org/10.1145/3458336.3465302 commercial trends have pushed us in a different direction. 26 HotOS ’21, May 31-June 2, 2021, Ann Arbor, MI, USA Ion Stoica and Scott Shenker In the early years when Amazon was dominant in the accommodate diversity? In 1972 Robert Kahn proposed open- cloud, they set the de facto standard for cloud computing. architecture networking [32], which advocated a universal However, in the past decade several competitors have emerged interoperability layer that could allow any two networks to in this market. According to a recent report [6], AWS now interconnect. This became the Internetworking Protocol (IP). owns just 32% of the market, followed by Microsoft with 19%, IP and the protocols above it were all that was needed in the Google with 7%, Alibaba with 6%, and the other clouds (such ARPAnet, which was a single coherent network. However, as as IBM and Oracle) splitting the rest of the remaining 37% of the ARPAnet grew into the Internet, and the Internet grew the cloud market (due to roundoff errors this adds up to 101%). to include separate Autonomous Systems (ASes, which are This competition has led to lower prices and an increasing independently run networks), the Internet needed some way array of products and services. For instance, AWS alone of- for packets in one network to reach another. This was not fers more than 175 products and services [8]. However, many about low-level compatibility in network technologies, this of these services are proprietary, and these proprietary ser- was a question of routing: how can the network successfully vices are one of the main ways cloud providers differentiate direct packets over a set of independently run networks. themselves. For example, each cloud has its own APIs for The Border Gateway Protocol (BGP) was invented to solve managing clusters, its own version of object store, its own this problem, and is now the “glue” that makes the set of inde- version of a data warehouse, its own serverless offering, and pendently run networks appear as a coherent whole to users. so on. Applications developed on one cloud often do not This was a technical solution, but an economic problem re- work on a different cloud without substantial changes, the mained: when networks interconnect, does money change same way an application written for Microsoft Windows hands, and if so, who pays whom? A set of business prac- requires substantial changes to work on Mac OS. Thus, com- tices have emerged for these “peering” or interconnection petition in the cloud market has led us away from the vision agreements. For instance, customers pay providers, but two of utility computing. networks of similar size and capability might do payment- Of course there have been many calls for standardizing free peering. These peering arrangements are technically cloud computing [28, 39], but these have had little impact on straightforward, but bring up thorny policy questions such the increasing drive towards cloud differentiation. The busi- as: can provider A charge Internet service B for carrying ness models of current cloud providers are built around at- B’s traffic, even if B is not directly connected to A.Thus, tracting, and then retaining, customers, and this goes against issues of unfair competition arise in these agreements (and offering a purely commoditized service. The question we are related to questions of network neutrality). address, then, is how do we make progress towards the goal Thus, there were three key design decisions that allowed of utility computing? the Internet to provide a uniform interface to a huge in- Our proposal is called Sky computing, to connote that we frastructure made out of heterogeneous technologies (from are trying to look beyond individual clouds. However, we are Ethernet to ATM to wireless) and competing companies. The not the first to use the term "Sky computing”. Starting in 2009, first is a “compatibility” layer that masks technological het- several papers have proposed designs with this name [30, erogeneity. The second is interdomain routing that glues the 34, 35]. However, these papers focus on particular technical Internet together, making it appear as one network to end solutions, such as running middleware (e.g., Nimbus) on a users. The third is a set of economic agreements, forming cross-cloud Infrastructure-as-a-Service platform, and target what we will call a “peering” layer, that allow competing specific workloads such as high-performance computing networks to collaborate in creating a uniform network. (HPC). This paper takes a broader view of Sky computing, What lessons does this teach us about the cloud? To fulfil proposing it as the universal software platform for future the vision of utility computing, applications should be able to applications and considering how technical trends and the run on any cloud provider (i.e., write-once, run-anywhere).

From Cloud Computing to Sky Computing Ion Stoica and Scott Shenker UC Berkeley

Building a SAN-Less Private Cloud with IBM Powervm and IBM Powervc

Is 'Distributed' Worth It? Benchmarking Apache Spark with Mesos

The Cloud Is Here: Embrace the Transition

Integrated High Definition LED Television User's Guide

Tachyon: Reliable, Memory Speed Storage for Cluster Computing Frameworks

The Eddie Awards Issue

Discretized Streams: Fault-Tolerant Streaming Computation at Scale

The Design and Implementation of Declarative Networks

IBM Worklight V5.0.6 Technology Overview

Apache Spark: a Unified Engine for Big Data Processing

AIX Dec 2004.P65

Comparison of Spark Resource Managers and Distributed File Systems