Anycast on the Move: ALookatMobileAnycastPerformance
Sarah Wassermann John P. Rula Fabi´an E. Bustamante Pedro Casas Inria Paris Northwestern University Northwestern University AIT Vienna [email protected] [email protected] [email protected] [email protected]
Abstract—The appeal and clear operational and economic ben- With IP anycast, services advertise a single IP address from efits of anycast to service providers have motivated a number of many physical locations (anycast sites) and clients’ requests recent experimental studies on its potential performance impact are directed, based on BGP routing policies, to a “nearby” for end users. For CDNs on mobile networks, in particular, anycast provides a simpler alternative to existing routing systems replica [4]. The approach is being used for redirecting clients challenged by a growing, complex, and commonly opaque cellular in a range of applications, from naming (e.g., root servers and infrastructure. This paper presents the first analysis of anycast top-level domain resolvers) to content delivery. By lettingBGP performance for mobile users. In particular, our evaluation control request routes, anycast routing obviates the need for focuses on two distinct anycast services, both providing part of fine-grained infrastructure or client information. BGP routing the DNS Root zone and together covering all major geographical regions. Our results show that mobile clients tend to be routed also offers some degree of robustness, adapting to changes to suboptimal replicas in terms of geographical distance, more in service or network availability, and allows for some policy frequently while on a cellular connection than on WiFi, with control. The benefits of anycast to service providers have mo- asignificantimpactonlatency.Wefindthatthisisnotsimply tivated a number of recent experimental studies on its potential an issue of lacking better alternatives, and that the problem performance impact for end users. Prior analyses have shown is not specific to particular geographic areas or autonomous systems. We close with a first analysis of the root causes of this that anycast routing can be suboptimal [5], unstable [6], [7], phenomenon and describe some of the major classes of anycast and seemingly chaotic [8], [9], as routing policies have not anomalies revealed during our study, additionally including a only technical motivations, but could be dictated by political or systematic approach to automatically detect such anomalies commercial reasons. Routing changes can silently shift traffic without any sort of training or annotated measurements. We from one site to another with a consequent loss of shared release our datasets to the networking community. state and potential performance impact [10]. Yet, despite the growing dominance of mobile Internet access, no previous I. INTRODUCTION studies have evaluated the effectiveness of anycast for mobile The impressive growth in mobile devices and use has led users, including its routing behavior and performance. to an unprecedented amount of cellular traffic. The number of In this paper, we take a first look at anycast performance mobile subscriptions has grown rapidly in just a few years, for mobile users. In particular, our evaluation focuses on two surpassing 7.8 billion in the third quarter of 2017 [1]. Today, distinct anycast services, K- and F-Root DNS, each providing users spend most of their time browsing on their mobile part the DNS Root zone and together covering all major phones, more than on any other device. In the United States, geographical regions. Similarly to previous work [9], [11], for instance, the average smartphone user spends 2x−3x more [12], we use the geographic distance as our evaluation metric. hours (87 hours per month) on her mobile device than she Geographic distance is a useful metric to estimate RTT [13], does on desktop machines (34 hours per month) [2]. and IP-to-location mapping techniques such as GeoPing [14] Most of this content is delivered to end users by content and Constraint-Based Geolocation (CBG) [15] rely on the delivery networks (CDNs). CDNs deploy servers around the correlation between network delay and geographic distance. world and redirect clients to nearby replicas to improve In the specific case of wireless networks hosting users on the performance and reliability. The process of replica selection – move, we consider geographic distance to be a more suitable the mapping of each client to a close replica – is key to CDN proximity metric than delay. Indeed, delay is influenced by performance. Most commonly, it has relied on a DNS-based multiple factors which can vary greatly, even if the user does approach pioneered by Akamai. not move much. Note that, while we are interested in the Traditional replica selection systems, however, are being impact and behavior of delay towards user-requested content, challenged by the rapid growth, increased complexity, and geographic distance provides a more stable ground truth to common opacity of cellular infrastructure [3]. Anycast offers use as baseline for the comparison of measurements. We show mobile CDNs an alternative approach. that mobile clients tend to be routed to suboptimal replicas in terms of geographical distance, more frequently while on This work has been carried out while Sarah Wassermann was an MSc. acellularconnectionthanonWiFi,withasignificantimpact student at the University of Li`ege. on perceived service performance. In the following section, we provide background on cellular and K-Root. We selected these services since they are both networks and anycast routing and review prior work. We widely replicated, covering together all geographic areas (with describe our methodology and datasets for measuring anycast approximately 60 servers for K-Root and nearly 140 for F- routing for mobile end users in Section III, and present our Root) and both with publicly available site locations and findings in Section IV. We further investigate the causes unicast IP addresses. The latter point is key to let us evaluate of aberrant anycast behavior for mobile users in Section V the performance of anycast routing relative to its “optimal”(in and present a systematic approach to identify such anomalies terms of unicast) site location. in an unsupervised manner through clustering techniques in We collected active measurements from geographically dis- Section VI. Finally, we conclude in Section VII. tributed clients on both cellular and WiFi networks from September 2016 to April 2017, using the ALICE engine1. II. ANYCAST AND MOBILE USERS ALICE conducts mobile network measurements by executing Previous efforts have focused on techniques for character- asmall,self-containedexperimentscript,runapproximately izing anycast deployment [16], enumerating [17]–[19], and every hour. In each experiment, for each of the two root DNS geolocating servers based on latency measurements [17], [18]. services, clients launched ping and traceroute measurements Cicalese et al. [16] presented the first Internet-wide anycast towards the root server’s anycast address, as well as towards census and showed that major players in the Internet ecosys- five unicast addresses of the analyzed DNS service determined tem are relying on anycast, even though only a small part of to be the geographically closest to the client. Target unicast the IPv4 space has so far been anycasted. addresses were selected based on the origin country of each Calder et al. [20] analyzed the performance of anycast in client’s IP address, as reported by whois. the Bing CDN. The authors showed that, although anycast Note that, besides our previous justification for choosing performed well in general, about 20% of the clients were geographical distance over latency as selection criteria, we redirected to a suboptimal replica, with a negative impact on consider the closest unicast replicas in terms of geographical performance. Kuipers et al. [9] reported a related study on the distance and not latency also due to practical concerns. Indeed, performance of the K-Root DNS service using the RIPE Atlas we cannot know the closest unicast replicas in terms of framework [21]. Like Calder et al., they observed that anycast latency without flooding the user’s connection with latency was suboptimal for multiple probes with, for instance, 46% measurements towards all available unicast replicas before of them being redirected to a suboptimal DNS server, both in starting every new experiment, which would be impractical terms of latency and geographical distance. Li and Spring [22] and highly invasive. Moreover, a previous study [9] shows claimed that the main causes for this behavior are Tier-1 ASes, that geographically distant replicas generally result in poorer which forward almost all requests systematically to the same latency, further supporting our methodology. replica, irrespectively of the user’s location. We collected data from mobile devices while clients were Addressing the flaws of anycast remains a major challenge. connected to either WiFi or cellular networks. We thus divide Schmidt et al. [5] analyzed the impact of the number of our data into a set containing experiments launched from anycast sites on the performance of this paradigm in the cellular networks (CELL) and another one including the DNS infrastructure. The authors concluded that increasing experiments issued from WiFi (WIFI). As measurements were the number of sites does not solve the suboptimal mapping collected opportunistically, based on connection availability observed nor, counterintuitively, reduces clients’ latencies. and resource usage, the number of experiments in each of In [8], Bellis et al. described their use of RIPE Atlas to detect these sets is not the same. and address issues related to F-Root, such as the redirectionof CELL dataset. CELL includes more than 20,000 experi- requests to replicas in different continents (e.g., from a probe ments, issued from 151 different clients. Our cellular users in Europe to a server in Atlanta). were scattered across nearly 40 different countries, with 70% Zarifis et al. [23] studied the impact of the Internet topology of them located in the United States, Greece, Brazil, and and routing on the performance perceived by mobile users, and France. Furthermore, the analyzed clients were hosted in a revealed that a significant fraction of Internet paths are inflated, range of major ASes, including AS 29247 (GR-Cosmote, with a non-negligible performance penalty. The authors inves- Greece), AS 26599 (Vivo, Brazil), and AS 22394 (Verizon tigated the root causes for route inflation and found the lack Wireless, US). of carrier ingress points to be one of the main reasons. Our work builds on approaches and insights from many of WIFI dataset. WIFI encompasses three times as many these past efforts. Yet, to the best of our knowledge, this is the experiments as CELL, issued from 251 clients. Clients were first work focused on the performance of anycast in mobile located in nearly 50 different countries around the world with networks. 70% of them in Greece and the United States. Some of the most active clients (i.e., those that have launched the most III. METHODOLOGY AND DATASETS experiments) were hosted in AS 6799 (OTENET-GR, Greece), This section describes our datasets and measurement AS 9121 (TTNet, Turkey), and AS 7922 (COMCAST, US). methodology for analyzing anycast from mobile devices. We focus our analysis on two major root DNS services: F-Root 1http://aqualab.cs.northwestern.edu/projects/261-alice 0.8 0.9 0.7 0.8 0.6 0.7 0.5 0.6 0.5 0.4 CDF CDF 0.4 0.3 0.3 0.2 Cell 0.2 Cell 0.1 WiFi 0.1 WiFi 0 0 0 1000 2000 3000 4000 0 30 60 90 120 150 180 210 240 270 300 D (km) Latency (ms) client → anycast client → anycast (a) Distance to K-Root. (a) Latency to K-Root. 0.8 0.9 0.7 0.8 0.6 0.7 0.5 0.6 0.5 0.4 CDF CDF 0.4 0.3 0.3 0.2 Cell 0.2 Cell 0.1 WiFi 0.1 WiFi 0 0 0 400 800 1200 1600 2000 0 20 40 60 80 100 120 140 160 180 200 D (km) Latency (ms) client → anycast client → anycast (b) Distance to F-Root. (b) Latency to F-Root. Fig. 1: Travel distance from clients to anycast servers. Fig. 2: Latency from clients to anycast servers.
Atotalof125clientslaunchedexperimentsfromboth the experiments carried out on a WiFi network, whereas this cellular and WiFi networks. holds for merely 50% of the experiments issued from cellular We are making both the CELL and WIFI datasets publicly clients. While we can observe the same phenomenon for F- available on GitHub2. Root on Figure 1(b), the differences are not as striking as for the K-Root service. The difference in number and geographic IV. FINDINGS AND OBSERVATIONS distribution between F- and K-Root services explains the In this section, we present key observations from our measured differences in the distances traveled by a client analysis of mobile anycast performance. In particular, our request on a cellular or WiFi connection. Still, clients on results suggest that mobile clients are most of the time routed cellular networks travel farther to their assigned replicas(with towards suboptimal replicas in terms of geographical distance potential latency implications). (Section IV-A), more frequently while on a cellular connection than on WiFi, and that the additional distance traveled by re- B. Impact on Latency quests has a significant impact on performance (Section IV-B). We analyze the scope of this phenomenon (Section IV-C) and Does the additional distance traveled by users’ requests re- the existence of better alternatives (Section IV-D). sult in worse user performance? We evaluate the impact of the distance between clients and their replicas in terms of latency. A. The Travel Distance Problem Figure 2 shows the distribution of their latencies. Indeed, the figure reveals that latencies are higher for the measurements Our study is partially motivated by anecdotal evidence of carried out on cellular networks than for the ones performed long geographic distances between mobile clients and their on WiFi. Again, the difference is more pronounced for K-Root assigned anycast DNS servers. To characterize these distances, (Figure 2(a)) than for F-root (Figure 2(b)). A comparison of given that anycast IP addresses cannot be geolocated [24], we both figures shows that, while the distribution of latencies is geolocate the penultimate hop on the path from a client to similar for K-Root and F-Root on WiFi, we see significantly the anycast server and estimate its location using the Akamai worse performance for K-Root compared to F-Root on cellular EdgeScape service3.Weusethisandtheclientlocationto networks: 90% of the experiments output a RTT lower than compute the geographic distance between a client and her as- 200 milliseconds for F-Root, but only 70% for K-Root DNS. signed anycast server. For each experiment, the client recorded her anonymized geographic location to a 10 km2 area. Figure 1 Other factors beyond distance traveled, such as signal shows the distribution of these distances, in kilometers, for strength and radio access latency, could result in performance both WiFi and cellular users. degradations for clients on cellular networks. To understand Figure 1(a) presents the travel distance between clients the importance of geographic distance on observed latency, we look at the correlation between distance and minimum and their K-Root anycast servers (Dclient→anycast). We see a RTT for cellular measurements. Figure 3, a scatter plot of significant difference between cell and WiFi: Dclient→anycast is smaller than 4,000 kilometers for approximately 75% of this correlation, clearly shows that, while not the only factor, the distance between cellular clients and their assigned anycast 2https://github.com/SAWassermann/mobile-anycast servers is indeed an important aspect to take into account when 3https://www.akamai.com/us/en/products/web-performance it comes to performance optimization. We observed the same 500 1 450 0.9 400 0.8 350 0.7 300 250 0.6 200 0.5
CDF AS 36180 (US) 0.4 150 AS 26599 (BR) Min RTT (ms) 100 K-Root 0.3 AS 12430 (ES) 50 F-Root 0.2 AS 23243 (GT) 0 0.1 AS 22394 (US) 0 3000 6000 9000 12000 15000 0 D (km) client → anycast 0 2000 4000 6000 8000 10000 12000 14000 D (km) Fig. 3: Correlation between geographic distance and minimum client → anycast RTT. (a) Dclient→anycast observed for K-Root. 1 0.9 behavior for WiFi clients; the graph is not included due to 0.8 space constraints. 0.7 0.6 0.5 C. Where Does the Travel Distance Problem Occur? AS 26599 (BR) CDF 0.4 AS 36180 (US) Is it possible that the observed issues are limited to some 0.3 AS 20057 (US) specific geographical regions or a few particularly mismanaged 0.2 AS 22394 (US) 0.1 AS 21928 (US) ASes? To explore this, we now focus our analysis on five 0 ASes that provide the most measurements for clients in cellular 0 2500 5000 7500 10000 12500 15000 D (km) networks. This leaves us with 2,310 measurements issued from client → anycast (b) Dclient→anycast observed for F-Root. 16 users for K-Root and 1,860 measurements retrieved from Fig. 4: Dclient→anycast in top 5 ASes (w.r.t. number of 15 mobile clients for F-Root DNS. measurements). Figure 4 presents the distributions of the distances traveled to anycast servers for clients in these ASes. While we see large variability within and across ASes, particularly in US- 1 0.9 based networks potentially covering larger geographic areas, 0.8 there are no clear patterns to argue for the identified problem 0.7 to be a region- or AS-specific issue. 0.6 0.5 AS 36180 (US) As noted before, F-Root DNS replicas seem to be closer CDF 0.4 AS 26599 (BR) to their clients, regardless of the network. Indeed, for 80% of 0.3 AS 12430 (ES) 0.2 the launched measurements towards F-Root anycast servers, AS 23243 (GT) 0.1 AS 22394 (US) D → is less than approximately 4,300 kilometers, 0 client anycast 0 1250 2500 3750 5000 6250 7500 8750 10000 while this distance is larger than 8,500 kilometers for 20% of δ (km) client the measurements issued towards K-Root servers. (a) δclient observed for K-Root. D. Do Cellular Clients Have a Closer Option? 1 0.9 Another possible explanation for the long distances would 0.8 0.7 be that these cellular clients are simply located far away 0.6 from all the available replicas. Further analysis shows that 0.5 AS 36180 (US) CDF this is not necessarily the case. We compute the distance 0.4 AS 22394 (US) 0.3 AS 20057 (US) between cellular clients and both their assigned anycast server 0.2 AS 21928 (US) 0.1 AS 26599 (BR) (Dclient→anycast)andtheirgeographicallyclosestunicast 0 replica (Dclient→unicast). We compute the additional distance 0 2000 4000 6000 8000 10000 12000 14000 δ (km) traveled by anycast as the difference between these two client (b) δ observed for F-Root. distances (i.e., Dclient→anycast − Dclient→unicast), which we client Fig. 5: δclient in top 5 ASes (w.r.t. number of measurements). denote by δclient. Figure 5 presents the additional distance traveled by anycast requests, δclient,forthemeasurementslaunchedfromthefive Particularly clear examples for the K-Root problem are ASes providing the most measurements for K- and F-Root clients residing in the United States, but being routed towards DNS. We can easily infer from this graph that our clients anycast servers located in London and Tehran. This is even are most of the time routed to a suboptimal replica and that more surprising as there are at least seven K-Root servers this issue does not seem to be specific to one AS or region. distributed across the United States, with four hosted on the Nevertheless, comparing Figures 5(a) and 5(b) leads us to East Coast and two on the West Coast. Finally, we observe conclude that anycast routing is significantly worse for K-Root that, in each of the examined ASes, some mobile clients are than for F-Root, even though the mappings for F-Root are far redirected to the nearest anycast F-Root server, while the ideal from optimal. case never occurs for the K-Root service. V. W HY TRAVELING SO FAR?AFIRST LOOK INTO o] vö >} ö]}vW }êö}vU D W't >} ö]}vW >}ê vP o êU CELLULAR ANOMALIES v« êö êö]v ö]}v E å êö hv] êö êö]v ö]}v In the following paragraphs, we explore some of the causes ,}â /W ^E ,}â /W ^E ÌrÒ Wå]¿ ö å êê ê r r ÌrÒ Wå]¿ ö å êê ê r r of anycast routing anomalies for cellular clients. In particular, Ú ÓÏÙXÌÙÒXÌÚÏXÌÙÓ Ú ÚÌ >}ê vP o êU Ú ÓÏÙXÌÙÒXÌÚÏXÌÙÓ Ú ÚÌ >}ê vP o êU Û ÓÏÙXÌÙÒXÌÚÏXÌÙÌ Ú ÚÌ >}ê vP o êU Û ÓÏÙXÌÙÒXÌÚÏXÌÙÌ Ú ÚÌ >}ê vP o êU we focus on the measurements with a value of δclient higher Ù Ú XÌÓÒXÓÙX Ì Ú ÚÌ ^ v :}ê U Ù Ú XÌÓÒXÓÙX Ò Ú ÚÌ ^ v :}ê U than 1,000 kilometers. This corresponds to more than 80% of ı Ú XÌÓÒXÔÏXÓÔÏ Ú ÚÌ ^ v :}ê U ı Ú XÌÓÒXÔÌXÌÒ Ú ÚÌ ^ v :}ê U the experiments for K-Root and to 70% of the experiments for ÌÏ Ú XÌÓÒXÔÌXÓÌÙ Ú ÚÌ v¿ åU K ÌÏ ÚÔXÌ ÚXÓÚXÓÒÔ ÓÏı r F-Root. We refer to these measurements as suboptimal anycast ÌÌ Ú XÌÓÒXÓıXÌÙ Ú ÚÌ Z] P}U /> ÌÓ ÚÔXÓÔ XÓÒ XÌ Ó ÓÏı ^ oö > l ]ö«U hd ÌÓ Ú XÌÓÒXÓıXÓÏÙ Ú ÚÌ E ¡ z}ålU Ez ÌÒ ÓÏıXÔÔXÓÌ XÓÒÏ ÌÌÏÛÌ ^öX ' }åP U hd measurements. ÌÔ Ú XÌÓÒXÓıXÌÓÛ Ú ÚÌ >}v }vU h<