Scheduling Latency-Sensitive Applications in Edge Computing

Scheduling Latency-Sensitive Applications in Edge Computing Vincenzo Scoca1, Atakan Aral2, Ivona Brandic2, Rocco De Nicola1 and Rafael Brundo Uriarte1 1IMT School for Advanced Studies Lucca, Italy 2Vienna University of Technology, Austria Keywords: Edge Computing, Scheduling, Latency-Sensitive Services, Live Video Streaming, Resource Selection. Abstract: Edge computing is an emerging technology that aims to include latency-sensitive and data-intensive applications such as mobile or IoT services, into the cloud ecosystem by placing computational resources at the edge of the network. Close proximity to producers and consumers of data brings significant benefits in latency and bandwidth. However, edge resources are, by definition, limited in comparison to cloud counterparts, thus, a trade-off exists between deploying a service closest to its users and avoiding resource overload. We propose a score-based edge service scheduling algorithm that evaluates both network and computational capabilities of edge nodes and outputs the maximum scoring mapping between services and resources. Our extensive simulation based on a live video streaming service, demonstrates significant improvements in both network delay and service time. Additionally, we compare edge computing technology with the state-of-the-art cloud computing and content delivery network solutions within the context of latency-sensitive and data-intensive applications. Our results show that edge computing enhanced with suggested scheduling algorithm is a viable solution for achieving high quality of service and responsivity in deploying such applications. 1 INTRODUCTION Edge Computing are related to Internet-of-Things, mostly focusing on the response time between ser- The enhancement of smart devices and network tech- vice components rather than the optimization of user- nologies has enabled new internet-based services with related metrics (end-to-end service response time) strict end-to-end latency requirements (Bilal and Er- (Skarlat et al., 2017). Scheduling approaches de- bad, 2017), for example, 360 degrees video and on- vised for similar paradigms, in particular cloud and line gaming. Edge Computing is a promising so- content delivery networks (CDNs) are characterized lution for the provision of latency-sensitive applica- by infrastructure limitations which make them un- tions since it offers a set of enabling technologies that fit to meet the requirements of this type of latency- moves both computation and storage capabilities (i.e., sensitive services. Specifically, cloud solutions are micro cloud data centers) to the edge of the network characterised by large distances between cloud data (Shi et al., 2016). This change allows deployment of centers and final users, incurring high network delay services close to end users, which reduces their re- that considerably degrades the service quality experi- sponse time. enced by end users. Moreover, due to high amount To effectively exploit the advantages of Edge in- of traffic generated by this kind of services, there is a frastructure, service instances should be scheduled on considerable risk of network congestion as well as a the node which better fits service requirements, taking bandwidth waste (Wang et al., 2017). In the case of into account contextual knowledge, i.e., user, appli- CDNs, any computational tasks must be carried out in cation and network information (Wang et al., 2017). the cloud, and a distributed network of cache servers, Indeed, defining optimal placement not only allows which is located in close proximity to users, is only to maximize user experience by reducing end-to-end used for disseminating output data (e.g. web objects, latency, but it also optimizes network traffic by re- rich media, etc.). However, servers are still several ducing bandwidth consumption and related costs, and hops away from final users and they need to process improves energy efficiency by reducing the need to requests on cloud servers and therefore do not consid- offload tasks to the cloud which incurs higher energy erably reduce the network delay when offloaded task consumption compared to micro data centers. is computationally heavy (Bilal and Erbad, 2017). Currently, few existing scheduling solutions for In this paper, we propose a novel score-based 158 Scoca, V., Aral, A., Brandic, I., De Nicola, R. and Uriarte, R. Scheduling Latency-Sensitive Applications in Edge Computing. DOI: 10.5220/0006706201580168 In Proceedings of the 8th International Conference on Cloud Computing and Services Science (CLOSER 2018), pages 158-168 ISBN: 978-989-758-295-0 Copyright c 2019 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved Scheduling Latency-Sensitive Applications in Edge Computing edge scheduling framework specifically designed for 2 RELATED WORKS latency-sensitive services in Edge. The proposed technique is latency, bandwidth, and resource aware. Effectiveness of edge technologies has already been It schedules each service instance on the VM type proved in many use cases, as reported by Wang et whose computing and network capabilities can op- al. in (Wang et al., 2017), however there are still timize service response time experienced by end open research challenges to be tackled such as ser- users. First, eligibility of available VM types on vice scheduling on edge nodes. Recent works such as edge nodes are evaluated for hosting given service (Zhao et al., 2015; Guo et al., 2016; Mao et al., 2016) based on VM network and resource capabilities, and faces edge computation offloading problem, that is the then such services are scheduled in the most suit- decision of scheduling a task on mobile device or lo- able VMs according to eligibility scores, guarantee- cal/internet cloud. Nevertheless, these works do not ing optimal service quality. We validate our ap- take into account the actual service scheduling be- proach based on a live video streaming service and tween edge nodes. evaluate how different deployment solutions, namely In the area of Fog Computing, in (Skarlat et al., Cloud, CDN and Edge, affect user response time in 2017), authors defined a scheduling approach for the an latency-sensitive and data-intensive scenario. The placement of service modules on fog nodes, focus- obtained results show that edge-based solutions effec- ing only on the optimization of response time among tively improve user response time, highlighting the components, without taking into account the end-to- need of a suitable scheduling algorithm to obtain op- end service time. Aazam et al. in (Aazam and Huh, timized results. Moreover, we compare the perfor- 2015), instead, defined a resource estimation and pric- mance of our approach with (Duong et al., 2012), ing model for IoT, that estimates the amount of re- a well-known latency-sensitive scheduling algorithm sources to be allocated for a given service, again with- for clouds, which aims to minimize the overall de- out providing an solution for the selection of a node lay experienced by end users by reducing the users’ where to allocate the service. request waiting time, without explicitly considering network latency. The results show that edge schedul- Scheduling of latency sensitivity services, instead, ing algorithms must take also network latency into has been widely studied in Cloud. For example, account to minimize the overall service delay expe- VM placement solutions for distributed clouds have rienced by end users. been developed in (Papagianni et al., 2013; Aral and Overall, the main contributions of this paper are: Ovatman, 2016). These approaches seek for optimal (i) a novel Edge scheduling framework for latency- VM placement on physical nodes which minimizes sensitive services; (ii) the performance evaluation of the network latency among them. However, these different deployment solutions, namely cloud, CDN mapping methodologies rely only on service require- and Edge, for latency-sensitive services, which shows ments and physical infrastructure knowledge with- the advantages of Edge computing in this context; and out considering user-related information, such as geo- (iii) the evaluation of the proposed framework in com- localization, which is a critical feature for the schedul- parison to a solution for latency-sensitive services in ing process in edge. Hence, they are not directly clouds, which shows the benefits of using an algo- adaptable to our problem. In the context of single rithm specifically devised for this architecture. cloud providers, authors in (Piao and Yan, 2010; Zeng The rest of this paper is organised a follows. In the et al., 2014) developed service component placement next section, we present the related works; in Sec.3 methodologies that, similarly to the distributed sce- we provide a motivating example discussing the ben- nario described before, optimize the service place- efit of edge-based deployment approach for latency ment among servers in a single data center minimiz- sensitive applications such as a live video streaming ing the transfer time between service components. service; in Sec.4 we present the scheduling approach Therefore, also these approaches do not take into ac- we devised; in Sec.5 we describe the experimental count users information which would make their ap- setup used for the evaluation of our approach together plication suitable for service scheduling in edge. with the results we obtained, whereas conclusions and Authors in (Duong et al., 2012), however, in- future works are reported in Sec.6. tegrated user information in the

Scheduling Latency-Sensitive Applications in Edge Computing

Latency and Throughput Optimization in Modern Networks: a Comprehensive Survey Amir Mirzaeinnia, Mehdi Mirzaeinia, and Abdelmounaam Rezgui

Measuring Latency Variation in the Internet

The Effects of Latency on Player Performance and Experience in A

Evaluating the Latency Impact of Ipv6 on a High Frequency Trading System

Low-Latency Networking: Where Latency Lurks and How to Tame It Xiaolin Jiang, Hossein S

Simulation and Comparison of Various Scheduling Algorithm for Improving the Interrupt Latency of Real –Time Kernal

Telematic Performance and the Challenge of Latency

Action-Sound Latency: Are Our Tools Fast Enough?

5G Qos: Impact of Security Functions on Latency

CPU Scheduling

Bufferbloat: Advertently Defeated a Critical TCP Con- Gestion-Detection Mechanism, with the Result Being Worsened Congestion and Increased Latency

Real-Time Latency: Rethinking Remote Networks