Fair Scheduling in Input-Queued Switches Under Inadmissible Traffic

Fair Scheduling in Input-Queued Switches under Inadmissible Traffic Neha Kumar, Rong Pan, Devavrat Shah Departments of EE & CS Stanford University {nehak, rong, devavrat}@stanford.edu Abstract—In recent years, several high-throughput low-delay scheduling achieve 100% throughput. These results focus on admissible algorithms have been designed for input-queued (IQ) switches, assuming traffic conditions however, when in practice traffic is frequently admissible traffic. In this paper, we focus on queueing systems that violate admissibility criteria. inadmissible. Herein lies our motivation for this paper, where We show that in a single-server system with multiple queues, the Longest we study scheduling policies for inadmissible traffic conditions Queue First (LQF) policy disallows a fair allocation of service rates 1.We in IQ switches. also describe the duality shared by LQF’s rate allocation and a fair rate allocation. In general, we demonstrate that the rate allocation performed Under admissible traffic, stable scheduling algorithms grant by the Maximum Weight Matching (MWM) scheduling algorithm in over- every flow its desired service, and there does not arise a need loaded IQ switches is unfair. We attribute this to the lack of coordination for fairness in rate allocation 4. Under inadmissible traffic, not between admission control and scheduling, and propose fair scheduling al- all flows can receive desired service. We observe the rate allo- gorithms that minimize delay for non-overloaded queues. Keywords— Congestion Control, Quality of Service and Scheduling, cations performed by LQF and MWM in such a scenario, and Stochastic Processes and Queueing Theory, Switches and Switching, Re- prove that they lack fairness. This motivates our search for a source Allocation scheduling policy that performs a fair rate allocation, given in- admissibility. We now summarize our results. I. INTRODUCTION The input-queued (IQ) switch architecture is widely used in B. Outline and Results high-speed switching. This is due to its low memory bandwidth In section II, we formalize the notion of fairness that we will requirements compared to those of output-queued and shared- use. We then commence our study of IQ switch-scheduling un- memory architectures, making it the preferred choice. der inadmissible traffic conditions by observing the performance In an N × N IQ switch, we assume fixed-size cells (packets). of LQF in a system with multiple queues and an over-subscribed Each input has N FIFO virtual output queues (VOQs), one for server in section III. As an interesting side observation, we also each output 2. Packets queue up at the inputs, arriving at input i present the duality shared by the rate allocations determined by for output j at an average rate λij. In each time slot, at most one LQF and Max-Min fairness. In section IV, we provide Fair- packet can arrive at each input and at most one can be transferred LQF, an algorithm that incorporates fairness into LQF to per- to an output. Consider these conditions for Λ=[λij ]: form a provably fair rate allocation. N × N N N In section V, we extend our study to the switch and observe the rate allocation performed by MWM. We show that λij < 1, ∀i λij < 1, ∀j j=1 i=1 it lacks fairness, and propose the Fair-MWM algorithm in section VI, conjecturing that this too performs a fair rate allocation. The first condition is enforced by the line-rate constraints Our conclusions follow in section VII. at the inputs. Incoming traffic is called admissible when the second condition is satisfied and inadmissible otherwise. The II. FAIRNESS IN SCHEDULING switch scheduling problem reduces to a matching problem in a weighted bipartite graph with N inputs and N outputs 3. To define a fair rate allocation for flows, we use the estab- lished notion of Max-Min fairness [1]. A. Background and Motivation Definition 1 (Max-Min Fairness) Let there be n flows arriv- The primary performance metric of an IQ switch scheduling ing at a server of capacity C with rates λ1, ··· ,λn respectively. algorithm is the throughput it delivers. The Maximum Weight A rate allocation r =(r1, ··· ,rn) is called Max-Min fair iff Matching (MWM) algorithm has been shown to achieve 100% (i) n ri ≤ C, ri ≤ λi, and throughput when arriving traffic is admissible and obeys the (ii) any ri can be increased only by reducing rj s.t. rj ≤ ri. Strong Law of Large Numbers. Practical heuristics to approxi- Definition 2 (Fairness in a Switch) There exist N 2 flows in mate MWM [7] [4] too have been proposed. In a single-server an N × N switch. We call R =[rij ] a fair allocation for queueing system, Longest Queue First (LQF) is also known to Λ=[λij] iff it is Max-Min fair for every output. 1The formal definition of fairness is provided later. 2 The Virtual Output Queueing architecture improves performance by prevent- 4In this paper, each VOQ defines one flow. This can easily be generalized to ing Head-of-Line blocking [5]. accommodate the arrival of multiple flows at an input that are intended for the 3 In this paper, we take the weight of the edge (i, j) as the size of VOQij . same output. IEEE Communications Society Globecom 2004 1713 0-7803-8794-5/04/$20.00 © 2004 IEEE III. LONGEST QUEUE FIRST AND FAIRNESS B. Rate Allocation under LQF We first consider a system of N queues and a server of unit When traffic is inadmissible, queues that do not receive de- capacity. The arrival rate for queue i, 1 ≤ i ≤ N, is denoted by sired service grow unboundedly. Since LQF is a non-idling pol- t λi, and the cumulative number of arrivals until time n is denoted icy, it services a queue in every time slot. For large enough , r t T t /t i by Ai(n), where Ai(0) = 0. We assume that the arrivals obey i( )= i( ) denotes the service rate allocated to queue the Strong Law of Large Numbers (SLLN) that states: by LQF. The fluid model equations (2) – (5) characterize LQF’s rate allocation for arrival rates λ1, ··· ,λn. Ai(n) For admissible traffic, techniques such as Lyapunov analysis lim = λi ∀iw.p.1 (1) n→∞ n may be used to show that LQF services every queue at its arrival Let Qi(n) denote the size of queue i at time n, Di(n) the rate. For inadmissible traffic, we state the following theorem to number of departures from queue i until time n (Di(0) = 0), describe LQF’s rate allocation: λ i ≤ and Ti(n) the number of time slots queue i is scheduled for ser- Theorem 1: Let i represent the arrival rate for queue , 1 i ≤ N λ ≥ λ ≥ ··· ≥ λ ≥ λ vice in [0,n]. LQF always serves the longest queue, breaking . Assume 1 2 N N+1 =0.Let r t i ties arbitrarily. For n ≥ 0, the equations below describe queue i( ) represent the rate allocated by LQF to queue in time slot t ∀t ≥ sizes, departures and the busyness of the server over time. Then 0: (i) 0 ≤ rk(t) ≤ λk s.t. k rk(t)=1 Qi(n)=Qi(0) + Ai(n) − Di(n) (ii) Let 1 ≤ l ≤ N be the smallest index such that: D n D n − T n − T n − Q n − > l i( )= i( 1) + ( i( ) i( 1))( i( 1) 0) ∆ =( λi − 1)/l > λl+1 N i=1 T · T n n + i( ) is non-decreasing and i( )= Then ∀k, rk(t)=(λk − ∆) . That is, all queues served at i=1 a positive rate grow at the rate ∆, while the rest grow at their The dynamics of the system are completely described by respective arrival rates 6. N S(n)=(Qi(n),Di(n),Ti(n))i=1. WATER WATER λ r =λ 4 r = r = r 4 4 A. Fluid Model for a Queue 1 2 3 WATER λ λλ λ λλ We employ the fluid model technique to study LQF [3]. To 1 23 1 23 r do this, we define a sequence of systems indexed by = GRAVITY r r 1, 2, 3 ... with the associated description vectors S (t)= 3 WATER GRAVITY r r r N r r (Qi (n),Di (n),Ti (n))i=1. The relationship between S (t) and 2 r SOLID SOLID S(·) is described by: 1 Sr(t)=S(rt)/r The fluid model solution of the system is characterized by (a) LQF (b) Max−Min Fair S¯(t) such that: Fig. 1. Comparison of LQF and Max-Min Fairness. S¯(t) = lim Sr(t) r→∞ Sr(t) is a non-deterministic quantity. We would like to show C. LQF vs. Max-Min Fairness that the limiting quantity S¯(t) is a deterministic solution to a set We momentarily digress to present an interesting duality that of differential equations. We proceed by showing the existence exists between the rates allocated by LQF and those allocated by of a limiting quantity and its characterization. Max-Min fairness. It is illustrated in Figure 1, where the shaded r r r r If for all i, |Di (t) − Di (t )|≤|t − t |, |Ti (t) − Ti (t )|≤ area represents a solid surface and the rest is empty space. There r |t − t |, and limr→∞ |Ai (t) − λit| =0then S¯(t) exists [2] and are N steps such that step i has depth λi and unit width, i.e. satisfies the fluid model equations 5 as follows: there are N columns with volumes λ1, ··· ,λN respectively and column i represents queue i with arrival rate λi. Q¯ t Q¯ λ t − D¯ t i( )= i(0) + i i( ) (2) We begin our experiment by pouring water (of unit volume) dD¯ i(t) dT¯i(t) through the hole.

Fair Scheduling in Input-Queued Switches Under Inadmissible Traffic

Instability Phenomena in Underloaded Packet Networks with Qos Schedulers M.Ajmone Marsan, M.Franceschinis, E.Leonardi, F.Neri, A.Tarello

Scalable Multi-Module Packet Switches with Quality of Service

Improving the Scalability of High Performance Computer Systems

Scheduling Algorithm and Evaluating Performance of a Novel 3D-VOQ Switch

On Scheduling Input Queued Cell Switches

Performance Analysis of Networks on Chips

Switch Kenji Yoshigoe University of South Florida

Quantifiable Service Differentiation for Packet Networks

Fabric-On-A-Chip: Toward Consolidating Packet Switching Functions on Silicon

Data Path Processing in Fast Programmable Routers

Msc THESIS Design of a High-Performance Buﬀered Crossbar Switch Fabric Using Network on Chip

High Performance Queueing and Scheduling in Support of Multicasting in Input-Queued Switches