Quality and User Experience (2018) 3:4 https://doi.org/10.1007/s41233-018-0017-x

RESEARCH ARTICLE

A new QoE fairness index for QoE management

Tobias Hoßfeld1,5 · Lea Skorin‑Kapov2 · Poul E. Heegaard3 · Martín Varela4

Received: 7 December 2016 © The Author(s) 2018. This article is an open access publication

Abstract The user-centric management of networks and services focuses on the (QoE) as perceived by the end user. In general, the goal is to maximize (or at least ensure an acceptable) QoE, while ensuring fairness among users, e.g., in terms of resource allocation and scheduling in shared systems. A problem arising in this context is that the notions of fairness commonly applied in the QoS domain do not translate well to the QoE domain. We have recently proposed a QoE fairness index F, which solves these issues. In this paper, we provide a detailed rationale for it, along with a thorough com- parison of the proposed index and its properties against the most widely used QoS fairness indices, showing its advantages. We furthermore explore the potential uses of the index, in the context of QoE management and describe future research lines on this topic.

Keywords Quality of experience (QoE) · Quality of service (QoS) · Fairness · Fairness index

Introduction the QoE of diferent services is often diferent given the same network conditions; i.e., the way in which QoS can Quality of Experience (QoE) is “the degree of delight or be mapped to QoE is service-specifc. For example, voice annoyance of the user of an application or service” [26]. services can usually withstand higher loss rates than video It is generally accepted that the quality experienced by a streaming services, but are in turn more sensitive to large user of a networked service is dependent, in a non-trivial delays. Hence, given a network condition with certain QoS and often non-linear way, on the network’s QoS. Moreover, characteristics, the QoE experienced by users of diferent services can vary signifcantly. From the point of view of fairness, as we will see, we need not concern ourselves with * Tobias Hoßfeld the diferent aspects of how QoS afects QoE for diferent tobias.hossfeld@uni‑due.de; services, but rather how diferent users’ expectations, in tobias.hossfeld@uni‑wuerzburg.de terms of QoE, are afected by the underlying QoS: are most Lea Skorin‑Kapov (or all) users receiving similar quality levels, regardless of Lea.Skorin‑[email protected] the services they use? Poul E. Heegaard Standards such as ETSI TS 102 250-1 V2. 2.1 [11] spec- [email protected] ify how to compute various QoS metrics and highlight the Martín Varela need to consider customer QoE targets. However, fairness [email protected] aspects are not considered. From a network operator’s point 1 Modeling of Adaptive Systems, University of Duisburg- of view, QoE is an important aspect in keeping customers Essen, Essen, Germany satisfed, e.g., decreasing churn. This has lead to a number of 2 Faculty of Electrical Engineering and Computing, University mechanisms for QoE-driven network resource management, of Zagreb, Zagreb, Croatia aimed at maintaining quality above a certain threshold for 3 Department of Information Security and Communication every user (or in some proposals, “premium” users, at least). Technology, NTNU, Norwegian University of Science An issue common to all those eforts is that of dividing the and Technology, Trondheim, Norway available resources among users so as to maintain a satisfed 4 Oulo, Finland customer base. In this paper, we explore (in depth) a notion 5 Chair of Communication Networks, University of Würzburg, of QoE fairness, frst introduced in our previous work [15], Würzburg, Germany

Vol.:(0123456789)1 3 4 Page 2 of 23 Quality and User Experience (2018) 3:4

Table 1 QoE fairness, just like QoE, can be considered at diferent time scales, and its applicability can vary according to them Time scale Duration Interpretation Example: web QoE Example video QoE Related network metrics

Instantaneous Tens of ms In-session Not applicable Video frame Short term Seconds In-session Web objects, Single page DASH segment, single scene Avg. throughput, latency Mid term Minutes Single-session Web session Video scene, short clips Aggregated over time Long term Hours, days Multi-session Commonly visited sites Several episodes Aggregated over time

We note that, while very diferent QoE functions are needed to estimate QoE at these varying time scales, the notion of fairness remains unchanged (although its impact may be signifcantly diferent) to quantify the degree to which the users sharing a network, measures such as second-order statistics (variance, standard and using a variety of services on it, achieve commensurate deviation, coefcient of variation), entropy-based measures, QoE. In this paper, we expound upon both the concept of and diference to optimal solution, e.g. [6]. A key issue is QoE fairness and the proposed QoE fairness index. We show defning what is considered to be “fair”, and then designing that the QoS-fair methods of resource distribution among and evaluating various scheduling policies in terms of fair- users do not, in general, result in QoE-fair systems, even ness. For example, while proportional fairness relates to the when considering a single-service scenario, and therefore, idea that it is fair for jobs to receive response times propor- that QoE fairness needs to be considered explicitly when tional to their service times, temporal fairness respects the evaluating the performance of management schemes. We seniority of customers and the frst-come-frst-serve policy. further illustrate the diferences between QoS fairness and Wierman [46] provides an overview and comparison of vari- QoE fairness indices by means of concrete case studies. ous scheduling policies which are focused on guaranteeing The remainder of this paper is structured as follows. equitable response times to all job sizes. “Background and related work: notion of fairness and its Avi-Itzhak et al. [1] address the applicability of various applications” provides background on the notion of fairness fairness measures for diferent applications involving queue in shared systems and the networking domain, and discusses scheduling, such as call centers, supermarkets, banks, etc. the move to considering fairness from a user perspective. A is inherently linked to some kind of The move from QoS to QoE management and the motiva- performance objective, such as minimizing waiting times tion for considering QoE fairness are then further discussed or maximizing the amount of allocated resources. A com- in “From QoS to QoE management” . “QoE fairness index” monly studied trade-of when considering diferent resource specifes the requested properties of a fairness index, while allocation optimization objectives is that between efciency “Relative standard deviation and Jain’s fairness index” intro- and fairness [3, 24]. Moreover, a key question that arises duces the commonly used relative standard deviation (RSD) is at which granularity level should fairness be quantifed and Jain’s index. The properties of Jain’s index are further and measured [1]. Related to diferent granularity levels elaborated in “Issues with Jain’s index for QoE fairness”. is also the question of the time scales at which fairness is “Defning a QoE fairness index” presents the QoE fairness calculated, with most QoS fairness index measures used in index we proposed [15], and the rationale behind it. We pro- the literature (such as max–min fairness and Jain’s fairness vide an example of its application for web QoE and video index [20]) refecting long-term average system fairness. streaming QoE to demonstrate its relevance for benchmark- In contrast, a system may be considered short-term fair [9] ing and system design in “Application of the QoE fairness if for N competing hosts, the relative probability of each index”. Finally, “Conclusions and discussion” concludes this host accessing a shared resource is 1/N in any short inter- work and discusses further research issues. val. Deng et al. [9] further note that while short-term fair- ness implies long-term fairness, long-term fairness may not ensure short-term fairness. Background and related work: notion Table 1 gives example time scales and session-related of fairness and its applications elements for which performance measures may be derived. The time scale at which to compute fairness may be dif- Notion of fairness in shared environments ferent when targeting QoE fairness as opposed to targeting QoS fairness. In the context of QoE fairness, the fairness Fairness in shared systems has been widely studied as calculation time scale is linked to the time scale on which an important system performance metric, with diferent QoE is actually measured (or estimated), typically short- or application areas and lack of a universal metric. In gen- mid-term. eral, approaches quantifying fairness have relied mainly on

1 3 Quality and User Experience (2018) 3:4 Page 3 of 23 4

Notion of fairness in networking allocated for entity i. Jain’s index is primarily for assessing long-term fairness (e.g., averaged per user, session), but can In networking, fairness in resource allocation and scheduling also evaluate short-term fairness by considering the sliding is either linked to sharing resources evenly among the enti- window average of xi . Jain’s fairness index has also been ties, or scaling the utility function of an entity in proportion used to improve so-called transient fairness in the context to others. Flow based resource sharing, e.g., max–min fair- of congestion control, when computing the optimal initial ness, is the fundament of the design of TCP and fair queu- shaping rate for new fows entering a mobile network (rather ing scheduling approaches [8, 36]. A resource allocation is than using a fxed value and/or Slow Start-like method) [35]. said to be max–min fair if the bit rate of one fow cannot be Apart from Jain’s index, other indexes which (partly) increased without decreasing the bit rate of a fow that has a measure the fairness of shared resources are: variation, smaller bit rate. This defnition puts emphasis on maintain- coefcient of variation, and the ratio between the maxi- ing high values for the smallest rates, even though this may mum access share by a host and the minimum access share be at the expense of network inefciency [25]. (max–min index, [34]). In a more general and utility-driven approach, propor- tional fairness was introduced in the seminal works by Fairness from the user’s perspective Kelly [22] and Kelly et al. [23]. Questioning the notion of max–min fairness, Kelly et al. [23] argue that bandwidth While QoS fairness has been well established in the net- sharing should be driven by the objective of maximizing working community, less focus has been put on considering overall utility of fows, assuming logarithmic utility func- fairness from a truly user-oriented perspective. Following tions. Weighted proportional fairness is further defned as Kelly’s theoretical notion of weighted proportional fairness, the scaling of an entity’s utility function relative to others, Briscoe [5] sharply criticized fow-rate fairness, and argued such that the entities will allocate fow rates so that the cost that fairness should be considered from the point of view of they cause will equal the weight they choose [22]. congestion costs (cost fairness) or user benefts. He states An alternative bandwidth-sharing approach is that of that if fairness is defned between fows, then users can sim- -fairness and the associated utility maximization. Mo and ply create more fows to get a larger resource allocation. Walrand [28] propose this as a decoupled fairness criteria, Moreover, fow fairness is defned instantaneously, and has which each user can use to achieve fairness without con- no necessary relation to real-world fairness over time. In sidering the behavior of other users. Bonald and Proutiere other words, Briscoe’s criticism of fow-level fairness leads [4] introduce the notion of balanced fairness, referring to to the notion that fairness should be considered at a higher allocations for which the steady state distribution is insensi- level, where real-world entities are considered, such as peo- tive to any trafc characteristics except the trafc intensities. ple or organizations. They note that this insensitivity property does not hold for Following this perspective, recent papers have argued that utility-based allocations such as max–min and proportional a QoS fair system is not necessarily QoE fair, e.g., Mansy fairness, where an optimal allocation process depends on et al. [27], given the lack of consideration of service QoE detailed trafc characteristics such as fow arrival process models. Such models specify the relationships between user- and fow size distribution. level QoE and various application-layer performance indica- Any resource scheduling allocation between diferent tors (e.g., fle loading times, video re-bufering) or infuence entities (users, applications, fows/sessions, bitstreams) must factors such as device capabilities, context of use, network have a notion of fairness. For example, according to the Gen- and system requirements, user preferences, etc. eral Processor Sharing Model (GPS), each host is assigned As an example we consider QoE fairness in the context a fair portion of a shared resource for any time interval [36]. of bottleneck link sharing among adaptive video streams, While GPS has a binary outcome (a system is either fair or where the on/of nature of fows results in inaccurate client- not), other metrics (such as the max–min fairness index) side bandwidth estimation and leads to a potential unfair quantify the fairness level when the system is not perfectly resource demand [13, 27, 37]. fair [9]. A QoS fairness index should thus refect the distance De Cicco et al. [7] propose a client-side algorithm which between the actual and the idealised allocation. The fairness avoids on/of behavior until reaching the highest possible should be relative to the resource unit xi which is allocated to playback quality. However, while they focus on QoS fair- entity i relative to the other entities. Various measures have ness, the approach still faces such problems as heterogene- been proposed, both for measuring short- and long-term fair- ous user devices, thus the issue of achieving QoE fairness ness, as discussed previously. remains. Georgopoulos et al. [13] proposed an OpenFlow- The most frequently used QoS fairness metric is Jain’s assisted system that allocates network resources among index [20], which approximates the ratio between the squares competing adaptive video streams originating from hetero- of the frst and second order moments of the resources xi geneous clients, so as to achieve user-level (QoE) fairness.

1 3 4 Page 4 of 23 Quality and User Experience (2018) 3:4

The allocation utilizes utility functions relating bitrate to QoE, whereby the quality metric used to evaluate QoE is the B. evaluate subjecvly test condions on a objectively measured Structural Similarity Index (SSIM). rang scale, e.g. 5- point MOS scale They evaluate their system against other systems by compar- A. Subjects C. test condions, e.g. certain web pages with pre-defined page load me t in a test laboratory D. Based on subjecve results, e.g. Mean Opinion ing mean achieved QoE and QoE variance. Scores per test condion, a QoE model is derived Resultsfrom Mansy et al. [27] also argue that typical fow-rate (QoS) QoE Model: Mapping funcon literature between QoS andQoE, e.g. Q(t) fairness ignores user-level fairness and is ultimately unfair, 6. Model is used to evaluate observed QoS, e.g. PLT, in a system. PLT x_iof user iis mapped to QoE value y_i. 3. Contents are delivered thus proposing a QoE fairness metric in the range [0; 1] (shared resources). based on Jain’s fairness index. Their metric considers a set System of QoE values corresponding to bitrate allocation, calculated 1. users in the system, e.g. downloading a web page 2. System may have a shared taking into account factors such as user screen size, resolu- 5. Page Load Times (PLT) boleneck, as in our are measured objecvely at measurement study. Resources tion, and viewing distance. Further, Petrangeli et al. [37] the end user site. may be shared among users. 6. Fairness of system is evaluated by Measurement study incorporate the notion of maximizing fairness, expressed considering all QoE values y_i. in Secon3 as the standard deviation of clients’ QoE, into a novel rate adaptation algorithm for adaptive streaming. Villa and Fig. 1 Heegaard [45] specify a ‘perceived fairness metric’ as the Illustration of QoE management diference between the worst and best performing stream- ing sessions in terms of average number of rate reductions optimized QoE-driven network resource allocation, often in (i.e., discrimination events) per minute. This is however an the context of a concrete service. We focus instead on a fair- application-level (and application-specifc) QoS metric, and ness index independent of the underlying service and QoE not a general QoE fairness index. model used. We have defned a generic QoE fairness index Going beyond relating QoE to allocated bitrate, Gabale to serve, e.g., as a benchmark when comparing diferent et al. [12] measure video-delivery QoE in terms of the num- resource management techniques in terms of their fairness ber and duration of playout stalls, with the objective of fairly across users and services (Fig. 1). distributing stalls across clients. Mu et al. [31] propose a In the following section, we further elaborate on the moti- solution for achieving user-level fairness of adaptive video vation of going from QoS to QoE management, and on the streaming, exploiting video quality, switching impact, and need to consider QoE fairness in that context. cost efciency as fairness metrics. QoE fairness is computed based on calculation of the relative standard deviation (coef- fcient of variation) of QoE values. In their work on com- From QoS to QoE management puting a benchmark QoE-optimal adaptation strategy for adaptive video streaming, Hoßfeld et al. [16] use Jain’s fair- ness index to show that QoE can be shared in a fair manner A general view on fairness among multiple competing streams. It is clear that many approaches use application-level QoS In other areas such as ethics and ecnonomics, fairness does metrics (like number of stalls, video bitrate, video quality not, in general, relate to utility, but rather to how resources switches) and use measures such as Jain’s fairness index or are distributed among actors. We note in particular that a coefcient of variation to evaluate systems in terms of QoE better system is not necessarily fairer, and neither is a fairer fairness, e.g., [16, 21, 27, 42]. In the remainder of the paper system necessarily better. Utility and fairness are orthogonal (“Relative standard deviation and Jain’s fairness index” and concepts. “Issues with Jain’s index for QoE fairness”), we will argue For a simplifed (and light-hearted) view on the orthogo- that these measures are not necessarily suitable for QoE nality between fairness and utility, we could draw an anal- fairness. ogy to the cold-war era superpowers and their economic models. In the Soviet model, there was an emphasis on fair- Application of fairness index: (benchmarking ness, but the overall quality of life (QoL) was low (i.e., most of) QoE management in resource constrained everyone had similarly low QoL). In the American model, environments the emphasis was on quality of life, but only for those who could achieve it on their own, leading to higher average An important consideration is the applicability of a QoE QoL, but much lower fairness (QoL was much more vari- fairness index, for example in the context of scheduling, able across sectors of the population). While the economic resource assignment, optimization, etc. and societal merits of each approach are arguably not set- For the most part, approaches discussed in the previ- tled, we can draw a parallel to the notions presented in this ous sections aim to exploit the notion of QoE fairness for paper, namely that the overall QoE achieved on a system

1 3 Quality and User Experience (2018) 3:4 Page 5 of 23 4 is not directly related to how fair the system is, and vice- the advantage factor). If the operator were to consider QoE 1 versa. Depending on the goals and context of whomever fairness instead, the resource distribution could result in Um is in charge of managing the quality (in the context of this and Ul receiving bm < b ≤ bl ≤ RMAX , respecively, and their paper, an ISP, for instance), the relative weight of each can corresponding Q(bm) and Q(bl) values being closer together be valued diferently. (i.e., more QoE-fair). Depending on the relationship between the bi values and RMAX , both users could even experience Why QoE management over QoS management? their maximal possible quality. The use of QoE models to solve this resource allocation Our main working assumption is as follows: network opera- problem allows the operator to be “closer” to the users’ tors strive to maintain their users sufciently satisfed with needs in terms of service quality. their service that they will not churn, while simultaneously trying to maximize their margins. There are diferent ways On the need for QoE fairness in which an operator can go about this (e.g, lower prices, higher speeds, bundled services), but conceptually, they all Besides keeping their users sufciently satisfed, operators lead to a notion of utility, or perceived value that the users may care about doing so in a fair manner. Whereas in many derive from their network connection. cases users will not be aware of the quality experienced by Operators have a limited resource budget, and how they other users, there are several contexts in which they may allocate it will have a (possibly large) impact on the users’ (e.g., shared activities, applications involving social media), utility. One option, for example, would be to ensure that the and this can become a relevant factor, so distributing the network capacity is distributed evenly across users. How- resources in a “fair way” can be a smart business practice ever, it is easy to see that this fails if users have applica- for operators. As discussed in above, what is fair in the QoS tions with diferent QoS requirements. While the allocation domain, may not be fair in the QoE domain, and so a notion may seem reasonable from the QoS point of view, it fails to of QoE fairness becomes necessary. We note that this applies account for the users’ utility, which will vary with the appli- not only to scenarios where there are multiple diferent ser- cation or service under use. In this context, QoE provides a vices involved, but also in scenarios where a single service reasonable proxy measure for utility, and if the operator were is considered. In what follows, we focus on these single- to take QoE into account instead of QoS, a better distribution service scenarios, but the contributions presented herein of its resources could be achieved (for instance, assigning hold also for multi-service scenarios as well, provided that more bandwidth to users who are watching video than to QoE models for those services are available and comparable, those who are just browsing the web; or providing expedited which to the best of our knowledge is still an open problem.2 forwarding for users of real-time services such as VoIP or video-conferencing). QoE fairness and QoE management Let us consider a hypothetical scenario to illustrate the difference between QoS-based management and QoE- As mentioned above, QoE management problems often based management, as well as between QoS fairness and revolve around maximizing some measure of QoE (the Mean QoE fairness. We assume a video service delivered using Opinion Score in the simplest case, but ideally something HTTP Adaptive Streaming (HAS), with an associated QoE like the percentage of users who rate the service above a cer- model Q that takes into account the device on which the tain threshold). The literature further advocates the need to user is accessing the content (that is, like in the E-model for consider ensuring fairness among users, in particular related voice, mobile devices have a so-called “advantage factor”, to QoE fairness [13, 27, 37]. This generally leads to a multi- that considers e.g., convenience of use alongside device- objective optimization problem, typically, maximizing QoE specifc limitations, such as screen resolution). As the sim- subject to some fairness constraints. This can be approached plest scenario, we consider two users Ul and Um , accessing in several ways, such as: the service (from a laptop and mobile phone, respectively) over a shared link with capacity C < 2RMAX , where RMAX is the bitrate of the highest-quality video representation available. Now, doing a QoS-fair distribution of resources 1 This assumes a mechanism for determining the type of device each would result in both Ul and Um having the same available user is using, which could be implemented e.g., via suitable APIs, or bandwidth b < RMAX . However, given the diferent devices by monitoring whether users are making requests to the mobile ver- being used by each user, their QoE, as per Q, could be sig- sion of the HAS service. 2 I.e. given two models for diferent services, using the same scale, nifcantly diferent, with Um receiving higher QoE (due to it is not clear whether equal output values from them correspond to equal QoE for users of each service. In the case of diferent scales, this is even further complicated.

1 3 4 Page 6 of 23 Quality and User Experience (2018) 3:4

• A two-step approach, maximizing frst the average QoE, QoE model x1 with a second step to solve for maximum fairness while Q(x ) 1 scope of paper maintaining the previously determined average quality y1 level. QoS parameters QoE model y2 QoE Fairness Index • An approach based on utility functions, where the opti- x2 Q(x ) F (y) mization targets (e.g., cost minimization, average quality 2 y maximization, fairness maximization) are combined into k a utility function.

QoE model We posit that in a QoE management context, we generally xk Q(xk) do not want to confound QoE fairness from overall QoE, e.g., by using utility functions. By treating QoE and QoE fairness as orthogonal goals, the operator can decide the Fig. 2 Scope of the paper: QoE fairness index. Please note that the correct trade-ofs and decide about the relevance of fairness. notion and the variables frequently used in this article are summa- This is easier from a practical point of view if there are two rized in Table 6 separate metrics for both concepts. Our aim is to provide a X into a QoE value y. For user i the corresponding QoS means for the provider to implement QoE management by parameters are xi , with QoE value yi. considering the various aspects independently, according to ↦ ; . their particular situation. The existence of a QoE fairness Q ∶ X y = Q(X)∈[L H] (1) metric does not imply that operators must use only QoE fair We note that Q does not need to be monotonic. assignment of resources, but should they need to, they have a L and H are the lower and uppper bounds of the QoE well-founded metric at their disposal. For further discussions scale, respectively, e.g., L = 1 (‘bad quality’) and H = 5 related to QoE management, diferent QoE metrics and fair- (‘excellent quality’) when using a 5-point absolute category ness, the interested reader is referred to Hoßfeld et al. [18]. rating scale. As an example for a QoE model, y = Q(x) is the mean opinion score (MOS) value corresponding to QoS x. In the literature, those QoE models are often derived by QoE fairness index subjective user studies, and typically only the MOS is used. But other QoE metrics (like median, quantiles, etc.) may be QoE models and QoE fairness especially of interest for service providers [14], which may be refected by the mapping function Q. We have proposed a QoE fairness index [15], F(Y), which enables us to assess the fairness of a provided service, for Desirable properties of a QoE fairness index which we assume that we have a set of QoE values (Y) pro- duced by a QoE model (about whose particulars we need As mentioned in the introduction, the most commonly used not worry) mapping a set of QoS parameters x to a unique quantifcation of (QoS) fairness is Jain’s fairness index [20]. QoE estimate y. It was designed for QoS fairness with the properties as intro- In resource management, network and service providers duced in the following section. We briefy explain those already use a notion of fairness at the QoS level, striving to properties and interpret them for QoE fairness. Please note allocate a fair share of resources (e.g., bandwidth) to each that we discuss the properties in detail for Jain’s fairness segment/session/user. However, as we will discuss in this index applied to QoE as well as our proposed index in “Prop- article, the notion of QoS fairness and fair share resource erties of the QoE fairness index F” and validate the indices allocation will in general not provide QoE fairness, and a with respect to these properties. A fairness index F(Y) maps new QoE fairness index is required in order to assess the the QoE values Y to a single scalar value. Y can be both a fairness at the QoE level. random variable and a set of samples in the following. Thus, Figure 2 shows a generic QoE model that maps the QoS F should have the following properties: factors (x) and other infuencing factors to a QoE value (y). For the notion of the QoE fairness metric, it is not important (a) Population size independence: it should be applicable which time-scale is considered by the QoE model. The fair- to any number of users. If the QoE values emerging ness index is independent of the time-scale. We consider a in the system follow a certain distribution Y, then the shared system of users consuming a certain service. For each actual number of users should not afect the fairness service there is a set of QoS parameters (of various key QoS index.Let Yx be the set of x samples of the RV Y. We infuence factors on QoE) given in a vector x. There exists demand: If Yn ∼ Y and Ym ∼ Y , then F(Yn)=F(Ym) , a mapping function Q taking the QoS parameters in the set

1 3 Quality and User Experience (2018) 3:4 Page 7 of 23 4

even if n ≠ m . For example, the absolute diference Please note that those properties are claimed in literature by n D = i=1 Yi − E[Y] from the average QoE E[Y] is a Jain et al. [20]. The main motivation for properties (c)–(e) measure∑ for the diversity of QoE values in the system. is to have an intuitive metric which provides continuous However, the more users n are in the system the larger values in the interval [0; 1]. The fairness index values are the value of D may get. Hence, such a metric is not con- thus comparable across systems. The (b) ‘scale and met- venient to quantify QoE fairness. Also the sum of Y or ric independence’ is crucial. In QoE assessment, diferent the standard error of Y depend on the sample size and rating scales are commonly used to assess quality, such as hence violate this property, while the expected value 5-point, 7-point, 11-point or continuous scales, which have and standard deviation fulfl it. diferent performance in terms of discriminatory power and (b) Scale and metric independence: the unit of measure- reliability, and also difer in assessment time and ease of use ment should not matter (for QoE this means independ- by the subjects. As examples, Tominaga et al. [43] discuss ent of L and H values). The main intention of the for- diferent rating scales for mobile video, Huynh-Thu et al. mulation of this property is the fact that the unit does [19] for high-defnition video, and Möller [29] for speech not infuence the fairness index. For example, it does quality. Moreover, the used QoE models may operate on not matter if kpbs or Mbps is used when considering diferent scales. For example, the transmission rating scale network throughput. For Jain’s index, the measurement of the E-model quantifes the quality of speech transmission scale requires to be a ratio scale with a clearly defned on a rating scale from 0, … , 100 . This scale is extended to zero point. On such a ratio scale, scale and metric inde- 0, … , 129 to consider wideband transmission [30]. Conse- pendence can be formulated as F(aY)=F(Y) for a > 0 . quently, we argue that a QoE fairness index needs to quantify However, QoE is measured on a category or interval fairness independent of the underlying scale. scale, see also “Relative standard deviation on an inter- When we want to quantify the QoE fairness we need an val scale”. Therefore, scale and metric independence index with the same properties as above, plus the following: means that the fairness index is the same when the QoE values are linearly transformed (to another inter- (f) QoE level independence: the fairness index is inde- val scale). We demand: F(aY + b)=F(Y) for a ≠ 0 and pendent of QoE level, whether system achieves good or any b. Please note that a negative value of a means that bad QoE. As discussed in “From QoS to QoE manage- the interpretation of the QoE values is inverted. Instead ment”, overall QoE and QoE fairness are orthogonal of the degree or delight of the user Y, the annoyance or concepts, and thus we want the QoE fairness index to dissatisfaction is expressed by −Y. be independent of the overall QoE of the system. We (c) Boundedness: the fairness index should be bounded demand: The fairness statistic F(Y) shall be independ- (without loss of generality it is set to be between 0 and ent of the sample mean E[Y] . The theorem by Basu 1). A bounded fairness index enables comparison of [2] shows that sample variance and standard deviation diferent sets of QoE values (e.g., from diferent appli- fulfll this property and are independent from the sam- cations) if the fairness index is mapped on the same ple mean. Therefore, we can concretize this property value range. We demand: F(Y)∈[0;1]. using the variance of QoE values. We demand: Given (d) Continuity: the fairness index should take continuous two systems with Var[Y1]=Var[Y2] and E[Y1] ≠ E[Y2] , values and changes in resource allocation should then F(Y1)=F(Y2) . A simple example for the rational change the index (e.g., the max–min ratio does not sat- of this property is as follows. Let us assume that all isfy this since it considers only the max and the min, users experience a fair QoE (3 on a 5-point scale). The ℝ and not values of xi in between). We demand: F(Y)∈ system is totally fair. If all users experience a good QoE ≠ � � ≠ � and F(Y) F(Y ) if Yi = Yi but ∃j ∶ Yj Yj . Please (4 on a 5-point scale), the system is obviously better, but the system is not fairer. Please note that a shift in note that the continuity allows to discriminate systems. QoE (i.e., changing the QoE level) without changing Although a discrete fairness index may be also useful the dispersion of QoE values around the mean does not in practice, the discriminative power of a continous afect the fairness index. index is benefcal in QoE management. (e) Intuition: the fairness index should be intuitive: high It is important to notice that as per property (f) above, F value if fair ( F(Y)=1 is “perfect” fairness), and low does not consider the absolute (average) QoE values across value if unfair ( F(Y)=0 , if possible, is totally unfair). all users in a system, i.e., it is independent of the QoE level. F(Y)=1 means that all users get the same QoE. The Thus, QoE managment solutions can separate between QoE most unfair system is when half of the users obtain the fairness and overall QoE of the system. Of course, QoE man- best quality and the other half get the worst quality. agement solutions will consider the overall QoE, see “From QoS to QoE management”.

1 3 4 Page 8 of 23 Quality and User Experience (2018) 3:4

Additional properties derived from desirable properties

(g) Deviation symmetric: the fairness index should only depend on the absolute value of the deviation from the mean value, not whether it is positive or negative. This property follows from (b). When considering the dis- tribution Y of QoE values, the fipped distribution Y (i.e., refection in a line parallel to the y-axis in the � middle of the QoE scale) is simply Y =−Y + L + H . � Thus, F(Y)=F(Y ) due to property (b) with a =−1 and b = L + H . Deviation symmetry can also be seen � 2 from property (f). Var[Y ]=(−1) Var[Y]=Var[Y] , and Fig. 3 � Illustration of property (f) that QoE level is independent from hence F(Y )=F(Y). QoE fairness for more fexiblity in QoE management (h) Valid for multi-applications: the fairness index should reflect the cross-application fairness (and not only As an example: we regard the fairness of a system between users of the same application). Property (h) (I) with an average QoE value ȳ = 4 on a 5-point ACR requires that a set of suitable QoE models exists for the scale and 50% of users with y = 3.5 and 50% with y = 4.5 , applications considered. If the QoE models fulfll this, and a system (II) with an average QoE ȳ = 2 with 50% of then the fairness index fulflls this property too. QoE users with y = 1.5 and 50% with y = 2.5 to have the same and QoE models are application specifc, and how to fairness. compare QoE values from diferent applications is a As another example, a (bad) system with an average separate and challenging topic that is outside the scope QoE ȳ = 1.4 , with 10% of users with y = 5 and 90% with of this paper. y = 1 is more fair than a system with an average QoE ȳ = 4 with 50% of users with y = 3 and 50% with y = 5 . This In an axiomatic theory of fairness in network resource property makes QoE fairness orthogonal to the overall allocation Lan et al. [24] demand similar properties: con- QoE of the system, which allows to objectively bench- tinuity, independence of the number of users, homogene- mark systems with respect to both aspects and to evaluate ity F(Y)=F(aY) (which follows from scale and metric the possible trade-of between QoE fairness and overall independence). Monotonicity is demanded which means QoE (see “Application of the QoE fairness index” for an that for n = 2 users, the fairness measure is monotonically example). Figure 3 visualizes QoE level indepence from increasing as the absolute diference between the two ele- QoE fairness. ments shrinks to zero. This is refected in property (e). We would like to highlight that property (b) Scale and The fairness index converges towards F = 1 if the users metric independence and property (f) QoE level independ- perceive the same QoE and the diference in QoE is zero. ence are key features. Since QoE is given on arbitrary inter- val scales, any linear transformation must not infuence the fairness index. To have a higher fexibility in QoE manage- ment, QoE level independence is necessary. This allows to Relative standard deviation and Jain’s mimic combined utility functions with relevance factors fairness index (e.g., for fairness, costs, overall QoE) defned by the pro- vider. The utility values are then easily derived as U(Y, FY ) . Arguably, the two most common indexes used in literature The other features (population size independence, bounded- for quantifying QoE fairness are the relative standard devi- ness, continuity, intuition) are desired to have a mathemati- ation (RSD) and Jain’s fairness index. They rely on sec- cally “nice” metric, which is intuitive and easy to interpret. ond-order moments of the QoE values Y (a random varia- We also note that in Hoßfeld et al. [15], we demanded ble resulting from mapping the QoS parameters X, another additional properties (deviation symmetry and validity for random variable, with the QoE model Q; Y = Q(X) ) in multi-applications) for a QoE fairness metric. However, after a system to numerically express the dispersion of QoE receiving feedback from the reviewers of this, we carefully values across users. analyzed those properties and revised them. In particular, we found that those properties follow from the set of properties above: (a)–(f). We discuss these derived properties below.

1 3 Quality and User Experience (2018) 3:4 Page 9 of 23 4

Relative standard deviation (RSD) with standard deviation

= 1 (H − L) The relative standard deviation c (also referred to as coef- u 2 (11) fcient of variation) is the standard deviation = Std[Y] of the QoE values normalized by the average QoE = E[Y]. and relative standard deviation Std[Y] H − L c = = . cu = . (12)  E[Y] (2) H + L

On a 5-point scale with L = 1 and H = 5 , then u = 3 , Given that �>0 then RSD c ≥ 0 . Using RSD as a fair- 2 = 2 c = = 0.67 ness index, then a low c represents a fair system while a u , and u 3 for the most unfair system. Apply- c = 2∕ 5 = 0.89 higher c refers to a more unfair system. Maximum fairness ing the same parameters to Eq. (9) gives max = 5∕3 c > c √ is achieved when c = 0 , i.e., all users experience the same (for max ), and hence max u . Generally, it can be proven that cmax > cu when H > L , which means that using QoE.Minimum fairness is obtained when RSD is cmax . Let RSD as a fairness index will never rate the most unfair sys- us consider the maximum standard deviation max for an tem (in our notion) as the most unfair system. observed average QoE . The max is obtained when a ratio p of users get the minimum QoE L and 1 − p get the maxi- This is illustrated in Fig. 4 which shows the maximum () c ( ) mum H. Then, the average QoE value is = pL +(1 − p)H , standard deviation ( max ) and RSD ( max ) as a function which can be transformed to of average QoE ( ) on a 5-point scale. It can be observed that the maximum RSD cmax is not achieved for the most H − unfair system at = 3 but at max = 1.67. p = . (3) H − L Thus we conclude that the RSD is not an intuitive fairness measure, as the most unfair system (Eq. 12) does not reach The maximum standard deviation max follows as the maximum RSD (Eq. 9).3 From Eq. (9), we further see 2 =(H − L) (1 − p)p. that the bounds of RSD depend on the actual rating scale. max (4) L = 0 √ In case of , the RSD, however, is not bounded and vio- Replacing p with Eq. (3) then we get max as a function of : lates property (c) ‘boundedness’. RSD also trivially violates property (f) ‘QoE level independence’, as the RSD depends ()= ( − L)(H − ) max (5) on the average QoE value (Eq. 2). √ and the maximum RSD cmax as a function of Furthermore, RSD does not fulfll property (g) ‘Deviation symmetric’ which is demonstrated in two simple scenarios, ( − L)(H − ) cf. Table 2. In scenario (A), 90% of users experience best c ( )= . max √ (6) QoE and 10% experience worst QoE. In scenario (B), the opposite ratio is observed, i.e. 10% of users experience best The cmax reaches its maximum value for satisfying: QoE and 90% experience worst QoE. Clearly, scenario A c () leads to better QoE than scenario B, however, both systems max = 0  (7) reveal the same unfairness. Nevertheless, the RSD is difer- ent in both scenarios, i.e. cA ≠ cB , and leads to very diferent which gives results. The RSD is not deviation symmetric. The RSD is further scale dependent and violates prop- 2HL = erty (b). A linear transformation of the QoE values Y will max H + L (8) also lead to diferent RSD values. We defne T(Y) as lin- and ear transformation with parameters a and b. For example, a =−b = 1 4 when normalizing QoE values from [1; 5] to 1 H − L [0; 1]. cmax = . 2 HL (9) T(Y)=aY + b. (13) √ For the linearly transformed QoE values, we observe a Let us return to using c as a fairness index. Intuitively, we dependency of the RSD value on the transformation. regard the most unfair system to be when half of the users experience lowest QoE L and half of the users experience E[T(Y)] = a + b, (14) highest QoE H, e.g., p = 0.5 in Eq. (3) and hence

= 1 (H + L) u 2 (10) 3 It is cu ≤ cmax , as H ≥ L.

1 3 4 Page 10 of 23 Quality and User Experience (2018) 3:4

Table 2 Illustrative scenario id Best QoE (%) Worst QoE Avg. Std. RSD J F and fairness indexes (%)

QoE values with L = 1 and H = 5 (A) 90 10 4.6 1.2 0.26 0.94 0.40 (B) 10 90 1.4 1.2 0.86 0.58 0.40 Normalized QoE values with L = 0 and H = 1 (C) 90 10 0.9 0.30 0.33 0.90 0.40 (D) 10 90 0.10 0.30 3.00 0.10 0.40

The RSD c and Jain’s fairness index J depend on the average QoE value. As a consequence, RSD and J are scale dependent and return diferent values after normalization of QoE values to [0; 1]

2 Jain’s index takes continuous values in [0; 1]. The maxi- (3.00; 2.00) mum fairness ( Jmax = 1 ) is reached for the minimum stand- ard deviation ( min = 0 ). If we consider the most unfair 1.5 e scenario with maximum standard deviation max (Eq. 7), we will expect that the fairness index reaches its minimum. 1 (1.67; 0.89) Substitute cu from Eq. (12) in Eq. (17) then (3.00; 0.67) 1 (H + L)2 maximum valu J = = . 0.5 u 1 + c2 2(H2 + L2) (18) max. Std. u max max. RSD c max Thus, in the maximum unfair scenario, a nonzero value 0 12345Ju > 0 is observed. Therefore, Jain’s index is not an intui- average QoE tive fairness measure, as the most unfair system does not reach the minimum value, i.e. Ju > Jmin when H > L . In fact, Fig. 4 The maximum standard deviation max and coefcient of the maximum RSD (Eq. 9) leads to the minimum possible c variation max depending on the average QoE value is plotted. We value Jmin. observe that the curve for max is symmetric – in contrast to cmax . The unfairest scenario ( 50% experience worst quality L and best quality 4HL 3 Jmin = . H, respectively, leading to = ) is well captured by max , but not by (H + L)2 (19) cmax due to the normalization by the observed (average) QoE level This means that the minimum value Jmin depends on the bounds of the value range [L; H]. If L = 0 , then Jmin = 0 . On Std[T(Y)] = a , J = 5 ≈ 0.56 (15) a 5-point scale, min 9 . Although J is bounded in a [0; 1], the lower bound Jmin is determined by the value range cT = . [J ;1] a + b (16) [L; H] and the bounds are min . Property (c) is partly fulflled. Table 2 illustrates the scale dependency numerically. Sce- nario (C) and (D) are equivalent to Scenario (A) and (B), however, a normalized scaled is used instead of a 5-point Issues with Jain’s index for QoE fairness scale. It is cA ≠ cC and cB ≠ cD , respectively. (We note that index F will be introduced in detail later on in “Defning a In the following, we demonstrate that Jain’s fairness index QoE fairness index”, with descriptions left out at this point violates several desirable properties introduced in “Desirable for the sake of readability.) properties of a QoE fairness index”.4 We further illustrate severe issues for its application in the QoE domain. Jain’s fairness index J

The well-known Jain’s fairness index J can be applied to 4 We have already seen in “Relative standard deviation (RSD)” that QoE values Y and is a function of the RSD c. the RSD violates several desirable properties. Since J and RSD are inversely proportional, cf. Eq. (17), this implied that J violates some 1 E[Y]2 J = = . of those properties too. In “Issues with Jain’s index for QoE fairness”, 1 + c2 E[Y2] (17) we will visualize those violations for J to make the reader aware of how severe these violations are.

1 3 Quality and User Experience (2018) 3:4 Page 11 of 23 4

1 1

J : [1;5] 0.9 0.9 F : [L;H] x

0.8 J : [1;10] 0.8 J : [0;H] F : [L;H] 0.7 J : [0;H] 0.7 fairness inde fairness index J : [1;5] 0.6 J : [1;10] 0.6 J : [0;H] J : [0;H] F: [L;H] F : [L;H] 0.5 0.5 00.2 0.40.6 0.81 00.2 0.4 0.6 0.8 1 standard deviation average QoE

Fig. 5 Property (b) ‘Scale Independence’: Jain’s fairness index is Fig. 6 J is very sensitive to QoE values close to zero. In contrast, violating scale independence. It matters if scores are linearly trans- the proposed fairness index F is QoE level independent. A constant ∗ formed. In contrast, the proposed fairness index F is scale independ- standard deviation = ∕4 = 0.025 is assumed on a normalized ent scale [0; 1]

Scale and metric dependency of J dependence. WHere do I see this?. If L = 0 , J is very sensi- tive to QoE values close to zero as depicted in Fig. 6. The scale dependency of J is caused by the dependency of We consider here a constant standard deviation = 0.1 the RSD on the actual scale, as shown in Eq. (16). To be on the 5-point MOS scale and vary the average QoE value . more precise, a linear transformation T(Y) of the QoE values Such a small is reached when 50% of the users get max- impacts J. imum QoE 5 and 50% get a QoE of 4.8. This is also reached Figure 5 highlights the dependency of J when using dif- when 50% of the users get minimum QoE 1 and 50% get a ferent QoE domains with varying L and H. On a 5-point QoE of 1.2. Another scenario leading to the same = 0.1 scale [1; 5], the same average QoE level = 2 is considered is the following. 99.9375% obtain QoE 5 and the remaining and only the standard deviation of the QoE values is var- 0.0625% obtain QoE 1. ied. The QoE values are then transformed to diferent rating However, Jain’s index is varying from 0.5 to 1, as depicted scales [L; H]. It can be seen from Fig. 5 that Jain’s fairness in Fig. 6 which shows the fairness index J for normalized ∗ index is not scale independent. QoE values for varying and constant . Due to the nor- ∗ Please note that Jain’s index was developed for measures malization, = ∕4 = 0.025 see Eq. (15), we observe that on a ratio scale (like bandwidth or waiting times), which J is clearly sensitive to close to zero. A small shift of the means L = 0 . In that case (ratio scale), Jain’s index fulflls average QoE signifcantly decreases J. the scale independence. It does not matter e.g., if band- Furthermore, Jain’s index is not able to capture fair- width is measured in kbit or MByte. Only in the case of ness when higher values on the scale mean a lower QoE. L = 0 , Jain’s index is scale independent. We observe that An example considers the following quality degradation J is equal for any H, if L = 0 . Let us assume = Std[Y0;1] scale. 0—no degradation, 1—perceptible but not annoying, and = E[Y0;1] when considering the QoE values on a [0; 1] 2—slightly annoying, 3—annoying, 4—very annoying, 5— scale. Now we consider the linearly transformed values on extremely annoying. Let us consider that n − 1 users expe- the scale [L; H], i.e. rience the best quality 0 and 1 user obtains a 1. Then, the average QoE is E[Y]=1∕n , while the coefcient of variation T(Y)=(H − L)Y + L (20) follows as cY = (1∕n) . Then J = 1∕n . In the limit, J con- a = H − L b = L c with and in Eq. (13). The RSD T of the verges towards lim√n→∞ 1∕n = 0 . Hence, in the best and fair- L = 0 transformed values (Eq. 16) is equal to c, if . est scenario, J quantifes the scenario as completely unfair. a c = c ⇔ = ⇔ b = 0. T a + b  (21) QoE level dependence of J

However, in the QoE domain the most common scale is From Fig. 6, we further see that Jain’s fairness index is QoE L = 1 H = 5 the 5-point MOS scale with and . When using level dependent. A more explicit visualization is provided in normalized QoE values in [0; 1], Jain’s fairness index Fig. 7 which clearly illustrates the QoE level dependence of

1 3 4 Page 12 of 23 Quality and User Experience (2018) 3:4

1 1

J 0.95 0.95 =2 F: p=0.1 =3 0.9 =4 0.9 F: p=0.3 0.85 fairness index F: p=0.1 0.85

Jain's Fairness Index avg. QoE = 2 0.8 F: p=0.3 avg. QoE = 3 J: p=0.1 avg. QoE = 4 J: p=0.3 0.8 0.75 0 0.2 0.4 0.6 0.81 -1 -0.5 00.5 1 standard deviation discrepancy

Fig. 7 Property (f) ‘QoE Level Independence’: On a 5-point scale, Fig. 8 Property (g) ‘Deviation Symmetry’: In this scenario, a ratio of Jain’s fairness index is computed for varying standard deviation . p users experience 2 and 1 − p experience 2 + . We observe that J Each curve refects an average QoE value = 2, 3, 4 . The curves are violates property (g) ‘Deviation symmetric’ and is less sensitive than not overlapping and depend on . Jain’s fairness index J is QoE level F dependent – in contrast to F

1 J 5 J QoE fairness F . In particular, is plotted against the standard deviation Jain's index J for diferent average QoE values = 2, 3, 4. 0.8 The three diferent curves for the average QoE are not x overlapping, although the same standard deviation from the 0.6 mean value is observed. For higher average QoE values, the same standard deviation leads to higher fairness. For = 4 0.4 and = 2 , we observe a value of Jain’s fairness index about fairness inde 0.95 and 0.8 for = 1. 0.2

Deviation asymmetry of J 0 00.2 0.4 0.6 0.81 The desired property (g) ‘Deviation Symmetry’ means that ratio of users experiencing QoE H the fairness index should only depend on the absolute value of the deviation from the mean value, not whether it is posi- Fig. 9 Property (g) ‘Deviation Symmetry’: two user groups are con- tive or negative. sidered in this illustration. A ratio p of users gets maximum QoE H = 1 and 1 − p gets minimum QoE L = 0 . Jain’s fairness index fol- Therefore, a scenario is considered in which a ratio of lows as J = p , see Eq. (27). The proposed fairness index F is derived 2 p users experience 2 and 1 − p experience 2 + . Figure 8 as F = 1 − 2 = 1 − 2 p − p , see Eq. (25). If 50% of the users get plots now Jain’s fairness index J against the discrepancy 0 and 50% get maximum√ QoE, then this is maximum unfair, F = 0 . J = p = 1∕2 ∈ [−1;1] between the two user classes. We observe that However, Jain’s index is not deviation symmetric, as the resulting curves for p = 0.1 and p = 0.3 are not symmetric at = 0. Another illustration is provided in Fig. 9. Normalized 2 2 E[Y ]=py (23) QoE values are considered with L = 0 and H = 1 . Again, p 2 2 two user classes are considered. A ratio of users obtains Var[Y]=y (p − p ) (24) QoE y and 1 − p obtain worst QoE L. In that scenario, we obtain the following statistical measures and fairness index Std[Y]=y p − p2, (25) J. √ 1−p E[Y]=py cY = , (26) (22) p

J = p. (27) J 5 The QoE level dependence of J is visible in Figs. 6 and 7 as well as This scenario provides an intuitive meaning for . Relating from Eq. (17). the interpretation to QoS and resource allocation, a ratio

1 3 Quality and User Experience (2018) 3:4 Page 13 of 23 4 of p users share the resource and obtain a rating of y > 0 ; Defning a QoE fairness index the other 1 − p users receive zero resources which means no service, hence a rating of L = 0 . If p = 100% of users Before presenting the formal defnition of F, we briefy get y = H , then J = 1 (maximal). If p = 100% of users get sketch the rationale behind it. After the defnition, we dis- y = L = 0 , then J = 0 (minimal). However, J = 0 represents cuss its properties, and compare it to Jain’s index. a totally unfair system, but all users get the same (bad) rat- ing. Jain’s fairness index is not intuitive when applied to Rationale for a QoE fairness index QoE. Similar thoughts for other values p and q = 1 − p (cf. Table 2) show that Jain’s fairness index violates the devia- Jain’s fairness index is not applicable as a QoE fairness tion symmetry. index, as it violates some of the desired properties as speci- fed in “Desirable properties of a QoE fairness index”. It is Relative standard deviation on an interval scale a reasonable approach to only consider the standard devia- tions without relating them to mean values when defning A major concern of the application of Jain’s fairness index QoE fairness. The standard deviation of the QoE values in the QoE domain is the typical interval scale of the QoE Y quantifes the dispersion of the users’ QoE in a system. function Q in Eq. (1). The RSD may not have any meaning There exists a maximum standard deviation of the QoE for data on an interval scale. For the computation of an RSD, value Y over the bounded value domain [L; H].6 The maxi- 1 a ratio scale is required which contains a natural zero value, mum max is obtained when 2 of the users experience L and like ‘no waiting time’ ≡ 0s. H, respectively. In that case, the average QoE value is However, the MOS scale typically used in QoE models E[Y]=(L + H)∕2 is not a ratio scale. There is no meaningful zero value on (28) the QoE scales: ‘zero’ would mean ‘no QoE’—which is and the maximum second order moment is not defned. Hence, the RSD of QoE values—and therefore 2 2 2 E[Y ]=(L + H )∕2. (29) Jain’s index—have no meaning for QoE values. The MOS scale can be considered as an interval scale as concluded by Then, the maximum standard deviation is Norman [32]. Therefore, it is required to use other statistics 1 = (H − L) . (like the standard deviation) to measure the deviation from max 2 (30) the mean. The average QoE is diferent from the MOS, as users in Remark For QoS Fairness, the usage of relative standard the system are experiencing diferent conditions resulting Q x deviation as in Jain’s fairness index is very reasonable. An into certain QoE values ( ). When computing a MOS, all example for a QoS measure is bandwidth which measures subjects experience the same test condition and the average the bandwidth of a user on a ratio scale with a meaningful over all user ratings is derived. zero value (‘no bandwidth’). A new QoE fairness index F However, Jain’s fairness index may also be difcult to F interpret if the data is measured on a ratio scale—which We defne the fairness index as a linear transformation of the standard deviation of Y to [0; 1]. The observed is allows to compute the RSD.Consider the following exam- ple. The QoS measure is delay, e.g., web page load time, normalized with the maximal standard deviation max and which measures a duration on a ratio scale with a meaningful measures the degree of unfairness. Hence, the diference ∕ zero value (‘no delay’). However, in that case, Jain’s index between 1 (indicating perfect fairness) and max is defned leads to counterintuitive results. In a scenario, where 100% as fairness index. J = 0 2 of users get no delay, . Figure 9 can be re-interpreted F = 1 − = 1 − . (31) when considering the ratio p of users experiences a delay of max H − L 1 [s], while 1 − p experience no delay. Thus, for QoS meas- ures like delays, Jain’s index cannot be directly applied to We note that F can also be interpreted in another way. The quantify QoS fairness. QoE values are normalized to the QoE domain [0; 1], Y − L Y∗ = . H − L (32)

6 The maximum standard deviation has already been derived in Eq. (7) in “Relative standard deviation (RSD)”. For readability and due to its importance for deriving F, the maximum standard deviation is repeated here.

1 3 4 Page 14 of 23 Quality and User Experience (2018) 3:4

2σ Table 3 Illustration of Jain’s J and QoE Fairness Index F for various scenarios and their distributions Y, L = 1, H = 5 Scenario Description J F

123451 All users experience 1 1.00 1.00 µ 2 50% experience 1 and 50% experience 2 0.90 0.75 3 50% experience 1 and 50% experience 3 0.80 0.50 Fig. 10 Illustration of the new QoE fairness index F (solid hatch) on 4 50% experience 1 and 50% experience 4 0.74 0.25 a typical 5-point scale with length H − L with = 3.5 and = 0.7 5 50% experience 1 and 50% experience 5 0.69 0.00 6 50% experience 2 and 50% experience 4 0.90 0.50 Then, the standard deviation is 7 50% experience 2.9 and 50% experience 4.9 0.94 0.50 8 Uniform distribution Y ∼ U(L;H). 0.75 0.42 ∗ = Std[Y∗]= H − L (33) = 1 and the maximum standard deviation is max 2 . Then, the ness measure is independent of L and H. To be more fairness index follows as precise, any linear transformation T(Y)=aY + b of the Y 2 QoE values does not change the value of the fairness F = 1 − 2 ∗ = 1 − . H − L (34) index. For the transformed values we obtain 2Std[T(Y)] Thus, (I) normalizing the standard deviation by the maxi- FT(Y) = 1 − T H T L 1 − ( )− ( ) mum possible standard deviation ( ) or (II) normal- max 2aStd[Y] 2Std[Y] ∗ ∗ ∗ = 1 − 1 − = FY . izing the QoE values Y with and max result both in the (aH + b)−(aL + b) H − L same F ∈[0;1]. (35) Figure 10 illustrates the meaning of the fairness index Hence, F is scale independent (which is also indicated F. A certain fraction of the QoE domain [L; H] is covered in Table 2). by the standard deviation around the average QoE in (c) Boundedness—F is bounded between 0 and 1. both directions. The size of the interval [ − , + ] is 2 (d) Continuity—F takes continuous values in [0; 1]. refects how unfair the QoE values are distributed over the (e) Intuitive—F is intuitive. The maximum fairness QoE domain. Accordingly, the fairness index F is the size of ∗ Fmax = 1 is for minimum standard deviation ( = 0 ). the complement of this interval, i.e. (1 − 2 ), normalized The minimum fairness Fmin = 0 is found when standard by the size of the QoE rating domain H − L. deviation is at its maximum; this happens in the most unfair scenario (50% of users get L and 50% get H). Defnition The QoE Fairness Index F is defned as the lin- 2 Any fairness value F can also be interpreted as follows ear transformation F = 1 − over the QoE Y of all users H−L when considering normalized QoE values. (A) Half consuming a service. A system is absolutely QoE fair when of the users get maximum QoE H = 1 and the other all users receive the same QoE value. half gets QoE y. Then, F = y . (B) Half of the users get minimum QoE L = 0 . The other half get QoE y. Then, F = 1 − y Properties of the QoE fairness index F . The equations are provided in Table 5. Exemplary numerical values are provided in Table 3. (g) Deviation symmetric—F does not depend on the abso- The proposed fairness index, F, needs to fulfll the proper- lute value of the deviation from the mean value, not ties as introduced in “Desirable properties of a QoE fairness whether it is positive or negative. This is clear from the index”. In the following, the properties are revisited and ana- defnition of F and visualized in Figs. 8 and 9. lyzed with respect to F. (f) QoE level independence—F is independent of the actual QoE level, whether the system achieves good (a) Population size independence—F is applicable to any or bad QoE. This is also clear from the defnition of F, number N of users in the system. The value of F is since F only depends on the deviation from the mean. independent of N. Figure 6 visualizes the QoE level independence. A con- (b) Scale and metric independence—The unit of measure- stant standard deviation is assumed while the average ment should not matter. In the context of QoE, the fair- QoE is varied. Since F is independent from , F is a

1 3 Quality and User Experience (2018) 3:4 Page 15 of 23 4

Table 4 Qualitative comparison of fairness indexes of L and H. Although J is bounded between [0; 1], its mini- J L H Property RSD c Jain’s J Fair. F mum value min depends on and . In this section, we will show in real numerical examples of YouTube QoE, those (a) Population size independent X X X limitations are severe. In particular, we show that J is not (b) Scale and metric independent – – X very sensitive and hardly discriminates fairness in diferent (c) Boundedness – (X) X scenarios. (d) Continuity X X X As example, HTTP video streaming and the impact of (e) Intuitive – (–) X video stalls on video QoE is considered. Hoßfeld et al. [17] (g) Deviation symmetric – – X provide a QoE model for non-adaptive video streaming in (f) QoE level independent – – X terms of MOS on a 5-point scale for N stalls and a total stall- (h) Valid for multi-applications X X X ing time T. In our numerical results, we assume T = NL with an average stall duration of L = 1s. Q(N)=3.5e−0.15T−0.19N + 1.5 = 3.5e−0.34N + 1.5 constant value which only depends on (and the QoE (36) Q∗ ∈[0;1] value range [L; H]). The normalized QoE function is (h) Valid for multi-applications—The index should refect Q(N)−L Q∗(N)= the cross-application fairness (and not only between H − L (37) users of the same application). The index should also be applicable to diferent applications. This property is with H = 5 and L = 1. respected by F, provided that the QoE mapping func- In order to illustrate the diferences between the QoE fair- tion Q allows to have comparable QoE values. Further, ness indexes, we consider N as a random variable describing F can be applied to any application, as it is based on the the number of stalls experienced by the users in the sys- deviation of the QoE values. The same is also true for J tem. We assume that N follows a binomial distribution, i.e. and RSD. However, literature also suggests other fair- N ∼ Binomial(K, p) . In this example, we assume K = 10 ness metrics which are only defned for a single appli- and vary p. Please note that it is not important for the illus- 8 cation and use case, e.g. Cofano et al. [6] as discussed tration which distribution is assumed. The probability pn in “Fairness from the user’s perspective”. that a user experiences n stalls is then n p = pK(1 − p)n−K . We observe that the defnition proposed fulflls the prop- n K  (38) erties outlined in “Desirable properties of a QoE fairness index”. The QoE fairness index F refects the system per- From Eq. (41), the key statistics can be derived. spective of fairness and quantifes the fairness of the entire ∞ system across all users.7 E[N]= npn = Kp (39) n=0 Qualitative comparison of fairness indexes

Std[N]= Kp(1 − p) A summary of the comparison between F and Jain’s index J (40) √ as well as the RSD is provided in Table 4. All three indexes 1 − p are population size independent, valid for multi-applications cN = . (41) and return positive continuous values. While F fulflls all √ Kp desirable properties, J and the RSD violate key properties: √ (g) ‘deviation symmetric’, see Fig. 8; (f) ‘QoE level inde- QoS fairness with respect to the stalls can therefore be pendence’, see Eq. (17) or Fig. 7; (f) ‘Scale and metric inde- derived via Jain’s index. pendence’, see Fig. 5. 1 Kp In the list of properties it is stated that the fairness index J = = N 2 Kp +(1 − p) (42) should be intuitive. This means for instance that the index 1 + cN should be at its minimum when the users’ experience (at least their scores) is maximally diferent. This holds for fair- ness index F, but not for J since this depends on the values

8 Realistic scenarios and measurement traces are provided in “Appli- 7 F must not be confused with standard deviation of user ratings in a cation of the QoE fairness index” to demonstrate the application of subjective study (for a system under test) i.e. the user rating diversity. the QoE fairness index F.

1 3 4 Page 16 of 23 Quality and User Experience (2018) 3:4

5 0.8 1 Avg. QoE 4.5 Avg. QoE Std. QoE 0.8 RSD 0.6 4 E E 0.6 3.5 Std. QoE 0.4 3 0.4

RSD c fairness index average Qo 2.5 0.2 second order Qo 0.2 QoEfairness F =1 – V/100 Jain’s index J =1/2 2 Jonascale[100;200] 0 1.5 0 020406080 100 0246810 QoE value V average number of stalls

Fig. 13 Comparison of J and F. A simple scenario is considered Fig. 11 Qualitative comparison between J and F. We assume a bino- with a continuous rating scale from [0; 100]. 50% of users experi- mial distribution of the number N of stalls and observe accordingly ence worst quality ( L = 0 ) and the other 50% experience QoE V. The those (frst and second order) QoE values dashed line indicates when the QoE scale is shifted to [100; 200]

1 depending on the average number of stalls E[N]=Kp . Based on E[N] , the parameter p = E[N]∕K of the binomial distribu- 0.8 tion is derived. We clearly observe the exponential decay of J : QoE [1;5] 5 the QoE model Q when considering the average QoE. The J : QoE [0;1] 0.6 1 standard deviation and the RSD show however a diferent Fairness Index F behavior. 0.4 The main observations can be seen in Fig. 12. Firstly, J : QoS fairness index J : QoS N J N QoS fairness N is diferent than QoE fairness quantifed J : QoE [1;5] 5 0.2 by F and J. In particular, the QoS fairness approaches zero J : QoE [0;1] 1 (i.e. completely unfair system) when the average number Fairness F 0 of stalls approaches zero. We further see the sensitivity of 0246810 Jain’s fairness index for values close to zero. QoE fairness mean number of stalls depicts a diferent behavior. In case of no stalling, all users get best QoE and the variance of the QoE values diminishes. Fig. 12 Qualitative comparison between J and F. Three key observa- Hence, the QoE fairness indexes are 1. With increasing num- tions are found. (1) QoS Fairness is diferent from QoE Fairness— ber of stalls in the system, the standard deviation increases independent of the QoE fairness index. (2) Jain’s index for QoE fair- ness depends on the rating scale ([1; 5] vs. [0; 1]). Finally, (3) J is and hence fairness decreases until a certain threshold. After less sensitive than F that threshold, the variance decreases and more users sufer. Hence, the QoE fairness increases again. In contrast, QoS fairness shows here a monotonic behavior. Due to the QoE mapping function Q, analytical expressions Secondly, Jain’s fairness index depends on the scale. The for the average QoE = E[Q(N)] and the standard deviation curves for J5 and J1 difer (a) when using the QoE function of the QoE values = Std[Q(N)] are rather bulky and omit- Q on a 5-point scale difers and (b) when using normalized ∗ ted here. It has to be noted that QoE values Q . Thirdly, Jain’s fairness index applied to QoE F ∞ values is less sensitive than the fairness index and does J E[Q(N)] = Q(n)p ≠ Q(E[N]) . not allow to clearly discriminate fairness issues. From 5 or n (43) J n=0 1 , one might conclude that the system is more or less fair. However, F clearly depicts that for certain scenarios (around We numerically compute the QoE fairness indexes F and E[N]=1.5 stalls) the system leads to unfairness. Jain’s index on a 5-point scale with Q(N) as well as for nor- Figure 13 summarizes those issues in a simple scenario. ∗ malized QoE values Q (N) . We refer to them as J5 = JQ(N) Half of the users experience worst quality ( L = 0 ) and the and J1 = JQ∗(N). other half experience QoE V. Jain’s index is a constant value Figure 11 shows the numerical values for the average J = 1∕2 independent of V. Thus, in the scenario where all QoE, the standard deviation and the RSD of the QoE values users obtain the same value V = 0 , J does not indicate that

1 3 Quality and User Experience (2018) 3:4 Page 17 of 23 4 the system is fair. In the unfairest scenario, where half of 5 1 the users get best QoE V = 100 , the same fairness index is obtained as in the fair scenario. J does not have intui- 0.8 4 tive meaning when applied to QoE to express fairness. In contrast, if the rating scale is shifted by 100, i.e., L = 100 0.6 and H = 200 , J is not constant anymore, but always leads 3 J ≥ 0.9 Qo E to values (cf. dashed line). Again, the rating scale 0.4 dependence of J leads to severe diferences. In contrast, the std.dev. Qo E 95% quantile F 2 proposed QoE fairness index distinguishes clearly the dif- avg. QoE 0.2 ferent scenarios refected by V. Independent of the underly- 5% quantile std. QoE ing QoE scale, the unfairest scenario leads to V = 0 , while 1 0 the fairest scenario leads to V = 1. 0 0.2 0.4 0.6 0.81 offered load

Application of the QoE fairness index Fig. 14 Case study ‘Web QoE’: Overall QoE and standard deviation Y of the system depending on the ofered load The goal of this section is to show how the proposed QoE fairness index F can be applied. As a result of the numeri- cal examples, we show that a QoS fair system is QoE unfair In the context of web browsing QoE, we have the following (case study: Web QoE for M/D/1-PS). In addition, we show M/D/1-PS model. A web server is considered where all users how to design a system in which a provider may trade-of share the server’s capacity C equally, i.e., processor sharing between fairness and overall performance (case study: HTTP model. All users download the same web page of constant adaptive streaming QoE). size b. User web page requests are modeled with a Poisson process and request rate . Thus, the shared environment is Case study: web browsing QoE modeled as M/D/1-PS queueing system. The ofered load =  b = b∗ in an M/D/1‑processor sharing system of the system is C . The expected value T of the download time T is derived The analytical M/D/1-PS sharing system is well understood by Ott [33] and depends on the fle size b, and describes a perfect QoS fair system which is neverthe- b = . less QoE unfair. Literature has shown that the processor T 1 −  (45) sharing (PS) model captures well the characteristics of a system with a single shared bottleneck, see the survey by Thus, M/D/1-PS is proportional fair. 9 Roberts [39]. For M/D/1-PS, the RSD cT of the download time T In the processor sharing model, the bandwidth C of the depends only on the offered load and is derived by bottleneck (i.e., the QoS resource) is perfectly fairly shared Shalmon [41]. n among the users in the system, i.e., each user receives 1 C n c2 = [2 − 2 − 2(1 − )e ]≈ / . The QoS resource is instantaneously adjusted when T 2 (46) the number of users changes. Thus, perfect QoS fairness is considered here on an instantaneous time scale, as depicted Thus, we can directly compute Jain’s fairness index JT for in Table 1. The M/D/1-PS system leads also to proportional the download time. (QoS) fairness as discussed in “Background and related Reichl et al. [38] observe a logarithmic relationship work: notion of fairness and its applications”. The propor- between waiting time and QoE which is formulated as the tional fairness [23] means (in case of deterministic service WQL hypothesis by Egger et al. [10]. The QoE of web requirements) that the expected service time E[T] is propor- browsing is derived as a logarithmic function of the page tional to the requirements of the service . load time t. E[T]∝ (44) Q(t)=−a log(t)+b. (47) The QoE domain ranges from L = 1 to H = 5 and we assume 9 a = 1 b = 4 In practice, the fair share of resources is not perfectly achieved and [10]. due to imperfections in the transport protocol. In Hoßfeld et al. [15], The page load times T in the M/D/1-PS system are a ran- measurement results are provided for the same system on web QoE dom variable. For computing QoE fairness F, we need to which show the same characteristics in terms of fairness. In this arti- derive the standard deviation Std[Q(T)] of the QoE values cle, we focus on the analytical model as it is well known and under- stood and generalizes the measurement results. Q(T). Further, Jain’s fairness index is applied to Q(T).

1 3 4 Page 18 of 23 Quality and User Experience (2018) 3:4

1 5 Case study: HTTP adaptive streaming QoE

0.9 As a second case study, HTTP adaptive streaming (HAS) 4

x is considered to demonstrate the comparison of differ- 0.8 ent approaches with respect to QoE fairness but also with 3 respect to overall QoE. When a provider has to decide which mechanism to use in practice, the (possible) trade-of

0.7 avg. Qo E

fairness inde F between QoE fairness and overall QoE may be considered. J (QoE) 2 HAS allows the video player to dynamically adjust the 0.6 J (QoS) T video bitrate according to the current network situation. avg. QoE Thereby, HAS tries to overcome video stalling at the cost 0.5 1 0 0.2 0.4 0.6 0.8 1 of reduced video bitrate and lower video quality. However, offered load from a QoE perspective, stalling is the dominating QoE infuence factor. For the interested reader, Seufert et al. [40] Fig. 15 Case study ‘Web QoE’: The fairness indexes are compared provides a comprehensive survey on HAS QoE and HAS for diferent load situations. QoS fairness in terms of Jain’s fairness technology. index of sojourn times is diferent than QoE fairness However, when multiple HAS clients are competing for shared network resources, they may negatively infuence each other in terms of QoE [37]. Thus, a QoE fairness issue Figure 14 depicts the QoE behavior of the system depend- may arise due to the HAS adaptation algorithm. To this end, ing on the ofered load. With increasing load, the download several developed strategies can be found in the literature, times increase and the QoE sufers (left Y-axis, see quan- which try to optimize QoE while maintaining QoE fairness tiles and average QoE values). The standard deviation of among the users. the QoE values shows a non-linear behavior. For very low In particular, Petrangeli et al. [37] developed a QoE- loads, the users arrive often at an empty system and every- driven rate adaptation heuristic (’FINEAS’) and evaluated one experiences the same QoS and QoE. Due to the random diferent mechanisms in terms of QoE.10 In their context, N arrival of users, some users share the capacity with others users are consuming an HTTP adaptive streaming service which happens more often with increasing load. At a certain which uses one concrete HAS mechanism. In the system, load, it is well known that the download times exponentially network bandwidth is the scarce resource and the users (to increase [33] resulting in smaller QoE diferences across be more precise: the HAS mechanisms) are competing. The users (but at a low overall QoE). Note that approaching the goal of the study by Petrangeli et al. [37] is to identify the → overload situation ( 1 ), all users experience the same HAS mechanisms which leads to the best overall QoE and a poor quality ( lim →1 Q(t)=L , but the system is perfectly fair system. In the paper, the average QoE and the standard fair ( lim →1 F = 1 ), if terribly under-performing. deviation of the QoE values over the N are reported for the Figure 15 illustrates the diferent fairness indexes. Jain’s HAS mechanisms. Since the results rely on simulations and fairness index J leads to diferent results and conclusions are repeated several times, there are also confdence intervals than F. In case of low load ( �<0.4 ), J suggests a perfect specifed for the average and the standard deviation. With QoE fair system. However, when looking at the standard the confdence interval of the standard deviation, [ 1; 2] , deviations of the QoE in Fig. 14, we see already stronger we may also derive a confdence interval for the fairness discrepancies between users. J is not capturing this prop- index F for a given signifcance level . Let us consider the erly, since the average QoE is high for this load. We further probability that the real standard deviation lies within the see again that J is not very sensitive. The minimal fairness bounds of the confdence interval. value is about 0.8. In contrast, our proposed fairness metric P ≤ ≤ = 1 − . properly refects the variances in QoE. F is more sensitive 1 2 (48)  and identifes fairness issues even in the low load scenarios. This can be transformed as follows. Please note that the less- F drops close to 0.5 which properly refects that the sys- than sign changes. tem leads to 0.5 max = 1 . We further observe that there is 2 2 P 1 − 1 ≥ F ≥ 1 − 2 = 1 −  a strong discrepancy between QoS fairness (expressed by H − L H − L (49) using Jain’s fairness index JT of download times T converg- ing to 0.5) and QoE fairness. Thus, the fairness index F gives the possibility to clearly 10 For details about the HAS algorithms (FINEAS, Q-L., FESTIVE, identify under which conditions and in which scenarios fair- MILLER, MSS), the interested reader is referred to Petrangeli et al. ness issues arise. [37]. For our purposes, the detailed description is not relevant.

1 3 Quality and User Experience (2018) 3:4 Page 19 of 23 4

1 1

0.9 0.8 F

0.8 x 0.6 0.7

0.4

fairness index 0.6 fairness inde E

0.5 0.2 RSD

Miller FESTIV Q-L. MSS FINEAS Jain F 2 2.5 33.5 0 avg. QoE MSSFINEASQ-L. FESTIVEMiller

Fig. 16 Case study ‘Video streaming QoE’: Diferent mechanisms Fig. 18 Case study ‘Video streaming QoE’: Diferent fairness indexes from literature are compared in terms of QoE and fairness. The lead to diferent values. We observe that Jain’s index is not very sen- results by Petrangeli et al. [37] are reevaluated to quantify F sitive, leading to high fairness values

1 =0.10 =0.30 =0.70 0.7 0.7 1 FINEAS Relevance of fairness 0.6 0.6 0.8 changes decision 0.8 0.5 0.5 Q-L. FESTIVE 0.6 0.6 0.4 0.4 valu e

utilit y 0.3 0.3 MSS 0.4 0.4 MSS Miller 0.2 0.2 FINEAS 0.2 Q-L. 0.1 0.1 FESTIVE 0.2 Miller 00.2 0.4 0.6 0.81 0 0 0 importance of fairness F Jain FJain FJain

Fig. 17 Case study ‘Video streaming QoE’: For the decision Fig. 19 Case study ‘Video streaming QoE’: Diferent fairness indexes which algorithm to use in practice, a value function is defned lead to diferent values and conclusions for operators. We consider (1 − )Q∗ + ⋅ F which allows to quantify the relevance of fairness diferent fairness relevance factors . FINEAS and Miller always lead for the decision to best and worst results, respectively. However, the order of the oth- ers changes depending on F and J

P(F1 ≥ F ≥ F2)=1 − . (50) weighted sum of the average QoE and the fairness depending F This leads to the confdence interval for . on a parameter specifying the relevance of fairness. Thus, a ∗ 2 2 value function is defned. For example, v =(1 − )Q + F . [1 − 2 ;1 − 1 ]. ∗ H − L H − L (51) Thereby, we use the normalized QoE values Q to have the fairness index and the average QoE in the interval [0; 1].11 Figure 16 shows the numerical results of the strategies con- This allows for an intuitive meaning of the relevance param- cerning the average QoE and the fairness index F. It can be eter. From Fig. 17, we observe that the FESTIVE approach clearly seen that the FINEAS strategy outperforms the other may be preferred instead of MSS if fairness is as important strategies as it is better in terms of average QoE and fairness. as average QoE ( ≥ 0.5 ). We would like to emphasize that However, it remains unclear if the HAS adaptation strategy other fairness indexes (Jain or RSD) lead to other values MSS is for example better than FESTIVE. MSS leads to higher average QoE but a lower fairness. A provider needs to decide how relevant fairness is. 11 Thus, there may be a trade-of between fairness and QoE. Instead of defning a relevance parameter for fairness, one might also defne weights for each component. This may also overcome the In Fig. 17 we sketch this more clearly. A provider may use a normalization of the QoE values.

1 3 4 Page 20 of 23 Quality and User Experience (2018) 3:4 and change the outcome of an operator’s decision. Figure 18 QoE fairness from the operator’s perspective shows (again) that Jain’s index is not able to discriminate the fairness properly (cf. Fig. 12)—here across mechanisms— Throughout the paper, we have considered fairness from the and also sufers like the RSD sufers from being scale and operator’s perspective. In fact, for most of today’s common metric dependent. Since J always leads to high fairness val- applications usage scenarios, users are not necessarily aware ues, the utility function would not consider fairness appro- of whether they receive a fair QoE. The question arises, then, priately and mainly put weight on overall QoE. Figure 19 why should operators care about fairness? shows the diferent outcomes. In case of little relevance We posit that QoE fairness can help operators obtain better of fairness ( = 0.1 ), the fairness index has only a minor overall customer satisfaction, if used, e.g., as a secondary ser- impact, as desired and defned. For higher relevance, it can vice optimization objective. For instance, when optimizing be seen that the order of mechanisms changes between F and a service for average QoE, it is common to fnd a family of J, i.e. leading to diferent conclusions for operators. solutions with similar results QoE-wise, but varying fairness In practice, this value function may be more complex and levels. In such cases, choosing the fairer solution (with some also include other aspects such as costs. This is however out additional constraints in terms of acceptance thresholds) will of scope for this article. The basic intention is to highlight often result in more satisfed users, and lower churn levels. that for system design choice, the overall QoE and fairness These optimizations become more important in multi- needs to be evaluated and considered. Accordingly, the rel- application scenarios, where the relative value of each QoS evance of fairness needs to be decided. metric (e.g., throughput, latency) can vary wildly in terms of QoE. Assuming that the QoE models produce compara- ble results, QoE fairness may imply very diferent resource management policies than just trying to give each user a fair Conclusions and discussion share of the network’s capacity. This type of application, however, remains contingent on the development of QoE The motivation for defning a fairness index comes primar- models whose results can be compared across applications. ily from the operator’s perspective, as QoE fairness measures This is still an open problem. can be used to drive resource allocation mechanisms aimed at For some application scenarios, where social interaction maximizing the satisfed customer base. The application of a happens in parallel to the service delivery (e.g., real-time com- QoE fairness metric is manifold, ranging from QoE manage- menting on social networks for a live stream), or where the ment mechanisms and system optimization to benchmarking use of diferent devices can have an impact on both resource diferent resource management techniques. We have introduced requirements and perceived quality (as exemplifed in “From a defnition for a QoE fairness index, and showed that QoE fair- QoS to QoE management”), then fairness can be more relevant ness does not, due to the nature of QoS to QoE mappings for for individual applications, as an unequal share of resources most services, necessarily follow from QoS fairness. We argue leading to unfairness may be observed by the users themselves, that commonly used QoS fairness metrics such as Jain’s fairness and QoS-fair approaches can result in sub-optimal QoE. index are not suitable for quantifying QoE fairness, despite being Another possible interpretation of QoE fairness would used for that purpose in the literature. Our proposed metric ful- be in terms of societal welfare, with the understanding that flls a number of desirable qualities, and it is intuitively simple a more fair distribution of QoE would make for a larger to understand. We illustrate its use with an example use case for number of satisfed (or even happy) users. This may also be Web QoE modeled as a function of page loading times. Another relevant in the case of non-commercial scenarios, such as use case is the selection of an HTTP adaptive streaming mecha- city- or coop-owned networks. nism which may be guided by the overall video QoE as well as the QoE fairness. QoE fairness says nothing about how good the system is and thus needs to be considered together with, and Further work most likely subordinated to, the achieved overall QoE in system design. We emphasize that the proposed QoE fairness index is The notion of QoE fairness defned in this paper leads us to just a means for benchmarking or designing systems, and may some interesting questions, which may be the basis of future be used as an extra tool for operators to make better informed research endeavours. decisions concerning QoE management. Remark The proposed fairness index is a generic concept • QoE Fairness-aware service optimization: whereby which may also be applied to other domains than QoE (like QoE management mechanisms consider fairness as an QoS) or for diferent purposes. It can be applied to any data optimization target (e.g., in a two-step optimization on an interval scale with a clearly defned bounded value approach, optimizing frst for overall QoE, and then range. for fairness).

1 3 Quality and User Experience (2018) 3:4 Page 21 of 23 4

• The comparability of QoE results for diferent applica- Table 5 Diferent statistical measures illustrating various properties tions: QoE fairness is most interesting for operators of QoE fairness indexes when QoE can be compared across services. This is Scenario description for user classes A and B not (necessarily) the case with current QoE models, Class A YA = 0 YA = 1 YA = 0 YA = 1 and further studies are needed in this direction. This 1 5 research line is also relevant to service pricing and Class B YB = YB = YB = YB = QoE-aware SLA defnitions, as addressed for example Scale [0; 1] [0; 1] [0; 1] [1; 5] by Varela et al. [44]. Scenario S1 S2 S3 S4 • The concept of QoE fairness can be more relevant for E[Y] 1 1 1 1 long-term contexts than for short-term ones. A provider 2 2 ( + ) 2 3 Var[Y] 1 2 1 1 2 1 may need to ensure that its customers are equally served 4 4 ( − ) 4 4 over one month, while short-term variances may be Std[Y] 1 1 1 1 2 2 ( − ) 2 2 1− 2 accepted by the users. In a rough sense, some providers c[Y] 1 1+ 1 3 are implementing this with reduced data speeds at the 1 2 1 9 J 1 (1+ ) end of a month when a certain data volume is consumed. 2 2 1+ 2 2 13 F 1 − This may lead to a certain kind of long-term fairness, see 0 0 also “Notion of fairness in shared environments”. How- A system is considered which leads to two QoE classes A and B for ever, most QoE models only produce short-term QoE its users with QoE values YA and YB . Both classes are observed with p = p = 1 estimates, and are not suited for other usages. same probability A B 2 . Scenario ‘S3’ and ‘S4’ lead to maxi- • Related to the item above, service pricing is also rel- mum unfairness evant to the notion of QoE fairness (as in, is the utility perceived by the user commensurate to the price they Table 6 Notation and variables frequently used pay?), yet it is not really considered in QoE models. On this point, price and fairness can be closely related. Our General variables proposed metric is price-independent as long as the QoE Y QoE values of the users in a system, Y ∈[L; H] models consider price (and consequently users’ expecta- L Lower bound of the QoE value range tions), but this is not usually the case. H Upper bound of the QoE value range Y∗ Y∗ = Y−L ∈[0;1] Normalized QoE values H−L T(Y) Linear transformation of QoE values, Acknowledgements This work emerged from the First International T(Y)=aY + b Symposium on Quality of Life (QoL-2016) in Zagreb, Croatia, in Key measures March 2016. This work was partly funded by Deutsche Forschun- = E[Y] gsgemeinschaft (DFG) under Grants HO 4770/1-2 (DFG OekoN​ Average QoE across all uers Std Y 2 2 et); the Croatian Science Foundation, project no. UIP-2014-09-5605 = [ ] Standard deviation, Std[Y]= E[Y ]−E[Y] (Q-MANIC); the NTNU QUAM Research​ Lab (Quantitative modelling c Relative standard deviation (RSD),√ c = ∕ of dependability and performance). The authors would like to thank 1 J J = = E[Y]2∕E[Y2] Stefano Petrangeli and Steven Latre for providing the data [37] as used Jain’s fairness index, 1+c2 in “Case study: HTTP adaptive streaming QoE”. F F = 1 − 2 New fairness index, H−L Most unfair system Compliance with ethical standards Yu QoE values in most unfair system u Average QoE in most unfair system, Conflict of interest On behalf of all authors, the corresponding author, 1 H L u = 2 ( + ) Tobias Hoßfeld states that there is no confict of interest. H−L u c = Stand- RSD, u H+L ard deviation, Open Access This article is distributed under the terms of the Crea- 1 H L c u = 2 ( − ) u tive Commons Attribution 4.0 International License (http://creat​iveco​ (H+L)2 Ju J = mmons.org/licen​ ses/by/4.0/​ ), which permits unrestricted use, distribu- Jain’s index, u 2(H2+L2) tion, and reproduction in any medium, provided you give appropriate Fu Fairness index, Fu = 0 credit to the original author(s) and the source, provide a link to the Maximum and minimum values of fairness indexes Creative Commons license, and indicate if changes were made. cmin 0, if = 0 1 H−L cmax 2 HL 4√HL Jmin (H+L)2 Appendix Jmax 1, if = 0 Fmin 0 for most unfair scenario See Tables 5 and 6. Fmax 1, if = 0

1 3 4 Page 22 of 23 Quality and User Experience (2018) 3:4

References 21. Jiang J, Sekar V, Zhang H (2014) Improving fairness, efciency, and stability in http-based adaptive video streaming with festive. IEEE/ACM Trans Netw 22(1):326–340 1. Avi-Itzhak B, Levy H, Raz D (2008) Quantifying fairness in 22. Kelly F (1997) Charging and rate control for elastic trafc. Eur queuing systems: Principles, approaches, and applicability. Prob Trans Telecommun 8(1) (1997) Eng Inf Sci 22(4):495–517 23. Kelly FP, Maulloo AK, Tan DKH (1998) Rate control for com- 2. Basu D (1955) On statistics independent of a complete sufcient munication networks: shadow prices, proportional fairness and statistic. Sankhyā Indian J Stat (1933–1960) 15(4):377–380 stability. J Oper Res Soc 49(3):237–252 3. Bertsimas D, Farias VF, Trichakis N (2012) On the efciency- 24. Lan T et al (2010) An axiomatic theory of fairness in net- fairness trade-of. Manag Sci 58(12):2234–2250 work resource allocation. In: Proceedings of IEEE INFO- 4. Bonald T, Proutiere A (2003) Insensitive bandwidth sharing in COM, San Diego, CA, pp 1–9. https​://doi.org/10.1109/INFCO​ data networks. Queueing Syst 44(1):69–100 M.2010.54619​11 5. Briscoe B (2007) Flow rate fairness: dismantling a religion. 25. Le Boudec J-Y (2005) Rate adaptation, congestion control and ACM SIGCOMM Comput Commun Rev 37(2):63–74 fairness: a tutorial. https​://moodl​e.epf.ch/fle.php/523/CC_ 6. Cofano G et al (2016) Design and experimental evaluation of Tutor​ial/cc.pdf network-assisted strategies for HTTP adaptive streaming. In: 26. Le Callet P Möller S, Perkis A et al (2013) Qualinet white paper Proceedings of the 7th international conference on multimedia on defnitions of quality of experience. In: European network systems, ACM, p 3 on quality of experience in multimedia systems and services 7. De Cicco L et al (2013) Elastic: a client-side controller for (COST Action IC 1003), March 2013 dynamic adaptive streaming over http (dash). In: 20th Interna- 27. Mansy A, Fayed M, Ammar M (2015) Network-layer fairness for tional packet video workshop, IEEE, pp 1–8 adaptive video streams. In: IFIP Networking conference (IFIP 8. Demers A, Keshav S, Shenker S (1989) Analysis and simulation networking). Toulouse, France, May 2015 of a fair queueing algorithm. ACM SIGCOMM Comput Com- 28. Mo J, Walrand J (2000) Fair end-to-end window-based conges- mun Rev 19(4):1–12 tion control. IEEE/ACM Trans Netw (ToN) 8(5):556–567 9. Deng J, Han YS, Liang B (2009) Fairness index based on vari- 29. Möller S (2012) Assessment and prediction of speech qual- ational distance. In: GLOBECOM, pp 1–6 ity in telecommunications. Springer Science+Business Media 10. Egger S et al (2012) “Time is bandwidth”? Narrowing the gap Dordrecht between subjective time perception and quality of experience. 30. Möller S et al (2011) Speech quality estimation: models and In: IEEE int. conference on communications (ICC), Ottawa, trends. IEEE Signal Process Mag 28(6):18–28 Canada, June 2012 31. Mu M et al (2015) User-level fairness delivered: network 11. ETSI TS 102 250-1 V2. 2.1. (2011) QoS aspects for popular resource allocation for adaptive video streaming. In: IEEE 23rd services in mobile networks. In: Speech and multimedia trans- international symposium on quality of service (IWQoS), IEEE, mission quality (STQ) 2011-04 pp 85–94 12. Gabale V et al (2012) InSite: QoE-aware video delivery from 32. Norman G (2010) Likert scales, levels of measurement and the cloud data centers. In: Quality of service (IWQoS), 2012 IEEE laws of statistics. Adv Health Sci Educ 15(5):625–632 20th international workshop on, IEEE, pp 1–9 33. Ott TJ (1984) The sojourn-time distribution in the M/G/1 queue 13. Georgopoulos P et al (2013) Towards network-wide QoE fair- with processor sharing. J Appl Prob 21(2):360–378 ness using openfow-assisted adaptive video streaming. In: 34. Ozugur T et al (1998) Balanced media access methods for wire- ACM SIGCOMM workshop on Future human-centric multi- less networks. In: Proceedings of the 4th annual ACM/IEEE media networking, Hong Kong, Aug. 2013 international conference on Mobile computing and networking, 14. Hoßfeld T, Heegaard PE, Varela M (2015) QoE beyond the ACM, pp 21–32 MOS: added value using quantiles and distributions. In: Seventh 35. Palyi PL, Racz S, Nadas S (2008) Fairness-optimal initial shap- int. workshop on quality of multimedia experience (QoMEX), ing rate for HSDPA transport control. In: Costa Navarino, Greece, June 2015 Communication systems, 2008. ICCS 2008. 11th IEEE singa- 15. Hoßfeld T et al (2017) Defnition of QoE fairness in shared pore international conference on, IEEE, pp 1415–1421 systems. IEEE Commun Lett 21(1):184–187 36. Parekh AK, Gallager RG (1993) A generalized processor shar- 16. Hoßfeld T et al (2015) Identifying QoE optimal adaptation of ing approach to fow control in integrated services networks: the HTTP adaptive streaming based on subjective studies. Comput single-node case. IEEE/ACM Trans Netw (ToN) 1(3):344–357 Netw 81:320–332 37. Petrangeli S et al (2016) QoE-driven rate adaptation heuristic 17. Hoßfeld T et al (2013) Internet video delivery in You-Tube: for fair adaptive video streaming. ACM Trans Multimedia Com- from trafc measurements to quality of experience. In: Biersack put Commun Appl 12(2):28 E, Callegari C, Matijasevic M (eds) Data trafc monitoring and 38. Reichl P et al (2010) The logarithmic nature of QoE and the analysis: from measurement, classifcation and anomaly detec- role of the Weber–Fechner law in QoE assessment. In: IEEE tion to quality of experience. Computer communications and int. conference on communications (ICC). Cape Town, South networks series. Springer, Berlin, Heidelberg Africa, May 2010 18. Hoßfeld T et al (2017) No silver bullet: QoE metrics, QoE fair- 39. Roberts JW (2004) A survey on statistical bandwidth sharing. ness, and user diversity in the context of QoE management. In: Comput Netw 45(3):319–332 2017 Ninth international conference on quality of multimedia 40. Michael S et al (2015) A survey on quality of experience of http experience (QoMEX), IEEE, pp 1–6 adaptive streaming. IEEE Commun Surv Tutor 17(1):469–492 19. Huynh-Thu Q et al (2011) Study of rating scales for subjective 41. Shalmon M (2007) Explicit formulas for the variance of condi- quality assessment of high-defnition video. IEEE Trans Broad- tioned sojourn times in M/D/1-PS. Oper Res Lett 35(4):463–466 cast 57(1):1–14 42. Taboada I et al (2013) QoE-aware optimization of multimedia 20. Jain R, Chiu D-M, Hawe WR (1984) A quantitative measure fow scheduling. Comput Commun 36(15):1629–1638 of fairness and discrimination for resource allocation in shared 43. Tominaga T et al (2010) Performance comparisons of subjective computer system, vol 38. Eastern Research Laboratory, Digital quality assessment methods for mobile video. In: 2010 Second Equipment Corporation, Hudson

1 3 Quality and User Experience (2018) 3:4 Page 23 of 23 4

international workshop on quality of multimedia experience 45. Villa B, Heegaard PE (2012) Improving perceived fairness and (QoMEX), June 2010, pp 82– 87 QoE for adaptive video streams. In: Proc. ICNS 2012, pp 149–158 44. Varela M et al (2015) Experience level agreements (ELA): 46. Wierman A (2007) Fairness and classifcations. ACM SIGMET- the challenges of selling QoE to the user. In: Proc. of the ICC RICS Perform Eval Rev 34(4):4–12 2015 workshops, IEEE, pp 1741–1746. https://doi.org/10.1109/​ ICCW.2015.72474​32

1 3