A dynamic rate adaptation algorithm using WB E-model for voice traffic over LTE network

Duy-Huy Nguyen and Hang Nguyen Department of Wireless Network and Multimedia Services Institut Mines-Telecom, Telecom SudParis Samovar Laboratory, UMR 5157, CNRS, Evry, France {duy huy.nguyen, hang.nguyen}@telecom-sudparis.eu

Abstract—This paper presents a dynamic adaptation algorithm When voice traffic is transmitted over LTE network, the of joint source-channel code rate for enhancing voice transmis- voice signal firstly is compressed at Application layer by sion over LTE network. In order to assess the speech quality, we AMR-WB , and then it is packetized into RTP payload. use the Wideband(WB) E-model. In this model, both end-to-end delay and packet loss are taken into account. The goal of this When this payload goes through each layer, it is packetized paper is to find out the best suboptimal solution for improving into the corresponding packet and the header is added. In order voice traffic over LTE network with some constraints on allowed to protect the voice packet when it is delivered over a noisy maximum end-to-end delay and allowed maximum packet loss. channel, some error correcting technologies are included. The The best suboptimal choice is channel code rate corresponding to Forward Error Correction (FEC) channel code is widely used each mode of the AMR-WB codec that minimizes redundant bits generated by channel coding with an acceptable MOS reduction. in LTE network for data channels is Turbo code. Channel Besides, this algorithm can be integrated with rate control in coding reduces Bit Error Rate (BER), so that the speech AMR-WB codec to offer the required mode of LTE network. quality will be improved. Channel coding encodes a k-bits Our results show that the MOS degradation is not significant, block into a n-bits codeword, thus, the number of redundant but the percent of reduced redundant bits to be very considerable. bits equal to n−k, and so that, the code rate is k/n. This means This will requires less bandwidth, so that, more mobile users can be served. The algorithm has simple computational operations, that the higher channel code rate, the higher speech quality, it can be applied to real-time voice communications. but this leads to the longer delay and the higher redundancy. Index Terms—AMR-WB, Wideband E-model, VoIP, VoLTE, Therefore, there needs to be a tradeoff between speech quality Source-Channel code rate, Adaptive algorithm and channel code rate. There are several authors who have proposed techniques to I.INTRODUCTION improve the speech quality delivered over a noisy channel. In present market of mobile communication in the world, Examples include the works of [3], [4], [5], [6], [7], and [8]. 3GPP LTE is developing strongly and is deployed by the But the closest works related to our paper are represented most communications operators. LTE network is based on All- in [3], [7], and [8]. The authors in [3] present a dynamic IP network and does not support circuit switching method joint source channel coding rate adaptation algorithm for VoIP which is utilized to provide the voice call service in 3G using AMR codec. The algorithm computes the optimal rates networks. So that, in order to support voice service over LTE allocated to each frame for a set given QoS constraints. The network, additional technology has to be included. Voice over aim of their paper is to find the tradeoff between packet loss LTE network (VoLTE) service was developed to supply voice recovery and end-to-end delay to maximize perceived speech and video communication and Short Message Service (SMS) quality. In [7], the authors propose an optimization issue for on the LTE network [1]. According to [1], there are two supplying unequal error protection of speech frames according types of voice traffic over LTE network, those are VoLTE to their importance. An optimization framework for identifying and VoIP. VoLTE was launched in 2012, and at present, the optimal joint source-channel code rate of each voice frame many mobile network operators in the world provide VoLTE based on the frame perceptual importance is proposed in [8]. service. VoLTE is VoIP (Voice over Internet Protocol) based In that paper, the quality of the received speech signal is multimedia service in which voice call and video conference maximized. services can be supplied. VoLTE is signalling protocol that The aim of this paper is to extend results in [3] in context of enables carrying of voice packets and is guaranteed of given voice traffic over LTE network. Besides, in stead of finding the QoS information by LTE network operator [2]. Otherwise, tradeoff between packet loss recovery and end-to-end delay to VoIP relies on the internet which is done on a “best effort” maximize the perceived speech quality, our proposal focuses basis to deliver voice packets. It is said that VoLTE is basically on finding out the compromise between source code rate a subset of VoIP. It is enhanced VoIP over specific access and channel code rate to minimize the number of redundant technology (LTE) with given QoS information, but it is still bits generated by channel coding with an acceptable Mean VoIP. Opinion Score (MOS) degradation. We would like to offer

1 an other point of view of choosing the channel code rate LTE network for voice compression and decompression. It is corresponding to each mode of AMR-WB codec for voice detailed described in [11]. AMR-WB codec uses a sampling traffic over LTE network. In order to assess speech quality, we rate of 16 kHz, which covers 50-7000 Hz audio bandwidth. use the Wideband E-model [9]. In this model, the transmission It has 9 different codec modes (from mode 0 to mode 8) rating factor (Rwb) is used as a measure of subjective quality. corresponding to 9 source bit rates in range from 6.6 Kb/s This factor is then mapped to the corresponding MOS. In this to 23.85 Kb/s. Each of them generates encoded 20 ms speech paper, a suboptimal solution for joint source-channel code rate frame and switches among them every 20 ms. The bits in adaptation is proposed. The aim of this solution is to find the encoded speech frame to be ordered according to their out the most suitable channel code rate for each mode of subjective importance. These bits are divided into three classes AMR-WB codec and for entire modes of AMR-WB codec. with reducing perceptual importance: Class A, Class B and It takes into account some constraints on maximum allowed Class C. Total bits of each class depends on codec mode. In end-to-end delay and maximum permitted packet loss rate this study, we consider the same level of error protection for for voice traffic over LTE network. We do not consider the these three classes. Thus, the bits of these classes are equally perceived speech quality as the first target to optimize. We protected by channel coding. want to present a suboptimal solution for the tradeoff between In LTE network, AMR-WB codec is configured into 3 speech quality and redundancy caused by channel coding. This configurations [12] as follows: means that our algorithm find out the suboptimal solution for • Configuration A (Config-WB-Code 0): 6.6, 8.85, and minimizing the redundant bits generated by channel coding 12.65 Kb/s (Mandatory multi-rate configuration). with an acceptable MOS reduction. The rest of this paper is • Configuration B (Config-WB-Code 2): 6.6, 8.85, 12.65, organized as follows: Overview of voice transmission over and 15.85 Kb/s. LTE network is described in section II. In section III, we • Configuration C (Config-WB-Code 4): 6.6, 8.85, 12.65, present the proposed algorithm. The simulation results and and 23.85 Kb/s. performance evaluation of the proposed algorithm are analysed These configurations are used to simplify the negotiation of in section IV. The conclusion and future work is represented bit rate between the user equipment and the base station, thus in section V. will simplify the implementation and testing. The remaining II.VOICE TRAFFIC OVER LTE NETWORK:OVERVIEW bit rates can still be used for other purposes in mobile networks. In order to choose a bit rate, the receiver measures A. Voice traffic protocol stack layers quality of radio channel. The quality indicator (QI) is used In LTE network, the speech frame is packetized sequen- for this purpose. It is defined as an equivalent carrier-to- tially with network protocols, including Real-time Transport interference (C/I) ratio. The C/I ratio then compared to a set Protocol (RTP), User Datagram Protocol (UDP) and Internet of predefined thresholds to decide which mode to be used. Protocol (IP). And then, it will be encapsulated with other Switching among modes in a configuration depend on the radio protocols as Packet Data Convergence Protocol (PDCP), rate control algorithm in AMR-WB codec. The criterion for Radio Control Link (RLC) and Mac Access Control (MAC). mode switching is threshold value of C/I ratio. These threshold All of these protocols will add their headers into the packetized values depend on the channel condition, frequency hopping speech packet. The sizes of these protocols headers as follows: scheme, network configuration and other factors. Furthermore, RTP - 12 bytes, UDP - 8 bytes, IP - 20 bytes (with IPv4) and network conditions change over time, so that, even well- 40 bytes (with IPv6), PDCP - 1 byte, RLC - 1 byte, and MAC selected adaption thresholds will not be best. - 1 byte. In order to decrease data overhead of above protocols head- C. LTE channel coding ers when the speech packet transmitted over a radio channel, One of the crucial issues of digital communication is error Robust Header Compression (RoHC) is used. This will save correction. When data is transmitted over a noisy channel, it bandwidth and enhance voice transmission over LTE network. will be distorted by noise. Thus, the protection of data over RoHC compresses IP header from 40 bytes (with IPv4) and 60 noisy channel is mandatory. This so called is channel coding. bytes (with IPv6) down to 1 to 3 bytes [10]. For voice traffic In LTE network, encoders are used in channel coding includ- in LTE network, HARQ (Hybrid Automatic Repeat Request) ing: Block code used for CRC (Cyclic Redundancy Code) technique is used at MAC layer for retransmission if FEC and HARQ, Convolutional code used for control channels, fails error correction. Each speech packet will be retransmitted with data channels, Turbo code is used. Turbo code is an at least from one to three times. The retransmission times enhanced Convolutional code. It is a Parallel Concatenated depends on error correction or the configured maximum times Convolutional Code (PCCC) with two eight-state constituent of retransmission. encoders and one turbo code internal interleaver, with a coding rate of 1/3 [13]. Standard turbo code rates are 1/3, 3/4, and B. Speech Codec 4/5, where code rate 1/3 is the original code rate. Turbo code Voice traffic in LTE uses Adaptive Multi-Rate Wideband rate is chosen based on CQI (Channel Quality Indicator) index (AMR-WB) as a vocoder. AMR-WB codec is a speech codec [14]. CQI index includes 16 values, where value of 0 is not which has been developed by ETSI and applied in the 3GPP used. Values of CQI index from 1 to 6, 7 to 9, and 10 to 15

2 TABLE I are corresponding to Turbo code rates of 1/3, 3/4, and 4/5. R-FACTOR AND MOS WITH CORRESPONDING USER SATISFACTION Each CQI index is mapped to a SINR (Signal-to-Interference- plus-Noise Ratio) value. SINR also has 15 values (from -6.7 Rx User satisfaction MOS 90 ≤ R < 100 dB to 22.7 dB) [15] counted by the receiver and sent to the x Very satisfied 4.3-5.0 80 ≤ Rx < 90 Satisfied 4.0-4.3 transmitter. Channel coding will map from SINR value to 70 ≤ Rx < 80 Some users dissatisfied 3.6-4.0 corresponding CQI index, and then chooses the corresponding 60 ≤ Rx < 70 Many users dissatisfied 3.1-3.6 channel code rate. 50 ≤ Rx < 60 Nearly all users dissatisfied 2.6-3.1 R < 50 Not recommended < 2.6 In Turbo Encoder, each information bit stream is a k-bits x block. This block size in range from 40 to 6144 bits. So each block can include one or several speech packets. A k-bits block be encoded into a n-bits codeword. So that, the number of redundant bits of LTE channel coding is n−k bits and channel code rate is k/n. D. Wideband E-model: Speech quality assessment for Wide- band Audio Wideband E-model is a computational model developed and standardized by ITU-T [9]. It is used to estimate the MOS for wideband audio quality. The output of the model is R-factor. The values of this R-factor in range from 0 to 129. And then, it is mapped to the MOS. The R-factor in Wideband E-model is defined as follows:

Rwb = R0,wb − Is,wb − Id,wb − Ie,eff,wb + A (1) Fig. 1. Ie,wb vs. Packet loss for nine modes of AMR-WB codec Where:

• R0,wb: The basic signal-to-noise ratio; • Is,wb: The simultaneous impairment factor, it is the In order to compute the Rwb factor, we have to count sum of all impairments which may occur more or less the values of Id,wb and Ie,eff,wb factors. The Id,wb factor is simultaneously with the voice transmission. In this model, determined by the following equation [17]: this factor is set to 0; Id,wb = 0.0024 × De2e + 0.11 × (De2e − 177.3)× • I : The delay impairment factor, representing all im- (4) d,wb H(De2e − 177.3) pairments due to delay of voice signals; In which: H(x) is the Heavyside function: • Ie,eff,wb: The equipment impairment factor, capturing the effect of signal distortion due to low bit rates of the codec  0, ifx < 0 H(x) = (5) and packet losses of random distribution; 1, otherwise • A: The advantage factor, capturing the fact that some users can accept a reduction of quality due to the mobility In equation (4), De2e represents the total end-to-end delay of cellular networks. In this model, this factor is set to 0. (or mouth-to-ear delay) of speech packet. It will be described detail in section III. The I is determined according to In above factors, I and I are affected by end- e,eff,wb d,wb e,eff,wb packet loss. In this study, we estimate packet loss probability to-end delay and packet loss, respectively, while R and 0,wb at the receiver, after FEC. The output bits of AMR-WB codec I do not depend on network performance. The R factor s,wb wb will be encoded by channel coding (Turbo code). In fact, the is translated into the MOS as follows [9]: bits in class A, class B and class C can be encoded with R = R /1.29 x wb different channel code rates. In addition, 8-bits CRC code is • For Rx < 0: MOS = 1 applied to protect class A bits. In figure 1, the packet loss • For 0 ≤ Rx ≤ 100: represents the average rate of speech frames for which CRC MOS = 1 + 0.035 × R + 7 × 10−6 × R × check fails in class A bits. It is determined for each mode of x x (2) (Rx − 60) × (100 − Rx) AMR-WB codec. According to [18], Ie,eff,wb is determined as follows: • For Rx > 100: MOS = 4.5 Ppl Ie,eff,wb = Ie,wb + (129 − Ie,wb) × (6) Rwb factor is mapped to the MOS using the above equation, Ppl+Bpl and then, the MOS is mapped to the level of satisfaction of In which: I : The respective impairment factor without the users, see table I. According to [16], with the wideband e,wb any packet loss. P : Packet loss rate. B : A codec-specific audio, the value of R factor in equation (1) equal to 129. pl pl 0,wb factor which characterizes its robustness against packet loss. Thus, equation (1) can be rewritten as follows:

Rwb = 129 − Id,wb − Ie,eff,wb (3)

3 III.THEPROPOSEDALGORITHMFORMINIMIZING mined as the following equation: REDUNDANTBITSGENERATEDBYCHANNELCODING De2e = k × T × f + la + (n − k + 1) × (T × f × Rs+ P 1 P As mentioned above, in the Wideband E-model, the value Hoverhead) × + (Qh + Ph) + 2 × T + Drohc+ Bh of the MOS depends on both impairment Id,wb and Ie,eff,wb h h factors and they have direct relationship with end-to-end Dharq (8) delay and packet loss. If increasing end-to-end delay leads With D : The delay of RoHC processing time at PDCP to decreasing the MOS while reducing packet loss leads to rohc layer. According to [21], RoHC should not noticeably add to increasing the MOS. Therefore, finding out the suboptimal the end-to-end delay and according to [22], this delay is not joint source-channel code rate solutions is very essential. In very significant, approximately 10 - 67 µs/packet to compress this paper, we offer another viewpoint of choosing the suitable and 12 - 51 µs/packet to decompress RoHC packets. Thus, channel code rate corresponding to each mode of AMR-WB in this study, we consider the delay caused by RoHC equal codec for minimizing the number of redundant bits generated to 0. D : The delay due to retransmission at MAC layer by channel coding with an acceptable MOS reduction. harq by HARQ. Each voice packet is retransmitted at least one A. The calculation of the delay impairment factor times. According to [23], because the RTT (round trip time) In order to calculate this factor, we have to compute the of HARQ is fixed and because of the higher priority for end-to-end delay. According to [19], the end-to-end delay can retransmissions, the HARQ delay is normally within 10 ms. be counted as follows: Normally, FEC also can cause the delay. However, a number of authors have pointed out that FEC does not introduce any De2e = Denc + Dnetwork + Dplay (7) delay unless there is packet loss. So that, in this study, we do not mention the delay caused by FEC. After calculating the Where: end-to-end delay, we can obtain the impairment factor Id,wb • Denc: The delay time caused by encoding and packetizing from equation (4). at the AMR-WB encoder, Denc = k×T ×f +la +Dpack With: T : The speech frame size, T = 20 ms. f: The B. The calculation of the equipment impairment factor number of frames of a speech packet. la: The look-ahead In order to compute this factor, we have to count the packet delay, la = 5 ms for all modes of AMR-WB codec. Dpack: loss rate after FEC schemes try to recover errors. We assume The packetization delay for grouping f frames into one that the estimates for the packet loss rate Ppl on the end-to- speech packet, i.e. Dpack = (f − 1) × T . end network path is available at time an adaptation decision is • Dnetwork: The sum of transmission delay, propagation being made. In order count the Ppl, we assume a random loss delay and queuing delay at each hop h in the network model, the relationship between the parameters (k, n) of FEC path from the transmitter to the receiver. The transmission schemes, “raw” packet loss rate on the end-to-end network delay (Th) is computed using the following equation [19]: path pr and the packet loss rate Ppl is described as follows P 1 Th = (n − k + 1) × (T × f × Rs + Hoverhead) × Bh [21]: h n   In which: Rs: AMR-WB bit rate before channel coding. P n i n−i i Ppl = × pr × (1 − pr) × n (9) Hoverhead: The number of overhead bits introduced by i=n−k+1 i RTP/UDP/IPv6/PDCP/RLC/MAC headers and 24 bits Equation (9) shows that, we can attain the current pr from introduced by CRC at Physical layer. Bh: The bandwidth the given measurement Ppl and a pair of (k, n). This pr value at hop h. is counted once per adaptation period and is utilized in the The propagation delay (Ph) depends on the distance from proposed algorithm as shown in the next subsection. Figure 1 the source to destination. It is negligible if within a local shows Ie,wb versus packet loss(%) after FEC. Thus, we can area. For intra-continental calls, the propagation delay is obtain Ie,wb by referring to figure 1. in the order of 30 ms and for inter-continental calls, it can be as large as 100 ms [19]. It is clear that for a given C. The proposed algorithm voice connection, the only random component of voice In this algorithm, we solve the problem of choosing suitable delay (that is the only source of jitter) consists of queuing source and channel code rate within some constraints on max- P delay in the network [19], Q = Qh. imum allowed end-to-end delay, maximum permitted packet h loss for minimizing the number of redundant bits generated • Dplay: The playback delay, voice packets are usually delayed in a jitter buffer and the fixed playback delay by channel coding with an acceptable MOS reduction. This must be equal to at least two speech frame length [20], algorithm is located at the transmitting side. The inputs of the algorithm as shown in figure 2 including: i.e. Dplay = 2 × T . From above analysis, we get the end-to-end delay deter- • QoS information: As path packet loss Ppl, path bandwidth Bh and congestion Qh. These values can obtain by QoS estimation module used in the network. • QoS constraints: Maximum allowed end-to-end delay Dmax and maximum permitted packet loss rate Pmax.

4 Algorithm 1: A dynamic rate adaptation algorithm using WB E-model Step 1: For all Rs ∈ {mode 0,..., mode 8}: For k = 1 to k0 For n = k + 1 to n0 • Compute the end-to-end delay using equation (8). • Compute the packet loss rate using equation (9).

Step 2: Find all triples (Rs,i, ki, ni), i = 1, ..., t meeting the conditions in equation (10). Step 3: Compute Id,wb, Ie,eff,wb, Rwb, and MOS using the equation (4), (6), (3), (2), respectively. Step 4: Find the highest MOS of each AMR-WB codec mode corresponding to each pair of (kbest, nbest). Fig. 2. A proposed simple framework of voice traffic over LTE network Step 5: Find the suboptimal selection for each AMR-WB codec mode for reducing redundant bits of channel coding: For k = 1 to k0 For n = k + 1 to n0 The outputs of the algorithm will be a decision on the choice • Find the last pair of (k, n) which has the highest MOS, then mark ksubopt = k of source code rate Rs and channel code rate Rc. Besides the For k = 1 to ksubopt most suboptimal choice of channel code rate for every AMR- For n = 1 to n0 WB codec mode, the algorithm also offers the best suboptimal if n > k then • Find the first pair of (k, n) which has the highest MOS, then mark choice of channel code rate for all of AMR-WB codec modes. nsubopt = n. In order to describe the algorithm, we start by setting two the • Find the suboptimal MOS corresponding to each pair of (ksubopt, nsubopt). following constraints: Step 6: Let S = (Rs,i, ki, ni), i = 1, ..., 9 denote the set of nine suboptimal solutions for nine AMR-WB codec modes. Finding the best suboptimal solution  D ≤ D in set S which has the highest value of Gr and has the lowest value of MOSr . e2e max (10) Ppl ≤ Pmax For each mode in AMR-WB codec modes, we change k and n over ranges k = 1, 2, ..., k0 and n = k + 1, ..., n0 where k0 and n0 are maximum available values of channel code rate in LTE network (n0 > k0). In order to find the most suitable suboptimal pair of (ksubopt, nsubopt) for all values of Rs, we define the MOS reduction (MOSr, %) and the percent of decreased redundant bits (Gr, %) as follows:

MOS(kbest,nbest)−MOS(ksubopt,nsubopt) MOSr = × 100 MOS(kbest,nbest) (11) (nbest−kbest)−(nsubopt−ksubopt) Gr = × 100 (12) nbest−kbest

We see that if the larger MOSr, the lower speech quality, and the higher reduced redundant bits, the higher BER (Bit Error Rate) and this leads to the lower speech quality. So Fig. 3. MOS vs. AMR-WB mode and channel code rate when pr is fixed that, in order to find the tradeoff between MOSr and Gr, equal to 10% we propose the criteria for this as follows: In the pairs of (ksubopt, nsubopt) of all modes, we find a pair which has the maximum value of Gr meeting MOSr ≤ 1% to ensure that IV. SIMULATION RESULTS AND PERFORMANCE this MOS is very close to the highest MOS. In the case there EVALUATION are many the same maximum values of Gr, the algorithm In order to simulate the algorithm, we assume simulation will choose a pair of (ksubopt, nsubopt) which has the lowest parameters as follows: value of the MOSr. This is the best suboptimal solution for • The number of speech frames per packet: f = 1 minimizing the redundant bits generated by channel coding • The network path includes 15 hops, with 13 hops are the with an acceptable MOS reduction. The steps of the proposed fast core network links: 10 hops at 622 Mb/s, and 3 hops algorithm is described as follows: at 1.8 Gb/s, and two are in the eNodeB cell at 50.4 Mb/s The proposed algorithm will find the suboptimal solutions and 25 Mb/s, respectively. for each mode of AMR-WB codec and the best suboptimal • P = 0.06 ms per hop. Q is random between 0 and 1 solution on entire modes of AMR-WB codec. This solution h h ms meets some constraints on the allowed maximum end-to- −2 • D = 150 ms. P = 10 end delay, the allowed maximum packet loss, and ensures max max • k = 4 and n = 5 the tradeoff between the MOS reduction and the percent of 0 0 reduced redundant bits generated by channel coding. Figure 3 and table II show the results in the case “raw” packet loss is fixed equal to 10%. This demonstrates that when the

5 value of MOSr(0.45%) with the suboptimal MOS equal to 4.43 while the best MOS equal to 4.45.

V. CONCLUSION In this paper, we present an adaptive algorithm for dy- namic joint source-channel code rate for voice traffic over LTE network. The proposed algorithm permits choosing a suboptimal solution for minimizing the number of redundant bits generated by channel coding. The output of the algorithm is a value pair of source code rate and channel code rate resulting in minimizing the number of redundant bits generated by channel coding with an acceptable MOS reduction based on some constraints on the maximum allowed end-to-end delay and maximum permitted packet loss. The simulation results Fig. 4. MOS vs. AMR-WB mode and channel code rate when pr is randomly changed in range of 0..15% show that in the case of the fixed “raw” packet loss, AMR-WB codec mode 7 offers the suboptimal solution with channel code TABLE II rate equal to 3/5. Otherwise, when the varied “raw” packet THE DETAILED RESULTS OF FIGURE 3 loss, AMR-WB codec mode 6 offers the suboptimal solution Mode kbest nbest MOSbest ksubopt nsubopt MOSsubopt MOSr(%) Gr(%) with channel code rate equal to 3/4. The simulation results 0 1 4 3.51 3 5 3.47 1.14 33.33 1 1 4 3.98 3 5 3.95 0.75 33.33 also demonstrate that the proposed algorithm always finds out 2 1 4 4.33 3 5 4.31 0.46 33.33 3 1 4 4.35 3 5 4.33 0.46 33.33 the suboptimal solution which meets some constraints on given 4 1 4 4.40 3 5 4.39 0.23 33.33 5 1 4 4.43 3 5 4.42 0.23 33.33 QoS information for the tradeoff between speech quality and 6 1 4 4.45 3 5 4.43 0.45 33.33 7 1 4 4.48 3 5 4.47 0.22 33.33 the number of redundant bits generated by channel coding. 8 1 4 4.42 3 5 4.40 0.45 33.33 This means there is a slightly reduction of MOS (≤ 1%) when TABLE III compared to the best MOS while we can obtain the percent THE DETAILED RESULTS OF FIGURE 4 of decreased redundant bits generated by channel coding up to 66.67%. This will lead to saving the bandwidth and so that Mode kbest nbest MOSbest ksubopt nsubopt MOSsubopt MOSr(%) Gr(%) 0 1 4 3.51 3 5 3.47 1.14 33.33 the system can server more mobile users at the same time. 1 1 5 3.99 2 4 3.97 0.50 50.00 2 1 4 4.34 4 5 4.30 0.92 66.67 However, in this study, we still do not consider the constraint 3 1 5 4.35 2 4 4.34 0.23 50.00 4 1 5 4.40 2 4 4.40 0.00 50.00 on bandwidth as well as the effect of network jitter to end-to- 5 1 5 4.43 2 4 4.43 0.00 50.00 6 1 4 4.45 3 4 4.43 0.45 66.67 end delay and packet loss. They might bring new directions 7 1 4 4.48 3 5 4.47 0.22 33.33 8 1 4 4.42 3 5 4.40 0.45 33.33 for the future work.

REFERENCES “raw” packet loss is fixed, the best MOS depends on only a [1] Jonghwan Hyun, Jian Li, ChaeTae Im, Jae-Hyoung Yoo, and James Won- Ki Hong. A volte traffic classification method in network. In Network specific channel code rate, and this is similar to the suboptimal Operations and Management Symposium (APNOMS), 2014 16th Asia- MOS. Figure 3 also shows that with the “raw” packet loss Pacific, pages 1–6. IEEE, 2014. is fixed equal to 10%, all of AMR-WB codec modes obtain [2] 3GPP. Policy and charging control architecture. TS 23.203, 3rd Generation Partnership Project (3GPP), 06 2011. the best MOS with channel code rate equal to 1/4, and the [3] Johnny Matta, Christine Pepin,´ Khosrow Lashkari, and Ravi Jain. A suboptimal MOS with channel code rate equal to 3/5. The source and channel rate adaptation algorithm for amr in voip using the results show that the suboptimal MOS is very close to the best emodel. In Proceedings of the 13th international workshop on Network and operating systems support for digital audio and video, pages 92–99. MOS while the redundant bits generated by channel coding ACM, 2003. decreased significantly. In this case, the Gr of all triples have [4] J Makinen and J Vainio. Source signal based rate adaptation for gsm amr speech codec. In Information Technology: Coding and Computing, the same value, so that, the triple (Rs,7, 3, 5) is the best 2004. Proceedings. ITCC 2004. International Conference on, volume 2, suboptimal solution because it has the lowest value of MOSr pages 308–313. IEEE, 2004. with the suboptimal MOS equal to 4.47 while the best MOS [5] Yicheng Huang, Jari Korhonen, and Ye Wang. Optimization of source equal to 4.48. Figure 4 and table III show the results in the and channel coding for . In Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on, pages 4–pp. IEEE, case the “raw” packet loss is varied in an instant. This result 2005. demonstrates that when the “raw” packet loss is changed, the [6] Fariza Sabrina and Jean-Marc Valin. Priority based dynamic rate best MOS depends on only two specific channel code rates, control for voip traffic. In Global Conference, 2009. GLOBECOM 2009. IEEE, pages 1–8. IEEE, 2009. while the suboptimal MOS depends on four specific channel [7] Mingyu Chen and Manohar N Murthi. Optimized unequal error protec- code rates. The results also show that there are 2 triples which tion for voice over ip. In Acoustics, Speech, and Signal Processing, 2004. Proceedings.(ICASSP’04). IEEE International Conference on, volume 5, have the same highest value of Gr, those are two triples of pages V–865. IEEE, 2004. (Rs,2, 4, 5) and (Rs,6, 3, 4) which meet the conditions. In this [8] A Khalifeh and Homayoun Yousefi’zadeh. Optimal rate adaptation for case, the algorithm will choose the triple (Rs,6, 3, 4) as the voip over wireless tandem links. In MILITARY COMMUNICATIONS best suboptimal solution because this triple has the lowest CONFERENCE, 2010-MILCOM 2010, pages 287–292. IEEE, 2010.

6 [9] ITUT Rec. G. 107.1, wideband e-model. Int. Telecomm. Union, Geneva, [17] Cristian Olariu, Micheal O Foghlu, Philip Perry, and Liam Murphy. Voip 2011. quality monitoring in lte . In Integrated Network Management [10] Carsten Bormann, C Burmeister, M Degermark, et al. Robust header (IM), 2011 IFIP/IEEE International Symposium on, pages 501–508. compression (rohc). Technical report, RFC 3095, June, 2001. IEEE, 2011. [11] ITUT Rec. G. 722.2 (2003) wideband coding of speech at around [18] Sebastian Moller, Alexander Raake, Nobuhiko Kitawaki, Akira Taka- 16kbit/s using adaptive multi-rate wideband (amr-wb). International hashi, and Marcel Waltermann. Impairment factor framework for wide- union, Geneve, Switzerland, 2003. band speech . Audio, Speech, and Language Processing, IEEE [12] Wikipedia. Adaptive multi-rate wideband — Wikipedia, the Transactions on, 14(6):1969–1976, 2006. free encyclopedia. http://en.wikipedia.org/w/index.php?title=Adaptive% [19] Mansour J Karam and Fouad A Tobagi. Analysis of delay and delay 20Multi-Rate%20Wideband&oldid=650770013, 2015. [Online; ac- jitter of voice traffic in the internet. Computer Networks, 40(6):711–726, cessed 16-March-2015]. 2002. [13] 3GPP. Evolved Universal Terrestrial Radio Access (E-UTRA); Mul- [20] Syed A Ahson and Mohammad Ilyas. VoIP handbook: applications, tiplexing and channel coding. TS 36.212, 3rd Generation Partnership technologies, reliability, and security. CRC Press, 2008. Project (3GPP), 10 2010. [21] Bernard Sklar. Digital communications, volume 2. Prentice Hall NJ, [14] 3GPP. Evolved Universal Terrestrial Radio Access (E-UTRA); Physical 2001. layer procedures. TS 36.213, 3rd Generation Partnership Project (3GPP), [22] Esa Piri, Jarno Pinola, Frerk Fitzek, and Kostas Pentikousis. Rohc 10 2010. and aggregated voip over fixed wimax: an empirical evaluation. In [15] Amitabha Ghosh and Rapeepat Ratasuk. Essentials of LTE and LTE-A. Computers and Communications, 2008. ISCC 2008. IEEE Symposium Cambridge University Press, 2011. on, pages 1141–1146. IEEE, 2008. [16] Liyun Pang and Laszlo Bosz¨ ormenyi.¨ Adaptive jitter buffer based on [23] Yikang Xiang, Jijun Luo, and Christian Hartmann. Inter-cell interference quality optimization under bursty packet loss. International Journal On mitigation through flexible resource reuse in ofdma based communica- Advances in Telecommunications, 5(1 and 2):1–10, 2012. tion networks. In European wireless, pages 1–4, 2007.

7