<<

arXiv:1801.02066v2 [cs.IT] 10 Aug 2018 edefine We oevr evcsin services Moreover, h hogpt o evcsin services For throughput. the is hthst emtwt aec oeac of tolerance latency a with met be to has that bits) ihtetm ttercie o rcsigcmuig to o category processing/computing, second for by The denoted delivery. receiver is of services deadline the alon overall place, at the taken meet set time has be that the delay can queueing with parameter be for The has account data scheduler. also the to the until by time transmitted the to fully refers tolerance latency the o n service any For rtctgr,dntdby denoted category, first o efrac assessment. performance for dual eiiiyi ohdmnin eadn rqec n time and frequency regarding dimensions named both in flexibility loainpolm eadestatblt n propose and tractability scalability resour with address 2-D algorithm We resulting the problem. consider allocation we [16], knowledge. structures our frame of best dimensions the resour time to and addressed of frequency been sizes yet the flexible both along with units allocation [9]– resource new selective not allocatio is dimensions resource frequency [15], and flexible time the Although both along [10]. [9], challenges tutrst rbessc smlidmninlK M similar multi-dimensional weighted yields as or such dimension problems notion frequency to domai structures general time-frequency the the the along in within Optimization allocation interva fall resource time flexible works transmission of These scalable A [2]–[7]. the [1]. is (TTIs) services latency of pro lower while variety efficiency ing large resource higher a for support solution promising to required is tems etn h eurmnso iso rtclservices. critical flexibility mission an enhancement of adopting capacity requirements for the of domains meeting frequency advantages and time algorit significant both optimization show Numeri duality. scalable Lagrangian results a and programming propose linear NP-hardn on and the based problem, serv prove We of and the requirements. presence of domain of in types domain, frequency different time with in in structure numerology frame variable flexible with allocation osdrabs tto n w aeoiso evcs The services. of categories two and station base a Consider ae n3P ees o clbenmrlge and numerologies scalable for release 3GPP on Based h fhgnrto 5)o ieescmuiain sys- communications wireless of (5G) generation fifth The Abstract eoreOtmzto ihFeil Numerology Flexible with Optimization Resource n rm tutr o eeoeeu Services Heterogeneous for Structure Frame and pc fotmzto.W hnpoienmrclresults numerical provide then We optimization. of space rma nee rgamn on fve,frequency view, of point programming integer an from -iesoa 2D eoreallocation resource (2-D) 2-dimensional W xlr h oeta fotmzn resource optimizing of potential the explore —We K = ATCHING K k ( ℓ K ∈ ) I S II. K ∪ .I I. { K lei.you K ( ( ℓ ( c ( ) ℓ NTRODUCTION 8.Rsuc piiainadopting optimization Resource [8]. ) YSTEM tlzn ohthe both utilizing , c ) K eoeisdt eadby demand data its denote , o hc h agti omaximize to is target the which for , ) r roiie vrtoein those over prioritized are ( h e falservices. all of set the ℓ ) } a titltnyrequirement. latency strict has , 3 i.us [email protected] @it.uu.se eateto cec n ehooy ikoigUniversi Link¨oping Technology, and Science of Department 1 K eateto nomto ehooy psl University Uppsala Technology, Information of Department M e You Lei ( c ODEL ) ulbfe sassumed. is buffer full , 1 iLiao Qi , primal 2 oi elLb,Sutat Germany Stuttgart, Labs, Bell Nokia oe new poses , NAPSACK pc and space τ a not has , k 2 Here, . ioasPappas Nikolaos , q K k vid- ices ( hm (in cal ess an c en ce ce in n, ls ) d g n f . , o each For ihoeo utpebok,adec lc a easge to assigned be can servi A block resource. each the blocks and of that blocks, unit Note three basic multiple service. with a or is allocation one grid resource with the of of illustration rectangle An A 1. Figure rvd h uoiladsuc oei EEDtPr [18]. DataPort IEEE in but code details source we the and omit addition, tutorial we space, the In consecutiv limited provide more to complexity). or Due high one TTI. per as the symbols overhead control to the consider due modele also paper not is this between numerologies ICI of in (the types inter- numerology different of with the neighbor type subbands same the account the between with into (ICI) subbands interference takes approximat inter-channel and model CP, the from the on This depending on mapping (ISI) based [17]. symbol-interference the throughput in predefine the to model we profiles parameters of channel and configuration multipath effect the B.2.1-4], nine block, the of Table per account [16, total throughput a achieved into assume the take we duration. compute symbol shall To and guardband. length, rate (characterized CP this range duration), Moreover, TTI frequency and and SCS span by time the including eorecngrto fnmrlg n rm tutr as structure frame and t to blocks numerology refer of we configuration hereafter a simplicity, resource with For respectively length. domain One time CP S and supported. configured adjacent frequency of is the set in aggregation a TTIs extended involves and slot service of a case if to in allocation slots symbols resource multiple 12 or (or or supported, symbols symbols CP), 14 1-13 with TTI with A slot mini-slot frame. one one a within of slots consist of can number stru by frame characterized radio t is The (a.k.a. ture symbols). length the (CP) between prefix interval” cyclic “guard and (SCS) spacing carrier K os power, noise sindt service to assigned ( n5 e ai N) ueooyi endb sub- by defined is numerology a (NR), radio new 5G In osdrtepolmo aiiigtettltruhu for throughput total the maximizing of problem the Consider ie h hne rfie h rnmsinpwr n the and power, transmission the profile, channel the Given c ) ujc oltnyadtedmn osrit for constraints demand the and latency to subject , I.P III. n osdracniaeset candidate a consider and , { ioasppa,di.yuan nikolaos.pappas, b 3 ROBLEM Frequency Yuan Di and , B ∈ r b,k h civdtruhu nblock on throughput achieved the , eed ntecngrto fblock of configuration the on depends lc 1 Block lc 4 Block k F 1 y Sweden ty, RUAINAND ORMULATION Sweden , ( and k K ∈ 4 3 r ifrn lcswt h aeshape. same the with blocks different are

sdntdby denoted is )

Time lc 2 Block } @liu.se B fbok,seFgr 1. Figure see blocks, of

T lc 3 Block RACTABILITY r b,k ye fblocks. of types . ei allocated is ce b pt one to up if K CSs b ( ing ℓ he he es c- b is ) d e 1 , . 2

We use basic unit to refer to the minimum unit of resource in A. Block Assignment the time-frequency domain in the problem formulation. The We denote by matrix u of size |B|×|K| the utility matrix for set of basic units is denoted by I. We let a = 1 if block b,i all pairs of blocks and services. An element u represents the b includes basic unit i, otherwise a = 0. Recall that a b,k b,i utility of a block-service pair (b, k) (b ∈B and k ∈ K). Block block refers to a rectangular shape located at some specific assignments for K(ℓ) and K(c) are treated separately. The location in the resource grid, see Figure 1. Taking the figure former is performed first because of the latency requirement. as an example, there are 20 × 4 = 80 basic units. Hence I = {1,..., 80}. Block 1 is of shape 1×4, and for this block, Algorithm 1 BA(S,u). The input consists of a block-service a = 1 for four elements of I. Placing this 1 × 4 shape at 1,i assignment S which may be empty, and a utility matrix u. Set all possible positions of the grid, we obtain all blocks and the S is augmented and then returned by the algorithm. We remark corresponding a-values for this particular shape. Doing so for that b overlaps with b′ if and only if ∃i ∈ I a + a ′ > 1. all shapes generates the set of all candidate blocks B. As it can b,i b ,i be seen from Figure 1, for each block, the a-values are fully 1: repeat determined by the position of the block and the numerical 2: Remove from B the blocks in S and those overlapping indexing of the basic units, and the complexity of doing with the blocks in S. ′ ′ ′ ′ one mapping equals the size of I, i.e., |I|. The complexity 3: (b , k ) ← argmaxb∈B,k∈K(ℓ) ub,k, S ←S∪{(b , k )}. (ℓ) (ℓ) ′ for obtaining all the mappings is O(|B||I|). One possible 4: if qk′ is met: K ← K \{k }. (ℓ) implementation of performing the mapping is detailed in IEEE 5: until K = φ or B = φ (ℓ) DataPort [18]. This mapping is done in the pre-processing 6: if K 6= φ: (ℓ) stage once (not every TTI), and it is totally decoupled from 7: The demand of the users left in K cannot be met. any service to be scheduled, as it only concerns the two sets 8: repeat (ℓ) (c) B and I. 9: Lines 2–3 with the notation K replaced by K . The optimization task is to select blocks for each service, 10: until B = φ such that the latency and demand requirements are met for K(ℓ), without overlapping among chosen blocks. We use The operations in Lines 2 and 4 cost O(1) by hash-map optimization variable xb,k ∈{1, 0} to indicate whether block implementations. By sorting the utilities in advance (which b (b ∈ B) is assigned to the service k (k ∈ K). A block b is costs O |B||K(ℓ)| log |B||K(ℓ)| ), along with inspecting the (ℓ) infeasible for k (k ∈ K ), if the ending time of b exceeds τk. other algorithm operations, one can conclude that the overall  This is modeled by setting rb,k =0. The problem is formulated complexity is of O (|B||K| log (|B||K|)). below. The two sets of constraints impose the demand for K(ℓ) and block non-overlapping, respectively. B. Utility Estimation by LP Relaxation [P0] max rb,kxb,k (1a) x∈{0,1} b (c) One way to compute the utility matrix u is to solve the LP X∈B k∈KX (ℓ) relaxation of P0 and to use the LP optimum xLP. s.t. rb,kxb,k ≥ qk, k ∈ K (1b) b∈B X [P0-LP] max rb,kxb,k s.t. (1b), (1c) (2a) (1c) 0≤x≤1 ab,ixb,k ≤ 1, i ∈ I b∈B k∈K(c) k b X X X∈K X∈B We denote by uLP = xLP the LP-based utility. Also, xLP can Theorem 1: P0 is NP-hard. be used for initialization: S = {(b, k): uLP,b,k ≥ ρ,b ∈B, k ∈ Proof: We construct a polynomial-time reduction from K} with ρ being a threshold. the PARTITION PROBLEM (PP) for a set of integers {d1,...,dn}. The task is to determine whether or not there is a partition such that the two subsets have equal sum of C. Utility Estimation by LD n i=1 di/2, where the numerator is assumed to be even. We define a single TTI size and multiple blocks that take the shape By relaxing the constraints (1c) of P0 with Lagrangian Pof one basic unit. There are two services, denoted by kℓ and multiplier λi (i ∈ I), the Lagrangian is defined as follows: kc, in K(ℓ) and K(c), respectively. The latency parameter of kℓ n equals that of the TTI size, and the demand equals i=1 di/2. L(x, λ)= rb,kxb,k+ λi 1 − ab,ixb,k Moreover, rb,1 = rb,2 = db,b = 1,...,n. By construction, b∈B k∈K(c) i∈I b∈B k∈K ! (1c) has no effect. Next, one can observe that partitioningP the n X X X X X basic units into two subsets, each providing a total throughput The LD function is defined in (3). n of i di/2, is equivalent to a feasible solution to PP. In =1 [P1] g(λ)= max L(x, λ) s.t.(1b) (3) addition, this can occur if and only if the objective function x∈{0,1} P c n defined for k reaches i=1 di/2. Hence the conclusion. Accordingly, we have the LD problem: IV. PROBLEM SOLVING [P0-LD] min g(λ). (4) We propose a sub-optimal but low-complexity algorithm, λ≥0 consisting in performing assignment of blocks to services, based on utility values generated from linear programming We define αb = i∈I λiab,i. Problem P1 decomposes for (LP) relaxation and the Lagrangian dual (LD). K(c) and K(ℓ): The constraints (1b) are only for K(ℓ). There- P 3 fore, solving P1 amounts to solving the two problems P2 and kHz, µ =0, 1, 2 ...) originate from Release 15 [20, Table 4.2- P3 for K(c) and K(ℓ) respectively, shown below. 1]. Note that Release 15 also specifies subcarrier spacing up to 240 kHz. However, by [21, Table I], a TTI of 0.125 ms [P2] max (r − α )x (5a) x b,k b b,k meets all the worst-case transmission latencies for the listed b (c) X∈B k∈KX 5G ultra-reliable low-latency communication configurations. s.t. xb,k ≤ 1, b ∈B (5b) Parameter settings are given in Table I. We test (c) our algorithm for a set of candidate thresholds ρ ∈ k∈KX (c) {0.05, 0.5,..., 0.95} among which the one achieves the best xb,k ∈{0, 1}, b ∈B, k ∈ K (5c) objective is selected. The maximum sub-gradient iterations is set to 200. While calculating the block rates, the impact on capacity due to guardband is included by following the model [P3] min αbxb,k (6a) x∈{0,1} in [14]. The rate reduction due to control overhead follows that b (ℓ) X∈B k∈KX in [22], where two symbols per TTI constitute the overhead. (ℓ) s.t. rb,kxb,k ≥ qk, k ∈ K (6b) We emphasize on accurate assessment in terms of optimality, b X∈B that is, how much does the proposed algorithm perform with respect to global optimum. We use the global optimum Note constraints (5b) are not present in P0, though these are obtained by solving the integer programming problem (1) via (c) implied by (1c) for services in K . Computing the optimum a solver. This is not a scalable method. The purpose here of P2 is straightforward. Each block b is allocated to the is for benchmarking, to demonstrate that our low-complexity service argmaxk rb,k − αb with rb,k − αb > 0. algorithm has little loss in optimality. The number of users as (ℓ) Problem P3 further decomposes to K problems P3[k] well as the bandwidth is chosen such that the global optimum (k ∈ K(ℓ)), each with objective max α x , constraints b∈B b b,k can be obtained with reasonable amount of computing effort. b∈B rb,kxb,k ≥ qk, and binary variables x b,k (b ∈B). Similarly, in view of the computational effort of obtaining P global optimum for benchmarking, we do not include all TTI P [P3[k]] min αbxb,k (7a) x∈{0,1} sizes that are permitted by 5G NR [20]. b X∈B s.t. rb,kxb,k ≥ qk (7b) Table I b SIMULATION PARAMETERS. X∈B Each P3[k] can be reformulated as a KNAPSACK PROBLEM, Parameter Value and optimally solved by dynamic programming. Number of users 10 with |K(ℓ)| = |K(c)| = 5 The dual problem P0-LD can be solved using a sub- Time-frequency domain 2 ms and 2 MHz x(h) SNR range [5, 30] (dB) gradient method [19]. Denote by LD the LD solution in Demand q {16, 32, 64, 128, 256, 512} (kbps) the hth iteration of the sub-gradient method. We let uLD = Latency tolerance τ {0.25, 0.5, 1.0, 1.5, 2} (ms) x(h) Threshold (Section IV-B) ρ ∈ {0.05, 0.1,..., 0.95} h LD to be the LD-based utility. P Figure 2 shows the average bit rate of services in K(c) with D. Algorithm Implementation respect to the latency tolerance τ of K(ℓ). For the non-flexible structures, Shape 1, Shape 2, and Shape 3 are used separately. In addition to BA(S,uLP ) and BA(S,uLD ), we consider al- Each of these structures is referred to in the format of “TTI- gorithm “LP+LD” that returns the best solution of BA(S,uLP ) SCS” (e.g. 0.25ms-30kHz means a shape of a fixed TTI of and BA(S,uLD ). We remark that BA(S,uLD ) is quite flexible in terms of computational effort, as one can use accumulated 0.25 ms and a fixed SCS of 30 kHz). The flexible structure significantly outperforms the non- uLD before full convergence. Overall, the algorithm scales well. Moreover, if necessary, the service sets can be decom- flexible ones. The system tends to benefit more from flexible posed into subsets, and the algorithm can be applies to one structure when the latency tolerance becomes more stringent. subset at a time to further reduce complexity. Note that our algorithm with flexible structure is near-optimal. Among the three non-flexible schemes, 0.25ms-30kHz out- performs the other two. This result is related to that, in the V. NUMERICAL RESULTS optimization problem, the throughput of the services in K(c), is The use of flexible numerology is expected to outperform subject to latency constraints of services in K(ℓ). For resource fixed numerology. The purpose of performance evaluation is to allocation, blocks of 0.5ms-15kHz and 0.125ms-60kHz have examine the amount of improvement, which is of significance low flexibility in the time and frequency domains, respectively. in particular as the control channel overhead for supporting The former does not have many choices in meeting the latency the flexible structure is accounted for. The result also tell how requirements of K(ℓ), leading to poor throughput for K(c). well the proposed algorithm is suited for the flexible structure. The latter is not efficient on the frequency domain (due to Comparing to LTE that applies a fixed SCS of 15 kHz and frequency selectivity), though this inefficiency is mitigated TTI of 1.0 ms, we consider four shapes, Shape 1, Shape 2, when the latency tolerance is high, as more choices become Shape 3, and Shape 4, with SCS being 15 kHz, 30 kHz, 60 available in the time domain. Blocks of 0.25ms-30kHz strikes kHz, and 60 kHz, CP 4.7 µs, 2.3 µs 1.2 µs, and 4.17 µs, and a balance between short TTI size (to meet latency-constrained the number of symbols 7, 7, 7, and 6, respectively. The TTI services) and flexibility in the frequency domain. durations of the four shapes are 0.5 ms, 0.25 ms, 0.125 ms, Without showing by the figures, we remark that the problem and 0.125 ms, respectively. The numerologies (∆f =2µ × 15 feasibility of the three non-flexible schemes is very sensitive 4

to the latency tolerance. This issue is alleviated by the flexible ACKNOWLEDGEMENT structure. In comparison to the related work [23] considering This work has been partially supported by European Union advantages of flexible numerology, our results emphasize the H2020 MSCA projects ACT5G (643002) and DECADE significance of block-service assignment optimization. (645705), and the Center for Industrial Information Tech- nology (CENIIT). The work of the first author was partly accomplished while he was at Link¨oping University, Sweden. 0.125ms-60kHz 0.5ms-15kHz 0.25ms-30kHz

(kbps) 600 Flexible (LP+LD) Flexible (Optimal) REFERENCES ) c ( [1] B. Soret, P. Mogensen, K. I. Pedersen, and M. C. Aguayo-Torres, K “Fundamental tradeoffs among reliability, latency and throughput in 400 cellular networks,” in IEEE Globecom Workshops, 2014, pp. 1391–1396. [2] K. Pedersen, F. Frederiksen, G. Berardinelli, and P. Mogensen, “A flexible frame structure for 5G wide area,” in IEEE VTC-Fall, 2015, 200 pp. 1–5. [3] G. Pocovi, B. Soret, K. I. Pedersen, and P. Mogensen, “MAC layer enhancements for ultra-reliable low-latency communications in cellular networks,” in IEEE ICC Workshops, 2017, pp. 1005–1010. Bit rate per user in 0 [4] K. Pedersen, G. Pocovi, J. Steiner, and S. Khosravirad, “Punctured 0.25 0.5 1.0 1.5 2.0 scheduling for critical low latency data on a shared channel with mobile broadband,” in IEEE VTC-Fall, 2017, pp. 1005–1010. Latency tolerance τ of K(ℓ) (ms) [5] Q. Liao, P. Baracca, D. Lopez-Perez, and L. G. Giordano, “Resource scheduling for mixed traffic types with scalable TTI in dynamic TDD systems,” in IEEE Globecom Workshops, 2016, pp. 1–7. (c) Figure 2. The figure shows bit rate of K with respect to latency tolerance [6] E. Fountoulakis, N. Pappas, Q. Liao, V. Suryaprakash, and D. Yuan, (ℓ) (ℓ) τ of K . The bit rate demand of K equals 128 kbps. All the three cases “An examination of the benefits of scalable TTI for heterogeneous traffic of non-flexible structures are solved to optimality. When τ = 0.25, using the management in 5G networks,” in WiOpt, 2017, pp. 1–6. block 0.5ms-15kHz results in infeasibility. [7] A. Anand, G. de Veciana, and S. Shakkottai, “Joint scheduling of URLLC and eMBB traffic in 5G wireless networks,” arXiv.org, Dec. Figure 3 shows the optimality gaps as function of the 2017. (ℓ) r [8] Y. Zhang and C. Leung, “Resource allocation in an OFDM-based demand of K . Here we also include BA(φ, ), which uses cognitive radio system,” IEEE Transactions on Communications, vol. 57, the throughput of each block-service pair as the utility. One no. 7, pp. 1928–1931, 2009. can observe that in general the optimality gap increases with [9] T. Wang, H. Feng, and B. Hu, “Two-dimensional resource allocation for OFDMA system,” in IEEE ICC Workshops, 2008, pp. 1–5. the user demand. Meanwhile, searching in the dual space [10] Y. Ben-Shimol, I. Kitroser, and Y. Dinitz, “Two-dimensional mapping for computing block-service utilities in most cases leads to for wireless OFDMA systems,” IEEE Transactions on Broadcasting, significantly better results than considering the LP relaxation. vol. 52, no. 3, pp. 388–396, 2006. r u [11] L. Zhang, A. Ijaz, P. Xiao, A. Quddus, and R. Tafazolli, “Subband fil- With high user demand, BA(φ, ) and BA(S, LP ) are clearly tered multi-carrier systems for multi-service wireless communications,” inferior to the others. Basically, using LD for utility estimation IEEE Transactions on Wireless Communications, vol. 16, no. 3, pp. significantly reduces the optimality gap. In addition, combin- 1893–1907, 2017. u u [12] A. A. Zaidi, R. Baldemair, H. Tullberg, H. Bjorkegren, L. Sundstrom, ing BA(S, LP ),BA(S, LD ) leads to further optimality gap J. Medbo, C. Kilinc, and I. D. Silva, “Waveform and numerology to sup- reduction, indicating that LP and LD are complementary to port 5G services and requirements,” IEEE Communications Magazine, each other. Overall, the gap of LP+LD is below 10%. vol. 54, no. 11, pp. 90–98, 2016. [13] P. Guan, D. Wu, T. Tian, J. Zhou, X. Zhang, L. Gu, A. Benjebbour, M. Iwabuchi, and Y. Kishiyama, “5G field trials: OFDM-based wave- forms and mixed numerologies,” IEEE Journal on Selected Areas in r u 0.3 BA(φ, ) BA(S, LP) Communications, vol. 35, no. 6, pp. 1234–1243, 2017. BA(S,uLD) LP+LD [14] A. Yazar and H. Arslan, “A flexibility metric and optimization methods for mixed numerologies in 5G and beyond,” IEEE Access, vol. 6, pp. 3755–3764, 2018. [15] A. Ijaz, L. Zhang, M. Grau, A. Mohamed, S. Vural, A. U. Quddus, M. A. 0.2 Imran, C. H. Foh, and R. Tafazolli, “Enabling massive IoT in 5G and beyond systems: PHY radio frame design considerations,” IEEE Access, vol. 4, pp. 3322–3339, 2016.

Optimality gap [16] 3GPP. (2017, Apr.) TS 36.101, Evolved Universal Terrestrial Radio 0.1 Access (E-UTRA); User Equipment (UE) radio transmission and reception. Release 14. [Online]. Available: http://www.3gpp.org [17] M. Batariere, K. Baum, and T. P. Krauss, “Cyclic prefix length analysis for 4G OFDM systems,” in IEEE VTC-Fall, 2004, pp. 543–547. 16 32 64 128 256 512 [18] L. You, Q. Liao, N. Pappas, and D. Yuan, “2D resource allocation,” 2018. [Online]. Available: http://dx.doi.org/10.21227/ch8e-x385 K(ℓ) Bit rate demand of (kbps) [19] S. Sen and H. D. Sherali, “A class of convergent primal-dual subgradient algorithms for decomposable convex programs,” Mathematical Program- Figure 3. The figure shows the optimality gaps of the solutions obtained. For ming, vol. 35, no. 3, pp. 279–297, 1986. instance, “0.1” in the y-axis in Figure 3 means that the relative deviation to [20] 3GPP TS 38.211, “NR; Physical channels and modulation,” Tech. Rep. the optimum is 10% on average. V15.1.0, 2018. [21] J. Sachs, G. Wikstrom, T. Dudda, R. Baldemair, and K. Kittichokechai, “5G radio network design for ultra-reliable low-latency communication,” IEEE Network, vol. 32, no. 2, pp. 24–31, March 2018. [22] H. Miao and M. Faerber, “Physical downlink control channel for 5G VI. CONCLUSION new radio,” in European Conference on Networks and Communications, 2017, pp. 1–5. We suggest that combining a flexible numerology and frame [23] X. Zhang, M. Jia, L. Chen, J. Ma, and J. Qiu, “Filtered-OFDM - enabler structure serves as a promising option for spectral efficiency. for flexible waveform in the 5th generation cellular networks,” in IEEE Utilizing LP and LD enables efficient problem solving. GLOBECOM, 2015, pp. 1–6.