On the Impact of Spatial Correlation and Precoder Design on the Performance of MIMO Systems with Space-Time Coding

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

April 19 - 24, Taipei, Taiwan, 2009

c 2009 IEEE. Published in the IEEE 2009 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), scheduled for April 19 - 24, 2009 in Taipei, Taiwan. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from the IEEE. Contact: Manager, Copyrights and Permissions / IEEE Service Center / 445 Hoes Lane / P.O. Box 1331 / Piscataway, NJ 08855-1331, USA. Telephone: + Intl. 908-562-3966.

EMIL BJORNSON,¨ BJORN¨ OTTERSTEN, AND EDUARD JORSWIECK

Stockholm 2009

KTH Royal Institute of Technology ACCESS Linnaeus Center Signal Processing Lab

IR-EE-SB 2009:002, Revised version with minor corrections ON THE IMPACT OF SPATIAL CORRELATION AND PRECODER DESIGN ON THE PERFORMANCE OF MIMO SYSTEMS WITH SPACE-TIME CODING

Emil Bjornson,¨ Bjorn¨ Ottersten Eduard Jorswieck

ACCESS Linnaeus Center Chair of Communication Theory Signal Processing Lab Communications Laboratory Royal Institute of Technology (KTH) Dresden University of Technology {emil.bjornson,bjorn.ottersten}@ee.kth.se [email protected]

ABSTRACT In this paper, we consider the SER performance of spatially cor- related multiple-input multiple-output (MIMO) systems with OST- The symbol error performance of spatially correlated multi-antenna BCs. The influence of and correlation on the SER will be systems is analyzed herein. When the transmitter only has statistical analyzed in terms of Schur-convexity [8]. Previously, the Chernoff channel information, the use of space-time block codes still permits bound on the SER has been analyzed in this manner assuming a cer- spatial multiplexing and mitigation of . The statistical infor- tain type of modulation [9]. The error performance was shown to mation can be used for precoding to optimize some quality measure. decrease with increasing correlation in most scenarios, except when Herein, we analyze the performance in terms of the symbol error rate an SER minimizing precoder was employed; then, the performance (SER). It is shown analytically that spatial correlation at the receiver improves as the correlation increases at the transmitter side. Herein, decreases the performance both without precoding and with an SER the results of [9] are generalized to cover a much larger class of mod- minimizing precoder. Without precoding, correlation should also be ulation schemes and to consider the exact value of the SER. avoided at the transmitter side, but with an SER minimizing precoder the performance is actually improved by increasing spatial correla- First, an introduction to the problem and to the mathematical tion at the transmitter. The structure of the optimized precoder is technique of majorization will be given. Then, we will review the analyzed and the asymptotic properties at high and low SNRs are SER expressions for PAM, PSK, and QAM constellations, and show characterized and illustrated numerically. how they all share the same general structure. This result will be exploited in the analysis of the SER, where we consider the case Index Terms— Linear Precoding, Majorization, MIMO Sys- of space-time coded transmissions without precoding and with SER tems, Orthogonal Space-Time Block Codes, Symbol Error Rate. minimizing precoding. The impact of spatial correlation on the SER will be derived analytically and then illustrated numerically. 1. INTRODUCTION 1.1. Notation The use of multiple antennas at the transmitter and receiver sides in wireless communication systems has the potential of dramatically Vectors and matrices are denoted with boldface in lower and upper increasing the throughput in environments with sufficient scattering. case, respectively. The of two matrices X and With full channel state information (CSI) available at both the trans- Y is denoted X ⊗ Y. The vector space of dimension n with non- n mitter and receiver sides, low complexity receivers exist that can negative and real-valued elements is denoted R+. The Frobenius realize this throughput increase [1]. Unfortunately, full CSI at the norm of a matrix X is denoted kXk. transmitter is unrealistic in many fast fading scenarios since it would require a prohibitive feedback load. The long-term channel statistics 2. SYSTEM MODEL AND PRELIMINARIES can, on the other hand, usually be considered as known since they vary much slower than the channel realization. When required, in- We consider a correlated Rayleigh flat-fading channel with n trans- stantaneous CSI can also be achieved at the receiver from training T mit antennas and n receive antennas. The channel is represented signalling [2]. R by the matrix H ∈ nR×nT and is assumed to follow the Kronecker The primary use of multiple antennas is to increase the reliabil- C model ity by mitigating fading and to increase throughput by transmitting H = R1/2HR1/2, several data streams in parallel, cf. [3]. When the transmitter is com- R e T (1) nR×nR nT ×nT pletely unaware of the channel, one common way of achieving these where RR ∈ C and RT ∈ C are the positive semi- two goals is to use orthogonal space-time block codes (OSTBCs) definite correlation matrices at the receiver and transmitter side, re- n ×n [4]. Herein, we assume that the transmitter has statistical CSI, which spectively. He ∈ C R T has i.i.d. complex Gaussian elements with makes it possible to adapt the coding to the channel statistics. Linear zero-mean and unit variance. The receive and transmit correlation precoding of OSTBCs was proposed in [5], and has recently been matrices are arbitrarily spatially correlated. analyzed in [6] and [7] with respect to the symbol error rate (SER). The transmission takes place using OSTBCs that code K sym- K bols over T symbols slots (i.e., the coding rate is T ). Let s = This work is supported in part by the FP6 project Cooperative and Op- T K portunistic Communications in Wireless Networks (COOPCOM), Project [s1, . . . , sK ] ∈ C represent these K data symbols, where each 2 Number: FP6-033533. Bjorn¨ Ottersten is also with securityandtrust.lu, Uni- symbol si ∈ A has the average power E{|si| } = γ and belongs to versity of Luxembourg. the constellation set A (different constellations will be considered). B×T These symbols are coded in an OSTBC matrix C(s) ∈ C that 2.2. Definitions of Majorization and Schur-Convexity fulfills the orthogonality property C(s)C(s)H = ksk2I, see [4] for details. The spatial coding dimension is B and a linear precoder The spatial correlation can be measured in the distribution of eigen- n ×B H values of the correlation matrices; low correlation is represented W ∈ C T with the power constraint tr(WW ) = 1 is used to project the code into advantageous spatial directions [5]. Observe by eigenvalues that are almost identical, while high correlation that the precoder is normalized such that the average transmit power means that a few eigenvalues are dominating. Herein, we assume allocated per symbol is E{kWC(s)k2}/k = γ. Under these as- that all eigenvalues are ordered in non-decreasing order and we nR×T will use the notion of majorization to compare systems [8]. If sumptions, the received signal Y ∈ C is T T x = [x1, . . . , xM ] and y = [y1, . . . , yM ] are two non-negative vectors, then we say that x majorizes y if Y = HWC(s) + N, (2) m m M M X X X X where the power of the system has been normalized such that the xk ≥ yk, m=1,...,M − 1, and xk = yk. n ×T additive white noise N ∈ C R has i.i.d. complex Gaussian ele- k=1 k=1 k=1 k=1 ments with zero-mean and unit variance. As shown in [6, 10], the use This property is denoted x  y. If x and y contain eigenvalues, of OSTBCs makes it possible to decompose (2) into K independent then x  y corresponds to that x is more spatially correlated than y. and virtual single-antenna systems as Majorization only provides a partial order of vectors, but is still very

0 0 powerful due to its connection to certain order-preserving functions: yk = kHWksk + nk, k = 1,...,K, (3) A function f(·) is said to be Schur-convex if f(x) ≥ f(y) for all x and y, such that x  y, and Schur-concave if f(x) ≤ f(y). 0 where nk is complex Gaussian with zero-mean and unit variance. 3. ANALYSIS OF THE SYMBOL ERROR RATE 2.1. Expressions for the Symbol Error Rate In this section, we will analyze the impact of spatial correlation on Herein, the performance measure will be the SER; that is, the proba- the system performance. The analysis is divided into two cases: bility that the receiver makes an error in the detection of received without precoding and with SER minimizing precoding. Observe symbols. The SER depends strongly on the signal-to-noise ratio that all analytic results in this paper are derived for arbitrary func- (SNR) and the type of symbol constellation. Next, we will present tions of the type introduced in Definition 1 (and linear combinations SER expressions for some commonly considered constellations, but of them). The three SER functions in (5) are just examples of func- first we introduce a definition. tions of this type, so the results can potentially be used for other functions (either corresponding to SERs or something else). H T In this section, we let the eigenvalues of the correlation matrices Definition 1. Let Φ RR ⊗(WW RT ) and define the function , nT RT and RR be gathered in non-decreasing order in λT ∈ R+ and λ ∈ nR , respectively. 1 Z b dθ R R+ Fa,b(g) , g ≥ 0, b ≥ a. (4) , π det I + γg Φ a sin2(θ) 3.1. Schur-Convexity Without Precoding

We will consider the SER for three different types of constel- First, we consider the case without linear precoding, represented by 1 3 √ B = nT and W = n I. Under these circumstances, the follow- lations: PAM, PSK, and QAM. Let gPAM , M 2−1 , gPSK , T 2 π 3 ing theorem and its corollary show that the SER always increases sin ( ), and gQAM . Then, the exact SER of the system M , 2(M−1) with the spatial correlation (i.e., the performance is degraded). in (3) was derived in [11, 6] and can be expressed as T RT Theorem 1. Consider the function Fa,b(g) with Φ RR ⊗ . 2(M −1) , nT SERPAM = F0, π (gPAM), This function is Schur-convex with respect to λT when λR is fixed M 2 and Schur-convex with respect to λR when λT is fixed. SERPSK = F0, M−1 π(gPSK), (5) √M Proof. The proof follows by analyzing the integrand of (4) in a sim- 4( M −1) √  SERQAM = F0, π (gQAM) + MF π , π (gQAM) , ilar way as in the proof of Theorem 1 in [9]. M 4 4 2 Corollary 1. The SERs in (5) with M-PAM, M-PSK, and M-QAM for M-PAM, M-PSK, and M-QAM constellations, respectively. are all Schur-convex with respect to the receive correlation for fixed Observe that the integral in Definition 1 is the main building stone transmit correlation, and vice versa. in the SER expressions. The determinant in (4) can be expressed as 3.2. Schur-Convexity With Optimal Precoding nT nR   γg  Y γg det I + Φ = 1 + λi(Φ) , (6) Next, we consider the case when the precoder W is chosen to mini- sin2(θ) sin2(θ) i=1 mize the SER. For this purpose, we define the following function.

H T where λi(Φ) denotes the ith largest eigenvalue of Φ. Hence, we Definition 2. Let Φ , RR ⊗(WW RT ) and define the function conclude that the eigenvalues of Φ (and not the eigenvectors) deter- Ga,b,c,d(g) , min C1Fa,b(g) + C2Fc,d(g), (7) mines the SER. In the next sections we will analyze how the system W, tr(WWH )=1 performance depends on the spatial correlation and thus we focus on comparing systems with different eigenvalue distributions. where C1 and C2 are non-negative constants, b ≥ a, and d ≥ c.

1 Observe that all the SER expressions in (5) can be expressed p

n 1 as Ga,b,c,d(g) when SER minimizing precoding has been applied. p 0.8 2 Let the eigenvalue decomposition of the transmit correlation matrix p H 3 be RT = UT ΛT UT , where the diagonal matrix ΛT contains the p 0.6 4 eigenvalues in non-decreasing order and the unitary matrix UT con- tains the corresponding eigenvectors. It is known that the precoder 0.4 that gives Ga,b,c,d(g) (i.e., the SER minimizing precoder) can be ex- nT ×B pressed as W = UT ∆, where ∆ ∈ C is a rectangular diago- 0.2 H H nal matrix [6]. Hence, we have that WW = UT ΛW UT , where Normalized power allocatio H 0 ΛW , ∆∆ = diag(p1, . . . , pB , 0,..., 0) represents the power 5 10 15 20 25 30 35 40 45 50 assigned to different transmit eigenmodes. The following lemma SNR (dB) gives the asymptotically optimal precoders at low and high SNRs (the latter was proved in [6] in the case of correlation with full rank). Fig. 1. The SER minimizing (normalized) power allocation ΛW = diag(p1, p2, p3, p4) as a function of the SNR. The transmitter and Lemma 1. Consider the function Ga,b,c,d(g) and let the optimal precoder be denoted W. Let the SNR be represented by γ and the receiver are equipped with four antennas, with a fixed correlation H of 0.5 between adjacent antennas in each array [12]. let RT = UT ΛT UT be the eigenvalue decomposition of RT , where the diagonal matrix ΛT contains the eigenvalues in non- decreasing order and the unitary matrix UT contains the corre- Proof. The expression in (8) is the Chernoff bound on Fa,b(g). The sponding eigenvectors. Then, W is given by W = UT ∆Ue , where Schur-concavity follows from Schur’s condition [8], Lemma 1, and U ∈ B×B is an arbitrary unitary matrix and Λ ∆∆H is e C W , a Maclaurin expansion that is valid for certain γ. diagonal and has rank(ΛW ) ≤ B. The optimal power allocation at low SNR is ΛW = diag(1, 0,..., 0) (i.e., selective alloca- Next, the results of this section will be illustrated numerically tion to the strongest eigenmode), while the allocation is ΛW = and it will become apparent that the SER is (at least approximately) diag( 1 ,..., 1 , 0,..., 0) at high SNR (i.e., equal allocation to the Be Be Schur-concave with respect to the transmit correlation for a much ˜ B dominating eigenmodes) where Be = min(B, rank(RT )). larger SNR region than the one stated in Lemma 2. For an SER minimizing precoder, the following theorem and its 4. NUMERICAL EXAMPLES corollary show how the SER behaves with respect to the spatial cor- relation. The asymptotic results of Lemma 1 play an important role In this section, the precoder properties of Lemma 1 and the Schur- since the Schur-convexity properties change with the SNR. convexity results of Corollary 2 will be illustrated numerically. We consider a system where the transmitter and the receiver are Theorem 2. The function Ga,b,c,d(g) is Schur-convex with respect equipped with four antennas each. The antenna correlation follows to λR (when λT is fixed) independently of the SNR. The function is the exponential model [12], which in principle models a uniform Schur-convex with respect to λT (when λR is fixed) at high SNR, while it is Schur-concave at low SNR. linear array (ULA) with the correlation between adjacent antenna elements as a parameter. An OSTBC with rate 3/4 and a spatial Proof. The proof follows from Schur’s condition [8] and Lemma 1 coding dimension of 4 is used, for example the one proposed in by differentiation of Ga,b,c,d(g) and some identification. [4], and the symbol constellation is 16-QAM. For normalization purposes, we let tr(RT ) = tr(RR) = 1. Thus, the transmit power Corollary 2. Consider the SERs in (5) with M-PAM, M-PSK, and (per data symbol) γ also equals the received SNR (with equal power M-QAM when an SER minimizing linear precoder is used. These allocation) and the transmitted SNR. function are Schur-convex with respect to the receive correlation (for Let the power allocation of the SER minimizing precoder be de- fixed transmit correlation). They are also Schur-convex with respect noted ΛW = diag(p1, p2, p3, p4), where pk is applied to the kth to the transmit correlation (for fixed received correlation) at high strongest transmit eigenmode. In Fig. 1, the SER minimizing power SNR, but Schur-concave at low SNR. allocation is given as a function of the SNR (in terms of γ). The antenna correlation is fixed at 0.5 at both sides, which corresponds From Corollary 2, we draw the conclusion that even with an to the eigenvalue distribution [0.5214, 0.2500, 0.1349, 0.0938]T . In optimal precoder, correlation at the receiver side will always degrade line with Lemma 1, we observe that all power is allocated to the the performance. The combination of SER minimizing precoding dominating eigenmode at low SNRs and that the system approaches and spatial correlation at the transmitter side will however improve equal power allocation at high SNRs. In between these extremes, the the performance at low SNR, while correlation still might be bad at optimal power allocation has the typical waterfilling behavior. high SNR. Thus, it is of practical importance to quantify what low Next, we keep the transmit correlation fixed at 0.5, while the SNR actually means in this context. An indication is given by the correlation between adjacent receive antennas is changed between following lemma that treats an upper bound on Fa,b(g). 0 and 1 (i.e., completely uncorrelated and correlated, respectively). The SER with SER minimizing precoding is shown in Fig. 2 at dif- Lemma 2. Consider the following upper bound on Fa,b(g): ferent SNRs. It is seen that the SER is Schur-convex with respect to 1 b − a the receive correlation eigenvalues at all SNRs, which confirms the Fa,b(g) ≤ . (8) π det I + γgΦ results of Corollary 2. Finally, we keep the receive correlation fixed at 0.5, while the The expression on the right hand side of (8) is Schur-concave with transmit correlation is changed from the completely uncorrelated to respect to the transmit correlation (for fixed receive correlation) for the completely correlated case. The SER with SER minimizing pre- all SNR γ such that γ ≤ 1 . coding is shown in Fig. 3 at different SNRs. At 0, 15, and 25 dB, the gtr(RT )tr(RR) 0 SER is Schur-concave (i.e., decreases with the spatial correlation). 10 0 dB 15 dB At very high SNR (e.g., 35 dB), the SER is Schur-convex, while

there is a transisition interval when the SER first increases with the e 25 dB correlation and then decreases again. −5 10 From Lemma 2, we expect the low SNR-region (where p1 = 1 30 dB and the SER is Schur-concave with respect to the transmit correla- tion) to include γ ≤ 10 dB. In comparsion with the numerical ex-

Symbol Error Rat −10 35 dB amples, this condition seems rather strict. In Fig. 1, selective power 10 allocation (i.e, beamforming) is optimal for γ ≤ 17 dB, and it is seen in Fig. 3 that the SER is Schur-concave for SNRs up to at least 25 dB. An important observation is that when the SER transistions 0 0.2 0.4 0.6 0.8 1 from a Schur-concave function to a Schur-convex function, it has Correlation Between Adjacent Receive Antennas −6 already reached such low values (below 10 ) that the dependence Fig. 2. The SER as a function of the correlation between adjacent on the spatial correlation is negligible. Thus, we conclude that the receive antennas in a four-antenna array [12]. The transmitter has SER is Schur-concave at low to medium SNRs and approximately four antennas with the corresponding fixed correlation of 0.5. The Schur-concave at high SNRs. Schur-convexity properties of the SER are shown at different SNRs.

0 5. CONCLUSIONS 10 0 dB 15 dB

In this paper, the performance of spatially correlated Kronecker- e 25 dB −5 structured MIMO systems with orthogonal space-time block cod- 10 ing has been analyzed in terms of the symbol error rate. The re- 30 dB sults cover the exact SER for M-PAM, M-PSK, and M-QAM, but also a wider range of functions that may correspond the SERs for

Symbol Error Rat −10 other constellations. Using majorization and the notion of Schur- 10 35 dB convexity, it has been shown analytically that the SER is Schur- convex with respect to the spatial correlation at the receiver side (i.e., the error increases with the correlation). This holds both in the case 0 0.2 0.4 0.6 0.8 1 Correlation Between Adjacent Transmit Antennas without precoding and with an SER minimizing linear precoder. The SER is also Schur-convex with respect to the transmit correlation Fig. 3. The SER as a function of the correlation between adjacent without precoding, while transmit correlation actually improves the transmit antennas in a four-antenna array [12]. The receiver has four performance when an SER minimizing linear precoder is used. Ana- antennas with the corresponding fixed correlation of 0.5. The Schur- lytically, the SER is Schur-concave with respect to the transmit cor- convexity properties of the SER are shown at different SNRs. relation at low SNRs (i.e., the error decreases with increasing trans- mit correlation) while it still becomes Schur-convex at high SNRs. An approximate lower bound on the SNR region of Schur-concavity [6] A. Hjørungnes and D. Gesbert, “Precoding of orthogonal has been derived, but as illustrated numerically, the SER is actually space-time block codes in arbitrarily correlated MIMO chan- approximately Schur-concave at all SNRs. nels: Iterative and closed-form solutions,” IEEE Trans. Wire- The SER minimizing precoder scales the eigenvalues of the less Commun., vol. 6, pp. 1072–1082, 2007. transmit correlation matrix, and the statistical waterfilling behavior [7] K.T. Phan, S.A. Vorobyov, and C. Tellambura, “Precoder of these scaling factors has been illustrated numerically. Finally, it design for space-time coded MIMO systems with correlated has been shown that selective and equal power allocation is asymp- channels,” in Proceedings of Canadian Con- totically optimal at low and high SNRs, respectively. ference on Electrical and Computer Engineering, 2007. [8] E. Jorswieck and H. Boche, “Majorization and matrix- 6. REFERENCES monotone functions in wireless communications,” Foundations and Trends in Communication and , vol. 3, [1] E. Telatar, “Capacity of multi-antenna Gaussian channels,” Eu- pp. 553–701, 2007. ropean Trans. Telecom., vol. 10, pp. 585–595, 1999. [9] E. Bjornson,¨ P. Devarakota, S. Medawar, and E. Jorswieck, [2] E. Bjornson¨ and B. Ottersten, “Training-based Bayesian “Schur-convexity of the symbol error rate in correlated MIMO MIMO channel and channel norm estimation,” in Proc. IEEE systems with precoding and space-time coding,” in Proc. ICASSP’09, 2009. Nordic Radio Science and Commun. (RVK’08), 2008. [3] M. Vu and A. Paulraj, “MIMO wireless linear precoding,” [10] S. Sandhu and A. Paulraj, “Space-time block codes: A capacity IEEE Signal Processing Magazine, vol. 24, no. 5, pp. 86–105, perspective,” IEEE Commun. Lett., vol. 4, pp. 384–386, 2000. 2007. [11] M.K. Simon and M. Alouini, “A unified approach to the per- [4] V. Tarokh, H. Jafarkhani, and A.R. Calderbank, “Space-time formance analysis of digital communication over generalized block codes from orthogonal designs,” IEEE Trans. Inf. The- fading channels,” Proc. IEEE, vol. 86, pp. 1860–1877, 1998. ory, vol. 45, pp. 1456–1467, 1999. [12] S.L. Loyka, “Channel capacity of MIMO architecture using [5] G. Jongren,¨ M. Skoglund, and B. Ottersten, “Combining beam- the exponential correlation matrix,” IEEE Commun. Lett., vol. forming and orthogonal space-time block coding,” IEEE Trans. 5, pp. 369–371, 2001. Inf. Theory, vol. 48, pp. 611–627, 2002.