<<

EURASIP Journal on Applied Signal Processing 2002:3, 316–329 c 2002 Hindawi Publishing Corporation

Rayleigh Multi-Antenna Channels

Alex Grant Institute for Telecommunications Research, University of South Australia, Mawson Lakes Boulevard, Mawson Lakes, SA 5095, Australia Email: [email protected]

Received 29 May 2001

Information theoretic properties of flat fading channels with multiple antennas are investigated. Perfect channel knowledge at the receiver is assumed. Expressions for maximum information rates and outage probabilities are derived. The advantages of transmitter channel knowledge are determined and a critical threshold is found beyond which such channel knowledge gains very little. Asymptotic expressions for the error exponent are found. For the case of transmit diversity closed form expressions for the error exponent and cutoff rate are given. The use of orthogonal modulating signals is shown to be asymptotically optimal in terms of information rate. Keywords and phrases: space-time channels, transmit diversity, information theory, error exponents.

1. INTRODUCTION receivers such as the decorrelator and MMSE filter [10]; (b) mitigation of fading effects by averaging over the spatial access to data networks such as the Internet is ex- properties of the fading process. This is a dual of interleav- pected to be an area of rapid growth for mobile communi- ing techniques which average over the temporal properties cations. High user densities will require very high speed low of the fading process; (c) increased link margins by simply delay links in order to support emerging Internet applica- collecting more of the transmitted energy at the receiver. tions such as voice and video. Even in the low mobility in- Recently, it has been realized that coordinated use of di- door environment, the deleterious effects of fading and the versity can be achieved through the use of space-time codes. need for very low transmit power combine to cause prob- Rather than relying solely on array processing of uncoded lems for transmissions. Regardless of advanced coding transmissions, forward error correction codes which add re- techniquessuchasturbo-codes[1], channel capacity remains dundancy in both the temporal and spatial domains are de- an unmovable barrier (without yet considering inefficien- signed specifically for channels with multiple transmit and cies introduced by higher layer network overheads). Without receive antennas. There are currently two main approaches changing the channel itself not much can be done. Fortu- to realizing the capacity potential of these channels: coordi- nately, increasing the number of antennas at both the base nated space-time codes and layered space-time codes. and mobile stations accomplishes exactly that, resulting in Coordinated space-time block codes [11, 12, 13]andtrel- channels with higher capacity. Such systems can theoretically lis codes [14, 15, 16, 17, 18] are designed for coordinated use increase capacity by up to a factor equaling the number of in space and time. The data is encoded using multidimen- transmit and receive antennas in the array [2, 3, 4, 5, 6]. sional codes that span the transmit array. Such codes are effi- Spatial diversity has been proposed for support of cient for small arrays, and can achieve within 3 dB of the 90% very high rate data users within third generation wide- outage capacity rate calculated in [3]. The other approach band Code-Division Multiple Access (CDMA) systems such uses layered space-time codes [19, 20, 21], where the chan- as cdma2000 [7]. Using multiple antennas, these systems nel is decomposed into parallel single-input, single-output achieve gains in link quality and therefore capacity. Clas- channels. sically, diversity has been exploited through the use of ei- Increasing the number of antennas and using space-time ther beam-steering (for antenna arrays with correlated el- codes involves both a physical and computational complex- ements), or through diversity combining (for independent ity/performance trade-off. Simple schemes such as multi- antenna arrays) [8, 9]. Use of these array processing tech- carrier transmit diversity or array processing techniques, niques can achieve any combination of the following: (a) re- such as maximum ratio combining may be easily imple- duction of multiple access interference through the “nulling” mented. How is their performance limited, when compared of strong interferers. Such techniques are complementary to to powerful space-time codes? What is the advantage of chan- (and share the mathematical formulations of) multiple-user nel knowledge at the transmitter? In this paper, we attempt Rayleigh Fading Multi-Antenna Channels 317 to answer some of these questions from an information the- matrix. For a X,E[X]andvarX are its ex- oretic point of view. We do not consider the design of space- pectation and variance. time codes. The paper is organized in the following way. The space- 2. THE SPACE-TIME CHANNEL time channel model, along with the Rayleigh fading statistical model are introduced in Section 2. Consider a point-to-point communication link with t trans- Under the assumption of fast Rayleigh fading and per- mit antennas and r receive antennas. Throughout the paper fect channel knowledge at the receiver, various channel ca- we refer to m = min{r, t} and n = max{r, t}. We let the y j ≤ j ≤ r pacity computations are carried out in Section 3.1.Inpartic- match-filtered received signal k, at antenna 1 at ular, we find simple bounds on capacity for the case where time k be given by the transmitter has no channel knowledge. These bounds are used to further investigate the rate of growth of capacity with t L−1 y j = xi f i,j nj , the number of antennas. An asymptotic (in the number of k k−l k,l + k (1) users) expression for the multiple access space-time channel i=1 l=0 capacity is shown to be independent of the number of trans- xi ≤ i ≤ t where k is the signal transmitted from antenna 1 mit antennas, indicating that for large numbers of users mul- k f i,j j l tiuser diversity dominates transmit diversity. We then pro- at time ; k,l is the response at antenna at time to an i k nj vide bounds on the outage capacity for slow fading Rayleigh impulse applied to antenna at time ;and k is a discrete- channels. These bounds provide additional support to a con- time white Gaussian process, independent over time j j j j n n = σ2 n n = j = j jecture of Telatar concerning the optimal power distribution. and antennas, with E[ k ¯k] and E[ k ¯k ] 0for In Section 3.2, we consider perfect channel knowledge and/or k = k. The integer L>0 is the (maximum) delay at the transmitter. Asymptotic expressions for capacity are spread of the fading channel. given, and the advantage of using channel knowledge at the The information theoretic analyses presented so far in the transmitter is quantified. We find a transmit power thresh- literature have concentrated on the flat fading case, we like- old beyond which transmitter channel knowledge yields only wise restrict our present scope. For flat fading, the impulse a marginal capacity advantage. We also consider a subop- f i,j response sequences k become scalars. We simplify (1)ob- timal approach, sometimes referred to as downlink beam- taining a linear algebraic model [2]. At each symbol interval, forming, in which the transmitter uses only the most pow- the received vector y ∈ Cr depends on the transmitted vector erful eigenchannel. Expressions for capacity and outage ca- x ∈ Ct according to pacity are given for this case. In Section 4, we give asymptotic expressions for a ran- y = Hx + n. (2) dom coding error exponent. In Section 5, we consider trans- mit diversity (using only a single receive antenna). In particu- Element yj is the matched-filter output from antenna j, lar, we derive a closed form expression for the error exponent while xi is the signal transmitted from antenna i. The ma- ff r×t and show that the cut-o rate increases logarithmically with trix H ∈ C has as elements Hji, which is the complex the number of antennas. We also show that asymptotically, gain between transmit antenna i and receive antenna j.The the use of orthogonal carriers for each transmit antenna is vector n contains i.i.d. circularly symmetric Gaussian noise ∗ optimal. samples [22, page 134], E[nn ] = σ2Ir . We place the follow- Throughout, we make extensive use of results from the ing power constraint on the transmitted signal (independent theory of random matrices. Of particular interest are Wishart of t), matrices, which model certain of the Rayleigh chan- nel. We make use of both the large systems approach, now fa- E [x∗x] ≤ P. (3) miliar in the analysis of multiple-input multiple-output sys- tems, as well as using results for finite dimension matrices. We denote the signal-to-noise ratio (SNR) as γ = P/σ 2. We collect in Appendix A several definitions and known re- sults for reference. Several lemmas used throughout the pa- Definition 1 (Rayleigh channel). Randomly select the en- per are also proved in Appendix A. Finally, in Appendix B we tries of H, independently of x and n as follows. Let Hji, give for reference definitions and some results concerning hy- i = 1, 2,...,t, j = 1, 2,...,r be i.i.d., zero mean Gaussian pergeometric functions of matrix argument, which are useful with independent real and imaginary parts, each with vari- in certain matrix variate distribution calculations. ance 1/2. We use the following notations. The vector x ∈ Cn is a column vector with complex entries xi, i = 1, 2,...,n. The complex entries of H are independent with uni- m×n Likewise A ∈ C is a matrix with complex entries Aij, formly distributed phase and Rayleigh distributed magni- i = 1,...,m, j = 1,...,n. The superscripts , ∗ and −1de- tudes, modelling the scenario in which the antenna separa- note, respectively, transposition, Hermitian adjoint, and ma- tion is large enough to ensure independent fading. A ma- trix inverse. By A > 0 we indicate that A is positive definite. trix H selected according to Definition 1 is matrix normal Likewise A > X means A − X > 0, det A is the determinant, distributed (i.e., H ∼ ᏺ(r, t) according to Definition A.1). ∗ tr A the trace and etr A = exp(tr A), and In is the n×n identity The matrix product HH will be of great interest and has 318 EURASIP Journal on Applied Signal Processing the , HH∗ ∼ ᐃ(r, t)byDefinition A.2. concept of forming an average capacity is somewhat less Appendix A describes some useful properties of Wishart ma- straightforward, since the transmitter has the additional op- trices that we will use throughout the paper. Appendix B tion of optimizing the power allocation over time as well as summarizes some results concerning hypergeometric func- over the eigenvalues (while maintaining the required average tions of matrix argument that are required for certain calcu- power restriction). We do not consider average capacities for lations concerning Wishart matrices. this case. The channel matrix may either be fixed, or time vary- ing. In the time varying case, we can further distinguish 3.1. Channel unknown at transmitter between fast variations (compared to the data transmission Now concentrate on the case in which only the receiver per- rate), which allow us to compute average information rates, fectly knows the time varying Rayleigh channel matrix H. and slowly varying channels, for which outage probability is The capacity of the fast time varying Rayleigh channel was more appropriate. We assume throughout that the channel found by Telatar [2]. We rewrite this capacity using an alter- matrix is known perfectly at the receiver. We consider both native form in the following theorem. the case where the channel is unknown at the transmitter and the case where it is known perfectly. The theory for the Theorem 1. At each symbol interval, let H be selected ac- case in which the channel matrix is unknown at both ter- cording to Definition 1, and assume that H is known to the minals is still in development. It is known that the capacity receiver, but not at the transmitter. Let m = min{r, t} and gains to be achieved depend upon the coherence time of the n = max{r, t}. Then the capacity of the channel (2) is given channel [23, 24, 25]. Special space-time codes have also been by designed to operate in the absence of channel state informa-   γ  tion [26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36]. C γ, t, r = ∗ ( ) E log det I + t HH  ∞   (m − 1)! γ = x e−xxn−m (7) 3. CAPACITIES n − log 1+ t ( 1)! 0 In this section, we try to gain some insight into the scaling    × Lm−1 x 2 − Lm x Lm−2 x dx, of capacity with the number of transmit and receive anten- n−m+1( ) n−m+1( ) n−m+1( ) nas. We are particularly interested in quantifying the capacity Lα k increase available through the availability of channel knowl- where k is the generalized Laguerre polynomial of order [38]. edge at the transmitter. We assume throughout that the re- Capacity is achieved for x circularly symmetric zero mean com- ∗ = P/t ceiver has perfect channel knowledge. plex Gaussian vector with E[xx ] It . Consider the channel (2), with average transmit power Proof. The expression follows from [2,Theorem2],us- constraint (3). Restrict xi, i = 1, 2,...,tto be i.i.d. zero mean Gaussian variables, each with variance P/t. Then the maxi- ing [39, equation (8.974.1)]. mum transmission rate is given by [2]as Although straightforward to numerically compute, (7)  γ  does not give much information about the scaling of capac- I γ, r, t, = ∗ . r t ( H) log2 det Ir + t HH bit/channel use (4) ity with and . Using limiting arguments, it is by now well known, and often quoted that in the case of r = t = k that This is simply an achievable rate, rather than the capacity, C(γ, k, k) is linear in k. We now give convenient upper and since no optimization over the source distribution is per- lower bounds to (7). formed. Such optimization would require knowledge of H at the transmitter. In the case of fast Rayleigh fading according Theorem 2. The capacity (7) is upper and lower bounded ac- to Definition 1, the channel capacity is given by C(γ, r, t) = cording to E[I(γ, r, t, H)] and is achieved for the i.i.d. Gaussian input γ t distribution just described. n−m C(γ, t, r) ≤ m log +logm! + log Lm − , (8) Given perfect channel knowledge at the transmitter, t γ channel capacity takes a different form, and is achieved by γ m−1 water filling on the parallel orthogonal channels defined C γ, t, r ≥ m ψ n − i , ( ) log t + ( ) (9) by the singular vectors of H,see,forexample,[37,Theo- i=0 rem 8.5.1]. Let λ1,...,λm be the m nonzero eigenvalues of HH∗. Then the capacity is given parametrically [2]by where ψ is Euler’s digamma function [39, equation (8.36)].  Proof. Inequality (8) follows directly from Jensen’s inequality CWF = log µλi, (5) −1 and Theorem A.4. The lower bound follows from det(I+A) ≥ i:λi ≤µ  det A for Hermitian A and Lemma A.2. −1 P = µ − λi . (6) −1 i:λi ≤µ It is clear from the method of proof that the upper and lower bounds (8)and(9)aretightasγ →∞. In fact, ex- Note that in the case of transmitter channel knowledge, the panding the digamma as ψ(z) = log z − (2z)−1 − θ(12z)−2,for Rayleigh Fading Multi-Antenna Channels 319 some 0 <θ<1, we may further approximate (9)by 3.1.1 Multiple users  Now consider a K user multiple access channel, in which the γ n m m − , channel from each user to the common receiver is an in- log t +log ! + log m c (10) dependent space-time channel. We suppose each user has t transmit antennas and that there are r receive antennas. At where c ≈ 0.577 is the Euler-Mascheroni constant [40, Sec- k = , ,...,K each symbol interval, user 1 2 transmits a vector tion 3.1.9.4]. Now t (k) x[k] ∈ C by sending Pk/t xi over antenna i = 1, 2,...,t ∗ such that E[x[k] x[k]] ≤ 1, corresponding to a transmit  r Pk ∈ C n−m t n power constraint .Thereceivedvectory ,observed√ lim Lm − = , (11) γ→∞ γ m at the output of the r receive antennas is y = H Px + n, where x = (x[1], x[2],...,x[K]) and H ∈ Cr×Kt contains the ffi ≤ k ≤ K hence we expect polynomial tightness of the upper and lower fading coe cients. For a given integer 1 , the ele- H bounds in γ. ment j,kt+i−1 is the complex gain between transmit antenna i k j Kt×Kt The lower bound is however poor when simultaneously of user and receive antenna .The diagonal matrix = P ,P ,...,P /t ∈ Cr γ t and t ≈ r.Thisproblemmaybesomewhatoffset by P diag( 1It 2It K It) .Thevectorn contains ∗ = σ2 noting that C is monotonic in both n and m, and hence the i.i.d. zero-mean Gaussian noise samples, E[nn ] I.We lower bound near n = m may be improved by using smaller assume that H is known at the receiver, but not at the trans- values of either n or m. mitter. It is therefore not unreasonable to use (8) as an approx- Passing to the band-limited case via the usual arguments, imation for the purposes of determining the rate of growth suppose that the total bandwidth available for transmission KW W KW of capacity with t and r, since for sufficient SNR, the upper is ,where is some fixed quantity ( should be less and lower bounds will display the same gross behavior. Three than the coherence bandwidth of the channel for the flat fad- cases are of interest: (a) fix t and let r →∞,(b)fixr and let ing assumption to be realistic). Then the sum capacity of this t →∞, and (c) fix t/r and let both t,r →∞. channelisgivenby Considering case (a), we hold t constant and send r large ∗ HPH to see that C(P, H) = KW log det Ir + bit/s. (15) KWσ2

γ r! γ If H is independently selected at each symbol interval accord- C(γ, t, r) ≈ t log +log −→ t log + t log r (12) t (r − t)! t ing to Definition 1, the capacity of the resulting channel is found by taking the expectation of (15) with respect to the r / r − t 1 results in C/r → log γ,which β is independent of . Asymptotically, there is no benefit from 1 t r The idea for this theorem arose out of discussions with S. Hanly at Mel- increasing beyond . bourne University. 320 EURASIP Journal on Applied Signal Processing

R −1 number of transmit antennas comes at the price of decreas- Pout(R, Q) ≤ Pr(X0,tr Q˜ =P det Q I , which leaves only a minimization over m. 3.1.2 Outage probability This upper bound lends additional support to the con- So far, we have concentrated on the fast time varying jecture made in [2] that the outage probability is minimized Rayleigh channel. Now consider the slowly varying channel, by using a uniform power distribution over some subset of for which outage probability has been proposed as the mea- the antennas. sure of interest [2, 3]. For a given information rate R and t × t > P R, = diagonal matrix Q 0, define out( Q) Pr(log det(Ir + 3.2. Channel known at transmitter HQH∗)

Pout(R, P) = inf Pout(R, W). (17) ter have perfect channel knowledge. We are interested in two Q:Q>0,tr Q≤P cases, firstly we compute a large systems limit for the water- filling capacity C and establish the optimal power profile. We consider H chosen once for all time according to WF Secondly, we consider a simple suboptimum case, in which our Rayleigh channel definition. Now the distribution transmitter selection diversity is used, that is, the transmit- of log det(I + W) is unknown. Apart from the mean ter simply transmits using all its power along the maximum (Theorem A.4), the moments of this quantity do not have eigenvector. In this second case, we compute the large sys- a simple expression (although in limiting cases they are tems capacity and the outage probability. Without loss of known [41]). The following theorem demonstrates the type generality, we let σ2 = 1, that is, γ = P. All of the results in of difficulty one encounters. this section will be obtained as large systems limits and rely m, n →∞ Theorem 4. Theoutageprobabilityisupperbounded on the convergence of the Wishart spectrum as in such a way that m/n → β ≤ 1. sR Pout(R, P) ≤ inf 2F0 (s,n; Q) e , (18) Given a particular eigenvalue distribution, the capacity s>0 Q>0,tr Q≤P may be found using (5). It is however a well-known fact that for a wide class of random matrices, the normalized eigen- where mFn (·; ·; ·) is a hypergeometric function of matrix argu- value distribution converges to a nonrandom limit as the ma- ment as defined in Appendix B. trices are taken large. We use this fact to determine the large systems capacity limit. Proof. Now W  HQH∗ ∼ ᐃ(m, n, Q)and   Theorem 6. Let m ≤ n approach infinity such that m/n → P (R, Q) = Pr log det(I + W) 0 by Chernoff’s bound. Now consider C b(β) WF −→ µλ dF λ ,  m log ( ) (22)   max{a(β),1/µ} E det(I + W)−s = p(W)det(I + W)−sdW  b(β)  − −n P −→ µ − λ 1 dF λ , det Q   ( ) ( ) (23) = F − − −1 {a β , /µ} Γ n s 0 ( W;etr) Q W max ( ) 1 m ( ) W>0 × det Wn−mdW, where F(λ), a(β),andb(β) are defined in Theorem A.7. (20) Proof. The proof follows from [37, Chapter 8] (using the where the second line results from (B.7) and the Wishart den- eigenvectors of H as kernels, rather than the Fourier kernel), sity (A.3). The result is then obtained through application and the convergence in distribution of the empirical eigen- of the Laplace transform recursion for the hypergeometric value distribution to F(λ)[42]. functions (B.5). According to this water-filling approach, if the transmit The following is a more computable bound. power P is such that µ−1 >a(β), the transmitter uses only a fraction F(µ−1) of the eigenchannels. If µ−1 ≤ a(β)ittransmits Theorem 5. Theoutageprobabilityisupperbounded using all the parallel channels. In this latter case, we can find   R−m log(P/t) a closed form expression for the capacity. Pout(R, P) ≤ Pr X Proof. The proof follows from det(I + A) det A and the CWF 1 1 − β 1 ∼ ᐃ m, n, −→ log P + + log − 1. (24) distribution of det W for W ( Q), (A.3), which gives m 1 − β β 1 − β Rayleigh Fading Multi-Antenna Channels 321

1 . m CWF(Pcrit(β),β) 1 025 5 1.02 4 20 dB 1.015

3 15 dB /C WF (nats) C 1.01

/m 10 dB 2 WF C 5dB 1.005 1 0dB 1 0.20.40.60.81 β 00.20.40.60.81 β Figure 2: Relative capacity gain from water-filling along the locus P = P Figure 1: Asymptotic water-filling capacity. of crit.

2.75 Proof. Under the assumption µ−1 ≤ a(β), the limit (22)be- . comes 2 5  2.25 C b(β) WF −→ log µ + log λdF(λ) m /m 2 a(β)

(25) WF 1 − β 1 C 1.75 = log µ + log − 1, β 1 − β 1.5 where the first line is due to integrating over the entire sup- 1.25 port of λ and the second line is from [42, Corollary 5.1]. It remains to find µ.Nowfrom(23), −10 0 10 20 P,dB     b β λ − a β b β − λ 1 ( ) ( ) ( ) Figure 3: Relative capacity gain from water-filling at β = 1. P −→ µ − dλ. (26) πβ λ2 2 a(β) of transmitter channel knowledge may not cost too much. This integral is of the form [39, equation (2.267.2)] and after For P>P , the capacity advantage for using transmitter straightforward calculation is found to be 1/(1 − β). crit channel knowledge is at most about 2%. P

Pcrit (left    of dot-dashed line), or for large SNR that there is only a 2 C γ −→ γm β small advantage in CWF/m to be obtained from water-filling. ( ) log 1+ 1+ a.s. (27) This is investigated further in Figure 2, where the ratio of the water-filling capacity to the corresponding limiting value of This indicates that by transmitting along the maximum C is shown for P = Pcrit(β). This loss will be even less for eigenvector, performance superior to the unfaded AWGN higher power levels (to the left of the dot-dashed line on channel is obtained. Note however that the capacity increase Figure 1). The relative difference, upper bounded by approxi- is only logarithmic with min{t,r}, as compared to the lin- mately 1.022 at β ≈ 0.1 is preserved through the scaling by m. ear capacity increase available through the use of all transmit This result indicates that in certain scenarios that the absence antennas, even in the absence of channel knowledge at the 322 EURASIP Journal on Applied Signal Processing transmitter. For finite systems, the following theorem may which is well known for the to be given be used to compute outage probabilities for the selection di- by versity approach. s2σ2 R M(s) = exp µs + . (34) Theorem 8. Let α = (e − 1)/P, then 2

Γm (m) nm Taking the supremum on both sides of (33) and using (34) Pout(R, P) = α 1F1 (n; n + m; −αI) . (28) Γm (n + m) gives the result. P R, P = Λ α Proof. From Theorem A.5, out( ) ( ). µ(ρ) may be calculated using [2, equation (13)], or may be lower bounded using [48, Theorem 3]. 4. ERROR EXPONENTS In addition to maximum information transmission rates, it 5. TRANSMIT DIVERSITY is interesting to consider the error exponent [37]forspace- In this section, we consider transmit diversity, r = 1, t>1. time channels. The error exponent gives some indication of From (4), it is easy to see that how difficult it may be to achieve a certain bit error rate. In this section, we derive a limiting expression for the error ex-   γ  ponent. Telatar [2] gives the following lower bound to the C = E log 1+ X , (35) 2t error exponent. where X is a chi-squared random variable with 2t degrees of Lemma 1. The probability of error averaged over all randomly freedom.2 As t increases, C → log(1+γ) (since X → 2t by the selected (N,NR) block codes is bounded law of large numbers), which is as if all the power were trans-    mitted over a single, nonfaded link. By increasing the num- Pe ≤ exp − N max E0(ρ) − ρR , (29) ff 0≤ρ≤1 ber of antennas, the e ects of the fading may be completely removed. In practice, only a small number of transmit anten- where nas (typically two or three for a wide range of γ) are required (see [2, Figure 3]). −ρ γ ∗ Increasing the number of transmit antennas however has E (ρ) = − log E det Ir + HH . (30) 0 t(1 + ρ) the desirable effect of increasing the error exponent, as we now show.

Theorem 10. Let r = 1, then Reliability functions for fading channels with correlated t =   multiple antennas have also been considered in [45]for 1, t ρ t t ρ see also [46]. E ρ = − (1 + ) Ψ t,t − ρ (1 + ) , 0( ) log γ +1 ; γ The following theorem uses the theory of random deter- minants to give a limiting expression for the error exponent. (36)

Theorem 9. Let t/r be fixed. The following limiting relation where Ψ is the confluent hypergeometric function [39, Sec- holds. tion 9.2]. Furthermore, letting CG(γ) = log(1+γ) be the Gaus- sian channel capacity with SNR γ we have the following limit- σ2 ρ ρ2 ( ) ing expression for the error exponent: sup E0(ρ) = µ(ρ)ρ − , (31) t→∞ 2  γ  γ  γ ∗ C − R, ≤ R ≤ C − , where, for convenience A(ρ) = I + γ/(t(1 + ρ))HH and  G if 0 G 2 2 2(γ +2) lim Er = t→∞  γ γ ρC − ρR(ρ), otherwise, µ(ρ) = lim E [log det A] = lim C ,t,r , G 1+ρ t→∞ t→∞ 1+ρ (32) (37) σ2(ρ) = lim var log det A. t→∞ where for the second case, CG(γ/2) − (γ/2(γ +2))

7 0.4 30 dB 6 5 25 dB 0.3 20 dB Er 4 AWGN 0.2 (nats)

Diversity 0 3 15 dB R 2 10 dB 0.1 1 5dB 0dB 0 0.10.20.30.40.50.60.7 246810 R (nats) t

Figure 4: Error exponent, γ = 0dB. Figure 5: Cut-off rate versus number of antennas.

Proof. From (30), we have Proof. Equation (41) is obtained from (39) via the identity Ψ α, α z = ezΓ − α, z ( ; ) (1 ). γ −ρ E0(ρ) = − log E 1+ X , (39) For t>1 the cut-off rate may be lower bounded as fol- 2t(1 + ρ) lows: X t   where is a chi-squared random variable with 2 degrees of γ 1 Ψ R ≥ log 1+ 1 − . (43) freedom. Using an integral representation for [39,equa- 0 2 t tion (9.211.4)], we get (36) after some algebraic manipula- tions. This bound is tight as t →∞or γ →∞.HenceR0 increases t X → t As increases, 2 in probability we have at least as the logarithm of t/(t − 1). Figure 5 shows R0 (solid t γ line) and the lower bound (43) (dashed) plotted versus for lim E (ρ) = ρC , (40) various SNR. t→∞ 0 G 1+ρ 5.1. Orthogonal transmit diversity which, after some manipulations (along the lines of [37, Sec- tion 7.4]) proves (37). It is interesting to consider the use of mutually orthogo- nal transmit waveforms for each transmit antenna,3 as pro- Figure 4 compares the error exponents for the additive posed for cdma2000 [7, Section 3.2.1.1.5]. We continue to white Gaussian noise channel [37, equation (7.4.33)] and the consider transmit diversity only (r = 1), and assume code- limiting t →∞transmit diversity channel with Rayleigh fad- books with i.i.d. circularly symmetric complex Gaussian let- ing (37). The signal-to-noise ratio is zero dB. Further numer- ters. This means that for a given channel, transmission rates ical investigations have shown that the gap in the error expo- are upper bounded by I as given by (4). If the channel is ran- nents only disappears as γ → 0. It is interesting to note that dom, according to Definition 1, the transmission rate is up- t →∞ C → C γ although results in G( ) there is an inherent per bounded by C as given by (7). complexity penalty to be paid (at least in terms of the error Examples of orthogonal waveforms include (a) time di- exponent). vision, in which the scalar input is successively transmit- Closed form determination of Er for finite t using (36) ted from each antenna at disjoint time intervals; (b) fre- seems difficult, although numerical methods can be used. We quency division (multi-carrier transmit diversity [7, Sec- ff R = E therefore concentrate on the cut-o rate, 0 0(1), which tion 3.2.1.1.5.1]), in which the signal for each antenna gives the error exponent up the critical rate, and a lower is modulated by an orthogonal frequency carrier; and (c) bound beyond that. Substituting ρ = 1 into the expressions code division (direct-sequence transmit diversity [7, Sec- of Theorem 10, we obtain the following corollary. tion 3.2.1.1.5.2]), where each antenna is modulated by (for example) an orthogonal Walsh code. ff Corollary 2. The cut-o rate is given by It is easy to see (e.g., through application of the data pro-

γ 2t 2t cessing theorem [49]) that restriction to orthogonal carriers R (t,γ) = t log − − log Γ 1 − t, , (41) can only decrease the maximum possible mutual informa- 0 2t γ γ tion, when compared to the use of unconstrained space-time  ∞ α−1 −t codes. In this section, we show that use of a sufficiently large where Γ(α, z) = z x e dt is the incomplete gamma func- tion. Furthermore,   γ 3What counts is that the waveforms are orthogonal at the receiver, for lim R (t,γ) = log 1+ . (42) t→∞ 0 2 example, flat fading and no Doppler spread. 324 EURASIP Journal on Applied Signal Processing number of transmit antennas makes this penalty insignifi- (8.214.1)] that cant. This may be interesting since use of orthogonal carriers ∞ xk should result in simple receiver structures [11, 12]. Narula et x = −x ,x<. Ei( ) c+log( )+ k · k 0 (47) al. [6] have found the average mutual information (assuming k=1 ! a Gaussian codebook) in the case of orthogonal signaling. Using this identity, and performing some simple rearrange- Theorem 11. Let H be a fixed matrix, known at the receiver. ments, we can write Then the average mutual information under the assumption of   ∞ − /γ k orthogonal transmit waveforms is given by 1/γ 1/γ ( 1 ) E [I⊥] − e log(γ) = −e c+ . (48)    k · k! t  2 k=1 1 Hi P I⊥ = log 1+ . (44) t 2 σ2 γ →∞ i=1 Taking limits as on both sides yields the desired result, since 1/γ → 0, forcing the exponential term to unity, and Furthermore, for this channel I⊥ ≤ I with equality if and only each element inside the sum to zero. if |Hi| = |Hj | for all i, j = 1, 2,...,t.

Hence for any given channel, the use of either multi- 6. CONCLUSION carrier transmit diversity, or orthogonal direct-sequence In this paper, we have given several calculations regarding the transmit diversity, while convenient from an implementa- capacity and error exponents of multiple antenna Gaussian tion point of view, incurs a loss, when compared to space- channels. We used simple bounds to determine the scaling time coding. Since this is true of any given channel, the aver- of capacity with the number of antennas. In particular, we ages over random channel selection (e.g., Definition 1) also found that if the number of transmit and receive antennas I ≤ C obey E[ ⊥] E[ ]. The following result gives an expression are increased proportionately, a linear increase is obtained, I for E[ ⊥] and shows its limiting behavior for large signal-to- and the rate of this increase was determined. We further dis- noise ratios. covered that asymptotically, there is no capacity advantage for the number of transmit antennas to exceed the number Theorem 12. Let H be randomly selected every symbol interval of receive antennas. In the case that the number of transmit according to Definition 1,andletH be known at the receiver. antennas is held constant, we found that capacity increases Then the maximum transmission rate, under the assumption of 1/γ logarithmically with the number of receive antennas. In the orthogonal transmit diversity is given by E[I⊥] = −e Ei(−1/γ) ∞ reverse situation, holding the number of receive antennas x = e−x/x dx nats/channel use, where Ei( ) −x is the exponential fixed, increasing the number of transmit antennas results in integral function. Furthermore, convergence to a fixed capacity. A simple calculation showed   that for multiple-access space-time channels that capacity is lim log 1+γ − E [I⊥] = c, (45) γ→∞ dominated by multiuser diversity, rather than space diversity. For large numbers of users, there is no total capacity advan- where c ≈ 0.577 is the Euler-Mascheroni constant. E[I⊥] → tage in using multiple antennas. We also gave bounds on the log(1 + γ) in the sense that their ratio tends to unity. outage probability for nonergodic channels and discovered that a constant power allocation over some subset of anten- r = t C → It is easy to verify that for 1 and increasing , nas minimized one of these bounds. γ log(1 + ). Hence even though a rate penalty is incurred for Large systems limits were calculated for the capacity in using orthogonal transmit diversity, this penalty disappears the case of perfect channel knowledge at the transmitter. For as the signal-to-noise ratio and the number of antennas is power levels above a certain critical threshold, this limit can increased. be easily calculated in closed form. We found numerically Proof. From (44), Definition 1, and the linearity of the ex- that for power levels greater than this same critical threshold that there is very little to be gained by the water-filling ap- pectation operator, we have that E[I⊥] = E[log(1 + γX)], where X is an exponentially distributed4 random vari- proach, although for smaller powers we found that capacity able [22]. Hence may be increased significantly. We also considered briefly the  use of selection diversity at the transmitter and found that ∞ this simple strategy outperforms a single antenna link with I = γx e−x dx E [ ⊥] log(1 + ) no fading. 0 (46) The remainder of the paper was devoted to calculation of = −e1/γ − 1 , Ei γ error exponents. In particular, we found a limiting expres- sion for E0(ρ), and a closed form expression for the case of according to [39, equation (4.337(2))], which is the first part transmit diversity (r = 1). We further showed that for r = 1 of the theorem. Continuing, we note from [39,equation the cut-off rate increases logarithmically with the number of transmit antennas. Although capacity is near to maximum for a small number of antennas (typically 2 or 3), use of ad- 4 χ2 ditional antennas improves performance through increase of Or equivalently, a 2 variable. Rayleigh Fading Multi-Antenna Channels 325 the error exponent. We have also shown that use of a suf- The following theorem gives the distribution of a complex ficiently large number of transmit antennas offsets the rate Wishart matrix. The proof of this theorem follows [53,Theo- penalty suffered due to orthogonal transmissions from each rem 3.2.3], using however the complex hypergeometric func- antenna, such as those described for cdma2000. tions, as outlined in Appendix B.

Theorem A.2 (Wishart distribution). Let W ∼ ᐃ(m, n). APPENDICES Then A. WISHART MATRICES Γm (m) n P(W < X) = det X 1F1 (n; n + m; −X) . (A.4) Wishart matrices are of interest in statistics and signal pro- Γm (n + m) cessing. We now review some facts concerning Wishart ma- trices, and derive some results concerning expectations of Theorem A.3 (Ordered eigenvalue distribution). Let W ∼ certain determinantal forms and further investigate the dis- ᐃ(m, n) with ordered eigenvalues λ1 ≥ λ2 ≥···≥λm. Then tribution and asymptotics of the largest eigenvalue. Books m m− providing an introduction to matrix variate distributions   π ( 1) p λ1,λ2,...,λm = are [50, 51, 52, 53]. These texts concentrate on real variables. Γm (n) Γm (m)   Although the corresponding complex cases that we require m m    n−m 2 are easily developed, they are noticeably absent from these × exp λi λi λi − λj . references. i=1 i=1 i

In fact Theorem A.1 may be used as the definition of the m−1 Wishart matrix, from which the statement of Definition A.2 E [log det W] = ψ(n − i), i=0 may be proved as a theorem. The ordering of definition and (A.8) theorem however become less important in the light of [53, m−1 = ψ n − i , Theorem 3.3.3], where it is proved that a matrix W with var log det W ( ) i= the Wishart distribution may always be factored W = XX∗, 0 where X ∼ ᏺ(m, n). It is useful however to remember that where ψ(x) = Γ(x)/Γ(x) is Euler’s digamma function [39, there are other matrix forms besides Definition A.2 that lead equation (8.36)]. to the Wishart distribution. We can also consider Wishart matrices with arbitrary co- The following lemma is from [56, Section 37]. variance structure. For A ∈ Cm×m nonsingular and Σ = AA∗, the product W = AXX∗A∗ is Wishart ᐃ(m, n, Σ)withdensity Lemma A.3. For α ∈{1, 2,...,n},letA(α) be the principal ∈ Cn×n   submatrix [57, page 17] formed from A by deleting n −1 − n−m p(W) = det Σ Γm (n) etr(−Σ 1W)detW . (A.3) rows and columns not indexed by the elements of α.Thenfor 326 EURASIP Journal on Applied Signal Processing any n × n matrix A function Λ(x) = Pr(λ0 be the largest eigenvalue of W. Then the distribution obvious texts. Rayleigh Fading Multi-Antenna Channels 327

r   k> κ = k ,k ,...,k p iλif ki >li for ing Laplace transforms [70, equation (2.1)], starting from the first index i where the partitions differ. Now let y1,...,yp k kp F X = etr X. (B.4) p y 1 ···y 0 0 be variables. Then we say that the monomial 1 p is k kp l lp κ y 1 ···y y 1 ···y of order and that 1 p is of higher order than 1 p Note that we have modified Herz’s definition to be suitable if κ>λ. The degree of a monomial in p variables is the sum for Hermitian matrices (rather than symmetric). of degrees of the individual variables. The degree of a poly- Definition B.4 (hypergeometric function of matrix argu- nomial is the maximum degree of the monomials making up ment). Let Z be a Hermitian p × p matrix. The hypergeo- the polynomial. metric function of matrix argument is defined iteratively by Define Vk to be the vector space of symmetric homoge- k p 5 V nous polynomials of degree in variables. Further let κ F a ,...,a ,c b ,...,b − −1 V κ m+1 n ( 1 m ; 1 n; Z ) be the subspace of k defined by polynomials of order . c  Then Vk is the direct sum of the irreducible invariant sub- = det Z F a ,...,a b ,...,b − Γ c etr(XZ)m n ( 1 m; 1 n; X) spaces Vκ. m ( ) X>0 Zonal polynomials were introduced by James [65, 66, 67] × det(X)c−mdX, and Constantine [68]. A concise definition of these polyno- mials for real symmetric matrix argument is given in [53]. (B.5) We define the polynomials on Hermitian matrices. − mFn (a ,...,am; b ,...,bn,c; −X 1) V +1 1 1 Definition B.2 (zonal polynomial). Let k be defined on the m−cΓ c p × p = det X m ( ) eigenvalues of a Hermitian matrix X. Then the polyno- n k (2πi) mial (tr X) ∈ Vk has a unique decomposition into polyno-  Cκ ∈ Vκ −1 mials (X) according to × etr(XZ)mFn (a1,...,am; b1,...,bn; −Z )   > k = C . Z 0 (tr X) κ (X) (B.1) −c κ × det(Z) dZ. (B.6) k The zonal polynomial Cκ (X) is the component of (tr X) in Vκ. Another particularly simple and useful form is

−a A general formula for the coefficients of zonal polyno- 1F0 (a; X) = det(I − X) . (B.7) mials has not been found. A recurrence for the coefficients may be found in [51]. An easily proved useful identity is The power series representation given below is due to Con- k Cκ (αX) = α Cκ (X). stantine [68]. The following constant occurs frequently in integrals p × p concerning zonal polynomials. Note that this differs from Theorem B.1. Let X be a Hermitian matrix. The hyper- the usual definition (e.g., [51, pages 247–248]), since we are geometric function of matrix argument can be written as considering zonal polynomials on the space of Hermitian mFn (a ,...,am; b ,...,bn; X) matrices. 1 1 ∞  a ··· a C = ( 1)κ ( m)κ κ (X). (B.8) Definition B.3 (generalized hypergeometric coefficient). Let b ··· bn j k=0 κ ( 1)κ ( )κ ! κ be a partition of k. Then REFERENCES p   a = a − j , ( )κ +1 kj (B.2) [1] C. Berrou and A. Glavieux, “Near optimum error correcting j=1 coding and decoding: turbo-codes,” IEEE Trans. Commun., vol. 44, no. 10, pp. 1261–1271, 1996. where (a)n = a(a+1) ···(a+n−1) is the Pochhammer symbol. [2] I. Emre Telatar, “Capacity of multi-antenna Gaussian chan- nels,” European Trans. Telecomm., vol. 10, no. 6, pp. 585–595, The zonal polynomials of the identity matrix are 1999. κ r [3] G. J. Foschini and M. J. Gans, “On limits of wireless commu- known [68, 69]. Suppose has exactly nonzero parts. Then nications in a fading environment when using multiple an- tennas,” Wireless Personal Communications,vol.6,no.3,pp. 311–335, 1998. 5A polynomial that is unchanged by permutation of variable subscripts [4] J. H. Winters, J. Salz, and G. D. Gitlin, “The capacity of wire- and such that every term has degree k. less communication systems can be substantially increased 328 EURASIP Journal on Applied Signal Processing

by the use of antenna diversity,” in Conf. Inform. Sci. Syst., orandum, Bell Laboratories, Lucent Technologies, April 2000, Princeton, NJ, USA, 1992. http://mars.bell-labs.com. [5] J. H. Winters, J. Salz, and R. Gitlin, “The impact of antenna [25] L. Zheng and D. Tse, “Sphere packing in the Grass- diversity on the capacity of wireless communication systems,” man manifold: a geometric approach to the noncoher- IEEE Trans. Commun., vol. 42, no. 2-4, pp. 1740–1751, 1994. ent multi-antenna channel,” in Proc. IEEE Int. Symp. In- [6]A.Narula,M.D.Trott,andG.W.Wornell, “Performance form. Theory, Sorrento, Italy, June 2000, also submit- limits of coded diversity methods for transmitter antenna ar- ted to IEEE Transactions on Information Theory.Preprint, rays,” IEEE Transactions on Information Theory,vol.45,no.7, http://degas.eecs.berkeley.edu/dtse/pub.html. pp. 2418–2433, 1999. [26] B. Hochwald and W. Sweldens, “Differential unitary space- [7] S. Dennett, “The cdma2000 ITU-R RTT candidate submis- time ,” IEEE Trans. Communications, vol. 48, no. sion (0.18),” Technical report, TIA, July 1998. 12, pp. 2041–2052, 2000. [8] W. C. Jakes, Microwave Mobile Communications,JohnWiley [27] T. L. Marzetta, B. Hochwald, and B. Hassibi, “Space-time au- and Sons, New York, NY, USA, 1974. tocoding: arbitrarily reliable communication in a single fad- [9]D.H.JohnsonandD.E.Dudgeon, Array Signal Processing: ing interval,” in Proc. IEEE Int. Symp. Inform. Theory,Sor- Concepts and Techniques, Prentice Hall, NJ, USA, 1993. rento, Italy, June 2000, also submitted to IEEE Transactions on [10] S. Verdu,´ Multiuser Detection, Cambridge University Press, Information Theory.Preprint,http://mars.bell-labs.com. Cambridge, UK, 1998. [28] D. Agrawal, T. J. Richardson, and R. Urbanke, “Multiple- [11] S. M. Alamouti, “A simple transmit diversity technique for antenna signal constellations for fading channels,” in Proc. wireless communications,” IEEE Journal on Selected Areas in IEEE Int. Symp. Inform. Theory, Sorrento, Italy, June 2000, Communications, vol. 16, no. 8, pp. 1451–1458, 1998. also submitted to IEEE Transactions on Information Theory. [12] V. Tarokh, H. Jafarkhani, and A. R. Calderbank, “Space-time Preprint, http://mars.bell-labs.com. block codes from orthogonal designs,” IEEE Transactions on [29] B. M. Hochwald and T. L. Marzetta, “Unitary space- Information Theory, vol. 45, no. 5, pp. 1456–1467, 1999. time modulation for multiple-antenna communications in [13] V. Tarokh, H. Jafarkhani, and A. R. Calderbank, “Space-time Rayleigh flat fading,” IEEE Transactions on Information The- block coding for wireless communications: performance re- ory, vol. 46, no. 2, pp. 543–564, 2000. sults,” IEEE Journal on Selected Areas in Communications,vol. [30] B. M. Hochwald, T. L. Marzetta, T. J. Richardson, W. Sweldens, 17, no. 3, pp. 451–460, 1999. and R. Urbanke, “Systematic design of unitary space-time [14] V. Tarokh, N. Seshadri, and A. R. Calderbank, “Space-time constellations,” IEEE Transactions on Information Theory, vol. codes for high data rate wireless communication: perfor- 46, no. 6, pp. 1962–1973, 2000. mance criterion and code construction,” IEEE Transactions [31] B. L. Hughes, “Extensions to the theory of differential space- on Information Theory, vol. 44, no. 2, pp. 744–765, 1998. time modulation,” in Proc. IEEE Int. Symp. Inform. Theory, [15] A. F. Naguib, V. Tarokh, N. Seshadri, and A. R. Calderbank, “A Sorrento, Italy, June 2000, also submitted to IEEE Transactions space-time coding modem for high-data-rate wireless com- on Information Theory. munications,” IEEE Journal on Selected Areas in Communica- [32] B. L. Hughes, “Differential space-time modulation,” IEEE tions, vol. 16, no. 8, pp. 1459–1478, 1998. Transactions on Information Theory, vol. 46, no. 7, pp. 2567– [16] D. Agrawal, V. Tarokh, A. Naguib, and N. Seshadri, “Space- 2578, 2000. time coded OFDM for high data-rate wireless communication [33] B. L. Hughes, “Optimal space-time constellations from over wideband channels,” in Proc. 48th IEEE Vehicular Tech- groups,” submitted to IEEE Transactions on Information The- nology Conference, vol. 3, pp. 2232–2236, Ottawa, Canada, ory, 2000. May 1998. [34] B. L. Hughes, “Space-time coding without channel estima- [17] V. Tarokh, A. Naguib, N. Seshadri, and A. R. Calderbank, tion,” in Conf. Inform. Sci. Sys., Johns Hopkins University, “Combined array processing and space-time coding,” IEEE Baltimore, Md, USA, March 1999. Transactions on Information Theory, vol. 45, no. 4, pp. 1121– [35] B. L. Hughes, “Differential space-time modulation,” in IEEE 1128, 1999. Wireless Communications and Networking Conference,vol.1, [18] S. Baro, G. Bauch, and A. Hansmann, “Improved codes for pp. 145–149, New Orleans, La, USA, September 1999. space-time trellis-coded modulation,” IEEE Communications [36] B. L. Hughes, “Further results on differential space-time mod- Letters, vol. 4, no. 1, pp. 20–22, 2000. ulation,” in IEEE Sensor Array and Multichannel Signal Pro- [19] G. J. Foschini, “Layered space-time architecture for wireless cessing Workshop, Cambridge, Mass, USA, March 2000. communication in a fading environment when using multi- [37] R. G. Gallager, Information Theory and Reliable Communica- element antennas,” Bell Labs Technical Journal, vol. 1, no. 2, tion, Jonh Wiley and Sons, New York, NY, USA, 1968. pp. 41–159, 1996. [38] G. Szego,¨ Orthogonal Polynomials, American Mathematical [20] P.W. Wonianski, G. J. Foschini, G. D. Golden, and R. A. Valen- Society, Providence, Rhode Island, 1939. zuela, “V-BLAST: an architecture for realizing very high data [39] I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and rates over rich scattering wireless channels,” in ISSSE,pp. Products, Academic Press, London, 5th edition, 1994. 295–300, 1998. [40] I. N. Bronshtein and K. A. Semendyayev, Handbook of Math- [21] H. El Gamal and A. R. Hammons, “The layered space-time ar- ematics, Springer-Verlag, Berlin, Germany, 3rd edition, 1997. chitecture: a new perspective,” submitted to IEEE Transactions [41] V. L. Girko, Theory of Random Determinants,vol.45ofMath- on Information Theory, 2000. ematics and Its Applications (Soviet Series), Kluwer Academic, [22] A. Papoulis, Probability, Random Variables and Stochastic Pro- Dordrecht, The Netherlands, 1990. cesses, McGraw-Hill, New York, NY, USA, 2nd edition, 1984. [42] D. Jonsson, “Some limit theorems for the eigenvalues of a [23] T. L. Marzetta and B. M. Hochwald, “Capacity of a mobile sample covariance matrix,” J. Multivariate Anal., vol. 12, no. multiple-antenna communication link in Rayleigh flat fad- 1, pp. 1–38, 1982. ing,” IEEE Transactions on Information Theory,vol.45,no. [43] 3GPP, “Multiplexing and channel coding (FDD) TS 25.212,” 1, pp. 139–157, 1999. Technical report, 3GPP, April 1999, http://www.3gpp.org. [24] B. Hassibi and B. M. Hochwald, “How much training is [44] D. Gerlach and A. Paulraj, “Adaptive transmitting antenna needed in multiple-antenna wireless links?,” Technical mem- arrays with feedback,” IEEE Signal Processing Letters, vol. 1, Rayleigh Fading Multi-Antenna Channels 329

no. 10, pp. 150–152, 1994. [68] A. G. Constantine, “Some noncentral distribution problems [45] W. K. M. Ahmed and P. J. McLane, “Error exponents for two- in multivariate analysis,” Ann. Math. Statist., vol. 34, pp. dimensional time correlated flat fading channels with space 1270–1285, 1963. diversity and channel estimation,” in Proc. IEEE Int. Symp. [69] A. T. James, “Distribution of matrix variate and latent roots Inform. Theory, p. 240, Ulm, Germany, 29 June–4 July 1997. derivedfromnormalsamples,” Ann. Math. Statist., vol. 35, [46] W. K. M. Ahmed, Information theoretic reliability function pp. 475–501, 1964. for flat fading channels, Ph.D. thesis, Queen’s University, [70] C. Herz, “Bessel functions of matrix argument,” Ann. Math., Kingston, Ontario, Canada, September 1997. vol. 61, no. 3, pp. 474–523, 1955. [47] V. L. Girko, “The for random determi- nants,” Theory of Probability and Its Applications,vol.24,no. 4, pp. 729–740, 1979. Alex Grant received the B.E. and Ph.D. de- [48] A. J. Grant and P. D. Alexander, “Random sequence multi- grees from the University of South Australia sets for synchronous code-division multiple-access channels,” in 1993 and 1996, respectively. In 1997, he IEEE Transactions on Information Theory,vol.44,no.7,pp. was a post-doctoral research fellow at the 2832–2836, 1998. Laboratory for Signal and Information Pro- [49] T. M. Cover and J. A. Thomas, Elements of Information Theory, cessing, Swiss Federal Institute of Technol- John Wiley and Sons, New York, NY, USA, 1991. ogy (ETH), Zurich.¨ Since March 1998 he [50] T. W. Anderson, An Introduction to Multivariate Statistical has been with the Institute for Telecommu- Analysis, John Wiley and Sons, New York, NY, USA, 1958. nication Research, where he is now a Se- [51] R. J. Muirhead, Aspects of Multivariate Statistical Theory, John nior Lecturer. Alex’s research interests in- Wiley and Sons, New York, NY, USA, 1982. clude information theory and its applications to multiple user and [52] M. Mehta, Random Matrices, Academic Press, San Diego, other communications problems. Alex is Leader of the Coding and Calif, USA, 2nd edition, 1991. Information Theory Research Group at ITR and serves as Chair- [53] A. K. Gupta and D. K. Nagar, Matrix Variate Distributions, man for the Australian Chapter of the IEEE Information Theory vol. 104 of Monographs and Surveys in Pure and Applied Math- ematics, Chapman and Hall/CRC, Boca Raton, 2000. Society. [54] N. R. Goodman, “Statistical analysis based on a certain mul- tivariate complex Gaussian distribution (an introduction),” Ann. Math. Statist., vol. 34, no. 1, pp. 152–177, 1963. [55] A. Edelman, Eigenvalues and condition numbers of random matrices, Ph.D. thesis, M. I. T. Press, Cambridge, Mass, USA, May 1989. [56] A. C. Aitken, Determinants and Matrices, Oliver and Boyd, Edinburgh, 9th edition, 1956. [57]R.A.HornandC.R.Johnson, Matrix Analysis,OxfordUni- versity Press, Cambridge, New York, NY, USA, 1985. [58] A. Edelman, “Eigenvalues and condition numbers of random matrices,” SIAM J. Matrix Anal. Appl., vol. 9, no. 4, pp. 543– 560, 1988. [59] T. Sugiyama, “On the distribution of the largest latent root and corresponding latent vector for principal component analysis,” Ann. Math. Statist., vol. 37, pp. 995–1001, 1966. [60] T. Sugiyama, “On the distribution of the largest latent root of the covariance matrix,” Ann. Math. Statist., vol. 38, pp. 1148– 1151, 1967. [61] T. Sugiyama, “Approximation for the distribution of the largest latent root of a Wishart matrix,” Austral. J. Statist., vol. 14, no. 1, pp. 17–24, 1972. [62] P. R. Krishnaiah and T. C. Chang, “On the exact distributions of the extreme roots of the Wishart and MANOVA matrices,” J. Multivariate Anal., vol. 1, pp. 108–117, 1971. [63] Z. D. Bai and Y. Q. Yin, “Limit of the smallest eigenvalue of a large dimensional sample covariance matrix,” Ann. Probab., vol. 21, no. 3, pp. 1275–1294, 1993. [64] Z. D. Bai, J. W. Silverstein, and Y. Q. Yin, “A note on the largest eigenvalue of a large dimensional sample covariance matrix,” J. Multivariate Anal., vol. 26, no. 2, pp. 166–168, 1988. [65] A. T. James, “The distribution of the latent roots of the co- variance matrix,” Ann. Math. Statist., vol. 31, pp. 151–158, 1960. [66] A. T. James, “The distribution of noncentral means with known covariance matrix,” Ann. Math. Statist., vol. 32, pp. 874–882, 1961. [67] A. T. James, “Zonal polynomials of the real positive definite symmetric matrices,” Ann. Math., vol. 74, no. 3, pp. 456–469, 1961.